BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 027054
         (229 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255548165|ref|XP_002515139.1| cathepsin B, putative [Ricinus communis]
 gi|223545619|gb|EEF47123.1| cathepsin B, putative [Ricinus communis]
          Length = 376

 Score =  388 bits (997), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 175/212 (82%), Positives = 195/212 (91%), Gaps = 2/212 (0%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
            N+SLSVNDLLACCGFLCGDGCDGGYP+ AWRYFVHHGVVTEECDPYFD+ GCSHPGCEP
Sbjct: 165 MNISLSVNDLLACCGFLCGDGCDGGYPMYAWRYFVHHGVVTEECDPYFDNIGCSHPGCEP 224

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
            +PTPKCVRKC+ KNQLWR SKHYS++AYRI+SDP D+MAE+YKNGPVEVSFTVYEDFAH
Sbjct: 225 GFPTPKCVRKCIDKNQLWRQSKHYSVNAYRISSDPHDVMAEVYKNGPVEVSFTVYEDFAH 284

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           YKSGVYKHITG+VMGGHAVKLIGWGTSD+GEDYW+LANQWNR WG DGYFKI+RG+NECG
Sbjct: 285 YKSGVYKHITGEVMGGHAVKLIGWGTSDNGEDYWLLANQWNRGWGDDGYFKIRRGTNECG 344

Query: 200 IEEDVVAGLPSSKN--LVKEITSADMFEDASA 229
           IE+D VAGLPS++N  LV+E+ S D  EDA A
Sbjct: 345 IEDDAVAGLPSARNLDLVREVASMDALEDAFA 376


>gi|224064400|ref|XP_002301457.1| predicted protein [Populus trichocarpa]
 gi|222843183|gb|EEE80730.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score =  384 bits (986), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 174/210 (82%), Positives = 189/210 (90%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
            N+SLSVNDLLACCGFLCG GC+GGYPISAWRYFVHHGVVTEECDPYFD  GCSHPGCEP
Sbjct: 148 MNISLSVNDLLACCGFLCGSGCNGGYPISAWRYFVHHGVVTEECDPYFDDIGCSHPGCEP 207

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
            YPTPKC RKCV KNQLW+ SKHY +  YRI+SDPE IMAEIYKNGPVEV+FTVYEDFAH
Sbjct: 208 GYPTPKCARKCVNKNQLWKKSKHYGVKPYRIDSDPESIMAEIYKNGPVEVAFTVYEDFAH 267

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           YKSGVYKHITG +MGGHAVKLIGWGTS+DGE YW+LANQWNR WG DGYFKI+RG+NECG
Sbjct: 268 YKSGVYKHITGGMMGGHAVKLIGWGTSEDGEAYWLLANQWNRGWGDDGYFKIRRGTNECG 327

Query: 200 IEEDVVAGLPSSKNLVKEITSADMFEDASA 229
           IE DVVAGLPS++NLV+E+ S D  EDASA
Sbjct: 328 IEGDVVAGLPSTRNLVREVVSVDAREDASA 357


>gi|449489527|ref|XP_004158338.1| PREDICTED: cathepsin B-like [Cucumis sativus]
          Length = 349

 Score =  382 bits (980), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 168/197 (85%), Positives = 186/197 (94%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N++LSVNDLLACCGF+CGDGCDGGYPISAWRYFV HGVVTE+CDPYFD+TGCSHPGCEPA
Sbjct: 150 NITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCSHPGCEPA 209

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           YPTP+CVR CV KNQ+WR +KHY +SAYR+  DP DIMAE+YKNGPVEVSFTVYEDFAHY
Sbjct: 210 YPTPRCVRHCVDKNQIWRKTKHYGVSAYRVKRDPNDIMAEVYKNGPVEVSFTVYEDFAHY 269

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYKHITGDVMGGHAVKLIGWGT+DDGEDYW+LANQWNR WG DGYFKI+RG+NECGI
Sbjct: 270 KSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRRGTNECGI 329

Query: 201 EEDVVAGLPSSKNLVKE 217
           EEDVVAGLPS+KN+ +E
Sbjct: 330 EEDVVAGLPSTKNIARE 346


>gi|449446774|ref|XP_004141146.1| PREDICTED: cathepsin B-like [Cucumis sativus]
          Length = 348

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 168/197 (85%), Positives = 186/197 (94%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N++LSVNDLLACCGF+CGDGCDGGYPISAWRYFV HGVVTE+CDPYFD+TGCSHPGCEPA
Sbjct: 149 NITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCSHPGCEPA 208

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           YPTP+CVR CV KNQ+WR +KHY +SAYR+  DP DIMAE+YKNGPVEVSFTVYEDFAHY
Sbjct: 209 YPTPRCVRHCVDKNQIWRKTKHYGVSAYRVKRDPNDIMAEVYKNGPVEVSFTVYEDFAHY 268

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYKHITGDVMGGHAVKLIGWGT+DDGEDYW+LANQWNR WG DGYFKI+RG+NECGI
Sbjct: 269 KSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRRGTNECGI 328

Query: 201 EEDVVAGLPSSKNLVKE 217
           EEDVVAGLPS+KN+ +E
Sbjct: 329 EEDVVAGLPSTKNIARE 345


>gi|356505709|ref|XP_003521632.1| PREDICTED: cathepsin B-like [Glycine max]
          Length = 357

 Score =  374 bits (960), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 169/208 (81%), Positives = 188/208 (90%), Gaps = 2/208 (0%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+SLSVNDLLACCGFLCG GCDGGYP+ AWRY  HHGVVTEECDPYFD  GCSHPGCEPA
Sbjct: 149 NISLSVNDLLACCGFLCGSGCDGGYPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPA 208

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           Y TPKCV+KCV  NQ+W+ SKHYS+SAYR+NSDP DIMAE+YKNGPVEV+FTVYEDFA+Y
Sbjct: 209 YRTPKCVKKCVSGNQVWKKSKHYSVSAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAYY 268

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYKHITG  +GGHAVKLIGWGT+DDGEDYW+LANQWNR WG DGYFKI+RG+NECGI
Sbjct: 269 KSGVYKHITGYELGGHAVKLIGWGTTDDGEDYWLLANQWNREWGDDGYFKIRRGTNECGI 328

Query: 201 EEDVVAGLPSSKNLVKEITSADMFEDAS 228
           EEDV AGLPS+KNLV+E+T  DM  DA+
Sbjct: 329 EEDVTAGLPSTKNLVREVT--DMDADAA 354


>gi|18378947|ref|NP_563648.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|16226808|gb|AAL16267.1|AF428337_1 At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|14532526|gb|AAK63991.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|25090140|gb|AAN72238.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|332189292|gb|AEE27413.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
          Length = 362

 Score =  374 bits (959), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 168/203 (82%), Positives = 186/203 (91%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
            N+SLSVNDLLACCGFLCG GC+GGYPI+AWRYF HHGVVTEECDPYFD+TGCSHPGCEP
Sbjct: 153 MNVSLSVNDLLACCGFLCGQGCNGGYPIAAWRYFKHHGVVTEECDPYFDNTGCSHPGCEP 212

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
           AYPTPKC RKCV  NQLWR SKHY +SAY++ S P+DIMAE+YKNGPVEV+FTVYEDFAH
Sbjct: 213 AYPTPKCARKCVSGNQLWRESKHYGVSAYKVRSHPDDIMAEVYKNGPVEVAFTVYEDFAH 272

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           YKSGVYKHITG  +GGHAVKLIGWGTSDDGEDYW+LANQWNRSWG DGYFKI+RG+NECG
Sbjct: 273 YKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECG 332

Query: 200 IEEDVVAGLPSSKNLVKEITSAD 222
           IE  VVAGLPS +N+VK IT++D
Sbjct: 333 IEHGVVAGLPSDRNVVKGITTSD 355


>gi|297843028|ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335237|gb|EFH65654.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 360

 Score =  372 bits (956), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 167/203 (82%), Positives = 185/203 (91%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
            N+SLSVNDLLACCGFLCG GC+GGYPI+AWRYF HHGVVTEECDPYFD+TGCSHPGCEP
Sbjct: 151 MNISLSVNDLLACCGFLCGQGCNGGYPIAAWRYFKHHGVVTEECDPYFDNTGCSHPGCEP 210

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
           AYPTPKC RKCV  NQLWR SKHY +SAY++ S P+DIMAE+YKNGPVEV+FTVYEDFAH
Sbjct: 211 AYPTPKCARKCVSGNQLWRESKHYGVSAYKVRSHPDDIMAEVYKNGPVEVAFTVYEDFAH 270

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           YKSGVYKHITG  +GGHAVKLIGWGTSDDGEDYW+LANQWNRSWG DGYFKI+RG+NECG
Sbjct: 271 YKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECG 330

Query: 200 IEEDVVAGLPSSKNLVKEITSAD 222
           IE  VVAGLPS +N+ K IT++D
Sbjct: 331 IEHGVVAGLPSDRNVFKGITTSD 353


>gi|224064398|ref|XP_002301456.1| predicted protein [Populus trichocarpa]
 gi|222843182|gb|EEE80729.1| predicted protein [Populus trichocarpa]
          Length = 325

 Score =  372 bits (956), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 169/210 (80%), Positives = 186/210 (88%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
            NLSLSVNDLLACCG++CGDGCDGGYPI AWRYFV  GVVTEECDPYFD  GCSHPGCEP
Sbjct: 116 MNLSLSVNDLLACCGWMCGDGCDGGYPIDAWRYFVQSGVVTEECDPYFDDIGCSHPGCEP 175

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
            +PTPKC RKC  KN+LW  SKH+S++AYRI+SDP  IMAE+  NGPVEV+FTVYEDFAH
Sbjct: 176 GFPTPKCERKCADKNKLWAESKHFSVNAYRIDSDPHSIMAEVSMNGPVEVAFTVYEDFAH 235

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW+LANQWNR WG DGYFKI+RG+NECG
Sbjct: 236 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWLLANQWNRGWGDDGYFKIRRGTNECG 295

Query: 200 IEEDVVAGLPSSKNLVKEITSADMFEDASA 229
           IEEDVVAGLPS++NLV+E+   D  E ASA
Sbjct: 296 IEEDVVAGLPSTRNLVREVAKIDAHEHASA 325


>gi|357511629|ref|XP_003626103.1| Cathepsin B [Medicago truncatula]
 gi|87240982|gb|ABD32840.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
           propeptide [Medicago truncatula]
 gi|355501118|gb|AES82321.1| Cathepsin B [Medicago truncatula]
          Length = 357

 Score =  372 bits (955), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 164/203 (80%), Positives = 184/203 (90%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+SLSVNDLLACCGFLCG GCDGG PI AWRY  HHGVVTEECDPYFD  GCSHPGCEPA
Sbjct: 149 NISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPA 208

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           Y TPKCVRKCVK NQ+W+ SKHYS+ AYR+ SDP+DIMAE+YKNGPVEV+FTV+EDFAHY
Sbjct: 209 YQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFEDFAHY 268

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYKHITG  +GGHAVKLIGWGTSD+GEDYW+LANQWN +WG DGYFKIKRG+NECGI
Sbjct: 269 KSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGI 328

Query: 201 EEDVVAGLPSSKNLVKEITSADM 223
           E+DV AGLPS+KN+V+E+T  D+
Sbjct: 329 EDDVTAGLPSTKNIVREVTDMDV 351


>gi|217072748|gb|ACJ84734.1| unknown [Medicago truncatula]
 gi|388505480|gb|AFK40806.1| unknown [Medicago truncatula]
          Length = 359

 Score =  372 bits (954), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 164/203 (80%), Positives = 184/203 (90%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+SLSVNDLLACCGFLCG GCDGG PI AWRY  HHGVVTEECDPYFD  GCSHPGCEPA
Sbjct: 151 NISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPA 210

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           Y TPKCVRKCVK NQ+W+ SKHYS+ AYR+ SDP+DIMAE+YKNGPVEV+FTV+EDFAHY
Sbjct: 211 YQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFEDFAHY 270

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYKHITG  +GGHAVKLIGWGTSD+GEDYW+LANQWN +WG DGYFKIKRG+NECGI
Sbjct: 271 KSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGI 330

Query: 201 EEDVVAGLPSSKNLVKEITSADM 223
           E+DV AGLPS+KN+V+E+T  D+
Sbjct: 331 EDDVTAGLPSTKNIVREVTDMDV 353


>gi|222424744|dbj|BAH20325.1| AT1G02305 [Arabidopsis thaliana]
          Length = 293

 Score =  371 bits (953), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 168/203 (82%), Positives = 186/203 (91%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
            N+SLSVNDLLACCGFLCG GC+GGYPI+AWRYF HHGVVTEECDPYFD+TGCSHPGCEP
Sbjct: 84  MNVSLSVNDLLACCGFLCGQGCNGGYPIAAWRYFKHHGVVTEECDPYFDNTGCSHPGCEP 143

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
           AYPTPKC RKCV  NQLWR SKHY +SAY++ S P+DIMAE+YKNGPVEV+FTVYEDFAH
Sbjct: 144 AYPTPKCARKCVSGNQLWRESKHYGVSAYKVRSHPDDIMAEVYKNGPVEVAFTVYEDFAH 203

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           YKSGVYKHITG  +GGHAVKLIGWGTSDDGEDYW+LANQWNRSWG DGYFKI+RG+NECG
Sbjct: 204 YKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECG 263

Query: 200 IEEDVVAGLPSSKNLVKEITSAD 222
           IE  VVAGLPS +N+VK IT++D
Sbjct: 264 IEHGVVAGLPSDRNVVKGITTSD 286


>gi|217073630|gb|ACJ85175.1| unknown [Medicago truncatula]
          Length = 359

 Score =  370 bits (950), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 163/203 (80%), Positives = 183/203 (90%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+SLSVNDLLACCGFLCG GCDGG PI AWRY  HHGVVTEECDPYFD  GCSHPGCEPA
Sbjct: 151 NISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPA 210

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           Y TPKCVRKCVK NQ+W+ SKHYS+ AYR+ SDP+DIM E+YKNGPVEV+FTV+EDFAHY
Sbjct: 211 YQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMTEVYKNGPVEVAFTVFEDFAHY 270

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYKHITG  +GGHAVKLIGWGTSD+GEDYW+LANQWN +WG DGYFKIKRG+NECGI
Sbjct: 271 KSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGI 330

Query: 201 EEDVVAGLPSSKNLVKEITSADM 223
           E+DV AGLPS+KN+V+E+T  D+
Sbjct: 331 EDDVTAGLPSTKNIVREVTDMDV 353


>gi|356572872|ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max]
          Length = 356

 Score =  369 bits (946), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 165/208 (79%), Positives = 187/208 (89%), Gaps = 2/208 (0%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+SLSVNDLLACCGFLCG GCDGGYP+ AW+Y  HHGVVTEECDPYFD  GCSHPGCEPA
Sbjct: 148 NISLSVNDLLACCGFLCGSGCDGGYPLYAWQYLAHHGVVTEECDPYFDQIGCSHPGCEPA 207

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           Y TPKCV+KCV  NQ+W+ SKHYS++AYR++SDP DIM E+YKNGPVEV+FTVYEDFAHY
Sbjct: 208 YRTPKCVKKCVSGNQVWKKSKHYSVNAYRVSSDPHDIMTEVYKNGPVEVAFTVYEDFAHY 267

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYKHITG  +GGHAVKLIGWGT++DGEDYW+LANQWNR WG DGYFKI+RG+NECGI
Sbjct: 268 KSGVYKHITGYELGGHAVKLIGWGTTEDGEDYWLLANQWNREWGDDGYFKIRRGTNECGI 327

Query: 201 EEDVVAGLPSSKNLVKEITSADMFEDAS 228
           EEDV AGLPS+KNLV+E+T  DM  DA+
Sbjct: 328 EEDVTAGLPSTKNLVREVT--DMDADAA 353


>gi|312283137|dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila]
          Length = 362

 Score =  363 bits (933), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 164/210 (78%), Positives = 185/210 (88%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
            N+SLSVNDLLACCGF CGDGCDGGYPI+AW+YF + GVVTEECDPYFD TGCSHPGCEP
Sbjct: 153 MNISLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDDTGCSHPGCEP 212

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
           AYPTPKC+RKCV  NQLW  SKHYS+S Y + S+P+DIMAE+YKNGPVEVSFTVYEDFAH
Sbjct: 213 AYPTPKCMRKCVSGNQLWSQSKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYEDFAH 272

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           YKSGVYKHITG  +GGHAVKLIGWGT+D+GEDYW+LANQWNRSWG DGYF I+RG+NECG
Sbjct: 273 YKSGVYKHITGSNIGGHAVKLIGWGTTDEGEDYWLLANQWNRSWGDDGYFMIRRGTNECG 332

Query: 200 IEEDVVAGLPSSKNLVKEITSADMFEDASA 229
           IE++ VAGLPSS+N+ K IT +D    AS 
Sbjct: 333 IEDEPVAGLPSSRNVFKVITGSDDLSVASV 362


>gi|225437812|ref|XP_002281936.1| PREDICTED: cathepsin B-like isoform 1 [Vitis vinifera]
 gi|359480250|ref|XP_003632421.1| PREDICTED: cathepsin B-like [Vitis vinifera]
          Length = 358

 Score =  363 bits (932), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 163/196 (83%), Positives = 179/196 (91%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
            N+SLSVNDLLACCGFLCG GCDGGYP+ AWRYF+HHGVVTEECDPYFD+TGCSHPGCEP
Sbjct: 148 MNISLSVNDLLACCGFLCGSGCDGGYPLYAWRYFIHHGVVTEECDPYFDATGCSHPGCEP 207

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
            YPTPKCVRKC  +NQLWR +K Y  SAYRI+SDP  IMAE+YKNGPVEV+FTVYEDFAH
Sbjct: 208 GYPTPKCVRKCTDENQLWRKAKRYGQSAYRISSDPYQIMAEVYKNGPVEVAFTVYEDFAH 267

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           Y+SGVY++ TGDVMGGHAVKLIGWGT+DDGEDYWILANQWNR+WG DGYF I+RG NECG
Sbjct: 268 YESGVYRYTTGDVMGGHAVKLIGWGTTDDGEDYWILANQWNRNWGDDGYFMIRRGVNECG 327

Query: 200 IEEDVVAGLPSSKNLV 215
           IEE VVAGLPSSKNL+
Sbjct: 328 IEEGVVAGLPSSKNLM 343


>gi|297744106|emb|CBI37076.3| unnamed protein product [Vitis vinifera]
          Length = 392

 Score =  362 bits (930), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 163/196 (83%), Positives = 179/196 (91%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
            N+SLSVNDLLACCGFLCG GCDGGYP+ AWRYF+HHGVVTEECDPYFD+TGCSHPGCEP
Sbjct: 182 MNISLSVNDLLACCGFLCGSGCDGGYPLYAWRYFIHHGVVTEECDPYFDATGCSHPGCEP 241

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
            YPTPKCVRKC  +NQLWR +K Y  SAYRI+SDP  IMAE+YKNGPVEV+FTVYEDFAH
Sbjct: 242 GYPTPKCVRKCTDENQLWRKAKRYGQSAYRISSDPYQIMAEVYKNGPVEVAFTVYEDFAH 301

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           Y+SGVY++ TGDVMGGHAVKLIGWGT+DDGEDYWILANQWNR+WG DGYF I+RG NECG
Sbjct: 302 YESGVYRYTTGDVMGGHAVKLIGWGTTDDGEDYWILANQWNRNWGDDGYFMIRRGVNECG 361

Query: 200 IEEDVVAGLPSSKNLV 215
           IEE VVAGLPSSKNL+
Sbjct: 362 IEEGVVAGLPSSKNLM 377


>gi|94958151|gb|ABF47216.1| cathepsin B [Nicotiana benthamiana]
          Length = 356

 Score =  362 bits (928), Expect = 8e-98,   Method: Compositional matrix adjust.
 Identities = 161/208 (77%), Positives = 185/208 (88%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+SLS NDLLACCGFLCGDGCDGGYP+ AW+YFV  GVVT+ECDPYFD+ GCSHPGCEPA
Sbjct: 148 NISLSANDLLACCGFLCGDGCDGGYPLQAWKYFVRKGVVTDECDPYFDNEGCSHPGCEPA 207

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           YPTPKC RKCVK+N LW  SKH+ ++AY I+SDP  IM E+YKNGPVEVSFTVYEDFAHY
Sbjct: 208 YPTPKCHRKCVKQNLLWSKSKHFGVNAYMISSDPHSIMTELYKNGPVEVSFTVYEDFAHY 267

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYKH+TGDVMGGHAVKLIGWGTS+DGEDYW+LANQWNR WG DGYFKI+RG++EC I
Sbjct: 268 KSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIRRGTDECEI 327

Query: 201 EEDVVAGLPSSKNLVKEITSADMFEDAS 228
           E++VVAGLPS++NL  E+  +D F DA+
Sbjct: 328 EDEVVAGLPSARNLNMELDVSDAFLDAA 355


>gi|609175|emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana rustica]
          Length = 356

 Score =  360 bits (925), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 160/208 (76%), Positives = 184/208 (88%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+SLS NDL ACCGFLCGDGCDGGYP+ AW+YFV  GVVT+ECDPYFD+ GCSHPGCEPA
Sbjct: 148 NISLSANDLYACCGFLCGDGCDGGYPLQAWKYFVRKGVVTDECDPYFDNEGCSHPGCEPA 207

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           YPTPKC RKCVK+N LW  SKH+ ++AY I+SDP  IM E+YKNGPVEVSFTVYEDFAHY
Sbjct: 208 YPTPKCHRKCVKQNLLWSRSKHFGVNAYMISSDPHSIMTEVYKNGPVEVSFTVYEDFAHY 267

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYKH+TGD+MGGHAVKLIGWGTS+DGEDYW+LANQWNR WG DGYFKI+RG+NEC I
Sbjct: 268 KSGVYKHVTGDIMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIRRGTNECEI 327

Query: 201 EEDVVAGLPSSKNLVKEITSADMFEDAS 228
           E++VVAGLPS++NL  E+  +D F DA+
Sbjct: 328 EDEVVAGLPSARNLNVELDVSDAFLDAA 355


>gi|388500062|gb|AFK38097.1| unknown [Lotus japonicus]
          Length = 357

 Score =  360 bits (925), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 160/208 (76%), Positives = 182/208 (87%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+SLSVNDLLACCGFLCG GCDGGYP+ AWRY  HHGVVTEECDPYFD  GCSHPGCEPA
Sbjct: 149 NVSLSVNDLLACCGFLCGSGCDGGYPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPA 208

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           Y TPKCVRKCVK NQ+W+ SK++S++AY + SDP DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 209 YQTPKCVRKCVKGNQIWKKSKYFSVNAYSVKSDPYDIMAEVYKNGPVEVAFTVYEDFAHY 268

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYKHITG  +GGHAVKLIGWGT+D+GEDYW++ANQWNRSWG DGYF I+RG+NECGI
Sbjct: 269 KSGVYKHITGSQLGGHAVKLIGWGTTDEGEDYWLIANQWNRSWGDDGYFMIRRGTNECGI 328

Query: 201 EEDVVAGLPSSKNLVKEITSADMFEDAS 228
           EEDV AGLPS+KN+ + +   D   D S
Sbjct: 329 EEDVTAGLPSTKNMGRWVMDMDADADVS 356


>gi|224128101|ref|XP_002320244.1| predicted protein [Populus trichocarpa]
 gi|222861017|gb|EEE98559.1| predicted protein [Populus trichocarpa]
          Length = 339

 Score =  357 bits (917), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 163/209 (77%), Positives = 181/209 (86%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           NLSLSVNDLLACCG++CG GCDGG PI AWRYFV  GVVTEECDPYFD  GCSHPGCEP 
Sbjct: 131 NLSLSVNDLLACCGWMCGAGCDGGSPIDAWRYFVQSGVVTEECDPYFDDIGCSHPGCEPG 190

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           +PTPKC RKC  KN+LW  SKH+S++AYRI+SDP  IMAE+  NGPVEV+FTVYEDFAHY
Sbjct: 191 FPTPKCERKCADKNKLWAESKHFSVNAYRIDSDPHSIMAEVSSNGPVEVAFTVYEDFAHY 250

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYKHITGD MGGHAVKLIGWGTS+DGEDYW+LANQWNR WG DGYFKIKRG+NECGI
Sbjct: 251 KSGVYKHITGDAMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIKRGTNECGI 310

Query: 201 EEDVVAGLPSSKNLVKEITSADMFEDASA 229
           E  VVAGLPS++NLV+E+   D  E A+A
Sbjct: 311 EGAVVAGLPSTRNLVREVAGIDGHEHATA 339


>gi|2317912|gb|AAC24376.1| cathepsin B-like cysteine proteinase [Arabidopsis thaliana]
          Length = 357

 Score =  357 bits (915), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 158/202 (78%), Positives = 181/202 (89%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+SLS ND++ACCG LCG GC+GG+P+ AW YF +HGVVT+ECDPYFD+TGCSHPGCEP 
Sbjct: 149 NVSLSANDVIACCGLLCGFGCNGGFPMGAWLYFKYHGVVTQECDPYFDNTGCSHPGCEPT 208

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           YPTPKC RKCV +NQLW  SKHY + AYRIN DP+DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 209 YPTPKCERKCVSRNQLWGESKHYGVGAYRINPDPQDIMAEVYKNGPVEVAFTVYEDFAHY 268

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYK+ITG  +GGHAVKLIGWGTSDDGEDYW+LANQWNRSWG DGYFKI+RG+NECGI
Sbjct: 269 KSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGI 328

Query: 201 EEDVVAGLPSSKNLVKEITSAD 222
           E+ VVAGLPS KN+ K IT++D
Sbjct: 329 EQSVVAGLPSEKNVFKGITTSD 350


>gi|18378945|ref|NP_563647.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332189291|gb|AEE27412.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
          Length = 379

 Score =  356 bits (914), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 158/202 (78%), Positives = 181/202 (89%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+SLS ND++ACCG LCG GC+GG+P+ AW YF +HGVVT+ECDPYFD+TGCSHPGCEP 
Sbjct: 171 NVSLSANDVIACCGLLCGFGCNGGFPMGAWLYFKYHGVVTQECDPYFDNTGCSHPGCEPT 230

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           YPTPKC RKCV +NQLW  SKHY + AYRIN DP+DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 231 YPTPKCERKCVSRNQLWGESKHYGVGAYRINPDPQDIMAEVYKNGPVEVAFTVYEDFAHY 290

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYK+ITG  +GGHAVKLIGWGTSDDGEDYW+LANQWNRSWG DGYFKI+RG+NECGI
Sbjct: 291 KSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGI 350

Query: 201 EEDVVAGLPSSKNLVKEITSAD 222
           E+ VVAGLPS KN+ K IT++D
Sbjct: 351 EQSVVAGLPSEKNVFKGITTSD 372


>gi|215687149|dbj|BAG90919.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 403

 Score =  355 bits (912), Expect = 7e-96,   Method: Compositional matrix adjust.
 Identities = 157/202 (77%), Positives = 180/202 (89%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
            N+SLSVNDL+ACCGF+CGDGCDGGYPI AWRYFV +GVVT+ECDPYFD  GC HPGCEP
Sbjct: 194 MNISLSVNDLVACCGFMCGDGCDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCKHPGCEP 253

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
           AYPTP C +KC  +NQ+W   KH+S++AYR+NSDP DIMAE+Y+NGPVEV+FTVYEDFAH
Sbjct: 254 AYPTPVCEKKCKVQNQVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAH 313

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           YKSGVYKHITG +MGGHAVKLIGWGT+D GEDYW+LANQWNR WG DGYFKI RG+NECG
Sbjct: 314 YKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFKIIRGTNECG 373

Query: 200 IEEDVVAGLPSSKNLVKEITSA 221
           IEEDVVAG+PS+KN+V+   SA
Sbjct: 374 IEEDVVAGMPSTKNMVRNYDSA 395


>gi|59895951|gb|AAX11351.1| cathepsin B-like cysteine protease [Oryza sativa Japonica Group]
 gi|125551767|gb|EAY97476.1| hypothetical protein OsI_19406 [Oryza sativa Indica Group]
 gi|215694023|dbj|BAG89222.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215712372|dbj|BAG94499.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765382|dbj|BAG87079.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222631058|gb|EEE63190.1| hypothetical protein OsJ_17999 [Oryza sativa Japonica Group]
          Length = 358

 Score =  355 bits (911), Expect = 8e-96,   Method: Compositional matrix adjust.
 Identities = 157/202 (77%), Positives = 180/202 (89%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
            N+SLSVNDL+ACCGF+CGDGCDGGYPI AWRYFV +GVVT+ECDPYFD  GC HPGCEP
Sbjct: 149 MNISLSVNDLVACCGFMCGDGCDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCKHPGCEP 208

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
           AYPTP C +KC  +NQ+W   KH+S++AYR+NSDP DIMAE+Y+NGPVEV+FTVYEDFAH
Sbjct: 209 AYPTPVCEKKCKVQNQVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAH 268

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           YKSGVYKHITG +MGGHAVKLIGWGT+D GEDYW+LANQWNR WG DGYFKI RG+NECG
Sbjct: 269 YKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFKIIRGTNECG 328

Query: 200 IEEDVVAGLPSSKNLVKEITSA 221
           IEEDVVAG+PS+KN+V+   SA
Sbjct: 329 IEEDVVAGMPSTKNMVRNYDSA 350


>gi|297814171|ref|XP_002874969.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297320806|gb|EFH51228.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 359

 Score =  354 bits (908), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 159/209 (76%), Positives = 184/209 (88%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
            N+SLSVNDLLACCGF CGDGCDGGYPI+AW+YF + GVVTEECDPYFD+TGCSHPGCEP
Sbjct: 150 MNISLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDNTGCSHPGCEP 209

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
           AYPTP+C+RKCV  N+LW  SKHYS+S Y +NS P+DIMAE+YKNGPVEVSFTVYEDFAH
Sbjct: 210 AYPTPRCLRKCVSDNKLWSESKHYSVSTYTVNSSPQDIMAEVYKNGPVEVSFTVYEDFAH 269

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           YKSGVYKHITG  +GGHAVKLIGWGTS++GEDYW++ANQWNR WG DGYF I+RG+NECG
Sbjct: 270 YKSGVYKHITGSNIGGHAVKLIGWGTSNEGEDYWLMANQWNRGWGDDGYFMIRRGTNECG 329

Query: 200 IEEDVVAGLPSSKNLVKEITSADMFEDAS 228
           IE++ VAGLPSS+N+ K  T ++    AS
Sbjct: 330 IEDEPVAGLPSSRNVFKVDTGSNDLPVAS 358


>gi|414886870|tpg|DAA62884.1| TPA: cathepsin B-like cysteine proteinase 3 [Zea mays]
          Length = 347

 Score =  353 bits (907), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 159/201 (79%), Positives = 177/201 (88%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           ++ LSVNDLLACCGF+CGDGCDGGYPI AWRYFV +GVVT+ECDPYFD  GC HPGCEPA
Sbjct: 141 SILLSVNDLLACCGFMCGDGCDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCKHPGCEPA 200

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           YPTPKC +KC ++NQ+W+  KH+SI AYRINSDP DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 201 YPTPKCEKKCKEQNQVWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAHY 260

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYKHITG +MGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECGI
Sbjct: 261 KSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGI 320

Query: 201 EEDVVAGLPSSKNLVKEITSA 221
           EE VVAG+PS+KN+V     A
Sbjct: 321 EEGVVAGMPSTKNMVPNFGGA 341


>gi|226497010|ref|NP_001150152.1| LOC100283781 precursor [Zea mays]
 gi|195637168|gb|ACG38052.1| cathepsin B-like cysteine proteinase 3 precursor [Zea mays]
          Length = 347

 Score =  353 bits (907), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 159/201 (79%), Positives = 177/201 (88%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           ++ LSVNDLLACCGF+CGDGCDGGYPI AWRYFV +GVVT+ECDPYFD  GC HPGCEPA
Sbjct: 141 SILLSVNDLLACCGFMCGDGCDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCKHPGCEPA 200

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           YPTPKC +KC ++NQ+W+  KH+SI AYRINSDP DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 201 YPTPKCEKKCKEQNQVWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAHY 260

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYKHITG +MGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECGI
Sbjct: 261 KSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGI 320

Query: 201 EEDVVAGLPSSKNLVKEITSA 221
           EE VVAG+PS+KN+V     A
Sbjct: 321 EEGVVAGMPSTKNMVPNFGGA 341


>gi|357511627|ref|XP_003626102.1| Cathepsin L-like proteinase [Medicago truncatula]
 gi|355501117|gb|AES82320.1| Cathepsin L-like proteinase [Medicago truncatula]
          Length = 351

 Score =  353 bits (906), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 158/209 (75%), Positives = 177/209 (84%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
            N+SLSVND+LACCG LCG GC GG P SAW Y  HHGVVTEECDPYFD  GCSHPGCEP
Sbjct: 142 MNVSLSVNDILACCGLLCGAGCAGGTPFSAWIYLAHHGVVTEECDPYFDQIGCSHPGCEP 201

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
            Y TPKCV+KCV  NQLW  SKHYS+ AY +NSDP+DIMAE+YKNGPVEV+FTVYEDFAH
Sbjct: 202 TYRTPKCVKKCVNGNQLWETSKHYSVKAYTVNSDPQDIMAEVYKNGPVEVAFTVYEDFAH 261

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           YKSGVYKHITG  +GGHAVKL+GWGTS +GEDYW+LANQWN +WG DGYFKIKRG+NECG
Sbjct: 262 YKSGVYKHITGFALGGHAVKLVGWGTSHEGEDYWLLANQWNTNWGDDGYFKIKRGTNECG 321

Query: 200 IEEDVVAGLPSSKNLVKEITSADMFEDAS 228
           IE  V AGLPS+KN+V+E+T  D+  D S
Sbjct: 322 IENAVTAGLPSTKNIVREVTDMDVDADVS 350


>gi|87240981|gb|ABD32839.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
           propeptide [Medicago truncatula]
          Length = 356

 Score =  353 bits (906), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 158/209 (75%), Positives = 177/209 (84%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
            N+SLSVND+LACCG LCG GC GG P SAW Y  HHGVVTEECDPYFD  GCSHPGCEP
Sbjct: 147 MNVSLSVNDILACCGLLCGAGCAGGTPFSAWIYLAHHGVVTEECDPYFDQIGCSHPGCEP 206

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
            Y TPKCV+KCV  NQLW  SKHYS+ AY +NSDP+DIMAE+YKNGPVEV+FTVYEDFAH
Sbjct: 207 TYRTPKCVKKCVNGNQLWETSKHYSVKAYTVNSDPQDIMAEVYKNGPVEVAFTVYEDFAH 266

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           YKSGVYKHITG  +GGHAVKL+GWGTS +GEDYW+LANQWN +WG DGYFKIKRG+NECG
Sbjct: 267 YKSGVYKHITGFALGGHAVKLVGWGTSHEGEDYWLLANQWNTNWGDDGYFKIKRGTNECG 326

Query: 200 IEEDVVAGLPSSKNLVKEITSADMFEDAS 228
           IE  V AGLPS+KN+V+E+T  D+  D S
Sbjct: 327 IENAVTAGLPSTKNIVREVTDMDVDADVS 355


>gi|414886872|tpg|DAA62886.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
 gi|414886873|tpg|DAA62887.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
          Length = 208

 Score =  352 bits (903), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 159/202 (78%), Positives = 177/202 (87%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
            ++ LSVNDLLACCGF+CGDGCDGGYPI AWRYFV +GVVT+ECDPYFD  GC HPGCEP
Sbjct: 1   MSILLSVNDLLACCGFMCGDGCDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCKHPGCEP 60

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
           AYPTPKC +KC ++NQ+W+  KH+SI AYRINSDP DIMAE+YKNGPVEV+FTVYEDFAH
Sbjct: 61  AYPTPKCEKKCKEQNQVWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAH 120

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           YKSGVYKHITG +MGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECG
Sbjct: 121 YKSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECG 180

Query: 200 IEEDVVAGLPSSKNLVKEITSA 221
           IEE VVAG+PS+KN+V     A
Sbjct: 181 IEEGVVAGMPSTKNMVPNFGGA 202


>gi|297723949|ref|NP_001174338.1| Os05g0310500 [Oryza sativa Japonica Group]
 gi|255676228|dbj|BAH93066.1| Os05g0310500, partial [Oryza sativa Japonica Group]
          Length = 234

 Score =  352 bits (903), Expect = 7e-95,   Method: Compositional matrix adjust.
 Identities = 157/202 (77%), Positives = 180/202 (89%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
            N+SLSVNDL+ACCGF+CGDGCDGGYPI AWRYFV +GVVT+ECDPYFD  GC HPGCEP
Sbjct: 25  MNISLSVNDLVACCGFMCGDGCDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCKHPGCEP 84

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
           AYPTP C +KC  +NQ+W   KH+S++AYR+NSDP DIMAE+Y+NGPVEV+FTVYEDFAH
Sbjct: 85  AYPTPVCEKKCKVQNQVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAH 144

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           YKSGVYKHITG +MGGHAVKLIGWGT+D GEDYW+LANQWNR WG DGYFKI RG+NECG
Sbjct: 145 YKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFKIIRGTNECG 204

Query: 200 IEEDVVAGLPSSKNLVKEITSA 221
           IEEDVVAG+PS+KN+V+   SA
Sbjct: 205 IEEDVVAGMPSTKNMVRNYDSA 226


>gi|357116869|ref|XP_003560199.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
          Length = 350

 Score =  352 bits (902), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 155/196 (79%), Positives = 178/196 (90%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+SLSVNDL+ACCGF+CGDGCDGGYPISAW+Y V +GVVT+ECDPYFD  GC HPGCEPA
Sbjct: 146 NISLSVNDLVACCGFMCGDGCDGGYPISAWQYLVENGVVTDECDPYFDQVGCKHPGCEPA 205

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           YPTP C +KC  +NQ+W+  KH+SI+AYR+NSDP DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 206 YPTPACEKKCKVQNQVWQEKKHFSINAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHY 265

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVY+HITG++MGGHAVKLIGWGTS DG+DYW+LANQWNR WG DGYFKI RG NECGI
Sbjct: 266 KSGVYEHITGEMMGGHAVKLIGWGTSADGKDYWLLANQWNRGWGDDGYFKIIRGKNECGI 325

Query: 201 EEDVVAGLPSSKNLVK 216
           EEDVVAG+PS+KN V+
Sbjct: 326 EEDVVAGMPSTKNTVR 341


>gi|30678927|ref|NP_849281.1| cathepsin B [Arabidopsis thaliana]
 gi|3859606|gb|AAC72872.1| contains similarity to cysteine proteases (Pfam: PF00112,
           E=1.3e-79, N=1) [Arabidopsis thaliana]
 gi|7268205|emb|CAB77732.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332656653|gb|AEE82053.1| cathepsin B [Arabidopsis thaliana]
          Length = 359

 Score =  351 bits (901), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 159/210 (75%), Positives = 182/210 (86%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
            N+SLSVNDLLACCGF CGDGCDGGYPI+AW+YF + GVVTEECDPYFD+TGCSHPGCEP
Sbjct: 150 MNISLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDNTGCSHPGCEP 209

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
           AYPTPKC RKCV  N+LW  SKHYS+S Y + S+P+DIMAE+YKNGPVEVSFTVYEDFAH
Sbjct: 210 AYPTPKCSRKCVSDNKLWSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYEDFAH 269

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           YKSGVYKHITG  +GGHAVKLIGWGTS +GEDYW++ANQWNR WG DGYF I+RG+NECG
Sbjct: 270 YKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIRRGTNECG 329

Query: 200 IEEDVVAGLPSSKNLVKEITSADMFEDASA 229
           IE++ VAGLPSSKN+ +  T ++    AS 
Sbjct: 330 IEDEPVAGLPSSKNVFRVDTGSNDLPVASV 359


>gi|18411686|ref|NP_567215.1| cathepsin B [Arabidopsis thaliana]
 gi|13877861|gb|AAK44008.1|AF370193_1 putative cathepsin B cysteine protease [Arabidopsis thaliana]
 gi|17473834|gb|AAL38343.1| unknown protein [Arabidopsis thaliana]
 gi|21281113|gb|AAM45063.1| putative cathepsin B cysteine protease [Arabidopsis thaliana]
 gi|21554165|gb|AAM63244.1| cathepsin B-like cysteine protease, putative [Arabidopsis thaliana]
 gi|24417490|gb|AAN60355.1| unknown [Arabidopsis thaliana]
 gi|24899725|gb|AAN65077.1| unknown protein [Arabidopsis thaliana]
 gi|51968702|dbj|BAD43043.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51969104|dbj|BAD43244.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51969220|dbj|BAD43302.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970472|dbj|BAD43928.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970630|dbj|BAD44007.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970704|dbj|BAD44044.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970802|dbj|BAD44093.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970974|dbj|BAD44179.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51971008|dbj|BAD44196.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51971116|dbj|BAD44250.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|62320144|dbj|BAD94342.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|110740287|dbj|BAF02040.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332656652|gb|AEE82052.1| cathepsin B [Arabidopsis thaliana]
          Length = 359

 Score =  351 bits (901), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 159/209 (76%), Positives = 182/209 (87%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
            N+SLSVNDLLACCGF CGDGCDGGYPI+AW+YF + GVVTEECDPYFD+TGCSHPGCEP
Sbjct: 150 MNISLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDNTGCSHPGCEP 209

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
           AYPTPKC RKCV  N+LW  SKHYS+S Y + S+P+DIMAE+YKNGPVEVSFTVYEDFAH
Sbjct: 210 AYPTPKCSRKCVSDNKLWSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYEDFAH 269

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           YKSGVYKHITG  +GGHAVKLIGWGTS +GEDYW++ANQWNR WG DGYF I+RG+NECG
Sbjct: 270 YKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIRRGTNECG 329

Query: 200 IEEDVVAGLPSSKNLVKEITSADMFEDAS 228
           IE++ VAGLPSSKN+ +  T ++    AS
Sbjct: 330 IEDEPVAGLPSSKNVFRVDTGSNDLPVAS 358


>gi|194352768|emb|CAQ00112.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326488519|dbj|BAJ93928.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326508126|dbj|BAJ99330.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 355

 Score =  350 bits (899), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 157/194 (80%), Positives = 176/194 (90%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           ++SLSVNDLLACCGFLCG GC+GGYPISAWRYF   GVVTEECDPYFD TGC HPGCEPA
Sbjct: 149 SVSLSVNDLLACCGFLCGSGCNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQHPGCEPA 208

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           YPTPKC RKC  +NQ+W+ +KH+S++AYR++S+P DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 209 YPTPKCHRKCKVENQVWKKNKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAFTVYEDFAHY 268

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYKHITG VMGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECGI
Sbjct: 269 KSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGI 328

Query: 201 EEDVVAGLPSSKNL 214
           EEDV AG+PS+KN+
Sbjct: 329 EEDVTAGMPSTKNM 342


>gi|197304333|dbj|BAG69285.1| cathepsin B-like cysteine protease [Raphanus sativus]
          Length = 343

 Score =  350 bits (897), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 158/192 (82%), Positives = 176/192 (91%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N++LSVNDLLACCGF CGDGCDGGYPISAW+YF + GVVTEECDPYFD TGCSHPGCEPA
Sbjct: 152 NITLSVNDLLACCGFRCGDGCDGGYPISAWQYFSYSGVVTEECDPYFDQTGCSHPGCEPA 211

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           Y TP+C+RKCV +NQLW  SKHYSI+ Y + S+P+DIMAEIYKNGPVEVSFTVYEDFAHY
Sbjct: 212 YNTPQCLRKCVGRNQLWSESKHYSINTYVVESNPQDIMAEIYKNGPVEVSFTVYEDFAHY 271

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYKHITG  +GGHAVKLIGWGT+DDGEDYW+LANQWNRSWG DGYF I+RG+NECGI
Sbjct: 272 KSGVYKHITGSNIGGHAVKLIGWGTTDDGEDYWLLANQWNRSWGDDGYFMIRRGTNECGI 331

Query: 201 EEDVVAGLPSSK 212
           E++ VAGLPSSK
Sbjct: 332 EDEPVAGLPSSK 343


>gi|262217337|gb|ACY38050.1| cathepsin B [Dactylis glomerata]
          Length = 348

 Score =  350 bits (897), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 157/201 (78%), Positives = 175/201 (87%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+SLS NDL+ACCGF+CGDGCDGGYPI AW+YFV  GVVTEECDPYFD  GC HPGCEPA
Sbjct: 142 NISLSANDLVACCGFMCGDGCDGGYPIKAWQYFVQSGVVTEECDPYFDQVGCKHPGCEPA 201

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           Y TPKC +KC  +NQ+W   KH+SI+AYR+NSDP DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 202 YDTPKCEKKCKVQNQVWEEKKHFSINAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHY 261

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYKH+TG VMGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECGI
Sbjct: 262 KSGVYKHVTGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGI 321

Query: 201 EEDVVAGLPSSKNLVKEITSA 221
           EE+VVAG+PS+KN+     SA
Sbjct: 322 EEEVVAGMPSTKNMAGNHGSA 342


>gi|326492684|dbj|BAJ90198.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 355

 Score =  349 bits (896), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 157/194 (80%), Positives = 175/194 (90%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           ++SLSVNDLLACCGFLCG GC+GGYPISAWRYF   GVVTEECDPYFD TGC HPGCEPA
Sbjct: 149 SVSLSVNDLLACCGFLCGSGCNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQHPGCEPA 208

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           YPTPKC RKC  +NQ+W+ +KH S++AYR++S+P DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 209 YPTPKCHRKCKVENQVWKKNKHSSVNAYRVHSNPHDIMAEVYKNGPVEVAFTVYEDFAHY 268

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYKHITG VMGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECGI
Sbjct: 269 KSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGGDGYFKIIRGKNECGI 328

Query: 201 EEDVVAGLPSSKNL 214
           EEDV AG+PS+KN+
Sbjct: 329 EEDVTAGMPSTKNM 342


>gi|38639325|gb|AAR25800.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
          Length = 354

 Score =  349 bits (895), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 159/208 (76%), Positives = 178/208 (85%), Gaps = 2/208 (0%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           ++SLSVNDLLACC FLCG GCDGGYPI+AWRYF   GVVTEECDPYFD+TGCSHPGCEP 
Sbjct: 148 SISLSVNDLLACCSFLCGSGCDGGYPIAAWRYFKRSGVVTEECDPYFDTTGCSHPGCEPL 207

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           YPTPKC RKCVK N LWR SKHY ++AYR++ DP+ IMAE+YKNGPVEVSFTVYEDFAHY
Sbjct: 208 YPTPKCHRKCVKGNVLWRKSKHYGVNAYRVSHDPQSIMAEVYKNGPVEVSFTVYEDFAHY 267

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYKH+TG  MGGHAVKLIGWGTS+ GEDYW++ N WNR WG DGYFKI+RG+NECGI
Sbjct: 268 KSGVYKHVTGGNMGGHAVKLIGWGTSEQGEDYWLIVNSWNRGWGEDGYFKIRRGTNECGI 327

Query: 201 EEDVVAGLPSSKNLVKEITSADMFEDAS 228
           E  VVAGLPS++NL  E+   D   DAS
Sbjct: 328 EHSVVAGLPSARNLNVEL--GDAVLDAS 353


>gi|40643250|emb|CAC83720.1| cathepsin B [Hordeum vulgare subsp. vulgare]
 gi|326494236|dbj|BAJ90387.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326499864|dbj|BAJ90767.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 344

 Score =  349 bits (895), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 156/195 (80%), Positives = 173/195 (88%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+SLS NDL+ACCGF+CGDGCDGGYPISAW+YFV +GVVTEECDPYFD  GC HPGCEPA
Sbjct: 144 NISLSANDLVACCGFMCGDGCDGGYPISAWQYFVQNGVVTEECDPYFDQVGCKHPGCEPA 203

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           YPTP C +KC  +NQ+W+  KH+SI AY++NSDP DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 204 YPTPVCEKKCKVQNQVWQEKKHFSIDAYQVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHY 263

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYKHITG VMGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECGI
Sbjct: 264 KSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGI 323

Query: 201 EEDVVAGLPSSKNLV 215
           EEDV AG+PS KN+ 
Sbjct: 324 EEDVTAGMPSMKNIA 338


>gi|21693|emb|CAA46810.1| cathepsin B [Triticum aestivum]
          Length = 305

 Score =  345 bits (886), Expect = 6e-93,   Method: Compositional matrix adjust.
 Identities = 153/195 (78%), Positives = 173/195 (88%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N++LS NDL+ACCGF+CGDGCDGGYPISAW+YFV +GVVT+ECDPYFD  GC HPGCEPA
Sbjct: 105 NITLSANDLVACCGFMCGDGCDGGYPISAWQYFVQNGVVTDECDPYFDQVGCKHPGCEPA 164

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           YPTP C +KC  +NQ+W   KH+SI+AY++NSDP DIMAE+Y NGPVEV+FTVYEDFAHY
Sbjct: 165 YPTPVCEKKCKVQNQVWEEKKHFSINAYQVNSDPHDIMAEVYNNGPVEVAFTVYEDFAHY 224

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYKHITG VMGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECGI
Sbjct: 225 KSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGI 284

Query: 201 EEDVVAGLPSSKNLV 215
           EEDV AG+PS+KN+ 
Sbjct: 285 EEDVTAGMPSTKNIA 299


>gi|14582576|gb|AAK69541.1|AF283476_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
          Length = 352

 Score =  345 bits (885), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 155/194 (79%), Positives = 172/194 (88%), Gaps = 1/194 (0%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N++LSVNDLLACCGFLCG+GCDGGYPI+AW+YF   GVVT ECDPYFD TGCSHPGCEPA
Sbjct: 144 NVTLSVNDLLACCGFLCGEGCDGGYPIAAWQYFKRTGVVTSECDPYFDQTGCSHPGCEPA 203

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           YPTP C +KCVKKN LW  SKH+S++AYR+NSD   IM E+Y NGP EVSFTVYEDFAHY
Sbjct: 204 YPTPACEKKCVKKNLLWSESKHFSVNAYRVNSDQHSIMTEVYTNGPAEVSFTVYEDFAHY 263

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYKH+TG  MGGHAVKLIGWGTS+DGEDYW+LANQWNRSWG DGYFKI RG+NECGI
Sbjct: 264 KSGVYKHVTGSEMGGHAVKLIGWGTSEDGEDYWLLANQWNRSWGDDGYFKIIRGTNECGI 323

Query: 201 EEDVVAGLPSSKNL 214
            EDV AG+PS+KNL
Sbjct: 324 -EDVTAGMPSTKNL 336


>gi|6165885|gb|AAF04727.1|AF101239_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
          Length = 352

 Score =  345 bits (884), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 155/194 (79%), Positives = 171/194 (88%), Gaps = 1/194 (0%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N++LSVNDLLACCGFLCG+GCDGGYPI+AW+YF   GVVT ECDPYFD TGCSHPGCEPA
Sbjct: 144 NVTLSVNDLLACCGFLCGEGCDGGYPIAAWQYFKRTGVVTSECDPYFDQTGCSHPGCEPA 203

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           YPTP C +KCVKKN LW  SKH+S++AYR+NSD   IM E+Y NGP EVSFTVYEDFAHY
Sbjct: 204 YPTPACEKKCVKKNLLWSESKHFSVNAYRVNSDQHSIMTEVYTNGPAEVSFTVYEDFAHY 263

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYKH+TG  MGGHAVKLIGWGTS+DGEDYW+LANQWNRSWG DGYFKI RG+NECGI
Sbjct: 264 KSGVYKHVTGSEMGGHAVKLIGWGTSEDGEDYWLLANQWNRSWGGDGYFKIIRGTNECGI 323

Query: 201 EEDVVAGLPSSKNL 214
            EDV AG PS+KNL
Sbjct: 324 -EDVTAGTPSTKNL 336


>gi|357116879|ref|XP_003560204.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
          Length = 351

 Score =  344 bits (882), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 154/202 (76%), Positives = 174/202 (86%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
            N+SLSVNDLLACCGFLCG GC+GGYPISAWRYF   GVVT+ECDPYFD  GC HPGCEP
Sbjct: 144 MNISLSVNDLLACCGFLCGSGCNGGYPISAWRYFRRKGVVTDECDPYFDQVGCKHPGCEP 203

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
           AY TPKC +KC  +N++W+  KH+S+ AYR++S+P DIMAE+Y NGPVEV+FTVYEDFAH
Sbjct: 204 AYRTPKCEKKCKVQNEVWKEQKHFSVDAYRVHSNPHDIMAEVYTNGPVEVAFTVYEDFAH 263

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           YKSGVYKHITG VMGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECG
Sbjct: 264 YKSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECG 323

Query: 200 IEEDVVAGLPSSKNLVKEITSA 221
           IEEDVVAG+PS+KN+ +    A
Sbjct: 324 IEEDVVAGMPSTKNMARNYDDA 345


>gi|255647484|gb|ACU24206.1| unknown [Glycine max]
          Length = 327

 Score =  338 bits (866), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 149/179 (83%), Positives = 164/179 (91%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+SLSVNDLLACCGFLCG GCDGGYP+ AWRY  HHGVVTEECDPYFD  GCSHPGCEPA
Sbjct: 149 NISLSVNDLLACCGFLCGSGCDGGYPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPA 208

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           Y TPKCV+KCV  NQ+W+ SKHYS+SAYR+NSDP DIMAE+YKNGPVEV+FTVYEDFA+Y
Sbjct: 209 YRTPKCVKKCVSGNQVWKKSKHYSVSAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAYY 268

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           KSGVYKHITG  +GGHAVKLIGWGT+DDGEDYW+LANQWNR WG DGYFKI+RG+NECG
Sbjct: 269 KSGVYKHITGYELGGHAVKLIGWGTTDDGEDYWLLANQWNREWGDDGYFKIRRGTNECG 327


>gi|21699|emb|CAA46811.1| cathepsin B [Triticum aestivum]
          Length = 353

 Score =  335 bits (860), Expect = 7e-90,   Method: Compositional matrix adjust.
 Identities = 153/198 (77%), Positives = 172/198 (86%), Gaps = 2/198 (1%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           ++SLSVNDLLACCGFLCG GC+GGYPISAWRYF   GVVTEECDPYFD TGC HPGCEPA
Sbjct: 145 SVSLSVNDLLACCGFLCGSGCNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQHPGCEPA 204

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE--DFA 138
           YPTPKC RKC  +NQ W+ +KH+S++AYR++S+P DIMAE+YKNGPVEV+FT  +  DFA
Sbjct: 205 YPTPKCQRKCKVENQAWKENKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAFTYCQILDFA 264

Query: 139 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 198
           HYKSGVYKHITG VMGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NEC
Sbjct: 265 HYKSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGENEC 324

Query: 199 GIEEDVVAGLPSSKNLVK 216
           GIE DV AG+PS+KN  +
Sbjct: 325 GIEGDVTAGMPSTKNTAR 342


>gi|326490902|dbj|BAJ90118.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326508404|dbj|BAJ99469.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514912|dbj|BAJ99817.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 345

 Score =  334 bits (857), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 149/195 (76%), Positives = 170/195 (87%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+SLSVNDL+ACCGFLCGDGCDGGYPI AW+YFV +GVVT+ECDP+FD  GC HPGCEPA
Sbjct: 145 NVSLSVNDLVACCGFLCGDGCDGGYPIFAWQYFVENGVVTDECDPFFDQVGCQHPGCEPA 204

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           YPTP C +KC  +NQ+W   KH+SI AY++NSDP DIMAE+YKNGPVEVSF +YEDFAHY
Sbjct: 205 YPTPVCEKKCKVQNQVWEEKKHFSIDAYQVNSDPHDIMAEVYKNGPVEVSFIIYEDFAHY 264

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYK ITG ++GGHA KLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG+NECGI
Sbjct: 265 KSGVYKQITGRMVGGHAAKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGTNECGI 324

Query: 201 EEDVVAGLPSSKNLV 215
           E DV AG+PS+KN+ 
Sbjct: 325 EGDVNAGMPSTKNIA 339


>gi|297843026|ref|XP_002889394.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335236|gb|EFH65653.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 359

 Score =  330 bits (847), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 160/202 (79%), Positives = 183/202 (90%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+SLS ND++ACCG LCG GC+GG+P+ AW YF +HGVVTEECDPYFD+TGCSHPGCEP 
Sbjct: 151 NVSLSANDVVACCGLLCGLGCNGGFPMGAWLYFKYHGVVTEECDPYFDNTGCSHPGCEPG 210

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           YPTPKCVRKCV +NQLW  SKHY +SAYRIN DP+DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 211 YPTPKCVRKCVSENQLWGESKHYGVSAYRINHDPQDIMAEVYKNGPVEVAFTVYEDFAHY 270

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYKHITG  +GGHAVKLIGWGTSDDGEDYW+LANQWNRSWG DGYFKI+RG+NECGI
Sbjct: 271 KSGVYKHITGTKIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGI 330

Query: 201 EEDVVAGLPSSKNLVKEITSAD 222
           E  VVAGLPS +N+ K++T++D
Sbjct: 331 EHGVVAGLPSDRNVFKDVTTSD 352


>gi|116784401|gb|ABK23329.1| unknown [Picea sitchensis]
          Length = 350

 Score =  324 bits (830), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 145/197 (73%), Positives = 169/197 (85%), Gaps = 1/197 (0%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N++LS NDL+ACCGF+CGDGCDGGYPISAW+YF+  GVVT ECDPYFD  GC HPGCEP 
Sbjct: 144 NVTLSENDLVACCGFMCGDGCDGGYPISAWQYFISTGVVTAECDPYFDDAGCQHPGCEPL 203

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           YPTP+CV++C  +NQ W NSK +S +AYRI+S P DIMAE+Y NGPVEVSF+VYEDFAHY
Sbjct: 204 YPTPQCVKQCKDENQKWGNSKRFSATAYRISSKPYDIMAEVYTNGPVEVSFSVYEDFAHY 263

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYK+  GD MGGHAVKL+GWGT +DG DYW++AN WN +WG DGYFKI RGSNECGI
Sbjct: 264 KSGVYKYTKGDYMGGHAVKLVGWGT-EDGTDYWLVANSWNTAWGEDGYFKIARGSNECGI 322

Query: 201 EEDVVAGLPSSKNLVKE 217
           E DVVAG+PS+KNLV +
Sbjct: 323 EGDVVAGMPSTKNLVMD 339


>gi|224285427|gb|ACN40436.1| unknown [Picea sitchensis]
          Length = 350

 Score =  324 bits (830), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 145/197 (73%), Positives = 169/197 (85%), Gaps = 1/197 (0%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N++LS NDL+ACCGF+CGDGCDGGYPISAW+YF+  GVVT ECDPYFD  GC HPGCEP 
Sbjct: 144 NVTLSENDLVACCGFMCGDGCDGGYPISAWQYFISTGVVTAECDPYFDDAGCQHPGCEPL 203

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           YPTP+CV++C  +NQ W NSK +S +AYRI+S P DIMAE+Y NGPVEVSF+VYEDFAHY
Sbjct: 204 YPTPQCVKQCKDENQKWGNSKRFSATAYRISSKPYDIMAEVYTNGPVEVSFSVYEDFAHY 263

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYK+  GD MGGHAVKL+GWGT +DG DYW++AN WN +WG DGYFKI RGSNECGI
Sbjct: 264 KSGVYKYTKGDYMGGHAVKLVGWGT-EDGTDYWLVANSWNTAWGEDGYFKIARGSNECGI 322

Query: 201 EEDVVAGLPSSKNLVKE 217
           E DVVAG+PS+KNLV +
Sbjct: 323 EGDVVAGMPSTKNLVMD 339


>gi|388499754|gb|AFK37943.1| unknown [Lotus japonicus]
          Length = 209

 Score =  321 bits (823), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 144/200 (72%), Positives = 164/200 (82%)

Query: 29  LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 88
            L    F  G    GGYP+ AWRY  HHGVVTEECDPYFD  GCSHPGCEPAY TPKCVR
Sbjct: 9   FLHAVAFSVGLAVMGGYPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVR 68

Query: 89  KCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 148
           KCVK NQ+W+ SKH+S++AY + SDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHI
Sbjct: 69  KCVKGNQIWKKSKHFSVNAYSVKSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHI 128

Query: 149 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           TG  +GGHAVKLIGWGT+D+GEDYW++ANQWNRSWG DGYF I+RG+NECGIEEDV AGL
Sbjct: 129 TGSQLGGHAVKLIGWGTTDEGEDYWLIANQWNRSWGDDGYFMIRRGTNECGIEEDVTAGL 188

Query: 209 PSSKNLVKEITSADMFEDAS 228
           PS+KN+ + +   D   D S
Sbjct: 189 PSTKNMGRWVMDMDADADVS 208


>gi|116779190|gb|ABK21175.1| unknown [Picea sitchensis]
 gi|148907952|gb|ABR17096.1| unknown [Picea sitchensis]
 gi|224284884|gb|ACN40172.1| unknown [Picea sitchensis]
          Length = 350

 Score =  319 bits (817), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 143/197 (72%), Positives = 166/197 (84%), Gaps = 1/197 (0%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N++LS NDL+ACCGF CGDGCDGGYP+SAW+YF+  GVVT ECDPYFD  GC HPGCEP 
Sbjct: 144 NVTLSENDLVACCGFRCGDGCDGGYPLSAWQYFISTGVVTAECDPYFDEAGCQHPGCEPL 203

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           YPTP+CV++C  +NQ W NSK +S +AYRI S P DIMAE+Y  GPVEV F VYEDFAHY
Sbjct: 204 YPTPQCVKQCKDENQNWGNSKRFSATAYRITSKPYDIMAEVYTKGPVEVDFLVYEDFAHY 263

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVYK+ITGD +GGHAVKLIGWGT ++G DYW++AN WN +WG DGYFKI RGSNEC I
Sbjct: 264 KSGVYKYITGDFLGGHAVKLIGWGT-ENGTDYWLVANSWNTAWGEDGYFKIARGSNECSI 322

Query: 201 EEDVVAGLPSSKNLVKE 217
           EEDVVAG+PS+KNLV +
Sbjct: 323 EEDVVAGMPSTKNLVMD 339


>gi|224285256|gb|ACN40354.1| unknown [Picea sitchensis]
          Length = 350

 Score =  317 bits (812), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 143/212 (67%), Positives = 175/212 (82%), Gaps = 2/212 (0%)

Query: 9   DALSSSPYVSLQ-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF 67
           +ALS    +  Q N +LS NDL+ACCGF CG GC+GG+P+SAWRYF   GVVT+ECDPYF
Sbjct: 130 EALSDRFCIHFQVNATLSENDLVACCGFRCGSGCNGGFPLSAWRYFSRRGVVTDECDPYF 189

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
           D+ GC+HPGCEP+YPTP+CV+ C K NQ W +SKHYS +AYRI SDP +IMAE++ NGPV
Sbjct: 190 DNDGCNHPGCEPSYPTPRCVKNC-KDNQRWSHSKHYSANAYRIKSDPYNIMAEVFNNGPV 248

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           EVSF+VYEDFAHY++GVYKH+ G  +GGHAVKLIGWGT+DDG DYW++AN WN +WG  G
Sbjct: 249 EVSFSVYEDFAHYETGVYKHVQGRYLGGHAVKLIGWGTTDDGIDYWLIANSWNTAWGEGG 308

Query: 188 YFKIKRGSNECGIEEDVVAGLPSSKNLVKEIT 219
           YFKI RG NECGIE D VAG+PS+KNL+++ T
Sbjct: 309 YFKIARGVNECGIERDPVAGMPSAKNLIQDPT 340


>gi|62320420|dbj|BAD94873.1| cathepsin B-like cysteine proteinase like protein [Arabidopsis
           thaliana]
          Length = 183

 Score =  311 bits (797), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 140/176 (79%), Positives = 158/176 (89%)

Query: 47  ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS 106
           + AW YF +HGVVT+ECDPYFD+TGCSHPGCEP YPTPKC RKCV +NQLW  SKHY + 
Sbjct: 1   MGAWLYFKYHGVVTQECDPYFDNTGCSHPGCEPTYPTPKCERKCVSRNQLWGESKHYGVG 60

Query: 107 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 166
           AYRIN DP+DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYK+ITG  +GGHAVKLIGWGTS
Sbjct: 61  AYRINPDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTS 120

Query: 167 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 222
           DDGEDYW+LANQWNRSWG DGYFKI+RG+NECGIE+ VVAGLPS KN+ K IT++D
Sbjct: 121 DDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQSVVAGLPSEKNVFKGITTSD 176


>gi|6562772|emb|CAB62590.1| putative cathepsin B-like protease [Pisum sativum]
          Length = 174

 Score =  303 bits (775), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 134/166 (80%), Positives = 150/166 (90%)

Query: 41  CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
           CDGGYPISAW+YF HHGVVTEECDPYFD  GCSHPGCEP Y TPKCVRKCVK NQ+W+ S
Sbjct: 1   CDGGYPISAWKYFAHHGVVTEECDPYFDQIGCSHPGCEPGYQTPKCVRKCVKGNQVWKKS 60

Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
           KHYS+  Y++NSDP++IM E+YKNGPVEV+F+VYEDFAHYKSGVYKHITG  +GGHAVKL
Sbjct: 61  KHYSVKPYKVNSDPQNIMEEVYKNGPVEVAFSVYEDFAHYKSGVYKHITGSALGGHAVKL 120

Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
            GWGTSD+GEDYW+LANQWN +WG DGYFKIKRG+NECGIEEDV A
Sbjct: 121 NGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEEDVTA 166


>gi|302823081|ref|XP_002993195.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
 gi|300138965|gb|EFJ05715.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
          Length = 342

 Score =  295 bits (754), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 135/199 (67%), Positives = 160/199 (80%), Gaps = 2/199 (1%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           +N+SLS NDL+ACC   CG GCDGGYP +AW YF   GVVT +CDPYFD  GC HPGCEP
Sbjct: 146 ENVSLSENDLVACCS-SCGFGCDGGYPYAAWEYFAQTGVVTSQCDPYFDGKGCKHPGCEP 204

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
            Y TP CV++CV  N+ WR+SKH+++  Y +NSD  DI AEIYKNGPVEVS+TVYEDFAH
Sbjct: 205 EYDTPVCVKQCVD-NEQWRDSKHFTVQTYAVNSDIYDIQAEIYKNGPVEVSYTVYEDFAH 263

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           YKSGVYKH+ G+V+GGHAVK IGWGT+DDG+DYWI+AN WNRSWG DG+F+I RGSNECG
Sbjct: 264 YKSGVYKHVFGEVLGGHAVKFIGWGTTDDGKDYWIVANSWNRSWGEDGFFQISRGSNECG 323

Query: 200 IEEDVVAGLPSSKNLVKEI 218
           IE + VAG+P  K    +I
Sbjct: 324 IESEPVAGIPLKKTGFSDI 342


>gi|302764096|ref|XP_002965469.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
 gi|300166283|gb|EFJ32889.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
          Length = 331

 Score =  293 bits (750), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 134/199 (67%), Positives = 159/199 (79%), Gaps = 2/199 (1%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           +N+SLS NDL+ACC   CG GC+GGYP +AW YF   GVVT +CDPYFD  GC HPGCEP
Sbjct: 135 ENVSLSENDLVACCS-SCGFGCEGGYPYAAWEYFAQTGVVTSQCDPYFDGKGCKHPGCEP 193

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
            Y TP CV++CV  N+ WR+SKH+++  Y +NSD  DI AEIYKNGPVEVS+TVYEDFAH
Sbjct: 194 EYDTPVCVKQCVD-NEQWRDSKHFTVQTYAVNSDIYDIQAEIYKNGPVEVSYTVYEDFAH 252

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           YKSGVYKH+ G V+GGHAVK IGWGT+DDG+DYWI+AN WNRSWG DG+F+I RGSNECG
Sbjct: 253 YKSGVYKHVFGQVLGGHAVKFIGWGTTDDGKDYWIVANSWNRSWGEDGFFQISRGSNECG 312

Query: 200 IEEDVVAGLPSSKNLVKEI 218
           IE + VAG+P  K    +I
Sbjct: 313 IESEPVAGIPLKKTGFSDI 331


>gi|168026641|ref|XP_001765840.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683017|gb|EDQ69431.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 339

 Score =  290 bits (741), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 130/195 (66%), Positives = 153/195 (78%), Gaps = 1/195 (0%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           +++SLS NDLLACCGF CGDGCDGGYPI AWRYF   GVVT +CDPYFD  GC HPGC P
Sbjct: 142 ESVSLSENDLLACCGFECGDGCDGGYPIRAWRYFKRTGVVTSKCDPYFDQIGCGHPGCYP 201

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
            Y TPKCV+ CV  ++LW  SKH S++AY ++ +PED+MAE+Y NGP+EVSF V+EDFAH
Sbjct: 202 TYRTPKCVKHCVD-DELWVKSKHLSVNAYEVSKEPEDLMAELYTNGPIEVSFEVFEDFAH 260

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           YK+GVYKH+ G  +GGHAVKLIGWGT+DDG DYW + N WN +WG  G F+I RG NECG
Sbjct: 261 YKTGVYKHVYGRYIGGHAVKLIGWGTTDDGVDYWTIVNSWNTNWGEHGLFRIARGGNECG 320

Query: 200 IEEDVVAGLPSSKNL 214
           IE   VAGLP  K L
Sbjct: 321 IESYAVAGLPFDKGL 335


>gi|21695|emb|CAA46812.1| cathepsin B [Triticum aestivum]
          Length = 310

 Score =  289 bits (740), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 131/166 (78%), Positives = 147/166 (88%), Gaps = 2/166 (1%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           ++SLSVNDLLACCGFLCG GC+GGYPISAWRYF   GVVTEECDPYFD TGC HPGCEPA
Sbjct: 145 SVSLSVNDLLACCGFLCGSGCNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQHPGCEPA 204

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE--DFA 138
           YPTPKC RKC  +NQ W+ +KH+S++AYR++S+P DIMAE+YKNGPVEV+FT  +  DFA
Sbjct: 205 YPTPKCQRKCKVENQAWKENKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAFTYCQILDFA 264

Query: 139 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           HYKSGVYKHITG VMGGHAVKLIGWGTSD GEDYW+LANQWNR WG
Sbjct: 265 HYKSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWG 310


>gi|168000937|ref|XP_001753172.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695871|gb|EDQ82213.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 347

 Score =  289 bits (739), Expect = 7e-76,   Method: Compositional matrix adjust.
 Identities = 128/195 (65%), Positives = 156/195 (80%), Gaps = 1/195 (0%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           +++SLS NDLLACCGF CG GC+GGYPI AW+YF H GVVT +CDPYFD  GC+HPGC P
Sbjct: 150 ESVSLSENDLLACCGFECGYGCEGGYPIRAWKYFKHSGVVTNKCDPYFDQKGCAHPGCYP 209

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
            Y TPKC ++CV  ++ W  SKH  ++AY ++ +PED+MAE+Y NGPVEV+F VYEDFAH
Sbjct: 210 TYETPKCEKQCVD-DEFWVQSKHLGVNAYEMSMEPEDLMAELYTNGPVEVAFEVYEDFAH 268

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           YK+GVYKH+ G  MGGHAVKLIGWGT+DDG DYW + N WN +WG DG F+I RG++ECG
Sbjct: 269 YKTGVYKHLFGGFMGGHAVKLIGWGTTDDGVDYWTIVNSWNTNWGEDGLFRIVRGNDECG 328

Query: 200 IEEDVVAGLPSSKNL 214
           IE + VAGLPS K L
Sbjct: 329 IESNAVAGLPSRKGL 343


>gi|168020784|ref|XP_001762922.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685734|gb|EDQ72127.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 345

 Score =  284 bits (726), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 124/195 (63%), Positives = 157/195 (80%), Gaps = 1/195 (0%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           +++SLS NDLLACCGF CGDGC+GGYPI AW+YF   GVVT +CDPYFD  GC HPGC P
Sbjct: 148 ESVSLSENDLLACCGFECGDGCEGGYPIRAWQYFKRTGVVTSKCDPYFDQKGCGHPGCYP 207

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
            Y TPKC ++CV  ++LW +SKH  +SAY ++ +PE++MAE++ NGP+EV+F V+EDFAH
Sbjct: 208 TYDTPKCFKRCVD-DELWVSSKHLGVSAYEVSMEPEELMAELFTNGPIEVAFDVFEDFAH 266

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           YK+GVYKH+ G  +GGHAVKL+GWGT+DDG DYW + N WN +WG DG F+I RG +ECG
Sbjct: 267 YKTGVYKHLYGGYIGGHAVKLVGWGTTDDGVDYWSMVNSWNTNWGEDGTFRILRGKDECG 326

Query: 200 IEEDVVAGLPSSKNL 214
           IE + VAGLPS+K L
Sbjct: 327 IESNAVAGLPSNKGL 341


>gi|149941230|emb|CAO02547.1| putative cathepsin B-like cysteine protease [Vigna unguiculata]
          Length = 201

 Score =  243 bits (621), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 108/132 (81%), Positives = 121/132 (91%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+SLSVNDLLACCGFLCG GC+GGYP+SAWRY  +HGVVTEECDPYFD TGCSHPGCEPA
Sbjct: 66  NISLSVNDLLACCGFLCGSGCNGGYPLSAWRYLSNHGVVTEECDPYFDQTGCSHPGCEPA 125

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           Y TPKCV+KCV  NQLW+ SKHYS+SAY++ S+P DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 126 YRTPKCVKKCVSGNQLWKKSKHYSVSAYKVKSNPHDIMAEVYKNGPVEVAFTVYEDFAHY 185

Query: 141 KSGVYKHITGDV 152
           KSGVYKH+TG V
Sbjct: 186 KSGVYKHVTGYV 197


>gi|149941232|emb|CAO02548.1| putative cathepsin B-like cysteine protease,putative [Vigna
           unguiculata]
          Length = 195

 Score =  242 bits (617), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 107/130 (82%), Positives = 120/130 (92%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+SLSVNDLLACCGFLCG GC+GGYP+SAWRY  +HGVVTEECDPYFD TGCSHPGCEPA
Sbjct: 66  NISLSVNDLLACCGFLCGSGCNGGYPLSAWRYLSNHGVVTEECDPYFDQTGCSHPGCEPA 125

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
           Y TPKCV+KCV  NQLW+ SKHYS+SAY++ S+P DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 126 YRTPKCVKKCVSGNQLWKKSKHYSVSAYKVKSNPHDIMAEVYKNGPVEVAFTVYEDFAHY 185

Query: 141 KSGVYKHITG 150
           KSGVYKH+TG
Sbjct: 186 KSGVYKHVTG 195


>gi|149392557|gb|ABR26081.1| cathepsin b-like cysteine proteinase 3 [Oryza sativa Indica Group]
          Length = 142

 Score =  229 bits (584), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 102/134 (76%), Positives = 120/134 (89%)

Query: 88  RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 147
           +KC  +NQ+W   KH+S++AYR+NSDP DIMAE+Y+NGPVEV+FTVYEDFAHYKSGVYKH
Sbjct: 1   KKCKVQNQVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKH 60

Query: 148 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
           ITG +MGGHAVKLIGWGT+D GEDYW+LANQWNR WG DGYFKI RG+NECGIEEDVVAG
Sbjct: 61  ITGGMMGGHAVKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEEDVVAG 120

Query: 208 LPSSKNLVKEITSA 221
           +PS+KN+V+   SA
Sbjct: 121 MPSTKNMVRNYDSA 134


>gi|341888136|gb|EGT44071.1| hypothetical protein CAEBREN_13576 [Caenorhabditis brenneri]
          Length = 337

 Score =  205 bits (521), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 108/228 (47%), Positives = 143/228 (62%), Gaps = 23/228 (10%)

Query: 4   TRTNRDALSSSPYVSLQNLSLSVNDLLACC--GFLCGDGCDGGYPISAWRYFVHHGVVTE 61
           T ++R  ++S+  V   N+ +S  DLL+CC  G+ CGDGC+GGYPI AWRY+VH+G+VT 
Sbjct: 111 TISDRTCIASNGEV---NVLISAEDLLSCCTGGYNCGDGCEGGYPIQAWRYWVHNGLVTG 167

Query: 62  E-------CDPYFDS------TGCSHPGCEP-AYPTPKCVRKCVKKNQL---WRNSKHYS 104
                   C PY  +       G + P C      TP+CV++C  K+     +   KHY 
Sbjct: 168 GSYESQYGCKPYSIAPCGQTVNGVTWPKCAADEVATPECVKQCTSKSDYAVPYDQDKHYG 227

Query: 105 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 164
            SAY I  +   I  EI +NGPVEV F VY DF  YKSG+YKH+ G  +GGHAVK++GWG
Sbjct: 228 SSAYAIRQNVAQIQTEIMRNGPVEVGFLVYSDFYQYKSGIYKHVAGRELGGHAVKILGWG 287

Query: 165 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 212
             ++G  YW+ AN WN +WG  GYF+I+RG+NECGIE  VVAG+P  K
Sbjct: 288 V-ENGTPYWLAANSWNVNWGEKGYFRIRRGTNECGIESSVVAGIPDLK 334


>gi|198429088|ref|XP_002120307.1| PREDICTED: similar to cathepsin B [Ciona intestinalis]
          Length = 364

 Score =  204 bits (518), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 104/203 (51%), Positives = 130/203 (64%), Gaps = 16/203 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
            + +S  DLL+CCGF CGDGC+GG+P SAW+Y+   G+VT         C PY     C 
Sbjct: 162 QVEISAEDLLSCCGFECGDGCNGGFPGSAWKYWNSDGLVTGGLYGSKTGCLPY-QIKPCE 220

Query: 74  H------PGCEPAYPTPKCVRKCVKKNQLWRNS-KHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TP CV KC     +  N  KHY +S+Y + SDP  I  EI  +GP
Sbjct: 221 HHVPGDRPKCSEGGGTPSCVSKCKGNTTIHYNQDKHYGLSSYAVGSDPTQIQTEIMTHGP 280

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +FTVY DF  YKSGVYKH+TG V+GGHA++++GWG S++G  YW++AN WN  WG  
Sbjct: 281 VEGAFTVYADFPTYKSGVYKHVTGGVLGGHAIRILGWG-SENGVAYWLVANSWNTDWGDK 339

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           GYFKI RGS+ECGIE  VVAG+P
Sbjct: 340 GYFKILRGSDECGIESSVVAGIP 362


>gi|326427908|gb|EGD73478.1| cathepsin B [Salpingoeca sp. ATCC 50818]
          Length = 341

 Score =  200 bits (509), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 106/218 (48%), Positives = 134/218 (61%), Gaps = 20/218 (9%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT----- 60
           T+R  ++S      Q   +S  DLL CC F CGDGC+GGYP +AW Y+ + G+VT     
Sbjct: 128 TDRTCIASK---GAQTPHISAEDLLTCCTFTCGDGCNGGYPAAAWEYWKNQGIVTGGQYD 184

Query: 61  --EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
             + C PY        +TG   P C    PTP C R C +  N  + N KH+  S+Y + 
Sbjct: 185 SNQGCQPYSLAKCEHHTTGPYKP-CGDIVPTPACKRSCRQGYNVTYPNDKHFGASSYGVR 243

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
              + I  EI  NGPVE +FTVY DF  YKSGVY+H +G  +GGHA+K+IGWG   DG D
Sbjct: 244 G-VDQIATEIMTNGPVEAAFTVYSDFLSYKSGVYQHTSGQPLGGHAIKIIGWGVQ-DGTD 301

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           YWI+AN WN SWG DG+F IK+G++ECGIE  VVAGLP
Sbjct: 302 YWIVANSWNDSWGNDGFFWIKKGTDECGIESQVVAGLP 339


>gi|395507317|ref|XP_003757972.1| PREDICTED: cathepsin B [Sarcophilus harrisii]
          Length = 342

 Score =  200 bits (508), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 101/228 (44%), Positives = 149/228 (65%), Gaps = 20/228 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ Y++++   +S  DLL+CCG  CG+GC+GG+P  AW+Y++  G+V+     
Sbjct: 120 SDRICVHTNGYITIE---VSAEDLLSCCGLQCGEGCNGGFPAGAWKYWIKKGLVSGGLYD 176

Query: 63  ----CDPYFDSTGCSH--PGCEPAYP-----TPKCVRKC-VKKNQLWRNSKHYSISAYRI 110
               C PY     C H   G  PA       TPKC +KC    +  +++ KHY  +AY +
Sbjct: 177 SHVGCRPY-SIPPCEHHVNGSRPACTGEGGDTPKCNKKCEAGYSPDYKDDKHYGTTAYNV 235

Query: 111 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 170
            S  ++IMAEIYKNGPVE +F VY DF  YKSGVY+H+TGD++GGHA++++GWG  +DG 
Sbjct: 236 PSSEKEIMAEIYKNGPVEGAFIVYADFLQYKSGVYQHVTGDMLGGHAIRVLGWGV-EDGV 294

Query: 171 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
            YW+ AN WN  WG +G+FKI RG + CGIE ++VAG+P ++   K+I
Sbjct: 295 PYWLAANSWNTDWGDNGFFKILRGKDHCGIESEMVAGIPRTEQYWKKI 342


>gi|449667614|ref|XP_002166962.2| PREDICTED: cathepsin B-like [Hydra magnipapillata]
          Length = 330

 Score =  199 bits (507), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 100/204 (49%), Positives = 135/204 (66%), Gaps = 17/204 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
             + +S  DLL CC   CG GC+GGYP SAW +F   G+VT       + C PY     C
Sbjct: 128 NQVRISTEDLLTCCD-SCGFGCNGGYPQSAWEFFKTKGIVTGGPYNSHKGCQPY-AIPAC 185

Query: 73  SH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
            H        C  + PTPKC + C K  N  ++N KHY +++Y IN+D  +IM EI  NG
Sbjct: 186 DHHVPHSKNPCNGSLPTPKCEKVCEKGYNITYKNDKHYGVTSYSINNDQNEIMREIMTNG 245

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +FTV+ DF +YKSGVY+H++G+ +GGHA+K++GWG  ++   YW++AN WN SWG 
Sbjct: 246 PVEAAFTVFADFPNYKSGVYQHVSGEELGGHAIKILGWGVENN-TPYWLVANSWNPSWGD 304

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G+FKI RGS+ECGIE++VVAGLP
Sbjct: 305 NGFFKILRGSDECGIEDEVVAGLP 328


>gi|313233819|emb|CBY09988.1| unnamed protein product [Oikopleura dioica]
          Length = 356

 Score =  199 bits (505), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 100/209 (47%), Positives = 133/209 (63%), Gaps = 20/209 (9%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY 81
             LS  DLL+CCG++CG+GC+GG+P +AW Y+V +G+V+      +  TGC     EP  
Sbjct: 146 FDLSSEDLLSCCGYVCGNGCNGGFPQAAWEYWVQNGLVS---GGLYHGTGCQPYAIEPCE 202

Query: 82  ---------------PTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
                           TPKC  KCV      +   KHY   AYRI ++ + IM EIYKNG
Sbjct: 203 HHTEGDRPPCTGEEGTTPKCSHKCVDGYTGNFAQDKHYGSVAYRIPANEKAIMNEIYKNG 262

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +F VYEDF  YKSGVY H TG  +GGHA++++GWG  ++GE YW+  N WN  WG 
Sbjct: 263 PVEGAFIVYEDFPTYKSGVYSHHTGSALGGHAIRVLGWG-EENGEKYWLCGNSWNTDWGN 321

Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
           +G+FKIKRG NECGIE ++V G+P+S++L
Sbjct: 322 NGFFKIKRGVNECGIESEMVGGIPASESL 350


>gi|355697726|gb|EHH28274.1| Cathepsin B [Macaca mulatta]
          Length = 339

 Score =  198 bits (503), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 99/227 (43%), Positives = 145/227 (63%), Gaps = 19/227 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW +    G+V+     
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGIMCGDGCNGGYPAGAWNFLTRKGLVSGGLYD 174

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY  ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|402877481|ref|XP_003902454.1| PREDICTED: cathepsin B [Papio anubis]
          Length = 339

 Score =  198 bits (503), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYD 174

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY  ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|302564570|ref|NP_001181828.1| cathepsin B precursor [Macaca mulatta]
          Length = 339

 Score =  198 bits (503), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYD 174

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY  ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|75076082|sp|Q4R5M2.1|CATB_MACFA RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|67970521|dbj|BAE01603.1| unnamed protein product [Macaca fascicularis]
 gi|355779504|gb|EHH63980.1| Cathepsin B [Macaca fascicularis]
 gi|383411999|gb|AFH29213.1| cathepsin B preproprotein [Macaca mulatta]
 gi|384942194|gb|AFI34702.1| cathepsin B preproprotein [Macaca mulatta]
          Length = 339

 Score =  197 bits (502), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYD 174

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY  ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|332244666|ref|XP_003271495.1| PREDICTED: cathepsin B [Nomascus leucogenys]
          Length = 351

 Score =  197 bits (502), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ ++S++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 130 SDRICIHTNAHISVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYD 186

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY  ++Y ++
Sbjct: 187 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 245

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+HITG++MGGHA++++GWG  ++G  
Sbjct: 246 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHITGEMMGGHAIRILGWGV-ENGTP 304

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 305 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 351


>gi|52546914|gb|AAU81590.1| cysteine proteinase, partial [Petunia x hybrida]
          Length = 122

 Score =  197 bits (502), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 91/120 (75%), Positives = 105/120 (87%)

Query: 109 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 168
           R +SDP  IM E+YKNGPVEV+FTVYEDFAHYKSGVYKH+TGD +GGHAVKLIGWGTS+D
Sbjct: 2   RGSSDPYSIMTEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGDELGGHAVKLIGWGTSED 61

Query: 169 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 228
           GEDYW+LANQWNR WG DGYFKI+RG+NEC IE++VVAG+PS KNL  E+  +D F DAS
Sbjct: 62  GEDYWLLANQWNRGWGDDGYFKIRRGTNECDIEDEVVAGMPSPKNLNMELDVSDAFLDAS 121


>gi|181192|gb|AAA52129.1| preprocathepsin B [Homo sapiens]
 gi|193787271|dbj|BAG52477.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score =  197 bits (501), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY  ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|4503139|ref|NP_001899.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538431|ref|NP_680090.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538433|ref|NP_680091.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538435|ref|NP_680092.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538437|ref|NP_680093.1| cathepsin B preproprotein [Homo sapiens]
 gi|68067549|sp|P07858.3|CATB_HUMAN RecName: Full=Cathepsin B; AltName: Full=APP secretase; Short=APPS;
           AltName: Full=Cathepsin B1; Contains: RecName:
           Full=Cathepsin B light chain; Contains: RecName:
           Full=Cathepsin B heavy chain; Flags: Precursor
 gi|291888|gb|AAC37547.1| cathepsin B [Homo sapiens]
 gi|63102437|gb|AAH95408.1| Cathepsin B [Homo sapiens]
 gi|119586034|gb|EAW65630.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586036|gb|EAW65632.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586037|gb|EAW65633.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586038|gb|EAW65634.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586039|gb|EAW65635.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586040|gb|EAW65636.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|168277954|dbj|BAG10955.1| cathepsin B precursor [synthetic construct]
 gi|193786804|dbj|BAG52127.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score =  197 bits (501), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY  ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|426358853|ref|XP_004046705.1| PREDICTED: cathepsin B isoform 1 [Gorilla gorilla gorilla]
 gi|426358855|ref|XP_004046706.1| PREDICTED: cathepsin B isoform 2 [Gorilla gorilla gorilla]
 gi|426358857|ref|XP_004046707.1| PREDICTED: cathepsin B isoform 3 [Gorilla gorilla gorilla]
          Length = 339

 Score =  197 bits (501), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY  ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|397467300|ref|XP_003805362.1| PREDICTED: cathepsin B [Pan paniscus]
          Length = 339

 Score =  197 bits (501), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY  ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|332862712|ref|XP_003317964.1| PREDICTED: cathepsin B isoform 1 [Pan troglodytes]
 gi|332862714|ref|XP_003317965.1| PREDICTED: cathepsin B isoform 2 [Pan troglodytes]
 gi|332862716|ref|XP_003317966.1| PREDICTED: cathepsin B isoform 3 [Pan troglodytes]
 gi|332862718|ref|XP_519607.3| PREDICTED: cathepsin B isoform 5 [Pan troglodytes]
 gi|410057614|ref|XP_003954244.1| PREDICTED: cathepsin B [Pan troglodytes]
 gi|410262606|gb|JAA19269.1| cathepsin B [Pan troglodytes]
 gi|410262608|gb|JAA19270.1| cathepsin B [Pan troglodytes]
 gi|410359820|gb|JAA44654.1| cathepsin B [Pan troglodytes]
 gi|410359822|gb|JAA44655.1| cathepsin B [Pan troglodytes]
 gi|410359824|gb|JAA44656.1| cathepsin B [Pan troglodytes]
 gi|410359826|gb|JAA44657.1| cathepsin B [Pan troglodytes]
 gi|410359828|gb|JAA44658.1| cathepsin B [Pan troglodytes]
          Length = 339

 Score =  197 bits (501), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY  ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|16307393|gb|AAH10240.1| Cathepsin B [Homo sapiens]
          Length = 339

 Score =  197 bits (501), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY  ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|30583753|gb|AAP36125.1| Homo sapiens cathepsin B [synthetic construct]
 gi|61370555|gb|AAX43516.1| cathepsin B [synthetic construct]
          Length = 340

 Score =  197 bits (501), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY  ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|167538317|ref|XP_001750823.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163770644|gb|EDQ84327.1| predicted protein [Monosiga brevicollis MX1]
          Length = 341

 Score =  197 bits (501), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 96/200 (48%), Positives = 122/200 (61%), Gaps = 16/200 (8%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
           +S  DL+ CC F CG GC GGYP +AW +F   G+VT       + C PY     C H  
Sbjct: 142 ISAQDLMTCCLFTCGSGCSGGYPSAAWSWFKTTGIVTGGNYNSSQGCQPY-SLPNCDHHV 200

Query: 75  ----PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
               P C    PTP C + C    N  + N KH+  +AY +  + + I  EI  NGPVE 
Sbjct: 201 SGQYPACSGEGPTPACKKSCEAGYNNTYSNDKHFGATAYSVAGEADKIATEIMTNGPVEG 260

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
           +FTVYED   YKSGVY+H TG V+GGHA+K+IGWG  + G DYW +AN WN  WG +G+F
Sbjct: 261 AFTVYEDLLTYKSGVYQHTTGQVLGGHAIKIIGWGV-ESGVDYWWVANSWNNDWGDNGFF 319

Query: 190 KIKRGSNECGIEEDVVAGLP 209
           KIK+G +ECGIE  +VAG+P
Sbjct: 320 KIKKGVDECGIESQIVAGMP 339


>gi|17565164|ref|NP_503383.1| Protein CPR-5 [Caenorhabditis elegans]
 gi|1169086|sp|P43509.1|CPR5_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 5; AltName:
           Full=Cysteine protease-related 5; Flags: Precursor
 gi|671713|gb|AAA98786.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|675502|gb|AAA98784.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351059399|emb|CCD74289.1| Protein CPR-5 [Caenorhabditis elegans]
          Length = 344

 Score =  197 bits (501), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 102/208 (49%), Positives = 126/208 (60%), Gaps = 20/208 (9%)

Query: 21  NLSLSVNDLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS-- 69
           N  LS  DLL+CC   F CG+GC+GGYPI AW+++V HG+VT         C PY  +  
Sbjct: 132 NTLLSSEDLLSCCTGMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAPC 191

Query: 70  ----TGCSHPGC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEI 121
                G   P C E   PTPKCV  C  KN     +   KH+  +AY +    E I  EI
Sbjct: 192 GETVNGVKWPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAVGKKVEQIQTEI 251

Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
             NGP+EV+FTVYEDF  Y +GVY H  G  +GGHAVK++GWG  D+G  YW++AN WN 
Sbjct: 252 LTNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGV-DNGTPYWLVANSWNV 310

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLP 209
           +WG  GYF+I RG NECGIE   VAG+P
Sbjct: 311 AWGEKGYFRIIRGLNECGIEHSAVAGIP 338


>gi|197098184|ref|NP_001126573.1| cathepsin B precursor [Pongo abelii]
 gi|75061687|sp|Q5R6D1.1|CATB_PONAB RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|55731764|emb|CAH92586.1| hypothetical protein [Pongo abelii]
 gi|55731953|emb|CAH92685.1| hypothetical protein [Pongo abelii]
          Length = 339

 Score =  197 bits (500), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 99/227 (43%), Positives = 145/227 (63%), Gaps = 19/227 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY  ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +   DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  
Sbjct: 234 NSERDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|24158605|pdb|1GMY|A Chain A, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
 gi|24158606|pdb|1GMY|B Chain B, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
 gi|24158607|pdb|1GMY|C Chain C, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
          Length = 261

 Score =  197 bits (500), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 40  SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 96

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY  ++Y ++
Sbjct: 97  SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 155

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  
Sbjct: 156 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 214

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 215 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 261


>gi|194387364|dbj|BAG60046.1| unnamed protein product [Homo sapiens]
          Length = 245

 Score =  197 bits (500), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 24  SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 80

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY  ++Y ++
Sbjct: 81  SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 139

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  
Sbjct: 140 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 198

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 199 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 245


>gi|340501578|gb|EGR28345.1| hypothetical protein IMG5_177790 [Ichthyophthirius multifiliis]
          Length = 356

 Score =  197 bits (500), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 105/216 (48%), Positives = 137/216 (63%), Gaps = 30/216 (13%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGC 72
           +++ LS  +L++CC   CGDGC+GGYP +A +YFV  G+VT +       C  Y     C
Sbjct: 145 EDVRLSTENLVSCCSS-CGDGCNGGYPEAAMQYFVKTGLVTGDLFGDNNFCQAY-SFPPC 202

Query: 73  SH-------PGCEPAYPTPKCVRKC-----VKK---NQLWRNSKHYSISAYRINSDPEDI 117
           +H       P C+   PTP+C +KC     VK+     L++  K YS+S     SDP+ I
Sbjct: 203 AHHVASTKYPPCKGEVPTPECKKKCDDDSKVKRPYNEDLYKGQKSYSVS-----SDPKAI 257

Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 177
           M EI  NGPVEV+FTVYEDF  YKSGVY+H+TG+ +GGHAVK+IGWG  +D   YW++ N
Sbjct: 258 MTEIMNNGPVEVAFTVYEDFVTYKSGVYQHVTGEQLGGHAVKMIGWGVEND-TPYWLIVN 316

Query: 178 QWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 213
            WN +WG  G FKI RGSNECGIE++VV  LP  K 
Sbjct: 317 SWNETWGDQGTFKILRGSNECGIEDEVVTALPQKKQ 352


>gi|345790427|ref|XP_543203.3| PREDICTED: cathepsin B [Canis lupus familiaris]
          Length = 339

 Score =  197 bits (500), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 97/212 (45%), Positives = 135/212 (63%), Gaps = 16/212 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY     C 
Sbjct: 130 NVEVSAEDMLTCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPY-SIPPCE 188

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC + C    +  ++  KHY  S+Y ++ + ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKICEPGYSPSYKEDKHYGCSSYSVSDNEKEIMAEIYKNGP 248

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +FTVY DF  YKSGVY+H+TG++MGGHAV+++GWG  +DG  YW++ N WN  WG +
Sbjct: 249 VEAAFTVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGV-EDGTPYWLVGNSWNTDWGDN 307

Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           G+FKI RG + CGIE ++VAG+P +    K+I
Sbjct: 308 GFFKILRGRDHCGIESEIVAGIPCTDQYWKKI 339


>gi|189096178|pdb|3CBJ|A Chain A, Chagasin-cathepsin B Complex
 gi|189096180|pdb|3CBK|A Chain A, Chagasin-Cathepsin B
          Length = 266

 Score =  196 bits (499), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 98/226 (43%), Positives = 146/226 (64%), Gaps = 17/226 (7%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 45  SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 101

Query: 63  ----CDPYFDSTGCSH-----PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINS 112
               C PY      +H     P C     TPKC + C    +  ++  KHY  ++Y +++
Sbjct: 102 SHVGCRPYSIPPCEAHVNGARPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSN 161

Query: 113 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 172
             +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  Y
Sbjct: 162 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPY 220

Query: 173 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           W++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 221 WLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 266


>gi|449267314|gb|EMC78276.1| Cathepsin B [Columba livia]
          Length = 340

 Score =  196 bits (499), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 98/212 (46%), Positives = 134/212 (63%), Gaps = 15/212 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------F 67
           ++ +S  DLL+CCGF CG GC+GGYP  AWRY+   G+V+         C PY       
Sbjct: 130 SVEVSAEDLLSCCGFECGMGCNGGYPSGAWRYWTEKGLVSGGLYDSHVGCRPYSIPPCEH 189

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
              G   P       TP+C R C    +  ++  KHY I++Y +    ++IMAEIYKNGP
Sbjct: 190 HVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGP 249

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +F VYEDF  YKSGVY+H+TG+ +GGHA++L+GWG  D+G  YW+ AN WN  WG +
Sbjct: 250 VEGAFIVYEDFLMYKSGVYQHVTGEQVGGHAIRLLGWGV-DNGTPYWLAANSWNTDWGDN 308

Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           G+FKI RG + CGIE ++VAG+PS++   K +
Sbjct: 309 GFFKILRGEDHCGIESEIVAGIPSTERYWKRV 340


>gi|157833437|pdb|1PBH|A Chain A, Crystal Structure Of Human Recombinant Procathepsin B At
           3.2 Angstrom Resolution
 gi|157835646|pdb|2PBH|A Chain A, Crystal Structure Of Human Procathepsin B At 3.3 Angstrom
           Resolution
 gi|157836863|pdb|3PBH|A Chain A, Refined Crystal Structure Of Human Procathepsin B At 2.5
           Angstrom Resolution
          Length = 317

 Score =  196 bits (499), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 98/220 (44%), Positives = 143/220 (65%), Gaps = 19/220 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 102 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 158

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY  ++Y ++
Sbjct: 159 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 217

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  
Sbjct: 218 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 276

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 211
           YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +
Sbjct: 277 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRT 316


>gi|25988674|gb|AAN76202.1| lysosomal cysteine proteinase cathepsin B/green fluorescent protein
           EGFP fusion protein [synthetic construct]
          Length = 578

 Score =  196 bits (498), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 96/206 (46%), Positives = 131/206 (63%), Gaps = 14/206 (6%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
           N+ +S  DLL CCG  CGDGC+GGYP  AW ++   G+V+         C PY       
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 189

Query: 69  STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
               S P C     TPKC + C    +  ++  KHY  ++Y ++   ++IMAEIYKNGPV
Sbjct: 190 HVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPV 249

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           E +FTV+ DF  YKSGVYKH  GDVMGGHA++++GWG  ++G  YW++AN WN  WG +G
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNG 308

Query: 188 YFKIKRGSNECGIEEDVVAGLPSSKN 213
           +FKI RG N CGIE ++VAG+P +++
Sbjct: 309 FFKILRGENHCGIESEIVAGIPRTQD 334


>gi|326515156|dbj|BAK03491.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 471

 Score =  196 bits (498), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 101/203 (49%), Positives = 125/203 (61%), Gaps = 17/203 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 66
            N  LS  DL +CC   CG GC+GGYP +AW YF   G+VT       + C PY      
Sbjct: 266 NNFYLSAEDLTSCCDS-CGMGCEGGYPSAAWDYFQSTGLVTGGDWNSNQGCYPYQLQACD 324

Query: 67  FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
              TG   P C    PTP C   C + N  W + KH+  S+Y + +D + IM EIY NGP
Sbjct: 325 HHVTGKYQP-CGDIQPTPACANSC-QNNATWSSDKHFGASSYSVGTDQQSIMTEIYTNGP 382

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE S+ VY DF  YKSGVY+H+TGD +GGHAVK+IGWG  D    YWI+AN WN  WG +
Sbjct: 383 VEASYDVYADFVSYKSGVYQHVTGDYLGGHAVKIIGWGV-DGSTPYWIVANSWNNDWGNN 441

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G+F I RGS+ECGIE+ +VAG+P
Sbjct: 442 GFFNILRGSDECGIEDGIVAGIP 464


>gi|296221607|ref|XP_002756833.1| PREDICTED: cathepsin B, partial [Callithrix jacchus]
          Length = 330

 Score =  196 bits (498), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 98/227 (43%), Positives = 143/227 (62%), Gaps = 19/227 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 109 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYD 165

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY   +Y ++
Sbjct: 166 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKSCEPGYSPTYKQDKHYGYDSYSVS 224

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           ++  DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  
Sbjct: 225 NNERDIMAEIYKNGPVEGAFSVYADFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 283

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           YW++ N WN  WG +G+FKI RG + CGIE +VVAG+P +    + I
Sbjct: 284 YWLVGNSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWRNI 330


>gi|333361087|pdb|3AI8|B Chain B, Cathepsin B In Complex With The Nitroxoline
 gi|333361088|pdb|3AI8|A Chain A, Cathepsin B In Complex With The Nitroxoline
          Length = 256

 Score =  196 bits (498), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 98/220 (44%), Positives = 143/220 (65%), Gaps = 19/220 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 41  SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 97

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY  ++Y ++
Sbjct: 98  SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 156

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  
Sbjct: 157 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 215

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 211
           YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +
Sbjct: 216 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRT 255


>gi|268555788|ref|XP_002635883.1| C. briggsae CBR-CPR-5 protein [Caenorhabditis briggsae]
          Length = 345

 Score =  196 bits (498), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 103/208 (49%), Positives = 129/208 (62%), Gaps = 20/208 (9%)

Query: 21  NLSLSVNDLLACC-GFL-CGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS-- 69
           N  LS  DLL+CC G L CG+GC+GGYPI AW+++V HG+VT         C PY  +  
Sbjct: 133 NTLLSSQDLLSCCTGLLSCGNGCEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPC 192

Query: 70  ----TGCSHPGC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEI 121
                G + P C +   PTPKCV  C   N     +   KH+  +AY +    E I  EI
Sbjct: 193 GQTVNGVTWPKCPDDTEPTPKCVEACTSNNTYPTPYLQDKHFGATAYAVGKKVEQIQTEI 252

Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
            KNGPVEV+FTVYEDF  Y +GVY H +G  +GGHAVK++GWG  D+G  YW++AN WN 
Sbjct: 253 LKNGPVEVAFTVYEDFYQYTTGVYVHTSGASLGGHAVKILGWGV-DNGTPYWLVANSWNV 311

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLP 209
           +WG  GYF+I RG NECGIE   VAG+P
Sbjct: 312 NWGEKGYFRIIRGLNECGIEHSAVAGIP 339


>gi|262368170|pdb|3K9M|A Chain A, Cathepsin B In Complex With Stefin A
 gi|262368172|pdb|3K9M|B Chain B, Cathepsin B In Complex With Stefin A
          Length = 254

 Score =  196 bits (497), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 98/220 (44%), Positives = 143/220 (65%), Gaps = 19/220 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 39  SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 95

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY  ++Y ++
Sbjct: 96  SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 154

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  
Sbjct: 155 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 213

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 211
           YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +
Sbjct: 214 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRT 253


>gi|158261501|dbj|BAF82928.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score =  196 bits (497), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 98/227 (43%), Positives = 145/227 (63%), Gaps = 19/227 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY  ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +  +DIMAEIYKNGP E +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  
Sbjct: 234 NSEKDIMAEIYKNGPAEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|60816353|gb|AAX36379.1| cathepsin B [synthetic construct]
 gi|61358313|gb|AAX41546.1| cathepsin B [synthetic construct]
          Length = 339

 Score =  196 bits (497), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 99/227 (43%), Positives = 145/227 (63%), Gaps = 19/227 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG  CGDGC+GGYP  AW ++   G+V+     
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGSRCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY  ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|327281751|ref|XP_003225610.1| PREDICTED: cathepsin B-like [Anolis carolinensis]
          Length = 330

 Score =  196 bits (497), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 95/212 (44%), Positives = 133/212 (62%), Gaps = 15/212 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------F 67
           N+ +S  DLL+CCGF CG GC+GGYP  AW+Y+   G+V+         C PY       
Sbjct: 120 NVEISAEDLLSCCGFECGMGCNGGYPSGAWKYWTEKGLVSGGLYDSHVGCRPYSIPPCEH 179

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
            + G   P       TP+CV+KC       ++  KHY +++Y I    ++IMAEIYKNGP
Sbjct: 180 HTNGTRPPCSGEGGETPECVKKCEDGYTPAYKQDKHYGVTSYGIPRSEKEIMAEIYKNGP 239

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +F VY DF  YKSGVY+H++G+ +GGHA++++GWG  D+G  YW+ AN WN  WG D
Sbjct: 240 VEGAFVVYSDFLMYKSGVYQHVSGEEVGGHAIRILGWGV-DNGTPYWLAANSWNTDWGED 298

Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           G+F+I RG + CGIE ++VAG+P +    K +
Sbjct: 299 GFFRILRGQDHCGIESEIVAGIPKTSEYWKML 330


>gi|181178|gb|AAA52125.1| lysosomal proteinase cathepsin B, partial [Homo sapiens]
          Length = 209

 Score =  195 bits (495), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 96/211 (45%), Positives = 136/211 (64%), Gaps = 16/211 (7%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH 74
           + +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+         C PY     C H
Sbjct: 1   VEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPY-SIPPCEH 59

Query: 75  ------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
                 P C     TPKC + C    +  ++  KHY  ++Y +++  +DIMAEIYKNGPV
Sbjct: 60  HVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV 119

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           E +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +G
Sbjct: 120 EGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNG 178

Query: 188 YFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           +FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 179 FFKILRGQDHCGIESEVVAGIPRTDQYWEKI 209


>gi|1942645|pdb|1MIR|A Chain A, Rat Procathepsin B
 gi|1942646|pdb|1MIR|B Chain B, Rat Procathepsin B
          Length = 322

 Score =  195 bits (495), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 97/207 (46%), Positives = 131/207 (63%), Gaps = 16/207 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  DLL CCG  CGDGC+GGYP  AW ++   G+V+         C PY     C 
Sbjct: 113 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPP-CE 171

Query: 74  H------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC + C    +  ++  KHY  ++Y ++   ++IMAEIYKNGP
Sbjct: 172 HHVNGARPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGP 231

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +FTV+ DF  YKSGVYKH  GDVMGGHA++++GWG  ++G  YW++AN WN  WG +
Sbjct: 232 VEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNADWGDN 290

Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKN 213
           G+FKI RG N CGIE ++VAG+P ++ 
Sbjct: 291 GFFKILRGENHCGIESEIVAGIPRTQQ 317


>gi|240992699|ref|XP_002404474.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215491571|gb|EEC01212.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score =  194 bits (494), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 99/206 (48%), Positives = 131/206 (63%), Gaps = 17/206 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
            + +S  DLL CC   CG GC+GGYP +AW Y+   G+VT       + C PY     C 
Sbjct: 135 QVDISAEDLLDCCDS-CGAGCNGGYPAAAWEYWKESGLVTGGLYGTSDGCKPY-SLAPCE 192

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C    PTPKCV  C K   + +++ KH+    Y I+SD + I  EI+KNGP
Sbjct: 193 HHTKGSLPNCTGTVPTPKCVHLCRKGYGKDYQDDKHFGRKVYSISSDEKQIQTEIFKNGP 252

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE  FTVY DF  YKSGVY+H +GDV+GGHA++++GWGT ++G  YW++AN WN  WG  
Sbjct: 253 VEADFTVYADFLSYKSGVYQHQSGDVLGGHAIRILGWGT-ENGTPYWLVANSWNEDWGDH 311

Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSK 212
           GYFKI RG +ECGIE+D+ AG+P ++
Sbjct: 312 GYFKILRGKDECGIEDDINAGIPKNE 337


>gi|403307501|ref|XP_003944231.1| PREDICTED: cathepsin B [Saimiri boliviensis boliviensis]
          Length = 351

 Score =  194 bits (494), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 97/220 (44%), Positives = 140/220 (63%), Gaps = 19/220 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 130 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYD 186

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C       ++  KHY  ++Y ++
Sbjct: 187 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKSCEPGYTPTYKQDKHYGYNSYSVS 245

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +   DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  
Sbjct: 246 NSERDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 304

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 211
           YW++ N WN  WG +G+FKI RG + CGIE +VVAG+P +
Sbjct: 305 YWLVGNSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRT 344


>gi|1127275|pdb|1CTE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1127276|pdb|1CTE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
          Length = 254

 Score =  194 bits (494), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 97/206 (47%), Positives = 131/206 (63%), Gaps = 16/206 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  DLL CCG  CGDGC+GGYP  AW ++   G+V+         C PY     C 
Sbjct: 51  NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPP-CE 109

Query: 74  H------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC + C    +  ++  KHY  ++Y ++   ++IMAEIYKNGP
Sbjct: 110 HHVNGARPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGP 169

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +FTV+ DF  YKSGVYKH  GDVMGGHA++++GWG  ++G  YW++AN WN  WG +
Sbjct: 170 VEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNADWGDN 228

Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSK 212
           G+FKI RG N CGIE ++VAG+P ++
Sbjct: 229 GFFKILRGENHCGIESEIVAGIPRTQ 254


>gi|1311050|pdb|1CPJ|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1311051|pdb|1CPJ|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1421561|pdb|1THE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B- Inhibitor Complex: Implications For
           Structure-Based Inhibitor Design
 gi|1421562|pdb|1THE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B- Inhibitor Complex: Implications For
           Structure-Based Inhibitor Design
          Length = 260

 Score =  194 bits (494), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 97/206 (47%), Positives = 131/206 (63%), Gaps = 16/206 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  DLL CCG  CGDGC+GGYP  AW ++   G+V+         C PY     C 
Sbjct: 57  NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPP-CE 115

Query: 74  H------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC + C    +  ++  KHY  ++Y ++   ++IMAEIYKNGP
Sbjct: 116 HHVNGARPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGP 175

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +FTV+ DF  YKSGVYKH  GDVMGGHA++++GWG  ++G  YW++AN WN  WG +
Sbjct: 176 VEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNADWGDN 234

Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSK 212
           G+FKI RG N CGIE ++VAG+P ++
Sbjct: 235 GFFKILRGENHCGIESEIVAGIPRTQ 260


>gi|1705630|sp|P00787.2|CATB_RAT RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; AltName:
           Full=RSG-2; Contains: RecName: Full=Cathepsin B light
           chain; Contains: RecName: Full=Cathepsin B heavy chain;
           Flags: Precursor
 gi|1524328|emb|CAA57792.1| cathepsin b [Rattus norvegicus]
          Length = 339

 Score =  194 bits (494), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 96/206 (46%), Positives = 130/206 (63%), Gaps = 14/206 (6%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
           N+ +S  DLL CCG  CGDGC+GGYP  AW ++   G+V+         C PY       
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 189

Query: 69  STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
               S P C     TPKC + C    +  ++  KHY  ++Y ++   ++IMAEIYKNGPV
Sbjct: 190 HVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPV 249

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           E +FTV+ DF  YKSGVYKH  GDVMGGHA++++GWG  ++G  YW++AN WN  WG +G
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNG 308

Query: 188 YFKIKRGSNECGIEEDVVAGLPSSKN 213
           +FKI RG N CGIE ++VAG+P ++ 
Sbjct: 309 FFKILRGENHCGIESEIVAGIPRTQQ 334


>gi|999909|pdb|1HUC|B Chain B, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
           Human Liver Cathepsin B: The Structural Basis For Its
           Specificity
 gi|999911|pdb|1HUC|D Chain D, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
           Human Liver Cathepsin B: The Structural Basis For Its
           Specificity
 gi|1421164|pdb|1CSB|B Chain B, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
           2.1 Angstroms Resolution: A Basis For The Design Of
           Specific Epoxysuccinyl Inhibitors
 gi|1421167|pdb|1CSB|E Chain E, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
           2.1 Angstroms Resolution: A Basis For The Design Of
           Specific Epoxysuccinyl Inhibitors
 gi|122920711|pdb|2IPP|B Chain B, Crystal Structure Of The Tetragonal Form Of Human Liver
           Cathepsin B
          Length = 205

 Score =  194 bits (493), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 95/205 (46%), Positives = 134/205 (65%), Gaps = 16/205 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           ++ +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+         C PY     C 
Sbjct: 2   SVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPY-SIPPCE 60

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC + C    +  ++  KHY  ++Y +++  +DIMAEIYKNGP
Sbjct: 61  HHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGP 120

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +
Sbjct: 121 VEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDN 179

Query: 187 GYFKIKRGSNECGIEEDVVAGLPSS 211
           G+FKI RG + CGIE +VVAG+P +
Sbjct: 180 GFFKILRGQDHCGIESEVVAGIPRT 204


>gi|82830420|ref|NP_072119.2| cathepsin B preproprotein [Rattus norvegicus]
 gi|47939014|gb|AAH72490.1| Cathepsin B [Rattus norvegicus]
 gi|149030258|gb|EDL85314.1| rCG52258, isoform CRA_a [Rattus norvegicus]
          Length = 339

 Score =  194 bits (493), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 96/206 (46%), Positives = 130/206 (63%), Gaps = 14/206 (6%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
           N+ +S  DLL CCG  CGDGC+GGYP  AW ++   G+V+         C PY       
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 189

Query: 69  STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
               S P C     TPKC + C    +  ++  KHY  ++Y ++   ++IMAEIYKNGPV
Sbjct: 190 HVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPV 249

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           E +FTV+ DF  YKSGVYKH  GDVMGGHA++++GWG  ++G  YW++AN WN  WG +G
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNG 308

Query: 188 YFKIKRGSNECGIEEDVVAGLPSSKN 213
           +FKI RG N CGIE ++VAG+P ++ 
Sbjct: 309 FFKILRGENHCGIESEIVAGIPRTQQ 334


>gi|203648|gb|AAA40993.1| cathepsin (EC 3.4.22.1), partial [Rattus norvegicus]
          Length = 271

 Score =  194 bits (492), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 96/206 (46%), Positives = 130/206 (63%), Gaps = 14/206 (6%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
           N+ +S  DLL CCG  CGDGC+GGYP  AW ++   G+V+         C PY       
Sbjct: 62  NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 121

Query: 69  STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
               S P C     TPKC + C    +  ++  KHY  ++Y ++   ++IMAEIYKNGPV
Sbjct: 122 HVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPV 181

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           E +FTV+ DF  YKSGVYKH  GDVMGGHA++++GWG  ++G  YW++AN WN  WG +G
Sbjct: 182 EGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNG 240

Query: 188 YFKIKRGSNECGIEEDVVAGLPSSKN 213
           +FKI RG N CGIE ++VAG+P ++ 
Sbjct: 241 FFKILRGENHCGIESEIVAGIPRTQQ 266


>gi|431918315|gb|ELK17542.1| Cathepsin B [Pteropus alecto]
          Length = 359

 Score =  194 bits (492), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 95/207 (45%), Positives = 128/207 (61%), Gaps = 15/207 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------F 67
           N+ +S  DLL CCGF CG+GC+GG+P  AW ++   G+V+         C PY       
Sbjct: 153 NVEVSAEDLLTCCGFQCGEGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH 212

Query: 68  DSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
              G   P       TPKC R C       ++  KH+  S+Y + S   +IMAEIYKNGP
Sbjct: 213 HVNGSRPPCTGEGGSTPKCSRICEAGYTPSYKEDKHFGCSSYSVPSSETEIMAEIYKNGP 272

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +F+VY DF  YKSGVY+H+TG++MGGHAV+++GWG  +DG  YW++ N WN  WG  
Sbjct: 273 VEAAFSVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGV-EDGTPYWLVGNSWNTDWGDS 331

Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKN 213
           G+FKI RG + CGIE ++VAGLP ++ 
Sbjct: 332 GFFKILRGQDHCGIESEIVAGLPCTEQ 358


>gi|308504233|ref|XP_003114300.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
 gi|308261685|gb|EFP05638.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
          Length = 351

 Score =  193 bits (490), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 100/206 (48%), Positives = 127/206 (61%), Gaps = 20/206 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGC 77
            LS+S +D+ ACCG +CG+GC+GGYPI AWR++V  G VT     Y + TGC    +P C
Sbjct: 147 QLSISADDINACCGMVCGNGCNGGYPIEAWRHYVKKGYVTG--GSYQEKTGCKPYPYPPC 204

Query: 78  E-----------PA--YPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYK 123
           E           P+  YPT KC R C     L +    H+  SAY ++    +I  EI  
Sbjct: 205 EHHVNGTHYKPCPSNMYPTDKCERSCQAGYALTYTQDLHFGQSAYAVSKKVTEIQKEIMT 264

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVEV+F+VYEDF HY  GVY H  G  +GGHAVK++GWG  D+G  YW+ AN WN  W
Sbjct: 265 HGPVEVAFSVYEDFEHYSGGVYVHTAGASLGGHAVKMLGWGV-DNGTPYWLCANSWNEDW 323

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLP 209
           G +GYF+I RG NECGIE  VV G+P
Sbjct: 324 GENGYFRIIRGVNECGIESGVVGGIP 349


>gi|171948776|gb|ACB59245.1| cathepsin B [Sus scrofa]
          Length = 335

 Score =  193 bits (490), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 94/208 (45%), Positives = 132/208 (63%), Gaps = 16/208 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY     C 
Sbjct: 130 NVEVSAEDMLTCCGDECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCE 188

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC + C       ++  KH+  S+Y I+ + ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGP 248

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +FTVY DF  YKSGVY+H+TGD+MGGHA++++GWG  ++G  YW++ N WN  WG +
Sbjct: 249 VEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDN 307

Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKNL 214
           G+FKI RG + CGIE ++VAG+P + + 
Sbjct: 308 GFFKILRGQDHCGIESEIVAGIPCTPHF 335


>gi|268557308|ref|XP_002636643.1| Hypothetical protein CBG23351 [Caenorhabditis briggsae]
          Length = 351

 Score =  192 bits (489), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 99/206 (48%), Positives = 126/206 (61%), Gaps = 20/206 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGC 77
            +S+S +D+ ACCG +CG+GC+GGYPI AWR++V  G VT     Y + +GC    +P C
Sbjct: 147 QISISADDINACCGMVCGNGCNGGYPIEAWRHYVKKGYVTG--GSYQEKSGCKPYPYPPC 204

Query: 78  E-----------PA--YPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYK 123
           E           P+  YPT KC   C     L +    H+  SAY ++  P +I  EI  
Sbjct: 205 EHHVNGTHYKPCPSNMYPTDKCEHSCQAGYPLTYTQDLHFGQSAYAVSKKPAEIQKEIMT 264

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVEV+FTVYEDF HY  GVY H  G  +GGHAVK++GWG  D+G  YW+ AN WN  W
Sbjct: 265 HGPVEVAFTVYEDFEHYSGGVYVHTAGASLGGHAVKMLGWGV-DNGTPYWLCANSWNEDW 323

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLP 209
           G +GYF+I RG NECGIE  VV G P
Sbjct: 324 GENGYFRIIRGVNECGIESGVVGGTP 349


>gi|147906534|ref|NP_001090927.1| cathepsin B precursor [Sus scrofa]
 gi|187470655|sp|A1E295.1|CATB_PIG RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|118490058|gb|ABK96810.1| cathepsin B [Sus scrofa]
          Length = 335

 Score =  192 bits (489), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 94/208 (45%), Positives = 132/208 (63%), Gaps = 16/208 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY     C 
Sbjct: 130 NVEVSAEDMLTCCGDECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCE 188

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC + C       ++  KH+  S+Y I+ + ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGP 248

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +FTVY DF  YKSGVY+H+TGD+MGGHA++++GWG  ++G  YW++ N WN  WG +
Sbjct: 249 VEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDN 307

Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKNL 214
           G+FKI RG + CGIE ++VAG+P + + 
Sbjct: 308 GFFKILRGQDHCGIESEIVAGIPCTPHF 335


>gi|390994429|gb|AFM37364.1| cathepsin B1 [Dictyocaulus viviparus]
          Length = 350

 Score =  192 bits (489), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 99/205 (48%), Positives = 122/205 (59%), Gaps = 17/205 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
           N+ LS  D+L+CCG  CG GC GGYPI AWRYF+ HGV T       + C PY     C 
Sbjct: 145 NVGLSATDILSCCGTTCGRGCRGGYPIEAWRYFMLHGVCTGGHYAEKDVCKPYAFHP-CG 203

Query: 74  HPGCEPAY--------PTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
           H   E  Y        PTP+C + C       + + K Y  SAY + ++ + I  EI  N
Sbjct: 204 HHRNEIYYGECPKEIFPTPQCTQSCQAGYASDYEDDKIYGKSAYALPNNEKAIQREIMTN 263

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPV+ +F VYEDF+ Y+SG+Y H  G   GGHAVKLIGWG  DDG  YW+ AN WN  WG
Sbjct: 264 GPVQAAFMVYEDFSRYRSGIYVHTAGRREGGHAVKLIGWGVDDDGNKYWLAANSWNSDWG 323

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
            +GYF+I RG + CGIE  VVAG+P
Sbjct: 324 ENGYFRIVRGVDHCGIESAVVAGMP 348


>gi|126681075|gb|ABO26563.1| cathepsin B-like cysteine protease form 1 [Ixodes ricinus]
          Length = 337

 Score =  192 bits (489), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 98/206 (47%), Positives = 131/206 (63%), Gaps = 17/206 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
            +++S  DLL CC   CG GCDGGYP +AW Y+   G+V++        C PY     C 
Sbjct: 135 QVNISAEDLLDCCDS-CGAGCDGGYPAAAWEYWKESGLVSDGLYGTPDGCKPY-SLAPCE 192

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C    PTPKCV  C K   + +++ KH+    Y I+S+ + I  EI+KNGP
Sbjct: 193 HHTKGSLPNCTGTVPTPKCVHLCRKGYGKDYQHDKHFGKKVYSISSNEKQIQTEIFKNGP 252

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE  FTVY DF  YKSGVY+H +GDV+GGHA++++GWGT ++G  YW++AN WN  WG  
Sbjct: 253 VEADFTVYADFLSYKSGVYQHHSGDVLGGHAIRILGWGT-ENGTPYWLVANSWNEDWGDH 311

Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSK 212
           GYFKI RG +ECGIE+D+ AG+P  +
Sbjct: 312 GYFKILRGKDECGIEDDINAGIPKDE 337


>gi|308488328|ref|XP_003106358.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
 gi|308253708|gb|EFO97660.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
          Length = 343

 Score =  192 bits (489), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 105/223 (47%), Positives = 136/223 (60%), Gaps = 23/223 (10%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACC--GFLCGDGCDGGYPISAWRYFVHHGVVTEE- 62
           ++R  ++S+  V   N  LS  D+L CC   + CGDGC+GGYPI AW+Y+V +G+VT   
Sbjct: 119 SDRTCIASNGVV---NTLLSAEDILTCCIGEYYCGDGCEGGYPIQAWKYWVKNGLVTGGS 175

Query: 63  ------CDPYFDS------TGCSHPGCEPA-YPTPKCVRKCVKKNQL---WRNSKHYSIS 106
                 C PY  +       G + P C  +   TPKCV  C   +     +   KHY  +
Sbjct: 176 YESQFGCKPYSIAPCGQTVNGVTWPKCPNSDADTPKCVDHCTSNSSYPIPYEKDKHYGAT 235

Query: 107 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 166
           AY ++   + I +EI KNGPVEV FTVY DF  YKSGVY H+ G  +GGHAVKL+GWG  
Sbjct: 236 AYAVSRKVDQIQSEILKNGPVEVGFTVYADFYQYKSGVYVHVAGPELGGHAVKLLGWGV- 294

Query: 167 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           D+G  YW+ AN WN +WG +GYF+I RG NECGIE  VVAG+P
Sbjct: 295 DNGTPYWLAANSWNTNWGENGYFRILRGVNECGIESQVVAGMP 337


>gi|341900876|gb|EGT56811.1| hypothetical protein CAEBREN_29569 [Caenorhabditis brenneri]
          Length = 344

 Score =  192 bits (488), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 100/208 (48%), Positives = 126/208 (60%), Gaps = 20/208 (9%)

Query: 21  NLSLSVNDLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS-- 69
           N  LS  DLL+CC   F CG+GC+GGYPI AW+++  HG+VT         C PY  +  
Sbjct: 132 NTLLSSEDLLSCCTGIFSCGNGCEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIAPC 191

Query: 70  ----TGCSHPGC-EPAYPTPKCVRKCVKKNQ---LWRNSKHYSISAYRINSDPEDIMAEI 121
                G + P C E   PTPKCV  C   +     +   KH+  +AY +    E I  EI
Sbjct: 192 GQTVNGVTWPKCPEDTEPTPKCVDACTSNHTYPTAYLQDKHFGATAYAVGKKVEQIQTEI 251

Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
            KNGP+EV+FTVYEDF  Y +GVY H  G  +GGHAVK++GWG  D+G  YW++AN WN 
Sbjct: 252 LKNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGV-DNGTPYWLVANSWNI 310

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLP 209
           +WG  GYF+I RG NECGIE   VAG+P
Sbjct: 311 NWGEKGYFRIIRGLNECGIEHSAVAGIP 338


>gi|341888137|gb|EGT44072.1| hypothetical protein CAEBREN_10156 [Caenorhabditis brenneri]
          Length = 344

 Score =  192 bits (488), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 100/208 (48%), Positives = 126/208 (60%), Gaps = 20/208 (9%)

Query: 21  NLSLSVNDLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS-- 69
           N  LS  DLL+CC   F CG+GC+GGYPI AW+++  HG+VT         C PY  +  
Sbjct: 132 NTLLSSEDLLSCCTGIFSCGNGCEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIAPC 191

Query: 70  ----TGCSHPGC-EPAYPTPKCVRKCVKKNQ---LWRNSKHYSISAYRINSDPEDIMAEI 121
                G + P C E   PTPKCV  C   +     +   KH+  +AY +    E I  EI
Sbjct: 192 GQTVNGVTWPKCPEDTEPTPKCVDACTSNHTYPTAYLQDKHFGATAYAVGKKVEQIQTEI 251

Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
            KNGP+EV+FTVYEDF  Y +GVY H  G  +GGHAVK++GWG  D+G  YW++AN WN 
Sbjct: 252 LKNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGV-DNGTPYWLVANSWNI 310

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLP 209
           +WG  GYF+I RG NECGIE   VAG+P
Sbjct: 311 NWGEKGYFRIIRGLNECGIEHSAVAGIP 338


>gi|410956528|ref|XP_003984894.1| PREDICTED: cathepsin B [Felis catus]
          Length = 339

 Score =  191 bits (486), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 94/212 (44%), Positives = 132/212 (62%), Gaps = 16/212 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY     C 
Sbjct: 130 NVEVSAEDMLTCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPY-SIPPCE 188

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC + C       ++  KHY  ++Y +++  ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHYGCNSYSVSNSEKEIMAEIYKNGP 248

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +F+V+ DF  YKSGVY+H+TG++MGGHAV+++GWG  +D   YW++ N WN  WG  
Sbjct: 249 VEAAFSVFSDFLQYKSGVYQHVTGEMMGGHAVRILGWGVEND-TPYWLVGNSWNTDWGDH 307

Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           G+FKI RG + CGIE +VVAG+P ++   K I
Sbjct: 308 GFFKILRGRDHCGIESEVVAGIPCTEQYWKRI 339


>gi|333408990|gb|AEF32260.1| cathepsin B [Cristaria plicata]
          Length = 347

 Score =  191 bits (484), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 101/204 (49%), Positives = 126/204 (61%), Gaps = 19/204 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
           N  +S  DLLACC   CG+GC GG+P  AWRY+   G+VT       + C PY     C 
Sbjct: 145 NAEISAEDLLACCSS-CGEGCQGGFPAEAWRYYEREGLVTGGLYNSSQGCQPYM-IPACD 202

Query: 74  H-------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           H       P  +    TPKC +KC    N  +++ KHY  ++Y ++S  E IM EI  NG
Sbjct: 203 HHVVGHLQPCPKEEAKTPKCSKKCEANYNVTYKDDKHYGKNSYSVDSV-EKIMTEIMTNG 261

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +FTVYEDF  YKSGVY+H TG  +GGHAVK++GWG  D+G  YWI+AN WN  WG 
Sbjct: 262 PVEAAFTVYEDFLSYKSGVYQHRTGQELGGHAVKILGWG-EDNGTPYWIVANSWNPDWGN 320

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            G+F I RG +ECGIE  +VAGLP
Sbjct: 321 QGFFNILRGKDECGIESQIVAGLP 344


>gi|417399216|gb|JAA46636.1| Putative cathepsin b [Desmodus rotundus]
          Length = 340

 Score =  191 bits (484), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 95/215 (44%), Positives = 135/215 (62%), Gaps = 17/215 (7%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTG 71
           LQN+ +S  DLL CCGF CG+GC+GG+P  AW ++   G+V+         C PY     
Sbjct: 128 LQNVEVSAEDLLTCCGFQCGEGCNGGFPSGAWNFWKKQGLVSGGLYDSHVGCRPY-SIPP 186

Query: 72  CSH------PGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C H      P C      TPKC + C    +  ++  KH+    Y + SD ++IM EIYK
Sbjct: 187 CEHHVNGSRPPCSGEGGDTPKCSKICEPGYSPSYKEDKHFGCDTYSVPSDEKEIMVEIYK 246

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           NGPVE +F+VY DF  YKSGVY+H+TG+++GGHAV+++GWG  ++G  YW++ N WN  W
Sbjct: 247 NGPVEAAFSVYSDFLLYKSGVYQHVTGEMVGGHAVRILGWGV-ENGTPYWLVGNSWNTDW 305

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           G +G+FKI RG + CGIE ++VAG+P + +  + I
Sbjct: 306 GDNGFFKILRGRDHCGIESEIVAGIPCTGHYSERI 340


>gi|221107055|ref|XP_002166984.1| PREDICTED: cathepsin B-like [Hydra magnipapillata]
          Length = 330

 Score =  191 bits (484), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 101/203 (49%), Positives = 127/203 (62%), Gaps = 16/203 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 66
           +N  +S  DLL CCGF CG GC+GG    AW +F + G VT       E C PY      
Sbjct: 127 KNPHISAEDLLTCCGFWCGFGCNGGRLGPAWNFFKYAGAVTGGQYNSSEGCQPYEIPSCE 186

Query: 67  FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
             ++G   P CE + PTPKC R C +  N  + + KH   S Y I +D E I  EIY NG
Sbjct: 187 HHTSGSKKP-CEGSEPTPKCKRSCREGYNVSYSDDKHKVSSHYSIANDEEQIKNEIYLNG 245

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +FTVY DF +YKSGVYK+ TG+ +GGHA+K++GWG  ++   YW++AN WN  WG 
Sbjct: 246 PVEAAFTVYSDFPNYKSGVYKYTTGNALGGHAIKILGWGVENN-VPYWLVANSWNPDWGD 304

Query: 186 DGYFKIKRGSNECGIEEDVVAGL 208
            G+FKI RGSNECGIE  VVAG+
Sbjct: 305 KGFFKILRGSNECGIEASVVAGM 327


>gi|341904470|gb|EGT60303.1| hypothetical protein CAEBREN_20420 [Caenorhabditis brenneri]
          Length = 351

 Score =  191 bits (484), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 98/206 (47%), Positives = 124/206 (60%), Gaps = 20/206 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGC 77
            +S+S +D+ ACCG  CG+GC+GGYPI AWR++V +G VT     Y + TGC    +P C
Sbjct: 147 QVSISADDINACCGMACGNGCNGGYPIEAWRHYVKNGYVTG--GSYQEKTGCKPYPYPPC 204

Query: 78  E-------------PAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYK 123
           E               YPT KC R C     L ++   H+  SAY ++    +I  EI  
Sbjct: 205 EHHVNGTHYKPCPSDMYPTDKCERSCQAGYSLTYKQDLHFGQSAYAVSKKATEIQKEIMT 264

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           NGPVEV+FTVY DF  Y  GVY H  G  +GGHAVK++GWG  D+G  YW+ AN WN  W
Sbjct: 265 NGPVEVAFTVYADFEVYSGGVYVHTAGASLGGHAVKMLGWGV-DNGTPYWLCANSWNEDW 323

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLP 209
           G +GYF+I RG NECGIE  VV G+P
Sbjct: 324 GENGYFRIIRGVNECGIEHGVVGGIP 349


>gi|410916585|ref|XP_003971767.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
          Length = 328

 Score =  190 bits (482), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 100/218 (45%), Positives = 135/218 (61%), Gaps = 20/218 (9%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + S   +SL+   +S  DLL+CC   CG GC GGYP SAW ++   G+VT     
Sbjct: 115 SDRLCIHSGSKISLE---ISAEDLLSCCD-ECGMGCSGGYPSSAWEFWTKKGLVTGGLCG 170

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRIN 111
               C PY  +  C H      P C+    TPKC +KC+      +   KH+   +Y + 
Sbjct: 171 SEVGCRPYSIAP-CEHHVNGTRPPCQGTQETPKCEKKCIDGYLTSYLKDKHFGKRSYSLP 229

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           S  E IM E+YKNGPVE +FTVY DF  YK+GVY+H+TG+V+GGHA+K++GWG  + G  
Sbjct: 230 SQQEQIMTELYKNGPVEAAFTVYADFLLYKTGVYQHVTGEVLGGHAIKILGWG-EESGTP 288

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           YW+ AN WN  WG  G+FKIKRG++ECGIE ++VAG P
Sbjct: 289 YWLAANSWNGDWGDKGFFKIKRGNDECGIESEMVAGTP 326


>gi|126303983|ref|XP_001381634.1| PREDICTED: cathepsin B-like [Monodelphis domestica]
          Length = 337

 Score =  190 bits (482), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 93/206 (45%), Positives = 135/206 (65%), Gaps = 17/206 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  DLL+CCG  CGDGC+GG+P  AW ++   G+V+         C PY     C 
Sbjct: 131 NVEVSAEDLLSCCGSECGDGCNGGFPAGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCE 189

Query: 74  H--PGCEPAYP-----TPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           H   G  PA       TP C +KC +  +  +++ K+Y  ++Y + S  ++IMAEIYKNG
Sbjct: 190 HHVNGSRPACTGEEGDTPTCRKKCEEGYSTQYKDDKNYGSTSYSVPSSEQEIMAEIYKNG 249

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +F+VYEDF HYKSGVY+H+ G+++GGHA++++GWG  ++G  YW+ AN WN  WG 
Sbjct: 250 PVEGAFSVYEDFLHYKSGVYQHVAGEMLGGHAIRILGWGV-ENGIRYWLAANSWNIDWGD 308

Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSS 211
           +G+FK  RG N CGIE +++AG+P +
Sbjct: 309 NGFFKFLRGKNHCGIESEIIAGIPRT 334


>gi|354471594|ref|XP_003498026.1| PREDICTED: cathepsin B-like [Cricetulus griseus]
 gi|344254255|gb|EGW10359.1| Cathepsin B [Cricetulus griseus]
          Length = 339

 Score =  190 bits (482), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 93/204 (45%), Positives = 130/204 (63%), Gaps = 14/204 (6%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
           N+ +S  DLL CCG  CGDGC+GGYP  AW +++  G+V+         C PY       
Sbjct: 130 NVEVSAEDLLTCCGSQCGDGCNGGYPSGAWNFWIKKGLVSGGLYNSHVGCLPYTIPPCEH 189

Query: 69  STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
               S P C     TPKC + C    +  ++  KHY  ++Y ++++ ++IMAEIYKNGPV
Sbjct: 190 HVNGSRPQCTGEGDTPKCTKSCEAGYSPSYKEDKHYGYTSYSVSNNEKEIMAEIYKNGPV 249

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           E +FTV+ DF  YKSGVYKH  GD+MGGHA++++GWG  ++   YW++AN WN  WG +G
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDIMGGHAIRILGWGV-ENSVPYWLVANSWNVDWGDNG 308

Query: 188 YFKIKRGSNECGIEEDVVAGLPSS 211
            FKI RG + CGIE ++VAG+P +
Sbjct: 309 LFKILRGEDHCGIESEIVAGIPRT 332


>gi|432946172|ref|XP_004083803.1| PREDICTED: cathepsin B-like [Oryzias latipes]
          Length = 330

 Score =  189 bits (481), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 101/220 (45%), Positives = 138/220 (62%), Gaps = 20/220 (9%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + S   +SL+   +S  DLL CC   CG GC GG+P +AW ++ + G+VT     
Sbjct: 117 SDRICIQSGGKISLE---ISAEDLLTCCD-ECGMGCFGGFPSAAWEFWTNKGLVTGGLFD 172

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRIN 111
               C PY  +  C H      P C+    TPKCV +C     L +   KH+   +Y I 
Sbjct: 173 SKVGCRPYTLAP-CEHHVNGSRPPCQGEVETPKCVTQCNNGYSLSYPKDKHFGQRSYSIP 231

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           S  E IM E+YKNGPVE +F+VY DF  YK+GVY+H+TGD++GGHAVK++GWG  ++G  
Sbjct: 232 SQQEQIMTELYKNGPVEAAFSVYADFLLYKNGVYQHVTGDMLGGHAVKILGWG-EENGTP 290

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 211
           YW++AN WN  WG  G+FKIKRG++ECGIE ++VAG P S
Sbjct: 291 YWLVANSWNSDWGDKGFFKIKRGNDECGIESEMVAGAPLS 330


>gi|118122|sp|P25793.1|CYSP2_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 2; Flags:
           Precursor
 gi|159165|gb|AAA29171.1| cathepsin B-like cysteine protease [Haemonchus contortus]
          Length = 342

 Score =  189 bits (481), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 94/210 (44%), Positives = 131/210 (62%), Gaps = 17/210 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           + +++S  D++ CC   CGDGC+GG+PI AW+YF++ GVV+       + C PY     C
Sbjct: 135 KQVNISATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPC 193

Query: 73  SHPG-------CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H G       C    PTP C RKC     +++R  K Y   AY +    + I +EI KN
Sbjct: 194 GHHGNDTYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKN 253

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPV  SF VYEDF HYKSG+YKH  G++ G HAVK+IGWG +++  D+W++AN W+  WG
Sbjct: 254 GPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWLIANSWHNDWG 312

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
             GYF+I RGSN+CGIE  + AG+  +++L
Sbjct: 313 EKGYFRIVRGSNDCGIEGTIAAGIVDTESL 342


>gi|6681079|ref|NP_031824.1| cathepsin B preproprotein [Mus musculus]
 gi|115712|sp|P10605.2|CATB_MOUSE RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
           RecName: Full=Cathepsin B light chain; Contains:
           RecName: Full=Cathepsin B heavy chain; Flags: Precursor
 gi|239907|gb|AAB20536.1| preprocathepsin B [Mus sp.]
 gi|309152|gb|AAA37375.1| cathepsin B [Mus musculus]
 gi|13879360|gb|AAH06656.1| Cathepsin B [Mus musculus]
 gi|26350521|dbj|BAC38900.1| unnamed protein product [Mus musculus]
 gi|74180941|dbj|BAE27751.1| unnamed protein product [Mus musculus]
 gi|74191261|dbj|BAE39458.1| unnamed protein product [Mus musculus]
 gi|74198944|dbj|BAE30691.1| unnamed protein product [Mus musculus]
 gi|74208073|dbj|BAE29144.1| unnamed protein product [Mus musculus]
 gi|148704123|gb|EDL36070.1| cathepsin B, isoform CRA_a [Mus musculus]
          Length = 339

 Score =  189 bits (480), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 93/204 (45%), Positives = 129/204 (63%), Gaps = 14/204 (6%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
           N+ +S  DLL CCG  CGDGC+GGYP  AW ++   G+V+         C PY       
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEH 189

Query: 69  STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
               S P C     TP+C + C    +  ++  KH+  ++Y +++  ++IMAEIYKNGPV
Sbjct: 190 HVNGSRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPV 249

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           E +FTV+ DF  YKSGVYKH  GD+MGGHA++++GWG  ++G  YW+ AN WN  WG +G
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGV-ENGVPYWLAANSWNLDWGDNG 308

Query: 188 YFKIKRGSNECGIEEDVVAGLPSS 211
           +FKI RG N CGIE ++VAG+P +
Sbjct: 309 FFKILRGENHCGIESEIVAGIPRT 332


>gi|326916753|ref|XP_003204669.1| PREDICTED: cathepsin B-like [Meleagris gallopavo]
          Length = 340

 Score =  189 bits (480), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 93/207 (44%), Positives = 131/207 (63%), Gaps = 15/207 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------F 67
           ++ +S  DLL+CCGF CG GC+GGYP  AWRY+   G+V+         C PY       
Sbjct: 130 SVEVSAEDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRPYTIPPCEH 189

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
              G   P       TP+C R C    +  ++  KHY I++Y +    ++IMAEIYKNGP
Sbjct: 190 HVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGP 249

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +F VYEDF  YKSGVY+H++G+ +GGHA++++GWG  ++G  YW+ AN WN  WG +
Sbjct: 250 VEGAFIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGV-ENGTPYWLAANSWNTDWGDN 308

Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKN 213
           G+FKI RG + CGIE ++VAG+P ++ 
Sbjct: 309 GFFKILRGEDHCGIESEIVAGVPRTEQ 335


>gi|308500570|ref|XP_003112470.1| CRE-CPR-4 protein [Caenorhabditis remanei]
 gi|308267038|gb|EFP10991.1| CRE-CPR-4 protein [Caenorhabditis remanei]
          Length = 335

 Score =  189 bits (480), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 100/205 (48%), Positives = 129/205 (62%), Gaps = 18/205 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
           N  LS  D+L+CC   CG GCDGGYPI+AW+Y V  G  T         C PY      +
Sbjct: 131 NTLLSAEDVLSCCSN-CGYGCDGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGE 189

Query: 69  STG-CSHPGC-EPAYPTPKCVRKCV--KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
           + G  + P C +  Y TP CV KC   K N  +++ KH+  +AY +      I AEI  +
Sbjct: 190 TVGNVTWPDCPDDGYNTPACVNKCTNTKYNTAYKDDKHFGSTAYAVGKKVAQIQAEIIAH 249

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE +FTVYEDF  YKSGVY H TG  +GGHA++++GWGT D+G  YW++AN WN +WG
Sbjct: 250 GPVEAAFTVYEDFYQYKSGVYVHTTGQELGGHAIRILGWGT-DNGTPYWLVANSWNVNWG 308

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
            +GYF+I RG+NECGIE  VV G+P
Sbjct: 309 ENGYFRIIRGTNECGIEHAVVGGVP 333


>gi|427785213|gb|JAA58058.1| Putative cathepsin l culex quinquefasciatus cathepsin l
           [Rhipicephalus pulchellus]
          Length = 346

 Score =  189 bits (479), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 101/211 (47%), Positives = 133/211 (63%), Gaps = 18/211 (8%)

Query: 14  SPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY 66
           SP    + + LS +DLL+CC   CG+GC+GG+P SAW ++V  G+VT       + C PY
Sbjct: 137 SPSGGPKRVHLSADDLLSCC-RTCGNGCNGGFPGSAWSFWVKTGIVTGGNYDSDDGCMPY 195

Query: 67  FDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIM 118
                C H       P  +   PTP+CV  C K   + + + KHY  S+Y + S+ + I 
Sbjct: 196 -PIKACDHHVNGTLGPCDKKIPPTPRCVHMCRKGYDVDYHDDKHYGKSSYSVPSEEKQIQ 254

Query: 119 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQ 178
           AEI  NGPVE  FTVY DF HYKSGVY+  T + +GGHA++L+GWG  ++G  YW+ AN 
Sbjct: 255 AEIMTNGPVEADFTVYSDFVHYKSGVYQRHTDEALGGHAIRLLGWGV-ENGVPYWLAANS 313

Query: 179 WNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           WN  WG  G+FKI RGS+ECGIE+DVVAGLP
Sbjct: 314 WNTEWGDKGFFKILRGSDECGIEDDVVAGLP 344


>gi|346472613|gb|AEO36151.1| hypothetical protein [Amblyomma maculatum]
          Length = 373

 Score =  189 bits (479), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 100/201 (49%), Positives = 127/201 (63%), Gaps = 18/201 (8%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
           L+ +D+L+CC   CG GC+GG+P SAW Y+VH G+VT       E C PY     C H  
Sbjct: 174 LAADDVLSCC-TECGAGCNGGFPGSAWSYWVHKGIVTGGNYDSDEGCMPY-PIKACDHHV 231

Query: 75  -----PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 128
                P  +   PTP+CVR C K   + + + KHY   AY + +  + I AEI  NGPVE
Sbjct: 232 NGTLGPCDKTIPPTPRCVRMCRKGYDVDFMDDKHYGRHAYSVPAKAKQIQAEIMMNGPVE 291

Query: 129 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 188
             FTVYEDF HYKSGVY+  T   +GGHA++L+GWG  ++G  YW+ AN WN  WG  G+
Sbjct: 292 ADFTVYEDFLHYKSGVYQRHTDSALGGHAIRLLGWGV-ENGVPYWLAANSWNTEWGDKGF 350

Query: 189 FKIKRGSNECGIEEDVVAGLP 209
           FKI RGS+ECGIE D+VAGLP
Sbjct: 351 FKILRGSDECGIESDIVAGLP 371


>gi|160333103|ref|NP_001103948.1| capthepsin B, b precursor [Danio rerio]
 gi|133777414|gb|AAI15255.1| Ctsbb protein [Danio rerio]
          Length = 326

 Score =  188 bits (478), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 97/204 (47%), Positives = 126/204 (61%), Gaps = 17/204 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGC 72
           Q+  +S  DLL+CC   CG GC GG+P  AW Y+   G+VT         C PY     C
Sbjct: 124 QSPEISAEDLLSCCD-QCGFGCSGGFPAEAWDYWRRSGLVTGGLYNSDVGCRPY-SIAPC 181

Query: 73  SH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
            H      P C     TPKC   C+ K  + ++  KH+    Y + SD + IM E+Y NG
Sbjct: 182 EHHVNGTRPPCSGEQDTPKCTGVCIPKYSVPYKQDKHFGSKVYNVPSDQQQIMTELYTNG 241

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +FTVYEDF  YKSGVY+H+TG  +GGHAVK++GWG  ++G  +W++AN WN  WG 
Sbjct: 242 PVEAAFTVYEDFPLYKSGVYQHLTGSALGGHAVKILGWG-EENGTPFWLVANSWNSDWGD 300

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +GYFKI RG +ECGIE ++VAGLP
Sbjct: 301 NGYFKILRGHDECGIESEMVAGLP 324


>gi|442754445|gb|JAA69382.1| Putative cathepsin b precursor [Ixodes ricinus]
          Length = 340

 Score =  188 bits (478), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 101/202 (50%), Positives = 130/202 (64%), Gaps = 17/202 (8%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH 74
           + L+ +D+L+CC + CG GC+GG+P +AW Y+V  G+VT       E C PY     C H
Sbjct: 140 VHLAADDVLSCC-WGCGSGCNGGFPAAAWSYWVDKGIVTGGNYDTDEGCMPY-PVPSCDH 197

Query: 75  P------GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
                   C    PTPKCVR C K  N  +++ KHY  S+Y + S+   I  EI KNGPV
Sbjct: 198 HVNGTLGPCGQDPPTPKCVRLCRKGYNVDFKDDKHYGKSSYSVPSNETQIQMEIMKNGPV 257

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           E +FTVY DF  YKSGVYK  + D +GGHA++++GWG  +D   YW++AN WN  WG  G
Sbjct: 258 EGAFTVYADFPLYKSGVYKSHSTDALGGHAIRILGWGVEND-VPYWLVANSWNTEWGDKG 316

Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
           YFKI RGSNECGIEED+VAG+P
Sbjct: 317 YFKILRGSNECGIEEDIVAGIP 338


>gi|241998314|ref|XP_002433800.1| longipain, putative [Ixodes scapularis]
 gi|215495559|gb|EEC05200.1| longipain, putative [Ixodes scapularis]
          Length = 339

 Score =  188 bits (478), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 99/202 (49%), Positives = 132/202 (65%), Gaps = 17/202 (8%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH 74
           + L+ +D+L+CC + CG GC+GG+P +AW Y+V  G+VT       E C PY     C H
Sbjct: 139 VHLAADDVLSCC-WGCGSGCNGGFPGAAWSYWVEKGIVTGGNYDTDEGCMPY-PVPSCDH 196

Query: 75  ------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
                   C    PTPKCVR C K   + +++ KHY  S+Y ++S+   I  EI KNGPV
Sbjct: 197 HVNGTLGPCGQDPPTPKCVRLCRKGYNIDFKDDKHYGKSSYSVSSNETQIQMEIMKNGPV 256

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           E +FTVY DF  YKSGVYK  + D +GGHA++++GWG  ++G  +W++AN WN  WG  G
Sbjct: 257 EGAFTVYADFPLYKSGVYKSHSTDALGGHAIRILGWGV-ENGVPFWLVANSWNTEWGDKG 315

Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
           YFKI RGSNECGIEED+VAG+P
Sbjct: 316 YFKILRGSNECGIEEDIVAGIP 337


>gi|31872149|gb|AAP59456.1| cathepsin B precursor [Araneus ventricosus]
          Length = 334

 Score =  188 bits (477), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 100/203 (49%), Positives = 127/203 (62%), Gaps = 17/203 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  DLL CC   CG GC+GG+P SAW Y+V  G+VT         C PY  ++ C 
Sbjct: 133 NVEISAEDLLTCCD-SCGMGCNGGFPGSAWEYWVDKGLVTGGLYNSHVGCQPYTIAS-CE 190

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TP+CV  C K  N  +R  K++   +Y I+   + I  EI  NGP
Sbjct: 191 HHTKGKLPPCGDIVDTPQCVHMCEKGYNVSYRADKYFGKKSYSIDEQEDQIKTEISTNGP 250

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +FTVY DF  YKSGVY+H+TG+ MGGHAV+++GWGT + G  YW++AN WN  WG  
Sbjct: 251 VEAAFTVYADFVTYKSGVYRHVTGEEMGGHAVRILGWGT-ESGTPYWLVANSWNTDWGDK 309

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           GYFKI RGS+ECGIE  +VAGLP
Sbjct: 310 GYFKILRGSDECGIESSIVAGLP 332


>gi|118118|sp|P19092.1|CYSP1_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
           Precursor
 gi|159173|gb|AAA29175.1| cysteine protease (AC-1) [Haemonchus contortus]
          Length = 342

 Score =  188 bits (477), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 92/210 (43%), Positives = 131/210 (62%), Gaps = 17/210 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           + +++S  D++ CC   CGDGC+GG+PI AW+YF++ GVV+       + C PY     C
Sbjct: 135 KQVNISATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPC 193

Query: 73  SHPG-------CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H G       C    PTP C RKC     +++R  K Y   AY +    + I +EI +N
Sbjct: 194 GHHGNDTYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILRN 253

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPV  SF VYEDF HYKSG+YKH  G++ G HAVK+IGWG +++  D+W++AN W+  WG
Sbjct: 254 GPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWLIANSWHNDWG 312

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
             GYF+I RG+N+CGIE  + AG+  +++L
Sbjct: 313 EKGYFRIIRGTNDCGIEGTIAAGIVDTESL 342


>gi|346470617|gb|AEO35153.1| hypothetical protein [Amblyomma maculatum]
          Length = 335

 Score =  188 bits (477), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 94/203 (46%), Positives = 128/203 (63%), Gaps = 17/203 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
            +++S  DLL CC   CG GC+GGYP +AW ++   G+VT       + C PY+    C 
Sbjct: 134 QVNISAEDLLTCCD-SCGAGCNGGYPAAAWEFYKTDGIVTGGLYGTDDGCQPYYFPP-CE 191

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C    PTP+CVR C K   + +   KHY+   Y +++D   I  EI+KNGP
Sbjct: 192 HHTVGPLPNCTGIKPTPQCVRDCRKGYEKSYSEDKHYAKKVYTLSADETQIKTEIFKNGP 251

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE  FTVY DF  YKSGVY+  + D +GGHA++++GWGT ++G  YW++AN WN  WG  
Sbjct: 252 VEADFTVYADFVSYKSGVYQRHSDDALGGHAIRILGWGT-ENGVPYWLVANSWNEDWGDK 310

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           GYFKI RG++ECGIE+D+ AG+P
Sbjct: 311 GYFKILRGNDECGIEDDINAGIP 333


>gi|312271213|gb|ADQ57304.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 347

 Score =  188 bits (477), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 102/207 (49%), Positives = 126/207 (60%), Gaps = 21/207 (10%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q +++S +DLL+CC   CG GCDGG P +AW Y+V +G+VT     Y   +GC    +P 
Sbjct: 144 QKVTISADDLLSCCD-ECGFGCDGGDPYAAWSYWVSNGIVTGS--NYTSKSGCKPYPYPP 200

Query: 77  CE-------------PAYPTPKCVRKCVKKNQLWRNS-KHYSISAYRINSDPEDIMAEIY 122
           CE               YPT  C  KC     +  NS KHY  S Y +  D   I  EI 
Sbjct: 201 CEHHIPEHHYKKCPKDIYPTNTCEYKCQDGYSISYNSDKHYGASVYAVAQDVASIQKEIM 260

Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
            NGPVEV+F VYEDF HY SG+YKH TGD +GGHAVK++GWGT ++G DYWI AN WN  
Sbjct: 261 TNGPVEVAFDVYEDFEHYSSGIYKHTTGDYLGGHAVKMLGWGT-ENGTDYWICANSWNSD 319

Query: 183 WGADGYFKIKRGSNECGIEEDVVAGLP 209
           WG +G+F+I RG +EC IE  VVAG P
Sbjct: 320 WGENGFFRILRGVDECQIESSVVAGEP 346


>gi|426220597|ref|XP_004004501.1| PREDICTED: cathepsin B [Ovis aries]
          Length = 335

 Score =  188 bits (477), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 91/203 (44%), Positives = 131/203 (64%), Gaps = 16/203 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY     C 
Sbjct: 130 NVEVSAEDMLTCCGSECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCE 188

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC + C    +  +++ KH+  S+Y ++S+ ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKICEPGYSPSYKDDKHFGCSSYSVSSNEKEIMAEIYKNGP 248

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +F+VY DF  YKSGVY+H++G++MGGHA++++GWG  +D   YW++ N WN  WG  
Sbjct: 249 VEGAFSVYSDFLLYKSGVYQHVSGEMMGGHAIRILGWGVEND-TPYWLVGNSWNTDWGDK 307

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G+FKI RG + CGIE ++VAG+P
Sbjct: 308 GFFKILRGQDHCGIESEIVAGMP 330


>gi|379067374|gb|AFC90100.1| cathepsin B [Capra hircus]
          Length = 335

 Score =  188 bits (477), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 91/205 (44%), Positives = 132/205 (64%), Gaps = 16/205 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY     C 
Sbjct: 130 NVEVSAEDMLTCCGSECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCE 188

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC + C    +  +++ KH+  S+Y ++S+ ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKICEPGYSPSYKDDKHFGCSSYSVSSNEKEIMAEIYKNGP 248

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +F+VY DF  YKSGVY+H++G++MGGHA++++GWG  +D   YW++ N WN  WG  
Sbjct: 249 VEGAFSVYSDFLLYKSGVYQHVSGEMMGGHAIRILGWGVEND-TPYWLVGNSWNTDWGDK 307

Query: 187 GYFKIKRGSNECGIEEDVVAGLPSS 211
           G+FKI RG + CGIE ++VAG+P +
Sbjct: 308 GFFKILRGQDHCGIESEIVAGMPCT 332


>gi|348587350|ref|XP_003479431.1| PREDICTED: cathepsin B-like [Cavia porcellus]
          Length = 340

 Score =  188 bits (477), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 95/206 (46%), Positives = 133/206 (64%), Gaps = 17/206 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  DLL+CCG LCG+GC+GGYP  AW+Y+   G+V+         C PY     C 
Sbjct: 130 NVEVSAEDLLSCCGPLCGEGCNGGYPTEAWKYWTRKGLVSGGLYGSHVGCRPY-SIPPCE 188

Query: 74  H------PGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           H      P C      TPKC + C    +  ++  K+Y  S+Y + S  ++IMAEIYKNG
Sbjct: 189 HHVNGTRPKCTGEGGDTPKCSKTCEPGYSPSYKEDKYYGYSSYSVPSTEKEIMAEIYKNG 248

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +F+V+ DF  YKSGVYKH+ G+V+GGHA++++GWG  ++G  YW++ N WN  WG 
Sbjct: 249 PVEAAFSVFSDFLTYKSGVYKHVAGEVLGGHAIRILGWG-KENGVPYWLVGNSWNVDWGD 307

Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSS 211
           +G+FKI RG + CGIE +VVAG+P +
Sbjct: 308 NGFFKILRGEDHCGIESEVVAGIPRT 333


>gi|156365510|ref|XP_001626688.1| predicted protein [Nematostella vectensis]
 gi|156213574|gb|EDO34588.1| predicted protein [Nematostella vectensis]
          Length = 259

 Score =  187 bits (475), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 98/200 (49%), Positives = 127/200 (63%), Gaps = 17/200 (8%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
           +S  DLL+CC   CG GC+GGYP SAW ++   G+VT       + C PY     C H  
Sbjct: 57  ISAEDLLSCC-ETCGMGCNGGYPESAWDHWKSKGLVTGGQYDSHKGCQPY-KIAACDHHV 114

Query: 75  ----PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
                 C+   PTPKC RKC    N  + + KH+  SAY + SDP +I  EI  NGPVE 
Sbjct: 115 VGKLKPCKGDSPTPKCERKCEAGYNVSYSDDKHFGQSAYSVRSDPAEIQKEIMTNGPVEG 174

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
           +FTVY DF  YKSGVY+H +G  +GGHA+K++GWG  ++G  YW++AN WN  WG +G+F
Sbjct: 175 AFTVYADFPTYKSGVYQHTSGSALGGHAIKILGWG-EENGTPYWLVANSWNSDWGDEGFF 233

Query: 190 KIKRGSNECGIEEDVVAGLP 209
           KIKRG++ECGIE  +V GLP
Sbjct: 234 KIKRGNDECGIESGIVGGLP 253


>gi|341891084|gb|EGT47019.1| CBN-CPR-4 protein [Caenorhabditis brenneri]
          Length = 335

 Score =  187 bits (475), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 98/205 (47%), Positives = 128/205 (62%), Gaps = 18/205 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
           N  LS  D+L+CC   CG GC+GGYPI+AW+Y V  G  T         C PY      +
Sbjct: 131 NTLLSAEDVLSCCSN-CGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGE 189

Query: 69  STG-CSHPGCEP-AYPTPKCVRKCVKKNQ--LWRNSKHYSISAYRINSDPEDIMAEIYKN 124
           + G  + P C    Y TP CV KC   N    +++ KH+  +AY +      I AEI  +
Sbjct: 190 TVGNTTWPACPTDGYDTPACVNKCTNSNYNVAYKDDKHFGSTAYAVGKKVAQIQAEIIAH 249

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE +FTVYEDF  YKSGVY H TG+ +GGHA++++GWGT D+G  YW++AN WN +WG
Sbjct: 250 GPVEAAFTVYEDFYQYKSGVYVHTTGEELGGHAIRILGWGT-DNGTPYWLVANSWNVNWG 308

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
            +GYF+I RG+NECGIE  VV G+P
Sbjct: 309 ENGYFRIIRGTNECGIEHAVVGGVP 333


>gi|46195455|ref|NP_990702.1| cathepsin B precursor [Gallus gallus]
 gi|1168790|sp|P43233.1|CATB_CHICK RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
           RecName: Full=Cathepsin B light chain; Contains:
           RecName: Full=Cathepsin B heavy chain; Flags: Precursor
 gi|603203|gb|AAA87075.1| cathepsin B [Gallus gallus]
          Length = 340

 Score =  187 bits (474), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 94/205 (45%), Positives = 130/205 (63%), Gaps = 19/205 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC---SHPGC 77
           ++ +S  DLL+CCGF CG GC+GGYP  AWRY+   G+V+     Y    GC   + P C
Sbjct: 130 SVEVSAEDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGGL--YDSHVGCRAYTIPPC 187

Query: 78  E------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
           E                TP+C R C    +  ++  KHY I++Y +    ++IMAEIYKN
Sbjct: 188 EHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKN 247

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE +F VYEDF  YKSGVY+H++G+ +GGHA++++GWG  ++G  YW+ AN WN  WG
Sbjct: 248 GPVEGAFIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGV-ENGTPYWLAANSWNTDWG 306

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
             G+FKI RG + CGIE ++VAG+P
Sbjct: 307 ITGFFKILRGEDHCGIESEIVAGVP 331


>gi|312271211|gb|ADQ57303.1| cathepsin B-like cysteine proteinase 1 [Angiostrongylus
           cantonensis]
          Length = 394

 Score =  187 bits (474), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 99/214 (46%), Positives = 130/214 (60%), Gaps = 20/214 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
            ++LS +DLL+CC   CG GC+GG P+ AW+Y+V HG+VT       + C PY     C 
Sbjct: 171 QVTLSADDLLSCCR-TCGFGCEGGDPMFAWQYWVDHGIVTGSNFTANQGCKPY-PFPPCE 228

Query: 74  H--------PGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           H        P     YPTPKC +KCV   K + + + + Y  +AY + +D   I  EI  
Sbjct: 229 HHSNKTRFDPCRHDLYPTPKCSKKCVPSYKEKNYDDDRFYGRTAYGVKNDVAAIQKEILT 288

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVEV+F VYEDF HY  G+Y H  G + GGHAVKLIGWG  D G  YW++AN WN  W
Sbjct: 289 HGPVEVAFEVYEDFLHYAGGIYVHTGGKLGGGHAVKLIGWGI-DQGTPYWLIANSWNTDW 347

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 217
           G +G+F+I RG +ECGIE  VV G+P S N+ + 
Sbjct: 348 GEEGFFRILRGVDECGIESGVVGGIPKSTNIQRR 381


>gi|118364222|ref|XP_001015333.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297100|gb|EAR95088.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 341

 Score =  186 bits (473), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 96/207 (46%), Positives = 125/207 (60%), Gaps = 24/207 (11%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTG-------- 71
           Q++ LS  ++L CC   CG GC+GGYP SA  Y+V  G+VT +    +++TG        
Sbjct: 140 QDIRLSAQNMLTCCA-TCGQGCNGGYPASAMSYYVKTGLVTGD---LYNTTGWCQAYSFA 195

Query: 72  -CSH-------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIY 122
            C+H       P C    PTPKC + C     Q +  + H    AY +    E IM EI 
Sbjct: 196 PCAHHVDTPLYPACTGELPTPKCAKTCDSGSGQTY--TVHKGSKAYSVGKTQEAIMTEIQ 253

Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
            NGPVE +FTVYEDF +YKSGVYKH+TG  +GGHA+K++GWG  ++   YWI+ N WN++
Sbjct: 254 TNGPVEAAFTVYEDFLNYKSGVYKHVTGKALGGHAIKIVGWGVENN-TPYWIVVNSWNQT 312

Query: 183 WGADGYFKIKRGSNECGIEEDVVAGLP 209
           WG +G FKI RG NECGIE  VV  LP
Sbjct: 313 WGDNGTFKILRGKNECGIEAQVVTALP 339


>gi|45361295|ref|NP_989225.1| cathepsin B precursor [Xenopus (Silurana) tropicalis]
 gi|38969948|gb|AAH63365.1| hypothetical protein MGC75969 [Xenopus (Silurana) tropicalis]
          Length = 333

 Score =  186 bits (473), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 96/206 (46%), Positives = 133/206 (64%), Gaps = 17/206 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  DLL+CCGF CG GC+GGYP  AW+++   G+V+         C PY     C 
Sbjct: 130 NVEVSAEDLLSCCGFECGMGCNGGYPSGAWKFWTETGLVSGGLYDSHLGCRPY-SIPPCE 188

Query: 74  H--PGCEPAYP-----TPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           H   G  PA       TPKCV++C      ++ + KH+  ++Y + S  ++IMAEIYKNG
Sbjct: 189 HHVNGSRPACKGEEGDTPKCVKQCEDGYAPVYGSDKHFGATSYGVPSSEKEIMAEIYKNG 248

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +F VY DF  YKSGVY+H TG+ +GGHA+K++GWG  ++G  YW+ AN WN  WG 
Sbjct: 249 PVEGAFLVYADFPMYKSGVYQHETGEELGGHAIKILGWGV-ENGTPYWLCANSWNTDWGD 307

Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSS 211
           +G+FKI RG + CGIE ++VAG+P +
Sbjct: 308 NGFFKILRGKDHCGIESEIVAGIPKN 333


>gi|268555790|ref|XP_002635884.1| Hypothetical protein CBG01104 [Caenorhabditis briggsae]
          Length = 337

 Score =  186 bits (473), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 97/207 (46%), Positives = 125/207 (60%), Gaps = 21/207 (10%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDS---- 69
           N  +S  DLL+CC   CGDGCDGGYP+ AWRY+V  G+V+         C PY  +    
Sbjct: 128 NTFVSAEDLLSCCT-SCGDGCDGGYPLQAWRYWVKQGLVSGGSYESQYGCKPYSIAPCGQ 186

Query: 70  --TGCSHPGCEPAY--PTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIY 122
              G + P C PA    TP+C   C  K+     +   KHY +SAY +      I  EI 
Sbjct: 187 TVNGVTWPKC-PAQEEATPECASHCTSKSSYSVAYEKDKHYGLSAYPVGRKEAQIQTEIL 245

Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
           ++GPVE  F VY DF  YKSG+Y H++G  +GGHAVK++GWG  ++G  YW++AN WN +
Sbjct: 246 QHGPVEAGFLVYSDFYRYKSGIYTHVSGQELGGHAVKILGWGV-ENGTKYWLVANSWNIN 304

Query: 183 WGADGYFKIKRGSNECGIEEDVVAGLP 209
           WG  GYF+I RG NECGIE  VVAG+P
Sbjct: 305 WGEKGYFRILRGRNECGIESAVVAGIP 331


>gi|268558600|ref|XP_002637291.1| C. briggsae CBR-CPR-4 protein [Caenorhabditis briggsae]
          Length = 335

 Score =  186 bits (473), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 98/205 (47%), Positives = 128/205 (62%), Gaps = 18/205 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
           N  LS  D+L+CC   CG GC+GGYPI+AW+Y V  G  T         C PY      +
Sbjct: 131 NTLLSAEDVLSCCSN-CGYGCEGGYPINAWKYLVKSGFCTGGSYVSQFGCKPYSLAPCGE 189

Query: 69  STG-CSHPGC-EPAYPTPKCVRKCVKKNQ--LWRNSKHYSISAYRINSDPEDIMAEIYKN 124
           + G  + P C +  Y TP CV KC   N    +++ KH+  +AY +      I AEI  +
Sbjct: 190 TVGNTTWPDCPQDGYNTPSCVNKCTNNNYNIAYKDDKHFGSTAYAVGKKVAQIQAEILAH 249

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE +FTVYEDF  YKSGVY H TG  +GGHA++++GWGT D+G  YW++AN WN +WG
Sbjct: 250 GPVEAAFTVYEDFYQYKSGVYVHTTGQELGGHAIRILGWGT-DNGTPYWLVANSWNVNWG 308

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
            +GYF+I RG+NECGIE  VV G+P
Sbjct: 309 ENGYFRIIRGTNECGIEHAVVGGVP 333


>gi|118358706|ref|XP_001012594.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89294361|gb|EAR92349.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 346

 Score =  186 bits (473), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 98/208 (47%), Positives = 125/208 (60%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGC 72
           Q++ LS  +LL CC   CGDGCDGG+P +A  Y+V+ G+VT +       C  Y  +  C
Sbjct: 141 QDIRLSTQNLLTCCA-ACGDGCDGGWPEAAMDYYVNTGLVTGDLYGNNSWCQAYTFAP-C 198

Query: 73  SH-------PGCEPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIY 122
           +H       P C    PTP C+  C   +     +    H    AY I  D + IMAEIY
Sbjct: 199 AHHVTSDIYPPCTGELPTPPCINSCDSNSTHTIPYSKDIHRGSKAYGIAKDEKAIMAEIY 258

Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
           KNGP+EV+ TVYEDF  YK+GVY+H+TGD +GGHAVK++GWG  ++G  YW + N WN S
Sbjct: 259 KNGPIEVALTVYEDFLTYKTGVYQHVTGDELGGHAVKMVGWGV-ENGTPYWTIVNSWNES 317

Query: 183 WGADGYFKIKRGSNECGIEEDVVAGLPS 210
           WG  G FKI RG NECGIE   V  LP+
Sbjct: 318 WGDKGTFKILRGKNECGIESSCVTALPA 345


>gi|351695295|gb|EHA98213.1| Cathepsin B [Heterocephalus glaber]
          Length = 340

 Score =  186 bits (473), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 92/206 (44%), Positives = 133/206 (64%), Gaps = 17/206 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  DLL+CCG  CGDGC+GGYP +AW+Y+   G+V+         C PY     C 
Sbjct: 130 NVEVSAEDLLSCCGLECGDGCNGGYPSAAWKYWTKKGLVSGGLYDSHVGCRPY-SIPPCE 188

Query: 74  H------PGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           H      P C      TPKC + C    +  ++  KH+   +Y ++S+ ++IMAEIYKNG
Sbjct: 189 HHVNGTRPQCTGEGGDTPKCSKTCEPGYSPSYKEDKHFGYDSYSVSSNEKEIMAEIYKNG 248

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +FTV+ DF  YK+GVYKH+ G+++GGHA++++GWG  ++G  YW++ N WN  WG 
Sbjct: 249 PVEGAFTVFSDFLMYKTGVYKHLAGEMLGGHAIRILGWG-KENGVPYWLVGNSWNVDWGD 307

Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSS 211
            G+FKI RG + CGIE ++VAG+P +
Sbjct: 308 SGFFKIVRGEDHCGIESEIVAGIPRT 333


>gi|17559068|ref|NP_504682.1| Protein CPR-4 [Caenorhabditis elegans]
 gi|1169085|sp|P43508.1|CPR4_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 4; AltName:
           Full=Cysteine protease-related 4; Flags: Precursor
 gi|675500|gb|AAA98785.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|695293|gb|AAA98783.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351063163|emb|CCD71204.1| Protein CPR-4 [Caenorhabditis elegans]
          Length = 335

 Score =  186 bits (472), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 98/205 (47%), Positives = 127/205 (61%), Gaps = 18/205 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
           N  LS  D+L+CC   CG GC+GGYPI+AW+Y V  G  T         C PY      +
Sbjct: 131 NTLLSAEDVLSCCSN-CGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGE 189

Query: 69  STG-CSHPGC-EPAYPTPKCVRKCVKKNQ--LWRNSKHYSISAYRINSDPEDIMAEIYKN 124
           + G  + P C +  Y TP CV KC  KN    +   KH+  +AY +      I AEI  +
Sbjct: 190 TVGNVTWPSCPDDGYDTPACVNKCTNKNYNVAYTADKHFGSTAYAVGKKVSQIQAEIIAH 249

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE +FTVYEDF  YK+GVY H TG  +GGHA++++GWGT D+G  YW++AN WN +WG
Sbjct: 250 GPVEAAFTVYEDFYQYKTGVYVHTTGQELGGHAIRILGWGT-DNGTPYWLVANSWNVNWG 308

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
            +GYF+I RG+NECGIE  VV G+P
Sbjct: 309 ENGYFRIIRGTNECGIEHAVVGGVP 333


>gi|301776581|ref|XP_002923704.1| PREDICTED: cathepsin B-like [Ailuropoda melanoleuca]
 gi|281347694|gb|EFB23278.1| hypothetical protein PANDA_012896 [Ailuropoda melanoleuca]
          Length = 339

 Score =  186 bits (472), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 97/212 (45%), Positives = 134/212 (63%), Gaps = 16/212 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY     C 
Sbjct: 130 NVEVSAEDMLTCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYESHVGCRPY-SIPPCE 188

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC + C       ++  KHY  S+Y ++S  ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKFCEPGYTPSYKEDKHYGCSSYSVSSSEKEIMAEIYKNGP 248

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +FTVY DF  YKSGVY+H+TG++MGGHAV+++GWG  ++G  YW++ N WN  WG +
Sbjct: 249 VEAAFTVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGV-ENGTPYWLVGNSWNTDWGDN 307

Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           G+FKI RG + CGIE ++VAG+P +    K+I
Sbjct: 308 GFFKILRGRDHCGIESEIVAGIPCTDQYWKKI 339


>gi|309202|gb|AAA37494.1| mouse preprocathepsin B [Mus musculus]
          Length = 339

 Score =  186 bits (471), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 92/204 (45%), Positives = 128/204 (62%), Gaps = 14/204 (6%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
           N+ +S  DLL CCG  CGDGC+GGYP  AW ++   G+V+         C PY       
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTKKGLVSGGVYDSHIGCLPYTIPPCEH 189

Query: 69  STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
               S P C     TP+C + C    +  ++  KH+  ++Y +++  ++IMAEIYKNGPV
Sbjct: 190 HVNGSRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPV 249

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           E +FTV+ DF  YKSGVYKH  GD+MGGHA++++ WG  ++G  YW+ AN WN  WG +G
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILVWGV-ENGVPYWLAANSWNLDWGDNG 308

Query: 188 YFKIKRGSNECGIEEDVVAGLPSS 211
           +FKI RG N CGIE ++VAG+P +
Sbjct: 309 FFKILRGENHCGIESEIVAGIPRT 332


>gi|344281458|ref|XP_003412496.1| PREDICTED: cathepsin B-like [Loxodonta africana]
          Length = 340

 Score =  186 bits (471), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 95/213 (44%), Positives = 133/213 (62%), Gaps = 17/213 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  DLL CC   CGDGC+GG+P  AW ++   G+V+         C PY     C 
Sbjct: 130 NVEVSAEDLLTCCHMECGDGCNGGFPAGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCE 188

Query: 74  H------PGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           H      P C+     TPKC + C    +  ++  KHY  S+Y + S  ++IMAEIYKNG
Sbjct: 189 HHVNGSRPPCKGEGGETPKCSKTCEPGYSPSYKEDKHYGYSSYGVPSSEQEIMAEIYKNG 248

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +F+VY DF  YKSGVY+H+TG+ +GGHA++++GWG  ++G  YW+ AN WN  WG 
Sbjct: 249 PVEGAFSVYTDFLVYKSGVYQHVTGEEVGGHAIRILGWGV-ENGTPYWLAANSWNTDWGD 307

Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           +G+FKI RG + CGIE ++VAG+P +    K+I
Sbjct: 308 NGFFKILRGQDHCGIESEIVAGIPRTDQYWKKI 340


>gi|443692853|gb|ELT94358.1| hypothetical protein CAPTEDRAFT_221292 [Capitella teleta]
          Length = 374

 Score =  186 bits (471), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 94/203 (46%), Positives = 131/203 (64%), Gaps = 17/203 (8%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHP- 75
           +S  DLL+CC   CG GC+GG+P +AW YF   G+V+       + C PY  +  C H  
Sbjct: 175 ISSEDLLSCCSS-CGMGCNGGFPPAAWEYFRDTGLVSGGQYGTHQGCRPYSIAP-CEHHV 232

Query: 76  -----GCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
                 C    PTPKC R C K  ++ + + K++  +AY +++D + IM EI  NGPVE 
Sbjct: 233 NGTRLPCSGEGPTPKCERTCEKGYKVKYEDDKNFGYTAYSVDNDEKQIMTEIMTNGPVEG 292

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
           +FTVY DF  YKSGVY+H++G  +GGHA++++GWG  +DG  YW++AN WN  WG +G+F
Sbjct: 293 AFTVYADFPTYKSGVYQHVSGGELGGHAIRVLGWGV-EDGTPYWLVANSWNSDWGDNGFF 351

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           KI RG NECGIE ++VAGLP  +
Sbjct: 352 KILRGQNECGIEGEIVAGLPKKQ 374


>gi|256077361|ref|XP_002574974.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
 gi|18181863|emb|CAC85211.2| cathepsin B endopeptidase [Schistosoma mansoni]
 gi|353231645|emb|CCD79000.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
          Length = 347

 Score =  186 bits (471), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 98/204 (48%), Positives = 127/204 (62%), Gaps = 17/204 (8%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH-- 74
           LS  +L++CC   CG GC+GG+P SAW Y+ + G+VT +       C PY +   C H  
Sbjct: 147 LSAENLVSCCSS-CGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY-EFPPCEHHV 204

Query: 75  ----PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
               P C+    TP C   C    N  +   K Y    YRI+S+PE IM E+ +NGPVEV
Sbjct: 205 IGPLPSCDGDVETPSCKTNCQPGYNIPYEKDKWYGEKVYRIHSNPEAIMLELMRNGPVEV 264

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
            F VY DF +YKSGVY+H++G ++GGHAV+L+GWG  ++   YW++AN WN  WG  GYF
Sbjct: 265 DFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWG-EENNVPYWLIANSWNSDWGDKGYF 323

Query: 190 KIKRGSNECGIEEDVVAGLPSSKN 213
           KI RG NECGIE DV AG+P  KN
Sbjct: 324 KIVRGKNECGIESDVNAGIPKIKN 347


>gi|74221319|dbj|BAE42140.1| unnamed protein product [Mus musculus]
          Length = 339

 Score =  186 bits (471), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 92/204 (45%), Positives = 127/204 (62%), Gaps = 14/204 (6%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
           N+ +S  DLL CCG  CGDGC+GGYP  AW ++   G+V+         C PY       
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEH 189

Query: 69  STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
               S P C     TP+C + C    +  ++  KH+  ++Y +++  ++IMAEIYKN PV
Sbjct: 190 HVNGSRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNDPV 249

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           E +FTV+ DF  YKSGVYKH  GD+MGGHA++++GWG   +G  YW+ AN WN  WG +G
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVG-NGVPYWLAANSWNLDWGDNG 308

Query: 188 YFKIKRGSNECGIEEDVVAGLPSS 211
           +FKI RG N CGIE ++VAG+P +
Sbjct: 309 FFKILRGENHCGIESEIVAGIPRT 332


>gi|74213457|dbj|BAE35542.1| unnamed protein product [Mus musculus]
          Length = 339

 Score =  186 bits (471), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 92/204 (45%), Positives = 128/204 (62%), Gaps = 14/204 (6%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
           N+ +S  DLL CCG  CGDGC+GGYP  AW ++   G+V+         C PY       
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEH 189

Query: 69  STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
               S P C     T +C + C    +  ++  KH+  ++Y +++  ++IMAEIYKNGPV
Sbjct: 190 HVNGSRPPCTGEGDTHRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPV 249

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           E +FTV+ DF  YKSGVYKH  GD+MGGHA++++GWG  ++G  YW+ AN WN  WG +G
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGV-ENGVPYWLAANSWNLDWGDNG 308

Query: 188 YFKIKRGSNECGIEEDVVAGLPSS 211
           +FKI RG N CGIE ++VAG+P +
Sbjct: 309 FFKILRGENHCGIESEIVAGIPRT 332


>gi|240992702|ref|XP_002404475.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215491572|gb|EEC01213.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score =  185 bits (470), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 96/206 (46%), Positives = 127/206 (61%), Gaps = 17/206 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
            +++S  DLL CC   CG GC+GG P +AW Y+   G+VT       + C PY     C 
Sbjct: 135 QVNISAEDLLDCCDS-CGAGCNGGTPAAAWEYWKESGLVTGGLYGTNDGCKPY-SLAPCE 192

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C    PTPKCV  C K   + +++ KH+    Y I+SD + I  EI+KNGP
Sbjct: 193 HHTKGSLPNCTGTVPTPKCVHLCRKGYGKDYQDDKHFGKKVYSISSDEKQIQTEIFKNGP 252

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE  F V  DF  YKSGVY+H + DV+GGHA++++GWGT ++G  YW+ AN WN  WG  
Sbjct: 253 VEADFIVLADFLSYKSGVYQHHSDDVIGGHAIRILGWGT-ENGTPYWLAANSWNEDWGDH 311

Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSK 212
           GYFKI RG +ECGIEED+ AG+P ++
Sbjct: 312 GYFKILRGKDECGIEEDINAGIPKNR 337


>gi|73586701|gb|AAI02998.1| CTSB protein [Bos taurus]
          Length = 335

 Score =  185 bits (470), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 89/205 (43%), Positives = 131/205 (63%), Gaps = 16/205 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  D+L CC   CGDGC+GG+P  AW ++   G+V+         C PY     C 
Sbjct: 130 NVEVSAEDMLTCCDGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPCE 188

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC + C    +  ++  KH+  S+Y + ++ ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGP 248

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +F+VY DF  YKSGVY+H++G++MGGHA++++GWG  ++G  YW++ N WN  WG +
Sbjct: 249 VEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDN 307

Query: 187 GYFKIKRGSNECGIEEDVVAGLPSS 211
           G+FKI RG + CGIE ++VAG+P +
Sbjct: 308 GFFKILRGQDHCGIESEIVAGMPCT 332


>gi|428174191|gb|EKX43088.1| hypothetical protein GUITHDRAFT_73372 [Guillardia theta CCMP2712]
          Length = 255

 Score =  185 bits (469), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 93/192 (48%), Positives = 120/192 (62%), Gaps = 17/192 (8%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH-------PG 76
           LS  D+L+CC   CG GC+GG+P  AWR+F  HG+ TE   PY     C H         
Sbjct: 68  LSAEDMLSCCLVQCGMGCNGGFPTGAWRFFKMHGLTTESKYPYVFPP-CEHHINKTHYKP 126

Query: 77  CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
           C P+ PTPKCVR   KK       +++  S Y ++  P  I AEI  NGPVE +FTVY+D
Sbjct: 127 CGPSQPTPKCVRASEKK------PRYHGKSVYSVS--PAKIQAEIMTNGPVEAAFTVYQD 178

Query: 137 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 196
           F  Y+SGVY+H++G  +GGHA+K++GWG  + G  YW++AN WN  WG  G FKI RG +
Sbjct: 179 FLAYQSGVYRHVSGPELGGHAIKIMGWGV-EAGNKYWLVANSWNEDWGDKGTFKIARGDD 237

Query: 197 ECGIEEDVVAGL 208
           ECGIE  VVAG+
Sbjct: 238 ECGIESSVVAGM 249


>gi|312091331|ref|XP_003146940.1| cathepsin B [Loa loa]
          Length = 249

 Score =  185 bits (469), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 100/214 (46%), Positives = 130/214 (60%), Gaps = 23/214 (10%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           + ++LS +DLL+CC   CG GC GG P++AW+Y+V  G+VT     Y + +GC     P 
Sbjct: 38  KQVTLSADDLLSCCK-TCGFGCFGGEPMAAWKYWVLRGIVTG--SEYTNHSGCRPYPFPP 94

Query: 77  CE-------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIY 122
           CE               YPTPKCV+KC K   + ++  K+Y  S Y + S+ E I  EI 
Sbjct: 95  CEHHNNKTHYEPCKHDLYPTPKCVKKCDKNYGKSYKADKYYGQSVYNVESNVESIQKEIM 154

Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
             GPVE SF VY DF +Y  G+YKH+ G + GGHAVK++GWG  D G  YW+ AN WN  
Sbjct: 155 TLGPVEASFEVYTDFLYYTGGIYKHVAGSMGGGHAVKVLGWGI-DQGVPYWLAANSWNTD 213

Query: 183 WGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVK 216
           WG DGYF+I RG NECGIE  ++AG+P  K L K
Sbjct: 214 WGEDGYFRILRGVNECGIESGIIAGIP--KQLAK 245


>gi|338815385|gb|AEJ08755.1| cathepsin B [Crassostrea ariakensis]
          Length = 341

 Score =  185 bits (469), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 97/205 (47%), Positives = 126/205 (61%), Gaps = 19/205 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           +N+ +S  DL +CC   CG+GC+GG+P +AW Y+   G+VT       + C PY     C
Sbjct: 138 ENVHISAEDLTSCC-RTCGNGCEGGFPSAAWSYYKRDGLVTGGQYNSHQGCQPY-TIKAC 195

Query: 73  SH-------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H       P  +   PTPKC   C    N  +   KHY +SAY ++   E IM EI  N
Sbjct: 196 DHHVVGKLQPCSKDIGPTPKCKHTCEAGYNVTYEKDKHYGMSAYSVHG-VEKIMTEIMTN 254

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE +FTVY DF  YKSGVYKH TG  +GGHA+K++GWGT ++G+DYW++AN WN  WG
Sbjct: 255 GPVEGAFTVYADFPQYKSGVYKHTTGQPLGGHAIKILGWGT-ENGDDYWLVANSWNPDWG 313

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
             G+FKI RG +ECGIE  + AG P
Sbjct: 314 DQGFFKILRGQDECGIESQISAGEP 338


>gi|50657025|emb|CAH04630.1| cathepsin B [Suberites domuncula]
          Length = 331

 Score =  185 bits (469), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 94/204 (46%), Positives = 125/204 (61%), Gaps = 16/204 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDST-- 70
           Q+  +S  DL+ACC   CG GC+GGY  +AWRYF H G+VT       E C PY  ++  
Sbjct: 125 QSAHISAEDLMACCE-TCGMGCNGGYLGAAWRYFEHTGLVTGGQYNSKEGCQPYLIASCD 183

Query: 71  ----GCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
               G   P       TP+C + C     + +   KH+  SAY + S  E I  EI  NG
Sbjct: 184 HHVVGKKQPCASKEEHTPRCSKTCEAGYDVSFEKDKHFGASAYSVRSSVEAIQTEIMTNG 243

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +FTVY DF  YKSGVY+H +G ++GGHA++++GWGT ++G  YW++AN WN  WGA
Sbjct: 244 PVEGAFTVYADFPTYKSGVYQHTSGAMLGGHAIRILGWGT-ENGTPYWLVANSWNEDWGA 302

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            GYFKI RG ++CGIE  + AG+P
Sbjct: 303 MGYFKIIRGKDDCGIESQITAGMP 326


>gi|225708580|gb|ACO10136.1| Cathepsin B precursor [Osmerus mordax]
          Length = 329

 Score =  185 bits (469), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 95/204 (46%), Positives = 125/204 (61%), Gaps = 17/204 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
            + +S  DLL+CC   CG GC GGYP +AW Y+   G+VT       + C PY     C 
Sbjct: 128 TVEISAEDLLSCCE-ECGMGCFGGYPSAAWEYWAKSGLVTGGLYGSNKGCRPY-SIPPCE 185

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C+    TPKC  KC+      +   K++    Y + S  E IM E+YKNGP
Sbjct: 186 HHVNGTRPPCQGEGDTPKCQTKCIDGYTPAYEKDKYFGKKTYSVPSKQEQIMTELYKNGP 245

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +F+VYEDF  YKSGVY+H+TGD++GGHA+K++GWG  ++   YW+ AN WN  WG  
Sbjct: 246 VEAAFSVYEDFLLYKSGVYQHLTGDMLGGHAIKILGWGKENN-TPYWLAANSWNTDWGNQ 304

Query: 187 GYFKIKRGSNECGIEEDVVAGLPS 210
           G+FKI RG +ECGIE +VVAG+P 
Sbjct: 305 GFFKILRGGDECGIESEVVAGIPQ 328


>gi|148222779|ref|NP_001080410.1| uncharacterized protein LOC380102 precursor [Xenopus laevis]
 gi|28302291|gb|AAH46667.1| Cg10992 protein [Xenopus laevis]
          Length = 333

 Score =  184 bits (468), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 95/206 (46%), Positives = 132/206 (64%), Gaps = 17/206 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  DLL+CCGF CG GC+GGYP  AWR++   G+V+         C PY     C 
Sbjct: 130 NVEVSAEDLLSCCGFKCGMGCNGGYPSGAWRFWTETGLVSGGLYDSHVGCRPY-SIPPCE 188

Query: 74  H------PGCEPAY-PTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           H      P C+     TPKC++ C +     + + KH+  ++Y + S  ++IMA+IYKNG
Sbjct: 189 HHVNGSRPSCKGEEGDTPKCMKTCEEGYTPAYGSDKHFGATSYGVPSSEKEIMADIYKNG 248

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +F VY DF  YKSGVY+H TG+ +GGHA+K++GWG  ++G  YW+ AN WN  WG 
Sbjct: 249 PVEGAFVVYADFPLYKSGVYQHETGEELGGHAIKILGWGV-ENGTPYWLCANSWNTDWGD 307

Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSS 211
           +G+FKI RG + CGIE +VVAG+P +
Sbjct: 308 NGFFKILRGKDHCGIESEVVAGIPKN 333


>gi|32566081|ref|NP_506002.2| Protein CPR-1 [Caenorhabditis elegans]
 gi|32172429|sp|P25807.2|CPR1_CAEEL RecName: Full=Gut-specific cysteine proteinase; Flags: Precursor
 gi|1395200|gb|AAB88058.1| gut-specific cysteine protease-1 [Caenorhabditis elegans]
 gi|24817276|emb|CAB01410.2| Protein CPR-1 [Caenorhabditis elegans]
          Length = 329

 Score =  184 bits (468), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 95/195 (48%), Positives = 121/195 (62%), Gaps = 10/195 (5%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
           Q   +S +DLL+CCG  CG+GC+GGYPI A R++   GVVT        C PY     C+
Sbjct: 134 QQPIISPDDLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPY-PIAPCT 192

Query: 74  HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
              C P   TP C   C    +  +   KH+ +SAY +  +   I AEIY NGPVE +F+
Sbjct: 193 SGNC-PESKTPSCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFS 251

Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
           VYEDF  YKSGVYKH  G  +GGHA+K+IGWGT + G  YW++AN W  +WG  G+FKI 
Sbjct: 252 VYEDFYKYKSGVYKHTAGKYLGGHAIKIIGWGT-ESGSPYWLVANSWGVNWGESGFFKIY 310

Query: 193 RGSNECGIEEDVVAG 207
           RG ++CGIE  VVAG
Sbjct: 311 RGDDQCGIESAVVAG 325


>gi|393909827|gb|EJD75608.1| cysteine endopeptidase [Loa loa]
          Length = 383

 Score =  184 bits (468), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 99/214 (46%), Positives = 129/214 (60%), Gaps = 23/214 (10%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           + ++LS +DLL+CC   CG GC GG P++AW+Y+V  G+VT     Y + +GC     P 
Sbjct: 172 KQVTLSADDLLSCCK-TCGFGCFGGEPMAAWKYWVLRGIVTG--SEYTNHSGCRPYPFPP 228

Query: 77  CE-------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIY 122
           CE               YPTPKCV+KC K   + ++  K+Y    Y + S+ E I  EI 
Sbjct: 229 CEHHNNKTHYEPCKHDLYPTPKCVKKCDKNYGKSYKADKYYGEQVYNVESNVESIQKEIM 288

Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
             GPVE SF VY DF +Y  G+YKH+ G + GGHAVK++GWG  D G  YW+ AN WN  
Sbjct: 289 TLGPVEASFEVYTDFLYYTGGIYKHVAGSMGGGHAVKVLGWGI-DQGVPYWLAANSWNTD 347

Query: 183 WGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVK 216
           WG DGYF+I RG NECGIE  ++AG+P  K L K
Sbjct: 348 WGEDGYFRILRGVNECGIESGIIAGIP--KQLAK 379


>gi|410912140|ref|XP_003969548.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
          Length = 246

 Score =  184 bits (467), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 99/223 (44%), Positives = 139/223 (62%), Gaps = 19/223 (8%)

Query: 2   SVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 61
           S   ++R  + S+  +S++   LS  DLL+CC   CG GC+GGYP +AW ++   G+V+ 
Sbjct: 29  SEAMSDRICIHSNAKISVE---LSAEDLLSCC-ESCGMGCNGGYPSAAWDFWTKDGLVSG 84

Query: 62  E-------CDPYF-----DSTGCSHPGCE-PAYPTPKCVRKC-VKKNQLWRNSKHYSISA 107
                   C PY           S P C      TP+CV +C       ++  KHY  ++
Sbjct: 85  GLYDSHIGCRPYTIPPCEHHVNGSRPSCSGEGGETPQCVYRCEAGYTPSYKQDKHYGKTS 144

Query: 108 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 167
           Y ++SD +DI  EIYKNGPVE +FTVYEDF  YK+GVY+H+TG  +GGHA+K++GWG  +
Sbjct: 145 YSVSSDEDDIKHEIYKNGPVEGAFTVYEDFVLYKTGVYQHVTGSALGGHAIKILGWG-EE 203

Query: 168 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 210
           +G  YW+ AN WN  WG +G+FKI RGSN CGIE ++VAG+P+
Sbjct: 204 NGIPYWLCANSWNTDWGNNGFFKILRGSNHCGIESEIVAGIPN 246


>gi|260786791|ref|XP_002588440.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
 gi|229273602|gb|EEN44451.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
          Length = 332

 Score =  184 bits (467), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 99/218 (45%), Positives = 131/218 (60%), Gaps = 20/218 (9%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT----- 60
           ++R  ++S+  V +    LS  +L+ACC   CG GC GG+P +AW Y+   G+VT     
Sbjct: 118 SDRTCVASNGKVQVH---LSSENLMACCE-TCGMGCHGGFPEAAWEYWKQDGLVTGGPYG 173

Query: 61  --EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
             + C PY +   C H      P C    PTP+C + C    N  +   KHY+ SAY ++
Sbjct: 174 SMQGCQPY-EIAPCEHHINGSRPACGKIEPTPRCKKTCESGYNVTFNKDKHYAKSAYSVS 232

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           S  + I  EI  NGPVE +FTVY DF HYKSGVY+H +G  +GGHAVK+IGWG  +    
Sbjct: 233 SKVQQIQMEIMTNGPVEAAFTVYADFPHYKSGVYQHESGAELGGHAVKMIGWGM-EGSTP 291

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           YW++AN WN  WG  G+FKI RG +ECGIE D+VAG P
Sbjct: 292 YWLIANSWNSDWGDMGFFKILRGQDECGIERDIVAGEP 329


>gi|268561802|ref|XP_002638421.1| C. briggsae CBR-CPR-3 protein [Caenorhabditis briggsae]
          Length = 375

 Score =  184 bits (467), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 95/204 (46%), Positives = 132/204 (64%), Gaps = 22/204 (10%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
           Q   +S  D+L+CCG  CG GC GGY I A +Y+++ GVVT        C PY      S
Sbjct: 144 QQPIISAEDILSCCGSTCGKGCQGGYTIEAMKYWMNSGVVTGGDYNGAGCMPY------S 197

Query: 74  HPGCEPA----YPTPKCVRKCVKKNQL--WRNSKHYSISAYRINSDPE---DIMAEIYKN 124
            P C+ +    + TP C   C +K     ++N KH++ SAY++++       I  EIY N
Sbjct: 198 FPPCKKSPCVEFSTPSCKTTCQEKYTTADYKNDKHFATSAYKLSTTKNAVPTIQYEIYHN 257

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE S+ V+EDF  YKSGVY H++G+++GGHAVK+IGWGT ++G DYW++AN W  S+G
Sbjct: 258 GPVEASYRVFEDFYQYKSGVYHHVSGNLVGGHAVKIIGWGT-ENGVDYWLVANSWGTSFG 316

Query: 185 ADGYFKIKRGSNECGIEEDVVAGL 208
             G+FKI+RG+NEC IE ++VAGL
Sbjct: 317 EKGFFKIRRGTNECQIESNIVAGL 340


>gi|154089579|gb|ABS57370.1| cathepsin B2 [Trichobilharzia regenti]
          Length = 344

 Score =  184 bits (466), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 99/208 (47%), Positives = 129/208 (62%), Gaps = 17/208 (8%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTG 71
           L    LS  +L+ACC   CG GC+GG+P SAW Y+   G+VT +       C PY +   
Sbjct: 140 LHKPFLSAENLVACCS-SCGMGCNGGFPHSAWSYWKRSGIVTGDLYNPTDGCQPY-EFPP 197

Query: 72  CSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
           C H      P CE    TPKC   C    N  +   K Y  + YR++S+ E IM E+ ++
Sbjct: 198 CEHHVVGPRPSCEGDVETPKCKTTCQPGYNIPYNKDKWYGKTVYRVHSNQEAIMKEVKEH 257

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVEV F VY DF +YKSGVY+H++G ++GGHAV+L+GWG  ++G  YW++AN WN  WG
Sbjct: 258 GPVEVDFEVYADFPNYKSGVYQHVSGGLLGGHAVRLLGWG-EENGVPYWLIANSWNSDWG 316

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPSSK 212
            +GYFKI RG NECGIE DV AG+P  K
Sbjct: 317 DNGYFKIIRGRNECGIESDVNAGIPKLK 344


>gi|34979797|gb|AAQ83887.1| cathepsin B [Branchiostoma belcheri tsingtauense]
          Length = 332

 Score =  184 bits (466), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 95/200 (47%), Positives = 124/200 (62%), Gaps = 17/200 (8%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
           +S  DL +CC   CG+GC+GG+P +AW Y+   G+VT       + C PY +   C H  
Sbjct: 133 ISAEDLNSCCKS-CGNGCNGGFPEAAWEYWKRDGLVTGGPYGSHQGCQPY-EIKPCEHHI 190

Query: 75  ----PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
               P C    PTP+C + C    N  +   KHY+ +AY ++S  + I  EI  NGPVE 
Sbjct: 191 NGSRPACGKLEPTPRCKKSCESGYNVTFAKDKHYAKTAYSVSSKVQQIQMEIMTNGPVEA 250

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
           +FTVY DF HYKSGVY+H +G  +GGHAVK+IGWGT +    YW++AN WN  WG  G+F
Sbjct: 251 AFTVYADFPHYKSGVYQHESGAELGGHAVKMIGWGT-EGSTPYWLIANSWNTDWGNMGFF 309

Query: 190 KIKRGSNECGIEEDVVAGLP 209
           KI RG +ECGIE D+VAG P
Sbjct: 310 KILRGQDECGIERDIVAGEP 329


>gi|392920988|ref|NP_506011.2| Protein F57F5.1 [Caenorhabditis elegans]
 gi|206994319|emb|CAB00098.2| Protein F57F5.1 [Caenorhabditis elegans]
          Length = 351

 Score =  184 bits (466), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 95/194 (48%), Positives = 120/194 (61%), Gaps = 20/194 (10%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE 78
           LS+S +D+ ACCG +CG+GC+GGYPI AWR++V  G VT     Y D TGC    +P CE
Sbjct: 148 LSISADDINACCGMVCGNGCNGGYPIEAWRHYVKKGYVTG--GSYQDKTGCKPYPYPPCE 205

Query: 79  -----------PA--YPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
                      P+  YPT KC R C     L ++   H+  SAY ++    +I  EI  +
Sbjct: 206 HHVNGTHYKPCPSNMYPTDKCERSCQAGYALTYQQDLHFGQSAYAVSKKAAEIQKEIMTH 265

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVEV+FTVYEDF HY  GVY H  G  +GGHAVK++GWG  D+G  YW+ AN WN  WG
Sbjct: 266 GPVEVAFTVYEDFEHYSGGVYVHTAGASLGGHAVKMLGWGV-DNGTPYWLCANSWNEDWG 324

Query: 185 ADGYFKIKRGSNEC 198
            +GYF+I RG NEC
Sbjct: 325 ENGYFRIIRGVNEC 338


>gi|389611087|dbj|BAM19154.1| cathepsin B [Papilio polytes]
          Length = 334

 Score =  183 bits (465), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 95/204 (46%), Positives = 125/204 (61%), Gaps = 17/204 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGC 72
           ++   S  DLL+CC  +CG GC+GG P  AW Y+ H G+V       T+ C PY +   C
Sbjct: 131 KHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSTQGCRPY-EIPPC 188

Query: 73  SH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
            H  PG    C     TPKC++KC    N  ++  KHY    Y +    + I AE+YKNG
Sbjct: 189 EHHVPGNRLPCSGDTKTPKCIKKCEDNYNVAYKQDKHYGKHIYSVRGGEDHIKAELYKNG 248

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +FTVY D   YKSGVYKH+ GD +GGHA+K++GWG  ++G  YW++AN WN  WG 
Sbjct: 249 PVEGAFTVYADLLSYKSGVYKHVAGDALGGHAIKIMGWGV-ENGNKYWLIANSWNSDWGD 307

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G+FKI RG + CGIE  +VAG P
Sbjct: 308 NGFFKILRGEDHCGIESSIVAGEP 331


>gi|121073168|gb|ABM47070.1| cathepsin B1 [Clonorchis sinensis]
 gi|358341105|dbj|GAA29748.2| cathepsin B [Clonorchis sinensis]
          Length = 339

 Score =  183 bits (465), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 97/200 (48%), Positives = 123/200 (61%), Gaps = 17/200 (8%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG 76
           +S  DL+ CC   CG GC GGYP  AW Y+V +G+VT       + C PY     C H  
Sbjct: 141 ISPEDLVDCCAD-CGMGCQGGYPAQAWEYWVRNGLVTGDLYNTTDTCRPY-SFPPCEHHV 198

Query: 77  CEPAYP------TPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
             P  P      TP+CV+KC  +  + + N K Y + AY I+SD E IM ++   GP+EV
Sbjct: 199 VGPRKPCTGDPTTPQCVKKCQPEYPKTYENDKWYGLKAYSIHSDQEAIMRDLMTYGPLEV 258

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
            F VY DF  Y SGVY+H+ G ++GGHAV+L+GWG  +DG DYW++AN WN  WG  GYF
Sbjct: 259 DFEVYADFPSYSSGVYRHVAGGLLGGHAVRLVGWGV-EDGADYWLIANSWNTDWGDGGYF 317

Query: 190 KIKRGSNECGIEEDVVAGLP 209
           KI+RG NECGIE D  AG P
Sbjct: 318 KIRRGVNECGIESDANAGHP 337


>gi|124502519|gb|ABN13633.1| cysteine proteinase [Haemonchus contortus]
          Length = 342

 Score =  183 bits (464), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 90/210 (42%), Positives = 130/210 (61%), Gaps = 17/210 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGC 72
           + +++S  D++ CC   CGDGC+GG+PI AW+YF++ GVV+         C PY     C
Sbjct: 135 KQVNISATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKGVCRPY-PIHPC 193

Query: 73  SHPG-------CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H G       C    PTP C ++C     +++R  K Y   AY +    + I +EI +N
Sbjct: 194 GHHGNDTYYGECRGTAPTPPCKKECRPGVRKVYRIDKRYGKDAYIVKQSVKAIQSEILRN 253

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPV  SF VYEDF HYKSG+YKH  G++ G HAVK+IGWG +++  D+W++AN W+  WG
Sbjct: 254 GPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWLIANSWHNDWG 312

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
             GYF+I RG+N+CGIE  + AG+  +++L
Sbjct: 313 EKGYFRIIRGTNDCGIEGTIAAGIVDTESL 342


>gi|268566077|ref|XP_002647467.1| Hypothetical protein CBG06539 [Caenorhabditis briggsae]
          Length = 332

 Score =  183 bits (464), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 95/191 (49%), Positives = 121/191 (63%), Gaps = 12/191 (6%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGC 77
           +S  D++ CCG  CG GCDGGY I A R++V  GVVT      + C PY     C+  GC
Sbjct: 140 ISPMDMVDCCGEYCGYGCDGGYSIQALRWWVFDGVVTGGDYQGDGCKPY---QFCNSAGC 196

Query: 78  EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
            P   TP+C   C  K N  +   K++  SAY +      I  +I  NGPVE SF VYED
Sbjct: 197 -PDAVTPECALSCQSKYNTEYAKDKNFGTSAYYVGMTVNAIQTDIMTNGPVEASFKVYED 255

Query: 137 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 196
           F  YKSGVYK+I G ++GGHA+K+IGWGT ++G  YW++AN W   WG +G+FKI+RG N
Sbjct: 256 FYKYKSGVYKYIAGKMLGGHAIKIIGWGT-ENGTAYWLIANSWGTKWGENGFFKIRRGVN 314

Query: 197 ECGIEEDVVAG 207
           ECGIE +VVAG
Sbjct: 315 ECGIENNVVAG 325


>gi|355681635|gb|AER96808.1| cathepsin B [Mustela putorius furo]
          Length = 338

 Score =  183 bits (464), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 97/226 (42%), Positives = 142/226 (62%), Gaps = 19/226 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  D+L CCG  CGDGC+GG+P  AW ++   G+V+     
Sbjct: 118 SDRICIRTNGHVSVE---VSAEDMLTCCGDQCGDGCNGGFPAEAWNFWTXXGLVSGGLYD 174

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C       ++  KHY  S+Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHYGCSSYSVS 233

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           S  ++IMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHAV+++GWG  ++G  
Sbjct: 234 SSEKEIMAEIYKNGPVEAAFSVYSDFLMYKSGVYQHVTGEMMGGHAVRILGWGV-ENGTP 292

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 217
           YW++ N WN  WG +G+FKI RG + CGIE ++VAG+P +    K+
Sbjct: 293 YWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGIPCTDQYWKK 338


>gi|227293|prf||1701299A cathepsin B
          Length = 339

 Score =  182 bits (463), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 95/212 (44%), Positives = 129/212 (60%), Gaps = 30/212 (14%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+ +S  DLL CCG  CGDGC+GGYP  AW ++   G+V+     Y+DS    H GC P 
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTKKGLVS---GGYYDS----HIGCLP- 181

Query: 81  YPTPKC----------------VRKCVKKNQL-----WRNSKHYSISAYRINSDPEDIMA 119
           Y  P C                 R+C K  +      ++  KH+  ++Y +++  + IMA
Sbjct: 182 YTIPPCEHHVNGSRPPCTGEGDTRRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKKIMA 241

Query: 120 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQW 179
           EIYKNGPVE +FTV+ DF  YKSGVYKH  GD+MGGHA++++ WG  ++G  YW  AN W
Sbjct: 242 EIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILVWGV-ENGVPYWAAANSW 300

Query: 180 NRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 211
           N  WG +G+FKI RG N CGIE ++VAG+P +
Sbjct: 301 NLDWGDNGFFKILRGENHCGIESEIVAGIPRT 332


>gi|149698064|ref|XP_001498242.1| PREDICTED: cathepsin B [Equus caballus]
          Length = 340

 Score =  182 bits (462), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 96/227 (42%), Positives = 140/227 (61%), Gaps = 18/227 (7%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  D+L CCG  CGDGC+GG+P  AW ++   G+V+     
Sbjct: 118 SDRICIRTNGHVSVE---VSAEDMLTCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYD 174

Query: 63  ----CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY          G   P       TPKC + C    +  ++  KHY  S+Y ++
Sbjct: 175 SHVGCRPYSIPPCEHHVNGSRPPCTGEGGDTPKCSKICEPGYSPSYKEDKHYGCSSYSVS 234

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           S  ++IMAEI+KNGPVE +FTVY DF  YKSGVY+H+ GD+MGGHAV+++GWG  ++G  
Sbjct: 235 SSEKEIMAEIFKNGPVEAAFTVYSDFLQYKSGVYQHVAGDMMGGHAVRILGWGV-ENGTP 293

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           YW++ N WN  WG +G+FKI RG + CGIE ++VAG+P +    K I
Sbjct: 294 YWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGIPCTDQYWKRI 340


>gi|389608541|dbj|BAM17880.1| cathepsin B [Papilio xuthus]
          Length = 334

 Score =  182 bits (462), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 94/204 (46%), Positives = 127/204 (62%), Gaps = 17/204 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           ++   S  DLL+CC  +CG GC+GG P  AW Y+ H G+V+       + C PY +   C
Sbjct: 131 KHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPY-EIPPC 188

Query: 73  SH--PG----CEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
            H  PG    C     TPKCV++C    ++ ++  KHY    Y +    + I AE+YKNG
Sbjct: 189 EHHVPGNRLPCSGDTKTPKCVKECESGYKVPYKQDKHYGKHVYSVRGGEDHIKAELYKNG 248

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +FTVY D   YKSGVYKH+TGD +GGHA+K++GWG  ++G  YW++AN WN  WG 
Sbjct: 249 PVEGAFTVYADLLSYKSGVYKHVTGDALGGHAIKIMGWGV-ENGNKYWLIANSWNSDWGD 307

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G+FKI RG + CGIE  +VAG P
Sbjct: 308 NGFFKILRGEDHCGIESSIVAGEP 331


>gi|17565162|ref|NP_503382.1| Protein W07B8.4 [Caenorhabditis elegans]
 gi|351059398|emb|CCD74288.1| Protein W07B8.4 [Caenorhabditis elegans]
          Length = 335

 Score =  182 bits (462), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 98/208 (47%), Positives = 120/208 (57%), Gaps = 20/208 (9%)

Query: 21  NLSLSVNDLLACC--GFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDST- 70
           N  LS  D+L CC   F CGDGC+GGYPI AWRY+V +G+VT         C PY  +  
Sbjct: 123 NTLLSAEDILTCCTGKFNCGDGCEGGYPIQAWRYWVKNGLVTGGSFESQYGCKPYSIAPC 182

Query: 71  -----GCSHPGCEPAYP-TPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEI 121
                G + P C      TPKC   C   N     +   KH+  SAY I    + I  EI
Sbjct: 183 GETIDGVTWPECPMKISDTPKCEHHCTGNNSYPIPYDQDKHFGASAYAIGRSAKQIQTEI 242

Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
             +GPVEV F VYEDF  YK+G+Y H+ G  +GGHAVK++GWG  D+G  YW+ AN WN 
Sbjct: 243 LAHGPVEVGFIVYEDFYLYKTGIYTHVAGGELGGHAVKMLGWGV-DNGTPYWLAANSWNT 301

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLP 209
            WG  GYF+I RG +ECGIE   VAG+P
Sbjct: 302 VWGEKGYFRILRGVDECGIESAAVAGMP 329


>gi|196009263|ref|XP_002114497.1| expressed hypothetical protein [Trichoplax adhaerens]
 gi|190583516|gb|EDV23587.1| expressed hypothetical protein [Trichoplax adhaerens]
          Length = 333

 Score =  182 bits (462), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 97/218 (44%), Positives = 132/218 (60%), Gaps = 20/218 (9%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT----- 60
           T+R  + S+  V +    +S  DL+ CC   CG GC+GG+   AW Y+V++G+VT     
Sbjct: 120 TDRICIHSNGKVKVH---ISAEDLMTCCT-SCGMGCNGGFLPQAWHYWVNNGIVTGGQYH 175

Query: 61  --EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
             + C PY +   C H        C    PTPKC +KC    N+ +   KH+   +Y I 
Sbjct: 176 SHKGCQPY-EIPKCEHHVKGPFKACGKELPTPKCSQKCQPGYNKTFNQDKHFGKKSYSIT 234

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           ++ + I  EI  NGPVE +FTVY DF  YKSGVY+H TG  +GGHAVK++GWGT ++   
Sbjct: 235 NNIQQIQKEIMMNGPVEAAFTVYADFPSYKSGVYQHTTGGPLGGHAVKILGWGTENN-TP 293

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           YW++AN WN +WG  GYFKI RG +ECGIE  +VAG+P
Sbjct: 294 YWLIANSWNPTWGDKGYFKIIRGKDECGIESSIVAGMP 331


>gi|329669000|gb|AEB96388.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 232

 Score =  182 bits (462), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 101/207 (48%), Positives = 125/207 (60%), Gaps = 21/207 (10%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q +++S +DLL+CC   CG GCDG  P +AW Y+V +G+VT     Y   +GC    +P 
Sbjct: 29  QKVTISADDLLSCCD-ECGFGCDGRDPYAAWSYWVSNGIVTGS--NYTSKSGCKPYPYPP 85

Query: 77  CE-------------PAYPTPKCVRKCVKKNQLWRNS-KHYSISAYRINSDPEDIMAEIY 122
           CE               YPT  C  KC     +  NS KHY  S Y +  D   I  EI 
Sbjct: 86  CEHHIPEHHYKKCPKDIYPTNTCEYKCQDGYSISYNSDKHYGASVYAVAQDVASIQKEIM 145

Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
            NGPVEV+F VYEDF HY SG+YKH TGD +GGHAVK++GWGT ++G DYWI AN WN  
Sbjct: 146 TNGPVEVAFDVYEDFEHYSSGIYKHTTGDYLGGHAVKMLGWGT-ENGTDYWICANSWNSD 204

Query: 183 WGADGYFKIKRGSNECGIEEDVVAGLP 209
           WG +G+F+I RG +EC IE  VVAG P
Sbjct: 205 WGENGFFRILRGVDECEIESGVVAGEP 231


>gi|37788265|gb|AAO64472.1| cathepsin B precursor [Fundulus heteroclitus]
          Length = 330

 Score =  182 bits (461), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 95/203 (46%), Positives = 125/203 (61%), Gaps = 16/203 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------F 67
           ++ +S  DLL CC   CG GC+GGYP +AW ++   G+VT         C PY       
Sbjct: 129 SVEISSQDLLTCCDS-CGMGCNGGYPANAWEFWTEQGLVTGGLYNSHIGCRPYTIEPCEH 187

Query: 68  DSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
              G   P       TP+CV +C       ++  KHY  ++Y + S+ E I +EIYKNGP
Sbjct: 188 HVNGSRPPCTGEGGDTPECVTQCEAGYTPSYQKDKHYGKTSYGVPSEEEQIQSEIYKNGP 247

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +F VYEDF  YKSGVY+H+TG  +GGHA+K+IGWG  ++G  YW+ AN WN  WG +
Sbjct: 248 VEGAFIVYEDFPSYKSGVYQHVTGSALGGHAIKMIGWG-EENGVPYWLCANSWNTDWGDN 306

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G+FKI RGSN CGIE +VVAG+P
Sbjct: 307 GFFKILRGSNHCGIESEVVAGIP 329


>gi|341904369|gb|EGT60202.1| hypothetical protein CAEBREN_08101 [Caenorhabditis brenneri]
          Length = 330

 Score =  181 bits (460), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 95/195 (48%), Positives = 117/195 (60%), Gaps = 10/195 (5%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
           Q   +S +DLL+CCG  CG+GC+GGYPI A R++   GVVT        C PY  +  C+
Sbjct: 135 QQPIISPDDLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CT 193

Query: 74  HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
              C P   TP C   C       +   KH+  SAY +      I  EI  NGPVE +FT
Sbjct: 194 SGSC-PESKTPACSLSCQSGYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAFT 252

Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
           VYEDF  YKSGVYKH  G  +GGHA+K+IGWGT + G  YW++AN W  SWG  G+FKI 
Sbjct: 253 VYEDFYKYKSGVYKHTAGKALGGHAIKIIGWGT-ESGSPYWLVANSWGTSWGESGFFKIF 311

Query: 193 RGSNECGIEEDVVAG 207
           RG ++CGIE  VVAG
Sbjct: 312 RGDDQCGIESAVVAG 326


>gi|226468762|emb|CAX76409.1| cathepsin B [Schistosoma japonicum]
 gi|257206178|emb|CAX82740.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  181 bits (460), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 96/203 (47%), Positives = 127/203 (62%), Gaps = 17/203 (8%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH-- 74
           LS  +L++CC   CG GC+GG+P SAW Y+ + G+VT +       C PY +   C H  
Sbjct: 148 LSAENLVSCCSS-CGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY-EFPPCEHNT 205

Query: 75  ----PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
               P C+    TP C R C    N  + N K Y    YR+ S+ E IM E+ ++GPVEV
Sbjct: 206 LGPLPVCDGDVETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEV 265

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
            F VY DF +YKSGVY+H++G ++GGHAV+L+GWG  ++   YW++AN WN  WG +GYF
Sbjct: 266 DFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWG-EENNVPYWLIANSWNTDWGDNGYF 324

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           KI RG NECGIE DV AG+P  K
Sbjct: 325 KIIRGKNECGIESDVNAGIPKIK 347


>gi|405971658|gb|EKC36483.1| Cathepsin B [Crassostrea gigas]
          Length = 341

 Score =  181 bits (460), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 97/205 (47%), Positives = 124/205 (60%), Gaps = 19/205 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           +N  +S  DL +CC   CG+GC+GG+P +AW Y+   G+VT       + C PY     C
Sbjct: 138 ENTHISAEDLTSCC-RTCGNGCEGGFPSAAWSYYKKDGLVTGGQYNSHQGCLPY-TIKAC 195

Query: 73  SH-------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H       P  +   PTPKC   C    N  +   KHY  SAY ++   E IM EI  N
Sbjct: 196 DHHVVGKLQPCSKSIGPTPKCKHTCEAGYNVTYEKDKHYGSSAYSVHG-VEKIMTEIMTN 254

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE +FTVY DF  YKSGVYKH TG  +GGHA+K++GWGT ++G+DYW++AN WN  WG
Sbjct: 255 GPVEGAFTVYADFPQYKSGVYKHTTGQPLGGHAIKILGWGT-ENGDDYWLVANSWNPDWG 313

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
             G+FKI RG +ECGIE  + AG P
Sbjct: 314 DQGFFKILRGQDECGIESQISAGEP 338


>gi|226472810|emb|CAX71091.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  181 bits (460), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 96/203 (47%), Positives = 127/203 (62%), Gaps = 17/203 (8%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH-- 74
           LS  +L++CC   CG GC+GG+P SAW Y+ + G+VT +       C PY +   C H  
Sbjct: 148 LSAENLVSCCS-SCGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY-EFPPCEHHT 205

Query: 75  ----PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
               P C+    TP C R C    N  + N K Y    YR+ S+ E IM E+ ++GPVEV
Sbjct: 206 LGPLPVCDGDVETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEV 265

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
            F VY DF +YKSGVY+H++G ++GGHAV+L+GWG  ++   YW++AN WN  WG +GYF
Sbjct: 266 DFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWG-EENNVPYWLIANSWNTDWGDNGYF 324

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           KI RG NECGIE DV AG+P  K
Sbjct: 325 KIIRGKNECGIESDVNAGIPKIK 347


>gi|148229459|ref|NP_001079570.1| cathepsin B precursor [Xenopus laevis]
 gi|28277314|gb|AAH44689.1| MGC53360 protein [Xenopus laevis]
          Length = 333

 Score =  181 bits (460), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 94/206 (45%), Positives = 132/206 (64%), Gaps = 17/206 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  DLL+CCG  CG GC+GGYP  AW+++   G+V+         C PY     C 
Sbjct: 130 NVEVSAEDLLSCCGDECGMGCNGGYPSGAWQFWTETGLVSGGLYDSHVGCRPY-SIPPCE 188

Query: 74  H--PGCEPAYP-----TPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           H   G  PA       TPKCV++C +  +  +   KH+  ++Y + +  ++IMAEIYKNG
Sbjct: 189 HHVNGSRPACKGEEGDTPKCVKQCEEGYSPAYGTDKHFGTTSYGVPTSEKEIMAEIYKNG 248

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +F VY DF  YKSGVY+H TG+ +GGHA+K++GWG  ++G  YW+ AN WN  WG 
Sbjct: 249 PVEGAFLVYADFPLYKSGVYQHETGEELGGHAIKILGWGV-ENGTPYWLCANSWNTDWGD 307

Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSS 211
           +G+FKI RG + CGIE ++VAG+P +
Sbjct: 308 NGFFKILRGKDHCGIESEIVAGVPKN 333


>gi|170586854|ref|XP_001898194.1| cathepsin B-like cysteine proteinase [Brugia malayi]
 gi|158594589|gb|EDP33173.1| cathepsin B-like cysteine proteinase, putative [Brugia malayi]
          Length = 384

 Score =  181 bits (460), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 98/210 (46%), Positives = 127/210 (60%), Gaps = 24/210 (11%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           + + LS +DLL+CC   CG GC GG P++AW+Y+V  G+VT     Y + +GC     P 
Sbjct: 170 KQVILSADDLLSCCK-TCGFGCFGGEPMAAWKYWVLSGIVTG--SDYTNHSGCRPYPFPP 226

Query: 77  CE-------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIY 122
           CE               YPTPKC ++C K   + ++  K+Y   AY + +D E I  EI 
Sbjct: 227 CEHHSNKTHYEPCKHDLYPTPKCYKQCDKNYTKSYKADKYYGEQAYNVENDVESIQKEIM 286

Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
             GPVE SF VY DF HY SG+YKH+ G V GGHAVK++GWG  D G  YW+ AN WN  
Sbjct: 287 TLGPVEASFEVYTDFLHYTSGIYKHVAGSVGGGHAVKILGWGI-DQGVSYWLAANSWNND 345

Query: 183 WGAD---GYFKIKRGSNECGIEEDVVAGLP 209
           WG D   GYF+I RG++ECGIE  +VAG+P
Sbjct: 346 WGEDVFSGYFRILRGADECGIESGIVAGIP 375


>gi|71984043|ref|NP_001024426.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
 gi|351058214|emb|CCD65629.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
          Length = 378

 Score =  181 bits (460), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 101/224 (45%), Positives = 132/224 (58%), Gaps = 21/224 (9%)

Query: 9   DALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------E 61
           D +  + +  LQ ++LS +DLL+CC   CG GC+GG P++AWRY+V  G+VT        
Sbjct: 143 DRICIASHGELQ-VTLSADDLLSCCKS-CGFGCNGGDPLAAWRYWVKDGIVTGSNYTANN 200

Query: 62  ECDPYFDSTGCSH--------PGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSISAYRIN 111
            C PY     C H        P     YPTPKC +KCV    ++ +   K +  SAY + 
Sbjct: 201 GCKPY-PFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVK 259

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
            D E I  E+  +GP+E++F VYEDF +Y  GVY H  G + GGHAVKLIGWG  DDG  
Sbjct: 260 DDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGI-DDGIP 318

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 215
           YW +AN WN  WG DG+F+I RG +ECGIE  VV G+P   +L 
Sbjct: 319 YWTVANSWNTDWGEDGFFRILRGVDECGIESGVVGGIPKLNSLT 362


>gi|161671340|gb|ABX75522.1| cathepsin b [Lycosa singoriensis]
          Length = 247

 Score =  181 bits (460), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 99/218 (45%), Positives = 134/218 (61%), Gaps = 20/218 (9%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + S+  V ++   +S  DLL+CC   CG GCDGG+P SAW ++V  G+ T     
Sbjct: 34  SDRHCIHSNGKVKIE---VSPEDLLSCCS-SCGMGCDGGFPPSAWEFWVDKGIATGGLWN 89

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY +   C H      P C     TPKCV  C K  N  +R+ KH+   +Y I 
Sbjct: 90  SHIGCQPY-EIPACEHHTTGDRPPCSDIVDTPKCVHLCEKGYNTSYRDDKHFGKKSYSIE 148

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           S  + I  EI+KNGPVE +F+VY DF +YKSGVY+H +G+ +GGHA++++GWG  +D   
Sbjct: 149 SLEQQIQTEIFKNGPVEGAFSVYSDFINYKSGVYQHHSGESLGGHAIRVLGWGYEND-VP 207

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           YW+ AN WN  WG  GYFKI RGS+ECGIE  +VAG+P
Sbjct: 208 YWLCANSWNTDWGDKGYFKILRGSDECGIESSIVAGIP 245


>gi|30995341|gb|AAO59414.2| cathepsin B endopeptidase [Schistosoma japonicum]
 gi|226472794|emb|CAX71083.1| cathepsin B [Schistosoma japonicum]
 gi|226472796|emb|CAX71084.1| cathepsin B [Schistosoma japonicum]
 gi|226472798|emb|CAX71085.1| cathepsin B [Schistosoma japonicum]
 gi|226472802|emb|CAX71087.1| cathepsin B [Schistosoma japonicum]
 gi|226472806|emb|CAX71089.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  181 bits (460), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 96/203 (47%), Positives = 127/203 (62%), Gaps = 17/203 (8%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH-- 74
           LS  +L++CC   CG GC+GG+P SAW Y+ + G+VT +       C PY +   C H  
Sbjct: 148 LSAENLVSCCSS-CGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY-EFPPCEHHT 205

Query: 75  ----PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
               P C+    TP C R C    N  + N K Y    YR+ S+ E IM E+ ++GPVEV
Sbjct: 206 LGPLPVCDGDVETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEV 265

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
            F VY DF +YKSGVY+H++G ++GGHAV+L+GWG  ++   YW++AN WN  WG +GYF
Sbjct: 266 DFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWG-EENNVPYWLIANSWNTDWGDNGYF 324

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           KI RG NECGIE DV AG+P  K
Sbjct: 325 KIIRGKNECGIESDVNAGIPKIK 347


>gi|25146613|ref|NP_741818.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
 gi|1169087|sp|P43510.1|CPR6_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 6; AltName:
           Full=Cysteine protease-related 6; Flags: Precursor
 gi|671715|gb|AAA98787.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|695294|gb|AAA98789.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351058213|emb|CCD65628.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
          Length = 379

 Score =  181 bits (460), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 101/224 (45%), Positives = 132/224 (58%), Gaps = 21/224 (9%)

Query: 9   DALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------E 61
           D +  + +  LQ ++LS +DLL+CC   CG GC+GG P++AWRY+V  G+VT        
Sbjct: 144 DRICIASHGELQ-VTLSADDLLSCCKS-CGFGCNGGDPLAAWRYWVKDGIVTGSNYTANN 201

Query: 62  ECDPYFDSTGCSH--------PGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSISAYRIN 111
            C PY     C H        P     YPTPKC +KCV    ++ +   K +  SAY + 
Sbjct: 202 GCKPY-PFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVK 260

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
            D E I  E+  +GP+E++F VYEDF +Y  GVY H  G + GGHAVKLIGWG  DDG  
Sbjct: 261 DDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGI-DDGIP 319

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 215
           YW +AN WN  WG DG+F+I RG +ECGIE  VV G+P   +L 
Sbjct: 320 YWTVANSWNTDWGEDGFFRILRGVDECGIESGVVGGIPKLNSLT 363


>gi|341878049|gb|EGT33984.1| CBN-CPR-1 protein [Caenorhabditis brenneri]
          Length = 330

 Score =  181 bits (460), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 95/195 (48%), Positives = 117/195 (60%), Gaps = 10/195 (5%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
           Q   +S +DLL+CCG  CG+GC+GGYPI A R++   GVVT        C PY  +  C+
Sbjct: 135 QQPIISPDDLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CT 193

Query: 74  HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
              C P   TP C   C       +   KH+  SAY +      I  EI  NGPVE +FT
Sbjct: 194 SGSC-PESKTPACSLSCQPGYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAFT 252

Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
           VYEDF  YKSGVYKH  G  +GGHA+K+IGWGT + G  YW++AN W  SWG  G+FKI 
Sbjct: 253 VYEDFYKYKSGVYKHTAGKALGGHAIKIIGWGT-ESGSPYWLVANSWGTSWGESGFFKIF 311

Query: 193 RGSNECGIEEDVVAG 207
           RG ++CGIE  VVAG
Sbjct: 312 RGDDQCGIESAVVAG 326


>gi|38147393|gb|AAR12009.1| cathepsin B-like proteinase [Triatoma infestans]
          Length = 332

 Score =  181 bits (459), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 94/203 (46%), Positives = 127/203 (62%), Gaps = 17/203 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
            + LS  +L++CC   CG GCDGGYP SAW Y+ + G+V+       + C PY  +  C 
Sbjct: 131 QVHLSAENLVSCCDS-CGFGCDGGYPASAWDYWQNVGIVSGGNYGSKQGCQPYSIAP-CE 188

Query: 74  H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TP C  +C K++ + +    +Y  SAY +  + + I AEI KNGP
Sbjct: 189 HHVPGPRPACSGEGSTPDCRNQCDKRSGISYDKDLYYGESAYSLEDEAKQIQAEILKNGP 248

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +FTVYED  +YK GVY+H+ G V+GGHA+K++GWG  +D   YW++AN WN  WG +
Sbjct: 249 VEAAFTVYEDLVNYKEGVYQHVAGSVLGGHAIKILGWGVEND-TPYWLVANSWNTDWGNN 307

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G+FKI RG +ECGIE DV AGLP
Sbjct: 308 GFFKILRGKDECGIEIDVSAGLP 330


>gi|226472800|emb|CAX71086.1| cathepsin B [Schistosoma japonicum]
 gi|226472804|emb|CAX71088.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  181 bits (459), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 96/203 (47%), Positives = 127/203 (62%), Gaps = 17/203 (8%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH-- 74
           LS  +L++CC   CG GC+GG+P SAW Y+ + G+VT +       C PY +   C H  
Sbjct: 148 LSAENLVSCCSS-CGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY-EFPPCEHHT 205

Query: 75  ----PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
               P C+    TP C R C    N  + N K Y    YR+ S+ E IM E+ ++GPVEV
Sbjct: 206 LGPLPVCDGDVETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEV 265

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
            F VY DF +YKSGVY+H++G ++GGHAV+L+GWG  ++   YW++AN WN  WG +GYF
Sbjct: 266 DFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWG-EENNVPYWLIANSWNTDWGDNGYF 324

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           KI RG NECGIE DV AG+P  K
Sbjct: 325 KIIRGKNECGIESDVNAGIPKIK 347


>gi|194246059|gb|ACF35521.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 217

 Score =  181 bits (459), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 91/203 (44%), Positives = 126/203 (62%), Gaps = 17/203 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
            +++S  DLL CC   CG GC+GGYP +AW+++   G+VT       + C PY+    C 
Sbjct: 13  QVNISAEDLLTCCD-SCGSGCNGGYPSAAWQFYKDEGIVTGGLYGTEDGCQPYYFPP-CE 70

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C    PTP+C + C +   + +   KH+    Y I+SD   I  EI KNGP
Sbjct: 71  HHTVGPLPNCTGIKPTPECAKTCREGYEKSYTRDKHFGKKVYSISSDETQIKTEICKNGP 130

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE  F VY DF  YKSGVY+  + +++GGHA++++GWGT +DG  YW++AN WN  WG  
Sbjct: 131 VEADFNVYADFPSYKSGVYQRHSKEMLGGHAIRILGWGT-EDGVPYWLVANSWNEDWGDK 189

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           GYFKI+RG++ECGIE D+ AG+P
Sbjct: 190 GYFKIRRGNDECGIENDINAGIP 212


>gi|193209594|ref|NP_001123113.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
 gi|351058222|emb|CCD65637.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
          Length = 369

 Score =  181 bits (459), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 101/224 (45%), Positives = 132/224 (58%), Gaps = 21/224 (9%)

Query: 9   DALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------E 61
           D +  + +  LQ ++LS +DLL+CC   CG GC+GG P++AWRY+V  G+VT        
Sbjct: 134 DRICIASHGELQ-VTLSADDLLSCCKS-CGFGCNGGDPLAAWRYWVKDGIVTGSNYTANN 191

Query: 62  ECDPYFDSTGCSH--------PGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSISAYRIN 111
            C PY     C H        P     YPTPKC +KCV    ++ +   K +  SAY + 
Sbjct: 192 GCKPY-PFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVK 250

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
            D E I  E+  +GP+E++F VYEDF +Y  GVY H  G + GGHAVKLIGWG  DDG  
Sbjct: 251 DDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGI-DDGIP 309

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 215
           YW +AN WN  WG DG+F+I RG +ECGIE  VV G+P   +L 
Sbjct: 310 YWTVANSWNTDWGEDGFFRILRGVDECGIESGVVGGIPKLNSLT 353


>gi|195729973|gb|ACG50797.1| cathepsin B2 [Trichobilharzia szidati]
          Length = 344

 Score =  181 bits (459), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 98/208 (47%), Positives = 127/208 (61%), Gaps = 17/208 (8%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTG 71
           L    LS  +L+ACC   CG GC+GG+P SAW Y+   G+VT +       C PY +   
Sbjct: 140 LHKPFLSAENLVACCS-SCGMGCNGGFPHSAWSYWKRSGIVTGDLYNTTDGCQPY-EFPP 197

Query: 72  CSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
           C H      P C     TPKC   C    N  +   K Y  + YR++S+ E IM E+  +
Sbjct: 198 CEHHVVGPRPSCGGDVETPKCKTTCQPGYNIPYNKDKWYGKTVYRVHSNQEAIMKEVMDH 257

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVEV F VY DF +YKSGVY+H++G ++GGHAV+L+GWG  ++G  YW++AN WN  WG
Sbjct: 258 GPVEVDFEVYADFPNYKSGVYQHVSGGLLGGHAVRLLGWG-EENGVPYWLIANSWNSDWG 316

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPSSK 212
            +GYFKI RG NECGIE DV AG+P  K
Sbjct: 317 DNGYFKIIRGRNECGIESDVNAGIPKLK 344


>gi|76576341|gb|ABA53864.1| cathepsin B-like cysteine protease 2 [Parelaphostrongylus tenuis]
          Length = 344

 Score =  181 bits (459), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 97/203 (47%), Positives = 123/203 (60%), Gaps = 18/203 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           + + LS +D+L+CC + CGDGCDGGYPISAW YFV  GVVT       + C PY +   C
Sbjct: 143 KTVELSADDILSCC-YDCGDGCDGGYPISAWEYFVETGVVTGGLYGTKDSCRPY-EIPPC 200

Query: 73  SHPGCEPAY-------PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H   E  Y        TP CV  C     + + + K +   +Y I S    I  EI   
Sbjct: 201 GHHRNETFYGNCTQIADTPDCVTTCQAGYPISYDDDKTFGKDSYTIESSVTAIQKEIMTY 260

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPV  +F VYEDF HY  G+YKH++G   GGHAV+++GWG  + G  YW++AN WN  WG
Sbjct: 261 GPVTAAFIVYEDFFHYHRGIYKHVSGGEEGGHAVRILGWG-EEKGTAYWLVANSWNTDWG 319

Query: 185 ADGYFKIKRGSNECGIEEDVVAG 207
            +GYF+I RGSNECGIEE+VVAG
Sbjct: 320 ENGYFRILRGSNECGIEENVVAG 342


>gi|51038793|gb|AAT94175.1| cathepsin B [Paralichthys olivaceus]
 gi|121053785|gb|ABM47001.1| cathepsin B [Paralichthys olivaceus]
          Length = 330

 Score =  181 bits (459), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 97/204 (47%), Positives = 128/204 (62%), Gaps = 18/204 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           ++ +S  DLL CC   CG GC+GGYP SAW ++   G+V+         C PY  S  C 
Sbjct: 129 SVEISSEDLLTCCDS-CGMGCNGGYPSSAWDFWTKEGLVSGGLYNSHIGCRPYTISP-CE 186

Query: 74  H------PGCE-PAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           H      P C      TP+C+ +C    +  ++  KHY  S+Y +    E I AEI KNG
Sbjct: 187 HHVNGSRPPCTGEGGDTPECISRCEAGYSPSYKQDKHYGKSSYSVEGSVEQIQAEISKNG 246

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +FTVYEDF  YKSGVY+H++G V+GGHA+K++GWG  +DG  YW+ AN WN  WG 
Sbjct: 247 PVEGAFTVYEDFVMYKSGVYQHVSGSVLGGHAIKVLGWG-EEDGIPYWLCANSWNTDWGD 305

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G+FKI RGSN CGIE ++VAG+P
Sbjct: 306 NGFFKILRGSNHCGIESEIVAGIP 329


>gi|308504375|ref|XP_003114371.1| CRE-CPR-1 protein [Caenorhabditis remanei]
 gi|308261756|gb|EFP05709.1| CRE-CPR-1 protein [Caenorhabditis remanei]
          Length = 366

 Score =  181 bits (458), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 94/195 (48%), Positives = 116/195 (59%), Gaps = 10/195 (5%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
           Q   +S +DLL+CCG  CG+GC+GGYPI A R++   GVVT        C PY     C+
Sbjct: 171 QQPIISPDDLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPY-PIAPCT 229

Query: 74  HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
              C P   TP C   C       +   KH+  SAY +      I  EI  NGPVE +FT
Sbjct: 230 SGNC-PESKTPSCSLSCQSGYTTAYAKDKHFGTSAYAVARKVASIQTEIMTNGPVEAAFT 288

Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
           VYEDF  YKSGVYKH  G  +GGHA+K+IGWGT + G  YW++AN W  SWG  G+F+I 
Sbjct: 289 VYEDFYKYKSGVYKHTAGKALGGHAIKIIGWGT-ESGSPYWLVANSWGNSWGESGFFRIF 347

Query: 193 RGSNECGIEEDVVAG 207
           RG ++CGIE  VVAG
Sbjct: 348 RGDDQCGIESAVVAG 362


>gi|91078964|ref|XP_974298.1| PREDICTED: similar to putative cathepsin B-like like proteinase
           [Tribolium castaneum]
 gi|270004838|gb|EFA01286.1| cathepsin B precursor [Tribolium castaneum]
          Length = 335

 Score =  181 bits (458), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 93/201 (46%), Positives = 118/201 (58%), Gaps = 16/201 (7%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH 74
           +S+S  DL  CC + CGDGC+GG+P  AW Y+   G+VT       + C  Y     C H
Sbjct: 136 VSISTEDLNTCC-YECGDGCNGGWPAEAWAYWAETGIVTGGKYETKDGCKAY-TVPPCEH 193

Query: 75  ------PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 128
                 P C    PTP+C ++C     +   S     SAY+ +SD   I  EI  NGPVE
Sbjct: 194 HTEGDLPACGDIVPTPQCKKECDAGVDIEYKSDLRKGSAYQTSSDESQIQTEIMTNGPVE 253

Query: 129 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 188
             F VYEDF +YKSGVY+  TG+  GGHA+K++GWG  +DG  YW+ AN WN  WG  GY
Sbjct: 254 ADFDVYEDFLNYKSGVYQQTTGNYAGGHAIKILGWGV-EDGTPYWLAANSWNEDWGDKGY 312

Query: 189 FKIKRGSNECGIEEDVVAGLP 209
           FKI RG NECGIE D++ G+P
Sbjct: 313 FKILRGQNECGIESDIIGGIP 333


>gi|325302580|dbj|BAJ83490.1| cathepsin B-like peptidase [Echinococcus multilocularis]
          Length = 351

 Score =  180 bits (457), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 89/203 (43%), Positives = 126/203 (62%), Gaps = 15/203 (7%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE------------ECDPYFDS 69
           + LS +DLL+CC   CG GC+GG+P  AW ++ H G+V+             E  P    
Sbjct: 151 VRLSADDLLSCCRD-CGMGCNGGFPSQAWNFWKHEGLVSGGLYGTKGVCRAYEIPPCEHH 209

Query: 70  TGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 128
              + P CE   PTPKC   C ++ ++ ++  KHY++  Y ++S+ + I  E+  +GPVE
Sbjct: 210 VNGTRPPCEGDAPTPKCKNVCQEEYKVPYKKDKHYAVKVYSVHSNEDAIKHELITHGPVE 269

Query: 129 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 188
             F VY DF  YKSGVY+H++G ++GGHA+KL+GWG  +DG  YW+ AN WN  WG  G+
Sbjct: 270 ADFEVYADFPTYKSGVYQHVSGALLGGHAIKLMGWG-EEDGVPYWLCANSWNTDWGEGGF 328

Query: 189 FKIKRGSNECGIEEDVVAGLPSS 211
           FKI RG N CGIE D+VAG+P +
Sbjct: 329 FKILRGKNHCGIESDIVAGIPQN 351


>gi|325302582|dbj|BAJ83491.1| cathepsin B-like peptidase [Echinococcus multilocularis]
          Length = 338

 Score =  180 bits (457), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 92/205 (44%), Positives = 125/205 (60%), Gaps = 16/205 (7%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH 74
           + +S +DLL+CCG  CG GC+GG P +AWRY+   G+V+         C PY +   C H
Sbjct: 135 VRISADDLLSCCGLFCGFGCNGGLPENAWRYWAIDGIVSGGLYGSHVGCRPY-EIPPCEH 193

Query: 75  ------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
                 P C+    TPKC R+CV+  +  ++  KH++ + Y + +  EDIM EI   GPV
Sbjct: 194 HTSGNRPDCKGNSKTPKCQRQCVESFDGKYQADKHFASNVYNVRASEEDIMNEILVYGPV 253

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           E  F VY DF  YKSGVY+H+ G  +GGHAVK++GWG  ++G  YW+ AN WN  WG  G
Sbjct: 254 EADFIVYADFLTYKSGVYQHVKGGFLGGHAVKILGWG-EENGVPYWLCANSWNTDWGDGG 312

Query: 188 YFKIKRGSNECGIEEDVVAGLPSSK 212
           +FKI RG N C IE D+ AG+P  +
Sbjct: 313 FFKILRGYNHCKIEADINAGIPKIR 337


>gi|29374025|gb|AAO73003.1| cathepsin B [Fasciola gigantica]
          Length = 339

 Score =  180 bits (457), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 91/202 (45%), Positives = 122/202 (60%), Gaps = 19/202 (9%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG 76
           L+  D L+CC + CG GC GGYP  AW Y++  G+VT         C P+   T C H G
Sbjct: 139 LAAADPLSCCTY-CGQGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWM-FTKCDHVG 196

Query: 77  -------CEP-AYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
                  C    YPTP C R C    N+ +   K Y  S+Y +      IM EI KNGPV
Sbjct: 197 DSRKYSRCPHYTYPTPPCARACQTGYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPV 256

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           EV+F +++DF  Y+SG+Y H+ G  +G HAV++IGWG  ++G +YW++AN WN  WG +G
Sbjct: 257 EVTFAIFQDFGVYRSGIYHHVAGKFIGRHAVRMIGWGV-ENGVNYWLMANSWNEEWGENG 315

Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
           YF++ RG NECGIE +VVAG+P
Sbjct: 316 YFRMVRGRNECGIESEVVAGMP 337


>gi|121309133|dbj|BAF43801.1| Longipain [Haemaphysalis longicornis]
          Length = 341

 Score =  180 bits (456), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 97/203 (47%), Positives = 125/203 (61%), Gaps = 18/203 (8%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH 74
           + L+ +D+L+CC   CG GC+GG+P +AW Y+VH G+VT       E C PY     C H
Sbjct: 140 VHLAADDVLSCC-MSCGSGCNGGFPGAAWSYWVHKGIVTGGNYDSDEGCMPY-PIKACDH 197

Query: 75  -------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
                  P  +   PTP+CVR C K  N  + + KHY   +Y + S+   I  EI  NGP
Sbjct: 198 HVNGTLGPCDKSIPPTPRCVRMCRKGYNVDFADDKHYGKKSYSVPSNVTQIQVEIMTNGP 257

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE  FTVY DF  YKSGVY+  T   +GGHA++L+GWG  + G  YW+ AN WN  WG  
Sbjct: 258 VEADFTVYADFPLYKSGVYQRHTDQALGGHAIRLLGWGV-EKGVPYWLAANSWNTEWGDK 316

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G+FKI RGS+ECGIE+DVVAG+P
Sbjct: 317 GFFKILRGSDECGIEDDVVAGIP 339


>gi|118358710|ref|XP_001012596.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89294363|gb|EAR92351.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 346

 Score =  180 bits (456), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 95/208 (45%), Positives = 125/208 (60%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGC 72
           Q++ LS  +L+ CC   CG GCDGG+P +A  Y+V++G+VT +       C  Y     C
Sbjct: 141 QDIRLSTQNLVTCCD-ECGFGCDGGWPEAAMDYYVNNGLVTGDLYGNNSWCQAY-SLAPC 198

Query: 73  SH-------PGCEPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIY 122
           +H       P C    PTP CV+ C   +     +    H    AY I+ + + IM EI 
Sbjct: 199 AHHVTSDVYPPCTGELPTPPCVKSCDSNSTYTIPYPKDLHKGSKAYSIDQNEQAIMTEIQ 258

Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
            NGP+EV+FTVYEDF  YKSGVY+H+TG  +GGHAVK++GWG  ++G  YWI+ N WN S
Sbjct: 259 TNGPIEVAFTVYEDFLTYKSGVYQHVTGSELGGHAVKMVGWGV-ENGTPYWIIVNSWNES 317

Query: 183 WGADGYFKIKRGSNECGIEEDVVAGLPS 210
           WG  G FKI RG NECGIE + V  LP+
Sbjct: 318 WGDKGTFKILRGQNECGIESECVTALPA 345


>gi|187097096|ref|NP_001119608.1| cathepsin B-348 precursor [Acyrthosiphon pisum]
 gi|161343833|tpg|DAA06097.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 342

 Score =  179 bits (455), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 89/206 (43%), Positives = 128/206 (62%), Gaps = 18/206 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC------- 72
           +N   S  +L++CC + CG GC+GG+P +AW Y+   G+V+    PY  + GC       
Sbjct: 140 KNFHFSAENLVSCC-WTCGFGCNGGFPGAAWNYWKTKGIVSG--GPYGSNMGCIPYEIAP 196

Query: 73  -------SHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
                  +   C+    TP CV+KC +  ++ +    H+  SAY I +D + I  EIY N
Sbjct: 197 CEHHVNGTRGPCKEGGKTPTCVKKCEEGYKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTN 256

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE +FTVYEDF  Y++GVYKH+ G  +GGHA++++GWG  +    YW++AN WN  WG
Sbjct: 257 GPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANSWNTDWG 316

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPS 210
           +DG+FKI RGS+ECGIE  + AGLP+
Sbjct: 317 SDGFFKILRGSDECGIEGQINAGLPA 342


>gi|268557292|ref|XP_002636635.1| C. briggsae CBR-CPR-1 protein [Caenorhabditis briggsae]
          Length = 330

 Score =  179 bits (455), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 94/195 (48%), Positives = 118/195 (60%), Gaps = 10/195 (5%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
           Q   +S +DLL+CCG  CG+GC+GGYPI A R++   GVVT        C PY  +  C+
Sbjct: 135 QQPIISPDDLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CT 193

Query: 74  HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
              C P   TP C   C    +  +   KH+  SAY +      I  EI  NGPVE +FT
Sbjct: 194 SGNC-PESKTPACSLSCQSGYSTAYAKDKHFGASAYAVARSVAAIQTEIMTNGPVEAAFT 252

Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
           VYEDF  YKSGVYKH  G  +GGHA+K+IGWGT + G  YW++AN W  +WG  G+FKI 
Sbjct: 253 VYEDFYKYKSGVYKHTAGKALGGHAIKIIGWGT-ESGSPYWLVANSWGTNWGESGFFKIL 311

Query: 193 RGSNECGIEEDVVAG 207
           RG ++CGIE  VVAG
Sbjct: 312 RGDDQCGIEGAVVAG 326


>gi|256086863|ref|XP_002579605.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228447|emb|CCD74618.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 271

 Score =  179 bits (454), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 101/222 (45%), Positives = 131/222 (59%), Gaps = 19/222 (8%)

Query: 4   TRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT--- 60
           + ++R  + S   +S++   LS  +LL+CC   CG GC GG P  AW Y+ + G+VT   
Sbjct: 51  SMSDRICIHSKNKISVE---LSAINLLSCCT-RCGFGCRGGIPGMAWDYWKYEGIVTGGS 106

Query: 61  ----EECDPY------FDSTGCSHPGCEPAY-PTPKCVRKCVKK-NQLWRNSKHYSISAY 108
                 C PY        S+  S+P CE  Y PTP+C   C     + ++  K Y  S+Y
Sbjct: 107 NETHTGCQPYPFPECNHHSSSKSYPPCESYYFPTPECHETCQDDYGKPYKKDKFYGKSSY 166

Query: 109 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 168
            + S+   IM EI  NGPVE  F VYEDF +YKSGVYKHITG  +GGHA+++IGWG   +
Sbjct: 167 NVASEEISIMKEILLNGPVEGGFYVYEDFLNYKSGVYKHITGSYLGGHAIRIIGWGIQQN 226

Query: 169 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 210
              YW+ AN WN  WG  GYFKI RG+NECGIE  V AGLP+
Sbjct: 227 HIPYWLCANSWNNQWGDQGYFKILRGTNECGIESMVTAGLPN 268


>gi|254746338|emb|CAX16634.1| putative C1A cysteine protease precursor [Manduca sexta]
          Length = 337

 Score =  179 bits (454), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 94/204 (46%), Positives = 125/204 (61%), Gaps = 17/204 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGC 72
           ++   S  DLL+CC  +CG GC+GG P  AW Y+ H G+V       T+ C PY +   C
Sbjct: 132 KHFHFSSEDLLSCCP-ICGLGCNGGIPSLAWEYWKHFGIVSGGNYNSTQGCRPY-EIPPC 189

Query: 73  SH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
            H  PG    C     TPKC + C    N +++  K Y    Y +++  + I AE+YKNG
Sbjct: 190 EHHVPGNRMPCSGDTKTPKCQKNCENGYNVMYKKDKRYGKHVYSVSAGEDHIRAELYKNG 249

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +FTVY D   YKSGVYKHI GD +GGHA+K++GWG  +D + YW++AN WN  WG 
Sbjct: 250 PVEGAFTVYADLLAYKSGVYKHIQGDALGGHAIKILGWGVENDNK-YWLVANSWNTDWGD 308

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G+FKI RG N CGIE  ++AG P
Sbjct: 309 NGFFKILRGENHCGIEGSIIAGEP 332


>gi|45822203|emb|CAE47498.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
          Length = 328

 Score =  179 bits (454), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 92/203 (45%), Positives = 127/203 (62%), Gaps = 16/203 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF-----D 68
           N   S +DL++CC + CG GC+GGYP +AW Y+V  G+V+       + C PY       
Sbjct: 127 NFHFSSDDLVSCC-WTCGMGCNGGYPGAAWHYWVRKGLVSGGQYGTKQGCRPYEIPPCEH 185

Query: 69  STGCSHPGCEPAY-PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
            T  S P C+ +   TPKC + C    ++ + N  H+   AY I+SD + I AEI +NGP
Sbjct: 186 HTNGSRPACDASEGNTPKCAKSCESNYKINYSNDLHFGSKAYSISSDVKQIQAEILQNGP 245

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +F+VY DF +YK+GVY+HI G  +GGHA+++ GWG  ++   YW++AN WN  WG  
Sbjct: 246 VEGAFSVYADFVNYKTGVYQHIKGQFLGGHAIRIFGWGVENN-TPYWLIANSWNTDWGDS 304

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G FKI RGS+ CGIE  +VAGLP
Sbjct: 305 GTFKILRGSDHCGIESGIVAGLP 327


>gi|348534156|ref|XP_003454569.1| PREDICTED: cathepsin B-like [Oreochromis niloticus]
          Length = 330

 Score =  179 bits (453), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 95/204 (46%), Positives = 128/204 (62%), Gaps = 18/204 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  DLL CC   CG GC+GGYP +AW ++   G+V+         C PY  +  C 
Sbjct: 129 NVEISSEDLLTCCDS-CGMGCNGGYPSAAWDFWASEGLVSGGLYESHIGCRPYTIAP-CE 186

Query: 74  H------PGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           H      P C      TP+CVR+C       +   KHY  ++Y + SD + I  EIYKNG
Sbjct: 187 HHVNGSRPPCTGEGGDTPECVRQCESGYTPSYIQDKHYGKTSYSVPSDEQQIQTEIYKNG 246

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +FTVYEDF  YK+GVY+H++G  +GGHA+K++GWG  ++G  YW+ AN WN  WG 
Sbjct: 247 PVEGAFTVYEDFLLYKTGVYQHVSGSAVGGHAIKVLGWG-EENGTPYWLCANSWNTDWGD 305

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +GYFKI RGS+ CGIE ++VAG+P
Sbjct: 306 NGYFKILRGSDHCGIESEIVAGIP 329


>gi|159179|gb|AAA29178.1| cysteine proteinase, partial [Haemonchus contortus]
          Length = 341

 Score =  179 bits (453), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 93/210 (44%), Positives = 126/210 (60%), Gaps = 17/210 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           + +++S  DL+ CC   CG GCDGG+ I AW YF + G+V+         C PY     C
Sbjct: 134 KQVNISATDLVTCCTPTCGFGCDGGWSIKAWEYFTYAGLVSGGEYRSKRCCRPY-PIHPC 192

Query: 73  SHPG-------CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H G       C     TP C +KC     +L+R  K Y   A+++    E I  E+ KN
Sbjct: 193 GHHGNDTYYGECPEEASTPSCKKKCQPGYRKLYRMDKRYGTDAFQLPKSVEAIQKELLKN 252

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPV  SF VYEDF+ YKSG+Y+H  G++ G HAVK+IGWGT ++  DYW++AN W+  WG
Sbjct: 253 GPVTASFAVYEDFSLYKSGIYRHTAGELRGYHAVKMIGWGT-ENRTDYWLIANSWHDDWG 311

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
            +GYF+I RG N+CGIEE+V AGL   ++L
Sbjct: 312 ENGYFRIIRGINDCGIEENVAAGLIDVESL 341


>gi|56753605|gb|AAW25005.1| SJCHGC02852 protein [Schistosoma japonicum]
          Length = 346

 Score =  179 bits (453), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 99/222 (44%), Positives = 134/222 (60%), Gaps = 20/222 (9%)

Query: 4   TRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT--- 60
           + ++R  + S   +S++   LS  +LL+CC   CG GC+GG P  AW Y+   G+VT   
Sbjct: 128 SMSDRICIHSKGRISIE---LSAVNLLSCCS-RCGFGCNGGIPGMAWDYWKDEGIVTGGS 183

Query: 61  ----EECDPY------FDSTGCSHPGCEPAY-PTPKCVRKCVKKNQL-WRNSKHYSISAY 108
                 C PY        ST  +H  CE  Y  TP+C + C     + + N K+Y  S+Y
Sbjct: 184 NETHTGCQPYPFPECIHHSTSINHSSCEVKYYSTPECYQTCQPDYAIQYENDKYYGKSSY 243

Query: 109 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD- 167
            + SD   IM EI  NGPVE +F V++DF +YK+GVYK++TG ++GGHA+++IGWG S  
Sbjct: 244 YVTSDEVSIMKEILLNGPVEATFYVFDDFLNYKTGVYKYVTGSLLGGHAIRIIGWGVSTL 303

Query: 168 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           +   YW+ AN WN+ WG  GYFKI RGSNECGIE  V AGLP
Sbjct: 304 NHTPYWLCANSWNKQWGDKGYFKILRGSNECGIESMVTAGLP 345


>gi|1848229|gb|AAB48119.1| cathepsin B-like protease [Leishmania major]
          Length = 340

 Score =  179 bits (453), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 95/202 (47%), Positives = 127/202 (62%), Gaps = 24/202 (11%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSHPGCEPAYP 82
           +S ++LL+CC F+CG GC GG P  AW ++V  G+ TE+C PY FD   CSH G    YP
Sbjct: 150 MSTSNLLSCC-FICGLGCHGGIPTVAWLWWVWVGIATEDCQPYPFDP--CSHHGNSEKYP 206

Query: 83  --------TPKCVRKCVKKNQL----WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
                   TPKC   C ++N++    ++ S  YS+   +      ++M E+  NGP+E++
Sbjct: 207 PCPSTIYDTPKCNTTC-ERNEMDLVKYKGSTSYSVKGEK------ELMIELMTNGPLELT 259

Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
             VY DF  YKSGVYKH+ GD +GGHAVKL+GWGT  DG  YW +AN WN  WG  GYF 
Sbjct: 260 MQVYSDFVGYKSGVYKHVLGDFLGGHAVKLVGWGT-QDGVPYWKVANSWNTDWGDKGYFL 318

Query: 191 IKRGSNECGIEEDVVAGLPSSK 212
           I+RG+NEC IE   VAG+P+ +
Sbjct: 319 IQRGNNECKIESGGVAGIPAQE 340


>gi|300176937|emb|CBK25506.2| unnamed protein product [Blastocystis hominis]
          Length = 320

 Score =  179 bits (453), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 95/200 (47%), Positives = 123/200 (61%), Gaps = 17/200 (8%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
           LS  DLL CC   CG GC+GG+P  AW +F   GV T       + C+ Y +   C H  
Sbjct: 122 LSDQDLLTCCE-SCGFGCNGGWPSMAWSWFHSTGVTTGGEYGSKDWCNAY-EFPKCDHHV 179

Query: 75  ----PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
               P C    PTP+CV KC +   + ++  KH+   AY + S+ E I  E+  NGP+EV
Sbjct: 180 EGKYPPCGETQPTPECVEKCQEGYPVEYKKDKHFFGEAYHVPSNVEAIKTELMTNGPIEV 239

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
            F+VYEDF  YKSG+Y+H+ G  +GGHAVKL+GWG  +DG +YW +AN WN  WG +GYF
Sbjct: 240 DFSVYEDFMTYKSGIYQHVAGKYLGGHAVKLVGWGV-EDGVEYWKIANSWNEDWGENGYF 298

Query: 190 KIKRGSNECGIEEDVVAGLP 209
           +I  G NECGIE D VAG+P
Sbjct: 299 RIIAGKNECGIESDGVAGIP 318


>gi|255040225|gb|ACT99885.1| cathepsin B2 [Opisthorchis viverrini]
          Length = 337

 Score =  178 bits (452), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 95/200 (47%), Positives = 123/200 (61%), Gaps = 15/200 (7%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT--EECDPY----FDSTGCSHPG 76
           SLS  DL++CCG+ CG GC GGYP +AW ++  +G+VT   + DP     +    CSH G
Sbjct: 132 SLSSIDLVSCCGY-CGFGCQGGYPPAAWDFWQAYGIVTGGSKEDPMGCRSYPFPKCSHHG 190

Query: 77  CEP-------AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
            +         Y TPKCV KC   N  +   K  +   Y +      IM EI  NGPVE 
Sbjct: 191 SKKYPPCPHRIYDTPKCVPKCDTPNIDYETDKTRANITYNVQRSQMAIMKEIMINGPVEA 250

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
           +F VYEDF  YK GVY H TG+ +GGHA++++GWG  ++G  YW++AN WN  WG DGYF
Sbjct: 251 AFEVYEDFFGYKQGVYFHSTGEFIGGHAIRILGWG-EENGTPYWLIANSWNEGWGEDGYF 309

Query: 190 KIKRGSNECGIEEDVVAGLP 209
           K+ RG NECGIE++V AGLP
Sbjct: 310 KMLRGKNECGIEDEVTAGLP 329


>gi|118153|sp|P25792.1|CYSP_SCHMA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
           Full=Antigen Sm31; Flags: Precursor
 gi|160950|gb|AAA29865.1| cathepsin B [Schistosoma mansoni]
          Length = 340

 Score =  178 bits (452), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 94/202 (46%), Positives = 125/202 (61%), Gaps = 16/202 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY-----F 67
           QN+ LS  DLL CC   CG GC+GG    AW Y+V  G+VT         C+PY      
Sbjct: 138 QNVELSAVDLLTCCES-CGLGCEGGILGPAWDYWVKEGIVTASSKENHTGCEPYPFPKCE 196

Query: 68  DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
             T   +P C    Y TP+C + C +K +  +   KH   S+Y + +D + I  EI K G
Sbjct: 197 HHTKGKYPPCGSKIYNTPRCKQTCQRKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYG 256

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE SFTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG  ++   YW++AN WN  WG 
Sbjct: 257 PVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGV-ENKTPYWLIANSWNEDWGE 315

Query: 186 DGYFKIKRGSNECGIEEDVVAG 207
           +GYF+I RG +EC IE +V+AG
Sbjct: 316 NGYFRIVRGRDECSIESEVIAG 337


>gi|225717770|gb|ACO14731.1| Cathepsin B precursor [Caligus clemensi]
          Length = 331

 Score =  178 bits (452), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 92/205 (44%), Positives = 128/205 (62%), Gaps = 18/205 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           +N+++S  +LL+CC + CG GC+GG+P +AWR++ + G+V+       + C PY     C
Sbjct: 129 KNVNISAENLLSCC-YTCGFGCNGGFPGAAWRFWENKGLVSGGLYGSHKGCQPYLIEP-C 186

Query: 73  SH------PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSI--SAYRINSDPEDIMAEIYKN 124
            H        C     TPKC + C  KN      K  S   S+Y I SDP+ I  +I  N
Sbjct: 187 EHHVNGTRKPCAEGGRTPKCHKTCDNKNYPISYEKDLSFGRSSYSIRSDPKQIQMDIMTN 246

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE +F+VY DF  YKSGVY+H+ G ++GGHA++++GWG  + G  YW++AN WN  WG
Sbjct: 247 GPVEAAFSVYSDFMSYKSGVYRHVKGSLLGGHAIRILGWGM-EKGTPYWLVANSWNTDWG 305

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
            +G FKI RGS+ CGIE+ VVAGLP
Sbjct: 306 DNGTFKILRGSDHCGIEDSVVAGLP 330


>gi|4204370|gb|AAD11445.1| cathepsin B protease, partial [Fasciola hepatica]
          Length = 247

 Score =  178 bits (452), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 91/202 (45%), Positives = 122/202 (60%), Gaps = 19/202 (9%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG 76
           L+  D L+CC + CG GC GGYP  AW Y++  G+VT         C P+   T C H G
Sbjct: 47  LAAADPLSCCTY-CGQGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWM-FTKCDHVG 104

Query: 77  -------C-EPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
                  C    YPTP C R C    N+ +   K Y  S+Y +      IM EI KNGPV
Sbjct: 105 DSRKYSRCPHYTYPTPPCARACQTGYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPV 164

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           EV+F +++DF  Y+SG+Y H+ G  +G HAV++IGWG  ++G +YW++AN WN  WG +G
Sbjct: 165 EVTFAIFQDFGVYRSGIYHHVAGKFIGRHAVRMIGWGV-ENGVNYWLMANSWNEEWGENG 223

Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
           YF++ RG NECGIE +VVAG+P
Sbjct: 224 YFRMVRGRNECGIESEVVAGMP 245


>gi|356984175|gb|AET43950.1| cathepsin B, partial [Reishia clavigera]
          Length = 209

 Score =  178 bits (452), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 95/200 (47%), Positives = 125/200 (62%), Gaps = 18/200 (9%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
           +S N+LLACC   CGDGC+GGYP +AW  F H GVVT       + C PY  +  C H  
Sbjct: 12  VSANELLACC-ESCGDGCNGGYPSAAWEVFDHDGVVTGGQYNSKQGCQPYLIAA-CDHHV 69

Query: 75  ----PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
                 C+    TP+C +KC    N  +++ KHY   +Y ++S   DIM E+   GPVE 
Sbjct: 70  VGKLKPCKGDGKTPRCEKKCEAGYNVTFKDDKHYGQRSYSVSS-VNDIMEELVTRGPVEA 128

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
           +FTVY DF  Y SGVY+H TG  +GGHAVK++G+G  ++G+ YW++AN WN  WG  G+F
Sbjct: 129 AFTVYSDFLQYHSGVYRHTTGSALGGHAVKILGYGV-ENGDKYWLVANSWNPDWGDQGFF 187

Query: 190 KIKRGSNECGIEEDVVAGLP 209
           KI RG +ECGIE  +VAG P
Sbjct: 188 KILRGVDECGIEGQIVAGEP 207


>gi|187103108|ref|NP_001119614.1| cathepsin B-1418 precursor [Acyrthosiphon pisum]
 gi|163300438|tpg|DAA06126.1| TPA_inf: cathepsin B transcript 1418 [Acyrthosiphon pisum]
 gi|239788654|dbj|BAH70998.1| ACYPI000010 [Acyrthosiphon pisum]
          Length = 346

 Score =  178 bits (451), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 92/199 (46%), Positives = 124/199 (62%), Gaps = 14/199 (7%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-FDSTGCSH 74
           +LS   L  CC + CG+GCDGG P SAW +F+ HG+VT       + C PY     G   
Sbjct: 143 NLSAEQLNTCC-YRCGNGCDGGSPESAWYFFMRHGIVTGGDYGSEDGCQPYSIYPCGKGR 201

Query: 75  PGCEPAYP-TPKC-VRKCVKKN--QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
             C    P TP C ++ C   N  + +R   HY  + Y ++   EDIM ++YKNGPV+ +
Sbjct: 202 NTCIEDDPDTPDCSIKTCTNSNYSKNYRADLHYVDTVYSLSRSEEDIMKDLYKNGPVQAA 261

Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
           F VY DF +YKSGVY +  G + GGHA+K++GWG  DDG  YW+ AN W+RSWG +G F+
Sbjct: 262 FYVYTDFMYYKSGVYSYTRGQIEGGHAIKILGWGV-DDGTKYWLCANSWSRSWGENGLFR 320

Query: 191 IKRGSNECGIEEDVVAGLP 209
           I RG+NEC IE+ V+AG+P
Sbjct: 321 ILRGNNECHIEDRVIAGMP 339


>gi|432852559|ref|XP_004067308.1| PREDICTED: cathepsin B-like [Oryzias latipes]
          Length = 330

 Score =  178 bits (451), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 95/218 (43%), Positives = 132/218 (60%), Gaps = 19/218 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + S+  VS++   +S  DLL CC   CG GC+GGYP +AW ++   G+VT     
Sbjct: 117 SDRVCIHSNARVSVE---ISSEDLLTCCES-CGMGCNGGYPTAAWDFWTKEGLVTGGLYD 172

Query: 63  ----CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY          G   P       TP+C+ +C       ++  KHY  ++Y + 
Sbjct: 173 SHVGCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCINQCESGYTPSYKKDKHYGKTSYSVE 232

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           ++   I  EIYKNGPVE +F VYEDF  YKSGVY+H++G ++GGHA+K++GWG  +DG  
Sbjct: 233 ANENQIQTEIYKNGPVEGAFMVYEDFPMYKSGVYQHVSGSLIGGHAIKILGWGV-EDGVP 291

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           YW+ AN WN  WG +GYFKI RGS+ CGIE +VVAG+P
Sbjct: 292 YWLCANSWNTDWGDNGYFKILRGSDHCGIESEVVAGIP 329


>gi|268579855|ref|XP_002644910.1| C. briggsae CBR-CPR-6 protein [Caenorhabditis briggsae]
          Length = 376

 Score =  178 bits (451), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 100/229 (43%), Positives = 135/229 (58%), Gaps = 23/229 (10%)

Query: 9   DALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD 68
           D +  + +  LQ +SLS +DLL+CC   CG GC+GG P++AWRY+V  G+VT     Y  
Sbjct: 145 DRICIASHGELQ-VSLSADDLLSCC-RSCGFGCNGGDPLAAWRYWVKDGIVTGS--NYTA 200

Query: 69  STGCS---HPGCE-------------PAYPTPKCVRKCVK--KNQLWRNSKHYSISAYRI 110
           ++GC     P CE               YPTPKC +KC+    ++ +   K Y  SAY +
Sbjct: 201 NSGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCIADYTDKTYSEDKFYGHSAYGV 260

Query: 111 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 170
             D E I  E+  +GP+E++F VYEDF +Y  GVY H  G + GGHAVKLIGWG  +DG 
Sbjct: 261 KDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGI-EDGI 319

Query: 171 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEIT 219
            YW  AN WN  WG DG+F+I RG +ECGIE  VV G+P   ++   ++
Sbjct: 320 PYWTCANSWNTDWGEDGFFRILRGVDECGIESGVVGGIPKLNSVSSRLS 368


>gi|324507953|gb|ADY43363.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
          Length = 352

 Score =  177 bits (450), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 95/214 (44%), Positives = 124/214 (57%), Gaps = 20/214 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
            +SLS +DLL+CC   CG GCDGG P++AW+Y+V  G+VT       + C PY     C 
Sbjct: 130 QVSLSADDLLSCCK-SCGFGCDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPY-PFPPCE 187

Query: 74  H--------PGCEPAYPTPKCVRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           H        P     YPTPKC +KC  +   + +   K +  +AY +  D   I  EI  
Sbjct: 188 HHSNKTHYQPCKHDLYPTPKCEKKCLDIYTEKTYAEDKFFGETAYGVEDDVTSIQKEILT 247

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVEV+F VYEDF  Y  G+Y H  G + GGHAVK++GWG  + G  YW++AN WN  W
Sbjct: 248 HGPVEVAFEVYEDFLMYDGGIYVHTGGKIGGGHAVKMLGWGV-EQGVPYWLVANSWNTDW 306

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 217
           G DG+F+I RG +ECGIE  VV GLP      K+
Sbjct: 307 GEDGFFRIIRGIDECGIESSVVGGLPKLNRTYKK 340


>gi|308512693|gb|ADO33000.1| cathepsin B [Biston betularia]
          Length = 217

 Score =  177 bits (450), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 93/204 (45%), Positives = 122/204 (59%), Gaps = 17/204 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           ++   S  DLL+CC  +CG GC+GG P  AW Y+ H G+V+       + C PY     C
Sbjct: 12  KHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEYWKHMGLVSGGNYNSSQGCSPYVIPP-C 69

Query: 73  SH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
            H  PG    C     TPKC + C    N L++  K Y    Y +    + I AE++KNG
Sbjct: 70  EHHVPGNRLPCNGDTKTPKCSKTCENGYNVLYKKDKRYGKHVYAVRGGEDHIKAELFKNG 129

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +FTVY D   YKSGVYKH+ GD +GGHA+K+IGWG  ++G  YW++AN WN  WG 
Sbjct: 130 PVEAAFTVYADLLAYKSGVYKHVEGDALGGHAIKIIGWGV-ENGNKYWLIANSWNTDWGN 188

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G+FKI RG + CGIE  +VAG P
Sbjct: 189 NGFFKILRGEDHCGIESSIVAGEP 212


>gi|340380665|ref|XP_003388842.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
          Length = 333

 Score =  177 bits (450), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 97/216 (44%), Positives = 130/216 (60%), Gaps = 18/216 (8%)

Query: 9   DALSSSPYVSLQ-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------- 60
           +A+S    VS Q N+ +S  +L+ CC F CG+GC GG+   AW Y+V  G+VT       
Sbjct: 117 EAMSDRYCVSFQENVHISAENLMTCCKF-CGNGCAGGFLQQAWEYWVKDGLVTGGQYGSD 175

Query: 61  EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSD 113
           E C PY     C+H  PG    C     TP+C R C       +    HY   AY ++ +
Sbjct: 176 EGCQPYLIPK-CNHHEPGPYENCTGEGKTPQCERTCRSGYTTSYEADLHYGEKAYAVHRE 234

Query: 114 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 173
            E I  EI  NGPVE +FTVY DF  YKSGVY+H+ G  +GGHA++++GWGT ++G  YW
Sbjct: 235 VEAIQTEIMTNGPVEGAFTVYSDFPTYKSGVYQHVVGHALGGHAIRILGWGT-ENGVPYW 293

Query: 174 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           ++AN WN SWG  GYFK+ RG ++CGIE ++VAG P
Sbjct: 294 LIANSWNPSWGDKGYFKMIRGKDDCGIESNIVAGTP 329


>gi|327322926|gb|AEA48884.1| cathepsin B [Oplegnathus fasciatus]
          Length = 330

 Score =  177 bits (449), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 96/219 (43%), Positives = 135/219 (61%), Gaps = 21/219 (9%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + S+  VS++   +S  DLL CC   CG GC+GGYP +AW ++   G+V+     
Sbjct: 117 SDRVCIHSNAKVSVE---ISSEDLLTCC-MSCGMGCNGGYPSAAWDFWTKEGLVSGGLYD 172

Query: 63  ----CDPYFDSTGCSH------PGCE-PAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRI 110
               C PY  +  C H      P C      TP+C+ KC       ++  KH+  ++Y +
Sbjct: 173 SHIGCRPYTIAP-CEHHVNGSRPSCTGEGGDTPQCITKCEAGYTPSYKEDKHFGKTSYTV 231

Query: 111 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 170
            SD E I +EI+KNGPVE +F VYEDF  YKSGVY+H++G  +GGHA+K++GWG  +DG 
Sbjct: 232 LSDEEQIQSEIFKNGPVEGAFIVYEDFVLYKSGVYQHVSGSAVGGHAIKILGWGV-EDGV 290

Query: 171 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
            YW+ AN WN  WG +G+FK  RGS+ CGIE +VVAG+P
Sbjct: 291 PYWLCANSWNTDWGDNGFFKFLRGSDHCGIESEVVAGIP 329


>gi|119887749|gb|ABM05925.1| cathepsin B-like cysteine proteinase [Helicoverpa assulta]
          Length = 338

 Score =  177 bits (449), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 93/204 (45%), Positives = 124/204 (60%), Gaps = 17/204 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           Q+   S  DLL+CC  +CG GC+GG P  AW Y+ H G+V+       + C PY +   C
Sbjct: 133 QHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPY-EIPPC 190

Query: 73  SH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
            H  PG    C     TPKC + C    N  +R  K Y    + ++S  + I AE++KNG
Sbjct: 191 EHHVPGNRMPCNGDSKTPKCEKTCESNYNVDYRKDKRYGKHVFSVSSKEDHIRAELFKNG 250

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +FTVY D  +YK+GVYKH  GD +GGHAVK++GWG  ++G  YW++AN WN  WG 
Sbjct: 251 PVEGAFTVYSDLLNYKTGVYKHTIGDALGGHAVKILGWGV-ENGNKYWLIANSWNSDWGD 309

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G+FKI RG + CGIE  +VAG P
Sbjct: 310 NGFFKILRGEDHCGIESSIVAGEP 333


>gi|1777779|gb|AAB40605.1| cathepsin B-like cysteine proteinase [Ascaris suum]
 gi|324515014|gb|ADY46062.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
          Length = 398

 Score =  177 bits (449), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 95/214 (44%), Positives = 124/214 (57%), Gaps = 20/214 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
            +SLS +DLL+CC   CG GCDGG P++AW+Y+V  G+VT       + C PY     C 
Sbjct: 171 QVSLSADDLLSCCK-SCGFGCDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPY-PFPPCE 228

Query: 74  H--------PGCEPAYPTPKCVRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           H        P     YPTPKC +KC  +   + +   K +  +AY +  D   I  EI  
Sbjct: 229 HHSNKTHYQPCKHDLYPTPKCEKKCLDIYTEKTYAEDKFFGETAYGVEDDVTSIQKEILT 288

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVEV+F VYEDF  Y  G+Y H  G + GGHAVK++GWG  + G  YW++AN WN  W
Sbjct: 289 HGPVEVAFEVYEDFLMYDGGIYVHTGGKIGGGHAVKMLGWGV-EQGVPYWLVANSWNTDW 347

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 217
           G DG+F+I RG +ECGIE  VV GLP      K+
Sbjct: 348 GEDGFFRIIRGIDECGIESSVVGGLPKLNRTYKK 381


>gi|256052329|ref|XP_002569725.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228436|emb|CCD74607.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 345

 Score =  177 bits (449), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 94/202 (46%), Positives = 126/202 (62%), Gaps = 16/202 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY-----F 67
           QN+ LS  DLL+CC   CG GC+GG    AW ++V  G+VT         C+PY      
Sbjct: 143 QNVELSAVDLLSCCES-CGLGCEGGILGPAWDFWVKEGIVTGSSKENHTGCEPYPFPKCE 201

Query: 68  DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
             T   +P C    Y TP+C + C KK +  +   KH   S+Y + +D + I  EI K G
Sbjct: 202 HHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYG 261

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE SFTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG  ++   YW++AN WN  WG 
Sbjct: 262 PVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGV-ENKTPYWLIANSWNEDWGE 320

Query: 186 DGYFKIKRGSNECGIEEDVVAG 207
           +GYF+I RG +EC IE +V+AG
Sbjct: 321 NGYFRIVRGRDECFIESEVIAG 342


>gi|341887135|gb|EGT43070.1| hypothetical protein CAEBREN_13756 [Caenorhabditis brenneri]
          Length = 398

 Score =  177 bits (449), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 99/218 (45%), Positives = 130/218 (59%), Gaps = 21/218 (9%)

Query: 9   DALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE------ 62
           D +  + +  LQ +SLS +DLL+CC   CG GC+GG P++AWRY+V  G+VT        
Sbjct: 159 DRICIASHGELQ-VSLSADDLLSCC-RSCGFGCNGGDPLAAWRYWVKDGIVTGSNFTANS 216

Query: 63  -CDPYFDSTGCSH--------PGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSISAYRIN 111
            C PY     C H        P     YPTPKC ++C  +  ++ +   K Y  SAY + 
Sbjct: 217 GCKPY-PFPPCEHHSKKTHFDPCPHDLYPTPKCEKRCNAEYTDKTYSEDKFYGSSAYGVK 275

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
            D E I  E+  +GP+E++F VYEDF +Y  GVY H  G + GGHAVKLIGWG  +DG  
Sbjct: 276 DDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGI-EDGIP 334

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           YW +AN WN  WG DG+F+I RG +ECGIE  VV G+P
Sbjct: 335 YWTVANSWNTDWGEDGFFRILRGVDECGIESGVVGGIP 372


>gi|27882093|gb|AAH44517.1| Zgc:55862 [Danio rerio]
          Length = 330

 Score =  177 bits (449), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 95/218 (43%), Positives = 133/218 (61%), Gaps = 19/218 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + S+  VS++   +S  DLL CC   CG GC+GGYP +AW ++   G+VT     
Sbjct: 117 SDRVCIQSNAKVSVE---ISSQDLLTCCDS-CGMGCNGGYPSAAWDFWTTDGLVTGGLYN 172

Query: 63  ----CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY          G   P       TP C  KC    + L++  KH+  ++Y + 
Sbjct: 173 SHIGCRPYTIEPCEHHVNGSRPPCTGEGGDTPNCDMKCEPGYSPLYKEDKHFGKTSYSVP 232

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           S+   IMAE++KNGPVE +FTVYEDF  YKSGVY+H++G  +GGHA+K++GWG  ++G  
Sbjct: 233 SNQNGIMAELFKNGPVEAAFTVYEDFLLYKSGVYQHMSGSALGGHAIKILGWG-EENGVP 291

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           YW+ AN WN  WG +GYFKI RG + CGIE ++VAG+P
Sbjct: 292 YWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAGIP 329


>gi|195729971|gb|ACG50796.1| cathepsin B1 [Trichobilharzia szidati]
          Length = 342

 Score =  177 bits (449), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 94/204 (46%), Positives = 124/204 (60%), Gaps = 16/204 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY-----F 67
           +++ LS  DLL+CC   CG GC GG+P +AW Y+V  G+VT         C PY      
Sbjct: 139 KSVELSAVDLLSCCT-ECGLGCQGGFPGAAWDYWVEEGIVTGSSKENHTGCQPYPFPKCE 197

Query: 68  DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
             T   +P C E  Y TPKC +KC K  +  ++  K+Y   +Y + S  + I  EI  +G
Sbjct: 198 HHTKGKYPACGEKIYKTPKCQQKCQKGYKTPYKKDKYYGKLSYNVLSKEDAIKKEIMMHG 257

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +FTVY DF +YKSG+YKH+ G V+GGHAV++IGWG  +    YW++AN WN  WG 
Sbjct: 258 PVEAAFTVYSDFLNYKSGIYKHMKGTVIGGHAVRIIGWGV-EKKTPYWLIANSWNEDWGE 316

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            GYF+I RG + CGIE  V AGLP
Sbjct: 317 KGYFRILRGKDVCGIESAVTAGLP 340


>gi|7537454|gb|AAF35867.2| cathepsin B-like cysteine proteinase [Helicoverpa armigera]
          Length = 338

 Score =  177 bits (448), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 93/204 (45%), Positives = 124/204 (60%), Gaps = 17/204 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           Q+   S  DLL+CC  +CG GC+GG P  AW Y+ H G+V+       + C PY +   C
Sbjct: 133 QHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPY-EIPPC 190

Query: 73  SH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
            H  PG    C     TPKC + C    N  +R  K Y    + ++S  + I AE++KNG
Sbjct: 191 EHHVPGNRMPCNGDSKTPKCEKTCESNYNVDYRKDKRYGKHVFSVSSKEDHIRAELFKNG 250

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +FTVY D  +YK+GVYKH  GD +GGHAVK++GWG  ++G  YW++AN WN  WG 
Sbjct: 251 PVEGAFTVYSDLLNYKTGVYKHTIGDALGGHAVKILGWGV-ENGNKYWLIANSWNSDWGD 309

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G+FKI RG + CGIE  +VAG P
Sbjct: 310 NGFFKILRGEDHCGIESSIVAGEP 333


>gi|344195776|gb|AEM98130.1| cathepsin B [Cynoglossus semilaevis]
          Length = 332

 Score =  177 bits (448), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 93/206 (45%), Positives = 128/206 (62%), Gaps = 18/206 (8%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTG 71
           L N+ +S  DLL+CC   CG GC+GGYP +AW ++   G+V+         C PY  +  
Sbjct: 127 LMNVEISAEDLLSCCDS-CGMGCNGGYPSAAWEFWTTDGLVSGGLYDSHIGCRPYSIAP- 184

Query: 72  CSH------PGCE-PAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C H      P C      TP+C +KC       +   KHY   +Y ++   ++I  EIYK
Sbjct: 185 CEHHVNGSRPPCTGEGGDTPQCTKKCEAGYTPGYTQDKHYGKLSYSVDDSEKEIQLEIYK 244

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           NGPVE +FTVYEDF  YK+GVY+H+TG  +GGHA+K++GWG  ++G  YW+ AN WN  W
Sbjct: 245 NGPVEGAFTVYEDFLLYKTGVYQHVTGSAVGGHAIKVLGWG-EENGTPYWLCANSWNTDW 303

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLP 209
           G +G+FKI RGS+ CGIE ++VAG+P
Sbjct: 304 GDNGFFKILRGSDHCGIESEIVAGIP 329


>gi|256090368|ref|XP_002581167.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|22531387|emb|CAD44624.1| cathepsin B1 isotype 1 [Schistosoma mansoni]
 gi|353228442|emb|CCD74613.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 340

 Score =  177 bits (448), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 94/202 (46%), Positives = 124/202 (61%), Gaps = 16/202 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY-----F 67
           QN+ LS  DLL+CC   CG GC+GG    AW Y+V  G+VT         C+PY      
Sbjct: 138 QNVELSAVDLLSCCES-CGLGCEGGILGPAWDYWVKEGIVTGSSKENHTGCEPYPFPKCE 196

Query: 68  DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
             T   +P C    Y TP+C + C KK +  +   KH   S+Y + +D + I  EI K G
Sbjct: 197 HHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYG 256

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE  FTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG  ++   YW++AN WN  WG 
Sbjct: 257 PVEAGFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGV-ENKTPYWLIANSWNEDWGE 315

Query: 186 DGYFKIKRGSNECGIEEDVVAG 207
           +GYF+I RG +EC IE +V AG
Sbjct: 316 NGYFRIVRGRDECSIESEVTAG 337


>gi|185135431|ref|NP_001117776.1| procathepsin B precursor [Oncorhynchus mykiss]
 gi|14582897|gb|AAK69705.1|AF358667_1 procathepsin B [Oncorhynchus mykiss]
          Length = 330

 Score =  177 bits (448), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 93/218 (42%), Positives = 133/218 (61%), Gaps = 20/218 (9%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + S+  VS++   +S  DLL+CC   CG GC GG+P +AW Y+   G+VT     
Sbjct: 117 SDRYCIHSNGKVSVE---ISAEDLLSCCD-ACGMGCMGGFPSAAWDYWAESGLVTGGLYG 172

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRIN 111
               C PY  +  C H      P C     TPKCV +C       ++  K +    Y + 
Sbjct: 173 SNIGCRPYSIAP-CEHHVNGTRPPCTGEGDTPKCVSECNAGYTPSYKKDKRFGKQTYSVP 231

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
              + IM E+YKNGPVE +F+VYEDF  YK+GVY+H+TG ++GGHA+K++GWG  ++   
Sbjct: 232 PKEQQIMTELYKNGPVEAAFSVYEDFLLYKTGVYQHVTGQMLGGHAIKILGWG-KENNTP 290

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           YW++AN WN  WG +G+FKI RG +ECGIE ++VAG+P
Sbjct: 291 YWLVANSWNTDWGDNGFFKILRGKDECGIESEIVAGIP 328


>gi|49036806|gb|AAT48984.1| cathepsin B-like proteinase [Triatoma sordida]
          Length = 331

 Score =  177 bits (448), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 89/202 (44%), Positives = 124/202 (61%), Gaps = 16/202 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
            + LS  +L++CC   CG GCDGG+P SAW Y+ + G+V+       + C PY  +  C 
Sbjct: 131 QVHLSAENLVSCCD-SCGYGCDGGFPASAWDYWQNEGIVSGGNYGSKQGCQPYSIAP-CE 188

Query: 74  H------PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
           H      P C     TP C  +C + + +  +  HY         + + I AEI KNGPV
Sbjct: 189 HHVPGSRPACSGGGDTPDCRNQCDEGSGISYDQDHYYGETVYTLDEAKQIQAEILKNGPV 248

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           E +FTVYED  +YK GVY+H+ G+ +GGHA+K++GWG  +D   YW++AN WN  WG +G
Sbjct: 249 EAAFTVYEDLLNYKEGVYQHVAGEALGGHAIKILGWGVEND-TPYWLVANSWNTDWGNNG 307

Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
           +FKI RGS+ECGIE+ +VAGLP
Sbjct: 308 FFKILRGSDECGIEDQIVAGLP 329


>gi|984958|gb|AAC46877.1| cathepsin B-like proteinase [Ancylostoma caninum]
          Length = 343

 Score =  177 bits (448), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 90/206 (43%), Positives = 126/206 (61%), Gaps = 18/206 (8%)

Query: 18  SLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDST 70
           S   + +S +D+L+CCG  CG GC GG+PI A+++    GVVT       + C PY    
Sbjct: 136 STIRVMISDSDILSCCGISCGYGCQGGWPIEAYKWMQRDGVVTGGKYRQKKVCKPY-AFY 194

Query: 71  GCSHPGCEPAY--------PTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEI 121
            C H   +P Y        PTPKC + C +K N+ ++  KH++  AY + ++  +I  EI
Sbjct: 195 PCGHHQNDPYYGPCPGGLWPTPKCRKTCQRKYNKSYQEDKHFATRAYYLPNNERNIRQEI 254

Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
           YKNGPV  +F VY+DF++YK G+Y H  G   G HAVK++GWG  ++  DYW++AN WN 
Sbjct: 255 YKNGPVVAAFRVYQDFSYYKKGIYVHKWGGQTGAHAVKVVGWG-RENATDYWLIANSWNT 313

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
            WG  GYF+I RG+NECGIE  +V G
Sbjct: 314 DWGESGYFRIVRGTNECGIEAQMVGG 339


>gi|402594312|gb|EJW88238.1| cathepsin B5 [Wuchereria bancrofti]
          Length = 407

 Score =  177 bits (448), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 98/213 (46%), Positives = 126/213 (59%), Gaps = 24/213 (11%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           + + LS +DLL+CC   CG GC GG P++AW+Y+V  G+VT     Y + +GC     P 
Sbjct: 185 KQVILSADDLLSCCK-TCGFGCFGGEPMAAWKYWVLSGIVTG--SDYTNHSGCRPYPFPP 241

Query: 77  CE-------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIY 122
           CE               YPTPKC R+C K   + ++  K+Y   AY + +D E I  EI 
Sbjct: 242 CEHHNNKTHYEPCKHDLYPTPKCDRQCDKNYKKPYKADKYYGEQAYNVENDVELIQKEIM 301

Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
             GPVE SF VY DF HY  G+YKH+ G V GGHAVK++GWG  D G  YW+ AN WN  
Sbjct: 302 TLGPVEASFEVYTDFLHYIGGIYKHVAGSVGGGHAVKILGWGI-DQGVSYWLAANSWNTD 360

Query: 183 WGAD---GYFKIKRGSNECGIEEDVVAGLPSSK 212
           WG D   GYF+I RG +ECGIE  +VAG+P  +
Sbjct: 361 WGEDVFSGYFRILRGVDECGIESGIVAGIPRKE 393


>gi|308511959|ref|XP_003118162.1| CRE-CPR-6 protein [Caenorhabditis remanei]
 gi|308238808|gb|EFO82760.1| CRE-CPR-6 protein [Caenorhabditis remanei]
          Length = 387

 Score =  177 bits (448), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 98/219 (44%), Positives = 131/219 (59%), Gaps = 23/219 (10%)

Query: 9   DALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD 68
           D +  + +  LQ +SLS +DLL+CC   CG GC+GG P++AWRY+V  G+VT     Y  
Sbjct: 144 DRICIASHGELQ-VSLSADDLLSCCRS-CGFGCNGGDPLAAWRYWVKDGIVTGS--NYTA 199

Query: 69  STGCS---HPGCE-------------PAYPTPKCVRKCVK--KNQLWRNSKHYSISAYRI 110
           ++GC     P CE               YPTPKC +KC+    ++ +   K Y  SAY +
Sbjct: 200 NSGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCIADYTDKTYSEDKFYGASAYGV 259

Query: 111 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 170
             D E I  E+  +GP+E++F VYEDF +Y  GVY H  G + GGHAVKL+GWG  ++G 
Sbjct: 260 KDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLVGWGI-ENGI 318

Query: 171 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
            YW  AN WN  WG DG+F+I RG +ECGIE  VV G+P
Sbjct: 319 PYWTCANSWNTDWGEDGFFRILRGVDECGIESGVVGGVP 357


>gi|409905640|gb|AFV46426.1| cysteine protease C [Leishmania donovani]
          Length = 345

 Score =  176 bits (447), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 95/197 (48%), Positives = 121/197 (61%), Gaps = 14/197 (7%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP- 82
           +S ++LL+CC F+CG GC GG P  AW ++V  G+ TE C PY     CSH G    YP 
Sbjct: 155 ISTSNLLSCC-FICGFGCYGGIPTMAWLWWVWVGITTEVCQPY-PFGPCSHHGNSDKYPP 212

Query: 83  -------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
                  TPKC   C K        K+   ++Y +  + E +M E+  NGP+EV+  VY 
Sbjct: 213 CPNTIYDTPKCNTTCEKSEM--DLVKYKGGTSYSVKGEKE-LMIELMTNGPLEVTMQVYS 269

Query: 136 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 195
           DF  YKSGVYKH++GD++GGHAVKL+GWGT   G  YW +AN WN  WG  GYF I+RGS
Sbjct: 270 DFVGYKSGVYKHVSGDLLGGHAVKLVGWGT-QGGVPYWKIANSWNTDWGDKGYFLIQRGS 328

Query: 196 NECGIEEDVVAGLPSSK 212
           NECGIE   VAG P+ +
Sbjct: 329 NECGIESGGVAGTPAQE 345


>gi|339236191|ref|XP_003379650.1| cathepsin B [Trichinella spiralis]
 gi|316977649|gb|EFV60721.1| cathepsin B [Trichinella spiralis]
          Length = 356

 Score =  176 bits (447), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 93/203 (45%), Positives = 123/203 (60%), Gaps = 16/203 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 66
           Q   +S  DLL+CC  +CG GC GG P  AW ++V +G+VT       + C PY      
Sbjct: 151 QKPHISSTDLLSCCK-ICGFGCQGGDPHQAWSFWVKYGLVTGGNYTTHDGCRPYPFAPCN 209

Query: 67  FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS-KHYSISAYRINSDPEDIMAEIYKNG 125
             S G   P      PTP C + C    ++  N  K+Y + AY +++   D+  E+  NG
Sbjct: 210 HHSNGTYGPCSHDLEPTPVCKKACQSTYKIQYNKDKYYGLKAYSLHNKASDLQKELMMNG 269

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           P+EV+F VYEDF  YK+GVY+H TG V+GGHAV+L+GWG  ++G  YW+LAN WN  WG 
Sbjct: 270 PMEVAFEVYEDFLLYKTGVYQHHTGSVLGGHAVRLLGWG-EENGVPYWLLANSWNTEWGD 328

Query: 186 DGYFKIKRGSNECGIEEDVVAGL 208
            G+FKI RG NECGIE + VAGL
Sbjct: 329 KGFFKIYRGRNECGIESEAVAGL 351


>gi|146092987|ref|XP_001466605.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania infantum JPCM5]
 gi|398018677|ref|XP_003862503.1| cysteine peptidase C (CPC) [Leishmania donovani]
 gi|12005276|gb|AAG44365.1| cathepsin B-like cysteine protease [Leishmania donovani]
 gi|134070968|emb|CAM69644.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania infantum JPCM5]
 gi|322500733|emb|CBZ35810.1| cysteine peptidase C (CPC) [Leishmania donovani]
          Length = 340

 Score =  176 bits (447), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 95/197 (48%), Positives = 121/197 (61%), Gaps = 14/197 (7%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP- 82
           +S ++LL+CC F+CG GC GG P  AW ++V  G+ TE C PY     CSH G    YP 
Sbjct: 150 ISTSNLLSCC-FICGFGCYGGIPTMAWLWWVWVGITTEVCQPY-PFGPCSHHGNSDKYPP 207

Query: 83  -------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
                  TPKC   C K        K+   ++Y +  + E +M E+  NGP+EV+  VY 
Sbjct: 208 CPNTIYDTPKCNTTCEKSEM--DLVKYKGGTSYSVKGEKE-LMIELMTNGPLEVTMQVYS 264

Query: 136 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 195
           DF  YKSGVYKH++GD++GGHAVKL+GWGT   G  YW +AN WN  WG  GYF I+RGS
Sbjct: 265 DFVGYKSGVYKHVSGDLLGGHAVKLVGWGT-QGGVPYWKIANSWNTDWGDKGYFLIQRGS 323

Query: 196 NECGIEEDVVAGLPSSK 212
           NECGIE   VAG P+ +
Sbjct: 324 NECGIESGGVAGTPAQE 340


>gi|308466896|ref|XP_003095699.1| CRE-CPR-3 protein [Caenorhabditis remanei]
 gi|308244581|gb|EFO88533.1| CRE-CPR-3 protein [Caenorhabditis remanei]
          Length = 373

 Score =  176 bits (447), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 93/198 (46%), Positives = 120/198 (60%), Gaps = 10/198 (5%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
           Q   +SV D+L+CCG  CG GC GGY I A R++  +G VT        C PY  +    
Sbjct: 142 QQPIISVEDILSCCGTTCGKGCQGGYSIEAMRFWKSNGAVTGGDYNGNGCMPYSFAPCQK 201

Query: 74  HPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI---NSDPEDIMAEIYKNGPVEVS 130
            P  E   PT K   +       +   KHY  SAYR+   N+    I  EIY NGPVE S
Sbjct: 202 SPCVESTTPTCKTTCQSSYTTANYTTDKHYGTSAYRLATTNNVVSTIQYEIYHNGPVEAS 261

Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
           + VYEDF  YKSGVY +++G ++GGHAVK+IGWGT +D  DYW++AN W   +G  G+FK
Sbjct: 262 YKVYEDFYQYKSGVYHYVSGKLVGGHAVKIIGWGTEND-VDYWLVANSWGIKFGEGGFFK 320

Query: 191 IKRGSNECGIEEDVVAGL 208
           I+RG+NEC IE +VVAG+
Sbjct: 321 IRRGTNECQIESNVVAGV 338


>gi|343961899|dbj|BAK62537.1| cathepsin B precursor [Pan troglodytes]
          Length = 195

 Score =  176 bits (447), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 88/197 (44%), Positives = 126/197 (63%), Gaps = 16/197 (8%)

Query: 36  LCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYP 82
           +CGDGC+GGYP  AW ++   G+V+         C PY     C H      P C     
Sbjct: 1   MCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGD 59

Query: 83  TPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 141
           TPKC + C    +  ++  KHY  ++Y +++  + IMAEIYKNGPVE +F+VY DF  YK
Sbjct: 60  TPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKGIMAEIYKNGPVEGAFSVYSDFLLYK 119

Query: 142 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 201
           SGVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE
Sbjct: 120 SGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 178

Query: 202 EDVVAGLPSSKNLVKEI 218
            +VVAG+P +    ++I
Sbjct: 179 SEVVAGIPRTDQYWEKI 195


>gi|17384033|emb|CAD12394.1| cysteine proteinase [Leishmania infantum]
          Length = 340

 Score =  176 bits (446), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 95/197 (48%), Positives = 121/197 (61%), Gaps = 14/197 (7%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP- 82
           +S ++LL+CC F+CG GC GG P  AW ++V  G+ TE C PY     CSH G    YP 
Sbjct: 150 ISTSNLLSCC-FICGFGCYGGIPTMAWLWWVWVGITTEVCQPY-PFGPCSHHGNSDKYPP 207

Query: 83  -------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
                  TPKC   C K        K+   ++Y +  + E +M E+  NGP+EV+  VY 
Sbjct: 208 CPNTIYDTPKCNTTCEKSEM--DLVKYKGGTSYSVKGEKE-LMIELMTNGPLEVTMQVYS 264

Query: 136 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 195
           DF  YKSGVYKH++GD++GGHAVKL+GWGT   G  YW +AN WN  WG  GYF I+RGS
Sbjct: 265 DFVGYKSGVYKHVSGDLLGGHAVKLVGWGT-QGGVPYWKIANSWNTDWGDKGYFLIQRGS 323

Query: 196 NECGIEEDVVAGLPSSK 212
           NECGIE   VAG P+ +
Sbjct: 324 NECGIESGGVAGTPAQE 340


>gi|121073189|gb|ABM47071.1| cathepsin B2 [Clonorchis sinensis]
 gi|358341868|dbj|GAA36574.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 343

 Score =  176 bits (446), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 94/202 (46%), Positives = 121/202 (59%), Gaps = 15/202 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT----EE---CDPY------F 67
           N SLS  DLL+CC   CGDGCDGG+P  AW ++  HG+VT    EE   C PY       
Sbjct: 136 NKSLSAVDLLSCCK-DCGDGCDGGFPPMAWDFWKTHGIVTGGSKEEPTGCRPYPFPKCQH 194

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
            S G   P     YPTPKCV+ C      ++  K  + ++Y ++     IM EI  NGPV
Sbjct: 195 HSQGHYPPCPRRIYPTPKCVKHCDTPKIDYQKDKTRANTSYNVHQSEVAIMKEILLNGPV 254

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           E +F V+EDF  YKSG+Y H  G  +GGHA++++GWG  ++G  YW++AN WN  WG  G
Sbjct: 255 EATFEVHEDFPEYKSGIYFHAWGGSVGGHAIRILGWG-EENGVPYWLIANSWNEDWGEKG 313

Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
           Y +  RG NECGIEE+  AGLP
Sbjct: 314 YLRFLRGHNECGIEEEATAGLP 335


>gi|268566089|ref|XP_002647469.1| Hypothetical protein CBG06541 [Caenorhabditis briggsae]
          Length = 280

 Score =  176 bits (446), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 91/197 (46%), Positives = 120/197 (60%), Gaps = 10/197 (5%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
           Q  ++S  D+LACCG  CGDGC+GGYPI A+R++   GVVT        C PY     C+
Sbjct: 85  QQPTISPTDMLACCGRSCGDGCEGGYPIQAFRWWNSRGVVTGGDFRGSGCRPY-PFAPCN 143

Query: 74  HPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
              C P   TP C   C    +  +   K + +SAY +  +   I  EI  NGPV  +FT
Sbjct: 144 SYKC-PEEKTPTCSLSCQFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAFT 202

Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
           +YED   YKSGVY+H  G ++GGHA+K+IGWGT  +G  YW++AN W   WG +G+ K++
Sbjct: 203 MYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGT-QNGIPYWLIANSWGADWGENGFLKMR 261

Query: 193 RGSNECGIEEDVVAGLP 209
           RG NECGIE  VVAG+P
Sbjct: 262 RGVNECGIESAVVAGMP 278



 Score = 79.3 bits (194), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 36/62 (58%), Positives = 46/62 (74%), Gaps = 1/62 (1%)

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           NGPVE SFTVYEDF  YK GVY++  G V+G HA+K++GWGT + G DYW++AN W    
Sbjct: 3   NGPVEASFTVYEDFYIYKKGVYQYTAGQVVGVHAIKIMGWGT-EHGTDYWLIANSWGAQC 61

Query: 184 GA 185
           G+
Sbjct: 62  GS 63


>gi|341888694|gb|EGT44629.1| hypothetical protein CAEBREN_31940 [Caenorhabditis brenneri]
          Length = 374

 Score =  176 bits (446), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 92/198 (46%), Positives = 120/198 (60%), Gaps = 11/198 (5%)

Query: 18  SLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTG 71
           + Q   +SV D+L+CCG  CG GC GGY I A R++   G VT        C PY     
Sbjct: 144 ATQTPIISVEDILSCCGVSCGKGCQGGYSIEALRFWKSSGAVTGGDYNGAGCMPY-SFAP 202

Query: 72  CSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
           C    C     TP C   C    K   +   KH+  +AY+I +    I  EIY NGPVE 
Sbjct: 203 CKKDSCAQG-TTPSCKTTCQSSYKTAEYTKDKHFGTTAYKITNSVAAIQTEIYHNGPVEA 261

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
           SF VYEDF  YKSGVY++ +G ++GGHAVK+IGWGT ++G DYW++AN W  ++G  G+F
Sbjct: 262 SFKVYEDFYKYKSGVYQYTSGKLVGGHAVKIIGWGT-ENGVDYWLIANSWGTTFGDSGFF 320

Query: 190 KIKRGSNECGIEEDVVAG 207
           K++RG+NE GIE +VVAG
Sbjct: 321 KMRRGTNEVGIEGNVVAG 338


>gi|389593817|ref|XP_003722157.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
 gi|321438655|emb|CBZ12414.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
          Length = 340

 Score =  176 bits (446), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 93/198 (46%), Positives = 123/198 (62%), Gaps = 16/198 (8%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSHPGCEPAYP 82
           +S ++LL+CC F+CG GC GG P  AW ++V  G+ TE+C PY FD   CSH G    YP
Sbjct: 150 MSTSNLLSCC-FICGLGCHGGIPTVAWLWWVWVGIATEDCQPYPFDP--CSHHGNSEKYP 206

Query: 83  --------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 134
                   TPKC   C +        K+   ++Y +  + E +M E+  NGP+E++  VY
Sbjct: 207 PCPSTIYDTPKCNTTCERSEM--DLVKYKGSTSYSVKGEKE-LMIELMTNGPLELTMQVY 263

Query: 135 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 194
            DF  YKSGVYKH+ G+ +GGHAVKL+GWGT  DG  YW +AN WN  WG  GYF I+RG
Sbjct: 264 SDFVGYKSGVYKHVLGEFLGGHAVKLVGWGT-QDGVPYWKVANSWNTDWGDKGYFLIQRG 322

Query: 195 SNECGIEEDVVAGLPSSK 212
           +NEC IE   VAG+P+ +
Sbjct: 323 NNECKIESGGVAGIPAQE 340


>gi|268572243|ref|XP_002648913.1| Hypothetical protein CBG17826 [Caenorhabditis briggsae]
          Length = 323

 Score =  176 bits (446), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 90/197 (45%), Positives = 119/197 (60%), Gaps = 12/197 (6%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
           Q  ++S  D+LACCG  CGDGC GGYPI A+R++   GVVT        C PY  +   S
Sbjct: 130 QQPTISPTDMLACCGNSCGDGCKGGYPIQAFRWWNSRGVVTGGDFRGSGCRPYPFAPCIS 189

Query: 74  HPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
            P       TP C   C    +  +   K + +SAY +  +   I  EI  NGPV  +FT
Sbjct: 190 CP----EEKTPTCSLSCQFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAFT 245

Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
           +YED   YKSGVY+H  G ++GGHA+K+IGWGT  +G  YW++AN W  +WG +G+ K++
Sbjct: 246 MYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGT-QNGIPYWLIANSWGANWGENGFLKMR 304

Query: 193 RGSNECGIEEDVVAGLP 209
           RG NECGIE  VVAG+P
Sbjct: 305 RGVNECGIERAVVAGMP 321


>gi|17559066|ref|NP_506790.1| Protein CPR-3 [Caenorhabditis elegans]
 gi|1169083|sp|P43507.1|CPR3_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 3; AltName:
           Full=Cysteine protease-related 3; Flags: Precursor
 gi|675494|gb|AAA98788.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|675496|gb|AAA98782.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|14530554|emb|CAB61032.2| Protein CPR-3 [Caenorhabditis elegans]
          Length = 370

 Score =  176 bits (445), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 98/217 (45%), Positives = 129/217 (59%), Gaps = 20/217 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
           Q   +SV D+L+CCG  CG GC GGY I A R++   G VT        C PY  S    
Sbjct: 141 QQPVISVEDILSCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPY--SFAPC 198

Query: 74  HPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSISAYRINSDPE--DIMAEIYKNGPVEV 129
              C P   TP C   C    K + ++  KHY  SAY++ +     +I  EIY  GPVE 
Sbjct: 199 TKNC-PESTTPSCKTTCQSSYKTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEA 257

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
           S+ VYEDF HYKSGVY + +G ++GGHAVK+IGWG  ++G DYW++AN W  S+G  G+F
Sbjct: 258 SYKVYEDFYHYKSGVYHYTSGKLVGGHAVKIIGWGV-ENGVDYWLIANSWGTSFGEKGFF 316

Query: 190 KIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFED 226
           KI+RG+NEC IE +VVAG      + K  T ++ +ED
Sbjct: 317 KIRRGTNECQIEGNVVAG------IAKLGTHSETYED 347


>gi|321452279|gb|EFX63703.1| hypothetical protein DAPPUDRAFT_306608 [Daphnia pulex]
          Length = 340

 Score =  176 bits (445), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 88/203 (43%), Positives = 122/203 (60%), Gaps = 17/203 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
           N  LS  +L++CC + CG GC+GG+P +AW ++V  G+VT       + C PY     C 
Sbjct: 139 NAHLSAENLVSCC-YTCGFGCNGGFPGAAWSHWVKKGIVTGGNFNSSQGCQPYIIPA-CE 196

Query: 74  H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC++ C     + +    HY  S+Y ++   EDI  EI  NGP
Sbjct: 197 HHTTGDRPPCSEGGGTPKCLKTCEDGYTVDYTQDLHYGASSYSVHKRMEDIQLEIMNNGP 256

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE + TVYEDF  YKSGVY+H+ G  +GGHA++++GWG  ++G  YW++AN WN  WG +
Sbjct: 257 VEGALTVYEDFPTYKSGVYQHVHGKALGGHAIRILGWGV-EEGVPYWLIANSWNTDWGDN 315

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           GY K+ RG + CGIE  + AGLP
Sbjct: 316 GYIKLLRGKDHCGIESQITAGLP 338


>gi|330434688|gb|AEC22812.1| cathepsin B [Macrobrachium nipponense]
          Length = 331

 Score =  176 bits (445), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 98/223 (43%), Positives = 133/223 (59%), Gaps = 30/223 (13%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV------ 59
           T+RD + S+     +N   S  +L++CC  LCG GC+GG+P +A++Y+VH G+V      
Sbjct: 117 TDRDCIHSN---GTKNFHYSAENLVSCC-HLCGFGCNGGFPGAAFQYWVHSGIVSGGAFN 172

Query: 60  -TEECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVK------KNQLWRNSKHYSIS 106
            T+ C PY +   C H      P C     TPKC + C        ++ L   SKHYS+ 
Sbjct: 173 STQGCQPY-EIAPCEHHVSGPRPKCAEGGSTPKCHKNCESNYVVDYESDLHHGSKHYSV- 230

Query: 107 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 166
               + D   I  +I  NGPVE +FTVY DF HYKSGVY+H  G  +GGHA++++GWG  
Sbjct: 231 ----DKDETQIKYDIMTNGPVEGAFTVYVDFLHYKSGVYQHTHGLPLGGHAIRVLGWG-E 285

Query: 167 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           +DG  YW+ AN WN  WG +GYFKI RGS+ CGIE ++ AGLP
Sbjct: 286 EDGTPYWLCANSWNTDWGDNGYFKILRGSDHCGIESEISAGLP 328


>gi|289743429|gb|ADD20462.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
          Length = 340

 Score =  176 bits (445), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 91/205 (44%), Positives = 122/205 (59%), Gaps = 20/205 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC-------- 72
           N   S +DL++CC   CG GC+GG+P +AW Y+V  G+V+    PY  S GC        
Sbjct: 138 NFHFSADDLVSCC-HTCGFGCNGGFPGAAWGYWVRKGIVSG--GPYGSSQGCRPYEIAPC 194

Query: 73  ------SHPGCEPAY-PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
                 + P CE  Y  TP+C  KC    ++ ++  KH+   AY I+ +  DI  EI  N
Sbjct: 195 EHHVNGTRPPCEKEYGKTPRCQHKCQASYKVDYKTDKHFGSRAYSISKNVRDIQGEIMTN 254

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE +FTVYED   YK GVY+H+ G  +GGHA+++IGWG   D   YW++AN WN  WG
Sbjct: 255 GPVEGAFTVYEDLILYKDGVYEHVHGKELGGHAIRIIGWGVEKD-TPYWLIANSWNTDWG 313

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
            +G+FKI RG + CGIE  + AGLP
Sbjct: 314 NNGFFKILRGKDHCGIESSISAGLP 338


>gi|343197337|pdb|3QSD|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With Ca074 Inhibitor
 gi|343197588|pdb|3S3Q|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11017 Inhibitor
 gi|343197589|pdb|3S3R|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
 gi|343197590|pdb|3S3R|B Chain B, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
 gi|343197591|pdb|3S3R|C Chain C, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
          Length = 254

 Score =  176 bits (445), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 94/202 (46%), Positives = 123/202 (60%), Gaps = 16/202 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY-----F 67
           QN+ LS  DLL+CC   CG GC+GG    AW Y+V  G+VT         C+PY      
Sbjct: 52  QNVELSAVDLLSCCE-SCGLGCEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCE 110

Query: 68  DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
             T   +P C    Y TP+C + C KK +  +   KH   S+Y + +D + I  EI K G
Sbjct: 111 HHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYG 170

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE  FTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG  +    YW++AN WN  WG 
Sbjct: 171 PVEAGFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGVENKA-PYWLIANSWNEDWGE 229

Query: 186 DGYFKIKRGSNECGIEEDVVAG 207
           +GYF+I RG +EC IE +V AG
Sbjct: 230 NGYFRIVRGRDECSIESEVTAG 251


>gi|308390275|gb|ADO32581.1| cathepsin B [Marsupenaeus japonicus]
          Length = 332

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 93/203 (45%), Positives = 126/203 (62%), Gaps = 17/203 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCS 73
           N   S  +L++CC  LCG GC+GG+P +A++Y+VH G+V       T+ C PY +   C 
Sbjct: 130 NFHYSSENLVSCC-HLCGFGCNGGFPGAAFKYWVHSGIVSGGSFNSTQGCQPY-EIAPCE 187

Query: 74  H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKCV++C     + + +  H+   AY I  D + I  EI KNGP
Sbjct: 188 HHVPGPRPKCSEGGGTPKCVKRCENGYTVDYESDLHHGGKAYSIMKDEDQIKYEIMKNGP 247

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +FTVY DF HYKSGVY+H  G  +GGHA++++GWG  ++G  YW+ AN WN  WG +
Sbjct: 248 VEGAFTVYVDFLHYKSGVYQHRHGLPLGGHAIRILGWG-EENGTPYWLCANSWNTDWGDN 306

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G FKI RGS+ CGIE ++ AGLP
Sbjct: 307 GLFKILRGSDHCGIESEISAGLP 329


>gi|407425570|gb|EKF39488.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi
           marinkellei]
          Length = 333

 Score =  175 bits (444), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 95/200 (47%), Positives = 123/200 (61%), Gaps = 17/200 (8%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH--- 74
           +++L +S  DL++CC  +CG GC+GG+P  AW ++V HG+V+E C PY F S  C+H   
Sbjct: 139 VRDLRISAGDLMSCCD-VCGYGCNGGFPEVAWVFYVVHGLVSEYCQPYPFPS--CAHHVN 195

Query: 75  ----PGCEPAYPTPKCVRKCV-KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
                 C   Y TPKC   C  KK  L R   ++S     + S  E    E+  NGP EV
Sbjct: 196 SSDLAPCSGDYKTPKCNSTCTEKKIPLIRYRGNHSY----VLSGEEHFKRELLLNGPFEV 251

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
           +F VY DF  Y  GVYKH+ GD++GGHAV+L+GWG   +GE YW +AN WN  WG +GYF
Sbjct: 252 AFEVYADFMAYTGGVYKHVAGDLLGGHAVRLVGWG-ELNGEPYWKIANSWNHEWGMNGYF 310

Query: 190 KIKRGSNECGIEEDVVAGLP 209
            I RG NECGIE + VAG P
Sbjct: 311 LIARGVNECGIESNGVAGTP 330


>gi|282400164|ref|NP_001164205.1| cathepsin B precursor [Tribolium castaneum]
 gi|270004839|gb|EFA01287.1| cathepsin B precursor [Tribolium castaneum]
          Length = 335

 Score =  175 bits (444), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 92/202 (45%), Positives = 120/202 (59%), Gaps = 18/202 (8%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH------- 74
           +++S  D L CC  +CG GC+GG P  AW ++  +G+VT     Y D+ GC         
Sbjct: 136 VNISAEDPLDCC-TICGMGCNGGMPAMAWLHWTVNGIVTG--GNYEDTNGCKAYSFAPCE 192

Query: 75  -------PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
                  P C P  PTP C ++C   + L   +     S Y I+  P+ I  EI  NGPV
Sbjct: 193 HHVDGDLPPCGPTKPTPDCKKECDSGSSLTYQNDLTHGSNYGIDPYPKQIQTEIMTNGPV 252

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           E SF+VYEDF  YKSGVY+H+ G+  GGHA+K++GWG  +D   YW++AN WN  WG  G
Sbjct: 253 EASFSVYEDFLSYKSGVYQHLEGEYAGGHAIKILGWGVEND-TPYWLVANSWNEDWGDKG 311

Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
           YFKI RGSNECGIE  +VAG+P
Sbjct: 312 YFKILRGSNECGIEGSIVAGIP 333


>gi|22531389|emb|CAD44625.1| cathepsin B1 isotype 2 [Schistosoma mansoni]
          Length = 340

 Score =  175 bits (444), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 93/201 (46%), Positives = 125/201 (62%), Gaps = 16/201 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY-----F 67
           QN+ LS  DLL+CC   CG GC+GG    AW ++V  G+VT         C+PY      
Sbjct: 138 QNVELSAVDLLSCCES-CGLGCEGGILGPAWDFWVKEGIVTGSSKENHTGCEPYPFPKCE 196

Query: 68  DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
             T   +P C    Y TP+C + C KK +  +   KH   S+Y + +D + I  EI K G
Sbjct: 197 HHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYG 256

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE SFTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG  ++   YW++AN WN  WG 
Sbjct: 257 PVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGV-ENKTPYWLIANSWNEDWGE 315

Query: 186 DGYFKIKRGSNECGIEEDVVA 206
           +GYF+I RG +EC IE +V+A
Sbjct: 316 NGYFRIVRGRDECFIESEVIA 336


>gi|112983908|ref|NP_001036850.1| cathepsin B precursor [Bombyx mori]
 gi|13548667|dbj|BAB40804.1| cathepsin B [Bombyx mori]
          Length = 337

 Score =  175 bits (444), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 92/204 (45%), Positives = 122/204 (59%), Gaps = 17/204 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           ++   S  DLL+CC  +CG GC GG P  AW Y+ H G+V+       + C PY +   C
Sbjct: 132 KHFHFSAEDLLSCCP-ICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPY-EIPPC 189

Query: 73  SH--PG----CEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
            H  PG    C     TPKC +KC     + ++  K Y    Y ++ D + I AE++KNG
Sbjct: 190 EHHVPGNRMPCSGDTKTPKCTKKCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNG 249

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +FTVY D   YKSGVYKH  GD +GGHAVK++GWG  +D + YW++AN WN  WG 
Sbjct: 250 PVEGAFTVYSDLLSYKSGVYKHTQGDALGGHAVKILGWGVENDNK-YWLIANSWNSDWGD 308

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G+FKI RG + CGIE  +V G P
Sbjct: 309 NGFFKILRGEDHCGIESSIVTGEP 332


>gi|161343863|tpg|DAA06112.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 340

 Score =  175 bits (444), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 88/206 (42%), Positives = 124/206 (60%), Gaps = 18/206 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC------- 72
           +N   S  +L++CC   CG GC+GG+P +AW Y+   G+V+    PY    GC       
Sbjct: 138 KNFHFSAENLVSCC-RTCGFGCNGGFPGAAWHYWKTKGIVSG--GPYGSKMGCIPYEIAP 194

Query: 73  -------SHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
                  +   C+    TP CV+KC    ++ +    H   SAY + +D + I  EIY N
Sbjct: 195 CEHHVNGTRGPCKEGGKTPACVKKCEDGYKVPYAQDLHRGKSAYSLGNDVDQIRQEIYTN 254

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE +FTVYEDF  Y++GVYKH+ G  +GGHA++++GWG  +    YW++AN WN  WG
Sbjct: 255 GPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANSWNSDWG 314

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPS 210
           +DG+FKI RGS+ECGIE  + AGLP+
Sbjct: 315 SDGFFKILRGSDECGIEGQINAGLPA 340


>gi|357613937|gb|EHJ68797.1| cathepsin B-like cysteine proteinase [Danaus plexippus]
          Length = 334

 Score =  175 bits (443), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 94/205 (45%), Positives = 124/205 (60%), Gaps = 17/205 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           ++   S  DLL+CC  +CG GC+GG P  AW Y+ H G+V+       + C PY +   C
Sbjct: 131 KHFHFSAEDLLSCCP-VCGLGCNGGIPSFAWEYWKHFGIVSGGNYNSSQGCLPY-EIPPC 188

Query: 73  SH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
            H  PG    C     TPKC R C K+    +++ K Y    Y +    E I AEI+KNG
Sbjct: 189 EHHVPGNRIPCNGETSTPKCHRSCRKEYTNSYKSDKKYGKHVYSVGGGEEHIKAEIFKNG 248

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +FTVY D   YKSGVYKH  G+ +GGHA+K++GWG  ++G  YW++AN WN  WG 
Sbjct: 249 PVEGAFTVYADLLTYKSGVYKHTEGEALGGHAIKIMGWGV-ENGNKYWLIANSWNSDWGD 307

Query: 186 DGYFKIKRGSNECGIEEDVVAGLPS 210
           +G+FKI RG + CGIE  +VAG PS
Sbjct: 308 NGFFKILRGEDHCGIESSIVAGEPS 332


>gi|401415968|ref|XP_003872479.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|322488703|emb|CBZ23950.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 340

 Score =  175 bits (443), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 96/194 (49%), Positives = 118/194 (60%), Gaps = 14/194 (7%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP- 82
           +S  +LL+CC F+CG GC GG P  AW ++V  GV TE C PY     CSH G    YP 
Sbjct: 150 ISTTNLLSCC-FICGFGCYGGIPAMAWLWWVWVGVTTELCQPY-PFGPCSHHGNSSKYPP 207

Query: 83  -------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
                  TPKC   C   N      K+  +S+Y I  + E +M E+  NGP+EV+  VY 
Sbjct: 208 CPNTIYNTPKCNTTC--DNVEMELVKYKGVSSYSIKGERE-LMVELMNNGPLEVAMQVYA 264

Query: 136 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 195
           DF  YKSGVYKH++GD +GGHAVKL+GWG   DG  YW +AN WN  WG  GYF I+RG+
Sbjct: 265 DFVAYKSGVYKHVSGDHLGGHAVKLVGWGV-KDGIPYWKIANSWNTDWGDKGYFLIQRGN 323

Query: 196 NECGIEEDVVAGLP 209
           +ECGIE   VAG P
Sbjct: 324 DECGIESSGVAGKP 337


>gi|116177489|gb|ABJ80691.1| cathepsin B [Hippoglossus hippoglossus]
          Length = 330

 Score =  174 bits (442), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 98/219 (44%), Positives = 130/219 (59%), Gaps = 21/219 (9%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + S   VS++   +S  DLL CC   CG GC+GGYP +AW ++   G+V+     
Sbjct: 117 SDRVCIHSGSKVSVE---ISSEDLLTCCD-ACGMGCNGGYPSAAWDFWTKEGLVSGGLYN 172

Query: 63  ----CDPYFDSTGCSH------PGCE-PAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRI 110
               C PY     C H      P C      TPKCV  C    +  +   KHY  S+Y +
Sbjct: 173 SHIGCRPYTIPP-CEHHVNGSRPHCSGEGGDTPKCVHSCEAGYSPTYTKDKHYGKSSYSV 231

Query: 111 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 170
            +  E I AEI +NGPVE +F VYEDF  YKSGVY+H TG  +GGHA+K++GWG  +DG 
Sbjct: 232 EASVEQIQAEISQNGPVEGAFIVYEDFVMYKSGVYQHTTGSALGGHAIKVLGWG-EEDGV 290

Query: 171 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
            YW+ AN WN  WG +G+FKI RGS+ CGIE ++VAG+P
Sbjct: 291 PYWLCANSWNTDWGENGFFKILRGSDHCGIESEIVAGIP 329


>gi|313229093|emb|CBY18245.1| unnamed protein product [Oikopleura dioica]
          Length = 355

 Score =  174 bits (442), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 92/209 (44%), Positives = 124/209 (59%), Gaps = 18/209 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
           ++  +  D+L+CC   CG+GC+GGYP++A  YFV  G+VT       + C PY     C 
Sbjct: 144 DVMYAAEDVLSCC-LTCGNGCNGGYPLAAMEYFVTRGLVTGGLYGTKDTCQPY-TLEACE 201

Query: 74  H------PGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           H      P C     TPKC  +C+     + +++ K +   AY + +D   I  EI   G
Sbjct: 202 HHVPGDRPPCTEGGGTPKCSHQCIPDYTTKAYKDDKVHGHKAYSVPNDVGKIQQEIMHYG 261

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +FTVY DF  YKSGVY+H +G  +GGHA+K+IGWGT + G+DYW++ N WN  WG 
Sbjct: 262 PVEAAFTVYSDFPSYKSGVYRHTSGSELGGHAIKIIGWGT-EGGDDYWLINNSWNSDWGD 320

Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
            G FKI RGSNECGIE +VVA    +  L
Sbjct: 321 KGTFKILRGSNECGIEGEVVAATVDASTL 349


>gi|226821413|gb|ACO82382.1| cathepsin B [Lutjanus argentimaculatus]
          Length = 330

 Score =  174 bits (442), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 94/218 (43%), Positives = 132/218 (60%), Gaps = 19/218 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + S+  VS++   +S  DLL CC   CG GC+GGYP +AW ++   G+V+     
Sbjct: 117 SDRLCIHSNAKVSVE---ISAEDLLTCCD-SCGMGCNGGYPSAAWDFWTKEGLVSGGLYD 172

Query: 63  ----CDPY------FDSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRIN 111
               C PY          G   P       TP+C+ +C       +R  KHY  ++Y + 
Sbjct: 173 SHVGCRPYTIPPCEHHVNGSRPPCTGEGGDTPQCLSQCEAGYTPSYREDKHYGKTSYSVL 232

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           SD  +I  EIYKNGPVE +FTVYEDF  YKSGVY+H++G  +GGHA+K++GWG  ++G  
Sbjct: 233 SDEAEIQYEIYKNGPVEGAFTVYEDFVLYKSGVYQHVSGSAVGGHAIKVLGWG-EENGVP 291

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           YW+ AN WN  WG +G+FK  RGS+ CGIE ++VAG+P
Sbjct: 292 YWLCANSWNTDWGDNGFFKFLRGSDHCGIESEIVAGIP 329


>gi|390994431|gb|AFM37365.1| cathepsin B2 [Dictyocaulus viviparus]
          Length = 346

 Score =  174 bits (442), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 93/204 (45%), Positives = 122/204 (59%), Gaps = 17/204 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
             ++S +DLL+CC   CG GCDGG+P +AW Y+V  G+V+         C PY       
Sbjct: 143 QFTVSADDLLSCCD-ECGFGCDGGFPYAAWNYWVEKGIVSGGSYTSKSGCKPYPFPPCEH 201

Query: 68  DSTGCS-HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
            + G   HP  +  YPT  C  KC       + N K Y   AY + +  + I  EI  +G
Sbjct: 202 HTNGTHYHPCPKDLYPTNTCEHKCQSGYATAYTNDKRYGAKAYTVAARVKAIQKEIMLHG 261

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVEV++ VYEDF HY  G+YKH  G  +GGHAVK+IGWGT ++G  YWI +N WN  WG 
Sbjct: 262 PVEVAYDVYEDFEHYLKGIYKHTAGSYLGGHAVKMIGWGT-ENGIPYWICSNSWNSDWGE 320

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G+F+I RG++ECGIE  VVAGLP
Sbjct: 321 NGFFRILRGTDECGIESGVVAGLP 344


>gi|56462338|gb|AAV91452.1| cysteine peptidase 2 cathepsin-B-like [Lonomia obliqua]
          Length = 338

 Score =  174 bits (442), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 91/204 (44%), Positives = 123/204 (60%), Gaps = 17/204 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGC 72
           ++   S  DLL+CC  +CG GC+GG P  AW Y+ H G+V       T+ C PY +   C
Sbjct: 133 KHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEYWKHAGIVSGGSYNSTQGCIPY-EVPPC 190

Query: 73  SH--PG----CEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
            H  PG    C     TPKC + C    N  ++  KHY    Y ++ + ++I AE++KNG
Sbjct: 191 EHHVPGNRLPCNGDTKTPKCQKTCEAGYNVPFKKDKHYGKHVYSVSGNEDNIKAELFKNG 250

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +FTVY D   YKSGVY+H  G  +GGHAVK++GWG  ++G  YW++AN WN  WG 
Sbjct: 251 PVEGAFTVYSDLLSYKSGVYQHTDGSALGGHAVKILGWGV-ENGSKYWLIANSWNSDWGD 309

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G+FKI RG + CGIE  +V G P
Sbjct: 310 NGFFKILRGEDHCGIESSIVTGEP 333


>gi|194246067|gb|ACF35525.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 192

 Score =  174 bits (441), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 86/187 (45%), Positives = 119/187 (63%), Gaps = 16/187 (8%)

Query: 37  CGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPT 83
           CG GC+GGYP +AW+++    +VT       + C PY+    C H      P C    PT
Sbjct: 3   CGSGCNGGYPSAAWQFYKDEDIVTGGLYGTEDGCQPYYFPP-CEHHTVGPLPNCTGIKPT 61

Query: 84  PKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 142
           P+C + C +  Q  +   KH+    Y I+SD   I  EIYKNGPVE  F+VY DF  YKS
Sbjct: 62  PECAKTCREGYQKSYTRDKHFGKKVYSISSDETQIKTEIYKNGPVEADFSVYADFPSYKS 121

Query: 143 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 202
           GVY+  + +++GGHA++++GWGT +DG  YW++AN WN  WG  GYFKI+RG++ECGIE+
Sbjct: 122 GVYQRHSEEMLGGHAIRILGWGT-EDGVPYWLVANSWNEDWGDKGYFKIRRGNDECGIED 180

Query: 203 DVVAGLP 209
           D+ AG+P
Sbjct: 181 DINAGIP 187


>gi|1181143|emb|CAA93278.1| cysteine proteinase [Haemonchus contortus]
          Length = 341

 Score =  174 bits (441), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 93/203 (45%), Positives = 121/203 (59%), Gaps = 17/203 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           + + +S  D+L+CCG  CG GC+GG+PI A+ YF   G VT         C PY     C
Sbjct: 139 KQVHVSATDILSCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPY-PFHPC 197

Query: 73  SHPG-------CEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H G       C     TPKCVRKC     + ++  +     AY + +  + I  EI KN
Sbjct: 198 GHHGKDTYYGECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEVPNSEKAIQREIMKN 257

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPV  +FTVYEDF++YK G+YKH  G   GGHA+K+IGWG  + G  YW++AN W+  WG
Sbjct: 258 GPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWG-KEGGVPYWLIANSWHNDWG 316

Query: 185 ADGYFKIKRGSNECGIEEDVVAG 207
            +GYF+I RGSN CGIEE+VVAG
Sbjct: 317 ENGYFRILRGSNHCGIEENVVAG 339


>gi|308488550|ref|XP_003106469.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
 gi|308253819|gb|EFO97771.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
          Length = 205

 Score =  174 bits (441), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 89/186 (47%), Positives = 112/186 (60%), Gaps = 18/186 (9%)

Query: 41  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDS------TGCSHPGC-EPAYPTPKC 86
           C+GGYPI AW+++V HG+VT         C PY  +       G + P C E   PTPKC
Sbjct: 14  CEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKC 73

Query: 87  VRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
           V  C   N     +   KH+  +AY +    E I  EI  +GP+EV+FTVYEDF  Y +G
Sbjct: 74  VEACTSNNTYPTGYLQDKHFGATAYAVGKKVEQIQTEILAHGPIEVAFTVYEDFYQYTTG 133

Query: 144 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 203
           VY H  G  +GGHAVK++GWG  D+G  YW++AN WN +WG  GYF+I RG NECGIE  
Sbjct: 134 VYVHTAGKSLGGHAVKILGWGV-DNGTPYWLVANSWNVNWGEKGYFRIIRGLNECGIEHS 192

Query: 204 VVAGLP 209
            VAGLP
Sbjct: 193 AVAGLP 198


>gi|298370749|gb|ADI80349.1| cathepsin B [Litopenaeus vannamei]
          Length = 331

 Score =  174 bits (441), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 93/203 (45%), Positives = 125/203 (61%), Gaps = 17/203 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCS 73
           N   S  +L++CC  LCG GC+GG+P +A++Y+VH G+V       T+ C PY +   C 
Sbjct: 129 NFHYSAENLVSCC-HLCGFGCNGGFPGAAFKYWVHSGIVSGGSFNSTQGCQPY-EIAPCE 186

Query: 74  H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC + C K   + + +  H+   AY I  D + I  EI KNGP
Sbjct: 187 HHVPGPRPKCSEGGGTPKCAKTCEKGYIVDYESDLHHGGKAYSIMKDEDQIKYEIMKNGP 246

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +FTVY DF HYKSGVY+H  G  +GGHA++++GWG  ++G  YW+ AN WN  WG +
Sbjct: 247 VEGAFTVYVDFLHYKSGVYQHRHGLPLGGHAIRVLGWG-EENGTPYWLCANSWNTDWGDN 305

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G FKI RGS+ CGIE ++ AGLP
Sbjct: 306 GLFKILRGSDHCGIESEISAGLP 328


>gi|347972086|ref|XP_313835.5| AGAP004533-PA [Anopheles gambiae str. PEST]
 gi|333469165|gb|EAA09183.5| AGAP004533-PA [Anopheles gambiae str. PEST]
          Length = 337

 Score =  174 bits (440), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 90/204 (44%), Positives = 123/204 (60%), Gaps = 18/204 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           +   S  DL++CC   CG GC+GG+P +AW Y+V  G+V+         C PY  +  C 
Sbjct: 135 HFRFSAEDLVSCC-HTCGFGCNGGFPGAAWSYWVRKGLVSGGPFGSNLGCQPYAIAP-CE 192

Query: 74  H------PGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           H      P CE     TPKCV+KC +  N  ++  K +  S+Y I      I  EI  NG
Sbjct: 193 HHVNGTRPSCEGEGGKTPKCVKKCQESYNVPYQKDKRFGASSYSIARHEAQIQKEIMTNG 252

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +FTVYED  HYK GVY+H+TG ++GGHA++++GWG  ++G  YW++AN WN  WG 
Sbjct: 253 PVEGAFTVYEDLLHYKEGVYQHVTGKMLGGHAIRILGWGV-ENGTKYWLIANSWNSDWGD 311

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G+FKI RG +  GIE  + AGLP
Sbjct: 312 NGFFKILRGEDHLGIESSISAGLP 335


>gi|390357905|ref|XP_003729132.1| PREDICTED: cathepsin B-like [Strongylocentrotus purpuratus]
          Length = 354

 Score =  174 bits (440), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 88/187 (47%), Positives = 115/187 (61%), Gaps = 16/187 (8%)

Query: 37  CGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPT 83
           C   C+GG+P SAW Y+   G+VT       + C PY     C H        C+   PT
Sbjct: 169 CKHKCNGGFPGSAWEYYKDTGIVTGGQWNSSQGCQPY-QIKSCDHHVNGTKGPCQGEGPT 227

Query: 84  PKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 142
           P+C  KC    +  +   KHY++S   I+++PE    EI  NGPVE  FTVYEDF  YKS
Sbjct: 228 PECKHKCEASYSTPYEQDKHYALSVNSISNNPEATQTEIMTNGPVEADFTVYEDFPTYKS 287

Query: 143 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 202
           GVY+H TG V+GGHA+K++GWG  ++G  YW++AN WN  WG +G+FKI RGSNECGIE 
Sbjct: 288 GVYQHTTGGVLGGHAIKILGWGV-EEGTKYWLVANSWNNEWGDNGFFKILRGSNECGIES 346

Query: 203 DVVAGLP 209
           D+  G+P
Sbjct: 347 DINFGIP 353


>gi|195393194|ref|XP_002055239.1| GJ19262 [Drosophila virilis]
 gi|194149749|gb|EDW65440.1| GJ19262 [Drosophila virilis]
          Length = 338

 Score =  174 bits (440), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 88/204 (43%), Positives = 122/204 (59%), Gaps = 18/204 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
           N   S +DL++CC   CG GC+GG+P +AW Y+ H G+V+       E C PY +   C 
Sbjct: 136 NFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTHKGIVSGGSYGSKEGCRPY-EVEPCE 193

Query: 74  H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TP+C+ KC     + +   KH+   AY +N +P DI  EI  NGP
Sbjct: 194 HHVNGTRPPCHSG-STPRCMHKCESGYSVDYAKDKHFGAKAYSVNRNPLDIQREIMTNGP 252

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGA 185
           VE +FTVYED   YK+GVY+H+ G  +GGHA++++GWG   D+   YW++ N WN  WG 
Sbjct: 253 VEGAFTVYEDLILYKTGVYQHVHGRQLGGHAIRILGWGVWGDNKVPYWLIGNSWNTDWGD 312

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G+F+I RG + CGIE  + AGLP
Sbjct: 313 NGFFRILRGEDHCGIESAISAGLP 336


>gi|91078958|ref|XP_974220.1| PREDICTED: similar to cathepsin b [Tribolium castaneum]
 gi|270004841|gb|EFA01289.1| cathepsin B precursor [Tribolium castaneum]
          Length = 334

 Score =  174 bits (440), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 89/204 (43%), Positives = 128/204 (62%), Gaps = 18/204 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
           N+ LS +DL++CC + CG GC+GG+P +AW Y+V+ G+V+       + C PY +   C 
Sbjct: 130 NVRLSADDLVSCC-YSCGMGCNGGFPGAAWHYWVNKGIVSGGSFGSNQGCRPY-EIAPCE 187

Query: 74  H--PGCEPA-----YPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           H   G  P        TP C ++C K  N  ++  K++   AY I+S+ + I  EI  NG
Sbjct: 188 HHVNGTRPPCTGDDNKTPSCKQQCEKGYNVPYKKDKNFGKEAYSISSEVQQIQKEIMTNG 247

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +F VYED   YK GVY+H+ G+ +GGHA++++GWGT + G  YW++AN WN  WG 
Sbjct: 248 PVEGAFEVYEDLLSYKKGVYQHVKGEALGGHAIRILGWGT-EKGTPYWLIANSWNSDWGD 306

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G FKI RG + CGIE  +VAG+P
Sbjct: 307 NGTFKILRGEDHCGIESSIVAGIP 330


>gi|156255405|gb|ABU62925.1| cathepsin B [Fasciola hepatica]
          Length = 337

 Score =  174 bits (440), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 92/202 (45%), Positives = 123/202 (60%), Gaps = 19/202 (9%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
           LS  D+++CC + CG GC+GG P  +W Y+   GVVT         C PY     CSH  
Sbjct: 139 LSAIDIVSCCAY-CGYGCNGGIPAMSWDYWTREGVVTGGTLENPTGCLPY-PFPKCSHGV 196

Query: 75  --PGCEPA----YPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
             PG  P     YPTPKC +KC    N+ +   K    S+Y +     DIM EI KNGPV
Sbjct: 197 VTPGLPPCPRDIYPTPKCEKKCHAGYNKTYEQDKVKGKSSYNVGGQETDIMMEIMKNGPV 256

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           +  F ++EDF  YKSG+Y + TG ++GGHA+++IGWG  ++G  YW++AN WN  WG  G
Sbjct: 257 DGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGV-ENGVKYWLIANSWNEGWGEKG 315

Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
           YF+++RG+NECGIE  + AGLP
Sbjct: 316 YFRMRRGNNECGIEARINAGLP 337


>gi|330805199|ref|XP_003290573.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
 gi|325079281|gb|EGC32888.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
          Length = 313

 Score =  174 bits (440), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 96/197 (48%), Positives = 121/197 (61%), Gaps = 18/197 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           ++ LS  DL+ C      DGC+GG  +SAW +    GVVT+EC PY      + P C PA
Sbjct: 126 DVQLSFLDLVTC--DQSDDGCEGGDDVSAWNFLKKQGVVTQECKPY------TIPTCPPA 177

Query: 81  YP-------TPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
                    TP CV++C   + L +   KH     Y INS  E IM EI  NGPVE  F+
Sbjct: 178 QQPCLNFVNTPNCVKQCESNSTLIYSQDKHKMAKIYSINS-VEAIMQEISTNGPVEACFS 236

Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
           VYEDF  YKSGVY+H TG  +GGH VK+ G+GT  +G +YW +AN W  SWG +G F IK
Sbjct: 237 VYEDFLGYKSGVYQHTTGKFLGGHCVKIFGYGTL-NGVNYWSVANSWTTSWGDNGIFLIK 295

Query: 193 RGSNECGIEEDVVAGLP 209
           RGS+ECGIE++VVAG+P
Sbjct: 296 RGSDECGIEDEVVAGIP 312


>gi|340380685|ref|XP_003388852.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
          Length = 341

 Score =  173 bits (439), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 96/208 (46%), Positives = 131/208 (62%), Gaps = 21/208 (10%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY----FVHHGVVT-------EECDPYFD 68
           + +++S  +LL+CC   CG GCDGGYP +AWR+     ++ G+VT         C PY  
Sbjct: 133 EQVNISAENLLSCCE-TCGSGCDGGYPAAAWRHWADKLLYEGIVTGGQYDSNAGCQPY-T 190

Query: 69  STGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEI 121
              C H  PG    C  +  TP C R C+   ++ +R+ KHY  ++Y I+SD   I  EI
Sbjct: 191 IPKCDHHEPGPYENCSGSQSTPSCKRSCISSYDKSYRSDKHYGKNSYSISSDVSSIQTEI 250

Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
             NGPVE +F+VY DF  Y SGVY+H TG  +GGHA+K++GWGT ++G  YW++AN WN 
Sbjct: 251 MTNGPVEGAFSVYADFPTYTSGVYQHTTGSFLGGHAIKILGWGT-ENGVPYWLVANSWNP 309

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLP 209
           SWG  G+FKI RG +ECGIE  +VAG+P
Sbjct: 310 SWGDSGFFKIIRGKDECGIESSIVAGMP 337


>gi|12004577|gb|AAG44098.1| cathepsin B cysteine protease [Leishmania chagasi]
          Length = 340

 Score =  173 bits (439), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 94/197 (47%), Positives = 120/197 (60%), Gaps = 14/197 (7%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP- 82
           +S ++LL+CC F+CG GC GG P  AW ++V  G+ TE C PY     CSH G    YP 
Sbjct: 150 ISTSNLLSCC-FICGFGCYGGIPTMAWLWWVWVGITTEVCQPY-PFGPCSHHGNSDKYPP 207

Query: 83  -------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
                  TPKC   C K        K+   ++Y +  + E +M E+  NGP+EV+  VY 
Sbjct: 208 CPNTIYDTPKCNTTCEKSEM--DLVKYKGGTSYSVKGEKE-LMIELMTNGPLEVTMQVYS 264

Query: 136 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 195
           DF  YKSG YKH++GD++GGHAVKL+GWGT   G  YW +AN WN  WG  GYF I+RGS
Sbjct: 265 DFVGYKSGGYKHVSGDLLGGHAVKLVGWGT-QGGVPYWKIANSWNTDWGDKGYFLIQRGS 323

Query: 196 NECGIEEDVVAGLPSSK 212
           NECGIE   VAG P+ +
Sbjct: 324 NECGIESGGVAGTPAQE 340


>gi|74179506|dbj|BAE44111.1| cathepsin B preproprotein [Cyprinus carpio]
          Length = 330

 Score =  173 bits (439), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 94/219 (42%), Positives = 132/219 (60%), Gaps = 19/219 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + S+  VS++   +S  DLL CC   CG GC+GGYP +AW ++   G+VT     
Sbjct: 117 SDRVCIHSNAKVSVE---ISAQDLLTCCDG-CGMGCNGGYPSAAWDFWSSDGLVTGGLYN 172

Query: 63  ----CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY          G   P       TP C   C    +  ++  KH+  ++Y + 
Sbjct: 173 SHIGCRPYTIEPCEHHVNGSRPPCTGEGGDTPNCDMSCEPGYSPSYKQDKHFGKTSYSVP 232

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           S+ +DIM E+YKNGPVE +FTVYEDF  YKSGVY+H++G  +GGHA+K++GWG  ++G  
Sbjct: 233 SNQKDIMKELYKNGPVEGAFTVYEDFLSYKSGVYQHVSGPALGGHAIKILGWG-EENGVP 291

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 210
           YW+ AN WN  WG +GYFKI RG + CGIE ++VAG+P 
Sbjct: 292 YWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAGIPQ 330


>gi|239938574|gb|ACS36086.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score =  173 bits (439), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 93/203 (45%), Positives = 122/203 (60%), Gaps = 17/203 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           + + +S  D+L+CCG  CG GC+GG+PI A+ YF   G VT         C PY     C
Sbjct: 51  KQVHVSATDILSCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPY-PFHPC 109

Query: 73  SHPG-------CEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H G       C     TPKCVRKC     + ++  +     AY + +  + I  EI KN
Sbjct: 110 GHHGKDTYYGECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEVPNSEKAIQREIMKN 169

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPV  +FTVYEDF++YK G+YKH  G   GGHA+K+IGWG  ++G  YW++AN W+  WG
Sbjct: 170 GPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWG-KENGVPYWLIANSWHNDWG 228

Query: 185 ADGYFKIKRGSNECGIEEDVVAG 207
            +GYF+I RGSN CGIEE+VVAG
Sbjct: 229 ENGYFRILRGSNHCGIEENVVAG 251


>gi|146217390|gb|ABQ10737.1| cathepsin B [Penaeus monodon]
          Length = 331

 Score =  173 bits (438), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 92/203 (45%), Positives = 124/203 (61%), Gaps = 17/203 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCS 73
           N   S  +L++CC  LCG GC+GG+P +A++Y+VH G+V       T+ C PY +   C 
Sbjct: 129 NFHYSAENLVSCC-HLCGFGCNGGFPGAAFKYWVHSGIVSGGSFNSTQGCQPY-EIAPCE 186

Query: 74  H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC + C K   + + +  H+   AY I  D + I  EI  NGP
Sbjct: 187 HHVSGPRPKCSEGGGTPKCAKTCEKGYIVDYESDLHHGGKAYSIMKDEDQIKYEIMNNGP 246

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +FTVY DF HYKSGVY+H  G  +GGHA++++GWG  ++G  YW+ AN WN  WG +
Sbjct: 247 VEGAFTVYVDFLHYKSGVYQHRHGLPLGGHAIRVLGWG-EENGTPYWLCANSWNTDWGDN 305

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G FKI RGS+ CGIE ++ AGLP
Sbjct: 306 GLFKILRGSDHCGIESEISAGLP 328


>gi|268570495|ref|XP_002648548.1| Hypothetical protein CBG24861 [Caenorhabditis briggsae]
          Length = 323

 Score =  173 bits (438), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 89/197 (45%), Positives = 118/197 (59%), Gaps = 12/197 (6%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
           Q  ++S  D+LACCG  CGDGC G YPI A+R++   GVVT        C PY  +   S
Sbjct: 130 QQPTISPTDMLACCGNSCGDGCKGRYPIQAFRWWNSRGVVTGGDFRGSGCRPYPFAPCIS 189

Query: 74  HPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
            P       TP C   C    +  +   K + +SAY +  +   I  EI  NGPV  +FT
Sbjct: 190 CP----EEKTPTCSLSCQFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAFT 245

Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
           +YED   YKSGVY+H  G ++GGHA+K+IGWGT  +G  YW++AN W  +WG +G+ K++
Sbjct: 246 MYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGT-QNGIPYWLIANSWGANWGENGFLKMR 304

Query: 193 RGSNECGIEEDVVAGLP 209
           RG NECGIE  VVAG+P
Sbjct: 305 RGVNECGIERAVVAGMP 321


>gi|384597848|gb|AFI23675.1| cathepsin B, partial [Brugia malayi]
          Length = 319

 Score =  173 bits (438), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 92/198 (46%), Positives = 120/198 (60%), Gaps = 21/198 (10%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           + + LS +DLL+CC   CG GC GG P++AW+Y+V  G+VT     Y + +GC     P 
Sbjct: 126 KQVILSADDLLSCCK-TCGFGCFGGEPMAAWKYWVLSGIVTGS--DYTNHSGCRPYPFPP 182

Query: 77  CE-------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIY 122
           CE               YPTPKC ++C K   + ++  K+Y   AY + +D E I  EI 
Sbjct: 183 CEHHSNKTHYEPCKHDLYPTPKCYKQCDKNYTKSYKADKYYGEQAYNVENDVESIQKEIM 242

Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
             GPVE SF VY DF HY SG+YKH+ G V GGHAVK++GWG  D G  YW+ AN WN  
Sbjct: 243 TLGPVEASFEVYTDFLHYTSGIYKHVAGSVGGGHAVKILGWGI-DQGVSYWLAANSWNND 301

Query: 183 WGADGYFKIKRGSNECGI 200
           WG DGYF+I RG++ECG+
Sbjct: 302 WGEDGYFRILRGADECGM 319


>gi|225711544|gb|ACO11618.1| Cathepsin B precursor [Caligus rogercresseyi]
          Length = 332

 Score =  173 bits (438), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 92/205 (44%), Positives = 124/205 (60%), Gaps = 18/205 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           +N+++S  +LL+CC + CG GC+GG+P +AW+Y+   G+V+         C PY D   C
Sbjct: 130 KNVNISAENLLSCC-YSCGFGCNGGFPGAAWKYWTSKGLVSGGLYGSHSGCQPY-DIEPC 187

Query: 73  SH------PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSI--SAYRINSDPEDIMAEIYKN 124
            H        C     TPKC R C  +N      K  S   S+Y I SDP+ I  EI  N
Sbjct: 188 EHHVNGTRQPCAEGGRTPKCHRTCENENYSVPYDKDLSFGRSSYSIRSDPKQIQLEIMDN 247

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE +F+VY DF + KSGVY+H+ G ++GGHA++++GWG  + G  YW++AN WN  WG
Sbjct: 248 GPVEAAFSVYSDFMNDKSGVYRHVKGSLLGGHAIRILGWGV-EKGTPYWLVANSWNTDWG 306

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
             G FKI RGS+ CGIE  VV GLP
Sbjct: 307 DKGTFKILRGSDHCGIEGSVVTGLP 331


>gi|118424551|gb|ABK90823.1| cathepsin B-like cysteine proteinase [Spodoptera exigua]
          Length = 341

 Score =  173 bits (438), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 91/204 (44%), Positives = 123/204 (60%), Gaps = 17/204 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           ++   S  DLL+CC  +CG GC+GG P  AW Y+ H G+V+       + C PY +   C
Sbjct: 136 KHFHFSAEDLLSCCP-VCGLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPY-EIPPC 193

Query: 73  SH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
            H  PG    C     TPKC + C    N  +   K Y    Y ++S  + I AE+YKNG
Sbjct: 194 EHHVPGNRMPCNGDSKTPKCHKTCESSYNVDYHKDKRYGKHVYSVSSKEDHIKAELYKNG 253

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +FTVY D  +YK+GVYKH  G+ +GGHA+K++GWG  ++G  YW++AN WN  WG 
Sbjct: 254 PVEGAFTVYSDLLNYKNGVYKHTVGNALGGHAIKILGWGV-ENGNKYWLIANSWNSDWGD 312

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G+FKI RG + CGIE  +VAG P
Sbjct: 313 NGFFKILRGEDHCGIESSIVAGEP 336


>gi|56756587|gb|AAW26466.1| unknown [Schistosoma japonicum]
          Length = 216

 Score =  172 bits (437), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 91/203 (44%), Positives = 122/203 (60%), Gaps = 16/203 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----F 67
           Q+  LS  DL++CC   CG GC GG+P  AW Y+V  G+VT         C PY      
Sbjct: 13  QSAELSALDLISCC-EDCGQGCQGGFPGVAWDYWVTQGIVTGGSKENHTGCQPYPFPKCE 71

Query: 68  DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
             T   +P C    Y TP+C +KC K  +  ++  KHY   +Y + S+ + I  EI  NG
Sbjct: 72  HHTKGKYPACGTKIYKTPQCKQKCQKGYKTPYKQDKHYGDESYNVISNEKAIQKEIMMNG 131

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG       YW++AN WN  WG 
Sbjct: 132 PVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVKKR-TPYWLIANSWNEDWGE 190

Query: 186 DGYFKIKRGSNECGIEEDVVAGL 208
            G F+I RG +EC IE +VVAGL
Sbjct: 191 KGLFRIVRGRDECSIESNVVAGL 213


>gi|268561866|ref|XP_002638438.1| Hypothetical protein CBG18654 [Caenorhabditis briggsae]
          Length = 396

 Score =  172 bits (437), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 96/202 (47%), Positives = 125/202 (61%), Gaps = 14/202 (6%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
           Q   +S  D+L+CCG  C +GC GGY I A +Y+++ GVVT        C PY     CS
Sbjct: 134 QQPIISPEDILSCCGSSCNNGCQGGYTIEAMKYWMNSGVVTGGDYQGAGCIPY-SFRPCS 192

Query: 74  HPGCEPAYPTPKCVRKC---VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
              C+     P C   C    K    +R     S +A   N+  + I  EIY NGPVEV+
Sbjct: 193 --TCKEPKDAPSCKTTCQASYKAKSAYRLPTTTSSNAIVANA-VQMIQTEIYNNGPVEVA 249

Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
           + VY+DF HYKSGVY H+ GD   GHAVK+IGWGT +   DYW++AN W+ ++G +G+FK
Sbjct: 250 YQVYDDFYHYKSGVYYHVYGDKPSGHAVKIIGWGT-EKKVDYWLVANSWSTTFGENGFFK 308

Query: 191 IKRGSNECGIEEDVVAGLPSSK 212
           I+RG+NECGIEE+VVAGLP SK
Sbjct: 309 IRRGTNECGIEENVVAGLPKSK 330


>gi|14141821|gb|AAK07477.2|AF329480_1 probable cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
 gi|289743431|gb|ADD20463.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
          Length = 340

 Score =  172 bits (437), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 90/205 (43%), Positives = 122/205 (59%), Gaps = 20/205 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC-------- 72
           N   S +DL++CC   CG GC+GG+P +AW Y+V  G+V+    PY  S GC        
Sbjct: 138 NFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWVRKGIVSG--GPYGSSQGCRPYEIAPC 194

Query: 73  ------SHPGCEPAY-PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
                 + P CE  Y  TP+C  KC    ++ ++  KH+   AY I+ +  DI  EI  +
Sbjct: 195 EHHVNGTRPPCEKEYGKTPRCQHKCQASYKVDYKTDKHFGSRAYSISKNVHDIQEEIMTH 254

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE +FTVYED   YK GVY+H+ G  +GGHA+++IGWG   D   YW++AN WN  WG
Sbjct: 255 GPVEGAFTVYEDLILYKDGVYEHVHGKELGGHAIRIIGWGVEKD-IPYWLVANSWNTDWG 313

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
            +G+FKI RG + CGIE  + AGLP
Sbjct: 314 NNGFFKILRGKDHCGIESSISAGLP 338


>gi|56756436|gb|AAW26391.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  172 bits (437), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 93/204 (45%), Positives = 123/204 (60%), Gaps = 18/204 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT---EE----CDPYFDSTGC 72
           Q+  LS  DL++CC   CGDGC GG+P  AW Y+V  G+VT   EE    C PY     C
Sbjct: 139 QSAELSALDLISCCED-CGDGCKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPY-PFPKC 196

Query: 73  SH------PGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H      P C    Y TP+C + C K  +  +   KHY    Y + S+ + I  EI   
Sbjct: 197 EHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNEKAIQREIMMY 256

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG  + G+ YW++AN WN  WG
Sbjct: 257 GPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKGKPYWLIANSWNEDWG 315

Query: 185 ADGYFKIKRGSNECGIEEDVVAGL 208
             G F++ RG +EC IE  VVAGL
Sbjct: 316 EKGLFRMVRGRDECSIESHVVAGL 339


>gi|3912916|gb|AAC78691.1| thiol protease [Trichuris suis]
          Length = 348

 Score =  172 bits (437), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 92/209 (44%), Positives = 122/209 (58%), Gaps = 25/209 (11%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF--------- 67
           +S  D+L+CCG  CG GC+GG+PI AWR+F   G  T         C PY          
Sbjct: 136 ISDTDILSCCGLYCGYGCNGGFPIEAWRHFTVAGNCTGGKTIDKYGCKPYKPTGPIGRHL 195

Query: 68  ---DSTGCSHPG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMA 119
              D   C +      C     TP+C R+C+    + + + ++Y  SAY +    + I  
Sbjct: 196 KRNDYAPCPNDTYYGECVGMADTPRCKRRCLLGYPKSYPSDRYYGKSAYIVKQSVKAIQR 255

Query: 120 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQW 179
           EI KNGPV  SF VYEDF HYKSG+YKH  G++ G HAVK+IGWG  ++  D+W++AN W
Sbjct: 256 EIMKNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKIIGWG-KENNTDFWLIANSW 314

Query: 180 NRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           ++ WG  GYF+I RG NECGIE DVVAG+
Sbjct: 315 HQDWGEKGYFRIVRGKNECGIETDVVAGI 343


>gi|157167283|ref|XP_001658486.1| cathepsin b [Aedes aegypti]
 gi|108876477|gb|EAT40702.1| AAEL007599-PA [Aedes aegypti]
          Length = 342

 Score =  172 bits (436), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 94/206 (45%), Positives = 122/206 (59%), Gaps = 26/206 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           +  S    D+LACC   CGDGC GGY   AW+++V  GV +    PY    GC HP    
Sbjct: 136 EQFSFGATDMLACC-HACGDGCKGGYLGPAWQFWVEQGVSSG--GPYNSRQGC-HP---- 187

Query: 80  AYP------------TPKCVRKC---VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
            YP            TPKC ++C        +W++ + Y   AY I +D + IM EIY N
Sbjct: 188 -YPIDVCDASGEEADTPKCSKRCQSGYNVTDVWQD-RRYGRVAYSIPNDEQKIMEEIYIN 245

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPV+ +F  Y+D   YKSGVY+H+ G + GGHAVKL+GWG  ++G  YW++AN W   WG
Sbjct: 246 GPVQAAFMTYQDLHAYKSGVYRHVWGHMAGGHAVKLMGWGV-ENGLKYWLVANSWGDDWG 304

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPS 210
            +G+FKI RG N CGIE+DV AGLPS
Sbjct: 305 DNGFFKIVRGENHCGIEKDVHAGLPS 330


>gi|1008858|gb|AAA79004.1| cathepsin B-like thiol protease [Aedes aegypti]
          Length = 342

 Score =  172 bits (436), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 94/206 (45%), Positives = 122/206 (59%), Gaps = 26/206 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           +  S    D+LACC   CGDGC GGY   AW+++V  GV +    PY    GC HP    
Sbjct: 136 EQFSFGATDMLACC-HACGDGCKGGYLGPAWQFWVEQGVSSG--GPYNSRQGC-HP---- 187

Query: 80  AYP------------TPKCVRKC---VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
            YP            TPKC ++C        +W++ + Y   AY I +D + IM EIY N
Sbjct: 188 -YPIDVCDASGEEADTPKCSKRCQSGYNVTDVWQD-RRYGRVAYSIPNDEQKIMEEIYIN 245

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPV+ +F  Y+D   YKSGVY+H+ G + GGHAVKL+GWG  ++G  YW++AN W   WG
Sbjct: 246 GPVQAAFMTYQDLHAYKSGVYRHVWGHMAGGHAVKLMGWGV-ENGLKYWLVANSWGDDWG 304

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPS 210
            +G+FKI RG N CGIE+DV AGLPS
Sbjct: 305 DNGFFKIVRGENHCGIEKDVHAGLPS 330


>gi|47217183|emb|CAG11019.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 351

 Score =  172 bits (436), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 98/243 (40%), Positives = 140/243 (57%), Gaps = 40/243 (16%)

Query: 2   SVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 61
           S   ++R  + S+  VS++   LS  DLL CC   CG GC+GGYP SAW ++V  G+V+ 
Sbjct: 113 SEAMSDRVCIHSNAKVSVE---LSAQDLLTCCNS-CGMGCNGGYPSSAWNFWVSDGLVSG 168

Query: 62  -------------------ECDPYFDSTGC--------------SHPGCE-PAYPTPKCV 87
                                D  F S GC              S P C      TP+C+
Sbjct: 169 GLYDSHIGRIQVSLCVLLLAVDRDFVSPGCRPYTIPPCEHHVNGSRPSCSGEGGDTPECI 228

Query: 88  RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 146
            +C    +  ++  KH+  ++Y ++S+ ++I  EIYKNGPVE +FTVYEDF  YKSGVY+
Sbjct: 229 FRCEAGYSPSYKQDKHFGKTSYSVSSEEDEIKQEIYKNGPVEGAFTVYEDFVLYKSGVYQ 288

Query: 147 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           H++G  +GGHA+K++GWG  ++G  YW+ AN WN  WG +G+FKI RG++ CGIE ++VA
Sbjct: 289 HVSGSALGGHAIKMLGWG-EENGVPYWLCANSWNTDWGDNGFFKILRGADHCGIESEIVA 347

Query: 207 GLP 209
           G P
Sbjct: 348 GNP 350


>gi|239938576|gb|ACS36087.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score =  172 bits (436), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 93/203 (45%), Positives = 121/203 (59%), Gaps = 17/203 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           + + +S  D+L+CCG  CG GC+GG+PI A+ YF   G VT         C PY     C
Sbjct: 51  KQVHVSATDILSCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPY-PFHPC 109

Query: 73  SHPG-------CEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H G       C     TPKCVRKC     + ++  +     AY + +  + I  EI KN
Sbjct: 110 GHHGKDTYYGECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEVPNSEKAIQREIMKN 169

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPV  +FTVYEDF++YK G+YKH  G   GGHA+K+IGWG  + G  YW++AN W+  WG
Sbjct: 170 GPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWG-KEGGVPYWLIANSWHNDWG 228

Query: 185 ADGYFKIKRGSNECGIEEDVVAG 207
            +GYF+I RGSN CGIEE+VVAG
Sbjct: 229 ENGYFRILRGSNHCGIEENVVAG 251


>gi|29374027|gb|AAO73004.1| cathepsin B [Fasciola gigantica]
          Length = 337

 Score =  172 bits (436), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 91/202 (45%), Positives = 122/202 (60%), Gaps = 19/202 (9%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
           LS  D+++CC + CG GC+GG P  +W Y+   GVVT         C PY     CSH  
Sbjct: 139 LSAIDIVSCCAY-CGYGCNGGIPAMSWDYWTREGVVTGGTLENPTGCLPY-PFPKCSHGV 196

Query: 75  --PGCEPA----YPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
             PG  P     YPTPKC +KC    N+ +   K    S+Y +     D M EI KNGPV
Sbjct: 197 VTPGLPPCPRDIYPTPKCEKKCHAGYNKTYEQDKVKGKSSYNVGEQETDFMMEIMKNGPV 256

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           +  F ++EDF  YKSG+Y + TG ++GGHA+++IGWG  ++G  YW++AN WN  WG  G
Sbjct: 257 DGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGV-ENGVKYWLIANSWNEGWGEKG 315

Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
           YF+++RG+NECGIE  + AGLP
Sbjct: 316 YFRMRRGNNECGIEARINAGLP 337


>gi|87246247|gb|ABD35300.1| cathepsin B-like cysteine protease [Triatoma infestans]
          Length = 333

 Score =  172 bits (435), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 91/203 (44%), Positives = 120/203 (59%), Gaps = 17/203 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
            + LS  +LL+CC   CGDGC GG P SAW Y+   G+V+       + C PY     C 
Sbjct: 132 QVHLSAENLLSCCD-SCGDGCLGGSPESAWEYWHKFGIVSGGNYGSKQGCQPY-SIAPCE 189

Query: 74  H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC ++C K   + +  + +Y    Y I +D + I AEI KNGP
Sbjct: 190 HSIHGSSPACGGVTDTPKCKKQCEKGYSIPYDKAFYYGQPGYAIPNDAQKIQAEILKNGP 249

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           +  SF VYED   YK GVY+H+ G+ +GGH +K+ GWG  ++G  YW++AN WN  WG +
Sbjct: 250 IVASFLVYEDLFSYKEGVYQHVAGEFLGGHVIKIFGWGI-ENGTPYWLVANSWNTDWGNN 308

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G+FKI RG +ECGIE DV AGLP
Sbjct: 309 GFFKIPRGKDECGIEIDVSAGLP 331


>gi|56758040|gb|AAW27160.1| unknown [Schistosoma japonicum]
          Length = 216

 Score =  172 bits (435), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 91/203 (44%), Positives = 121/203 (59%), Gaps = 16/203 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----F 67
           Q+  LS  DL++CC   CGDGC GG+P  AW Y+V  G+VT         C PY      
Sbjct: 13  QSAELSALDLISCC-EDCGDGCQGGFPGQAWDYWVTQGIVTGGSKENHTGCQPYPFPKCE 71

Query: 68  DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
             T   +P C    Y TP+C + C K  +  +   KHY   +Y + S+ + I  EI  NG
Sbjct: 72  HHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVISNEKAIQKEIMMNG 131

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG  +    YW++AN WN  WG 
Sbjct: 132 PVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKRTPYWLIANSWNEDWGE 190

Query: 186 DGYFKIKRGSNECGIEEDVVAGL 208
            G F+I RG +EC IE  VVAGL
Sbjct: 191 KGLFRIVRGRDECSIESHVVAGL 213


>gi|50540542|ref|NP_998501.1| cathepsin B, a precursor [Danio rerio]
 gi|34784038|gb|AAH56688.1| Cathepsin B, a [Danio rerio]
 gi|37681773|gb|AAQ97764.1| cathepsin B [Danio rerio]
 gi|41351445|gb|AAH65589.1| Cathepsin B, a [Danio rerio]
          Length = 330

 Score =  172 bits (435), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 96/219 (43%), Positives = 133/219 (60%), Gaps = 21/219 (9%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + S   VS++   +S  DLL CC   CG GC+GGYP +AW ++   G+VT     
Sbjct: 117 SDRVCIHSDAKVSVE---ISSQDLLTCCD-SCGMGCNGGYPSAAWDFWATEGLVTGGLYN 172

Query: 63  ----CDPYFDSTGCSH------PGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRI 110
               C PY     C H      P C      TP C  KC    +  ++  KH+  ++Y +
Sbjct: 173 SHIGCRPYTIEP-CEHHVNGSRPPCSGEGGDTPNCDMKCEPGYSPSYKQDKHFGKTSYSV 231

Query: 111 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 170
            S+   IMAE++KNGPVE +FTVYEDF  YKSGVY+H++G  +GGHA+K++GWG  ++G 
Sbjct: 232 PSNQNSIMAELFKNGPVEGAFTVYEDFLLYKSGVYQHMSGSPVGGHAIKILGWG-EENGV 290

Query: 171 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
            YW+ AN WN  WG +GYFKI RG + CGIE ++VAG+P
Sbjct: 291 PYWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAGIP 329


>gi|195058549|ref|XP_001995463.1| GH17748 [Drosophila grimshawi]
 gi|193896249|gb|EDV95115.1| GH17748 [Drosophila grimshawi]
          Length = 340

 Score =  171 bits (434), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 88/203 (43%), Positives = 120/203 (59%), Gaps = 16/203 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCS 73
           N   S +DL+ CC   CG GC+GG+P +AW Y+   G+V       TE C PY +   C 
Sbjct: 138 NFHFSADDLVTCC-HTCGFGCNGGFPGAAWSYWTTRGIVSGGSYNSTEGCRPY-EVEPCE 195

Query: 74  HPGCEPAYP-----TPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
           H    P  P     TP C  +C     + +   KH+  S+Y IN +P +I  EI  NGPV
Sbjct: 196 HHVDGPRPPCHSGSTPHCKHQCQPNYSVDYEKDKHFGASSYSINRNPRNIQREIMTNGPV 255

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE-DYWILANQWNRSWGAD 186
           E +FTVYED   YK+GVY+H+ G  +GGHA+++IGWG   + +  YW++AN WN  WG +
Sbjct: 256 EGAFTVYEDLILYKTGVYQHVHGKQLGGHAIRIIGWGVWGESKVPYWLIANSWNTDWGDN 315

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G+F+I RG + CGIE  + AGLP
Sbjct: 316 GFFRILRGKDHCGIESQISAGLP 338


>gi|312374701|gb|EFR22198.1| hypothetical protein AND_15621 [Anopheles darlingi]
          Length = 335

 Score =  171 bits (434), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 89/204 (43%), Positives = 122/204 (59%), Gaps = 18/204 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           +   S  DL++CC   CG GC+GG+P +AW Y+VH G+V+         C PY  +  C 
Sbjct: 133 HFRFSAEDLVSCC-HTCGFGCNGGFPGAAWSYWVHKGLVSGGPFGSNLGCQPYAIAP-CE 190

Query: 74  H------PGCE-PAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           H      P CE     TPKCV+KC     + +   K Y   +Y I    + I  EI  NG
Sbjct: 191 HHVNGTRPSCEGEGGKTPKCVKKCQDSYTVPYAKDKRYGSKSYSIPRHEDQIRKEIMTNG 250

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +FTVYED  HYK GVY+H+TG ++GGHA++++GWG  ++ + YW++AN WN  WG 
Sbjct: 251 PVEGAFTVYEDLLHYKEGVYQHVTGKMLGGHAIRILGWGVENNTK-YWLIANSWNSDWGD 309

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G+FKI RG +  GIE  + AGLP
Sbjct: 310 NGFFKILRGEDHLGIESSIAAGLP 333


>gi|355332948|pdb|3MOR|A Chain A, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
 gi|355332949|pdb|3MOR|B Chain B, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
          Length = 317

 Score =  171 bits (434), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 93/203 (45%), Positives = 123/203 (60%), Gaps = 13/203 (6%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           +Q++ +S  DLLACC   CGDGC+GG P  AW YF   G+V++ C PY       H   +
Sbjct: 118 VQDVHISAGDLLACCS-DCGDGCNGGDPDRAWAYFSSTGLVSDYCQPYPFPHCSHHSKSK 176

Query: 79  PAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
             YP        TPKC   C        N +  S ++Y +  + +D M E++  GP EV+
Sbjct: 177 NGYPPCSQFNFDTPKCNYTCDDPTIPVVNYR--SWTSYALQGE-DDYMRELFFRGPFEVA 233

Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
           F VYEDF  Y SGVY H++G  +GGHAV+L+GWGTS +G  YW +AN WN  WG DGYF 
Sbjct: 234 FDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGWGTS-NGVPYWKIANSWNTEWGMDGYFL 292

Query: 191 IKRGSNECGIEEDVVAGLPSSKN 213
           I+RGS+ECGIE+   AG+P + N
Sbjct: 293 IRRGSSECGIEDGGSAGIPLAPN 315


>gi|261328564|emb|CBH11542.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like,
           putative [Trypanosoma brucei gambiense DAL972]
          Length = 340

 Score =  171 bits (434), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 93/203 (45%), Positives = 123/203 (60%), Gaps = 13/203 (6%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           +Q++ +S  DLLACC   CGDGC+GG P  AW YF   G+V++ C PY       H   +
Sbjct: 141 VQDVHISAGDLLACCS-DCGDGCNGGDPDRAWAYFSSTGLVSDYCQPYPFPHCSHHSKSK 199

Query: 79  PAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
             YP        TPKC   C        N +  S ++Y +  + +D M E++  GP EV+
Sbjct: 200 NGYPPCSQFNFDTPKCNYTCDDPTIPVVNYR--SWTSYALQGE-DDYMRELFFRGPFEVA 256

Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
           F VYEDF  Y SGVY H++G  +GGHAV+L+GWGTS +G  YW +AN WN  WG DGYF 
Sbjct: 257 FDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGWGTS-NGVPYWKIANSWNTEWGMDGYFL 315

Query: 191 IKRGSNECGIEEDVVAGLPSSKN 213
           I+RGS+ECGIE+   AG+P + N
Sbjct: 316 IRRGSSECGIEDGGSAGIPLAPN 338


>gi|55793941|gb|AAV65881.1| cathepsin B1 isotype 1 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  171 bits (434), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 92/205 (44%), Positives = 127/205 (61%), Gaps = 18/205 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------ 66
           +++ LS  DLL+CC   CG GC GG+P +AW Y+V  G+VT         C PY      
Sbjct: 139 KSVELSAVDLLSCCT-ECGLGCQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCE 197

Query: 67  FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
             +TG  +P C E  Y TPKC +KC K  +  ++  K+Y   +Y + ++   I  EI  +
Sbjct: 198 HHTTG-KYPECGEKIYKTPKCHQKCQKGYKTPYKKDKYYGRMSYNVLNNENAIKKEIMMH 256

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE +FTV+ DF +YKSG+YK++TG  +GGHAV++IGWG  +    YW++AN WN  WG
Sbjct: 257 GPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIGWGV-EKKTPYWLIANSWNEDWG 315

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
             GYF+I RG +ECGIE +V  GLP
Sbjct: 316 EKGYFRILRGKDECGIESEVTGGLP 340


>gi|296863454|pdb|3HHI|A Chain A, Crystal Structure Of Cathepsin B From T. Brucei In Complex
           With Ca074
 gi|296863455|pdb|3HHI|B Chain B, Crystal Structure Of Cathepsin B From T. Brucei In Complex
           With Ca074
          Length = 325

 Score =  171 bits (433), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 93/203 (45%), Positives = 123/203 (60%), Gaps = 13/203 (6%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           +Q++ +S  DLLACC   CGDGC+GG P  AW YF   G+V++ C PY       H   +
Sbjct: 119 VQDVHISAGDLLACCS-DCGDGCNGGDPDRAWAYFSSTGLVSDYCQPYPFPHCSHHSKSK 177

Query: 79  PAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
             YP        TPKC   C        N +  S ++Y +  + +D M E++  GP EV+
Sbjct: 178 NGYPPCSQFNFDTPKCDYTCDDPTIPVVNYR--SWTSYALQGE-DDYMRELFFRGPFEVA 234

Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
           F VYEDF  Y SGVY H++G  +GGHAV+L+GWGTS +G  YW +AN WN  WG DGYF 
Sbjct: 235 FDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGWGTS-NGVPYWKIANSWNTEWGMDGYFL 293

Query: 191 IKRGSNECGIEEDVVAGLPSSKN 213
           I+RGS+ECGIE+   AG+P + N
Sbjct: 294 IRRGSSECGIEDGGSAGIPLAPN 316


>gi|72389769|ref|XP_845179.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|427931064|pdb|4HWY|A Chain A, Trypanosoma Brucei Procathepsin B Solved From 40 Fs
           Free-electron Laser Pulse Data By Serial Femtosecond
           X-ray Crystallography
 gi|40557577|gb|AAR88085.1| cathepsin B-like cysteine protease [Trypanosoma brucei]
 gi|62360039|gb|AAX80461.1| cysteine peptidase C (CPC) [Trypanosoma brucei]
 gi|70801714|gb|AAZ11620.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
          Length = 340

 Score =  171 bits (433), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 93/203 (45%), Positives = 123/203 (60%), Gaps = 13/203 (6%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           +Q++ +S  DLLACC   CGDGC+GG P  AW YF   G+V++ C PY       H   +
Sbjct: 141 VQDVHISAGDLLACCS-DCGDGCNGGDPDRAWAYFSSTGLVSDYCQPYPFPHCSHHSKSK 199

Query: 79  PAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
             YP        TPKC   C        N +  S ++Y +  + +D M E++  GP EV+
Sbjct: 200 NGYPPCSQFNFDTPKCNYTCDDPTIPVVNYR--SWTSYALQGE-DDYMRELFFRGPFEVA 256

Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
           F VYEDF  Y SGVY H++G  +GGHAV+L+GWGTS +G  YW +AN WN  WG DGYF 
Sbjct: 257 FDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGWGTS-NGVPYWKIANSWNTEWGMDGYFL 315

Query: 191 IKRGSNECGIEEDVVAGLPSSKN 213
           I+RGS+ECGIE+   AG+P + N
Sbjct: 316 IRRGSSECGIEDGGSAGIPLAPN 338


>gi|55793945|gb|AAV65883.1| cathepsin B1 isotype 3 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  171 bits (433), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 92/205 (44%), Positives = 127/205 (61%), Gaps = 18/205 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------ 66
           +++ LS  DLL+CC   CG GC GG+P +AW Y+V  G+VT         C PY      
Sbjct: 139 KSVELSAVDLLSCCT-ECGLGCQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCE 197

Query: 67  FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
             +TG  +P C E  Y TPKC +KC K  +  ++  K+Y   +Y + ++   I  EI  +
Sbjct: 198 HHTTG-KYPECGEKIYKTPKCHQKCQKGYKTPYKKDKYYGRMSYNVLNNENAIKKEIMMH 256

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE +FTV+ DF +YKSG+YK++TG  +GGHAV++IGWG  +    YW++AN WN  WG
Sbjct: 257 GPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIGWGV-EKKTPYWLIANSWNEDWG 315

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
             GYF+I RG +ECGIE +V  GLP
Sbjct: 316 EKGYFRILRGKDECGIESEVTGGLP 340


>gi|126116630|gb|ABN79675.1| cathepsin B3 [Clonorchis sinensis]
          Length = 337

 Score =  171 bits (432), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 90/205 (43%), Positives = 122/205 (59%), Gaps = 17/205 (8%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG 76
           +S  DL++CCG+ CG GC GG+P +AW ++   G+VT         C  Y     CSH G
Sbjct: 133 ISAVDLISCCGY-CGFGCQGGFPPTAWDFWQTEGIVTGGSKENPTGCRSY-PFPRCSHHG 190

Query: 77  CEP-------AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
            +         Y TP CV+KC   +  +   K  +   Y + +    IM EI  NGPVE 
Sbjct: 191 SKKYPPCSHRIYDTPNCVQKCDTPDTDYATDKTRANITYNVKAKQNAIMKEIMINGPVEA 250

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
           +F VYEDF  YKSGVY H  G ++GGHA++++GWG  ++G  YW++AN WN  WG DGYF
Sbjct: 251 AFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWG-EENGVAYWLIANSWNDGWGEDGYF 309

Query: 190 KIKRGSNECGIEEDVVAGLPSSKNL 214
           K+ RG NECGIE++V AGLP   ++
Sbjct: 310 KMLRGKNECGIEDEVTAGLPELSSI 334


>gi|3088522|gb|AAD03404.1| cathepsin B-like protease precursor [Trypanosoma cruzi]
 gi|407859283|gb|EKG06969.1| cysteine peptidase C (CPC) [Trypanosoma cruzi]
          Length = 333

 Score =  171 bits (432), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 94/200 (47%), Positives = 120/200 (60%), Gaps = 17/200 (8%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH--- 74
           +++L +S  DL++CC  +CG GC+GGYP  AW Y+  HG+V+E C PY F S  C+H   
Sbjct: 139 VRDLRISAGDLMSCCD-VCGYGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPS--CAHHVN 195

Query: 75  ----PGCEPAYPTPKCVRKCV-KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
                 C   Y TP C   C  KK  L +    Y  +   I S  E    E+  NGP EV
Sbjct: 196 SSDLSPCSGEYDTPTCNSTCTDKKIPLIK----YRGNTSYILSGEESFKRELLLNGPFEV 251

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
           SF+VY DF  Y  GVYKH+TG  +GGHAV+++GWG   +GE YW +AN WN  WG +GYF
Sbjct: 252 SFSVYADFVAYTGGVYKHVTGVFLGGHAVRIVGWG-ELNGEPYWKIANSWNHEWGMNGYF 310

Query: 190 KIKRGSNECGIEEDVVAGLP 209
            I RG +ECGIE   VAG+P
Sbjct: 311 LIARGVDECGIEGSGVAGIP 330


>gi|380791571|gb|AFE67661.1| cathepsin B preproprotein, partial [Macaca mulatta]
          Length = 311

 Score =  171 bits (432), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 86/199 (43%), Positives = 127/199 (63%), Gaps = 19/199 (9%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYD 174

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY  ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292

Query: 172 YWILANQWNRSWGADGYFK 190
           YW++AN WN  WG +G+FK
Sbjct: 293 YWLVANSWNTDWGDNGFFK 311


>gi|76576339|gb|ABA53863.1| cathepsin B-like cysteine protease 1 [Parelaphostrongylus tenuis]
          Length = 346

 Score =  170 bits (431), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 94/207 (45%), Positives = 120/207 (57%), Gaps = 21/207 (10%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q + LS +D+L+CC   CG GC+GG    AW Y+   G+VT     Y   +GC    +P 
Sbjct: 143 QQVILSADDILSCCT-ECGYGCEGGDTYKAWNYWTTDGIVTGS--NYTTKSGCKPYPYPP 199

Query: 77  CE-------------PAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIY 122
           CE               YPT  C  KC     + +   KHY    Y +  D   I  EI 
Sbjct: 200 CEHYIDAGRYKKCPKDLYPTNTCEYKCQDNYTISYDEDKHYGAYPYVLVGDASFIQQEIM 259

Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
            +GPVEV+F VYEDF HY SG+YKH+ G+ +G HAVK++GWGT ++G DYWI AN WN  
Sbjct: 260 NHGPVEVTFDVYEDFEHYSSGIYKHMAGEYVGVHAVKMLGWGT-ENGVDYWICANSWNSD 318

Query: 183 WGADGYFKIKRGSNECGIEEDVVAGLP 209
           WG +G+F+I RG NECGIE +VVAG P
Sbjct: 319 WGENGFFRILRGENECGIESNVVAGKP 345


>gi|71656032|ref|XP_816569.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
 gi|70881707|gb|EAN94718.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
          Length = 333

 Score =  170 bits (431), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 92/202 (45%), Positives = 118/202 (58%), Gaps = 21/202 (10%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH--- 74
           +++L +S  DL++CC  +CG GC+GGYP  AW Y+  HG+V+E C PY F S  C+H   
Sbjct: 139 VRDLRISAGDLMSCCD-VCGYGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPS--CAHHVN 195

Query: 75  ----PGCEPAYPTPKCVRKCVKKNQ---LWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
                 C   Y TP C   C  K      +R +  Y +S        E    E+  NGP 
Sbjct: 196 SSDLSPCSGEYDTPTCNSTCTDKKVPLIKYRGNTSYLLSG------EESFKRELLLNGPF 249

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           EVSF+VY DF  Y  GVYKH+ G  +GGHAV+++GWG   +GE YW +AN WNR WG +G
Sbjct: 250 EVSFSVYADFLAYTGGVYKHVAGTFLGGHAVRIVGWG-ELNGEPYWKIANSWNREWGMNG 308

Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
           YF I RG +ECGIE   VAG P
Sbjct: 309 YFLIARGVDECGIEGSGVAGTP 330


>gi|193783549|dbj|BAG53460.1| unnamed protein product [Homo sapiens]
          Length = 276

 Score =  170 bits (431), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 85/192 (44%), Positives = 122/192 (63%), Gaps = 16/192 (8%)

Query: 41  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 87
           C+GGYP  AW ++   G+V+         C PY     C H      P C     TPKC 
Sbjct: 87  CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 145

Query: 88  RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 146
           + C    +  ++  KHY  ++Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+
Sbjct: 146 KICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQ 205

Query: 147 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVA
Sbjct: 206 HVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVA 264

Query: 207 GLPSSKNLVKEI 218
           G+P +    ++I
Sbjct: 265 GIPRTDQYWEKI 276


>gi|225713216|gb|ACO12454.1| Cathepsin B precursor [Lepeophtheirus salmonis]
 gi|290561811|gb|ADD38303.1| Cathepsin B [Lepeophtheirus salmonis]
          Length = 333

 Score =  170 bits (431), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 86/203 (42%), Positives = 127/203 (62%), Gaps = 18/203 (8%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH 74
           +++S  +LL+CC + CG GC+GG+P +AW ++   G+V+       + C PY  +  C H
Sbjct: 133 VNVSAENLLSCC-YSCGFGCNGGFPGAAWSFWKKKGLVSGGLYGSHKGCQPYAIAP-CEH 190

Query: 75  ------PGCEPAYPTPKCVRKCVKKN--QLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
                 P C     TPKC   C  ++    +   K +  S+Y + SDP+ I  EI  NGP
Sbjct: 191 HANGTRPPCSGGGRTPKCHTFCENEDYSLPYEKDKSFGRSSYSVKSDPKQIQLEIMNNGP 250

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +F+VY DF +YKSGVY+H+ G ++GGHA++++GWG  ++G  YW++AN WN  WG +
Sbjct: 251 VEAAFSVYSDFLNYKSGVYRHVKGSLLGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDN 309

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G FKI +GS+ CGIE  +VAGLP
Sbjct: 310 GTFKILKGSDHCGIEGSIVAGLP 332


>gi|392922404|ref|NP_507186.3| Protein CPR-2 [Caenorhabditis elegans]
 gi|206994217|emb|CAB04322.3| Protein CPR-2 [Caenorhabditis elegans]
          Length = 326

 Score =  170 bits (431), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 92/197 (46%), Positives = 112/197 (56%), Gaps = 10/197 (5%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
           Q   +S  DLL CCG  CG+GCDGG+P  A++++   GVVT        C PY     C+
Sbjct: 132 QQPIISPTDLLTCCGMSCGEGCDGGFPYRAFQWWARRGVVTGGDYLGTGCKPY-PIRPCN 190

Query: 74  HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
              C     TP C   C       + N K+Y  SAY +      I A+IY NGPV  +F 
Sbjct: 191 SDNCV-NLQTPPCRLSCQPGYRTTYTNDKNYGNSAYPVPRTVAAIQADIYYNGPVVAAFI 249

Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
           VYEDF  YKSG+Y+HI G   GGHAVKLIGWGT + G  YW+  N W   WG  G F+I 
Sbjct: 250 VYEDFEKYKSGIYRHIAGRSKGGHAVKLIGWGT-ERGTPYWLAVNSWGSQWGESGTFRIL 308

Query: 193 RGSNECGIEEDVVAGLP 209
           RG +ECGIE  +VAGLP
Sbjct: 309 RGVDECGIESRIVAGLP 325


>gi|167541036|gb|ABZ82028.1| cathepsin B endopeptidase [Clonorchis sinensis]
          Length = 228

 Score =  170 bits (431), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 90/205 (43%), Positives = 122/205 (59%), Gaps = 17/205 (8%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG 76
           +S  DL++CCG+ CG GC GG+P +AW ++   G+VT         C  Y     CSH G
Sbjct: 24  ISAVDLISCCGY-CGFGCQGGFPPTAWDFWQTEGIVTGGSKENPTGCRSY-PFPRCSHHG 81

Query: 77  CEP-------AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
            +         Y TP CV+KC   +  +   K  +   Y + +    IM EI  NGPVE 
Sbjct: 82  SKKYPPCSHRIYDTPNCVQKCDTPDTDYATDKTRANITYNVKAKQNAIMKEIMINGPVEA 141

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
           +F VYEDF  YKSGVY H  G ++GGHA++++GWG  ++G  YW++AN WN  WG DGYF
Sbjct: 142 AFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWG-EENGVAYWLIANSWNDGWGEDGYF 200

Query: 190 KIKRGSNECGIEEDVVAGLPSSKNL 214
           K+ RG NECGIE++V AGLP   ++
Sbjct: 201 KMLRGKNECGIEDEVTAGLPELSSI 225


>gi|223646922|gb|ACN10219.1| Cathepsin B precursor [Salmo salar]
 gi|223647940|gb|ACN10728.1| Cathepsin B precursor [Salmo salar]
 gi|223672785|gb|ACN12574.1| Cathepsin B precursor [Salmo salar]
          Length = 330

 Score =  170 bits (431), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 92/218 (42%), Positives = 132/218 (60%), Gaps = 19/218 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + S+  VS++   +S  DLL+CC   CG GC+GGYP +AW ++   G+VT     
Sbjct: 117 SDRVCIHSNAKVSVE---ISSEDLLSCCDS-CGMGCNGGYPSAAWDFWTTEGLVTGGLYD 172

Query: 63  ----CDPY------FDSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRIN 111
               C PY          G   P       TP+C  +C       ++  KH+  ++Y + 
Sbjct: 173 SHVGCRPYSIPPCEHHVNGTRPPCTGEEGDTPQCSNQCETGYTPGYKQDKHFGKNSYSLP 232

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           S+ + IMAE+ KNGPVE +FTVYEDF  YKSGVY+H++G  +GGHA+K++GWG  + G  
Sbjct: 233 SEEQQIMAELLKNGPVEGAFTVYEDFLLYKSGVYQHVSGSAVGGHAIKVLGWG-EEGGTP 291

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           YW+ AN WN  WG +G+FKI RG + CGIE ++VAG+P
Sbjct: 292 YWLAANSWNTDWGENGFFKILRGKDHCGIESEMVAGVP 329


>gi|728602|emb|CAA88490.1| cathepsin B-like enzyme [Leishmania mexicana]
 gi|1586011|prf||2202319A cathepsin B-like Cys protease
          Length = 340

 Score =  170 bits (431), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 95/194 (48%), Positives = 117/194 (60%), Gaps = 14/194 (7%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP- 82
           +S  +LL+CC F+CG GC GG P  AW ++V  GV TE C PY     CSH G    YP 
Sbjct: 150 ISTTNLLSCC-FICGFGCYGGIPAMAWLWWVWVGVTTELCQPY-PFGPCSHHGNSSKYPP 207

Query: 83  -------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
                  TPKC   C   N      K+  +S+Y I  + E +  E+  NGP+EV+  VY 
Sbjct: 208 CPNTIYNTPKCNTTC--DNVEMELVKYKGVSSYSIKGERE-LDHELMNNGPLEVAMQVYA 264

Query: 136 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 195
           DF  YKSGVYKH++GD +GGHAVKL+GWG   DG  YW +AN WN  WG  GYF I+RG+
Sbjct: 265 DFVAYKSGVYKHVSGDHLGGHAVKLVGWGV-KDGIPYWKIANSWNTDWGDKGYFLIQRGN 323

Query: 196 NECGIEEDVVAGLP 209
           +ECGIE   VAG P
Sbjct: 324 DECGIESSGVAGKP 337


>gi|157167366|ref|XP_001653890.1| cathepsin b [Aedes aegypti]
 gi|54289254|gb|AAV31917.1| lysosomal cathepsin B [Aedes aegypti]
 gi|108874249|gb|EAT38474.1| AAEL009637-PA [Aedes aegypti]
          Length = 340

 Score =  170 bits (431), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 90/204 (44%), Positives = 123/204 (60%), Gaps = 18/204 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
           +  +S  DL++CC   CG GC+GG+P +AW Y+V  G+V+       + C PY  +  C 
Sbjct: 138 HFRVSSEDLVSCC-HTCGFGCNGGFPGAAWSYWVRKGLVSGGPFGSDQGCQPYAIAP-CE 195

Query: 74  H------PGCE-PAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           H      P CE     TPKCV+KC    N  +   K Y  S+Y I +  + I  EI  NG
Sbjct: 196 HHVNGSRPSCEGEGGKTPKCVKKCQASYNVPYAKDKMYGKSSYSIANHEKQIQKEIMTNG 255

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +FTVYED  +YK GVY H+ G ++GGHA++++GWG  +DG  YW++AN WN  WG 
Sbjct: 256 PVEGAFTVYEDLLNYKEGVYHHVHGKMLGGHAIRILGWGV-EDGTKYWLIANSWNSDWGD 314

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G+FKI RG +  GIE  + AGLP
Sbjct: 315 NGFFKILRGEDHLGIESSIAAGLP 338


>gi|320166129|gb|EFW43028.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
          Length = 332

 Score =  170 bits (431), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 93/205 (45%), Positives = 118/205 (57%), Gaps = 22/205 (10%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH 74
           + LS +DLL+CC   CGDGCDGG    +W Y+ + G+VT         C PY D   C+H
Sbjct: 130 VRLSASDLLSCC-TSCGDGCDGGQLGPSWDYYKNKGIVTGYLYNTTGYCKPY-DFPACAH 187

Query: 75  PGCEPAYP--------TPKCVRKCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
               P YP        TPKC + CV       +    HY  S+Y +      I  EI  +
Sbjct: 188 HEASPDYPDCPSTDYSTPKCTKSCVAGYTANTYTADLHYGQSSYSVGRTDAAIQTEILNH 247

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE +FTVY DF  Y+SGVYKH +G V+GGHA+ ++GWGT + G  YW++ N WN SWG
Sbjct: 248 GPVEAAFTVYSDFPTYRSGVYKHTSGSVLGGHAISIVGWGT-ESGSPYWLVKNSWNPSWG 306

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
             G+FKI RG  +CGI  DVV GLP
Sbjct: 307 DGGFFKILRG--DCGINNDVVGGLP 329


>gi|55793951|gb|AAV65886.1| cathepsin B1 isotype 6 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  170 bits (430), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 92/205 (44%), Positives = 124/205 (60%), Gaps = 18/205 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------ 66
           +++ LS  DLL+CC   CG GC GG+P SAW Y+V  GVVT         C PY      
Sbjct: 139 KSVELSAVDLLSCC-RECGLGCLGGFPGSAWDYWVEEGVVTGSSGENHTGCQPYPFPKCE 197

Query: 67  FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
            ++TG  +P C +  Y TPKC +KC K  +  ++  KHY   AY + ++ + I  EI  +
Sbjct: 198 HNTTG-KYPACGQKIYETPKCQKKCQKGYKTPYKKDKHYGKVAYNVPNNEDSIKKEIMMH 256

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPV   FTVY DF +YKSG+YKH+ G  +G H V+++GWG  + G  YW++AN WN  WG
Sbjct: 257 GPVGSFFTVYSDFLNYKSGIYKHMKGTEIGVHTVRIVGWGV-EKGTPYWLIANSWNEGWG 315

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
             GYF+I RG +EC IE  V+ GLP
Sbjct: 316 EKGYFRILRGKDECDIESLVIGGLP 340


>gi|241154720|ref|XP_002407359.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215494103|gb|EEC03744.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score =  170 bits (430), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 88/202 (43%), Positives = 121/202 (59%), Gaps = 15/202 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDS---- 69
            +++S  DLL CC   CG GC GG+P +AW ++   G+V+       + C PY  +    
Sbjct: 135 QVNISAEDLLDCCD-TCGHGCKGGFPAAAWEHWKERGIVSGGLYGTPDGCKPYSLAPCEY 193

Query: 70  -TGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
            T C  P C P   TP+CV  C K  ++ ++  KH+    Y I+ D + I  EI+ NGPV
Sbjct: 194 HTKCRIPNCIPIVHTPECVHHCRKGYDKDYQEDKHFGQKVYSISRDEKQIQTEIFTNGPV 253

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           E  F VY DF  YKSGVY+  + D  G HA++++GWGT ++G  YW+ AN WN +WG  G
Sbjct: 254 EADFHVYGDFLCYKSGVYQRHSNDGRGMHAIRILGWGT-ENGTPYWLAANSWNENWGDKG 312

Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
           YFKI R +NECGIEE + AG+P
Sbjct: 313 YFKILRRTNECGIEEHIYAGIP 334


>gi|226471004|emb|CAX70583.1| Cysteine PRotease related protein [Schistosoma japonicum]
          Length = 304

 Score =  170 bits (430), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 93/204 (45%), Positives = 121/204 (59%), Gaps = 18/204 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT---EE----CDPYFDSTGC 72
           Q+  LS  DL++CC   CGDGC GG+P  AW Y+V  G+VT   EE    C PY     C
Sbjct: 101 QSAELSALDLISCCKD-CGDGCKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPY-PFPKC 158

Query: 73  SH------PGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H      P C    Y TP+C + C K  +  +   KHY    Y + S+ + I  EI   
Sbjct: 159 EHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNEKAIQREIMMY 218

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG  +    YW++AN WN  WG
Sbjct: 219 GPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKRTPYWLIANSWNEDWG 277

Query: 185 ADGYFKIKRGSNECGIEEDVVAGL 208
             G F+I RG +EC IE  VVAGL
Sbjct: 278 EKGLFRIVRGRDECSIESHVVAGL 301


>gi|343476048|emb|CCD12737.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  170 bits (430), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 90/200 (45%), Positives = 118/200 (59%), Gaps = 13/200 (6%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           +Q L +S   LL+CC   CGDGCDGGYP SAW Y+V HG+ +  C PY     C H G +
Sbjct: 137 VQQLRISAAHLLSCCK-DCGDGCDGGYPDSAWEYYVSHGLASSYCQPY-PFPHCGHHGGK 194

Query: 79  PAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
              P        TPKC   C  K       K+    +Y +    +D   E+Y NGP  V+
Sbjct: 195 GKKPPCSKYDFHTPKCNTTCTDKAIPL--IKYRGNDSYVLLHGEDDFKRELYFNGPFVVA 252

Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
           F VY DF  YK+GVY+H++GD +GGHAV+++GWG   +G  YW +AN W+  WG +G+F 
Sbjct: 253 FQVYSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHFL 311

Query: 191 IKRGSNECGIEEDVVAGLPS 210
           I RG+NECGIE    AGLP+
Sbjct: 312 ILRGNNECGIESTGYAGLPA 331


>gi|170028910|ref|XP_001842337.1| cathepsin L [Culex quinquefasciatus]
 gi|167879387|gb|EDS42770.1| cathepsin L [Culex quinquefasciatus]
          Length = 334

 Score =  170 bits (430), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 88/203 (43%), Positives = 123/203 (60%), Gaps = 17/203 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
           +  +S  DL++CC   CG GC+GG+P +AW Y+V  G+V+       + C PY  S  C 
Sbjct: 133 HFRVSAEDLVSCC-HTCGFGCNGGFPGAAWSYWVRKGLVSGGPYGSDQGCQPYAISP-CE 190

Query: 74  H------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H        C     TPKCV+KC    N  +   K +  S+Y I S  + I  E++ NGP
Sbjct: 191 HHVNGTRGPCNGEGKTPKCVKKCQASYNVPYAKDKFFGKSSYSIASHEQQIQKELFTNGP 250

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +FTVYED  +YK GVY+H  G ++GGHA++++GWG  +D + +W++AN WN  WG +
Sbjct: 251 VEGAFTVYEDLLNYKEGVYQHTAGKMLGGHAIRILGWGVENDTK-FWLIANSWNSDWGDN 309

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           GYFKI RGS+  GIE  + AGLP
Sbjct: 310 GYFKILRGSDHLGIESSIAAGLP 332


>gi|56753443|gb|AAW24925.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  169 bits (429), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 89/203 (43%), Positives = 122/203 (60%), Gaps = 16/203 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----F 67
           Q+  LS  DL++CC   CGDGC GG+P  AW Y+V  G+VT         C PY      
Sbjct: 139 QSAELSALDLISCCED-CGDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCE 197

Query: 68  DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
             T   +P C    Y TP+C +KC K  +  +   K+Y    Y + S+ + I  EI   G
Sbjct: 198 HHTKGKYPACGTKIYKTPQCKQKCQKGYKTPYEQDKNYGDQRYNVISNEKAIQREIMMYG 257

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +F VYEDF +YKSG+Y+H+ G ++GGHA+++IGWG  + G+ YW++AN WN  WG 
Sbjct: 258 PVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIGWGV-EKGKPYWLIANSWNEDWGE 316

Query: 186 DGYFKIKRGSNECGIEEDVVAGL 208
           +G F++ RG +EC IE  VVAGL
Sbjct: 317 NGLFRMVRGRDECSIESHVVAGL 339


>gi|154340956|ref|XP_001566431.1| cysteine peptidase C (CPC) [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134063754|emb|CAM39941.1| cysteine peptidase C (CPC) [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 340

 Score =  169 bits (429), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 91/201 (45%), Positives = 118/201 (58%), Gaps = 12/201 (5%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY------FDSTGC 72
           + +L +S   LL+CC F+CG GC GG P  AW ++V  G+ +E C PY        + G 
Sbjct: 145 ITDLRVSTGHLLSCC-FVCGMGCQGGIPTMAWLWWVWVGLTSEVCQPYPFPPCGHHTDGG 203

Query: 73  SHPGCEPA-YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
            +P C    Y TP C   C   +     +KH    +Y +  + E  M E+   GP EV+F
Sbjct: 204 KYPACPSTIYDTPTCNSTCADSHTAL--TKHKGEKSYSLRGERE-YMIELMTYGPFEVAF 260

Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
            VY DF  YKSGVY H TG+ +GGHAVKL+GWG   +G  YW +AN WN  WG +GYF I
Sbjct: 261 DVYADFVSYKSGVYSHTTGERLGGHAVKLVGWGV-QNGTPYWKIANSWNSDWGDNGYFLI 319

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG++ECGIE   VAGLPS K
Sbjct: 320 RRGTDECGIESTGVAGLPSLK 340


>gi|71424150|ref|XP_812694.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
 gi|70877506|gb|EAN90843.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
          Length = 333

 Score =  169 bits (429), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 94/202 (46%), Positives = 120/202 (59%), Gaps = 21/202 (10%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH--- 74
           +++L +S  DL++CC  +CG GC+GGYP  AW Y+  HG+V+E C PY F S  C+H   
Sbjct: 139 VRDLRISAGDLMSCCD-VCGFGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPS--CAHHVN 195

Query: 75  ----PGCEPAYPTPKCVRKCV-KKNQL--WRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
                 C   Y TP C   C  KK  L  +R +  Y +S        E    E+  NGP 
Sbjct: 196 SSDLSPCSGEYDTPTCNSTCTDKKIPLIKYRGNTSYVLSG------EEPFKRELILNGPF 249

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           EVSF+VY DF  Y  GVYKH+ G  +GGHAV+++GWG   +GE YW +AN WNR WG +G
Sbjct: 250 EVSFSVYADFVAYTGGVYKHVAGIFLGGHAVRIVGWG-ELNGEPYWKIANSWNREWGMNG 308

Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
           YF I RG +ECGIE   VAG P
Sbjct: 309 YFLIARGVDECGIEGSGVAGTP 330


>gi|1169189|sp|P43157.1|CYSP_SCHJA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
           Full=Antigen Sj31; Flags: Precursor
 gi|11167|emb|CAA50305.1| cathepsin B [Schistosoma japonicum]
          Length = 342

 Score =  169 bits (429), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 88/203 (43%), Positives = 121/203 (59%), Gaps = 16/203 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----F 67
           Q+  LS  DL++CC   CGDGC GG+P  AW Y+V  G+VT         C PY      
Sbjct: 139 QSAELSALDLISCCKD-CGDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCE 197

Query: 68  DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
             T   +P C    Y TP+C + C K  +  +   KHY   +Y + ++ + I  +I   G
Sbjct: 198 HHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVQNNEKVIQRDIMMYG 257

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG  +    YW++AN WN  WG 
Sbjct: 258 PVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKRTPYWLIANSWNEDWGE 316

Query: 186 DGYFKIKRGSNECGIEEDVVAGL 208
            G F++ RG +EC IE DVVAGL
Sbjct: 317 KGLFRMVRGRDECSIESDVVAGL 339


>gi|55793943|gb|AAV65882.1| cathepsin B1 isotype 2 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  169 bits (429), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 92/205 (44%), Positives = 126/205 (61%), Gaps = 18/205 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------ 66
           +++ LS  DLL+CC   CG GC GG+P +AW Y+V  G+VT         C PY      
Sbjct: 139 KSVELSAVDLLSCCT-ECGLGCQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCE 197

Query: 67  FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
             +TG  +P C E  Y TPKC +KC K  +  +   K+Y   +Y + ++   I  EI  +
Sbjct: 198 HHTTG-KYPECGEKIYKTPKCHQKCQKGYKTPYGKDKYYGRMSYNVLNNENAIKKEIMMH 256

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE +FTV+ DF +YKSG+YK++TG  +GGHAV++IGWG  +    YW++AN WN  WG
Sbjct: 257 GPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIGWGV-EKKTPYWLIANSWNEDWG 315

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
             GYF+I RG +ECGIE +V  GLP
Sbjct: 316 EKGYFRILRGKDECGIESEVTGGLP 340


>gi|145498570|ref|XP_001435272.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124402403|emb|CAK67875.1| unnamed protein product [Paramecium tetraurelia]
          Length = 325

 Score =  169 bits (429), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 91/199 (45%), Positives = 117/199 (58%), Gaps = 17/199 (8%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH-- 74
           +S  DLL CCG  CG GC+GG+P  AW YF + G+VT +       C PY     C H  
Sbjct: 126 ISTEDLLTCCGITCGMGCNGGFPSGAWNYFKNKGLVTGDLFGDNSWCRPY-TFPPCDHHV 184

Query: 75  -----PGCEPAYPTPKCVRKCVKKN-QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 128
                  C  + PTP CV+ C  ++ + + + K  SI +Y ++S  E I  EI   GPVE
Sbjct: 185 DDGKYGPCGDSQPTPACVKSCTAQSGRNYDSDKIRSIDSYSVSSKVEQIQNEIMTFGPVE 244

Query: 129 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 188
            SFTVYEDF  YKSGVY+++ G  +GGHAVK+IGWG   +   YW++ N WN  WG +G 
Sbjct: 245 ASFTVYEDFLTYKSGVYQNVAGANLGGHAVKIIGWGVEKN-VPYWLVVNSWNEGWGENGL 303

Query: 189 FKIKRGSNECGIEEDVVAG 207
           FKI RGSN  GIE  + AG
Sbjct: 304 FKILRGSNHVGIEGGIYAG 322


>gi|146165818|ref|XP_001015807.2| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|146145394|gb|EAR95562.2| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 338

 Score =  169 bits (428), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 96/225 (42%), Positives = 125/225 (55%), Gaps = 20/225 (8%)

Query: 1   MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
            + T T  D +  +    LQ  S+S  DLL CC   CG+GC GGYP +AW+Y    GV T
Sbjct: 118 FAATETYSDRICIASNQELQT-SISSEDLLECCA-TCGNGCQGGYPSAAWKYMKATGVST 175

Query: 61  -------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSI 105
                    C PY     C H      P C P  PTPKCV++C  +   + ++   H+  
Sbjct: 176 GGLYGDDSSCKPYVFPP-CDHHVVGQYPPCGPIKPTPKCVKQCNSQYTEKTYQQDLHHPS 234

Query: 106 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWG 164
             Y++ ++ E I  EI  +GPV+ SF V  DF  YKSGVY +       GGH+VK+IGWG
Sbjct: 235 KVYQLPNNAEAIQREIMAHGPVQASFRVASDFLTYKSGVYIRDPKLKYEGGHSVKIIGWG 294

Query: 165 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
             + G  YW++AN WN  WG +G FK+ RG NECGIE +VVAGLP
Sbjct: 295 V-EQGTPYWLIANSWNEDWGENGLFKMLRGKNECGIEAEVVAGLP 338


>gi|323147412|gb|ADX32985.1| cathepsin B [Pinctada fucata]
          Length = 366

 Score =  169 bits (428), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 93/205 (45%), Positives = 118/205 (57%), Gaps = 19/205 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           QN  +S  DL +CC   CG+GC+GG+   AW Y+   G+VT       + C PY     C
Sbjct: 163 QNAHISAEDLTSCC-RSCGNGCNGGFLSGAWEYYKRDGLVTGGQYNSHQGCQPY-TVKAC 220

Query: 73  SH-------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H       P  +    TP C  +C    N  +   KHY  +AY +    + IM EI  N
Sbjct: 221 DHHVVGKLQPCSKKEEHTPVCKHECESGYNVSYTKDKHYGATAYSVRG-VQQIMTEIMTN 279

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE +FTVY DF  YKSGVYKH TG  +GGHA+K++GWGT + G+DYW++AN WN  WG
Sbjct: 280 GPVEGAFTVYADFPQYKSGVYKHTTGSPLGGHAIKIMGWGT-EGGDDYWLVANSWNPDWG 338

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
             G FKI RG +ECGIE  + AG P
Sbjct: 339 NQGTFKILRGRDECGIESQIAAGEP 363


>gi|268561878|ref|XP_002638441.1| Hypothetical protein CBG18657 [Caenorhabditis briggsae]
          Length = 372

 Score =  169 bits (427), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 95/226 (42%), Positives = 131/226 (57%), Gaps = 45/226 (19%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGC 77
           +SV D+L+CCG  CG+GC GGYP+   +++++ GVVT        C PY     CS   C
Sbjct: 129 ISVEDILSCCGSSCGEGCKGGYPLEGLKFWMNSGVVTGGDYNGTGCQPY-TFPPCSS--C 185

Query: 78  EPAYPTPKCVRKC--------VKKNQLWRNSKH---------YSI--------SAYRINS 112
           E +  TP C +KC         K ++ + N +          Y +        SAYR+++
Sbjct: 186 EASKSTPSCQKKCQTGYLEATYKNDKRFENEEQDSSYMSENFYQVLIILKGGKSAYRLST 245

Query: 113 DPED----------IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 162
                         I  EIY NGPVEVS+ V+EDF  YKSGVY +++G + G HAVK+IG
Sbjct: 246 TTSSNKISTDAIITIQTEIYNNGPVEVSYRVFEDFYQYKSGVYHYVSGKLTGAHAVKIIG 305

Query: 163 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           WGT ++  DYW++AN W   +G  G+FKI+RG+NECGIEE+VVAGL
Sbjct: 306 WGT-ENKVDYWLVANSWGTDFGEKGFFKIRRGTNECGIEENVVAGL 350


>gi|340053922|emb|CCC48215.1| cysteine peptidase C (CPC) [Trypanosoma vivax Y486]
          Length = 334

 Score =  169 bits (427), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 90/200 (45%), Positives = 118/200 (59%), Gaps = 14/200 (7%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           +++L +S  DLL+CC   CGDGCDGGYP  AW YF   G+V++ C PY     C H G  
Sbjct: 138 VRDLGISAGDLLSCCT-SCGDGCDGGYPDEAWLYFTESGLVSDYCQPY-PFPPCKHSGGR 195

Query: 79  PAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
              P        TPKC   C  K       ++++  +Y +  + ED   E+Y  GP EV+
Sbjct: 196 SKNPSCHDMHFHTPKCNATCTDKRIP--VVRYFASESYSLQGE-EDYKRELYLRGPFEVA 252

Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
           FTVYEDF  Y+SGVYKH++G  +GGHAV+++GWG   +G  YW +AN WN  WG +GY  
Sbjct: 253 FTVYEDFLAYESGVYKHVSGGPVGGHAVRVVGWG-ERNGVPYWKIANSWNTDWGENGYLY 311

Query: 191 IKRGSNECGIEEDVVAGLPS 210
             RG +ECGIE    AG PS
Sbjct: 312 FYRGKDECGIESQGSAGTPS 331


>gi|56759588|gb|AAW28820.1| Parcxpwnx02 [Periplaneta americana]
          Length = 343

 Score =  169 bits (427), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 87/202 (43%), Positives = 121/202 (59%), Gaps = 16/202 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
           +   S  DLL CC   CG GC+GG P +AW Y+V  G+V+       + C PY     C 
Sbjct: 143 HFHFSAEDLLTCCSS-CGFGCNGGEPGAAWDYWVSTGIVSGGSYNSHQGCQPYAIEP-CE 200

Query: 74  HPGCEPAYP-----TPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
           H       P     TP+CV++C +   + +   +H+  SAY +    + I  E+  NGP 
Sbjct: 201 HHVNGTRKPCGEGDTPRCVKRCEEGYDVPYGKDRHFGKSAYAVPGSVKAIQKELLLNGPA 260

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           E + TVY+DF HY++GVY+H++G  +GGHAV+L+GWG  +DG  YW+LAN WN  WG +G
Sbjct: 261 EAALTVYDDFLHYRTGVYQHVSGGALGGHAVRLLGWGV-EDGTPYWLLANSWNYDWGDNG 319

Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
           YF+I RG +ECGIE D+  GLP
Sbjct: 320 YFRILRGQDECGIESDINGGLP 341


>gi|170787211|gb|ACB38229.1| cathepsin B [Meretrix meretrix]
          Length = 337

 Score =  168 bits (426), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 93/217 (42%), Positives = 130/217 (59%), Gaps = 21/217 (9%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT----- 60
           T+R  + S   V   N  LS  DL +CC   CG+GC+GG+   AW Y    G+VT     
Sbjct: 124 TDRLCIQSKGIV---NAHLSAEDLTSCC-RTCGNGCNGGFLEGAWNYLKRDGIVTGGPYN 179

Query: 61  --EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
             + C PY +   C H        C+   PTP+C ++C    N  +   +H++ + + + 
Sbjct: 180 SHQGCLPY-EIKACDHHVVGKLQPCKGDGPTPRCKKECESGYNNTYSKDEHHAKTVHAVE 238

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
              E IM EI  NGPVE +FTVY DF  YKSGVY+H +G  +GGHA+K +GWG ++DG+D
Sbjct: 239 G-VEQIMTEIMTNGPVEAAFTVYSDFPTYKSGVYEHKSGGPLGGHAIKTLGWG-NEDGKD 296

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           YW++AN WN  WG +G+FKI RG +ECGIE ++VAG+
Sbjct: 297 YWLVANSWNPDWGDNGFFKILRGRDECGIESNIVAGM 333


>gi|427787723|gb|JAA59313.1| Putative cathepsin b-like cysteine protease form 2 [Rhipicephalus
           pulchellus]
          Length = 338

 Score =  168 bits (426), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 87/203 (42%), Positives = 116/203 (57%), Gaps = 16/203 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
            +++S  DLL CC + C  GC GG P  AW ++   G+VT       + C PY      +
Sbjct: 133 QVNISAQDLLTCCDY-CRTGCKGGVPSYAWMFYKEKGIVTGGLYGTEDGCQPYSIHTTRY 191

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
            +TG   P      P P C R+C K   + +   KHY    Y ++ D   I  EI+KNGP
Sbjct: 192 TTTGLLPPPINDLSPMPPCKRECRKSYGKKYSEDKHYGEKVYTLSGDEAQIKTEIFKNGP 251

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE  F VY DF  YKSGVY+  +    G HA++++GWGT ++G  YW+ AN W   WG  
Sbjct: 252 VEADFAVYADFYSYKSGVYQAHSRVRCGSHAIRILGWGT-ENGVPYWLAANSWTEHWGDK 310

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           GYFKI+RG+NECGIEED+ AG+P
Sbjct: 311 GYFKIRRGNNECGIEEDINAGIP 333


>gi|55793947|gb|AAV65884.1| cathepsin B1 isotype 4 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  168 bits (426), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 91/205 (44%), Positives = 127/205 (61%), Gaps = 18/205 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------ 66
           +++ LS  DLL+CC   CG GC GG+P +AW Y+V  G+VT         C PY      
Sbjct: 139 KSVELSAVDLLSCCT-ECGLGCQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCE 197

Query: 67  FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
             +TG  +P C E  Y TPKC +KC K  +  ++  K+Y   +Y + ++   I  EI  +
Sbjct: 198 HHTTG-KYPECGEKIYKTPKCHQKCQKGYKTPYKKDKYYGRMSYNVLNNENAIKKEIMMH 256

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVEV+FTV+ DF +YKSG+YK++TG  +G HAV++IGWG  +    YW++AN WN  WG
Sbjct: 257 GPVEVAFTVHSDFLNYKSGIYKYMTGAEIGEHAVRIIGWGV-EKKTPYWLIANSWNEDWG 315

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
             GYF++ RG +ECGIE  V +GLP
Sbjct: 316 EKGYFRMLRGKDECGIESAVTSGLP 340


>gi|256052331|ref|XP_002569726.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228435|emb|CCD74606.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 319

 Score =  168 bits (426), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 90/202 (44%), Positives = 118/202 (58%), Gaps = 16/202 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----F 67
           QN+ LS  DLL+CC   CGDG +GG+P  AW Y+V  G+VT         C PY      
Sbjct: 116 QNVELSAVDLLSCCEH-CGDGFEGGFPALAWDYWVKEGIVTGSSKENHTSCQPYPFPKCE 174

Query: 68  DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
             T   +P C E  Y TP C   C K  +  +   KH   S Y + +D + I  EI K G
Sbjct: 175 HHTKGKYPACFEEIYKTPNCENTCQKSYKTPYAQDKHRGKSRYNVKNDEKAIQKEIMKYG 234

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +F VYEDF +YKSG+YKHITG ++  HA+++IGWG  ++   YW++ N WN  WG 
Sbjct: 235 PVEANFIVYEDFLNYKSGIYKHITGKLVSWHAIRIIGWGV-ENNTPYWLIPNSWNEDWGE 293

Query: 186 DGYFKIKRGSNECGIEEDVVAG 207
           +G F+I RG +EC IE +V AG
Sbjct: 294 NGNFRILRGRHECSIESEVTAG 315


>gi|1345924|sp|P25802.3|CYSP1_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
           Precursor
          Length = 341

 Score =  168 bits (425), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 89/202 (44%), Positives = 123/202 (60%), Gaps = 17/202 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGC 72
           + + +S  D+++CC + CGDGC+GG+PISA+R+    GVVT         C PY +   C
Sbjct: 140 KQVLISAQDVVSCCTW-CGDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPY-EIHPC 197

Query: 73  SHPGCEPAY-------PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
            H G E  Y        TP+C R+C+        S  Y   AY++ +  + I  +I KNG
Sbjct: 198 GHHGNETYYGECVGMADTPRCKRRCLLGYPKSYPSDRYYKKAYQLKNSVKAIQKDIMKNG 257

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PV  ++TVYEDFAHY+SG+YKH  G   G HAVK+IGWG  + G  YWI+AN W+  WG 
Sbjct: 258 PVVATYTVYEDFAHYRSGIYKHKAGRKTGLHAVKVIGWG-EEKGTPYWIVANSWHDDWGE 316

Query: 186 DGYFKIKRGSNECGIEEDVVAG 207
           +G+F++ RGSN+CG EE + AG
Sbjct: 317 NGFFRMHRGSNDCGFEERMAAG 338


>gi|339242629|ref|XP_003377240.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316973974|gb|EFV57515.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 325

 Score =  168 bits (425), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 90/208 (43%), Positives = 125/208 (60%), Gaps = 12/208 (5%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 65
           T+R  + S   ++ +   LS  +L++CC  +CG GCDGGYP  A+ Y+   G+ T    P
Sbjct: 122 TDRICIES---IAAKQPLLSEEELVSCCK-ICGYGCDGGYPDKAFIYWATRGIPTG--GP 175

Query: 66  YFDSTGCS----HPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAE 120
           Y  + GC         E    TP C R+C+ +        +H+    Y +NS+ E IM E
Sbjct: 176 YGSTKGCKPYSIGSNSEDEAETPLCTRQCINEYPYNLSQDRHFGEKPYWVNSNEEQIMQE 235

Query: 121 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 180
           +YKNGPV V+F VYEDF +Y  GVY+H  G  +GGHAVKLIGWG  ++ + YW+++N WN
Sbjct: 236 LYKNGPVVVAFNVYEDFMYYIKGVYEHRFGKFLGGHAVKLIGWGI-ENSKKYWLISNSWN 294

Query: 181 RSWGADGYFKIKRGSNECGIEEDVVAGL 208
            +WG +G+FKI RG N C IE  VVAG+
Sbjct: 295 TTWGENGFFKIIRGKNCCAIESYVVAGM 322


>gi|5764077|emb|CAB53367.1| necpain [Necator americanus]
          Length = 339

 Score =  167 bits (424), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 86/203 (42%), Positives = 123/203 (60%), Gaps = 16/203 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGCS 73
            L +S  D+LACCG  CGDGC GG+P  AW +   +GV T         C PY      +
Sbjct: 135 KLHVSDTDILACCGEFCGDGCSGGWPFQAWEWVRKYGVCTGGDYRAKGVCKPYAFHPCGN 194

Query: 74  HP-----GCEP--AYPTPKCVRKCVKKN-QLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           H      G  P  ++PTP+C + C +   + ++  K Y+  +Y + +D ++I  +I KNG
Sbjct: 195 HENQVYYGVCPKGSWPTPRCEKFCQRGYIKPYKKDKFYAKKSYWLPNDEKEIRLDIMKNG 254

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PV+ +F VYEDF  YK G+YKH  G   GGHAVK+IGWG  D+G DYW++AN W++ WG 
Sbjct: 255 PVQAAFDVYEDFKLYKRGIYKHKEGIQTGGHAVKIIGWG-KDNGTDYWLIANSWSKDWGE 313

Query: 186 DGYFKIKRGSNECGIEEDVVAGL 208
            G+F++ RG N+C IE+ + AG+
Sbjct: 314 SGFFRMVRGENDCEIEDMITAGI 336


>gi|27526823|emb|CAD32937.1| pro-cathepsin B2 [Fasciola hepatica]
          Length = 337

 Score =  167 bits (424), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 93/218 (42%), Positives = 124/218 (56%), Gaps = 22/218 (10%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTG 71
           +    LS  DL++CC + CG+GC GG P +AW Y+  +G+VT         C PY     
Sbjct: 124 MMQPELSAIDLVSCCSY-CGNGCQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPY-PFPQ 181

Query: 72  CSHPGCEP--------AYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIY 122
           C HPG            YPTP C   C    ++ +   K Y  ++Y ++     IM EI 
Sbjct: 182 CRHPGSRSQLNPCPRYTYPTPSCYPYCQAGYDKTYEKDKVYGKTSYNVDRHEYTIMEEIM 241

Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
           KNGPVE  F VY DFA YKSG+Y H++G   G HA+++IGWG  ++G  YW+ AN WN  
Sbjct: 242 KNGPVEAGFIVYTDFAVYKSGIYHHVSGRYAGKHAIRIIGWGV-ENGVKYWLTANSWNVG 300

Query: 183 WGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITS 220
           WG +GYF+I RG++EC IE  VVAG+P    L K IT+
Sbjct: 301 WGENGYFRILRGTDECRIESIVVAGMP---RLQKNITN 335


>gi|242001640|ref|XP_002435463.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215498799|gb|EEC08293.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 223

 Score =  167 bits (424), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 88/202 (43%), Positives = 121/202 (59%), Gaps = 15/202 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYF-----D 68
            + +S  DL+ CC   CG GC GG   +AW+Y+   G+V       T+ C PY       
Sbjct: 21  QVDISAEDLMDCCD-KCGSGCSGGVSAAAWQYWKDAGLVSGGLYNTTDGCKPYSLAPCEH 79

Query: 69  STGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
           S+  S P C    PTPKC R+C +   + + + K+++ + Y IN   + I  EI++NGPV
Sbjct: 80  SSQGSLPECVGTLPTPKCKRQCREGYERSYDDDKYFAKNVYSINGSEKQIRTEIFQNGPV 139

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           E  FT Y DF  YKSGVY+H + D++G HA++++GWG S+D   YW+LAN WN  WG  G
Sbjct: 140 EAEFTAYADFLSYKSGVYQHHSRDIIGRHAIRILGWG-SEDNNPYWLLANSWNEDWGDHG 198

Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
           YFK+ RG NEC IE  V AG+P
Sbjct: 199 YFKMLRGVNECDIESFVNAGIP 220


>gi|38373697|gb|AAR19103.1| cathepsin B [Uronema marinum]
          Length = 350

 Score =  167 bits (423), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 93/211 (44%), Positives = 120/211 (56%), Gaps = 30/211 (14%)

Query: 24  LSVNDLLACC--GFLCGDGCDGGYPISAWRYFVHHGVVT------------EECDPYFDS 69
           +S  +LL+CC   F CG GC+GGY   AW Y+V  G+V+             EC PY   
Sbjct: 139 ISSENLLSCCRGTFACGMGCNGGYTAGAWNYYVKTGLVSGNLYTDDNQNSKTECQPY-SF 197

Query: 70  TGCSH------PGCE--PAYPTPKCVRKCVKKNQLWRNS----KHYSISAYRINSDPEDI 117
             CSH        C   P + TPKC  +C   +Q  +NS     H  +S+Y +    E I
Sbjct: 198 PPCSHHVQGEYQACTDLPQFNTPKCYTEC--NSQYTQNSYEQDLHKGVSSYSVPKSEEQI 255

Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 177
            AEIY+ G    SF VY DF  Y SGVY++ +G  MGGHA+K++GWG  ++G  YW+ AN
Sbjct: 256 KAEIYQYGSTTASFNVYSDFLTYSSGVYQNTSGSYMGGHAIKMLGWGV-ENGTPYWLCAN 314

Query: 178 QWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
            WN SWG +G+FKI RGSNECGIE  +VAG 
Sbjct: 315 SWNSSWGENGFFKILRGSNECGIESGMVAGF 345


>gi|195130519|ref|XP_002009699.1| GI15503 [Drosophila mojavensis]
 gi|193908149|gb|EDW07016.1| GI15503 [Drosophila mojavensis]
          Length = 342

 Score =  167 bits (423), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 87/205 (42%), Positives = 120/205 (58%), Gaps = 18/205 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
           N   S  DL++CC   CG GC+GG+P +AW Y+ H G+V+       E C PY +   C 
Sbjct: 140 NFHFSAEDLVSCC-HTCGFGCNGGFPGAAWSYWTHKGIVSGGSYNSNEGCRPY-EIEPCE 197

Query: 74  H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C+    TP C  +C     + +   KH+   +Y I  +P +I  EI  NGP
Sbjct: 198 HHVNGTRPPCKNGR-TPSCKHQCESSYSVDYAKDKHFGSKSYSIRRNPREIQREIMTNGP 256

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE-DYWILANQWNRSWGA 185
           VE +FTVYED   YKSGVYKH+ G  +GGHA++++GWG   D +  YW++ N WN  WG 
Sbjct: 257 VEGAFTVYEDLILYKSGVYKHVHGKELGGHAIRILGWGVWGDSKVPYWLIGNSWNTDWGD 316

Query: 186 DGYFKIKRGSNECGIEEDVVAGLPS 210
           +G+F+I RG + CGIE  + AGLP+
Sbjct: 317 NGFFRIVRGEDHCGIESAISAGLPA 341


>gi|49036808|gb|AAT48985.1| cathepsin B-like proteinase [Triatoma vitticeps]
          Length = 332

 Score =  167 bits (423), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 88/202 (43%), Positives = 121/202 (59%), Gaps = 15/202 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF-----D 68
            + LS  +LL+CC   CG GC GG   +AW Y+   G+V+       + C PY       
Sbjct: 131 QVHLSAENLLSCCDS-CGYGCLGGSAENAWEYWHKFGIVSGGNYGSKQGCQPYSIAPCEH 189

Query: 69  STGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
           S   S P CE    TPKC ++C K   + + +   Y    Y I +D + I AEI KNGP+
Sbjct: 190 SIPGSRPACEGVRDTPKCKKQCEKGYGIPYGDDLCYGQPGYTIENDAQKIQAEILKNGPI 249

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
             S  VYED   YK+GVY+H+ G+V+GGH +K++GWG  +D   YW++AN WN  WG +G
Sbjct: 250 VASILVYEDLFSYKAGVYQHVAGEVLGGHVIKILGWGVEND-TPYWLVANSWNTDWGNNG 308

Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
           +FKI RGS+ECGIE+ +VAG+P
Sbjct: 309 FFKILRGSDECGIEDQIVAGIP 330


>gi|166030316|gb|ABY78825.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  167 bits (423), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 87/200 (43%), Positives = 119/200 (59%), Gaps = 13/200 (6%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           +Q L +S   LL+CC   CGDGCDGGYP +AWRY+V HG+ +  C PY     C H G +
Sbjct: 137 VQQLRISAAHLLSCCK-DCGDGCDGGYPDAAWRYYVSHGLASSYCQPY-PFPHCGHHGGK 194

Query: 79  PAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
              P        TPKC   C  K       ++    +Y +    +D   E+Y NGP  V+
Sbjct: 195 GKKPPCSKYDFHTPKCNTTCTDKAIPL--IEYRGNDSYVLLHGEDDFKRELYFNGPFVVA 252

Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
           F V+ DF  YK+GVY+H++GD +GGHAV+++GWG   +G  YW +AN W+  WG +G+F 
Sbjct: 253 FQVFSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHFL 311

Query: 191 IKRGSNECGIEEDVVAGLPS 210
             RG+NECGIE +  AGLP+
Sbjct: 312 FLRGNNECGIEFEGYAGLPA 331


>gi|213514196|ref|NP_001133994.1| Cathepsin B precursor [Salmo salar]
 gi|209156086|gb|ACI34275.1| Cathepsin B precursor [Salmo salar]
          Length = 330

 Score =  167 bits (422), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 93/219 (42%), Positives = 134/219 (61%), Gaps = 21/219 (9%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + S+  VS++   +S  DLL+CC   CG GC+GGYP +A  ++   G+V+     
Sbjct: 117 SDRVCIHSNAKVSVE---ISSEDLLSCCES-CGMGCNGGYPSAACDFWTKEGLVSGGLYD 172

Query: 63  ----CDPYFDSTGCSH------PGCEPAY-PTPKCVRKCVKK-NQLWRNSKHYSISAYRI 110
               C PY     C H      P C+     TP+C  +C       ++  KH+   +Y +
Sbjct: 173 SHIGCRPY-SIPPCEHHVNGTRPPCKGEEGDTPQCTNQCEPGYTPGYKQDKHFGKRSYSV 231

Query: 111 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 170
            SD ++IM E+YKNGPVE +FTVYEDF  YKSGVY+H++G  +GGHA+K++GWG  + G 
Sbjct: 232 PSDEKEIMKELYKNGPVEGAFTVYEDFLLYKSGVYRHVSGSAVGGHAIKVLGWG-EEGGI 290

Query: 171 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
            YW+ AN WN  WG +G+FKI RG + CGIE ++VAG+P
Sbjct: 291 PYWLAANSWNTDWGENGFFKIVRGEDHCGIESEMVAGIP 329


>gi|306992171|gb|ADN19566.1| cathepsin B-like proteinase [Spodoptera frugiperda]
          Length = 341

 Score =  166 bits (421), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 88/204 (43%), Positives = 122/204 (59%), Gaps = 17/204 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           ++   S  DLL+CC  +CG GC+GG P  AW Y+ H G+V+       + C PY +   C
Sbjct: 136 KHFHFSAEDLLSCCP-VCGLGCNGGMPTLAWEYWKHFGLVSGGSYNSGQGCRPY-EIPPC 193

Query: 73  SH--PG----CEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
            H  PG    C     TPKC + C     + +   K Y    Y ++S  + I AE++KNG
Sbjct: 194 EHHVPGNRVPCNGDSKTPKCHKTCEASYSVDYHKDKRYGKHVYSVSSKEDHIKAELFKNG 253

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +FTVY D  +YK+GVYKH  G+ +GGHA+K++GWG  ++G  Y ++AN WN  WG 
Sbjct: 254 PVEGAFTVYSDLLNYKNGVYKHTVGNALGGHAIKILGWGV-ENGNKYRLIANSWNSDWGD 312

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G+FKI RG + CGIE  +VAG P
Sbjct: 313 NGFFKILRGEDHCGIESSIVAGEP 336


>gi|154761391|gb|ABS85545.1| cathepsin B preproprotein [Biomphalaria glabrata]
          Length = 333

 Score =  166 bits (421), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 92/208 (44%), Positives = 119/208 (57%), Gaps = 28/208 (13%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
           N+ +S  D+  CC   CG GC+GGYP +AW ++V  GVV+       E C PY       
Sbjct: 135 NIHISAEDINDCCKS-CGMGCNGGYPAAAWEWYVDTGVVSGGQYGTNEGCMPYSLPHCDH 193

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVK------KNQLWRNSKHYSISAYRINSDPEDIMAEI 121
            +TG   P C    PTPKC +KC+        N   R  K Y +         + IM E+
Sbjct: 194 HTTGKYQP-CPAVVPTPKCEKKCLTGYPKSYSNDKTRGKKSYGVRGV------QSIMQEL 246

Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
             NGPV  +F VY DF  YK+GVY+H TG   GGHAVK+IG+GT + G+DYW++AN WN 
Sbjct: 247 VDNGPVTAAFDVYSDFLSYKTGVYRHTTGSYEGGHAVKIIGYGT-ESGQDYWLVANSWNE 305

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLP 209
            WG  G+FKI +G +ECGIE  +VAG P
Sbjct: 306 DWGDKGFFKIAKGKDECGIESSIVAGDP 333


>gi|46812327|gb|AAT02230.1| cathepsin B-like proteinase [Triatoma dimidiata]
          Length = 332

 Score =  166 bits (420), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 92/203 (45%), Positives = 118/203 (58%), Gaps = 17/203 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
            + LS  +L+ CCG  CG GC GG P SAW Y+   G+V+       E C PY     C 
Sbjct: 131 QVHLSAENLVTCCGS-CGAGCFGGDPGSAWEYWRDVGIVSGGNYGSKEGCQPY-SIAPCE 188

Query: 74  H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     T  C ++C K   + +    HY+   Y    D ++I  EI KNGP
Sbjct: 189 HHIPGSRPPCRGEGHTADCRKQCEKGYSIPYDKDLHYAEFVYSTERDVKEIQTEILKNGP 248

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +F VYED   YK GVYKH+ G  +GGHA+K++GWG  ++G  YW++AN WN  WG +
Sbjct: 249 VEAAFFVYEDLLTYKEGVYKHVAGAPVGGHAIKILGWGV-ENGTPYWLIANSWNTDWGNN 307

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G+FKI RGS+ECGIE DV AGLP
Sbjct: 308 GFFKILRGSDECGIEIDVSAGLP 330


>gi|349956183|dbj|GAA30948.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 337

 Score =  166 bits (420), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 89/205 (43%), Positives = 120/205 (58%), Gaps = 17/205 (8%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG 76
           +S  DL++CCG+ CG GC GG+P  AW ++   G+VT         C  Y     CSH G
Sbjct: 133 ISAVDLISCCGY-CGFGCQGGFPPIAWDFWQTEGIVTGGSKENPTGCRSY-PFPRCSHHG 190

Query: 77  CEP-------AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
            +         Y TP CV+KC   +  +   K  +   Y + +    IM EI  NGPVE 
Sbjct: 191 SKKYPPCSHRIYDTPNCVQKCDTPDTDYATDKTRANITYNVKAKQNAIMKEIMINGPVEA 250

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
           +F VYEDF  YKSGVY H  G ++GGHA++++GWG  ++G  YW++AN WN  WG DG F
Sbjct: 251 AFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWG-EENGVAYWLIANSWNDGWGEDGCF 309

Query: 190 KIKRGSNECGIEEDVVAGLPSSKNL 214
           K+ RG NECGIE++V AGLP   ++
Sbjct: 310 KMLRGKNECGIEDEVTAGLPELSSI 334


>gi|56759504|gb|AAW27892.1| unknown [Schistosoma japonicum]
          Length = 279

 Score =  166 bits (419), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 87/203 (42%), Positives = 120/203 (59%), Gaps = 16/203 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----F 67
           Q+  LS  DL++CC   CG GC GG+P  AW Y+V  G+VT         C PY      
Sbjct: 76  QSAELSALDLISCCE-DCGQGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCE 134

Query: 68  DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
             T   +P C    Y TP+C + C K  +  +   KHY   +Y + ++ + I  +I   G
Sbjct: 135 HHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGEESYNVQNNEKVIQRDIMMYG 194

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG  +    YW++AN WN  WG 
Sbjct: 195 PVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKRTPYWLIANSWNEDWGE 253

Query: 186 DGYFKIKRGSNECGIEEDVVAGL 208
            G F+I RG +EC IE +VVAGL
Sbjct: 254 KGLFRIVRGRDECSIESNVVAGL 276


>gi|132566367|gb|ABO34080.1| cathepsin B5 [Clonorchis sinensis]
          Length = 343

 Score =  166 bits (419), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 94/204 (46%), Positives = 115/204 (56%), Gaps = 19/204 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGC 77
           N SLS  DLL+CC   CG GC GGYP  AW Y+  HG+VT       D +GC     P C
Sbjct: 136 NKSLSAVDLLSCCK-DCGFGCRGGYPAVAWDYWKTHGIVTGGSKE--DPSGCRSYPFPKC 192

Query: 78  E------------PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           E              YPTP+CV++C   +  +   K  +  +Y I +    IM EI   G
Sbjct: 193 EHHVQGHYPPCPRELYPTPECVQQCDTPDVGYLEDKTRANMSYNIYASEISIMKEIMLRG 252

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE  FT+YEDF  Y SGVY H  G  M GHAV+++GWG   +   YW++AN WN  WG 
Sbjct: 253 PVEAIFTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWGELGN-VPYWLIANSWNEDWGE 311

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +GY K  RG NECGIE+DV AGLP
Sbjct: 312 EGYMKFLRGYNECGIEDDVTAGLP 335


>gi|170028912|ref|XP_001842338.1| oryzain gamma chain [Culex quinquefasciatus]
 gi|167879388|gb|EDS42771.1| oryzain gamma chain [Culex quinquefasciatus]
          Length = 333

 Score =  166 bits (419), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 86/202 (42%), Positives = 118/202 (58%), Gaps = 18/202 (8%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY 81
             +S  DLL CC   CG GCDGG P + W++++  G+V+    P+    GC     EP  
Sbjct: 134 FRVSAEDLLTCCTN-CGHGCDGGAPGAGWKHWIEKGLVSG--GPFGSDQGCRPYTIEPCV 190

Query: 82  P-------------TPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
                         TPKC++KC+   N  +   K +  S Y I +D   I  EI+ NGPV
Sbjct: 191 HVENGAQSPCKDSITPKCIKKCLPGYNVPYAKDKSFGKSTYSIANDERQIRKEIFTNGPV 250

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           E +FTV++DFA YK G+Y+H +G++ G HAV+++GWG  ++G  YW+ AN WN  WG +G
Sbjct: 251 EATFTVFDDFASYKHGIYQHTSGNLAGEHAVRILGWGV-ENGTKYWLAANSWNSDWGDNG 309

Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
           YFKI RGSN   IE  +VAGLP
Sbjct: 310 YFKILRGSNHVDIESAIVAGLP 331


>gi|7507648|pir||T24819 hypothetical protein T10H4.12 - Caenorhabditis elegans
          Length = 324

 Score =  165 bits (418), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 98/233 (42%), Positives = 129/233 (55%), Gaps = 36/233 (15%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
           Q   +SV D+L+CCG  CG GC GGY I A R++   G VT        C PY  S    
Sbjct: 79  QQPVISVEDILSCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPY--SFAPC 136

Query: 74  HPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYS----------------ISAYRINSDPE 115
              C P   TP C   C    K + ++  KHY                  SAY++ +   
Sbjct: 137 TKNC-PESTTPSCKTTCQSSYKTEEYKKDKHYGELVWHSFNRFQRFLNRASAYKVTTTKS 195

Query: 116 --DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 173
             +I  EIY  GPVE S+ VYEDF HYKSGVY + +G ++GGHAVK+IGWG  ++G DYW
Sbjct: 196 VTEIQTEIYHYGPVEASYKVYEDFYHYKSGVYHYTSGKLVGGHAVKIIGWGV-ENGVDYW 254

Query: 174 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFED 226
           ++AN W  S+G  G+FKI+RG+NEC IE +VVAG      + K  T ++ +ED
Sbjct: 255 LIANSWGTSFGEKGFFKIRRGTNECQIEGNVVAG------IAKLGTHSETYED 301


>gi|170060936|ref|XP_001866022.1| cathepsin B [Culex quinquefasciatus]
 gi|167879259|gb|EDS42642.1| cathepsin B [Culex quinquefasciatus]
          Length = 341

 Score =  165 bits (417), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 92/203 (45%), Positives = 115/203 (56%), Gaps = 24/203 (11%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
             S    DLL+CC   CGDGC GG    AW+++V  GV +    PY    GC HP     
Sbjct: 139 QFSFGAYDLLSCC-HSCGDGCQGGNLGPAWQFWVQRGVSSG--GPYNSRQGC-HP----- 189

Query: 81  YP------------TPKCVRKCVKKNQLWRNS--KHYSISAYRINSDPEDIMAEIYKNGP 126
           YP            TPKC RKC     +   S  + +   AY ++ D E I  EI++NGP
Sbjct: 190 YPVDVCHSADEDADTPKCTRKCQSMYNVTNVSDDRRFGRVAYSVSQDEERIKEEIFRNGP 249

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           V+ SF VY DF  YK+GVY+H+ G + GGHAVK+IGWG  ++G  YW+ +N W   WG  
Sbjct: 250 VQASFDVYLDFKAYKTGVYRHVFGPMEGGHAVKMIGWGV-ENGTKYWLCSNSWGEDWGER 308

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G+FKI RG N CGIE DV AGLP
Sbjct: 309 GFFKIVRGENHCGIESDVHAGLP 331


>gi|166030312|gb|ABY78823.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score =  165 bits (417), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 89/200 (44%), Positives = 119/200 (59%), Gaps = 14/200 (7%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           +Q L +S   LL+CC   CG GCDGGYP +AWRY+V HG+ +  C PY     C H G +
Sbjct: 137 VQQLRISAAHLLSCCKD-CGYGCDGGYPDAAWRYYVSHGLASSYCQPY-PFPHCDHHGGK 194

Query: 79  PAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
              P        TPKC   C  K       K+    +Y ++ + ED   E+Y NGP  V+
Sbjct: 195 GKKPPCSKYDFHTPKCNTTCTDKAIPL--IKYRGNHSYEVHGE-EDYKRELYFNGPFVVA 251

Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
           F VY DF  YK+GVY+H++GDV+GGHAV+++GWG   +G  YW +AN W+  WG +G+F 
Sbjct: 252 FQVYSDFFAYKTGVYRHVSGDVLGGHAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHFL 310

Query: 191 IKRGSNECGIEEDVVAGLPS 210
           I RG +ECGIE    AG P+
Sbjct: 311 ILRGKDECGIEHQGYAGSPA 330


>gi|329668994|gb|AEB96385.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 316

 Score =  165 bits (417), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 88/202 (43%), Positives = 120/202 (59%), Gaps = 16/202 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 66
           + + LS +D+L+CC    G GCDGG+P+SAW+YFV  GVVT       + C PY      
Sbjct: 115 KKVELSADDILSCC-TDGGYGCDGGWPVSAWQYFVETGVVTGGLYGTKDACRPYEIPPCG 173

Query: 67  FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
                  +  C     TP C   C     + + + K Y  +AY +++    I  EI   G
Sbjct: 174 IHKNETFYSNCTQEIDTPDCKTTCQAGYPISYDDDKTYGKTAYSVSNSVHAIQKEIMTYG 233

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PV  +FTVY+DF HYK+G+YKH++G   GGHAV+++GWG    G  YW++AN WN  WG 
Sbjct: 234 PVVAAFTVYDDFFHYKTGIYKHVSGAEAGGHAVRILGWG-QQGGVPYWLVANSWNTDWGE 292

Query: 186 DGYFKIKRGSNECGIEEDVVAG 207
           +GYF+I RGS+ECGIE+ VVAG
Sbjct: 293 NGYFRILRGSDECGIEDGVVAG 314


>gi|268555420|ref|XP_002635699.1| Hypothetical protein CBG22436 [Caenorhabditis briggsae]
          Length = 317

 Score =  164 bits (416), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 85/193 (44%), Positives = 116/193 (60%), Gaps = 11/193 (5%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGC 77
           +S  DLL+CCG  CG GC G  P+ A+R++   GVVT        C PY     C+   C
Sbjct: 128 ISPTDLLSCCGNFCGYGCKGASPLQAFRWWNKKGVVTGGDYRGSGCKPY-PFAPCTALPC 186

Query: 78  EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
             +  TP+C   C    ++ +   K++   AY +  D   I  EI  NGPVE +F VY+D
Sbjct: 187 TKS-ETPRCSLNCQPAYSKAYSKDKYFGTPAYIVGMDVAAIQTEI-TNGPVEAAFIVYDD 244

Query: 137 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 196
           F HY+SGVY+H+ G ++GGHAVK+IGWG   +G  YW++AN W   WG +G+FK+ RG +
Sbjct: 245 FNHYRSGVYRHVAGKLVGGHAVKIIGWGI-QNGAPYWLMANSWGPYWGENGFFKMLRGVD 303

Query: 197 ECGIEEDVVAGLP 209
           ECGIE  +VAG P
Sbjct: 304 ECGIESTIVAGKP 316


>gi|239938584|gb|ACS36091.1| cysteine proteinase [Haemonchus contortus]
          Length = 346

 Score =  164 bits (416), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 88/202 (43%), Positives = 119/202 (58%), Gaps = 18/202 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
            + +S  D ++CC   CG GCDGG+PI A+ ++ + G VT       + C PY     C 
Sbjct: 144 QMHISSIDFVSCCE-SCGYGCDGGWPILAFDFYTYEGAVTGGDYGSKDGCRPY-PFHPCG 201

Query: 74  HPG-------CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           H G       C     TPKC R+C +   + +   K Y   AY +    + I  EI KNG
Sbjct: 202 HHGNDTYYGECPKGAKTPKCRRRCQRSYKKAYYMDKSYGEDAYEVPHSVKAIQREIMKNG 261

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PV  +FTVYEDF++YK G+YKH  G   GGHA+K+IGWG  +D   YW++AN W+  WG 
Sbjct: 262 PVVGAFTVYEDFSYYKKGIYKHTAGQARGGHAIKIIGWGVEND-VPYWLIANSWHNDWGE 320

Query: 186 DGYFKIKRGSNECGIEEDVVAG 207
           +GYF++ RG NECGIE++VVAG
Sbjct: 321 EGYFRMIRGINECGIEQEVVAG 342


>gi|166030308|gb|ABY78821.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  164 bits (416), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 88/200 (44%), Positives = 118/200 (59%), Gaps = 13/200 (6%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           +Q L +S   L++CC   CGDGC GG P SAW Y+V HG+ +  C PY     C H G +
Sbjct: 137 VQQLRISAAHLMSCCED-CGDGCKGGAPDSAWEYYVSHGLASSYCQPY-PFPHCGHHGGK 194

Query: 79  PAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
              P        TPKC   C  K       K+   ++Y + +  +D   E+Y NGP  V 
Sbjct: 195 GKKPPCSKYHFHTPKCNTTCTDKAIPL--IKYRGNNSYMLLNGEDDYKRELYFNGPFVVD 252

Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
           F VY DF  YK+GVY+H++GDV+GGHAV+++GWG   +G  YW +AN W+  WG +G+F 
Sbjct: 253 FGVYSDFLAYKTGVYRHVSGDVLGGHAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHFL 311

Query: 191 IKRGSNECGIEEDVVAGLPS 210
           I RG+NECGIE    AGLP+
Sbjct: 312 ILRGNNECGIESTGYAGLPA 331


>gi|300176938|emb|CBK25507.2| unnamed protein product [Blastocystis hominis]
          Length = 320

 Score =  164 bits (416), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 89/201 (44%), Positives = 117/201 (58%), Gaps = 17/201 (8%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
           LS  DLL CC   CG GCDGG+   AWR+F   GV T       + C+ Y     C H  
Sbjct: 122 LSEQDLLTCCD-SCGFGCDGGWLDMAWRWFQSTGVTTGGEYGSKDWCNAY-SFPKCEHHA 179

Query: 75  ----PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
               P C  +  TP+CV++C +   + +   KH+   AY +    + I  E+  NGP+EV
Sbjct: 180 EGKYPPCGESQETPECVKQCQEGYPVEYEKDKHFFGEAYYVQGGIDAIKTELMTNGPLEV 239

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
           SF VYEDF  YKSG+Y+H+ G  +GGHAVKL+GWG  +DG +YW +AN WN  WG +GYF
Sbjct: 240 SFFVYEDFLTYKSGIYQHVAGKYLGGHAVKLVGWGV-EDGIEYWKIANSWNEDWGENGYF 298

Query: 190 KIKRGSNECGIEEDVVAGLPS 210
           +I  G  ECGIE   + G+P 
Sbjct: 299 RIVAGKGECGIEVGPIGGIPK 319


>gi|291291827|gb|ADD91786.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score =  164 bits (415), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 91/203 (44%), Positives = 118/203 (58%), Gaps = 17/203 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           + + +S  D+L+CCG  CG GC+GG+PI A+ YF   G VT         C PY     C
Sbjct: 51  KQVHVSATDILSCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPY-PFHPC 109

Query: 73  SHPG-------CEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H G       C     TPKCVRKC     + ++  +     AY   +  +    EI KN
Sbjct: 110 GHHGKDTYYGECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEEPNAEKATQREIMKN 169

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPV  +FTVYEDF++YK G+YKH  G   GGHA+K+IGWG  + G  YW++AN W+  WG
Sbjct: 170 GPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWG-KEGGVPYWLIANSWHNDWG 228

Query: 185 ADGYFKIKRGSNECGIEEDVVAG 207
            +GYF+I  GSN CGIEE+VVAG
Sbjct: 229 ENGYFRILCGSNHCGIEENVVAG 251


>gi|496317|dbj|BAA04103.1| Sarcophaga pro-cathepsin B [Sarcophaga peregrina]
          Length = 344

 Score =  164 bits (414), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 82/205 (40%), Positives = 122/205 (59%), Gaps = 20/205 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC-------- 72
           +   S +DL++CC   CG GC+GG+P +AW Y+   G+V+    PY  S GC        
Sbjct: 142 HFHFSADDLVSCC-HTCGFGCNGGFPGAAWAYWTRKGIVSG--GPYGSSQGCRPYEIAPC 198

Query: 73  ------SHPGCEPAY-PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
                 + P C+  +  TP C  +C K   + ++  KH+   +Y +  + +DI  EI +N
Sbjct: 199 EHHVNGTRPPCDGEHGKTPSCRHECQKSYDVDYKTDKHFGSKSYSVKRNVKDIQKEIMQN 258

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE +FTVYED   YK GVY+H+ G  +GGHA++++GWG  ++   YW++AN WN  WG
Sbjct: 259 GPVEGAFTVYEDLILYKDGVYQHVHGRELGGHAIRILGWGV-ENKTPYWLIANSWNTDWG 317

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
            +G+FK+ RG + CGIE  + AGLP
Sbjct: 318 NNGFFKMLRGEDHCGIESAIAAGLP 342


>gi|166030314|gb|ABY78824.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score =  164 bits (414), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 88/200 (44%), Positives = 120/200 (60%), Gaps = 14/200 (7%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           +Q L +S   LL+CC   CG GCDGGYP +AW Y+V HG+ +  C PY     C H G +
Sbjct: 137 VQQLRISAAHLLSCCKD-CGYGCDGGYPGTAWEYYVSHGLASSYCQPY-PFPHCGHHGGK 194

Query: 79  PAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
              P        TPKC   C  K       K+    +Y ++ + +D   E+Y NGP  V+
Sbjct: 195 GKKPPCSKYDFHTPKCNTTCTDKAIPL--IKYRGNHSYGLDGE-DDYKRELYFNGPFVVA 251

Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
           F VY DF  YK+GVY+H++GDV+GGHAV+++GWG   +G  YW +AN W+  WG +G+F 
Sbjct: 252 FQVYSDFLAYKTGVYRHVSGDVLGGHAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHFL 310

Query: 191 IKRGSNECGIEEDVVAGLPS 210
           I RG +ECGIE +  AGLP+
Sbjct: 311 ILRGKDECGIESEGYAGLPA 330


>gi|444525951|gb|ELV14228.1| Cathepsin B [Tupaia chinensis]
          Length = 339

 Score =  163 bits (413), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 95/212 (44%), Positives = 135/212 (63%), Gaps = 16/212 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  D+L CCG  CG+GC+GGYP +AW ++   G+V+         C PY     C 
Sbjct: 130 NVEVSAEDMLTCCGGQCGEGCNGGYPSAAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCE 188

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC + C    +  ++  KHY  S+Y +    ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKSCEPGYSSSYKEDKHYGYSSYSVPGIEKEIMAEIYKNGP 248

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWGT ++G  YW++AN WN  WG +
Sbjct: 249 VEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGT-ENGTPYWLVANSWNTDWGDN 307

Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           G+FKI RG + CGIE ++VAG+P +     +I
Sbjct: 308 GFFKILRGQDHCGIESEIVAGIPRTDQYWAKI 339


>gi|195438776|ref|XP_002067308.1| GK16352 [Drosophila willistoni]
 gi|194163393|gb|EDW78294.1| GK16352 [Drosophila willistoni]
          Length = 340

 Score =  163 bits (413), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 85/204 (41%), Positives = 122/204 (59%), Gaps = 18/204 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
           N  LS +DL++CC   CG GC+GG+P +AW Y+   G+V+       + C PY +   C 
Sbjct: 138 NFHLSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGGNFGSQQGCRPY-EIEPCE 195

Query: 74  H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TP+C   C    ++ ++  K++   +Y I ++  DI  EI  NGP
Sbjct: 196 HHVNGTRPPCSSG-STPRCQHVCESSYKVDYKKDKNFGSKSYSIKNNVLDIQKEIMNNGP 254

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGA 185
           VE +FTVYED   YKSGVY+H+ G  +GGHA++++GWG   D+   YW++AN WN  WG 
Sbjct: 255 VEGAFTVYEDLILYKSGVYEHVHGKELGGHAIRILGWGVWGDEKIPYWLIANSWNTDWGD 314

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G+F+I RG + CGIE  + AGLP
Sbjct: 315 NGFFRIVRGKDHCGIESSISAGLP 338


>gi|984960|gb|AAC46878.1| cathepsin B proteinase, partial [Ancylostoma caninum]
          Length = 340

 Score =  163 bits (412), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 85/203 (41%), Positives = 122/203 (60%), Gaps = 17/203 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
            + +S  D+L+CCG  CG GC+   PI A+R+     VVT       + C PY      +
Sbjct: 137 RVMISDTDILSCCGISCGYGCEV-LPIEAYRWMQRSVVVTGGKYRQKDVCKPYAFYPCGN 195

Query: 74  H-------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           H       P     +PTPKC + C +K N+ +   K+++  +Y + S+   I  EIYKNG
Sbjct: 196 HTNERYYGPCPRGLWPTPKCRKACQRKYNKSYNEDKYFATRSYYLPSNERSIREEIYKNG 255

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PV  +F VY+DF++Y+ G+Y H  G   G HAVK++GWG  ++G DYW++AN WN  WG 
Sbjct: 256 PVVAAFKVYQDFSYYRGGIYVHKWGGQTGAHAVKVVGWG-RENGTDYWLIANSWNTDWGE 314

Query: 186 DGYFKIKRGSNECGIEEDVVAGL 208
           +GYF+I RGSNECGIE  +V+G+
Sbjct: 315 NGYFRIARGSNECGIEGQMVSGV 337


>gi|358341561|dbj|GAA37330.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 347

 Score =  163 bits (412), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 93/208 (44%), Positives = 123/208 (59%), Gaps = 17/208 (8%)

Query: 18  SLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY---- 66
           +L N+ LS  DLLACC   CG GC GG+   AW Y+  +G+VT         C PY    
Sbjct: 134 TLVNVQLSATDLLACCT-TCGFGCVGGWGGMAWDYWRDNGIVTGGEYKDSHTCLPYPFPP 192

Query: 67  ---FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEI 121
                + G  +P C E  Y TP+CV +C K     + + K  + ++Y +      I  EI
Sbjct: 193 CRHHGAKGSEYPPCPEKMYSTPQCVSECQKGYATKYEDDKIRASTSYNLYRSVTTIQKEI 252

Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
           +  GPVE +  VY DFA+Y  GVYKH TG+++GGHA++L+GWG  +DG  YW+ AN WN 
Sbjct: 253 WMRGPVEATMNVYTDFANYAGGVYKHTTGELLGGHAIRLLGWGVEEDGTPYWLAANSWNP 312

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLP 209
           SWG  G+F+I RGS+ CGIE DV AGLP
Sbjct: 313 SWGEKGFFRILRGSDHCGIESDVSAGLP 340


>gi|239938582|gb|ACS36090.1| cysteine proteinase [Haemonchus contortus]
          Length = 346

 Score =  162 bits (411), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 87/202 (43%), Positives = 118/202 (58%), Gaps = 18/202 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
            + +S  D ++CC   C  GCDGG+PI A+ ++ + G VT       + C PY     C 
Sbjct: 144 QMHISSIDFVSCCE-SCSYGCDGGWPILAFDFYTYEGAVTGGDYGSKDGCRPY-PFHPCG 201

Query: 74  HPG-------CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           H G       C     TPKC R+C +   + +   K Y   AY +    + I  EI KNG
Sbjct: 202 HHGNDTYYGECPKGAKTPKCRRRCQRSYKKAYYMDKSYGEDAYEVPHSVKAIQREIMKNG 261

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PV  +FTVYEDF++YK G+YKH  G   GGHA+K+IGWG  +D   YW++AN W+  WG 
Sbjct: 262 PVVGAFTVYEDFSYYKKGIYKHTAGQARGGHAIKIIGWGVEND-VPYWLIANSWHNDWGE 320

Query: 186 DGYFKIKRGSNECGIEEDVVAG 207
           +GYF++ RG NECGIE++VVAG
Sbjct: 321 EGYFRMIRGINECGIEQEVVAG 342


>gi|144952804|gb|ABP04056.1| cathepsin B-4 [Clonorchis sinensis]
          Length = 347

 Score =  162 bits (410), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 93/208 (44%), Positives = 123/208 (59%), Gaps = 17/208 (8%)

Query: 18  SLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY---- 66
           +L N+ LS  DLLACC   CG GC GG+   AW Y+  +G+VT         C PY    
Sbjct: 134 TLVNVQLSATDLLACCT-TCGFGCVGGWGGMAWDYWRDNGIVTGGEYKDSHTCLPYPFPP 192

Query: 67  ---FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEI 121
                + G  +P C E  Y TP+CV +C K     + + K  + ++Y +      I  EI
Sbjct: 193 CRHHGAKGSEYPPCPEKMYSTPQCVSECQKGYATKYEDDKIRASTSYNLYRSVTAIQKEI 252

Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
           +  GPVE +  VY DFA+Y  GVYKH TG+++GGHA++L+GWG  +DG  YW+ AN WN 
Sbjct: 253 WMRGPVEATMNVYTDFANYAGGVYKHTTGELLGGHAIRLLGWGVEEDGTPYWLAANSWNP 312

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLP 209
           SWG  G+F+I RGS+ CGIE DV AGLP
Sbjct: 313 SWGEKGFFRILRGSDHCGIESDVSAGLP 340


>gi|299471123|emb|CBN78981.1| cathepsin B-like proteinase [Ectocarpus siliculosus]
          Length = 557

 Score =  162 bits (410), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 94/214 (43%), Positives = 120/214 (56%), Gaps = 27/214 (12%)

Query: 20  QNLSLSVNDLLACC-GFLCG--DGCDGGYPISAWRYFVHHGVVT----------EECDPY 66
           Q L LS  D  ACC GF CG   GC+GG P SAW++F   GVVT            C PY
Sbjct: 343 QLLVLSAEDTTACCHGFHCGLSMGCNGGQPGSAWKWFTKTGVVTGGDYADIGTGTTCKPY 402

Query: 67  --------FDSTGCSHPGC-EPAYPTPKCVRKCVKKN---QLWRNSKHYSISAYRINSDP 114
                    D     +P C +  YPTP+C+ +C + N     +   K  +  AY + +  
Sbjct: 403 EFMPCAHHVDPGASGYPACPDGEYPTPECLSECSETNFSGGSYGEDKKMAREAYSL-AGI 461

Query: 115 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD-DGEDYW 173
           E+I  ++ K G V  +F+V+ DF  Y  GVY H +G  MGGHAVK+IGWGT +  GEDYW
Sbjct: 462 ENIQRDMMKYGSVTAAFSVFSDFLTYSGGVYTHESGSFMGGHAVKMIGWGTDEVSGEDYW 521

Query: 174 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
           ++AN WN SWG  G F+I RG NECGIE  +VAG
Sbjct: 522 LIANSWNPSWGEGGLFRILRGVNECGIEGQIVAG 555


>gi|56756114|gb|AAW26235.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  162 bits (410), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 89/208 (42%), Positives = 119/208 (57%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC   CG GCDGGY + +W Y+V HG+VT       + TGC     P 
Sbjct: 139 QSVELSAIDLISCCKN-CGSGCDGGYFLPSWDYWVSHGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y + S    I  +I  
Sbjct: 196 CDHFVKGKYRACGDKLYETPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG NEC IE ++ AGL  S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|226469952|emb|CAX70257.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  162 bits (410), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 91/203 (44%), Positives = 116/203 (57%), Gaps = 18/203 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           Q++ LS  DL++CC   CG GCDGG+P  AW Y+V HG+VT         C PY     C
Sbjct: 139 QSVELSAIDLISCCEN-CGSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPY-PFPKC 196

Query: 73  SH------PGC-EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H      P C +  Y TP+C RKC K     + + KHY   A  +  +   I  EI   
Sbjct: 197 EHHSIGKYPSCGDKMYKTPQCKRKCQKGYTTPYEHDKHYGGIAINVIKNELAIQKEIMMY 256

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE    ++EDF +YKSG+YK+ TG  +G H V++IGWG  ++G  YW+ AN WN  WG
Sbjct: 257 GPVEAYLLIFEDFLNYKSGIYKYTTGSFVGEHYVRIIGWGI-ENGTAYWLAANTWNEDWG 315

Query: 185 ADGYFKIKRGSNECGIEEDVVAG 207
             GYF+I RG NEC IE  VVAG
Sbjct: 316 EKGYFRIVRGRNECSIESVVVAG 338


>gi|195478432|ref|XP_002100515.1| GE16138 [Drosophila yakuba]
 gi|194188039|gb|EDX01623.1| GE16138 [Drosophila yakuba]
          Length = 340

 Score =  162 bits (409), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 83/204 (40%), Positives = 115/204 (56%), Gaps = 17/204 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
           N   S +DL++CC   CG GC+GG+P +AW Y+   G+V+       + C PY + + C 
Sbjct: 137 NFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPY-EISPCE 194

Query: 74  H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC   C     + +   KH+   +Y +  +  DI  EI  NGP
Sbjct: 195 HHVNGTRPPCAHGGATPKCSHVCQSSYTVDYAKDKHFGSKSYSVRRNVRDIQEEIMTNGP 254

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGA 185
           VE +FTVYED   YK GVY+H  G  +GGHA++++GWG   D+   YW++ N WN  WG 
Sbjct: 255 VEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGDEKIPYWLIGNSWNTDWGD 314

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            G+F+I RG + CGIE  + AGLP
Sbjct: 315 QGFFRILRGQDHCGIESSISAGLP 338


>gi|118365170|ref|XP_001015806.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297573|gb|EAR95561.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 340

 Score =  161 bits (408), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 92/225 (40%), Positives = 119/225 (52%), Gaps = 19/225 (8%)

Query: 1   MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
            + T T  D +  +   +LQ  S+S  DLL CC   CG GC GGYP +AW Y    GV T
Sbjct: 119 FAATETFSDRICIASNQTLQT-SISSEDLLECCADYCGMGCKGGYPSAAWGYMKRQGVST 177

Query: 61  -------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSI 105
                    C PY         TG   P C P  PTP+CV++C  +     +    H++ 
Sbjct: 178 GGLYGDDTSCKPYIFPPCDHHVTGQYQP-CGPIQPTPQCVKECNSEYTQNTYEKDLHFAS 236

Query: 106 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWG 164
             Y I  + + I  EI  +GPV+ SF V  DF  YKSGVY ++      GGH+VK+IGWG
Sbjct: 237 QTYSIKQNVQAIQREIMAHGPVQASFKVAADFLTYKSGVYIRNPKLKYEGGHSVKIIGWG 296

Query: 165 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
             +    YW++AN WN  WG  G F++ RG NECGIE  +VAGLP
Sbjct: 297 -KEGNTPYWLIANSWNEDWGEKGLFRMLRGRNECGIEAQIVAGLP 340


>gi|358331547|dbj|GAA35870.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 508

 Score =  161 bits (408), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 92/203 (45%), Positives = 113/203 (55%), Gaps = 19/203 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGC 77
           N SLS  DLL+CC   CG GC GGYP  AW Y+  HG+VT       D +GC     P C
Sbjct: 136 NKSLSAVDLLSCCKD-CGFGCRGGYPAVAWDYWKTHGIVTGGSKE--DPSGCRSYPFPKC 192

Query: 78  E------------PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           E              YPTP+CV++C   +  +   K  +  +Y I +    IM EI   G
Sbjct: 193 EHHVQGHYPPCPRELYPTPECVQQCDTPDVGYLEDKTRANMSYNIYASEISIMKEIMLRG 252

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE  FT+YEDF  Y SGVY H  G  M GHAV+++GWG   +   YW++AN WN  WG 
Sbjct: 253 PVEAIFTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWGELGN-VPYWLIANSWNEDWGE 311

Query: 186 DGYFKIKRGSNECGIEEDVVAGL 208
           +GY K  RG NECGIE+DV A L
Sbjct: 312 EGYMKFLRGYNECGIEDDVTAVL 334


>gi|29374023|gb|AAO73002.1| cathepsin B [Fasciola gigantica]
          Length = 335

 Score =  161 bits (408), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 89/202 (44%), Positives = 119/202 (58%), Gaps = 21/202 (10%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
           LS  DL++CC + CG GC+GGYP  AW Y+  HG+V+         C PY     CSH  
Sbjct: 139 LSAVDLVSCCPY-CGYGCEGGYPSMAWDYWWRHGIVSGGTLENPTGCLPY-PFPKCSHLE 196

Query: 75  --PGCEPA----YPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
             PG  P     Y TPKC ++C    ++     K    S+Y +     DIM EI  NGPV
Sbjct: 197 ETPGLAPCPRELYATPKCEKQCQAGYSKTSEEDKIKGKSSYNVGDRETDIMMEIITNGPV 256

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
              + ++EDF  YKSG+Y++ +G +MGGH +  IGWG  ++G  YW+ AN WN  WG +G
Sbjct: 257 STIYYIFEDFTVYKSGIYQYTSGSLMGGHGI--IGWGV-ENGVKYWLAANSWNEGWGENG 313

Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
           YF+I+RG+NECGIE  + AGLP
Sbjct: 314 YFRIRRGTNECGIESRINAGLP 335


>gi|3087801|emb|CAA93277.1| cysteine proteinase [Haemonchus contortus]
          Length = 344

 Score =  161 bits (407), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 89/204 (43%), Positives = 121/204 (59%), Gaps = 19/204 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           + + +S  D+L+CC   CGDGCDGGY I A+++F   G VT       + C PY     C
Sbjct: 140 KQVYVSATDILSCC-HSCGDGCDGGYVIDAFKFFAEQGAVTGGDYGAKDCCRPY-PFHPC 197

Query: 73  SHPGCEPAY-------PTPKCVRKCVKKNQL-WRNSKHYSISAYRIN-SDPEDIMAEIYK 123
            H G E  Y        TP+CVRKC +  +  +   +     AYR+     + I  EI +
Sbjct: 198 GHHGNETYYGECPEDGSTPECVRKCQEGYETEYHEDRVRGEDAYRLPIGSVKAIQKEIMR 257

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           NGPV  +F V++DF+ Y+ G+Y H+ G   GGHAVK+IGWGT + G  YWI+AN W+  W
Sbjct: 258 NGPVVAAFIVFDDFSFYRKGIYAHVAGSPRGGHAVKIIGWGT-EHGVPYWIIANSWHSDW 316

Query: 184 GADGYFKIKRGSNECGIEEDVVAG 207
           G DGYF++ RG N+CGIE +VVAG
Sbjct: 317 GEDGYFRMVRGINDCGIETNVVAG 340


>gi|170028916|ref|XP_001842340.1| cathepsin B [Culex quinquefasciatus]
 gi|167879390|gb|EDS42773.1| cathepsin B [Culex quinquefasciatus]
          Length = 339

 Score =  161 bits (407), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 87/202 (43%), Positives = 125/202 (61%), Gaps = 16/202 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGY-PISAWRYFVHHGVVT-------EECDPY-FDSTG 71
           NL L+  DL+ CC   CG+GC+GG+   +A++Y+V  G+V+       E C PY F+   
Sbjct: 141 NLELATEDLMGCCK-DCGNGCNGGFLDGTAFQYWVDAGLVSGAPYNSSEGCKPYPFEP-- 197

Query: 72  CSHP--GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 128
           CS+P  GC      PKC+  C+   ++ +R  K +  +AY+I +D   I  EI  NGPV 
Sbjct: 198 CSYPFVGCHHEKKNPKCLHHCINGYDRKYRKDKFFGATAYKIPNDARMIQLEIMTNGPVA 257

Query: 129 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 188
             F V+EDF  Y SGVYKH+ G  +G HA++++GWGT ++G  YW++AN +  +WG  G+
Sbjct: 258 TGFEVFEDFYFYHSGVYKHVVGKKVGMHAIRIVGWGT-ENGTPYWLIANSYGDTWGDKGF 316

Query: 189 FKIKRGSNECGIEEDVVAGLPS 210
           FK+ RGSN  GIE  V+AGLP 
Sbjct: 317 FKMLRGSNHLGIESTVIAGLPQ 338


>gi|395842321|ref|XP_003793966.1| PREDICTED: cathepsin B [Otolemur garnettii]
          Length = 339

 Score =  161 bits (407), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 97/227 (42%), Positives = 143/227 (62%), Gaps = 19/227 (8%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + ++ +VS++   +S  DLL CCG  CGDGC+GGYP  AW ++   G+V+     
Sbjct: 118 SDRICIHTNGHVSVE---VSAEDLLTCCGGQCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174

Query: 63  ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KH+  ++Y + 
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPACTGEGDTPKCSKTCEPGYSPTYKEDKHFGYTSYSLP 233

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           ++  +IMAEIYKNGPVE +F+VY DF  YKSGVY+H+TGD+MGGHA++++GWG  ++G  
Sbjct: 234 TNEWEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHLTGDMMGGHAIRILGWG-EENGVP 292

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           YW++AN WN  WG  G+F+I RG + CGIE +VVAG+P +    ++I
Sbjct: 293 YWLVANSWNTDWGDGGFFRILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|86451908|gb|ABC97349.1| cathepsin B [Streblomastix strix]
          Length = 312

 Score =  160 bits (406), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 83/178 (46%), Positives = 111/178 (62%), Gaps = 13/178 (7%)

Query: 40  GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR-KC----VKKN 94
           GC+GG+  +A+ +   +G++ E+C PY     C HPGC   +PTPKC + KC     K  
Sbjct: 143 GCNGGWMSTAFGFMQSNGILGEDCIPY-QMGKCKHPGCS-TWPTPKCNKTKCYPNDTKST 200

Query: 95  QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 154
           +LW     ++ S+Y + S+  DI  EIY+NGPV  SF VYED + Y+SGVY+H+TG   G
Sbjct: 201 ELW-----HAASSYSVRSNEADIQKEIYENGPVTASFAVYEDLSVYQSGVYQHVTGGFEG 255

Query: 155 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 212
            HA+K++GWG   DG  YW + N W   WG DG   I+RG +ECGIE DVVAG P  K
Sbjct: 256 LHAIKVVGWGIL-DGVKYWTIVNSWAEDWGFDGLLLIRRGVDECGIESDVVAGQPKLK 312


>gi|226469950|emb|CAX70256.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  160 bits (405), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 89/203 (43%), Positives = 116/203 (57%), Gaps = 18/203 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           Q++ LS  DL++CC   CG GCDGG+P  AW Y+V HG+VT         C PY     C
Sbjct: 139 QSVELSAIDLISCCEN-CGSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPY-PFPKC 196

Query: 73  SH------PGC-EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H      P C +  Y TP+C RKC K     + + KHY   +  +  +   I  EI   
Sbjct: 197 EHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPYEHDKHYGGISINVIKNESAIQKEIMMY 256

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE    ++EDF +YKSG+Y++ TG  +G H V++IGWG  ++G  YW+ AN WN  WG
Sbjct: 257 GPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGI-ENGTAYWLAANTWNEDWG 315

Query: 185 ADGYFKIKRGSNECGIEEDVVAG 207
             GYF+I RG NEC IE  VVAG
Sbjct: 316 EKGYFRIVRGRNECSIESVVVAG 338


>gi|320167003|gb|EFW43902.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
          Length = 306

 Score =  160 bits (405), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 89/205 (43%), Positives = 121/205 (59%), Gaps = 16/205 (7%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 65
           ++R A++S+  +   N+ LS  DL++C       GCDGGYPI+AW Y    GVVT+ C P
Sbjct: 117 SDRLAIASNNSI---NVVLSPQDLVSCDS--TDYGCDGGYPINAWHYMQSLGVVTDTCYP 171

Query: 66  YFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           Y    G S         TP C      K +          +AY++ ++   I +EI  NG
Sbjct: 172 YTSGNGDSGTCQITGKKTPACATATFYKAK----------TAYQVANNMAAIQSEILANG 221

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE +F+VY+DF  Y SGVY H +G + GGHAVK++GWG  D    YWI+AN W  SWG 
Sbjct: 222 PVEAAFSVYDDFFSYTSGVYSHQSGALDGGHAVKIVGWGV-DGTTPYWIVANSWGTSWGQ 280

Query: 186 DGYFKIKRGSNECGIEEDVVAGLPS 210
            G+F IKRG++ECGIE+ +VAGL +
Sbjct: 281 AGFFWIKRGNDECGIEDGIVAGLAA 305


>gi|166030332|gb|ABY78833.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  160 bits (405), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 85/204 (41%), Positives = 114/204 (55%), Gaps = 14/204 (6%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           + L +S  DL+ACC   CG GC+GGYP +AW Y+V HG+ + +C PY     C H G + 
Sbjct: 139 KQLRISAADLMACCK-DCGGGCEGGYPDAAWEYYVSHGITSSQCQPY-PFPRCEHRGAQG 196

Query: 80  AYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
             P        TP+C   C  K+      K+    +Y +  + ED   E+Y NGP  V F
Sbjct: 197 KKPPCSKYKFVTPQCNATCTDKSVPL--IKYRGNHSYEVRGE-EDYKRELYFNGPFVVRF 253

Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
            V+ DF  YKSGVY+H+ G+ +GG AV+++GWG   +G  YW +AN W+  WG +GYF I
Sbjct: 254 QVHSDFLAYKSGVYQHVAGNFLGGKAVRIVGWGKL-NGTPYWKVANSWDTDWGMNGYFLI 312

Query: 192 KRGSNECGIEEDVVAGLPSSKNLV 215
            RG NEC IE    AG P    L 
Sbjct: 313 LRGDNECNIEHLGFAGTPDPSQLA 336


>gi|226474182|emb|CAX71577.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  160 bits (405), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 88/207 (42%), Positives = 118/207 (57%), Gaps = 18/207 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           Q++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT         C PY     C
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTSCRPY-PFPKC 196

Query: 73  SH------PGC-EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H        C +  Y TP+C + C K  N  +   KHY   +Y + S    I  +I  +
Sbjct: 197 DHFVKGKYRACGDKLYETPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMH 256

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  WG
Sbjct: 257 GPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDWG 315

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPSS 211
             GYF+I RG NEC IE ++ AGL  S
Sbjct: 316 EKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|226469948|emb|CAX70255.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  160 bits (405), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 89/203 (43%), Positives = 116/203 (57%), Gaps = 18/203 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           Q++ LS  DL++CC   CG GCDGG+P  AW Y+V HG+VT         C PY     C
Sbjct: 139 QSVELSAIDLISCCEN-CGSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPY-PFPKC 196

Query: 73  SH------PGC-EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H      P C +  Y TP+C RKC K     + + KHY   +  +  +   I  EI   
Sbjct: 197 EHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPYEHDKHYGGISINVIKNESAIQNEIMMY 256

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE    ++EDF +YKSG+Y++ TG  +G H V++IGWG  ++G  YW+ AN WN  WG
Sbjct: 257 GPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGI-ENGTAYWLAANTWNEDWG 315

Query: 185 ADGYFKIKRGSNECGIEEDVVAG 207
             GYF+I RG NEC IE  VVAG
Sbjct: 316 EKGYFRIVRGRNECSIESVVVAG 338


>gi|55793949|gb|AAV65885.1| cathepsin B1 isotype 5 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  160 bits (404), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 88/205 (42%), Positives = 118/205 (57%), Gaps = 18/205 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           +++ LS  DLL+CC   CG GC  G+P  AW Y+V  G+VT         C PY     C
Sbjct: 139 KSVELSAVDLLSCC-IECGLGCQMGFPGIAWDYWVQEGIVTGGSKENHTGCQPY-PFPKC 196

Query: 73  SH------PGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H      P C E  Y  PKC +KC K  +  +   K+Y   +Y +  + + I  EI  +
Sbjct: 197 EHHTKGRYPECGEIIYMKPKCHQKCQKGYKTPYEKDKYYGKVSYNLLKNEDSIKKEIMMH 256

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE SF V+ DF +YKSG+YKH+TG  +G H V++IGWG   +   YW++AN WN  WG
Sbjct: 257 GPVEASFRVHSDFLNYKSGIYKHMTGIDIGSHVVRIIGWGVEKE-TPYWLIANSWNEDWG 315

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
             GYF++ RG +ECGIE  V +GLP
Sbjct: 316 EKGYFRMLRGKDECGIESAVTSGLP 340


>gi|345308|pir||S31909 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - fluke
           (Schistosoma japonicum)
          Length = 316

 Score =  160 bits (404), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 88/203 (43%), Positives = 117/203 (57%), Gaps = 18/203 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           Q++ LS  DL++CC   CG GCDGG+P  AW Y+V HG+VT         C PY     C
Sbjct: 113 QSVELSAIDLISCCEN-CGSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPY-PFPKC 170

Query: 73  SH------PGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H      P C +  Y TP+C RKC K  +  + + KHY   +  +  +   I  EI   
Sbjct: 171 EHHSKGKYPSCGDKMYKTPQCKRKCQKGYKTPYEHDKHYGGISINVIKNESAIQKEIMMY 230

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE    ++EDF +YKSG+Y++ TG  +G H V++IGWG  ++G  YW+ AN WN  WG
Sbjct: 231 GPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGI-ENGTAYWLAANTWNEDWG 289

Query: 185 ADGYFKIKRGSNECGIEEDVVAG 207
             GYF+I RG NEC +E  VVAG
Sbjct: 290 EKGYFRIVRGRNECSVESVVVAG 312


>gi|56756410|gb|AAW26378.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  159 bits (403), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT       + TGC     P 
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y + S    I  +I  
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG NEC IE ++ AGL  S
Sbjct: 315 GEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|28932700|gb|AAO60044.1| midgut cysteine proteinase 1 [Rhipicephalus appendiculatus]
          Length = 332

 Score =  159 bits (403), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 91/196 (46%), Positives = 114/196 (58%), Gaps = 11/196 (5%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDG-----GYPISAWRYFVHHGVVTEE-CDPYFDSTGCSH 74
            +++S  DLLACC   CG GCDG        I   R  V   V TE+ C PY  S     
Sbjct: 137 QVNISAEDLLACC-HTCGHGCDGRCHCSSVAILQGRRLVPEPVRTEDGCQPY--SLPPCV 193

Query: 75  PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
           P C    PTPKC   C K   + +   KH++ + YR+    + I  +IYKNGPVE +F V
Sbjct: 194 PNCTHPEPTPKCQHVCRKGYEKSYEEDKHFAKNVYRLLKKCDAIKTDIYKNGPVESAFFV 253

Query: 134 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 193
           Y DF  YKSGVY+      MG HA+K++GWGT +DG  YW++AN WN  WG  GYFKI R
Sbjct: 254 YADFPSYKSGVYQQHMIKFMGVHAIKILGWGT-EDGVPYWLVANSWNVGWGDKGYFKILR 312

Query: 194 GSNECGIEEDVVAGLP 209
           G +ECGIEE + AG+P
Sbjct: 313 GKDECGIEEVIDAGIP 328


>gi|226474180|emb|CAX71576.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  159 bits (403), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT       + TGC     P 
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y + S    I  +I  
Sbjct: 196 CDHFVKGKYRACGDKLYETPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG NEC IE ++ AGL  S
Sbjct: 315 GEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|194766882|ref|XP_001965553.1| GF22391 [Drosophila ananassae]
 gi|190619544|gb|EDV35068.1| GF22391 [Drosophila ananassae]
          Length = 342

 Score =  159 bits (403), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 85/204 (41%), Positives = 114/204 (55%), Gaps = 17/204 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
           N   S +DL++CC   CG GC+GG+P +AW Y+   G+V+         C PY +   C 
Sbjct: 138 NFRFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGGRYGSKTGCRPY-EIAPCE 195

Query: 74  H------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H        C     TPKC  +C    N  +   KH+   +Y +  +  DI  EI  NGP
Sbjct: 196 HHVNGTRAPCNHDSKTPKCQHQCEAGYNVEYSKDKHFGSKSYSVRRNVRDIQEEIMTNGP 255

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE-DYWILANQWNRSWGA 185
           VE +FTVYED   YKSGVY+H  G  +GGHA++++GWG     E  YW++AN WN  WG 
Sbjct: 256 VEGAFTVYEDLILYKSGVYQHEHGKELGGHAIRILGWGVWGKEEVPYWLIANSWNDDWGD 315

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            G+F+I RG + CGIE  + AGLP
Sbjct: 316 KGFFRILRGEDHCGIESSISAGLP 339


>gi|56752809|gb|AAW24616.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  159 bits (403), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 88/207 (42%), Positives = 118/207 (57%), Gaps = 18/207 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           Q++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT         C PY     C
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPY-PFPKC 196

Query: 73  SH------PGC-EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H        C +  Y TP+C + C K  N  +   KHY   +Y + S    I  +I  +
Sbjct: 197 DHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMH 256

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  WG
Sbjct: 257 GPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDWG 315

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPSS 211
             GYF+I RG NEC IE ++ AGL  S
Sbjct: 316 EKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|157167281|ref|XP_001658485.1| cathepsin b [Aedes aegypti]
 gi|108876476|gb|EAT40701.1| AAEL007585-PA [Aedes aegypti]
          Length = 386

 Score =  159 bits (403), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 86/193 (44%), Positives = 117/193 (60%), Gaps = 16/193 (8%)

Query: 28  DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPA 80
           DLL+CC   CG GC GG    AW+++V  G+ +       + C PY     C  PG +  
Sbjct: 182 DLLSCC-HSCGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGE-CRIPGEDED 239

Query: 81  YPTPKCVRKC---VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
             TPKC  KC        +W++ +HY   AY + +D   IM EI+ NGPV+ +F  Y D 
Sbjct: 240 --TPKCSNKCRSGYNVTDVWQD-RHYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDL 296

Query: 138 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 197
             YKSG+Y+H+ G + GGHAVKL+GWG  ++G  YW++AN W R WG +G+FKI RG N 
Sbjct: 297 HAYKSGIYRHVWGPLSGGHAVKLLGWGV-ENGVKYWLVANSWGREWGENGFFKIVRGENH 355

Query: 198 CGIEEDVVAGLPS 210
           CGIEE++ AGLP+
Sbjct: 356 CGIEENIHAGLPN 368


>gi|66810163|ref|XP_638805.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
 gi|74897075|sp|Q54QD9.1|CTSB_DICDI RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Flags:
           Precursor
 gi|60467425|gb|EAL65448.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
          Length = 311

 Score =  159 bits (402), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 92/198 (46%), Positives = 115/198 (58%), Gaps = 20/198 (10%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           +N+ LS  D++ C      +GC+GG   SAW +    G V+EEC PY      + P C P
Sbjct: 126 ENVQLSFMDMVTCDE--TDNGCEGGDAFSAWNWLRKQGAVSEECLPY------TIPTCPP 177

Query: 80  AYP-------TPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
           A         TP C ++C   + L +   KH     Y  +SD E IM EI  NGPVE  F
Sbjct: 178 AQQPCLNFVNTPSCTKECQSNSSLIYSQDKHKMAKIYSFDSD-EAIMQEIVTNGPVEACF 236

Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
           TV+EDF  YKSGVY H TG  +GGH VKL+G+GT  +G DY+   NQW  SWG +G F I
Sbjct: 237 TVFEDFLAYKSGVYVHTTGKDLGGHCVKLVGFGTL-NGVDYYAANNQWTTSWGDNGTFLI 295

Query: 192 KRGSNECGIEEDVVAGLP 209
           KRG  +CGI +DVVAGLP
Sbjct: 296 KRG--DCGISDDVVAGLP 311


>gi|118429531|gb|ABK91813.1| cathepsin B-like cysteine proteinase precursor [Clonorchis
           sinensis]
 gi|358331549|dbj|GAA37857.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 343

 Score =  159 bits (402), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 94/204 (46%), Positives = 111/204 (54%), Gaps = 19/204 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGC 77
           N SLS  DLL+CC   CG GC GGYP  AW Y+  HG+VT       D +GC     P C
Sbjct: 136 NKSLSAVDLLSCCEN-CGYGCSGGYPAVAWDYWGAHGIVTGGSKE--DPSGCRSYPFPKC 192

Query: 78  E------------PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           E              YPTP+CV+ C      +   K  +  +Y I S    IM EI   G
Sbjct: 193 EHHVQGHYPPCPHQYYPTPECVQHCDTPGIDYVKDKTRANMSYNIYSSEILIMKEIMLRG 252

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE  FTVYEDF  YK GVY H  G  +  HA++++GWG   D   YW++AN WN  WG 
Sbjct: 253 PVEAVFTVYEDFLQYKFGVYFHSWGAPLSEHAIRILGWGEEGD-VPYWLIANSWNEDWGE 311

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            GY K  RG NECGIE+DV AGLP
Sbjct: 312 KGYMKFLRGLNECGIEDDVTAGLP 335


>gi|195566634|ref|XP_002106884.1| GD15875 [Drosophila simulans]
 gi|194204277|gb|EDX17853.1| GD15875 [Drosophila simulans]
          Length = 340

 Score =  159 bits (402), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 82/204 (40%), Positives = 115/204 (56%), Gaps = 17/204 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
           N   S +DL++CC   CG GC+GG+P +AW Y+   G+V+       + C PY + + C 
Sbjct: 137 NFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPY-EISPCE 194

Query: 74  H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC   C     + +   KH+   +Y +  +  +I  EI  NGP
Sbjct: 195 HHVNGTRPPCAHGGGTPKCSHVCQSSYTVDYAKDKHFGSKSYSVKRNVREIQEEIMTNGP 254

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGA 185
           VE +FTVYED   YK GVY+H  G  +GGHA++++GWG   D+   YW++ N WN  WG 
Sbjct: 255 VEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGDEKIPYWLIGNSWNTDWGD 314

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            G+F+I RG + CGIE  + AGLP
Sbjct: 315 HGFFRILRGQDHCGIESSISAGLP 338


>gi|157111449|ref|XP_001651570.1| cathepsin b [Aedes aegypti]
 gi|108868331|gb|EAT32556.1| AAEL015312-PA [Aedes aegypti]
          Length = 386

 Score =  159 bits (402), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 88/207 (42%), Positives = 122/207 (58%), Gaps = 19/207 (9%)

Query: 28  DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPA 80
           DLL+CC   CG GC GG    AW+++V  G+ +       + C PY     C  PG +  
Sbjct: 182 DLLSCC-HSCGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGE-CRIPGEDED 239

Query: 81  YPTPKCVRKC---VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
             TPKC  KC        +W++ +HY   AY + +D   IM EI+ NGPV+ +F  Y D 
Sbjct: 240 --TPKCSNKCRSGYNVTDVWQD-RHYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDL 296

Query: 138 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 197
             YKSG+Y+H+ G + GGHAVKL+GWG  ++G  YW++AN W R WG +G+FK+ RG N 
Sbjct: 297 HAYKSGIYRHVWGPLSGGHAVKLLGWGV-ENGVKYWLVANSWGREWGENGFFKMVRGENH 355

Query: 198 CGIEEDVVAGLPSSKNLVKEITSADMF 224
           CGIEE++ AGLP   N  ++  +A  F
Sbjct: 356 CGIEENIHAGLP---NFHRQGEAAKYF 379


>gi|157131748|ref|XP_001662318.1| cathepsin b [Aedes aegypti]
 gi|108871395|gb|EAT35620.1| AAEL012216-PA [Aedes aegypti]
          Length = 386

 Score =  159 bits (402), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 88/207 (42%), Positives = 122/207 (58%), Gaps = 19/207 (9%)

Query: 28  DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPA 80
           DLL+CC   CG GC GG    AW+++V  G+ +       + C PY     C  PG +  
Sbjct: 182 DLLSCC-HSCGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGE-CRIPGEDED 239

Query: 81  YPTPKCVRKC---VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
             TPKC  KC        +W++ +HY   AY + +D   IM EI+ NGPV+ +F  Y D 
Sbjct: 240 --TPKCSNKCRSGYNVTDVWQD-RHYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDL 296

Query: 138 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 197
             YKSG+Y+H+ G + GGHAVKL+GWG  ++G  YW++AN W R WG +G+FK+ RG N 
Sbjct: 297 HAYKSGIYRHVWGPLSGGHAVKLLGWGV-ENGVKYWLVANSWGREWGENGFFKMVRGENH 355

Query: 198 CGIEEDVVAGLPSSKNLVKEITSADMF 224
           CGIEE++ AGLP   N  ++  +A  F
Sbjct: 356 CGIEENIHAGLP---NFHRQGEAAKYF 379


>gi|226474160|emb|CAX71567.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  159 bits (401), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT       + TGC     P 
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y + S    I  +I  
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCKQICQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG NEC IE ++ AGL  S
Sbjct: 315 GEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|56755451|gb|AAW25905.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  159 bits (401), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT       + TGC     P 
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y + S    I  +I  
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCKQICQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG NEC IE ++ AGL  S
Sbjct: 315 GEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|226474164|emb|CAX71568.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
 gi|226474166|emb|CAX71569.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  159 bits (401), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT       + TGC     P 
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y + S    I  +I  
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCKQICQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG NEC IE ++ AGL  S
Sbjct: 315 GEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|194895314|ref|XP_001978227.1| GG19486 [Drosophila erecta]
 gi|190649876|gb|EDV47154.1| GG19486 [Drosophila erecta]
          Length = 340

 Score =  159 bits (401), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 82/204 (40%), Positives = 115/204 (56%), Gaps = 17/204 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
           N   S +DL++CC   CG GC+GG+P +AW Y+   G+V+       + C PY +   C 
Sbjct: 137 NFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPY-EIAPCE 194

Query: 74  H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC   C     + +   KH+   +Y +  +  DI  EI  NGP
Sbjct: 195 HHVNGTRPPCGHGGGTPKCSHVCESGYTVDYAKDKHFGSKSYSVKRNVRDIQEEIMTNGP 254

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGA 185
           VE +FTVYED   YK GVY+H  G  +GGHA++++GWG   ++   YW++ N WN  WG 
Sbjct: 255 VEGAFTVYEDLILYKDGVYQHQHGKELGGHAIRILGWGVWGEEKIPYWLIGNSWNTDWGD 314

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G+F+I RG + CGIE  + AGLP
Sbjct: 315 NGFFRILRGQDHCGIESSISAGLP 338


>gi|335347291|gb|AEH42093.1| cysteine proteinase 6 [Haemonchus contortus]
          Length = 346

 Score =  158 bits (400), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 88/204 (43%), Positives = 121/204 (59%), Gaps = 20/204 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           + + +S  D ++CC   CG GC+GG+PI A+ Y+ + GVVT         C PY     C
Sbjct: 143 KQVHISSIDFVSCCD-SCGFGCEGGWPIDAFEYYSYQGVVTGGDYGSKTGCRPY-PFHPC 200

Query: 73  SHPGCEPAY-------PTPKCVRKCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
            H G E  Y        TP+CV++C K  KN  +R  K +    Y + +  + I  EI +
Sbjct: 201 GHHGNETYYGECPKEESTPECVKQCQKGYKNS-YRRDKTWGEDYYEVENSVKAIQREIMR 259

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPV  SFTVY+DF++Y  G+YKH  G   G HA+K+IGWGT +    YWI+AN W+  W
Sbjct: 260 SGPVVSSFTVYDDFSYYVKGIYKHTAGKARGSHAIKIIGWGT-EKNVPYWIIANSWHNDW 318

Query: 184 GADGYFKIKRGSNECGIEEDVVAG 207
           G  G+F++ RG+N CGIEEDVVAG
Sbjct: 319 GEKGFFRMVRGTNHCGIEEDVVAG 342


>gi|226474172|emb|CAX71572.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  158 bits (400), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT       + TGC     P 
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y + S    I  +I  
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG NEC IE ++ AGL  S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|27806671|ref|NP_776456.1| cathepsin B precursor [Bos taurus]
 gi|115312124|sp|P07688.5|CATB_BOVIN RecName: Full=Cathepsin B; AltName: Full=BCSB; Contains: RecName:
           Full=Cathepsin B light chain; Contains: RecName:
           Full=Cathepsin B heavy chain; Flags: Precursor
 gi|289402|gb|AAA03064.1| cathepsin B [Bos taurus]
 gi|809479|gb|AAA80198.1| cathepsin B [Bos taurus]
 gi|296484950|tpg|DAA27065.1| TPA: cathepsin B precursor [Bos taurus]
          Length = 335

 Score =  158 bits (400), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 90/203 (44%), Positives = 131/203 (64%), Gaps = 16/203 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY     C 
Sbjct: 130 NVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPCE 188

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC + C    +  ++  KH+  S+Y + ++ ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGP 248

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +F+VY DF  YKSGVY+H++G++MGGHA++++GWG  ++G  YW++ N WN  WG +
Sbjct: 249 VEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDN 307

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G+FKI RG + CGIE ++VAG+P
Sbjct: 308 GFFKILRGQDHCGIESEIVAGMP 330


>gi|226474174|emb|CAX71573.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  158 bits (400), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT       + TGC     P 
Sbjct: 139 QSVELSAIDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y + S    I  +I  
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG NEC IE ++ AGL  S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|125981197|ref|XP_001354605.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
 gi|54642915|gb|EAL31659.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
          Length = 338

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 83/204 (40%), Positives = 118/204 (57%), Gaps = 18/204 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCS 73
           N  LS +DL++CC  +CG GC+GG+P +AW Y+   G+V       T+ C PY +   C 
Sbjct: 136 NFHLSADDLVSCC-HICGFGCNGGFPGAAWSYWTRKGIVSGGPYGSTQGCRPY-EIAPCE 193

Query: 74  H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TP C  KC     + +   K++   +Y +  +  +I  EI  NGP
Sbjct: 194 HHVNGTRPPCSHG-STPSCQHKCQASYSVEYAKDKNFGSKSYSVRRNVAEIQQEIMTNGP 252

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE-DYWILANQWNRSWGA 185
           VE +FTVYED   YKSGVY+H  G  +GGHA++++GWG   + +  YW++ N WN  WG 
Sbjct: 253 VEGAFTVYEDLILYKSGVYQHEHGKELGGHAIRILGWGVWGESKVPYWLIGNSWNTDWGD 312

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G+F+I RG + CGIE  + AGLP
Sbjct: 313 NGFFRILRGQDHCGIESSISAGLP 336


>gi|440913587|gb|ELR63025.1| Cathepsin B [Bos grunniens mutus]
          Length = 335

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 90/203 (44%), Positives = 131/203 (64%), Gaps = 16/203 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY     C 
Sbjct: 130 NVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPCE 188

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC + C    +  ++  KH+  S+Y + ++ ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGP 248

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +F+VY DF  YKSGVY+H++G++MGGHA++++GWG  ++G  YW++ N WN  WG +
Sbjct: 249 VEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDN 307

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G+FKI RG + CGIE ++VAG+P
Sbjct: 308 GFFKILRGQDHCGIESEIVAGMP 330


>gi|390994433|gb|AFM37366.1| cathepsin B3 [Dictyocaulus viviparus]
          Length = 342

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 86/199 (43%), Positives = 117/199 (58%), Gaps = 17/199 (8%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG 76
            SV D+L CC   CG GCDGG+P +AW YFV  GVVT         C PY  S   +HP 
Sbjct: 146 FSVEDILTCCD-ECGFGCDGGFPDAAWEYFVSTGVVTGGLYGTKNACRPYEISPCGNHPN 204

Query: 77  CEPAY------PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
            E  Y       TP C   C K   + +++ K     +Y + +    I  +I K+GP+  
Sbjct: 205 -ETFYRNCTGVSTPSCKTSCQKGYPVSYKDDKTRGRKSYNLANSVSAIQKDILKHGPLVA 263

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
           +F+VYEDF +YK G+Y++  G   GGHAV+++GWG  ++ + YWI+AN WN  WG DG+F
Sbjct: 264 TFSVYEDFMYYKKGIYRYTHGGYEGGHAVRILGWGVENNVK-YWIIANSWNTDWGEDGFF 322

Query: 190 KIKRGSNECGIEEDVVAGL 208
           ++ RG N+CGIEE V AGL
Sbjct: 323 RMVRGINDCGIEESVSAGL 341


>gi|226473758|emb|CAX71564.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT       + TGC     P 
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y + S    I  +I  
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG NEC IE ++ AGL  S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|195352458|ref|XP_002042729.1| GM17589 [Drosophila sechellia]
 gi|194126760|gb|EDW48803.1| GM17589 [Drosophila sechellia]
          Length = 340

 Score =  158 bits (399), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 81/204 (39%), Positives = 115/204 (56%), Gaps = 17/204 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
           N   S +DL++CC   CG GC+GG+P +AW Y+   G+V+       + C PY + + C 
Sbjct: 137 NFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPY-EISPCE 194

Query: 74  H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC   C     + +   KH+   +Y +  +  +I  EI  NGP
Sbjct: 195 HHVNGTRPPCANGSGTPKCSHVCQSSYTVDYAKDKHFGSKSYSVKRNVREIQEEIMTNGP 254

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGA 185
           VE +FTVYED   YK GVY+H  G  +GGHA++++GWG   ++   YW++ N WN  WG 
Sbjct: 255 VEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGNEKIPYWLIGNSWNTDWGD 314

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            G+F+I RG + CGIE  + AGLP
Sbjct: 315 HGFFRILRGQDHCGIESSISAGLP 338


>gi|183988834|gb|ACC66066.1| cathepsin B [Samia ricini]
          Length = 283

 Score =  158 bits (399), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 83/197 (42%), Positives = 117/197 (59%), Gaps = 17/197 (8%)

Query: 18  SLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDST 70
           + ++   S  DL++CC  +CG GC+GG P  AW Y+ H G+V+       + C PY +  
Sbjct: 90  ATKHFHFSAEDLVSCCP-ICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPY-EIP 147

Query: 71  GCSH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
            C H  PG    C     TPKC + C    N  ++  K Y    Y ++   + I AE++K
Sbjct: 148 PCEHHVPGNRMPCNGDTKTPKCQKNCESSYNVPFKKDKRYGKHVYSVSGHEDHIKAELFK 207

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           NGPVE +FTVY D   YK+GVYKH  G+ +GGHA+K+IGWG  ++ + YW++AN WN  W
Sbjct: 208 NGPVEAAFTVYSDLLSYKNGVYKHTEGNALGGHAIKIIGWGVENNNK-YWLIANSWNSDW 266

Query: 184 GADGYFKIKRGSNECGI 200
           G +G+FKI RG + CGI
Sbjct: 267 GDNGFFKILRGEDHCGI 283


>gi|56756475|gb|AAW26410.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  158 bits (399), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT       + TGC     P 
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y + S    I  +I  
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG NEC IE ++ AGL  S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|171474007|gb|AAX31052.2| SJCHGC09761 protein [Schistosoma japonicum]
          Length = 342

 Score =  158 bits (399), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT       + TGC     P 
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y + S    I  +I  
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG NEC IE ++ AGL  S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|268560898|ref|XP_002638183.1| Hypothetical protein CBG22612 [Caenorhabditis briggsae]
          Length = 721

 Score =  157 bits (398), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 87/198 (43%), Positives = 123/198 (62%), Gaps = 16/198 (8%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGC 77
           +S  D+L CC      GC GG+ + A +++   GVVT      + C PY     CS   C
Sbjct: 133 ISPEDILTCC--TNSHGCQGGFVLEAMKFWKSKGVVTGGDFQGDGCIPY-SYGSCSD--C 187

Query: 78  EPAYPTPKCVRKCVKK--NQLWRNSKHYSISAYRINSDP--EDIMAEIYKNGPVEVSFTV 133
             A  TPKC  +C  K     ++  K+Y  SAYR+++      I +EI +NGPVE ++ V
Sbjct: 188 HTAQTTPKCKNECQVKYTKNEYKEDKYYGSSAYRLSTSNAVRTIQSEILRNGPVEATYQV 247

Query: 134 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 193
           YEDF +YKSGVY++I+G  MGGHAVK+IGWG  ++  +YW++AN W   +G +G+FK++R
Sbjct: 248 YEDFYYYKSGVYEYISGRHMGGHAVKIIGWGV-EENVNYWLIANSWGTGFGENGFFKMRR 306

Query: 194 GSNECGIEEDVVAGLPSS 211
           G+NECGIE  VVAG+  S
Sbjct: 307 GNNECGIENYVVAGMAKS 324


>gi|159175|gb|AAA29176.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score =  157 bits (398), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 90/214 (42%), Positives = 119/214 (55%), Gaps = 22/214 (10%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS----HP 75
           + +  S  D+L CCG  CG GC GG+PI AW++F + GVV+    PY     CS    HP
Sbjct: 138 KQVYASDTDILTCCGARCGLGCRGGWPIEAWKFFEYDGVVSG--GPYLGKGCCSPYPLHP 195

Query: 76  -----------GCEPAYPTPKCVRKCVKKNQ-LWRNSKHYSI--SAYRINSDPEDIMAEI 121
                       C    PTP C RKC    + ++R  K Y      Y +      I  +I
Sbjct: 196 CGRHGNDTFYGNCVGMAPTPPCKRKCQPGFRGMYRVDKRYGEPGRTYTLPRSEVKIRRDI 255

Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG-HAVKLIGWGTSDDGEDYWILANQWN 180
            + G V   F VYEDF+HY+SG+YKH  G   GG HAVK+IGWG  D+G DYW++AN W+
Sbjct: 256 KERGSVVAVFAVYEDFSHYQSGIYKHTAGRFTGGYHAVKMIGWG-KDNGTDYWLIANSWH 314

Query: 181 RSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
             WG +G+F++ RG N CGIEE V AG+   ++L
Sbjct: 315 DDWGENGFFRMIRGINNCGIEEQVDAGIVDVESL 348


>gi|159177|gb|AAA29177.1| cysteine proteinase [Haemonchus contortus]
          Length = 342

 Score =  157 bits (398), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 86/205 (41%), Positives = 114/205 (55%), Gaps = 17/205 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGC 72
           + +++S  D+L CC   CG GC GG+ I AW YFV+ GVV+         C PY     C
Sbjct: 133 KQVNISSTDILTCCNPQCGFGCGGGWSIRAWEYFVYEGVVSGGEYLTKGVCRPY-PIHPC 191

Query: 73  SHPG-------CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H G       C     TP C +KC     +++R  K     AY +    E I  EI ++
Sbjct: 192 GHHGNDTYYGECPREAATPPCKKKCQPGYKKIFRMDKRQGKVAYGVEPKEEAIQREILRH 251

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSW 183
           GPV  SF VYEDF+ YK+GVYKH  G + G HAVK++GWG  S     YW++AN W+  W
Sbjct: 252 GPVVASFAVYEDFSLYKTGVYKHTAGALRGYHAVKMMGWGVDSKTKAKYWLIANSWHNDW 311

Query: 184 GADGYFKIKRGSNECGIEEDVVAGL 208
           G +GYF+  RG N+C IE+ V AG+
Sbjct: 312 GENGYFRFIRGINDCEIEDTVAAGI 336


>gi|56756907|gb|AAW26625.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  157 bits (398), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT       + TGC     P 
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y + S    I  +I  
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG NEC IE ++ AGL  S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|56754499|gb|AAW25437.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  157 bits (398), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT       + TGC     P 
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y + S    I  +I  
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG NEC IE ++ AGL  S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|226473756|emb|CAX71563.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  157 bits (398), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT       + TGC     P 
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y + S    I  +I  
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG NEC IE ++ AGL  S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|170060938|ref|XP_001866023.1| cathepsin B [Culex quinquefasciatus]
 gi|167879260|gb|EDS42643.1| cathepsin B [Culex quinquefasciatus]
          Length = 353

 Score =  157 bits (398), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 89/197 (45%), Positives = 112/197 (56%), Gaps = 12/197 (6%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC-SHPGCEP 79
             S    DL++CC   CGDGC GG    AW Y+V  GV +    PY    GC S+P    
Sbjct: 148 TFSFGSFDLISCC-HSCGDGCQGGVLGPAWDYWVQKGVSSG--GPYNSKQGCHSYPFDTC 204

Query: 80  AYP-----TPKCVRKCVKKNQLWRNSK--HYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
             P      PKC RKC     +   SK   +   AY + +D   IM EI+ NGPV+ +F 
Sbjct: 205 HSPDEDDDAPKCSRKCQSSYSVQDVSKDRRFGRVAYSVVADEHRIMEEIFVNGPVQAAFQ 264

Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
           VY DF  YKSGVY+H+TG + GGHA+K++GWG  ++G  YW+ +N W   WG  G+FKI 
Sbjct: 265 VYLDFKTYKSGVYRHVTGPLEGGHAIKILGWGV-ENGTKYWLCSNSWGEDWGDHGFFKIV 323

Query: 193 RGSNECGIEEDVVAGLP 209
           RG N  GIE DV AGLP
Sbjct: 324 RGENHLGIETDVHAGLP 340


>gi|18921171|ref|NP_572920.1| cathepsin B1, isoform A [Drosophila melanogaster]
 gi|7292926|gb|AAF48317.1| cathepsin B1, isoform A [Drosophila melanogaster]
 gi|16767940|gb|AAL28188.1| GH06546p [Drosophila melanogaster]
 gi|220944992|gb|ACL85039.1| CG10992-PA [synthetic construct]
 gi|220954816|gb|ACL89951.1| CG10992-PA [synthetic construct]
          Length = 340

 Score =  157 bits (397), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 81/204 (39%), Positives = 115/204 (56%), Gaps = 17/204 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
           N   S +DL++CC   CG GC+GG+P +AW Y+   G+V+       + C PY + + C 
Sbjct: 137 NFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPY-EISPCE 194

Query: 74  H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC   C     + +   KH+   +Y +  +  +I  EI  NGP
Sbjct: 195 HHVNGTRPPCAHGGRTPKCSHVCQSGYTVDYAKDKHFGSKSYSVRRNVREIQEEIMTNGP 254

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGA 185
           VE +FTVYED   YK GVY+H  G  +GGHA++++GWG   ++   YW++ N WN  WG 
Sbjct: 255 VEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGEEKIPYWLIGNSWNTDWGD 314

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            G+F+I RG + CGIE  + AGLP
Sbjct: 315 HGFFRILRGQDHCGIESSISAGLP 338


>gi|56752787|gb|AAW24605.1| unknown [Schistosoma japonicum]
          Length = 309

 Score =  157 bits (397), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT       + TGC     P 
Sbjct: 106 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 162

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y + S    I  +I  
Sbjct: 163 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 222

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 223 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 281

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG NEC IE ++ AGL  S
Sbjct: 282 GEKGYFRIVRGRNECLIESEIAAGLIKS 309


>gi|28373366|pdb|1ITO|A Chain A, Crystal Structure Analysis Of Bovine Spleen Cathepsin B-
           E64c Complex
 gi|88192750|pdb|2DC6|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca073 Complex
 gi|88192751|pdb|2DC7|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca042 Complex
 gi|88192752|pdb|2DC8|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca059 Complex
 gi|88192753|pdb|2DC9|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca074me Complex
 gi|88192754|pdb|2DCA|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca075 Complex
 gi|88192755|pdb|2DCB|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca076 Complex
 gi|88192756|pdb|2DCC|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca077 Complex
 gi|88192757|pdb|2DCD|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca078 Complex
          Length = 256

 Score =  157 bits (397), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 90/203 (44%), Positives = 131/203 (64%), Gaps = 16/203 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY     C 
Sbjct: 51  NVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPCE 109

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC + C    +  ++  KH+  S+Y + ++ ++IMAEIYKNGP
Sbjct: 110 HHVNGSRPPCTGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGP 169

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +F+VY DF  YKSGVY+H++G++MGGHA++++GWG  ++G  YW++ N WN  WG +
Sbjct: 170 VEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDN 228

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G+FKI RG + CGIE ++VAG+P
Sbjct: 229 GFFKILRGQDHCGIESEIVAGMP 251


>gi|226473762|emb|CAX71566.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
 gi|226474170|emb|CAX71571.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  157 bits (397), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT       + TGC     P 
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y + S    I  +I  
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 HGPVEAYIEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG NEC IE ++ AGL  S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|56757646|gb|AAW26973.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  157 bits (397), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 86/208 (41%), Positives = 117/208 (56%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT       + TGC     P 
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y + S       +I  
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSGESVFQKDIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG NEC IE ++ AGL  S
Sbjct: 315 GEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|226474184|emb|CAX71578.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  157 bits (397), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT       + TGC     P 
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y + S    I  +I  
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMV 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG NEC IE ++ AGL  S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|194246069|gb|ACF35526.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 277

 Score =  157 bits (397), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 86/203 (42%), Positives = 116/203 (57%), Gaps = 19/203 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
            +++S  DLL CC   CG GC GGYP +AW Y+   G+VT       + C PY+    C 
Sbjct: 75  QVNISAQDLLTCC-HQCGMGCFGGYPSAAWDYYKDEGIVTGGLYGTDDGCQPYYFPP-CE 132

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C    PTPKC++ C K   + +   K+++ + Y ++SD   I  EIYKNGP
Sbjct: 133 HHTKGPLPNCTDTKPTPKCLQVCRKGYEKSYSEDKYFAKTVYSLHSDETQIKTEIYKNGP 192

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE  F+VY DF  YKSGVY+  + ++       L GW         W++AN WN+ WG  
Sbjct: 193 VEADFSVYTDFLAYKSGVYQRHSYELWEARHQNL-GWALKR--RSVWLVANSWNQDWGDK 249

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           GYFKI+RG+NECGIE D+ AG+P
Sbjct: 250 GYFKIRRGNNECGIENDINAGIP 272


>gi|442616292|ref|NP_001259536.1| cathepsin B1, isoform B [Drosophila melanogaster]
 gi|440216755|gb|AGB95378.1| cathepsin B1, isoform B [Drosophila melanogaster]
          Length = 330

 Score =  157 bits (397), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 81/204 (39%), Positives = 115/204 (56%), Gaps = 17/204 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
           N   S +DL++CC   CG GC+GG+P +AW Y+   G+V+       + C PY + + C 
Sbjct: 127 NFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPY-EISPCE 184

Query: 74  H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC   C     + +   KH+   +Y +  +  +I  EI  NGP
Sbjct: 185 HHVNGTRPPCAHGGRTPKCSHVCQSGYTVDYAKDKHFGSKSYSVRRNVREIQEEIMTNGP 244

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGA 185
           VE +FTVYED   YK GVY+H  G  +GGHA++++GWG   ++   YW++ N WN  WG 
Sbjct: 245 VEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGEEKIPYWLIGNSWNTDWGD 304

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            G+F+I RG + CGIE  + AGLP
Sbjct: 305 HGFFRILRGQDHCGIESSISAGLP 328


>gi|226474176|emb|CAX71574.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  157 bits (397), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 86/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT       + TGC     P 
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y + S    I  +I  
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG NEC I+ ++ AGL  S
Sbjct: 315 GEKGYFRIVRGRNECSIDSEIAAGLIKS 342


>gi|343477197|emb|CCD11909.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  157 bits (396), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 83/204 (40%), Positives = 113/204 (55%), Gaps = 14/204 (6%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           + L +S  DL+ACC   CG GC+GGYP +AW Y+V HG+ + +C PY     C H G + 
Sbjct: 139 KQLRISAADLMACCK-DCGGGCEGGYPDAAWEYYVSHGIASSQCQPY-PFPRCEHRGAQG 196

Query: 80  A--------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
                    + TP+C   C  K       K+    +Y +  + ED   E+Y NGP  V F
Sbjct: 197 KKTPCSKYKFVTPQCNATCTDKTIPL--IKYRGNHSYEVRGE-EDYKRELYFNGPFVVRF 253

Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
            V+ DF  YK+GVY+H+ G+ +GG AV+++GWG   +G  YW +AN W+  WG +GYF I
Sbjct: 254 QVHSDFLAYKNGVYQHVAGNFLGGKAVRIVGWGKL-NGTPYWKVANSWDTDWGMNGYFLI 312

Query: 192 KRGSNECGIEEDVVAGLPSSKNLV 215
            RG NEC IE    AG P    L 
Sbjct: 313 LRGDNECNIEHLGFAGTPDPSQLT 336


>gi|48425700|pdb|1SP4|B Chain B, Crystal Structure Of Ns-134 In Complex With Bovine
           Cathepsin B: A Two Headed Epoxysuccinyl Inhibitor
           Extends Along The Whole Active Site Cleft
          Length = 205

 Score =  157 bits (396), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 90/203 (44%), Positives = 131/203 (64%), Gaps = 16/203 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY     C 
Sbjct: 3   NVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPCE 61

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC + C    +  ++  KH+  S+Y + ++ ++IMAEIYKNGP
Sbjct: 62  HHVNGSRPPCTGEGDTPKCNKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGP 121

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +F+VY DF  YKSGVY+H++G++MGGHA++++GWG  ++G  YW++ N WN  WG +
Sbjct: 122 VEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDN 180

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G+FKI RG + CGIE ++VAG+P
Sbjct: 181 GFFKILRGQDHCGIESEIVAGMP 203


>gi|9955277|pdb|1QDQ|A Chain A, X-Ray Crystal Structure Of Bovine Cathepsin B-Ca074
           Complex
          Length = 253

 Score =  157 bits (396), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 91/203 (44%), Positives = 131/203 (64%), Gaps = 16/203 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  D+L CCG  CGDGC+GG P  AW ++   G+V+         C PY     C 
Sbjct: 51  NVEVSAEDMLTCCGGECGDGCNGGEPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPCE 109

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC + C    +  ++  KH+  S+Y + ++ ++IMAEIYKNGP
Sbjct: 110 HHVNGSRPPCTGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGP 169

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +F+VY DF  YKSGVY+H++G++MGGHA++++GWG  ++G  YW++AN WN  WG +
Sbjct: 170 VEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDN 228

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G+FKI RG + CGIE ++VAG+P
Sbjct: 229 GFFKILRGQDHCGIESEIVAGMP 251


>gi|183988832|gb|ACC66065.1| cathepsin B [Antheraea assama]
          Length = 287

 Score =  157 bits (396), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 82/198 (41%), Positives = 118/198 (59%), Gaps = 17/198 (8%)

Query: 18  SLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDST 70
           + ++   S  DL++CC  +CG GC+GG P  AW Y+ H G+V+       + C PY +  
Sbjct: 91  ATKHFHFSAEDLVSCCP-ICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPY-EIP 148

Query: 71  GCSH--PG----CEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYK 123
            C H  PG    C     TPKC + C     + ++  K Y    Y ++   ++I AE++K
Sbjct: 149 PCEHHVPGNRMPCNGDTKTPKCEKTCESSYTVPFKKDKRYGKHVYSVSGHEDNIKAELFK 208

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           NGPVE +FTVY D   YKSGVY+H  G+ +GGHA+K++GWG  ++G  YW++AN WN  W
Sbjct: 209 NGPVEGAFTVYSDLLSYKSGVYQHTHGNALGGHAIKILGWGV-ENGSKYWLIANSWNSDW 267

Query: 184 GADGYFKIKRGSNECGIE 201
           G +G+ KI RG + CGIE
Sbjct: 268 GDNGFLKILRGEDHCGIE 285


>gi|226474168|emb|CAX71570.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  157 bits (396), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 86/208 (41%), Positives = 117/208 (56%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT       + TGC     P 
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y + S    I  +I  
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GP E    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 HGPAEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG NEC IE ++ AGL  S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|226474178|emb|CAX71575.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  157 bits (396), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 85/204 (41%), Positives = 116/204 (56%), Gaps = 20/204 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT       + TGC     P 
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y + S    I  +I  
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAG 207
           G  GYF+I RG NEC IE ++ AG
Sbjct: 315 GEKGYFRIVRGRNECSIESEIAAG 338


>gi|260782761|ref|XP_002586451.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
 gi|229271561|gb|EEN42462.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
          Length = 272

 Score =  156 bits (395), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 87/189 (46%), Positives = 112/189 (59%), Gaps = 18/189 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGC-DGGYPISAWRYFVHHGVVTEECDPYFD-STGCSHPGCE 78
           N+ LS  DLL+C     G GC DGG    AWRY    GVV   C PY   +TG       
Sbjct: 93  NIILSSEDLLSC--DKAGRGCSDGGRLSEAWRYMQKKGVVANRCKPYTSGATGF------ 144

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
                P+C+ KC  +   ++  K Y +  Y ++ + + I  EI  NGPVE +FTVY D  
Sbjct: 145 ----IPECMSKCTGEGHAYQ--KFYGLYLYTVSGENQ-IKVEIMTNGPVEAAFTVYSDIV 197

Query: 139 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 198
           HYKSGVY H +G  +GGHAVK++GWG  D+ E+YW++AN W   WG  G+FKIKRGS+EC
Sbjct: 198 HYKSGVYHHTSGGKLGGHAVKVLGWGVEDE-EEYWLVANSWGPDWGDQGFFKIKRGSDEC 256

Query: 199 GIEEDVVAG 207
           GIE  V+ G
Sbjct: 257 GIESRVLTG 265


>gi|194384502|dbj|BAG59411.1| unnamed protein product [Homo sapiens]
          Length = 273

 Score =  156 bits (395), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 72/147 (48%), Positives = 104/147 (70%), Gaps = 2/147 (1%)

Query: 73  SHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
           S P C     TPKC + C    +  ++  KHY  ++Y +++  +DIMAEIYKNGPVE +F
Sbjct: 128 SRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF 187

Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
           +VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI
Sbjct: 188 SVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKI 246

Query: 192 KRGSNECGIEEDVVAGLPSSKNLVKEI 218
            RG + CGIE +VVAG+P +    ++I
Sbjct: 247 LRGQDHCGIESEVVAGIPRTDQYWEKI 273


>gi|5031250|gb|AAD38132.1|AF127592_1 vitellogenic cathepsin-B like protease [Aedes aegypti]
          Length = 386

 Score =  156 bits (395), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 87/207 (42%), Positives = 121/207 (58%), Gaps = 19/207 (9%)

Query: 28  DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPA 80
           DLL+CC   CG GC GG    AW+++V  G+ +       + C PY     C  PG +  
Sbjct: 182 DLLSCC-HSCGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGE-CRIPGEDED 239

Query: 81  YPTPKCVRKC---VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
             TPKC  KC        +W++ +H    AY + +D   IM EI+ NGPV+ +F  Y D 
Sbjct: 240 --TPKCSNKCRSGYNVTDVWQD-RHIGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDL 296

Query: 138 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 197
             YKSG+Y+H+ G + GGHAVKL+GWG  ++G  YW++AN W R WG +G+FK+ RG N 
Sbjct: 297 HAYKSGIYRHVWGPLSGGHAVKLLGWGV-ENGVKYWLVANSWGREWGENGFFKMVRGENH 355

Query: 198 CGIEEDVVAGLPSSKNLVKEITSADMF 224
           CGIEE++ AGLP   N  ++  +A  F
Sbjct: 356 CGIEENIHAGLP---NFHRQGEAAKYF 379


>gi|56759488|gb|AAW27884.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  156 bits (395), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 86/208 (41%), Positives = 117/208 (56%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT       + TGC     P 
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y +      I  +I  
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLGIESVIQKDIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG NEC IE ++ AGL  S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|166030310|gb|ABY78822.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score =  156 bits (394), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 84/200 (42%), Positives = 116/200 (58%), Gaps = 14/200 (7%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           +Q L +S   L++CC   CGDGCDGGYP ++W Y+V HG+ +  C PY     C H G +
Sbjct: 137 VQQLRISAAHLMSCCED-CGDGCDGGYPGTSWEYYVSHGLASSYCQPY-PFPHCGHHGGK 194

Query: 79  PAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
              P        TPKC   C  K       K+    +Y ++ + +D   E+Y NGP  V 
Sbjct: 195 GKKPPCSKYHFHTPKCNTTCTDKAIPL--IKYRGNHSYEVHGE-DDYKRELYFNGPFVVV 251

Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
           F VY DF  YK+GVY+H++GD +GGHAV+++GWG   +G  YW +AN W+  WG +G+  
Sbjct: 252 FWVYSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHLL 310

Query: 191 IKRGSNECGIEEDVVAGLPS 210
             RG+NECGIE    AG P+
Sbjct: 311 FLRGNNECGIEAAGYAGSPA 330


>gi|156375635|ref|XP_001630185.1| predicted protein [Nematostella vectensis]
 gi|156217201|gb|EDO38122.1| predicted protein [Nematostella vectensis]
          Length = 311

 Score =  156 bits (394), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 80/189 (42%), Positives = 112/189 (59%), Gaps = 12/189 (6%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEEC-DPYFDSTGCSHPGCEPA 80
           ++LS   L+ C   L   GC GG+PI+AW Y V  G++TE+C  PY+         C   
Sbjct: 132 VTLSAQQLVDCD--LDNSGCSGGWPINAWNYMVKTGLLTEQCYGPYY----AKQYTCRLT 185

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
             T  C  +   K + +     Y + A  +    E I  +I  NGPVE  FT+++DF  Y
Sbjct: 186 ANTTDCPWQPGVKARFYHAKSAYKLPAKNV----EAIQTDIMNNGPVEADFTIFQDFYAY 241

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           +SG+Y H TG  +GGHA+K++GWGT D+  DYW+ AN W  +WG  GYFKI+RG++ECGI
Sbjct: 242 RSGIYVHATGKQLGGHAIKILGWGTEDN-VDYWLCANSWGANWGIQGYFKIRRGTDECGI 300

Query: 201 EEDVVAGLP 209
           E+ + AGLP
Sbjct: 301 EDGLAAGLP 309


>gi|56758864|gb|AAW27572.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  156 bits (394), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 86/208 (41%), Positives = 117/208 (56%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT       + TGC     P 
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y +      I  +I  
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLGIESVIQKDIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG NEC IE ++ AGL  S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|741376|prf||2007265A cathepsin B
          Length = 153

 Score =  155 bits (393), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 72/147 (48%), Positives = 104/147 (70%), Gaps = 2/147 (1%)

Query: 73  SHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
           S P C     TPKC + C    +  ++  KHY  ++Y +++  +DIMAEIYKNGPVE +F
Sbjct: 8   SRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF 67

Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
           +VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI
Sbjct: 68  SVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKI 126

Query: 192 KRGSNECGIEEDVVAGLPSSKNLVKEI 218
            RG + CGIE +VVAG+P +    ++I
Sbjct: 127 LRGQDHCGIESEVVAGIPRTDQYWEKI 153


>gi|3929733|emb|CAA77178.1| cathepsin B [Homo sapiens]
          Length = 195

 Score =  155 bits (393), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 78/174 (44%), Positives = 110/174 (63%), Gaps = 16/174 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           ++ +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+         C PY     C 
Sbjct: 24  SVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPY-SIPPCE 82

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TPKC + C    +  ++  KHY   +Y +++  +DIMAEIYKNGP
Sbjct: 83  HHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYDSYSVSNSEKDIMAEIYKNGP 142

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 180
           VE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN
Sbjct: 143 VEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWN 195


>gi|56756380|gb|AAW26363.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  155 bits (393), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 117/208 (56%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC   CG GCDGG+   +W Y+V  G+VT       + TGC     P 
Sbjct: 139 QSVKLSAVDLISCCEN-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y + S    I  +I  
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG NEC IE ++ AGL  S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|294883442|ref|XP_002770942.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239874068|gb|EER02758.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 393

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 94/205 (45%), Positives = 119/205 (58%), Gaps = 23/205 (11%)

Query: 22  LSLSVNDLLACCGFLCG---DGCDGGYPISAWRYFVHHGVVTE---ECDPYFDSTGCSH- 74
           + LS     ACC    G    GCDGG P SAWR+F  HGVV+E    C PY +   CSH 
Sbjct: 180 VPLSAGHTAACCSEAEGCFSFGCDGGQPDSAWRWFSEHGVVSELDSGCWPY-NFPECSHH 238

Query: 75  ---PGCEPAY---PTPKCVRKCVKKNQLWRNS----KHYSISAYRINSDPEDIMAEIYKN 124
               G EP     P+P C   C  +N  ++ S    +H++        + ++I  EI  N
Sbjct: 239 VETKGMEPCKGNSPSPVCSTTC--RNHHFKPSFESDRHFTEDEGYSLDEVDEIKKEIIDN 296

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPV  +FTVYEDF +YKSGVYKH+ G  +GGHAVK+IGWGT D  E YW++ N WN +WG
Sbjct: 297 GPVAAAFTVYEDFLYYKSGVYKHVNGSELGGHAVKIIGWGT-DQNEQYWLVMNSWNVNWG 355

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
             G FKI  G  ECGI+ +V AG+P
Sbjct: 356 DQGIFKIAIG--ECGIDSEVTAGIP 378


>gi|119638965|gb|ABL85237.1| cysteine proteinase 3 [Necator americanus]
          Length = 360

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 89/200 (44%), Positives = 120/200 (60%), Gaps = 16/200 (8%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY--FDSTGCSH 74
           LS  D+LACC   CG GC GG+ I AW YF + GV T       + C PY  +     S+
Sbjct: 143 LSDTDILACCPN-CGAGCGGGHTIRAWEYFKNTGVCTGGLYGTKDSCKPYAFYPCKDESY 201

Query: 75  PGC-EPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
             C + ++PTPKC + C  K ++ + + K+Y+ SAYRI  +   I  EI +NGPV  SF 
Sbjct: 202 GKCPKDSFPTPKCRKICQYKYSKKYADDKYYANSAYRIPQNETWIKLEIMRNGPVTASFR 261

Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD-DGED--YWILANQWNRSWGA-DGY 188
           +Y DF  Y+ GVY    G  +GGHA+K+IGWGT   +G D  YW++AN W   WG  +GY
Sbjct: 262 IYPDFGFYEKGVYVTSGGRELGGHAIKIIGWGTEKVNGTDLPYWLIANSWGTDWGENNGY 321

Query: 189 FKIKRGSNECGIEEDVVAGL 208
           F+I RG N C IE+ V+AG+
Sbjct: 322 FRILRGQNHCQIEQKVIAGM 341


>gi|91089435|ref|XP_966663.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
 gi|270012706|gb|EFA09154.1| cathepsin B precursor [Tribolium castaneum]
          Length = 320

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 87/212 (41%), Positives = 124/212 (58%), Gaps = 14/212 (6%)

Query: 4   TRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEEC 63
           T ++R  ++S+   + +    S  DLLACC   CG GC GGY   AW+Y+V  G+V+   
Sbjct: 115 TMSDRLCIASN---ATKKFEFSAQDLLACCK-ECGHGCGGGYSSRAWQYWVTDGIVSG-- 168

Query: 64  DPYFDSTGCSHPGCEPAY---PTPKCVRKCV--KKNQLWRNSKHYSISAYRINSDPEDIM 118
             +  S GC HP    A+    TP C   C   K  + +   K Y   +YRI  + E I 
Sbjct: 169 GDFNTSQGC-HPYSVQAFRDSTTPNCSSFCTNPKYQKNYSEDKRYGARSYRIAKNIEQIQ 227

Query: 119 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQ 178
           AEI  +GPV+ S+ VY+DF  Y++GVY+H+ G+V G H+VK++GWG  ++G DYW++AN 
Sbjct: 228 AEIMTSGPVQASYVVYDDFYSYQNGVYQHVLGNVSGRHSVKILGWG-RENGTDYWLVANS 286

Query: 179 WNRSWGA-DGYFKIKRGSNECGIEEDVVAGLP 209
           W R WG   G+FK  RG N C IE +++ G P
Sbjct: 287 WGRDWGRLGGFFKFLRGENHCDIESNILGGDP 318


>gi|291385792|ref|XP_002709482.1| PREDICTED: cathepsin B [Oryctolagus cuniculus]
          Length = 339

 Score =  154 bits (390), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 97/212 (45%), Positives = 133/212 (62%), Gaps = 16/212 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  D+L CCG  CGDGC+GGYP  AW ++   G+V+         C PY     C 
Sbjct: 130 NVEVSAEDMLTCCGGQCGDGCNGGYPSGAWNFWTKKGLVSGGLYDSHVGCKPY-SIPPCE 188

Query: 74  H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TP+C + C    +  ++  KHY  S+Y ++SD  +I AEIYKNGP
Sbjct: 189 HHVNGSRPACTGEGDTPRCSKTCEPGYSPSYKEDKHYGYSSYSVSSDENEIKAEIYKNGP 248

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +FTVY DF  YKSGVY+H TGD+MGGHA++++GWG  ++G  YW++AN WN  WG  
Sbjct: 249 VEGAFTVYSDFLMYKSGVYQHTTGDIMGGHAIRILGWG-EENGVPYWLVANSWNTDWGDK 307

Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           G+FKI RG + CGIE ++VAG+P +    ++I
Sbjct: 308 GFFKILRGQDHCGIESEIVAGIPRTDQYWRQI 339


>gi|40557606|gb|AAR88096.1| cathepsin B-like cysteine protease [Callosobruchus maculatus]
          Length = 330

 Score =  154 bits (390), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 88/201 (43%), Positives = 112/201 (55%), Gaps = 14/201 (6%)

Query: 21  NLSLSVNDLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS----- 73
            L +S  D++ CC       DGC GG P   +  +   G V+     Y  + GC      
Sbjct: 131 QLRISAADMIECCESCTFSVDGCHGGIPSFTFTEWKDSGFVSG--GEYNSTNGCMSYPLP 188

Query: 74  --HPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPE-DIMAEIYKNGPVEV 129
             +P C+  Y  P C ++C K + L +   KHY+  AYRI S  E  I  EI KNGPV  
Sbjct: 189 RCNPSCKTLYDAPTCKKECDKGSPLKYEEDKHYAKQAYRIMSKVERQIQLEIIKNGPVVA 248

Query: 130 SFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 188
           SFTVY DF HY SGVYK      ++GGHAV++IGWG  +    YW+++N WN  WG  G 
Sbjct: 249 SFTVYADFIHYLSGVYKFDGESKLLGGHAVRIIGWGIENGTYPYWLVSNSWNERWGDQGL 308

Query: 189 FKIKRGSNECGIEEDVVAGLP 209
           FKI RG NECGIEE++ AGLP
Sbjct: 309 FKIWRGKNECGIEEEITAGLP 329


>gi|291000228|ref|XP_002682681.1| predicted protein [Naegleria gruberi]
 gi|284096309|gb|EFC49937.1| predicted protein [Naegleria gruberi]
          Length = 225

 Score =  154 bits (390), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 81/184 (44%), Positives = 109/184 (59%), Gaps = 13/184 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+ LS  DL++C  +    GCDGG   +AW Y  H G+VT++C PY    G +       
Sbjct: 55  NVVLSPQDLVSCNWY--NAGCDGGILWAAWIYLKHTGIVTDQCLPYSSGNGVA------- 105

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
              P C + C   +    + K+ +   Y + S  E IM EI  NGPV+  F+VY+DF  Y
Sbjct: 106 ---PSCPKYCNGTSTPIDSVKYKAKDWYEVGSIAEKIMNEIATNGPVQSGFSVYQDFMSY 162

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSGVY H TG  +GGHA+K++GWG  ++ + YW++AN W   WG +G FKIKRG NECGI
Sbjct: 163 KSGVYTHQTGSFLGGHAIKIVGWGVENNVK-YWLVANSWGPDWGLNGLFKIKRGDNECGI 221

Query: 201 EEDV 204
           E DV
Sbjct: 222 EADV 225


>gi|56754307|gb|AAW25341.1| unknown [Schistosoma japonicum]
          Length = 309

 Score =  154 bits (390), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 116/208 (55%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC   CG GCDGG    +W Y+V HG+VT       + TGC     P 
Sbjct: 106 QSVELSAIDLISCCKN-CGSGCDGGVTGYSWDYWVSHGIVTGGSKE--NHTGCRPYPFPK 162

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y + S    I  +I  
Sbjct: 163 CDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 222

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +G VE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 223 HGTVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 281

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG NEC IE ++ AGL  S
Sbjct: 282 GEKGYFRIVRGRNECLIESEIAAGLIKS 309


>gi|17565158|ref|NP_503384.1| Protein W07B8.1 [Caenorhabditis elegans]
 gi|351059396|emb|CCD74286.1| Protein W07B8.1 [Caenorhabditis elegans]
          Length = 335

 Score =  154 bits (389), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 78/210 (37%), Positives = 119/210 (56%), Gaps = 20/210 (9%)

Query: 19  LQNLSLSVNDLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-- 67
            +N  LS  +LL+CC   F CG+GC+GG P  AW+Y   HG+ T         C PY   
Sbjct: 124 FKNTILSAEELLSCCTGMFSCGEGCEGGNPFKAWQYIQKHGIPTGGSYESQFGCKPYSIP 183

Query: 68  ---DSTG-CSHPGC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMA 119
               + G  ++P C     PTP C +KC  +          +HY +S  ++ +   +I +
Sbjct: 184 PCGKTVGNVTYPACTNTTSPTPSCEKKCTSRIGYPIDIDKDRHYGVSVDQLPNSQIEIQS 243

Query: 120 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQW 179
           ++  NGP++ +F VY+DF  Y +G+Y H+TG+  G  +V++IGWG    G  YW+ AN W
Sbjct: 244 DVMLNGPIQATFEVYDDFLQYTTGIYVHLTGNKQGHLSVRIIGWGVW-QGVPYWLCANSW 302

Query: 180 NRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
            R WG +G F++ RG+NECG+E + V+G+P
Sbjct: 303 GRQWGENGTFRVLRGTNECGLESNCVSGMP 332


>gi|161343875|tpg|DAA06118.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 210

 Score =  154 bits (388), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 82/185 (44%), Positives = 111/185 (60%), Gaps = 14/185 (7%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-FDSTGCSH 74
           +LS   L  CC + CG+GCDGG P +AW +F+ HG+VT       + C PY     G   
Sbjct: 28  NLSAEQLNTCC-YRCGNGCDGGSPEAAWYFFMRHGIVTGGDYESGDGCQPYSIYPRGKGR 86

Query: 75  PGC-EPAYPTPKC-VRKCVKKN--QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
             C +    TP C +R C   N  + +R   HY  + Y ++   EDIM +IYKNGPV+ +
Sbjct: 87  NTCIDDDIDTPDCSIRTCTNSNYTKGYRADLHYVDTVYSLSRSEEDIMTDIYKNGPVQAA 146

Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
           F VY DF +YKSGVY +  G + GGHA+K++GWG  DD   YW+ AN W+RSWG +G F+
Sbjct: 147 FYVYTDFMYYKSGVYSYTRGQIEGGHAIKILGWGV-DDNTKYWLCANSWSRSWGENGLFR 205

Query: 191 IKRGS 195
           I RG+
Sbjct: 206 ILRGN 210


>gi|166030330|gb|ABY78832.1| cathepsin B-like protease [Trypanosoma congolense]
 gi|343476577|emb|CCD12360.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score =  154 bits (388), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 85/204 (41%), Positives = 108/204 (52%), Gaps = 13/204 (6%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           + L +S   LL+CC   CGDGC GG+P  AWRY+V +G+ +  C PY     C H G + 
Sbjct: 139 KQLRISAAHLLSCCK-DCGDGCKGGFPGFAWRYYVEYGITSSSCQPY-PFPRCEHQGAQG 196

Query: 80  A--------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
                    + TPKC   C  K       K+   + Y +    ED   E+Y NGP    F
Sbjct: 197 NKTPCSKYNFDTPKCNATCTDKAIPL--IKYRGNATYLLLHGEEDYKRELYFNGPFVAVF 254

Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
            VY D   YKSGVY+H+ GD +GG AVK++GWG   +G  YW LAN W+  WG  GY  I
Sbjct: 255 YVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKL-NGTPYWKLANSWDTDWGMGGYLLI 313

Query: 192 KRGSNECGIEEDVVAGLPSSKNLV 215
            RG+NEC IE    AG P +  L 
Sbjct: 314 LRGNNECNIEHLGFAGTPEASQLT 337


>gi|343474137|emb|CCD14154.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score =  154 bits (388), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 85/204 (41%), Positives = 108/204 (52%), Gaps = 13/204 (6%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           + L +S   LL+CC   CGDGC GG+P  AWRY+V +G+ +  C PY     C H G + 
Sbjct: 139 KQLRISAAHLLSCCK-DCGDGCKGGFPGFAWRYYVEYGITSSSCQPY-PFPRCEHQGAQG 196

Query: 80  A--------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
                    + TPKC   C  K       K+   + Y +    ED   E+Y NGP    F
Sbjct: 197 NKTPCSKYNFDTPKCNATCTDKAIPL--IKYRGNATYLLLHGEEDYKRELYFNGPFVAVF 254

Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
            VY D   YKSGVY+H+ GD +GG AVK++GWG   +G  YW LAN W+  WG  GY  I
Sbjct: 255 YVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKL-NGTPYWKLANSWDTDWGMGGYLLI 313

Query: 192 KRGSNECGIEEDVVAGLPSSKNLV 215
            RG+NEC IE    AG P +  L 
Sbjct: 314 LRGNNECNIEHLGFAGTPEASQLT 337


>gi|343474132|emb|CCD14149.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score =  154 bits (388), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 85/204 (41%), Positives = 108/204 (52%), Gaps = 13/204 (6%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           + L +S   LL+CC   CGDGC GG+P  AWRY+V +G+ +  C PY     C H G + 
Sbjct: 139 KQLRISAAHLLSCCK-DCGDGCKGGFPGFAWRYYVEYGITSSSCQPY-PFPRCEHQGAQG 196

Query: 80  A--------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
                    + TPKC   C  K       K+   + Y +    ED   E+Y NGP    F
Sbjct: 197 NKTPCSKYNFDTPKCNATCTDKAIPL--IKYRGNATYLLLHGEEDYKRELYFNGPFVAVF 254

Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
            VY D   YKSGVY+H+ GD +GG AVK++GWG   +G  YW LAN W+  WG  GY  I
Sbjct: 255 YVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKL-NGTPYWKLANSWDTDWGMGGYLLI 313

Query: 192 KRGSNECGIEEDVVAGLPSSKNLV 215
            RG+NEC IE    AG P +  L 
Sbjct: 314 LRGNNECNIEHLGFAGTPEASQLT 337


>gi|343470805|emb|CCD16605.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score =  154 bits (388), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 84/204 (41%), Positives = 110/204 (53%), Gaps = 13/204 (6%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           + L +S   LL+CC   CGDGC GG+P  AWRY+V +G+ +  C PY     C H G + 
Sbjct: 139 KQLRISAAHLLSCCK-DCGDGCKGGFPGFAWRYYVEYGITSSSCQPY-PFPRCEHQGAQG 196

Query: 80  A--------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
                    + TPKC   C  K+      K+   + Y +    ED   E+Y NGP    F
Sbjct: 197 NKTPCSKYNFDTPKCNATCTDKSVPL--IKYRGNATYLLLHGEEDYKRELYFNGPFVAVF 254

Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
            VY D   YKSGVY+++ GD +GG AVK++GWG   +G  YW +AN W+  WG DGY  I
Sbjct: 255 YVYTDLFAYKSGVYRNVDGDFLGGTAVKVVGWGKL-NGTPYWKVANSWDTDWGMDGYLLI 313

Query: 192 KRGSNECGIEEDVVAGLPSSKNLV 215
            RG+NEC IE    AG P +  L 
Sbjct: 314 LRGNNECNIEHLGFAGTPETSQLT 337


>gi|211853248|emb|CAP17587.1| cathepsin-like protein 4 [Crateromorpha meyeri]
          Length = 325

 Score =  153 bits (387), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 89/201 (44%), Positives = 113/201 (56%), Gaps = 22/201 (10%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-----------EECDPYFD 68
            +L +S  +L+ CC   CG+GC+GG+  +AW Y+   G+VT           + C PY  
Sbjct: 127 MHLLISAANLMECCRN-CGNGCEGGFLGAAWNYWKQEGLVTGGLYNPSATESDTCQPY-P 184

Query: 69  STGCSHP--GCEPAYP-----TPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAE 120
              C H   G +PA P     TP+CV  C       +    HY  SAY +     +I  E
Sbjct: 185 LPSCEHHINGSKPACPSKIAKTPECVHTCHAGYPTSYEQDLHYGESAYSVRRRVAEIQTE 244

Query: 121 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 180
           I  NGPVE +FTVY DF  YKSGVYK  +   +GGHAVK+IGWG  +DG  YW++AN WN
Sbjct: 245 IMTNGPVEAAFTVYADFPAYKSGVYKRHSLRQLGGHAVKMIGWG-EEDGIPYWLIANSWN 303

Query: 181 RSWGADGYFKIKRGSNECGIE 201
             WG  GYFKI RG +ECGIE
Sbjct: 304 SDWGDHGYFKIVRGQDECGIE 324


>gi|56752925|gb|AAW24674.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  153 bits (387), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 114/208 (54%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC   CG GCDGG    +W Y+V HG+VT       + TGC     P 
Sbjct: 139 QSVELSAIDLISCCKN-CGSGCDGGVTGYSWDYWVKHGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y +      I  EI  
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYSVIGVESAIQKEIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
            GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 YGPVEAYLQIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTSYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG +EC IE  +VAG   S
Sbjct: 315 GEKGYFRIVRGRDECLIESFIVAGQIKS 342


>gi|56757271|gb|AAW26807.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  153 bits (386), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 114/208 (54%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC   CG GCDGG    +W Y+V HG+VT       + TGC     P 
Sbjct: 139 QSVELSAIDLISCCKN-CGSGCDGGVTGYSWDYWVKHGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y +      I  EI  
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGEFSYNVIGVESVIQKEIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
            GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 YGPVEAYLHIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTSYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG +EC IE  +VAG   S
Sbjct: 315 GEKGYFRIVRGRDECLIESFIVAGQIKS 342


>gi|328871084|gb|EGG19455.1| peptidase C1A family protein [Dictyostelium fasciculatum]
          Length = 352

 Score =  153 bits (386), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 84/195 (43%), Positives = 106/195 (54%), Gaps = 16/195 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           +++ LS  DL+ C      +GC GG   +A ++    G+V+ +C PY      + P C P
Sbjct: 117 EDVLLSFQDLVTC--DQSDNGCQGGDAYTAMKFIQKKGIVSNDCLPY------TIPTCAP 168

Query: 80  AYP-------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
           A         TP+CV KC   +  +    H+    Y +N     I  EI  NGPVE  F 
Sbjct: 169 AQQPCLNFVDTPQCVEKCSNASYTYAQDLHFIDGVYSMNPTVNAIQQEIMTNGPVEACFE 228

Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
           VYEDF  YKSGVY+H TG  +GGH VK+IGWGT ++ E YWI  N W   WG  G F IK
Sbjct: 229 VYEDFLGYKSGVYQHTTGKDLGGHCVKMIGWGTQNN-ELYWICNNSWTTYWGNQGVFWIK 287

Query: 193 RGSNECGIEEDVVAG 207
            G NECGIE DVVA 
Sbjct: 288 AGVNECGIESDVVAA 302


>gi|342181301|emb|CCC90780.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 335

 Score =  153 bits (386), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 83/200 (41%), Positives = 115/200 (57%), Gaps = 14/200 (7%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           +Q L +S   L++CC   CG GCDGGYP ++W Y+V HG+ +  C PY     C H G +
Sbjct: 137 VQQLRISAAHLMSCCED-CGYGCDGGYPGTSWEYYVSHGLASSYCQPY-PFPHCGHHGGK 194

Query: 79  PAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
              P        TPKC   C  K       K+    +Y ++ + +D   E+Y NGP  V 
Sbjct: 195 GKKPPCSKYHFHTPKCNTTCTDKAIPL--IKYRGNHSYEVHGE-DDYKRELYFNGPFVVV 251

Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
           F VY DF  YK+GVY+H++GD +GGHAV+++GWG   +G  YW +AN W+  WG +G+  
Sbjct: 252 FWVYSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHLL 310

Query: 191 IKRGSNECGIEEDVVAGLPS 210
             RG+NECGIE    AG P+
Sbjct: 311 FLRGNNECGIEAAGYAGSPA 330


>gi|254575663|gb|ACT68328.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score =  152 bits (385), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 86/203 (42%), Positives = 112/203 (55%), Gaps = 16/203 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGC 72
           + + LS  D+LACCG  CG GCDGGY   AW++    GVVT         C PY      
Sbjct: 145 KKVLLSATDILACCGKFCGYGCDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCG 204

Query: 73  SHPGCE----PAYP--TPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           +H G      P++P  TP C   C     + + N K  + + Y + +D   I  EI K G
Sbjct: 205 AHKGKAFNNCPSHPYATPACKPYCQYGYGKRYENDKIKAKTWYWLPNDERTIQLEIMKKG 264

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PV  +F +YEDF HY  GVY H  G + GGH++K+IGWG  D G  YW++AN W+  WG 
Sbjct: 265 PVHATFNIYEDFEHYNGGVYIHTAGAMEGGHSIKIIGWGV-DKGVKYWLIANSWSTDWGE 323

Query: 186 D-GYFKIKRGSNECGIEEDVVAG 207
           D GYF++ RG N C IE  V+AG
Sbjct: 324 DGGYFRVVRGINNCDIEGGVLAG 346


>gi|226473760|emb|CAX71565.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  152 bits (385), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 86/208 (41%), Positives = 117/208 (56%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT       + TGC     P 
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y + S    I  +I  
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIG G  ++G  YW+ AN WN  W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGCGV-ENGTAYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG NEC IE ++ AGL  S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|343472937|emb|CCD15042.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  152 bits (385), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 81/204 (39%), Positives = 109/204 (53%), Gaps = 13/204 (6%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           + L +S  DL+ACC   CGDGC GG+P  AW Y+V +G+ + +C PY     C H G + 
Sbjct: 138 KQLRISAADLMACCK-QCGDGCKGGFPGFAWLYYVEYGITSSQCQPY-PFPHCEHRGAQG 195

Query: 80  --------AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
                    + TPKC   C  K+      K+   + Y +    ED   E+Y NGP    F
Sbjct: 196 NKTPCSKYKFDTPKCNATCTDKSIPL--VKYRGNATYLLLHGEEDYKRELYFNGPFVAVF 253

Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
            VY D   YKSGVY+++ GD +GG AV+++GWG   +G  YW +AN W+  WG +GY  I
Sbjct: 254 FVYTDLFAYKSGVYRNVDGDFLGGQAVRIVGWGKL-NGTPYWKVANSWDTDWGMNGYMLI 312

Query: 192 KRGSNECGIEEDVVAGLPSSKNLV 215
            RG+NEC IE     G P    L 
Sbjct: 313 LRGNNECNIEHLGFTGFPDPSQLT 336


>gi|308507719|ref|XP_003116043.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
 gi|308250987|gb|EFO94939.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
          Length = 356

 Score =  152 bits (385), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 95/216 (43%), Positives = 127/216 (58%), Gaps = 30/216 (13%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGC 77
           +S  D+L CCG  CG+GC GG  + A +++  +G VT      + C PY     CS+  C
Sbjct: 123 ISAEDILTCCGKSCGNGCQGGQGLEAMKFWTTYGAVTGGDYKGDGCKPY-SFAPCSN--C 179

Query: 78  EPAYPTPKCVRKCVKKNQL--WRNSKHYS---------------ISAYRINSDPED---I 117
             +  TP C  KC     +  ++  KHY                 SAYR+++       I
Sbjct: 180 VESKTTPSCQSKCQSTYTVTNYKGDKHYGKNEGKVTERHKHLECTSAYRLDTSSNAVPII 239

Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 177
             EIY+NGPVEV++TVY+DF HYKSGVY H+TG   GGHAVK+IGWGT + G DYW++ N
Sbjct: 240 QNEIYQNGPVEVAYTVYDDFYHYKSGVYHHVTGKDTGGHAVKIIGWGT-EKGVDYWLVTN 298

Query: 178 QWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 213
            W  S+G  G+FKI+RG+NECGIE +VVAG+    N
Sbjct: 299 SWGTSFGDKGFFKIRRGTNECGIESNVVAGMAKVGN 334


>gi|56758716|gb|AAW27498.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  152 bits (384), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 114/208 (54%), Gaps = 20/208 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
           Q++ LS  DL++CC   CG GCDGG    +W Y+V HG+VT       + TGC     P 
Sbjct: 139 QSVELSAIDLISCCEN-CGSGCDGGVTGYSWDYWVKHGIVTGGSKE--NHTGCRPYPFPK 195

Query: 77  CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C+              Y TP+C + C K  N  +   KHY   +Y +      I  EI  
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYSVIGVESAIQKEIMM 255

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
            GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  W
Sbjct: 256 YGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
           G  GYF+I RG +EC IE  +VAG   S
Sbjct: 315 GEKGYFRIVRGRDECLIESFIVAGQIKS 342


>gi|45822211|emb|CAE47502.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
          Length = 331

 Score =  152 bits (383), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 81/199 (40%), Positives = 112/199 (56%), Gaps = 17/199 (8%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH 74
           + +S  +LL+CC   CG GC+GGYP  AW Y++  G+ T       + C PY     C H
Sbjct: 131 VPVSAENLLSCCDS-CGYGCEGGYPTMAWSYWIDTGITTGGLYGSKQGCQPY-SLQPCEH 188

Query: 75  ------PGCEPA-YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
                   C    Y TP C  KC      +++   +   + R      +I  EI  NGPV
Sbjct: 189 HTEGNKVQCSTLDYDTPSCKHKCDDSALNYKSELTFGSGSVRNFYSVANIQKEILTNGPV 248

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           E +F VY DF +YKSGVY+H+ G+ +GGHAV+++GWG  + G  YW++AN WN  WG  G
Sbjct: 249 EAAFDVYSDFVNYKSGVYQHVAGEYLGGHAVRILGWG-EESGVPYWLVANSWNEDWGDKG 307

Query: 188 YFKIKRGSNECGIEEDVVA 206
            FKI+RG+NE G E+ +VA
Sbjct: 308 LFKIRRGNNESGFEDSIVA 326


>gi|2944340|gb|AAC05262.1| cathepsin B-like cysteine protease GCP7 [Haemonchus contortus]
          Length = 348

 Score =  152 bits (383), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 85/203 (41%), Positives = 113/203 (55%), Gaps = 16/203 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGC 72
           + + LS  D+LACCG  CG GCDGGY   AW++    GVVT         C PY      
Sbjct: 145 KKVLLSATDILACCGKFCGYGCDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCG 204

Query: 73  SHPGCE----PAYP--TPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           +H G      P++P  TP C   C     + + N K  + + Y + +D   I  EI + G
Sbjct: 205 AHKGKAFNNCPSHPYATPACKPYCQYGYGKRYENDKIKARTWYWLPNDERTIQLEIMQKG 264

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PV  +F +YEDF HY+ GVY H  G + GGH++K+IGWG  D G  YW++AN W+  WG 
Sbjct: 265 PVHATFNIYEDFEHYEGGVYIHTAGAMEGGHSIKIIGWGV-DKGVKYWLIANSWSTDWGE 323

Query: 186 D-GYFKIKRGSNECGIEEDVVAG 207
           D GYF++ RG N C IE  V+AG
Sbjct: 324 DGGYFRVVRGINNCDIEGGVLAG 346


>gi|347972080|ref|XP_313831.5| AGAP004531-PA [Anopheles gambiae str. PEST]
 gi|333469162|gb|EAA09191.5| AGAP004531-PA [Anopheles gambiae str. PEST]
          Length = 375

 Score =  152 bits (383), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 81/202 (40%), Positives = 111/202 (54%), Gaps = 23/202 (11%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
             +    D+L+CC   CG GCDGG P + W Y+V +G+ +            SH GC+ +
Sbjct: 181 QFNFGAYDVLSCC-HRCGFGCDGGVPSAVWHYWVENGITS-------GGAFGSHEGCQ-S 231

Query: 81  YP------------TPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
           YP            TP+C+R C    N  +   KHY   AY +  D E IM E++  GP 
Sbjct: 232 YPFDVCKKSGDSNDTPRCLRFCQPGYNVTYPEDKHYGRVAYTVPKDEERIMYEVFNFGPA 291

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           + +FT+Y DF  YKSGVY+H  G  +G H+VK++GWG  +D + YW+ AN W   WG  G
Sbjct: 292 QATFTMYTDFVQYKSGVYRHTFGVRVGTHSVKVMGWGVENDVK-YWLCANSWGAQWGDGG 350

Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
           +FKI RG +    E +VVAGLP
Sbjct: 351 FFKIVRGEDHLSFETNVVAGLP 372


>gi|321461662|gb|EFX72692.1| hypothetical protein DAPPUDRAFT_308155 [Daphnia pulex]
          Length = 379

 Score =  152 bits (383), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 85/206 (41%), Positives = 113/206 (54%), Gaps = 22/206 (10%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY 81
           + LS  +LL+CC  LCG GC GG+P  AW ++  HG+VT     Y    GC      P Y
Sbjct: 164 VRLSAGNLLSCCK-LCGKGCKGGFPGGAWMHWSKHGIVTG--GSYSSDYGCQKYQFFPCY 220

Query: 82  -PTPK----------------CVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
            P  K                C   C    N+ ++   +Y  S YRI +D   I  EI +
Sbjct: 221 QPRTKGSIKNKCPKTDNTLLECRETCRTSYNKSYKQDLYYGESVYRIPNDARAIQLEIME 280

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           NGPV+ +  +YEDF HYK GVY+H+ G  +  HAVK+ GWGT + G  YW+ AN W++ W
Sbjct: 281 NGPVQANLRIYEDFLHYKFGVYRHVHGQGLEYHAVKIFGWGT-EGGTPYWLAANPWSKRW 339

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLP 209
           G  G+FKI RGSN   IE+ V+AG+P
Sbjct: 340 GNGGFFKILRGSNHAEIEDHVMAGIP 365


>gi|204022102|dbj|BAG71148.1| cathepsin B-N2 [Tuberaphis takenouchii]
          Length = 334

 Score =  151 bits (382), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 88/204 (43%), Positives = 111/204 (54%), Gaps = 19/204 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
           N  LS  +L  CC   CG GC GGYPI AW  F  HG+VT       E C PY      F
Sbjct: 135 NELLSAEELAFCC-HKCGSGCHGGYPIKAWERFRKHGLVTGGDYNSGEGCQPYRVPPCPF 193

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           D  G +    +PA    +C R C     L ++    Y+  AY +N   + I  ++   GP
Sbjct: 194 DEYGNNTCRGKPAEKNHRCTRMCYGNQNLDFKEDHRYTRDAYYLNY--QIIQNDLMTYGP 251

Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           +E S+ VY+DF +YKSGVY K      +GGHAVKLIGWG  + G  YW+L N WN  WG 
Sbjct: 252 IEASYDVYDDFPNYKSGVYMKTENASYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWGD 310

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            G FKI+RG+NECGI+     G+P
Sbjct: 311 QGLFKIRRGTNECGIDNSTTGGVP 334


>gi|91089437|ref|XP_966750.1| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270012705|gb|EFA09153.1| cathepsin B precursor [Tribolium castaneum]
          Length = 324

 Score =  151 bits (382), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 87/200 (43%), Positives = 109/200 (54%), Gaps = 20/200 (10%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH 74
              S +DLLACC   CG GCDGG P  A+ Y+V  G+V+       E C PY  S   + 
Sbjct: 133 FEFSADDLLACCT-ACGKGCDGGAPYRAFEYWVAKGIVSGGDYNSNEGCQPYEGSAFLNS 191

Query: 75  PGCEPAYPTPKCVRKCV--KKNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSF 131
                   TPKC  KC+  K    +   KHY     Y  + +  +I  EI  NGPV    
Sbjct: 192 V-------TPKCSTKCLNSKYTTPYAKDKHYGTDFIYMTSKNVAEIQTEIMNNGPVVTHM 244

Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG-ADGYFK 190
            VYEDF  YKSGVY+H++G+ MGGHAVK+IGWGT + G  YW++AN W   W   DG++K
Sbjct: 245 DVYEDFYSYKSGVYQHVSGNSMGGHAVKIIGWGT-EKGVPYWLIANSWGAKWADLDGFYK 303

Query: 191 IKRGSNECGIEEDVVAGLPS 210
           I RG N C IE  +  G P 
Sbjct: 304 ILRGKNHCKIETYIYGGTPQ 323


>gi|187104114|ref|NP_001119617.1| cathepsin B-16A precursor [Acyrthosiphon pisum]
 gi|161343835|tpg|DAA06098.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 340

 Score =  151 bits (382), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 85/203 (41%), Positives = 110/203 (54%), Gaps = 17/203 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------ 67
           N  LS  ++  CC   CG GC+GGYPI AW+YF  HG+VT       E C+PY       
Sbjct: 138 NELLSAEEITFCC-HTCGFGCNGGYPIKAWKYFSSHGIVTGGNYKSGEGCEPYRVPPCPQ 196

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
           D  G S    +P     +C R C     L  N  H     Y   +    I  ++   GP+
Sbjct: 197 DEEGKSSCAGKPIEKNHRCTRMCYGNQDLDYNDDHRFTRDYYYLT-YGSIQKDVMNYGPI 255

Query: 128 EVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           E SF VY+DF  YKSGVY+       +GGHAVKLIGWG  ++G  YW++ N WN  WG +
Sbjct: 256 EASFDVYDDFPSYKSGVYQRTPNATKLGGHAVKLIGWGV-EEGTPYWLMVNSWNAQWGDN 314

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G FKI+RG++ECGI+    AG+P
Sbjct: 315 GLFKIRRGTDECGIDSAATAGVP 337


>gi|239792046|dbj|BAH72408.1| ACYPI000003 [Acyrthosiphon pisum]
          Length = 182

 Score =  151 bits (382), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 73/165 (44%), Positives = 103/165 (62%), Gaps = 1/165 (0%)

Query: 47  ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSI 105
           +S   Y  + G +  E  P       +   C+    TP CV+KC +  ++ +    H+  
Sbjct: 18  VSGGPYGSNMGCIPYEIAPCEHHVNGTRGPCKEGGKTPTCVKKCEEGYKVPYAQDLHHGK 77

Query: 106 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 165
           SAY I +D + I  EIY NGPVE +FTVYEDF  Y++GVYKH+ G  +GGHA++++GWG 
Sbjct: 78  SAYSIRNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGV 137

Query: 166 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 210
            +    YW++AN WN  WG+DG+FKI RGS+ECGIE  + AGLP+
Sbjct: 138 QNGEIPYWLVANSWNTDWGSDGFFKILRGSDECGIEGQINAGLPA 182


>gi|157167368|ref|XP_001653891.1| cathepsin b [Aedes aegypti]
 gi|108874250|gb|EAT38475.1| AAEL009642-PA [Aedes aegypti]
          Length = 332

 Score =  150 bits (380), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 85/201 (42%), Positives = 120/201 (59%), Gaps = 15/201 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGY-PISAWRYFVHHGVV-------TEECDPYFDSTGC 72
           ++ L+  DL+ CC   CG+GC+GG+   ++++Y+V  G+V       T+ C PY     C
Sbjct: 135 DVELAAEDLMGCCK-DCGNGCNGGFLDGTSFQYWVDVGLVSGAAYNSTDGCKPY-PFKPC 192

Query: 73  SHP--GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
            +P  GC P   TP C   C +  +  +R  K+Y  +AY++ +D   I  EI  NGPVE 
Sbjct: 193 LYPFVGCHPE-KTPSCTHHCTEGYDGTYRRDKYYGSAAYKLPNDERMIQLEIMTNGPVES 251

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
            F+VY+D   YK+GVY+H+ G  +G HAV+LIGWG  + G  YW++AN +   WG  GYF
Sbjct: 252 GFSVYQDLYLYKTGVYQHVVGREVGKHAVRLIGWG-KERGVPYWLIANSYGEDWGEHGYF 310

Query: 190 KIKRGSNECGIEEDVVAGLPS 210
           K  RGSN  GIE  V+AGLP 
Sbjct: 311 KFLRGSNHLGIESVVIAGLPK 331


>gi|54289256|gb|AAV31918.1| putative vitellogenic cathepsin B [Aedes aegypti]
          Length = 332

 Score =  150 bits (380), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 85/201 (42%), Positives = 120/201 (59%), Gaps = 15/201 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGY-PISAWRYFVHHGVV-------TEECDPYFDSTGC 72
           ++ L+  DL+ CC   CG+GC+GG+   ++++Y+V  G+V       T+ C PY     C
Sbjct: 135 DVELAAEDLMGCCK-DCGNGCNGGFLDGTSFQYWVDVGLVSGAAYNNTDGCKPY-PFKPC 192

Query: 73  SHP--GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
            +P  GC P   TP C   C +  +  +R  K+Y  +AY++ +D   I  EI  NGPVE 
Sbjct: 193 LYPFVGCHPE-KTPSCTHHCTEGYDGTYRRDKYYGSAAYKLPNDERMIQLEIMTNGPVES 251

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
            F+VY+D   YK+GVY+H+ G  +G HAV+LIGWG  + G  YW++AN +   WG  GYF
Sbjct: 252 GFSVYQDLYLYKTGVYQHVVGREVGKHAVRLIGWG-KERGVPYWLIANSYGEDWGEHGYF 310

Query: 190 KIKRGSNECGIEEDVVAGLPS 210
           K  RGSN  GIE  V+AGLP 
Sbjct: 311 KFLRGSNHLGIESVVIAGLPK 331


>gi|204022094|dbj|BAG71144.1| cathepsin B-N1 [Tuberaphis taiwana]
          Length = 334

 Score =  150 bits (379), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 89/204 (43%), Positives = 108/204 (52%), Gaps = 19/204 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
           N  LS  +L  CC   CG GC GGYPI AW  F  HG+VT       E C PY       
Sbjct: 135 NELLSAEELAFCC-HKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPL 193

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           D  G +    +PA    +C R C     L ++   HY+  AY +      I  +I   GP
Sbjct: 194 DEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYGT--IQNDILAYGP 251

Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           +E SF VY+DF  YKSGVY K      +GGHAVKLIGWG  + G  YW+L N WN  WG 
Sbjct: 252 IEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWGD 310

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            G FKI+RG+NECGI+     G+P
Sbjct: 311 QGLFKIRRGTNECGIDNSTTGGVP 334


>gi|328697984|ref|XP_003240502.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
          Length = 339

 Score =  150 bits (379), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 84/203 (41%), Positives = 111/203 (54%), Gaps = 17/203 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------ 67
           N  LS  +L  CC   CG+GC+GGYPI AW+YF  HG+VT       E C+PY       
Sbjct: 137 NELLSAEELTFCC-HTCGNGCNGGYPIKAWKYFSSHGLVTGGNYKSGEGCEPYRVPPCPR 195

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
           +  G S    +P     +C R C     L  N  H     Y   +    I  ++   GP+
Sbjct: 196 NEDGTSSCAGQPIEKNHRCTRMCYGNQDLDYNDDHRFTRDYYYLT-YGSIQKDVMNYGPI 254

Query: 128 EVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           E SF VY+DF  YKSGVY+       +GGHAVKLIGWG  ++G  YW++ N W+  WG +
Sbjct: 255 EASFDVYDDFYSYKSGVYQRTPNATKLGGHAVKLIGWGV-EEGIPYWLMVNSWSAQWGDN 313

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G FKI+RG++ECGI+    AG+P
Sbjct: 314 GLFKIRRGTDECGIDSATTAGVP 336


>gi|343474530|emb|CCD13852.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 335

 Score =  150 bits (378), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 108/206 (52%), Gaps = 20/206 (9%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           ++ L +S  DLLACCG  CG GC GG P  AW YF   G+ +  C PY     CSH    
Sbjct: 137 VRGLRISAADLLACCGD-CGYGCLGGDPDMAWAYFSSEGIASGRCQPY-PFPRCSHYTNS 194

Query: 79  PAYP--------TPKCVRKCVKKN---QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
             YP        TP C   C       + +R  K YS+S        ED   E+Y  GP 
Sbjct: 195 TTYPQCSALHLWTPTCNPACTDSTISKKKYRGLKSYSLSG------EEDFRRELYFRGPF 248

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           +  F V+ D   YK GVYKH+ G  +G HAV+++GWG +  G  YW +AN WN  WG  G
Sbjct: 249 QAVFDVWSDLFAYKHGVYKHVGGAFIGAHAVRIVGWG-NQSGVPYWKIANSWNAEWGDRG 307

Query: 188 YFKIKRGSNECGIEEDVVAGLPSSKN 213
           YF + RG NECGIE+   AG+P+  N
Sbjct: 308 YFFMLRGDNECGIEDSGSAGVPAIPN 333


>gi|119638992|gb|ABL85238.1| cysteine proteinase 4 [Necator americanus]
          Length = 339

 Score =  150 bits (378), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 85/195 (43%), Positives = 117/195 (60%), Gaps = 13/195 (6%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-FDSTGCSHP 75
           LS  D+LACCG  CG GC+GGYPI A+ Y  + GV +         C PY F     ++ 
Sbjct: 141 LSSADILACCGEDCGSGCEGGYPIQAYFYLENTGVCSGGEYREKNVCKPYPFYPCDGNYG 200

Query: 76  GC--EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPE-DIMAEIYKNGPVEVSF 131
            C  E A+ TPKC + C  +  + +   K +  +++ +  D E  I  EI+ NGPV  +F
Sbjct: 201 PCPKEGAFDTPKCRKICQFRYPVPYEEDKVFGKNSHILLQDNEARIRQEIFINGPVGANF 260

Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
            V+EDF HYK G+YK   G  +G HA+KLIGWGT ++G DYW++AN +N  WG +G F+I
Sbjct: 261 YVFEDFIHYKEGIYKQTYGKWIGVHAIKLIGWGT-ENGTDYWLVANSYNYDWGENGTFRI 319

Query: 192 KRGSNECGIEEDVVA 206
            RG+N C IE  V+A
Sbjct: 320 LRGTNHCLIESQVIA 334


>gi|52630945|gb|AAU84936.1| putative cathepsin B-S [Toxoptera citricida]
          Length = 335

 Score =  150 bits (378), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 86/204 (42%), Positives = 111/204 (54%), Gaps = 19/204 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------ 67
           N  +S  +L  CC   CG GC+GG P+ AW+YF  HGVVT       + C PY       
Sbjct: 134 NELISAEELTFCC-HRCGFGCNGGNPLKAWQYFKRHGVVTGGNYNTTDGCQPYKVPPCVK 192

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSI-SAYRINSDPEDIMAEIYKNGP 126
           D  G +    +P  P  KC R C           HY   +AY +N D   +  +    GP
Sbjct: 193 DEEGHNSCSGQPTEPNHKCSRSCYGDKTCDYKKGHYKTKNAYYLNIDT--MQKDTIAYGP 250

Query: 127 VEVSFTVYEDFAHYKSGVYKHIT-GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           +E SF VY+DF +Y+SGVY+       +GGHAVK+IGWG  +DG  YW++ N W   WGA
Sbjct: 251 IEASFDVYDDFVNYESGVYQKTEDAKYLGGHAVKMIGWG-EEDGTPYWLMVNSWGEQWGA 309

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G FKI RG+NECGIE    AG+P
Sbjct: 310 NGMFKILRGTNECGIEGSPTAGVP 333


>gi|300835056|gb|ADK37857.1| putative cathepsin precursor [Sitobion avenae]
          Length = 340

 Score =  149 bits (377), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 86/204 (42%), Positives = 110/204 (53%), Gaps = 19/204 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------ 67
           N  LS  ++  CC   CG GC GGYPI AW+YF  HG+VT       E C+PY       
Sbjct: 138 NELLSAEEITFCC-HTCGFGCHGGYPIKAWKYFSKHGLVTGGNYKSGEGCEPYRVPPCPR 196

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGP 126
           D  G +    +P     +C R C     L  N  H ++   Y +      I  ++   GP
Sbjct: 197 DDKGNNTCAGKPIEKNHRCTRMCYGDQDLDYNDDHRFTRDFYYLTYG--SIQKDVMTYGP 254

Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           +E SF VY+DF  YKSGVY K      +GGHAVKLIGWG  ++G  YW++ N WN  WG 
Sbjct: 255 IEASFDVYDDFPSYKSGVYEKTENASYLGGHAVKLIGWGV-EEGTPYWLMVNSWNAQWGD 313

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            G FKI+RG+NECGI+    AG+P
Sbjct: 314 KGLFKIRRGTNECGIDNSTTAGVP 337


>gi|91078960|ref|XP_974244.1| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270004840|gb|EFA01288.1| cathepsin B precursor [Tribolium castaneum]
          Length = 319

 Score =  149 bits (377), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 87/214 (40%), Positives = 121/214 (56%), Gaps = 21/214 (9%)

Query: 4   TRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT--- 60
           T ++R  + SS     +    S  DLL+CC   CG  C GGY ++A+ +++  GVV+   
Sbjct: 117 TMSDRICIHSS---GAKKFFFSAEDLLSCCT-ACGS-CSGGYMMAAFDFYIKQGVVSGGD 171

Query: 61  ----EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPE 115
               E C PY   T  +H        TP C + C K     + + KHY    Y +++   
Sbjct: 172 LNSNEGCRPY---TADAHDKG----VTPSCTKSCRKGYPTSYSSDKHYGSKDYIVDAGVS 224

Query: 116 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWIL 175
           +I  EI  NGP+ VSF VY+DF +Y SGVY H++G+  G H VK++GWGT  + +DYW++
Sbjct: 225 NIQYEIMTNGPIIVSFKVYQDFYNYGSGVYHHVSGNYTGNHIVKIVGWGTEKE-QDYWLI 283

Query: 176 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           AN W  SWG  G+FKI RG NECGIE +  A LP
Sbjct: 284 ANSWGSSWGEHGFFKILRGKNECGIENNPYAVLP 317


>gi|166030328|gb|ABY78831.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  149 bits (377), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 81/204 (39%), Positives = 108/204 (52%), Gaps = 13/204 (6%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           + L +S  DLL+CC   CGDGC GG+P  AW Y+V +G+ +  C PY     C H G + 
Sbjct: 138 KQLRISAADLLSCCK-QCGDGCKGGFPGFAWLYYVEYGIASSGCQPY-PFPHCEHRGAQG 195

Query: 80  --------AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
                    + TPKC   C  K+      K+   + Y +    ED   E+Y NGP    F
Sbjct: 196 NKTPCSKYKFDTPKCNATCTDKSIPL--VKYRGNATYLLLHGEEDYKRELYFNGPFVAVF 253

Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
            VY D   YKSGVY+++ GD +GG AV+++GWG   +G  YW +AN W+  WG +GY  I
Sbjct: 254 FVYTDLFAYKSGVYRNVDGDFLGGQAVRIVGWGKL-NGTPYWKVANSWDTDWGMNGYMLI 312

Query: 192 KRGSNECGIEEDVVAGLPSSKNLV 215
            RG+NEC IE     G P    L 
Sbjct: 313 LRGNNECNIEHLGFTGFPDPSQLT 336


>gi|341900875|gb|EGT56810.1| hypothetical protein CAEBREN_32632 [Caenorhabditis brenneri]
          Length = 287

 Score =  149 bits (377), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 79/208 (37%), Positives = 118/208 (56%), Gaps = 20/208 (9%)

Query: 21  NLSLSVNDLLACC-GFL-CGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDST- 70
           N  LS  +LL+CC G L CG+GC GG    AW+Y+  HG+ T         C PY  +  
Sbjct: 78  NTILSAQELLSCCTGVLSCGEGCGGGNAFKAWQYWGKHGLPTGGSYESQFGCKPYSIAPC 137

Query: 71  -----GCSHPGC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEI 121
                  ++P C     PTP C +KC  KN         +HY  S  ++ +   +I +++
Sbjct: 138 GKTVGNVTYPACTNTTLPTPSCEKKCTSKNGYPVDIDKDRHYGASVDQLPNRQIEIQSDV 197

Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
             NGP+E +F VY+DF  Y +G+Y H+TG+  G  +V+++GWG   +G  YW+LAN W +
Sbjct: 198 MLNGPIETTFEVYDDFLQYTTGIYVHLTGNKQGHLSVRILGWGMY-EGVPYWLLANSWGK 256

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLP 209
            WG +G F+  RG+NECG+E + V+G+P
Sbjct: 257 EWGENGTFRALRGTNECGLEANCVSGMP 284


>gi|170030060|ref|XP_001842908.1| cathepsin B [Culex quinquefasciatus]
 gi|167865914|gb|EDS29297.1| cathepsin B [Culex quinquefasciatus]
          Length = 320

 Score =  149 bits (377), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 85/194 (43%), Positives = 107/194 (55%), Gaps = 9/194 (4%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC-SHPGCE 78
           +  +    D LACC       CDGGY    W+Y+V  G+ +E   PY    GC S+P   
Sbjct: 130 KQFTFGATDYLACCTDCFK--CDGGYVGKTWQYWVDSGLTSE--GPYKSGQGCNSYPFGS 185

Query: 79  PAY--PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
                P P C R C     L +     Y  SAYR+  +   IM EIY+NGPV V F V+ 
Sbjct: 186 YCVNDPLPTCSRTCQAGYPLTYSQDLKYGGSAYRVMWNENAIMTEIYQNGPVVVQFEVFA 245

Query: 136 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 195
           DF  YKSGVY+H+TG   G HAV++IGWG  ++G  YW++AN W   WG  G+FK  RG 
Sbjct: 246 DFYQYKSGVYRHVTGATEGWHAVRVIGWGV-ENGVKYWLVANSWGVRWGDKGFFKFVRGE 304

Query: 196 NECGIEEDVVAGLP 209
           N  GIE+ V AGLP
Sbjct: 305 NHLGIEDFVYAGLP 318


>gi|4325188|gb|AAD17297.1| cysteine proteinase [Ancylostoma ceylanicum]
          Length = 341

 Score =  149 bits (377), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 81/203 (39%), Positives = 115/203 (56%), Gaps = 17/203 (8%)

Query: 18  SLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDST 70
           S   + +S  D+L+CCG  CG GC GG+PI A+R+    GVVT       + C PY    
Sbjct: 135 STIKVMISDTDILSCCGLDCGYGCQGGWPIEAYRWMQRDGVVTGGKYRQRDVCKPYSFYP 194

Query: 71  GCSH-------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIY 122
              H       P     +PTPKC +   +K N+ ++  KH++  +Y + ++   I  EIY
Sbjct: 195 CGQHKDVPYYGPCPGGLWPTPKCRKSSQRKYNKTYQEDKHFATRSYSLPNNERSIRQEIY 254

Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
           KNGPV  +F VYED++    G+Y H  G   G HA K+IGWG  ++G DYW++AN WN  
Sbjct: 255 KNGPVVAAFKVYEDYSS-TGGIYVHKWGIQTGAHADKVIGWG-RENGTDYWLIANSWNTD 312

Query: 183 WGADGYFKIKRGSNECGIEEDVV 205
           WG DGY++I R ++ C IE  +V
Sbjct: 313 WGEDGYYRIVRETDNCEIERQMV 335


>gi|166030318|gb|ABY78826.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score =  149 bits (377), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 107/206 (51%), Gaps = 20/206 (9%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           ++ L +S  DLLACCG  CG GC GG P  AW YF   G+ +  C PY     CSH    
Sbjct: 137 VRGLRISAADLLACCG-DCGYGCLGGDPDMAWAYFSSEGIASGRCQPY-PFPRCSHYTNS 194

Query: 79  PAYP--------TPKCVRKCVKKN---QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
             YP        TP C   C       + +R  K YS S        ED   E+Y  GP 
Sbjct: 195 TTYPQCSALHLWTPTCNPACTDSTISKKKYRGLKSYSFSG------EEDFRRELYFRGPF 248

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           +  F V+ D   YK GVYKH+ G  +G HAV+++GWG +  G  YW +AN WN  WG  G
Sbjct: 249 QAVFDVWSDLFAYKHGVYKHVGGAFIGAHAVRIVGWG-NQSGVPYWKIANSWNAEWGDRG 307

Query: 188 YFKIKRGSNECGIEEDVVAGLPSSKN 213
           YF + RG NECGIE+   AG+P+  N
Sbjct: 308 YFFMLRGDNECGIEDSGSAGVPAIPN 333


>gi|48762485|dbj|BAD23812.1| cathepsin B-N1 [Tuberaphis styraci]
          Length = 340

 Score =  149 bits (376), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 89/204 (43%), Positives = 108/204 (52%), Gaps = 19/204 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
           N  LS  +L  CC   CG GC GGYPI AW  F  HG+VT       E C PY       
Sbjct: 138 NELLSPEELAFCC-HKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPL 196

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           D  G +    +PA    +C R C     L ++   HY+  AY +      I  +I   GP
Sbjct: 197 DEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYGT--IQNDILAYGP 254

Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           +E SF VY+DF  YKSGVY K      +GGHAVKLIGWG  + G  YW+L N WN  WG 
Sbjct: 255 IEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWGD 313

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            G FKI+RG+NECGI+     G+P
Sbjct: 314 QGLFKIRRGTNECGIDNSTTGGVP 337


>gi|204022092|dbj|BAG71143.1| cathepsin B-N2 [Tuberaphis coreana]
          Length = 334

 Score =  149 bits (376), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 89/204 (43%), Positives = 108/204 (52%), Gaps = 19/204 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
           N  LS  +L  CC   CG GC GGYPI AW  F  HG+VT       E C PY       
Sbjct: 135 NELLSPEELAFCC-HKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPL 193

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           D  G +    +PA    +C R C     L ++   HY+  AY +      I  +I   GP
Sbjct: 194 DEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYGT--IQNDILAYGP 251

Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           +E SF VY+DF  YKSGVY K      +GGHAVKLIGWG  + G  YW+L N WN  WG 
Sbjct: 252 IEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWGD 310

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            G FKI+RG+NECGI+     G+P
Sbjct: 311 QGLFKIRRGTNECGIDNSTTGGVP 334


>gi|204022108|dbj|BAG71151.1| cathepsin B-N [Cerataphis jamuritsu]
          Length = 333

 Score =  149 bits (376), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 88/204 (43%), Positives = 108/204 (52%), Gaps = 19/204 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
           N  LS  +L  CC   CG GC+GGYPI AW  F  HG+VT       E C PY       
Sbjct: 134 NELLSAEELTFCC-HKCGFGCNGGYPIRAWERFRKHGLVTGGNYDSYEGCQPYRVPPCPL 192

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           D  G +    +P     +C R C     L + N  HY+  AY +      I  ++   GP
Sbjct: 193 DEYGNNTCHGKPMEKNHRCTRMCYGDQDLDFNNDHHYTRDAYYLTYGT--IQNDVLTYGP 250

Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           +E SF VY+DF  YKSGVY K      +GGHAVKLIGWG  + G  YW+L N WN  WG 
Sbjct: 251 IEASFEVYDDFPSYKSGVYVKTENASYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWGD 309

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            G FKI+RG+NECGI+     G+P
Sbjct: 310 QGLFKIRRGTNECGIDNSTTGGVP 333


>gi|48762493|dbj|BAD23816.1| cathepsin B-N1 [Tuberaphis coreana]
          Length = 340

 Score =  149 bits (376), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 89/204 (43%), Positives = 108/204 (52%), Gaps = 19/204 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
           N  LS  +L  CC   CG GC GGYPI AW  F  HG+VT       E C PY       
Sbjct: 138 NELLSPEELAFCC-HKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPL 196

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           D  G +    +PA    +C R C     L ++   HY+  AY +      I  +I   GP
Sbjct: 197 DEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYGT--IQNDILAYGP 254

Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           +E SF VY+DF  YKSGVY K      +GGHAVKLIGWG  + G  YW+L N WN  WG 
Sbjct: 255 IEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWGD 313

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            G FKI+RG+NECGI+     G+P
Sbjct: 314 QGLFKIRRGTNECGIDNSTTGGVP 337


>gi|358341867|dbj|GAA49438.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 952

 Score =  149 bits (375), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 83/199 (41%), Positives = 109/199 (54%), Gaps = 15/199 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT--EECDPY----FDSTGCSH 74
           N SLS  DLL+CC   CG GC  G+   AW ++  HG+VT   + +P     F    C H
Sbjct: 101 NKSLSATDLLSCCED-CGLGCGAGFHPMAWDFWKTHGIVTGGSKEEPSGCRSFPFPKCGH 159

Query: 75  ------PGC-EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
                 P C    YPTP+C+++C +    +   K  +  +Y +      IM EI  NGPV
Sbjct: 160 RRKGRYPPCPRHIYPTPECIKQCDEPEVNYEKDKTRANISYNVYPSDISIMKEIMLNGPV 219

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           E SF +Y DF  Y  GVY H  G  +  HA++++GWG  DDG  YW++AN WN  WG  G
Sbjct: 220 EASFGIYADFLEYNGGVYFHCWGGPISRHAIRILGWG-EDDGVPYWLIANSWNEDWGEKG 278

Query: 188 YFKIKRGSNECGIEEDVVA 206
           Y +  RG NECGIEE+V A
Sbjct: 279 YVRFLRGHNECGIEEEVTA 297



 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 86/266 (32%), Positives = 113/266 (42%), Gaps = 76/266 (28%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGC 77
           N SLS  DL++CC   CG GC GGY   AW ++  HG+VT         TGC     P C
Sbjct: 689 NKSLSAVDLVSCCT-ECGCGCRGGYSPIAWDFWKTHGIVTGGSKE--KPTGCRSYPFPSC 745

Query: 78  E------------PAYPTPKCVRKCVKKNQLWRNSK------------------------ 101
           E              YPTP+C+++C  K   +   K                        
Sbjct: 746 EHRGKGQYPPCPHQLYPTPECIKRCDTKEIDYEKDKTRGFDSASSEQLADRHCFHTSNFG 805

Query: 102 ----------------HYSIS-----------------AYRINSDPEDIMAEIYKNGPVE 128
                           H+SI                  +Y +    + +M EI   GPV 
Sbjct: 806 EASAQRTLHLTCLNFMHHSIDLLSSRLEKAVLRSTANISYNVYPAEQAVMKEIMLRGPVG 865

Query: 129 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 188
               VYED   YKSGVY H+ G  +G H ++++GWG  +DG  YW++AN WN  WG  GY
Sbjct: 866 AILHVYEDLLDYKSGVYFHVWGGHLGEHGIRILGWG-EEDGVPYWLVANSWNEDWGEKGY 924

Query: 189 FKIKRGSNECGIEEDVVAGLPSSKNL 214
            ++ R  NECGI + V AGLP   N 
Sbjct: 925 MRVLRWRNECGIVDQVTAGLPDLSNF 950


>gi|254575665|gb|ACT68329.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score =  149 bits (375), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 83/203 (40%), Positives = 111/203 (54%), Gaps = 16/203 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGC 72
           + + LS  D+LACCG  CG GCDGGY   AW++    GVVT         C PY      
Sbjct: 145 KKVLLSATDILACCGKFCGYGCDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCG 204

Query: 73  SHPGCE----PAYPTPKCVRKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           +H G      P++P     RK   +    + + N K  + + Y + +D   I  EI + G
Sbjct: 205 AHKGKAFNNCPSHPYATPARKPYCQYGYGKRYENDKIKARTWYWLPNDERTIQLEIMQKG 264

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PV  +F +YEDF HY  GVY H  G + GGH++K+IGWG  D G  YW++AN W+  WG 
Sbjct: 265 PVHATFNIYEDFEHYNGGVYIHTAGAMEGGHSIKIIGWGV-DKGVKYWLIANSWSTDWGE 323

Query: 186 D-GYFKIKRGSNECGIEEDVVAG 207
           D GYF++ RG N C IE  V+AG
Sbjct: 324 DGGYFRVVRGINNCDIEGGVLAG 346


>gi|204022100|dbj|BAG71147.1| cathepsin B-N1 [Tuberaphis takenouchii]
          Length = 334

 Score =  149 bits (375), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 87/204 (42%), Positives = 110/204 (53%), Gaps = 19/204 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
           N  LS  +L  CC   CG GC GGYPI AW +F  HG+VT       E C PY       
Sbjct: 135 NELLSAEELAFCC-HKCGFGCHGGYPIKAWEWFKKHGLVTGGDYDSGEGCQPYRVPPCPL 193

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           D  G +    +PA    +C R C    +L ++   H++  AY +      I  ++   GP
Sbjct: 194 DEYGNNTCRGKPAEKNHRCTRMCYGNQELDFKEDHHWTRDAYYLTY--TTIQKDVMAYGP 251

Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           +E SF VY+DF +YKSGVY K      +GGHAVKLIGWG  + G  YW+L N WN  WG 
Sbjct: 252 IEASFDVYDDFPNYKSGVYMKTENASYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWGD 310

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            G FKI RG+NECGI+     G+P
Sbjct: 311 QGLFKILRGTNECGIDNSTTGGVP 334


>gi|156708122|gb|ABU93319.1| cathepsin B10 cysteine protease [Monocercomonoides sp. PA]
          Length = 283

 Score =  148 bits (374), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 78/184 (42%), Positives = 109/184 (59%), Gaps = 18/184 (9%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
           LS  DL++C     G  C+GGY  ++W + +  G+ TE C PY   +G            
Sbjct: 113 LSPQDLISCDSNDLG--CNGGYQENSWTWVLTTGITTESCWPYRSGSG----------RI 160

Query: 84  PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
           P C  +CV  + L RN+    I+ YR   D  ++  E+Y NGP++V++ VYEDF +Y  G
Sbjct: 161 PSCPHRCVNGSVLQRNT----INNYR-RLDSSELQDELYNNGPIQVTYVVYEDFFYYSKG 215

Query: 144 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 203
           +YKH++G+ +GGHAV L+GWG  +DG  YW++ N W   WG  GYF+I RGSNECGIE  
Sbjct: 216 IYKHLSGNKVGGHAVVLMGWGI-EDGVKYWLVQNSWGYEWGEQGYFRILRGSNECGIESS 274

Query: 204 VVAG 207
             AG
Sbjct: 275 AYAG 278


>gi|56752811|gb|AAW24617.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 79/179 (44%), Positives = 106/179 (59%), Gaps = 17/179 (9%)

Query: 45  YPISAWRYFVHHGVVT---EE----CDPYFDSTGCSH------PGC-EPAYPTPKCVRKC 90
           +P  AW Y+V  G+VT   EE    C PY     C H      P C    Y TP+C + C
Sbjct: 163 FPGQAWDYWVKRGIVTGGSEENHTGCQPY-PFPKCEHLTKGKYPACGTKIYKTPQCKQTC 221

Query: 91  VKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 149
            K  +  +   KHY    Y + S+ + I  EI   GPVE +F VYEDF +YKSG+Y+H+T
Sbjct: 222 QKGYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVT 281

Query: 150 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           G ++GGHA+++IGWG  + G+ YW++AN WN  WG  G F++ RG +EC IE  VVAGL
Sbjct: 282 GSIVGGHAIRIIGWGV-EKGKPYWLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGL 339


>gi|332376204|gb|AEE63242.1| unknown [Dendroctonus ponderosae]
          Length = 338

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 82/206 (39%), Positives = 111/206 (53%), Gaps = 23/206 (11%)

Query: 24  LSVNDLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA- 80
           +S  DL +CC   F CG GCDGGY    W Y+   G+VT     Y  S GC     EP  
Sbjct: 137 VSAEDLNSCCFGLFACGLGCDGGYVAEPWDYWRTDGIVTG--GAYNSSQGCKDYSLEPCE 194

Query: 81  ---------------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
                          + TP+CVR C + +  +  S  +        ++ + +  EI KNG
Sbjct: 195 HHVEVGSRPQCSSLNFDTPECVRSCYESSLDYTESLTFGQQVSTFTNEKQ-MQLEILKNG 253

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGD-VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           P+E +FTVY DF  YKSGVY+    D  +GGHA+K++GWG  ++G  YW++AN WN  WG
Sbjct: 254 PIEAAFTVYNDFLSYKSGVYQATAQDESVGGHAIKVLGWGV-EEGTKYWLIANSWNTDWG 312

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPS 210
            +GYFK  RG + CGIE +  A LP+
Sbjct: 313 DNGYFKFLRGVDHCGIESETAASLPA 338


>gi|290989996|ref|XP_002677623.1| cathepsin B [Naegleria gruberi]
 gi|284091231|gb|EFC44879.1| cathepsin B [Naegleria gruberi]
          Length = 321

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 83/189 (43%), Positives = 105/189 (55%), Gaps = 18/189 (9%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
           LS  D+++C       GCDGG   +AW +  + G+V + C PY    G            
Sbjct: 137 LSPQDMVSCD--YNDMGCDGGNLDNAWWWMKNKGIVPDSCMPYVSGGG----------NV 184

Query: 84  PKCVRKCVKKN-----QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
           P C   C   N     QL+       IS +       DI  EIY NGPV+  F+VY+DF 
Sbjct: 185 PACPSNCNGTNIPISSQLYYAKSFSHISPWMFWERVADIQQEIYTNGPVQGGFSVYQDFM 244

Query: 139 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 198
           +YKSGVY H TG  +GGHA+K+IGWG  + G DYW++AN W+  WG DG FKI RG NEC
Sbjct: 245 NYKSGVYSHKTGSFLGGHAIKIIGWGV-EGGVDYWLVANSWSTDWGIDGTFKILRGHNEC 303

Query: 199 GIEEDVVAG 207
           GIE+DV AG
Sbjct: 304 GIEDDVYAG 312


>gi|156708112|gb|ABU93314.1| cathepsin B5 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  147 bits (372), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 76/184 (41%), Positives = 104/184 (56%), Gaps = 18/184 (9%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
           +S  DL++C       GC+GG P+ +W +  H G+ TEEC PY    G            
Sbjct: 112 MSPQDLVSC--DKVDHGCNGGSPLFSWEWVKHSGITTEECIPYVSGGG----------RV 159

Query: 84  PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
           P C +KC   + + R +K  S+   +     + +  E+Y  GP E +F+VYEDF  YKSG
Sbjct: 160 PSCPKKCTNGSAIVR-TKAKSVGLVK----GDKMQNELYSRGPFEAAFSVYEDFKSYKSG 214

Query: 144 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 203
           VY HITG ++GGHAV ++GWG  +DG  YW++ N W  +WG  G+FKI RG NECGIE  
Sbjct: 215 VYHHITGKMLGGHAVMVVGWGV-EDGTPYWLIQNSWGTTWGEQGFFKILRGKNECGIETT 273

Query: 204 VVAG 207
              G
Sbjct: 274 CFQG 277


>gi|312382740|gb|EFR28091.1| hypothetical protein AND_04395 [Anopheles darlingi]
          Length = 381

 Score =  147 bits (372), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 80/198 (40%), Positives = 114/198 (57%), Gaps = 13/198 (6%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC-SHPG--C 77
             S    D+L+CC   CG GCDGG P + W Y+V +G+ +     Y    GC S+P   C
Sbjct: 185 QFSFGAYDVLSCC-HRCGFGCDGGVPSAVWHYWVENGITSG--GAYESHEGCQSYPFGVC 241

Query: 78  EPA-----YPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
           +P      +    C+R+C    N  +   KH+   AY +  D + I+ E++  GPV+ SF
Sbjct: 242 KPQEIFAPHVDLICLRQCQPGYNTTYLEDKHFGRVAYSVPRDEDRILYELFYFGPVQASF 301

Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
           TVY DF  YKSGVY+H  G  +G H+VK++GWG  ++G  +W+ AN W   WG +G+FKI
Sbjct: 302 TVYTDFIQYKSGVYRHTYGVRVGDHSVKIVGWGV-ENGTKFWLCANSWGAEWGENGFFKI 360

Query: 192 KRGSNECGIEEDVVAGLP 209
            RG +   +E +VVAGLP
Sbjct: 361 IRGEDHLSVESNVVAGLP 378


>gi|204022098|dbj|BAG71146.1| cathepsin B-N2 [Tuberaphis sumatrana]
          Length = 334

 Score =  147 bits (371), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 87/204 (42%), Positives = 107/204 (52%), Gaps = 19/204 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
           N  LS  +L  CC   CG GC GGYPI AW  F  HG+VT       E C PY       
Sbjct: 135 NELLSPEELAFCC-HKCGFGCSGGYPIKAWERFKKHGLVTGGNYESGEGCQPYRVPPCPL 193

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           D  G +    +P     +C R C     L ++   HY+  AY +      I  ++   GP
Sbjct: 194 DEYGNNTCSGKPTEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYGT--IQNDVLAYGP 251

Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           +E SF VY+DF  YKSGVY K      +GGHAVKLIGWG  + G  YW+L N WN  WG 
Sbjct: 252 IEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWGD 310

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            G FKI+RG+NECGI+     G+P
Sbjct: 311 QGLFKIRRGTNECGIDNSTTGGVP 334


>gi|226473754|emb|CAX71562.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 329

 Score =  147 bits (371), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 80/191 (41%), Positives = 106/191 (55%), Gaps = 19/191 (9%)

Query: 37  CGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE------------PAY 81
           CG GCDGG+   +W Y+V  G+VT       + TGC     P C+              Y
Sbjct: 142 CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLY 199

Query: 82  PTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
            TP+C + C K  N  +   KHY   +Y + S    I  +I  +GPVE    +YEDF +Y
Sbjct: 200 KTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNY 259

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC I
Sbjct: 260 KSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSI 318

Query: 201 EEDVVAGLPSS 211
           E ++ AGL  S
Sbjct: 319 ESEIAAGLIKS 329


>gi|226471002|emb|CAX70582.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  147 bits (371), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 78/176 (44%), Positives = 106/176 (60%), Gaps = 17/176 (9%)

Query: 48  SAWRYFVHHGVVT---EE----CDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK 93
            AW Y+V  G+VT   EE    C PY     C H      P C    Y TP+C + C K 
Sbjct: 166 QAWDYWVKRGIVTGGSEENHTGCQPY-PFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKG 224

Query: 94  NQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 152
            +  ++  KHY   +Y + S+ + I  EI   GPVE +F VYEDF +YKSG+Y+H+TG +
Sbjct: 225 YKTPYKQDKHYGDESYNVISNEKAIQKEIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSI 284

Query: 153 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           +GGHA+++IGWG  + G+ YW++AN WN  WG  G F++ RG +EC IE  VVAGL
Sbjct: 285 VGGHAIRIIGWGV-EKGKPYWLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGL 339


>gi|22535408|emb|CAC87118.1| cathepsin B-like protease [Nilaparvata lugens]
          Length = 347

 Score =  147 bits (370), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 86/210 (40%), Positives = 113/210 (53%), Gaps = 21/210 (10%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
           N  +S  +L++CC + CG GC+GG+P +AW +   HG+VT       + C PY     C 
Sbjct: 142 NGHISSRELMSCCSY-CGFGCEGGFPDAAWVFIKRHGLVTGGDYHSHDGCQPY-PIAPCE 199

Query: 74  H------PGCE--PAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
           H      P C   P  PTP C   C   + L ++  +    SAY +    +    EI+KN
Sbjct: 200 HHMEGSKPNCSASPTEPTPACETTCTHGSSLAYQKDRQKGKSAYLVPVGEKQTQLEIFKN 259

Query: 125 GPVEVSFTVYEDFAHYKSGVYK-HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           GP+  +F VYEDF  YKSGVYK H      G HAVK+IGWG   +G  YW++ N W+  W
Sbjct: 260 GPIVAAFKVYEDFFMYKSGVYKRHPESPFRGRHAVKVIGWG-EQNGLPYWLVQNSWDYDW 318

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSSKN 213
           G  G FKI RG NEC  E+ + AGLP  K 
Sbjct: 319 GDKGLFKIARG-NECDFEKSMTAGLPKYKK 347


>gi|187105116|ref|NP_001119618.1| cathepsin B-84 precursor [Acyrthosiphon pisum]
 gi|161343843|tpg|DAA06102.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 335

 Score =  146 bits (369), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 85/206 (41%), Positives = 114/206 (55%), Gaps = 21/206 (10%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------ 67
           N  +S  +L  CC   CG GC+GGYP+ AW+YF  HGVVT       + C PY       
Sbjct: 134 NELISAEELTFCC-HRCGFGCNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPCVK 192

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGP 126
           D  G +    +P     KC +KC   + +     HY    AY + +        +Y  GP
Sbjct: 193 DDEGHNSCSGQPTERNHKCSKKCYGDDTIDYKKNHYKTKDAYYLKNTTMQKDTMVY--GP 250

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDV--MGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           +E SF VY+DF +Y+SGVY+  TG+   +GGHAVK+IGWG  ++G  YW++ N W   WG
Sbjct: 251 IEASFDVYDDFMNYESGVYQR-TGNASYLGGHAVKMIGWGV-EEGTPYWLMVNSWGEQWG 308

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPS 210
             G FKI RG++ECGIE    AG+PS
Sbjct: 309 DKGMFKILRGTDECGIESSCTAGVPS 334


>gi|204022077|dbj|BAG71136.1| cathepsin B-S1 [Tuberaphis sumatrana]
 gi|204022079|dbj|BAG71137.1| cathepsin B-S2 [Tuberaphis sumatrana]
          Length = 334

 Score =  146 bits (369), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 87/204 (42%), Positives = 114/204 (55%), Gaps = 18/204 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FD 68
           N  LS  +L  CC   CG+GC+GGYPI AWRYF   GV T       E C PY     ++
Sbjct: 135 NELLSPEELAFCCK-DCGNGCEGGYPIKAWRYFRTQGVTTGGDYDTKEGCKPYKVAPCYN 193

Query: 69  STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 128
             G +  G +P     +C + C  K       ++ + S Y INS  + I  +I   GPVE
Sbjct: 194 KQGKNTCGGKPMERNHQCPKTCYGKTT--DQKRYKTKSEYVINS-IKTIEQDIKTYGPVE 250

Query: 129 VSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
            SF VY+DF+ YKSG+Y+         GH+VK+IGWG  ++G  YW+  N W++ WG  G
Sbjct: 251 ASFDVYDDFSVYKSGIYRKTPNAKYQNGHSVKIIGWG-QENGTPYWLAVNSWSKFWGDHG 309

Query: 188 YFKIKRGSNECGIEEDVVAGLPSS 211
            FKI +G NECGIE  V AG+PSS
Sbjct: 310 TFKIIKGKNECGIERAVTAGIPSS 333


>gi|52630925|gb|AAU84926.1| putative cathepsin B-N [Toxoptera citricida]
          Length = 340

 Score =  146 bits (369), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 84/204 (41%), Positives = 111/204 (54%), Gaps = 19/204 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
           N  LS  +L  CC   CG GC+GGYPI AW +F  HG+VT       E C+PY      +
Sbjct: 138 NQLLSAEELTFCC-HKCGFGCNGGYPIKAWEHFKKHGLVTGGDYKSGEGCEPYRVPPCPY 196

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGP 126
           D +G +    +P     +C R C     L  +  H Y+  +Y +      I  ++   GP
Sbjct: 197 DESGNNTCAGKPMEANHRCTRMCYGDQDLDFDEDHRYTRDSYYLTYG--SIQKDVLTYGP 254

Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           VE SF VY+DF  YKSGVY +      +GGHA KLIGWG  + G  YW++ N WN  WG 
Sbjct: 255 VEASFDVYDDFPSYKSGVYIRSENASYLGGHAAKLIGWG-EEYGVPYWLMVNSWNADWGD 313

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G FKI+RG+NECGI+     G+P
Sbjct: 314 NGLFKIQRGTNECGIDNSTTGGVP 337


>gi|149030260|gb|EDL85316.1| rCG52258, isoform CRA_c [Rattus norvegicus]
          Length = 130

 Score =  146 bits (369), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 68/137 (49%), Positives = 93/137 (67%), Gaps = 13/137 (9%)

Query: 77  CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
           CE  Y T             ++  KHY  ++Y ++   ++IMAEIYKNGPVE +FTV+ D
Sbjct: 2   CEAGYSTS------------YKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSD 49

Query: 137 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 196
           F  YKSGVYKH  GDVMGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI RG N
Sbjct: 50  FLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNGFFKILRGEN 108

Query: 197 ECGIEEDVVAGLPSSKN 213
            CGIE ++VAG+P ++ 
Sbjct: 109 HCGIESEIVAGIPRTQQ 125


>gi|300122171|emb|CBK22745.2| unnamed protein product [Blastocystis hominis]
          Length = 319

 Score =  146 bits (369), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 86/204 (42%), Positives = 106/204 (51%), Gaps = 20/204 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
            + LS  D+L CC   CG  C GGY   AW Y    GVVT       E C  Y     CS
Sbjct: 119 QVRLSAEDVLECCK-DCGFQCQGGYSAMAWEYLRRTGVVTGGQYNSTEWCKSY-PFPPCS 176

Query: 74  HPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKN 124
           H G E  YP         PKC   C +   +      Y  S  Y++ ++ + I  EI +N
Sbjct: 177 H-GIEGQYPQCSTKPPVVPKCETTCQEGYPIEYEKDRYKFSNVYQLENNVDQIKNEIMEN 235

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPV+ SF VYEDF  YKSG+Y H+ G  M  H VK+IGWG  ++GE YW   N WN  WG
Sbjct: 236 GPVDASFQVYEDFMTYKSGIYHHVEGKFMNLHTVKIIGWG-EENGEAYWKAVNSWNSEWG 294

Query: 185 ADGYFKIKRGSNECGIEEDVVAGL 208
            +G F+I+ G+NEC IE  V  GL
Sbjct: 295 ENGLFRIRLGTNECTIESQVEGGL 318


>gi|118429529|gb|ABK91812.1| cathepsin B precursor [Clonorchis sinensis]
          Length = 342

 Score =  146 bits (368), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 83/209 (39%), Positives = 109/209 (52%), Gaps = 19/209 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGC 77
           N SLS  DL++CC   CG GC GGY   AW  +  HG+VT         TGC     P C
Sbjct: 136 NKSLSAVDLVSCCT-ECGCGCRGGYSPIAWDLWKTHGIVTGGSKE--KPTGCRSYPFPSC 192

Query: 78  E------------PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           E              YPTP+C+++C  K   +   K  +  +Y +    + +M EI   G
Sbjct: 193 EHRGKGQYPPCPHQLYPTPECIKRCDTKEIDYEKDKTRANISYNVYPAEQAVMKEIMLRG 252

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PV     VYED   YKSGVY H+ G  +G H ++++GWG  +DG  YW++AN WN  WG 
Sbjct: 253 PVGAILHVYEDLLDYKSGVYFHVWGGHLGEHGIRILGWG-EEDGVPYWLVANSWNEDWGE 311

Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
            GY ++ R  NECGI + V AGLP   N 
Sbjct: 312 KGYMRVLRWRNECGIVDQVTAGLPDLSNF 340


>gi|308488594|ref|XP_003106491.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
 gi|308253841|gb|EFO97793.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
          Length = 342

 Score =  146 bits (368), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 74/203 (36%), Positives = 117/203 (57%), Gaps = 18/203 (8%)

Query: 24  LSVNDLLACC-GFL-CGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDST---- 70
           LS  +LL+CC G L CG+GC GG P+ AW+Y+  HG+ T         C PY  +     
Sbjct: 138 LSAQELLSCCTGVLSCGEGCAGGNPLKAWQYWQKHGIPTGGSYESQFGCKPYSIAPCGKT 197

Query: 71  --GCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
               ++P C     PTP C +KC     +     +HY +S  ++ +   +I +++  NGP
Sbjct: 198 IGNVTYPPCTNTTLPTPTCEKKCKPGYPVDLDKDRHYGVSVDQLPNRQIEIQSDVMLNGP 257

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           VE +  +Y+DF  Y +G+Y H+ G+  G  +V+++GWG   +G  YW+LAN W + WG +
Sbjct: 258 VEATMEIYDDFLQYTTGIYVHLAGNKQGHLSVRILGWGMF-EGVPYWLLANSWGKEWGEN 316

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G F++ RG NECG+E + ++G+P
Sbjct: 317 GTFRVLRGVNECGLEANCISGMP 339


>gi|204022085|dbj|BAG71140.1| cathepsin B-S [Astegopteryx spinocephala]
          Length = 335

 Score =  146 bits (368), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 84/203 (41%), Positives = 108/203 (53%), Gaps = 18/203 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FD 68
           N  LS   L  CC + CG GC GG PI AW+YF  HG+ T       E C PY     +D
Sbjct: 136 NEQLSAEKLTFCC-WTCGLGCQGGNPIKAWKYFKRHGITTGGDYGSNEGCAPYKVPPCYD 194

Query: 69  STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 128
             G      +P     KC R C   + +      Y + +  +    + I  +I K GPVE
Sbjct: 195 DQGEFLCQGKPTEHNHKCPRACYGNSTV---ENRYKVKSIYVLDSSKTIEQDIRKYGPVE 251

Query: 129 VSFTVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
            SF VY+DF  YKSG+Y+       +GGH+VKLIGWG  +DG  YW+L N W++ WG  G
Sbjct: 252 ASFDVYDDFITYKSGIYQKTPNAFYVGGHSVKLIGWG-EEDGIPYWLLVNSWSKFWGEQG 310

Query: 188 YFKIKRGSNECGIEEDVVAGLPS 210
            F+I +G NECGIE    AG+PS
Sbjct: 311 TFRIIKGRNECGIERSATAGVPS 333


>gi|204022088|dbj|BAG71141.1| cathepsin B-N2 [Tuberaphis styraci]
          Length = 334

 Score =  146 bits (368), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 85/204 (41%), Positives = 107/204 (52%), Gaps = 19/204 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
           N  LS  +L  CC   CG GC GGYPI AW  F  HG+VT       E C PY       
Sbjct: 135 NELLSPEELAFCC-HKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVSPCPL 193

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           D  G +    +PA    +C + C     L ++   HY+  AY +      I  ++   GP
Sbjct: 194 DEYGNNTCSGKPAEKNHRCTQMCYGNQNLDFKEDHHYTRDAYYLTYGT--IQNDVLAYGP 251

Query: 127 VEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           +E SF VY+DF  YKSGVY  +     +GGHAVKLIGWG  + G  YW+L N WN  WG 
Sbjct: 252 IEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWGD 310

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            G FKI+RG+NECG +     G+P
Sbjct: 311 QGLFKIRRGTNECGTDNSTTGGVP 334


>gi|204022096|dbj|BAG71145.1| cathepsin B-N1 [Tuberaphis sumatrana]
          Length = 334

 Score =  146 bits (368), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 87/204 (42%), Positives = 107/204 (52%), Gaps = 19/204 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
           N  LS  +L  CC   CG GC GG PI AW  F  HG+VT       E C PY       
Sbjct: 135 NELLSPEELAFCC-HKCGFGCSGGNPIKAWERFQKHGLVTGGNYDSGEGCQPYKVPPCPL 193

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           D  G +    +PA    +C R C     L ++   HY+  AY +      I  ++   GP
Sbjct: 194 DEYGNNTCSGKPAEKNHRCTRMCYGNQNLDFKEDHHYTRDAYYLTYGT--IQYDVLAYGP 251

Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           +E SF VY+DF  YKSGVY K      +GGHAVKLIGWG  + G  YW+L N WN  WG 
Sbjct: 252 IEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWGD 310

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            G FKI+RG+NECGI+     G+P
Sbjct: 311 QGLFKIRRGTNECGIDNSTTGGVP 334


>gi|204022090|dbj|BAG71142.1| cathepsin B-N3 [Tuberaphis styraci]
          Length = 334

 Score =  146 bits (368), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 86/204 (42%), Positives = 107/204 (52%), Gaps = 19/204 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
           N  LS  +L  CC   CG GC GGYPI AW  F  HG+VT       E C PY       
Sbjct: 135 NELLSPEELAFCC-HKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVPPCPL 193

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           D  G +    +PA    +C + C     L ++   HY+  AY +      I  ++   GP
Sbjct: 194 DEYGNNTCSGKPAEKNHRCTQMCYGNQNLDFKEDHHYTRDAYYLTYGT--IQNDVLAYGP 251

Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           +E SF VY+DF  YKSGVY K      +GGHAVKLIGWG  + G  YW+L N WN  WG 
Sbjct: 252 IEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWGD 310

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            G FKI+RG+NECG +     G+P
Sbjct: 311 QGLFKIRRGTNECGTDNSTTGGVP 334


>gi|119638996|gb|ABL85239.1| cysteine proteinase 5 [Necator americanus]
          Length = 342

 Score =  145 bits (367), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 83/195 (42%), Positives = 109/195 (55%), Gaps = 17/195 (8%)

Query: 28  DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------ 74
           D+L+CC + CG GCDGG P +A+ + + +GV T         C PY       H      
Sbjct: 146 DILSCC-WNCGMGCDGGRPFAAFFFAIDNGVCTGGPFREPNVCKPYAFYPCGRHQNQKYF 204

Query: 75  -PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
            P  +  +PTPKC + C +K N  +++ K Y   AY + ++   IM EI+ NGPV  SF+
Sbjct: 205 GPCPKELWPTPKCRKMCQLKYNVAYKDDKIYGNDAYSLPNNETRIMQEIFTNGPVVGSFS 264

Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
           V+ DFA YK GVY        G HAVK+IGWG  D G  YW++AN WN  WG +GY +  
Sbjct: 265 VFADFAIYKKGVYVSNGIQQNGAHAVKIIGWGVQD-GLKYWLIANSWNNDWGDEGYVRFL 323

Query: 193 RGSNECGIEEDVVAG 207
           RG N CGIE  VV G
Sbjct: 324 RGDNHCGIESRVVTG 338


>gi|729283|sp|Q06544.1|CYSP3_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 3
 gi|159952|gb|AAA29436.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
          Length = 174

 Score =  145 bits (367), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 79/175 (45%), Positives = 104/175 (59%), Gaps = 17/175 (9%)

Query: 49  AWRYFVHHGVVTEE-------CDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKN 94
           AW+YF   GVVT         C PY +   C   G EP Y        TPKC + C +  
Sbjct: 1   AWQYFALEGVVTGGNYRKQGCCRPY-EFPPCGRHGKEPYYGECYDTAKTPKCQKTCQRGY 59

Query: 95  -QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 153
            + ++  KH+  SAYR+ ++ + I  +I KNGPV   F VYEDFAHYKSG+YKH  G + 
Sbjct: 60  LKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMT 119

Query: 154 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           GGHAVK+IGWG  + G  YW++AN W+  WG  G++++ RG N C IEE V AG+
Sbjct: 120 GGHAVKIIGWG-KEKGTPYWLIANSWHDDWGEKGFYRMIRGINNCRIEEMVFAGI 173


>gi|48762476|dbj|BAD23809.1| cathepsin B-S [Tuberaphis styraci]
 gi|204022069|dbj|BAG71132.1| cathepsin B-S1 [Tuberaphis styraci]
          Length = 349

 Score =  145 bits (367), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 85/204 (41%), Positives = 114/204 (55%), Gaps = 18/204 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FD 68
           N  LS  +L  CC   CG GC GGYPI AW+YF   GV T       E C PY     +D
Sbjct: 135 NQLLSPEELAFCC-MDCGKGCGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPCYD 193

Query: 69  STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 128
             G +  G +P     +C + C  K  +    ++ + + Y INS  E I  ++   GPVE
Sbjct: 194 EQGKNTCGGKPMERNHQCPKTCYGKTTV--QDRYKTKNEYVINS-IETIEQDLMTYGPVE 250

Query: 129 VSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
            SF VY+DF+ YKSG+Y+        GGH++K+IGWG  ++G  YW+  N W++ WG  G
Sbjct: 251 ASFDVYDDFSVYKSGIYRKTPKAKYEGGHSIKIIGWG-EENGTPYWLAVNSWSKFWGDHG 309

Query: 188 YFKIKRGSNECGIEEDVVAGLPSS 211
            FKI +G NECGIE  V AG+PS+
Sbjct: 310 TFKIIKGRNECGIERAVTAGIPST 333


>gi|10803452|emb|CAB97365.2| putative cathepsin B.2 [Ostertagia ostertagi]
          Length = 194

 Score =  145 bits (366), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 80/175 (45%), Positives = 106/175 (60%), Gaps = 18/175 (10%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGC 72
           + + LS  D+LACC + CG GC+GG+P+ AW+YF   GVVT         C PY +   C
Sbjct: 23  KQVLLSDQDMLACCSW-CGYGCEGGWPMKAWQYFXLEGVVTGGNYRKQGCCRPY-EFPPC 80

Query: 73  SHPGCEPAY-------PTPKCVRKCVKKN-QLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
              G EP Y        TPKC + C +   + ++  KH+  SAYR+ ++ + I  +I KN
Sbjct: 81  GRHGKEPYYGECYDSAKTPKCQKTCQRGYLKPYKEDKHFGKSAYRLPNNVKAIQRDIMKN 140

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQW 179
           GPV   F VYEDFAHYKSG+YKH  G + GGHAVK+IGWG  + G  YW++AN W
Sbjct: 141 GPVVAGFIVYEDFAHYKSGIYKHTAGRMTGGHAVKIIGWG-KEXGTPYWLIANSW 194


>gi|56752997|gb|AAW24710.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  145 bits (366), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 78/176 (44%), Positives = 104/176 (59%), Gaps = 17/176 (9%)

Query: 48  SAWRYFVHHGVVT---EE----CDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK 93
            AW Y+V  G+VT   EE    C PY     C H      P C    Y TP+C + C K 
Sbjct: 166 QAWDYWVKRGIVTGGSEENHTGCQPY-PFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKG 224

Query: 94  NQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 152
            +  +   KHY    Y + S+ + I  EI   GPVE +F VYEDF +YKSG+Y+H+TG +
Sbjct: 225 YKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSI 284

Query: 153 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           +GGHA+++IGWG  + G+ YW++AN WN  WG  G F++ RG +EC IE  VVAGL
Sbjct: 285 VGGHAIRIIGWGV-EKGKPYWLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGL 339


>gi|209863073|ref|NP_001119610.2| cathepsin B-1852 [Acyrthosiphon pisum]
          Length = 333

 Score =  145 bits (366), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 83/200 (41%), Positives = 111/200 (55%), Gaps = 15/200 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
           N  LS   + +CC + CG GC GGYPI AWRY+  HG+VT       E C PY       
Sbjct: 134 NQLLSAEHVTSCC-YRCGLGCQGGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPCTG 192

Query: 74  HPGCE-PAYPTPKCVRKCVKKNQL-WRNSKHY-SISAYRINSDPEDIMAEIYKNGPVEVS 130
           +  C   +    KC +KC     + +R  + Y   S Y +  D  ++  +I   GP+E S
Sbjct: 193 NNSCSGQSEKNHKCQKKCFGNTSISYRGDRRYVERSPYVLAYD--NMQNDIMTYGPIESS 250

Query: 131 FTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
           F VY+DF  YKSGVY K      +GGH+VK IGWG  +    YW++ N WN +WG  GYF
Sbjct: 251 FDVYDDFISYKSGVYFKSPNATYLGGHSVKCIGWGV-ERNVSYWLMMNSWNSTWGDGGYF 309

Query: 190 KIKRGSNECGIEEDVVAGLP 209
           KI+RG+NEC +E+   AG+P
Sbjct: 310 KIRRGTNECQVEDSSTAGVP 329


>gi|156708108|gb|ABU93312.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 79/183 (43%), Positives = 106/183 (57%), Gaps = 18/183 (9%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
           +S  DL++C       GC+GGY   AW +   HG+ TE+C PY   +G            
Sbjct: 112 MSPQDLVSC--DTTDMGCNGGYMDHAWAWTKSHGITTEKCMPYQSGSG----------RV 159

Query: 84  PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
           P C  KCV  + + RN    S+S  ++N+  + +M E+Y+NGP+ V+FTVY DF +YKSG
Sbjct: 160 PACPAKCVNGSAIVRNK---SVSYKKLNA--QQMMEELYENGPISVAFTVYYDFMNYKSG 214

Query: 144 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 203
           VY H TG + GGHAV  +GWG  +D   YW+  N W  +WG  G+FKI RGSN CGIE  
Sbjct: 215 VYVHKTGGIAGGHAVLCVGWGV-EDNTPYWLCQNSWGPAWGEKGHFKILRGSNHCGIENQ 273

Query: 204 VVA 206
             A
Sbjct: 274 SYA 276


>gi|300952942|gb|ADK46902.1| cathepsin B [Radopholus similis]
          Length = 356

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 78/195 (40%), Positives = 112/195 (57%), Gaps = 21/195 (10%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           Q + +S  D+L+C       GC+GGYP  A+ ++   GVVT        S   ++ GC+P
Sbjct: 158 QKVHISAQDILSCATDR-SQGCNGGYPDEAFEHYAQSGVVT-------GSGNSANQGCKP 209

Query: 80  ---------AYPTPKCVRKC--VKKNQLWRNSKHYSISAYRIN-SDPEDIMAEIYKNGPV 127
                     Y TP+C +KC   +  + ++  KH+ +S Y +  SDP DI  EI  NGPV
Sbjct: 210 YPFLPHTTVEYSTPECSKKCENYQYKKAYKQDKHFGMSVYNVQFSDPVDIQYEIMNNGPV 269

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE-DYWILANQWNRSWGAD 186
           E +  VY DF  YKSGVY+ +    +GGHAV+++GWG     +  YW++AN WN  WG D
Sbjct: 270 EANMIVYYDFMFYKSGVYQTVFPWPLGGHAVRIVGWGVDGPTKVPYWLVANSWNTDWGED 329

Query: 187 GYFKIKRGSNECGIE 201
           GYF+I+RG++E  IE
Sbjct: 330 GYFRIRRGTDESYIE 344


>gi|156708104|gb|ABU93310.1| cathepsin B1 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 76/167 (45%), Positives = 99/167 (59%), Gaps = 16/167 (9%)

Query: 40  GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
           GC+GGY   AW +   HGV TE+C PY   +G            P C  KCV  + + RN
Sbjct: 126 GCNGGYMDHAWAWTKSHGVTTEKCMPYQSGSG----------RVPACPAKCVNGSAIVRN 175

Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 159
               S+S  ++N+  + +M E+Y+NGP+ V+FTVY DF +YKSGVY H TG + GGHAV 
Sbjct: 176 K---SVSYKKLNA--QQMMEELYENGPISVAFTVYYDFMNYKSGVYVHKTGGIAGGHAVL 230

Query: 160 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
            +GWG  D+   YW+  N W  +WG  G+FKI RGSN CGIE    A
Sbjct: 231 CVGWGVEDN-TPYWLCQNSWGPAWGEKGHFKILRGSNHCGIENQSYA 276


>gi|339241013|ref|XP_003376432.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316974853|gb|EFV58323.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 551

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 82/200 (41%), Positives = 112/200 (56%), Gaps = 18/200 (9%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG 76
           LS  +LL+CC   CG GC+GGYP   ++Y+V+ G+ T       + C PY        P 
Sbjct: 343 LSDAELLSCCT-SCGYGCNGGYPQRTFKYWVYSGMPTGGPYGSNDTCKPY------PIPP 395

Query: 77  CE--PAYPTPKCVRKCVKKNQLWRNS-KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
           C       TPKC + C+    L  N  +HY  + Y+     + +M +I   GP+    +V
Sbjct: 396 CSNCSETRTPKCSKSCISTYPLSLNEDRHYGSTYYQFWLGEKSMMKDISLYGPIVAGMSV 455

Query: 134 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 193
           YEDF HYK GVY   +G  +GGHAV++IGWG  D+   YW++AN WN ++G DG FKI+R
Sbjct: 456 YEDFLHYKEGVYTQESGIFLGGHAVRIIGWGEQDN-IPYWLVANSWNTTFGEDGLFKIRR 514

Query: 194 GSNECGIEEDVVAGLPSSKN 213
           G +ECGIE  V AG    K 
Sbjct: 515 GFDECGIESYVSAGRAKCKQ 534


>gi|341888224|gb|EGT44159.1| hypothetical protein CAEBREN_15022 [Caenorhabditis brenneri]
          Length = 332

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 78/211 (36%), Positives = 119/211 (56%), Gaps = 21/211 (9%)

Query: 19  LQNLSLSVNDLLACC-GFL-CGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS 69
           + N  LS  +LL+CC G L CG+GC GG    AW+Y+  HG+ T         C PY  +
Sbjct: 120 MINTILSAQELLSCCTGVLSCGEGCGGGNAFKAWQYWGKHGLPTGGSYETQFGCKPYSIA 179

Query: 70  T------GCSHPGC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAY-RINSDPEDIM 118
                    ++P C     PTP C +KC  KN         +HY  S+  ++ +   +I 
Sbjct: 180 PCGKTVGNVTYPACTNTTLPTPSCEKKCTSKNGYPVDIDKDRHYGASSVDQLPNRQIEIQ 239

Query: 119 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQ 178
           +++  NGP+E +F VY+DF  Y +G+Y H+TG+  G  +V+++GWG   +G  YW+LAN 
Sbjct: 240 SDVMLNGPIETTFEVYDDFLQYTTGIYVHLTGNKQGHLSVRILGWGMY-EGVPYWLLANS 298

Query: 179 WNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           W + WG +G F+  RG+NECG+E + V+ +P
Sbjct: 299 WGKEWGENGTFRALRGTNECGLEANCVSAMP 329


>gi|204022104|dbj|BAG71149.1| cathepsin B-N [Astegopteryx styracophila]
          Length = 332

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 84/204 (41%), Positives = 108/204 (52%), Gaps = 19/204 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
           N  LS  +L  CC   CG GC GGYPI AW  F  HG+VT       E C PY       
Sbjct: 133 NELLSAEELTFCC-HKCGFGCHGGYPIKAWERFQKHGLVTGGDYDSGEGCQPYRVSPCPL 191

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           D  G +    +PA    +C R C     L ++   H++  AY +      I  ++   GP
Sbjct: 192 DEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKKDHHFTRDAYYLTFGI--IQRDVMAYGP 249

Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           +E S+ VY+DF  YKSGVY +      +GGHAVKLIGWG  + G  YW++ N WN  WG 
Sbjct: 250 IEASYDVYDDFPSYKSGVYVRTENATYLGGHAVKLIGWG-EEYGVPYWLMVNSWNDQWGD 308

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            G FKI+RG+NECGI+     G+P
Sbjct: 309 KGLFKIRRGTNECGIDNSTTGGVP 332


>gi|332374788|gb|AEE62535.1| unknown [Dendroctonus ponderosae]
          Length = 328

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 78/200 (39%), Positives = 110/200 (55%), Gaps = 13/200 (6%)

Query: 18  SLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH-------HGVVTEECDPYFDST 70
           + + L +S  DLL C       GC+GG+P  AW  + +       +G + + C  YF   
Sbjct: 129 ATKKLLVSSQDLLTCG---TAGGCNGGWPAVAWSDWTNGIVTGGLYGALEQGCKSYFLEG 185

Query: 71  GCSHPG-CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
              HP  C     TP CV +C + +  ++  + Y  + Y I  + E I  EI  NGPVE 
Sbjct: 186 CDDHPNKCRNYVSTPACVEQCDEPSLYYKAQETYGQTPYEIQGE-EQIQYEIMTNGPVEA 244

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
           +  VY DFA Y+SG+Y+  T +  GGHAVK++GWG  +DG  YW++AN WN  WG +G F
Sbjct: 245 TMDVYVDFAQYQSGIYQLTTDEYEGGHAVKILGWGV-EDGVKYWLVANSWNERWGENGLF 303

Query: 190 KIKRGSNECGIEEDVVAGLP 209
           +I RG +E GIE  + A LP
Sbjct: 304 RIIRGRDEVGIESTIDAALP 323


>gi|350535627|ref|NP_001233013.1| uncharacterized protein LOC100164982 precursor [Acyrthosiphon
           pisum]
 gi|239789514|dbj|BAH71377.1| ACYPI005957 [Acyrthosiphon pisum]
          Length = 339

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 82/204 (40%), Positives = 111/204 (54%), Gaps = 19/204 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------ 67
           N  LS  +L  CC   CG GC+GGYPI AW+YF  HG+VT       + C+PY       
Sbjct: 137 NELLSAEELTFCC-HACGHGCNGGYPIKAWKYFSTHGLVTGGNYKSGKGCEPYRVPPCPR 195

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGP 126
           +  G S    +P     +C R C     L  +  H ++   Y +      I  ++   GP
Sbjct: 196 NEDGKSSCAGKPKEKNHRCTRMCYGNQDLDYDDDHRFTRDFYYLTYG--SIQKDVLNYGP 253

Query: 127 VEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           +E SF VY+DF  YKSGVY+       +GGHAVKLIGWG  ++G  YW++ N WN  WG 
Sbjct: 254 IEASFDVYDDFPSYKSGVYQRTPNATKLGGHAVKLIGWGV-EEGTPYWLMVNSWNAQWGD 312

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G FKI+RG++EC I+    AG+P
Sbjct: 313 NGLFKIRRGTDECRIDSATTAGVP 336


>gi|187107122|ref|NP_001119621.1| cathepsin B-3098 precursor [Acyrthosiphon pisum]
 gi|161343841|tpg|DAA06101.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 337

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 81/206 (39%), Positives = 113/206 (54%), Gaps = 19/206 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
           N  LS  ++  CC   CG+GC+GGYPI AW+ F +HG+VT       E C+PY      +
Sbjct: 136 NQLLSAEEITFCC-HKCGNGCNGGYPIRAWKRFKNHGLVTGGNYKSGEGCEPYRVPPCPY 194

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGP 126
           D  G +    +P     KC +KC     +  N  H Y+   Y +      I  ++   GP
Sbjct: 195 DKDGKNTCSGQPMESNHKCSKKCYGDEDIDFNKDHRYTRDDYYLTY--RGIQKDVINYGP 252

Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           +E SF VY+DF +YKSG+Y K      +GGH+VKLIGWG  + G  YW++ N WN  WG 
Sbjct: 253 IETSFDVYDDFPNYKSGIYVKSENASYLGGHSVKLIGWG-EEYGVLYWLMVNSWNADWGD 311

Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSS 211
            G FKI+RG+NEC ++     G+P +
Sbjct: 312 KGLFKIRRGTNECRVDNSTTGGVPDT 337


>gi|44965462|gb|AAS49538.1| cathepsin B [Protopterus dolloi]
          Length = 225

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 79/164 (48%), Positives = 100/164 (60%), Gaps = 16/164 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGC 72
           QN+ +S  DLL+CCGF CG GC+GGYP  AW+Y+   G+V+         C PY     C
Sbjct: 62  QNVEVSAEDLLSCCGFECGMGCNGGYPSGAWQYWTEKGLVSGGLYGSGIGCRPYTIPP-C 120

Query: 73  SH------PGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H      P C      TPKCV+KC       +   K Y  SAY + S PE IM EIYK+
Sbjct: 121 EHHVNGSRPSCSGEGGDTPKCVQKCDSGYTPAYEKDKIYGQSAYSVPSSPESIMEEIYKD 180

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 168
           GPVE +FTVYEDF  YKSGVY+H TG+ +GGHA+K++GWG  ++
Sbjct: 181 GPVEGAFTVYEDFLLYKSGVYQHHTGEAVGGHAIKILGWGIENN 224


>gi|226466816|emb|CAX69543.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 337

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 86/202 (42%), Positives = 115/202 (56%), Gaps = 20/202 (9%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH 74
           + LS  +L++CC   C  GC+ GY  SAW Y+V +G+VT E       C PY     C H
Sbjct: 135 VELSAIELVSCCS-KCAVGCNFGYSESAWYYWVENGLVTGESNGNNSGCLPY-PFPKCDH 192

Query: 75  PGCEPAYPT--------PKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
            G   +YP         P C   C     + + + KH+  SAY++  +  DI  EI   G
Sbjct: 193 -GSSDSYPMCGYVVYTPPVCNGTCRPGYPIPYNDDKHFGKSAYQVKQNESDIRREIMLYG 251

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE S  +Y+DF  YKSGVYKH+TG ++   +V++IGWG  ++G  YW+ AN WN  WG 
Sbjct: 252 PVEASIFIYDDFVDYKSGVYKHLTGRLITIQSVRIIGWGI-ENGIPYWLCANSWNEEWGL 310

Query: 186 DGYFKIKRGSNECGIEEDVVAG 207
           +G+FKI RGSNEC IE  V AG
Sbjct: 311 NGFFKILRGSNECEIEAFVNAG 332


>gi|193716207|ref|XP_001950562.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
          Length = 340

 Score =  144 bits (363), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 84/224 (37%), Positives = 118/224 (52%), Gaps = 20/224 (8%)

Query: 1   MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
           M+ +    D L  +       L LS  ++  CC   CG GC+GGYPI AW  F + G+VT
Sbjct: 119 MATSSAFADRLCVATNADFNEL-LSAEEITFCCS-SCGYGCNGGYPIKAWESFNNRGLVT 176

Query: 61  -------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSIS 106
                  E C+PY      +D+ G +    +P     +C R C     L  N  H ++  
Sbjct: 177 GGDYQSGEGCEPYRVPPCPYDAEGHNTCAGKPREKNHRCTRTCYGNQDLDYNDDHRFTRD 236

Query: 107 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGT 165
           +Y +      I  ++ + GP+E SF +Y+DF  YKSGVY +      +GGHAVKLIGWG 
Sbjct: 237 SYYLTY--SSIQKDVMRYGPIEASFDMYDDFPSYKSGVYVRSENASYLGGHAVKLIGWG- 293

Query: 166 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
            + G  YW++ N WN  WG +G FKI+RG+NECGI+     G+P
Sbjct: 294 EEHGVLYWLMVNSWNEGWGDNGLFKIRRGTNECGIDNSTTGGVP 337


>gi|328718094|ref|XP_003246386.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
          Length = 340

 Score =  144 bits (363), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 84/204 (41%), Positives = 111/204 (54%), Gaps = 19/204 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
           N  LS  ++  CC   CG GC+GGYPI AW  F   G+VT       E C+PY      +
Sbjct: 138 NELLSAEEITFCC-HSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPY 196

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGP 126
           D+ G +    +P     +C R C     L  +  H Y+  +Y +      I  ++   GP
Sbjct: 197 DAEGHNTCAGKPRESNHRCTRMCYGNQDLDFDEDHRYTRDSYYLTYG--SIQKDVMTYGP 254

Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           +E SF VY+DF  YKSGVY K      +GGHAVKLIGWG  + G  YW++ N WN  WG 
Sbjct: 255 IEASFDVYDDFPSYKSGVYVKSENATYLGGHAVKLIGWG-EEYGVPYWLMVNSWNADWGD 313

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G FKI+RG+NECGI+    AG+P
Sbjct: 314 NGLFKIRRGTNECGIDNSTTAGVP 337


>gi|226471006|emb|CAX70584.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  144 bits (363), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 77/176 (43%), Positives = 104/176 (59%), Gaps = 17/176 (9%)

Query: 48  SAWRYFVHHGVVT---EE----CDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK 93
            AW Y+V  G+VT   EE    C PY     C H      P C    Y TP+C + C K 
Sbjct: 166 QAWDYWVKRGIVTGGSEENHTGCQPY-PFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKG 224

Query: 94  NQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 152
            +  +   KHY    Y + S+ + I  EI   GPVE +F VYEDF +YKSG+Y+H+ G +
Sbjct: 225 YKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSI 284

Query: 153 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           +GGHA+++IGWG  + G+ YW++AN WN  WG +G F++ RG +EC IE  VVAGL
Sbjct: 285 VGGHAIRIIGWGV-EKGKPYWLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339


>gi|161343869|tpg|DAA06115.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 337

 Score =  144 bits (363), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 85/204 (41%), Positives = 111/204 (54%), Gaps = 16/204 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF---DST 70
           N  LS  ++  CC   CG GC GGYPI AW+ F  HG+VT       E C+PY     + 
Sbjct: 138 NELLSAEEITFCC-HTCGFGCHGGYPIKAWKRFSTHGLVTGGDYNSGEGCEPYRVPPSND 196

Query: 71  GCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGPVEV 129
           G S    +P      C R C     +  N  H Y+   Y +      I  ++   GP+E 
Sbjct: 197 GNSSSSDQPLAINHICRRHCYGNQSIDFNDDHRYTRDYYYLTYG--SIQKDVLTYGPIEA 254

Query: 130 SFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 188
           SF VY+DF  YKSGVY K      +GGHAVKLIGWG  +DG  YW++ N WN  WG +G+
Sbjct: 255 SFDVYDDFPSYKSGVYVKSDNASYLGGHAVKLIGWG-EEDGTPYWLMVNSWNTQWGDNGF 313

Query: 189 FKIKRGSNECGIEEDVVAGLPSSK 212
           FKI+RG+NECG++    AG+P + 
Sbjct: 314 FKIRRGTNECGVDNSTTAGVPVTN 337


>gi|226471008|emb|CAX70585.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  144 bits (363), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 77/176 (43%), Positives = 104/176 (59%), Gaps = 17/176 (9%)

Query: 48  SAWRYFVHHGVVT---EE----CDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK 93
            AW Y+V  G+VT   EE    C PY     C H      P C    Y TP+C + C K 
Sbjct: 166 QAWDYWVKRGIVTGGSEENHTGCQPY-PFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKG 224

Query: 94  NQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 152
            +  +   KHY    Y + S+ + I  EI   GPVE +F VYEDF +YKSG+Y+H+ G +
Sbjct: 225 YKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSI 284

Query: 153 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           +GGHA+++IGWG  + G+ YW++AN WN  WG +G F++ RG +EC IE  VVAGL
Sbjct: 285 VGGHAIRIIGWGV-EKGKPYWLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339


>gi|48762491|dbj|BAD23815.1| cathepsin B-S1 [Tuberaphis coreana]
          Length = 334

 Score =  144 bits (363), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 84/204 (41%), Positives = 114/204 (55%), Gaps = 18/204 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FD 68
           N  LS  +L  CC   CG GC GGYPI AW+YF   GV T       E C PY     ++
Sbjct: 135 NQLLSPEELAFCCK-DCGQGCGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPCYN 193

Query: 69  STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 128
             G +  G +P     +C + C  K  +   +++ + S Y INS  + I  ++   GPVE
Sbjct: 194 KQGKNTCGGQPMERNHQCPKTCYGKTTV--QNRYKTKSEYSINS-IKTIEQDLKTYGPVE 250

Query: 129 VSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
            SF VY+DF+ YKSG+Y+        G H++K+IGWG  ++G  YW+  N W++ WG  G
Sbjct: 251 ASFDVYDDFSVYKSGIYRKTPKAKYEGRHSIKIIGWG-QENGTTYWLAVNSWSKFWGEHG 309

Query: 188 YFKIKRGSNECGIEEDVVAGLPSS 211
            FKI +G NECGIE  V AG+PSS
Sbjct: 310 TFKIIKGRNECGIERAVTAGIPSS 333


>gi|201023315|ref|NP_001128400.1| cathepsin B-16D2 precursor [Acyrthosiphon pisum]
          Length = 340

 Score =  144 bits (362), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 84/204 (41%), Positives = 111/204 (54%), Gaps = 19/204 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
           N  LS  ++  CC   CG GC+GGYPI AW  F   G+VT       E C+PY      +
Sbjct: 138 NELLSAEEITFCC-HSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPY 196

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGP 126
           D+ G +    +P     +C R C     L  +  H Y+  +Y +      I  ++   GP
Sbjct: 197 DAEGHNTCAGKPRESNHRCTRMCYGNQDLDFDEDHRYTRDSYYLTYG--SIQKDVMTYGP 254

Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           +E SF VY+DF  YKSGVY K      +GGHAVKLIGWG  + G  YW++ N WN  WG 
Sbjct: 255 IEASFDVYDDFPSYKSGVYVKSENATYLGGHAVKLIGWG-EEYGVPYWLMVNSWNADWGD 313

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
           +G FKI+RG+NECGI+    AG+P
Sbjct: 314 NGLFKIRRGTNECGIDNSTTAGVP 337


>gi|161343879|tpg|DAA06120.1| TPA_inf: cathepsin B [Toxoptera citricida]
          Length = 340

 Score =  144 bits (362), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 85/206 (41%), Positives = 114/206 (55%), Gaps = 23/206 (11%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
           N  LS  ++  CC   CG GC+GGYPI AW  F  HG+VT       E C+PY      +
Sbjct: 138 NQLLSAEEITFCC-HKCGYGCNGGYPIKAWERFKKHGLVTGGEYKSGEGCEPYRVPPCPY 196

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAY--RINSDPEDIMAEIYKN 124
           D +G +    +P     +C R C     L  +  H ++  +Y   I S  +D+M      
Sbjct: 197 DESGNNTCSGKPMEQNHRCTRMCYGDQDLDFDDDHRHTRDSYYLTIGSIQKDVMTY---- 252

Query: 125 GPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           GP+E SF VY+DF  YKSGVY +      +GGHAVKLIGWG  + G  YW++ N WN  W
Sbjct: 253 GPIEASFDVYDDFLSYKSGVYVRSENASYLGGHAVKLIGWG-EEYGTPYWLMMNSWNADW 311

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLP 209
           G +G FKI+RG+NECG++    AG+P
Sbjct: 312 GDEGLFKIRRGTNECGVDNSTTAGVP 337


>gi|428180143|gb|EKX49011.1| cathepsin B-like cysteine protease [Guillardia theta CCMP2712]
          Length = 330

 Score =  144 bits (362), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 94/221 (42%), Positives = 113/221 (51%), Gaps = 33/221 (14%)

Query: 2   SVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVT 60
           S   ++R A++S+  V   N  LS  DL++C     GD GC GGY   AW Y   +G+VT
Sbjct: 126 SEVLSDRFAIASNGTV---NKILSPEDLVSCDK---GDMGCQGGYLDKAWDYLKTNGIVT 179

Query: 61  EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE 120
           E C PY    G +          P C   CV         K Y  S Y   +  EDIM E
Sbjct: 180 ESCFPYAAQKGVA----------PSCRISCVDGEPY----KKYKASDYYQLTTEEDIMKE 225

Query: 121 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM-GGHAVKLIGWGTS------DDGEDYW 173
           IY NGPVE  F VY  F  YKSGVY H   D+M GGHA+K++GWG             YW
Sbjct: 226 IYLNGPVEAGFRVYTSFMSYKSGVYHHRILDIMEGGHAIKIVGWGVEPPKRFWQKPTKYW 285

Query: 174 ILANQWNRSWGADGYFKIKRGSN-----ECGIEEDVVAGLP 209
           I AN W   WG +G+FKI+RG N     ECGIE+ V AG P
Sbjct: 286 ICANSWTADWGMNGFFKIRRGKNRFGQSECGIEDQVFAGHP 326


>gi|239788404|dbj|BAH70886.1| ACYPI000014 [Acyrthosiphon pisum]
          Length = 335

 Score =  143 bits (361), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 84/206 (40%), Positives = 113/206 (54%), Gaps = 21/206 (10%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------ 67
           N  +S  +L  CC   C  GC+GGYP+ AW+YF  HGVVT       + C PY       
Sbjct: 134 NELISAEELTFCC-HRCVFGCNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPCVK 192

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGP 126
           D  G +    +P     KC +KC   + +     HY    AY + +        +Y  GP
Sbjct: 193 DDEGHNSCSGQPTERNHKCSKKCYGDDTIDYKKNHYKTKDAYYLKNTTMQKDTMVY--GP 250

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDV--MGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           +E SF VY+DF +Y+SGVY+  TG+   +GGHAVK+IGWG  ++G  YW++ N W   WG
Sbjct: 251 IEASFDVYDDFMNYESGVYQR-TGNASYLGGHAVKMIGWGV-EEGTPYWLMVNSWGEQWG 308

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPS 210
             G FKI RG++ECGIE    AG+PS
Sbjct: 309 DKGMFKILRGTDECGIESSCTAGVPS 334


>gi|1644295|emb|CAB03627.1| cysteine proteinase [Haemonchus contortus]
          Length = 345

 Score =  143 bits (361), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 84/203 (41%), Positives = 113/203 (55%), Gaps = 21/203 (10%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE------CDPY-------- 66
            L +S  D+++CC  LCG GCDGG+PI A+ YF   G VT E      C PY        
Sbjct: 144 QLHISSIDIVSCCK-LCGYGCDGGWPIEAFDYFSRQGAVTGETTSKDGCRPYPFHPLWTY 202

Query: 67  -FDSTGCSHPG-CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
             D+ G    G C+ +    + V++ V +N   R     +    RI    +      + N
Sbjct: 203 GNDTVGRRMSGRCKHSKTVGEGVKR-VTRNHTRRTG--LTARRLRITEFCQSHSEGDHGN 259

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPV   FTVYEDF++YK G+Y HI G   G HA+K+IGWG  ++G  YW++AN W+  WG
Sbjct: 260 GPVVAVFTVYEDFSYYKKGIYVHIAGKARGAHAIKIIGWGV-ENGLPYWLIANSWHDDWG 318

Query: 185 ADGYFKIKRGSNECGIEEDVVAG 207
             G F+I RG NECGIE++VVAG
Sbjct: 319 EQGLFRIVRGINECGIEQEVVAG 341


>gi|19526442|gb|AAL89717.1|AF483623_1 cathepsin B [Apriona germari]
          Length = 324

 Score =  143 bits (360), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 75/196 (38%), Positives = 110/196 (56%), Gaps = 5/196 (2%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           +    S  ++++CC   CG GC GG+    ++Y+V +G+ +     Y    GC       
Sbjct: 133 KKFIFSAEEVVSCCT-ACGGGCRGGFLNEPYKYWVTNGIPSG--GDYGSKLGCKPYTAAV 189

Query: 80  AYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
           +  TP+C + CV    + W     ++ SAY++N     I  EI  NGPV     VYEDF 
Sbjct: 190 SGETPQCQKACVSGYEKSWEKDLRHATSAYQVNGGVLQIQREILDNGPVTAYMEVYEDFY 249

Query: 139 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 198
            Y +G+Y+H +G  +GGHAVK+IGWG+ +D   YWI AN W   +G DG+F+I RGSN  
Sbjct: 250 SYGTGIYQHTSGSFVGGHAVKIIGWGSEND-VPYWIAANSWGTGFGEDGFFRILRGSNCA 308

Query: 199 GIEEDVVAGLPSSKNL 214
           GIE  +VAG P++  +
Sbjct: 309 GIESYIVAGYPNTSEV 324


>gi|161343871|tpg|DAA06116.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 276

 Score =  143 bits (360), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 85/226 (37%), Positives = 116/226 (51%), Gaps = 20/226 (8%)

Query: 1   MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
           MS +    D L  +       L LS  ++  CC   CGDGC GGYPI AW+ +  HG+VT
Sbjct: 56  MSTSSAFSDRLCVATNGDFNQL-LSAEEITFCC-HTCGDGCSGGYPIRAWKRYKKHGLVT 113

Query: 61  -------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSIS 106
                  E C+PY       D  G +    +P     +C R C     L  +  H Y+  
Sbjct: 114 GGNYKSGEGCEPYRVPPCPNDDQGNNTCSGQPMEKNHRCTRMCYGDQDLDFDEDHRYTRD 173

Query: 107 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGT 165
            Y +      I  ++   GP+E SF VY+DF  YKSG+Y K      +GGH+VKLIGWG 
Sbjct: 174 HYYLTY--RGIQKDVINYGPIEASFDVYDDFPSYKSGIYVKSENASYLGGHSVKLIGWG- 230

Query: 166 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 211
            + G  YW++ N WN  WG  G FKI+RG+NECG++     G+P++
Sbjct: 231 EEYGVLYWLMVNSWNADWGDKGLFKIRRGTNECGVDNSTTGGVPAT 276


>gi|294914603|ref|XP_002778294.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239886508|gb|EER10089.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 365

 Score =  143 bits (360), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 86/218 (39%), Positives = 117/218 (53%), Gaps = 33/218 (15%)

Query: 21  NLSLSVNDLLACCG---FLCGDGCDGGYPISAWRYFVHHGVVT-------------EECD 64
           N  LS  D+LACC    F    GC GG PI++W +   +G+V+             + C 
Sbjct: 151 NQLLSAADMLACCNIGHFCLSFGCSGGNPITSWTFLHTNGIVSGGGFVPEKNMKAADGCW 210

Query: 65  PYFDSTGCSH--------PGCEPAYPTPKCVRKC--VKKNQLWRNSKHYSISAY--RINS 112
           PY +   C+H        P  +  Y TP C   C   K    +   +HY+ S +  R  S
Sbjct: 211 PY-NFPKCAHHQKESDYKPCAKEIYDTPSCSSSCPNAKYGTAFDKDRHYTESLFPSRFGS 269

Query: 113 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 172
               I  EI  NGP   +F+VYEDF  YKSGVYKH +G  +GGHAV++IGWGT + G DY
Sbjct: 270 T-SSIKKEIMTNGPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGT-EKGVDY 327

Query: 173 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 210
           W++ N WN  WG  G FKI +G  +CGI++ ++AG P+
Sbjct: 328 WLVMNSWNEEWGDHGTFKIVQG--DCGIDDMILAGTPA 363


>gi|161343865|tpg|DAA06113.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 335

 Score =  143 bits (360), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 81/205 (39%), Positives = 109/205 (53%), Gaps = 19/205 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------ 67
           N  +S  +L  CC   CG GC+GG P+ AW+YF  HGVVT       + C PY       
Sbjct: 134 NELISAEELTFCC-HTCGFGCNGGNPLKAWKYFKRHGVVTGGNYNTTDGCQPYRVPPCVR 192

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGP 126
           D  G +    +P     KC +KC     +     HY    AY +++        +Y  GP
Sbjct: 193 DDEGHNSCSGQPTERNHKCSKKCYGDETINYKKNHYKTKDAYYLSNTTMQKDTMVY--GP 250

Query: 127 VEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           +E SF VY+DF  Y+SGVY+       +GGHAVK+IGWG  ++G  YW++ N W   WG 
Sbjct: 251 IEASFDVYDDFTSYESGVYQKTENASYLGGHAVKMIGWGV-EEGTPYWLMVNSWGEQWGD 309

Query: 186 DGYFKIKRGSNECGIEEDVVAGLPS 210
            G FKI RG++ECG+E    AG+PS
Sbjct: 310 KGMFKILRGTDECGVESSCTAGVPS 334


>gi|145481831|ref|XP_001426938.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124394016|emb|CAK59540.1| unnamed protein product [Paramecium tetraurelia]
          Length = 332

 Score =  142 bits (359), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 89/216 (41%), Positives = 112/216 (51%), Gaps = 37/216 (17%)

Query: 23  SLSVNDLLACCGFLC----GDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTG 71
            +S  DLL+CCG  C      GCDGGYP  AW+Y    G+VT         C PY     
Sbjct: 123 QISAEDLLSCCGINCELDGNGGCDGGYPYGAWKYLRVDGIVTGGTYNDFSLCKPY-SFPP 181

Query: 72  CSH-------PGCEPAY-----PTPKCVRKCVKKNQLWRNSKHYSI-------SAYRINS 112
           CSH         CE  +      TP C +KC  +      S+ Y +       + Y++  
Sbjct: 182 CSHGNDSGKYSKCENDFFMLTEVTPSCTKKCHPQF-----SRTYDVDKIRSRENPYKLIK 236

Query: 113 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 172
           D E I  EIY NGPV+  FTV++DF +YKSGVY+  TG   G HAVK+IGWGT ++G  Y
Sbjct: 237 DQEQIKNEIYLNGPVQAVFTVFDDFLNYKSGVYQQTTGQRRGKHAVKIIGWGT-ENGVPY 295

Query: 173 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           W   N WN  WG +G FKI RG N   IE +V A +
Sbjct: 296 WEAINSWNDGWGINGKFKILRGFNHLDIEGEVYASI 331


>gi|51947600|gb|AAU14266.1| cathepsin B-N [Myzus persicae]
          Length = 338

 Score =  142 bits (359), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 83/204 (40%), Positives = 109/204 (53%), Gaps = 19/204 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------ 67
           N  LS  ++  CC   CG GC+GGYPI AW+ F   G+VT       E C+PY       
Sbjct: 136 NELLSAEEITFCC-HTCGFGCNGGYPIKAWKRFSKKGLVTGGDYKSGEGCEPYRVPPCPN 194

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGP 126
           D  G +    +P     +C R C     L  +  H Y+   Y +      I  ++   GP
Sbjct: 195 DDQGNNTCAGKPMESNHRCTRMCYGDQDLDFDEDHRYTRDYYYLTYG--SIQKDVMTYGP 252

Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           +E SF VY+DF  YKSGVY K      +GGHAVKLIGWG  + G  YW++ N WN  WG 
Sbjct: 253 IEASFDVYDDFPSYKSGVYVKSENASYLGGHAVKLIGWG-EEYGVPYWLMVNSWNEDWGD 311

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            G+FKI+RG+NECG++    AG+P
Sbjct: 312 HGFFKIQRGTNECGVDNSTTAGVP 335


>gi|32129435|sp|P92133.2|CATB3_GIALA RecName: Full=Cathepsin B-like CP3; AltName: Full=Cathepsin B-like
           protease B3; Flags: Precursor
 gi|1763663|gb|AAB58260.1| cysteine protease [Giardia intestinalis]
 gi|11691660|emb|CAC18648.1| cathepsin B-like cysteine protease 3 [Giardia intestinalis]
          Length = 299

 Score =  142 bits (359), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 75/168 (44%), Positives = 94/168 (55%), Gaps = 11/168 (6%)

Query: 41  CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
           CDGG+  S WR+    G  T+EC PY         G   A  T  C  KC   + L    
Sbjct: 140 CDGGWLPSVWRFLTKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSDLPHLY 190

Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
           K      Y +  D   IM  +   GP++ +FTVY DF +Y+SGVY+H  G V GGHAV +
Sbjct: 191 KATKAVDYGL--DAPAIMKALATGGPLQTAFTVYSDFMYYESGVYQHTYGRVEGGHAVDM 248

Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           +G+GT DDG DYWI+ N W   WG DGYF+I R +NECGIEE V+ G 
Sbjct: 249 VGYGTDDDGVDYWIIKNSWGPDWGEDGYFRIIRMTNECGIEEQVIGGF 296


>gi|161343851|tpg|DAA06106.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 333

 Score =  142 bits (359), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 82/200 (41%), Positives = 110/200 (55%), Gaps = 15/200 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
           N  LS   + +CC + CG GC GGYPI AWRY+  HG+VT       E C PY       
Sbjct: 134 NQLLSAEHVTSCC-YRCGLGCQGGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPCTG 192

Query: 74  HPGCE-PAYPTPKCVRKCVKKNQL-WRNSKHY-SISAYRINSDPEDIMAEIYKNGPVEVS 130
           +  C   +    KC +KC     + +R  + Y   S Y +  D  ++  +I   GP+E S
Sbjct: 193 NNSCSGQSEKNHKCQKKCFGNTSISYRGDRRYVERSPYVLAYD--NMQNDIMTYGPIESS 250

Query: 131 FTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
           F VY+DF  YKSGVY K      +GGH+VK IGWG   +   YW++ N WN +WG  G F
Sbjct: 251 FDVYDDFISYKSGVYFKSPNATYLGGHSVKCIGWGVERN-VSYWLMMNSWNNTWGDGGNF 309

Query: 190 KIKRGSNECGIEEDVVAGLP 209
           KI+RG+NEC +E+   AG+P
Sbjct: 310 KIRRGTNECQVEDSSTAGMP 329


>gi|162813|gb|AAA30434.1| cathepsin B, partial [Bos taurus]
          Length = 122

 Score =  142 bits (359), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 59/113 (52%), Positives = 89/113 (78%), Gaps = 1/113 (0%)

Query: 97  WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 156
           ++  KH+  S+Y + ++ ++IMAEIYKNGPVE +F+VY DF  YKSGVY+H++G++MGGH
Sbjct: 6   YKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGH 65

Query: 157 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           A++++GWG  ++G  YW++ N WN  WG +G+FKI RG + CGIE ++VAG+P
Sbjct: 66  AIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 117


>gi|204022106|dbj|BAG71150.1| cathepsin B-N [Astegopteryx spinocephala]
          Length = 332

 Score =  142 bits (358), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 82/204 (40%), Positives = 109/204 (53%), Gaps = 19/204 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
           N  LS  +L  CC   CG GC GGYPI AW  F  HG+VT       E C PY       
Sbjct: 133 NELLSAEELTFCC-HTCGYGCHGGYPIKAWERFKKHGLVTGGNYDSSEGCQPYRVSPCPL 191

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCV-KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           D  G +    +PA    +C R C   +++ ++    ++  AY +      I  ++   GP
Sbjct: 192 DEYGNNTCRGKPAEKNHRCTRMCYGDQDRDFKEDHRFTRDAYYLTYGT--IQKDVMTYGP 249

Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           +E S+ VY+DF  YKSGVY +      +GGHAVKLIGWG  + G  YW++ N WN  WG 
Sbjct: 250 IEASYEVYDDFPSYKSGVYVRTENATYLGGHAVKLIGWG-EEYGVPYWLMVNSWNDQWGD 308

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            G FKI+RG+NECGI+     G+P
Sbjct: 309 RGLFKIRRGTNECGIDNSTTGGVP 332


>gi|160688716|gb|ABX45136.1| cathepsin B-like cysteine protease 2 [Callosobruchus maculatus]
          Length = 260

 Score =  142 bits (358), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 72/139 (51%), Positives = 89/139 (64%), Gaps = 3/139 (2%)

Query: 74  HPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPE-DIMAEIYKNGPVEVSF 131
           +P C+  Y  P C ++C K + L +   KHY+  AYRI S  E  I  EI KNGPV  SF
Sbjct: 121 NPSCKTLYDAPTCKKECDKGSPLKYEEDKHYAKQAYRIMSKVERQIQLEIIKNGPVVASF 180

Query: 132 TVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
           TVY DF HY SGVYK      ++GGHAV++IGWG  +    YW+++N WN  WG  G FK
Sbjct: 181 TVYADFIHYLSGVYKFDGESKLLGGHAVRIIGWGIENGTYPYWLVSNSWNERWGDQGLFK 240

Query: 191 IKRGSNECGIEEDVVAGLP 209
           I RG NECGIEE++ AGLP
Sbjct: 241 IWRGKNECGIEEEITAGLP 259


>gi|204022083|dbj|BAG71139.1| cathepsin B-S [Astegopteryx styracophila]
          Length = 335

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 82/203 (40%), Positives = 107/203 (52%), Gaps = 18/203 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FD 68
           N  LS   L  CC + CG GC GG PI AW+YF   G+ T       E C PY     +D
Sbjct: 136 NEQLSAEKLTFCC-WTCGLGCQGGNPIKAWKYFKRRGITTGGDYGSNEGCAPYKVPPCYD 194

Query: 69  STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 128
             G      +P     KC R C   + +      Y + +  +    + I  +I   GPVE
Sbjct: 195 DQGEFLCQGKPTEHNHKCPRACYGNSTV---ENRYKVESIYVLDSFKTIEQDIRTYGPVE 251

Query: 129 VSFTVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
            SF VY+DF  YKSG+Y+     + +GGH+VKLIGWG  +DG  YW+L N W++ WG  G
Sbjct: 252 ASFDVYDDFITYKSGIYQKTPNALYVGGHSVKLIGWG-EEDGIPYWLLVNSWSKFWGEQG 310

Query: 188 YFKIKRGSNECGIEEDVVAGLPS 210
            F+I +G NECGIE    AG+PS
Sbjct: 311 TFRIIKGRNECGIERSATAGIPS 333


>gi|407080581|gb|AFS89610.1| procathepsin B precursor [Phenacoccus solenopsis]
          Length = 309

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 84/210 (40%), Positives = 112/210 (53%), Gaps = 18/210 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           ++   S  +LL+CC   C  GC G     AW ++V HG+V+       E C PY     C
Sbjct: 100 KHFHFSALNLLSCCD-SCEKGCLGCDHHLAWDHWVKHGIVSGGSYGSKEGCQPYHLPP-C 157

Query: 73  SH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIM-AEIYKN 124
            H        C    PTP C R C    ++ + +  H+    Y +    E I+  EI+ N
Sbjct: 158 EHHRAGPRRNCTKYGPTPSCARVCQPDYKISYEDDLHFGKQWYALAPHNEKIIRTEIFHN 217

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSW 183
           GPVE +   YEDF  Y+SG+Y HI G  +  HAVK+IGWGT       YW++AN +N  W
Sbjct: 218 GPVEATMAAYEDFYTYESGIYHHIEGTFVCDHAVKIIGWGTDKKTNTPYWLVANSFNTDW 277

Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSSKN 213
           G  G+FKIKRG NECGIE  + AG+P+ KN
Sbjct: 278 GEYGFFKIKRGVNECGIENKITAGIPAYKN 307


>gi|195437434|ref|XP_002066645.1| GK24603 [Drosophila willistoni]
 gi|194162730|gb|EDW77631.1| GK24603 [Drosophila willistoni]
          Length = 341

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 79/205 (38%), Positives = 113/205 (55%), Gaps = 16/205 (7%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF---- 67
           + N  LS  D+L+CC  +CG  C GGYP +AW Y+   G+V+       + C PY     
Sbjct: 137 VMNFRLSGLDMLSCCA-ICGFACQGGYPGAAWAYWARKGLVSGGDYGSQQGCQPYTIEPC 195

Query: 68  -DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
             S   S P C       +C   C    ++ ++  K+++   Y I++D  +I  EI  NG
Sbjct: 196 DHSGNGSRPVCTVGGGV-RCQHLCEPSYKVDFQRDKNFASKVYSISNDVLEIQKEIMTNG 254

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWG 184
           PV+   TVYEDF  YK+GVY H+ G+ +G HAV+++GWG        YW++AN W   WG
Sbjct: 255 PVQAILTVYEDFLSYKTGVYYHLEGEKVGPHAVRILGWGVWGTKKVPYWLVANSWGSDWG 314

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
            +G+F I RG N C IE  ++AGLP
Sbjct: 315 DNGFFHIFRGENHCDIEGYIMAGLP 339


>gi|157058767|gb|ABV03141.1| cathepsin B-348 [Sitobion avenae]
          Length = 252

 Score =  141 bits (355), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 71/179 (39%), Positives = 105/179 (58%), Gaps = 18/179 (10%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC------- 72
           +N   S  +L++CC + CG GC+GG+P +AW Y+   G+V+    PY  + GC       
Sbjct: 77  KNFHFSAENLVSCC-WTCGFGCNGGFPGAAWHYWKTKGIVSG--GPYGSNMGCIPYEIAP 133

Query: 73  -------SHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
                  +   C+    TPKCV+KC    ++ +    H   SAY +++D + I  EIY N
Sbjct: 134 CEHHVNGTRGPCKEGGKTPKCVKKCEDGYKVPYEQDLHRGKSAYSLSNDVDQIRQEIYTN 193

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           GPVE +FTVYEDF  Y++GVYKH+ G  +GGHA++++GWG  +    YW++AN WN  W
Sbjct: 194 GPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANSWNTDW 252


>gi|3929817|emb|CAA77181.1| cathepsin B [Mus musculus]
          Length = 194

 Score =  141 bits (355), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 74/171 (43%), Positives = 104/171 (60%), Gaps = 16/171 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  DLL CCG  CGDGC+GGYP  AW ++   G+V+         C PY     C 
Sbjct: 26  NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTKKGLVSGGVYDSHIGCLPYTIPP-CE 84

Query: 74  H------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P       TP+C + C    +  ++  KH+  ++Y +++  ++IMAEIYKNGP
Sbjct: 85  HHVNGSRPPMHGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGP 144

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 177
           VE +FTV+ DF  YKSGVYKH  GD+MGGHA++++GWG  ++G  YW+ AN
Sbjct: 145 VEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGV-ENGVPYWLAAN 194


>gi|291000017|ref|XP_002682576.1| cathepsin C [Naegleria gruberi]
 gi|284096203|gb|EFC49832.1| cathepsin C [Naegleria gruberi]
          Length = 430

 Score =  141 bits (355), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 74/202 (36%), Positives = 107/202 (52%), Gaps = 32/202 (15%)

Query: 25  SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP 84
           S  D++ C  +    GCDGG+P    +Y + +G+  E CDPY              +   
Sbjct: 242 SPQDIVDCSAY--SQGCDGGFPFLVGKYAMDYGLTVESCDPY------------QGHDLG 287

Query: 85  KCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
           KC  +C V + Q   +S +Y +  Y  NS    +M EIY+NGP+ + F VY D  +YK G
Sbjct: 288 KCSNQCPVNRQQRLHSSNYYFVGGYYGNSHELSMMHEIYQNGPLAIGFEVYPDLRNYKHG 347

Query: 144 VYKHITGDVMGG----------------HAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           VYKH+T + +                  HAV ++GWG  ++G  YW + N W+ +WG +G
Sbjct: 348 VYKHVTAEELKAQGLSEDEMIPHFEVVNHAVLMVGWGV-ENGTPYWKIKNSWSTTWGDNG 406

Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
           YFKI RGS+ECG+E D  AG+P
Sbjct: 407 YFKILRGSDECGVESDAEAGIP 428


>gi|86279343|gb|ABC88767.1| putative cathepsin B-like proteinase [Tenebrio molitor]
          Length = 321

 Score =  141 bits (355), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 78/194 (40%), Positives = 115/194 (59%), Gaps = 18/194 (9%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG 76
            S  DLL+CC   CGD C GGY +SA  ++++ G+V+       E C PY   T  +H  
Sbjct: 136 FSPEDLLSCCT-SCGD-CGGGYMMSALDFYINEGIVSGGDVNSNEGCRPY---TADAHDQ 190

Query: 77  CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
            +    TP C + C    +  +   KHY  + Y ++S  + I  E+  NGP+ V+F V++
Sbjct: 191 GQ----TPACTKSCRNGYSTSYSADKHYGSNDYVVSSVIDQIQYEVMTNGPIIVNFEVFQ 246

Query: 136 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 195
           DF +Y SGVY+H++G+ +G H VK++GWG  ++G  YW++AN W  SWG  G+FK+ RG 
Sbjct: 247 DFYNYVSGVYRHVSGESVGFHVVKIVGWGV-ENGVPYWLIANSWGSSWGDHGFFKMLRGQ 305

Query: 196 NECGIEEDVVAGLP 209
           NECGIE    A +P
Sbjct: 306 NECGIENYPYAVMP 319


>gi|156708114|gb|ABU93315.1| cathepsin B6 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  140 bits (354), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 83/185 (44%), Positives = 105/185 (56%), Gaps = 20/185 (10%)

Query: 24  LSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 82
           LSV DL++C     GD GC+GG    + ++ V +GV TEEC PY    G           
Sbjct: 112 LSVQDLVSCDK---GDSGCNGGSGPLSSKWLVSNGVTTEECLPYVSGNG----------R 158

Query: 83  TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 142
            P C  KC   +Q+ R  K+     Y +    ++I  E+ KNGPV   FTVY DF +YKS
Sbjct: 159 VPACAAKCSNGSQIIR-YKYEKAETYTV----QNIQEELMKNGPVYFRFTVYSDFMNYKS 213

Query: 143 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 202
           GVY+H +G   GGHAV LIGWG  +DG  YW+L N W  +WG  G+FKI RG NECG E+
Sbjct: 214 GVYQHKSGYQEGGHAVLLIGWGV-EDGVPYWLLQNSWGPAWGEKGHFKIIRGKNECGCEQ 272

Query: 203 DVVAG 207
              AG
Sbjct: 273 GFYAG 277


>gi|157167285|ref|XP_001658487.1| cathepsin b [Aedes aegypti]
 gi|108876478|gb|EAT40703.1| AAEL007590-PA [Aedes aegypti]
          Length = 313

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 76/171 (44%), Positives = 98/171 (57%), Gaps = 12/171 (7%)

Query: 50  WRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAYP-TPKCVRKCVKKNQLWRN-- 99
           W Y+V  GV +       + C PY     C  P  E  YP  P C  +C     +  +  
Sbjct: 141 WSYWVKQGVSSGGPYGSNQGCHPYPMPPSCPKPS-EGDYPDEPNCSTRCNAGYNVTEDLR 199

Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 159
            + +   AY I +D   IM +I+ NGPV+  F  YED  +Y  GVY+H +G + GGHAVK
Sbjct: 200 DRRFGRVAYSIPADERKIMEDIFVNGPVQAVFQWYEDIVNYSGGVYRHQSGRLKGGHAVK 259

Query: 160 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 210
           LIGWG  +DG  YW++AN W R WG DG+FK+ RG N CGIEE+V AGLPS
Sbjct: 260 LIGWGV-EDGTKYWLVANSWGRVWGDDGFFKMVRGENHCGIEENVHAGLPS 309


>gi|56758658|gb|AAW27469.1| unknown [Schistosoma japonicum]
          Length = 181

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 75/172 (43%), Positives = 100/172 (58%), Gaps = 17/172 (9%)

Query: 52  YFVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKKNQL- 96
           Y V  G+VT         C PY     C H      P C    Y TP+C +KC K  +  
Sbjct: 9   YLVKRGIVTGGSKENHTGCQPY-PFPKCEHLTKGKYPACGTKIYKTPQCKQKCQKGYKTP 67

Query: 97  WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 156
           +   K+Y    Y + S+ + I  EI  NGPVE +F VYEDF +YKSG+Y+H+TG ++GGH
Sbjct: 68  YEQDKNYGDQRYNVISNAKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGH 127

Query: 157 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           A+++IGWG  +    YW++AN WN  WG  G F+I RG +EC IE +VVAGL
Sbjct: 128 AIRIIGWGV-EKRTPYWLIANSWNEDWGEKGLFRIVRGRDECSIESNVVAGL 178


>gi|38639319|gb|AAR25797.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
          Length = 218

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 61/72 (84%), Positives = 65/72 (90%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           ++SLSVNDLLACC FLCG GCDGGYPI+AWRYF   GVVTEECDPYFD+TGCSHPGCEP 
Sbjct: 146 SISLSVNDLLACCSFLCGSGCDGGYPIAAWRYFKRSGVVTEECDPYFDTTGCSHPGCEPL 205

Query: 81  YPTPKCVRKCVK 92
           YPTPKC RKCVK
Sbjct: 206 YPTPKCHRKCVK 217


>gi|161343867|tpg|DAA06114.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 340

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 83/204 (40%), Positives = 109/204 (53%), Gaps = 19/204 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
           N  LS  +L  CC   CG GC+GGYPI AW  F  HG+VT       E C+PY       
Sbjct: 138 NEFLSPEELTFCC-HTCGYGCNGGYPIKAWERFKSHGLVTGGDYKSGEGCEPYRVPPCRH 196

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGP 126
            + G +    +P     +C R C     L  +  H Y+  +Y +      I  ++   GP
Sbjct: 197 HAEGNNSCSDKPMEKNHRCTRMCYGDQDLDFDDDHRYTRDSYYLTYG--SIQKDVMNYGP 254

Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           +E SF VY+DF  YKSGVY +      +GGHAVKLIGWG  + G  YW++ N WN  WG 
Sbjct: 255 IEASFDVYDDFPSYKSGVYIRSDNASYLGGHAVKLIGWG-EESGVPYWLMVNSWNTDWGD 313

Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
            G FKI+RG+NECG++    AG+P
Sbjct: 314 KGLFKIQRGTNECGVDNSTTAGVP 337


>gi|161343821|tpg|DAA06091.1| TPA_inf: cathepsin B [Aphis gossypii]
          Length = 196

 Score =  140 bits (352), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 80/199 (40%), Positives = 108/199 (54%), Gaps = 19/199 (9%)

Query: 28  DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSH 74
           +L  CC   CG GC GGYPI AW+ F +HG+VT       E C+PY      +D  G + 
Sbjct: 1   ELTFCC-HTCGFGCHGGYPIRAWKRFKNHGLVTGGDYKSGEGCEPYRVPPCPYDEQGNNT 59

Query: 75  PGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
              +P     +C R C    +L  +  H Y+   Y +      I  ++   GP+E SF V
Sbjct: 60  CAGKPMEKNHRCTRICYGDQELDFDEDHRYTRDYYYLTYG--SIQKDVMTYGPIEASFDV 117

Query: 134 YEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
           Y DF  YKSG+Y+       +GGHAVKLIGWG    G  YW++ N WN  WG +G FKI+
Sbjct: 118 YSDFPSYKSGIYERTENATYLGGHAVKLIGWG-EQYGIPYWLMVNSWNEDWGDNGLFKIR 176

Query: 193 RGSNECGIEEDVVAGLPSS 211
           RG+NECG++    AG+P +
Sbjct: 177 RGTNECGVDNSTTAGVPVT 195


>gi|195026034|ref|XP_001986167.1| GH20676 [Drosophila grimshawi]
 gi|193902167|gb|EDW01034.1| GH20676 [Drosophila grimshawi]
          Length = 432

 Score =  140 bits (352), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 83/206 (40%), Positives = 106/206 (51%), Gaps = 30/206 (14%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDST----------- 70
           + LS  ++L+C       GCDGG+  +AWRY   +GV+   C PY               
Sbjct: 236 VQLSAQNILSCTRR--QQGCDGGHLDAAWRYMHKNGVLDANCYPYIQQRDTCKVQRHRGR 293

Query: 71  GCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
                GC+PA+         V ++  +     YS+S         DIMAEIY +GPV+ +
Sbjct: 294 SLKAYGCQPAHG--------VNRDNFYTVGPAYSLSR------EADIMAEIYHSGPVQAT 339

Query: 131 FTVYEDFAHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
            TVY DF  Y SGVY+H     G   G H+VKL+GWG   +G  YWI AN W   WG  G
Sbjct: 340 MTVYRDFFSYSSGVYQHTAANRGAATGFHSVKLVGWGEEHNGVKYWIAANSWGPWWGERG 399

Query: 188 YFKIKRGSNECGIEEDVVAGLPSSKN 213
           YF+I RGSNECGIEE V+A  P   N
Sbjct: 400 YFRILRGSNECGIEEYVLASWPHVYN 425


>gi|294873367|ref|XP_002766594.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239867622|gb|EEQ99311.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 244

 Score =  140 bits (352), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 85/218 (38%), Positives = 116/218 (53%), Gaps = 33/218 (15%)

Query: 21  NLSLSVNDLLACCG---FLCGDGCDGGYPISAWRYFVHHGVVT-------------EECD 64
           N  LS  ++LACC    F    GC GG PI++W +   +G+V+             + C 
Sbjct: 30  NQLLSAANMLACCNIGHFCLSFGCSGGNPITSWTFLHTNGIVSGGGFVPEKNMKAADGCW 89

Query: 65  PYFDSTGCSH--------PGCEPAYPTPKCVRKC--VKKNQLWRNSKHYSISAY--RINS 112
           PY     C+H        P  +  Y TP C   C   K    +   +HY+ S +  R  S
Sbjct: 90  PY-SFPKCAHHQDGSDYKPCAKEIYDTPSCSSSCPNAKYGTAFDKDRHYTESLFPSRFGS 148

Query: 113 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 172
               I  EI  NGP   +F+VYEDF  YKSGVYKH +G  +GGHAV++IGWGT + G DY
Sbjct: 149 T-SSIKKEIMTNGPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGT-EKGVDY 206

Query: 173 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 210
           W++ N WN  WG  G FKI +G  +CGI++ ++AG P+
Sbjct: 207 WLVMNSWNEEWGDHGTFKIVQG--DCGIDDTILAGTPA 242


>gi|347972088|ref|XP_313836.5| AGAP004534-PA [Anopheles gambiae str. PEST]
 gi|333469166|gb|EAA09182.5| AGAP004534-PA [Anopheles gambiae str. PEST]
          Length = 334

 Score =  140 bits (352), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 81/199 (40%), Positives = 114/199 (57%), Gaps = 13/199 (6%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGY-PISAWRYFVHHGVV-------TEECDPYFDSTGC 72
           N++L+  DL+ CC   CG+GC+GG+   ++++Y+V  G+V       T+ C PY     C
Sbjct: 137 NVALAAEDLMGCC-VDCGNGCNGGFLDGTSFQYWVDAGLVSGGAYNSTDGCKPY-PFKPC 194

Query: 73  SHPGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
            +P  +     +PKC   C    ++ +   K +   AY +  D   I  EI  NGPVE  
Sbjct: 195 EYPFNDCHVEISPKCTHHCRDGVDRHYSKDKLFGKVAYSVPRDERAIRYEIMTNGPVEAG 254

Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
           F VYED   YKSGVY+H+ G+ +G HAV++IGWG  D G  YW++AN +   WG  GYFK
Sbjct: 255 FDVYEDVLLYKSGVYRHVYGEQIGKHAVRIIGWG-RDGGIPYWLIANSYGDDWGDHGYFK 313

Query: 191 IKRGSNECGIEEDVVAGLP 209
             RGSN  GIE  ++ GLP
Sbjct: 314 FVRGSNHLGIESKIITGLP 332


>gi|21930117|gb|AAM82155.1| cysteine proteinase [Ancylostoma ceylanicum]
          Length = 348

 Score =  139 bits (351), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 81/204 (39%), Positives = 109/204 (53%), Gaps = 18/204 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
           N  LS  ++L+CC   CG GC GGYP  A+ Y   +G+ T       + C PY     C 
Sbjct: 144 NRILSDTEVLSCCFGSCGFGCKGGYPARAFGYAWRYGLSTGGPYGEKDACQPY-AFYPCG 202

Query: 74  HPGCEPAY--------PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
           +   EP Y        PTP C R C     + +   K ++   Y I  +  +I  EI   
Sbjct: 203 NHAHEPYYGPCPDELWPTPTCRRTCQLGYPIPFEKDKIFNDQTYYIFGNETEIKYEIMTR 262

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPV  ++ VY DF +YK GVY H  G+V G HAVK+IGWG  +D   YW++AN WN  WG
Sbjct: 263 GPVVATYKVYRDFDYYKKGVYIHREGEVTGLHAVKIIGWGKGND-VPYWLVANSWNTDWG 321

Query: 185 ADGYFKIKRGSNECGIEEDVVAGL 208
            +GYF+I RG++ C IE  +V G+
Sbjct: 322 DNGYFRIVRGTDNCEIERQMVGGI 345


>gi|209863077|ref|NP_001119612.2| cathepsin B-912 precursor [Acyrthosiphon pisum]
          Length = 342

 Score =  139 bits (351), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 81/203 (39%), Positives = 106/203 (52%), Gaps = 17/203 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------ 67
           N  LS  +L  CC  LCG  C GGYPI AW YF  HG+VT       E C PY       
Sbjct: 140 NELLSAEELTFCC-HLCGFACHGGYPIKAWSYFRRHGIVTGGDYQSGEGCAPYRVPPCFS 198

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
           +  G +    +P     +C R C    ++  +  H     Y   +    I  ++   GP+
Sbjct: 199 EEDGNNTCRGQPMEKHHRCTRMCYGDQEIDYDDDHRFTRDYYYLTYA-SIQKDVMTYGPI 257

Query: 128 EVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           E S  VY+DF  YKSGVY K      +GGHAVKLIGWG  +DG  YW++ N W+  WG  
Sbjct: 258 EASMEVYDDFPSYKSGVYEKSENATYLGGHAVKLIGWG-EEDGVPYWLMVNSWSEMWGDK 316

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G FKI+RG+NEC ++  + AG+P
Sbjct: 317 GLFKIRRGTNECSVDNSMTAGVP 339


>gi|187105118|ref|NP_001119619.1| cathepsin B-5880 precursor [Acyrthosiphon pisum]
 gi|163300442|tpg|DAA06127.1| TPA_inf: cathepsin B transcript 5880 [Acyrthosiphon pisum]
 gi|239790051|dbj|BAH71611.1| ACYPI000015 [Acyrthosiphon pisum]
 gi|239790053|dbj|BAH71612.1| ACYPI000015 [Acyrthosiphon pisum]
          Length = 302

 Score =  139 bits (351), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 79/205 (38%), Positives = 111/205 (54%), Gaps = 28/205 (13%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDST 70
           LS   +L+CC +LCGDGC GG    +W ++  HG+V+       E C PY         T
Sbjct: 106 LSAQQILSCC-YLCGDGCSGGQHFESWDFYRRHGLVSGGEYGSNEGCQPYTIEPCQHTET 164

Query: 71  GCSHPGCEPAYPTPKCVRKCVKKNQLWRNSK------HYSISAYRINSDPEDIMAEIYKN 124
              +        TP+C  +C   +   R  K      HY + AY         M EIY+N
Sbjct: 165 AVENACSNKTLFTPECKVQCYNPDYGTRYVKDNHQGTHYRVPAYTA-------MKEIYEN 217

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GP+  SF +Y+DF +Y+SGVY + +G  +   AVK++GWG  ++G  YW+ AN +N  WG
Sbjct: 218 GPITASFYMYQDFVNYQSGVYAYNSGKYVTTQAVKILGWG-EENGTPYWLAANSFNTYWG 276

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
            +G+ KI RG+NEC IEE + AGLP
Sbjct: 277 DNGFVKILRGANECYIEEFMYAGLP 301


>gi|294898091|ref|XP_002776152.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239882839|gb|EER07968.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 382

 Score =  139 bits (351), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 80/191 (41%), Positives = 104/191 (54%), Gaps = 11/191 (5%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-CDPYFDSTGCSHP--GCEPA 80
           LS  ++ AC  F    GC GG P SAW +    G+ T E   P   S   + P    +  
Sbjct: 196 LSAGEMNACTLFF---GCGGGDPYSAWSWVHDKGIATGEGSRPKRVSESEAIPVIAYQDI 252

Query: 81  YPTPKCVRKCV--KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
           YPTP CV +C   K     R+ +H+ + +   +    D    I  +GPV  SFTVYEDF 
Sbjct: 253 YPTPNCVEQCRNPKYTTTLRDDRHFMLESSPYHYSVNDAKNAIRTDGPVSASFTVYEDFL 312

Query: 139 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 198
            YKSGVYKH +G  +GGHAVK+IGWG    G+ YW+  N WN  WG  G FKI  G+  C
Sbjct: 313 AYKSGVYKHTSGSYLGGHAVKIIGWG-EKSGQAYWLAVNSWNEDWGDKGLFKIALGN--C 369

Query: 199 GIEEDVVAGLP 209
           GI++D++ G P
Sbjct: 370 GIDDDLLGGTP 380


>gi|161343855|tpg|DAA06108.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 342

 Score =  139 bits (351), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 81/203 (39%), Positives = 106/203 (52%), Gaps = 17/203 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------ 67
           N  LS  +L  CC  LCG  C GGYPI AW YF  HG+VT       E C PY       
Sbjct: 140 NELLSAEELTFCC-HLCGFACHGGYPIKAWSYFRRHGIVTGGGYQSGEGCAPYRVPPCFS 198

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
           +  G +    +P     +C R C    ++  +  H     Y   +    I  ++   GP+
Sbjct: 199 EEDGNNTCRGQPMEKHHRCTRMCYGDQEIDYDDDHRFTRDYYYLTYA-SIQKDVMTYGPI 257

Query: 128 EVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           E S  VY+DF  YKSGVY K      +GGHAVKLIGWG  +DG  YW++ N W+  WG  
Sbjct: 258 EASMEVYDDFPSYKSGVYEKSENATYLGGHAVKLIGWG-EEDGVPYWLMVNSWSEMWGDK 316

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G FKI+RG+NEC ++  + AG+P
Sbjct: 317 GLFKIRRGTNECSVDNSMTAGVP 339


>gi|189502866|gb|ACE06814.1| unknown [Schistosoma japonicum]
          Length = 121

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 65/119 (54%), Positives = 85/119 (71%), Gaps = 1/119 (0%)

Query: 94  NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 153
           N  + N K Y    YR+ S+ E IM E+ ++GPVEV F VY DF +YKSGVY+H++G ++
Sbjct: 3   NVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALL 62

Query: 154 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 212
           GGHAV+L+GWG  ++   YW++AN WN  WG +GYFKI RG NECGIE DV AG+P  K
Sbjct: 63  GGHAVRLLGWGEENN-VPYWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNAGIPKIK 120


>gi|401758196|gb|AFQ01133.1| cathepsin B [Chilo suppressalis]
          Length = 350

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 85/222 (38%), Positives = 120/222 (54%), Gaps = 39/222 (17%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           ++   S+ D+L+CCG+ CG+GC+GG    AW Y+   G+V+       + C PY     C
Sbjct: 133 EHFYFSIKDVLSCCGY-CGNGCEGGVLTRAWIYYKKIGIVSGGGYKSKQGCQPY-TIPPC 190

Query: 73  SH---------------PGCE--PAYP--------TPKCVRKCVKKNQL-WRNSKHYSIS 106
           +H               P C+  P  P        TP+C +KC K  ++ +   KH   S
Sbjct: 191 NHLVWGEIEQCKNIPMTPKCKNIPVIPEQCKYIPITPECEKKCNKNYKVCYSKDKHRGKS 250

Query: 107 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 166
            YR+     +I  EIY+ GPV   FTVYEDF +YK G+Y + +G  +G H+VK+IGWG  
Sbjct: 251 VYRVKKS--EIFKEIYEYGPVTSYFTVYEDFLNYKEGIYNYTSGQKLGLHSVKIIGWG-E 307

Query: 167 DDGEDYWILANQWNRSWGADGYFKIKR-GSNECGIEEDVVAG 207
           + G  YW+ AN +N  WG  G+FKI R G   CGI ++VVAG
Sbjct: 308 ERGIKYWLAANSFNTDWGDKGFFKIIREGVGSCGISDNVVAG 349


>gi|289724789|gb|ADD18342.1| putative cysteine proteinase TIN-ag [Glossina morsitans morsitans]
          Length = 387

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 81/200 (40%), Positives = 107/200 (53%), Gaps = 13/200 (6%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD-STGCSHPGCEPA 80
           + LS  ++L+C       GC+GG+  +AWRY    GVV E C PY      C  P    +
Sbjct: 191 IQLSPQNILSCTRR--QQGCNGGHLDAAWRYLHKQGVVDESCYPYVGYRDACKIPHNSRS 248

Query: 81  YPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
                C     V +++L+     YS++      +  DIMAEI+ +GPV+ + TVY DF  
Sbjct: 249 LRNNGCRSYSGVDRDELYTVGPAYSLN------NETDIMAEIFMSGPVQATLTVYRDFFS 302

Query: 140 YKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 196
           Y  G+Y+H     G  +G H+VKLIGWG   DG  YWI  N W   WG  G F+I RGSN
Sbjct: 303 YSGGIYRHTAASRGSPVGFHSVKLIGWGEEHDGNKYWIATNSWGTWWGEHGNFRILRGSN 362

Query: 197 ECGIEEDVVAGLPSSKNLVK 216
           ECGIEE V+A  P+  N  K
Sbjct: 363 ECGIEEYVLAAWPNVYNYFK 382


>gi|156708106|gb|ABU93311.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
          Length = 282

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 85/204 (41%), Positives = 107/204 (52%), Gaps = 21/204 (10%)

Query: 1   MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
            SV  T  D LS    +      +S  DL++C       GC+GGY   AW +   HGV  
Sbjct: 92  FSVAETMGDRLS---IIGCGRGHMSPQDLVSC--DTTDMGCNGGYMDKAWAWTKSHGVTN 146

Query: 61  EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE 120
           EEC PY    G            P C  KCV  + + R +K  S + +  +     +  E
Sbjct: 147 EECMPYQSGGG----------RVPACPAKCVNGSTIVR-TKSQSFTHFTAS----QMQQE 191

Query: 121 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 180
           +Y+NGP+ V+FTVY DF +YKSGVY H TG V GGHAV  IGWG  D+   YW+  N W 
Sbjct: 192 LYENGPLSVAFTVYYDFMNYKSGVYVHKTGGVAGGHAVLCIGWGVEDN-TPYWLCQNSWG 250

Query: 181 RSWGADGYFKIKRGSNECGIEEDV 204
            +WG  G+FKI RGSN CGIE  V
Sbjct: 251 PAWGEKGHFKILRGSNHCGIENQV 274


>gi|157058763|gb|ABV03139.1| cathepsin B-348 [Acyrthosiphon pisum]
          Length = 248

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 70/176 (39%), Positives = 104/176 (59%), Gaps = 18/176 (10%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC------- 72
           +N   S  +L++CC + CG GC+GG+P +AW Y+   G+V+    PY  + GC       
Sbjct: 75  KNFHFSAENLVSCC-WTCGFGCNGGFPGAAWNYWKTKGIVSG--GPYGSNMGCIPYEIAP 131

Query: 73  -------SHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
                  +   C+    TP CV+KC +  ++ +    H+  SAY I +D + I  EIY N
Sbjct: 132 CEHHVNGTRGPCKEGGKTPTCVKKCEEGYKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTN 191

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 180
           GPVE +FTVYEDF  Y++GVYKH+ G  +GGHA++++GWG  +    YW++AN WN
Sbjct: 192 GPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANSWN 247


>gi|204022081|dbj|BAG71138.1| cathepsin B-S1 [Tuberaphis takenouchii]
          Length = 332

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 74/202 (36%), Positives = 107/202 (52%), Gaps = 18/202 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FD 68
           N  LS  D+  CC   CG GC+GGYPI AW+YF   GV T       E C PY     FD
Sbjct: 135 NELLSPEDVAFCCQ-NCGKGCEGGYPIKAWQYFRTQGVPTGGDYDSKEGCAPYKIPPCFD 193

Query: 69  STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 128
             G +    +P     +C + C     +    K Y +    + + P  +  ++ K GP+E
Sbjct: 194 QKGKNTCAGKPLERNHQCPKTCYGSTTV---QKRYKVKNEYVLNSPNTMEQDLIKYGPIE 250

Query: 129 VSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
            SF +++D + YKSG+Y+       + GH++K+IGWG  ++G  YW+  N W++ WG  G
Sbjct: 251 ASFNLFDDLSAYKSGIYQKTPKAKFLSGHSIKIIGWG-KENGVPYWLAVNSWSKFWGEQG 309

Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
            F+I +G NECGIE    AG+P
Sbjct: 310 TFRIIKGRNECGIERSATAGIP 331


>gi|195384166|ref|XP_002050789.1| GJ20006 [Drosophila virilis]
 gi|194145586|gb|EDW61982.1| GJ20006 [Drosophila virilis]
          Length = 432

 Score =  138 bits (348), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 81/202 (40%), Positives = 106/202 (52%), Gaps = 30/202 (14%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-----------FDST 70
           + LS  ++L+C       GC+GG+  +AWRY    GV+ E+C PY            +S 
Sbjct: 236 VQLSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVLDEKCYPYTQHRDSCKIQRHNSR 293

Query: 71  GCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
                GC+PAY         V ++ L+     YS+S         DIMAEIY +GPV+ +
Sbjct: 294 SLKANGCQPAYG--------VNRDSLYTVGPAYSLSR------EADIMAEIYHSGPVQAT 339

Query: 131 FTVYEDFAHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
             +Y DF  Y  G+Y+      G   G H+VKL+GWG   DG  YWI AN W   WG  G
Sbjct: 340 MRIYRDFFSYSGGIYRQTAANRGAPTGFHSVKLVGWGEEHDGVKYWIAANSWGPWWGEHG 399

Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
           YF+I RGSNECGIEE V+A  P
Sbjct: 400 YFRILRGSNECGIEEYVLASWP 421


>gi|28971815|dbj|BAC65419.1| cathepsin B [Pandalus borealis]
          Length = 328

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 83/208 (39%), Positives = 111/208 (53%), Gaps = 17/208 (8%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTG 71
           L +   S  ++ ACC   CG+ C GG   +A+ ++V  G V+       E C PY     
Sbjct: 124 LVDFRFSSENVAACCT-ECGNACYGGDEDTAFTHWVTKGFVSGGRHNSNEGCQPY-SVEE 181

Query: 72  CSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
           C H      P CE   P   C   C ++  + +     Y + AY +  D   I  EI  N
Sbjct: 182 CEHHIEGPRPPCEGDMPELVCSETCHEEYGKTYEEDLEYGLEAYVLPQDVTQIQEEIMTN 241

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GPV  +F VY+DF  YKSGVY+H TG + G HAV++IGWG  ++G  YW++AN WN  WG
Sbjct: 242 GPVTAAFAVYDDFLSYKSGVYQHETGLLDGYHAVRVIGWG-EEEGTPYWLVANSWNTDWG 300

Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPSSK 212
            +G FKI RGS+EC  E D+ A   SSK
Sbjct: 301 DNGLFKILRGSDECEFEGDMAAATYSSK 328


>gi|290992302|ref|XP_002678773.1| predicted protein [Naegleria gruberi]
 gi|284092387|gb|EFC46029.1| predicted protein [Naegleria gruberi]
          Length = 236

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 100/169 (59%), Gaps = 15/169 (8%)

Query: 40  GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
           GCDGGY  +AW +    G+ +++C PY    G               V  C  K Q   +
Sbjct: 78  GCDGGYLNNAWAFLAGTGIPSDKCAPYTSQNGD--------------VAACPSKCQDGSS 123

Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 159
            K Y     +  +D   IM ++ +NGPV+ +F+VY DF  YKSGVY H++G ++GGHA+K
Sbjct: 124 VKLYKAKNPQQLNDIPSIMEDMQQNGPVQAAFSVYRDFMSYKSGVYHHVSGSLLGGHAIK 183

Query: 160 LIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
           ++GWG  S   + YWI+AN W  SWG +G+F I RGS+ECGIE++V +G
Sbjct: 184 MVGWGVDSATNKPYWIIANSWGPSWGLNGFFWILRGSDECGIEDNVWSG 232


>gi|16768502|gb|AAL28470.1| GM06507p [Drosophila melanogaster]
          Length = 430

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 79/193 (40%), Positives = 105/193 (54%), Gaps = 10/193 (5%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           +N+ LS  ++L+C       GC+GG+  +AWRY    GVV E C PY          C+ 
Sbjct: 234 ENVQLSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDENCYPYTQH----RDTCKI 287

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
            +        C K   + R+S +    AY +N +  DIMAEI+ +GPV+ +  V  DF  
Sbjct: 288 RHSRSLKANGCQKPVNVDRDSLYTVGPAYSLNREA-DIMAEIFHSGPVQATMRVNRDFFA 346

Query: 140 YKSGVYKHITGDV---MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 196
           Y  GVY+    +     G H+VKL+GWG   +GE YWI AN W   WG  GYF+I RGSN
Sbjct: 347 YSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEHGYFRILRGSN 406

Query: 197 ECGIEEDVVAGLP 209
           ECGIEE V+A  P
Sbjct: 407 ECGIEEYVLASWP 419


>gi|195121981|ref|XP_002005491.1| GI19039 [Drosophila mojavensis]
 gi|193910559|gb|EDW09426.1| GI19039 [Drosophila mojavensis]
          Length = 432

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 85/205 (41%), Positives = 106/205 (51%), Gaps = 29/205 (14%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF---DSTGCSHP--- 75
           + LS  ++L+C       GC+GG+  +AWRY    GVV E C PY    DS    H    
Sbjct: 236 VQLSPQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDETCYPYTQRRDSCKIRHNSRS 293

Query: 76  ----GCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
               GC PAY         V ++ L+     YS+          DIMAEIY +GPV+ + 
Sbjct: 294 LKANGCRPAYG--------VNRDSLYTVGPAYSLKG------ETDIMAEIYHSGPVQATM 339

Query: 132 TVYEDFAHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 188
            VY DF  Y  GVY+      G   G H+VK++GWG   DG  YWI AN W   WG  GY
Sbjct: 340 RVYRDFFSYSGGVYRQTAANRGAPTGFHSVKIVGWGEEHDGVKYWIAANSWGPWWGEHGY 399

Query: 189 FKIKRGSNECGIEEDVVAGLPSSKN 213
           F+I RGSNECGIEE V+A  P+  N
Sbjct: 400 FRILRGSNECGIEEYVLASWPNVYN 424


>gi|44965401|gb|AAS49537.1| cathepsin B [Latimeria chalumnae]
          Length = 225

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 73/160 (45%), Positives = 96/160 (60%), Gaps = 16/160 (10%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  DLL+CCG  CG GC+GGYP  AW ++   G+V+         C PY     C 
Sbjct: 63  NVEISAEDLLSCCGMECGFGCNGGYPSGAWNFWTETGLVSGGLFKSHIGCRPYTIPP-CE 121

Query: 74  H------PGCEPAY-PTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           H      P C      TPKCV +C       +   KH+  ++Y ++S+  DI  EIYKNG
Sbjct: 122 HHVNGSRPSCTGEEGDTPKCVMQCEAGYTPSYFKDKHFGSTSYAVSSNEADIQIEIYKNG 181

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 165
           PVE +FTVYEDF  YKSGVYKH+TGD +GGHA++++GWG 
Sbjct: 182 PVEGAFTVYEDFLQYKSGVYKHVTGDAVGGHAIRILGWGV 221


>gi|281208776|gb|EFA82951.1| peptidase C1A family protein [Polysphondylium pallidum PN500]
          Length = 1308

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 75/186 (40%), Positives = 102/186 (54%), Gaps = 16/186 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           +++ LS  DL+ C      +GC+GG P +A++Y   +GVVT  C PY      + P C P
Sbjct: 117 ESVQLSFQDLITCDN--QDNGCEGGDPYTAYKYVQKNGVVTSNCQPY------TIPTCPP 168

Query: 80  AYP-------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
           A         TP C  KC   +  ++   H+  + Y +  +   I  EI  NGPVE  F 
Sbjct: 169 AQQPCMNFVNTPPCSAKCANSSVNFQQDLHHLKTVYAVKPNVAAIQNEIVTNGPVEACFE 228

Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
           VYEDF  YKSGVY H +G  +GGH +K++G+G S +G  YWI  N W  SWG +G F I+
Sbjct: 229 VYEDFLGYKSGVYTHKSGKDLGGHCIKIVGFGVS-NGTPYWICNNSWTTSWGNNGIFWIE 287

Query: 193 RGSNEC 198
            G NEC
Sbjct: 288 AGKNEC 293


>gi|44968648|gb|AAS49594.1| cathepsin B [Scyliorhinus canicula]
          Length = 206

 Score =  137 bits (345), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 70/161 (43%), Positives = 97/161 (60%), Gaps = 15/161 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           N+ +S  DLL+CC   CG+GC+GGYP  AW ++ + G+V+         C PY  S  C 
Sbjct: 45  NVEVSAEDLLSCCKLECGNGCNGGYPSGAWEFWTNDGLVSGGLYYSHIGCRPYSISP-CE 103

Query: 74  H------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TP+C R+C    +  +   KHY +++Y I SD  +IM EIYKNGP
Sbjct: 104 HHVNGSRPKCSGEIETPRCSRRCEAGYSPKYSEDKHYGLTSYSIGSDVTEIMTEIYKNGP 163

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 167
           VE +  V++DF  YKSGVY+H TG  +GGHA+K++GWG  +
Sbjct: 164 VEAALEVFKDFLLYKSGVYQHKTGGSIGGHAIKILGWGEEN 204


>gi|193603738|ref|XP_001943652.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 337

 Score =  137 bits (345), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 74/181 (40%), Positives = 102/181 (56%), Gaps = 21/181 (11%)

Query: 45  YPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVR-KC 90
           +P  AW+Y   +G+ T       E C PY       ++  CS    +    TP+C + +C
Sbjct: 160 HPEKAWKYIKKNGLCTGGEYGSNEGCQPYSIVPCPRNANSCSKENED----TPQCYKDQC 215

Query: 91  VKKN--QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 148
              N      +  +Y+   Y +   PE IM+E++KNGPV  +  VY+DF  YK G+Y++ 
Sbjct: 216 TNNNYETPLVSDLYYAYKVYSVKPKPEIIMSEVFKNGPVVAAMKVYDDFLCYKGGIYQYT 275

Query: 149 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           TG + G HAVK++GWG  DDG DYW+ AN W  SWG  G FKI+RG NECGIE  +  GL
Sbjct: 276 TGGLKGDHAVKIMGWG-EDDGIDYWLCANTWGNSWGMGGMFKIRRGRNECGIENRITGGL 334

Query: 209 P 209
           P
Sbjct: 335 P 335


>gi|290975216|ref|XP_002670339.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
 gi|284083897|gb|EFC37595.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
          Length = 350

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 79/191 (41%), Positives = 105/191 (54%), Gaps = 15/191 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+ LS   +++C G    +GC+GG+  + WR+ V  G V+E C PY  S G + P C   
Sbjct: 173 NVDLSPQFMVSCSG--QNNGCNGGFFDATWRFLVSVGTVSEACVPYV-SFGGAVPACN-- 227

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
                 V+ C    Q    S  Y   + R      DIMA++  NGP++V+  VY DF  Y
Sbjct: 228 ------VKSCGVPGQ---KSPFYRAGSARKLEGMLDIMADLKANGPIQVAMGVYRDFYSY 278

Query: 141 KSGVYKHITGDVMGGHAVKLIGWG-TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
           KSGVY H++G  +GGHAVK++GWG  S     YWI AN W   WG  GYF I RG  ECG
Sbjct: 279 KSGVYHHVSGRYVGGHAVKIVGWGYDSASKLPYWICANSWGEDWGIKGYFWILRGRGECG 338

Query: 200 IEEDVVAGLPS 210
           I + V +G P+
Sbjct: 339 IGKMVWSGKPA 349


>gi|395734831|ref|XP_003776483.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin B-like [Pongo abelii]
          Length = 350

 Score =  137 bits (344), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 78/209 (37%), Positives = 116/209 (55%), Gaps = 16/209 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
           ++ +S  D L C   LCGDGC+GG P   W ++   G+V+         C  +     C 
Sbjct: 147 HVEVSAEDKLTC---LCGDGCNGGXPNEGWNFWTGKGLVSGGLYDSHVGCRLFPSLLPCK 203

Query: 74  HPGCEPAY----PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
           H      Y     +PKC   C +  Q ++  KHY  S+Y I+   +DIM  IYKN  VE 
Sbjct: 204 HHIHGXPYVXTGDSPKCSMTC-EPGQTYKXDKHYGCSSYSISDSTKDIMTNIYKNDXVEE 262

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
           +F+VY DF  YK   Y+ +TG++ GGHA+ ++G    ++   YW++AN WNR WG +G+F
Sbjct: 263 AFSVYLDFLMYKFKEYQGVTGEMXGGHAICILGCKV-ENSTSYWLVANXWNRDWGDNGFF 321

Query: 190 KIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           KI RG +  GIE +VVA +P ++   ++I
Sbjct: 322 KILRGQDHYGIESEVVAEIPHTEQYWEKI 350


>gi|119638954|gb|ABL85236.1| cysteine proteinase 2 [Necator americanus]
          Length = 347

 Score =  137 bits (344), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 75/203 (36%), Positives = 105/203 (51%), Gaps = 17/203 (8%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGC 72
           + + +S  D+L+CCG  CG GC  G P  A+ Y +  GV +         C PY     C
Sbjct: 143 KQVYVSETDILSCCGQRCGSGCTSGVPRQAFNYAIRKGVCSGGPYGTKGVCKPY-PFYPC 201

Query: 73  SHPGCEPAY--------PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
            +    P Y        PTP C + C     +  N      S   + +  E I  EI+ N
Sbjct: 202 GYHAHLPYYGPCPDGMWPTPTCEKACQSDYTVPYNDDRIFGSKTIVLTGEEKIKREIFNN 261

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
           GP+  ++TVYEDFA+YK+G+Y    G   G HAVK+IGWG  ++G  YW++AN WN  WG
Sbjct: 262 GPLVATYTVYEDFAYYKNGIYMTGLGRATGAHAVKIIGWG-EENGVKYWLIANSWNTDWG 320

Query: 185 ADGYFKIKRGSNECGIEEDVVAG 207
            +G+F++ RG+N C IE     G
Sbjct: 321 ENGFFRMLRGTNLCDIELSATGG 343


>gi|323448735|gb|EGB04630.1| hypothetical protein AURANDRAFT_32318 [Aureococcus anophagefferens]
          Length = 253

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 85/228 (37%), Positives = 122/228 (53%), Gaps = 32/228 (14%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECD 64
           T+R  ++S+  V+     LS  D+ +C     GD GC+GG P S + Y+   G+V  +  
Sbjct: 26  TDRMCIASNGTVTTH---LSAQDVTSCDKL--GDMGCNGGIPSSVYSYWALSGIV--DGG 78

Query: 65  PYFDSTGC---------------SHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYR 109
            Y D +GC                +P C      PKC RKC  +++ W  +K      Y 
Sbjct: 79  NYGDKSGCWSYQLEPCAHHVNSSKYPACPDEVRAPKCARKCESEDKDWTKAKVKGEKGYS 138

Query: 110 INSDPE-------DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK-HITGDVMGGHAVKLI 161
           +    E        + A+IY+NGP+   F V +DF  YKSGVY+  +    +GGHA+K++
Sbjct: 139 VCQQGELEGTCAIKMAADIYQNGPITGMFFVKQDFLAYKSGVYEPKLLSPPLGGHAIKIM 198

Query: 162 GWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           G+GT +DG+DYW++AN WN  WG DGYFKI RG N C IE+ V+ G P
Sbjct: 199 GFGT-EDGKDYWLVANSWNEDWGDDGYFKIIRGKNACQIEDPVINGGP 245


>gi|189239879|ref|XP_968767.2| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270012755|gb|EFA09203.1| cathepsin B precursor [Tribolium castaneum]
          Length = 353

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 78/193 (40%), Positives = 108/193 (55%), Gaps = 12/193 (6%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY 81
              S  DL+ CC   CG  C GGY   AW+Y+   G+V+     Y  S GC  P  +  +
Sbjct: 125 FEFSPEDLINCCE-TCGKKCKGGYSYYAWKYYTSTGLVSG--GDYNTSRGC-QPYSKSNF 180

Query: 82  ---PTPKCVRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIY-KNGPVEVSFTVYE 135
               +P+C + C   K    + N +H+    Y I  +   I  EI  + GPV   F VYE
Sbjct: 181 NDGVSPECSKTCQNTKYPTSYLNDRHFGDGTYYILKNVTTIQQEILLRGGPVMAGFDVYE 240

Query: 136 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA-DGYFKIKRG 194
           DF  Y+ GVY H +G ++G HAVK+IGWGT ++G  YW++AN W + WGA  G FKI+RG
Sbjct: 241 DFKLYREGVYVHTSGALLGSHAVKIIGWGT-ENGWAYWLVANSWGKDWGALGGVFKIRRG 299

Query: 195 SNECGIEEDVVAG 207
           +NEC IE+ ++ G
Sbjct: 300 TNECKIEQSIITG 312


>gi|339831342|gb|AEK20867.1| cathepsin B [Eimeria tenella]
          Length = 512

 Score =  136 bits (343), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 85/208 (40%), Positives = 112/208 (53%), Gaps = 31/208 (14%)

Query: 28  DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT----------EECDPYFDSTGCSH--- 74
           DLL C  F    GC GG P  AWR+F + GVVT          + C PY +   C H   
Sbjct: 301 DLLHCLSF----GCSGGQPRMAWRWFSNDGVVTGGDYNELHTGKSCWPY-EIPFCRHHSE 355

Query: 75  ---PGCEPAYP-TPKCVRKC-----VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
              P CE   P  PKC + C       K + +++  H++ SAY +    + I  E+ +NG
Sbjct: 356 GPYPKCEGPLPKAPKCRKDCEEAEYTSKVKPFKDDLHFATSAYSVEGR-DQIKRELMENG 414

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
            +  +F VYEDF  YK GVY H+TG  MGGHAVK+IG+G ++DG DYW+  N WN  WG 
Sbjct: 415 TLTGAFLVYEDFLLYKEGVYHHVTGMPMGGHAVKVIGFG-NEDGRDYWLAVNSWNEYWGD 473

Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSSKN 213
            G FKI+ G  E GI+++   G P   N
Sbjct: 474 KGTFKIEMG--EAGIDKEFCGGEPKVPN 499


>gi|253748582|gb|EET02635.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 298

 Score =  136 bits (343), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 71/168 (42%), Positives = 96/168 (57%), Gaps = 12/168 (7%)

Query: 41  CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
           CDGG+  S WR+    G  T EC PY   T  +   C    PT     KC    +L   S
Sbjct: 140 CDGGWLQSVWRFLTKTGTTTNECVPYQSGTTGARGTC----PT-----KCADGGEL---S 187

Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
              +  A     D + IM  +   GP++ +FTVY DF +Y+ GVY+H++G V GGHAV++
Sbjct: 188 TVKAKKAVDYGLDCDLIMKALVTGGPLQTAFTVYSDFMYYEGGVYQHMSGRVEGGHAVEM 247

Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           +G+GT +   DYWI+ N W   WG DGYF+I R +NECGIEE V+ G+
Sbjct: 248 VGYGTDEYDVDYWIIRNSWGPDWGEDGYFRIIRMTNECGIEEQVMGGI 295


>gi|359427491|gb|AEV46267.1| eimeripain [Eimeria tenella]
          Length = 512

 Score =  136 bits (343), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 85/208 (40%), Positives = 112/208 (53%), Gaps = 31/208 (14%)

Query: 28  DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT----------EECDPYFDSTGCSH--- 74
           DLL C  F    GC GG P  AWR+F + GVVT          + C PY +   C H   
Sbjct: 301 DLLHCLSF----GCSGGQPRMAWRWFSNDGVVTGGDYNELHTGKSCWPY-EIPFCRHHSE 355

Query: 75  ---PGCEPAYP-TPKCVRKC-----VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
              P CE   P  PKC + C       K + +++  H++ SAY +    + I  E+ +NG
Sbjct: 356 GPYPKCEGPLPKAPKCRKDCEEAEYTSKVKPFKDDLHFATSAYSVEGR-DQIKRELMENG 414

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
            +  +F VYEDF  YK GVY H+TG  MGGHAVK+IG+G ++DG DYW+  N WN  WG 
Sbjct: 415 TLTGAFLVYEDFLLYKEGVYHHVTGMPMGGHAVKVIGFG-NEDGRDYWLAVNSWNEYWGD 473

Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSSKN 213
            G FKI+ G  E GI+++   G P   N
Sbjct: 474 KGTFKIEMG--EAGIDKEFCGGEPKVPN 499


>gi|86451924|gb|ABC97357.1| cathepsin B [Streblomastix strix]
          Length = 283

 Score =  136 bits (342), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 77/186 (41%), Positives = 102/186 (54%), Gaps = 18/186 (9%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
           ++  DL++C  F   DGCDGG+   AW +   +G+ TEEC PY    G   P        
Sbjct: 112 IAPEDLVSCDIF--DDGCDGGFIDMAWDWCQENGLTTEECIPYKAGEGVPSP-------- 161

Query: 84  PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
             C   C   + ++R      I +YR   D +DI  EIY+ GPV + F VY DF  YKSG
Sbjct: 162 --CPETCEDGSAIYRTP----IESYRY-IDADDIQGEIYEYGPVSMGFIVYSDFMSYKSG 214

Query: 144 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 203
           VY H  G + GGHAV ++GWG  D+   YW++ N W   WG +G+FKI RGS+ C  E +
Sbjct: 215 VYVHQAGYIEGGHAVLIVGWGVEDE-VPYWLVQNSWGTDWGENGFFKILRGSDHCECESN 273

Query: 204 VVAGLP 209
           V AG P
Sbjct: 274 VTAGYP 279


>gi|91088083|ref|XP_968689.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
          Length = 360

 Score =  136 bits (342), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 119/208 (57%), Gaps = 14/208 (6%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 65
           ++R  ++++  V +Q   LS  DL+ CC + CG+ C GGY   AW YF+  G+V+     
Sbjct: 111 SDRLCIATNGKVKIQ---LSPEDLIDCCHY-CGNQCKGGYTYYAWNYFMLTGLVSG--GD 164

Query: 66  YFDSTGCSHPGCEPAY--PTPKCVRKCV--KKNQLWRNSKHYSISAYRINSDPEDIMAEI 121
           Y  STGC  P  E  Y   TP C   C   K    + + KH+  S Y I  +   I  EI
Sbjct: 165 YNTSTGC-QPYSELNYYRITPPCNTTCQNDKYPIPYVSDKHFGDSIYYIPQNETAIQNEI 223

Query: 122 YKNG-PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 180
              G PV  +F VY DF  Y+ GVY + +G + G  AVK+IGWGT ++G  YW+ AN W 
Sbjct: 224 LSGGGPVVAAFDVYGDFKIYRDGVYIYTSGALFGRTAVKIIGWGT-ENGWAYWLAANSWG 282

Query: 181 RSWGA-DGYFKIKRGSNECGIEEDVVAG 207
           + WGA  G+FKI+RG+NECG EE ++AG
Sbjct: 283 KDWGALGGFFKIRRGTNECGFEESIIAG 310


>gi|24657813|ref|NP_726176.1| secreted Wg-interacting molecule, isoform A [Drosophila
           melanogaster]
 gi|24657819|ref|NP_611652.2| secreted Wg-interacting molecule, isoform B [Drosophila
           melanogaster]
 gi|21064305|gb|AAM29382.1| RE01730p [Drosophila melanogaster]
 gi|21626543|gb|AAF46818.2| secreted Wg-interacting molecule, isoform A [Drosophila
           melanogaster]
 gi|21626544|gb|AAM68213.1| secreted Wg-interacting molecule, isoform B [Drosophila
           melanogaster]
 gi|220949028|gb|ACL87057.1| CG3074-PA [synthetic construct]
 gi|220958134|gb|ACL91610.1| CG3074-PA [synthetic construct]
          Length = 431

 Score =  135 bits (341), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 80/195 (41%), Positives = 106/195 (54%), Gaps = 13/195 (6%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           +N+ LS  ++L+C       GC+GG+  +AWRY    GVV E C PY       H     
Sbjct: 234 ENVQLSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDENCYPYT-----QHRDTCK 286

Query: 80  AYPTPKCVRK--CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
                + +R   C K   + R+S +    AY +N +  DIMAEI+ +GPV+ +  V  DF
Sbjct: 287 IRHNSRSLRANGCQKPVNVDRDSLYTVGPAYSLNREA-DIMAEIFHSGPVQATMRVNRDF 345

Query: 138 AHYKSGVYKHITGDV---MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 194
             Y  GVY+    +     G H+VKL+GWG   +GE YWI AN W   WG  GYF+I RG
Sbjct: 346 FAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEHGYFRILRG 405

Query: 195 SNECGIEEDVVAGLP 209
           SNECGIEE V+A  P
Sbjct: 406 SNECGIEEYVLASWP 420


>gi|312374702|gb|EFR22199.1| hypothetical protein AND_15622 [Anopheles darlingi]
          Length = 339

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 80/200 (40%), Positives = 115/200 (57%), Gaps = 13/200 (6%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGY-PISAWRYFVHHGVV-------TEECDPYFDSTG 71
           +N++++  DL+ CC   CG+GC+GG+   ++++Y+V  G+V       TE C PY     
Sbjct: 141 RNVAIAAEDLMGCCA-DCGNGCEGGFLDGTSFQYWVDAGLVSGGAYNSTEGCKPY-PFKP 198

Query: 72  CSHPGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
           C +P  +     +PKC   C    ++ +   K +   AY +  D   I  EI  NGPVE 
Sbjct: 199 CLYPFTDCHREESPKCKHHCQHGVDKRYARDKVFGSVAYSVPRDERVIRYEIMTNGPVEG 258

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
            F VYED   YKSGVY+H+ G+ +G HAV++IGWG  + G  YW+++N +   WG  GYF
Sbjct: 259 GFDVYEDVFLYKSGVYRHVYGEHVGKHAVRIIGWG-REGGIPYWLISNSYGEDWGDHGYF 317

Query: 190 KIKRGSNECGIEEDVVAGLP 209
           KI RG N  GIE  V+ GLP
Sbjct: 318 KIVRGINHLGIESKVITGLP 337


>gi|237836005|ref|XP_002367300.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
 gi|211964964|gb|EEB00160.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
 gi|221506020|gb|EEE31655.1| cysteine proteinase, putative [Toxoplasma gondii VEG]
          Length = 572

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 84/212 (39%), Positives = 116/212 (54%), Gaps = 31/212 (14%)

Query: 22  LSLSVNDLLACCGFL-CGD-GCDGGYPISAWRYFVHHGVVT----------EECDPYFDS 69
           + LS     +CC  + C   GC+GG P  AWR+F   GVVT            C PY + 
Sbjct: 329 MPLSAQHTTSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPY-EV 387

Query: 70  TGCSH------PGCEPAY---PTPKCVRKCVKKN-----QLWRNSKHYSISAYRINSDPE 115
             C+H      P C+       TPKC + C ++        +    H + SAY + S  +
Sbjct: 388 PFCAHHAKAPFPDCDATLVPRKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSR-D 446

Query: 116 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWIL 175
           D+  ++  +GPV  +F VYEDF  YKSGVYKH++G  +GGHA+K+IGWGT ++GE+YW  
Sbjct: 447 DVKRDMMTHGPVSGAFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGT-ENGEEYWHA 505

Query: 176 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
            N WN  WG  G FKI  G  +CGI+ ++VAG
Sbjct: 506 VNSWNTYWGDGGQFKIAMG--QCGIDGEMVAG 535


>gi|21700775|gb|AAL60053.1| cysteine proteinase [Toxoplasma gondii]
          Length = 569

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 84/212 (39%), Positives = 116/212 (54%), Gaps = 31/212 (14%)

Query: 22  LSLSVNDLLACCGFL-CGD-GCDGGYPISAWRYFVHHGVVT----------EECDPYFDS 69
           + LS     +CC  + C   GC+GG P  AWR+F   GVVT            C PY + 
Sbjct: 326 MPLSAQHTTSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPY-EV 384

Query: 70  TGCSH------PGCEPAY---PTPKCVRKCVKKN-----QLWRNSKHYSISAYRINSDPE 115
             C+H      P C+       TPKC + C ++        +    H + SAY + S  +
Sbjct: 385 PFCAHHAKAPFPDCDATLVPRKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSR-D 443

Query: 116 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWIL 175
           D+  ++  +GPV  +F VYEDF  YKSGVYKH++G  +GGHA+K+IGWGT ++GE+YW  
Sbjct: 444 DVKRDMMTHGPVSGAFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGT-ENGEEYWHA 502

Query: 176 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
            N WN  WG  G FKI  G  +CGI+ ++VAG
Sbjct: 503 VNSWNTYWGDGGQFKIAMG--QCGIDGEMVAG 532


>gi|221484923|gb|EEE23213.1| cysteine proteinase, putative [Toxoplasma gondii GT1]
          Length = 569

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 84/212 (39%), Positives = 116/212 (54%), Gaps = 31/212 (14%)

Query: 22  LSLSVNDLLACCGFL-CGD-GCDGGYPISAWRYFVHHGVVT----------EECDPYFDS 69
           + LS     +CC  + C   GC+GG P  AWR+F   GVVT            C PY + 
Sbjct: 326 MPLSAQHTTSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPY-EV 384

Query: 70  TGCSH------PGCEPAY---PTPKCVRKCVKKN-----QLWRNSKHYSISAYRINSDPE 115
             C+H      P C+       TPKC + C ++        +    H + SAY + S  +
Sbjct: 385 PFCAHHAKAPFPDCDATLVPRKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSR-D 443

Query: 116 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWIL 175
           D+  ++  +GPV  +F VYEDF  YKSGVYKH++G  +GGHA+K+IGWGT ++GE+YW  
Sbjct: 444 DVKRDMMTHGPVSGAFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGT-ENGEEYWHA 502

Query: 176 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
            N WN  WG  G FKI  G  +CGI+ ++VAG
Sbjct: 503 VNSWNTYWGDGGQFKIAMG--QCGIDGEMVAG 532


>gi|195165479|ref|XP_002023566.1| GL19846 [Drosophila persimilis]
 gi|194105700|gb|EDW27743.1| GL19846 [Drosophila persimilis]
          Length = 329

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 77/203 (37%), Positives = 110/203 (54%), Gaps = 27/203 (13%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCS 73
           N  LS +DL++CC  +CG GC+GG+P +AW Y+   G+V       T+ C PY +   C 
Sbjct: 136 NFHLSADDLVSCC-HICGFGCNGGFPGAAWSYWTRKGIVSGGPYGSTQGCRPY-EIAPCE 193

Query: 74  H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           H      P C     TP C  KC     + +   K++   +Y +  +  +I  EI  NGP
Sbjct: 194 HHVNGTRPPCSHG-STPSCQHKCQASYSVEYAKDKNFGSKSYSVRRNVAEIQQEIMTNGP 252

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE-DYWILANQWNRSWGA 185
           VE +FTVYED   YKSGVY+H  G  +GGHA++++GWG   + +  YW++ N WN  WG 
Sbjct: 253 VEGAFTVYEDLILYKSGVYQHEHGKELGGHAIRILGWGVWGESKVPYWLIGNSWNTDWGD 312

Query: 186 DGYFKIKRGSNECGIEEDVVAGL 208
           +         + CGIE  + AGL
Sbjct: 313 N---------DHCGIESSISAGL 326


>gi|353228456|emb|CCD74627.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 333

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 80/201 (39%), Positives = 111/201 (55%), Gaps = 13/201 (6%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE---CDPYF-----DSTGCS 73
           + LS  +L++C     G  C  G+   +W Y++ +G+VT +   C PY        +  S
Sbjct: 135 VQLSAIELISCSKNKLG--CQIGFSEFSWDYWLKNGLVTGDPTGCLPYPFPKCDHRSSNS 192

Query: 74  HPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
           +P C    Y  P C + C     + ++  KHY    Y +  +  DI  EI  NGPVE   
Sbjct: 193 YPKCGYITYTAPPCTKTCRSGYPIPYKADKHYGRVIYSLRPNESDIRKEIMMNGPVEAGI 252

Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
            V+ DF +YKSGVY+HITG ++  H+V++IGWG  +D   YW+ AN WN  WG +GYFKI
Sbjct: 253 FVHSDFLNYKSGVYRHITGQLVTIHSVRIIGWGIEND-IPYWLCANSWNEDWGLNGYFKI 311

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
            RGSNEC IE  V AG   +K
Sbjct: 312 LRGSNECEIESFVNAGKVDNK 332


>gi|308160258|gb|EFO62754.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 298

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 70/168 (41%), Positives = 95/168 (56%), Gaps = 12/168 (7%)

Query: 41  CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
           CDGG+  S WR+ V  G  T+EC PY         G   A  T  C  KC   ++L    
Sbjct: 140 CDGGWLPSVWRFLVKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSEL---P 187

Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
            + +  A     D + IM  +   GP++ +FTVY DF +Y+ GVY+H+ G   GGHAV++
Sbjct: 188 IYKATKAVDYGLDCDLIMKALATGGPLQTAFTVYSDFMYYQGGVYQHVYGRAEGGHAVEM 247

Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           +G+GT +   DYWI+ N W   WG DGYF+I R +NECGIEE V+ G 
Sbjct: 248 VGYGTDEYDVDYWIIRNSWGPDWGEDGYFRIIRMTNECGIEEQVIGGF 295


>gi|269146930|gb|ACZ28411.1| cathepsin b [Simulium nigrimanum]
          Length = 168

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 60/128 (46%), Positives = 83/128 (64%), Gaps = 2/128 (1%)

Query: 83  TPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 141
           TPKC++ C     + +   K Y   +Y +      I  EI  NGPVE +FTVYED   YK
Sbjct: 41  TPKCIKHCQASYTVAYEQDKSYGAKSYSVPHHVAQIQKEIMTNGPVEGAFTVYEDLVQYK 100

Query: 142 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 201
            GVY+H+TG ++GGHA++++GWG  +D   YW++AN WN  WG +G+FKI RGS+ CGIE
Sbjct: 101 DGVYQHVTGKMLGGHAIRILGWGVEND-VPYWLIANSWNTDWGNNGFFKILRGSDHCGIE 159

Query: 202 EDVVAGLP 209
             + AG+P
Sbjct: 160 SQISAGIP 167


>gi|125810908|ref|XP_001361665.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
 gi|54636841|gb|EAL26244.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
          Length = 433

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 85/217 (39%), Positives = 113/217 (52%), Gaps = 32/217 (14%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 65
           ++R A+ S    ++Q   LS  ++L+C       GC+GG+  +AWRY    GVV E C P
Sbjct: 225 SDRFAIQSKGKEAVQ---LSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDESCYP 279

Query: 66  Y----------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPE 115
           Y           +S      GC P+                 R+S +    AY +N +  
Sbjct: 280 YTQHRDTCKIRHNSRSLKANGCRPSANVD-------------RDSFYTVGPAYTLNKE-S 325

Query: 116 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDY 172
           DIMAEIY +GPV+ +  VY DF  Y SGVY+      G   G H+VKL+GWG   +G+ Y
Sbjct: 326 DIMAEIYHSGPVQATMRVYRDFFSYSSGVYRQTAANRGAPTGFHSVKLVGWGEEHNGDKY 385

Query: 173 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           WI AN W   WG  GYF+I RGSNECGIE+ V+A  P
Sbjct: 386 WIAANSWGPWWGERGYFRILRGSNECGIEDYVLASWP 422


>gi|157058769|gb|ABV03142.1| cathepsin B-348 [Myzus persicae]
          Length = 246

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 69/176 (39%), Positives = 101/176 (57%), Gaps = 18/176 (10%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC------- 72
           +N   S  +L++CC + CG GC+GG+P +AW Y+   G+V+    PY    GC       
Sbjct: 73  KNFHFSAENLVSCC-WTCGFGCNGGFPGAAWHYWKTKGIVSG--GPYGSKMGCIPYEIAP 129

Query: 73  -------SHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
                  +   C+    TP CV+KC    ++ +    H   SAY + +D + I  EIY N
Sbjct: 130 CEHHVNGTRGPCKEGGKTPACVKKCEDGYKVPYAQDLHRGKSAYSLGNDVDQIRQEIYTN 189

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 180
           GPVE +FTVYEDF  Y++GVYKH+ G  +GGHA++++GWG  +    YW++AN WN
Sbjct: 190 GPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANSWN 245


>gi|195154396|ref|XP_002018108.1| GL16940 [Drosophila persimilis]
 gi|194113904|gb|EDW35947.1| GL16940 [Drosophila persimilis]
          Length = 433

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 85/217 (39%), Positives = 113/217 (52%), Gaps = 32/217 (14%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 65
           ++R A+ S    ++Q   LS  ++L+C       GC+GG+  +AWRY    GVV E C P
Sbjct: 225 SDRFAIQSKGKEAVQ---LSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDESCYP 279

Query: 66  Y----------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPE 115
           Y           +S      GC P+                 R+S +    AY +N +  
Sbjct: 280 YTQHRDTCKIRHNSRSLKANGCRPSANVD-------------RDSFYTVGPAYTLNKE-S 325

Query: 116 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDY 172
           DIMAEIY +GPV+ +  VY DF  Y SGVY+      G   G H+VKL+GWG   +G+ Y
Sbjct: 326 DIMAEIYHSGPVQATMRVYRDFFSYSSGVYRQTAANRGAPTGFHSVKLVGWGEEHNGDKY 385

Query: 173 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           WI AN W   WG  GYF+I RGSNECGIE+ V+A  P
Sbjct: 386 WIAANSWGPWWGERGYFRILRGSNECGIEDYVLASWP 422


>gi|256090674|ref|XP_002581308.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 250

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 79/198 (39%), Positives = 109/198 (55%), Gaps = 13/198 (6%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE---CDPYF-----DSTG 71
             + LS  +L++C     G  C  G+   +W Y++ +G+VT +   C PY        + 
Sbjct: 50  MKVQLSAIELISCSKNKLG--CQIGFSEFSWDYWLKNGLVTGDPTGCLPYPFPKCDHRSS 107

Query: 72  CSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
            S+P C    Y  P C + C     + ++  KHY    Y +  +  DI  EI  NGPVE 
Sbjct: 108 NSYPKCGYITYTAPPCTKTCRSGYPIPYKADKHYGRVIYSLRPNESDIRKEIMMNGPVEA 167

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
              V+ DF +YKSGVY+HITG ++  H+V++IGWG  +D   YW+ AN WN  WG +GYF
Sbjct: 168 GIFVHSDFLNYKSGVYRHITGQLVTIHSVRIIGWGIEND-IPYWLCANSWNEDWGLNGYF 226

Query: 190 KIKRGSNECGIEEDVVAG 207
           KI RGSNEC IE  V AG
Sbjct: 227 KILRGSNECEIESFVNAG 244


>gi|335347289|gb|AEH42092.1| cysteine proteinase 1 [Haemonchus contortus]
          Length = 332

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 80/208 (38%), Positives = 110/208 (52%), Gaps = 16/208 (7%)

Query: 1   MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
           +S   T  D +       +Q + +S  D+LACCG  CG GC+GG    AW Y    GVVT
Sbjct: 126 VSAAETMSDRICVQSKGRVQKM-ISDVDILACCGRECGRGCNGGMDHKAWEYVKEFGVVT 184

Query: 61  ----EE---CDPYFDSTGCSHPG----C--EPAYPTPKCVRKC-VKKNQLWRNSKHYSIS 106
               +E   C PY      +H G    C  + ++ TP C + C     + +   K Y  S
Sbjct: 185 GGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPACKKYCQYGYGKRYEKDKSYVKS 244

Query: 107 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 166
            Y ++ D + I  E+ KNGPV+ +F  YEDF+ Y  G+Y H  G   G HAVK++GWG  
Sbjct: 245 VYILDEDEKAIQREMMKNGPVQAAFITYEDFSFYTKGIYVHTRGRQRGAHAVKVVGWGV- 303

Query: 167 DDGEDYWILANQWNRSWGADGYFKIKRG 194
           ++G  YW +AN W+  WG DGYF+I RG
Sbjct: 304 ENGTKYWNVANSWSTDWGEDGYFRILRG 331


>gi|294939825|ref|XP_002782575.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239894358|gb|EER14370.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 398

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 79/207 (38%), Positives = 112/207 (54%), Gaps = 25/207 (12%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------------EECDPYFDST 70
           LS  ++ AC   L   GC GG+P SAW +    G+ T             + C PY D  
Sbjct: 194 LSAGEMNACAPSLKDPGCRGGFPYSAWSWVHDEGIATGGDYVPRDNMTEDDGCWPY-DFP 252

Query: 71  GCSHPGCEPAYPT-PKCVR---KCVKKNQ----LWRNSKHYSISAYRINSDPEDIMAEIY 122
            C+H   +P YP  PK  R   +CV K +    ++ + +++ + +   +   +D    I 
Sbjct: 253 PCAHFFKDPKYPACPKFARVNLRCVSKLRHMMVVYFSDRYFMVESVPYHFSADDAKNAIR 312

Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
            +GPV  +F VYEDF  YKSGVYKH +G ++G HAVK+IGWG  D GE YW++ N WN  
Sbjct: 313 TDGPVSATFYVYEDFLAYKSGVYKHTSGSLLGAHAVKIIGWG-EDGGEAYWLVVNSWNEG 371

Query: 183 WGADGYFKIKRGSNECGIEEDVVAGLP 209
           WG  G FKI  G  +CGI+ +++ G P
Sbjct: 372 WGDHGLFKIALG--DCGIDNELLGGTP 396


>gi|294954734|ref|XP_002788292.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239903555|gb|EER20088.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 317

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 87/232 (37%), Positives = 117/232 (50%), Gaps = 31/232 (13%)

Query: 1   MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
             VT    D L      +   L LS  ++ AC       GCDGGYP SAW +    G+ T
Sbjct: 92  FGVTEAFNDRLCVKSNGTFTEL-LSAGEMNACAPSY---GCDGGYPDSAWSWVHDEGIAT 147

Query: 61  -------------EECDPYFDSTGCSH-------PGC-EPAYPTPKCVRKC--VKKNQLW 97
                        + C PY D   C+H       P C + +Y TP CV +C   K +   
Sbjct: 148 GGDYVARGNLTKGDGCWPY-DFPPCAHHINDTKYPKCPKGSYETPNCVEQCHNPKYSTSL 206

Query: 98  RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 157
           +N +HY + +        +    I  +GPV  S+ VYEDF  YKSGVYKH +G  +GGHA
Sbjct: 207 KNDRHYMLESSPYQYSVNNAKNAIRTDGPVSASYLVYEDFLAYKSGVYKHTSGSYLGGHA 266

Query: 158 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           VK+IGWG  ++GE YW++ N WN  WG  G FKI  G+  C I++D++ G P
Sbjct: 267 VKIIGWG-EENGEAYWLVVNSWNEDWGDHGLFKIALGN--CQIDDDLLGGTP 315


>gi|294935195|ref|XP_002781337.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239891887|gb|EER13132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 317

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 81/213 (38%), Positives = 113/213 (53%), Gaps = 25/213 (11%)

Query: 21  NLSLSVNDLLACCGFLCG---DGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDST 70
           N  LS  +++ACC         GC GG  ++AW +   HG+ TE        C PY +  
Sbjct: 110 NQLLSAGEMIACCNSTHSWQPRGCKGGMILNAWSFLKTHGIATEGSMSAADGCWPY-NFP 168

Query: 71  GCSH--------PGCEPAYPTPKCVRKCV--KKNQLWRNSKHYSISAYRINSDPEDIMAE 120
            C+H        P  +  Y TP C+ +C   K        +H++  +  +    ++I  E
Sbjct: 169 KCAHHQKKSKYEPCSKKLYDTPSCLDRCPNEKYGIPLDKDRHFTAHSPDLFEGTDNIKKE 228

Query: 121 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 180
           I  NGP   +F+VYEDF  YKSGVYKH  G +MG H+V++IGWGT + G DYW++ N WN
Sbjct: 229 IMTNGPTSATFSVYEDFVSYKSGVYKHTNGTLMGIHSVEIIGWGT-EKGVDYWLVMNSWN 287

Query: 181 RSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 213
             WG  G FKI +G  +CGI +D V G P + N
Sbjct: 288 EGWGDHGTFKIAQG--DCGI-DDAVLGSPPAMN 317


>gi|343475054|emb|CCD13447.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  134 bits (336), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 84/204 (41%), Positives = 115/204 (56%), Gaps = 14/204 (6%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           + L +S  DL+ACC   CG GC+GGYP +AW Y+V +G+ + +C PY     C H G + 
Sbjct: 139 KQLRISAADLMACCT-GCGGGCEGGYPDAAWEYYVSNGITSSQCQPY-PFPRCEHRGAQG 196

Query: 80  AYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
             P        TP C   C  K+      K+    +Y +  + ED   E+Y NGP  V F
Sbjct: 197 KKPPCSKYNFDTPTCNATCTDKSVPL--IKYRGNHSYEVRGE-EDYKRELYFNGPFVVRF 253

Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
            V+ DF  YKSGVY+H+ G+ +GG AV+++GWG   +G  YW +AN W+  WG +GYF I
Sbjct: 254 QVHSDFLAYKSGVYQHVAGNFLGGKAVRIVGWGKM-NGTPYWKVANSWDTDWGMNGYFLI 312

Query: 192 KRGSNECGIEEDVVAGLPSSKNLV 215
            RG+NEC IE    AG P +  L 
Sbjct: 313 LRGNNECNIEHLGFAGTPDTSQLT 336


>gi|294951797|ref|XP_002787132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239901778|gb|EER18928.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 278

 Score =  134 bits (336), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 85/232 (36%), Positives = 117/232 (50%), Gaps = 31/232 (13%)

Query: 1   MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
             VT    D L    + +   L LS  ++ AC       GC+GG+P SAW +    G+ T
Sbjct: 53  FGVTEAFNDRLCIKSHGTFTEL-LSAGEMNACAP---SHGCNGGFPNSAWSWVHDKGIAT 108

Query: 61  -------------EECDPYFDSTGCSH-------PGC-EPAYPTPKCVRKC--VKKNQLW 97
                        + C PY D   C+H       P C + +Y TP C  +C   K     
Sbjct: 109 GGDYVAEDDMTKDDGCWPY-DFPPCAHHVNDSKYPKCPKDSYETPNCAEQCHNPKYTTTL 167

Query: 98  RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 157
           R+ +H+ + +        D    I  +GPV  SFTVYEDF  YKSGVYKH +G+ +GGHA
Sbjct: 168 RDDRHFMVESSPYQYSVNDAKNAIRTDGPVSASFTVYEDFLAYKSGVYKHTSGEYLGGHA 227

Query: 158 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           VK+IGWG  + G+ YW++ N WN  WG  G FKI  G+  CGI++ ++ G P
Sbjct: 228 VKIIGWG-EESGQAYWLVVNSWNEDWGDHGLFKIALGN--CGIDDYLLGGTP 276


>gi|194882138|ref|XP_001975170.1| GG20712 [Drosophila erecta]
 gi|190658357|gb|EDV55570.1| GG20712 [Drosophila erecta]
          Length = 431

 Score =  134 bits (336), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 83/208 (39%), Positives = 107/208 (51%), Gaps = 29/208 (13%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----------FDS 69
           + + LS  ++L+C       GCDGG+  +AWRY    GVV E C PY           +S
Sbjct: 234 ETVQLSAQNILSCTRRQ--QGCDGGHLDAAWRYLHKKGVVDESCYPYTQHRDTCKIRHNS 291

Query: 70  TGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
                 GCE    TP  V          R++ +    AY +N +  DIMAEI+ +GPV+ 
Sbjct: 292 RSLRANGCE----TPVNVD---------RDTFYTVGPAYSLNREA-DIMAEIFNSGPVQA 337

Query: 130 SFTVYEDFAHYKSGVYKHITGDV---MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           +  V  DF  Y  GVY+    +     G H+VKL+GWG   +GE YWI AN W   WG  
Sbjct: 338 TMRVNRDFFSYSRGVYRQTAANREAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEK 397

Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKNL 214
           GYF+I RGSNECGIEE V+A  P   N 
Sbjct: 398 GYFRILRGSNECGIEEYVLASWPYVYNF 425


>gi|161343839|tpg|DAA06100.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 323

 Score =  133 bits (335), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 82/208 (39%), Positives = 109/208 (52%), Gaps = 28/208 (13%)

Query: 23  SLSVNDLLACCGFLCGDG----CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTG 71
           +LS  +L++C     GDG    CDGG    AW   ++ G+VT       E C PY  +  
Sbjct: 117 NLSAQNLMSC-----GDGEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPY-KNRP 170

Query: 72  CSHPG------CEPAYPTPK--CVRKCVKKNQL--WRNSKHYSISAYRIN-SDPEDIMAE 120
           C H G      C     T    C +KCV KN    + +  H +   Y  + ++ + I  E
Sbjct: 171 CDHYGDSRLTNCSSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQE 230

Query: 121 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 180
           I  +GPV     VYE+F  YK G+YK  TG+++G H VKLIGWG   DG +YW+  N WN
Sbjct: 231 IMTHGPVTAFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWLAMNSWN 290

Query: 181 RSWGADGYFKIKRGSNECGIEEDVVAGL 208
            +WG DG FKI RG N C IE  V+AG+
Sbjct: 291 SNWGNDGLFKILRGYNFCSIELLVMAGI 318


>gi|239938580|gb|ACS36089.1| cysteine proteinase [Haemonchus contortus]
          Length = 332

 Score =  133 bits (335), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 79/208 (37%), Positives = 110/208 (52%), Gaps = 16/208 (7%)

Query: 1   MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
           +S   T  D +       +Q + +S  D+LACCG  CG GC+GG    AW Y    GVVT
Sbjct: 126 VSAAETMSDRICVQSKGRVQKM-ISDVDILACCGRECGRGCNGGMDHKAWEYVKEFGVVT 184

Query: 61  ----EE---CDPYFDSTGCSHPG----C--EPAYPTPKCVRKC-VKKNQLWRNSKHYSIS 106
               +E   C PY      +H G    C  + ++ TP C + C     + +   K Y  S
Sbjct: 185 GGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPACKKYCQYGYGKRYEKDKSYVKS 244

Query: 107 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 166
            Y ++ D + I  E+ KNGPV+ +F  YEDF+ Y  G+Y H  G   G HAVK++GWG  
Sbjct: 245 VYILDEDEKAIQREMMKNGPVQAAFITYEDFSFYTKGIYVHTRGRQRGAHAVKVVGWGV- 303

Query: 167 DDGEDYWILANQWNRSWGADGYFKIKRG 194
           ++G  YW +AN W+  WG +GYF+I RG
Sbjct: 304 ENGTKYWNVANSWSTDWGENGYFRILRG 331


>gi|15723272|gb|AAL06324.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score =  133 bits (334), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 100/172 (58%), Gaps = 17/172 (9%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH--- 74
           +++L +S  DL++CC  +CG GC+GGYP  AW Y+  HG+V+E C PY F S  C+H   
Sbjct: 44  VRDLRISAGDLMSCCD-VCGYGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPS--CAHHVN 100

Query: 75  ----PGCEPAYPTPKCVRKCV-KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
                 C   Y TP C   C  KK  L +   + S     I S  E    E+  NGP EV
Sbjct: 101 SSDLSPCSGEYDTPTCNSTCTDKKIPLIKYRGNTSC----ILSGEESFKRELLLNGPFEV 156

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
           SF+VY DF  Y  GVYKH+TG  +GGHAV+++GWG   +GE YW +AN WN 
Sbjct: 157 SFSVYADFVAYTGGVYKHVTGVFLGGHAVRIVGWGEL-NGEPYWKIANSWNH 207


>gi|256090364|ref|XP_002581165.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228444|emb|CCD74615.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 303

 Score =  133 bits (334), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 71/152 (46%), Positives = 90/152 (59%), Gaps = 20/152 (13%)

Query: 63  CDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPE 115
           C+PY     C H      P C    Y TP+C   C K     R    Y+   +R      
Sbjct: 162 CEPY-PFPKCEHFTKGQYPPCGSKIYKTPRCKTTCQK-----RYKTSYAQDKHRA----- 210

Query: 116 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWIL 175
            I  EI K GPVE SFTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG  ++   YW++
Sbjct: 211 -IQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGV-ENKTPYWLI 268

Query: 176 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
           AN WN  WG +GYF+I RG +EC IE +V AG
Sbjct: 269 ANSWNEDWGENGYFRIVRGRDECSIESEVTAG 300


>gi|404250524|gb|AFR54113.1| cysteine proteinase, partial [Haemonchus contortus]
          Length = 332

 Score =  133 bits (334), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 79/208 (37%), Positives = 110/208 (52%), Gaps = 16/208 (7%)

Query: 1   MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
           +S   T  D +       +Q + +S  D+LACCG  CG GC+GG    AW Y    GVVT
Sbjct: 126 VSAAETMSDRICVQSKGRVQKM-ISDVDILACCGSECGRGCNGGMDHKAWEYVKEFGVVT 184

Query: 61  ----EE---CDPYFDSTGCSHPG----C--EPAYPTPKCVRKC-VKKNQLWRNSKHYSIS 106
               +E   C PY      +H G    C  + ++ TP C + C     + +   K Y  S
Sbjct: 185 GGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPACKKYCQYGYGKRYEKDKSYVKS 244

Query: 107 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 166
            Y ++ D + I  E+ KNGPV+ +F  YEDF+ Y  G+Y H  G   G HAVK++GWG  
Sbjct: 245 VYILDEDEKAIQREMMKNGPVQAAFITYEDFSFYTKGIYVHTRGRQRGAHAVKVVGWGV- 303

Query: 167 DDGEDYWILANQWNRSWGADGYFKIKRG 194
           ++G  YW +AN W+  WG +GYF+I RG
Sbjct: 304 ENGTKYWNVANSWSTDWGENGYFRILRG 331


>gi|195488613|ref|XP_002092389.1| GE11695 [Drosophila yakuba]
 gi|194178490|gb|EDW92101.1| GE11695 [Drosophila yakuba]
          Length = 431

 Score =  133 bits (334), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 83/217 (38%), Positives = 112/217 (51%), Gaps = 32/217 (14%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 65
           ++R A+ S    ++Q   LS  ++L+C       GC+GG+  +AWRY    GVV E C P
Sbjct: 223 SDRFAIQSKGKEAVQ---LSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDESCYP 277

Query: 66  Y----------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPE 115
           Y           +S      GC+  Y                R++ +    AY +N +  
Sbjct: 278 YTQQRDTCKIRHNSRSLRANGCQTPYNVD-------------RDTFYTVGPAYSLNREA- 323

Query: 116 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM---GGHAVKLIGWGTSDDGEDY 172
           DIMAEI+ +GPV+ +  V  DF  Y  GVY+    + M   G H+VKL+GWG   +GE Y
Sbjct: 324 DIMAEIFHSGPVQATMRVNRDFFAYAGGVYRQTAANRMAPTGFHSVKLVGWGEEHNGEKY 383

Query: 173 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           WI AN W   WG  GYF+I RGSNECGIEE V+A  P
Sbjct: 384 WIAANSWGPWWGERGYFRILRGSNECGIEEYVLASWP 420


>gi|159108625|ref|XP_001704582.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157432649|gb|EDO76908.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 298

 Score =  133 bits (334), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 70/168 (41%), Positives = 93/168 (55%), Gaps = 12/168 (7%)

Query: 41  CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
           CDGG+  S WR+    G  T+EC PY         G   A  T  C  KC   + L    
Sbjct: 140 CDGGWLPSVWRFLTKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSDL---P 187

Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
            + +  A     D + IM  +   GP++ +FTVY DF +Y+ GVY+H  G V GGHAV++
Sbjct: 188 IYKATKAVDYGLDCDLIMKALATGGPLQTAFTVYSDFMYYEGGVYQHTYGRVEGGHAVEM 247

Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           +G+GT +   DYWI+ N W   WG DGYF+I R +NECGIEE V+ G 
Sbjct: 248 VGYGTDEYDVDYWIIRNSWGPDWGEDGYFRIIRMTNECGIEEQVIGGF 295


>gi|204022071|dbj|BAG71133.1| cathepsin B-S2 [Tuberaphis coreana]
          Length = 334

 Score =  133 bits (334), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 83/204 (40%), Positives = 109/204 (53%), Gaps = 18/204 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FD 68
           N  LS  +L  CC   CG GC GG P+ AW YF   GV T       E C PY      +
Sbjct: 135 NQLLSPEELTFCCK-DCGQGCGGGNPMKAWEYFRTQGVTTGGDYNTKEGCMPYKVPPCRN 193

Query: 69  STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 128
             G +    +P     +C + C  K  +   +++ + S Y INS  + I  +I   GPVE
Sbjct: 194 KQGENICDEQPMERNHQCPKTCYGKTTV--QNRYKTKSEYYINS-IKTIEQDIKTYGPVE 250

Query: 129 VSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
            SF  Y+D + YKSG+Y K       GGH++K+IGWG  +DG  YW+  N W++ WG  G
Sbjct: 251 ASFDCYDDLSVYKSGIYRKSPNAKYKGGHSIKIIGWG-QEDGTPYWLAVNSWSKFWGDHG 309

Query: 188 YFKIKRGSNECGIEEDVVAGLPSS 211
            FKI +G NECGIE  V AG+PSS
Sbjct: 310 TFKIIKGRNECGIERAVTAGIPSS 333


>gi|268578113|ref|XP_002644039.1| Hypothetical protein CBG17499 [Caenorhabditis briggsae]
          Length = 355

 Score =  132 bits (333), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 82/213 (38%), Positives = 105/213 (49%), Gaps = 28/213 (13%)

Query: 21  NLSLSVNDLLACCGFL---CGDG--CDGGYPISAWRYFVHHGVVTE-------------- 61
           N  LS  D L+CC  L   CGDG  CDG +P    +++  HG+ T               
Sbjct: 141 NWPLSAQDPLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYDDQFGCKPYSI 200

Query: 62  -ECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW----RNSKHYSISAYRINSDPED 116
             CD  + +   S P   P Y TP C   C   N  W    +  KH+  + Y +     D
Sbjct: 201 YPCDKNYPNGTTSVPC--PGYHTPPCEDHCTS-NITWPIAYKQDKHFGKAHYNVGKKMTD 257

Query: 117 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 176
           I  EI  NGPV  SF +YEDF  YKSG+Y H  GD  GG   K+IGWG  D+G  YW+  
Sbjct: 258 IQTEIMTNGPVIASFIIYEDFWDYKSGIYVHTAGDQEGGMDTKIIGWGV-DNGVPYWLCV 316

Query: 177 NQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           +QW   +G +G+ +I RG NE  IE  V+A LP
Sbjct: 317 HQWGTDFGENGFVRILRGVNEVNIEHQVLAALP 349


>gi|395833440|ref|XP_003789742.1| PREDICTED: tubulointerstitial nephritis antigen [Otolemur
           garnettii]
          Length = 464

 Score =  132 bits (333), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 76/206 (36%), Positives = 108/206 (52%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A  
Sbjct: 255 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQHATNSGCAMASR 313

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YRI+S+  +IM EI +NGPV+    V
Sbjct: 314 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRISSNETEIMKEIMQNGPVQAIMQV 368

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
           +EDF HYKSG+Y+H+            +  HAVKL+GWGT        E +WI AN W +
Sbjct: 369 HEDFFHYKSGIYRHVASTHGESENYRKLRTHAVKLLGWGTLRGAQGRKEKFWIAANSWGK 428

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 429 SWGENGYFRILRGVNESDIEKLIIAA 454


>gi|281200411|gb|EFA74631.1| hypothetical protein PPL_11599 [Polysphondylium pallidum PN500]
          Length = 311

 Score =  132 bits (333), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 74/191 (38%), Positives = 106/191 (55%), Gaps = 19/191 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N++LS   L++C       GC+GG P  AW Y   HG+ T+ C PY    G +       
Sbjct: 129 NVTLSPQALVSC-DIEFNQGCNGGIPQMAWEYLELHGIPTDSCFPYTSGNGTA------- 180

Query: 81  YPTPKCVRKCV--KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
              P C ++C    K QL++  K +++   +  S    I A ++  GP+E +  VY+DF 
Sbjct: 181 ---PDCQKECSDGSKYQLYKG-KTFTL---KTCSSVAAIQANVFAYGPIEGTMDVYQDFM 233

Query: 139 HYKSGVYKHITGD-VMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSN 196
            Y SGVY    G  ++GGHA+K++GWGT S  G DYWI+ N W   WG +G+F I+RG+N
Sbjct: 234 SYTSGVYVMTPGSKLLGGHAIKIVGWGTDSTSGLDYWIVQNSWGSDWGMNGFFWIQRGTN 293

Query: 197 ECGIEEDVVAG 207
            CGI+ D  AG
Sbjct: 294 MCGIDRDASAG 304


>gi|239938578|gb|ACS36088.1| cysteine proteinase [Haemonchus contortus]
          Length = 332

 Score =  132 bits (332), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 79/208 (37%), Positives = 110/208 (52%), Gaps = 16/208 (7%)

Query: 1   MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
           +S   T  D +       +Q + +S  D+LACCG  CG GC+GG    AW Y    GVVT
Sbjct: 126 VSAAETMSDRICVQSKGRVQKM-ISDVDILACCGRECGRGCNGGMDHKAWEYVKEFGVVT 184

Query: 61  ----EE---CDPYFDSTGCSHPG----C--EPAYPTPKCVRKC-VKKNQLWRNSKHYSIS 106
               +E   C PY      +H G    C  + ++ TP C + C     + +   K Y  S
Sbjct: 185 GGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPACKKYCQYGYGKRYEKDKSYVKS 244

Query: 107 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 166
            Y ++ D + I  E+ KNGPV+ +   YEDF+ Y+ G+Y H  G   G HAVK++GWG  
Sbjct: 245 VYILDEDEKAIQREMMKNGPVQAASITYEDFSFYRRGIYVHTRGRQRGAHAVKVVGWGV- 303

Query: 167 DDGEDYWILANQWNRSWGADGYFKIKRG 194
           ++G  YW +AN W+  WG DGYF+I RG
Sbjct: 304 ENGTKYWNVANSWSTDWGEDGYFRILRG 331


>gi|47125398|gb|AAH70278.1| Tubulointerstitial nephritis antigen [Homo sapiens]
 gi|190690249|gb|ACE86899.1| tubulointerstitial nephritis antigen protein [synthetic construct]
 gi|190691623|gb|ACE87586.1| tubulointerstitial nephritis antigen protein [synthetic construct]
 gi|312150986|gb|ADQ32005.1| tubulointerstitial nephritis antigen [synthetic construct]
          Length = 476

 Score =  132 bits (332), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 75/206 (36%), Positives = 108/206 (52%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A  
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASR 325

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V
Sbjct: 326 SDGRGKRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 380

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
           +EDF HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +
Sbjct: 381 HEDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGK 440

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466


>gi|209863079|ref|NP_001119613.2| cathepsin B precursor [Acyrthosiphon pisum]
          Length = 323

 Score =  132 bits (332), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 82/208 (39%), Positives = 108/208 (51%), Gaps = 28/208 (13%)

Query: 23  SLSVNDLLACCGFLCGDG----CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTG 71
           +LS  +L++C     GDG    CDGG    AW   ++ G+VT       E C PY  +  
Sbjct: 117 NLSAQNLMSC-----GDGEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPY-KNRP 170

Query: 72  CSHPG------CEPAYPTPK--CVRKCVKKNQL--WRNSKHYSISAYRIN-SDPEDIMAE 120
           C H G      C     T    C +KCV KN    + +  H +   Y  + ++ + I  E
Sbjct: 171 CDHYGDSRLTNCSSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQE 230

Query: 121 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 180
           I   GPV     VYE+F  YK G+YK  TG+++G H VKLIGWG   DG +YW+  N WN
Sbjct: 231 IMTYGPVTAFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWLAMNSWN 290

Query: 181 RSWGADGYFKIKRGSNECGIEEDVVAGL 208
            +WG DG FKI RG N C IE  V+AG+
Sbjct: 291 SNWGNDGLFKILRGYNFCSIELLVMAGI 318


>gi|194753202|ref|XP_001958906.1| GF12327 [Drosophila ananassae]
 gi|190620204|gb|EDV35728.1| GF12327 [Drosophila ananassae]
          Length = 431

 Score =  132 bits (332), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 79/203 (38%), Positives = 106/203 (52%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----------FDS 69
           + + LS  ++L+C       GCDGG+  +AWR+    GVV + C PY           +S
Sbjct: 233 EAVRLSAQNILSCTRRQ--QGCDGGHLDAAWRFLHKKGVVDDSCYPYTQQRDTCKIRHNS 290

Query: 70  TGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
                 GC P+               + R+S +    AY +N +  DIMAEIY +GPV+ 
Sbjct: 291 RSLKANGCRPS-------------PNVDRDSFYTVGPAYTLNRE-GDIMAEIYHSGPVQA 336

Query: 130 SFTVYEDFAHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           +  VY DF  Y  G+Y+      G   G H+VKL+GWG   +G+ YWI AN W   WG  
Sbjct: 337 TMRVYRDFFSYSGGIYRQTAANRGAPQGFHSVKLVGWGEEHNGDKYWIAANSWGPWWGER 396

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           GYF+I RGSNECGIEE V+A  P
Sbjct: 397 GYFRILRGSNECGIEEYVLASWP 419


>gi|6009533|dbj|BAA84949.1| tubulointerstitial nephritis antigen [Homo sapiens]
          Length = 476

 Score =  132 bits (332), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 75/206 (36%), Positives = 108/206 (52%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A  
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASR 325

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V
Sbjct: 326 SDGRGKRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 380

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
           +EDF HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +
Sbjct: 381 HEDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGK 440

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466


>gi|10803454|emb|CAB97366.2| putative cathepsin B.3 [Ostertagia ostertagi]
          Length = 196

 Score =  132 bits (331), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 80/194 (41%), Positives = 103/194 (53%), Gaps = 17/194 (8%)

Query: 1   MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
           +S   T  D L        + L LS  D+LACCG  CG GC+GGY   AW Y  + GV +
Sbjct: 5   VSAAETMSDRLCVQTNGRKKTL-LSDTDILACCGDFCGYGCNGGYSARAWLYARNSGVCS 63

Query: 61  ----EE---CDPY------FDSTGCSHPGC-EPAYPTPKCVRKC-VKKNQLWRNSKHYSI 105
               +E   C PY      +      +  C +  Y TP C + C     + +   K Y+ 
Sbjct: 64  GGRYQEKGVCKPYTFHPCGYHKNQTYYGECPKHTYQTPACKKYCQYGYGKRYEKDKIYAX 123

Query: 106 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 165
            AYR++SD   I AEI+  GPV+ SF  YEDFAHYKSG+Y H  G   GGHAVK+IGWG 
Sbjct: 124 DAYRVSSDEAAIRAEIFARGPVQASFATYEDFAHYKSGIYVHTAGKRRGGHAVKIIGWGV 183

Query: 166 SDDGEDYWILANQW 179
            ++G   WI+AN W
Sbjct: 184 -ENGTKXWIVANSW 196


>gi|15723276|gb|AAL06326.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score =  132 bits (331), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 74/174 (42%), Positives = 98/174 (56%), Gaps = 21/174 (12%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH--- 74
           +++L +S  DL++CC  +CG GC+GGYP  AW Y+  HG+V+E C PY F S  C+H   
Sbjct: 44  VRDLRISAGDLMSCCD-VCGYGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPS--CAHHVN 100

Query: 75  ----PGCEPAYPTPKCVRKCVKKNQ---LWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
                 C   Y TP C   C  K      +R +  Y +S        E    E+  NGP 
Sbjct: 101 SSDLSPCSGEYDTPTCNSTCTDKKVPLIKYRGNTSYLLSG------EESFKRELLLNGPF 154

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
           EVSF+VY DF  Y  GVYKH+ G  +GGHAV+++GWG   +GE YW +AN WN 
Sbjct: 155 EVSFSVYADFLAYTGGVYKHVAGTFLGGHAVRIVGWGEL-NGEPYWKIANSWNH 207


>gi|195426329|ref|XP_002061289.1| GK20838 [Drosophila willistoni]
 gi|194157374|gb|EDW72275.1| GK20838 [Drosophila willistoni]
          Length = 432

 Score =  132 bits (331), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 80/201 (39%), Positives = 106/201 (52%), Gaps = 22/201 (10%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTG---CSHPGCE 78
           + LS  ++L+C       GC+GG+  +AWRY    GV+ E C PY  S G     H G  
Sbjct: 238 VQLSPQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVLDESCYPYTQSRGTCKVRHSGSL 295

Query: 79  PAY---PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
            A+   P P      V ++ L+     YS+S         DI AEI+ +GPV+ +  VY 
Sbjct: 296 KAHGCRPAPG-----VDRDSLYTVGPAYSLSR------EADIKAEIFHSGPVQATMRVYR 344

Query: 136 DFAHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
           DF  Y  G+Y+      G   G H+VKL+GWG   +G+ YWI AN W   WG  GYF+I 
Sbjct: 345 DFFSYSGGIYRQTAANRGAPTGFHSVKLVGWGEEHNGDKYWIAANSWGPWWGERGYFRIL 404

Query: 193 RGSNECGIEEDVVAGLPSSKN 213
           RGSNECGIE+ V+A  P   N
Sbjct: 405 RGSNECGIEDYVLASWPYVYN 425


>gi|403332696|gb|EJY65386.1| Cathepsin B [Oxytricha trifallax]
          Length = 297

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 74/172 (43%), Positives = 98/172 (56%), Gaps = 19/172 (11%)

Query: 40  GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
           GC+GGY   AW +   HGVV + C PY   +G +          P C  KC   +     
Sbjct: 141 GCNGGYMDMAWEFLDQHGVVADSCFPYSAGSGFA----------PACASKCADGSA---- 186

Query: 100 SKHYSI--SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 157
            K YS    + R +   E I +EI  +GPVE +FTVY DF +Y+SGVY   T DV GGHA
Sbjct: 187 EKKYSCVHGSIRQSQGVEQIKSEIVAHGPVEGAFTVYTDFFNYQSGVYTPTTSDVAGGHA 246

Query: 158 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           +K++G+G  ++G  YW+ AN W  SWG  G+FKIK+G  ECGIE+ V +  P
Sbjct: 247 IKILGFGV-ENGTPYWLCANSWGPSWGMQGFFKIKQG--ECGIEDQVFSCDP 295


>gi|226472808|emb|CAX71090.1| cathepsin B [Schistosoma japonicum]
          Length = 325

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 72/167 (43%), Positives = 101/167 (60%), Gaps = 17/167 (10%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH-- 74
           LS  +L++CC   CG GC+GG+P SAW Y+ + G+VT +       C PY +   C H  
Sbjct: 148 LSAENLVSCCSS-CGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY-EFPPCEHHT 205

Query: 75  ----PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
               P C+    TP C R C    N  + N K Y    YR+ S+ E IM E+ ++GPVEV
Sbjct: 206 LGPLPVCDGDVETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEV 265

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 176
            F VY DF +YKSGVY+H++G ++GGHAV+L+GWG  ++   YW++A
Sbjct: 266 DFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWG-EENNVPYWLIA 311


>gi|296198446|ref|XP_002746707.1| PREDICTED: tubulointerstitial nephritis antigen [Callithrix
           jacchus]
          Length = 476

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 75/206 (36%), Positives = 107/206 (51%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A  
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNSGCAMASR 325

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S   +IM EI +NGPV+    V
Sbjct: 326 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMKV 380

Query: 134 YEDFAHYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
           +EDF HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +
Sbjct: 381 HEDFFHYKTGIYRHVTSTNKESEKFQKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGK 440

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466


>gi|350596935|ref|XP_001927698.4| PREDICTED: tubulointerstitial nephritis antigen, partial [Sus
           scrofa]
          Length = 368

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 75/206 (36%), Positives = 108/206 (52%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A  
Sbjct: 159 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASR 217

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V
Sbjct: 218 SDGRGKRHATKPCPNNFEKSNRIYQCS-----PPYRVSSNETEIMREIMQNGPVQAIMQV 272

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
           +EDF HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +
Sbjct: 273 HEDFFHYKTGIYRHVTSTNEESDKYRKLRTHAVKLTGWGTLKGAQGRKEKFWIAANSWGK 332

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 333 SWGENGYFRILRGVNESDIEKLIIAA 358


>gi|28974200|gb|AAO61484.1| cathepsin B [Sterkiella histriomuscorum]
          Length = 294

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 78/203 (38%), Positives = 113/203 (55%), Gaps = 19/203 (9%)

Query: 9   DALSSSPYVSLQNLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYF 67
           +A S    ++ +++ LS  DL++C      D GC+GGY   AW Y   HG  T+ C PY 
Sbjct: 109 EAFSDRFAINGKDVILSPEDLVSC---DTNDYGCNGGYMDVAWEYLADHGAATDSCFPYS 165

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
             +G +          P C  KC   + + R     + ++ R +     I +EI  +GPV
Sbjct: 166 AGSGFA----------PACSDKCADGSAMQRFK--CAPNSVRQSKGVAQIQSEIVSHGPV 213

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
           E +FTVY DF +Y+SGVY   T DV GGHA+K++G+G  ++G  YW+ AN W  +WG  G
Sbjct: 214 EGAFTVYTDFFNYQSGVYTPTTTDVAGGHAIKILGYGV-ENGTPYWLCANSWGPAWGMSG 272

Query: 188 YFKIKRGSNECGIEEDVVAGLPS 210
           +FKIK+G  ECGIE+ V +  P 
Sbjct: 273 FFKIKQG--ECGIEDQVFSCDPQ 293


>gi|166030324|gb|ABY78829.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 72/179 (40%), Positives = 94/179 (52%), Gaps = 12/179 (6%)

Query: 45  YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA--------YPTPKCVRKCVKKNQL 96
           +P  AWRY+V +G+ +  C PY     C H G +          + TPKC   C  K+  
Sbjct: 162 FPGFAWRYYVEYGIASSYCQPY-PFPHCEHRGAQGNKTPCSKYNFDTPKCNATCTDKSI- 219

Query: 97  WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 156
               K+   + Y +    ED   E+Y NGP    F VY D   YKSGVY+H+ GD +GG 
Sbjct: 220 -PLVKYRGNATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGT 278

Query: 157 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 215
           AVK++GWG   +G  YW +AN W+  WG DGY  I RG+NEC IE    AG P +  L 
Sbjct: 279 AVKVVGWG-KLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPETSQLT 336


>gi|195585648|ref|XP_002082593.1| GD25141 [Drosophila simulans]
 gi|194194602|gb|EDX08178.1| GD25141 [Drosophila simulans]
          Length = 484

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 83/216 (38%), Positives = 113/216 (52%), Gaps = 16/216 (7%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 65
           ++R A+ S    ++Q   LS  ++L+C       GC+GG+  +AWRY    GVV E C P
Sbjct: 223 SDRFAIQSKGKEAVQ---LSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDENCYP 277

Query: 66  YFDSTGCSHPGCEPAYPTPKCVRK--CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           Y       H          + +R   C     + R++ +    AY +N +  DIMAEI+ 
Sbjct: 278 YTQ-----HRDTCKIRHNSRSLRANGCQTPVNVDRDTLYTVGPAYSLNREA-DIMAEIFH 331

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDV---MGGHAVKLIGWGTSDDGEDYWILANQWN 180
           +GPV+ +  V  DF  Y  GVY+    +     G H+VKL+GWG   +GE YWI AN W 
Sbjct: 332 SGPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWG 391

Query: 181 RSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVK 216
             WG  GYF+I RGSNECGIEE V+A  P   N  K
Sbjct: 392 SWWGEHGYFRILRGSNECGIEEYVLASWPYVYNYYK 427


>gi|403268748|ref|XP_003926429.1| PREDICTED: tubulointerstitial nephritis antigen [Saimiri
           boliviensis boliviensis]
          Length = 476

 Score =  131 bits (329), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 75/206 (36%), Positives = 107/206 (51%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A  
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNSGCAMASR 325

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S   +IM EI +NGPV+    V
Sbjct: 326 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMKV 380

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
           +EDF HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +
Sbjct: 381 HEDFFHYKTGIYRHVTSTNKESEKFLKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGK 440

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466


>gi|17560488|ref|NP_506310.1| Protein F32H5.1 [Caenorhabditis elegans]
 gi|3876629|emb|CAB04249.1| Protein F32H5.1 [Caenorhabditis elegans]
          Length = 356

 Score =  131 bits (329), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 81/215 (37%), Positives = 107/215 (49%), Gaps = 26/215 (12%)

Query: 21  NLSLSVNDLLACCGFL---CGDG--CDGGYPISAWRYFVHHGVVTEE-------CDPYFD 68
           N  LS  D L+CC  L   CGDG  CDG +P    +++  HG+ T         C PY  
Sbjct: 142 NWPLSAQDPLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYNDQFGCKPYSI 201

Query: 69  -------STGCSHPGCEPAYPTPKCVRKCVKKNQLW----RNSKHYSISAYRINSDPEDI 117
                  + G +   C P Y TP C   C   N  W    +  KH+  + Y +     DI
Sbjct: 202 YPCDKKYANGTTSVPC-PGYHTPTCEEHCTS-NITWPIAYKQDKHFGKAHYNVGKKMTDI 259

Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 177
             EI  NGPV  SF +Y+DF  YK+G+Y H  GD  GG   K+IGWG  D+G  YW+  +
Sbjct: 260 QIEIMTNGPVIASFIIYDDFWDYKTGIYVHTAGDQEGGMDTKIIGWGV-DNGVPYWLCVH 318

Query: 178 QWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 212
           QW   +G +G+ +  RG NE  IE  V+A LP S+
Sbjct: 319 QWGTDFGENGFVRFLRGVNEVNIEHQVLAALPDSE 353


>gi|15723280|gb|AAL06328.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score =  131 bits (329), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 74/174 (42%), Positives = 98/174 (56%), Gaps = 21/174 (12%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH--- 74
           +++L +S  DL++CC  +CG GC+GGYP  AW Y+  HG+V+E C PY F S  C+H   
Sbjct: 44  VRDLRISAGDLMSCCD-VCGYGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPS--CAHHVN 100

Query: 75  ----PGCEPAYPTPKCVRKCVKKNQ---LWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
                 C   Y TP C   C  K      +R +  Y +S        E    E+  NGP 
Sbjct: 101 SSDLSPCSGEYDTPTCNSTCTDKKVPLIKYRGNTSYLLSG------EESFKRELLLNGPF 154

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
           EVSF+VY DF  Y  GVYKH+ G  +GGHAV+++GWG   +GE YW +AN WN 
Sbjct: 155 EVSFSVYADFLAYTGGVYKHVAGIFLGGHAVRIVGWGEL-NGEPYWKIANSWNH 207


>gi|156708118|gb|ABU93317.1| cathepsin B8 cysteine protease, partial [Monocercomonoides sp. PA]
          Length = 275

 Score =  131 bits (329), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 72/186 (38%), Positives = 102/186 (54%), Gaps = 18/186 (9%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
           +S  DL++C       GC+GGY    W +    G+ TE+C PY   +G            
Sbjct: 106 MSPQDLVSCESN--NMGCEGGYADRVWNWIQKKGITTEQCLPYVSGSG----------RV 153

Query: 84  PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
           P C  KC   + + R+   +  S    NS  + +M E+  NGPV   F V+EDF +YKSG
Sbjct: 154 PTCPSKCKNGSNIVRS---FVSSWGSFNS--KTVMDEVANNGPVYACFEVFEDFLNYKSG 208

Query: 144 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 203
           +Y+H TG   G H V L+GWGT ++G  YW+L N W   WG  G+F+I+RG+N+C I+E 
Sbjct: 209 IYQHKTGKSKGWHHVMLMGWGT-ENGVPYWLLQNSWGSGWGEKGFFRIRRGTNDCHIDEI 267

Query: 204 VVAGLP 209
             +GLP
Sbjct: 268 FYSGLP 273


>gi|224586907|ref|NP_055279.3| tubulointerstitial nephritis antigen [Homo sapiens]
 gi|317373501|sp|Q9UJW2.3|TINAG_HUMAN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
 gi|119624842|gb|EAX04437.1| tubulointerstitial nephritis antigen [Homo sapiens]
 gi|189066513|dbj|BAG35763.1| unnamed protein product [Homo sapiens]
          Length = 476

 Score =  131 bits (329), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 75/206 (36%), Positives = 107/206 (51%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A  
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASR 325

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V
Sbjct: 326 SDGRGKRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 380

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
            EDF HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +
Sbjct: 381 REDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGK 440

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466


>gi|426353589|ref|XP_004044272.1| PREDICTED: tubulointerstitial nephritis antigen [Gorilla gorilla
           gorilla]
          Length = 476

 Score =  131 bits (329), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 75/206 (36%), Positives = 107/206 (51%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A  
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASR 325

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V
Sbjct: 326 SDGRGKRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 380

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
            EDF HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +
Sbjct: 381 REDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGK 440

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466


>gi|15723274|gb|AAL06325.1| cathepsin B-like protease [Trypanosoma cruzi]
 gi|15723278|gb|AAL06327.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score =  131 bits (329), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 76/174 (43%), Positives = 100/174 (57%), Gaps = 21/174 (12%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH--- 74
           +++L +S  DL++CC  +CG GC+GGYP  AW Y+  HG+V+E C PY F S  C+H   
Sbjct: 44  VRDLRISAGDLMSCCD-VCGFGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPS--CAHHVN 100

Query: 75  ----PGCEPAYPTPKCVRKCV-KKNQL--WRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
                 C   Y TP C   C  KK  L  +R +  Y +S        E    E+  NGP 
Sbjct: 101 SSDLSPCSGEYDTPTCNSTCTDKKIPLIKYRGNTSYVLSG------EEPFKRELILNGPF 154

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
           EVSF+VY DF  Y  GVYKH+ G  +GGHAV+++GWG   +GE YW +AN WN 
Sbjct: 155 EVSFSVYADFVAYTGGVYKHVAGIFLGGHAVRIVGWGEL-NGEPYWKIANSWNH 207


>gi|341886633|gb|EGT42568.1| hypothetical protein CAEBREN_17563 [Caenorhabditis brenneri]
          Length = 358

 Score =  131 bits (329), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 80/213 (37%), Positives = 106/213 (49%), Gaps = 28/213 (13%)

Query: 21  NLSLSVNDLLACCGFL---CGDG--CDGGYPISAWRYFVHHGVVT--------------- 60
           N  LS  D L+CC  L   CGDG  CDG +P    +++  HG+ T               
Sbjct: 144 NWPLSAQDPLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYDDQFGCKPYTI 203

Query: 61  EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW----RNSKHYSISAYRINSDPED 116
             CD  + +   S P   P Y TP C  +C   N  W    +  KH+  + Y +     D
Sbjct: 204 YPCDKKYPNGTTSVPC--PGYHTPVCEERCTS-NITWPISYKQDKHFGKAHYNVGKKMTD 260

Query: 117 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 176
           I  EI +NGPV  SF +Y+DF  YKSG+Y H  GD  GG   K+IGWG  D+G  YW+  
Sbjct: 261 IQTEIMRNGPVIASFIIYDDFWDYKSGIYVHTAGDQEGGMDTKIIGWGV-DNGVPYWLCV 319

Query: 177 NQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           +QW   +G +G+ +I RG NE  IE  V+A  P
Sbjct: 320 HQWGTDFGENGFVRILRGVNEVNIEHQVLAAQP 352


>gi|3087799|emb|CAA93276.1| cysteine proteinase [Haemonchus contortus]
          Length = 350

 Score =  131 bits (329), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 81/229 (35%), Positives = 114/229 (49%), Gaps = 18/229 (7%)

Query: 1   MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
           +S   T  D +       LQ + LS  D+L+CCG +CGDGC+GGY   AW +    GVVT
Sbjct: 125 VSAASTMSDRICVQTKGKLQTI-LSDTDILSCCGRMCGDGCEGGYDHLAWEWVQRFGVVT 183

Query: 61  E-------ECDPY-FDSTGCSHP---GC--EPAYPTPKCVRKC-VKKNQLWRNSKHYSIS 106
                    C PY F   G  H     C  + ++ TP C   C     + +   K +  S
Sbjct: 184 GGPYQQKGVCRPYAFHPCGLHHGRRYDCPWDHSFSTPACKPYCQFGYGKRYEKDKFFVKS 243

Query: 107 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 166
            Y +++D + I  E+ KNGPV+ +F  YEDF+ YK G+Y H+ G   G HAVKLIGWG  
Sbjct: 244 TYILDNDEKVIQREMMKNGPVQAAFITYEDFSPYKGGIYVHVKGRERGAHAVKLIGWGV- 302

Query: 167 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 215
           ++G  YW +AN W+  WG   +      S    +   +V      +NL+
Sbjct: 303 ENGTKYWTVANSWHDDWGGKRFLPYSTWSESLRVR--IVCRFRRIQNLI 349


>gi|410959397|ref|XP_003986297.1| PREDICTED: tubulointerstitial nephritis antigen [Felis catus]
          Length = 474

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 75/207 (36%), Positives = 109/207 (52%), Gaps = 28/207 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW +    G+V+  C P F +   ++ GC  A  
Sbjct: 264 NLSPQNLISCCP-KNRHGCNSGSIDRAWWFLRKRGLVSHACYPLFKNQNATNHGCAMASR 322

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V
Sbjct: 323 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 377

Query: 134 YEDFAHYKSGVYKHITGDV---------MGGHAVKLIGWGT----SDDGEDYWILANQWN 180
           +EDF HYK+G+Y+HIT            +  HAVKL GWGT        E +WI AN W 
Sbjct: 378 HEDFFHYKTGIYRHITKKANEESGKYRKLQTHAVKLTGWGTLKGAQGRKEKFWIAANSWG 437

Query: 181 RSWGADGYFKIKRGSNECGIEEDVVAG 207
           +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 438 KSWGENGYFRILRGVNESDIEKLIIAA 464


>gi|201023369|ref|NP_001128426.1| cathepsin B-3483 [Acyrthosiphon pisum]
 gi|328712086|ref|XP_003244726.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
          Length = 355

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 85/232 (36%), Positives = 117/232 (50%), Gaps = 26/232 (11%)

Query: 1   MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
           +SVT    D +  +   ++  L  S   L++CC   CG+GC GGY  +AWRY +  G+VT
Sbjct: 127 ISVTSAMNDRICIASQGNITAL-YSPQKLVSCCE-DCGNGCSGGYTAAAWRYILKKGIVT 184

Query: 61  -------EECDPYF-----DSTGCSHP----------GCEPAYPTPKCVRKCVKKNQLWR 98
                  E C P+       ST  + P          G +PA  TPKC   C       +
Sbjct: 185 GGDYGSNEGCQPWLVQPCNASTTAADPSSVLGPHGVCGGDPA-TTPKCDLSCYNARHEGK 243

Query: 99  NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 158
                  +      D       + K+GP  V+  VYEDF  YKSGVY H+TGD +G  +V
Sbjct: 244 YLDDIIKAKKVFTFDGCSARKNLRKHGPYVVTMRVYEDFLAYKSGVYHHVTGDYLGLLSV 303

Query: 159 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 210
           ++IGWG  + G+ +W+LAN W  SWG  G+FKI+R  NEC IE    AG+P+
Sbjct: 304 RMIGWGL-EGGQAFWLLANSWGTSWGDKGFFKIRRFVNECWIENFRYAGVPN 354


>gi|255040223|gb|ACT99884.1| truncated cathepsin B [Opisthorchis viverrini]
          Length = 313

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 78/182 (42%), Positives = 95/182 (52%), Gaps = 19/182 (10%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS------- 73
           N SLS  DLL+CC   CG GC GGYP  AW Y+  HG+VT       D +GC        
Sbjct: 136 NKSLSAVDLLSCCK-DCGFGCRGGYPAVAWDYWRTHGIVTGGSKE--DPSGCRSYPFPKC 192

Query: 74  -------HPGC-EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
                  +P C    YPTP+CV+ C      +   K  +  +Y I +    IM EI   G
Sbjct: 193 DHHVQGHYPPCPRQIYPTPECVQDCDTPELGYLEDKTRANISYNIYASEISIMKEIMLRG 252

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           PVE  FTVYEDF  YKS VY H  G  M GHA++++GWG   D   YW++AN WN  WG 
Sbjct: 253 PVEAVFTVYEDFLQYKSRVYFHAWGAPMSGHAIRILGWGEEGD-VPYWLIANSWNEDWGE 311

Query: 186 DG 187
            G
Sbjct: 312 KG 313


>gi|389608479|dbj|BAM17849.1| tubulointerstitial nephritis antigen [Papilio xuthus]
          Length = 429

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 78/194 (40%), Positives = 109/194 (56%), Gaps = 15/194 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDS-TGCSHPGCE 78
           +N+ LS   LL+C       GC GG+   AW +   HG+V E+C PY  S T C      
Sbjct: 237 ENMVLSPQTLLSC-NVRAQQGCHGGHIDVAWNFARGHGLVDEKCFPYKASVTRC------ 289

Query: 79  PAYPTPKCVRK-CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
           P  P    ++  C+   +  R +  Y +      S  +DIM +I ++GPV+   TVY+DF
Sbjct: 290 PFRPRGNLIQDGCMPLVK--RRTSRYKLGPPAKLSHEKDIMYDIMESGPVQAVMTVYQDF 347

Query: 138 AHYKSGVYK---HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 194
            HY+ GVY+   H   ++ G H+V++IGWG  D G+ YW++AN W R WG +GYF+I RG
Sbjct: 348 FHYRDGVYRRSYHGNNELKGFHSVRIIGWG-EDRGDRYWVVANSWGRQWGENGYFRIARG 406

Query: 195 SNECGIEEDVVAGL 208
           SNE  IE  VV GL
Sbjct: 407 SNEADIESFVVTGL 420


>gi|403371460|gb|EJY85611.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 76/189 (40%), Positives = 104/189 (55%), Gaps = 17/189 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           NL LS  D+++C       GC GGY   AW+Y    GV ++ C+PY      S  G +P+
Sbjct: 126 NLVLSPQDMVSC--DTSNFGCFGGYLDQAWQYLEQQGVSSDSCEPYK-----SGNGDQPS 178

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
            PT     + +KK +    S   +  A       E   + I ++GPVE  FTVY+DF +Y
Sbjct: 179 CPTKCSNGQAIKKYKCKAGSTKQAKGA-------EATKSLIQESGPVETGFTVYQDFYNY 231

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
            SGVY H+TGD  GGHAVK++GWG     E+YWI+AN W   WG  GYF I++G  + GI
Sbjct: 232 NSGVYHHVTGDAEGGHAVKILGWG-KQGLENYWIVANSWGEDWGEKGYFNIRQG--DSGI 288

Query: 201 EEDVVAGLP 209
           +E     +P
Sbjct: 289 DEATFGCIP 297


>gi|166030320|gb|ABY78827.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  130 bits (327), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 70/179 (39%), Positives = 95/179 (53%), Gaps = 12/179 (6%)

Query: 45  YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA--------YPTPKCVRKCVKKNQL 96
           +P  AWRY+V +G+ +  C PY     C H G +          + TPKC   C  K+  
Sbjct: 162 FPGFAWRYYVEYGIASSYCQPY-PFPHCEHRGAQGNKTPCSKYNFDTPKCNATCTDKSIP 220

Query: 97  WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 156
               K+   + Y +    ED   E+Y NGP    F VY D   YKSGVY+++ GD++GG 
Sbjct: 221 L--VKYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDILGGQ 278

Query: 157 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 215
           AV+++GWG   +G  YW +AN W+  WG DGY  I RG+NEC IE    AG P +  L 
Sbjct: 279 AVRIVGWG-KLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPETSQLT 336


>gi|156708116|gb|ABU93316.1| cathepsin B7 cysteine protease, partial [Monocercomonoides sp. PA]
          Length = 273

 Score =  130 bits (327), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 72/186 (38%), Positives = 102/186 (54%), Gaps = 18/186 (9%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
           +S  DL++C       GC+GGY    W +    G+ TE+C PY   +G            
Sbjct: 104 MSPQDLVSCESN--NMGCNGGYADRVWNWIQKKGITTEQCIPYVSGSG----------RV 151

Query: 84  PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
           P C  KC   + + R+   +  S    NS  + +M E+  NGPV   F V+EDF +Y+SG
Sbjct: 152 PTCPSKCKNGSNIVRS---FVSSWGSFNS--KTVMDEVANNGPVYACFEVFEDFYNYRSG 206

Query: 144 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 203
           VY+H TG   G H V L+GWGT ++G  YW+L N W   WG  G+F+I+RG+N+C I+E 
Sbjct: 207 VYQHKTGRSQGWHHVMLMGWGT-ENGVPYWLLQNSWGSGWGEKGFFRIRRGTNDCHIDEI 265

Query: 204 VVAGLP 209
             +GLP
Sbjct: 266 FYSGLP 271


>gi|397517574|ref|XP_003828984.1| PREDICTED: tubulointerstitial nephritis antigen [Pan paniscus]
          Length = 476

 Score =  130 bits (327), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 75/206 (36%), Positives = 107/206 (51%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A  
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDHNATNNGCAMASR 325

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V
Sbjct: 326 SDGRGKRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 380

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
            EDF HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +
Sbjct: 381 REDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGK 440

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466


>gi|332824268|ref|XP_518550.3| PREDICTED: tubulointerstitial nephritis antigen [Pan troglodytes]
          Length = 476

 Score =  130 bits (327), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 75/206 (36%), Positives = 107/206 (51%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A  
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDHNATNNGCAMASR 325

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V
Sbjct: 326 SDGRGKRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 380

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
            EDF HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +
Sbjct: 381 REDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGK 440

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466


>gi|195346663|ref|XP_002039877.1| GM15657 [Drosophila sechellia]
 gi|194135226|gb|EDW56742.1| GM15657 [Drosophila sechellia]
          Length = 431

 Score =  130 bits (327), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 80/209 (38%), Positives = 112/209 (53%), Gaps = 16/209 (7%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 65
           ++R A+ S    ++Q   LS  ++L+C       GC+GG+  +AWRY    GVV E C P
Sbjct: 223 SDRFAIQSKGKEAVQ---LSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDENCYP 277

Query: 66  YFDSTGCSHPGCEPAYPTPKCVRK--CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           Y       H          + +R   C     + R++ +    AY +N +  DIMAEI+ 
Sbjct: 278 YT-----QHRDTCKIRHNSRSLRANGCQTPVNVDRDTLYTVGPAYSLNREA-DIMAEIFH 331

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGD---VMGGHAVKLIGWGTSDDGEDYWILANQWN 180
           +GPV+ +  V  DF  Y  GVY+    +   + G H+VKL+GWG   +GE YWI AN W 
Sbjct: 332 SGPVQATMRVNRDFFAYSGGVYRETAANRKALTGFHSVKLVGWGEEHNGEKYWIAANSWG 391

Query: 181 RSWGADGYFKIKRGSNECGIEEDVVAGLP 209
             WG  GYF+I RGSNECGIE+ V+A  P
Sbjct: 392 SWWGEHGYFRILRGSNECGIEDYVLASWP 420


>gi|403365170|gb|EJY82363.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score =  130 bits (326), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 75/192 (39%), Positives = 100/192 (52%), Gaps = 23/192 (11%)

Query: 21  NLSLSVNDLLAC-CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           NL LS  D+L+C     C   C GGY  +AW+Y    GV ++ C+PY    G        
Sbjct: 126 NLVLSPQDMLSCDASNFC---CFGGYLDTAWQYLEQQGVGSDSCEPYKSGNG-------- 174

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISA--YRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
               P C  KC     +    K Y   A   +     E   + I ++GPVE  FT+YEDF
Sbjct: 175 --DQPSCPSKCSNGQAI----KKYKCKAGSTKQAKGAEATKSLIQQSGPVETGFTIYEDF 228

Query: 138 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 197
            +Y SG+Y H+TG  MGGHAVK++GWG     E+YWI+AN W   WG  GYF I++G  +
Sbjct: 229 LNYNSGIYHHVTGGNMGGHAVKILGWGKQGL-ENYWIVANSWGEDWGEKGYFNIRQG--D 285

Query: 198 CGIEEDVVAGLP 209
            GI+E     +P
Sbjct: 286 SGIDEATFGCIP 297


>gi|449283627|gb|EMC90232.1| Tubulointerstitial nephritis antigen [Columba livia]
          Length = 469

 Score =  130 bits (326), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 80/202 (39%), Positives = 109/202 (53%), Gaps = 27/202 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF-----DSTGCSHPGC 77
           +LSV +L++C       GC+GG   SAWRY   HGVV+  C P F     + +G +H   
Sbjct: 272 NLSVQNLISC-DTRNQHGCNGGNIDSAWRYLKTHGVVSYACYPSFWKKHLEPSGENHCYV 330

Query: 78  EPAY-------PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
              Y       P P  + K    N+L+R + HY     R++S   +IM EI   GPV+  
Sbjct: 331 SSEYGKNYTNGPCPNALEK---SNRLYRCASHY-----RVSSKETNIMKEIMDKGPVQAI 382

Query: 131 FTVYEDFAHYKSGVYKH--ITGDVMGGHAVKLIGWGTSDDG----EDYWILANQWNRSWG 184
             VYEDF  YK G+Y+H    G     H+VKL+GWG   D     + +WI AN W +SWG
Sbjct: 383 MKVYEDFFLYKEGIYRHSQKAGSKWKTHSVKLLGWGALADKNGQKQKFWIAANSWGKSWG 442

Query: 185 ADGYFKIKRGSNECGIEEDVVA 206
            +GYF+I RG NEC IE+ ++A
Sbjct: 443 ENGYFRILRGQNECDIEKLILA 464


>gi|308157829|gb|EFO60849.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 300

 Score =  129 bits (325), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 67/168 (39%), Positives = 94/168 (55%), Gaps = 11/168 (6%)

Query: 41  CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
           C+GG+  +AW++    G  T+EC PY   +      C    PT     KC   +     +
Sbjct: 141 CNGGWLPNAWKFLTKTGTTTDECVPYQSGSTTLRGTC----PT-----KCADGSSKVHLT 191

Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
              S   Y +  D   +M  +   GP++V+F VY DF +Y+SGVY+H  G + GGHAV++
Sbjct: 192 TATSYKDYGL--DIPAMMKALSTTGPLQVAFLVYSDFMYYESGVYQHTYGYMEGGHAVEM 249

Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           +G+GT DDG DYWI+ N W   WG DGYF++ RG N+C IEE   AG 
Sbjct: 250 VGYGTDDDGVDYWIIRNSWGPDWGEDGYFRMIRGINDCSIEEQAYAGF 297


>gi|349604734|gb|AEQ00202.1| Cathepsin B-like protein, partial [Equus caballus]
          Length = 134

 Score =  129 bits (325), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 86/143 (60%), Gaps = 14/143 (9%)

Query: 77  CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDI-MAEIYKNGPVEVSFTVYE 135
           CEP Y               ++  KHY  S+Y ++            KNGPVE +FTVY 
Sbjct: 5   CEPGYSPS------------YKEDKHYGCSSYSVSRGARRRSWQRSSKNGPVEAAFTVYS 52

Query: 136 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 195
           DF  YKSGVY+H+ GD+MGGHAV+++GWG  ++G  YW++ N WN  WG +G+FKI RG 
Sbjct: 53  DFLQYKSGVYQHVAGDMMGGHAVRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQ 111

Query: 196 NECGIEEDVVAGLPSSKNLVKEI 218
           + CGIE ++VAG+P +    K I
Sbjct: 112 DHCGIESEIVAGIPCTDQYWKRI 134


>gi|332210168|ref|XP_003254178.1| PREDICTED: tubulointerstitial nephritis antigen [Nomascus
           leucogenys]
          Length = 476

 Score =  129 bits (325), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 75/206 (36%), Positives = 105/206 (50%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW Y    G+V+  C P F     +  GC  A  
Sbjct: 267 NLSPQNLISCCS-KNRPGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATSNGCAMASR 325

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S   +IM EI +NGPV+    V
Sbjct: 326 SDGRGKRHATKPCPNNVEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMQV 380

Query: 134 YEDFAHYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
            EDF HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +
Sbjct: 381 REDFFHYKTGIYRHVTSANKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGK 440

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466


>gi|308504721|ref|XP_003114544.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
 gi|308261929|gb|EFP05882.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
          Length = 358

 Score =  129 bits (325), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 80/213 (37%), Positives = 103/213 (48%), Gaps = 28/213 (13%)

Query: 21  NLSLSVNDLLACCGFL---CGDG--CDGGYPISAWRYFVHHGVVT--------------- 60
           N  LS  D L+CC  L   CGDG  CDG +P    +++  HG+ T               
Sbjct: 144 NWPLSAQDPLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYEDQFGCKPYSI 203

Query: 61  EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW----RNSKHYSISAYRINSDPED 116
             CD  + +   S P   P Y TP C   C   N  W    +  KH+  + Y +     D
Sbjct: 204 YPCDKKYPNGTTSVPC--PGYHTPTCEEHCTS-NITWPIAYKQDKHFGKAHYNVGKKMTD 260

Query: 117 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 176
           I  EI  NGPV  SF +Y+DF  YKSG+Y H  GD  GG   K+IGWG  D G  YW+  
Sbjct: 261 IQTEIMTNGPVIASFVIYDDFWDYKSGIYVHTAGDQEGGMDTKIIGWGV-DSGVPYWLCV 319

Query: 177 NQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           +QW   +G +G+ +  RG NE  IE  V+A LP
Sbjct: 320 HQWGTDFGENGFVRFLRGVNEVNIEHQVLAALP 352


>gi|78042562|ref|NP_001030279.1| tubulointerstitial nephritis antigen [Bos taurus]
 gi|108861910|sp|Q3SZI1.1|TINAG_BOVIN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
 gi|74354008|gb|AAI02844.1| Tubulointerstitial nephritis antigen [Bos taurus]
 gi|296474572|tpg|DAA16687.1| TPA: tubulointerstitial nephritis antigen [Bos taurus]
          Length = 476

 Score =  129 bits (325), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 79/221 (35%), Positives = 113/221 (51%), Gaps = 34/221 (15%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A  
Sbjct: 267 NLSPQNLISCCAKK-RHGCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASR 325

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V
Sbjct: 326 SDGRGKRHATTPCPNSIEKSNRIYQCS-----PPYRVSSNETEIMREIMQNGPVQAIMQV 380

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
           +EDF +YK+G+Y+HIT              HAVKL GWGT        E +WI AN W +
Sbjct: 381 HEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQGQKEKFWIAANSWGK 440

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 222
           SWG +GYF+I RG NE  IE+ ++A          ++TSAD
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAAW-------GQLTSAD 474


>gi|355724272|gb|AES08175.1| tubulointerstitial nephritis antigen [Mustela putorius furo]
          Length = 476

 Score =  129 bits (324), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 74/206 (35%), Positives = 107/206 (51%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW +    G+V+  C P F     ++ GC  A  
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNATNDGCAMASR 325

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V
Sbjct: 326 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 380

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
           +EDF HYK+G+Y+H+T              HAVKL GWGT        E +WI AN W +
Sbjct: 381 HEDFFHYKTGIYRHVTRTNEEASKYRKFQTHAVKLTGWGTLKGAQGQKEKFWIAANSWGK 440

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466


>gi|403362666|gb|EJY81064.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score =  129 bits (324), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 72/189 (38%), Positives = 104/189 (55%), Gaps = 17/189 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+ LS  D+++C       GCDGGY   AW+Y    GV ++ C+PY  ++G +       
Sbjct: 126 NVVLSPQDMVSC--DTNNYGCDGGYLNLAWQYLEKKGVASDSCEPYKSASGTA------- 176

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
              P C  KC    Q  +  K  + S  + N       + I ++GPVE  FTVY DF +Y
Sbjct: 177 ---PSCPSKCAN-GQAIKKYKCQAGSTKQANGAAA-TKSLIQQSGPVETGFTVYADFFNY 231

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSG+Y H++G   GGHAVK++GWG     E+YWI+AN W  SWG  G+F I++G  + GI
Sbjct: 232 KSGIYHHVSGGAEGGHAVKILGWGKQGS-ENYWIVANSWGESWGEKGFFNIRQG--DSGI 288

Query: 201 EEDVVAGLP 209
           ++     +P
Sbjct: 289 DQATFGCIP 297


>gi|73973401|ref|XP_538969.2| PREDICTED: tubulointerstitial nephritis antigen [Canis lupus
           familiaris]
          Length = 476

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 75/206 (36%), Positives = 107/206 (51%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW +    G+V+  C P F     ++ GC  A  
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNATNYGCAMASR 325

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V
Sbjct: 326 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 380

Query: 134 YEDFAHYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
           +EDF HYK+G+Y+HIT           +  HAVKL GWGT        E +WI AN W  
Sbjct: 381 HEDFFHYKTGIYRHITRTNEESRKYQKLQTHAVKLTGWGTLKGAQGQKEKFWIAANSWGI 440

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466


>gi|301775398|ref|XP_002923119.1| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
           antigen-like [Ailuropoda melanoleuca]
          Length = 472

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 73/206 (35%), Positives = 108/206 (52%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW +    G+V+  C P F     ++ GC  A  
Sbjct: 263 NLSPQNLISCCA-KNRHGCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNATNYGCAMASR 321

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V
Sbjct: 322 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 376

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
           +EDF HYK+G+Y+H+T           +  HA+KL GWGT        E +WI AN W +
Sbjct: 377 HEDFFHYKTGIYRHVTRTNEESSKYRKLQTHAIKLTGWGTLKGARGQKEKFWIAANSWGK 436

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 437 SWGENGYFRILRGVNESDIEKLIIAA 462


>gi|6449322|gb|AAF08931.1| tubulointerstitial nephritis antigen isoform TIN-ag [Homo sapiens]
          Length = 476

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 76/206 (36%), Positives = 106/206 (51%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY- 81
           +LS  +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A  
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASR 325

Query: 82  --------PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                    T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V
Sbjct: 326 SDGRGKRDATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 380

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
            EDF HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +
Sbjct: 381 REDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANFWGK 440

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ V+A 
Sbjct: 441 SWGENGYFRILRGVNESDIEKLVIAA 466


>gi|126310154|ref|XP_001364630.1| PREDICTED: tubulointerstitial nephritis antigen [Monodelphis
           domestica]
          Length = 468

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 76/206 (36%), Positives = 107/206 (51%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC GG    AW Y    G+V+  C P F     ++ GC+ A  
Sbjct: 259 NLSPQNLISCC-VKNRHGCKGGSIDRAWWYLRKRGLVSHACYPLFKDQIFNNNGCDMASR 317

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V
Sbjct: 318 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 372

Query: 134 YEDFAHYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
           +EDF HYKSG+Y+HI            +  HAVKL GWG         E +WI AN W +
Sbjct: 373 HEDFFHYKSGIYRHINNLKDESEKYRNLRTHAVKLTGWGVLRGAQGKKEKFWIAANSWGK 432

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 433 SWGENGYFRILRGVNESDIEKLIIAA 458


>gi|403345965|gb|EJY72367.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 72/189 (38%), Positives = 104/189 (55%), Gaps = 17/189 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+ LS  D+++C       GCDGGY   AW+Y    GV ++ C+PY  ++G +       
Sbjct: 126 NVVLSPQDMVSC--DTNNYGCDGGYLNLAWQYLEKKGVASDSCEPYKSASGTA------- 176

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
              P C  KC    Q  +  K  + S  + N       + I ++GPVE  FTVY DF +Y
Sbjct: 177 ---PSCPSKC-SNGQAIKKYKCKAGSTKQANGAAA-TKSLIQQSGPVETGFTVYADFFNY 231

Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
           KSG+Y H++G   GGHAVK++GWG     E+YWI+AN W  SWG  G+F I++G  + GI
Sbjct: 232 KSGIYHHVSGGAEGGHAVKILGWGKQGS-ENYWIVANSWGESWGEKGFFNIRQG--DSGI 288

Query: 201 EEDVVAGLP 209
           ++     +P
Sbjct: 289 DQATFGCIP 297


>gi|426250116|ref|XP_004018784.1| PREDICTED: tubulointerstitial nephritis antigen [Ovis aries]
          Length = 476

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 79/221 (35%), Positives = 113/221 (51%), Gaps = 34/221 (15%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A  
Sbjct: 267 NLSPQNLISCCAKK-RHGCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASR 325

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V
Sbjct: 326 SDGRGKRHATTPCPNSIEKSNRIYQCS-----PPYRVSSNETEIMREIMQNGPVQAIMQV 380

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
           +EDF +YK+G+Y+HIT              HAVKL GWGT        E +WI AN W +
Sbjct: 381 HEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAHGQKEKFWIAANSWGK 440

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 222
           SWG +GYF+I RG NE  IE+ ++A          ++TSAD
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAAW-------GQLTSAD 474


>gi|431838263|gb|ELK00195.1| Tubulointerstitial nephritis antigen [Pteropus alecto]
          Length = 425

 Score =  128 bits (322), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 74/206 (35%), Positives = 105/206 (50%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC  G    AW Y    G+V+  C P+      ++  C  A  
Sbjct: 216 NLSPQNLISCCA-KNRHGCSSGSIDRAWWYLRKRGLVSHACYPFLKDQNTTNNACAMASR 274

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S+  +IM EI  NGPV+    V
Sbjct: 275 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIIHNGPVQAIMQV 329

Query: 134 YEDFAHYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
           +EDF HYKSG+Y+H+T           +  HAVKL GWGT        E +WI+AN W  
Sbjct: 330 HEDFFHYKSGIYRHVTSTNEKSEKYQKLQTHAVKLTGWGTLRGAQGRKEKFWIVANSWGN 389

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 390 SWGENGYFRILRGVNESDIEKLIIAA 415


>gi|270012756|gb|EFA09204.1| cathepsin B precursor [Tribolium castaneum]
          Length = 369

 Score =  128 bits (322), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 87/217 (40%), Positives = 119/217 (54%), Gaps = 23/217 (10%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 65
           ++R  ++++  V +Q   LS  DL+ CC + CG+ C GGY   AW YF+  G+V+     
Sbjct: 111 SDRLCIATNGKVKIQ---LSPEDLIDCCHY-CGNQCKGGYTYYAWNYFMLTGLVSG--GD 164

Query: 66  YFDSTGCSHPGCEPAY--PTPKCVRKCV--KKNQLWRNSKHYSISAYRINSDPEDIMAEI 121
           Y  STGC  P  E  Y   TP C   C   K    + + KH+  S Y I  +   I  EI
Sbjct: 165 YNTSTGC-QPYSELNYYRITPPCNTTCQNDKYPIPYVSDKHFGDSIYYIPQNETAIQNEI 223

Query: 122 YKNG-PVEVSFTVYEDFAHYK---------SGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
              G PV  +F VY DF  Y+          GVY + +G + G  AVK+IGWGT ++G  
Sbjct: 224 LSGGGPVVAAFDVYGDFKIYRDGEQHDTILEGVYIYTSGALFGRTAVKIIGWGT-ENGWA 282

Query: 172 YWILANQWNRSWGA-DGYFKIKRGSNECGIEEDVVAG 207
           YW+ AN W + WGA  G+FKI+RG+NECG EE ++AG
Sbjct: 283 YWLAANSWGKDWGALGGFFKIRRGTNECGFEESIIAG 319


>gi|351704465|gb|EHB07384.1| Tubulointerstitial nephritis antigen [Heterocephalus glaber]
          Length = 475

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 76/207 (36%), Positives = 109/207 (52%), Gaps = 26/207 (12%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD----STGCSHP- 75
            ++LS  +L++CC      GC GG    AW Y    G+V+  C P F     + GC+   
Sbjct: 265 TVNLSPQNLISCC-LKHRYGCSGGSIDRAWWYLRKRGLVSHACYPLFKDQNSTNGCAMAS 323

Query: 76  ---GCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
              G    + T  C     K N++++ S       YR++S+   IM EI KNGPV+    
Sbjct: 324 RSDGRGKRHATTPCPNNIEKSNRIYQCS-----PPYRVSSNETQIMKEIMKNGPVQAIMQ 378

Query: 133 VYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWGT----SDDGEDYWILANQWN 180
           V+EDF +YK+G+Y+H+T  +        +  HAVKL GWGT        E +WI AN W 
Sbjct: 379 VHEDFFYYKTGIYRHVTSTIEDSEKYQKLRTHAVKLTGWGTLRGAKGRKEKFWIAANSWG 438

Query: 181 RSWGADGYFKIKRGSNECGIEEDVVAG 207
           +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 439 KSWGENGYFRILRGVNESDIEKLIIAA 465


>gi|290988628|ref|XP_002677000.1| predicted protein [Naegleria gruberi]
 gi|284090605|gb|EFC44256.1| predicted protein [Naegleria gruberi]
          Length = 158

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 64/169 (37%), Positives = 94/169 (55%), Gaps = 13/169 (7%)

Query: 43  GGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH 102
           GG+ ++ WR+    G  +E+C PY  S G + P C         ++ C    +    S  
Sbjct: 1   GGFLVATWRFLAAVGTASEQCVPYV-SFGGAVPACN--------IKSCAVSGE---KSPF 48

Query: 103 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 162
           Y + + R      D+MA++  NGP++ +  VY+DF  YKSGVY H++G ++G HA+K++G
Sbjct: 49  YKVKSARKLKGMVDMMADLKANGPLQATMIVYKDFFSYKSGVYHHVSGRMVGAHAIKIVG 108

Query: 163 WGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 210
           WG  S     YWI AN W   WG DGYF I RG  ECG+ + V +G P+
Sbjct: 109 WGVDSASKLPYWICANSWGEDWGLDGYFWIARGRGECGLGKTVWSGKPA 157


>gi|327408413|emb|CCA30060.1| unnamed protein product [Neospora caninum Liverpool]
          Length = 463

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 81/212 (38%), Positives = 115/212 (54%), Gaps = 31/212 (14%)

Query: 22  LSLSVNDLLACCGFL-CGD-GCDGGYPISAWRYFVHHGVVT----------EECDPYFDS 69
           + LS     +CC  + C   GC+GG P  AWR+F   GVVT            C PY + 
Sbjct: 220 MPLSTQHTTSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDFDTLGKGTTCWPY-EI 278

Query: 70  TGCSH------PGCEP---AYPTPKCVRKCVKKN-----QLWRNSKHYSISAYRINSDPE 115
             C+H      P C+       TPKC + C +         +    H + S+Y + S  +
Sbjct: 279 PFCAHHAKAPFPNCDTDVRPRKTPKCRKDCEEAAYSEHVLPFDKDVHKASSSYSLRSR-D 337

Query: 116 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWIL 175
            +  ++  +G V  +F VYEDF +YKSGVYKH+ G  +GGHA+K+IGWGT +DGE+YW  
Sbjct: 338 AVKRDMMAHGTVTGAFMVYEDFLNYKSGVYKHVYGGPLGGHAIKIIGWGT-EDGEEYWHA 396

Query: 176 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
            N WN  WG  G+FKI+ G  +CG++ ++VAG
Sbjct: 397 VNSWNTYWGDSGHFKIEMG--QCGVDNEMVAG 426


>gi|290992564|ref|XP_002678904.1| predicted protein [Naegleria gruberi]
 gi|284092518|gb|EFC46160.1| predicted protein [Naegleria gruberi]
          Length = 289

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 65/154 (42%), Positives = 95/154 (61%), Gaps = 14/154 (9%)

Query: 40  GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
           GCDGGY  +AW +    G+ +++CDPY  ++G    G  P   T     K  K       
Sbjct: 148 GCDGGYLNNAWAFLAGTGIPSDKCDPY--TSGNGDVGSCPTSCTDGSAIKLYK------- 198

Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 159
           +K  S++     S  +DI  +I  NGPV+ +F+VY+DF  YKSGVY+H++G + GGHA+K
Sbjct: 199 AKSSSVAQL---SSIDDIQKDIQANGPVQAAFSVYQDFFSYKSGVYRHVSGSLAGGHAIK 255

Query: 160 LIGWGTSDDGED--YWILANQWNRSWGADGYFKI 191
           ++GWG + DG+D  YWI+AN WN +WG +G+F I
Sbjct: 256 IVGWGVTSDGKDTPYWIVANSWNTNWGQEGFFWI 289


>gi|170030062|ref|XP_001842909.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
 gi|167865915|gb|EDS29298.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
          Length = 288

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 73/175 (41%), Positives = 100/175 (57%), Gaps = 9/175 (5%)

Query: 41  CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS-HP---GCEPAYPTPKCVRKCVKKNQL 96
           CDGGY    + Y+V +G+ +    PY    GC  +P     +      KC R+C     L
Sbjct: 115 CDGGYVHKTFDYWVKYGLTSG--GPYHSGQGCKPYPFGGATQDVNIVLKCDRQCQAGYPL 172

Query: 97  -WRNSKHYSISAYRINSDPEDIM-AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 154
            +     +  S+Y +    E+ M AEIY+NGP+  SF VY DF  Y+SGVY+H+TG   G
Sbjct: 173 TYSQDLKHGASSYILPWGDENAMKAEIYQNGPIVTSFDVYGDFFQYRSGVYRHVTGAYKG 232

Query: 155 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
            HAV++IGWG  ++G  YW+ AN WN  WG +G+FKI RG N  G+E+   AGLP
Sbjct: 233 SHAVRVIGWGV-ENGVKYWLCANSWNERWGENGFFKIVRGENHVGVEDISYAGLP 286


>gi|201023319|ref|NP_001128401.1| cathepsin B-10270 precursor [Acyrthosiphon pisum]
 gi|239788119|dbj|BAH70754.1| ACYPI000021 [Acyrthosiphon pisum]
          Length = 341

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 78/203 (38%), Positives = 107/203 (52%), Gaps = 22/203 (10%)

Query: 25  SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGC 77
           S   +L+CC   CGDGC+GGY  +AW+Y++  G+VT       E C P+     C+H   
Sbjct: 141 SPQKMLSCCDD-CGDGCNGGYSGAAWQYWMKRGLVTGGDYGSNEGCQPWLIPP-CNHTVM 198

Query: 78  EPAYP----------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMA-EIYKNGP 126
           +   P          TP+C   C   N      K  S    RI+     ++  E+ K+GP
Sbjct: 199 DERSPSYMCGKYKSETPQCTLNCYNPNYSKPFLKDIS-KGIRIDWHCSGMIRNELKKHGP 257

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
                 VYEDF  YKSG+Y+H+TG ++G   VK+IGWG    G  YW+ AN W  SWG  
Sbjct: 258 ATAIMRVYEDFLTYKSGIYQHVTGKLLGQITVKVIGWGVY-RGVQYWLAANSWGTSWGDK 316

Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
           G+FKI+RG NEC  E+  ++G P
Sbjct: 317 GFFKIRRGYNECLFEDYFISGRP 339


>gi|156708110|gb|ABU93313.1| cathepsin B4 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 71/187 (37%), Positives = 101/187 (54%), Gaps = 18/187 (9%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 82
           ++SV DL++C        C+GG    A  Y V  G+ TE C  Y   +G           
Sbjct: 111 AMSVQDLVSC--DKTDSACNGGDMKKAQEYLVKTGITTEACVKYVSGSG----------R 158

Query: 83  TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 142
            P C  KC   +Q+ R    Y + +++ + +P +IM  + + GP+   F VY DF +Y+S
Sbjct: 159 VPACPSKCDNGSQIIR----YKLQSWK-SVEPSEIMQALMEYGPLSCGFMVYSDFMNYRS 213

Query: 143 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 202
           GVY+H +G   GGHAV L GWG  ++G  YW++ N W  +WG  G+FKI RGSN C IE 
Sbjct: 214 GVYQHKSGYFEGGHAVLLCGWGV-ENGLPYWLVQNSWGPAWGEKGFFKILRGSNHCEIES 272

Query: 203 DVVAGLP 209
            V  G+P
Sbjct: 273 YVTLGVP 279


>gi|159109223|ref|XP_001704877.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157432952|gb|EDO77203.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 300

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 66/168 (39%), Positives = 94/168 (55%), Gaps = 11/168 (6%)

Query: 41  CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
           C+GG+  + W++    G  T+EC PY   +      C    PT     KC   +     +
Sbjct: 141 CNGGWLPNVWKFLTKTGTTTDECVPYKSGSTTLRGTC----PT-----KCADGSSKVHLA 191

Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
              S   Y +  D   +M  +  +GP++V+F VY DF +Y+SGVY+H  G + GGHAV++
Sbjct: 192 TATSYKDYGL--DIPAMMKALSTSGPLQVAFLVYSDFMYYESGVYQHTYGYMEGGHAVEM 249

Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           +G+GT DDG DYWI+ N W   WG DGYF++ RG N+C IEE   AG 
Sbjct: 250 VGYGTDDDGVDYWIIRNSWGPDWGEDGYFRMIRGINDCSIEEQAYAGF 297


>gi|123478051|ref|XP_001322190.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
 gi|121905031|gb|EAY09967.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
          Length = 288

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 66/170 (38%), Positives = 98/170 (57%), Gaps = 17/170 (10%)

Query: 40  GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW-- 97
           GC GG  ++AWRY    G+  + C PY           +       C +KC  +++ +  
Sbjct: 131 GCGGGIEVNAWRYIDLRGLPLDSCQPY-----------DGNITKYNCSKKCTNESETYEA 179

Query: 98  RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 157
           + ++++S++ Y   +  E++   I   GPV  S  VY D  +YKSG+Y H  G+ +G HA
Sbjct: 180 QFTEYWSVARY---ASIEEMQIGIMTEGPVTTSLKVYSDLMYYKSGIYTHTKGEFLGHHA 236

Query: 158 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
           V++IGWGT  +G DYWI++N WN +WG +G F IKRG NEC IE+ V AG
Sbjct: 237 VEIIGWGTK-NGIDYWIISNSWNTTWGMNGLFLIKRGVNECHIEDYVCAG 285


>gi|323447573|gb|EGB03489.1| hypothetical protein AURANDRAFT_72715 [Aureococcus anophagefferens]
          Length = 812

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 77/184 (41%), Positives = 107/184 (58%), Gaps = 21/184 (11%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
           LS  DL++C       GC+GG   +AW Y  + G+VT+ C PY    G +          
Sbjct: 390 LSPEDLVSCD--RVDQGCNGGNLGTAWTYLKNTGIVTDACFPYTAGGGDA---------- 437

Query: 84  PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
           PKC   C K    W  +K+ + SAY +N   E++  EI  +GP++V+F VY+ F  YKSG
Sbjct: 438 PKCETSC-KDGSSW--TKYKAASAYAVNG-VENMQKEIMTHGPIQVAFNVYKSFMSYKSG 493

Query: 144 VYKHITGDVM--GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 201
           VY     ++M  GGHAVK++GWGT + G+DYW++AN WN SWG +GYFKI  G+    + 
Sbjct: 494 VYAKKWYELMPEGGHAVKIVGWGT-EGGKDYWLVANSWNTSWGDEGYFKIAVGAESISL- 551

Query: 202 EDVV 205
            DVV
Sbjct: 552 -DVV 554


>gi|204022073|dbj|BAG71134.1| cathepsin B-S1 [Tuberaphis taiwana]
          Length = 334

 Score =  127 bits (319), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 73/180 (40%), Positives = 100/180 (55%), Gaps = 17/180 (9%)

Query: 45  YPISAWRYFVHHGVVT-------EECDPY-----FDSTGCSHPGCEPAYPTPKCVRKCVK 92
           YPI AW+YF   GV T       E C PY     ++  G +  G +P     +C + C  
Sbjct: 158 YPIKAWKYFRTQGVTTGGDYGTKEGCMPYKVPPCYNKQGKNTCGGQPMERNHQCPKTCYG 217

Query: 93  KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGD 151
           K  +   +++ + S Y INS  + I  +I   GPVE SF VY+D + YKSG+Y+      
Sbjct: 218 KTTV--QNRYKTKSEYVINSI-KTIERDIMTYGPVEASFDVYDDLSAYKSGIYRKTPKAK 274

Query: 152 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 211
             GGH++K+IGWG   +G  YW+  N W++ WG  G FKI +G NECGIE  V AG+PSS
Sbjct: 275 YQGGHSIKIIGWG-QQNGTPYWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGIPSS 333


>gi|157092993|gb|ABV22151.1| cysteine proteinase [Perkinsus chesapeaki]
          Length = 396

 Score =  127 bits (319), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 84/218 (38%), Positives = 114/218 (52%), Gaps = 41/218 (18%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------------EECDPYFDST 70
           LS  ++ AC       GC GG  + AW++    GVVT             + C PY D  
Sbjct: 193 LSPGNVAACSK---TSGCHGGSSLDAWQWLHTTGVVTGGDYSAEKDMTESDGCWPY-DIP 248

Query: 71  GCSH-------PGC-EPAYPTPKCVRKCVKK--NQLWRNSKHY----SISAYRINSDPED 116
            C+H       P C +  Y  P C   C  K  +      +H+    S+SA R     + 
Sbjct: 249 PCAHYTNSTLYPKCPKTKYDFPTCQESCPNKKYDTPMEKDRHFVEEESLSALR---SIDA 305

Query: 117 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 176
           I  EI  NGPV  S+ VY+DF  YKSGVYK  + + +GGHAVK+IGW     GEDYW++ 
Sbjct: 306 IKKEIMTNGPVSASYLVYDDFLTYKSGVYKRTSHNALGGHAVKIIGW-----GEDYWLVV 360

Query: 177 NQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
           N WN++WG +G FKI  G  +CGIE++V+AG P + +L
Sbjct: 361 NSWNKNWGDNGMFKI--GCGQCGIEDNVLAGTPMTSSL 396


>gi|403340695|gb|EJY69640.1| Cathepsin B [Oxytricha trifallax]
          Length = 247

 Score =  127 bits (319), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 81/208 (38%), Positives = 108/208 (51%), Gaps = 23/208 (11%)

Query: 2   SVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 61
           S T ++R  ++S       ++ LS  DL+AC G+    GC+GG    AW Y  + G V +
Sbjct: 61  SETLSDRICIASDKKT---DVILSPEDLVACDGW--NMGCNGGILPWAWSYLTNTGAVED 115

Query: 62  ECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEI 121
            C PY    G            P C +KC      +   K    S  +  S  + I AEI
Sbjct: 116 SCFPYSSDKG----------AVPTCAKKCQNDKDSFTKYKCKKNSVVQA-SGVDKIKAEI 164

Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
            KNGP+E  FTVYEDF +Y+SGVY H TG+ +GGHAVK++G+     G+ YWI AN W+ 
Sbjct: 165 SKNGPMETGFTVYEDFMNYESGVYHHTTGNQLGGHAVKIVGY-----GDGYWICANSWSE 219

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLP 209
            WG  G+F I  G  ECGI+    A  P
Sbjct: 220 KWGEKGFFNI--GFGECGIDSAAYACTP 245


>gi|10803441|emb|CAC13133.1| putative cathepsin B.7 [Ostertagia ostertagi]
          Length = 198

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 74/179 (41%), Positives = 101/179 (56%), Gaps = 24/179 (13%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSH 74
            + +S  D+++CC + CG GC+GG+PI AW+Y V  GVVT      +EC   ++   C +
Sbjct: 24  QVLISAQDIVSCCTW-CGAGCEGGWPIEAWKYGVTEGVVTGGNFGRKECCRSYEIHPCGY 82

Query: 75  PGCEPAY-------PTPKCVRKCVKKNQLWRNS----KHYSISAYRINSDPEDIMAEIYK 123
            G EP Y        TP C ++C      ++NS    K Y  SAY + +    I  +I +
Sbjct: 83  HGNEPFYGHCHSMARTPPCKKRC---RPGYKNSYMMDKRYGTSAYELPNSVXAIQRDIME 139

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG---TSDDGEDYWILANQW 179
           NGPV   F VYEDF +YKSG+Y+H  G   GGHAVK+IGWG   T +    YWI+AN W
Sbjct: 140 NGPVVAGFDVYEDFKYYKSGIYRHTAGKXTGGHAVKVIGWGEEXTENGTIPYWIIANSW 198


>gi|134023803|gb|AAI35570.1| LOC100124858 protein [Xenopus (Silurana) tropicalis]
          Length = 484

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 83/219 (37%), Positives = 115/219 (52%), Gaps = 23/219 (10%)

Query: 10  ALSSSPYVSLQNL-----SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  +S+Q++     SLS  +LL+C       GC GG    AW Y    GVV+E C 
Sbjct: 253 AAVASDRISIQSMGHMTQSLSPQNLLSC-DTRNQHGCRGGRVDGAWWYLRRRGVVSEPCY 311

Query: 65  PY--FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHY-SISAYRINSDPEDIMA 119
           P+   ++ G S P    +    +  R+      NQ + +++ Y S  AYR+ S  +DIM 
Sbjct: 312 PFTSLNTNGHSAPCMMQSRSMGRGKRQATNNCPNQYYSSNEIYQSTPAYRLASSEKDIMK 371

Query: 120 EIYKNGPVEVSFTVYEDFAHYKSGVYKHIT--------GDVMGGHAVKLIGWGTSD--DG 169
           E+Y+NGPV+    V+EDF  YKSG+Y+               G H+VK+ GWG     DG
Sbjct: 372 ELYENGPVQAIMEVHEDFFMYKSGIYRRTPVTEREPEHHRRHGTHSVKITGWGEERGRDG 431

Query: 170 E--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           +   YW+ AN W R WG DGYF+I RG NEC IE  +V 
Sbjct: 432 QTHKYWLAANSWGRDWGEDGYFRIARGENECEIETFIVG 470


>gi|130502070|ref|NP_001076255.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
 gi|818411|gb|AAC48477.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
          Length = 474

 Score =  127 bits (318), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 73/206 (35%), Positives = 105/206 (50%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE---- 78
           +LS  +L++CC      GC+ G    AW Y    G+V+  C P F     S+  C     
Sbjct: 265 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNISNNTCAMTSK 323

Query: 79  -----PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V
Sbjct: 324 ADGRGKRHATRPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 378

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
           +EDF HYK+G+Y+H+            +  HAVKL GWGT        E +WI AN W +
Sbjct: 379 HEDFFHYKTGIYRHVISTNEESEKYRKLQTHAVKLTGWGTLKGARGQKEKFWIAANSWGK 438

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 439 SWGENGYFRILRGVNESDIEKLIIAA 464


>gi|204022075|dbj|BAG71135.1| cathepsin B-S2 [Tuberaphis taiwana]
          Length = 334

 Score =  127 bits (318), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 72/180 (40%), Positives = 101/180 (56%), Gaps = 17/180 (9%)

Query: 45  YPISAWRYFVHHGVVT-------EECDPY-----FDSTGCSHPGCEPAYPTPKCVRKCVK 92
           YPI AW+YF   GV T       E C PY     ++  G +  G +P     +C + C  
Sbjct: 158 YPIKAWKYFRTQGVTTGGDYGTKEGCMPYKVPPCYNKQGKNTCGGQPMERNHQCPKTCYG 217

Query: 93  KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGD 151
           K  +   +++ + S Y +NS  + I  ++   GPVE SF VY+DF+ YKSG+Y+      
Sbjct: 218 KTTV--QNRYKTKSEYVMNSI-KTIEQDLKTYGPVEASFDVYDDFSVYKSGIYRKTPKAK 274

Query: 152 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 211
             GGH++K+IGWG   +G  YW+  N W++ WG  G FKI +G NECGIE  V AG+PSS
Sbjct: 275 YQGGHSIKIIGWG-QQNGTPYWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGIPSS 333


>gi|197100841|ref|NP_001126804.1| tubulointerstitial nephritis antigen [Pongo abelii]
 gi|55732702|emb|CAH93049.1| hypothetical protein [Pongo abelii]
          Length = 476

 Score =  127 bits (318), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 73/206 (35%), Positives = 106/206 (51%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW Y    G+V+  C P       ++ GC  A  
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLSKDQNATNNGCAMASR 325

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V
Sbjct: 326 SDGRGKRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 380

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
            EDF HYK+G+Y+H+T           +  HAVKL GWGT        E +W+ AN W +
Sbjct: 381 REDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWVAANSWGK 440

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466


>gi|10803450|emb|CAB97364.2| putative cathepsin B.1 [Ostertagia ostertagi]
          Length = 199

 Score =  127 bits (318), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 73/172 (42%), Positives = 98/172 (56%), Gaps = 20/172 (11%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGC 72
           + + +S  D+++CC + CG GC GG+ I AW YF   GVVT         C PY +   C
Sbjct: 23  KQVLISDQDIVSCCTW-CGYGCQGGWSIRAWYYFAEQGVVTGGNYNTKGSCRPY-EIHPC 80

Query: 73  SHPGCEPAY-------PTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
            +   EP Y        TP+C R+C +   + + + KHY  +AY++    E I  EI +N
Sbjct: 81  GYHKDEPYYGECDDLADTPRCKRRCQLGYPKSYPSDKHYGRTAYQLPMSVESIQREIMRN 140

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED---YW 173
           GPV   FTVYEDFAHYK G+YKH +G   GGHAVK+IGWG+   G +   YW
Sbjct: 141 GPVVAGFTVYEDFAHYKGGIYKHTSGKKTGGHAVKVIGWGSEQKGSEKIPYW 192


>gi|440907441|gb|ELR57591.1| Tubulointerstitial nephritis antigen [Bos grunniens mutus]
          Length = 476

 Score =  126 bits (317), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 78/221 (35%), Positives = 112/221 (50%), Gaps = 34/221 (15%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+      AW Y    G+V+  C P F     ++ GC  A  
Sbjct: 267 NLSPQNLISCCAKK-RRGCNSESVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASR 325

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V
Sbjct: 326 SDGRGKRHATTPCPNSIEKSNRIYQCS-----PPYRVSSNETEIMREIMQNGPVQAIMQV 380

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
           +EDF +YK+G+Y+HIT              HAVKL GWGT        E +WI AN W +
Sbjct: 381 HEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQGQKEKFWIAANSWGK 440

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 222
           SWG +GYF+I RG NE  IE+ ++A          ++TSAD
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAAW-------GQLTSAD 474


>gi|161343861|tpg|DAA06111.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 323

 Score =  126 bits (317), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 84/212 (39%), Positives = 106/212 (50%), Gaps = 36/212 (16%)

Query: 23  SLSVNDLLACCGFLCGD----GCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTG 71
           +LS  +L++C     GD    GCDGG    AW + +  G+VT       E C PY  +  
Sbjct: 117 NLSAQNLMSC-----GDDEKLGCDGGSAYKAWEFTMGKGIVTGGPYDSNEGCQPY-KNRP 170

Query: 72  CSHPG------CEPAYPTPK--CVRKCVKKN-------QLWRNSKHYSISAYRINSDPED 116
           C H G      C     T    C  KCV KN        L++ S  Y  S     ++ + 
Sbjct: 171 CDHYGDSSLTNCSSLRRTQMMFCRDKCVNKNYKVKYEDDLYKTSVVYMTSW----TNVKQ 226

Query: 117 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 176
           I  EI   GPV     VYE+F  YK GVYK   G+++G H VKLIGWG  + G +YW+  
Sbjct: 227 IQQEIMTYGPVTAFMYVYENFMGYKEGVYKSTAGELIGYHHVKLIGWGVDEAGIEYWLAM 286

Query: 177 NQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           N WN +WG DG FKI RG N C IE  V+AGL
Sbjct: 287 NSWNSNWGNDGLFKILRGYNFCSIELLVMAGL 318


>gi|197725747|gb|ACH73069.1| cathepsin B precursor [Epinephelus coioides]
          Length = 333

 Score =  126 bits (317), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 86/225 (38%), Positives = 122/225 (54%), Gaps = 27/225 (12%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + S+  VS++   +S  DLL CC   CG GC+GGYP +AW ++   G+V+     
Sbjct: 117 SDRLCIHSNGKVSVE---ISSEDLLTCCDS-CGMGCNGGYPSAAWDFWTDVGLVSGGLYD 172

Query: 63  ----CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY          G   P       TP+C+ +C       ++  KHY  S+Y + 
Sbjct: 173 SHVGCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCILQCESGYTPSYKADKHYGKSSYSVP 232

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           SD E I +EIYKNGPVE +FTVYEDF  YK+GVY+H+TG  +GGHA+K      S  GE+
Sbjct: 233 SDEEQIQSEIYKNGPVEGAFTVYEDFLLYKTGVYQHMTGSAVGGHAIK------SWLGEE 286

Query: 172 YWILAN--QWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
              L      +  WG D       GS+ CGIE ++VAG+P +++ 
Sbjct: 287 VCSLLALCHSDTDWG-DMVSLSSAGSDHCGIESEIVAGIPITQSF 330


>gi|10803437|emb|CAC13131.1| putative cathepsin B.5 [Ostertagia ostertagi]
          Length = 196

 Score =  126 bits (317), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 67/160 (41%), Positives = 89/160 (55%), Gaps = 17/160 (10%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
            +++S +D+L+CCG  CG+GC+GGYPI AW+Y+V  G+ T         C PY     C 
Sbjct: 24  QVTISADDVLSCCGKKCGNGCEGGYPIEAWKYWVKTGICTGGSYESQSGCKPY-PIPPCG 82

Query: 74  H--------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
           H        P     Y TP C  KC+   +  + + KHY  SAY +      I  EI  N
Sbjct: 83  HHKNQTYFGPCPTDEYDTPVCTNKCIAAYKTPYSDDKHYGTSAYNVAKTVAGIQKEIMTN 142

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 164
           GPVE ++TVYEDF  Y  GVY H  G  +GGHAV+++GWG
Sbjct: 143 GPVEAAYTVYEDFYQYTGGVYTHTGGAEVGGHAVRILGWG 182


>gi|291228863|ref|XP_002734398.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 451

 Score =  126 bits (317), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 87/214 (40%), Positives = 112/214 (52%), Gaps = 27/214 (12%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 65
           ++R A+ SS    +   +LS   LL+C       GC GG+   AW +    GVV+ +C P
Sbjct: 215 SDRLAIQSSGETGM---TLSPQHLLSC-NTRGQRGCSGGHIDRAWWFMRKRGVVSNDCYP 270

Query: 66  YF----DSTG-CSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE 120
           Y     D  G C  PG  P+     C     + N+L     H+S   YRI ++  +I  E
Sbjct: 271 YTSGDQDKKGVCMMPGKLPS----DCPTGRERNNEL-----HHSTPPYRIAANEREIQVE 321

Query: 121 IYKNGPVEVSFTVYEDFAHYKSGVYKHI---TGDVMGGHA-----VKLIGWGTSDDGEDY 172
           I +NGPV+ SF V EDF  Y SGVY+H    + D    HA     VKL+GWG  ++G  Y
Sbjct: 322 IMENGPVQASFEVKEDFFMYGSGVYRHTPIASNDAEQYHASEWHSVKLLGWGV-ENGIKY 380

Query: 173 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           W+ AN W   WG DGYFKI RG NEC IE  VVA
Sbjct: 381 WLGANSWGTKWGEDGYFKILRGENECNIESYVVA 414


>gi|338718488|ref|XP_001918155.2| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
           antigen-like [Equus caballus]
          Length = 480

 Score =  126 bits (317), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 72/206 (34%), Positives = 106/206 (51%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW Y    G+V+  C P F     ++  C  A  
Sbjct: 271 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNDCAMASR 329

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V
Sbjct: 330 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 384

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
           ++DF HYK G+Y+H+T           +  HA+KL GWGT        E +WI AN W +
Sbjct: 385 HDDFFHYKKGIYRHVTSTHEEPEKYRKLRTHAIKLAGWGTLRGAQGRKEKFWIAANSWGK 444

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 445 SWGENGYFRILRGVNESDIEKLIIAA 470


>gi|294955270|ref|XP_002788457.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
 gi|239903926|gb|EER20253.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
          Length = 392

 Score =  126 bits (316), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 73/191 (38%), Positives = 103/191 (53%), Gaps = 24/191 (12%)

Query: 34  GFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH--------PGCE 78
           G +C DGC  G P +AW +   +G+ TE        C PY +   C H        P  E
Sbjct: 152 GHVCCDGCTKGRPDAAWSFLNVYGIATEGSMSAADGCWPY-NFPKCGHHQQDSKYQPCPE 210

Query: 79  PAYPTPKCVRKCVKKN--QLWRNSKHYS--ISAYRINSDPEDIMAEIYKNGPVEVSFTVY 134
             Y TP C+ +C  KN        +H++   S Y++    ++I  EI  NGP   +F++Y
Sbjct: 211 KNYDTPPCLDRCPNKNYGTPLDKDRHFTAHFSPYQLKGT-DNIKKEIMTNGPTSAAFSMY 269

Query: 135 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 194
           +DF  Y+SGVYKH +G +MG H V++IGWGT   G DYW++ N WN  WG  G FKI +G
Sbjct: 270 DDFLSYESGVYKHTSGTLMGEHGVEIIGWGTK-QGVDYWLVMNSWNEGWGVHGTFKIAQG 328

Query: 195 SNECGIEEDVV 205
             +CGI +  +
Sbjct: 329 --DCGINDMAI 337


>gi|339242313|ref|XP_003377082.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316974149|gb|EFV57673.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 517

 Score =  126 bits (316), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 66/172 (38%), Positives = 97/172 (56%), Gaps = 10/172 (5%)

Query: 48  SAWRYFVHHGVVTEECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNS 100
           S + Y+   G+ T    PY D + C          C     TP C   C     +   + 
Sbjct: 346 SPFNYWKKMGIATG--GPYGDKSCCQPYSIAPCSKCSYTASTPSCKYDCQADYDIPISDD 403

Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
           K Y+   Y ++S+  +IM EIY +GPV   F VYEDF +Y SG+Y+  T   MGGHA+++
Sbjct: 404 KFYASEHYHVSSNQYEIMNEIYTHGPVVAGFIVYEDFTYYISGIYQQTTYVAMGGHAIRI 463

Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 212
           IGWG  ++G  YW++AN WN ++G  G+F+I+RG+NEC IE +V  G+P  +
Sbjct: 464 IGWG-EENGIPYWLIANSWNTTFGEKGFFRIRRGTNECRIESEVYTGIPKLR 514



 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 60/131 (45%), Gaps = 11/131 (8%)

Query: 40  GCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK 93
           GC  G   +A+ Y+   G+VT      + C   +  + C+   C P    PKC R C   
Sbjct: 69  GCRSGKIEAAFIYWQRSGLVTGGPYGEKACCLPYSISPCTM--CRPYMLAPKCQRTCQAS 126

Query: 94  NQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 152
             L  +  K+Y  S Y +N D  DIM EIY+ GPV   F VY DF +Y SG +  I G+ 
Sbjct: 127 YNLSLKRDKYYGKSHYYVNQDEFDIMQEIYQRGPVVAGFKVYHDFLYYISGQF--ICGNK 184

Query: 153 MGGHAVKLIGW 163
                  L  W
Sbjct: 185 RCEEEENLTSW 195


>gi|344264196|ref|XP_003404179.1| PREDICTED: tubulointerstitial nephritis antigen [Loxodonta
           africana]
          Length = 476

 Score =  126 bits (316), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 73/206 (35%), Positives = 105/206 (50%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A  
Sbjct: 267 NLSPQNLISCCT-KNRHGCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNANNNGCAMASR 325

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N +++ S       YR++S+  +IM EI +NGPV+    V
Sbjct: 326 SDGRGKRHATKPCPNNIEKSNVIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 380

Query: 134 YEDFAHYKSGVYKHITG--------DVMGGHAVKLIGWGTSDDG----EDYWILANQWNR 181
           +EDF HYK+G+Y+H+            +  HAVKL GWG         E +W+ AN W +
Sbjct: 381 HEDFFHYKTGIYRHVIRTSEESEKYQKLRTHAVKLTGWGMMKGAKGRKEKFWVAANSWGK 440

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG DGYF+I RG NE  IE+ ++A 
Sbjct: 441 SWGEDGYFRILRGVNESDIEKLIIAA 466


>gi|32129434|sp|P92132.2|CATB2_GIALA RecName: Full=Cathepsin B-like CP2; AltName: Full=Cathepsin B-like
           protease B2; Flags: Precursor
 gi|11691658|emb|CAC18647.1| cathepsin B-like protease 2 [Giardia intestinalis]
          Length = 300

 Score =  126 bits (316), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 65/168 (38%), Positives = 94/168 (55%), Gaps = 11/168 (6%)

Query: 41  CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
           C+GG+  + W++    G  T+EC PY   +      C    PT     KC   +     +
Sbjct: 141 CNGGWLPNVWKFLTKTGTTTDECVPYKSGSTTLRGTC----PT-----KCADGSSKVHLA 191

Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
              S   Y +  D   +M  +  +GP++V+F V+ DF +Y+SGVY+H  G + GGHAV++
Sbjct: 192 TATSYKDYGL--DIPAMMKALSTSGPLQVAFLVHSDFMYYESGVYQHTYGYMEGGHAVEM 249

Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           +G+GT DDG DYWI+ N W   WG DGYF++ RG N+C IEE   AG 
Sbjct: 250 VGYGTDDDGVDYWIIKNSWGPDWGEDGYFRMIRGINDCSIEEQAYAGF 297


>gi|348553066|ref|XP_003462348.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
           porcellus]
          Length = 475

 Score =  126 bits (316), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 76/206 (36%), Positives = 106/206 (51%), Gaps = 28/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC GG    AW Y    G+V+  C P F     ++ GC  A  
Sbjct: 267 NLSPQNLISCCARK-RHGCGGGSVDRAWWYLRKRGLVSHACYPLFKDQNATN-GCAMASR 324

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S+   IM EI +NGPV+    V
Sbjct: 325 SDGRGKRHATTPCPNHIEKSNRIYQCS-----PPYRVSSNETQIMKEIMQNGPVQAIMKV 379

Query: 134 YEDFAHYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
           +EDF  YK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +
Sbjct: 380 HEDFFSYKTGIYRHVTSTSEDSEKYQKLRTHAVKLTGWGTLKGARGKKEKFWIAANSWGK 439

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYFKI RG NE  IE+ ++A 
Sbjct: 440 SWGENGYFKILRGVNESDIEKLIIAA 465


>gi|301618234|ref|XP_002938532.1| PREDICTED: tubulointerstitial nephritis antigen-like [Xenopus
           (Silurana) tropicalis]
          Length = 494

 Score =  125 bits (315), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 81/215 (37%), Positives = 112/215 (52%), Gaps = 20/215 (9%)

Query: 10  ALSSSPYVSLQNL-----SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  +S+Q++     SLS  +LL+C       GC GG    AW Y    GVV+E C 
Sbjct: 268 AAVASDRISIQSMGHMTQSLSPQNLLSC-DTRNQHGCRGGRVDGAWWYLRRRGVVSEPCY 326

Query: 65  PY--FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHY-SISAYRINSDPEDIMA 119
           P+   ++ G S P    +    +  R+      NQ + +++ Y S  AYR+ S  +DIM 
Sbjct: 327 PFTSLNTNGHSAPCMMQSRSMGRGKRQATNNCPNQYYSSNEIYQSTPAYRLASSEKDIMK 386

Query: 120 EIYKNGPVEVSFTVYEDFAHYKSGVYKHIT--------GDVMGGHAVKLIGWGTSDDGED 171
           E+Y+NGPV+    V+EDF  YKSG+Y+H              G H+VK+ G G       
Sbjct: 387 ELYENGPVQAIMEVHEDFFMYKSGIYRHTPVTEREPEHHRRHGTHSVKITG-GRDGQTHK 445

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           YW+ AN W R WG DGYF+I RG NEC IE  +V 
Sbjct: 446 YWLAANSWGRDWGEDGYFRIARGENECEIETFIVG 480


>gi|283468816|emb|CAO98753.1| putative cathepsin B [Fasciola hepatica]
          Length = 112

 Score =  125 bits (315), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 55/104 (52%), Positives = 76/104 (73%), Gaps = 1/104 (0%)

Query: 106 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 165
           S+Y +     DIM EI KNGPV+  F ++EDF  YKSG+Y + TG ++GGHA+++IGWG 
Sbjct: 10  SSYNVGEQETDIMMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGV 69

Query: 166 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
            ++G  YW++AN WN  WG  GYF+++RG+NECGIE  + AGLP
Sbjct: 70  -ENGVKYWLIANSWNEGWGEKGYFRMRRGNNECGIEARINAGLP 112


>gi|268555786|ref|XP_002635882.1| Hypothetical protein CBG01102 [Caenorhabditis briggsae]
          Length = 374

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 75/244 (30%), Positives = 116/244 (47%), Gaps = 54/244 (22%)

Query: 19  LQNLSLSVNDLLACCG--FLCGDG------------------------------------ 40
           + N  LS  +LL+CC   F CG+G                                    
Sbjct: 129 MINTVLSAQELLSCCTGVFSCGEGDSEHWQFRNSKFRKPRCQKFNKEILEARRNLETREK 188

Query: 41  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDST------GCSHPGC-EPAYPTPKC 86
           C GG    AW+Y+  HG+ T         C PY  S         + PGC      TP C
Sbjct: 189 CAGGNVFKAWQYWQKHGLPTGGSYESQFGCKPYSISPCDTVIGNITFPGCLNSTVQTPSC 248

Query: 87  VRKCVKKNQLWRNS-KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 145
            +KC     +  +  +HY +S  ++ +   +I +++  NGP+  +  VY+DF  Y +G+Y
Sbjct: 249 EKKCKSGYPVELDKDRHYGVSVDQLPNRQIEIQSDVMLNGPISATMEVYDDFLQYTTGIY 308

Query: 146 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 205
            H+TG+  G  +V+++GWG   +G  YW+LAN W + WG +G F++ RG NECG+E + V
Sbjct: 309 VHLTGNKQGHLSVRILGWGMY-EGVPYWLLANSWGKQWGENGTFRVLRGVNECGLEANCV 367

Query: 206 AGLP 209
           +G+P
Sbjct: 368 SGMP 371


>gi|297291062|ref|XP_002803846.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
           mulatta]
          Length = 463

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 75/206 (36%), Positives = 106/206 (51%), Gaps = 28/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A  
Sbjct: 255 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANN-GCAMASR 312

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S   +IM EI +NGPV+    V
Sbjct: 313 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMQV 367

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
            EDF HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +
Sbjct: 368 REDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGK 427

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 428 SWGENGYFRILRGVNESDIEKLIIAA 453


>gi|449498128|ref|XP_002193225.2| PREDICTED: tubulointerstitial nephritis antigen [Taeniopygia
           guttata]
          Length = 469

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 77/201 (38%), Positives = 104/201 (51%), Gaps = 21/201 (10%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 82
           +LS  +L++C       GC+GG    AWRY   HGVV+  C P F +           Y 
Sbjct: 272 NLSAQNLISC-DTRNQHGCNGGSIDGAWRYLKTHGVVSYACYPSFWNKHLGPSAENQCYV 330

Query: 83  TPK---------CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
           + +         C     K N+L+R + HY     R++S   DIM EI   GPV+    V
Sbjct: 331 SNEYGKNHTNGPCPNAFEKSNRLYRCASHY-----RVSSKETDIMKEIKDRGPVQAIMKV 385

Query: 134 YEDFAHYKSGVYKH--ITGDVMGGHAVKLIGWGTSDDG----EDYWILANQWNRSWGADG 187
           YEDF  YK G+Y+H    G     H+VKL+GWG   D     + +WI AN W +SWG +G
Sbjct: 386 YEDFFLYKEGIYQHSQKAGSKWKTHSVKLLGWGALPDKNGQKQKFWIAANSWGKSWGENG 445

Query: 188 YFKIKRGSNECGIEEDVVAGL 208
           YF+I RG NEC IE+ ++A L
Sbjct: 446 YFRILRGQNECDIEKLILATL 466


>gi|294914336|ref|XP_002778250.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239886453|gb|EER10045.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 388

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 67/152 (44%), Positives = 91/152 (59%), Gaps = 11/152 (7%)

Query: 62  ECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS----KHYSISAYRINSDPEDI 117
           EC  + D+ G     C+   P+P C   C  +N  ++ S    +H++        + ++I
Sbjct: 229 ECSHHVDTKGME--PCKGNSPSPVCSTTC--RNHHFKPSFESDRHFTEDEGYSLDEVDEI 284

Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 177
             EI  NGPV  +FTVYEDF +YKSGVYKH+ G  +GGHAVK+IGWG  D  E YW++ N
Sbjct: 285 KREIIDNGPVAAAFTVYEDFPYYKSGVYKHVNGSELGGHAVKIIGWGI-DQNEQYWLVMN 343

Query: 178 QWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
            WN +WG  G FKI  G  ECGI+ +V AG+P
Sbjct: 344 SWNVNWGDQGIFKIAIG--ECGIDSEVTAGIP 373


>gi|355748654|gb|EHH53137.1| hypothetical protein EGM_13709 [Macaca fascicularis]
          Length = 475

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 75/206 (36%), Positives = 106/206 (51%), Gaps = 28/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A  
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANN-GCAMASR 324

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S   +IM EI +NGPV+    V
Sbjct: 325 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMQV 379

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
            EDF HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +
Sbjct: 380 REDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGK 439

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 440 SWGENGYFRILRGVNESDIEKLIIAA 465


>gi|355561807|gb|EHH18439.1| hypothetical protein EGK_15031 [Macaca mulatta]
          Length = 475

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 75/206 (36%), Positives = 106/206 (51%), Gaps = 28/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A  
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANN-GCAMASR 324

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S   +IM EI +NGPV+    V
Sbjct: 325 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMQV 379

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
            EDF HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +
Sbjct: 380 REDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGK 439

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 440 SWGENGYFRILRGVNESDIEKLIIAA 465


>gi|327282776|ref|XP_003226118.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
           carolinensis]
          Length = 476

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 73/185 (39%), Positives = 94/185 (50%), Gaps = 21/185 (11%)

Query: 40  GCDGGYPISAWRYFVHHGVVTEECDPYF----DSTGCSHPGCEPAYPTPKCVRKCVKKNQ 95
           GC GG    AW Y   +G+V+  C P F      T C       A    + ++ C  +  
Sbjct: 290 GCKGGSITGAWSYLKKYGLVSHACYPLFWNNLHQTSCEMSSVFDAEGKRQAIQPCPNR-- 347

Query: 96  LWRNSKHYSISA--YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI----- 148
            W  S H       YRI+S   DIM EI +NGPV+    VY+DF  YKSG+YKHI     
Sbjct: 348 -WEPSNHIYQCGLPYRISSQDADIMKEIKENGPVQAVMQVYDDFFLYKSGIYKHIWSLEG 406

Query: 149 ---TGDVMGGHAVKLIGWGTSDDGE----DYWILANQWNRSWGADGYFKIKRGSNECGIE 201
                     H++K++GWGT  D E     +WI AN W  SWG +GYF+I RG NEC IE
Sbjct: 407 KTQNRHQKKPHSIKIVGWGTLRDAEGQRQKFWIAANSWGNSWGENGYFRILRGQNECDIE 466

Query: 202 EDVVA 206
           + V+A
Sbjct: 467 KTVIA 471


>gi|402867308|ref|XP_003897801.1| PREDICTED: tubulointerstitial nephritis antigen [Papio anubis]
          Length = 475

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 75/206 (36%), Positives = 106/206 (51%), Gaps = 28/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A  
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANN-GCAMASR 324

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S   +IM EI +NGPV+    V
Sbjct: 325 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMQV 379

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
            EDF HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +
Sbjct: 380 REDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGK 439

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 440 SWGENGYFRILRGVNESDIEKLIIAA 465


>gi|291236586|ref|XP_002738220.1| PREDICTED: cathepsin B preproprotein-like [Saccoglossus
           kowalevskii]
          Length = 93

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 57/92 (61%), Positives = 71/92 (77%), Gaps = 1/92 (1%)

Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 177
           MAEI K GPVE +FTVY DF  YKSGVY+H TG+ +GGHA+K++GWG ++DG DYW++AN
Sbjct: 1   MAEIQKYGPVEGAFTVYADFPSYKSGVYQHETGEALGGHAIKILGWG-NEDGHDYWLVAN 59

Query: 178 QWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
            WN  WG  G+FKI RG +ECGIE  + AG P
Sbjct: 60  SWNEDWGDQGFFKILRGVDECGIESQITAGSP 91


>gi|157058765|gb|ABV03140.1| cathepsin B-348 [Aulacorthum solani]
          Length = 237

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 64/163 (39%), Positives = 97/163 (59%), Gaps = 18/163 (11%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC------- 72
           +N   S  +L++CC + CG GC+GG+P +AW Y+   G+V+    PY  + GC       
Sbjct: 77  KNFHFSAENLVSCC-WTCGFGCNGGFPGAAWNYWKTKGIVSG--GPYGSNMGCIPYEVAP 133

Query: 73  -------SHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
                  +   C+    TPKCV+KC    ++ +    H+  SAY +++D + I  EIY N
Sbjct: 134 CEHHVNGTRGPCKEGGKTPKCVKKCEDGYKVPYAQDLHHGKSAYSLSNDVDQIRQEIYTN 193

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 167
           GPVE +FTVYEDF  Y++GVYKH+ G  +GGHA++++GWG  +
Sbjct: 194 GPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQN 236


>gi|312082955|ref|XP_003143660.1| hypothetical protein LOAG_08080 [Loa loa]
 gi|307761175|gb|EFO20409.1| hypothetical protein LOAG_08080 [Loa loa]
          Length = 339

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/195 (38%), Positives = 106/195 (54%), Gaps = 13/195 (6%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           QN++LS    L+C       GC+GGY   AW Y    GVV+EEC PY   T      C  
Sbjct: 127 QNVALSAQQFLSCNQHR-QKGCEGGYLDRAWWYIRKFGVVSEECYPYISGTTRKPEICYM 185

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
                   R+C   +    NS+ Y  + +YR++S  +DIM+EI  NGPV+ +F V+ DF 
Sbjct: 186 QKSKHANGRQCPSGHP---NSRVYRTTPSYRVSSREQDIMSEILTNGPVQATFRVHGDF- 241

Query: 139 HYKSGVYKHITG---DVMGGHAVKLIGWG---TSDDGEDYWILANQWNRSWGADGYFKIK 192
            + +GVYKH+     ++ G H+V+L+GWG   ++     YWI AN W  +WG +G F+I 
Sbjct: 242 -FIAGVYKHLPTVGEEIEGYHSVRLLGWGEDYSTGIPVKYWIAANSWGTNWGENGTFRIL 300

Query: 193 RGSNECGIEEDVVAG 207
           RG N C IE  V+  
Sbjct: 301 RGENHCEIESFVIGA 315


>gi|427783627|gb|JAA57265.1| hypothetical protein [Rhipicephalus pulchellus]
          Length = 483

 Score =  124 bits (312), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 72/200 (36%), Positives = 101/200 (50%), Gaps = 15/200 (7%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           +  + LS  DL++C        C GG+P   WR+ +++G V+EEC PY      ++  C 
Sbjct: 246 VDKVELSPQDLMSCLNGGRRVVCQGGHPDRGWRFLLNYGGVSEECYPYEGVHSSANATCR 305

Query: 79  -PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
            P    P    +C          KH+S   YR+ ++ EDIM EIY NGPV+    V EDF
Sbjct: 306 IPRRRDPIEDARCPTGRT---EQKHFSTPPYRVPANEEDIMQEIYANGPVQALILVKEDF 362

Query: 138 AHYKSGVYKHI--------TGDVMGGHAVKLIGWGTSDDGE---DYWILANQWNRSWGAD 186
             Y+SGVY+H              G H+V+++GWG          YW+ AN W   WG +
Sbjct: 363 FLYRSGVYRHTRIAESLRPQYSRSGWHSVRILGWGVDRSQYRPIKYWLCANSWGHGWGEN 422

Query: 187 GYFKIKRGSNECGIEEDVVA 206
           GYF+I RG +E  IE  V+A
Sbjct: 423 GYFRIVRGEDESQIESFVLA 442


>gi|268572255|ref|XP_002648916.1| Hypothetical protein CBG17829 [Caenorhabditis briggsae]
          Length = 220

 Score =  124 bits (312), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 57/102 (55%), Positives = 71/102 (69%), Gaps = 1/102 (0%)

Query: 106 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 165
           SAY +      I  EI  NGPV   FT+YED   YKSGVY+H  G ++GGHA+K+IGWGT
Sbjct: 113 SAYYVGMTVSAIQTEIMTNGPVVGVFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGT 172

Query: 166 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
             +G  YW++AN W   WG +G+FKI+RG NECGIE +VVAG
Sbjct: 173 -QNGIPYWLIANSWGTKWGENGFFKIRRGVNECGIENNVVAG 213


>gi|326916361|ref|XP_003204476.1| PREDICTED: tubulointerstitial nephritis antigen-like [Meleagris
           gallopavo]
          Length = 467

 Score =  124 bits (312), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 76/201 (37%), Positives = 102/201 (50%), Gaps = 21/201 (10%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 82
           +LSV +L++C       GC GG    AWRY   HGVV+  C P F       P     Y 
Sbjct: 272 NLSVQNLISC-DTKNQHGCGGGNIEGAWRYLKTHGVVSYACYPSFWKHSLDSPSENHCYV 330

Query: 83  TPK---------CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
           + +         C       N+L+R + HY     RI+S   DIM EI   GPV+    V
Sbjct: 331 SSEYGKNHTNGPCPNALEDSNRLYRCASHY-----RISSKETDIMEEIMAKGPVQAIMKV 385

Query: 134 YEDFAHYKSGVYKH--ITGDVMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGADG 187
           YEDF  YK G+Y+H    G     H+VKL+GWG+    +   + +WI AN W + WG +G
Sbjct: 386 YEDFFLYKEGIYRHSYKAGSKWKTHSVKLLGWGSLPGKNGQKQKFWIAANSWGKYWGENG 445

Query: 188 YFKIKRGSNECGIEEDVVAGL 208
           YF+I RG NEC IE+ ++  L
Sbjct: 446 YFRILRGQNECDIEKLILTTL 466


>gi|290990464|ref|XP_002677856.1| predicted protein [Naegleria gruberi]
 gi|284091466|gb|EFC45112.1| predicted protein [Naegleria gruberi]
          Length = 231

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 74/189 (39%), Positives = 102/189 (53%), Gaps = 18/189 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+ LS  DL+ C  +    GC+GG P   + Y    G+V++ C PY    G +H  C P 
Sbjct: 50  NVVLSPQDLVTCSWY--SFGCNGGIPGLVFDYIHKDGLVSDACFPYLSYDGNTHVKC-PD 106

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPED-------IMAEIYKNGPVEVSFTV 133
           +    C      K + +++ KH++   Y +    ED       I  EI  +GPV   F V
Sbjct: 107 F----CYNN---KTKSFKSDKHFADKVYHVGEFLEDKAKRVLEIQKEILTHGPVNADFMV 159

Query: 134 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 193
           Y DF  YKSGVY+H TG   G HAVK+IGWGT ++G DYW++AN W  ++G  G+FKI R
Sbjct: 160 YSDFTVYKSGVYRHQTGSFEGIHAVKIIGWGT-ENGVDYWLIANSWGTTFGLQGFFKIVR 218

Query: 194 GSNECGIEE 202
           G     +EE
Sbjct: 219 GGKFIHLEE 227


>gi|363732245|ref|XP_419905.3| PREDICTED: tubulointerstitial nephritis antigen [Gallus gallus]
          Length = 467

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 75/201 (37%), Positives = 102/201 (50%), Gaps = 21/201 (10%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 82
           +LSV +L++C       GC+GG    AWRY   HGVV+  C P F       P     Y 
Sbjct: 272 NLSVQNLISC-DTGNQRGCNGGSIDGAWRYLTTHGVVSYACYPSFWKHHLDSPSENQCYV 330

Query: 83  TPK---------CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
           + +         C       N+L+R   HY     R++S   DIM EI   GPV+    V
Sbjct: 331 SSEYGKNHTNGPCPNALEDSNRLYRCGSHY-----RVSSKETDIMEEIMAKGPVQAIMKV 385

Query: 134 YEDFAHYKSGVYKH--ITGDVMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGADG 187
           YEDF  YK G+Y+H    G     H+VKL+GWG+    +   + +WI AN W + WG +G
Sbjct: 386 YEDFFLYKEGIYRHSYKAGSKWKTHSVKLLGWGSLPGKNGQKQKFWIAANSWGKYWGENG 445

Query: 188 YFKIKRGSNECGIEEDVVAGL 208
           YF+I RG NEC IE+ ++  L
Sbjct: 446 YFRILRGQNECDIEKLILTTL 466


>gi|448278133|gb|AGE43966.1| putative cathepsin B [Naegleria fowleri]
          Length = 349

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 64/177 (36%), Positives = 99/177 (55%), Gaps = 22/177 (12%)

Query: 39  DGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-- 96
           +GC+GG   +A+++    G+V++ C PY    G            P C   C     +  
Sbjct: 189 NGCNGGEFPTAFQFVETTGLVSDGCVPYQSGNGF----------VPPCPNSCANGEDINV 238

Query: 97  ---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 153
               +NS+++ ++      D + + A I  NGPV   F VY DF +Y+SG YKH+ G ++
Sbjct: 239 RYRTKNSRNFDVN------DMKSVQASILANGPVISGFKVYRDFYNYRSG-YKHVAGGLV 291

Query: 154 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 210
           GGHA+K++GWG +     YWI+AN W+  WG +GYF I RG+NEC IEE++   +P+
Sbjct: 292 GGHAIKVVGWGVTQSNVPYWIVANSWSDEWGMNGYFWILRGTNECSIEENMWETIPA 348


>gi|15150360|gb|AAK85411.1| cathepsin B-like protease [Trypanosoma rangeli]
          Length = 207

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 93/170 (54%), Gaps = 14/170 (8%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH---- 74
           +++L +S  DLL+CC   CG GC+GG P  AW Y+V  G+V+E C PY     C+H    
Sbjct: 44  VRDLRISAGDLLSCCN-ACGLGCNGGDPDWAWLYYVETGIVSEFCQPY-PFPPCAHHVNS 101

Query: 75  ---PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
                C   Y TP C   C       +     S S     S  ED   E++  GP EV+F
Sbjct: 102 THYTPCSVEYDTPFCNITCTNTIPPIKYKGRISYSL----SGEEDYKRELFLYGPFEVAF 157

Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
           TVYEDF  Y  GVYKH +G+ +GGHAV+L+GWG   +G  YW +AN WN 
Sbjct: 158 TVYEDFVAYSDGVYKHFSGNALGGHAVRLVGWGNL-NGTPYWKIANSWNH 206


>gi|107921791|gb|ABF85679.1| cathepsin B2 [Fasciola hepatica]
          Length = 278

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 68/166 (40%), Positives = 92/166 (55%), Gaps = 19/166 (11%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG 76
           L+  D L+CC + CG GC GGYP  AW Y++  G+VT         C P+   T C H G
Sbjct: 116 LAAADPLSCCTY-CGQGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWM-FTKCDHVG 173

Query: 77  -------CEP-AYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
                  C    YP P C R C    N+ +   K Y  S+Y +      IM EI KNGPV
Sbjct: 174 DSRKYSRCPHYTYPKPPCARACQTGYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPV 233

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 173
           EV+F +++DF  Y+SG+Y H+ G  +G HAV++IGWG  ++G +YW
Sbjct: 234 EVTFAIFQDFGVYRSGIYHHVAGKFIGRHAVRMIGWGV-ENGVNYW 278


>gi|348513320|ref|XP_003444190.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oreochromis
           niloticus]
          Length = 499

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 79/231 (34%), Positives = 114/231 (49%), Gaps = 46/231 (19%)

Query: 10  ALSSSPYVSLQNL-----SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  +S+Q++      LS  +L++C     G GC GG    AW Y    GVVTE+C 
Sbjct: 259 AAVASDRISIQSMGHMTPRLSPQNLISCDTRNQG-GCAGGRIDGAWWYLRRRGVVTEDCY 317

Query: 65  PYFDSTGCSHPGCEPAYPTPKCVRKCVKKN-----------------QLWRNSKHYSISA 107
           PY           +P + TP  V +C+ ++                 Q + N  + S   
Sbjct: 318 PY-----------QPPHQTPAEVGRCMMQSRSVGRGKRQATQRCPNTQNYHNDIYQSTPP 366

Query: 108 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM--------GGHAVK 159
           YR++S+ ++IM EI  NGPV+    V+EDF  YK+G+YKH              G H+V+
Sbjct: 367 YRLSSNEKEIMKEIMDNGPVQAIMEVHEDFFVYKTGIYKHTDVSFTKPPQYRKHGTHSVR 426

Query: 160 LIGWGTSDD----GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           + GWG   +       YWI AN W ++WG +GYF+I RG NEC IE  V+ 
Sbjct: 427 ITGWGEDRNVDGTSRKYWIAANSWGKNWGENGYFRIVRGENECEIETFVIG 477


>gi|339239305|ref|XP_003381207.1| cathepsin B [Trichinella spiralis]
 gi|316975778|gb|EFV59177.1| cathepsin B [Trichinella spiralis]
          Length = 343

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 84/208 (40%), Positives = 112/208 (53%), Gaps = 20/208 (9%)

Query: 16  YVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHP 75
           Y   Q   LS  +L +CC   CG GC+GG+P+ A++Y+   GV T    PY   +GC   
Sbjct: 139 YKGEQQPFLSDEELTSCCT-SCGYGCNGGFPLLAFKYWNEIGVPTG--GPYGSKSGCKPF 195

Query: 76  GCEP------AYPTPKCVRKCVK--KNQLWRNSKHYSISAYRINSDPE---DIMAEIYKN 124
              P      A  TP C  KC+   K +L ++ ++Y  S Y I S  +    I  EI  +
Sbjct: 196 SIAPPTSSSTAAQTPLCQLKCISDYKRKLDKD-RYYGESYYLITSSNQPVKTIQREIMDH 254

Query: 125 GPVEVSFTVYEDFAHYKSGVY---KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
           GPV  +  ++E F +YKSGVY   K      +G HAVKLIGWG       YW++ N WN 
Sbjct: 255 GPVVAAMEIFESFLYYKSGVYSANKRNDDPSLGLHAVKLIGWGEQKR-IPYWLVVNSWNT 313

Query: 182 SWGADGYFKIKRGSNECGIEE-DVVAGL 208
           ++G  G FKI+RG+NECGIE   V AGL
Sbjct: 314 TFGEQGLFKIRRGTNECGIENLHVTAGL 341


>gi|403377404|gb|EJY88697.1| hypothetical protein OXYTRI_00086 [Oxytricha trifallax]
          Length = 351

 Score =  124 bits (310), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 72/192 (37%), Positives = 101/192 (52%), Gaps = 24/192 (12%)

Query: 21  NLSLSVNDLLACC--GFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           N  LS  D++ C    F    GC+GGY ++A  Y ++ GV  E C PY D T        
Sbjct: 168 NEELSPQDMVDCSHDNF----GCEGGYLMNALDYLMNEGVTKESCTPYKDKTN------- 216

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHY-SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
                 KC   C  K + +   KHY      R+ ++ E I  ++ +NGP+ V  TVYEDF
Sbjct: 217 ------KCQYTCQNKTEEFH--KHYCKPGTLRVLTNEEQIKRDLMQNGPLMVGLTVYEDF 268

Query: 138 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 197
            +Y +G YK + G+++GGHAVKL+GW T+  G+  W++ NQWN  WG  G+  I    NE
Sbjct: 269 INYATGDYKFVAGEIVGGHAVKLMGWRTTQKGQTSWLIQNQWNDDWGEQGFGYIL--ENE 326

Query: 198 CGIEEDVVAGLP 209
            GI+   V   P
Sbjct: 327 VGIDSIGVGCTP 338


>gi|56758644|gb|AAW27462.1| unknown [Schistosoma japonicum]
          Length = 294

 Score =  123 bits (309), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 67/156 (42%), Positives = 92/156 (58%), Gaps = 15/156 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----F 67
           Q+  LS  DL++CC   CGDGC GG+P  AW Y+V  G+VT         C PY      
Sbjct: 139 QSAELSALDLISCCED-CGDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCE 197

Query: 68  DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
             T   +P C    Y TP+C +KC K  +  +   KHY   +Y + S+ + I  EI  NG
Sbjct: 198 HHTKGKYPACGTKIYKTPQCKQKCQKGYKTPYEQDKHYGEESYNVISNEKAIQKEIMMNG 257

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 161
           PVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++I
Sbjct: 258 PVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRII 293


>gi|53850626|ref|NP_001005549.1| tubulointerstitial nephritis antigen precursor [Rattus norvegicus]
 gi|51858645|gb|AAH81887.1| Tubulointerstitial nephritis antigen [Rattus norvegicus]
 gi|149019129|gb|EDL77770.1| tubulointerstitial nephritis antigen [Rattus norvegicus]
          Length = 475

 Score =  123 bits (308), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 72/206 (34%), Positives = 106/206 (51%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW +    G+V+  C P F     ++  C  A  
Sbjct: 266 NLSPQNLISCCA-KNRHGCNSGSIDRAWWFLRKRGLVSHACYPLFKEQSTNNNSCAMASR 324

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YRI+S+  +IM EI +NGPV+    V
Sbjct: 325 SDGRGKRHATRPCPNSFEKSNRIYQCS-----PPYRISSNETEIMREIIQNGPVQAIMQV 379

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
           +EDF +YK+G+Y+H+            +  HAVKL GWGT        E +WI AN W +
Sbjct: 380 HEDFFYYKTGIYRHVVSTNEEPEKYRKLRTHAVKLTGWGTLRGAQGKKEKFWIAANSWGK 439

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 440 SWGENGYFRILRGVNESDIEKLIIAA 465


>gi|107921798|gb|ABF85680.1| cathepsin B3 [Fasciola hepatica]
          Length = 278

 Score =  123 bits (308), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 71/166 (42%), Positives = 96/166 (57%), Gaps = 19/166 (11%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
           LS  D+++CC + CG GC+GG P  +W Y+   GVVT         C PY     CSH  
Sbjct: 116 LSAIDIVSCCAY-CGYGCNGGIPAMSWDYWTREGVVTGGTLENPTGCLPY-PFPKCSHGV 173

Query: 75  --PGCEPA----YPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
             PG  P     YPTPKC +KC    N+ +   K    S+Y +     DIM EI KNGPV
Sbjct: 174 VTPGLPPCPRDIYPTPKCEKKCHAGYNKTYEQDKVKGKSSYNVGEQETDIMMEIMKNGPV 233

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 173
           +  F ++EDF  YKSG+Y + TG ++GGHA+++IGWG  ++G +YW
Sbjct: 234 DGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGV-ENGVNYW 278


>gi|363742306|ref|XP_428202.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Gallus
           gallus]
          Length = 464

 Score =  123 bits (308), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 77/200 (38%), Positives = 103/200 (51%), Gaps = 18/200 (9%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF--DSTGCSHPGCEPA 80
           SLS  +LL+C       GC GG    AW Y    GVVT+EC P+   DS   + P    +
Sbjct: 252 SLSPQNLLSC-DTRNQRGCSGGRLDGAWWYLRRRGVVTDECYPFTSQDSQPAAQPCMMHS 310

Query: 81  YPTPKCVRKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
             T +  R+   +    Q   N  + S  AYR+    ++IM E+ +NGPV+    V+EDF
Sbjct: 311 RSTGRGKRQATARCPNPQTHANDIYQSTPAYRLAPSEKEIMKELMENGPVQAILEVHEDF 370

Query: 138 AHYKSGVYKHIT--------GDVMGGHAVKLIGWGTSD--DGE--DYWILANQWNRSWGA 185
             YKSG+Y+H              G H+VK+ GWG     DG+   YW  AN W R+WG 
Sbjct: 371 FLYKSGIYRHTAVAEGKGPKHQQHGTHSVKITGWGEEQLPDGQVQKYWTAANSWGRAWGE 430

Query: 186 DGYFKIKRGSNECGIEEDVV 205
           DG+F+I RG NEC +E  VV
Sbjct: 431 DGHFRIARGVNECEVESFVV 450


>gi|3087803|emb|CAA93279.1| cysteine protease [Haemonchus contortus]
          Length = 325

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 72/179 (40%), Positives = 92/179 (51%), Gaps = 18/179 (10%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
           + +++S  D+L CC + CG GC GG+PI AW Y    G VT      + C        C 
Sbjct: 143 KQVNISATDILTCC-YKCGYGCQGGWPIEAWEYVAREGAVTGGRLLAKSCCRSHPFPPCG 201

Query: 74  HPGCEPAY-------PTPKCVRKCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
           H G E  Y        TPKC   C    KN  + + K     AY + +  + I  EI KN
Sbjct: 202 HHGNETYYGECGGRARTPKCRTSCTPGYKNS-YSDDKIRGKDAYELPNSVKAIQREIMKN 260

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
           GPV  +FTVY DF++YK G+YKH  G   G HAVK+IGWG   D   YWI+ N W+  W
Sbjct: 261 GPVVAAFTVYADFSYYKKGIYKHTAGRARGSHAVKVIGWGEEGD-VPYWIVKNSWHNDW 318


>gi|290979437|ref|XP_002672440.1| predicted protein [Naegleria gruberi]
 gi|284086017|gb|EFC39696.1| predicted protein [Naegleria gruberi]
          Length = 354

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 68/170 (40%), Positives = 83/170 (48%), Gaps = 14/170 (8%)

Query: 40  GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
            C GGY   AW +    G   + C PY         G  PA        KC    Q   +
Sbjct: 197 ACQGGYLKYAWSFLERTGTTVDSCIPYASGRATFSSGTCPA--------KCKVSTQ---S 245

Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 159
              Y     R  S   +I A I   G V+  FT+Y DF  Y+SGVYKH++   +GGHAV 
Sbjct: 246 MTMYKAKNSRYISGVNNIKAAIMSYGSVQSGFTIYRDFMSYRSGVYKHVSTTTLGGHAVA 305

Query: 160 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           LIGWG  + G +YW+  N W  +WG  GYFKI +G  ECGIE  V AG P
Sbjct: 306 LIGWGV-ESGTNYWLAVNSWGSNWGMSGYFKIAQG--ECGIENQVYAGEP 352


>gi|354483193|ref|XP_003503779.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cricetulus
           griseus]
          Length = 475

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 71/206 (34%), Positives = 106/206 (51%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW +    G+V+  C P F     ++  C  A  
Sbjct: 266 NLSPQNLISCCAKK-RHGCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNTTNNICAMASR 324

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V
Sbjct: 325 SDGRGKRHATKPCPNSFEKSNRIYQCS-----PPYRVSSNETEIMREIIRNGPVQAIMQV 379

Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
           +EDF +YK+G+Y+H+            +  HAVKL GWGT        E +WI AN W +
Sbjct: 380 HEDFFYYKTGIYRHVISTNEESEKYRKLRSHAVKLTGWGTLRGAGGKKEKFWIAANSWGK 439

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 440 SWGENGYFRILRGVNESDIEKLIIAA 465


>gi|403357104|gb|EJY78168.1| Cathepsin B [Oxytricha trifallax]
          Length = 349

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 71/192 (36%), Positives = 102/192 (53%), Gaps = 24/192 (12%)

Query: 21  NLSLSVNDLLACC--GFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDS-TGCSHPGC 77
           N  LS  DL++C    F    GC GG    +  + ++ G+V+E+C PY +  T C     
Sbjct: 173 NEDLSPQDLVSCSYENF----GCSGGQLTESVDFLIYEGIVSEKCKPYMNQDTYCKFKCQ 228

Query: 78  EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
               P  K    C +K+ L             I SD E+I  E+  NGP+ V  +VYED 
Sbjct: 229 NDKQPYTKYF--CEQKSML-------------ILSDIEEIQLELMTNGPMMVGLSVYEDL 273

Query: 138 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 197
            +YK GVY++ TG+ +GGHA+K+IGWG ++ GE +W   NQW + WG  GY  IK G  E
Sbjct: 274 MNYKEGVYEYTTGNQVGGHAIKIIGWGHTEKGELFWKCQNQWGKDWGMGGYINIKAG--E 331

Query: 198 CGIEEDVVAGLP 209
            G++  V+  +P
Sbjct: 332 LGMDTMVLGCMP 343


>gi|166030322|gb|ABY78828.1| cathepsin B-like protease [Trypanosoma congolense]
 gi|343471419|emb|CCD16168.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 68/179 (37%), Positives = 90/179 (50%), Gaps = 12/179 (6%)

Query: 45  YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP--------AYPTPKCVRKCVKKNQL 96
           +P  AWRY+V +G+ +  C PY     C H G +          + TP+C   C  K   
Sbjct: 162 FPGFAWRYYVEYGIASSYCQPY-PFPQCEHQGAQGNKTPCSNYKFVTPQCNTTCTDKTIP 220

Query: 97  WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 156
               K+    AY +    E+   E+Y NGP      VY D   YKSGVY+++ G  MG  
Sbjct: 221 L--IKYRGKDAYMLLPGEEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRNVDGSYMGVT 278

Query: 157 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 215
           AVK++GWG   +G  YW +AN W+  WG DGY  I RG+NEC IE    AG P +  L 
Sbjct: 279 AVKVVGWG-KLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPDTSQLT 336


>gi|324512900|gb|ADY45327.1| Peptidase C1-like protein [Ascaris suum]
          Length = 450

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 77/211 (36%), Positives = 101/211 (47%), Gaps = 38/211 (18%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+ LS   LL+C       GC+GGY   AW Y    GVV+E C PY +S     PG    
Sbjct: 232 NIPLSAQQLLSCNQHR-QRGCEGGYLDRAWWYIRKLGVVSELCYPY-ESGATQQPG---- 285

Query: 81  YPTPKCVRKCVKKNQLWRNSKH------------YSISA-YRINSDPEDIMAEIYKNGPV 127
                   +C      +R   H            Y ++  YR++S  +DIM EI  NGPV
Sbjct: 286 --------ECRIPKSAYRTGAHIDCPSGAADPSVYRMTPPYRVSSREQDIMTEIITNGPV 337

Query: 128 EVSFTVYEDFAHYKSGVYKHI--------TGDVMGGHAVKLIGWG---TSDDGEDYWILA 176
           + +F VYEDF  Y  GVY+H+           V G H+V++IGWG   ++     YW+ A
Sbjct: 338 QATFLVYEDFFMYSGGVYQHLDLHEHKEEERKVQGYHSVRIIGWGEDYSTGPQVKYWLAA 397

Query: 177 NQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
           N W   WG DG F+I RG N C IE  V+  
Sbjct: 398 NSWGNEWGEDGLFRILRGENHCEIESFVIGA 428


>gi|14789619|gb|AAH10745.1| Tubulointerstitial nephritis antigen [Mus musculus]
          Length = 475

 Score =  122 bits (305), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 71/206 (34%), Positives = 106/206 (51%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW +    G+V+  C P F     ++  C  A  
Sbjct: 266 NLSPQNLISCCA-KNRHGCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNTTNNICAMASR 324

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V
Sbjct: 325 SDGRGKRHATKPCPNSFEKSNRIYQCS-----PPYRVSSNETEIMREIIQNGPVQAIMQV 379

Query: 134 YEDFAHYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
           +EDF +YK+G+Y+H+            +  HAVKL GWGT        E +WI AN W +
Sbjct: 380 HEDFFYYKTGIYRHVVSTNEEPEKYKKLRTHAVKLTGWGTLRGARGKKEKFWIAANSWGK 439

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 440 SWGENGYFRILRGVNESDIEKLIIAA 465


>gi|227499499|ref|NP_036163.3| tubulointerstitial nephritis antigen precursor [Mus musculus]
 gi|4929827|gb|AAD34171.1| tubulo-interstitial nephritis antigen [Mus musculus]
 gi|148694397|gb|EDL26344.1| tubulointerstitial nephritis antigen, isoform CRA_a [Mus musculus]
          Length = 475

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 71/206 (34%), Positives = 106/206 (51%), Gaps = 27/206 (13%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +L++CC      GC+ G    AW +    G+V+  C P F     ++  C  A  
Sbjct: 266 NLSPQNLISCCA-KNRHGCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNTTNNICAMASR 324

Query: 81  -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
                  + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V
Sbjct: 325 SDGRGKRHATKPCPNSFEKSNRIYQCS-----PPYRVSSNETEIMREIIQNGPVQAIMQV 379

Query: 134 YEDFAHYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
           +EDF +YK+G+Y+H+            +  HAVKL GWGT        E +WI AN W +
Sbjct: 380 HEDFFYYKTGIYRHVVSTNEEPEKYKKLRTHAVKLTGWGTLRGARGKKEKFWIAANSWGK 439

Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
           SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 440 SWGENGYFRILRGVNESDIEKLIIAA 465


>gi|270011021|gb|EFA07469.1| cathepsin B precursor [Tribolium castaneum]
          Length = 327

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 74/200 (37%), Positives = 104/200 (52%), Gaps = 12/200 (6%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           + ++LS   LL+C        C+GGY   AW Y    G+V E+C PY      ++  C  
Sbjct: 126 EKVTLSAQHLLSC-DRRGQQSCNGGYLDRAWSYIRKIGLVDEQCFPY----SATNEKCRI 180

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
                     C     + R SK+    AYR+ ++  DIM EI  +GPV+ +  VY DF  
Sbjct: 181 PRRGDLVTANCQLPTNVDRRSKYKVAPAYRVGNET-DIMYEILHSGPVQATMKVYHDFFT 239

Query: 140 YKSGVYKHI---TGDVMGGHAVKLIGWGT--SDDG-EDYWILANQWNRSWGADGYFKIKR 193
           YK G+Y+H    T D  G H+V+++GWG   S +G + YW +AN W   WG +GYF+I R
Sbjct: 240 YKRGIYRHSPISTNDRTGYHSVRIVGWGEEYSPEGLKKYWKVANSWGPEWGENGYFRILR 299

Query: 194 GSNECGIEEDVVAGLPSSKN 213
           GSNEC IE  V+      +N
Sbjct: 300 GSNECEIESFVLGTWAEVEN 319


>gi|323448265|gb|EGB04166.1| hypothetical protein AURANDRAFT_32974 [Aureococcus anophagefferens]
          Length = 298

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 76/203 (37%), Positives = 99/203 (48%), Gaps = 31/203 (15%)

Query: 39  DGCDGGYPISAWRYFVHHGVVTEE------------CDPYFDSTGCSHPGCE-------- 78
           DGCDGG  I+ W Y    G VT              C  +F +  C H G          
Sbjct: 90  DGCDGGQIITPWTYVAKAGAVTGGQYNGTGPFGAGLCADWF-APHCHHHGPRGDDPYPAE 148

Query: 79  -----PAYPTPKCVRKC----VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
                P+  +P+  + C       +  +   KH      +  S    IMA I + GPVE 
Sbjct: 149 GDAGCPSEKSPEGPKACDATAAAGHDAFAADKHTFAGDVQTASGEAAIMAMIAEGGPVET 208

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
           +FTVYEDF +Y  G+Y H+TG+  GGHAVK +GWG  ++G  YW +AN WN  WG  GYF
Sbjct: 209 AFTVYEDFENYAGGIYHHVTGEEAGGHAVKFVGWGV-ENGTKYWKVANSWNPYWGEAGYF 267

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I RGSNE GIE+ V      +K
Sbjct: 268 RILRGSNEGGIEDQVTGSHADAK 290


>gi|47212965|emb|CAF93376.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 271

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 79/231 (34%), Positives = 112/231 (48%), Gaps = 46/231 (19%)

Query: 10  ALSSSPYVSLQNL-----SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  +S+Q++      LS  +L++C     G GC GG    AW Y    GVVTE+C 
Sbjct: 40  AAVASDRISIQSMGHMTPQLSPQNLISCDTRNQG-GCAGGRLDGAWWYLRRRGVVTEDCY 98

Query: 65  PYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-----------------WRNSKHYSISA 107
           PY            P   TP  + +C+ +++                  ++N  + S   
Sbjct: 99  PY-----------RPPQQTPAELSRCMMQSRSVGRGKRQATQRCPNTNNYQNDIYQSTPP 147

Query: 108 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM--------GGHAVK 159
           YR+++  ++IM EI  NGPV+    V+EDF  Y SG+YKH              G H+VK
Sbjct: 148 YRLSTSEKEIMKEIQDNGPVQAIMEVHEDFFMYNSGIYKHTDVSFTKPPHYRKHGTHSVK 207

Query: 160 LIGWGTSD--DG--EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           + GWG     DG    YWI AN W ++WG +GYF+I RG NEC IE  V+ 
Sbjct: 208 ITGWGEERNFDGTTRKYWIAANSWGKNWGENGYFRIARGENECEIEAFVIG 258


>gi|343476073|emb|CCD12715.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 68/179 (37%), Positives = 90/179 (50%), Gaps = 12/179 (6%)

Query: 45  YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP--------AYPTPKCVRKCVKKNQL 96
           +P  AWRY+V +G+ +  C PY     C H G +          + TP+C   C  K   
Sbjct: 162 FPGFAWRYYVEYGIASSYCQPY-PFPQCEHHGAQGNKTPCSNYKFVTPQCNTTCTDKTIP 220

Query: 97  WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 156
               K+    AY +    E+   E+Y NGP      VY D   YKSGVY+++ G  MG  
Sbjct: 221 L--IKYRGKDAYMLLPGEEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRNVDGSYMGVT 278

Query: 157 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 215
           AVK++GWG   +G  YW +AN W+  WG DGY  I RG+NEC IE    AG P +  L 
Sbjct: 279 AVKVVGWG-KLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPDTSQLT 336


>gi|294879717|ref|XP_002768767.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239871616|gb|EER01485.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 157

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 68/157 (43%), Positives = 88/157 (56%), Gaps = 14/157 (8%)

Query: 63  CDPYFDSTGCSH-------PGCEPA-YPTPKCVRKC--VKKNQLWRNSKHYSISAYRINS 112
           C PY D   C+H       P C    YPTP CV +C   K     R+ +H+ + +   + 
Sbjct: 3   CWPY-DFPPCAHHINDTKYPKCPKGLYPTPNCVEQCHNPKYTTTLRDDRHFMLESSPYHY 61

Query: 113 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 172
              D    I  +GPV  SFTVYEDF  Y+SGVYKH +G  +GGHAVK+IGWG    G+ Y
Sbjct: 62  SVNDAKNAIRTDGPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAVKIIGWGEK-SGQAY 120

Query: 173 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           W+  N WN  WG  G FKI  G+  CGI++D++ G P
Sbjct: 121 WLAVNSWNEDWGDHGLFKIALGN--CGIDDDLLGGTP 155


>gi|294877489|ref|XP_002768007.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239870145|gb|EER00725.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 344

 Score =  121 bits (303), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 80/236 (33%), Positives = 112/236 (47%), Gaps = 49/236 (20%)

Query: 21  NLSLSVNDLLACCGFL--CGD-GCDGGYPISAWRYFVHHGVVT-------------EECD 64
           N  LS  ++LACC  +  C   GC GG   +AW +   HG+VT             + C 
Sbjct: 110 NQLLSAGEMLACCNSVHSCNSHGCQGGIARAAWSFLKMHGIVTGGDFVPKGSMSAADGCW 169

Query: 65  PY------FDSTGCSHPGC---------------------EPAYPTPKCVRKCV--KKNQ 95
           PY       D     +  C                     +  Y TP C+ +C   K   
Sbjct: 170 PYSFPKCAHDQEDSKYEPCPEVRVPPLGERHQRGAGASIHQKLYDTPSCLDRCPNEKYGT 229

Query: 96  LWRNSKHYSISAYR-INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 154
                +H++  A   +    ++I  EI  NGP   SF+ YEDF+ YKSGVYKH +G  +G
Sbjct: 230 PRDKDRHFTARALPYLFEGTDNIKKEIMTNGPTSASFSTYEDFSSYKSGVYKHTSGGYLG 289

Query: 155 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 210
            H+V++IGWGT + G DYW++ N WN  WG  G FKI +G  +CGI++ V   LP+
Sbjct: 290 DHSVEIIGWGT-EKGVDYWLVMNSWNEGWGDHGTFKIAQG--DCGIDDAVQGSLPA 342


>gi|496968|gb|AAA96831.1| cysteine protease homologue, partial [Ancylostoma caninum]
          Length = 197

 Score =  121 bits (303), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 65/178 (36%), Positives = 97/178 (54%), Gaps = 17/178 (9%)

Query: 18  SLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDST 70
           S   + +S +D+L+CCG  CG GC GG+ I A+++                 C P   S 
Sbjct: 21  STIRVMISDSDILSCCGISCGYGCQGGWSIEAYKWMQRERCCYRWENTDRRVCKPVRPSI 80

Query: 71  GCSHPGCEPAY--------PTPKCVRKCVKKN-QLWRNSKHYSISAYRINSDPEDIMAEI 121
              +   +P Y        PTPKC + C +K  + ++  KH++  AY + ++   I  EI
Sbjct: 81  RVGNHPNDPYYGPCPGGLWPTPKCRKTCQRKYYKSYQEDKHFATRAYYLPNNERSIRQEI 140

Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQW 179
           YKNGPV  +F VY+DF++YK G+Y H  G   G HAVK++GWG  ++  DYW++AN W
Sbjct: 141 YKNGPVVAAFRVYQDFSYYKKGIYVHKWGGQTGAHAVKVVGWG-RENATDYWLIANSW 197


>gi|157058749|gb|ABV03132.1| cathepsin B-3098 [Acyrthosiphon pisum]
          Length = 256

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 72/182 (39%), Positives = 98/182 (53%), Gaps = 19/182 (10%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
           N  LS  ++  CC   CG+GC+GGYPI AW+ F +HG+VT       E C+PY      +
Sbjct: 78  NQLLSAEEITFCC-HKCGNGCNGGYPIRAWKRFKNHGLVTGGNYKSGEGCEPYRVPPCPY 136

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGP 126
           D  G +    +P  P  KC +KC     +  N  H Y+   Y +      I  ++   GP
Sbjct: 137 DKDGKNTCSGQPMEPNHKCSKKCYGDEDIDFNKDHRYTRDDYYLTY--RGIQKDVINYGP 194

Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
           +E SF VY+DF +YKSG+Y K      +GGH+VKLIGWG  + G  YW++ N WN  WG 
Sbjct: 195 IEASFDVYDDFPNYKSGIYVKSENASYLGGHSVKLIGWG-EEYGVLYWLMVNSWNADWGD 253

Query: 186 DG 187
            G
Sbjct: 254 KG 255


>gi|410910940|ref|XP_003968948.1| PREDICTED: tubulointerstitial nephritis antigen-like [Takifugu
           rubripes]
          Length = 477

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 79/231 (34%), Positives = 114/231 (49%), Gaps = 46/231 (19%)

Query: 10  ALSSSPYVSLQNL-----SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  +S+Q++      LS  +L++C     G GC GG    AW +    GVVTE+C 
Sbjct: 237 AAVASDRISIQSMGHMTPQLSPQNLISCDTRNQG-GCTGGRIDGAWWFLRRRGVVTEDCY 295

Query: 65  PYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-----------------WRNSKHYSISA 107
           PY            P   TP  + +C+ +++                  ++N  + S   
Sbjct: 296 PY-----------RPPQQTPAELGRCMMQSRSVGRGKRQATQRCPNTNNYQNDIYQSTPP 344

Query: 108 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM--------GGHAVK 159
           YR++++ ++IM EI  NGPV+    V+EDF  YKSG+YKH              G H+VK
Sbjct: 345 YRLSTNEKEIMKEIQDNGPVQAIMEVHEDFFVYKSGIYKHTDVSFTKPPQYRKHGTHSVK 404

Query: 160 LIGWGTSD--DG--EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           + GWG     DG    YWI AN W ++WG +GYF+I RG NEC IE  V+ 
Sbjct: 405 ITGWGEERNVDGAKRKYWIAANSWGKNWGEEGYFRIARGENECEIEAFVIG 455


>gi|10803443|emb|CAC13134.1| putative cathepsin B.8 [Ostertagia ostertagi]
          Length = 197

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 71/170 (41%), Positives = 95/170 (55%), Gaps = 20/170 (11%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
            ++LS  DLL+CC   CG GC+GG P+SAW+++V  G+VT         C PY     C 
Sbjct: 24  QVTLSAADLLSCC-RSCGFGCNGGDPLSAWKFWVKEGIVTGSNHSTNAGCKPY-PFPACE 81

Query: 74  H--------PGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           H        P     +PTPKC + C      + ++  K++  SAY + +  E I  EI  
Sbjct: 82  HHSNKTHYDPCKHDLFPTPKCEKSCQATFGERTYKEDKYFGRSAYGVKNHMEAIQKEIIT 141

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 173
            GPVEV+F VYEDF +Y  G+Y H  G + GGHAVK+IGWG  D+G  YW
Sbjct: 142 YGPVEVAFEVYEDFLNYAGGIYVHQGGALGGGHAVKMIGWGI-DNGVPYW 190


>gi|321478457|gb|EFX89414.1| hypothetical protein DAPPUDRAFT_303204 [Daphnia pulex]
          Length = 442

 Score =  120 bits (302), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 103/200 (51%), Gaps = 17/200 (8%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA--- 80
           LS+ +LLAC       GC+GG+   AW Y    GVV EEC PY          C+     
Sbjct: 236 LSMQNLLAC-NNRGQQGCNGGHLDRAWNYMRRFGVVNEECYPYISGRTGQVEKCKVPRRG 294

Query: 81  -YPTPKCV------RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
              T KC       RK  + ++  R     S  AYRI    +DIM EI ++GPV+ +  V
Sbjct: 295 NLATMKCQLVNAAERKSDRSDKPPRKGLFRSPPAYRIAPFEDDIMNEILQHGPVQATMRV 354

Query: 134 YEDFAHYKSGVYKHITGDVM---GGHAVKLIGWGTSDDGED---YWILANQWNRSWGADG 187
           + DF  Y+ GVY++   +     G H+V+++GWG      +   YW++AN W R WG DG
Sbjct: 355 HPDFFLYRGGVYRYSGTNSQQRSGYHSVRIVGWGVDSSKRNPTKYWLVANSWGRLWGEDG 414

Query: 188 YFKIKRGSNECGIEEDVVAG 207
           YF+I RG NE  IE+ V+A 
Sbjct: 415 YFRIVRGENESDIEKFVLAA 434


>gi|395528577|ref|XP_003766405.1| PREDICTED: dipeptidyl peptidase 1-like [Sarcophilus harrisii]
          Length = 568

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 68/194 (35%), Positives = 106/194 (54%), Gaps = 26/194 (13%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 82
           LS  ++++C  +    GC+GG+P +   +Y    G+V EEC PY             AY 
Sbjct: 389 LSPQEIVSCSEY--SQGCEGGFPYLIGGKYAQDFGLVEEECFPY------------QAYD 434

Query: 83  TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 142
           +P   +KC +    +  S+++ +  +    +   +  E+ +NGP+ V+F VY+DF HY++
Sbjct: 435 SPCTPKKCSR----YYTSEYHYVGGFYGGCNEALMKHELIQNGPLTVAFEVYDDFIHYRT 490

Query: 143 GVYKH------ITGDVMGGHAVKLIGWGTSDD-GEDYWILANQWNRSWGADGYFKIKRGS 195
           G+Y H           +  HAV L+G+GT +  GEDYWI+ N W  SWG +GYF+I RG+
Sbjct: 491 GIYHHTGLRDNFNPFELTNHAVLLVGYGTDEKTGEDYWIVKNSWGTSWGENGYFRILRGT 550

Query: 196 NECGIEEDVVAGLP 209
           +EC IE   VA  P
Sbjct: 551 DECAIESIAVAATP 564


>gi|157116531|ref|XP_001658537.1| tubulointerstitial nephritis antigen [Aedes aegypti]
 gi|108883447|gb|EAT47672.1| AAEL001232-PA [Aedes aegypti]
          Length = 462

 Score =  120 bits (301), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 73/207 (35%), Positives = 104/207 (50%), Gaps = 15/207 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           + + L+   +++C       GC GG+  +AW Y    G V EEC PY  +    H  C+ 
Sbjct: 232 ETVQLAPQQIVSCVRR--SQGCSGGHLDTAWSYLRKVGTVNEECYPYISA----HNVCKI 285

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
                     C    ++ R + +    A+ +N++  DIM EI K+GPV+    V+ DF  
Sbjct: 286 RPSDTLITANCELPMKVDRTNMYKMGPAFSLNNE-TDIMLEIKKHGPVQAIMRVHRDFFS 344

Query: 140 YKSGVYKHITGDV-----MGGHAVKLIGWGTSDDGED---YWILANQWNRSWGADGYFKI 191
           YKSG+Y+H           G H+V+LIGWG    G +   YWI  N W   WG +G F+I
Sbjct: 345 YKSGIYRHSAASTSADQRAGYHSVRLIGWGEERHGYEVTKYWIAVNSWGTWWGENGRFRI 404

Query: 192 KRGSNECGIEEDVVAGLPSSKNLVKEI 218
            RGSNEC IE  V+A LP     VK++
Sbjct: 405 LRGSNECEIESYVLASLPYVHQQVKDL 431


>gi|294894292|ref|XP_002774787.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239880404|gb|EER06603.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 414

 Score =  120 bits (301), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 78/224 (34%), Positives = 108/224 (48%), Gaps = 45/224 (20%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------------EECDPYFDST 70
           LS  ++ AC       GCDGG P  AW +  + G+ T             + C PY D  
Sbjct: 196 LSAGEMNACAPSF---GCDGGIPSLAWSWVHNKGIATGGDYLAEDDMTKDDGCWPY-DFP 251

Query: 71  GCSH-------PGC-EPAYPTPKCVRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAE 120
            C+H       P C + +Y TP C  +C   K     R+ +H+ + +        D    
Sbjct: 252 PCAHHVNDSKYPKCPKDSYETPNCAEQCHNPKYTTTLRDDRHFLVESVPYEYSVNDAKNA 311

Query: 121 IYKNGPV---------------EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 165
           I  +GPV                 SF VYEDF  Y+SGVYKH +G  +GGHAVK+IGWG 
Sbjct: 312 IRTDGPVGPIYFCDPSVNFDQVSASFIVYEDFLAYRSGVYKHTSGKELGGHAVKIIGWG- 370

Query: 166 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
            + G+ YW++ N WN  WG +G FKI  G+  C I++D++ G P
Sbjct: 371 EETGQAYWLVVNSWNEDWGDNGLFKIALGN--CEIDDDLLGGTP 412


>gi|189238903|ref|XP_967834.2| PREDICTED: similar to tubulointerstitial nephritis antigen
           [Tribolium castaneum]
          Length = 453

 Score =  120 bits (301), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 74/200 (37%), Positives = 104/200 (52%), Gaps = 12/200 (6%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           + ++LS   LL+C        C+GGY   AW Y    G+V E+C PY      ++  C  
Sbjct: 252 EKVTLSAQHLLSC-DRRGQQSCNGGYLDRAWSYIRKIGLVDEQCFPY----SATNEKCRI 306

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
                     C     + R SK+    AYR+ ++  DIM EI  +GPV+ +  VY DF  
Sbjct: 307 PRRGDLVTANCQLPTNVDRRSKYKVAPAYRVGNE-TDIMYEILHSGPVQATMKVYHDFFT 365

Query: 140 YKSGVYKHI---TGDVMGGHAVKLIGWGT--SDDG-EDYWILANQWNRSWGADGYFKIKR 193
           YK G+Y+H    T D  G H+V+++GWG   S +G + YW +AN W   WG +GYF+I R
Sbjct: 366 YKRGIYRHSPISTNDRTGYHSVRIVGWGEEYSPEGLKKYWKVANSWGPEWGENGYFRILR 425

Query: 194 GSNECGIEEDVVAGLPSSKN 213
           GSNEC IE  V+      +N
Sbjct: 426 GSNECEIESFVLGTWAEVEN 445


>gi|327281715|ref|XP_003225592.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
           carolinensis]
          Length = 520

 Score =  120 bits (301), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 76/203 (37%), Positives = 107/203 (52%), Gaps = 22/203 (10%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
           +LS  +LL+C       GC+GG    AW +    GVVT+EC P F +   +H    PA  
Sbjct: 304 ALSPQNLLSC-NTRHQQGCNGGRIDGAWWFLRRRGVVTDECYP-FSNQETNHSPNAPACM 361

Query: 81  ---YPTPKCVRKCVKKNQLWR---NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 134
                T +  R+ + +    R   N  + S  AYR++S+ ++IM E+ +NGPV+    V+
Sbjct: 362 MHSRSTGRGKRQAIARCPNPRSHANEIYQSTPAYRLSSNEKEIMKELMENGPVQAILEVH 421

Query: 135 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWGTSD--DG--EDYWILANQWNRS 182
           EDF  Y++G+Y+H              G H+VK+ GWG     DG  + YWI AN W + 
Sbjct: 422 EDFFMYRTGIYRHTAVAAGKPEQYRRHGTHSVKITGWGEEQMPDGSNQKYWIAANSWGKD 481

Query: 183 WGADGYFKIKRGSNECGIEEDVV 205
           WG  GYF+I RG NEC IE  VV
Sbjct: 482 WGEHGYFRITRGENECEIETFVV 504


>gi|166030326|gb|ABY78830.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  120 bits (300), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 66/179 (36%), Positives = 90/179 (50%), Gaps = 12/179 (6%)

Query: 45  YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP--------AYPTPKCVRKCVKKNQL 96
           +P  AW Y+V +G+ +  C PY     C H G +          + TPKC   C  K+  
Sbjct: 162 FPGFAWLYYVEYGIASSGCQPY-PFPHCEHRGAQGNKTPCSKYKFDTPKCNATCTDKSIP 220

Query: 97  WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 156
               K+   + Y +    ED   E+Y NGP    F VY D   YKSGVY+++ GD +GG 
Sbjct: 221 L--VKYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGGQ 278

Query: 157 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 215
           AV+++GWG   +G  YW +AN W+  WG +GY  I  G+NEC IE     G P    L 
Sbjct: 279 AVRIVGWGKL-NGTPYWKVANSWDTDWGMNGYMLILGGNNECNIEHLGFTGFPDPSQLT 336


>gi|326430261|gb|EGD75831.1| hypothetical protein PTSG_07950 [Salpingoeca sp. ATCC 50818]
          Length = 381

 Score =  120 bits (300), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 70/186 (37%), Positives = 94/186 (50%), Gaps = 9/186 (4%)

Query: 37  CGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL 96
           C  GCDGGYP  A+R+    G+  E C  Y    G     C         V +C   +  
Sbjct: 176 CSLGCDGGYPDGAFRFMQDEGITPELCVKYVSKDGTDPLECSDVQTM---VSECTATSNA 232

Query: 97  WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK--HITGDVMG 154
             N        Y  +SD E I  +I ++GPV  S+ V+EDF  Y SGVY       D +G
Sbjct: 233 TVNGDR---CYYHSSSDIETIQRDIMQHGPVLASYEVFEDFGEYDSGVYTCPDDGSDSIG 289

Query: 155 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
            HAV ++GWG  +D   YW++ N W   +G DGYFKI RG+NEC IE  +V  L +++ +
Sbjct: 290 WHAVIIVGWGV-EDNTPYWLVQNSWGTGFGIDGYFKIARGTNECNIESRLVTSLVNTEGV 348

Query: 215 VKEITS 220
           V   TS
Sbjct: 349 VFASTS 354


>gi|159950|gb|AAA29435.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
          Length = 105

 Score =  119 bits (299), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 54/102 (52%), Positives = 73/102 (71%), Gaps = 1/102 (0%)

Query: 106 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 165
            AY++ +  + I  +I KNGPV  ++TVYEDFAHY+SG+YKH  G   G HAVK+IGWG 
Sbjct: 2   KAYQLKNSVKAIQKDIMKNGPVVATYTVYEDFAHYRSGIYKHKAGRKTGLHAVKVIGWG- 60

Query: 166 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
            + G  YWI+AN W+  WG +G+F++ RGSN+CG EE + AG
Sbjct: 61  EEKGTPYWIVANSWHDDWGENGFFRMHRGSNDCGFEERMAAG 102


>gi|107921773|gb|ABF85678.1| cathepsin B1 [Fasciola hepatica]
          Length = 278

 Score =  119 bits (299), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 68/171 (39%), Positives = 93/171 (54%), Gaps = 19/171 (11%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTG 71
           +    LS  DL++CC + CG+GC GG P +AW Y+  +G+VT         C PY     
Sbjct: 111 MMQPELSAIDLVSCCSY-CGNGCQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPY-PFPQ 168

Query: 72  CSHPGCEPA--------YPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIY 122
           C HPG            YPTP C   C    ++ +   K Y  ++Y ++     IM EI 
Sbjct: 169 CRHPGSRSQLNPCPGYIYPTPSCYPYCQAGYDKTYEEDKVYGKTSYNVDRHEYTIMQEIM 228

Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 173
           KNGPVE  F VY DFA YKSG+Y H++G   G HA+++IGWG  ++G +YW
Sbjct: 229 KNGPVEAGFIVYTDFAVYKSGIYHHVSGRYAGKHAIRIIGWGV-ENGVNYW 278


>gi|170045773|ref|XP_001850470.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
 gi|167868692|gb|EDS32075.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
          Length = 463

 Score =  119 bits (298), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 72/213 (33%), Positives = 106/213 (49%), Gaps = 15/213 (7%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY 81
           + L+   +++C       GC GG+  +AW Y    G V +EC PY  +       C+   
Sbjct: 235 VQLAPQQIISCVRR--SQGCSGGHLDTAWNYVRKVGTVNDECYPYISAQN----ACKIRP 288

Query: 82  PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 141
                   C    ++ R + +    A+ +N++  DIM EI K+GPV+    V+ DF  YK
Sbjct: 289 SDTLITANCDLPTKVDRTNMYKMGPAFSLNNE-TDIMIEIKKHGPVQAILRVHRDFFSYK 347

Query: 142 SGVYKHIT----GDVMGG-HAVKLIGWGTSDDGED---YWILANQWNRSWGADGYFKIKR 193
           SG+Y+H      GD   G H+V+LIGWG   +G +   YW+  N W R WG +G F+I R
Sbjct: 348 SGIYRHSAASSAGDERAGYHSVRLIGWGEERNGYETTKYWVAVNSWGRWWGENGRFRIVR 407

Query: 194 GSNECGIEEDVVAGLPSSKNLVKEITSADMFED 226
           G NEC IE  V+A LP     VK +      ++
Sbjct: 408 GQNECEIESYVLASLPYVHQQVKPMRQVGELQE 440


>gi|294885809|ref|XP_002771442.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239875086|gb|EER03258.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 527

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 65/157 (41%), Positives = 89/157 (56%), Gaps = 14/157 (8%)

Query: 63  CDPYFDSTGCSH-------PGC-EPAYPTPKCVRKC--VKKNQLWRNSKHYSISAYRINS 112
           C PY D   C+H       P C + +Y TP CV +C   K     +N +HY + +     
Sbjct: 373 CWPY-DFPPCAHHINDTKYPKCPKGSYETPNCVEQCHNPKYTTSLKNDRHYMLESSPYQY 431

Query: 113 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 172
              +    I  +GP+  S+ VYEDF  YKSGVYKH +G  +GGHAVK+IGWG  ++GE Y
Sbjct: 432 SVNNAKNAIRTDGPISASYLVYEDFLAYKSGVYKHTSGSYLGGHAVKIIGWG-EENGEAY 490

Query: 173 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           W++ N WN  WG  G FKI  G+  C I++D++ G P
Sbjct: 491 WLVVNSWNEDWGDQGLFKIALGN--CEIDDDLLGGTP 525


>gi|308159555|gb|EFO62082.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 305

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 69/186 (37%), Positives = 105/186 (56%), Gaps = 17/186 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFD-STGCSHPGC 77
           Q +SLSV  +++C     G+ GC GG   S+W +    GVV  +C PY    TG S    
Sbjct: 128 QAVSLSVQHMVSCDN---GEAGCLGGEFESSWAFLETEGVVKSDCLPYTSGETGNSG--- 181

Query: 78  EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
                  +C   C +   L  ++ HY  ++    ++  +IM  +  +GPV+  F V+EDF
Sbjct: 182 -------ECPMMC-QDGTLVEDAFHYKAASASPLNNYNEIMVSLLADGPVQTGFYVHEDF 233

Query: 138 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 197
            +Y  G+Y  + G  +GGHAV ++G+G+ +D  DYWI+ N W   WG +GYF+I RG+NE
Sbjct: 234 LYYVGGIYHKVYGSSLGGHAVLIVGYGSMND-HDYWIVRNSWGPDWGENGYFRILRGTNE 292

Query: 198 CGIEED 203
           CGIE++
Sbjct: 293 CGIEKN 298


>gi|253744515|gb|EET00718.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 306

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 68/172 (39%), Positives = 89/172 (51%), Gaps = 16/172 (9%)

Query: 40  GCDGGYPISAWRYFVHHGVVTEECDPYFDSTG-CSHPGCEPAYPTPKCVRKCVKKNQLWR 98
           GC GG   S W +   HG  T EC PY D+    S P          C   C   +++ R
Sbjct: 143 GCAGGLSFSVWTFLTEHGTTTLECVPYTDANKDISSP----------CPDACADGSEI-R 191

Query: 99  NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 158
             K      Y  N     IM  +  +GPV+ S  VY DF +Y+SGVY+H+ G  +  HAV
Sbjct: 192 LVKADGCLDYSGNVTA--IMQALANDGPVQASMAVYRDFLYYRSGVYRHVYGSQISSHAV 249

Query: 159 KLIGWGTSDDGED--YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           ++IG+G +DD +   YWI+ N     WG +GYF I RGSNEC IE  V +GL
Sbjct: 250 EIIGYGAADDEDSTPYWIVKNSLGSGWGEEGYFNIVRGSNECDIESAVYSGL 301


>gi|432884030|ref|XP_004074413.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oryzias
           latipes]
          Length = 474

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 78/233 (33%), Positives = 115/233 (49%), Gaps = 50/233 (21%)

Query: 10  ALSSSPYVSLQNL-----SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  +S+Q++      LS  +L++C     G GC GG    AW Y    GVVTE C 
Sbjct: 234 AAVASDRISIQSMGHMTPQLSPQNLISCDTRNQG-GCAGGRIDGAWWYLRRRGVVTENCY 292

Query: 65  PYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-----------------WRNSKHYSISA 107
           PY           +P    P  V +C+ +++                  + N  + S   
Sbjct: 293 PY-----------QPPQQAPAEVGRCMMQSRAVGRGKRQATQRCPNTYNYHNDIYQSTPP 341

Query: 108 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV----------MGGHA 157
           Y+++S+ ++IM EI +NGPV+    V+EDF  YK+G+YKH   DV           G H+
Sbjct: 342 YKLSSNEKEIMKEIMENGPVQAIMEVHEDFFVYKNGIYKHT--DVSSTKPPQYRKHGTHS 399

Query: 158 VKLIGWGTSDD----GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           V++ GWG   D       YWI AN W ++WG +G+F+I RG+NEC IE  V+ 
Sbjct: 400 VRITGWGEDKDYDGTPRKYWIAANSWGKNWGENGFFRIARGANECEIEAFVIG 452


>gi|13469701|gb|AAK27318.1| cysteine proteinase [Clonorchis sinensis]
          Length = 179

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 69/160 (43%), Positives = 91/160 (56%), Gaps = 14/160 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT----EE---CDPY------F 67
           N SLS  DLL+CC   CG GCDGG+P  AW ++  HG+VT    EE   C PY       
Sbjct: 19  NKSLSAVDLLSCCK-DCGYGCDGGFPPMAWDFWKTHGIVTGGSKEEPAGCRPYPFPKCQH 77

Query: 68  DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
            S G   P     YPTPKCV+ C      ++  K  + ++Y ++     IM EI  NGPV
Sbjct: 78  HSQGHYPPCPRRIYPTPKCVKHCDTPKIDYQKDKTRANTSYNVHQSEVAIMKEILLNGPV 137

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 167
           E +F V+EDF  YKSG+Y H  G  +GGHA++++GWG  +
Sbjct: 138 EATFEVHEDFPEYKSGIYFHAWGGSVGGHAIRILGWGEEN 177


>gi|308162940|gb|EFO65307.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 303

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 63/169 (37%), Positives = 91/169 (53%), Gaps = 15/169 (8%)

Query: 40  GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
           GCDGG     W +    G  T EC  Y D          P      C   C   +Q+   
Sbjct: 144 GCDGGDFWPTWSFLTLTGATTAECVKYIDY---------PNIVASPCPAVCDDGSQI--- 191

Query: 100 SKHYSISAY-RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHA 157
            + Y    Y +++ + + IM  +   GPV+    VY D ++Y+SGVYKH  G + +G HA
Sbjct: 192 -QLYKAHGYGQVSKNVQAIMHMLATGGPVQTMIVVYSDLSYYESGVYKHTYGTISLGLHA 250

Query: 158 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           ++++G+GT+DDG DYWI+ N W   WG +GYF+I RG NEC IE+++ A
Sbjct: 251 LEMVGYGTTDDGTDYWIIRNSWGADWGENGYFRIVRGVNECRIEDEIYA 299


>gi|3087797|emb|CAA93275.1| cysteine proteinase [Haemonchus contortus]
          Length = 330

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 74/202 (36%), Positives = 102/202 (50%), Gaps = 27/202 (13%)

Query: 1   MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
           +S   T  D +       +Q + +S  D+LACCG  CG GC+GG    AW Y    GVVT
Sbjct: 126 VSAAETMSDRICVQSKGRVQKM-ISDVDILACCGRECGRGCNGGMDHKAWEYVKEFGVVT 184

Query: 61  ----EE---CDPYFDSTGCSHPGCE-----------PAYPTPKCVRKC-VKKNQLWRNSK 101
               +E   C PY       HP CE            ++ TP C + C     + +   K
Sbjct: 185 GGRYQEKGVCKPYH-----LHP-CEITGKFWSCPRDHSFRTPACKKYCQYGYGKRYEKDK 238

Query: 102 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 161
            Y  S Y ++ D + I  E+ KNGPV+ +FT YEDF+ Y+ G+Y H  G   G HAVK++
Sbjct: 239 SYVKSVYILDEDEKAIQREMMKNGPVQAAFTTYEDFSFYRKGIYVHSYGRQRGAHAVKVV 298

Query: 162 GWGTSDDGEDYWILANQWNRSW 183
           GWG  ++G  YW +AN W+  W
Sbjct: 299 GWGV-ENGTKYWNVANSWSTDW 319


>gi|242001446|ref|XP_002435366.1| cysteine proteinase, putative [Ixodes scapularis]
 gi|215498696|gb|EEC08190.1| cysteine proteinase, putative [Ixodes scapularis]
          Length = 238

 Score =  118 bits (296), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 72/202 (35%), Positives = 100/202 (49%), Gaps = 19/202 (9%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           +  + LS  DLL+C        C GG+    WR+   H  V+E+C PY      +   C 
Sbjct: 13  VDKVELSPQDLLSCLNGGRRVTCQGGHVDRGWRFLGRHAGVSEDCYPYESGYSNASTTCR 72

Query: 79  PA---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
            A    PT   +    +++Q     K++S   YR+ ++ EDIM EIY NGPV+    V E
Sbjct: 73  IARRRVPTEDPICPTGRQDQ-----KYFSTPPYRVPANEEDIMQEIYANGPVQALMLVKE 127

Query: 136 DFAHYKSGVYKHIT--------GDVMGGHAVKLIGWG---TSDDGEDYWILANQWNRSWG 184
           DF  Y SGVYKH                H+V+++GWG   T    + YW+ AN W   WG
Sbjct: 128 DFFLYSSGVYKHTRLAHNLPPEYQKSDWHSVRILGWGVDRTQYRPQKYWLCANSWGSGWG 187

Query: 185 ADGYFKIKRGSNECGIEEDVVA 206
            +GYF+I RG +E  IE  V+A
Sbjct: 188 ENGYFRIVRGEDESQIESFVLA 209


>gi|90074902|dbj|BAE87131.1| unnamed protein product [Macaca fascicularis]
          Length = 296

 Score =  118 bits (295), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 73/227 (32%), Positives = 108/227 (47%), Gaps = 62/227 (27%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE---- 61
           ++R  + ++ +VS++   +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+     
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYD 174

Query: 62  ---ECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY     C H      P C     TPKC + C    +  ++  KHY  ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
           +  +DIMAEIYKN                                            G  
Sbjct: 234 NSEKDIMAEIYKN--------------------------------------------GTP 249

Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
           YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 250 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 296


>gi|294897889|ref|XP_002776090.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
 gi|239882699|gb|EER07906.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
          Length = 134

 Score =  118 bits (295), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 62/134 (46%), Positives = 82/134 (61%), Gaps = 8/134 (5%)

Query: 81  YPTPKCVRKC--VKKNQLWRNSKHYSISAY--RINSDPEDIMAEIYKNGPVEVSFTVYED 136
           Y TP C   C   K    +   +HY+ S +  R  S    I  EI  NGP   +F+VYED
Sbjct: 3   YDTPSCSSSCPNAKYGTAFDKDRHYTESLFPSRFGS-TSSIKKEIMTNGPTSAAFSVYED 61

Query: 137 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 196
           F  YKSGVYKH +G  +GGHAV++IGWGT + G DYW++ N WN  WG  G FKI +G  
Sbjct: 62  FLSYKSGVYKHTSGGFLGGHAVEIIGWGT-EKGVDYWLVMNSWNEEWGDHGTFKIVQG-- 118

Query: 197 ECGIEEDVVAGLPS 210
           +CGI++ ++AG P+
Sbjct: 119 DCGIDDMILAGTPA 132


>gi|56754337|gb|AAW25356.1| SJCHGC00056 protein [Schistosoma japonicum]
          Length = 342

 Score =  118 bits (295), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 60/154 (38%), Positives = 88/154 (57%), Gaps = 10/154 (6%)

Query: 63  CDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDP 114
           C PY     C H      P C    Y TP+C + C K  +  +   K +   +  + ++ 
Sbjct: 188 CQPY-PFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPFEQDKPFGEGSSNVQNNE 246

Query: 115 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 174
           +    +I   GPVE +F VYEDF + KSG+ +H+TG ++GGH +++IGWG  + G  YW+
Sbjct: 247 KVFQRDIMMYGPVEAAFDVYEDFLNSKSGISRHVTGSIVGGHPIRIIGWGV-EKGNPYWL 305

Query: 175 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           +AN WN  WG +G F++ RG +EC IE  VVAGL
Sbjct: 306 IANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339


>gi|12330244|gb|AAG52659.1| cysteine proteinase [Metagonimus yokogawai]
          Length = 183

 Score =  118 bits (295), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 68/167 (40%), Positives = 96/167 (57%), Gaps = 21/167 (12%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------- 66
           N+ LS  DLL+CC   CG GC GG+   AW Y+  +G+VT         C PY       
Sbjct: 19  NVQLSARDLLSCC-TSCGFGCVGGWIGDAWDYWRDNGIVTGGDYQDKSTCLPYPFPPSHH 77

Query: 67  FDSTGCS---HPGCEPAYPTPKCVRKCVKKNQ-LWRNSKHYSISAYRINSDPEDIMAEIY 122
             S G     +P  +  YPTP CV KC +     +   K +++S+Y+I+ +  +I  EI 
Sbjct: 78  LVSKGTPFEIYP--QTLYPTPPCVSKCQEGYPGEYEKDKIFALSSYKIDRNATEIQKEIL 135

Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 169
            NGPVE    VY DF +YK+GVY+H TG+++GGHA++L+GWG + DG
Sbjct: 136 INGPVEAGMNVYADFPNYKTGVYQHTTGEILGGHAIRLLGWGKTKDG 182


>gi|253744204|gb|EET00443.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 309

 Score =  118 bits (295), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 69/195 (35%), Positives = 99/195 (50%), Gaps = 18/195 (9%)

Query: 17  VSLQNLSLSVNDLLACCGFLCGDGCDG--GYPISAWRYFVHHGVVTEECDPYFDSTGCSH 74
           V  +    S   +L+C      +GC    G  + +W +    G+  E C  Y D     +
Sbjct: 119 VDQEATRYSAQYILSCA---TTNGCLAFPGQGVVSWDFIATTGIPLESCVKYTD-----Y 170

Query: 75  PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYR-INSDPEDIMAEIYKNGPVEVSFTV 133
              E +YP P     C   + L      Y    Y  +  +PE +   I   GP++  FTV
Sbjct: 171 DKTESSYPCPSL---CNDNSSL----VLYKSDGYEGVGFNPEKLRRAIALRGPMQAMFTV 223

Query: 134 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 193
           YEDFA+Y  G+Y H+ G   G  +V+++G+GTSD+G+DYWI+ N W  +WG DGYF+I R
Sbjct: 224 YEDFAYYLEGIYSHVYGGTAGYLSVEIVGYGTSDEGQDYWIVKNYWGSNWGEDGYFRIVR 283

Query: 194 GSNECGIEEDVVAGL 208
           G NEC IEE V   +
Sbjct: 284 GQNECQIEEAVYGAI 298


>gi|321476473|gb|EFX87434.1| hypothetical protein DAPPUDRAFT_221708 [Daphnia pulex]
          Length = 464

 Score =  118 bits (295), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 79/214 (36%), Positives = 108/214 (50%), Gaps = 32/214 (14%)

Query: 11  LSSSPYVSLQN---LSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPY 66
           L S   V+ +N   ++LS  D+++C  +    GC+GG+P + A +Y   HGVV EEC PY
Sbjct: 266 LESRLRVATKNQVQVNLSPQDIVSCSAY--SQGCEGGFPYLIAGKYAQDHGVVAEECYPY 323

Query: 67  FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
              TG     C  A    KC R  V        +K+  +  Y    + E +   + ++GP
Sbjct: 324 ---TG-RDSACSAA---KKCQRSYV--------AKYRYVGGYYGACNEELMKMSLVESGP 368

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDV----------MGGHAVKLIGWGT-SDDGEDYWIL 175
           + VSF VY DF HY  GVY    G            +  HAV L+G+GT S   E YWI+
Sbjct: 369 LSVSFEVYSDFMHYAGGVYHRTDGLFNKINEFNPFELTNHAVLLVGYGTDSQTKEKYWIV 428

Query: 176 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
            N W   WG DG+F+I+RG +ECGIE   V   P
Sbjct: 429 KNSWGTKWGEDGFFRIRRGVDECGIESIAVEVTP 462


>gi|294894290|ref|XP_002774786.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239880403|gb|EER06602.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 830

 Score =  117 bits (294), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 80/245 (32%), Positives = 111/245 (45%), Gaps = 66/245 (26%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------------EECDPYFDST 70
           LS  ++ AC       GC+GG+P SAW +    G+ T             + C PY D  
Sbjct: 591 LSAGEMNACAP---SHGCNGGFPNSAWSWVHDKGIATGGDYVAKDDMTKDDGCWPY-DFP 646

Query: 71  GCSH-------PGC----------------------EPAYPTPKCVRKC--VKKNQLWRN 99
            C+H       P C                      + +Y TP C  +C   K     R+
Sbjct: 647 PCAHHINDTKYPECPKVSCSGESPPATAETATVIAYQNSYETPNCAEQCHNPKYTTTLRD 706

Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPV---------------EVSFTVYEDFAHYKSGV 144
            +H+ + +        D    I  +GPV                 SF+VYEDF  YKSGV
Sbjct: 707 DRHFMLESSPYQYSVNDAKNAIRTDGPVGPIYFCDPNVNFDQVSASFSVYEDFLAYKSGV 766

Query: 145 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 204
           YKH +G+ +GGHAVK+IGWG  + G+ YWI+ N WN  WG  G FKI  G+  CGI++++
Sbjct: 767 YKHTSGEYLGGHAVKIIGWG-EESGQAYWIVVNSWNEDWGDHGLFKIALGN--CGIDDNL 823

Query: 205 VAGLP 209
           + G P
Sbjct: 824 LGGTP 828


>gi|417401357|gb|JAA47568.1| Putative dipeptidyl peptidase 1 [Desmodus rotundus]
          Length = 463

 Score =  117 bits (294), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 73/204 (35%), Positives = 105/204 (51%), Gaps = 31/204 (15%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GCDGG+P + A +Y    G+V E C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCDGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C+ K   +R   S+++ +  +    +   +  E+  NGP+ V+F VY D
Sbjct: 331 -----------CMLKEDCFRYYTSEYHYVGGFYGGCNEALMKLELVHNGPMAVAFEVYND 379

Query: 137 FAHYKSGVYKHITGDV-------MGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGY 188
           F HY+ G+Y H TG         +  HAV L+G+GT    G DYWI+ N W  +WG DGY
Sbjct: 380 FLHYQEGIYHH-TGLTDPFNPFELTNHAVLLVGYGTDPATGMDYWIVKNSWGTAWGEDGY 438

Query: 189 FKIKRGSNECGIEEDVVAGLPSSK 212
           F+I+RG++EC IE   VA  P  K
Sbjct: 439 FRIRRGTDECAIESIAVAATPIPK 462


>gi|294876463|ref|XP_002767679.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239869446|gb|EER00397.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 348

 Score =  117 bits (293), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 83/221 (37%), Positives = 108/221 (48%), Gaps = 34/221 (15%)

Query: 21  NLSLSVNDLLACCGFL--C-GDGCDGGYPISAWRYFVHHGVVT-------------EECD 64
           N  LS  +LLACC     C   GC GG    AW +   HG+ T             + C 
Sbjct: 134 NQLLSAGELLACCNLAHSCEARGCKGGVARDAWVFLNKHGIATGGDFVPKSSMEAVDGCW 193

Query: 65  PYFDSTGCSH--------PGCEPAYPTPKCVRKCV--KKNQLWRNSKHYSISA--YRINS 112
           PY +   C+H        P  + +Y TP C+ +C   K        +H++  A  Y  N 
Sbjct: 194 PY-NFPRCAHYQKKSKYGPCPKKSYETPSCLDRCPNEKYGTPLDKDRHFTARAVPYWFNG 252

Query: 113 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 172
               I  EI K+GP   SF  YEDF  YKSGVYK+ +G  +  H V+LIGWGT + G DY
Sbjct: 253 I-RSIKKEIMKHGPTSASFFTYEDFFSYKSGVYKYTSGAYVEFHTVELIGWGT-EKGVDY 310

Query: 173 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 213
           W+  N WN  W   G FKI +G  +CGI  D+V G P++ N
Sbjct: 311 WLAKNDWNEEWADLGTFKIAQG--DCGI-NDLVLGAPAALN 348


>gi|357623033|gb|EHJ74345.1| tubulointerstitial nephritis antigen [Danaus plexippus]
          Length = 426

 Score =  117 bits (292), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 75/211 (35%), Positives = 110/211 (52%), Gaps = 14/211 (6%)

Query: 1   MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
           ++   ++R A+ S+     + + LS   LL+C       GC GG+   AW +   HG+V 
Sbjct: 217 IATVASDRFAIQSN---GAERMVLSPQVLLSC-NIRRQQGCRGGHIDVAWNFARGHGLVD 272

Query: 61  EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE 120
           EEC PY  +T        P  P    +    +     R S+ Y +      +   DIM +
Sbjct: 273 EECFPYKAATTSC-----PFRPKANLIEDGCRPPVRQRTSR-YKVGPPGKLATENDIMYD 326

Query: 121 IYKNGPVEVSFTVYEDFAHYKSGVYKHIT-GD--VMGGHAVKLIGWGTSDDGEDYWILAN 177
           I ++GPV    TV++DF HY  G+Y+    GD  + G H+V+++GWG  D G+ YW++AN
Sbjct: 327 IMESGPVHAVMTVHQDFFHYHDGIYRRSPYGDNTLQGLHSVRIVGWG-EDRGDKYWVVAN 385

Query: 178 QWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
            W   WG +GYF+I RGSNE GIE  VV  L
Sbjct: 386 SWGCDWGENGYFRIARGSNESGIESFVVTVL 416


>gi|361069783|gb|AEW09203.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153583|gb|AFG58928.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153585|gb|AFG58929.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153587|gb|AFG58930.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153589|gb|AFG58931.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153591|gb|AFG58932.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153593|gb|AFG58933.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153595|gb|AFG58934.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153597|gb|AFG58935.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153599|gb|AFG58936.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153601|gb|AFG58937.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153603|gb|AFG58938.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153605|gb|AFG58939.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153607|gb|AFG58940.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153609|gb|AFG58941.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
          Length = 68

 Score =  117 bits (292), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 54/69 (78%), Positives = 61/69 (88%), Gaps = 1/69 (1%)

Query: 137 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 196
           FAHYKSGVYK+I GD+MGGHAVKL+GWGT + G DYW++AN WN +WG DGYFKI RGSN
Sbjct: 1   FAHYKSGVYKYIKGDLMGGHAVKLVGWGT-EGGTDYWLVANSWNTAWGEDGYFKIARGSN 59

Query: 197 ECGIEEDVV 205
           ECGIEEDVV
Sbjct: 60  ECGIEEDVV 68


>gi|10803435|emb|CAC13130.1| putative cathepsin B.4 [Ostertagia ostertagi]
          Length = 194

 Score =  117 bits (292), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 64/156 (41%), Positives = 88/156 (56%), Gaps = 17/156 (10%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTG 71
           ++ + +S  D+++CC + CG GCDGG+PI AW++F   GVVT         C PY + T 
Sbjct: 22  VKQVLISAQDMVSCCSY-CGYGCDGGWPIKAWQFFAREGVVTGGNYGRQGCCRPY-EITP 79

Query: 72  CSHPGCEPAY-------PTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
           C H G EP Y        TP+C RKC       ++  K Y   AY++ +  + I  EI  
Sbjct: 80  CGHHGREPYYGECYDDAQTPRCKRKCQSGYKTTYKKDKRYGRKAYQLPNSVKAIQREIMM 139

Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 159
           +GPV   +TVYEDF++Y  G+YKH  G   GGHAVK
Sbjct: 140 HGPVVAGYTVYEDFSYYTKGIYKHTAGRETGGHAVK 175


>gi|22653678|sp|O97578.1|CATC_CANFA RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain 1; AltName: Full=Dipeptidyl peptidase I
           heavy chain 1; Contains: RecName: Full=Dipeptidyl
           peptidase 1 heavy chain 2; AltName: Full=Dipeptidyl
           peptidase I heavy chain 2; Contains: RecName:
           Full=Dipeptidyl peptidase 1 heavy chain 3; AltName:
           Full=Dipeptidyl peptidase I heavy chain 3; Contains:
           RecName: Full=Dipeptidyl peptidase 1 heavy chain 4;
           AltName: Full=Dipeptidyl peptidase I heavy chain 4;
           Contains: RecName: Full=Dipeptidyl peptidase 1 light
           chain; AltName: Full=Dipeptidyl peptidase I light chain;
           Flags: Precursor
 gi|4106126|gb|AAD02704.1| dipeptidyl peptidase I [Canis lupus familiaris]
          Length = 435

 Score =  117 bits (292), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 71/201 (35%), Positives = 106/201 (52%), Gaps = 26/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY    G   P C+
Sbjct: 252 QTPILSPQEIVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---AGSDSP-CK 305

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
           P      C R        + +S++Y +  +    +   +  E+ ++GP+ V+F VY+DF 
Sbjct: 306 PN----DCFR--------YYSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFF 353

Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKI 191
           HY+ G+Y H           +  HAV L+G+GT S  G DYWI+ N W   WG DGYF+I
Sbjct: 354 HYQKGIYYHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRI 413

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG++EC IE   VA  P  K
Sbjct: 414 RRGTDECAIESIAVAATPIPK 434


>gi|32129433|sp|P92131.3|CATB1_GIALA RecName: Full=Cathepsin B-like CP1; AltName: Full=Cathepsin B-like
           protease B1; Flags: Precursor
          Length = 303

 Score =  117 bits (292), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 70/188 (37%), Positives = 99/188 (52%), Gaps = 15/188 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           + +S S   L++C   L   GCDGG     W +    G  T EC  Y D       G   
Sbjct: 126 EAVSYSQQHLISCS--LENFGCDGGDFQPTWSFLTFTGATTAECVKYVDY------GHTV 177

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
           A P P          QL++   +  +S     S P  IM  +   GP++    VY D ++
Sbjct: 178 ASPCPAVCDDG-SPIQLYKAHGYGQVS----KSVPA-IMGMLVAGGPLQTMIVVYADLSY 231

Query: 140 YKSGVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 198
           Y+SGVYKH  G + +G HA++++G+GT+DDG DYWI+ N W   WG +GYF+I RG NEC
Sbjct: 232 YESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIKNSWGPDWGENGYFRIVRGVNEC 291

Query: 199 GIEEDVVA 206
            IE+++ A
Sbjct: 292 RIEDEIYA 299


>gi|29840882|gb|AAP05883.1| similar to GenBank Accession Number X70968 cathepsin B in
           Schistosoma japonicum [Schistosoma japonicum]
          Length = 312

 Score =  117 bits (292), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 74/188 (39%), Positives = 104/188 (55%), Gaps = 20/188 (10%)

Query: 4   TRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT--- 60
           + ++R  + S   +S++   LS  +LL+CC   CG GC+GG P  AW Y+   G+VT   
Sbjct: 128 SMSDRICIHSKGRISIE---LSAVNLLSCCS-RCGFGCNGGIPGMAWDYWKDEGIVTGGS 183

Query: 61  ----EECDPY------FDSTGCSHPGCEPAY-PTPKCVRKCVKKNQL-WRNSKHYSISAY 108
                 C PY        ST  +H  CE  Y  TP+C + C     + + N K+Y  S+Y
Sbjct: 184 NETHTGCQPYPFPECIHHSTSINHSSCEVKYYSTPECYQTCQPDYAIQYENDKYYGKSSY 243

Query: 109 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 168
            + SD   IM EI  NGPVE +F VY+DF +YK+GVYK++TG ++GGHA++ I W     
Sbjct: 244 YVTSDEVSIMKEILLNGPVEATFYVYDDFLNYKTGVYKYVTGSLLGGHAIR-ITWLGCIH 302

Query: 169 GEDYWILA 176
            E Y IL 
Sbjct: 303 IESYTILV 310


>gi|159112288|ref|XP_001706373.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157434469|gb|EDO78699.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 303

 Score =  117 bits (292), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 70/188 (37%), Positives = 99/188 (52%), Gaps = 15/188 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           + +S S   L++C   L   GCDGG     W +    G  T EC  Y D       G   
Sbjct: 126 EAVSYSQQHLISCS--LENFGCDGGDFQPTWSFLTFTGATTAECVKYVDY------GHTV 177

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
           A P P          QL++   +  +S     S P  IM  +   GP++    VY D ++
Sbjct: 178 ASPCPAVCDDG-SPIQLYKAHGYGQVS----KSVPA-IMGMLVAGGPLQTMIVVYADLSY 231

Query: 140 YKSGVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 198
           Y+SGVYKH  G + +G HA++++G+GT+DDG DYWI+ N W   WG +GYF+I RG NEC
Sbjct: 232 YESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIKNSWGPDWGENGYFRIVRGVNEC 291

Query: 199 GIEEDVVA 206
            IE+++ A
Sbjct: 292 RIEDEIYA 299


>gi|11691656|emb|CAC18646.1| cathepsin B-like protease 1 [Giardia intestinalis]
          Length = 303

 Score =  116 bits (291), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 70/188 (37%), Positives = 99/188 (52%), Gaps = 15/188 (7%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           + +S S   L++C   L   GCDGG     W +    G  T EC  Y D       G   
Sbjct: 126 EAVSYSQQHLISCS--LENFGCDGGDFQPTWSFLTFTGATTAECVKYVDY------GHTV 177

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
           A P P          QL++   +  +S     S P  IM  +   GP++    VY D ++
Sbjct: 178 ASPCPAVCDDG-SPIQLYKAHGYGQVS----KSVPA-IMGMLVAGGPLQTMIVVYADLSY 231

Query: 140 YKSGVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 198
           Y+SGVYKH  G + +G HA++++G+GT+DDG DYWI+ N W   WG +GYF+I RG NEC
Sbjct: 232 YESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIKNSWGPDWGENGYFRIVRGVNEC 291

Query: 199 GIEEDVVA 206
            IE+++ A
Sbjct: 292 RIEDEIYA 299


>gi|307938279|ref|NP_001182763.1| dipeptidyl peptidase 1 precursor [Canis lupus familiaris]
          Length = 459

 Score =  116 bits (291), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 71/201 (35%), Positives = 106/201 (52%), Gaps = 26/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY    G   P C+
Sbjct: 276 QTPILSPQEIVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---AGSDSP-CK 329

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
           P      C R        + +S++Y +  +    +   +  E+ ++GP+ V+F VY+DF 
Sbjct: 330 PN----DCFR--------YYSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFF 377

Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKI 191
           HY+ G+Y H           +  HAV L+G+GT S  G DYWI+ N W   WG DGYF+I
Sbjct: 378 HYQKGIYYHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRI 437

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG++EC IE   VA  P  K
Sbjct: 438 RRGTDECAIESIAVAATPIPK 458


>gi|158285208|ref|XP_001687862.1| AGAP007684-PA [Anopheles gambiae str. PEST]
 gi|158285210|ref|XP_308187.4| AGAP007684-PB [Anopheles gambiae str. PEST]
 gi|157019881|gb|EDO64511.1| AGAP007684-PA [Anopheles gambiae str. PEST]
 gi|157019882|gb|EAA04576.4| AGAP007684-PB [Anopheles gambiae str. PEST]
          Length = 463

 Score =  116 bits (291), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 76/215 (35%), Positives = 104/215 (48%), Gaps = 15/215 (6%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           + + L+   +LAC       GC GG+  +AW+Y    GVV EEC PY  +        + 
Sbjct: 234 EMVQLAPQQMLACVRR--QQGCSGGHLDTAWQYLRRTGVVNEECYPYIAAQNVCKISNDD 291

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
              T  C    VK N   R   +    A+ +N++  DIMAEI   G V+    VY DF  
Sbjct: 292 TLITANCELP-VKVN---RTLMYKMGPAFSLNNET-DIMAEIKDRGTVQAIMRVYRDFFS 346

Query: 140 YKSGVYKHITG-----DVMGGHAVKLIGWGTSDDGED---YWILANQWNRSWGADGYFKI 191
           Y+SG+Y+H        +    H+V+LIGWG    G D   YWI  N W + WG +G F+I
Sbjct: 347 YRSGIYRHSAAATPAEERSAYHSVRLIGWGEERVGYDVVKYWIAINSWGQWWGENGRFRI 406

Query: 192 KRGSNECGIEEDVVAGLPSSKNLVKEITSADMFED 226
            RGSNEC IE  V+A  P     V+ I      ++
Sbjct: 407 LRGSNECDIESYVLASNPYVHEHVQAIRKVGELQE 441


>gi|1763659|gb|AAB58258.1| cysteine protease [Giardia intestinalis]
          Length = 269

 Score =  116 bits (291), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 70/191 (36%), Positives = 100/191 (52%), Gaps = 15/191 (7%)

Query: 17  VSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPG 76
           +  + +S S   L++C   L   GCDGG     W +    G  T EC  Y D       G
Sbjct: 89  IDKEAVSYSQQHLISCS--LENFGCDGGDFQPTWSFLTFTGATTAECVKYVDY------G 140

Query: 77  CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
              A P P          QL++   +  +S     S P  IM  +   GP++    VY D
Sbjct: 141 HTVASPCPAVCDDG-SPIQLYKAHGYGQVS----KSVPA-IMGMLVAGGPLQTMIVVYAD 194

Query: 137 FAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 195
            ++Y+SGVYKH  G + +G HA++++G+GT+DDG DYWI+ N W   WG +GYF+I RG 
Sbjct: 195 LSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIKNSWGPDWGENGYFRIVRGV 254

Query: 196 NECGIEEDVVA 206
           NEC IE+++ A
Sbjct: 255 NECRIEDEIYA 265


>gi|62510425|sp|Q60HG6.1|CATC_MACFA RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|52782205|dbj|BAD51949.1| cathepsin C [Macaca fascicularis]
          Length = 463

 Score =  116 bits (291), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 71/203 (34%), Positives = 107/203 (52%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P ++A +Y    G+V E C PY   TG   P   
Sbjct: 279 QTPILSSQEVVSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGTDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HY++G+Y H           +  HAV L+G+GT S  G DYWI+ N W  SWG DGYF
Sbjct: 380 FLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462


>gi|159115721|ref|XP_001708083.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157436192|gb|EDO80409.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 305

 Score =  116 bits (290), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 67/187 (35%), Positives = 103/187 (55%), Gaps = 17/187 (9%)

Query: 20  QNLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFD-STGCSHPGC 77
           Q +SLSV  +++C     G+ GC GG   S+W +    G V  +C PY    TG S    
Sbjct: 128 QAVSLSVQHMVSCDS---GEAGCQGGEFESSWAFLETEGAVKSDCLPYTSGETGKSG--- 181

Query: 78  EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
                  +C   C     +  ++ HY  ++    S+  +IM  +  +GPV+  F V+EDF
Sbjct: 182 -------ECPTTCQDGTPV-ESAFHYKAASASRLSNYNEIMVSLLADGPVQTGFYVHEDF 233

Query: 138 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 197
            +Y  G+Y  + G  +GGHAV ++G+G+ ++  DYWI+ N W   WG +GYF+I RG+NE
Sbjct: 234 LYYVGGIYHKVYGTSLGGHAVLIVGYGSMNN-HDYWIVRNSWGSDWGENGYFRILRGTNE 292

Query: 198 CGIEEDV 204
           CGIE++ 
Sbjct: 293 CGIEKNA 299


>gi|201023321|ref|NP_001128402.1| cathepsin B-1874 precursor [Acyrthosiphon pisum]
          Length = 315

 Score =  116 bits (290), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 73/220 (33%), Positives = 107/220 (48%), Gaps = 45/220 (20%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           +N  +S   +++CC +LCG GCDGG    +W Y+  HG V+       + C PY      
Sbjct: 110 KNPIMSAQQIISCC-YLCGHGCDGGSLFESWDYYRRHGFVSGGDYNSNQGCQPY------ 162

Query: 73  SHPGCE------PAYP--------TPKCVRKCVKKNQLWR------NSKHYSISAYRINS 112
           + P C+      P +         TP C +KC   N            K+Y +S Y    
Sbjct: 163 TIPPCKLMNEKPPGHSCTTYHREETPICEKKCYNPNYYTSFRTDIYKGKYYKLSPYMA-- 220

Query: 113 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG---DVMGGHAVKLIGWGTSDDG 169
                M +I+ NGP+   F +Y D   YKSGVY++      D    H+VK+ GWG  ++G
Sbjct: 221 -----MKDIFDNGPITTQFYMYRDLVDYKSGVYQYDEQSDFDFFTVHSVKIFGWG-EENG 274

Query: 170 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
             YW++AN +   WG +G FKI RG++ C  +E + AGLP
Sbjct: 275 VPYWLVANSFGTDWGYNGTFKISRGNDGCFFQEKMYAGLP 314


>gi|410972493|ref|XP_003992693.1| PREDICTED: dipeptidyl peptidase 1 isoform 1 [Felis catus]
          Length = 463

 Score =  116 bits (290), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 72/201 (35%), Positives = 105/201 (52%), Gaps = 25/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GCDGG+P + A +Y    G+V E C PY   TG   P C+
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCDGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP-CK 332

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
           P      CVR        + +S+++ +  +    +   +  E+  +GP+ V+F VY DF 
Sbjct: 333 PK---EDCVR--------YYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYNDFL 381

Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
           HY+ G+Y H           +  HAV L+G+GT    G DYWI+ N W   WG DGYF+I
Sbjct: 382 HYRKGIYYHTGLRDPFNPFELTNHAVLLVGYGTDPVSGMDYWIVKNSWGIGWGEDGYFRI 441

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG++EC IE   VA  P  K
Sbjct: 442 RRGTDECAIESIAVAATPIPK 462


>gi|508264|gb|AAA96833.1| cysteine protease, partial [Caenorhabditis elegans]
          Length = 198

 Score =  116 bits (290), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 70/177 (39%), Positives = 96/177 (54%), Gaps = 22/177 (12%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE 78
           LS+S +D+ ACCG +CG+GC+GGYPI AWR++V  G VT     Y D TGC    +P CE
Sbjct: 25  LSISADDINACCGMVCGNGCNGGYPIEAWRHYVKKGYVTG--GSYQDKTGCKPYPYPPCE 82

Query: 79  -----------PA--YPTPKCVRKCVKKN--QLWRNSKHY-SISAYRINSDPEDIMAEIY 122
                      P+  YPT +      K +    +    H+ +I     + +   I   I 
Sbjct: 83  HHVNGTHYKPCPSNMYPTGQNANALGKLDIALTYHKDLHFRTILHTPASKEAAGIPKGIK 142

Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQW 179
            +G +    TV+EDF HY  GVY H  G  +GGHAVK++GWG  D+G  YW++AN W
Sbjct: 143 THGQLRGGITVFEDFEHYSGGVYVHTAGASLGGHAVKMLGWGV-DNGTPYWLIANSW 198


>gi|443687066|gb|ELT90166.1| hypothetical protein CAPTEDRAFT_138389 [Capitella teleta]
          Length = 446

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 70/193 (36%), Positives = 96/193 (49%), Gaps = 27/193 (13%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 82
            S  D++ CC +    GCDGG+P +   +Y    G+V E CDPY                
Sbjct: 272 FSPQDIVDCCQY--SQGCDGGFPYLVGGKYAEDFGLVDESCDPYVGED------------ 317

Query: 83  TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 142
                RKC   +   R +  Y        +  E  M    + GP+ VSF VY+DF HYKS
Sbjct: 318 -----RKCKSTSCSRRYATRYRYVGGYYGACNEQEMKLALQRGPLSVSFMVYDDFMHYKS 372

Query: 143 GVYKH--ITGDV----MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 196
           GVY+H  +T       +  HAV L+G+G +D+G  YWI+ N W + WG +GYF+I RG++
Sbjct: 373 GVYRHSGLTDKYNPFEITNHAVLLVGYG-ADEGTKYWIVKNSWGKGWGEEGYFRILRGAD 431

Query: 197 ECGIEEDVVAGLP 209
           EC IE   V   P
Sbjct: 432 ECAIESIAVETFP 444


>gi|355752523|gb|EHH56643.1| hypothetical protein EGM_06098 [Macaca fascicularis]
          Length = 463

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 71/203 (34%), Positives = 107/203 (52%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P ++A +Y    G+V E C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGNDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HY++G+Y H           +  HAV L+G+GT S  G DYWI+ N W  SWG DGYF
Sbjct: 380 FLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462


>gi|383415299|gb|AFH30863.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
 gi|384944880|gb|AFI36045.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
          Length = 463

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 71/203 (34%), Positives = 107/203 (52%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P ++A +Y    G+V E C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGNDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HY++G+Y H           +  HAV L+G+GT S  G DYWI+ N W  SWG DGYF
Sbjct: 380 FLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462


>gi|47550737|ref|NP_999887.1| dipeptidyl peptidase 1 precursor [Danio rerio]
 gi|39794586|gb|AAH64286.1| Cathepsin C [Danio rerio]
          Length = 455

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 71/201 (35%), Positives = 100/201 (49%), Gaps = 26/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           Q    S   +++C  +    GCDGG+P    +Y    G+V E+C PY   TG   P   P
Sbjct: 272 QQPVFSPQQVVSCSQY--SQGCDGGFPYLIGKYIQDFGIVEEDCFPY---TGSDSPCNLP 326

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
           A        KC K    +  S ++ +  +        +M E+ KNGP+ V+  VY DF +
Sbjct: 327 A--------KCTK----YYASDYHYVGGFYGGCSESAMMLELVKNGPMGVALEVYPDFMN 374

Query: 140 YKSGVYKHITGDV-------MGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKI 191
           YK G+Y H TG         +  HAV L+G+G     GE YWI+ N W   WG +G+F+I
Sbjct: 375 YKEGIYHH-TGLRDANNPFELTNHAVLLVGYGQCHKTGEKYWIVKNSWGSGWGENGFFRI 433

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG++EC IE   VA  P  K
Sbjct: 434 RRGTDECAIESIAVAATPIPK 454


>gi|307548878|ref|NP_001182580.1| dipeptidyl peptidase 1 precursor [Macaca mulatta]
          Length = 463

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 71/203 (34%), Positives = 107/203 (52%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P ++A +Y    G+V E C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGNDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HY++G+Y H           +  HAV L+G+GT S  G DYWI+ N W  SWG DGYF
Sbjct: 380 FLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462


>gi|126327832|ref|XP_001363345.1| PREDICTED: dipeptidyl peptidase 1-like [Monodelphis domestica]
          Length = 462

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 73/197 (37%), Positives = 100/197 (50%), Gaps = 26/197 (13%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 82
           LS   +++C  +    GCDGG+P + A +Y    GVV E C PY    G   P C P   
Sbjct: 283 LSTQQIVSCSEY--SQGCDGGFPYLIAGKYVQDFGVVEENCFPYL---GHDSP-CSPK-- 334

Query: 83  TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 142
              C R  V        S ++ +  +    +   +  E+ +NGP+ V+F VY DF HY+ 
Sbjct: 335 --NCTRYYV--------SDYHYVGGFYGACNEALMKLELVENGPMAVAFEVYNDFIHYQK 384

Query: 143 GVYKH------ITGDVMGGHAVKLIGWGTSDD-GEDYWILANQWNRSWGADGYFKIKRGS 195
           GVY H           +  HAV L+G+GT +  GE YWI+ N W   WG DGYF+I RG+
Sbjct: 385 GVYHHTGLRDSFNPFEITNHAVLLVGYGTDEKTGEHYWIVKNSWGSYWGEDGYFRILRGT 444

Query: 196 NECGIEEDVVAGLPSSK 212
           +ECGIE   V+  P  K
Sbjct: 445 DECGIESIAVSATPIPK 461


>gi|380808942|gb|AFE76346.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
          Length = 463

 Score =  115 bits (288), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 71/203 (34%), Positives = 107/203 (52%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P ++A +Y    G+V E C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGNDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HY++G+Y H           +  HAV L+G+GT S  G DYWI+ N W  SWG DGYF
Sbjct: 380 FLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462


>gi|167508668|gb|ABZ81540.1| cathepsin B-like cysteine protease [Caenorhabditis brenneri]
          Length = 193

 Score =  115 bits (288), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 76/195 (38%), Positives = 100/195 (51%), Gaps = 26/195 (13%)

Query: 30  LACCGFL---CGDG--CDGGYPISAWRYFVHHGVVTEE-------CDPYF----DST--- 70
           L+CC  L   CGDG  CDG +P    +++  HG+ T         C PY     D T   
Sbjct: 2   LSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYDDQFGCKPYTIYPCDKTYPN 61

Query: 71  GCSHPGCEPAYPTPKCVRKCVKKNQLW----RNSKHYSISAYRINSDPEDIMAEIYKNGP 126
           G +   C P Y TP C  +C   N  W    +  KH+  + Y +     DI  EI +NGP
Sbjct: 62  GTTSVPC-PGYHTPVCEERCTS-NITWPISYKQVKHFGKAHYNVGKKMTDIQTEIMRNGP 119

Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
           V  SF +Y+DF  YKSG+Y H  GD  GG   K+IGWG  D+G  YW+  +QW   +G +
Sbjct: 120 VIASFIIYDDFWDYKSGIYVHTAGDQEGGMDTKIIGWGV-DNGVPYWLCVHQWGTDFGEN 178

Query: 187 GYFKIKRGSNECGIE 201
           G+ +I RG NE  IE
Sbjct: 179 GFMRILRGVNEVHIE 193


>gi|196009233|ref|XP_002114482.1| hypothetical protein TRIADDRAFT_28083 [Trichoplax adhaerens]
 gi|190583501|gb|EDV23572.1| hypothetical protein TRIADDRAFT_28083 [Trichoplax adhaerens]
          Length = 466

 Score =  115 bits (288), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 71/194 (36%), Positives = 101/194 (52%), Gaps = 24/194 (12%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 82
           +S  D+++C  +    GC GG+P + A +Y    G+V E C PY    G   P  E    
Sbjct: 287 MSPQDVVSCSEY--AQGCAGGFPYLIAGKYGEDFGLVEESCFPY---NGKDEPCKETK-- 339

Query: 83  TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 142
             KC R           + +Y +  +    +   +M E+ KNGP+ +SF VY DF HYK 
Sbjct: 340 -SKCRRHST--------TNYYYVGGFYGACNEYLMMRELVKNGPISISFEVYGDFKHYKG 390

Query: 143 GVYKHI-TGD-----VMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGS 195
           G+Y+H   GD      +  HAV L+G+GT    G+DYWI+ N W   WG +G+F+I RG 
Sbjct: 391 GIYQHTGLGDSYNPWQITNHAVLLVGYGTDQKSGKDYWIVKNSWGTKWGENGFFRILRGV 450

Query: 196 NECGIEEDVVAGLP 209
           +EC IE + VA  P
Sbjct: 451 DECSIENEAVAVTP 464


>gi|290982673|ref|XP_002674054.1| predicted protein [Naegleria gruberi]
 gi|284087642|gb|EFC41310.1| predicted protein [Naegleria gruberi]
          Length = 673

 Score =  115 bits (288), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 66/176 (37%), Positives = 90/176 (51%), Gaps = 17/176 (9%)

Query: 41  CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY---PTPKCVRKCVKKNQLW 97
           C GGY   +W++F++ G+  E C PY   +          Y      +C   C   + L 
Sbjct: 154 CQGGYGYYSWKFFMNTGIPLESCVPYTKDS--------LVYGNTTNAQCRSTCTDGSPL- 204

Query: 98  RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGH 156
               + + SAY I S   +   EI  NGPVE  F VY DF  YKSG+Y+   G   +GGH
Sbjct: 205 --KLYKAASAYYIYSPITNYQTEIMTNGPVEADFDVYSDFYSYKSGIYQKTAGSTYVGGH 262

Query: 157 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN--ECGIEEDVVAGLPS 210
           AVK++GW +  +G  YWI  NQW  SWG  GYF I RG++   C  +  ++AG  S
Sbjct: 263 AVKVLGWASDSNGTPYWIAQNQWGTSWGMGGYFYIYRGNSTLNCKFDNYMIAGTVS 318


>gi|402894881|ref|XP_003910570.1| PREDICTED: dipeptidyl peptidase 1 [Papio anubis]
          Length = 463

 Score =  115 bits (288), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 71/203 (34%), Positives = 106/203 (52%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLSVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HY++G+Y H           +  HAV L+G+GT S  G DYWI+ N W  SWG DGYF
Sbjct: 380 FLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462


>gi|242014495|ref|XP_002427925.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
           corporis]
 gi|212512409|gb|EEB15187.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
           corporis]
          Length = 473

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 75/211 (35%), Positives = 112/211 (53%), Gaps = 14/211 (6%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDS-TGCSHPGC 77
           L  + LS   LL+C       GC GG+   AW +    G+V + C P+  + T C  P  
Sbjct: 236 LTKVDLSPQHLLSCNKGQ--RGCQGGHLSRAWTFIRKFGLVDDYCYPWTGTPTKCKIPK- 292

Query: 78  EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
            P +     +      + L R+  +    AY+I  D +DIM EI ++GPV+ +  VY+DF
Sbjct: 293 RPNFDALSSICPPSLGSNL-RSELYRVGPAYKIQ-DEKDIMEEIMQSGPVQATMKVYQDF 350

Query: 138 AHYKSGVYKHITGDV----MGGHAVKLIGWGTSDD--GE--DYWILANQWNRSWGADGYF 189
             YKSGVY     +      G H+VK++GWG   +  G+   YW+ AN W + WG +G+F
Sbjct: 351 FSYKSGVYTKSNTERESSNFGYHSVKILGWGEETNIYGQPIKYWLAANSWGQQWGENGFF 410

Query: 190 KIKRGSNECGIEEDVVAGLPSSKNLVKEITS 220
           KI+RG+NEC IEE V+A    + +  +EI +
Sbjct: 411 KIRRGTNECEIEEFVLAAWAETNDPSREIIT 441


>gi|115621283|ref|XP_782184.2| PREDICTED: tubulointerstitial nephritis antigen-like
           [Strongylocentrotus purpuratus]
          Length = 450

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 72/203 (35%), Positives = 96/203 (47%), Gaps = 24/203 (11%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF----DSTGCSHPG 76
           N  LS   LL+C       GC GGY   AW +    G V+  C PY     + T      
Sbjct: 245 NPRLSEQHLLSC-NIRGQRGCSGGYLDRAWYHLRRAGAVSRACYPYHSGLDEDTIMQKLR 303

Query: 77  CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
           C  AY + +C  + V  +       + S   YRI +   DIM EIY+NGPV+ +F V  D
Sbjct: 304 CRVAYGSSQCPERGVTSDL------YLSTPPYRIAAREVDIMTEIYQNGPVQATFNVKND 357

Query: 137 FAHYKSGVYKHIT---------GDVMGGHAVKLIGWGTSD----DGEDYWILANQWNRSW 183
           F  Y  GVY+++           D  G H+VK++GWG       +   YW+  N W R+W
Sbjct: 358 FFVYNRGVYRNVKQEFTASQSDSDQAGWHSVKIVGWGIDRSDWYNPIKYWLCTNSWGRNW 417

Query: 184 GADGYFKIKRGSNECGIEEDVVA 206
           G  G F+I RG NEC IE  V+ 
Sbjct: 418 GEQGMFRIVRGVNECEIESFVLG 440


>gi|395526635|ref|XP_003765465.1| PREDICTED: tubulointerstitial nephritis antigen-like [Sarcophilus
           harrisii]
          Length = 467

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 74/211 (35%), Positives = 102/211 (48%), Gaps = 36/211 (17%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 82
           +LS  +LL+C       GC GG    AW +    G+V+  C P+ +     H G  PA P
Sbjct: 252 ALSPQNLLSC-NTHNQHGCRGGRLDGAWWFLRRRGLVSNNCYPFSEG---DHNGAAPAAP 307

Query: 83  ---------------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
                          T  C       N +++     +   YR++S  +DIM E+ +NGPV
Sbjct: 308 CMMHSRHMGRGKRQATAHCPNSRTHANHIYQ-----ATPPYRLSSHEKDIMKELMENGPV 362

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWIL 175
           +    V+EDF  YKSG+YKH    +         G H+VK+ GWG     DG+   YW  
Sbjct: 363 QALLEVHEDFFLYKSGIYKHTPASLGKPERYRQHGTHSVKITGWGEEIQPDGQKVKYWTA 422

Query: 176 ANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           AN W  +WG +GYF+I RG+NEC IE  VV 
Sbjct: 423 ANSWGPTWGENGYFRIVRGANECDIESFVVG 453


>gi|66801417|ref|XP_629634.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
 gi|60463014|gb|EAL61210.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
          Length = 323

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 65/183 (35%), Positives = 92/183 (50%), Gaps = 21/183 (11%)

Query: 37  CGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPG-CEPAYPTPKCVRKCVKKNQ 95
           C +GC GG+   A    ++ G+V++EC  Y  S   S P  C+   P             
Sbjct: 117 CNNGCKGGFVGLALTRLINEGIVSDECLSYQASKDSSCPTTCDDGSPI------------ 164

Query: 96  LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 155
              N+  Y  ++ R     +D   EI  NGPV  +F +Y DF  +K  VY   +   +  
Sbjct: 165 --SNTTIYKATSCRAFPTVQDAQYEIMTNGPVIATFMLYSDFKPHKWDVYIKSSNTQVES 222

Query: 156 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV------AGLP 209
           HAV+++GWGT+ DG DYWI AN W   WG  GYFKI+RGS+E   EE  +      A +P
Sbjct: 223 HAVRVVGWGTTSDGVDYWIAANSWGTGWGDKGYFKIRRGSDEAAFEEGFITVTADTASVP 282

Query: 210 SSK 212
           +S+
Sbjct: 283 TSQ 285


>gi|6562770|emb|CAB62589.1| putative cathepsin B-like protease [Pisum sativum]
          Length = 206

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 49/57 (85%), Positives = 52/57 (91%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGC 77
           ++ LSVNDLLACCGFLCG GCDGGYPISAW+YF HHGVVTEECDPYFD  GCSHPGC
Sbjct: 150 DVPLSVNDLLACCGFLCGSGCDGGYPISAWKYFAHHGVVTEECDPYFDQIGCSHPGC 206


>gi|2330009|gb|AAB66719.1| cysteine protease [Giardia muris]
          Length = 301

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 61/175 (34%), Positives = 91/175 (52%), Gaps = 15/175 (8%)

Query: 30  LACCGFLCGDG-CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 88
           +  C F  GDG C+GG+  + W++    GV   +C  YF                  C+ 
Sbjct: 133 VVSCDF--GDGACNGGWLSNVWKFLTKTGVPKLDCLKYFSGMTGDRE---------SCIT 181

Query: 89  KCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 148
            C   + +      + I+      D + +M  +  +GP++V+F VY DF +Y SGVY+H+
Sbjct: 182 HCTDGSPVELYQASHVIN---YGMDLDRMMEALVYDGPLQVAFVVYSDFGYYSSGVYQHV 238

Query: 149 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 203
            G + GGHAV+++G+G  + G  YWI+ N W   WG  GYF+I R  NECGIEE 
Sbjct: 239 NGMMEGGHAVEMVGYGIDESGLKYWIIRNSWGPDWGEGGYFRIIRRVNECGIEEQ 293


>gi|197101281|ref|NP_001125612.1| dipeptidyl peptidase 1 precursor [Pongo abelii]
 gi|75061881|sp|Q5RB02.1|CATC_PONAB RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|55728636|emb|CAH91058.1| hypothetical protein [Pongo abelii]
          Length = 463

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 71/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HYK G+Y H           +  HAV L+G+GT S  G DYWI+ N W   WG DGYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEDGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462


>gi|328869211|gb|EGG17589.1| hypothetical protein DFA_08585 [Dictyostelium fasciculatum]
          Length = 323

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 71/190 (37%), Positives = 103/190 (54%), Gaps = 18/190 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N++LS   L+AC   +   GC+GG P  AW Y    G+ T EC PY    G         
Sbjct: 143 NVTLSPQALVAC-DDIGNQGCNGGVPQLAWEYMEWKGLPTFECYPYTAGNGTDG------ 195

Query: 81  YPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
                C R+C   + + +  +K +S++     +    I  EI   GPV  +  VY+DF  
Sbjct: 196 ----TCQRQCADGSAMTYYRAKPFSMTTC---NSVACIQNEIITYGPVVGTMMVYQDFMS 248

Query: 140 YKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGE-DYWILANQWNRSWGA-DGYFKIKRGSN 196
           Y SGVY +  T +++GGHA++++GWGT    + DYWI+ N W+ +WG  DGYF I+RG+N
Sbjct: 249 YSSGVYVYDGTAELLGGHAIEIVGWGTDATSKLDYWIVKNSWSAAWGGLDGYFWIQRGTN 308

Query: 197 ECGIEEDVVA 206
            CGI+ D  A
Sbjct: 309 MCGIDHDASA 318


>gi|426370061|ref|XP_004051995.1| PREDICTED: dipeptidyl peptidase 1 [Gorilla gorilla gorilla]
          Length = 463

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 71/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HYK G+Y H           +  HAV L+G+GT S  G DYWI+ N W   WG DGYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEDGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462


>gi|114639716|ref|XP_508684.2| PREDICTED: dipeptidyl peptidase 1 isoform 2 [Pan troglodytes]
 gi|397526223|ref|XP_003833035.1| PREDICTED: dipeptidyl peptidase 1 [Pan paniscus]
 gi|410219182|gb|JAA06810.1| cathepsin C [Pan troglodytes]
 gi|410260226|gb|JAA18079.1| cathepsin C [Pan troglodytes]
 gi|410304128|gb|JAA30664.1| cathepsin C [Pan troglodytes]
 gi|410353831|gb|JAA43519.1| cathepsin C [Pan troglodytes]
          Length = 463

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 71/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HYK G+Y H           +  HAV L+G+GT S  G DYWI+ N W   WG DGYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEDGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462


>gi|332210919|ref|XP_003254561.1| PREDICTED: LOW QUALITY PROTEIN: dipeptidyl peptidase 1 [Nomascus
           leucogenys]
          Length = 463

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 70/203 (34%), Positives = 105/203 (51%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P ++A +Y    G+V E C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGTDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HY+ G+Y H           +  HAV L+G+GT S  G DYWI+ N W   WG DGYF
Sbjct: 380 FLHYEKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEDGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462


>gi|307201161|gb|EFN81067.1| Uncharacterized peptidase C1-like protein F26E4.3 [Harpegnathos
           saltator]
          Length = 443

 Score =  114 bits (286), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 71/194 (36%), Positives = 103/194 (53%), Gaps = 13/194 (6%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           +++ LS   LL+C       GC GGY   AW +    G+V +EC P+   TG  +  C  
Sbjct: 250 EDVELSAQHLLSC-NNRGQQGCRGGYLDRAWLFMRKFGLVDKECYPW---TG-RNDQCRL 304

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
              +   V  C K     R   +    AYR+ ++  DIM EI  +GPV+ +  VY+DF  
Sbjct: 305 RKRSNLNVAGCRKPPNPLRQELYKVGPAYRLGNE-TDIMQEILTSGPVQATMRVYQDFFV 363

Query: 140 YKSGVYKHITGDVM---GGHAVKLIGWGTSDDGE----DYWILANQWNRSWGADGYFKIK 192
           YK+GVY+H     +   G H++++IGWG           YW++AN W R WG +G F+I+
Sbjct: 364 YKNGVYRHSRSAELHDSGYHSMRIIGWGEEPSYRGPPLKYWLVANSWGRHWGENGLFRIQ 423

Query: 193 RGSNECGIEEDVVA 206
           RG+NEC IE  V+A
Sbjct: 424 RGTNECEIESYVLA 437


>gi|355566931|gb|EHH23310.1| hypothetical protein EGK_06753 [Macaca mulatta]
          Length = 463

 Score =  114 bits (285), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 71/203 (34%), Positives = 106/203 (52%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P ++A +Y    G+V E C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGNDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HY++G+Y H           +  HAV L+G+GT S  G DYWI+ N W  SWG DGYF
Sbjct: 380 FLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I RG++EC IE   VA  P  K
Sbjct: 440 RIHRGTDECAIESIAVAATPIPK 462


>gi|149635146|ref|XP_001512140.1| PREDICTED: dipeptidyl peptidase 1-like [Ornithorhynchus anatinus]
          Length = 469

 Score =  114 bits (285), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 71/202 (35%), Positives = 101/202 (50%), Gaps = 27/202 (13%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYF-DSTGCSHPGC 77
           Q   LS   +++C  +    GCDGG+P + A +Y    GVV E+C PY    T C     
Sbjct: 285 QTPILSTQQIVSCSEY--SQGCDGGFPYLIAGKYTQDFGVVEEDCFPYTARDTQC----- 337

Query: 78  EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
               P  +C R        +  S +  +  +    +   +  E+ ++GP+ V+F VY DF
Sbjct: 338 ---VPKKECPR--------YYASDYQYVGGFYGGCNEALMKLELVRHGPMAVAFEVYNDF 386

Query: 138 AHYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFK 190
            HY+ GVY H           +  HAV L+G+GT    G DYWI+ N W  +WG DGYF+
Sbjct: 387 LHYREGVYHHTGLRDPFNPFELTNHAVLLVGYGTDPATGLDYWIVKNSWGTAWGEDGYFR 446

Query: 191 IKRGSNECGIEEDVVAGLPSSK 212
           I+RGS+EC IE   VA  P  +
Sbjct: 447 IRRGSDECAIESIAVAATPIPR 468


>gi|56755425|gb|AAW25892.1| unknown [Schistosoma japonicum]
          Length = 226

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 65/164 (39%), Positives = 90/164 (54%), Gaps = 17/164 (10%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
           Q++ LS  DL++CC   CG GCDGG+P  AW Y+V HG+VT         C PY     C
Sbjct: 61  QSVELSAIDLISCCEN-CGSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPY-PFPKC 118

Query: 73  SH------PGC-EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
            H      P C +  Y TP+C RKC K     + + KHY   +  +  +   I  EI   
Sbjct: 119 EHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPYEHDKHYGGISINVIKNESAIQKEIMMY 178

Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 168
           GPVE    ++EDF +YKSG+Y++ TG  +G H V++IGWG  ++
Sbjct: 179 GPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGIENE 222


>gi|253748399|gb|EET02549.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 303

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 61/170 (35%), Positives = 84/170 (49%), Gaps = 15/170 (8%)

Query: 40  GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
           GC GG     W +    G  T EC  Y D        C    PT      C   +Q+   
Sbjct: 144 GCSGGDFFPTWSFLTQTGATTAECVKYVDYGSSVAAAC----PT-----TCDDGSQI--- 191

Query: 100 SKHYSISAY-RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG-HA 157
            + Y    Y +++     IM  +   GPV+    VY D  +Y  GVY+H  G +  G HA
Sbjct: 192 -QFYKAHGYGQVSKSVPAIMQMLVSGGPVQTMIVVYADLLYYAGGVYRHTYGPISNGLHA 250

Query: 158 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
           ++++G+GT+DDG DYW + N W   WG DGYF+I RG NEC IE+++ A 
Sbjct: 251 LEMVGYGTTDDGTDYWTIKNSWGSDWGEDGYFRIVRGVNECRIEDEIYAA 300


>gi|60827947|gb|AAX36820.1| cathepsin C [synthetic construct]
 gi|61368416|gb|AAX43175.1| cathepsin C [synthetic construct]
          Length = 464

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 71/205 (34%), Positives = 105/205 (51%), Gaps = 29/205 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HYK G+Y H           +  HAV L+G+GT S  G DYWI+ N W   WG +GYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSKNL 214
           +I+RG++EC IE   VA  P  K L
Sbjct: 440 RIRRGTDECAIESIAVAATPIPKLL 464


>gi|54696504|gb|AAV38624.1| cathepsin C [synthetic construct]
 gi|54696506|gb|AAV38625.1| cathepsin C [synthetic construct]
 gi|61368207|gb|AAX43130.1| cathepsin C [synthetic construct]
 gi|61368212|gb|AAX43131.1| cathepsin C [synthetic construct]
          Length = 464

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 71/205 (34%), Positives = 105/205 (51%), Gaps = 29/205 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HYK G+Y H           +  HAV L+G+GT S  G DYWI+ N W   WG +GYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSKNL 214
           +I+RG++EC IE   VA  P  K L
Sbjct: 440 RIRRGTDECAIESIAVAATPIPKLL 464


>gi|253742315|gb|EES99155.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 303

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 61/170 (35%), Positives = 84/170 (49%), Gaps = 15/170 (8%)

Query: 40  GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
           GC GG     W +    G  T EC  Y D        C    PT      C   +Q+   
Sbjct: 144 GCSGGDFFPTWSFLTQTGATTAECVKYVDYGSSVAAAC----PT-----TCDDGSQI--- 191

Query: 100 SKHYSISAY-RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG-HA 157
            + Y    Y +++     IM  +   GPV+    VY D  +Y  GVY+H  G +  G HA
Sbjct: 192 -QFYKAHGYGQLSKSVPAIMQMLVSGGPVQTMIVVYADLLYYAGGVYRHTYGPISNGLHA 250

Query: 158 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
           ++++G+GT+DDG DYW + N W   WG DGYF+I RG NEC IE+++ A 
Sbjct: 251 LEMVGYGTTDDGTDYWTIKNSWGSDWGEDGYFRIVRGVNECRIEDEIYAA 300


>gi|344293788|ref|XP_003418602.1| PREDICTED: dipeptidyl peptidase 1 [Loxodonta africana]
          Length = 463

 Score =  113 bits (283), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 70/201 (34%), Positives = 106/201 (52%), Gaps = 25/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY   T    P C 
Sbjct: 279 QTPVLSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TATDSP-C- 331

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
                 K  + C +    + +S+++ +  +    +   +  E+  +GPV VSF VY+DF 
Sbjct: 332 ------KVKKDCFR----YYSSEYHYVGGFYGGCNEALMKLELVNHGPVVVSFEVYDDFI 381

Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKI 191
           HY  G+Y H           +  HAV L+G+GT S  G DYWI+ N W+ +WG DGYF+I
Sbjct: 382 HYHKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGLDYWIVKNSWSATWGEDGYFRI 441

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG++ECGIE   +   P  K
Sbjct: 442 RRGTDECGIESIALTATPIPK 462


>gi|119579767|gb|EAW59363.1| cathepsin C, isoform CRA_a [Homo sapiens]
          Length = 316

 Score =  113 bits (283), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 70/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P   
Sbjct: 132 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 183

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 184 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 232

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HYK G+Y H           +  HAV L+G+GT S  G DYWI+ N W   WG +GYF
Sbjct: 233 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYF 292

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 293 RIRRGTDECAIESIAVAATPIPK 315


>gi|403339807|gb|EJY69164.1| Cathepsin B [Oxytricha trifallax]
          Length = 345

 Score =  113 bits (283), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 73/185 (39%), Positives = 104/185 (56%), Gaps = 22/185 (11%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+  S  D+++C   L    C+GGY  S+ +Y    GVV+E+C  Y  + G S       
Sbjct: 168 NMQFSRQDMVSCD--LGNAACNGGYLSSSVQYLQTEGVVSEQCLAYASADGNS------- 218

Query: 81  YPTPKCVRKCVKKNQLWRN--SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
              P+C  +C  K+  ++    K+ S+   +I +  EDI  EIY NGPV V F VY+DF+
Sbjct: 219 --VPRCNYRCDDKSLEYKKYGCKYNSM---KILTTYEDIKEEIYTNGPVMVGFVVYDDFS 273

Query: 139 HYKSGVYKHITGDVM--GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 196
            Y +G+Y+ +T D +  GGHAV L GWG  D+G  YWI  NQW  +WG  G+F+I  G  
Sbjct: 274 SYSTGIYE-VTPDSVEEGGHAVTLNGWGY-DNGRLYWIGQNQWQNTWGESGFFRIYAG-- 329

Query: 197 ECGIE 201
           E GI+
Sbjct: 330 EAGID 334


>gi|403287831|ref|XP_003935129.1| PREDICTED: dipeptidyl peptidase 1 [Saimiri boliviensis boliviensis]
          Length = 463

 Score =  113 bits (283), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 71/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    GVV E C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSKY--AQGCEGGFPYLIAGKYAQDFGVVEEACFPY---TGTDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HY+ G+Y H           +  HAV L+G+GT S  G  YWI+ N W  SWG DGYF
Sbjct: 380 FLHYRKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGIHYWIVKNSWGTSWGEDGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462


>gi|4099305|gb|AAD00577.1| cysteine proteinase [Clonorchis sinensis]
          Length = 180

 Score =  113 bits (283), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 69/159 (43%), Positives = 85/159 (53%), Gaps = 18/159 (11%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC---SHPGC 77
           N SLS  DLL+CC   CG GC GGYP  AW Y+  HG+VT       D +GC     P C
Sbjct: 19  NKSLSAVDLLSCCEN-CGFGCRGGYPAVAWDYWKTHGIVTGGSKE--DPSGCRSYPFPKC 75

Query: 78  E------------PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
           E              YPTP+CV++C   +  +   K  +  +Y I +    IM EI   G
Sbjct: 76  EHHVQGHYPPCPRELYPTPECVQQCDTPDVGYLEDKTRANMSYNIYASEISIMKEIMLRG 135

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 164
           PVE  FT+YEDF  Y SGVY H  G  M GHAV+++GWG
Sbjct: 136 PVEAIFTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWG 174


>gi|12330246|gb|AAG52660.1| cysteine proteinase [Metagonimus yokogawai]
          Length = 179

 Score =  113 bits (283), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 64/155 (41%), Positives = 90/155 (58%), Gaps = 16/155 (10%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
           +S  DLL+CC   CG GC GG+P  AW +++ +G+VT         C  Y     CSH  
Sbjct: 22  ISATDLLSCCE-SCGFGCHGGFPPRAWDFWMENGLVTGGSKENPSGCRSY-PFPRCSHHG 79

Query: 75  ----PGC-EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
               P C +  + TP CV  C K +  +   K ++ S+Y + S+   IM EI +NGPVE 
Sbjct: 80  KGKYPPCPKTIFDTPNCVDHCDKPDIDYAADKTHAKSSYNVQSNERVIMKEIMRNGPVEA 139

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 164
           +F VYEDF  YKSG+Y H  G ++GGHA++++GWG
Sbjct: 140 AFMVYEDFIEYKSGIYFHSHGKLLGGHAIRMLGWG 174


>gi|351709947|gb|EHB12866.1| Tubulointerstitial nephritis antigen-like protein [Heterocephalus
           glaber]
          Length = 467

 Score =  113 bits (282), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 82/224 (36%), Positives = 107/224 (47%), Gaps = 31/224 (13%)

Query: 10  ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  VS+ ++      LS  +LL+C       GC GG    AW +    GVV++ C 
Sbjct: 235 AAVASDRVSIHSMGHMTPVLSPQNLLSCDTHH-QQGCQGGRLDGAWWFLRRRGVVSDHCY 293

Query: 65  PYFDSTGCSHPGCEPAYPTPKCVRKCVK-KNQLWR---------NSKHYSISAYRINSDP 114
           P+   +G       PA P     R   + K Q  R         N  +    AYR+ SD 
Sbjct: 294 PF---SGHEQAEAGPATPCMMHSRAMGRGKRQATRRCPNSHDDANEIYQVTPAYRLGSDE 350

Query: 115 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWGTS 166
           ++IM E+ +NGPV+    VYEDF  YKSG+Y H    +         G H+VK+ GWG  
Sbjct: 351 KEIMKELMENGPVQALMEVYEDFFLYKSGIYSHTLVSMGRPEQYRRHGTHSVKITGWGEE 410

Query: 167 --DDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
              DG    YW  AN W  SWG  GYF+I RGSNEC IE  V+ 
Sbjct: 411 MLPDGRTLKYWTAANSWGPSWGERGYFRILRGSNECDIESFVLG 454


>gi|189083844|ref|NP_001805.3| dipeptidyl peptidase 1 isoform a preproprotein [Homo sapiens]
 gi|1006657|emb|CAA60671.1| cathepsin C [Homo sapiens]
 gi|1947071|gb|AAC51341.1| prepro dipeptidyl peptidase I [Homo sapiens]
 gi|60816242|gb|AAX36375.1| cathepsin C [synthetic construct]
 gi|119579768|gb|EAW59364.1| cathepsin C, isoform CRA_b [Homo sapiens]
 gi|158257666|dbj|BAF84806.1| unnamed protein product [Homo sapiens]
 gi|261858568|dbj|BAI45806.1| cathepsin C [synthetic construct]
          Length = 463

 Score =  113 bits (282), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 70/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HYK G+Y H           +  HAV L+G+GT S  G DYWI+ N W   WG +GYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462


>gi|1582221|prf||2118248A prepro-cathepsin C
          Length = 463

 Score =  113 bits (282), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 70/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HYK G+Y H           +  HAV L+G+GT S  G DYWI+ N W   WG +GYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462


>gi|354459545|pdb|3PDF|A Chain A, Discovery Of Novel Cyanamide-Based Inhibitors Of Cathepsin
           C
          Length = 441

 Score =  113 bits (282), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 70/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P   
Sbjct: 255 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 306

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 307 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 355

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HYK G+Y H           +  HAV L+G+GT S  G DYWI+ N W   WG +GYF
Sbjct: 356 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYF 415

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 416 RIRRGTDECAIESIAVAATPIPK 438


>gi|194382330|dbj|BAG58920.1| unnamed protein product [Homo sapiens]
          Length = 446

 Score =  113 bits (282), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 70/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P   
Sbjct: 262 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 313

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 314 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 362

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HYK G+Y H           +  HAV L+G+GT S  G DYWI+ N W   WG +GYF
Sbjct: 363 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYF 422

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 423 RIRRGTDECAIESIAVAATPIPK 445


>gi|6562768|emb|CAB62588.1| putative cathepsin B-like protease [Pisum sativum]
          Length = 166

 Score =  113 bits (282), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 49/57 (85%), Positives = 52/57 (91%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGC 77
           ++ LSVNDLLACCGFLCG GCDGGYPISAW+YF HHGVVTEECDPYFD  GCSHPGC
Sbjct: 110 DVPLSVNDLLACCGFLCGSGCDGGYPISAWKYFAHHGVVTEECDPYFDQIGCSHPGC 166


>gi|317373330|sp|P53634.2|CATC_HUMAN RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|17933069|gb|AAL48191.1| cathepsin C [Homo sapiens]
          Length = 463

 Score =  113 bits (282), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 70/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HYK G+Y H           +  HAV L+G+GT S  G DYWI+ N W   WG +GYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462


>gi|62897637|dbj|BAD96758.1| cathepsin C isoform a preproprotein variant [Homo sapiens]
          Length = 463

 Score =  113 bits (282), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 70/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HYK G+Y H           +  HAV L+G+GT S  G DYWI+ N W   WG +GYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462


>gi|17933071|gb|AAL48192.1| cathepsin C [Homo sapiens]
          Length = 463

 Score =  113 bits (282), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 70/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HYK G+Y H           +  HAV L+G+GT S  G DYWI+ N W   WG +GYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462


>gi|426252217|ref|XP_004019812.1| PREDICTED: dipeptidyl peptidase 1, partial [Ovis aries]
          Length = 455

 Score =  113 bits (282), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 70/201 (34%), Positives = 105/201 (52%), Gaps = 25/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E+C PY   TG   P C 
Sbjct: 271 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEDCFPY---TGTDSP-C- 323

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
                 K    C +    + +S+++ +  +    +   +  E+   GP+ V+F VY DF 
Sbjct: 324 ------KLKEGCFR----YYSSEYHYVGGFYGGCNEALMKLELVHRGPMAVAFEVYNDFL 373

Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKI 191
           HY+ GVY H           +  HAV L+G+GT +  G DYWI+ N W  SWG DGYF+I
Sbjct: 374 HYRQGVYHHTGLRDPFNPFELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGEDGYFRI 433

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG++EC IE   +A  P  K
Sbjct: 434 RRGTDECAIESIALAATPIPK 454


>gi|268564843|ref|XP_002639246.1| Hypothetical protein CBG03805 [Caenorhabditis briggsae]
          Length = 526

 Score =  113 bits (282), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 72/199 (36%), Positives = 100/199 (50%), Gaps = 14/199 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N SLS   LL+C       GC+GGY   AW Y    GVV + C PY  S     PG    
Sbjct: 306 NASLSSQQLLSCNQHR-QKGCEGGYLDRAWWYIRKLGVVGDHCYPYV-SGQSREPGHCLI 363

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
                  R+ ++     ++S  + ++  Y+++S  EDI  E+  NGPV+ +F V+EDF  
Sbjct: 364 PKRDYTNRQGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFM 423

Query: 140 YKSGVYKH--------ITGDVMGGHAVKLIGWG---TSDDGEDYWILANQWNRSWGADGY 188
           Y  GVY+H         +    G H+V+++GWG   ++     YW+ AN W   WG DGY
Sbjct: 424 YAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDGY 483

Query: 189 FKIKRGSNECGIEEDVVAG 207
           FKI RG N C IE  V+  
Sbjct: 484 FKILRGENHCEIESFVIGA 502


>gi|301779281|ref|XP_002925058.1| PREDICTED: dipeptidyl peptidase 1-like [Ailuropoda melanoleuca]
 gi|281337582|gb|EFB13166.1| hypothetical protein PANDA_014484 [Ailuropoda melanoleuca]
          Length = 461

 Score =  113 bits (282), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 66/201 (32%), Positives = 103/201 (51%), Gaps = 25/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY  +         
Sbjct: 277 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPYMGAD-------F 327

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
           P  P   C R        + +S ++ +  +    +   +  E+  +GP+ V+F VY+DF 
Sbjct: 328 PCKPKKDCFR--------YYSSDYHYVGGFYGGCNEALMKLELVHHGPIAVAFQVYDDFF 379

Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKI 191
           HY++G+Y H           +  HAV L+G+GT +  G DYWI+ N W   WG +GYF+I
Sbjct: 380 HYRTGIYYHTGLRDPFNPFELTNHAVLLVGYGTDTASGMDYWIVKNSWGAGWGENGYFRI 439

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG++EC IE   VA  P  K
Sbjct: 440 RRGTDECAIESIAVAATPVPK 460


>gi|290975817|ref|XP_002670638.1| predicted protein [Naegleria gruberi]
 gi|284084199|gb|EFC37894.1| predicted protein [Naegleria gruberi]
          Length = 528

 Score =  113 bits (282), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 62/198 (31%), Positives = 99/198 (50%), Gaps = 28/198 (14%)

Query: 25  SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP 84
           S  ++++C  +    GCDGG+     ++    G++ E+CDPY   TG  H          
Sbjct: 347 SPENIISCSFY--SQGCDGGFAYLISKWGEDFGIIAEQCDPY---TGTPH---------- 391

Query: 85  KC-VRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
           KC + +     Q W N ++     Y      E++  ++ K GP+ VS  VY D  +Y SG
Sbjct: 392 KCNLNQACSTRQYWTNYRY--TGGYYGAVTVENMQLDVLKYGPLSVSMEVYNDLFNYHSG 449

Query: 144 VYKHITGDVMG----------GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 193
           +Y+H++   +            H V ++GWG ++ GE YWI+ N W  S+G DGYF I R
Sbjct: 450 IYRHVSSSKLTSPVPNPFELTNHVVLIVGWGENEKGEKYWIVKNSWGTSFGMDGYFLIAR 509

Query: 194 GSNECGIEEDVVAGLPSS 211
           G +EC IE +  + +P+ 
Sbjct: 510 GVDECAIESENASAIPTQ 527


>gi|348570708|ref|XP_003471139.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
           porcellus]
          Length = 468

 Score =  112 bits (281), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 75/209 (35%), Positives = 100/209 (47%), Gaps = 34/209 (16%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
           LS  +LL+C   L   GC GG+   AW +    GVV++ C P+             A P 
Sbjct: 255 LSPQNLLSC-DTLHQQGCRGGHLDGAWWFLRRRGVVSDHCYPFSGREQAE------AGPA 307

Query: 84  PKCV--------------RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
           P C+              R+C   +    N  +    AYR+ SD ++IM E+ +NGPV+ 
Sbjct: 308 PPCMMHSRAMGRGKRQATRRCPNSHTD-ANDIYQVTPAYRLGSDEKEIMKELMENGPVQA 366

Query: 130 SFTVYEDFAHYKSGVYKHITGDVM--------GGHAVKLIGWG--TSDDGE--DYWILAN 177
              V+EDF  YK G+Y H    +         G H+VK+ GWG  T  DG    YW  AN
Sbjct: 367 LMEVHEDFFLYKGGIYSHTPLSMARPEQYRRHGTHSVKITGWGEETLPDGRTLKYWTAAN 426

Query: 178 QWNRSWGADGYFKIKRGSNECGIEEDVVA 206
            W  SWG  G+F+I RGSNEC IE  V+ 
Sbjct: 427 SWGPSWGERGHFRILRGSNECDIESFVLG 455


>gi|341898422|gb|EGT54357.1| hypothetical protein CAEBREN_10381 [Caenorhabditis brenneri]
          Length = 466

 Score =  112 bits (281), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 72/198 (36%), Positives = 100/198 (50%), Gaps = 14/198 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N SLS   LL+C       GC+GGY   AW Y    GVV + C PY  S     PG    
Sbjct: 246 NASLSSQQLLSCNQHR-QKGCEGGYLDRAWWYIRKLGVVGDHCYPYV-SGQSREPGHCLI 303

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
                  R+ ++     ++S  + ++  Y+++S  EDI  E+  NGPV+ +F V+EDF  
Sbjct: 304 PKRDYTDRRGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFM 363

Query: 140 YKSGVYKH--------ITGDVMGGHAVKLIGWG---TSDDGEDYWILANQWNRSWGADGY 188
           Y  GVY+H         +    G H+V+++GWG   ++     YW+ AN W   WG DGY
Sbjct: 364 YAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDGY 423

Query: 189 FKIKRGSNECGIEEDVVA 206
           FKI RG N C IE  V+ 
Sbjct: 424 FKILRGDNHCEIESFVIG 441


>gi|311263676|ref|XP_003129789.1| PREDICTED: dipeptidyl peptidase 1-like [Sus scrofa]
          Length = 463

 Score =  112 bits (281), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 70/203 (34%), Positives = 103/203 (50%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC GG+P + A +Y    G+V E C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCAGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 331 -----------CTVKEGCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGTS-DDGEDYWILANQWNRSWGADGYF 189
           F HY+ G+Y H           +  HAV L+G+GT    G DYWI+ N W  SWG DGYF
Sbjct: 380 FLHYRKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDLASGMDYWIVKNSWGTSWGEDGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462


>gi|182509202|ref|NP_001116812.1| tubulointerstitial nephritis antigen precursor [Bombyx mori]
 gi|81303350|gb|ABB71105.1| TIN-ag-RP [Bombyx mori]
          Length = 404

 Score =  112 bits (281), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 71/195 (36%), Positives = 103/195 (52%), Gaps = 30/195 (15%)

Query: 20  QNLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           +N+ +S   LL+C   L G  GC+GG    A+ +   HG+V+E+C PY            
Sbjct: 232 ENVRMSSQTLLSC--HLKGQRGCNGGNLDIAFDFVKTHGLVSEQCFPY------------ 277

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
                       V + ++  + + Y +      S  EDIM +I  +GP     TVY+DF 
Sbjct: 278 ---------EGAVTQCRIGNDCRRYRVGVPFSISKEEDIMYDIMTSGPALGIMTVYQDFF 328

Query: 139 HYKSGVYKHIT-GDVM--GGHAVKLIGWGTSDDGED-YWILANQWNRSWGADGYFKIKRG 194
           HY+ G+Y+H   GD +  G H+V+++GWG  +D ED YWI+AN W  SWG  GYF+I RG
Sbjct: 329 HYREGIYRHTRHGDQLMRGLHSVRIVGWG--EDAEDKYWIVANSWGTSWGEKGYFRIARG 386

Query: 195 SNECGIEEDVVAGLP 209
            +  GIE  V+  LP
Sbjct: 387 HSGTGIESSVLTVLP 401


>gi|417409900|gb|JAA51439.1| Putative cysteine proteinase tin-ag, partial [Desmodus rotundus]
          Length = 346

 Score =  112 bits (281), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 77/227 (33%), Positives = 112/227 (49%), Gaps = 37/227 (16%)

Query: 10  ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  VS+ +L      LS  +LL+C       GC GG+  SAW +    GVV++ C 
Sbjct: 114 AAVASDRVSIHSLGHMTPVLSPQNLLSC-DKRNQQGCQGGHLDSAWWFLRRRGVVSDHCY 172

Query: 65  PYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRNSKHYSISAYRIN 111
           P F   G +  G     P P+C+          R+   +   +Q+  N  +    AYR+ 
Sbjct: 173 P-FSGQGRTETG-----PAPRCMMHSRAMGRGKRQATARCPNHQVHANDIYQVTPAYRLG 226

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
           S  ++IM E+ +NGPV+    V+EDF  Y++G+Y H    +         G H+VK+ GW
Sbjct: 227 SSEKEIMKELMENGPVQALMEVHEDFFLYQNGIYSHTPVSLGRPERYRRHGTHSVKITGW 286

Query: 164 GTSD--DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           G     DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+ 
Sbjct: 287 GEESLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 333


>gi|410909768|ref|XP_003968362.1| PREDICTED: dipeptidyl peptidase 1-like [Takifugu rubripes]
          Length = 455

 Score =  112 bits (281), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 70/202 (34%), Positives = 97/202 (48%), Gaps = 28/202 (13%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF--DSTGCSHPGC 77
           Q+  LS   +++C  +    GCDGG+P    +Y    G+V E C PY   DS       C
Sbjct: 272 QSPVLSPQQVVSCSEY--SQGCDGGFPYLTGKYVQDFGIVDESCFPYMGKDSPCGISQSC 329

Query: 78  EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
              Y                  +++  +  +        +M E+ KNGP+ V+  VY DF
Sbjct: 330 RRGYA-----------------AEYKYVGGFYGGCSEAAMMVELVKNGPMAVALEVYSDF 372

Query: 138 AHYKSGVYKH--ITGDV----MGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFK 190
             YK G+Y H  +T  V    +  HAV L+G+G     G+ YWI+ N W  SWG DGYF+
Sbjct: 373 MSYKGGIYHHTGLTDHVNPFELTNHAVLLVGYGRCHMTGQKYWIVKNSWGSSWGEDGYFR 432

Query: 191 IKRGSNECGIEEDVVAGLPSSK 212
           I+RGS+EC IE   VA  P  K
Sbjct: 433 IRRGSDECAIESIAVAASPIPK 454


>gi|129270160|ref|NP_001038442.2| tubulointerstitial nephritis antigen-like precursor [Danio rerio]
 gi|126632071|gb|AAI33830.1| Si:dkey-158b13.1 [Danio rerio]
          Length = 471

 Score =  112 bits (281), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 72/227 (31%), Positives = 109/227 (48%), Gaps = 38/227 (16%)

Query: 10  ALSSSPYVSLQNL-----SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  +S+Q++      LS  +L++C      DGC GG    AW +    GVVT++C 
Sbjct: 232 AAVASDRISIQSMGHMTPQLSPQNLISC-DTRHQDGCAGGRIDGAWWFMRRRGVVTQDCY 290

Query: 65  PYFDSTGCSHPGCEPAYPTPKCVRKC-------------VKKNQLWRNSKHYSISAYRIN 111
           P+        P  + A    +C+ +                 +  + N  + S   YR++
Sbjct: 291 PF-------SPPEQSAVEVARCMMQSRAVGRGKRQATAHCPNSHSYHNDIYQSTPPYRLS 343

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM--------GGHAVKLIGW 163
           ++  +IM EI  NGPV+    V+EDF  YKSG+++H   +            H+V++ GW
Sbjct: 344 TNENEIMKEIMDNGPVQAIMEVHEDFFVYKSGIFRHTDVNYHKPSQYRKHATHSVRITGW 403

Query: 164 GTSDD----GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           G   D       YWI AN W ++WG DGYF+I RG NEC IE  V+ 
Sbjct: 404 GEERDYSGRTRKYWIGANSWGKNWGEDGYFRIARGVNECDIETFVIG 450


>gi|444728469|gb|ELW68926.1| Dipeptidyl peptidase 1 [Tupaia chinensis]
          Length = 462

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 68/201 (33%), Positives = 107/201 (53%), Gaps = 25/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P C 
Sbjct: 278 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEESCFPY---TGTDAP-C- 330

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
                 K  + C++    + +S+++ +  +    +   +  E+  +GP+ V+F VY+DF 
Sbjct: 331 ------KMKKDCIR----YYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFL 380

Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTS-DDGEDYWILANQWNRSWGADGYFKI 191
           HY+ G+Y+H           +  HAV L+G+GT    G DYWI+ N W  SWG DG+F+I
Sbjct: 381 HYQKGIYQHTGLRDPFNPFELTNHAVLLVGYGTDLASGMDYWIVKNSWGTSWGEDGFFRI 440

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG +EC IE   +A  P  K
Sbjct: 441 RRGIDECSIESIAMAATPIPK 461


>gi|126330441|ref|XP_001381244.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Monodelphis
           domestica]
          Length = 466

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 71/203 (34%), Positives = 103/203 (50%), Gaps = 20/203 (9%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCE 78
           +LS  +LL+C       GC GG    AW +    G+V+  C P+     D+T  + P   
Sbjct: 251 ALSPQNLLSC-DTHNQKGCRGGRLDGAWWFLRRRGLVSNHCYPFSAGNRDATAPAAPCMM 309

Query: 79  PAYPTPKCVRKCVK---KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
            +    +  R+       ++   N  + +   YR++SD +DIM E+ +NGPV+    V+E
Sbjct: 310 HSRSMGRGKRQATAHCPNSRAHANHIYQATPPYRLSSDEKDIMKELMENGPVQALMEVHE 369

Query: 136 DFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSW 183
           DF  YKSG+YKH    +         G H+VK+ GWG     DG+   YW  AN W  +W
Sbjct: 370 DFFLYKSGIYKHTPASLGKPARYRQHGTHSVKITGWGEERQPDGQRLKYWTAANSWGPTW 429

Query: 184 GADGYFKIKRGSNECGIEEDVVA 206
           G  G+F+I RG+NEC IE  VV 
Sbjct: 430 GEKGHFRILRGANECDIESFVVG 452


>gi|290984292|ref|XP_002674861.1| cathepsin C [Naegleria gruberi]
 gi|284088454|gb|EFC42117.1| cathepsin C [Naegleria gruberi]
          Length = 569

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 66/202 (32%), Positives = 101/202 (50%), Gaps = 30/202 (14%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
           L+V D+++C  +     C GG P +  R+     +V E C PY  S   +          
Sbjct: 374 LAVQDIVSCSPY--AQKCHGGIPYAVGRHLRDFNLVPESCFPYKGSENVA---------- 421

Query: 84  PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
             C  KC     + + +K+  +S Y   S+  ++M EIY++GP+  S+ +Y DF +Y  G
Sbjct: 422 --CSSKCKNPEYIVKVTKYRYVSDYYGGSNYANMMKEIYEHGPISASYLIYPDFKYYSKG 479

Query: 144 VYKH-----------ITGDVMG----GHAVKLIGWGTS-DDGEDYWILANQWNRSWGADG 187
           +YKH           I  ++ G     H+V + GWG     GE YW + N W+ SWG +G
Sbjct: 480 IYKHSGKGYPMKTDRINREMNGWEPTTHSVVITGWGEDPKTGEKYWNVLNSWSESWGENG 539

Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
            F+IKRG++EC IE + VA  P
Sbjct: 540 RFRIKRGNDECAIEAEGVAFYP 561


>gi|348508181|ref|XP_003441633.1| PREDICTED: dipeptidyl peptidase 1-like isoform 1 [Oreochromis
           niloticus]
          Length = 455

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 68/199 (34%), Positives = 100/199 (50%), Gaps = 28/199 (14%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF-DSTGCSHPGCEPAY 81
           +LS   +++C  +    GCDGG+P    +Y    G+V E C PY   +T C  P      
Sbjct: 275 TLSPQQVVSCSEY--SQGCDGGFPYLIGKYTQDFGIVDESCFPYVGQNTPCGVP------ 326

Query: 82  PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 141
                     +K Q    +++  +  +        +M E+ KNGP+ V+F VY DF +YK
Sbjct: 327 ----------QKCQRIYAAEYNYVGGFYGGCSEAAMMLELVKNGPMAVAFEVYPDFMNYK 376

Query: 142 SGVYKHITGDV-------MGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKR 193
            G+Y H TG         +  HAV L+G+G     G++YWI+ N W   WG +GYF+I+R
Sbjct: 377 EGIYHH-TGLADPFNPFELTNHAVLLVGYGRCHKTGQNYWIVKNSWGTGWGEEGYFRIRR 435

Query: 194 GSNECGIEEDVVAGLPSSK 212
           G++EC IE   VA  P  K
Sbjct: 436 GNDECAIESIAVAANPIPK 454


>gi|296216857|ref|XP_002754752.1| PREDICTED: dipeptidyl peptidase 1 [Callithrix jacchus]
          Length = 460

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 71/203 (34%), Positives = 103/203 (50%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    GVV E C PY   TG   P   
Sbjct: 276 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGVVEEACFPY---TGTDSP--- 327

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 328 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 376

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HY  G+Y H           +  HAV L+G+GT S  G  YWI+ N W  SWG DGYF
Sbjct: 377 FLHYHKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGIHYWIVKNSWGTSWGEDGYF 436

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 437 RIRRGTDECAIESIAVAATPIPK 459


>gi|294929081|ref|XP_002779258.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239888294|gb|EER11053.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 288

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 72/195 (36%), Positives = 97/195 (49%), Gaps = 17/195 (8%)

Query: 24  LSVNDLLACCGFLCG----DGCDGGYPISAWRYFVHHGVVT-EECDP---YFDSTGC--- 72
           LSV    +CC    G     GC GG  +    +  +HG+VT +E  P      + GC   
Sbjct: 91  LSVGYFTSCCNPANGCPKAKGCQGGNLLEGLNFLKNHGIVTGDEFKPAGQLSSADGCWPY 150

Query: 73  SHPGCEPA-YPTPKCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
             P C+ A Y +P C  KC  K      +   H + S  R+ + P++I  EI+ NGPV  
Sbjct: 151 PFPKCKHAGYSSPACQTKCTNKAYKTSLQQDLHRAKSFGRLPAIPQNIKQEIFTNGPVIG 210

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
             ++YED   YK+GVY H TG   G H +K+IGWG  + G+DYW+  N WN  WG  G  
Sbjct: 211 MLSIYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGV-ESGQDYWLAVNSWNEEWGDHGMI 269

Query: 190 KIKRGSNECGIEEDV 204
           K+  G    GIE  V
Sbjct: 270 KLAVGRT--GIENSV 282


>gi|348508183|ref|XP_003441634.1| PREDICTED: dipeptidyl peptidase 1-like isoform 2 [Oreochromis
           niloticus]
          Length = 461

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 68/199 (34%), Positives = 100/199 (50%), Gaps = 28/199 (14%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF-DSTGCSHPGCEPAY 81
           +LS   +++C  +    GCDGG+P    +Y    G+V E C PY   +T C  P      
Sbjct: 281 TLSPQQVVSCSEY--SQGCDGGFPYLIGKYTQDFGIVDESCFPYVGQNTPCGVP------ 332

Query: 82  PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 141
                     +K Q    +++  +  +        +M E+ KNGP+ V+F VY DF +YK
Sbjct: 333 ----------QKCQRIYAAEYNYVGGFYGGCSEAAMMLELVKNGPMAVAFEVYPDFMNYK 382

Query: 142 SGVYKHITGDV-------MGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKR 193
            G+Y H TG         +  HAV L+G+G     G++YWI+ N W   WG +GYF+I+R
Sbjct: 383 EGIYHH-TGLADPFNPFELTNHAVLLVGYGRCHKTGQNYWIVKNSWGTGWGEEGYFRIRR 441

Query: 194 GSNECGIEEDVVAGLPSSK 212
           G++EC IE   VA  P  K
Sbjct: 442 GNDECAIESIAVAANPIPK 460


>gi|75812938|ref|NP_001028789.1| dipeptidyl peptidase 1 precursor [Bos taurus]
 gi|115312125|sp|Q3ZCJ8.1|CATC_BOVIN RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|73587261|gb|AAI02116.1| Cathepsin C [Bos taurus]
          Length = 463

 Score =  112 bits (279), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 69/203 (33%), Positives = 105/203 (51%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E+C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEDCFPY---TGTDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+   GP+ V+F VY+D
Sbjct: 331 -----------CRLKEGCFRYYSSEYHYVGGFYGGCNEALMKLELVHQGPMAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HY+ GVY H           +  HAV L+G+GT +  G DYWI+ N W  SWG +GYF
Sbjct: 380 FLHYRKGVYHHTGLRDPFNPFELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   +A  P  K
Sbjct: 440 RIRRGTDECAIESIALAATPIPK 462


>gi|308494436|ref|XP_003109407.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
 gi|308246820|gb|EFO90772.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
          Length = 470

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 73/201 (36%), Positives = 101/201 (50%), Gaps = 20/201 (9%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGC--- 77
           N SLS   LL+C       GC+GGY   AW Y    GVV + C PY          C   
Sbjct: 250 NASLSSQQLLSCNQHR-QKGCEGGYLDRAWWYIRKLGVVGDHCYPYVSGQSREPGHCLIP 308

Query: 78  EPAYPTPKCVRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYED 136
           +  Y   + +R C   +Q   +S  + ++  Y+++S  EDI  E+  NGPV+ +F V+ED
Sbjct: 309 KRDYTNRQGLR-CPSGDQ---DSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHED 364

Query: 137 FAHYKSGVYKH--------ITGDVMGGHAVKLIGWG---TSDDGEDYWILANQWNRSWGA 185
           F  Y  GVY+H         +    G H+V+++GWG   ++     YW+ AN W   WG 
Sbjct: 365 FFMYAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGE 424

Query: 186 DGYFKIKRGSNECGIEEDVVA 206
           DGYFKI RG N C IE  V+ 
Sbjct: 425 DGYFKILRGENHCEIESFVIG 445


>gi|405963121|gb|EKC28721.1| Tubulointerstitial nephritis antigen-like protein [Crassostrea
           gigas]
          Length = 464

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 65/163 (39%), Positives = 81/163 (49%), Gaps = 15/163 (9%)

Query: 49  AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 108
           AW +    G++TEEC PY  S G     C     T      C   N        Y    Y
Sbjct: 269 AWWFVKRRGIITEECYPYTASDG----ECLDGETT------CPNANSSTAKIVLYVTPPY 318

Query: 109 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH-AVKLIGWG--- 164
           R+  D EDI AEIY+NGPV+ +F V  DF  Y+SGVY+H   D+     +V++IGWG   
Sbjct: 319 RVRQDEEDIKAEIYRNGPVQATFRVSSDFFMYRSGVYRHTGADLGESRLSVRIIGWGEKT 378

Query: 165 -TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
                   YWI  N W   WG  G F+I RG N  GIEE+V+A
Sbjct: 379 NKKGKKRKYWICLNSWGTKWGEKGAFRIVRGENHLGIEENVLA 421


>gi|349605750|gb|AEQ00879.1| Dipeptidyl-peptidase 1-like protein, partial [Equus caballus]
          Length = 356

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 73/201 (36%), Positives = 105/201 (52%), Gaps = 25/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    GVV E C PY   TG   P C 
Sbjct: 172 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGVVEEGCFPY---TGTDSP-C- 224

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
                 K  + C +    + +S +Y +  +    +   I  E+  +GP+ V+F VY DF 
Sbjct: 225 ------KLKKDCFR----YYSSDYYYVGGFYGGCNEALIKLELVHHGPMAVAFEVYNDFL 274

Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKI 191
           HY  G+Y H           +  HAV L+G+GT S  G+DYWI+ N W  SWG DGYF+I
Sbjct: 275 HYHDGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGQDYWIVKNSWGTSWGEDGYFRI 334

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG++EC IE   +A  P  K
Sbjct: 335 RRGTDECAIESIAMAATPIPK 355


>gi|296471940|tpg|DAA14055.1| TPA: dipeptidyl peptidase 1 [Bos taurus]
 gi|440894445|gb|ELR46895.1| Dipeptidyl peptidase 1 [Bos grunniens mutus]
          Length = 463

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 69/203 (33%), Positives = 105/203 (51%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E+C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEDCFPY---TGTDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+   GP+ V+F VY+D
Sbjct: 331 -----------CRLKEGCFRYYSSEYHYVGGFYGGCNEALMKLELVHQGPMAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HY+ GVY H           +  HAV L+G+GT +  G DYWI+ N W  SWG +GYF
Sbjct: 380 FLHYRKGVYHHTGLRDPFNPFELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   +A  P  K
Sbjct: 440 RIRRGTDECAIESIALAATPIPK 462


>gi|193606095|ref|XP_001951499.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 330

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 74/200 (37%), Positives = 94/200 (47%), Gaps = 13/200 (6%)

Query: 21  NLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           N  LS  +L+ C G      G   G  +  W Y   HG+V+     Y  + GC      P
Sbjct: 136 NQLLSTEELIFCGGIKTKQSGAVRGDDV--WEYLKSHGLVS--GGKYNTNDGCQPSKIPP 191

Query: 80  AYPTPK------CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
               P       C  +C   N +     H  +S Y      EDI  E+   GPV V F V
Sbjct: 192 IGNIPTHLYNHTCEERCYGNNTIHYYHDHVKVSHYYNIKSNEDIQKEVQTYGPVSVKFRV 251

Query: 134 YEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
           Y+DF  YKSGVY      + +  H  KLIGWG  ++G DYW+L N W   WG +G FKIK
Sbjct: 252 YDDFFLYKSGVYVKTEKSLYVRRHFAKLIGWGV-ENGVDYWLLVNSWGNEWGQNGLFKIK 310

Query: 193 RGSNECGIEEDVVAGLPSSK 212
           RG+NE  +E+ V AG P  K
Sbjct: 311 RGTNEVHVEDYVYAGEPEIK 330


>gi|30038325|dbj|BAC75711.1| cathepsin C [Bos taurus]
          Length = 458

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 69/203 (33%), Positives = 105/203 (51%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E+C PY   TG   P   
Sbjct: 274 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEDCFPY---TGTDSP--- 325

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+   GP+ V+F VY+D
Sbjct: 326 -----------CRLKEGCFRYYSSEYHYVGGFYGGCNEALMKLELVHQGPMAVAFEVYDD 374

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HY+ GVY H           +  HAV L+G+GT +  G DYWI+ N W  SWG +GYF
Sbjct: 375 FLHYRKGVYHHTGLRDPFNPFELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYF 434

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   +A  P  K
Sbjct: 435 RIRRGTDECAIESIALAATPIPK 457


>gi|6449324|gb|AAF08932.1|AF195117_1 tubulointerstitial nephritis antigen isoform TIN2 [Homo sapiens]
          Length = 333

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 64/171 (37%), Positives = 89/171 (52%), Gaps = 26/171 (15%)

Query: 58  VVTEECDPYFDSTGCSHPGCEPAY---------PTPKCVRKCVKKNQLWRNSKHYSISAY 108
           +V+  C P F     ++ GC  A           T  C     K N++++ S       Y
Sbjct: 158 LVSHACYPLFKDQNATNNGCAMASRSDGRGKRDATKPCPNNVEKSNRIYQCS-----PPY 212

Query: 109 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGGHAVKL 160
           R++S+  +IM EI +NGPV+    V EDF HYK+G+Y+H+T           +  HAVKL
Sbjct: 213 RVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKL 272

Query: 161 IGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
            GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ V+A 
Sbjct: 273 TGWGTRRGAQGQKEKFWIAANFWGKSWGENGYFRILRGVNESDIEKLVIAA 323


>gi|33327024|gb|AAQ08887.1| cathepsin C [Homo sapiens]
          Length = 463

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 70/203 (34%), Positives = 103/203 (50%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C       GC+GG+P + A +Y    G+V E C PY   TG   P   
Sbjct: 279 QTPILSPQEVVSCSQH--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
           F HYK G+Y H           +  HAV L+G+GT S  G DYWI+ N W   WG +GYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462


>gi|291384116|ref|XP_002708690.1| PREDICTED: cathepsin C [Oryctolagus cuniculus]
          Length = 463

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 69/203 (33%), Positives = 104/203 (51%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E+C PY   TG   P   
Sbjct: 279 QTPILSPQEIVSCSQY--AQGCNGGFPYLIAGKYAQDFGLVEEDCFPY---TGTDSP--- 330

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYF 189
           F HY  G+Y H           +  HAV L+G+GT    G DYWI+ N W  SWG +GYF
Sbjct: 380 FLHYHKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDPATGVDYWIVKNSWGTSWGENGYF 439

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   VA  P  K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462


>gi|159114116|ref|XP_001707283.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157435387|gb|EDO79609.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 332

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 65/171 (38%), Positives = 88/171 (51%), Gaps = 14/171 (8%)

Query: 40  GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
           GC GG   + W +   HG  T EC  Y D+       C PA        + VK +     
Sbjct: 169 GCAGGTSFNVWTFLTEHGTTTLECVRYTDADKDLSSPC-PALCDDGSEIQLVKADGCLDY 227

Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 159
           S + +            IM  +  +GPV+   +VY DF +Y+ GVYKH+ G  +  HAV+
Sbjct: 228 SGNVTA-----------IMQTLANDGPVQAVMSVYRDFLYYRGGVYKHVYGIQISSHAVE 276

Query: 160 LIGWGTSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
           +IG+GT+DD E   YWI+ N    +WG +GYF I RGSNEC IE  V +GL
Sbjct: 277 IIGYGTTDDEERIPYWIVKNSLGPNWGEEGYFNIVRGSNECDIESAVYSGL 327


>gi|194213370|ref|XP_001492720.2| PREDICTED: dipeptidyl peptidase 1-like [Equus caballus]
          Length = 478

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 73/201 (36%), Positives = 105/201 (52%), Gaps = 25/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    GVV E C PY   TG   P C 
Sbjct: 294 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGVVEEGCFPY---TGTDSP-C- 346

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
                 K  + C +    + +S +Y +  +    +   I  E+  +GP+ V+F VY DF 
Sbjct: 347 ------KLKKDCFR----YYSSDYYYVGGFYGGCNEALIKLELVHHGPMAVAFEVYNDFL 396

Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKI 191
           HY  G+Y H           +  HAV L+G+GT S  G+DYWI+ N W  SWG DGYF+I
Sbjct: 397 HYHDGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGQDYWIVKNSWGTSWGEDGYFRI 456

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG++EC IE   +A  P  K
Sbjct: 457 RRGTDECAIESIAMAATPIPK 477


>gi|322788703|gb|EFZ14296.1| hypothetical protein SINV_07506 [Solenopsis invicta]
          Length = 443

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 68/190 (35%), Positives = 99/190 (52%), Gaps = 13/190 (6%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
           LS   LL+C       GC GGY   AW +    G+V +EC P+       +  C+    +
Sbjct: 254 LSAQQLLSC-NNRGQQGCRGGYLDRAWLFMRKFGLVDKECYPWSGK----NDQCKLRKRS 308

Query: 84  PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
                 C K +   R   +    AYR+ ++  DIM EI  +GPV+ +  VY+DF  YKSG
Sbjct: 309 TLKAAGCRKPSHPLRTELYKVGPAYRLGNE-TDIMQEILTSGPVQATMRVYQDFFIYKSG 367

Query: 144 VYKHITGDVM---GGHAVKLIGWGTSDDGE----DYWILANQWNRSWGADGYFKIKRGSN 196
           +Y+H     +   G H+V++IGWG           YW++AN W  +WG +G FKI++G+N
Sbjct: 368 IYRHSRSAELHDSGYHSVRIIGWGEERSYRGPPLKYWLVANSWGYNWGDNGLFKIQKGTN 427

Query: 197 ECGIEEDVVA 206
           EC IE  V+A
Sbjct: 428 ECEIESYVLA 437


>gi|395856781|ref|XP_003800797.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Otolemur garnettii]
          Length = 436

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 76/221 (34%), Positives = 110/221 (49%), Gaps = 25/221 (11%)

Query: 10  ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  VS+ +L      LS  +LL+C       GC GG    AW +    GVV++ C 
Sbjct: 204 AAVASDRVSIHSLGHMTPVLSPQNLLSCDTHH-QQGCHGGRLDGAWWFLRRRGVVSDHCY 262

Query: 65  PY----FDSTGCSHPGCEPAYPTPKCVRKCVKK---NQLWRNSKHYSISAYRINSDPEDI 117
           P+     D  G +      + P  +  R+   +   NQ+  N  +    AYR+ S+ ++I
Sbjct: 263 PFSGQERDKAGPAPLCMMHSRPMGRGKRQATARCPNNQVQANDIYQVTPAYRLGSNEKEI 322

Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM--------GGHAVKLIGWG--TSD 167
           M E+ +NGPV+    V+EDF  Y+SG+Y H    +         G H+VK+ GWG  T  
Sbjct: 323 MKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLQRPEGYRRHGTHSVKITGWGEETLP 382

Query: 168 DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+ 
Sbjct: 383 DGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 423


>gi|417401428|gb|JAA47600.1| Putative cysteine proteinase tin-ag [Desmodus rotundus]
          Length = 466

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 77/227 (33%), Positives = 112/227 (49%), Gaps = 37/227 (16%)

Query: 10  ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  VS+ +L      LS  +LL+C       GC GG+  SAW +    GVV++ C 
Sbjct: 234 AAVASDRVSIHSLGHMTPVLSPQNLLSC-DKRNQQGCQGGHLDSAWWFLRRRGVVSDHCY 292

Query: 65  PYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRNSKHYSISAYRIN 111
           P F   G +  G     P P+C+          R+   +   +Q+  N  +    AYR+ 
Sbjct: 293 P-FSGQGRTETG-----PAPRCMMHSRAMGRGKRQATARCPNHQVHANDIYQVTPAYRLG 346

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
           S  ++IM E+ +NGPV+    V+EDF  Y++G+Y H    +         G H+VK+ GW
Sbjct: 347 SSEKEIMKELMENGPVQALMEVHEDFFLYQNGIYSHTPVSLGRPERYRRHGTHSVKITGW 406

Query: 164 GTSD--DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           G     DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+ 
Sbjct: 407 GEESLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 453


>gi|395856779|ref|XP_003800796.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Otolemur garnettii]
          Length = 467

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 76/221 (34%), Positives = 110/221 (49%), Gaps = 25/221 (11%)

Query: 10  ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  VS+ +L      LS  +LL+C       GC GG    AW +    GVV++ C 
Sbjct: 235 AAVASDRVSIHSLGHMTPVLSPQNLLSCDTHH-QQGCHGGRLDGAWWFLRRRGVVSDHCY 293

Query: 65  PY----FDSTGCSHPGCEPAYPTPKCVRKCVKK---NQLWRNSKHYSISAYRINSDPEDI 117
           P+     D  G +      + P  +  R+   +   NQ+  N  +    AYR+ S+ ++I
Sbjct: 294 PFSGQERDKAGPAPLCMMHSRPMGRGKRQATARCPNNQVQANDIYQVTPAYRLGSNEKEI 353

Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM--------GGHAVKLIGWG--TSD 167
           M E+ +NGPV+    V+EDF  Y+SG+Y H    +         G H+VK+ GWG  T  
Sbjct: 354 MKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLQRPEGYRRHGTHSVKITGWGEETLP 413

Query: 168 DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+ 
Sbjct: 414 DGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 454


>gi|67867504|gb|AAH98085.1| Unknown (protein for MGC:107782) [Xenopus (Silurana) tropicalis]
          Length = 458

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 71/213 (33%), Positives = 105/213 (49%), Gaps = 32/213 (15%)

Query: 8   RDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPY 66
           R  LS  P +S Q +       ++C  +    GC+GG+P + A +Y   +G+V E   PY
Sbjct: 269 RSQLSQKPILSPQQV-------VSCSNY--SQGCEGGFPYLIAGKYVSDYGIVEESDLPY 319

Query: 67  FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
              TG   P          C  K     Q +  ++++ +  +    +   +  E+   GP
Sbjct: 320 ---TGSDSP----------CTLK--DSQQKYYTAEYHYVGGFYGGCNEAYMKLELVLGGP 364

Query: 127 VEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGTSDD-GEDYWILANQW 179
           + V+F VY+DF HY+SGVY H           +  HAV L+G+GT    GE YWI+ N W
Sbjct: 365 LSVAFEVYDDFMHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGTDQQTGEKYWIVKNSW 424

Query: 180 NRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 212
             SWG  GYF+I+RG++EC IE   V+  P  K
Sbjct: 425 GESWGEKGYFRIRRGTDECAIESIAVSAEPIIK 457


>gi|66506619|ref|XP_393283.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
           [Apis mellifera]
          Length = 439

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 70/191 (36%), Positives = 99/191 (51%), Gaps = 14/191 (7%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
           LS   LL+C       GCDGGY   AW +    G+V E+C P+       +  C+    T
Sbjct: 248 LSAQHLLSC-NKKGQRGCDGGYLDRAWLFMRKFGLVDEQCYPWKGV----YEQCKLQKRT 302

Query: 84  PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
                 C       R   +    AYR+ ++  DIM EI  +GPV+ +  VY+DF  Y+SG
Sbjct: 303 NLEAAGCRAPANPLRKELYKVGPAYRLGNE-TDIMREILTSGPVQATMKVYQDFFSYESG 361

Query: 144 VYKHITGDVM---GGHAVKLIGWG---TSDDGE--DYWILANQWNRSWGADGYFKIKRGS 195
           +Y H     +   G H+V++IGWG   ++D G    YW++ N W + WG +G F+I+RG 
Sbjct: 362 IYMHTPIAELYESGYHSVRIIGWGEDISTDSGLPIKYWLVVNSWGQEWGENGLFRIRRGI 421

Query: 196 NECGIEEDVVA 206
           NEC IE  VVA
Sbjct: 422 NECDIESFVVA 432


>gi|332030944|gb|EGI70570.1| Uncharacterized peptidase C1-like protein F26E4.3 [Acromyrmex
           echinatior]
          Length = 501

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 67/194 (34%), Positives = 98/194 (50%), Gaps = 13/194 (6%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           ++  LS   LL+C       GC GGY   AW +    G+V ++C P+    G     C+ 
Sbjct: 308 EDAELSAQHLLSC-NNRGQQGCRGGYLDRAWLFMRKFGLVDKDCYPWTGKNG----QCKL 362

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
                     C K     R   +    AYR+ ++  DIM EI  +GPV+ +  VY+DF  
Sbjct: 363 RKRNNLQAAGCRKPPNPLRTELYKVGPAYRLGNE-TDIMQEILTSGPVQATMRVYQDFFV 421

Query: 140 YKSGVYKHITGDVM---GGHAVKLIGWGTSDDGE----DYWILANQWNRSWGADGYFKIK 192
           YK+G+Y+H     +   G H+V++IGWG           YW++ N W  +WG +G FKI+
Sbjct: 422 YKNGIYRHSQSAELHDSGYHSVRIIGWGEERSYRGPPLKYWLVVNSWGYNWGENGLFKIQ 481

Query: 193 RGSNECGIEEDVVA 206
           RG+NEC IE  V+A
Sbjct: 482 RGTNECEIESYVLA 495


>gi|239790303|dbj|BAH71722.1| ACYPI001175 [Acyrthosiphon pisum]
          Length = 330

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 74/200 (37%), Positives = 94/200 (47%), Gaps = 13/200 (6%)

Query: 21  NLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           N  LS  +L+ C G      G   G  +  W Y   HG+V+     Y  + GC      P
Sbjct: 136 NQLLSTEELIFCGGIKTKQSGAVRGDDV--WEYLKSHGLVS--GGKYNTNDGCQPSKIPP 191

Query: 80  AYPTPK------CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
               P       C  +C   N +     H  +S Y      EDI  E+   GPV V F V
Sbjct: 192 IGNIPTHLYNHTCEERCYGNNTIHYYHDHVKVSHYYNIKSNEDIQKEVQTYGPVSVKFRV 251

Query: 134 YEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
           Y+DF  YKSGVY      + +  H  KLIGWG  ++G DYW+L N W   WG +G FKIK
Sbjct: 252 YDDFFLYKSGVYVKTEKSLYVRRHFAKLIGWGV-ENGVDYWLLVNFWGNEWGQNGLFKIK 310

Query: 193 RGSNECGIEEDVVAGLPSSK 212
           RG+NE  +E+ V AG P  K
Sbjct: 311 RGTNEVHVEDYVYAGEPEIK 330


>gi|340712697|ref|XP_003394892.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
           terrestris]
          Length = 445

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 105/218 (48%), Gaps = 21/218 (9%)

Query: 1   MSVTR--TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 58
           +S TR  ++R AL S       ++ LS   LL+C        C GGY   AW Y    G+
Sbjct: 231 ISATRVASDRFALMSK---GADSVLLSAQHLLSC-NNRGQQACSGGYLDRAWLYMRKFGL 286

Query: 59  VTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIM 118
           V E+C P+  +       C+    T      C       R   +    AYR+ ++  DIM
Sbjct: 287 VDEDCYPWEGTNA----QCKLRKRTDLKTAGCRPPVNPLRTELYKVGPAYRLGNE-TDIM 341

Query: 119 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD---VMGGHAVKLIGWGTSDDGE----- 170
            EI  +GPV+ +  VY+DF  Y+SG+YKH         G H+V++IGWG           
Sbjct: 342 YEILTSGPVQATMKVYQDFFSYESGIYKHTATTEHYAFGYHSVRIIGWGEDTSAHRHHNL 401

Query: 171 --DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
              YW++ N W + WG  G F+I+RG+NEC IE  VVA
Sbjct: 402 PIKYWLVVNSWGQQWGESGLFRIQRGTNECDIESFVVA 439


>gi|432108509|gb|ELK33225.1| Dipeptidyl peptidase 1 [Myotis davidii]
          Length = 466

 Score =  110 bits (276), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 68/201 (33%), Positives = 104/201 (51%), Gaps = 25/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q+  LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P C 
Sbjct: 282 QSPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP-C- 334

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
                 K    C++    +  S+++ +  +    +   +  E+  +GP+ V+F VY+DF 
Sbjct: 335 ------KMKEDCIR----YYTSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFL 384

Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTS-DDGEDYWILANQWNRSWGADGYFKI 191
           HY  G+Y H           +  HAV L+G+GT    G DYWI+ N W  SWG  GYF+I
Sbjct: 385 HYNQGIYHHTGLKDPFNPFELTNHAVLLVGYGTDPKTGLDYWIVKNSWGTSWGEQGYFRI 444

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG++EC IE   +A  P  K
Sbjct: 445 RRGTDECAIESIAMAATPIPK 465


>gi|24987409|pdb|1JQP|A Chain A, Dipeptidyl Peptidase I (Cathepsin C), A Tetrameric
           Cysteine Protease Of The Papain Family
          Length = 438

 Score =  110 bits (276), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 68/201 (33%), Positives = 103/201 (51%), Gaps = 25/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GCDGG+P + A +Y    GVV E C PY  +         
Sbjct: 254 QTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEENCFPYTATDA------- 304

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
           P  P   C+R        + +S++Y +  +    +   +  E+ K+GP+ V+F V++DF 
Sbjct: 305 PCKPKENCLR--------YYSSEYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFL 356

Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
           HY SG+Y H           +  HAV L+G+G     G DYWI+ N W   WG  GYF+I
Sbjct: 357 HYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYFRI 416

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG++EC IE   +A +P  K
Sbjct: 417 RRGTDECAIESIAMAAIPIPK 437


>gi|8393218|ref|NP_058793.1| dipeptidyl peptidase 1 precursor [Rattus norvegicus]
 gi|114152780|sp|P80067.3|CATC_RAT RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|220686|dbj|BAA14400.1| cathepsin C precursor [Rattus norvegicus]
 gi|149069035|gb|EDM18587.1| cathepsin C, isoform CRA_a [Rattus norvegicus]
          Length = 462

 Score =  110 bits (276), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 68/201 (33%), Positives = 103/201 (51%), Gaps = 25/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GCDGG+P + A +Y    GVV E C PY  +         
Sbjct: 278 QTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEENCFPYTATDA------- 328

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
           P  P   C+R        + +S++Y +  +    +   +  E+ K+GP+ V+F V++DF 
Sbjct: 329 PCKPKENCLR--------YYSSEYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFL 380

Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
           HY SG+Y H           +  HAV L+G+G     G DYWI+ N W   WG  GYF+I
Sbjct: 381 HYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYFRI 440

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG++EC IE   +A +P  K
Sbjct: 441 RRGTDECAIESIAMAAIPIPK 461


>gi|345327151|ref|XP_001507103.2| PREDICTED: tubulointerstitial nephritis antigen-like
           [Ornithorhynchus anatinus]
          Length = 327

 Score =  110 bits (276), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 68/203 (33%), Positives = 98/203 (48%), Gaps = 20/203 (9%)

Query: 23  SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 82
           SLS  +LL+C       GC+GG    AW +    G+V+++C P       + P    + P
Sbjct: 107 SLSPQNLLSC-NTRHQQGCNGGRLDRAWSFLRRRGLVSDKCYPLASQNSIAEPCRMYSRP 165

Query: 83  TPKCVRKCV-------KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
             +  R+           +  + N  + S   YR++S+ +DIM EI +NGPV+    V+E
Sbjct: 166 MGRGKRQATGPCPNNFHHSNDYSNDIYQSTPPYRLSSNEKDIMKEIMENGPVQALMEVHE 225

Query: 136 DFAHYKSGVYKHITGD--------VMGGHAVKLIGWG--TSDDGE--DYWILANQWNRSW 183
           DF  YK G+Y+H              G H+VK+ GWG     +G    +W  AN W  +W
Sbjct: 226 DFFLYKDGIYRHTPASNGKPPQFRRQGTHSVKITGWGEELQPNGRRVKFWRAANSWGPTW 285

Query: 184 GADGYFKIKRGSNECGIEEDVVA 206
           G  G F+I RG NEC IE  VV 
Sbjct: 286 GEGGSFRILRGCNECDIESFVVG 308


>gi|255209|gb|AAB23200.1| preprocathepsin C, dipeptidylaminopeptidase I [rats, kidney,
           Peptide, 462 aa]
          Length = 462

 Score =  110 bits (275), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 68/201 (33%), Positives = 103/201 (51%), Gaps = 25/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GCDGG+P + A +Y    GVV E C PY  +         
Sbjct: 278 QTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEENCFPYTATDA------- 328

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
           P  P   C+R        + +S++Y +  +    +   +  E+ K+GP+ V+F V++DF 
Sbjct: 329 PCKPKENCLR--------YYSSEYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFL 380

Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
           HY SG+Y H           +  HAV L+G+G     G DYWI+ N W   WG  GYF+I
Sbjct: 381 HYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYFRI 440

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG++EC IE   +A +P  K
Sbjct: 441 RRGTDECAIESIAMAAIPIPK 461


>gi|253747738|gb|EET02294.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 305

 Score =  110 bits (275), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 58/166 (34%), Positives = 87/166 (52%), Gaps = 13/166 (7%)

Query: 40  GCDGGYPISAWRYFVHHGVVTEECDPYFD-STGCSHPGCEPAYPTPKCVRKCVKKNQLWR 98
           GC GG   ++W +    G +  +C PY    TG S           +C   C +   L  
Sbjct: 146 GCQGGGFNTSWAFLETEGAIMRDCLPYVSGETGLS----------GECPTTC-QDGTLLN 194

Query: 99  NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 158
           ++ HY   +     +  +IM  +   GPV+  F V+EDF +Y  G+Y    G  +GGHAV
Sbjct: 195 DTIHYKAVSASHLKNYNEIMTSLLNEGPVQTGFYVHEDFLYYVGGIYHKTYGSSIGGHAV 254

Query: 159 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 204
            ++G+G+ ++  DYWI+ N W   WG +GYF+I RG+NECGIE + 
Sbjct: 255 LIVGYGSMNN-HDYWIVRNSWGSDWGENGYFRILRGTNECGIENNA 299


>gi|12658201|gb|AAK01061.1| cysteine proteinase [Metagonimus yokogawai]
          Length = 179

 Score =  110 bits (275), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 62/158 (39%), Positives = 89/158 (56%), Gaps = 16/158 (10%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
           +S  DLL+CC   CG GC GG+P  AW +++ +G+VT         C  Y     C+H  
Sbjct: 22  ISSTDLLSCCE-SCGFGCHGGFPPRAWDFWMENGLVTGGSKENPSGCRSY-PFPKCNHHG 79

Query: 75  -----PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
                P  E  +PTP C + C      +   K  + S+Y + +  + IM EI +NGPVE 
Sbjct: 80  KGPDAPCPEKIFPTPACNKTCDTPEVNYILDKTKAKSSYNVPNSEKAIMKEIMQNGPVEA 139

Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 167
           +F VYEDF HY+SGVY H  G ++GGHA++++GWG  +
Sbjct: 140 AFEVYEDFLHYESGVYFHSFGRMIGGHAIRMLGWGEEN 177


>gi|290987261|ref|XP_002676341.1| predicted protein [Naegleria gruberi]
 gi|284089943|gb|EFC43597.1| predicted protein [Naegleria gruberi]
          Length = 218

 Score =  110 bits (275), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 60/180 (33%), Positives = 88/180 (48%), Gaps = 28/180 (15%)

Query: 40  GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
           GC  GY  +A+++  + G+VTE C P+    G            P C +KC+  N     
Sbjct: 54  GCSYGYFDTAFQFVENQGIVTENCFPFVSGEGNY---------IPPCPKKCLAYNPF--- 101

Query: 100 SKHYSISAYRINSD----PEDIMA---EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 152
                 + +++N+     P+DI      I   G +  S  +Y DF  Y+ GVY+H+ G+ 
Sbjct: 102 ------TLFKVNNSRAFLPQDIQGMQLSIMNGGSLAASLDIYRDFVQYRGGVYRHLVGNY 155

Query: 153 MGGHAVKLIGWGTSDDGE---DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           M  H+V+++GWG +   +    YWI  N W   WG  G+F I RGSNEC IE DV    P
Sbjct: 156 MFTHSVRIVGWGITSPQQGSIPYWICGNNWTEEWGMQGWFWILRGSNECNIELDVWETTP 215


>gi|383861394|ref|XP_003706171.1| PREDICTED: tubulointerstitial nephritis antigen-like [Megachile
           rotundata]
          Length = 442

 Score =  110 bits (275), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 68/193 (35%), Positives = 100/193 (51%), Gaps = 15/193 (7%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY 81
           + LS   LL+C       GC GG+   AW +    G+V E C P+  ST      C    
Sbjct: 249 VELSAQHLLSC-NNRGQQGCSGGHLDRAWMFMRRFGLVDENCYPWKASTE----TCRLRK 303

Query: 82  PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 141
            T      C       R   +    AYR+ ++  DIM EI  +GPV+ +  VY+DF  Y+
Sbjct: 304 RTDLRSAGCAPPPNPLRTELYKVGPAYRLANE-TDIMQEILTSGPVQATMRVYQDFFSYE 362

Query: 142 SGVYKH-ITGDVMGG--HAVKLIGWG------TSDDGEDYWILANQWNRSWGADGYFKIK 192
           SGVYKH +T ++     H+V++IGWG      + +    YW++AN W + WG +G F+I+
Sbjct: 363 SGVYKHSVTAELYESDYHSVRIIGWGEEPPTYSRNTPLKYWLVANSWGQQWGENGLFRIQ 422

Query: 193 RGSNECGIEEDVV 205
           +G+NEC IE  V+
Sbjct: 423 KGTNECEIESFVL 435


>gi|350408961|ref|XP_003488566.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
           impatiens]
          Length = 445

 Score =  110 bits (275), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 106/218 (48%), Gaps = 21/218 (9%)

Query: 1   MSVTR--TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 58
           +S TR  ++R AL S       ++ LS   LL+C        C GGY   AW Y    G+
Sbjct: 231 ISTTRVASDRFALMSK---GADSVLLSAQHLLSC-NNRGQQACSGGYLDRAWLYMRKFGL 286

Query: 59  VTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIM 118
           V E+C P+  +    +  C+    T      C       R   +    AYR+ ++  DIM
Sbjct: 287 VDEDCYPWEGT----NVQCKLRKRTDLKTAGCRPPVNPLRTELYKVGPAYRLGNE-TDIM 341

Query: 119 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD---VMGGHAVKLIGWGTSDDGE----- 170
            EI  +GPV+ +  VY+DF  Y+SG+YKH         G H+V++IGWG           
Sbjct: 342 YEILTSGPVQATMKVYQDFFSYESGIYKHTATTEHYAFGYHSVRIIGWGEDTSAHRYRNL 401

Query: 171 --DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
              YW++ N W + WG  G F+I+RG+NEC IE  VVA
Sbjct: 402 PIKYWLVVNSWGQQWGESGLFRIQRGTNECDIESFVVA 439


>gi|344250687|gb|EGW06791.1| Dipeptidyl-peptidase 1 [Cricetulus griseus]
          Length = 483

 Score =  110 bits (275), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 68/201 (33%), Positives = 100/201 (49%), Gaps = 25/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GCDGG+P + A +Y    GVV E C PY  +         
Sbjct: 299 QTPILSPQEVVSCSMY--AQGCDGGFPYLIAGKYAQDFGVVEENCFPYTATDA------- 349

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
           P  P   C+R        +  S +Y +  +    +   +  E+ ++GP+ V+F V +DF 
Sbjct: 350 PCKPKENCLR--------YYTSGYYYVGGFYGGCNEALMKLELVQHGPMAVAFEVQDDFL 401

Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTS-DDGEDYWILANQWNRSWGADGYFKI 191
           HY SG+Y H           +  HAV L+G+G   D G DYW + N W   WG  GYF+I
Sbjct: 402 HYHSGIYHHTGLRDPFNPFELTNHAVLLVGYGRDPDTGTDYWTVKNSWGTEWGESGYFRI 461

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG++EC IE   VA +P  K
Sbjct: 462 RRGTDECAIESIAVAAIPIPK 482


>gi|307175943|gb|EFN65753.1| Uncharacterized peptidase C1-like protein F26E4.3 [Camponotus
           floridanus]
          Length = 443

 Score =  110 bits (275), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 69/194 (35%), Positives = 100/194 (51%), Gaps = 13/194 (6%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           + + LS   LL+C       GC GGY   AW +    G+V EEC P+   TG  +  C  
Sbjct: 250 ETVELSAQHLLSC-NNRGQQGCKGGYLDRAWLFMRKFGLVDEECYPW---TG-RNDQCRL 304

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
              +      C       R   +    AYR+ ++  DIM EI  +GPV+ +  VY+DF  
Sbjct: 305 RKRSNLKTAGCQNPPNSLRTELYKVGPAYRLGNE-TDIMQEILTSGPVQATMRVYQDFFV 363

Query: 140 YKSGVYKHITGDVM---GGHAVKLIGWGTSDDGE----DYWILANQWNRSWGADGYFKIK 192
           Y+SGVY+H     +   G H+V++IGWG           YW++AN W  +WG +G F+I+
Sbjct: 364 YQSGVYRHSRSAELHDSGYHSVRIIGWGEEPSYRGPPLKYWLVANSWGHNWGENGLFRIQ 423

Query: 193 RGSNECGIEEDVVA 206
           +G+NEC IE  V+A
Sbjct: 424 KGTNECEIESYVLA 437


>gi|193202653|ref|NP_492593.2| Protein F26E4.3 [Caenorhabditis elegans]
 gi|205371857|sp|P90850.3|YCF2E_CAEEL RecName: Full=Uncharacterized peptidase C1-like protein F26E4.3;
           Flags: Precursor
 gi|166157004|emb|CAB03007.2| Protein F26E4.3 [Caenorhabditis elegans]
          Length = 452

 Score =  110 bits (275), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 70/198 (35%), Positives = 100/198 (50%), Gaps = 14/198 (7%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N +LS   LL+C       GC+GGY   AW Y    GVV + C PY  S     PG    
Sbjct: 232 NSTLSSQQLLSCNQHR-QKGCEGGYLDRAWWYIRKLGVVGDHCYPYV-SGQSREPGHCLI 289

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
                  R+ ++     ++S  + ++  Y+++S  EDI  E+  NGPV+ +F V+EDF  
Sbjct: 290 PKRDYTNRQGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFM 349

Query: 140 YKSGVYKH--------ITGDVMGGHAVKLIGWG---TSDDGEDYWILANQWNRSWGADGY 188
           Y  GVY+H         +    G H+V+++GWG   ++     YW+ AN W   WG DGY
Sbjct: 350 YAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGY 409

Query: 189 FKIKRGSNECGIEEDVVA 206
           FK+ RG N C IE  V+ 
Sbjct: 410 FKVLRGENHCEIESFVIG 427


>gi|354498051|ref|XP_003511129.1| PREDICTED: LOW QUALITY PROTEIN: dipeptidyl peptidase 1-like
           [Cricetulus griseus]
          Length = 470

 Score =  110 bits (274), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 68/201 (33%), Positives = 100/201 (49%), Gaps = 25/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GCDGG+P + A +Y    GVV E C PY  +         
Sbjct: 286 QTPILSPQEVVSCSMY--AQGCDGGFPYLIAGKYAQDFGVVEENCFPYTATDA------- 336

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
           P  P   C+R        +  S +Y +  +    +   +  E+ ++GP+ V+F V +DF 
Sbjct: 337 PCKPKENCLR--------YYTSGYYYVGGFYGGCNEALMKLELVQHGPMAVAFEVQDDFL 388

Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTS-DDGEDYWILANQWNRSWGADGYFKI 191
           HY SG+Y H           +  HAV L+G+G   D G DYW + N W   WG  GYF+I
Sbjct: 389 HYHSGIYHHTGLRDPFNPFELTNHAVLLVGYGRDPDTGTDYWTVKNSWGTEWGESGYFRI 448

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG++EC IE   VA +P  K
Sbjct: 449 RRGTDECAIESIAVAAIPIPK 469


>gi|17933077|gb|AAL48195.1| cathepsin C [Homo sapiens]
          Length = 463

 Score =  110 bits (274), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 66/185 (35%), Positives = 95/185 (51%), Gaps = 27/185 (14%)

Query: 38  GDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL 96
             GC+GG+P + A +Y    G+V E C PY   TG   P              C  K   
Sbjct: 295 AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--------------CKMKEDC 337

Query: 97  WR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------I 148
           +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HYK G+Y H       
Sbjct: 338 FRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPF 397

Query: 149 TGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
               +  HAV L+G+GT S  G DYWI+ N W   WG +GYF+I+RG++EC IE   VA 
Sbjct: 398 NPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAA 457

Query: 208 LPSSK 212
            P  K
Sbjct: 458 TPIPK 462


>gi|312383398|gb|EFR28501.1| hypothetical protein AND_03481 [Anopheles darlingi]
          Length = 573

 Score =  110 bits (274), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 69/215 (32%), Positives = 102/215 (47%), Gaps = 15/215 (6%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           + + L+   LLAC        C GG+  +AW+Y    GVV +EC PY  +       C+ 
Sbjct: 343 EQVQLAPQQLLACVRR--QQACSGGHLDTAWQYLRRVGVVNDECYPYIAAKN----QCKI 396

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
                     C     + R + +    AY +N++  DIM EI + G V+    VY DF  
Sbjct: 397 NDGDTLVSANCELPANVNRTAMYRMGPAYSLNNE-TDIMTEIKERGTVQAILRVYRDFFS 455

Query: 140 YKSGVYKHITG-----DVMGGHAVKLIGWGTSDDGED---YWILANQWNRSWGADGYFKI 191
           Y++G+Y+H        +    H+V+LIGWG    G D   YWI  N W   WG +G F+I
Sbjct: 456 YQNGIYRHSAAATPAEERSAYHSVRLIGWGEERVGYDMVKYWIAVNSWGTWWGENGRFRI 515

Query: 192 KRGSNECGIEEDVVAGLPSSKNLVKEITSADMFED 226
            RG+NEC IE  V+A  P     V+ + +    ++
Sbjct: 516 LRGTNECEIESYVLASNPYVHQHVQTVRNVGDLQE 550


>gi|437323|gb|AAB00354.1| cysteine protease, partial [Caenorhabditis elegans]
          Length = 133

 Score =  110 bits (274), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 65/159 (40%), Positives = 80/159 (50%), Gaps = 51/159 (32%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY 81
           LS+S +D+ ACCG +CG+GC+GGYPI AWR++V  G VT     Y D TGC        Y
Sbjct: 25  LSISADDINACCGMVCGNGCNGGYPIEAWRHYVKKGYVTG--GSYQDKTGCK------PY 76

Query: 82  PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 141
           P P                                         P EV+FTVYEDF HY 
Sbjct: 77  PYP-----------------------------------------PFEVAFTVYEDFEHYS 95

Query: 142 SGVYKHITGDVM-GGHAVKLIGWGTSDDGEDYWILANQW 179
            GVY H  G  + GGHAVK++GWG  D+G  YW++AN W
Sbjct: 96  GGVYVHTAGASLGGGHAVKMLGWGV-DNGTPYWLIANSW 133


>gi|358421824|ref|XP_003585145.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bos taurus]
          Length = 428

 Score =  110 bits (274), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 77/227 (33%), Positives = 108/227 (47%), Gaps = 37/227 (16%)

Query: 10  ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  VS+ +L      LS  +LL+C       GC GG    AW +    GVV++ C 
Sbjct: 196 AAVASDRVSIHSLGHMSPVLSPQNLLSC-DTHNQQGCRGGRLDGAWWFLRRRGVVSDHCY 254

Query: 65  PYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR-------------NSKHYSISAYRIN 111
           P+      S  G + A P P C+       +  R             N  +    AYR+ 
Sbjct: 255 PF------SGHGRDEAVPAPPCMMHSRAMGRGKRQATARCPNSYVHANDIYQVTPAYRLG 308

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
           S+ ++IM E+ +NGPV+    V+EDF  Y+SG+Y H    +         G H+VK+ GW
Sbjct: 309 SNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGW 368

Query: 164 G--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           G  T  DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+ 
Sbjct: 369 GEETLPDGRTIKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 415


>gi|345488309|ref|XP_001605531.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
           [Nasonia vitripennis]
          Length = 481

 Score =  110 bits (274), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 67/197 (34%), Positives = 99/197 (50%), Gaps = 14/197 (7%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           ++ + LS   L++C       GC GGY   AW +    GVV E+C P+          C 
Sbjct: 280 IEKVQLSGQHLISC-NNRGQRGCKGGYLDRAWLFMRKFGVVDEDCYPWLSG---RSDKCR 335

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
                      C ++N     ++ Y +  AYR+ ++  DIM EI  +GPV+ +  V+ DF
Sbjct: 336 IPRRGKLSDAGCQRRNSYNLRNEMYKVGPAYRLGNE-TDIMQEILTSGPVQATMRVHRDF 394

Query: 138 AHYKSGVYKH---ITGDVMGGHAVKLIGWGTSDDGED-----YWILANQWNRSWGADGYF 189
            HY+SG+Y H         G H+V+++GWG      +     +W +AN W R WG DGYF
Sbjct: 395 FHYESGIYVHSRPFDTRQSGYHSVRIVGWGEEPSPYNGKPIKFWRVANSWGRDWGEDGYF 454

Query: 190 KIKRGSNECGIEEDVVA 206
           +I RG+NEC IE  V+ 
Sbjct: 455 RIVRGNNECEIESFVLG 471


>gi|294916952|ref|XP_002778399.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239886773|gb|EER10194.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 228

 Score =  110 bits (274), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 74/208 (35%), Positives = 97/208 (46%), Gaps = 31/208 (14%)

Query: 24  LSVNDLLACCGFLCG----DGCDGGYPISAWRYFVHHGVVT-------------EECDPY 66
           LS+  L +CC    G    +GC  G       +  +HG+VT             + C PY
Sbjct: 18  LSLGYLTSCCNRANGCPKSNGCMFGSVPEGLNFMKNHGLVTGGEYKPPEKLGNDDGCWPY 77

Query: 67  FDSTGCSH-PGCEPAYP-------TPKCVRKCVKK--NQLWRNSKHYSISAYRINSDPED 116
                C+H PG E  YP        P C   C  K      +   H + S  R+   PE 
Sbjct: 78  -PFPKCNHVPGLESKYPRCAQVRDLPACATTCPNKAYGTSMQKDTHRAKSWGRLPIGPEK 136

Query: 117 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 176
           I  EI+ NGPV    T+YEDF +YKSGVY H TG ++  H +KLIGWG  + G++YW+  
Sbjct: 137 IKQEIFDNGPVAAMMTLYEDFRYYKSGVYVHKTGQLLAAHTLKLIGWGV-ESGQEYWLAM 195

Query: 177 NQWNRSWGADGYFKIKRGSNECGIEEDV 204
           N WN  WG  G  K+  G    G+E  V
Sbjct: 196 NAWNEEWGDHGMIKLAVGKT--GLEHQV 221


>gi|294891881|ref|XP_002773785.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239878989|gb|EER05601.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 455

 Score =  110 bits (274), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 72/195 (36%), Positives = 92/195 (47%), Gaps = 29/195 (14%)

Query: 24  LSVNDLLACCGFLCG----DGCDGGYPISAWRYFVHHGVVT-------EE------CDPY 66
           LS+  L +CC    G    +GC  G       +  +HG+VT       EE      C PY
Sbjct: 199 LSLGYLTSCCNRANGCPKSNGCMFGSVPEGLNFMKNHGLVTGGEYKPPEELGNDDGCWPY 258

Query: 67  FDSTGCSH-PGCEPAYP-------TPKCVRKCVKK--NQLWRNSKHYSISAYRINSDPED 116
                C+H PG E  YP        P C   C  K      +   H + S  R+   PE 
Sbjct: 259 -PFPKCNHVPGLESKYPRCAQVRDLPACATTCPNKAYGTSMQKDTHRAKSWGRLPIGPEK 317

Query: 117 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 176
           I  EI+ NGPV    T+YEDF  YKSGVY H TG ++  H +KLIGWG  + G++YW+  
Sbjct: 318 IKQEIFDNGPVAAMMTLYEDFRFYKSGVYVHKTGQMLAAHTLKLIGWGV-ESGQEYWLAV 376

Query: 177 NQWNRSWGADGYFKI 191
           N WN  WG  G  K+
Sbjct: 377 NAWNEEWGDHGMIKL 391


>gi|290973351|ref|XP_002669412.1| predicted protein [Naegleria gruberi]
 gi|284082959|gb|EFC36668.1| predicted protein [Naegleria gruberi]
          Length = 488

 Score =  109 bits (273), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 63/192 (32%), Positives = 98/192 (51%), Gaps = 27/192 (14%)

Query: 25  SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP 84
           S  D++ C  +    GCDGG+     +Y   +G+  E CDPY         G +      
Sbjct: 310 SPQDIVECSAY--SQGCDGGFMYLVSKYAEDYGLAEESCDPY--------KGVDSVCKKD 359

Query: 85  KCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 144
           +C ++    N  +    + + +A       +++M E+Y  GP+ ++F VY+DF +YK GV
Sbjct: 360 QCPKRAYGTNYAYTGGFYGATNA-------KNMMYELYHGGPLAIAFEVYDDFFNYKGGV 412

Query: 145 YKHIT---------GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 195
           Y H T         G     HAV L+GWG  ++G  YW++ N W  SWG +G+FKIKRG+
Sbjct: 413 YTHSTALKTKIAEPGWEETNHAVLLVGWG-EENGVPYWLVKNSWGTSWGINGFFKIKRGT 471

Query: 196 NECGIEEDVVAG 207
           +EC  E + V+ 
Sbjct: 472 DECDCESEAVSA 483


>gi|157058739|gb|ABV03127.1| cathepsin B-2744 [Acyrthosiphon pisum]
          Length = 260

 Score =  109 bits (273), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 70/187 (37%), Positives = 94/187 (50%), Gaps = 28/187 (14%)

Query: 23  SLSVNDLLACCGFLCGDG----CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTG 71
           +LS  +L++C     GDG    CDGG    AW   ++ G+VT       E C PY +   
Sbjct: 79  NLSAQNLMSC-----GDGEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRP- 132

Query: 72  CSHPG------CEPAYPTPK--CVRKCVKKNQL--WRNSKHYSISAYRIN-SDPEDIMAE 120
           C H G      C     T    C +KCV KN    + +  H +   Y  + ++ + I  E
Sbjct: 133 CDHYGDSRLTNCSSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQE 192

Query: 121 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 180
           I   GPV     VYE+F  YK G+YK  TG+++G H VKLIGWG   DG +YW+  N WN
Sbjct: 193 IMTYGPVTAFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWLAMNSWN 252

Query: 181 RSWGADG 187
            +WG DG
Sbjct: 253 SNWGNDG 259


>gi|270012758|gb|EFA09206.1| cathepsin B precursor [Tribolium castaneum]
          Length = 326

 Score =  109 bits (273), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 62/164 (37%), Positives = 94/164 (57%), Gaps = 20/164 (12%)

Query: 45  YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYS 104
           Y  +AW Y+++ G+ +     Y  S GC  P  E ++   +   +CVK            
Sbjct: 150 YIKNAWDYYINEGIAS--GGDYNSSEGC-QPYSESSFQYAE-ASECVK------------ 193

Query: 105 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 164
              Y + ++   I  EI  NGPV   + V+EDFA +KSGVY + +G  +G H+VK+IGWG
Sbjct: 194 --FYTLETNVAQIQMEILTNGPVMAYYNVFEDFACHKSGVYYYKSGKFVGRHSVKVIGWG 251

Query: 165 TSDDGEDYWILANQWNRSWGA-DGYFKIKRGSNECGIEEDVVAG 207
           T ++G  YW++AN W   WG   G+FK++RG+NEC IE+++ AG
Sbjct: 252 T-EEGIPYWLIANSWGSEWGELGGFFKMRRGTNECWIEQEMTAG 294


>gi|426221788|ref|XP_004005089.1| PREDICTED: tubulointerstitial nephritis antigen-like [Ovis aries]
          Length = 362

 Score =  109 bits (272), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 77/227 (33%), Positives = 111/227 (48%), Gaps = 37/227 (16%)

Query: 10  ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  VS+ +L      LS  +LL+C       GC GG    AW +    GVV++ C 
Sbjct: 130 AAVASDRVSIHSLGHMSPVLSPQNLLSC-DTHNQQGCHGGRLDGAWWFLRRRGVVSDHCY 188

Query: 65  PYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRNSKHYSISAYRIN 111
           P+      S  G + A P P C+          R+   +   + +  N  +    AYR+ 
Sbjct: 189 PF------SGHGRDEAVPAPPCMMHSRAMGRGKRQATARCPNSYVHANDIYQVTPAYRLG 242

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
           S+ ++IM E+ +NGPV+    V+EDF  Y+SG+Y H    +         G H+VK+ GW
Sbjct: 243 SNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGW 302

Query: 164 G--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           G  T  DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+ 
Sbjct: 303 GEETLPDGRTVKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 349


>gi|256052327|ref|XP_002569724.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 96

 Score =  109 bits (272), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 49/91 (53%), Positives = 64/91 (70%), Gaps = 1/91 (1%)

Query: 117 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 176
           I  EI K GPVE +F VYEDF +YKSG+YKHITG +   HA+++IGWG  ++   YW++ 
Sbjct: 3   IQKEIMKYGPVEANFIVYEDFLNYKSGIYKHITGKLFSWHAIRIIGWG-EENNTPYWLIP 61

Query: 177 NQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
           N WN  WG +G F+I RG +EC IE +V AG
Sbjct: 62  NSWNEDWGENGNFRILRGRHECSIESEVTAG 92


>gi|37905530|gb|AAO64478.1| cathepsin C precursor [Fundulus heteroclitus]
          Length = 450

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 66/197 (33%), Positives = 94/197 (47%), Gaps = 26/197 (13%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDS-TGCSHPGCEPAYP 82
           LS   +++C  +    GCDGG+P    +Y    G+V E C PY  + + C  P       
Sbjct: 271 LSPQQVVSCSEY--SQGCDGGFPYLIGKYVQDFGIVDESCFPYIAADSPCGVP------- 321

Query: 83  TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 142
              C R           +++  +  +        +  E+ KNGP+ V+  VY DF HYK 
Sbjct: 322 -QNCGRM--------YTAEYRYVGGFYGGCSETAMKLELVKNGPMAVALEVYPDFMHYKE 372

Query: 143 GVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGS 195
           G+Y H      +    +  HAV L+G+G     G+ YWI+ N W   WG DGYF+I+RGS
Sbjct: 373 GIYHHTGFRDSVNPFELTNHAVLLVGYGRCHKTGQKYWIVKNSWGSGWGEDGYFRIRRGS 432

Query: 196 NECGIEEDVVAGLPSSK 212
           +EC IE   VA  P  K
Sbjct: 433 DECAIESIAVAAKPIPK 449


>gi|74199074|dbj|BAE30750.1| unnamed protein product [Mus musculus]
          Length = 447

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 68/201 (33%), Positives = 102/201 (50%), Gaps = 25/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GCDGG+P + A +Y    GVV E C PY            
Sbjct: 263 QTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDS------- 313

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
           P  P   C+R        + +S +Y +  +    +   +  E+ K+GP+ V+F V++DF 
Sbjct: 314 PCKPRENCLR--------YYSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFL 365

Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
           HY SG+Y H           +  HAV L+G+G     G +YWI+ N W  +WG  GYF+I
Sbjct: 366 HYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRI 425

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG++EC IE   VA +P  K
Sbjct: 426 RRGTDECAIESIAVAAIPIPK 446


>gi|348565723|ref|XP_003468652.1| PREDICTED: dipeptidyl peptidase 1-like [Cavia porcellus]
          Length = 463

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 70/202 (34%), Positives = 105/202 (51%), Gaps = 27/202 (13%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY    G   P C 
Sbjct: 279 QTPILSPQEIVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEESCFPY---KGIDVP-C- 331

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
                 K  + CV+    +  S+++ +  +    +   +  E+ ++GP+ V+F VY+DF 
Sbjct: 332 ------KVKKDCVR----YYTSEYHYVGGFYGGCNEALMKLELVQHGPMAVAFEVYDDFL 381

Query: 139 HYKSGVYKHITGDV-------MGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFK 190
           HY  G+Y H TG         +  HAV L+G+GT    G DYWI+ N W   WG DGYF+
Sbjct: 382 HYHKGIY-HRTGLRDPFNPFELTNHAVLLVGYGTDPVSGRDYWIVKNSWGTGWGEDGYFR 440

Query: 191 IKRGSNECGIEEDVVAGLPSSK 212
           I RG++EC IE   +A  P  K
Sbjct: 441 ILRGTDECAIESIAMAATPIPK 462


>gi|160707990|ref|NP_034112.3| dipeptidyl peptidase 1 preproprotein [Mus musculus]
 gi|3023454|sp|P97821.1|CATC_MOUSE RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|1881656|gb|AAB49457.1| preprodipeptidyl peptidase I [Mus musculus]
 gi|7609786|gb|AAB58400.3| dipeptidyl peptidase I precursor [Mus musculus]
 gi|45219895|gb|AAH67063.1| Cathepsin C [Mus musculus]
 gi|74147157|dbj|BAE27487.1| unnamed protein product [Mus musculus]
 gi|74178079|dbj|BAE29829.1| unnamed protein product [Mus musculus]
 gi|148674849|gb|EDL06796.1| cathepsin C, isoform CRA_b [Mus musculus]
          Length = 462

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 68/201 (33%), Positives = 102/201 (50%), Gaps = 25/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GCDGG+P + A +Y    GVV E C PY            
Sbjct: 278 QTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDS------- 328

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
           P  P   C+R        + +S +Y +  +    +   +  E+ K+GP+ V+F V++DF 
Sbjct: 329 PCKPRENCLR--------YYSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFL 380

Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
           HY SG+Y H           +  HAV L+G+G     G +YWI+ N W  +WG  GYF+I
Sbjct: 381 HYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRI 440

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG++EC IE   VA +P  K
Sbjct: 441 RRGTDECAIESIAVAAIPIPK 461


>gi|198434980|ref|XP_002126076.1| PREDICTED: similar to LOC100124858 protein [Ciona intestinalis]
          Length = 541

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 72/215 (33%), Positives = 104/215 (48%), Gaps = 17/215 (7%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 65
           ++R A+ S  +  ++   LS   L++C  F   +G  G      W Y    GVV+  C P
Sbjct: 324 SDRLAIQSKNFTVVE---LSPQHLVSC--FSSHEG-RGERLDRTWWYLRKKGVVSTVCYP 377

Query: 66  YFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
              S      G             C   N +  N  + +   YR++S+ E+IM EI++NG
Sbjct: 378 ESRSKSTQGIGSCGLVAHSSGAHICPNGNVISSNEIYKTSPVYRVSSNEENIMKEIFENG 437

Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVM--------GGHAVKLIGWG---TSDDGEDYWI 174
           PV+    V  DF  YKSGVY     D +          H+VK+IGWG   +  +   YWI
Sbjct: 438 PVQAVMRVQPDFFVYKSGVYSSTAIDNIVVEQVKDNTYHSVKIIGWGEKKSKTNSGKYWI 497

Query: 175 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           + N W  +WG  GYF+I++G NECGIEE ++A  P
Sbjct: 498 VQNSWGANWGEGGYFRIRKGVNECGIEEMILAAWP 532


>gi|74204274|dbj|BAE39895.1| unnamed protein product [Mus musculus]
          Length = 462

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 68/201 (33%), Positives = 102/201 (50%), Gaps = 25/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GCDGG+P + A +Y    GVV E C PY            
Sbjct: 278 QTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDS------- 328

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
           P  P   C+R        + +S +Y +  +    +   +  E+ K+GP+ V+F V++DF 
Sbjct: 329 PCKPRENCLR--------YYSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFL 380

Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
           HY SG+Y H           +  HAV L+G+G     G +YWI+ N W  +WG  GYF+I
Sbjct: 381 HYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRI 440

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG++EC IE   VA +P  K
Sbjct: 441 RRGTDECAIESIAVAAIPIPK 461


>gi|291236490|ref|XP_002738176.1| PREDICTED: cathepsin C-like [Saccoglossus kowalevskii]
          Length = 438

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 64/196 (32%), Positives = 94/196 (47%), Gaps = 24/196 (12%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N+++S  D++ CC +    GC GG+P    +Y    G V E C PY    G       P 
Sbjct: 258 NITISPQDVVQCCNY--SQGCSGGFPYLVSKYSEDFGFVEETCLPYTAQDG-------PC 308

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
               KC R           +K+  +  +    +   +  E+ KNGP+ V+F VY+DF  Y
Sbjct: 309 VSEIKCKRH--------YGTKYRYVGDFYGGCNEALMKIELVKNGPMAVAFMVYDDFMSY 360

Query: 141 KSGVYKH------ITGDVMGGHAVKLIGWGTS-DDGEDYWILANQWNRSWGADGYFKIKR 193
           + G+Y H           +  HAV L+G+G   D  E +WI+ N W   WG +GYF+I+R
Sbjct: 361 QGGIYHHTGLQDKFNPFEITNHAVLLVGYGYDHDTKEKFWIVKNSWGTGWGEEGYFRIRR 420

Query: 194 GSNECGIEEDVVAGLP 209
           G++EC IE   V   P
Sbjct: 421 GNDECSIESIAVESTP 436


>gi|297465285|ref|XP_887401.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
           [Bos taurus]
 gi|297472148|ref|XP_002685665.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Bos taurus]
 gi|296490232|tpg|DAA32345.1| TPA: tubulointerstitial nephritis antigen-like 1-like [Bos taurus]
          Length = 534

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 77/227 (33%), Positives = 108/227 (47%), Gaps = 37/227 (16%)

Query: 10  ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  VS+ +L      LS  +LL+C       GC GG    AW +    GVV++ C 
Sbjct: 302 AAVASDRVSIHSLGHMSPVLSPQNLLSC-DTHNQQGCRGGRLDGAWWFLRRRGVVSDHCY 360

Query: 65  PYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR-------------NSKHYSISAYRIN 111
           P+      S  G + A P P C+       +  R             N  +    AYR+ 
Sbjct: 361 PF------SGHGRDEAVPAPPCMMHSRAMGRGKRQATARCPNSYVHANDIYQVTPAYRLG 414

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
           S+ ++IM E+ +NGPV+    V+EDF  Y+SG+Y H    +         G H+VK+ GW
Sbjct: 415 SNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGW 474

Query: 164 G--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           G  T  DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+ 
Sbjct: 475 GEETLPDGRTIKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 521


>gi|74191569|dbj|BAE30359.1| unnamed protein product [Mus musculus]
          Length = 462

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 68/201 (33%), Positives = 102/201 (50%), Gaps = 25/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GCDGG+P + A +Y    GVV E C PY            
Sbjct: 278 QTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDS------- 328

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
           P  P   C+R        + +S +Y +  +    +   +  E+ K+GP+ V+F V++DF 
Sbjct: 329 PCKPRENCLR--------YYSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFL 380

Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
           HY SG+Y H           +  HAV L+G+G     G +YWI+ N W  +WG  GYF+I
Sbjct: 381 HYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRI 440

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG++EC IE   VA +P  K
Sbjct: 441 RRGTDECAIESIAVAAIPIPK 461


>gi|328712819|ref|XP_001942906.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Acyrthosiphon pisum]
 gi|328712821|ref|XP_003244911.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Acyrthosiphon pisum]
          Length = 463

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 71/193 (36%), Positives = 99/193 (51%), Gaps = 13/193 (6%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDS-TGCSHPGC--EPA 80
           LS   LL+C   L   GC GG+   AW +    G++TEEC P+    + C+ P    E  
Sbjct: 246 LSPQHLLSC-NNLNQQGCQGGHLTRAWNWIRKFGLITEECYPWQGRMSTCAVPKKKKETM 304

Query: 81  YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
              P  VR     ++  +   H     YR+ ++ E IM EI  +GPV+    V  DF  Y
Sbjct: 305 AQCPSRVRS--NNDRTTKTRLHRVGPVYRVATE-EGIMHEILTSGPVQAVMKVSRDFFMY 361

Query: 141 KSGVYK---HITGDVMGGHAVKLIGWGTSDDG---EDYWILANQWNRSWGADGYFKIKRG 194
           KSGVYK     +G   G H+V+++GWG    G     YWI +N W   WG +GYF+I +G
Sbjct: 362 KSGVYKCSNLASGSRTGYHSVRIVGWGEEYQGGKIVKYWIASNSWGSWWGENGYFRILKG 421

Query: 195 SNECGIEEDVVAG 207
            +EC IE+ V+A 
Sbjct: 422 VDECEIEDFVIAA 434


>gi|260826514|ref|XP_002608210.1| hypothetical protein BRAFLDRAFT_125840 [Branchiostoma floridae]
 gi|229293561|gb|EEN64220.1| hypothetical protein BRAFLDRAFT_125840 [Branchiostoma floridae]
          Length = 470

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 67/200 (33%), Positives = 98/200 (49%), Gaps = 29/200 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPY--FDSTGCSHPG 76
           Q   LS  ++++C  +    GC+GG+P + A +Y    GVV EEC PY   DS+      
Sbjct: 285 QQFVLSPQEIVSCGKY--SQGCEGGFPYLIAGKYAEDFGVVLEECYPYEGKDSSCKDTSR 342

Query: 77  CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
           C   Y T                  +  +  +    + E +  E+ KNGP+ V+F VY D
Sbjct: 343 CGRGYAT-----------------NYRYVGGFYGGCNEELMQLELVKNGPMAVAFEVYSD 385

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGTS-DDGEDYWILANQWNRSWGADGYF 189
           F HYK GVY+H           +  HAV L+G+G   + G  +W + N W   WG +G+F
Sbjct: 386 FMHYKGGVYEHTGLSDPFNPFEITNHAVLLVGYGRDPETGAKFWTVKNSWGEKWGEEGFF 445

Query: 190 KIKRGSNECGIEEDVVAGLP 209
           +I+RG++EC IE   VA  P
Sbjct: 446 RIRRGTDECAIESIAVAADP 465


>gi|74212565|dbj|BAE31022.1| unnamed protein product [Mus musculus]
          Length = 191

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 68/201 (33%), Positives = 102/201 (50%), Gaps = 25/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GCDGG+P + A +Y    GVV E C PY            
Sbjct: 7   QTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDS------- 57

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
           P  P   C+R        + +S +Y +  +    +   +  E+ K+GP+ V+F V++DF 
Sbjct: 58  PCKPRENCLR--------YYSSDYYYVGGFYGGCNEALMELELVKHGPMAVAFEVHDDFL 109

Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
           HY SG+Y H           +  HAV L+G+G     G +YWI+ N W  +WG  GYF+I
Sbjct: 110 HYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRI 169

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG++EC IE   VA +P  K
Sbjct: 170 RRGTDECAIESIAVAAIPIPK 190


>gi|45708820|gb|AAH67941.1| LOC407938 protein, partial [Xenopus (Silurana) tropicalis]
          Length = 470

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 69/206 (33%), Positives = 102/206 (49%), Gaps = 32/206 (15%)

Query: 8   RDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPY 66
           R  LS  P +S Q +       ++C  +    GC+GG+P + A +Y   +G+V E   PY
Sbjct: 269 RSQLSQKPILSPQQV-------VSCSNY--SQGCEGGFPYLIAGKYVSDYGIVEESDLPY 319

Query: 67  FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
              TG   P          C  K     Q +  ++++ +  +    +   +  E+   GP
Sbjct: 320 ---TGSDSP----------CTLK--DSQQKYYTAEYHYVGGFYGGCNEAYMKLELVLGGP 364

Query: 127 VEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGTSDD-GEDYWILANQW 179
           + V+F VY+DF HY+SGVY H           +  HAV L+G+GT    GE YWI+ N W
Sbjct: 365 LSVAFEVYDDFMHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGTDQQTGEKYWIVKNSW 424

Query: 180 NRSWGADGYFKIKRGSNECGIEEDVV 205
             SWG  GYF+I+RG++EC IE   V
Sbjct: 425 GESWGEKGYFRIRRGTDECAIESIAV 450


>gi|603044|gb|AAA96832.1| cysteine protease homolog, partial [Strongyloides ratti]
          Length = 202

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 62/176 (35%), Positives = 91/176 (51%), Gaps = 20/176 (11%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY-FDSTGCSHP 75
           +S  D+L+CCG  CG GC GG  I AW++ + +GV T         C PY F   G    
Sbjct: 27  ISDTDILSCCGRFCGYGCRGGANIRAWKHVMRNGVCTGGPCGYKYGCRPYAFHPCGVHKD 86

Query: 76  GC------EPAYPTPKCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
                     +Y TP+C + C +      +   ++Y+ SAY + +D + IM EI + GPV
Sbjct: 87  QVYYGECPRKSYDTPECRKICQRGCIQLQYGKDRYYAASAYFVKNDTKAIMREIMRGGPV 146

Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED----YWILANQW 179
             ++  Y DF  YK GVY+H  G+  GGH++K++GWG           YW++AN W
Sbjct: 147 HGAYDTYTDFRLYKGGVYEHTAGERTGGHSIKIMGWGNYKHPNGTVIPYWLVANSW 202


>gi|161343849|tpg|DAA06105.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 334

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 72/199 (36%), Positives = 101/199 (50%), Gaps = 15/199 (7%)

Query: 21  NLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
           N  LS  +L++C G    + G    Y +  W Y  +HG+V+     Y  + GC      P
Sbjct: 140 NQLLSTEELISCSGIKEDEFGSVNDYYV--WEYLKNHGLVS--GGKYNTNNGCQPSKIPP 195

Query: 80  AYPTPK------CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
               P       C ++C   N +  N  H  I  +  + + EDI  E+   GPV ++F V
Sbjct: 196 IGNLPTGLYENTCEKRCYGNNTINYNQDHVKIKNH-YDIEYEDIQREVQNYGPVSMAFKV 254

Query: 134 YE-DFAHYKSGVYKHITG-DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
           ++ DF  YKSGVY+  T  + +     KLIGWG  ++G DYW+L N W   WG +G FKI
Sbjct: 255 FDNDFFLYKSGVYEKTTNSEFIQWQYAKLIGWGV-ENGVDYWLLVNFWGYEWGQNGLFKI 313

Query: 192 KRGSNECGIEEDVVAGLPS 210
           KRG++EC IE  V AG P 
Sbjct: 314 KRGTDECNIETFVHAGEPQ 332


>gi|209863086|ref|NP_001119616.2| cathepsin B-1674 precursor [Acyrthosiphon pisum]
 gi|239799412|dbj|BAH70627.1| ACYPI000012 [Acyrthosiphon pisum]
          Length = 334

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 71/198 (35%), Positives = 99/198 (50%), Gaps = 13/198 (6%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N  LS  +L++C G +  D          W Y  +HG+V+     Y  + GC      P 
Sbjct: 140 NQLLSTEELISCSG-IKEDEFGSVNDDYVWEYLKNHGLVS--GGKYNTNNGCQPSKIPPI 196

Query: 81  YPTPK------CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 134
              P       C ++C   N +  N  H  I  +  + + EDI  E+   GPV ++F V+
Sbjct: 197 GNLPTGLYENTCEKRCYGNNTINYNQDHVKIKNH-YDIEYEDIQREVQNYGPVSMAFRVF 255

Query: 135 E-DFAHYKSGVYKHITG-DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
           + DF  YKSGVY+  T  + +     KLIGWG  ++G DYW+L N W   WG +G FKIK
Sbjct: 256 DNDFFLYKSGVYEKTTNSEFIQWQYAKLIGWGV-ENGVDYWLLVNSWGYEWGQNGLFKIK 314

Query: 193 RGSNECGIEEDVVAGLPS 210
           RG++EC IE  V AG P 
Sbjct: 315 RGTDECNIETFVHAGEPQ 332


>gi|185135783|ref|NP_001117966.1| prepro-cathepsin C precursor [Oncorhynchus mykiss]
 gi|51038277|gb|AAT94060.1| prepro-cathepsin C [Oncorhynchus mykiss]
          Length = 457

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 65/196 (33%), Positives = 93/196 (47%), Gaps = 24/196 (12%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
            S   +++C  +    GCDGG+P    +Y    G+V E C PY    G   P   P    
Sbjct: 278 FSPQQVVSCSQY--SQGCDGGFPYLIGKYVQDFGIVEESCYPY---AGTDSPCDVPD--- 329

Query: 84  PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
             C+R           S +  +  +        +M E+ KNGP+ V+F VY DF HYK G
Sbjct: 330 -GCLRH--------YTSDYSYVGGFYGGCSESAMMLELVKNGPMGVAFEVYPDFMHYKEG 380

Query: 144 VYKHI------TGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKIKRGSN 196
           +Y H           +  HAV L+G+G     G+ +W++ N W   WG +G+FK++RGS+
Sbjct: 381 IYHHTGLHDSYNPFELTNHAVLLVGYGQCHVTGQKFWVVKNSWGTKWGEEGFFKVRRGSD 440

Query: 197 ECGIEEDVVAGLPSSK 212
           EC IE   VA  P  K
Sbjct: 441 ECAIESIAVAAKPIPK 456


>gi|395815757|ref|XP_003781389.1| PREDICTED: dipeptidyl peptidase 1 [Otolemur garnettii]
          Length = 575

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 69/201 (34%), Positives = 102/201 (50%), Gaps = 31/201 (15%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A ++    G+V E C PY   TG   P   
Sbjct: 391 QTPILSPQEVVSCSQY--AQGCEGGFPYLVAGKHAQDFGLVEEACFPY---TGTDAP--- 442

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K    R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 443 -----------CTMKEGCRRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 491

Query: 137 FAHYKSGVYKHITGDV-------MGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGY 188
           F HY  G+Y H TG         +  HAV L+G+GT S  G  YWI+ N W   WG DGY
Sbjct: 492 FLHYHRGIYHH-TGLTDPFNPFELTNHAVLLVGYGTDSATGIQYWIVKNSWGTGWGEDGY 550

Query: 189 FKIKRGSNECGIEEDVVAGLP 209
           F+I+RG++EC IE   VA  P
Sbjct: 551 FRIRRGTDECAIESIAVAATP 571


>gi|328726600|ref|XP_003248962.1| PREDICTED: cathepsin B-like cysteine proteinase-like [Acyrthosiphon
           pisum]
          Length = 169

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 60/155 (38%), Positives = 84/155 (54%), Gaps = 11/155 (7%)

Query: 63  CDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPE 115
           C+PY       +  G S    +P     +C R C     L  N  H ++   Y +     
Sbjct: 15  CEPYRVPPCPRNEDGTSSCAGQPIEKNHRCTRMCYGNQDLDYNDDHRFTRDYYYLTYG-- 72

Query: 116 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWI 174
            I  ++   GP+E SF VY+DF  YKSGVY+       +GGHAVKLIGWG  ++G  YW+
Sbjct: 73  SIQKDVMNYGPIEASFDVYDDFYSYKSGVYQRTPNATKLGGHAVKLIGWGV-EEGIPYWL 131

Query: 175 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           + N W+  WG +G FKI+RG++ECGI+    AG+P
Sbjct: 132 MVNSWSAQWGDNGLFKIRRGTDECGIDSATTAGVP 166


>gi|290981656|ref|XP_002673546.1| predicted protein [Naegleria gruberi]
 gi|284087130|gb|EFC40802.1| predicted protein [Naegleria gruberi]
          Length = 362

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 66/168 (39%), Positives = 85/168 (50%), Gaps = 14/168 (8%)

Query: 40  GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
            C GGY   +W +  + G   + C PY    G         + +  C  +C  K      
Sbjct: 205 ACQGGYLKYSWTFLENTGTPLDSCIPYASGRG--------TFSSGTCPTQC--KIASMSM 254

Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 159
           SK+ + +   I S   +I   I   G V+  FTVY D   YKSGVYKHI   V+GGHAV 
Sbjct: 255 SKYKAKNTVYI-SGINNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHIENTVLGGHAVA 313

Query: 160 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
           LIG+G  + G +YW+ AN W  +WG  GYFKI +G  E GIE  V AG
Sbjct: 314 LIGFGV-EGGSNYWLAANSWGPNWGMSGYFKIAQG--EGGIENQVYAG 358


>gi|297282815|ref|XP_002802331.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
           mulatta]
          Length = 322

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 76/221 (34%), Positives = 108/221 (48%), Gaps = 25/221 (11%)

Query: 10  ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  VS+ +L      LS  +LLAC       GC GG    AW +    GVV++ C 
Sbjct: 90  AAVASDRVSIHSLGHMTPVLSPQNLLACDTH-HQQGCRGGRLDGAWWFLRRRGVVSDHCY 148

Query: 65  PY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDI 117
           P+     D  G + P    +    +  R+   +  N    N+  Y ++  YR+ S+ ++I
Sbjct: 149 PFSGRERDEAGPAPPCMMHSRAMGRGKRQATARCPNSHVNNNDIYQVTPVYRLGSNDKEI 208

Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSD 167
           M E+ +NGPV+    V+EDF  YK G+Y H    +         G H+VK+ GWG  T  
Sbjct: 209 MKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLP 268

Query: 168 DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           DG    YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 269 DGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 309


>gi|308161503|gb|EFO63946.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 363

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 89/164 (54%), Gaps = 16/164 (9%)

Query: 40  GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
           GC+GG P++A+ +  + G V   C  Y          C       KC      +N +   
Sbjct: 208 GCNGGEPVNAFNFLHNTGTVLTSCVEYTAGDDAVVKFCPQ-----KCDDGSAVENIV--- 259

Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 159
               + S  +  S  + ++A    +GPV  +F V +DF +YKSGVY+H  G  +GGHAV+
Sbjct: 260 ----ATSGAKSGSAIDVLLA----HGPVVATFNVAQDFMYYKSGVYQHRWGVWLGGHAVE 311

Query: 160 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 203
           ++G+G +D G DYW + N W   WG DGYF+I RG +ECGIE++
Sbjct: 312 IVGYGVTDSGLDYWTVRNSWGPDWGEDGYFRIVRGGDECGIEQE 355


>gi|290971375|ref|XP_002668483.1| predicted protein [Naegleria gruberi]
 gi|284081912|gb|EFC35739.1| predicted protein [Naegleria gruberi]
          Length = 325

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 63/167 (37%), Positives = 82/167 (49%), Gaps = 14/167 (8%)

Query: 41  CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
           C GGY   +W +  + G   + C PY    G    G  P     +C    +  ++     
Sbjct: 169 CQGGYLKYSWTFLENTGTPLDTCIPYASGRGTFSSGTCPT----QCKIASMSMSK----- 219

Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
             Y     R  +   +I   I   G V+  FTVY D   YKSGVYKH+   V+GGHAV L
Sbjct: 220 --YKAKNTRYITGINNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHVVSTVLGGHAVAL 277

Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
           IG+G  + G +YW+ AN W  +WG  GYFKI +G  E GIE  V AG
Sbjct: 278 IGFGV-EGGSNYWLAANSWGANWGMSGYFKIAQG--EGGIENQVYAG 321


>gi|290990726|ref|XP_002677987.1| predicted protein [Naegleria gruberi]
 gi|284091597|gb|EFC45243.1| predicted protein [Naegleria gruberi]
          Length = 225

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 62/167 (37%), Positives = 82/167 (49%), Gaps = 14/167 (8%)

Query: 41  CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
           C GGY   +W +  + G   + C PY    G         + +  C  +C   +    + 
Sbjct: 69  CQGGYLKYSWTFLENTGTPLDTCIPYASGRG--------TFSSGTCPTQCKIASM---SM 117

Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
             Y     R  +   +I   I   G V+  FTVY D   YKSGVYKH+   V+GGHAV L
Sbjct: 118 SKYKAKNTRYITGINNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHVVSTVLGGHAVAL 177

Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
           IG+G  + G +YW+ AN W  +WG  GYFKI +G  E GIE  V AG
Sbjct: 178 IGFGV-EGGSNYWLAANSWGPNWGMSGYFKIAQG--EGGIENQVYAG 221


>gi|363729389|ref|XP_417207.2| PREDICTED: dipeptidyl peptidase 1 [Gallus gallus]
          Length = 460

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 74/207 (35%), Positives = 101/207 (48%), Gaps = 37/207 (17%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q    S   +++C  +    GCDGG+P + A +Y    GVV E+C PY            
Sbjct: 276 QKPVFSPQQVVSCSQY--SQGCDGGFPYLIAGKYVQDFGVVEEDCFPY------------ 321

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI-----NSDPEDIMA-EIYKNGPVEVSFT 132
            A  TP C+ K        R+  HY  S Y        +  E +M  E+  +GP+ V+F 
Sbjct: 322 TAKDTP-CLFK--------RSCYHYYTSEYHYVGGFYGACNEALMKLELVLSGPMAVAFE 372

Query: 133 VYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGTS-DDGEDYWILANQWNRSWGA 185
           VY DF  YK G+Y H           +  HAV L+G+G   + GE +WI+ N W  SWG 
Sbjct: 373 VYNDFMFYKEGIYHHTGLKDEFNPFELTNHAVLLVGYGKDPESGEKFWIVKNSWGTSWGE 432

Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSSK 212
           DGYF+I+RG++EC IE   VA  P  K
Sbjct: 433 DGYFRIRRGTDECAIESIAVAATPIPK 459


>gi|290998874|ref|XP_002682005.1| predicted protein [Naegleria gruberi]
 gi|284095631|gb|EFC49261.1| predicted protein [Naegleria gruberi]
          Length = 310

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 66/169 (39%), Positives = 86/169 (50%), Gaps = 14/169 (8%)

Query: 41  CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
           C GGY   +W +  + G   + C PY    G         + +  C  +C  K      S
Sbjct: 154 CQGGYLKYSWTFLENTGTPLDTCIPYASGGG--------TFSSGTCPTQC--KIASMSMS 203

Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
           K+ + +   I S   +I   I   G V+  FTVY D   YKSGVYKH+   V+GGHAV L
Sbjct: 204 KYKAKNTVYI-SGINNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHLVSTVLGGHAVAL 262

Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           IG+G  + G +YW+ AN W  +WG  GYFKI +G  E GIE  V AG P
Sbjct: 263 IGFGV-EGGSNYWLAANSWGPNWGMSGYFKIAQG--EGGIENQVYAGEP 308


>gi|296207307|ref|XP_002750588.1| PREDICTED: tubulointerstitial nephritis antigen-like [Callithrix
           jacchus]
          Length = 467

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 77/221 (34%), Positives = 109/221 (49%), Gaps = 25/221 (11%)

Query: 10  ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  VS+ +L      LS  +LL+C       GC GG+   AW +    GVV++ C 
Sbjct: 235 AAVASDRVSIHSLGHMTPILSPQNLLSCNTHH-QQGCRGGHLDGAWWFLRRRGVVSDHCY 293

Query: 65  PYF----DSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDI 117
           P+     D  G   P    +  T +  R+      N    N+  Y ++ AYR+ S+  +I
Sbjct: 294 PFLGRERDKAGPVPPCMMHSRATGRGKRQATAHCPNGHVNNNNIYQVTPAYRLGSNDTEI 353

Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSD 167
           M E+ +NGPV+    V+EDF  YK G+Y H   ++         G H+VK+ GWG  T  
Sbjct: 354 MKELMENGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHSVKITGWGEETWP 413

Query: 168 DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           DG    YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 414 DGRKLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 454


>gi|270012757|gb|EFA09205.1| cathepsin B precursor [Tribolium castaneum]
          Length = 348

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 70/190 (36%), Positives = 101/190 (53%), Gaps = 20/190 (10%)

Query: 24  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP- 82
            S  +LL CC   C   C GGY   AW Y+++ G+V+     Y  S GC  P  + ++  
Sbjct: 130 FSPENLLTCCED-CRLECVGGYTAKAWDYYINEGIVSG--GDYNSSEGC-QPYSKASFQY 185

Query: 83  --TPKCVRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
               KCV+ C   K +  + + KHY  S Y + ++   I  EI  NGPV  +F V+ED  
Sbjct: 186 AVASKCVKACQNDKYDVKYDDDKHYGDSFYTLETNVTQIQTEILTNGPVMATFNVFEDII 245

Query: 139 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG-ADGYFKIKRGSNE 197
           +YKSG+             V ++ WGT ++G  YW++AN W   WG   G+ KIKRG+NE
Sbjct: 246 YYKSGIQL---------SNVSILRWGT-EEGVPYWLIANSWGTWWGDLGGFIKIKRGTNE 295

Query: 198 CGIEEDVVAG 207
           C IE+++ AG
Sbjct: 296 CAIEQEMAAG 305


>gi|335290878|ref|XP_003127800.2| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Sus scrofa]
          Length = 362

 Score =  107 bits (267), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 78/227 (34%), Positives = 106/227 (46%), Gaps = 37/227 (16%)

Query: 10  ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  VS+ +L      LS  +LL+C       GC GG    AW +    GVV++ C 
Sbjct: 130 AAVASDRVSIHSLGHMTPVLSPQNLLSC-DTHNQQGCQGGRLDGAWWFLRRRGVVSDHCY 188

Query: 65  PYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR-------------NSKHYSISAYRIN 111
           P+       H   E A P P+C+       +  R             N  +    AYR+ 
Sbjct: 189 PF-----SGHERNE-AGPAPRCMMHSRAMGRGKRQATARCPNSYVHANDIYQVTPAYRLG 242

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGGHAVKLIGW 163
           S+ +DIM E+ +NGPV+    V+EDF  Y+SG+Y H              G H+VK+ GW
Sbjct: 243 SNEKDIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSHGRPERYRRHGTHSVKITGW 302

Query: 164 G--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           G  T  DG    YW  AN W   WG  G+F+I RG+NEC IE  V+ 
Sbjct: 303 GEETLPDGRMLKYWTAANSWGPGWGERGHFRIVRGANECDIESFVLG 349


>gi|147902366|ref|NP_001080511.1| cathepsin C precursor [Xenopus laevis]
 gi|33417162|gb|AAH56109.1| Ctsc protein [Xenopus laevis]
          Length = 458

 Score =  107 bits (267), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 68/203 (33%), Positives = 100/203 (49%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS   +++C  +    GCDGG+P + A +Y    G+V E   PY    G   P   
Sbjct: 274 QKPILSPQQVVSCSNY--SQGCDGGFPYLIAGKYLNDFGIVEESDFPYI---GSDSP--- 325

Query: 79  PAYPTPKCVRKCVKKN--QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K+  Q +  ++++ +  +    +   +  E+   GP+ V+F VY+D
Sbjct: 326 -----------CTLKDSYQRYYTAEYHYVGGFYGGCNEAYMKLELVLGGPLSVAFEVYDD 374

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGTSDD-GEDYWILANQWNRSWGADGYF 189
           F HY+SGVY H           +  HAV L+G+GT    GE YWI+ N W  SWG  G+F
Sbjct: 375 FIHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGTDQQTGEKYWIVKNSWGESWGEKGFF 434

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RGS+EC IE   V+  P  K
Sbjct: 435 RIRRGSDECAIESIAVSANPIIK 457


>gi|159108157|ref|XP_001704351.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157432412|gb|EDO76677.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 360

 Score =  107 bits (267), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 61/164 (37%), Positives = 88/164 (53%), Gaps = 16/164 (9%)

Query: 40  GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
           GC+GG P++A+ +  + G V   C  Y          C       KC      +N +   
Sbjct: 205 GCNGGEPVNAFNFLHNTGTVLASCVGYTAGDDAVVKFCPQ-----KCDDGSAVENVV--- 256

Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 159
               + S  +  S  + ++A    +GPV  +F V +DF +YKSGVY+H  G  +GGHAV+
Sbjct: 257 ----ATSGSKSGSAIDVLLA----HGPVVATFNVAQDFMYYKSGVYQHRWGLWLGGHAVE 308

Query: 160 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 203
           +IG+G +D G DYW + N W   WG DGYF+I RG +ECGIE +
Sbjct: 309 IIGYGVTDSGLDYWTVRNSWGPDWGEDGYFRIVRGGDECGIEHE 352


>gi|432892467|ref|XP_004075795.1| PREDICTED: dipeptidyl peptidase 1-like [Oryzias latipes]
          Length = 453

 Score =  107 bits (267), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 68/203 (33%), Positives = 96/203 (47%), Gaps = 30/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF-DSTGCSHP-GC 77
           Q    S   +++C  +    GCDGG+P    +Y    G+V E C PY    + C  P  C
Sbjct: 270 QTPVFSPQQVVSCSEY--SQGCDGGFPYLIGKYSQDFGIVEESCFPYIAKDSPCGVPQNC 327

Query: 78  EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
             AY                  +++  +  +        +M E+  +GP+ V+F VY DF
Sbjct: 328 GRAY-----------------TAEYKYVGGFYGGCSEMAMMKELVHHGPMAVAFEVYPDF 370

Query: 138 AHYKSGVYKHITGDV-------MGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
            HY  G+Y H TG         +  HAV L+G+G     GE YWI+ N W  SWG +G+F
Sbjct: 371 MHYAGGIYHH-TGLADPFNPFELTNHAVLLVGYGRCHKTGEKYWIVKNSWGTSWGENGFF 429

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RGS+EC IE   VA  P  K
Sbjct: 430 RIRRGSDECSIESIAVAATPIPK 452


>gi|328712825|ref|XP_001945477.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Acyrthosiphon pisum]
          Length = 487

 Score =  107 bits (267), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 75/217 (34%), Positives = 105/217 (48%), Gaps = 15/217 (6%)

Query: 19  LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSHPGC 77
           L   +LS   LL+C   L   GC GG+  SAW + +  G+VTEEC P+   +T C+    
Sbjct: 268 LMRDALSPKHLLSCNNDL-QRGCQGGHLTSAWNWVMTFGLVTEECYPWDGRATDCAVSNQ 326

Query: 78  EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
                      +  K + L R    Y ++        E IM EI   G V+    V ++F
Sbjct: 327 RSNNNLIVTCPRSAKTSPLRRVGLMYRVAT------EEGIMYEIMNWGSVQAMMKVSKEF 380

Query: 138 AHYKSGVYKHITGDV---MGGHAVKLIGWGTSDDG---EDYWILANQWNRSWGADGYFKI 191
             Y+SGVYK    D+    G H V+++GWG          YWI++N W   WG  GYF+I
Sbjct: 381 FMYESGVYKCSKLDLGSKTGYHTVRIVGWGEEQQNGRTVKYWIVSNSWGLWWGESGYFRI 440

Query: 192 KRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 228
            +G+NEC IE+ VVA +P   N    I+     E+AS
Sbjct: 441 LKGTNECQIEDFVVAAMPDIDNFCN-ISDQSFRENAS 476


>gi|26340150|dbj|BAC33738.1| unnamed protein product [Mus musculus]
          Length = 462

 Score =  107 bits (266), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 68/201 (33%), Positives = 101/201 (50%), Gaps = 25/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GCDGG+P + A +Y    GVV E C PY            
Sbjct: 278 QTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDS------- 328

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
           P  P   C R        + +S +Y +  +    +   +  E+ K+GP+ V+F V++DF 
Sbjct: 329 PCKPRENCHR--------YYSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFL 380

Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
           HY SG+Y H           +  HAV L+G+G     G +YWI+ N W  +WG  GYF+I
Sbjct: 381 HYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRI 440

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
           +RG++EC IE   VA +P  K
Sbjct: 441 RRGTDECAIESIAVAAIPIPK 461


>gi|12060418|dbj|BAB20596.1| ARG1 [Mus musculus]
 gi|71059879|emb|CAJ18483.1| Lcn7 [Mus musculus]
          Length = 415

 Score =  107 bits (266), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 77/227 (33%), Positives = 107/227 (47%), Gaps = 37/227 (16%)

Query: 10  ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  VS+ +L      LS  +LL+C       GC GG    AW +    GVV++ C 
Sbjct: 183 AAVASDRVSIHSLGHMTPILSPQNLLSCDTHH-QQGCRGGRLDGAWWFLRRRGVVSDNCY 241

Query: 65  PYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRNSKHYSISAYRIN 111
           P+             A PTP+C+          R+   +    Q+  N  +    AYR+ 
Sbjct: 242 PFSGREQ------NEASPTPRCMMHSRAMGRGKRQATSRCPNGQVDSNDIYQVTPAYRLG 295

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
           SD ++IM E+ +NGPV+    V+EDF  Y+ G+Y H              G H+VK+ GW
Sbjct: 296 SDEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGW 355

Query: 164 G--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           G  T  DG    YW  AN W   WG  G+F+I RG+NEC IE  V+ 
Sbjct: 356 GEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLG 402


>gi|330846430|ref|XP_003295033.1| hypothetical protein DICPUDRAFT_51857 [Dictyostelium purpureum]
 gi|325074364|gb|EGC28440.1| hypothetical protein DICPUDRAFT_51857 [Dictyostelium purpureum]
          Length = 257

 Score =  107 bits (266), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 62/179 (34%), Positives = 87/179 (48%), Gaps = 15/179 (8%)

Query: 30  LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK 89
           L  C      GC+GG P  AW Y   HG+ T  C PY    G              CV+ 
Sbjct: 87  LVSCDIFGNQGCNGGIPQLAWEYMELHGIPTYGCFPYTSGNGTDG----------SCVKN 136

Query: 90  CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 149
               N+ +   +   ++  +  +  E I  +I K GP++ +  VY DF  Y SGVY    
Sbjct: 137 SCVDNEQYTLYRAKPLT-LKTCASVECIQQDIMKFGPIQGTMEVYSDFMSYTSGVYTMTP 195

Query: 150 G-DVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           G  ++GGHA+K++GWG      ++YWI+AN W  SWG DG+F I    ++CGI  D  A
Sbjct: 196 GSSLLGGHAIKIVGWGFDQASNQNYWIVANSWGPSWGIDGFFWIAF--DQCGINSDACA 252


>gi|270132817|ref|NP_075965.2| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
 gi|270132824|ref|NP_001161805.1| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
 gi|61213616|sp|Q99JR5.1|TINAL_MOUSE RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
           Full=Adrenocortical zonation factor 1; Short=AZ-1;
           AltName: Full=Androgen-regulated gene 1 protein;
           AltName: Full=Tubulointerstitial nephritis
           antigen-related protein; Short=TARP; Flags: Precursor
 gi|13543125|gb|AAH05738.1| Tinagl1 protein [Mus musculus]
 gi|17391278|gb|AAH18539.1| Tinagl1 protein [Mus musculus]
 gi|30314458|dbj|BAC76038.1| tubulointersititial nephritis antigen-related protein [Mus
           musculus]
 gi|148698197|gb|EDL30144.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
           musculus]
 gi|148698198|gb|EDL30145.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
           musculus]
          Length = 466

 Score =  107 bits (266), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 77/227 (33%), Positives = 107/227 (47%), Gaps = 37/227 (16%)

Query: 10  ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  VS+ +L      LS  +LL+C       GC GG    AW +    GVV++ C 
Sbjct: 234 AAVASDRVSIHSLGHMTPILSPQNLLSCDTHH-QQGCRGGRLDGAWWFLRRRGVVSDNCY 292

Query: 65  PYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRNSKHYSISAYRIN 111
           P+             A PTP+C+          R+   +    Q+  N  +    AYR+ 
Sbjct: 293 PFSGREQ------NEASPTPRCMMHSRAMGRGKRQATSRCPNGQVDSNDIYQVTPAYRLG 346

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
           SD ++IM E+ +NGPV+    V+EDF  Y+ G+Y H              G H+VK+ GW
Sbjct: 347 SDEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGW 406

Query: 164 G--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           G  T  DG    YW  AN W   WG  G+F+I RG+NEC IE  V+ 
Sbjct: 407 GEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLG 453


>gi|325180819|emb|CCA15230.1| cathepsinlike cysteine protease putative [Albugo laibachii Nc14]
          Length = 660

 Score =  107 bits (266), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 66/206 (32%), Positives = 99/206 (48%), Gaps = 8/206 (3%)

Query: 5   RTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           + +R+    SP   L+ + L+   LL C       GC GG P+SA+RY   +G+  E C 
Sbjct: 94  QRSRNRKEKSPVDVLREVVLAPQVLLNC--DTADGGCHGGDPLSAFRYIHENGIPDESCQ 151

Query: 65  PYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW--RNSKHYSISAYRINSDPEDIMAEIY 122
            Y ++TG  H       P   C   C      W  ++ + Y +S +        + AEIY
Sbjct: 152 RY-EATG--HDTGNQCRPQDVC-ENCAPSRGCWAQKSYEKYYVSEFGTVRGEHQMKAEIY 207

Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
             G +  +  V + F +Y+ GV+   T  V   HA+ ++GWG   DG  YW++ N W   
Sbjct: 208 ARGSIVCTVDVTDAFLNYEGGVFDDKTHAVSMDHAISVVGWGEMKDGTKYWVVRNSWGSF 267

Query: 183 WGADGYFKIKRGSNECGIEEDVVAGL 208
           WG DG+F+I RG N  GIE +   G+
Sbjct: 268 WGEDGWFRIVRGVNNLGIESECTFGV 293



 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 56/191 (29%), Positives = 86/191 (45%), Gaps = 8/191 (4%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSHPG-CEP 79
           ++LS   L+ C G   G  C GG P   + Y   HG+  + C  Y   +  C+    CE 
Sbjct: 444 IALSPQVLINCHG---GGSCAGGNPGLVYEYAHRHGIPDQTCQAYQAQNLNCNEFAICET 500

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
            + T         + +  +  K Y +S Y   S  + + AEI+K GP+       ++F  
Sbjct: 501 CWSTNTSFTP--GRCEAIKKFKKYYVSEYGKVSGVDRMKAEIFKRGPIGCGIHATKNFVA 558

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGE-DYWILANQWNRSWGADGYFKIKRGSNEC 198
           Y  G+Y       +  H + + GWG  +D + +YWI  N W   WG  G+F+IK   +  
Sbjct: 559 YTGGIYSESVIWPIPNHEISVAGWGFDEDTQTEYWIGRNSWGTYWGEHGWFRIKMHHSNL 618

Query: 199 GIEEDVVAGLP 209
           GIE D   G+P
Sbjct: 619 GIESDCDWGVP 629


>gi|355724275|gb|AES08176.1| tubulointerstitial nephritis antigen-like 1 [Mustela putorius furo]
          Length = 454

 Score =  107 bits (266), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 75/227 (33%), Positives = 107/227 (47%), Gaps = 37/227 (16%)

Query: 10  ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  VS+ +L      LS  +LL+C       GC GG    AW +    GVV++ C 
Sbjct: 222 AAVASDRVSIHSLGHMTPVLSPQNLLSC-DTHNQRGCHGGRLDGAWWFLRRRGVVSDHCY 280

Query: 65  PYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR-------------NSKHYSISAYRIN 111
           P+           + A P P+C+       +  R             N  +    AYR+ 
Sbjct: 281 PFVGREQ------DEAGPAPRCMMHSRAMGRGKRQATARCPSSHAHANDIYQVTPAYRLG 334

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
           S+ ++IM E+ +NGPV+    V+EDF  Y+SG+Y H    +         G H+VK+ GW
Sbjct: 335 SNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGW 394

Query: 164 G--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           G  T  DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+ 
Sbjct: 395 GEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 441


>gi|161343877|tpg|DAA06119.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 145

 Score =  107 bits (266), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 57/141 (40%), Positives = 84/141 (59%), Gaps = 4/141 (2%)

Query: 70  TGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSK-HYSISAYRINSDPEDIMAEIYKNGPVE 128
           +   +P     + TP+C  +C   +   R  K ++  + YRI       M EIY+NGP+ 
Sbjct: 7   SAVENPCSNKTFFTPECKVQCYNPDYGTRYVKDNHKGTQYRIPG--YTAMKEIYENGPIT 64

Query: 129 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 188
            SF +Y+DF +Y+SGVY   +G  +   AVK++GWG  ++G  YW+ AN +N  WG +G+
Sbjct: 65  ASFYMYQDFVNYQSGVYAFNSGKYVTTQAVKILGWG-EENGTPYWLAANSFNTYWGDNGF 123

Query: 189 FKIKRGSNECGIEEDVVAGLP 209
            KI RG+NEC IEE + AGLP
Sbjct: 124 VKILRGANECYIEEFMYAGLP 144


>gi|294888035|ref|XP_002772321.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239876433|gb|EER04137.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 200

 Score =  106 bits (265), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 72/203 (35%), Positives = 95/203 (46%), Gaps = 26/203 (12%)

Query: 1   MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
             VT    D L      +   L LS  ++ AC  F    GC GG P SAW +    G+ T
Sbjct: 11  FGVTEAFNDRLCIKSDGAFTEL-LSAGEMNACTLFF---GCGGGDPYSAWSWVHDKGIAT 66

Query: 61  -------------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 107
                        + C PY D   C+H   +  YP  KC +     +      +H+ + +
Sbjct: 67  GGDYVAKDDMTKDDGCWPY-DFPPCAHHINDTKYP--KCPKVSCSGDD-----RHFMLES 118

Query: 108 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 167
              +    D    I  +GPV  SFTVYEDF  Y+SGVYKH +G  +GGHAVK+IGWG   
Sbjct: 119 SPYHYSVNDAKNAIRTDGPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAVKIIGWGEK- 177

Query: 168 DGEDYWILANQWNRSWGADGYFK 190
            G+ YW+  N WN  WG  G F+
Sbjct: 178 SGQAYWLAVNSWNEDWGDHGLFR 200


>gi|290998826|ref|XP_002681981.1| predicted protein [Naegleria gruberi]
 gi|284095607|gb|EFC49237.1| predicted protein [Naegleria gruberi]
          Length = 310

 Score =  106 bits (265), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 65/167 (38%), Positives = 85/167 (50%), Gaps = 14/167 (8%)

Query: 41  CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
           C GGY   +W +  + G   + C PY    G         + +  C  +C  K      S
Sbjct: 154 CQGGYLKYSWTFLENTGTPLDTCIPYASGRG--------TFSSGTCPTQC--KIASMSMS 203

Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
           K+ + +   I S   +I   I   G V+  FTVY D   YKSGVYKH+   V+GGHAV L
Sbjct: 204 KYKAKNTVYI-SGINNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHVVSTVLGGHAVAL 262

Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
           IG+G  + G +YW+ AN W  +WG  GYFKI +G  E GIE  V AG
Sbjct: 263 IGFGV-EGGSNYWLAANSWGPNWGMSGYFKIAQG--EGGIENQVYAG 306


>gi|328701234|ref|XP_001948885.2| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 326

 Score =  106 bits (265), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 71/197 (36%), Positives = 96/197 (48%), Gaps = 16/197 (8%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGY--PISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           N  LS  +L++C G       + GY   +  W YF  HG+V+     Y  + GC      
Sbjct: 136 NQLLSTEELISCSGI---KEREDGYVNRVLVWEYFKTHGLVS--GGKYNTNEGCQPSKVP 190

Query: 79  PAYPTPK------CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
             Y +        CV  C  K+ +  N  H  +S +      +DI  E+   GPV V F 
Sbjct: 191 TVYNSQTKIYKRTCVEYCYGKDTINYNHDHVKVSNHYF-IRIKDIQKEVQTYGPVSVFFD 249

Query: 133 VYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
           +++D   YKSGVY K         H  KLIGWG  ++G DYW+L N W   WG +G FKI
Sbjct: 250 LHDDLFLYKSGVYAKTEKSKDKRYHHAKLIGWGV-ENGVDYWLLVNSWGYEWGQNGLFKI 308

Query: 192 KRGSNECGIEEDVVAGL 208
           KRG++EC +E  V AGL
Sbjct: 309 KRGTDECSVESHVYAGL 325


>gi|253743418|gb|EES99819.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 296

 Score =  106 bits (265), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 61/164 (37%), Positives = 84/164 (51%), Gaps = 16/164 (9%)

Query: 40  GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
           GC+GG P  A+ +    G V   C  Y          C      PK          ++  
Sbjct: 141 GCNGGEPTKAFDFLHSTGTVLTSCVDYTAGADNVVKFC------PKTCDDGSAVENVFAA 194

Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 159
           S   S SA  +          +  +GPV  +F V +DF +YKSGVY+H  G  +GGHAV+
Sbjct: 195 SGSKSGSAIDV----------LLSHGPVVATFNVAQDFMYYKSGVYQHRWGVWLGGHAVE 244

Query: 160 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 203
           ++G+G +D G DYW + N W   WG DGYF+I RGS+ECGIE++
Sbjct: 245 VVGYGVTDSGLDYWTVRNSWGPDWGEDGYFRIVRGSDECGIEQE 288


>gi|193688336|ref|XP_001945899.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 308

 Score =  106 bits (265), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 73/199 (36%), Positives = 93/199 (46%), Gaps = 12/199 (6%)

Query: 21  NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
           N  LS  +L++C G    +G       S W Y   HGVV+     Y  + GC      P 
Sbjct: 115 NKLLSTEELISCSGIKENNGSVPS-ERSIWEYLKSHGVVS--GGKYNSNDGCQPFKFPPI 171

Query: 81  YPTPKCVRK------CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 134
              PK + K      C   + +  N  H  +  Y       DI  E+   GPV V F V 
Sbjct: 172 ANIPKHLHKHTCDDHCYGNSTINYNHDHVRVRNY-YTIRTRDIQKEVQTYGPVVVRFMVC 230

Query: 135 EDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 193
           +DF  YKSGVY K      +     KLIGWG  ++G DYW++ N W   WG  G FKIK 
Sbjct: 231 DDFFLYKSGVYAKSDKAKGIRTQYAKLIGWGV-ENGVDYWLVINSWGHEWGQKGLFKIKS 289

Query: 194 GSNECGIEEDVVAGLPSSK 212
           G+N+CG+E  V AGLP  K
Sbjct: 290 GTNQCGVESFVYAGLPEIK 308


>gi|327269233|ref|XP_003219399.1| PREDICTED: dipeptidyl peptidase 1-like [Anolis carolinensis]
          Length = 467

 Score =  106 bits (265), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 69/202 (34%), Positives = 98/202 (48%), Gaps = 27/202 (13%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q  +LS   +++C  +    GCDGG+P + A +Y    GVV E+C PY  +         
Sbjct: 283 QTPTLSPQKVVSCSQY--SQGCDGGFPYLIAGKYAQDFGVVEEDCFPYTATD-------S 333

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
           P   T  C                    A         +  E+ K+GP+ V+F VY DF 
Sbjct: 334 PCNFTHSCYHYYATNYYYVGGFYGGCNEAL--------MKLELVKHGPMAVAFEVYSDFM 385

Query: 139 HYKSGVYKHITGDV-------MGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFK 190
           HY+ G+Y H TG +       +  HAV L+G+GT  + GE +WI+ N W  +WG  GYF+
Sbjct: 386 HYRGGIYHH-TGLMDPFNPFELTNHAVLLVGYGTDPETGEPFWIVKNSWGPAWGEQGYFR 444

Query: 191 IKRGSNECGIEEDVVAGLPSSK 212
           I+RG++EC IE   VA  P  K
Sbjct: 445 IRRGTDECAIESIAVASTPIPK 466


>gi|449269572|gb|EMC80333.1| Dipeptidyl-peptidase 1 [Columba livia]
          Length = 412

 Score =  106 bits (265), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 67/203 (33%), Positives = 102/203 (50%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q    S   +++C  +    GCDGG+P + A +Y    GVV E+C PY   T    P   
Sbjct: 228 QKPIFSPQQVVSCSQY--SQGCDGGFPYLIAGKYVQDFGVVEEDCFPY---TAQDSP--- 279

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C+ K   +    S+++ +  +    +   +  E+  +GP+ V+F VY D
Sbjct: 280 -----------CLFKRSCYHYYTSEYHYVGGFYGGCNEALMKLELVLHGPMAVAFEVYND 328

Query: 137 FAHYKSGVYKH--ITGDV----MGGHAVKLIGWGTS-DDGEDYWILANQWNRSWGADGYF 189
           F HYK G+Y H  +  D     +  HAV L+G+GT    GE +WI+ N W   WG +GYF
Sbjct: 329 FIHYKEGIYHHTGLRDDFNPFELTNHAVLLVGYGTDPQSGEKFWIVKNSWGILWGENGYF 388

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   V+  P +K
Sbjct: 389 RIRRGTDECAIESIAVSATPIAK 411


>gi|327239610|gb|AEA39649.1| cathepsin B [Epinephelus coioides]
          Length = 171

 Score =  106 bits (265), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 66/163 (40%), Positives = 92/163 (56%), Gaps = 18/163 (11%)

Query: 6   TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
           ++R  + S+  VS++   +S  DLLACC   CG GC+GGYP +AW ++   G+V+     
Sbjct: 13  SDRLCIHSNGKVSVE---ISSEDLLACCD-SCGMGCNGGYPSAAWDFWTDVGLVSGGLYD 68

Query: 63  ----CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
               C PY          G   P       TP+C+ +C       ++  KHY  S+Y + 
Sbjct: 69  SHVGCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCILQCESGYTPSYKADKHYGKSSYSVP 128

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 154
           SD E I +EIYKNGPVE +FTVYEDF  YK+GVY+H+TG  +G
Sbjct: 129 SDEEQIQSEIYKNGPVEGAFTVYEDFLLYKTGVYQHMTGSAVG 171


>gi|431891156|gb|ELK02033.1| Tubulointerstitial nephritis antigen-like protein [Pteropus alecto]
          Length = 467

 Score =  106 bits (265), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 76/227 (33%), Positives = 109/227 (48%), Gaps = 37/227 (16%)

Query: 10  ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  VS+ +L      LS  +LL+C       GC GG    AW +    GVV++ C 
Sbjct: 235 AAVASDRVSIHSLGHMTPVLSPQNLLSCDKHN-QQGCRGGRLDGAWWFLRRRGVVSDHCY 293

Query: 65  PYFDSTGCSHPGCEPAYPTPKCVRKCV-----KKNQLWRNSKH-------YSIS-AYRIN 111
           P+             A P P+C+         K+  + R   H       Y ++ AYR+ 
Sbjct: 294 PFSGQER------NEAGPEPRCMMHSRAMGRGKRQAIARCPNHHVHANDIYQVTPAYRLG 347

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
           S+ ++IM E+ +NGPV+    V+EDF  Y+ G+Y H    +         G H+VK+ GW
Sbjct: 348 SNEKEIMKELMENGPVQALMEVHEDFFLYQGGIYSHTPVSLGKPERYRRHGTHSVKITGW 407

Query: 164 G--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           G  T  DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+ 
Sbjct: 408 GEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGTNECDIESFVLG 454


>gi|351712812|gb|EHB15731.1| Dipeptidyl-peptidase 1 [Heterocephalus glaber]
          Length = 462

 Score =  106 bits (264), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 68/201 (33%), Positives = 103/201 (51%), Gaps = 25/201 (12%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q   LS  ++++C  +    GC+GG+P + A +Y    G V E C PY   TG   P C 
Sbjct: 278 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGFVEESCFPY---TGTDAP-C- 330

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
                 K    C++    +  S+++ +  +    +   +  E+ ++GP+ V+F V +DF 
Sbjct: 331 ------KMKEDCMR----YYTSEYHYVGGFYGGCNEALMKLELVQHGPMAVAFEVCDDFM 380

Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKI 191
           HY  G+Y H           +  HAV L+G+GT S +G DYWI+ N W  SWG  GYF+I
Sbjct: 381 HYHKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSANGMDYWIVKNSWGTSWGEKGYFRI 440

Query: 192 KRGSNECGIEEDVVAGLPSSK 212
            RG++EC IE   +A  P  K
Sbjct: 441 LRGTDECAIESIAMAATPIPK 461


>gi|326914532|ref|XP_003203579.1| PREDICTED: dipeptidyl peptidase 1-like [Meleagris gallopavo]
          Length = 420

 Score =  106 bits (264), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 73/207 (35%), Positives = 99/207 (47%), Gaps = 37/207 (17%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q    S   +++C  +    GCDGG+P + A +Y    GVV E+C PY   T    P   
Sbjct: 236 QKPVFSPQQVVSCSQY--SQGCDGGFPYLIAGKYVQDFGVVEEDCFPY---TAQDSP--- 287

Query: 79  PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI-----NSDPEDIMA-EIYKNGPVEVSFT 132
                  C+ K        R+  HY  S Y        +  E +M  E+  +GP+ V+F 
Sbjct: 288 -------CLFK--------RSCYHYYTSEYHYVGGFYGACNEALMKLELVLSGPMAVAFE 332

Query: 133 VYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGTS-DDGEDYWILANQWNRSWGA 185
           VY DF  YK G+Y H           +  HAV L+G+G     GE +WI+ N W  SWG 
Sbjct: 333 VYNDFMFYKEGIYHHTGLKDNFNPFELTNHAVLLVGYGKDPKSGEKFWIVKNSWGTSWGE 392

Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSSK 212
           DGYF+I+RG++EC IE   VA  P  K
Sbjct: 393 DGYFRIRRGTDECAIESIAVAATPIPK 419


>gi|253742295|gb|EES99137.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 315

 Score =  106 bits (264), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 63/171 (36%), Positives = 87/171 (50%), Gaps = 22/171 (12%)

Query: 40  GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
           GC GG       +    G+ T+ C PY D         E A+  P C   CV  + + R 
Sbjct: 146 GCTGGTMEDVGDFLRDTGIATDTCVPYVD---------EDAHWEP-CPVSCVDGSPI-RT 194

Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 159
            +   +   R + + E +M  I  NGP+  S  +YEDF +Y+SG+Y  I G   G HA++
Sbjct: 195 VQ--LMDFVRYDGNLEAMMEAIAMNGPIHASMMIYEDFMYYQSGIYHFIYGSGCGMHAIE 252

Query: 160 LIGWGTSDDGE---------DYWILANQWNRSWGADGYFKIKRGSNECGIE 201
           L+G+GT   G+         DYWI  N W   WG +GYF+I RG+NECGIE
Sbjct: 253 LVGYGTDISGDSEAGEEVRVDYWIARNSWGEDWGENGYFRIVRGNNECGIE 303


>gi|443686962|gb|ELT90079.1| hypothetical protein CAPTEDRAFT_166233 [Capitella teleta]
          Length = 495

 Score =  106 bits (264), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 79/221 (35%), Positives = 106/221 (47%), Gaps = 19/221 (8%)

Query: 1   MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
            S +    D LS      L+++ LS   L++C       GC+GG+   AW      G V+
Sbjct: 242 FSTSTVAADRLSIHSGGELKDM-LSAQYLISCTTDHHQKGCEGGHVDRAWWQLRRVGTVS 300

Query: 61  EECDPYFDSTGCSHPG--CEPAYPTPKCVRKCVKKNQLWRNSKHYSISA-YRINSDPEDI 117
           ++C PY  S   + PG      Y  PK   +C     +   SK Y  S  YRI +   +I
Sbjct: 301 KDCYPY-TSGDTNDPGKCLMSKYKLPKKNIECPVGQGI--TSKLYQASPPYRIAAKEREI 357

Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG---------HAVKLIGWGTSDD 168
           M EI  NGPV+    V +DF  Y+ GVYKH                 H+V++IGWGT   
Sbjct: 358 MNEIILNGPVQAVMHVKDDFYTYERGVYKHSHAPKPANYPHLGKEAYHSVRIIGWGTDYT 417

Query: 169 GED---YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           G+D   YW+ AN W R WG  G+F+I RGS+E  IE  VV 
Sbjct: 418 GDDPIKYWLAANTWGRHWGEGGFFRIARGSDESHIESFVVG 458


>gi|402853710|ref|XP_003891533.1| PREDICTED: tubulointerstitial nephritis antigen-like [Papio anubis]
          Length = 362

 Score =  106 bits (264), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 75/221 (33%), Positives = 108/221 (48%), Gaps = 25/221 (11%)

Query: 10  ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  VS+ +L      LS  +LL+C       GC GG    AW +    GVV++ C 
Sbjct: 130 AAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCY 188

Query: 65  PY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDI 117
           P+     D  G + P    +    +  R+   +  N    N+  Y ++  YR+ S+ ++I
Sbjct: 189 PFSGRERDEAGPAPPCMMHSRAMGRGKRQATARCPNSHVNNNDIYQVTPVYRLGSNDKEI 248

Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSD 167
           M E+ +NGPV+    V+EDF  YK G+Y H    +         G H+VK+ GWG  T  
Sbjct: 249 MKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLP 308

Query: 168 DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           DG    YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 309 DGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 349


>gi|431838501|gb|ELK00433.1| Dipeptidyl-peptidase 1 [Pteropus alecto]
          Length = 460

 Score =  106 bits (264), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 66/203 (32%), Positives = 103/203 (50%), Gaps = 29/203 (14%)

Query: 20  QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
           Q+  LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P   
Sbjct: 276 QSPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEETCFPY---TGTDSP--- 327

Query: 79  PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
                      C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+D
Sbjct: 328 -----------CKLKENCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 376

Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYF 189
           F HY  G+Y H           +  HAV L+G+GT    G +YW + N W  SWG +GYF
Sbjct: 377 FLHYHKGIYHHTGLKDPFNPFELTNHAVLLVGYGTDPASGLNYWTVKNSWGTSWGENGYF 436

Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
           +I+RG++EC IE   +A  P  K
Sbjct: 437 RIRRGTDECAIESIAMAATPIPK 459


>gi|328872536|gb|EGG20903.1| hypothetical protein DFA_00770 [Dictyostelium fasciculatum]
          Length = 313

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 62/165 (37%), Positives = 83/165 (50%), Gaps = 15/165 (9%)

Query: 40  GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
           GC GG   +   YF  +GVVTE+C+ Y             A     C   C         
Sbjct: 100 GCGGGRLDTPLAYFRDNGVVTEKCESY------------KATQASSCSNTCDDGTSFSNT 147

Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAV 158
           +K++S   YR++S  E   A+IY NGP+   F +Y D  +YKSGVY K  +      HA 
Sbjct: 148 TKYHSKDCYRLSS-IEQAKADIYLNGPIIAVFDLYTDIYNYKSGVYIKSDSATYKETHAG 206

Query: 159 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 203
           ++IGWG  +DG  YW+ AN W   WG  G FKI+ G+NE G E +
Sbjct: 207 RVIGWGV-EDGVQYWLAANSWGTGWGQQGLFKIRSGTNEVGFEAN 250


>gi|345794363|ref|XP_535330.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Canis lupus
           familiaris]
          Length = 467

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 73/227 (32%), Positives = 109/227 (48%), Gaps = 37/227 (16%)

Query: 10  ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  VS+ +L      LS  +LL+C       GC GG    AW +    GVV++ C 
Sbjct: 235 AAVASDRVSIHSLGHMTPVLSPQNLLSC-DTHNQQGCRGGRLDGAWWFLRRRGVVSDHCY 293

Query: 65  PYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRNSKHYSISAYRIN 111
           P+           + A P P+C+          R+   +   + +  N  +    AYR+ 
Sbjct: 294 PFVGREQ------DEAGPAPRCMMHSRAMGRGKRQATARCPSSHVHANDIYQVTPAYRLG 347

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
           ++ ++IM E+ +NGPV+    V+EDF  Y+ G+Y H    +         G H+VK+ GW
Sbjct: 348 TNEKEIMKELMENGPVQALMEVHEDFFLYQGGIYSHTPVSLGRPERYRRHGTHSVKITGW 407

Query: 164 G--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           G  T  DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+ 
Sbjct: 408 GEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 454


>gi|403293251|ref|XP_003937634.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Saimiri boliviensis boliviensis]
          Length = 436

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 76/221 (34%), Positives = 108/221 (48%), Gaps = 25/221 (11%)

Query: 10  ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  VS+ +L      LS  +LL+C       GC GG    AW +    GVV++ C 
Sbjct: 204 AAVASDRVSIHSLGHMTPVLSPQNLLSCNTHH-QQGCRGGRLDGAWWFLRRRGVVSDHCY 262

Query: 65  PY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDI 117
           P+     D  G + P    +    +  R+      N    N+  Y ++ AYR+ S+  +I
Sbjct: 263 PFSGRERDKAGPAPPCMMHSRAMGRGKRQATAHCPNGHVNNNNIYQVTPAYRLGSNDTEI 322

Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSD 167
           M E+ +NGPV+    V+EDF  YK G+Y H   ++         G H+VK+ GWG  T  
Sbjct: 323 MKELMENGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHSVKITGWGEETRP 382

Query: 168 DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           DG    YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 383 DGRKLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 423


>gi|300176830|emb|CBK25399.2| unnamed protein product [Blastocystis hominis]
          Length = 563

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 59/174 (33%), Positives = 87/174 (50%), Gaps = 13/174 (7%)

Query: 39  DGCDGGYPISAWRYFVHHGVVTEECDPYF-DSTGCSHPGCEPAYPTPKCVRKCVKKNQLW 97
           +GC GG+P++A++Y   HGV  E C  Y   +  C+              R C  +   +
Sbjct: 112 NGCQGGHPLTAFKYMHDHGVPEEGCMRYMAKNMECT---------DINICRDCDSEKGCF 162

Query: 98  --RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 155
             +N   Y +  Y   +  +++M EIY  GP+  S  V +D   YK G+Y+  TG     
Sbjct: 163 AVKNYTKYYVDEYGSVAGEKNMMKEIYARGPITCSIAVPDDLMEYKGGIYRDTTGAKTLD 222

Query: 156 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
           HA+ ++GWG  +DG+ YWI  N W   WG  G+F+I RG N  GIE D    +P
Sbjct: 223 HAISVVGWG-EEDGQKYWIARNSWGTFWGEKGWFRIVRGENNLGIEADCQWAVP 275



 Score = 85.9 bits (211), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 52/175 (29%), Positives = 78/175 (44%), Gaps = 13/175 (7%)

Query: 22  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY--FDSTGCSHPGCEP 79
           + LS  +++ C        CDGG     + Y  + G+  + C  Y   D        C  
Sbjct: 381 VELSAQEVINCSN---AGTCDGGSDADVFEYAFNEGIPDQTCQVYEAIDKECNDMARCMD 437

Query: 80  AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
             P   C           ++ K Y +S Y       +I AEI+  GPV  S  V E+F  
Sbjct: 438 CPPGEDCYPV--------KDYKRYKVSEYGEVKGEMEIKAEIFARGPVSCSMIVTEEFLA 489

Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 194
           Y+ G++    G ++G HAV++ GWG ++DG  YWI  N W   WG  G+F++  G
Sbjct: 490 YQGGIFVDDRGHIVGYHAVEVAGWGETEDGTKYWIARNSWGPYWGEHGWFRMIVG 544


>gi|291408920|ref|XP_002720687.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Oryctolagus
           cuniculus]
          Length = 467

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 76/226 (33%), Positives = 109/226 (48%), Gaps = 35/226 (15%)

Query: 10  ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  VS+ +L      LS  +LL+C       GC GG    AW +    GVV++ C 
Sbjct: 235 AAVASDRVSIHSLGHMTPVLSPQNLLSCDTHH-QQGCRGGRLDGAWWFLRRRGVVSDHCY 293

Query: 65  PYF----DSTGCSHP--------GCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 112
           P+     D  G + P        G      T +C    V  N +++ +      AYR+ S
Sbjct: 294 PFSGHEQDEAGPAPPCMMHSRAMGRGKRQATARCPNSHVHANDIYQVT-----PAYRLGS 348

Query: 113 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG 164
           + ++IM E+ +NGPV+    V+EDF  Y+ G+Y H    +         G H+VK+ GWG
Sbjct: 349 NEKEIMKELLENGPVQALMEVHEDFFLYQGGIYSHTPVSLERPERYRRHGTHSVKITGWG 408

Query: 165 --TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
             T  DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+ 
Sbjct: 409 EETLPDGRTLKYWTAANSWGPAWGERGHFRILRGTNECDIESFVLG 454


>gi|403293249|ref|XP_003937633.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Saimiri boliviensis boliviensis]
          Length = 467

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 76/221 (34%), Positives = 108/221 (48%), Gaps = 25/221 (11%)

Query: 10  ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  VS+ +L      LS  +LL+C       GC GG    AW +    GVV++ C 
Sbjct: 235 AAVASDRVSIHSLGHMTPVLSPQNLLSCNTHH-QQGCRGGRLDGAWWFLRRRGVVSDHCY 293

Query: 65  PY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDI 117
           P+     D  G + P    +    +  R+      N    N+  Y ++ AYR+ S+  +I
Sbjct: 294 PFSGRERDKAGPAPPCMMHSRAMGRGKRQATAHCPNGHVNNNNIYQVTPAYRLGSNDTEI 353

Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSD 167
           M E+ +NGPV+    V+EDF  YK G+Y H   ++         G H+VK+ GWG  T  
Sbjct: 354 MKELMENGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHSVKITGWGEETRP 413

Query: 168 DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           DG    YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 414 DGRKLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 454


>gi|395730851|ref|XP_003775799.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Pongo
           abelii]
          Length = 362

 Score =  105 bits (262), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 78/227 (34%), Positives = 109/227 (48%), Gaps = 37/227 (16%)

Query: 10  ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  VS+ +L      LS  +LL+C       GC GG    AW +    GVV++ C 
Sbjct: 130 AAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCY 188

Query: 65  PYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK--NQLWRNSKHYSIS-AYRIN 111
           P+      S    + A PTP C+          R+      N    N+  Y ++  YR+ 
Sbjct: 189 PF------SGRERDEAGPTPPCMMHSRAMGRGKRQATASCPNSHVNNNDIYQVTPVYRLG 242

Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
           S+ ++IM E+ +NGPV+    V+EDF  YK G+Y H    +         G H+VK+ GW
Sbjct: 243 SNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGW 302

Query: 164 G--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           G  T  DG    YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 303 GEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 349


>gi|324713036|ref|NP_001191344.1| tubulointerstitial nephritis antigen-like isoform 3 [Homo sapiens]
 gi|119628008|gb|EAX07603.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_a [Homo
           sapiens]
          Length = 362

 Score =  105 bits (262), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 75/221 (33%), Positives = 107/221 (48%), Gaps = 25/221 (11%)

Query: 10  ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
           A  +S  VS+ +L      LS  +LL+C       GC GG    AW +    GVV++ C 
Sbjct: 130 AAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCY 188

Query: 65  PY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDI 117
           P+     D  G + P    +    +  R+      N    N+  Y ++  YR+ S+ ++I
Sbjct: 189 PFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSYVNNNDIYQVTPVYRLGSNDKEI 248

Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSD 167
           M E+ +NGPV+    V+EDF  YK G+Y H    +         G H+VK+ GWG  T  
Sbjct: 249 MKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLP 308

Query: 168 DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
           DG    YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 309 DGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 349


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.135    0.439 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,172,401,015
Number of Sequences: 23463169
Number of extensions: 191520933
Number of successful extensions: 385834
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 4779
Number of HSP's successfully gapped in prelim test: 1867
Number of HSP's that attempted gapping in prelim test: 372648
Number of HSP's gapped (non-prelim): 7581
length of query: 229
length of database: 8,064,228,071
effective HSP length: 138
effective length of query: 91
effective length of database: 9,121,278,045
effective search space: 830036302095
effective search space used: 830036302095
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 74 (33.1 bits)