BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 018707
         (351 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224064400|ref|XP_002301457.1| predicted protein [Populus trichocarpa]
 gi|222843183|gb|EEE80730.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score =  575 bits (1481), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 265/327 (81%), Positives = 292/327 (89%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
           VS LKL+S ILQDSI+K+VN NPKAGWKA  N  FSNYTV QFK+LLGVKPTPK  L G+
Sbjct: 31  VSDLKLNSRILQDSILKKVNGNPKAGWKATMNHHFSNYTVAQFKYLLGVKPTPKEELRGI 90

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 144
           PV +H KSL+LP+ FDAR+AWPQCSTI +ILDQGHCGSCWAFGAVE+LSDRFCIH+GMN+
Sbjct: 91  PVISHPKSLRLPEEFDARTAWPQCSTIGKILDQGHCGSCWAFGAVESLSDRFCIHYGMNI 150

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 204
           SLSVNDLLACCGFLCG GC+GGYPISAWRYFVHHGVVTEECDPYFD  GCSHPGCEP YP
Sbjct: 151 SLSVNDLLACCGFLCGSGCNGGYPISAWRYFVHHGVVTEECDPYFDDIGCSHPGCEPGYP 210

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           TPKC RKCV KNQLW+ SKHY +  YRI+SDPE IMAEIYKNGPVEV+FTVYEDFAHYKS
Sbjct: 211 TPKCARKCVNKNQLWKKSKHYGVKPYRIDSDPESIMAEIYKNGPVEVAFTVYEDFAHYKS 270

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           GVYKHITG +MGGHAVKLIGWGTS+DGE YW+LANQWNR WG DGYFKI+RG+NECGIE 
Sbjct: 271 GVYKHITGGMMGGHAVKLIGWGTSEDGEAYWLLANQWNRGWGDDGYFKIRRGTNECGIEG 330

Query: 325 DVVAGLPSSKNLVKEITSADMFEDASA 351
           DVVAGLPS++NLV+E+ S D  EDASA
Sbjct: 331 DVVAGLPSTRNLVREVVSVDAREDASA 357


>gi|255548165|ref|XP_002515139.1| cathepsin B, putative [Ricinus communis]
 gi|223545619|gb|EEF47123.1| cathepsin B, putative [Ricinus communis]
          Length = 376

 Score =  570 bits (1469), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 266/345 (77%), Positives = 299/345 (86%), Gaps = 19/345 (5%)

Query: 26  SKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP 85
           SKLKL+S ILQ+SIIK+VNENP AGW+AA NPQ SN+TVGQFK+LLG KPTPK  L+GVP
Sbjct: 32  SKLKLNSRILQESIIKKVNENPDAGWEAAMNPQLSNFTVGQFKYLLGAKPTPKKELMGVP 91

Query: 86  VKTHDKSLKLPKSFDARSAWPQCSTISRILDQ-----------------GHCGSCWAFGA 128
           + +H K+LKLPK FDAR+AWP CSTI +IL Q                 GHCGSCWAFGA
Sbjct: 92  MISHPKTLKLPKEFDARTAWPHCSTIGKILGQLLSFYNIFSIFFFLFLEGHCGSCWAFGA 151

Query: 129 VEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY 188
           VE+LSDRFCIHFGMN+SLSVNDLLACCGFLCGDGCDGGYP+ AWRYFVHHGVVTEECDPY
Sbjct: 152 VESLSDRFCIHFGMNISLSVNDLLACCGFLCGDGCDGGYPMYAWRYFVHHGVVTEECDPY 211

Query: 189 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 248
           FD+ GCSHPGCEP +PTPKCVRKC+ KNQLWR SKHYS++AYRI+SDP D+MAE+YKNGP
Sbjct: 212 FDNIGCSHPGCEPGFPTPKCVRKCIDKNQLWRQSKHYSVNAYRISSDPHDVMAEVYKNGP 271

Query: 249 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 308
           VEVSFTVYEDFAHYKSGVYKHITG+VMGGHAVKLIGWGTSD+GEDYW+LANQWNR WG D
Sbjct: 272 VEVSFTVYEDFAHYKSGVYKHITGEVMGGHAVKLIGWGTSDNGEDYWLLANQWNRGWGDD 331

Query: 309 GYFKIKRGSNECGIEEDVVAGLPSSKN--LVKEITSADMFEDASA 351
           GYFKI+RG+NECGIE+D VAGLPS++N  LV+E+ S D  EDA A
Sbjct: 332 GYFKIRRGTNECGIEDDAVAGLPSARNLDLVREVASMDALEDAFA 376


>gi|449446774|ref|XP_004141146.1| PREDICTED: cathepsin B-like [Cucumis sativus]
          Length = 348

 Score =  563 bits (1451), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 256/328 (78%), Positives = 289/328 (88%)

Query: 12  LCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLL 71
           +C      +AE  V K KLD+ ILQ+SI++ VNE+P+AGWKA  NP+FSNY+V QFK+LL
Sbjct: 18  VCTFHHQVYAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLL 77

Query: 72  GVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEA 131
           GVK TP+  L   PV +H KSLKLPKSFDAR AWPQC +I  ILDQGHCGSCWAFGAVE+
Sbjct: 78  GVKQTPEKDLKSTPVLSHPKSLKLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVES 137

Query: 132 LSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDS 191
           LSDRFCIHF MN++LSVNDLLACCGF+CGDGCDGGYPISAWRYFV HGVVTE+CDPYFD+
Sbjct: 138 LSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDT 197

Query: 192 TGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
           TGCSHPGCEPAYPTP+CVR CV KNQ+WR +KHY +SAYR+  DP DIMAE+YKNGPVEV
Sbjct: 198 TGCSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVSAYRVKRDPNDIMAEVYKNGPVEV 257

Query: 252 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 311
           SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT+DDGEDYW+LANQWNR WG DGYF
Sbjct: 258 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYF 317

Query: 312 KIKRGSNECGIEEDVVAGLPSSKNLVKE 339
           KI+RG+NECGIEEDVVAGLPS+KN+ +E
Sbjct: 318 KIRRGTNECGIEEDVVAGLPSTKNIARE 345


>gi|449489527|ref|XP_004158338.1| PREDICTED: cathepsin B-like [Cucumis sativus]
          Length = 349

 Score =  563 bits (1450), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 255/321 (79%), Positives = 287/321 (89%)

Query: 19  TFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPK 78
            +AE  V K KLD+ ILQ+SI++ VNE+P+AGWKA  NP+FSNY+V QFK+LLGVK TP+
Sbjct: 26  VYAEEQVLKFKLDADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPE 85

Query: 79  GLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI 138
             L   PV +H KSLKLPKSFDAR AWPQC +I  ILDQGHCGSCWAFGAVE+LSDRFCI
Sbjct: 86  KDLKSTPVLSHPKSLKLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCI 145

Query: 139 HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPG 198
           HF MN++LSVNDLLACCGF+CGDGCDGGYPISAWRYFV HGVVTE+CDPYFD+TGCSHPG
Sbjct: 146 HFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCSHPG 205

Query: 199 CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
           CEPAYPTP+CVR CV KNQ+WR +KHY +SAYR+  DP DIMAE+YKNGPVEVSFTVYED
Sbjct: 206 CEPAYPTPRCVRHCVDKNQIWRKTKHYGVSAYRVKRDPNDIMAEVYKNGPVEVSFTVYED 265

Query: 259 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
           FAHYKSGVYKHITGDVMGGHAVKLIGWGT+DDGEDYW+LANQWNR WG DGYFKI+RG+N
Sbjct: 266 FAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRRGTN 325

Query: 319 ECGIEEDVVAGLPSSKNLVKE 339
           ECGIEEDVVAGLPS+KN+ +E
Sbjct: 326 ECGIEEDVVAGLPSTKNIARE 346


>gi|217072748|gb|ACJ84734.1| unknown [Medicago truncatula]
 gi|388505480|gb|AFK40806.1| unknown [Medicago truncatula]
          Length = 359

 Score =  557 bits (1436), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 252/321 (78%), Positives = 285/321 (88%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
           ++ LKL+SHILQ+SI K++NENP+AGW+AA NP+FSN+TVGQFK LLGVK  PK  LL  
Sbjct: 33  LNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELLST 92

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 144
           PV TH KSLKLPK FDAR+AW QCSTI +ILDQGHCGSCWAFGAVE+L DRFCIHF MN+
Sbjct: 93  PVVTHPKSLKLPKEFDARTAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFCIHFDMNI 152

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 204
           SLSVNDLLACCGFLCG GCDGG PI AWRY  HHGVVTEECDPYFD  GCSHPGCEPAY 
Sbjct: 153 SLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYQ 212

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           TPKCVRKCVK NQ+W+ SKHYS+ AYR+ SDP+DIMAE+YKNGPVEV+FTV+EDFAHYKS
Sbjct: 213 TPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFEDFAHYKS 272

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           GVYKHITG  +GGHAVKLIGWGTSD+GEDYW+LANQWN +WG DGYFKIKRG+NECGIE+
Sbjct: 273 GVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIED 332

Query: 325 DVVAGLPSSKNLVKEITSADM 345
           DV AGLPS+KN+V+E+T  D+
Sbjct: 333 DVTAGLPSTKNIVREVTDMDV 353


>gi|356505709|ref|XP_003521632.1| PREDICTED: cathepsin B-like [Glycine max]
          Length = 357

 Score =  557 bits (1435), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 255/326 (78%), Positives = 288/326 (88%), Gaps = 2/326 (0%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
           ++ LKL+SHILQ+S  KE+NENP+AGW+AA NP+FSNYTV QFK LLGVKP PK  L   
Sbjct: 31  LTSLKLNSHILQESTAKEINENPEAGWEAAINPRFSNYTVEQFKRLLGVKPMPKKELRST 90

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 144
           P  +H K+LKLPK+FDAR+AW QCSTI RILDQGHCGSCWAFGAVE+LSDRFCIHF +N+
Sbjct: 91  PAISHPKTLKLPKNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDVNI 150

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 204
           SLSVNDLLACCGFLCG GCDGGYP+ AWRY  HHGVVTEECDPYFD  GCSHPGCEPAY 
Sbjct: 151 SLSVNDLLACCGFLCGSGCDGGYPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYR 210

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           TPKCV+KCV  NQ+W+ SKHYS+SAYR+NSDP DIMAE+YKNGPVEV+FTVYEDFA+YKS
Sbjct: 211 TPKCVKKCVSGNQVWKKSKHYSVSAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAYYKS 270

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           GVYKHITG  +GGHAVKLIGWGT+DDGEDYW+LANQWNR WG DGYFKI+RG+NECGIEE
Sbjct: 271 GVYKHITGYELGGHAVKLIGWGTTDDGEDYWLLANQWNREWGDDGYFKIRRGTNECGIEE 330

Query: 325 DVVAGLPSSKNLVKEITSADMFEDAS 350
           DV AGLPS+KNLV+E+T  DM  DA+
Sbjct: 331 DVTAGLPSTKNLVREVT--DMDADAA 354


>gi|357511629|ref|XP_003626103.1| Cathepsin B [Medicago truncatula]
 gi|87240982|gb|ABD32840.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
           propeptide [Medicago truncatula]
 gi|355501118|gb|AES82321.1| Cathepsin B [Medicago truncatula]
          Length = 357

 Score =  557 bits (1435), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 252/321 (78%), Positives = 285/321 (88%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
           ++ LKL+SHILQ+SI K++NENP+AGW+AA NP+FSN+TVGQFK LLGVK  PK  LL  
Sbjct: 31  LNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELLST 90

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 144
           PV TH KSLKLPK FDAR+AW QCSTI +ILDQGHCGSCWAFGAVE+L DRFCIHF MN+
Sbjct: 91  PVVTHPKSLKLPKEFDARTAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFCIHFDMNI 150

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 204
           SLSVNDLLACCGFLCG GCDGG PI AWRY  HHGVVTEECDPYFD  GCSHPGCEPAY 
Sbjct: 151 SLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYQ 210

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           TPKCVRKCVK NQ+W+ SKHYS+ AYR+ SDP+DIMAE+YKNGPVEV+FTV+EDFAHYKS
Sbjct: 211 TPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFEDFAHYKS 270

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           GVYKHITG  +GGHAVKLIGWGTSD+GEDYW+LANQWN +WG DGYFKIKRG+NECGIE+
Sbjct: 271 GVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIED 330

Query: 325 DVVAGLPSSKNLVKEITSADM 345
           DV AGLPS+KN+V+E+T  D+
Sbjct: 331 DVTAGLPSTKNIVREVTDMDV 351


>gi|217073630|gb|ACJ85175.1| unknown [Medicago truncatula]
          Length = 359

 Score =  553 bits (1425), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 250/321 (77%), Positives = 283/321 (88%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
           ++ LKL+SHILQ+SI K++NENP+AGW+AA NP+FSN+TVGQFK LLGVK  PK  LL  
Sbjct: 33  LNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNFTVGQFKRLLGVKQAPKKELLST 92

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 144
           PV TH KSLKLPK FDAR+AW QCSTI +ILDQGHCGSCWAFGAVE+L DRFC HF MN+
Sbjct: 93  PVVTHPKSLKLPKEFDARAAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFCSHFDMNI 152

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 204
           SLSVNDLLACCGFLCG GCDGG PI AWRY  HHGVVTEECDPYFD  GCSHPGCEPAY 
Sbjct: 153 SLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYQ 212

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           TPKCVRKCVK NQ+W+ SKHYS+ AYR+ SDP+DIM E+YKNGPVEV+FTV+EDFAHYKS
Sbjct: 213 TPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMTEVYKNGPVEVAFTVFEDFAHYKS 272

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           GVYKHITG  +GGHAVKLIGWGTSD+GEDYW+LANQWN +WG DGYFKIKRG+NECGIE+
Sbjct: 273 GVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIED 332

Query: 325 DVVAGLPSSKNLVKEITSADM 345
           DV AGLPS+KN+V+E+T  D+
Sbjct: 333 DVTAGLPSTKNIVREVTDMDV 353


>gi|225437812|ref|XP_002281936.1| PREDICTED: cathepsin B-like isoform 1 [Vitis vinifera]
 gi|359480250|ref|XP_003632421.1| PREDICTED: cathepsin B-like [Vitis vinifera]
          Length = 358

 Score =  553 bits (1424), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 252/313 (80%), Positives = 283/313 (90%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
           VS+LK ++ ILQ+S+++ +N NPKAGWKAA NP+FSNY+VGQF HLLGVKPT +  L GV
Sbjct: 31  VSQLKFNTKILQESMVELINANPKAGWKAAMNPRFSNYSVGQFMHLLGVKPTLQKDLEGV 90

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 144
           PV TH K+LKLPK FDAR+AWPQCSTI +ILDQGHCGSCWAFGAVE+LSDRFCIHFGMN+
Sbjct: 91  PVITHPKTLKLPKHFDARTAWPQCSTIGKILDQGHCGSCWAFGAVESLSDRFCIHFGMNI 150

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 204
           SLSVNDLLACCGFLCG GCDGGYP+ AWRYF+HHGVVTEECDPYFD+TGCSHPGCEP YP
Sbjct: 151 SLSVNDLLACCGFLCGSGCDGGYPLYAWRYFIHHGVVTEECDPYFDATGCSHPGCEPGYP 210

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           TPKCVRKC  +NQLWR +K Y  SAYRI+SDP  IMAE+YKNGPVEV+FTVYEDFAHY+S
Sbjct: 211 TPKCVRKCTDENQLWRKAKRYGQSAYRISSDPYQIMAEVYKNGPVEVAFTVYEDFAHYES 270

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           GVY++ TGDVMGGHAVKLIGWGT+DDGEDYWILANQWNR+WG DGYF I+RG NECGIEE
Sbjct: 271 GVYRYTTGDVMGGHAVKLIGWGTTDDGEDYWILANQWNRNWGDDGYFMIRRGVNECGIEE 330

Query: 325 DVVAGLPSSKNLV 337
            VVAGLPSSKNL+
Sbjct: 331 GVVAGLPSSKNLM 343


>gi|356572872|ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max]
          Length = 356

 Score =  552 bits (1423), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 253/326 (77%), Positives = 287/326 (88%), Gaps = 2/326 (0%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
           ++ LKL+S ILQ+SI KE+NENP+AGW+AA NP FSNYTV QFK LLGVKPTPK  L   
Sbjct: 30  LTSLKLNSPILQESIAKEINENPEAGWEAAINPHFSNYTVEQFKRLLGVKPTPKKELRST 89

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 144
           P  +H KSLKLPK+FDAR+AW QCSTI RILDQGHCGSCWAFGAVE+LSDRFCIHF +N+
Sbjct: 90  PAISHPKSLKLPKNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDVNI 149

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 204
           SLSVNDLLACCGFLCG GCDGGYP+ AW+Y  HHGVVTEECDPYFD  GCSHPGCEPAY 
Sbjct: 150 SLSVNDLLACCGFLCGSGCDGGYPLYAWQYLAHHGVVTEECDPYFDQIGCSHPGCEPAYR 209

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           TPKCV+KCV  NQ+W+ SKHYS++AYR++SDP DIM E+YKNGPVEV+FTVYEDFAHYKS
Sbjct: 210 TPKCVKKCVSGNQVWKKSKHYSVNAYRVSSDPHDIMTEVYKNGPVEVAFTVYEDFAHYKS 269

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           GVYKHITG  +GGHAVKLIGWGT++DGEDYW+LANQWNR WG DGYFKI+RG+NECGIEE
Sbjct: 270 GVYKHITGYELGGHAVKLIGWGTTEDGEDYWLLANQWNREWGDDGYFKIRRGTNECGIEE 329

Query: 325 DVVAGLPSSKNLVKEITSADMFEDAS 350
           DV AGLPS+KNLV+E+T  DM  DA+
Sbjct: 330 DVTAGLPSTKNLVREVT--DMDADAA 353


>gi|224128101|ref|XP_002320244.1| predicted protein [Populus trichocarpa]
 gi|222861017|gb|EEE98559.1| predicted protein [Populus trichocarpa]
          Length = 339

 Score =  551 bits (1419), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 255/331 (77%), Positives = 287/331 (86%)

Query: 21  AEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGL 80
           AE  VSKLKL+S ILQDSI+++VNENPKAGW+A  NPQFSNY+VG+FK+LLGVK TP+  
Sbjct: 9   AEEPVSKLKLNSRILQDSIVQKVNENPKAGWEATMNPQFSNYSVGEFKYLLGVKQTPRKE 68

Query: 81  LLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF 140
           L GVP+  H KS+KLP  FDAR+AWP CSTI RILDQGHCGSCWAFGAVE+LSDRFCIH+
Sbjct: 69  LRGVPLLRHPKSMKLPIEFDARTAWPHCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHY 128

Query: 141 GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 200
           GMNLSLSVNDLLACCG++CG GCDGG PI AWRYFV  GVVTEECDPYFD  GCSHPGCE
Sbjct: 129 GMNLSLSVNDLLACCGWMCGAGCDGGSPIDAWRYFVQSGVVTEECDPYFDDIGCSHPGCE 188

Query: 201 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
           P +PTPKC RKC  KN+LW  SKH+S++AYRI+SDP  IMAE+  NGPVEV+FTVYEDFA
Sbjct: 189 PGFPTPKCERKCADKNKLWAESKHFSVNAYRIDSDPHSIMAEVSSNGPVEVAFTVYEDFA 248

Query: 261 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 320
           HYKSGVYKHITGD MGGHAVKLIGWGTS+DGEDYW+LANQWNR WG DGYFKIKRG+NEC
Sbjct: 249 HYKSGVYKHITGDAMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIKRGTNEC 308

Query: 321 GIEEDVVAGLPSSKNLVKEITSADMFEDASA 351
           GIE  VVAGLPS++NLV+E+   D  E A+A
Sbjct: 309 GIEGAVVAGLPSTRNLVREVAGIDGHEHATA 339


>gi|18378947|ref|NP_563648.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|16226808|gb|AAL16267.1|AF428337_1 At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|14532526|gb|AAK63991.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|25090140|gb|AAN72238.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|332189292|gb|AEE27413.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
          Length = 362

 Score =  550 bits (1418), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 252/320 (78%), Positives = 284/320 (88%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
           +SK KL S ILQ+ I+KEVNENP AGWKA+ N +F+N TV +FK LLGVKPTPK   LGV
Sbjct: 36  LSKQKLTSWILQNEIVKEVNENPNAGWKASFNDRFANATVAEFKRLLGVKPTPKTEFLGV 95

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 144
           P+ +HD SLKLPK FDAR+AW QC++I RILDQGHCGSCWAFGAVE+LSDRFCI + MN+
Sbjct: 96  PIVSHDISLKLPKEFDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKYNMNV 155

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 204
           SLSVNDLLACCGFLCG GC+GGYPI+AWRYF HHGVVTEECDPYFD+TGCSHPGCEPAYP
Sbjct: 156 SLSVNDLLACCGFLCGQGCNGGYPIAAWRYFKHHGVVTEECDPYFDNTGCSHPGCEPAYP 215

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           TPKC RKCV  NQLWR SKHY +SAY++ S P+DIMAE+YKNGPVEV+FTVYEDFAHYKS
Sbjct: 216 TPKCARKCVSGNQLWRESKHYGVSAYKVRSHPDDIMAEVYKNGPVEVAFTVYEDFAHYKS 275

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           GVYKHITG  +GGHAVKLIGWGTSDDGEDYW+LANQWNRSWG DGYFKI+RG+NECGIE 
Sbjct: 276 GVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEH 335

Query: 325 DVVAGLPSSKNLVKEITSAD 344
            VVAGLPS +N+VK IT++D
Sbjct: 336 GVVAGLPSDRNVVKGITTSD 355


>gi|297843028|ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335237|gb|EFH65654.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 360

 Score =  549 bits (1415), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 251/320 (78%), Positives = 283/320 (88%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
           +SK KL S ILQ+ I+KEVNENP AGWKAA N +F+N TV +FK LLGVKPTPK   LGV
Sbjct: 34  LSKQKLTSWILQNEIVKEVNENPNAGWKAAFNDRFANATVAEFKRLLGVKPTPKTEFLGV 93

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 144
           P+ +HD SLKLPK FDAR+AW QC+++ RILDQGHCGSCWAFGAVE+LSDRFCI + MN+
Sbjct: 94  PIVSHDISLKLPKEFDARTAWSQCTSVGRILDQGHCGSCWAFGAVESLSDRFCIKYNMNI 153

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 204
           SLSVNDLLACCGFLCG GC+GGYPI+AWRYF HHGVVTEECDPYFD+TGCSHPGCEPAYP
Sbjct: 154 SLSVNDLLACCGFLCGQGCNGGYPIAAWRYFKHHGVVTEECDPYFDNTGCSHPGCEPAYP 213

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           TPKC RKCV  NQLWR SKHY +SAY++ S P+DIMAE+YKNGPVEV+FTVYEDFAHYKS
Sbjct: 214 TPKCARKCVSGNQLWRESKHYGVSAYKVRSHPDDIMAEVYKNGPVEVAFTVYEDFAHYKS 273

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           GVYKHITG  +GGHAVKLIGWGTSDDGEDYW+LANQWNRSWG DGYFKI+RG+NECGIE 
Sbjct: 274 GVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEH 333

Query: 325 DVVAGLPSSKNLVKEITSAD 344
            VVAGLPS +N+ K IT++D
Sbjct: 334 GVVAGLPSDRNVFKGITTSD 353


>gi|312283137|dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila]
          Length = 362

 Score =  548 bits (1413), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 250/327 (76%), Positives = 286/327 (87%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
           +SK KL+S ILQ+ I+K+VN+NP AGWKAA N +FSN TV +FK LLGVKPTPK   LGV
Sbjct: 36  LSKQKLNSKILQEEIVKKVNQNPDAGWKAAINDRFSNATVAEFKRLLGVKPTPKKHFLGV 95

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 144
           P+ +HD+SLKLPK FDAR+AWPQC++I  ILDQGHCGSCWAFGAVE+LSDRFCI FGMN+
Sbjct: 96  PIVSHDRSLKLPKEFDARTAWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIEFGMNI 155

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 204
           SLSVNDLLACCGF CGDGCDGGYPI+AW+YF + GVVTEECDPYFD TGCSHPGCEPAYP
Sbjct: 156 SLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDDTGCSHPGCEPAYP 215

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           TPKC+RKCV  NQLW  SKHYS+S Y + S+P+DIMAE+YKNGPVEVSFTVYEDFAHYKS
Sbjct: 216 TPKCMRKCVSGNQLWSQSKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYEDFAHYKS 275

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           GVYKHITG  +GGHAVKLIGWGT+D+GEDYW+LANQWNRSWG DGYF I+RG+NECGIE+
Sbjct: 276 GVYKHITGSNIGGHAVKLIGWGTTDEGEDYWLLANQWNRSWGDDGYFMIRRGTNECGIED 335

Query: 325 DVVAGLPSSKNLVKEITSADMFEDASA 351
           + VAGLPSS+N+ K IT +D    AS 
Sbjct: 336 EPVAGLPSSRNVFKVITGSDDLSVASV 362


>gi|94958151|gb|ABF47216.1| cathepsin B [Nicotiana benthamiana]
          Length = 356

 Score =  548 bits (1411), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 250/334 (74%), Positives = 288/334 (86%)

Query: 17  FATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPT 76
               AE  +S+ K +S ILQDSI+K+VNEN KAGWKAA NP+FSN+TV QFK LLGVKPT
Sbjct: 22  LQVVAEQPISQAKAESAILQDSIVKQVNENEKAGWKAALNPRFSNFTVSQFKRLLGVKPT 81

Query: 77  PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRF 136
            KG L G+P+ TH K L+LP+ FDAR AWP CSTI RILDQGHCGSCWAFGAVE+LSDRF
Sbjct: 82  RKGDLKGIPILTHPKLLELPQEFDARVAWPNCSTIGRILDQGHCGSCWAFGAVESLSDRF 141

Query: 137 CIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH 196
           CIH+G+N+SLS NDLLACCGFLCGDGCDGGYP+ AW+YFV  GVVT+ECDPYFD+ GCSH
Sbjct: 142 CIHYGLNISLSANDLLACCGFLCGDGCDGGYPLQAWKYFVRKGVVTDECDPYFDNEGCSH 201

Query: 197 PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
           PGCEPAYPTPKC RKCVK+N LW  SKH+ ++AY I+SDP  IM E+YKNGPVEVSFTVY
Sbjct: 202 PGCEPAYPTPKCHRKCVKQNLLWSKSKHFGVNAYMISSDPHSIMTELYKNGPVEVSFTVY 261

Query: 257 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
           EDFAHYKSGVYKH+TGDVMGGHAVKLIGWGTS+DGEDYW+LANQWNR WG DGYFKI+RG
Sbjct: 262 EDFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIRRG 321

Query: 317 SNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 350
           ++EC IE++VVAGLPS++NL  E+  +D F DA+
Sbjct: 322 TDECEIEDEVVAGLPSARNLNMELDVSDAFLDAA 355


>gi|609175|emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana rustica]
          Length = 356

 Score =  546 bits (1406), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 251/350 (71%), Positives = 291/350 (83%)

Query: 1   MEPTKLIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFS 60
           M  T L +     +      AE  +S+ K +S ILQDSI+K+VNEN KAGWKAA NP+FS
Sbjct: 6   MSLTTLFLLIGASIIVLQVVAEQPISQAKAESAILQDSIVKQVNENEKAGWKAALNPRFS 65

Query: 61  NYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHC 120
           N+TV QFK LLGVKPT KG L G+P+ TH K L+LP+ FDAR AW  CSTI RILDQGHC
Sbjct: 66  NFTVSQFKRLLGVKPTRKGDLKGIPILTHPKLLELPQEFDARVAWSNCSTIGRILDQGHC 125

Query: 121 GSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 180
           GSCWAFGAVE+LSDRFCIH+G+N+SLS NDL ACCGFLCGDGCDGGYP+ AW+YFV  GV
Sbjct: 126 GSCWAFGAVESLSDRFCIHYGLNISLSANDLYACCGFLCGDGCDGGYPLQAWKYFVRKGV 185

Query: 181 VTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIM 240
           VT+ECDPYFD+ GCSHPGCEPAYPTPKC RKCVK+N LW  SKH+ ++AY I+SDP  IM
Sbjct: 186 VTDECDPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWSRSKHFGVNAYMISSDPHSIM 245

Query: 241 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQ 300
            E+YKNGPVEVSFTVYEDFAHYKSGVYKH+TGD+MGGHAVKLIGWGTS+DGEDYW+LANQ
Sbjct: 246 TEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDIMGGHAVKLIGWGTSEDGEDYWLLANQ 305

Query: 301 WNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 350
           WNR WG DGYFKI+RG+NEC IE++VVAGLPS++NL  E+  +D F DA+
Sbjct: 306 WNRGWGDDGYFKIRRGTNECEIEDEVVAGLPSARNLNVELDVSDAFLDAA 355


>gi|357511627|ref|XP_003626102.1| Cathepsin L-like proteinase [Medicago truncatula]
 gi|355501117|gb|AES82320.1| Cathepsin L-like proteinase [Medicago truncatula]
          Length = 351

 Score =  545 bits (1404), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 250/350 (71%), Positives = 287/350 (82%)

Query: 1   MEPTKLIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFS 60
           M PT L +  +  +  F       +S++KL+SHILQ+SI +++NENP+AGW+A  NP+FS
Sbjct: 1   MTPTILSLATLFLVFFFGEAKTYELSEVKLNSHILQESIARQINENPEAGWEATINPRFS 60

Query: 61  NYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHC 120
           N+TVGQFK LLGVK TP+  L   PV TH KSLKLPK FDAR+AW QCSTI RILDQGHC
Sbjct: 61  NFTVGQFKRLLGVKQTPRSELSSAPVVTHPKSLKLPKDFDARTAWSQCSTIGRILDQGHC 120

Query: 121 GSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 180
           GSCWAFGAVE+LSDRFCIHF MN+SLSVND+LACCG LCG GC GG P SAW Y  HHGV
Sbjct: 121 GSCWAFGAVESLSDRFCIHFDMNVSLSVNDILACCGLLCGAGCAGGTPFSAWIYLAHHGV 180

Query: 181 VTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIM 240
           VTEECDPYFD  GCSHPGCEP Y TPKCV+KCV  NQLW  SKHYS+ AY +NSDP+DIM
Sbjct: 181 VTEECDPYFDQIGCSHPGCEPTYRTPKCVKKCVNGNQLWETSKHYSVKAYTVNSDPQDIM 240

Query: 241 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQ 300
           AE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG  +GGHAVKL+GWGTS +GEDYW+LANQ
Sbjct: 241 AEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGFALGGHAVKLVGWGTSHEGEDYWLLANQ 300

Query: 301 WNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 350
           WN +WG DGYFKIKRG+NECGIE  V AGLPS+KN+V+E+T  D+  D S
Sbjct: 301 WNTNWGDDGYFKIKRGTNECGIENAVTAGLPSTKNIVREVTDMDVDADVS 350


>gi|388500062|gb|AFK38097.1| unknown [Lotus japonicus]
          Length = 357

 Score =  543 bits (1398), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 247/326 (75%), Positives = 282/326 (86%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
           +S LKL+S ILQ+SI KE+NENP AGW+AA +P+FSNYTV QFK LLGVKP+PK  L   
Sbjct: 31  LSTLKLNSRILQESIAKEINENPGAGWEAAISPRFSNYTVAQFKRLLGVKPSPKKELRST 90

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 144
           PV +H +SLKLPKSFDAR+AW QCSTI RILDQGHCGSCWAFGAVE+LSDRFCIH  +N+
Sbjct: 91  PVVSHPRSLKLPKSFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHLDVNV 150

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 204
           SLSVNDLLACCGFLCG GCDGGYP+ AWRY  HHGVVTEECDPYFD  GCSHPGCEPAY 
Sbjct: 151 SLSVNDLLACCGFLCGSGCDGGYPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYQ 210

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           TPKCVRKCVK NQ+W+ SK++S++AY + SDP DIMAE+YKNGPVEV+FTVYEDFAHYKS
Sbjct: 211 TPKCVRKCVKGNQIWKKSKYFSVNAYSVKSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKS 270

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           GVYKHITG  +GGHAVKLIGWGT+D+GEDYW++ANQWNRSWG DGYF I+RG+NECGIEE
Sbjct: 271 GVYKHITGSQLGGHAVKLIGWGTTDEGEDYWLIANQWNRSWGDDGYFMIRRGTNECGIEE 330

Query: 325 DVVAGLPSSKNLVKEITSADMFEDAS 350
           DV AGLPS+KN+ + +   D   D S
Sbjct: 331 DVTAGLPSTKNMGRWVMDMDADADVS 356


>gi|87240981|gb|ABD32839.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
           propeptide [Medicago truncatula]
          Length = 356

 Score =  540 bits (1392), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 255/356 (71%), Positives = 290/356 (81%), Gaps = 7/356 (1%)

Query: 1   MEPTKLIMDPILCLTCFA---TFAEGV---VSKLKLDSHILQDSIIKEVNENPKAGWKAA 54
           M PT L +   L L  FA    F E     +S++KL+SHILQ+SI +++NENP+AGW+A 
Sbjct: 1   MTPTILSL-ATLFLVFFAPYLRFGEAKTYELSEVKLNSHILQESIARQINENPEAGWEAT 59

Query: 55  RNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRI 114
            NP+FSN+TVGQFK LLGVK TP+  L   PV TH KSLKLPK FDAR+AW QCSTI RI
Sbjct: 60  INPRFSNFTVGQFKRLLGVKQTPRSELSSAPVVTHPKSLKLPKDFDARTAWSQCSTIGRI 119

Query: 115 LDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 174
           LDQGHCGSCWAFGAVE+LSDRFCIHF MN+SLSVND+LACCG LCG GC GG P SAW Y
Sbjct: 120 LDQGHCGSCWAFGAVESLSDRFCIHFDMNVSLSVNDILACCGLLCGAGCAGGTPFSAWIY 179

Query: 175 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 234
             HHGVVTEECDPYFD  GCSHPGCEP Y TPKCV+KCV  NQLW  SKHYS+ AY +NS
Sbjct: 180 LAHHGVVTEECDPYFDQIGCSHPGCEPTYRTPKCVKKCVNGNQLWETSKHYSVKAYTVNS 239

Query: 235 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 294
           DP+DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG  +GGHAVKL+GWGTS +GEDY
Sbjct: 240 DPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGFALGGHAVKLVGWGTSHEGEDY 299

Query: 295 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 350
           W+LANQWN +WG DGYFKIKRG+NECGIE  V AGLPS+KN+V+E+T  D+  D S
Sbjct: 300 WLLANQWNTNWGDDGYFKIKRGTNECGIENAVTAGLPSTKNIVREVTDMDVDADVS 355


>gi|297814171|ref|XP_002874969.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297320806|gb|EFH51228.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 359

 Score =  540 bits (1392), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 246/326 (75%), Positives = 286/326 (87%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
           ++K KL+S ILQD I+K+VN+NP AGWKAA N +FSN TV +FK LLGVKPTPK   LGV
Sbjct: 33  LTKQKLNSKILQDEIVKKVNQNPNAGWKAAINDRFSNATVAEFKRLLGVKPTPKKHFLGV 92

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 144
           PV +HD SLKLPK+FDAR+AWPQC++I +ILDQGHCGSCWAFGAVE+LSDRFCI FGMN+
Sbjct: 93  PVVSHDPSLKLPKAFDARTAWPQCTSIGKILDQGHCGSCWAFGAVESLSDRFCIQFGMNI 152

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 204
           SLSVNDLLACCGF CGDGCDGGYPI+AW+YF + GVVTEECDPYFD+TGCSHPGCEPAYP
Sbjct: 153 SLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDNTGCSHPGCEPAYP 212

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           TP+C+RKCV  N+LW  SKHYS+S Y +NS P+DIMAE+YKNGPVEVSFTVYEDFAHYKS
Sbjct: 213 TPRCLRKCVSDNKLWSESKHYSVSTYTVNSSPQDIMAEVYKNGPVEVSFTVYEDFAHYKS 272

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           GVYKHITG  +GGHAVKLIGWGTS++GEDYW++ANQWNR WG DGYF I+RG+NECGIE+
Sbjct: 273 GVYKHITGSNIGGHAVKLIGWGTSNEGEDYWLMANQWNRGWGDDGYFMIRRGTNECGIED 332

Query: 325 DVVAGLPSSKNLVKEITSADMFEDAS 350
           + VAGLPSS+N+ K  T ++    AS
Sbjct: 333 EPVAGLPSSRNVFKVDTGSNDLPVAS 358


>gi|18411686|ref|NP_567215.1| cathepsin B [Arabidopsis thaliana]
 gi|13877861|gb|AAK44008.1|AF370193_1 putative cathepsin B cysteine protease [Arabidopsis thaliana]
 gi|17473834|gb|AAL38343.1| unknown protein [Arabidopsis thaliana]
 gi|21281113|gb|AAM45063.1| putative cathepsin B cysteine protease [Arabidopsis thaliana]
 gi|21554165|gb|AAM63244.1| cathepsin B-like cysteine protease, putative [Arabidopsis thaliana]
 gi|24417490|gb|AAN60355.1| unknown [Arabidopsis thaliana]
 gi|24899725|gb|AAN65077.1| unknown protein [Arabidopsis thaliana]
 gi|51968702|dbj|BAD43043.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51969104|dbj|BAD43244.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51969220|dbj|BAD43302.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970472|dbj|BAD43928.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970630|dbj|BAD44007.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970704|dbj|BAD44044.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970802|dbj|BAD44093.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970974|dbj|BAD44179.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51971008|dbj|BAD44196.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51971116|dbj|BAD44250.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|62320144|dbj|BAD94342.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|110740287|dbj|BAF02040.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332656652|gb|AEE82052.1| cathepsin B [Arabidopsis thaliana]
          Length = 359

 Score =  539 bits (1388), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 247/326 (75%), Positives = 283/326 (86%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
           ++K KLDS ILQD I+K+VNENP AGWKAA N +FSN TV +FK LLGVKPTPK   LGV
Sbjct: 33  LTKQKLDSKILQDEIVKKVNENPNAGWKAAINDRFSNATVAEFKRLLGVKPTPKKHFLGV 92

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 144
           P+ +HD SLKLPK+FDAR+AWPQC++I  ILDQGHCGSCWAFGAVE+LSDRFCI FGMN+
Sbjct: 93  PIVSHDPSLKLPKAFDARTAWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIQFGMNI 152

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 204
           SLSVNDLLACCGF CGDGCDGGYPI+AW+YF + GVVTEECDPYFD+TGCSHPGCEPAYP
Sbjct: 153 SLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDNTGCSHPGCEPAYP 212

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           TPKC RKCV  N+LW  SKHYS+S Y + S+P+DIMAE+YKNGPVEVSFTVYEDFAHYKS
Sbjct: 213 TPKCSRKCVSDNKLWSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYEDFAHYKS 272

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           GVYKHITG  +GGHAVKLIGWGTS +GEDYW++ANQWNR WG DGYF I+RG+NECGIE+
Sbjct: 273 GVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIRRGTNECGIED 332

Query: 325 DVVAGLPSSKNLVKEITSADMFEDAS 350
           + VAGLPSSKN+ +  T ++    AS
Sbjct: 333 EPVAGLPSSKNVFRVDTGSNDLPVAS 358


>gi|197304333|dbj|BAG69285.1| cathepsin B-like cysteine protease [Raphanus sativus]
          Length = 343

 Score =  537 bits (1384), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 246/310 (79%), Positives = 280/310 (90%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
           ++K KL+S ILQ+ I+K+VNE+P AGWKAA N +FSN TV +FK LLGVKPTPK LLLGV
Sbjct: 34  LTKQKLNSKILQEEIVKKVNEHPNAGWKAAINDRFSNATVAEFKRLLGVKPTPKKLLLGV 93

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 144
           PV +HD+SLKLPKSFDAR+ WPQC++I +ILDQGHCGSCWAFGAVE+LSDRFCI FGMN+
Sbjct: 94  PVVSHDQSLKLPKSFDARTHWPQCTSIGKILDQGHCGSCWAFGAVESLSDRFCIQFGMNI 153

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 204
           +LSVNDLLACCGF CGDGCDGGYPISAW+YF + GVVTEECDPYFD TGCSHPGCEPAY 
Sbjct: 154 TLSVNDLLACCGFRCGDGCDGGYPISAWQYFSYSGVVTEECDPYFDQTGCSHPGCEPAYN 213

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           TP+C+RKCV +NQLW  SKHYSI+ Y + S+P+DIMAEIYKNGPVEVSFTVYEDFAHYKS
Sbjct: 214 TPQCLRKCVGRNQLWSESKHYSINTYVVESNPQDIMAEIYKNGPVEVSFTVYEDFAHYKS 273

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           GVYKHITG  +GGHAVKLIGWGT+DDGEDYW+LANQWNRSWG DGYF I+RG+NECGIE+
Sbjct: 274 GVYKHITGSNIGGHAVKLIGWGTTDDGEDYWLLANQWNRSWGDDGYFMIRRGTNECGIED 333

Query: 325 DVVAGLPSSK 334
           + VAGLPSSK
Sbjct: 334 EPVAGLPSSK 343


>gi|297744106|emb|CBI37076.3| unnamed protein product [Vitis vinifera]
          Length = 392

 Score =  534 bits (1375), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 252/349 (72%), Positives = 283/349 (81%), Gaps = 36/349 (10%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
           VS+LK ++ ILQ+S+++ +N NPKAGWKAA NP+FSNY+VGQF HLLGVKPT +  L GV
Sbjct: 29  VSQLKFNTKILQESMVELINANPKAGWKAAMNPRFSNYSVGQFMHLLGVKPTLQKDLEGV 88

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRIL----------------------------- 115
           PV TH K+LKLPK FDAR+AWPQCSTI +IL                             
Sbjct: 89  PVITHPKTLKLPKHFDARTAWPQCSTIGKILGRLLDSFSSYFDDFFCFGCTDALYFSYHL 148

Query: 116 -------DQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYP 168
                  DQGHCGSCWAFGAVE+LSDRFCIHFGMN+SLSVNDLLACCGFLCG GCDGGYP
Sbjct: 149 LVPFYIKDQGHCGSCWAFGAVESLSDRFCIHFGMNISLSVNDLLACCGFLCGSGCDGGYP 208

Query: 169 ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS 228
           + AWRYF+HHGVVTEECDPYFD+TGCSHPGCEP YPTPKCVRKC  +NQLWR +K Y  S
Sbjct: 209 LYAWRYFIHHGVVTEECDPYFDATGCSHPGCEPGYPTPKCVRKCTDENQLWRKAKRYGQS 268

Query: 229 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 288
           AYRI+SDP  IMAE+YKNGPVEV+FTVYEDFAHY+SGVY++ TGDVMGGHAVKLIGWGT+
Sbjct: 269 AYRISSDPYQIMAEVYKNGPVEVAFTVYEDFAHYESGVYRYTTGDVMGGHAVKLIGWGTT 328

Query: 289 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 337
           DDGEDYWILANQWNR+WG DGYF I+RG NECGIEE VVAGLPSSKNL+
Sbjct: 329 DDGEDYWILANQWNRNWGDDGYFMIRRGVNECGIEEGVVAGLPSSKNLM 377


>gi|30678927|ref|NP_849281.1| cathepsin B [Arabidopsis thaliana]
 gi|3859606|gb|AAC72872.1| contains similarity to cysteine proteases (Pfam: PF00112,
           E=1.3e-79, N=1) [Arabidopsis thaliana]
 gi|7268205|emb|CAB77732.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332656653|gb|AEE82053.1| cathepsin B [Arabidopsis thaliana]
          Length = 359

 Score =  533 bits (1373), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 245/326 (75%), Positives = 281/326 (86%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
           ++K KLDS ILQD I+K+VNENP AGWKAA N +FSN TV +FK LLGVKPTPK   LGV
Sbjct: 33  LTKQKLDSKILQDEIVKKVNENPNAGWKAAINDRFSNATVAEFKRLLGVKPTPKKHFLGV 92

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 144
           P+ +HD SLKLPK+FDAR+AWPQC++I  IL  GHCGSCWAFGAVE+LSDRFCI FGMN+
Sbjct: 93  PIVSHDPSLKLPKAFDARTAWPQCTSIGNILGLGHCGSCWAFGAVESLSDRFCIQFGMNI 152

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 204
           SLSVNDLLACCGF CGDGCDGGYPI+AW+YF + GVVTEECDPYFD+TGCSHPGCEPAYP
Sbjct: 153 SLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDNTGCSHPGCEPAYP 212

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           TPKC RKCV  N+LW  SKHYS+S Y + S+P+DIMAE+YKNGPVEVSFTVYEDFAHYKS
Sbjct: 213 TPKCSRKCVSDNKLWSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYEDFAHYKS 272

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           GVYKHITG  +GGHAVKLIGWGTS +GEDYW++ANQWNR WG DGYF I+RG+NECGIE+
Sbjct: 273 GVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIRRGTNECGIED 332

Query: 325 DVVAGLPSSKNLVKEITSADMFEDAS 350
           + VAGLPSSKN+ +  T ++    AS
Sbjct: 333 EPVAGLPSSKNVFRVDTGSNDLPVAS 358


>gi|38639325|gb|AAR25800.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
          Length = 354

 Score =  529 bits (1363), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 249/350 (71%), Positives = 285/350 (81%), Gaps = 6/350 (1%)

Query: 5   KLIMDPILCLTCFATF----AEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFS 60
           K ++ P+L    F       AE  +S+ KL+S ILQDSI+K VNEN +AGWKAA NPQ S
Sbjct: 6   KSLITPLLLGAFFILILQVAAEKPISEAKLESAILQDSIVKRVNENAEAGWKAAFNPQLS 65

Query: 61  NYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHC 120
           N+TV QFK LLGVKP  +G L G+PV TH +  +LPK FDAR AWPQCSTI +ILDQGHC
Sbjct: 66  NFTVSQFKRLLGVKPAREGDLEGIPVLTHPRLKELPKEFDARKAWPQCSTIGKILDQGHC 125

Query: 121 GSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 180
           GSCWAFGAVE+LSDRFCIH+ +++SLSVNDLLACC FLCG GCDGGYPI+AWRYF   GV
Sbjct: 126 GSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCSFLCGSGCDGGYPIAAWRYFKRSGV 185

Query: 181 VTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIM 240
           VTEECDPYFD+TGCSHPGCEP YPTPKC RKCVK N LWR SKHY ++AYR++ DP+ IM
Sbjct: 186 VTEECDPYFDTTGCSHPGCEPLYPTPKCHRKCVKGNVLWRKSKHYGVNAYRVSHDPQSIM 245

Query: 241 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQ 300
           AE+YKNGPVEVSFTVYEDFAHYKSGVYKH+TG  MGGHAVKLIGWGTS+ GEDYW++ N 
Sbjct: 246 AEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGGNMGGHAVKLIGWGTSEQGEDYWLIVNS 305

Query: 301 WNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 350
           WNR WG DGYFKI+RG+NECGIE  VVAGLPS++NL  E+   D   DAS
Sbjct: 306 WNRGWGEDGYFKIRRGTNECGIEHSVVAGLPSARNLNVEL--GDAVLDAS 353


>gi|59895951|gb|AAX11351.1| cathepsin B-like cysteine protease [Oryza sativa Japonica Group]
 gi|125551767|gb|EAY97476.1| hypothetical protein OsI_19406 [Oryza sativa Indica Group]
 gi|215694023|dbj|BAG89222.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215712372|dbj|BAG94499.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765382|dbj|BAG87079.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222631058|gb|EEE63190.1| hypothetical protein OsJ_17999 [Oryza sativa Japonica Group]
          Length = 358

 Score =  527 bits (1358), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 238/320 (74%), Positives = 274/320 (85%)

Query: 24  VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLG 83
           +++K    S I+QD IIK +N++P AGW AARNP F+NYT  QFKH+LGVKPTP  +L  
Sbjct: 31  LMTKEGGSSRIIQDDIIKAINKHPNAGWTAARNPYFANYTTAQFKHILGVKPTPHSVLND 90

Query: 84  VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 143
           VPVKT+ +SL LPK FDARSAW QC+TI  ILDQGHCGSCWAFGAVE L DRFCIHF MN
Sbjct: 91  VPVKTYPRSLMLPKEFDARSAWSQCNTIGTILDQGHCGSCWAFGAVECLQDRFCIHFNMN 150

Query: 144 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY 203
           +SLSVNDL+ACCGF+CGDGCDGGYPI AWRYFV +GVVT+ECDPYFD  GC HPGCEPAY
Sbjct: 151 ISLSVNDLVACCGFMCGDGCDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCKHPGCEPAY 210

Query: 204 PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 263
           PTP C +KC  +NQ+W   KH+S++AYR+NSDP DIMAE+Y+NGPVEV+FTVYEDFAHYK
Sbjct: 211 PTPVCEKKCKVQNQVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYK 270

Query: 264 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 323
           SGVYKHITG +MGGHAVKLIGWGT+D GEDYW+LANQWNR WG DGYFKI RG+NECGIE
Sbjct: 271 SGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFKIIRGTNECGIE 330

Query: 324 EDVVAGLPSSKNLVKEITSA 343
           EDVVAG+PS+KN+V+   SA
Sbjct: 331 EDVVAGMPSTKNMVRNYDSA 350


>gi|255647484|gb|ACU24206.1| unknown [Glycine max]
          Length = 327

 Score =  520 bits (1340), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 235/297 (79%), Positives = 264/297 (88%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
           ++ LKL+SHILQ+S  KE+NENP+AGW+AA NP+FSNYTV QFK LLGVKP PK  L   
Sbjct: 31  LTSLKLNSHILQESTAKEINENPEAGWEAAINPRFSNYTVEQFKRLLGVKPMPKKELRST 90

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 144
           P  +H K+LKLPK+FDAR+AW QCSTI RILDQGHCGSCWAFGAVE+LSDRFCIHF +N+
Sbjct: 91  PAISHPKTLKLPKNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDVNI 150

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 204
           SLSVNDLLACCGFLCG GCDGGYP+ AWRY  HHGVVTEECDPYFD  GCSHPGCEPAY 
Sbjct: 151 SLSVNDLLACCGFLCGSGCDGGYPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYR 210

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           TPKCV+KCV  NQ+W+ SKHYS+SAYR+NSDP DIMAE+YKNGPVEV+FTVYEDFA+YKS
Sbjct: 211 TPKCVKKCVSGNQVWKKSKHYSVSAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAYYKS 270

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           GVYKHITG  +GGHAVKLIGWGT+DDGEDYW+LANQWNR WG DGYFKI+RG+NECG
Sbjct: 271 GVYKHITGYELGGHAVKLIGWGTTDDGEDYWLLANQWNREWGDDGYFKIRRGTNECG 327


>gi|14582576|gb|AAK69541.1|AF283476_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
          Length = 352

 Score =  519 bits (1337), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 240/337 (71%), Positives = 277/337 (82%), Gaps = 2/337 (0%)

Query: 1   MEPTK-LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF 59
           ME  K L++   + L      A   V+  ++D  ILQD I+K VNENP+AGWKA  NP+F
Sbjct: 1   METIKTLLLIGAISLLILQVVAVKPVTLTEVDPKILQDEIVKTVNENPEAGWKADMNPRF 60

Query: 60  SNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGH 119
           S++TV QFK LLGVK  PK LL   PV TH K ++LPK+FDAR+AWPQC +I+ ILDQGH
Sbjct: 61  SDFTVSQFKRLLGVKKAPKSLLKRTPVVTHSKEIELPKTFDARTAWPQCLSIADILDQGH 120

Query: 120 CGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 179
           CGSCWAFGAVE+L+DRFCIH+G N++LSVNDLLACCGFLCG+GCDGGYPI+AW+YF   G
Sbjct: 121 CGSCWAFGAVESLTDRFCIHYGTNVTLSVNDLLACCGFLCGEGCDGGYPIAAWQYFKRTG 180

Query: 180 VVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDI 239
           VVT ECDPYFD TGCSHPGCEPAYPTP C +KCVKKN LW  SKH+S++AYR+NSD   I
Sbjct: 181 VVTSECDPYFDQTGCSHPGCEPAYPTPACEKKCVKKNLLWSESKHFSVNAYRVNSDQHSI 240

Query: 240 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 299
           M E+Y NGP EVSFTVYEDFAHYKSGVYKH+TG  MGGHAVKLIGWGTS+DGEDYW+LAN
Sbjct: 241 MTEVYTNGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHAVKLIGWGTSEDGEDYWLLAN 300

Query: 300 QWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 336
           QWNRSWG DGYFKI RG+NECGI EDV AG+PS+KNL
Sbjct: 301 QWNRSWGDDGYFKIIRGTNECGI-EDVTAGMPSTKNL 336


>gi|6165885|gb|AAF04727.1|AF101239_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
          Length = 352

 Score =  519 bits (1336), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 240/337 (71%), Positives = 276/337 (81%), Gaps = 2/337 (0%)

Query: 1   MEPTK-LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF 59
           ME  K L++   + L      A   V+  ++D  ILQD I+K VNENP+AGWKA  NP+F
Sbjct: 1   METIKTLLLIGAISLLILQVVAVKPVTLTEVDPKILQDEIVKTVNENPEAGWKADMNPRF 60

Query: 60  SNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGH 119
           S++TV QFK LLGVK  PK LL   PV TH K ++LPK+FDAR+AWPQC +I+ ILDQGH
Sbjct: 61  SDFTVSQFKRLLGVKKAPKSLLKRTPVVTHSKEIELPKTFDARTAWPQCLSIADILDQGH 120

Query: 120 CGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 179
           CGSCWAFGAVE+L+DRFCIH+G N++LSVNDLLACCGFLCG+GCDGGYPI+AW+YF   G
Sbjct: 121 CGSCWAFGAVESLTDRFCIHYGTNVTLSVNDLLACCGFLCGEGCDGGYPIAAWQYFKRTG 180

Query: 180 VVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDI 239
           VVT ECDPYFD TGCSHPGCEPAYPTP C +KCVKKN LW  SKH+S++AYR+NSD   I
Sbjct: 181 VVTSECDPYFDQTGCSHPGCEPAYPTPACEKKCVKKNLLWSESKHFSVNAYRVNSDQHSI 240

Query: 240 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 299
           M E+Y NGP EVSFTVYEDFAHYKSGVYKH+TG  MGGHAVKLIGWGTS+DGEDYW+LAN
Sbjct: 241 MTEVYTNGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHAVKLIGWGTSEDGEDYWLLAN 300

Query: 300 QWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 336
           QWNRSWG DGYFKI RG+NECGI EDV AG PS+KNL
Sbjct: 301 QWNRSWGGDGYFKIIRGTNECGI-EDVTAGTPSTKNL 336


>gi|226497010|ref|NP_001150152.1| LOC100283781 precursor [Zea mays]
 gi|195637168|gb|ACG38052.1| cathepsin B-like cysteine proteinase 3 precursor [Zea mays]
          Length = 347

 Score =  518 bits (1335), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 237/315 (75%), Positives = 268/315 (85%), Gaps = 2/315 (0%)

Query: 31  DSH--ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKT 88
           D+H  I+Q+ II+ VN +P AGW A+RNP FSNYT+ QFKH+LGVKP P+  L  VPVKT
Sbjct: 27  DNHMRIIQEDIIETVNNHPSAGWTASRNPYFSNYTIAQFKHILGVKPAPQNALSNVPVKT 86

Query: 89  HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSV 148
           + +SL+LPK FDARSAW +CSTI  ILDQGHCGSCWAFGAVE L DRFCIH  M++ LSV
Sbjct: 87  YSRSLELPKEFDARSAWSRCSTIGNILDQGHCGSCWAFGAVECLQDRFCIHLNMSILLSV 146

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 208
           NDLLACCGF+CGDGCDGGYPI AWRYFV +GVVT+ECDPYFD  GC HPGCEPAYPTPKC
Sbjct: 147 NDLLACCGFMCGDGCDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCKHPGCEPAYPTPKC 206

Query: 209 VRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            +KC ++NQ+W+  KH+SI AYRINSDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYK
Sbjct: 207 EKKCKEQNQVWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYK 266

Query: 269 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
           HITG +MGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECGIEE VVA
Sbjct: 267 HITGGIMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEGVVA 326

Query: 329 GLPSSKNLVKEITSA 343
           G+PS+KN+V     A
Sbjct: 327 GMPSTKNMVPNFGGA 341


>gi|414886870|tpg|DAA62884.1| TPA: cathepsin B-like cysteine proteinase 3 [Zea mays]
          Length = 347

 Score =  517 bits (1332), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 236/315 (74%), Positives = 268/315 (85%), Gaps = 2/315 (0%)

Query: 31  DSH--ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKT 88
           D+H  I+Q+ II+ VN +P AGW A+RNP FSNYT+ QFKH+LGVKP P+  L  VPVKT
Sbjct: 27  DNHMRIIQEDIIETVNNHPSAGWTASRNPYFSNYTIAQFKHILGVKPAPQNALSNVPVKT 86

Query: 89  HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSV 148
           + +SL+LPK FDARSAW +CSTI  IL+QGHCGSCWAFGAVE L DRFCIH  M++ LSV
Sbjct: 87  YSRSLELPKEFDARSAWSRCSTIGNILEQGHCGSCWAFGAVECLQDRFCIHLNMSILLSV 146

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 208
           NDLLACCGF+CGDGCDGGYPI AWRYFV +GVVT+ECDPYFD  GC HPGCEPAYPTPKC
Sbjct: 147 NDLLACCGFMCGDGCDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCKHPGCEPAYPTPKC 206

Query: 209 VRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            +KC ++NQ+W+  KH+SI AYRINSDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYK
Sbjct: 207 EKKCKEQNQVWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYK 266

Query: 269 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
           HITG +MGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECGIEE VVA
Sbjct: 267 HITGGIMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEGVVA 326

Query: 329 GLPSSKNLVKEITSA 343
           G+PS+KN+V     A
Sbjct: 327 GMPSTKNMVPNFGGA 341


>gi|2317912|gb|AAC24376.1| cathepsin B-like cysteine proteinase [Arabidopsis thaliana]
          Length = 357

 Score =  515 bits (1327), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 237/320 (74%), Positives = 273/320 (85%), Gaps = 2/320 (0%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
           +SK KL S ILQ+ I+KEVNENP AGWKAA N +F+N TV +FK LLGV  TPK   LGV
Sbjct: 33  LSKQKLTSLILQNEIVKEVNENPNAGWKAAFNDRFANATVAEFKRLLGVIQTPKTAYLGV 92

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 144
           P+  HD SLKLPK FDAR+AW  C++I RIL  GHCGSCWAFGAVE+LSDRFCI + +N+
Sbjct: 93  PIVRHDLSLKLPKEFDARTAWSHCTSIRRIL--GHCGSCWAFGAVESLSDRFCIKYNLNV 150

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 204
           SLS ND++ACCG LCG GC+GG+P+ AW YF +HGVVT+ECDPYFD+TGCSHPGCEP YP
Sbjct: 151 SLSANDVIACCGLLCGFGCNGGFPMGAWLYFKYHGVVTQECDPYFDNTGCSHPGCEPTYP 210

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           TPKC RKCV +NQLW  SKHY + AYRIN DP+DIMAE+YKNGPVEV+FTVYEDFAHYKS
Sbjct: 211 TPKCERKCVSRNQLWGESKHYGVGAYRINPDPQDIMAEVYKNGPVEVAFTVYEDFAHYKS 270

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           GVYK+ITG  +GGHAVKLIGWGTSDDGEDYW+LANQWNRSWG DGYFKI+RG+NECGIE+
Sbjct: 271 GVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQ 330

Query: 325 DVVAGLPSSKNLVKEITSAD 344
            VVAGLPS KN+ K IT++D
Sbjct: 331 SVVAGLPSEKNVFKGITTSD 350


>gi|194352768|emb|CAQ00112.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326488519|dbj|BAJ93928.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326508126|dbj|BAJ99330.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 355

 Score =  513 bits (1322), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 231/303 (76%), Positives = 262/303 (86%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL 93
           I+Q+ II+ VN++P AGW A  NP F+NYT+ QFKH+LGVKPTP GLL GVP+KTH KS 
Sbjct: 40  IIQEDIIQTVNDHPNAGWTAGHNPYFANYTIEQFKHILGVKPTPPGLLAGVPIKTHPKSA 99

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLA 153
            LPK FDAR+ W  CSTI  ILDQGHCG+CWAF AVE+L DRFCIH  M++SLSVNDLLA
Sbjct: 100 DLPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVESLQDRFCIHLNMSVSLSVNDLLA 159

Query: 154 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV 213
           CCGFLCG GC+GGYPISAWRYF   GVVTEECDPYFD TGC HPGCEPAYPTPKC RKC 
Sbjct: 160 CCGFLCGSGCNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQHPGCEPAYPTPKCHRKCK 219

Query: 214 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 273
            +NQ+W+ +KH+S++AYR++S+P DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG 
Sbjct: 220 VENQVWKKNKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGG 279

Query: 274 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           VMGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECGIEEDV AG+PS+
Sbjct: 280 VMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVTAGMPST 339

Query: 334 KNL 336
           KN+
Sbjct: 340 KNM 342


>gi|326492684|dbj|BAJ90198.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 355

 Score =  513 bits (1320), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 231/303 (76%), Positives = 261/303 (86%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL 93
           I+Q+ II+ VN++P AGW A  NP F+NYT+ QFKH+LGVKPTP GLL GVP+KTH KS 
Sbjct: 40  IIQEDIIQTVNDHPNAGWTAGHNPYFANYTIEQFKHILGVKPTPPGLLAGVPIKTHPKSA 99

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLA 153
            LPK FDAR+ W  CSTI  ILDQGHCG+CWAF AVE+L DRFCIH  M++SLSVNDLLA
Sbjct: 100 DLPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVESLQDRFCIHLNMSVSLSVNDLLA 159

Query: 154 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV 213
           CCGFLCG GC+GGYPISAWRYF   GVVTEECDPYFD TGC HPGCEPAYPTPKC RKC 
Sbjct: 160 CCGFLCGSGCNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQHPGCEPAYPTPKCHRKCK 219

Query: 214 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 273
            +NQ+W+ +KH S++AYR++S+P DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG 
Sbjct: 220 VENQVWKKNKHSSVNAYRVHSNPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGG 279

Query: 274 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           VMGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECGIEEDV AG+PS+
Sbjct: 280 VMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGGDGYFKIIRGKNECGIEEDVTAGMPST 339

Query: 334 KNL 336
           KN+
Sbjct: 340 KNM 342


>gi|357116869|ref|XP_003560199.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
          Length = 350

 Score =  511 bits (1317), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 228/305 (74%), Positives = 262/305 (85%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL 93
           I+Q+ II+ +N +P AGW A +N  F+NYT+ QFKH+LGVKPTP GLL GVP KT+ +S 
Sbjct: 37  IIQNDIIETINNHPNAGWTAGQNSYFANYTIAQFKHILGVKPTPPGLLRGVPTKTYSRST 96

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLA 153
            LPK FDARS W  CSTI  ILDQGHCGSCWAFGAVE L DRFCIH  MN+SLSVNDL+A
Sbjct: 97  DLPKEFDARSKWSGCSTIGTILDQGHCGSCWAFGAVECLQDRFCIHLNMNISLSVNDLVA 156

Query: 154 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV 213
           CCGF+CGDGCDGGYPISAW+Y V +GVVT+ECDPYFD  GC HPGCEPAYPTP C +KC 
Sbjct: 157 CCGFMCGDGCDGGYPISAWQYLVENGVVTDECDPYFDQVGCKHPGCEPAYPTPACEKKCK 216

Query: 214 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 273
            +NQ+W+  KH+SI+AYR+NSDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVY+HITG+
Sbjct: 217 VQNQVWQEKKHFSINAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYEHITGE 276

Query: 274 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           +MGGHAVKLIGWGTS DG+DYW+LANQWNR WG DGYFKI RG NECGIEEDVVAG+PS+
Sbjct: 277 MMGGHAVKLIGWGTSADGKDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVVAGMPST 336

Query: 334 KNLVK 338
           KN V+
Sbjct: 337 KNTVR 341


>gi|215687149|dbj|BAG90919.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 403

 Score =  508 bits (1309), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 239/365 (65%), Positives = 275/365 (75%), Gaps = 45/365 (12%)

Query: 24  VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTV------------------- 64
           +++K    S I+QD IIK +N++P AGW AARNP F+NYTV                   
Sbjct: 31  LMTKEGGSSRIIQDDIIKAINKHPNAGWTAARNPYFANYTVNNNTLLLLFSFFFLRGHLP 90

Query: 65  --------------------------GQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKS 98
                                      QFKH+LGVKPTP  +L  VPVKT+ +SL LPK 
Sbjct: 91  VVVSIAYIKTFISCLFGGLNNPPVQTAQFKHILGVKPTPHSVLNDVPVKTYPRSLMLPKE 150

Query: 99  FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFL 158
           FDARSAW QC+TI  ILDQGHCGSCWAFGAVE L DRFCIHF MN+SLSVNDL+ACCGF+
Sbjct: 151 FDARSAWSQCNTIGTILDQGHCGSCWAFGAVECLQDRFCIHFNMNISLSVNDLVACCGFM 210

Query: 159 CGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL 218
           CGDGCDGGYPI AWRYFV +GVVT+ECDPYFD  GC HPGCEPAYPTP C +KC  +NQ+
Sbjct: 211 CGDGCDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCKHPGCEPAYPTPVCEKKCKVQNQV 270

Query: 219 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 278
           W   KH+S++AYR+NSDP DIMAE+Y+NGPVEV+FTVYEDFAHYKSGVYKHITG +MGGH
Sbjct: 271 WLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGH 330

Query: 279 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVK 338
           AVKLIGWGT+D GEDYW+LANQWNR WG DGYFKI RG+NECGIEEDVVAG+PS+KN+V+
Sbjct: 331 AVKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEEDVVAGMPSTKNMVR 390

Query: 339 EITSA 343
              SA
Sbjct: 391 NYDSA 395


>gi|297843026|ref|XP_002889394.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335236|gb|EFH65653.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 359

 Score =  508 bits (1308), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 245/324 (75%), Positives = 282/324 (87%)

Query: 21  AEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGL 80
           A G +SK KL S ILQ+ I+KEVNENP AGWKA+ N +F+N TV +FK LLGVKPTPK  
Sbjct: 29  AAGNLSKQKLTSLILQNEIVKEVNENPNAGWKASLNDRFANATVAEFKRLLGVKPTPKTA 88

Query: 81  LLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF 140
            LGVP+  HD SLKLPK FDAR+AW QC++I RILDQGHCGSCWAFGAVE+LSDRFCI +
Sbjct: 89  YLGVPIVRHDLSLKLPKEFDARTAWSQCTSIPRILDQGHCGSCWAFGAVESLSDRFCIKY 148

Query: 141 GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 200
            +N+SLS ND++ACCG LCG GC+GG+P+ AW YF +HGVVTEECDPYFD+TGCSHPGCE
Sbjct: 149 NLNVSLSANDVVACCGLLCGLGCNGGFPMGAWLYFKYHGVVTEECDPYFDNTGCSHPGCE 208

Query: 201 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
           P YPTPKCVRKCV +NQLW  SKHY +SAYRIN DP+DIMAE+YKNGPVEV+FTVYEDFA
Sbjct: 209 PGYPTPKCVRKCVSENQLWGESKHYGVSAYRINHDPQDIMAEVYKNGPVEVAFTVYEDFA 268

Query: 261 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 320
           HYKSGVYKHITG  +GGHAVKLIGWGTSDDGEDYW+LANQWNRSWG DGYFKI+RG+NEC
Sbjct: 269 HYKSGVYKHITGTKIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNEC 328

Query: 321 GIEEDVVAGLPSSKNLVKEITSAD 344
           GIE  VVAGLPS +N+ K++T++D
Sbjct: 329 GIEHGVVAGLPSDRNVFKDVTTSD 352


>gi|18378945|ref|NP_563647.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332189291|gb|AEE27412.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
          Length = 379

 Score =  506 bits (1304), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 237/340 (69%), Positives = 273/340 (80%), Gaps = 20/340 (5%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
           +SK KL S ILQ+ I+KEVNENP AGWKAA N +F+N TV +FK LLGV  TPK   LGV
Sbjct: 33  LSKQKLTSLILQNEIVKEVNENPNAGWKAAFNDRFANATVAEFKRLLGVIQTPKTAYLGV 92

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILD--------------------QGHCGSCW 124
           P+  HD SLKLPK FDAR+AW  C++I RIL                      GHCGSCW
Sbjct: 93  PIVRHDLSLKLPKEFDARTAWSHCTSIRRILVGYILNNVLLWSTITLWFWFLLGHCGSCW 152

Query: 125 AFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 184
           AFGAVE+LSDRFCI + +N+SLS ND++ACCG LCG GC+GG+P+ AW YF +HGVVT+E
Sbjct: 153 AFGAVESLSDRFCIKYNLNVSLSANDVIACCGLLCGFGCNGGFPMGAWLYFKYHGVVTQE 212

Query: 185 CDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIY 244
           CDPYFD+TGCSHPGCEP YPTPKC RKCV +NQLW  SKHY + AYRIN DP+DIMAE+Y
Sbjct: 213 CDPYFDNTGCSHPGCEPTYPTPKCERKCVSRNQLWGESKHYGVGAYRINPDPQDIMAEVY 272

Query: 245 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 304
           KNGPVEV+FTVYEDFAHYKSGVYK+ITG  +GGHAVKLIGWGTSDDGEDYW+LANQWNRS
Sbjct: 273 KNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLLANQWNRS 332

Query: 305 WGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 344
           WG DGYFKI+RG+NECGIE+ VVAGLPS KN+ K IT++D
Sbjct: 333 WGDDGYFKIRRGTNECGIEQSVVAGLPSEKNVFKGITTSD 372


>gi|40643250|emb|CAC83720.1| cathepsin B [Hordeum vulgare subsp. vulgare]
 gi|326494236|dbj|BAJ90387.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326499864|dbj|BAJ90767.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 344

 Score =  506 bits (1304), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 230/304 (75%), Positives = 256/304 (84%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL 93
           I+Q  II+ VN +P AGW A  NP  +NYT+ QFKH+LGVKPTP GLL GV  KTH +S 
Sbjct: 35  IIQKGIIQTVNNHPNAGWTAGHNPYLANYTIEQFKHMLGVKPTPPGLLAGVRTKTHPRSE 94

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLA 153
           +LPK FDARS W  CSTI +ILDQGHCGSCWAFGAVE L DRFCIH  MN+SLS NDL+A
Sbjct: 95  QLPKEFDARSKWSGCSTIGKILDQGHCGSCWAFGAVECLQDRFCIHHNMNISLSANDLVA 154

Query: 154 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV 213
           CCGF+CGDGCDGGYPISAW+YFV +GVVTEECDPYFD  GC HPGCEPAYPTP C +KC 
Sbjct: 155 CCGFMCGDGCDGGYPISAWQYFVQNGVVTEECDPYFDQVGCKHPGCEPAYPTPVCEKKCK 214

Query: 214 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 273
            +NQ+W+  KH+SI AY++NSDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG 
Sbjct: 215 VQNQVWQEKKHFSIDAYQVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGG 274

Query: 274 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           VMGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECGIEEDV AG+PS 
Sbjct: 275 VMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVTAGMPSM 334

Query: 334 KNLV 337
           KN+ 
Sbjct: 335 KNIA 338


>gi|222424744|dbj|BAH20325.1| AT1G02305 [Arabidopsis thaliana]
          Length = 293

 Score =  504 bits (1299), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 230/286 (80%), Positives = 257/286 (89%)

Query: 59  FSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQG 118
           F+N TV +FK LLGVKPTPK   LGVP+ +HD SLKLPK FDAR+AW QC++I RILDQG
Sbjct: 1   FANATVAEFKRLLGVKPTPKTEFLGVPIVSHDISLKLPKEFDARTAWSQCTSIGRILDQG 60

Query: 119 HCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH 178
           HCGSCWAFGAVE+LSDRFCI + MN+SLSVNDLLACCGFLCG GC+GGYPI+AWRYF HH
Sbjct: 61  HCGSCWAFGAVESLSDRFCIKYNMNVSLSVNDLLACCGFLCGQGCNGGYPIAAWRYFKHH 120

Query: 179 GVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPED 238
           GVVTEECDPYFD+TGCSHPGCEPAYPTPKC RKCV  NQLWR SKHY +SAY++ S P+D
Sbjct: 121 GVVTEECDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQLWRESKHYGVSAYKVRSHPDD 180

Query: 239 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 298
           IMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG  +GGHAVKLIGWGTSDDGEDYW+LA
Sbjct: 181 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLA 240

Query: 299 NQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 344
           NQWNRSWG DGYFKI+RG+NECGIE  VVAGLPS +N+VK IT++D
Sbjct: 241 NQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVVKGITTSD 286


>gi|357116879|ref|XP_003560204.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
          Length = 351

 Score =  504 bits (1299), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 226/310 (72%), Positives = 259/310 (83%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL 93
           I+Q+ II+ +N++P AGW A  NP F+NYT+ QFKH+LGVKPTP  LL GVP K++ +S+
Sbjct: 36  IIQNDIIETINKHPNAGWTAGHNPYFANYTITQFKHILGVKPTPPALLAGVPTKSYSRSM 95

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLA 153
           KLP  FDARS W  CSTI  ILDQGHCGSCWAFGAVE L DRFCIH  MN+SLSVNDLLA
Sbjct: 96  KLPTEFDARSQWSGCSTIGTILDQGHCGSCWAFGAVECLQDRFCIHLNMNISLSVNDLLA 155

Query: 154 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV 213
           CCGFLCG GC+GGYPISAWRYF   GVVT+ECDPYFD  GC HPGCEPAY TPKC +KC 
Sbjct: 156 CCGFLCGSGCNGGYPISAWRYFRRKGVVTDECDPYFDQVGCKHPGCEPAYRTPKCEKKCK 215

Query: 214 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 273
            +N++W+  KH+S+ AYR++S+P DIMAE+Y NGPVEV+FTVYEDFAHYKSGVYKHITG 
Sbjct: 216 VQNEVWKEQKHFSVDAYRVHSNPHDIMAEVYTNGPVEVAFTVYEDFAHYKSGVYKHITGG 275

Query: 274 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           VMGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECGIEEDVVAG+PS+
Sbjct: 276 VMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVVAGMPST 335

Query: 334 KNLVKEITSA 343
           KN+ +    A
Sbjct: 336 KNMARNYDDA 345


>gi|262217337|gb|ACY38050.1| cathepsin B [Dactylis glomerata]
          Length = 348

 Score =  503 bits (1296), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 229/310 (73%), Positives = 258/310 (83%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL 93
           I+Q  II+ +N++P AGW A  N   +NYT+ QFKH+LGVKPTP GLL GVP KT+ KS 
Sbjct: 33  IIQKDIIETINKHPNAGWTAGHNAYLANYTIEQFKHILGVKPTPPGLLAGVPTKTYSKSE 92

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLA 153
           +LPK FDARS W  CSTI  ILDQGHCGSCWAFGAVE L DRFCIH  +N+SLS NDL+A
Sbjct: 93  ELPKQFDARSKWSGCSTIGTILDQGHCGSCWAFGAVECLQDRFCIHQNINISLSANDLVA 152

Query: 154 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV 213
           CCGF+CGDGCDGGYPI AW+YFV  GVVTEECDPYFD  GC HPGCEPAY TPKC +KC 
Sbjct: 153 CCGFMCGDGCDGGYPIKAWQYFVQSGVVTEECDPYFDQVGCKHPGCEPAYDTPKCEKKCK 212

Query: 214 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 273
            +NQ+W   KH+SI+AYR+NSDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKH+TG 
Sbjct: 213 VQNQVWEEKKHFSINAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGG 272

Query: 274 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           VMGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECGIEE+VVAG+PS+
Sbjct: 273 VMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEEVVAGMPST 332

Query: 334 KNLVKEITSA 343
           KN+     SA
Sbjct: 333 KNMAGNHGSA 342


>gi|224064398|ref|XP_002301456.1| predicted protein [Populus trichocarpa]
 gi|222843182|gb|EEE80729.1| predicted protein [Populus trichocarpa]
          Length = 325

 Score =  501 bits (1291), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 237/327 (72%), Positives = 265/327 (81%), Gaps = 31/327 (9%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
           VSKLKL+S ILQDSI+++VNENP AGW+A  NPQFSNY+VG+FK+LLGVKPTP   L GV
Sbjct: 30  VSKLKLNSRILQDSIVQKVNENPNAGWEATMNPQFSNYSVGEFKYLLGVKPTPGKELRGV 89

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 144
           P+                               GHCGSCWAFGAVE+LSDRFCIH+GMNL
Sbjct: 90  PL-------------------------------GHCGSCWAFGAVESLSDRFCIHYGMNL 118

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 204
           SLSVNDLLACCG++CGDGCDGGYPI AWRYFV  GVVTEECDPYFD  GCSHPGCEP +P
Sbjct: 119 SLSVNDLLACCGWMCGDGCDGGYPIDAWRYFVQSGVVTEECDPYFDDIGCSHPGCEPGFP 178

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           TPKC RKC  KN+LW  SKH+S++AYRI+SDP  IMAE+  NGPVEV+FTVYEDFAHYKS
Sbjct: 179 TPKCERKCADKNKLWAESKHFSVNAYRIDSDPHSIMAEVSMNGPVEVAFTVYEDFAHYKS 238

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           GVYKHITGDVMGGHAVKLIGWGTSDDGEDYW+LANQWNR WG DGYFKI+RG+NECGIEE
Sbjct: 239 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWLLANQWNRGWGDDGYFKIRRGTNECGIEE 298

Query: 325 DVVAGLPSSKNLVKEITSADMFEDASA 351
           DVVAGLPS++NLV+E+   D  E ASA
Sbjct: 299 DVVAGLPSTRNLVREVAKIDAHEHASA 325


>gi|21693|emb|CAA46810.1| cathepsin B [Triticum aestivum]
          Length = 305

 Score =  495 bits (1274), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 223/299 (74%), Positives = 251/299 (83%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKS 98
           II+ VN +P AGW A  NP  +NYT+ QFKH+LGVKPTP GL   V  KTH +S +LPK 
Sbjct: 1   IIQTVNNHPNAGWTAGHNPYLANYTIEQFKHMLGVKPTPPGLRAAVRTKTHSRSEQLPKV 60

Query: 99  FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFL 158
           FDARS W  CSTI +ILDQGHCGSCWAFGAVE L DRFCIH  MN++LS NDL+ACCGF+
Sbjct: 61  FDARSKWSGCSTIGKILDQGHCGSCWAFGAVECLQDRFCIHHNMNITLSANDLVACCGFM 120

Query: 159 CGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL 218
           CGDGCDGGYPISAW+YFV +GVVT+ECDPYFD  GC HPGCEPAYPTP C +KC  +NQ+
Sbjct: 121 CGDGCDGGYPISAWQYFVQNGVVTDECDPYFDQVGCKHPGCEPAYPTPVCEKKCKVQNQV 180

Query: 219 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 278
           W   KH+SI+AY++NSDP DIMAE+Y NGPVEV+FTVYEDFAHYKSGVYKHITG VMGGH
Sbjct: 181 WEEKKHFSINAYQVNSDPHDIMAEVYNNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGH 240

Query: 279 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 337
           AVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECGIEEDV AG+PS+KN+ 
Sbjct: 241 AVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVTAGMPSTKNIA 299


>gi|21699|emb|CAA46811.1| cathepsin B [Triticum aestivum]
          Length = 353

 Score =  491 bits (1263), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 225/307 (73%), Positives = 256/307 (83%), Gaps = 3/307 (0%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL 93
           I+Q  II+ VN++P AGW A  NP F+NYT+ QFKH+LGVKPTP GLL GVP+K H + +
Sbjct: 37  IIQKDIIQTVNKHPNAGWTAGHNPYFANYTIEQFKHILGVKPTPPGLLAGVPIKIHPE-M 95

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLA 153
            LPK FDAR+ W  CSTI  ILDQGHCG+CWAF AVEAL DRFCIH  M++SLSVNDLLA
Sbjct: 96  DLPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVEALQDRFCIHLNMSVSLSVNDLLA 155

Query: 154 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV 213
           CCGFLCG GC+GGYPISAWRYF   GVVTEECDPYFD TGC HPGCEPAYPTPKC RKC 
Sbjct: 156 CCGFLCGSGCNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQHPGCEPAYPTPKCQRKCK 215

Query: 214 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE--DFAHYKSGVYKHIT 271
            +NQ W+ +KH+S++AYR++S+P DIMAE+YKNGPVEV+FT  +  DFAHYKSGVYKHIT
Sbjct: 216 VENQAWKENKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAFTYCQILDFAHYKSGVYKHIT 275

Query: 272 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           G VMGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECGIE DV AG+P
Sbjct: 276 GGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGENECGIEGDVTAGMP 335

Query: 332 SSKNLVK 338
           S+KN  +
Sbjct: 336 STKNTAR 342


>gi|116784401|gb|ABK23329.1| unknown [Picea sitchensis]
          Length = 350

 Score =  488 bits (1255), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 223/335 (66%), Positives = 265/335 (79%), Gaps = 7/335 (2%)

Query: 11  ILCLTCFATFAEGVVSKL------KLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTV 64
           + CLT     A  + + L      K    IL++ I++E+N +P AGWKA  N +FSN+TV
Sbjct: 6   LFCLTVLVAMAATLQASLLESFPAKNQDRILKEPIVEEINRHPNAGWKAGMNSRFSNHTV 65

Query: 65  GQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 124
           GQFK LLGV PTP+  L  VPV T+ K + LPK FDAR AWPQC+++  ILDQGHCGSCW
Sbjct: 66  GQFKRLLGVLPTPRNFLENVPVITYPKGMNLPKQFDAREAWPQCTSVQTILDQGHCGSCW 125

Query: 125 AFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 184
           AFGAVEALSDRFCIH  +N++LS NDL+ACCGF+CGDGCDGGYPISAW+YF+  GVVT E
Sbjct: 126 AFGAVEALSDRFCIHHKVNVTLSENDLVACCGFMCGDGCDGGYPISAWQYFISTGVVTAE 185

Query: 185 CDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIY 244
           CDPYFD  GC HPGCEP YPTP+CV++C  +NQ W NSK +S +AYRI+S P DIMAE+Y
Sbjct: 186 CDPYFDDAGCQHPGCEPLYPTPQCVKQCKDENQKWGNSKRFSATAYRISSKPYDIMAEVY 245

Query: 245 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 304
            NGPVEVSF+VYEDFAHYKSGVYK+  GD MGGHAVKL+GWGT +DG DYW++AN WN +
Sbjct: 246 TNGPVEVSFSVYEDFAHYKSGVYKYTKGDYMGGHAVKLVGWGT-EDGTDYWLVANSWNTA 304

Query: 305 WGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 339
           WG DGYFKI RGSNECGIE DVVAG+PS+KNLV +
Sbjct: 305 WGEDGYFKIARGSNECGIEGDVVAGMPSTKNLVMD 339


>gi|326490902|dbj|BAJ90118.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326508404|dbj|BAJ99469.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514912|dbj|BAJ99817.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 345

 Score =  488 bits (1255), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 222/306 (72%), Positives = 255/306 (83%), Gaps = 2/306 (0%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL 93
           I+Q+ II+ VN +P AGW A  NP  +NYT+ QFKH+LGVKPTP GLL GVP KT+ +S 
Sbjct: 34  IIQEDIIRTVNSHPNAGWTAGHNPYLANYTIEQFKHILGVKPTPPGLLAGVPTKTYSRSE 93

Query: 94  K--LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL 151
           K  LPK FDARS W  CSTI +ILDQGHCG+CWAFGAVE L DRFCIH  +N+SLSVNDL
Sbjct: 94  KAELPKEFDARSKWSGCSTIGKILDQGHCGACWAFGAVECLQDRFCIHHSVNVSLSVNDL 153

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK 211
           +ACCGFLCGDGCDGGYPI AW+YFV +GVVT+ECDP+FD  GC HPGCEPAYPTP C +K
Sbjct: 154 VACCGFLCGDGCDGGYPIFAWQYFVENGVVTDECDPFFDQVGCQHPGCEPAYPTPVCEKK 213

Query: 212 CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 271
           C  +NQ+W   KH+SI AY++NSDP DIMAE+YKNGPVEVSF +YEDFAHYKSGVYK IT
Sbjct: 214 CKVQNQVWEEKKHFSIDAYQVNSDPHDIMAEVYKNGPVEVSFIIYEDFAHYKSGVYKQIT 273

Query: 272 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           G ++GGHA KLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG+NECGIE DV AG+P
Sbjct: 274 GRMVGGHAAKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEGDVNAGMP 333

Query: 332 SSKNLV 337
           S+KN+ 
Sbjct: 334 STKNIA 339


>gi|224285427|gb|ACN40436.1| unknown [Picea sitchensis]
          Length = 350

 Score =  487 bits (1254), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 223/335 (66%), Positives = 265/335 (79%), Gaps = 7/335 (2%)

Query: 11  ILCLTCFATFAEGVVSKL------KLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTV 64
           + CLT     A  + + L      K    IL++ I++E+N +P AGWKA  N +FSN+TV
Sbjct: 6   LFCLTVLVAMAATLQASLLESFPAKNQDRILKEPIVEEINRHPNAGWKAGMNSRFSNHTV 65

Query: 65  GQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 124
           GQFK LLGV PTP+  L  VPV T+ K + LPK FDAR AWPQC+++  ILDQGHCGSCW
Sbjct: 66  GQFKRLLGVLPTPRNFLENVPVITYPKGINLPKQFDAREAWPQCTSVQTILDQGHCGSCW 125

Query: 125 AFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 184
           AFGAVEALSDRFCIH  +N++LS NDL+ACCGF+CGDGCDGGYPISAW+YF+  GVVT E
Sbjct: 126 AFGAVEALSDRFCIHHKVNVTLSENDLVACCGFMCGDGCDGGYPISAWQYFISTGVVTAE 185

Query: 185 CDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIY 244
           CDPYFD  GC HPGCEP YPTP+CV++C  +NQ W NSK +S +AYRI+S P DIMAE+Y
Sbjct: 186 CDPYFDDAGCQHPGCEPLYPTPQCVKQCKDENQKWGNSKRFSATAYRISSKPYDIMAEVY 245

Query: 245 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 304
            NGPVEVSF+VYEDFAHYKSGVYK+  GD MGGHAVKL+GWGT +DG DYW++AN WN +
Sbjct: 246 TNGPVEVSFSVYEDFAHYKSGVYKYTKGDYMGGHAVKLVGWGT-EDGTDYWLVANSWNTA 304

Query: 305 WGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 339
           WG DGYFKI RGSNECGIE DVVAG+PS+KNLV +
Sbjct: 305 WGEDGYFKIARGSNECGIEGDVVAGMPSTKNLVMD 339


>gi|116779190|gb|ABK21175.1| unknown [Picea sitchensis]
 gi|148907952|gb|ABR17096.1| unknown [Picea sitchensis]
 gi|224284884|gb|ACN40172.1| unknown [Picea sitchensis]
          Length = 350

 Score =  487 bits (1253), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 223/338 (65%), Positives = 269/338 (79%), Gaps = 3/338 (0%)

Query: 4   TKLIMDPILCLTCFATFAEGVVSKLKLDSH--ILQDSIIKEVNENPKAGWKAARNPQFSN 61
           ++L+   ++ +   AT    +V      S   IL++ I++E+N +PKAGWKA  N +FSN
Sbjct: 3   SRLLFCLMVLVAMAATPQASLVESFPAQSQDRILKEPIVEEINRHPKAGWKAGMNSRFSN 62

Query: 62  YTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCG 121
           +TVGQFK LLGV PTP+ LL  VPV+T+ K L LPK FDAR AWPQC+++  ILDQGHCG
Sbjct: 63  HTVGQFKRLLGVLPTPRNLLENVPVRTYPKGLNLPKQFDARKAWPQCTSVRTILDQGHCG 122

Query: 122 SCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           SCWAFGAVEALSDRFCIH+ +N++LS NDL+ACCGF CGDGCDGGYP+SAW+YF+  GVV
Sbjct: 123 SCWAFGAVEALSDRFCIHYKVNVTLSENDLVACCGFRCGDGCDGGYPLSAWQYFISTGVV 182

Query: 182 TEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMA 241
           T ECDPYFD  GC HPGCEP YPTP+CV++C  +NQ W NSK +S +AYRI S P DIMA
Sbjct: 183 TAECDPYFDEAGCQHPGCEPLYPTPQCVKQCKDENQNWGNSKRFSATAYRITSKPYDIMA 242

Query: 242 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQW 301
           E+Y  GPVEV F VYEDFAHYKSGVYK+ITGD +GGHAVKLIGWGT ++G DYW++AN W
Sbjct: 243 EVYTKGPVEVDFLVYEDFAHYKSGVYKYITGDFLGGHAVKLIGWGT-ENGTDYWLVANSW 301

Query: 302 NRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 339
           N +WG DGYFKI RGSNEC IEEDVVAG+PS+KNLV +
Sbjct: 302 NTAWGEDGYFKIARGSNECSIEEDVVAGMPSTKNLVMD 339


>gi|224285256|gb|ACN40354.1| unknown [Picea sitchensis]
          Length = 350

 Score =  473 bits (1218), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 216/341 (63%), Positives = 263/341 (77%), Gaps = 1/341 (0%)

Query: 1   MEPTKLIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFS 60
           M  T L +   + L C           L+    ILQ S ++ +N++P AGWKAA + +FS
Sbjct: 1   MATTILTVFTTVLLACIKVSGLESFHSLESQRPILQKSFVEHINKHPNAGWKAAMSTRFS 60

Query: 61  NYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHC 120
           NYTV +F HLLGV PTP+ LL  VPV+ + K LKLP  FDAR AWP C++   ILDQGHC
Sbjct: 61  NYTVREFAHLLGVLPTPQKLLETVPVRVYPKGLKLPSKFDARKAWPHCTSTRSILDQGHC 120

Query: 121 GSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 180
           GSCWAF AVEALSDRFCIHF +N +LS NDL+ACCGF CG GC+GG+P+SAWRYF   GV
Sbjct: 121 GSCWAFAAVEALSDRFCIHFQVNATLSENDLVACCGFRCGSGCNGGFPLSAWRYFSRRGV 180

Query: 181 VTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIM 240
           VT+ECDPYFD+ GC+HPGCEP+YPTP+CV+ C K NQ W +SKHYS +AYRI SDP +IM
Sbjct: 181 VTDECDPYFDNDGCNHPGCEPSYPTPRCVKNC-KDNQRWSHSKHYSANAYRIKSDPYNIM 239

Query: 241 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQ 300
           AE++ NGPVEVSF+VYEDFAHY++GVYKH+ G  +GGHAVKLIGWGT+DDG DYW++AN 
Sbjct: 240 AEVFNNGPVEVSFSVYEDFAHYETGVYKHVQGRYLGGHAVKLIGWGTTDDGIDYWLIANS 299

Query: 301 WNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEIT 341
           WN +WG  GYFKI RG NECGIE D VAG+PS+KNL+++ T
Sbjct: 300 WNTAWGEGGYFKIARGVNECGIERDPVAGMPSAKNLIQDPT 340


>gi|21695|emb|CAA46812.1| cathepsin B [Triticum aestivum]
          Length = 310

 Score =  444 bits (1142), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 203/275 (73%), Positives = 231/275 (84%), Gaps = 3/275 (1%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL 93
           I+Q  II+ VN++P AGW A  NP F+NYT+ QFKH+LGVKPTP GLL GVP+K H + +
Sbjct: 37  IIQKDIIQTVNKHPNAGWTAGHNPYFANYTIEQFKHILGVKPTPPGLLAGVPIKIHPE-M 95

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLA 153
            LPK FDAR+ W  CSTI  ILDQGHCG+CWAF AVEAL DRFCIH  M++SLSVNDLLA
Sbjct: 96  DLPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVEALQDRFCIHLNMSVSLSVNDLLA 155

Query: 154 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV 213
           CCGFLCG GC+GGYPISAWRYF   GVVTEECDPYFD TGC HPGCEPAYPTPKC RKC 
Sbjct: 156 CCGFLCGSGCNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQHPGCEPAYPTPKCQRKCK 215

Query: 214 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE--DFAHYKSGVYKHIT 271
            +NQ W+ +KH+S++AYR++S+P DIMAE+YKNGPVEV+FT  +  DFAHYKSGVYKHIT
Sbjct: 216 VENQAWKENKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAFTYCQILDFAHYKSGVYKHIT 275

Query: 272 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 306
           G VMGGHAVKLIGWGTSD GEDYW+LANQWNR WG
Sbjct: 276 GGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWG 310


>gi|302823081|ref|XP_002993195.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
 gi|300138965|gb|EFJ05715.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
          Length = 342

 Score =  426 bits (1096), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 202/315 (64%), Positives = 240/315 (76%), Gaps = 3/315 (0%)

Query: 27  KLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV-P 85
           KL L   +LQ SI+  VN +P AGWKA  N +F N+TV  FK L GV P     +  + P
Sbjct: 30  KLDLGRPLLQKSIVDIVNNDPNAGWKAGFNERFINHTVRDFKRLCGVLPKSSEEVQPLRP 89

Query: 86  VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS 145
           +++H ++L LPK FDAR AWPQCS+I  ILDQGHCGSCWAFGAVEAL+DRFCI    N+S
Sbjct: 90  LRSHPRTLDLPKHFDAREAWPQCSSIKNILDQGHCGSCWAFGAVEALTDRFCILNNENVS 149

Query: 146 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 205
           LS NDL+ACC   CG GCDGGYP +AW YF   GVVT +CDPYFD  GC HPGCEP Y T
Sbjct: 150 LSENDLVACCS-SCGFGCDGGYPYAAWEYFAQTGVVTSQCDPYFDGKGCKHPGCEPEYDT 208

Query: 206 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
           P CV++CV  N+ WR+SKH+++  Y +NSD  DI AEIYKNGPVEVS+TVYEDFAHYKSG
Sbjct: 209 PVCVKQCVD-NEQWRDSKHFTVQTYAVNSDIYDIQAEIYKNGPVEVSYTVYEDFAHYKSG 267

Query: 266 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 325
           VYKH+ G+V+GGHAVK IGWGT+DDG+DYWI+AN WNRSWG DG+F+I RGSNECGIE +
Sbjct: 268 VYKHVFGEVLGGHAVKFIGWGTTDDGKDYWIVANSWNRSWGEDGFFQISRGSNECGIESE 327

Query: 326 VVAGLPSSKNLVKEI 340
            VAG+P  K    +I
Sbjct: 328 PVAGIPLKKTGFSDI 342


>gi|302764096|ref|XP_002965469.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
 gi|300166283|gb|EFJ32889.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
          Length = 331

 Score =  425 bits (1093), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 204/329 (62%), Positives = 245/329 (74%), Gaps = 7/329 (2%)

Query: 17  FATFAEGV----VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG 72
           F+  A+GV      KL L   +LQ SI+  VN +P AGWKA  N +F N+TV  FK L G
Sbjct: 5   FSAVAQGVRVAESGKLDLGRPLLQKSIVDIVNNDPNAGWKAGFNERFINHTVRDFKRLCG 64

Query: 73  VKPTPKGLLLGV-PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEA 131
           V P     +  + P+++H ++L LPK FDAR AWPQC++I  ILDQGHCGSCWAFGAVEA
Sbjct: 65  VLPKSSEEVQPLRPLRSHPRTLDLPKHFDAREAWPQCASIKTILDQGHCGSCWAFGAVEA 124

Query: 132 LSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDS 191
           L+DRFCI    N+SLS NDL+ACC   CG GC+GGYP +AW YF   GVVT +CDPYFD 
Sbjct: 125 LTDRFCILNNENVSLSENDLVACCS-SCGFGCEGGYPYAAWEYFAQTGVVTSQCDPYFDG 183

Query: 192 TGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
            GC HPGCEP Y TP CV++CV  N+ WR+SKH+++  Y +NSD  DI AEIYKNGPVEV
Sbjct: 184 KGCKHPGCEPEYDTPVCVKQCVD-NEQWRDSKHFTVQTYAVNSDIYDIQAEIYKNGPVEV 242

Query: 252 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 311
           S+TVYEDFAHYKSGVYKH+ G V+GGHAVK IGWGT+DDG+DYWI+AN WNRSWG DG+F
Sbjct: 243 SYTVYEDFAHYKSGVYKHVFGQVLGGHAVKFIGWGTTDDGKDYWIVANSWNRSWGEDGFF 302

Query: 312 KIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           +I RGSNECGIE + VAG+P  K    +I
Sbjct: 303 QISRGSNECGIESEPVAGIPLKKTGFSDI 331


>gi|168026641|ref|XP_001765840.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683017|gb|EDQ69431.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 339

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 195/339 (57%), Positives = 234/339 (69%), Gaps = 7/339 (2%)

Query: 1   MEPTKLIMDPILCLTCFATFAEGVVSKLKLDSHIL-QDSIIKEVNENPKAGWKAARNPQF 59
           M+P  L++   LC    A  A  V   L     ++ Q  ++ +VN +P+A WKA  N +F
Sbjct: 1   MKPISLLL---LCSVILAAQAARVEPDLLESKRLIHQQLLVDKVNAHPRATWKAGFNDRF 57

Query: 60  SNYTVGQFKHLLGVKPTPKGLLL-GVPVKTHD-KSLKLPKSFDARSAWPQCSTISRILDQ 117
             +T+   K + G K TP   L   +   TH  K L LPK FDAR  W  CSTI  ILDQ
Sbjct: 58  EGHTIEHLKKICGAKMTPANELEPSIERVTHKHKKLVLPKEFDARKHWGHCSTIGAILDQ 117

Query: 118 GHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 177
           GHCGSCWAFGA E+L+DRFCIH   ++SLS NDLLACCGF CGDGCDGGYPI AWRYF  
Sbjct: 118 GHCGSCWAFGAAESLTDRFCIHMNESVSLSENDLLACCGFECGDGCDGGYPIRAWRYFKR 177

Query: 178 HGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPE 237
            GVVT +CDPYFD  GC HPGC P Y TPKCV+ CV  ++LW  SKH S++AY ++ +PE
Sbjct: 178 TGVVTSKCDPYFDQIGCGHPGCYPTYRTPKCVKHCV-DDELWVKSKHLSVNAYEVSKEPE 236

Query: 238 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWIL 297
           D+MAE+Y NGP+EVSF V+EDFAHYK+GVYKH+ G  +GGHAVKLIGWGT+DDG DYW +
Sbjct: 237 DLMAELYTNGPIEVSFEVFEDFAHYKTGVYKHVYGRYIGGHAVKLIGWGTTDDGVDYWTI 296

Query: 298 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 336
            N WN +WG  G F+I RG NECGIE   VAGLP  K L
Sbjct: 297 VNSWNTNWGEHGLFRIARGGNECGIESYAVAGLPFDKGL 335


>gi|297723949|ref|NP_001174338.1| Os05g0310500 [Oryza sativa Japonica Group]
 gi|255676228|dbj|BAH93066.1| Os05g0310500, partial [Oryza sativa Japonica Group]
          Length = 234

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 179/226 (79%), Positives = 202/226 (89%)

Query: 118 GHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 177
           GHCGSCWAFGAVE L DRFCIHF MN+SLSVNDL+ACCGF+CGDGCDGGYPI AWRYFV 
Sbjct: 1   GHCGSCWAFGAVECLQDRFCIHFNMNISLSVNDLVACCGFMCGDGCDGGYPIMAWRYFVR 60

Query: 178 HGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPE 237
           +GVVT+ECDPYFD  GC HPGCEPAYPTP C +KC  +NQ+W   KH+S++AYR+NSDP 
Sbjct: 61  NGVVTDECDPYFDQVGCKHPGCEPAYPTPVCEKKCKVQNQVWLEKKHFSVNAYRVNSDPH 120

Query: 238 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWIL 297
           DIMAE+Y+NGPVEV+FTVYEDFAHYKSGVYKHITG +MGGHAVKLIGWGT+D GEDYW+L
Sbjct: 121 DIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLL 180

Query: 298 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSA 343
           ANQWNR WG DGYFKI RG+NECGIEEDVVAG+PS+KN+V+   SA
Sbjct: 181 ANQWNRGWGDDGYFKIIRGTNECGIEEDVVAGMPSTKNMVRNYDSA 226


>gi|168020784|ref|XP_001762922.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685734|gb|EDQ72127.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 345

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 182/311 (58%), Positives = 230/311 (73%), Gaps = 3/311 (0%)

Query: 28  LKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLL-GVPV 86
           L+ +  I Q S++ ++N +P A WKA  N +F+ +TV   K + G K TP   +   +  
Sbjct: 32  LENNRLIHQQSLVDKINAHPGATWKAGLNDRFAKHTVEHLKKMCGAKMTPANEVEPSIER 91

Query: 87  KTHD-KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS 145
            TH  K+L LP  FDAR  W  CSTI  ILDQGHCGSCWAFGAVE+L+DRFCIH   ++S
Sbjct: 92  VTHKHKNLDLPTEFDARKHWSHCSTIGDILDQGHCGSCWAFGAVESLTDRFCIHLNESVS 151

Query: 146 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 205
           LS NDLLACCGF CGDGC+GGYPI AW+YF   GVVT +CDPYFD  GC HPGC P Y T
Sbjct: 152 LSENDLLACCGFECGDGCEGGYPIRAWQYFKRTGVVTSKCDPYFDQKGCGHPGCYPTYDT 211

Query: 206 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
           PKC ++CV  ++LW +SKH  +SAY ++ +PE++MAE++ NGP+EV+F V+EDFAHYK+G
Sbjct: 212 PKCFKRCV-DDELWVSSKHLGVSAYEVSMEPEELMAELFTNGPIEVAFDVFEDFAHYKTG 270

Query: 266 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 325
           VYKH+ G  +GGHAVKL+GWGT+DDG DYW + N WN +WG DG F+I RG +ECGIE +
Sbjct: 271 VYKHLYGGYIGGHAVKLVGWGTTDDGVDYWSMVNSWNTNWGEDGTFRILRGKDECGIESN 330

Query: 326 VVAGLPSSKNL 336
            VAGLPS+K L
Sbjct: 331 AVAGLPSNKGL 341


>gi|168000937|ref|XP_001753172.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695871|gb|EDQ82213.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 347

 Score =  390 bits (1003), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 181/305 (59%), Positives = 222/305 (72%), Gaps = 3/305 (0%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLL-GVPVKTHD-K 91
           I Q +++ +VN +P A W A  N +F+ +T+   K + G   TP   L   +   +H  K
Sbjct: 40  IHQQALVDKVNAHPGATWTAGFNERFAKHTIEHLKKMCGAILTPANKLEPSIETISHKHK 99

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL 151
            L LPK FDAR  W  C TI  IL QGHCGSCWAFGAVE+L+DRFCIH   ++SLS NDL
Sbjct: 100 KLYLPKEFDARKQWSHCPTIGDILGQGHCGSCWAFGAVESLTDRFCIHLNESVSLSENDL 159

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK 211
           LACCGF CG GC+GGYPI AW+YF H GVVT +CDPYFD  GC+HPGC P Y TPKC ++
Sbjct: 160 LACCGFECGYGCEGGYPIRAWKYFKHSGVVTNKCDPYFDQKGCAHPGCYPTYETPKCEKQ 219

Query: 212 CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 271
           CV  ++ W  SKH  ++AY ++ +PED+MAE+Y NGPVEV+F VYEDFAHYK+GVYKH+ 
Sbjct: 220 CV-DDEFWVQSKHLGVNAYEMSMEPEDLMAELYTNGPVEVAFEVYEDFAHYKTGVYKHLF 278

Query: 272 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           G  MGGHAVKLIGWGT+DDG DYW + N WN +WG DG F+I RG++ECGIE + VAGLP
Sbjct: 279 GGFMGGHAVKLIGWGTTDDGVDYWTIVNSWNTNWGEDGLFRIVRGNDECGIESNAVAGLP 338

Query: 332 SSKNL 336
           S K L
Sbjct: 339 SRKGL 343


>gi|414886872|tpg|DAA62886.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
 gi|414886873|tpg|DAA62887.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
          Length = 208

 Score =  353 bits (907), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 160/202 (79%), Positives = 178/202 (88%)

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 201
           M++ LSVNDLLACCGF+CGDGCDGGYPI AWRYFV +GVVT+ECDPYFD  GC HPGCEP
Sbjct: 1   MSILLSVNDLLACCGFMCGDGCDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCKHPGCEP 60

Query: 202 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 261
           AYPTPKC +KC ++NQ+W+  KH+SI AYRINSDP DIMAE+YKNGPVEV+FTVYEDFAH
Sbjct: 61  AYPTPKCEKKCKEQNQVWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAH 120

Query: 262 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           YKSGVYKHITG +MGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECG
Sbjct: 121 YKSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECG 180

Query: 322 IEEDVVAGLPSSKNLVKEITSA 343
           IEE VVAG+PS+KN+V     A
Sbjct: 181 IEEGVVAGMPSTKNMVPNFGGA 202


>gi|149941232|emb|CAO02548.1| putative cathepsin B-like cysteine protease,putative [Vigna
           unguiculata]
          Length = 195

 Score =  344 bits (882), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 155/189 (82%), Positives = 173/189 (91%)

Query: 84  VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 143
           VPV +H KSLKLP +FDAR+AW QCSTI RILDQGHCGSCWAFGAVE+LSDRFCIHF +N
Sbjct: 7   VPVISHPKSLKLPVNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDVN 66

Query: 144 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY 203
           +SLSVNDLLACCGFLCG GC+GGYP+SAWRY  +HGVVTEECDPYFD TGCSHPGCEPAY
Sbjct: 67  ISLSVNDLLACCGFLCGSGCNGGYPLSAWRYLSNHGVVTEECDPYFDQTGCSHPGCEPAY 126

Query: 204 PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 263
            TPKCV+KCV  NQLW+ SKHYS+SAY++ S+P DIMAE+YKNGPVEV+FTVYEDFAHYK
Sbjct: 127 RTPKCVKKCVSGNQLWKKSKHYSVSAYKVKSNPHDIMAEVYKNGPVEVAFTVYEDFAHYK 186

Query: 264 SGVYKHITG 272
           SGVYKH+TG
Sbjct: 187 SGVYKHVTG 195


>gi|149941230|emb|CAO02547.1| putative cathepsin B-like cysteine protease [Vigna unguiculata]
          Length = 201

 Score =  341 bits (874), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 154/192 (80%), Positives = 173/192 (90%)

Query: 83  GVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM 142
            + V +H KSLKLP +FDAR+AW QCSTI RILDQGHCGSCWAFGAVE+LSDRFCIHF +
Sbjct: 6   ALTVISHPKSLKLPVNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDV 65

Query: 143 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 202
           N+SLSVNDLLACCGFLCG GC+GGYP+SAWRY  +HGVVTEECDPYFD TGCSHPGCEPA
Sbjct: 66  NISLSVNDLLACCGFLCGSGCNGGYPLSAWRYLSNHGVVTEECDPYFDQTGCSHPGCEPA 125

Query: 203 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 262
           Y TPKCV+KCV  NQLW+ SKHYS+SAY++ S+P DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 126 YRTPKCVKKCVSGNQLWKKSKHYSVSAYKVKSNPHDIMAEVYKNGPVEVAFTVYEDFAHY 185

Query: 263 KSGVYKHITGDV 274
           KSGVYKH+TG V
Sbjct: 186 KSGVYKHVTGYV 197


>gi|388499754|gb|AFK37943.1| unknown [Lotus japonicus]
          Length = 209

 Score =  321 bits (823), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 144/200 (72%), Positives = 164/200 (82%)

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
            L    F  G    GGYP+ AWRY  HHGVVTEECDPYFD  GCSHPGCEPAY TPKCVR
Sbjct: 9   FLHAVAFSVGLAVMGGYPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVR 68

Query: 211 KCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 270
           KCVK NQ+W+ SKH+S++AY + SDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHI
Sbjct: 69  KCVKGNQIWKKSKHFSVNAYSVKSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHI 128

Query: 271 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           TG  +GGHAVKLIGWGT+D+GEDYW++ANQWNRSWG DGYF I+RG+NECGIEEDV AGL
Sbjct: 129 TGSQLGGHAVKLIGWGTTDEGEDYWLIANQWNRSWGDDGYFMIRRGTNECGIEEDVTAGL 188

Query: 331 PSSKNLVKEITSADMFEDAS 350
           PS+KN+ + +   D   D S
Sbjct: 189 PSTKNMGRWVMDMDADADVS 208


>gi|38639319|gb|AAR25797.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
          Length = 218

 Score =  318 bits (816), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 147/194 (75%), Positives = 165/194 (85%)

Query: 21  AEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGL 80
           AE  +S+ KL+S ILQDSI+K VNEN +AGWKAA NPQ SN+TV QFK LLGVKP  +G 
Sbjct: 24  AEKPISEAKLESAILQDSIVKRVNENAEAGWKAAFNPQLSNFTVSQFKRLLGVKPAREGD 83

Query: 81  LLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF 140
           L G+PV TH +  +LPK FDAR AWPQCSTI +ILDQGHCGSCWAFGAVE+LSDRFCIH+
Sbjct: 84  LEGIPVLTHPRLKELPKEFDARKAWPQCSTIGKILDQGHCGSCWAFGAVESLSDRFCIHY 143

Query: 141 GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 200
            +++SLSVNDLLACC FLCG GCDGGYPI+AWRYF   GVVTEECDPYFD+TGCSHPGCE
Sbjct: 144 NLSISLSVNDLLACCSFLCGSGCDGGYPIAAWRYFKRSGVVTEECDPYFDTTGCSHPGCE 203

Query: 201 PAYPTPKCVRKCVK 214
           P YPTPKC RKCVK
Sbjct: 204 PLYPTPKCHRKCVK 217


>gi|62320420|dbj|BAD94873.1| cathepsin B-like cysteine proteinase like protein [Arabidopsis
           thaliana]
          Length = 183

 Score =  311 bits (798), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 140/176 (79%), Positives = 158/176 (89%)

Query: 169 ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS 228
           + AW YF +HGVVT+ECDPYFD+TGCSHPGCEP YPTPKC RKCV +NQLW  SKHY + 
Sbjct: 1   MGAWLYFKYHGVVTQECDPYFDNTGCSHPGCEPTYPTPKCERKCVSRNQLWGESKHYGVG 60

Query: 229 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 288
           AYRIN DP+DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYK+ITG  +GGHAVKLIGWGTS
Sbjct: 61  AYRINPDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTS 120

Query: 289 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 344
           DDGEDYW+LANQWNRSWG DGYFKI+RG+NECGIE+ VVAGLPS KN+ K IT++D
Sbjct: 121 DDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQSVVAGLPSEKNVFKGITTSD 176


>gi|6562772|emb|CAB62590.1| putative cathepsin B-like protease [Pisum sativum]
          Length = 174

 Score =  303 bits (775), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 134/166 (80%), Positives = 150/166 (90%)

Query: 163 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 222
           CDGGYPISAW+YF HHGVVTEECDPYFD  GCSHPGCEP Y TPKCVRKCVK NQ+W+ S
Sbjct: 1   CDGGYPISAWKYFAHHGVVTEECDPYFDQIGCSHPGCEPGYQTPKCVRKCVKGNQVWKKS 60

Query: 223 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 282
           KHYS+  Y++NSDP++IM E+YKNGPVEV+F+VYEDFAHYKSGVYKHITG  +GGHAVKL
Sbjct: 61  KHYSVKPYKVNSDPQNIMEEVYKNGPVEVAFSVYEDFAHYKSGVYKHITGSALGGHAVKL 120

Query: 283 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
            GWGTSD+GEDYW+LANQWN +WG DGYFKIKRG+NECGIEEDV A
Sbjct: 121 NGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEEDVTA 166


>gi|6562770|emb|CAB62589.1| putative cathepsin B-like protease [Pisum sativum]
          Length = 206

 Score =  290 bits (741), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 134/164 (81%), Positives = 144/164 (87%)

Query: 36  QDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKL 95
           Q+SI KEVNENP AGWKAA NP+FSN TVGQFK LLGVK TP+  L  +PV TH KSL L
Sbjct: 43  QESIAKEVNENPGAGWKAAINPRFSNSTVGQFKRLLGVKQTPRNELSSIPVVTHPKSLNL 102

Query: 96  PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACC 155
           PK FDAR+AWPQCSTI RILDQGHCGSCWAFGAVE+LSDRFCIHFG+++ LSVNDLLACC
Sbjct: 103 PKEFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFGVDVPLSVNDLLACC 162

Query: 156 GFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGC 199
           GFLCG GCDGGYPISAW+YF HHGVVTEECDPYFD  GCSHPGC
Sbjct: 163 GFLCGSGCDGGYPISAWKYFAHHGVVTEECDPYFDQIGCSHPGC 206


>gi|402877481|ref|XP_003902454.1| PREDICTED: cathepsin B [Papio anubis]
          Length = 339

 Score =  283 bits (725), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 149/326 (45%), Positives = 200/326 (61%), Gaps = 28/326 (8%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGLLLGVPVKTH 89
           H L D ++  VN+     W+A  N  F N  V   K L G     P P   ++       
Sbjct: 24  HPLSDELVNYVNKQ-NTTWQAGHN--FYNVDVSYLKRLCGTFLGGPKPPQRVM------F 74

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 149
            + LKLP+SFDAR  WPQC TI  I DQG CGSCWAFGAVEA+SDR CIH   ++S+ V+
Sbjct: 75  TEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVS 134

Query: 150 --DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCS 195
             DLL CCG +CGDGC+GGYP  AW ++   G+V+         C PY           S
Sbjct: 135 AEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNGS 194

Query: 196 HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
            P C     TPKC + C    +  ++  KHY  ++Y +++  +DIMAEIYKNGPVE +F+
Sbjct: 195 RPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFS 254

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI 
Sbjct: 255 VYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKIL 313

Query: 315 RGSNECGIEEDVVAGLPSSKNLVKEI 340
           RG + CGIE +VVAG+P +    ++I
Sbjct: 314 RGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|355697726|gb|EHH28274.1| Cathepsin B [Macaca mulatta]
          Length = 339

 Score =  283 bits (725), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 149/326 (45%), Positives = 199/326 (61%), Gaps = 28/326 (8%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGLLLGVPVKTH 89
           H L D ++  VN+     W+A  N  F N  V   K L G     P P   ++       
Sbjct: 24  HPLSDELVNYVNKQ-NTTWQAGHN--FYNVDVSYLKRLCGTFLGGPKPPQRVM------F 74

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 149
            + LKLP+SFDAR  WPQC TI  I DQG CGSCWAFGAVEA+SDR CIH   ++S+ V+
Sbjct: 75  TEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVS 134

Query: 150 --DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCS 195
             DLL CCG +CGDGC+GGYP  AW +    G+V+         C PY           S
Sbjct: 135 AEDLLTCCGIMCGDGCNGGYPAGAWNFLTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNGS 194

Query: 196 HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
            P C     TPKC + C    +  ++  KHY  ++Y +++  +DIMAEIYKNGPVE +F+
Sbjct: 195 RPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFS 254

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI 
Sbjct: 255 VYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKIL 313

Query: 315 RGSNECGIEEDVVAGLPSSKNLVKEI 340
           RG + CGIE +VVAG+P +    ++I
Sbjct: 314 RGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|302564570|ref|NP_001181828.1| cathepsin B precursor [Macaca mulatta]
          Length = 339

 Score =  283 bits (725), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 149/326 (45%), Positives = 200/326 (61%), Gaps = 28/326 (8%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGLLLGVPVKTH 89
           H L D ++  VN+     W+A  N  F N  V   K L G     P P   ++       
Sbjct: 24  HPLSDELVNYVNKQ-NTTWQAGHN--FYNVDVSYLKRLCGTFLGGPKPPQRVM------F 74

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 149
            + LKLP+SFDAR  WPQC TI  I DQG CGSCWAFGAVEA+SDR CIH   ++S+ V+
Sbjct: 75  TEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVS 134

Query: 150 --DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCS 195
             DLL CCG +CGDGC+GGYP  AW ++   G+V+         C PY           S
Sbjct: 135 AEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNGS 194

Query: 196 HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
            P C     TPKC + C    +  ++  KHY  ++Y +++  +DIMAEIYKNGPVE +F+
Sbjct: 195 RPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFS 254

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI 
Sbjct: 255 VYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKIL 313

Query: 315 RGSNECGIEEDVVAGLPSSKNLVKEI 340
           RG + CGIE +VVAG+P +    ++I
Sbjct: 314 RGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|197098184|ref|NP_001126573.1| cathepsin B precursor [Pongo abelii]
 gi|75061687|sp|Q5R6D1.1|CATB_PONAB RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|55731764|emb|CAH92586.1| hypothetical protein [Pongo abelii]
 gi|55731953|emb|CAH92685.1| hypothetical protein [Pongo abelii]
          Length = 339

 Score =  283 bits (724), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 152/345 (44%), Positives = 206/345 (59%), Gaps = 31/345 (8%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           L C    A+   ++ +   H L D ++  VN+     W+A  N  F N  V   K L G 
Sbjct: 8   LCCLLALAD---ARSRPSFHPLSDELVNYVNKR-NTTWQAGHN--FYNVDVSYLKKLCGT 61

Query: 74  ---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 130
               P P   ++        + LKLP+SFDAR  WPQC TI  I DQG CGSCWAFGAVE
Sbjct: 62  FLGGPKPPQRVM------FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVE 115

Query: 131 ALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE---- 184
           A+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW ++   G+V+      
Sbjct: 116 AISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175

Query: 185 ---CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSD 235
              C PY           S P C     TPKC + C    +  ++  KHY  ++Y +++ 
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNS 235

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
             DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW
Sbjct: 236 ERDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYW 294

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           ++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 295 LVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|75076082|sp|Q4R5M2.1|CATB_MACFA RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|67970521|dbj|BAE01603.1| unnamed protein product [Macaca fascicularis]
 gi|355779504|gb|EHH63980.1| Cathepsin B [Macaca fascicularis]
 gi|383411999|gb|AFH29213.1| cathepsin B preproprotein [Macaca mulatta]
 gi|384942194|gb|AFI34702.1| cathepsin B preproprotein [Macaca mulatta]
          Length = 339

 Score =  283 bits (724), Expect = 9e-74,   Method: Compositional matrix adjust.
 Identities = 149/326 (45%), Positives = 200/326 (61%), Gaps = 28/326 (8%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGLLLGVPVKTH 89
           H L D ++  VN+     W+A  N  F N  V   K L G     P P   ++       
Sbjct: 24  HPLSDELVNYVNKQ-NTTWQAGHN--FYNVDVSYLKRLCGTFLGGPKPPQRVM------F 74

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 149
            + LKLP+SFDAR  WPQC TI  I DQG CGSCWAFGAVEA+SDR CIH   ++S+ V+
Sbjct: 75  TEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVS 134

Query: 150 --DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCS 195
             DLL CCG +CGDGC+GGYP  AW ++   G+V+         C PY           S
Sbjct: 135 AEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNGS 194

Query: 196 HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
            P C     TPKC + C    +  ++  KHY  ++Y +++  +DIMAEIYKNGPVE +F+
Sbjct: 195 RPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFS 254

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI 
Sbjct: 255 VYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKIL 313

Query: 315 RGSNECGIEEDVVAGLPSSKNLVKEI 340
           RG + CGIE +VVAG+P +    ++I
Sbjct: 314 RGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|30583753|gb|AAP36125.1| Homo sapiens cathepsin B [synthetic construct]
 gi|61370555|gb|AAX43516.1| cathepsin B [synthetic construct]
          Length = 340

 Score =  283 bits (723), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 151/345 (43%), Positives = 206/345 (59%), Gaps = 31/345 (8%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           L C    A    ++ +   H + D ++  VN+     W+A  N  F N  +G  K L G 
Sbjct: 8   LCCLLVLAN---ARSRPSFHPVSDELVNYVNKR-NTTWQAGHN--FYNVDMGYLKRLCGT 61

Query: 74  ---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 130
               P P   ++        + LKLP SFDAR  WPQC TI  I DQG CGSCWAFGAVE
Sbjct: 62  FLGGPKPPQRVM------FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVE 115

Query: 131 ALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE---- 184
           A+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW ++   G+V+      
Sbjct: 116 AISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175

Query: 185 ---CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSD 235
              C PY           S P C     TPKC + C    +  ++  KHY  ++Y +++ 
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNS 235

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
            +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW
Sbjct: 236 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYW 294

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           ++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 295 LVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|16307393|gb|AAH10240.1| Cathepsin B [Homo sapiens]
          Length = 339

 Score =  283 bits (723), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 151/345 (43%), Positives = 206/345 (59%), Gaps = 31/345 (8%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           L C    A    ++ +   H + D ++  VN+     W+A  N  F N  +G  K L G 
Sbjct: 8   LCCLLVLAN---ARSRPSFHPVSDELVNYVNKR-NTTWQAGHN--FYNVDMGYLKRLCGT 61

Query: 74  ---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 130
               P P   ++        + LKLP SFDAR  WPQC TI  I DQG CGSCWAFGAVE
Sbjct: 62  FLGGPKPPQRVM------FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVE 115

Query: 131 ALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE---- 184
           A+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW ++   G+V+      
Sbjct: 116 AISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175

Query: 185 ---CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSD 235
              C PY           S P C     TPKC + C    +  ++  KHY  ++Y +++ 
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNS 235

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
            +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW
Sbjct: 236 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYW 294

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           ++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 295 LVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|397467300|ref|XP_003805362.1| PREDICTED: cathepsin B [Pan paniscus]
          Length = 339

 Score =  282 bits (722), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 151/345 (43%), Positives = 206/345 (59%), Gaps = 31/345 (8%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           L C    A    ++ +   H L D ++  VN+     W+A  N  F N  +   K L G 
Sbjct: 8   LCCLLVLAN---ARSRPSFHPLSDELVNYVNKR-NTTWQAGHN--FYNVDMSYLKRLCGT 61

Query: 74  ---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 130
               P P   ++        + LKLP+SFDAR  WPQC TI  I DQG CGSCWAFGAVE
Sbjct: 62  FLGGPKPPQRVM------FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVE 115

Query: 131 ALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE---- 184
           A+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW ++   G+V+      
Sbjct: 116 AISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175

Query: 185 ---CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSD 235
              C PY           S P C     TPKC + C    +  ++  KHY  ++Y +++ 
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNS 235

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
            +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW
Sbjct: 236 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYW 294

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           ++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 295 LVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|332862712|ref|XP_003317964.1| PREDICTED: cathepsin B isoform 1 [Pan troglodytes]
 gi|332862714|ref|XP_003317965.1| PREDICTED: cathepsin B isoform 2 [Pan troglodytes]
 gi|332862716|ref|XP_003317966.1| PREDICTED: cathepsin B isoform 3 [Pan troglodytes]
 gi|332862718|ref|XP_519607.3| PREDICTED: cathepsin B isoform 5 [Pan troglodytes]
 gi|410057614|ref|XP_003954244.1| PREDICTED: cathepsin B [Pan troglodytes]
 gi|410262606|gb|JAA19269.1| cathepsin B [Pan troglodytes]
 gi|410262608|gb|JAA19270.1| cathepsin B [Pan troglodytes]
 gi|410359820|gb|JAA44654.1| cathepsin B [Pan troglodytes]
 gi|410359822|gb|JAA44655.1| cathepsin B [Pan troglodytes]
 gi|410359824|gb|JAA44656.1| cathepsin B [Pan troglodytes]
 gi|410359826|gb|JAA44657.1| cathepsin B [Pan troglodytes]
 gi|410359828|gb|JAA44658.1| cathepsin B [Pan troglodytes]
          Length = 339

 Score =  282 bits (721), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 151/345 (43%), Positives = 206/345 (59%), Gaps = 31/345 (8%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           L C    A    ++ +   H L D ++  VN+     W+A  N  F N  +   K L G 
Sbjct: 8   LCCLLVLAN---ARSRPSFHPLSDELVNYVNKR-NTTWQAGHN--FYNVDMSYLKRLCGA 61

Query: 74  ---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 130
               P P   ++        + LKLP+SFDAR  WPQC TI  I DQG CGSCWAFGAVE
Sbjct: 62  FLGGPKPPQRVM------FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVE 115

Query: 131 ALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE---- 184
           A+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW ++   G+V+      
Sbjct: 116 AISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175

Query: 185 ---CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSD 235
              C PY           S P C     TPKC + C    +  ++  KHY  ++Y +++ 
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNS 235

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
            +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW
Sbjct: 236 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYW 294

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           ++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 295 LVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|426358853|ref|XP_004046705.1| PREDICTED: cathepsin B isoform 1 [Gorilla gorilla gorilla]
 gi|426358855|ref|XP_004046706.1| PREDICTED: cathepsin B isoform 2 [Gorilla gorilla gorilla]
 gi|426358857|ref|XP_004046707.1| PREDICTED: cathepsin B isoform 3 [Gorilla gorilla gorilla]
          Length = 339

 Score =  281 bits (720), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 150/345 (43%), Positives = 206/345 (59%), Gaps = 31/345 (8%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           L C    A    ++ +   H L D ++  VN+     W+A  N  F N  +   K L G 
Sbjct: 8   LCCLLVLAN---ARSRPSFHPLSDELVNYVNKR-NTTWQAGHN--FYNVDMSYLKRLCGT 61

Query: 74  ---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 130
               P P   ++        + LKLP+SFDAR  WPQC T+  I DQG CGSCWAFGAVE
Sbjct: 62  FLGGPKPPQRVM------FTEDLKLPESFDAREQWPQCPTVKEIRDQGSCGSCWAFGAVE 115

Query: 131 ALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE---- 184
           A+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW ++   G+V+      
Sbjct: 116 AISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175

Query: 185 ---CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSD 235
              C PY           S P C     TPKC + C    +  ++  KHY  ++Y +++ 
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNS 235

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
            +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW
Sbjct: 236 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYW 294

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           ++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 295 LVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|198429088|ref|XP_002120307.1| PREDICTED: similar to cathepsin B [Ciona intestinalis]
          Length = 364

 Score =  281 bits (720), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 150/311 (48%), Positives = 191/311 (61%), Gaps = 19/311 (6%)

Query: 37  DSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLP 96
           ++I+K VN+     WKA+ N   + Y     K L GVK    G         + + +K+P
Sbjct: 55  NAIVKTVNK-ANTTWKASLNFDPTYYVPEDLKLLCGVKEDKHGYSKLETSYHNLEGIKIP 113

Query: 97  KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLAC 154
             FD+R  WP C +IS I DQG CGSCWAFGAVEA+SDR+CI     + + +S  DLL+C
Sbjct: 114 NQFDSRKQWPHCPSISYIRDQGSCGSCWAFGAVEAMSDRYCIRSNGKIQVEISAEDLLSC 173

Query: 155 CGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEP 201
           CGF CGDGC+GG+P SAW+Y+   G+VT         C PY     C H      P C  
Sbjct: 174 CGFECGDGCNGGFPGSAWKYWNSDGLVTGGLYGSKTGCLPY-QIKPCEHHVPGDRPKCSE 232

Query: 202 AYPTPKCVRKCVKKNQLWRNS-KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
              TP CV KC     +  N  KHY +S+Y + SDP  I  EI  +GPVE +FTVY DF 
Sbjct: 233 GGGTPSCVSKCKGNTTIHYNQDKHYGLSSYAVGSDPTQIQTEIMTHGPVEGAFTVYADFP 292

Query: 261 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 320
            YKSGVYKH+TG V+GGHA++++GWG S++G  YW++AN WN  WG  GYFKI RGS+EC
Sbjct: 293 TYKSGVYKHVTGGVLGGHAIRILGWG-SENGVAYWLVANSWNTDWGDKGYFKILRGSDEC 351

Query: 321 GIEEDVVAGLP 331
           GIE  VVAG+P
Sbjct: 352 GIESSVVAGIP 362


>gi|4503139|ref|NP_001899.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538431|ref|NP_680090.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538433|ref|NP_680091.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538435|ref|NP_680092.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538437|ref|NP_680093.1| cathepsin B preproprotein [Homo sapiens]
 gi|68067549|sp|P07858.3|CATB_HUMAN RecName: Full=Cathepsin B; AltName: Full=APP secretase; Short=APPS;
           AltName: Full=Cathepsin B1; Contains: RecName:
           Full=Cathepsin B light chain; Contains: RecName:
           Full=Cathepsin B heavy chain; Flags: Precursor
 gi|291888|gb|AAC37547.1| cathepsin B [Homo sapiens]
 gi|63102437|gb|AAH95408.1| Cathepsin B [Homo sapiens]
 gi|119586034|gb|EAW65630.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586036|gb|EAW65632.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586037|gb|EAW65633.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586038|gb|EAW65634.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586039|gb|EAW65635.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586040|gb|EAW65636.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|168277954|dbj|BAG10955.1| cathepsin B precursor [synthetic construct]
 gi|193786804|dbj|BAG52127.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score =  281 bits (720), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 151/345 (43%), Positives = 205/345 (59%), Gaps = 31/345 (8%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           L C    A    ++ +   H L D ++  VN+     W+A  N  F N  +   K L G 
Sbjct: 8   LCCLLVLAN---ARSRPSFHPLSDELVNYVNKR-NTTWQAGHN--FYNVDMSYLKRLCGT 61

Query: 74  ---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 130
               P P   ++        + LKLP SFDAR  WPQC TI  I DQG CGSCWAFGAVE
Sbjct: 62  FLGGPKPPQRVM------FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVE 115

Query: 131 ALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE---- 184
           A+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW ++   G+V+      
Sbjct: 116 AISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175

Query: 185 ---CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSD 235
              C PY           S P C     TPKC + C    +  ++  KHY  ++Y +++ 
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNS 235

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
            +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW
Sbjct: 236 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYW 294

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           ++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 295 LVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|181192|gb|AAA52129.1| preprocathepsin B [Homo sapiens]
 gi|193787271|dbj|BAG52477.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score =  280 bits (717), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 150/345 (43%), Positives = 205/345 (59%), Gaps = 31/345 (8%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           L C    A    ++ +   H + D ++  VN+     W+A  N  F N  +   K L G 
Sbjct: 8   LCCLLVLAN---ARSRPSFHPVSDELVNYVNKR-NTTWQAGHN--FYNVDMSYLKRLCGT 61

Query: 74  ---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 130
               P P   ++        + LKLP SFDAR  WPQC TI  I DQG CGSCWAFGAVE
Sbjct: 62  FLGGPKPPQRVM------FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVE 115

Query: 131 ALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE---- 184
           A+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW ++   G+V+      
Sbjct: 116 AISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175

Query: 185 ---CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSD 235
              C PY           S P C     TPKC + C    +  ++  KHY  ++Y +++ 
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNS 235

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
            +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW
Sbjct: 236 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYW 294

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           ++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 295 LVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|158261501|dbj|BAF82928.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score =  280 bits (717), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 150/345 (43%), Positives = 204/345 (59%), Gaps = 31/345 (8%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           L C    A    ++ +   H L D ++  VN+     W+A  N  F N  +   K L G 
Sbjct: 8   LCCLLVLAN---ARSRPSFHPLSDELVNYVNKR-NTTWQAGHN--FYNVDMSYLKRLCGT 61

Query: 74  ---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 130
               P P   ++        + LKLP SFDAR  WPQC TI  I DQG CGSCWAFGAVE
Sbjct: 62  FLGGPKPPQRVM------FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVE 115

Query: 131 ALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE---- 184
           A+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW ++   G+V+      
Sbjct: 116 AISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175

Query: 185 ---CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSD 235
              C PY           S P C     TPKC + C    +  ++  KHY  ++Y +++ 
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNS 235

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
            +DIMAEIYKNGP E +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW
Sbjct: 236 EKDIMAEIYKNGPAEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYW 294

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           ++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 295 LVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|157833437|pdb|1PBH|A Chain A, Crystal Structure Of Human Recombinant Procathepsin B At
           3.2 Angstrom Resolution
 gi|157835646|pdb|2PBH|A Chain A, Crystal Structure Of Human Procathepsin B At 3.3 Angstrom
           Resolution
 gi|157836863|pdb|3PBH|A Chain A, Refined Crystal Structure Of Human Procathepsin B At 2.5
           Angstrom Resolution
          Length = 317

 Score =  280 bits (716), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 147/319 (46%), Positives = 196/319 (61%), Gaps = 28/319 (8%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGLLLGVPVKTH 89
           H L D ++  VN+     W+A  N  F N  +   K L G     P P   ++       
Sbjct: 8   HPLSDELVNYVNKR-NTTWQAGHN--FYNVDMSYLKRLCGTFLGGPKPPQRVM------F 58

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 149
            + LKLP SFDAR  WPQC TI  I DQG CGSCWAFGAVEA+SDR CIH   ++S+ V+
Sbjct: 59  TEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVS 118

Query: 150 --DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCS 195
             DLL CCG +CGDGC+GGYP  AW ++   G+V+         C PY           S
Sbjct: 119 AEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGS 178

Query: 196 HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
            P C     TPKC + C    +  ++  KHY  ++Y +++  +DIMAEIYKNGPVE +F+
Sbjct: 179 RPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFS 238

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI 
Sbjct: 239 VYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKIL 297

Query: 315 RGSNECGIEEDVVAGLPSS 333
           RG + CGIE +VVAG+P +
Sbjct: 298 RGQDHCGIESEVVAGIPRT 316


>gi|395507317|ref|XP_003757972.1| PREDICTED: cathepsin B [Sarcophilus harrisii]
          Length = 342

 Score =  280 bits (716), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 149/328 (45%), Positives = 203/328 (61%), Gaps = 33/328 (10%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV-----KPTPKGLLLGVPVKTH 89
           L D ++  VN+     WKA  N  F N  +   K L G      K  P+ ++L       
Sbjct: 26  LSDEMVNYVNK-LNTTWKAGHN--FRNVDMSYVKKLCGTVMGGAKQLPQRVMLA------ 76

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 147
           D  +KLP++FDAR  WP+C TI  I DQG CGSCWAFGAVEA+SDR C+H    + + +S
Sbjct: 77  DDDMKLPENFDAREQWPKCPTIKEIRDQGSCGSCWAFGAVEAISDRICVHTNGYITIEVS 136

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH--PG 198
             DLL+CCG  CG+GC+GG+P  AW+Y++  G+V+         C PY     C H   G
Sbjct: 137 AEDLLSCCGLQCGEGCNGGFPAGAWKYWIKKGLVSGGLYDSHVGCRPY-SIPPCEHHVNG 195

Query: 199 CEPAYP-----TPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 252
             PA       TPKC +KC    +  +++ KHY  +AY + S  ++IMAEIYKNGPVE +
Sbjct: 196 SRPACTGEGGDTPKCNKKCEAGYSPDYKDDKHYGTTAYNVPSSEKEIMAEIYKNGPVEGA 255

Query: 253 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 312
           F VY DF  YKSGVY+H+TGD++GGHA++++GWG  +DG  YW+ AN WN  WG +G+FK
Sbjct: 256 FIVYADFLQYKSGVYQHVTGDMLGGHAIRVLGWGV-EDGVPYWLAANSWNTDWGDNGFFK 314

Query: 313 IKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           I RG + CGIE ++VAG+P ++   K+I
Sbjct: 315 ILRGKDHCGIESEMVAGIPRTEQYWKKI 342


>gi|60816353|gb|AAX36379.1| cathepsin B [synthetic construct]
 gi|61358313|gb|AAX41546.1| cathepsin B [synthetic construct]
          Length = 339

 Score =  279 bits (713), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 150/345 (43%), Positives = 204/345 (59%), Gaps = 31/345 (8%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           L C    A    ++ +   H + D ++  VN+     W+A  N  F N  +   K L G 
Sbjct: 8   LCCLLVLAN---ARSRPSFHPVSDELVNYVNKR-NTTWQAGHN--FYNVDMSYLKRLCGT 61

Query: 74  ---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 130
               P P   ++        + LKLP SFDAR  WPQC TI  I DQG CGSCWAFGAVE
Sbjct: 62  FLGGPKPPQRVM------FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVE 115

Query: 131 ALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE---- 184
           A+SDR CIH   ++S+ V+  DLL CCG  CGDGC+GGYP  AW ++   G+V+      
Sbjct: 116 AISDRICIHTNAHVSVEVSAEDLLTCCGSRCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175

Query: 185 ---CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSD 235
              C PY           S P C     TPKC + C    +  ++  KHY  ++Y +++ 
Sbjct: 176 HVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNS 235

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
            +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW
Sbjct: 236 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYW 294

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           ++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 295 LVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|296221607|ref|XP_002756833.1| PREDICTED: cathepsin B, partial [Callithrix jacchus]
          Length = 330

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 147/327 (44%), Positives = 195/327 (59%), Gaps = 30/327 (9%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD-- 90
           H L D ++  VN+     W+A  N  F N  +   K L G         LG P       
Sbjct: 15  HPLSDELVNYVNKQ-NTTWQAGHN--FYNVDLSYLKRLCGT-------FLGGPKPPQRVK 64

Query: 91  --KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSV 148
             + L LP+SFDAR  WPQC TI  I DQG CGSCWAFGAVEA+SDR CIH   ++S+ V
Sbjct: 65  FAEDLNLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEV 124

Query: 149 N--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGC 194
           +  DLL CCG +CGDGC+GGYP  AW ++   G+V+         C PY           
Sbjct: 125 SAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNG 184

Query: 195 SHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
           S P C     TPKC + C    +  ++  KHY   +Y ++++  DIMAEIYKNGPVE +F
Sbjct: 185 SRPPCTGEGDTPKCSKSCEPGYSPTYKQDKHYGYDSYSVSNNERDIMAEIYKNGPVEGAF 244

Query: 254 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 313
           +VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW++ N WN  WG +G+FKI
Sbjct: 245 SVYADFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKI 303

Query: 314 KRGSNECGIEEDVVAGLPSSKNLVKEI 340
            RG + CGIE +VVAG+P +    + I
Sbjct: 304 LRGQDHCGIESEVVAGIPRTDQYWRNI 330


>gi|25988674|gb|AAN76202.1| lysosomal cysteine proteinase cathepsin B/green fluorescent protein
           EGFP fusion protein [synthetic construct]
          Length = 578

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 151/343 (44%), Positives = 203/343 (59%), Gaps = 32/343 (9%)

Query: 10  PILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKH 69
           P+ CL    +  +      K   H L D +I  +N+     W+A RN  F N  +   K 
Sbjct: 7   PLSCLLALTSAHD------KPSFHPLSDDMINYINKQ-NTTWQAGRN--FYNVDISYLKK 57

Query: 70  LLG-VKPTPKGLLLGVPVKT-HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFG 127
           L G V   PK     +P +    + + LP+SFDAR  W  C TI++I DQG CGSCWAFG
Sbjct: 58  LCGTVLGGPK-----LPERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFG 112

Query: 128 AVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE- 184
           AVEA+SDR CIH    +N+ +S  DLL CCG  CGDGC+GGYP  AW ++   G+V+   
Sbjct: 113 AVEAMSDRICIHTNGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGV 172

Query: 185 ------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRI 232
                 C PY           S P C     TPKC + C    +  ++  KHY  ++Y +
Sbjct: 173 YNSHIGCLPYTIPPCEHHVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSV 232

Query: 233 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 292
           +   ++IMAEIYKNGPVE +FTV+ DF  YKSGVYKH  GDVMGGHA++++GWG  ++G 
Sbjct: 233 SDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGV 291

Query: 293 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 335
            YW++AN WN  WG +G+FKI RG N CGIE ++VAG+P +++
Sbjct: 292 PYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAGIPRTQD 334


>gi|1705630|sp|P00787.2|CATB_RAT RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; AltName:
           Full=RSG-2; Contains: RecName: Full=Cathepsin B light
           chain; Contains: RecName: Full=Cathepsin B heavy chain;
           Flags: Precursor
 gi|1524328|emb|CAA57792.1| cathepsin b [Rattus norvegicus]
          Length = 339

 Score =  277 bits (708), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 151/344 (43%), Positives = 201/344 (58%), Gaps = 36/344 (10%)

Query: 10  PILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKH 69
           P+ CL    +  +      K  SH L D +I  +N+     W+A RN  F N  +   K 
Sbjct: 7   PLSCLLALTSAHD------KPSSHPLSDDMINYINKQ-NTTWQAGRN--FYNVDISYLKK 57

Query: 70  LLGVKPTPKGLLLGVPVKTH----DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
           L G        +LG P         + + LP+SFDAR  W  C TI++I DQG CGSCWA
Sbjct: 58  LCGT-------VLGGPNLPERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWA 110

Query: 126 FGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 183
           FGAVEA+SDR CIH    +N+ +S  DLL CCG  CGDGC+GGYP  AW ++   G+V+ 
Sbjct: 111 FGAVEAMSDRICIHTNGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSG 170

Query: 184 E-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAY 230
                   C PY           S P C     TPKC + C    +  ++  KHY  ++Y
Sbjct: 171 GVYNSHIGCLPYTIPPCEHHVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSY 230

Query: 231 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 290
            ++   ++IMAEIYKNGPVE +FTV+ DF  YKSGVYKH  GDVMGGHA++++GWG  ++
Sbjct: 231 SVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-EN 289

Query: 291 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 334
           G  YW++AN WN  WG +G+FKI RG N CGIE ++VAG+P ++
Sbjct: 290 GVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAGIPRTQ 333


>gi|313233819|emb|CBY09988.1| unnamed protein product [Oikopleura dioica]
          Length = 356

 Score =  277 bits (708), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 156/348 (44%), Positives = 207/348 (59%), Gaps = 31/348 (8%)

Query: 8   MDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 67
            +P+  +       E ++  L+ D+    D II +VN +    WKA  N   SNY     
Sbjct: 15  FNPLNWIENVGKRVEKLIENLEHDNF---DDIIAKVN-SADLSWKAGANFN-SNYAP--- 66

Query: 68  KHLLGVKPTPKGL-LLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF 126
           KH+ G+  T  G   L V    +D  L+LP +FD+R AWP C +IS + DQG CGSCWAF
Sbjct: 67  KHVAGLCGTIMGDDRLPVNHLLNDADLELPANFDSREAWPDCPSISEVRDQGSCGSCWAF 126

Query: 127 GAVEALSDRFCIH--FGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 184
           GA EA+SDR CIH        LS  DLL+CCG++CG+GC+GG+P +AW Y+V +G+V+  
Sbjct: 127 GASEAISDRTCIHSNAAFTFDLSSEDLLSCCGYVCGNGCNGGFPQAAWEYWVQNGLVS-- 184

Query: 185 CDPYFDSTGCSHPGCEPAY---------------PTPKCVRKCVKK-NQLWRNSKHYSIS 228
               +  TGC     EP                  TPKC  KCV      +   KHY   
Sbjct: 185 -GGLYHGTGCQPYAIEPCEHHTEGDRPPCTGEEGTTPKCSHKCVDGYTGNFAQDKHYGSV 243

Query: 229 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 288
           AYRI ++ + IM EIYKNGPVE +F VYEDF  YKSGVY H TG  +GGHA++++GWG  
Sbjct: 244 AYRIPANEKAIMNEIYKNGPVEGAFIVYEDFPTYKSGVYSHHTGSALGGHAIRVLGWG-E 302

Query: 289 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 336
           ++GE YW+  N WN  WG +G+FKIKRG NECGIE ++V G+P+S++L
Sbjct: 303 ENGEKYWLCGNSWNTDWGNNGFFKIKRGVNECGIESEMVGGIPASESL 350


>gi|403307501|ref|XP_003944231.1| PREDICTED: cathepsin B [Saimiri boliviensis boliviensis]
          Length = 351

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 145/320 (45%), Positives = 192/320 (60%), Gaps = 30/320 (9%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD-- 90
           H L + ++  VN+     W+A  N  F N  +   K L G         LG P       
Sbjct: 36  HPLSEELVNYVNKQ-NTTWQAGHN--FYNVDLSYLKRLCGT-------FLGGPKPPQRVK 85

Query: 91  --KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSV 148
             + L LP+SFDAR  WPQC TI  I DQG CGSCWAFGAVEA+SDR CIH   ++S+ V
Sbjct: 86  FAEDLNLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEV 145

Query: 149 N--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGC 194
           +  DLL CCG +CGDGC+GGYP  AW ++   G+V+         C PY           
Sbjct: 146 SAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNG 205

Query: 195 SHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
           S P C     TPKC + C       ++  KHY  ++Y +++   DIMAEIYKNGPVE +F
Sbjct: 206 SRPPCTGEGDTPKCSKSCEPGYTPTYKQDKHYGYNSYSVSNSERDIMAEIYKNGPVEGAF 265

Query: 254 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 313
           +VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW++ N WN  WG +G+FKI
Sbjct: 266 SVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKI 324

Query: 314 KRGSNECGIEEDVVAGLPSS 333
            RG + CGIE +VVAG+P +
Sbjct: 325 LRGQDHCGIESEVVAGIPRT 344


>gi|82830420|ref|NP_072119.2| cathepsin B preproprotein [Rattus norvegicus]
 gi|47939014|gb|AAH72490.1| Cathepsin B [Rattus norvegicus]
 gi|149030258|gb|EDL85314.1| rCG52258, isoform CRA_a [Rattus norvegicus]
          Length = 339

 Score =  275 bits (703), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 151/342 (44%), Positives = 202/342 (59%), Gaps = 32/342 (9%)

Query: 10  PILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKH 69
           P+ CL    +  +      K   H L D +I  +N+     W+A RN  F N  +   K 
Sbjct: 7   PLSCLLALTSAHD------KPSFHPLSDDMINYINKQ-NTTWQAGRN--FYNVDISYLKK 57

Query: 70  LLG-VKPTPKGLLLGVPVKT-HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFG 127
           L G V   PK     +P +    + + LP+SFDAR  W  C TI++I DQG CGSCWAFG
Sbjct: 58  LCGTVLGGPK-----LPERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFG 112

Query: 128 AVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE- 184
           AVEA+SDR CIH    +N+ +S  DLL CCG  CGDGC+GGYP  AW ++   G+V+   
Sbjct: 113 AVEAMSDRICIHTNGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGV 172

Query: 185 ------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRI 232
                 C PY           S P C     TPKC + C    +  ++  KHY  ++Y +
Sbjct: 173 YNSHIGCLPYTIPPCEHHVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSV 232

Query: 233 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 292
           +   ++IMAEIYKNGPVE +FTV+ DF  YKSGVYKH  GDVMGGHA++++GWG  ++G 
Sbjct: 233 SDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGV 291

Query: 293 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 334
            YW++AN WN  WG +G+FKI RG N CGIE ++VAG+P ++
Sbjct: 292 PYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAGIPRTQ 333


>gi|345790427|ref|XP_543203.3| PREDICTED: cathepsin B [Canis lupus familiaris]
          Length = 339

 Score =  274 bits (700), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 149/346 (43%), Positives = 199/346 (57%), Gaps = 30/346 (8%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           LT  +       ++ +L    L D ++  VN+     WKA  N  F N      + L G 
Sbjct: 5   LTTLSCLVMLTGAQSRLPFRALSDELVDYVNKR-NTTWKAGHN--FHNVDPSYLRRLCGT 61

Query: 74  KPTPKGLLLGVPVKTHD----KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 129
                   LG P         K+L LP+SFDAR  WP C TI  I DQG CGSCWAFGAV
Sbjct: 62  -------FLGGPKLPQRVQFAKNLILPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAV 114

Query: 130 EALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 184
           EA+SDR CI     +N+ +S  D+L CCG  CGDGC+GG+P  AW ++   G+V+     
Sbjct: 115 EAISDRICIRTNGHVNVEVSAEDMLTCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYD 174

Query: 185 ----CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINS 234
               C PY           S P C     TPKC + C    +  ++  KHY  S+Y ++ 
Sbjct: 175 SHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPSYKEDKHYGCSSYSVSD 234

Query: 235 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 294
           + ++IMAEIYKNGPVE +FTVY DF  YKSGVY+H+TG++MGGHAV+++GWG  +DG  Y
Sbjct: 235 NEKEIMAEIYKNGPVEAAFTVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGV-EDGTPY 293

Query: 295 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           W++ N WN  WG +G+FKI RG + CGIE ++VAG+P +    K+I
Sbjct: 294 WLVGNSWNTDWGDNGFFKILRGRDHCGIESEIVAGIPCTDQYWKKI 339


>gi|29374025|gb|AAO73003.1| cathepsin B [Fasciola gigantica]
          Length = 339

 Score =  273 bits (699), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 148/317 (46%), Positives = 187/317 (58%), Gaps = 25/317 (7%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK-HLLGVKPTPKGLLLGVPVKTHDKSL 93
             D +I+ VNE   A WKAAR+ +FSN  V  FK HL  +  TP+      P   HD S 
Sbjct: 26  FSDELIRFVNEESGASWKAARSTRFSN--VDHFKLHLGALSETPEERNALRPTIKHDISK 83

Query: 94  K-LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 150
             LP+SFDARS WPQC TIS I DQ  CGSCWA  A  A+SDR CIH    M   L+  D
Sbjct: 84  NDLPESFDARSQWPQCWTISEIRDQASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAAD 143

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEP-- 201
            L+CC + CG GC GGYP  AW Y++  G+VT         C P+   T C H G     
Sbjct: 144 PLSCCTY-CGQGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWM-FTKCDHVGDSRKY 201

Query: 202 ------AYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
                  YPTP C R C    N+ +   K Y  S+Y +      IM EI KNGPVEV+F 
Sbjct: 202 SRCPHYTYPTPPCARACQTGYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEVTFA 261

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           +++DF  Y+SG+Y H+ G  +G HAV++IGWG  ++G +YW++AN WN  WG +GYF++ 
Sbjct: 262 IFQDFGVYRSGIYHHVAGKFIGRHAVRMIGWGV-ENGVNYWLMANSWNEEWGENGYFRMV 320

Query: 315 RGSNECGIEEDVVAGLP 331
           RG NECGIE +VVAG+P
Sbjct: 321 RGRNECGIESEVVAGMP 337


>gi|449267314|gb|EMC78276.1| Cathepsin B [Columba livia]
          Length = 340

 Score =  272 bits (696), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 154/351 (43%), Positives = 204/351 (58%), Gaps = 38/351 (10%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
           ILC+      A  V     L S ++    I ++N      WKA  N  F N  +   K L
Sbjct: 7   ILCVLVAFANARSVPYYRPLSSDLVNH--INKLNTT----WKAGHN--FYNTDMSYVKQL 58

Query: 71  LGVKPTPKGLLLGVPVKTHDK-----SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
            G         LG P K  ++      ++LP SFD+R+ WP C TIS I DQG CGSCWA
Sbjct: 59  CGT-------FLGGP-KLPERVDFAGDMELPDSFDSRTQWPNCPTISEIRDQGSCGSCWA 110

Query: 126 FGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 183
           FGAVEA+SDR C+H    +S+ V+  DLL+CCGF CG GC+GGYP  AWRY+   G+V+ 
Sbjct: 111 FGAVEAISDRICVHTNAKVSVEVSAEDLLSCCGFECGMGCNGGYPSGAWRYWTEKGLVSG 170

Query: 184 E-------CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 229
                   C PY          G   P       TP+C R C    +  ++  KHY I++
Sbjct: 171 GLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITS 230

Query: 230 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 289
           Y +    ++IMAEIYKNGPVE +F VYEDF  YKSGVY+H+TG+ +GGHA++L+GWG  D
Sbjct: 231 YGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVTGEQVGGHAIRLLGWGV-D 289

Query: 290 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           +G  YW+ AN WN  WG +G+FKI RG + CGIE ++VAG+PS++   K +
Sbjct: 290 NGTPYWLAANSWNTDWGDNGFFKILRGEDHCGIESEIVAGIPSTERYWKRV 340


>gi|1942645|pdb|1MIR|A Chain A, Rat Procathepsin B
 gi|1942646|pdb|1MIR|B Chain B, Rat Procathepsin B
          Length = 322

 Score =  271 bits (693), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 147/320 (45%), Positives = 195/320 (60%), Gaps = 28/320 (8%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG-VKPTPKGLLLGVPVKT-HD 90
           H L D +I  +N+     W+A RN  F N  +   K L G V   PK     +P +    
Sbjct: 7   HPLSDDMINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGTVLGGPK-----LPERVGFS 58

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 148
           + + LP+SFDAR  W  C TI++I DQG CGS WAFGAVEA+SDR CIH    +N+ +S 
Sbjct: 59  EDINLPESFDAREQWSNCPTIAQIRDQGSCGSSWAFGAVEAMSDRICIHTNGRVNVEVSA 118

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH----- 196
            DLL CCG  CGDGC+GGYP  AW ++   G+V+         C PY     C H     
Sbjct: 119 EDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPP-CEHHVNGA 177

Query: 197 -PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
            P C     TPKC + C    +  ++  KHY  ++Y ++   ++IMAEIYKNGPVE +FT
Sbjct: 178 RPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFT 237

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           V+ DF  YKSGVYKH  GDVMGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI 
Sbjct: 238 VFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNADWGDNGFFKIL 296

Query: 315 RGSNECGIEEDVVAGLPSSK 334
           RG N CGIE ++VAG+P ++
Sbjct: 297 RGENHCGIESEIVAGIPRTQ 316


>gi|256090368|ref|XP_002581167.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|22531387|emb|CAD44624.1| cathepsin B1 isotype 1 [Schistosoma mansoni]
 gi|353228442|emb|CCD74613.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 340

 Score =  271 bits (693), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 146/340 (42%), Positives = 200/340 (58%), Gaps = 20/340 (5%)

Query: 7   IMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ 66
           ++  ILC+    TF E  +S        L D II  +NE+P AGW+A ++ +F +    +
Sbjct: 1   MLTSILCIASLITFLEAHISVKNEKFEPLSDDIISYINEHPNAGWRAEKSNRFHSLDDAR 60

Query: 67  FKHLLGVKPTPKGLLLGVPVKTH-DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
            + +   +  P       P   H D ++++P SFD+R  WP+C +I+ I DQ  CGSCWA
Sbjct: 61  IQ-MGARREEPDLRRTRRPTVDHNDWNVEIPSSFDSRKKWPRCKSIATIRDQSRCGSCWA 119

Query: 126 FGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 183
           FGAVEA+SDR CI  G   N+ LS  DLL+CC   CG GC+GG    AW Y+V  G+VT 
Sbjct: 120 FGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCES-CGLGCEGGILGPAWDYWVKEGIVTG 178

Query: 184 E-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISA 229
                   C+PY        T   +P C    Y TP+C + C KK +  +   KH   S+
Sbjct: 179 SSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSS 238

Query: 230 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 289
           Y + +D + I  EI K GPVE  FTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG  +
Sbjct: 239 YNVKNDEKAIQKEIMKYGPVEAGFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGV-E 297

Query: 290 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
           +   YW++AN WN  WG +GYF+I RG +EC IE +V AG
Sbjct: 298 NKTPYWLIANSWNEDWGENGYFRIVRGRDECSIESEVTAG 337


>gi|417399216|gb|JAA46636.1| Putative cathepsin b [Desmodus rotundus]
          Length = 340

 Score =  270 bits (691), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 148/347 (42%), Positives = 199/347 (57%), Gaps = 34/347 (9%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           L+C A       ++ +L+   L D ++  VN+     WKA  N  F N  +   K L G 
Sbjct: 8   LSCLAVL---TTARSRLEFQPLSDELVNYVNKQ-NTTWKAGHN--FYNVDLSYVKKLCGT 61

Query: 74  KPTPKGLLLGVPVKTHDKSLK----LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 129
           K       LG P      SL     LP+SFDAR  WPQC TI  I DQG CGSCWAFGAV
Sbjct: 62  K-------LGGPKLPQRLSLAGDIALPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAV 114

Query: 130 EALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 184
           EA+SDR CI      N+ +S  DLL CCGF CG+GC+GG+P  AW ++   G+V+     
Sbjct: 115 EAISDRICIRSNGLQNVEVSAEDLLTCCGFQCGEGCNGGFPSGAWNFWKKQGLVSGGLYD 174

Query: 185 ----CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 233
               C PY          G   P       TPKC + C    +  ++  KH+    Y + 
Sbjct: 175 SHVGCRPYSIPPCEHHVNGSRPPCSGEGGDTPKCSKICEPGYSPSYKEDKHFGCDTYSVP 234

Query: 234 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 293
           SD ++IM EIYKNGPVE +F+VY DF  YKSGVY+H+TG+++GGHAV+++GWG  ++G  
Sbjct: 235 SDEKEIMVEIYKNGPVEAAFSVYSDFLLYKSGVYQHVTGEMVGGHAVRILGWGV-ENGTP 293

Query: 294 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           YW++ N WN  WG +G+FKI RG + CGIE ++VAG+P + +  + I
Sbjct: 294 YWLVGNSWNTDWGDNGFFKILRGRDHCGIESEIVAGIPCTGHYSERI 340


>gi|126681075|gb|ABO26563.1| cathepsin B-like cysteine protease form 1 [Ixodes ricinus]
          Length = 337

 Score =  270 bits (690), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 148/318 (46%), Positives = 196/318 (61%), Gaps = 22/318 (6%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK- 91
           H L D +I  +N+     WKA RN   S  ++   + L+GV P  K   L  P   H++ 
Sbjct: 26  HPLSDQMINFINK-INTTWKAGRNFDKS-ISMSYIRGLMGVNPKSKEYRL--PEFVHEEI 81

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP+SFDAR  W  C++I+ I DQ  CGSCWAFGA EA+SDR CIH   G+ +++S  
Sbjct: 82  PDDLPESFDAREKWSHCASINLIRDQSTCGSCWAFGAAEAMSDRVCIHSEGGIQVNISAE 141

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHP 197
           DLL CC   CG GCDGGYP +AW Y+   G+V++        C PY        T  S P
Sbjct: 142 DLLDCCDS-CGAGCDGGYPAAAWEYWKESGLVSDGLYGTPDGCKPYSLAPCEHHTKGSLP 200

Query: 198 GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
            C    PTPKCV  C K   + +++ KH+    Y I+S+ + I  EI+KNGPVE  FTVY
Sbjct: 201 NCTGTVPTPKCVHLCRKGYGKDYQHDKHFGKKVYSISSNEKQIQTEIFKNGPVEADFTVY 260

Query: 257 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
            DF  YKSGVY+H +GDV+GGHA++++GWGT ++G  YW++AN WN  WG  GYFKI RG
Sbjct: 261 ADFLSYKSGVYQHHSGDVLGGHAIRILGWGT-ENGTPYWLVANSWNEDWGDHGYFKILRG 319

Query: 317 SNECGIEEDVVAGLPSSK 334
            +ECGIE+D+ AG+P  +
Sbjct: 320 KDECGIEDDINAGIPKDE 337


>gi|333361087|pdb|3AI8|B Chain B, Cathepsin B In Complex With The Nitroxoline
 gi|333361088|pdb|3AI8|A Chain A, Cathepsin B In Complex With The Nitroxoline
          Length = 256

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 132/256 (51%), Positives = 173/256 (67%), Gaps = 16/256 (6%)

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--D 150
           LKLP SFDAR  WPQC TI  I DQG CGSCWAFGAVEA+SDR CIH   ++S+ V+  D
Sbjct: 1   LKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAED 60

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPG 198
           LL CCG +CGDGC+GGYP  AW ++   G+V+         C PY           S P 
Sbjct: 61  LLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPP 120

Query: 199 CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 257
           C     TPKC + C    +  ++  KHY  ++Y +++  +DIMAEIYKNGPVE +F+VY 
Sbjct: 121 CTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYS 180

Query: 258 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 317
           DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI RG 
Sbjct: 181 DFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQ 239

Query: 318 NECGIEEDVVAGLPSS 333
           + CGIE +VVAG+P +
Sbjct: 240 DHCGIESEVVAGIPRT 255


>gi|121073168|gb|ABM47070.1| cathepsin B1 [Clonorchis sinensis]
 gi|358341105|dbj|GAA29748.2| cathepsin B [Clonorchis sinensis]
          Length = 339

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 151/342 (44%), Positives = 196/342 (57%), Gaps = 23/342 (6%)

Query: 8   MDPILCLTCFATF-AEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ 66
           MD I  L  +A   AE   ++       L D I+  +N      WKAA+  +F   T+  
Sbjct: 1   MDSIWTLIMYALLCAESFRAEYIPSFESLSDEIVHYINHKANTTWKAAKYQRFK--TISD 58

Query: 67  FKHLLGVKPTPKGL-LLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
            + +LG  P P G  L    + +  +  +LP+SFDAR  WP CS+I+ I DQ +CGSCWA
Sbjct: 59  VRRVLGAVPDPNGFGLEKRCLLSTIREQELPESFDAREKWPYCSSIAEIRDQSNCGSCWA 118

Query: 126 FGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT- 182
           FGA  A+SDR CI  G      +S  DL+ CC   CG GC GGYP  AW Y+V +G+VT 
Sbjct: 119 FGAAGAISDRICIASGGKHQPRISPEDLVDCCA-DCGMGCQGGYPAQAWEYWVRNGLVTG 177

Query: 183 ------EECDPYFDSTGCSHPGCEPAYP------TPKCVRKCVKK-NQLWRNSKHYSISA 229
                 + C PY     C H    P  P      TP+CV+KC  +  + + N K Y + A
Sbjct: 178 DLYNTTDTCRPY-SFPPCEHHVVGPRKPCTGDPTTPQCVKKCQPEYPKTYENDKWYGLKA 236

Query: 230 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 289
           Y I+SD E IM ++   GP+EV F VY DF  Y SGVY+H+ G ++GGHAV+L+GWG  +
Sbjct: 237 YSIHSDQEAIMRDLMTYGPLEVDFEVYADFPSYSSGVYRHVAGGLLGGHAVRLVGWGV-E 295

Query: 290 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           DG DYW++AN WN  WG  GYFKI+RG NECGIE D  AG P
Sbjct: 296 DGADYWLIANSWNTDWGDGGYFKIRRGVNECGIESDANAGHP 337


>gi|240992699|ref|XP_002404474.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215491571|gb|EEC01212.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score =  269 bits (687), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 149/318 (46%), Positives = 194/318 (61%), Gaps = 22/318 (6%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK- 91
           H L D +I  +N+     WKA RN   S  ++   + L+GV P  K   L   V  HD+ 
Sbjct: 26  HPLSDQMINFINK-INTTWKAGRNFDKS-ISMSYIRGLMGVHPKSKEYRLAEFV--HDEI 81

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
              LP+SFDAR  W  C++I  I DQ  CGSCWAFGA EA+SDR CIH    + + +S  
Sbjct: 82  PDDLPESFDAREKWSHCASIHLIRDQSTCGSCWAFGAAEAMSDRVCIHSKGKIQVDISAE 141

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF-----DSTGCSHP 197
           DLL CC   CG GC+GGYP +AW Y+   G+VT       + C PY        T  S P
Sbjct: 142 DLLDCCDS-CGAGCNGGYPAAAWEYWKESGLVTGGLYGTSDGCKPYSLAPCEHHTKGSLP 200

Query: 198 GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
            C    PTPKCV  C K   + +++ KH+    Y I+SD + I  EI+KNGPVE  FTVY
Sbjct: 201 NCTGTVPTPKCVHLCRKGYGKDYQDDKHFGRKVYSISSDEKQIQTEIFKNGPVEADFTVY 260

Query: 257 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
            DF  YKSGVY+H +GDV+GGHA++++GWGT ++G  YW++AN WN  WG  GYFKI RG
Sbjct: 261 ADFLSYKSGVYQHQSGDVLGGHAIRILGWGT-ENGTPYWLVANSWNEDWGDHGYFKILRG 319

Query: 317 SNECGIEEDVVAGLPSSK 334
            +ECGIE+D+ AG+P ++
Sbjct: 320 KDECGIEDDINAGIPKNE 337


>gi|431918315|gb|ELK17542.1| Cathepsin B [Pteropus alecto]
          Length = 359

 Score =  269 bits (687), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 145/321 (45%), Positives = 188/321 (58%), Gaps = 31/321 (9%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD---- 90
           L D ++  VN+     WKA  N  F N  +   K L G        +LG P         
Sbjct: 49  LSDELVNYVNKR-NTTWKAGHN--FHNVDLSYVKRLCGT-------ILGGPKLPQRVWLA 98

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSV 148
           + L LP+SFDAR  WP C TI  I DQG CGSCWAFGAVEA+SDR CI  +  +N+ +S 
Sbjct: 99  EDLVLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICILTNGNVNVEVSA 158

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCS 195
            DLL CCGF CG+GC+GG+P  AW ++   G+V+         C PY          G  
Sbjct: 159 EDLLTCCGFQCGEGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSR 218

Query: 196 HPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
            P       TPKC R C       ++  KH+  S+Y + S   +IMAEIYKNGPVE +F+
Sbjct: 219 PPCTGEGGSTPKCSRICEAGYTPSYKEDKHFGCSSYSVPSSETEIMAEIYKNGPVEAAFS 278

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           VY DF  YKSGVY+H+TG++MGGHAV+++GWG  +DG  YW++ N WN  WG  G+FKI 
Sbjct: 279 VYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGV-EDGTPYWLVGNSWNTDWGDSGFFKIL 337

Query: 315 RGSNECGIEEDVVAGLPSSKN 335
           RG + CGIE ++VAGLP ++ 
Sbjct: 338 RGQDHCGIESEIVAGLPCTEQ 358


>gi|118153|sp|P25792.1|CYSP_SCHMA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
           Full=Antigen Sm31; Flags: Precursor
 gi|160950|gb|AAA29865.1| cathepsin B [Schistosoma mansoni]
          Length = 340

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 144/340 (42%), Positives = 200/340 (58%), Gaps = 20/340 (5%)

Query: 7   IMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ 66
           ++  ILC+    TF E  +S        L D II  +NE+P AGW+A ++ +F +    +
Sbjct: 1   MLTSILCIASLITFLEAHISVKNEKFEPLSDDIISYINEHPNAGWRAEKSNRFHSLDDAR 60

Query: 67  FKHLLGVKPTPKGLLLGVPVKTH-DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
            + +   +  P       P   H D ++++P +FD+R  WP C +I+ I DQ  CGSCW+
Sbjct: 61  IQ-MGARREEPDLRRKRRPTVDHNDWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCWS 119

Query: 126 FGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 183
           FGAVEA+SDR CI  G   N+ LS  DLL CC   CG GC+GG    AW Y+V  G+VT 
Sbjct: 120 FGAVEAMSDRSCIQSGGKQNVELSAVDLLTCCES-CGLGCEGGILGPAWDYWVKEGIVTA 178

Query: 184 E-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISA 229
                   C+PY        T   +P C    Y TP+C + C +K +  +   KH   S+
Sbjct: 179 SSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYNTPRCKQTCQRKYKTPYTQDKHRGKSS 238

Query: 230 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 289
           Y + +D + I  EI K GPVE SFTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG  +
Sbjct: 239 YNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGV-E 297

Query: 290 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
           +   YW++AN WN  WG +GYF+I RG +EC IE +V+AG
Sbjct: 298 NKTPYWLIANSWNEDWGENGYFRIVRGRDECSIESEVIAG 337


>gi|91078964|ref|XP_974298.1| PREDICTED: similar to putative cathepsin B-like like proteinase
           [Tribolium castaneum]
 gi|270004838|gb|EFA01286.1| cathepsin B precursor [Tribolium castaneum]
          Length = 335

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 154/339 (45%), Positives = 194/339 (57%), Gaps = 28/339 (8%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT-VGQFKH 69
           +LC    AT A   +S   L+ H L D  I  +N + K  WKA RN  F  +T +   K 
Sbjct: 5   LLCAVVLATIA---LSYGGLNPHPLSDEFINAIN-SKKTTWKAGRN--FDIHTPLANIKK 58

Query: 70  LLGVKPTPKGLLLGVPVKTHDKSLK-LPKSFDARSAWPQC-STISRILDQGHCGSCWAFG 127
           LLGV P  K     + +K H   +  +P+SFDAR AWP+C S I  I DQ  CGSCWAFG
Sbjct: 59  LLGVLPK-KANARQLELKVHSVDVNAIPESFDAREAWPECASIIGDIRDQASCGSCWAFG 117

Query: 128 AVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT--- 182
           A EA+SDR CIH    + +S+S  DL  CC + CGDGC+GG+P  AW Y+   G+VT   
Sbjct: 118 AAEAMSDRICIHSNATVKVSISTEDLNTCC-YECGDGCNGGWPAEAWAYWAETGIVTGGK 176

Query: 183 ----EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI 232
               + C  Y     C H      P C    PTP+C ++C     +   S     SAY+ 
Sbjct: 177 YETKDGCKAY-TVPPCEHHTEGDLPACGDIVPTPQCKKECDAGVDIEYKSDLRKGSAYQT 235

Query: 233 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 292
           +SD   I  EI  NGPVE  F VYEDF +YKSGVY+  TG+  GGHA+K++GWG  +DG 
Sbjct: 236 SSDESQIQTEIMTNGPVEADFDVYEDFLNYKSGVYQQTTGNYAGGHAIKILGWGV-EDGT 294

Query: 293 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
            YW+ AN WN  WG  GYFKI RG NECGIE D++ G+P
Sbjct: 295 PYWLAANSWNEDWGDKGYFKILRGQNECGIESDIIGGIP 333


>gi|154089579|gb|ABS57370.1| cathepsin B2 [Trichobilharzia regenti]
          Length = 344

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 150/327 (45%), Positives = 195/327 (59%), Gaps = 22/327 (6%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
            ++ K     L   +I  +N      WKAA +P+F   +V   + +LG  P P G  L  
Sbjct: 23  ANRHKFMHQPLSSELIHFINHEANTTWKAAPSPRFK--SVSDIRRMLGALPDPNGGHLPT 80

Query: 85  PVKTHDKSL-KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF-GM 142
               +  SL +LPK FDAR  WP C +IS I DQ  CGSCWAFGAVEA+SDR CI   G+
Sbjct: 81  LCTGYTPSLDELPKEFDARKYWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGL 140

Query: 143 NLS-LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGC 194
           +   LS  +L+ACC   CG GC+GG+P SAW Y+   G+VT +       C PY +   C
Sbjct: 141 HKPFLSAENLVACCS-SCGMGCNGGFPHSAWSYWKRSGIVTGDLYNPTDGCQPY-EFPPC 198

Query: 195 SH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 247
            H      P CE    TPKC   C    N  +   K Y  + YR++S+ E IM E+ ++G
Sbjct: 199 EHHVVGPRPSCEGDVETPKCKTTCQPGYNIPYNKDKWYGKTVYRVHSNQEAIMKEVKEHG 258

Query: 248 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 307
           PVEV F VY DF +YKSGVY+H++G ++GGHAV+L+GWG  ++G  YW++AN WN  WG 
Sbjct: 259 PVEVDFEVYADFPNYKSGVYQHVSGGLLGGHAVRLLGWG-EENGVPYWLIANSWNSDWGD 317

Query: 308 DGYFKIKRGSNECGIEEDVVAGLPSSK 334
           +GYFKI RG NECGIE DV AG+P  K
Sbjct: 318 NGYFKIIRGRNECGIESDVNAGIPKLK 344


>gi|24158605|pdb|1GMY|A Chain A, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
 gi|24158606|pdb|1GMY|B Chain B, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
 gi|24158607|pdb|1GMY|C Chain C, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
          Length = 261

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 132/262 (50%), Positives = 175/262 (66%), Gaps = 16/262 (6%)

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DL 151
           KLP SFDAR  WPQC TI  I DQG CGSCWAFGAVEA+SDR CIH   ++S+ V+  DL
Sbjct: 1   KLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDL 60

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGC 199
           L CCG +CGDGC+GGYP  AW ++   G+V+         C PY           S P C
Sbjct: 61  LTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPC 120

Query: 200 EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
                TPKC + C    +  ++  KHY  ++Y +++  +DIMAEIYKNGPVE +F+VY D
Sbjct: 121 TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSD 180

Query: 259 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
           F  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI RG +
Sbjct: 181 FLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQD 239

Query: 319 ECGIEEDVVAGLPSSKNLVKEI 340
            CGIE +VVAG+P +    ++I
Sbjct: 240 HCGIESEVVAGIPRTDQYWEKI 261


>gi|348587350|ref|XP_003479431.1| PREDICTED: cathepsin B-like [Cavia porcellus]
          Length = 340

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 147/322 (45%), Positives = 194/322 (60%), Gaps = 33/322 (10%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD-- 90
           H L D ++  VN+     W+A RN  F N  +   K L G         LG P       
Sbjct: 24  HPLSDELVNYVNK-LNTTWQAGRN--FHNVDISYVKRLCGT-------YLGGPRLPQRVQ 73

Query: 91  --KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 146
             + L LP+SFDAR  WP C TI  I DQG CGSCWAFGAVEA+SDR CIH    +N+ +
Sbjct: 74  FAEDLDLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAMSDRLCIHTNGHVNVEV 133

Query: 147 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH--- 196
           S  DLL+CCG LCG+GC+GGYP  AW+Y+   G+V+         C PY     C H   
Sbjct: 134 SAEDLLSCCGPLCGEGCNGGYPTEAWKYWTRKGLVSGGLYGSHVGCRPY-SIPPCEHHVN 192

Query: 197 ---PGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
              P C      TPKC + C    +  ++  K+Y  S+Y + S  ++IMAEIYKNGPVE 
Sbjct: 193 GTRPKCTGEGGDTPKCSKTCEPGYSPSYKEDKYYGYSSYSVPSTEKEIMAEIYKNGPVEA 252

Query: 252 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 311
           +F+V+ DF  YKSGVYKH+ G+V+GGHA++++GWG  ++G  YW++ N WN  WG +G+F
Sbjct: 253 AFSVFSDFLTYKSGVYKHVAGEVLGGHAIRILGWG-KENGVPYWLVGNSWNVDWGDNGFF 311

Query: 312 KIKRGSNECGIEEDVVAGLPSS 333
           KI RG + CGIE +VVAG+P +
Sbjct: 312 KILRGEDHCGIESEVVAGIPRT 333


>gi|340501578|gb|EGR28345.1| hypothetical protein IMG5_177790 [Ichthyophthirius multifiliis]
          Length = 356

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 156/362 (43%), Positives = 212/362 (58%), Gaps = 44/362 (12%)

Query: 4   TKLIMDPILC-LTCFATFAEGVVSKLKLD--SHILQD---SIIKEVNENPKAGWKAARNP 57
           T LI+  +L  L  F  +     SK   D  +   Q+   +I K+VN + K  W+A  N 
Sbjct: 3   TALILTLVLSSLIGFGVYVYSKHSKFTFDEPNQAYQNKLGNIAKKVN-SLKTTWQAGENQ 61

Query: 58  QFSNYTVGQFKHLLGV-KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW-PQCSTISRIL 115
           ++ N  +   K  +GV + +  G+ L    K       LPK+FD+R  W  +C +++ + 
Sbjct: 62  RWQNMDIAGIKAHMGVLRESKSGINLE---KVSTVVENLPKNFDSRKQWGSKCPSLNEVR 118

Query: 116 DQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 175
           DQ  CGSCWAF A E+LSDR CIH G ++ LS  +L++CC   CGDGC+GGYP +A +YF
Sbjct: 119 DQSTCGSCWAFAAAESLSDRICIHTGEDVRLSTENLVSCCSS-CGDGCNGGYPEAAMQYF 177

Query: 176 VHHGVVTEECDPYFDSTGCS---------------HPGCEPAYPTPKCVRKC-----VKK 215
           V  G+VT   D + D+  C                +P C+   PTP+C +KC     VK+
Sbjct: 178 VKTGLVTG--DLFGDNNFCQAYSFPPCAHHVASTKYPPCKGEVPTPECKKKCDDDSKVKR 235

Query: 216 ---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 272
                L++  K YS+S     SDP+ IM EI  NGPVEV+FTVYEDF  YKSGVY+H+TG
Sbjct: 236 PYNEDLYKGQKSYSVS-----SDPKAIMTEIMNNGPVEVAFTVYEDFVTYKSGVYQHVTG 290

Query: 273 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
           + +GGHAVK+IGWG  +D   YW++ N WN +WG  G FKI RGSNECGIE++VV  LP 
Sbjct: 291 EQLGGHAVKMIGWGVEND-TPYWLIVNSWNETWGDQGTFKILRGSNECGIEDEVVTALPQ 349

Query: 333 SK 334
            K
Sbjct: 350 KK 351


>gi|354471594|ref|XP_003498026.1| PREDICTED: cathepsin B-like [Cricetulus griseus]
 gi|344254255|gb|EGW10359.1| Cathepsin B [Cricetulus griseus]
          Length = 339

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 146/341 (42%), Positives = 202/341 (59%), Gaps = 32/341 (9%)

Query: 10  PILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKH 69
           P+ CL   A+      +  K   H L D +I  +N+     W+A RN  F N  +   K 
Sbjct: 7   PLSCLLALAS------AHNKPSFHPLSDDLINYINKR-NTTWQAGRN--FHNVDISYLKR 57

Query: 70  LLG-VKPTPKGLLLGVPVKT-HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFG 127
           L G +   PK     +P +    + ++LP++FDAR  W  C TI +I DQG CGSCWAFG
Sbjct: 58  LCGTIMGGPK-----LPERVAFAEDMELPENFDAREQWSNCPTIKQIRDQGSCGSCWAFG 112

Query: 128 AVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE- 184
           AV A+SDR CIH    +N+ +S  DLL CCG  CGDGC+GGYP  AW +++  G+V+   
Sbjct: 113 AVGAMSDRLCIHTNGHVNVEVSAEDLLTCCGSQCGDGCNGGYPSGAWNFWIKKGLVSGGL 172

Query: 185 ------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRI 232
                 C PY           S P C     TPKC + C    +  ++  KHY  ++Y +
Sbjct: 173 YNSHVGCLPYTIPPCEHHVNGSRPQCTGEGDTPKCTKSCEAGYSPSYKEDKHYGYTSYSV 232

Query: 233 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 292
           +++ ++IMAEIYKNGPVE +FTV+ DF  YKSGVYKH  GD+MGGHA++++GWG  ++  
Sbjct: 233 SNNEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDIMGGHAIRILGWGV-ENSV 291

Query: 293 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
            YW++AN WN  WG +G FKI RG + CGIE ++VAG+P +
Sbjct: 292 PYWLVANSWNVDWGDNGLFKILRGEDHCGIESEIVAGIPRT 332


>gi|189096178|pdb|3CBJ|A Chain A, Chagasin-cathepsin B Complex
 gi|189096180|pdb|3CBK|A Chain A, Chagasin-Cathepsin B
          Length = 266

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 132/265 (49%), Positives = 177/265 (66%), Gaps = 16/265 (6%)

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN- 149
           + LKLP SFDAR  WPQC TI  I DQG CGS WAFGAVEA+SDR CIH   ++S+ V+ 
Sbjct: 3   EDLKLPASFDAREQWPQCPTIKEIRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSA 62

Query: 150 -DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH----- 196
            DLL CCG +CGDGC+GGYP  AW ++   G+V+         C PY      +H     
Sbjct: 63  EDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEAHVNGAR 122

Query: 197 PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 255
           P C     TPKC + C    +  ++  KHY  ++Y +++  +DIMAEIYKNGPVE +F+V
Sbjct: 123 PPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSV 182

Query: 256 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 315
           Y DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI R
Sbjct: 183 YSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILR 241

Query: 316 GSNECGIEEDVVAGLPSSKNLVKEI 340
           G + CGIE +VVAG+P +    ++I
Sbjct: 242 GQDHCGIESEVVAGIPRTDQYWEKI 266


>gi|126303983|ref|XP_001381634.1| PREDICTED: cathepsin B-like [Monodelphis domestica]
          Length = 337

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 142/330 (43%), Positives = 201/330 (60%), Gaps = 34/330 (10%)

Query: 26  SKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV-----KPTPKGL 80
           +K +L    L D ++  +N+     W+A  N  F N  +   K L G      K  P+ +
Sbjct: 17  AKSRLSIPPLSDEMVNHINK-LNTTWQAGHN--FLNADMSYVKKLCGTFMGGAKLLPQRM 73

Query: 81  LLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF 140
           +L         ++KLP++FDAR  WP C TI  I DQG CGSCWAFGAVEA+SDR C+H 
Sbjct: 74  ILA-------DNMKLPENFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICVHS 126

Query: 141 G--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS 191
               N+ +S  DLL+CCG  CGDGC+GG+P  AW ++   G+V+         C PY   
Sbjct: 127 NGNANVEVSAEDLLSCCGSECGDGCNGGFPAGAWNFWTKKGLVSGGLYDSHVGCRPY-SI 185

Query: 192 TGCSH--PGCEPAYP-----TPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEI 243
             C H   G  PA       TP C +KC +  +  +++ K+Y  ++Y + S  ++IMAEI
Sbjct: 186 PPCEHHVNGSRPACTGEEGDTPTCRKKCEEGYSTQYKDDKNYGSTSYSVPSSEQEIMAEI 245

Query: 244 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 303
           YKNGPVE +F+VYEDF HYKSGVY+H+ G+++GGHA++++GWG  ++G  YW+ AN WN 
Sbjct: 246 YKNGPVEGAFSVYEDFLHYKSGVYQHVAGEMLGGHAIRILGWGV-ENGIRYWLAANSWNI 304

Query: 304 SWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
            WG +G+FK  RG N CGIE +++AG+P +
Sbjct: 305 DWGDNGFFKFLRGKNHCGIESEIIAGIPRT 334


>gi|256077361|ref|XP_002574974.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
 gi|18181863|emb|CAC85211.2| cathepsin B endopeptidase [Schistosoma mansoni]
 gi|353231645|emb|CCD79000.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
          Length = 347

 Score =  267 bits (682), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 152/335 (45%), Positives = 192/335 (57%), Gaps = 22/335 (6%)

Query: 17  FATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPT 76
           + T  E    + K     L   +I  +N      WKAA   +F   TV   + +LG  P 
Sbjct: 19  YGTLNEIDARRHKRMYQPLSMELINFINYEANTTWKAAPTTRFR--TVSDIRRMLGALPD 76

Query: 77  PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRF 136
           P G  L   + T   S +LPKSFDAR  WP C +IS I DQ  CGSCWAFGAVEA+SDR 
Sbjct: 77  PNGEQLET-LCTGYISDELPKSFDARVEWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRI 135

Query: 137 CIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDP 187
           CI         LS  +L++CC   CG GC+GG+P SAW Y+ + G+VT +       C P
Sbjct: 136 CIKSKGKHKPFLSAENLVSCCSS-CGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQP 194

Query: 188 YFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIM 240
           Y +   C H      P C+    TP C   C    N  +   K Y    YRI+S+PE IM
Sbjct: 195 Y-EFPPCEHHVIGPLPSCDGDVETPSCKTNCQPGYNIPYEKDKWYGEKVYRIHSNPEAIM 253

Query: 241 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQ 300
            E+ +NGPVEV F VY DF +YKSGVY+H++G ++GGHAV+L+GWG  ++   YW++AN 
Sbjct: 254 LELMRNGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWG-EENNVPYWLIANS 312

Query: 301 WNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 335
           WN  WG  GYFKI RG NECGIE DV AG+P  KN
Sbjct: 313 WNSDWGDKGYFKIVRGKNECGIESDVNAGIPKIKN 347


>gi|31872149|gb|AAP59456.1| cathepsin B precursor [Araneus ventricosus]
          Length = 334

 Score =  266 bits (681), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 151/315 (47%), Positives = 187/315 (59%), Gaps = 23/315 (7%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS 92
           H L + +I+ VN      WKA RN      T+   + LLGV        L  P   H   
Sbjct: 25  HPLSEKMIEYVN-FMNTTWKAGRNFH-EGVTMKYIRGLLGVHKDNHKYRL--PSIRHAVP 80

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 150
             LP+SFD+R  WP C TIS I DQG CGSCWAFGA EA+SDR CIH    +N+ +S  D
Sbjct: 81  GDLPESFDSREQWPNCPTISEIRDQGSCGSCWAFGAAEAMSDRHCIHSNGKVNVEISAED 140

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------P 197
           LL CC   CG GC+GG+P SAW Y+V  G+VT         C PY  ++ C H      P
Sbjct: 141 LLTCCD-SCGMGCNGGFPGSAWEYWVDKGLVTGGLYNSHVGCQPYTIAS-CEHHTKGKLP 198

Query: 198 GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
            C     TP+CV  C K  N  +R  K++   +Y I+   + I  EI  NGPVE +FTVY
Sbjct: 199 PCGDIVDTPQCVHMCEKGYNVSYRADKYFGKKSYSIDEQEDQIKTEISTNGPVEAAFTVY 258

Query: 257 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
            DF  YKSGVY+H+TG+ MGGHAV+++GWGT + G  YW++AN WN  WG  GYFKI RG
Sbjct: 259 ADFVTYKSGVYRHVTGEEMGGHAVRILGWGT-ESGTPYWLVANSWNTDWGDKGYFKILRG 317

Query: 317 SNECGIEEDVVAGLP 331
           S+ECGIE  +VAGLP
Sbjct: 318 SDECGIESSIVAGLP 332


>gi|326515156|dbj|BAK03491.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 471

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 150/323 (46%), Positives = 189/323 (58%), Gaps = 29/323 (8%)

Query: 28  LKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPT-PKGLLLGVPV 86
           L LD+      I+  VN      W A  N +F+  T+   K+L G K   PK     +PV
Sbjct: 152 LGLDAPAQSRDIVDFVNA-LGTTWTAGHNKRFTYNTLRHVKNLCGAKKGGPK-----LPV 205

Query: 87  KTHDKSLKLPKSFDAR--SAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCI--HFG 141
           K   K + LP SFD R  S WP C  +++ + DQG CGSCWAFGA EA++DR CI  +  
Sbjct: 206 KRIPKKMALPTSFDPRDGSKWPACKDSLNHVRDQGSCGSCWAFGAAEAMTDRICIASNGQ 265

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 188
            N  LS  DL +CC   CG GC+GGYP +AW YF   G+VT       + C PY      
Sbjct: 266 NNFYLSAEDLTSCCDS-CGMGCEGGYPSAAWDYFQSTGLVTGGDWNSNQGCYPYQLQACD 324

Query: 189 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 248
              TG   P C    PTP C   C + N  W + KH+  S+Y + +D + IM EIY NGP
Sbjct: 325 HHVTGKYQP-CGDIQPTPACANSC-QNNATWSSDKHFGASSYSVGTDQQSIMTEIYTNGP 382

Query: 249 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 308
           VE S+ VY DF  YKSGVY+H+TGD +GGHAVK+IGWG  D    YWI+AN WN  WG +
Sbjct: 383 VEASYDVYADFVSYKSGVYQHVTGDYLGGHAVKIIGWGV-DGSTPYWIVANSWNNDWGNN 441

Query: 309 GYFKIKRGSNECGIEEDVVAGLP 331
           G+F I RGS+ECGIE+ +VAG+P
Sbjct: 442 GFFNILRGSDECGIEDGIVAGIP 464


>gi|326427908|gb|EGD73478.1| cathepsin B [Salpingoeca sp. ATCC 50818]
          Length = 341

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 146/322 (45%), Positives = 192/322 (59%), Gaps = 24/322 (7%)

Query: 28  LKLDSHILQ-DSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPV 86
           L+    IL  +SI  ++N     GWKA  N +F N T+   +  +G +   +G  + + V
Sbjct: 24  LRFAHDILGLESIANDINAR-NVGWKAGVNERFVNVTMDYIRKQMGTRL--EGSPVTLDV 80

Query: 87  KTHDKSLKLPKSFDARSAW-PQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMN 143
           K  +    LP SFD+R+ W   C ++  + DQ +CGSCWAFGAVEA++DR CI       
Sbjct: 81  KHVEVPADLPTSFDSRTQWGSMCPSVKEVRDQANCGSCWAFGAVEAMTDRTCIASKGAQT 140

Query: 144 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FD 190
             +S  DLL CC F CGDGC+GGYP +AW Y+ + G+VT       + C PY        
Sbjct: 141 PHISAEDLLTCCTFTCGDGCNGGYPAAAWEYWKNQGIVTGGQYDSNQGCQPYSLAKCEHH 200

Query: 191 STGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 249
           +TG   P C    PTP C R C +  N  + N KH+  S+Y +    + I  EI  NGPV
Sbjct: 201 TTGPYKP-CGDIVPTPACKRSCRQGYNVTYPNDKHFGASSYGVRG-VDQIATEIMTNGPV 258

Query: 250 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 309
           E +FTVY DF  YKSGVY+H +G  +GGHA+K+IGWG   DG DYWI+AN WN SWG DG
Sbjct: 259 EAAFTVYSDFLSYKSGVYQHTSGQPLGGHAIKIIGWGVQ-DGTDYWIVANSWNDSWGNDG 317

Query: 310 YFKIKRGSNECGIEEDVVAGLP 331
           +F IK+G++ECGIE  VVAGLP
Sbjct: 318 FFWIKKGTDECGIESQVVAGLP 339


>gi|6681079|ref|NP_031824.1| cathepsin B preproprotein [Mus musculus]
 gi|115712|sp|P10605.2|CATB_MOUSE RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
           RecName: Full=Cathepsin B light chain; Contains:
           RecName: Full=Cathepsin B heavy chain; Flags: Precursor
 gi|239907|gb|AAB20536.1| preprocathepsin B [Mus sp.]
 gi|309152|gb|AAA37375.1| cathepsin B [Mus musculus]
 gi|13879360|gb|AAH06656.1| Cathepsin B [Mus musculus]
 gi|26350521|dbj|BAC38900.1| unnamed protein product [Mus musculus]
 gi|74180941|dbj|BAE27751.1| unnamed protein product [Mus musculus]
 gi|74191261|dbj|BAE39458.1| unnamed protein product [Mus musculus]
 gi|74198944|dbj|BAE30691.1| unnamed protein product [Mus musculus]
 gi|74208073|dbj|BAE29144.1| unnamed protein product [Mus musculus]
 gi|148704123|gb|EDL36070.1| cathepsin B, isoform CRA_a [Mus musculus]
          Length = 339

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 142/320 (44%), Positives = 191/320 (59%), Gaps = 30/320 (9%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL----LGVKPTPKGLLLGVPVKT 88
           H L D +I  +N+     W+A RN  F N  +   K L    LG    P  +  G     
Sbjct: 24  HPLSDDLINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGTVLGGPKLPGRVAFG----- 75

Query: 89  HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 146
             + + LP++FDAR  W  C TI +I DQG CGSCWAFGAVEA+SDR CIH    +N+ +
Sbjct: 76  --EDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEV 133

Query: 147 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGC 194
           S  DLL CCG  CGDGC+GGYP  AW ++   G+V+         C PY           
Sbjct: 134 SAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNG 193

Query: 195 SHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
           S P C     TP+C + C    +  ++  KH+  ++Y +++  ++IMAEIYKNGPVE +F
Sbjct: 194 SRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAF 253

Query: 254 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 313
           TV+ DF  YKSGVYKH  GD+MGGHA++++GWG  ++G  YW+ AN WN  WG +G+FKI
Sbjct: 254 TVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGV-ENGVPYWLAANSWNLDWGDNGFFKI 312

Query: 314 KRGSNECGIEEDVVAGLPSS 333
            RG N CGIE ++VAG+P +
Sbjct: 313 LRGENHCGIESEIVAGIPRT 332


>gi|262368170|pdb|3K9M|A Chain A, Cathepsin B In Complex With Stefin A
 gi|262368172|pdb|3K9M|B Chain B, Cathepsin B In Complex With Stefin A
          Length = 254

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 130/254 (51%), Positives = 171/254 (67%), Gaps = 16/254 (6%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLL 152
           LP SFDAR  WPQC TI  I DQG CGSCWAFGAVEA+SDR CIH   ++S+ V+  DLL
Sbjct: 1   LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 60

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCE 200
            CCG +CGDGC+GGYP  AW ++   G+V+         C PY           S P C 
Sbjct: 61  TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 120

Query: 201 PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
               TPKC + C    +  ++  KHY  ++Y +++  +DIMAEIYKNGPVE +F+VY DF
Sbjct: 121 GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDF 180

Query: 260 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 319
             YKSGVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI RG + 
Sbjct: 181 LLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDH 239

Query: 320 CGIEEDVVAGLPSS 333
           CGIE +VVAG+P +
Sbjct: 240 CGIESEVVAGIPRT 253


>gi|37788265|gb|AAO64472.1| cathepsin B precursor [Fundulus heteroclitus]
          Length = 330

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 150/337 (44%), Positives = 196/337 (58%), Gaps = 24/337 (7%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
           + C T     A   VS+ +   H L   +I  +N+     WKA  N  F +   G  K+L
Sbjct: 1   MWCQTLLVLAASLSVSRGRPHIHPLSSDMINYINK-LNTTWKAGHN--FHDVDYGYVKNL 57

Query: 71  LGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 130
            G     KG  L + V++    +KLPK FDAR  WP+C T+  I DQG CGSCWAFGA E
Sbjct: 58  CGT--LLKGPKLPIMVQSAG-GMKLPKQFDAREQWPECPTLKEIRDQGSCGSCWAFGAAE 114

Query: 131 ALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE---- 184
           A+SDR CIH    +++ +S  DLL CC   CG GC+GGYP +AW ++   G+VT      
Sbjct: 115 AISDRICIHTKGKVSVEISSQDLLTCCDS-CGMGCNGGYPANAWEFWTEQGLVTGGLYNS 173

Query: 185 ---CDPY------FDSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINS 234
              C PY          G   P       TP+CV +C       ++  KHY  ++Y + S
Sbjct: 174 HIGCRPYTIEPCEHHVNGSRPPCTGEGGDTPECVTQCEAGYTPSYQKDKHYGKTSYGVPS 233

Query: 235 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 294
           + E I +EIYKNGPVE +F VYEDF  YKSGVY+H+TG  +GGHA+K+IGWG  ++G  Y
Sbjct: 234 EEEQIQSEIYKNGPVEGAFIVYEDFPSYKSGVYQHVTGSALGGHAIKMIGWG-EENGVPY 292

Query: 295 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           W+ AN WN  WG +G+FKI RGSN CGIE +VVAG+P
Sbjct: 293 WLCANSWNTDWGDNGFFKILRGSNHCGIESEVVAGIP 329


>gi|147906534|ref|NP_001090927.1| cathepsin B precursor [Sus scrofa]
 gi|187470655|sp|A1E295.1|CATB_PIG RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|118490058|gb|ABK96810.1| cathepsin B [Sus scrofa]
          Length = 335

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 145/343 (42%), Positives = 194/343 (56%), Gaps = 35/343 (10%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           L+C         ++  L    L D ++  +N+     W A  N  F N  +   K L G 
Sbjct: 8   LSCLVLLTS---ARESLHFQPLSDELVNFINKQ-NTTWTAGHN--FYNVDLSYVKKLCGT 61

Query: 74  KPTPKGLLLGVPVKTHDKSLK----LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 129
                   LG P      +      LPKSFDAR  WP C TI  I DQG CGSCWAFGAV
Sbjct: 62  -------FLGGPKLPQRAAFAADMILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAV 114

Query: 130 EALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 184
           EA+SDR CI     +N+ +S  D+L CCG  CGDGC+GG+P  AW ++   G+V+     
Sbjct: 115 EAISDRICIRSNGRVNVEVSAEDMLTCCGDECGDGCNGGFPSGAWNFWTKKGLVSGGLYD 174

Query: 185 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 233
               C PY     C H      P C     TPKC + C       ++  KH+  S+Y I+
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSIS 233

Query: 234 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 293
            + ++IMAEIYKNGPVE +FTVY DF  YKSGVY+H+TGD+MGGHA++++GWG  ++G  
Sbjct: 234 RNEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGV-ENGTP 292

Query: 294 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 336
           YW++ N WN  WG +G+FKI RG + CGIE ++VAG+P + + 
Sbjct: 293 YWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGIPCTPHF 335


>gi|346470617|gb|AEO35153.1| hypothetical protein [Amblyomma maculatum]
          Length = 335

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 145/314 (46%), Positives = 190/314 (60%), Gaps = 24/314 (7%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 94
           L D +I  +N+     WKA RN    N  V   K L+GV P  K   L  P+  H+   K
Sbjct: 27  LSDEMINFINK-LNTTWKAGRNFD-KNTPVSYLKGLMGVHPDSKNYRL--PLFYHEDIPK 82

Query: 95  -LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 151
            LP+SFDAR  W  C++I  I DQ  CGSCWAFGA EA+SDR CIH    + +++S  DL
Sbjct: 83  DLPESFDAREKWSHCNSIHVIRDQSTCGSCWAFGATEAMSDRVCIHSKGKVQVNISAEDL 142

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PG 198
           L CC   CG GC+GGYP +AW ++   G+VT       + C PY+    C H      P 
Sbjct: 143 LTCCD-SCGAGCNGGYPAAAWEFYKTDGIVTGGLYGTDDGCQPYYFPP-CEHHTVGPLPN 200

Query: 199 CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 257
           C    PTP+CVR C K   + +   KHY+   Y +++D   I  EI+KNGPVE  FTVY 
Sbjct: 201 CTGIKPTPQCVRDCRKGYEKSYSEDKHYAKKVYTLSADETQIKTEIFKNGPVEADFTVYA 260

Query: 258 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 317
           DF  YKSGVY+  + D +GGHA++++GWGT ++G  YW++AN WN  WG  GYFKI RG+
Sbjct: 261 DFVSYKSGVYQRHSDDALGGHAIRILGWGT-ENGVPYWLVANSWNEDWGDKGYFKILRGN 319

Query: 318 NECGIEEDVVAGLP 331
           +ECGIE+D+ AG+P
Sbjct: 320 DECGIEDDINAGIP 333


>gi|50657025|emb|CAH04630.1| cathepsin B [Suberites domuncula]
          Length = 331

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 139/315 (44%), Positives = 183/315 (58%), Gaps = 19/315 (6%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV-KPTPKGLLLGVPVKTHD 90
           + +L    + E        WKA  N +F   +    +  +GV +  P  L + +P K   
Sbjct: 16  AELLNQQDMSEYINKLGTTWKAGVNKRFEGLSEVDIRRQMGVLQGGP--LDIKLPEKDIT 73

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND 150
               +P  FDAR  WP C TI  I DQG CGSCWAFGAVE++SDRFCIHF  +  +S  D
Sbjct: 74  PLKDVPDMFDARMQWPDCPTIKEIRDQGACGSCWAFGAVESMSDRFCIHFNQSAHISAED 133

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDST------GCSHP 197
           L+ACC   CG GC+GGY  +AWRYF H G+VT       E C PY  ++      G   P
Sbjct: 134 LMACCE-TCGMGCNGGYLGAAWRYFEHTGLVTGGQYNSKEGCQPYLIASCDHHVVGKKQP 192

Query: 198 GCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
                  TP+C + C     + +   KH+  SAY + S  E I  EI  NGPVE +FTVY
Sbjct: 193 CASKEEHTPRCSKTCEAGYDVSFEKDKHFGASAYSVRSSVEAIQTEIMTNGPVEGAFTVY 252

Query: 257 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
            DF  YKSGVY+H +G ++GGHA++++GWGT ++G  YW++AN WN  WGA GYFKI RG
Sbjct: 253 ADFPTYKSGVYQHTSGAMLGGHAIRILGWGT-ENGTPYWLVANSWNEDWGAMGYFKIIRG 311

Query: 317 SNECGIEEDVVAGLP 331
            ++CGIE  + AG+P
Sbjct: 312 KDDCGIESQITAGMP 326


>gi|333408990|gb|AEF32260.1| cathepsin B [Cristaria plicata]
          Length = 347

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 149/321 (46%), Positives = 188/321 (58%), Gaps = 34/321 (10%)

Query: 35  LQDSIIKEVN-ENPKAGWKAARNPQFSNYTVGQF---KHLLGVK---PTPKGLLLGVPVK 87
           + + +I  +N   P A WKA  N  F      +    K L G K   P P      +PVK
Sbjct: 34  MSEEMINFLNMPGPGATWKAGNNFPFIRNLDDKLLYAKRLCGTKLNNPNP------LPVK 87

Query: 88  THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLS 145
             +    LP +FDAR+ WP C T+  + DQG CGSCWAFGAVEA+SDR CI  +  +N  
Sbjct: 88  NIEPLRDLPTNFDARTQWPNCPTVKEVRDQGDCGSCWAFGAVEAMSDRICIASNGKVNAE 147

Query: 146 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 196
           +S  DLLACC   CG+GC GG+P  AWRY+   G+VT       + C PY     C H  
Sbjct: 148 ISAEDLLACCSS-CGEGCQGGFPAEAWRYYEREGLVTGGLYNSSQGCQPYM-IPACDHHV 205

Query: 197 -----PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 250
                P  +    TPKC +KC    N  +++ KHY  ++Y ++S  E IM EI  NGPVE
Sbjct: 206 VGHLQPCPKEEAKTPKCSKKCEANYNVTYKDDKHYGKNSYSVDS-VEKIMTEIMTNGPVE 264

Query: 251 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 310
            +FTVYEDF  YKSGVY+H TG  +GGHAVK++GWG  D+G  YWI+AN WN  WG  G+
Sbjct: 265 AAFTVYEDFLSYKSGVYQHRTGQELGGHAVKILGWG-EDNGTPYWIVANSWNPDWGNQGF 323

Query: 311 FKIKRGSNECGIEEDVVAGLP 331
           F I RG +ECGIE  +VAGLP
Sbjct: 324 FNILRGKDECGIESQIVAGLP 344


>gi|427785213|gb|JAA58058.1| Putative cathepsin l culex quinquefasciatus cathepsin l
           [Rhipicephalus pulchellus]
          Length = 346

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 149/316 (47%), Positives = 193/316 (61%), Gaps = 29/316 (9%)

Query: 37  DSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL--K 94
           D +I+ +N      W+A RNP F +      + LLGV P      L  P +  D S    
Sbjct: 37  DKMIQYINY-LNTTWQAGRNPGFED--PAYVRGLLGVSPENHRYRL--PERRLDLSSLGP 91

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG----MNLSLSVND 150
           LP++FD+R  WP+C+TI  I DQG CGSCWAFGAVEA+SDR CIH        + LS +D
Sbjct: 92  LPENFDSRENWPECTTIGEIRDQGSCGSCWAFGAVEAMSDRTCIHSPSGGPKRVHLSADD 151

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------- 196
           LL+CC   CG+GC+GG+P SAW ++V  G+VT       + C PY     C H       
Sbjct: 152 LLSCC-RTCGNGCNGGFPGSAWSFWVKTGIVTGGNYDSDDGCMPY-PIKACDHHVNGTLG 209

Query: 197 PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 255
           P  +   PTP+CV  C K   + + + KHY  S+Y + S+ + I AEI  NGPVE  FTV
Sbjct: 210 PCDKKIPPTPRCVHMCRKGYDVDYHDDKHYGKSSYSVPSEEKQIQAEIMTNGPVEADFTV 269

Query: 256 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 315
           Y DF HYKSGVY+  T + +GGHA++L+GWG  ++G  YW+ AN WN  WG  G+FKI R
Sbjct: 270 YSDFVHYKSGVYQRHTDEALGGHAIRLLGWGV-ENGVPYWLAANSWNTEWGDKGFFKILR 328

Query: 316 GSNECGIEEDVVAGLP 331
           GS+ECGIE+DVVAGLP
Sbjct: 329 GSDECGIEDDVVAGLP 344


>gi|410916585|ref|XP_003971767.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
          Length = 328

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 147/316 (46%), Positives = 189/316 (59%), Gaps = 29/316 (9%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV-KPTPKGLLLGVPVKTHD-K 91
           +L   +I  +N+     W A +N  F N      K L G     PK     +P   H+ +
Sbjct: 22  LLSSEMIDFINK-VNTTWTAGQN--FHNVDSSYVKGLCGTFLKGPK-----LPQVLHNTE 73

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSL--SVN 149
            ++LP SFDAR  WP C TI +I DQG CGSCWAFGA EA+SDR CIH G  +SL  S  
Sbjct: 74  GIRLPDSFDARKQWPDCRTIQQIRDQGSCGSCWAFGAAEAISDRLCIHSGSKISLEISAE 133

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------ 196
           DLL+CC   CG GC GGYP SAW ++   G+VT         C PY  +  C H      
Sbjct: 134 DLLSCCD-ECGMGCSGGYPSSAWEFWTKKGLVTGGLCGSEVGCRPYSIAP-CEHHVNGTR 191

Query: 197 PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 255
           P C+    TPKC +KC+      +   KH+   +Y + S  E IM E+YKNGPVE +FTV
Sbjct: 192 PPCQGTQETPKCEKKCIDGYLTSYLKDKHFGKRSYSLPSQQEQIMTELYKNGPVEAAFTV 251

Query: 256 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 315
           Y DF  YK+GVY+H+TG+V+GGHA+K++GWG  + G  YW+ AN WN  WG  G+FKIKR
Sbjct: 252 YADFLLYKTGVYQHVTGEVLGGHAIKILGWG-EESGTPYWLAANSWNGDWGDKGFFKIKR 310

Query: 316 GSNECGIEEDVVAGLP 331
           G++ECGIE ++VAG P
Sbjct: 311 GNDECGIESEMVAGTP 326


>gi|327281751|ref|XP_003225610.1| PREDICTED: cathepsin B-like [Anolis carolinensis]
          Length = 330

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 144/346 (41%), Positives = 198/346 (57%), Gaps = 37/346 (10%)

Query: 14  LTCFATF-AEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG 72
           +T FA F  E VV   +   +I   +++     N            F N  +   K L G
Sbjct: 3   MTFFAEFHVEAVVIATQWKKNISTKTLVVRAGHN------------FHNVDMSYLKKLCG 50

Query: 73  VK-PTPKGLLLGVPVK-THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 130
                PK     +P +      ++LP SFD+R  WP C TI+ I DQG CGSCWAFGAVE
Sbjct: 51  TYLHGPK-----LPERFAFADDVELPDSFDSRKQWPSCPTINEIRDQGSCGSCWAFGAVE 105

Query: 131 ALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE---- 184
           A+SDR C+H    +N+ +S  DLL+CCGF CG GC+GGYP  AW+Y+   G+V+      
Sbjct: 106 AISDRVCVHTNGKVNVEISAEDLLSCCGFECGMGCNGGYPSGAWKYWTEKGLVSGGLYDS 165

Query: 185 ---CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINS 234
              C PY        + G   P       TP+CV+KC       ++  KHY +++Y I  
Sbjct: 166 HVGCRPYSIPPCEHHTNGTRPPCSGEGGETPECVKKCEDGYTPAYKQDKHYGVTSYGIPR 225

Query: 235 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 294
             ++IMAEIYKNGPVE +F VY DF  YKSGVY+H++G+ +GGHA++++GWG  D+G  Y
Sbjct: 226 SEKEIMAEIYKNGPVEGAFVVYSDFLMYKSGVYQHVSGEEVGGHAIRILGWGV-DNGTPY 284

Query: 295 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           W+ AN WN  WG DG+F+I RG + CGIE ++VAG+P +    K +
Sbjct: 285 WLAANSWNTDWGEDGFFRILRGQDHCGIESEIVAGIPKTSEYWKML 330


>gi|379067374|gb|AFC90100.1| cathepsin B [Capra hircus]
          Length = 335

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 140/315 (44%), Positives = 189/315 (60%), Gaps = 28/315 (8%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK--- 91
           L D ++  VN+     WKA  N  F N  +   K L G       +L G  +   D    
Sbjct: 26  LSDEMVNYVNKQ-NTTWKAGHN--FYNVDLSYVKKLCGA------ILGGPKLPQRDAFAA 76

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
            + LP SFDAR  WP C TI  I DQG CGSCWAFGAVEA+SDR CIH    +N+ +S  
Sbjct: 77  DMVLPDSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSKGRVNVEVSAE 136

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHP 197
           D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY           S P
Sbjct: 137 DMLTCCGSECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRP 196

Query: 198 GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
            C     TPKC + C    +  +++ KH+  S+Y ++S+ ++IMAEIYKNGPVE +F+VY
Sbjct: 197 PCTGEGDTPKCSKICEPGYSPSYKDDKHFGCSSYSVSSNEKEIMAEIYKNGPVEGAFSVY 256

Query: 257 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
            DF  YKSGVY+H++G++MGGHA++++GWG  +D   YW++ N WN  WG  G+FKI RG
Sbjct: 257 SDFLLYKSGVYQHVSGEMMGGHAIRILGWGVEND-TPYWLVGNSWNTDWGDKGFFKILRG 315

Query: 317 SNECGIEEDVVAGLP 331
            + CGIE ++VAG+P
Sbjct: 316 QDHCGIESEIVAGMP 330


>gi|91078958|ref|XP_974220.1| PREDICTED: similar to cathepsin b [Tribolium castaneum]
 gi|270004841|gb|EFA01289.1| cathepsin B precursor [Tribolium castaneum]
          Length = 334

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 145/340 (42%), Positives = 203/340 (59%), Gaps = 32/340 (9%)

Query: 10  PILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFS-NYTVGQFK 68
           PIL + C A       + L +  H L    I+++NE  ++ WKA   P F+ N  +   +
Sbjct: 5   PILTIICTA-------ASLSVAVHPLSKEFIQQINEK-QSTWKAG--PNFAENVPMSYIR 54

Query: 69  HLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 128
            L+GV P  K  +  V     D ++++P  FDAR  WP C TI  I DQG CGSCWAFGA
Sbjct: 55  RLMGVPPNSKYHMPSVKRHLLD-AMEIPDDFDARKQWPNCPTIREIRDQGSCGSCWAFGA 113

Query: 129 VEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT---- 182
           VEA+SDR CIH    +N+ LS +DL++CC + CG GC+GG+P +AW Y+V+ G+V+    
Sbjct: 114 VEAMSDRVCIHSKGAVNVRLSADDLVSCC-YSCGMGCNGGFPGAAWHYWVNKGIVSGGSF 172

Query: 183 ---EECDPYFDSTGCSH--PGCEPA-----YPTPKCVRKCVKK-NQLWRNSKHYSISAYR 231
              + C PY +   C H   G  P        TP C ++C K  N  ++  K++   AY 
Sbjct: 173 GSNQGCRPY-EIAPCEHHVNGTRPPCTGDDNKTPSCKQQCEKGYNVPYKKDKNFGKEAYS 231

Query: 232 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 291
           I+S+ + I  EI  NGPVE +F VYED   YK GVY+H+ G+ +GGHA++++GWGT + G
Sbjct: 232 ISSEVQQIQKEIMTNGPVEGAFEVYEDLLSYKKGVYQHVKGEALGGHAIRILGWGT-EKG 290

Query: 292 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
             YW++AN WN  WG +G FKI RG + CGIE  +VAG+P
Sbjct: 291 TPYWLIANSWNSDWGDNGTFKILRGEDHCGIESSIVAGIP 330


>gi|426220597|ref|XP_004004501.1| PREDICTED: cathepsin B [Ovis aries]
          Length = 335

 Score =  264 bits (674), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 140/315 (44%), Positives = 189/315 (60%), Gaps = 28/315 (8%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK--- 91
           L D ++  VN+     WKA  N  F N  +   K L G       +L G  +   D    
Sbjct: 26  LSDEMVNYVNKQ-NTTWKAGHN--FYNVDLSYVKKLCGA------ILGGPKLPQRDAFAA 76

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
            + LP SFDAR  WP C TI  I DQG CGSCWAFGAVEA+SDR CIH    +N+ +S  
Sbjct: 77  DMVLPDSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSKGRVNVEVSAE 136

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHP 197
           D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY           S P
Sbjct: 137 DMLTCCGSECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRP 196

Query: 198 GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
            C     TPKC + C    +  +++ KH+  S+Y ++S+ ++IMAEIYKNGPVE +F+VY
Sbjct: 197 PCTGEGDTPKCSKICEPGYSPSYKDDKHFGCSSYSVSSNEKEIMAEIYKNGPVEGAFSVY 256

Query: 257 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
            DF  YKSGVY+H++G++MGGHA++++GWG  +D   YW++ N WN  WG  G+FKI RG
Sbjct: 257 SDFLLYKSGVYQHVSGEMMGGHAIRILGWGVEND-TPYWLVGNSWNTDWGDKGFFKILRG 315

Query: 317 SNECGIEEDVVAGLP 331
            + CGIE ++VAG+P
Sbjct: 316 QDHCGIESEIVAGMP 330


>gi|301776581|ref|XP_002923704.1| PREDICTED: cathepsin B-like [Ailuropoda melanoleuca]
 gi|281347694|gb|EFB23278.1| hypothetical protein PANDA_012896 [Ailuropoda melanoleuca]
          Length = 339

 Score =  263 bits (673), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 149/349 (42%), Positives = 201/349 (57%), Gaps = 33/349 (9%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
           + CL+C    A G  S+      +L D ++  VN+     WKA  N  F N      + L
Sbjct: 5   LACLSCLVVLA-GAQSRPPF--QLLSDELVNYVNKR-NTTWKAGHN--FHNVDPSYLRRL 58

Query: 71  LGVKPTPKGLLLGVPVKTHD----KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF 126
            G         LG P         +++ LP++FDAR  WP C TI  I DQG CGSCWAF
Sbjct: 59  CGT-------FLGGPKLPQRVWFAENMVLPENFDAREQWPNCPTIKEIRDQGSCGSCWAF 111

Query: 127 GAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 184
           GAVEA+SDR CI     +N+ +S  D+L CCG  CGDGC+GG+P  AW ++   G+V+  
Sbjct: 112 GAVEAISDRICIRTNGHVNVEVSAEDMLTCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGG 171

Query: 185 -------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYR 231
                  C PY           S P C     TPKC + C       ++  KHY  S+Y 
Sbjct: 172 LYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKFCEPGYTPSYKEDKHYGCSSYS 231

Query: 232 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 291
           ++S  ++IMAEIYKNGPVE +FTVY DF  YKSGVY+H+TG++MGGHAV+++GWG  ++G
Sbjct: 232 VSSSEKEIMAEIYKNGPVEAAFTVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGV-ENG 290

Query: 292 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
             YW++ N WN  WG +G+FKI RG + CGIE ++VAG+P +    K+I
Sbjct: 291 TPYWLVGNSWNTDWGDNGFFKILRGRDHCGIESEIVAGIPCTDQYWKKI 339


>gi|171948776|gb|ACB59245.1| cathepsin B [Sus scrofa]
          Length = 335

 Score =  263 bits (673), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 143/342 (41%), Positives = 192/342 (56%), Gaps = 33/342 (9%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           L+C         ++  L    L D ++  +N+     W A  N  F N  +   K L G 
Sbjct: 8   LSCLVLLTS---ARESLHFQPLSDELVNFINKQ-NTTWTAGHN--FYNVDLSYVKKLCGT 61

Query: 74  KPTPKGLLLGVPVKTHDKSLK----LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 129
                   LG P      +      LPK FDAR  WP C TI  I DQG CGSCWAFGAV
Sbjct: 62  -------FLGGPKLPQRAAFAADMILPKGFDAREQWPNCPTIKEIRDQGSCGSCWAFGAV 114

Query: 130 EALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 184
           EA+SDR CI     +N+ +S  D+L CCG  CGDGC+GG+P  AW ++   G+V+     
Sbjct: 115 EAISDRICIRSNGRVNVEVSAEDMLTCCGDECGDGCNGGFPSGAWNFWTKKGLVSGGLYD 174

Query: 185 ----CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINS 234
               C PY           S P C     TPKC + C       ++  KH+  S+Y I+ 
Sbjct: 175 SHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISR 234

Query: 235 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 294
           + ++IMAEIYKNGPVE +FTVY DF  YKSGVY+H+TGD+MGGHA++++GWG  ++G  Y
Sbjct: 235 NEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGV-ENGTPY 293

Query: 295 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 336
           W++ N WN  WG +G+FKI RG + CGIE ++VAG+P + + 
Sbjct: 294 WLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGIPCTPHF 335


>gi|309202|gb|AAA37494.1| mouse preprocathepsin B [Mus musculus]
          Length = 339

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 141/320 (44%), Positives = 190/320 (59%), Gaps = 30/320 (9%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL----LGVKPTPKGLLLGVPVKT 88
           H L D +I  +N+     W+A RN  F N  +   K L    LG    P  +  G     
Sbjct: 24  HPLSDDLINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGTVLGGPKLPGRVAFG----- 75

Query: 89  HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 146
             + + LP++FDAR  W  C TI +I DQG CGSCWAFGAVEA+SDR CIH    +N+ +
Sbjct: 76  --EDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEV 133

Query: 147 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGC 194
           S  DLL CCG  CGDGC+GGYP  AW ++   G+V+         C PY           
Sbjct: 134 SAEDLLTCCGIQCGDGCNGGYPSGAWNFWTKKGLVSGGVYDSHIGCLPYTIPPCEHHVNG 193

Query: 195 SHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
           S P C     TP+C + C    +  ++  KH+  ++Y +++  ++IMAEIYKNGPVE +F
Sbjct: 194 SRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAF 253

Query: 254 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 313
           TV+ DF  YKSGVYKH  GD+MGGHA++++ WG  ++G  YW+ AN WN  WG +G+FKI
Sbjct: 254 TVFSDFLTYKSGVYKHEAGDMMGGHAIRILVWGV-ENGVPYWLAANSWNLDWGDNGFFKI 312

Query: 314 KRGSNECGIEEDVVAGLPSS 333
            RG N CGIE ++VAG+P +
Sbjct: 313 LRGENHCGIESEIVAGIPRT 332


>gi|45822203|emb|CAE47498.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
          Length = 328

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 144/316 (45%), Positives = 191/316 (60%), Gaps = 23/316 (7%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD-K 91
           H L D  I  +N   K+ W A RN    + ++     L+GV P  K  +   PV TH  +
Sbjct: 18  HPLSDEFINSINA-AKSTWTAGRNFA-QDKSMDYIIKLMGVLPDHKNYM--PPVLTHKLE 73

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
           +L++P  FDAR  WP C TI  I DQG CGSCWAFGAVEA+SDR CIH     N   S +
Sbjct: 74  ALEIPADFDARQQWPHCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIHSNGESNFHFSSD 133

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF-----DSTGCSHP 197
           DL++CC + CG GC+GGYP +AW Y+V  G+V+       + C PY        T  S P
Sbjct: 134 DLVSCC-WTCGMGCNGGYPGAAWHYWVRKGLVSGGQYGTKQGCRPYEIPPCEHHTNGSRP 192

Query: 198 GCEPAY-PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 255
            C+ +   TPKC + C    ++ + N  H+   AY I+SD + I AEI +NGPVE +F+V
Sbjct: 193 ACDASEGNTPKCAKSCESNYKINYSNDLHFGSKAYSISSDVKQIQAEILQNGPVEGAFSV 252

Query: 256 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 315
           Y DF +YK+GVY+HI G  +GGHA+++ GWG  ++   YW++AN WN  WG  G FKI R
Sbjct: 253 YADFVNYKTGVYQHIKGQFLGGHAIRIFGWGVENN-TPYWLIANSWNTDWGDSGTFKILR 311

Query: 316 GSNECGIEEDVVAGLP 331
           GS+ CGIE  +VAGLP
Sbjct: 312 GSDHCGIESGIVAGLP 327


>gi|160333103|ref|NP_001103948.1| capthepsin B, b precursor [Danio rerio]
 gi|133777414|gb|AAI15255.1| Ctsbb protein [Danio rerio]
          Length = 326

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 150/333 (45%), Positives = 192/333 (57%), Gaps = 28/333 (8%)

Query: 15  TCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK 74
            C         ++ +L +H   D +I  +N   ++ W A  N  F N      K L G  
Sbjct: 4   VCVFVLLSVTCARPQLHTH---DEMISFINA-ARSTWTAGVN--FDNVPKEYLKSLCGT- 56

Query: 75  PTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSD 134
              KG  L   VK H  ++KLP SFD R  WP C T+S+I DQG CGSCWAFGAVE++SD
Sbjct: 57  -VLKGPRLPHTVK-HSTNVKLPDSFDLRDQWPNCKTLSQIRDQGSCGSCWAFGAVESISD 114

Query: 135 RFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------C 185
           R CIH     S  +S  DLL+CC   CG GC GG+P  AW Y+   G+VT         C
Sbjct: 115 RICIHSKGKQSPEISAEDLLSCCD-QCGFGCSGGFPAEAWDYWRRSGLVTGGLYNSDVGC 173

Query: 186 DPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPED 238
            PY     C H      P C     TPKC   C+ K  + ++  KH+    Y + SD + 
Sbjct: 174 RPY-SIAPCEHHVNGTRPPCSGEQDTPKCTGVCIPKYSVPYKQDKHFGSKVYNVPSDQQQ 232

Query: 239 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 298
           IM E+Y NGPVE +FTVYEDF  YKSGVY+H+TG  +GGHAVK++GWG  ++G  +W++A
Sbjct: 233 IMTELYTNGPVEAAFTVYEDFPLYKSGVYQHLTGSALGGHAVKILGWG-EENGTPFWLVA 291

Query: 299 NQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           N WN  WG +GYFKI RG +ECGIE ++VAGLP
Sbjct: 292 NSWNSDWGDNGYFKILRGHDECGIESEMVAGLP 324


>gi|443692853|gb|ELT94358.1| hypothetical protein CAPTEDRAFT_221292 [Capitella teleta]
          Length = 374

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 142/317 (44%), Positives = 192/317 (60%), Gaps = 24/317 (7%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD-KSL 93
           L   I+  VN      WKA    ++S  +V + K+L G    P G  L  P+  H  +++
Sbjct: 65  LSQEIVDYVNTKADTTWKAEVTSKWS--SVAEVKNLCGSLKDPNGSRL--PIMRHKLEAV 120

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
            LP  FDAR  W  C TI  + DQG CGSCWAFGAVEA+SDR CI    N+   +S  DL
Sbjct: 121 NLPDDFDARKEWTGCPTIKEVRDQGSCGSCWAFGAVEAMSDRICIASKGNVHAHISSEDL 180

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHP------G 198
           L+CC   CG GC+GG+P +AW YF   G+V+       + C PY  +  C H        
Sbjct: 181 LSCCSS-CGMGCNGGFPPAAWEYFRDTGLVSGGQYGTHQGCRPYSIAP-CEHHVNGTRLP 238

Query: 199 CEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 257
           C    PTPKC R C K  ++ + + K++  +AY +++D + IM EI  NGPVE +FTVY 
Sbjct: 239 CSGEGPTPKCERTCEKGYKVKYEDDKNFGYTAYSVDNDEKQIMTEIMTNGPVEGAFTVYA 298

Query: 258 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 317
           DF  YKSGVY+H++G  +GGHA++++GWG  +DG  YW++AN WN  WG +G+FKI RG 
Sbjct: 299 DFPTYKSGVYQHVSGGELGGHAIRVLGWGV-EDGTPYWLVANSWNSDWGDNGFFKILRGQ 357

Query: 318 NECGIEEDVVAGLPSSK 334
           NECGIE ++VAGLP  +
Sbjct: 358 NECGIEGEIVAGLPKKQ 374


>gi|226468762|emb|CAX76409.1| cathepsin B [Schistosoma japonicum]
 gi|257206178|emb|CAX82740.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 147/333 (44%), Positives = 190/333 (57%), Gaps = 22/333 (6%)

Query: 19  TFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPK 78
           T  E    + K     L   +I  +N      WKA    +F   TV   + +LG  P P 
Sbjct: 20  TLNENDARRHKRMHQPLSKELIHFINYEANTTWKAGPTRRFK--TVSDIRRMLGALPDPN 77

Query: 79  GLLLGVPVKTHDKSL-KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFC 137
           G  L      ++ +L +LPKSFDAR  W  C +IS I DQ  CGSCWAFGAVEA+SDR C
Sbjct: 78  GEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQSSCGSCWAFGAVEAMSDRIC 137

Query: 138 IHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY 188
           I         LS  +L++CC   CG GC+GG+P SAW Y+ + G+VT +       C PY
Sbjct: 138 IESKGKYKPFLSAENLVSCCS-SCGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY 196

Query: 189 FDSTGCSH------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMA 241
            +   C H      P C+    TP C R C    N  + N K Y    YR+ S+ E IM 
Sbjct: 197 -EFPPCEHNTLGPLPVCDGDVETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMK 255

Query: 242 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQW 301
           E+ ++GPVEV F VY DF +YKSGVY+H++G ++GGHAV+L+GWG  ++   YW++AN W
Sbjct: 256 ELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWG-EENNVPYWLIANSW 314

Query: 302 NRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 334
           N  WG +GYFKI RG NECGIE DV AG+P  K
Sbjct: 315 NTDWGDNGYFKIIRGKNECGIESDVNAGIPKIK 347


>gi|256052329|ref|XP_002569725.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228436|emb|CCD74607.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 345

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 141/340 (41%), Positives = 201/340 (59%), Gaps = 20/340 (5%)

Query: 7   IMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ 66
           ++  +LC+    T  +  +S        L D II  +NE+P AGW+A ++ +F +    +
Sbjct: 6   MLTSVLCIASLITHLDAHISIKNEKFKPLSDDIISYINEHPNAGWRAEKSNRFHSLDDAR 65

Query: 67  FKHLLGVKPTPKGLLLGVPVKTHDK-SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
            + +   +  P       P   H++ ++++P +FD+R  WP C +I+ I DQ  CGSCWA
Sbjct: 66  IQ-MGARREEPDLRRKRRPTVDHNEWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCWA 124

Query: 126 FGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 183
           FGAVEA+SDR CI  G   N+ LS  DLL+CC   CG GC+GG    AW ++V  G+VT 
Sbjct: 125 FGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCE-SCGLGCEGGILGPAWDFWVKEGIVTG 183

Query: 184 E-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISA 229
                   C+PY        T   +P C    Y TP+C + C KK +  +   KH   S+
Sbjct: 184 SSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSS 243

Query: 230 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 289
           Y + +D + I  EI K GPVE SFTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG  +
Sbjct: 244 YNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGV-E 302

Query: 290 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
           +   YW++AN WN  WG +GYF+I RG +EC IE +V+AG
Sbjct: 303 NKTPYWLIANSWNEDWGENGYFRIVRGRDECFIESEVIAG 342


>gi|1311050|pdb|1CPJ|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1311051|pdb|1CPJ|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1421561|pdb|1THE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B- Inhibitor Complex: Implications For
           Structure-Based Inhibitor Design
 gi|1421562|pdb|1THE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B- Inhibitor Complex: Implications For
           Structure-Based Inhibitor Design
          Length = 260

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 130/260 (50%), Positives = 171/260 (65%), Gaps = 18/260 (6%)

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 148
           + + LP+SFDAR  W  C TI++I DQG CGSCWAFGAVEA+SDR CIH    +N+ +S 
Sbjct: 3   EDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSA 62

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH----- 196
            DLL CCG  CGDGC+GGYP  AW ++   G+V+         C PY     C H     
Sbjct: 63  EDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPP-CEHHVNGA 121

Query: 197 -PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
            P C     TPKC + C    +  ++  KHY  ++Y ++   ++IMAEIYKNGPVE +FT
Sbjct: 122 RPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFT 181

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           V+ DF  YKSGVYKH  GDVMGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI 
Sbjct: 182 VFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNADWGDNGFFKIL 240

Query: 315 RGSNECGIEEDVVAGLPSSK 334
           RG N CGIE ++VAG+P ++
Sbjct: 241 RGENHCGIESEIVAGIPRTQ 260


>gi|225708580|gb|ACO10136.1| Cathepsin B precursor [Osmerus mordax]
          Length = 329

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 144/318 (45%), Positives = 186/318 (58%), Gaps = 32/318 (10%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGLLLGVPVKTHD 90
           +L   +I+ +N      WKA +N  F N  +   + L G    KPT       +P   H 
Sbjct: 24  LLSSEMIQYINR-LNTTWKAGQN--FYNVDLSYVQGLCGTLQNKPT-------LPELEHP 73

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 148
             +KLP +FDAR  WP C TI  I DQG CGSCWAFGA EA+SDR CIH    + + +S 
Sbjct: 74  AGVKLPDTFDARQQWPNCPTIQDIRDQGSCGSCWAFGAAEAISDRLCIHSNAKITVEISA 133

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH----- 196
            DLL+CC   CG GC GGYP +AW Y+   G+VT       + C PY     C H     
Sbjct: 134 EDLLSCCE-ECGMGCFGGYPSAAWEYWAKSGLVTGGLYGSNKGCRPY-SIPPCEHHVNGT 191

Query: 197 -PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
            P C+    TPKC  KC+      +   K++    Y + S  E IM E+YKNGPVE +F+
Sbjct: 192 RPPCQGEGDTPKCQTKCIDGYTPAYEKDKYFGKKTYSVPSKQEQIMTELYKNGPVEAAFS 251

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           VYEDF  YKSGVY+H+TGD++GGHA+K++GWG  ++   YW+ AN WN  WG  G+FKI 
Sbjct: 252 VYEDFLLYKSGVYQHLTGDMLGGHAIKILGWGKENN-TPYWLAANSWNTDWGNQGFFKIL 310

Query: 315 RGSNECGIEEDVVAGLPS 332
           RG +ECGIE +VVAG+P 
Sbjct: 311 RGGDECGIESEVVAGIPQ 328


>gi|30995341|gb|AAO59414.2| cathepsin B endopeptidase [Schistosoma japonicum]
 gi|226472794|emb|CAX71083.1| cathepsin B [Schistosoma japonicum]
 gi|226472796|emb|CAX71084.1| cathepsin B [Schistosoma japonicum]
 gi|226472798|emb|CAX71085.1| cathepsin B [Schistosoma japonicum]
 gi|226472802|emb|CAX71087.1| cathepsin B [Schistosoma japonicum]
 gi|226472806|emb|CAX71089.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 147/333 (44%), Positives = 190/333 (57%), Gaps = 22/333 (6%)

Query: 19  TFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPK 78
           T  E    + K     L   +I  +N      WKA    +F   TV   + +LG  P P 
Sbjct: 20  TLNENDARRHKRMHQPLSKELIHFINYEANTTWKAGPTRRFK--TVSDIRRMLGALPDPN 77

Query: 79  GLLLGVPVKTHDKSL-KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFC 137
           G  L      ++ +L +LPKSFDAR  W  C +IS I DQ  CGSCWAFGAVEA+SDR C
Sbjct: 78  GEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQSSCGSCWAFGAVEAMSDRIC 137

Query: 138 IHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY 188
           I         LS  +L++CC   CG GC+GG+P SAW Y+ + G+VT +       C PY
Sbjct: 138 IESKGKYKPFLSAENLVSCCS-SCGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY 196

Query: 189 FDSTGCSH------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMA 241
            +   C H      P C+    TP C R C    N  + N K Y    YR+ S+ E IM 
Sbjct: 197 -EFPPCEHHTLGPLPVCDGDVETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMK 255

Query: 242 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQW 301
           E+ ++GPVEV F VY DF +YKSGVY+H++G ++GGHAV+L+GWG  ++   YW++AN W
Sbjct: 256 ELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWG-EENNVPYWLIANSW 314

Query: 302 NRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 334
           N  WG +GYFKI RG NECGIE DV AG+P  K
Sbjct: 315 NTDWGDNGYFKIIRGKNECGIESDVNAGIPKIK 347


>gi|74221319|dbj|BAE42140.1| unnamed protein product [Mus musculus]
          Length = 339

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 141/320 (44%), Positives = 189/320 (59%), Gaps = 30/320 (9%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL----LGVKPTPKGLLLGVPVKT 88
           H L D +I  +N+     W+A RN  F N  +   K L    LG    P  +  G     
Sbjct: 24  HPLSDDLINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGTVLGGPKLPGRVAFG----- 75

Query: 89  HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 146
             + + LP++FDAR  W  C TI +I DQG CGSCWAFGAVEA+SDR CIH    +N+ +
Sbjct: 76  --EDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEV 133

Query: 147 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGC 194
           S  DLL CCG  CGDGC+GGYP  AW ++   G+V+         C PY           
Sbjct: 134 SAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNG 193

Query: 195 SHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
           S P C     TP+C + C    +  ++  KH+  ++Y +++  ++IMAEIYKN PVE +F
Sbjct: 194 SRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNDPVEGAF 253

Query: 254 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 313
           TV+ DF  YKSGVYKH  GD+MGGHA++++GWG   +G  YW+ AN WN  WG +G+FKI
Sbjct: 254 TVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVG-NGVPYWLAANSWNLDWGDNGFFKI 312

Query: 314 KRGSNECGIEEDVVAGLPSS 333
            RG N CGIE ++VAG+P +
Sbjct: 313 LRGENHCGIESEIVAGIPRT 332


>gi|308390275|gb|ADO32581.1| cathepsin B [Marsupenaeus japonicus]
          Length = 332

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 141/320 (44%), Positives = 194/320 (60%), Gaps = 21/320 (6%)

Query: 28  LKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVK 87
           +  ++H L D  IK + ++  + W+A RN    + ++  F+ L+GV P  K  + G    
Sbjct: 15  VSANNHFLSDKFIKML-QSEDSTWEAGRNFN-RHLSIRYFRRLMGVHPDSKYHMPGYEAH 72

Query: 88  THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLS 145
              ++  +PK FD+R+AWP C TI  I DQG CGSCWAFGAVE +SDR CIH     N  
Sbjct: 73  KIPENFDMPKEFDSRAAWPMCPTIGEIRDQGSCGSCWAFGAVEVMSDRQCIHSKGKSNFH 132

Query: 146 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCSH-- 196
            S  +L++CC  LCG GC+GG+P +A++Y+VH G+V       T+ C PY +   C H  
Sbjct: 133 YSSENLVSCC-HLCGFGCNGGFPGAAFKYWVHSGIVSGGSFNSTQGCQPY-EIAPCEHHV 190

Query: 197 ----PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
               P C     TPKCV++C     + + +  H+   AY I  D + I  EI KNGPVE 
Sbjct: 191 PGPRPKCSEGGGTPKCVKRCENGYTVDYESDLHHGGKAYSIMKDEDQIKYEIMKNGPVEG 250

Query: 252 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 311
           +FTVY DF HYKSGVY+H  G  +GGHA++++GWG  ++G  YW+ AN WN  WG +G F
Sbjct: 251 AFTVYVDFLHYKSGVYQHRHGLPLGGHAIRILGWG-EENGTPYWLCANSWNTDWGDNGLF 309

Query: 312 KIKRGSNECGIEEDVVAGLP 331
           KI RGS+ CGIE ++ AGLP
Sbjct: 310 KILRGSDHCGIESEISAGLP 329


>gi|195729973|gb|ACG50797.1| cathepsin B2 [Trichobilharzia szidati]
          Length = 344

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 147/317 (46%), Positives = 189/317 (59%), Gaps = 22/317 (6%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL- 93
           L   +I  +N      WKAA + +F   +V   + +LG  P P G  L      +  SL 
Sbjct: 33  LSSELIHFINHEANTTWKAAPSSRFK--SVSDIRRMLGALPDPNGGYLPTLCTGYTPSLD 90

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF-GMNLS-LSVNDL 151
           +LPK FDAR  WP C +IS I DQ  CGSCWAFGAVEA+SDR CI   G++   LS  +L
Sbjct: 91  ELPKEFDARKHWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGLHKPFLSAENL 150

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PG 198
           +ACC   CG GC+GG+P SAW Y+   G+VT +       C PY +   C H      P 
Sbjct: 151 VACCS-SCGMGCNGGFPHSAWSYWKRSGIVTGDLYNTTDGCQPY-EFPPCEHHVVGPRPS 208

Query: 199 CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 257
           C     TPKC   C    N  +   K Y  + YR++S+ E IM E+  +GPVEV F VY 
Sbjct: 209 CGGDVETPKCKTTCQPGYNIPYNKDKWYGKTVYRVHSNQEAIMKEVMDHGPVEVDFEVYA 268

Query: 258 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 317
           DF +YKSGVY+H++G ++GGHAV+L+GWG  ++G  YW++AN WN  WG +GYFKI RG 
Sbjct: 269 DFPNYKSGVYQHVSGGLLGGHAVRLLGWG-EENGVPYWLIANSWNSDWGDNGYFKIIRGR 327

Query: 318 NECGIEEDVVAGLPSSK 334
           NECGIE DV AG+P  K
Sbjct: 328 NECGIESDVNAGIPKLK 344


>gi|74213457|dbj|BAE35542.1| unnamed protein product [Mus musculus]
          Length = 339

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 141/320 (44%), Positives = 190/320 (59%), Gaps = 30/320 (9%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL----LGVKPTPKGLLLGVPVKT 88
           H L D +I  +N+     W+A RN  F N  +   K L    LG    P  +  G     
Sbjct: 24  HPLSDDLINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGTVLGGPKLPGRVAFG----- 75

Query: 89  HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 146
             + + LP++FDAR  W  C TI +I DQG CGSCWAFGAVEA+SDR CIH    +N+ +
Sbjct: 76  --EDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEV 133

Query: 147 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGC 194
           S  DLL CCG  CGDGC+GGYP  AW ++   G+V+         C PY           
Sbjct: 134 SAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNG 193

Query: 195 SHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
           S P C     T +C + C    +  ++  KH+  ++Y +++  ++IMAEIYKNGPVE +F
Sbjct: 194 SRPPCTGEGDTHRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAF 253

Query: 254 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 313
           TV+ DF  YKSGVYKH  GD+MGGHA++++GWG  ++G  YW+ AN WN  WG +G+FKI
Sbjct: 254 TVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGV-ENGVPYWLAANSWNLDWGDNGFFKI 312

Query: 314 KRGSNECGIEEDVVAGLPSS 333
            RG N CGIE ++VAG+P +
Sbjct: 313 LRGENHCGIESEIVAGIPRT 332


>gi|203648|gb|AAA40993.1| cathepsin (EC 3.4.22.1), partial [Rattus norvegicus]
          Length = 271

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 129/260 (49%), Positives = 170/260 (65%), Gaps = 16/260 (6%)

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 148
           + + LP+SFDAR  W  C TI++I DQG CGSCWAFGAVEA+SDR CIH    +N+ +S 
Sbjct: 8   EDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSA 67

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSH 196
            DLL CCG  CGDGC+GGYP  AW ++   G+V+         C PY           S 
Sbjct: 68  EDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGSR 127

Query: 197 PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 255
           P C     TPKC + C    +  ++  KHY  ++Y ++   ++IMAEIYKNGPVE +FTV
Sbjct: 128 PPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTV 187

Query: 256 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 315
           + DF  YKSGVYKH  GDVMGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI R
Sbjct: 188 FSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNGFFKILR 246

Query: 316 GSNECGIEEDVVAGLPSSKN 335
           G N CGIE ++VAG+P ++ 
Sbjct: 247 GENHCGIESEIVAGIPRTQQ 266


>gi|226472800|emb|CAX71086.1| cathepsin B [Schistosoma japonicum]
 gi|226472804|emb|CAX71088.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 147/333 (44%), Positives = 190/333 (57%), Gaps = 22/333 (6%)

Query: 19  TFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPK 78
           T  E    + K     L   +I  +N      WKA    +F   TV   + +LG  P P 
Sbjct: 20  TLNENDARRHKHMHQPLSKELIHFINYEANTTWKAGPTRRFK--TVSDIRRMLGALPDPN 77

Query: 79  GLLLGVPVKTHDKSL-KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFC 137
           G  L      ++ +L +LPKSFDAR  W  C +IS I DQ  CGSCWAFGAVEA+SDR C
Sbjct: 78  GEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQSSCGSCWAFGAVEAMSDRIC 137

Query: 138 IHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY 188
           I         LS  +L++CC   CG GC+GG+P SAW Y+ + G+VT +       C PY
Sbjct: 138 IESKGKYKPFLSAENLVSCCS-SCGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY 196

Query: 189 FDSTGCSH------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMA 241
            +   C H      P C+    TP C R C    N  + N K Y    YR+ S+ E IM 
Sbjct: 197 -EFPPCEHHTLGPLPVCDGDVETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMK 255

Query: 242 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQW 301
           E+ ++GPVEV F VY DF +YKSGVY+H++G ++GGHAV+L+GWG  ++   YW++AN W
Sbjct: 256 ELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWG-EENNVPYWLIANSW 314

Query: 302 NRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 334
           N  WG +GYFKI RG NECGIE DV AG+P  K
Sbjct: 315 NTDWGDNGYFKIIRGKNECGIESDVNAGIPKIK 347


>gi|240992702|ref|XP_002404475.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215491572|gb|EEC01213.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 147/318 (46%), Positives = 191/318 (60%), Gaps = 22/318 (6%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK- 91
           H L D +I  +N+     WKA RN   S  ++   + L+GV P  K   L   V  HD+ 
Sbjct: 26  HPLSDQMINFINK-INTTWKAGRNFDKS-ISMSYIRGLMGVHPKSKEYRLAEFV--HDEI 81

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
              LP+SFDAR  WP C++I  I DQ  CGSCWAFGA EA+SDR CIH    + +++S  
Sbjct: 82  PDDLPESFDAREKWPHCNSIHLIRDQSTCGSCWAFGAAEAMSDRVCIHSKGKIQVNISAE 141

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF-----DSTGCSHP 197
           DLL CC   CG GC+GG P +AW Y+   G+VT       + C PY        T  S P
Sbjct: 142 DLLDCCDS-CGAGCNGGTPAAAWEYWKESGLVTGGLYGTNDGCKPYSLAPCEHHTKGSLP 200

Query: 198 GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
            C    PTPKCV  C K   + +++ KH+    Y I+SD + I  EI+KNGPVE  F V 
Sbjct: 201 NCTGTVPTPKCVHLCRKGYGKDYQDDKHFGKKVYSISSDEKQIQTEIFKNGPVEADFIVL 260

Query: 257 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
            DF  YKSGVY+H + DV+GGHA++++GWGT ++G  YW+ AN WN  WG  GYFKI RG
Sbjct: 261 ADFLSYKSGVYQHHSDDVIGGHAIRILGWGT-ENGTPYWLAANSWNEDWGDHGYFKILRG 319

Query: 317 SNECGIEEDVVAGLPSSK 334
            +ECGIEED+ AG+P ++
Sbjct: 320 KDECGIEEDINAGIPKNR 337


>gi|344281458|ref|XP_003412496.1| PREDICTED: cathepsin B-like [Loxodonta africana]
          Length = 340

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 148/352 (42%), Positives = 202/352 (57%), Gaps = 31/352 (8%)

Query: 8   MDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 67
           M  +L   C         S+L      L D ++  VN+     W+A  N  F +  +   
Sbjct: 1   MWQLLATLCCLVVLTSAQSRLYFKP--LSDELVNHVNK-LNTTWQAGHN--FYDVDMSYV 55

Query: 68  KHLLGVKPTPKGLLLG--VPVKTH-DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 124
           K L G       LL G  +P + H  + + LP++FDAR  WP C TI  I DQG CGSCW
Sbjct: 56  KRLCGT------LLNGPKLPQRVHLAEEMDLPENFDARENWPNCPTIKEIRDQGSCGSCW 109

Query: 125 AFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 182
           AFGAVEA+SDR CIH    +N+ +S  DLL CC   CGDGC+GG+P  AW ++   G+V+
Sbjct: 110 AFGAVEAISDRVCIHTNGNVNVEVSAEDLLTCCHMECGDGCNGGFPAGAWNFWTKKGLVS 169

Query: 183 EE-------CDPYF-----DSTGCSHPGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSIS 228
                    C PY           S P C+     TPKC + C    +  ++  KHY  S
Sbjct: 170 GGLYDSHVGCRPYSIPPCEHHVNGSRPPCKGEGGETPKCSKTCEPGYSPSYKEDKHYGYS 229

Query: 229 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 288
           +Y + S  ++IMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG+ +GGHA++++GWG  
Sbjct: 230 SYGVPSSEQEIMAEIYKNGPVEGAFSVYTDFLVYKSGVYQHVTGEEVGGHAIRILGWGV- 288

Query: 289 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           ++G  YW+ AN WN  WG +G+FKI RG + CGIE ++VAG+P +    K+I
Sbjct: 289 ENGTPYWLAANSWNTDWGDNGFFKILRGQDHCGIESEIVAGIPRTDQYWKKI 340


>gi|326916753|ref|XP_003204669.1| PREDICTED: cathepsin B-like [Meleagris gallopavo]
          Length = 340

 Score =  261 bits (668), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 146/350 (41%), Positives = 197/350 (56%), Gaps = 36/350 (10%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
           ILC+      A  +     L S ++    I ++N      WKA  N  F N  +   K L
Sbjct: 7   ILCVLVAFANARSIPYYPPLSSDLVNH--INKLNTT----WKAGHN--FHNTDMSYVKKL 58

Query: 71  LGVKPTPKGLLLGVPVKTHD----KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF 126
            G         LG P           + LP +FD+R  WP C TIS I DQG CGSCWAF
Sbjct: 59  CGT-------FLGGPKLPERVDFAADIDLPDTFDSRKQWPNCPTISEIRDQGSCGSCWAF 111

Query: 127 GAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 184
           GAVEA+SDR C+H    +S+ V+  DLL+CCGF CG GC+GGYP  AWRY+   G+V+  
Sbjct: 112 GAVEAISDRICVHTNAKVSVEVSAEDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGG 171

Query: 185 -------CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAY 230
                  C PY          G   P       TP+C R C    +  ++  KHY I++Y
Sbjct: 172 LYDSHVGCRPYTIPPCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSY 231

Query: 231 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 290
            +    ++IMAEIYKNGPVE +F VYEDF  YKSGVY+H++G+ +GGHA++++GWG  ++
Sbjct: 232 GVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGV-EN 290

Query: 291 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           G  YW+ AN WN  WG +G+FKI RG + CGIE ++VAG+P ++     +
Sbjct: 291 GTPYWLAANSWNTDWGDNGFFKILRGEDHCGIESEIVAGVPRTEQYWTRV 340


>gi|73586701|gb|AAI02998.1| CTSB protein [Bos taurus]
          Length = 335

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 138/317 (43%), Positives = 190/317 (59%), Gaps = 28/317 (8%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK--- 91
           L D ++  VN+     WKA  N  F N  +   K L G       +L G  +   D    
Sbjct: 26  LSDELVNFVNKQ-NTTWKAGHN--FYNVDLSYVKKLCGA------ILGGPKLPQRDAFAA 76

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
            + LP+SFDAR  WP C TI  I DQG CGSCWAFGAVEA+SDR CIH    +N+ +S  
Sbjct: 77  DVVLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAE 136

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHP 197
           D+L CC   CGDGC+GG+P  AW ++   G+V+         C PY           S P
Sbjct: 137 DMLTCCDGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRP 196

Query: 198 GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
            C     TPKC + C    +  ++  KH+  S+Y + ++ ++IMAEIYKNGPVE +F+VY
Sbjct: 197 PCTGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVY 256

Query: 257 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
            DF  YKSGVY+H++G++MGGHA++++GWG  ++G  YW++ N WN  WG +G+FKI RG
Sbjct: 257 SDFLLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRG 315

Query: 317 SNECGIEEDVVAGLPSS 333
            + CGIE ++VAG+P +
Sbjct: 316 QDHCGIESEIVAGMPCT 332


>gi|1127275|pdb|1CTE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1127276|pdb|1CTE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
          Length = 254

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 130/256 (50%), Positives = 169/256 (66%), Gaps = 18/256 (7%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLL 152
           LP+SFDAR  W  C TI++I DQG CGSCWAFGAVEA+SDR CIH    +N+ +S  DLL
Sbjct: 1   LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLL 60

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGC 199
            CCG  CGDGC+GGYP  AW ++   G+V+         C PY     C H      P C
Sbjct: 61  TCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPP-CEHHVNGARPPC 119

Query: 200 EPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
                TPKC + C    +  ++  KHY  ++Y ++   ++IMAEIYKNGPVE +FTV+ D
Sbjct: 120 TGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSD 179

Query: 259 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
           F  YKSGVYKH  GDVMGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI RG N
Sbjct: 180 FLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNADWGDNGFFKILRGEN 238

Query: 319 ECGIEEDVVAGLPSSK 334
            CGIE ++VAG+P ++
Sbjct: 239 HCGIESEIVAGIPRTQ 254


>gi|410956528|ref|XP_003984894.1| PREDICTED: cathepsin B [Felis catus]
          Length = 339

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 140/327 (42%), Positives = 191/327 (58%), Gaps = 30/327 (9%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS 92
            +L D ++  VN+     WKA  N  F +      + L G        +LG P      S
Sbjct: 24  QLLSDELVDYVNKR-NTTWKAGHN--FYHVEPSYLRRLCGT-------ILGGPKLPQRVS 73

Query: 93  ----LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSL 146
               + LP++FDAR  WP C TI  I DQG CGSCWAFGAVEA+SDR CI  +  +N+ +
Sbjct: 74  FAEDMVLPENFDAREHWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICILTNGHVNVEV 133

Query: 147 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGC 194
           S  D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY           
Sbjct: 134 SAEDMLTCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIPPCEHHVNG 193

Query: 195 SHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
           S P C     TPKC + C       ++  KHY  ++Y +++  ++IMAEIYKNGPVE +F
Sbjct: 194 SRPPCTGEGDTPKCSKICEPGYTPSYKEDKHYGCNSYSVSNSEKEIMAEIYKNGPVEAAF 253

Query: 254 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 313
           +V+ DF  YKSGVY+H+TG++MGGHAV+++GWG  +D   YW++ N WN  WG  G+FKI
Sbjct: 254 SVFSDFLQYKSGVYQHVTGEMMGGHAVRILGWGVEND-TPYWLVGNSWNTDWGDHGFFKI 312

Query: 314 KRGSNECGIEEDVVAGLPSSKNLVKEI 340
            RG + CGIE +VVAG+P ++   K I
Sbjct: 313 LRGRDHCGIESEVVAGIPCTEQYWKRI 339


>gi|118358706|ref|XP_001012594.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89294361|gb|EAR92349.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 346

 Score =  261 bits (667), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 149/352 (42%), Positives = 200/352 (56%), Gaps = 27/352 (7%)

Query: 1   MEPTKLIMDPILCLTCFATFAEGVVSKLKLDS-HILQDSIIKEVNENPKAGWKAARNPQF 59
           M+   LI+   + L     F      + K +  H     II++VN +  + WKA  N ++
Sbjct: 1   MKHQALIITAGILLATLTGFVAFEAFRYKQEKYHDKLKQIIQKVNSS-NSTWKAGENTKW 59

Query: 60  SNYTVGQFKHLLGVKPTPKGLLLGVPVKT-HDKSLKLPKSFDARSAW-PQCSTISRILDQ 117
            N  +   K  +GVK    G   G+ ++T   ++  LP+ FDAR  W  +CS++  + DQ
Sbjct: 60  INSDIAGVKAHMGVK---LGQESGIKLETVSAQANGLPEEFDARVQWGDKCSSLWEVRDQ 116

Query: 118 GHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 177
             CGSCWAFGA E+LSDR CIH G ++ LS  +LL CC   CGDGCDGG+P +A  Y+V+
Sbjct: 117 STCGSCWAFGAAESLSDRHCIHLGQDIRLSTQNLLTCCA-ACGDGCDGGWPEAAMDYYVN 175

Query: 178 HGVVTEE-------CDPYFDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQL---WR 220
            G+VT +       C  Y  +  C+H       P C    PTP C+  C   +     + 
Sbjct: 176 TGLVTGDLYGNNSWCQAYTFAP-CAHHVTSDIYPPCTGELPTPPCINSCDSNSTHTIPYS 234

Query: 221 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 280
              H    AY I  D + IMAEIYKNGP+EV+ TVYEDF  YK+GVY+H+TGD +GGHAV
Sbjct: 235 KDIHRGSKAYGIAKDEKAIMAEIYKNGPIEVALTVYEDFLTYKTGVYQHVTGDELGGHAV 294

Query: 281 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
           K++GWG  ++G  YW + N WN SWG  G FKI RG NECGIE   V  LP+
Sbjct: 295 KMVGWGV-ENGTPYWTIVNSWNESWGDKGTFKILRGKNECGIESSCVTALPA 345


>gi|56753605|gb|AAW25005.1| SJCHGC02852 protein [Schistosoma japonicum]
          Length = 346

 Score =  261 bits (667), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 142/317 (44%), Positives = 191/317 (60%), Gaps = 23/317 (7%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV--KPTPKGLLLGVPVKTHDKS 92
           L D +I  +N+ P   WKA R  +F+  ++   K ++GV      +  L    +  +D +
Sbjct: 32  LSDELITFINKQPNIEWKADRTTRFT--SIHHAKSMMGVLLNSVDQHKLHHPIIHHNDIN 89

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 150
           +KLPK FD+R  W  CS+I  I DQ  CGSCWAFGAVE++SDR CIH    +++ LS  +
Sbjct: 90  IKLPKYFDSRKYWKNCSSIRTIRDQSSCGSCWAFGAVESMSDRICIHSKGRISIELSAVN 149

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHP 197
           LL+CC   CG GC+GG P  AW Y+   G+VT         C PY        ST  +H 
Sbjct: 150 LLSCCS-RCGFGCNGGIPGMAWDYWKDEGIVTGGSNETHTGCQPYPFPECIHHSTSINHS 208

Query: 198 GCE-PAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 255
            CE   Y TP+C + C     + + N K+Y  S+Y + SD   IM EI  NGPVE +F V
Sbjct: 209 SCEVKYYSTPECYQTCQPDYAIQYENDKYYGKSSYYVTSDEVSIMKEILLNGPVEATFYV 268

Query: 256 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKIK 314
           ++DF +YK+GVYK++TG ++GGHA+++IGWG S  +   YW+ AN WN+ WG  GYFKI 
Sbjct: 269 FDDFLNYKTGVYKYVTGSLLGGHAIRIIGWGVSTLNHTPYWLCANSWNKQWGDKGYFKIL 328

Query: 315 RGSNECGIEEDVVAGLP 331
           RGSNECGIE  V AGLP
Sbjct: 329 RGSNECGIESMVTAGLP 345


>gi|22531389|emb|CAD44625.1| cathepsin B1 isotype 2 [Schistosoma mansoni]
          Length = 340

 Score =  261 bits (667), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 140/339 (41%), Positives = 200/339 (58%), Gaps = 20/339 (5%)

Query: 7   IMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ 66
           ++  +LC+    T  +  +S        L D II  +NE+P AGW+A ++ +F +    +
Sbjct: 1   MLTSVLCIASLITHLDAHISIKNEKFKPLSDDIISYINEHPNAGWRAEKSNRFHSLDDAR 60

Query: 67  FKHLLGVKPTPKGLLLGVPVKTHDK-SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
            + +   +  P       P   H++ ++++P +FD+R  WP C +I+ I DQ  CGSCWA
Sbjct: 61  IQ-MGARREEPDLRRKRRPTVDHNEWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCWA 119

Query: 126 FGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 183
           FGAVEA+SDR CI  G   N+ LS  DLL+CC   CG GC+GG    AW ++V  G+VT 
Sbjct: 120 FGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCE-SCGLGCEGGILGPAWDFWVKEGIVTG 178

Query: 184 E-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISA 229
                   C+PY        T   +P C    Y TP+C + C KK +  +   KH   S+
Sbjct: 179 SSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSS 238

Query: 230 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 289
           Y + +D + I  EI K GPVE SFTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG  +
Sbjct: 239 YNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGV-E 297

Query: 290 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
           +   YW++AN WN  WG +GYF+I RG +EC IE +V+A
Sbjct: 298 NKTPYWLIANSWNEDWGENGYFRIVRGRDECFIESEVIA 336


>gi|118364222|ref|XP_001015333.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297100|gb|EAR95088.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 341

 Score =  261 bits (667), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 149/348 (42%), Positives = 198/348 (56%), Gaps = 33/348 (9%)

Query: 4   TKLIMDPIL--CLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSN 61
           T  I+  +L   LT F T+      + K    + Q  + +EVN N    WKA  N ++ N
Sbjct: 5   TIFIVAALLSAALTGFYTYEALKHKEFKYSDRLKQ--LAEEVN-NANTTWKAGENIKWIN 61

Query: 62  YTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW-PQCSTISRILDQGHC 120
             +   K  LG      G  L  PV    K+  LP +FDAR  W  +C+++  + DQ +C
Sbjct: 62  ADIAGVKAHLGALEGDNGENL--PVSNAVKA-DLPTAFDARQQWGDKCTSLWEVRDQSNC 118

Query: 121 GSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 180
           GSCWAFGAVE+L+DR CIH G ++ LS  ++L CC   CG GC+GGYP SA  Y+V  G+
Sbjct: 119 GSCWAFGAVESLTDRHCIHLGQDIRLSAQNMLTCCA-TCGQGCNGGYPASAMSYYVKTGL 177

Query: 181 VTEECDPYFDSTG---------CSH-------PGCEPAYPTPKCVRKC-VKKNQLWRNSK 223
           VT +    +++TG         C+H       P C    PTPKC + C     Q +  + 
Sbjct: 178 VTGD---LYNTTGWCQAYSFAPCAHHVDTPLYPACTGELPTPKCAKTCDSGSGQTY--TV 232

Query: 224 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 283
           H    AY +    E IM EI  NGPVE +FTVYEDF +YKSGVYKH+TG  +GGHA+K++
Sbjct: 233 HKGSKAYSVGKTQEAIMTEIQTNGPVEAAFTVYEDFLNYKSGVYKHVTGKALGGHAIKIV 292

Query: 284 GWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           GWG  ++   YWI+ N WN++WG +G FKI RG NECGIE  VV  LP
Sbjct: 293 GWGVENN-TPYWIVVNSWNQTWGDNGTFKILRGKNECGIEAQVVTALP 339


>gi|321452279|gb|EFX63703.1| hypothetical protein DAPPUDRAFT_306608 [Daphnia pulex]
          Length = 340

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 143/345 (41%), Positives = 196/345 (56%), Gaps = 28/345 (8%)

Query: 8   MDPILCLTCFATFAEGVVSKLKLDSHI--LQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           M  +L +            KLK + +   L D  I  +N + K+ WKA RN    N+ +G
Sbjct: 1   MKIVLSIIFAVVLVTSQAKKLKSNKYFNPLSDEFINHIN-SMKSTWKAGRNFG-KNFPMG 58

Query: 66  QFKHLLGVKPTPKGLLLGVPVKTHDK---SLKLPKSFDARSAWPQCSTISRILDQGHCGS 122
               ++GV P     L   P+K   +   +  +P++FDAR  WP C TI  I DQG CGS
Sbjct: 59  ALTQMMGVHPDSN--LYMPPLKNVSQMYSNQAIPEAFDAREQWPDCPTIQEIRDQGSCGS 116

Query: 123 CWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 180
           CWAFGAVEA+SDR CIH    +N  LS  +L++CC + CG GC+GG+P +AW ++V  G+
Sbjct: 117 CWAFGAVEAMSDRICIHSKGEVNAHLSAENLVSCC-YTCGFGCNGGFPGAAWSHWVKKGI 175

Query: 181 VT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYS 226
           VT       + C PY     C H      P C     TPKC++ C     + +    HY 
Sbjct: 176 VTGGNFNSSQGCQPYIIPA-CEHHTTGDRPPCSEGGGTPKCLKTCEDGYTVDYTQDLHYG 234

Query: 227 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 286
            S+Y ++   EDI  EI  NGPVE + TVYEDF  YKSGVY+H+ G  +GGHA++++GWG
Sbjct: 235 ASSYSVHKRMEDIQLEIMNNGPVEGALTVYEDFPTYKSGVYQHVHGKALGGHAIRILGWG 294

Query: 287 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
             ++G  YW++AN WN  WG +GY K+ RG + CGIE  + AGLP
Sbjct: 295 V-EEGVPYWLIANSWNTDWGDNGYIKLLRGKDHCGIESQITAGLP 338


>gi|432946172|ref|XP_004083803.1| PREDICTED: cathepsin B-like [Oryzias latipes]
          Length = 330

 Score =  260 bits (665), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 143/301 (47%), Positives = 186/301 (61%), Gaps = 28/301 (9%)

Query: 51  WKAARNPQFSNYTVGQFKHLLG-VKPTPKGLLLGVPVKTHD-KSLKLPKSFDARSAWPQC 108
           W A +N  F N      K L G +   PK     +P   HD + +KLP SFD R  WP C
Sbjct: 40  WTAGQN--FHNKDSSFVKGLCGTILKGPK-----LPELAHDVEGIKLPDSFDPREQWPNC 92

Query: 109 STISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGG 166
            T+ +I DQG+CGSCWAFGA EA+SDR CI  G  ++L +S  DLL CC   CG GC GG
Sbjct: 93  PTLKQIRDQGNCGSCWAFGAAEAISDRICIQSGGKISLEISAEDLLTCCD-ECGMGCFGG 151

Query: 167 YPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCV 213
           +P +AW ++ + G+VT         C PY  +  C H      P C+    TPKCV +C 
Sbjct: 152 FPSAAWEFWTNKGLVTGGLFDSKVGCRPYTLAP-CEHHVNGSRPPCQGEVETPKCVTQCN 210

Query: 214 KKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 272
               L +   KH+   +Y I S  E IM E+YKNGPVE +F+VY DF  YK+GVY+H+TG
Sbjct: 211 NGYSLSYPKDKHFGQRSYSIPSQQEQIMTELYKNGPVEAAFSVYADFLLYKNGVYQHVTG 270

Query: 273 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
           D++GGHAVK++GWG  ++G  YW++AN WN  WG  G+FKIKRG++ECGIE ++VAG P 
Sbjct: 271 DMLGGHAVKILGWG-EENGTPYWLVANSWNSDWGDKGFFKIKRGNDECGIESEMVAGAPL 329

Query: 333 S 333
           S
Sbjct: 330 S 330


>gi|56756436|gb|AAW26391.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 141/343 (41%), Positives = 200/343 (58%), Gaps = 22/343 (6%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           ++   +  ++ FA     V ++       L D +I  +NE+P AGWKA ++ +F  +++ 
Sbjct: 1   MLKIAVCIVSFFALLKAHVTTRNNERIEPLSDEMISFINEHPDAGWKADKSDRF--HSLD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + L+G +     +       V  HD ++++P  FD+R  WP C +IS+I DQ  CGSC
Sbjct: 59  DARILMGARKEDAEMKRKRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSC 118

Query: 124 WAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WAFGAVEA++DR CI  G   S  LS  DL++CC   CGDGC GG+P  AW Y+V  G+V
Sbjct: 119 WAFGAVEAMTDRICIQSGGQQSAELSALDLISCCED-CGDGCKGGFPGQAWDYWVKRGIV 177

Query: 182 T---EE----CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSI 227
           T   EE    C PY        T   +P C    Y TP+C + C K  +  +   KHY  
Sbjct: 178 TGGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGD 237

Query: 228 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 287
             Y + S+ + I  EI   GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG 
Sbjct: 238 QRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV 297

Query: 288 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
            + G+ YW++AN WN  WG  G F++ RG +EC IE  VVAGL
Sbjct: 298 -EKGKPYWLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGL 339


>gi|195729971|gb|ACG50796.1| cathepsin B1 [Trichobilharzia szidati]
          Length = 342

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 144/344 (41%), Positives = 200/344 (58%), Gaps = 23/344 (6%)

Query: 7   IMDPILCL-TCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +M+ +LC+ +  +     +++  ++    L D +I  +N++P AGW A+R+ +F   +V 
Sbjct: 1   MMNTVLCIVSLMSILTAHILTDNEVQFEPLSDEMIAYINQHPDAGWTASRSDRFK--SVE 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG     + L       V   + SL++P SFD+R  W QC +IS I DQ  CG C
Sbjct: 59  DARILLGAMSEDEELRKKRRPTVDHQNVSLEIPSSFDSRKKWRQCKSISNIRDQSRCGPC 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WAF AVEA+SDR CI      ++ LS  DLL+CC   CG GC GG+P +AW Y+V  G+V
Sbjct: 119 WAFAAVEAMSDRICIQSKGKKSVELSAVDLLSCCT-ECGLGCQGGFPGAAWDYWVEEGIV 177

Query: 182 TEE-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSI 227
           T         C PY        T   +P C E  Y TPKC +KC K  +  ++  K+Y  
Sbjct: 178 TGSSKENHTGCQPYPFPKCEHHTKGKYPACGEKIYKTPKCQQKCQKGYKTPYKKDKYYGK 237

Query: 228 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 287
            +Y + S  + I  EI  +GPVE +FTVY DF +YKSG+YKH+ G V+GGHAV++IGWG 
Sbjct: 238 LSYNVLSKEDAIKKEIMMHGPVEAAFTVYSDFLNYKSGIYKHMKGTVIGGHAVRIIGWGV 297

Query: 288 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
            +    YW++AN WN  WG  GYF+I RG + CGIE  V AGLP
Sbjct: 298 -EKKTPYWLIANSWNEDWGEKGYFRILRGKDVCGIESAVTAGLP 340


>gi|38147393|gb|AAR12009.1| cathepsin B-like proteinase [Triatoma infestans]
          Length = 332

 Score =  260 bits (665), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 146/335 (43%), Positives = 198/335 (59%), Gaps = 25/335 (7%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF-KHLLG 72
           L  F+    G+ S   + +  L D  I  +N + +  W+A RN  F+  T  ++ K L G
Sbjct: 4   LIPFSLLICGIFSA-SIPTDPLSDEFIDYIN-SLQTTWRAGRN--FAPNTPKKYLKSLAG 59

Query: 73  VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEAL 132
           V          +P +     + LPK FDAR  WP C++I+ I DQG CGSCWAFGAVEA+
Sbjct: 60  VHKDANNAFT-LPKRQVSLDVTLPKEFDARKHWPNCTSIAEIRDQGSCGSCWAFGAVEAM 118

Query: 133 SDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------E 183
           SDR CIH    + + LS  +L++CC   CG GCDGGYP SAW Y+ + G+V+       +
Sbjct: 119 SDRICIHSNGKLQVHLSAENLVSCCDS-CGFGCDGGYPASAWDYWQNVGIVSGGNYGSKQ 177

Query: 184 ECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDP 236
            C PY  +  C H      P C     TP C  +C K++ + +    +Y  SAY +  + 
Sbjct: 178 GCQPYSIAP-CEHHVPGPRPACSGEGSTPDCRNQCDKRSGISYDKDLYYGESAYSLEDEA 236

Query: 237 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 296
           + I AEI KNGPVE +FTVYED  +YK GVY+H+ G V+GGHA+K++GWG  +D   YW+
Sbjct: 237 KQIQAEILKNGPVEAAFTVYEDLVNYKEGVYQHVAGSVLGGHAIKILGWGVEND-TPYWL 295

Query: 297 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           +AN WN  WG +G+FKI RG +ECGIE DV AGLP
Sbjct: 296 VANSWNTDWGNNGFFKILRGKDECGIEIDVSAGLP 330


>gi|227293|prf||1701299A cathepsin B
          Length = 339

 Score =  260 bits (664), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 144/328 (43%), Positives = 191/328 (58%), Gaps = 46/328 (14%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL----LGVKPTPKGLLLGVPVKT 88
           H L D +I  +N+     W+A RNP   N  +   K L    LG    P  +  G     
Sbjct: 24  HPLSDDLINYINKQ-NTTWQAGRNPY--NVDISYLKKLCGTVLGGPKLPGRVAFG----- 75

Query: 89  HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 146
             + + LP++FDAR  W  C TI +I DQG CGSCWAFGAVEA+SDR CIH    +N+ +
Sbjct: 76  --EDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEV 133

Query: 147 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP 206
           S  DLL CCG  CGDGC+GGYP  AW ++   G+V+     Y+DS    H GC P Y  P
Sbjct: 134 SAEDLLTCCGIQCGDGCNGGYPSGAWNFWTKKGLVS---GGYYDS----HIGCLP-YTIP 185

Query: 207 KC----------------VRKCVKKNQL-----WRNSKHYSISAYRINSDPEDIMAEIYK 245
            C                 R+C K  +      ++  KH+  ++Y +++  + IMAEIYK
Sbjct: 186 PCEHHVNGSRPPCTGEGDTRRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKKIMAEIYK 245

Query: 246 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 305
           NGPVE +FTV+ DF  YKSGVYKH  GD+MGGHA++++ WG  ++G  YW  AN WN  W
Sbjct: 246 NGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILVWGV-ENGVPYWAAANSWNLDW 304

Query: 306 GADGYFKIKRGSNECGIEEDVVAGLPSS 333
           G +G+FKI RG N CGIE ++VAG+P +
Sbjct: 305 GDNGFFKILRGENHCGIESEIVAGIPRT 332


>gi|346472613|gb|AEO36151.1| hypothetical protein [Amblyomma maculatum]
          Length = 373

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 148/303 (48%), Positives = 182/303 (60%), Gaps = 29/303 (9%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL--KLPKSFDARSAWPQC 108
           WKA  N  + N        LLGV+P      L  P +T D S    LP++FDAR  WP C
Sbjct: 76  WKAGHNSGYDNPE--DVIPLLGVRPENSRYRL--PERTLDVSALRVLPENFDAREHWPDC 131

Query: 109 STISRILDQGHCGSCWAFGAVEALSDRFCIHF-----GMNLSLSVNDLLACCGFLCGDGC 163
            TI  I DQG CGSCWAFGAVEA+SDR CIH       +   L+ +D+L+CC   CG GC
Sbjct: 132 PTIREIRDQGSCGSCWAFGAVEAISDRTCIHSPEGKPRVIAHLAADDVLSCC-TECGAGC 190

Query: 164 DGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKCV 209
           +GG+P SAW Y+VH G+VT       E C PY     C H       P  +   PTP+CV
Sbjct: 191 NGGFPGSAWSYWVHKGIVTGGNYDSDEGCMPY-PIKACDHHVNGTLGPCDKTIPPTPRCV 249

Query: 210 RKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
           R C K   + + + KHY   AY + +  + I AEI  NGPVE  FTVYEDF HYKSGVY+
Sbjct: 250 RMCRKGYDVDFMDDKHYGRHAYSVPAKAKQIQAEIMMNGPVEADFTVYEDFLHYKSGVYQ 309

Query: 269 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
             T   +GGHA++L+GWG  ++G  YW+ AN WN  WG  G+FKI RGS+ECGIE D+VA
Sbjct: 310 RHTDSALGGHAIRLLGWGV-ENGVPYWLAANSWNTEWGDKGFFKILRGSDECGIESDIVA 368

Query: 329 GLP 331
           GLP
Sbjct: 369 GLP 371


>gi|196009263|ref|XP_002114497.1| expressed hypothetical protein [Trichoplax adhaerens]
 gi|190583516|gb|EDV23587.1| expressed hypothetical protein [Trichoplax adhaerens]
          Length = 333

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 145/338 (42%), Positives = 195/338 (57%), Gaps = 28/338 (8%)

Query: 11  ILCLTCFATF-AEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKH 69
           ++ +T FA F A+G       +   L   +I  VN      WKA  N  F+   V   K+
Sbjct: 5   LIVITLFAVFSAQGAYFP---NHQPLSQDLIDYVNL-VSTSWKAGTN--FAGLPVSYVKY 58

Query: 70  LLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 129
           L G    P    L  P+  H+ +  LPKSFD+R  W  C +I  I DQG CGSCW+FGAV
Sbjct: 59  LCGALEDPNHFQL--PIHVHEDTSDLPKSFDSRDKWRMCPSIREIRDQGSCGSCWSFGAV 116

Query: 130 EALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT----- 182
           E+++DR CIH    + + +S  DL+ CC   CG GC+GG+   AW Y+V++G+VT     
Sbjct: 117 ESITDRICIHSNGKVKVHISAEDLMTCCT-SCGMGCNGGFLPQAWHYWVNNGIVTGGQYH 175

Query: 183 --EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 233
             + C PY +   C H        C    PTPKC +KC    N+ +   KH+   +Y I 
Sbjct: 176 SHKGCQPY-EIPKCEHHVKGPFKACGKELPTPKCSQKCQPGYNKTFNQDKHFGKKSYSIT 234

Query: 234 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 293
           ++ + I  EI  NGPVE +FTVY DF  YKSGVY+H TG  +GGHAVK++GWGT ++   
Sbjct: 235 NNIQQIQKEIMMNGPVEAAFTVYADFPSYKSGVYQHTTGGPLGGHAVKILGWGTENN-TP 293

Query: 294 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           YW++AN WN +WG  GYFKI RG +ECGIE  +VAG+P
Sbjct: 294 YWLIANSWNPTWGDKGYFKIIRGKDECGIESSIVAGMP 331


>gi|146217390|gb|ABQ10737.1| cathepsin B [Penaeus monodon]
          Length = 331

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 139/320 (43%), Positives = 192/320 (60%), Gaps = 21/320 (6%)

Query: 28  LKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVK 87
           +   SH L D  I+++ ++  + W+A RN    + ++  F+ L+GV P  K  +      
Sbjct: 14  VNASSHFLSDKFIRQL-QSEDSTWEAGRNFN-KHLSIKYFRRLMGVHPDSKFHMPKYEAH 71

Query: 88  THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLS 145
              ++ ++PK FD+R+AWP C TI  I DQG CGSCWAFGAVE +SDR CIH     N  
Sbjct: 72  QIPENFEMPKEFDSRAAWPMCPTIGEIRDQGSCGSCWAFGAVEVMSDRQCIHSKGKSNFH 131

Query: 146 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCSH-- 196
            S  +L++CC  LCG GC+GG+P +A++Y+VH G+V       T+ C PY +   C H  
Sbjct: 132 YSAENLVSCC-HLCGFGCNGGFPGAAFKYWVHSGIVSGGSFNSTQGCQPY-EIAPCEHHV 189

Query: 197 ----PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
               P C     TPKC + C K   + + +  H+   AY I  D + I  EI  NGPVE 
Sbjct: 190 SGPRPKCSEGGGTPKCAKTCEKGYIVDYESDLHHGGKAYSIMKDEDQIKYEIMNNGPVEG 249

Query: 252 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 311
           +FTVY DF HYKSGVY+H  G  +GGHA++++GWG  ++G  YW+ AN WN  WG +G F
Sbjct: 250 AFTVYVDFLHYKSGVYQHRHGLPLGGHAIRVLGWG-EENGTPYWLCANSWNTDWGDNGLF 308

Query: 312 KIKRGSNECGIEEDVVAGLP 331
           KI RGS+ CGIE ++ AGLP
Sbjct: 309 KILRGSDHCGIESEISAGLP 328


>gi|298370749|gb|ADI80349.1| cathepsin B [Litopenaeus vannamei]
          Length = 331

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 142/315 (45%), Positives = 191/315 (60%), Gaps = 21/315 (6%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS 92
           H L D  IK + ++  + W+A RN    + ++  F+ L+GV P  K  +    V    ++
Sbjct: 19  HFLSDKFIKLL-QSEDSTWEAGRNFN-KHLSIRYFRRLMGVHPDSKYHMPKYEVHQIPEN 76

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVND 150
            +LPK FD+R+AWP C TI  I DQG CGSCWAFGAVE +SDR CIH     N   S  +
Sbjct: 77  FELPKEFDSRAAWPMCPTIGEIRDQGSCGSCWAFGAVEVMSDRQCIHSKGKSNFHYSAEN 136

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCSH------P 197
           L++CC  LCG GC+GG+P +A++Y+VH G+V       T+ C PY +   C H      P
Sbjct: 137 LVSCC-HLCGFGCNGGFPGAAFKYWVHSGIVSGGSFNSTQGCQPY-EIAPCEHHVPGPRP 194

Query: 198 GCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
            C     TPKC + C K   + + +  H+   AY I  D + I  EI KNGPVE +FTVY
Sbjct: 195 KCSEGGGTPKCAKTCEKGYIVDYESDLHHGGKAYSIMKDEDQIKYEIMKNGPVEGAFTVY 254

Query: 257 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
            DF HYKSGVY+H  G  +GGHA++++GWG  ++G  YW+ AN WN  WG +G FKI RG
Sbjct: 255 VDFLHYKSGVYQHRHGLPLGGHAIRVLGWG-EENGTPYWLCANSWNTDWGDNGLFKILRG 313

Query: 317 SNECGIEEDVVAGLP 331
           S+ CGIE ++ AGLP
Sbjct: 314 SDHCGIESEISAGLP 328


>gi|351695295|gb|EHA98213.1| Cathepsin B [Heterocephalus glaber]
          Length = 340

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 140/322 (43%), Positives = 192/322 (59%), Gaps = 33/322 (10%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD-- 90
           H L D ++  +N+     W+A  N  F N  +   K L G         LG P       
Sbjct: 24  HPLSDELVNYINKQ-NTTWQAGHN--FHNVHLSYVKRLCGT-------YLGGPRLPQRIK 73

Query: 91  --KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 146
             + + LP+SFDAR  WP C TI  I DQG CGSCWAFGAV A+SDR CIH    +N+ +
Sbjct: 74  FAEIVDLPESFDARQQWPNCPTIKEIRDQGSCGSCWAFGAVGAMSDRVCIHTNGHVNVEV 133

Query: 147 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH--- 196
           S  DLL+CCG  CGDGC+GGYP +AW+Y+   G+V+         C PY     C H   
Sbjct: 134 SAEDLLSCCGLECGDGCNGGYPSAAWKYWTKKGLVSGGLYDSHVGCRPY-SIPPCEHHVN 192

Query: 197 ---PGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
              P C      TPKC + C    +  ++  KH+   +Y ++S+ ++IMAEIYKNGPVE 
Sbjct: 193 GTRPQCTGEGGDTPKCSKTCEPGYSPSYKEDKHFGYDSYSVSSNEKEIMAEIYKNGPVEG 252

Query: 252 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 311
           +FTV+ DF  YK+GVYKH+ G+++GGHA++++GWG  ++G  YW++ N WN  WG  G+F
Sbjct: 253 AFTVFSDFLMYKTGVYKHLAGEMLGGHAIRILGWG-KENGVPYWLVGNSWNVDWGDSGFF 311

Query: 312 KIKRGSNECGIEEDVVAGLPSS 333
           KI RG + CGIE ++VAG+P +
Sbjct: 312 KIVRGEDHCGIESEIVAGIPRT 333


>gi|340380665|ref|XP_003388842.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
          Length = 333

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 145/333 (43%), Positives = 188/333 (56%), Gaps = 23/333 (6%)

Query: 13  CLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG 72
           CL      A  + S   LD   L D +I  VN +    W AAR+P+F +      K L G
Sbjct: 6   CLLVLFAVAS-IASAKPLDFQALSDDVIDYVN-SLNTTWTAARSPRFPSGNEVDVKDLCG 63

Query: 73  VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEAL 132
           V      L    P K       +P +FDAR  W  C +IS I DQG CGSCWA GAVEA+
Sbjct: 64  VLDVKHTL----PYKEKVSVGAIPDTFDARQKWSDCPSISDIRDQGSCGSCWALGAVEAM 119

Query: 133 SDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EEC 185
           SDR+C+ F  N+ +S  +L+ CC F CG+GC GG+   AW Y+V  G+VT       E C
Sbjct: 120 SDRYCVSFQENVHISAENLMTCCKF-CGNGCAGGFLQQAWEYWVKDGLVTGGQYGSDEGC 178

Query: 186 DPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPED 238
            PY     C+H  PG    C     TP+C R C       +    HY   AY ++ + E 
Sbjct: 179 QPYLIPK-CNHHEPGPYENCTGEGKTPQCERTCRSGYTTSYEADLHYGEKAYAVHREVEA 237

Query: 239 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 298
           I  EI  NGPVE +FTVY DF  YKSGVY+H+ G  +GGHA++++GWGT ++G  YW++A
Sbjct: 238 IQTEIMTNGPVEGAFTVYSDFPTYKSGVYQHVVGHALGGHAIRILGWGT-ENGVPYWLIA 296

Query: 299 NQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           N WN SWG  GYFK+ RG ++CGIE ++VAG P
Sbjct: 297 NSWNPSWGDKGYFKMIRGKDDCGIESNIVAGTP 329


>gi|282400164|ref|NP_001164205.1| cathepsin B precursor [Tribolium castaneum]
 gi|270004839|gb|EFA01287.1| cathepsin B precursor [Tribolium castaneum]
          Length = 335

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 147/321 (45%), Positives = 186/321 (57%), Gaps = 33/321 (10%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKP----TPKGLLLGVPVKT 88
           H L D  I  +N   K+ WKA RN    +  +   K LLGV P    TPK     +P K 
Sbjct: 24  HPLSDDFINRINSR-KSTWKAGRNFDI-DTPISHIKQLLGVLPETENTPK-----LPKKI 76

Query: 89  HD-KSLKLPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNL 144
           H   + ++P SFDAR AWP C+  I  I DQ  CGSCWAFGAVEA+SDR CIH    + +
Sbjct: 77  HSINAQEIPDSFDAREAWPDCAPIIGNIRDQSTCGSCWAFGAVEAMSDRICIHSNATVKV 136

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH-------- 196
           ++S  D L CC  +CG GC+GG P  AW ++  +G+VT     Y D+ GC          
Sbjct: 137 NISAEDPLDCC-TICGMGCNGGMPAMAWLHWTVNGIVTG--GNYEDTNGCKAYSFAPCEH 193

Query: 197 ------PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 250
                 P C P  PTP C ++C   + L   +     S Y I+  P+ I  EI  NGPVE
Sbjct: 194 HVDGDLPPCGPTKPTPDCKKECDSGSSLTYQNDLTHGSNYGIDPYPKQIQTEIMTNGPVE 253

Query: 251 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 310
            SF+VYEDF  YKSGVY+H+ G+  GGHA+K++GWG  +D   YW++AN WN  WG  GY
Sbjct: 254 ASFSVYEDFLSYKSGVYQHLEGEYAGGHAIKILGWGVEND-TPYWLVANSWNEDWGDKGY 312

Query: 311 FKIKRGSNECGIEEDVVAGLP 331
           FKI RGSNECGIE  +VAG+P
Sbjct: 313 FKILRGSNECGIEGSIVAGIP 333


>gi|46195455|ref|NP_990702.1| cathepsin B precursor [Gallus gallus]
 gi|1168790|sp|P43233.1|CATB_CHICK RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
           RecName: Full=Cathepsin B light chain; Contains:
           RecName: Full=Cathepsin B heavy chain; Flags: Precursor
 gi|603203|gb|AAA87075.1| cathepsin B [Gallus gallus]
          Length = 340

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 146/343 (42%), Positives = 194/343 (56%), Gaps = 40/343 (11%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
           ILCL      A  +     L S ++    I ++N   +AG        F N  +   K L
Sbjct: 7   ILCLLGAFANARSIPYYPPLSSDLVNH--INKLNTTGRAG------HNFHNTDMSYVKKL 58

Query: 71  LGVKPTPKGLLLGVPVKTHD----KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF 126
            G         LG P         + + LP +FD R  WP C TIS I DQG CGSCWAF
Sbjct: 59  CGT-------FLGGPKAPERVDFAEDMDLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAF 111

Query: 127 GAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 184
           GAVEA+SDR C+H    +S+ V+  DLL+CCGF CG GC+GGYP  AWRY+   G+V+  
Sbjct: 112 GAVEAISDRICVHTNAKVSVEVSAEDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGG 171

Query: 185 CDPYFDSTGC---SHPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSIS 228
              Y    GC   + P CE                TP+C R C    +  ++  KHY I+
Sbjct: 172 L--YDSHVGCRAYTIPPCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGIT 229

Query: 229 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 288
           +Y +    ++IMAEIYKNGPVE +F VYEDF  YKSGVY+H++G+ +GGHA++++GWG  
Sbjct: 230 SYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGV- 288

Query: 289 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           ++G  YW+ AN WN  WG  G+FKI RG + CGIE ++VAG+P
Sbjct: 289 ENGTPYWLAANSWNTDWGITGFFKILRGEDHCGIESEIVAGVP 331


>gi|187097096|ref|NP_001119608.1| cathepsin B-348 precursor [Acyrthosiphon pisum]
 gi|161343833|tpg|DAA06097.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 342

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 144/345 (41%), Positives = 200/345 (57%), Gaps = 31/345 (8%)

Query: 11  ILCLTCFATFAEGVV--SKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 68
           I  L     F+ G V  + +++D + L D  I  +N + +  W A RN    +  +   K
Sbjct: 6   IFALVGLLIFSFGRVDGATVRVDLNPLSDEFIDHIN-SIQYYWSAGRNFH-KDTPISYIK 63

Query: 69  HLLGVKPT----PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 124
            L+GV       PK   L   +  +D S  LP++FDAR  WP C TI  + DQG CGSCW
Sbjct: 64  GLMGVHEKNAEYPK---LEQLLTYNDASTDLPETFDARERWPNCPTIREVRDQGSCGSCW 120

Query: 125 AFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 182
           AFGAVEA+SDR CIH     N   S  +L++CC + CG GC+GG+P +AW Y+   G+V+
Sbjct: 121 AFGAVEAMSDRVCIHSNGTKNFHFSAENLVSCC-WTCGFGCNGGFPGAAWNYWKTKGIVS 179

Query: 183 EECDPYFDSTGC--------------SHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSI 227
               PY  + GC              +   C+    TP CV+KC +  ++ +    H+  
Sbjct: 180 G--GPYGSNMGCIPYEIAPCEHHVNGTRGPCKEGGKTPTCVKKCEEGYKVPYAQDLHHGK 237

Query: 228 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 287
           SAY I +D + I  EIY NGPVE +FTVYEDF  Y++GVYKH+ G  +GGHA++++GWG 
Sbjct: 238 SAYSIRNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGV 297

Query: 288 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
            +    YW++AN WN  WG+DG+FKI RGS+ECGIE  + AGLP+
Sbjct: 298 QNGEIPYWLVANSWNTDWGSDGFFKILRGSDECGIEGQINAGLPA 342


>gi|193209594|ref|NP_001123113.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
 gi|351058222|emb|CCD65637.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
          Length = 369

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 156/370 (42%), Positives = 202/370 (54%), Gaps = 57/370 (15%)

Query: 8   MDPILCLTCFATFA--------EGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF 59
           M  +L L+C    A        E V+   +LD     D +I  VNEN    W A +  +F
Sbjct: 1   MKTLLFLSCIVVAAYCACNDNLESVLEAAELDG----DDLIDYVNENQNL-WTAKKQRRF 55

Query: 60  SNYTVGQFKHLLGVKPTPKGLLLGVP------------VKTHDKSLKLPKSFDARSAWPQ 107
           S+        + G     K  L+GV              KT D  L +P+SFD+R  WP+
Sbjct: 56  SS--------VYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDLDIPESFDSRDNWPK 107

Query: 108 CSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDG 165
           C +I  I DQ  CGSCWAFGAVEA+SDR CI  H  + ++LS +DLL+CC   CG GC+G
Sbjct: 108 CDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLLSCCKS-CGFGCNG 166

Query: 166 GYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCV 209
           G P++AWRY+V  G+VT     Y  + GC     P CE               YPTPKC 
Sbjct: 167 GDPLAAWRYWVKDGIVTGS--NYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCE 224

Query: 210 RKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 267
           +KCV    ++ +   K +  SAY +  D E I  E+  +GP+E++F VYEDF +Y  GVY
Sbjct: 225 KKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVY 284

Query: 268 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 327
            H  G + GGHAVKLIGWG  DDG  YW +AN WN  WG DG+F+I RG +ECGIE  VV
Sbjct: 285 VHTGGKLGGGHAVKLIGWGI-DDGIPYWTVANSWNTDWGEDGFFRILRGVDECGIESGVV 343

Query: 328 AGLPSSKNLV 337
            G+P   +L 
Sbjct: 344 GGIPKLNSLT 353


>gi|308504233|ref|XP_003114300.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
 gi|308261685|gb|EFP05638.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
          Length = 351

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 140/325 (43%), Positives = 185/325 (56%), Gaps = 24/325 (7%)

Query: 28  LKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVK 87
           + L++ +L+   + +     +  +KA     FS+Y     K L+G K         V   
Sbjct: 28  IPLEAQMLRGQDLVDYVNKQQTSFKAKLGSYFSSYPDTIKKQLMGAKMIEIPDEYRVFEM 87

Query: 88  THDKSL--KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMN 143
           TH + L   +P SFD+R+ WP C +IS+I DQ  CGSCWA  A E +SDR CI  +    
Sbjct: 88  THPEVLDAAIPDSFDSRAQWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNGKTQ 147

Query: 144 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE 200
           LS+S +D+ ACCG +CG+GC+GGYPI AWR++V  G VT     Y + TGC    +P CE
Sbjct: 148 LSISADDINACCGMVCGNGCNGGYPIEAWRHYVKKGYVTG--GSYQEKTGCKPYPYPPCE 205

Query: 201 -------------PAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 246
                          YPT KC R C     L +    H+  SAY ++    +I  EI  +
Sbjct: 206 HHVNGTHYKPCPSNMYPTDKCERSCQAGYALTYTQDLHFGQSAYAVSKKVTEIQKEIMTH 265

Query: 247 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 306
           GPVEV+F+VYEDF HY  GVY H  G  +GGHAVK++GWG  D+G  YW+ AN WN  WG
Sbjct: 266 GPVEVAFSVYEDFEHYSGGVYVHTAGASLGGHAVKMLGWGV-DNGTPYWLCANSWNEDWG 324

Query: 307 ADGYFKIKRGSNECGIEEDVVAGLP 331
            +GYF+I RG NECGIE  VV G+P
Sbjct: 325 ENGYFRIIRGVNECGIESGVVGGIP 349


>gi|118358710|ref|XP_001012596.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89294363|gb|EAR92351.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 346

 Score =  257 bits (657), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 143/352 (40%), Positives = 196/352 (55%), Gaps = 27/352 (7%)

Query: 1   MEPTKLIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFS 60
           M+ T LI+     L     FA   + + K   +  +   I E   N    WKA  N ++ 
Sbjct: 1   MKHTALILSASFLLIALTGFATYEIFRFKHQKYHDRLKQIAEKVNNSNTTWKAGENIKWI 60

Query: 61  NYTVGQFKHLLGVKPTPKGLLLGVPV-KTHDKSLKLPKSFDARSAW-PQCSTISRILDQG 118
           N  +   K  +G     K    GV + K + ++  LP  FD+R  W  +CS++  + DQ 
Sbjct: 61  NSDIAGVKAHMGTLLNQKS---GVKLEKVNRQANNLPSEFDSRVQWGDKCSSLWEVRDQS 117

Query: 119 HCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH 178
           +CGSCWAFGA E+LSDR CIH G ++ LS  +L+ CC   CG GCDGG+P +A  Y+V++
Sbjct: 118 NCGSCWAFGAAESLSDRHCIHLGQDIRLSTQNLVTCCD-ECGFGCDGGWPEAAMDYYVNN 176

Query: 179 GVVTEECDPYFDSTGCS---------------HPGCEPAYPTPKCVRKCVKKNQL---WR 220
           G+VT   D Y +++ C                +P C    PTP CV+ C   +     + 
Sbjct: 177 GLVTG--DLYGNNSWCQAYSLAPCAHHVTSDVYPPCTGELPTPPCVKSCDSNSTYTIPYP 234

Query: 221 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 280
              H    AY I+ + + IM EI  NGP+EV+FTVYEDF  YKSGVY+H+TG  +GGHAV
Sbjct: 235 KDLHKGSKAYSIDQNEQAIMTEIQTNGPIEVAFTVYEDFLTYKSGVYQHVTGSELGGHAV 294

Query: 281 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
           K++GWG  ++G  YWI+ N WN SWG  G FKI RG NECGIE + V  LP+
Sbjct: 295 KMVGWGV-ENGTPYWIIVNSWNESWGDKGTFKILRGQNECGIESECVTALPA 345


>gi|25146613|ref|NP_741818.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
 gi|1169087|sp|P43510.1|CPR6_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 6; AltName:
           Full=Cysteine protease-related 6; Flags: Precursor
 gi|671715|gb|AAA98787.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|695294|gb|AAA98789.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351058213|emb|CCD65628.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
          Length = 379

 Score =  257 bits (657), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 157/376 (41%), Positives = 205/376 (54%), Gaps = 59/376 (15%)

Query: 8   MDPILCLTCFATFA--------EGVVSKLK---LDSHILQ---DSIIKEVNENPKAGWKA 53
           M  +L L+C    A        E V+ K +   +DS   +   D +I  VNEN    W A
Sbjct: 1   MKTLLFLSCIVVAAYCACNDNLESVLDKYRNREIDSEAAELDGDDLIDYVNENQNL-WTA 59

Query: 54  ARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP------------VKTHDKSLKLPKSFDA 101
            +  +FS+        + G     K  L+GV              KT D  L +P+SFD+
Sbjct: 60  KKQRRFSS--------VYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDLDIPESFDS 111

Query: 102 RSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLC 159
           R  WP+C +I  I DQ  CGSCWAFGAVEA+SDR CI  H  + ++LS +DLL+CC   C
Sbjct: 112 RDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLLSCCKS-C 170

Query: 160 GDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAY 203
           G GC+GG P++AWRY+V  G+VT     Y  + GC     P CE               Y
Sbjct: 171 GFGCNGGDPLAAWRYWVKDGIVTGS--NYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLY 228

Query: 204 PTPKCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 261
           PTPKC +KCV    ++ +   K +  SAY +  D E I  E+  +GP+E++F VYEDF +
Sbjct: 229 PTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLN 288

Query: 262 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           Y  GVY H  G + GGHAVKLIGWG  DDG  YW +AN WN  WG DG+F+I RG +ECG
Sbjct: 289 YDGGVYVHTGGKLGGGHAVKLIGWGI-DDGIPYWTVANSWNTDWGEDGFFRILRGVDECG 347

Query: 322 IEEDVVAGLPSSKNLV 337
           IE  VV G+P   +L 
Sbjct: 348 IESGVVGGIPKLNSLT 363


>gi|45361295|ref|NP_989225.1| cathepsin B precursor [Xenopus (Silurana) tropicalis]
 gi|38969948|gb|AAH63365.1| hypothetical protein MGC75969 [Xenopus (Silurana) tropicalis]
          Length = 333

 Score =  257 bits (657), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 140/300 (46%), Positives = 187/300 (62%), Gaps = 24/300 (8%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCST 110
           WKA  N  F+N  +   K L G      G  L       D  ++LP SFD+R+AWP C T
Sbjct: 41  WKAGHN--FANADLHYVKRLCGTHLN--GPQLQKRFGFAD-GMELPDSFDSRAAWPNCPT 95

Query: 111 ISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 168
           I  + DQG CGSCWAFGAVEA+SDR C+H    +N+ +S  DLL+CCGF CG GC+GGYP
Sbjct: 96  IREVRDQGSCGSCWAFGAVEAISDRVCVHTNGKVNVEVSAEDLLSCCGFECGMGCNGGYP 155

Query: 169 ISAWRYFVHHGVVTEE-------CDPYFDSTGCSH--PGCEPAYP-----TPKCVRKCVK 214
             AW+++   G+V+         C PY     C H   G  PA       TPKCV++C  
Sbjct: 156 SGAWKFWTETGLVSGGLYDSHLGCRPY-SIPPCEHHVNGSRPACKGEEGDTPKCVKQCED 214

Query: 215 K-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 273
               ++ + KH+  ++Y + S  ++IMAEIYKNGPVE +F VY DF  YKSGVY+H TG+
Sbjct: 215 GYAPVYGSDKHFGATSYGVPSSEKEIMAEIYKNGPVEGAFLVYADFPMYKSGVYQHETGE 274

Query: 274 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
            +GGHA+K++GWG  ++G  YW+ AN WN  WG +G+FKI RG + CGIE ++VAG+P +
Sbjct: 275 ELGGHAIKILGWGV-ENGTPYWLCANSWNTDWGDNGFFKILRGKDHCGIESEIVAGIPKN 333


>gi|51038793|gb|AAT94175.1| cathepsin B [Paralichthys olivaceus]
 gi|121053785|gb|ABM47001.1| cathepsin B [Paralichthys olivaceus]
          Length = 330

 Score =  257 bits (656), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 141/297 (47%), Positives = 180/297 (60%), Gaps = 23/297 (7%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCST 110
           WKA  N  F N      + L G     KG  L + V+ +   LKLP  FDAR  WP+C T
Sbjct: 40  WKAGHN--FHNVDYSYVRRLCGT--MLKGPKLPIMVQ-YAGGLKLPAEFDAREQWPECPT 94

Query: 111 ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYP 168
           +  I DQG CGSCWAFGA EA+SDR CIH G  +S+ ++  DLL CC   CG GC+GGYP
Sbjct: 95  LKEIRDQGSCGSCWAFGAAEAISDRVCIHSGGKISVEISSEDLLTCCDS-CGMGCNGGYP 153

Query: 169 ISAWRYFVHHGVVTEE-------CDPYFDS------TGCSHPGCEPAYPTPKCVRKC-VK 214
            SAW ++   G+V+         C PY  S       G   P       TP+C+ +C   
Sbjct: 154 SSAWDFWTKEGLVSGGLYNSHIGCRPYTISPCEHHVNGSRPPCTGEGGDTPECISRCEAG 213

Query: 215 KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 274
            +  ++  KHY  S+Y +    E I AEI KNGPVE +FTVYEDF  YKSGVY+H++G V
Sbjct: 214 YSPSYKQDKHYGKSSYSVEGSVEQIQAEISKNGPVEGAFTVYEDFVMYKSGVYQHVSGSV 273

Query: 275 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           +GGHA+K++GWG  +DG  YW+ AN WN  WG +G+FKI RGSN CGIE ++VAG+P
Sbjct: 274 LGGHAIKVLGWG-EEDGIPYWLCANSWNTDWGDNGFFKILRGSNHCGIESEIVAGIP 329


>gi|1169189|sp|P43157.1|CYSP_SCHJA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
           Full=Antigen Sj31; Flags: Precursor
 gi|11167|emb|CAA50305.1| cathepsin B [Schistosoma japonicum]
          Length = 342

 Score =  257 bits (656), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 135/343 (39%), Positives = 197/343 (57%), Gaps = 22/343 (6%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           ++   +  ++ F      V ++       L D +I  +NE+P AGWKA ++ +F  +++ 
Sbjct: 1   MLKIAVYIVSLFTFLEAHVTTRNNQRIEPLSDEMISFINEHPDAGWKADKSDRF--HSLD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + L+G +     +       V  HD ++++P  FD+R  WP C +IS+I DQ  CGSC
Sbjct: 59  DARILMGARKEDAEMKRNRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSC 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WAFGAVEA++DR CI  G   +  LS  DL++CC   CGDGC GG+P  AW Y+V  G+V
Sbjct: 119 WAFGAVEAMTDRICIQSGGGQSAELSALDLISCCK-DCGDGCQGGFPGVAWDYWVKRGIV 177

Query: 182 T-------EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSI 227
           T         C PY        T   +P C    Y TP+C + C K  +  +   KHY  
Sbjct: 178 TGGSKENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGD 237

Query: 228 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 287
            +Y + ++ + I  +I   GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG 
Sbjct: 238 ESYNVQNNEKVIQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV 297

Query: 288 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
            +    YW++AN WN  WG  G F++ RG +EC IE DVVAGL
Sbjct: 298 -EKRTPYWLIANSWNEDWGEKGLFRMVRGRDECSIESDVVAGL 339


>gi|55793941|gb|AAV65881.1| cathepsin B1 isotype 1 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 143/345 (41%), Positives = 203/345 (58%), Gaps = 25/345 (7%)

Query: 7   IMDPILCLTCFATFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +M+ +LC+  F +     ++ + ++    L D +I  +N++P AGW A+R+ +F +    
Sbjct: 1   MMNTVLCIISFMSILTAHILPENEIQFEPLSDEMIAYINQHPDAGWTASRSDRFKSLEDA 60

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
           +   LLG     + L       V   + SL++P SFD+R  W QC +IS I DQ  CGSC
Sbjct: 61  RI--LLGAMHEDEELRKKRRPTVDHQNVSLEIPSSFDSRKKWHQCKSISNIRDQSRCGSC 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WAF AVEA+SDR CI      ++ LS  DLL+CC   CG GC GG+P +AW Y+V  G+V
Sbjct: 119 WAFAAVEAMSDRICIESKGKKSVELSAVDLLSCCT-ECGLGCQGGFPGAAWDYWVEDGIV 177

Query: 182 TEE-------CDPY------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYS 226
           T         C PY        +TG  +P C E  Y TPKC +KC K  +  ++  K+Y 
Sbjct: 178 TGSSKENHTGCQPYPFPKCEHHTTG-KYPECGEKIYKTPKCHQKCQKGYKTPYKKDKYYG 236

Query: 227 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 286
             +Y + ++   I  EI  +GPVE +FTV+ DF +YKSG+YK++TG  +GGHAV++IGWG
Sbjct: 237 RMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIGWG 296

Query: 287 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
             +    YW++AN WN  WG  GYF+I RG +ECGIE +V  GLP
Sbjct: 297 V-EKKTPYWLIANSWNEDWGEKGYFRILRGKDECGIESEVTGGLP 340


>gi|325302582|dbj|BAJ83491.1| cathepsin B-like peptidase [Echinococcus multilocularis]
          Length = 338

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 139/313 (44%), Positives = 182/313 (58%), Gaps = 26/313 (8%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKT----HDKSLK 94
           II  +N      W+A +N +F++      K  +G    P G +L  P K+      +   
Sbjct: 28  IIDYINNKANTTWRAGKNKRFTDALSA--KSQMGSLFNPGGSML--PTKSFYLSSTQKAA 83

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLL 152
           LP  FDAR AWP C TI  I DQG CGSCWAFGA EA+SDR CIH      + +S +DLL
Sbjct: 84  LPSEFDARKAWPDCPTIGEIRDQGTCGSCWAFGATEAMSDRICIHSEGKEVVRISADDLL 143

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGC 199
           +CCG  CG GC+GG P +AWRY+   G+V+         C PY +   C H      P C
Sbjct: 144 SCCGLFCGFGCNGGLPENAWRYWAIDGIVSGGLYGSHVGCRPY-EIPPCEHHTSGNRPDC 202

Query: 200 EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
           +    TPKC R+CV+  +  ++  KH++ + Y + +  EDIM EI   GPVE  F VY D
Sbjct: 203 KGNSKTPKCQRQCVESFDGKYQADKHFASNVYNVRASEEDIMNEILVYGPVEADFIVYAD 262

Query: 259 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
           F  YKSGVY+H+ G  +GGHAVK++GWG  ++G  YW+ AN WN  WG  G+FKI RG N
Sbjct: 263 FLTYKSGVYQHVKGGFLGGHAVKILGWG-EENGVPYWLCANSWNTDWGDGGFFKILRGYN 321

Query: 319 ECGIEEDVVAGLP 331
            C IE D+ AG+P
Sbjct: 322 HCKIEADINAGIP 334


>gi|449667614|ref|XP_002166962.2| PREDICTED: cathepsin B-like [Hydra magnipapillata]
          Length = 330

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 146/332 (43%), Positives = 198/332 (59%), Gaps = 29/332 (8%)

Query: 20  FAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF-KHLLGVKPTPK 78
           F   +     +  + +  S I  +N N K  W+A  N  F  +    + ++L G   TP 
Sbjct: 6   FGVLIAMVFTMPKNSMFQSHIHTIN-NMKTTWEAGEN--FGPHITSDYIRNLCGALKTP- 61

Query: 79  GLLLGVPVKTHDKSLK-LPKSFDARSAWPQ-CSTISRILDQGHCGSCWAFGAVEALSDRF 136
            L   +P+K   K +  LP  FDAR  W   C ++  + DQG CGSCWAFGA EA++DR 
Sbjct: 62  -LSKKLPIKDLSKEVHDLPIEFDARKEWGSICPSLLEVRDQGECGSCWAFGAAEAMTDRI 120

Query: 137 CIHF-GMN-LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC 194
           CI   G N + +S  DLL CC   CG GC+GGYP SAW +F   G+VT    PY    GC
Sbjct: 121 CIATKGKNQVRISTEDLLTCCD-SCGFGCNGGYPQSAWEFFKTKGIVTG--GPYNSHKGC 177

Query: 195 --------------SHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDI 239
                         S   C  + PTPKC + C K  N  ++N KHY +++Y IN+D  +I
Sbjct: 178 QPYAIPACDHHVPHSKNPCNGSLPTPKCEKVCEKGYNITYKNDKHYGVTSYSINNDQNEI 237

Query: 240 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 299
           M EI  NGPVE +FTV+ DF +YKSGVY+H++G+ +GGHA+K++GWG  ++   YW++AN
Sbjct: 238 MREIMTNGPVEAAFTVFADFPNYKSGVYQHVSGEELGGHAIKILGWGVENN-TPYWLVAN 296

Query: 300 QWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
            WN SWG +G+FKI RGS+ECGIE++VVAGLP
Sbjct: 297 SWNPSWGDNGFFKILRGSDECGIEDEVVAGLP 328


>gi|55793945|gb|AAV65883.1| cathepsin B1 isotype 3 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 143/345 (41%), Positives = 203/345 (58%), Gaps = 25/345 (7%)

Query: 7   IMDPILCLTCFATFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +M+ +LC+  F +     ++ + ++    L D +I  +N++P AGW A+R+ +F +    
Sbjct: 1   MMNTVLCIVSFMSILTAHILPENEIQFEPLSDEMIAYINQHPDAGWTASRSDRFKSLEDA 60

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
           +   LLG     + L       V   + SL++P SFD+R  W QC +IS I DQ  CGSC
Sbjct: 61  RI--LLGAMREDEELRKKRRPTVDHQNVSLEIPSSFDSRKKWHQCKSISNIRDQSRCGSC 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WAF AVEA+SDR CI      ++ LS  DLL+CC   CG GC GG+P +AW Y+V  G+V
Sbjct: 119 WAFTAVEAMSDRICIESKGKKSVELSAVDLLSCCT-ECGLGCQGGFPGAAWDYWVEDGIV 177

Query: 182 TEE-------CDPY------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYS 226
           T         C PY        +TG  +P C E  Y TPKC +KC K  +  ++  K+Y 
Sbjct: 178 TGSSKENHTGCQPYPFPKCEHHTTG-KYPECGEKIYKTPKCHQKCQKGYKTPYKKDKYYG 236

Query: 227 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 286
             +Y + ++   I  EI  +GPVE +FTV+ DF +YKSG+YK++TG  +GGHAV++IGWG
Sbjct: 237 RMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIGWG 296

Query: 287 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
             +    YW++AN WN  WG  GYF+I RG +ECGIE +V  GLP
Sbjct: 297 V-EKKTPYWLIANSWNEDWGEKGYFRILRGKDECGIESEVTGGLP 340


>gi|56753443|gb|AAW24925.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 138/343 (40%), Positives = 199/343 (58%), Gaps = 23/343 (6%)

Query: 7   IMDPILCLTCFATFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +++   C+    T  E  V ++       L D +I  +N++P AGWKA ++ +F  +++ 
Sbjct: 1   MLNIAFCIVSLFTLLEAHVTTRNNQRIEPLSDEMISFINKHPDAGWKADKSDRF--HSLD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + L+G +     +       V  HD ++++P  FD+R  WP C +IS+I DQ  CGSC
Sbjct: 59  DARILMGARKEDAEMKRKRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSC 118

Query: 124 WAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WAFGAVEA++DR CI  G   S  LS  DL++CC   CGDGC GG+P  AW Y+V  G+V
Sbjct: 119 WAFGAVEAMTDRICIQSGGQQSAELSALDLISCCED-CGDGCQGGFPGVAWDYWVKRGIV 177

Query: 182 T-------EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSI 227
           T         C PY        T   +P C    Y TP+C +KC K  +  +   K+Y  
Sbjct: 178 TGGSKENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQKCQKGYKTPYEQDKNYGD 237

Query: 228 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 287
             Y + S+ + I  EI   GPVE +F VYEDF +YKSG+Y+H+ G ++GGHA+++IGWG 
Sbjct: 238 QRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIGWGV 297

Query: 288 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
            + G+ YW++AN WN  WG +G F++ RG +EC IE  VVAGL
Sbjct: 298 -EKGKPYWLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339


>gi|148222779|ref|NP_001080410.1| uncharacterized protein LOC380102 precursor [Xenopus laevis]
 gi|28302291|gb|AAH46667.1| Cg10992 protein [Xenopus laevis]
          Length = 333

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 139/299 (46%), Positives = 182/299 (60%), Gaps = 22/299 (7%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCST 110
           WKA  N  F+N  V   K L G       L            L LP SFD+R+AWP C T
Sbjct: 41  WKAGHN--FANADVHYVKRLCGTHLNGPQLQKRFGFA---DDLDLPDSFDSRAAWPNCPT 95

Query: 111 ISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 168
           I  I DQG CGSCWAFGAVEA+SDR C+H    +N+ +S  DLL+CCGF CG GC+GGYP
Sbjct: 96  IREIRDQGSCGSCWAFGAVEAISDRVCVHTNGKVNVEVSAEDLLSCCGFKCGMGCNGGYP 155

Query: 169 ISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAY-PTPKCVRKCVKK 215
             AWR++   G+V+         C PY           S P C+     TPKC++ C + 
Sbjct: 156 SGAWRFWTETGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPSCKGEEGDTPKCMKTCEEG 215

Query: 216 -NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 274
               + + KH+  ++Y + S  ++IMA+IYKNGPVE +F VY DF  YKSGVY+H TG+ 
Sbjct: 216 YTPAYGSDKHFGATSYGVPSSEKEIMADIYKNGPVEGAFVVYADFPLYKSGVYQHETGEE 275

Query: 275 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           +GGHA+K++GWG  ++G  YW+ AN WN  WG +G+FKI RG + CGIE +VVAG+P +
Sbjct: 276 LGGHAIKILGWGV-ENGTPYWLCANSWNTDWGDNGFFKILRGKDHCGIESEVVAGIPKN 333


>gi|341887135|gb|EGT43070.1| hypothetical protein CAEBREN_13756 [Caenorhabditis brenneri]
          Length = 398

 Score =  256 bits (655), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 145/319 (45%), Positives = 187/319 (58%), Gaps = 29/319 (9%)

Query: 37  DSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH-----DK 91
           D +I  VN N +  WKA +  +FS Y     KH  G+      + L V  K H     D 
Sbjct: 59  DELINYVNNNQQL-WKAKKQRRFSMYKGENDKHKWGLMGVNH-VRLSVKGKQHLSKTKDL 116

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVN 149
            + +P+SFD+R  WP+C +I  I DQ  CGSCWAFGAVEA+SDR CI  H  + +SLS +
Sbjct: 117 DMDIPESFDSRENWPKCESIKAIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVSLSAD 176

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------ 196
           DLL+CC   CG GC+GG P++AWRY+V  G+VT         C PY     C H      
Sbjct: 177 DLLSCC-RSCGFGCNGGDPLAAWRYWVKDGIVTGSNFTANSGCKPY-PFPPCEHHSKKTH 234

Query: 197 --PGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 252
             P     YPTPKC ++C  +  ++ +   K Y  SAY +  D E I  E+  +GP+E++
Sbjct: 235 FDPCPHDLYPTPKCEKRCNAEYTDKTYSEDKFYGSSAYGVKDDVEAIQKELMTHGPLEIA 294

Query: 253 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 312
           F VYEDF +Y  GVY H  G + GGHAVKLIGWG  +DG  YW +AN WN  WG DG+F+
Sbjct: 295 FEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGI-EDGIPYWTVANSWNTDWGEDGFFR 353

Query: 313 IKRGSNECGIEEDVVAGLP 331
           I RG +ECGIE  VV G+P
Sbjct: 354 ILRGVDECGIESGVVGGIP 372


>gi|161343863|tpg|DAA06112.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 340

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 143/343 (41%), Positives = 192/343 (55%), Gaps = 29/343 (8%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
           I  L     F+ G    +++D   L D  I  +N + +  W A RN    N  +   K L
Sbjct: 6   IFALVGLLIFSFGCCDDIRVDLDPLSDEFIDHIN-SIQYYWSAGRNFH-KNTPMSYLKGL 63

Query: 71  LGVKPT----PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF 126
           +GV  +    PK   L   V   D    LP++FDAR  WP C TI  + DQG CGSCWAF
Sbjct: 64  MGVHESNAHYPK---LEQLVSYTDTPTDLPENFDAREHWPNCPTIREVRDQGSCGSCWAF 120

Query: 127 GAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 184
           GAVEA+SDR CIH     N   S  +L++CC   CG GC+GG+P +AW Y+   G+V+  
Sbjct: 121 GAVEAMSDRVCIHSKGAKNFHFSAENLVSCC-RTCGFGCNGGFPGAAWHYWKTKGIVSG- 178

Query: 185 CDPYFDSTGC--------------SHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISA 229
             PY    GC              +   C+    TP CV+KC    ++ +    H   SA
Sbjct: 179 -GPYGSKMGCIPYEIAPCEHHVNGTRGPCKEGGKTPACVKKCEDGYKVPYAQDLHRGKSA 237

Query: 230 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 289
           Y + +D + I  EIY NGPVE +FTVYEDF  Y++GVYKH+ G  +GGHA++++GWG  +
Sbjct: 238 YSLGNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQN 297

Query: 290 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
               YW++AN WN  WG+DG+FKI RGS+ECGIE  + AGLP+
Sbjct: 298 GEIPYWLVANSWNSDWGSDGFFKILRGSDECGIEGQINAGLPA 340


>gi|380791571|gb|AFE67661.1| cathepsin B preproprotein, partial [Macaca mulatta]
          Length = 311

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 136/298 (45%), Positives = 181/298 (60%), Gaps = 28/298 (9%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGLLLGVPVKTH 89
           H L D ++  VN+     W+A  N  F N  V   K L G     P P   ++       
Sbjct: 24  HPLSDELVNYVNKQ-NTTWQAGHN--FYNVDVSYLKRLCGTFLGGPKPPQRVM------F 74

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 149
            + LKLP+SFDAR  WPQC TI  I DQG CGSCWAFGAVEA+SDR CIH   ++S+ V+
Sbjct: 75  TEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVS 134

Query: 150 --DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCS 195
             DLL CCG +CGDGC+GGYP  AW ++   G+V+         C PY           S
Sbjct: 135 AEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNGS 194

Query: 196 HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
            P C     TPKC + C    +  ++  KHY  ++Y +++  +DIMAEIYKNGPVE +F+
Sbjct: 195 RPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFS 254

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 312
           VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +G+FK
Sbjct: 255 VYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFK 311


>gi|348534156|ref|XP_003454569.1| PREDICTED: cathepsin B-like [Oreochromis niloticus]
          Length = 330

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 144/323 (44%), Positives = 188/323 (58%), Gaps = 24/323 (7%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
           VS  +   H L   ++  +N+     WKA  N  F N      + L G     KG  L V
Sbjct: 15  VSLARPHLHPLSSEMVNHINK-LNTTWKAGHN--FHNVDYSYVRKLCGT--MLKGPKLPV 69

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--M 142
            V+ +   +KLPK FDAR  WP C T+  I DQG CGSCWAFGA EA+SDR CIH    +
Sbjct: 70  MVQ-YAGDVKLPKEFDARQQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNGKV 128

Query: 143 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------F 189
           N+ +S  DLL CC   CG GC+GGYP +AW ++   G+V+         C PY       
Sbjct: 129 NVEISSEDLLTCCDS-CGMGCNGGYPSAAWDFWASEGLVSGGLYESHIGCRPYTIAPCEH 187

Query: 190 DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 248
              G   P       TP+CVR+C       +   KHY  ++Y + SD + I  EIYKNGP
Sbjct: 188 HVNGSRPPCTGEGGDTPECVRQCESGYTPSYIQDKHYGKTSYSVPSDEQQIQTEIYKNGP 247

Query: 249 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 308
           VE +FTVYEDF  YK+GVY+H++G  +GGHA+K++GWG  ++G  YW+ AN WN  WG +
Sbjct: 248 VEGAFTVYEDFLLYKTGVYQHVSGSAVGGHAIKVLGWG-EENGTPYWLCANSWNTDWGDN 306

Query: 309 GYFKIKRGSNECGIEEDVVAGLP 331
           GYFKI RGS+ CGIE ++VAG+P
Sbjct: 307 GYFKILRGSDHCGIESEIVAGIP 329


>gi|226472810|emb|CAX71091.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 144/333 (43%), Positives = 189/333 (56%), Gaps = 22/333 (6%)

Query: 19  TFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPK 78
           T  +    + K     L   +I  +N      WKA    +F   TV   + +LG  P P 
Sbjct: 20  TLNDNDARRHKRMHQPLSKELIHFINYEANTTWKAGPTRRFK--TVSDIRRMLGALPDPN 77

Query: 79  GLLLGVPVKTHDKSL-KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFC 137
           G  L      ++ ++ +LPKSFDAR  W  C +IS I DQ  CGS WAFGAVEA+SDR C
Sbjct: 78  GEQLETLCTGYELTVNELPKSFDARKEWTHCPSISEIRDQSSCGSYWAFGAVEAMSDRIC 137

Query: 138 IHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY 188
           I         LS  +L++CC   CG GC+GG+P SAW Y+ + G+VT +       C PY
Sbjct: 138 IESKGKYKPFLSAENLVSCCS-SCGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY 196

Query: 189 FDSTGCSH------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMA 241
            +   C H      P C+    TP C R C    N  + N K Y    YR+ S+ E IM 
Sbjct: 197 -EFPPCEHHTLGPLPVCDGDVETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMK 255

Query: 242 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQW 301
           E+ ++GPVEV F VY DF +YKSGVY+H++G ++GGHAV+L+GWG  ++   YW++AN W
Sbjct: 256 ELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWG-EENNVPYWLIANSW 314

Query: 302 NRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 334
           N  WG +GYFKI RG NECGIE DV AG+P  K
Sbjct: 315 NTDWGDNGYFKIIRGKNECGIESDVNAGIPKIK 347


>gi|312271213|gb|ADQ57304.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 347

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 146/346 (42%), Positives = 194/346 (56%), Gaps = 26/346 (7%)

Query: 7   IMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ 66
           I+  +L     A   +     L+    +    ++  +N+  K  + A  +P+F+N+    
Sbjct: 6   IVAVVLVTAVSAASWQNAKKNLQEAEKLTGRELVDYINKAQKL-FTAKLSPRFANFPNEI 64

Query: 67  FKHLLGVKPTPKGLLLGVPVKTHDK--SLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 124
            + L+G K         V  KTH       +PKSFD+R+ WP+C ++  I DQ  CGSCW
Sbjct: 65  KRRLMGSKYVALPAKYRVNEKTHSDIDDTTIPKSFDSRTNWPECPSLYSIRDQSSCGSCW 124

Query: 125 AFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 182
           A GAVEA++DR CI    N  +++S +DLL+CC   CG GCDGG P +AW Y+V +G+VT
Sbjct: 125 AVGAVEAMTDRICIASKGNQKVTISADDLLSCCD-ECGFGCDGGDPYAAWSYWVSNGIVT 183

Query: 183 EECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKKNQLWRNS-KHY 225
                Y   +GC    +P CE               YPT  C  KC     +  NS KHY
Sbjct: 184 GS--NYTSKSGCKPYPYPPCEHHIPEHHYKKCPKDIYPTNTCEYKCQDGYSISYNSDKHY 241

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 285
             S Y +  D   I  EI  NGPVEV+F VYEDF HY SG+YKH TGD +GGHAVK++GW
Sbjct: 242 GASVYAVAQDVASIQKEIMTNGPVEVAFDVYEDFEHYSSGIYKHTTGDYLGGHAVKMLGW 301

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           GT ++G DYWI AN WN  WG +G+F+I RG +EC IE  VVAG P
Sbjct: 302 GT-ENGTDYWICANSWNSDWGENGFFRILRGVDECQIESSVVAGEP 346


>gi|148229459|ref|NP_001079570.1| cathepsin B precursor [Xenopus laevis]
 gi|28277314|gb|AAH44689.1| MGC53360 protein [Xenopus laevis]
          Length = 333

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 140/303 (46%), Positives = 188/303 (62%), Gaps = 30/303 (9%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH---DKSLKLPKSFDARSAWPQ 107
           WKA  N  F+N  +   K L G       LL G  ++        L+LP SFD+R+AWP 
Sbjct: 41  WKAGHN--FANADLHYVKRLCGT------LLKGPQLQKRFGFADGLELPDSFDSRAAWPN 92

Query: 108 CSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDG 165
           C TI  I DQG CGSCWAFGAVEA+SDR C+H    +N+ +S  DLL+CCG  CG GC+G
Sbjct: 93  CPTIREIRDQGSCGSCWAFGAVEAISDRVCVHTNGKVNVEVSAEDLLSCCGDECGMGCNG 152

Query: 166 GYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH--PGCEPAYP-----TPKCVRK 211
           GYP  AW+++   G+V+         C PY     C H   G  PA       TPKCV++
Sbjct: 153 GYPSGAWQFWTETGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPACKGEEGDTPKCVKQ 211

Query: 212 CVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 270
           C +  +  +   KH+  ++Y + +  ++IMAEIYKNGPVE +F VY DF  YKSGVY+H 
Sbjct: 212 CEEGYSPAYGTDKHFGTTSYGVPTSEKEIMAEIYKNGPVEGAFLVYADFPLYKSGVYQHE 271

Query: 271 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           TG+ +GGHA+K++GWG  ++G  YW+ AN WN  WG +G+FKI RG + CGIE ++VAG+
Sbjct: 272 TGEELGGHAIKILGWGV-ENGTPYWLCANSWNTDWGDNGFFKILRGKDHCGIESEIVAGV 330

Query: 331 PSS 333
           P +
Sbjct: 331 PKN 333


>gi|71984043|ref|NP_001024426.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
 gi|351058214|emb|CCD65629.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
          Length = 378

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 156/373 (41%), Positives = 204/373 (54%), Gaps = 59/373 (15%)

Query: 11  ILCLTCFATFA--------EGVVSKLK---LDSHILQ---DSIIKEVNENPKAGWKAARN 56
           +L L+C    A        E V+ K +   +DS   +   D +I  VNEN    W A + 
Sbjct: 3   LLFLSCIVVAAYCACNDNLESVLDKYRNREIDSEAAELDGDDLIDYVNENQNL-WTAKKQ 61

Query: 57  PQFSNYTVGQFKHLLGVKPTPKGLLLGVP------------VKTHDKSLKLPKSFDARSA 104
            +FS+        + G     K  L+GV              KT D  L +P+SFD+R  
Sbjct: 62  RRFSS--------VYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDLDIPESFDSRDN 113

Query: 105 WPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDG 162
           WP+C +I  I DQ  CGSCWAFGAVEA+SDR CI  H  + ++LS +DLL+CC   CG G
Sbjct: 114 WPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLLSCCK-SCGFG 172

Query: 163 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTP 206
           C+GG P++AWRY+V  G+VT     Y  + GC     P CE               YPTP
Sbjct: 173 CNGGDPLAAWRYWVKDGIVTGS--NYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTP 230

Query: 207 KCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           KC +KCV    ++ +   K +  SAY +  D E I  E+  +GP+E++F VYEDF +Y  
Sbjct: 231 KCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDG 290

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           GVY H  G + GGHAVKLIGWG  DDG  YW +AN WN  WG DG+F+I RG +ECGIE 
Sbjct: 291 GVYVHTGGKLGGGHAVKLIGWGI-DDGIPYWTVANSWNTDWGEDGFFRILRGVDECGIES 349

Query: 325 DVVAGLPSSKNLV 337
            VV G+P   +L 
Sbjct: 350 GVVGGIPKLNSLT 362


>gi|330805199|ref|XP_003290573.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
 gi|325079281|gb|EGC32888.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
          Length = 313

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 139/289 (48%), Positives = 176/289 (60%), Gaps = 23/289 (7%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCST 110
           W   +N QF N  +G    LLG K +       +PV   D ++K P SFD+R+AW  C+T
Sbjct: 39  WVEEKNDQFDNIKIGS---LLGFKKSLN--RPSIPVLNADPNIKAPASFDSRTAWSNCTT 93

Query: 111 ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 170
           I  I +Q  CGSCWAFGAVE+  DR CIH G+++ LS  DL+ C      DGC+GG  +S
Sbjct: 94  IGYIENQARCGSCWAFGAVESAQDRICIHKGLDVQLSFLDLVTC--DQSDDGCEGGDDVS 151

Query: 171 AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP-------TPKCVRKCVKKNQL-WRNS 222
           AW +    GVVT+EC PY      + P C PA         TP CV++C   + L +   
Sbjct: 152 AWNFLKKQGVVTQECKPY------TIPTCPPAQQPCLNFVNTPNCVKQCESNSTLIYSQD 205

Query: 223 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 282
           KH     Y INS  E IM EI  NGPVE  F+VYEDF  YKSGVY+H TG  +GGH VK+
Sbjct: 206 KHKMAKIYSINS-VEAIMQEISTNGPVEACFSVYEDFLGYKSGVYQHTTGKFLGGHCVKI 264

Query: 283 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
            G+GT  +G +YW +AN W  SWG +G F IKRGS+ECGIE++VVAG+P
Sbjct: 265 FGYGTL-NGVNYWSVANSWTTSWGDNGIFLIKRGSDECGIEDEVVAGIP 312


>gi|325302580|dbj|BAJ83490.1| cathepsin B-like peptidase [Echinococcus multilocularis]
          Length = 351

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 136/318 (42%), Positives = 184/318 (57%), Gaps = 24/318 (7%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 94
           L  +II  VN      WKA  + +F++ +  Q +  LG  P P G  L V     +    
Sbjct: 39  LSSAIIDYVNRI-NTTWKAEPSRRFTSPS--QVRQQLGALPDPMGRRLPVLYSLSENYKS 95

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH------FGMNLSLSV 148
           LP SFD R  WP C T+  I DQG CGSCWAFGA EA+SDR CI         + + LS 
Sbjct: 96  LPASFDPRKKWPNCKTLFEIRDQGSCGSCWAFGAAEAMSDRLCIQQQTVSGRAVMVRLSA 155

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------------EECDPYFDSTGCSH 196
           +DLL+CC   CG GC+GG+P  AW ++ H G+V+             E  P       + 
Sbjct: 156 DDLLSCCRD-CGMGCNGGFPSQAWNFWKHEGLVSGGLYGTKGVCRAYEIPPCEHHVNGTR 214

Query: 197 PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 255
           P CE   PTPKC   C ++ ++ ++  KHY++  Y ++S+ + I  E+  +GPVE  F V
Sbjct: 215 PPCEGDAPTPKCKNVCQEEYKVPYKKDKHYAVKVYSVHSNEDAIKHELITHGPVEADFEV 274

Query: 256 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 315
           Y DF  YKSGVY+H++G ++GGHA+KL+GWG  +DG  YW+ AN WN  WG  G+FKI R
Sbjct: 275 YADFPTYKSGVYQHVSGALLGGHAIKLMGWG-EEDGVPYWLCANSWNTDWGEGGFFKILR 333

Query: 316 GSNECGIEEDVVAGLPSS 333
           G N CGIE D+VAG+P +
Sbjct: 334 GKNHCGIESDIVAGIPQN 351


>gi|341888136|gb|EGT44071.1| hypothetical protein CAEBREN_13576 [Caenorhabditis brenneri]
          Length = 337

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 129/262 (49%), Positives = 163/262 (62%), Gaps = 22/262 (8%)

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 151
            +P  FDAR  WP C +I  I DQ  CGSCWA  A E +SDR CI     +N+ +S  DL
Sbjct: 74  NIPDHFDAREQWPNCVSIDNIRDQSDCGSCWAVAAAETISDRTCIASNGEVNVLISAEDL 133

Query: 152 LACC--GFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDS------TGCSH 196
           L+CC  G+ CGDGC+GGYPI AWRY+VH+G+VT         C PY  +       G + 
Sbjct: 134 LSCCTGGYNCGDGCEGGYPIQAWRYWVHNGLVTGGSYESQYGCKPYSIAPCGQTVNGVTW 193

Query: 197 PGCEP-AYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 252
           P C      TP+CV++C  K+     +   KHY  SAY I  +   I  EI +NGPVEV 
Sbjct: 194 PKCAADEVATPECVKQCTSKSDYAVPYDQDKHYGSSAYAIRQNVAQIQTEIMRNGPVEVG 253

Query: 253 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 312
           F VY DF  YKSG+YKH+ G  +GGHAVK++GWG  ++G  YW+ AN WN +WG  GYF+
Sbjct: 254 FLVYSDFYQYKSGIYKHVAGRELGGHAVKILGWGV-ENGTPYWLAANSWNVNWGEKGYFR 312

Query: 313 IKRGSNECGIEEDVVAGLPSSK 334
           I+RG+NECGIE  VVAG+P  K
Sbjct: 313 IRRGTNECGIESSVVAGIPDLK 334


>gi|55793943|gb|AAV65882.1| cathepsin B1 isotype 2 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 143/345 (41%), Positives = 202/345 (58%), Gaps = 25/345 (7%)

Query: 7   IMDPILCLTCFATFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +M+ +LC+  F +     ++ + ++    L D +I  +N++P AGW A+R+ +F +    
Sbjct: 1   MMNTVLCIISFMSILTAHILPENEIQFEPLSDEMIAYINQHPDAGWTASRSDRFKSLEDA 60

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
           +   LLG     + L       V   + SL++P SFD+R  W QC +IS I DQ  CGSC
Sbjct: 61  RI--LLGAMHEDEELRKKRRPTVDHQNVSLEIPSSFDSRKKWRQCKSISNIRDQSRCGSC 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WAF AVEA+SDR CI      ++ LS  DLL+CC   CG GC GG+P +AW Y+V  G+V
Sbjct: 119 WAFAAVEAMSDRICIESKGKKSVELSAVDLLSCC-TECGLGCQGGFPGAAWDYWVEDGIV 177

Query: 182 TEE-------CDPY------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYS 226
           T         C PY        +TG  +P C E  Y TPKC +KC K  +  +   K+Y 
Sbjct: 178 TGSSKENHTGCQPYPFPKCEHHTTG-KYPECGEKIYKTPKCHQKCQKGYKTPYGKDKYYG 236

Query: 227 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 286
             +Y + ++   I  EI  +GPVE +FTV+ DF +YKSG+YK++TG  +GGHAV++IGWG
Sbjct: 237 RMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIGWG 296

Query: 287 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
             +    YW++AN WN  WG  GYF+I RG +ECGIE +V  GLP
Sbjct: 297 V-EKKTPYWLIANSWNEDWGEKGYFRILRGKDECGIESEVTGGLP 340


>gi|185135431|ref|NP_001117776.1| procathepsin B precursor [Oncorhynchus mykiss]
 gi|14582897|gb|AAK69705.1|AF358667_1 procathepsin B [Oncorhynchus mykiss]
          Length = 330

 Score =  255 bits (651), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 141/323 (43%), Positives = 191/323 (59%), Gaps = 25/323 (7%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
           VS  K    +L   +++ +N N    W A +N  F N  +   K L G     KG  L  
Sbjct: 15  VSWAKPRLPLLSPEMVQYIN-NADTTWTAGQN--FHNVDISYVKSLCGT--LLKGPRLPE 69

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--M 142
            V++ D+ + LP SFDAR  WP C TI  I DQG CGSCWAFGA EA+SDR+CIH    +
Sbjct: 70  LVQS-DEDMSLPDSFDARLQWPNCPTIKEIRDQGSCGSCWAFGAAEAISDRYCIHSNGKV 128

Query: 143 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 195
           ++ +S  DLL+CC   CG GC GG+P +AW Y+   G+VT         C PY  +  C 
Sbjct: 129 SVEISAEDLLSCCD-ACGMGCMGGFPSAAWDYWAESGLVTGGLYGSNIGCRPYSIAP-CE 186

Query: 196 H------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 248
           H      P C     TPKCV +C       ++  K +    Y +    + IM E+YKNGP
Sbjct: 187 HHVNGTRPPCTGEGDTPKCVSECNAGYTPSYKKDKRFGKQTYSVPPKEQQIMTELYKNGP 246

Query: 249 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 308
           VE +F+VYEDF  YK+GVY+H+TG ++GGHA+K++GWG  ++   YW++AN WN  WG +
Sbjct: 247 VEAAFSVYEDFLLYKTGVYQHVTGQMLGGHAIKILGWG-KENNTPYWLVANSWNTDWGDN 305

Query: 309 GYFKIKRGSNECGIEEDVVAGLP 331
           G+FKI RG +ECGIE ++VAG+P
Sbjct: 306 GFFKILRGKDECGIESEIVAGIP 328


>gi|225717770|gb|ACO14731.1| Cathepsin B precursor [Caligus clemensi]
          Length = 331

 Score =  254 bits (650), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 141/319 (44%), Positives = 187/319 (58%), Gaps = 23/319 (7%)

Query: 29  KLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF-KHLLGVKPTPKGLLLGVPVK 87
           K  + IL +S I  VNE  +  WKA   P F   T   + + L+GV P  +  L      
Sbjct: 19  KTYNSILSESFIASVNEEAQI-WKAG--PNFHPETSSNYIRSLMGVLPNHRDYLPPPLPN 75

Query: 88  THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLS 147
                  +P +FDAR  WP C +I  I DQG CGSCWAFGA EA+SDR CIH   N+++S
Sbjct: 76  LLGTE-SIPDTFDAREHWPNCPSIRLIRDQGSCGSCWAFGAAEAMSDRVCIHTHKNVNIS 134

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------DSTGC 194
             +LL+CC + CG GC+GG+P +AWR++ + G+V+       + C PY          G 
Sbjct: 135 AENLLSCC-YTCGFGCNGGFPGAAWRFWENKGLVSGGLYGSHKGCQPYLIEPCEHHVNGT 193

Query: 195 SHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSI--SAYRINSDPEDIMAEIYKNGPVEVS 252
             P C     TPKC + C  KN      K  S   S+Y I SDP+ I  +I  NGPVE +
Sbjct: 194 RKP-CAEGGRTPKCHKTCDNKNYPISYEKDLSFGRSSYSIRSDPKQIQMDIMTNGPVEAA 252

Query: 253 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 312
           F+VY DF  YKSGVY+H+ G ++GGHA++++GWG  + G  YW++AN WN  WG +G FK
Sbjct: 253 FSVYSDFMSYKSGVYRHVKGSLLGGHAIRILGWGM-EKGTPYWLVANSWNTDWGDNGTFK 311

Query: 313 IKRGSNECGIEEDVVAGLP 331
           I RGS+ CGIE+ VVAGLP
Sbjct: 312 ILRGSDHCGIEDSVVAGLP 330


>gi|355681635|gb|AER96808.1| cathepsin B [Mustela putorius furo]
          Length = 338

 Score =  254 bits (650), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 142/324 (43%), Positives = 191/324 (58%), Gaps = 30/324 (9%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD---- 90
           L D ++  VN+     WKA  N  F N      K L G         LG P         
Sbjct: 26  LSDELVHYVNKQ-NTTWKAGHN--FHNVDQSYLKKLCGT-------FLGGPKPPQRLWFA 75

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN- 149
           +++ LP+SFD+R  WP C TI  I DQG CGSCWAFGAVEA+SDR CI    ++S+ V+ 
Sbjct: 76  ENMILPESFDSREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVSVEVSA 135

Query: 150 -DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSH 196
            D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY           S 
Sbjct: 136 EDMLTCCGDQCGDGCNGGFPAEAWNFWTXXGLVSGGLYDSHVGCRPYSIPPCEHHVNGSR 195

Query: 197 PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 255
           P C     TPKC + C       ++  KHY  S+Y ++S  ++IMAEIYKNGPVE +F+V
Sbjct: 196 PPCTGEGDTPKCSKICEPGYTPSYKEDKHYGCSSYSVSSSEKEIMAEIYKNGPVEAAFSV 255

Query: 256 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 315
           Y DF  YKSGVY+H+TG++MGGHAV+++GWG  ++G  YW++ N WN  WG +G+FKI R
Sbjct: 256 YSDFLMYKSGVYQHVTGEMMGGHAVRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILR 314

Query: 316 GSNECGIEEDVVAGLPSSKNLVKE 339
           G + CGIE ++VAG+P +    K+
Sbjct: 315 GQDHCGIESEIVAGIPCTDQYWKK 338


>gi|55793947|gb|AAV65884.1| cathepsin B1 isotype 4 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 143/345 (41%), Positives = 203/345 (58%), Gaps = 25/345 (7%)

Query: 7   IMDPILCLTCFATFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +M+ +LC+  F +     ++ + ++    L D +I  +N++P AGW A+R+ +F +    
Sbjct: 1   MMNTVLCIISFMSILTAHILPENEIQFEPLSDEMIAYINQHPDAGWTASRSDRFKSLEDA 60

Query: 66  QFKHLLGVKPTPKGLLLGV-PVKTHDK-SLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
           +   LLG     + L     P   H   SL++P SFD+R  W QC +IS I DQ  CGSC
Sbjct: 61  RI--LLGAMHEDEELRKKRRPTVDHQNVSLEIPSSFDSRKKWHQCKSISNIRDQSRCGSC 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WAF AVEA+SDR CI      ++ LS  DLL+CC   CG GC GG+P +AW Y+V  G+V
Sbjct: 119 WAFAAVEAMSDRICIESKGKKSVELSAVDLLSCCT-ECGLGCQGGFPGAAWDYWVEDGIV 177

Query: 182 TEE-------CDPY------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYS 226
           T         C PY        +TG  +P C E  Y TPKC +KC K  +  ++  K+Y 
Sbjct: 178 TGSSKENHTGCQPYPFPKCEHHTTG-KYPECGEKIYKTPKCHQKCQKGYKTPYKKDKYYG 236

Query: 227 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 286
             +Y + ++   I  EI  +GPVEV+FTV+ DF +YKSG+YK++TG  +G HAV++IGWG
Sbjct: 237 RMSYNVLNNENAIKKEIMMHGPVEVAFTVHSDFLNYKSGIYKYMTGAEIGEHAVRIIGWG 296

Query: 287 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
             +    YW++AN WN  WG  GYF++ RG +ECGIE  V +GLP
Sbjct: 297 V-EKKTPYWLIANSWNEDWGEKGYFRMLRGKDECGIESAVTSGLP 340


>gi|149698064|ref|XP_001498242.1| PREDICTED: cathepsin B [Equus caballus]
          Length = 340

 Score =  254 bits (649), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 142/326 (43%), Positives = 190/326 (58%), Gaps = 31/326 (9%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD---- 90
           L D ++  VN+     WKA  N  F N  +   K L G         LG P         
Sbjct: 26  LSDELVNYVNKR-NTTWKAGHN--FHNVDLSYVKRLCGT-------FLGGPKLPQRVWFA 75

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN- 149
           + + LP++FDAR  WP C TI  I DQG CGSCWAFGAVEA+SDR CI    ++S+ V+ 
Sbjct: 76  EDVVLPENFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVSVEVSA 135

Query: 150 -DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCS 195
            D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY          G  
Sbjct: 136 EDMLTCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIPPCEHHVNGSR 195

Query: 196 HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
            P       TPKC + C    +  ++  KHY  S+Y ++S  ++IMAEI+KNGPVE +FT
Sbjct: 196 PPCTGEGGDTPKCSKICEPGYSPSYKEDKHYGCSSYSVSSSEKEIMAEIFKNGPVEAAFT 255

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           VY DF  YKSGVY+H+ GD+MGGHAV+++GWG  ++G  YW++ N WN  WG +G+FKI 
Sbjct: 256 VYSDFLQYKSGVYQHVAGDMMGGHAVRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKIL 314

Query: 315 RGSNECGIEEDVVAGLPSSKNLVKEI 340
           RG + CGIE ++VAG+P +    K I
Sbjct: 315 RGQDHCGIESEIVAGIPCTDQYWKRI 340


>gi|1848229|gb|AAB48119.1| cathepsin B-like protease [Leishmania major]
          Length = 340

 Score =  254 bits (649), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 146/343 (42%), Positives = 196/343 (57%), Gaps = 31/343 (9%)

Query: 12  LCLTC-FATFAEGVVSKLKL---DSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVG 65
           LCL   FA      VS L     D  +L  S + EVN   K  W A+ N  +  +  ++G
Sbjct: 9   LCLVAVFALLLATTVSGLYAKPSDFPLLGKSFVAEVNSKAKGQWTASANNGYLVTGKSLG 68

Query: 66  QFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
           + + L+GV       +        +    LP+ FDA   WP C TIS I DQ +CGSCWA
Sbjct: 69  EVRKLMGVTDMSTEAVPPRNFSVEELQQDLPEFFDAAEHWPMCLTISEIRDQSNCGSCWA 128

Query: 126 FGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 184
             AVEA+SDR+C   G+ +  +S ++LL+CC F+CG GC GG P  AW ++V  G+ TE+
Sbjct: 129 IAAVEAISDRYCTFGGVPDRRMSTSNLLSCC-FICGLGCHGGIPTVAWLWWVWVGIATED 187

Query: 185 CDPY-FDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQL----WRNSKHYSISAYR 231
           C PY FD   CSH G    YP        TPKC   C ++N++    ++ S  YS+   +
Sbjct: 188 CQPYPFDP--CSHHGNSEKYPPCPSTIYDTPKCNTTC-ERNEMDLVKYKGSTSYSVKGEK 244

Query: 232 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 291
                 ++M E+  NGP+E++  VY DF  YKSGVYKH+ GD +GGHAVKL+GWGT  DG
Sbjct: 245 ------ELMIELMTNGPLELTMQVYSDFVGYKSGVYKHVLGDFLGGHAVKLVGWGT-QDG 297

Query: 292 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 334
             YW +AN WN  WG  GYF I+RG+NEC IE   VAG+P+ +
Sbjct: 298 VPYWKVANSWNTDWGDKGYFLIQRGNNECKIESGGVAGIPAQE 340


>gi|268557308|ref|XP_002636643.1| Hypothetical protein CBG23351 [Caenorhabditis briggsae]
          Length = 351

 Score =  254 bits (648), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 138/325 (42%), Positives = 185/325 (56%), Gaps = 24/325 (7%)

Query: 28  LKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVK 87
           + +++ +L+   + +     +  + A     FS+Y     K L+G K         V   
Sbjct: 28  IPVEAQMLRGQELVDYVNKQQTTFTAKLGSYFSSYPDTIKKQLMGAKMVEIPEEYRVFEM 87

Query: 88  THDKSL--KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMN 143
           TH + L   +P SFD+R+ WP C +IS+I DQ  CGSCWA  A E +SDR CI  +    
Sbjct: 88  THPEVLDTAVPDSFDSRTQWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNGKTQ 147

Query: 144 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE 200
           +S+S +D+ ACCG +CG+GC+GGYPI AWR++V  G VT     Y + +GC    +P CE
Sbjct: 148 ISISADDINACCGMVCGNGCNGGYPIEAWRHYVKKGYVTG--GSYQEKSGCKPYPYPPCE 205

Query: 201 -----------PA--YPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 246
                      P+  YPT KC   C     L +    H+  SAY ++  P +I  EI  +
Sbjct: 206 HHVNGTHYKPCPSNMYPTDKCEHSCQAGYPLTYTQDLHFGQSAYAVSKKPAEIQKEIMTH 265

Query: 247 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 306
           GPVEV+FTVYEDF HY  GVY H  G  +GGHAVK++GWG  D+G  YW+ AN WN  WG
Sbjct: 266 GPVEVAFTVYEDFEHYSGGVYVHTAGASLGGHAVKMLGWGV-DNGTPYWLCANSWNEDWG 324

Query: 307 ADGYFKIKRGSNECGIEEDVVAGLP 331
            +GYF+I RG NECGIE  VV G P
Sbjct: 325 ENGYFRIIRGVNECGIESGVVGGTP 349


>gi|330434688|gb|AEC22812.1| cathepsin B [Macrobrachium nipponense]
          Length = 331

 Score =  254 bits (648), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 140/315 (44%), Positives = 185/315 (58%), Gaps = 21/315 (6%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS 92
           H L D  I+ + +N K  WKA RN    N  +   K L+GV    K  +  V      + 
Sbjct: 19  HPLSDKFIQLL-QNEKTTWKAGRNFN-KNLPMRYLKSLMGVHADSKFHMSPVHKHKIPEG 76

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 150
            K+PK FD+R+AW  C TIS I DQG CGSCWAFGAVE ++DR CIH     N   S  +
Sbjct: 77  FKIPKEFDSRTAWSMCPTISEIRDQGSCGSCWAFGAVEVMTDRDCIHSNGTKNFHYSAEN 136

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCSH------P 197
           L++CC  LCG GC+GG+P +A++Y+VH G+V       T+ C PY +   C H      P
Sbjct: 137 LVSCC-HLCGFGCNGGFPGAAFQYWVHSGIVSGGAFNSTQGCQPY-EIAPCEHHVSGPRP 194

Query: 198 GCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
            C     TPKC + C     + + +  H+    Y ++ D   I  +I  NGPVE +FTVY
Sbjct: 195 KCAEGGSTPKCHKNCESNYVVDYESDLHHGSKHYSVDKDETQIKYDIMTNGPVEGAFTVY 254

Query: 257 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
            DF HYKSGVY+H  G  +GGHA++++GWG  +DG  YW+ AN WN  WG +GYFKI RG
Sbjct: 255 VDFLHYKSGVYQHTHGLPLGGHAIRVLGWG-EEDGTPYWLCANSWNTDWGDNGYFKILRG 313

Query: 317 SNECGIEEDVVAGLP 331
           S+ CGIE ++ AGLP
Sbjct: 314 SDHCGIESEISAGLP 328


>gi|312271211|gb|ADQ57303.1| cathepsin B-like cysteine proteinase 1 [Angiostrongylus
           cantonensis]
          Length = 394

 Score =  253 bits (647), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 140/313 (44%), Positives = 185/313 (59%), Gaps = 31/313 (9%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH-----DKSLKLPKSFDARSAW 105
           WKA ++ +F +Y       L+GV      + L V  K H     D  + +P++FDAR  W
Sbjct: 76  WKAKKHRRFVHYPDRTKWGLMGVN----NVHLSVKAKQHLSSTKDLDIDIPETFDARQHW 131

Query: 106 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGC 163
             C +I  I DQ  CGSCWAFGAVEA+SDR CI  +  + ++LS +DLL+CC   CG GC
Sbjct: 132 SNCQSIKNIRDQSSCGSCWAFGAVEAMSDRICIASNEKIQVTLSADDLLSCCR-TCGFGC 190

Query: 164 DGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH--------PGCEPAYPTPKC 208
           +GG P+ AW+Y+V HG+VT       + C PY     C H        P     YPTPKC
Sbjct: 191 EGGDPMFAWQYWVDHGIVTGSNFTANQGCKPY-PFPPCEHHSNKTRFDPCRHDLYPTPKC 249

Query: 209 VRKCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 266
            +KCV   K + + + + Y  +AY + +D   I  EI  +GPVEV+F VYEDF HY  G+
Sbjct: 250 SKKCVPSYKEKNYDDDRFYGRTAYGVKNDVAAIQKEILTHGPVEVAFEVYEDFLHYAGGI 309

Query: 267 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 326
           Y H  G + GGHAVKLIGWG  D G  YW++AN WN  WG +G+F+I RG +ECGIE  V
Sbjct: 310 YVHTGGKLGGGHAVKLIGWGI-DQGTPYWLIANSWNTDWGEEGFFRILRGVDECGIESGV 368

Query: 327 VAGLPSSKNLVKE 339
           V G+P S N+ + 
Sbjct: 369 VGGIPKSTNIQRR 381


>gi|241998314|ref|XP_002433800.1| longipain, putative [Ixodes scapularis]
 gi|215495559|gb|EEC05200.1| longipain, putative [Ixodes scapularis]
          Length = 339

 Score =  253 bits (647), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 144/314 (45%), Positives = 187/314 (59%), Gaps = 25/314 (7%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK-SL 93
           L D ++  +N      WKA  N    +    + K  LGV        L  P   HD   +
Sbjct: 32  LSDKMVDYIN-FINTTWKAGHNEGHRDLETVRRK--LGVSRDNHKYRL--PELVHDTLEM 86

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDL 151
            +P  FD+R  W  C TI  I DQG CGSCWAFGAVE++SDR CIH G    + L+ +D+
Sbjct: 87  DIPAQFDSRQQWQDCPTIREIRDQGACGSCWAFGAVESMSDRHCIHSGAKNIVHLAADDV 146

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PG 198
           L+CC + CG GC+GG+P +AW Y+V  G+VT       E C PY     C H        
Sbjct: 147 LSCC-WGCGSGCNGGFPGAAWSYWVEKGIVTGGNYDTDEGCMPY-PVPSCDHHVNGTLGP 204

Query: 199 CEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 257
           C    PTPKCVR C K   + +++ KHY  S+Y ++S+   I  EI KNGPVE +FTVY 
Sbjct: 205 CGQDPPTPKCVRLCRKGYNIDFKDDKHYGKSSYSVSSNETQIQMEIMKNGPVEGAFTVYA 264

Query: 258 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 317
           DF  YKSGVYK  + D +GGHA++++GWG  ++G  +W++AN WN  WG  GYFKI RGS
Sbjct: 265 DFPLYKSGVYKSHSTDALGGHAIRILGWGV-ENGVPFWLVANSWNTEWGDKGYFKILRGS 323

Query: 318 NECGIEEDVVAGLP 331
           NECGIEED+VAG+P
Sbjct: 324 NECGIEEDIVAGIP 337


>gi|167538317|ref|XP_001750823.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163770644|gb|EDQ84327.1| predicted protein [Monosiga brevicollis MX1]
          Length = 341

 Score =  253 bits (647), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 136/314 (43%), Positives = 179/314 (57%), Gaps = 26/314 (8%)

Query: 36  QDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV-KPTPKGLLLGVPVKTHDKSLK 94
            + +  EVN+  +  W A  N +F+  T    K  +GV +  P+     +P K       
Sbjct: 34  HEQVAAEVNQ-AQTSWTAGVNSRFARATDDFIKSQMGVLEGGPQ-----LPEKDIAVLAD 87

Query: 95  LPKSFDARSAW-PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
           LP +FD+R  W   C +   I DQ  CGSCWAFGAVE+++DR CI    +L   +S  DL
Sbjct: 88  LPTAFDSREQWGSTCPSTKEIRDQAACGSCWAFGAVESMTDRICIASKGSLRPHISAQDL 147

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PG 198
           + CC F CG GC GGYP +AW +F   G+VT       + C PY     C H      P 
Sbjct: 148 MTCCLFTCGSGCSGGYPSAAWSWFKTTGIVTGGNYNSSQGCQPY-SLPNCDHHVSGQYPA 206

Query: 199 CEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 257
           C    PTP C + C    N  + N KH+  +AY +  + + I  EI  NGPVE +FTVYE
Sbjct: 207 CSGEGPTPACKKSCEAGYNNTYSNDKHFGATAYSVAGEADKIATEIMTNGPVEGAFTVYE 266

Query: 258 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 317
           D   YKSGVY+H TG V+GGHA+K+IGWG  + G DYW +AN WN  WG +G+FKIK+G 
Sbjct: 267 DLLTYKSGVYQHTTGQVLGGHAIKIIGWGV-ESGVDYWWVANSWNNDWGDNGFFKIKKGV 325

Query: 318 NECGIEEDVVAGLP 331
           +ECGIE  +VAG+P
Sbjct: 326 DECGIESQIVAGMP 339


>gi|55793951|gb|AAV65886.1| cathepsin B1 isotype 6 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  253 bits (646), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 143/347 (41%), Positives = 201/347 (57%), Gaps = 25/347 (7%)

Query: 7   IMDPILCLTCFATFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +M+ +LC+  F +     +++  ++    L D II  +N++P AGW A+R+ +F   +V 
Sbjct: 1   MMNTVLCIVSFMSILTAHILTGNEMQFEPLSDEIIAYINQHPDAGWTASRSDRFK--SVE 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLGV    + L       V   + SL++P +FD+R  W QC +IS I DQ  CGS 
Sbjct: 59  DARILLGVMREDEKLRKKRRPTVDHQNVSLEIPSTFDSRKKWSQCKSISSIHDQSRCGSG 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WAF AVE +SDR CI      ++ LS  DLL+CC   CG GC GG+P SAW Y+V  GVV
Sbjct: 119 WAFAAVEVMSDRICIQSKGEKSVELSAVDLLSCC-RECGLGCLGGFPGSAWDYWVEEGVV 177

Query: 182 TEE-------CDPY------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYS 226
           T         C PY       ++TG  +P C +  Y TPKC +KC K  +  ++  KHY 
Sbjct: 178 TGSSGENHTGCQPYPFPKCEHNTTG-KYPACGQKIYETPKCQKKCQKGYKTPYKKDKHYG 236

Query: 227 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 286
             AY + ++ + I  EI  +GPV   FTVY DF +YKSG+YKH+ G  +G H V+++GWG
Sbjct: 237 KVAYNVPNNEDSIKKEIMMHGPVGSFFTVYSDFLNYKSGIYKHMKGTEIGVHTVRIVGWG 296

Query: 287 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
             + G  YW++AN WN  WG  GYF+I RG +EC IE  V+ GLP +
Sbjct: 297 V-EKGTPYWLIANSWNEGWGEKGYFRILRGKDECDIESLVIGGLPRN 342


>gi|195130519|ref|XP_002009699.1| GI15503 [Drosophila mojavensis]
 gi|193908149|gb|EDW07016.1| GI15503 [Drosophila mojavensis]
          Length = 342

 Score =  253 bits (646), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 142/326 (43%), Positives = 185/326 (56%), Gaps = 31/326 (9%)

Query: 31  DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH- 89
           D H+L D  I+ V    K  W   RN   S  + G  + L+GV P      L  P K+  
Sbjct: 23  DPHMLSDEFIELVRSKAKT-WTPGRNFDAS-VSEGHIRGLMGVHPDAHKFTL--PEKSQV 78

Query: 90  ------DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 141
                 D    LP+SFDAR+AWP C TI  I DQG CGSCWAFGAVEA+SDR CIH    
Sbjct: 79  LGNLVGDDGDDLPESFDARTAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSNGT 138

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 194
           +N   S  DL++CC   CG GC+GG+P +AW Y+ H G+V+       E C PY +   C
Sbjct: 139 VNFHFSAEDLVSCC-HTCGFGCNGGFPGAAWSYWTHKGIVSGGSYNSNEGCRPY-EIEPC 196

Query: 195 SH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 247
            H      P C+    TP C  +C     + +   KH+   +Y I  +P +I  EI  NG
Sbjct: 197 EHHVNGTRPPCKNGR-TPSCKHQCESSYSVDYAKDKHFGSKSYSIRRNPREIQREIMTNG 255

Query: 248 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE-DYWILANQWNRSWG 306
           PVE +FTVYED   YKSGVYKH+ G  +GGHA++++GWG   D +  YW++ N WN  WG
Sbjct: 256 PVEGAFTVYEDLILYKSGVYKHVHGKELGGHAIRILGWGVWGDSKVPYWLIGNSWNTDWG 315

Query: 307 ADGYFKIKRGSNECGIEEDVVAGLPS 332
            +G+F+I RG + CGIE  + AGLP+
Sbjct: 316 DNGFFRIVRGEDHCGIESAISAGLPA 341


>gi|225713216|gb|ACO12454.1| Cathepsin B precursor [Lepeophtheirus salmonis]
 gi|290561811|gb|ADD38303.1| Cathepsin B [Lepeophtheirus salmonis]
          Length = 333

 Score =  253 bits (646), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 139/341 (40%), Positives = 201/341 (58%), Gaps = 24/341 (7%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +I+     LT +A  A    S+  + + IL    I  +N++ K  W+A  N      +  
Sbjct: 1   MILKFAFLLTVYAGAA---YSRGAVSNGILSKDYIDSINKDSKT-WRAGSNFD-EEISTS 55

Query: 66  QFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
             + L+GV P  K  L    + T   + ++P++FD+R  WP C TIS I DQG CGSCWA
Sbjct: 56  YIRGLMGVLPNHKDYLPPA-LPTLLGTEQIPENFDSRQKWPHCPTISLIRDQGSCGSCWA 114

Query: 126 FGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT--- 182
           FGAVEA+SDR CIH    +++S  +LL+CC + CG GC+GG+P +AW ++   G+V+   
Sbjct: 115 FGAVEAMSDRLCIHSNKIVNVSAENLLSCC-YSCGFGCNGGFPGAAWSFWKKKGLVSGGL 173

Query: 183 ----EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL--WRNSKHYSISAY 230
               + C PY  +  C H      P C     TPKC   C  ++    +   K +  S+Y
Sbjct: 174 YGSHKGCQPYAIAP-CEHHANGTRPPCSGGGRTPKCHTFCENEDYSLPYEKDKSFGRSSY 232

Query: 231 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 290
            + SDP+ I  EI  NGPVE +F+VY DF +YKSGVY+H+ G ++GGHA++++GWG  ++
Sbjct: 233 SVKSDPKQIQLEIMNNGPVEAAFSVYSDFLNYKSGVYRHVKGSLLGGHAIRILGWGV-EN 291

Query: 291 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           G  YW++AN WN  WG +G FKI +GS+ CGIE  +VAGLP
Sbjct: 292 GTPYWLVANSWNTDWGDNGTFKILKGSDHCGIEGSIVAGLP 332


>gi|38373697|gb|AAR19103.1| cathepsin B [Uronema marinum]
          Length = 350

 Score =  253 bits (646), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 141/339 (41%), Positives = 191/339 (56%), Gaps = 37/339 (10%)

Query: 24  VVSKLKLDSHILQDSIIKEVNE-NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLL- 81
           V S    D  +    I++EVN  N  + WKA  N +F   +  Q + ++G   TP  ++ 
Sbjct: 12  VASVQAFDFKLFTSEIMEEVNNYNTGSTWKAGYNKRFEGMSFDQIQAMMGTIATPVHMIP 71

Query: 82  --LGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH 139
                P +T  ++L LP+SFD R A+P+C ++ ++ DQ +CGSCWAFG VEA+SDR CI 
Sbjct: 72  DERYTPFET-IQNLSLPESFDLREAYPKCESLQQVRDQSNCGSCWAFGTVEAISDRICIA 130

Query: 140 FGM--NLSLSVNDLLACC--GFLCGDGCDGGYPISAWRYFVHHGVVT------------E 183
            G      +S  +LL+CC   F CG GC+GGY   AW Y+V  G+V+             
Sbjct: 131 SGQKDQTRISSENLLSCCRGTFACGMGCNGGYTAGAWNYYVKTGLVSGNLYTDDNQNSKT 190

Query: 184 ECDPYFDSTGCSH------PGCE--PAYPTPKCVRKCVKKNQLWRNSK----HYSISAYR 231
           EC PY     CSH        C   P + TPKC  +C   +Q  +NS     H  +S+Y 
Sbjct: 191 ECQPY-SFPPCSHHVQGEYQACTDLPQFNTPKCYTEC--NSQYTQNSYEQDLHKGVSSYS 247

Query: 232 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 291
           +    E I AEIY+ G    SF VY DF  Y SGVY++ +G  MGGHA+K++GWG  ++G
Sbjct: 248 VPKSEEQIKAEIYQYGSTTASFNVYSDFLTYSSGVYQNTSGSYMGGHAIKMLGWGV-ENG 306

Query: 292 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
             YW+ AN WN SWG +G+FKI RGSNECGIE  +VAG 
Sbjct: 307 TPYWLCANSWNSSWGENGFFKILRGSNECGIESGMVAGF 345


>gi|17565164|ref|NP_503383.1| Protein CPR-5 [Caenorhabditis elegans]
 gi|1169086|sp|P43509.1|CPR5_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 5; AltName:
           Full=Cysteine protease-related 5; Flags: Precursor
 gi|671713|gb|AAA98786.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|675502|gb|AAA98784.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351059399|emb|CCD74289.1| Protein CPR-5 [Caenorhabditis elegans]
          Length = 344

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 129/258 (50%), Positives = 159/258 (61%), Gaps = 22/258 (8%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 152
           +P  FDAR  WP C +I+ I DQ  CGSCWAF A EA+SDR CI  +  +N  LS  DLL
Sbjct: 82  IPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDLL 141

Query: 153 ACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS------TGCSHP 197
           +CC   F CG+GC+GGYPI AW+++V HG+VT         C PY  +       G   P
Sbjct: 142 SCCTGMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAPCGETVNGVKWP 201

Query: 198 GC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
            C E   PTPKCV  C  KN     +   KH+  +AY +    E I  EI  NGP+EV+F
Sbjct: 202 ACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIEVAF 261

Query: 254 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 313
           TVYEDF  Y +GVY H  G  +GGHAVK++GWG  D+G  YW++AN WN +WG  GYF+I
Sbjct: 262 TVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGV-DNGTPYWLVANSWNVAWGEKGYFRI 320

Query: 314 KRGSNECGIEEDVVAGLP 331
            RG NECGIE   VAG+P
Sbjct: 321 IRGLNECGIEHSAVAGIP 338


>gi|49036806|gb|AAT48984.1| cathepsin B-like proteinase [Triatoma sordida]
          Length = 331

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 135/313 (43%), Positives = 184/313 (58%), Gaps = 23/313 (7%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF-KHLLGVKPTPKGLLLGVPVKTHDKSL 93
           L D  I  +N   +  W+A RN  F+  T  ++ K L GV          +P +     +
Sbjct: 24  LSDEFIDYIN-TLQTTWRAGRN--FAPNTPKKYLKSLAGVHKNANNAFT-LPKRKVSLDV 79

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 151
            +P  FDAR  WP C +I+ I DQG CGSCWAFGAVEA+SDR CIH    + + LS  +L
Sbjct: 80  TIPDEFDARKQWPNCPSITDIRDQGSCGSCWAFGAVEAMSDRICIHSNGKLQVHLSAENL 139

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PG 198
           ++CC   CG GCDGG+P SAW Y+ + G+V+       + C PY  +  C H      P 
Sbjct: 140 VSCCD-SCGYGCDGGFPASAWDYWQNEGIVSGGNYGSKQGCQPYSIAP-CEHHVPGSRPA 197

Query: 199 CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
           C     TP C  +C + + +  +  HY         + + I AEI KNGPVE +FTVYED
Sbjct: 198 CSGGGDTPDCRNQCDEGSGISYDQDHYYGETVYTLDEAKQIQAEILKNGPVEAAFTVYED 257

Query: 259 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
             +YK GVY+H+ G+ +GGHA+K++GWG  +D   YW++AN WN  WG +G+FKI RGS+
Sbjct: 258 LLNYKEGVYQHVAGEALGGHAIKILGWGVEND-TPYWLVANSWNTDWGNNGFFKILRGSD 316

Query: 319 ECGIEEDVVAGLP 331
           ECGIE+ +VAGLP
Sbjct: 317 ECGIEDQIVAGLP 329


>gi|390994429|gb|AFM37364.1| cathepsin B1 [Dictyocaulus viviparus]
          Length = 350

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 126/257 (49%), Positives = 157/257 (61%), Gaps = 19/257 (7%)

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 150
           +++P++FDAR  W QC +I  I DQ HCGSCWA  A E +SDR CIH    +N+ LS  D
Sbjct: 93  VEIPENFDAREKWSQCDSIRTIRDQSHCGSCWAVSAAETMSDRTCIHSDGKINVGLSATD 152

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAY 203
           +L+CCG  CG GC GGYPI AWRYF+ HGV T       + C PY     C H   E  Y
Sbjct: 153 ILSCCGTTCGRGCRGGYPIEAWRYFMLHGVCTGGHYAEKDVCKPYAFHP-CGHHRNEIYY 211

Query: 204 --------PTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
                   PTP+C + C       + + K Y  SAY + ++ + I  EI  NGPV+ +F 
Sbjct: 212 GECPKEIFPTPQCTQSCQAGYASDYEDDKIYGKSAYALPNNEKAIQREIMTNGPVQAAFM 271

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           VYEDF+ Y+SG+Y H  G   GGHAVKLIGWG  DDG  YW+ AN WN  WG +GYF+I 
Sbjct: 272 VYEDFSRYRSGIYVHTAGRREGGHAVKLIGWGVDDDGNKYWLAANSWNSDWGENGYFRIV 331

Query: 315 RGSNECGIEEDVVAGLP 331
           RG + CGIE  VVAG+P
Sbjct: 332 RGVDHCGIESAVVAGMP 348


>gi|268555788|ref|XP_002635883.1| C. briggsae CBR-CPR-5 protein [Caenorhabditis briggsae]
          Length = 345

 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 129/258 (50%), Positives = 161/258 (62%), Gaps = 22/258 (8%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 152
           +P  FDAR  WP C +I+ I DQ  CGSCWAF A EA+SDR CI  +  +N  LS  DLL
Sbjct: 83  IPDHFDARDQWPSCVSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSQDLL 142

Query: 153 ACCGFL--CGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS------TGCSHP 197
           +CC  L  CG+GC+GGYPI AW+++V HG+VT         C PY  +       G + P
Sbjct: 143 SCCTGLLSCGNGCEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWP 202

Query: 198 GC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
            C +   PTPKCV  C   N     +   KH+  +AY +    E I  EI KNGPVEV+F
Sbjct: 203 KCPDDTEPTPKCVEACTSNNTYPTPYLQDKHFGATAYAVGKKVEQIQTEILKNGPVEVAF 262

Query: 254 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 313
           TVYEDF  Y +GVY H +G  +GGHAVK++GWG  D+G  YW++AN WN +WG  GYF+I
Sbjct: 263 TVYEDFYQYTTGVYVHTSGASLGGHAVKILGWGV-DNGTPYWLVANSWNVNWGEKGYFRI 321

Query: 314 KRGSNECGIEEDVVAGLP 331
            RG NECGIE   VAG+P
Sbjct: 322 IRGLNECGIEHSAVAGIP 339


>gi|496317|dbj|BAA04103.1| Sarcophaga pro-cathepsin B [Sarcophaga peregrina]
          Length = 344

 Score =  251 bits (641), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 141/350 (40%), Positives = 196/350 (56%), Gaps = 34/350 (9%)

Query: 8   MDPILCLTCFATFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ 66
           M     + C A  A G V++ L  ++ +L D  ++ V    K  W   RN   S      
Sbjct: 1   MRQHFVIICIAFLAFGQVLANLDAENDLLSDEFLEIVRSKAKT-WTPGRNYDKS-VPRSH 58

Query: 67  FKHLLGVKPTP-------KGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGH 119
           F+ L+GV P         K L+LG  V   D  +  P+ FDAR AWP C TI  I DQG 
Sbjct: 59  FRRLMGVHPDAHKFTLHEKSLVLGEEVGLADSDV--PEEFDARKAWPNCPTIGEIRDQGS 116

Query: 120 CGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 177
           CGSCWAFGAVEA+SDR CIH    ++   S +DL++CC   CG GC+GG+P +AW Y+  
Sbjct: 117 CGSCWAFGAVEAMSDRLCIHSNATIHFHFSADDLVSCC-HTCGFGCNGGFPGAAWAYWTR 175

Query: 178 HGVVTEECDPYFDSTGC--------------SHPGCEPAY-PTPKCVRKCVKKNQL-WRN 221
            G+V+    PY  S GC              + P C+  +  TP C  +C K   + ++ 
Sbjct: 176 KGIVSG--GPYGSSQGCRPYEIAPCEHHVNGTRPPCDGEHGKTPSCRHECQKSYDVDYKT 233

Query: 222 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 281
            KH+   +Y +  + +DI  EI +NGPVE +FTVYED   YK GVY+H+ G  +GGHA++
Sbjct: 234 DKHFGSKSYSVKRNVKDIQKEIMQNGPVEGAFTVYEDLILYKDGVYQHVHGRELGGHAIR 293

Query: 282 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           ++GWG  ++   YW++AN WN  WG +G+FK+ RG + CGIE  + AGLP
Sbjct: 294 ILGWGV-ENKTPYWLIANSWNTDWGNNGFFKMLRGEDHCGIESAIAAGLP 342


>gi|432852559|ref|XP_004067308.1| PREDICTED: cathepsin B-like [Oryzias latipes]
          Length = 330

 Score =  251 bits (641), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 138/315 (43%), Positives = 184/315 (58%), Gaps = 24/315 (7%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS 92
           H L   ++  +N+     WKA  N  F N      + L G     KG  L + V+ +   
Sbjct: 23  HPLSSDMVNYINK-LNTTWKAGHN--FKNADYSYVQKLCGT--MLKGPKLPIMVQ-YAGD 76

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--D 150
           +KLP  FDAR+ WP C T+  I DQG CGSCWAFGA EA+SDR CIH    +S+ ++  D
Sbjct: 77  VKLPTEFDARAQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNARVSVEISSED 136

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHP 197
           LL CC   CG GC+GGYP +AW ++   G+VT         C PY          G   P
Sbjct: 137 LLTCCE-SCGMGCNGGYPTAAWDFWTKEGLVTGGLYDSHVGCRPYTIPPCEHHVNGTRPP 195

Query: 198 GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
                  TP+C+ +C       ++  KHY  ++Y + ++   I  EIYKNGPVE +F VY
Sbjct: 196 CTGEGGDTPQCINQCESGYTPSYKKDKHYGKTSYSVEANENQIQTEIYKNGPVEGAFMVY 255

Query: 257 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
           EDF  YKSGVY+H++G ++GGHA+K++GWG  +DG  YW+ AN WN  WG +GYFKI RG
Sbjct: 256 EDFPMYKSGVYQHVSGSLIGGHAIKILGWGV-EDGVPYWLCANSWNTDWGDNGYFKILRG 314

Query: 317 SNECGIEEDVVAGLP 331
           S+ CGIE +VVAG+P
Sbjct: 315 SDHCGIESEVVAGIP 329


>gi|338815385|gb|AEJ08755.1| cathepsin B [Crassostrea ariakensis]
          Length = 341

 Score =  251 bits (641), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 147/341 (43%), Positives = 191/341 (56%), Gaps = 27/341 (7%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFK 68
           +LC       +  V  + K     L D +I  +N+     WKA +N      +  +   K
Sbjct: 5   VLCALVAGAMSALVEFRDKDIFEPLSDEMIWFINK-LNTTWKAGQNFHHIAKDDRLAHVK 63

Query: 69  HLLGVK-PTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFG 127
            + G    TP  L L  P K  +    LP SFD+R+ WP C T+  + DQG CGSCWAFG
Sbjct: 64  MMCGTYLNTPPELRL--PEKKMEPLKDLPASFDSRTQWPNCPTLKEVRDQGACGSCWAFG 121

Query: 128 AVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT--- 182
           AVEA+SDR CI      N+ +S  DL +CC   CG+GC+GG+P +AW Y+   G+VT   
Sbjct: 122 AVEAMSDRICIKSQGKENVHISAEDLTSCC-RTCGNGCEGGFPSAAWSYYKRDGLVTGGQ 180

Query: 183 ----EECDPYFDSTGCSH-------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAY 230
               + C PY     C H       P  +   PTPKC   C    N  +   KHY +SAY
Sbjct: 181 YNSHQGCQPY-TIKACDHHVVGKLQPCSKDIGPTPKCKHTCEAGYNVTYEKDKHYGMSAY 239

Query: 231 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 290
            ++   E IM EI  NGPVE +FTVY DF  YKSGVYKH TG  +GGHA+K++GWGT ++
Sbjct: 240 SVHG-VEKIMTEIMTNGPVEGAFTVYADFPQYKSGVYKHTTGQPLGGHAIKILGWGT-EN 297

Query: 291 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           G+DYW++AN WN  WG  G+FKI RG +ECGIE  + AG P
Sbjct: 298 GDDYWLVANSWNPDWGDQGFFKILRGQDECGIESQISAGEP 338


>gi|121309133|dbj|BAF43801.1| Longipain [Haemaphysalis longicornis]
          Length = 341

 Score =  251 bits (641), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 142/321 (44%), Positives = 187/321 (58%), Gaps = 24/321 (7%)

Query: 28  LKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVK 87
           + +D     D +I+ +N      W+A RN  + +      + LLGV P      L   ++
Sbjct: 26  VPVDMDNFPDKMIEYINY-LNTTWQAGRNLGYEDPRY--VRTLLGVHPNNHKYRL-PEIE 81

Query: 88  THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LS 145
               ++++P  FD+R  W  C TI  I DQG CGSCWAFGAVEA+SDR CIH G    + 
Sbjct: 82  IDTSNVQIPDHFDSRHRWHDCPTIREIRDQGSCGSCWAFGAVEAMSDRHCIHSGAKNIVH 141

Query: 146 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 196
           L+ +D+L+CC   CG GC+GG+P +AW Y+VH G+VT       E C PY     C H  
Sbjct: 142 LAADDVLSCC-MSCGSGCNGGFPGAAWSYWVHKGIVTGGNYDSDEGCMPY-PIKACDHHV 199

Query: 197 -----PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 250
                P  +   PTP+CVR C K  N  + + KHY   +Y + S+   I  EI  NGPVE
Sbjct: 200 NGTLGPCDKSIPPTPRCVRMCRKGYNVDFADDKHYGKKSYSVPSNVTQIQVEIMTNGPVE 259

Query: 251 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 310
             FTVY DF  YKSGVY+  T   +GGHA++L+GWG  + G  YW+ AN WN  WG  G+
Sbjct: 260 ADFTVYADFPLYKSGVYQRHTDQALGGHAIRLLGWGV-EKGVPYWLAANSWNTEWGDKGF 318

Query: 311 FKIKRGSNECGIEEDVVAGLP 331
           FKI RGS+ECGIE+DVVAG+P
Sbjct: 319 FKILRGSDECGIEDDVVAGIP 339


>gi|87246247|gb|ABD35300.1| cathepsin B-like cysteine protease [Triatoma infestans]
          Length = 333

 Score =  251 bits (641), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 147/336 (43%), Positives = 192/336 (57%), Gaps = 26/336 (7%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF-KHLLG 72
           L  F+    G+ S   + +  L D  I  +N + +  W+A RN  F+  T  ++ K L G
Sbjct: 4   LIPFSLLICGIFSA-SIPTDPLSDEFIDYIN-SLQTTWRAGRN--FAPNTPKKYLKSLAG 59

Query: 73  V--KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 130
              K T  G  L  P++     + LP  FDAR  WP CSTI  I DQG CGSCWAFGAVE
Sbjct: 60  GVHKNTKNGFTL--PIRDVSLDITLPDEFDARKQWPNCSTIGEIRDQGSCGSCWAFGAVE 117

Query: 131 ALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------ 182
           A+SDR CIH    + + LS  +LL+CC   CGDGC GG P SAW Y+   G+V+      
Sbjct: 118 AMSDRLCIHSNGKLQVHLSAENLLSCCD-SCGDGCLGGSPESAWEYWHKFGIVSGGNYGS 176

Query: 183 -EECDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSD 235
            + C PY       S   S P C     TPKC ++C K   + +  + +Y    Y I +D
Sbjct: 177 KQGCQPYSIAPCEHSIHGSSPACGGVTDTPKCKKQCEKGYSIPYDKAFYYGQPGYAIPND 236

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
            + I AEI KNGP+  SF VYED   YK GVY+H+ G+ +GGH +K+ GWG  ++G  YW
Sbjct: 237 AQKIQAEILKNGPIVASFLVYEDLFSYKEGVYQHVAGEFLGGHVIKIFGWGI-ENGTPYW 295

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           ++AN WN  WG +G+FKI RG +ECGIE DV AGLP
Sbjct: 296 LVANSWNTDWGNNGFFKIPRGKDECGIEIDVSAGLP 331


>gi|389611087|dbj|BAM19154.1| cathepsin B [Papilio polytes]
          Length = 334

 Score =  251 bits (640), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 141/334 (42%), Positives = 184/334 (55%), Gaps = 24/334 (7%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           +TC    A  V +     S  L D  I  +N      WKA RN       V   + L+G 
Sbjct: 6   VTCLLLCAFAVTAD---SSEPLSDDFINLINSKQDT-WKAGRNFPVDT-PVKHIQKLMGT 60

Query: 74  KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALS 133
               +   L       D    LP++FD R  WP C T++ + DQG CGSCWAFGAVEA++
Sbjct: 61  LKDDRFTTLVTLQHEVDLIASLPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMT 120

Query: 134 DRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEE 184
           DR C +     +   S  DLL+CC  +CG GC+GG P  AW Y+ H G+V       T+ 
Sbjct: 121 DRVCTYSNGTKHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSTQG 179

Query: 185 CDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPE 237
           C PY +   C H  PG    C     TPKC++KC    N  ++  KHY    Y +    +
Sbjct: 180 CRPY-EIPPCEHHVPGNRLPCSGDTKTPKCIKKCEDNYNVAYKQDKHYGKHIYSVRGGED 238

Query: 238 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWIL 297
            I AE+YKNGPVE +FTVY D   YKSGVYKH+ GD +GGHA+K++GWG  ++G  YW++
Sbjct: 239 HIKAELYKNGPVEGAFTVYADLLSYKSGVYKHVAGDALGGHAIKIMGWGV-ENGNKYWLI 297

Query: 298 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           AN WN  WG +G+FKI RG + CGIE  +VAG P
Sbjct: 298 ANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 331


>gi|344195776|gb|AEM98130.1| cathepsin B [Cynoglossus semilaevis]
          Length = 332

 Score =  251 bits (640), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 137/316 (43%), Positives = 185/316 (58%), Gaps = 30/316 (9%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLG--VPVKTH-DK 91
           L + ++  +N+   + WKA  N  F N      + L G       +L G  +PVK     
Sbjct: 25  LSNEMVNHINK-VNSTWKAGLN--FQNVDYSYLRRLCGT------MLKGPKLPVKLQFTA 75

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
            ++LP  FDAR  WPQC T+  + DQG CGSCWAFGA EA+SDR CIH    MN+ +S  
Sbjct: 76  DVQLPVDFDARVQWPQCPTLKEVRDQGSCGSCWAFGAAEAISDRLCIHSNGLMNVEISAE 135

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCSH 196
           DLL+CC   CG GC+GGYP +AW ++   G+V+         C PY          G   
Sbjct: 136 DLLSCCDS-CGMGCNGGYPSAAWEFWTTDGLVSGGLYDSHIGCRPYSIAPCEHHVNGSRP 194

Query: 197 PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 255
           P       TP+C +KC       +   KHY   +Y ++   ++I  EIYKNGPVE +FTV
Sbjct: 195 PCTGEGGDTPQCTKKCEAGYTPGYTQDKHYGKLSYSVDDSEKEIQLEIYKNGPVEGAFTV 254

Query: 256 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 315
           YEDF  YK+GVY+H+TG  +GGHA+K++GWG  ++G  YW+ AN WN  WG +G+FKI R
Sbjct: 255 YEDFLLYKTGVYQHVTGSAVGGHAIKVLGWG-EENGTPYWLCANSWNTDWGDNGFFKILR 313

Query: 316 GSNECGIEEDVVAGLP 331
           GS+ CGIE ++VAG+P
Sbjct: 314 GSDHCGIESEIVAGIP 329


>gi|195393194|ref|XP_002055239.1| GJ19262 [Drosophila virilis]
 gi|194149749|gb|EDW65440.1| GJ19262 [Drosophila virilis]
          Length = 338

 Score =  251 bits (640), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 136/330 (41%), Positives = 187/330 (56%), Gaps = 27/330 (8%)

Query: 24  VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLG 83
           + +  + D H+L +  ++ V    K  W   RN   S  +    + L+GV P      L 
Sbjct: 12  IAAATEDDPHMLSEEFMELVRGKAKT-WTVGRNFDAS-VSEHHIRGLMGVHPDAHKFTLP 69

Query: 84  VPVKTHDKSLK-----LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI 138
              +     ++     LP+ FDAR+AWP C TI  I DQG CGSCWAFGAVEA+SDR CI
Sbjct: 70  EKSQVLGNLMEADGGDLPEEFDARTAWPDCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCI 129

Query: 139 HFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF 189
           H    +N   S +DL++CC   CG GC+GG+P +AW Y+ H G+V+       E C PY 
Sbjct: 130 HSNATVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTHKGIVSGGSYGSKEGCRPY- 187

Query: 190 DSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAE 242
           +   C H      P C     TP+C+ KC     + +   KH+   AY +N +P DI  E
Sbjct: 188 EVEPCEHHVNGTRPPCHSG-STPRCMHKCESGYSVDYAKDKHFGAKAYSVNRNPLDIQRE 246

Query: 243 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQW 301
           I  NGPVE +FTVYED   YK+GVY+H+ G  +GGHA++++GWG   D+   YW++ N W
Sbjct: 247 IMTNGPVEGAFTVYEDLILYKTGVYQHVHGRQLGGHAIRILGWGVWGDNKVPYWLIGNSW 306

Query: 302 NRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           N  WG +G+F+I RG + CGIE  + AGLP
Sbjct: 307 NTDWGDNGFFRILRGEDHCGIESAISAGLP 336


>gi|327322926|gb|AEA48884.1| cathepsin B [Oplegnathus fasciatus]
          Length = 330

 Score =  250 bits (639), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 138/298 (46%), Positives = 180/298 (60%), Gaps = 25/298 (8%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCST 110
           WKA  N  F N      + L G     KG  L V V+ +   LKLP+ FDAR  WP C T
Sbjct: 40  WKAGHN--FHNVDYSYIQRLCGT--MLKGPKLPVMVQ-YTGDLKLPEEFDAREQWPNCPT 94

Query: 111 ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYP 168
           +  I DQG CGSCWAFGA EA+SDR CIH    +S+ ++  DLL CC   CG GC+GGYP
Sbjct: 95  LKEIRDQGSCGSCWAFGAAEAISDRVCIHSNAKVSVEISSEDLLTCC-MSCGMGCNGGYP 153

Query: 169 ISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCE-PAYPTPKCVRKC-V 213
            +AW ++   G+V+         C PY  +  C H      P C      TP+C+ KC  
Sbjct: 154 SAAWDFWTKEGLVSGGLYDSHIGCRPYTIAP-CEHHVNGSRPSCTGEGGDTPQCITKCEA 212

Query: 214 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 273
                ++  KH+  ++Y + SD E I +EI+KNGPVE +F VYEDF  YKSGVY+H++G 
Sbjct: 213 GYTPSYKEDKHFGKTSYTVLSDEEQIQSEIFKNGPVEGAFIVYEDFVLYKSGVYQHVSGS 272

Query: 274 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
            +GGHA+K++GWG  +DG  YW+ AN WN  WG +G+FK  RGS+ CGIE +VVAG+P
Sbjct: 273 AVGGHAIKILGWGV-EDGVPYWLCANSWNTDWGDNGFFKFLRGSDHCGIESEVVAGIP 329


>gi|289743429|gb|ADD20462.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
          Length = 340

 Score =  250 bits (638), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 142/336 (42%), Positives = 191/336 (56%), Gaps = 36/336 (10%)

Query: 25  VSKLKLDSH---ILQDSIIKEVNENPKAGWKAARNPQFSNYT-VGQFKHLLGVKPTP--- 77
           ++ L L+ H   IL D  ++ V +  K  W   RN  F   T +  ++ L+GV P     
Sbjct: 10  LALLALNVHGDDILSDRFMEIVRQKAKT-WTVGRN--FHKLTPMSHYRQLMGVHPDAHYY 66

Query: 78  ----KGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALS 133
               K ++L         +  +PK FD+R+ WP C TI  I DQG CGSCWAFGAVEA+S
Sbjct: 67  ALPDKRMVLREEELVGLGNDMIPKEFDSRNQWPHCPTIWEIRDQGSCGSCWAFGAVEAMS 126

Query: 134 DRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDS 191
           DR CIH    +N   S +DL++CC   CG GC+GG+P +AW Y+V  G+V+    PY  S
Sbjct: 127 DRVCIHSNGTVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWGYWVRKGIVSG--GPYGSS 183

Query: 192 TGC--------------SHPGCEPAY-PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSD 235
            GC              + P CE  Y  TP+C  KC    ++ ++  KH+   AY I+ +
Sbjct: 184 QGCRPYEIAPCEHHVNGTRPPCEKEYGKTPRCQHKCQASYKVDYKTDKHFGSRAYSISKN 243

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
             DI  EI  NGPVE +FTVYED   YK GVY+H+ G  +GGHA+++IGWG   D   YW
Sbjct: 244 VRDIQGEIMTNGPVEGAFTVYEDLILYKDGVYEHVHGKELGGHAIRIIGWGVEKD-TPYW 302

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           ++AN WN  WG +G+FKI RG + CGIE  + AGLP
Sbjct: 303 LIANSWNTDWGNNGFFKILRGKDHCGIESSISAGLP 338


>gi|339236191|ref|XP_003379650.1| cathepsin B [Trichinella spiralis]
 gi|316977649|gb|EFV60721.1| cathepsin B [Trichinella spiralis]
          Length = 356

 Score =  250 bits (638), Expect = 8e-64,   Method: Compositional matrix adjust.
 Identities = 138/339 (40%), Positives = 191/339 (56%), Gaps = 23/339 (6%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
           +  L CF +   G+    + +  +  + +   +N N +  WKA RNP F        + +
Sbjct: 17  LFSLPCFYSTVFGIPFGSR-NQRLYFNKMATYIN-NLQTTWKAGRNPYFETVPSHVIQGM 74

Query: 71  LGVKPTPKGLLLGVP---VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFG 127
           +GV+ + K     +P   +      +++P  FD+R  WP C TI  I DQ +CGSCWAFG
Sbjct: 75  MGVRRSSKLETNSIPLPVISYEHIDMEIPVEFDSRKQWPYCPTIGEIRDQSNCGSCWAFG 134

Query: 128 AVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT--- 182
           AVEA+SDR CI         +S  DLL+CC  +CG GC GG P  AW ++V +G+VT   
Sbjct: 135 AVEAISDRICIATDGRQKPHISSTDLLSCCK-ICGFGCQGGDPHQAWSFWVKYGLVTGGN 193

Query: 183 ----EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS-KHYSISAYR 231
               + C PY        S G   P      PTP C + C    ++  N  K+Y + AY 
Sbjct: 194 YTTHDGCRPYPFAPCNHHSNGTYGPCSHDLEPTPVCKKACQSTYKIQYNKDKYYGLKAYS 253

Query: 232 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 291
           +++   D+  E+  NGP+EV+F VYEDF  YK+GVY+H TG V+GGHAV+L+GWG  ++G
Sbjct: 254 LHNKASDLQKELMMNGPMEVAFEVYEDFLLYKTGVYQHHTGSVLGGHAVRLLGWG-EENG 312

Query: 292 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
             YW+LAN WN  WG  G+FKI RG NECGIE + VAGL
Sbjct: 313 VPYWLLANSWNTEWGDKGFFKIYRGRNECGIESEAVAGL 351


>gi|34979797|gb|AAQ83887.1| cathepsin B [Branchiostoma belcheri tsingtauense]
          Length = 332

 Score =  249 bits (637), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 142/314 (45%), Positives = 184/314 (58%), Gaps = 25/314 (7%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 94
           L   II  VN      WKA  N  F   TV   K L GV   P    L  P+K H+ + +
Sbjct: 24  LTQEIIDYVN-TIDTTWKAGWN--FQGATVSYVKGLCGVIRDPNNHKL--PLKLHELNAQ 78

Query: 95  -LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF-GMNLS-LSVNDL 151
            +P +FD+R+ W  C TI  + DQG CGSCWA  AVEA+SDR C+   G  ++ +S  DL
Sbjct: 79  DIPDTFDSRTQWANCPTIKEVRDQGSCGSCWALAAVEAMSDRICVASKGSTMAHISAEDL 138

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PG 198
            +CC   CG+GC+GG+P +AW Y+   G+VT       + C PY +   C H      P 
Sbjct: 139 NSCCKS-CGNGCNGGFPEAAWEYWKRDGLVTGGPYGSHQGCQPY-EIKPCEHHINGSRPA 196

Query: 199 CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 257
           C    PTP+C + C    N  +   KHY+ +AY ++S  + I  EI  NGPVE +FTVY 
Sbjct: 197 CGKLEPTPRCKKSCESGYNVTFAKDKHYAKTAYSVSSKVQQIQMEIMTNGPVEAAFTVYA 256

Query: 258 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 317
           DF HYKSGVY+H +G  +GGHAVK+IGWGT +    YW++AN WN  WG  G+FKI RG 
Sbjct: 257 DFPHYKSGVYQHESGAELGGHAVKMIGWGT-EGSTPYWLIANSWNTDWGNMGFFKILRGQ 315

Query: 318 NECGIEEDVVAGLP 331
           +ECGIE D+VAG P
Sbjct: 316 DECGIERDIVAGEP 329


>gi|389593817|ref|XP_003722157.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
 gi|321438655|emb|CBZ12414.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
          Length = 340

 Score =  249 bits (637), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 143/339 (42%), Positives = 192/339 (56%), Gaps = 23/339 (6%)

Query: 12  LCLTC-FATFAEGVVSKLKL---DSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVG 65
           LCL   FA      VS L     D  +L  S + EVN   K  W A+ +  +  +  ++G
Sbjct: 9   LCLVAVFALLLATTVSGLYAKPSDFPLLGKSFVAEVNSKAKGQWTASADNGYLVTGKSLG 68

Query: 66  QFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
           + + L+GV       +        +    LP+ FDA   WP C TIS I DQ +CGSCWA
Sbjct: 69  EVRKLMGVTDMSTEAVPPRNFSVEELQQDLPEFFDAAEHWPMCLTISEIRDQSNCGSCWA 128

Query: 126 FGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 184
             AVEA+SDR+C   G+ +  +S ++LL+CC F+CG GC GG P  AW ++V  G+ TE+
Sbjct: 129 IAAVEAISDRYCTFGGVPDRRMSTSNLLSCC-FICGLGCHGGIPTVAWLWWVWVGIATED 187

Query: 185 CDPY-FDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSD 235
           C PY FD   CSH G    YP        TPKC   C +        K+   ++Y +  +
Sbjct: 188 CQPYPFDP--CSHHGNSEKYPPCPSTIYDTPKCNTTCERSEM--DLVKYKGSTSYSVKGE 243

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
            E +M E+  NGP+E++  VY DF  YKSGVYKH+ G+ +GGHAVKL+GWGT  DG  YW
Sbjct: 244 KE-LMIELMTNGPLELTMQVYSDFVGYKSGVYKHVLGEFLGGHAVKLVGWGT-QDGVPYW 301

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 334
            +AN WN  WG  GYF I+RG+NEC IE   VAG+P+ +
Sbjct: 302 KVANSWNTDWGDKGYFLIQRGNNECKIESGGVAGIPAQE 340


>gi|341904470|gb|EGT60303.1| hypothetical protein CAEBREN_20420 [Caenorhabditis brenneri]
          Length = 351

 Score =  249 bits (637), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 136/326 (41%), Positives = 181/326 (55%), Gaps = 24/326 (7%)

Query: 27  KLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPV 86
           ++ ++  +L+   + +     +  + A     FS+Y     K L+G K         V  
Sbjct: 27  EIPVEVQMLRGQELVDYINKKQTTFTAKLGAYFSDYPDTIKKQLMGAKMVEIPEEYRVFE 86

Query: 87  KTHDKSL--KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGM 142
             H + L   +P SFD+R+ WP C +IS+I DQ  CGSCWA  A E +SDR CI      
Sbjct: 87  MEHPEVLDAAIPDSFDSRAQWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASKGQT 146

Query: 143 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGC 199
            +S+S +D+ ACCG  CG+GC+GGYPI AWR++V +G VT     Y + TGC    +P C
Sbjct: 147 QVSISADDINACCGMACGNGCNGGYPIEAWRHYVKNGYVTG--GSYQEKTGCKPYPYPPC 204

Query: 200 E-------------PAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYK 245
           E               YPT KC R C     L ++   H+  SAY ++    +I  EI  
Sbjct: 205 EHHVNGTHYKPCPSDMYPTDKCERSCQAGYSLTYKQDLHFGQSAYAVSKKATEIQKEIMT 264

Query: 246 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 305
           NGPVEV+FTVY DF  Y  GVY H  G  +GGHAVK++GWG  D+G  YW+ AN WN  W
Sbjct: 265 NGPVEVAFTVYADFEVYSGGVYVHTAGASLGGHAVKMLGWGV-DNGTPYWLCANSWNEDW 323

Query: 306 GADGYFKIKRGSNECGIEEDVVAGLP 331
           G +GYF+I RG NECGIE  VV G+P
Sbjct: 324 GENGYFRIIRGVNECGIEHGVVGGIP 349


>gi|340380685|ref|XP_003388852.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
          Length = 341

 Score =  249 bits (636), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 143/316 (45%), Positives = 191/316 (60%), Gaps = 32/316 (10%)

Query: 38  SIIKEVNENPKAGWKAA-RNPQFSNYTVGQFKHLLGVKPTPKGLLLG---VPVKTHDKSL 93
           SI + VN + +  W+A   + +F   T    + L G       LL G   +PVK  +   
Sbjct: 32  SIAERVN-SLQTTWRATPSSKRFEGVTENYVRSLCGT------LLHGGPTLPVKEIEVPA 84

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLA 153
            +P +FDAR  WP C TI  + DQG CGSCWAFGAVEA+SDR+CI F   +++S  +LL+
Sbjct: 85  VIPDTFDARQKWPDCPTIGTVRDQGACGSCWAFGAVEAMSDRYCISFKEQVNISAENLLS 144

Query: 154 CCGFLCGDGCDGGYPISAWRY----FVHHGVVT-------EECDPYFDSTGCSH--PG-- 198
           CC   CG GCDGGYP +AWR+     ++ G+VT         C PY     C H  PG  
Sbjct: 145 CCE-TCGSGCDGGYPAAAWRHWADKLLYEGIVTGGQYDSNAGCQPY-TIPKCDHHEPGPY 202

Query: 199 --CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 255
             C  +  TP C R C+   ++ +R+ KHY  ++Y I+SD   I  EI  NGPVE +F+V
Sbjct: 203 ENCSGSQSTPSCKRSCISSYDKSYRSDKHYGKNSYSISSDVSSIQTEIMTNGPVEGAFSV 262

Query: 256 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 315
           Y DF  Y SGVY+H TG  +GGHA+K++GWGT ++G  YW++AN WN SWG  G+FKI R
Sbjct: 263 YADFPTYTSGVYQHTTGSFLGGHAIKILGWGT-ENGVPYWLVANSWNPSWGDSGFFKIIR 321

Query: 316 GSNECGIEEDVVAGLP 331
           G +ECGIE  +VAG+P
Sbjct: 322 GKDECGIESSIVAGMP 337


>gi|308511959|ref|XP_003118162.1| CRE-CPR-6 protein [Caenorhabditis remanei]
 gi|308238808|gb|EFO82760.1| CRE-CPR-6 protein [Caenorhabditis remanei]
          Length = 387

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 148/370 (40%), Positives = 201/370 (54%), Gaps = 59/370 (15%)

Query: 8   MDPILCLTCFATFA--------EGVVSKLK---LDSHILQ---DSIIKEVNENPKAGWKA 53
           M  +L L+C A           E  + K +   +D    +   D +I  VN N    W+A
Sbjct: 1   MKTLLLLSCLAVAVYCGCNDNVESTLDKFRNREIDDEAAELDGDELINYVNNNQDL-WRA 59

Query: 54  ARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP------------VKTHDKSLKLPKSFDA 101
            +  +F++        + G     K  L+GV              KT D  + +P++FD+
Sbjct: 60  KKQRRFTS--------VYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDMDIPENFDS 111

Query: 102 RSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLC 159
           R  WP+C +I  I DQ  CGSCWAFGAVEA+SDR CI  H  + +SLS +DLL+CC   C
Sbjct: 112 RENWPKCQSIRNIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVSLSADDLLSCC-RSC 170

Query: 160 GDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAY 203
           G GC+GG P++AWRY+V  G+VT     Y  ++GC     P CE               Y
Sbjct: 171 GFGCNGGDPLAAWRYWVKDGIVTGS--NYTANSGCKPYPFPPCEHHSKKTHFDPCPHDLY 228

Query: 204 PTPKCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 261
           PTPKC +KC+    ++ +   K Y  SAY +  D E I  E+  +GP+E++F VYEDF +
Sbjct: 229 PTPKCEKKCIADYTDKTYSEDKFYGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLN 288

Query: 262 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           Y  GVY H  G + GGHAVKL+GWG  ++G  YW  AN WN  WG DG+F+I RG +ECG
Sbjct: 289 YDGGVYVHTGGKLGGGHAVKLVGWGI-ENGIPYWTCANSWNTDWGEDGFFRILRGVDECG 347

Query: 322 IEEDVVAGLP 331
           IE  VV G+P
Sbjct: 348 IESGVVGGVP 357


>gi|324507953|gb|ADY43363.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
          Length = 352

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 140/338 (41%), Positives = 192/338 (56%), Gaps = 27/338 (7%)

Query: 24  VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGL 80
           +VSK+  ++  L    +       +  WKA  N +F NY+      L+GV   + + K  
Sbjct: 8   IVSKISHEAEKLTGYALANYVNRKQNLWKAKFNNKFRNYSDRVKYGLMGVNNVRLSVKAK 67

Query: 81  LLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI-- 138
               P + +D  + +P++FDAR  W QC+++  I DQ  CGSCWAFGAVEA+SDR CI  
Sbjct: 68  KNLSPTRFYD--IYIPEAFDAREKWDQCASLKNIRDQSSCGSCWAFGAVEAMSDRICIAS 125

Query: 139 HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDS 191
           +  + +SLS +DLL+CC   CG GCDGG P++AW+Y+V  G+VT       + C PY   
Sbjct: 126 NGKIQVSLSADDLLSCCK-SCGFGCDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPY-PF 183

Query: 192 TGCSH--------PGCEPAYPTPKCVRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMA 241
             C H        P     YPTPKC +KC  +   + +   K +  +AY +  D   I  
Sbjct: 184 PPCEHHSNKTHYQPCKHDLYPTPKCEKKCLDIYTEKTYAEDKFFGETAYGVEDDVTSIQK 243

Query: 242 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQW 301
           EI  +GPVEV+F VYEDF  Y  G+Y H  G + GGHAVK++GWG  + G  YW++AN W
Sbjct: 244 EILTHGPVEVAFEVYEDFLMYDGGIYVHTGGKIGGGHAVKMLGWGV-EQGVPYWLVANSW 302

Query: 302 NRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 339
           N  WG DG+F+I RG +ECGIE  VV GLP      K+
Sbjct: 303 NTDWGEDGFFRIIRGIDECGIESSVVGGLPKLNRTYKK 340


>gi|56462338|gb|AAV91452.1| cysteine peptidase 2 cathepsin-B-like [Lonomia obliqua]
          Length = 338

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 138/315 (43%), Positives = 182/315 (57%), Gaps = 25/315 (7%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL- 93
           L +  I  +N  PK  W A RN   +N      K L+G        +L +P  THD  L 
Sbjct: 26  LSEDFINILNSKPKT-WTAGRNFP-ANTPFAHIKMLMGALKDDN--ILKLPKMTHDAELI 81

Query: 94  -KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 150
             LP++FD R  WP C T++ I DQG CGSCWAFGAVEA++DR C +     +   S  D
Sbjct: 82  ASLPENFDPRDKWPNCPTLNEIRDQGSCGSCWAFGAVEAMTDRVCTYSDGTKHFHFSAED 141

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCSH--PG--- 198
           LL+CC  +CG GC+GG P  AW Y+ H G+V       T+ C PY +   C H  PG   
Sbjct: 142 LLSCCP-ICGLGCNGGMPTLAWEYWKHAGIVSGGSYNSTQGCIPY-EVPPCEHHVPGNRL 199

Query: 199 -CEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
            C     TPKC + C    N  ++  KHY    Y ++ + ++I AE++KNGPVE +FTVY
Sbjct: 200 PCNGDTKTPKCQKTCEAGYNVPFKKDKHYGKHVYSVSGNEDNIKAELFKNGPVEGAFTVY 259

Query: 257 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
            D   YKSGVY+H  G  +GGHAVK++GWG  ++G  YW++AN WN  WG +G+FKI RG
Sbjct: 260 SDLLSYKSGVYQHTDGSALGGHAVKILGWGV-ENGSKYWLIANSWNSDWGDNGFFKILRG 318

Query: 317 SNECGIEEDVVAGLP 331
            + CGIE  +V G P
Sbjct: 319 EDHCGIESSIVTGEP 333


>gi|226821413|gb|ACO82382.1| cathepsin B [Lutjanus argentimaculatus]
          Length = 330

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 139/323 (43%), Positives = 187/323 (57%), Gaps = 24/323 (7%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
           VS+ +     L   ++  +N+     WKA  N  F N      + L G     KG  L +
Sbjct: 15  VSQARPRLKPLSSEMVNYINK-VNTTWKAGHN--FHNVDFSYVQRLCGT--MLKGPKLPI 69

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 144
            V+ +   +KLPK+FD+R  WP C T+  I DQG CGSCWAFGA EA+SDR CIH    +
Sbjct: 70  MVQ-YAGDMKLPKAFDSREQWPNCPTLKEIRDQGSCGSCWAFGASEAISDRLCIHSNAKV 128

Query: 145 S--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------F 189
           S  +S  DLL CC   CG GC+GGYP +AW ++   G+V+         C PY       
Sbjct: 129 SVEISAEDLLTCCD-SCGMGCNGGYPSAAWDFWTKEGLVSGGLYDSHVGCRPYTIPPCEH 187

Query: 190 DSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 248
              G   P       TP+C+ +C       +R  KHY  ++Y + SD  +I  EIYKNGP
Sbjct: 188 HVNGSRPPCTGEGGDTPQCLSQCEAGYTPSYREDKHYGKTSYSVLSDEAEIQYEIYKNGP 247

Query: 249 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 308
           VE +FTVYEDF  YKSGVY+H++G  +GGHA+K++GWG  ++G  YW+ AN WN  WG +
Sbjct: 248 VEGAFTVYEDFVLYKSGVYQHVSGSAVGGHAIKVLGWG-EENGVPYWLCANSWNTDWGDN 306

Query: 309 GYFKIKRGSNECGIEEDVVAGLP 331
           G+FK  RGS+ CGIE ++VAG+P
Sbjct: 307 GFFKFLRGSDHCGIESEIVAGIP 329


>gi|428174191|gb|EKX43088.1| hypothetical protein GUITHDRAFT_73372 [Guillardia theta CCMP2712]
          Length = 255

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 126/259 (48%), Positives = 163/259 (62%), Gaps = 19/259 (7%)

Query: 81  LLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF 140
           +L  P      ++K+P +FDAR+ WPQC +I+ I DQ  CGSCWAFGAVEA+SDR CI  
Sbjct: 1   MLAGPPDFDYPNVKIPDNFDARTNWPQCPSIAHIRDQSTCGSCWAFGAVEAMSDRLCIAS 60

Query: 141 GMNL--SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH-- 196
              +   LS  D+L+CC   CG GC+GG+P  AWR+F  HG+ TE   PY     C H  
Sbjct: 61  NGTVKDELSAEDMLSCCLVQCGMGCNGGFPTGAWRFFKMHGLTTESKYPYVFPP-CEHHI 119

Query: 197 -----PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
                  C P+ PTPKCVR   KK       +++  S Y ++  P  I AEI  NGPVE 
Sbjct: 120 NKTHYKPCGPSQPTPKCVRASEKK------PRYHGKSVYSVS--PAKIQAEIMTNGPVEA 171

Query: 252 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 311
           +FTVY+DF  Y+SGVY+H++G  +GGHA+K++GWG  + G  YW++AN WN  WG  G F
Sbjct: 172 AFTVYQDFLAYQSGVYRHVSGPELGGHAIKIMGWGV-EAGNKYWLVANSWNEDWGDKGTF 230

Query: 312 KIKRGSNECGIEEDVVAGL 330
           KI RG +ECGIE  VVAG+
Sbjct: 231 KIARGDDECGIESSVVAGM 249


>gi|254746338|emb|CAX16634.1| putative C1A cysteine protease precursor [Manduca sexta]
          Length = 337

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 139/318 (43%), Positives = 184/318 (57%), Gaps = 27/318 (8%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYT-VGQFKHLLGVKPTPKGLLLGVPVKTHDK 91
           H L D+ I+ +N      W+A RN  F   T       L+G        +  +P   HD 
Sbjct: 23  HPLSDAFIRLINSKQNT-WRAGRN--FPTTTPFAHINKLMGALQDDN--VAKMPKVEHDA 77

Query: 92  SL--KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 147
            L   LP++FD R  WP C T++ I DQG CGSCWAFGAVEA++DR+C +     +   S
Sbjct: 78  DLIASLPENFDPRDKWPDCPTLNEIRDQGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFS 137

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCSH--PG 198
             DLL+CC  +CG GC+GG P  AW Y+ H G+V       T+ C PY +   C H  PG
Sbjct: 138 SEDLLSCCP-ICGLGCNGGIPSLAWEYWKHFGIVSGGNYNSTQGCRPY-EIPPCEHHVPG 195

Query: 199 ----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
               C     TPKC + C    N +++  K Y    Y +++  + I AE+YKNGPVE +F
Sbjct: 196 NRMPCSGDTKTPKCQKNCENGYNVMYKKDKRYGKHVYSVSAGEDHIRAELYKNGPVEGAF 255

Query: 254 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 313
           TVY D   YKSGVYKHI GD +GGHA+K++GWG  +D + YW++AN WN  WG +G+FKI
Sbjct: 256 TVYADLLAYKSGVYKHIQGDALGGHAIKILGWGVENDNK-YWLVANSWNTDWGDNGFFKI 314

Query: 314 KRGSNECGIEEDVVAGLP 331
            RG N CGIE  ++AG P
Sbjct: 315 LRGENHCGIEGSIIAGEP 332


>gi|225711544|gb|ACO11618.1| Cathepsin B precursor [Caligus rogercresseyi]
          Length = 332

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 144/316 (45%), Positives = 185/316 (58%), Gaps = 27/316 (8%)

Query: 34  ILQDSIIKEVNENPKAGWKAARN--PQFS-NYTVGQFKHLLGVKPTPKGLLLGVPVKTHD 90
           IL    I  +NE  +  WKA RN  P+ S NY     + L+GV P  K  L   P+ +  
Sbjct: 25  ILSSEYIHSINEASEI-WKAGRNFHPETSSNY----LRSLMGVLPNHKDHLP-PPLPSLL 78

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND 150
            +  LP  FDAR  WP C +I  I DQG CGSCWAFGA EA+SDR CIH   N+++S  +
Sbjct: 79  GTEALPSDFDAREHWPNCPSIRLIRDQGSCGSCWAFGAAEAMSDRICIHTNKNVNISAEN 138

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------P 197
           LL+CC + CG GC+GG+P +AW+Y+   G+V+         C PY D   C H       
Sbjct: 139 LLSCC-YSCGFGCNGGFPGAAWKYWTSKGLVSGGLYGSHSGCQPY-DIEPCEHHVNGTRQ 196

Query: 198 GCEPAYPTPKCVRKCVKKNQLWRNSKHYSI--SAYRINSDPEDIMAEIYKNGPVEVSFTV 255
            C     TPKC R C  +N      K  S   S+Y I SDP+ I  EI  NGPVE +F+V
Sbjct: 197 PCAEGGRTPKCHRTCENENYSVPYDKDLSFGRSSYSIRSDPKQIQLEIMDNGPVEAAFSV 256

Query: 256 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 315
           Y DF + KSGVY+H+ G ++GGHA++++GWG  + G  YW++AN WN  WG  G FKI R
Sbjct: 257 YSDFMNDKSGVYRHVKGSLLGGHAIRILGWGV-EKGTPYWLVANSWNTDWGDKGTFKILR 315

Query: 316 GSNECGIEEDVVAGLP 331
           GS+ CGIE  VV GLP
Sbjct: 316 GSDHCGIEGSVVTGLP 331


>gi|268579855|ref|XP_002644910.1| C. briggsae CBR-CPR-6 protein [Caenorhabditis briggsae]
          Length = 376

 Score =  248 bits (634), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 145/334 (43%), Positives = 194/334 (58%), Gaps = 38/334 (11%)

Query: 37  DSIIKEVNENPKAGWKAARNPQFSNY---TVGQFK-HLLGVKPTPKGLLLGVPVKTH--- 89
           D +I  +N+N    W A +  +F++    T  + K  L+GV      + L V  K H   
Sbjct: 44  DELIDYINDNQNL-WTAKKQKRFTSVYGETDDKAKWGLMGVNH----VRLSVKGKQHLSK 98

Query: 90  --DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLS 145
             D  L +P+SFD+R  WP+C +I  I DQ  CGSCWAFGAVEA+SDR CI  H  + +S
Sbjct: 99  TKDLDLDIPESFDSRENWPKCQSIRNIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVS 158

Query: 146 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-- 200
           LS +DLL+CC   CG GC+GG P++AWRY+V  G+VT     Y  ++GC     P CE  
Sbjct: 159 LSADDLLSCC-RSCGFGCNGGDPLAAWRYWVKDGIVTGS--NYTANSGCKPYPFPPCEHH 215

Query: 201 -----------PAYPTPKCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 247
                        YPTPKC +KC+    ++ +   K Y  SAY +  D E I  E+  +G
Sbjct: 216 SKKTHFDPCPHDLYPTPKCEKKCIADYTDKTYSEDKFYGHSAYGVKDDVEAIQKELMTHG 275

Query: 248 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 307
           P+E++F VYEDF +Y  GVY H  G + GGHAVKLIGWG  +DG  YW  AN WN  WG 
Sbjct: 276 PLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGI-EDGIPYWTCANSWNTDWGE 334

Query: 308 DGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEIT 341
           DG+F+I RG +ECGIE  VV G+P   ++   ++
Sbjct: 335 DGFFRILRGVDECGIESGVVGGIPKLNSVSSRLS 368


>gi|390994431|gb|AFM37365.1| cathepsin B2 [Dictyocaulus viviparus]
          Length = 346

 Score =  248 bits (634), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 140/350 (40%), Positives = 195/350 (55%), Gaps = 25/350 (7%)

Query: 1   MEPTKLIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFS 60
           M+  K++   ++ +  F   +E ++ K   +  +  D ++  VN+     + A  +P+FS
Sbjct: 1   MKVVKVLCTVLVAVAAFVPQSERILGK---NVELTGDDLVDYVNKAQNL-FTAKLSPRFS 56

Query: 61  NYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK--SLKLPKSFDARSAWPQCSTISRILDQG 118
            Y     + L+G K         V   THD      +P SFD+R+ WP C +I  I DQ 
Sbjct: 57  EYPTAIKRRLMGSKYVAIPSKYRVNEVTHDDIDDSAIPSSFDSRTQWPNCPSIKSIRDQS 116

Query: 119 HCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 176
            CGSCWAFGA EA++DR CI     +  ++S +DLL+CC   CG GCDGG+P +AW Y+V
Sbjct: 117 SCGSCWAFGAAEAMTDRICIASKGAIQFTVSADDLLSCCD-ECGFGCDGGFPYAAWNYWV 175

Query: 177 HHGVVT-------EECDPY------FDSTGCS-HPGCEPAYPTPKCVRKCVKK-NQLWRN 221
             G+V+         C PY        + G   HP  +  YPT  C  KC       + N
Sbjct: 176 EKGIVSGGSYTSKSGCKPYPFPPCEHHTNGTHYHPCPKDLYPTNTCEHKCQSGYATAYTN 235

Query: 222 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 281
            K Y   AY + +  + I  EI  +GPVEV++ VYEDF HY  G+YKH  G  +GGHAVK
Sbjct: 236 DKRYGAKAYTVAARVKAIQKEIMLHGPVEVAYDVYEDFEHYLKGIYKHTAGSYLGGHAVK 295

Query: 282 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           +IGWGT ++G  YWI +N WN  WG +G+F+I RG++ECGIE  VVAGLP
Sbjct: 296 MIGWGT-ENGIPYWICSNSWNSDWGENGFFRILRGTDECGIESGVVAGLP 344


>gi|332244666|ref|XP_003271495.1| PREDICTED: cathepsin B [Nomascus leucogenys]
          Length = 351

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 146/356 (41%), Positives = 199/356 (55%), Gaps = 41/356 (11%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWK---AARNPQFSNYTVGQFKHL 70
           L C    A+   ++ +   H L D ++  VN+     W+    A +  F N  V   K L
Sbjct: 8   LCCLLALAD---ARSRPSFHPLSDELVNYVNKR-NTTWQVGCGAASYNFYNVDVSYLKRL 63

Query: 71  LGVKPTPKGLLLGVPVK----THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC--W 124
            G         LG P      T  + L LP+SF AR  WPQC TI     Q   G    W
Sbjct: 64  CGT-------FLGGPKPPQRVTFTEDLNLPESFYAREQWPQCPTIXXXRAQPGRGGLTRW 116

Query: 125 -----AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVH 177
                AFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW ++  
Sbjct: 117 GSFLQAFGAVEAISDRICIHTNAHISVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTR 176

Query: 178 HGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKH 224
            G+V+         C PY           S P C     TPKC + C    +  ++  KH
Sbjct: 177 KGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKH 236

Query: 225 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 284
           Y  ++Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+HITG++MGGHA++++G
Sbjct: 237 YGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHITGEMMGGHAIRILG 296

Query: 285 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           WG  ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 297 WGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 351


>gi|255040225|gb|ACT99885.1| cathepsin B2 [Opisthorchis viverrini]
          Length = 337

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 136/306 (44%), Positives = 176/306 (57%), Gaps = 21/306 (6%)

Query: 43  VNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK--SLKLPKSFD 100
           V+    A W  A  P+   +  G  + +      P+      P  +H+      +PK+FD
Sbjct: 28  VDSETGAKWIYAEPPE--TFRQGNLQLMFRAIREPEEQRSKRPTVSHESLGDENIPKTFD 85

Query: 101 ARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL--SLSVNDLLACCGFL 158
           AR  WP C TI +I DQ  CGSCWAFGAVEA+SDR CIH       SLS  DL++CCG+ 
Sbjct: 86  AREQWPHCPTIGQIRDQSSCGSCWAFGAVEAMSDRLCIHSNGTFTKSLSSIDLVSCCGY- 144

Query: 159 CGDGCDGGYPISAWRYFVHHGVVT--EECDPY----FDSTGCSHPGCEP-------AYPT 205
           CG GC GGYP +AW ++  +G+VT   + DP     +    CSH G +         Y T
Sbjct: 145 CGFGCQGGYPPAAWDFWQAYGIVTGGSKEDPMGCRSYPFPKCSHHGSKKYPPCPHRIYDT 204

Query: 206 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
           PKCV KC   N  +   K  +   Y +      IM EI  NGPVE +F VYEDF  YK G
Sbjct: 205 PKCVPKCDTPNIDYETDKTRANITYNVQRSQMAIMKEIMINGPVEAAFEVYEDFFGYKQG 264

Query: 266 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 325
           VY H TG+ +GGHA++++GWG  ++G  YW++AN WN  WG DGYFK+ RG NECGIE++
Sbjct: 265 VYFHSTGEFIGGHAIRILGWG-EENGTPYWLIANSWNEGWGEDGYFKMLRGKNECGIEDE 323

Query: 326 VVAGLP 331
           V AGLP
Sbjct: 324 VTAGLP 329


>gi|116177489|gb|ABJ80691.1| cathepsin B [Hippoglossus hippoglossus]
          Length = 330

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 139/313 (44%), Positives = 182/313 (58%), Gaps = 24/313 (7%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 94
           L   ++  +N+     WKA  N  F +      + L G     KG  L + V+ +   LK
Sbjct: 25  LSKEMVNYINKM-NTTWKAGHN--FRDVDYSYVRRLCGT--MLKGPKLPIMVQ-YAGGLK 78

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLL 152
           LP  FD+R  WP+C T+  I DQG CGSCWAFGA EA+SDR CIH G  +S+ ++  DLL
Sbjct: 79  LPAQFDSREQWPECPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSGSKVSVEISSEDLL 138

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCE 200
            CC   CG GC+GGYP +AW ++   G+V+         C PY           S P C 
Sbjct: 139 TCCD-ACGMGCNGGYPSAAWDFWTKEGLVSGGLYNSHIGCRPYTIPPCEHHVNGSRPHCS 197

Query: 201 -PAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
                TPKCV  C    +  +   KHY  S+Y + +  E I AEI +NGPVE +F VYED
Sbjct: 198 GEGGDTPKCVHSCEAGYSPTYTKDKHYGKSSYSVEASVEQIQAEISQNGPVEGAFIVYED 257

Query: 259 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
           F  YKSGVY+H TG  +GGHA+K++GWG  +DG  YW+ AN WN  WG +G+FKI RGS+
Sbjct: 258 FVMYKSGVYQHTTGSALGGHAIKVLGWG-EEDGVPYWLCANSWNTDWGENGFFKILRGSD 316

Query: 319 ECGIEEDVVAGLP 331
            CGIE ++VAG+P
Sbjct: 317 HCGIESEIVAGIP 329


>gi|389608541|dbj|BAM17880.1| cathepsin B [Papilio xuthus]
          Length = 334

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 143/342 (41%), Positives = 192/342 (56%), Gaps = 27/342 (7%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +I   + CL     FA   V+   LD   L D  I  +N    + WKA RN   S+    
Sbjct: 1   MIYSSVTCLL-LCAFA---VTADTLDP--LSDDFINLINSKQDS-WKAGRNFP-SDTPFK 52

Query: 66  QFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
             K L+G     +   L       +    LP++FD R  WP C T++ + DQG CGSCWA
Sbjct: 53  HIKKLMGTLRDDRFTTLVTMQHEVELIASLPENFDPRDKWPNCPTLNEVRDQGSCGSCWA 112

Query: 126 FGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT- 182
           FGAVEA++DR C +     +   S  DLL+CC  +CG GC+GG P  AW Y+ H G+V+ 
Sbjct: 113 FGAVEAMTDRICTYSNGTKHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEYWKHFGLVSG 171

Query: 183 ------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKKNQL-WRNSKHYSISA 229
                 + C PY +   C H  PG    C     TPKCV++C    ++ ++  KHY    
Sbjct: 172 GSYNSSQGCRPY-EIPPCEHHVPGNRLPCSGDTKTPKCVKECESGYKVPYKQDKHYGKHV 230

Query: 230 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 289
           Y +    + I AE+YKNGPVE +FTVY D   YKSGVYKH+TGD +GGHA+K++GWG  +
Sbjct: 231 YSVRGGEDHIKAELYKNGPVEGAFTVYADLLSYKSGVYKHVTGDALGGHAIKIMGWGV-E 289

Query: 290 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           +G  YW++AN WN  WG +G+FKI RG + CGIE  +VAG P
Sbjct: 290 NGNKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 331


>gi|132566367|gb|ABO34080.1| cathepsin B5 [Clonorchis sinensis]
          Length = 343

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 140/312 (44%), Positives = 175/312 (56%), Gaps = 25/312 (8%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD--KSLKLP 96
           + + V+    A W + R P+   +  G   H+ G K   +      P   HD   +++LP
Sbjct: 30  VREHVHSITGARWISGRLPK--RFESGDLIHMFGAKRETREQKAQRPTLRHDGFDNMRLP 87

Query: 97  KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLAC 154
           K+FDAR  WP CS+IS I DQ  CGSCWAFGAVEA+SDR CIH     N SLS  DLL+C
Sbjct: 88  KNFDARKTWPHCSSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLLSC 147

Query: 155 CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE----------- 200
           C   CG GC GGYP  AW Y+  HG+VT       D +GC     P CE           
Sbjct: 148 CK-DCGFGCRGGYPAVAWDYWKTHGIVTGGSKE--DPSGCRSYPFPKCEHHVQGHYPPCP 204

Query: 201 -PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
              YPTP+CV++C   +  +   K  +  +Y I +    IM EI   GPVE  FT+YEDF
Sbjct: 205 RELYPTPECVQQCDTPDVGYLEDKTRANMSYNIYASEISIMKEIMLRGPVEAIFTMYEDF 264

Query: 260 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 319
             Y SGVY H  G  M GHAV+++GWG   +   YW++AN WN  WG +GY K  RG NE
Sbjct: 265 LRYSSGVYFHALGAPMSGHAVRILGWGELGN-VPYWLIANSWNEDWGEEGYMKFLRGYNE 323

Query: 320 CGIEEDVVAGLP 331
           CGIE+DV AGLP
Sbjct: 324 CGIEDDVTAGLP 335


>gi|260786791|ref|XP_002588440.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
 gi|229273602|gb|EEN44451.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
          Length = 332

 Score =  248 bits (632), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 141/314 (44%), Positives = 182/314 (57%), Gaps = 25/314 (7%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 94
           L   II  VN +    WKA  N  F   TV   K L GV   P    L  P+K H+ + +
Sbjct: 24  LTQEIIDYVN-SIDTTWKAGWN--FQGATVSYVKGLCGVIRDPNNHKL--PLKLHELNAQ 78

Query: 95  -LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDL 151
            +P +FD+R+ W  C TI  + DQG CGSCWA  A EA+SDR C+  +  + + LS  +L
Sbjct: 79  DIPDTFDSRTQWANCPTIKEVRDQGSCGSCWAEAAAEAMSDRTCVASNGKVQVHLSSENL 138

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PG 198
           +ACC   CG GC GG+P +AW Y+   G+VT       + C PY +   C H      P 
Sbjct: 139 MACCE-TCGMGCHGGFPEAAWEYWKQDGLVTGGPYGSMQGCQPY-EIAPCEHHINGSRPA 196

Query: 199 CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 257
           C    PTP+C + C    N  +   KHY+ SAY ++S  + I  EI  NGPVE +FTVY 
Sbjct: 197 CGKIEPTPRCKKTCESGYNVTFNKDKHYAKSAYSVSSKVQQIQMEIMTNGPVEAAFTVYA 256

Query: 258 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 317
           DF HYKSGVY+H +G  +GGHAVK+IGWG  +    YW++AN WN  WG  G+FKI RG 
Sbjct: 257 DFPHYKSGVYQHESGAELGGHAVKMIGWGM-EGSTPYWLIANSWNSDWGDMGFFKILRGQ 315

Query: 318 NECGIEEDVVAGLP 331
           +ECGIE D+VAG P
Sbjct: 316 DECGIERDIVAGEP 329


>gi|56759588|gb|AAW28820.1| Parcxpwnx02 [Periplaneta americana]
          Length = 343

 Score =  248 bits (632), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 139/343 (40%), Positives = 196/343 (57%), Gaps = 25/343 (7%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSN-YTV 64
           L++  +L  +C    +     +  +    L D  I  +N +    WKA RN  F N   +
Sbjct: 7   LLLTAMLLFSCMQFTSSVPPPEPSVLVDPLSDDFIDHIN-SLNTTWKAHRN--FGNDIPL 63

Query: 65  GQFKHLLGVKPTPKGLLLGVPVKT-HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
            + K L+GV+ + +   L  P K+  D  +++P+ FD R  WP+C T+  I DQG CGSC
Sbjct: 64  REIKKLMGVRRSLENFRL--PEKSMEDIDIEIPEEFDPREQWPECPTLKEIRDQGSCGSC 121

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WAFGAVEA+SDR CIH     +   S  DLL CC   CG GC+GG P +AW Y+V  G+V
Sbjct: 122 WAFGAVEAMSDRVCIHSKGKTHFHFSAEDLLTCCSS-CGFGCNGGEPGAAWDYWVSTGIV 180

Query: 182 T-------EECDPYFDSTGCSHPGCEPAYP-----TPKCVRKCVKKNQL-WRNSKHYSIS 228
           +       + C PY     C H       P     TP+CV++C +   + +   +H+  S
Sbjct: 181 SGGSYNSHQGCQPYAIEP-CEHHVNGTRKPCGEGDTPRCVKRCEEGYDVPYGKDRHFGKS 239

Query: 229 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 288
           AY +    + I  E+  NGP E + TVY+DF HY++GVY+H++G  +GGHAV+L+GWG  
Sbjct: 240 AYAVPGSVKAIQKELLLNGPAEAALTVYDDFLHYRTGVYQHVSGGALGGHAVRLLGWGV- 298

Query: 289 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           +DG  YW+LAN WN  WG +GYF+I RG +ECGIE D+  GLP
Sbjct: 299 EDGTPYWLLANSWNYDWGDNGYFRILRGQDECGIESDINGGLP 341


>gi|112983908|ref|NP_001036850.1| cathepsin B precursor [Bombyx mori]
 gi|13548667|dbj|BAB40804.1| cathepsin B [Bombyx mori]
          Length = 337

 Score =  247 bits (631), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 139/327 (42%), Positives = 189/327 (57%), Gaps = 27/327 (8%)

Query: 24  VVSKLKLDSHILQDSIIKEVNENPKAGWKAARN-PQFSNYTVGQFKHLLGVKPTPKGLLL 82
           V++  K   + L D  I  +N    + WKA RN P+ +++     K ++GV         
Sbjct: 14  VLAAAKDLPYPLSDEFINTINLKQNS-WKAGRNFPRDTSFA--HLKKIMGVIEDEH--FA 68

Query: 83  GVPVKTHDKSL--KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF 140
            +P+KTH   L   LP++FD R  WP C T++ + DQG CGSCWAFGAVEA++DR C + 
Sbjct: 69  TLPIKTHKIDLIAGLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYS 128

Query: 141 G--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDS 191
               +   S  DLL+CC  +CG GC GG P  AW Y+ H G+V+       + C PY + 
Sbjct: 129 NGTKHFHFSAEDLLSCCP-ICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPY-EI 186

Query: 192 TGCSH--PG----CEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIY 244
             C H  PG    C     TPKC +KC     + ++  K Y    Y ++ D + I AE++
Sbjct: 187 PPCEHHVPGNRMPCSGDTKTPKCTKKCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELF 246

Query: 245 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 304
           KNGPVE +FTVY D   YKSGVYKH  GD +GGHAVK++GWG  +D + YW++AN WN  
Sbjct: 247 KNGPVEGAFTVYSDLLSYKSGVYKHTQGDALGGHAVKILGWGVENDNK-YWLIANSWNSD 305

Query: 305 WGADGYFKIKRGSNECGIEEDVVAGLP 331
           WG +G+FKI RG + CGIE  +V G P
Sbjct: 306 WGDNGFFKILRGEDHCGIESSIVTGEP 332


>gi|405971658|gb|EKC36483.1| Cathepsin B [Crassostrea gigas]
          Length = 341

 Score =  247 bits (631), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 146/341 (42%), Positives = 189/341 (55%), Gaps = 27/341 (7%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFK 68
           +LC       +  V  + K     L D +I  +N+     WKA +N      +  +   K
Sbjct: 5   VLCALVAGAMSALVEFRDKDIFEPLSDEMIWFINKM-NTTWKAGQNFHHIAKDDRLAHVK 63

Query: 69  HLLGVK-PTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFG 127
            + G    TP  L L  P K  +    LP +FD+R+ WP C T+  + DQG CGSCWAFG
Sbjct: 64  MMCGTYLNTPPELRL--PEKKMEPLKDLPATFDSRTQWPNCPTLKEVRDQGACGSCWAFG 121

Query: 128 AVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT--- 182
           AVEA+SDR CI      N  +S  DL +CC   CG+GC+GG+P +AW Y+   G+VT   
Sbjct: 122 AVEAMSDRICIKSQGKENTHISAEDLTSCC-RTCGNGCEGGFPSAAWSYYKKDGLVTGGQ 180

Query: 183 ----EECDPYFDSTGCSH-------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAY 230
               + C PY     C H       P  +   PTPKC   C    N  +   KHY  SAY
Sbjct: 181 YNSHQGCLPY-TIKACDHHVVGKLQPCSKSIGPTPKCKHTCEAGYNVTYEKDKHYGSSAY 239

Query: 231 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 290
            ++   E IM EI  NGPVE +FTVY DF  YKSGVYKH TG  +GGHA+K++GWGT ++
Sbjct: 240 SVHG-VEKIMTEIMTNGPVEGAFTVYADFPQYKSGVYKHTTGQPLGGHAIKILGWGT-EN 297

Query: 291 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           G+DYW++AN WN  WG  G+FKI RG +ECGIE  + AG P
Sbjct: 298 GDDYWLVANSWNPDWGDQGFFKILRGQDECGIESQISAGEP 338


>gi|341900876|gb|EGT56811.1| hypothetical protein CAEBREN_29569 [Caenorhabditis brenneri]
          Length = 344

 Score =  247 bits (631), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 127/258 (49%), Positives = 158/258 (61%), Gaps = 22/258 (8%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 152
           +P  FDAR  WP C +I  I DQ  CGSCWAF A EA+SDR CI  +  +N  LS  DLL
Sbjct: 82  IPDRFDAREQWPSCVSIDNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDLL 141

Query: 153 ACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS------TGCSHP 197
           +CC   F CG+GC+GGYPI AW+++  HG+VT         C PY  +       G + P
Sbjct: 142 SCCTGIFSCGNGCEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWP 201

Query: 198 GC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
            C E   PTPKCV  C   +     +   KH+  +AY +    E I  EI KNGP+EV+F
Sbjct: 202 KCPEDTEPTPKCVDACTSNHTYPTAYLQDKHFGATAYAVGKKVEQIQTEILKNGPIEVAF 261

Query: 254 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 313
           TVYEDF  Y +GVY H  G  +GGHAVK++GWG  D+G  YW++AN WN +WG  GYF+I
Sbjct: 262 TVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGV-DNGTPYWLVANSWNINWGEKGYFRI 320

Query: 314 KRGSNECGIEEDVVAGLP 331
            RG NECGIE   VAG+P
Sbjct: 321 IRGLNECGIEHSAVAGIP 338


>gi|156365510|ref|XP_001626688.1| predicted protein [Nematostella vectensis]
 gi|156213574|gb|EDO34588.1| predicted protein [Nematostella vectensis]
          Length = 259

 Score =  247 bits (631), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 127/253 (50%), Positives = 162/253 (64%), Gaps = 19/253 (7%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL--SLSVNDLL 152
           +P  FD+R  WP C TI  + DQG CGSCWAFGAVEA+SDR+CI     +   +S  DLL
Sbjct: 4   VPDHFDSREQWPHCPTIKEVRDQGACGSCWAFGAVEAMSDRYCIKSEGKVMPHISAEDLL 63

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGC 199
           +CC   CG GC+GGYP SAW ++   G+VT       + C PY     C H        C
Sbjct: 64  SCC-ETCGMGCNGGYPESAWDHWKSKGLVTGGQYDSHKGCQPY-KIAACDHHVVGKLKPC 121

Query: 200 EPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
           +   PTPKC RKC    N  + + KH+  SAY + SDP +I  EI  NGPVE +FTVY D
Sbjct: 122 KGDSPTPKCERKCEAGYNVSYSDDKHFGQSAYSVRSDPAEIQKEIMTNGPVEGAFTVYAD 181

Query: 259 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
           F  YKSGVY+H +G  +GGHA+K++GWG  ++G  YW++AN WN  WG +G+FKIKRG++
Sbjct: 182 FPTYKSGVYQHTSGSALGGHAIKILGWG-EENGTPYWLVANSWNSDWGDEGFFKIKRGND 240

Query: 319 ECGIEEDVVAGLP 331
           ECGIE  +V GLP
Sbjct: 241 ECGIESGIVGGLP 253


>gi|341888137|gb|EGT44072.1| hypothetical protein CAEBREN_10156 [Caenorhabditis brenneri]
          Length = 344

 Score =  247 bits (630), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 127/258 (49%), Positives = 158/258 (61%), Gaps = 22/258 (8%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 152
           +P  FDAR  WP C +I  I DQ  CGSCWAF A EA+SDR CI  +  +N  LS  DLL
Sbjct: 82  IPDHFDAREQWPSCVSIDNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDLL 141

Query: 153 ACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS------TGCSHP 197
           +CC   F CG+GC+GGYPI AW+++  HG+VT         C PY  +       G + P
Sbjct: 142 SCCTGIFSCGNGCEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWP 201

Query: 198 GC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
            C E   PTPKCV  C   +     +   KH+  +AY +    E I  EI KNGP+EV+F
Sbjct: 202 KCPEDTEPTPKCVDACTSNHTYPTAYLQDKHFGATAYAVGKKVEQIQTEILKNGPIEVAF 261

Query: 254 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 313
           TVYEDF  Y +GVY H  G  +GGHAVK++GWG  D+G  YW++AN WN +WG  GYF+I
Sbjct: 262 TVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGV-DNGTPYWLVANSWNINWGEKGYFRI 320

Query: 314 KRGSNECGIEEDVVAGLP 331
            RG NECGIE   VAG+P
Sbjct: 321 IRGLNECGIEHSAVAGIP 338


>gi|157167366|ref|XP_001653890.1| cathepsin b [Aedes aegypti]
 gi|54289254|gb|AAV31917.1| lysosomal cathepsin B [Aedes aegypti]
 gi|108874249|gb|EAT38474.1| AAEL009637-PA [Aedes aegypti]
          Length = 340

 Score =  247 bits (630), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 137/317 (43%), Positives = 183/317 (57%), Gaps = 24/317 (7%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF-KHLLGVKPTPKGLLLGVPVKTHDK 91
           H L    I ++N      WKA   P FS  T   F + L+GV       +  V +   + 
Sbjct: 28  HPLSQKFIDQINSKATT-WKAG--PNFSPETSMSFIRGLMGVHKDADKFMPPVYLHEMEA 84

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
               P++FD+R+ WP C TI  I DQG CGSCWAFGAVEA+SDR CIH    ++  +S  
Sbjct: 85  DDDFPENFDSRTQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRICIHSEGKVHFRVSSE 144

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------ 196
           DL++CC   CG GC+GG+P +AW Y+V  G+V+       + C PY  +  C H      
Sbjct: 145 DLVSCC-HTCGFGCNGGFPGAAWSYWVRKGLVSGGPFGSDQGCQPYAIAP-CEHHVNGSR 202

Query: 197 PGCE-PAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
           P CE     TPKCV+KC    N  +   K Y  S+Y I +  + I  EI  NGPVE +FT
Sbjct: 203 PSCEGEGGKTPKCVKKCQASYNVPYAKDKMYGKSSYSIANHEKQIQKEIMTNGPVEGAFT 262

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           VYED  +YK GVY H+ G ++GGHA++++GWG  +DG  YW++AN WN  WG +G+FKI 
Sbjct: 263 VYEDLLNYKEGVYHHVHGKMLGGHAIRILGWGV-EDGTKYWLIANSWNSDWGDNGFFKIL 321

Query: 315 RGSNECGIEEDVVAGLP 331
           RG +  GIE  + AGLP
Sbjct: 322 RGEDHLGIESSIAAGLP 338


>gi|74179506|dbj|BAE44111.1| cathepsin B preproprotein [Cyprinus carpio]
          Length = 330

 Score =  247 bits (630), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 140/313 (44%), Positives = 182/313 (58%), Gaps = 24/313 (7%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 94
           L   ++  +N+     WKA  N  F +      K L G     KG  L V V+  D  LK
Sbjct: 25  LSREMVNFINK-ANTTWKAGHN--FHDVDYSYVKRLCGT--LLKGPRLPVMVQYAD-DLK 78

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLL 152
           LP +FDAR  WP C T+  I DQG CGSCWAFGA EA+SDR CIH    +S  +S  DLL
Sbjct: 79  LPTNFDAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNAKVSVEISAQDLL 138

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHPGC 199
            CC   CG GC+GGYP +AW ++   G+VT         C PY          G   P  
Sbjct: 139 TCCDG-CGMGCNGGYPSAAWDFWSSDGLVTGGLYNSHIGCRPYTIEPCEHHVNGSRPPCT 197

Query: 200 EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
                TP C   C    +  ++  KH+  ++Y + S+ +DIM E+YKNGPVE +FTVYED
Sbjct: 198 GEGGDTPNCDMSCEPGYSPSYKQDKHFGKTSYSVPSNQKDIMKELYKNGPVEGAFTVYED 257

Query: 259 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
           F  YKSGVY+H++G  +GGHA+K++GWG  ++G  YW+ AN WN  WG +GYFKI RG +
Sbjct: 258 FLSYKSGVYQHVSGPALGGHAIKILGWG-EENGVPYWLAANSWNTDWGDNGYFKILRGED 316

Query: 319 ECGIEEDVVAGLP 331
            CGIE ++VAG+P
Sbjct: 317 HCGIESEIVAGIP 329


>gi|27882093|gb|AAH44517.1| Zgc:55862 [Danio rerio]
          Length = 330

 Score =  247 bits (630), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 138/313 (44%), Positives = 184/313 (58%), Gaps = 24/313 (7%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 94
           L   ++  +N+     W A  N  F +      K L G     KG  L V V+ + + LK
Sbjct: 25  LSHEMVNFINK-ANTTWTAGHN--FRDVDYSYVKRLCGT--FLKGPKLPVMVQ-YTEGLK 78

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLL 152
           LPK+FDAR  WP C T+  I DQG CGSCWAFGA EA+SDR CI     +S+ ++  DLL
Sbjct: 79  LPKNFDAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIQSNAKVSVEISSQDLL 138

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHPGC 199
            CC   CG GC+GGYP +AW ++   G+VT         C PY          G   P  
Sbjct: 139 TCCD-SCGMGCNGGYPSAAWDFWTTDGLVTGGLYNSHIGCRPYTIEPCEHHVNGSRPPCT 197

Query: 200 EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
                TP C  KC    + L++  KH+  ++Y + S+   IMAE++KNGPVE +FTVYED
Sbjct: 198 GEGGDTPNCDMKCEPGYSPLYKEDKHFGKTSYSVPSNQNGIMAELFKNGPVEAAFTVYED 257

Query: 259 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
           F  YKSGVY+H++G  +GGHA+K++GWG  ++G  YW+ AN WN  WG +GYFKI RG +
Sbjct: 258 FLLYKSGVYQHMSGSALGGHAIKILGWG-EENGVPYWLAANSWNTDWGDNGYFKILRGED 316

Query: 319 ECGIEEDVVAGLP 331
            CGIE ++VAG+P
Sbjct: 317 HCGIESEIVAGIP 329


>gi|392920988|ref|NP_506011.2| Protein F57F5.1 [Caenorhabditis elegans]
 gi|206994319|emb|CAB00098.2| Protein F57F5.1 [Caenorhabditis elegans]
          Length = 351

 Score =  247 bits (630), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 135/314 (42%), Positives = 179/314 (57%), Gaps = 24/314 (7%)

Query: 28  LKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVK 87
           + +++ +L+   + +     +  +KA     FS+Y     K L+G K         V   
Sbjct: 28  IPVEAQMLRGQELVDYVNKVQTSFKAELGSYFSSYPDTIKKQLMGAKMVEIPEEYRVFEM 87

Query: 88  THDK--SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN-- 143
           TH +     +P SFD+R+AWP C +IS+I DQ  CGSCWA  A E +SDR CI       
Sbjct: 88  THPEVEDAAVPDSFDSRTAWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNAKTI 147

Query: 144 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE 200
           LS+S +D+ ACCG +CG+GC+GGYPI AWR++V  G VT     Y D TGC    +P CE
Sbjct: 148 LSISADDINACCGMVCGNGCNGGYPIEAWRHYVKKGYVTG--GSYQDKTGCKPYPYPPCE 205

Query: 201 -----------PA--YPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 246
                      P+  YPT KC R C     L ++   H+  SAY ++    +I  EI  +
Sbjct: 206 HHVNGTHYKPCPSNMYPTDKCERSCQAGYALTYQQDLHFGQSAYAVSKKAAEIQKEIMTH 265

Query: 247 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 306
           GPVEV+FTVYEDF HY  GVY H  G  +GGHAVK++GWG  D+G  YW+ AN WN  WG
Sbjct: 266 GPVEVAFTVYEDFEHYSGGVYVHTAGASLGGHAVKMLGWGV-DNGTPYWLCANSWNEDWG 324

Query: 307 ADGYFKIKRGSNEC 320
            +GYF+I RG NEC
Sbjct: 325 ENGYFRIIRGVNEC 338


>gi|14141821|gb|AAK07477.2|AF329480_1 probable cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
 gi|289743431|gb|ADD20463.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
          Length = 340

 Score =  247 bits (630), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 141/336 (41%), Positives = 190/336 (56%), Gaps = 36/336 (10%)

Query: 25  VSKLKLDSH---ILQDSIIKEVNENPKAGWKAARNPQFSNYT-VGQFKHLLGVKPTP--- 77
           ++ L L+ H   IL D  ++ V +  K  W   RN  F   T +  ++ L+GV P     
Sbjct: 10  LALLALNVHGDDILSDKFMEIVRQKAKT-WTVGRN--FHKLTPMSHYRQLMGVHPDAHNY 66

Query: 78  ----KGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALS 133
               K ++L         +  +PK FD+R  WP C TI  I DQG CGSCWAFGAVEA+S
Sbjct: 67  ALPDKRMVLREEELVGLGNNMIPKDFDSRKQWPHCPTIWEIRDQGSCGSCWAFGAVEAMS 126

Query: 134 DRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDS 191
           DR CIH    +N   S +DL++CC   CG GC+GG+P +AW Y+V  G+V+    PY  S
Sbjct: 127 DRVCIHSNGTVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWVRKGIVSG--GPYGSS 183

Query: 192 TGC--------------SHPGCEPAY-PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSD 235
            GC              + P CE  Y  TP+C  KC    ++ ++  KH+   AY I+ +
Sbjct: 184 QGCRPYEIAPCEHHVNGTRPPCEKEYGKTPRCQHKCQASYKVDYKTDKHFGSRAYSISKN 243

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
             DI  EI  +GPVE +FTVYED   YK GVY+H+ G  +GGHA+++IGWG   D   YW
Sbjct: 244 VHDIQEEIMTHGPVEGAFTVYEDLILYKDGVYEHVHGKELGGHAIRIIGWGVEKD-IPYW 302

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           ++AN WN  WG +G+FKI RG + CGIE  + AGLP
Sbjct: 303 LVANSWNTDWGNNGFFKILRGKDHCGIESSISAGLP 338


>gi|347972086|ref|XP_313835.5| AGAP004533-PA [Anopheles gambiae str. PEST]
 gi|333469165|gb|EAA09183.5| AGAP004533-PA [Anopheles gambiae str. PEST]
          Length = 337

 Score =  247 bits (630), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 141/340 (41%), Positives = 193/340 (56%), Gaps = 30/340 (8%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
           ++ L    T A    SK     + L    I+E+N      W+A +N    + ++   + L
Sbjct: 7   VIALAAVGTNAAAGGSK----KYPLSSKFIEEINTKATT-WRAGQNFH-PDTSLTYIRGL 60

Query: 71  LGVKPTPKGLLLGVPVKTHDKSL--KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 128
           +GV P         P   HD S   +LP++FD+R  WP C TI  I DQG CGSCWAFGA
Sbjct: 61  MGVHPDADKFR--EPEILHDLSDGDELPENFDSREQWPNCPTIREIRDQGSCGSCWAFGA 118

Query: 129 VEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-- 184
           VEA+SDR C+  G  ++   S  DL++CC   CG GC+GG+P +AW Y+V  G+V+    
Sbjct: 119 VEAMSDRVCVASGGKIHFRFSAEDLVSCC-HTCGFGCNGGFPGAAWSYWVRKGLVSGGPF 177

Query: 185 -----CDPYFDSTGCSH------PGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYR 231
                C PY  +  C H      P CE     TPKCV+KC +  N  ++  K +  S+Y 
Sbjct: 178 GSNLGCQPYAIAP-CEHHVNGTRPSCEGEGGKTPKCVKKCQESYNVPYQKDKRFGASSYS 236

Query: 232 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 291
           I      I  EI  NGPVE +FTVYED  HYK GVY+H+TG ++GGHA++++GWG  ++G
Sbjct: 237 IARHEAQIQKEIMTNGPVEGAFTVYEDLLHYKEGVYQHVTGKMLGGHAIRILGWGV-ENG 295

Query: 292 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
             YW++AN WN  WG +G+FKI RG +  GIE  + AGLP
Sbjct: 296 TKYWLIANSWNSDWGDNGFFKILRGEDHLGIESSISAGLP 335


>gi|76576341|gb|ABA53864.1| cathepsin B-like cysteine protease 2 [Parelaphostrongylus tenuis]
          Length = 344

 Score =  246 bits (629), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 128/257 (49%), Positives = 159/257 (61%), Gaps = 20/257 (7%)

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 147
           ++  K+P SFDAR  WP C +IS I DQ  CGSCWAFG+ EA+SDR CI  H    + LS
Sbjct: 89  EEGFKIPDSFDARVQWPHCPSISYIRDQSQCGSCWAFGSAEAMSDRVCIASHGNKTVELS 148

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCE 200
            +D+L+CC + CGDGCDGGYPISAW YFV  GVVT       + C PY +   C H   E
Sbjct: 149 ADDILSCC-YDCGDGCDGGYPISAWEYFVETGVVTGGLYGTKDSCRPY-EIPPCGHHRNE 206

Query: 201 PAY-------PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 252
             Y        TP CV  C     + + + K +   +Y I S    I  EI   GPV  +
Sbjct: 207 TFYGNCTQIADTPDCVTTCQAGYPISYDDDKTFGKDSYTIESSVTAIQKEIMTYGPVTAA 266

Query: 253 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 312
           F VYEDF HY  G+YKH++G   GGHAV+++GWG  + G  YW++AN WN  WG +GYF+
Sbjct: 267 FIVYEDFFHYHRGIYKHVSGGEEGGHAVRILGWG-EEKGTAYWLVANSWNTDWGENGYFR 325

Query: 313 IKRGSNECGIEEDVVAG 329
           I RGSNECGIEE+VVAG
Sbjct: 326 ILRGSNECGIEENVVAG 342


>gi|146092987|ref|XP_001466605.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania infantum JPCM5]
 gi|398018677|ref|XP_003862503.1| cysteine peptidase C (CPC) [Leishmania donovani]
 gi|12005276|gb|AAG44365.1| cathepsin B-like cysteine protease [Leishmania donovani]
 gi|134070968|emb|CAM69644.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania infantum JPCM5]
 gi|322500733|emb|CBZ35810.1| cysteine peptidase C (CPC) [Leishmania donovani]
          Length = 340

 Score =  246 bits (629), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 143/338 (42%), Positives = 189/338 (55%), Gaps = 21/338 (6%)

Query: 12  LCLTC-FATFAEGVVSKLKL---DSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVG 65
           LCL   FA      VS L     D  +L  S + E+N   +  W A+ +  +  S  ++ 
Sbjct: 9   LCLVAVFAVLLATTVSGLYAKPSDFPLLGKSFVAEINSKARGQWTASADNGYLVSGKSLE 68

Query: 66  QFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
           + + L+GV       +        +    LP+ FDA   WP C TIS I DQ +CGSCWA
Sbjct: 69  EVRKLMGVTDMSTEAVPPRNFSVDEMQQDLPEFFDAAEHWPMCVTISEIRDQSNCGSCWA 128

Query: 126 FGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 184
             AVEA+SDR+C   G+ +  +S ++LL+CC F+CG GC GG P  AW ++V  G+ TE 
Sbjct: 129 IAAVEAISDRYCTLGGVPDRRISTSNLLSCC-FICGFGCYGGIPTMAWLWWVWVGITTEV 187

Query: 185 CDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 236
           C PY     CSH G    YP        TPKC   C K        K+   ++Y +  + 
Sbjct: 188 CQPY-PFGPCSHHGNSDKYPPCPNTIYDTPKCNTTCEKSEM--DLVKYKGGTSYSVKGEK 244

Query: 237 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 296
           E +M E+  NGP+EV+  VY DF  YKSGVYKH++GD++GGHAVKL+GWGT   G  YW 
Sbjct: 245 E-LMIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGGHAVKLVGWGT-QGGVPYWK 302

Query: 297 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 334
           +AN WN  WG  GYF I+RGSNECGIE   VAG P+ +
Sbjct: 303 IANSWNTDWGDKGYFLIQRGSNECGIESGGVAGTPAQE 340


>gi|170028910|ref|XP_001842337.1| cathepsin L [Culex quinquefasciatus]
 gi|167879387|gb|EDS42770.1| cathepsin L [Culex quinquefasciatus]
          Length = 334

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 133/313 (42%), Positives = 182/313 (58%), Gaps = 21/313 (6%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 94
           L    I ++N      W+A RN    +  +   + L+GV       +  V +   D+   
Sbjct: 25  LSGKFIDQINAKATT-WRAGRNFH-PDTPMSYIRGLMGVHKDADKFMPPVMLHDLDEGDD 82

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLL 152
           LP++FDAR  WP C TI  I DQG CGSCWAFGAVEA+SDR CIH    ++  +S  DL+
Sbjct: 83  LPENFDAREQWPNCPTIREIRDQGSCGSCWAFGAVEAMSDRICIHSKGKVHFRVSAEDLV 142

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGC 199
           +CC   CG GC+GG+P +AW Y+V  G+V+       + C PY  S  C H        C
Sbjct: 143 SCC-HTCGFGCNGGFPGAAWSYWVRKGLVSGGPYGSDQGCQPYAISP-CEHHVNGTRGPC 200

Query: 200 EPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
                TPKCV+KC    N  +   K +  S+Y I S  + I  E++ NGPVE +FTVYED
Sbjct: 201 NGEGKTPKCVKKCQASYNVPYAKDKFFGKSSYSIASHEQQIQKELFTNGPVEGAFTVYED 260

Query: 259 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
             +YK GVY+H  G ++GGHA++++GWG  +D + +W++AN WN  WG +GYFKI RGS+
Sbjct: 261 LLNYKEGVYQHTAGKMLGGHAIRILGWGVENDTK-FWLIANSWNSDWGDNGYFKILRGSD 319

Query: 319 ECGIEEDVVAGLP 331
             GIE  + AGLP
Sbjct: 320 HLGIESSIAAGLP 332


>gi|409905640|gb|AFV46426.1| cysteine protease C [Leishmania donovani]
          Length = 345

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 143/338 (42%), Positives = 189/338 (55%), Gaps = 21/338 (6%)

Query: 12  LCLTC-FATFAEGVVSKLKL---DSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVG 65
           LCL   FA      VS L     D  +L  S + E+N   +  W A+ +  +  S  ++ 
Sbjct: 14  LCLVAVFAVLLATTVSGLYAKPSDFPLLGKSFVAEINSKARGQWTASADNGYLVSGKSLE 73

Query: 66  QFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
           + + L+GV       +        +    LP+ FDA   WP C TIS I DQ +CGSCWA
Sbjct: 74  EVRKLMGVTDMSTEAVPPRNFSVVEMQQDLPEFFDAAEHWPMCVTISEIRDQSNCGSCWA 133

Query: 126 FGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 184
             AVEA+SDR+C   G+ +  +S ++LL+CC F+CG GC GG P  AW ++V  G+ TE 
Sbjct: 134 IAAVEAISDRYCTLGGVPDRRISTSNLLSCC-FICGFGCYGGIPTMAWLWWVWVGITTEV 192

Query: 185 CDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 236
           C PY     CSH G    YP        TPKC   C K        K+   ++Y +  + 
Sbjct: 193 CQPY-PFGPCSHHGNSDKYPPCPNTIYDTPKCNTTCEKSEM--DLVKYKGGTSYSVKGEK 249

Query: 237 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 296
           E +M E+  NGP+EV+  VY DF  YKSGVYKH++GD++GGHAVKL+GWGT   G  YW 
Sbjct: 250 E-LMIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGGHAVKLVGWGT-QGGVPYWK 307

Query: 297 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 334
           +AN WN  WG  GYF I+RGSNECGIE   VAG P+ +
Sbjct: 308 IANSWNTDWGDKGYFLIQRGSNECGIESGGVAGTPAQE 345


>gi|1777779|gb|AAB40605.1| cathepsin B-like cysteine proteinase [Ascaris suum]
 gi|324515014|gb|ADY46062.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
          Length = 398

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 140/333 (42%), Positives = 191/333 (57%), Gaps = 32/333 (9%)

Query: 29  KLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGLLLGVP 85
           KL  + L + + ++ N      WKA  N +F NY+      L+GV   + + K      P
Sbjct: 59  KLTGYALANYVNRKQNL-----WKAKFNNKFRNYSDRVKYGLMGVNNVRLSVKAKKNLSP 113

Query: 86  VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMN 143
            + +D  + +P++FDAR  W QC+++  I DQ  CGSCWAFGAVEA+SDR CI  +  + 
Sbjct: 114 TRFYD--IYIPEAFDAREKWDQCASLKNIRDQSSCGSCWAFGAVEAMSDRICIASNGKIQ 171

Query: 144 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH 196
           +SLS +DLL+CC   CG GCDGG P++AW+Y+V  G+VT       + C PY     C H
Sbjct: 172 VSLSADDLLSCCK-SCGFGCDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPY-PFPPCEH 229

Query: 197 --------PGCEPAYPTPKCVRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 246
                   P     YPTPKC +KC  +   + +   K +  +AY +  D   I  EI  +
Sbjct: 230 HSNKTHYQPCKHDLYPTPKCEKKCLDIYTEKTYAEDKFFGETAYGVEDDVTSIQKEILTH 289

Query: 247 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 306
           GPVEV+F VYEDF  Y  G+Y H  G + GGHAVK++GWG  + G  YW++AN WN  WG
Sbjct: 290 GPVEVAFEVYEDFLMYDGGIYVHTGGKIGGGHAVKMLGWGV-EQGVPYWLVANSWNTDWG 348

Query: 307 ADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 339
            DG+F+I RG +ECGIE  VV GLP      K+
Sbjct: 349 EDGFFRIIRGIDECGIESSVVGGLPKLNRTYKK 381


>gi|121073189|gb|ABM47071.1| cathepsin B2 [Clonorchis sinensis]
 gi|358341868|dbj|GAA36574.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 343

 Score =  246 bits (627), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 137/309 (44%), Positives = 178/309 (57%), Gaps = 19/309 (6%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKL-PK 97
           + + V+    A W + R P+    +  +  H   ++   +       V+  D   KL PK
Sbjct: 30  VREHVHPTAGARWISVRYPK-PFESDNKLHHFGAIREPVEQRAQRSTVRHEDFDSKLIPK 88

Query: 98  SFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACC 155
           SFDAR+ WP C +IS I DQ  CGSCWAFGAVEA+SDR CIH     N SLS  DLL+CC
Sbjct: 89  SFDARATWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSSGAFNKSLSAVDLLSCC 148

Query: 156 GFLCGDGCDGGYPISAWRYFVHHGVVT----EE---CDPY------FDSTGCSHPGCEPA 202
              CGDGCDGG+P  AW ++  HG+VT    EE   C PY        S G   P     
Sbjct: 149 K-DCGDGCDGGFPPMAWDFWKTHGIVTGGSKEEPTGCRPYPFPKCQHHSQGHYPPCPRRI 207

Query: 203 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 262
           YPTPKCV+ C      ++  K  + ++Y ++     IM EI  NGPVE +F V+EDF  Y
Sbjct: 208 YPTPKCVKHCDTPKIDYQKDKTRANTSYNVHQSEVAIMKEILLNGPVEATFEVHEDFPEY 267

Query: 263 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 322
           KSG+Y H  G  +GGHA++++GWG  ++G  YW++AN WN  WG  GY +  RG NECGI
Sbjct: 268 KSGIYFHAWGGSVGGHAIRILGWG-EENGVPYWLIANSWNEDWGEKGYLRFLRGHNECGI 326

Query: 323 EEDVVAGLP 331
           EE+  AGLP
Sbjct: 327 EEEATAGLP 335


>gi|17384033|emb|CAD12394.1| cysteine proteinase [Leishmania infantum]
          Length = 340

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 142/338 (42%), Positives = 189/338 (55%), Gaps = 21/338 (6%)

Query: 12  LCLTC-FATFAEGVVSKLKL---DSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVG 65
           LCL   FA      VS L     D  +L  S + E+N   +  W A+ +  +  +  ++ 
Sbjct: 9   LCLVAVFAVLLATTVSGLYAKPSDFPLLGKSFVAEINSKARGQWTASADNGYLVTGKSLE 68

Query: 66  QFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
           + + L+GV       +        +    LP+ FDA   WP C TIS I DQ +CGSCWA
Sbjct: 69  EVRKLMGVTDMSTEAVPPRNFSVDEMQQDLPEFFDAAEHWPMCVTISEIRDQSNCGSCWA 128

Query: 126 FGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 184
             AVEA+SDR+C   G+ +  +S ++LL+CC F+CG GC GG P  AW ++V  G+ TE 
Sbjct: 129 IAAVEAISDRYCTLGGVPDRRISTSNLLSCC-FICGFGCYGGIPTMAWLWWVWVGITTEV 187

Query: 185 CDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 236
           C PY     CSH G    YP        TPKC   C K        K+   ++Y +  + 
Sbjct: 188 CQPY-PFGPCSHHGNSDKYPPCPNTIYDTPKCNTTCEKSEM--DLVKYKGGTSYSVKGEK 244

Query: 237 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 296
           E +M E+  NGP+EV+  VY DF  YKSGVYKH++GD++GGHAVKL+GWGT   G  YW 
Sbjct: 245 E-LMIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGGHAVKLVGWGT-QGGVPYWK 302

Query: 297 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 334
           +AN WN  WG  GYF I+RGSNECGIE   VAG P+ +
Sbjct: 303 IANSWNTDWGDKGYFLIQRGSNECGIESGGVAGTPAQE 340


>gi|126116630|gb|ABN79675.1| cathepsin B3 [Clonorchis sinensis]
          Length = 337

 Score =  245 bits (626), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 133/312 (42%), Positives = 177/312 (56%), Gaps = 23/312 (7%)

Query: 43  VNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK--SLKLPKSFD 100
           V+    A W  A  P+   +  G F+ + G    P+      P  +H+      +PK+FD
Sbjct: 28  VDSKSGARWIYAEPPE--RFQPGNFQLMFGALREPEEQRSKRPTVSHESFSDEHIPKAFD 85

Query: 101 ARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFL 158
           AR  WP C TI  I DQ  CGSCWAFGAVEA+SDR CIH     +  +S  DL++CCG+ 
Sbjct: 86  ARKQWPHCPTIGEIRDQSSCGSCWAFGAVEAMSDRLCIHTNGTFTKRISAVDLISCCGY- 144

Query: 159 CGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEP-------AYP 204
           CG GC GG+P +AW ++   G+VT         C  Y     CSH G +         Y 
Sbjct: 145 CGFGCQGGFPPTAWDFWQTEGIVTGGSKENPTGCRSY-PFPRCSHHGSKKYPPCSHRIYD 203

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           TP CV+KC   +  +   K  +   Y + +    IM EI  NGPVE +F VYEDF  YKS
Sbjct: 204 TPNCVQKCDTPDTDYATDKTRANITYNVKAKQNAIMKEIMINGPVEAAFQVYEDFLGYKS 263

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           GVY H  G ++GGHA++++GWG  ++G  YW++AN WN  WG DGYFK+ RG NECGIE+
Sbjct: 264 GVYFHSDGTLLGGHAIRILGWG-EENGVAYWLIANSWNDGWGEDGYFKMLRGKNECGIED 322

Query: 325 DVVAGLPSSKNL 336
           +V AGLP   ++
Sbjct: 323 EVTAGLPELSSI 334


>gi|119887749|gb|ABM05925.1| cathepsin B-like cysteine proteinase [Helicoverpa assulta]
          Length = 338

 Score =  245 bits (626), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 136/314 (43%), Positives = 180/314 (57%), Gaps = 23/314 (7%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYT-VGQFKHLLGVKPTPKGLLLGVPVKTHDKSL 93
           L D  I  +N    + WKA RN  F  +T     K L GV P      L       +   
Sbjct: 26  LSDDFINLINTKQNS-WKAGRN--FPEHTPFAHIKKLAGVLPDYHLSKLSKVEHEDELIA 82

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 151
            LP++FD R  WP C T++ + DQG CGSCWAFGAVEA++DR+C +     +   S  DL
Sbjct: 83  SLPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTQHFHFSAEDL 142

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH--PG---- 198
           L+CC  +CG GC+GG P  AW Y+ H G+V+       + C PY +   C H  PG    
Sbjct: 143 LSCCP-ICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPY-EIPPCEHHVPGNRMP 200

Query: 199 CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 257
           C     TPKC + C    N  +R  K Y    + ++S  + I AE++KNGPVE +FTVY 
Sbjct: 201 CNGDSKTPKCEKTCESNYNVDYRKDKRYGKHVFSVSSKEDHIRAELFKNGPVEGAFTVYS 260

Query: 258 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 317
           D  +YK+GVYKH  GD +GGHAVK++GWG  ++G  YW++AN WN  WG +G+FKI RG 
Sbjct: 261 DLLNYKTGVYKHTIGDALGGHAVKILGWGV-ENGNKYWLIANSWNSDWGDNGFFKILRGE 319

Query: 318 NECGIEEDVVAGLP 331
           + CGIE  +VAG P
Sbjct: 320 DHCGIESSIVAGEP 333


>gi|407425570|gb|EKF39488.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi
           marinkellei]
          Length = 333

 Score =  245 bits (626), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 139/330 (42%), Positives = 182/330 (55%), Gaps = 18/330 (5%)

Query: 12  LCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLL 71
           + L  F  +A  V +    D+ IL D  ++ VN      W A R  +  + T  +   LL
Sbjct: 9   IALFLFLLYATAVHALHVDDAPILTDEFLEHVNSLNGGKWTAGRTSRTKHLTRREASRLL 68

Query: 72  GVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEA 131
           G       +L        +  ++L   FDA  AWP C TI+ I DQ  CGSCWA  A  A
Sbjct: 69  GTFLGNTSILAPRQFSEAELRVRLEDKFDAAEAWPNCPTITEIRDQSSCGSCWAVAAASA 128

Query: 132 LSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-F 189
           +SDR+C   G+ +L +S  DL++CC  +CG GC+GG+P  AW ++V HG+V+E C PY F
Sbjct: 129 MSDRYCTLGGVRDLRISAGDLMSCCD-VCGYGCNGGFPEVAWVFYVVHGLVSEYCQPYPF 187

Query: 190 DSTGCSH-------PGCEPAYPTPKCVRKCV-KKNQLWRNSKHYSISAYRINSDPEDIMA 241
            S  C+H         C   Y TPKC   C  KK  L R   ++S     + S  E    
Sbjct: 188 PS--CAHHVNSSDLAPCSGDYKTPKCNSTCTEKKIPLIRYRGNHSY----VLSGEEHFKR 241

Query: 242 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQW 301
           E+  NGP EV+F VY DF  Y  GVYKH+ GD++GGHAV+L+GWG   +GE YW +AN W
Sbjct: 242 ELLLNGPFEVAFEVYADFMAYTGGVYKHVAGDLLGGHAVRLVGWGEL-NGEPYWKIANSW 300

Query: 302 NRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           N  WG +GYF I RG NECGIE + VAG P
Sbjct: 301 NHEWGMNGYFLIARGVNECGIESNGVAGTP 330


>gi|7537454|gb|AAF35867.2| cathepsin B-like cysteine proteinase [Helicoverpa armigera]
          Length = 338

 Score =  245 bits (625), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 136/314 (43%), Positives = 180/314 (57%), Gaps = 23/314 (7%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYT-VGQFKHLLGVKPTPKGLLLGVPVKTHDKSL 93
           L D  I  +N    + WKA RN  F  +T     K L GV P      L       +   
Sbjct: 26  LSDDFINLINTKQNS-WKAGRN--FPEHTPFAHIKRLAGVLPDYHLSKLSKVEHEDELIA 82

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 151
            LP++FD R  WP C T++ + DQG CGSCWAFGAVEA++DR+C +     +   S  DL
Sbjct: 83  SLPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTQHFHFSAEDL 142

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH--PG---- 198
           L+CC  +CG GC+GG P  AW Y+ H G+V+       + C PY +   C H  PG    
Sbjct: 143 LSCCP-ICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPY-EIPPCEHHVPGNRMP 200

Query: 199 CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 257
           C     TPKC + C    N  +R  K Y    + ++S  + I AE++KNGPVE +FTVY 
Sbjct: 201 CNGDSKTPKCEKTCESNYNVDYRKDKRYGKHVFSVSSKEDHIRAELFKNGPVEGAFTVYS 260

Query: 258 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 317
           D  +YK+GVYKH  GD +GGHAVK++GWG  ++G  YW++AN WN  WG +G+FKI RG 
Sbjct: 261 DLLNYKTGVYKHTIGDALGGHAVKILGWGV-ENGNKYWLIANSWNSDWGDNGFFKILRGE 319

Query: 318 NECGIEEDVVAGLP 331
           + CGIE  +VAG P
Sbjct: 320 DHCGIESSIVAGEP 333


>gi|118424551|gb|ABK90823.1| cathepsin B-like cysteine proteinase [Spodoptera exigua]
          Length = 341

 Score =  245 bits (625), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 142/339 (41%), Positives = 188/339 (55%), Gaps = 28/339 (8%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
            + L C    A   V  L    + L D  I  +N    + WKA RN    N  +   K L
Sbjct: 8   FVALVCTLALASASVEDLL---NPLTDEFINLINTKQNS-WKAGRNFPV-NTPLTHIKKL 62

Query: 71  LGVKPTPKGLLLGVPVKTHDKSL--KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 128
            GV       L  +P   HD  L   LP++FD R  WP C T++ + DQG CGSCWAFGA
Sbjct: 63  TGVLVDTH--LSKLPKVEHDADLIADLPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGA 120

Query: 129 VEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT---- 182
           VEA++DR+C +     +   S  DLL+CC  +CG GC+GG P  AW Y+ H G+V+    
Sbjct: 121 VEAMTDRYCTYSNGTKHFHFSAEDLLSCCP-VCGLGCNGGMPTLAWEYWKHFGLVSGGSY 179

Query: 183 ---EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRI 232
              + C PY +   C H  PG    C     TPKC + C    N  +   K Y    Y +
Sbjct: 180 NSSQGCRPY-EIPPCEHHVPGNRMPCNGDSKTPKCHKTCESSYNVDYHKDKRYGKHVYSV 238

Query: 233 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 292
           +S  + I AE+YKNGPVE +FTVY D  +YK+GVYKH  G+ +GGHA+K++GWG  ++G 
Sbjct: 239 SSKEDHIKAELYKNGPVEGAFTVYSDLLNYKNGVYKHTVGNALGGHAIKILGWGV-ENGN 297

Query: 293 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
            YW++AN WN  WG +G+FKI RG + CGIE  +VAG P
Sbjct: 298 KYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 336


>gi|29374023|gb|AAO73002.1| cathepsin B [Fasciola gigantica]
          Length = 335

 Score =  245 bits (625), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 139/317 (43%), Positives = 183/317 (57%), Gaps = 27/317 (8%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK-HLLGVKPTPKGLLLGVPVKTHDKSL 93
             D +I+ VNE   A WKAAR+ +F+N  + QFK HL  ++ TP+      P   +  S 
Sbjct: 26  FSDELIRYVNEESGASWKAARSTRFNN--IEQFKKHLGALEETPEERNTRRPTVRYSVSE 83

Query: 94  K-LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 150
             LP+SFDAR  WP CS+IS I DQ  C SCWA G   A++DR CIH        LS  D
Sbjct: 84  NDLPESFDAREKWPNCSSISEIPDQSSCSSCWAVGTASAMTDRICIHSNGEKKPRLSAVD 143

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH----PGC 199
           L++CC + CG GC+GGYP  AW Y+  HG+V+         C PY     CSH    PG 
Sbjct: 144 LVSCCPY-CGYGCEGGYPSMAWDYWWRHGIVSGGTLENPTGCLPY-PFPKCSHLEETPGL 201

Query: 200 EPA----YPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
            P     Y TPKC ++C    ++     K    S+Y +     DIM EI  NGPV   + 
Sbjct: 202 APCPRELYATPKCEKQCQAGYSKTSEEDKIKGKSSYNVGDRETDIMMEIITNGPVSTIYY 261

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           ++EDF  YKSG+Y++ +G +MGGH +  IGWG  ++G  YW+ AN WN  WG +GYF+I+
Sbjct: 262 IFEDFTVYKSGIYQYTSGSLMGGHGI--IGWGV-ENGVKYWLAANSWNEGWGENGYFRIR 318

Query: 315 RGSNECGIEEDVVAGLP 331
           RG+NECGIE  + AGLP
Sbjct: 319 RGTNECGIESRINAGLP 335


>gi|401415968|ref|XP_003872479.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|322488703|emb|CBZ23950.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 340

 Score =  245 bits (625), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 143/338 (42%), Positives = 184/338 (54%), Gaps = 21/338 (6%)

Query: 12  LCLTC-FATFAEGVVSKLKL---DSHILQDSIIKEVNENPKAGWKAARNP--QFSNYTVG 65
           LCL   F       VS L     D  +L  S + E N   K  W A+ +     +  ++ 
Sbjct: 9   LCLVAVFVVLLATTVSALYAKPSDIPLLGKSFVAETNSKAKGQWTASADNGHLVTGKSLE 68

Query: 66  QFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
           + + L+GV       +        +    LP+SFDA   WP C TI  I DQ +CGSCWA
Sbjct: 69  EVRKLMGVTSMSTEAVPPRNFSVEEMQQDLPESFDASEKWPMCVTIGEIRDQSNCGSCWA 128

Query: 126 FGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 184
             AVEA+SDR+C   G+ +  +S  +LL+CC F+CG GC GG P  AW ++V  GV TE 
Sbjct: 129 IAAVEAMSDRYCTMSGIPDRRISTTNLLSCC-FICGFGCYGGIPAMAWLWWVWVGVTTEL 187

Query: 185 CDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 236
           C PY     CSH G    YP        TPKC   C   N      K+  +S+Y I  + 
Sbjct: 188 CQPY-PFGPCSHHGNSSKYPPCPNTIYNTPKCNTTC--DNVEMELVKYKGVSSYSIKGER 244

Query: 237 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 296
           E +M E+  NGP+EV+  VY DF  YKSGVYKH++GD +GGHAVKL+GWG   DG  YW 
Sbjct: 245 E-LMVELMNNGPLEVAMQVYADFVAYKSGVYKHVSGDHLGGHAVKLVGWGV-KDGIPYWK 302

Query: 297 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 334
           +AN WN  WG  GYF I+RG++ECGIE   VAG P  +
Sbjct: 303 IANSWNTDWGDKGYFLIQRGNDECGIESSGVAGKPGEE 340


>gi|340053922|emb|CCC48215.1| cysteine peptidase C (CPC) [Trypanosoma vivax Y486]
          Length = 334

 Score =  245 bits (625), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 133/311 (42%), Positives = 173/311 (55%), Gaps = 15/311 (4%)

Query: 31  DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD 90
           D   +    + EVN+  K  W A  + + +  T    K L+G K     +L        +
Sbjct: 27  DGRFITREFVAEVNKLNKGIWTARYDTKMARLTRQGVKRLMGAKLRDAPVLPRRHFTEEE 86

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVN 149
               LP+SFDA +AWP C TI RI DQ  CGSCWA  A  A+SDRFC+  G+ +L +S  
Sbjct: 87  LRAPLPESFDAATAWPDCPTIKRIADQSSCGSCWAVAAATAMSDRFCVTGGVRDLGISAG 146

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP----- 204
           DLL+CC   CGDGCDGGYP  AW YF   G+V++ C PY     C H G     P     
Sbjct: 147 DLLSCC-TSCGDGCDGGYPDEAWLYFTESGLVSDYCQPY-PFPPCKHSGGRSKNPSCHDM 204

Query: 205 ---TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 261
              TPKC   C  K       ++++  +Y +  + ED   E+Y  GP EV+FTVYEDF  
Sbjct: 205 HFHTPKCNATCTDKRIP--VVRYFASESYSLQGE-EDYKRELYLRGPFEVAFTVYEDFLA 261

Query: 262 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           Y+SGVYKH++G  +GGHAV+++GWG   +G  YW +AN WN  WG +GY    RG +ECG
Sbjct: 262 YESGVYKHVSGGPVGGHAVRVVGWG-ERNGVPYWKIANSWNTDWGENGYLYFYRGKDECG 320

Query: 322 IEEDVVAGLPS 332
           IE    AG PS
Sbjct: 321 IESQGSAGTPS 331


>gi|55793949|gb|AAV65885.1| cathepsin B1 isotype 5 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  244 bits (624), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 137/344 (39%), Positives = 194/344 (56%), Gaps = 23/344 (6%)

Query: 7   IMDPILCLTCFATFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +M+ +LC+  F +     ++ + ++    L D +I  +N++P AGW A+R+ +F +    
Sbjct: 1   MMNTVLCIISFMSILTAHILPENEIQFEPLSDEMIAYINQHPDAGWTASRSDRFKSLKDA 60

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
           +   LLG     + L       V   D SL++P SFD+R  WPQC +IS I DQ  CG+ 
Sbjct: 61  RI--LLGAMREDEELRKKRRPTVDHQDVSLEIPTSFDSRKEWPQCKSISNIRDQSRCGAG 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WAF AV+A+SDR CI      ++ LS  DLL+CC   CG GC  G+P  AW Y+V  G+V
Sbjct: 119 WAFAAVQAMSDRICIESKGKKSVELSAVDLLSCC-IECGLGCQMGFPGIAWDYWVQEGIV 177

Query: 182 T-------EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSI 227
           T         C PY        T   +P C E  Y  PKC +KC K  +  +   K+Y  
Sbjct: 178 TGGSKENHTGCQPYPFPKCEHHTKGRYPECGEIIYMKPKCHQKCQKGYKTPYEKDKYYGK 237

Query: 228 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 287
            +Y +  + + I  EI  +GPVE SF V+ DF +YKSG+YKH+TG  +G H V++IGWG 
Sbjct: 238 VSYNLLKNEDSIKKEIMMHGPVEASFRVHSDFLNYKSGIYKHMTGIDIGSHVVRIIGWGV 297

Query: 288 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
             +   YW++AN WN  WG  GYF++ RG +ECGIE  V +GLP
Sbjct: 298 EKE-TPYWLIANSWNEDWGEKGYFRMLRGKDECGIESAVTSGLP 340


>gi|393909827|gb|EJD75608.1| cysteine endopeptidase [Loa loa]
          Length = 383

 Score =  244 bits (624), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 139/333 (41%), Positives = 192/333 (57%), Gaps = 27/333 (8%)

Query: 26  SKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP 85
           +K+  ++  L D  + +   + +  WKA  N +F+ Y+      LLGV    + +     
Sbjct: 54  TKIAPEAENLSDQELIDYVNSHQTLWKAEMN-KFNLYSNTVKYGLLGVNNMKQSVDGKKN 112

Query: 86  VK-THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--M 142
           +  T   ++ +P+SFDAR  WP+C+++  + DQ  CGSCWA  AVEA+SDR CI      
Sbjct: 113 LSPTRHSTIFIPESFDARKHWPECASLRNVRDQSSCGSCWAVAAVEAMSDRICIMSKGKK 172

Query: 143 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGC 199
            ++LS +DLL+CC   CG GC GG P++AW+Y+V  G+VT     Y + +GC     P C
Sbjct: 173 QVTLSADDLLSCCK-TCGFGCFGGEPMAAWKYWVLRGIVTG--SEYTNHSGCRPYPFPPC 229

Query: 200 E-------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 245
           E               YPTPKCV+KC K   + ++  K+Y    Y + S+ E I  EI  
Sbjct: 230 EHHNNKTHYEPCKHDLYPTPKCVKKCDKNYGKSYKADKYYGEQVYNVESNVESIQKEIMT 289

Query: 246 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 305
            GPVE SF VY DF +Y  G+YKH+ G + GGHAVK++GWG  D G  YW+ AN WN  W
Sbjct: 290 LGPVEASFEVYTDFLYYTGGIYKHVAGSMGGGHAVKVLGWGI-DQGVPYWLAANSWNTDW 348

Query: 306 GADGYFKIKRGSNECGIEEDVVAGLPSSKNLVK 338
           G DGYF+I RG NECGIE  ++AG+P  K L K
Sbjct: 349 GEDGYFRILRGVNECGIESGIIAGIP--KQLAK 379


>gi|358341561|dbj|GAA37330.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 347

 Score =  244 bits (623), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 145/345 (42%), Positives = 195/345 (56%), Gaps = 26/345 (7%)

Query: 8   MDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 67
           M     L   A   +G   K K ++  L D ++  VN    A WKAA++ +F   T+ + 
Sbjct: 1   MRATTFLCAIAILLDGSNGKPKHEA--LSDELVDYVNSQVDATWKAAKSERFK--TLEEI 56

Query: 68  KHLLGVKPTPKGLL-LGVPVKTH-DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
           + +LG     + +     P  +H D +L+LP  FDAR  WP+C TI +I DQ  CGSCWA
Sbjct: 57  RSVLGTMREDQNVKEFRRPTISHEDITLELPSEFDAREHWPECRTIPQIRDQSGCGSCWA 116

Query: 126 FGAVEALSDRFCIHFG---MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 182
           F AV A+SDR CIH     +N+ LS  DLLACC   CG GC GG+   AW Y+  +G+VT
Sbjct: 117 FAAVTAMSDRVCIHSNQTLVNVQLSATDLLACCT-TCGFGCVGGWGGMAWDYWRDNGIVT 175

Query: 183 -------EECDPY-------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYS 226
                    C PY         + G  +P C E  Y TP+CV +C K     + + K  +
Sbjct: 176 GGEYKDSHTCLPYPFPPCRHHGAKGSEYPPCPEKMYSTPQCVSECQKGYATKYEDDKIRA 235

Query: 227 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 286
            ++Y +      I  EI+  GPVE +  VY DFA+Y  GVYKH TG+++GGHA++L+GWG
Sbjct: 236 STSYNLYRSVTTIQKEIWMRGPVEATMNVYTDFANYAGGVYKHTTGELLGGHAIRLLGWG 295

Query: 287 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
             +DG  YW+ AN WN SWG  G+F+I RGS+ CGIE DV AGLP
Sbjct: 296 VEEDGTPYWLAANSWNPSWGEKGFFRILRGSDHCGIESDVSAGLP 340


>gi|144952804|gb|ABP04056.1| cathepsin B-4 [Clonorchis sinensis]
          Length = 347

 Score =  244 bits (623), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 145/345 (42%), Positives = 195/345 (56%), Gaps = 26/345 (7%)

Query: 8   MDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 67
           M     L   A   +G   K K ++  L D ++  VN    A WKAA++ +F   T+ + 
Sbjct: 1   MRATTFLCAIAILLDGSNGKPKHEA--LSDELVDYVNSQVDATWKAAKSERFK--TLEEI 56

Query: 68  KHLLGVKPTPKGLL-LGVPVKTH-DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
           + +LG     + +     P  +H D +L+LP  FDAR  WP+C TI +I DQ  CGSCWA
Sbjct: 57  RSVLGTMREDQNVKEFRRPTISHEDITLELPSEFDAREHWPECRTIPQIRDQSGCGSCWA 116

Query: 126 FGAVEALSDRFCIHFG---MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 182
           F AV A+SDR CIH     +N+ LS  DLLACC   CG GC GG+   AW Y+  +G+VT
Sbjct: 117 FAAVTAMSDRVCIHSNQTLVNVQLSATDLLACCT-TCGFGCVGGWGGMAWDYWRDNGIVT 175

Query: 183 -------EECDPY-------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYS 226
                    C PY         + G  +P C E  Y TP+CV +C K     + + K  +
Sbjct: 176 GGEYKDSHTCLPYPFPPCRHHGAKGSEYPPCPEKMYSTPQCVSECQKGYATKYEDDKIRA 235

Query: 227 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 286
            ++Y +      I  EI+  GPVE +  VY DFA+Y  GVYKH TG+++GGHA++L+GWG
Sbjct: 236 STSYNLYRSVTAIQKEIWMRGPVEATMNVYTDFANYAGGVYKHTTGELLGGHAIRLLGWG 295

Query: 287 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
             +DG  YW+ AN WN SWG  G+F+I RGS+ CGIE DV AGLP
Sbjct: 296 VEEDGTPYWLAANSWNPSWGEKGFFRILRGSDHCGIESDVSAGLP 340


>gi|27526823|emb|CAD32937.1| pro-cathepsin B2 [Fasciola hepatica]
          Length = 337

 Score =  244 bits (623), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 141/328 (42%), Positives = 185/328 (56%), Gaps = 28/328 (8%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV-KPTPKGLLLGVPVKTHDKSL 93
             D +I  +NE   A WKAA + +F N  +  FK  LG+ + TP+      P   ++ S 
Sbjct: 16  FSDELIHYINEKSGASWKAAPSSRFIN--IEHFKQHLGLLEETPEERQTRRPTVRYNVSD 73

Query: 94  K-LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 150
             LP+SFDAR  WP C +I +I DQ  CGSCWA   V A+SDR CIH    M   LS  D
Sbjct: 74  NDLPESFDAREKWPLCRSIRQIPDQSSCGSCWAVAGVGAMSDRVCIHSNGMMQPELSAID 133

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEP-- 201
           L++CC + CG+GC GG P +AW Y+  +G+VT         C PY     C HPG     
Sbjct: 134 LVSCCSY-CGNGCQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPY-PFPQCRHPGSRSQL 191

Query: 202 ------AYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
                  YPTP C   C    ++ +   K Y  ++Y ++     IM EI KNGPVE  F 
Sbjct: 192 NPCPRYTYPTPSCYPYCQAGYDKTYEKDKVYGKTSYNVDRHEYTIMEEIMKNGPVEAGFI 251

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           VY DFA YKSG+Y H++G   G HA+++IGWG  ++G  YW+ AN WN  WG +GYF+I 
Sbjct: 252 VYTDFAVYKSGIYHHVSGRYAGKHAIRIIGWGV-ENGVKYWLTANSWNVGWGENGYFRIL 310

Query: 315 RGSNECGIEEDVVAGLPSSKNLVKEITS 342
           RG++EC IE  VVAG+P    L K IT+
Sbjct: 311 RGTDECRIESIVVAGMP---RLQKNITN 335


>gi|187103108|ref|NP_001119614.1| cathepsin B-1418 precursor [Acyrthosiphon pisum]
 gi|163300438|tpg|DAA06126.1| TPA_inf: cathepsin B transcript 1418 [Acyrthosiphon pisum]
 gi|239788654|dbj|BAH70998.1| ACYPI000010 [Acyrthosiphon pisum]
          Length = 346

 Score =  244 bits (622), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 140/343 (40%), Positives = 194/343 (56%), Gaps = 29/343 (8%)

Query: 12  LCLTCFATFAEGVVSKLKLDSHILQDS--IIKEVNENPKAGWKAARNPQFSNYTVG---Q 66
           +  + F   A  V   ++  S +  ++  II  VN +P   W+A+     +N   G    
Sbjct: 3   MSASIFIVLATMVAVAVRESSAVTNEATFIIDSVNADPGNTWRASD----TNVIPGDGKN 58

Query: 67  FKHLLGVKPTPKGLLLGVPVK--THDKSLK-LPKSFDARSAWPQCSTI-SRILDQGHCGS 122
           F  L+GV P         P+K    D+S + LP++FDAR  WP+CS++   I DQ +CGS
Sbjct: 59  FNQLMGVLPRNFNSFRFAPIKKSAEDESNEALPENFDARERWPECSSLLGSIKDQSNCGS 118

Query: 123 CWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 180
           CWA  A    SDR CI  G  +  +LS   L  CC + CG+GCDGG P SAW +F+ HG+
Sbjct: 119 CWAVSAASVFSDRLCIATGGAVARNLSAEQLNTCC-YRCGNGCDGGSPESAWYFFMRHGI 177

Query: 181 VT-------EECDPY-FDSTGCSHPGCEPAYP-TPKC-VRKCVKKN--QLWRNSKHYSIS 228
           VT       + C PY     G     C    P TP C ++ C   N  + +R   HY  +
Sbjct: 178 VTGGDYGSEDGCQPYSIYPCGKGRNTCIEDDPDTPDCSIKTCTNSNYSKNYRADLHYVDT 237

Query: 229 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 288
            Y ++   EDIM ++YKNGPV+ +F VY DF +YKSGVY +  G + GGHA+K++GWG  
Sbjct: 238 VYSLSRSEEDIMKDLYKNGPVQAAFYVYTDFMYYKSGVYSYTRGQIEGGHAIKILGWGV- 296

Query: 289 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           DDG  YW+ AN W+RSWG +G F+I RG+NEC IE+ V+AG+P
Sbjct: 297 DDGTKYWLCANSWSRSWGENGLFRILRGNNECHIEDRVIAGMP 339


>gi|50540542|ref|NP_998501.1| cathepsin B, a precursor [Danio rerio]
 gi|34784038|gb|AAH56688.1| Cathepsin B, a [Danio rerio]
 gi|37681773|gb|AAQ97764.1| cathepsin B [Danio rerio]
 gi|41351445|gb|AAH65589.1| Cathepsin B, a [Danio rerio]
          Length = 330

 Score =  244 bits (622), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 136/297 (45%), Positives = 178/297 (59%), Gaps = 23/297 (7%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCST 110
           W A  N  F +      K L G     KG  L V V+ + + LKLPK+FDAR  WP C T
Sbjct: 40  WTAGHN--FRDVDYSYVKKLCGT--FLKGPKLPVMVQ-YTEGLKLPKNFDAREQWPNCPT 94

Query: 111 ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYP 168
           +  I DQG CGSCWAFGA EA+SDR CIH    +S+ ++  DLL CC   CG GC+GGYP
Sbjct: 95  LKEIRDQGSCGSCWAFGAAEAISDRVCIHSDAKVSVEISSQDLLTCCD-SCGMGCNGGYP 153

Query: 169 ISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK 215
            +AW ++   G+VT         C PY          G   P       TP C  KC   
Sbjct: 154 SAAWDFWATEGLVTGGLYNSHIGCRPYTIEPCEHHVNGSRPPCSGEGGDTPNCDMKCEPG 213

Query: 216 -NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 274
            +  ++  KH+  ++Y + S+   IMAE++KNGPVE +FTVYEDF  YKSGVY+H++G  
Sbjct: 214 YSPSYKQDKHFGKTSYSVPSNQNSIMAELFKNGPVEGAFTVYEDFLLYKSGVYQHMSGSP 273

Query: 275 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           +GGHA+K++GWG  ++G  YW+ AN WN  WG +GYFKI RG + CGIE ++VAG+P
Sbjct: 274 VGGHAIKILGWG-EENGVPYWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAGIP 329


>gi|12004577|gb|AAG44098.1| cathepsin B cysteine protease [Leishmania chagasi]
          Length = 340

 Score =  244 bits (622), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 142/338 (42%), Positives = 188/338 (55%), Gaps = 21/338 (6%)

Query: 12  LCLTC-FATFAEGVVSKLKL---DSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVG 65
           LCL   FA      VS L     D  +L  S + E+N   +  W A+ +  +  S  ++ 
Sbjct: 9   LCLVAVFAVLLATTVSGLYAKPSDFPLLGKSFVAEINSKARGQWTASADNGYLVSGKSLE 68

Query: 66  QFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
           + + L+GV       +        +    LP+ FDA   WP C TIS I DQ +CGSCWA
Sbjct: 69  EVRKLMGVTDMSTEAVPPRNFSVDEMQQDLPEFFDAAEHWPMCVTISEIRDQSNCGSCWA 128

Query: 126 FGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 184
             AVEA+SDR+C   G+ +  +S ++LL+CC F+CG GC GG P  AW ++V  G+ TE 
Sbjct: 129 IAAVEAISDRYCTLGGVPDRRISTSNLLSCC-FICGFGCYGGIPTMAWLWWVWVGITTEV 187

Query: 185 CDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 236
           C PY     CSH G    YP        TPKC   C K        K+   ++Y +  + 
Sbjct: 188 CQPY-PFGPCSHHGNSDKYPPCPNTIYDTPKCNTTCEKSEM--DLVKYKGGTSYSVKGEK 244

Query: 237 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 296
           E +M E+  NGP+EV+  VY DF  YKSG YKH++GD++GGHAVKL+GWGT   G  YW 
Sbjct: 245 E-LMIELMTNGPLEVTMQVYSDFVGYKSGGYKHVSGDLLGGHAVKLVGWGT-QGGVPYWK 302

Query: 297 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 334
           +AN WN  WG  GYF I+RGSNECGIE   VAG P+ +
Sbjct: 303 IANSWNTDWGDKGYFLIQRGSNECGIESGGVAGTPAQE 340


>gi|312374701|gb|EFR22198.1| hypothetical protein AND_15621 [Anopheles darlingi]
          Length = 335

 Score =  244 bits (622), Expect = 7e-62,   Method: Compositional matrix adjust.
 Identities = 136/319 (42%), Positives = 182/319 (57%), Gaps = 27/319 (8%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS 92
           + L    I E+N      W+A RN    + ++   + L+GV           P   HD S
Sbjct: 22  YALSAKFIDEINSKAST-WRAGRNFH-PDVSLSYIRGLMGVHQ--DAYKFREPEFVHDLS 77

Query: 93  LK---LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 147
                LP++FD+R  WP C TI  I DQG CGSCWAFGAVEA+SDR CI  G  ++   S
Sbjct: 78  ADVDDLPENFDSREQWPNCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIASGGKIHFRFS 137

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH---- 196
             DL++CC   CG GC+GG+P +AW Y+VH G+V+         C PY  +  C H    
Sbjct: 138 AEDLVSCC-HTCGFGCNGGFPGAAWSYWVHKGLVSGGPFGSNLGCQPYAIAP-CEHHVNG 195

Query: 197 --PGCE-PAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 252
             P CE     TPKCV+KC     + +   K Y   +Y I    + I  EI  NGPVE +
Sbjct: 196 TRPSCEGEGGKTPKCVKKCQDSYTVPYAKDKRYGSKSYSIPRHEDQIRKEIMTNGPVEGA 255

Query: 253 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 312
           FTVYED  HYK GVY+H+TG ++GGHA++++GWG  ++ + YW++AN WN  WG +G+FK
Sbjct: 256 FTVYEDLLHYKEGVYQHVTGKMLGGHAIRILGWGVENNTK-YWLIANSWNSDWGDNGFFK 314

Query: 313 IKRGSNECGIEEDVVAGLP 331
           I RG +  GIE  + AGLP
Sbjct: 315 ILRGEDHLGIESSIAAGLP 333


>gi|357613937|gb|EHJ68797.1| cathepsin B-like cysteine proteinase [Danaus plexippus]
          Length = 334

 Score =  243 bits (621), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 143/343 (41%), Positives = 188/343 (54%), Gaps = 27/343 (7%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +I+   +CL      A   VS++    H L D  I  +N      W A RN      T+ 
Sbjct: 1   MILIRAICLVFLCGIA---VSEI---PHPLSDKFIDLINSKQNT-WIAGRNFDIGR-TLK 52

Query: 66  QFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
             K L+G         L       D    LP++FD R  WP C T++ I DQG CGSCWA
Sbjct: 53  SIKKLMGALEDKYLHKLYTVEHDDDTINNLPENFDPRDKWPNCPTLNEIRDQGSCGSCWA 112

Query: 126 FGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT- 182
           FGAVEA++DR+C +     +   S  DLL+CC  +CG GC+GG P  AW Y+ H G+V+ 
Sbjct: 113 FGAVEAMTDRYCTYSNGTKHFHFSAEDLLSCCP-VCGLGCNGGIPSFAWEYWKHFGIVSG 171

Query: 183 ------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 229
                 + C PY +   C H  PG    C     TPKC R C K+    +++ K Y    
Sbjct: 172 GNYNSSQGCLPY-EIPPCEHHVPGNRIPCNGETSTPKCHRSCRKEYTNSYKSDKKYGKHV 230

Query: 230 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 289
           Y +    E I AEI+KNGPVE +FTVY D   YKSGVYKH  G+ +GGHA+K++GWG  +
Sbjct: 231 YSVGGGEEHIKAEIFKNGPVEGAFTVYADLLTYKSGVYKHTEGEALGGHAIKIMGWGV-E 289

Query: 290 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
           +G  YW++AN WN  WG +G+FKI RG + CGIE  +VAG PS
Sbjct: 290 NGNKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEPS 332


>gi|358331547|dbj|GAA35870.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 508

 Score =  243 bits (620), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 136/301 (45%), Positives = 168/301 (55%), Gaps = 25/301 (8%)

Query: 49  AGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD--KSLKLPKSFDARSAWP 106
           A W + R P+   +      H+ G K   +      P   HD   +++LPK+FDAR  WP
Sbjct: 40  ARWISGRRPK--RFESDDLIHMFGAKRETREQKAQRPTLRHDGFDNMRLPKNFDARKTWP 97

Query: 107 QCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCD 164
            CS+IS I DQ  CGSCWAFGAVEA+SDR CIH     N SLS  DLL+CC   CG GC 
Sbjct: 98  HCSSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLLSCCKD-CGFGCR 156

Query: 165 GGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCV 209
           GGYP  AW Y+  HG+VT       D +GC     P CE              YPTP+CV
Sbjct: 157 GGYPAVAWDYWKTHGIVTGGSKE--DPSGCRSYPFPKCEHHVQGHYPPCPRELYPTPECV 214

Query: 210 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 269
           ++C   +  +   K  +  +Y I +    IM EI   GPVE  FT+YEDF  Y SGVY H
Sbjct: 215 QQCDTPDVGYLEDKTRANMSYNIYASEISIMKEIMLRGPVEAIFTMYEDFLRYSSGVYFH 274

Query: 270 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
             G  M GHAV+++GWG   +   YW++AN WN  WG +GY K  RG NECGIE+DV A 
Sbjct: 275 ALGAPMSGHAVRILGWGELGN-VPYWLIANSWNEDWGEEGYMKFLRGYNECGIEDDVTAV 333

Query: 330 L 330
           L
Sbjct: 334 L 334


>gi|170028912|ref|XP_001842338.1| oryzain gamma chain [Culex quinquefasciatus]
 gi|167879388|gb|EDS42771.1| oryzain gamma chain [Culex quinquefasciatus]
          Length = 333

 Score =  243 bits (619), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 136/337 (40%), Positives = 189/337 (56%), Gaps = 24/337 (7%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
           +L LT  AT    + +  ++  + L +  I ++N      W A RN    +  +  F+ L
Sbjct: 3   LLLLT--ATVIVVLWAMYRVSINPLSEKFIDQINAKATT-WHAGRNFH-PDTPLSYFRGL 58

Query: 71  LGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 130
           +GV       +  V +   D+   LP++FD+R  WP C TI  I DQG CGSCWAFGAVE
Sbjct: 59  MGVHKDADKFMPPVMLHDLDEGDDLPENFDSREQWPNCPTIREIRDQGSCGSCWAFGAVE 118

Query: 131 ALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY 188
           A+SDR CIH    +   +S  DLL CC   CG GCDGG P + W++++  G+V+    P+
Sbjct: 119 AMSDRVCIHSKGKVLFRVSAEDLLTCCTN-CGHGCDGGAPGAGWKHWIEKGLVSG--GPF 175

Query: 189 FDSTGCSHPGCEPAYP-------------TPKCVRKCVKK-NQLWRNSKHYSISAYRINS 234
               GC     EP                TPKC++KC+   N  +   K +  S Y I +
Sbjct: 176 GSDQGCRPYTIEPCVHVENGAQSPCKDSITPKCIKKCLPGYNVPYAKDKSFGKSTYSIAN 235

Query: 235 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 294
           D   I  EI+ NGPVE +FTV++DFA YK G+Y+H +G++ G HAV+++GWG  ++G  Y
Sbjct: 236 DERQIRKEIFTNGPVEATFTVFDDFASYKHGIYQHTSGNLAGEHAVRILGWGV-ENGTKY 294

Query: 295 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           W+ AN WN  WG +GYFKI RGSN   IE  +VAGLP
Sbjct: 295 WLAANSWNSDWGDNGYFKILRGSNHVDIESAIVAGLP 331


>gi|49036808|gb|AAT48985.1| cathepsin B-like proteinase [Triatoma vitticeps]
          Length = 332

 Score =  243 bits (619), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 135/313 (43%), Positives = 183/313 (58%), Gaps = 22/313 (7%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF-KHLLGVKPTPKGLLLGVPVKTHDKSL 93
           L D  I  +N + +  W+A RN  F+  T  ++ K L GV          +P +     +
Sbjct: 24  LSDEFIDYIN-SLQTTWRAGRN--FAPNTPKKYLKSLAGVHKDANNAFT-LPKRQVSVDV 79

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 151
            +P  FDAR  WP CS+I+ I DQG CGSCWAFGAVEA+SDR CIH    + + LS  +L
Sbjct: 80  TVPDEFDARKHWPNCSSITEIRDQGSCGSCWAFGAVEAMSDRICIHSNGKLQVHLSAENL 139

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF-----DSTGCSHPGC 199
           L+CC   CG GC GG   +AW Y+   G+V+       + C PY       S   S P C
Sbjct: 140 LSCCDS-CGYGCLGGSAENAWEYWHKFGIVSGGNYGSKQGCQPYSIAPCEHSIPGSRPAC 198

Query: 200 EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
           E    TPKC ++C K   + + +   Y    Y I +D + I AEI KNGP+  S  VYED
Sbjct: 199 EGVRDTPKCKKQCEKGYGIPYGDDLCYGQPGYTIENDAQKIQAEILKNGPIVASILVYED 258

Query: 259 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
              YK+GVY+H+ G+V+GGH +K++GWG  +D   YW++AN WN  WG +G+FKI RGS+
Sbjct: 259 LFSYKAGVYQHVAGEVLGGHVIKILGWGVEND-TPYWLVANSWNTDWGNNGFFKILRGSD 317

Query: 319 ECGIEEDVVAGLP 331
           ECGIE+ +VAG+P
Sbjct: 318 ECGIEDQIVAGIP 330


>gi|86451908|gb|ABC97349.1| cathepsin B [Streblomastix strix]
          Length = 312

 Score =  243 bits (619), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 131/306 (42%), Positives = 175/306 (57%), Gaps = 18/306 (5%)

Query: 36  QDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKL 95
           Q  +++EVN      W A  NP F++ T+  F+ L G + TP    + + V T   +  L
Sbjct: 18  QQKLVREVNSRNDVNWVAGINPHFADATIEDFRRLNGARQTPLSDRVYMDVSTVPVA-NL 76

Query: 96  PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLA 153
           P  FD+R+ WP C  I +I DQGHCGSCWA  + E L DRFCI         LS   L +
Sbjct: 77  PDEFDSRTNWPNCQLIGKIYDQGHCGSCWAMSSFEVLQDRFCIKSEGKQTPELSPQHLTS 136

Query: 154 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR-KC 212
           C       GC+GG+  +A+ +   +G++ E+C PY     C HPGC   +PTPKC + KC
Sbjct: 137 CTPGC--SGCNGGWMSTAFGFMQSNGILGEDCIPY-QMGKCKHPGCS-TWPTPKCNKTKC 192

Query: 213 ----VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
                K  +LW     ++ S+Y + S+  DI  EIY+NGPV  SF VYED + Y+SGVY+
Sbjct: 193 YPNDTKSTELW-----HAASSYSVRSNEADIQKEIYENGPVTASFAVYEDLSVYQSGVYQ 247

Query: 269 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
           H+TG   G HA+K++GWG   DG  YW + N W   WG DG   I+RG +ECGIE DVVA
Sbjct: 248 HVTGGFEGLHAIKVVGWGIL-DGVKYWTIVNSWAEDWGFDGLLLIRRGVDECGIESDVVA 306

Query: 329 GLPSSK 334
           G P  K
Sbjct: 307 GQPKLK 312


>gi|145498570|ref|XP_001435272.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124402403|emb|CAK67875.1| unnamed protein product [Paramecium tetraurelia]
          Length = 325

 Score =  242 bits (618), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 128/317 (40%), Positives = 173/317 (54%), Gaps = 22/317 (6%)

Query: 28  LKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVK 87
           L+  S    D +      + ++ W +  N ++  +     K  +G        +      
Sbjct: 13  LRFQSQTFYDFV-----NSQQSTWVSGHNQRWEQFNEATLKTQMGTFLDEPDFMKLPEST 67

Query: 88  THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLS 147
              ++L++P+SFDAR  WP C +I  + DQ  CGSCWAFGA EA+SDR CI  G    +S
Sbjct: 68  VQFENLEIPESFDARQQWPNCESIKEVRDQSTCGSCWAFGAAEAMSDRLCIATGKQTRIS 127

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH---- 196
             DLL CCG  CG GC+GG+P  AW YF + G+VT +       C PY     C H    
Sbjct: 128 TEDLLTCCGITCGMGCNGGFPSGAWNYFKNKGLVTGDLFGDNSWCRPY-TFPPCDHHVDD 186

Query: 197 ---PGCEPAYPTPKCVRKCVKKN-QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 252
                C  + PTP CV+ C  ++ + + + K  SI +Y ++S  E I  EI   GPVE S
Sbjct: 187 GKYGPCGDSQPTPACVKSCTAQSGRNYDSDKIRSIDSYSVSSKVEQIQNEIMTFGPVEAS 246

Query: 253 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 312
           FTVYEDF  YKSGVY+++ G  +GGHAVK+IGWG   +   YW++ N WN  WG +G FK
Sbjct: 247 FTVYEDFLTYKSGVYQNVAGANLGGHAVKIIGWGVEKN-VPYWLVVNSWNEGWGENGLFK 305

Query: 313 IKRGSNECGIEEDVVAG 329
           I RGSN  GIE  + AG
Sbjct: 306 ILRGSNHVGIEGGIYAG 322


>gi|343476048|emb|CCD12737.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 134/330 (40%), Positives = 176/330 (53%), Gaps = 14/330 (4%)

Query: 12  LCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLL 71
           LCL   A  A G  + L  D+ +L  + +  +N+     WKA  N +  N T  + + L 
Sbjct: 7   LCLLSTALVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQNITFAEARRLT 66

Query: 72  GVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEA 131
           G        L  V         +LP+SFD+   WP C TI  I DQ  CGSCWA     A
Sbjct: 67  GAFRRKTSSLPPVRFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASA 126

Query: 132 LSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD 190
           +SDR C   G+  L +S   LL+CC   CGDGCDGGYP SAW Y+V HG+ +  C PY  
Sbjct: 127 ISDRHCTVGGVQQLRISAAHLLSCCK-DCGDGCDGGYPDSAWEYYVSHGLASSYCQPY-P 184

Query: 191 STGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE 242
              C H G +   P        TPKC   C  K       K+    +Y +    +D   E
Sbjct: 185 FPHCGHHGGKGKKPPCSKYDFHTPKCNTTCTDKAIPL--IKYRGNDSYVLLHGEDDFKRE 242

Query: 243 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 302
           +Y NGP  V+F VY DF  YK+GVY+H++GD +GGHAV+++GWG   +G  YW +AN W+
Sbjct: 243 LYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKL-NGTPYWKIANSWD 301

Query: 303 RSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
             WG +G+F I RG+NECGIE    AGLP+
Sbjct: 302 TDWGMNGHFLILRGNNECGIESTGYAGLPA 331


>gi|47217183|emb|CAG11019.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 351

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 139/334 (41%), Positives = 189/334 (56%), Gaps = 45/334 (13%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 94
           L   ++  +N+   + W A  N  F N      K L G     KG  L + ++ +   +K
Sbjct: 25  LSSEMVNYINK-LNSTWTAGHN--FHNVDYSYVKKLCGT--LLKGPKLPLMIR-YAGDIK 78

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLL 152
           LPK FD+R  WP C T+  I DQG CGSCWAFGA EA+SDR CIH    +S  LS  DLL
Sbjct: 79  LPKEFDSREQWPNCPTLKEIRDQGSCGSCWAFGASEAMSDRVCIHSNAKVSVELSAQDLL 138

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------------------ECDPYFDSTG 193
            CC   CG GC+GGYP SAW ++V  G+V+                      D  F S G
Sbjct: 139 TCCNS-CGMGCNGGYPSSAWNFWVSDGLVSGGLYDSHIGRIQVSLCVLLLAVDRDFVSPG 197

Query: 194 C--------------SHPGCE-PAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPE 237
           C              S P C      TP+C+ +C    +  ++  KH+  ++Y ++S+ +
Sbjct: 198 CRPYTIPPCEHHVNGSRPSCSGEGGDTPECIFRCEAGYSPSYKQDKHFGKTSYSVSSEED 257

Query: 238 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWIL 297
           +I  EIYKNGPVE +FTVYEDF  YKSGVY+H++G  +GGHA+K++GWG  ++G  YW+ 
Sbjct: 258 EIKQEIYKNGPVEGAFTVYEDFVLYKSGVYQHVSGSALGGHAIKMLGWG-EENGVPYWLC 316

Query: 298 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           AN WN  WG +G+FKI RG++ CGIE ++VAG P
Sbjct: 317 ANSWNTDWGDNGFFKILRGADHCGIESEIVAGNP 350


>gi|320167003|gb|EFW43902.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
          Length = 306

 Score =  241 bits (616), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 136/301 (45%), Positives = 173/301 (57%), Gaps = 18/301 (5%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL 93
           ILQ  +I ++N N   GW A  NP+F+  T    K LLG K  PKG  L           
Sbjct: 21  ILQQEMIDQIN-NANVGWTAGVNPRFAGKTREDIKGLLGTKLLPKGTKLREFPVVDTIVD 79

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDL 151
            +P SFDAR+ WP  ++I  I DQ  CGSCWAFGA EALSDR  I  +  +N+ LS  DL
Sbjct: 80  AIPTSFDARTQWP--ASIHPIRDQQQCGSCWAFGATEALSDRLAIASNNSINVVLSPQDL 137

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK 211
           ++C       GCDGGYPI+AW Y    GVVT+ C PY    G S         TP C   
Sbjct: 138 VSCDS--TDYGCDGGYPINAWHYMQSLGVVTDTCYPYTSGNGDSGTCQITGKKTPACATA 195

Query: 212 CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 271
              K +          +AY++ ++   I +EI  NGPVE +F+VY+DF  Y SGVY H +
Sbjct: 196 TFYKAK----------TAYQVANNMAAIQSEILANGPVEAAFSVYDDFFSYTSGVYSHQS 245

Query: 272 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           G + GGHAVK++GWG  D    YWI+AN W  SWG  G+F IKRG++ECGIE+ +VAGL 
Sbjct: 246 GALDGGHAVKIVGWGV-DGTTPYWIVANSWGTSWGQAGFFWIKRGNDECGIEDGIVAGLA 304

Query: 332 S 332
           +
Sbjct: 305 A 305


>gi|195566634|ref|XP_002106884.1| GD15875 [Drosophila simulans]
 gi|194204277|gb|EDX17853.1| GD15875 [Drosophila simulans]
          Length = 340

 Score =  241 bits (616), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 137/342 (40%), Positives = 182/342 (53%), Gaps = 30/342 (8%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           L      A  V +    +  +L D  I+ V    K  WK  RN   S  T G  + L+GV
Sbjct: 3   LLLLVAIAASVAALTSGEPSLLSDEFIEVVRSKAKT-WKVGRNFDAS-VTEGHIRRLMGV 60

Query: 74  KPTPKGLLLGVPVKTH-------DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF 126
            P      L  P K         +   +LP+ FD+R  WP C TI  I DQG CGSCWAF
Sbjct: 61  HPDAHKFAL--PDKREVLGDLYMNSVDELPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAF 118

Query: 127 GAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-- 182
           GAVEA+SDR CIH G  +N   S +DL++CC   CG GC+GG+P +AW Y+   G+V+  
Sbjct: 119 GAVEAMSDRVCIHSGGKVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGG 177

Query: 183 -----EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAY 230
                + C PY + + C H      P C     TPKC   C     + +   KH+   +Y
Sbjct: 178 PYGSNQGCRPY-EISPCEHHVNGTRPPCAHGGGTPKCSHVCQSSYTVDYAKDKHFGSKSY 236

Query: 231 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SD 289
            +  +  +I  EI  NGPVE +FTVYED   YK GVY+H  G  +GGHA++++GWG   D
Sbjct: 237 SVKRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGD 296

Query: 290 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           +   YW++ N WN  WG  G+F+I RG + CGIE  + AGLP
Sbjct: 297 EKIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIESSISAGLP 338


>gi|195058549|ref|XP_001995463.1| GH17748 [Drosophila grimshawi]
 gi|193896249|gb|EDV95115.1| GH17748 [Drosophila grimshawi]
          Length = 340

 Score =  241 bits (615), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 136/324 (41%), Positives = 182/324 (56%), Gaps = 31/324 (9%)

Query: 31  DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTP-------KGLLLG 83
           + H+L D  I E+ ++    W   RN   +  +    + L+GV P         K  LLG
Sbjct: 23  EPHMLSDEFI-ELVKSKATTWTPGRNFD-AAVSEHHIRALMGVHPDSHKFTLPEKRELLG 80

Query: 84  VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 141
              +  D    LP+ FD+   WP C TI  I DQG CGSCWAFGAVEA+SDR CIH    
Sbjct: 81  ADGEDKD----LPEEFDSSKNWPNCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIHSNAT 136

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGC 194
           +N   S +DL+ CC   CG GC+GG+P +AW Y+   G+V       TE C PY +   C
Sbjct: 137 VNFHFSADDLVTCC-HTCGFGCNGGFPGAAWSYWTTRGIVSGGSYNSTEGCRPY-EVEPC 194

Query: 195 SHPGCEPAYP-----TPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 248
            H    P  P     TP C  +C     + +   KH+  S+Y IN +P +I  EI  NGP
Sbjct: 195 EHHVDGPRPPCHSGSTPHCKHQCQPNYSVDYEKDKHFGASSYSINRNPRNIQREIMTNGP 254

Query: 249 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE-DYWILANQWNRSWGA 307
           VE +FTVYED   YK+GVY+H+ G  +GGHA+++IGWG   + +  YW++AN WN  WG 
Sbjct: 255 VEGAFTVYEDLILYKTGVYQHVHGKQLGGHAIRIIGWGVWGESKVPYWLIANSWNTDWGD 314

Query: 308 DGYFKIKRGSNECGIEEDVVAGLP 331
           +G+F+I RG + CGIE  + AGLP
Sbjct: 315 NGFFRILRGKDHCGIESQISAGLP 338


>gi|221107055|ref|XP_002166984.1| PREDICTED: cathepsin B-like [Hydra magnipapillata]
          Length = 330

 Score =  241 bits (615), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 128/253 (50%), Positives = 157/253 (62%), Gaps = 19/253 (7%)

Query: 95  LPKSFDARSAW-PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDL 151
           LP S+D R  W   C + + I DQG CGSCWAFGAVEA +DR CI      N  +S  DL
Sbjct: 77  LPDSYDTREKWGSTCPSTTEIRDQGSCGSCWAFGAVEAFTDRICIQSNGAKNPHISAEDL 136

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHPG 198
           L CCGF CG GC+GG    AW +F + G VT       E C PY        ++G   P 
Sbjct: 137 LTCCGFWCGFGCNGGRLGPAWNFFKYAGAVTGGQYNSSEGCQPYEIPSCEHHTSGSKKP- 195

Query: 199 CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 257
           CE + PTPKC R C +  N  + + KH   S Y I +D E I  EIY NGPVE +FTVY 
Sbjct: 196 CEGSEPTPKCKRSCREGYNVSYSDDKHKVSSHYSIANDEEQIKNEIYLNGPVEAAFTVYS 255

Query: 258 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 317
           DF +YKSGVYK+ TG+ +GGHA+K++GWG  ++   YW++AN WN  WG  G+FKI RGS
Sbjct: 256 DFPNYKSGVYKYTTGNALGGHAIKILGWGVENN-VPYWLVANSWNPDWGDKGFFKILRGS 314

Query: 318 NECGIEEDVVAGL 330
           NECGIE  VVAG+
Sbjct: 315 NECGIEASVVAGM 327


>gi|195352458|ref|XP_002042729.1| GM17589 [Drosophila sechellia]
 gi|194126760|gb|EDW48803.1| GM17589 [Drosophila sechellia]
          Length = 340

 Score =  241 bits (615), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 137/340 (40%), Positives = 183/340 (53%), Gaps = 26/340 (7%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           L      A  V +    +  +L D  I+ V    K  WK  RN   S  T G  + L+GV
Sbjct: 3   LLLLVAIAASVAALTSGEPSLLSDEFIEVVRSKAKT-WKVGRNFDAS-VTEGHIRRLMGV 60

Query: 74  KPTPKGLLL----GVPVKTHDKSL-KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 128
            P      L     V    +  SL +LP+ FD+R  WP C TI  I DQG CGSCWAFGA
Sbjct: 61  HPDAHKFALPDKREVLGDLYMNSLDELPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGA 120

Query: 129 VEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT---- 182
           VEA+SDR CIH G  +N   S +DL++CC   CG GC+GG+P +AW Y+   G+V+    
Sbjct: 121 VEAMSDRVCIHSGGKVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGGPY 179

Query: 183 ---EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRI 232
              + C PY + + C H      P C     TPKC   C     + +   KH+   +Y +
Sbjct: 180 GSNQGCRPY-EISPCEHHVNGTRPPCANGSGTPKCSHVCQSSYTVDYAKDKHFGSKSYSV 238

Query: 233 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDG 291
             +  +I  EI  NGPVE +FTVYED   YK GVY+H  G  +GGHA++++GWG   ++ 
Sbjct: 239 KRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGNEK 298

Query: 292 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
             YW++ N WN  WG  G+F+I RG + CGIE  + AGLP
Sbjct: 299 IPYWLIGNSWNTDWGDHGFFRILRGQDHCGIESSISAGLP 338


>gi|170586854|ref|XP_001898194.1| cathepsin B-like cysteine proteinase [Brugia malayi]
 gi|158594589|gb|EDP33173.1| cathepsin B-like cysteine proteinase, putative [Brugia malayi]
          Length = 384

 Score =  241 bits (614), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 139/311 (44%), Positives = 179/311 (57%), Gaps = 42/311 (13%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK--------LPKSFDAR 102
           WKA  N +F+ Y+      LLGV    K +        H K+L         +P+SFDAR
Sbjct: 77  WKAGMN-KFNLYSDTVKYGLLGVNNRKKSV-------EHKKNLSPIRHSNIFIPESFDAR 128

Query: 103 SAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCG 160
             WP+C+++  I DQ  CGSCWA  AVEA+SDR CI       + LS +DLL+CC   CG
Sbjct: 129 KNWPECASLRNIRDQSSCGSCWAVAAVEAMSDRICITSKGKKQVILSADDLLSCCK-TCG 187

Query: 161 DGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYP 204
            GC GG P++AW+Y+V  G+VT     Y + +GC     P CE               YP
Sbjct: 188 FGCFGGEPMAAWKYWVLSGIVTGS--DYTNHSGCRPYPFPPCEHHSNKTHYEPCKHDLYP 245

Query: 205 TPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 263
           TPKC ++C K   + ++  K+Y   AY + +D E I  EI   GPVE SF VY DF HY 
Sbjct: 246 TPKCYKQCDKNYTKSYKADKYYGEQAYNVENDVESIQKEIMTLGPVEASFEVYTDFLHYT 305

Query: 264 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD---GYFKIKRGSNEC 320
           SG+YKH+ G V GGHAVK++GWG  D G  YW+ AN WN  WG D   GYF+I RG++EC
Sbjct: 306 SGIYKHVAGSVGGGHAVKILGWGI-DQGVSYWLAANSWNNDWGEDVFSGYFRILRGADEC 364

Query: 321 GIEEDVVAGLP 331
           GIE  +VAG+P
Sbjct: 365 GIESGIVAGIP 375


>gi|261328564|emb|CBH11542.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like,
           putative [Trypanosoma brucei gambiense DAL972]
          Length = 340

 Score =  241 bits (614), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 135/316 (42%), Positives = 178/316 (56%), Gaps = 16/316 (5%)

Query: 31  DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV--KPTPKGLLLGVPVKT 88
           D+ +L  + +  VN   +  WKA  +    N T+ + K L GV  K     +L       
Sbjct: 28  DAPVLSKAFVDRVNRLNRGIWKAKYDGVMQNITLREAKRLNGVIKKNNNASILPKRRFTE 87

Query: 89  HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLS 147
            +    LP SFD+  AWP C TI +I DQ  CGSCWA  A  A+SDRFC   G+ ++ +S
Sbjct: 88  EEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHIS 147

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--- 204
             DLLACC   CGDGC+GG P  AW YF   G+V++ C PY       H   +  YP   
Sbjct: 148 AGDLLACCS-DCGDGCNGGDPDRAWAYFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCS 206

Query: 205 -----TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
                TPKC   C        N +  S ++Y +  + +D M E++  GP EV+F VYEDF
Sbjct: 207 QFNFDTPKCNYTCDDPTIPVVNYR--SWTSYALQGE-DDYMRELFFRGPFEVAFDVYEDF 263

Query: 260 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 319
             Y SGVY H++G  +GGHAV+L+GWGTS +G  YW +AN WN  WG DGYF I+RGS+E
Sbjct: 264 IAYNSGVYHHVSGQYLGGHAVRLVGWGTS-NGVPYWKIANSWNTEWGMDGYFLIRRGSSE 322

Query: 320 CGIEEDVVAGLPSSKN 335
           CGIE+   AG+P + N
Sbjct: 323 CGIEDGGSAGIPLAPN 338


>gi|195478432|ref|XP_002100515.1| GE16138 [Drosophila yakuba]
 gi|194188039|gb|EDX01623.1| GE16138 [Drosophila yakuba]
          Length = 340

 Score =  241 bits (614), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 134/340 (39%), Positives = 180/340 (52%), Gaps = 26/340 (7%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           L      A  V +    +  +L D  I+ V    K  W   RN   S  T G  + L+GV
Sbjct: 3   LLLLVATAASVAALTAGEPSLLSDEFIELVRSKAKT-WTVGRNFDAS-VTEGHIRRLMGV 60

Query: 74  KPTPKGLLLGVPVKT-----HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 128
            P      L    +       +   ++P+ FD+R  WP C TI  I DQG CGSCWAFGA
Sbjct: 61  HPDAHKFALADKREVLGDLYMNSVDEIPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGA 120

Query: 129 VEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT---- 182
           VEA+SDR CIH G  +N   S +DL++CC   CG GC+GG+P +AW Y+   G+V+    
Sbjct: 121 VEAMSDRVCIHSGGKVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGGPY 179

Query: 183 ---EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRI 232
              + C PY + + C H      P C     TPKC   C     + +   KH+   +Y +
Sbjct: 180 GSNQGCRPY-EISPCEHHVNGTRPPCAHGGATPKCSHVCQSSYTVDYAKDKHFGSKSYSV 238

Query: 233 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDG 291
             +  DI  EI  NGPVE +FTVYED   YK GVY+H  G  +GGHA++++GWG   D+ 
Sbjct: 239 RRNVRDIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGDEK 298

Query: 292 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
             YW++ N WN  WG  G+F+I RG + CGIE  + AGLP
Sbjct: 299 IPYWLIGNSWNTDWGDQGFFRILRGQDHCGIESSISAGLP 338


>gi|728602|emb|CAA88490.1| cathepsin B-like enzyme [Leishmania mexicana]
 gi|1586011|prf||2202319A cathepsin B-like Cys protease
          Length = 340

 Score =  241 bits (614), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 142/338 (42%), Positives = 183/338 (54%), Gaps = 21/338 (6%)

Query: 12  LCLTC-FATFAEGVVSKLKL---DSHILQDSIIKEVNENPKAGWKAARNP--QFSNYTVG 65
           LCL   F       VS L     D  +L  S + E N   K  W A+ +     +  ++ 
Sbjct: 9   LCLVAVFVVLLATTVSALYAKPSDIPLLGKSFVAETNSKAKGQWTASADNGHLVTGKSLE 68

Query: 66  QFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
           + + L+GV       +        +    LP+SFDA   WP C TI  I DQ +CGSCWA
Sbjct: 69  EVRKLMGVTSMSTEAVPPRNFSVEEMQQDLPESFDASEKWPMCVTIGEIRDQSNCGSCWA 128

Query: 126 FGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 184
             AVEA+SDR+C   G+ +  +S  +LL+CC F+CG GC GG P  AW ++V  GV TE 
Sbjct: 129 IAAVEAMSDRYCTMSGIPDRRISTTNLLSCC-FICGFGCYGGIPAMAWLWWVWVGVTTEL 187

Query: 185 CDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 236
           C PY     CSH G    YP        TPKC   C   N      K+  +S+Y I  + 
Sbjct: 188 CQPY-PFGPCSHHGNSSKYPPCPNTIYNTPKCNTTC--DNVEMELVKYKGVSSYSIKGER 244

Query: 237 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 296
           E +  E+  NGP+EV+  VY DF  YKSGVYKH++GD +GGHAVKL+GWG   DG  YW 
Sbjct: 245 E-LDHELMNNGPLEVAMQVYADFVAYKSGVYKHVSGDHLGGHAVKLVGWGV-KDGIPYWK 302

Query: 297 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 334
           +AN WN  WG  GYF I+RG++ECGIE   VAG P  +
Sbjct: 303 IANSWNTDWGDKGYFLIQRGNDECGIESSGVAGKPGEE 340


>gi|156255405|gb|ABU62925.1| cathepsin B [Fasciola hepatica]
          Length = 337

 Score =  241 bits (614), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 141/342 (41%), Positives = 191/342 (55%), Gaps = 29/342 (8%)

Query: 14  LTCFATFAEGVVSKLKLDS----HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKH 69
           ++    FA  VV++ K +         D +I  +NE   A WKAA + +F+N  + Q K 
Sbjct: 1   MSWLLIFAAIVVAQAKPNYKRQFEPFSDELIHYINEESGASWKAAPSTRFNN--IDQVKQ 58

Query: 70  LLGV-KPTPKGL-LLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFG 127
            LGV + TP+        V+       LP+SFDAR  W  C +IS I DQ  C SCWA  
Sbjct: 59  NLGVLEETPEDRNTQRQTVRYSVSENDLPESFDARQKWANCPSISEIRDQSSCSSCWAVS 118

Query: 128 AVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT--- 182
           +  A++DR CIH        LS  D+++CC + CG GC+GG P  +W Y+   GVVT   
Sbjct: 119 SASAITDRICIHSNGQKKPRLSAIDIVSCCAY-CGYGCNGGIPAMSWDYWTREGVVTGGT 177

Query: 183 ----EECDPYFDSTGCSH----PGCEPA----YPTPKCVRKC-VKKNQLWRNSKHYSISA 229
                 C PY     CSH    PG  P     YPTPKC +KC    N+ +   K    S+
Sbjct: 178 LENPTGCLPY-PFPKCSHGVVTPGLPPCPRDIYPTPKCEKKCHAGYNKTYEQDKVKGKSS 236

Query: 230 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 289
           Y +     DIM EI KNGPV+  F ++EDF  YKSG+Y + TG ++GGHA+++IGWG  +
Sbjct: 237 YNVGGQETDIMMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGV-E 295

Query: 290 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           +G  YW++AN WN  WG  GYF+++RG+NECGIE  + AGLP
Sbjct: 296 NGVKYWLIANSWNEGWGEKGYFRMRRGNNECGIEARINAGLP 337


>gi|427787723|gb|JAA59313.1| Putative cathepsin b-like cysteine protease form 2 [Rhipicephalus
           pulchellus]
          Length = 338

 Score =  241 bits (614), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 139/341 (40%), Positives = 186/341 (54%), Gaps = 25/341 (7%)

Query: 8   MDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 67
           M+ +L L+ F    +  V  +    H L D +I  +N+     WKA RN    N      
Sbjct: 1   MNFLLALSLFVVTPQDRV-MVPPSVHPLSDEMIDFINK-LNTTWKAGRNFD-KNVPFSYI 57

Query: 68  KHLLGVKPTPKGLLLGVPVKTHDK-SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF 126
           K L+GV    +     +P   H      LP+SFDAR  W +C++I  I DQ  CG+CWAF
Sbjct: 58  KGLMGVA---RNKTRRLPTLMHSSIPDNLPESFDARQHWRKCNSIHVIRDQSSCGACWAF 114

Query: 127 GAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-- 182
           GAVEA+SDR CIH    + +++S  DLL CC + C  GC GG P  AW ++   G+VT  
Sbjct: 115 GAVEAISDRICIHTKGSVQVNISAQDLLTCCDY-CRTGCKGGVPSYAWMFYKEKGIVTGG 173

Query: 183 -----EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAY 230
                + C PY      + +TG   P      P P C R+C K   + +   KHY    Y
Sbjct: 174 LYGTEDGCQPYSIHTTRYTTTGLLPPPINDLSPMPPCKRECRKSYGKKYSEDKHYGEKVY 233

Query: 231 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 290
            ++ D   I  EI+KNGPVE  F VY DF  YKSGVY+  +    G HA++++GWGT ++
Sbjct: 234 TLSGDEAQIKTEIFKNGPVEADFAVYADFYSYKSGVYQAHSRVRCGSHAIRILGWGT-EN 292

Query: 291 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           G  YW+ AN W   WG  GYFKI+RG+NECGIEED+ AG+P
Sbjct: 293 GVPYWLAANSWTEHWGDKGYFKIRRGNNECGIEEDINAGIP 333


>gi|56752809|gb|AAW24616.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  241 bits (614), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 136/348 (39%), Positives = 192/348 (55%), Gaps = 27/348 (7%)

Query: 7   IMDPILCLTCFATFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +++   C+   +T  E  V ++       L D +I  +NE+P AGWKA ++ +F  ++V 
Sbjct: 1   MLNIAFCIVSLSTLLEAHVTTRNNQRIEPLSDEMISFINEHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG +     L       V  HD  +++P  FD+R  WP+C +IS+I DQ  CGS 
Sbjct: 59  DARILLGGRREDPNLREKRRPTVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y+V  G+V
Sbjct: 119 WAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIV 177

Query: 182 TEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHY 225
           T       + TGC     P C+              Y TP+C + C K  N  +   KHY
Sbjct: 178 TGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHY 235

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 285
              +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGW
Sbjct: 236 GGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGW 295

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           G  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 296 GV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|241154720|ref|XP_002407359.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215494103|gb|EEC03744.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score =  240 bits (613), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 135/314 (42%), Positives = 180/314 (57%), Gaps = 20/314 (6%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS 92
           H L D +I  +N+     WKA  N      ++   + LLGV P  +   L   V   +  
Sbjct: 26  HPLSDQMINYINK-INTTWKAGSNFD-KCISMSYIRGLLGVHPKSEEYRLAEFVHE-EIP 82

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 150
             LP+SFDAR+ W  C +I  I DQ  CGSCWAFGA EA+SDR CIH    M +++S  D
Sbjct: 83  DDLPESFDARAKWSHCDSIHLIRDQSTCGSCWAFGATEAMSDRICIHSKGKMQVNISAED 142

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDS-----TGCSHPG 198
           LL CC   CG GC GG+P +AW ++   G+V+       + C PY  +     T C  P 
Sbjct: 143 LLDCCD-TCGHGCKGGFPAAAWEHWKERGIVSGGLYGTPDGCKPYSLAPCEYHTKCRIPN 201

Query: 199 CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 257
           C P   TP+CV  C K  ++ ++  KH+    Y I+ D + I  EI+ NGPVE  F VY 
Sbjct: 202 CIPIVHTPECVHHCRKGYDKDYQEDKHFGQKVYSISRDEKQIQTEIFTNGPVEADFHVYG 261

Query: 258 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 317
           DF  YKSGVY+  + D  G HA++++GWGT ++G  YW+ AN WN +WG  GYFKI R +
Sbjct: 262 DFLCYKSGVYQRHSNDGRGMHAIRILGWGT-ENGTPYWLAANSWNENWGDKGYFKILRRT 320

Query: 318 NECGIEEDVVAGLP 331
           NECGIEE + AG+P
Sbjct: 321 NECGIEEHIYAGIP 334


>gi|355332948|pdb|3MOR|A Chain A, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
 gi|355332949|pdb|3MOR|B Chain B, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
          Length = 317

 Score =  240 bits (613), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 135/316 (42%), Positives = 178/316 (56%), Gaps = 16/316 (5%)

Query: 31  DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV--KPTPKGLLLGVPVKT 88
           D+ +L  + +  VN   +  WKA  +    N T+ + K L GV  K     +L       
Sbjct: 5   DAPVLSKAFVDRVNRLNRGIWKAKYDGVMQNITLREAKRLNGVIKKNNNASILPKRRFTE 64

Query: 89  HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLS 147
            +    LP SFD+  AWP C TI +I DQ  CGSCWA  A  A+SDRFC   G+ ++ +S
Sbjct: 65  EEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHIS 124

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--- 204
             DLLACC   CGDGC+GG P  AW YF   G+V++ C PY       H   +  YP   
Sbjct: 125 AGDLLACCS-DCGDGCNGGDPDRAWAYFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCS 183

Query: 205 -----TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
                TPKC   C        N +  S ++Y +  + +D M E++  GP EV+F VYEDF
Sbjct: 184 QFNFDTPKCNYTCDDPTIPVVNYR--SWTSYALQGE-DDYMRELFFRGPFEVAFDVYEDF 240

Query: 260 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 319
             Y SGVY H++G  +GGHAV+L+GWGTS +G  YW +AN WN  WG DGYF I+RGS+E
Sbjct: 241 IAYNSGVYHHVSGQYLGGHAVRLVGWGTS-NGVPYWKIANSWNTEWGMDGYFLIRRGSSE 299

Query: 320 CGIEEDVVAGLPSSKN 335
           CGIE+   AG+P + N
Sbjct: 300 CGIEDGGSAGIPLAPN 315


>gi|166030314|gb|ABY78824.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score =  240 bits (613), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 132/333 (39%), Positives = 178/333 (53%), Gaps = 21/333 (6%)

Query: 12  LCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLL 71
           LCL   A  A G  + L  D+ +L  + +  +N+     WKA  N +  N T  + + L 
Sbjct: 7   LCLLSTALVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQNITFAEARRLT 66

Query: 72  GVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEA 131
           G +      L  V         +LP+SFD+   WP C TI  I DQ  CGSCWA     A
Sbjct: 67  GARIQKTSSLPPVRFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASA 126

Query: 132 LSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD 190
           +SDR+C   G+  L +S   LL+CC   CG GCDGGYP +AW Y+V HG+ +  C PY  
Sbjct: 127 ISDRYCTVGGVQQLRISAAHLLSCCKD-CGYGCDGGYPGTAWEYYVSHGLASSYCQPY-P 184

Query: 191 STGCSHPGCEPAYP--------TPKCVRKCVKKN---QLWRNSKHYSISAYRINSDPEDI 239
              C H G +   P        TPKC   C  K      +R +  Y +         +D 
Sbjct: 185 FPHCGHHGGKGKKPPCSKYDFHTPKCNTTCTDKAIPLIKYRGNHSYGLDG------EDDY 238

Query: 240 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 299
             E+Y NGP  V+F VY DF  YK+GVY+H++GDV+GGHAV+++GWG   +G  YW +AN
Sbjct: 239 KRELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDVLGGHAVRIVGWGKL-NGTPYWKIAN 297

Query: 300 QWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
            W+  WG +G+F I RG +ECGIE +  AGLP+
Sbjct: 298 SWDTDWGMNGHFLILRGKDECGIESEGYAGLPA 330


>gi|296863454|pdb|3HHI|A Chain A, Crystal Structure Of Cathepsin B From T. Brucei In Complex
           With Ca074
 gi|296863455|pdb|3HHI|B Chain B, Crystal Structure Of Cathepsin B From T. Brucei In Complex
           With Ca074
          Length = 325

 Score =  240 bits (613), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 135/316 (42%), Positives = 178/316 (56%), Gaps = 16/316 (5%)

Query: 31  DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV--KPTPKGLLLGVPVKT 88
           D+ +L  + +  VN   +  WKA  +    N T+ + K L GV  K     +L       
Sbjct: 6   DAPVLSKAFVDRVNRLNRGIWKAKYDGVMQNITLREAKRLNGVIKKNNNASILPKRRFTE 65

Query: 89  HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLS 147
            +    LP SFD+  AWP C TI +I DQ  CGSCWA  A  A+SDRFC   G+ ++ +S
Sbjct: 66  EEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHIS 125

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--- 204
             DLLACC   CGDGC+GG P  AW YF   G+V++ C PY       H   +  YP   
Sbjct: 126 AGDLLACCS-DCGDGCNGGDPDRAWAYFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCS 184

Query: 205 -----TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
                TPKC   C        N +  S ++Y +  + +D M E++  GP EV+F VYEDF
Sbjct: 185 QFNFDTPKCDYTCDDPTIPVVNYR--SWTSYALQGE-DDYMRELFFRGPFEVAFDVYEDF 241

Query: 260 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 319
             Y SGVY H++G  +GGHAV+L+GWGTS +G  YW +AN WN  WG DGYF I+RGS+E
Sbjct: 242 IAYNSGVYHHVSGQYLGGHAVRLVGWGTS-NGVPYWKIANSWNTEWGMDGYFLIRRGSSE 300

Query: 320 CGIEEDVVAGLPSSKN 335
           CGIE+   AG+P + N
Sbjct: 301 CGIEDGGSAGIPLAPN 316


>gi|72389769|ref|XP_845179.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|427931064|pdb|4HWY|A Chain A, Trypanosoma Brucei Procathepsin B Solved From 40 Fs
           Free-electron Laser Pulse Data By Serial Femtosecond
           X-ray Crystallography
 gi|40557577|gb|AAR88085.1| cathepsin B-like cysteine protease [Trypanosoma brucei]
 gi|62360039|gb|AAX80461.1| cysteine peptidase C (CPC) [Trypanosoma brucei]
 gi|70801714|gb|AAZ11620.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
          Length = 340

 Score =  240 bits (613), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 135/316 (42%), Positives = 178/316 (56%), Gaps = 16/316 (5%)

Query: 31  DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV--KPTPKGLLLGVPVKT 88
           D+ +L  + +  VN   +  WKA  +    N T+ + K L GV  K     +L       
Sbjct: 28  DAPVLSKAFVDRVNRLNRGIWKAKYDGVMQNITLREAKRLNGVIKKNNNASILPKRRFTE 87

Query: 89  HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLS 147
            +    LP SFD+  AWP C TI +I DQ  CGSCWA  A  A+SDRFC   G+ ++ +S
Sbjct: 88  EEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHIS 147

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--- 204
             DLLACC   CGDGC+GG P  AW YF   G+V++ C PY       H   +  YP   
Sbjct: 148 AGDLLACCS-DCGDGCNGGDPDRAWAYFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCS 206

Query: 205 -----TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
                TPKC   C        N +  S ++Y +  + +D M E++  GP EV+F VYEDF
Sbjct: 207 QFNFDTPKCNYTCDDPTIPVVNYR--SWTSYALQGE-DDYMRELFFRGPFEVAFDVYEDF 263

Query: 260 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 319
             Y SGVY H++G  +GGHAV+L+GWGTS +G  YW +AN WN  WG DGYF I+RGS+E
Sbjct: 264 IAYNSGVYHHVSGQYLGGHAVRLVGWGTS-NGVPYWKIANSWNTEWGMDGYFLIRRGSSE 322

Query: 320 CGIEEDVVAGLPSSKN 335
           CGIE+   AG+P + N
Sbjct: 323 CGIEDGGSAGIPLAPN 338


>gi|306992171|gb|ADN19566.1| cathepsin B-like proteinase [Spodoptera frugiperda]
          Length = 341

 Score =  240 bits (613), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 139/339 (41%), Positives = 188/339 (55%), Gaps = 28/339 (8%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
            + L C    A   V  L+   + L D  I  +N    + WKA RN    N  +   K L
Sbjct: 8   FVALVCALALASANVEDLQ---NPLTDEFINLINSKQNS-WKAGRNFPV-NTPLTHIKKL 62

Query: 71  LGVKPTPKGLLLGVPVKTHDKSL--KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 128
            GV       L  +P   HD  L   LP++FD R  WP C T++ + DQG CGSCWAFGA
Sbjct: 63  TGVLVDTH--LSKLPKAEHDMDLIASLPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGA 120

Query: 129 VEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT---- 182
           VEA++DR+C +     +   S  DLL+CC  +CG GC+GG P  AW Y+ H G+V+    
Sbjct: 121 VEAMTDRYCTYSNGTKHFHFSAEDLLSCCP-VCGLGCNGGMPTLAWEYWKHFGLVSGGSY 179

Query: 183 ---EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRI 232
              + C PY +   C H  PG    C     TPKC + C     + +   K Y    Y +
Sbjct: 180 NSGQGCRPY-EIPPCEHHVPGNRVPCNGDSKTPKCHKTCEASYSVDYHKDKRYGKHVYSV 238

Query: 233 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 292
           +S  + I AE++KNGPVE +FTVY D  +YK+GVYKH  G+ +GGHA+K++GWG  ++G 
Sbjct: 239 SSKEDHIKAELFKNGPVEGAFTVYSDLLNYKNGVYKHTVGNALGGHAIKILGWGV-ENGN 297

Query: 293 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
            Y ++AN WN  WG +G+FKI RG + CGIE  +VAG P
Sbjct: 298 KYRLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 336


>gi|308488328|ref|XP_003106358.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
 gi|308253708|gb|EFO97660.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
          Length = 343

 Score =  240 bits (613), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 125/258 (48%), Positives = 159/258 (61%), Gaps = 22/258 (8%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLL 152
           +P  +D R  + QC +++ I DQ HCGSCWA  A EA+SDR CI     +N  LS  D+L
Sbjct: 81  IPDHYDVRDDFSQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGVVNTLLSAEDIL 140

Query: 153 ACC--GFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS------TGCSHP 197
            CC   + CGDGC+GGYPI AW+Y+V +G+VT         C PY  +       G + P
Sbjct: 141 TCCIGEYYCGDGCEGGYPIQAWKYWVKNGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWP 200

Query: 198 GCEPA-YPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
            C  +   TPKCV  C   +     +   KHY  +AY ++   + I +EI KNGPVEV F
Sbjct: 201 KCPNSDADTPKCVDHCTSNSSYPIPYEKDKHYGATAYAVSRKVDQIQSEILKNGPVEVGF 260

Query: 254 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 313
           TVY DF  YKSGVY H+ G  +GGHAVKL+GWG  D+G  YW+ AN WN +WG +GYF+I
Sbjct: 261 TVYADFYQYKSGVYVHVAGPELGGHAVKLLGWGV-DNGTPYWLAANSWNTNWGENGYFRI 319

Query: 314 KRGSNECGIEEDVVAGLP 331
            RG NECGIE  VVAG+P
Sbjct: 320 LRGVNECGIESQVVAGMP 337


>gi|56755451|gb|AAW25905.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  240 bits (613), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 137/349 (39%), Positives = 195/349 (55%), Gaps = 29/349 (8%)

Query: 7   IMDPILCLTCFATFAEGVVSKLKLDSHI--LQDSIIKEVNENPKAGWKAARNPQFSNYTV 64
           +++   C+    T  E  V+K +++  I  L D +I  +N++P AGWKA ++ +F  ++V
Sbjct: 1   MLNIAFCIVSLFTLLEAHVTK-RINQRIEPLSDEMISFINKHPNAGWKADKSDRF--HSV 57

Query: 65  GQFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGS 122
              + LLG +     L       V  HD  +++P  FD+R  WP+C +IS+I DQ  CGS
Sbjct: 58  DDARILLGGRKEDPNLRQKRRPTVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGS 117

Query: 123 CWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 180
            WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y+V  G+
Sbjct: 118 SWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGI 176

Query: 181 VTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKH 224
           VT       + TGC     P C+              Y TP+C + C K  N  +   KH
Sbjct: 177 VTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQICQKGYNTSYEQDKH 234

Query: 225 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 284
           Y   +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIG
Sbjct: 235 YGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIG 294

Query: 285 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           WG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 295 WGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|223646922|gb|ACN10219.1| Cathepsin B precursor [Salmo salar]
 gi|223647940|gb|ACN10728.1| Cathepsin B precursor [Salmo salar]
 gi|223672785|gb|ACN12574.1| Cathepsin B precursor [Salmo salar]
          Length = 330

 Score =  240 bits (613), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 137/324 (42%), Positives = 183/324 (56%), Gaps = 34/324 (10%)

Query: 27  KLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPV 86
           +L   SH + D I K         WKA   P F N      K L G       LL G  +
Sbjct: 21  RLPPLSHQMVDYINKA-----NTTWKAG--PNFHNVDYSYVKRLCGT------LLKGPKL 67

Query: 87  KT---HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 143
            T   +   ++LP +FD R  WP C T+  I DQG CGSCWAFGA EA+SDR CIH    
Sbjct: 68  PTMVQYAGDVELPDTFDPRQQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNAK 127

Query: 144 LSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------ 188
           +S+ ++  DLL+CC   CG GC+GGYP +AW ++   G+VT         C PY      
Sbjct: 128 VSVEISSEDLLSCCDS-CGMGCNGGYPSAAWDFWTTEGLVTGGLYDSHVGCRPYSIPPCE 186

Query: 189 FDSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 247
               G   P       TP+C  +C       ++  KH+  ++Y + S+ + IMAE+ KNG
Sbjct: 187 HHVNGTRPPCTGEEGDTPQCSNQCETGYTPGYKQDKHFGKNSYSLPSEEQQIMAELLKNG 246

Query: 248 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 307
           PVE +FTVYEDF  YKSGVY+H++G  +GGHA+K++GWG  + G  YW+ AN WN  WG 
Sbjct: 247 PVEGAFTVYEDFLLYKSGVYQHVSGSAVGGHAIKVLGWG-EEGGTPYWLAANSWNTDWGE 305

Query: 308 DGYFKIKRGSNECGIEEDVVAGLP 331
           +G+FKI RG + CGIE ++VAG+P
Sbjct: 306 NGFFKILRGKDHCGIESEMVAGVP 329


>gi|349956183|dbj|GAA30948.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 337

 Score =  240 bits (612), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 132/312 (42%), Positives = 175/312 (56%), Gaps = 23/312 (7%)

Query: 43  VNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK--SLKLPKSFD 100
           V+    A W  A  P+   +  G F+ + G    P+      P  +H+      +PK+FD
Sbjct: 28  VDSKSGARWIYAEPPE--RFQPGNFQLMFGALREPEEQRSKRPTVSHESFSDEHIPKAFD 85

Query: 101 ARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFL 158
           AR  WP C TI  I DQ  CGSCWAFGAVEA+SDR CIH     +  +S  DL++CCG+ 
Sbjct: 86  ARKQWPHCPTIGEIRDQSSCGSCWAFGAVEAMSDRLCIHTNGTFTKRISAVDLISCCGY- 144

Query: 159 CGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEP-------AYP 204
           CG GC GG+P  AW ++   G+VT         C  Y     CSH G +         Y 
Sbjct: 145 CGFGCQGGFPPIAWDFWQTEGIVTGGSKENPTGCRSY-PFPRCSHHGSKKYPPCSHRIYD 203

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           TP CV+KC   +  +   K  +   Y + +    IM EI  NGPVE +F VYEDF  YKS
Sbjct: 204 TPNCVQKCDTPDTDYATDKTRANITYNVKAKQNAIMKEIMINGPVEAAFQVYEDFLGYKS 263

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           GVY H  G ++GGHA++++GWG  ++G  YW++AN WN  WG DG FK+ RG NECGIE+
Sbjct: 264 GVYFHSDGTLLGGHAIRILGWG-EENGVAYWLIANSWNDGWGEDGCFKMLRGKNECGIED 322

Query: 325 DVVAGLPSSKNL 336
           +V AGLP   ++
Sbjct: 323 EVTAGLPELSSI 334


>gi|268566077|ref|XP_002647467.1| Hypothetical protein CBG06539 [Caenorhabditis briggsae]
          Length = 332

 Score =  240 bits (612), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 125/264 (47%), Positives = 162/264 (61%), Gaps = 16/264 (6%)

Query: 75  PTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSD 134
           P P   +    V T   ++  P++FDAR+ WP+C +I  I +Q +CGSCWAFGA E +SD
Sbjct: 69  PPPSDEIRATEVNTVLATI--PETFDARTKWPKCKSIKLIRNQANCGSCWAFGAAEVISD 126

Query: 135 RFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECD 186
           R CI         +S  D++ CCG  CG GCDGGY I A R++V  GVVT      + C 
Sbjct: 127 RICIATKGARQPVISPMDMVDCCGEYCGYGCDGGYSIQALRWWVFDGVVTGGDYQGDGCK 186

Query: 187 PYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 245
           PY     C+  GC P   TP+C   C  K N  +   K++  SAY +      I  +I  
Sbjct: 187 PY---QFCNSAGC-PDAVTPECALSCQSKYNTEYAKDKNFGTSAYYVGMTVNAIQTDIMT 242

Query: 246 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 305
           NGPVE SF VYEDF  YKSGVYK+I G ++GGHA+K+IGWGT ++G  YW++AN W   W
Sbjct: 243 NGPVEASFKVYEDFYKYKSGVYKYIAGKMLGGHAIKIIGWGT-ENGTAYWLIANSWGTKW 301

Query: 306 GADGYFKIKRGSNECGIEEDVVAG 329
           G +G+FKI+RG NECGIE +VVAG
Sbjct: 302 GENGFFKIRRGVNECGIENNVVAG 325


>gi|118365170|ref|XP_001015806.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297573|gb|EAR95561.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 340

 Score =  240 bits (612), Expect = 9e-61,   Method: Compositional matrix adjust.
 Identities = 136/314 (43%), Positives = 166/314 (52%), Gaps = 25/314 (7%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK---L 95
           I+ EVN NP + WKAAR P F   T  Q    LG    P  + L  P K  D +     +
Sbjct: 31  IVFEVNSNPNSTWKAARYPHFEKMTREQLLGHLGSLDEPDWVKL--PTKEFDPNANADPI 88

Query: 96  PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL--SLSVNDLLA 153
           P+ FDAR  WP C +I  I DQ  CGSCWAF A E  SDR CI     L  S+S  DLL 
Sbjct: 89  PEFFDAREQWPNCQSIKLIRDQSTCGSCWAFAATETFSDRICIASNQTLQTSISSEDLLE 148

Query: 154 CCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------DSTGCSHPGCE 200
           CC   CG GC GGYP +AW Y    GV T         C PY         TG   P C 
Sbjct: 149 CCADYCGMGCKGGYPSAAWGYMKRQGVSTGGLYGDDTSCKPYIFPPCDHHVTGQYQP-CG 207

Query: 201 PAYPTPKCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
           P  PTP+CV++C  +     +    H++   Y I  + + I  EI  +GPV+ SF V  D
Sbjct: 208 PIQPTPQCVKECNSEYTQNTYEKDLHFASQTYSIKQNVQAIQREIMAHGPVQASFKVAAD 267

Query: 259 FAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 317
           F  YKSGVY ++      GGH+VK+IGWG  +    YW++AN WN  WG  G F++ RG 
Sbjct: 268 FLTYKSGVYIRNPKLKYEGGHSVKIIGWG-KEGNTPYWLIANSWNEDWGEKGLFRMLRGR 326

Query: 318 NECGIEEDVVAGLP 331
           NECGIE  +VAGLP
Sbjct: 327 NECGIEAQIVAGLP 340


>gi|166030312|gb|ABY78823.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score =  239 bits (611), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 133/330 (40%), Positives = 178/330 (53%), Gaps = 15/330 (4%)

Query: 12  LCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLL 71
           LCL   A  A G  + L  D+ +L  + +  +N+     WKA  N +  N T  + + L 
Sbjct: 7   LCLLSTALVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQNITFAEARRLT 66

Query: 72  GVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEA 131
           G +      L  V         +LP+SFD+   WP C TI  I DQ  CGSCWA     A
Sbjct: 67  GARIQKTSSLPPVRFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASA 126

Query: 132 LSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD 190
           +SDR C   G+  L +S   LL+CC   CG GCDGGYP +AWRY+V HG+ +  C PY  
Sbjct: 127 ISDRHCTVGGVQQLRISAAHLLSCCK-DCGYGCDGGYPDAAWRYYVSHGLASSYCQPY-P 184

Query: 191 STGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE 242
              C H G +   P        TPKC   C  K       K+    +Y ++ + ED   E
Sbjct: 185 FPHCDHHGGKGKKPPCSKYDFHTPKCNTTCTDKAIPL--IKYRGNHSYEVHGE-EDYKRE 241

Query: 243 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 302
           +Y NGP  V+F VY DF  YK+GVY+H++GDV+GGHAV+++GWG   +G  YW +AN W+
Sbjct: 242 LYFNGPFVVAFQVYSDFFAYKTGVYRHVSGDVLGGHAVRIVGWGKL-NGTPYWKIANSWD 300

Query: 303 RSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
             WG +G+F I RG +ECGIE    AG P+
Sbjct: 301 TDWGMNGHFLILRGKDECGIEHQGYAGSPA 330


>gi|308500570|ref|XP_003112470.1| CRE-CPR-4 protein [Caenorhabditis remanei]
 gi|308267038|gb|EFP10991.1| CRE-CPR-4 protein [Caenorhabditis remanei]
          Length = 335

 Score =  239 bits (611), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 132/267 (49%), Positives = 169/267 (63%), Gaps = 21/267 (7%)

Query: 84  VPVKTHD-KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HF 140
           V V  HD +   +P +FDAR+ WP C +I+ I DQ  CGSCWAF A EA SDRFCI  + 
Sbjct: 69  VEVVEHDIQEDTIPATFDARTQWPNCVSINNIRDQSDCGSCWAFAAAEAASDRFCIASNG 128

Query: 141 GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF---- 189
            +N  LS  D+L+CC   CG GCDGGYPI+AW+Y V  G  T         C PY     
Sbjct: 129 AVNTLLSAEDVLSCCSN-CGYGCDGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPC 187

Query: 190 -DSTG-CSHPGC-EPAYPTPKCVRKCV--KKNQLWRNSKHYSISAYRINSDPEDIMAEIY 244
            ++ G  + P C +  Y TP CV KC   K N  +++ KH+  +AY +      I AEI 
Sbjct: 188 GETVGNVTWPDCPDDGYNTPACVNKCTNTKYNTAYKDDKHFGSTAYAVGKKVAQIQAEII 247

Query: 245 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 304
            +GPVE +FTVYEDF  YKSGVY H TG  +GGHA++++GWGT D+G  YW++AN WN +
Sbjct: 248 AHGPVEAAFTVYEDFYQYKSGVYVHTTGQELGGHAIRILGWGT-DNGTPYWLVANSWNVN 306

Query: 305 WGADGYFKIKRGSNECGIEEDVVAGLP 331
           WG +GYF+I RG+NECGIE  VV G+P
Sbjct: 307 WGENGYFRIIRGTNECGIEHAVVGGVP 333


>gi|154340956|ref|XP_001566431.1| cysteine peptidase C (CPC) [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134063754|emb|CAM39941.1| cysteine peptidase C (CPC) [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 340

 Score =  239 bits (611), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 131/314 (41%), Positives = 178/314 (56%), Gaps = 15/314 (4%)

Query: 31  DSHILQDSIIKEVNENPKAGWKAARNPQ--FSNYTVGQFKHLLGVKPTPKGLLLGVPVKT 88
           ++ +L +  + E+N   K  W A+ +     S  +  + + L+GV       L       
Sbjct: 32  NTPLLSNRFVAEINLKAKGQWTASADNGHLVSGKSDEELRKLMGVLNMSTAALSPRIFSA 91

Query: 89  HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLS 147
            + + +LP SFD+   WP+C TIS I DQ +CGSCWA  AVEA+SDR+C   G+ +L +S
Sbjct: 92  EELAQELPTSFDSSDKWPKCRTISEIRDQSNCGSCWAIAAVEAMSDRYCTVAGITDLRVS 151

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY------FDSTGCSHPGCEP 201
              LL+CC F+CG GC GG P  AW ++V  G+ +E C PY        + G  +P C  
Sbjct: 152 TGHLLSCC-FVCGMGCQGGIPTMAWLWWVWVGLTSEVCQPYPFPPCGHHTDGGKYPACPS 210

Query: 202 A-YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
             Y TP C   C   +     +KH    +Y +  + E  M E+   GP EV+F VY DF 
Sbjct: 211 TIYDTPTCNSTCADSHTAL--TKHKGEKSYSLRGERE-YMIELMTYGPFEVAFDVYADFV 267

Query: 261 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 320
            YKSGVY H TG+ +GGHAVKL+GWG   +G  YW +AN WN  WG +GYF I+RG++EC
Sbjct: 268 SYKSGVYSHTTGERLGGHAVKLVGWGV-QNGTPYWKIANSWNSDWGDNGYFLIRRGTDEC 326

Query: 321 GIEEDVVAGLPSSK 334
           GIE   VAGLPS K
Sbjct: 327 GIESTGVAGLPSLK 340


>gi|166030308|gb|ABY78821.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  239 bits (611), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 132/330 (40%), Positives = 177/330 (53%), Gaps = 14/330 (4%)

Query: 12  LCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLL 71
           LCL   A  A G  + L  D+ +L  + +  +N+     WKA  N +  N T  + + L 
Sbjct: 7   LCLLSTALVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQNITFAEARRLT 66

Query: 72  GVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEA 131
           G        L  V         +LP+SFD+   WP C TI  I DQ  CGSCWA     A
Sbjct: 67  GAFRRKTSSLPPVRFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASA 126

Query: 132 LSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD 190
           +SDR+C   G+  L +S   L++CC   CGDGC GG P SAW Y+V HG+ +  C PY  
Sbjct: 127 ISDRYCTVGGVQQLRISAAHLMSCCED-CGDGCKGGAPDSAWEYYVSHGLASSYCQPY-P 184

Query: 191 STGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE 242
              C H G +   P        TPKC   C  K       K+   ++Y + +  +D   E
Sbjct: 185 FPHCGHHGGKGKKPPCSKYHFHTPKCNTTCTDKAIPL--IKYRGNNSYMLLNGEDDYKRE 242

Query: 243 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 302
           +Y NGP  V F VY DF  YK+GVY+H++GDV+GGHAV+++GWG   +G  YW +AN W+
Sbjct: 243 LYFNGPFVVDFGVYSDFLAYKTGVYRHVSGDVLGGHAVRIVGWGKL-NGTPYWKIANSWD 301

Query: 303 RSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
             WG +G+F I RG+NECGIE    AGLP+
Sbjct: 302 TDWGMNGHFLILRGNNECGIESTGYAGLPA 331


>gi|29374027|gb|AAO73004.1| cathepsin B [Fasciola gigantica]
          Length = 337

 Score =  239 bits (611), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 140/342 (40%), Positives = 190/342 (55%), Gaps = 29/342 (8%)

Query: 14  LTCFATFAEGVVSKLKLDS----HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKH 69
           ++    FA  VV++ K +         D +I  +NE   A WKAA + +F+N  + Q K 
Sbjct: 1   MSWLLIFAAIVVAQAKPNYKRQFEPFSDELIHYINEESGASWKAAPSTRFNN--IDQVKQ 58

Query: 70  LLGV-KPTPKGL-LLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFG 127
            LGV + TP+        V+       LP+SFDAR  W  C +IS I DQ  C SCWA  
Sbjct: 59  NLGVLEETPEDRNTQRQTVRYSVSENDLPESFDARQKWANCPSISEIRDQSSCSSCWAVS 118

Query: 128 AVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT--- 182
           +  A++DR CIH        LS  D+++CC + CG GC+GG P  +W Y+   GVVT   
Sbjct: 119 SASAITDRICIHSNGQKKPRLSAIDIVSCCAY-CGYGCNGGIPAMSWDYWTREGVVTGGT 177

Query: 183 ----EECDPYFDSTGCSH----PGCEPA----YPTPKCVRKC-VKKNQLWRNSKHYSISA 229
                 C PY     CSH    PG  P     YPTPKC +KC    N+ +   K    S+
Sbjct: 178 LENPTGCLPY-PFPKCSHGVVTPGLPPCPRDIYPTPKCEKKCHAGYNKTYEQDKVKGKSS 236

Query: 230 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 289
           Y +     D M EI KNGPV+  F ++EDF  YKSG+Y + TG ++GGHA+++IGWG  +
Sbjct: 237 YNVGEQETDFMMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGV-E 295

Query: 290 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           +G  YW++AN WN  WG  GYF+++RG+NECGIE  + AGLP
Sbjct: 296 NGVKYWLIANSWNEGWGEKGYFRMRRGNNECGIEARINAGLP 337


>gi|444525951|gb|ELV14228.1| Cathepsin B [Tupaia chinensis]
          Length = 339

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 144/350 (41%), Positives = 202/350 (57%), Gaps = 36/350 (10%)

Query: 10  PILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKH 69
           P+ CL    +      ++ +   H L D ++  +N+     W+A  N  F N  +   + 
Sbjct: 7   PLCCLLALTS------ARNRPYFHPLSDDLVNYINKQ-NTTWQAGHN--FRNADMSYVRK 57

Query: 70  LLGVKPTPKGLLLGVPVKTHD----KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
           L G         LG P   H     + + LP+SFDAR  W  C TI  I DQG CGSCWA
Sbjct: 58  LCGT-------FLGGPKLPHRIKFAEDMNLPESFDAREQWSSCPTIKEIRDQGSCGSCWA 110

Query: 126 FGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 183
           FGAVE++SDR CIH    +N+ +S  D+L CCG  CG+GC+GGYP +AW ++   G+V+ 
Sbjct: 111 FGAVESISDRICIHTNGHVNVEVSAEDMLTCCGGQCGEGCNGGYPSAAWNFWTKKGLVSG 170

Query: 184 E-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAY 230
                   C PY           S P C     TPKC + C    +  ++  KHY  S+Y
Sbjct: 171 GLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKSCEPGYSSSYKEDKHYGYSSY 230

Query: 231 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 290
            +    ++IMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWGT ++
Sbjct: 231 SVPGIEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGT-EN 289

Query: 291 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           G  YW++AN WN  WG +G+FKI RG + CGIE ++VAG+P +     +I
Sbjct: 290 GTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEIVAGIPRTDQYWAKI 339


>gi|17565162|ref|NP_503382.1| Protein W07B8.4 [Caenorhabditis elegans]
 gi|351059398|emb|CCD74288.1| Protein W07B8.4 [Caenorhabditis elegans]
          Length = 335

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 125/267 (46%), Positives = 158/267 (59%), Gaps = 22/267 (8%)

Query: 86  VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMN 143
           +K  + +  +P S+D R  WPQC +++ I DQ HCGSCWA  A EA+SDR CI  +  +N
Sbjct: 64  IKLAETADSIPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGDVN 123

Query: 144 LSLSVNDLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDS--- 191
             LS  D+L CC   F CGDGC+GGYPI AWRY+V +G+VT         C PY  +   
Sbjct: 124 TLLSAEDILTCCTGKFNCGDGCEGGYPIQAWRYWVKNGLVTGGSFESQYGCKPYSIAPCG 183

Query: 192 ---TGCSHPGCEPAYP-TPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIY 244
               G + P C      TPKC   C   N     +   KH+  SAY I    + I  EI 
Sbjct: 184 ETIDGVTWPECPMKISDTPKCEHHCTGNNSYPIPYDQDKHFGASAYAIGRSAKQIQTEIL 243

Query: 245 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 304
            +GPVEV F VYEDF  YK+G+Y H+ G  +GGHAVK++GWG  D+G  YW+ AN WN  
Sbjct: 244 AHGPVEVGFIVYEDFYLYKTGIYTHVAGGELGGHAVKMLGWGV-DNGTPYWLAANSWNTV 302

Query: 305 WGADGYFKIKRGSNECGIEEDVVAGLP 331
           WG  GYF+I RG +ECGIE   VAG+P
Sbjct: 303 WGEKGYFRILRGVDECGIESAAVAGMP 329


>gi|320166129|gb|EFW43028.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
          Length = 332

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 132/303 (43%), Positives = 171/303 (56%), Gaps = 28/303 (9%)

Query: 48  KAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQ 107
           K  W A R  +F ++   +   L G   TP+   L  P+K    +  +P +FD+R+ WP 
Sbjct: 36  KTTWVAERPTRFGSFD--EVARLCGALETPEDQRL--PLKVAPIAEAIPDTFDSRTNWPA 91

Query: 108 CSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDG 165
           C TI  + DQ  CGSCWAFGAVE++SDR CI       + LS +DLL+CC   CGDGCDG
Sbjct: 92  CPTIKEVRDQSACGSCWAFGAVESMSDRICIASNATKIVRLSASDLLSCC-TSCGDGCDG 150

Query: 166 GYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSHPGCEPAYP--------TPKCVR 210
           G    +W Y+ + G+VT         C PY D   C+H    P YP        TPKC +
Sbjct: 151 GQLGPSWDYYKNKGIVTGYLYNTTGYCKPY-DFPACAHHEASPDYPDCPSTDYSTPKCTK 209

Query: 211 KCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            CV       +    HY  S+Y +      I  EI  +GPVE +FTVY DF  Y+SGVYK
Sbjct: 210 SCVAGYTANTYTADLHYGQSSYSVGRTDAAIQTEILNHGPVEAAFTVYSDFPTYRSGVYK 269

Query: 269 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
           H +G V+GGHA+ ++GWGT + G  YW++ N WN SWG  G+FKI RG  +CGI  DVV 
Sbjct: 270 HTSGSVLGGHAISIVGWGT-ESGSPYWLVKNSWNPSWGDGGFFKILRG--DCGINNDVVG 326

Query: 329 GLP 331
           GLP
Sbjct: 327 GLP 329


>gi|226474182|emb|CAX71577.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 135/347 (38%), Positives = 191/347 (55%), Gaps = 24/347 (6%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           ++   +  ++ F      V ++       L D +I  +NE+P AGWKA ++ +F  ++V 
Sbjct: 1   MLKIAVYIVSLFNLLEAHVTTRNNERIEPLSDEMISFINEHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG +     L       V  HD ++++P  FD+R  WP+C +IS+I DQ  CGS 
Sbjct: 59  DARILLGGRREDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y+V  G+V
Sbjct: 119 WAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIV 177

Query: 182 T-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLWRNSKHYS 226
           T         C PY     C H        C +  Y TP+C + C K  N  +   KHY 
Sbjct: 178 TGGSKENHTSCRPY-PFPKCDHFVKGKYRACGDKLYETPQCKQTCQKGYNTSYEQDKHYG 236

Query: 227 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 286
             +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG
Sbjct: 237 GFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWG 296

Query: 287 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
             ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 297 V-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|161671340|gb|ABX75522.1| cathepsin b [Lycosa singoriensis]
          Length = 247

 Score =  238 bits (608), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 125/248 (50%), Positives = 159/248 (64%), Gaps = 19/248 (7%)

Query: 100 DARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGF 157
           D+R  WP C +IS I DQG CGSCWAFGAVEA+SDR CIH    + + +S  DLL+CC  
Sbjct: 1   DSREQWPDCPSISEIRDQGSCGSCWAFGAVEAMSDRHCIHSNGKVKIEVSPEDLLSCCS- 59

Query: 158 LCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYP 204
            CG GCDGG+P SAW ++V  G+ T         C PY +   C H      P C     
Sbjct: 60  SCGMGCDGGFPPSAWEFWVDKGIATGGLWNSHIGCQPY-EIPACEHHTTGDRPPCSDIVD 118

Query: 205 TPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 263
           TPKCV  C K  N  +R+ KH+   +Y I S  + I  EI+KNGPVE +F+VY DF +YK
Sbjct: 119 TPKCVHLCEKGYNTSYRDDKHFGKKSYSIESLEQQIQTEIFKNGPVEGAFSVYSDFINYK 178

Query: 264 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 323
           SGVY+H +G+ +GGHA++++GWG  +D   YW+ AN WN  WG  GYFKI RGS+ECGIE
Sbjct: 179 SGVYQHHSGESLGGHAIRVLGWGYEND-VPYWLCANSWNTDWGDKGYFKILRGSDECGIE 237

Query: 324 EDVVAGLP 331
             +VAG+P
Sbjct: 238 SSIVAGIP 245


>gi|171474007|gb|AAX31052.2| SJCHGC09761 protein [Schistosoma japonicum]
          Length = 342

 Score =  238 bits (608), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 135/348 (38%), Positives = 193/348 (55%), Gaps = 27/348 (7%)

Query: 7   IMDPILCLTCFATFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +++   C+   +T  E  V ++       L D +I  +NE+P AGWKA ++ +F  ++V 
Sbjct: 1   MLNIAFCIVSLSTLLEAHVTTRNNERIEPLSDEMISFINEHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG +     L       +  HD ++++P  FD+R  WP+C +IS+I DQ  CGS 
Sbjct: 59  DARILLGGRREDPNLREKRRPTIDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y+V  G+V
Sbjct: 119 WAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIV 177

Query: 182 TEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHY 225
           T       + TGC     P C+              Y TP+C + C K  N  +   KHY
Sbjct: 178 TGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHY 235

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 285
              +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGW
Sbjct: 236 GGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGW 295

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           G  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 296 GV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|226474172|emb|CAX71572.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  238 bits (608), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 135/347 (38%), Positives = 190/347 (54%), Gaps = 24/347 (6%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           ++   +  ++ F      V ++       L D +I  +NE+P AGWKA ++ +F +    
Sbjct: 1   MLKIAVYIVSLFTLLEAHVTTRNNERIEPLSDEMISFINEHPNAGWKADKSDRFHSVDDA 60

Query: 66  QFKHLLGVKPTPKGLLLGVP-VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 124
           +F  L G K  P       P V  HD ++++P  FD+R  WP+C +IS+I DQ  CGS W
Sbjct: 61  RFL-LGGRKEDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSW 119

Query: 125 AFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 182
           A  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT
Sbjct: 120 AVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVT 178

Query: 183 EECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHYS 226
                  + TGC     P C+              Y TP+C + C K  N  +   KHY 
Sbjct: 179 GGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYG 236

Query: 227 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 286
             +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG
Sbjct: 237 GFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWG 296

Query: 287 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
             ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 297 V-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|194895314|ref|XP_001978227.1| GG19486 [Drosophila erecta]
 gi|190649876|gb|EDV47154.1| GG19486 [Drosophila erecta]
          Length = 340

 Score =  238 bits (608), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 133/340 (39%), Positives = 180/340 (52%), Gaps = 26/340 (7%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           L      A  V +    +   L D  I+ V    K  W   RN   S+ T G  + L+GV
Sbjct: 3   LLLLVAIAASVAALTSGEPSFLSDEFIELVRSKAKT-WTVGRNFD-SSVTEGYIRRLMGV 60

Query: 74  KPTPKGLLLGVPVKT-----HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 128
            P      L    +       +   ++P+ FD+R  WP C TI  I DQG CGSCWAFGA
Sbjct: 61  HPDAHKFALADKREVLGDLYMNTVDQIPEEFDSRKQWPNCPTIGEIRDQGECGSCWAFGA 120

Query: 129 VEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT---- 182
           VEA+SDR CIH G  +N   S +DL++CC   CG GC+GG+P +AW Y+   G+V+    
Sbjct: 121 VEAMSDRVCIHSGGKVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGGPY 179

Query: 183 ---EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRI 232
              + C PY +   C H      P C     TPKC   C     + +   KH+   +Y +
Sbjct: 180 GSNQGCRPY-EIAPCEHHVNGTRPPCGHGGGTPKCSHVCESGYTVDYAKDKHFGSKSYSV 238

Query: 233 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDG 291
             +  DI  EI  NGPVE +FTVYED   YK GVY+H  G  +GGHA++++GWG   ++ 
Sbjct: 239 KRNVRDIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHQHGKELGGHAIRILGWGVWGEEK 298

Query: 292 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
             YW++ N WN  WG +G+F+I RG + CGIE  + AGLP
Sbjct: 299 IPYWLIGNSWNTDWGDNGFFRILRGQDHCGIESSISAGLP 338


>gi|343197337|pdb|3QSD|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With Ca074 Inhibitor
 gi|343197588|pdb|3S3Q|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11017 Inhibitor
 gi|343197589|pdb|3S3R|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
 gi|343197590|pdb|3S3R|B Chain B, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
 gi|343197591|pdb|3S3R|C Chain C, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
          Length = 254

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 123/253 (48%), Positives = 160/253 (63%), Gaps = 18/253 (7%)

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 150
           +++P SFD+R  WP+C +I+ I DQ  CGSCWAFGAVEA+SDR CI  G   N+ LS  D
Sbjct: 1   VEIPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVD 60

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY-----FDSTGCSHPG 198
           LL+CC   CG GC+GG    AW Y+V  G+VT         C+PY        T   +P 
Sbjct: 61  LLSCC-ESCGLGCEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEHHTKGKYPP 119

Query: 199 C-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
           C    Y TP+C + C KK +  +   KH   S+Y + +D + I  EI K GPVE  FTVY
Sbjct: 120 CGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVY 179

Query: 257 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
           EDF +YKSG+YKHITG+ +GGHA+++IGWG  +    YW++AN WN  WG +GYF+I RG
Sbjct: 180 EDFLNYKSGIYKHITGETLGGHAIRIIGWGVENKA-PYWLIANSWNEDWGENGYFRIVRG 238

Query: 317 SNECGIEEDVVAG 329
            +EC IE +V AG
Sbjct: 239 RDECSIESEVTAG 251


>gi|226474178|emb|CAX71575.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 134/344 (38%), Positives = 191/344 (55%), Gaps = 27/344 (7%)

Query: 7   IMDPILCLTCFATFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +++   C+   +T  E  V ++       L D +I  +NE+P AGWKA ++ +F  ++V 
Sbjct: 1   MLNIAFCIVSLSTLLEAHVTTRNNERIEPLSDEMISFINEHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLGVKPTPKGLL--LGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG +     L       V  HD ++++P  FD+R  WP+C +IS+I DQ  CGS 
Sbjct: 59  DARILLGGRREDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y+V  G+V
Sbjct: 119 WAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIV 177

Query: 182 TEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHY 225
           T       + TGC     P C+              Y TP+C + C K  N  +   KHY
Sbjct: 178 TGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHY 235

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 285
              +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGW
Sbjct: 236 GGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGW 295

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
           G  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AG
Sbjct: 296 GV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAG 338


>gi|56756475|gb|AAW26410.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 135/348 (38%), Positives = 193/348 (55%), Gaps = 27/348 (7%)

Query: 7   IMDPILCLTCFATFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +++   C+    T  E  V ++       L D +I  +N++P AGWKA ++ +F  ++V 
Sbjct: 1   MLNIAFCIVSLFTLLEAHVTTRNNQRIEPLSDEMISFINKHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             ++LLG +     L       V  HD ++++P  FD+R  WP+C +IS+I DQ  CGS 
Sbjct: 59  DARNLLGGRREDPNLRQKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y+V  G+V
Sbjct: 119 WAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIV 177

Query: 182 TEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHY 225
           T       + TGC     P C+              Y TP+C + C K  N  +   KHY
Sbjct: 178 TGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHY 235

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 285
              +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGW
Sbjct: 236 GGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGW 295

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           G  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 296 GV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|66810163|ref|XP_638805.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
 gi|74897075|sp|Q54QD9.1|CTSB_DICDI RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Flags:
           Precursor
 gi|60467425|gb|EAL65448.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
          Length = 311

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 134/290 (46%), Positives = 172/290 (59%), Gaps = 26/290 (8%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK-SLKLPKSFDARSAWPQCS 109
           W   +  QF N  VGQ   LLG K +P    L   +K++D   +++P SF+A++ WP C+
Sbjct: 39  WVEEQTDQFDNIKVGQ---LLGFKRSPNRPKL--QIKSYDPLGVQIPTSFNAQTNWPNCT 93

Query: 110 TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPI 169
           TIS+I +Q  CGSCWAFGA E+ +DR CIH   N+ LS  D++ C      +GC+GG   
Sbjct: 94  TISQIQNQARCGSCWAFGATESATDRLCIHNNENVQLSFMDMVTCDE--TDNGCEGGDAF 151

Query: 170 SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP-------TPKCVRKCVKKNQL-WRN 221
           SAW +    G V+EEC PY      + P C PA         TP C ++C   + L +  
Sbjct: 152 SAWNWLRKQGAVSEECLPY------TIPTCPPAQQPCLNFVNTPSCTKECQSNSSLIYSQ 205

Query: 222 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 281
            KH     Y  +SD E IM EI  NGPVE  FTV+EDF  YKSGVY H TG  +GGH VK
Sbjct: 206 DKHKMAKIYSFDSD-EAIMQEIVTNGPVEACFTVFEDFLAYKSGVYVHTTGKDLGGHCVK 264

Query: 282 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           L+G+GT  +G DY+   NQW  SWG +G F IKRG  +CGI +DVVAGLP
Sbjct: 265 LVGFGTL-NGVDYYAANNQWTTSWGDNGTFLIKRG--DCGISDDVVAGLP 311


>gi|17559068|ref|NP_504682.1| Protein CPR-4 [Caenorhabditis elegans]
 gi|1169085|sp|P43508.1|CPR4_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 4; AltName:
           Full=Cysteine protease-related 4; Flags: Precursor
 gi|675500|gb|AAA98785.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|695293|gb|AAA98783.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351063163|emb|CCD71204.1| Protein CPR-4 [Caenorhabditis elegans]
          Length = 335

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 147/339 (43%), Positives = 193/339 (56%), Gaps = 28/339 (8%)

Query: 12  LCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLL 71
           L L        G+V  L   +   Q++I + VN   ++ WKA   P+  + T+ Q K  L
Sbjct: 4   LILAALVAVTAGLVIPLVPKT---QEAITEYVNSK-QSLWKA-EIPK--DITIEQVKKRL 56

Query: 72  GVKPTPKGLLLGVPVKTHD-KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 130
                       V V  HD     +P +FDAR+ WP C +I+ I DQ  CGSCWAF A E
Sbjct: 57  MRTEFVAPHTPDVEVVKHDINEDTIPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAE 116

Query: 131 ALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE---- 184
           A SDRFCI  +  +N  LS  D+L+CC   CG GC+GGYPI+AW+Y V  G  T      
Sbjct: 117 AASDRFCIASNGAVNTLLSAEDVLSCCSN-CGYGCEGGYPINAWKYLVKSGFCTGGSYEA 175

Query: 185 ---CDPYF-----DSTG-CSHPGC-EPAYPTPKCVRKCVKKNQ--LWRNSKHYSISAYRI 232
              C PY      ++ G  + P C +  Y TP CV KC  KN    +   KH+  +AY +
Sbjct: 176 QFGCKPYSLAPCGETVGNVTWPSCPDDGYDTPACVNKCTNKNYNVAYTADKHFGSTAYAV 235

Query: 233 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 292
                 I AEI  +GPVE +FTVYEDF  YK+GVY H TG  +GGHA++++GWGT D+G 
Sbjct: 236 GKKVSQIQAEIIAHGPVEAAFTVYEDFYQYKTGVYVHTTGQELGGHAIRILGWGT-DNGT 294

Query: 293 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
            YW++AN WN +WG +GYF+I RG+NECGIE  VV G+P
Sbjct: 295 PYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVP 333


>gi|226474180|emb|CAX71576.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  238 bits (607), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 134/348 (38%), Positives = 191/348 (54%), Gaps = 26/348 (7%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           ++   +  ++ F      V ++       L D +I  +NE+P AGWKA ++ +F  ++V 
Sbjct: 1   MLKIAVYIVSLFNLLEAHVTTRNNERIEPLSDEMISFINEHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG +     L       V  HD ++++P  FD+R  WP+C +IS+I DQ  CGS 
Sbjct: 59  DARILLGGRREDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y+V  G+V
Sbjct: 119 WAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIV 177

Query: 182 TEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHY 225
           T       + TGC     P C+              Y TP+C + C K  N  +   KHY
Sbjct: 178 TGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYETPQCKQTCQKGYNTSYEQDKHY 235

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 285
              +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGW
Sbjct: 236 GGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGW 295

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           G  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 296 GV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|226474174|emb|CAX71573.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  238 bits (607), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 135/347 (38%), Positives = 190/347 (54%), Gaps = 24/347 (6%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           ++   +  ++ F      V ++       L D +I  +NE+P AGWKA ++ +F +    
Sbjct: 1   MLKIAVYIVSLFTLLEAHVTTRNNERIEPLSDEMISFINEHPNAGWKADKSDRFHSVDDA 60

Query: 66  QFKHLLGVKPTPKGLLLGVP-VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 124
           +F  L G K  P       P V  HD ++++P  FD+R  WP+C +IS+I DQ  CGS W
Sbjct: 61  RFL-LGGRKEDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSW 119

Query: 125 AFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 182
           A  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT
Sbjct: 120 AVSAVGAMSDRICIQSGGKQSVELSAIDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVT 178

Query: 183 EECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHYS 226
                  + TGC     P C+              Y TP+C + C K  N  +   KHY 
Sbjct: 179 GGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYG 236

Query: 227 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 286
             +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG
Sbjct: 237 GFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWG 296

Query: 287 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
             ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 297 V-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|56757646|gb|AAW26973.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  238 bits (606), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 135/348 (38%), Positives = 191/348 (54%), Gaps = 27/348 (7%)

Query: 7   IMDPILCLTCFATFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +++   C+    T  E  V ++       L D +I  +NE+P AGWKA ++ +F  ++V 
Sbjct: 1   MLNIAFCIVSLFTLLEAHVTTRNNERIEPLSDEMISFINEHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG +     L       V  HD ++++P  FD+R  WP+C +IS+I DQ  CGS 
Sbjct: 59  DARILLGGRREDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y+V  G+V
Sbjct: 119 WAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIV 177

Query: 182 TEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHY 225
           T       + TGC     P C+              Y TP+C + C K  N  +   KHY
Sbjct: 178 TGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHY 235

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 285
              +Y + S       +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGW
Sbjct: 236 GGFSYNVLSGESVFQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGW 295

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           G  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 296 GV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|268561802|ref|XP_002638421.1| C. briggsae CBR-CPR-3 protein [Caenorhabditis briggsae]
          Length = 375

 Score =  238 bits (606), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 119/250 (47%), Positives = 163/250 (65%), Gaps = 18/250 (7%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLL 152
           LP +FDAR  WP C ++  I +Q  CGSCWAFGA E +SDR CI         +S  D+L
Sbjct: 95  LPDTFDARDQWPDCKSLKFIRNQASCGSCWAFGAAEVISDRVCIQSNGTQQPIISAEDIL 154

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC---SHPGCEPA----YPT 205
           +CCG  CG GC GGY I A +Y+++ GVVT      ++  GC   S P C+ +    + T
Sbjct: 155 SCCGSTCGKGCQGGYTIEAMKYWMNSGVVT---GGDYNGAGCMPYSFPPCKKSPCVEFST 211

Query: 206 PKCVRKCVKKNQL--WRNSKHYSISAYRINSDPE---DIMAEIYKNGPVEVSFTVYEDFA 260
           P C   C +K     ++N KH++ SAY++++       I  EIY NGPVE S+ V+EDF 
Sbjct: 212 PSCKTTCQEKYTTADYKNDKHFATSAYKLSTTKNAVPTIQYEIYHNGPVEASYRVFEDFY 271

Query: 261 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 320
            YKSGVY H++G+++GGHAVK+IGWGT ++G DYW++AN W  S+G  G+FKI+RG+NEC
Sbjct: 272 QYKSGVYHHVSGNLVGGHAVKIIGWGT-ENGVDYWLVANSWGTSFGEKGFFKIRRGTNEC 330

Query: 321 GIEEDVVAGL 330
            IE ++VAGL
Sbjct: 331 QIESNIVAGL 340


>gi|56756114|gb|AAW26235.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  238 bits (606), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 133/348 (38%), Positives = 190/348 (54%), Gaps = 26/348 (7%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           ++   +  ++ F      V  +       L D +I  +N++P AGWKA ++ +F  ++V 
Sbjct: 1   MLKIAVYIVSLFTLLEAHVTKRNNQRIEPLSDEMISFINKHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG +     L       V  HD ++++P  FD+R  WP+C +IS+I DQ  C S 
Sbjct: 59  DARILLGGRKEDSNLRQKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSRCASS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  +V A+SDR CI  G   ++ LS  DL++CC   CG GCDGGY + +W Y+V HG+V
Sbjct: 119 WAVSSVGAMSDRICIQSGGKQSVELSAIDLISCCKN-CGSGCDGGYFLPSWDYWVSHGIV 177

Query: 182 TEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHY 225
           T       + TGC     P C+              Y TP+C + C K  N  +   KHY
Sbjct: 178 TGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYETPQCKQTCQKGYNTSYEQDKHY 235

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 285
              +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGW
Sbjct: 236 GGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGW 295

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           G  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 296 GV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|226474184|emb|CAX71578.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  238 bits (606), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 135/347 (38%), Positives = 190/347 (54%), Gaps = 24/347 (6%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           ++   +  ++ F      V ++       L D +I  +NE+P AGWKA ++ +F +    
Sbjct: 1   MLKIAVYIVSLFTLLEAHVTTRNNERIEPLSDEMISFINEHPNAGWKADKSDRFHSVDDA 60

Query: 66  QFKHLLGVKPTPKGLLLGVP-VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 124
           +F  L G K  P       P V  HD ++++P  FD+R  WP+C +IS+I DQ  CGS W
Sbjct: 61  RFL-LGGRKEDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSW 119

Query: 125 AFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 182
           A  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT
Sbjct: 120 AVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVT 178

Query: 183 EECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHYS 226
                  + TGC     P C+              Y TP+C + C K  N  +   KHY 
Sbjct: 179 GGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYG 236

Query: 227 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 286
             +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG
Sbjct: 237 GFSYNVLSVESVIQKDIMVHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWG 296

Query: 287 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
             ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 297 V-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|226474176|emb|CAX71574.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  237 bits (605), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 134/347 (38%), Positives = 190/347 (54%), Gaps = 24/347 (6%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           ++   +  ++ F      V ++       L D +I  +NE+P AGWKA ++ +F +    
Sbjct: 1   MLKIAVYIVSLFTLLEAHVTTRNNERIEPLSDEMISFINEHPNAGWKADKSDRFHSVDDA 60

Query: 66  QFKHLLGVKPTPKGLLLGVP-VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 124
           +F  L G K  P       P V  HD ++++P  FD+R  WP+C +IS+I DQ  CGS W
Sbjct: 61  RFL-LGGRKEDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSW 119

Query: 125 AFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 182
           A  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT
Sbjct: 120 AVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVT 178

Query: 183 EECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHYS 226
                  + TGC     P C+              Y TP+C + C K  N  +   KHY 
Sbjct: 179 GGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYG 236

Query: 227 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 286
             +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG
Sbjct: 237 GFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWG 296

Query: 287 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
             ++G  YW+ AN WN  WG  GYF+I RG NEC I+ ++ AGL  S
Sbjct: 297 V-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIDSEIAAGLIKS 342


>gi|226471004|emb|CAX70583.1| Cysteine PRotease related protein [Schistosoma japonicum]
          Length = 304

 Score =  237 bits (605), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 123/261 (47%), Positives = 161/261 (61%), Gaps = 18/261 (6%)

Query: 86  VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MN 143
           V  HD ++++P  FD+R  WP C +IS+I DQ  CGSCWAFGAVEA++DR CI  G   +
Sbjct: 43  VDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQS 102

Query: 144 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT---EE----CDPY-----FDS 191
             LS  DL++CC   CGDGC GG+P  AW Y+V  G+VT   EE    C PY        
Sbjct: 103 AELSALDLISCCKD-CGDGCKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHL 161

Query: 192 TGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPV 249
           T   +P C    Y TP+C + C K  +  +   KHY    Y + S+ + I  EI   GPV
Sbjct: 162 TKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPV 221

Query: 250 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 309
           E +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG  +    YW++AN WN  WG  G
Sbjct: 222 EAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKRTPYWLIANSWNEDWGEKG 280

Query: 310 YFKIKRGSNECGIEEDVVAGL 330
            F+I RG +EC IE  VVAGL
Sbjct: 281 LFRIVRGRDECSIESHVVAGL 301


>gi|395842321|ref|XP_003793966.1| PREDICTED: cathepsin B [Otolemur garnettii]
          Length = 339

 Score =  237 bits (605), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 143/327 (43%), Positives = 194/327 (59%), Gaps = 30/327 (9%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD-- 90
           H L D ++  +N+   + W+A  N  F N  +   K L G         LG P       
Sbjct: 24  HPLSDELVNFINKQ-NSTWQAGHN--FRNVDMSYLKRLCGS-------FLGGPKLPQRVK 73

Query: 91  --KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSV 148
             K + LPKSFDAR  W  C TI  I DQG CGSCWAFGAVE++SDR CIH   ++S+ V
Sbjct: 74  FAKDMNLPKSFDAREQWSHCPTIKEIRDQGSCGSCWAFGAVESISDRICIHTNGHVSVEV 133

Query: 149 N--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGC 194
           +  DLL CCG  CGDGC+GGYP  AW ++   G+V+         C PY           
Sbjct: 134 SAEDLLTCCGGQCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNG 193

Query: 195 SHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
           S P C     TPKC + C    +  ++  KH+  ++Y + ++  +IMAEIYKNGPVE +F
Sbjct: 194 SRPACTGEGDTPKCSKTCEPGYSPTYKEDKHFGYTSYSLPTNEWEIMAEIYKNGPVEGAF 253

Query: 254 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 313
           +VY DF  YKSGVY+H+TGD+MGGHA++++GWG  ++G  YW++AN WN  WG  G+F+I
Sbjct: 254 SVYSDFLLYKSGVYQHLTGDMMGGHAIRILGWG-EENGVPYWLVANSWNTDWGDGGFFRI 312

Query: 314 KRGSNECGIEEDVVAGLPSSKNLVKEI 340
            RG + CGIE +VVAG+P +    ++I
Sbjct: 313 LRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|313229093|emb|CBY18245.1| unnamed protein product [Oikopleura dioica]
          Length = 355

 Score =  237 bits (605), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 138/318 (43%), Positives = 179/318 (56%), Gaps = 27/318 (8%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVK-THDKSL-KL 95
           +II EVN    AGW A  N      T+   +  LG            P K  HD  +  +
Sbjct: 40  AIIDEVN-TANAGWTAGENFH-EQTTLEDVRSWLGAWSNKD---YDWPQKYPHDDLVGDI 94

Query: 96  PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLA 153
           P +FD+RS W  CS I +I DQG CGSCWAFGA EA+SDR CI      ++  +  D+L+
Sbjct: 95  PATFDSRSNWSDCSVIGKIRDQGGCGSCWAFGAAEAISDRICIASKGATDVMYAAEDVLS 154

Query: 154 CCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCE 200
           CC   CG+GC+GGYP++A  YFV  G+VT       + C PY     C H      P C 
Sbjct: 155 CC-LTCGNGCNGGYPLAAMEYFVTRGLVTGGLYGTKDTCQPY-TLEACEHHVPGDRPPCT 212

Query: 201 PAYPTPKCVRKCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
               TPKC  +C+     + +++ K +   AY + +D   I  EI   GPVE +FTVY D
Sbjct: 213 EGGGTPKCSHQCIPDYTTKAYKDDKVHGHKAYSVPNDVGKIQQEIMHYGPVEAAFTVYSD 272

Query: 259 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
           F  YKSGVY+H +G  +GGHA+K+IGWGT + G+DYW++ N WN  WG  G FKI RGSN
Sbjct: 273 FPSYKSGVYRHTSGSELGGHAIKIIGWGT-EGGDDYWLINNSWNSDWGDKGTFKILRGSN 331

Query: 319 ECGIEEDVVAGLPSSKNL 336
           ECGIE +VVA    +  L
Sbjct: 332 ECGIEGEVVAATVDASTL 349


>gi|166030316|gb|ABY78825.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  237 bits (605), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 130/330 (39%), Positives = 176/330 (53%), Gaps = 14/330 (4%)

Query: 12  LCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLL 71
           LCL   A  A G  +    D+ +L  + +  +N+     WKA  N +  N T  + + L 
Sbjct: 7   LCLLSTALVALGASALRAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQNITFAEARRLT 66

Query: 72  GVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEA 131
           G        L  V         +LP+SFD+   WP C TI  I DQ  CGSCWA     A
Sbjct: 67  GAFRRKTSSLPPVRFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASA 126

Query: 132 LSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD 190
           +SDR C   G+  L +S   LL+CC   CGDGCDGGYP +AWRY+V HG+ +  C PY  
Sbjct: 127 ISDRHCTVGGVQQLRISAAHLLSCCK-DCGDGCDGGYPDAAWRYYVSHGLASSYCQPY-P 184

Query: 191 STGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE 242
              C H G +   P        TPKC   C  K       ++    +Y +    +D   E
Sbjct: 185 FPHCGHHGGKGKKPPCSKYDFHTPKCNTTCTDKAIPL--IEYRGNDSYVLLHGEDDFKRE 242

Query: 243 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 302
           +Y NGP  V+F V+ DF  YK+GVY+H++GD +GGHAV+++GWG   +G  YW +AN W+
Sbjct: 243 LYFNGPFVVAFQVFSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKL-NGTPYWKIANSWD 301

Query: 303 RSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
             WG +G+F   RG+NECGIE +  AGLP+
Sbjct: 302 TDWGMNGHFLFLRGNNECGIEFEGYAGLPA 331


>gi|154761391|gb|ABS85545.1| cathepsin B preproprotein [Biomphalaria glabrata]
          Length = 333

 Score =  237 bits (605), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 132/318 (41%), Positives = 177/318 (55%), Gaps = 32/318 (10%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 94
           L D+ I  +N      WKA RN  F    + + + LLGV          + +K      +
Sbjct: 27  LSDAEIFYINHVANTTWKAGRN--FHPAEIKRARALLGVNMAENKAYNRIHLKYKQVQPR 84

Query: 95  --LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLL 152
             LP +FD R+ WP C++++ I DQ +CGSCWAFG+ EA++DR CI    N+ +S  D+ 
Sbjct: 85  NDLPDNFDPRTKWPDCASLNEIRDQANCGSCWAFGSAEAMTDRICIAGKGNIHISAEDIN 144

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHPGC 199
            CC   CG GC+GGYP +AW ++V  GVV+       E C PY        +TG   P C
Sbjct: 145 DCCK-SCGMGCNGGYPAAAWEWYVDTGVVSGGQYGTNEGCMPYSLPHCDHHTTGKYQP-C 202

Query: 200 EPAYPTPKCVRKCVK------KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
               PTPKC +KC+        N   R  K Y +         + IM E+  NGPV  +F
Sbjct: 203 PAVVPTPKCEKKCLTGYPKSYSNDKTRGKKSYGVRGV------QSIMQELVDNGPVTAAF 256

Query: 254 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 313
            VY DF  YK+GVY+H TG   GGHAVK+IG+GT + G+DYW++AN WN  WG  G+FKI
Sbjct: 257 DVYSDFLSYKTGVYRHTTGSYEGGHAVKIIGYGT-ESGQDYWLVANSWNEDWGDKGFFKI 315

Query: 314 KRGSNECGIEEDVVAGLP 331
            +G +ECGIE  +VAG P
Sbjct: 316 AKGKDECGIESSIVAGDP 333


>gi|56759488|gb|AAW27884.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  237 bits (604), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 134/348 (38%), Positives = 192/348 (55%), Gaps = 27/348 (7%)

Query: 7   IMDPILCLTCFATFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +++   C+   +T  E  V ++       L D +I  +NE+P AGWKA ++ +F  ++V 
Sbjct: 1   MLNIAFCIVSLSTLLEAHVTTRNNERIEPLSDEMISFINEHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG +     L       +  HD ++++P  FD+R  WP+C +IS+I DQ  CGS 
Sbjct: 59  DARILLGGRREDPNLREKRRPTIDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y+V  G+V
Sbjct: 119 WAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIV 177

Query: 182 TEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHY 225
           T       + TGC     P C+              Y TP+C + C K  N  +   KHY
Sbjct: 178 TGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHY 235

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 285
              +Y +      I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGW
Sbjct: 236 GGFSYNVLGIESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGW 295

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           G  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 296 GV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|56756907|gb|AAW26625.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  237 bits (604), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 134/348 (38%), Positives = 191/348 (54%), Gaps = 26/348 (7%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           ++   +  ++ F      V ++       L D +I  +NE+P AGWKA ++ +F  ++V 
Sbjct: 1   MLKIAVYIVSLFTLLEAHVTTRNNERVEPLSDEMISFINEHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG +     L       V  HD ++++P  FD+R  WP+C +IS+I DQ  CGS 
Sbjct: 59  DARILLGGRREDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y+V  G+V
Sbjct: 119 WAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIV 177

Query: 182 TEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHY 225
           T       + TGC     P C+              Y TP+C + C K  N  +   KHY
Sbjct: 178 TGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHY 235

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 285
              +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGW
Sbjct: 236 GGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGW 295

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           G  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 296 GV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|226469952|emb|CAX70257.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  237 bits (604), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 137/343 (39%), Positives = 188/343 (54%), Gaps = 24/343 (6%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           ++   +  ++ F      V ++       L D +I  +N++P AGWKA ++ +F  ++V 
Sbjct: 1   MLKIAVYIVSLFNLLEAHVTTRNNERIEPLSDEMISFINKHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLGVKPTPKGLL--LGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG +     L       V  HD  +++P  FD+R  WP+C +IS+I DQ  CGS 
Sbjct: 59  DARILLGGRREDPNLREKRRPTVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSRCGSS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  AV A+SDR CI  G   ++ LS  DL++CC   CG GCDGG+P  AW Y+V HG+V
Sbjct: 119 WAVSAVGAISDRICIQSGGKQSVELSAIDLISCCEN-CGSGCDGGFPGPAWDYWVSHGIV 177

Query: 182 T-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLWRNSKHYS 226
           T         C PY     C H      P C +  Y TP+C RKC K     + + KHY 
Sbjct: 178 TGGSKENHTGCQPY-PFPKCEHHSIGKYPSCGDKMYKTPQCKRKCQKGYTTPYEHDKHYG 236

Query: 227 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 286
             A  +  +   I  EI   GPVE    ++EDF +YKSG+YK+ TG  +G H V++IGWG
Sbjct: 237 GIAINVIKNELAIQKEIMMYGPVEAYLLIFEDFLNYKSGIYKYTTGSFVGEHYVRIIGWG 296

Query: 287 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
             ++G  YW+ AN WN  WG  GYF+I RG NEC IE  VVAG
Sbjct: 297 I-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESVVVAG 338


>gi|332376204|gb|AEE63242.1| unknown [Dendroctonus ponderosae]
          Length = 338

 Score =  237 bits (604), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 132/324 (40%), Positives = 178/324 (54%), Gaps = 27/324 (8%)

Query: 30  LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 89
           LD H L D  I  +NE     WKA +N +  ++   + K   GV P    L        H
Sbjct: 21  LDLHPLSDEYIASINEKATT-WKAGKNFEVDDWERVK-KIAAGVLPRKAALRFVTQNNPH 78

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLS 147
           D+S ++P+SFDAR  WP+C ++ +I DQ  CGSCWAFGAVEA+SDR CIH   +  + +S
Sbjct: 79  DESEEVPESFDARENWPRCDSLKQIRDQSSCGSCWAFGAVEAMSDRICIHSDQSNQVYVS 138

Query: 148 VNDLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA--- 202
             DL +CC   F CG GCDGGY    W Y+   G+VT     Y  S GC     EP    
Sbjct: 139 AEDLNSCCFGLFACGLGCDGGYVAEPWDYWRTDGIVTG--GAYNSSQGCKDYSLEPCEHH 196

Query: 203 -------------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 249
                        + TP+CVR C + +  +  S  +        ++ + +  EI KNGP+
Sbjct: 197 VEVGSRPQCSSLNFDTPECVRSCYESSLDYTESLTFGQQVSTFTNEKQ-MQLEILKNGPI 255

Query: 250 EVSFTVYEDFAHYKSGVYKHITGD-VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 308
           E +FTVY DF  YKSGVY+    D  +GGHA+K++GWG  ++G  YW++AN WN  WG +
Sbjct: 256 EAAFTVYNDFLSYKSGVYQATAQDESVGGHAIKVLGWGV-EEGTKYWLIANSWNTDWGDN 314

Query: 309 GYFKIKRGSNECGIEEDVVAGLPS 332
           GYFK  RG + CGIE +  A LP+
Sbjct: 315 GYFKFLRGVDHCGIESETAASLPA 338


>gi|226473758|emb|CAX71564.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  237 bits (604), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 135/348 (38%), Positives = 192/348 (55%), Gaps = 27/348 (7%)

Query: 7   IMDPILCLTCFATFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +++   C+    T  E  V ++       L D +I  +N++P AGWKA ++ +F  ++V 
Sbjct: 1   MLNIAFCIVSLFTLLEAHVTTRNNQRIEPLSDEMISFINKHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG +     L       V  HD ++++P  FD+R  WP+C +IS+I DQ  CGS 
Sbjct: 59  DARILLGGRREDPNLRQKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y+V  G+V
Sbjct: 119 WAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIV 177

Query: 182 TEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHY 225
           T       + TGC     P C+              Y TP+C + C K  N  +   KHY
Sbjct: 178 TGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHY 235

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 285
              +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGW
Sbjct: 236 GGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGW 295

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           G  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 296 GV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|18921171|ref|NP_572920.1| cathepsin B1, isoform A [Drosophila melanogaster]
 gi|7292926|gb|AAF48317.1| cathepsin B1, isoform A [Drosophila melanogaster]
 gi|16767940|gb|AAL28188.1| GH06546p [Drosophila melanogaster]
 gi|220944992|gb|ACL85039.1| CG10992-PA [synthetic construct]
 gi|220954816|gb|ACL89951.1| CG10992-PA [synthetic construct]
          Length = 340

 Score =  237 bits (604), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 135/344 (39%), Positives = 182/344 (52%), Gaps = 30/344 (8%)

Query: 12  LCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLL 71
           + L      A  V +    +  +L D  I+ V    K  W   RN   S  T G  + L+
Sbjct: 1   MNLLLLVATAASVAALTSGEPSLLSDEFIEVVRSKAKT-WTVGRNFDAS-VTEGHIRRLM 58

Query: 72  GVKPTPKGLLLGVPVKTH-------DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 124
           GV P      L  P K         +   +LP+ FD+R  WP C TI  I DQG CGSCW
Sbjct: 59  GVHPDAHKFAL--PDKREVLGDLYVNSVDELPEEFDSRKQWPNCPTIGEIRDQGSCGSCW 116

Query: 125 AFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 182
           AFGAVEA+SDR CIH G  +N   S +DL++CC   CG GC+GG+P +AW Y+   G+V+
Sbjct: 117 AFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVS 175

Query: 183 -------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSIS 228
                  + C PY + + C H      P C     TPKC   C     + +   KH+   
Sbjct: 176 GGPYGSNQGCRPY-EISPCEHHVNGTRPPCAHGGRTPKCSHVCQSGYTVDYAKDKHFGSK 234

Query: 229 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT- 287
           +Y +  +  +I  EI  NGPVE +FTVYED   YK GVY+H  G  +GGHA++++GWG  
Sbjct: 235 SYSVRRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVW 294

Query: 288 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
            ++   YW++ N WN  WG  G+F+I RG + CGIE  + AGLP
Sbjct: 295 GEEKIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIESSISAGLP 338


>gi|3088522|gb|AAD03404.1| cathepsin B-like protease precursor [Trypanosoma cruzi]
 gi|407859283|gb|EKG06969.1| cysteine peptidase C (CPC) [Trypanosoma cruzi]
          Length = 333

 Score =  237 bits (604), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 139/334 (41%), Positives = 180/334 (53%), Gaps = 26/334 (7%)

Query: 12  LCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLL 71
           + L  F  +A    S    D+ IL D  ++ VN      W A R  +    T      LL
Sbjct: 9   IALFLFLLYATAGHSFHAEDAPILTDEFLELVNRLNGGKWTAGRTSRTKYLTRRGASRLL 68

Query: 72  GVKPTPKGLLLGVPVKTHDKSLKLP--KSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 129
           G       +L   P +  ++ L++P    FDA  AWP+C TI+ I DQ  CGSCWA  A 
Sbjct: 69  GTFLRNTSIL--PPRQFSEEELRVPLQDRFDAGEAWPKCPTITEIRDQSSCGSCWAVAAA 126

Query: 130 EALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY 188
            A+SDR+C   G+ +L +S  DL++CC  +CG GC+GGYP  AW Y+  HG+V+E C PY
Sbjct: 127 SAMSDRYCTLGGVRDLRISAGDLMSCCD-VCGYGCNGGYPEVAWEYYAVHGIVSEYCQPY 185

Query: 189 -FDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQ---LWRNSKHYSISAYRINSDPE 237
            F S  C+H         C   Y TP C   C  K      +R +  Y      I S  E
Sbjct: 186 PFPS--CAHHVNSSDLSPCSGEYDTPTCNSTCTDKKIPLIKYRGNTSY------ILSGEE 237

Query: 238 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWIL 297
               E+  NGP EVSF+VY DF  Y  GVYKH+TG  +GGHAV+++GWG   +GE YW +
Sbjct: 238 SFKRELLLNGPFEVSFSVYADFVAYTGGVYKHVTGVFLGGHAVRIVGWGEL-NGEPYWKI 296

Query: 298 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           AN WN  WG +GYF I RG +ECGIE   VAG+P
Sbjct: 297 ANSWNHEWGMNGYFLIARGVDECGIEGSGVAGIP 330


>gi|268558600|ref|XP_002637291.1| C. briggsae CBR-CPR-4 protein [Caenorhabditis briggsae]
          Length = 335

 Score =  237 bits (604), Expect = 8e-60,   Method: Compositional matrix adjust.
 Identities = 130/267 (48%), Positives = 168/267 (62%), Gaps = 21/267 (7%)

Query: 84  VPVKTHD-KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HF 140
           V V  HD +   +P +FDAR+ WP C +I+ I DQ  CGSCWAF A EA SDRFCI  + 
Sbjct: 69  VEVIKHDIQEDTIPDTFDARTQWPSCVSINNIRDQSDCGSCWAFAAAEAASDRFCIASNG 128

Query: 141 GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF---- 189
            +N  LS  D+L+CC   CG GC+GGYPI+AW+Y V  G  T         C PY     
Sbjct: 129 AVNTLLSAEDVLSCCSN-CGYGCEGGYPINAWKYLVKSGFCTGGSYVSQFGCKPYSLAPC 187

Query: 190 -DSTG-CSHPGC-EPAYPTPKCVRKCVKKNQ--LWRNSKHYSISAYRINSDPEDIMAEIY 244
            ++ G  + P C +  Y TP CV KC   N    +++ KH+  +AY +      I AEI 
Sbjct: 188 GETVGNTTWPDCPQDGYNTPSCVNKCTNNNYNIAYKDDKHFGSTAYAVGKKVAQIQAEIL 247

Query: 245 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 304
            +GPVE +FTVYEDF  YKSGVY H TG  +GGHA++++GWGT D+G  YW++AN WN +
Sbjct: 248 AHGPVEAAFTVYEDFYQYKSGVYVHTTGQELGGHAIRILGWGT-DNGTPYWLVANSWNVN 306

Query: 305 WGADGYFKIKRGSNECGIEEDVVAGLP 331
           WG +GYF+I RG+NECGIE  VV G+P
Sbjct: 307 WGENGYFRIIRGTNECGIEHAVVGGVP 333


>gi|226473762|emb|CAX71566.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
 gi|226474170|emb|CAX71571.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  236 bits (603), Expect = 8e-60,   Method: Compositional matrix adjust.
 Identities = 134/348 (38%), Positives = 191/348 (54%), Gaps = 26/348 (7%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           ++   +  ++ F      V ++       L D +I  +NE+P AGWKA ++ +F  ++V 
Sbjct: 1   MLKIAVYIVSLFTLLEAHVTTRNNERVEPLSDEMISFINEHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG +     L       V  HD ++++P  FD+R  WP+C +IS+I DQ  CGS 
Sbjct: 59  DARILLGGRREDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y+V  G+V
Sbjct: 119 WAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIV 177

Query: 182 TEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHY 225
           T       + TGC     P C+              Y TP+C + C K  N  +   KHY
Sbjct: 178 TGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHY 235

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 285
              +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGW
Sbjct: 236 GGFSYNVLSVESVIQKDIMMHGPVEAYIEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGW 295

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           G  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 296 GV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|213514196|ref|NP_001133994.1| Cathepsin B precursor [Salmo salar]
 gi|209156086|gb|ACI34275.1| Cathepsin B precursor [Salmo salar]
          Length = 330

 Score =  236 bits (603), Expect = 8e-60,   Method: Compositional matrix adjust.
 Identities = 132/301 (43%), Positives = 179/301 (59%), Gaps = 31/301 (10%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKT---HDKSLKLPKSFDARSAWPQ 107
           WKA  N  F N      K L G       LL G  + T   + + ++LPK+FD R  WP 
Sbjct: 40  WKAGHN--FHNVDYSYVKRLCGT------LLKGPKLSTMVQYTEDMELPKNFDPRLQWPN 91

Query: 108 CSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDG 165
           C T+  + DQG CGSCWAFGA EA+SDR CIH    +S+ ++  DLL+CC   CG GC+G
Sbjct: 92  CPTLKEVRDQGSCGSCWAFGAAEAISDRVCIHSNAKVSVEISSEDLLSCCES-CGMGCNG 150

Query: 166 GYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAY-PTPKCVRK 211
           GYP +A  ++   G+V+         C PY     C H      P C+     TP+C  +
Sbjct: 151 GYPSAACDFWTKEGLVSGGLYDSHIGCRPY-SIPPCEHHVNGTRPPCKGEEGDTPQCTNQ 209

Query: 212 CVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 270
           C       ++  KH+   +Y + SD ++IM E+YKNGPVE +FTVYEDF  YKSGVY+H+
Sbjct: 210 CEPGYTPGYKQDKHFGKRSYSVPSDEKEIMKELYKNGPVEGAFTVYEDFLLYKSGVYRHV 269

Query: 271 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           +G  +GGHA+K++GWG  + G  YW+ AN WN  WG +G+FKI RG + CGIE ++VAG+
Sbjct: 270 SGSAVGGHAIKVLGWG-EEGGIPYWLAANSWNTDWGENGFFKIVRGEDHCGIESEMVAGI 328

Query: 331 P 331
           P
Sbjct: 329 P 329


>gi|226474168|emb|CAX71570.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  236 bits (603), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 134/347 (38%), Positives = 189/347 (54%), Gaps = 24/347 (6%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           ++   +  ++ F      V ++       L D +I  +NE+P AGWKA ++ +F +    
Sbjct: 1   MLKIAVYIVSLFTLLEAHVTTRNNERIEPLSDEMISFINEHPNAGWKADKSDRFHSVDDA 60

Query: 66  QFKHLLGVKPTPKGLLLGVP-VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 124
           +F  L G K  P       P V  HD ++++P  FD+R  WP+C +IS+I DQ  CGS W
Sbjct: 61  RFL-LGGRKEDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSW 119

Query: 125 AFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 182
           A  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y+V  G+VT
Sbjct: 120 AVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVT 178

Query: 183 EECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHYS 226
                  + TGC     P C+              Y TP+C + C K  N  +   KHY 
Sbjct: 179 GGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYG 236

Query: 227 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 286
             +Y + S    I  +I  +GP E    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG
Sbjct: 237 GFSYNVLSVESVIQKDIMMHGPAEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWG 296

Query: 287 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
             ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 297 V-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|170787211|gb|ACB38229.1| cathepsin B [Meretrix meretrix]
          Length = 337

 Score =  236 bits (603), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 133/330 (40%), Positives = 182/330 (55%), Gaps = 29/330 (8%)

Query: 23  GVVSKLKLDSH--ILQDSIIKEVNENPKAGWKAARNPQFSNY----TVGQFKHLLGVKPT 76
           G     + D H     ++ +   N      WKA     F N      +   K L G  P 
Sbjct: 11  GAAWSYRFDFHDDYFSEAFVNYHNSRDDVSWKATTE-NFKNVPYKGRMDYVKSLCGANPA 69

Query: 77  PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRF 136
           P  +    PVK  +    LP +FDAR+ WP C ++  + DQG CGSCWAFG VEA +DR 
Sbjct: 70  PPEMKF--PVKEIEVPKDLPDTFDARTQWPDCPSLKEVRDQGACGSCWAFGCVEAATDRL 127

Query: 137 CIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDP 187
           CI     +N  LS  DL +CC   CG+GC+GG+   AW Y    G+VT       + C P
Sbjct: 128 CIQSKGIVNAHLSAEDLTSCC-RTCGNGCNGGFLEGAWNYLKRDGIVTGGPYNSHQGCLP 186

Query: 188 YFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIM 240
           Y +   C H        C+   PTP+C ++C    N  +   +H++ + + +    E IM
Sbjct: 187 Y-EIKACDHHVVGKLQPCKGDGPTPRCKKECESGYNNTYSKDEHHAKTVHAVEG-VEQIM 244

Query: 241 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQ 300
            EI  NGPVE +FTVY DF  YKSGVY+H +G  +GGHA+K +GWG ++DG+DYW++AN 
Sbjct: 245 TEIMTNGPVEAAFTVYSDFPTYKSGVYEHKSGGPLGGHAIKTLGWG-NEDGKDYWLVANS 303

Query: 301 WNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           WN  WG +G+FKI RG +ECGIE ++VAG+
Sbjct: 304 WNPDWGDNGFFKILRGRDECGIESNIVAGM 333


>gi|56756410|gb|AAW26378.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  236 bits (603), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 133/348 (38%), Positives = 190/348 (54%), Gaps = 26/348 (7%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           ++   +  ++ F      V ++       L D +I  +N++P AGWKA ++ +F  ++V 
Sbjct: 1   MLKIAVYIVSLFNLLEAHVTTRNNERIEPLSDEMISFINKHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG +     L       V  HD  +++P  FD+R  WP+C +IS+I DQ  CGS 
Sbjct: 59  DARILLGGRKEDPNLRQKRRPTVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y+V  G+V
Sbjct: 119 WAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIV 177

Query: 182 TEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHY 225
           T       + TGC     P C+              Y TP+C + C K  N  +   KHY
Sbjct: 178 TGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHY 235

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 285
              +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGW
Sbjct: 236 GGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGW 295

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           G  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 296 GV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|341891084|gb|EGT47019.1| CBN-CPR-4 protein [Caenorhabditis brenneri]
          Length = 335

 Score =  236 bits (602), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 128/264 (48%), Positives = 166/264 (62%), Gaps = 20/264 (7%)

Query: 86  VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMN 143
           VK   +   +P +FDAR+ WP C +I+ I DQ  CGSCWAF A EA SDRFCI  +  +N
Sbjct: 72  VKHDIQEDTIPATFDARTQWPSCVSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVN 131

Query: 144 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DS 191
             LS  D+L+CC   CG GC+GGYPI+AW+Y V  G  T         C PY      ++
Sbjct: 132 TLLSAEDVLSCCSN-CGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGET 190

Query: 192 TG-CSHPGCEP-AYPTPKCVRKCVKKNQ--LWRNSKHYSISAYRINSDPEDIMAEIYKNG 247
            G  + P C    Y TP CV KC   N    +++ KH+  +AY +      I AEI  +G
Sbjct: 191 VGNTTWPACPTDGYDTPACVNKCTNSNYNVAYKDDKHFGSTAYAVGKKVAQIQAEIIAHG 250

Query: 248 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 307
           PVE +FTVYEDF  YKSGVY H TG+ +GGHA++++GWGT D+G  YW++AN WN +WG 
Sbjct: 251 PVEAAFTVYEDFYQYKSGVYVHTTGEELGGHAIRILGWGT-DNGTPYWLVANSWNVNWGE 309

Query: 308 DGYFKIKRGSNECGIEEDVVAGLP 331
           +GYF+I RG+NECGIE  VV G+P
Sbjct: 310 NGYFRIIRGTNECGIEHAVVGGVP 333


>gi|226473756|emb|CAX71563.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  236 bits (602), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 135/348 (38%), Positives = 191/348 (54%), Gaps = 27/348 (7%)

Query: 7   IMDPILCL-TCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +++   C+ + F      V ++       L D +I  +NE+P AGWKA ++ +F  ++V 
Sbjct: 1   MLNIAFCIVSLFTLLGAHVTTRNNERVEPLSDEMISFINEHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLGVKPTPKGLL--LGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG +     L       V  HD  +++P  FD+R  WP+C +IS+I DQ  CGS 
Sbjct: 59  DARILLGGRREDPNLREKRRPTVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y+V  G+V
Sbjct: 119 WAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIV 177

Query: 182 TEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHY 225
           T       + TGC     P C+              Y TP+C + C K  N  +   KHY
Sbjct: 178 TGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHY 235

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 285
              +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGW
Sbjct: 236 GGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGW 295

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           G  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 296 GV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|32566081|ref|NP_506002.2| Protein CPR-1 [Caenorhabditis elegans]
 gi|32172429|sp|P25807.2|CPR1_CAEEL RecName: Full=Gut-specific cysteine proteinase; Flags: Precursor
 gi|1395200|gb|AAB88058.1| gut-specific cysteine protease-1 [Caenorhabditis elegans]
 gi|24817276|emb|CAB01410.2| Protein CPR-1 [Caenorhabditis elegans]
          Length = 329

 Score =  236 bits (602), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 119/244 (48%), Positives = 153/244 (62%), Gaps = 12/244 (4%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLL 152
           +P +FD+R+ W +C +I  I DQ  CGSCWAFGA E +SDR CI         +S +DLL
Sbjct: 85  VPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLL 144

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 206
           +CCG  CG+GC+GGYPI A R++   GVVT        C PY  +  C+   C P   TP
Sbjct: 145 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CTSGNC-PESKTP 202

Query: 207 KCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
            C   C    +  +   KH+ +SAY +  +   I AEIY NGPVE +F+VYEDF  YKSG
Sbjct: 203 SCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYKSG 262

Query: 266 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 325
           VYKH  G  +GGHA+K+IGWGT + G  YW++AN W  +WG  G+FKI RG ++CGIE  
Sbjct: 263 VYKHTAGKYLGGHAIKIIGWGT-ESGSPYWLVANSWGVNWGESGFFKIYRGDDQCGIESA 321

Query: 326 VVAG 329
           VVAG
Sbjct: 322 VVAG 325


>gi|56756380|gb|AAW26363.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  236 bits (601), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 137/348 (39%), Positives = 192/348 (55%), Gaps = 27/348 (7%)

Query: 7   IMDPILCLTCFATFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +++   C+    T  E  V ++       L D +I  +NE+P AGWKA ++ +F  ++V 
Sbjct: 1   MLNIAFCIVSLFTLLEAHVTTRNNERIEPLSDEMISFINEHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLG-VKPTPKGLLLGVP-VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG  K  P       P V  HD ++++P  FD+R  WP+C +IS+I DQ  CGS 
Sbjct: 59  DARILLGGRKEDPNLRQRRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  A+ A+SDR CI  G   ++ LS  DL++CC   CG GCDGG+   +W Y+V  G+V
Sbjct: 119 WAVSAIGAMSDRICIQSGGKQSVKLSAVDLISCCEN-CGSGCDGGFLGPSWDYWVLRGIV 177

Query: 182 TEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHY 225
           T       + TGC     P C+              Y TP+C + C K  N  +   KHY
Sbjct: 178 TGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHY 235

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 285
              +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGW
Sbjct: 236 GGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGW 295

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           G  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 296 GV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|71424150|ref|XP_812694.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
 gi|70877506|gb|EAN90843.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
          Length = 333

 Score =  236 bits (601), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 136/334 (40%), Positives = 180/334 (53%), Gaps = 26/334 (7%)

Query: 12  LCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLL 71
           + L  F  +A    S    D+ IL D  ++ VN      W A R  +  + T      +L
Sbjct: 9   IALFLFLLYATAGHSFHAEDAPILTDEFLEHVNRLNGGKWTAGRTSRTKHLTRRGASRML 68

Query: 72  GVKPTPKGLLLGVPVKTHDKSLKLP--KSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 129
           G       +L   P +  ++ L++P    FDA  AWP+C T++ I DQ  CGSCWA  A 
Sbjct: 69  GTFLRNTSIL--PPRQFSEEELRVPLQDRFDAGEAWPECPTVTEIRDQSSCGSCWAVAAA 126

Query: 130 EALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY 188
            A+SDR+C   G+ +L +S  DL++CC  +CG GC+GGYP  AW Y+  HG+V+E C PY
Sbjct: 127 SAISDRYCTLGGVRDLRISAGDLMSCCD-VCGFGCNGGYPEVAWEYYAVHGIVSEYCQPY 185

Query: 189 -FDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQ---LWRNSKHYSISAYRINSDPE 237
            F S  C+H         C   Y TP C   C  K      +R +  Y +S        E
Sbjct: 186 PFPS--CAHHVNSSDLSPCSGEYDTPTCNSTCTDKKIPLIKYRGNTSYVLSG------EE 237

Query: 238 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWIL 297
               E+  NGP EVSF+VY DF  Y  GVYKH+ G  +GGHAV+++GWG   +GE YW +
Sbjct: 238 PFKRELILNGPFEVSFSVYADFVAYTGGVYKHVAGIFLGGHAVRIVGWG-ELNGEPYWKI 296

Query: 298 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           AN WNR WG +GYF I RG +ECGIE   VAG P
Sbjct: 297 ANSWNREWGMNGYFLIARGVDECGIEGSGVAGTP 330


>gi|71656032|ref|XP_816569.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
 gi|70881707|gb|EAN94718.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
          Length = 333

 Score =  236 bits (601), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 138/334 (41%), Positives = 179/334 (53%), Gaps = 26/334 (7%)

Query: 12  LCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLL 71
           + L  F  +A    S    D+ IL D  ++ VN      W A R  +  + T      LL
Sbjct: 9   IALFLFLLYATAGHSFHAEDAPILTDEFLELVNRLNGGKWTAGRTSRTKHLTRRGASRLL 68

Query: 72  GVKPTPKGLLLGVPVKTHDKSLKLP--KSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 129
           G       +L   P +  ++ L+ P    FDA  AWP+C TI+ I DQ  CGSCWA  A 
Sbjct: 69  GTFLRNTSIL--PPRQFSEEELREPLQDRFDAGEAWPKCPTITEIRDQSSCGSCWAVAAA 126

Query: 130 EALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY 188
            A+SDR+C   G+ +L +S  DL++CC  +CG GC+GGYP  AW Y+  HG+V+E C PY
Sbjct: 127 SAISDRYCTLGGVRDLRISAGDLMSCCD-VCGYGCNGGYPEVAWEYYAVHGIVSEYCQPY 185

Query: 189 -FDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQ---LWRNSKHYSISAYRINSDPE 237
            F S  C+H         C   Y TP C   C  K      +R +  Y +S        E
Sbjct: 186 PFPS--CAHHVNSSDLSPCSGEYDTPTCNSTCTDKKVPLIKYRGNTSYLLSG------EE 237

Query: 238 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWIL 297
               E+  NGP EVSF+VY DF  Y  GVYKH+ G  +GGHAV+++GWG   +GE YW +
Sbjct: 238 SFKRELLLNGPFEVSFSVYADFLAYTGGVYKHVAGTFLGGHAVRIVGWG-ELNGEPYWKI 296

Query: 298 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           AN WNR WG +GYF I RG +ECGIE   VAG P
Sbjct: 297 ANSWNREWGMNGYFLIARGVDECGIEGSGVAGTP 330


>gi|194766882|ref|XP_001965553.1| GF22391 [Drosophila ananassae]
 gi|190619544|gb|EDV35068.1| GF22391 [Drosophila ananassae]
          Length = 342

 Score =  235 bits (600), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 141/348 (40%), Positives = 185/348 (53%), Gaps = 33/348 (9%)

Query: 8   MDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 67
           M  +L  T     A G   + K+   +L D  I+ V    +  W+A RN  F      ++
Sbjct: 1   MKLLLVATVACLLAMGSCEENKIP--LLSDEFIELVKTKTRT-WQAGRN--FDEGVSEEY 55

Query: 68  -KHLLGVKPTPKGLLLGVPVKTH------DKSLKLPKSFDARSAWPQCSTISRILDQGHC 120
            + L+GV P      L  P K         K   +PK FDAR  WP C TI+ I DQG C
Sbjct: 56  IRGLMGVHPDAYKFAL--PDKQEVLGYLSQKVDDIPKEFDAREKWPNCPTINEIRDQGSC 113

Query: 121 GSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH 178
           GSCWAFGAVEA+SDR CIH    +N   S +DL++CC   CG GC+GG+P +AW Y+   
Sbjct: 114 GSCWAFGAVEAMSDRVCIHSNGNVNFRFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRK 172

Query: 179 GVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKC-VKKNQLWRNSKH 224
           G+V+         C PY +   C H        C     TPKC  +C    N  +   KH
Sbjct: 173 GIVSGGRYGSKTGCRPY-EIAPCEHHVNGTRAPCNHDSKTPKCQHQCEAGYNVEYSKDKH 231

Query: 225 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 284
           +   +Y +  +  DI  EI  NGPVE +FTVYED   YKSGVY+H  G  +GGHA++++G
Sbjct: 232 FGSKSYSVRRNVRDIQEEIMTNGPVEGAFTVYEDLILYKSGVYQHEHGKELGGHAIRILG 291

Query: 285 WGTSDDGE-DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           WG     E  YW++AN WN  WG  G+F+I RG + CGIE  + AGLP
Sbjct: 292 WGVWGKEEVPYWLIANSWNDDWGDKGFFRILRGEDHCGIESSISAGLP 339


>gi|291385792|ref|XP_002709482.1| PREDICTED: cathepsin B [Oryctolagus cuniculus]
          Length = 339

 Score =  235 bits (600), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 154/356 (43%), Positives = 205/356 (57%), Gaps = 42/356 (11%)

Query: 7   IMDPILCLTCFATFAEGVVSKLKLDSHI--LQDSIIKEVNENPKAGWKAARNPQFSNYTV 64
           ++ P+ CL    +           DSH+  L D ++  +N+     W+A  N  F N  V
Sbjct: 4   LLSPLCCLLALTSAWS--------DSHLHPLSDELVNFINKQ-NTTWQAGHN--FFNVEV 52

Query: 65  GQFKHLLGVKPTPKGLLLGVP-----VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGH 119
              K L G         LG P     V+  D  +KLP+SFDAR  WP C TI  I DQG 
Sbjct: 53  SYLKKLCGT-------FLGGPKLPRRVEFAD-DIKLPESFDAREQWPNCPTIKEIRDQGS 104

Query: 120 CGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 177
           CGSCWAFGAVEA+SDR CIH    +N+ +S  D+L CCG  CGDGC+GGYP  AW ++  
Sbjct: 105 CGSCWAFGAVEAISDRICIHTNGHVNVEVSAEDMLTCCGGQCGDGCNGGYPSGAWNFWTK 164

Query: 178 HGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKH 224
            G+V+         C PY           S P C     TP+C + C    +  ++  KH
Sbjct: 165 KGLVSGGLYDSHVGCKPYSIPPCEHHVNGSRPACTGEGDTPRCSKTCEPGYSPSYKEDKH 224

Query: 225 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 284
           Y  S+Y ++SD  +I AEIYKNGPVE +FTVY DF  YKSGVY+H TGD+MGGHA++++G
Sbjct: 225 YGYSSYSVSSDENEIKAEIYKNGPVEGAFTVYSDFLMYKSGVYQHTTGDIMGGHAIRILG 284

Query: 285 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           WG  ++G  YW++AN WN  WG  G+FKI RG + CGIE ++VAG+P +    ++I
Sbjct: 285 WG-EENGVPYWLVANSWNTDWGDKGFFKILRGQDHCGIESEIVAGIPRTDQYWRQI 339


>gi|226474164|emb|CAX71568.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
 gi|226474166|emb|CAX71569.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  235 bits (600), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 133/348 (38%), Positives = 191/348 (54%), Gaps = 26/348 (7%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           ++   +  ++ F      V ++       L D +I  +N++P AGWKA ++ +F  ++V 
Sbjct: 1   MLKIAVYIVSLFNLLEAHVTTRNNERIEPLSDEMISFINKHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG +     L       V  HD ++++P  FD+R  WP+C +IS+I DQ  CGS 
Sbjct: 59  DARILLGGRREDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y+V  G+V
Sbjct: 119 WAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIV 177

Query: 182 TEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHY 225
           T       + TGC     P C+              Y TP+C + C K  N  +   KHY
Sbjct: 178 TGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQICQKGYNTSYEQDKHY 235

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 285
              +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGW
Sbjct: 236 GGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGW 295

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           G  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 296 GV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|226474160|emb|CAX71567.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  235 bits (600), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 133/348 (38%), Positives = 190/348 (54%), Gaps = 26/348 (7%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           ++   +  ++ F      V ++       L D +I  +N++P AGWKA ++ +F  ++V 
Sbjct: 1   MLKIAVYIVSLFNLLEAHVTTRNNERIEPLSDEMISFINKHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG +     L       V  HD  +++P  FD+R  WP+C +IS+I DQ  CGS 
Sbjct: 59  DARILLGGRREDPNLREKRRPTVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y+V  G+V
Sbjct: 119 WAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIV 177

Query: 182 TEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHY 225
           T       + TGC     P C+              Y TP+C + C K  N  +   KHY
Sbjct: 178 TGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQICQKGYNTSYEQDKHY 235

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 285
              +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGW
Sbjct: 236 GGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGW 295

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           G  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 296 GV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|211853248|emb|CAP17587.1| cathepsin-like protein 4 [Crateromorpha meyeri]
          Length = 325

 Score =  235 bits (599), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 135/303 (44%), Positives = 170/303 (56%), Gaps = 28/303 (9%)

Query: 40  IKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSF 99
           I EVN     GW A R  +F  +T      L GVK +    L  +PV        +P  F
Sbjct: 31  IYEVNRE-NLGWVAGRQKRFEGHTEEYIAGLCGVKGSIPLPLSDLPVLE-----DIPDMF 84

Query: 100 DARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLC 159
           D+R+ WP C TI  I DQ +CGSCWAFGA E++SDR+CIH  M+L +S  +L+ CC   C
Sbjct: 85  DSRTQWPDCKTIGLIEDQSNCGSCWAFGATESMSDRYCIHMKMHLLISAANLMECCRN-C 143

Query: 160 GDGCDGGYPISAWRYFVHHGVVT-----------EECDPYFDSTGCSH--PGCEPAYP-- 204
           G+GC+GG+  +AW Y+   G+VT           + C PY     C H   G +PA P  
Sbjct: 144 GNGCEGGFLGAAWNYWKQEGLVTGGLYNPSATESDTCQPY-PLPSCEHHINGSKPACPSK 202

Query: 205 ---TPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
              TP+CV  C       +    HY  SAY +     +I  EI  NGPVE +FTVY DF 
Sbjct: 203 IAKTPECVHTCHAGYPTSYEQDLHYGESAYSVRRRVAEIQTEIMTNGPVEAAFTVYADFP 262

Query: 261 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 320
            YKSGVYK  +   +GGHAVK+IGWG  +DG  YW++AN WN  WG  GYFKI RG +EC
Sbjct: 263 AYKSGVYKRHSLRQLGGHAVKMIGWG-EEDGIPYWLIANSWNSDWGDHGYFKIVRGQDEC 321

Query: 321 GIE 323
           GIE
Sbjct: 322 GIE 324


>gi|308466896|ref|XP_003095699.1| CRE-CPR-3 protein [Caenorhabditis remanei]
 gi|308244581|gb|EFO88533.1| CRE-CPR-3 protein [Caenorhabditis remanei]
          Length = 373

 Score =  235 bits (599), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 131/338 (38%), Positives = 180/338 (53%), Gaps = 13/338 (3%)

Query: 4   TKLIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 63
           +K ++   L  T +A   +   +   L +H+   +++  +N   +  W A  N    +  
Sbjct: 3   SKFLIQLFLLSTTYAFVVQENYAPPALTTHLTGKALVDHIN-TAQTSWLAEHNVISDSEM 61

Query: 64  VGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             +        P P+     + V        +P +FDAR  WP C +I  I +Q  CGSC
Sbjct: 62  KFKVMDERFADPLPEEESGEILVSGEIVPEPIPDTFDARENWPDCKSIKLIRNQATCGSC 121

Query: 124 WAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WAFGA E +SDR CI         +SV D+L+CCG  CG GC GGY I A R++  +G V
Sbjct: 122 WAFGAAEVISDRICIQSNGTQQPIISVEDILSCCGTTCGKGCQGGYSIEAMRFWKSNGAV 181

Query: 182 T------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI--- 232
           T        C PY  +     P  E   PT K   +       +   KHY  SAYR+   
Sbjct: 182 TGGDYNGNGCMPYSFAPCQKSPCVESTTPTCKTTCQSSYTTANYTTDKHYGTSAYRLATT 241

Query: 233 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 292
           N+    I  EIY NGPVE S+ VYEDF  YKSGVY +++G ++GGHAVK+IGWGT +D  
Sbjct: 242 NNVVSTIQYEIYHNGPVEASYKVYEDFYQYKSGVYHYVSGKLVGGHAVKIIGWGTEND-V 300

Query: 293 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           DYW++AN W   +G  G+FKI+RG+NEC IE +VVAG+
Sbjct: 301 DYWLVANSWGIKFGEGGFFKIRRGTNECQIESNVVAGV 338


>gi|195438776|ref|XP_002067308.1| GK16352 [Drosophila willistoni]
 gi|194163393|gb|EDW78294.1| GK16352 [Drosophila willistoni]
          Length = 340

 Score =  235 bits (599), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 132/329 (40%), Positives = 181/329 (55%), Gaps = 27/329 (8%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
           +S  +   H+L D  I+ V       W   RN   S  +    + L+GV P      L  
Sbjct: 15  LSMFEAKDHLLSDEFIELVRGKANT-WTVGRNFHES-VSEKYIRGLMGVHPDADKFALPD 72

Query: 85  PVKT-----HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH 139
            ++       D    +P  FDAR  W  C TI  I DQG CGSCWAFGAVEA+SDR CIH
Sbjct: 73  KMEVLGKLVEDSDSDIPTEFDAREKWSNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIH 132

Query: 140 F--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFD 190
               +N  LS +DL++CC   CG GC+GG+P +AW Y+   G+V+       + C PY +
Sbjct: 133 SQGKVNFHLSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGGNFGSQQGCRPY-E 190

Query: 191 STGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEI 243
              C H      P C     TP+C   C    ++ ++  K++   +Y I ++  DI  EI
Sbjct: 191 IEPCEHHVNGTRPPCSSG-STPRCQHVCESSYKVDYKKDKNFGSKSYSIKNNVLDIQKEI 249

Query: 244 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWN 302
             NGPVE +FTVYED   YKSGVY+H+ G  +GGHA++++GWG   D+   YW++AN WN
Sbjct: 250 MNNGPVEGAFTVYEDLILYKSGVYEHVHGKELGGHAIRILGWGVWGDEKIPYWLIANSWN 309

Query: 303 RSWGADGYFKIKRGSNECGIEEDVVAGLP 331
             WG +G+F+I RG + CGIE  + AGLP
Sbjct: 310 TDWGDNGFFRIVRGKDHCGIESSISAGLP 338


>gi|56752811|gb|AAW24617.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  234 bits (598), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 137/342 (40%), Positives = 196/342 (57%), Gaps = 20/342 (5%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           ++   +  ++ FA     V ++       L D +I  +NE+P AGWKA ++ +F  +++ 
Sbjct: 1   MLKIAVCIVSFFALLKAHVTTRNNERIEPLSDEMISFINEHPDAGWKADKSDRF--HSLD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + L+G +     +       V  HD ++++P  FD+R  WP C +IS+I DQ  CGSC
Sbjct: 59  DARILMGARKEDAEMKRKRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSC 118

Query: 124 WAFGAVEALSDRFCIHFGMNLSLSVNDL-LACCGFLCGDGCDGGYPISAWRYFVHHGVVT 182
           WAFGAVEA++DR CI  G   S  ++ L L  C   CG GC GG+P  AW Y+V  G+VT
Sbjct: 119 WAFGAVEAMTDRICIQSGGQQSAELSALDLISCCKDCGGGCKGGFPGQAWDYWVKRGIVT 178

Query: 183 ---EE----CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSIS 228
              EE    C PY        T   +P C    Y TP+C + C K  +  +   KHY   
Sbjct: 179 GGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQ 238

Query: 229 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 288
            Y + S+ + I  EI   GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG  
Sbjct: 239 RYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV- 297

Query: 289 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           + G+ YW++AN WN  WG  G F++ RG +EC IE  VVAGL
Sbjct: 298 EKGKPYWLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGL 339


>gi|226469948|emb|CAX70255.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  234 bits (598), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 134/343 (39%), Positives = 187/343 (54%), Gaps = 24/343 (6%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           ++   +  ++ F      V ++       L D +I  +N++P AGWKA ++ +F  ++V 
Sbjct: 1   MLKIAVYIVSLFTLLEAHVTTRNNERIEPLSDEMISFINKHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG       +       V  HD ++++P  FD+R  WP C +IS+I DQ  CGS 
Sbjct: 59  DARILLGGGKEDAEMKWKRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSQCGSS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  AV A+SDR CI  G   ++ LS  DL++CC   CG GCDGG+P  AW Y+V HG+V
Sbjct: 119 WAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCEN-CGSGCDGGFPGPAWDYWVSHGIV 177

Query: 182 T-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLWRNSKHYS 226
           T         C PY     C H      P C +  Y TP+C RKC K     + + KHY 
Sbjct: 178 TGGSKENHTGCQPY-PFPKCEHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPYEHDKHYG 236

Query: 227 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 286
             +  +  +   I  EI   GPVE    ++EDF +YKSG+Y++ TG  +G H V++IGWG
Sbjct: 237 GISINVIKNESAIQNEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWG 296

Query: 287 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
             ++G  YW+ AN WN  WG  GYF+I RG NEC IE  VVAG
Sbjct: 297 I-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESVVVAG 338


>gi|27806671|ref|NP_776456.1| cathepsin B precursor [Bos taurus]
 gi|115312124|sp|P07688.5|CATB_BOVIN RecName: Full=Cathepsin B; AltName: Full=BCSB; Contains: RecName:
           Full=Cathepsin B light chain; Contains: RecName:
           Full=Cathepsin B heavy chain; Flags: Precursor
 gi|289402|gb|AAA03064.1| cathepsin B [Bos taurus]
 gi|809479|gb|AAA80198.1| cathepsin B [Bos taurus]
 gi|296484950|tpg|DAA27065.1| TPA: cathepsin B precursor [Bos taurus]
          Length = 335

 Score =  234 bits (598), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 139/315 (44%), Positives = 190/315 (60%), Gaps = 28/315 (8%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK--- 91
           L D ++  VN+     WKA  N  F N  +   K L G       +L G  +   D    
Sbjct: 26  LSDELVNFVNKQ-NTTWKAGHN--FYNVDLSYVKKLCGA------ILGGPKLPQRDAFAA 76

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
            + LP+SFDAR  WP C TI  I DQG CGSCWAFGAVEA+SDR CIH    +N+ +S  
Sbjct: 77  DVVLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAE 136

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHP 197
           D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY           S P
Sbjct: 137 DMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRP 196

Query: 198 GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
            C     TPKC + C    +  ++  KH+  S+Y + ++ ++IMAEIYKNGPVE +F+VY
Sbjct: 197 PCTGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVY 256

Query: 257 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
            DF  YKSGVY+H++G++MGGHA++++GWG  ++G  YW++ N WN  WG +G+FKI RG
Sbjct: 257 SDFLLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRG 315

Query: 317 SNECGIEEDVVAGLP 331
            + CGIE ++VAG+P
Sbjct: 316 QDHCGIESEIVAGMP 330


>gi|56754499|gb|AAW25437.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  234 bits (598), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 134/348 (38%), Positives = 192/348 (55%), Gaps = 27/348 (7%)

Query: 7   IMDPILCL-TCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +++   C+ + F      V ++       L D +I  +N++P AGWKA ++ +F  ++V 
Sbjct: 1   MLNIAFCIVSLFTLLGAHVTTRNNERIEPLSDEMISFINKHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG +     L       V  HD ++++P  FD+R  WP+C +IS+I DQ  CGS 
Sbjct: 59  DARILLGGRREDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y+V  G+V
Sbjct: 119 WAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIV 177

Query: 182 TEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHY 225
           T       + TGC     P C+              Y TP+C + C K  N  +   KHY
Sbjct: 178 TGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHY 235

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 285
              +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGW
Sbjct: 236 GGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGW 295

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           G  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 296 GV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|440913587|gb|ELR63025.1| Cathepsin B [Bos grunniens mutus]
          Length = 335

 Score =  234 bits (597), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 139/315 (44%), Positives = 190/315 (60%), Gaps = 28/315 (8%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK--- 91
           L D ++  VN+     WKA  N  F N  +   K L G       +L G  +   D    
Sbjct: 26  LSDELVNFVNKQ-NTTWKAGHN--FYNVDLSYVKKLCGT------ILGGPKLPQRDAFAA 76

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
            + LP+SFDAR  WP C TI  I DQG CGSCWAFGAVEA+SDR CIH    +N+ +S  
Sbjct: 77  DVVLPESFDARKQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAE 136

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHP 197
           D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY           S P
Sbjct: 137 DMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRP 196

Query: 198 GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
            C     TPKC + C    +  ++  KH+  S+Y + ++ ++IMAEIYKNGPVE +F+VY
Sbjct: 197 PCTGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVY 256

Query: 257 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
            DF  YKSGVY+H++G++MGGHA++++GWG  ++G  YW++ N WN  WG +G+FKI RG
Sbjct: 257 SDFLLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRG 315

Query: 317 SNECGIEEDVVAGLP 331
            + CGIE ++VAG+P
Sbjct: 316 QDHCGIESEIVAGMP 330


>gi|226469950|emb|CAX70256.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  234 bits (597), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 134/343 (39%), Positives = 187/343 (54%), Gaps = 24/343 (6%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           ++   +  ++ F      V ++       L D +I  +N++P AGWKA ++ +F  ++V 
Sbjct: 1   MLKIAVYIVSLFNLLEAHVTTRNNERIEPLSDEMISFINKHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG       +       V  HD ++++P  FD+R  WP C +IS+I DQ  CGS 
Sbjct: 59  DARILLGGGKEDAEMKWKRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSQCGSS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  AV A+SDR CI  G   ++ LS  DL++CC   CG GCDGG+P  AW Y+V HG+V
Sbjct: 119 WAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCEN-CGSGCDGGFPGPAWDYWVSHGIV 177

Query: 182 T-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLWRNSKHYS 226
           T         C PY     C H      P C +  Y TP+C RKC K     + + KHY 
Sbjct: 178 TGGSKENHTGCQPY-PFPKCEHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPYEHDKHYG 236

Query: 227 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 286
             +  +  +   I  EI   GPVE    ++EDF +YKSG+Y++ TG  +G H V++IGWG
Sbjct: 237 GISINVIKNESAIQKEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWG 296

Query: 287 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
             ++G  YW+ AN WN  WG  GYF+I RG NEC IE  VVAG
Sbjct: 297 I-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESVVVAG 338


>gi|268555790|ref|XP_002635884.1| Hypothetical protein CBG01104 [Caenorhabditis briggsae]
          Length = 337

 Score =  234 bits (596), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 119/257 (46%), Positives = 156/257 (60%), Gaps = 23/257 (8%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 152
           +P+S+D R  W +C ++  I DQ  CGSCWA  A E +SDR CI  +  +N  +S  DLL
Sbjct: 78  IPESYDVRDHWSKCISVDNIRDQSDCGSCWAVAAAETISDRLCIASNGSINTFVSAEDLL 137

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDS------TGCSHPGC 199
           +CC   CGDGCDGGYP+ AWRY+V  G+V+         C PY  +       G + P C
Sbjct: 138 SCCT-SCGDGCDGGYPLQAWRYWVKQGLVSGGSYESQYGCKPYSIAPCGQTVNGVTWPKC 196

Query: 200 EPAY--PTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
            PA    TP+C   C  K+     +   KHY +SAY +      I  EI ++GPVE  F 
Sbjct: 197 -PAQEEATPECASHCTSKSSYSVAYEKDKHYGLSAYPVGRKEAQIQTEILQHGPVEAGFL 255

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           VY DF  YKSG+Y H++G  +GGHAVK++GWG  ++G  YW++AN WN +WG  GYF+I 
Sbjct: 256 VYSDFYRYKSGIYTHVSGQELGGHAVKILGWGV-ENGTKYWLVANSWNINWGEKGYFRIL 314

Query: 315 RGSNECGIEEDVVAGLP 331
           RG NECGIE  VVAG+P
Sbjct: 315 RGRNECGIESAVVAGIP 331


>gi|146165818|ref|XP_001015807.2| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|146145394|gb|EAR95562.2| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 338

 Score =  234 bits (596), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 130/316 (41%), Positives = 175/316 (55%), Gaps = 23/316 (7%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 94
             +  ++E N+   + W+AAR  +F        +  LG     + L   +P+K  +++  
Sbjct: 27  FSEKFVEEFNKRYNSTWRAARYQKFEEMDPETLQGHLGAL-IDEPLWAKLPIKNVEQTND 85

Query: 95  -LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL--SLSVNDL 151
            +P+SFD+R  WP C++I  I DQ  CGSCWAF A E  SDR CI     L  S+S  DL
Sbjct: 86  PIPESFDSREQWPNCNSIKTIRDQSTCGSCWAFAATETYSDRICIASNQELQTSISSEDL 145

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PG 198
           L CC   CG+GC GGYP +AW+Y    GV T         C PY     C H      P 
Sbjct: 146 LECCA-TCGNGCQGGYPSAAWKYMKATGVSTGGLYGDDSSCKPYVFPP-CDHHVVGQYPP 203

Query: 199 CEPAYPTPKCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
           C P  PTPKCV++C  +   + ++   H+    Y++ ++ E I  EI  +GPV+ SF V 
Sbjct: 204 CGPIKPTPKCVKQCNSQYTEKTYQQDLHHPSKVYQLPNNAEAIQREIMAHGPVQASFRVA 263

Query: 257 EDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 315
            DF  YKSGVY +       GGH+VK+IGWG  + G  YW++AN WN  WG +G FK+ R
Sbjct: 264 SDFLTYKSGVYIRDPKLKYEGGHSVKIIGWGV-EQGTPYWLIANSWNEDWGENGLFKMLR 322

Query: 316 GSNECGIEEDVVAGLP 331
           G NECGIE +VVAGLP
Sbjct: 323 GKNECGIEAEVVAGLP 338


>gi|323147412|gb|ADX32985.1| cathepsin B [Pinctada fucata]
          Length = 366

 Score =  233 bits (595), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 138/319 (43%), Positives = 176/319 (55%), Gaps = 31/319 (9%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLL-----LGVPVKTH 89
           L D +I  +N+     WKA +N     + + Q   L  VK      L     L +PV+  
Sbjct: 54  LSDEMIWFINK-VNTSWKAGQN----FHHIKQEDRLDHVKIMCGTYLDVPPHLQLPVRDI 108

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 147
           +    LP +FDAR+ W  C TI  I DQG CGSCWAFGAVE++SDR CI      N  +S
Sbjct: 109 EPRKDLPDTFDARTQWSNCPTIKEIRDQGSCGSCWAFGAVESMSDRICIKSNGQQNAHIS 168

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH---- 196
             DL +CC   CG+GC+GG+   AW Y+   G+VT       + C PY     C H    
Sbjct: 169 AEDLTSCC-RSCGNGCNGGFLSGAWEYYKRDGLVTGGQYNSHQGCQPY-TVKACDHHVVG 226

Query: 197 ---PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 252
              P  +    TP C  +C    N  +   KHY  +AY +    + IM EI  NGPVE +
Sbjct: 227 KLQPCSKKEEHTPVCKHECESGYNVSYTKDKHYGATAYSVRG-VQQIMTEIMTNGPVEGA 285

Query: 253 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 312
           FTVY DF  YKSGVYKH TG  +GGHA+K++GWGT + G+DYW++AN WN  WG  G FK
Sbjct: 286 FTVYADFPQYKSGVYKHTTGSPLGGHAIKIMGWGT-EGGDDYWLVANSWNPDWGNQGTFK 344

Query: 313 IKRGSNECGIEEDVVAGLP 331
           I RG +ECGIE  + AG P
Sbjct: 345 ILRGRDECGIESQIAAGEP 363


>gi|56758864|gb|AAW27572.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  233 bits (594), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 132/348 (37%), Positives = 190/348 (54%), Gaps = 26/348 (7%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           ++   +  ++ F      V ++       L D +I  +N++P AGWKA ++ +F  ++V 
Sbjct: 1   MLKIAVYIVSLFNLLEAHVTTRNNQRIEPLSDEMISFINKHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG +     L       V  HD ++++P  FD+R  WP+C +IS+I DQ  CGS 
Sbjct: 59  DARILLGGRREDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y+V  G+V
Sbjct: 119 WAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIV 177

Query: 182 TEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHY 225
           T       + TGC     P C+              Y TP+C + C K  N  +   KHY
Sbjct: 178 TGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHY 235

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 285
              +Y +      I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGW
Sbjct: 236 GGFSYNVLGIESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGW 295

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           G  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 296 GV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|345308|pir||S31909 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - fluke
           (Schistosoma japonicum)
          Length = 316

 Score =  233 bits (593), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 131/314 (41%), Positives = 179/314 (57%), Gaps = 24/314 (7%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 92
           L D +I  +N++P AGWKA ++ +F  ++V   + LLG +     L       V  HD  
Sbjct: 4   LSDEMISFINKHPNAGWKADKSDRF--HSVDDARILLGGRREDPNLRQKRRPTVDHHDLK 61

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 150
           +++P  FD+R  WP+C +IS+I DQ  C S WA  AV A+SDR CI  G   ++ LS  D
Sbjct: 62  VEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVGAMSDRICIQSGGKQSVELSAID 121

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------P 197
           L++CC   CG GCDGG+P  AW Y+V HG+VT         C PY     C H      P
Sbjct: 122 LISCCEN-CGSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPY-PFPKCEHHSKGKYP 179

Query: 198 GC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 255
            C +  Y TP+C RKC K  +  + + KHY   +  +  +   I  EI   GPVE    +
Sbjct: 180 SCGDKMYKTPQCKRKCQKGYKTPYEHDKHYGGISINVIKNESAIQKEIMMYGPVEAYLLI 239

Query: 256 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 315
           +EDF +YKSG+Y++ TG  +G H V++IGWG  ++G  YW+ AN WN  WG  GYF+I R
Sbjct: 240 FEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGI-ENGTAYWLAANTWNEDWGEKGYFRIVR 298

Query: 316 GSNECGIEEDVVAG 329
           G NEC +E  VVAG
Sbjct: 299 GRNECSVESVVVAG 312


>gi|226471002|emb|CAX70582.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  233 bits (593), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 137/342 (40%), Positives = 198/342 (57%), Gaps = 20/342 (5%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           ++   +  ++ FA     V ++       L D +I  +NE+P AGWKA ++ +F  +++ 
Sbjct: 1   MLKIAVCIVSFFAILKAHVTTRNNERIEPLSDEMISFINEHPDAGWKADKSDRF--HSLD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + L+G +     +       V  HD ++++P  FD+R  WP C +IS+I DQ  CGSC
Sbjct: 59  DARILMGARKEDAEMKRKRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSC 118

Query: 124 WAFGAVEALSDRFCIHFGMNLSLSVNDL-LACCGFLCGDGCDGGYPISAWRYFVHHGVVT 182
           WAFGAVEA++DR CI  G   S  ++ L L  C   CG GC GG+P  AW Y+V  G+VT
Sbjct: 119 WAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDYWVKRGIVT 178

Query: 183 ---EE----CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSIS 228
              EE    C PY        T   +P C    Y TP+C + C K  +  ++  KHY   
Sbjct: 179 GGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYKQDKHYGDE 238

Query: 229 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 288
           +Y + S+ + I  EI   GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG  
Sbjct: 239 SYNVISNEKAIQKEIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV- 297

Query: 289 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           + G+ YW++AN WN  WG  G F++ RG +EC IE  VVAGL
Sbjct: 298 EKGKPYWLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGL 339


>gi|125981197|ref|XP_001354605.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
 gi|54642915|gb|EAL31659.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
          Length = 338

 Score =  233 bits (593), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 132/322 (40%), Positives = 181/322 (56%), Gaps = 31/322 (9%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKT----- 88
           +L D  I E+  +  + W+  RN + S  +    + L+GV P      L  P K      
Sbjct: 22  MLSDEFI-ELVRSKASTWQVGRNFKES-VSEEYIRGLMGVHPDAHKFAL--PEKRIVLGD 77

Query: 89  --HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNL 144
              D  + +P+ FDAR AWP C TI  I DQG CGSCWAFGAVEA+SDR CIH    +N 
Sbjct: 78  LYADDGVDIPEEFDARKAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSEGKVNF 137

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCSH- 196
            LS +DL++CC  +CG GC+GG+P +AW Y+   G+V       T+ C PY +   C H 
Sbjct: 138 HLSADDLVSCC-HICGFGCNGGFPGAAWSYWTRKGIVSGGPYGSTQGCRPY-EIAPCEHH 195

Query: 197 -----PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 250
                P C     TP C  KC     + +   K++   +Y +  +  +I  EI  NGPVE
Sbjct: 196 VNGTRPPCSHG-STPSCQHKCQASYSVEYAKDKNFGSKSYSVRRNVAEIQQEIMTNGPVE 254

Query: 251 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE-DYWILANQWNRSWGADG 309
            +FTVYED   YKSGVY+H  G  +GGHA++++GWG   + +  YW++ N WN  WG +G
Sbjct: 255 GAFTVYEDLILYKSGVYQHEHGKELGGHAIRILGWGVWGESKVPYWLIGNSWNTDWGDNG 314

Query: 310 YFKIKRGSNECGIEEDVVAGLP 331
           +F+I RG + CGIE  + AGLP
Sbjct: 315 FFRILRGQDHCGIESSISAGLP 336


>gi|384597848|gb|AFI23675.1| cathepsin B, partial [Brugia malayi]
          Length = 319

 Score =  232 bits (592), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 133/299 (44%), Positives = 172/299 (57%), Gaps = 39/299 (13%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK--------LPKSFDAR 102
           WKA  N +F+ Y+      LLGV    K +        H K+L         +P+SFDAR
Sbjct: 33  WKAGMN-KFNLYSDTVKYGLLGVNNRKKSV-------EHKKNLSPIRHSNIFIPESFDAR 84

Query: 103 SAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCG 160
             WP+C+++  I DQ  CGSCWA  AVEA+SDR CI       + LS +DLL+CC   CG
Sbjct: 85  KNWPECASLRNIRDQSSCGSCWAVAAVEAMSDRICITSKGKKQVILSADDLLSCCK-TCG 143

Query: 161 DGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYP 204
            GC GG P++AW+Y+V  G+VT     Y + +GC     P CE               YP
Sbjct: 144 FGCFGGEPMAAWKYWVLSGIVTGS--DYTNHSGCRPYPFPPCEHHSNKTHYEPCKHDLYP 201

Query: 205 TPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 263
           TPKC ++C K   + ++  K+Y   AY + +D E I  EI   GPVE SF VY DF HY 
Sbjct: 202 TPKCYKQCDKNYTKSYKADKYYGEQAYNVENDVESIQKEIMTLGPVEASFEVYTDFLHYT 261

Query: 264 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 322
           SG+YKH+ G V GGHAVK++GWG  D G  YW+ AN WN  WG DGYF+I RG++ECG+
Sbjct: 262 SGIYKHVAGSVGGGHAVKILGWGI-DQGVSYWLAANSWNNDWGEDGYFRILRGADECGM 319


>gi|17559066|ref|NP_506790.1| Protein CPR-3 [Caenorhabditis elegans]
 gi|1169083|sp|P43507.1|CPR3_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 3; AltName:
           Full=Cysteine protease-related 3; Flags: Precursor
 gi|675494|gb|AAA98788.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|675496|gb|AAA98782.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|14530554|emb|CAB61032.2| Protein CPR-3 [Caenorhabditis elegans]
          Length = 370

 Score =  232 bits (592), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 125/266 (46%), Positives = 160/266 (60%), Gaps = 22/266 (8%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLL 152
           LP +FDAR  WP C+TI  I +Q  CGSCWAFGA E +SDR CI         +SV D+L
Sbjct: 92  LPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDIL 151

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 206
           +CCG  CG GC GGY I A R++   G VT        C PY  S       C P   TP
Sbjct: 152 SCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPY--SFAPCTKNC-PESTTP 208

Query: 207 KCVRKCVK--KNQLWRNSKHYSISAYRINSDPE--DIMAEIYKNGPVEVSFTVYEDFAHY 262
            C   C    K + ++  KHY  SAY++ +     +I  EIY  GPVE S+ VYEDF HY
Sbjct: 209 SCKTTCQSSYKTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHY 268

Query: 263 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 322
           KSGVY + +G ++GGHAVK+IGWG  ++G DYW++AN W  S+G  G+FKI+RG+NEC I
Sbjct: 269 KSGVYHYTSGKLVGGHAVKIIGWGV-ENGVDYWLIANSWGTSFGEKGFFKIRRGTNECQI 327

Query: 323 EEDVVAGLPSSKNLVKEITSADMFED 348
           E +VVAG      + K  T ++ +ED
Sbjct: 328 EGNVVAG------IAKLGTHSETYED 347


>gi|56759504|gb|AAW27892.1| unknown [Schistosoma japonicum]
          Length = 279

 Score =  232 bits (591), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 118/261 (45%), Positives = 160/261 (61%), Gaps = 18/261 (6%)

Query: 86  VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS 145
           V  H+ ++++P  FD+R  WP C +IS+I DQ  CGSCWAFGAVEA++DR CI  G   S
Sbjct: 18  VDHHNLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQS 77

Query: 146 --LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FDS 191
             LS  DL++CC   CG GC GG+P  AW Y+V  G+VT         C PY        
Sbjct: 78  AELSALDLISCCE-DCGQGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHH 136

Query: 192 TGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPV 249
           T   +P C    Y TP+C + C K  +  +   KHY   +Y + ++ + I  +I   GPV
Sbjct: 137 TKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGEESYNVQNNEKVIQRDIMMYGPV 196

Query: 250 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 309
           E +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG  +    YW++AN WN  WG  G
Sbjct: 197 EAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKRTPYWLIANSWNEDWGEKG 255

Query: 310 YFKIKRGSNECGIEEDVVAGL 330
            F+I RG +EC IE +VVAGL
Sbjct: 256 LFRIVRGRDECSIESNVVAGL 276


>gi|308504375|ref|XP_003114371.1| CRE-CPR-1 protein [Caenorhabditis remanei]
 gi|308261756|gb|EFP05709.1| CRE-CPR-1 protein [Caenorhabditis remanei]
          Length = 366

 Score =  232 bits (591), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 119/244 (48%), Positives = 147/244 (60%), Gaps = 12/244 (4%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLL 152
           +P SFD+R+ W +C +I  I DQ  CGSCWAFGA E +SDR CI         +S +DLL
Sbjct: 122 IPASFDSRTHWSECKSIKLIRDQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDDLL 181

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 206
           +CCG  CG+GC+GGYPI A R++   GVVT        C PY     C+   C P   TP
Sbjct: 182 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPY-PIAPCTSGNC-PESKTP 239

Query: 207 KCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
            C   C       +   KH+  SAY +      I  EI  NGPVE +FTVYEDF  YKSG
Sbjct: 240 SCSLSCQSGYTTAYAKDKHFGTSAYAVARKVASIQTEIMTNGPVEAAFTVYEDFYKYKSG 299

Query: 266 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 325
           VYKH  G  +GGHA+K+IGWGT + G  YW++AN W  SWG  G+F+I RG ++CGIE  
Sbjct: 300 VYKHTAGKALGGHAIKIIGWGT-ESGSPYWLVANSWGNSWGESGFFRIFRGDDQCGIESA 358

Query: 326 VVAG 329
           VVAG
Sbjct: 359 VVAG 362


>gi|341888694|gb|EGT44629.1| hypothetical protein CAEBREN_31940 [Caenorhabditis brenneri]
          Length = 374

 Score =  231 bits (590), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 117/245 (47%), Positives = 151/245 (61%), Gaps = 13/245 (5%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLL 152
           LP +FD+R  WP+C +I  I +Q  CGSCWAFGA E +SDR CI      +  +SV D+L
Sbjct: 97  LPDTFDSREQWPECKSIKLIRNQATCGSCWAFGAAEIISDRICIQSNATQTPIISVEDIL 156

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 206
           +CCG  CG GC GGY I A R++   G VT        C PY     C    C     TP
Sbjct: 157 SCCGVSCGKGCQGGYSIEALRFWKSSGAVTGGDYNGAGCMPY-SFAPCKKDSCAQG-TTP 214

Query: 207 KCVRKCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
            C   C    K   +   KH+  +AY+I +    I  EIY NGPVE SF VYEDF  YKS
Sbjct: 215 SCKTTCQSSYKTAEYTKDKHFGTTAYKITNSVAAIQTEIYHNGPVEASFKVYEDFYKYKS 274

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           GVY++ +G ++GGHAVK+IGWGT ++G DYW++AN W  ++G  G+FK++RG+NE GIE 
Sbjct: 275 GVYQYTSGKLVGGHAVKIIGWGT-ENGVDYWLIANSWGTTFGDSGFFKMRRGTNEVGIEG 333

Query: 325 DVVAG 329
           +VVAG
Sbjct: 334 NVVAG 338


>gi|118122|sp|P25793.1|CYSP2_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 2; Flags:
           Precursor
 gi|159165|gb|AAA29171.1| cathepsin B-like cysteine protease [Haemonchus contortus]
          Length = 342

 Score =  231 bits (590), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 128/324 (39%), Positives = 182/324 (56%), Gaps = 36/324 (11%)

Query: 30  LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 89
           L +++ +   + EVN +P         P F        + ++ +K   + L L V  +  
Sbjct: 38  LVAYLRRSQNLFEVNSDP--------TPDFE-------QKIMSIKYKHQKLNLMVK-EDP 81

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 147
           D  + +P S+D R  W  C+T   I DQ +CGSCWA     A+SDR CI       +++S
Sbjct: 82  DPEVDIPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNIS 140

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG-- 198
             D++ CC   CGDGC+GG+PI AW+YF++ GVV+       + C PY     C H G  
Sbjct: 141 ATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPCGHHGND 199

Query: 199 -----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 252
                C    PTP C RKC     +++R  K Y   AY +    + I +EI KNGPV  S
Sbjct: 200 TYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKNGPVVAS 259

Query: 253 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 312
           F VYEDF HYKSG+YKH  G++ G HAVK+IGWG +++  D+W++AN W+  WG  GYF+
Sbjct: 260 FAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWLIANSWHNDWGEKGYFR 318

Query: 313 IKRGSNECGIEEDVVAGLPSSKNL 336
           I RGSN+CGIE  + AG+  +++L
Sbjct: 319 IVRGSNDCGIEGTIAAGIVDTESL 342


>gi|300176937|emb|CBK25506.2| unnamed protein product [Blastocystis hominis]
          Length = 320

 Score =  231 bits (590), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 136/311 (43%), Positives = 172/311 (55%), Gaps = 29/311 (9%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPK 97
            + KEVN   K  W A       +YT       LG     K L    P K       LP+
Sbjct: 22  EVAKEVNAM-KTTWLANEAIPTRDYT-----QYLGALRGGKQL----PEKNIAIRGDLPE 71

Query: 98  SFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACC 155
           SFD    WP+C ++  I DQ  CGSCWAFGA EA +DR CI     +   LS  DLL CC
Sbjct: 72  SFDPVEKWPECPSLKEIRDQSVCGSCWAFGAAEAATDRLCIASKGKIQDRLSDQDLLTCC 131

Query: 156 GFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPA 202
              CG GC+GG+P  AW +F   GV T       + C+ Y +   C H      P C   
Sbjct: 132 E-SCGFGCNGGWPSMAWSWFHSTGVTTGGEYGSKDWCNAY-EFPKCDHHVEGKYPPCGET 189

Query: 203 YPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 261
            PTP+CV KC +   + ++  KH+   AY + S+ E I  E+  NGP+EV F+VYEDF  
Sbjct: 190 QPTPECVEKCQEGYPVEYKKDKHFFGEAYHVPSNVEAIKTELMTNGPIEVDFSVYEDFMT 249

Query: 262 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           YKSG+Y+H+ G  +GGHAVKL+GWG  +DG +YW +AN WN  WG +GYF+I  G NECG
Sbjct: 250 YKSGIYQHVAGKYLGGHAVKLVGWGV-EDGVEYWKIANSWNEDWGENGYFRIIAGKNECG 308

Query: 322 IEEDVVAGLPS 332
           IE D VAG+P 
Sbjct: 309 IESDGVAGIPE 319


>gi|341904369|gb|EGT60202.1| hypothetical protein CAEBREN_08101 [Caenorhabditis brenneri]
          Length = 330

 Score =  231 bits (589), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 119/244 (48%), Positives = 148/244 (60%), Gaps = 12/244 (4%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLL 152
           +P SFD+R+ W +C +I  I +Q  CGSCWAFGA E +SDR CI         +S +DLL
Sbjct: 86  IPASFDSRTHWSECKSIKLIRNQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDDLL 145

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 206
           +CCG  CG+GC+GGYPI A R++   GVVT        C PY  +  C+   C P   TP
Sbjct: 146 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CTSGSC-PESKTP 203

Query: 207 KCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
            C   C       +   KH+  SAY +      I  EI  NGPVE +FTVYEDF  YKSG
Sbjct: 204 ACSLSCQSGYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAFTVYEDFYKYKSG 263

Query: 266 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 325
           VYKH  G  +GGHA+K+IGWGT + G  YW++AN W  SWG  G+FKI RG ++CGIE  
Sbjct: 264 VYKHTAGKALGGHAIKIIGWGT-ESGSPYWLVANSWGTSWGESGFFKIFRGDDQCGIESA 322

Query: 326 VVAG 329
           VVAG
Sbjct: 323 VVAG 326


>gi|56752997|gb|AAW24710.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  231 bits (589), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 138/337 (40%), Positives = 194/337 (57%), Gaps = 21/337 (6%)

Query: 12  LCLTCFATFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
           +C+    T  E  V ++       L D +I  +NE+P AGWKA ++ +F  +++   + L
Sbjct: 6   VCIVSLFTLLEAHVTTRNNERIEPLSDEMISFINEHPDAGWKADKSDRF--HSLDDARIL 63

Query: 71  LGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 128
           +G +     +       V  HD ++++P  FD+R  WP C +IS+I DQ  CGSCWAFGA
Sbjct: 64  MGARKEDAEMKRNRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGA 123

Query: 129 VEALSDRFCIHFGMNLSLSVNDL-LACCGFLCGDGCDGGYPISAWRYFVHHGVVT---EE 184
           VEA++DR CI  G   S  ++ L L  C   CG GC GG+P  AW Y+V  G+VT   EE
Sbjct: 124 VEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDYWVKRGIVTGGSEE 183

Query: 185 ----CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRIN 233
               C PY        T   +P C    Y TP+C + C K  +  +   KHY    Y + 
Sbjct: 184 NHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVI 243

Query: 234 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 293
           S+ + I  EI   GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG  + G+ 
Sbjct: 244 SNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKGKP 302

Query: 294 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           YW++AN WN  WG  G F++ RG +EC IE  VVAGL
Sbjct: 303 YWLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGL 339


>gi|166030310|gb|ABY78822.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score =  231 bits (589), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 128/330 (38%), Positives = 175/330 (53%), Gaps = 15/330 (4%)

Query: 12  LCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLL 71
           LCL   A  A G  + L  D+ +L  + +  +N+     WKA  N +  N T  + + L 
Sbjct: 7   LCLLSTALVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQNITFAEARRLT 66

Query: 72  GVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEA 131
           G +      L  V         +LP+SFD+   WP C TI  I DQ  CGSCWA     A
Sbjct: 67  GARIQKTSSLPPVRFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASA 126

Query: 132 LSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD 190
           +SDR C   G+  L +S   L++CC   CGDGCDGGYP ++W Y+V HG+ +  C PY  
Sbjct: 127 ISDRHCTVGGVQQLRISAAHLMSCCE-DCGDGCDGGYPGTSWEYYVSHGLASSYCQPY-P 184

Query: 191 STGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE 242
              C H G +   P        TPKC   C  K       K+    +Y ++ + +D   E
Sbjct: 185 FPHCGHHGGKGKKPPCSKYHFHTPKCNTTCTDKAIPL--IKYRGNHSYEVHGE-DDYKRE 241

Query: 243 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 302
           +Y NGP  V F VY DF  YK+GVY+H++GD +GGHAV+++GWG   +G  YW +AN W+
Sbjct: 242 LYFNGPFVVVFWVYSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKL-NGTPYWKIANSWD 300

Query: 303 RSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
             WG +G+    RG+NECGIE    AG P+
Sbjct: 301 TDWGMNGHLLFLRGNNECGIEAAGYAGSPA 330


>gi|226473760|emb|CAX71565.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  231 bits (589), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 134/348 (38%), Positives = 190/348 (54%), Gaps = 27/348 (7%)

Query: 7   IMDPILCL-TCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +++   C+ + F      V ++       L D +I  +NE+P AGWKA ++ +F  ++V 
Sbjct: 1   MLNIAFCIVSLFTLLGAHVTTRNNERVEPLSDEMISFINEHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG +     L       V  HD  +++P  FD+R  WP+C +IS+I DQ  CGS 
Sbjct: 59  DARILLGGRREDPNLREKRRPTVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y+V  G+V
Sbjct: 119 WAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIV 177

Query: 182 TEECDPYFDSTGCS---HPGC------------EPAYPTPKCVRKCVKK-NQLWRNSKHY 225
           T       + TGC     P C            +  Y TP+C + C K  N  +   KHY
Sbjct: 178 TGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHY 235

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 285
              +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIG 
Sbjct: 236 GGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGC 295

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           G  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 296 GV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|1181143|emb|CAA93278.1| cysteine proteinase [Haemonchus contortus]
          Length = 341

 Score =  231 bits (588), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 121/258 (46%), Positives = 157/258 (60%), Gaps = 19/258 (7%)

Query: 89  HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSL 146
           +DK   +P+SFDAR+ WP+CS++  I DQ +CGSCWA     ALSDR CI  +    + +
Sbjct: 84  NDKGEDIPESFDARTKWPKCSSLKHIRDQANCGSCWAVSTASALSDRICIASNGRKQVHV 143

Query: 147 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG- 198
           S  D+L+CCG  CG GC+GG+PI A+ YF   G VT         C PY     C H G 
Sbjct: 144 SATDILSCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPY-PFHPCGHHGK 202

Query: 199 ------CEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
                 C     TPKCVRKC     + ++  +     AY + +  + I  EI KNGPV  
Sbjct: 203 DTYYGECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVG 262

Query: 252 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 311
           +FTVYEDF++YK G+YKH  G   GGHA+K+IGWG  + G  YW++AN W+  WG +GYF
Sbjct: 263 AFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWG-KEGGVPYWLIANSWHNDWGENGYF 321

Query: 312 KIKRGSNECGIEEDVVAG 329
           +I RGSN CGIEE+VVAG
Sbjct: 322 RILRGSNHCGIEENVVAG 339


>gi|341878049|gb|EGT33984.1| CBN-CPR-1 protein [Caenorhabditis brenneri]
          Length = 330

 Score =  231 bits (588), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 119/244 (48%), Positives = 148/244 (60%), Gaps = 12/244 (4%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLL 152
           +P SFD+R+ W +C +I  I +Q  CGSCWAFGA E +SDR CI         +S +DLL
Sbjct: 86  IPASFDSRTHWSECKSIKLIRNQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDDLL 145

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 206
           +CCG  CG+GC+GGYPI A R++   GVVT        C PY  +  C+   C P   TP
Sbjct: 146 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CTSGSC-PESKTP 203

Query: 207 KCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
            C   C       +   KH+  SAY +      I  EI  NGPVE +FTVYEDF  YKSG
Sbjct: 204 ACSLSCQPGYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAFTVYEDFYKYKSG 263

Query: 266 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 325
           VYKH  G  +GGHA+K+IGWGT + G  YW++AN W  SWG  G+FKI RG ++CGIE  
Sbjct: 264 VYKHTAGKALGGHAIKIIGWGT-ESGSPYWLVANSWGTSWGESGFFKIFRGDDQCGIESA 322

Query: 326 VVAG 329
           VVAG
Sbjct: 323 VVAG 326


>gi|56758716|gb|AAW27498.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  230 bits (587), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 133/344 (38%), Positives = 186/344 (54%), Gaps = 27/344 (7%)

Query: 7   IMDPILCLTCFATFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +++   C+    T  E  V ++       L D +I  +N++P AGWKA ++ +F  ++V 
Sbjct: 1   MLNIAFCIVSLFTLLEAHVTTRNNQRIEPLSDEMISFINKHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG +     L       V  HD ++++P  FD+R  WP+C +IS+I DQ  C S 
Sbjct: 59  DARILLGGRKEDPNLRQKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSRCASS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  AV A+SDR CI  G   ++ LS  DL++CC   CG GCDGG    +W Y+V HG+V
Sbjct: 119 WAVSAVAAMSDRICIQSGGKQSVELSAIDLISCCEN-CGSGCDGGVTGYSWDYWVKHGIV 177

Query: 182 TEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHY 225
           T       + TGC     P C+              Y TP+C + C K  N  +   KHY
Sbjct: 178 TGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHY 235

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 285
              +Y +      I  EI   GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGW
Sbjct: 236 GGFSYSVIGVESAIQKEIMMYGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGW 295

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
           G  ++G  YW+ AN WN  WG  GYF+I RG +EC IE  +VAG
Sbjct: 296 GV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRDECLIESFIVAG 338


>gi|226471008|emb|CAX70585.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  230 bits (587), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 136/342 (39%), Positives = 196/342 (57%), Gaps = 20/342 (5%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           ++   +  ++ FA     V ++       L D +I  +NE+P AGWKA ++ +F  +++ 
Sbjct: 1   MLKIAVCIVSFFALLKAHVTTRNNQRIEPLSDEMISFINEHPDAGWKADKSDRF--HSLD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + L+G +     +       V  HD ++++P  FD+R  WP C +IS+I DQ  CGSC
Sbjct: 59  DARILMGARKEDAEMKRKRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSC 118

Query: 124 WAFGAVEALSDRFCIHFGMNLSLSVNDL-LACCGFLCGDGCDGGYPISAWRYFVHHGVVT 182
           WAFGAVEA++DR CI  G   S  ++ L L  C   CG GC GG+P  AW Y+V  G+VT
Sbjct: 119 WAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDYWVKRGIVT 178

Query: 183 ---EE----CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSIS 228
              EE    C PY        T   +P C    Y TP+C + C K  +  +   KHY   
Sbjct: 179 GGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQ 238

Query: 229 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 288
            Y + S+ + I  EI   GPVE +F VYEDF +YKSG+Y+H+ G ++GGHA+++IGWG  
Sbjct: 239 RYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIGWGV- 297

Query: 289 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           + G+ YW++AN WN  WG +G F++ RG +EC IE  VVAGL
Sbjct: 298 EKGKPYWLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339


>gi|56752925|gb|AAW24674.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  230 bits (587), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 133/344 (38%), Positives = 186/344 (54%), Gaps = 27/344 (7%)

Query: 7   IMDPILCLTCFATFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +++   C+    T  E  V ++       L D +I  +N++P AGWKA ++ +F  ++V 
Sbjct: 1   MLNIAFCIVSLFTLLEAHVTTRNNQRIEPLSDEMILFINKHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG +     L       V  HD ++++P  FD+R  WP+C +IS+I DQ  C S 
Sbjct: 59  DARILLGGRREDPNLRQKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSRCASS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  AV A+SDR CI  G   ++ LS  DL++CC   CG GCDGG    +W Y+V HG+V
Sbjct: 119 WAVSAVAAMSDRICIQSGGKQSVELSAIDLISCCKN-CGSGCDGGVTGYSWDYWVKHGIV 177

Query: 182 TEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHY 225
           T       + TGC     P C+              Y TP+C + C K  N  +   KHY
Sbjct: 178 TGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHY 235

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 285
              +Y +      I  EI   GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGW
Sbjct: 236 GGFSYSVIGVESAIQKEIMMYGPVEAYLQIYEDFLNYKSGIYRYTTGKYISGHAVRLIGW 295

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
           G  ++G  YW+ AN WN  WG  GYF+I RG +EC IE  +VAG
Sbjct: 296 GV-ENGTSYWLAANTWNEDWGEKGYFRIVRGRDECLIESFIVAG 338


>gi|118429531|gb|ABK91813.1| cathepsin B-like cysteine proteinase precursor [Clonorchis
           sinensis]
 gi|358331549|dbj|GAA37857.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 343

 Score =  230 bits (586), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 130/266 (48%), Positives = 154/266 (57%), Gaps = 23/266 (8%)

Query: 85  PVKTHD--KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF-- 140
           P  TH    +++LPK+FDAR+ WP C +IS I DQ  CGSCWAFGAVEA+SDR CIH   
Sbjct: 74  PTVTHVGFDAMRLPKNFDARTKWPHCPSISEIRDQSGCGSCWAFGAVEAMSDRLCIHSNG 133

Query: 141 GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HP 197
             N SLS  DLL+CC   CG GC GGYP  AW Y+  HG+VT       D +GC     P
Sbjct: 134 AFNKSLSAVDLLSCCEN-CGYGCSGGYPAVAWDYWGAHGIVTGGSKE--DPSGCRSYPFP 190

Query: 198 GCE------------PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYK 245
            CE              YPTP+CV+ C      +   K  +  +Y I S    IM EI  
Sbjct: 191 KCEHHVQGHYPPCPHQYYPTPECVQHCDTPGIDYVKDKTRANMSYNIYSSEILIMKEIML 250

Query: 246 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 305
            GPVE  FTVYEDF  YK GVY H  G  +  HA++++GWG   D   YW++AN WN  W
Sbjct: 251 RGPVEAVFTVYEDFLQYKFGVYFHSWGAPLSEHAIRILGWGEEGD-VPYWLIANSWNEDW 309

Query: 306 GADGYFKIKRGSNECGIEEDVVAGLP 331
           G  GY K  RG NECGIE+DV AGLP
Sbjct: 310 GEKGYMKFLRGLNECGIEDDVTAGLP 335


>gi|149392557|gb|ABR26081.1| cathepsin b-like cysteine proteinase 3 [Oryza sativa Indica Group]
          Length = 142

 Score =  230 bits (586), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 102/134 (76%), Positives = 120/134 (89%)

Query: 210 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 269
           +KC  +NQ+W   KH+S++AYR+NSDP DIMAE+Y+NGPVEV+FTVYEDFAHYKSGVYKH
Sbjct: 1   KKCKVQNQVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKH 60

Query: 270 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
           ITG +MGGHAVKLIGWGT+D GEDYW+LANQWNR WG DGYFKI RG+NECGIEEDVVAG
Sbjct: 61  ITGGMMGGHAVKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEEDVVAG 120

Query: 330 LPSSKNLVKEITSA 343
           +PS+KN+V+   SA
Sbjct: 121 MPSTKNMVRNYDSA 134


>gi|268557292|ref|XP_002636635.1| C. briggsae CBR-CPR-1 protein [Caenorhabditis briggsae]
          Length = 330

 Score =  229 bits (585), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 118/244 (48%), Positives = 149/244 (61%), Gaps = 12/244 (4%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLL 152
           +P SFD+R+ W +C +I  I +Q  CGSCWAFGA E +SDR CI         +S +DLL
Sbjct: 86  IPASFDSRTQWSECKSIKLIRNQATCGSCWAFGAAEIISDRTCIETKGAQQPIISPDDLL 145

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 206
           +CCG  CG+GC+GGYPI A R++   GVVT        C PY  +  C+   C P   TP
Sbjct: 146 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CTSGNC-PESKTP 203

Query: 207 KCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
            C   C    +  +   KH+  SAY +      I  EI  NGPVE +FTVYEDF  YKSG
Sbjct: 204 ACSLSCQSGYSTAYAKDKHFGASAYAVARSVAAIQTEIMTNGPVEAAFTVYEDFYKYKSG 263

Query: 266 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 325
           VYKH  G  +GGHA+K+IGWGT + G  YW++AN W  +WG  G+FKI RG ++CGIE  
Sbjct: 264 VYKHTAGKALGGHAIKIIGWGT-ESGSPYWLVANSWGTNWGESGFFKILRGDDQCGIEGA 322

Query: 326 VVAG 329
           VVAG
Sbjct: 323 VVAG 326


>gi|226471006|emb|CAX70584.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  229 bits (585), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 136/342 (39%), Positives = 196/342 (57%), Gaps = 20/342 (5%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           ++   +  ++ FA     V ++       L D +I  +NE+P AGWKA ++ +F  +++ 
Sbjct: 1   MLKIAVCIVSFFALLKAHVTTRNNQRIEPLSDEMILFINEHPDAGWKADKSDRF--HSLD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + L+G +     +       V  HD ++++P  FD+R  WP C +IS+I DQ  CGSC
Sbjct: 59  DARILMGARKEDAEMKRKRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSC 118

Query: 124 WAFGAVEALSDRFCIHFGMNLSLSVNDL-LACCGFLCGDGCDGGYPISAWRYFVHHGVVT 182
           WAFGAVEA++DR CI  G   S  ++ L L  C   CG GC GG+P  AW Y+V  G+VT
Sbjct: 119 WAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDYWVKRGIVT 178

Query: 183 ---EE----CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSIS 228
              EE    C PY        T   +P C    Y TP+C + C K  +  +   KHY   
Sbjct: 179 GGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQ 238

Query: 229 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 288
            Y + S+ + I  EI   GPVE +F VYEDF +YKSG+Y+H+ G ++GGHA+++IGWG  
Sbjct: 239 RYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIGWGV- 297

Query: 289 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           + G+ YW++AN WN  WG +G F++ RG +EC IE  VVAGL
Sbjct: 298 EKGKPYWLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339


>gi|118118|sp|P19092.1|CYSP1_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
           Precursor
 gi|159173|gb|AAA29175.1| cysteine protease (AC-1) [Haemonchus contortus]
          Length = 342

 Score =  229 bits (585), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 115/264 (43%), Positives = 159/264 (60%), Gaps = 20/264 (7%)

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 147
           D  + +P S+D R  W  C+T   I DQ +CGSCWA     A+SDR CI       +++S
Sbjct: 82  DPEVDIPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNIS 140

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG-- 198
             D++ CC   CGDGC+GG+PI AW+YF++ GVV+       + C PY     C H G  
Sbjct: 141 ATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPCGHHGND 199

Query: 199 -----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 252
                C    PTP C RKC     +++R  K Y   AY +    + I +EI +NGPV  S
Sbjct: 200 TYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILRNGPVVAS 259

Query: 253 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 312
           F VYEDF HYKSG+YKH  G++ G HAVK+IGWG +++  D+W++AN W+  WG  GYF+
Sbjct: 260 FAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWLIANSWHNDWGEKGYFR 318

Query: 313 IKRGSNECGIEEDVVAGLPSSKNL 336
           I RG+N+CGIE  + AG+  +++L
Sbjct: 319 IIRGTNDCGIEGTIAAGIVDTESL 342


>gi|76576339|gb|ABA53863.1| cathepsin B-like cysteine protease 1 [Parelaphostrongylus tenuis]
          Length = 346

 Score =  229 bits (584), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 133/347 (38%), Positives = 187/347 (53%), Gaps = 26/347 (7%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +++  +L     A F +   + L+    +    ++  +N+  K  + A  +P+F+N    
Sbjct: 4   VVLFAVLGTAASAAFLQHTENVLREAEQLSGSDLVNYINKAQKL-FTAKLSPRFANLPRD 62

Query: 66  QFKHLLGVKPTPKGLLLGVPVKTHDK--SLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
               L+G K         +  KTH+   +  +PKSFDAR+ WP+C+++  + DQ  CGS 
Sbjct: 63  IKHRLMGSKYVALPAKYRMNEKTHNDIDNSTIPKSFDARTNWPKCASLRTVRDQSACGSG 122

Query: 124 WAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  AV A+ DR CI       + LS +D+L+CC   CG GC+GG    AW Y+   G+V
Sbjct: 123 WAVAAVGAIMDRICIASEGKQQVILSADDILSCCT-ECGYGCEGGDTYKAWNYWTTDGIV 181

Query: 182 TEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKKNQL-WRNSKH 224
           T     Y   +GC    +P CE               YPT  C  KC     + +   KH
Sbjct: 182 TGS--NYTTKSGCKPYPYPPCEHYIDAGRYKKCPKDLYPTNTCEYKCQDNYTISYDEDKH 239

Query: 225 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 284
           Y    Y +  D   I  EI  +GPVEV+F VYEDF HY SG+YKH+ G+ +G HAVK++G
Sbjct: 240 YGAYPYVLVGDASFIQQEIMNHGPVEVTFDVYEDFEHYSSGIYKHMAGEYVGVHAVKMLG 299

Query: 285 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           WGT ++G DYWI AN WN  WG +G+F+I RG NECGIE +VVAG P
Sbjct: 300 WGT-ENGVDYWICANSWNSDWGENGFFRILRGENECGIESNVVAGKP 345


>gi|56757271|gb|AAW26807.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  229 bits (584), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 134/348 (38%), Positives = 187/348 (53%), Gaps = 27/348 (7%)

Query: 7   IMDPILCLTCFATFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +++   C+    T  E  V ++       L D +I  +N++P AGWKA ++ +F  ++V 
Sbjct: 1   MLNIAFCIVSLFTLLEAHVTTRNNQRIEPLSDEMILFINKHPNAGWKADKSDRF--HSVD 58

Query: 66  QFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
             + LLG +     L       V  HD ++++P  FD+R  WP+C +IS+I DQ  C S 
Sbjct: 59  DARILLGGRREDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSRCASS 118

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WA  AV A+SDR CI  G   ++ LS  DL++CC   CG GCDGG    +W Y+V HG+V
Sbjct: 119 WAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCKN-CGSGCDGGVTGYSWDYWVKHGIV 177

Query: 182 TEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHY 225
           T       + TGC     P C+              Y TP+C + C K  N  +   KHY
Sbjct: 178 TGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHY 235

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 285
              +Y +      I  EI   GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGW
Sbjct: 236 GEFSYNVIGVESVIQKEIMMYGPVEAYLHIYEDFLNYKSGIYRYTTGQFISGHAVRLIGW 295

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           G  ++G  YW+ AN WN  WG  GYF+I RG +EC IE  +VAG   S
Sbjct: 296 GV-ENGTSYWLAANTWNEDWGEKGYFRIVRGRDECLIESFIVAGQIKS 342


>gi|442616292|ref|NP_001259536.1| cathepsin B1, isoform B [Drosophila melanogaster]
 gi|440216755|gb|AGB95378.1| cathepsin B1, isoform B [Drosophila melanogaster]
          Length = 330

 Score =  229 bits (583), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 133/342 (38%), Positives = 179/342 (52%), Gaps = 40/342 (11%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           L      A  V +    +  +L D  I EV  N  A           + T G  + L+GV
Sbjct: 3   LLLLVATAASVAALTSGEPSLLSDEFI-EVGRNFDA-----------SVTEGHIRRLMGV 50

Query: 74  KPTPKGLLLGVPVKTH-------DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF 126
            P      L  P K         +   +LP+ FD+R  WP C TI  I DQG CGSCWAF
Sbjct: 51  HPDAHKFAL--PDKREVLGDLYVNSVDELPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAF 108

Query: 127 GAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-- 182
           GAVEA+SDR CIH G  +N   S +DL++CC   CG GC+GG+P +AW Y+   G+V+  
Sbjct: 109 GAVEAMSDRVCIHSGGKVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGG 167

Query: 183 -----EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAY 230
                + C PY + + C H      P C     TPKC   C     + +   KH+   +Y
Sbjct: 168 PYGSNQGCRPY-EISPCEHHVNGTRPPCAHGGRTPKCSHVCQSGYTVDYAKDKHFGSKSY 226

Query: 231 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SD 289
            +  +  +I  EI  NGPVE +FTVYED   YK GVY+H  G  +GGHA++++GWG   +
Sbjct: 227 SVRRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGE 286

Query: 290 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           +   YW++ N WN  WG  G+F+I RG + CGIE  + AGLP
Sbjct: 287 EKIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIESSISAGLP 328


>gi|56752787|gb|AAW24605.1| unknown [Schistosoma japonicum]
          Length = 309

 Score =  229 bits (583), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 129/315 (40%), Positives = 180/315 (57%), Gaps = 26/315 (8%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLP 96
           +I  +N++P AGWKA ++ +F  ++V   + LLG +     L       V  HD ++++P
Sbjct: 1   MISFINKHPNAGWKADKSDRF--HSVDDARILLGGRREDPNLREKRRPTVDHHDLNVEIP 58

Query: 97  KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLAC 154
             FD+R  WP+C +IS+I DQ  CGS WA  AV A+SDR CI  G   ++ LS  DL++C
Sbjct: 59  SHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISC 118

Query: 155 CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE----------- 200
           C + CG GCDGG+   +W Y+V  G+VT       + TGC     P C+           
Sbjct: 119 CKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACG 175

Query: 201 -PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
              Y TP+C + C K  N  +   KHY   +Y + S    I  +I  +GPVE    +YED
Sbjct: 176 DKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYED 235

Query: 259 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
           F +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG N
Sbjct: 236 FLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRN 294

Query: 319 ECGIEEDVVAGLPSS 333
           EC IE ++ AGL  S
Sbjct: 295 ECLIESEIAAGLIKS 309


>gi|268561866|ref|XP_002638438.1| Hypothetical protein CBG18654 [Caenorhabditis briggsae]
          Length = 396

 Score =  228 bits (582), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 121/253 (47%), Positives = 158/253 (62%), Gaps = 16/253 (6%)

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVND 150
           ++LP +FD+R  WP C++I  I DQ +CGSCWAF A E +SDR CI         +S  D
Sbjct: 83  IQLPTAFDSRVQWPNCNSIKLIRDQTYCGSCWAFAAAEIISDRICIQSNGTQQPIISPED 142

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYP 204
           +L+CCG  C +GC GGY I A +Y+++ GVVT        C PY     CS   C+    
Sbjct: 143 ILSCCGSSCNNGCQGGYTIEAMKYWMNSGVVTGGDYQGAGCIPY-SFRPCST--CKEPKD 199

Query: 205 TPKCVRKC---VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 261
            P C   C    K    +R     S +A   N+  + I  EIY NGPVEV++ VY+DF H
Sbjct: 200 APSCKTTCQASYKAKSAYRLPTTTSSNAIVANA-VQMIQTEIYNNGPVEVAYQVYDDFYH 258

Query: 262 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           YKSGVY H+ GD   GHAVK+IGWGT +   DYW++AN W+ ++G +G+FKI+RG+NECG
Sbjct: 259 YKSGVYYHVYGDKPSGHAVKIIGWGT-EKKVDYWLVANSWSTTFGENGFFKIRRGTNECG 317

Query: 322 IEEDVVAGLPSSK 334
           IEE+VVAGLP SK
Sbjct: 318 IEENVVAGLPKSK 330


>gi|984958|gb|AAC46877.1| cathepsin B-like proteinase [Ancylostoma caninum]
          Length = 343

 Score =  228 bits (582), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 113/252 (44%), Positives = 156/252 (61%), Gaps = 20/252 (7%)

Query: 96  PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLA 153
           P SFDAR+ WP+C +I  I DQ  CGSCWA  + EA+SD  C+     + + +S +D+L+
Sbjct: 90  PASFDARTHWPECRSIGTIRDQSSCGSCWAVSSAEAMSDEICVQSNSTIRVMISDSDILS 149

Query: 154 CCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAY--- 203
           CCG  CG GC GG+PI A+++    GVVT       + C PY     C H   +P Y   
Sbjct: 150 CCGISCGYGCQGGWPIEAYKWMQRDGVVTGGKYRQKKVCKPY-AFYPCGHHQNDPYYGPC 208

Query: 204 -----PTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 257
                PTPKC + C +K N+ ++  KH++  AY + ++  +I  EIYKNGPV  +F VY+
Sbjct: 209 PGGLWPTPKCRKTCQRKYNKSYQEDKHFATRAYYLPNNERNIRQEIYKNGPVVAAFRVYQ 268

Query: 258 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 317
           DF++YK G+Y H  G   G HAVK++GWG  ++  DYW++AN WN  WG  GYF+I RG+
Sbjct: 269 DFSYYKKGIYVHKWGGQTGAHAVKVVGWG-RENATDYWLIANSWNTDWGESGYFRIVRGT 327

Query: 318 NECGIEEDVVAG 329
           NECGIE  +V G
Sbjct: 328 NECGIEAQMVGG 339


>gi|4204370|gb|AAD11445.1| cathepsin B protease, partial [Fasciola hepatica]
          Length = 247

 Score =  228 bits (581), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 118/248 (47%), Positives = 150/248 (60%), Gaps = 21/248 (8%)

Query: 102 RSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLC 159
           RS WPQC TIS I DQ  CGSCWA  A  A+SDR CIH    M   L+  D L+CC + C
Sbjct: 1   RSQWPQCWTISEIRDQASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAADPLSCCTY-C 59

Query: 160 GDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG-------C-EPAYP 204
           G GC GGYP  AW Y++  G+VT         C P+   T C H G       C    YP
Sbjct: 60  GQGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWM-FTKCDHVGDSRKYSRCPHYTYP 118

Query: 205 TPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 263
           TP C R C    N+ +   K Y  S+Y +      IM EI KNGPVEV+F +++DF  Y+
Sbjct: 119 TPPCARACQTGYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEVTFAIFQDFGVYR 178

Query: 264 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 323
           SG+Y H+ G  +G HAV++IGWG  ++G +YW++AN WN  WG +GYF++ RG NECGIE
Sbjct: 179 SGIYHHVAGKFIGRHAVRMIGWGV-ENGVNYWLMANSWNEEWGENGYFRMVRGRNECGIE 237

Query: 324 EDVVAGLP 331
            +VVAG+P
Sbjct: 238 SEVVAGMP 245


>gi|342181301|emb|CCC90780.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 335

 Score =  228 bits (581), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 127/330 (38%), Positives = 174/330 (52%), Gaps = 15/330 (4%)

Query: 12  LCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLL 71
           LCL   A  A G  + L  D+ +L  + +  +N+     WKA  N +  N T  + + L 
Sbjct: 7   LCLLSTALVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQNITFAEARRLT 66

Query: 72  GVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEA 131
           G +      L  V         +LP+SFD+   WP C TI  I DQ  CGSCWA     A
Sbjct: 67  GARIQKTSSLPPVRFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASA 126

Query: 132 LSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD 190
           +SDR C   G+  L +S   L++CC   CG GCDGGYP ++W Y+V HG+ +  C PY  
Sbjct: 127 ISDRHCTVGGVQQLRISAAHLMSCCE-DCGYGCDGGYPGTSWEYYVSHGLASSYCQPY-P 184

Query: 191 STGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE 242
              C H G +   P        TPKC   C  K       K+    +Y ++ + +D   E
Sbjct: 185 FPHCGHHGGKGKKPPCSKYHFHTPKCNTTCTDKAIPL--IKYRGNHSYEVHGE-DDYKRE 241

Query: 243 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 302
           +Y NGP  V F VY DF  YK+GVY+H++GD +GGHAV+++GWG   +G  YW +AN W+
Sbjct: 242 LYFNGPFVVVFWVYSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKL-NGTPYWKIANSWD 300

Query: 303 RSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
             WG +G+    RG+NECGIE    AG P+
Sbjct: 301 TDWGMNGHLLFLRGNNECGIEAAGYAGSPA 330


>gi|256052331|ref|XP_002569726.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228435|emb|CCD74606.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 319

 Score =  228 bits (580), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 132/340 (38%), Positives = 177/340 (52%), Gaps = 42/340 (12%)

Query: 7   IMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ 66
           ++  +LC+    T  E  +S        L   II  +N                      
Sbjct: 1   MLISVLCIASLITHLEAHISIKNEKFEPLSHDIISYIN---------------------- 38

Query: 67  FKHLLGVKPTPKGLLLGVPVKTH-DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
            KHL   +          P+  H D ++++P +FD+R  WP C +I+ I DQ  CGS WA
Sbjct: 39  -KHLDARREESDLRRKRRPIVDHNDWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSSWA 97

Query: 126 FGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT- 182
           FGAVEA+SDR CI  G   N+ LS  DLL+CC   CGDG +GG+P  AW Y+V  G+VT 
Sbjct: 98  FGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCEH-CGDGFEGGFPALAWDYWVKEGIVTG 156

Query: 183 ------EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISA 229
                   C PY        T   +P C E  Y TP C   C K  +  +   KH   S 
Sbjct: 157 SSKENHTSCQPYPFPKCEHHTKGKYPACFEEIYKTPNCENTCQKSYKTPYAQDKHRGKSR 216

Query: 230 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 289
           Y + +D + I  EI K GPVE +F VYEDF +YKSG+YKHITG ++  HA+++IGWG  +
Sbjct: 217 YNVKNDEKAIQKEIMKYGPVEANFIVYEDFLNYKSGIYKHITGKLVSWHAIRIIGWGV-E 275

Query: 290 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
           +   YW++ N WN  WG +G F+I RG +EC IE +V AG
Sbjct: 276 NNTPYWLIPNSWNEDWGENGNFRILRGRHECSIESEVTAG 315


>gi|91078960|ref|XP_974244.1| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270004840|gb|EFA01288.1| cathepsin B precursor [Tribolium castaneum]
          Length = 319

 Score =  228 bits (580), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 131/327 (40%), Positives = 182/327 (55%), Gaps = 27/327 (8%)

Query: 17  FATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVK 74
           F   A   VS+ ++D  I     I  +N+  ++ W A RN     +N  + +    LG+ 
Sbjct: 6   FLLLASISVSRAEID--IQSQDFIDSINQK-QSHWVARRNFPENTTNEYLYKLNGFLGLH 62

Query: 75  PTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSD 134
           P P    +   +K +     +PK+FDAR  WP+C +++RI DQG CGSCWAF AVE +SD
Sbjct: 63  PDPN--YMPEKIKHNFNPQDIPKTFDARKKWPKCDSLNRIRDQGSCGSCWAFAAVETMSD 120

Query: 135 RFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EEC 185
           R CIH         S  DLL+CC   CG  C GGY ++A+ +++  GVV+       E C
Sbjct: 121 RICIHSSGAKKFFFSAEDLLSCCT-ACGS-CSGGYMMAAFDFYIKQGVVSGGDLNSNEGC 178

Query: 186 DPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIY 244
            PY   T  +H        TP C + C K     + + KHY    Y +++   +I  EI 
Sbjct: 179 RPY---TADAHDKG----VTPSCTKSCRKGYPTSYSSDKHYGSKDYIVDAGVSNIQYEIM 231

Query: 245 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 304
            NGP+ VSF VY+DF +Y SGVY H++G+  G H VK++GWGT  + +DYW++AN W  S
Sbjct: 232 TNGPIIVSFKVYQDFYNYGSGVYHHVSGNYTGNHIVKIVGWGTEKE-QDYWLIANSWGSS 290

Query: 305 WGADGYFKIKRGSNECGIEEDVVAGLP 331
           WG  G+FKI RG NECGIE +  A LP
Sbjct: 291 WGEHGFFKILRGKNECGIENNPYAVLP 317


>gi|183988834|gb|ACC66066.1| cathepsin B [Samia ricini]
          Length = 283

 Score =  227 bits (579), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 125/290 (43%), Positives = 170/290 (58%), Gaps = 25/290 (8%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL--KLPKSFDARSAWPQC 108
           W A RN  F  +T   F H+  ++   +   + V   THD  L   LP+ FD R  WP+C
Sbjct: 1   WSAGRN--FPTHT--SFAHIKILREHERRYYMEVAYVTHDVELIATLPEIFDPRDKWPEC 56

Query: 109 STISRILDQGHCGSCWAFGAVEALSDRFCIHFGM--NLSLSVNDLLACCGFLCGDGCDGG 166
            T++ I DQG CGSCWAFGAVEA++DR CI+     +   S  DL++CC  +CG GC+GG
Sbjct: 57  LTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP-ICGLGCNGG 115

Query: 167 YPISAWRYFVHHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCV 213
            P  AW Y+ H G+V+       + C PY +   C H  PG    C     TPKC + C 
Sbjct: 116 MPTLAWEYWKHVGLVSGGNYNSSQGCRPY-EIPPCEHHVPGNRMPCNGDTKTPKCQKNCE 174

Query: 214 KK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 272
              N  ++  K Y    Y ++   + I AE++KNGPVE +FTVY D   YK+GVYKH  G
Sbjct: 175 SSYNVPFKKDKRYGKHVYSVSGHEDHIKAELFKNGPVEAAFTVYSDLLSYKNGVYKHTEG 234

Query: 273 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 322
           + +GGHA+K+IGWG  ++ + YW++AN WN  WG +G+FKI RG + CGI
Sbjct: 235 NALGGHAIKIIGWGVENNNK-YWLIANSWNSDWGDNGFFKILRGEDHCGI 283


>gi|332374788|gb|AEE62535.1| unknown [Dendroctonus ponderosae]
          Length = 328

 Score =  227 bits (578), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 133/336 (39%), Positives = 184/336 (54%), Gaps = 25/336 (7%)

Query: 8   MDPILCLTCFATFAEGVVSKLKLDS-HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ 66
           M  +L L     FA G+ S L  +  H L D  I ++N + ++ WKA RN     Y +  
Sbjct: 1   MKSVLMLV----FALGLSSALPSNKPHPLSDEYIAQIN-SKQSTWKAGRNFAIDEYEL-- 53

Query: 67  FKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTI-SRILDQGHCGSCWA 125
           FK L      P+GL     +   + + ++P+SFD+R+AWP+C+ I   I DQ  CGSCWA
Sbjct: 54  FKSLASGVKKPQGLKTAQKL-VREITEEIPESFDSRTAWPECTQIIGMIRDQSRCGSCWA 112

Query: 126 FGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH------ 177
           F AVEA+SDR CIH      L +S  DLL C       GC+GG+P  AW  + +      
Sbjct: 113 FAAVEAMSDRICIHSNATKKLLVSSQDLLTCG---TAGGCNGGWPAVAWSDWTNGIVTGG 169

Query: 178 -HGVVTEECDPYFDSTGCSHPG-CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 235
            +G + + C  YF      HP  C     TP CV +C + +  ++  + Y  + Y I  +
Sbjct: 170 LYGALEQGCKSYFLEGCDDHPNKCRNYVSTPACVEQCDEPSLYYKAQETYGQTPYEIQGE 229

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
            E I  EI  NGPVE +  VY DFA Y+SG+Y+  T +  GGHAVK++GWG  +DG  YW
Sbjct: 230 -EQIQYEIMTNGPVEATMDVYVDFAQYQSGIYQLTTDEYEGGHAVKILGWGV-EDGVKYW 287

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           ++AN WN  WG +G F+I RG +E GIE  + A LP
Sbjct: 288 LVANSWNERWGENGLFRIIRGRDEVGIESTIDAALP 323


>gi|166030332|gb|ABY78833.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  226 bits (577), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 127/338 (37%), Positives = 174/338 (51%), Gaps = 17/338 (5%)

Query: 11  ILCLTCFAT--FAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 68
           ++ L+ FA    A G  + L  D+ +L  + +  +N+     WKA  + +  N T  + K
Sbjct: 5   VVVLSSFAATLVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYDGKMQNLTFSEAK 64

Query: 69  HLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 128
            L G        L  V         +LP+SFDA   WP C TI  I DQ  C + WA   
Sbjct: 65  RLTGAFSRKTSSLPPVRFTEEQLRTELPESFDAAEHWPHCPTIREIADQSACRASWAVAT 124

Query: 129 VEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 187
             A+SDR+C +  G  L +S  DL+ACC   CG GC+GGYP +AW Y+V HG+ + +C P
Sbjct: 125 ASAISDRYCTVGKGKQLRISAADLMACCK-DCGGGCEGGYPDAAWEYYVSHGITSSQCQP 183

Query: 188 YFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDI 239
           Y     C H G +   P        TP+C   C  K+      K+    +Y +  + ED 
Sbjct: 184 Y-PFPRCEHRGAQGKKPPCSKYKFVTPQCNATCTDKSVPL--IKYRGNHSYEVRGE-EDY 239

Query: 240 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 299
             E+Y NGP  V F V+ DF  YKSGVY+H+ G+ +GG AV+++GWG   +G  YW +AN
Sbjct: 240 KRELYFNGPFVVRFQVHSDFLAYKSGVYQHVAGNFLGGKAVRIVGWGKL-NGTPYWKVAN 298

Query: 300 QWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 337
            W+  WG +GYF I RG NEC IE    AG P    L 
Sbjct: 299 SWDTDWGMNGYFLILRGDNECNIEHLGFAGTPDPSQLA 336


>gi|183988832|gb|ACC66065.1| cathepsin B [Antheraea assama]
          Length = 287

 Score =  226 bits (577), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 126/292 (43%), Positives = 172/292 (58%), Gaps = 26/292 (8%)

Query: 51  WKAARNPQFSNYT-VGQFKHLLGVKPTPKGLLLGVPVKTHDKSL--KLPKSFDARSAWPQ 107
           W+A RN  F  +T     K L+G        +L +P  THD  L   LP++FD R  WP 
Sbjct: 1   WRAGRN--FPIHTPFAHIKKLMGSLKDDN--ILKLPKVTHDADLIASLPENFDPRDKWPD 56

Query: 108 CSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM--NLSLSVNDLLACCGFLCGDGCDG 165
           C T++ I DQG CGSCWAFGAVEA++DR CI+     +   S  DL++CC  +CG GC+G
Sbjct: 57  CPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP-ICGLGCNG 115

Query: 166 GYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKC 212
           G P  AW Y+ H G+V+       + C PY +   C H  PG    C     TPKC + C
Sbjct: 116 GMPTLAWEYWKHVGLVSGGNYNSSQGCRPY-EIPPCEHHVPGNRMPCNGDTKTPKCEKTC 174

Query: 213 VKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 271
                + ++  K Y    Y ++   ++I AE++KNGPVE +FTVY D   YKSGVY+H  
Sbjct: 175 ESSYTVPFKKDKRYGKHVYSVSGHEDNIKAELFKNGPVEGAFTVYSDLLSYKSGVYQHTH 234

Query: 272 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 323
           G+ +GGHA+K++GWG  ++G  YW++AN WN  WG +G+ KI RG + CGIE
Sbjct: 235 GNALGGHAIKILGWGV-ENGSKYWLIANSWNSDWGDNGFLKILRGEDHCGIE 285


>gi|6562768|emb|CAB62588.1| putative cathepsin B-like protease [Pisum sativum]
          Length = 166

 Score =  226 bits (575), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 103/126 (81%), Positives = 111/126 (88%)

Query: 74  KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALS 133
           K TP+  L  +PV TH KSL LPK FDAR+AWPQCSTI RILDQGHCGSCWAFGAVE+LS
Sbjct: 41  KQTPRNELSSIPVVTHPKSLNLPKEFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLS 100

Query: 134 DRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTG 193
           DRFCIHFG+++ LSVNDLLACCGFLCG GCDGGYPISAW+YF HHGVVTEECDPYFD  G
Sbjct: 101 DRFCIHFGVDVPLSVNDLLACCGFLCGSGCDGGYPISAWKYFAHHGVVTEECDPYFDQIG 160

Query: 194 CSHPGC 199
           CSHPGC
Sbjct: 161 CSHPGC 166


>gi|124502519|gb|ABN13633.1| cysteine proteinase [Haemonchus contortus]
          Length = 342

 Score =  226 bits (575), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 125/324 (38%), Positives = 181/324 (55%), Gaps = 36/324 (11%)

Query: 30  LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 89
           L S++ +   + EVN +P         P F        + ++ +K   + L L V  +  
Sbjct: 38  LVSYLRRSQSLFEVNSDP--------TPNFE-------QKIMDIKYNHQRLNLMVK-EDP 81

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 147
           D  + +P S+D R  W  C+T   I DQ +CGSCWA     A+SDR CI       +++S
Sbjct: 82  DPEVDIPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNIS 140

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSHPG-- 198
             D++ CC   CGDGC+GG+PI AW+YF++ GVV+         C PY     C H G  
Sbjct: 141 ATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKGVCRPY-PIHPCGHHGND 199

Query: 199 -----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 252
                C    PTP C ++C     +++R  K Y   AY +    + I +EI +NGPV  S
Sbjct: 200 TYYGECRGTAPTPPCKKECRPGVRKVYRIDKRYGKDAYIVKQSVKAIQSEILRNGPVVAS 259

Query: 253 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 312
           F VYEDF HYKSG+YKH  G++ G HAVK+IGWG +++  D+W++AN W+  WG  GYF+
Sbjct: 260 FAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWLIANSWHNDWGEKGYFR 318

Query: 313 IKRGSNECGIEEDVVAGLPSSKNL 336
           I RG+N+CGIE  + AG+  +++L
Sbjct: 319 IIRGTNDCGIEGTIAAGIVDTESL 342


>gi|170060936|ref|XP_001866022.1| cathepsin B [Culex quinquefasciatus]
 gi|167879259|gb|EDS42642.1| cathepsin B [Culex quinquefasciatus]
          Length = 341

 Score =  226 bits (575), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 128/293 (43%), Positives = 165/293 (56%), Gaps = 23/293 (7%)

Query: 51  WKAARNPQFSN-YTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCS 109
           W    NP   N Y  G  +  L     P G+L+   VK H   + LP+ FDAR  WP+C+
Sbjct: 50  WTPGANPLPPNLYRTGAKREDLEKHRLPLGILV---VKDH---IVLPERFDARDRWPECT 103

Query: 110 TISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGY 167
           ++ +I +QG CGSCWA  A E  +DR+CIH       S    DLL+CC   CGDGC GG 
Sbjct: 104 SLKQIRNQGCCGSCWAISAAETFTDRWCIHSEDKDQFSFGAYDLLSCC-HSCGDGCQGGN 162

Query: 168 PISAWRYFVHHGVVTEECDPYFDSTGCSHP-------GCEPAYPTPKCVRKCVKKNQLWR 220
              AW+++V  GV +    PY    GC HP         +    TPKC RKC     +  
Sbjct: 163 LGPAWQFWVQRGVSSG--GPYNSRQGC-HPYPVDVCHSADEDADTPKCTRKCQSMYNVTN 219

Query: 221 NS--KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 278
            S  + +   AY ++ D E I  EI++NGPV+ SF VY DF  YK+GVY+H+ G + GGH
Sbjct: 220 VSDDRRFGRVAYSVSQDEERIKEEIFRNGPVQASFDVYLDFKAYKTGVYRHVFGPMEGGH 279

Query: 279 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           AVK+IGWG  ++G  YW+ +N W   WG  G+FKI RG N CGIE DV AGLP
Sbjct: 280 AVKMIGWGV-ENGTKYWLCSNSWGEDWGERGFFKIVRGENHCGIESDVHAGLP 331


>gi|239938574|gb|ACS36086.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score =  226 bits (575), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 120/253 (47%), Positives = 156/253 (61%), Gaps = 21/253 (8%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 152
           +P+SFDAR+ WP+CS++  I DQ +CGSCWA     ALSDR CI  +    + +S  D+L
Sbjct: 2   IPESFDARTKWPKCSSLKHIHDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDIL 61

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-FDSTGCSHPG------ 198
           +CCG  CG GC+GG+PI A+ YF   G VT         C PY F    C H G      
Sbjct: 62  SCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHP--CGHHGKDTYYG 119

Query: 199 -CEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
            C     TPKCVRKC     + ++  +     AY + +  + I  EI KNGPV  +FTVY
Sbjct: 120 ECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAFTVY 179

Query: 257 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
           EDF++YK G+YKH  G   GGHA+K+IGWG  ++G  YW++AN W+  WG +GYF+I RG
Sbjct: 180 EDFSYYKKGIYKHTAGKARGGHAIKIIGWG-KENGVPYWLIANSWHNDWGENGYFRILRG 238

Query: 317 SNECGIEEDVVAG 329
           SN CGIEE+VVAG
Sbjct: 239 SNHCGIEENVVAG 251


>gi|9955277|pdb|1QDQ|A Chain A, X-Ray Crystal Structure Of Bovine Cathepsin B-Ca074
           Complex
          Length = 253

 Score =  225 bits (574), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 124/252 (49%), Positives = 167/252 (66%), Gaps = 16/252 (6%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLL 152
           LP+SFDAR  WP C TI  I DQG CGSCWAFGAVEA+SDR CIH    +N+ +S  D+L
Sbjct: 1   LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 60

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCE 200
            CCG  CGDGC+GG P  AW ++   G+V+         C PY           S P C 
Sbjct: 61  TCCGGECGDGCNGGEPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCT 120

Query: 201 PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
               TPKC + C    +  ++  KH+  S+Y + ++ ++IMAEIYKNGPVE +F+VY DF
Sbjct: 121 GEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDF 180

Query: 260 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 319
             YKSGVY+H++G++MGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI RG + 
Sbjct: 181 LLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDH 239

Query: 320 CGIEEDVVAGLP 331
           CGIE ++VAG+P
Sbjct: 240 CGIESEIVAGMP 251


>gi|28373366|pdb|1ITO|A Chain A, Crystal Structure Analysis Of Bovine Spleen Cathepsin B-
           E64c Complex
 gi|88192750|pdb|2DC6|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca073 Complex
 gi|88192751|pdb|2DC7|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca042 Complex
 gi|88192752|pdb|2DC8|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca059 Complex
 gi|88192753|pdb|2DC9|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca074me Complex
 gi|88192754|pdb|2DCA|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca075 Complex
 gi|88192755|pdb|2DCB|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca076 Complex
 gi|88192756|pdb|2DCC|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca077 Complex
 gi|88192757|pdb|2DCD|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca078 Complex
          Length = 256

 Score =  225 bits (574), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 123/252 (48%), Positives = 167/252 (66%), Gaps = 16/252 (6%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLL 152
           LP+SFDAR  WP C TI  I DQG CGSCWAFGAVEA+SDR CIH    +N+ +S  D+L
Sbjct: 1   LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 60

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCE 200
            CCG  CGDGC+GG+P  AW ++   G+V+         C PY           S P C 
Sbjct: 61  TCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCT 120

Query: 201 PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
               TPKC + C    +  ++  KH+  S+Y + ++ ++IMAEIYKNGPVE +F+VY DF
Sbjct: 121 GEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDF 180

Query: 260 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 319
             YKSGVY+H++G++MGGHA++++GWG  ++G  YW++ N WN  WG +G+FKI RG + 
Sbjct: 181 LLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDH 239

Query: 320 CGIEEDVVAGLP 331
           CGIE ++VAG+P
Sbjct: 240 CGIESEIVAGMP 251


>gi|239938576|gb|ACS36087.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score =  225 bits (573), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 120/253 (47%), Positives = 155/253 (61%), Gaps = 21/253 (8%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 152
           +P+SFDAR+ WP+CS++  I DQ +CGSCWA     ALSDR CI  +    + +S  D+L
Sbjct: 2   IPESFDARTKWPKCSSLKHIRDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDIL 61

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-FDSTGCSHPG------ 198
           +CCG  CG GC+GG+PI A+ YF   G VT         C PY F    C H G      
Sbjct: 62  SCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHP--CGHHGKDTYYG 119

Query: 199 -CEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
            C     TPKCVRKC     + ++  +     AY + +  + I  EI KNGPV  +FTVY
Sbjct: 120 ECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAFTVY 179

Query: 257 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
           EDF++YK G+YKH  G   GGHA+K+IGWG  + G  YW++AN W+  WG +GYF+I RG
Sbjct: 180 EDFSYYKKGIYKHTAGKARGGHAIKIIGWG-KEGGVPYWLIANSWHNDWGENGYFRILRG 238

Query: 317 SNECGIEEDVVAG 329
           SN CGIEE+VVAG
Sbjct: 239 SNHCGIEENVVAG 251


>gi|260782761|ref|XP_002586451.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
 gi|229271561|gb|EEN42462.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
          Length = 272

 Score =  225 bits (573), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 130/288 (45%), Positives = 166/288 (57%), Gaps = 29/288 (10%)

Query: 48  KAGWKAARNPQFSNYTVGQFKHLLG-VKPTPKGLLLGVPVKTHDKS-LKLPKSFDARSAW 105
           +AGW       F   ++   K L G +   P   LL +PVK HD + +++PKSFDAR  W
Sbjct: 1   QAGWN-----DFGEASMSDLKVLCGTILDDPD--LLNLPVKQHDLTDMEIPKSFDARMEW 53

Query: 106 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGC 163
             C    +I DQGHCGSCWAF + E LSDR CI      N+ LS  DLL+C     G GC
Sbjct: 54  STCVRSHKIHDQGHCGSCWAFASTEVLSDRLCIQTRGSTNIILSSEDLLSC--DKAGRGC 111

Query: 164 -DGGYPISAWRYFVHHGVVTEECDPYFD-STGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 221
            DGG    AWRY    GVV   C PY   +TG            P+C+ KC  +   ++ 
Sbjct: 112 SDGGRLSEAWRYMQKKGVVANRCKPYTSGATGF----------IPECMSKCTGEGHAYQ- 160

Query: 222 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 281
            K Y +  Y ++ + + I  EI  NGPVE +FTVY D  HYKSGVY H +G  +GGHAVK
Sbjct: 161 -KFYGLYLYTVSGENQ-IKVEIMTNGPVEAAFTVYSDIVHYKSGVYHHTSGGKLGGHAVK 218

Query: 282 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
           ++GWG  D+ E+YW++AN W   WG  G+FKIKRGS+ECGIE  V+ G
Sbjct: 219 VLGWGVEDE-EEYWLVANSWGPDWGDQGFFKIKRGSDECGIESRVLTG 265


>gi|226473754|emb|CAX71562.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 329

 Score =  224 bits (572), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 131/345 (37%), Positives = 181/345 (52%), Gaps = 34/345 (9%)

Query: 7   IMDPILCLTCFATFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +++   C+    T  E  V ++       L D +I  +N++P AGWKA ++ +F +    
Sbjct: 1   MLNIAFCIVSLFTLLEAHVTTRNNQRIEPLSDEMISFINKHPNAGWKADKSDRFHSVDDA 60

Query: 66  QFKHLLGVKPTPKGLLLGVP-VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 124
           +F  L G K  P       P V  HD ++++P  FD+R  WP+C +IS+I DQ  CGS W
Sbjct: 61  RFL-LGGRKEDPNLRQKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSW 119

Query: 125 AFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 184
           A  AV A+SDR CI  G   S             CG GCDGG+   +W Y+V  G+VT  
Sbjct: 120 AVSAVGAISDRICIQSGGKQSY------------CGSGCDGGFLGPSWDYWVLRGIVTGG 167

Query: 185 CDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSIS 228
                + TGC     P C+              Y TP+C + C K  N  +   KHY   
Sbjct: 168 SKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGF 225

Query: 229 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 288
           +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV+LIGWG  
Sbjct: 226 SYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV- 284

Query: 289 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 285 ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 329


>gi|56754307|gb|AAW25341.1| unknown [Schistosoma japonicum]
          Length = 309

 Score =  224 bits (572), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 128/315 (40%), Positives = 177/315 (56%), Gaps = 26/315 (8%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLP 96
           +I  +N++P AGWKA ++ +F  ++V   + LLG +     L       V  HD ++++P
Sbjct: 1   MISFINKHPNAGWKADKSDRF--HSVDDARILLGGRREDPNLREKRRPTVDHHDLNVEIP 58

Query: 97  KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLAC 154
             FD+R  WP+C +IS+I DQ  C S WA  AV A+SDR CI  G   ++ LS  DL++C
Sbjct: 59  SHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISC 118

Query: 155 CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE----------- 200
           C   CG GCDGG    +W Y+V HG+VT       + TGC     P C+           
Sbjct: 119 CKN-CGSGCDGGVTGYSWDYWVSHGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACG 175

Query: 201 -PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
              Y TP+C + C K  N  +   KHY   +Y + S    I  +I  +G VE    +YED
Sbjct: 176 DKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGTVEAYLEIYED 235

Query: 259 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
           F +YKSG+Y++ TG  + GHAV+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG N
Sbjct: 236 FLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRN 294

Query: 319 ECGIEEDVVAGLPSS 333
           EC IE ++ AGL  S
Sbjct: 295 ECLIESEIAAGLIKS 309


>gi|268561878|ref|XP_002638441.1| Hypothetical protein CBG18657 [Caenorhabditis briggsae]
          Length = 372

 Score =  224 bits (572), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 121/283 (42%), Positives = 168/283 (59%), Gaps = 47/283 (16%)

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 148
           + + +P SFDAR  WP C +I  I +Q +CG+CWAFGA E +SDR CI  G      +SV
Sbjct: 72  QGVYVPISFDARDHWPNCKSIKLIRNQAYCGACWAFGAAEIISDRICIQSGGAHQPIISV 131

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC---SHP---GCEPA 202
            D+L+CCG  CG+GC GGYP+   +++++ GVVT      ++ TGC   + P    CE +
Sbjct: 132 EDILSCCGSSCGEGCKGGYPLEGLKFWMNSGVVT---GGDYNGTGCQPYTFPPCSSCEAS 188

Query: 203 YPTPKCVRKC--------VKKNQLWRNSKH---------YSI--------SAYRINSDPE 237
             TP C +KC         K ++ + N +          Y +        SAYR+++   
Sbjct: 189 KSTPSCQKKCQTGYLEATYKNDKRFENEEQDSSYMSENFYQVLIILKGGKSAYRLSTTTS 248

Query: 238 D----------IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 287
                      I  EIY NGPVEVS+ V+EDF  YKSGVY +++G + G HAVK+IGWGT
Sbjct: 249 SNKISTDAIITIQTEIYNNGPVEVSYRVFEDFYQYKSGVYHYVSGKLTGAHAVKIIGWGT 308

Query: 288 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
            ++  DYW++AN W   +G  G+FKI+RG+NECGIEE+VVAGL
Sbjct: 309 -ENKVDYWLVANSWGTDFGEKGFFKIRRGTNECGIEENVVAGL 350


>gi|329668994|gb|AEB96385.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 316

 Score =  224 bits (572), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 117/252 (46%), Positives = 152/252 (60%), Gaps = 18/252 (7%)

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDL 151
           K+P SFDAR  WP C +IS I DQ  CGSCWAF + E +SDR CI  H    + LS +D+
Sbjct: 65  KIPDSFDARVTWPHCPSISYIRDQSQCGSCWAFSSAEVMSDRVCIASHGHKKVELSADDI 124

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHPG 198
           L+CC    G GCDGG+P+SAW+YFV  GVVT       + C PY             +  
Sbjct: 125 LSCC-TDGGYGCDGGWPVSAWQYFVETGVVTGGLYGTKDACRPYEIPPCGIHKNETFYSN 183

Query: 199 CEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 257
           C     TP C   C     + + + K Y  +AY +++    I  EI   GPV  +FTVY+
Sbjct: 184 CTQEIDTPDCKTTCQAGYPISYDDDKTYGKTAYSVSNSVHAIQKEIMTYGPVVAAFTVYD 243

Query: 258 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 317
           DF HYK+G+YKH++G   GGHAV+++GWG    G  YW++AN WN  WG +GYF+I RGS
Sbjct: 244 DFFHYKTGIYKHVSGAEAGGHAVRILGWG-QQGGVPYWLVANSWNTDWGENGYFRILRGS 302

Query: 318 NECGIEEDVVAG 329
           +ECGIE+ VVAG
Sbjct: 303 DECGIEDGVVAG 314


>gi|268572243|ref|XP_002648913.1| Hypothetical protein CBG17826 [Caenorhabditis briggsae]
          Length = 323

 Score =  224 bits (571), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 113/246 (45%), Positives = 148/246 (60%), Gaps = 14/246 (5%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLL 152
           +P SFD+R+ W  C++I  I DQ  CGSCWAF   E +SDR CI        ++S  D+L
Sbjct: 81  IPPSFDSRTRWSNCTSIEMIRDQAQCGSCWAFSTAEVISDRICIATKGTQQPTISPTDML 140

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 206
           ACCG  CGDGC GGYPI A+R++   GVVT        C PY  +   S P       TP
Sbjct: 141 ACCGNSCGDGCKGGYPIQAFRWWNSRGVVTGGDFRGSGCRPYPFAPCISCP----EEKTP 196

Query: 207 KCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
            C   C    +  +   K + +SAY +  +   I  EI  NGPV  +FT+YED   YKSG
Sbjct: 197 TCSLSCQFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSG 256

Query: 266 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 325
           VY+H  G ++GGHA+K+IGWGT  +G  YW++AN W  +WG +G+ K++RG NECGIE  
Sbjct: 257 VYRHTAGRLLGGHAIKIIGWGT-QNGIPYWLIANSWGANWGENGFLKMRRGVNECGIERA 315

Query: 326 VVAGLP 331
           VVAG+P
Sbjct: 316 VVAGMP 321


>gi|157167283|ref|XP_001658486.1| cathepsin b [Aedes aegypti]
 gi|108876477|gb|EAT40702.1| AAEL007599-PA [Aedes aegypti]
          Length = 342

 Score =  223 bits (569), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 120/268 (44%), Positives = 160/268 (59%), Gaps = 28/268 (10%)

Query: 82  LGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG 141
           L   +  + + ++LP+SFDAR  W QC +++ I +QG CGSCWA  A  A++DR+CI   
Sbjct: 74  LAPAILVNPQDIQLPESFDARQKWSQCPSLNVIRNQGCCGSCWAISAASAMTDRWCIKSK 133

Query: 142 --MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGC 199
                S    D+LACC   CGDGC GGY   AW+++V  GV +    PY    GC HP  
Sbjct: 134 GKEQFSFGATDMLACC-HACGDGCKGGYLGPAWQFWVEQGVSSG--GPYNSRQGC-HP-- 187

Query: 200 EPAYP------------TPKCVRKC---VKKNQLWRNSKHYSISAYRINSDPEDIMAEIY 244
              YP            TPKC ++C        +W++ + Y   AY I +D + IM EIY
Sbjct: 188 ---YPIDVCDASGEEADTPKCSKRCQSGYNVTDVWQD-RRYGRVAYSIPNDEQKIMEEIY 243

Query: 245 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 304
            NGPV+ +F  Y+D   YKSGVY+H+ G + GGHAVKL+GWG  ++G  YW++AN W   
Sbjct: 244 INGPVQAAFMTYQDLHAYKSGVYRHVWGHMAGGHAVKLMGWGV-ENGLKYWLVANSWGDD 302

Query: 305 WGADGYFKIKRGSNECGIEEDVVAGLPS 332
           WG +G+FKI RG N CGIE+DV AGLPS
Sbjct: 303 WGDNGFFKIVRGENHCGIEKDVHAGLPS 330


>gi|1008858|gb|AAA79004.1| cathepsin B-like thiol protease [Aedes aegypti]
          Length = 342

 Score =  223 bits (569), Expect = 8e-56,   Method: Compositional matrix adjust.
 Identities = 120/268 (44%), Positives = 160/268 (59%), Gaps = 28/268 (10%)

Query: 82  LGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG 141
           L   +  + + ++LP+SFDAR  W QC +++ I +QG CGSCWA  A  A++DR+CI   
Sbjct: 74  LAPAILVNPQDIQLPESFDARQKWSQCPSLNVIRNQGCCGSCWAISAASAMTDRWCIKSK 133

Query: 142 --MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGC 199
                S    D+LACC   CGDGC GGY   AW+++V  GV +    PY    GC HP  
Sbjct: 134 GKEQFSFGATDMLACC-HACGDGCKGGYLGPAWQFWVEQGVSSG--GPYNSRQGC-HP-- 187

Query: 200 EPAYP------------TPKCVRKC---VKKNQLWRNSKHYSISAYRINSDPEDIMAEIY 244
              YP            TPKC ++C        +W++ + Y   AY I +D + IM EIY
Sbjct: 188 ---YPIDVCDASGEEADTPKCSKRCQSGYNVTDVWQD-RRYGRVAYSIPNDEQKIMEEIY 243

Query: 245 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 304
            NGPV+ +F  Y+D   YKSGVY+H+ G + GGHAVKL+GWG  ++G  YW++AN W   
Sbjct: 244 INGPVQAAFMTYQDLHAYKSGVYRHVWGHMAGGHAVKLMGWGV-ENGLKYWLVANSWGDD 302

Query: 305 WGADGYFKIKRGSNECGIEEDVVAGLPS 332
           WG +G+FKI RG N CGIE+DV AGLPS
Sbjct: 303 WGDNGFFKIVRGENHCGIEKDVHAGLPS 330


>gi|204022102|dbj|BAG71148.1| cathepsin B-N2 [Tuberaphis takenouchii]
          Length = 334

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 132/325 (40%), Positives = 176/325 (54%), Gaps = 33/325 (10%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KT 88
           ++ L++  I ++N N K  WKA  N  P+ S   +  F  LLG K           + KT
Sbjct: 18  AYFLEEDYINQINANAKT-WKAGANFDPKLS---IDSFVKLLGSKGVQAAKQASPDMFKT 73

Query: 89  HDKSL-----KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 141
           HD++      ++P +FDAR  W +CST+ ++ DQG+CG+CWAFG   A +DR CI     
Sbjct: 74  HDEAYNSLPNRIPSNFDARKKWRKCSTVGKVRDQGNCGTCWAFGTSSAFADRLCIATNGE 133

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 188
            N  LS  +L  CC   CG GC GGYPI AW  F  HG+VT       E C PY      
Sbjct: 134 FNELLSAEELAFCC-HKCGSGCHGGYPIKAWERFRKHGLVTGGDYNSGEGCQPYRVPPCP 192

Query: 189 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 247
           FD  G +    +PA    +C R C     L ++    Y+  AY +N   + I  ++   G
Sbjct: 193 FDEYGNNTCRGKPAEKNHRCTRMCYGNQNLDFKEDHRYTRDAYYLNY--QIIQNDLMTYG 250

Query: 248 PVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 306
           P+E S+ VY+DF +YKSGVY K      +GGHAVKLIGWG  + G  YW+L N WN  WG
Sbjct: 251 PIEASYDVYDDFPNYKSGVYMKTENASYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWG 309

Query: 307 ADGYFKIKRGSNECGIEEDVVAGLP 331
             G FKI+RG+NECGI+     G+P
Sbjct: 310 DQGLFKIRRGTNECGIDNSTTGGVP 334


>gi|166030330|gb|ABY78832.1| cathepsin B-like protease [Trypanosoma congolense]
 gi|343476577|emb|CCD12360.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 127/338 (37%), Positives = 168/338 (49%), Gaps = 16/338 (4%)

Query: 11  ILCLTCFAT--FAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 68
           ++ L+ FA    A G  + L  D+ +L  + +  +N+     WKA  N +  N T  + K
Sbjct: 5   VVVLSSFAATLVALGASALLAKDAPVLTKTFVDHINQLNGGMWKAVYNGKMQNITFSEAK 64

Query: 69  HLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 128
            L G +      L            KLP++FDA   WP C TI  I DQ  C + WA   
Sbjct: 65  RLTGARIQKSSALPPARFTEEQLRTKLPETFDAAEHWPHCPTIREIADQSECRASWAVST 124

Query: 129 VEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 187
             A+SDR+C +  G  L +S   LL+CC   CGDGC GG+P  AWRY+V +G+ +  C P
Sbjct: 125 ASAISDRYCTVGKGKQLRISAAHLLSCCK-DCGDGCKGGFPGFAWRYYVEYGITSSSCQP 183

Query: 188 YFDSTGCSHPGCEPA--------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDI 239
           Y     C H G +          + TPKC   C  K       K+   + Y +    ED 
Sbjct: 184 Y-PFPRCEHQGAQGNKTPCSKYNFDTPKCNATCTDKAIPL--IKYRGNATYLLLHGEEDY 240

Query: 240 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 299
             E+Y NGP    F VY D   YKSGVY+H+ GD +GG AVK++GWG   +G  YW LAN
Sbjct: 241 KRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKL-NGTPYWKLAN 299

Query: 300 QWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 337
            W+  WG  GY  I RG+NEC IE    AG P +  L 
Sbjct: 300 SWDTDWGMGGYLLILRGNNECNIEHLGFAGTPEASQLT 337


>gi|194387364|dbj|BAG60046.1| unnamed protein product [Homo sapiens]
          Length = 245

 Score =  223 bits (567), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 111/236 (47%), Positives = 154/236 (65%), Gaps = 16/236 (6%)

Query: 120 CGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVH 177
           C   WAFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW ++  
Sbjct: 11  CRMSWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTR 70

Query: 178 HGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKH 224
            G+V+         C PY           S P C     TPKC + C    +  ++  KH
Sbjct: 71  KGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKH 130

Query: 225 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 284
           Y  ++Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++G
Sbjct: 131 YGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILG 190

Query: 285 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           WG  ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 191 WGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 245


>gi|204022094|dbj|BAG71144.1| cathepsin B-N1 [Tuberaphis taiwana]
          Length = 334

 Score =  222 bits (566), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 137/325 (42%), Positives = 171/325 (52%), Gaps = 33/325 (10%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KT 88
           ++ L++  I ++N N K  WKA  N  P+ S   +  F  LLG K           + KT
Sbjct: 18  AYFLEEDYINQINANAKT-WKAGVNFDPKLS---IDSFVKLLGSKGVQAAKQASPDMFKT 73

Query: 89  HDK-----SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 141
           HD+     S ++P SFDAR  W +CSTI  + DQG CGSCWAFG   A +DR CI     
Sbjct: 74  HDEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGE 133

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 188
            N  LS  +L  CC   CG GC GGYPI AW  F  HG+VT       E C PY      
Sbjct: 134 FNELLSAEELAFCC-HKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCP 192

Query: 189 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 247
            D  G +    +PA    +C R C     L ++   HY+  AY +      I  +I   G
Sbjct: 193 LDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYGT--IQNDILAYG 250

Query: 248 PVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 306
           P+E SF VY+DF  YKSGVY K      +GGHAVKLIGWG  + G  YW+L N WN  WG
Sbjct: 251 PIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWG 309

Query: 307 ADGYFKIKRGSNECGIEEDVVAGLP 331
             G FKI+RG+NECGI+     G+P
Sbjct: 310 DQGLFKIRRGTNECGIDNSTTGGVP 334


>gi|343474137|emb|CCD14154.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score =  222 bits (566), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 127/338 (37%), Positives = 168/338 (49%), Gaps = 16/338 (4%)

Query: 11  ILCLTCFAT--FAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 68
           ++ L+ FA    A G  + L  D+ +L  + +  +N+     WKA  N +  N T  + K
Sbjct: 5   VVVLSSFAATLVALGASALLAKDAPVLTKTFVDHINQLNGGMWKAVYNGKMQNITFSEAK 64

Query: 69  HLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 128
            L G +      L            KLP++FDA   WP C TI  I DQ  C + WA   
Sbjct: 65  RLTGARIQKSSGLQPARFTEEQLRTKLPETFDAAEHWPHCPTIREIADQSECRASWAVST 124

Query: 129 VEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 187
             A+SDR+C +  G  L +S   LL+CC   CGDGC GG+P  AWRY+V +G+ +  C P
Sbjct: 125 ASAISDRYCTVGKGKQLRISAAHLLSCCK-DCGDGCKGGFPGFAWRYYVEYGITSSSCQP 183

Query: 188 YFDSTGCSHPGCEPA--------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDI 239
           Y     C H G +          + TPKC   C  K       K+   + Y +    ED 
Sbjct: 184 Y-PFPRCEHQGAQGNKTPCSKYNFDTPKCNATCTDKAIPL--IKYRGNATYLLLHGEEDY 240

Query: 240 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 299
             E+Y NGP    F VY D   YKSGVY+H+ GD +GG AVK++GWG   +G  YW LAN
Sbjct: 241 KRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKL-NGTPYWKLAN 299

Query: 300 QWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 337
            W+  WG  GY  I RG+NEC IE    AG P +  L 
Sbjct: 300 SWDTDWGMGGYLLILRGNNECNIEHLGFAGTPEASQLT 337


>gi|343470805|emb|CCD16605.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score =  222 bits (566), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 125/338 (36%), Positives = 170/338 (50%), Gaps = 16/338 (4%)

Query: 11  ILCLTCFAT--FAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 68
           ++ L+ FA    A G  + L  D+ +L  + +  +N+     W+A  N +  N T  + K
Sbjct: 5   VVVLSSFAATLVALGASALLAKDAPVLTKTFVDHINQLNGGMWRAVYNGKMQNITFSEAK 64

Query: 69  HLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 128
            L G +      L            KLP++FDA   WP C TI  I DQ  C + WA   
Sbjct: 65  RLTGARIQKSSALPPARFTEEQLRTKLPETFDAAEHWPHCPTIREIADQSECRASWAVST 124

Query: 129 VEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 187
             A+SDR+C +  G  L +S   LL+CC   CGDGC GG+P  AWRY+V +G+ +  C P
Sbjct: 125 ASAISDRYCTVGKGKQLRISAAHLLSCCK-DCGDGCKGGFPGFAWRYYVEYGITSSSCQP 183

Query: 188 YFDSTGCSHPGCEPA--------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDI 239
           Y     C H G +          + TPKC   C  K+      K+   + Y +    ED 
Sbjct: 184 Y-PFPRCEHQGAQGNKTPCSKYNFDTPKCNATCTDKSVPL--IKYRGNATYLLLHGEEDY 240

Query: 240 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 299
             E+Y NGP    F VY D   YKSGVY+++ GD +GG AVK++GWG   +G  YW +AN
Sbjct: 241 KRELYFNGPFVAVFYVYTDLFAYKSGVYRNVDGDFLGGTAVKVVGWGKL-NGTPYWKVAN 299

Query: 300 QWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 337
            W+  WG DGY  I RG+NEC IE    AG P +  L 
Sbjct: 300 SWDTDWGMDGYLLILRGNNECNIEHLGFAGTPETSQLT 337


>gi|343477197|emb|CCD11909.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  222 bits (566), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 126/343 (36%), Positives = 176/343 (51%), Gaps = 27/343 (7%)

Query: 11  ILCLTCFAT--FAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 68
           ++ L+ FA    A G  + L  D+ +L  + +  +N+     WKA  + +  N T  + K
Sbjct: 5   VVVLSSFAAALVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYDGKMQNLTFSEAK 64

Query: 69  HLLGVKPTPKGLLLGVPVKTHDKSLK--LPKSFDARSAWPQCSTISRILDQGHCGSCWAF 126
            L G        L   P +  ++ L+  LP+SFDA   WP C TI  I DQ  C + WA 
Sbjct: 65  RLTGAFSRKTSTL--PPARFTEEQLRTDLPESFDAAEHWPHCPTIREIADQSACRASWAV 122

Query: 127 GAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEEC 185
               A+SDR+C +  G  L +S  DL+ACC   CG GC+GGYP +AW Y+V HG+ + +C
Sbjct: 123 ATASAISDRYCTVGKGKQLRISAADLMACCK-DCGGGCEGGYPDAAWEYYVSHGIASSQC 181

Query: 186 DPYFDSTGCSHPGCEPA--------YPTPKCVRKCVKKNQ---LWRNSKHYSISAYRINS 234
            PY     C H G +          + TP+C   C  K      +R +  Y +       
Sbjct: 182 QPY-PFPRCEHRGAQGKKTPCSKYKFVTPQCNATCTDKTIPLIKYRGNHSYEVRG----- 235

Query: 235 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 294
             ED   E+Y NGP  V F V+ DF  YK+GVY+H+ G+ +GG AV+++GWG   +G  Y
Sbjct: 236 -EEDYKRELYFNGPFVVRFQVHSDFLAYKNGVYQHVAGNFLGGKAVRIVGWGKL-NGTPY 293

Query: 295 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 337
           W +AN W+  WG +GYF I RG NEC IE    AG P    L 
Sbjct: 294 WKVANSWDTDWGMNGYFLILRGDNECNIEHLGFAGTPDPSQLT 336


>gi|343474132|emb|CCD14149.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score =  222 bits (566), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 126/338 (37%), Positives = 168/338 (49%), Gaps = 16/338 (4%)

Query: 11  ILCLTCFAT--FAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 68
           ++ L+ FA    A G  + L  D+ +L  + +  +N+     W+A  N +  N T  + K
Sbjct: 5   VVVLSSFAATLVALGASALLAKDAPVLTKTFVDHINQLNGGMWRAVYNGKMQNITFSEAK 64

Query: 69  HLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 128
            L G +      L            KLP++FDA   WP C TI  I DQ  C + WA   
Sbjct: 65  RLTGARIQKSSALPPARFTEEQLRTKLPETFDAAEHWPHCPTIREIADQSECRASWAVST 124

Query: 129 VEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 187
             A+SDR+C +  G  L +S   LL+CC   CGDGC GG+P  AWRY+V +G+ +  C P
Sbjct: 125 ASAISDRYCTVGKGKQLRISAAHLLSCCK-DCGDGCKGGFPGFAWRYYVEYGITSSSCQP 183

Query: 188 YFDSTGCSHPGCEPA--------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDI 239
           Y     C H G +          + TPKC   C  K       K+   + Y +    ED 
Sbjct: 184 Y-PFPRCEHQGAQGNKTPCSKYNFDTPKCNATCTDKAIPL--IKYRGNATYLLLHGEEDY 240

Query: 240 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 299
             E+Y NGP    F VY D   YKSGVY+H+ GD +GG AVK++GWG   +G  YW LAN
Sbjct: 241 KRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKL-NGTPYWKLAN 299

Query: 300 QWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 337
            W+  WG  GY  I RG+NEC IE    AG P +  L 
Sbjct: 300 SWDTDWGMGGYLLILRGNNECNIEHLGFAGTPEASQLT 337


>gi|204022100|dbj|BAG71147.1| cathepsin B-N1 [Tuberaphis takenouchii]
          Length = 334

 Score =  222 bits (566), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 134/325 (41%), Positives = 174/325 (53%), Gaps = 33/325 (10%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KT 88
           ++ L++  I ++N N K  WKA  N  P+ S   +  F  LLG K           + KT
Sbjct: 18  AYFLEEDYINQINTNAKT-WKAGVNFDPKLS---IDSFVKLLGSKGVQAAKQTSPDMFKT 73

Query: 89  HDKSL-----KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 141
           HD++      ++P +FDAR  W +CSTI  + DQGHCGSCWAFG   A +DR CI     
Sbjct: 74  HDEAYNSLPNRIPSNFDARKKWRKCSTIGEVRDQGHCGSCWAFGTSSAFADRLCIATDGE 133

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 188
            N  LS  +L  CC   CG GC GGYPI AW +F  HG+VT       E C PY      
Sbjct: 134 FNELLSAEELAFCC-HKCGFGCHGGYPIKAWEWFKKHGLVTGGDYDSGEGCQPYRVPPCP 192

Query: 189 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 247
            D  G +    +PA    +C R C    +L ++   H++  AY +      I  ++   G
Sbjct: 193 LDEYGNNTCRGKPAEKNHRCTRMCYGNQELDFKEDHHWTRDAYYLTYTT--IQKDVMAYG 250

Query: 248 PVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 306
           P+E SF VY+DF +YKSGVY K      +GGHAVKLIGWG  + G  YW+L N WN  WG
Sbjct: 251 PIEASFDVYDDFPNYKSGVYMKTENASYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWG 309

Query: 307 ADGYFKIKRGSNECGIEEDVVAGLP 331
             G FKI RG+NECGI+     G+P
Sbjct: 310 DQGLFKILRGTNECGIDNSTTGGVP 334


>gi|339242629|ref|XP_003377240.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316973974|gb|EFV57515.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 325

 Score =  222 bits (566), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 121/298 (40%), Positives = 164/298 (55%), Gaps = 11/298 (3%)

Query: 40  IKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSF 99
           I+E N+     +   +N  F   ++   K LLG K            K  + S+ LP   
Sbjct: 29  IQEKNDLEGLPYTFGKNAYFEGASIETVKRLLGFKGKLLSHTSISSSKNANLSVDLPFEM 88

Query: 100 DARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH--FGMNLSLSVNDLLACCGF 157
           DAR  WPQC  I  + DQ +CGSCWA  +   ++DR CI         LS  +L++CC  
Sbjct: 89  DARKRWPQCKYIGFVRDQANCGSCWAVSSASVMTDRICIESIAAKQPLLSEEELVSCCK- 147

Query: 158 LCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS----HPGCEPAYPTPKCVRKCV 213
           +CG GCDGGYP  A+ Y+   G+ T    PY  + GC         E    TP C R+C+
Sbjct: 148 ICGYGCDGGYPDKAFIYWATRGIPTG--GPYGSTKGCKPYSIGSNSEDEAETPLCTRQCI 205

Query: 214 KKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 272
            +        +H+    Y +NS+ E IM E+YKNGPV V+F VYEDF +Y  GVY+H  G
Sbjct: 206 NEYPYNLSQDRHFGEKPYWVNSNEEQIMQELYKNGPVVVAFNVYEDFMYYIKGVYEHRFG 265

Query: 273 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
             +GGHAVKLIGWG  ++ + YW+++N WN +WG +G+FKI RG N C IE  VVAG+
Sbjct: 266 KFLGGHAVKLIGWGI-ENSKKYWLISNSWNTTWGENGFFKIIRGKNCCAIESYVVAGM 322


>gi|156375635|ref|XP_001630185.1| predicted protein [Nematostella vectensis]
 gi|156217201|gb|EDO38122.1| predicted protein [Nematostella vectensis]
          Length = 311

 Score =  222 bits (565), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 132/329 (40%), Positives = 179/329 (54%), Gaps = 25/329 (7%)

Query: 8   MDPILCLTCFATFAEGV-VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ 66
           M  I          +G+ +SK K+ S  L D I          GW+A   PQF N T   
Sbjct: 1   MLAIAAFLVLLVSGDGIPISKEKVISRDLVDKI-----NTLNVGWEATLYPQFENLTFES 55

Query: 67  FKHLLGVKPT-PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
            K +LG +   P+G L   P      +  +P++FDAR  WP   +I  I +QG CGSCWA
Sbjct: 56  AKSMLGSRGAWPEGSL--PPEIEVRVAENIPENFDARKQWP--GSIHPIRNQGQCGSCWA 111

Query: 126 FGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 183
           FGA E LSDRF I     + ++LS   L+ C   L   GC GG+PI+AW Y V  G++TE
Sbjct: 112 FGASEVLSDRFAIASKNQIYVTLSAQQLVDCD--LDNSGCSGGWPINAWNYMVKTGLLTE 169

Query: 184 EC-DPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE 242
           +C  PY+         C     T  C  +   K + +     Y + A  +    E I  +
Sbjct: 170 QCYGPYY----AKQYTCRLTANTTDCPWQPGVKARFYHAKSAYKLPAKNV----EAIQTD 221

Query: 243 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 302
           I  NGPVE  FT+++DF  Y+SG+Y H TG  +GGHA+K++GWGT D+  DYW+ AN W 
Sbjct: 222 IMNNGPVEADFTIFQDFYAYRSGIYVHATGKQLGGHAIKILGWGTEDN-VDYWLCANSWG 280

Query: 303 RSWGADGYFKIKRGSNECGIEEDVVAGLP 331
            +WG  GYFKI+RG++ECGIE+ + AGLP
Sbjct: 281 ANWGIQGYFKIRRGTDECGIEDGLAAGLP 309


>gi|343474530|emb|CCD13852.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 335

 Score =  221 bits (564), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 128/337 (37%), Positives = 166/337 (49%), Gaps = 21/337 (6%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
           ILC       A    + +  ++ +L    +  VN      W A  + +  N TV + K L
Sbjct: 6   ILCSVSVVLLAMNTSALVAREAPLLTKEFVDTVNRLSGGMWTAVYDGRMQNTTVSEAKRL 65

Query: 71  LGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 130
                 P  +L  V     +    LP++FDA   WP C TI+ I DQ  CGSCWA  A  
Sbjct: 66  NRATRKPVSVLPRVNFTEEELLAPLPETFDAAEKWPNCPTITEISDQSSCGSCWAVAAAT 125

Query: 131 ALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF 189
           +++DR+C IH    L +S  DLLACCG  CG GC GG P  AW YF   G+ +  C PY 
Sbjct: 126 SMTDRYCTIHGVRGLRISAADLLACCG-DCGYGCLGGDPDMAWAYFSSEGIASGRCQPY- 183

Query: 190 DSTGCSHPGCEPAYP--------TPKCVRKCVKKN---QLWRNSKHYSISAYRINSDPED 238
               CSH      YP        TP C   C       + +R  K YS+S        ED
Sbjct: 184 PFPRCSHYTNSTTYPQCSALHLWTPTCNPACTDSTISKKKYRGLKSYSLSG------EED 237

Query: 239 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 298
              E+Y  GP +  F V+ D   YK GVYKH+ G  +G HAV+++GWG +  G  YW +A
Sbjct: 238 FRRELYFRGPFQAVFDVWSDLFAYKHGVYKHVGGAFIGAHAVRIVGWG-NQSGVPYWKIA 296

Query: 299 NQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 335
           N WN  WG  GYF + RG NECGIE+   AG+P+  N
Sbjct: 297 NSWNAEWGDRGYFFMLRGDNECGIEDSGSAGVPAIPN 333


>gi|91089435|ref|XP_966663.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
 gi|270012706|gb|EFA09154.1| cathepsin B precursor [Tribolium castaneum]
          Length = 320

 Score =  221 bits (564), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 126/307 (41%), Positives = 169/307 (55%), Gaps = 18/307 (5%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFS-NYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS 92
           IL    I  +N+     W A   P F  N      + L G +  P         K     
Sbjct: 21  ILSQQFINAINQK-HPSWLAG--PNFPPNTPHSHLRSLNGARDDP-AFFTDTETKNVTIP 76

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVND 150
            ++P++FDAR  WPQC +I +I +QG CGSCWAFGAVE +SDR CI  +       S  D
Sbjct: 77  EQIPQNFDARIVWPQCESIRKIRNQGSCGSCWAFGAVETMSDRLCIASNATKKFEFSAQD 136

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY---PTPK 207
           LLACC   CG GC GGY   AW+Y+V  G+V+     +  S GC HP    A+    TP 
Sbjct: 137 LLACCK-ECGHGCGGGYSSRAWQYWVTDGIVSG--GDFNTSQGC-HPYSVQAFRDSTTPN 192

Query: 208 CVRKCV--KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
           C   C   K  + +   K Y   +YRI  + E I AEI  +GPV+ S+ VY+DF  Y++G
Sbjct: 193 CSSFCTNPKYQKNYSEDKRYGARSYRIAKNIEQIQAEIMTSGPVQASYVVYDDFYSYQNG 252

Query: 266 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA-DGYFKIKRGSNECGIEE 324
           VY+H+ G+V G H+VK++GWG  ++G DYW++AN W R WG   G+FK  RG N C IE 
Sbjct: 253 VYQHVLGNVSGRHSVKILGWG-RENGTDYWLVANSWGRDWGRLGGFFKFLRGENHCDIES 311

Query: 325 DVVAGLP 331
           +++ G P
Sbjct: 312 NILGGDP 318


>gi|268570495|ref|XP_002648548.1| Hypothetical protein CBG24861 [Caenorhabditis briggsae]
          Length = 323

 Score =  221 bits (563), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 112/246 (45%), Positives = 147/246 (59%), Gaps = 14/246 (5%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLL 152
           +P SFD+R+ W  C++I  I DQ  CGSCWAF   E +SDR CI        ++S  D+L
Sbjct: 81  IPPSFDSRTRWSNCTSIEMIRDQAQCGSCWAFSTAEVISDRICIATKGTQQPTISPTDML 140

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 206
           ACCG  CGDGC G YPI A+R++   GVVT        C PY  +   S P       TP
Sbjct: 141 ACCGNSCGDGCKGRYPIQAFRWWNSRGVVTGGDFRGSGCRPYPFAPCISCP----EEKTP 196

Query: 207 KCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
            C   C    +  +   K + +SAY +  +   I  EI  NGPV  +FT+YED   YKSG
Sbjct: 197 TCSLSCQFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSG 256

Query: 266 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 325
           VY+H  G ++GGHA+K+IGWGT  +G  YW++AN W  +WG +G+ K++RG NECGIE  
Sbjct: 257 VYRHTAGRLLGGHAIKIIGWGT-QNGIPYWLIANSWGANWGENGFLKMRRGVNECGIERA 315

Query: 326 VVAGLP 331
           VVAG+P
Sbjct: 316 VVAGMP 321


>gi|166030318|gb|ABY78826.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score =  221 bits (563), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 128/337 (37%), Positives = 165/337 (48%), Gaps = 21/337 (6%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
           ILC       A    + +  ++ +L    +  VN      W A  + +  N TV + K L
Sbjct: 6   ILCSVSVVLLAMNTSALVAREAPLLTKEFVDTVNRLSGGMWTAVYDGRMQNTTVSEAKRL 65

Query: 71  LGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 130
                 P  +L  V     +    LP++FDA   WP C TI+ I DQ  CGSCWA  A  
Sbjct: 66  NRATRKPVSVLPRVNFTEEELLAPLPETFDAAEKWPNCPTITEISDQSSCGSCWAVAAAT 125

Query: 131 ALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF 189
           +++DR+C IH    L +S  DLLACCG  CG GC GG P  AW YF   G+ +  C PY 
Sbjct: 126 SMTDRYCTIHGVRGLRISAADLLACCG-DCGYGCLGGDPDMAWAYFSSEGIASGRCQPY- 183

Query: 190 DSTGCSHPGCEPAYP--------TPKCVRKCVKKN---QLWRNSKHYSISAYRINSDPED 238
               CSH      YP        TP C   C       + +R  K YS S        ED
Sbjct: 184 PFPRCSHYTNSTTYPQCSALHLWTPTCNPACTDSTISKKKYRGLKSYSFSG------EED 237

Query: 239 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 298
              E+Y  GP +  F V+ D   YK GVYKH+ G  +G HAV+++GWG +  G  YW +A
Sbjct: 238 FRRELYFRGPFQAVFDVWSDLFAYKHGVYKHVGGAFIGAHAVRIVGWG-NQSGVPYWKIA 296

Query: 299 NQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 335
           N WN  WG  GYF + RG NECGIE+   AG+P+  N
Sbjct: 297 NSWNAEWGDRGYFFMLRGDNECGIEDSGSAGVPAIPN 333


>gi|204022092|dbj|BAG71143.1| cathepsin B-N2 [Tuberaphis coreana]
          Length = 334

 Score =  221 bits (562), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 137/325 (42%), Positives = 171/325 (52%), Gaps = 33/325 (10%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KT 88
           ++ L++  I ++N N K  WKA  N  P+ S   +  F  LLG K           + KT
Sbjct: 18  AYFLEEDYINQINANAKT-WKAGVNFDPKLS---IDSFVKLLGSKGVQAAKQASPDMFKT 73

Query: 89  HDK-----SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 141
           HD+     S ++P SFDAR  W +CSTI  + DQG CGSCWAFG   A +DR CI     
Sbjct: 74  HDEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGE 133

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 188
            N  LS  +L  CC   CG GC GGYPI AW  F  HG+VT       E C PY      
Sbjct: 134 FNELLSPEELAFCC-HKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCP 192

Query: 189 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 247
            D  G +    +PA    +C R C     L ++   HY+  AY +      I  +I   G
Sbjct: 193 LDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYGT--IQNDILAYG 250

Query: 248 PVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 306
           P+E SF VY+DF  YKSGVY K      +GGHAVKLIGWG  + G  YW+L N WN  WG
Sbjct: 251 PIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWG 309

Query: 307 ADGYFKIKRGSNECGIEEDVVAGLP 331
             G FKI+RG+NECGI+     G+P
Sbjct: 310 DQGLFKIRRGTNECGIDNSTTGGVP 334


>gi|48762493|dbj|BAD23816.1| cathepsin B-N1 [Tuberaphis coreana]
          Length = 340

 Score =  221 bits (562), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 137/325 (42%), Positives = 171/325 (52%), Gaps = 33/325 (10%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KT 88
           ++ L++  I ++N N K  WKA  N  P+ S   +  F  LLG K           + KT
Sbjct: 21  AYFLEEDYINQINANAKT-WKAGVNFDPKLS---IDSFVKLLGSKGVQAAKQASPDMFKT 76

Query: 89  HDK-----SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 141
           HD+     S ++P SFDAR  W +CSTI  + DQG CGSCWAFG   A +DR CI     
Sbjct: 77  HDEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGE 136

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 188
            N  LS  +L  CC   CG GC GGYPI AW  F  HG+VT       E C PY      
Sbjct: 137 FNELLSPEELAFCC-HKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCP 195

Query: 189 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 247
            D  G +    +PA    +C R C     L ++   HY+  AY +      I  +I   G
Sbjct: 196 LDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYGT--IQNDILAYG 253

Query: 248 PVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 306
           P+E SF VY+DF  YKSGVY K      +GGHAVKLIGWG  + G  YW+L N WN  WG
Sbjct: 254 PIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWG 312

Query: 307 ADGYFKIKRGSNECGIEEDVVAGLP 331
             G FKI+RG+NECGI+     G+P
Sbjct: 313 DQGLFKIRRGTNECGIDNSTTGGVP 337


>gi|7507648|pir||T24819 hypothetical protein T10H4.12 - Caenorhabditis elegans
          Length = 324

 Score =  221 bits (562), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 125/282 (44%), Positives = 160/282 (56%), Gaps = 38/282 (13%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLL 152
           LP +FDAR  WP C+TI  I +Q  CGSCWAFGA E +SDR CI         +SV D+L
Sbjct: 30  LPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDIL 89

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 206
           +CCG  CG GC GGY I A R++   G VT        C PY  S       C P   TP
Sbjct: 90  SCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPY--SFAPCTKNC-PESTTP 146

Query: 207 KCVRKCVK--KNQLWRNSKHYS----------------ISAYRINSDPE--DIMAEIYKN 246
            C   C    K + ++  KHY                  SAY++ +     +I  EIY  
Sbjct: 147 SCKTTCQSSYKTEEYKKDKHYGELVWHSFNRFQRFLNRASAYKVTTTKSVTEIQTEIYHY 206

Query: 247 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 306
           GPVE S+ VYEDF HYKSGVY + +G ++GGHAVK+IGWG  ++G DYW++AN W  S+G
Sbjct: 207 GPVEASYKVYEDFYHYKSGVYHYTSGKLVGGHAVKIIGWGV-ENGVDYWLIANSWGTSFG 265

Query: 307 ADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFED 348
             G+FKI+RG+NEC IE +VVAG      + K  T ++ +ED
Sbjct: 266 EKGFFKIRRGTNECQIEGNVVAG------IAKLGTHSETYED 301


>gi|166030328|gb|ABY78831.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  220 bits (561), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 123/334 (36%), Positives = 165/334 (49%), Gaps = 12/334 (3%)

Query: 12  LCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLL 71
           LCL   A  A G  + L  D+ +L  + +  +N+     WKA  N +  N T  + K L 
Sbjct: 7   LCLLSTALVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQNITFAEAKRLT 66

Query: 72  GVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEA 131
           G        L            KLP++FDA   WP C TI  I DQ  C + WA     A
Sbjct: 67  GAWIQKSSTLPPARFTEEQLRTKLPETFDAAEHWPHCPTIREIADQSACRASWAVSTASA 126

Query: 132 LSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-- 188
           +SDR+C +  G  L +S  DLL+CC   CGDGC GG+P  AW Y+V +G+ +  C PY  
Sbjct: 127 ISDRYCTVGGGKQLRISAADLLSCCK-QCGDGCKGGFPGFAWLYYVEYGIASSGCQPYPF 185

Query: 189 -----FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEI 243
                  + G   P  +  + TPKC   C  K+      K+   + Y +    ED   E+
Sbjct: 186 PHCEHRGAQGNKTPCSKYKFDTPKCNATCTDKSIPL--VKYRGNATYLLLHGEEDYKREL 243

Query: 244 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 303
           Y NGP    F VY D   YKSGVY+++ GD +GG AV+++GWG   +G  YW +AN W+ 
Sbjct: 244 YFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGGQAVRIVGWGKL-NGTPYWKVANSWDT 302

Query: 304 SWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 337
            WG +GY  I RG+NEC IE     G P    L 
Sbjct: 303 DWGMNGYMLILRGNNECNIEHLGFTGFPDPSQLT 336


>gi|204022088|dbj|BAG71141.1| cathepsin B-N2 [Tuberaphis styraci]
          Length = 334

 Score =  220 bits (561), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 135/325 (41%), Positives = 171/325 (52%), Gaps = 33/325 (10%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLG-VPVKT 88
           ++ L++  I ++N N K  WKA  N  P+ S   +  F  LLG K          V  KT
Sbjct: 18  AYFLEEDYINQINANAKT-WKAGVNFDPKLS---IDSFVKLLGSKGVQAAKQASPVMFKT 73

Query: 89  HDK-----SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 141
           HD+     S ++P SFDAR  W +CSTI  + DQG+CGSCWAFG   A +DR CI     
Sbjct: 74  HDEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGNCGSCWAFGTSSAFADRLCIATDGE 133

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 188
            N  LS  +L  CC   CG GC GGYPI AW  F  HG+VT       E C PY      
Sbjct: 134 FNELLSPEELAFCC-HKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVSPCP 192

Query: 189 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 247
            D  G +    +PA    +C + C     L ++   HY+  AY +      I  ++   G
Sbjct: 193 LDEYGNNTCSGKPAEKNHRCTQMCYGNQNLDFKEDHHYTRDAYYLTYGT--IQNDVLAYG 250

Query: 248 PVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 306
           P+E SF VY+DF  YKSGVY K      +GGHAVKLIGWG  + G  YW+L N WN  WG
Sbjct: 251 PIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWG 309

Query: 307 ADGYFKIKRGSNECGIEEDVVAGLP 331
             G FKI+RG+NECG +     G+P
Sbjct: 310 DQGLFKIRRGTNECGTDNSTTGGVP 334


>gi|3087801|emb|CAA93277.1| cysteine proteinase [Haemonchus contortus]
          Length = 344

 Score =  220 bits (561), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 132/340 (38%), Positives = 187/340 (55%), Gaps = 30/340 (8%)

Query: 12  LCLTCFATFAEGVVS----KLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 67
           L    FA+ A+ + S    K+ L++ +L+   +    +  +  ++AA  PQ  N+     
Sbjct: 9   LSRIAFASEADVLASLKYEKIPLEAQLLRGEELINYLKTNQNFFEAAITPQSYNFKRNLM 68

Query: 68  KHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFG 127
                +K   K ++  V    +D    +P+SFDAR+ WP CS+++ I DQ  CGSCWA  
Sbjct: 69  DRRF-IKHNRKPIVEDV----NDDGDDIPESFDARTHWPNCSSLTHIRDQADCGSCWAVS 123

Query: 128 AVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT--- 182
              ALSDR CI       + +S  D+L+CC   CGDGCDGGY I A+++F   G VT   
Sbjct: 124 TASALSDRICIASKGAKQVYVSATDILSCC-HSCGDGCDGGYVIDAFKFFAEQGAVTGGD 182

Query: 183 ----EECDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKNQL-WRNSKHYSISAY 230
               + C PY     C H G E  Y        TP+CVRKC +  +  +   +     AY
Sbjct: 183 YGAKDCCRPY-PFHPCGHHGNETYYGECPEDGSTPECVRKCQEGYETEYHEDRVRGEDAY 241

Query: 231 RIN-SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 289
           R+     + I  EI +NGPV  +F V++DF+ Y+ G+Y H+ G   GGHAVK+IGWGT +
Sbjct: 242 RLPIGSVKAIQKEIMRNGPVVAAFIVFDDFSFYRKGIYAHVAGSPRGGHAVKIIGWGT-E 300

Query: 290 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
            G  YWI+AN W+  WG DGYF++ RG N+CGIE +VVAG
Sbjct: 301 HGVPYWIIANSWHSDWGEDGYFRMVRGINDCGIETNVVAG 340


>gi|118429529|gb|ABK91812.1| cathepsin B precursor [Clonorchis sinensis]
          Length = 342

 Score =  220 bits (560), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 128/321 (39%), Positives = 166/321 (51%), Gaps = 24/321 (7%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK--S 92
           L D  ++E   +P  G +     +   +  G   HL G         L  P   H+   +
Sbjct: 25  LTDLGVQEY-AHPSMGARWIAGGRLERFETGNSLHLFGAMRETAEQRLQRPTVRHEDFDN 83

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVND 150
             LP+SFDAR+ WP C +IS I DQ  CGSCWAFGAVEA+SDR CIH     N SLS  D
Sbjct: 84  QHLPESFDARANWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSKGAFNKSLSAVD 143

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE------- 200
           L++CC   CG GC GGY   AW  +  HG+VT         TGC     P CE       
Sbjct: 144 LVSCCT-ECGCGCRGGYSPIAWDLWKTHGIVTGGSKE--KPTGCRSYPFPSCEHRGKGQY 200

Query: 201 -----PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 255
                  YPTP+C+++C  K   +   K  +  +Y +    + +M EI   GPV     V
Sbjct: 201 PPCPHQLYPTPECIKRCDTKEIDYEKDKTRANISYNVYPAEQAVMKEIMLRGPVGAILHV 260

Query: 256 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 315
           YED   YKSGVY H+ G  +G H ++++GWG  +DG  YW++AN WN  WG  GY ++ R
Sbjct: 261 YEDLLDYKSGVYFHVWGGHLGEHGIRILGWG-EEDGVPYWLVANSWNEDWGEKGYMRVLR 319

Query: 316 GSNECGIEEDVVAGLPSSKNL 336
             NECGI + V AGLP   N 
Sbjct: 320 WRNECGIVDQVTAGLPDLSNF 340


>gi|48762485|dbj|BAD23812.1| cathepsin B-N1 [Tuberaphis styraci]
          Length = 340

 Score =  220 bits (560), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 137/325 (42%), Positives = 170/325 (52%), Gaps = 33/325 (10%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KT 88
           ++ L+   I ++N N K  WKA  N  P+ S   +  F  LLG K           + KT
Sbjct: 21  AYFLEKDYINQINANAKT-WKAGVNFDPKLS---IDSFVKLLGSKGVQAAKQASPDMFKT 76

Query: 89  HDK-----SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 141
           HD+     S ++P SFDAR  W +CSTI  + DQG CGSCWAFG   A +DR CI     
Sbjct: 77  HDEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGE 136

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 188
            N  LS  +L  CC   CG GC GGYPI AW  F  HG+VT       E C PY      
Sbjct: 137 FNELLSPEELAFCC-HKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCP 195

Query: 189 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 247
            D  G +    +PA    +C R C     L ++   HY+  AY +      I  +I   G
Sbjct: 196 LDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYGT--IQNDILAYG 253

Query: 248 PVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 306
           P+E SF VY+DF  YKSGVY K      +GGHAVKLIGWG  + G  YW+L N WN  WG
Sbjct: 254 PIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWG 312

Query: 307 ADGYFKIKRGSNECGIEEDVVAGLP 331
             G FKI+RG+NECGI+     G+P
Sbjct: 313 DQGLFKIRRGTNECGIDNSTTGGVP 337


>gi|170060938|ref|XP_001866023.1| cathepsin B [Culex quinquefasciatus]
 gi|167879260|gb|EDS42643.1| cathepsin B [Culex quinquefasciatus]
          Length = 353

 Score =  220 bits (560), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 130/304 (42%), Positives = 167/304 (54%), Gaps = 23/304 (7%)

Query: 40  IKEVNENPKAGWKAA--RNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPK 97
           I  +  N    W A   R P  S+Y VG     L  K    G+L+        + + LP+
Sbjct: 48  IAAMVRNRTNSWTAGAPRQP-LSSYRVGVNMEELESKRLKPGILI------LKEDIDLPE 100

Query: 98  SFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACC 155
            FDAR  WPQC ++  I +QG CGSCWA  A EA +DR+CIH   + + S    DL++CC
Sbjct: 101 QFDARDKWPQCPSLREIRNQGCCGSCWAISAAEAFTDRWCIHSPEHTTFSFGSFDLISCC 160

Query: 156 GFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC-SHPGCEPAYP-----TPKCV 209
              CGDGC GG    AW Y+V  GV +    PY    GC S+P      P      PKC 
Sbjct: 161 -HSCGDGCQGGVLGPAWDYWVQKGVSSG--GPYNSKQGCHSYPFDTCHSPDEDDDAPKCS 217

Query: 210 RKCVKKNQLWRNSK--HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 267
           RKC     +   SK   +   AY + +D   IM EI+ NGPV+ +F VY DF  YKSGVY
Sbjct: 218 RKCQSSYSVQDVSKDRRFGRVAYSVVADEHRIMEEIFVNGPVQAAFQVYLDFKTYKSGVY 277

Query: 268 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 327
           +H+TG + GGHA+K++GWG  ++G  YW+ +N W   WG  G+FKI RG N  GIE DV 
Sbjct: 278 RHVTGPLEGGHAIKILGWGV-ENGTKYWLCSNSWGEDWGDHGFFKIVRGENHLGIETDVH 336

Query: 328 AGLP 331
           AGLP
Sbjct: 337 AGLP 340


>gi|300176938|emb|CBK25507.2| unnamed protein product [Blastocystis hominis]
          Length = 320

 Score =  220 bits (560), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 121/272 (44%), Positives = 157/272 (57%), Gaps = 22/272 (8%)

Query: 79  GLLLG---VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDR 135
           G+L G   +P KT      LP+SFD    WP+C ++  I DQ  CGSCWAFGA EA +DR
Sbjct: 50  GVLFGDRQLPSKTIVARGDLPESFDPVEKWPECPSLKEIRDQSVCGSCWAFGAAEAATDR 109

Query: 136 FCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECD 186
            CI     +   LS  DLL CC   CG GCDGG+   AWR+F   GV T       + C+
Sbjct: 110 LCIASKGKIQDRLSEQDLLTCCD-SCGFGCDGGWLDMAWRWFQSTGVTTGGEYGSKDWCN 168

Query: 187 PYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDI 239
            Y     C H      P C  +  TP+CV++C +   + +   KH+   AY +    + I
Sbjct: 169 AY-SFPKCEHHAEGKYPPCGESQETPECVKQCQEGYPVEYEKDKHFFGEAYYVQGGIDAI 227

Query: 240 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 299
             E+  NGP+EVSF VYEDF  YKSG+Y+H+ G  +GGHAVKL+GWG  +DG +YW +AN
Sbjct: 228 KTELMTNGPLEVSFFVYEDFLTYKSGIYQHVAGKYLGGHAVKLVGWGV-EDGIEYWKIAN 286

Query: 300 QWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
            WN  WG +GYF+I  G  ECGIE   + G+P
Sbjct: 287 SWNEDWGENGYFRIVAGKGECGIEVGPIGGIP 318


>gi|239938584|gb|ACS36091.1| cysteine proteinase [Haemonchus contortus]
          Length = 346

 Score =  219 bits (559), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 115/257 (44%), Positives = 153/257 (59%), Gaps = 20/257 (7%)

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 147
           D+   +P+SFDAR+ WP C++I  I DQ +CGSCWA     ALSDR CI       + +S
Sbjct: 89  DEGDDIPESFDARTHWPNCTSIRHIRDQANCGSCWAVSTASALSDRICIESNGETQMHIS 148

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG-- 198
             D ++CC   CG GCDGG+PI A+ ++ + G VT       + C PY     C H G  
Sbjct: 149 SIDFVSCCE-SCGYGCDGGWPILAFDFYTYEGAVTGGDYGSKDGCRPY-PFHPCGHHGND 206

Query: 199 -----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 252
                C     TPKC R+C +   + +   K Y   AY +    + I  EI KNGPV  +
Sbjct: 207 TYYGECPKGAKTPKCRRRCQRSYKKAYYMDKSYGEDAYEVPHSVKAIQREIMKNGPVVGA 266

Query: 253 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 312
           FTVYEDF++YK G+YKH  G   GGHA+K+IGWG  +D   YW++AN W+  WG +GYF+
Sbjct: 267 FTVYEDFSYYKKGIYKHTAGQARGGHAIKIIGWGVEND-VPYWLIANSWHNDWGEEGYFR 325

Query: 313 IKRGSNECGIEEDVVAG 329
           + RG NECGIE++VVAG
Sbjct: 326 MIRGINECGIEQEVVAG 342


>gi|204022096|dbj|BAG71145.1| cathepsin B-N1 [Tuberaphis sumatrana]
          Length = 334

 Score =  219 bits (558), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 135/325 (41%), Positives = 171/325 (52%), Gaps = 33/325 (10%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KT 88
           ++ L++  I ++N N K  WKA  N  P+ S   +  F  LLG K           + KT
Sbjct: 18  AYFLEEDYINQINANAKT-WKAGVNFDPKLS---IDSFVKLLGSKGVQAAKQASPDMFKT 73

Query: 89  HDK-----SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 141
           HD+     S ++P +FDAR  W +CSTI  + DQGHCGSCWAFG   A +DR CI     
Sbjct: 74  HDEAYNNWSNRIPSNFDARKKWRKCSTIGEVRDQGHCGSCWAFGTSSAFADRLCIATDGE 133

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 188
            N  LS  +L  CC   CG GC GG PI AW  F  HG+VT       E C PY      
Sbjct: 134 FNELLSPEELAFCC-HKCGFGCSGGNPIKAWERFQKHGLVTGGNYDSGEGCQPYKVPPCP 192

Query: 189 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 247
            D  G +    +PA    +C R C     L ++   HY+  AY +      I  ++   G
Sbjct: 193 LDEYGNNTCSGKPAEKNHRCTRMCYGNQNLDFKEDHHYTRDAYYLTYGT--IQYDVLAYG 250

Query: 248 PVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 306
           P+E SF VY+DF  YKSGVY K      +GGHAVKLIGWG  + G  YW+L N WN  WG
Sbjct: 251 PIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWG 309

Query: 307 ADGYFKIKRGSNECGIEEDVVAGLP 331
             G FKI+RG+NECGI+     G+P
Sbjct: 310 DQGLFKIRRGTNECGIDNSTTGGVP 334


>gi|204022108|dbj|BAG71151.1| cathepsin B-N [Cerataphis jamuritsu]
          Length = 333

 Score =  219 bits (558), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 130/324 (40%), Positives = 171/324 (52%), Gaps = 32/324 (9%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGL-----LLGV 84
           ++ L++  IK++N N K  W+A  N  P+ S   +  F +LLG K           +   
Sbjct: 18  AYFLEEDYIKQINANAKT-WEAGVNFDPKLS---IDSFVNLLGSKGVQAAKKASPDMFKT 73

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGM 142
             K ++ + ++P +FDAR  W +C +I  + DQGHCGSCWAFG   A +DR CI      
Sbjct: 74  GDKAYNLAQRIPSNFDARKKWKKCLSIGEVRDQGHCGSCWAFGTSSAFADRLCIATEGEF 133

Query: 143 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 189
           N  LS  +L  CC   CG GC+GGYPI AW  F  HG+VT       E C PY       
Sbjct: 134 NELLSAEELTFCC-HKCGFGCNGGYPIRAWERFRKHGLVTGGNYDSYEGCQPYRVPPCPL 192

Query: 190 DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 248
           D  G +    +P     +C R C     L + N  HY+  AY +      I  ++   GP
Sbjct: 193 DEYGNNTCHGKPMEKNHRCTRMCYGDQDLDFNNDHHYTRDAYYLTYGT--IQNDVLTYGP 250

Query: 249 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 307
           +E SF VY+DF  YKSGVY K      +GGHAVKLIGWG  + G  YW+L N WN  WG 
Sbjct: 251 IEASFEVYDDFPSYKSGVYVKTENASYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWGD 309

Query: 308 DGYFKIKRGSNECGIEEDVVAGLP 331
            G FKI+RG+NECGI+     G+P
Sbjct: 310 QGLFKIRRGTNECGIDNSTTGGVP 333


>gi|343472937|emb|CCD15042.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  219 bits (558), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 122/340 (35%), Positives = 167/340 (49%), Gaps = 14/340 (4%)

Query: 8   MDPILCLTCFAT--FAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           M   + L+ FA    A G  +    D  +L  + +  +N+     WKA  N +  N T  
Sbjct: 1   MRAFVVLSSFAATLVALGTSALRAKDGPVLTQTFVDRINQLNGGMWKAVYNGKMQNITFS 60

Query: 66  QFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
           + K L G +      L            KLP++FDA   WP C TI  I DQ  C + WA
Sbjct: 61  EAKRLTGARIQKSRTLPPARFTEEQLRTKLPETFDAAEHWPHCPTIREIADQSECRASWA 120

Query: 126 FGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 184
                A+SDR+C +  G  L +S  DL+ACC   CGDGC GG+P  AW Y+V +G+ + +
Sbjct: 121 VSTASAISDRYCTVGGGKQLRISAADLMACCK-QCGDGCKGGFPGFAWLYYVEYGITSSQ 179

Query: 185 CDPY-------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPE 237
           C PY         + G   P  +  + TPKC   C  K+      K+   + Y +    E
Sbjct: 180 CQPYPFPHCEHRGAQGNKTPCSKYKFDTPKCNATCTDKSIPL--VKYRGNATYLLLHGEE 237

Query: 238 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWIL 297
           D   E+Y NGP    F VY D   YKSGVY+++ GD +GG AV+++GWG   +G  YW +
Sbjct: 238 DYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGGQAVRIVGWGKL-NGTPYWKV 296

Query: 298 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 337
           AN W+  WG +GY  I RG+NEC IE     G P    L 
Sbjct: 297 ANSWDTDWGMNGYMLILRGNNECNIEHLGFTGFPDPSQLT 336


>gi|45822211|emb|CAE47502.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
          Length = 331

 Score =  219 bits (558), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 126/322 (39%), Positives = 178/322 (55%), Gaps = 25/322 (7%)

Query: 24  VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLG 83
           +V   K   + L +  I  +N + ++ W A +N    N ++ + K+LLG K   KG L  
Sbjct: 13  IVLSYKGSPNPLSNDFINYIN-SKQSTWVAGKNFD-ENLSIQEIKNLLGAK---KGKLGV 67

Query: 84  VPVKTHDKSLKLPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCI--HF 140
               TH + +++P SFDAR  W +CS  IS ++DQ  CGSCWA  A  A+SDR CI    
Sbjct: 68  AKEFTHSEDIQVPNSFDARENWKECSDVISTVVDQSDCGSCWAVAAASAMSDRRCIASQG 127

Query: 141 GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTG 193
            + + +S  +LL+CC   CG GC+GGYP  AW Y++  G+ T       + C PY     
Sbjct: 128 KLKVPVSAENLLSCCDS-CGYGCEGGYPTMAWSYWIDTGITTGGLYGSKQGCQPY-SLQP 185

Query: 194 CSH------PGCEPA-YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 246
           C H        C    Y TP C  KC      +++   +   + R      +I  EI  N
Sbjct: 186 CEHHTEGNKVQCSTLDYDTPSCKHKCDDSALNYKSELTFGSGSVRNFYSVANIQKEILTN 245

Query: 247 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 306
           GPVE +F VY DF +YKSGVY+H+ G+ +GGHAV+++GWG  + G  YW++AN WN  WG
Sbjct: 246 GPVEAAFDVYSDFVNYKSGVYQHVAGEYLGGHAVRILGWG-EESGVPYWLVANSWNEDWG 304

Query: 307 ADGYFKIKRGSNECGIEEDVVA 328
             G FKI+RG+NE G E+ +VA
Sbjct: 305 DKGLFKIRRGNNESGFEDSIVA 326


>gi|3912916|gb|AAC78691.1| thiol protease [Trichuris suis]
          Length = 348

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 117/265 (44%), Positives = 155/265 (58%), Gaps = 28/265 (10%)

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVN 149
           +L +P SFD RS W  CS ++ I DQ  CGSCWA  A E +SDR C+    ++   +S  
Sbjct: 81  ALSIPPSFDVRSLWHVCS-LNLIRDQAKCGSCWAVSAAETMSDRICVQSNCSIKACISDT 139

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------------FD 190
           D+L+CCG  CG GC+GG+PI AWR+F   G  T         C PY             D
Sbjct: 140 DILSCCGLYCGYGCNGGFPIEAWRHFTVAGNCTGGKTIDKYGCKPYKPTGPIGRHLKRND 199

Query: 191 STGCSHPG----CEPAYPTPKCVRKCV-KKNQLWRNSKHYSISAYRINSDPEDIMAEIYK 245
              C +      C     TP+C R+C+    + + + ++Y  SAY +    + I  EI K
Sbjct: 200 YAPCPNDTYYGECVGMADTPRCKRRCLLGYPKSYPSDRYYGKSAYIVKQSVKAIQREIMK 259

Query: 246 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 305
           NGPV  SF VYEDF HYKSG+YKH  G++ G HAVK+IGWG  ++  D+W++AN W++ W
Sbjct: 260 NGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKIIGWG-KENNTDFWLIANSWHQDW 318

Query: 306 GADGYFKIKRGSNECGIEEDVVAGL 330
           G  GYF+I RG NECGIE DVVAG+
Sbjct: 319 GEKGYFRIVRGKNECGIETDVVAGI 343


>gi|204022090|dbj|BAG71142.1| cathepsin B-N3 [Tuberaphis styraci]
          Length = 334

 Score =  218 bits (556), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 135/325 (41%), Positives = 170/325 (52%), Gaps = 33/325 (10%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLG-VPVKT 88
           ++ L+   I ++N N K  WKA  N  P+ S   +  F  LLG K          V  KT
Sbjct: 18  AYFLEVDYINQINANAKT-WKAGVNFDPKLS---IDSFVKLLGSKGVQAAKQASLVMFKT 73

Query: 89  HDK-----SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 141
           HD+     S ++P SFDAR  W +CSTI  + DQG+CGSCWAFG   A +DR CI     
Sbjct: 74  HDEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGNCGSCWAFGTSSAFADRLCIATDGE 133

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 188
            N  LS  +L  CC   CG GC GGYPI AW  F  HG+VT       E C PY      
Sbjct: 134 FNELLSPEELAFCC-HKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVPPCP 192

Query: 189 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 247
            D  G +    +PA    +C + C     L ++   HY+  AY +      I  ++   G
Sbjct: 193 LDEYGNNTCSGKPAEKNHRCTQMCYGNQNLDFKEDHHYTRDAYYLTYGT--IQNDVLAYG 250

Query: 248 PVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 306
           P+E SF VY+DF  YKSGVY K      +GGHAVKLIGWG  + G  YW+L N WN  WG
Sbjct: 251 PIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWG 309

Query: 307 ADGYFKIKRGSNECGIEEDVVAGLP 331
             G FKI+RG+NECG +     G+P
Sbjct: 310 DQGLFKIRRGTNECGTDNSTTGGVP 334


>gi|392922404|ref|NP_507186.3| Protein CPR-2 [Caenorhabditis elegans]
 gi|206994217|emb|CAB04322.3| Protein CPR-2 [Caenorhabditis elegans]
          Length = 326

 Score =  218 bits (556), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 115/245 (46%), Positives = 142/245 (57%), Gaps = 12/245 (4%)

Query: 96  PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLA 153
           P +FDAR+ WPQC ++  I +Q +CGSCWAF   E +SDR CI         +S  DLL 
Sbjct: 84  PLNFDARTRWPQCKSMKLIREQSNCGSCWAFSTAEVISDRTCIASNGTQQPIISPTDLLT 143

Query: 154 CCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPK 207
           CCG  CG+GCDGG+P  A++++   GVVT        C PY     C+   C     TP 
Sbjct: 144 CCGMSCGEGCDGGFPYRAFQWWARRGVVTGGDYLGTGCKPY-PIRPCNSDNCV-NLQTPP 201

Query: 208 CVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 266
           C   C       + N K+Y  SAY +      I A+IY NGPV  +F VYEDF  YKSG+
Sbjct: 202 CRLSCQPGYRTTYTNDKNYGNSAYPVPRTVAAIQADIYYNGPVVAAFIVYEDFEKYKSGI 261

Query: 267 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 326
           Y+HI G   GGHAVKLIGWGT + G  YW+  N W   WG  G F+I RG +ECGIE  +
Sbjct: 262 YRHIAGRSKGGHAVKLIGWGT-ERGTPYWLAVNSWGSQWGESGTFRILRGVDECGIESRI 320

Query: 327 VAGLP 331
           VAGLP
Sbjct: 321 VAGLP 325


>gi|91089437|ref|XP_966750.1| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270012705|gb|EFA09153.1| cathepsin B precursor [Tribolium castaneum]
          Length = 324

 Score =  218 bits (555), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 136/341 (39%), Positives = 182/341 (53%), Gaps = 36/341 (10%)

Query: 8   MDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARN-PQFSNYTVGQ 66
           M   L +    TF+    S L   + IL D  I  +N   ++ W A RN P+  +  +  
Sbjct: 1   MRSYLVVVFVLTFS----SALSAQNPILSDEFINSINAQ-QSTWTAGRNFPE--DTPIEH 53

Query: 67  FKHLLGVKPTPKGLLLGVPVKTHDKSL---KLPKSFDARSAWPQCSTISRILDQGHCGSC 123
            K L G   TP   L+G   +TH  ++    +P++FD R+ W QC ++  I +QG+CGSC
Sbjct: 54  LKRLNGALITPD--LVG-KNQTHVINVIPEAIPETFDGRTHWSQCPSLKNIRNQGNCGSC 110

Query: 124 WAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WAFG+VE ++DR CI          S +DLLACC   CG GCDGG P  A+ Y+V  G+V
Sbjct: 111 WAFGSVEVMTDRLCIASKGKTKFEFSADDLLACCT-ACGKGCDGGAPYRAFEYWVAKGIV 169

Query: 182 T-------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCV--KKNQLWRNSKHYSIS-AYR 231
           +       E C PY  S   +         TPKC  KC+  K    +   KHY     Y 
Sbjct: 170 SGGDYNSNEGCQPYEGSAFLNSV-------TPKCSTKCLNSKYTTPYAKDKHYGTDFIYM 222

Query: 232 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 291
            + +  +I  EI  NGPV     VYEDF  YKSGVY+H++G+ MGGHAVK+IGWGT + G
Sbjct: 223 TSKNVAEIQTEIMNNGPVVTHMDVYEDFYSYKSGVYQHVSGNSMGGHAVKIIGWGT-EKG 281

Query: 292 EDYWILANQWNRSWG-ADGYFKIKRGSNECGIEEDVVAGLP 331
             YW++AN W   W   DG++KI RG N C IE  +  G P
Sbjct: 282 VPYWLIANSWGAKWADLDGFYKILRGKNHCKIETYIYGGTP 322


>gi|268560898|ref|XP_002638183.1| Hypothetical protein CBG22612 [Caenorhabditis briggsae]
          Length = 721

 Score =  218 bits (554), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 134/333 (40%), Positives = 189/333 (56%), Gaps = 32/333 (9%)

Query: 24  VVSKLKLDSHILQ----------DSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           +++KL L +H+LQ           S++  VN   +  WKA    + S   + +FK +   
Sbjct: 1   MLAKLFLIAHLLQYTFSQQTLSGKSLVNHVN-TIQTLWKAEY-FEISEEEM-KFKVMDSK 57

Query: 74  KPTPKGLLLGVPVKTHDKSL-KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEAL 132
              P+  +   P  +   SL + P SFDAR  WP C +I  I DQ +CGSCWAFGA E +
Sbjct: 58  FAFPEEQISSEPNNSLPGSLSRAPTSFDARDYWPNCKSIKMIRDQAYCGSCWAFGAAEVI 117

Query: 133 SDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EE 184
           SDR CI         +S  D+L CC      GC GG+ + A +++   GVVT      + 
Sbjct: 118 SDRICIQSNGTDQPIISPEDILTCC--TNSHGCQGGFVLEAMKFWKSKGVVTGGDFQGDG 175

Query: 185 CDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSISAYRINSDP--EDIM 240
           C PY     CS   C  A  TPKC  +C  K     ++  K+Y  SAYR+++      I 
Sbjct: 176 CIPY-SYGSCSD--CHTAQTTPKCKNECQVKYTKNEYKEDKYYGSSAYRLSTSNAVRTIQ 232

Query: 241 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQ 300
           +EI +NGPVE ++ VYEDF +YKSGVY++I+G  MGGHAVK+IGWG  ++  +YW++AN 
Sbjct: 233 SEILRNGPVEATYQVYEDFYYYKSGVYEYISGRHMGGHAVKIIGWGV-EENVNYWLIANS 291

Query: 301 WNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           W   +G +G+FK++RG+NECGIE  VVAG+  S
Sbjct: 292 WGTGFGENGFFKMRRGNNECGIENYVVAGMAKS 324


>gi|239938582|gb|ACS36090.1| cysteine proteinase [Haemonchus contortus]
          Length = 346

 Score =  218 bits (554), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 114/257 (44%), Positives = 152/257 (59%), Gaps = 20/257 (7%)

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 147
           D+   +P+SFDAR+ WP C++I  I DQ +CGSCWA     ALSDR CI       + +S
Sbjct: 89  DEGDDIPESFDARTHWPNCTSIRHIRDQANCGSCWAVSTASALSDRICIESNGETQMHIS 148

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG-- 198
             D ++CC   C  GCDGG+PI A+ ++ + G VT       + C PY     C H G  
Sbjct: 149 SIDFVSCCE-SCSYGCDGGWPILAFDFYTYEGAVTGGDYGSKDGCRPY-PFHPCGHHGND 206

Query: 199 -----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 252
                C     TPKC R+C +   + +   K Y   AY +    + I  EI KNGPV  +
Sbjct: 207 TYYGECPKGAKTPKCRRRCQRSYKKAYYMDKSYGEDAYEVPHSVKAIQREIMKNGPVVGA 266

Query: 253 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 312
           FTVYEDF++YK G+YKH  G   GGHA+K+IGWG  +D   YW++AN W+  WG +GYF+
Sbjct: 267 FTVYEDFSYYKKGIYKHTAGQARGGHAIKIIGWGVEND-VPYWLIANSWHNDWGEEGYFR 325

Query: 313 IKRGSNECGIEEDVVAG 329
           + RG NECGIE++VVAG
Sbjct: 326 MIRGINECGIEQEVVAG 342


>gi|194246069|gb|ACF35526.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 277

 Score =  217 bits (552), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 118/265 (44%), Positives = 158/265 (59%), Gaps = 22/265 (8%)

Query: 84  VPVKTHDKSLK-LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG- 141
           +P++ H++  + LP+SFDAR AW  C +I  I DQ  CGSC AFGA EA+SDR CIH   
Sbjct: 13  LPIRLHEEIPEDLPESFDAREAWSHCDSIHLIRDQSTCGSCRAFGATEAMSDRICIHTKG 72

Query: 142 -MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTG 193
            + +++S  DLL CC   CG GC GGYP +AW Y+   G+VT       + C PY+    
Sbjct: 73  RVQVNISAQDLLTCC-HQCGMGCFGGYPSAAWDYYKDEGIVTGGLYGTDDGCQPYYFPP- 130

Query: 194 CSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 246
           C H      P C    PTPKC++ C K   + +   K+++ + Y ++SD   I  EIYKN
Sbjct: 131 CEHHTKGPLPNCTDTKPTPKCLQVCRKGYEKSYSEDKYFAKTVYSLHSDETQIKTEIYKN 190

Query: 247 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 306
           GPVE  F+VY DF  YKSGVY+  + ++       L GW         W++AN WN+ WG
Sbjct: 191 GPVEADFSVYTDFLAYKSGVYQRHSYELWEARHQNL-GWALKR--RSVWLVANSWNQDWG 247

Query: 307 ADGYFKIKRGSNECGIEEDVVAGLP 331
             GYFKI+RG+NECGIE D+ AG+P
Sbjct: 248 DKGYFKIRRGNNECGIENDINAGIP 272


>gi|170028916|ref|XP_001842340.1| cathepsin B [Culex quinquefasciatus]
 gi|167879390|gb|EDS42773.1| cathepsin B [Culex quinquefasciatus]
          Length = 339

 Score =  217 bits (552), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 121/286 (42%), Positives = 171/286 (59%), Gaps = 24/286 (8%)

Query: 65  GQFKHLLGVKPTPKGLLLGVPVKT-HDKSLK---LPKSFDARSAWPQCSTISRILDQGHC 120
           G+F+ + G+  +P  L   +P K  H  SL    +P  FDAR  WP C +I  + +QG C
Sbjct: 59  GEFRSIKGIYESP--LDFTLPSKRLHASSLDEVVIPDRFDAREKWPFCQSIHSVRNQGTC 116

Query: 121 GSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGY-PISAWRYFVH 177
           GSCWA   V  +SDR CIH    +NL L+  DL+ CC   CG+GC+GG+   +A++Y+V 
Sbjct: 117 GSCWAVATVSVMSDRLCIHSDGEVNLELATEDLMGCCK-DCGNGCNGGFLDGTAFQYWVD 175

Query: 178 HGVVT-------EECDPY-FDSTGCSHP--GCEPAYPTPKCVRKCVKK-NQLWRNSKHYS 226
            G+V+       E C PY F+   CS+P  GC      PKC+  C+   ++ +R  K + 
Sbjct: 176 AGLVSGAPYNSSEGCKPYPFEP--CSYPFVGCHHEKKNPKCLHHCINGYDRKYRKDKFFG 233

Query: 227 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 286
            +AY+I +D   I  EI  NGPV   F V+EDF  Y SGVYKH+ G  +G HA++++GWG
Sbjct: 234 ATAYKIPNDARMIQLEIMTNGPVATGFEVFEDFYFYHSGVYKHVVGKKVGMHAIRIVGWG 293

Query: 287 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
           T ++G  YW++AN +  +WG  G+FK+ RGSN  GIE  V+AGLP 
Sbjct: 294 T-ENGTPYWLIANSYGDTWGDKGFFKMLRGSNHLGIESTVIAGLPQ 338


>gi|281200411|gb|EFA74631.1| hypothetical protein PPL_11599 [Polysphondylium pallidum PN500]
          Length = 311

 Score =  217 bits (552), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 127/331 (38%), Positives = 179/331 (54%), Gaps = 34/331 (10%)

Query: 5   KLIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTV 64
           + I   ++ LT FA     V + L L+  +L D  I   N N  A W A RNP+F   ++
Sbjct: 2   RFISTLLIALTVFA-----VCNALDLNKPVLDDKFIHNHNAN-GASWVAGRNPRFEGQSI 55

Query: 65  GQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 124
           G    LLG K  P+      P +     + +P SFD+R+ WP C  +  +L+QG CGSCW
Sbjct: 56  GDILGLLGTKK-PRN----TPEEVSVSKVAVPNSFDSRTNWPGC--VHAVLNQGQCGSCW 108

Query: 125 AFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 182
           AF A E+LSDR CI     +N++LS   L++C       GC+GG P  AW Y   HG+ T
Sbjct: 109 AFAASESLSDRLCIASQGAINVTLSPQALVSC-DIEFNQGCNGGIPQMAWEYLELHGIPT 167

Query: 183 EECDPYFDSTGCSHPGCEPAYPTPKCVRKCV--KKNQLWRNSKHYSISAYRINSDPEDIM 240
           + C PY    G +          P C ++C    K QL++  K +++   +  S    I 
Sbjct: 168 DSCFPYTSGNGTA----------PDCQKECSDGSKYQLYK-GKTFTL---KTCSSVAAIQ 213

Query: 241 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD-VMGGHAVKLIGWGT-SDDGEDYWILA 298
           A ++  GP+E +  VY+DF  Y SGVY    G  ++GGHA+K++GWGT S  G DYWI+ 
Sbjct: 214 ANVFAYGPIEGTMDVYQDFMSYTSGVYVMTPGSKLLGGHAIKIVGWGTDSTSGLDYWIVQ 273

Query: 299 NQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
           N W   WG +G+F I+RG+N CGI+ D  AG
Sbjct: 274 NSWGSDWGMNGFFWIQRGTNMCGIDRDASAG 304


>gi|204022098|dbj|BAG71146.1| cathepsin B-N2 [Tuberaphis sumatrana]
          Length = 334

 Score =  216 bits (551), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 134/325 (41%), Positives = 168/325 (51%), Gaps = 33/325 (10%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KT 88
           ++ L++  I  +N N K  WKA  N  P+ S   +  F  LLG K           + KT
Sbjct: 18  AYFLEEDYINHINANAKT-WKAGVNFDPKLS---IDSFVKLLGSKGVQAAKQASPDMFKT 73

Query: 89  HDK-----SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 141
           HD+     S ++P  FDAR  W +C TI  + DQGHCGSCWAFG   A +DR CI     
Sbjct: 74  HDEAYNNWSNRIPSYFDARKKWRKCLTIGEVRDQGHCGSCWAFGTSSAFADRLCIATDGE 133

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 188
            N  LS  +L  CC   CG GC GGYPI AW  F  HG+VT       E C PY      
Sbjct: 134 FNELLSPEELAFCC-HKCGFGCSGGYPIKAWERFKKHGLVTGGNYESGEGCQPYRVPPCP 192

Query: 189 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 247
            D  G +    +P     +C R C     L ++   HY+  AY +      I  ++   G
Sbjct: 193 LDEYGNNTCSGKPTEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYGT--IQNDVLAYG 250

Query: 248 PVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 306
           P+E SF VY+DF  YKSGVY K      +GGHAVKLIGWG  + G  YW+L N WN  WG
Sbjct: 251 PIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWG 309

Query: 307 ADGYFKIKRGSNECGIEEDVVAGLP 331
             G FKI+RG+NECGI+     G+P
Sbjct: 310 DQGLFKIRRGTNECGIDNSTTGGVP 334


>gi|442754445|gb|JAA69382.1| Putative cathepsin b precursor [Ixodes ricinus]
          Length = 340

 Score =  216 bits (551), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 137/321 (42%), Positives = 176/321 (54%), Gaps = 38/321 (11%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK-SL 93
           L D ++  +N      WKA  N    +    + K  LGV        L  P   HD   +
Sbjct: 32  LSDKMVDYIN-FINTTWKAGHNEGHRDLETVRRK--LGVHRDNHKYRL--PELVHDTLEM 86

Query: 94  KLPKSFDARSAWPQC-------STISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--L 144
            +P  FD+R  W           T  R    GH      FGAVE++SDR CIH G    +
Sbjct: 87  DIPAQFDSRQQWQDWPHHPGDPGTKERADPVGH------FGAVESMSDRHCIHSGAKNIV 140

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH- 196
            L+ +D+L+CC + CG GC+GG+P +AW Y+V  G+VT       E C PY     C H 
Sbjct: 141 HLAADDVLSCC-WGCGSGCNGGFPAAAWSYWVDKGIVTGGNYDTDEGCMPY-PVPSCDHH 198

Query: 197 -----PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 250
                  C    PTPKCVR C K  N  +++ KHY  S+Y + S+   I  EI KNGPVE
Sbjct: 199 VNGTLGPCGQDPPTPKCVRLCRKGYNVDFKDDKHYGKSSYSVPSNETQIQMEIMKNGPVE 258

Query: 251 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 310
            +FTVY DF  YKSGVYK  + D +GGHA++++GWG  +D   YW++AN WN  WG  GY
Sbjct: 259 GAFTVYADFPLYKSGVYKSHSTDALGGHAIRILGWGVEND-VPYWLVANSWNTEWGDKGY 317

Query: 311 FKIKRGSNECGIEEDVVAGLP 331
           FKI RGSNECGIEED+VAG+P
Sbjct: 318 FKILRGSNECGIEEDIVAGIP 338


>gi|159179|gb|AAA29178.1| cysteine proteinase, partial [Haemonchus contortus]
          Length = 341

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 126/331 (38%), Positives = 177/331 (53%), Gaps = 38/331 (11%)

Query: 24  VVSKLKLDSHILQDSIIKEVNENPKAGWKAA-RNPQFSNYTVGQFKHLLGVKPTPKGLLL 82
            +S   L +++ ++  + EVN  P  G+K    + +F N               P  ++ 
Sbjct: 31  TLSGEPLVAYLRKNQNLFEVNSTPTPGFKQKIMDIKFRN-------------QNPNLIVK 77

Query: 83  GVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM 142
             P    D    +P+ +D R  W  C++   I DQ +CGSCWA     A+SDR CI    
Sbjct: 78  DDPEPEDD----IPEEYDPRKIWSNCTSF-YIRDQANCGSCWAVSTAAAISDRICIATKA 132

Query: 143 --NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTG 193
              +++S  DL+ CC   CG GCDGG+ I AW YF + G+V+         C PY     
Sbjct: 133 RKQVNISATDLVTCCTPTCGFGCDGGWSIKAWEYFTYAGLVSGGEYRSKRCCRPY-PIHP 191

Query: 194 CSHPG-------CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 245
           C H G       C     TP C +KC     +L+R  K Y   A+++    E I  E+ K
Sbjct: 192 CGHHGNDTYYGECPEEASTPSCKKKCQPGYRKLYRMDKRYGTDAFQLPKSVEAIQKELLK 251

Query: 246 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 305
           NGPV  SF VYEDF+ YKSG+Y+H  G++ G HAVK+IGWGT ++  DYW++AN W+  W
Sbjct: 252 NGPVTASFAVYEDFSLYKSGIYRHTAGELRGYHAVKMIGWGT-ENRTDYWLIANSWHDDW 310

Query: 306 GADGYFKIKRGSNECGIEEDVVAGLPSSKNL 336
           G +GYF+I RG N+CGIEE+V AGL   ++L
Sbjct: 311 GENGYFRIIRGINDCGIEENVAAGLIDVESL 341


>gi|268555420|ref|XP_002635699.1| Hypothetical protein CBG22436 [Caenorhabditis briggsae]
          Length = 317

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 112/246 (45%), Positives = 148/246 (60%), Gaps = 13/246 (5%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI-HFGMNLSL-SVNDLL 152
           +P  FDAR+ WP C +I  I +Q  CGSCWAFGA E +SDR CI   G    + S  DLL
Sbjct: 75  IPTYFDARTRWPNCRSIKMIRNQATCGSCWAFGAAEVMSDRICIASMGTKQPIISPTDLL 134

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 206
           +CCG  CG GC G  P+ A+R++   GVVT        C PY     C+   C  +  TP
Sbjct: 135 SCCGNFCGYGCKGASPLQAFRWWNKKGVVTGGDYRGSGCKPY-PFAPCTALPCTKS-ETP 192

Query: 207 KCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
           +C   C    ++ +   K++   AY +  D   I  EI  NGPVE +F VY+DF HY+SG
Sbjct: 193 RCSLNCQPAYSKAYSKDKYFGTPAYIVGMDVAAIQTEI-TNGPVEAAFIVYDDFNHYRSG 251

Query: 266 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 325
           VY+H+ G ++GGHAVK+IGWG   +G  YW++AN W   WG +G+FK+ RG +ECGIE  
Sbjct: 252 VYRHVAGKLVGGHAVKIIGWGI-QNGAPYWLMANSWGPYWGENGFFKMLRGVDECGIEST 310

Query: 326 VVAGLP 331
           +VAG P
Sbjct: 311 IVAGKP 316


>gi|86279343|gb|ABC88767.1| putative cathepsin B-like proteinase [Tenebrio molitor]
          Length = 321

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 122/322 (37%), Positives = 182/322 (56%), Gaps = 27/322 (8%)

Query: 23  GVVSKLKLDSHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGL 80
            V+S    +  +L    I  +N   ++ W A RN     +N  + +    +G+ P P   
Sbjct: 12  AVLSASLAEIDVLSSEFIDSINR-IQSSWVAGRNFPENTTNEYLYKLNGFIGLHPDPN-- 68

Query: 81  LLGVPVKTHDKSLK-LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH 139
               PV  H  + + +P+SFDAR+ WP C +++RI DQG CGSCWAF ++E++SDR CIH
Sbjct: 69  -YKPPVLVHTFNARDVPESFDARTKWPNCDSLNRIRDQGACGSCWAFASIESMSDRICIH 127

Query: 140 F--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFD 190
                    S  DLL+CC   CGD C GGY +SA  ++++ G+V+       E C PY  
Sbjct: 128 SSGSAQFMFSPEDLLSCCT-SCGD-CGGGYMMSALDFYINEGIVSGGDVNSNEGCRPY-- 183

Query: 191 STGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 249
            T  +H   +    TP C + C    +  +   KHY  + Y ++S  + I  E+  NGP+
Sbjct: 184 -TADAHDQGQ----TPACTKSCRNGYSTSYSADKHYGSNDYVVSSVIDQIQYEVMTNGPI 238

Query: 250 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 309
            V+F V++DF +Y SGVY+H++G+ +G H VK++GWG  ++G  YW++AN W  SWG  G
Sbjct: 239 IVNFEVFQDFYNYVSGVYRHVSGESVGFHVVKIVGWGV-ENGVPYWLIANSWGSSWGDHG 297

Query: 310 YFKIKRGSNECGIEEDVVAGLP 331
           +FK+ RG NECGIE    A +P
Sbjct: 298 FFKMLRGQNECGIENYPYAVMP 319


>gi|194384502|dbj|BAG59411.1| unnamed protein product [Homo sapiens]
          Length = 273

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 126/331 (38%), Positives = 172/331 (51%), Gaps = 69/331 (20%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           L C    A    ++ +   H + D ++  VN+     W+A  N  F N  +   K L G 
Sbjct: 8   LCCLLVLAN---ARSRPSFHPVSDELVNYVNKR-NTTWQAGHN--FYNVDMSYLKRLCGT 61

Query: 74  ---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 130
               P P   ++        + LKLP SFDAR  WPQC TI  I DQG CGSCWAFGAVE
Sbjct: 62  FLGGPKPPQRVM------FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVE 115

Query: 131 ALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD 190
           A+SDR C        + VN                                         
Sbjct: 116 AISDRIC--------IHVNG---------------------------------------- 127

Query: 191 STGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 249
               S P C     TPKC + C    +  ++  KHY  ++Y +++  +DIMAEIYKNGPV
Sbjct: 128 ----SRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV 183

Query: 250 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 309
           E +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +G
Sbjct: 184 EGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNG 242

Query: 310 YFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           +FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 243 FFKILRGQDHCGIESEVVAGIPRTDQYWEKI 273


>gi|204022106|dbj|BAG71150.1| cathepsin B-N [Astegopteryx spinocephala]
          Length = 332

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 131/324 (40%), Positives = 173/324 (53%), Gaps = 31/324 (9%)

Query: 31  DSHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-K 87
           +++ L++  I ++NEN K  WKA  N  P+ S   V  F  LLG K           + K
Sbjct: 17  EAYFLEEDYINQINENAKT-WKAGINFDPKLS---VENFVKLLGSKGVQAAKKASPDMFK 72

Query: 88  THDKSL---KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GM 142
           T DK+    ++PK FDAR  W +CSTI  + DQG CGSCWAFG   A +DR CI      
Sbjct: 73  TDDKTYENQRIPKFFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGDF 132

Query: 143 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 189
           N  LS  +L  CC   CG GC GGYPI AW  F  HG+VT       E C PY       
Sbjct: 133 NELLSAEELTFCC-HTCGYGCHGGYPIKAWERFKKHGLVTGGNYDSSEGCQPYRVSPCPL 191

Query: 190 DSTGCSHPGCEPAYPTPKCVRKCV-KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 248
           D  G +    +PA    +C R C   +++ ++    ++  AY +      I  ++   GP
Sbjct: 192 DEYGNNTCRGKPAEKNHRCTRMCYGDQDRDFKEDHRFTRDAYYLTYGT--IQKDVMTYGP 249

Query: 249 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 307
           +E S+ VY+DF  YKSGVY +      +GGHAVKLIGWG  + G  YW++ N WN  WG 
Sbjct: 250 IEASYEVYDDFPSYKSGVYVRTENATYLGGHAVKLIGWG-EEYGVPYWLMVNSWNDQWGD 308

Query: 308 DGYFKIKRGSNECGIEEDVVAGLP 331
            G FKI+RG+NECGI+     G+P
Sbjct: 309 RGLFKIRRGTNECGIDNSTTGGVP 332


>gi|390994433|gb|AFM37366.1| cathepsin B3 [Dictyocaulus viviparus]
          Length = 342

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 114/260 (43%), Positives = 155/260 (59%), Gaps = 19/260 (7%)

Query: 87  KTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS- 145
           +  + +  +P+SFDAR+ WP C +IS I DQ  CGSCWAF   E++SDR CI    N + 
Sbjct: 85  ENEEDTAGIPESFDARTQWPHCPSISLIRDQADCGSCWAFAVGESISDRVCIATDANKTA 144

Query: 146 -LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHP 197
             SV D+L CC   CG GCDGG+P +AW YFV  GVVT         C PY  S   +HP
Sbjct: 145 EFSVEDILTCCD-ECGFGCDGGFPDAAWEYFVSTGVVTGGLYGTKNACRPYEISPCGNHP 203

Query: 198 GCEPAY------PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 250
             E  Y       TP C   C K   + +++ K     +Y + +    I  +I K+GP+ 
Sbjct: 204 N-ETFYRNCTGVSTPSCKTSCQKGYPVSYKDDKTRGRKSYNLANSVSAIQKDILKHGPLV 262

Query: 251 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 310
            +F+VYEDF +YK G+Y++  G   GGHAV+++GWG  ++ + YWI+AN WN  WG DG+
Sbjct: 263 ATFSVYEDFMYYKKGIYRYTHGGYEGGHAVRILGWGVENNVK-YWIIANSWNTDWGEDGF 321

Query: 311 FKIKRGSNECGIEEDVVAGL 330
           F++ RG N+CGIEE V AGL
Sbjct: 322 FRMVRGINDCGIEESVSAGL 341


>gi|5764077|emb|CAB53367.1| necpain [Necator americanus]
          Length = 339

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 132/349 (37%), Positives = 188/349 (53%), Gaps = 43/349 (12%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKE---VNENPKAGWKAARNPQF----SNYT 63
           +L LT F       V+ L  D  ILQD++ KE   +  +  A +       F    S   
Sbjct: 2   LLFLTLF-------VAILAADEKILQDAVKKESKALTGHALAEFLRTLQSLFEVKKSEEV 54

Query: 64  VGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKL----PKSFDARSAWPQC-STISRILDQG 118
             + K+LL     PK  ++  P +     ++L    P+ FDAR AWP C   I  + DQ 
Sbjct: 55  PVRMKYLL-----PKHFMVK-PKEEDRTKIQLDKEPPEKFDARDAWPYCREIIGHVRDQS 108

Query: 119 HCGSCWAFGAVEALSDRFCIHFGMNLSLSVND--LLACCGFLCGDGCDGGYPISAWRYFV 176
            CGSCWA  A   +SDR C+     + L V+D  +LACCG  CGDGC GG+P  AW +  
Sbjct: 109 RCGSCWAVSAASVMSDRLCVQSNGKIKLHVSDTDILACCGEFCGDGCSGGWPFQAWEWVR 168

Query: 177 HHGVVTEE-------CDPYFDSTGCSHP-----GCEP--AYPTPKCVRKCVKKN-QLWRN 221
            +GV T         C PY      +H      G  P  ++PTP+C + C +   + ++ 
Sbjct: 169 KYGVCTGGDYRAKGVCKPYAFHPCGNHENQVYYGVCPKGSWPTPRCEKFCQRGYIKPYKK 228

Query: 222 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 281
            K Y+  +Y + +D ++I  +I KNGPV+ +F VYEDF  YK G+YKH  G   GGHAVK
Sbjct: 229 DKFYAKKSYWLPNDEKEIRLDIMKNGPVQAAFDVYEDFKLYKRGIYKHKEGIQTGGHAVK 288

Query: 282 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           +IGWG  D+G DYW++AN W++ WG  G+F++ RG N+C IE+ + AG+
Sbjct: 289 IIGWG-KDNGTDYWLIANSWSKDWGESGFFRMVRGENDCEIEDMITAGI 336


>gi|107921791|gb|ABF85679.1| cathepsin B2 [Fasciola hepatica]
          Length = 278

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 125/281 (44%), Positives = 157/281 (55%), Gaps = 25/281 (8%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG-VKPTPKGLLLGVPVKTHDKSL 93
             D +I+ VNE   A WKAAR+ +FSN  V  FK  LG +  TP+      P   HD S 
Sbjct: 3   FSDELIRFVNEESGASWKAARSTRFSN--VDHFKLDLGALSETPEERNALRPTIKHDISK 60

Query: 94  K-LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 150
             LP+SFDARS WPQC TIS I DQ  CGSCWA  A  A+SDR CIH    M   L+  D
Sbjct: 61  NDLPESFDARSQWPQCWTISEIRDQASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAAD 120

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEP-- 201
            L+CC + CG GC GGYP  AW Y++  G+VT         C P+   T C H G     
Sbjct: 121 PLSCCTY-CGQGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWM-FTKCDHVGDSRKY 178

Query: 202 ------AYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
                  YP P C R C    N+ +   K Y  S+Y +      IM EI KNGPVEV+F 
Sbjct: 179 SRCPHYTYPKPPCARACQTGYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEVTFA 238

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
           +++DF  Y+SG+Y H+ G  +G HAV++IGWG  ++G +YW
Sbjct: 239 IFQDFGVYRSGIYHHVAGKFIGRHAVRMIGWGV-ENGVNYW 278


>gi|19526442|gb|AAL89717.1|AF483623_1 cathepsin B [Apriona germari]
          Length = 324

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 119/310 (38%), Positives = 168/310 (54%), Gaps = 10/310 (3%)

Query: 30  LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 89
           + S I  ++ I+ +NE     W A +N  F   T  Q K L  V    +   + +PV  H
Sbjct: 22  VPSQIDTEAFIQSINEKATT-WTARKN--FEGRTPEQLKALADVIGINRDPNVTLPVVFH 78

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 147
           +    +P SFDAR  WP C +I  I D+G CGSCWAF AVE +SDR C+          S
Sbjct: 79  EAISGIPDSFDAREQWPFCESIRTIRDEGACGSCWAFAAVEVMSDRLCLASEGRKKFIFS 138

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK 207
             ++++CC   CG GC GG+    ++Y+V +G+ +     Y    GC       +  TP+
Sbjct: 139 AEEVVSCC-TACGGGCRGGFLNEPYKYWVTNGIPSG--GDYGSKLGCKPYTAAVSGETPQ 195

Query: 208 CVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 266
           C + CV    + W     ++ SAY++N     I  EI  NGPV     VYEDF  Y +G+
Sbjct: 196 CQKACVSGYEKSWEKDLRHATSAYQVNGGVLQIQREILDNGPVTAYMEVYEDFYSYGTGI 255

Query: 267 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 326
           Y+H +G  +GGHAVK+IGWG+ +D   YWI AN W   +G DG+F+I RGSN  GIE  +
Sbjct: 256 YQHTSGSFVGGHAVKIIGWGSEND-VPYWIAANSWGTGFGEDGFFRILRGSNCAGIESYI 314

Query: 327 VAGLPSSKNL 336
           VAG P++  +
Sbjct: 315 VAGYPNTSEV 324


>gi|204022104|dbj|BAG71149.1| cathepsin B-N [Astegopteryx styracophila]
          Length = 332

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 132/323 (40%), Positives = 170/323 (52%), Gaps = 31/323 (9%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KT 88
           ++ L++  I ++NEN K  WKA  N  P+ S   +  F  LLG K           + KT
Sbjct: 18  AYFLEEDYINQINENAKT-WKAGINFDPKLS---IENFVKLLGSKGVQAAKKASPDMFKT 73

Query: 89  HDKSL---KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MN 143
            DK+    K+PK FDAR  W +C TI  + DQG CGSCWAFG   A +DR CI      N
Sbjct: 74  IDKAYENQKIPKFFDARKKWRKCFTIGEVRDQGKCGSCWAFGTSSAFADRLCIATNGEFN 133

Query: 144 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FD 190
             LS  +L  CC   CG GC GGYPI AW  F  HG+VT       E C PY       D
Sbjct: 134 ELLSAEELTFCC-HKCGFGCHGGYPIKAWERFQKHGLVTGGDYDSGEGCQPYRVSPCPLD 192

Query: 191 STGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPV 249
             G +    +PA    +C R C     L ++   H++  AY +      I  ++   GP+
Sbjct: 193 EYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKKDHHFTRDAYYLTFGI--IQRDVMAYGPI 250

Query: 250 EVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 308
           E S+ VY+DF  YKSGVY +      +GGHAVKLIGWG  + G  YW++ N WN  WG  
Sbjct: 251 EASYDVYDDFPSYKSGVYVRTENATYLGGHAVKLIGWG-EEYGVPYWLMVNSWNDQWGDK 309

Query: 309 GYFKIKRGSNECGIEEDVVAGLP 331
           G FKI+RG+NECGI+     G+P
Sbjct: 310 GLFKIRRGTNECGIDNSTTGGVP 332


>gi|46812327|gb|AAT02230.1| cathepsin B-like proteinase [Triatoma dimidiata]
          Length = 332

 Score =  214 bits (546), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 128/314 (40%), Positives = 168/314 (53%), Gaps = 24/314 (7%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF-KHLLGVKPTPKGLLLGVPVKTHDKSL 93
           L D  I  +N   +  W+A RN  F+  T  ++ K L GV          +P +     +
Sbjct: 24  LSDEFIDYIN-TLQTTWRAGRN--FAPNTPKKYLKSLAGVHKNANNAFT-LPKRKVSLDV 79

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 151
            +P  FDAR  WP C +I+ I DQG CGSCWA   +      F  H    + + LS  +L
Sbjct: 80  TIPDEFDARKQWPNCPSITDIRDQGSCGSCWALELLRLCLIVFVSHSNGKLQVHLSAENL 139

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PG 198
           + CCG  CG GC GG P SAW Y+   G+V+       E C PY     C H      P 
Sbjct: 140 VTCCG-SCGAGCFGGDPGSAWEYWRDVGIVSGGNYGSKEGCQPY-SIAPCEHHIPGSRPP 197

Query: 199 CEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 257
           C     T  C ++C K   + +    HY+   Y    D ++I  EI KNGPVE +F VYE
Sbjct: 198 CRGEGHTADCRKQCEKGYSIPYDKDLHYAEFVYSTERDVKEIQTEILKNGPVEAAFFVYE 257

Query: 258 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 317
           D   YK GVYKH+ G  +GGHA+K++GWG  ++G  YW++AN WN  WG +G+FKI RGS
Sbjct: 258 DLLTYKEGVYKHVAGAPVGGHAIKILGWGV-ENGTPYWLIANSWNTDWGNNGFFKILRGS 316

Query: 318 NECGIEEDVVAGLP 331
           +ECGIE DV AGLP
Sbjct: 317 DECGIEIDVSAGLP 330


>gi|328871084|gb|EGG19455.1| peptidase C1A family protein [Dictyostelium fasciculatum]
          Length = 352

 Score =  214 bits (546), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 112/242 (46%), Positives = 140/242 (57%), Gaps = 16/242 (6%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLAC 154
           +P +F++   W  CS IS I +Q  CGSCWAFGAVE++SDRFCIH G ++ LS  DL+ C
Sbjct: 70  VPANFNSAQQWSNCSYISAIQNQARCGSCWAFGAVESVSDRFCIHKGEDVLLSFQDLVTC 129

Query: 155 CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP-------TPK 207
                 +GC GG   +A ++    G+V+ +C PY      + P C PA         TP+
Sbjct: 130 --DQSDNGCQGGDAYTAMKFIQKKGIVSNDCLPY------TIPTCAPAQQPCLNFVDTPQ 181

Query: 208 CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 267
           CV KC   +  +    H+    Y +N     I  EI  NGPVE  F VYEDF  YKSGVY
Sbjct: 182 CVEKCSNASYTYAQDLHFIDGVYSMNPTVNAIQQEIMTNGPVEACFEVYEDFLGYKSGVY 241

Query: 268 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 327
           +H TG  +GGH VK+IGWGT ++ E YWI  N W   WG  G F IK G NECGIE DVV
Sbjct: 242 QHTTGKDLGGHCVKMIGWGTQNN-ELYWICNNSWTTYWGNQGVFWIKAGVNECGIESDVV 300

Query: 328 AG 329
           A 
Sbjct: 301 AA 302


>gi|984960|gb|AAC46878.1| cathepsin B proteinase, partial [Ancylostoma caninum]
          Length = 340

 Score =  214 bits (545), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 109/252 (43%), Positives = 153/252 (60%), Gaps = 19/252 (7%)

Query: 96  PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLA 153
           P SFDAR+ WP+C +I  I DQ  CGSCWA  + EA+SD+ C+       + +S  D+L+
Sbjct: 88  PDSFDARAHWPECRSIGTIRDQSACGSCWAVSSAEAMSDQICVQSNRTTRVMISDTDILS 147

Query: 154 CCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-------PGC 199
           CCG  CG GC+   PI A+R+     VVT       + C PY      +H       P  
Sbjct: 148 CCGISCGYGCEV-LPIEAYRWMQRSVVVTGGKYRQKDVCKPYAFYPCGNHTNERYYGPCP 206

Query: 200 EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
              +PTPKC + C +K N+ +   K+++  +Y + S+   I  EIYKNGPV  +F VY+D
Sbjct: 207 RGLWPTPKCRKACQRKYNKSYNEDKYFATRSYYLPSNERSIREEIYKNGPVVAAFKVYQD 266

Query: 259 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
           F++Y+ G+Y H  G   G HAVK++GWG  ++G DYW++AN WN  WG +GYF+I RGSN
Sbjct: 267 FSYYRGGIYVHKWGGQTGAHAVKVVGWG-RENGTDYWLIANSWNTDWGENGYFRIARGSN 325

Query: 319 ECGIEEDVVAGL 330
           ECGIE  +V+G+
Sbjct: 326 ECGIEGQMVSGV 337


>gi|52630945|gb|AAU84936.1| putative cathepsin B-S [Toxoptera citricida]
          Length = 335

 Score =  214 bits (545), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 131/325 (40%), Positives = 173/325 (53%), Gaps = 29/325 (8%)

Query: 28  LKLDSHILQDSIIKEVNENPKAGWKAARN-PQFSNYTVGQFKHLLGVK---PTPKGLLLG 83
           L   +H L  S + ++NE  K  WKA +N P++   T  Q   LLG K     PK L+  
Sbjct: 17  LTEQAHFLSKSYVDKINEVAKT-WKAKQNFPEY--MTKEQIVRLLGSKNLTSVPKSLIKE 73

Query: 84  VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFG 141
              +  + S ++P  FDAR  W  C TI  + +QG+CGSCWA G   A +DR CI  +  
Sbjct: 74  NDSEYINDS-EIPNFFDARIQWSHCKTIGEVRNQGNCGSCWAHGTTGAFADRLCIATNGD 132

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF----- 189
            N  +S  +L  CC   CG GC+GG P+ AW+YF  HGVVT       + C PY      
Sbjct: 133 FNELISAEELTFCC-HRCGFGCNGGNPLKAWQYFKRHGVVTGGNYNTTDGCQPYKVPPCV 191

Query: 190 -DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSI-SAYRINSDPEDIMAEIYKNG 247
            D  G +    +P  P  KC R C           HY   +AY +N D   +  +    G
Sbjct: 192 KDEEGHNSCSGQPTEPNHKCSRSCYGDKTCDYKKGHYKTKNAYYLNIDT--MQKDTIAYG 249

Query: 248 PVEVSFTVYEDFAHYKSGVYKHIT-GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 306
           P+E SF VY+DF +Y+SGVY+       +GGHAVK+IGWG  +DG  YW++ N W   WG
Sbjct: 250 PIEASFDVYDDFVNYESGVYQKTEDAKYLGGHAVKMIGWG-EEDGTPYWLMVNSWGEQWG 308

Query: 307 ADGYFKIKRGSNECGIEEDVVAGLP 331
           A+G FKI RG+NECGIE    AG+P
Sbjct: 309 ANGMFKILRGTNECGIEGSPTAGVP 333


>gi|256090364|ref|XP_002581165.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228444|emb|CCD74615.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 303

 Score =  214 bits (544), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 128/333 (38%), Positives = 177/333 (53%), Gaps = 43/333 (12%)

Query: 7   IMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ 66
           ++  +L +    +  E  +S        L D II  +NE+P AGW+A ++ +F +    +
Sbjct: 1   MLISVLYIASLISHLEAHISIKNEKFEPLSDDIISYINEHPNAGWRAEKSNRFHSLDDAR 60

Query: 67  FKHLLGVKPTPKGLLLGVPVKTH-DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
           F+ L   +  P       P   H D ++++P SFD+R  WP+C +I+ I DQ  CGSC A
Sbjct: 61  FQ-LGARREEPDLRRTRRPTVDHNDWNVEIPSSFDSRKKWPRCKSIATIRDQSRCGSCCA 119

Query: 126 FGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGD------GCDGGYPISAWRYFVH 177
           FGAVEA+S+R CI  G   N+ LS  DL    G + G       GC+  YP     +F  
Sbjct: 120 FGAVEAMSERSCIQSGGKQNVELSAVDLE---GIVTGSSKENNTGCEP-YPFPKCEHF-- 173

Query: 178 HGVVTEECDPYFDSTGCSHPGC-EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 236
                         T   +P C    Y TP+C   C K     R    Y+   +R     
Sbjct: 174 --------------TKGQYPPCGSKIYKTPRCKTTCQK-----RYKTSYAQDKHRA---- 210

Query: 237 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 296
             I  EI K GPVE SFTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG  ++   YW+
Sbjct: 211 --IQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGV-ENKTPYWL 267

Query: 297 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
           +AN WN  WG +GYF+I RG +EC IE +V AG
Sbjct: 268 IANSWNEDWGENGYFRIVRGRDECSIESEVTAG 300


>gi|335347291|gb|AEH42093.1| cysteine proteinase 6 [Haemonchus contortus]
          Length = 346

 Score =  214 bits (544), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 115/258 (44%), Positives = 153/258 (59%), Gaps = 22/258 (8%)

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM--NLSLS 147
           DK   +P+SFDAR+ WP C++I  I DQ +CGSCWA      LSDR CI       + +S
Sbjct: 89  DKGDDIPESFDARTKWPNCTSIKHIRDQANCGSCWAVSTASVLSDRICIASKQKKQVHIS 148

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCE 200
             D ++CC   CG GC+GG+PI A+ Y+ + GVVT         C PY     C H G E
Sbjct: 149 SIDFVSCCD-SCGFGCEGGWPIDAFEYYSYQGVVTGGDYGSKTGCRPY-PFHPCGHHGNE 206

Query: 201 PAY-------PTPKCVRKCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
             Y        TP+CV++C K  KN  +R  K +    Y + +  + I  EI ++GPV  
Sbjct: 207 TYYGECPKEESTPECVKQCQKGYKNS-YRRDKTWGEDYYEVENSVKAIQREIMRSGPVVS 265

Query: 252 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 311
           SFTVY+DF++Y  G+YKH  G   G HA+K+IGWGT +    YWI+AN W+  WG  G+F
Sbjct: 266 SFTVYDDFSYYVKGIYKHTAGKARGSHAIKIIGWGT-EKNVPYWIIANSWHNDWGEKGFF 324

Query: 312 KIKRGSNECGIEEDVVAG 329
           ++ RG+N CGIEEDVVAG
Sbjct: 325 RMVRGTNHCGIEEDVVAG 342


>gi|308507719|ref|XP_003116043.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
 gi|308250987|gb|EFO94939.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
          Length = 356

 Score =  214 bits (544), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 122/269 (45%), Positives = 163/269 (60%), Gaps = 32/269 (11%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLL 152
           +P +FDAR+ WP+C++I  + DQ +CGSCWAFGA E +SDR CIH        +S  D+L
Sbjct: 70  IPTTFDARTNWPKCNSIKMVRDQSNCGSCWAFGAAEVISDRICIHSNGKEQPVISAEDIL 129

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 206
            CCG  CG+GC GG  + A +++  +G VT      + C PY     CS+  C  +  TP
Sbjct: 130 TCCGKSCGNGCQGGQGLEAMKFWTTYGAVTGGDYKGDGCKPY-SFAPCSN--CVESKTTP 186

Query: 207 KCVRKCVKKNQL--WRNSKHYS---------------ISAYRINSDPED---IMAEIYKN 246
            C  KC     +  ++  KHY                 SAYR+++       I  EIY+N
Sbjct: 187 SCQSKCQSTYTVTNYKGDKHYGKNEGKVTERHKHLECTSAYRLDTSSNAVPIIQNEIYQN 246

Query: 247 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 306
           GPVEV++TVY+DF HYKSGVY H+TG   GGHAVK+IGWGT + G DYW++ N W  S+G
Sbjct: 247 GPVEVAYTVYDDFYHYKSGVYHHVTGKDTGGHAVKIIGWGT-EKGVDYWLVTNSWGTSFG 305

Query: 307 ADGYFKIKRGSNECGIEEDVVAGLPSSKN 335
             G+FKI+RG+NECGIE +VVAG+    N
Sbjct: 306 DKGFFKIRRGTNECGIESNVVAGMAKVGN 334


>gi|170030060|ref|XP_001842908.1| cathepsin B [Culex quinquefasciatus]
 gi|167865914|gb|EDS29297.1| cathepsin B [Culex quinquefasciatus]
          Length = 320

 Score =  213 bits (543), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 126/333 (37%), Positives = 178/333 (53%), Gaps = 22/333 (6%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +    IL + C A    G +S   +    +Q++++  +    +  W A    Q   + V 
Sbjct: 1   MAFTKILLVVCLAI---GTISGFSISDQ-MQNALVSAIRSRTRT-WVAQVYDQREKFGVM 55

Query: 66  QFKHLLGVKPTPKGLLLGVPVKTHDKSLK-LPKSFDARSAWPQCSTISRILDQGHCGSCW 124
                LG++P  + +   VP+  + +S++ LP+SFD+R  WP C ++++I DQG CGSC+
Sbjct: 56  N----LGLRPN-ESVANAVPLLENQRSVRSLPESFDSRQKWPNCPSLNQIRDQGCCGSCY 110

Query: 125 AFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 182
                 A++DR+CIH G     +    D LACC       CDGGY    W+Y+V  G+ +
Sbjct: 111 VVSTAAAITDRYCIHSGGQKQFTFGATDYLACCTDCFK--CDGGYVGKTWQYWVDSGLTS 168

Query: 183 EECDPYFDSTGC-SHPGCEPAY--PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPED 238
           E   PY    GC S+P        P P C R C     L +     Y  SAYR+  +   
Sbjct: 169 E--GPYKSGQGCNSYPFGSYCVNDPLPTCSRTCQAGYPLTYSQDLKYGGSAYRVMWNENA 226

Query: 239 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 298
           IM EIY+NGPV V F V+ DF  YKSGVY+H+TG   G HAV++IGWG  ++G  YW++A
Sbjct: 227 IMTEIYQNGPVVVQFEVFADFYQYKSGVYRHVTGATEGWHAVRVIGWGV-ENGVKYWLVA 285

Query: 299 NQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           N W   WG  G+FK  RG N  GIE+ V AGLP
Sbjct: 286 NSWGVRWGDKGFFKFVRGENHLGIEDFVYAGLP 318


>gi|157167281|ref|XP_001658485.1| cathepsin b [Aedes aegypti]
 gi|108876476|gb|EAT40701.1| AAEL007585-PA [Aedes aegypti]
          Length = 386

 Score =  213 bits (543), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 121/295 (41%), Positives = 166/295 (56%), Gaps = 25/295 (8%)

Query: 51  WKAARNPQF-SNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCS 109
           W+A  NP+  + Y  G     L     P G++  V      + L LP +FDAR  WP+C 
Sbjct: 86  WRAGSNPKPPAGYRSGVNMADLERTKLPLGIMADV------EDLDLPDTFDAREKWPECP 139

Query: 110 TISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGY 167
           ++  I DQG CGSCWA  A  A++DR+C+             DLL+CC   CG GC GG 
Sbjct: 140 SLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLLSCC-HSCGQGCRGGT 198

Query: 168 PISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAYPTPKCVRKC---VKKNQ 217
              AW+++V  G+ +       + C PY     C  PG +    TPKC  KC        
Sbjct: 199 LGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGE-CRIPGEDED--TPKCSNKCRSGYNVTD 255

Query: 218 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 277
           +W++ +HY   AY + +D   IM EI+ NGPV+ +F  Y D   YKSG+Y+H+ G + GG
Sbjct: 256 VWQD-RHYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGG 314

Query: 278 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
           HAVKL+GWG  ++G  YW++AN W R WG +G+FKI RG N CGIEE++ AGLP+
Sbjct: 315 HAVKLLGWGV-ENGVKYWLVANSWGREWGENGFFKIVRGENHCGIEENIHAGLPN 368


>gi|157111449|ref|XP_001651570.1| cathepsin b [Aedes aegypti]
 gi|108868331|gb|EAT32556.1| AAEL015312-PA [Aedes aegypti]
          Length = 386

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 123/309 (39%), Positives = 171/309 (55%), Gaps = 28/309 (9%)

Query: 51  WKAARNPQF-SNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCS 109
           W+A  NP+  + Y  G     L     P G++  V      + L LP +FDAR  WP+C 
Sbjct: 86  WRAGSNPKPPAGYRSGVNMADLERTKLPLGIMADV------EDLDLPDTFDAREKWPECP 139

Query: 110 TISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGY 167
           ++  I DQG CGSCWA  A  A++DR+C+             DLL+CC   CG GC GG 
Sbjct: 140 SLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLLSCC-HSCGQGCRGGT 198

Query: 168 PISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAYPTPKCVRKC---VKKNQ 217
              AW+++V  G+ +       + C PY     C  PG +    TPKC  KC        
Sbjct: 199 LGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGE-CRIPGEDE--DTPKCSNKCRSGYNVTD 255

Query: 218 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 277
           +W++ +HY   AY + +D   IM EI+ NGPV+ +F  Y D   YKSG+Y+H+ G + GG
Sbjct: 256 VWQD-RHYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGG 314

Query: 278 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 337
           HAVKL+GWG  ++G  YW++AN W R WG +G+FK+ RG N CGIEE++ AGLP   N  
Sbjct: 315 HAVKLLGWGV-ENGVKYWLVANSWGREWGENGFFKMVRGENHCGIEENIHAGLP---NFH 370

Query: 338 KEITSADMF 346
           ++  +A  F
Sbjct: 371 RQGEAAKYF 379


>gi|157131748|ref|XP_001662318.1| cathepsin b [Aedes aegypti]
 gi|108871395|gb|EAT35620.1| AAEL012216-PA [Aedes aegypti]
          Length = 386

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 123/309 (39%), Positives = 171/309 (55%), Gaps = 28/309 (9%)

Query: 51  WKAARNPQF-SNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCS 109
           W+A  NP+  + Y  G     L     P G++  V      + L LP +FDAR  WP+C 
Sbjct: 86  WRAGSNPKPPAGYRSGVNMADLERTKLPLGIMADV------EDLDLPDTFDAREKWPECP 139

Query: 110 TISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGY 167
           ++  I DQG CGSCWA  A  A++DR+C+             DLL+CC   CG GC GG 
Sbjct: 140 SLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLLSCC-HSCGQGCRGGT 198

Query: 168 PISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAYPTPKCVRKC---VKKNQ 217
              AW+++V  G+ +       + C PY     C  PG +    TPKC  KC        
Sbjct: 199 LGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGE-CRIPGEDE--DTPKCSNKCRSGYNVTD 255

Query: 218 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 277
           +W++ +HY   AY + +D   IM EI+ NGPV+ +F  Y D   YKSG+Y+H+ G + GG
Sbjct: 256 VWQD-RHYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGG 314

Query: 278 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 337
           HAVKL+GWG  ++G  YW++AN W R WG +G+FK+ RG N CGIEE++ AGLP   N  
Sbjct: 315 HAVKLLGWGV-ENGVKYWLVANSWGREWGENGFFKMVRGENHCGIEENIHAGLP---NFH 370

Query: 338 KEITSADMF 346
           ++  +A  F
Sbjct: 371 RQGEAAKYF 379


>gi|119638965|gb|ABL85237.1| cysteine proteinase 3 [Necator americanus]
          Length = 360

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 116/258 (44%), Positives = 157/258 (60%), Gaps = 18/258 (6%)

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 147
           D S ++P SFDAR  WP+C++I  I DQ HCGSCWA  + E +SDR C+     + + LS
Sbjct: 85  DFSEEIPVSFDARDKWPKCTSIGFIRDQSHCGSCWAVSSAETMSDRLCVQSNGTIKVLLS 144

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY--FDSTGCSHPG 198
             D+LACC   CG GC GG+ I AW YF + GV T       + C PY  +     S+  
Sbjct: 145 DTDILACCPN-CGAGCGGGHTIRAWEYFKNTGVCTGGLYGTKDSCKPYAFYPCKDESYGK 203

Query: 199 C-EPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
           C + ++PTPKC + C  K ++ + + K+Y+ SAYRI  +   I  EI +NGPV  SF +Y
Sbjct: 204 CPKDSFPTPKCRKICQYKYSKKYADDKYYANSAYRIPQNETWIKLEIMRNGPVTASFRIY 263

Query: 257 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD-DGED--YWILANQWNRSWGA-DGYFK 312
            DF  Y+ GVY    G  +GGHA+K+IGWGT   +G D  YW++AN W   WG  +GYF+
Sbjct: 264 PDFGFYEKGVYVTSGGRELGGHAIKIIGWGTEKVNGTDLPYWLIANSWGTDWGENNGYFR 323

Query: 313 IKRGSNECGIEEDVVAGL 330
           I RG N C IE+ V+AG+
Sbjct: 324 ILRGQNHCQIEQKVIAGM 341


>gi|157167368|ref|XP_001653891.1| cathepsin b [Aedes aegypti]
 gi|108874250|gb|EAT38475.1| AAEL009642-PA [Aedes aegypti]
          Length = 332

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 129/336 (38%), Positives = 183/336 (54%), Gaps = 29/336 (8%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAAR---NPQFSNYTVGQFKHL 70
           L  FA     +    +L      D  + +V  + K     A      +F N     F+++
Sbjct: 6   LLVFAIGVVVIARSERLGDDPFNDGFLAQVQRHAKTWTPDATFRDGIRFEN-----FQNM 60

Query: 71  LGVKPTPKGLLLGVPVKTHDKS--LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 128
            G+  +  G  L  P K HD +  + +P+ FDAR  WP C +IS I +QG CG+CWA  A
Sbjct: 61  KGIFESKIGFRL--PTKRHDVAYNMDIPEFFDAREKWPYCKSISTIKNQGLCGACWAVAA 118

Query: 129 VEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGY-PISAWRYFVHHGVV---- 181
           V  +SDR CIH     ++ L+  DL+ CC   CG+GC+GG+   ++++Y+V  G+V    
Sbjct: 119 VSVMSDRLCIHSEGKFDVELAAEDLMGCCK-DCGNGCNGGFLDGTSFQYWVDVGLVSGAA 177

Query: 182 ---TEECDPYFDSTGCSHP--GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSD 235
              T+ C PY     C +P  GC P   TP C   C +  +  +R  K+Y  +AY++ +D
Sbjct: 178 YNSTDGCKPY-PFKPCLYPFVGCHPE-KTPSCTHHCTEGYDGTYRRDKYYGSAAYKLPND 235

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
              I  EI  NGPVE  F+VY+D   YK+GVY+H+ G  +G HAV+LIGWG  + G  YW
Sbjct: 236 ERMIQLEIMTNGPVESGFSVYQDLYLYKTGVYQHVVGREVGKHAVRLIGWG-KERGVPYW 294

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           ++AN +   WG  GYFK  RGSN  GIE  V+AGLP
Sbjct: 295 LIANSYGEDWGEHGYFKFLRGSNHLGIESVVIAGLP 330


>gi|44965462|gb|AAS49538.1| cathepsin B [Protopterus dolloi]
          Length = 225

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 112/224 (50%), Positives = 142/224 (63%), Gaps = 17/224 (7%)

Query: 84  VPVKTH-DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG- 141
           +P+KT    + KLP +FD+R+ WP C TI  I DQG CGSCWAFGAVE++SDR C+H G 
Sbjct: 1   LPLKTSFSGNWKLPDNFDSRTQWPNCPTIREIRDQGSCGSCWAFGAVESMSDRVCVHSGG 60

Query: 142 -MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF---- 189
             N+ +S  DLL+CCGF CG GC+GGYP  AW+Y+   G+V+         C PY     
Sbjct: 61  KQNVEVSAEDLLSCCGFECGMGCNGGYPSGAWQYWTEKGLVSGGLYGSGIGCRPYTIPPC 120

Query: 190 -DSTGCSHPGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 246
                 S P C      TPKCV+KC       +   K Y  SAY + S PE IM EIYK+
Sbjct: 121 EHHVNGSRPSCSGEGGDTPKCVQKCDSGYTPAYEKDKIYGQSAYSVPSSPESIMEEIYKD 180

Query: 247 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 290
           GPVE +FTVYEDF  YKSGVY+H TG+ +GGHA+K++GWG  ++
Sbjct: 181 GPVEGAFTVYEDFLLYKSGVYQHHTGEAVGGHAIKILGWGIENN 224


>gi|1345924|sp|P25802.3|CYSP1_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
           Precursor
          Length = 341

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 111/251 (44%), Positives = 152/251 (60%), Gaps = 19/251 (7%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 152
           +P+S+D R  W  CS++  I DQ +CGSCWA  +  A+SDR CI       + +S  D++
Sbjct: 91  IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVV 150

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGCSHPGCEPAY-- 203
           +CC + CGDGC+GG+PISA+R+    GVVT         C PY +   C H G E  Y  
Sbjct: 151 SCCTW-CGDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPY-EIHPCGHHGNETYYGE 208

Query: 204 -----PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
                 TP+C R+C+        S  Y   AY++ +  + I  +I KNGPV  ++TVYED
Sbjct: 209 CVGMADTPRCKRRCLLGYPKSYPSDRYYKKAYQLKNSVKAIQKDIMKNGPVVATYTVYED 268

Query: 259 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
           FAHY+SG+YKH  G   G HAVK+IGWG  + G  YWI+AN W+  WG +G+F++ RGSN
Sbjct: 269 FAHYRSGIYKHKAGRKTGLHAVKVIGWG-EEKGTPYWIVANSWHDDWGENGFFRMHRGSN 327

Query: 319 ECGIEEDVVAG 329
           +CG EE + AG
Sbjct: 328 DCGFEERMAAG 338


>gi|290989996|ref|XP_002677623.1| cathepsin B [Naegleria gruberi]
 gi|284091231|gb|EFC44879.1| cathepsin B [Naegleria gruberi]
          Length = 321

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 128/321 (39%), Positives = 172/321 (53%), Gaps = 45/321 (14%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG-------VKPTPKGLL---------- 81
           +I E+N +P + WKA  N   +  TV + K LLG       V+ + + +           
Sbjct: 7   MINEINSDPSSTWKAGVNRNLAGKTVAEMKRLLGFAKKEGQVRYSEEQMTTIKHYNEAKA 66

Query: 82  -----LGVPVKTHD-KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDR 135
                +GV   +   K+L LP +FD+R  W +C  I  I +Q  CGSCWAF A E+LSDR
Sbjct: 67  SAVKSVGVEEASKQFKTLGLPTNFDSRQQWGKC--IHPIRNQEQCGSCWAFSASESLSDR 124

Query: 136 FCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTG 193
           FCI  +  +++ LS  D+++C       GCDGG   +AW +  + G+V + C PY    G
Sbjct: 125 FCIASNGKVDVILSPQDMVSC--DYNDMGCDGGNLDNAWWWMKNKGIVPDSCMPYVSGGG 182

Query: 194 CSHPGCEPAYPTPKCVRKCVKKN-----QLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 248
                       P C   C   N     QL+       IS +       DI  EIY NGP
Sbjct: 183 ----------NVPACPSNCNGTNIPISSQLYYAKSFSHISPWMFWERVADIQQEIYTNGP 232

Query: 249 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 308
           V+  F+VY+DF +YKSGVY H TG  +GGHA+K+IGWG  + G DYW++AN W+  WG D
Sbjct: 233 VQGGFSVYQDFMNYKSGVYSHKTGSFLGGHAIKIIGWGV-EGGVDYWLVANSWSTDWGID 291

Query: 309 GYFKIKRGSNECGIEEDVVAG 329
           G FKI RG NECGIE+DV AG
Sbjct: 292 GTFKILRGHNECGIEDDVYAG 312


>gi|226472808|emb|CAX71090.1| cathepsin B [Schistosoma japonicum]
          Length = 325

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 123/297 (41%), Positives = 164/297 (55%), Gaps = 22/297 (7%)

Query: 19  TFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPK 78
           T  E    + K     L   +I  +N      WKA    +F   TV   + +LG  P P 
Sbjct: 20  TLNENDARRHKHMHQPLSKELIHFINYEANTTWKAGPTRRFK--TVSDIRRMLGALPDPN 77

Query: 79  GLLLGVPVKTHDKSL-KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFC 137
           G  L      ++ +L +LPKSFDAR  W  C +IS I DQ  CGSCWAFGAVEA+SDR C
Sbjct: 78  GEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQSSCGSCWAFGAVEAMSDRIC 137

Query: 138 IHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY 188
           I         LS  +L++CC   CG GC+GG+P SAW Y+ + G+VT +       C PY
Sbjct: 138 IESKGKYKPFLSAENLVSCCSS-CGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY 196

Query: 189 FDSTGCSH------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMA 241
            +   C H      P C+    TP C R C    N  + N K Y    YR+ S+ E IM 
Sbjct: 197 -EFPPCEHHTLGPLPVCDGDVETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMK 255

Query: 242 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 298
           E+ ++GPVEV F VY DF +YKSGVY+H++G ++GGHAV+L+GWG  ++   YW++A
Sbjct: 256 ELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWG-EENNVPYWLIA 311


>gi|56758644|gb|AAW27462.1| unknown [Schistosoma japonicum]
          Length = 294

 Score =  211 bits (538), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 117/291 (40%), Positives = 167/291 (57%), Gaps = 22/291 (7%)

Query: 12  LCLTCFATFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
           +C+    T  E  V ++       L D +I  +NE+P AGWKA ++ +F  +++   + L
Sbjct: 6   VCIVSLFTLLEAHVTTRNNERIEPLSDEMISFINEHPDAGWKADKSDRF--HSLDDARIL 63

Query: 71  LGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 128
           +G +     +       V  HD ++++P  FD+R  WP C +IS+I DQ  CGSCWAFGA
Sbjct: 64  MGARKEDAEMKRKRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGA 123

Query: 129 VEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT---- 182
           VEA++DR CI  G   S  LS  DL++CC   CGDGC GG+P  AW Y+V  G+VT    
Sbjct: 124 VEAMTDRICIQSGGQQSAELSALDLISCCED-CGDGCQGGFPGVAWDYWVKRGIVTGGSK 182

Query: 183 ---EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRI 232
                C PY        T   +P C    Y TP+C +KC K  +  +   KHY   +Y +
Sbjct: 183 ENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQKCQKGYKTPYEQDKHYGEESYNV 242

Query: 233 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 283
            S+ + I  EI  NGPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++I
Sbjct: 243 ISNEKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRII 293


>gi|54289256|gb|AAV31918.1| putative vitellogenic cathepsin B [Aedes aegypti]
          Length = 332

 Score =  211 bits (538), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 128/336 (38%), Positives = 182/336 (54%), Gaps = 29/336 (8%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAAR---NPQFSNYTVGQFKHL 70
           L  FA     +    +L      D  + +V  + K     A      +F N     F+++
Sbjct: 6   LLVFAIGVVVIARSERLGDDPFNDGFLAQVQRHAKTWTPDATFRDGIRFEN-----FQNM 60

Query: 71  LGVKPTPKGLLLGVPVKTHDKS--LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 128
            G+  +  G  L  P K HD +  + +P+ FDAR  WP C +IS I +QG CG+CWA   
Sbjct: 61  KGIFESKIGFRL--PTKRHDVAYNMDIPEFFDAREKWPYCKSISTIKNQGLCGACWAVAT 118

Query: 129 VEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGY-PISAWRYFVHHGVV---- 181
           V  +SDR CIH     ++ L+  DL+ CC   CG+GC+GG+   ++++Y+V  G+V    
Sbjct: 119 VSVMSDRLCIHSEGKFDVELAAEDLMGCCK-DCGNGCNGGFLDGTSFQYWVDVGLVSGAA 177

Query: 182 ---TEECDPYFDSTGCSHP--GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSD 235
              T+ C PY     C +P  GC P   TP C   C +  +  +R  K+Y  +AY++ +D
Sbjct: 178 YNNTDGCKPY-PFKPCLYPFVGCHPE-KTPSCTHHCTEGYDGTYRRDKYYGSAAYKLPND 235

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
              I  EI  NGPVE  F+VY+D   YK+GVY+H+ G  +G HAV+LIGWG  + G  YW
Sbjct: 236 ERMIQLEIMTNGPVESGFSVYQDLYLYKTGVYQHVVGREVGKHAVRLIGWG-KERGVPYW 294

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           ++AN +   WG  GYFK  RGSN  GIE  V+AGLP
Sbjct: 295 LIANSYGEDWGEHGYFKFLRGSNHLGIESVVIAGLP 330


>gi|28932700|gb|AAO60044.1| midgut cysteine proteinase 1 [Rhipicephalus appendiculatus]
          Length = 332

 Score =  211 bits (537), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 118/251 (47%), Positives = 146/251 (58%), Gaps = 13/251 (5%)

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 147
           D     P+SF  R  W  CS+I  I DQ  CGSCWAF A E++SDR CIH    + +++S
Sbjct: 82  DSRWTCPESFTPREYWSHCSSIRVIRDQSACGSCWAFAAAESISDRICIHTNGKVQVNIS 141

Query: 148 VNDLLACCGFLCGDGCDG-----GYPISAWRYFVHHGVVTEE-CDPYFDSTGCSHPGCEP 201
             DLLACC   CG GCDG        I   R  V   V TE+ C PY  S     P C  
Sbjct: 142 AEDLLACC-HTCGHGCDGRCHCSSVAILQGRRLVPEPVRTEDGCQPY--SLPPCVPNCTH 198

Query: 202 AYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
             PTPKC   C K   + +   KH++ + YR+    + I  +IYKNGPVE +F VY DF 
Sbjct: 199 PEPTPKCQHVCRKGYEKSYEEDKHFAKNVYRLLKKCDAIKTDIYKNGPVESAFFVYADFP 258

Query: 261 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 320
            YKSGVY+      MG HA+K++GWGT +DG  YW++AN WN  WG  GYFKI RG +EC
Sbjct: 259 SYKSGVYQQHMIKFMGVHAIKILGWGT-EDGVPYWLVANSWNVGWGDKGYFKILRGKDEC 317

Query: 321 GIEEDVVAGLP 331
           GIEE + AG+P
Sbjct: 318 GIEEVIDAGIP 328


>gi|358341867|dbj|GAA49438.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 952

 Score =  211 bits (536), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 123/297 (41%), Positives = 158/297 (53%), Gaps = 21/297 (7%)

Query: 49  AGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS--LKLPKSFDARSAWP 106
           A W +  +P+   +      H  G            P   H+ S   +LPKSFDAR+ WP
Sbjct: 5   ARWISGGHPR--RFESASLLHTFGALRESAEQRARRPTVKHEVSDEKELPKSFDARTKWP 62

Query: 107 QCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCD 164
            C +IS I DQ  C S WAFGAVE++SDR CIH     N SLS  DLL+CC   CG GC 
Sbjct: 63  HCPSISEIRDQSSCESFWAFGAVESMSDRLCIHSNGAFNKSLSATDLLSCCED-CGLGCG 121

Query: 165 GGYPISAWRYFVHHGVVT----EE---CDPY-FDSTGCSHPGCEPA-----YPTPKCVRK 211
            G+   AW ++  HG+VT    EE   C  + F   G    G  P      YPTP+C+++
Sbjct: 122 AGFHPMAWDFWKTHGIVTGGSKEEPSGCRSFPFPKCGHRRKGRYPPCPRHIYPTPECIKQ 181

Query: 212 CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 271
           C +    +   K  +  +Y +      IM EI  NGPVE SF +Y DF  Y  GVY H  
Sbjct: 182 CDEPEVNYEKDKTRANISYNVYPSDISIMKEIMLNGPVEASFGIYADFLEYNGGVYFHCW 241

Query: 272 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
           G  +  HA++++GWG  DDG  YW++AN WN  WG  GY +  RG NECGIEE+V A
Sbjct: 242 GGPISRHAIRILGWG-EDDGVPYWLIANSWNEDWGEKGYVRFLRGHNECGIEEEVTA 297



 Score =  201 bits (510), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 127/355 (35%), Positives = 165/355 (46%), Gaps = 80/355 (22%)

Query: 58  QFSNYTVGQFKHLLG-VKPTPKGLLLGVPVKTHD-KSLKLPKSFDARSAWPQCSTISRIL 115
           +   +  G   HL G ++ T +  L    V+  D  +  LP+SFDAR+ WP C +IS I 
Sbjct: 600 RLERFETGNSLHLFGAIRETAEQRLQRPTVRHEDFDNQHLPESFDARANWPHCPSISEIR 659

Query: 116 DQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 173
           DQ  CGSCWAFGAVEA+SDR CIH     N SLS  DL++CC   CG GC GGY   AW 
Sbjct: 660 DQSSCGSCWAFGAVEAMSDRLCIHSKGAFNKSLSAVDLVSCCT-ECGCGCRGGYSPIAWD 718

Query: 174 YFVHHGVVTEECDPYFDSTGCSH---PGCE------------PAYPTPKCVRKCVKKNQL 218
           ++  HG+VT         TGC     P CE              YPTP+C+++C  K   
Sbjct: 719 FWKTHGIVTGGSKE--KPTGCRSYPFPSCEHRGKGQYPPCPHQLYPTPECIKRCDTKEID 776

Query: 219 WRNSK----------------------------------------HYSIS---------- 228
           +   K                                        H+SI           
Sbjct: 777 YEKDKTRGFDSASSEQLADRHCFHTSNFGEASAQRTLHLTCLNFMHHSIDLLSSRLEKAV 836

Query: 229 -------AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 281
                  +Y +    + +M EI   GPV     VYED   YKSGVY H+ G  +G H ++
Sbjct: 837 LRSTANISYNVYPAEQAVMKEIMLRGPVGAILHVYEDLLDYKSGVYFHVWGGHLGEHGIR 896

Query: 282 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 336
           ++GWG  +DG  YW++AN WN  WG  GY ++ R  NECGI + V AGLP   N 
Sbjct: 897 ILGWG-EEDGVPYWLVANSWNEDWGEKGYMRVLRWRNECGIVDQVTAGLPDLSNF 950


>gi|195165479|ref|XP_002023566.1| GL19846 [Drosophila persimilis]
 gi|194105700|gb|EDW27743.1| GL19846 [Drosophila persimilis]
          Length = 329

 Score =  211 bits (536), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 126/321 (39%), Positives = 173/321 (53%), Gaps = 40/321 (12%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKT----- 88
           +L D  I E+  +  + W+  RN + S  +    + L+GV P      L  P K      
Sbjct: 22  MLSDEFI-ELVRSKASTWQVGRNFKES-VSEEYIRGLMGVHPDAHKFAL--PEKRIVLGD 77

Query: 89  --HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNL 144
              D  + +P+ FDAR AWP C TI  I DQG CGSCWAFGAVEA+SDR CIH    +N 
Sbjct: 78  LYADDGIDIPEEFDARKAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSEGKVNF 137

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCSH- 196
            LS +DL++CC  +CG GC+GG+P +AW Y+   G+V       T+ C PY +   C H 
Sbjct: 138 HLSADDLVSCC-HICGFGCNGGFPGAAWSYWTRKGIVSGGPYGSTQGCRPY-EIAPCEHH 195

Query: 197 -----PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 250
                P C     TP C  KC     + +   K++   +Y +  +  +I  EI  NGPVE
Sbjct: 196 VNGTRPPCSHG-STPSCQHKCQASYSVEYAKDKNFGSKSYSVRRNVAEIQQEIMTNGPVE 254

Query: 251 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE-DYWILANQWNRSWGADG 309
            +FTVYED   YKSGVY+H  G  +GGHA++++GWG   + +  YW++ N WN  WG + 
Sbjct: 255 GAFTVYEDLILYKSGVYQHEHGKELGGHAIRILGWGVWGESKVPYWLIGNSWNTDWGDN- 313

Query: 310 YFKIKRGSNECGIEEDVVAGL 330
                   + CGIE  + AGL
Sbjct: 314 --------DHCGIESSISAGL 326


>gi|281208776|gb|EFA82951.1| peptidase C1A family protein [Polysphondylium pallidum PN500]
          Length = 1308

 Score =  210 bits (535), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 115/280 (41%), Positives = 155/280 (55%), Gaps = 28/280 (10%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKPT---PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQ 107
           W   +NP FS   + +    +G K +   PK +   +P      ++ LP +FDA   WPQ
Sbjct: 32  WVELKNPIFSGDNLPR----MGFKKSLDRPKKIYKTLP-----HNVNLPTNFDAAQQWPQ 82

Query: 108 CSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGY 167
           C TI  I +Q  CGSCWAFGA+E++SDRFCIH   ++ LS  DL+ C      +GC+GG 
Sbjct: 83  CPTIGAIQNQAECGSCWAFGAIESISDRFCIHKNESVQLSFQDLITCDN--QDNGCEGGD 140

Query: 168 PISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP-------TPKCVRKCVKKNQLWR 220
           P +A++Y   +GVVT  C PY      + P C PA         TP C  KC   +  ++
Sbjct: 141 PYTAYKYVQKNGVVTSNCQPY------TIPTCPPAQQPCMNFVNTPPCSAKCANSSVNFQ 194

Query: 221 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 280
              H+  + Y +  +   I  EI  NGPVE  F VYEDF  YKSGVY H +G  +GGH +
Sbjct: 195 QDLHHLKTVYAVKPNVAAIQNEIVTNGPVEACFEVYEDFLGYKSGVYTHKSGKDLGGHCI 254

Query: 281 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 320
           K++G+G S +G  YWI  N W  SWG +G F I+ G NEC
Sbjct: 255 KIVGFGVS-NGTPYWICNNSWTTSWGNNGIFWIEAGKNEC 293


>gi|329669000|gb|AEB96388.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 232

 Score =  210 bits (535), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 117/234 (50%), Positives = 143/234 (61%), Gaps = 23/234 (9%)

Query: 117 QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 174
           Q  CGSCWA GAVEA++DR CI    N  +++S +DLL+CC   CG GCDG  P +AW Y
Sbjct: 2   QSSCGSCWAVGAVEAMTDRICIASKGNQKVTISADDLLSCCD-ECGFGCDGRDPYAAWSY 60

Query: 175 FVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKKNQL 218
           +V +G+VT     Y   +GC    +P CE               YPT  C  KC     +
Sbjct: 61  WVSNGIVTGS--NYTSKSGCKPYPYPPCEHHIPEHHYKKCPKDIYPTNTCEYKCQDGYSI 118

Query: 219 WRNS-KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 277
             NS KHY  S Y +  D   I  EI  NGPVEV+F VYEDF HY SG+YKH TGD +GG
Sbjct: 119 SYNSDKHYGASVYAVAQDVASIQKEIMTNGPVEVAFDVYEDFEHYSSGIYKHTTGDYLGG 178

Query: 278 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           HAVK++GWGT ++G DYWI AN WN  WG +G+F+I RG +EC IE  VVAG P
Sbjct: 179 HAVKMLGWGT-ENGTDYWICANSWNSDWGENGFFRILRGVDECEIESGVVAGEP 231


>gi|5031250|gb|AAD38132.1|AF127592_1 vitellogenic cathepsin-B like protease [Aedes aegypti]
          Length = 386

 Score =  210 bits (534), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 122/309 (39%), Positives = 170/309 (55%), Gaps = 28/309 (9%)

Query: 51  WKAARNPQF-SNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCS 109
           W+A  NP+  + Y  G     L     P G++  V      + L LP +FDAR  WP+C 
Sbjct: 86  WRAGSNPKPPAGYRSGVNMADLERTKLPLGIMADV------EDLDLPDTFDAREKWPECP 139

Query: 110 TISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGY 167
           ++  I DQG CGSCWA  A  A++DR+C+             DLL+CC   CG GC GG 
Sbjct: 140 SLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLLSCC-HSCGQGCRGGT 198

Query: 168 PISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAYPTPKCVRKC---VKKNQ 217
              AW+++V  G+ +       + C PY     C  PG +    TPKC  KC        
Sbjct: 199 LGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGE-CRIPGEDE--DTPKCSNKCRSGYNVTD 255

Query: 218 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 277
           +W++ +H    AY + +D   IM EI+ NGPV+ +F  Y D   YKSG+Y+H+ G + GG
Sbjct: 256 VWQD-RHIGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGG 314

Query: 278 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 337
           HAVKL+GWG  ++G  YW++AN W R WG +G+FK+ RG N CGIEE++ AGLP   N  
Sbjct: 315 HAVKLLGWGV-ENGVKYWLVANSWGREWGENGFFKMVRGENHCGIEENIHAGLP---NFH 370

Query: 338 KEITSADMF 346
           ++  +A  F
Sbjct: 371 RQGEAAKYF 379


>gi|294883442|ref|XP_002770942.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239874068|gb|EER02758.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 393

 Score =  210 bits (534), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 132/330 (40%), Positives = 170/330 (51%), Gaps = 27/330 (8%)

Query: 23  GVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLL 82
           G+     +   +L DS+   +N+  K    +++  +F   +V   K L G        L 
Sbjct: 55  GLSGLFSMSRPMLMDSLADALNQGQKTWVASSKQERFKGASVFDVKALCGTILNGPSKLP 114

Query: 83  GVPVKTHDKSLKLPKSFDARSAWPQCST-ISRILDQGHCGSCWAFGAVEALSDRFCIHFG 141
             P         LP  FDAR  +  C+T I  + DQ  CGSCWAF   EA SDR CI   
Sbjct: 115 KKPASESTALSNLPDRFDAREHFKNCATVIGHVRDQSTCGSCWAFATSEAFSDRLCIRSS 174

Query: 142 MNLSL---SVNDLLACCGFLCG---DGCDGGYPISAWRYFVHHGVVTE---ECDPYFDST 192
               L   S     ACC    G    GCDGG P SAWR+F  HGVV+E    C PY +  
Sbjct: 175 GEFDLVPLSAGHTAACCSEAEGCFSFGCDGGQPDSAWRWFSEHGVVSELDSGCWPY-NFP 233

Query: 193 GCSH----PGCEPA---YPTPKCVRKCVKKNQLWRNS----KHYSISAYRINSDPEDIMA 241
            CSH     G EP     P+P C   C  +N  ++ S    +H++        + ++I  
Sbjct: 234 ECSHHVETKGMEPCKGNSPSPVCSTTC--RNHHFKPSFESDRHFTEDEGYSLDEVDEIKK 291

Query: 242 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQW 301
           EI  NGPV  +FTVYEDF +YKSGVYKH+ G  +GGHAVK+IGWGT D  E YW++ N W
Sbjct: 292 EIIDNGPVAAAFTVYEDFLYYKSGVYKHVNGSELGGHAVKIIGWGT-DQNEQYWLVMNSW 350

Query: 302 NRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           N +WG  G FKI  G  ECGI+ +V AG+P
Sbjct: 351 NVNWGDQGIFKIAIG--ECGIDSEVTAGIP 378


>gi|299471123|emb|CBN78981.1| cathepsin B-like proteinase [Ectocarpus siliculosus]
          Length = 557

 Score =  210 bits (534), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 130/330 (39%), Positives = 167/330 (50%), Gaps = 53/330 (16%)

Query: 51  WKAARNPQFSNYTVGQF--------KHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDAR 102
           WK AR         GQ         ++   + P   G     PV        +P +FDAR
Sbjct: 228 WKDARRIAGGTVMRGQVGFEELPRRRYTKEIAPAVPGRRRLTPVAQSSSDEDIPANFDAR 287

Query: 103 SAWPQC-STISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN----------------LS 145
            A+P+C S I R+ DQ  CGSCWAF + EA +DR CI  G+                 L 
Sbjct: 288 EAFPECASIIGRVRDQSDCGSCWAFASTEAFNDRRCIA-GIGKEDAAGAEGEATADQLLV 346

Query: 146 LSVNDLLACC-GFLCG--DGCDGGYPISAWRYFVHHGVVT----------EECDPY---- 188
           LS  D  ACC GF CG   GC+GG P SAW++F   GVVT            C PY    
Sbjct: 347 LSAEDTTACCHGFHCGLSMGCNGGQPGSAWKWFTKTGVVTGGDYADIGTGTTCKPYEFMP 406

Query: 189 ----FDSTGCSHPGC-EPAYPTPKCVRKCVKKN---QLWRNSKHYSISAYRINSDPEDIM 240
                D     +P C +  YPTP+C+ +C + N     +   K  +  AY + +  E+I 
Sbjct: 407 CAHHVDPGASGYPACPDGEYPTPECLSECSETNFSGGSYGEDKKMAREAYSL-AGIENIQ 465

Query: 241 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD-DGEDYWILAN 299
            ++ K G V  +F+V+ DF  Y  GVY H +G  MGGHAVK+IGWGT +  GEDYW++AN
Sbjct: 466 RDMMKYGSVTAAFSVFSDFLTYSGGVYTHESGSFMGGHAVKMIGWGTDEVSGEDYWLIAN 525

Query: 300 QWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
            WN SWG  G F+I RG NECGIE  +VAG
Sbjct: 526 SWNPSWGEGGLFRILRGVNECGIEGQIVAG 555


>gi|328697984|ref|XP_003240502.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
          Length = 339

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 124/321 (38%), Positives = 168/321 (52%), Gaps = 26/321 (8%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KTHD 90
           ++ L++S I+ +N+     W A  N   S       K +LG K           + KTHD
Sbjct: 21  AYFLEESYIEMINDVATT-WTAGVNFDPSTPEKDLIK-MLGSKGVEAAKNASAHMFKTHD 78

Query: 91  KSLK----LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNL 144
            +      +P++FDAR  W  C TI  + DQG+CGSCWAFG   A +DR C+      N 
Sbjct: 79  VAYNNNGYIPRTFDARRRWRHCKTIGEVRDQGYCGSCWAFGTSSAFADRLCVATDGDFNE 138

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDS 191
            LS  +L  CC   CG+GC+GGYPI AW+YF  HG+VT       E C+PY       + 
Sbjct: 139 LLSAEELTFCC-HTCGNGCNGGYPIKAWKYFSSHGLVTGGNYKSGEGCEPYRVPPCPRNE 197

Query: 192 TGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
            G S    +P     +C R C     L  N  H     Y   +    I  ++   GP+E 
Sbjct: 198 DGTSSCAGQPIEKNHRCTRMCYGNQDLDYNDDHRFTRDYYYLT-YGSIQKDVMNYGPIEA 256

Query: 252 SFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 310
           SF VY+DF  YKSGVY+       +GGHAVKLIGWG  ++G  YW++ N W+  WG +G 
Sbjct: 257 SFDVYDDFYSYKSGVYQRTPNATKLGGHAVKLIGWGV-EEGIPYWLMVNSWSAQWGDNGL 315

Query: 311 FKIKRGSNECGIEEDVVAGLP 331
           FKI+RG++ECGI+    AG+P
Sbjct: 316 FKIRRGTDECGIDSATTAGVP 336


>gi|157058767|gb|ABV03141.1| cathepsin B-348 [Sitobion avenae]
          Length = 252

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 104/233 (44%), Positives = 142/233 (60%), Gaps = 20/233 (8%)

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLS 147
           D  + LP++FDAR  WP C TI  + DQG CGSCWAFGAVEA+SDR CIH     N   S
Sbjct: 23  DAPIDLPETFDAREHWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGTKNFHFS 82

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC------------- 194
             +L++CC + CG GC+GG+P +AW Y+   G+V+    PY  + GC             
Sbjct: 83  AENLVSCC-WTCGFGCNGGFPGAAWHYWKTKGIVSG--GPYGSNMGCIPYEIAPCEHHVN 139

Query: 195 -SHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 252
            +   C+    TPKCV+KC    ++ +    H   SAY +++D + I  EIY NGPVE +
Sbjct: 140 GTRGPCKEGGKTPKCVKKCEDGYKVPYEQDLHRGKSAYSLSNDVDQIRQEIYTNGPVEGA 199

Query: 253 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 305
           FTVYEDF  Y++GVYKH+ G  +GGHA++++GWG  +    YW++AN WN  W
Sbjct: 200 FTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANSWNTDW 252


>gi|187105116|ref|NP_001119618.1| cathepsin B-84 precursor [Acyrthosiphon pisum]
 gi|161343843|tpg|DAA06102.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 335

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 128/326 (39%), Positives = 174/326 (53%), Gaps = 37/326 (11%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARN-PQFSNYTVGQFKHLLGVKPTPKGLLLGV---PVK 87
           +H L    I ++NE  K  WKA +N P+  N    Q   LLG K      LLGV   P+K
Sbjct: 21  AHFLSKDYINKINEVAKT-WKAKQNFPE--NTPKEQIVRLLGSK-----RLLGVSKSPIK 72

Query: 88  THDK----SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 141
            +D+    + ++P+ FD+R  W  C TI  + +QG+CGSCWA G   A +DR C+     
Sbjct: 73  ENDELYMDNSEVPEFFDSRLEWDYCETIGHVRNQGNCGSCWAHGTTGAFADRLCVATNGE 132

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF----- 189
            N  +S  +L  CC   CG GC+GGYP+ AW+YF  HGVVT       + C PY      
Sbjct: 133 FNELISAEELTFCC-HRCGFGCNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPCV 191

Query: 190 -DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNG 247
            D  G +    +P     KC +KC   + +     HY    AY + +        +Y  G
Sbjct: 192 KDDEGHNSCSGQPTERNHKCSKKCYGDDTIDYKKNHYKTKDAYYLKNTTMQKDTMVY--G 249

Query: 248 PVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 306
           P+E SF VY+DF +Y+SGVY+       +GGHAVK+IGWG  ++G  YW++ N W   WG
Sbjct: 250 PIEASFDVYDDFMNYESGVYQRTGNASYLGGHAVKMIGWGV-EEGTPYWLMVNSWGEQWG 308

Query: 307 ADGYFKIKRGSNECGIEEDVVAGLPS 332
             G FKI RG++ECGIE    AG+PS
Sbjct: 309 DKGMFKILRGTDECGIESSCTAGVPS 334


>gi|40557606|gb|AAR88096.1| cathepsin B-like cysteine protease [Callosobruchus maculatus]
          Length = 330

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 128/336 (38%), Positives = 171/336 (50%), Gaps = 25/336 (7%)

Query: 12  LCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQ--FSNYTVGQFKH 69
           L     A       ++ +LD   L D  I+++N +    WKA RN +   S Y + +   
Sbjct: 3   LAFIALAAVVSCTFAQPELD--FLSDEYIEQLN-SKNLPWKAGRNFERDTSLYNIQRLLS 59

Query: 70  LLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 129
           +  + P  +       +   D    LP+ FDAR  W +C +I  I DQ  CGSCWA  + 
Sbjct: 60  VGTINPPSEF----ETIFHEDDGKDLPEEFDARKQWSKCESIKEIRDQSGCGSCWAVSSA 115

Query: 130 EALSDRFCIHFGM--NLSLSVNDLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEEC 185
             +SDR CI       L +S  D++ CC       DGC GG P   +  +   G V+   
Sbjct: 116 SVMSDRICIQSDQKNQLRISAADMIECCESCTFSVDGCHGGIPSFTFTEWKDSGFVSG-- 173

Query: 186 DPYFDSTGCS-------HPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPE 237
             Y  + GC        +P C+  Y  P C ++C K + L +   KHY+  AYRI S  E
Sbjct: 174 GEYNSTNGCMSYPLPRCNPSCKTLYDAPTCKKECDKGSPLKYEEDKHYAKQAYRIMSKVE 233

Query: 238 -DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYW 295
             I  EI KNGPV  SFTVY DF HY SGVYK      ++GGHAV++IGWG  +    YW
Sbjct: 234 RQIQLEIIKNGPVVASFTVYADFIHYLSGVYKFDGESKLLGGHAVRIIGWGIENGTYPYW 293

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           +++N WN  WG  G FKI RG NECGIEE++ AGLP
Sbjct: 294 LVSNSWNERWGDQGLFKIWRGKNECGIEEEITAGLP 329


>gi|350535627|ref|NP_001233013.1| uncharacterized protein LOC100164982 precursor [Acyrthosiphon
           pisum]
 gi|239789514|dbj|BAH71377.1| ACYPI005957 [Acyrthosiphon pisum]
          Length = 339

 Score =  208 bits (529), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 125/322 (38%), Positives = 169/322 (52%), Gaps = 28/322 (8%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KTHD 90
           ++ L++S I+ +N+     WKA  N   S      F  +LG K           + KTHD
Sbjct: 21  AYFLEESYIEMINDVATT-WKAGVNFDPSTPET-DFIKMLGSKGVEAAKNASAHMFKTHD 78

Query: 91  ----KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNL 144
               K   +P++FDAR  W  C TI  + DQGHCGSCWAFG   A +DR C+      N 
Sbjct: 79  VAYNKFSYIPRTFDARKRWRHCKTIGEVRDQGHCGSCWAFGTSSAFADRLCVATDGDFNE 138

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDS 191
            LS  +L  CC   CG GC+GGYPI AW+YF  HG+VT       + C+PY       + 
Sbjct: 139 LLSAEELTFCC-HACGHGCNGGYPIKAWKYFSTHGLVTGGNYKSGKGCEPYRVPPCPRNE 197

Query: 192 TGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGPVE 250
            G S    +P     +C R C     L  +  H ++   Y +      I  ++   GP+E
Sbjct: 198 DGKSSCAGKPKEKNHRCTRMCYGNQDLDYDDDHRFTRDFYYLTYG--SIQKDVLNYGPIE 255

Query: 251 VSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 309
            SF VY+DF  YKSGVY+       +GGHAVKLIGWG  ++G  YW++ N WN  WG +G
Sbjct: 256 ASFDVYDDFPSYKSGVYQRTPNATKLGGHAVKLIGWGV-EEGTPYWLMVNSWNAQWGDNG 314

Query: 310 YFKIKRGSNECGIEEDVVAGLP 331
            FKI+RG++EC I+    AG+P
Sbjct: 315 LFKIRRGTDECRIDSATTAGVP 336


>gi|157058763|gb|ABV03139.1| cathepsin B-348 [Acyrthosiphon pisum]
          Length = 248

 Score =  208 bits (529), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 104/231 (45%), Positives = 142/231 (61%), Gaps = 20/231 (8%)

Query: 89  HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 146
           +D S  LP++FDAR  WP C TI  + DQG CGSCWAFGAVEA+SDR CIH     N   
Sbjct: 20  NDASTDLPETFDARERWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSNGTKNFHF 79

Query: 147 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC------------ 194
           S  +L++CC + CG GC+GG+P +AW Y+   G+V+    PY  + GC            
Sbjct: 80  SAENLVSCC-WTCGFGCNGGFPGAAWNYWKTKGIVSG--GPYGSNMGCIPYEIAPCEHHV 136

Query: 195 --SHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
             +   C+    TP CV+KC +  ++ +    H+  SAY I +D + I  EIY NGPVE 
Sbjct: 137 NGTRGPCKEGGKTPTCVKKCEEGYKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTNGPVEG 196

Query: 252 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 302
           +FTVYEDF  Y++GVYKH+ G  +GGHA++++GWG  +    YW++AN WN
Sbjct: 197 AFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANSWN 247


>gi|187104114|ref|NP_001119617.1| cathepsin B-16A precursor [Acyrthosiphon pisum]
 gi|161343835|tpg|DAA06098.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 340

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 124/322 (38%), Positives = 166/322 (51%), Gaps = 27/322 (8%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KTHD 90
           ++ L++S I+ +N+     W A  N   S      F  +LG K           + KTHD
Sbjct: 21  AYFLEESYIEMINDVATT-WTAGVNFDPST-PEKDFIKMLGSKGVEAAKNASAHMFKTHD 78

Query: 91  -----KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMN 143
                 +  +P++FDAR  W  C TI  + DQGHCGSCWA     A +DR C+  +   N
Sbjct: 79  VANDNNNGYIPRTFDARRRWRHCKTIGEVRDQGHCGSCWAMATSSAFADRLCVATNGDFN 138

Query: 144 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------D 190
             LS  ++  CC   CG GC+GGYPI AW+YF  HG+VT       E C+PY       D
Sbjct: 139 ELLSAEEITFCC-HTCGFGCNGGYPIKAWKYFSSHGIVTGGNYKSGEGCEPYRVPPCPQD 197

Query: 191 STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 250
             G S    +P     +C R C     L  N  H     Y   +    I  ++   GP+E
Sbjct: 198 EEGKSSCAGKPIEKNHRCTRMCYGNQDLDYNDDHRFTRDYYYLT-YGSIQKDVMNYGPIE 256

Query: 251 VSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 309
            SF VY+DF  YKSGVY+       +GGHAVKLIGWG  ++G  YW++ N WN  WG +G
Sbjct: 257 ASFDVYDDFPSYKSGVYQRTPNATKLGGHAVKLIGWGV-EEGTPYWLMVNSWNAQWGDNG 315

Query: 310 YFKIKRGSNECGIEEDVVAGLP 331
            FKI+RG++ECGI+    AG+P
Sbjct: 316 LFKIRRGTDECGIDSAATAGVP 337


>gi|22535408|emb|CAC87118.1| cathepsin B-like protease [Nilaparvata lugens]
          Length = 347

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 125/324 (38%), Positives = 169/324 (52%), Gaps = 29/324 (8%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGL-----LLGVPVKTH 89
           + +  I  +N NPK+ WKA  N    +  +   + LLGV      L        +     
Sbjct: 28  IANKWIDAINNNPKSTWKAGHNFH-PDTPMSYLQGLLGVSELESNLADLDKYEEMEENEE 86

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 147
           +K +K+PK FDAR  W +C ++  I DQG+CGSCWA     A +DR CI  +   N  +S
Sbjct: 87  NKKIKVPKYFDARKKWKKCKSLREIRDQGNCGSCWAVSVAAAFADRLCIASNAKWNGHIS 146

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH---- 196
             +L++CC + CG GC+GG+P +AW +   HG+VT       + C PY     C H    
Sbjct: 147 SRELMSCCSY-CGFGCEGGFPDAAWVFIKRHGLVTGGDYHSHDGCQPY-PIAPCEHHMEG 204

Query: 197 --PGC--EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
             P C   P  PTP C   C   + L ++  +    SAY +    +    EI+KNGP+  
Sbjct: 205 SKPNCSASPTEPTPACETTCTHGSSLAYQKDRQKGKSAYLVPVGEKQTQLEIFKNGPIVA 264

Query: 252 SFTVYEDFAHYKSGVYK-HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 310
           +F VYEDF  YKSGVYK H      G HAVK+IGWG   +G  YW++ N W+  WG  G 
Sbjct: 265 AFKVYEDFFMYKSGVYKRHPESPFRGRHAVKVIGWG-EQNGLPYWLVQNSWDYDWGDKGL 323

Query: 311 FKIKRGSNECGIEEDVVAGLPSSK 334
           FKI RG NEC  E+ + AGLP  K
Sbjct: 324 FKIARG-NECDFEKSMTAGLPKYK 346


>gi|300835056|gb|ADK37857.1| putative cathepsin precursor [Sitobion avenae]
          Length = 340

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 127/323 (39%), Positives = 164/323 (50%), Gaps = 29/323 (8%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV-PVKTHD 90
           ++ L+ S I  +NE     W A  N   S      F  +LG K             KT+D
Sbjct: 21  AYFLEKSYIDMINEVATT-WTAGVNFDPS-IPEDHFIKMLGSKGVESAKQASAHEFKTND 78

Query: 91  KSL-----KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMN 143
            +       +P++FDAR  W  C TI  + DQGHCGSCWAFG   A +DR C+      N
Sbjct: 79  VAYDNHFGHIPRTFDARKKWRHCRTIGEVRDQGHCGSCWAFGTSSAFADRLCVATDGDFN 138

Query: 144 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FD 190
             LS  ++  CC   CG GC GGYPI AW+YF  HG+VT       E C+PY       D
Sbjct: 139 ELLSAEEITFCC-HTCGFGCHGGYPIKAWKYFSKHGLVTGGNYKSGEGCEPYRVPPCPRD 197

Query: 191 STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGPV 249
             G +    +P     +C R C     L  N  H ++   Y +      I  ++   GP+
Sbjct: 198 DKGNNTCAGKPIEKNHRCTRMCYGDQDLDYNDDHRFTRDFYYLTYG--SIQKDVMTYGPI 255

Query: 250 EVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 308
           E SF VY+DF  YKSGVY K      +GGHAVKLIGWG  ++G  YW++ N WN  WG  
Sbjct: 256 EASFDVYDDFPSYKSGVYEKTENASYLGGHAVKLIGWGV-EEGTPYWLMVNSWNAQWGDK 314

Query: 309 GYFKIKRGSNECGIEEDVVAGLP 331
           G FKI+RG+NECGI+    AG+P
Sbjct: 315 GLFKIRRGTNECGIDNSTTAGVP 337


>gi|291291827|gb|ADD91786.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score =  207 bits (528), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 115/253 (45%), Positives = 150/253 (59%), Gaps = 21/253 (8%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 152
           +P+S  +R+ WP+CS++  I DQ +CGSCWA     ALSDR CI  +    + +S  D+L
Sbjct: 2   IPESPYSRTKWPKCSSLKPIRDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDIL 61

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-FDSTGCSHPG------ 198
           +CCG  CG GC+GG+PI A+ YF   G VT         C PY F    C H G      
Sbjct: 62  SCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHP--CGHHGKDTYYG 119

Query: 199 -CEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
            C     TPKCVRKC     + ++  +     AY   +  +    EI KNGPV  +FTVY
Sbjct: 120 ECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEEPNAEKATQREIMKNGPVVGAFTVY 179

Query: 257 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
           EDF++YK G+YKH  G   GGHA+K+IGWG  + G  YW++AN W+  WG +GYF+I  G
Sbjct: 180 EDFSYYKKGIYKHTAGKARGGHAIKIIGWG-KEGGVPYWLIANSWHNDWGENGYFRILCG 238

Query: 317 SNECGIEEDVVAG 329
           SN CGIEE+VVAG
Sbjct: 239 SNHCGIEENVVAG 251


>gi|44965401|gb|AAS49537.1| cathepsin B [Latimeria chalumnae]
          Length = 225

 Score =  207 bits (528), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 105/210 (50%), Positives = 136/210 (64%), Gaps = 16/210 (7%)

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 150
           +KLP++FD+R+ WP+C TI  I DQG CGSCWAFGAVEA+SDR CIH    +N+ +S  D
Sbjct: 11  VKLPENFDSRTQWPKCPTIQEIRDQGSCGSCWAFGAVEAISDRVCIHSKGKVNVEISAED 70

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPG 198
           LL+CCG  CG GC+GGYP  AW ++   G+V+         C PY           S P 
Sbjct: 71  LLSCCGMECGFGCNGGYPSGAWNFWTETGLVSGGLFKSHIGCRPYTIPPCEHHVNGSRPS 130

Query: 199 CEPAY-PTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
           C      TPKCV +C       +   KH+  ++Y ++S+  DI  EIYKNGPVE +FTVY
Sbjct: 131 CTGEEGDTPKCVMQCEAGYTPSYFKDKHFGSTSYAVSSNEADIQIEIYKNGPVEGAFTVY 190

Query: 257 EDFAHYKSGVYKHITGDVMGGHAVKLIGWG 286
           EDF  YKSGVYKH+TGD +GGHA++++GWG
Sbjct: 191 EDFLQYKSGVYKHVTGDAVGGHAIRILGWG 220


>gi|339241013|ref|XP_003376432.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316974853|gb|EFV58323.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 551

 Score =  207 bits (527), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 121/303 (39%), Positives = 163/303 (53%), Gaps = 26/303 (8%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKL-----PKSFDARSAW 105
           WK  RN  F N ++G+ K LLG +  PK +     +   +  L L     P  FD+R  W
Sbjct: 240 WKFGRNAYFKNKSIGEIKKLLGYRMLPKTVKERNEMPMPEDLLNLENFNYPVEFDSRKHW 299

Query: 106 PQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG 162
           PQC   IS I DQ +CGSCWA  +   +SDR CI      +  LS  +LL+CC   CG G
Sbjct: 300 PQCEKVISFIKDQANCGSCWAVSSASVMSDRTCIATDGQFTTLLSDAELLSCCT-SCGYG 358

Query: 163 CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCE--PAYPTPKCVRKCV 213
           C+GGYP   ++Y+V+ G+ T       + C PY        P C       TPKC + C+
Sbjct: 359 CNGGYPQRTFKYWVYSGMPTGGPYGSNDTCKPY------PIPPCSNCSETRTPKCSKSCI 412

Query: 214 KKNQLWRNS-KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 272
               L  N  +HY  + Y+     + +M +I   GP+    +VYEDF HYK GVY   +G
Sbjct: 413 STYPLSLNEDRHYGSTYYQFWLGEKSMMKDISLYGPIVAGMSVYEDFLHYKEGVYTQESG 472

Query: 273 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
             +GGHAV++IGWG  D+   YW++AN WN ++G DG FKI+RG +ECGIE  V AG   
Sbjct: 473 IFLGGHAVRIIGWGEQDN-IPYWLVANSWNTTFGEDGLFKIRRGFDECGIESYVSAGRAK 531

Query: 333 SKN 335
            K 
Sbjct: 532 CKQ 534


>gi|312091331|ref|XP_003146940.1| cathepsin B [Loa loa]
          Length = 249

 Score =  207 bits (526), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 114/237 (48%), Positives = 144/237 (60%), Gaps = 25/237 (10%)

Query: 121 GSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH 178
           GSCWA  AVEA+SDR CI       ++LS +DLL+CC   CG GC GG P++AW+Y+V  
Sbjct: 15  GSCWAVAAVEAMSDRICIMSKGKKQVTLSADDLLSCCK-TCGFGCFGGEPMAAWKYWVLR 73

Query: 179 GVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKK-NQLWRN 221
           G+VT     Y + +GC     P CE               YPTPKCV+KC K   + ++ 
Sbjct: 74  GIVTG--SEYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCVKKCDKNYGKSYKA 131

Query: 222 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 281
            K+Y  S Y + S+ E I  EI   GPVE SF VY DF +Y  G+YKH+ G + GGHAVK
Sbjct: 132 DKYYGQSVYNVESNVESIQKEIMTLGPVEASFEVYTDFLYYTGGIYKHVAGSMGGGHAVK 191

Query: 282 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVK 338
           ++GWG  D G  YW+ AN WN  WG DGYF+I RG NECGIE  ++AG+P  K L K
Sbjct: 192 VLGWGI-DQGVPYWLAANSWNTDWGEDGYFRILRGVNECGIESGIIAGIP--KQLAK 245


>gi|290992564|ref|XP_002678904.1| predicted protein [Naegleria gruberi]
 gi|284092518|gb|EFC46160.1| predicted protein [Naegleria gruberi]
          Length = 289

 Score =  206 bits (525), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 122/310 (39%), Positives = 176/310 (56%), Gaps = 32/310 (10%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
           +L L   +TFA+       LD  +   ++I+++N +   GW AA  PQF+  T+   + L
Sbjct: 5   LLALAAVSTFAQ----LSTLDRPVHDHTLIQKINADSSIGWTAAAYPQFAGMTLRDARKL 60

Query: 71  LG---VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFG 127
           LG   V P     +  +P KT   +LK   SFDAR+ W +C  +  I DQ  CGSCWAF 
Sbjct: 61  LGTVLVHP-----INNLPKKTMPANLKAASSFDARTKWGKC--VHPIRDQQQCGSCWAFS 113

Query: 128 AVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEEC 185
           A E LSDRFCI  +  +++ LS   +L C       GCDGGY  +AW +    G+ +++C
Sbjct: 114 ASEVLSDRFCIASNGSVDVVLSPEYMLQCDS--TDYGCDGGYLNNAWAFLAGTGIPSDKC 171

Query: 186 DPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYK 245
           DPY  ++G    G  P   T     K  K       +K  S++     S  +DI  +I  
Sbjct: 172 DPY--TSGNGDVGSCPTSCTDGSAIKLYK-------AKSSSVAQL---SSIDDIQKDIQA 219

Query: 246 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED--YWILANQWNR 303
           NGPV+ +F+VY+DF  YKSGVY+H++G + GGHA+K++GWG + DG+D  YWI+AN WN 
Sbjct: 220 NGPVQAAFSVYQDFFSYKSGVYRHVSGSLAGGHAIKIVGWGVTSDGKDTPYWIVANSWNT 279

Query: 304 SWGADGYFKI 313
           +WG +G+F I
Sbjct: 280 NWGQEGFFWI 289


>gi|300122171|emb|CBK22745.2| unnamed protein product [Blastocystis hominis]
          Length = 319

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 127/312 (40%), Positives = 162/312 (51%), Gaps = 36/312 (11%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYT--VGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLP 96
           I K VN+  +  W A  N    +Y+  +G  K+    KP P   +  +P+K      +LP
Sbjct: 23  IAKRVNKQ-QNSWVANENTPLRDYSSFIGTLKNK---KPLP---IRSIPIKR-----ELP 70

Query: 97  KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF-GMN-LSLSVNDLLAC 154
           K FD+   WP+C +I  + DQ  C SCWAFG VE  +DR CI   G N + LS  D+L C
Sbjct: 71  KEFDSSEKWPECPSILEVRDQSSCASCWAFGVVEVATDRICIESKGKNQVRLSAEDVLEC 130

Query: 155 CGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAYP--- 204
           C   CG  C GGY   AW Y    GVVT       E C  Y     CSH G E  YP   
Sbjct: 131 CK-DCGFQCQGGYSAMAWEYLRRTGVVTGGQYNSTEWCKSY-PFPPCSH-GIEGQYPQCS 187

Query: 205 -----TPKCVRKCVKKNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
                 PKC   C +   +      Y  S  Y++ ++ + I  EI +NGPV+ SF VYED
Sbjct: 188 TKPPVVPKCETTCQEGYPIEYEKDRYKFSNVYQLENNVDQIKNEIMENGPVDASFQVYED 247

Query: 259 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
           F  YKSG+Y H+ G  M  H VK+IGWG  ++GE YW   N WN  WG +G F+I+ G+N
Sbjct: 248 FMTYKSGIYHHVEGKFMNLHTVKIIGWG-EENGEAYWKAVNSWNSEWGENGLFRIRLGTN 306

Query: 319 ECGIEEDVVAGL 330
           EC IE  V  GL
Sbjct: 307 ECTIESQVEGGL 318


>gi|119638996|gb|ABL85239.1| cysteine proteinase 5 [Necator americanus]
          Length = 342

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 120/315 (38%), Positives = 167/315 (53%), Gaps = 25/315 (7%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK 91
           + +   + +  VN++ ++ +KA  +P    Y     +     KP    +     VK  D 
Sbjct: 32  TKLTGQAYVDYVNQH-QSFYKAEYSPLVEQYAKAVMRSEFMTKPNQNYV-----VKDVDL 85

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVN 149
           ++ LP++FDAR  WP C++I  I DQ +CGSCWA  A   +SDR CI     +    S  
Sbjct: 86  NINLPETFDAREKWPNCTSIRTIRDQSNCGSCWAVSAASVMSDRLCIQSNGTIQSWASDT 145

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------ 196
           D+L+CC + CG GCDGG P +A+ + + +GV T         C PY       H      
Sbjct: 146 DILSCC-WNCGMGCDGGRPFAAFFFAIDNGVCTGGPFREPNVCKPYAFYPCGRHQNQKYF 204

Query: 197 -PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
            P  +  +PTPKC + C +K N  +++ K Y   AY + ++   IM EI+ NGPV  SF+
Sbjct: 205 GPCPKELWPTPKCRKMCQLKYNVAYKDDKIYGNDAYSLPNNETRIMQEIFTNGPVVGSFS 264

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           V+ DFA YK GVY        G HAVK+IGWG   DG  YW++AN WN  WG +GY +  
Sbjct: 265 VFADFAIYKKGVYVSNGIQQNGAHAVKIIGWGVQ-DGLKYWLIANSWNNDWGDEGYVRFL 323

Query: 315 RGSNECGIEEDVVAG 329
           RG N CGIE  VV G
Sbjct: 324 RGDNHCGIESRVVTG 338


>gi|347972088|ref|XP_313836.5| AGAP004534-PA [Anopheles gambiae str. PEST]
 gi|333469166|gb|EAA09182.5| AGAP004534-PA [Anopheles gambiae str. PEST]
          Length = 334

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 124/339 (36%), Positives = 182/339 (53%), Gaps = 20/339 (5%)

Query: 6   LIMDPILCLTCFATFAEGVVSKL-KLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTV 64
           + +  ++ LT     A G+VS + +       D  ++ V    +  WK   N Q SN   
Sbjct: 1   MRLQVLILLT--VVLANGLVSSVDRHGQDPFNDDFLRRVLARART-WKPDTNFQ-SNVHF 56

Query: 65  GQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 124
             F+ L G+  +  G  + +    +   + +P+SFDAR+ WP C ++  I +QG CGSCW
Sbjct: 57  HAFRSLKGIGESRTGFKVPIRRYEYVYDVDIPESFDARNHWPNCESLRAIRNQGTCGSCW 116

Query: 125 AFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGY-PISAWRYFVHHGVV 181
           A  A   +SDR CIH    +N++L+  DL+ CC   CG+GC+GG+   ++++Y+V  G+V
Sbjct: 117 AVAAASVMSDRVCIHSNGTINVALAAEDLMGCC-VDCGNGCNGGFLDGTSFQYWVDAGLV 175

Query: 182 -------TEECDPYFDSTGCSHPGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRI 232
                  T+ C PY     C +P  +     +PKC   C    ++ +   K +   AY +
Sbjct: 176 SGGAYNSTDGCKPY-PFKPCEYPFNDCHVEISPKCTHHCRDGVDRHYSKDKLFGKVAYSV 234

Query: 233 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 292
             D   I  EI  NGPVE  F VYED   YKSGVY+H+ G+ +G HAV++IGWG  D G 
Sbjct: 235 PRDERAIRYEIMTNGPVEAGFDVYEDVLLYKSGVYRHVYGEQIGKHAVRIIGWG-RDGGI 293

Query: 293 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
            YW++AN +   WG  GYFK  RGSN  GIE  ++ GLP
Sbjct: 294 PYWLIANSYGDDWGDHGYFKFVRGSNHLGIESKIITGLP 332


>gi|193716207|ref|XP_001950562.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
          Length = 340

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 123/326 (37%), Positives = 170/326 (52%), Gaps = 35/326 (10%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVK 87
           ++ LQ   I  +N N    WKA  N    N     F  +LG K    P    + +    K
Sbjct: 21  AYFLQKDFIDNIN-NHATTWKAGVNFD-PNTPKEYFLKMLGSKGVQIPDKHNIHM---YK 75

Query: 88  THDKSL-----KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HF 140
           THD +      ++PK FDAR  W +C TI ++ DQG+CGSCWA     A +DR C+  + 
Sbjct: 76  THDAAYDNLFGRIPKHFDARKKWKRCHTIGKVRDQGNCGSCWAMATSSAFADRLCVATNA 135

Query: 141 GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY----- 188
             N  LS  ++  CC   CG GC+GGYPI AW  F + G+VT       E C+PY     
Sbjct: 136 DFNELLSAEEITFCCS-SCGYGCNGGYPIKAWESFNNRGLVTGGDYQSGEGCEPYRVPPC 194

Query: 189 -FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKN 246
            +D+ G +    +P     +C R C     L  N  H ++  +Y +      I  ++ + 
Sbjct: 195 PYDAEGHNTCAGKPREKNHRCTRTCYGNQDLDYNDDHRFTRDSYYLTY--SSIQKDVMRY 252

Query: 247 GPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 305
           GP+E SF +Y+DF  YKSGVY +      +GGHAVKLIGWG  + G  YW++ N WN  W
Sbjct: 253 GPIEASFDMYDDFPSYKSGVYVRSENASYLGGHAVKLIGWG-EEHGVLYWLMVNSWNEGW 311

Query: 306 GADGYFKIKRGSNECGIEEDVVAGLP 331
           G +G FKI+RG+NECGI+     G+P
Sbjct: 312 GDNGLFKIRRGTNECGIDNSTTGGVP 337


>gi|239788404|dbj|BAH70886.1| ACYPI000014 [Acyrthosiphon pisum]
          Length = 335

 Score =  205 bits (522), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 127/326 (38%), Positives = 173/326 (53%), Gaps = 37/326 (11%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARN-PQFSNYTVGQFKHLLGVKPTPKGLLLGV---PVK 87
           +H L    I ++NE  K  WKA +N P+  N    Q   LLG K      LLGV   P+K
Sbjct: 21  AHFLSKDYINKINEVAKT-WKAKQNFPE--NTPKEQIVRLLGSK-----RLLGVSKSPIK 72

Query: 88  THDK----SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 141
            +D+    + ++P+ FD+R  W  C TI  + +QG+CGSCWA G   A +DR C+     
Sbjct: 73  ENDELYMDNSEVPEFFDSRLEWDYCETIGHVRNQGNCGSCWAHGTTGAFADRLCVATNGE 132

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF----- 189
            N  +S  +L  CC   C  GC+GGYP+ AW+YF  HGVVT       + C PY      
Sbjct: 133 FNELISAEELTFCC-HRCVFGCNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPCV 191

Query: 190 -DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNG 247
            D  G +    +P     KC +KC   + +     HY    AY + +        +Y  G
Sbjct: 192 KDDEGHNSCSGQPTERNHKCSKKCYGDDTIDYKKNHYKTKDAYYLKNTTMQKDTMVY--G 249

Query: 248 PVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 306
           P+E SF VY+DF +Y+SGVY+       +GGHAVK+IGWG  ++G  YW++ N W   WG
Sbjct: 250 PIEASFDVYDDFMNYESGVYQRTGNASYLGGHAVKMIGWGV-EEGTPYWLMVNSWGEQWG 308

Query: 307 ADGYFKIKRGSNECGIEEDVVAGLPS 332
             G FKI RG++ECGIE    AG+PS
Sbjct: 309 DKGMFKILRGTDECGIESSCTAGVPS 334


>gi|48762476|dbj|BAD23809.1| cathepsin B-S [Tuberaphis styraci]
 gi|204022069|dbj|BAG71132.1| cathepsin B-S1 [Tuberaphis styraci]
          Length = 349

 Score =  204 bits (520), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 128/319 (40%), Positives = 168/319 (52%), Gaps = 26/319 (8%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK- 91
             L D  IK +NE  K  WKA R    +N +   F  LLG +   K     V +K +D  
Sbjct: 23  QFLSDERIKYINEVAKT-WKAERYFP-ANTSEEYFIGLLGSRGY-KNYTNEVEIKKYDPL 79

Query: 92  --SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 147
                 PK FD+R  W  C  I  I DQG+CGSCW+F    A +DR C+  G   N  LS
Sbjct: 80  YVENNSPKQFDSRENWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLS 139

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FDSTGCS 195
             +L  CC   CG GC GGYPI AW+YF   GV T       E C PY     +D  G +
Sbjct: 140 PEELAFCC-MDCGKGCGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPCYDEQGKN 198

Query: 196 HPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 255
             G +P     +C + C  K  +    ++ + + Y INS  E I  ++   GPVE SF V
Sbjct: 199 TCGGKPMERNHQCPKTCYGKTTV--QDRYKTKNEYVINS-IETIEQDLMTYGPVEASFDV 255

Query: 256 YEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           Y+DF+ YKSG+Y+        GGH++K+IGWG  ++G  YW+  N W++ WG  G FKI 
Sbjct: 256 YDDFSVYKSGIYRKTPKAKYEGGHSIKIIGWG-EENGTPYWLAVNSWSKFWGDHGTFKII 314

Query: 315 RGSNECGIEEDVVAGLPSS 333
           +G NECGIE  V AG+PS+
Sbjct: 315 KGRNECGIERAVTAGIPST 333


>gi|321461662|gb|EFX72692.1| hypothetical protein DAPPUDRAFT_308155 [Daphnia pulex]
          Length = 379

 Score =  204 bits (519), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 110/257 (42%), Positives = 142/257 (55%), Gaps = 24/257 (9%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLL 152
           +P  FDAR  WP C TI  I +QG C SCWA    + +SDR CIH G    + LS  +LL
Sbjct: 113 IPAEFDARLRWPNCPTIGEIFEQGSCASCWAVAPTDVMSDRICIHSGSRHIVRLSAGNLL 172

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY-PTPK---- 207
           +CC  LCG GC GG+P  AW ++  HG+VT     Y    GC      P Y P  K    
Sbjct: 173 SCCK-LCGKGCKGGFPGGAWMHWSKHGIVTG--GSYSSDYGCQKYQFFPCYQPRTKGSIK 229

Query: 208 ------------CVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
                       C   C    N+ ++   +Y  S YRI +D   I  EI +NGPV+ +  
Sbjct: 230 NKCPKTDNTLLECRETCRTSYNKSYKQDLYYGESVYRIPNDARAIQLEIMENGPVQANLR 289

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           +YEDF HYK GVY+H+ G  +  HAVK+ GWGT + G  YW+ AN W++ WG  G+FKI 
Sbjct: 290 IYEDFLHYKFGVYRHVHGQGLEYHAVKIFGWGT-EGGTPYWLAANPWSKRWGNGGFFKIL 348

Query: 315 RGSNECGIEEDVVAGLP 331
           RGSN   IE+ V+AG+P
Sbjct: 349 RGSNHAEIEDHVMAGIP 365


>gi|290975216|ref|XP_002670339.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
 gi|284083897|gb|EFC37595.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
          Length = 350

 Score =  204 bits (518), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 125/313 (39%), Positives = 163/313 (52%), Gaps = 36/313 (11%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS------ 92
           +I  +N  P A W+A   PQF   ++    +LLG     +  L G  V   D S      
Sbjct: 54  MISNINSQPSASWQAVEYPQFKGKSLADMTNLLGALNVNENDLKG-EVMDKDNSTNTPLS 112

Query: 93  -------LKL---PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG- 141
                  L+L   P  FDAR  WPQC  I  I +Q +CGSCWAF A   L+DRFCI  G 
Sbjct: 113 DSRYLTILRLRDFPTQFDAREQWPQC--IRSIKNQKNCGSCWAFSASSVLADRFCIKSGG 170

Query: 142 -MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 200
            +N+ LS   +++C G    +GC+GG+  + WR+ V  G V+E C PY  S G + P C 
Sbjct: 171 KVNVDLSPQFMVSCSG--QNNGCNGGFFDATWRFLVSVGTVSEACVPYV-SFGGAVPACN 227

Query: 201 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
                   V+ C    Q    S  Y   + R      DIMA++  NGP++V+  VY DF 
Sbjct: 228 --------VKSCGVPGQ---KSPFYRAGSARKLEGMLDIMADLKANGPIQVAMGVYRDFY 276

Query: 261 HYKSGVYKHITGDVMGGHAVKLIGWG-TSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 319
            YKSGVY H++G  +GGHAVK++GWG  S     YWI AN W   WG  GYF I RG  E
Sbjct: 277 SYKSGVYHHVSGRYVGGHAVKIVGWGYDSASKLPYWICANSWGEDWGIKGYFWILRGRGE 336

Query: 320 CGIEEDVVAGLPS 332
           CGI + V +G P+
Sbjct: 337 CGIGKMVWSGKPA 349


>gi|410912140|ref|XP_003969548.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
          Length = 246

 Score =  204 bits (518), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 108/224 (48%), Positives = 143/224 (63%), Gaps = 18/224 (8%)

Query: 125 AFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 182
           AFGA EA+SDR CIH    +S  LS  DLL+CC   CG GC+GGYP +AW ++   G+V+
Sbjct: 25  AFGASEAMSDRICIHSNAKISVELSAEDLLSCC-ESCGMGCNGGYPSAAWDFWTKDGLVS 83

Query: 183 EE-------CDPYF-----DSTGCSHPGCE-PAYPTPKCVRKC-VKKNQLWRNSKHYSIS 228
                    C PY           S P C      TP+CV +C       ++  KHY  +
Sbjct: 84  GGLYDSHIGCRPYTIPPCEHHVNGSRPSCSGEGGETPQCVYRCEAGYTPSYKQDKHYGKT 143

Query: 229 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 288
           +Y ++SD +DI  EIYKNGPVE +FTVYEDF  YK+GVY+H+TG  +GGHA+K++GWG  
Sbjct: 144 SYSVSSDEDDIKHEIYKNGPVEGAFTVYEDFVLYKTGVYQHVTGSALGGHAIKILGWG-E 202

Query: 289 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
           ++G  YW+ AN WN  WG +G+FKI RGSN CGIE ++VAG+P+
Sbjct: 203 ENGIPYWLCANSWNTDWGNNGFFKILRGSNHCGIESEIVAGIPN 246


>gi|161343869|tpg|DAA06115.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 337

 Score =  204 bits (518), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 125/324 (38%), Positives = 170/324 (52%), Gaps = 28/324 (8%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 89
           ++ LQ+  I  +NE     WKA  N  P   +  + +     GV+   K  +     KTH
Sbjct: 21  AYFLQEDFINNINEQATT-WKAGMNFDPNTPHDDIIKLLGSRGVQNPDK--VNHKLYKTH 77

Query: 90  DKSL-----KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GM 142
           D++      ++P+ FDAR+ W  C TI R+ DQG+CGSCWA     A +DR C+      
Sbjct: 78  DEAYDNLFGRIPEHFDARNKWVYCDTIGRVRDQGNCGSCWAVATSSAFADRLCVATTGDF 137

Query: 143 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF---DST 192
           N  LS  ++  CC   CG GC GGYPI AW+ F  HG+VT       E C+PY     + 
Sbjct: 138 NELLSAEEITFCC-HTCGFGCHGGYPIKAWKRFSTHGLVTGGDYNSGEGCEPYRVPPSND 196

Query: 193 GCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGPVEV 251
           G S    +P      C R C     +  N  H Y+   Y +      I  ++   GP+E 
Sbjct: 197 GNSSSSDQPLAINHICRRHCYGNQSIDFNDDHRYTRDYYYLTYGS--IQKDVLTYGPIEA 254

Query: 252 SFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 310
           SF VY+DF  YKSGVY K      +GGHAVKLIGWG  +DG  YW++ N WN  WG +G+
Sbjct: 255 SFDVYDDFPSYKSGVYVKSDNASYLGGHAVKLIGWG-EEDGTPYWLMVNSWNTQWGDNGF 313

Query: 311 FKIKRGSNECGIEEDVVAGLPSSK 334
           FKI+RG+NECG++    AG+P + 
Sbjct: 314 FKIRRGTNECGVDNSTTAGVPVTN 337


>gi|90074902|dbj|BAE87131.1| unnamed protein product [Macaca fascicularis]
          Length = 296

 Score =  204 bits (518), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 123/326 (37%), Positives = 162/326 (49%), Gaps = 71/326 (21%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGLLLGVPVKTH 89
           H L D ++  VN+     W+A  N  F N  V   K L G     P P   ++       
Sbjct: 24  HPLSDELVNYVNKQ-NTTWQAGHN--FYNVDVSYLKRLCGTFLGGPKPPQRVM------F 74

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 149
            + LKLP+SFDAR  WPQC TI  I DQG CGSCWAFGAVEA+SDR CIH   ++S+ V+
Sbjct: 75  TEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVS 134

Query: 150 --DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYF-----DSTGCS 195
             DLL CCG +CGDGC+GGYP  AW ++   G+V+         C PY           S
Sbjct: 135 AEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNGS 194

Query: 196 HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
            P C     TPKC + C    +  ++  KHY  ++Y +++  +DIMAEIYKN        
Sbjct: 195 RPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKN-------- 246

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
                                               G  YW++AN WN  WG +G+FKI 
Sbjct: 247 ------------------------------------GTPYWLVANSWNTDWGDNGFFKIL 270

Query: 315 RGSNECGIEEDVVAGLPSSKNLVKEI 340
           RG + CGIE +VVAG+P +    ++I
Sbjct: 271 RGQDHCGIESEVVAGIPRTDQYWEKI 296


>gi|201023315|ref|NP_001128400.1| cathepsin B-16D2 precursor [Acyrthosiphon pisum]
          Length = 340

 Score =  203 bits (517), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 125/330 (37%), Positives = 169/330 (51%), Gaps = 35/330 (10%)

Query: 28  LKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLG 83
           L   ++ LQ   I  +NE     WKA  N    +     F  +LG K    P    + + 
Sbjct: 17  LTEQAYFLQKDFIDNINERATT-WKAGVNFD-PDTPKEHFLKMLGSKGVQIPNKHNIHM- 73

Query: 84  VPVKTHDKSL-----KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI 138
              KTHD +      ++P+ FDAR  W +C TI  + DQG+CGSCWA     A +DR C+
Sbjct: 74  --YKTHDAAYDNLFGRIPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMATSSAFADRLCV 131

Query: 139 --HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY- 188
             +   N  LS  ++  CC   CG GC+GGYPI AW  F   G+VT       E C+PY 
Sbjct: 132 ATNADFNELLSAEEITFCC-HSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYR 190

Query: 189 -----FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAE 242
                +D+ G +    +P     +C R C     L  +  H Y+  +Y +      I  +
Sbjct: 191 VPPCPYDAEGHNTCAGKPRESNHRCTRMCYGNQDLDFDEDHRYTRDSYYLTYG--SIQKD 248

Query: 243 IYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQW 301
           +   GP+E SF VY+DF  YKSGVY K      +GGHAVKLIGWG  + G  YW++ N W
Sbjct: 249 VMTYGPIEASFDVYDDFPSYKSGVYVKSENATYLGGHAVKLIGWG-EEYGVPYWLMVNSW 307

Query: 302 NRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           N  WG +G FKI+RG+NECGI+    AG+P
Sbjct: 308 NADWGDNGLFKIRRGTNECGIDNSTTAGVP 337


>gi|209863077|ref|NP_001119612.2| cathepsin B-912 precursor [Acyrthosiphon pisum]
          Length = 342

 Score =  203 bits (517), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 126/324 (38%), Positives = 170/324 (52%), Gaps = 31/324 (9%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF-KHLLGVK--PTPKGLLLGVPVKT 88
           ++ L++  I  +NE  K  WKA  N  F   T  ++   LLG K    P  L L +  KT
Sbjct: 23  AYFLEEDFIDSINEKAKT-WKAGIN--FDPNTPKEYIVKLLGSKGVQVPHKLNLKM-YKT 78

Query: 89  HDKSL-----KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFG 141
            D++      ++PK FDAR  W +C TI ++ DQG+CGSCWA     A +DR CI  ++ 
Sbjct: 79  DDEAYVNLFGRIPKKFDARKEWRRCITIGQVRDQGNCGSCWALATSSAFADRLCIATNYE 138

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF----- 189
            N  LS  +L  CC  LCG  C GGYPI AW YF  HG+VT       E C PY      
Sbjct: 139 FNELLSAEELTFCC-HLCGFACHGGYPIKAWSYFRRHGIVTGGDYQSGEGCAPYRVPPCF 197

Query: 190 -DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 248
            +  G +    +P     +C R C    ++  +  H     Y   +    I  ++   GP
Sbjct: 198 SEEDGNNTCRGQPMEKHHRCTRMCYGDQEIDYDDDHRFTRDYYYLT-YASIQKDVMTYGP 256

Query: 249 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 307
           +E S  VY+DF  YKSGVY K      +GGHAVKLIGWG  +DG  YW++ N W+  WG 
Sbjct: 257 IEASMEVYDDFPSYKSGVYEKSENATYLGGHAVKLIGWG-EEDGVPYWLMVNSWSEMWGD 315

Query: 308 DGYFKIKRGSNECGIEEDVVAGLP 331
            G FKI+RG+NEC ++  + AG+P
Sbjct: 316 KGLFKIRRGTNECSVDNSMTAGVP 339


>gi|255040223|gb|ACT99884.1| truncated cathepsin B [Opisthorchis viverrini]
          Length = 313

 Score =  203 bits (517), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 124/295 (42%), Positives = 154/295 (52%), Gaps = 26/295 (8%)

Query: 35  LQDSIIKE-VNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD--K 91
           L+D  ++E V+    A W + R+ +   +      H  G K          P   H    
Sbjct: 25  LEDVGLREHVHSVTGARWISGRHSK--GFESDHLIHTFGAKMETAEQKAQRPTVKHVGFD 82

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
             +LPK+FDARS WP CS++S I DQ  CGSCWAFGAVEA+SDR CIH     N SLS  
Sbjct: 83  DTRLPKNFDARSKWPHCSSVSEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGSFNKSLSAV 142

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS-------------- 195
           DLL+CC   CG GC GGYP  AW Y+  HG+VT       D +GC               
Sbjct: 143 DLLSCCK-DCGFGCRGGYPAVAWDYWRTHGIVTGGSKE--DPSGCRSYPFPKCDHHVQGH 199

Query: 196 HPGC-EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
           +P C    YPTP+CV+ C      +   K  +  +Y I +    IM EI   GPVE  FT
Sbjct: 200 YPPCPRQIYPTPECVQDCDTPELGYLEDKTRANISYNIYASEISIMKEIMLRGPVEAVFT 259

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 309
           VYEDF  YKS VY H  G  M GHA++++GWG   D   YW++AN WN  WG  G
Sbjct: 260 VYEDFLQYKSRVYFHAWGAPMSGHAIRILGWGEEGD-VPYWLIANSWNEDWGEKG 313


>gi|161343855|tpg|DAA06108.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 342

 Score =  203 bits (517), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 126/324 (38%), Positives = 170/324 (52%), Gaps = 31/324 (9%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF-KHLLGVK--PTPKGLLLGVPVKT 88
           ++ L++  I  +NE  K  WKA  N  F   T  ++   LLG K    P  L L +  KT
Sbjct: 23  AYFLEEDFIDSINEKAKT-WKAGIN--FDPNTPKEYIVKLLGSKGVQVPHKLNLKM-YKT 78

Query: 89  HDKSL-----KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFG 141
            D++      ++PK FDAR  W +C TI ++ DQG+CGSCWA     A +DR CI  ++ 
Sbjct: 79  DDEAYVNLFGRIPKKFDARKEWRRCITIGQVRDQGNCGSCWALATSSAFADRLCIATNYE 138

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF----- 189
            N  LS  +L  CC  LCG  C GGYPI AW YF  HG+VT       E C PY      
Sbjct: 139 FNELLSAEELTFCC-HLCGFACHGGYPIKAWSYFRRHGIVTGGGYQSGEGCAPYRVPPCF 197

Query: 190 -DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 248
            +  G +    +P     +C R C    ++  +  H     Y   +    I  ++   GP
Sbjct: 198 SEEDGNNTCRGQPMEKHHRCTRMCYGDQEIDYDDDHRFTRDYYYLTYA-SIQKDVMTYGP 256

Query: 249 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 307
           +E S  VY+DF  YKSGVY K      +GGHAVKLIGWG  +DG  YW++ N W+  WG 
Sbjct: 257 IEASMEVYDDFPSYKSGVYEKSENATYLGGHAVKLIGWG-EEDGVPYWLMVNSWSEMWGD 315

Query: 308 DGYFKIKRGSNECGIEEDVVAGLP 331
            G FKI+RG+NEC ++  + AG+P
Sbjct: 316 KGLFKIRRGTNECSVDNSMTAGVP 339


>gi|209863073|ref|NP_001119610.2| cathepsin B-1852 [Acyrthosiphon pisum]
          Length = 333

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 127/320 (39%), Positives = 170/320 (53%), Gaps = 29/320 (9%)

Query: 31  DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKT 88
            ++ L    I  +N   K  WKA  N  F   T    K +LG+  + KG+ +    P K+
Sbjct: 20  QTYFLNKDYISTINSVAKT-WKAGIN--FHPET--PLKFILGLLGS-KGVEVSSAGPFKS 73

Query: 89  HDK----SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGM 142
           HD     +  +P  FDAR  W  C+TI  I DQG+CGSCWAF    A +DR CI  +   
Sbjct: 74  HDPLYSPTGNIPNEFDARKRWKNCTTIGTIRDQGNCGSCWAFSTSGAFADRLCIASNGSF 133

Query: 143 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 195
           N  LS   + +CC + CG GC GGYPI AWRY+  HG+VT       E C PY       
Sbjct: 134 NQLLSAEHVTSCC-YRCGLGCQGGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPCTG 192

Query: 196 HPGCE-PAYPTPKCVRKCVKKNQL-WRNSKHY-SISAYRINSDPEDIMAEIYKNGPVEVS 252
           +  C   +    KC +KC     + +R  + Y   S Y +  D  ++  +I   GP+E S
Sbjct: 193 NNSCSGQSEKNHKCQKKCFGNTSISYRGDRRYVERSPYVLAYD--NMQNDIMTYGPIESS 250

Query: 253 FTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 311
           F VY+DF  YKSGVY K      +GGH+VK IGWG   +   YW++ N WN +WG  GYF
Sbjct: 251 FDVYDDFISYKSGVYFKSPNATYLGGHSVKCIGWGVERN-VSYWLMMNSWNSTWGDGGYF 309

Query: 312 KIKRGSNECGIEEDVVAGLP 331
           KI+RG+NEC +E+   AG+P
Sbjct: 310 KIRRGTNECQVEDSSTAGVP 329


>gi|428180143|gb|EKX49011.1| cathepsin B-like cysteine protease [Guillardia theta CCMP2712]
          Length = 330

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 134/347 (38%), Positives = 174/347 (50%), Gaps = 43/347 (12%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKL---DSHILQDSIIKEVNENPKAGWKAARNPQFSNY 62
           L +  ++C+ C    A G+     +   D  +L   +I+++N +  + W A     F   
Sbjct: 2   LFLRSLICI-CLLAVATGIPVAGAVSHGDDPVLDKDMIEQINSDKDSLWTAGETEIFKGM 60

Query: 63  TVGQFKH-LLGVKPTPKGLLLGVPVKTHDKSL--KLPKSFDARSAWPQCSTISRILDQGH 119
           T+ +F+  +LG++         VPVK H  +    LP+SF+    WP  + +  I DQ  
Sbjct: 61  TMKEFRSSMLGLRLDRD--YSEVPVKVHSSTALKDLPESFNCYENWP--NYMHPIRDQAR 116

Query: 120 CGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFV 176
           CGSCWAF A E LSDRF I  +  +N  LS  DL++C     GD GC GGY   AW Y  
Sbjct: 117 CGSCWAFAASEVLSDRFAIASNGTVNKILSPEDLVSCDK---GDMGCQGGYLDKAWDYLK 173

Query: 177 HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 236
            +G+VTE C PY    G +          P C   CV         K Y  S Y   +  
Sbjct: 174 TNGIVTESCFPYAAQKGVA----------PSCRISCVDGEPY----KKYKASDYYQLTTE 219

Query: 237 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM-GGHAVKLIGWGTS------D 289
           EDIM EIY NGPVE  F VY  F  YKSGVY H   D+M GGHA+K++GWG         
Sbjct: 220 EDIMKEIYLNGPVEAGFRVYTSFMSYKSGVYHHRILDIMEGGHAIKIVGWGVEPPKRFWQ 279

Query: 290 DGEDYWILANQWNRSWGADGYFKIKRGSN-----ECGIEEDVVAGLP 331
               YWI AN W   WG +G+FKI+RG N     ECGIE+ V AG P
Sbjct: 280 KPTKYWICANSWTADWGMNGFFKIRRGKNRFGQSECGIEDQVFAGHP 326


>gi|226466816|emb|CAX69543.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 337

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 122/322 (37%), Positives = 176/322 (54%), Gaps = 45/322 (13%)

Query: 37  DSIIKEVNENPKAGWKAARNPQFSN----------YTVGQFKHLLGVKPTPKGLLLGVPV 86
           D  I+ +N +P +G KA+++ +F+           Y   QF+H +            +P+
Sbjct: 27  DEQIRFLNNHPSSGLKASKHNRFTAISDVYSALEYYGEKQFRHHI------------LPI 74

Query: 87  KTHDK-SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MN 143
            +HD  ++ LP  FD+R  W  C +I RI DQ  C S WA  +V A+SDR CI     + 
Sbjct: 75  ISHDDDNILLPDYFDSREQWKNCPSIKRIYDQSQCYSSWAMASVAAISDRICIQTNGTVK 134

Query: 144 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC--------- 194
           + LS  +L++CC   C  GC+ GY  SAW Y+V +G+VT E +   +++GC         
Sbjct: 135 VELSAIELVSCCS-KCAVGCNFGYSESAWYYWVENGLVTGESNG--NNSGCLPYPFPKCD 191

Query: 195 -----SHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 247
                S+P C    Y  P C   C     + + + KH+  SAY++  +  DI  EI   G
Sbjct: 192 HGSSDSYPMCGYVVYTPPVCNGTCRPGYPIPYNDDKHFGKSAYQVKQNESDIRREIMLYG 251

Query: 248 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 307
           PVE S  +Y+DF  YKSGVYKH+TG ++   +V++IGWG  ++G  YW+ AN WN  WG 
Sbjct: 252 PVEASIFIYDDFVDYKSGVYKHLTGRLITIQSVRIIGWGI-ENGIPYWLCANSWNEEWGL 310

Query: 308 DGYFKIKRGSNECGIEEDVVAG 329
           +G+FKI RGSNEC IE  V AG
Sbjct: 311 NGFFKILRGSNECEIEAFVNAG 332


>gi|328718094|ref|XP_003246386.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
          Length = 340

 Score =  202 bits (515), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 124/326 (38%), Positives = 168/326 (51%), Gaps = 35/326 (10%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVK 87
           ++ LQ   I  +N N    WKA  N    +     F  +LG K    P    + +    K
Sbjct: 21  TYFLQKDFIDNIN-NQATTWKAGVNFD-PDTPKEHFLKMLGSKGVQIPNKHNIHM---YK 75

Query: 88  THDKSL-----KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HF 140
           THD +      ++P+ FDAR  W +C TI  + DQG+CGSCWA     A +DR C+  + 
Sbjct: 76  THDAAYDKLFGRIPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATNA 135

Query: 141 GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY----- 188
             N  LS  ++  CC   CG GC+GGYPI AW  F   G+VT       E C+PY     
Sbjct: 136 DFNELLSAEEITFCC-HSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPC 194

Query: 189 -FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKN 246
            +D+ G +    +P     +C R C     L  +  H Y+  +Y +      I  ++   
Sbjct: 195 PYDAEGHNTCAGKPRESNHRCTRMCYGNQDLDFDEDHRYTRDSYYLTYG--SIQKDVMTY 252

Query: 247 GPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 305
           GP+E SF VY+DF  YKSGVY K      +GGHAVKLIGWG  + G  YW++ N WN  W
Sbjct: 253 GPIEASFDVYDDFPSYKSGVYVKSENATYLGGHAVKLIGWG-EEYGVPYWLMVNSWNADW 311

Query: 306 GADGYFKIKRGSNECGIEEDVVAGLP 331
           G +G FKI+RG+NECGI+    AG+P
Sbjct: 312 GDNGLFKIRRGTNECGIDNSTTAGVP 337


>gi|204022077|dbj|BAG71136.1| cathepsin B-S1 [Tuberaphis sumatrana]
 gi|204022079|dbj|BAG71137.1| cathepsin B-S2 [Tuberaphis sumatrana]
          Length = 334

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 128/319 (40%), Positives = 167/319 (52%), Gaps = 26/319 (8%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK- 91
             L D  IK +NE  K  WKA R    +N +   F  LLG +   K       +K +D  
Sbjct: 23  QFLSDERIKYINEVAKT-WKAERYFP-ANTSEEYFIGLLGSRGY-KNYTNEAEIKKYDPL 79

Query: 92  --SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 147
                 P+ FD+R  W  C  I  I DQG+CGSCW+F    A +DR C+  G   N  LS
Sbjct: 80  YVENDSPQQFDSRENWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNELLS 139

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FDSTGCS 195
             +L  CC   CG+GC+GGYPI AWRYF   GV T       E C PY     ++  G +
Sbjct: 140 PEELAFCCK-DCGNGCEGGYPIKAWRYFRTQGVTTGGDYDTKEGCKPYKVAPCYNKQGKN 198

Query: 196 HPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 255
             G +P     +C + C  K       ++ + S Y INS  + I  +I   GPVE SF V
Sbjct: 199 TCGGKPMERNHQCPKTCYGKTT--DQKRYKTKSEYVINS-IKTIEQDIKTYGPVEASFDV 255

Query: 256 YEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           Y+DF+ YKSG+Y+         GH+VK+IGWG  ++G  YW+  N W++ WG  G FKI 
Sbjct: 256 YDDFSVYKSGIYRKTPNAKYQNGHSVKIIGWG-QENGTPYWLAVNSWSKFWGDHGTFKII 314

Query: 315 RGSNECGIEEDVVAGLPSS 333
           +G NECGIE  V AG+PSS
Sbjct: 315 KGKNECGIERAVTAGIPSS 333


>gi|119638992|gb|ABL85238.1| cysteine proteinase 4 [Necator americanus]
          Length = 339

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 113/258 (43%), Positives = 153/258 (59%), Gaps = 15/258 (5%)

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF-GMN 143
           P K  + +++LP+ FDAR  WP C++I  I D   CGSCWA  A   +SDR CI   G N
Sbjct: 78  PRKGINLNVELPERFDAREKWPHCASIGLIRDHSACGSCWAVSAASVMSDRLCIQTNGTN 137

Query: 144 LS-LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-FDSTGC 194
              LS  D+LACCG  CG GC+GGYPI A+ Y  + GV +         C PY F     
Sbjct: 138 QKILSSADILACCGEDCGSGCEGGYPIQAYFYLENTGVCSGGEYREKNVCKPYPFYPCDG 197

Query: 195 SHPGC--EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPE-DIMAEIYKNGPVE 250
           ++  C  E A+ TPKC + C  +  + +   K +  +++ +  D E  I  EI+ NGPV 
Sbjct: 198 NYGPCPKEGAFDTPKCRKICQFRYPVPYEEDKVFGKNSHILLQDNEARIRQEIFINGPVG 257

Query: 251 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 310
            +F V+EDF HYK G+YK   G  +G HA+KLIGWGT ++G DYW++AN +N  WG +G 
Sbjct: 258 ANFYVFEDFIHYKEGIYKQTYGKWIGVHAIKLIGWGT-ENGTDYWLVANSYNYDWGENGT 316

Query: 311 FKIKRGSNECGIEEDVVA 328
           F+I RG+N C IE  V+A
Sbjct: 317 FRILRGTNHCLIESQVIA 334


>gi|4325188|gb|AAD17297.1| cysteine proteinase [Ancylostoma ceylanicum]
          Length = 341

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 105/250 (42%), Positives = 145/250 (58%), Gaps = 21/250 (8%)

Query: 96  PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLA 153
           P SFDAR+ WP+C +I  I DQ  CGSCWA  + EA+SD  C+     + + +S  D+L+
Sbjct: 89  PDSFDARTQWPECRSIGTIRDQSACGSCWAVSSAEAMSDEICVQSNSTIKVMISDTDILS 148

Query: 154 CCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAY--- 203
           CCG  CG GC GG+PI A+R+    GVVT       + C PY     C      P Y   
Sbjct: 149 CCGLDCGYGCQGGWPIEAYRWMQRDGVVTGGKYRQRDVCKPY-SFYPCGQHKDVPYYGPC 207

Query: 204 -----PTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 257
                PTPKC +   +K N+ ++  KH++  +Y + ++   I  EIYKNGPV  +F VYE
Sbjct: 208 PGGLWPTPKCRKSSQRKYNKTYQEDKHFATRSYSLPNNERSIRQEIYKNGPVVAAFKVYE 267

Query: 258 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 317
           D++    G+Y H  G   G HA K+IGWG  ++G DYW++AN WN  WG DGY++I R +
Sbjct: 268 DYSS-TGGIYVHKWGIQTGAHADKVIGWG-RENGTDYWLIANSWNTDWGEDGYYRIVRET 325

Query: 318 NECGIEEDVV 327
           + C IE  +V
Sbjct: 326 DNCEIERQMV 335


>gi|204022085|dbj|BAG71140.1| cathepsin B-S [Astegopteryx spinocephala]
          Length = 335

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 127/320 (39%), Positives = 164/320 (51%), Gaps = 27/320 (8%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD- 90
           S  + D  I+ +N+  K  WKA R    +N +      LLG +   K  L  V +K  D 
Sbjct: 22  SQFISDERIEYINKIAKT-WKAERYFP-ANMSKEYIMGLLGSRGY-KNYLNEVEIKKDDP 78

Query: 91  ---KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLS 145
              K+    K FDAR  W  C  I  + DQG+CGSCWAFG   A +DR C+  G   N  
Sbjct: 79  LYTKNNDTIKHFDAREDWKICKQIGHVRDQGNCGSCWAFGTTGAFADRLCVATGGGFNEQ 138

Query: 146 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FDSTG 193
           LS   L  CC + CG GC GG PI AW+YF  HG+ T       E C PY     +D  G
Sbjct: 139 LSAEKLTFCC-WTCGLGCQGGNPIKAWKYFKRHGITTGGDYGSNEGCAPYKVPPCYDDQG 197

Query: 194 CSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
                 +P     KC R C   + +      Y + +  +    + I  +I K GPVE SF
Sbjct: 198 EFLCQGKPTEHNHKCPRACYGNSTV---ENRYKVKSIYVLDSSKTIEQDIRKYGPVEASF 254

Query: 254 TVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 312
            VY+DF  YKSG+Y+       +GGH+VKLIGWG  +DG  YW+L N W++ WG  G F+
Sbjct: 255 DVYDDFITYKSGIYQKTPNAFYVGGHSVKLIGWG-EEDGIPYWLLVNSWSKFWGEQGTFR 313

Query: 313 IKRGSNECGIEEDVVAGLPS 332
           I +G NECGIE    AG+PS
Sbjct: 314 IIKGRNECGIERSATAGVPS 333


>gi|157058769|gb|ABV03142.1| cathepsin B-348 [Myzus persicae]
          Length = 246

 Score =  202 bits (513), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 103/234 (44%), Positives = 138/234 (58%), Gaps = 20/234 (8%)

Query: 86  VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMN 143
           V   D    LP++FDAR  WP C TI  + DQG CGSCWAFGAVEA+SDR CIH     N
Sbjct: 15  VSYTDTPTDLPENFDAREHWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGAKN 74

Query: 144 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC--------- 194
              S  +L++CC + CG GC+GG+P +AW Y+   G+V+    PY    GC         
Sbjct: 75  FHFSAENLVSCC-WTCGFGCNGGFPGAAWHYWKTKGIVSG--GPYGSKMGCIPYEIAPCE 131

Query: 195 -----SHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 248
                +   C+    TP CV+KC    ++ +    H   SAY + +D + I  EIY NGP
Sbjct: 132 HHVNGTRGPCKEGGKTPACVKKCEDGYKVPYAQDLHRGKSAYSLGNDVDQIRQEIYTNGP 191

Query: 249 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 302
           VE +FTVYEDF  Y++GVYKH+ G  +GGHA++++GWG  +    YW++AN WN
Sbjct: 192 VEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANSWN 245


>gi|159175|gb|AAA29176.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score =  202 bits (513), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 113/269 (42%), Positives = 148/269 (55%), Gaps = 24/269 (8%)

Query: 89  HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 146
           +D    LP+++D R  W  CS+   I DQ +CGSCWA     A+SDR CI       +  
Sbjct: 83  NDTGADLPENYDPRIVWKNCSSFHTIRDQANCGSCWAVSTAAAISDRICIATKGKKQVYA 142

Query: 147 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS----HP----- 197
           S  D+L CCG  CG GC GG+PI AW++F + GVV+    PY     CS    HP     
Sbjct: 143 SDTDILTCCGARCGLGCRGGWPIEAWKFFEYDGVVSG--GPYLGKGCCSPYPLHPCGRHG 200

Query: 198 ------GCEPAYPTPKCVRKCVKKNQ-LWRNSKHYSI--SAYRINSDPEDIMAEIYKNGP 248
                  C    PTP C RKC    + ++R  K Y      Y +      I  +I + G 
Sbjct: 201 NDTFYGNCVGMAPTPPCKRKCQPGFRGMYRVDKRYGEPGRTYTLPRSEVKIRRDIKERGS 260

Query: 249 VEVSFTVYEDFAHYKSGVYKHITGDVMGG-HAVKLIGWGTSDDGEDYWILANQWNRSWGA 307
           V   F VYEDF+HY+SG+YKH  G   GG HAVK+IGWG  D+G DYW++AN W+  WG 
Sbjct: 261 VVAVFAVYEDFSHYQSGIYKHTAGRFTGGYHAVKMIGWG-KDNGTDYWLIANSWHDDWGE 319

Query: 308 DGYFKIKRGSNECGIEEDVVAGLPSSKNL 336
           +G+F++ RG N CGIEE V AG+   ++L
Sbjct: 320 NGFFRMIRGINNCGIEEQVDAGIVDVESL 348


>gi|312374702|gb|EFR22199.1| hypothetical protein AND_15622 [Anopheles darlingi]
          Length = 339

 Score =  202 bits (513), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 128/345 (37%), Positives = 186/345 (53%), Gaps = 30/345 (8%)

Query: 1   MEPTKLIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFS 60
           +E T LI+   L L CF      V +  +   +   D+ ++ V    ++ WK   N + S
Sbjct: 9   LERTVLIL---LGLACF------VQATDRQGQNPFNDAFLRRVLARARS-WKPDTNFR-S 57

Query: 61  NYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK--SLKLPKSFDARSAWPQCSTISRILDQG 118
           N     F+ L G+  +  G    VP+K +D    + +P+SFD+R  WP C ++  I +QG
Sbjct: 58  NIHYHTFRSLKGIGESRTGF--KVPIKHYDYVYDIDIPESFDSRDRWPNCDSLREIRNQG 115

Query: 119 HCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGY-PISAWRYF 175
            CGSCWA  A   +SDR CIH     N++++  DL+ CC   CG+GC+GG+   ++++Y+
Sbjct: 116 TCGSCWAVAAASVMSDRVCIHTNGTRNVAIAAEDLMGCCA-DCGNGCEGGFLDGTSFQYW 174

Query: 176 VHHGVV-------TEECDPYFDSTGCSHPGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYS 226
           V  G+V       TE C PY     C +P  +     +PKC   C    ++ +   K + 
Sbjct: 175 VDAGLVSGGAYNSTEGCKPY-PFKPCLYPFTDCHREESPKCKHHCQHGVDKRYARDKVFG 233

Query: 227 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 286
             AY +  D   I  EI  NGPVE  F VYED   YKSGVY+H+ G+ +G HAV++IGWG
Sbjct: 234 SVAYSVPRDERVIRYEIMTNGPVEGGFDVYEDVFLYKSGVYRHVYGEHVGKHAVRIIGWG 293

Query: 287 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
             + G  YW+++N +   WG  GYFKI RG N  GIE  V+ GLP
Sbjct: 294 -REGGIPYWLISNSYGEDWGDHGYFKIVRGINHLGIESKVITGLP 337


>gi|51947600|gb|AAU14266.1| cathepsin B-N [Myzus persicae]
          Length = 338

 Score =  202 bits (513), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 120/320 (37%), Positives = 163/320 (50%), Gaps = 25/320 (7%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVK-PTPKGLLLGVPVKT 88
           ++ L+   I  +N      WKA  N  P+ S   + +     GV+ P    + L      
Sbjct: 21  AYFLEKDFIDNINAQATT-WKAGVNFDPKTSKEHIMKLLGSRGVQIPNKNNMNLYKSEDA 79

Query: 89  HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSL 146
              +  +P+ FDAR  W  CSTI R+ DQG+CGSCWA     A +DR C+  +   N  L
Sbjct: 80  EYDNTYIPRFFDARRKWRHCSTIGRVRDQGNCGSCWAVATSSAFADRLCVATNADFNELL 139

Query: 147 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------DSTG 193
           S  ++  CC   CG GC+GGYPI AW+ F   G+VT       E C+PY       D  G
Sbjct: 140 SAEEITFCC-HTCGFGCNGGYPIKAWKRFSKKGLVTGGDYKSGEGCEPYRVPPCPNDDQG 198

Query: 194 CSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGPVEVS 252
            +    +P     +C R C     L  +  H Y+   Y +      I  ++   GP+E S
Sbjct: 199 NNTCAGKPMESNHRCTRMCYGDQDLDFDEDHRYTRDYYYLTYGS--IQKDVMTYGPIEAS 256

Query: 253 FTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 311
           F VY+DF  YKSGVY K      +GGHAVKLIGWG  + G  YW++ N WN  WG  G+F
Sbjct: 257 FDVYDDFPSYKSGVYVKSENASYLGGHAVKLIGWG-EEYGVPYWLMVNSWNEDWGDHGFF 315

Query: 312 KIKRGSNECGIEEDVVAGLP 331
           KI+RG+NECG++    AG+P
Sbjct: 316 KIQRGTNECGVDNSTTAGVP 335


>gi|52630925|gb|AAU84926.1| putative cathepsin B-N [Toxoptera citricida]
          Length = 340

 Score =  202 bits (513), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 122/324 (37%), Positives = 169/324 (52%), Gaps = 31/324 (9%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 89
           ++ L++  I ++NE     WKA  N  P+     + +     GV+   K  L     K+ 
Sbjct: 21  AYFLEEDYINKINEQATT-WKAGVNFDPKTPKEHILKLLGSKGVQIPSK--LNHKMYKSE 77

Query: 90  DKSL-----KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGM 142
           D++      ++P+ FDAR  W  C TI  I DQG+CGSCWA     A +DR C+  +   
Sbjct: 78  DENYDNLFGRIPRKFDARKKWRNCKTIGAIRDQGNCGSCWALATSSAFADRLCVVSNEDF 137

Query: 143 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 189
           N  LS  +L  CC   CG GC+GGYPI AW +F  HG+VT       E C+PY      +
Sbjct: 138 NQLLSAEELTFCC-HKCGFGCNGGYPIKAWEHFKKHGLVTGGDYKSGEGCEPYRVPPCPY 196

Query: 190 DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGP 248
           D +G +    +P     +C R C     L  +  H Y+  +Y +      I  ++   GP
Sbjct: 197 DESGNNTCAGKPMEANHRCTRMCYGDQDLDFDEDHRYTRDSYYLTYG--SIQKDVLTYGP 254

Query: 249 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 307
           VE SF VY+DF  YKSGVY +      +GGHA KLIGWG  + G  YW++ N WN  WG 
Sbjct: 255 VEASFDVYDDFPSYKSGVYIRSENASYLGGHAAKLIGWG-EEYGVPYWLMVNSWNADWGD 313

Query: 308 DGYFKIKRGSNECGIEEDVVAGLP 331
           +G FKI+RG+NECGI+     G+P
Sbjct: 314 NGLFKIQRGTNECGIDNSTTGGVP 337


>gi|353228456|emb|CCD74627.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 333

 Score =  202 bits (513), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 119/315 (37%), Positives = 173/315 (54%), Gaps = 18/315 (5%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYT-VGQFKHLLGVKPTPKGLLLGVPVKTHDK 91
           +IL D +I+ +N  P AGWKA++  +F + + V       G++   KG+L    +   D+
Sbjct: 23  NILSDELIQYINNYPSAGWKASKQNRFKSISDVYNTFGYYGIRHFRKGIL--STISHEDE 80

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
           +++LP  FD+R  W  C +I+ I DQ  C S WA  +  ++SDR CI     M + LS  
Sbjct: 81  NIQLPDYFDSREQWKDCPSINIIHDQSKCDSGWAVASAASISDRTCIQTNGTMKVQLSAI 140

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE---CDPY-----FDSTGCSHPGC-E 200
           +L++C     G  C  G+   +W Y++ +G+VT +   C PY        +  S+P C  
Sbjct: 141 ELISCSKNKLG--CQIGFSEFSWDYWLKNGLVTGDPTGCLPYPFPKCDHRSSNSYPKCGY 198

Query: 201 PAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
             Y  P C + C     + ++  KHY    Y +  +  DI  EI  NGPVE    V+ DF
Sbjct: 199 ITYTAPPCTKTCRSGYPIPYKADKHYGRVIYSLRPNESDIRKEIMMNGPVEAGIFVHSDF 258

Query: 260 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 319
            +YKSGVY+HITG ++  H+V++IGWG  +D   YW+ AN WN  WG +GYFKI RGSNE
Sbjct: 259 LNYKSGVYRHITGQLVTIHSVRIIGWGIEND-IPYWLCANSWNEDWGLNGYFKILRGSNE 317

Query: 320 CGIEEDVVAGLPSSK 334
           C IE  V AG   +K
Sbjct: 318 CEIESFVNAGKVDNK 332


>gi|161343865|tpg|DAA06113.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 335

 Score =  202 bits (513), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 125/330 (37%), Positives = 171/330 (51%), Gaps = 37/330 (11%)

Query: 28  LKLDSHILQDSIIKEVNENPKAGWKAARN-PQFSNYTVGQFKHLLGVKPTPKGLLLGV-- 84
           L   +H L    + ++NE  K  WKA +N P+  N        LLG K      LLG+  
Sbjct: 17  LTEQAHFLSKEYVNKINEVAKT-WKAKQNFPE--NTPREDIVRLLGSK-----RLLGLNK 68

Query: 85  -PVKTHD----KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH 139
            P+K +D     + ++P+ FD+R  W  C TI  + +QG+CGSCWA G   A +DR CI 
Sbjct: 69  SPIKENDILYVDNGEVPEFFDSRLEWKNCKTIGEVRNQGNCGSCWAHGTTGAFADRLCIA 128

Query: 140 FG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF- 189
                N  +S  +L  CC   CG GC+GG P+ AW+YF  HGVVT       + C PY  
Sbjct: 129 TDGEFNELISAEELTFCC-HTCGFGCNGGNPLKAWKYFKRHGVVTGGNYNTTDGCQPYRV 187

Query: 190 -----DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS-AYRINSDPEDIMAEI 243
                D  G +    +P     KC +KC     +     HY    AY +++        +
Sbjct: 188 PPCVRDDEGHNSCSGQPTERNHKCSKKCYGDETINYKKNHYKTKDAYYLSNTTMQKDTMV 247

Query: 244 YKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 302
           Y  GP+E SF VY+DF  Y+SGVY+       +GGHAVK+IGWG  ++G  YW++ N W 
Sbjct: 248 Y--GPIEASFDVYDDFTSYESGVYQKTENASYLGGHAVKMIGWGV-EEGTPYWLMVNSWG 304

Query: 303 RSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
             WG  G FKI RG++ECG+E    AG+PS
Sbjct: 305 EQWGDKGMFKILRGTDECGVESSCTAGVPS 334


>gi|254575663|gb|ACT68328.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score =  202 bits (513), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 111/263 (42%), Positives = 146/263 (55%), Gaps = 18/263 (6%)

Query: 84  VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 141
           +PV     +  +P+SFD+R  W  C ++  I DQ +CGSCWA  A + +SDR CIH    
Sbjct: 85  LPVANITSNDDIPESFDSREKWKDCPSLRVIPDQSNCGSCWAVSAAQCMSDRLCIHSQGR 144

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGC 194
             + LS  D+LACCG  CG GCDGGY   AW++    GVVT         C PY      
Sbjct: 145 KKVLLSATDILACCGKFCGYGCDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCG 204

Query: 195 SHPGCE----PAYP--TPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 247
           +H G      P++P  TP C   C     + + N K  + + Y + +D   I  EI K G
Sbjct: 205 AHKGKAFNNCPSHPYATPACKPYCQYGYGKRYENDKIKAKTWYWLPNDERTIQLEIMKKG 264

Query: 248 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 307
           PV  +F +YEDF HY  GVY H  G + GGH++K+IGWG  D G  YW++AN W+  WG 
Sbjct: 265 PVHATFNIYEDFEHYNGGVYIHTAGAMEGGHSIKIIGWGV-DKGVKYWLIANSWSTDWGE 323

Query: 308 D-GYFKIKRGSNECGIEEDVVAG 329
           D GYF++ RG N C IE  V+AG
Sbjct: 324 DGGYFRVVRGINNCDIEGGVLAG 346


>gi|197725747|gb|ACH73069.1| cathepsin B precursor [Epinephelus coioides]
          Length = 333

 Score =  201 bits (512), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 127/304 (41%), Positives = 169/304 (55%), Gaps = 31/304 (10%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCST 110
           WKA  N  F+N      + L G     KG  L V V+ +   +KLPK+FD+R  WP C T
Sbjct: 40  WKAGHN--FNNVDYSYVQKLCGT--MLKGPKLPVLVQ-YSGDMKLPKNFDSREQWPNCPT 94

Query: 111 ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYP 168
           +  I DQG CGSCWAFGA EA+SDR CIH    +S+ ++  DLL CC   CG GC+GGYP
Sbjct: 95  LKEIRDQGSCGSCWAFGAAEAISDRLCIHSNGKVSVEISSEDLLTCCDS-CGMGCNGGYP 153

Query: 169 ISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK 215
            +AW ++   G+V+         C PY          G   P       TP+C+ +C   
Sbjct: 154 SAAWDFWTDVGLVSGGLYDSHVGCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCILQCESG 213

Query: 216 -NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 274
               ++  KHY  S+Y + SD E I +EIYKNGPVE +FTVYEDF  YK+GVY+H+TG  
Sbjct: 214 YTPSYKADKHYGKSSYSVPSDEEQIQSEIYKNGPVEGAFTVYEDFLLYKTGVYQHMTGSA 273

Query: 275 MGGHAVKLIGWGTSDDGEDYWILAN--QWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
           +GGHA+K      S  GE+   L      +  WG D       GS+ CGIE ++VAG+P 
Sbjct: 274 VGGHAIK------SWLGEEVCSLLALCHSDTDWG-DMVSLSSAGSDHCGIESEIVAGIPI 326

Query: 333 SKNL 336
           +++ 
Sbjct: 327 TQSF 330


>gi|2944340|gb|AAC05262.1| cathepsin B-like cysteine protease GCP7 [Haemonchus contortus]
          Length = 348

 Score =  201 bits (510), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 121/334 (36%), Positives = 172/334 (51%), Gaps = 27/334 (8%)

Query: 17  FATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPT 76
           F    E +   +  ++  L    + E   N ++ +KA  +P+     V + +  L +KP 
Sbjct: 19  FTRLEEFLAQPITKEAEQLTGEALVEYVNNRQSFFKAKYSPE----VVKKRRQFL-LKPQ 73

Query: 77  PKGLLLG----VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEAL 132
                      +P+     +  +P+SFD+R  W  C ++  I DQ +CGSCWA  A + +
Sbjct: 74  FIERSYNQENVLPIANITSNDDIPESFDSREKWKDCPSLRVIPDQSNCGSCWAVSAAQCM 133

Query: 133 SDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE------- 183
           SDR CIH      + LS  D+LACCG  CG GCDGGY   AW++    GVVT        
Sbjct: 134 SDRLCIHSQGRKKVLLSATDILACCGKFCGYGCDGGYNARAWKWATIAGVVTGGAYKEKG 193

Query: 184 ECDPYFDSTGCSHPGCE----PAYP--TPKCVRKC-VKKNQLWRNSKHYSISAYRINSDP 236
            C PY      +H G      P++P  TP C   C     + + N K  + + Y + +D 
Sbjct: 194 NCKPYVFPQCGAHKGKAFNNCPSHPYATPACKPYCQYGYGKRYENDKIKARTWYWLPNDE 253

Query: 237 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 296
             I  EI + GPV  +F +YEDF HY+ GVY H  G + GGH++K+IGWG  D G  YW+
Sbjct: 254 RTIQLEIMQKGPVHATFNIYEDFEHYEGGVYIHTAGAMEGGHSIKIIGWGV-DKGVKYWL 312

Query: 297 LANQWNRSWGAD-GYFKIKRGSNECGIEEDVVAG 329
           +AN W+  WG D GYF++ RG N C IE  V+AG
Sbjct: 313 IANSWSTDWGEDGGYFRVVRGINNCDIEGGVLAG 346


>gi|347972080|ref|XP_313831.5| AGAP004531-PA [Anopheles gambiae str. PEST]
 gi|333469162|gb|EAA09191.5| AGAP004531-PA [Anopheles gambiae str. PEST]
          Length = 375

 Score =  200 bits (509), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 117/312 (37%), Positives = 168/312 (53%), Gaps = 33/312 (10%)

Query: 36  QDSIIKEVNENPKAGWKAARNPQFSN-YTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 94
           Q + ++ +N N    WKA  NPQ ++ Y  G    +L  +     L LG  +K  ++   
Sbjct: 78  QAAFVEAIN-NRSTTWKAGVNPQRNDQYRTG----VLSDESMKFQLPLGFVLKKDEQ--P 130

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLL 152
           LP SFDAR  W  C +++ + +QG C S +A  AV  ++DR+C+H       +    D+L
Sbjct: 131 LPMSFDARQKWSYCPSMNMVRNQGCCDSSYAVAAVSTMTDRWCVHSEGKAQFNFGAYDVL 190

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP-------- 204
           +CC   CG GCDGG P + W Y+V +G+ +            SH GC+ +YP        
Sbjct: 191 SCC-HRCGFGCDGGVPSAVWHYWVENGITS-------GGAFGSHEGCQ-SYPFDVCKKSG 241

Query: 205 ----TPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
               TP+C+R C    N  +   KHY   AY +  D E IM E++  GP + +FT+Y DF
Sbjct: 242 DSNDTPRCLRFCQPGYNVTYPEDKHYGRVAYTVPKDEERIMYEVFNFGPAQATFTMYTDF 301

Query: 260 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 319
             YKSGVY+H  G  +G H+VK++GWG  +D + YW+ AN W   WG  G+FKI RG + 
Sbjct: 302 VQYKSGVYRHTFGVRVGTHSVKVMGWGVENDVK-YWLCANSWGAQWGDGGFFKIVRGEDH 360

Query: 320 CGIEEDVVAGLP 331
              E +VVAGLP
Sbjct: 361 LSFETNVVAGLP 372


>gi|161343851|tpg|DAA06106.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 333

 Score =  200 bits (508), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 126/320 (39%), Positives = 169/320 (52%), Gaps = 29/320 (9%)

Query: 31  DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKT 88
            ++ L    I  +N   K  WKA  N  F   T    K +LG+  + KG+ +    P K+
Sbjct: 20  QTYFLNKDYISTINSVAKT-WKAGIN--FHPET--PLKFILGLLGS-KGVDVSSAGPFKS 73

Query: 89  HDK----SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGM 142
           HD     +  +P  FDAR  W  C+TI  I DQG+CGSCWAF    A +DR CI  +   
Sbjct: 74  HDPLYSPAGNIPNEFDARKRWKNCTTIGTIRDQGNCGSCWAFSTSGAFADRLCIASNGSF 133

Query: 143 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 195
           N  LS   + +CC + CG GC GGYPI AWRY+  HG+VT       E C PY       
Sbjct: 134 NQLLSAEHVTSCC-YRCGLGCQGGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPCTG 192

Query: 196 HPGCE-PAYPTPKCVRKCVKKNQL-WRNSKHY-SISAYRINSDPEDIMAEIYKNGPVEVS 252
           +  C   +    KC +KC     + +R  + Y   S Y +  D  ++  +I   GP+E S
Sbjct: 193 NNSCSGQSEKNHKCQKKCFGNTSISYRGDRRYVERSPYVLAYD--NMQNDIMTYGPIESS 250

Query: 253 FTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 311
           F VY+DF  YKSGVY K      +GGH+VK IGWG   +   YW++ N WN +WG  G F
Sbjct: 251 FDVYDDFISYKSGVYFKSPNATYLGGHSVKCIGWGVERN-VSYWLMMNSWNNTWGDGGNF 309

Query: 312 KIKRGSNECGIEEDVVAGLP 331
           KI+RG+NEC +E+   AG+P
Sbjct: 310 KIRRGTNECQVEDSSTAGMP 329


>gi|48762491|dbj|BAD23815.1| cathepsin B-S1 [Tuberaphis coreana]
          Length = 334

 Score =  200 bits (508), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 125/319 (39%), Positives = 168/319 (52%), Gaps = 26/319 (8%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS 92
             L D  IK +NE  K  WKA R    +N +   F  LLG +   K       +K +D  
Sbjct: 23  QFLSDERIKYINEVAKT-WKAERYFP-ANTSEEYFIGLLGSRGY-KNYTNEFEIKKYDPL 79

Query: 93  L---KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 147
                 P+ FD+R+ W  C  I  I DQG+CGSCW+F    A +DR C+  G   N  LS
Sbjct: 80  YVENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLS 139

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FDSTGCS 195
             +L  CC   CG GC GGYPI AW+YF   GV T       E C PY     ++  G +
Sbjct: 140 PEELAFCCK-DCGQGCGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPCYNKQGKN 198

Query: 196 HPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 255
             G +P     +C + C  K  +   +++ + S Y INS  + I  ++   GPVE SF V
Sbjct: 199 TCGGQPMERNHQCPKTCYGKTTV--QNRYKTKSEYSINS-IKTIEQDLKTYGPVEASFDV 255

Query: 256 YEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           Y+DF+ YKSG+Y+        G H++K+IGWG  ++G  YW+  N W++ WG  G FKI 
Sbjct: 256 YDDFSVYKSGIYRKTPKAKYEGRHSIKIIGWG-QENGTTYWLAVNSWSKFWGEHGTFKII 314

Query: 315 RGSNECGIEEDVVAGLPSS 333
           +G NECGIE  V AG+PSS
Sbjct: 315 KGRNECGIERAVTAGIPSS 333


>gi|17565158|ref|NP_503384.1| Protein W07B8.1 [Caenorhabditis elegans]
 gi|351059396|emb|CCD74286.1| Protein W07B8.1 [Caenorhabditis elegans]
          Length = 335

 Score =  200 bits (508), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 101/258 (39%), Positives = 147/258 (56%), Gaps = 22/258 (8%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLL 152
           L  SFDAR  WP+C +I +I D   C + WAF A E++SDR CI+ G   N  LS  +LL
Sbjct: 76  LSPSFDARERWPECMSIPQINDISECKTSWAFAAAESMSDRLCINSGGFKNTILSAEELL 135

Query: 153 ACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF------DSTGCSHP 197
           +CC   F CG+GC+GG P  AW+Y   HG+ T         C PY            ++P
Sbjct: 136 SCCTGMFSCGEGCEGGNPFKAWQYIQKHGIPTGGSYESQFGCKPYSIPPCGKTVGNVTYP 195

Query: 198 GC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
            C     PTP C +KC  +          +HY +S  ++ +   +I +++  NGP++ +F
Sbjct: 196 ACTNTTSPTPSCEKKCTSRIGYPIDIDKDRHYGVSVDQLPNSQIEIQSDVMLNGPIQATF 255

Query: 254 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 313
            VY+DF  Y +G+Y H+TG+  G  +V++IGWG    G  YW+ AN W R WG +G F++
Sbjct: 256 EVYDDFLQYTTGIYVHLTGNKQGHLSVRIIGWGVW-QGVPYWLCANSWGRQWGENGTFRV 314

Query: 314 KRGSNECGIEEDVVAGLP 331
            RG+NECG+E + V+G+P
Sbjct: 315 LRGTNECGLESNCVSGMP 332


>gi|29840882|gb|AAP05883.1| similar to GenBank Accession Number X70968 cathepsin B in
           Schistosoma japonicum [Schistosoma japonicum]
          Length = 312

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 117/283 (41%), Positives = 161/283 (56%), Gaps = 23/283 (8%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV--KPTPKGLLLGVPVKTHDKS 92
           L D +I  +N+ P   WKA R  +F+  ++   K ++GV      +  L    +  +D +
Sbjct: 32  LSDELITFINKQPNIEWKADRTTRFT--SIHHAKSMMGVLLNRVDQHKLHHPIIHHNDIN 89

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 150
           +KLPK FD+R  W  CS+I  I DQ  CGSCWAFGAVE++SDR CIH    +++ LS  +
Sbjct: 90  IKLPKYFDSRKYWKNCSSIRTIRDQSSCGSCWAFGAVESMSDRICIHSKGRISIELSAVN 149

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHP 197
           LL+CC   CG GC+GG P  AW Y+   G+VT         C PY        ST  +H 
Sbjct: 150 LLSCCS-RCGFGCNGGIPGMAWDYWKDEGIVTGGSNETHTGCQPYPFPECIHHSTSINHS 208

Query: 198 GCE-PAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 255
            CE   Y TP+C + C     + + N K+Y  S+Y + SD   IM EI  NGPVE +F V
Sbjct: 209 SCEVKYYSTPECYQTCQPDYAIQYENDKYYGKSSYYVTSDEVSIMKEILLNGPVEATFYV 268

Query: 256 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 298
           Y+DF +YK+GVYK++TG ++GGHA++ I W      E Y IL 
Sbjct: 269 YDDFLNYKTGVYKYVTGSLLGGHAIR-ITWLGCIHIESYTILV 310


>gi|300952942|gb|ADK46902.1| cathepsin B [Radopholus similis]
          Length = 356

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 119/313 (38%), Positives = 172/313 (54%), Gaps = 37/313 (11%)

Query: 37  DSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKG------LLLGVPVKTHD 90
           + ++K+VNE  K  W A   P+ S+ ++   K L+G+K    G       LLG   K+  
Sbjct: 43  EDMVKKVNE-AKTTWTAEELPRISSMSLNAKKGLMGLKAFHDGGFQKHKQLLGARPKSAS 101

Query: 91  K--SLKLPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLS 145
           K  + KLP+ FD+R  + +C+  I  I DQ +CGSCWA  +   + DR CI       + 
Sbjct: 102 KLDATKLPQHFDSRKQFTKCAKVIGTIQDQSNCGSCWAVSSASVIQDRICIASNGEQKVH 161

Query: 146 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP---- 201
           +S  D+L+C       GC+GGYP  A+ ++   GVVT        S   ++ GC+P    
Sbjct: 162 ISAQDILSCATDR-SQGCNGGYPDEAFEHYAQSGVVT-------GSGNSANQGCKPYPFL 213

Query: 202 -----AYPTPKCVRKC--VKKNQLWRNSKHYSISAYRIN-SDPEDIMAEIYKNGPVEVSF 253
                 Y TP+C +KC   +  + ++  KH+ +S Y +  SDP DI  EI  NGPVE + 
Sbjct: 214 PHTTVEYSTPECSKKCENYQYKKAYKQDKHFGMSVYNVQFSDPVDIQYEIMNNGPVEANM 273

Query: 254 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED---YWILANQWNRSWGADGY 310
            VY DF  YKSGVY+ +    +GGHAV+++GWG   DG     YW++AN WN  WG DGY
Sbjct: 274 IVYYDFMFYKSGVYQTVFPWPLGGHAVRIVGWGV--DGPTKVPYWLVANSWNTDWGEDGY 331

Query: 311 FKIKRGSNECGIE 323
           F+I+RG++E  IE
Sbjct: 332 FRIRRGTDESYIE 344


>gi|193603738|ref|XP_001943652.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 337

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 128/317 (40%), Positives = 175/317 (55%), Gaps = 31/317 (9%)

Query: 37  DSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKP----TPKGLLLGVPVKTHDKS 92
           + II+ VN  PK  WKA  N  F    +    HL+GV P    + K +LL   V    +S
Sbjct: 28  NQIIQLVNNIPKHTWKAGIN--FHPSLLTNVSHLMGVVPWNKLSEKDILLTYDVSIDLES 85

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVND 150
           L  P+S+D    W +C ++  I DQ +CGSCWA     A SDR CI  + G+N  LS   
Sbjct: 86  L--PESYDITQTWSECKSVVSIRDQSNCGSCWALSTASAFSDRLCITSNMGVNKVLSGEY 143

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHP 197
           + +CC   CG+GC+GG+P  AW+Y   +G+ T       E C PY       ++  CS  
Sbjct: 144 INSCCNGKCGNGCNGGHPEKAWKYIKKNGLCTGGEYGSNEGCQPYSIVPCPRNANSCSKE 203

Query: 198 GCEPAYPTPKCVR-KCVKKN--QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
             +    TP+C + +C   N      +  +Y+   Y +   PE IM+E++KNGPV  +  
Sbjct: 204 NED----TPQCYKDQCTNNNYETPLVSDLYYAYKVYSVKPKPEIIMSEVFKNGPVVAAMK 259

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           VY+DF  YK G+Y++ TG + G HAVK++GWG  DDG DYW+ AN W  SWG  G FKI+
Sbjct: 260 VYDDFLCYKGGIYQYTTGGLKGDHAVKIMGWG-EDDGIDYWLCANTWGNSWGMGGMFKIR 318

Query: 315 RGSNECGIEEDVVAGLP 331
           RG NECGIE  +  GLP
Sbjct: 319 RGRNECGIENRITGGLP 335


>gi|156708122|gb|ABU93319.1| cathepsin B10 cysteine protease [Monocercomonoides sp. PA]
          Length = 283

 Score =  199 bits (506), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 117/298 (39%), Positives = 165/298 (55%), Gaps = 31/298 (10%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVK--THDK 91
           I  + ++  +N NP A W A    ++S   + + +  L + P   G     PV+  T + 
Sbjct: 10  ISGEPLVNIINRNPAATWSAH---EYSRDIITRARLTL-LAPLAIG-----PVEKFTIED 60

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL 151
           S  +P+SFDAR  WP  + I  + DQ  CGSCWAF   E+L DRF I       LS  DL
Sbjct: 61  SFYVPESFDARDEWP--NAILPVRDQEKCGSCWAFSIAESLGDRFGILGCGKGHLSPQDL 118

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK 211
           ++C     G  C+GGY  ++W + +  G+ TE C PY   +G            P C  +
Sbjct: 119 ISCDSNDLG--CNGGYQENSWTWVLTTGITTESCWPYRSGSG----------RIPSCPHR 166

Query: 212 CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 271
           CV  + L RN+    I+ YR   D  ++  E+Y NGP++V++ VYEDF +Y  G+YKH++
Sbjct: 167 CVNGSVLQRNT----INNYR-RLDSSELQDELYNNGPIQVTYVVYEDFFYYSKGIYKHLS 221

Query: 272 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
           G+ +GGHAV L+GWG  +DG  YW++ N W   WG  GYF+I RGSNECGIE    AG
Sbjct: 222 GNKVGGHAVVLMGWGI-EDGVKYWLVQNSWGYEWGEQGYFRILRGSNECGIESSAYAG 278


>gi|401758196|gb|AFQ01133.1| cathepsin B [Chilo suppressalis]
          Length = 350

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 136/363 (37%), Positives = 181/363 (49%), Gaps = 55/363 (15%)

Query: 8   MDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 67
           M   L  TC       V S + L  H L D  I+ +N N    W A RN  F   T  ++
Sbjct: 1   MFRTLLFTCAICVVCVVASNVHL--HPLSDEFIESINFNQNT-WIAGRN--FPKKTPLKY 55

Query: 68  KH-LLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF 126
            + L+G     +   L     T  +  K P  FDAR  W  C T+  I DQG CGSCWA 
Sbjct: 56  IYNLMGTLSDSRMDNLPQRNYTFSRKTKYPNQFDAREHWKNCPTLKDIRDQGGCGSCWAV 115

Query: 127 GAVEALSDRFCI------HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 180
            AV A++DR CI      HF      S+ D+L+CCG+ CG+GC+GG    AW Y+   G+
Sbjct: 116 AAVSAMTDRMCILSKGKEHF----YFSIKDVLSCCGY-CGNGCEGGVLTRAWIYYKKIGI 170

Query: 181 VT-------EECDPYFDSTGCSH---------------PGCE--PAYP--------TPKC 208
           V+       + C PY     C+H               P C+  P  P        TP+C
Sbjct: 171 VSGGGYKSKQGCQPY-TIPPCNHLVWGEIEQCKNIPMTPKCKNIPVIPEQCKYIPITPEC 229

Query: 209 VRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 267
            +KC K  ++ +   KH   S YR+     +I  EIY+ GPV   FTVYEDF +YK G+Y
Sbjct: 230 EKKCNKNYKVCYSKDKHRGKSVYRVKKS--EIFKEIYEYGPVTSYFTVYEDFLNYKEGIY 287

Query: 268 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR-GSNECGIEEDV 326
            + +G  +G H+VK+IGWG  + G  YW+ AN +N  WG  G+FKI R G   CGI ++V
Sbjct: 288 NYTSGQKLGLHSVKIIGWG-EERGIKYWLAANSFNTDWGDKGFFKIIREGVGSCGISDNV 346

Query: 327 VAG 329
           VAG
Sbjct: 347 VAG 349


>gi|256086863|ref|XP_002579605.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228447|emb|CCD74618.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 271

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 111/225 (49%), Positives = 137/225 (60%), Gaps = 18/225 (8%)

Query: 125 AFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 182
           AFGAVE++SDR CIH    +S  LS  +LL+CC   CG GC GG P  AW Y+ + G+VT
Sbjct: 45  AFGAVESMSDRICIHSKNKISVELSAINLLSCCT-RCGFGCRGGIPGMAWDYWKYEGIVT 103

Query: 183 -------EECDPY------FDSTGCSHPGCEPAY-PTPKCVRKCVKK-NQLWRNSKHYSI 227
                    C PY        S+  S+P CE  Y PTP+C   C     + ++  K Y  
Sbjct: 104 GGSNETHTGCQPYPFPECNHHSSSKSYPPCESYYFPTPECHETCQDDYGKPYKKDKFYGK 163

Query: 228 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 287
           S+Y + S+   IM EI  NGPVE  F VYEDF +YKSGVYKHITG  +GGHA+++IGWG 
Sbjct: 164 SSYNVASEEISIMKEILLNGPVEGGFYVYEDFLNYKSGVYKHITGSYLGGHAIRIIGWGI 223

Query: 288 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
             +   YW+ AN WN  WG  GYFKI RG+NECGIE  V AGLP+
Sbjct: 224 QQNHIPYWLCANSWNNQWGDQGYFKILRGTNECGIESMVTAGLPN 268


>gi|204022083|dbj|BAG71139.1| cathepsin B-S [Astegopteryx styracophila]
          Length = 335

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 126/320 (39%), Positives = 163/320 (50%), Gaps = 27/320 (8%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD- 90
           S  L D  I+ +N+  K  WKA R    +N +      LLG +   K  L  V +K  D 
Sbjct: 22  SQFLSDERIEYINKIAKT-WKAERYFP-ANMSKEYITGLLGSRGY-KNYLNEVEIKKDDP 78

Query: 91  ---KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLS 145
              K+    K FDAR  W  C  I  + DQG+CGSCWAFG   A +DR C+  G   N  
Sbjct: 79  LYTKNNNKIKHFDARENWKICKQIGHVRDQGNCGSCWAFGTTGAFADRLCVATGGGFNEQ 138

Query: 146 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FDSTG 193
           LS   L  CC + CG GC GG PI AW+YF   G+ T       E C PY     +D  G
Sbjct: 139 LSAEKLTFCC-WTCGLGCQGGNPIKAWKYFKRRGITTGGDYGSNEGCAPYKVPPCYDDQG 197

Query: 194 CSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
                 +P     KC R C   + +      Y + +  +    + I  +I   GPVE SF
Sbjct: 198 EFLCQGKPTEHNHKCPRACYGNSTV---ENRYKVESIYVLDSFKTIEQDIRTYGPVEASF 254

Query: 254 TVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 312
            VY+DF  YKSG+Y+     + +GGH+VKLIGWG  +DG  YW+L N W++ WG  G F+
Sbjct: 255 DVYDDFITYKSGIYQKTPNALYVGGHSVKLIGWG-EEDGIPYWLLVNSWSKFWGEQGTFR 313

Query: 313 IKRGSNECGIEEDVVAGLPS 332
           I +G NECGIE    AG+PS
Sbjct: 314 IIKGRNECGIERSATAGIPS 333


>gi|86451924|gb|ABC97357.1| cathepsin B [Streblomastix strix]
          Length = 283

 Score =  198 bits (504), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 117/306 (38%), Positives = 161/306 (52%), Gaps = 27/306 (8%)

Query: 26  SKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP 85
           ++L L + +L +SI + +N NP + W A   P  S  +  + +  LG + TP        
Sbjct: 1   TRLLLIAAVLAESIPETINRNPNSTWVAIDYPA-SVISHEKLRSKLGARFTPHR------ 53

Query: 86  VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS 145
           V+ +  S K+P +FDAR  WP    I  + DQG CGSCWAF   E + DR  +       
Sbjct: 54  VRPYRDSNKVPDTFDAREKWPD--AILPVRDQGECGSCWAFSIAETIGDRLGVLGCSRGD 111

Query: 146 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 205
           ++  DL++C  F   DGCDGG+   AW +   +G+ TEEC PY    G   P        
Sbjct: 112 IAPEDLVSCDIF--DDGCDGGFIDMAWDWCQENGLTTEECIPYKAGEGVPSP-------- 161

Query: 206 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
             C   C   + ++R      I +YR   D +DI  EIY+ GPV + F VY DF  YKSG
Sbjct: 162 --CPETCEDGSAIYRTP----IESYRY-IDADDIQGEIYEYGPVSMGFIVYSDFMSYKSG 214

Query: 266 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 325
           VY H  G + GGHAV ++GWG  D+   YW++ N W   WG +G+FKI RGS+ C  E +
Sbjct: 215 VYVHQAGYIEGGHAVLIVGWGVEDE-VPYWLVQNSWGTDWGENGFFKILRGSDHCECESN 273

Query: 326 VVAGLP 331
           V AG P
Sbjct: 274 VTAGYP 279


>gi|145481831|ref|XP_001426938.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124394016|emb|CAK59540.1| unnamed protein product [Paramecium tetraurelia]
          Length = 332

 Score =  198 bits (504), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 120/276 (43%), Positives = 145/276 (52%), Gaps = 33/276 (11%)

Query: 84  VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 143
           V  K H+K   LP SF A+  WP C +I  I DQG+CGSCWA  A   +SDR CI  G  
Sbjct: 60  VEYKYHEKLENLPPSFSAQEKWPGCPSIELIPDQGNCGSCWAVSAASTMSDRLCIASGQT 119

Query: 144 --LSLSVNDLLACCGFLC----GDGCDGGYPISAWRYFVHHGVVT-------EECDPYFD 190
               +S  DLL+CCG  C      GCDGGYP  AW+Y    G+VT         C PY  
Sbjct: 120 DKRQISAEDLLSCCGINCELDGNGGCDGGYPYGAWKYLRVDGIVTGGTYNDFSLCKPY-S 178

Query: 191 STGCSH-------PGCEPAY-----PTPKCVRKCVKKNQLWRNSKHYSISA----YRINS 234
              CSH         CE  +      TP C +KC    Q  R      I +    Y++  
Sbjct: 179 FPPCSHGNDSGKYSKCENDFFMLTEVTPSCTKKC--HPQFSRTYDVDKIRSRENPYKLIK 236

Query: 235 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 294
           D E I  EIY NGPV+  FTV++DF +YKSGVY+  TG   G HAVK+IGWGT ++G  Y
Sbjct: 237 DQEQIKNEIYLNGPVQAVFTVFDDFLNYKSGVYQQTTGQRRGKHAVKIIGWGT-ENGVPY 295

Query: 295 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           W   N WN  WG +G FKI RG N   IE +V A +
Sbjct: 296 WEAINSWNDGWGINGKFKILRGFNHLDIEGEVYASI 331


>gi|56754337|gb|AAW25356.1| SJCHGC00056 protein [Schistosoma japonicum]
          Length = 342

 Score =  198 bits (504), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 120/342 (35%), Positives = 181/342 (52%), Gaps = 30/342 (8%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
           +  ++ FA     V ++       L D +I  +NE+P AGWKA ++ +F  +++   + L
Sbjct: 6   VCIVSFFALLKAHVTTRNNERIEPLSDEMISFINEHPDAGWKADKSDRF--HSLDDARIL 63

Query: 71  LGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 128
           +G +     +       V  H+ ++++P  FD+R  WP C +IS+I DQ  CGSCWAFGA
Sbjct: 64  MGARKEDAEMKRKRRPTVDHHNLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGA 123

Query: 129 VEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCD---------GGYPISAWRYFV- 176
           VEA++DR CI    G +  LS  DL++CC    G             G    S WR+   
Sbjct: 124 VEAMTDRICIQSGGGQSAELSALDLISCCEDCGGGCKGGFPGQAWDMGKTRDSHWRFRKK 183

Query: 177 -HHGVVTEECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSIS 228
            H G     C PY        T   +P C    Y TP+C + C K  +  +   K +   
Sbjct: 184 NHTG-----CQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPFEQDKPFGEG 238

Query: 229 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 288
           +  + ++ +    +I   GPVE +F VYEDF + KSG+ +H+TG ++GGH +++IGWG  
Sbjct: 239 SSNVQNNEKVFQRDIMMYGPVEAAFDVYEDFLNSKSGISRHVTGSIVGGHPIRIIGWGV- 297

Query: 289 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           + G  YW++AN WN  WG +G F++ RG +EC IE  VVAGL
Sbjct: 298 EKGNPYWLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339


>gi|161343879|tpg|DAA06120.1| TPA_inf: cathepsin B [Toxoptera citricida]
          Length = 340

 Score =  198 bits (503), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 125/326 (38%), Positives = 171/326 (52%), Gaps = 35/326 (10%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK--PTPKGLLLGVPVKTH 89
           ++ L+   I ++NE     W A  N   S       + LLG K   TP  +   +  K+ 
Sbjct: 21  AYFLEKDYINKINEKAST-WTAGFNFDPSTPKEDILR-LLGSKGVQTPSKINHKM-YKSE 77

Query: 90  DKSL-----KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGM 142
           DK       ++PK FDAR  W  C+TI  + DQG+CGSCWA     A +DR C+  +   
Sbjct: 78  DKEYDNLFGRIPKKFDARKKWRHCTTIGAVRDQGNCGSCWAIATSSAFADRLCVATNADF 137

Query: 143 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 189
           N  LS  ++  CC   CG GC+GGYPI AW  F  HG+VT       E C+PY      +
Sbjct: 138 NQLLSAEEITFCC-HKCGYGCNGGYPIKAWERFKKHGLVTGGEYKSGEGCEPYRVPPCPY 196

Query: 190 DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAY--RINSDPEDIMAEIYKN 246
           D +G +    +P     +C R C     L  +  H ++  +Y   I S  +D+M      
Sbjct: 197 DESGNNTCSGKPMEQNHRCTRMCYGDQDLDFDDDHRHTRDSYYLTIGSIQKDVMTY---- 252

Query: 247 GPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 305
           GP+E SF VY+DF  YKSGVY +      +GGHAVKLIGWG  + G  YW++ N WN  W
Sbjct: 253 GPIEASFDVYDDFLSYKSGVYVRSENASYLGGHAVKLIGWG-EEYGTPYWLMMNSWNADW 311

Query: 306 GADGYFKIKRGSNECGIEEDVVAGLP 331
           G +G FKI+RG+NECG++    AG+P
Sbjct: 312 GDEGLFKIRRGTNECGVDNSTTAGVP 337


>gi|52546914|gb|AAU81590.1| cysteine proteinase, partial [Petunia x hybrida]
          Length = 122

 Score =  198 bits (503), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 91/120 (75%), Positives = 105/120 (87%)

Query: 231 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 290
           R +SDP  IM E+YKNGPVEV+FTVYEDFAHYKSGVYKH+TGD +GGHAVKLIGWGTS+D
Sbjct: 2   RGSSDPYSIMTEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGDELGGHAVKLIGWGTSED 61

Query: 291 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 350
           GEDYW+LANQWNR WG DGYFKI+RG+NEC IE++VVAG+PS KNL  E+  +D F DAS
Sbjct: 62  GEDYWLLANQWNRGWGDDGYFKIRRGTNECDIEDEVVAGMPSPKNLNMELDVSDAFLDAS 121


>gi|28971815|dbj|BAC65419.1| cathepsin B [Pandalus borealis]
          Length = 328

 Score =  197 bits (502), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 122/317 (38%), Positives = 167/317 (52%), Gaps = 25/317 (7%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF-KHLLGVKPTPKGLLLGVPVKTHDKSL 93
           L D  + E+ ++ +  WKA RN  F+      F K L  V+  P   +  +P+K    + 
Sbjct: 20  LSDEFL-ELLQSKQMTWKAGRN--FAKDISKDFLKSLNCVRKNPD--IPKLPLKNVTPTK 74

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 151
           ++P  FDAR  WP C  I  I DQG+CGSCWA  A   ++DR CI     ++   S  ++
Sbjct: 75  EIPVEFDAREQWPHCPCIDEIRDQGNCGSCWAVSAASVMTDRTCIDTEGLVDFRFSSENV 134

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PG 198
            ACC   CG+ C GG   +A+ ++V  G V+       E C PY     C H      P 
Sbjct: 135 AACCT-ECGNACYGGDEDTAFTHWVTKGFVSGGRHNSNEGCQPY-SVEECEHHIEGPRPP 192

Query: 199 CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 257
           CE   P   C   C ++  + +     Y + AY +  D   I  EI  NGPV  +F VY+
Sbjct: 193 CEGDMPELVCSETCHEEYGKTYEEDLEYGLEAYVLPQDVTQIQEEIMTNGPVTAAFAVYD 252

Query: 258 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 317
           DF  YKSGVY+H TG + G HAV++IGWG  ++G  YW++AN WN  WG +G FKI RGS
Sbjct: 253 DFLSYKSGVYQHETGLLDGYHAVRVIGWG-EEEGTPYWLVANSWNTDWGDNGLFKILRGS 311

Query: 318 NECGIEEDVVAGLPSSK 334
           +EC  E D+ A   SSK
Sbjct: 312 DECEFEGDMAAATYSSK 328


>gi|341900875|gb|EGT56810.1| hypothetical protein CAEBREN_32632 [Caenorhabditis brenneri]
          Length = 287

 Score =  197 bits (502), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 106/269 (39%), Positives = 154/269 (57%), Gaps = 26/269 (9%)

Query: 84  VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 141
           VP +  D    L + FDAR  WP+C +I +I D   C S WAF A E++SDR CI+ G  
Sbjct: 21  VPTENSD----LSQFFDARERWPECMSIPQINDISECKSSWAFAAAESMSDRLCINSGGT 76

Query: 142 MNLSLSVNDLLACC-GFL-CGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDST 192
           +N  LS  +LL+CC G L CG+GC GG    AW+Y+  HG+ T         C PY  + 
Sbjct: 77  INTILSAQELLSCCTGVLSCGEGCGGGNAFKAWQYWGKHGLPTGGSYESQFGCKPYSIAP 136

Query: 193 ------GCSHPGC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAE 242
                   ++P C     PTP C +KC  KN         +HY  S  ++ +   +I ++
Sbjct: 137 CGKTVGNVTYPACTNTTLPTPSCEKKCTSKNGYPVDIDKDRHYGASVDQLPNRQIEIQSD 196

Query: 243 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 302
           +  NGP+E +F VY+DF  Y +G+Y H+TG+  G  +V+++GWG   +G  YW+LAN W 
Sbjct: 197 VMLNGPIETTFEVYDDFLQYTTGIYVHLTGNKQGHLSVRILGWGMY-EGVPYWLLANSWG 255

Query: 303 RSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           + WG +G F+  RG+NECG+E + V+G+P
Sbjct: 256 KEWGENGTFRALRGTNECGLEANCVSGMP 284


>gi|254575665|gb|ACT68329.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score =  197 bits (502), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 107/263 (40%), Positives = 145/263 (55%), Gaps = 18/263 (6%)

Query: 84  VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 141
           +P+     +  +P+SFD+R  W  C ++  I DQ +CGSCWA  A + +SDR CIH    
Sbjct: 85  LPIANITSNDDIPESFDSREKWKDCPSLRVIPDQSNCGSCWAVSAAQCMSDRLCIHSQGR 144

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGC 194
             + LS  D+LACCG  CG GCDGGY   AW++    GVVT         C PY      
Sbjct: 145 KKVLLSATDILACCGKFCGYGCDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCG 204

Query: 195 SHPGCE----PAYPTPKCVRKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 247
           +H G      P++P     RK   +    + + N K  + + Y + +D   I  EI + G
Sbjct: 205 AHKGKAFNNCPSHPYATPARKPYCQYGYGKRYENDKIKARTWYWLPNDERTIQLEIMQKG 264

Query: 248 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 307
           PV  +F +YEDF HY  GVY H  G + GGH++K+IGWG  D G  YW++AN W+  WG 
Sbjct: 265 PVHATFNIYEDFEHYNGGVYIHTAGAMEGGHSIKIIGWGV-DKGVKYWLIANSWSTDWGE 323

Query: 308 D-GYFKIKRGSNECGIEEDVVAG 329
           D GYF++ RG N C IE  V+AG
Sbjct: 324 DGGYFRVVRGINNCDIEGGVLAG 346


>gi|156708108|gb|ABU93312.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  197 bits (501), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 117/299 (39%), Positives = 158/299 (52%), Gaps = 27/299 (9%)

Query: 30  LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 89
           L + +L +SI++ VN +P + W A   P  S  T  +F   LG   T           + 
Sbjct: 5   LFASVLAESIVETVNNDPSSTWVAVEYPA-SVITRAKFLARLGTYVTK------YEETSF 57

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 149
           D    LP++FD+R  WP    I  + DQ  CGSCWAF   E + DR  I       +S  
Sbjct: 58  DLDNALPENFDSREQWP--GKILPVRDQASCGSCWAFSVAETMGDRLSIKGCDFGDMSPQ 115

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
           DL++C       GC+GGY   AW +   HG+ TE+C PY   +G            P C 
Sbjct: 116 DLVSC--DTTDMGCNGGYMDHAWAWTKSHGITTEKCMPYQSGSG----------RVPACP 163

Query: 210 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 269
            KCV  + + RN    S+S  ++N+  + +M E+Y+NGP+ V+FTVY DF +YKSGVY H
Sbjct: 164 AKCVNGSAIVRNK---SVSYKKLNA--QQMMEELYENGPISVAFTVYYDFMNYKSGVYVH 218

Query: 270 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
            TG + GGHAV  +GWG  +D   YW+  N W  +WG  G+FKI RGSN CGIE    A
Sbjct: 219 KTGGIAGGHAVLCVGWGV-EDNTPYWLCQNSWGPAWGEKGHFKILRGSNHCGIENQSYA 276


>gi|268566089|ref|XP_002647469.1| Hypothetical protein CBG06541 [Caenorhabditis briggsae]
          Length = 280

 Score =  197 bits (501), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 103/222 (46%), Positives = 134/222 (60%), Gaps = 12/222 (5%)

Query: 119 HCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 176
            CGSCWAF   E +SDR CI        ++S  D+LACCG  CGDGC+GGYPI A+R++ 
Sbjct: 60  QCGSCWAFSTAEVISDRICIATKGTQQPTISPTDMLACCGRSCGDGCEGGYPIQAFRWWN 119

Query: 177 HHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISA 229
             GVVT        C PY  +  C+   C P   TP C   C    +  +   K + +SA
Sbjct: 120 SRGVVTGGDFRGSGCRPYPFAP-CNSYKC-PEEKTPTCSLSCQFGYSTAYAKDKRFGVSA 177

Query: 230 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 289
           Y +  +   I  EI  NGPV  +FT+YED   YKSGVY+H  G ++GGHA+K+IGWGT  
Sbjct: 178 YAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGT-Q 236

Query: 290 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           +G  YW++AN W   WG +G+ K++RG NECGIE  VVAG+P
Sbjct: 237 NGIPYWLIANSWGADWGENGFLKMRRGVNECGIESAVVAGMP 278



 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 36/62 (58%), Positives = 46/62 (74%), Gaps = 1/62 (1%)

Query: 246 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 305
           NGPVE SFTVYEDF  YK GVY++  G V+G HA+K++GWGT + G DYW++AN W    
Sbjct: 3   NGPVEASFTVYEDFYIYKKGVYQYTAGQVVGVHAIKIMGWGT-EHGTDYWLIANSWGAQC 61

Query: 306 GA 307
           G+
Sbjct: 62  GS 63


>gi|156708104|gb|ABU93310.1| cathepsin B1 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  197 bits (501), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 117/299 (39%), Positives = 158/299 (52%), Gaps = 27/299 (9%)

Query: 30  LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 89
           L + +L +SI++ VN +P + W A   P  S  T  +F   LG   T           + 
Sbjct: 5   LFASVLAESIVETVNNDPSSTWVAVEYPA-SVITRAKFLARLGTYVTK------YEETSF 57

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 149
           D    LP++FD+R  WP    I  + DQ  CGSCWAF   E + DR  I       ++  
Sbjct: 58  DLDNALPENFDSREQWP--GKILPVRDQASCGSCWAFSVAETMGDRLSIKGCDYGDMAPQ 115

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
           DL++C       GC+GGY   AW +   HGV TE+C PY   +G            P C 
Sbjct: 116 DLVSC--DTTDMGCNGGYMDHAWAWTKSHGVTTEKCMPYQSGSG----------RVPACP 163

Query: 210 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 269
            KCV  + + RN    S+S  ++N+  + +M E+Y+NGP+ V+FTVY DF +YKSGVY H
Sbjct: 164 AKCVNGSAIVRNK---SVSYKKLNA--QQMMEELYENGPISVAFTVYYDFMNYKSGVYVH 218

Query: 270 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
            TG + GGHAV  +GWG  D+   YW+  N W  +WG  G+FKI RGSN CGIE    A
Sbjct: 219 KTGGIAGGHAVLCVGWGVEDN-TPYWLCQNSWGPAWGEKGHFKILRGSNHCGIENQSYA 276


>gi|291000228|ref|XP_002682681.1| predicted protein [Naegleria gruberi]
 gi|284096309|gb|EFC49937.1| predicted protein [Naegleria gruberi]
          Length = 225

 Score =  197 bits (500), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 108/239 (45%), Positives = 141/239 (58%), Gaps = 22/239 (9%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWA-----FGAVEALSDRFCIHFG--MNLSLS 147
           LP+SFD+R  WP C  I  I +Q  CGSCWA       + E LSDRFCI  G  +N+ LS
Sbjct: 2   LPESFDSREKWPTC--IHPIRNQEQCGSCWACKNLFIQSSEVLSDRFCIASGGKVNVVLS 59

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK 207
             DL++C  +    GCDGG   +AW Y  H G+VT++C PY    G +          P 
Sbjct: 60  PQDLVSCNWY--NAGCDGGILWAAWIYLKHTGIVTDQCLPYSSGNGVA----------PS 107

Query: 208 CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 267
           C + C   +    + K+ +   Y + S  E IM EI  NGPV+  F+VY+DF  YKSGVY
Sbjct: 108 CPKYCNGTSTPIDSVKYKAKDWYEVGSIAEKIMNEIATNGPVQSGFSVYQDFMSYKSGVY 167

Query: 268 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 326
            H TG  +GGHA+K++GWG  ++ + YW++AN W   WG +G FKIKRG NECGIE DV
Sbjct: 168 THQTGSFLGGHAIKIVGWGVENNVK-YWLVANSWGPDWGLNGLFKIKRGDNECGIEADV 225


>gi|402594312|gb|EJW88238.1| cathepsin B5 [Wuchereria bancrofti]
          Length = 407

 Score =  197 bits (500), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 111/235 (47%), Positives = 139/235 (59%), Gaps = 26/235 (11%)

Query: 122 SCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 179
           SCWA  AVEA+SDR CI       + LS +DLL+CC   CG GC GG P++AW+Y+V  G
Sbjct: 163 SCWAVAAVEAMSDRICITSKGKKQVILSADDLLSCCK-TCGFGCFGGEPMAAWKYWVLSG 221

Query: 180 VVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKK-NQLWRNS 222
           +VT     Y + +GC     P CE               YPTPKC R+C K   + ++  
Sbjct: 222 IVTG--SDYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCDRQCDKNYKKPYKAD 279

Query: 223 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 282
           K+Y   AY + +D E I  EI   GPVE SF VY DF HY  G+YKH+ G V GGHAVK+
Sbjct: 280 KYYGEQAYNVENDVELIQKEIMTLGPVEASFEVYTDFLHYIGGIYKHVAGSVGGGHAVKI 339

Query: 283 IGWGTSDDGEDYWILANQWNRSWGAD---GYFKIKRGSNECGIEEDVVAGLPSSK 334
           +GWG  D G  YW+ AN WN  WG D   GYF+I RG +ECGIE  +VAG+P  +
Sbjct: 340 LGWGI-DQGVSYWLAANSWNTDWGEDVFSGYFRILRGVDECGIESGIVAGIPRKE 393


>gi|294898091|ref|XP_002776152.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239882839|gb|EER07968.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 382

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 121/315 (38%), Positives = 162/315 (51%), Gaps = 34/315 (10%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 94
           +  S++ E+N        +    +F N ++   K L G       L+ G    ++DK++K
Sbjct: 82  IMQSLVDEINSKQNTWTASTGQKRFKNLSLRDAKMLCGT------LMRG----SNDKAVK 131

Query: 95  ----------LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 143
                     LP  FDAR+A+P CS  I  I DQ  CGSCWAFG  EA +DR CI     
Sbjct: 132 KGYAIEELQDLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSNGA 191

Query: 144 LS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-CDPYFDSTGCSHP--G 198
            +  LS  ++ AC  F    GC GG P SAW +    G+ T E   P   S   + P   
Sbjct: 192 FTELLSAGEMNACTLFF---GCGGGDPYSAWSWVHDKGIATGEGSRPKRVSESEAIPVIA 248

Query: 199 CEPAYPTPKCVRKCV--KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
            +  YPTP CV +C   K     R+ +H+ + +   +    D    I  +GPV  SFTVY
Sbjct: 249 YQDIYPTPNCVEQCRNPKYTTTLRDDRHFMLESSPYHYSVNDAKNAIRTDGPVSASFTVY 308

Query: 257 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
           EDF  YKSGVYKH +G  +GGHAVK+IGWG    G+ YW+  N WN  WG  G FKI  G
Sbjct: 309 EDFLAYKSGVYKHTSGSYLGGHAVKIIGWG-EKSGQAYWLAVNSWNEDWGDKGLFKIALG 367

Query: 317 SNECGIEEDVVAGLP 331
           +  CGI++D++ G P
Sbjct: 368 N--CGIDDDLLGGTP 380


>gi|339831342|gb|AEK20867.1| cathepsin B [Eimeria tenella]
          Length = 512

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 129/353 (36%), Positives = 181/353 (51%), Gaps = 43/353 (12%)

Query: 21  AEGVVSKLKLDSHILQDSIIKE-VNENPKAGWKAARNPQFSNYTVGQFKHLLGV------ 73
           + G +  L++    L+    ++ ++      W+A  +P+F  +++   K  +G       
Sbjct: 152 SNGALQHLRVKMQRLKLQAAEQGLDPEQAVTWEAEVSPRFKYHSIKDAKRHMGTYLSFYS 211

Query: 74  ---KP-TPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGA 128
              KP  P G  L V V    + +     FDAR A+PQC+  I  + DQG CGSCWAF +
Sbjct: 212 DPDKPEVPLGEPLPVKVFAETQQVLETDKFDAREAFPQCAEVIGHVRDQGDCGSCWAFAS 271

Query: 129 VEALSDRFCIHFG--MNLSLSVNDLLACCGFL--CGDGCDGGYPISAWRYFVHHGVVT-- 182
            EAL+DRFCI  G     +LS     +CC  L     GC GG P  AWR+F + GVVT  
Sbjct: 272 TEALNDRFCIKSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSNDGVVTGG 331

Query: 183 --------EECDPYFDSTGCSH------PGCEPAYP-TPKCVRKC-----VKKNQLWRNS 222
                   + C PY +   C H      P CE   P  PKC + C       K + +++ 
Sbjct: 332 DYNELHTGKSCWPY-EIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKVKPFKDD 390

Query: 223 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 282
            H++ SAY +    + I  E+ +NG +  +F VYEDF  YK GVY H+TG  MGGHAVK+
Sbjct: 391 LHFATSAYSVEGR-DQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMGGHAVKV 449

Query: 283 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 335
           IG+G ++DG DYW+  N WN  WG  G FKI+ G  E GI+++   G P   N
Sbjct: 450 IGFG-NEDGRDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCGGEPKVPN 499


>gi|166030324|gb|ABY78829.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 122/334 (36%), Positives = 163/334 (48%), Gaps = 12/334 (3%)

Query: 12  LCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLL 71
           LCL   A    GV + L  D+ +L  + +  +N+     WKA  N +  N T  + K L 
Sbjct: 7   LCLLSTALVTLGVSALLVKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQNITFAEAKRLT 66

Query: 72  GVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEA 131
           G        L  V         +LP+SFD+   WP C TI  I DQ  C + WA      
Sbjct: 67  GAWIQKTSSLPPVRFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACRASWAVSTASV 126

Query: 132 LSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-- 188
           +SDR+C   G+  L +S   LL+CC    G    GG+P  AWRY+V +G+ +  C PY  
Sbjct: 127 ISDRYCTVGGVQQLRISAAHLLSCCKQCGGGC-KGGFPGFAWRYYVEYGIASSYCQPYPF 185

Query: 189 -----FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEI 243
                  + G   P  +  + TPKC   C  K+      K+   + Y +    ED   E+
Sbjct: 186 PHCEHRGAQGNKTPCSKYNFDTPKCNATCTDKSIPL--VKYRGNATYLLLHGEEDYKREL 243

Query: 244 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 303
           Y NGP    F VY D   YKSGVY+H+ GD +GG AVK++GWG   +G  YW +AN W+ 
Sbjct: 244 YFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKL-NGTPYWKVANTWDT 302

Query: 304 SWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 337
            WG DGY  I RG+NEC IE    AG P +  L 
Sbjct: 303 DWGMDGYLLILRGNNECNIEHLGFAGTPETSQLT 336


>gi|343475054|emb|CCD13447.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 120/319 (37%), Positives = 163/319 (51%), Gaps = 21/319 (6%)

Query: 31  DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD 90
           D+ +L    +  +N+     WKA  + +  N T  + K L G        L  V      
Sbjct: 27  DAPVLTQKFVDRINQLNGGMWKAVYDGKMQNLTFSEAKRLTGAFSRKTSTLPPVRFTEEQ 86

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVN 149
              +LP+SFDA   WP C TI  I DQ  C + WA     A+SDR+C +  G  L +S  
Sbjct: 87  LRTELPESFDAAEKWPHCPTIREIPDQSACRASWAVATASAISDRYCTVGNGKQLRISAA 146

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP----- 204
           DL+ACC   CG GC+GGYP +AW Y+V +G+ + +C PY     C H G +   P     
Sbjct: 147 DLMACCT-GCGGGCEGGYPDAAWEYYVSNGITSSQCQPY-PFPRCEHRGAQGKKPPCSKY 204

Query: 205 ---TPKCVRKCVKKNQ---LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
              TP C   C  K+     +R +  Y +         ED   E+Y NGP  V F V+ D
Sbjct: 205 NFDTPTCNATCTDKSVPLIKYRGNHSYEVRG------EEDYKRELYFNGPFVVRFQVHSD 258

Query: 259 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
           F  YKSGVY+H+ G+ +GG AV+++GWG   +G  YW +AN W+  WG +GYF I RG+N
Sbjct: 259 FLAYKSGVYQHVAGNFLGGKAVRIVGWGKM-NGTPYWKVANSWDTDWGMNGYFLILRGNN 317

Query: 319 ECGIEEDVVAGLPSSKNLV 337
           EC IE    AG P +  L 
Sbjct: 318 ECNIEHLGFAGTPDTSQLT 336


>gi|359427491|gb|AEV46267.1| eimeripain [Eimeria tenella]
          Length = 512

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 129/353 (36%), Positives = 181/353 (51%), Gaps = 43/353 (12%)

Query: 21  AEGVVSKLKLDSHILQDSIIKE-VNENPKAGWKAARNPQFSNYTVGQFKHLLGV------ 73
           + G +  L++    L+    ++ ++      W+A  +P+F  +++   K  +G       
Sbjct: 152 SNGALQHLRVKMQRLKLQAAEQGLDPEQAVTWEAEVSPRFKYHSIKDAKRHMGTYLSFYS 211

Query: 74  ---KP-TPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGA 128
              KP  P G  L V V    + +     FDAR A+PQC+  I  + DQG CGSCWAF +
Sbjct: 212 DPDKPEVPLGEPLPVKVFAETQQVLETDKFDAREAFPQCAEVIGHVRDQGDCGSCWAFAS 271

Query: 129 VEALSDRFCIHFG--MNLSLSVNDLLACCGFL--CGDGCDGGYPISAWRYFVHHGVVT-- 182
            EAL+DRFCI  G     +LS     +CC  L     GC GG P  AWR+F + GVVT  
Sbjct: 272 TEALNDRFCIKSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSNDGVVTGG 331

Query: 183 --------EECDPYFDSTGCSH------PGCEPAYP-TPKCVRKC-----VKKNQLWRNS 222
                   + C PY +   C H      P CE   P  PKC + C       K + +++ 
Sbjct: 332 DYNELHTGKSCWPY-EIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKVKPFKDD 390

Query: 223 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 282
            H++ SAY +    + I  E+ +NG +  +F VYEDF  YK GVY H+TG  MGGHAVK+
Sbjct: 391 LHFATSAYSVEGR-DQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMGGHAVKV 449

Query: 283 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 335
           IG+G ++DG DYW+  N WN  WG  G FKI+ G  E GI+++   G P   N
Sbjct: 450 IGFG-NEDGRDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCGGEPKVPN 499


>gi|204022081|dbj|BAG71138.1| cathepsin B-S1 [Tuberaphis takenouchii]
          Length = 332

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 118/317 (37%), Positives = 162/317 (51%), Gaps = 26/317 (8%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS 92
             L D  IK +NE  K  WKA R    +N +      LLG +         V +KT+D  
Sbjct: 23  QFLSDERIKYINEVAKT-WKAERFFP-ANTSKEYIMGLLGSRGYTN-YSSEVEIKTYDPL 79

Query: 93  LKLPKS---FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 147
            +   S   FD+R  W  C  I RI DQG+CGSCWAFG   A +DR C+  G   N  LS
Sbjct: 80  YEENASVEQFDSRENWKSCKQIGRIRDQGNCGSCWAFGTTGAFADRLCVSTGGKFNELLS 139

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FDSTGCS 195
             D+  CC   CG GC+GGYPI AW+YF   GV T       E C PY     FD  G +
Sbjct: 140 PEDVAFCCQ-NCGKGCEGGYPIKAWQYFRTQGVPTGGDYDSKEGCAPYKIPPCFDQKGKN 198

Query: 196 HPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 255
               +P     +C + C     +    K Y +    + + P  +  ++ K GP+E SF +
Sbjct: 199 TCAGKPLERNHQCPKTCYGSTTV---QKRYKVKNEYVLNSPNTMEQDLIKYGPIEASFNL 255

Query: 256 YEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           ++D + YKSG+Y+       + GH++K+IGWG  ++G  YW+  N W++ WG  G F+I 
Sbjct: 256 FDDLSAYKSGIYQKTPKAKFLSGHSIKIIGWG-KENGVPYWLAVNSWSKFWGEQGTFRII 314

Query: 315 RGSNECGIEEDVVAGLP 331
           +G NECGIE    AG+P
Sbjct: 315 KGRNECGIERSATAGIP 331


>gi|294914603|ref|XP_002778294.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239886508|gb|EER10089.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 365

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 124/331 (37%), Positives = 173/331 (52%), Gaps = 41/331 (12%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG--VKPTPKGLLLGVPVKTHDKS 92
           +  S++ EVN        +    +F   ++G  K L G  +  T +   L   V   ++ 
Sbjct: 41  IMQSLVDEVNSKQNLWTASTEQGRFYGRSLGDAKKLCGTFLNGTEE---LEEKVYPAEEL 97

Query: 93  LKLPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
           + +P SFDAR A+ +C   I  + DQ  CGSCWAFG VEA + R CI  G  +N  LS  
Sbjct: 98  VDIPDSFDARDAFKECKDVIGHVRDQSACGSCWAFGTVEAFNARVCIKSGGKLNQLLSAA 157

Query: 150 DLLACCG---FLCGDGCDGGYPISAWRYFVHHGVVT-------------EECDPYFDSTG 193
           D+LACC    F    GC GG PI++W +   +G+V+             + C PY +   
Sbjct: 158 DMLACCNIGHFCLSFGCSGGNPITSWTFLHTNGIVSGGGFVPEKNMKAADGCWPY-NFPK 216

Query: 194 CSH--------PGCEPAYPTPKCVRKC--VKKNQLWRNSKHYSISAY--RINSDPEDIMA 241
           C+H        P  +  Y TP C   C   K    +   +HY+ S +  R  S    I  
Sbjct: 217 CAHHQKESDYKPCAKEIYDTPSCSSSCPNAKYGTAFDKDRHYTESLFPSRFGS-TSSIKK 275

Query: 242 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQW 301
           EI  NGP   +F+VYEDF  YKSGVYKH +G  +GGHAV++IGWGT + G DYW++ N W
Sbjct: 276 EIMTNGPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGT-EKGVDYWLVMNSW 334

Query: 302 NRSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
           N  WG  G FKI +G  +CGI++ ++AG P+
Sbjct: 335 NEEWGDHGTFKIVQG--DCGIDDMILAGTPA 363


>gi|166030320|gb|ABY78827.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 120/334 (35%), Positives = 164/334 (49%), Gaps = 12/334 (3%)

Query: 12  LCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLL 71
           LCL   A    GV + L  D+ +L  + +  +N+     WKA  N +  N T  + K L 
Sbjct: 7   LCLLSTALVTLGVSALLVKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQNITFAEAKRLT 66

Query: 72  GVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEA 131
           G        L  V         +LP+SFD+   WP C TI  I DQ  C + WA      
Sbjct: 67  GAWIQKTSSLPPVRFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACRASWAVSTASV 126

Query: 132 LSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-- 188
           +SDR+C   G+  L +S   LL+CC    G    GG+P  AWRY+V +G+ +  C PY  
Sbjct: 127 ISDRYCTVGGVQQLRISAAHLLSCCKQCGGGC-KGGFPGFAWRYYVEYGIASSYCQPYPF 185

Query: 189 -----FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEI 243
                  + G   P  +  + TPKC   C  K+      K+   + Y +    ED   E+
Sbjct: 186 PHCEHRGAQGNKTPCSKYNFDTPKCNATCTDKSIPL--VKYRGNATYLLLHGEEDYKREL 243

Query: 244 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 303
           Y NGP    F VY D   YKSGVY+++ GD++GG AV+++GWG   +G  YW +AN W+ 
Sbjct: 244 YFNGPFVAVFFVYTDLFAYKSGVYRNVDGDILGGQAVRIVGWGKL-NGTPYWKVANTWDT 302

Query: 304 SWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 337
            WG DGY  I RG+NEC IE    AG P +  L 
Sbjct: 303 DWGMDGYLLILRGNNECNIEHLGFAGTPETSQLT 336


>gi|189239879|ref|XP_968767.2| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270012755|gb|EFA09203.1| cathepsin B precursor [Tribolium castaneum]
          Length = 353

 Score =  195 bits (496), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 116/301 (38%), Positives = 164/301 (54%), Gaps = 21/301 (6%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKS 98
           +I ++N   ++ W A  NP F +  +      LG+ P P   L    ++  +    +P +
Sbjct: 23  LINQINSQ-QSSWTARINP-FDD--IESRLGFLGIHPDPNFQL--EVLEWEEPRTVIPAT 76

Query: 99  FDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACC 155
           FDAR  WPQC   I  I +QG CGSCWAF A E +SDR C+  +  +    S  DL+ CC
Sbjct: 77  FDAREYWPQCKDVIGNIRNQGKCGSCWAFAAAEVMSDRLCVATNGSVKFEFSPEDLINCC 136

Query: 156 GFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP---TPKCVRKC 212
              CG  C GGY   AW+Y+   G+V+     Y  S GC  P  +  +    +P+C + C
Sbjct: 137 E-TCGKKCKGGYSYYAWKYYTSTGLVSG--GDYNTSRGC-QPYSKSNFNDGVSPECSKTC 192

Query: 213 --VKKNQLWRNSKHYSISAYRINSDPEDIMAEIY-KNGPVEVSFTVYEDFAHYKSGVYKH 269
              K    + N +H+    Y I  +   I  EI  + GPV   F VYEDF  Y+ GVY H
Sbjct: 193 QNTKYPTSYLNDRHFGDGTYYILKNVTTIQQEILLRGGPVMAGFDVYEDFKLYREGVYVH 252

Query: 270 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA-DGYFKIKRGSNECGIEEDVVA 328
            +G ++G HAVK+IGWGT ++G  YW++AN W + WGA  G FKI+RG+NEC IE+ ++ 
Sbjct: 253 TSGALLGSHAVKIIGWGT-ENGWAYWLVANSWGKDWGALGGVFKIRRGTNECKIEQSIIT 311

Query: 329 G 329
           G
Sbjct: 312 G 312


>gi|3929733|emb|CAA77178.1| cathepsin B [Homo sapiens]
          Length = 195

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 95/196 (48%), Positives = 129/196 (65%), Gaps = 14/196 (7%)

Query: 120 CGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 179
           CGSCWAFGAVEA+SDR CIH  +++ +S  DLL CCG +CGDGC+GGYP  AW ++   G
Sbjct: 1   CGSCWAFGAVEAISDRICIHTNVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKG 60

Query: 180 VVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYS 226
           +V+         C PY           S P C     TPKC + C    +  ++  KHY 
Sbjct: 61  LVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYG 120

Query: 227 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 286
             +Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG
Sbjct: 121 YDSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWG 180

Query: 287 TSDDGEDYWILANQWN 302
             ++G  YW++AN WN
Sbjct: 181 V-ENGTPYWLVANSWN 195


>gi|181178|gb|AAA52125.1| lysosomal proteinase cathepsin B, partial [Homo sapiens]
          Length = 209

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 95/210 (45%), Positives = 135/210 (64%), Gaps = 14/210 (6%)

Query: 144 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DS 191
           + +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+         C PY        
Sbjct: 1   VEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHH 60

Query: 192 TGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 250
              S P C     TPKC + C    +  ++  KHY  ++Y +++  +DIMAEIYKNGPVE
Sbjct: 61  VNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVE 120

Query: 251 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 310
            +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +G+
Sbjct: 121 GAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGF 179

Query: 311 FKIKRGSNECGIEEDVVAGLPSSKNLVKEI 340
           FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 180 FKILRGQDHCGIESEVVAGIPRTDQYWEKI 209


>gi|999909|pdb|1HUC|B Chain B, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
           Human Liver Cathepsin B: The Structural Basis For Its
           Specificity
 gi|999911|pdb|1HUC|D Chain D, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
           Human Liver Cathepsin B: The Structural Basis For Its
           Specificity
 gi|1421164|pdb|1CSB|B Chain B, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
           2.1 Angstroms Resolution: A Basis For The Design Of
           Specific Epoxysuccinyl Inhibitors
 gi|1421167|pdb|1CSB|E Chain E, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
           2.1 Angstroms Resolution: A Basis For The Design Of
           Specific Epoxysuccinyl Inhibitors
 gi|122920711|pdb|2IPP|B Chain B, Crystal Structure Of The Tetragonal Form Of Human Liver
           Cathepsin B
          Length = 205

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 94/205 (45%), Positives = 134/205 (65%), Gaps = 14/205 (6%)

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF----- 189
           +++ +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+         C PY      
Sbjct: 1   VSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCE 60

Query: 190 DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 248
                S P C     TPKC + C    +  ++  KHY  ++Y +++  +DIMAEIYKNGP
Sbjct: 61  HHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGP 120

Query: 249 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 308
           VE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +
Sbjct: 121 VEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDN 179

Query: 309 GYFKIKRGSNECGIEEDVVAGLPSS 333
           G+FKI RG + CGIE +VVAG+P +
Sbjct: 180 GFFKILRGQDHCGIESEVVAGIPRT 204


>gi|156708114|gb|ABU93315.1| cathepsin B6 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  194 bits (493), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 118/301 (39%), Positives = 163/301 (54%), Gaps = 29/301 (9%)

Query: 30  LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 89
           L + ++ +SI++ VN +P + W A   P+    T+ + + +LG +  P      +    +
Sbjct: 5   LFASVIAESIVETVNNDPSSTWVAIEYPR-EVITLAKMRAMLGEEVLP------LEDVEY 57

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 149
            +   +P++FDAR  WP    I  + DQ  CGSCWA  A EA+ +RF I       LSV 
Sbjct: 58  VEPNNVPENFDAREQWP--GKIYPVRDQASCGSCWAHAASEAIGNRFSIKGCGKGMLSVQ 115

Query: 150 DLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 208
           DL++C     GD GC+GG    + ++ V +GV TEEC PY    G            P C
Sbjct: 116 DLVSCDK---GDSGCNGGSGPLSSKWLVSNGVTTEECLPYVSGNG----------RVPAC 162

Query: 209 VRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
             KC   +Q+ R  K+     Y +    ++I  E+ KNGPV   FTVY DF +YKSGVY+
Sbjct: 163 AAKCSNGSQIIR-YKYEKAETYTV----QNIQEELMKNGPVYFRFTVYSDFMNYKSGVYQ 217

Query: 269 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
           H +G   GGHAV LIGWG  +DG  YW+L N W  +WG  G+FKI RG NECG E+   A
Sbjct: 218 HKSGYQEGGHAVLLIGWGV-EDGVPYWLLQNSWGPAWGEKGHFKIIRGKNECGCEQGFYA 276

Query: 329 G 329
           G
Sbjct: 277 G 277


>gi|156708106|gb|ABU93311.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
          Length = 282

 Score =  194 bits (492), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 117/297 (39%), Positives = 155/297 (52%), Gaps = 27/297 (9%)

Query: 30  LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 89
           L + +L +SI++ VN +P + W A   P  S  T  +F   LG        +     +T+
Sbjct: 5   LFASVLAESIVETVNNDPSSTWVAVEYPA-SVITRAKFLARLGTH------VEEYEERTY 57

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 149
           +    LP++FDAR  WP+   I  + DQ  CGSCWAF   E + DR  I       +S  
Sbjct: 58  ESDNALPENFDAREQWPE--QILPVRDQASCGSCWAFSVAETMGDRLSIIGCGRGHMSPQ 115

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
           DL++C       GC+GGY   AW +   HGV  EEC PY    G            P C 
Sbjct: 116 DLVSC--DTTDMGCNGGYMDKAWAWTKSHGVTNEECMPYQSGGG----------RVPACP 163

Query: 210 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 269
            KCV  + + R +K  S + +  +     +  E+Y+NGP+ V+FTVY DF +YKSGVY H
Sbjct: 164 AKCVNGSTIVR-TKSQSFTHFTAS----QMQQELYENGPLSVAFTVYYDFMNYKSGVYVH 218

Query: 270 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 326
            TG V GGHAV  IGWG  D+   YW+  N W  +WG  G+FKI RGSN CGIE  V
Sbjct: 219 KTGGVAGGHAVLCIGWGVEDN-TPYWLCQNSWGPAWGEKGHFKILRGSNHCGIENQV 274


>gi|341888224|gb|EGT44159.1| hypothetical protein CAEBREN_15022 [Caenorhabditis brenneri]
          Length = 332

 Score =  194 bits (492), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 105/270 (38%), Positives = 155/270 (57%), Gaps = 27/270 (10%)

Query: 84  VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 141
           VP +  D    L + FDAR  WP+C++I +I D   C S WAF A E++SDR CI+ G  
Sbjct: 65  VPTENSD----LSQFFDARERWPECTSIPQINDISECKSSWAFAAAESMSDRLCINSGGM 120

Query: 142 MNLSLSVNDLLACC-GFL-CGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDST 192
           +N  LS  +LL+CC G L CG+GC GG    AW+Y+  HG+ T         C PY  + 
Sbjct: 121 INTILSAQELLSCCTGVLSCGEGCGGGNAFKAWQYWGKHGLPTGGSYETQFGCKPYSIAP 180

Query: 193 ------GCSHPGC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAY-RINSDPEDIMA 241
                   ++P C     PTP C +KC  KN         +HY  S+  ++ +   +I +
Sbjct: 181 CGKTVGNVTYPACTNTTLPTPSCEKKCTSKNGYPVDIDKDRHYGASSVDQLPNRQIEIQS 240

Query: 242 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQW 301
           ++  NGP+E +F VY+DF  Y +G+Y H+TG+  G  +V+++GWG   +G  YW+LAN W
Sbjct: 241 DVMLNGPIETTFEVYDDFLQYTTGIYVHLTGNKQGHLSVRILGWGMY-EGVPYWLLANSW 299

Query: 302 NRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
            + WG +G F+  RG+NECG+E + V+ +P
Sbjct: 300 GKEWGENGTFRALRGTNECGLEANCVSAMP 329


>gi|156708112|gb|ABU93314.1| cathepsin B5 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  194 bits (492), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 112/300 (37%), Positives = 153/300 (51%), Gaps = 27/300 (9%)

Query: 30  LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 89
           L + +  +SI++ VN +P A W A   P     T  + +  LG     +G    VP    
Sbjct: 5   LIASVFAESIVETVNNHPGATWVAVEYPP-EVITTAKLRARLGAIDLNEGPSNYVP---- 59

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 149
                LP +FDAR  WP    I  + +Q  CGSCWAF   E   +R  I       +S  
Sbjct: 60  --DTSLPDNFDAREQWP--GKILPVRNQEQCGSCWAFAVAETTGNRLNILGCGRGDMSPQ 115

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
           DL++C       GC+GG P+ +W +  H G+ TEEC PY    G            P C 
Sbjct: 116 DLVSC--DKVDHGCNGGSPLFSWEWVKHSGITTEECIPYVSGGG----------RVPSCP 163

Query: 210 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 269
           +KC   + + R +K  S+   +     + +  E+Y  GP E +F+VYEDF  YKSGVY H
Sbjct: 164 KKCTNGSAIVR-TKAKSVGLVK----GDKMQNELYSRGPFEAAFSVYEDFKSYKSGVYHH 218

Query: 270 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
           ITG ++GGHAV ++GWG  +DG  YW++ N W  +WG  G+FKI RG NECGIE     G
Sbjct: 219 ITGKMLGGHAVMVVGWGV-EDGTPYWLIQNSWGTTWGEQGFFKILRGKNECGIETTCFQG 277


>gi|44968648|gb|AAS49594.1| cathepsin B [Scyliorhinus canicula]
          Length = 206

 Score =  194 bits (492), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 98/205 (47%), Positives = 128/205 (62%), Gaps = 17/205 (8%)

Query: 101 ARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFL 158
           +R  WP C TI  I DQG CGSCWAFGAVEA+SDR CIH    +N+ +S  DLL+CC   
Sbjct: 1   SREQWPDCPTIKEIRDQGSCGSCWAFGAVEAMSDRICIHSRGKVNVEVSAEDLLSCCKLE 60

Query: 159 CGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPT 205
           CG+GC+GGYP  AW ++ + G+V+         C PY  S  C H      P C     T
Sbjct: 61  CGNGCNGGYPSGAWEFWTNDGLVSGGLYYSHIGCRPYSISP-CEHHVNGSRPKCSGEIET 119

Query: 206 PKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           P+C R+C    +  +   KHY +++Y I SD  +IM EIYKNGPVE +  V++DF  YKS
Sbjct: 120 PRCSRRCEAGYSPKYSEDKHYGLTSYSIGSDVTEIMTEIYKNGPVEAALEVFKDFLLYKS 179

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSD 289
           GVY+H TG  +GGHA+K++GWG  +
Sbjct: 180 GVYQHKTGGSIGGHAIKILGWGEEN 204


>gi|1644295|emb|CAB03627.1| cysteine proteinase [Haemonchus contortus]
          Length = 345

 Score =  194 bits (492), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 109/258 (42%), Positives = 146/258 (56%), Gaps = 23/258 (8%)

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 147
           D+   +P+SFDAR+ W  C+++  I DQ +CGSCWA     ALSDR CI       L +S
Sbjct: 89  DEDDDIPESFDARTHWANCTSLRHIRDQANCGSCWAVSTASALSDRICIASKGETQLHIS 148

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE------CDPY---------FDST 192
             D+++CC  LCG GCDGG+PI A+ YF   G VT E      C PY          D+ 
Sbjct: 149 SIDIVSCCK-LCGYGCDGGWPIEAFDYFSRQGAVTGETTSKDGCRPYPFHPLWTYGNDTV 207

Query: 193 GCSHPG-CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
           G    G C+ +    + V++ V +N   R     +    RI    +      + NGPV  
Sbjct: 208 GRRMSGRCKHSKTVGEGVKR-VTRNHTRRTG--LTARRLRITEFCQSHSEGDHGNGPVVA 264

Query: 252 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 311
            FTVYEDF++YK G+Y HI G   G HA+K+IGWG  ++G  YW++AN W+  WG  G F
Sbjct: 265 VFTVYEDFSYYKKGIYVHIAGKARGAHAIKIIGWGV-ENGLPYWLIANSWHDDWGEQGLF 323

Query: 312 KIKRGSNECGIEEDVVAG 329
           +I RG NECGIE++VVAG
Sbjct: 324 RIVRGINECGIEQEVVAG 341


>gi|107921798|gb|ABF85680.1| cathepsin B3 [Fasciola hepatica]
          Length = 278

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 116/281 (41%), Positives = 155/281 (55%), Gaps = 25/281 (8%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV-KPTPKGL-LLGVPVKTHDKS 92
             D +I  +NE   A WKAA + +F+N  + Q K  LGV + TP+        V+     
Sbjct: 3   FSDELIHYINEESGASWKAAPSTRFNN--IDQVKQNLGVLEETPEDRNTQRQTVRYSVSE 60

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 150
             LP+SFDAR  WP C +IS I DQ  C SCWA  +  A++DR CIH        LS  D
Sbjct: 61  NDLPESFDARQKWPNCPSISEIRDQSSCSSCWAVSSASAITDRICIHSNGQKKPRLSAID 120

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH----PGC 199
           +++CC + CG GC+GG P  +W Y+   GVVT         C PY     CSH    PG 
Sbjct: 121 IVSCCAY-CGYGCNGGIPAMSWDYWTREGVVTGGTLENPTGCLPY-PFPKCSHGVVTPGL 178

Query: 200 EPA----YPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
            P     YPTPKC +KC    N+ +   K    S+Y +     DIM EI KNGPV+  F 
Sbjct: 179 PPCPRDIYPTPKCEKKCHAGYNKTYEQDKVKGKSSYNVGEQETDIMMEIMKNGPVDGIFY 238

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
           ++EDF  YKSG+Y + TG ++GGHA+++IGWG  ++G +YW
Sbjct: 239 MFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGV-ENGVNYW 278


>gi|187107122|ref|NP_001119621.1| cathepsin B-3098 precursor [Acyrthosiphon pisum]
 gi|161343841|tpg|DAA06101.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 337

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 116/327 (35%), Positives = 163/327 (49%), Gaps = 27/327 (8%)

Query: 28  LKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLG 83
           L   ++ L+   I  +N+     WKA  N    N        LLG +    P      + 
Sbjct: 17  LTEQAYFLEKDFIDNINKQATT-WKAGVNSA-PNTPKEHILRLLGSRGVQIPDKVNYNMY 74

Query: 84  VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFG 141
                 D   ++P  FDAR  W +C TI  + DQG+CGS WA     A +DR C+  +  
Sbjct: 75  KNDDHADNYQEIPMKFDARKKWIRCKTIGEVRDQGNCGSDWALSTSSAFADRLCVATNGD 134

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 188
            N  LS  ++  CC   CG+GC+GGYPI AW+ F +HG+VT       E C+PY      
Sbjct: 135 FNQLLSAEEITFCC-HKCGNGCNGGYPIRAWKRFKNHGLVTGGNYKSGEGCEPYRVPPCP 193

Query: 189 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNG 247
           +D  G +    +P     KC +KC     +  N  H Y+   Y +      I  ++   G
Sbjct: 194 YDKDGKNTCSGQPMESNHKCSKKCYGDEDIDFNKDHRYTRDDYYLTY--RGIQKDVINYG 251

Query: 248 PVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 306
           P+E SF VY+DF +YKSG+Y K      +GGH+VKLIGWG  + G  YW++ N WN  WG
Sbjct: 252 PIETSFDVYDDFPNYKSGIYVKSENASYLGGHSVKLIGWG-EEYGVLYWLMVNSWNADWG 310

Query: 307 ADGYFKIKRGSNECGIEEDVVAGLPSS 333
             G FKI+RG+NEC ++     G+P +
Sbjct: 311 DKGLFKIRRGTNECRVDNSTTGGVPDT 337


>gi|403371460|gb|EJY85611.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 114/276 (41%), Positives = 153/276 (55%), Gaps = 25/276 (9%)

Query: 58  QFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQ 117
           +F+NYT  Q K LLG   + +    G+   T   +  LP SFD+R+ W  C  +  I DQ
Sbjct: 45  KFANYTEAQLKGLLGTVLSHQS---GISAFTQINA-ALPDSFDSRTQWKDC--VHPIRDQ 98

Query: 118 GHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 175
             CGSCWAF A E+LSDRFCI     +NL LS  D+++C       GC GGY   AW+Y 
Sbjct: 99  AQCGSCWAFAAAESLSDRFCIASQGKVNLVLSPQDMVSC--DTSNFGCFGGYLDQAWQYL 156

Query: 176 VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 235
              GV ++ C+PY      S  G +P+ PT     + +KK +    S   +  A      
Sbjct: 157 EQQGVSSDSCEPYK-----SGNGDQPSCPTKCSNGQAIKKYKCKAGSTKQAKGA------ 205

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
            E   + I ++GPVE  FTVY+DF +Y SGVY H+TGD  GGHAVK++GWG     E+YW
Sbjct: 206 -EATKSLIQESGPVETGFTVYQDFYNYNSGVYHHVTGDAEGGHAVKILGWGKQGL-ENYW 263

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           I+AN W   WG  GYF I++G  + GI+E     +P
Sbjct: 264 IVANSWGEDWGEKGYFNIRQG--DSGIDEATFGCIP 297


>gi|161343867|tpg|DAA06114.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 340

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 105/255 (41%), Positives = 140/255 (54%), Gaps = 21/255 (8%)

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
           ++PK FDAR  W +C TI  + DQG+CGSCWA     A +DR C+    + +  LS  +L
Sbjct: 87  RIPKKFDARKKWRKCKTIGAVRDQGNCGSCWALATSSAFADRLCVATDADFNEFLSPEEL 146

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHPG 198
             CC   CG GC+GGYPI AW  F  HG+VT       E C+PY        + G +   
Sbjct: 147 TFCC-HTCGYGCNGGYPIKAWERFKSHGLVTGGDYKSGEGCEPYRVPPCRHHAEGNNSCS 205

Query: 199 CEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 257
            +P     +C R C     L  +  H Y+  +Y +      I  ++   GP+E SF VY+
Sbjct: 206 DKPMEKNHRCTRMCYGDQDLDFDDDHRYTRDSYYLTYG--SIQKDVMNYGPIEASFDVYD 263

Query: 258 DFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
           DF  YKSGVY +      +GGHAVKLIGWG  + G  YW++ N WN  WG  G FKI+RG
Sbjct: 264 DFPSYKSGVYIRSDNASYLGGHAVKLIGWG-EESGVPYWLMVNSWNTDWGDKGLFKIQRG 322

Query: 317 SNECGIEEDVVAGLP 331
           +NECG++    AG+P
Sbjct: 323 TNECGVDNSTTAGVP 337


>gi|195437434|ref|XP_002066645.1| GK24603 [Drosophila willistoni]
 gi|194162730|gb|EDW77631.1| GK24603 [Drosophila willistoni]
          Length = 341

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 118/322 (36%), Positives = 162/322 (50%), Gaps = 26/322 (8%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK------PTPKGLLLGVP 85
           +  L D+ +++V    K  W   RN   S  +    + L+GV       P P    +   
Sbjct: 22  ADFLSDAFMEKVRRKAKT-WNLGRNFHES-ISEKYLRGLMGVHEESYKYPLPDKQEVLGE 79

Query: 86  VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MN 143
                    LP  FDAR  W  C TIS I +QG CGSCWA      +SDR CI     MN
Sbjct: 80  SDDEISLADLPVDFDARLRWTSCPTISEIREQGSCGSCWAIATTSVMSDRLCIGSNGVMN 139

Query: 144 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF-----DS 191
             LS  D+L+CC  +CG  C GGYP +AW Y+   G+V+       + C PY       S
Sbjct: 140 FRLSGLDMLSCCA-ICGFACQGGYPGAAWAYWARKGLVSGGDYGSQQGCQPYTIEPCDHS 198

Query: 192 TGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 250
              S P C       +C   C    ++ ++  K+++   Y I++D  +I  EI  NGPV+
Sbjct: 199 GNGSRPVCTVGGGV-RCQHLCEPSYKVDFQRDKNFASKVYSISNDVLEIQKEIMTNGPVQ 257

Query: 251 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADG 309
              TVYEDF  YK+GVY H+ G+ +G HAV+++GWG        YW++AN W   WG +G
Sbjct: 258 AILTVYEDFLSYKTGVYYHLEGEKVGPHAVRILGWGVWGTKKVPYWLVANSWGSDWGDNG 317

Query: 310 YFKIKRGSNECGIEEDVVAGLP 331
           +F I RG N C IE  ++AGLP
Sbjct: 318 FFHIFRGENHCDIEGYIMAGLP 339


>gi|167541036|gb|ABZ82028.1| cathepsin B endopeptidase [Clonorchis sinensis]
          Length = 228

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 103/228 (45%), Positives = 137/228 (60%), Gaps = 19/228 (8%)

Query: 125 AFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 182
           AFGAVEA+SDR CIH     +  +S  DL++CCG+ CG GC GG+P +AW ++   G+VT
Sbjct: 1   AFGAVEAMSDRLCIHTNGTFTKRISAVDLISCCGY-CGFGCQGGFPPTAWDFWQTEGIVT 59

Query: 183 -------EECDPYFDSTGCSHPGCEP-------AYPTPKCVRKCVKKNQLWRNSKHYSIS 228
                    C  Y     CSH G +         Y TP CV+KC   +  +   K  +  
Sbjct: 60  GGSKENPTGCRSY-PFPRCSHHGSKKYPPCSHRIYDTPNCVQKCDTPDTDYATDKTRANI 118

Query: 229 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 288
            Y + +    IM EI  NGPVE +F VYEDF  YKSGVY H  G ++GGHA++++GWG  
Sbjct: 119 TYNVKAKQNAIMKEIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWG-E 177

Query: 289 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 336
           ++G  YW++AN WN  WG DGYFK+ RG NECGIE++V AGLP   ++
Sbjct: 178 ENGVAYWLIANSWNDGWGEDGYFKMLRGKNECGIEDEVTAGLPELSSI 225


>gi|403365170|gb|EJY82363.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 114/279 (40%), Positives = 150/279 (53%), Gaps = 31/279 (11%)

Query: 58  QFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQ 117
           +F+NYT  Q K LLG   + +    G+   T   +  LP SFD+R+ W  C  +  I DQ
Sbjct: 45  KFANYTEAQLKGLLGTVLSHQS---GISAFTQINA-ALPDSFDSRTQWKDC--VHPIRDQ 98

Query: 118 GHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLAC-CGFLCGDGCDGGYPISAWRY 174
             CGSCWAF AVE+LSDRFCI     +NL LS  D+L+C     C   C GGY  +AW+Y
Sbjct: 99  AKCGSCWAFAAVESLSDRFCIASQGKVNLVLSPQDMLSCDASNFC---CFGGYLDTAWQY 155

Query: 175 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA--YRI 232
               GV ++ C+PY    G            P C  KC     +    K Y   A   + 
Sbjct: 156 LEQQGVGSDSCEPYKSGNG----------DQPSCPSKCSNGQAI----KKYKCKAGSTKQ 201

Query: 233 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 292
               E   + I ++GPVE  FT+YEDF +Y SG+Y H+TG  MGGHAVK++GWG     E
Sbjct: 202 AKGAEATKSLIQQSGPVETGFTIYEDFLNYNSGIYHHVTGGNMGGHAVKILGWGKQGL-E 260

Query: 293 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           +YWI+AN W   WG  GYF I++G  + GI+E     +P
Sbjct: 261 NYWIVANSWGEDWGEKGYFNIRQG--DSGIDEATFGCIP 297


>gi|157058765|gb|ABV03140.1| cathepsin B-348 [Aulacorthum solani]
          Length = 237

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 97/217 (44%), Positives = 133/217 (61%), Gaps = 20/217 (9%)

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLS 147
           D    LP++FDAR  WP C TI  + DQG CGSCWAFGAVEA+SDR CIH     N   S
Sbjct: 23  DAPTDLPETFDAREHWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGTKNFHFS 82

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC------------- 194
             +L++CC + CG GC+GG+P +AW Y+   G+V+    PY  + GC             
Sbjct: 83  AENLVSCC-WTCGFGCNGGFPGAAWNYWKTKGIVSG--GPYGSNMGCIPYEVAPCEHHVN 139

Query: 195 -SHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 252
            +   C+    TPKCV+KC    ++ +    H+  SAY +++D + I  EIY NGPVE +
Sbjct: 140 GTRGPCKEGGKTPKCVKKCEDGYKVPYAQDLHHGKSAYSLSNDVDQIRQEIYTNGPVEGA 199

Query: 253 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 289
           FTVYEDF  Y++GVYKH+ G  +GGHA++++GWG  +
Sbjct: 200 FTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQN 236


>gi|294939825|ref|XP_002782575.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239894358|gb|EER14370.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 398

 Score =  191 bits (486), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 122/351 (34%), Positives = 174/351 (49%), Gaps = 40/351 (11%)

Query: 17  FATFAEGVVSKLKL----DSHILQD--------SIIKEVNENPKAGWKAARNPQFSNYTV 64
           FA F E +  + K     D  +L D        S++ E+N        +A   +F   ++
Sbjct: 50  FARFEEELSIQSKFISTEDMEVLYDETRPAIMQSLVDEINAKQNTWTASAEQEKFKTSSL 109

Query: 65  GQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCS-TISRILDQGHCGSC 123
              K L G         +   V   ++   LP  FDAR+A+P+CS  I  + DQ  CG C
Sbjct: 110 RDAKMLCGTLTRDSNDKVVEKVYAIEELKDLPTDFDARTAFPKCSKVIGHVRDQSACGDC 169

Query: 124 WAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WAFG  EA +DR CI      +  LS  ++ AC   L   GC GG+P SAW +    G+ 
Sbjct: 170 WAFGVTEAFNDRLCIKSNGTFTKLLSAGEMNACAPSLKDPGCRGGFPYSAWSWVHDEGIA 229

Query: 182 T-------------EECDPYFDSTGCSHPGCEPAYPT-PKCVR---KCVKKNQ----LWR 220
           T             + C PY D   C+H   +P YP  PK  R   +CV K +    ++ 
Sbjct: 230 TGGDYVPRDNMTEDDGCWPY-DFPPCAHFFKDPKYPACPKFARVNLRCVSKLRHMMVVYF 288

Query: 221 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 280
           + +++ + +   +   +D    I  +GPV  +F VYEDF  YKSGVYKH +G ++G HAV
Sbjct: 289 SDRYFMVESVPYHFSADDAKNAIRTDGPVSATFYVYEDFLAYKSGVYKHTSGSLLGAHAV 348

Query: 281 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           K+IGWG  D GE YW++ N WN  WG  G FKI  G  +CGI+ +++ G P
Sbjct: 349 KIIGWG-EDGGEAYWLVVNSWNEGWGDHGLFKIALG--DCGIDNELLGGTP 396


>gi|161343871|tpg|DAA06116.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 276

 Score =  191 bits (485), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 105/261 (40%), Positives = 140/261 (53%), Gaps = 21/261 (8%)

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 147
           D   ++P  FDAR  W +C TI  + DQGHCGS WA     A SDR C+  +   N  LS
Sbjct: 20  DNYQEIPIKFDARKKWLRCKTIGEVRDQGHCGSDWAMSTSSAFSDRLCVATNGDFNQLLS 79

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------DSTGC 194
             ++  CC   CGDGC GGYPI AW+ +  HG+VT       E C+PY       D  G 
Sbjct: 80  AEEITFCC-HTCGDGCSGGYPIRAWKRYKKHGLVTGGNYKSGEGCEPYRVPPCPNDDQGN 138

Query: 195 SHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGPVEVSF 253
           +    +P     +C R C     L  +  H Y+   Y +      I  ++   GP+E SF
Sbjct: 139 NTCSGQPMEKNHRCTRMCYGDQDLDFDEDHRYTRDHYYLTY--RGIQKDVINYGPIEASF 196

Query: 254 TVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 312
            VY+DF  YKSG+Y K      +GGH+VKLIGWG  + G  YW++ N WN  WG  G FK
Sbjct: 197 DVYDDFPSYKSGIYVKSENASYLGGHSVKLIGWG-EEYGVLYWLMVNSWNADWGDKGLFK 255

Query: 313 IKRGSNECGIEEDVVAGLPSS 333
           I+RG+NECG++     G+P++
Sbjct: 256 IRRGTNECGVDNSTTGGVPAT 276


>gi|308488594|ref|XP_003106491.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
 gi|308253841|gb|EFO97793.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
          Length = 342

 Score =  191 bits (485), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 98/252 (38%), Positives = 148/252 (58%), Gaps = 20/252 (7%)

Query: 99  FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACC- 155
           FDAR  WP+CS+I  I D   C S WAF A E++SDR CI+ G  ++  LS  +LL+CC 
Sbjct: 89  FDARERWPECSSIPLINDISECKSSWAFAAAESMSDRLCINSGGMIDTILSAQELLSCCT 148

Query: 156 GFL-CGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDST------GCSHPGC-E 200
           G L CG+GC GG P+ AW+Y+  HG+ T         C PY  +         ++P C  
Sbjct: 149 GVLSCGEGCAGGNPLKAWQYWQKHGIPTGGSYESQFGCKPYSIAPCGKTIGNVTYPPCTN 208

Query: 201 PAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
              PTP C +KC     +     +HY +S  ++ +   +I +++  NGPVE +  +Y+DF
Sbjct: 209 TTLPTPTCEKKCKPGYPVDLDKDRHYGVSVDQLPNRQIEIQSDVMLNGPVEATMEIYDDF 268

Query: 260 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 319
             Y +G+Y H+ G+  G  +V+++GWG   +G  YW+LAN W + WG +G F++ RG NE
Sbjct: 269 LQYTTGIYVHLAGNKQGHLSVRILGWGMF-EGVPYWLLANSWGKEWGENGTFRVLRGVNE 327

Query: 320 CGIEEDVVAGLP 331
           CG+E + ++G+P
Sbjct: 328 CGLEANCISGMP 339


>gi|221484923|gb|EEE23213.1| cysteine proteinase, putative [Toxoplasma gondii GT1]
          Length = 569

 Score =  191 bits (484), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 121/318 (38%), Positives = 170/318 (53%), Gaps = 46/318 (14%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVK---------PTPKGLLLGVPVKTHDKSLK-LPKSFD 100
           W+   + +F   ++   K L+G           PTPKG+ L  P K  + + + +P  FD
Sbjct: 222 WEPEVSLRFRYLSLKDAKKLMGTFLVNTKVEGFPTPKGMPL--PAKEFENATEPVPAHFD 279

Query: 101 ARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGF 157
           AR+A+P C   +  + DQG CGSCWAF + EA +DR CI       + LS     +CC  
Sbjct: 280 ARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKRLMPLSAQHTTSCCNA 339

Query: 158 L-CGD-GCDGGYPISAWRYFVHHGVVT----------EECDPYFDSTGCSH------PGC 199
           + C   GC+GG P  AWR+F   GVVT            C PY +   C+H      P C
Sbjct: 340 IHCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPY-EVPFCAHHAKAPFPDC 398

Query: 200 EPAY---PTPKCVRKCVKKN-----QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
           +       TPKC + C ++        +    H + SAY + S  +D+  ++  +GPV  
Sbjct: 399 DATLVPRKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSR-DDVKRDMMTHGPVSG 457

Query: 252 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 311
           +F VYEDF  YKSGVYKH++G  +GGHA+K+IGWGT ++GE+YW   N WN  WG  G F
Sbjct: 458 AFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGT-ENGEEYWHAVNSWNTYWGDGGQF 516

Query: 312 KIKRGSNECGIEEDVVAG 329
           KI  G  +CGI+ ++VAG
Sbjct: 517 KIAMG--QCGIDGEMVAG 532


>gi|21700775|gb|AAL60053.1| cysteine proteinase [Toxoplasma gondii]
          Length = 569

 Score =  191 bits (484), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 121/318 (38%), Positives = 170/318 (53%), Gaps = 46/318 (14%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVK---------PTPKGLLLGVPVKTHDKSLK-LPKSFD 100
           W+   + +F   ++   K L+G           PTPKG+ L  P K  + + + +P  FD
Sbjct: 222 WEPEVSLRFRYLSLKDAKKLMGTFLVNTKVEGFPTPKGMPL--PAKEFENATEPVPAHFD 279

Query: 101 ARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGF 157
           AR+A+P C   +  + DQG CGSCWAF + EA +DR CI       + LS     +CC  
Sbjct: 280 ARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKRLMPLSAQHTTSCCNA 339

Query: 158 L-CGD-GCDGGYPISAWRYFVHHGVVT----------EECDPYFDSTGCSH------PGC 199
           + C   GC+GG P  AWR+F   GVVT            C PY +   C+H      P C
Sbjct: 340 IHCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPY-EVPFCAHHAKAPFPDC 398

Query: 200 EPAY---PTPKCVRKCVKKN-----QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
           +       TPKC + C ++        +    H + SAY + S  +D+  ++  +GPV  
Sbjct: 399 DATLVPRKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSR-DDVKRDMMTHGPVSG 457

Query: 252 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 311
           +F VYEDF  YKSGVYKH++G  +GGHA+K+IGWGT ++GE+YW   N WN  WG  G F
Sbjct: 458 AFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGT-ENGEEYWHAVNSWNTYWGDGGQF 516

Query: 312 KIKRGSNECGIEEDVVAG 329
           KI  G  +CGI+ ++VAG
Sbjct: 517 KIAMG--QCGIDGEMVAG 532


>gi|237836005|ref|XP_002367300.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
 gi|211964964|gb|EEB00160.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
 gi|221506020|gb|EEE31655.1| cysteine proteinase, putative [Toxoplasma gondii VEG]
          Length = 572

 Score =  191 bits (484), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 121/318 (38%), Positives = 170/318 (53%), Gaps = 46/318 (14%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVK---------PTPKGLLLGVPVKTHDKSLK-LPKSFD 100
           W+   + +F   ++   K L+G           PTPKG+ L  P K  + + + +P  FD
Sbjct: 225 WEPEVSLRFRYLSLKDAKKLMGTFLVNTKVEGFPTPKGMPL--PAKEFENATEPVPAHFD 282

Query: 101 ARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGF 157
           AR+A+P C   +  + DQG CGSCWAF + EA +DR CI       + LS     +CC  
Sbjct: 283 ARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKGLMPLSAQHTTSCCNA 342

Query: 158 L-CGD-GCDGGYPISAWRYFVHHGVVT----------EECDPYFDSTGCSH------PGC 199
           + C   GC+GG P  AWR+F   GVVT            C PY +   C+H      P C
Sbjct: 343 IHCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPY-EVPFCAHHAKAPFPDC 401

Query: 200 EPAY---PTPKCVRKCVKKN-----QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
           +       TPKC + C ++        +    H + SAY + S  +D+  ++  +GPV  
Sbjct: 402 DATLVPRKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSR-DDVKRDMMTHGPVSG 460

Query: 252 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 311
           +F VYEDF  YKSGVYKH++G  +GGHA+K+IGWGT ++GE+YW   N WN  WG  G F
Sbjct: 461 AFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGT-ENGEEYWHAVNSWNTYWGDGGQF 519

Query: 312 KIKRGSNECGIEEDVVAG 329
           KI  G  +CGI+ ++VAG
Sbjct: 520 KIAMG--QCGIDGEMVAG 535


>gi|294954734|ref|XP_002788292.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239903555|gb|EER20088.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 317

 Score =  190 bits (483), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 119/323 (36%), Positives = 165/323 (51%), Gaps = 39/323 (12%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK---PTPKGLLLGVPVKTHDKSLK 94
           S++ E+N        +    +F N ++   K L G +      K +  G  +   ++   
Sbjct: 3   SLVDEINSKQTTWTASTGQKRFKNLSLRDAKMLCGTRMRGSNDKVIRKGYAI---EELQD 59

Query: 95  LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
           LP  FDAR+A+P CS  I  I DQ  CGSCWAFG  EA +DR C+      +  LS  ++
Sbjct: 60  LPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCVKSNGTFTELLSAGEM 119

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------------EECDPYFDSTGCSH-- 196
            AC       GCDGGYP SAW +    G+ T             + C PY D   C+H  
Sbjct: 120 NACAPSY---GCDGGYPDSAWSWVHDEGIATGGDYVARGNLTKGDGCWPY-DFPPCAHHI 175

Query: 197 -----PGC-EPAYPTPKCVRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 248
                P C + +Y TP CV +C   K +   +N +HY + +        +    I  +GP
Sbjct: 176 NDTKYPKCPKGSYETPNCVEQCHNPKYSTSLKNDRHYMLESSPYQYSVNNAKNAIRTDGP 235

Query: 249 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 308
           V  S+ VYEDF  YKSGVYKH +G  +GGHAVK+IGWG  ++GE YW++ N WN  WG  
Sbjct: 236 VSASYLVYEDFLAYKSGVYKHTSGSYLGGHAVKIIGWG-EENGEAYWLVVNSWNEDWGDH 294

Query: 309 GYFKIKRGSNECGIEEDVVAGLP 331
           G FKI  G+  C I++D++ G P
Sbjct: 295 GLFKIALGN--CQIDDDLLGGTP 315


>gi|21930117|gb|AAM82155.1| cysteine proteinase [Ancylostoma ceylanicum]
          Length = 348

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 104/259 (40%), Positives = 142/259 (54%), Gaps = 20/259 (7%)

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFC--IHFGMNLSLS 147
           +  + +P +FDAR  WP C+++  I DQ  CGSCWA  A  A+SDR C   +  +N  LS
Sbjct: 89  EMKVDIPDTFDARDRWPNCTSMKHIRDQSSCGSCWAVAAASAMSDRVCALTNGRINRILS 148

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCE 200
             ++L+CC   CG GC GGYP  A+ Y   +G+ T       + C PY     C +   E
Sbjct: 149 DTEVLSCCFGSCGFGCKGGYPARAFGYAWRYGLSTGGPYGEKDACQPY-AFYPCGNHAHE 207

Query: 201 PAY--------PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
           P Y        PTP C R C     + +   K ++   Y I  +  +I  EI   GPV  
Sbjct: 208 PYYGPCPDELWPTPTCRRTCQLGYPIPFEKDKIFNDQTYYIFGNETEIKYEIMTRGPVVA 267

Query: 252 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 311
           ++ VY DF +YK GVY H  G+V G HAVK+IGWG  +D   YW++AN WN  WG +GYF
Sbjct: 268 TYKVYRDFDYYKKGVYIHREGEVTGLHAVKIIGWGKGND-VPYWLVANSWNTDWGDNGYF 326

Query: 312 KIKRGSNECGIEEDVVAGL 330
           +I RG++ C IE  +V G+
Sbjct: 327 RIVRGTDNCEIERQMVGGI 345


>gi|157167285|ref|XP_001658487.1| cathepsin b [Aedes aegypti]
 gi|108876478|gb|EAT40703.1| AAEL007590-PA [Aedes aegypti]
          Length = 313

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 118/286 (41%), Positives = 152/286 (53%), Gaps = 18/286 (6%)

Query: 63  TVGQFKHLLGVKPTP----KGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQG 118
           T   F  +L +   P    K   L   +    + L LPKSFDAR  WPQCS+++ I  QG
Sbjct: 26  TTSPFAWILDLPGVPLEKLKETRLHPAINVFAEDLVLPKSFDARQQWPQCSSLNEIRTQG 85

Query: 119 HCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 176
            CGSC       A++DR+CIH       +    DLL+CC    G    GG P   W Y+V
Sbjct: 86  CCGSCAYVSGASAMTDRWCIHSKGKKQFTFGAFDLLSCCYECGGGCTGGGIPGPIWSYWV 145

Query: 177 HHGVVT-------EECDPYFDSTGCSHPGCEPAYP-TPKCVRKCVKKNQLWRN--SKHYS 226
             GV +       + C PY     C  P  E  YP  P C  +C     +  +   + + 
Sbjct: 146 KQGVSSGGPYGSNQGCHPYPMPPSCPKPS-EGDYPDEPNCSTRCNAGYNVTEDLRDRRFG 204

Query: 227 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 286
             AY I +D   IM +I+ NGPV+  F  YED  +Y  GVY+H +G + GGHAVKLIGWG
Sbjct: 205 RVAYSIPADERKIMEDIFVNGPVQAVFQWYEDIVNYSGGVYRHQSGRLKGGHAVKLIGWG 264

Query: 287 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
             +DG  YW++AN W R WG DG+FK+ RG N CGIEE+V AGLPS
Sbjct: 265 V-EDGTKYWLVANSWGRVWGDDGFFKMVRGENHCGIEENVHAGLPS 309


>gi|159177|gb|AAA29177.1| cysteine proteinase [Haemonchus contortus]
          Length = 342

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 107/254 (42%), Positives = 141/254 (55%), Gaps = 21/254 (8%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLL 152
           +P+ +D R  + +CST   I DQ +CGSCWA     A+SDR CI       +++S  D+L
Sbjct: 86  IPEEYDPREKF-KCSTF-YIRDQANCGSCWAVSTAAAISDRICIATNGEKQVNISSTDIL 143

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSHPG------- 198
            CC   CG GC GG+ I AW YFV+ GVV+         C PY     C H G       
Sbjct: 144 TCCNPQCGFGCGGGWSIRAWEYFVYEGVVSGGEYLTKGVCRPY-PIHPCGHHGNDTYYGE 202

Query: 199 CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 257
           C     TP C +KC     +++R  K     AY +    E I  EI ++GPV  SF VYE
Sbjct: 203 CPREAATPPCKKKCQPGYKKIFRMDKRQGKVAYGVEPKEEAIQREILRHGPVVASFAVYE 262

Query: 258 DFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRG 316
           DF+ YK+GVYKH  G + G HAVK++GWG  S     YW++AN W+  WG +GYF+  RG
Sbjct: 263 DFSLYKTGVYKHTAGALRGYHAVKMMGWGVDSKTKAKYWLIANSWHNDWGENGYFRFIRG 322

Query: 317 SNECGIEEDVVAGL 330
            N+C IE+ V AG+
Sbjct: 323 INDCEIEDTVAAGI 336


>gi|107921773|gb|ABF85678.1| cathepsin B1 [Fasciola hepatica]
          Length = 278

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 112/281 (39%), Positives = 150/281 (53%), Gaps = 25/281 (8%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV--KPTPKGLLLGVPVKTHDKS 92
             D +I  +NE   A WKA  + +F N  +  FK  LG+  +   +       V+ +   
Sbjct: 3   FSDELIHYINEKSGASWKAGPSSRFIN--IEHFKQHLGLLEETPEERETRRPTVRYNVSE 60

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 150
             LP+SFDAR  WP C +I +I DQ  CGSCWA   V A+SDR CIH    M   LS  D
Sbjct: 61  NDLPESFDAREKWPLCRSIRQIPDQSSCGSCWAVAGVGAMSDRVCIHSNGMMQPELSAID 120

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPA- 202
           L++CC + CG+GC GG P +AW Y+  +G+VT         C PY     C HPG     
Sbjct: 121 LVSCCSY-CGNGCQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPY-PFPQCRHPGSRSQL 178

Query: 203 -------YPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
                  YPTP C   C    ++ +   K Y  ++Y ++     IM EI KNGPVE  F 
Sbjct: 179 NPCPGYIYPTPSCYPYCQAGYDKTYEEDKVYGKTSYNVDRHEYTIMQEIMKNGPVEAGFI 238

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
           VY DFA YKSG+Y H++G   G HA+++IGWG  ++G +YW
Sbjct: 239 VYTDFAVYKSGIYHHVSGRYAGKHAIRIIGWGV-ENGVNYW 278


>gi|194246059|gb|ACF35521.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 217

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 97/215 (45%), Positives = 133/215 (61%), Gaps = 19/215 (8%)

Query: 133 SDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------E 183
           SDR CIH    + +++S  DLL CC   CG GC+GGYP +AW+++   G+VT       +
Sbjct: 1   SDRICIHTKGKVQVNISAEDLLTCCD-SCGSGCNGGYPSAAWQFYKDEGIVTGGLYGTED 59

Query: 184 ECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDP 236
            C PY+    C H      P C    PTP+C + C +   + +   KH+    Y I+SD 
Sbjct: 60  GCQPYYFPP-CEHHTVGPLPNCTGIKPTPECAKTCREGYEKSYTRDKHFGKKVYSISSDE 118

Query: 237 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 296
             I  EI KNGPVE  F VY DF  YKSGVY+  + +++GGHA++++GWGT +DG  YW+
Sbjct: 119 TQIKTEICKNGPVEADFNVYADFPSYKSGVYQRHSKEMLGGHAIRILGWGT-EDGVPYWL 177

Query: 297 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           +AN WN  WG  GYFKI+RG++ECGIE D+ AG+P
Sbjct: 178 VANSWNEDWGDKGYFKIRRGNDECGIENDINAGIP 212


>gi|204022071|dbj|BAG71133.1| cathepsin B-S2 [Tuberaphis coreana]
          Length = 334

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 124/319 (38%), Positives = 163/319 (51%), Gaps = 26/319 (8%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK- 91
             L D  IK +NE  K  WKA R    +N +   F  LLG +   K       +K +D  
Sbjct: 23  QFLSDERIKYINEVAKT-WKAERYFP-ANTSEEYFIGLLGSRGY-KNYTNEFEIKKYDPL 79

Query: 92  --SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 147
                 P+ FD+R+ W  C  I  I DQG+CGSCW+F    A +DR C+  G   N  LS
Sbjct: 80  YVENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLS 139

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FDSTGCS 195
             +L  CC   CG GC GG P+ AW YF   GV T       E C PY      +  G +
Sbjct: 140 PEELTFCCK-DCGQGCGGGNPMKAWEYFRTQGVTTGGDYNTKEGCMPYKVPPCRNKQGEN 198

Query: 196 HPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 255
               +P     +C + C  K  +   +++ + S Y INS  + I  +I   GPVE SF  
Sbjct: 199 ICDEQPMERNHQCPKTCYGKTTV--QNRYKTKSEYYINS-IKTIEQDIKTYGPVEASFDC 255

Query: 256 YEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           Y+D + YKSG+Y K       GGH++K+IGWG  +DG  YW+  N W++ WG  G FKI 
Sbjct: 256 YDDLSVYKSGIYRKSPNAKYKGGHSIKIIGWG-QEDGTPYWLAVNSWSKFWGDHGTFKII 314

Query: 315 RGSNECGIEEDVVAGLPSS 333
           +G NECGIE  V AG+PSS
Sbjct: 315 KGRNECGIERAVTAGIPSS 333


>gi|242001640|ref|XP_002435463.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215498799|gb|EEC08293.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 223

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 101/222 (45%), Positives = 136/222 (61%), Gaps = 17/222 (7%)

Query: 125 AFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV- 181
           AFGAVEA+SDR CIH    + + +S  DL+ CC   CG GC GG   +AW+Y+   G+V 
Sbjct: 1   AFGAVEAMSDRVCIHSNGRVQVDISAEDLMDCCD-KCGSGCSGGVSAAAWQYWKDAGLVS 59

Query: 182 ------TEECDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 229
                 T+ C PY       S+  S P C    PTPKC R+C +   + + + K+++ + 
Sbjct: 60  GGLYNTTDGCKPYSLAPCEHSSQGSLPECVGTLPTPKCKRQCREGYERSYDDDKYFAKNV 119

Query: 230 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 289
           Y IN   + I  EI++NGPVE  FT Y DF  YKSGVY+H + D++G HA++++GWG S+
Sbjct: 120 YSINGSEKQIRTEIFQNGPVEAEFTAYADFLSYKSGVYQHHSRDIIGRHAIRILGWG-SE 178

Query: 290 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           D   YW+LAN WN  WG  GYFK+ RG NEC IE  V AG+P
Sbjct: 179 DNNPYWLLANSWNEDWGDHGYFKMLRGVNECDIESFVNAGIP 220


>gi|91088083|ref|XP_968689.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
          Length = 360

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 123/302 (40%), Positives = 165/302 (54%), Gaps = 22/302 (7%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPK-GLLLGVPVKTHDKSLKLP 96
           S+I ++N    A W A  NP F +  +      LG+ P P     +  P  T +    +P
Sbjct: 21  SLINQINSQQSA-WTAGINP-FDD--IESRLGFLGIHPDPNFKPEIKEPQATQNV---IP 73

Query: 97  KSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLA 153
           ++FDAR  WP+C+  I  I +QG C S WAF A E +SDR CI     + + LS  DL+ 
Sbjct: 74  ETFDAREYWPECADIIGNIRNQGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPEDLID 133

Query: 154 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY--PTPKCVRK 211
           CC + CG+ C GGY   AW YF+  G+V+     Y  STGC  P  E  Y   TP C   
Sbjct: 134 CCHY-CGNQCKGGYTYYAWNYFMLTGLVSG--GDYNTSTGC-QPYSELNYYRITPPCNTT 189

Query: 212 CV--KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG-PVEVSFTVYEDFAHYKSGVYK 268
           C   K    + + KH+  S Y I  +   I  EI   G PV  +F VY DF  Y+ GVY 
Sbjct: 190 CQNDKYPIPYVSDKHFGDSIYYIPQNETAIQNEILSGGGPVVAAFDVYGDFKIYRDGVYI 249

Query: 269 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA-DGYFKIKRGSNECGIEEDVV 327
           + +G + G  AVK+IGWGT ++G  YW+ AN W + WGA  G+FKI+RG+NECG EE ++
Sbjct: 250 YTSGALFGRTAVKIIGWGT-ENGWAYWLAANSWGKDWGALGGFFKIRRGTNECGFEESII 308

Query: 328 AG 329
           AG
Sbjct: 309 AG 310


>gi|403362666|gb|EJY81064.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 109/276 (39%), Positives = 152/276 (55%), Gaps = 25/276 (9%)

Query: 58  QFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQ 117
           +F+NYT  Q K LLG   +       +P  T   +  +P SFD+R+ W  C  +  I DQ
Sbjct: 45  KFANYTEAQIKGLLGTVLSHSS---DIPAFTQINA-AVPDSFDSRTQWQGC--VHPIRDQ 98

Query: 118 GHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 175
             CGSCWAF A E+LSDRFCI     +N+ LS  D+++C       GCDGGY   AW+Y 
Sbjct: 99  AQCGSCWAFAASESLSDRFCIASQGKVNVVLSPQDMVSC--DTNNYGCDGGYLNLAWQYL 156

Query: 176 VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 235
              GV ++ C+PY  ++G +          P C  KC    Q  +  K  + S  + N  
Sbjct: 157 EKKGVASDSCEPYKSASGTA----------PSCPSKCAN-GQAIKKYKCQAGSTKQANGA 205

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
                + I ++GPVE  FTVY DF +YKSG+Y H++G   GGHAVK++GWG     E+YW
Sbjct: 206 AA-TKSLIQQSGPVETGFTVYADFFNYKSGIYHHVSGGAEGGHAVKILGWGKQGS-ENYW 263

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           I+AN W  SWG  G+F I++G  + GI++     +P
Sbjct: 264 IVANSWGESWGEKGFFNIRQG--DSGIDQATFGCIP 297


>gi|312382740|gb|EFR28091.1| hypothetical protein AND_04395 [Anopheles darlingi]
          Length = 381

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 115/309 (37%), Positives = 168/309 (54%), Gaps = 24/309 (7%)

Query: 36  QDSIIKEVNENPKAGWKAARNP-QFSNYTVGQFKHLLGVKPT-PKGLLLGVPVKTHDKSL 93
           Q + +  +N N   GWKA  NP +   Y  G   +    +   P+G++L +      +  
Sbjct: 81  QAAFVAAIN-NRTRGWKAGVNPLRHDQYRTGALLYEEAARAKLPQGIVLKL------QEE 133

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDL 151
             P+SFDAR  W  C ++  I +QG C S +A  AV  ++DR+CIH       S    D+
Sbjct: 134 PFPESFDARQKWSFCPSVGTIRNQGCCASSYAVAAVATITDRWCIHSEGKSQFSFGAYDV 193

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC-SHPG--CEPA-----Y 203
           L+CC   CG GCDGG P + W Y+V +G+ +     Y    GC S+P   C+P      +
Sbjct: 194 LSCC-HRCGFGCDGGVPSAVWHYWVENGITSG--GAYESHEGCQSYPFGVCKPQEIFAPH 250

Query: 204 PTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 262
               C+R+C    N  +   KH+   AY +  D + I+ E++  GPV+ SFTVY DF  Y
Sbjct: 251 VDLICLRQCQPGYNTTYLEDKHFGRVAYSVPRDEDRILYELFYFGPVQASFTVYTDFIQY 310

Query: 263 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 322
           KSGVY+H  G  +G H+VK++GWG  ++G  +W+ AN W   WG +G+FKI RG +   +
Sbjct: 311 KSGVYRHTYGVRVGDHSVKIVGWGV-ENGTKFWLCANSWGAEWGENGFFKIIRGEDHLSV 369

Query: 323 EEDVVAGLP 331
           E +VVAGLP
Sbjct: 370 ESNVVAGLP 378


>gi|187105118|ref|NP_001119619.1| cathepsin B-5880 precursor [Acyrthosiphon pisum]
 gi|163300442|tpg|DAA06127.1| TPA_inf: cathepsin B transcript 5880 [Acyrthosiphon pisum]
 gi|239790051|dbj|BAH71611.1| ACYPI000015 [Acyrthosiphon pisum]
 gi|239790053|dbj|BAH71612.1| ACYPI000015 [Acyrthosiphon pisum]
          Length = 302

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 104/260 (40%), Positives = 143/260 (55%), Gaps = 30/260 (11%)

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL--SLSVND 150
           L LPKSFDAR+ W  C +I  + DQG+C S +A     A+SDR CIH    +   LS   
Sbjct: 51  LNLPKSFDARAKWYMCPSIGMVYDQGNCKSSYAISVASAVSDRICIHSNGTVKPKLSAQQ 110

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHP 197
           +L+CC +LCGDGC GG    +W ++  HG+V+       E C PY         T   + 
Sbjct: 111 ILSCC-YLCGDGCSGGQHFESWDFYRRHGLVSGGEYGSNEGCQPYTIEPCQHTETAVENA 169

Query: 198 GCEPAYPTPKCVRKCVKKNQLWRNSK------HYSISAYRINSDPEDIMAEIYKNGPVEV 251
                  TP+C  +C   +   R  K      HY + AY         M EIY+NGP+  
Sbjct: 170 CSNKTLFTPECKVQCYNPDYGTRYVKDNHQGTHYRVPAYT-------AMKEIYENGPITA 222

Query: 252 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 311
           SF +Y+DF +Y+SGVY + +G  +   AVK++GWG  ++G  YW+ AN +N  WG +G+ 
Sbjct: 223 SFYMYQDFVNYQSGVYAYNSGKYVTTQAVKILGWG-EENGTPYWLAANSFNTYWGDNGFV 281

Query: 312 KIKRGSNECGIEEDVVAGLP 331
           KI RG+NEC IEE + AGLP
Sbjct: 282 KILRGANECYIEEFMYAGLP 301


>gi|403345965|gb|EJY72367.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score =  187 bits (475), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 109/276 (39%), Positives = 152/276 (55%), Gaps = 25/276 (9%)

Query: 58  QFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQ 117
           +F+NYT  Q K LLG   +       +P  T   +  +P SFD+R+ W  C  +  I DQ
Sbjct: 45  KFANYTEAQIKGLLGTVLSHSS---DIPAFTQINA-AVPDSFDSRTQWQGC--VHPIRDQ 98

Query: 118 GHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 175
             CGSCWAF A E+LSDRFCI     +N+ LS  D+++C       GCDGGY   AW+Y 
Sbjct: 99  AQCGSCWAFAASESLSDRFCIASQGKVNVVLSPQDMVSC--DTNNYGCDGGYLNLAWQYL 156

Query: 176 VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 235
              GV ++ C+PY  ++G +          P C  KC    Q  +  K  + S  + N  
Sbjct: 157 EKKGVASDSCEPYKSASGTA----------PSCPSKC-SNGQAIKKYKCKAGSTKQANGA 205

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
                + I ++GPVE  FTVY DF +YKSG+Y H++G   GGHAVK++GWG     E+YW
Sbjct: 206 AA-TKSLIQQSGPVETGFTVYADFFNYKSGIYHHVSGGAEGGHAVKILGWGKQGS-ENYW 263

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           I+AN W  SWG  G+F I++G  + GI++     +P
Sbjct: 264 IVANSWGESWGEKGFFNIRQG--DSGIDQATFGCIP 297


>gi|328869211|gb|EGG17589.1| hypothetical protein DFA_08585 [Dictyostelium fasciculatum]
          Length = 323

 Score =  187 bits (474), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 119/325 (36%), Positives = 170/325 (52%), Gaps = 28/325 (8%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPK-AGWKAARNPQFSNYTVGQFKH 69
           I  +T        V   + + + +L D  I+  N N K A W A RN +F  +T+GQ   
Sbjct: 15  IFAITITLAILLNVAFAINMGAPVLNDKFIQ--NHNSKNAPWVAKRNARFEGHTIGQVMA 72

Query: 70  LLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 129
           ++G K           +K  D S+  P +FDAR  WP C  +  +L+Q  CGSCWAF + 
Sbjct: 73  MMGTKKVINNNA-APSIKIVDASI--PSTFDAREQWPGC--VHAVLNQEQCGSCWAFSSS 127

Query: 130 EALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 187
           EALSDR CI     +N++LS   L+A C  +   GC+GG P  AW Y    G+ T EC P
Sbjct: 128 EALSDRLCIASKGQVNVTLSPQALVA-CDDIGNQGCNGGVPQLAWEYMEWKGLPTFECYP 186

Query: 188 YFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 246
           Y    G              C R+C   + + +  +K +S++     +    I  EI   
Sbjct: 187 YTAGNGTDG----------TCQRQCADGSAMTYYRAKPFSMTTC---NSVACIQNEIITY 233

Query: 247 GPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGE-DYWILANQWNRS 304
           GPV  +  VY+DF  Y SGVY +  T +++GGHA++++GWGT    + DYWI+ N W+ +
Sbjct: 234 GPVVGTMMVYQDFMSYSSGVYVYDGTAELLGGHAIEIVGWGTDATSKLDYWIVKNSWSAA 293

Query: 305 WGA-DGYFKIKRGSNECGIEEDVVA 328
           WG  DGYF I+RG+N CGI+ D  A
Sbjct: 294 WGGLDGYFWIQRGTNMCGIDHDASA 318


>gi|335347289|gb|AEH42092.1| cysteine proteinase 1 [Haemonchus contortus]
          Length = 332

 Score =  186 bits (473), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 99/238 (41%), Positives = 135/238 (56%), Gaps = 17/238 (7%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND--LL 152
           +P+SFD+R  W  CS+I+ I DQ +CGSCWA  A E +SDR C+     +   ++D  +L
Sbjct: 95  IPESFDSREVWKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDIL 154

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT----EE---CDPYFDSTGCSHPG----C-- 199
           ACCG  CG GC+GG    AW Y    GVVT    +E   C PY      +H G    C  
Sbjct: 155 ACCGRECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPR 214

Query: 200 EPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
           + ++ TP C + C     + +   K Y  S Y ++ D + I  E+ KNGPV+ +F  YED
Sbjct: 215 DHSFRTPACKKYCQYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFITYED 274

Query: 259 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
           F+ Y  G+Y H  G   G HAVK++GWG  ++G  YW +AN W+  WG DGYF+I RG
Sbjct: 275 FSFYTKGIYVHTRGRQRGAHAVKVVGWGV-ENGTKYWNVANSWSTDWGEDGYFRILRG 331


>gi|119638954|gb|ABL85236.1| cysteine proteinase 2 [Necator americanus]
          Length = 347

 Score =  186 bits (472), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 99/257 (38%), Positives = 136/257 (52%), Gaps = 19/257 (7%)

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 147
           D ++ LP+SFDAR  WP+C +I  I DQ   G CWA  + E ++DR CI       + +S
Sbjct: 89  DLAVSLPESFDAREKWPECPSIGLIRDQSAGGGCWAVSSAEVMTDRICIQSNGTKQVYVS 148

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSHPGCE 200
             D+L+CCG  CG GC  G P  A+ Y +  GV +         C PY     C +    
Sbjct: 149 ETDILSCCGQRCGSGCTSGVPRQAFNYAIRKGVCSGGPYGTKGVCKPY-PFYPCGYHAHL 207

Query: 201 PAY--------PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 252
           P Y        PTP C + C     +  N      S   + +  E I  EI+ NGP+  +
Sbjct: 208 PYYGPCPDGMWPTPTCEKACQSDYTVPYNDDRIFGSKTIVLTGEEKIKREIFNNGPLVAT 267

Query: 253 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 312
           +TVYEDFA+YK+G+Y    G   G HAVK+IGWG  ++G  YW++AN WN  WG +G+F+
Sbjct: 268 YTVYEDFAYYKNGIYMTGLGRATGAHAVKIIGWG-EENGVKYWLIANSWNTDWGENGFFR 326

Query: 313 IKRGSNECGIEEDVVAG 329
           + RG+N C IE     G
Sbjct: 327 MLRGTNLCDIELSATGG 343


>gi|19880041|gb|AAM00234.1|AF359422_1 cathepsin B-like cysteine proteinase [Nicotiana tabacum]
          Length = 110

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 86/110 (78%), Positives = 96/110 (87%)

Query: 53  AARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTIS 112
           AA NP+FSN+TV QFK LLGVKPT KG L G+P+ TH K L+LP+ FDAR AWP CSTI 
Sbjct: 1   AALNPRFSNFTVSQFKRLLGVKPTRKGDLKGIPILTHPKLLELPQEFDARVAWPNCSTIG 60

Query: 113 RILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDG 162
           RILDQGHCGSCWAFGAVE+LSDRFCIH+G+N+SLS NDLLACCGFLCGDG
Sbjct: 61  RILDQGHCGSCWAFGAVESLSDRFCIHYGLNISLSANDLLACCGFLCGDG 110


>gi|239938580|gb|ACS36089.1| cysteine proteinase [Haemonchus contortus]
          Length = 332

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 109/305 (35%), Positives = 164/305 (53%), Gaps = 26/305 (8%)

Query: 31  DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD 90
           D+ +  ++++K VNE  +  ++A  +P+       +  HL+  +       L   +   +
Sbjct: 34  DNRLTGEALVKYVNER-QPFFEAKYSPEAEQ----RLNHLMDTEFVRNVRKLH-KIPRAE 87

Query: 91  KSLK---LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLS 147
           K++    +P+SFD+R  W  CS+I+ I DQ +CGSCWA  A E +SDR C+     +   
Sbjct: 88  KAISNDDIPESFDSREVWKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKM 147

Query: 148 VND--LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT----EE---CDPYFDSTGCSHPG 198
           ++D  +LACCG  CG GC+GG    AW Y    GVVT    +E   C PY      +H G
Sbjct: 148 ISDVDILACCGRECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGG 207

Query: 199 ----C--EPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
               C  + ++ TP C + C     + +   K Y  S Y ++ D + I  E+ KNGPV+ 
Sbjct: 208 KFWSCPRDHSFRTPACKKYCQYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQA 267

Query: 252 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 311
           +F  YEDF+ Y  G+Y H  G   G HAVK++GWG  ++G  YW +AN W+  WG +GYF
Sbjct: 268 AFITYEDFSFYTKGIYVHTRGRQRGAHAVKVVGWGV-ENGTKYWNVANSWSTDWGENGYF 326

Query: 312 KIKRG 316
           +I RG
Sbjct: 327 RILRG 331


>gi|32129435|sp|P92133.2|CATB3_GIALA RecName: Full=Cathepsin B-like CP3; AltName: Full=Cathepsin B-like
           protease B3; Flags: Precursor
 gi|1763663|gb|AAB58260.1| cysteine protease [Giardia intestinalis]
 gi|11691660|emb|CAC18648.1| cathepsin B-like cysteine protease 3 [Giardia intestinalis]
          Length = 299

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 117/299 (39%), Positives = 155/299 (51%), Gaps = 28/299 (9%)

Query: 40  IKEVNE----NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKL 95
           + E+N     NP+  WKA    +F   T  +   LL      K     VP  T   + + 
Sbjct: 18  VSELNHIKSLNPR--WKAGIPKRFEGLTKDEISSLLMPVSFLKRDRAAVPRGTV-SATQA 74

Query: 96  PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLL 152
           P SFD R  +P C  I  ++DQG CGSCWAF +V ++ DR C   G++   +  S   ++
Sbjct: 75  PDSFDFREEYPHC--IPEVVDQGGCGSCWAFSSVASVGDRRCFA-GLDKKAVKYSPQYVV 131

Query: 153 ACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK 211
           +C     GD  CDGG+  S WR+    G  T+EC PY         G   A  T  C  K
Sbjct: 132 SCDR---GDMACDGGWLPSVWRFLTKTGTTTDECVPY-------QSGSTGARGT--CPTK 179

Query: 212 CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 271
           C   + L    K      Y +  D   IM  +   GP++ +FTVY DF +Y+SGVY+H  
Sbjct: 180 CADGSDLPHLYKATKAVDYGL--DAPAIMKALATGGPLQTAFTVYSDFMYYESGVYQHTY 237

Query: 272 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           G V GGHAV ++G+GT DDG DYWI+ N W   WG DGYF+I R +NECGIEE V+ G 
Sbjct: 238 GRVEGGHAVDMVGYGTDDDGVDYWIIKNSWGPDWGEDGYFRIIRMTNECGIEEQVIGGF 296


>gi|239938578|gb|ACS36088.1| cysteine proteinase [Haemonchus contortus]
          Length = 332

 Score =  185 bits (469), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 109/305 (35%), Positives = 164/305 (53%), Gaps = 26/305 (8%)

Query: 31  DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD 90
           D+ +  ++++K VNE  +  ++A  +P+       +  HL+  +       L   +   +
Sbjct: 34  DNRLTGEALVKYVNER-QPFFEAKYSPEAEQ----RLNHLMDTEFVRNVRKLH-KIPRAE 87

Query: 91  KSLK---LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLS 147
           K++    +P+SFD+R  W  CS+I+ I DQ +CGSCWA  A E +SDR C+     +   
Sbjct: 88  KAISNDDIPESFDSRVVWKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKM 147

Query: 148 VND--LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT----EE---CDPYFDSTGCSHPG 198
           ++D  +LACCG  CG GC+GG    AW Y    GVVT    +E   C PY      +H G
Sbjct: 148 ISDVDILACCGRECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGG 207

Query: 199 ----C--EPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
               C  + ++ TP C + C     + +   K Y  S Y ++ D + I  E+ KNGPV+ 
Sbjct: 208 KFWSCPRDHSFRTPACKKYCQYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQA 267

Query: 252 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 311
           +   YEDF+ Y+ G+Y H  G   G HAVK++GWG  ++G  YW +AN W+  WG DGYF
Sbjct: 268 ASITYEDFSFYRRGIYVHTRGRQRGAHAVKVVGWGV-ENGTKYWNVANSWSTDWGEDGYF 326

Query: 312 KIKRG 316
           +I RG
Sbjct: 327 RILRG 331


>gi|404250524|gb|AFR54113.1| cysteine proteinase, partial [Haemonchus contortus]
          Length = 332

 Score =  184 bits (468), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 98/238 (41%), Positives = 135/238 (56%), Gaps = 17/238 (7%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND--LL 152
           +P+SFD+R  W  CS+I+ I DQ +CGSCWA  A E +SDR C+     +   ++D  +L
Sbjct: 95  IPESFDSREVWKSCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDIL 154

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT----EE---CDPYFDSTGCSHPG----C-- 199
           ACCG  CG GC+GG    AW Y    GVVT    +E   C PY      +H G    C  
Sbjct: 155 ACCGSECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPR 214

Query: 200 EPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
           + ++ TP C + C     + +   K Y  S Y ++ D + I  E+ KNGPV+ +F  YED
Sbjct: 215 DHSFRTPACKKYCQYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFITYED 274

Query: 259 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
           F+ Y  G+Y H  G   G HAVK++GWG  ++G  YW +AN W+  WG +GYF+I RG
Sbjct: 275 FSFYTKGIYVHTRGRQRGAHAVKVVGWGV-ENGTKYWNVANSWSTDWGENGYFRILRG 331


>gi|289724789|gb|ADD18342.1| putative cysteine proteinase TIN-ag [Glossina morsitans morsitans]
          Length = 387

 Score =  184 bits (466), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 117/326 (35%), Positives = 163/326 (50%), Gaps = 22/326 (6%)

Query: 22  EGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVKPTPKG 79
           +G +     D  +  D++++ VN   + GW A +  ++    Y  G  K L   +PT + 
Sbjct: 70  DGGIVDCDRDLCLTDDNLVRNVNSIHRLGWSARKYDEWWGHKYAEGLTKRLGTKEPTYR- 128

Query: 80  LLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH 139
             +    + H+    LP+SF++   W   S IS +LDQG CGS W        SDRF I 
Sbjct: 129 --VKAMSRLHNIVDHLPRSFNSIDKWA--SYISDVLDQGWCGSSWVISTASVASDRFAIQ 184

Query: 140 FGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD-STGCSH 196
                 + LS  ++L+C       GC+GG+  +AWRY    GVV E C PY      C  
Sbjct: 185 SRGKEVIQLSPQNILSCTRRQ--QGCNGGHLDAAWRYLHKQGVVDESCYPYVGYRDACKI 242

Query: 197 PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 255
           P    +     C     V +++L+     YS++      +  DIMAEI+ +GPV+ + TV
Sbjct: 243 PHNSRSLRNNGCRSYSGVDRDELYTVGPAYSLN------NETDIMAEIFMSGPVQATLTV 296

Query: 256 YEDFAHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 312
           Y DF  Y  G+Y+H     G  +G H+VKLIGWG   DG  YWI  N W   WG  G F+
Sbjct: 297 YRDFFSYSGGIYRHTAASRGSPVGFHSVKLIGWGEEHDGNKYWIATNSWGTWWGEHGNFR 356

Query: 313 IKRGSNECGIEEDVVAGLPSSKNLVK 338
           I RGSNECGIEE V+A  P+  N  K
Sbjct: 357 ILRGSNECGIEEYVLAAWPNVYNYFK 382


>gi|407080581|gb|AFS89610.1| procathepsin B precursor [Phenacoccus solenopsis]
          Length = 309

 Score =  183 bits (465), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 127/310 (40%), Positives = 162/310 (52%), Gaps = 32/310 (10%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGV-----KP--TPKGLLLGVPVKTHDKSLKLPKSFDARS 103
           WKA  N    +Y   +F  ++G+     KP  TP    L  P      S  LP  FD+R 
Sbjct: 5   WKADYN--IDSYIDNRFLGMMGINYSELKPNVTPD---LEPPFVVSKISENLPDEFDSRV 59

Query: 104 AWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM--NLSLSVNDLLACCGFLCGD 161
            WP C TI  I DQG CG+CWAF A EA+SDR CIH     +   S  +LL+CC   C  
Sbjct: 60  RWPNCPTIREIRDQGSCGACWAFAAAEAMSDRVCIHSSQTKHFHFSALNLLSCCD-SCEK 118

Query: 162 GCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKC 208
           GC G     AW ++V HG+V+       E C PY     C H        C    PTP C
Sbjct: 119 GCLGCDHHLAWDHWVKHGIVSGGSYGSKEGCQPYH-LPPCEHHRAGPRRNCTKYGPTPSC 177

Query: 209 VRKCVKKNQL-WRNSKHYSISAYRINSDPEDIM-AEIYKNGPVEVSFTVYEDFAHYKSGV 266
            R C    ++ + +  H+    Y +    E I+  EI+ NGPVE +   YEDF  Y+SG+
Sbjct: 178 ARVCQPDYKISYEDDLHFGKQWYALAPHNEKIIRTEIFHNGPVEATMAAYEDFYTYESGI 237

Query: 267 YKHITGDVMGGHAVKLIGWGTSDD-GEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 325
           Y HI G  +  HAVK+IGWGT       YW++AN +N  WG  G+FKIKRG NECGIE  
Sbjct: 238 YHHIEGTFVCDHAVKIIGWGTDKKTNTPYWLVANSFNTDWGEYGFFKIKRGVNECGIENK 297

Query: 326 VVAGLPSSKN 335
           + AG+P+ KN
Sbjct: 298 ITAGIPAYKN 307


>gi|294951797|ref|XP_002787132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239901778|gb|EER18928.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 278

 Score =  183 bits (465), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 109/263 (41%), Positives = 142/263 (53%), Gaps = 33/263 (12%)

Query: 95  LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDL 151
           LP  FDAR+A+P CS  I  I DQ  CGSCWAFG  EA +DR CI  H      LS  ++
Sbjct: 21  LPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSHGTFTELLSAGEM 80

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------------EECDPYFDSTGCSH-- 196
            AC       GC+GG+P SAW +    G+ T             + C PY D   C+H  
Sbjct: 81  NACAP---SHGCNGGFPNSAWSWVHDKGIATGGDYVAEDDMTKDDGCWPY-DFPPCAHHV 136

Query: 197 -----PGC-EPAYPTPKCVRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 248
                P C + +Y TP C  +C   K     R+ +H+ + +        D    I  +GP
Sbjct: 137 NDSKYPKCPKDSYETPNCAEQCHNPKYTTTLRDDRHFMVESSPYQYSVNDAKNAIRTDGP 196

Query: 249 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 308
           V  SFTVYEDF  YKSGVYKH +G+ +GGHAVK+IGWG  + G+ YW++ N WN  WG  
Sbjct: 197 VSASFTVYEDFLAYKSGVYKHTSGEYLGGHAVKIIGWG-EESGQAYWLVVNSWNEDWGDH 255

Query: 309 GYFKIKRGSNECGIEEDVVAGLP 331
           G FKI  G+  CGI++ ++ G P
Sbjct: 256 GLFKIALGN--CGIDDYLLGGTP 276


>gi|290992302|ref|XP_002678773.1| predicted protein [Naegleria gruberi]
 gi|284092387|gb|EFC46029.1| predicted protein [Naegleria gruberi]
          Length = 236

 Score =  183 bits (464), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 100/246 (40%), Positives = 143/246 (58%), Gaps = 21/246 (8%)

Query: 87  KTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNL 144
           KT   ++    +FD+R+ WP C  +  I +Q  CGSCWAF A E LSDRFCI  G  +++
Sbjct: 5   KTATGAVAAVPAFDSRTKWPHC--VHPIRNQEQCGSCWAFSASEVLSDRFCIASGGKVDV 62

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 204
            LS   +++C       GCDGGY  +AW +    G+ +++C PY    G           
Sbjct: 63  VLSPQYMVSCDS--TDYGCDGGYLNNAWAFLAGTGIPSDKCAPYTSQNGD---------- 110

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
               V  C  K Q   + K Y     +  +D   IM ++ +NGPV+ +F+VY DF  YKS
Sbjct: 111 ----VAACPSKCQDGSSVKLYKAKNPQQLNDIPSIMEDMQQNGPVQAAFSVYRDFMSYKS 166

Query: 265 GVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 323
           GVY H++G ++GGHA+K++GWG  S   + YWI+AN W  SWG +G+F I RGS+ECGIE
Sbjct: 167 GVYHHVSGSLLGGHAIKMVGWGVDSATNKPYWIIANSWGPSWGLNGFFWILRGSDECGIE 226

Query: 324 EDVVAG 329
           ++V +G
Sbjct: 227 DNVWSG 232


>gi|166030322|gb|ABY78828.1| cathepsin B-like protease [Trypanosoma congolense]
 gi|343471419|emb|CCD16168.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 121/335 (36%), Positives = 161/335 (48%), Gaps = 14/335 (4%)

Query: 12  LCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLL 71
           L L   A  A G  +    D+ +L  + +  +N+     WKA  N +  N T  + K L 
Sbjct: 7   LGLLSTALVALGASALRAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQNITFSEAKRLT 66

Query: 72  GVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEA 131
           G        L  V         +LP+SFD+   WP C TI  I DQ  C + WA     A
Sbjct: 67  GAWIQKNSSLPPVRFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACRASWAVSTASA 126

Query: 132 LSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD 190
           +SDR+C +  G  L +S   LL+CC   CG GC GG+P  AWRY+V +G+ +  C PY  
Sbjct: 127 ISDRYCTVGGGKQLRISAAHLLSCCK-QCGGGCKGGFPGFAWRYYVEYGIASSYCQPY-P 184

Query: 191 STGCSHPGCEP--------AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE 242
              C H G +          + TP+C   C  K       K+    AY +    E+   E
Sbjct: 185 FPQCEHQGAQGNKTPCSNYKFVTPQCNTTCTDKTIPL--IKYRGKDAYMLLPGEEEFKRE 242

Query: 243 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 302
           +Y NGP      VY D   YKSGVY+++ G  MG  AVK++GWG   +G  YW +AN W+
Sbjct: 243 LYFNGPFVAILFVYTDLFAYKSGVYRNVDGSYMGVTAVKVVGWGKL-NGTPYWKVANTWD 301

Query: 303 RSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 337
             WG DGY  I RG+NEC IE    AG P +  L 
Sbjct: 302 TDWGMDGYLLILRGNNECNIEHLGFAGTPDTSQLT 336


>gi|28974200|gb|AAO61484.1| cathepsin B [Sterkiella histriomuscorum]
          Length = 294

 Score =  182 bits (461), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 115/312 (36%), Positives = 162/312 (51%), Gaps = 30/312 (9%)

Query: 23  GVVSKLKLDSHILQDSIIKEVNENPKAGWK---AARNPQFSNYTVGQFKHLLGVKPTPKG 79
           G +  + + +H + + ++  +       W+      NP F+N T  Q     G    P  
Sbjct: 8   GTIVAVAVATHPINEEMVAHIKAKTSL-WQPHETTTNP-FNNMTKEQLLAKCGTYIVPAN 65

Query: 80  LLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH 139
                      K + +P++FDAR  W   S I  I DQ  CGSCWAFGA EA SDRF I+
Sbjct: 66  KEY-----PGSKIMTVPENFDARQQWG--SKIHAIRDQQQCGSCWAFGATEAFSDRFAIN 118

Query: 140 FGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGC 199
            G ++ LS  DL++C       GC+GGY   AW Y   HG  T+ C PY   +G +    
Sbjct: 119 -GKDVILSPEDLVSC--DTNDYGCNGGYMDVAWEYLADHGAATDSCFPYSAGSGFA---- 171

Query: 200 EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
                 P C  KC   + + R     + ++ R +     I +EI  +GPVE +FTVY DF
Sbjct: 172 ------PACSDKCADGSAMQRFK--CAPNSVRQSKGVAQIQSEIVSHGPVEGAFTVYTDF 223

Query: 260 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 319
            +Y+SGVY   T DV GGHA+K++G+G  ++G  YW+ AN W  +WG  G+FKIK+G  E
Sbjct: 224 FNYQSGVYTPTTTDVAGGHAIKILGYGV-ENGTPYWLCANSWGPAWGMSGFFKIKQG--E 280

Query: 320 CGIEEDVVAGLP 331
           CGIE+ V +  P
Sbjct: 281 CGIEDQVFSCDP 292


>gi|290982673|ref|XP_002674054.1| predicted protein [Naegleria gruberi]
 gi|284087642|gb|EFC41310.1| predicted protein [Naegleria gruberi]
          Length = 673

 Score =  181 bits (460), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 111/313 (35%), Positives = 157/313 (50%), Gaps = 30/313 (9%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD- 90
           +H  +D +I  +N++P   W+AA   QF+  +  + + LLG K   +         T D 
Sbjct: 24  THFTKD-MIDSLNQDPSVKWEAANYDQFAGKSFAELRKLLGGKRGEESSSEEARYNTRDV 82

Query: 91  -KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 147
             ++ +P +FD+R+ WPQC  I  I +QG CGSCWAF      SDR CI      N+ +S
Sbjct: 83  KSTVAIPDTFDSRTKWPQC--IHGIRNQGQCGSCWAFATTGVFSDRLCITTNNVSNVVIS 140

Query: 148 VNDLLAC--CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY-- 203
              L+ C    F C     GGY   +W++F++ G+  E C PY   +          Y  
Sbjct: 141 PEFLIECDKTSFAC----QGGYGYYSWKFFMNTGIPLESCVPYTKDS--------LVYGN 188

Query: 204 -PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 262
               +C   C   + L     + + SAY I S   +   EI  NGPVE  F VY DF  Y
Sbjct: 189 TTNAQCRSTCTDGSPL---KLYKAASAYYIYSPITNYQTEIMTNGPVEADFDVYSDFYSY 245

Query: 263 KSGVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN--E 319
           KSG+Y+   G   +GGHAVK++GW +  +G  YWI  NQW  SWG  GYF I RG++   
Sbjct: 246 KSGIYQKTAGSTYVGGHAVKVLGWASDSNGTPYWIAQNQWGTSWGMGGYFYIYRGNSTLN 305

Query: 320 CGIEEDVVAGLPS 332
           C  +  ++AG  S
Sbjct: 306 CKFDNYMIAGTVS 318


>gi|15723272|gb|AAL06324.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score =  181 bits (459), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 102/215 (47%), Positives = 128/215 (59%), Gaps = 18/215 (8%)

Query: 99  FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGF 157
           FDA  AWP+C TI+ I DQ  CGSCWA  A  A+SDR+C   G+ +L +S  DL++CC  
Sbjct: 1   FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAMSDRYCTLGGVRDLRISAGDLMSCCD- 59

Query: 158 LCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH-------PGCEPAYPTPKCV 209
           +CG GC+GGYP  AW Y+  HG+V+E C PY F S  C+H         C   Y TP C 
Sbjct: 60  VCGYGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPS--CAHHVNSSDLSPCSGEYDTPTCN 117

Query: 210 RKCV-KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
             C  KK  L +   + S     I S  E    E+  NGP EVSF+VY DF  Y  GVYK
Sbjct: 118 STCTDKKIPLIKYRGNTSC----ILSGEESFKRELLLNGPFEVSFSVYADFVAYTGGVYK 173

Query: 269 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 303
           H+TG  +GGHAV+++GWG   +GE YW +AN WN 
Sbjct: 174 HVTGVFLGGHAVRIVGWGEL-NGEPYWKIANSWNH 207


>gi|253748582|gb|EET02635.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 298

 Score =  181 bits (459), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 112/289 (38%), Positives = 154/289 (53%), Gaps = 25/289 (8%)

Query: 46  NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW 105
           NP+  WKA    +F   T  +   LL            VP  T   + K+P SFD R  +
Sbjct: 28  NPR--WKAGIPKRFEGLTKDEISSLLMPISFLNRDRAAVPRGTIADT-KVPDSFDFREEY 84

Query: 106 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD- 161
           P C  I  ++DQG CGSCWAF +V +L DR C   G++   ++ S   +++C     GD 
Sbjct: 85  PHC--IPEVVDQGSCGSCWAFSSVASLGDRRCFA-GLDKKAVTYSPQYVVSCDH---GDM 138

Query: 162 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 221
            CDGG+  S WR+    G  T EC PY   T  +   C    PT     KC    +L   
Sbjct: 139 ACDGGWLQSVWRFLTKTGTTTNECVPYQSGTTGARGTC----PT-----KCADGGEL--- 186

Query: 222 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 281
           S   +  A     D + IM  +   GP++ +FTVY DF +Y+ GVY+H++G V GGHAV+
Sbjct: 187 STVKAKKAVDYGLDCDLIMKALVTGGPLQTAFTVYSDFMYYEGGVYQHMSGRVEGGHAVE 246

Query: 282 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           ++G+GT +   DYWI+ N W   WG DGYF+I R +NECGIEE V+ G+
Sbjct: 247 MVGYGTDEYDVDYWIIRNSWGPDWGEDGYFRIIRMTNECGIEEQVMGGI 295


>gi|204022073|dbj|BAG71134.1| cathepsin B-S1 [Tuberaphis taiwana]
          Length = 334

 Score =  181 bits (459), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 125/318 (39%), Positives = 167/318 (52%), Gaps = 24/318 (7%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS 92
             L D  IK +NE  K  WKA R    +N +   F  LLG +   K     V +K +D  
Sbjct: 23  QFLSDERIKYINEVAKT-WKAERYFP-ANTSEEYFIGLLGSRGY-KNYTNEVEIKKYDPL 79

Query: 93  L---KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS-LSV 148
                 P+ FD+R+ W  C  I  I DQG+CGSCW+F    A +DR C+  G   + L  
Sbjct: 80  YVENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLS 139

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FDSTGCSH 196
            + LA C   CG GC GGYPI AW+YF   GV T       E C PY     ++  G + 
Sbjct: 140 PEELAFCCKDCGKGCGGGYPIKAWKYFRTQGVTTGGDYGTKEGCMPYKVPPCYNKQGKNT 199

Query: 197 PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
            G +P     +C + C  K  +   +++ + S Y INS  + I  +I   GPVE SF VY
Sbjct: 200 CGGQPMERNHQCPKTCYGKTTV--QNRYKTKSEYVINSI-KTIERDIMTYGPVEASFDVY 256

Query: 257 EDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 315
           +D + YKSG+Y+        GGH++K+IGWG   +G  YW+  N W++ WG  G FKI +
Sbjct: 257 DDLSAYKSGIYRKTPKAKYQGGHSIKIIGWG-QQNGTPYWLAVNSWSKFWGEHGTFKIIK 315

Query: 316 GSNECGIEEDVVAGLPSS 333
           G NECGIE  V AG+PSS
Sbjct: 316 GRNECGIERAVTAGIPSS 333


>gi|270012756|gb|EFA09204.1| cathepsin B precursor [Tribolium castaneum]
          Length = 369

 Score =  181 bits (458), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 123/311 (39%), Positives = 165/311 (53%), Gaps = 31/311 (9%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPK-GLLLGVPVKTHDKSLKLP 96
           S+I ++N    A W A  NP F +  +      LG+ P P     +  P  T +    +P
Sbjct: 21  SLINQINSQQSA-WTAGINP-FDD--IESRLGFLGIHPDPNFKPEIKEPQATQNV---IP 73

Query: 97  KSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLA 153
           ++FDAR  WP+C+  I  I +QG C S WAF A E +SDR CI     + + LS  DL+ 
Sbjct: 74  ETFDAREYWPECADIIGNIRNQGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPEDLID 133

Query: 154 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY--PTPKCVRK 211
           CC + CG+ C GGY   AW YF+  G+V+     Y  STGC  P  E  Y   TP C   
Sbjct: 134 CCHY-CGNQCKGGYTYYAWNYFMLTGLVSG--GDYNTSTGC-QPYSELNYYRITPPCNTT 189

Query: 212 CV--KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG-PVEVSFTVYEDFAHYK----- 263
           C   K    + + KH+  S Y I  +   I  EI   G PV  +F VY DF  Y+     
Sbjct: 190 CQNDKYPIPYVSDKHFGDSIYYIPQNETAIQNEILSGGGPVVAAFDVYGDFKIYRDGEQH 249

Query: 264 ----SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA-DGYFKIKRGSN 318
                GVY + +G + G  AVK+IGWGT ++G  YW+ AN W + WGA  G+FKI+RG+N
Sbjct: 250 DTILEGVYIYTSGALFGRTAVKIIGWGT-ENGWAYWLAANSWGKDWGALGGFFKIRRGTN 308

Query: 319 ECGIEEDVVAG 329
           ECG EE ++AG
Sbjct: 309 ECGFEESIIAG 319


>gi|156708110|gb|ABU93313.1| cathepsin B4 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  180 bits (457), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 109/302 (36%), Positives = 156/302 (51%), Gaps = 27/302 (8%)

Query: 30  LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 89
           L + ++ +SI++ +N +P + W AA  P+ S   V +F+ +LG +  P      +P    
Sbjct: 5   LFASVVAESIVETINNDPTSTWVAAEYPR-SVINVAKFRAMLGAELGPH-----MPY-VQ 57

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 149
             SL  P  FDAR  WP    I  + DQ  CGSCWA    EA+ D   I      ++SV 
Sbjct: 58  PLSLSEPTEFDAREQWP--GKILPVRDQASCGSCWAHSVAEAMGDAQNIAGCPRGAMSVQ 115

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
           DL++C        C+GG    A  Y V  G+ TE C  Y   +G            P C 
Sbjct: 116 DLVSC--DKTDSACNGGDMKKAQEYLVKTGITTEACVKYVSGSG----------RVPACP 163

Query: 210 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 269
            KC   +Q+ R    Y + +++ + +P +IM  + + GP+   F VY DF +Y+SGVY+H
Sbjct: 164 SKCDNGSQIIR----YKLQSWK-SVEPSEIMQALMEYGPLSCGFMVYSDFMNYRSGVYQH 218

Query: 270 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
            +G   GGHAV L GWG  ++G  YW++ N W  +WG  G+FKI RGSN C IE  V  G
Sbjct: 219 KSGYFEGGHAVLLCGWGV-ENGLPYWLVQNSWGPAWGEKGFFKILRGSNHCEIESYVTLG 277

Query: 330 LP 331
           +P
Sbjct: 278 VP 279


>gi|204022075|dbj|BAG71135.1| cathepsin B-S2 [Tuberaphis taiwana]
          Length = 334

 Score =  180 bits (457), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 124/318 (38%), Positives = 168/318 (52%), Gaps = 24/318 (7%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS 92
             L D  IK +NE  K  WKA R    +N +   F  LLG +   K     V +K +D  
Sbjct: 23  QFLSDERIKYINEVAKT-WKAERYFP-ANTSEEYFIGLLGSRGY-KNYTNEVEIKKYDPL 79

Query: 93  L---KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS-LSV 148
                 P+ FD+R+ W  C  I  I DQG+CGSCW+F    A +DR C+  G   + L  
Sbjct: 80  YVENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLS 139

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FDSTGCSH 196
            + LA C   CG GC GGYPI AW+YF   GV T       E C PY     ++  G + 
Sbjct: 140 PEELAFCCKDCGKGCGGGYPIKAWKYFRTQGVTTGGDYGTKEGCMPYKVPPCYNKQGKNT 199

Query: 197 PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
            G +P     +C + C  K  +   +++ + S Y +NS  + I  ++   GPVE SF VY
Sbjct: 200 CGGQPMERNHQCPKTCYGKTTV--QNRYKTKSEYVMNSI-KTIEQDLKTYGPVEASFDVY 256

Query: 257 EDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 315
           +DF+ YKSG+Y+        GGH++K+IGWG   +G  YW+  N W++ WG  G FKI +
Sbjct: 257 DDFSVYKSGIYRKTPKAKYQGGHSIKIIGWG-QQNGTPYWLAVNSWSKFWGEHGTFKIIK 315

Query: 316 GSNECGIEEDVVAGLPSS 333
           G NECGIE  V AG+PSS
Sbjct: 316 GRNECGIERAVTAGIPSS 333


>gi|156708120|gb|ABU93318.1| cathepsin B9 cysteine protease, partial [Monocercomonoides sp. PA]
          Length = 382

 Score =  180 bits (457), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 102/259 (39%), Positives = 142/259 (54%), Gaps = 12/259 (4%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLL--GVPVKTHDKSLKLP 96
           ++ E+N     GW A  NP F ++   +F+ L   +  P   L      VK  D+   +P
Sbjct: 15  MVHEINNRNDVGWTARVNPHFKSFNQKKFRSLNSAQHNPSFSLQFKNEFVKIEDE---IP 71

Query: 97  KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLAC 154
           +SFDAR+ WP C TI  I DQGHCGSCWA  + E L DRFCIH   +    LS  D+ +C
Sbjct: 72  ESFDARTNWPNCPTIGHIYDQGHCGSCWAMCSFEVLQDRFCIHSNGSEKPWLSGQDITSC 131

Query: 155 CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKC-V 213
                  GC+GG+  +A+ Y    GV TEEC PY     C HPGC  ++ TP C ++C  
Sbjct: 132 DSR--SHGCNGGWTETAFEYAKKAGVPTEECVPYLMGK-CHHPGCS-SWQTPTCKKECSS 187

Query: 214 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 273
             N  + ++++Y+  +Y I  + E I  E+ +NGPV   FT Y+D A Y  GVY H+ G 
Sbjct: 188 LSNYNYSSNRYYASKSYSIQRNVEAIQLELMRNGPVTAVFTTYDDLAVYWRGVYNHVMGS 247

Query: 274 VMGGHAVKLIGWGTSDDGE 292
             G HA+K++GWG   + E
Sbjct: 248 EQGLHAIKIVGWGVWRESE 266



 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 23/43 (53%), Positives = 28/43 (65%)

Query: 289 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           ++G  YWI+ N W   +G DG   IKRG NECGIE DV  G+P
Sbjct: 321 EEGIPYWIIVNSWGEDFGMDGILLIKRGVNECGIESDVYTGIP 363


>gi|343476073|emb|CCD12715.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  180 bits (457), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 119/334 (35%), Positives = 159/334 (47%), Gaps = 12/334 (3%)

Query: 12  LCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLL 71
           L L   A    G  +    D+ +L  + +  +N+     WKA  N +  N T  + K L 
Sbjct: 7   LGLLSTALVTLGASALRAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQNITFSEAKRLT 66

Query: 72  GVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEA 131
           G        L  V         +LP+SFD+   WP C TI  I DQ  C + WA     A
Sbjct: 67  GAWIQKTSSLPPVRFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACRASWAVSTASA 126

Query: 132 LSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-- 188
           +SDR+C +  G  L +S   LL+CC   CG GC GG+P  AWRY+V +G+ +  C PY  
Sbjct: 127 ISDRYCTVGGGKQLRISAAHLLSCCK-QCGGGCKGGFPGFAWRYYVEYGIASSYCQPYPF 185

Query: 189 -----FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEI 243
                  + G   P     + TP+C   C  K       K+    AY +    E+   E+
Sbjct: 186 PQCEHHGAQGNKTPCSNYKFVTPQCNTTCTDKTIPL--IKYRGKDAYMLLPGEEEFKREL 243

Query: 244 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 303
           Y NGP      VY D   YKSGVY+++ G  MG  AVK++GWG   +G  YW +AN W+ 
Sbjct: 244 YFNGPFVAILFVYTDLFAYKSGVYRNVDGSYMGVTAVKVVGWG-KLNGTPYWKVANTWDT 302

Query: 304 SWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 337
            WG DGY  I RG+NEC IE    AG P +  L 
Sbjct: 303 DWGMDGYLLILRGNNECNIEHLGFAGTPDTSQLT 336


>gi|15723276|gb|AAL06326.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score =  180 bits (457), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 99/217 (45%), Positives = 126/217 (58%), Gaps = 22/217 (10%)

Query: 99  FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGF 157
           FDA  AWP+C TI+ I DQ  CGSCWA  A  A+SDR+C   G+ +L +S  DL++CC  
Sbjct: 1   FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAISDRYCTLGGVRDLRISAGDLMSCCD- 59

Query: 158 LCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH-------PGCEPAYPTPKCV 209
           +CG GC+GGYP  AW Y+  HG+V+E C PY F S  C+H         C   Y TP C 
Sbjct: 60  VCGYGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPS--CAHHVNSSDLSPCSGEYDTPTCN 117

Query: 210 RKCVKKNQ---LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 266
             C  K      +R +  Y +S        E    E+  NGP EVSF+VY DF  Y  GV
Sbjct: 118 STCTDKKVPLIKYRGNTSYLLSG------EESFKRELLLNGPFEVSFSVYADFLAYTGGV 171

Query: 267 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 303
           YKH+ G  +GGHAV+++GWG   +GE YW +AN WN 
Sbjct: 172 YKHVAGTFLGGHAVRIVGWGEL-NGEPYWKIANSWNH 207


>gi|170030062|ref|XP_001842909.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
 gi|167865915|gb|EDS29298.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
          Length = 288

 Score =  180 bits (457), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 107/276 (38%), Positives = 150/276 (54%), Gaps = 17/276 (6%)

Query: 65  GQFKHLLGVKPTPKGLLLGVPVKTHDKSLK-LPKSFDARSAWPQCSTISRILDQGHCGSC 123
           G  K  LG+  +    L  +P   + +S++ LP SFDAR  WP C ++++I  QG CGSC
Sbjct: 19  GVMKMSLGLNESE---LNNLPRLQNQRSVRALPASFDARQKWPYCPSLNQIRSQGSCGSC 75

Query: 124 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           +A      ++DR+CIH G            L+CC       CDGGY    + Y+V +G+ 
Sbjct: 76  YAVSTAAVITDRYCIHSGGERQFYFGSTGYLSCCTDCYK--CDGGYVHKTFDYWVKYGLT 133

Query: 182 TEECDPYFDSTGCS-HP---GCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDP 236
           +    PY    GC  +P     +      KC R+C     L +     +  S+Y +    
Sbjct: 134 SG--GPYHSGQGCKPYPFGGATQDVNIVLKCDRQCQAGYPLTYSQDLKHGASSYILPWGD 191

Query: 237 EDIM-AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
           E+ M AEIY+NGP+  SF VY DF  Y+SGVY+H+TG   G HAV++IGWG  ++G  YW
Sbjct: 192 ENAMKAEIYQNGPIVTSFDVYGDFFQYRSGVYRHVTGAYKGSHAVRVIGWGV-ENGVKYW 250

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           + AN WN  WG +G+FKI RG N  G+E+   AGLP
Sbjct: 251 LCANSWNERWGENGFFKIVRGENHVGVEDISYAGLP 286


>gi|56758130|gb|AAW27205.1| unknown [Schistosoma japonicum]
          Length = 279

 Score =  180 bits (456), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 105/274 (38%), Positives = 149/274 (54%), Gaps = 21/274 (7%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
           +  ++ FA     V ++       L D +I  +NE+P AGWKA ++ +F  +++   + L
Sbjct: 6   VCIVSFFALLKAHVTTRNNERIEPLSDEMISFINEHPDAGWKADKSDRF--HSLDDARIL 63

Query: 71  LGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 128
           +G +     +       V  HD ++++P  FD+R  WP C +IS+I DQ  CGSCWAFGA
Sbjct: 64  MGARKEDAEMKRKRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGA 123

Query: 129 VEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT---- 182
           VEA++DR CI  G   S  LS  DL++CC   CGDGC GG+P  AW Y+V  G+VT    
Sbjct: 124 VEAMTDRICIQSGGQQSAELSALDLISCCE-DCGDGCQGGFPGVAWDYWVKRGIVTGGSK 182

Query: 183 ---EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRI 232
                C PY        T   +P C    Y TP+C + C K  +  +   KHY   +Y +
Sbjct: 183 ENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNV 242

Query: 233 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 266
            S+ + I  EI   GPVE +F VYEDF +YKSG+
Sbjct: 243 ISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGI 276


>gi|308160258|gb|EFO62754.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 298

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 110/289 (38%), Positives = 154/289 (53%), Gaps = 25/289 (8%)

Query: 46  NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW 105
           NP+  WKA    +F   T  +   LL      K     VP  T   + ++P SFD R  +
Sbjct: 28  NPR--WKAGIPKRFEGLTKDEISSLLMPVSFLKRDRAAVPRGTVSAT-QVPDSFDFREEY 84

Query: 106 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD- 161
           P C  I  ++DQG CGSCWAF +V ++ DR C+  G++   +  S   +++C     GD 
Sbjct: 85  PHC--IPEVVDQGGCGSCWAFSSVASVGDRRCVA-GLDKKAVRYSPQYVVSCDR---GDM 138

Query: 162 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 221
            CDGG+  S WR+ V  G  T+EC PY         G   A  T  C  KC   ++L   
Sbjct: 139 ACDGGWLPSVWRFLVKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSEL--- 186

Query: 222 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 281
             + +  A     D + IM  +   GP++ +FTVY DF +Y+ GVY+H+ G   GGHAV+
Sbjct: 187 PIYKATKAVDYGLDCDLIMKALATGGPLQTAFTVYSDFMYYQGGVYQHVYGRAEGGHAVE 246

Query: 282 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           ++G+GT +   DYWI+ N W   WG DGYF+I R +NECGIEE V+ G 
Sbjct: 247 MVGYGTDEYDVDYWIIRNSWGPDWGEDGYFRIIRMTNECGIEEQVIGGF 295


>gi|339242313|ref|XP_003377082.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316974149|gb|EFV57673.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 517

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 103/287 (35%), Positives = 150/287 (52%), Gaps = 25/287 (8%)

Query: 56  NPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL--KLPKSFDARSAWPQCSTISR 113
           NP FS  +  +    +G K           +  ++++L  KLPK FD+R  WP+C  I  
Sbjct: 239 NPYFSGMSKEEILIRMGTKLMNSSTEFDSKLSNNNEALIKKLPKHFDSREKWPECEWIRF 298

Query: 114 ILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND--LLACCGFLCGDGCDGGYPISA 171
           I DQ +CGSCWA  A   ++DR CI      +  ++D  +LAC           G   S 
Sbjct: 299 IRDQSNCGSCWAVSAASVMTDRHCIASKGQETPYISDEQILAC-----------GMIPSP 347

Query: 172 WRYFVHHGVVTEECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKH 224
           + Y+   G+ T    PY D + C          C     TP C   C     +   + K 
Sbjct: 348 FNYWKKMGIATG--GPYGDKSCCQPYSIAPCSKCSYTASTPSCKYDCQADYDIPISDDKF 405

Query: 225 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 284
           Y+   Y ++S+  +IM EIY +GPV   F VYEDF +Y SG+Y+  T   MGGHA+++IG
Sbjct: 406 YASEHYHVSSNQYEIMNEIYTHGPVVAGFIVYEDFTYYISGIYQQTTYVAMGGHAIRIIG 465

Query: 285 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           WG  ++G  YW++AN WN ++G  G+F+I+RG+NEC IE +V  G+P
Sbjct: 466 WG-EENGIPYWLIANSWNTTFGEKGFFRIRRGTNECRIESEVYTGIP 511



 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 48/132 (36%), Positives = 60/132 (45%), Gaps = 13/132 (9%)

Query: 162 GCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSHPGCEPAYPTPKCVRKCVK 214
           GC  G   +A+ Y+   G+VT         C PY  S  C+   C P    PKC R C  
Sbjct: 69  GCRSGKIEAAFIYWQRSGLVTGGPYGEKACCLPYSISP-CTM--CRPYMLAPKCQRTCQA 125

Query: 215 KNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 273
              L  +  K+Y  S Y +N D  DIM EIY+ GPV   F VY DF +Y SG +  I G+
Sbjct: 126 SYNLSLKRDKYYGKSHYYVNQDEFDIMQEIYQRGPVVAGFKVYHDFLYYISGQF--ICGN 183

Query: 274 VMGGHAVKLIGW 285
                   L  W
Sbjct: 184 KRCEEEENLTSW 195


>gi|15723280|gb|AAL06328.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 99/217 (45%), Positives = 126/217 (58%), Gaps = 22/217 (10%)

Query: 99  FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGF 157
           FDA  AWP+C TI+ I DQ  CGSCWA  A  A+SDR+C   G+ +L +S  DL++CC  
Sbjct: 1   FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAISDRYCTLGGVRDLRISAGDLMSCCD- 59

Query: 158 LCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH-------PGCEPAYPTPKCV 209
           +CG GC+GGYP  AW Y+  HG+V+E C PY F S  C+H         C   Y TP C 
Sbjct: 60  VCGYGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPS--CAHHVNSSDLSPCSGEYDTPTCN 117

Query: 210 RKCVKKNQ---LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 266
             C  K      +R +  Y +S        E    E+  NGP EVSF+VY DF  Y  GV
Sbjct: 118 STCTDKKVPLIKYRGNTSYLLSG------EESFKRELLLNGPFEVSFSVYADFLAYTGGV 171

Query: 267 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 303
           YKH+ G  +GGHAV+++GWG   +GE YW +AN WN 
Sbjct: 172 YKHVAGIFLGGHAVRIVGWGEL-NGEPYWKIANSWNH 207


>gi|389608479|dbj|BAM17849.1| tubulointerstitial nephritis antigen [Papilio xuthus]
          Length = 429

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 116/308 (37%), Positives = 164/308 (53%), Gaps = 22/308 (7%)

Query: 31  DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTH 89
           D  ++ +S+++ VN    + W+A   P+F N  + +   + LG  P         P++ +
Sbjct: 127 DPCLMSNSVVEGVNRG-GSSWRAYNYPEFRNKKLKEGLIYKLGTFPLNAETRRMGPLR-Y 184

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLS 147
           DK +  P  FDAR+ WP    IS I+DQG CGS WA       SDRF I      N+ LS
Sbjct: 185 DKDVPYPTQFDARTRWP--GFISPIVDQGWCGSDWAVSLAGVASDRFAIQSNGAENMVLS 242

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDS-TGCSHPGCEPAYPTP 206
              LL+C       GC GG+   AW +   HG+V E+C PY  S T C      P  P  
Sbjct: 243 PQTLLSC-NVRAQQGCHGGHIDVAWNFARGHGLVDEKCFPYKASVTRC------PFRPRG 295

Query: 207 KCVRK-CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
             ++  C+   +  R +  Y +      S  +DIM +I ++GPV+   TVY+DF HY+ G
Sbjct: 296 NLIQDGCMPLVK--RRTSRYKLGPPAKLSHEKDIMYDIMESGPVQAVMTVYQDFFHYRDG 353

Query: 266 VYK---HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 322
           VY+   H   ++ G H+V++IGWG  D G+ YW++AN W R WG +GYF+I RGSNE  I
Sbjct: 354 VYRRSYHGNNELKGFHSVRIIGWG-EDRGDRYWVVANSWGRQWGENGYFRIARGSNEADI 412

Query: 323 EEDVVAGL 330
           E  VV GL
Sbjct: 413 ESFVVTGL 420


>gi|327408413|emb|CCA30060.1| unnamed protein product [Neospora caninum Liverpool]
          Length = 463

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 119/336 (35%), Positives = 177/336 (52%), Gaps = 44/336 (13%)

Query: 33  HILQDSIIKEVNE-NPKAGWKAARNPQFSNYTVGQFKHLLG---VKPTPKGLLL--GVPV 86
            ++++ + K     + K  W+   + +F   ++   K L+G   V    +GL L  GVP+
Sbjct: 96  QLIKEKMAKRAETGDAKHMWEPEVSLRFKFLSLKDAKKLMGTFLVNTRVEGLRLPSGVPL 155

Query: 87  KT----HDKSLKLPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFG 141
                  + +  +P +FDAR+A+P C   +  + DQG CGSCWAF + EA +DR CI   
Sbjct: 156 PAKTVFENANEPVPANFDARTAFPVCKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQ 215

Query: 142 MN--LSLSVNDLLACCGFL-CGD-GCDGGYPISAWRYFVHHGVVT----------EECDP 187
               + LS     +CC  + C   GC+GG P  AWR+F   GVVT            C P
Sbjct: 216 GKGVMPLSTQHTTSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDFDTLGKGTTCWP 275

Query: 188 YFDSTGCSH------PGCEP---AYPTPKCVRKCVKKNQL-----WRNSKHYSISAYRIN 233
           Y +   C+H      P C+       TPKC + C +         +    H + S+Y + 
Sbjct: 276 Y-EIPFCAHHAKAPFPNCDTDVRPRKTPKCRKDCEEAAYSEHVLPFDKDVHKASSSYSLR 334

Query: 234 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 293
           S  + +  ++  +G V  +F VYEDF +YKSGVYKH+ G  +GGHA+K+IGWGT +DGE+
Sbjct: 335 SR-DAVKRDMMAHGTVTGAFMVYEDFLNYKSGVYKHVYGGPLGGHAIKIIGWGT-EDGEE 392

Query: 294 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
           YW   N WN  WG  G+FKI+ G  +CG++ ++VAG
Sbjct: 393 YWHAVNSWNTYWGDSGHFKIEMG--QCGVDNEMVAG 426


>gi|395734831|ref|XP_003776483.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin B-like [Pongo abelii]
          Length = 350

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 122/364 (33%), Positives = 174/364 (47%), Gaps = 38/364 (10%)

Query: 1   MEPTKLIMDP------ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAA 54
           M   + I DP      +L   C         S+  L  H L   ++  +N+ P    +A 
Sbjct: 1   MNSERWIQDPSSDLRRLLASFCCLLVLASAGSRTYL--HPLSKXLVNYINK-PNTMQQAG 57

Query: 55  RNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRI 114
            N  F    +   +   G  P    L   V        + LP+SFD    WP       I
Sbjct: 58  HN--FHKMXISYLRRPCGTFPGRSKLPQRVKFAX---DINLPESFDPXEQWPD-XPXREI 111

Query: 115 LDQGHCGSCWAFGAVEALSDRFCIH-------FGMNLSLSVNDLLACCGFLCGDGCDGGY 167
            DQG  G CWA GA+EA+SD  CIH        G ++ +S  D L C   LCGDGC+GG 
Sbjct: 112 RDQGSYGFCWALGALEAISDWICIHPNVGGAQGGNHVEVSAEDKLTC---LCGDGCNGGX 168

Query: 168 PISAWRYFVHHGVVTEE-------CDPYFDSTGCSHPGCEPAY----PTPKCVRKCVKKN 216
           P   W ++   G+V+         C  +     C H      Y     +PKC   C +  
Sbjct: 169 PNEGWNFWTGKGLVSGGLYDSHVGCRLFPSLLPCKHHIHGXPYVXTGDSPKCSMTC-EPG 227

Query: 217 QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 276
           Q ++  KHY  S+Y I+   +DIM  IYKN  VE +F+VY DF  YK   Y+ +TG++ G
Sbjct: 228 QTYKXDKHYGCSSYSISDSTKDIMTNIYKNDXVEEAFSVYLDFLMYKFKEYQGVTGEMXG 287

Query: 277 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 336
           GHA+ ++G    ++   YW++AN WNR WG +G+FKI RG +  GIE +VVA +P ++  
Sbjct: 288 GHAICILGCKV-ENSTSYWLVANXWNRDWGDNGFFKILRGQDHYGIESEVVAEIPHTEQY 346

Query: 337 VKEI 340
            ++I
Sbjct: 347 WEKI 350


>gi|166030326|gb|ABY78830.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 117/334 (35%), Positives = 160/334 (47%), Gaps = 12/334 (3%)

Query: 12  LCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLL 71
           L L   A    G  +    D+ +L  + +  +N+     WKA  N +  N T  + K L 
Sbjct: 7   LGLLSTALVTLGASALRAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQNITFSEAKRLT 66

Query: 72  GVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEA 131
           G        L  V         +LP+SFD+   WP C TI  I DQ  C + WA     A
Sbjct: 67  GAWIQKNSSLPPVRFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACRASWAVSTASA 126

Query: 132 LSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-- 188
           +SDR+C +  G  L +S   LL+CC   CG GC GG+P  AW Y+V +G+ +  C PY  
Sbjct: 127 ISDRYCTVGGGKQLRISAAHLLSCCK-QCGGGCKGGFPGFAWLYYVEYGIASSGCQPYPF 185

Query: 189 -----FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEI 243
                  + G   P  +  + TPKC   C  K+      K+   + Y +    ED   E+
Sbjct: 186 PHCEHRGAQGNKTPCSKYKFDTPKCNATCTDKSIPL--VKYRGNATYLLLHGEEDYKREL 243

Query: 244 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 303
           Y NGP    F VY D   YKSGVY+++ GD +GG AV+++GWG   +G  YW +AN W+ 
Sbjct: 244 YFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGGQAVRIVGWGKL-NGTPYWKVANSWDT 302

Query: 304 SWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 337
            WG +GY  I  G+NEC IE     G P    L 
Sbjct: 303 DWGMNGYMLILGGNNECNIEHLGFTGFPDPSQLT 336


>gi|15723274|gb|AAL06325.1| cathepsin B-like protease [Trypanosoma cruzi]
 gi|15723278|gb|AAL06327.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 98/217 (45%), Positives = 126/217 (58%), Gaps = 22/217 (10%)

Query: 99  FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGF 157
           FDA  AWP+C T++ I DQ  CGSCWA  A  A+SDR+C   G+ +L +S  DL++CC  
Sbjct: 1   FDAGEAWPECPTVTEIRDQSSCGSCWAVAAASAISDRYCTLGGVRDLRISAGDLMSCCD- 59

Query: 158 LCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH-------PGCEPAYPTPKCV 209
           +CG GC+GGYP  AW Y+  HG+V+E C PY F S  C+H         C   Y TP C 
Sbjct: 60  VCGFGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPS--CAHHVNSSDLSPCSGEYDTPTCN 117

Query: 210 RKCVKKNQ---LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 266
             C  K      +R +  Y +S        E    E+  NGP EVSF+VY DF  Y  GV
Sbjct: 118 STCTDKKIPLIKYRGNTSYVLSG------EEPFKRELILNGPFEVSFSVYADFVAYTGGV 171

Query: 267 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 303
           YKH+ G  +GGHAV+++GWG   +GE YW +AN WN 
Sbjct: 172 YKHVAGIFLGGHAVRIVGWGEL-NGEPYWKIANSWNH 207


>gi|356984175|gb|AET43950.1| cathepsin B, partial [Reishia clavigera]
          Length = 209

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 95/204 (46%), Positives = 127/204 (62%), Gaps = 18/204 (8%)

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 194
           ++  +S N+LLACC   CGDGC+GGYP +AW  F H GVVT       + C PY  +  C
Sbjct: 8   VHAHVSANELLACC-ESCGDGCNGGYPSAAWEVFDHDGVVTGGQYNSKQGCQPYLIAA-C 65

Query: 195 SH------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 247
            H        C+    TP+C +KC    N  +++ KHY   +Y ++S   DIM E+   G
Sbjct: 66  DHHVVGKLKPCKGDGKTPRCEKKCEAGYNVTFKDDKHYGQRSYSVSS-VNDIMEELVTRG 124

Query: 248 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 307
           PVE +FTVY DF  Y SGVY+H TG  +GGHAVK++G+G  ++G+ YW++AN WN  WG 
Sbjct: 125 PVEAAFTVYSDFLQYHSGVYRHTTGSALGGHAVKILGYGV-ENGDKYWLVANSWNPDWGD 183

Query: 308 DGYFKIKRGSNECGIEEDVVAGLP 331
            G+FKI RG +ECGIE  +VAG P
Sbjct: 184 QGFFKILRGVDECGIEGQIVAGEP 207


>gi|56756587|gb|AAW26466.1| unknown [Schistosoma japonicum]
          Length = 216

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 96/215 (44%), Positives = 128/215 (59%), Gaps = 18/215 (8%)

Query: 132 LSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------- 182
           ++DR CI  G   S  LS  DL++CC   CG GC GG+P  AW Y+V  G+VT       
Sbjct: 1   MTDRICIQSGGGQSAELSALDLISCC-EDCGQGCQGGFPGVAWDYWVTQGIVTGGSKENH 59

Query: 183 EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSD 235
             C PY        T   +P C    Y TP+C +KC K  +  ++  KHY   +Y + S+
Sbjct: 60  TGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQKCQKGYKTPYKQDKHYGDESYNVISN 119

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
            + I  EI  NGPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG       YW
Sbjct: 120 EKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVKKR-TPYW 178

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           ++AN WN  WG  G F+I RG +EC IE +VVAGL
Sbjct: 179 LIANSWNEDWGEKGLFRIVRGRDECSIESNVVAGL 213


>gi|403332696|gb|EJY65386.1| Cathepsin B [Oxytricha trifallax]
          Length = 297

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 118/317 (37%), Positives = 163/317 (51%), Gaps = 37/317 (11%)

Query: 23  GVVSKLKLDSHILQDSIIKEVNENPKAGWK---AARNPQFSNYTVGQFKHLLGVKPTPKG 79
           G ++ +   +H + + ++  +       W+      NP FS+ T  Q     G    P  
Sbjct: 8   GTIAAMVAATHPVNEEMVAHIKAKTSL-WQPHETTTNP-FSDLTKEQLLAKCGTYIVPSN 65

Query: 80  LLL-GVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI 138
               G P+      +  P +FDAR  W   S I  I DQ  CG+CWAFGA EALSDRF I
Sbjct: 66  KQYPGSPL------ISTPDNFDARQQWG--SKIHAIRDQQQCGACWAFGATEALSDRFTI 117

Query: 139 --HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH 196
             +  +++  S  DL++C       GC+GGY   AW +   HGVV + C PY   +G + 
Sbjct: 118 ASNGSVDVVFSPEDLVSC--DTNDYGCNGGYMDMAWEFLDQHGVVADSCFPYSAGSGFA- 174

Query: 197 PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSI--SAYRINSDPEDIMAEIYKNGPVEVSFT 254
                    P C  KC   +      K YS    + R +   E I +EI  +GPVE +FT
Sbjct: 175 ---------PACASKCADGSA----EKKYSCVHGSIRQSQGVEQIKSEIVAHGPVEGAFT 221

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           VY DF +Y+SGVY   T DV GGHA+K++G+G  ++G  YW+ AN W  SWG  G+FKIK
Sbjct: 222 VYTDFFNYQSGVYTPTTSDVAGGHAIKILGFGV-ENGTPYWLCANSWGPSWGMQGFFKIK 280

Query: 315 RGSNECGIEEDVVAGLP 331
           +G  ECGIE+ V +  P
Sbjct: 281 QG--ECGIEDQVFSCDP 295


>gi|157092993|gb|ABV22151.1| cysteine proteinase [Perkinsus chesapeaki]
          Length = 396

 Score =  178 bits (452), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 121/335 (36%), Positives = 168/335 (50%), Gaps = 51/335 (15%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGLLLGVPVKTHDK 91
           +  S++ E+N    A   +    +F   ++   K L G    KP      +   + T D+
Sbjct: 80  IMQSLVDEINSKQNAWMASIEQERFKGASMSDAKRLCGTWLEKPEN----IREKLYTADE 135

Query: 92  SLKLPKSFDARSAWPQCST-ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSV 148
              LP SF+A   + +CS+ I  I DQ  CGSCWAF   EA +DR CI    N +  LS 
Sbjct: 136 LKDLPVSFNATEEFKECSSVIGHIRDQSACGSCWAFAPTEAFNDRLCIKSAGNFTSLLSP 195

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------------EECDPYFDSTGCS 195
            ++ AC       GC GG  + AW++    GVVT             + C PY D   C+
Sbjct: 196 GNVAACSK---TSGCHGGSSLDAWQWLHTTGVVTGGDYSAEKDMTESDGCWPY-DIPPCA 251

Query: 196 H-------PGC-EPAYPTPKCVRKCVKK--NQLWRNSKHY----SISAYRINSDPEDIMA 241
           H       P C +  Y  P C   C  K  +      +H+    S+SA R     + I  
Sbjct: 252 HYTNSTLYPKCPKTKYDFPTCQESCPNKKYDTPMEKDRHFVEEESLSALR---SIDAIKK 308

Query: 242 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQW 301
           EI  NGPV  S+ VY+DF  YKSGVYK  + + +GGHAVK+IGW     GEDYW++ N W
Sbjct: 309 EIMTNGPVSASYLVYDDFLTYKSGVYKRTSHNALGGHAVKIIGW-----GEDYWLVVNSW 363

Query: 302 NRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 336
           N++WG +G FKI  G  +CGIE++V+AG P + +L
Sbjct: 364 NKNWGDNGMFKI--GCGQCGIEDNVLAGTPMTSSL 396


>gi|3087799|emb|CAA93276.1| cysteine proteinase [Haemonchus contortus]
          Length = 350

 Score =  178 bits (452), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 98/259 (37%), Positives = 136/259 (52%), Gaps = 19/259 (7%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLL 152
           +P+SFD+R  W  CS+I+ + DQ  CGSCWA  A   +SDR C+     L   LS  D+L
Sbjct: 94  IPESFDSRIVWKNCSSITYVRDQSRCGSCWAVSAASTMSDRICVQTKGKLQTILSDTDIL 153

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPY-FDSTGCSHPG-----C 199
           +CCG +CGDGC+GGY   AW +    GVVT         C PY F   G  H        
Sbjct: 154 SCCGRMCGDGCEGGYDHLAWEWVQRFGVVTGGPYQQKGVCRPYAFHPCGLHHGRRYDCPW 213

Query: 200 EPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
           + ++ TP C   C     + +   K +  S Y +++D + I  E+ KNGPV+ +F  YED
Sbjct: 214 DHSFSTPACKPYCQFGYGKRYEKDKFFVKSTYILDNDEKVIQREMMKNGPVQAAFITYED 273

Query: 259 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
           F+ YK G+Y H+ G   G HAVKLIGWG  ++G  YW +AN W+  WG   +      S 
Sbjct: 274 FSPYKGGIYVHVKGRERGAHAVKLIGWGV-ENGTKYWTVANSWHDDWGGKRFLPYSTWSE 332

Query: 319 ECGIEEDVVAGLPSSKNLV 337
              +   +V      +NL+
Sbjct: 333 SLRVR--IVCRFRRIQNLI 349


>gi|56758040|gb|AAW27160.1| unknown [Schistosoma japonicum]
          Length = 216

 Score =  178 bits (451), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 96/215 (44%), Positives = 127/215 (59%), Gaps = 18/215 (8%)

Query: 132 LSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------- 182
           ++DR CI  G   S  LS  DL++CC   CGDGC GG+P  AW Y+V  G+VT       
Sbjct: 1   MTDRICIQSGGQQSAELSALDLISCC-EDCGDGCQGGFPGQAWDYWVTQGIVTGGSKENH 59

Query: 183 EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSD 235
             C PY        T   +P C    Y TP+C + C K  +  +   KHY   +Y + S+
Sbjct: 60  TGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVISN 119

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
            + I  EI  NGPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG  +    YW
Sbjct: 120 EKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKRTPYW 178

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           ++AN WN  WG  G F+I RG +EC IE  VVAGL
Sbjct: 179 LIANSWNEDWGEKGLFRIVRGRDECSIESHVVAGL 213


>gi|290979437|ref|XP_002672440.1| predicted protein [Naegleria gruberi]
 gi|284086017|gb|EFC39696.1| predicted protein [Naegleria gruberi]
          Length = 354

 Score =  178 bits (451), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 109/315 (34%), Positives = 147/315 (46%), Gaps = 33/315 (10%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK------HLLGVKPTPK 78
           V++    + +    +I ++N N   GWKA   P+F+N ++ + +       LL   P   
Sbjct: 63  VNETSASTPVNDKELIDKINANETLGWKATEYPRFANLSISEARDSLFGLSLLSTDPDTP 122

Query: 79  GLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI 138
            L +       +  + LP +FDAR+ W  C  I  + DQ  CG+CWAF A   L+ R CI
Sbjct: 123 RLDI-------EPRVDLPMNFDARTQWRGC--IPAVRDQQTCGACWAFSATYVLAHRLCI 173

Query: 139 HFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH 196
                 N+ LS    + C        C GGY   AW +    G   + C PY        
Sbjct: 174 ATNGKTNVVLSPEYQVQCDTM--NKACQGGYLKYAWSFLERTGTTVDSCIPYASGRATFS 231

Query: 197 PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
            G  PA        KC    Q   +   Y     R  S   +I A I   G V+  FT+Y
Sbjct: 232 SGTCPA--------KCKVSTQ---SMTMYKAKNSRYISGVNNIKAAIMSYGSVQSGFTIY 280

Query: 257 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
            DF  Y+SGVYKH++   +GGHAV LIGWG  + G +YW+  N W  +WG  GYFKI +G
Sbjct: 281 RDFMSYRSGVYKHVSTTTLGGHAVALIGWGV-ESGTNYWLAVNSWGSNWGMSGYFKIAQG 339

Query: 317 SNECGIEEDVVAGLP 331
             ECGIE  V AG P
Sbjct: 340 --ECGIENQVYAGEP 352


>gi|308512693|gb|ADO33000.1| cathepsin B [Biston betularia]
          Length = 217

 Score =  177 bits (450), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 97/216 (44%), Positives = 126/216 (58%), Gaps = 21/216 (9%)

Query: 133 SDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD 190
           +DR C +     +   S  DLL+CC  +CG GC+GG P  AW Y+ H G+V+     Y  
Sbjct: 1   TDRVCTYSNGTKHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEYWKHMGLVSG--GNYNS 57

Query: 191 STGCSH---PGCEPAYP-----------TPKCVRKCVKK-NQLWRNSKHYSISAYRINSD 235
           S GCS    P CE   P           TPKC + C    N L++  K Y    Y +   
Sbjct: 58  SQGCSPYVIPPCEHHVPGNRLPCNGDTKTPKCSKTCENGYNVLYKKDKRYGKHVYAVRGG 117

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
            + I AE++KNGPVE +FTVY D   YKSGVYKH+ GD +GGHA+K+IGWG  ++G  YW
Sbjct: 118 EDHIKAELFKNGPVEAAFTVYADLLAYKSGVYKHVEGDALGGHAIKIIGWGV-ENGNKYW 176

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           ++AN WN  WG +G+FKI RG + CGIE  +VAG P
Sbjct: 177 LIANSWNTDWGNNGFFKILRGEDHCGIESSIVAGEP 212


>gi|16768502|gb|AAL28470.1| GM06507p [Drosophila melanogaster]
          Length = 430

 Score =  177 bits (450), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 118/318 (37%), Positives = 159/318 (50%), Gaps = 21/318 (6%)

Query: 22  EGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVK-PTPK 78
           EG   +   D  +  D+I+  VN   + GW A +  Q+    Y+ G  K  LG K PT +
Sbjct: 115 EGGSVQCDEDLCLTDDAIVHSVNSIHRLGWSARKYDQWWGRKYSEG-LKLRLGTKEPTYR 173

Query: 79  GLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI 138
              +    +  + +  LP SF+A   W   S IS + DQG CG+ W        SDRF I
Sbjct: 174 ---VKAMTRLKNPTDGLPNSFNALDKWS--SYISEVPDQGWCGASWVLSTTSVASDRFAI 228

Query: 139 HFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH 196
                 N+ LS  ++L+C       GC+GG+  +AWRY    GVV E C PY        
Sbjct: 229 QSKGKENVQLSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDENCYPYTQH----R 282

Query: 197 PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
             C+  +        C K   + R+S +    AY +N +  DIMAEI+ +GPV+ +  V 
Sbjct: 283 DTCKIRHSRSLKANGCQKPVNVDRDSLYTVGPAYSLNREA-DIMAEIFHSGPVQATMRVN 341

Query: 257 EDFAHYKSGVYKHITGDV---MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 313
            DF  Y  GVY+    +     G H+VKL+GWG   +GE YWI AN W   WG  GYF+I
Sbjct: 342 RDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEHGYFRI 401

Query: 314 KRGSNECGIEEDVVAGLP 331
            RGSNECGIEE V+A  P
Sbjct: 402 LRGSNECGIEEYVLASWP 419


>gi|3087803|emb|CAA93279.1| cysteine protease [Haemonchus contortus]
          Length = 325

 Score =  177 bits (450), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 99/234 (42%), Positives = 127/234 (54%), Gaps = 20/234 (8%)

Query: 89  HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 146
           +D+   +P+SFDAR+ WP CS+++ I DQ +CGSCWA     ALSDR CI       +++
Sbjct: 88  NDEGDDIPESFDARTHWPNCSSLTHIRDQANCGSCWAVSTAAALSDRICISTNGTKQVNI 147

Query: 147 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCE 200
           S  D+L CC + CG GC GG+PI AW Y    G VT      + C        C H G E
Sbjct: 148 SATDILTCC-YKCGYGCQGGWPIEAWEYVAREGAVTGGRLLAKSCCRSHPFPPCGHHGNE 206

Query: 201 PAY-------PTPKCVRKCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
             Y        TPKC   C    KN  + + K     AY + +  + I  EI KNGPV  
Sbjct: 207 TYYGECGGRARTPKCRTSCTPGYKNS-YSDDKIRGKDAYELPNSVKAIQREIMKNGPVVA 265

Query: 252 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 305
           +FTVY DF++YK G+YKH  G   G HAVK+IGWG   D   YWI+ N W+  W
Sbjct: 266 AFTVYADFSYYKKGIYKHTAGRARGSHAVKVIGWGEEGD-VPYWIVKNSWHNDW 318


>gi|294935195|ref|XP_002781337.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239891887|gb|EER13132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 317

 Score =  177 bits (450), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 117/324 (36%), Positives = 164/324 (50%), Gaps = 35/324 (10%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG-VKPTPKGLLLGV--PVKTHDKSLK 94
           S++ E+N        +    +F   ++G  K L G +    +GL   V  P +  D    
Sbjct: 3   SLVDEINSKQNLWTASTDQERFYGRSLGDAKKLCGTLLEETEGLEKRVYPPGELAD---- 58

Query: 95  LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 151
           +P SFDAR A+ +C   I  + DQ  C SCWA   VEA + R CI  G   N  LS  ++
Sbjct: 59  IPNSFDARDAFKECKDVIGHVWDQSACASCWAIAPVEAFNARLCIKSGGKFNQLLSAGEM 118

Query: 152 LACCGFLCG---DGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH----- 196
           +ACC         GC GG  ++AW +   HG+ TE        C PY +   C+H     
Sbjct: 119 IACCNSTHSWQPRGCKGGMILNAWSFLKTHGIATEGSMSAADGCWPY-NFPKCAHHQKKS 177

Query: 197 ---PGCEPAYPTPKCVRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
              P  +  Y TP C+ +C   K        +H++  +  +    ++I  EI  NGP   
Sbjct: 178 KYEPCSKKLYDTPSCLDRCPNEKYGIPLDKDRHFTAHSPDLFEGTDNIKKEIMTNGPTSA 237

Query: 252 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 311
           +F+VYEDF  YKSGVYKH  G +MG H+V++IGWGT + G DYW++ N WN  WG  G F
Sbjct: 238 TFSVYEDFVSYKSGVYKHTNGTLMGIHSVEIIGWGT-EKGVDYWLVMNSWNEGWGDHGTF 296

Query: 312 KIKRGSNECGIEEDVVAGLPSSKN 335
           KI +G  +CGI +D V G P + N
Sbjct: 297 KIAQG--DCGI-DDAVLGSPPAMN 317


>gi|323447573|gb|EGB03489.1| hypothetical protein AURANDRAFT_72715 [Aureococcus anophagefferens]
          Length = 812

 Score =  177 bits (450), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 113/299 (37%), Positives = 155/299 (51%), Gaps = 25/299 (8%)

Query: 31  DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPK-GLLLGVPVKT- 88
           DS ++ D          +  WKA  N +F+  T    K LLG   +P     LG      
Sbjct: 273 DSALINDEQHVNYLNQEEMSWKAGVNERFAGMTYADVKGLLGADTSPHIAEYLGETRSQD 332

Query: 89  -HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI-HFGMNLSL 146
            +D    +P  F+A + W     +  I DQ  CGSCWAF A E LSDR  I H      L
Sbjct: 333 FYDNITDVPSEFNAVTQWK--GLVQPIRDQQQCGSCWAFSAAEVLSDRNAIQHNKAEPVL 390

Query: 147 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP 206
           S  DL++C       GC+GG   +AW Y  + G+VT+ C PY    G +          P
Sbjct: 391 SPEDLVSCD--RVDQGCNGGNLGTAWTYLKNTGIVTDACFPYTAGGGDA----------P 438

Query: 207 KCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 266
           KC   C K    W  +K+ + SAY +N   E++  EI  +GP++V+F VY+ F  YKSGV
Sbjct: 439 KCETSC-KDGSSW--TKYKAASAYAVNG-VENMQKEIMTHGPIQVAFNVYKSFMSYKSGV 494

Query: 267 YKHITGDVM--GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 323
           Y     ++M  GGHAVK++GWGT + G+DYW++AN WN SWG +GYFKI  G+    ++
Sbjct: 495 YAKKWYELMPEGGHAVKIVGWGT-EGGKDYWLVANSWNTSWGDEGYFKIAVGAESISLD 552


>gi|403340695|gb|EJY69640.1| Cathepsin B [Oxytricha trifallax]
          Length = 247

 Score =  177 bits (449), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 106/256 (41%), Positives = 137/256 (53%), Gaps = 25/256 (9%)

Query: 78  KGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFC 137
           +G + G+P       + +PK+FD+R  W  C  +  I DQ  CGSCWAFGA E LSDR C
Sbjct: 13  QGPVEGIPEPAQHNDI-VPKTFDSREQWGNC--VHPIRDQAQCGSCWAFGASETLSDRIC 69

Query: 138 IHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS 195
           I      ++ LS  DL+AC G+    GC+GG    AW Y  + G V + C PY    G  
Sbjct: 70  IASDKKTDVILSPEDLVACDGW--NMGCNGGILPWAWSYLTNTGAVEDSCFPYSSDKG-- 125

Query: 196 HPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 255
                     P C +KC      +   K    S  +  S  + I AEI KNGP+E  FTV
Sbjct: 126 --------AVPTCAKKCQNDKDSFTKYKCKKNSVVQA-SGVDKIKAEISKNGPMETGFTV 176

Query: 256 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 315
           YEDF +Y+SGVY H TG+ +GGHAVK++G+     G+ YWI AN W+  WG  G+F I  
Sbjct: 177 YEDFMNYESGVYHHTTGNQLGGHAVKIVGY-----GDGYWICANSWSEKWGEKGFFNI-- 229

Query: 316 GSNECGIEEDVVAGLP 331
           G  ECGI+    A  P
Sbjct: 230 GFGECGIDSAAYACTP 245


>gi|123478051|ref|XP_001322190.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
 gi|121905031|gb|EAY09967.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
          Length = 288

 Score =  177 bits (449), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 103/306 (33%), Positives = 157/306 (51%), Gaps = 26/306 (8%)

Query: 28  LKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV--KPTPKGLLLGVP 85
             L+  I    ++KE+       W A  N +F   T      + G   K  P  + L  P
Sbjct: 2   FNLEEKIQGSKLLKELKGEKDLPWVAGENERFKGMTFKDASVISGNAHKLRPDTIPLARP 61

Query: 86  VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS 145
            K +   + +P S++    +PQC     +LDQG CGSCW+F   ++ S R+C  +   + 
Sbjct: 62  PKIN---ISIPMSYNFTERFPQCDF--GVLDQGKCGSCWSFAVSKSFSHRYCRKYNKPVL 116

Query: 146 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 205
            S + L+AC       GC GG  ++AWRY    G+  + C PY           +     
Sbjct: 117 FSQSHLVACDRR--NSGCGGGIEVNAWRYIDLRGLPLDSCQPY-----------DGNITK 163

Query: 206 PKCVRKCVKKNQLW--RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 263
             C +KC  +++ +  + ++++S++ Y   +  E++   I   GPV  S  VY D  +YK
Sbjct: 164 YNCSKKCTNESETYEAQFTEYWSVARY---ASIEEMQIGIMTEGPVTTSLKVYSDLMYYK 220

Query: 264 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 323
           SG+Y H  G+ +G HAV++IGWGT  +G DYWI++N WN +WG +G F IKRG NEC IE
Sbjct: 221 SGIYTHTKGEFLGHHAVEIIGWGTK-NGIDYWIISNSWNTTWGMNGLFLIKRGVNECHIE 279

Query: 324 EDVVAG 329
           + V AG
Sbjct: 280 DYVCAG 285


>gi|195026034|ref|XP_001986167.1| GH20676 [Drosophila grimshawi]
 gi|193902167|gb|EDW01034.1| GH20676 [Drosophila grimshawi]
          Length = 432

 Score =  177 bits (448), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 115/320 (35%), Positives = 155/320 (48%), Gaps = 39/320 (12%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK 91
           +  D++I  VN   + GW A +  ++    Y+ G    L   +PT     +    +  + 
Sbjct: 127 LTDDALIHSVNSIHQLGWSARKYDEWWSHKYSEGLRLRLGTKEPT---FRVKSMTRLTNP 183

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVN 149
           S  LP+SF+A   W   + IS + DQG CG+ W        SDRF I       + LS  
Sbjct: 184 SNDLPRSFNAVEKWS--TFISEVPDQGWCGASWVLSTTSVASDRFAIQSQGKEVVQLSAQ 241

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDST-----------GCSHPG 198
           ++L+C       GCDGG+  +AWRY   +GV+   C PY                    G
Sbjct: 242 NILSCTRRQ--QGCDGGHLDAAWRYMHKNGVLDANCYPYIQQRDTCKVQRHRGRSLKAYG 299

Query: 199 CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
           C+PA+         V ++  +     YS+S         DIMAEIY +GPV+ + TVY D
Sbjct: 300 CQPAHG--------VNRDNFYTVGPAYSLSR------EADIMAEIYHSGPVQATMTVYRD 345

Query: 259 FAHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 315
           F  Y SGVY+H     G   G H+VKL+GWG   +G  YWI AN W   WG  GYF+I R
Sbjct: 346 FFSYSSGVYQHTAANRGAATGFHSVKLVGWGEEHNGVKYWIAANSWGPWWGERGYFRILR 405

Query: 316 GSNECGIEEDVVAGLPSSKN 335
           GSNECGIEE V+A  P   N
Sbjct: 406 GSNECGIEEYVLASWPHVYN 425


>gi|294894292|ref|XP_002774787.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239880404|gb|EER06603.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 414

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 126/382 (32%), Positives = 177/382 (46%), Gaps = 66/382 (17%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKL----DSHILQD--------SIIKEVNENPKAGWKA 53
           L+  P      FA F E +  + +L    D  +L D        S++ E+N        +
Sbjct: 41  LVYTPAEEAQHFARFEEELRIQSELISTEDLAVLYDETRPAIMQSLVDEINSKQNTWTAS 100

Query: 54  ARNPQFSNYTVGQFKHLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCST 110
               +F N ++   K L G        K +  G  +   ++   LP  FDAR+A+P CS 
Sbjct: 101 TGQKRFKNLSLRDAKMLCGTLKRGSNDKVIRKGYAI---EELQDLPTDFDARTAFPNCSK 157

Query: 111 ISR-ILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGY 167
           + R I DQ  CGSCWAFG  EA +DR CI      +  LS  ++ AC       GCDGG 
Sbjct: 158 VIRHIRDQSDCGSCWAFGVTEAFNDRLCIKSNGTFTELLSAGEMNACAPSF---GCDGGI 214

Query: 168 PISAWRYFVHHGVVT-------------EECDPYFDSTGCSH-------PGC-EPAYPTP 206
           P  AW +  + G+ T             + C PY D   C+H       P C + +Y TP
Sbjct: 215 PSLAWSWVHNKGIATGGDYLAEDDMTKDDGCWPY-DFPPCAHHVNDSKYPKCPKDSYETP 273

Query: 207 KCVRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV--------------- 249
            C  +C   K     R+ +H+ + +        D    I  +GPV               
Sbjct: 274 NCAEQCHNPKYTTTLRDDRHFLVESVPYEYSVNDAKNAIRTDGPVGPIYFCDPSVNFDQV 333

Query: 250 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 309
             SF VYEDF  Y+SGVYKH +G  +GGHAVK+IGWG  + G+ YW++ N WN  WG +G
Sbjct: 334 SASFIVYEDFLAYRSGVYKHTSGKELGGHAVKIIGWG-EETGQAYWLVVNSWNEDWGDNG 392

Query: 310 YFKIKRGSNECGIEEDVVAGLP 331
            FKI  G+  C I++D++ G P
Sbjct: 393 LFKIALGN--CEIDDDLLGGTP 412


>gi|66805843|ref|XP_636643.1| hypothetical protein DDB_G0288563 [Dictyostelium discoideum AX4]
 gi|60465035|gb|EAL63141.1| hypothetical protein DDB_G0288563 [Dictyostelium discoideum AX4]
          Length = 314

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 116/328 (35%), Positives = 155/328 (47%), Gaps = 32/328 (9%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
           I+CL   + +   V     LD  +L D++I  +N N K+ W A RN  F   T G    +
Sbjct: 6   IICLIFVSFYFASVCLGSFLDKPVLDDNLINSINNNKKSSWTAHRNKNFEGKTFGDIIGM 65

Query: 71  LGVKPTPKGLLLGVPVKTHDKSLK--LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 128
           +G K T     L      + + LK  +P SFD+R  WP C  I  IL+Q  CGSCWAF +
Sbjct: 66  MGTKKTAAPFKL----TENGEELKGSIPTSFDSRVQWPDC--IHPILNQEQCGSCWAFSS 119

Query: 129 VEALSDRFCIHFGMNL---SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEEC 185
            E LSDR CI         +LS   L+A C     DGC GG P  AW Y    G+ T+ C
Sbjct: 120 SEVLSDRLCIASNNKTNPGALSPQTLVA-CDVYGNDGCSGGIPQLAWEYMELKGLPTDSC 178

Query: 186 DPYFDSTGCSHPGCEPAYPTPKCVRKCVKKN--QLWRNSKHYSISAYRINSDPEDIMAEI 243
            PY    G  +           C R C       L+R +K +++   +  S  + I   I
Sbjct: 179 VPYTAGNGTVY----------SCQRSCSDSEDYSLYR-AKPFTL---KTCSSVQCIQENI 224

Query: 244 YKNGPVEVSFTVYEDFAHYKSGVYKHITG-DVMGGHAVKLIGWGTSDDGE-DYWILANQW 301
              GP+  +  VYEDF  Y SGVY    G  ++GGHA+K++GWG     + +YWI+AN W
Sbjct: 225 LAYGPIVGTMEVYEDFMSYSSGVYVMTPGSSLLGGHAIKIVGWGFDQTSQLNYWIVANSW 284

Query: 302 NRSWGADGYFKIKRGSNECGIEEDVVAG 329
              WG  G+F I      C I  D  A 
Sbjct: 285 GADWGQQGFFFISM--ETCSISSDASAA 310


>gi|194882138|ref|XP_001975170.1| GG20712 [Drosophila erecta]
 gi|190658357|gb|EDV55570.1| GG20712 [Drosophila erecta]
          Length = 431

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 123/333 (36%), Positives = 162/333 (48%), Gaps = 40/333 (12%)

Query: 22  EGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVK-PTPK 78
           EG   +   D  +  D+II  VN   + GW A +  Q+    Y+ G  K  LG K PT +
Sbjct: 115 EGGRVQCDQDLCLTDDAIIHSVNSISRLGWSAHKYDQWWGRKYSEG-LKLRLGTKEPTYR 173

Query: 79  GLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI 138
              +    +  + +  LP+SF+A   W   S IS + DQG CG+ W        SDRF I
Sbjct: 174 ---VKAMTRLRNPTDGLPRSFNALDKWS--SYISEVPDQGWCGASWVLSTTSVASDRFAI 228

Query: 139 HFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-------- 188
                  + LS  ++L+C       GCDGG+  +AWRY    GVV E C PY        
Sbjct: 229 QSKGKETVQLSAQNILSCTRRQ--QGCDGGHLDAAWRYLHKKGVVDESCYPYTQHRDTCK 286

Query: 189 --FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 246
              +S      GCE    TP  V          R++ +    AY +N +  DIMAEI+ +
Sbjct: 287 IRHNSRSLRANGCE----TPVNVD---------RDTFYTVGPAYSLNREA-DIMAEIFNS 332

Query: 247 GPVEVSFTVYEDFAHYKSGVYKHITGDV---MGGHAVKLIGWGTSDDGEDYWILANQWNR 303
           GPV+ +  V  DF  Y  GVY+    +     G H+VKL+GWG   +GE YWI AN W  
Sbjct: 333 GPVQATMRVNRDFFSYSRGVYRQTAANREAPTGFHSVKLVGWGEEHNGEKYWIAANSWGS 392

Query: 304 SWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 336
            WG  GYF+I RGSNECGIEE V+A  P   N 
Sbjct: 393 WWGEKGYFRILRGSNECGIEEYVLASWPYVYNF 425


>gi|343961899|dbj|BAK62537.1| cathepsin B precursor [Pan troglodytes]
          Length = 195

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 87/196 (44%), Positives = 125/196 (63%), Gaps = 14/196 (7%)

Query: 158 LCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPT 205
           +CGDGC+GGYP  AW ++   G+V+         C PY           S P C     T
Sbjct: 1   MCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDT 60

Query: 206 PKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           PKC + C    +  ++  KHY  ++Y +++  + IMAEIYKNGPVE +F+VY DF  YKS
Sbjct: 61  PKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKGIMAEIYKNGPVEGAFSVYSDFLLYKS 120

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           GVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE 
Sbjct: 121 GVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIES 179

Query: 325 DVVAGLPSSKNLVKEI 340
           +VVAG+P +    ++I
Sbjct: 180 EVVAGIPRTDQYWEKI 195


>gi|195384166|ref|XP_002050789.1| GJ20006 [Drosophila virilis]
 gi|194145586|gb|EDW61982.1| GJ20006 [Drosophila virilis]
          Length = 432

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 116/332 (34%), Positives = 159/332 (47%), Gaps = 39/332 (11%)

Query: 22  EGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVKPTPKG 79
           +G   +   D  +  D ++  VN   + GW A +  ++    Y+ G    L   +PT + 
Sbjct: 115 DGGRVQCDTDLCLTDDELVHSVNSIHRLGWSARKYDEWWGHKYSEGLRLRLGTKEPTYR- 173

Query: 80  LLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH 139
             +    +  + S  LP+ F+A   W   S IS + DQG CGS W        SDRF I 
Sbjct: 174 --VKAMTRLTNPSDDLPRKFNAVEKWS--SYISEVPDQGWCGSSWVLSTTSVASDRFAIQ 229

Query: 140 FGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY--------- 188
                 + LS  ++L+C       GC+GG+  +AWRY    GV+ E+C PY         
Sbjct: 230 SQGKEVVQLSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVLDEKCYPYTQHRDSCKI 287

Query: 189 --FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 246
              +S      GC+PAY         V ++ L+     YS+S         DIMAEIY +
Sbjct: 288 QRHNSRSLKANGCQPAYG--------VNRDSLYTVGPAYSLSR------EADIMAEIYHS 333

Query: 247 GPVEVSFTVYEDFAHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 303
           GPV+ +  +Y DF  Y  G+Y+      G   G H+VKL+GWG   DG  YWI AN W  
Sbjct: 334 GPVQATMRIYRDFFSYSGGIYRQTAANRGAPTGFHSVKLVGWGEEHDGVKYWIAANSWGP 393

Query: 304 SWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 335
            WG  GYF+I RGSNECGIEE V+A  P   N
Sbjct: 394 WWGEHGYFRILRGSNECGIEEYVLASWPYVYN 425


>gi|159108625|ref|XP_001704582.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157432649|gb|EDO76908.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 298

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 110/289 (38%), Positives = 150/289 (51%), Gaps = 25/289 (8%)

Query: 46  NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW 105
           NP+  WKA    +F   T  +   LL      K     VP  T   + + P SFD R  +
Sbjct: 28  NPR--WKAGIPKRFEGLTKDEISSLLMPVSFLKRDRAAVPRGTV-SATQAPDSFDFREEY 84

Query: 106 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD- 161
           P C  I  ++DQG CGSCWAF +V ++ DR C   G++   +  S   +++C     GD 
Sbjct: 85  PHC--IPEVVDQGGCGSCWAFSSVASVGDRRCFA-GLDKKAVKYSPQYVVSCDR---GDM 138

Query: 162 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 221
            CDGG+  S WR+    G  T+EC PY         G   A  T  C  KC   + L   
Sbjct: 139 ACDGGWLPSVWRFLTKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSDL--- 186

Query: 222 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 281
             + +  A     D + IM  +   GP++ +FTVY DF +Y+ GVY+H  G V GGHAV+
Sbjct: 187 PIYKATKAVDYGLDCDLIMKALATGGPLQTAFTVYSDFMYYEGGVYQHTYGRVEGGHAVE 246

Query: 282 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           ++G+GT +   DYWI+ N W   WG DGYF+I R +NECGIEE V+ G 
Sbjct: 247 MVGYGTDEYDVDYWIIRNSWGPDWGEDGYFRIIRMTNECGIEEQVIGGF 295


>gi|161343875|tpg|DAA06118.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 210

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 95/212 (44%), Positives = 125/212 (58%), Gaps = 16/212 (7%)

Query: 120 CGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 177
           CGSCWA  A    SDR CI  G  +  +LS   L  CC + CG+GCDGG P +AW +F+ 
Sbjct: 1   CGSCWAASAASVFSDRLCIATGGAVARNLSAEQLNTCC-YRCGNGCDGGSPEAAWYFFMR 59

Query: 178 HGVVT-------EECDPY-FDSTGCSHPGC-EPAYPTPKC-VRKCVKKN--QLWRNSKHY 225
           HG+VT       + C PY     G     C +    TP C +R C   N  + +R   HY
Sbjct: 60  HGIVTGGDYESGDGCQPYSIYPRGKGRNTCIDDDIDTPDCSIRTCTNSNYTKGYRADLHY 119

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 285
             + Y ++   EDIM +IYKNGPV+ +F VY DF +YKSGVY +  G + GGHA+K++GW
Sbjct: 120 VDTVYSLSRSEEDIMTDIYKNGPVQAAFYVYTDFMYYKSGVYSYTRGQIEGGHAIKILGW 179

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGS 317
           G  DD   YW+ AN W+RSWG +G F+I RG+
Sbjct: 180 GV-DDNTKYWLCANSWSRSWGENGLFRILRGN 210


>gi|195121981|ref|XP_002005491.1| GI19039 [Drosophila mojavensis]
 gi|193910559|gb|EDW09426.1| GI19039 [Drosophila mojavensis]
          Length = 432

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 120/331 (36%), Positives = 160/331 (48%), Gaps = 38/331 (11%)

Query: 22  EGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVK-PTPKG 79
           +G   +   D  +  D +I  VN   + GW A +  ++ ++   +  +  LG K PT + 
Sbjct: 115 DGGRVQCDTDLCLTDDELINSVNSIHQLGWSARKYDEWWSHKYSEGLRLRLGTKEPTYR- 173

Query: 80  LLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH 139
             +    +  + S  LP+ F+A   W   S IS + DQG CGS W        SDRF I 
Sbjct: 174 --VKAMTRLSNPSSGLPRKFNAVERWS--SYISEVPDQGWCGSSWVLSTTSVASDRFAIQ 229

Query: 140 FGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF---DSTGC 194
                 + LS  ++L+C       GC+GG+  +AWRY    GVV E C PY    DS   
Sbjct: 230 SQGKEVVQLSPQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDETCYPYTQRRDSCKI 287

Query: 195 SHP-------GCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 247
            H        GC PAY         V ++ L+     YS+          DIMAEIY +G
Sbjct: 288 RHNSRSLKANGCRPAYG--------VNRDSLYTVGPAYSLKG------ETDIMAEIYHSG 333

Query: 248 PVEVSFTVYEDFAHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 304
           PV+ +  VY DF  Y  GVY+      G   G H+VK++GWG   DG  YWI AN W   
Sbjct: 334 PVQATMRVYRDFFSYSGGVYRQTAANRGAPTGFHSVKIVGWGEEHDGVKYWIAANSWGPW 393

Query: 305 WGADGYFKIKRGSNECGIEEDVVAGLPSSKN 335
           WG  GYF+I RGSNECGIEE V+A  P+  N
Sbjct: 394 WGEHGYFRILRGSNECGIEEYVLASWPNVYN 424


>gi|294894290|ref|XP_002774786.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239880403|gb|EER06602.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 830

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 131/410 (31%), Positives = 183/410 (44%), Gaps = 101/410 (24%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKL----DSHILQD--------SIIKEVNENPKAGWKA 53
           L+  P      FA F E +  + +L    D  +L D        S++ E+N        +
Sbjct: 436 LVYTPAEEAQHFARFEEELRIQSELISTEDLTVLYDETRPAIMQSLVDEINSKQNTWTAS 495

Query: 54  ARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK----------LPKSFDARS 103
               +F N ++   K L G       L+ G    ++DK++K          LP  FDAR+
Sbjct: 496 TGQKRFKNLSLRDAKMLCGT------LMRG----SNDKAIKKGYAIEELQDLPTDFDART 545

Query: 104 AWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCG 160
           A+P CS  I  I DQ  CGSCWAFG  EA +DR CI      +  LS  ++ AC      
Sbjct: 546 AFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSNGTFTELLSAGEMNACAP---S 602

Query: 161 DGCDGGYPISAWRYFVHHGVVT-------------EECDPYFDSTGCSH-------PGC- 199
            GC+GG+P SAW +    G+ T             + C PY D   C+H       P C 
Sbjct: 603 HGCNGGFPNSAWSWVHDKGIATGGDYVAKDDMTKDDGCWPY-DFPPCAHHINDTKYPECP 661

Query: 200 ---------------------EPAYPTPKCVRKC--VKKNQLWRNSKHYSISAYRINSDP 236
                                + +Y TP C  +C   K     R+ +H+ + +       
Sbjct: 662 KVSCSGESPPATAETATVIAYQNSYETPNCAEQCHNPKYTTTLRDDRHFMLESSPYQYSV 721

Query: 237 EDIMAEIYKNGPV---------------EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 281
            D    I  +GPV                 SF+VYEDF  YKSGVYKH +G+ +GGHAVK
Sbjct: 722 NDAKNAIRTDGPVGPIYFCDPNVNFDQVSASFSVYEDFLAYKSGVYKHTSGEYLGGHAVK 781

Query: 282 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           +IGWG  + G+ YWI+ N WN  WG  G FKI  G+  CGI+++++ G P
Sbjct: 782 IIGWG-EESGQAYWIVVNSWNEDWGDHGLFKIALGN--CGIDDNLLGGTP 828


>gi|24657813|ref|NP_726176.1| secreted Wg-interacting molecule, isoform A [Drosophila
           melanogaster]
 gi|24657819|ref|NP_611652.2| secreted Wg-interacting molecule, isoform B [Drosophila
           melanogaster]
 gi|21064305|gb|AAM29382.1| RE01730p [Drosophila melanogaster]
 gi|21626543|gb|AAF46818.2| secreted Wg-interacting molecule, isoform A [Drosophila
           melanogaster]
 gi|21626544|gb|AAM68213.1| secreted Wg-interacting molecule, isoform B [Drosophila
           melanogaster]
 gi|220949028|gb|ACL87057.1| CG3074-PA [synthetic construct]
 gi|220958134|gb|ACL91610.1| CG3074-PA [synthetic construct]
          Length = 431

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 119/320 (37%), Positives = 160/320 (50%), Gaps = 24/320 (7%)

Query: 22  EGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVK-PTPK 78
           EG   +   D  +  D+I+  VN   + GW A +  Q+    Y+ G  K  LG K PT +
Sbjct: 115 EGGSVQCDEDLCLTDDAIVHSVNSIHRLGWSARKYDQWWGRKYSEG-LKLRLGTKEPTYR 173

Query: 79  GLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI 138
              +    +  + +  LP SF+A   W   S IS + DQG CG+ W        SDRF I
Sbjct: 174 ---VKAMTRLKNPTDGLPSSFNALDKWS--SYISEVPDQGWCGASWVLSTTSVASDRFAI 228

Query: 139 HFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH 196
                 N+ LS  ++L+C       GC+GG+  +AWRY    GVV E C PY       H
Sbjct: 229 QSKGKENVQLSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDENCYPYT-----QH 281

Query: 197 PGCEPAYPTPKCVRK--CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
                     + +R   C K   + R+S +    AY +N +  DIMAEI+ +GPV+ +  
Sbjct: 282 RDTCKIRHNSRSLRANGCQKPVNVDRDSLYTVGPAYSLNREA-DIMAEIFHSGPVQATMR 340

Query: 255 VYEDFAHYKSGVYKHITGDV---MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 311
           V  DF  Y  GVY+    +     G H+VKL+GWG   +GE YWI AN W   WG  GYF
Sbjct: 341 VNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEHGYF 400

Query: 312 KIKRGSNECGIEEDVVAGLP 331
           +I RGSNECGIEE V+A  P
Sbjct: 401 RILRGSNECGIEEYVLASWP 420


>gi|270012757|gb|EFA09205.1| cathepsin B precursor [Tribolium castaneum]
          Length = 348

 Score =  175 bits (444), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 112/304 (36%), Positives = 158/304 (51%), Gaps = 31/304 (10%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 94
           LQ  +I+E+N   +  WKA  N       +G     LG+ P P      +  K H  +  
Sbjct: 24  LQPQLIQEINSR-QTSWKAGTNSLDIKSRLG----FLGLHPDPD---YKIQTKHHKIAKS 75

Query: 95  LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 151
           +P+SFDAR  WP+C   I +I DQG CGSCWAF + E ++DR CI          S  +L
Sbjct: 76  IPESFDAREKWPECKDVIGKIRDQGTCGSCWAFASTEVMTDRLCIGTKGETKFVFSPENL 135

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP---TPKC 208
           L CC   C   C GGY   AW Y+++ G+V+     Y  S GC  P  + ++      KC
Sbjct: 136 LTCCE-DCRLECVGGYTAKAWDYYINEGIVSG--GDYNSSEGC-QPYSKASFQYAVASKC 191

Query: 209 VRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 266
           V+ C   K +  + + KHY  S Y + ++   I  EI  NGPV  +F V+ED  +YKSG+
Sbjct: 192 VKACQNDKYDVKYDDDKHYGDSFYTLETNVTQIQTEILTNGPVMATFNVFEDIIYYKSGI 251

Query: 267 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG-ADGYFKIKRGSNECGIEED 325
                        V ++ WGT ++G  YW++AN W   WG   G+ KIKRG+NEC IE++
Sbjct: 252 QL---------SNVSILRWGT-EEGVPYWLIANSWGTWWGDLGGFIKIKRGTNECAIEQE 301

Query: 326 VVAG 329
           + AG
Sbjct: 302 MAAG 305


>gi|3929817|emb|CAA77181.1| cathepsin B [Mus musculus]
          Length = 194

 Score =  175 bits (444), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 91/195 (46%), Positives = 123/195 (63%), Gaps = 16/195 (8%)

Query: 120 CGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 177
           CGSCWAFGAVEA+SDR CIH    +N+ +S  DLL CCG  CGDGC+GGYP  AW ++  
Sbjct: 1   CGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTK 60

Query: 178 HGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKH 224
            G+V+         C PY           S P       TP+C + C    +  ++  KH
Sbjct: 61  KGLVSGGVYDSHIGCLPYTIPPCEHHVNGSRPPMHGEGDTPRCNKSCEAGYSPSYKEDKH 120

Query: 225 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 284
           +  ++Y +++  ++IMAEIYKNGPVE +FTV+ DF  YKSGVYKH  GD+MGGHA++++G
Sbjct: 121 FGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILG 180

Query: 285 WGTSDDGEDYWILAN 299
           WG  ++G  YW+ AN
Sbjct: 181 WGV-ENGVPYWLAAN 194


>gi|308157829|gb|EFO60849.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 300

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 108/299 (36%), Positives = 153/299 (51%), Gaps = 27/299 (9%)

Query: 40  IKEVNE----NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKL 95
           + E+N     NP+  WKA    +F   T  +   LL      K      P  T      +
Sbjct: 18  VSELNHIKSLNPR--WKAGIPRRFEGLTKDEISSLLMPVSFLKSAKGAAPRGTFADKDDV 75

Query: 96  PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLL 152
           P+SFD R  +P C  I  ++DQG CGSCWAF +V    DR CI  G++   +  S   ++
Sbjct: 76  PESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCIA-GLDKKPVKYSPQYVV 132

Query: 153 AC-CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK 211
           +C  G +    C+GG+  +AW++    G  T+EC PY   +      C    PT     K
Sbjct: 133 SCDHGNM---ACNGGWLPNAWKFLTKTGTTTDECVPYQSGSTTLRGTC----PT-----K 180

Query: 212 CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 271
           C   +     +   S   Y +  D   +M  +   GP++V+F VY DF +Y+SGVY+H  
Sbjct: 181 CADGSSKVHLTTATSYKDYGL--DIPAMMKALSTTGPLQVAFLVYSDFMYYESGVYQHTY 238

Query: 272 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           G + GGHAV+++G+GT DDG DYWI+ N W   WG DGYF++ RG N+C IEE   AG 
Sbjct: 239 GYMEGGHAVEMVGYGTDDDGVDYWIIRNSWGPDWGEDGYFRMIRGINDCSIEEQAYAGF 297


>gi|194246067|gb|ACF35525.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 192

 Score =  174 bits (442), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 86/187 (45%), Positives = 119/187 (63%), Gaps = 16/187 (8%)

Query: 159 CGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPT 205
           CG GC+GGYP +AW+++    +VT       + C PY+    C H      P C    PT
Sbjct: 3   CGSGCNGGYPSAAWQFYKDEDIVTGGLYGTEDGCQPYYFPP-CEHHTVGPLPNCTGIKPT 61

Query: 206 PKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           P+C + C +  Q  +   KH+    Y I+SD   I  EIYKNGPVE  F+VY DF  YKS
Sbjct: 62  PECAKTCREGYQKSYTRDKHFGKKVYSISSDETQIKTEIYKNGPVEADFSVYADFPSYKS 121

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           GVY+  + +++GGHA++++GWGT +DG  YW++AN WN  WG  GYFKI+RG++ECGIE+
Sbjct: 122 GVYQRHSEEMLGGHAIRILGWGT-EDGVPYWLVANSWNEDWGDKGYFKIRRGNDECGIED 180

Query: 325 DVVAGLP 331
           D+ AG+P
Sbjct: 181 DINAGIP 187


>gi|291228863|ref|XP_002734398.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 451

 Score =  174 bits (441), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 122/310 (39%), Positives = 157/310 (50%), Gaps = 31/310 (10%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKSL 93
           ++ S+I+ +N     GW+AA    F    +    KH LG     + +     +    K  
Sbjct: 120 VRPSLIQAINHG-GFGWRAANYTTFWGMKLTDAVKHKLGTLKVERDVHTMTEIDIKMKK- 177

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 151
           K+PKSFDAR  W   S I+ ILDQG+C S WAF  V   SDR  I       ++LS   L
Sbjct: 178 KIPKSFDARDKWG--SMITGILDQGNCASSWAFSTVGVASDRLAIQSSGETGMTLSPQHL 235

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF----DSTG-CSHPGCEPAYPTP 206
           L+C       GC GG+   AW +    GVV+ +C PY     D  G C  PG  P+    
Sbjct: 236 LSC-NTRGQRGCSGGHIDRAWWFMRKRGVVSNDCYPYTSGDQDKKGVCMMPGKLPS---- 290

Query: 207 KCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 266
            C     + N+L     H+S   YRI ++  +I  EI +NGPV+ SF V EDF  Y SGV
Sbjct: 291 DCPTGRERNNEL-----HHSTPPYRIAANEREIQVEIMENGPVQASFEVKEDFFMYGSGV 345

Query: 267 YKHI---TGDVMGGHA-----VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
           Y+H    + D    HA     VKL+GWG  ++G  YW+ AN W   WG DGYFKI RG N
Sbjct: 346 YRHTPIASNDAEQYHASEWHSVKLLGWGV-ENGIKYWLGANSWGTKWGEDGYFKILRGEN 404

Query: 319 ECGIEEDVVA 328
           EC IE  VVA
Sbjct: 405 ECNIESYVVA 414


>gi|159109223|ref|XP_001704877.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157432952|gb|EDO77203.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 300

 Score =  174 bits (441), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 107/299 (35%), Positives = 153/299 (51%), Gaps = 27/299 (9%)

Query: 40  IKEVNE----NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKL 95
           + E+N     NP+  WKA    +F   T  +   LL      K      P  T      +
Sbjct: 18  VSELNHIKSLNPR--WKAGIPKRFEGLTKDEISSLLMPVSFLKNAKGAAPRGTFTDKDDV 75

Query: 96  PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLL 152
           P+SFD R  +P C  I  ++DQG CGSCWAF +V    DR C+  G++   +  S   ++
Sbjct: 76  PESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVA-GLDKKPVKYSPQYVV 132

Query: 153 ACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK 211
           +C     GD  C+GG+  + W++    G  T+EC PY   +      C    PT     K
Sbjct: 133 SCDH---GDMACNGGWLPNVWKFLTKTGTTTDECVPYKSGSTTLRGTC----PT-----K 180

Query: 212 CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 271
           C   +     +   S   Y +  D   +M  +  +GP++V+F VY DF +Y+SGVY+H  
Sbjct: 181 CADGSSKVHLATATSYKDYGL--DIPAMMKALSTSGPLQVAFLVYSDFMYYESGVYQHTY 238

Query: 272 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           G + GGHAV+++G+GT DDG DYWI+ N W   WG DGYF++ RG N+C IEE   AG 
Sbjct: 239 GYMEGGHAVEMVGYGTDDDGVDYWIIRNSWGPDWGEDGYFRMIRGINDCSIEEQAYAGF 297


>gi|201023369|ref|NP_001128426.1| cathepsin B-3483 [Acyrthosiphon pisum]
 gi|328712086|ref|XP_003244726.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
          Length = 355

 Score =  174 bits (440), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 122/345 (35%), Positives = 169/345 (48%), Gaps = 48/345 (13%)

Query: 28  LKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLL----- 82
           L  D++    +I+  VN N    W+A  N   +N      K L+G    P+ + L     
Sbjct: 18  LTCDANDKLHNIVTHVN-NANVTWQAGINSFHTN----DHKKLVGTFYHPEWIGLEHETF 72

Query: 83  -GVPVKTHDKSL----------KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEA 131
            GV VK  D             + P+SFDAR  W  C++IS I +QG+C + WA     A
Sbjct: 73  DGVLVKGGDCDNDDEDDGGDANETPESFDARYHWFNCTSISHIWNQGNCAADWAISVTSA 132

Query: 132 LSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------- 182
           ++DR CI    N++   S   L++CC   CG+GC GGY  +AWRY +  G+VT       
Sbjct: 133 MNDRICIASQGNITALYSPQKLVSCCE-DCGNGCSGGYTAAAWRYILKKGIVTGGDYGSN 191

Query: 183 EECDPYF-----DSTGCSHP----------GCEPAYPTPKCVRKCVKKNQLWRNSKHYSI 227
           E C P+       ST  + P          G +PA  TPKC   C       +       
Sbjct: 192 EGCQPWLVQPCNASTTAADPSSVLGPHGVCGGDPA-TTPKCDLSCYNARHEGKYLDDIIK 250

Query: 228 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 287
           +      D       + K+GP  V+  VYEDF  YKSGVY H+TGD +G  +V++IGWG 
Sbjct: 251 AKKVFTFDGCSARKNLRKHGPYVVTMRVYEDFLAYKSGVYHHVTGDYLGLLSVRMIGWGL 310

Query: 288 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
            + G+ +W+LAN W  SWG  G+FKI+R  NEC IE    AG+P+
Sbjct: 311 -EGGQAFWLLANSWGTSWGDKGFFKIRRFVNECWIENFRYAGVPN 354


>gi|390357905|ref|XP_003729132.1| PREDICTED: cathepsin B-like [Strongylocentrotus purpuratus]
          Length = 354

 Score =  174 bits (440), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 88/187 (47%), Positives = 115/187 (61%), Gaps = 16/187 (8%)

Query: 159 CGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPT 205
           C   C+GG+P SAW Y+   G+VT       + C PY          G   P C+   PT
Sbjct: 169 CKHKCNGGFPGSAWEYYKDTGIVTGGQWNSSQGCQPYQIKSCDHHVNGTKGP-CQGEGPT 227

Query: 206 PKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           P+C  KC    +  +   KHY++S   I+++PE    EI  NGPVE  FTVYEDF  YKS
Sbjct: 228 PECKHKCEASYSTPYEQDKHYALSVNSISNNPEATQTEIMTNGPVEADFTVYEDFPTYKS 287

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           GVY+H TG V+GGHA+K++GWG  ++G  YW++AN WN  WG +G+FKI RGSNECGIE 
Sbjct: 288 GVYQHTTGGVLGGHAIKILGWGV-EEGTKYWLVANSWNNEWGDNGFFKILRGSNECGIES 346

Query: 325 DVVAGLP 331
           D+  G+P
Sbjct: 347 DINFGIP 353


>gi|308488550|ref|XP_003106469.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
 gi|308253819|gb|EFO97771.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
          Length = 205

 Score =  173 bits (439), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 89/186 (47%), Positives = 112/186 (60%), Gaps = 18/186 (9%)

Query: 163 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDS------TGCSHPGC-EPAYPTPKC 208
           C+GGYPI AW+++V HG+VT         C PY  +       G + P C E   PTPKC
Sbjct: 14  CEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKC 73

Query: 209 VRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
           V  C   N     +   KH+  +AY +    E I  EI  +GP+EV+FTVYEDF  Y +G
Sbjct: 74  VEACTSNNTYPTGYLQDKHFGATAYAVGKKVEQIQTEILAHGPIEVAFTVYEDFYQYTTG 133

Query: 266 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 325
           VY H  G  +GGHAVK++GWG  D+G  YW++AN WN +WG  GYF+I RG NECGIE  
Sbjct: 134 VYVHTAGKSLGGHAVKILGWGV-DNGTPYWLVANSWNVNWGEKGYFRIIRGLNECGIEHS 192

Query: 326 VVAGLP 331
            VAGLP
Sbjct: 193 AVAGLP 198


>gi|403357104|gb|EJY78168.1| Cathepsin B [Oxytricha trifallax]
          Length = 349

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 98/246 (39%), Positives = 135/246 (54%), Gaps = 24/246 (9%)

Query: 89  HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSL 146
            D +  +P+SFD+R  WP C  I  I DQ  CGSCWAF +   LSDRFCIH    +N  L
Sbjct: 119 QDLNETIPESFDSRDKWPNC--IHGIRDQQLCGSCWAFASSAFLSDRFCIHSEGQINEDL 176

Query: 147 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDS-TGCSHPGCEPAYPT 205
           S  DL++C       GC GG    +  + ++ G+V+E+C PY +  T C         P 
Sbjct: 177 SPQDLVSCS--YENFGCSGGQLTESVDFLIYEGIVSEKCKPYMNQDTYCKFKCQNDKQPY 234

Query: 206 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
            K    C +K+ L             I SD E+I  E+  NGP+ V  +VYED  +YK G
Sbjct: 235 TKYF--CEQKSML-------------ILSDIEEIQLELMTNGPMMVGLSVYEDLMNYKEG 279

Query: 266 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 325
           VY++ TG+ +GGHA+K+IGWG ++ GE +W   NQW + WG  GY  IK G  E G++  
Sbjct: 280 VYEYTTGNQVGGHAIKIIGWGHTEKGELFWKCQNQWGKDWGMGGYINIKAG--ELGMDTM 337

Query: 326 VVAGLP 331
           V+  +P
Sbjct: 338 VLGCMP 343


>gi|323448735|gb|EGB04630.1| hypothetical protein AURANDRAFT_32318 [Aureococcus anophagefferens]
          Length = 253

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 99/250 (39%), Positives = 139/250 (55%), Gaps = 31/250 (12%)

Query: 108 CSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGD-GCD 164
           C ++  I DQ +CGSCWAFG+ EA++DR CI     ++  LS  D+ +C     GD GC+
Sbjct: 1   CPSLKEIRDQANCGSCWAFGSTEAMTDRMCIASNGTVTTHLSAQDVTSCDKL--GDMGCN 58

Query: 165 GGYPISAWRYFVHHGVVTEECDPYFDSTGC---------------SHPGCEPAYPTPKCV 209
           GG P S + Y+   G+V  +   Y D +GC                +P C      PKC 
Sbjct: 59  GGIPSSVYSYWALSGIV--DGGNYGDKSGCWSYQLEPCAHHVNSSKYPACPDEVRAPKCA 116

Query: 210 RKCVKKNQLWRNSKHYSISAYRINSDPE-------DIMAEIYKNGPVEVSFTVYEDFAHY 262
           RKC  +++ W  +K      Y +    E        + A+IY+NGP+   F V +DF  Y
Sbjct: 117 RKCESEDKDWTKAKVKGEKGYSVCQQGELEGTCAIKMAADIYQNGPITGMFFVKQDFLAY 176

Query: 263 KSGVYK-HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           KSGVY+  +    +GGHA+K++G+GT +DG+DYW++AN WN  WG DGYFKI RG N C 
Sbjct: 177 KSGVYEPKLLSPPLGGHAIKIMGFGT-EDGKDYWLVANSWNEDWGDDGYFKIIRGKNACQ 235

Query: 322 IEEDVVAGLP 331
           IE+ V+ G P
Sbjct: 236 IEDPVINGGP 245


>gi|268578113|ref|XP_002644039.1| Hypothetical protein CBG17499 [Caenorhabditis briggsae]
          Length = 355

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 104/265 (39%), Positives = 133/265 (50%), Gaps = 30/265 (11%)

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 150
           + +P SFD+R  WP+C+ I  + DQ  CGS     AVE  SDR CI      N  LS  D
Sbjct: 89  INIPASFDSRQQWPECTQIGAVRDQSDCGSAAHLVAVEMASDRTCISSNGTFNWPLSAQD 148

Query: 151 LLACCGFL---CGDG--CDGGYPISAWRYFVHHGVVT---------------EECDPYFD 190
            L+CC  L   CGDG  CDG +P    +++  HG+ T                 CD  + 
Sbjct: 149 PLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYDDQFGCKPYSIYPCDKNYP 208

Query: 191 STGCSHPGCEPAYPTPKCVRKCVKKNQLW----RNSKHYSISAYRINSDPEDIMAEIYKN 246
           +   S P   P Y TP C   C   N  W    +  KH+  + Y +     DI  EI  N
Sbjct: 209 NGTTSVPC--PGYHTPPCEDHCTS-NITWPIAYKQDKHFGKAHYNVGKKMTDIQTEIMTN 265

Query: 247 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 306
           GPV  SF +YEDF  YKSG+Y H  GD  GG   K+IGWG  D+G  YW+  +QW   +G
Sbjct: 266 GPVIASFIIYEDFWDYKSGIYVHTAGDQEGGMDTKIIGWGV-DNGVPYWLCVHQWGTDFG 324

Query: 307 ADGYFKIKRGSNECGIEEDVVAGLP 331
            +G+ +I RG NE  IE  V+A LP
Sbjct: 325 ENGFVRILRGVNEVNIEHQVLAALP 349


>gi|448278133|gb|AGE43966.1| putative cathepsin B [Naegleria fowleri]
          Length = 349

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 108/332 (32%), Positives = 173/332 (52%), Gaps = 47/332 (14%)

Query: 28  LKLDSH----ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK-PTPK---- 78
           L LDS     +  ++ I+ +N+  K  W+A ++  F    +   + L+G+  PTP+    
Sbjct: 37  LNLDSSSDPLVHDEAFIQLINKYAKT-WQAGKSKFFEGKRLSHARRLIGLGLPTPEQRAS 95

Query: 79  -----GLLLGVPVKTHDKSL----KLPKSFDAR--SAWPQCSTISRILDQGHCGSCWAFG 127
                 L++G    + +K L     LP S++A   S +  C  + RI +Q  CGSCWAF 
Sbjct: 96  YPKKNSLMMGEEANSLEKYLVKMDALPDSYNAANDSNYYMCQQLHRIRNQEQCGSCWAFS 155

Query: 128 AVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEEC 185
             E ++DRFCI     +N  +S   +++C      +GC+GG   +A+++    G+V++ C
Sbjct: 156 ISEMVADRFCIGTRGKINTIMSPQWMVSCD--TADNGCNGGEFPTAFQFVETTGLVSDGC 213

Query: 186 DPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-----WRNSKHYSISAYRINSDPEDIM 240
            PY    G            P C   C     +      +NS+++ ++      D + + 
Sbjct: 214 VPYQSGNGF----------VPPCPNSCANGEDINVRYRTKNSRNFDVN------DMKSVQ 257

Query: 241 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQ 300
           A I  NGPV   F VY DF +Y+SG YKH+ G ++GGHA+K++GWG +     YWI+AN 
Sbjct: 258 ASILANGPVISGFKVYRDFYNYRSG-YKHVAGGLVGGHAIKVVGWGVTQSNVPYWIVANS 316

Query: 301 WNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
           W+  WG +GYF I RG+NEC IEE++   +P+
Sbjct: 317 WSDEWGMNGYFWILRGTNECSIEENMWETIPA 348


>gi|195585648|ref|XP_002082593.1| GD25141 [Drosophila simulans]
 gi|194194602|gb|EDX08178.1| GD25141 [Drosophila simulans]
          Length = 484

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 118/327 (36%), Positives = 160/327 (48%), Gaps = 24/327 (7%)

Query: 22  EGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVK-PTPK 78
           EG   +   D  +  D+I+  VN   + GW A +  Q+    Y+ G  K  LG K PT +
Sbjct: 115 EGGSVQCDQDLCLTDDAIVHSVNSINRLGWSARKYDQWWGRKYSEG-LKLRLGTKEPTYR 173

Query: 79  GLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI 138
              +    +  + +  LP SF+A   W   S IS + DQG CG+ W        SDRF I
Sbjct: 174 ---VKAMTRLRNPTDGLPSSFNALDKWS--SYISEVPDQGWCGASWVLSTTSVASDRFAI 228

Query: 139 HFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH 196
                  + LS  ++L+C       GC+GG+  +AWRY    GVV E C PY       H
Sbjct: 229 QSKGKEAVQLSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDENCYPYT-----QH 281

Query: 197 PGCEPAYPTPKCVRK--CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
                     + +R   C     + R++ +    AY +N +  DIMAEI+ +GPV+ +  
Sbjct: 282 RDTCKIRHNSRSLRANGCQTPVNVDRDTLYTVGPAYSLNREA-DIMAEIFHSGPVQATMR 340

Query: 255 VYEDFAHYKSGVYKHITGDV---MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 311
           V  DF  Y  GVY+    +     G H+VKL+GWG   +GE YWI AN W   WG  GYF
Sbjct: 341 VNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEHGYF 400

Query: 312 KIKRGSNECGIEEDVVAGLPSSKNLVK 338
           +I RGSNECGIEE V+A  P   N  K
Sbjct: 401 RILRGSNECGIEEYVLASWPYVYNYYK 427


>gi|195488613|ref|XP_002092389.1| GE11695 [Drosophila yakuba]
 gi|194178490|gb|EDW92101.1| GE11695 [Drosophila yakuba]
          Length = 431

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 116/320 (36%), Positives = 156/320 (48%), Gaps = 40/320 (12%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVK-PTPKGLLLGVPVKTHD 90
           +  D++I  VN   + GW A +  Q+    Y+ G  K  LG K PT +   +    +  +
Sbjct: 127 LTDDALIHSVNSIQRLGWSARKYDQWWGRKYSEG-LKLRLGTKEPTYR---VKAMTRLKN 182

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSV 148
            +  LP SF+A   W   S IS + DQG CG+ W        SDRF I       + LS 
Sbjct: 183 PTDGLPSSFNALDKWS--SYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKEAVQLSA 240

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----------FDSTGCSHPG 198
            ++L+C       GC+GG+  +AWRY    GVV E C PY           +S      G
Sbjct: 241 QNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDESCYPYTQQRDTCKIRHNSRSLRANG 298

Query: 199 CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
           C+  Y                R++ +    AY +N +  DIMAEI+ +GPV+ +  V  D
Sbjct: 299 CQTPYNVD-------------RDTFYTVGPAYSLNREA-DIMAEIFHSGPVQATMRVNRD 344

Query: 259 FAHYKSGVYKHITGDVM---GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 315
           F  Y  GVY+    + M   G H+VKL+GWG   +GE YWI AN W   WG  GYF+I R
Sbjct: 345 FFAYAGGVYRQTAANRMAPTGFHSVKLVGWGEEHNGEKYWIAANSWGPWWGERGYFRILR 404

Query: 316 GSNECGIEEDVVAGLPSSKN 335
           GSNECGIEE V+A  P   N
Sbjct: 405 GSNECGIEEYVLASWPYVYN 424


>gi|294877489|ref|XP_002768007.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239870145|gb|EER00725.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 344

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 118/345 (34%), Positives = 163/345 (47%), Gaps = 55/345 (15%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTP-KGLLLGVPVKTHDKSLKLP 96
           S++ EVN        +    +F   ++G  K L G  P   KGL     V   ++   +P
Sbjct: 3   SLVDEVNSKQNLWTASTDQERFYGRSLGDAKKLCGTLPEETKGLE--KKVYPTEELADIP 60

Query: 97  KSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLA 153
            SFDAR A+ +C   I  + DQ  CGSCWA   VEA + R CI  G   N  LS  ++LA
Sbjct: 61  SSFDARDAFKECKDVIGHVWDQSACGSCWAIAPVEAFNARLCIKSGGKFNQLLSAGEMLA 120

Query: 154 CCGFL--CGD-GCDGGYPISAWRYFVHHGVVT-------------EECDPY------FDS 191
           CC  +  C   GC GG   +AW +   HG+VT             + C PY       D 
Sbjct: 121 CCNSVHSCNSHGCQGGIARAAWSFLKMHGIVTGGDFVPKGSMSAADGCWPYSFPKCAHDQ 180

Query: 192 TGCSHPGC---------------------EPAYPTPKCVRKC--VKKNQLWRNSKHYSIS 228
               +  C                     +  Y TP C+ +C   K        +H++  
Sbjct: 181 EDSKYEPCPEVRVPPLGERHQRGAGASIHQKLYDTPSCLDRCPNEKYGTPRDKDRHFTAR 240

Query: 229 AY-RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 287
           A   +    ++I  EI  NGP   SF+ YEDF+ YKSGVYKH +G  +G H+V++IGWGT
Sbjct: 241 ALPYLFEGTDNIKKEIMTNGPTSASFSTYEDFSSYKSGVYKHTSGGYLGDHSVEIIGWGT 300

Query: 288 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
            + G DYW++ N WN  WG  G FKI +G  +CGI++ V   LP+
Sbjct: 301 -EKGVDYWLVMNSWNEGWGDHGTFKIAQG--DCGIDDAVQGSLPA 342


>gi|32129434|sp|P92132.2|CATB2_GIALA RecName: Full=Cathepsin B-like CP2; AltName: Full=Cathepsin B-like
           protease B2; Flags: Precursor
 gi|11691658|emb|CAC18647.1| cathepsin B-like protease 2 [Giardia intestinalis]
          Length = 300

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 106/299 (35%), Positives = 153/299 (51%), Gaps = 27/299 (9%)

Query: 40  IKEVNE----NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKL 95
           + E+N     NP+  WKA    +F   T  +   LL      K      P  T      +
Sbjct: 18  VSELNHIKSLNPR--WKAGIPKRFEGLTKDEISSLLMPVSFLKNAKGAAPRGTFTDKDDV 75

Query: 96  PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLL 152
           P+SFD R  +P C  I  ++DQG CGSCWAF +V    DR C+  G++   +  S   ++
Sbjct: 76  PESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVA-GLDKKPVKYSPQYVV 132

Query: 153 ACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK 211
           +C     GD  C+GG+  + W++    G  T+EC PY   +      C    PT     K
Sbjct: 133 SCDH---GDMACNGGWLPNVWKFLTKTGTTTDECVPYKSGSTTLRGTC----PT-----K 180

Query: 212 CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 271
           C   +     +   S   Y +  D   +M  +  +GP++V+F V+ DF +Y+SGVY+H  
Sbjct: 181 CADGSSKVHLATATSYKDYGL--DIPAMMKALSTSGPLQVAFLVHSDFMYYESGVYQHTY 238

Query: 272 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           G + GGHAV+++G+GT DDG DYWI+ N W   WG DGYF++ RG N+C IEE   AG 
Sbjct: 239 GYMEGGHAVEMVGYGTDDDGVDYWIIKNSWGPDWGEDGYFRMIRGINDCSIEEQAYAGF 297


>gi|268555786|ref|XP_002635882.1| Hypothetical protein CBG01102 [Caenorhabditis briggsae]
          Length = 374

 Score =  171 bits (433), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 99/288 (34%), Positives = 144/288 (50%), Gaps = 56/288 (19%)

Query: 99  FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCG 156
           FDAR  WP+CS+I  I D   C S WAF A E++SDR CI+ G  +N  LS  +LL+CC 
Sbjct: 85  FDARERWPECSSIPIINDISDCKSSWAFSAAESMSDRLCINSGGMINTVLSAQELLSCCT 144

Query: 157 --FLCGDG------------------------------------CDGGYPISAWRYFVHH 178
             F CG+G                                    C GG    AW+Y+  H
Sbjct: 145 GVFSCGEGDSEHWQFRNSKFRKPRCQKFNKEILEARRNLETREKCAGGNVFKAWQYWQKH 204

Query: 179 GVVTEE-------CDPYFDST------GCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSK 223
           G+ T         C PY  S         + PGC      TP C +KC     +     +
Sbjct: 205 GLPTGGSYESQFGCKPYSISPCDTVIGNITFPGCLNSTVQTPSCEKKCKSGYPVELDKDR 264

Query: 224 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 283
           HY +S  ++ +   +I +++  NGP+  +  VY+DF  Y +G+Y H+TG+  G  +V+++
Sbjct: 265 HYGVSVDQLPNRQIEIQSDVMLNGPISATMEVYDDFLQYTTGIYVHLTGNKQGHLSVRIL 324

Query: 284 GWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           GWG   +G  YW+LAN W + WG +G F++ RG NECG+E + V+G+P
Sbjct: 325 GWGMY-EGVPYWLLANSWGKQWGENGTFRVLRGVNECGLEANCVSGMP 371


>gi|201023319|ref|NP_001128401.1| cathepsin B-10270 precursor [Acyrthosiphon pisum]
 gi|239788119|dbj|BAH70754.1| ACYPI000021 [Acyrthosiphon pisum]
          Length = 341

 Score =  171 bits (433), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 101/262 (38%), Positives = 142/262 (54%), Gaps = 24/262 (9%)

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LS 147
           D S  +P++FDAR+ W +C +I+ I +QG+C + WA     A++DR CI    N++   S
Sbjct: 82  DGSNDMPETFDARNKWFECVSIAHIWNQGNCAADWAISVTSAINDRICIKSKKNITAFYS 141

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCE 200
              +L+CC   CGDGC+GGY  +AW+Y++  G+VT       E C P+     C+H   +
Sbjct: 142 PQKMLSCCD-DCGDGCNGGYSGAAWQYWMKRGLVTGGDYGSNEGCQPWLIPP-CNHTVMD 199

Query: 201 PAYP----------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPED-IMAEIYKNGPV 249
              P          TP+C   C   N      K  S    RI+      I  E+ K+GP 
Sbjct: 200 ERSPSYMCGKYKSETPQCTLNCYNPNYSKPFLKDIS-KGIRIDWHCSGMIRNELKKHGPA 258

Query: 250 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 309
                VYEDF  YKSG+Y+H+TG ++G   VK+IGWG    G  YW+ AN W  SWG  G
Sbjct: 259 TAIMRVYEDFLTYKSGIYQHVTGKLLGQITVKVIGWGVY-RGVQYWLAANSWGTSWGDKG 317

Query: 310 YFKIKRGSNECGIEEDVVAGLP 331
           +FKI+RG NEC  E+  ++G P
Sbjct: 318 FFKIRRGYNECLFEDYFISGRP 339


>gi|17560488|ref|NP_506310.1| Protein F32H5.1 [Caenorhabditis elegans]
 gi|3876629|emb|CAB04249.1| Protein F32H5.1 [Caenorhabditis elegans]
          Length = 356

 Score =  171 bits (433), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 104/267 (38%), Positives = 135/267 (50%), Gaps = 28/267 (10%)

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVND 150
           + +P SFD+R  WP CS I  + DQ  CGS     AVE  SDR CI  +   N  LS  D
Sbjct: 90  VDIPSSFDSRQKWPSCSQIGAVRDQSDCGSAAHLVAVEIASDRTCIASNGTFNWPLSAQD 149

Query: 151 LLACCGFL---CGDG--CDGGYPISAWRYFVHHGVVTEE-------CDPYFD-------S 191
            L+CC  L   CGDG  CDG +P    +++  HG+ T         C PY         +
Sbjct: 150 PLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYNDQFGCKPYSIYPCDKKYA 209

Query: 192 TGCSHPGCEPAYPTPKCVRKCVKKNQLW----RNSKHYSISAYRINSDPEDIMAEIYKNG 247
            G +   C P Y TP C   C   N  W    +  KH+  + Y +     DI  EI  NG
Sbjct: 210 NGTTSVPC-PGYHTPTCEEHCTS-NITWPIAYKQDKHFGKAHYNVGKKMTDIQIEIMTNG 267

Query: 248 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 307
           PV  SF +Y+DF  YK+G+Y H  GD  GG   K+IGWG  D+G  YW+  +QW   +G 
Sbjct: 268 PVIASFIIYDDFWDYKTGIYVHTAGDQEGGMDTKIIGWGV-DNGVPYWLCVHQWGTDFGE 326

Query: 308 DGYFKIKRGSNECGIEEDVVAGLPSSK 334
           +G+ +  RG NE  IE  V+A LP S+
Sbjct: 327 NGFVRFLRGVNEVNIEHQVLAALPDSE 353


>gi|193783549|dbj|BAG53460.1| unnamed protein product [Homo sapiens]
          Length = 276

 Score =  171 bits (433), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 84/191 (43%), Positives = 121/191 (63%), Gaps = 14/191 (7%)

Query: 163 CDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVR 210
           C+GGYP  AW ++   G+V+         C PY           S P C     TPKC +
Sbjct: 87  CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSK 146

Query: 211 KCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 269
            C    +  ++  KHY  ++Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H
Sbjct: 147 ICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQH 206

Query: 270 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
           +TG++MGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVAG
Sbjct: 207 VTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 265

Query: 330 LPSSKNLVKEI 340
           +P +    ++I
Sbjct: 266 IPRTDQYWEKI 276


>gi|363742306|ref|XP_428202.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Gallus
           gallus]
          Length = 464

 Score =  171 bits (433), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 114/316 (36%), Positives = 152/316 (48%), Gaps = 25/316 (7%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTV--GQFKHLLGVKPTPKGLLLGVPVKTHDK 91
           ++   +I  VN     GW+AA   QF   T+  G    L   +P P  + +       D 
Sbjct: 140 LMDGDLIDAVNRG-NYGWRAANYSQFWGMTLEDGMRYRLGTFRPPPTVMNMNEMHMAMDS 198

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
           +  LP+ FDA + WP    I   LDQG+C   WAF      SDR  IH    M  SLS  
Sbjct: 199 NEVLPRHFDAATKWP--GMIHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMTPSLSPQ 256

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF--DSTGCSHPGCEPAYPTPK 207
           +LL+C       GC GG    AW Y    GVVT+EC P+   DS   + P    +  T +
Sbjct: 257 NLLSC-DTRNQRGCSGGRLDGAWWYLRRRGVVTDECYPFTSQDSQPAAQPCMMHSRSTGR 315

Query: 208 CVRKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
             R+   +    Q   N  + S  AYR+    ++IM E+ +NGPV+    V+EDF  YKS
Sbjct: 316 GKRQATARCPNPQTHANDIYQSTPAYRLAPSEKEIMKELMENGPVQAILEVHEDFFLYKS 375

Query: 265 GVYKHIT--------GDVMGGHAVKLIGWGTSD--DGE--DYWILANQWNRSWGADGYFK 312
           G+Y+H              G H+VK+ GWG     DG+   YW  AN W R+WG DG+F+
Sbjct: 376 GIYRHTAVAEGKGPKHQQHGTHSVKITGWGEEQLPDGQVQKYWTAANSWGRAWGEDGHFR 435

Query: 313 IKRGSNECGIEEDVVA 328
           I RG NEC +E  VV 
Sbjct: 436 IARGVNECEVESFVVG 451


>gi|15150360|gb|AAK85411.1| cathepsin B-like protease [Trypanosoma rangeli]
          Length = 207

 Score =  171 bits (432), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 97/213 (45%), Positives = 120/213 (56%), Gaps = 15/213 (7%)

Query: 99  FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGF 157
           FDA  AWP C TI+ I DQ  CGSCWA  A  A+SDR+C   G+ +L +S  DLL+CC  
Sbjct: 1   FDAGEAWPNCPTITEIRDQSGCGSCWAVAARSAMSDRYCTRGGVRDLRISAGDLLSCCN- 59

Query: 158 LCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH-------PGCEPAYPTPKCVR 210
            CG GC+GG P  AW Y+V  G+V+E C PY     C+H         C   Y TP C  
Sbjct: 60  ACGLGCNGGDPDWAWLYYVETGIVSEFCQPY-PFPPCAHHVNSTHYTPCSVEYDTPFCNI 118

Query: 211 KCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 270
            C       +     S S     S  ED   E++  GP EV+FTVYEDF  Y  GVYKH 
Sbjct: 119 TCTNTIPPIKYKGRISYSL----SGEEDYKRELFLYGPFEVAFTVYEDFVAYSDGVYKHF 174

Query: 271 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 303
           +G+ +GGHAV+L+GWG   +G  YW +AN WN 
Sbjct: 175 SGNALGGHAVRLVGWGNL-NGTPYWKIANSWNH 206


>gi|195346663|ref|XP_002039877.1| GM15657 [Drosophila sechellia]
 gi|194135226|gb|EDW56742.1| GM15657 [Drosophila sechellia]
          Length = 431

 Score =  171 bits (432), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 116/324 (35%), Positives = 160/324 (49%), Gaps = 24/324 (7%)

Query: 22  EGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVK-PTPK 78
           EG   +   D  +  D+I+  VN   + GW A +  Q+    Y+ G  K  LG K PT +
Sbjct: 115 EGGSVQCDQDLCLTDDAIVHSVNSINRLGWSARKYDQWWGRKYSEG-LKLRLGTKEPTYR 173

Query: 79  GLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI 138
              +    +  + +  LP SF+A   W   S IS + DQG CG+ W        SDRF I
Sbjct: 174 ---VKAMTRLRNPTDGLPSSFNALDKWS--SYISEVPDQGWCGASWVLSTTSVASDRFAI 228

Query: 139 HFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH 196
                  + LS  ++L+C       GC+GG+  +AWRY    GVV E C PY       H
Sbjct: 229 QSKGKEAVQLSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDENCYPYT-----QH 281

Query: 197 PGCEPAYPTPKCVRK--CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
                     + +R   C     + R++ +    AY +N +  DIMAEI+ +GPV+ +  
Sbjct: 282 RDTCKIRHNSRSLRANGCQTPVNVDRDTLYTVGPAYSLNREA-DIMAEIFHSGPVQATMR 340

Query: 255 VYEDFAHYKSGVYKHITGD---VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 311
           V  DF  Y  GVY+    +   + G H+VKL+GWG   +GE YWI AN W   WG  GYF
Sbjct: 341 VNRDFFAYSGGVYRETAANRKALTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEHGYF 400

Query: 312 KIKRGSNECGIEEDVVAGLPSSKN 335
           +I RGSNECGIE+ V+A  P   N
Sbjct: 401 RILRGSNECGIEDYVLASWPYVYN 424


>gi|194753202|ref|XP_001958906.1| GF12327 [Drosophila ananassae]
 gi|190620204|gb|EDV35728.1| GF12327 [Drosophila ananassae]
          Length = 431

 Score =  170 bits (431), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 114/319 (35%), Positives = 155/319 (48%), Gaps = 38/319 (11%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVK-PTPKGLLLGVPVKTHDK 91
           +  D +I  VN     GW A +  ++  +   +  +  LG K PT +   +    +  + 
Sbjct: 126 LTDDELIYSVNSIHNLGWSARKYNEWWGHKYAEGLRLRLGTKEPTYR---VKAMTRLTNP 182

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVN 149
           +  LP SF+A   WP  S IS + DQG CGS W        SDRF I       + LS  
Sbjct: 183 TDGLPSSFNAVERWP--SYISEVPDQGWCGSSWVLSTTSVASDRFAIQSKGKEAVRLSAQ 240

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----------FDSTGCSHPGC 199
           ++L+C       GCDGG+  +AWR+    GVV + C PY           +S      GC
Sbjct: 241 NILSCTRRQ--QGCDGGHLDAAWRFLHKKGVVDDSCYPYTQQRDTCKIRHNSRSLKANGC 298

Query: 200 EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
            P+               + R+S +    AY +N +  DIMAEIY +GPV+ +  VY DF
Sbjct: 299 RPS-------------PNVDRDSFYTVGPAYTLNREG-DIMAEIYHSGPVQATMRVYRDF 344

Query: 260 AHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
             Y  G+Y+      G   G H+VKL+GWG   +G+ YWI AN W   WG  GYF+I RG
Sbjct: 345 FSYSGGIYRQTAANRGAPQGFHSVKLVGWGEEHNGDKYWIAANSWGPWWGERGYFRILRG 404

Query: 317 SNECGIEEDVVAGLPSSKN 335
           SNECGIEE V+A  P   N
Sbjct: 405 SNECGIEEYVLASWPYVYN 423


>gi|341886633|gb|EGT42568.1| hypothetical protein CAEBREN_17563 [Caenorhabditis brenneri]
          Length = 358

 Score =  170 bits (431), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 105/281 (37%), Positives = 139/281 (49%), Gaps = 39/281 (13%)

Query: 86  VKTHDKSLK---------LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRF 136
           +++H++S +         +P SFDAR  WP CS I  + DQ  CGS     A E  SDR 
Sbjct: 76  IRSHEQSTENDNSQVFEEIPNSFDARQKWPSCSQIGAVRDQSDCGSAAHLVAAEIASDRT 135

Query: 137 CIHFG--MNLSLSVNDLLACCGFL---CGDG--CDGGYPISAWRYFVHHGVVT------- 182
           CI      N  LS  D L+CC  L   CGDG  CDG +P    +++  HG+ T       
Sbjct: 136 CIFSNGTFNWPLSAQDPLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYDDQ 195

Query: 183 --------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW----RNSKHYSISAY 230
                     CD  + +   S P   P Y TP C  +C   N  W    +  KH+  + Y
Sbjct: 196 FGCKPYTIYPCDKKYPNGTTSVPC--PGYHTPVCEERCTS-NITWPISYKQDKHFGKAHY 252

Query: 231 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 290
            +     DI  EI +NGPV  SF +Y+DF  YKSG+Y H  GD  GG   K+IGWG  D+
Sbjct: 253 NVGKKMTDIQTEIMRNGPVIASFIIYDDFWDYKSGIYVHTAGDQEGGMDTKIIGWGV-DN 311

Query: 291 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           G  YW+  +QW   +G +G+ +I RG NE  IE  V+A  P
Sbjct: 312 GVPYWLCVHQWGTDFGENGFVRILRGVNEVNIEHQVLAAQP 352


>gi|125810908|ref|XP_001361665.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
 gi|54636841|gb|EAL26244.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
          Length = 433

 Score =  170 bits (431), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 114/319 (35%), Positives = 154/319 (48%), Gaps = 38/319 (11%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK 91
           +  +SII  +N     GW A +  ++    Y+ G    L   +PT +   +    +  + 
Sbjct: 129 LTDESIIHSINTIYHLGWSARKYDEWWGHKYSEGLRLRLGTKEPTYR---VKAMSRLTNP 185

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVN 149
           +  LP +F+A   W   S IS + DQG CGS W        SDRF I       + LS  
Sbjct: 186 TAGLPAAFNAVEKWS--SYISEVPDQGWCGSSWVLSTTSVASDRFAIQSKGKEAVQLSAQ 243

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----------FDSTGCSHPGC 199
           ++L+C       GC+GG+  +AWRY    GVV E C PY           +S      GC
Sbjct: 244 NILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDESCYPYTQHRDTCKIRHNSRSLKANGC 301

Query: 200 EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
            P+                 R+S +    AY +N +  DIMAEIY +GPV+ +  VY DF
Sbjct: 302 RPSANVD-------------RDSFYTVGPAYTLNKE-SDIMAEIYHSGPVQATMRVYRDF 347

Query: 260 AHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
             Y SGVY+      G   G H+VKL+GWG   +G+ YWI AN W   WG  GYF+I RG
Sbjct: 348 FSYSSGVYRQTAANRGAPTGFHSVKLVGWGEEHNGDKYWIAANSWGPWWGERGYFRILRG 407

Query: 317 SNECGIEEDVVAGLPSSKN 335
           SNECGIE+ V+A  P   N
Sbjct: 408 SNECGIEDYVLASWPYVYN 426


>gi|195154396|ref|XP_002018108.1| GL16940 [Drosophila persimilis]
 gi|194113904|gb|EDW35947.1| GL16940 [Drosophila persimilis]
          Length = 433

 Score =  170 bits (431), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 114/319 (35%), Positives = 154/319 (48%), Gaps = 38/319 (11%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK 91
           +  +SII  +N     GW A +  ++    Y+ G    L   +PT +   +    +  + 
Sbjct: 129 LTDESIIHSINTIYHLGWSARKYDEWWGHKYSEGLRLRLGTKEPTYR---VKAMSRLTNP 185

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVN 149
           +  LP +F+A   W   S IS + DQG CGS W        SDRF I       + LS  
Sbjct: 186 TAGLPAAFNAVEKWS--SYISEVPDQGWCGSSWVLSTTSVASDRFAIQSKGKEAVQLSAQ 243

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----------FDSTGCSHPGC 199
           ++L+C       GC+GG+  +AWRY    GVV E C PY           +S      GC
Sbjct: 244 NILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDESCYPYTQHRDTCKIRHNSRSLKANGC 301

Query: 200 EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
            P+                 R+S +    AY +N +  DIMAEIY +GPV+ +  VY DF
Sbjct: 302 RPSANVD-------------RDSFYTVGPAYTLNKE-SDIMAEIYHSGPVQATMRVYRDF 347

Query: 260 AHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
             Y SGVY+      G   G H+VKL+GWG   +G+ YWI AN W   WG  GYF+I RG
Sbjct: 348 FSYSSGVYRQTAANRGAPTGFHSVKLVGWGEEHNGDKYWIAANSWGPWWGERGYFRILRG 407

Query: 317 SNECGIEEDVVAGLPSSKN 335
           SNECGIE+ V+A  P   N
Sbjct: 408 SNECGIEDYVLASWPYVYN 426


>gi|294873367|ref|XP_002766594.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239867622|gb|EEQ99311.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 244

 Score =  170 bits (431), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 102/247 (41%), Positives = 135/247 (54%), Gaps = 35/247 (14%)

Query: 116 DQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCG---FLCGDGCDGGYPIS 170
           DQ  CGSCWAFG VEA + R CI  G  +N  LS  ++LACC    F    GC GG PI+
Sbjct: 1   DQSACGSCWAFGTVEAFNARVCIKSGGKLNQLLSAANMLACCNIGHFCLSFGCSGGNPIT 60

Query: 171 AWRYFVHHGVVT-------------EECDPYFDSTGCSH--------PGCEPAYPTPKCV 209
           +W +   +G+V+             + C PY     C+H        P  +  Y TP C 
Sbjct: 61  SWTFLHTNGIVSGGGFVPEKNMKAADGCWPY-SFPKCAHHQDGSDYKPCAKEIYDTPSCS 119

Query: 210 RKC--VKKNQLWRNSKHYSISAY--RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
             C   K    +   +HY+ S +  R  S    I  EI  NGP   +F+VYEDF  YKSG
Sbjct: 120 SSCPNAKYGTAFDKDRHYTESLFPSRFGS-TSSIKKEIMTNGPTSAAFSVYEDFLSYKSG 178

Query: 266 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 325
           VYKH +G  +GGHAV++IGWGT + G DYW++ N WN  WG  G FKI +G  +CGI++ 
Sbjct: 179 VYKHTSGGFLGGHAVEIIGWGT-EKGVDYWLVMNSWNEEWGDHGTFKIVQG--DCGIDDT 235

Query: 326 VVAGLPS 332
           ++AG P+
Sbjct: 236 ILAGTPA 242


>gi|115621283|ref|XP_782184.2| PREDICTED: tubulointerstitial nephritis antigen-like
           [Strongylocentrotus purpuratus]
          Length = 450

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 118/340 (34%), Positives = 156/340 (45%), Gaps = 35/340 (10%)

Query: 13  CLTCFATFAEGVVSKLKLDS-HILQDSI-IKEVNENPKAGWKAARNPQFSNYTVGQ-FKH 69
           C TC  T A    +    D    L DSI I +VNE+   GW+A+        T  +   +
Sbjct: 112 CNTCVCTLAPDGNADFVCDGIPCLVDSITISDVNEDYYLGWRASNYSFLWGLTQAEGVLY 171

Query: 70  LLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 129
            LG  P  + L     V    +  +LP++FDAR  WP    I  ++DQG CGS WA    
Sbjct: 172 RLGTFPPGRALSEMAEVNIDTEGARLPETFDARENWP--GLIDEVIDQGKCGSSWAISTA 229

Query: 130 EALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 187
              SDR  I     +N  LS   LL+C       GC GGY   AW +    G V+  C P
Sbjct: 230 SVASDRLAIQSMGEINPRLSEQHLLSC-NIRGQRGCSGGYLDRAWYHLRRAGAVSRACYP 288

Query: 188 YF----DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEI 243
           Y     + T      C  AY + +C  + V  +       + S   YRI +   DIM EI
Sbjct: 289 YHSGLDEDTIMQKLRCRVAYGSSQCPERGVTSD------LYLSTPPYRIAAREVDIMTEI 342

Query: 244 YKNGPVEVSFTVYEDFAHYKSGVYKHIT---------GDVMGGHAVKLIGWGTSDDGED- 293
           Y+NGPV+ +F V  DF  Y  GVY+++           D  G H+VK++GWG   D  D 
Sbjct: 343 YQNGPVQATFNVKNDFFVYNRGVYRNVKQEFTASQSDSDQAGWHSVKIVGWGI--DRSDW 400

Query: 294 -----YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
                YW+  N W R+WG  G F+I RG NEC IE  V+ 
Sbjct: 401 YNPIKYWLCTNSWGRNWGEQGMFRIVRGVNECEIESFVLG 440


>gi|449283627|gb|EMC90232.1| Tubulointerstitial nephritis antigen [Columba livia]
          Length = 469

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 116/317 (36%), Positives = 159/317 (50%), Gaps = 34/317 (10%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLL-LGVPVKTHDK 91
           +++  +++ +N     GWKA    QF   TV + FK  LG  P    LL +         
Sbjct: 160 LVRQDLLQRINSG-DYGWKADNYSQFWGMTVEEAFKKRLGTFPPSHSLLNMRESPGNSLP 218

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVN 149
             K P  F A  AWP+   I   LDQ +CG+ WAF      +DR  IH    ++  LSV 
Sbjct: 219 EEKFPVFFAATYAWPE--WIHDPLDQRNCGASWAFSTASVAADRIAIHSEGQITDNLSVQ 276

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF-----DSTGCSHPGCEPAY- 203
           +L++C       GC+GG   SAWRY   HGVV+  C P F     + +G +H      Y 
Sbjct: 277 NLISC-DTRNQHGCNGGNIDSAWRYLKTHGVVSYACYPSFWKKHLEPSGENHCYVSSEYG 335

Query: 204 ------PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 257
                 P P  + K    N+L+R + HY     R++S   +IM EI   GPV+    VYE
Sbjct: 336 KNYTNGPCPNALEK---SNRLYRCASHY-----RVSSKETNIMKEIMDKGPVQAIMKVYE 387

Query: 258 DFAHYKSGVYKHI--TGDVMGGHAVKLIGWGTSDDG----EDYWILANQWNRSWGADGYF 311
           DF  YK G+Y+H    G     H+VKL+GWG   D     + +WI AN W +SWG +GYF
Sbjct: 388 DFFLYKEGIYRHSQKAGSKWKTHSVKLLGWGALADKNGQKQKFWIAANSWGKSWGENGYF 447

Query: 312 KIKRGSNECGIEEDVVA 328
           +I RG NEC IE+ ++A
Sbjct: 448 RILRGQNECDIEKLILA 464


>gi|449498128|ref|XP_002193225.2| PREDICTED: tubulointerstitial nephritis antigen [Taeniopygia
           guttata]
          Length = 469

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 122/345 (35%), Positives = 166/345 (48%), Gaps = 35/345 (10%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           +I D      CF +     + K   D  +++  +I+ +N     GWKA    QF   TV 
Sbjct: 137 IIKDNCNSCKCFNS-----LWKCSTDVCLVRQDLIQHINSG-DFGWKADNYSQFWGMTVE 190

Query: 66  Q-FKHLLGVKPTPKGLL--LGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGS 122
           + FK  LG  P    LL    VP K+  +  K P  F A   WP+   I   LDQ +CG+
Sbjct: 191 EGFKKRLGTFPPSHSLLNMREVPGKSLPEE-KFPAIFSAIYEWPE--WIHDPLDQRNCGA 247

Query: 123 CWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 180
            WAF      +DR  IH    ++  LS  +L++C       GC+GG    AWRY   HGV
Sbjct: 248 SWAFSTASVAADRIAIHSKGQITDNLSAQNLISC-DTRNQHGCNGGSIDGAWRYLKTHGV 306

Query: 181 VTEECDPYFDSTGCSHPGCEPAYPTPK---------CVRKCVKKNQLWRNSKHYSISAYR 231
           V+  C P F +           Y + +         C     K N+L+R + HY     R
Sbjct: 307 VSYACYPSFWNKHLGPSAENQCYVSNEYGKNHTNGPCPNAFEKSNRLYRCASHY-----R 361

Query: 232 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI--TGDVMGGHAVKLIGWGTSD 289
           ++S   DIM EI   GPV+    VYEDF  YK G+Y+H    G     H+VKL+GWG   
Sbjct: 362 VSSKETDIMKEIKDRGPVQAIMKVYEDFFLYKEGIYQHSQKAGSKWKTHSVKLLGWGALP 421

Query: 290 DG----EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           D     + +WI AN W +SWG +GYF+I RG NEC IE+ ++A L
Sbjct: 422 DKNGQKQKFWIAANSWGKSWGENGYFRILRGQNECDIEKLILATL 466


>gi|3087797|emb|CAA93275.1| cysteine proteinase [Haemonchus contortus]
          Length = 330

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 103/299 (34%), Positives = 155/299 (51%), Gaps = 37/299 (12%)

Query: 31  DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD 90
           D+ +  ++++K VNE  +  ++A  +P+       +  HL+  +       L   +   +
Sbjct: 34  DNRLTGEALVKYVNER-QPFFEAKYSPEAEQ----RLNHLMDTEFVRNVRKLH-KIPRAE 87

Query: 91  KSLK---LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLS 147
           K++    +P+SFD+R  W  CS+I+ I DQ + GSCWA  A E +SDR C+     +   
Sbjct: 88  KAISNEDIPESFDSREVWKNCSSITYIRDQSNSGSCWAVSAAETMSDRICVQSKGRVQKM 147

Query: 148 VND--LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT----EE---CDPYFDSTGCSHPG 198
           ++D  +LACCG  CG GC+GG    AW Y    GVVT    +E   C PY       HP 
Sbjct: 148 ISDVDILACCGRECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYH-----LHP- 201

Query: 199 CE-----------PAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 246
           CE            ++ TP C + C     + +   K Y  S Y ++ D + I  E+ KN
Sbjct: 202 CEITGKFWSCPRDHSFRTPACKKYCQYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKN 261

Query: 247 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 305
           GPV+ +FT YEDF+ Y+ G+Y H  G   G HAVK++GWG  ++G  YW +AN W+  W
Sbjct: 262 GPVQAAFTTYEDFSFYRKGIYVHSYGRQRGAHAVKVVGWGV-ENGTKYWNVANSWSTDW 319


>gi|403377404|gb|EJY88697.1| hypothetical protein OXYTRI_00086 [Oxytricha trifallax]
          Length = 351

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 97/251 (38%), Positives = 136/251 (54%), Gaps = 30/251 (11%)

Query: 88  THDKSLK--LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMN 143
           + D  LK  +P  FD R+ WPQC  + +I DQ +CG+CWAF     L+DR CI  +  +N
Sbjct: 111 SQDHLLKDSIPLEFDFRTKWPQC--LRKIRDQANCGACWAFTGSGMLADRICILTNGTIN 168

Query: 144 LSLSVNDLLACC--GFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 201
             LS  D++ C    F    GC+GGY ++A  Y ++ GV  E C PY D T         
Sbjct: 169 EELSPQDMVDCSHDNF----GCEGGYLMNALDYLMNEGVTKESCTPYKDKTN-------- 216

Query: 202 AYPTPKCVRKCVKKNQLWRNSKHY-SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
                KC   C  K + +   KHY      R+ ++ E I  ++ +NGP+ V  TVYEDF 
Sbjct: 217 -----KCQYTCQNKTEEFH--KHYCKPGTLRVLTNEEQIKRDLMQNGPLMVGLTVYEDFI 269

Query: 261 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 320
           +Y +G YK + G+++GGHAVKL+GW T+  G+  W++ NQWN  WG  G+  I    NE 
Sbjct: 270 NYATGDYKFVAGEIVGGHAVKLMGWRTTQKGQTSWLIQNQWNDDWGEQGFGYIL--ENEV 327

Query: 321 GIEEDVVAGLP 331
           GI+   V   P
Sbjct: 328 GIDSIGVGCTP 338


>gi|195426329|ref|XP_002061289.1| GK20838 [Drosophila willistoni]
 gi|194157374|gb|EDW72275.1| GK20838 [Drosophila willistoni]
          Length = 432

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 113/318 (35%), Positives = 155/318 (48%), Gaps = 31/318 (9%)

Query: 31  DSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVKPTPKGLLLGVPVKT 88
           D  +  D +I  VN   + GW A +  ++    Y+ G    L   +PT +   +    + 
Sbjct: 126 DLCLTDDELIHSVNSIHRLGWSARKYEEWWGRKYSEGLRLRLGTKEPTYR---VKTMTRL 182

Query: 89  HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSL 146
            + +  LP SF+A   W +   IS + DQG CGS W        SDRF I       + L
Sbjct: 183 TNPTDGLPASFNAVDKWSR--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSQGKEVVQL 240

Query: 147 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTG---CSHPGCEPAY 203
           S  ++L+C       GC+GG+  +AWRY    GV+ E C PY  S G     H G   A+
Sbjct: 241 SPQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVLDESCYPYTQSRGTCKVRHSGSLKAH 298

Query: 204 ---PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
              P P      V ++ L+     YS+S         DI AEI+ +GPV+ +  VY DF 
Sbjct: 299 GCRPAPG-----VDRDSLYTVGPAYSLSR------EADIKAEIFHSGPVQATMRVYRDFF 347

Query: 261 HYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 317
            Y  G+Y+      G   G H+VKL+GWG   +G+ YWI AN W   WG  GYF+I RGS
Sbjct: 348 SYSGGIYRQTAANRGAPTGFHSVKLVGWGEEHNGDKYWIAANSWGPWWGERGYFRILRGS 407

Query: 318 NECGIEEDVVAGLPSSKN 335
           NECGIE+ V+A  P   N
Sbjct: 408 NECGIEDYVLASWPYVYN 425


>gi|355724272|gb|AES08175.1| tubulointerstitial nephritis antigen [Mustela putorius furo]
          Length = 476

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 117/360 (32%), Positives = 169/360 (46%), Gaps = 48/360 (13%)

Query: 9   DPILCLTCFATFAEGVVSKLKLDSH--------------ILQDSIIKEVNENPKAGWKAA 54
           DP  CL     + EG V K   +S               ++Q  +I+ VN N   GW A 
Sbjct: 116 DPEGCLRDGQLYEEGSVVKENCNSCTCSGQQWKCSQLVCLIQPELIERVN-NGDYGWTAQ 174

Query: 55  RNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTIS 112
              QF   T+ + FK+ LG + P+P+ L +     +   +  LP+ F A   WP      
Sbjct: 175 NYSQFWGMTLEEGFKYRLGTLPPSPRLLSMNEMTASLPATTDLPEFFIASYKWP--GWTH 232

Query: 113 RILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPIS 170
             LDQ +C + WAF      +DR  I        +LS  +L++CC      GC+ G    
Sbjct: 233 GPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDR 291

Query: 171 AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRN 221
           AW +    G+V+  C P F     ++ GC  A         + T  C     K N++++ 
Sbjct: 292 AWWFLRKRGLVSHACYPLFKDQNATNDGCAMASRSDGRGKRHATKPCPNNIEKSNRIYQC 351

Query: 222 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD-------- 273
           S       YR++S+  +IM EI +NGPV+    V+EDF HYK+G+Y+H+T          
Sbjct: 352 S-----PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTRTNEEASKYR 406

Query: 274 VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
               HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 407 KFQTHAVKLTGWGTLKGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|86279341|gb|ABC88766.1| putative cathepsin B-like like proteinase [Tenebrio molitor]
          Length = 301

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 110/268 (41%), Positives = 134/268 (50%), Gaps = 22/268 (8%)

Query: 14  LTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 73
           L C    A   +S   +  H L D  I E+N   +  WKA RN    N  +   + LLGV
Sbjct: 5   LLCIVVLASVALSYGGVKLHPLSDEFINEINSK-QTTWKAGRNFDV-NTPISHVRRLLGV 62

Query: 74  KPTPKGLLLGVPVKTHDKSL-KLPKSFDARSAWPQC-STISRILDQGHCGSCWAFGAVEA 131
            P  K     +PVKTH  +L  +P+SFDAR AWP+C S I  I DQ  CGSCWAFGAVEA
Sbjct: 63  LPK-KANAPKLPVKTHAVNLDAIPESFDAREAWPECTSIIGEIRDQASCGSCWAFGAVEA 121

Query: 132 LSDRFCIH--FGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------- 182
           +SDR CIH    + + +S  DL  CC + CGDGC+GG+P  AW Y+   G+VT       
Sbjct: 122 MSDRICIHSDASVKVRISAEDLNDCC-YDCGDGCNGGWPDLAWSYWSSTGIVTGGLYGVD 180

Query: 183 EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 236
           E C  Y     C H        C     TP C + C   + L   S     SAY I    
Sbjct: 181 EGCKAY-SIKPCDHHVDGNLGPCGDIQRTPACKKSCDSTSDLEYKSDLRRGSAYSIPKSE 239

Query: 237 EDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
             I  EI  NGPVE  + VY DF  YK+
Sbjct: 240 SQIQTEIMTNGPVEADYDVYSDFLTYKA 267


>gi|308504721|ref|XP_003114544.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
 gi|308261929|gb|EFP05882.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
          Length = 358

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 114/346 (32%), Positives = 163/346 (47%), Gaps = 34/346 (9%)

Query: 14  LTCFATFAEGVVSKLKL--DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLL 71
           ++C ++F   V  +  +  ++  L  S +     N +  WKA  +       + + K + 
Sbjct: 13  VSCTSSFHPSVSYRPTIPENARKLSGSDLTSYVNNHQKLWKAETSRMTFQEKMARVKDIK 72

Query: 72  GVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEA 131
            +K + +  ++G   + +   L +P  FD+R  WP+C+ I  + DQ  CGS     AVE 
Sbjct: 73  FIK-SHEDQMVG-DSENNQVLLDIPTYFDSRQKWPECTQIGAVRDQSDCGSAAHLVAVEL 130

Query: 132 LSDRFCIHFG--MNLSLSVNDLLACCGFL---CGDG--CDGGYPISAWRYFVHHGVVT-- 182
            SDR CI      N  LS  D L+CC  L   CGDG  CDG +P    +++  HG+ T  
Sbjct: 131 ASDRTCIFSNGTFNWPLSAQDPLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGG 190

Query: 183 -------------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW----RNSKHY 225
                          CD  + +   S P   P Y TP C   C   N  W    +  KH+
Sbjct: 191 NYEDQFGCKPYSIYPCDKKYPNGTTSVPC--PGYHTPTCEEHCTS-NITWPIAYKQDKHF 247

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 285
             + Y +     DI  EI  NGPV  SF +Y+DF  YKSG+Y H  GD  GG   K+IGW
Sbjct: 248 GKAHYNVGKKMTDIQTEIMTNGPVIASFVIYDDFWDYKSGIYVHTAGDQEGGMDTKIIGW 307

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           G  D G  YW+  +QW   +G +G+ +  RG NE  IE  V+A LP
Sbjct: 308 GV-DSGVPYWLCVHQWGTDFGENGFVRFLRGVNEVNIEHQVLAALP 352


>gi|161343839|tpg|DAA06100.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 323

 Score =  168 bits (425), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 105/271 (38%), Positives = 139/271 (51%), Gaps = 33/271 (12%)

Query: 87  KTHDKSLK--LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 143
           KT D S K  +P+ FDAR  +  C+  I  + DQG+C S WA       +DR CI     
Sbjct: 54  KTVDNSYKTDIPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGQ 113

Query: 144 LS--LSVNDLLACCGFLCGDG----CDGGYPISAWRYFVHHGVVT-------EECDPYFD 190
            +  LS  +L++C     GDG    CDGG    AW   ++ G+VT       E C PY  
Sbjct: 114 FTDNLSAQNLMSC-----GDGEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPY-K 167

Query: 191 STGCSHPG------CEPAYPTPK--CVRKCVKKNQL--WRNSKHYSISAYRIN-SDPEDI 239
           +  C H G      C     T    C +KCV KN    + +  H +   Y  + ++ + I
Sbjct: 168 NRPCDHYGDSRLTNCSSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQI 227

Query: 240 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 299
             EI  +GPV     VYE+F  YK G+YK  TG+++G H VKLIGWG   DG +YW+  N
Sbjct: 228 QQEIMTHGPVTAFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWLAMN 287

Query: 300 QWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
            WN +WG DG FKI RG N C IE  V+AG+
Sbjct: 288 SWNSNWGNDGLFKILRGYNFCSIELLVMAGI 318


>gi|350596935|ref|XP_001927698.4| PREDICTED: tubulointerstitial nephritis antigen, partial [Sus
           scrofa]
          Length = 368

 Score =  168 bits (425), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 112/322 (34%), Positives = 159/322 (49%), Gaps = 36/322 (11%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 90
           ++Q  +I+ VNE    GW A    QF   T+ + FK+ LG  P P  LLL +   T    
Sbjct: 47  LVQPGLIEHVNEG-DFGWTAQNYSQFWGMTLEEGFKYRLGTLP-PSPLLLSMNEVTASLP 104

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSV 148
           ++  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS 
Sbjct: 105 ETTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSEGRYTANLSP 162

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------ 202
            +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A      
Sbjct: 163 QNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGR 221

Query: 203 ---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
              + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V+EDF
Sbjct: 222 GKRHATKPCPNNFEKSNRIYQCSP-----PYRVSSNETEIMREIMQNGPVQAIMQVHEDF 276

Query: 260 AHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGA 307
            HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +SWG 
Sbjct: 277 FHYKTGIYRHVTSTNEESDKYRKLRTHAVKLTGWGTLKGAQGRKEKFWIAANSWGKSWGE 336

Query: 308 DGYFKIKRGSNECGIEEDVVAG 329
           +GYF+I RG NE  IE+ ++A 
Sbjct: 337 NGYFRILRGVNESDIEKLIIAA 358


>gi|270012758|gb|EFA09206.1| cathepsin B precursor [Tribolium castaneum]
          Length = 326

 Score =  167 bits (423), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 109/297 (36%), Positives = 161/297 (54%), Gaps = 34/297 (11%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS--LKLP 96
           +I+E+N + +  WKA  N       +G     LG+ P P      +  K H  S  + +P
Sbjct: 26  VIQEIN-SEQISWKAETNCLDIKSRLG----FLGLHPDPN---YKIQTKQHKISRIISIP 77

Query: 97  KSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLA 153
           +SFDAR  WP+C   I +I +QG+CGSCWAF + E ++DR CI     +    S  +LL 
Sbjct: 78  ESFDAREKWPECKDVIGKIRNQGNCGSCWAFASTEVMTDRLCISSKGKIKFVFSPENLLT 137

Query: 154 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV 213
           CC         GGY  +AW Y+++ G+ +     Y  S GC  P  E ++   +   +CV
Sbjct: 138 CCKDCGCGC-KGGYIKNAWDYYINEGIAS--GGDYNSSEGC-QPYSESSFQYAE-ASECV 192

Query: 214 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 273
           K               Y + ++   I  EI  NGPV   + V+EDFA +KSGVY + +G 
Sbjct: 193 K--------------FYTLETNVAQIQMEILTNGPVMAYYNVFEDFACHKSGVYYYKSGK 238

Query: 274 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA-DGYFKIKRGSNECGIEEDVVAG 329
            +G H+VK+IGWGT ++G  YW++AN W   WG   G+FK++RG+NEC IE+++ AG
Sbjct: 239 FVGRHSVKVIGWGT-EEGIPYWLIANSWGSEWGELGGFFKMRRGTNECWIEQEMTAG 294


>gi|308162940|gb|EFO65307.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 303

 Score =  167 bits (423), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 99/288 (34%), Positives = 150/288 (52%), Gaps = 29/288 (10%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLG-VP----VKTHDKSLKLPKSFDARSAW 105
           WKA    +F N T  +F+ +L ++P   G   G +P     +  + +  +P  FD R  +
Sbjct: 31  WKAGMPKRFENITEDEFRGML-IRPDILGAGSGSLPPSSVTEIQEPADPIPSQFDFRDEY 89

Query: 106 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDG 162
           PQC  ++ ++DQG CG CWAF A+    DR C+  G++   +  S   L++C       G
Sbjct: 90  PQC--VTPVMDQGSCGGCWAFSAIGVFGDRRCVA-GIDKEGVPYSQQYLISCS--TENHG 144

Query: 163 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 222
           CDGG     W +    G  T EC  Y D          P      C   C   +Q+    
Sbjct: 145 CDGGDFWPTWSFLTLTGATTAECVKYIDY---------PNIVASPCPAVCDDGSQI---- 191

Query: 223 KHYSISAY-RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAV 280
           + Y    Y +++ + + IM  +   GPV+    VY D ++Y+SGVYKH  G + +G HA+
Sbjct: 192 QLYKAHGYGQVSKNVQAIMHMLATGGPVQTMIVVYSDLSYYESGVYKHTYGTISLGLHAL 251

Query: 281 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
           +++G+GT+DDG DYWI+ N W   WG +GYF+I RG NEC IE+++ A
Sbjct: 252 EMVGYGTTDDGTDYWIIRNSWGADWGENGYFRIVRGVNECRIEDEIYA 299


>gi|209863079|ref|NP_001119613.2| cathepsin B precursor [Acyrthosiphon pisum]
          Length = 323

 Score =  167 bits (423), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 105/271 (38%), Positives = 138/271 (50%), Gaps = 33/271 (12%)

Query: 87  KTHDKSLK--LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 143
           KT D S K  +P+ FDAR  +  C+  I  + DQG+C S WA       +DR CI     
Sbjct: 54  KTVDNSYKTDIPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGQ 113

Query: 144 LS--LSVNDLLACCGFLCGDG----CDGGYPISAWRYFVHHGVVT-------EECDPYFD 190
            +  LS  +L++C     GDG    CDGG    AW   ++ G+VT       E C PY  
Sbjct: 114 FTDNLSAQNLMSC-----GDGEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPY-K 167

Query: 191 STGCSHPG------CEPAYPTPK--CVRKCVKKNQL--WRNSKHYSISAYRIN-SDPEDI 239
           +  C H G      C     T    C +KCV KN    + +  H +   Y  + ++ + I
Sbjct: 168 NRPCDHYGDSRLTNCSSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQI 227

Query: 240 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 299
             EI   GPV     VYE+F  YK G+YK  TG+++G H VKLIGWG   DG +YW+  N
Sbjct: 228 QQEIMTYGPVTAFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWLAMN 287

Query: 300 QWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
            WN +WG DG FKI RG N C IE  V+AG+
Sbjct: 288 SWNSNWGNDGLFKILRGYNFCSIELLVMAGI 318


>gi|290990464|ref|XP_002677856.1| predicted protein [Naegleria gruberi]
 gi|284091466|gb|EFC45112.1| predicted protein [Naegleria gruberi]
          Length = 231

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 96/235 (40%), Positives = 133/235 (56%), Gaps = 22/235 (9%)

Query: 99  FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCG 156
           FD+R  WP C  +  I DQG+CGSC++F + E +SDRFCI  +  +N+ LS  DL+ C  
Sbjct: 6   FDSRQKWPNC--VHPIRDQGNCGSCYSFASSEVMSDRFCIFSNGSVNVVLSPQDLVTCSW 63

Query: 157 FLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKN 216
           +    GC+GG P   + Y    G+V++ C PY    G +H  C P +    C      K 
Sbjct: 64  Y--SFGCNGGIPGLVFDYIHKDGLVSDACFPYLSYDGNTHVKC-PDF----CYNN---KT 113

Query: 217 QLWRNSKHYSISAYRINSDPED-------IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 269
           + +++ KH++   Y +    ED       I  EI  +GPV   F VY DF  YKSGVY+H
Sbjct: 114 KSFKSDKHFADKVYHVGEFLEDKAKRVLEIQKEILTHGPVNADFMVYSDFTVYKSGVYRH 173

Query: 270 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
            TG   G HAVK+IGWGT ++G DYW++AN W  ++G  G+FKI RG     +EE
Sbjct: 174 QTGSFEGIHAVKIIGWGT-ENGVDYWLIANSWGTTFGLQGFFKIVRGGKFIHLEE 227


>gi|157058749|gb|ABV03132.1| cathepsin B-3098 [Acyrthosiphon pisum]
          Length = 256

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 94/237 (39%), Positives = 128/237 (54%), Gaps = 21/237 (8%)

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 147
           D   ++P  FDAR  W +C TI  + DQG+CGS WA     A +DR C+  +   N  LS
Sbjct: 23  DNYQEIPMKFDARKKWIRCKTIGEVRDQGNCGSDWALSTSSAFADRLCVATNGDFNQLLS 82

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGC 194
             ++  CC   CG+GC+GGYPI AW+ F +HG+VT       E C+PY      +D  G 
Sbjct: 83  AEEITFCC-HKCGNGCNGGYPIRAWKRFKNHGLVTGGNYKSGEGCEPYRVPPCPYDKDGK 141

Query: 195 SHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGPVEVSF 253
           +    +P  P  KC +KC     +  N  H Y+   Y +      I  ++   GP+E SF
Sbjct: 142 NTCSGQPMEPNHKCSKKCYGDEDIDFNKDHRYTRDDYYLTY--RGIQKDVINYGPIEASF 199

Query: 254 TVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 309
            VY+DF +YKSG+Y K      +GGH+VKLIGWG  + G  YW++ N WN  WG  G
Sbjct: 200 DVYDDFPNYKSGIYVKSENASYLGGHSVKLIGWG-EEYGVLYWLMVNSWNADWGDKG 255


>gi|410959397|ref|XP_003986297.1| PREDICTED: tubulointerstitial nephritis antigen [Felis catus]
          Length = 474

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 108/322 (33%), Positives = 160/322 (49%), Gaps = 35/322 (10%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++Q  +I+ VN+    GW+A    QF   T+ + FK+ LG + P+P  L +     +   
Sbjct: 152 LVQPELIERVNKG-DYGWRAQNYSQFWGMTLEEGFKYRLGTLPPSPMLLSMNEVTASLPA 210

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
           +  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS  
Sbjct: 211 TTDLPEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 268

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 202
           +L++CC      GC+ G    AW +    G+V+  C P F +   ++ GC  A       
Sbjct: 269 NLISCCP-KNRHGCNSGSIDRAWWFLRKRGLVSHACYPLFKNQNATNHGCAMASRSDGRG 327

Query: 203 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
             + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V+EDF 
Sbjct: 328 KRHATKPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFF 382

Query: 261 HYKSGVYKHITGDV---------MGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGA 307
           HYK+G+Y+HIT            +  HAVKL GWGT        E +WI AN W +SWG 
Sbjct: 383 HYKTGIYRHITKKANEESGKYRKLQTHAVKLTGWGTLKGAQGRKEKFWIAANSWGKSWGE 442

Query: 308 DGYFKIKRGSNECGIEEDVVAG 329
           +GYF+I RG NE  IE+ ++A 
Sbjct: 443 NGYFRILRGVNESDIEKLIIAA 464


>gi|156708118|gb|ABU93317.1| cathepsin B8 cysteine protease, partial [Monocercomonoides sp. PA]
          Length = 275

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 103/301 (34%), Positives = 148/301 (49%), Gaps = 32/301 (10%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVK---THD 90
           +L +S++  VN +P + W A   P+         + L   K T     +G   +   T  
Sbjct: 2   VLAESVVDIVNNDPSSTWVATEYPR---------EILTLAKMTAMISQIGNGFEGEWTFA 52

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND 150
           ++   P SFD R  WP       + +Q  CGSCWA  A E +  R  I       +S  D
Sbjct: 53  ENENAPASFDCRQKWP--GKAEPVRNQASCGSCWAHAASETMGFRMGIRGCYKGVMSPQD 110

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           L++C       GC+GGY    W +    G+ TE+C PY   +G            P C  
Sbjct: 111 LVSCESN--NMGCEGGYADRVWNWIQKKGITTEQCLPYVSGSG----------RVPTCPS 158

Query: 211 KCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 270
           KC   + + R+   +  S    NS  + +M E+  NGPV   F V+EDF +YKSG+Y+H 
Sbjct: 159 KCKNGSNIVRS---FVSSWGSFNS--KTVMDEVANNGPVYACFEVFEDFLNYKSGIYQHK 213

Query: 271 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           TG   G H V L+GWGT ++G  YW+L N W   WG  G+F+I+RG+N+C I+E   +GL
Sbjct: 214 TGKSKGWHHVMLMGWGT-ENGVPYWLLQNSWGSGWGEKGFFRIRRGTNDCHIDEIFYSGL 272

Query: 331 P 331
           P
Sbjct: 273 P 273


>gi|301775398|ref|XP_002923119.1| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
           antigen-like [Ailuropoda melanoleuca]
          Length = 472

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 118/359 (32%), Positives = 169/359 (47%), Gaps = 50/359 (13%)

Query: 9   DPILCLTCFATFAEGVVSKLKLDSH--------------ILQDSIIKEVNENPKAGWKAA 54
           DP  CL     + EG V K   +S               ++Q  +I+ VN+    GW A 
Sbjct: 116 DPEGCLRDGQAYEEGSVIKENCNSCTCSGQQWKCSQLVCLVQPELIERVNKG-DYGWTAQ 174

Query: 55  RNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD--KSLKLPKSFDARSAWPQCSTI 111
              QF   T+ + FK+ LG  P P  LLL +   T     +  LP+ F A   WP     
Sbjct: 175 NYSQFWGMTLEEGFKYRLGTLP-PSPLLLSMNEMTASLPATTDLPEFFIASYKWP--GWT 231

Query: 112 SRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISA 171
              LDQ +C + WAF      +DR    +  NLS    +L++CC      GC+ G    A
Sbjct: 232 HGPLDQKNCAASWAFSTASVAADRIXGRYTANLS--PQNLISCCA-KNRHGCNSGSIDRA 288

Query: 172 WRYFVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNS 222
           W +    G+V+  C P F     ++ GC  A         + T  C     K N++++ S
Sbjct: 289 WWFLRKRGLVSHACYPLFKDQNATNYGCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS 348

Query: 223 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------V 274
                  YR++S+  +IM EI +NGPV+    V+EDF HYK+G+Y+H+T           
Sbjct: 349 -----PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTRTNEESSKYRK 403

Query: 275 MGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
           +  HA+KL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 404 LQTHAIKLTGWGTLKGARGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 462


>gi|78042562|ref|NP_001030279.1| tubulointerstitial nephritis antigen [Bos taurus]
 gi|108861910|sp|Q3SZI1.1|TINAG_BOVIN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
 gi|74354008|gb|AAI02844.1| Tubulointerstitial nephritis antigen [Bos taurus]
 gi|296474572|tpg|DAA16687.1| TPA: tubulointerstitial nephritis antigen [Bos taurus]
          Length = 476

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 116/337 (34%), Positives = 164/337 (48%), Gaps = 43/337 (12%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 90
           ++Q  +I+ VN+    GW A    QF   T+ + FK+ LG  P P  LLL +   T    
Sbjct: 155 LVQPGLIEHVNKG-DYGWTAQNYSQFWGMTLEEGFKYRLGTLP-PSPLLLSMNEVTASLT 212

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 148
           K+  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS 
Sbjct: 213 KTTDLPEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLSP 270

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------ 202
            +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A      
Sbjct: 271 QNLISCCAKK-RHGCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGR 329

Query: 203 ---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
              + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V+EDF
Sbjct: 330 GKRHATTPCPNSIEKSNRIYQCS-----PPYRVSSNETEIMREIMQNGPVQAIMQVHEDF 384

Query: 260 AHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGA 307
            +YK+G+Y+HIT              HAVKL GWGT        E +WI AN W +SWG 
Sbjct: 385 FNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGE 444

Query: 308 DGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 344
           +GYF+I RG NE  IE+ ++A          ++TSAD
Sbjct: 445 NGYFRILRGVNESDIEKLIIAAW-------GQLTSAD 474


>gi|323448265|gb|EGB04166.1| hypothetical protein AURANDRAFT_32974 [Aureococcus anophagefferens]
          Length = 298

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 104/276 (37%), Positives = 138/276 (50%), Gaps = 37/276 (13%)

Query: 91  KSLKLPKSFDARSAWPQCST-ISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 147
           +    P++FD+ + WP+C+  I  I DQ +CG CWAF   EA SDR CI  G  + + LS
Sbjct: 20  RGGAAPEAFDSAARWPECAKLIGDIRDQSNCGCCWAFAGAEAASDRQCIATGGAVAVPLS 79

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE------------CDPYFDSTGCS 195
             D+   C     DGCDGG  I+ W Y    G VT              C  +F +  C 
Sbjct: 80  AQDV---CFNANVDGCDGGQIITPWTYVAKAGAVTGGQYNGTGPFGAGLCADWF-APHCH 135

Query: 196 HPGCE-------------PAYPTPKCVRKC----VKKNQLWRNSKHYSISAYRINSDPED 238
           H G               P+  +P+  + C       +  +   KH      +  S    
Sbjct: 136 HHGPRGDDPYPAEGDAGCPSEKSPEGPKACDATAAAGHDAFAADKHTFAGDVQTASGEAA 195

Query: 239 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 298
           IMA I + GPVE +FTVYEDF +Y  G+Y H+TG+  GGHAVK +GWG  ++G  YW +A
Sbjct: 196 IMAMIAEGGPVETAFTVYEDFENYAGGIYHHVTGEEAGGHAVKFVGWGV-ENGTKYWKVA 254

Query: 299 NQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 334
           N WN  WG  GYF+I RGSNE GIE+ V      +K
Sbjct: 255 NSWNPYWGEAGYFRILRGSNEGGIEDQVTGSHADAK 290


>gi|327282776|ref|XP_003226118.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
           carolinensis]
          Length = 476

 Score =  165 bits (417), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 123/365 (33%), Positives = 168/365 (46%), Gaps = 46/365 (12%)

Query: 3   PTKLIMDPILCLTCF---------ATFAEGVVSKLKLDSH--------ILQDSIIKEVNE 45
           P  +   P L + CF         A F +   S   ++SH        +++ S+IK++N+
Sbjct: 114 PENIRTPPSLQVGCFTDEQHHGEGAIFKDNCNSCKCVNSHWKCSSEICLVRPSLIKQIND 173

Query: 46  NPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKSLK-LPKSFDARS 103
               GWKA    QF    + + +   LG  P P  LL   PV  +  +    P+ F A  
Sbjct: 174 G-NYGWKAHNYSQFWGMNLKEGYNSRLGTFPPPAALLDMKPVTENIIAEDDFPEFFVAWH 232

Query: 104 AWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGD 161
            WP    I   LDQ +C + WAF      +DR  IH     +  LS   L++C       
Sbjct: 233 EWP--GWIHDPLDQRNCAASWAFSTASVAADRIAIHSKGRFTDNLSPQHLISC-DTRNQY 289

Query: 162 GCDGGYPISAWRYFVHHGVVTEECDPYF----DSTGCSHPGCEPAYPTPKCVRKCVKKNQ 217
           GC GG    AW Y   +G+V+  C P F      T C       A    + ++ C  +  
Sbjct: 290 GCKGGSITGAWSYLKKYGLVSHACYPLFWNNLHQTSCEMSSVFDAEGKRQAIQPCPNR-- 347

Query: 218 LWRNSKHYSISA--YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI---TG 272
            W  S H       YRI+S   DIM EI +NGPV+    VY+DF  YKSG+YKHI    G
Sbjct: 348 -WEPSNHIYQCGLPYRISSQDADIMKEIKENGPVQAVMQVYDDFFLYKSGIYKHIWSLEG 406

Query: 273 DVMGGH-----AVKLIGWGTSDDGE----DYWILANQWNRSWGADGYFKIKRGSNECGIE 323
                H     ++K++GWGT  D E     +WI AN W  SWG +GYF+I RG NEC IE
Sbjct: 407 KTQNRHQKKPHSIKIVGWGTLRDAEGQRQKFWIAANSWGNSWGENGYFRILRGQNECDIE 466

Query: 324 EDVVA 328
           + V+A
Sbjct: 467 KTVIA 471


>gi|395833440|ref|XP_003789742.1| PREDICTED: tubulointerstitial nephritis antigen [Otolemur
           garnettii]
          Length = 464

 Score =  165 bits (417), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 112/322 (34%), Positives = 157/322 (48%), Gaps = 36/322 (11%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 90
           +++  +I+ VN+    GW A    QF   T+   FK  LG  P P  LLL +   T    
Sbjct: 143 LVRPELIENVNKG-DYGWIAQNYSQFWGMTLEDGFKFRLGTLP-PSPLLLSMNEMTASLP 200

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 148
           K+  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS 
Sbjct: 201 KTTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSP 258

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------ 202
            +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A      
Sbjct: 259 QNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQHATNSGCAMASRSDGR 317

Query: 203 ---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
              + T  C     K N++++ S       YRI+S+  +IM EI +NGPV+    V+EDF
Sbjct: 318 GKRHATKPCPNNIEKSNRIYQCS-----PPYRISSNETEIMKEIMQNGPVQAIMQVHEDF 372

Query: 260 AHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGA 307
            HYKSG+Y+H+            +  HAVKL+GWGT        E +WI AN W +SWG 
Sbjct: 373 FHYKSGIYRHVASTHGESENYRKLRTHAVKLLGWGTLRGAQGRKEKFWIAANSWGKSWGE 432

Query: 308 DGYFKIKRGSNECGIEEDVVAG 329
           +GYF+I RG NE  IE+ ++A 
Sbjct: 433 NGYFRILRGVNESDIEKLIIAA 454


>gi|156708116|gb|ABU93316.1| cathepsin B7 cysteine protease, partial [Monocercomonoides sp. PA]
          Length = 273

 Score =  164 bits (416), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 102/300 (34%), Positives = 149/300 (49%), Gaps = 32/300 (10%)

Query: 35  LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVK---THDK 91
           L +S++  VN +P + W A   P+    T  + + ++          +G   +   T  +
Sbjct: 1   LAESVVDIVNNDPSSTWVATEYPR-EILTPAKMRAMIS--------QIGNGFEGEWTFAE 51

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL 151
           +   P SFD R  WP       + +QG CGSCWA  A E +  R  I       +S  DL
Sbjct: 52  NENAPASFDCRQKWP--GKAEPVRNQGSCGSCWAHAASETMGFRMGIRRCSKGVMSPQDL 109

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK 211
           ++C       GC+GGY    W +    G+ TE+C PY   +G            P C  K
Sbjct: 110 VSCESN--NMGCNGGYADRVWNWIQKKGITTEQCIPYVSGSG----------RVPTCPSK 157

Query: 212 CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 271
           C   + + R+   +  S    NS  + +M E+  NGPV   F V+EDF +Y+SGVY+H T
Sbjct: 158 CKNGSNIVRS---FVSSWGSFNS--KTVMDEVANNGPVYACFEVFEDFYNYRSGVYQHKT 212

Query: 272 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           G   G H V L+GWGT ++G  YW+L N W   WG  G+F+I+RG+N+C I+E   +GLP
Sbjct: 213 GRSQGWHHVMLMGWGT-ENGVPYWLLQNSWGSGWGEKGFFRIRRGTNDCHIDEIFYSGLP 271


>gi|32129433|sp|P92131.3|CATB1_GIALA RecName: Full=Cathepsin B-like CP1; AltName: Full=Cathepsin B-like
           protease B1; Flags: Precursor
          Length = 303

 Score =  164 bits (415), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 104/287 (36%), Positives = 148/287 (51%), Gaps = 27/287 (9%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKP----TPKGLLLGVPV-KTHDKSLKLPKSFDARSAW 105
           WKA    +F N T  +F+ +L ++P       G L  + + +  +    +P  FD R  +
Sbjct: 31  WKAGMPKRFENVTEDEFRSML-IRPDRLRARSGSLPPISITEVQELVDPIPPQFDFRDEY 89

Query: 106 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDG 162
           PQC  +   LDQG CGSCWAF A+    DR C   G++   +S S   L++C   L   G
Sbjct: 90  PQC--VKPALDQGSCGSCWAFSAIGVFGDRRC-AMGIDKEAVSYSQQHLISCS--LENFG 144

Query: 163 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 222
           CDGG     W +    G  T EC  Y D       G   A P P          QL++  
Sbjct: 145 CDGGDFQPTWSFLTFTGATTAECVKYVDY------GHTVASPCPAVCDDG-SPIQLYKAH 197

Query: 223 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVK 281
            +  +S     S P  IM  +   GP++    VY D ++Y+SGVYKH  G + +G HA++
Sbjct: 198 GYGQVS----KSVPA-IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALE 252

Query: 282 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
           ++G+GT+DDG DYWI+ N W   WG +GYF+I RG NEC IE+++ A
Sbjct: 253 IVGYGTTDDGTDYWIIKNSWGPDWGENGYFRIVRGVNECRIEDEIYA 299


>gi|47125398|gb|AAH70278.1| Tubulointerstitial nephritis antigen [Homo sapiens]
 gi|190690249|gb|ACE86899.1| tubulointerstitial nephritis antigen protein [synthetic construct]
 gi|190691623|gb|ACE87586.1| tubulointerstitial nephritis antigen protein [synthetic construct]
 gi|312150986|gb|ADQ32005.1| tubulointerstitial nephritis antigen [synthetic construct]
          Length = 476

 Score =  164 bits (415), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 109/322 (33%), Positives = 157/322 (48%), Gaps = 36/322 (11%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 90
           +++  +I++VN+    GW A    QF   T+   FK  LG  P P  +LL +   T    
Sbjct: 155 LVRSELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLP-PSLMLLSMNEMTASLP 212

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 148
            +  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS 
Sbjct: 213 ATTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSP 270

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------ 202
            +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A      
Sbjct: 271 QNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGR 329

Query: 203 ---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
              + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V+EDF
Sbjct: 330 GKRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDF 384

Query: 260 AHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGA 307
            HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +SWG 
Sbjct: 385 FHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGE 444

Query: 308 DGYFKIKRGSNECGIEEDVVAG 329
           +GYF+I RG NE  IE+ ++A 
Sbjct: 445 NGYFRILRGVNESDIEKLIIAA 466


>gi|426250116|ref|XP_004018784.1| PREDICTED: tubulointerstitial nephritis antigen [Ovis aries]
          Length = 476

 Score =  164 bits (415), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 118/355 (33%), Positives = 169/355 (47%), Gaps = 43/355 (12%)

Query: 16  CFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVK 74
           C +    G   K    + ++Q  +I+ VN+    GW A    QF   T+ + FK+ LG  
Sbjct: 137 CNSCTCSGQQWKCSQHACLVQPGLIEHVNKG-DYGWTAQNYSQFWGMTLEEGFKYRLGTL 195

Query: 75  PTPKGLLLGVPVKTHD--KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEAL 132
           P P  LLL +   T    ++  LP+ F A   WP        LDQ +C + WAF      
Sbjct: 196 P-PSPLLLSMNEVTASLAETTDLPEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASVA 252

Query: 133 SDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD 190
           +DR  I        +LS  +L++CC      GC+ G    AW Y    G+V+  C P F 
Sbjct: 253 ADRIAIQSQGRYTANLSPQNLISCCAKK-RHGCNSGSVDRAWWYLRKRGLVSHACYPLFK 311

Query: 191 STGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMA 241
               ++ GC  A         + T  C     K N++++ S       YR++S+  +IM 
Sbjct: 312 DQNATNNGCAMASRSDGRGKRHATTPCPNSIEKSNRIYQCS-----PPYRVSSNETEIMR 366

Query: 242 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SD 289
           EI +NGPV+    V+EDF +YK+G+Y+HIT              HAVKL GWGT      
Sbjct: 367 EIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAHG 426

Query: 290 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 344
             E +WI AN W +SWG +GYF+I RG NE  IE+ ++A          ++TSAD
Sbjct: 427 QKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAW-------GQLTSAD 474


>gi|224586907|ref|NP_055279.3| tubulointerstitial nephritis antigen [Homo sapiens]
 gi|317373501|sp|Q9UJW2.3|TINAG_HUMAN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
 gi|119624842|gb|EAX04437.1| tubulointerstitial nephritis antigen [Homo sapiens]
 gi|189066513|dbj|BAG35763.1| unnamed protein product [Homo sapiens]
          Length = 476

 Score =  164 bits (414), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 107/321 (33%), Positives = 156/321 (48%), Gaps = 34/321 (10%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           +++  +I++VN+    GW A    QF   T+   FK  LG + P+P  L +     +   
Sbjct: 155 LVRSELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPA 213

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
           +  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS  
Sbjct: 214 TTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 202
           +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A       
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRG 330

Query: 203 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
             + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V EDF 
Sbjct: 331 KRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFF 385

Query: 261 HYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGAD 308
           HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +SWG +
Sbjct: 386 HYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGEN 445

Query: 309 GYFKIKRGSNECGIEEDVVAG 329
           GYF+I RG NE  IE+ ++A 
Sbjct: 446 GYFRILRGVNESDIEKLIIAA 466


>gi|73973401|ref|XP_538969.2| PREDICTED: tubulointerstitial nephritis antigen [Canis lupus
           familiaris]
          Length = 476

 Score =  164 bits (414), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 108/321 (33%), Positives = 157/321 (48%), Gaps = 34/321 (10%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++Q  +I+ VN+    GW A    QF   T+ + FK+ LG + P+P  L +     +   
Sbjct: 155 LVQPELIEHVNKG-DYGWTAQNYSQFWGMTLEEGFKYRLGTLPPSPMLLSMNEMTASLPA 213

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
           +  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS  
Sbjct: 214 TTDLPEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSNGRYTANLSPQ 271

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 202
           +L++CC      GC+ G    AW +    G+V+  C P F     ++ GC  A       
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNATNYGCAMASRSDGRG 330

Query: 203 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
             + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V+EDF 
Sbjct: 331 KRHATKPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFF 385

Query: 261 HYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGAD 308
           HYK+G+Y+HIT           +  HAVKL GWGT        E +WI AN W  SWG +
Sbjct: 386 HYKTGIYRHITRTNEESRKYQKLQTHAVKLTGWGTLKGAQGQKEKFWIAANSWGISWGEN 445

Query: 309 GYFKIKRGSNECGIEEDVVAG 329
           GYF+I RG NE  IE+ ++A 
Sbjct: 446 GYFRILRGVNESDIEKLIIAA 466


>gi|363732245|ref|XP_419905.3| PREDICTED: tubulointerstitial nephritis antigen [Gallus gallus]
          Length = 467

 Score =  164 bits (414), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 111/317 (35%), Positives = 154/317 (48%), Gaps = 30/317 (9%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLL--LGVPVKTHD 90
           +++  +I  +N     GWKA    QF   T+ + F+  LG  P    LL    +P  +  
Sbjct: 160 LVRPDLIHHINSG-DYGWKADNYTQFWGMTLEEGFRKRLGTLPPSHSLLNMKAIPGSSVP 218

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSV 148
           +  K P+ F A  AWP    I   LDQ +CG+ WAF      +DR  IH    ++  LSV
Sbjct: 219 EE-KFPEFFAATYAWPD--WIHDPLDQRNCGASWAFSTASVAADRITIHSDGQITDNLSV 275

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK- 207
            +L++C       GC+GG    AWRY   HGVV+  C P F       P     Y + + 
Sbjct: 276 QNLISC-DTGNQRGCNGGSIDGAWRYLTTHGVVSYACYPSFWKHHLDSPSENQCYVSSEY 334

Query: 208 --------CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
                   C       N+L+R   HY     R++S   DIM EI   GPV+    VYEDF
Sbjct: 335 GKNHTNGPCPNALEDSNRLYRCGSHY-----RVSSKETDIMEEIMAKGPVQAIMKVYEDF 389

Query: 260 AHYKSGVYKHI--TGDVMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKI 313
             YK G+Y+H    G     H+VKL+GWG+    +   + +WI AN W + WG +GYF+I
Sbjct: 390 FLYKEGIYRHSYKAGSKWKTHSVKLLGWGSLPGKNGQKQKFWIAANSWGKYWGENGYFRI 449

Query: 314 KRGSNECGIEEDVVAGL 330
            RG NEC IE+ ++  L
Sbjct: 450 LRGQNECDIEKLILTTL 466


>gi|326916361|ref|XP_003204476.1| PREDICTED: tubulointerstitial nephritis antigen-like [Meleagris
           gallopavo]
          Length = 467

 Score =  164 bits (414), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 113/317 (35%), Positives = 153/317 (48%), Gaps = 30/317 (9%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKS 92
           +++  +I  +N     GWKA    QF   T+ + F+  LG  P P   LL +        
Sbjct: 160 LVRPDLIHHINSG-DYGWKADNYTQFWGMTLEEGFRKRLGTLP-PSHSLLNMEAIPGSSL 217

Query: 93  L--KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSV 148
           L  K P+ F A  AWP    I   LDQ +CG+ WAF      +DR  IH    ++  LSV
Sbjct: 218 LEEKFPEFFAATYAWPD--WIHDPLDQRNCGASWAFSTASVAADRIAIHSDGQITDNLSV 275

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK- 207
            +L++C       GC GG    AWRY   HGVV+  C P F       P     Y + + 
Sbjct: 276 QNLISC-DTKNQHGCGGGNIEGAWRYLKTHGVVSYACYPSFWKHSLDSPSENHCYVSSEY 334

Query: 208 --------CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
                   C       N+L+R + HY     RI+S   DIM EI   GPV+    VYEDF
Sbjct: 335 GKNHTNGPCPNALEDSNRLYRCASHY-----RISSKETDIMEEIMAKGPVQAIMKVYEDF 389

Query: 260 AHYKSGVYKHI--TGDVMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKI 313
             YK G+Y+H    G     H+VKL+GWG+    +   + +WI AN W + WG +GYF+I
Sbjct: 390 FLYKEGIYRHSYKAGSKWKTHSVKLLGWGSLPGKNGQKQKFWIAANSWGKYWGENGYFRI 449

Query: 314 KRGSNECGIEEDVVAGL 330
            RG NEC IE+ ++  L
Sbjct: 450 LRGQNECDIEKLILTTL 466


>gi|6009533|dbj|BAA84949.1| tubulointerstitial nephritis antigen [Homo sapiens]
          Length = 476

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 109/322 (33%), Positives = 157/322 (48%), Gaps = 36/322 (11%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 90
           +++  +I++VN+    GW A    QF   T+   FK  LG  P P  +LL +   T    
Sbjct: 155 LVRPELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLP-PSLMLLSMNEMTASLP 212

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 148
            +  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS 
Sbjct: 213 ATTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSP 270

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------ 202
            +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A      
Sbjct: 271 QNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGR 329

Query: 203 ---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
              + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V+EDF
Sbjct: 330 GKRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDF 384

Query: 260 AHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGA 307
            HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +SWG 
Sbjct: 385 FHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGE 444

Query: 308 DGYFKIKRGSNECGIEEDVVAG 329
           +GYF+I RG NE  IE+ ++A 
Sbjct: 445 NGYFRILRGVNESDIEKLIIAA 466


>gi|403268748|ref|XP_003926429.1| PREDICTED: tubulointerstitial nephritis antigen [Saimiri
           boliviensis boliviensis]
          Length = 476

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 107/321 (33%), Positives = 156/321 (48%), Gaps = 34/321 (10%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           +++  +I++VN+    GW A    QF   T+   FK  LG + P+P  L +     +   
Sbjct: 155 LVRPELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPA 213

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
           +  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS  
Sbjct: 214 TTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 202
           +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A       
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNSGCAMASRSDGRG 330

Query: 203 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
             + T  C     K N++++ S       YR++S   +IM EI +NGPV+    V+EDF 
Sbjct: 331 KRHATKPCPNNIEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMKVHEDFF 385

Query: 261 HYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGAD 308
           HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +SWG +
Sbjct: 386 HYKTGIYRHVTSTNKESEKFLKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGEN 445

Query: 309 GYFKIKRGSNECGIEEDVVAG 329
           GYF+I RG NE  IE+ ++A 
Sbjct: 446 GYFRILRGVNESDIEKLIIAA 466


>gi|296198446|ref|XP_002746707.1| PREDICTED: tubulointerstitial nephritis antigen [Callithrix
           jacchus]
          Length = 476

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 107/321 (33%), Positives = 156/321 (48%), Gaps = 34/321 (10%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           +++  +I++VN+    GW A    QF   T+   FK  LG + P+P  L +     +   
Sbjct: 155 LVRPELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPA 213

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
           +  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS  
Sbjct: 214 TTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 202
           +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A       
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNSGCAMASRSDGRG 330

Query: 203 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
             + T  C     K N++++ S       YR++S   +IM EI +NGPV+    V+EDF 
Sbjct: 331 KRHATKPCPNNIEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMKVHEDFF 385

Query: 261 HYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGAD 308
           HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +SWG +
Sbjct: 386 HYKTGIYRHVTSTNKESEKFQKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGEN 445

Query: 309 GYFKIKRGSNECGIEEDVVAG 329
           GYF+I RG NE  IE+ ++A 
Sbjct: 446 GYFRILRGVNESDIEKLIIAA 466


>gi|253744204|gb|EET00443.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 309

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 100/287 (34%), Positives = 146/287 (50%), Gaps = 26/287 (9%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP-VKTHDKSLKLPKSFDARSAWPQCS 109
           WKA    +  N T   FK L+  K  P+G +  +  + T++    +P  FD R  +PQC 
Sbjct: 31  WKAGIPERLKNLTETDFKRLVSAK-DPRGQIPTLHLIHTYESEDPIPDHFDFREEYPQC- 88

Query: 110 TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDG- 165
            I+ ++D G C S WA   VEA   R C++ G++      S   +L+C      +GC   
Sbjct: 89  -ITEVIDMGTCSSSWAHSPVEAFGHRRCMN-GVDQEATRYSAQYILSCA---TTNGCLAF 143

Query: 166 -GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH 224
            G  + +W +    G+  E C  Y D     +   E +YP P     C   + L      
Sbjct: 144 PGQGVVSWDFIATTGIPLESCVKYTD-----YDKTESSYPCPSL---CNDNSSL----VL 191

Query: 225 YSISAYR-INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 283
           Y    Y  +  +PE +   I   GP++  FTVYEDFA+Y  G+Y H+ G   G  +V+++
Sbjct: 192 YKSDGYEGVGFNPEKLRRAIALRGPMQAMFTVYEDFAYYLEGIYSHVYGGTAGYLSVEIV 251

Query: 284 GWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           G+GTSD+G+DYWI+ N W  +WG DGYF+I RG NEC IEE V   +
Sbjct: 252 GYGTSDEGQDYWIVKNYWGSNWGEDGYFRIVRGQNECQIEEAVYGAI 298


>gi|307201161|gb|EFN81067.1| Uncharacterized peptidase C1-like protein F26E4.3 [Harpegnathos
           saltator]
          Length = 443

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 108/306 (35%), Positives = 157/306 (51%), Gaps = 19/306 (6%)

Query: 34  ILQDSIIKEVN-ENPKAGWKAARNPQFSNYTVGQFKHL-LGVKPTPKGLLLGVPVKTHDK 91
           +++  +++EVN + P  GW+A    +F   T+     L LG     + +    PV+    
Sbjct: 140 LIEPELMEEVNLQGPTLGWQAGNYSEFWGRTLRDGVELRLGTLNPSQSMYKMNPVRRIYD 199

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP+ FDAR+ WP+   IS I DQG CG+ WA    +  SDRF I      ++ LS  
Sbjct: 200 PDALPREFDARTRWPR--DISGIHDQGWCGASWAVSTADVASDRFAIMSKGAEDVELSAQ 257

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
            LL+C       GC GGY   AW +    G+V +EC P+   TG  +  C     +   V
Sbjct: 258 HLLSC-NNRGQQGCRGGYLDRAWLFMRKFGLVDKECYPW---TG-RNDQCRLRKRSNLNV 312

Query: 210 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 269
             C K     R   +    AYR+ ++  DIM EI  +GPV+ +  VY+DF  YK+GVY+H
Sbjct: 313 AGCRKPPNPLRQELYKVGPAYRLGNE-TDIMQEILTSGPVQATMRVYQDFFVYKNGVYRH 371

Query: 270 ITGDVM---GGHAVKLIGWGTSDDGE----DYWILANQWNRSWGADGYFKIKRGSNECGI 322
                +   G H++++IGWG           YW++AN W R WG +G F+I+RG+NEC I
Sbjct: 372 SRSAELHDSGYHSMRIIGWGEEPSYRGPPLKYWLVANSWGRHWGENGLFRIQRGTNECEI 431

Query: 323 EEDVVA 328
           E  V+A
Sbjct: 432 ESYVLA 437


>gi|440907441|gb|ELR57591.1| Tubulointerstitial nephritis antigen [Bos grunniens mutus]
          Length = 476

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 115/337 (34%), Positives = 163/337 (48%), Gaps = 43/337 (12%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 90
           ++Q  +I+ VN+    GW A    QF   T+ + FK+ LG  P P  LLL +   T    
Sbjct: 155 LVQPGLIEHVNKG-DYGWTAQNYSQFWGMTLEEGFKYRLGTLP-PSPLLLSMNEVTASLT 212

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 148
           K+  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS 
Sbjct: 213 KTTDLPEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLSP 270

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------ 202
            +L++CC      GC+      AW Y    G+V+  C P F     ++ GC  A      
Sbjct: 271 QNLISCCAKK-RRGCNSESVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGR 329

Query: 203 ---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
              + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V+EDF
Sbjct: 330 GKRHATTPCPNSIEKSNRIYQCS-----PPYRVSSNETEIMREIMQNGPVQAIMQVHEDF 384

Query: 260 AHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGA 307
            +YK+G+Y+HIT              HAVKL GWGT        E +WI AN W +SWG 
Sbjct: 385 FNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGE 444

Query: 308 DGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 344
           +GYF+I RG NE  IE+ ++A          ++TSAD
Sbjct: 445 NGYFRILRGVNESDIEKLIIAAW-------GQLTSAD 474


>gi|426353589|ref|XP_004044272.1| PREDICTED: tubulointerstitial nephritis antigen [Gorilla gorilla
           gorilla]
          Length = 476

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 107/321 (33%), Positives = 156/321 (48%), Gaps = 34/321 (10%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           +++  +I++VN+    GW A    QF   T+   FK  LG + P+P  L +     +   
Sbjct: 155 LVRPQLIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPA 213

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
           +  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS  
Sbjct: 214 TTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 202
           +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A       
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRG 330

Query: 203 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
             + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V EDF 
Sbjct: 331 KRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFF 385

Query: 261 HYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGAD 308
           HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +SWG +
Sbjct: 386 HYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGEN 445

Query: 309 GYFKIKRGSNECGIEEDVVAG 329
           GYF+I RG NE  IE+ ++A 
Sbjct: 446 GYFRILRGVNESDIEKLIIAA 466


>gi|357623033|gb|EHJ74345.1| tubulointerstitial nephritis antigen [Danaus plexippus]
          Length = 426

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 109/305 (35%), Positives = 150/305 (49%), Gaps = 17/305 (5%)

Query: 31  DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTH 89
           D+ I+ D +I  VN      W+A    QF    +     + LG  P         P++ +
Sbjct: 124 DACIISDDVIYGVNRG--NSWRAYNYTQFYGKKLRDGIIYKLGTMPLSHETRRMGPIR-Y 180

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF-GMNLSLSV 148
           DK +  P+ FDAR  WP  + IS +LDQG CGS WA       SDRF I   G    +  
Sbjct: 181 DKDIPYPRDFDARRRWP--NFISPVLDQGWCGSDWAVTIATVASDRFAIQSNGAERMVLS 238

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 208
             +L  C      GC GG+   AW +   HG+V EEC PY  +T        P  P    
Sbjct: 239 PQVLLSCNIRRQQGCRGGHIDVAWNFARGHGLVDEECFPYKAATTSC-----PFRPKANL 293

Query: 209 VRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
           +    +     R S+ Y +      +   DIM +I ++GPV    TV++DF HY  G+Y+
Sbjct: 294 IEDGCRPPVRQRTSR-YKVGPPGKLATENDIMYDIMESGPVHAVMTVHQDFFHYHDGIYR 352

Query: 269 HIT-GD--VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 325
               GD  + G H+V+++GWG  D G+ YW++AN W   WG +GYF+I RGSNE GIE  
Sbjct: 353 RSPYGDNTLQGLHSVRIVGWG-EDRGDKYWVVANSWGCDWGENGYFRIARGSNESGIESF 411

Query: 326 VVAGL 330
           VV  L
Sbjct: 412 VVTVL 416


>gi|11691656|emb|CAC18646.1| cathepsin B-like protease 1 [Giardia intestinalis]
          Length = 303

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 103/287 (35%), Positives = 147/287 (51%), Gaps = 27/287 (9%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKP----TPKGLLLGVPV-KTHDKSLKLPKSFDARSAW 105
           WKA    +F N T  +F+ +L ++P       G L  + + +  +    +P  FD R  +
Sbjct: 31  WKAGMPKRFENVTEDEFRSML-IRPDRLRARSGSLPPISITEVQELVDPIPPQFDFRDEY 89

Query: 106 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDG 162
           PQC  +   LDQG CG CWAF A+    DR C   G++   +S S   L++C   L   G
Sbjct: 90  PQC--VKPALDQGSCGECWAFSAIGVFGDRRC-AMGIDKEAVSYSQQHLISCS--LENFG 144

Query: 163 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 222
           CDGG     W +    G  T EC  Y D       G   A P P          QL++  
Sbjct: 145 CDGGDFQPTWSFLTFTGATTAECVKYVDY------GHTVASPCPAVCDDG-SPIQLYKAH 197

Query: 223 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVK 281
            +  +S     S P  IM  +   GP++    VY D ++Y+SGVYKH  G + +G HA++
Sbjct: 198 GYGQVS----KSVPA-IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALE 252

Query: 282 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
           ++G+GT+DDG DYWI+ N W   WG +GYF+I RG NEC IE+++ A
Sbjct: 253 IVGYGTTDDGTDYWIIKNSWGPDWGENGYFRIVRGVNECRIEDEIYA 299


>gi|253744515|gb|EET00718.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 306

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 99/247 (40%), Positives = 129/247 (52%), Gaps = 24/247 (9%)

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSV- 148
           + S  +P SFD R  +PQC  I+ + DQGHCGSCWAF A  A  DR C+  G++ S  V 
Sbjct: 73  EPSGSIPASFDFREEYPQC--ITPVYDQGHCGSCWAFSATSAFGDRRCMQ-GLD-SAGVP 128

Query: 149 --NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDST-GCSHPGCEPAYPT 205
                   C +L   GC GG   S W +   HG  T EC PY D+    S P        
Sbjct: 129 YSQQYTISCDYL-DLGCAGGLSFSVWTFLTEHGTTTLECVPYTDANKDISSP-------- 179

Query: 206 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
             C   C   +++ R  K      Y  N     IM  +  +GPV+ S  VY DF +Y+SG
Sbjct: 180 --CPDACADGSEI-RLVKADGCLDYSGNVTA--IMQALANDGPVQASMAVYRDFLYYRSG 234

Query: 266 VYKHITGDVMGGHAVKLIGWGTSDDGED--YWILANQWNRSWGADGYFKIKRGSNECGIE 323
           VY+H+ G  +  HAV++IG+G +DD +   YWI+ N     WG +GYF I RGSNEC IE
Sbjct: 235 VYRHVYGSQISSHAVEIIGYGAADDEDSTPYWIVKNSLGSGWGEEGYFNIVRGSNECDIE 294

Query: 324 EDVVAGL 330
             V +GL
Sbjct: 295 SAVYSGL 301


>gi|270011021|gb|EFA07469.1| cathepsin B precursor [Tribolium castaneum]
          Length = 327

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 107/312 (34%), Positives = 151/312 (48%), Gaps = 18/312 (5%)

Query: 34  ILQDSIIKEVNEN-PKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDK 91
           +++ SI + +N N    GW A+   +F  + + +  K  LG     + ++   PV+    
Sbjct: 16  LIEPSITEAINSNYANYGWSASNYSKFWGHKLEEGIKLRLGTLQPQRFVMHMNPVRRIYD 75

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVN 149
              LP+ FD+   WP    +S I DQG CGS WA       SDRF I       ++LS  
Sbjct: 76  PNSLPREFDSEFKWP--GWMSEIQDQGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQ 133

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
            LL+C        C+GGY   AW Y    G+V E+C PY      ++  C          
Sbjct: 134 HLLSC-DRRGQQSCNGGYLDRAWSYIRKIGLVDEQCFPY----SATNEKCRIPRRGDLVT 188

Query: 210 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 269
             C     + R SK+    AYR+ ++  DIM EI  +GPV+ +  VY DF  YK G+Y+H
Sbjct: 189 ANCQLPTNVDRRSKYKVAPAYRVGNET-DIMYEILHSGPVQATMKVYHDFFTYKRGIYRH 247

Query: 270 I---TGDVMGGHAVKLIGWGTSDDGE---DYWILANQWNRSWGADGYFKIKRGSNECGIE 323
               T D  G H+V+++GWG     E    YW +AN W   WG +GYF+I RGSNEC IE
Sbjct: 248 SPISTNDRTGYHSVRIVGWGEEYSPEGLKKYWKVANSWGPEWGENGYFRILRGSNECEIE 307

Query: 324 EDVVAGLPSSKN 335
             V+      +N
Sbjct: 308 SFVLGTWAEVEN 319


>gi|290971375|ref|XP_002668483.1| predicted protein [Naegleria gruberi]
 gi|284081912|gb|EFC35739.1| predicted protein [Naegleria gruberi]
          Length = 325

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 149/315 (47%), Gaps = 34/315 (10%)

Query: 24  VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK-HLLGV------KPT 76
           + +    ++ +   S+I  +N N   GWKA    +F N T+ Q + +L G+      + T
Sbjct: 32  IANHTHANTPVNDKSLIDRINSNHTHGWKATEYSRFDNMTISQLRDNLFGLSLMSTDEDT 91

Query: 77  PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRF 136
           P+       ++  +  + +P +FDAR+ W  C  +  I DQ  CG+CWAF A   L+ R 
Sbjct: 92  PR-------MENIETRMDIPMNFDARTQWRGC--VPAIRDQQTCGACWAFSANYVLAHRL 142

Query: 137 CIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC 194
           CI      N+ LS    + C        C GGY   +W +  + G   + C PY    G 
Sbjct: 143 CIATNGQTNVVLSPEYQVQCDTM--NKACQGGYLKYSWTFLENTGTPLDTCIPYASGRGT 200

Query: 195 SHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
              G  P     +C    +  ++       Y     R  +   +I   I   G V+  FT
Sbjct: 201 FSSGTCPT----QCKIASMSMSK-------YKAKNTRYITGINNIKTAIMTYGSVQAGFT 249

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           VY D   YKSGVYKH+   V+GGHAV LIG+G  + G +YW+ AN W  +WG  GYFKI 
Sbjct: 250 VYRDLTGYKSGVYKHVVSTVLGGHAVALIGFGV-EGGSNYWLAANSWGANWGMSGYFKIA 308

Query: 315 RGSNECGIEEDVVAG 329
           +G  E GIE  V AG
Sbjct: 309 QG--EGGIENQVYAG 321


>gi|338718488|ref|XP_001918155.2| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
           antigen-like [Equus caballus]
          Length = 480

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 105/321 (32%), Positives = 156/321 (48%), Gaps = 34/321 (10%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++Q  +I+ VN+    GW A    QF   T+ + FK+ LG + P+P  L +     +   
Sbjct: 159 LIQPELIERVNKG-DYGWTAQNYSQFWGMTLEEGFKYRLGTLPPSPMLLSMNEVTPSLPA 217

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
           +  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS  
Sbjct: 218 TTDLPEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSNGRFTANLSPQ 275

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 202
           +L++CC      GC+ G    AW Y    G+V+  C P F     ++  C  A       
Sbjct: 276 NLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNDCAMASRSDGRG 334

Query: 203 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
             + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V++DF 
Sbjct: 335 KRHATKPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVHDDFF 389

Query: 261 HYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGAD 308
           HYK G+Y+H+T           +  HA+KL GWGT        E +WI AN W +SWG +
Sbjct: 390 HYKKGIYRHVTSTHEEPEKYRKLRTHAIKLAGWGTLRGAQGRKEKFWIAANSWGKSWGEN 449

Query: 309 GYFKIKRGSNECGIEEDVVAG 329
           GYF+I RG NE  IE+ ++A 
Sbjct: 450 GYFRILRGVNESDIEKLIIAA 470


>gi|159112288|ref|XP_001706373.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157434469|gb|EDO78699.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 303

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 103/287 (35%), Positives = 147/287 (51%), Gaps = 27/287 (9%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKP----TPKGLLLGVPV-KTHDKSLKLPKSFDARSAW 105
           WKA    +F N T  +F+ +L ++P       G L  + + +  +    +P  FD R  +
Sbjct: 31  WKAGMPKRFENVTEDEFRSML-IRPDRLRARSGSLPPISITEVQELVDPIPPQFDFRDEY 89

Query: 106 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDG 162
           PQC  +   LDQG CG CWAF A+    DR C   G++   +S S   L++C   L   G
Sbjct: 90  PQC--VKPALDQGSCGGCWAFSAIGVFGDRRC-AMGIDKEAVSYSQQHLISCS--LENFG 144

Query: 163 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 222
           CDGG     W +    G  T EC  Y D       G   A P P          QL++  
Sbjct: 145 CDGGDFQPTWSFLTFTGATTAECVKYVDY------GHTVASPCPAVCDDG-SPIQLYKAH 197

Query: 223 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVK 281
            +  +S     S P  IM  +   GP++    VY D ++Y+SGVYKH  G + +G HA++
Sbjct: 198 GYGQVS----KSVPA-IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALE 252

Query: 282 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
           ++G+GT+DDG DYWI+ N W   WG +GYF+I RG NEC IE+++ A
Sbjct: 253 IVGYGTTDDGTDYWIIKNSWGPDWGENGYFRIVRGVNECRIEDEIYA 299


>gi|397517574|ref|XP_003828984.1| PREDICTED: tubulointerstitial nephritis antigen [Pan paniscus]
          Length = 476

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 107/321 (33%), Positives = 156/321 (48%), Gaps = 34/321 (10%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           +++  +I++VN+    GW A    QF   T+   FK  LG + P+P  L +     +   
Sbjct: 155 LVRPELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPA 213

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
           +  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS  
Sbjct: 214 TTDLPEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 202
           +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A       
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDHNATNNGCAMASRSDGRG 330

Query: 203 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
             + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V EDF 
Sbjct: 331 KRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFF 385

Query: 261 HYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGAD 308
           HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +SWG +
Sbjct: 386 HYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGEN 445

Query: 309 GYFKIKRGSNECGIEEDVVAG 329
           GYF+I RG NE  IE+ ++A 
Sbjct: 446 GYFRILRGVNESDIEKLIIAA 466


>gi|290981656|ref|XP_002673546.1| predicted protein [Naegleria gruberi]
 gi|284087130|gb|EFC40802.1| predicted protein [Naegleria gruberi]
          Length = 362

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 107/315 (33%), Positives = 151/315 (47%), Gaps = 34/315 (10%)

Query: 24  VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK-HLLGV------KPT 76
           + +    ++ +   S+I  +N N   GWKA    +F N T+ Q + +L G+      + T
Sbjct: 69  IANHTHANTPVNDKSLIDRINSNHTHGWKATEYSRFDNMTISQLRDNLFGLSLMSSDEDT 128

Query: 77  PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRF 136
           P+       +   +  + +P +FDAR+ W  C  +  I DQ  CG+CWAF A   L+ R 
Sbjct: 129 PR-------MANIETRIDIPMNFDARTQWKGC--VPAIRDQQTCGACWAFSANYVLAHRL 179

Query: 137 CIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC 194
           CI      N+ LS    + C        C GGY   +W +  + G   + C PY    G 
Sbjct: 180 CIATNGQTNVVLSPEYQVQCDTM--NKACQGGYLKYSWTFLENTGTPLDSCIPYASGRG- 236

Query: 195 SHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
                   + +  C  +C  K      SK+ + +   I S   +I   I   G V+  FT
Sbjct: 237 -------TFSSGTCPTQC--KIASMSMSKYKAKNTVYI-SGINNIKTAIMTYGSVQAGFT 286

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           VY D   YKSGVYKHI   V+GGHAV LIG+G  + G +YW+ AN W  +WG  GYFKI 
Sbjct: 287 VYRDLTGYKSGVYKHIENTVLGGHAVALIGFGV-EGGSNYWLAANSWGPNWGMSGYFKIA 345

Query: 315 RGSNECGIEEDVVAG 329
           +G  E GIE  V AG
Sbjct: 346 QG--EGGIENQVYAG 358


>gi|332824268|ref|XP_518550.3| PREDICTED: tubulointerstitial nephritis antigen [Pan troglodytes]
          Length = 476

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 107/321 (33%), Positives = 155/321 (48%), Gaps = 34/321 (10%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +I++VN+    GW A    QF   T+   FK  LG + P+P  L +     +   
Sbjct: 155 LVHPELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPA 213

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
           +  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS  
Sbjct: 214 TTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 202
           +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A       
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDHNATNNGCAMASRSDGRG 330

Query: 203 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
             + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V EDF 
Sbjct: 331 KRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFF 385

Query: 261 HYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGAD 308
           HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +SWG +
Sbjct: 386 HYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGEN 445

Query: 309 GYFKIKRGSNECGIEEDVVAG 329
           GYF+I RG NE  IE+ ++A 
Sbjct: 446 GYFRILRGVNESDIEKLIIAA 466


>gi|290998874|ref|XP_002682005.1| predicted protein [Naegleria gruberi]
 gi|284095631|gb|EFC49261.1| predicted protein [Naegleria gruberi]
          Length = 310

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 107/317 (33%), Positives = 152/317 (47%), Gaps = 34/317 (10%)

Query: 24  VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK-HLLGV------KPT 76
           + +    ++ +   S+I  +N N   GWKA    +F N T+ Q + +L G+      + T
Sbjct: 17  IANHTHANTPVNDKSLIDRINSNHTHGWKATEYSRFDNMTISQLRDNLFGLSLMSSDEDT 76

Query: 77  PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRF 136
           P+       +   +  + +P +FDAR+ W  C  +  I DQ  CG+CWAF A   L+ R 
Sbjct: 77  PR-------MANIETRVDIPMNFDARTQWKGC--VPAIRDQQTCGACWAFSANYVLAHRL 127

Query: 137 CIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC 194
           CI      N+ LS    + C        C GGY   +W +  + G   + C PY    G 
Sbjct: 128 CIATNGQTNVVLSPEYQVQCDTM--NKACQGGYLKYSWTFLENTGTPLDTCIPYASGGG- 184

Query: 195 SHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
                   + +  C  +C  K      SK+ + +   I S   +I   I   G V+  FT
Sbjct: 185 -------TFSSGTCPTQC--KIASMSMSKYKAKNTVYI-SGINNIKTAIMTYGSVQAGFT 234

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           VY D   YKSGVYKH+   V+GGHAV LIG+G  + G +YW+ AN W  +WG  GYFKI 
Sbjct: 235 VYRDLTGYKSGVYKHLVSTVLGGHAVALIGFGV-EGGSNYWLAANSWGPNWGMSGYFKIA 293

Query: 315 RGSNECGIEEDVVAGLP 331
           +G  E GIE  V AG P
Sbjct: 294 QG--EGGIENQVYAGEP 308


>gi|189238903|ref|XP_967834.2| PREDICTED: similar to tubulointerstitial nephritis antigen
           [Tribolium castaneum]
          Length = 453

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 107/312 (34%), Positives = 151/312 (48%), Gaps = 18/312 (5%)

Query: 34  ILQDSIIKEVNEN-PKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDK 91
           +++ SI + +N N    GW A+   +F  + + +  K  LG     + ++   PV+    
Sbjct: 142 LIEPSITEAINSNYANYGWSASNYSKFWGHKLEEGIKLRLGTLQPQRFVMHMNPVRRIYD 201

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVN 149
              LP+ FD+   WP    +S I DQG CGS WA       SDRF I       ++LS  
Sbjct: 202 PNSLPREFDSEFKWP--GWMSEIQDQGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQ 259

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
            LL+C        C+GGY   AW Y    G+V E+C PY      ++  C          
Sbjct: 260 HLLSC-DRRGQQSCNGGYLDRAWSYIRKIGLVDEQCFPY----SATNEKCRIPRRGDLVT 314

Query: 210 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 269
             C     + R SK+    AYR+ ++  DIM EI  +GPV+ +  VY DF  YK G+Y+H
Sbjct: 315 ANCQLPTNVDRRSKYKVAPAYRVGNET-DIMYEILHSGPVQATMKVYHDFFTYKRGIYRH 373

Query: 270 I---TGDVMGGHAVKLIGWGTSDDGE---DYWILANQWNRSWGADGYFKIKRGSNECGIE 323
               T D  G H+V+++GWG     E    YW +AN W   WG +GYF+I RGSNEC IE
Sbjct: 374 SPISTNDRTGYHSVRIVGWGEEYSPEGLKKYWKVANSWGPEWGENGYFRILRGSNECEIE 433

Query: 324 EDVVAGLPSSKN 335
             V+      +N
Sbjct: 434 SFVLGTWAEVEN 445


>gi|348513320|ref|XP_003444190.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oreochromis
           niloticus]
          Length = 499

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 106/329 (32%), Positives = 157/329 (47%), Gaps = 49/329 (14%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLL--GVPVKTHD 90
           +++  II+ VN     GWKAA   +    T+ +  ++ LG +   + ++    + +    
Sbjct: 164 LIEPDIIQAVNRG-NYGWKAANYSELYGMTLNEGIRYRLGTQRPSRTVMNMNEIQMNMDP 222

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSV 148
           ++  LP  F++   WP    I   LDQG+C + WAF      SDR  I     M   LS 
Sbjct: 223 QTDNLPPYFNSAEKWP--GKIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPRLSP 280

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 208
            +L++C     G GC GG    AW Y    GVVTE+C PY           +P + TP  
Sbjct: 281 QNLISCDTRNQG-GCAGGRIDGAWWYLRRRGVVTEDCYPY-----------QPPHQTPAE 328

Query: 209 VRKCVKKN-----------------QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
           V +C+ ++                 Q + N  + S   YR++S+ ++IM EI  NGPV+ 
Sbjct: 329 VGRCMMQSRSVGRGKRQATQRCPNTQNYHNDIYQSTPPYRLSSNEKEIMKEIMDNGPVQA 388

Query: 252 SFTVYEDFAHYKSGVYKHITGDVM--------GGHAVKLIGWGTSDD----GEDYWILAN 299
              V+EDF  YK+G+YKH              G H+V++ GWG   +       YWI AN
Sbjct: 389 IMEVHEDFFVYKTGIYKHTDVSFTKPPQYRKHGTHSVRITGWGEDRNVDGTSRKYWIAAN 448

Query: 300 QWNRSWGADGYFKIKRGSNECGIEEDVVA 328
            W ++WG +GYF+I RG NEC IE  V+ 
Sbjct: 449 SWGKNWGENGYFRIVRGENECEIETFVIG 477


>gi|332210168|ref|XP_003254178.1| PREDICTED: tubulointerstitial nephritis antigen [Nomascus
           leucogenys]
          Length = 476

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 107/321 (33%), Positives = 154/321 (47%), Gaps = 34/321 (10%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           +++  +I++VN+    GW A    QF   T+   FK  LG + P+P  L +     +   
Sbjct: 155 LVRPELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPA 213

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
           +  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS  
Sbjct: 214 TTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 202
           +L++CC      GC+ G    AW Y    G+V+  C P F     +  GC  A       
Sbjct: 272 NLISCCS-KNRPGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATSNGCAMASRSDGRG 330

Query: 203 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
             + T  C     K N++++ S       YR++S   +IM EI +NGPV+    V EDF 
Sbjct: 331 KRHATKPCPNNVEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMQVREDFF 385

Query: 261 HYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGAD 308
           HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +SWG +
Sbjct: 386 HYKTGIYRHVTSANKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGEN 445

Query: 309 GYFKIKRGSNECGIEEDVVAG 329
           GYF+I RG NE  IE+ ++A 
Sbjct: 446 GYFRILRGVNESDIEKLIIAA 466


>gi|6449322|gb|AAF08931.1| tubulointerstitial nephritis antigen isoform TIN-ag [Homo sapiens]
          Length = 476

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 107/316 (33%), Positives = 158/316 (50%), Gaps = 24/316 (7%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           +++  +I++VN+    GW A    QF   T+   FK  LG + P+P  L +     +   
Sbjct: 155 LVRPELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPA 213

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
           +  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS  
Sbjct: 214 TTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
           +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A  +    
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRG 330

Query: 210 RKCVKK---NQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
           ++   K   N + ++++ Y  S  YR++S+  +IM EI +NGPV+    V EDF HYK+G
Sbjct: 331 KRDATKPCPNNVEKSNRIYQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTG 390

Query: 266 VYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKI 313
           +Y+H+T           +  HAVKL GWGT        E +WI AN W +SWG +GYF+I
Sbjct: 391 IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANFWGKSWGENGYFRI 450

Query: 314 KRGSNECGIEEDVVAG 329
            RG NE  IE+ V+A 
Sbjct: 451 LRGVNESDIEKLVIAA 466


>gi|290998826|ref|XP_002681981.1| predicted protein [Naegleria gruberi]
 gi|284095607|gb|EFC49237.1| predicted protein [Naegleria gruberi]
          Length = 310

 Score =  161 bits (407), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 106/315 (33%), Positives = 152/315 (48%), Gaps = 34/315 (10%)

Query: 24  VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK-HLLGV------KPT 76
           + +    ++ +   S+I  +N N   GWKA    +F N T+ Q + +L G+      + T
Sbjct: 17  IANHTHANTPVNDKSLIDRINSNHTHGWKATEYSRFDNMTISQLRDNLFGLSLMSSDEDT 76

Query: 77  PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRF 136
           P+       + + +  + +P +FDAR+ W  C  +  I DQ  CG+CWAF A   L+ R 
Sbjct: 77  PR-------MASIETRVDIPMNFDARTQWKGC--VPAIRDQQTCGACWAFSANYVLAHRL 127

Query: 137 CIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC 194
           CI      N+ LS    + C        C GGY   +W +  + G   + C PY    G 
Sbjct: 128 CIATNGKTNVVLSPEYQVQCDTM--NKACQGGYLKYSWTFLENTGTPLDTCIPYASGRG- 184

Query: 195 SHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
                   + +  C  +C  K      SK+ + +   I S   +I   I   G V+  FT
Sbjct: 185 -------TFSSGTCPTQC--KIASMSMSKYKAKNTVYI-SGINNIKTAIMTYGSVQAGFT 234

Query: 255 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           VY D   YKSGVYKH+   V+GGHAV LIG+G  + G +YW+ AN W  +WG  GYFKI 
Sbjct: 235 VYRDLTGYKSGVYKHVVSTVLGGHAVALIGFGV-EGGSNYWLAANSWGPNWGMSGYFKIA 293

Query: 315 RGSNECGIEEDVVAG 329
           +G  E GIE  V AG
Sbjct: 294 QG--EGGIENQVYAG 306


>gi|160688716|gb|ABX45136.1| cathepsin B-like cysteine protease 2 [Callosobruchus maculatus]
          Length = 260

 Score =  160 bits (406), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 111/325 (34%), Positives = 146/325 (44%), Gaps = 73/325 (22%)

Query: 12  LCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQ--FSNYTVGQFKH 69
           L     A       ++ +LD   L D  I+++N +    WKA RN +   S Y + +   
Sbjct: 3   LAFIALAAVVSCTFAQPELD--FLSDEYIEQLN-SKNLPWKAGRNFERDTSLYNIQRLLS 59

Query: 70  LLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 129
           +  + P  +       +   D    LP+ FDAR  W +C +I  I DQ  CGSCW     
Sbjct: 60  VGTINPPSEF----ETIFHEDDGKDLPEEFDARKQWSKCESIKEIRDQSGCGSCW----- 110

Query: 130 EALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF 189
                                           GC   YP+               C+P  
Sbjct: 111 --------------------------------GC-MSYPLP-------------RCNP-- 122

Query: 190 DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPE-DIMAEIYKNG 247
                    C+  Y  P C ++C K + L +   KHY+  AYRI S  E  I  EI KNG
Sbjct: 123 --------SCKTLYDAPTCKKECDKGSPLKYEEDKHYAKQAYRIMSKVERQIQLEIIKNG 174

Query: 248 PVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 306
           PV  SFTVY DF HY SGVYK      ++GGHAV++IGWG  +    YW+++N WN  WG
Sbjct: 175 PVVASFTVYADFIHYLSGVYKFDGESKLLGGHAVRIIGWGIENGTYPYWLVSNSWNERWG 234

Query: 307 ADGYFKIKRGSNECGIEEDVVAGLP 331
             G FKI RG NECGIEE++ AGLP
Sbjct: 235 DQGLFKIWRGKNECGIEEEITAGLP 259


>gi|134023803|gb|AAI35570.1| LOC100124858 protein [Xenopus (Silurana) tropicalis]
          Length = 484

 Score =  160 bits (406), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 106/299 (35%), Positives = 145/299 (48%), Gaps = 23/299 (7%)

Query: 50  GWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQC 108
           GW A    QF   T+ +  ++ LG       ++    +  +  +  LP  F+A   WP  
Sbjct: 175 GWTAGNYSQFWGMTLDEGIQYRLGTAKPSSSVMNMNEIHVNMNNDILPSHFNAAEKWP-- 232

Query: 109 STISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGG 166
             +   LDQG+C   WAF      SDR  I     M  SLS  +LL+C       GC GG
Sbjct: 233 GLVHEPLDQGNCAGSWAFSTAAVASDRISIQSMGHMTQSLSPQNLLSC-DTRNQHGCRGG 291

Query: 167 YPISAWRYFVHHGVVTEECDPY--FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNS 222
               AW Y    GVV+E C P+   ++ G S P    +    +  R+      NQ + ++
Sbjct: 292 RVDGAWWYLRRRGVVSEPCYPFTSLNTNGHSAPCMMQSRSMGRGKRQATNNCPNQYYSSN 351

Query: 223 KHY-SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT--------GD 273
           + Y S  AYR+ S  +DIM E+Y+NGPV+    V+EDF  YKSG+Y+             
Sbjct: 352 EIYQSTPAYRLASSEKDIMKELYENGPVQAIMEVHEDFFMYKSGIYRRTPVTEREPEHHR 411

Query: 274 VMGGHAVKLIGWGTSD--DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
             G H+VK+ GWG     DG+   YW+ AN W R WG DGYF+I RG NEC IE  +V 
Sbjct: 412 RHGTHSVKITGWGEERGRDGQTHKYWLAANSWGRDWGEDGYFRIARGENECEIETFIVG 470


>gi|403339807|gb|EJY69164.1| Cathepsin B [Oxytricha trifallax]
          Length = 345

 Score =  160 bits (406), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 114/338 (33%), Positives = 169/338 (50%), Gaps = 59/338 (17%)

Query: 24  VVSKLKLDSHILQDSIIK------------EVNENPKAGWKAARNPQFSNYTVGQFKHLL 71
           +V+ L  + H ++  +I             E+ ENP    K+ ++ +     +G  K   
Sbjct: 18  LVNGLNFNKHPVRQEVIDRIKNSNVSWTPFEIEENPFKN-KSLQSMRNMGGNLGYIKEES 76

Query: 72  GVKPTPKGL--------------LLGVPVKTHDKSLK------LPKSFDARSAWPQCSTI 111
           G++   K L              L G  +   D+ L       LP +++ ++A+P C   
Sbjct: 77  GIQGNIKHLKSKFFQELKKMGHKLKGEHIHVQDEGLNPKLGASLPTAYNTKTAFPSCP-- 134

Query: 112 SRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPI 169
             ILDQ +CGSCWA  AV  L +RFCI  G  +N+  S  D+++C   L    C+GGY  
Sbjct: 135 HTILDQANCGSCWAHAAVTMLQNRFCIKSGGSINMQFSRQDMVSCD--LGNAACNGGYLS 192

Query: 170 SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHY--SI 227
           S+ +Y    GVV+E+C  Y  + G S          P+C  +C  K+  +   K Y    
Sbjct: 193 SSVQYLQTEGVVSEQCLAYASADGNS---------VPRCNYRCDDKSLEY---KKYGCKY 240

Query: 228 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM--GGHAVKLIGW 285
           ++ +I +  EDI  EIY NGPV V F VY+DF+ Y +G+Y+ +T D +  GGHAV L GW
Sbjct: 241 NSMKILTTYEDIKEEIYTNGPVMVGFVVYDDFSSYSTGIYE-VTPDSVEEGGHAVTLNGW 299

Query: 286 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 323
           G  D+G  YWI  NQW  +WG  G+F+I  G  E GI+
Sbjct: 300 GY-DNGRLYWIGQNQWQNTWGESGFFRIYAG--EAGID 334


>gi|161343861|tpg|DAA06111.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 323

 Score =  160 bits (405), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 107/275 (38%), Positives = 136/275 (49%), Gaps = 41/275 (14%)

Query: 87  KTHDKSLK--LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 143
           KT D + K  +PK FDAR  +  C+  I  + DQG+C S WA       +DR CI     
Sbjct: 54  KTADINYKTDIPKEFDARQYFISCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGK 113

Query: 144 LS--LSVNDLLACCGFLCGD----GCDGGYPISAWRYFVHHGVVT-------EECDPYFD 190
            +  LS  +L++C     GD    GCDGG    AW + +  G+VT       E C PY  
Sbjct: 114 FTDNLSAQNLMSC-----GDDEKLGCDGGSAYKAWEFTMGKGIVTGGPYDSNEGCQPY-K 167

Query: 191 STGCSHPG------CEPAYPTPK--CVRKCVKKN-------QLWRNSKHYSISAYRINSD 235
           +  C H G      C     T    C  KCV KN        L++ S  Y  S     ++
Sbjct: 168 NRPCDHYGDSSLTNCSSLRRTQMMFCRDKCVNKNYKVKYEDDLYKTSVVYMTSW----TN 223

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
            + I  EI   GPV     VYE+F  YK GVYK   G+++G H VKLIGWG  + G +YW
Sbjct: 224 VKQIQQEIMTYGPVTAFMYVYENFMGYKEGVYKSTAGELIGYHHVKLIGWGVDEAGIEYW 283

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           +  N WN +WG DG FKI RG N C IE  V+AGL
Sbjct: 284 LAMNSWNSNWGNDGLFKILRGYNFCSIELLVMAGL 318


>gi|242014495|ref|XP_002427925.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
           corporis]
 gi|212512409|gb|EEB15187.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
           corporis]
          Length = 473

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 111/321 (34%), Positives = 162/321 (50%), Gaps = 20/321 (6%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKS 92
           +++  +I  VN N + GW A     F   T+ +   +  G     + +   +PVK   K 
Sbjct: 129 LVEPGVISAVNSNRELGWSATNYSMFWGKTLDEGITYKTGTLLPHRTVKRMMPVKVKSKG 188

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 150
            KLP SFDAR+ WP    IS   DQG CG+ WA       SDR+ I       + LS   
Sbjct: 189 -KLPNSFDARNKWP--GWISGPADQGWCGASWAVSTASVASDRYAIMSKGLTKVDLSPQH 245

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDS-TGCSHPGCEPAYPTPKCV 209
           LL+C       GC GG+   AW +    G+V + C P+  + T C  P   P +     +
Sbjct: 246 LLSCNKGQ--RGCQGGHLSRAWTFIRKFGLVDDYCYPWTGTPTKCKIPK-RPNFDALSSI 302

Query: 210 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 269
                 + L R+  +    AY+I  D +DIM EI ++GPV+ +  VY+DF  YKSGVY  
Sbjct: 303 CPPSLGSNL-RSELYRVGPAYKIQ-DEKDIMEEIMQSGPVQATMKVYQDFFSYKSGVYTK 360

Query: 270 ITGDV----MGGHAVKLIGWGTSDD--GE--DYWILANQWNRSWGADGYFKIKRGSNECG 321
              +      G H+VK++GWG   +  G+   YW+ AN W + WG +G+FKI+RG+NEC 
Sbjct: 361 SNTERESSNFGYHSVKILGWGEETNIYGQPIKYWLAANSWGQQWGENGFFKIRRGTNECE 420

Query: 322 IEEDVVAGLPSSKNLVKEITS 342
           IEE V+A    + +  +EI +
Sbjct: 421 IEEFVLAAWAETNDPSREIIT 441


>gi|253748399|gb|EET02549.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 303

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 107/328 (32%), Positives = 155/328 (47%), Gaps = 40/328 (12%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
           IL L      A+ +VS+ +L         I+ +N +    W AA   +F N T  +F+ +
Sbjct: 2   ILALLLAVVCAKPLVSRAELRR-------IQALNPS----WVAAMPKRFENVTEDEFRGM 50

Query: 71  LGVKP----TPKGLLLGVPVK-THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
           L + P       G +   P+K  +D +  LP  FD R  +P C  +S + DQG CG CWA
Sbjct: 51  L-INPDRLKARSGSMPSAPLKEINDPTDPLPAQFDFRDEYPHC--VSPVFDQGSCGGCWA 107

Query: 126 FGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 182
           F A+     R C   G++   +  S   L++C       GC GG     W +    G  T
Sbjct: 108 FSAIGMFGSRRCA-VGIDKAAVLYSQQHLISCS--TENFGCSGGDFFPTWSFLTQTGATT 164

Query: 183 EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY-RINSDPEDIMA 241
            EC  Y D        C    PT      C   +Q+    + Y    Y +++     IM 
Sbjct: 165 AECVKYVDYGSSVAAAC----PT-----TCDDGSQI----QFYKAHGYGQVSKSVPAIMQ 211

Query: 242 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG-HAVKLIGWGTSDDGEDYWILANQ 300
            +   GPV+    VY D  +Y  GVY+H  G +  G HA++++G+GT+DDG DYW + N 
Sbjct: 212 MLVSGGPVQTMIVVYADLLYYAGGVYRHTYGPISNGLHALEMVGYGTTDDGTDYWTIKNS 271

Query: 301 WNRSWGADGYFKIKRGSNECGIEEDVVA 328
           W   WG DGYF+I RG NEC IE+++ A
Sbjct: 272 WGSDWGEDGYFRIVRGVNECRIEDEIYA 299


>gi|432884030|ref|XP_004074413.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oryzias
           latipes]
          Length = 474

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 107/331 (32%), Positives = 162/331 (48%), Gaps = 53/331 (16%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLL--GVPVKTHD 90
           +++  II  VN     GWKAA   QF   ++ +  ++ LG +   + ++    + +K   
Sbjct: 139 LIEADIIHAVNRG-NYGWKAANYSQFFGMSLDEGIRYRLGTQRPSRTVMNMNEIQMKMDP 197

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSV 148
           ++  LP+ F++   WP  + I   LDQG+C + WAF      SDR  I     M   LS 
Sbjct: 198 QNDHLPRYFNSSEKWP--NKIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPQLSP 255

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 208
            +L++C     G GC GG    AW Y    GVVTE C PY           +P    P  
Sbjct: 256 QNLISCDTRNQG-GCAGGRIDGAWWYLRRRGVVTENCYPY-----------QPPQQAPAE 303

Query: 209 VRKCVKKNQL-----------------WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
           V +C+ +++                  + N  + S   Y+++S+ ++IM EI +NGPV+ 
Sbjct: 304 VGRCMMQSRAVGRGKRQATQRCPNTYNYHNDIYQSTPPYKLSSNEKEIMKEIMENGPVQA 363

Query: 252 SFTVYEDFAHYKSGVYKHITGDV----------MGGHAVKLIGWGTSDDGE----DYWIL 297
              V+EDF  YK+G+YKH   DV           G H+V++ GWG   D +     YWI 
Sbjct: 364 IMEVHEDFFVYKNGIYKHT--DVSSTKPPQYRKHGTHSVRITGWGEDKDYDGTPRKYWIA 421

Query: 298 ANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
           AN W ++WG +G+F+I RG+NEC IE  V+ 
Sbjct: 422 ANSWGKNWGENGFFRIARGANECEIEAFVIG 452


>gi|321478457|gb|EFX89414.1| hypothetical protein DAPPUDRAFT_303204 [Daphnia pulex]
          Length = 442

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 108/319 (33%), Positives = 153/319 (47%), Gaps = 23/319 (7%)

Query: 31  DSHILQDSIIKEVNEN-PKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKT 88
           D+ +++   I+ +N N  + GW A  +  F    +     + LG     K +L   P+K 
Sbjct: 119 DACLVEPEAIQAINGNSAQFGWTAGNHSDFWGRKLEDGLVYRLGTLEPEKFVLAMHPIKQ 178

Query: 89  HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSL 146
                 LP SFD R  W    T+  + DQG CG+ WAF      +DR  I    +    L
Sbjct: 179 KYDRNTLPMSFDGRIEWR--DTLQDVRDQGWCGASWAFSTAAVAADRLAIQSRGHEVYPL 236

Query: 147 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA---- 202
           S+ +LLAC       GC+GG+   AW Y    GVV EEC PY          C+      
Sbjct: 237 SMQNLLAC-NNRGQQGCNGGHLDRAWNYMRRFGVVNEECYPYISGRTGQVEKCKVPRRGN 295

Query: 203 YPTPKCV------RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
             T KC       RK  + ++  R     S  AYRI    +DIM EI ++GPV+ +  V+
Sbjct: 296 LATMKCQLVNAAERKSDRSDKPPRKGLFRSPPAYRIAPFEDDIMNEILQHGPVQATMRVH 355

Query: 257 EDFAHYKSGVYKHITGDVM---GGHAVKLIGWGTSDDGED---YWILANQWNRSWGADGY 310
            DF  Y+ GVY++   +     G H+V+++GWG      +   YW++AN W R WG DGY
Sbjct: 356 PDFFLYRGGVYRYSGTNSQQRSGYHSVRIVGWGVDSSKRNPTKYWLVANSWGRLWGEDGY 415

Query: 311 FKIKRGSNECGIEEDVVAG 329
           F+I RG NE  IE+ V+A 
Sbjct: 416 FRIVRGENESDIEKFVLAA 434


>gi|10803452|emb|CAB97365.2| putative cathepsin B.2 [Ostertagia ostertagi]
          Length = 194

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 90/197 (45%), Positives = 117/197 (59%), Gaps = 20/197 (10%)

Query: 122 SCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 179
           SCWA  +  A+SDR CI       + LS  D+LACC + CG GC+GG+P+ AW+YF   G
Sbjct: 1   SCWAVSSAAAMSDRVCIASXGAKQVLLSDQDMLACCSW-CGYGCEGGWPMKAWQYFXLEG 59

Query: 180 VVTEE-------CDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKN-QLWRNSKH 224
           VVT         C PY +   C   G EP Y        TPKC + C +   + ++  KH
Sbjct: 60  VVTGGNYRKQGCCRPY-EFPPCGRHGKEPYYGECYDSAKTPKCQKTCQRGYLKPYKEDKH 118

Query: 225 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 284
           +  SAYR+ ++ + I  +I KNGPV   F VYEDFAHYKSG+YKH  G + GGHAVK+IG
Sbjct: 119 FGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMTGGHAVKIIG 178

Query: 285 WGTSDDGEDYWILANQW 301
           WG  + G  YW++AN W
Sbjct: 179 WG-KEXGTPYWLIANSW 194


>gi|410910940|ref|XP_003968948.1| PREDICTED: tubulointerstitial nephritis antigen-like [Takifugu
           rubripes]
          Length = 477

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 107/329 (32%), Positives = 157/329 (47%), Gaps = 49/329 (14%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLL--GVPVKTHD 90
           +++  +I  VN     GW+AA   QF   T+ +  ++ LG +   K ++    + +    
Sbjct: 142 LIEPDVISAVNRG-NYGWRAANYSQFYGMTLDEGIRYRLGTQRPAKTIMNMNEIQMNMDP 200

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSV 148
           +  +LP  F++   WP    I   LDQG+C + WAF      SDR  I     M   LS 
Sbjct: 201 ERDQLPLYFNSAEKWP--GKIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPQLSP 258

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 208
            +L++C     G GC GG    AW +    GVVTE+C PY            P   TP  
Sbjct: 259 QNLISCDTRNQG-GCTGGRIDGAWWFLRRRGVVTEDCYPY-----------RPPQQTPAE 306

Query: 209 VRKCVKKNQL-----------------WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
           + +C+ +++                  ++N  + S   YR++++ ++IM EI  NGPV+ 
Sbjct: 307 LGRCMMQSRSVGRGKRQATQRCPNTNNYQNDIYQSTPPYRLSTNEKEIMKEIQDNGPVQA 366

Query: 252 SFTVYEDFAHYKSGVYKHITGDVM--------GGHAVKLIGWGTSD--DG--EDYWILAN 299
              V+EDF  YKSG+YKH              G H+VK+ GWG     DG    YWI AN
Sbjct: 367 IMEVHEDFFVYKSGIYKHTDVSFTKPPQYRKHGTHSVKITGWGEERNVDGAKRKYWIAAN 426

Query: 300 QWNRSWGADGYFKIKRGSNECGIEEDVVA 328
            W ++WG +GYF+I RG NEC IE  V+ 
Sbjct: 427 SWGKNWGEEGYFRIARGENECEIEAFVIG 455


>gi|197100841|ref|NP_001126804.1| tubulointerstitial nephritis antigen [Pongo abelii]
 gi|55732702|emb|CAH93049.1| hypothetical protein [Pongo abelii]
          Length = 476

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 105/321 (32%), Positives = 155/321 (48%), Gaps = 34/321 (10%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FK-HLLGVKPTPKGLLLGVPVKTHDK 91
           +++  +I++VN+    GW A    QF   T+   FK HL  + P+P  L +     +   
Sbjct: 155 LVRPELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFHLGTLPPSPMLLSMNEMTASLPA 213

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
           +  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS  
Sbjct: 214 TTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 202
           +L++CC      GC+ G    AW Y    G+V+  C P       ++ GC  A       
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLSKDQNATNNGCAMASRSDGRG 330

Query: 203 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
             + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V EDF 
Sbjct: 331 KRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFF 385

Query: 261 HYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGAD 308
           HYK+G+Y+H+T           +  HAVKL GWGT        E +W+ AN W +SWG +
Sbjct: 386 HYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWVAANSWGKSWGEN 445

Query: 309 GYFKIKRGSNECGIEEDVVAG 329
           GYF+I RG NE  IE+ ++A 
Sbjct: 446 GYFRILRGVNESDIEKLIIAA 466


>gi|290977636|ref|XP_002671543.1| predicted protein [Naegleria gruberi]
 gi|284085113|gb|EFC38799.1| predicted protein [Naegleria gruberi]
          Length = 268

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 109/280 (38%), Positives = 149/280 (53%), Gaps = 26/280 (9%)

Query: 11  ILCLTCFAT-FAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKH 69
           +LC    AT F  G  S L    H L  S+I+++N +    WKA    +F   T+ + + 
Sbjct: 8   VLCFLLIATTFVCGQFSALDKPVHEL--SLIQKINSDSSIRWKATTYKKFEGMTLREARK 65

Query: 70  LLG-VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 128
            LG V  +P   +  +P K   K+LK    FDAR  W  C  I  I +Q  CGSCWAF A
Sbjct: 66  YLGTVIISP---INNLPKKKMPKNLKAASHFDAREKWEDC--IHEIRNQEECGSCWAFSA 120

Query: 129 VEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 186
            EA SDR CI  +  +N+ LS   +++C       GCDGGY  +AW +  + G+ ++EC 
Sbjct: 121 SEAFSDRLCIATNGSVNIVLSPQYMVSCDA--TDYGCDGGYLNNAWNFLANTGIPSDECV 178

Query: 187 PYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI-NSDP-EDIMAEIY 244
           PY   +G  H         P C  K  KK Q   + K Y +S   I N D  EDI  +I 
Sbjct: 179 PY--QSGSGH--------VPSC-SKLNKKCQDGSDIKLYKVSKKSIANLDSIEDIQKDIQ 227

Query: 245 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 284
           +NG ++  F+VY+DF  YKSGVY H+TG + GGHA+K+IG
Sbjct: 228 ENGSIQSGFSVYKDFFSYKSGVYHHVTGSLAGGHAIKVIG 267


>gi|327281715|ref|XP_003225592.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
           carolinensis]
          Length = 520

 Score =  159 bits (401), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 109/319 (34%), Positives = 157/319 (49%), Gaps = 29/319 (9%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   ++  VN     GW+A+   QF   T+ +  ++ LG +KP    + +       D+
Sbjct: 192 LINGDMMDAVNRG-NYGWRASNYSQFWGMTLDEGIQYRLGTIKPPTSVMNMNELQMNMDE 250

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
           +  LP  F+A   W     I   LDQG+C   WAF      SDR  IH    M  +LS  
Sbjct: 251 NDVLPSYFNAADKW--SGMIHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMTPALSPQ 308

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-----YP 204
           +LL+C       GC+GG    AW +    GVVT+EC P F +   +H    PA       
Sbjct: 309 NLLSC-NTRHQQGCNGGRIDGAWWFLRRRGVVTDECYP-FSNQETNHSPNAPACMMHSRS 366

Query: 205 TPKCVRKCVKKNQLWR---NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 261
           T +  R+ + +    R   N  + S  AYR++S+ ++IM E+ +NGPV+    V+EDF  
Sbjct: 367 TGRGKRQAIARCPNPRSHANEIYQSTPAYRLSSNEKEIMKELMENGPVQAILEVHEDFFM 426

Query: 262 YKSGVYKHITGDV--------MGGHAVKLIGWGTSD--DG--EDYWILANQWNRSWGADG 309
           Y++G+Y+H              G H+VK+ GWG     DG  + YWI AN W + WG  G
Sbjct: 427 YRTGIYRHTAVAAGKPEQYRRHGTHSVKITGWGEEQMPDGSNQKYWIAANSWGKDWGEHG 486

Query: 310 YFKIKRGSNECGIEEDVVA 328
           YF+I RG NEC IE  VV 
Sbjct: 487 YFRITRGENECEIETFVVG 505


>gi|2330009|gb|AAB66719.1| cysteine protease [Giardia muris]
          Length = 301

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 101/296 (34%), Positives = 151/296 (51%), Gaps = 35/296 (11%)

Query: 40  IKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH------DKSL 93
           +KE+ +   + W  A + +F N TV +F+  L     P   L  +  +TH       K+ 
Sbjct: 21  LKELQQLATS-WTPAIHDRFRNMTVDEFRARL----IPVENLRSLRTETHVSQLNLGKTK 75

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN---D 150
           +LPK +D R     C  +  + DQ  CGSCWAF AV   +DR C  +G++ S  V+    
Sbjct: 76  ELPKDYDPRVERAHC--LPEVADQASCGSCWAFSAVATFADRRCA-YGLD-SKQVHYSEQ 131

Query: 151 LLACCGFLCGDG-CDGGYPISAWRYFVHHGVVTEECDPYFDS-TGCSHPGCEPAYPTPKC 208
            +  C F  GDG C+GG+  + W++    GV   +C  YF   TG              C
Sbjct: 132 YVVSCDF--GDGACNGGWLSNVWKFLTKTGVPKLDCLKYFSGMTG----------DRESC 179

Query: 209 VRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
           +  C   + +      + I+      D + +M  +  +GP++V+F VY DF +Y SGVY+
Sbjct: 180 ITHCTDGSPVELYQASHVIN---YGMDLDRMMEALVYDGPLQVAFVVYSDFGYYSSGVYQ 236

Query: 269 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           H+ G + GGHAV+++G+G  + G  YWI+ N W   WG  GYF+I R  NECGIEE
Sbjct: 237 HVNGMMEGGHAVEMVGYGIDESGLKYWIIRNSWGPDWGEGGYFRIIRRVNECGIEE 292


>gi|431838263|gb|ELK00195.1| Tubulointerstitial nephritis antigen [Pteropus alecto]
          Length = 425

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 103/303 (33%), Positives = 142/303 (46%), Gaps = 32/303 (10%)

Query: 51  WKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCS 109
           W A    QF   T+ + FK+ LG  P    LL    V      + LP+ F A   WP   
Sbjct: 121 WTAQNYSQFWGMTLEEGFKYRLGTLPPSPMLLSMNEVTAVPAIIDLPEFFVAYYKWP--G 178

Query: 110 TISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGY 167
                LDQ +C + WAF      +DR  I        +LS  +L++CC      GC  G 
Sbjct: 179 WTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCSSGS 237

Query: 168 PISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQL 218
              AW Y    G+V+  C P+      ++  C  A         + T  C     K N++
Sbjct: 238 IDRAWWYLRKRGLVSHACYPFLKDQNTTNNACAMASRSDGRGKRHATKPCPNNIEKSNRI 297

Query: 219 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG------ 272
           ++ S       YR++S+  +IM EI  NGPV+    V+EDF HYKSG+Y+H+T       
Sbjct: 298 YQCS-----PPYRVSSNETEIMKEIIHNGPVQAIMQVHEDFFHYKSGIYRHVTSTNEKSE 352

Query: 273 --DVMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 326
               +  HAVKL GWGT        E +WI+AN W  SWG +GYF+I RG NE  IE+ +
Sbjct: 353 KYQKLQTHAVKLTGWGTLRGAQGRKEKFWIVANSWGNSWGENGYFRILRGVNESDIEKLI 412

Query: 327 VAG 329
           +A 
Sbjct: 413 IAA 415


>gi|351704465|gb|EHB07384.1| Tubulointerstitial nephritis antigen [Heterocephalus glaber]
          Length = 475

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 106/320 (33%), Positives = 155/320 (48%), Gaps = 33/320 (10%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           +++  +I+ +N+    GW A    QF   T+ + F   LG + P+P  L +         
Sbjct: 155 LVRPELIEHINKG-DYGWTAENYSQFWGMTLEEGFTFRLGTLAPSPMLLSMNEVTAALPA 213

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
              LP+ F A   WP        LDQ +C + WAF      +DR  I       ++LS  
Sbjct: 214 KTDLPEFFIASYKWP--GWTHDPLDQKNCAASWAFSTASVAADRIAIQSNGRYTVNLSPQ 271

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF----DSTGCSHP----GCEP 201
           +L++CC      GC GG    AW Y    G+V+  C P F     + GC+      G   
Sbjct: 272 NLISCC-LKHRYGCSGGSIDRAWWYLRKRGLVSHACYPLFKDQNSTNGCAMASRSDGRGK 330

Query: 202 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 261
            + T  C     K N++++ S       YR++S+   IM EI KNGPV+    V+EDF +
Sbjct: 331 RHATTPCPNNIEKSNRIYQCS-----PPYRVSSNETQIMKEIMKNGPVQAIMQVHEDFFY 385

Query: 262 YKSGVYKHITGDV--------MGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGADG 309
           YK+G+Y+H+T  +        +  HAVKL GWGT        E +WI AN W +SWG +G
Sbjct: 386 YKTGIYRHVTSTIEDSEKYQKLRTHAVKLTGWGTLRGAKGRKEKFWIAANSWGKSWGENG 445

Query: 310 YFKIKRGSNECGIEEDVVAG 329
           YF+I RG NE  IE+ ++A 
Sbjct: 446 YFRILRGVNESDIEKLIIAA 465


>gi|126310154|ref|XP_001364630.1| PREDICTED: tubulointerstitial nephritis antigen [Monodelphis
           domestica]
          Length = 468

 Score =  158 bits (399), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 107/321 (33%), Positives = 154/321 (47%), Gaps = 34/321 (10%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           +++  +I+ VN     GW A    QF   T+ + +K  LG + P+P  L +     T   
Sbjct: 147 LVRPELIENVNTR-DYGWTAHNYSQFWGMTLEEGYKFRLGTLPPSPTLLSMNEMTVTLPS 205

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVN 149
              LP+ F +   WP        LDQ +C + WAF      +DR  I      +  LS  
Sbjct: 206 QTDLPEFFISSYKWP--GWTHDPLDQKNCAASWAFSTASVAADRIAIQSKGRYTDNLSPQ 263

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 202
           +L++CC      GC GG    AW Y    G+V+  C P F     ++ GC+ A       
Sbjct: 264 NLISCC-VKNRHGCKGGSIDRAWWYLRKRGLVSHACYPLFKDQIFNNNGCDMASRSDGRG 322

Query: 203 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
             + T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V+EDF 
Sbjct: 323 KRHATKPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFF 377

Query: 261 HYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGAD 308
           HYKSG+Y+HI            +  HAVKL GWG         E +WI AN W +SWG +
Sbjct: 378 HYKSGIYRHINNLKDESEKYRNLRTHAVKLTGWGVLRGAQGKKEKFWIAANSWGKSWGEN 437

Query: 309 GYFKIKRGSNECGIEEDVVAG 329
           GYF+I RG NE  IE+ ++A 
Sbjct: 438 GYFRILRGVNESDIEKLIIAA 458


>gi|253742315|gb|EES99155.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 303

 Score =  158 bits (399), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 107/328 (32%), Positives = 154/328 (46%), Gaps = 40/328 (12%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
           IL L      A+ +VS+ +L         I+ +N      W AA   +F N T  +F+ +
Sbjct: 2   ILALLLAVVCAKPLVSRAELRR-------IQALNPP----WVAAMPKRFENVTEDEFRGM 50

Query: 71  LGVKP----TPKGLLLGVPVK-THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
           L + P       G +   P+K  +D +  LP  FD R  +P C  +S + DQG CG CWA
Sbjct: 51  L-INPDRLKARSGSMPSAPLKEINDPTDPLPAQFDFRDEYPHC--VSPVFDQGSCGGCWA 107

Query: 126 FGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 182
           F A+     R C   G++   +  S   L++C       GC GG     W +    G  T
Sbjct: 108 FSAIGMFGSRRCA-VGIDKAAVLYSQQHLISCS--TENFGCSGGDFFPTWSFLTQTGATT 164

Query: 183 EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY-RINSDPEDIMA 241
            EC  Y D        C    PT      C   +Q+    + Y    Y +++     IM 
Sbjct: 165 AECVKYVDYGSSVAAAC----PT-----TCDDGSQI----QFYKAHGYGQLSKSVPAIMQ 211

Query: 242 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG-HAVKLIGWGTSDDGEDYWILANQ 300
            +   GPV+    VY D  +Y  GVY+H  G +  G HA++++G+GT+DDG DYW + N 
Sbjct: 212 MLVSGGPVQTMIVVYADLLYYAGGVYRHTYGPISNGLHALEMVGYGTTDDGTDYWTIKNS 271

Query: 301 WNRSWGADGYFKIKRGSNECGIEEDVVA 328
           W   WG DGYF+I RG NEC IE+++ A
Sbjct: 272 WGSDWGEDGYFRIVRGVNECRIEDEIYA 299


>gi|301618234|ref|XP_002938532.1| PREDICTED: tubulointerstitial nephritis antigen-like [Xenopus
           (Silurana) tropicalis]
          Length = 494

 Score =  157 bits (397), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 103/294 (35%), Positives = 141/294 (47%), Gaps = 20/294 (6%)

Query: 51  WKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCS 109
           W A    QF   T+ +  ++ LG       ++    +  +  +  LP  F+A   WP   
Sbjct: 191 WTAGNYSQFWGMTLDEGIQYRLGTAKPSSSVMNMNEIHVNMNNDILPSHFNAAEKWP--G 248

Query: 110 TISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGY 167
            +   LDQG+C   WAF      SDR  I     M  SLS  +LL+C       GC GG 
Sbjct: 249 LVHEPLDQGNCAGSWAFSTAAVASDRISIQSMGHMTQSLSPQNLLSC-DTRNQHGCRGGR 307

Query: 168 PISAWRYFVHHGVVTEECDPY--FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSK 223
              AW Y    GVV+E C P+   ++ G S P    +    +  R+      NQ + +++
Sbjct: 308 VDGAWWYLRRRGVVSEPCYPFTSLNTNGHSAPCMMQSRSMGRGKRQATNNCPNQYYSSNE 367

Query: 224 HY-SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT--------GDV 274
            Y S  AYR+ S  +DIM E+Y+NGPV+    V+EDF  YKSG+Y+H             
Sbjct: 368 IYQSTPAYRLASSEKDIMKELYENGPVQAIMEVHEDFFMYKSGIYRHTPVTEREPEHHRR 427

Query: 275 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
            G H+VK+ G G       YW+ AN W R WG DGYF+I RG NEC IE  +V 
Sbjct: 428 HGTHSVKITG-GRDGQTHKYWLAANSWGRDWGEDGYFRIARGENECEIETFIVG 480


>gi|294876463|ref|XP_002767679.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239869446|gb|EER00397.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 348

 Score =  157 bits (397), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 106/272 (38%), Positives = 136/272 (50%), Gaps = 37/272 (13%)

Query: 95  LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 151
           +P SFDAR A+ +C   I  + DQ  C SCWA   V+A S R CI  G   N  LS  +L
Sbjct: 83  IPSSFDARDAFKECKDVIGHVWDQSACASCWAIAPVQAFSARLCIKSGGKFNQLLSAGEL 142

Query: 152 LACCGFL--C-GDGCDGGYPISAWRYFVHHGVVT-------------EECDPYFDSTGCS 195
           LACC     C   GC GG    AW +   HG+ T             + C PY +   C+
Sbjct: 143 LACCNLAHSCEARGCKGGVARDAWVFLNKHGIATGGDFVPKSSMEAVDGCWPY-NFPRCA 201

Query: 196 H--------PGCEPAYPTPKCVRKC--VKKNQLWRNSKHYSISA--YRINSDPEDIMAEI 243
           H        P  + +Y TP C+ +C   K        +H++  A  Y  N     I  EI
Sbjct: 202 HYQKKSKYGPCPKKSYETPSCLDRCPNEKYGTPLDKDRHFTARAVPYWFNG-IRSIKKEI 260

Query: 244 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 303
            K+GP   SF  YEDF  YKSGVYK+ +G  +  H V+LIGWGT + G DYW+  N WN 
Sbjct: 261 MKHGPTSASFFTYEDFFSYKSGVYKYTSGAYVEFHTVELIGWGT-EKGVDYWLAKNDWNE 319

Query: 304 SWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 335
            W   G FKI +G  +CGI  D+V G P++ N
Sbjct: 320 EWADLGTFKIAQG--DCGI-NDLVLGAPAALN 348


>gi|48425700|pdb|1SP4|B Chain B, Crystal Structure Of Ns-134 In Complex With Bovine
           Cathepsin B: A Two Headed Epoxysuccinyl Inhibitor
           Extends Along The Whole Active Site Cleft
          Length = 205

 Score =  157 bits (397), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 89/203 (43%), Positives = 131/203 (64%), Gaps = 14/203 (6%)

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF----- 189
           +N+ +S  D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY      
Sbjct: 2   VNVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCE 61

Query: 190 DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 248
                S P C     TPKC + C    +  ++  KH+  S+Y + ++ ++IMAEIYKNGP
Sbjct: 62  HHVNGSRPPCTGEGDTPKCNKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGP 121

Query: 249 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 308
           VE +F+VY DF  YKSGVY+H++G++MGGHA++++GWG  ++G  YW++ N WN  WG +
Sbjct: 122 VEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDN 180

Query: 309 GYFKIKRGSNECGIEEDVVAGLP 331
           G+FKI RG + CGIE ++VAG+P
Sbjct: 181 GFFKILRGQDHCGIESEIVAGMP 203


>gi|326430261|gb|EGD75831.1| hypothetical protein PTSG_07950 [Salpingoeca sp. ATCC 50818]
          Length = 381

 Score =  157 bits (397), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 102/297 (34%), Positives = 139/297 (46%), Gaps = 32/297 (10%)

Query: 70  LLGVKPTPKGLLLGVP-VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 128
           L G      GL    P V   + S+ +P S+++  A+ +C     IL QG CGSCWAF  
Sbjct: 68  LSGSSEENIGLCASTPSVANLNTSMPIPDSYNSHEAYSKCK--PDILQQGSCGSCWAFAT 125

Query: 129 VEALSDRFCI---HFGMNLSLSVNDLLACCGFLC----GD-------------GCDGGYP 168
              L+ R CI     G    L+   L++C   +C    GD             GCDGGYP
Sbjct: 126 TGVLAQRMCIKSEQIGQGYELAPQALVSCTDQICYTKAGDRCSSPSSTCYCSLGCDGGYP 185

Query: 169 ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS 228
             A+R+    G+  E C  Y    G     C         V +C   +    N       
Sbjct: 186 DGAFRFMQDEGITPELCVKYVSKDGTDPLECSDVQTM---VSECTATSNATVNGDR---C 239

Query: 229 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK--HITGDVMGGHAVKLIGWG 286
            Y  +SD E I  +I ++GPV  S+ V+EDF  Y SGVY       D +G HAV ++GWG
Sbjct: 240 YYHSSSDIETIQRDIMQHGPVLASYEVFEDFGEYDSGVYTCPDDGSDSIGWHAVIIVGWG 299

Query: 287 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSA 343
             +D   YW++ N W   +G DGYFKI RG+NEC IE  +V  L +++ +V   TS 
Sbjct: 300 V-EDNTPYWLVQNSWGTGFGIDGYFKIARGTNECNIESRLVTSLVNTEGVVFASTSG 355


>gi|383861394|ref|XP_003706171.1| PREDICTED: tubulointerstitial nephritis antigen-like [Megachile
           rotundata]
          Length = 442

 Score =  157 bits (396), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 103/308 (33%), Positives = 155/308 (50%), Gaps = 23/308 (7%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTV--GQFKHLLGVKPTPKGLLLGVPVKTHDK 91
           + +  +I EVN  P   W+A    +F+  T+  G    L  + P+     +    + +D 
Sbjct: 139 LQEPDLIDEVNAMP-LNWRARNYSEFNGRTLKDGMRLRLGTLNPSRSVYRMNAVRRIYDP 197

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVN 149
              LP+ FD+R+ WP+   IS+I DQG CG+ WA  + +  SDRF I       + LS  
Sbjct: 198 E-SLPREFDSRTRWPR--DISKITDQGWCGASWAISSAQVASDRFAIMSKGTDAVELSAQ 254

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
            LL+C       GC GG+   AW +    G+V E C P+  ST      C     T    
Sbjct: 255 HLLSC-NNRGQQGCSGGHLDRAWMFMRRFGLVDENCYPWKAST----ETCRLRKRTDLRS 309

Query: 210 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 269
             C       R   +    AYR+ ++  DIM EI  +GPV+ +  VY+DF  Y+SGVYKH
Sbjct: 310 AGCAPPPNPLRTELYKVGPAYRL-ANETDIMQEILTSGPVQATMRVYQDFFSYESGVYKH 368

Query: 270 -ITGDVMGG--HAVKLIGWG------TSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 320
            +T ++     H+V++IGWG      + +    YW++AN W + WG +G F+I++G+NEC
Sbjct: 369 SVTAELYESDYHSVRIIGWGEEPPTYSRNTPLKYWLVANSWGQQWGENGLFRIQKGTNEC 428

Query: 321 GIEEDVVA 328
            IE  V+ 
Sbjct: 429 EIESFVLG 436


>gi|1763659|gb|AAB58258.1| cysteine protease [Giardia intestinalis]
          Length = 269

 Score =  157 bits (396), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 100/280 (35%), Positives = 144/280 (51%), Gaps = 27/280 (9%)

Query: 58  QFSNYTVGQFKHLLGVKP----TPKGLLLGVPV-KTHDKSLKLPKSFDARSAWPQCSTIS 112
           +F N T  +F+ +L ++P       G L  + + +  +    +P  FD R  +PQC  + 
Sbjct: 4   RFENVTEDEFRSML-IRPDRLRARSGSLPPISITEVQELVDPIPPQFDFRDEYPQC--VK 60

Query: 113 RILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPI 169
             LDQG CG CWAF A+    DR C   G++   +S S   L++C   L   GCDGG   
Sbjct: 61  PALDQGSCGECWAFSAIGVFGDRRCA-MGIDKEAVSYSQQHLISCS--LENFGCDGGDFQ 117

Query: 170 SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 229
             W +    G  T EC  Y D       G   A P P          QL++   +  +S 
Sbjct: 118 PTWSFLTFTGATTAECVKYVDY------GHTVASPCPAVCDDG-SPIQLYKAHGYGQVS- 169

Query: 230 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTS 288
               S P  IM  +   GP++    VY D ++Y+SGVYKH  G + +G HA++++G+GT+
Sbjct: 170 ---KSVPA-IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTT 225

Query: 289 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
           DDG DYWI+ N W   WG +GYF+I RG NEC IE+++ A
Sbjct: 226 DDGTDYWIIKNSWGPDWGENGYFRIVRGVNECRIEDEIYA 265


>gi|355561807|gb|EHH18439.1| hypothetical protein EGK_15031 [Macaca mulatta]
          Length = 475

 Score =  157 bits (396), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 109/322 (33%), Positives = 155/322 (48%), Gaps = 37/322 (11%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 90
           +++  +I++VN+    GW A    QF   T+   FK  LG  P P  +LL +   T    
Sbjct: 155 LVRPELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLP-PSPMLLSMNEMTXPLP 212

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 148
            +  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS 
Sbjct: 213 ATTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSP 270

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------ 202
            +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A      
Sbjct: 271 QNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANN-GCAMASRSDGR 328

Query: 203 ---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
              + T  C     K N++++ S       YR++S   +IM EI +NGPV+    V EDF
Sbjct: 329 GKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMQVREDF 383

Query: 260 AHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGA 307
            HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +SWG 
Sbjct: 384 FHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGE 443

Query: 308 DGYFKIKRGSNECGIEEDVVAG 329
           +GYF+I RG NE  IE+ ++A 
Sbjct: 444 NGYFRILRGVNESDIEKLIIAA 465


>gi|330846430|ref|XP_003295033.1| hypothetical protein DICPUDRAFT_51857 [Dictyostelium purpureum]
 gi|325074364|gb|EGC28440.1| hypothetical protein DICPUDRAFT_51857 [Dictyostelium purpureum]
          Length = 257

 Score =  157 bits (396), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 90/238 (37%), Positives = 122/238 (51%), Gaps = 18/238 (7%)

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN-DLL 152
            +P+SFDAR+ WP C  I  IL+Q  CGSCWAF A E LSDR CI       + ++   L
Sbjct: 30  SIPQSFDARTQWPNC--IHPILNQEQCGSCWAFSASEVLSDRLCIASNGKTGVVLSPQAL 87

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKC 212
             C      GC+GG P  AW Y   HG+ T  C PY    G              CV+  
Sbjct: 88  VSCDIFGNQGCNGGIPQLAWEYMELHGIPTYGCFPYTSGNGTDG----------SCVKNS 137

Query: 213 VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 272
              N+ +   +   ++  +  +  E I  +I K GP++ +  VY DF  Y SGVY    G
Sbjct: 138 CVDNEQYTLYRAKPLT-LKTCASVECIQQDIMKFGPIQGTMEVYSDFMSYTSGVYTMTPG 196

Query: 273 -DVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
             ++GGHA+K++GWG      ++YWI+AN W  SWG DG+F I    ++CGI  D  A
Sbjct: 197 SSLLGGHAIKIVGWGFDQASNQNYWIVANSWGPSWGIDGFFWIAF--DQCGINSDACA 252


>gi|355748654|gb|EHH53137.1| hypothetical protein EGM_13709 [Macaca fascicularis]
          Length = 475

 Score =  157 bits (396), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 109/322 (33%), Positives = 155/322 (48%), Gaps = 37/322 (11%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 90
           +++  +I++VN+    GW A    QF   T+   FK  LG  P P  +LL +   T    
Sbjct: 155 LVRPELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLP-PSPMLLSMNEMTAPLP 212

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 148
            +  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS 
Sbjct: 213 ATTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSP 270

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------ 202
            +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A      
Sbjct: 271 QNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANN-GCAMASRSDGR 328

Query: 203 ---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
              + T  C     K N++++ S       YR++S   +IM EI +NGPV+    V EDF
Sbjct: 329 GKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMQVREDF 383

Query: 260 AHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGA 307
            HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +SWG 
Sbjct: 384 FHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGE 443

Query: 308 DGYFKIKRGSNECGIEEDVVAG 329
           +GYF+I RG NE  IE+ ++A 
Sbjct: 444 NGYFRILRGVNESDIEKLIIAA 465


>gi|354483193|ref|XP_003503779.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cricetulus
           griseus]
          Length = 475

 Score =  157 bits (396), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 113/366 (30%), Positives = 167/366 (45%), Gaps = 48/366 (13%)

Query: 3   PTKLIMDPILCLTCFATFAEGVVSKLKLDS------------HI--LQDSIIKEVNENPK 48
           P K  +DP  C      + EG V K   +S            H+  +   +I+ +N+   
Sbjct: 109 PLKQPLDPEGCSRNSQHYEEGSVVKENCNSCTCSGRQWNCSQHVCLVHPELIEHINKG-D 167

Query: 49  AGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWP 106
            GW A    QF   T+ + FK  LG + P+P  L +     T      LP+ F +   WP
Sbjct: 168 YGWTAQNYSQFWGMTLEEGFKFRLGTLPPSPTLLSMNEMTATFPARADLPEVFISSYKWP 227

Query: 107 QCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCD 164
                   LDQ +C + WAF      +DR  I        +LS  +L++CC      GC+
Sbjct: 228 --GWTHGPLDQKNCAASWAFSTASVAADRIAIQSRGRYTANLSPQNLISCCAKK-RHGCN 284

Query: 165 GGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKK 215
            G    AW +    G+V+  C P F     ++  C  A         + T  C     K 
Sbjct: 285 SGSIDRAWWFLRKRGLVSHACYPLFKDQNTTNNICAMASRSDGRGKRHATKPCPNSFEKS 344

Query: 216 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD-- 273
           N++++ S       YR++S+  +IM EI +NGPV+    V+EDF +YK+G+Y+H+     
Sbjct: 345 NRIYQCS-----PPYRVSSNETEIMREIIRNGPVQAIMQVHEDFFYYKTGIYRHVISTNE 399

Query: 274 ------VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 323
                  +  HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE
Sbjct: 400 ESEKYRKLRSHAVKLTGWGTLRGAGGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 459

Query: 324 EDVVAG 329
           + ++A 
Sbjct: 460 KLIIAA 465


>gi|268619140|gb|ACZ13346.1| cathepsin B-like cysteine proteinase [Bursaphelenchus xylophilus]
          Length = 405

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 106/329 (32%), Positives = 161/329 (48%), Gaps = 31/329 (9%)

Query: 20  FAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKG 79
           FA  V+  + + + +    ++  +N+N    +KA  NP    Y  G+        P  K 
Sbjct: 4   FATLVLFLIPVAASLSGQELVDYINKN--GLFKAVYNPSAGAYHFGRIN-----DPLRKS 56

Query: 80  LLLGVPVKTHDKSLKLPKSFDARSAWPQCSTI-SRILDQGHCGSCWAFGAVEALSDRFCI 138
            L       +D S ++P+SFDA   WP+C+ + + I DQ +CGSCWA  +   +SDR C+
Sbjct: 57  TLKKRTEADYDLSEEIPESFDAAEKWPECAEVFNNIRDQSNCGSCWAVSSAGVMSDRICV 116

Query: 139 HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDS 191
                + +S++ + A    + GDGC+GG    A+  F+ +G  T       + C PY   
Sbjct: 117 ATNGKVKVSISGI-ATASCVGGDGCNGGLEEVAFEKFIENGFPTGSEVDKHQGCQPY-PF 174

Query: 192 TGCSH-------PGCE--PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMA 241
             C+H       P C+  P Y    C  +C K  ++ +    +Y    Y   SD   I  
Sbjct: 175 KHCAHHVNSTEYPPCDSVPEYKADTCSHECQKDYDRKYEEDLYYGKEQYGF-SDEAPIQR 233

Query: 242 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD-VMGGHAVKLIGWGTSDDGEDYWILANQ 300
           EI  NGPV VSFTVYE F +Y  G+Y+   G+ + G HAV+++GWG  ++G  YW +AN 
Sbjct: 234 EIMTNGPVAVSFTVYESFLYYSGGIYRSTPGERIKGYHAVRVVGWGV-ENGTKYWKIANS 292

Query: 301 WNRSWGADGYF-KIKRGSNECGIEEDVVA 328
           WN  WG +        G +E  IE+  VA
Sbjct: 293 WNEQWGRERLLPHTPAGVDESDIEDGGVA 321


>gi|348553066|ref|XP_003462348.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
           porcellus]
          Length = 475

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 110/317 (34%), Positives = 152/317 (47%), Gaps = 37/317 (11%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTH--DKSLKL 95
           +I+ +N+    GW A    QF   T+ + FK  LG  P P   LLG+   T      + L
Sbjct: 160 LIEHINKG-DYGWTAQNYSQFWGMTLEEGFKFRLGTLP-PSPALLGMNEVTAALPAKIDL 217

Query: 96  PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLA 153
           P+ F A   WP        LDQ +C + WAF      +DR  I        +LS  +L++
Sbjct: 218 PEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSSGRYTANLSPQNLIS 275

Query: 154 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA---------YP 204
           CC      GC GG    AW Y    G+V+  C P F     ++ GC  A         + 
Sbjct: 276 CCARK-RHGCGGGSVDRAWWYLRKRGLVSHACYPLFKDQNATN-GCAMASRSDGRGKRHA 333

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           T  C     K N++++ S       YR++S+   IM EI +NGPV+    V+EDF  YK+
Sbjct: 334 TTPCPNHIEKSNRIYQCS-----PPYRVSSNETQIMKEIMQNGPVQAIMKVHEDFFSYKT 388

Query: 265 GVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFK 312
           G+Y+H+T           +  HAVKL GWGT        E +WI AN W +SWG +GYFK
Sbjct: 389 GIYRHVTSTSEDSEKYQKLRTHAVKLTGWGTLKGARGKKEKFWIAANSWGKSWGENGYFK 448

Query: 313 IKRGSNECGIEEDVVAG 329
           I RG NE  IE+ ++A 
Sbjct: 449 ILRGVNESDIEKLIIAA 465


>gi|291000017|ref|XP_002682576.1| cathepsin C [Naegleria gruberi]
 gi|284096203|gb|EFC49832.1| cathepsin C [Naegleria gruberi]
          Length = 430

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 102/344 (29%), Positives = 155/344 (45%), Gaps = 64/344 (18%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV------------KPTPKGLL 81
           +  D  I+ +N+  ++ WKA  + QF   T  + K + G             K   K   
Sbjct: 103 VNNDRYIQALNK-AQSTWKATAHKQFEGMTFAELKRITGSYRRSYQKTRNLKKQQAKLRA 161

Query: 82  LGVPVKT----------HDKSLKLPKSFDARSAWPQCST---ISRILDQGHCGSCWAFGA 128
           +     T             + KL  S      W   +    +  + +Q  CGSC+AF +
Sbjct: 162 MNADKVTLFNGKTGQFESQDAEKLRASLPTEFDWTNVNGRDFVVPVRNQEQCGSCYAFSS 221

Query: 129 VEALSDRFCIHFGMNLS----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 184
            +    R  +    NL+     S  D++ C  +    GCDGG+P    +Y + +G+  E 
Sbjct: 222 SDMFGSR--VRIPSNLTQVPVYSPQDIVDCSAY--SQGCDGGFPFLVGKYAMDYGLTVES 277

Query: 185 CDPYFDSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEI 243
           CDPY              +   KC  +C V + Q   +S +Y +  Y  NS    +M EI
Sbjct: 278 CDPY------------QGHDLGKCSNQCPVNRQQRLHSSNYYFVGGYYGNSHELSMMHEI 325

Query: 244 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG----------------HAVKLIGWGT 287
           Y+NGP+ + F VY D  +YK GVYKH+T + +                  HAV ++GWG 
Sbjct: 326 YQNGPLAIGFEVYPDLRNYKHGVYKHVTAEELKAQGLSEDEMIPHFEVVNHAVLMVGWGV 385

Query: 288 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
            ++G  YW + N W+ +WG +GYFKI RGS+ECG+E D  AG+P
Sbjct: 386 -ENGTPYWKIKNSWSTTWGDNGYFKILRGSDECGVESDAEAGIP 428


>gi|741376|prf||2007265A cathepsin B
          Length = 153

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 72/147 (48%), Positives = 104/147 (70%), Gaps = 2/147 (1%)

Query: 195 SHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
           S P C     TPKC + C    +  ++  KHY  ++Y +++  +DIMAEIYKNGPVE +F
Sbjct: 8   SRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF 67

Query: 254 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 313
           +VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI
Sbjct: 68  SVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKI 126

Query: 314 KRGSNECGIEEDVVAGLPSSKNLVKEI 340
            RG + CGIE +VVAG+P +    ++I
Sbjct: 127 LRGQDHCGIESEVVAGIPRTDQYWEKI 153


>gi|427783627|gb|JAA57265.1| hypothetical protein [Rhipicephalus pulchellus]
          Length = 483

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 107/339 (31%), Positives = 161/339 (47%), Gaps = 32/339 (9%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHIL--QDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-F 67
           + C  C         ++L+ ++ +   +  +I+++NE    GW+A     F    +    
Sbjct: 115 VDCNRCTCQKVSEREARLQCENRVCINRPELIRQINEG-NFGWQATNYSIFYGKLLEDGI 173

Query: 68  KHLLGV----KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 123
           ++ LG     +PT +   L +      K  +LP+ FDAR  W     +  + DQG C + 
Sbjct: 174 RYRLGTHQPERPTAEMNELHL-----KKREQLPEEFDARIRWS--GLVHGVRDQGDCANS 226

Query: 124 WAFGAVEALSDRFCIH-FGMN-LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           WAF      SDR  I   G++ + LS  DL++C        C GG+P   WR+ +++G V
Sbjct: 227 WAFSTAAVASDRLSIQSRGVDKVELSPQDLMSCLNGGRRVVCQGGHPDRGWRFLLNYGGV 286

Query: 182 TEECDPYFDSTGCSHPGCE-PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIM 240
           +EEC PY      ++  C  P    P    +C          KH+S   YR+ ++ EDIM
Sbjct: 287 SEECYPYEGVHSSANATCRIPRRRDPIEDARCPTGRT---EQKHFSTPPYRVPANEEDIM 343

Query: 241 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHI--------TGDVMGGHAVKLIGWGTSDDGE 292
            EIY NGPV+    V EDF  Y+SGVY+H              G H+V+++GWG      
Sbjct: 344 QEIYANGPVQALILVKEDFFLYRSGVYRHTRIAESLRPQYSRSGWHSVRILGWGVDRSQY 403

Query: 293 ---DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
               YW+ AN W   WG +GYF+I RG +E  IE  V+A
Sbjct: 404 RPIKYWLCANSWGHGWGENGYFRIVRGEDESQIESFVLA 442


>gi|344264196|ref|XP_003404179.1| PREDICTED: tubulointerstitial nephritis antigen [Loxodonta
           africana]
          Length = 476

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 104/321 (32%), Positives = 153/321 (47%), Gaps = 34/321 (10%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           +++  +I+ VN+    GW A    QF   T+ +  K  LG + P+P  L +     +   
Sbjct: 155 LVRPELIEYVNKG-DYGWTAKNYSQFWGMTLEEGLKFRLGTLPPSPMLLSMNEVTPSLPA 213

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
           +  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS  
Sbjct: 214 TTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSNGRYTANLSPQ 271

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 202
           +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A       
Sbjct: 272 NLISCCT-KNRHGCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNANNNGCAMASRSDGRG 330

Query: 203 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
             + T  C     K N +++ S       YR++S+  +IM EI +NGPV+    V+EDF 
Sbjct: 331 KRHATKPCPNNIEKSNVIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFF 385

Query: 261 HYKSGVYKHITG--------DVMGGHAVKLIGWGTSDDG----EDYWILANQWNRSWGAD 308
           HYK+G+Y+H+            +  HAVKL GWG         E +W+ AN W +SWG D
Sbjct: 386 HYKTGIYRHVIRTSEESEKYQKLRTHAVKLTGWGMMKGAKGRKEKFWVAANSWGKSWGED 445

Query: 309 GYFKIKRGSNECGIEEDVVAG 329
           GYF+I RG NE  IE+ ++A 
Sbjct: 446 GYFRILRGVNESDIEKLIIAA 466


>gi|402867308|ref|XP_003897801.1| PREDICTED: tubulointerstitial nephritis antigen [Papio anubis]
          Length = 475

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 109/322 (33%), Positives = 154/322 (47%), Gaps = 37/322 (11%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 90
           +++  +I+ VN+    GW A    QF   T+   FK  LG  P P  +LL +   T    
Sbjct: 155 LVRPELIEHVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLP-PSPMLLSMNEMTAPLP 212

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 148
            +  LP+ F A   WP        LDQ +C + WAF      +DR  I        +LS 
Sbjct: 213 ATTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSP 270

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------ 202
            +L++CC      GC+ G    AW Y    G+V+  C P F     ++ GC  A      
Sbjct: 271 QNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANN-GCAMASRSDGR 328

Query: 203 ---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
              + T  C     K N++++ S       YR++S   +IM EI +NGPV+    V EDF
Sbjct: 329 GKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMQVREDF 383

Query: 260 AHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGA 307
            HYK+G+Y+H+T           +  HAVKL GWGT        E +WI AN W +SWG 
Sbjct: 384 FHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGE 443

Query: 308 DGYFKIKRGSNECGIEEDVVAG 329
           +GYF+I RG NE  IE+ ++A 
Sbjct: 444 NGYFRILRGVNESDIEKLIIAA 465


>gi|297291062|ref|XP_002803846.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
           mulatta]
          Length = 463

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 116/348 (33%), Positives = 160/348 (45%), Gaps = 39/348 (11%)

Query: 10  PILCLTCFATFAEGVVSKLKLDSHILQDSI--IKEVNENPKAGWKAARNPQFSNYTVGQ- 66
           P  C      + EG V K   +S         I++VN+    GW A    QF   T+   
Sbjct: 117 PEGCFKDGQHYEEGSVIKENCNSXXXXXXXXXIEQVNKG-DYGWTAQNYSQFWGMTLEDG 175

Query: 67  FKHLLGVKPTPKGLLLGVPVKTHD--KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 124
           FK  LG  P P  +LL +   T     +  LP+ F A   WP        LDQ +C + W
Sbjct: 176 FKFRLGTLP-PSPMLLSMNEMTAPLPATTDLPEFFVASYKWP--GWTHGPLDQKNCAASW 232

Query: 125 AFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 182
           AF      +DR  I        +LS  +L++CC      GC+ G    AW Y    G+V+
Sbjct: 233 AFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVS 291

Query: 183 EECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 233
             C P F     ++ GC  A         + T  C     K N++++ S       YR++
Sbjct: 292 HACYPLFKDQNANN-GCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVS 345

Query: 234 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGGHAVKLIGW 285
           S   +IM EI +NGPV+    V EDF HYK+G+Y+H+T           +  HAVKL GW
Sbjct: 346 SSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGW 405

Query: 286 GT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
           GT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 406 GTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 453


>gi|66506619|ref|XP_393283.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
           [Apis mellifera]
          Length = 439

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 108/328 (32%), Positives = 156/328 (47%), Gaps = 20/328 (6%)

Query: 13  CLTCFATFAEGVVSKLKLDSHILQD-SIIKEVNENPKAGWKAARNPQF--SNYTVGQFKH 69
           C TC  T    +   L   +  LQ+ S+I EVN      W+A    +F     + G    
Sbjct: 113 CNTCKCTAVSRLAEVLCEQNRCLQEQSLIDEVNSISSLNWRARNYSEFWGKRLSEGVKLR 172

Query: 70  LLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 129
           L  + P+     +    + +D    LP+ FDAR+ W +   IS + DQG CG+ WA    
Sbjct: 173 LGTLNPSNSVYRMNSVRRVYDPE-SLPREFDARTRWRR--QISGVDDQGWCGASWAISTA 229

Query: 130 EALSDRFCIHF-GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY 188
           +  SDRF +   G +  L     L  C      GCDGGY   AW +    G+V E+C P+
Sbjct: 230 QVASDRFAVMSKGTDSVLLSAQHLLSCNKKGQRGCDGGYLDRAWLFMRKFGLVDEQCYPW 289

Query: 189 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 248
                  +  C+    T      C       R   +    AYR+ ++  DIM EI  +GP
Sbjct: 290 KGV----YEQCKLQKRTNLEAAGCRAPANPLRKELYKVGPAYRLGNE-TDIMREILTSGP 344

Query: 249 VEVSFTVYEDFAHYKSGVYKHITGDVM---GGHAVKLIGWG---TSDDGE--DYWILANQ 300
           V+ +  VY+DF  Y+SG+Y H     +   G H+V++IGWG   ++D G    YW++ N 
Sbjct: 345 VQATMKVYQDFFSYESGIYMHTPIAELYESGYHSVRIIGWGEDISTDSGLPIKYWLVVNS 404

Query: 301 WNRSWGADGYFKIKRGSNECGIEEDVVA 328
           W + WG +G F+I+RG NEC IE  VVA
Sbjct: 405 WGQEWGENGLFRIRRGINECDIESFVVA 432


>gi|345488309|ref|XP_001605531.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
           [Nasonia vitripennis]
          Length = 481

 Score =  154 bits (390), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 102/307 (33%), Positives = 145/307 (47%), Gaps = 19/307 (6%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKS 92
           ++   II E+N     GW A    +F   T     K  LG     +     +PV  H   
Sbjct: 172 LMDQEIINEINYLESPGWIARNYSKFWGRTFDDGLKLRLGTINPSQSTRQMLPVTRHYNP 231

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 150
             LP+ FD+R  W   + I+ + DQG CG+ WA   V+  SDRF I       + LS   
Sbjct: 232 NDLPREFDSRIQWG--NDITPVQDQGWCGASWAISTVDVASDRFAIMSKGIEKVQLSGQH 289

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           L++C       GC GGY   AW +    GVV E+C P+          C           
Sbjct: 290 LISC-NNRGQRGCKGGYLDRAWLFMRKFGVVDEDCYPWLSG---RSDKCRIPRRGKLSDA 345

Query: 211 KCVKKNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 269
            C ++N     ++ Y +  AYR+ ++  DIM EI  +GPV+ +  V+ DF HY+SG+Y H
Sbjct: 346 GCQRRNSYNLRNEMYKVGPAYRLGNE-TDIMQEILTSGPVQATMRVHRDFFHYESGIYVH 404

Query: 270 ---ITGDVMGGHAVKLIGWGTSDDGED-----YWILANQWNRSWGADGYFKIKRGSNECG 321
                    G H+V+++GWG      +     +W +AN W R WG DGYF+I RG+NEC 
Sbjct: 405 SRPFDTRQSGYHSVRIVGWGEEPSPYNGKPIKFWRVANSWGRDWGEDGYFRIVRGNNECE 464

Query: 322 IEEDVVA 328
           IE  V+ 
Sbjct: 465 IESFVLG 471


>gi|332030944|gb|EGI70570.1| Uncharacterized peptidase C1-like protein F26E4.3 [Acromyrmex
           echinatior]
          Length = 501

 Score =  154 bits (390), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 99/306 (32%), Positives = 153/306 (50%), Gaps = 19/306 (6%)

Query: 34  ILQDSIIKEVN-ENPKAGWKAARNPQFSNYTVGQFKHL-LGVKPTPKGLLLGVPVKTHDK 91
           +++  +++E+N + P  GW+A+   +F   T+ +   L LG     + +    PV+    
Sbjct: 198 LIESELMEELNLQGPTLGWQASNYSEFWGRTLLEGVELRLGTLNPSQSVYKMNPVRRIYD 257

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP+ FD+R+ W +   IS + DQG CG+ WA    +  +DRF I      +  LS  
Sbjct: 258 PDALPREFDSRTRWSR--DISNVHDQGWCGASWAISTADVATDRFSIMSKGAEDAELSAQ 315

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
            LL+C       GC GGY   AW +    G+V ++C P+    G     C+         
Sbjct: 316 HLLSC-NNRGQQGCRGGYLDRAWLFMRKFGLVDKDCYPWTGKNG----QCKLRKRNNLQA 370

Query: 210 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 269
             C K     R   +    AYR+ ++  DIM EI  +GPV+ +  VY+DF  YK+G+Y+H
Sbjct: 371 AGCRKPPNPLRTELYKVGPAYRLGNE-TDIMQEILTSGPVQATMRVYQDFFVYKNGIYRH 429

Query: 270 ITGDVM---GGHAVKLIGWGTSDDGE----DYWILANQWNRSWGADGYFKIKRGSNECGI 322
                +   G H+V++IGWG           YW++ N W  +WG +G FKI+RG+NEC I
Sbjct: 430 SQSAELHDSGYHSVRIIGWGEERSYRGPPLKYWLVVNSWGYNWGENGLFKIQRGTNECEI 489

Query: 323 EEDVVA 328
           E  V+A
Sbjct: 490 ESYVLA 495


>gi|157116531|ref|XP_001658537.1| tubulointerstitial nephritis antigen [Aedes aegypti]
 gi|108883447|gb|EAT47672.1| AAEL001232-PA [Aedes aegypti]
          Length = 462

 Score =  154 bits (390), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 107/323 (33%), Positives = 152/323 (47%), Gaps = 25/323 (7%)

Query: 31  DSHILQDSIIKEVNENPKA-GWKAARNPQF--SNYTVGQFKHLLGVKPTPKGLLLGVPVK 87
           D  +  + ++K++N   ++ GWKA    ++    Y  G+   L    P  K   +     
Sbjct: 121 DVCLTDNELLKQLNHLERSIGWKATNYSEWWGHKYDEGKVMRLGTFYPKIKVKSMSRLTN 180

Query: 88  THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLS 145
             D    LP  FDA + WP    I ++ DQG CGS WA       SDRF I       + 
Sbjct: 181 GLDH---LPTHFDATNYWP--GFIGKVRDQGWCGSSWAVSTASVASDRFAILSKGRETVQ 235

Query: 146 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 205
           L+   +++C       GC GG+  +AW Y    G V EEC PY  +    H  C+     
Sbjct: 236 LAPQQIVSCVRR--SQGCSGGHLDTAWSYLRKVGTVNEECYPYISA----HNVCKIRPSD 289

Query: 206 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
                 C    ++ R + +    A+ +N++  DIM EI K+GPV+    V+ DF  YKSG
Sbjct: 290 TLITANCELPMKVDRTNMYKMGPAFSLNNE-TDIMLEIKKHGPVQAIMRVHRDFFSYKSG 348

Query: 266 VYKHITGDV-----MGGHAVKLIGWGTSDDGED---YWILANQWNRSWGADGYFKIKRGS 317
           +Y+H           G H+V+LIGWG    G +   YWI  N W   WG +G F+I RGS
Sbjct: 349 IYRHSAASTSADQRAGYHSVRLIGWGEERHGYEVTKYWIAVNSWGTWWGENGRFRILRGS 408

Query: 318 NECGIEEDVVAGLPSSKNLVKEI 340
           NEC IE  V+A LP     VK++
Sbjct: 409 NECEIESYVLASLPYVHQQVKDL 431


>gi|226466652|emb|CAX69461.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 340

 Score =  154 bits (389), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 108/323 (33%), Positives = 158/323 (48%), Gaps = 29/323 (8%)

Query: 31  DSHI--LQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVKPTPKGLLLGVPV 86
           + HI  L   +I+ VN NPK GWKA  N +F  S      F+  + ++      +  +  
Sbjct: 23  NEHIEPLFGKLIEYVNRNPKFGWKAGTNHRFRSSKDIEKMFRKYIEIENIQTKHIKTI-- 80

Query: 87  KTHDK-SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MN 143
            +H+  ++++P+SFDAR  W  CSTI +I D+  C + WA   V+++SDR CI     ++
Sbjct: 81  -SHNSINMEIPRSFDARYHWINCSTIRQIHDESLCRADWAIATVDSISDRICIRSNGRIS 139

Query: 144 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH 196
           + LS  D ++ CGF    GC  G  +    Y++ +G+VT         C PY       H
Sbjct: 140 VQLSARDAIS-CGF--SPGCFHGSEVEVLVYWITYGIVTGGSYEDQSGCQPYPLPKCSYH 196

Query: 197 PGCE------PAYPTPKCVRKCVK-KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 249
           P           +  P+C  +C    N+ + + K Y    Y +    EDI  EI  NGPV
Sbjct: 197 PESRFLDCNNNTFEFPQCTNECQDGYNKTYDDDKFYGERIYNVYGTQEDIQKEILMNGPV 256

Query: 250 EVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 308
             S +V  DF  YKSGVY        +G   +++IGWG  +    YW+ AN WN  WG +
Sbjct: 257 IASISVNTDFLVYKSGVYLPTPRSRNLGWITLRIIGWGY-EGKIPYWLCANSWNEEWGDN 315

Query: 309 GYFKIKRGSNECGIEEDVVAGLP 331
           GY KI+RG     IE  V A +P
Sbjct: 316 GYVKIQRGVQAGYIESYVRAPIP 338


>gi|322788703|gb|EFZ14296.1| hypothetical protein SINV_07506 [Solenopsis invicta]
          Length = 443

 Score =  154 bits (389), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 107/330 (32%), Positives = 161/330 (48%), Gaps = 20/330 (6%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSH-ILQDSIIKEVNEN-PKAGWKAARNPQFSNYTVGQFK 68
           + C TC  T  +     L  ++  +++  +++EVN+  P  GW+     +F   T+    
Sbjct: 116 VNCNTCKCTLVDKRAEVLCEENRCLIEPELLEEVNQQEPILGWQVGNYSEFWGRTLRDGV 175

Query: 69  HL-LGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFG 127
            L LG     + +    PVK       LP+ FD+R+ W +   IS I DQG CG+ WA  
Sbjct: 176 ELRLGTLNPSQSVYKMNPVKRIYDPDALPREFDSRTRWSR--DISGIHDQGWCGASWAVS 233

Query: 128 AVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEEC 185
             +  SDR+ I         LS   LL+C       GC GGY   AW +    G+V +EC
Sbjct: 234 TADVASDRYSIMSKGAEAPELSAQQLLSC-NNRGQQGCRGGYLDRAWLFMRKFGLVDKEC 292

Query: 186 DPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYK 245
            P+       +  C+    +      C K +   R   +    AYR+ ++  DIM EI  
Sbjct: 293 YPWSGK----NDQCKLRKRSTLKAAGCRKPSHPLRTELYKVGPAYRLGNE-TDIMQEILT 347

Query: 246 NGPVEVSFTVYEDFAHYKSGVYKHITGDVM---GGHAVKLIGWGTSDDGE----DYWILA 298
           +GPV+ +  VY+DF  YKSG+Y+H     +   G H+V++IGWG           YW++A
Sbjct: 348 SGPVQATMRVYQDFFIYKSGIYRHSRSAELHDSGYHSVRIIGWGEERSYRGPPLKYWLVA 407

Query: 299 NQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
           N W  +WG +G FKI++G+NEC IE  V+A
Sbjct: 408 NSWGYNWGDNGLFKIQKGTNECEIESYVLA 437


>gi|307175943|gb|EFN65753.1| Uncharacterized peptidase C1-like protein F26E4.3 [Camponotus
           floridanus]
          Length = 443

 Score =  154 bits (389), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 102/306 (33%), Positives = 154/306 (50%), Gaps = 19/306 (6%)

Query: 34  ILQDSIIKEVN-ENPKAGWKAARNPQFSNYTVGQFKHL-LGVKPTPKGLLLGVPVKTHDK 91
           +++  +++E++ + P  GW+A    +F   T+     L LG     + +    PV+    
Sbjct: 140 LIEPELMEEIHLQGPTLGWQAGNYSEFWGRTLKDGVQLRLGTLNPSQSVYKMNPVRRIYD 199

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP+ F++R+ WP+   IS I DQG CG+ WA    +  SDRF I       + LS  
Sbjct: 200 PDALPREFNSRTRWPR--DISDIHDQGWCGASWAVSTADVASDRFAIMSKGAETVELSAQ 257

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
            LL+C       GC GGY   AW +    G+V EEC P+   TG  +  C     +    
Sbjct: 258 HLLSC-NNRGQQGCKGGYLDRAWLFMRKFGLVDEECYPW---TG-RNDQCRLRKRSNLKT 312

Query: 210 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 269
             C       R   +    AYR+ ++  DIM EI  +GPV+ +  VY+DF  Y+SGVY+H
Sbjct: 313 AGCQNPPNSLRTELYKVGPAYRLGNE-TDIMQEILTSGPVQATMRVYQDFFVYQSGVYRH 371

Query: 270 ITGDVM---GGHAVKLIGWGTSDDGE----DYWILANQWNRSWGADGYFKIKRGSNECGI 322
                +   G H+V++IGWG           YW++AN W  +WG +G F+I++G+NEC I
Sbjct: 372 SRSAELHDSGYHSVRIIGWGEEPSYRGPPLKYWLVANSWGHNWGENGLFRIQKGTNECEI 431

Query: 323 EEDVVA 328
           E  V+A
Sbjct: 432 ESYVLA 437


>gi|290973645|ref|XP_002669558.1| predicted protein [Naegleria gruberi]
 gi|284083107|gb|EFC36814.1| predicted protein [Naegleria gruberi]
          Length = 343

 Score =  154 bits (389), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 108/332 (32%), Positives = 159/332 (47%), Gaps = 52/332 (15%)

Query: 23  GVVSKLKLDSH---ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK-------HLLG 72
            +V+  ++ SH   I    +I  +N NPK+ WKA    +F+N TVG+FK       H   
Sbjct: 4   AIVAMGEMASHHEPIHDHHVIHSINNNPKSSWKAKVYEKFANMTVGEFKQKYLGAIHEEA 63

Query: 73  VKPTPKG---LLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF--- 126
           + P+ K    ++ G P      +   P +FD+R  WPQC  +  + +Q  CGSCWAF   
Sbjct: 64  ITPSSKSRFSIVTGPPT-----AYTPPTNFDSRQKWPQC--VHTVRNQLDCGSCWAFWIE 116

Query: 127 -----GAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 179
                 A + LSDRFCI  +  +N+ +S    + C   +   GC GG     W +  + G
Sbjct: 117 FNDLVSATKVLSDRFCIASNGSVNVIMSPQYQIDCN--MDNLGCSGGSLPKTWNFLTNVG 174

Query: 180 VVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDI 239
            V+E+C PY ++                C  KCV      +    Y   +Y      + I
Sbjct: 175 SVSEQCRPYKNND------------DDDCPSKCVDG----KAPSFYKAKSYASIKGLDSI 218

Query: 240 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILA 298
           M EI   GPV  S TVY+D   Y+SGVY H+TG+ +GGHA+ +IG+G  S   + YWI+A
Sbjct: 219 MYEIQNYGPVHASLTVYKDLMSYQSGVYSHLTGNEIGGHAIVIIGFGMDSLSKKPYWIIA 278

Query: 299 NQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           N W  +      +KI   SN   + +D+  G 
Sbjct: 279 NSWGENGSIPTSYKI---SNAPRLRDDLHDGF 307


>gi|395526635|ref|XP_003765465.1| PREDICTED: tubulointerstitial nephritis antigen-like [Sarcophilus
           harrisii]
          Length = 467

 Score =  154 bits (388), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 107/326 (32%), Positives = 151/326 (46%), Gaps = 43/326 (13%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +I  +N     GW A  +  F   T+ +  ++ LG V+PT   + +         
Sbjct: 140 LVNPDLIDAINRG-NYGWTAGNHSVFWGMTLDEGIRYRLGTVRPTSSVMNMNEIQMVMSP 198

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F A + WP    I   LDQG+C   WAF      SDR  IH    M+ +LS  
Sbjct: 199 DETLPSAFSASNKWP--GLIHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMSPALSPQ 256

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP----- 204
           +LL+C       GC GG    AW +    G+V+  C P+ +     H G  PA P     
Sbjct: 257 NLLSC-NTHNQHGCRGGRLDGAWWFLRRRGLVSNNCYPFSEG---DHNGAAPAAPCMMHS 312

Query: 205 ----------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 254
                     T  C       N +++     +   YR++S  +DIM E+ +NGPV+    
Sbjct: 313 RHMGRGKRQATAHCPNSRTHANHIYQ-----ATPPYRLSSHEKDIMKELMENGPVQALLE 367

Query: 255 VYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWN 302
           V+EDF  YKSG+YKH    +         G H+VK+ GWG     DG+   YW  AN W 
Sbjct: 368 VHEDFFLYKSGIYKHTPASLGKPERYRQHGTHSVKITGWGEEIQPDGQKVKYWTAANSWG 427

Query: 303 RSWGADGYFKIKRGSNECGIEEDVVA 328
            +WG +GYF+I RG+NEC IE  VV 
Sbjct: 428 PTWGENGYFRIVRGANECDIESFVVG 453


>gi|324512900|gb|ADY45327.1| Peptidase C1-like protein [Ascaris suum]
          Length = 450

 Score =  154 bits (388), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 109/323 (33%), Positives = 148/323 (45%), Gaps = 46/323 (14%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKS 92
           ++Q+ I+K VN   +  W A     F   T+    ++ LG     K +     +    K 
Sbjct: 125 LIQEDILKRVNAG-RYTWSARNYSNFWGRTLEDGMRYRLGTLFPDKSVQNMNEILM--KP 181

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVND 150
            +LP SFDAR  WP    I  + DQG C S W+       +DR  I     +N+ LS   
Sbjct: 182 RELPSSFDAREKWPL--YIHPVRDQGDCASSWSHSTTATSADRLSIITDGRVNIPLSAQQ 239

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           LL+C       GC+GGY   AW Y    GVV+E C PY +S     PG            
Sbjct: 240 LLSCNQHR-QRGCEGGYLDRAWWYIRKLGVVSELCYPY-ESGATQQPG------------ 285

Query: 211 KCVKKNQLWRNSKH------------YSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYE 257
           +C      +R   H            Y ++  YR++S  +DIM EI  NGPV+ +F VYE
Sbjct: 286 ECRIPKSAYRTGAHIDCPSGAADPSVYRMTPPYRVSSREQDIMTEIITNGPVQATFLVYE 345

Query: 258 DFAHYKSGVYKHI--------TGDVMGGHAVKLIGWG---TSDDGEDYWILANQWNRSWG 306
           DF  Y  GVY+H+           V G H+V++IGWG   ++     YW+ AN W   WG
Sbjct: 346 DFFMYSGGVYQHLDLHEHKEEERKVQGYHSVRIIGWGEDYSTGPQVKYWLAANSWGNEWG 405

Query: 307 ADGYFKIKRGSNECGIEEDVVAG 329
            DG F+I RG N C IE  V+  
Sbjct: 406 EDGLFRILRGENHCEIESFVIGA 428


>gi|129270160|ref|NP_001038442.2| tubulointerstitial nephritis antigen-like precursor [Danio rerio]
 gi|126632071|gb|AAI33830.1| Si:dkey-158b13.1 [Danio rerio]
          Length = 471

 Score =  154 bits (388), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 104/324 (32%), Positives = 153/324 (47%), Gaps = 40/324 (12%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVK-PTPKGLLLGVPVKTHDK 91
           +++D +I+E+N     GW+AA   QF   T+ +  +  LG K PT   + +       + 
Sbjct: 138 LIEDDMIQEINRR-DYGWRAANYSQFWGMTLDEGLRFRLGTKRPTRTIMNMNEMQMNMNG 196

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
           +  LP  F+A   WP    I   LDQG+C + WAF      SDR  I     M   LS  
Sbjct: 197 NDHLPSYFNAVDKWP--GKIHEPLDQGNCNASWAFSTAAVASDRISIQSMGHMTPQLSPQ 254

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
           +L++C      DGC GG    AW +    GVVT++C P+        P  + A    +C+
Sbjct: 255 NLISC-DTRHQDGCAGGRIDGAWWFMRRRGVVTQDCYPF-------SPPEQSAVEVARCM 306

Query: 210 RKC-------------VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
            +                 +  + N  + S   YR++++  +IM EI  NGPV+    V+
Sbjct: 307 MQSRAVGRGKRQATAHCPNSHSYHNDIYQSTPPYRLSTNENEIMKEIMDNGPVQAIMEVH 366

Query: 257 EDFAHYKSGVYKHITGDVM--------GGHAVKLIGWGTSDD----GEDYWILANQWNRS 304
           EDF  YKSG+++H   +            H+V++ GWG   D       YWI AN W ++
Sbjct: 367 EDFFVYKSGIFRHTDVNYHKPSQYRKHATHSVRITGWGEERDYSGRTRKYWIGANSWGKN 426

Query: 305 WGADGYFKIKRGSNECGIEEDVVA 328
           WG DGYF+I RG NEC IE  V+ 
Sbjct: 427 WGEDGYFRIARGVNECDIETFVIG 450


>gi|53850626|ref|NP_001005549.1| tubulointerstitial nephritis antigen precursor [Rattus norvegicus]
 gi|51858645|gb|AAH81887.1| Tubulointerstitial nephritis antigen [Rattus norvegicus]
 gi|149019129|gb|EDL77770.1| tubulointerstitial nephritis antigen [Rattus norvegicus]
          Length = 475

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 113/365 (30%), Positives = 162/365 (44%), Gaps = 47/365 (12%)

Query: 3   PTKLIMDPILCLTCFATFAEGVVSK------------LKLDSHI--LQDSIIKEVNENPK 48
           P +  +DP  C      + EG V K             K   H+  +   +I  +N+   
Sbjct: 110 PLQQPLDPEGCSRDSQHYEEGSVIKENCNFCTCSGQQWKCSQHVCLVLPELIDHINKG-D 168

Query: 49  AGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQ 107
            GW A    QF   T+ + FK  LG  P    LL    +        LP+ F A   WP 
Sbjct: 169 YGWTAQNYSQFWGMTLEEGFKFRLGTLPPSPMLLSMNEMTASYPRADLPEVFIASYKWP- 227

Query: 108 CSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDG 165
                  LDQ +C + WAF      +DR  I        +LS  +L++CC      GC+ 
Sbjct: 228 -GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNS 285

Query: 166 GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKN 216
           G    AW +    G+V+  C P F     ++  C  A         + T  C     K N
Sbjct: 286 GSIDRAWWFLRKRGLVSHACYPLFKEQSTNNNSCAMASRSDGRGKRHATRPCPNSFEKSN 345

Query: 217 QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--- 273
           ++++ S       YRI+S+  +IM EI +NGPV+    V+EDF +YK+G+Y+H+      
Sbjct: 346 RIYQCS-----PPYRISSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEE 400

Query: 274 -----VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
                 +  HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+
Sbjct: 401 PEKYRKLRTHAVKLTGWGTLRGAQGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIEK 460

Query: 325 DVVAG 329
            ++A 
Sbjct: 461 LIIAA 465


>gi|182509202|ref|NP_001116812.1| tubulointerstitial nephritis antigen precursor [Bombyx mori]
 gi|81303350|gb|ABB71105.1| TIN-ag-RP [Bombyx mori]
          Length = 404

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 107/309 (34%), Positives = 155/309 (50%), Gaps = 42/309 (13%)

Query: 31  DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTH 89
           D+ ++ + ++ +VN+     W+A   P+F+   +     + LG  P      L V V ++
Sbjct: 127 DTCMMSEDLVNDVNQQGTT-WRATTYPEFNEKKLKDGLIYKLGTFP------LNVTVISY 179

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH-FGM-NLSLS 147
            K  + P  FDAR  W     IS I DQ  CGS WA      + DRF I  FG  N+ +S
Sbjct: 180 SKDGQYPDEFDARREWY--GYISPIADQDWCGSDWAVSIASIVGDRFSIQSFGTENVRMS 237

Query: 148 VNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP 206
              LL+C   L G  GC+GG    A+ +   HG+V+E+C PY                  
Sbjct: 238 SQTLLSC--HLKGQRGCNGGNLDIAFDFVKTHGLVSEQCFPY------------------ 277

Query: 207 KCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 266
                 V + ++  + + Y +      S  EDIM +I  +GP     TVY+DF HY+ G+
Sbjct: 278 ---EGAVTQCRIGNDCRRYRVGVPFSISKEEDIMYDIMTSGPALGIMTVYQDFFHYREGI 334

Query: 267 YKHIT-GDVM--GGHAVKLIGWGTSDDGED-YWILANQWNRSWGADGYFKIKRGSNECGI 322
           Y+H   GD +  G H+V+++GWG  +D ED YWI+AN W  SWG  GYF+I RG +  GI
Sbjct: 335 YRHTRHGDQLMRGLHSVRIVGWG--EDAEDKYWIVANSWGTSWGEKGYFRIARGHSGTGI 392

Query: 323 EEDVVAGLP 331
           E  V+  LP
Sbjct: 393 ESSVLTVLP 401


>gi|312082955|ref|XP_003143660.1| hypothetical protein LOAG_08080 [Loa loa]
 gi|307761175|gb|EFO20409.1| hypothetical protein LOAG_08080 [Loa loa]
          Length = 339

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 105/310 (33%), Positives = 153/310 (49%), Gaps = 29/310 (9%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 90
           ++Q+ ++ ++ ++ +  W      QF   T+    +H LG       L     V+  +  
Sbjct: 21  LIQEDLLMKI-QSGRYTWTGRNYSQFWGRTLKDGIRHRLGT------LFPERSVQNMNEM 73

Query: 91  --KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSL 146
             K  +LP SFDAR  WP    I  I DQG C S WA       +DR  +      N++L
Sbjct: 74  IVKPRELPTSFDARQKWP--DFIHPIQDQGDCASSWAQSTAATSADRLALITEGRQNVAL 131

Query: 147 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP 206
           S    L+C       GC+GGY   AW Y    GVV+EEC PY   T      C       
Sbjct: 132 SAQQFLSCNQHR-QKGCEGGYLDRAWWYIRKFGVVSEECYPYISGTTRKPEICYMQKSKH 190

Query: 207 KCVRKCVKKNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
              R+C   +    NS+ Y  + +YR++S  +DIM+EI  NGPV+ +F V+ DF  + +G
Sbjct: 191 ANGRQCPSGHP---NSRVYRTTPSYRVSSREQDIMSEILTNGPVQATFRVHGDF--FIAG 245

Query: 266 VYKH---ITGDVMGGHAVKLIGWG---TSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 319
           VYKH   +  ++ G H+V+L+GWG   ++     YWI AN W  +WG +G F+I RG N 
Sbjct: 246 VYKHLPTVGEEIEGYHSVRLLGWGEDYSTGIPVKYWIAANSWGTNWGENGTFRILRGENH 305

Query: 320 CGIEEDVVAG 329
           C IE  V+  
Sbjct: 306 CEIESFVIGA 315


>gi|290991959|ref|XP_002678602.1| predicted protein [Naegleria gruberi]
 gi|284092215|gb|EFC45858.1| predicted protein [Naegleria gruberi]
          Length = 286

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 105/325 (32%), Positives = 143/325 (44%), Gaps = 53/325 (16%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
           ++CL   A        K   +  I   +++++VN     GW+A   P F N  +  F+  
Sbjct: 11  VICLLLLAVTFLFAEEKDFWNKPIQTRALVEQVNSQVGVGWRATSYPHFDNMKLSDFRKY 70

Query: 71  LGVKPTPKGLLLGVPVKTH-DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 129
           LGV    +       V+    K   LP+ FDAR  WP C  I+ I +Q  CGSCWAF A 
Sbjct: 71  LGVHNFTEPTRSKFNVRAELTKVRNLPEQFDARKEWPHC--ITPIRNQEQCGSCWAFSAS 128

Query: 130 EALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 187
             LSDRFC++    + + LS   +L C      + C+GG   +AW++ V  G+ T+ C P
Sbjct: 129 AVLSDRFCVYSNGSVQVMLSPEYMLECSA--QNNACNGGTLHAAWQFLVSVGIPTDSCVP 186

Query: 188 YFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 247
           Y    G              C  KC    Q    SK Y  +A +   +  +IM EI  +G
Sbjct: 187 YSSGNG----------TVGHCPSKCTVPGQ---TSKFYKAAAAKKLENMVEIMTEIKTHG 233

Query: 248 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 307
            V+V+  VY D   YKSGVY H+T                                 WG 
Sbjct: 234 SVQVAIAVYRDLFSYKSGVYHHVT---------------------------------WGL 260

Query: 308 DGYFKIKRGSNECGIEEDVVAGLPS 332
           DGYF I RG NECG  +DV AG P+
Sbjct: 261 DGYFWILRGHNECGFGKDVWAGKPA 285


>gi|268563232|ref|XP_002638788.1| Hypothetical protein CBG05143 [Caenorhabditis briggsae]
          Length = 426

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 110/349 (31%), Positives = 160/349 (45%), Gaps = 68/349 (19%)

Query: 29  KLDSHILQDSIIKEVNENPKAGWKAARNP-QFSNYTVGQFKHLLGVKPTP------KGLL 81
           K +S      ++++VN++P+  WKA  N     N + G FK+              +   
Sbjct: 65  KRESDEYLRKLVRQVNDSPETTWKAKFNKFGVKNRSYG-FKYTRNQTAVEEYMEHIRKFF 123

Query: 82  LGVPVKTHDKSLK------LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDR 135
               +K H + L+      LPK FDAR  WP C +IS + +QG CGSC+A  A    SDR
Sbjct: 124 ESDAMKRHLEELENYKSSSLPKHFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDR 183

Query: 136 FCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT---EECDPYFD 190
            CIH        LS  D++ CC  +CG+ C GG P+ A  Y+V+ G+VT   + C PY  
Sbjct: 184 ACIHSNGTFKSLLSEEDIIGCCS-VCGN-CYGGDPLKALTYWVNQGLVTGGRDGCRPYSF 241

Query: 191 STGCSHPGCEPAY-----PTPKCVRKC--VKKNQLWRNSKHYSISAYRI----------- 232
              C  P C PA          C+R+C  +   Q +   KH++  AY +           
Sbjct: 242 DLSCGVP-CSPATFFEAEEKRTCMRRCQNIYYQQKYEEDKHFATFAYSLYPRSMTVSPDG 300

Query: 233 --------------NSDPEDIMAEIYKN---------GPVEVSFTVYEDFAHYKSGVYKH 269
                         + + E +    Y+N         GP  ++F V E+F HY SGV++ 
Sbjct: 301 KERVKVPTIIGHFNDKNTEKLNVTEYRNVIKKEILLYGPTTMAFPVPEEFLHYSSGVFRP 360

Query: 270 ITGD-----VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 313
              D     ++  H V+LIGWG SDDG+ YW+  N +   WG +G FKI
Sbjct: 361 FPLDGFDDRIVYWHVVRLIGWGESDDGQHYWLAVNSFGNHWGDNGIFKI 409


>gi|14789619|gb|AAH10745.1| Tubulointerstitial nephritis antigen [Mus musculus]
          Length = 475

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 112/366 (30%), Positives = 164/366 (44%), Gaps = 48/366 (13%)

Query: 3   PTKLIMDPILCLTCFATFAEGVVSK------------LKLDSHI--LQDSIIKEVNENPK 48
           P +   DP  C      + EG V K             K   H+  +   +I  +N+   
Sbjct: 109 PFQQPSDPEGCFRDSQHYEEGSVVKENCNSCTCSGQQWKCSQHVCLVHPELIDHINKG-D 167

Query: 49  AGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWP 106
            GW A    QF   T+ + FK  LG + P+P  L +     +      LP+ F A   WP
Sbjct: 168 YGWTAQNYSQFWGMTLEEGFKFRLGTLPPSPMLLSMNEMTASFPPRADLPEIFIASYKWP 227

Query: 107 QCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCD 164
                   LDQ +C + WAF      +DR  I        +LS  +L++CC      GC+
Sbjct: 228 --GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCN 284

Query: 165 GGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKK 215
            G    AW +    G+V+  C P F     ++  C  A         + T  C     K 
Sbjct: 285 SGSIDRAWWFLRKRGLVSHACYPLFKDQNTTNNICAMASRSDGRGKRHATKPCPNSFEKS 344

Query: 216 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG--- 272
           N++++ S       YR++S+  +IM EI +NGPV+    V+EDF +YK+G+Y+H+     
Sbjct: 345 NRIYQCS-----PPYRVSSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNE 399

Query: 273 -----DVMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 323
                  +  HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE
Sbjct: 400 EPEKYKKLRTHAVKLTGWGTLRGARGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 459

Query: 324 EDVVAG 329
           + ++A 
Sbjct: 460 KLIIAA 465


>gi|159114116|ref|XP_001707283.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157435387|gb|EDO79609.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 332

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 91/251 (36%), Positives = 127/251 (50%), Gaps = 32/251 (12%)

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH--------FG 141
           + S  +P +FD R  +PQC  I+ + DQG+CG+CWAF A  A  DR C+         + 
Sbjct: 99  EPSGPIPDAFDLREEYPQC--ITPVYDQGYCGACWAFSATGAFGDRRCMQWLDPVGVPYS 156

Query: 142 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 201
              ++S +DL          GC GG   + W +   HG  T EC  Y D+       C P
Sbjct: 157 QQYTVSCDDLDL--------GCAGGTSFNVWTFLTEHGTTTLECVRYTDADKDLSSPC-P 207

Query: 202 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 261
           A        + VK +     S + +            IM  +  +GPV+   +VY DF +
Sbjct: 208 ALCDDGSEIQLVKADGCLDYSGNVTA-----------IMQTLANDGPVQAVMSVYRDFLY 256

Query: 262 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGED--YWILANQWNRSWGADGYFKIKRGSNE 319
           Y+ GVYKH+ G  +  HAV++IG+GT+DD E   YWI+ N    +WG +GYF I RGSNE
Sbjct: 257 YRGGVYKHVYGIQISSHAVEIIGYGTTDDEERIPYWIVKNSLGPNWGEEGYFNIVRGSNE 316

Query: 320 CGIEEDVVAGL 330
           C IE  V +GL
Sbjct: 317 CDIESAVYSGL 327


>gi|227499499|ref|NP_036163.3| tubulointerstitial nephritis antigen precursor [Mus musculus]
 gi|4929827|gb|AAD34171.1| tubulo-interstitial nephritis antigen [Mus musculus]
 gi|148694397|gb|EDL26344.1| tubulointerstitial nephritis antigen, isoform CRA_a [Mus musculus]
          Length = 475

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 112/366 (30%), Positives = 164/366 (44%), Gaps = 48/366 (13%)

Query: 3   PTKLIMDPILCLTCFATFAEGVVSK------------LKLDSHI--LQDSIIKEVNENPK 48
           P +   DP  C      + EG V K             K   H+  +   +I  +N+   
Sbjct: 109 PFQQPSDPEGCFRDSQHYEEGSVVKENCNSCTCSGQQWKCSQHVCLVHPELIDHINKG-D 167

Query: 49  AGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWP 106
            GW A    QF   T+ + FK  LG + P+P  L +     +      LP+ F A   WP
Sbjct: 168 YGWTAQNYSQFWGMTLEEGFKFRLGTLPPSPMLLSMNEMTASFPPRADLPEIFIASYKWP 227

Query: 107 QCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCD 164
                   LDQ +C + WAF      +DR  I        +LS  +L++CC      GC+
Sbjct: 228 --GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCN 284

Query: 165 GGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKK 215
            G    AW +    G+V+  C P F     ++  C  A         + T  C     K 
Sbjct: 285 SGSIDRAWWFLRKRGLVSHACYPLFKDQNTTNNICAMASRSDGRGKRHATKPCPNSFEKS 344

Query: 216 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG--- 272
           N++++ S       YR++S+  +IM EI +NGPV+    V+EDF +YK+G+Y+H+     
Sbjct: 345 NRIYQCS-----PPYRVSSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNE 399

Query: 273 -----DVMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 323
                  +  HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE
Sbjct: 400 EPEKYKKLRTHAVKLTGWGTLRGARGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 459

Query: 324 EDVVAG 329
           + ++A 
Sbjct: 460 KLIIAA 465


>gi|443686962|gb|ELT90079.1| hypothetical protein CAPTEDRAFT_166233 [Capitella teleta]
          Length = 495

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 117/346 (33%), Positives = 161/346 (46%), Gaps = 43/346 (12%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHI--LQDSIIKEVNENPKAGWKAARNPQF--------- 59
           I C  C    + G + + + D  +  ++  +I  VN +   GW+A RN  F         
Sbjct: 128 INCNECVCQKSYGSLYEWQCDDEVCLIRKEVIDHVNSH-NPGWQA-RNYTFLWGMTLKDG 185

Query: 60  SNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGH 119
             Y +G FK        P+G++  +     D    +P  FDAR  WP  S I  + DQG+
Sbjct: 186 IKYRLGTFK--------PQGMIEEMSSLKVDADEVMPDEFDAREEWP--SFIHPVQDQGN 235

Query: 120 CGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 177
           CG+ +AF      +DR  IH G  L   LS   L++C       GC+GG+   AW     
Sbjct: 236 CGASYAFSTSTVAADRLSIHSGGELKDMLSAQYLISCTTDHHQKGCEGGHVDRAWWQLRR 295

Query: 178 HGVVTEECDPYFDSTGCSHPG--CEPAYPTPKCVRKCVKKNQLWRNSKHYSISA-YRINS 234
            G V+++C PY  S   + PG      Y  PK   +C     +   SK Y  S  YRI +
Sbjct: 296 VGTVSKDCYPY-TSGDTNDPGKCLMSKYKLPKKNIECPVGQGI--TSKLYQASPPYRIAA 352

Query: 235 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG---------HAVKLIGW 285
              +IM EI  NGPV+    V +DF  Y+ GVYKH                 H+V++IGW
Sbjct: 353 KEREIMNEIILNGPVQAVMHVKDDFYTYERGVYKHSHAPKPANYPHLGKEAYHSVRIIGW 412

Query: 286 GTSDDGED---YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
           GT   G+D   YW+ AN W R WG  G+F+I RGS+E  IE  VV 
Sbjct: 413 GTDYTGDDPIKYWLAANTWGRHWGEGGFFRIARGSDESHIESFVVG 458


>gi|328712819|ref|XP_001942906.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Acyrthosiphon pisum]
 gi|328712821|ref|XP_003244911.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Acyrthosiphon pisum]
          Length = 463

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 109/313 (34%), Positives = 159/313 (50%), Gaps = 24/313 (7%)

Query: 30  LDSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVKPTPKGLLLGVPVK 87
           +D++IL D++  + N   + GW A    +F    Y  G  +  LG   + + +L   P+K
Sbjct: 133 VDTYIL-DTLRHQAN---RFGWSAGNYSEFWGRRYDEG-LQLRLGTLHSKRKILQMKPLK 187

Query: 88  THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS-- 145
              +  KL +S+DAR  W   + IS  +DQG CG+ WA   V+  +DRF I     +S  
Sbjct: 188 AAFQRGKLRRSYDAREVWG--NYISSPIDQGWCGASWAITTVQVTTDRFGIMSKRAISDV 245

Query: 146 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDS-TGCSHPGC--EPA 202
           LS   LL+C   L   GC GG+   AW +    G++TEEC P+    + C+ P    E  
Sbjct: 246 LSPQHLLSC-NNLNQQGCQGGHLTRAWNWIRKFGLITEECYPWQGRMSTCAVPKKKKETM 304

Query: 203 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 262
              P  VR     ++  +   H     YR+ ++ E IM EI  +GPV+    V  DF  Y
Sbjct: 305 AQCPSRVRS--NNDRTTKTRLHRVGPVYRVATE-EGIMHEILTSGPVQAVMKVSRDFFMY 361

Query: 263 KSGVYK---HITGDVMGGHAVKLIGWGTSDDGED---YWILANQWNRSWGADGYFKIKRG 316
           KSGVYK     +G   G H+V+++GWG    G     YWI +N W   WG +GYF+I +G
Sbjct: 362 KSGVYKCSNLASGSRTGYHSVRIVGWGEEYQGGKIVKYWIASNSWGSWWGENGYFRILKG 421

Query: 317 SNECGIEEDVVAG 329
            +EC IE+ V+A 
Sbjct: 422 VDECEIEDFVIAA 434


>gi|170045773|ref|XP_001850470.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
 gi|167868692|gb|EDS32075.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
          Length = 463

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 102/329 (31%), Positives = 158/329 (48%), Gaps = 21/329 (6%)

Query: 31  DSHILQDSIIKEVNENPKA-GWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 89
           D  ++ D+++++++   ++ GW+A    ++  +   + K        PK  +  +   T+
Sbjct: 122 DVCLVDDALLRQLHHLERSIGWQATNYSEWWGHKYDEGKTFRLGTFYPKFKVKSMSRLTN 181

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLS 147
            +   LP  FDA + WP    I  + DQG CGS WA       SDRF I       + L+
Sbjct: 182 GQE-HLPTHFDATTYWP--GFIGEVKDQGWCGSSWALSTASVASDRFAILSKGREIVQLA 238

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK 207
              +++C       GC GG+  +AW Y    G V +EC PY  +       C+       
Sbjct: 239 PQQIISCVRR--SQGCSGGHLDTAWNYVRKVGTVNDECYPYISAQN----ACKIRPSDTL 292

Query: 208 CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 267
               C    ++ R + +    A+ +N++  DIM EI K+GPV+    V+ DF  YKSG+Y
Sbjct: 293 ITANCDLPTKVDRTNMYKMGPAFSLNNE-TDIMIEIKKHGPVQAILRVHRDFFSYKSGIY 351

Query: 268 KHIT----GDVMGG-HAVKLIGWGTSDDGED---YWILANQWNRSWGADGYFKIKRGSNE 319
           +H      GD   G H+V+LIGWG   +G +   YW+  N W R WG +G F+I RG NE
Sbjct: 352 RHSAASSAGDERAGYHSVRLIGWGEERNGYETTKYWVAVNSWGRWWGENGRFRIVRGQNE 411

Query: 320 CGIEEDVVAGLPSSKNLVKEITSADMFED 348
           C IE  V+A LP     VK +      ++
Sbjct: 412 CEIESYVLASLPYVHQQVKPMRQVGELQE 440


>gi|10803437|emb|CAC13131.1| putative cathepsin B.5 [Ostertagia ostertagi]
          Length = 196

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 81/183 (44%), Positives = 104/183 (56%), Gaps = 19/183 (10%)

Query: 122 SCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 179
           SCWAFGA EA+SDR CI       +++S +D+L+CCG  CG+GC+GGYPI AW+Y+V  G
Sbjct: 1   SCWAFGAAEAMSDRICIASQGKTQVTISADDVLSCCGKKCGNGCEGGYPIEAWKYWVKTG 60

Query: 180 VVT-------EECDPYFDSTGCSH--------PGCEPAYPTPKCVRKCVKKNQL-WRNSK 223
           + T         C PY     C H        P     Y TP C  KC+   +  + + K
Sbjct: 61  ICTGGSYESQSGCKPY-PIPPCGHHKNQTYFGPCPTDEYDTPVCTNKCIAAYKTPYSDDK 119

Query: 224 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 283
           HY  SAY +      I  EI  NGPVE ++TVYEDF  Y  GVY H  G  +GGHAV+++
Sbjct: 120 HYGTSAYNVAKTVAGIQKEIMTNGPVEAAYTVYEDFYQYTGGVYTHTGGAEVGGHAVRIL 179

Query: 284 GWG 286
           GWG
Sbjct: 180 GWG 182


>gi|341891034|gb|EGT46969.1| hypothetical protein CAEBREN_30419 [Caenorhabditis brenneri]
          Length = 422

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 105/348 (30%), Positives = 162/348 (46%), Gaps = 66/348 (18%)

Query: 29  KLDSHILQDSIIKEVNENPKAGWKAARNP----------QFSNYTVGQFKHLLGVKPTPK 78
           K +S      ++++VN++P+  WKA  N           +++       +++  ++   +
Sbjct: 61  KRESDEYLRKLVRQVNDSPETTWKAKFNKFGVKNRSYGFKYTRNQTAVEEYMEHIRKFFE 120

Query: 79  GLLLGVPVKTHD--KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRF 136
              +   ++  D  KS  LPK+FDAR  WP C +IS + +QG CGSC+A  A    SDR 
Sbjct: 121 SDAMKRHLEELDNYKSSDLPKAFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRA 180

Query: 137 CIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT---EECDPYFDS 191
           CIH        LS  D++ CC  +CG+ C GG P+ A  Y+V+ G+VT   + C PY   
Sbjct: 181 CIHSNGTFKALLSEEDIIGCCS-VCGN-CYGGDPLKALTYWVNQGLVTGGRDGCRPYSFD 238

Query: 192 TGCSHPGCEPAY-----PTPKCVRKC--VKKNQLWRNSKHYSISAYRI------------ 232
             C  P C PA          C+R+C  +   Q +   KH++  AY +            
Sbjct: 239 LSCGVP-CSPATFFEAEEKRTCMRRCQNIYYQQRYEEDKHFATFAYSLYPRSMTVSPDGK 297

Query: 233 -------------NSDPEDIMAEIYKN---------GPVEVSFTVYEDFAHYKSGVYKHI 270
                        + + E +    Y+N         GP  ++F V E+F HY SGV++  
Sbjct: 298 ERVKVPTIIGHFNDKNTEKLNVTEYRNVIKKEILLYGPTTMAFPVPEEFLHYSSGVFRPF 357

Query: 271 TGD-----VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 313
             D     ++  H V+LIGWG S+DG  YW+  N +   WG +G FKI
Sbjct: 358 PLDGFDDRIVYWHVVRLIGWGQSEDGTHYWLAVNSFGSHWGDNGLFKI 405


>gi|256090674|ref|XP_002581308.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 250

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 93/245 (37%), Positives = 129/245 (52%), Gaps = 18/245 (7%)

Query: 100 DARSAWPQCSTISR---ILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLAC 154
           D    +     ISR   +L + H    WA  +  ++SDR CI     M + LS  +L++C
Sbjct: 3   DQHGLYLTSRVISRKYPLLPREHYTELWAVASAASISDRTCIQTNGTMKVQLSAIELISC 62

Query: 155 CGFLCGDGCDGGYPISAWRYFVHHGVVTEE---CDPYF-----DSTGCSHPGC-EPAYPT 205
                G  C  G+   +W Y++ +G+VT +   C PY        +  S+P C    Y  
Sbjct: 63  SKNKLG--CQIGFSEFSWDYWLKNGLVTGDPTGCLPYPFPKCDHRSSNSYPKCGYITYTA 120

Query: 206 PKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
           P C + C     + ++  KHY    Y +  +  DI  EI  NGPVE    V+ DF +YKS
Sbjct: 121 PPCTKTCRSGYPIPYKADKHYGRVIYSLRPNESDIRKEIMMNGPVEAGIFVHSDFLNYKS 180

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
           GVY+HITG ++  H+V++IGWG  +D   YW+ AN WN  WG +GYFKI RGSNEC IE 
Sbjct: 181 GVYRHITGQLVTIHSVRIIGWGIEND-IPYWLCANSWNEDWGLNGYFKILRGSNECEIES 239

Query: 325 DVVAG 329
            V AG
Sbjct: 240 FVNAG 244


>gi|351709947|gb|EHB12866.1| Tubulointerstitial nephritis antigen-like protein [Heterocephalus
           glaber]
          Length = 467

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 107/322 (33%), Positives = 152/322 (47%), Gaps = 35/322 (10%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +I  +N+    GW+A  +  F   T+    ++ LG ++P+   + +         
Sbjct: 141 LVDPDMIAAINQG-NYGWQAGNHSAFWGMTLDSGIRYRLGTIRPSSSVMNMNEIYTVLAP 199

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVN 149
              LPK+F+A   WP  + I   LDQG+C   WAF      SDR  IH   +++  LS  
Sbjct: 200 GEVLPKAFEASKKWP--NMIHDPLDQGNCAGSWAFSTAAVASDRVSIHSMGHMTPVLSPQ 257

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP----- 204
           +LL+C       GC GG    AW +    GVV++ C P+   +G       PA P     
Sbjct: 258 NLLSCDTHH-QQGCQGGRLDGAWWFLRRRGVVSDHCYPF---SGHEQAEAGPATPCMMHS 313

Query: 205 ------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
                   +  R+C   +    N  +    AYR+ SD ++IM E+ +NGPV+    VYED
Sbjct: 314 RAMGRGKRQATRRCPNSHDD-ANEIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVYED 372

Query: 259 FAHYKSGVYKHITGDV--------MGGHAVKLIGWGTS--DDGE--DYWILANQWNRSWG 306
           F  YKSG+Y H    +         G H+VK+ GWG     DG    YW  AN W  SWG
Sbjct: 373 FFLYKSGIYSHTLVSMGRPEQYRRHGTHSVKITGWGEEMLPDGRTLKYWTAANSWGPSWG 432

Query: 307 ADGYFKIKRGSNECGIEEDVVA 328
             GYF+I RGSNEC IE  V+ 
Sbjct: 433 ERGYFRILRGSNECDIESFVLG 454


>gi|239792046|dbj|BAH72408.1| ACYPI000003 [Acyrthosiphon pisum]
          Length = 182

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 73/165 (44%), Positives = 103/165 (62%), Gaps = 1/165 (0%)

Query: 169 ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSI 227
           +S   Y  + G +  E  P       +   C+    TP CV+KC +  ++ +    H+  
Sbjct: 18  VSGGPYGSNMGCIPYEIAPCEHHVNGTRGPCKEGGKTPTCVKKCEEGYKVPYAQDLHHGK 77

Query: 228 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 287
           SAY I +D + I  EIY NGPVE +FTVYEDF  Y++GVYKH+ G  +GGHA++++GWG 
Sbjct: 78  SAYSIRNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGV 137

Query: 288 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
            +    YW++AN WN  WG+DG+FKI RGS+ECGIE  + AGLP+
Sbjct: 138 QNGEIPYWLVANSWNTDWGSDGFFKILRGSDECGIEGQINAGLPA 182


>gi|193606095|ref|XP_001951499.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 330

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 93/251 (37%), Positives = 121/251 (48%), Gaps = 15/251 (5%)

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDL 151
           ++ + FDAR  WPQC TI  + D G+    WA+     L+DR CI  +   N  LS  +L
Sbjct: 85  QIHEEFDARKGWPQCKTIGEVHDDGNTRWGWAYATAGVLADRMCIATNGSYNQLLSTEEL 144

Query: 152 LACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK--- 207
           + C G      G   G  +  W Y   HG+V+     Y  + GC      P    P    
Sbjct: 145 IFCGGIKTKQSGAVRGDDV--WEYLKSHGLVS--GGKYNTNDGCQPSKIPPIGNIPTHLY 200

Query: 208 ---CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
              C  +C   N +     H  +S Y      EDI  E+   GPV V F VY+DF  YKS
Sbjct: 201 NHTCEERCYGNNTIHYYHDHVKVSHYYNIKSNEDIQKEVQTYGPVSVKFRVYDDFFLYKS 260

Query: 265 GVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 323
           GVY      + +  H  KLIGWG  ++G DYW+L N W   WG +G FKIKRG+NE  +E
Sbjct: 261 GVYVKTEKSLYVRRHFAKLIGWGV-ENGVDYWLLVNSWGNEWGQNGLFKIKRGTNEVHVE 319

Query: 324 EDVVAGLPSSK 334
           + V AG P  K
Sbjct: 320 DYVYAGEPEIK 330


>gi|340712697|ref|XP_003394892.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
           terrestris]
          Length = 445

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 102/303 (33%), Positives = 143/303 (47%), Gaps = 22/303 (7%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKSLKLPK 97
           +I E+N +    W+A    +F   T+ +  K  LG     + +     V+       LP+
Sbjct: 146 LIDEIN-SLDLSWRARNYSEFWGRTLDEGVKLRLGTLNPSRSVYRMNSVRRIYDPESLPR 204

Query: 98  SFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACC 155
            FDAR  WP+   IS I DQG CG+ WA  A    SDRF +      ++ LS   LL+C 
Sbjct: 205 EFDARIRWPR--EISDIDDQGWCGASWAISATRVASDRFALMSKGADSVLLSAQHLLSC- 261

Query: 156 GFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK 215
                  C GGY   AW Y    G+V E+C P+  +       C+    T      C   
Sbjct: 262 NNRGQQACSGGYLDRAWLYMRKFGLVDEDCYPWEGTNA----QCKLRKRTDLKTAGCRPP 317

Query: 216 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD-- 273
               R   +    AYR+ ++  DIM EI  +GPV+ +  VY+DF  Y+SG+YKH      
Sbjct: 318 VNPLRTELYKVGPAYRLGNE-TDIMYEILTSGPVQATMKVYQDFFSYESGIYKHTATTEH 376

Query: 274 -VMGGHAVKLIGWGTSDDGE-------DYWILANQWNRSWGADGYFKIKRGSNECGIEED 325
              G H+V++IGWG              YW++ N W + WG  G F+I+RG+NEC IE  
Sbjct: 377 YAFGYHSVRIIGWGEDTSAHRHHNLPIKYWLVVNSWGQQWGESGLFRIQRGTNECDIESF 436

Query: 326 VVA 328
           VVA
Sbjct: 437 VVA 439


>gi|308485822|ref|XP_003105109.1| hypothetical protein CRE_20700 [Caenorhabditis remanei]
 gi|308257054|gb|EFP01007.1| hypothetical protein CRE_20700 [Caenorhabditis remanei]
          Length = 410

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 110/361 (30%), Positives = 165/361 (45%), Gaps = 70/361 (19%)

Query: 17  FATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNP-QFSNYTVGQFKHLLGVKP 75
           +  +   V  K + D ++ +  ++++VN++P+  WKA  N     N + G FK+      
Sbjct: 39  YRRYVTDVNDKRENDEYLRK--LVRQVNDSPETTWKAKFNKFGVKNRSYG-FKYTRNQTA 95

Query: 76  TP------KGLLLGVPVKTHDKSLK------LPKSFDARSAWPQCSTISRILDQGHCGSC 123
                   +       +K H + L+      LPK FDAR  WP C +IS + +QG CGSC
Sbjct: 96  VEEYMEHIRKFFESDAMKRHLEELENYKSSDLPKHFDARQKWPNCPSISNVPNQGGCGSC 155

Query: 124 WAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 181
           +A  A    SDR CIH        LS  D++ CC  +CG+ C GG P+ A  Y+V+ G+V
Sbjct: 156 FAVAAAGVASDRACIHSNGTFKALLSEEDIIGCCS-VCGN-CYGGDPLKALTYWVNQGLV 213

Query: 182 T---EECDPYFDSTGCSHPGCEPAY-----PTPKCVRKC--VKKNQLWRNSKHYSISAYR 231
           T   + C PY     C  P C PA          C+R+C  +   Q +   KH++  AY 
Sbjct: 214 TGGRDGCRPYSFDLSCGVP-CSPATFFEAEEKRTCMRRCQNIYYQQKYEEDKHFATFAYS 272

Query: 232 I-------------------------NSDPEDIMAEIYKN---------GPVEVSFTVYE 257
           +                         + + E +    Y+N         GP  ++F V E
Sbjct: 273 MYPRSMTVSPDGKERVKVPTIIGHFNDKNTEKLNVTEYRNVIKKEILLYGPTTMAFPVPE 332

Query: 258 DFAHYKSGVYKHITGD-----VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 312
           +F HY SGV++    D     ++  H V+LIGWG S DG+ YW+  N +   WG +G FK
Sbjct: 333 EFLHYSSGVFRPFPLDGFDDRIVYWHVVRLIGWGESGDGQHYWLAINSFGNHWGDNGLFK 392

Query: 313 I 313
           I
Sbjct: 393 I 393


>gi|10803454|emb|CAB97366.2| putative cathepsin B.3 [Ostertagia ostertagi]
          Length = 196

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 85/197 (43%), Positives = 108/197 (54%), Gaps = 18/197 (9%)

Query: 122 SCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 179
           SCWA  A E +SDR C+         LS  D+LACCG  CG GC+GGY   AW Y  + G
Sbjct: 1   SCWAVSAAETMSDRLCVQTNGRKKTLLSDTDILACCGDFCGYGCNGGYSARAWLYARNSG 60

Query: 180 VVT----EE---CDPY------FDSTGCSHPGC-EPAYPTPKCVRKC-VKKNQLWRNSKH 224
           V +    +E   C PY      +      +  C +  Y TP C + C     + +   K 
Sbjct: 61  VCSGGRYQEKGVCKPYTFHPCGYHKNQTYYGECPKHTYQTPACKKYCQYGYGKRYEKDKI 120

Query: 225 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 284
           Y+  AYR++SD   I AEI+  GPV+ SF  YEDFAHYKSG+Y H  G   GGHAVK+IG
Sbjct: 121 YAXDAYRVSSDEAAIRAEIFARGPVQASFATYEDFAHYKSGIYVHTAGKRRGGHAVKIIG 180

Query: 285 WGTSDDGEDYWILANQW 301
           WG  ++G   WI+AN W
Sbjct: 181 WGV-ENGTKXWIVANSW 196


>gi|348570708|ref|XP_003471139.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
           porcellus]
          Length = 468

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 105/322 (32%), Positives = 154/322 (47%), Gaps = 35/322 (10%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +I  +N+    GW+A  +  F   T+ +  ++ LG ++P+   + +         
Sbjct: 142 LVDPDMINAINQG-DYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIYTVLAP 200

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH   +++  LS  
Sbjct: 201 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSMGHMTPLLSPQ 258

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP----- 204
           +LL+C   L   GC GG+   AW +    GVV++ C P+   +G       PA P     
Sbjct: 259 NLLSC-DTLHQQGCRGGHLDGAWWFLRRRGVVSDHCYPF---SGREQAEAGPAPPCMMHS 314

Query: 205 ------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
                   +  R+C   +    N  +    AYR+ SD ++IM E+ +NGPV+    V+ED
Sbjct: 315 RAMGRGKRQATRRCPNSHTD-ANDIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVHED 373

Query: 259 FAHYKSGVYKHITGDVM--------GGHAVKLIGWG--TSDDGE--DYWILANQWNRSWG 306
           F  YK G+Y H    +         G H+VK+ GWG  T  DG    YW  AN W  SWG
Sbjct: 374 FFLYKGGIYSHTPLSMARPEQYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPSWG 433

Query: 307 ADGYFKIKRGSNECGIEEDVVA 328
             G+F+I RGSNEC IE  V+ 
Sbjct: 434 ERGHFRILRGSNECDIESFVLG 455


>gi|47212965|emb|CAF93376.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 271

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 93/256 (36%), Positives = 125/256 (48%), Gaps = 25/256 (9%)

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDL 151
           +LP  F++   WP    I   LDQG+C + WAF      SDR  I     M   LS  +L
Sbjct: 7   QLPLYFNSAEKWP--GKIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPQLSPQNL 64

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF-------DSTGCSHPGCEPAYP 204
           ++C     G GC GG    AW Y    GVVTE+C PY        + + C          
Sbjct: 65  ISCDTRNQG-GCAGGRLDGAWWYLRRRGVVTEDCYPYRPPQQTPAELSRCMMQSRSVGRG 123

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
             +  ++C   N  ++N  + S   YR+++  ++IM EI  NGPV+    V+EDF  Y S
Sbjct: 124 KRQATQRCPNTNN-YQNDIYQSTPPYRLSTSEKEIMKEIQDNGPVQAIMEVHEDFFMYNS 182

Query: 265 GVYKHITGDVM--------GGHAVKLIGWGTSD--DG--EDYWILANQWNRSWGADGYFK 312
           G+YKH              G H+VK+ GWG     DG    YWI AN W ++WG +GYF+
Sbjct: 183 GIYKHTDVSFTKPPHYRKHGTHSVKITGWGEERNFDGTTRKYWIAANSWGKNWGENGYFR 242

Query: 313 IKRGSNECGIEEDVVA 328
           I RG NEC IE  V+ 
Sbjct: 243 IARGENECEIEAFVIG 258


>gi|158285208|ref|XP_001687862.1| AGAP007684-PA [Anopheles gambiae str. PEST]
 gi|158285210|ref|XP_308187.4| AGAP007684-PB [Anopheles gambiae str. PEST]
 gi|157019881|gb|EDO64511.1| AGAP007684-PA [Anopheles gambiae str. PEST]
 gi|157019882|gb|EAA04576.4| AGAP007684-PB [Anopheles gambiae str. PEST]
          Length = 463

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 108/331 (32%), Positives = 153/331 (46%), Gaps = 25/331 (7%)

Query: 31  DSHILQDSIIKEVNENPKA-GWKAARNPQF--SNYTVGQFKHLLGVKPTPKGLLLGVPVK 87
           D  +  D ++++++   ++ GWKA    ++    Y  G+   L   +P      +    +
Sbjct: 123 DVCLADDDLLRQLHHLERSIGWKATNYSEWWGHKYDEGKVLRLGTFQPR---FRVKAMKR 179

Query: 88  THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI-HFGMNL-S 145
             +K   LP  FDA   W     ++   DQG CGS WAF      SDRF I   G  +  
Sbjct: 180 LSNKGGHLPTRFDASEHWT--GLVAEARDQGWCGSSWAFSTATMASDRFAILSKGREMVQ 237

Query: 146 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 205
           L+   +LAC       GC GG+  +AW+Y    GVV EEC PY  +        +    T
Sbjct: 238 LAPQQMLACVRR--QQGCSGGHLDTAWQYLRRTGVVNEECYPYIAAQNVCKISNDDTLIT 295

Query: 206 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
             C    VK N   R   +    A+ +N++  DIMAEI   G V+    VY DF  Y+SG
Sbjct: 296 ANCELP-VKVN---RTLMYKMGPAFSLNNET-DIMAEIKDRGTVQAIMRVYRDFFSYRSG 350

Query: 266 VYKHITG-----DVMGGHAVKLIGWGTSDDGED---YWILANQWNRSWGADGYFKIKRGS 317
           +Y+H        +    H+V+LIGWG    G D   YWI  N W + WG +G F+I RGS
Sbjct: 351 IYRHSAAATPAEERSAYHSVRLIGWGEERVGYDVVKYWIAINSWGQWWGENGRFRILRGS 410

Query: 318 NECGIEEDVVAGLPSSKNLVKEITSADMFED 348
           NEC IE  V+A  P     V+ I      ++
Sbjct: 411 NECDIESYVLASNPYVHEHVQAIRKVGELQE 441


>gi|426221788|ref|XP_004005089.1| PREDICTED: tubulointerstitial nephritis antigen-like [Ovis aries]
          Length = 362

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 106/324 (32%), Positives = 159/324 (49%), Gaps = 39/324 (12%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++ + +IK +N+    GW+A  +  F   T+ +  ++ LG V+P+     +         
Sbjct: 36  LVDEDMIKAINQG-NYGWRAGNHSAFWGMTLDEGIRYRLGTVRPSSSVTNMNEIHTVLGP 94

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVN 149
              LP++F+A   WP  + I   LDQG+C   WAF      SDR  IH   ++S  LS  
Sbjct: 95  GEVLPRTFEASEKWP--NLIHDPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMSPVLSPQ 152

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
           +LL+C       GC GG    AW +    GVV++ C P+      S  G + A P P C+
Sbjct: 153 NLLSC-DTHNQQGCHGGRLDGAWWFLRRRGVVSDHCYPF------SGHGRDEAVPAPPCM 205

Query: 210 ----------RKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVY 256
                     R+   +  N     +  Y ++ AYR+ S+ ++IM E+ +NGPV+    V+
Sbjct: 206 MHSRAMGRGKRQATARCPNSYVHANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVH 265

Query: 257 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRS 304
           EDF  Y+SG+Y H    +         G H+VK+ GWG  T  DG    YW  AN W  +
Sbjct: 266 EDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTVKYWTAANSWGPA 325

Query: 305 WGADGYFKIKRGSNECGIEEDVVA 328
           WG  G+F+I RG+NEC IE  V+ 
Sbjct: 326 WGERGHFRIVRGANECDIESFVLG 349


>gi|239790303|dbj|BAH71722.1| ACYPI001175 [Acyrthosiphon pisum]
          Length = 330

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 93/251 (37%), Positives = 121/251 (48%), Gaps = 15/251 (5%)

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDL 151
           ++ + FDAR  WPQC TI  + D G+    WA+     L+DR CI  +   N  LS  +L
Sbjct: 85  QIHEEFDARKGWPQCKTIGEVHDDGNTRWGWAYATAGVLADRMCIATNGSYNQLLSTEEL 144

Query: 152 LACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK--- 207
           + C G      G   G  +  W Y   HG+V+     Y  + GC      P    P    
Sbjct: 145 IFCGGIKTKQSGAVRGDDV--WEYLKSHGLVS--GGKYNTNDGCQPSKIPPIGNIPTHLY 200

Query: 208 ---CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
              C  +C   N +     H  +S Y      EDI  E+   GPV V F VY+DF  YKS
Sbjct: 201 NHTCEERCYGNNTIHYYHDHVKVSHYYNIKSNEDIQKEVQTYGPVSVKFRVYDDFFLYKS 260

Query: 265 GVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 323
           GVY      + +  H  KLIGWG  ++G DYW+L N W   WG +G FKIKRG+NE  +E
Sbjct: 261 GVYVKTEKSLYVRRHFAKLIGWGV-ENGVDYWLLVNFWGNEWGQNGLFKIKRGTNEVHVE 319

Query: 324 EDVVAGLPSSK 334
           + V AG P  K
Sbjct: 320 DYVYAGEPEIK 330


>gi|22653678|sp|O97578.1|CATC_CANFA RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain 1; AltName: Full=Dipeptidyl peptidase I
           heavy chain 1; Contains: RecName: Full=Dipeptidyl
           peptidase 1 heavy chain 2; AltName: Full=Dipeptidyl
           peptidase I heavy chain 2; Contains: RecName:
           Full=Dipeptidyl peptidase 1 heavy chain 3; AltName:
           Full=Dipeptidyl peptidase I heavy chain 3; Contains:
           RecName: Full=Dipeptidyl peptidase 1 heavy chain 4;
           AltName: Full=Dipeptidyl peptidase I heavy chain 4;
           Contains: RecName: Full=Dipeptidyl peptidase 1 light
           chain; AltName: Full=Dipeptidyl peptidase I light chain;
           Flags: Precursor
 gi|4106126|gb|AAD02704.1| dipeptidyl peptidase I [Canis lupus familiaris]
          Length = 435

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 96/306 (31%), Positives = 154/306 (50%), Gaps = 30/306 (9%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKS 98
            +K +N   K+ W A R  ++   T+      +G +  P+     +  + H++  +LP S
Sbjct: 149 FVKAINTIQKS-WTATRYIEYETLTLRDMMTRVGGRKIPRPKPTPLTAEIHEEISRLPTS 207

Query: 99  FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCG 156
           +D R+     + +S + +Q  CGSC+AF +   L  R  I      +  LS  ++++C  
Sbjct: 208 WDWRNV-RGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSCSQ 266

Query: 157 FLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK 215
           +    GC+GG+P + A +Y    G+V E C PY    G   P C+P      C R     
Sbjct: 267 Y--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---AGSDSP-CKPN----DCFR----- 311

Query: 216 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ 269
              + +S++Y +  +    +   +  E+ ++GP+ V+F VY+DF HY+ G+Y H      
Sbjct: 312 ---YYSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDP 368

Query: 270 ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
                +  HAV L+G+GT S  G DYWI+ N W   WG DGYF+I+RG++EC IE   VA
Sbjct: 369 FNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVA 428

Query: 329 GLPSSK 334
             P  K
Sbjct: 429 ATPIPK 434


>gi|307938279|ref|NP_001182763.1| dipeptidyl peptidase 1 precursor [Canis lupus familiaris]
          Length = 459

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 96/306 (31%), Positives = 154/306 (50%), Gaps = 30/306 (9%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKS 98
            +K +N   K+ W A R  ++   T+      +G +  P+     +  + H++  +LP S
Sbjct: 173 FVKAINTIQKS-WTATRYIEYETLTLRDMMTRVGGRKIPRPKPTPLTAEIHEEISRLPTS 231

Query: 99  FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCG 156
           +D R+     + +S + +Q  CGSC+AF +   L  R  I      +  LS  ++++C  
Sbjct: 232 WDWRNV-RGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSCSQ 290

Query: 157 FLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK 215
           +    GC+GG+P + A +Y    G+V E C PY    G   P C+P      C R     
Sbjct: 291 Y--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---AGSDSP-CKPN----DCFR----- 335

Query: 216 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ 269
              + +S++Y +  +    +   +  E+ ++GP+ V+F VY+DF HY+ G+Y H      
Sbjct: 336 ---YYSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDP 392

Query: 270 ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
                +  HAV L+G+GT S  G DYWI+ N W   WG DGYF+I+RG++EC IE   VA
Sbjct: 393 FNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVA 452

Query: 329 GLPSSK 334
             P  K
Sbjct: 453 ATPIPK 458


>gi|130502070|ref|NP_001076255.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
 gi|818411|gb|AAC48477.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
          Length = 474

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 102/318 (32%), Positives = 152/318 (47%), Gaps = 38/318 (11%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLP 96
           +I+ +N+    GW A    QF   T+ + F+  LG + P+P  L +     T  ++  LP
Sbjct: 158 LIEHINKG-DYGWTAQNYSQFWGMTLEEGFRFRLGTLPPSPVLLSMNEMRATLPETTDLP 216

Query: 97  KSFDA--RSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLL 152
           + F A  + AW      S  +   +C + WAF      +DR  I        +LS  +L+
Sbjct: 217 EFFIAFLQMAWMD----SWAIGSKNCAASWAFSTASVAADRIAIQSNGRYTANLSPQNLI 272

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE---------PAY 203
           +CC      GC+ G    AW Y    G+V+  C P F     S+  C            +
Sbjct: 273 SCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNISNNTCAMTSKADGRGKRH 331

Query: 204 PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 263
            T  C     K N++++ S       YR++S+  +IM EI +NGPV+    V+EDF HYK
Sbjct: 332 ATRPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYK 386

Query: 264 SGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYF 311
           +G+Y+H+            +  HAVKL GWGT        E +WI AN W +SWG +GYF
Sbjct: 387 TGIYRHVISTNEESEKYRKLQTHAVKLTGWGTLKGARGQKEKFWIAANSWGKSWGENGYF 446

Query: 312 KIKRGSNECGIEEDVVAG 329
           +I RG NE  IE+ ++A 
Sbjct: 447 RILRGVNESDIEKLIIAA 464


>gi|350408961|ref|XP_003488566.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
           impatiens]
          Length = 445

 Score =  148 bits (374), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 101/303 (33%), Positives = 142/303 (46%), Gaps = 22/303 (7%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKSLKLPK 97
           +I E+N      W+A    +F   T+ +  K  LG     + +     V+       LP+
Sbjct: 146 LIDEINSQ-DLSWRARNYSEFWGRTLDEGVKLRLGTLNPSRSVYRMNSVQRIYDPESLPR 204

Query: 98  SFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACC 155
            FDAR  WP+   IS I DQG CG+ WA       SDRF +      ++ LS   LL+C 
Sbjct: 205 EFDARIRWPR--EISDIDDQGWCGASWAISTTRVASDRFALMSKGADSVLLSAQHLLSC- 261

Query: 156 GFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK 215
                  C GGY   AW Y    G+V E+C P+  +    +  C+    T      C   
Sbjct: 262 NNRGQQACSGGYLDRAWLYMRKFGLVDEDCYPWEGT----NVQCKLRKRTDLKTAGCRPP 317

Query: 216 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD-- 273
               R   +    AYR+ ++  DIM EI  +GPV+ +  VY+DF  Y+SG+YKH      
Sbjct: 318 VNPLRTELYKVGPAYRLGNE-TDIMYEILTSGPVQATMKVYQDFFSYESGIYKHTATTEH 376

Query: 274 -VMGGHAVKLIGWGTSDDGE-------DYWILANQWNRSWGADGYFKIKRGSNECGIEED 325
              G H+V++IGWG              YW++ N W + WG  G F+I+RG+NEC IE  
Sbjct: 377 YAFGYHSVRIIGWGEDTSAHRYRNLPIKYWLVVNSWGQQWGESGLFRIQRGTNECDIESF 436

Query: 326 VVA 328
           VVA
Sbjct: 437 VVA 439


>gi|159117627|ref|XP_001709033.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157437148|gb|EDO81359.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 308

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 93/294 (31%), Positives = 135/294 (45%), Gaps = 35/294 (11%)

Query: 50  GWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCS 109
            WKA    +  N T   FK +L          +  PV+  +    +P  FD R  +PQC 
Sbjct: 30  AWKAGIPERLKNLTKNDFKKMLSAGSPRTQSSIVRPVRVPENEDPVPDHFDFREEYPQC- 88

Query: 110 TISRILDQGHCGSCWAFGAVEALSDRFCI--------HFGMNLSLSVNDLLACCGFLCGD 161
            I+ ++D G C S WA+ AV+A S R C+         +     LS +    C GF   +
Sbjct: 89  -ITEVIDIGLCSSSWAYSAVDAFSHRRCLTGLDQEATRYSAQYILSCSSTNGCFGFSTRE 147

Query: 162 GCDGGYPISAWRYFVHHGVVTEECDPY--FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW 219
                    AW +    G+  E C  Y  +D T  S P          C   C   + L 
Sbjct: 148 SI-------AWDFIATTGIPLESCVKYTDYDQTQ-SRP----------CPSTCDDDSFL- 188

Query: 220 RNSKHYSISAYR-INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 278
              + Y    Y  +  + E +   +   GP++  FTVYEDF +Y  G+Y +  G+ +G  
Sbjct: 189 ---EVYKPDGYEGVGLNCERLKRAVALRGPMQAMFTVYEDFTYYLEGIYSYTYGNRVGFL 245

Query: 279 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
           +V+++G+GTSD+G+DYWI+ N W   WG DGYF+I RG NEC IE      + S
Sbjct: 246 SVEIVGYGTSDEGQDYWIVKNYWGPGWGEDGYFRIVRGQNECQIENSAYGAIIS 299


>gi|403331769|gb|EJY64852.1| hypothetical protein OXYTRI_15000 [Oxytricha trifallax]
          Length = 259

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 94/266 (35%), Positives = 134/266 (50%), Gaps = 26/266 (9%)

Query: 73  VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEAL 132
           +KP P    L + +     +  LP SFD+   WP C   +R  +QG CGSC+AF A   +
Sbjct: 11  IKPQPSSYSLNLNITQKLLASNLPLSFDSTVEWPDCIHATR--NQGSCGSCYAFAASGMM 68

Query: 133 SDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-- 188
           SDR CI     +NL LS  +L++C       GC GG+  +   Y + +G+ +E C PY  
Sbjct: 69  SDRLCIKSNGQINLVLSPQELVSC--DYQNYGCSGGWMTNTLYYLMSYGIPSETCLPYDM 126

Query: 189 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 248
           F+S             T  C  +C   N  +   K    ++ +I SDPE IM +I +NGP
Sbjct: 127 FNSE------------TKACSGRCDSPNYEYTRHKCKKGTS-KIMSDPETIMRDIMENGP 173

Query: 249 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 308
             V+F  +EDF ++  G+YK+ +G  + GHA KL GWG    G  YWI  NQ+   WG  
Sbjct: 174 SIVAFQAFEDFLNFGGGIYKYTSGKFLVGHATKLTGWGLDSAGRLYWIGQNQFGLGWGGR 233

Query: 309 ---GYFKIKRGSNECGIEEDVVAGLP 331
              G++KI  G  E G    V + +P
Sbjct: 234 GDYGFYKIYDG--EVGFGSAVWSCIP 257


>gi|395856779|ref|XP_003800796.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Otolemur garnettii]
          Length = 467

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 102/318 (32%), Positives = 153/318 (48%), Gaps = 27/318 (8%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +I  +N+    GW+A  +  F   T+ +  ++ LG ++P+   + +         
Sbjct: 141 LVDPDMINTINQG-NYGWRAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIYTVLSP 199

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 200 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 205
           +LL+C       GC GG    AW +    GVV++ C P+     D  G +      + P 
Sbjct: 258 NLLSCDTHH-QQGCHGGRLDGAWWFLRRRGVVSDHCYPFSGQERDKAGPAPLCMMHSRPM 316

Query: 206 PKCVRKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 262
            +  R+   +   NQ+  N  +    AYR+ S+ ++IM E+ +NGPV+    V+EDF  Y
Sbjct: 317 GRGKRQATARCPNNQVQANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLY 376

Query: 263 KSGVYKHITGDVM--------GGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGY 310
           +SG+Y H    +         G H+VK+ GWG  T  DG    YW  AN W  +WG  G+
Sbjct: 377 QSGIYSHTPVSLQRPEGYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGH 436

Query: 311 FKIKRGSNECGIEEDVVA 328
           F+I RG+NEC IE  V+ 
Sbjct: 437 FRIVRGANECDIESFVLG 454


>gi|196009233|ref|XP_002114482.1| hypothetical protein TRIADDRAFT_28083 [Trichoplax adhaerens]
 gi|190583501|gb|EDV23572.1| hypothetical protein TRIADDRAFT_28083 [Trichoplax adhaerens]
          Length = 466

 Score =  148 bits (373), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 100/320 (31%), Positives = 156/320 (48%), Gaps = 35/320 (10%)

Query: 29  KLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPK---GLLLGVP 85
           K   +I     I ++N + ++ W A   P++ ++T+ +     G    PK   G  L + 
Sbjct: 163 KHRKYIPNKDYINQIN-SAQSLWTATEYPEYEDFTLAELNMRSGRPTVPKSFAGPRLRMK 221

Query: 86  ----VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--H 139
                +  D+ +  PK FD R+     + +S + +QG CGSC+AF ++     R  +   
Sbjct: 222 RDRLSRNSDEFIYFPKQFDWRNV-SNVNYVSPVRNQGACGSCYAFSSMAMYEARLRVLSK 280

Query: 140 FGMNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPG 198
             +   +S  D+++C  +    GC GG+P + A +Y    G+V E C PY    G   P 
Sbjct: 281 NSVKRVMSPQDVVSCSEY--AQGCAGGFPYLIAGKYGEDFGLVEESCFPY---NGKDEPC 335

Query: 199 CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
            E      KC R           + +Y +  +    +   +M E+ KNGP+ +SF VY D
Sbjct: 336 KETK---SKCRRHST--------TNYYYVGGFYGACNEYLMMRELVKNGPISISFEVYGD 384

Query: 259 FAHYKSGVYKHI-TGDV-----MGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYF 311
           F HYK G+Y+H   GD      +  HAV L+G+GT    G+DYWI+ N W   WG +G+F
Sbjct: 385 FKHYKGGIYQHTGLGDSYNPWQITNHAVLLVGYGTDQKSGKDYWIVKNSWGTKWGENGFF 444

Query: 312 KIKRGSNECGIEEDVVAGLP 331
           +I RG +EC IE + VA  P
Sbjct: 445 RILRGVDECSIENEAVAVTP 464


>gi|294891881|ref|XP_002773785.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239878989|gb|EER05601.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 455

 Score =  148 bits (373), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 94/249 (37%), Positives = 122/249 (48%), Gaps = 32/249 (12%)

Query: 95  LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
           LP SFDAR  +  C+  I  + +QG C +CWA  AV   +DR CI  G  ++  LS+  L
Sbjct: 145 LPSSFDARQKFASCADVIGHVREQGECNNCWASAAVGMFNDRVCIKSGGRITDILSLGYL 204

Query: 152 LACCGFLCG----DGCDGGYPISAWRYFVHHGVVT-------EE------CDPYFDSTGC 194
            +CC    G    +GC  G       +  +HG+VT       EE      C PY     C
Sbjct: 205 TSCCNRANGCPKSNGCMFGSVPEGLNFMKNHGLVTGGEYKPPEELGNDDGCWPY-PFPKC 263

Query: 195 SH-PGCEPAYPT-------PKCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIY 244
           +H PG E  YP        P C   C  K      +   H + S  R+   PE I  EI+
Sbjct: 264 NHVPGLESKYPRCAQVRDLPACATTCPNKAYGTSMQKDTHRAKSWGRLPIGPEKIKQEIF 323

Query: 245 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 304
            NGPV    T+YEDF  YKSGVY H TG ++  H +KLIGWG  + G++YW+  N WN  
Sbjct: 324 DNGPVAAMMTLYEDFRFYKSGVYVHKTGQMLAAHTLKLIGWGV-ESGQEYWLAVNAWNEE 382

Query: 305 WGADGYFKI 313
           WG  G  K+
Sbjct: 383 WGDHGMIKL 391


>gi|417409900|gb|JAA51439.1| Putative cysteine proteinase tin-ag, partial [Desmodus rotundus]
          Length = 346

 Score =  147 bits (372), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 103/324 (31%), Positives = 154/324 (47%), Gaps = 39/324 (12%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +I  +N+    GW+A  +  F   T+ +  ++ LG ++P+     +         
Sbjct: 20  LVDRDMIDAINQG-NYGWRAGNHSAFWGMTLDEGIRYRLGTIRPSSSVASMNEIHTVLGP 78

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 79  GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 136

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
           +LL+C       GC GG+  SAW +    GVV++ C P F   G +  G     P P+C+
Sbjct: 137 NLLSC-DKRNQQGCQGGHLDSAWWFLRRRGVVSDHCYP-FSGQGRTETG-----PAPRCM 189

Query: 210 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
                     R+   +   +Q+  N  +    AYR+ S  ++IM E+ +NGPV+    V+
Sbjct: 190 MHSRAMGRGKRQATARCPNHQVHANDIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVH 249

Query: 257 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWGTSD--DGE--DYWILANQWNRS 304
           EDF  Y++G+Y H    +         G H+VK+ GWG     DG    YW  AN W  +
Sbjct: 250 EDFFLYQNGIYSHTPVSLGRPERYRRHGTHSVKITGWGEESLPDGRTLKYWTAANSWGPA 309

Query: 305 WGADGYFKIKRGSNECGIEEDVVA 328
           WG  G+F+I RG+NEC IE  V+ 
Sbjct: 310 WGERGHFRIVRGANECDIESFVLG 333


>gi|66801417|ref|XP_629634.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
 gi|60463014|gb|EAL61210.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
          Length = 323

 Score =  147 bits (372), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 91/256 (35%), Positives = 126/256 (49%), Gaps = 32/256 (12%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLL 152
           +P SFD R+ W  C  +S + +Q  CGSCWA      L+DR CI    N+   LS   L+
Sbjct: 46  IPASFDVRTNWGDC--MSPVREQQSCGSCWAQVTSGILADRMCIESDKNIKMLLSPQYLM 103

Query: 153 ACCGFL-------CGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPG-CEPAYP 204
            C G         C +GC GG+   A    ++ G+V++EC  Y  S   S P  C+   P
Sbjct: 104 DCDGSCVSDGVSGCNNGCKGGFVGLALTRLINEGIVSDECLSYQASKDSSCPTTCDDGSP 163

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
                           N+  Y  ++ R     +D   EI  NGPV  +F +Y DF  +K 
Sbjct: 164 I--------------SNTTIYKATSCRAFPTVQDAQYEIMTNGPVIATFMLYSDFKPHKW 209

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
            VY   +   +  HAV+++GWGT+ DG DYWI AN W   WG  GYFKI+RGS+E   EE
Sbjct: 210 DVYIKSSNTQVESHAVRVVGWGTTSDGVDYWIAANSWGTGWGDKGYFKIRRGSDEAAFEE 269

Query: 325 DVV------AGLPSSK 334
             +      A +P+S+
Sbjct: 270 GFITVTADTASVPTSQ 285


>gi|339239305|ref|XP_003381207.1| cathepsin B [Trichinella spiralis]
 gi|316975778|gb|EFV59177.1| cathepsin B [Trichinella spiralis]
          Length = 343

 Score =  147 bits (371), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 104/291 (35%), Positives = 139/291 (47%), Gaps = 56/291 (19%)

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW-------------------------- 124
           +SL L + FDAR  WP+C  I  I DQ  C  CW                          
Sbjct: 56  ESLPLEEHFDAREKWPECKYIGFIKDQSTCSCCWVSGDFLYHYDQWKIILLFDFSSSSSH 115

Query: 125 --------AFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 174
                   A  +   ++DR CI +       LS  +L +CC   CG GC+GG+P+ A++Y
Sbjct: 116 WLFISTFKAMSSASVMTDRTCIAYKGEQQPFLSDEELTSCCT-SCGYGCNGGFPLLAFKY 174

Query: 175 FVHHGVVTEECDPYFDSTGCSHPGCEP------AYPTPKCVRKCVK--KNQLWRNSKHYS 226
           +   GV T    PY   +GC      P      A  TP C  KC+   K +L ++ ++Y 
Sbjct: 175 WNEIGVPTG--GPYGSKSGCKPFSIAPPTSSSTAAQTPLCQLKCISDYKRKLDKD-RYYG 231

Query: 227 ISAYRINSDPE---DIMAEIYKNGPVEVSFTVYEDFAHYKSGVY---KHITGDVMGGHAV 280
            S Y I S  +    I  EI  +GPV  +  ++E F +YKSGVY   K      +G HAV
Sbjct: 232 ESYYLITSSNQPVKTIQREIMDHGPVVAAMEIFESFLYYKSGVYSANKRNDDPSLGLHAV 291

Query: 281 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE-DVVAGL 330
           KLIGWG       YW++ N WN ++G  G FKI+RG+NECGIE   V AGL
Sbjct: 292 KLIGWGEQKR-IPYWLVVNSWNTTFGEQGLFKIRRGTNECGIENLHVTAGL 341


>gi|328701234|ref|XP_001948885.2| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 326

 Score =  147 bits (370), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 98/279 (35%), Positives = 134/279 (48%), Gaps = 28/279 (10%)

Query: 70  LLGVKPTPKGLLLGVPVKTHDKSL----KLPKSFDARSAWPQCSTISRILDQGHCGSCWA 125
           LLG +         +  KT D       ++ K FDAR  WPQC TI  + ++G+    WA
Sbjct: 57  LLGTRGVEAATKSKMLYKTRDPRYIIDNQIHKEFDARKRWPQCKTIGEVHNEGNELLSWA 116

Query: 126 FGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGY--PISAWRYFVHHGVV 181
           + A    +DR CI    N +  LS  +L++C G       + GY   +  W YF  HG+V
Sbjct: 117 YAATGVFADRMCIATNGNYNQLLSTEELISCSGI---KEREDGYVNRVLVWEYFKTHGLV 173

Query: 182 TEECDPYFDSTGCSHPGCEPAYPTPK------CVRKCVKKNQLWRNSKHYSISAY---RI 232
           +     Y  + GC        Y +        CV  C  K+ +  N  H  +S +   RI
Sbjct: 174 S--GGKYNTNEGCQPSKVPTVYNSQTKIYKRTCVEYCYGKDTINYNHDHVKVSNHYFIRI 231

Query: 233 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDG 291
               +DI  E+   GPV V F +++D   YKSGVY K         H  KLIGWG  ++G
Sbjct: 232 ----KDIQKEVQTYGPVSVFFDLHDDLFLYKSGVYAKTEKSKDKRYHHAKLIGWGV-ENG 286

Query: 292 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
            DYW+L N W   WG +G FKIKRG++EC +E  V AGL
Sbjct: 287 VDYWLLVNSWGYEWGQNGLFKIKRGTDECSVESHVYAGL 325


>gi|149030260|gb|EDL85316.1| rCG52258, isoform CRA_c [Rattus norvegicus]
          Length = 130

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 68/137 (49%), Positives = 93/137 (67%), Gaps = 13/137 (9%)

Query: 199 CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 258
           CE  Y T             ++  KHY  ++Y ++   ++IMAEIYKNGPVE +FTV+ D
Sbjct: 2   CEAGYSTS------------YKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSD 49

Query: 259 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
           F  YKSGVYKH  GDVMGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI RG N
Sbjct: 50  FLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNGFFKILRGEN 108

Query: 319 ECGIEEDVVAGLPSSKN 335
            CGIE ++VAG+P ++ 
Sbjct: 109 HCGIESEIVAGIPRTQQ 125


>gi|126330441|ref|XP_001381244.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Monodelphis
           domestica]
          Length = 466

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 103/313 (32%), Positives = 148/313 (47%), Gaps = 27/313 (8%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLP 96
           +I  +N     GW A  +  F   T+ +  ++ LG V+P    + +            LP
Sbjct: 144 LINAINHG-NYGWTAGNHSAFWGMTLEEGIQYRLGTVRPASSVMNMNEIQMVMAPQETLP 202

Query: 97  KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLAC 154
            +F+A   WP    I   LDQG+C   WAF      SDR  IH    M  +LS  +LL+C
Sbjct: 203 LAFNASDKWP--GLIHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMTPALSPQNLLSC 260

Query: 155 CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVR 210
                  GC GG    AW +    G+V+  C P+     D+T  + P    +    +  R
Sbjct: 261 -DTHNQKGCRGGRLDGAWWFLRRRGLVSNHCYPFSAGNRDATAPAAPCMMHSRSMGRGKR 319

Query: 211 KCVK---KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 267
           +       ++   N  + +   YR++SD +DIM E+ +NGPV+    V+EDF  YKSG+Y
Sbjct: 320 QATAHCPNSRAHANHIYQATPPYRLSSDEKDIMKELMENGPVQALMEVHEDFFLYKSGIY 379

Query: 268 KHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKR 315
           KH    +         G H+VK+ GWG     DG+   YW  AN W  +WG  G+F+I R
Sbjct: 380 KHTPASLGKPARYRQHGTHSVKITGWGEERQPDGQRLKYWTAANSWGPTWGEKGHFRILR 439

Query: 316 GSNECGIEEDVVA 328
           G+NEC IE  VV 
Sbjct: 440 GANECDIESFVVG 452


>gi|345327151|ref|XP_001507103.2| PREDICTED: tubulointerstitial nephritis antigen-like
           [Ornithorhynchus anatinus]
          Length = 327

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 90/255 (35%), Positives = 123/255 (48%), Gaps = 24/255 (9%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLL 152
           LP++FDA   WP    I   LDQG+C   WAF      SDR  IH    M  SLS  +LL
Sbjct: 57  LPRNFDAAQKWP--GLIHEPLDQGNCAGSWAFSTAAVASDRISIHSKGHMTPSLSPQNLL 114

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKC 212
           +C       GC+GG    AW +    G+V+++C P       + P    + P  +  R+ 
Sbjct: 115 SC-NTRHQQGCNGGRLDRAWSFLRRRGLVSDKCYPLASQNSIAEPCRMYSRPMGRGKRQA 173

Query: 213 V-------KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
                     +  + N  + S   YR++S+ +DIM EI +NGPV+    V+EDF  YK G
Sbjct: 174 TGPCPNNFHHSNDYSNDIYQSTPPYRLSSNEKDIMKEIMENGPVQALMEVHEDFFLYKDG 233

Query: 266 VYKHITGD--------VMGGHAVKLIGWGT--SDDGE--DYWILANQWNRSWGADGYFKI 313
           +Y+H              G H+VK+ GWG     +G    +W  AN W  +WG  G F+I
Sbjct: 234 IYRHTPASNGKPPQFRRQGTHSVKITGWGEELQPNGRRVKFWRAANSWGPTWGEGGSFRI 293

Query: 314 KRGSNECGIEEDVVA 328
            RG NEC IE  VV 
Sbjct: 294 LRGCNECDIESFVVG 308


>gi|253742295|gb|EES99137.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 315

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 103/308 (33%), Positives = 147/308 (47%), Gaps = 39/308 (12%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYT-----VGQFKHLLG-VKPTPKGLLLGVPVK 87
           +L    +  +N  PK  W A  + +F   T     +    HL+  +       L G   K
Sbjct: 17  MLNSRTLAHINSLPKH-WTAGISEKFRALTRDDIELMTMSHLVHFLDANAHSHLAGRTEK 75

Query: 88  THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---L 144
             + +   P+SFD R  +PQC  +    DQGHCGSCWAF +  A  D  C+  G++   +
Sbjct: 76  --NINYDYPESFDFREEYPQC--LLPTYDQGHCGSCWAFASSRAFGDTRCMQ-GLDPVPV 130

Query: 145 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 204
             S   L++C   L   GC GG       +    G+ T+ C PY D         E A+ 
Sbjct: 131 LYSPQYLVSCS--LQNMGCTGGTMEDVGDFLRDTGIATDTCVPYVD---------EDAHW 179

Query: 205 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
            P C   CV  + + R  +   +   R + + E +M  I  NGP+  S  +YEDF +Y+S
Sbjct: 180 EP-CPVSCVDGSPI-RTVQ--LMDFVRYDGNLEAMMEAIAMNGPIHASMMIYEDFMYYQS 235

Query: 265 GVYKHITGDVMGGHAVKLIGWGTSDDGE---------DYWILANQWNRSWGADGYFKIKR 315
           G+Y  I G   G HA++L+G+GT   G+         DYWI  N W   WG +GYF+I R
Sbjct: 236 GIYHFIYGSGCGMHAIELVGYGTDISGDSEAGEEVRVDYWIARNSWGEDWGENGYFRIVR 295

Query: 316 GSNECGIE 323
           G+NECGIE
Sbjct: 296 GNNECGIE 303


>gi|358421824|ref|XP_003585145.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bos taurus]
          Length = 428

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 109/347 (31%), Positives = 165/347 (47%), Gaps = 39/347 (11%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKH 69
           + C  C  T  E    +   +  ++ + +I+ +N     GW+A  +  F   T+ +  ++
Sbjct: 79  LRCEECNLTCHEKERWECDQEPCLVDEDMIEAINHG-DYGWRAGNHSAFWGMTLDEGIRY 137

Query: 70  LLG-VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 128
            LG V+P+     +            LP++F+A   WP  + I   LDQG+C   WAF  
Sbjct: 138 RLGTVRPSSFVANMNEIHTVLGPGEVLPRTFEASEKWP--NLIHDPLDQGNCAGSWAFST 195

Query: 129 VEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 186
               SDR  IH   ++S  LS  +LL+C       GC GG    AW +    GVV++ C 
Sbjct: 196 AAVASDRVSIHSLGHMSPVLSPQNLLSC-DTHNQQGCRGGRLDGAWWFLRRRGVVSDHCY 254

Query: 187 PYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK--NQLWRNSKHYSIS-AYRIN 233
           P+      S  G + A P P C+          R+   +  N     +  Y ++ AYR+ 
Sbjct: 255 PF------SGHGRDEAVPAPPCMMHSRAMGRGKRQATARCPNSYVHANDIYQVTPAYRLG 308

Query: 234 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 285
           S+ ++IM E+ +NGPV+    V+EDF  Y+SG+Y H    +         G H+VK+ GW
Sbjct: 309 SNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGW 368

Query: 286 G--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
           G  T  DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+ 
Sbjct: 369 GEETLPDGRTIKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 415


>gi|198434980|ref|XP_002126076.1| PREDICTED: similar to LOC100124858 protein [Ciona intestinalis]
          Length = 541

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 104/319 (32%), Positives = 153/319 (47%), Gaps = 34/319 (10%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYT-------VGQFKHLLGVKPTPKGLLLGVPV 86
           +++ ++I+ +NE    GW A      SN+T       +  +K+ LG    P  +     +
Sbjct: 227 LVRPNVIEAINEG-DFGWTA------SNFTFLWGLTQLEGYKYKLGTARVPDEVRNMNAM 279

Query: 87  KTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI---HFGMN 143
                S  LPK+FD+R+ WP   ++ R  DQ + G+ WAF     LSDR  I   +F + 
Sbjct: 280 HPLSVSSNLPKTFDSRTKWPGSLSLPR--DQENEGTSWAFSTTSVLSDRLAIQSKNFTV- 336

Query: 144 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY 203
           + LS   L++C  F   +G  G      W Y    GVV+  C P   S      G     
Sbjct: 337 VELSPQHLVSC--FSSHEG-RGERLDRTWWYLRKKGVVSTVCYPESRSKSTQGIGSCGLV 393

Query: 204 PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 263
                   C   N +  N  + +   YR++S+ E+IM EI++NGPV+    V  DF  YK
Sbjct: 394 AHSSGAHICPNGNVISSNEIYKTSPVYRVSSNEENIMKEIFENGPVQAVMRVQPDFFVYK 453

Query: 264 SGVYKHITGDVM--------GGHAVKLIGWG---TSDDGEDYWILANQWNRSWGADGYFK 312
           SGVY     D +          H+VK+IGWG   +  +   YWI+ N W  +WG  GYF+
Sbjct: 454 SGVYSSTAIDNIVVEQVKDNTYHSVKIIGWGEKKSKTNSGKYWIVQNSWGANWGEGGYFR 513

Query: 313 IKRGSNECGIEEDVVAGLP 331
           I++G NECGIEE ++A  P
Sbjct: 514 IRKGVNECGIEEMILAAWP 532


>gi|358341865|dbj|GAA49436.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 515

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 84/198 (42%), Positives = 104/198 (52%), Gaps = 18/198 (9%)

Query: 86  VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS 145
           V     ++ +P  FDAR  W +C +I  I  Q  CGSCWAFGAVEA+SDR CIH G    
Sbjct: 72  VNNRFSNVDIPMQFDARKYWLKCPSIREIRGQSSCGSCWAFGAVEAMSDRLCIHSGAKYQ 131

Query: 146 --LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FD 190
             LS  DLL+CC + CG GCDGG+P  AW Y+   G+VT         C  Y       D
Sbjct: 132 KGLSAVDLLSCC-WKCGYGCDGGFPAQAWNYWSTDGIVTGGSKENPSGCRSYPFPSCSHD 190

Query: 191 STGCSHPGC-EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 249
             G  HP C    Y TP+C +KC      +      + S+Y +     +IM EI  NGPV
Sbjct: 191 ERG-RHPLCPSEIYHTPRCTKKCDTDKLHYSAELTKANSSYNVLDSDREIMMEIMNNGPV 249

Query: 250 EVSFTVYEDFAHYKSGVY 267
           E  F VYEDF  Y+ G+Y
Sbjct: 250 EAVFDVYEDFLQYEKGIY 267


>gi|403354695|gb|EJY76909.1| Cathepsin B [Oxytricha trifallax]
          Length = 311

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 95/295 (32%), Positives = 134/295 (45%), Gaps = 50/295 (16%)

Query: 55  RNPQFSNYTVGQFKHLLGVKPTPKGLL-----LGVPVKT--------------------- 88
           +NP   N+T  Q K +LGVK TP G          P KT                     
Sbjct: 19  KNP-MKNFTTEQLKKILGVK-TPAGYFDANYGQQSPSKTTSAYTFSAPKSPVSARGTSGT 76

Query: 89  ----HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 144
                  + ++P S+D R+ +P C   +RI DQ  CGSCWAF     L  R+C+      
Sbjct: 77  DYLNRQVAKQMPSSYDVRTVYPMCE--NRIKDQAQCGSCWAFATTNVLEYRYCMATKGKK 134

Query: 145 --SLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 201
              LS  +L++C  F     GCDGGY    + Y    GV TE+C PY    G        
Sbjct: 135 YPELSPQNLISC--FNSASWGCDGGYIDQTFLYLEMMGVNTEQCMPYKSGDG-------- 184

Query: 202 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 261
                 C  KC     L+ N  +    + +     +     ++  GP+   F V+EDF +
Sbjct: 185 --NMTACPSKCANGENLYMNKYYCRPGSTQYMRGEQQFKNYLFNKGPMVAVFDVFEDFIN 242

Query: 262 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 316
           Y  G+Y  ++GD +G HAVKL+G+G  ++  +Y+I  NQW + WG DGYF+IK G
Sbjct: 243 YGGGIYNKVSGDKLGKHAVKLLGYGV-ENSTNYYIGVNQWGKDWGEDGYFRIKAG 296


>gi|308161503|gb|EFO63946.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 363

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 97/277 (35%), Positives = 143/277 (51%), Gaps = 31/277 (11%)

Query: 57  PQFSNYTVGQFKHLLGVKPTPKGLL-LGVPVKTHDKSLK----LPKSFDARSAWPQCSTI 111
           P+     VG  K L GV+     L+    P  T   S K     P+S+D R  +P C  I
Sbjct: 102 PELPKRFVG--KSLDGVRAMLGPLIDTSRPTITMKHSTKPPVGAPESYDFREEYPHC--I 157

Query: 112 SRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYP 168
           + ++DQG CGSCWAF +++  +D  C   G++   +S SV  +L C       GC+GG P
Sbjct: 158 TEVVDQGSCGSCWAFSSIQTFADHRC-RSGLDATGVSYSVQYVLDCD--RKDHGCNGGEP 214

Query: 169 ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS 228
           ++A+ +  + G V   C  Y          C       KC      +N +       + S
Sbjct: 215 VNAFNFLHNTGTVLTSCVEYTAGDDAVVKFCPQ-----KCDDGSAVENIV-------ATS 262

Query: 229 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 288
             +  S  + ++A    +GPV  +F V +DF +YKSGVY+H  G  +GGHAV+++G+G +
Sbjct: 263 GAKSGSAIDVLLA----HGPVVATFNVAQDFMYYKSGVYQHRWGVWLGGHAVEIVGYGVT 318

Query: 289 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 325
           D G DYW + N W   WG DGYF+I RG +ECGIE++
Sbjct: 319 DSGLDYWTVRNSWGPDWGEDGYFRIVRGGDECGIEQE 355


>gi|159108157|ref|XP_001704351.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157432412|gb|EDO76677.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 360

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 87/233 (37%), Positives = 128/233 (54%), Gaps = 24/233 (10%)

Query: 96  PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLL 152
           P+S+D R  +P C  I+ ++DQG+CGSCWAF +V+  +D  C   G++   +S SV  +L
Sbjct: 141 PESYDFRDEYPHC--ITEVVDQGNCGSCWAFSSVQTFADHRC-RSGLDATGVSYSVQYVL 197

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKC 212
            C       GC+GG P++A+ +  + G V   C  Y          C       KC    
Sbjct: 198 DC--DRKDHGCNGGEPVNAFNFLHNTGTVLASCVGYTAGDDAVVKFCPQ-----KCDDGS 250

Query: 213 VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 272
             +N +       + S  +  S  + ++A    +GPV  +F V +DF +YKSGVY+H  G
Sbjct: 251 AVENVV-------ATSGSKSGSAIDVLLA----HGPVVATFNVAQDFMYYKSGVYQHRWG 299

Query: 273 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 325
             +GGHAV++IG+G +D G DYW + N W   WG DGYF+I RG +ECGIE +
Sbjct: 300 LWLGGHAVEIIGYGVTDSGLDYWTVRNSWGPDWGEDGYFRIVRGGDECGIEHE 352


>gi|417401428|gb|JAA47600.1| Putative cysteine proteinase tin-ag [Desmodus rotundus]
          Length = 466

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 103/324 (31%), Positives = 154/324 (47%), Gaps = 39/324 (12%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +I  +N+    GW+A  +  F   T+ +  ++ LG ++P+     +         
Sbjct: 140 LVDRDMIDAINQG-NYGWRAGNHSAFWGMTLDEGIRYRLGTIRPSSSVASMNEIHTVLGP 198

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 199 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 256

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
           +LL+C       GC GG+  SAW +    GVV++ C P F   G +  G     P P+C+
Sbjct: 257 NLLSC-DKRNQQGCQGGHLDSAWWFLRRRGVVSDHCYP-FSGQGRTETG-----PAPRCM 309

Query: 210 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
                     R+   +   +Q+  N  +    AYR+ S  ++IM E+ +NGPV+    V+
Sbjct: 310 MHSRAMGRGKRQATARCPNHQVHANDIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVH 369

Query: 257 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWGTSD--DGE--DYWILANQWNRS 304
           EDF  Y++G+Y H    +         G H+VK+ GWG     DG    YW  AN W  +
Sbjct: 370 EDFFLYQNGIYSHTPVSLGRPERYRRHGTHSVKITGWGEESLPDGRTLKYWTAANSWGPA 429

Query: 305 WGADGYFKIKRGSNECGIEEDVVA 328
           WG  G+F+I RG+NEC IE  V+ 
Sbjct: 430 WGERGHFRIVRGANECDIESFVLG 453


>gi|12060418|dbj|BAB20596.1| ARG1 [Mus musculus]
 gi|71059879|emb|CAJ18483.1| Lcn7 [Mus musculus]
          Length = 415

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 104/324 (32%), Positives = 151/324 (46%), Gaps = 39/324 (12%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +IK +N     GW+A  +  F   T+ +  ++ LG ++P+   + +        +
Sbjct: 89  LVDPDMIKAINRG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSTVMNMNEIYTVLGQ 147

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 148 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQ 205

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
           +LL+C       GC GG    AW +    GVV++ C P+             A PTP+C+
Sbjct: 206 NLLSCDTHH-QQGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQ------NEASPTPRCM 258

Query: 210 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
                     R+   +    Q+  N  +    AYR+ SD ++IM E+ +NGPV+    V+
Sbjct: 259 MHSRAMGRGKRQATSRCPNGQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVH 318

Query: 257 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRS 304
           EDF  Y+ G+Y H              G H+VK+ GWG  T  DG    YW  AN W   
Sbjct: 319 EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPW 378

Query: 305 WGADGYFKIKRGSNECGIEEDVVA 328
           WG  G+F+I RG+NEC IE  V+ 
Sbjct: 379 WGERGHFRIVRGTNECDIETFVLG 402


>gi|328712825|ref|XP_001945477.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Acyrthosiphon pisum]
          Length = 487

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 106/317 (33%), Positives = 150/317 (47%), Gaps = 21/317 (6%)

Query: 45  ENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVK-THDKSLKLPKSFDAR 102
           ++ + GW A     F   T     K  LG    P+ +L  VP+K    +  +LP SFD R
Sbjct: 170 QSRQFGWSAKNYSVFWGVTYDNGLKWRLGTLQPPEKILQVVPLKAVFHQDYQLPSSFDLR 229

Query: 103 SAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCG 160
             +     I+  +DQG CG+ WA    +  +DRF I     M  +LS   LL+C   L  
Sbjct: 230 KVFG--DKITDPIDQGWCGASWAISTAQVTTDRFVIMTKGLMRDALSPKHLLSCNNDL-Q 286

Query: 161 DGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW 219
            GC GG+  SAW + +  G+VTEEC P+   +T C+               +  K + L 
Sbjct: 287 RGCQGGHLTSAWNWVMTFGLVTEECYPWDGRATDCAVSNQRSNNNLIVTCPRSAKTSPLR 346

Query: 220 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV---MG 276
           R    Y ++        E IM EI   G V+    V ++F  Y+SGVYK    D+    G
Sbjct: 347 RVGLMYRVAT------EEGIMYEIMNWGSVQAMMKVSKEFFMYESGVYKCSKLDLGSKTG 400

Query: 277 GHAVKLIGWGTSDDG---EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
            H V+++GWG          YWI++N W   WG  GYF+I +G+NEC IE+ VVA +P  
Sbjct: 401 YHTVRIVGWGEEQQNGRTVKYWIVSNSWGLWWGESGYFRILKGTNECQIEDFVVAAMPDI 460

Query: 334 KNLVKEITSADMFEDAS 350
            N    I+     E+AS
Sbjct: 461 DNFCN-ISDQSFRENAS 476


>gi|297282815|ref|XP_002802331.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
           mulatta]
          Length = 322

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 103/313 (32%), Positives = 150/313 (47%), Gaps = 27/313 (8%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLP 96
           +IK +N+    GW+A  +  F   T+ +  ++ LG ++P+   + +       +    LP
Sbjct: 1   MIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSLVMNMHEIYTVLNPGEVLP 59

Query: 97  KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLAC 154
            +F+A   WP    I   LDQG+C   WAF      SDR  IH    M   LS  +LLAC
Sbjct: 60  TAFEASEKWPNL--IHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLAC 117

Query: 155 CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVR 210
                  GC GG    AW +    GVV++ C P+     D  G + P    +    +  R
Sbjct: 118 DTHHQ-QGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKR 176

Query: 211 KCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 267
           +   +  N    N+  Y ++  YR+ S+ ++IM E+ +NGPV+    V+EDF  YK G+Y
Sbjct: 177 QATARCPNSHVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIY 236

Query: 268 KHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKR 315
            H    +         G H+VK+ GWG  T  DG    YW  AN W  +WG  G+F+I R
Sbjct: 237 SHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVR 296

Query: 316 GSNECGIEEDVVA 328
           G NEC IE  V+ 
Sbjct: 297 GVNECDIESFVLG 309


>gi|729283|sp|Q06544.1|CYSP3_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 3
 gi|159952|gb|AAA29436.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
          Length = 174

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 79/175 (45%), Positives = 104/175 (59%), Gaps = 17/175 (9%)

Query: 171 AWRYFVHHGVVTEE-------CDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKN 216
           AW+YF   GVVT         C PY +   C   G EP Y        TPKC + C +  
Sbjct: 1   AWQYFALEGVVTGGNYRKQGCCRPY-EFPPCGRHGKEPYYGECYDTAKTPKCQKTCQRGY 59

Query: 217 -QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 275
            + ++  KH+  SAYR+ ++ + I  +I KNGPV   F VYEDFAHYKSG+YKH  G + 
Sbjct: 60  LKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMT 119

Query: 276 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           GGHAVK+IGWG  + G  YW++AN W+  WG  G++++ RG N C IEE V AG+
Sbjct: 120 GGHAVKIIGWG-KEKGTPYWLIANSWHDDWGEKGFYRMIRGINNCRIEEMVFAGI 173


>gi|270132817|ref|NP_075965.2| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
 gi|270132824|ref|NP_001161805.1| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
 gi|61213616|sp|Q99JR5.1|TINAL_MOUSE RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
           Full=Adrenocortical zonation factor 1; Short=AZ-1;
           AltName: Full=Androgen-regulated gene 1 protein;
           AltName: Full=Tubulointerstitial nephritis
           antigen-related protein; Short=TARP; Flags: Precursor
 gi|13543125|gb|AAH05738.1| Tinagl1 protein [Mus musculus]
 gi|17391278|gb|AAH18539.1| Tinagl1 protein [Mus musculus]
 gi|30314458|dbj|BAC76038.1| tubulointersititial nephritis antigen-related protein [Mus
           musculus]
 gi|148698197|gb|EDL30144.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
           musculus]
 gi|148698198|gb|EDL30145.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
           musculus]
          Length = 466

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 104/324 (32%), Positives = 151/324 (46%), Gaps = 39/324 (12%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +IK +N     GW+A  +  F   T+ +  ++ LG ++P+   + +        +
Sbjct: 140 LVDPDMIKAINRG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSTVMNMNEIYTVLGQ 198

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 199 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQ 256

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
           +LL+C       GC GG    AW +    GVV++ C P+             A PTP+C+
Sbjct: 257 NLLSCDTHH-QQGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQ------NEASPTPRCM 309

Query: 210 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
                     R+   +    Q+  N  +    AYR+ SD ++IM E+ +NGPV+    V+
Sbjct: 310 MHSRAMGRGKRQATSRCPNGQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVH 369

Query: 257 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRS 304
           EDF  Y+ G+Y H              G H+VK+ GWG  T  DG    YW  AN W   
Sbjct: 370 EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPW 429

Query: 305 WGADGYFKIKRGSNECGIEEDVVA 328
           WG  G+F+I RG+NEC IE  V+ 
Sbjct: 430 WGERGHFRIVRGTNECDIETFVLG 453


>gi|253743418|gb|EES99819.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 296

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 95/282 (33%), Positives = 139/282 (49%), Gaps = 33/282 (11%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK----LPKSFDARSAWP 106
           W    + +F   ++ + K +LG       +    P  T   S K     P+S+D R  +P
Sbjct: 33  WVPELSKRFEGKSLDEVKAMLGPL-----INTSRPAITRRHSTKPPVGAPESYDFRDEYP 87

Query: 107 QCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGC 163
            C  I+ ++DQG CGSCWAF +++  +D  C   G++   +S SV  +L C       GC
Sbjct: 88  HC--ITEVVDQGSCGSCWAFSSIQTFADHRC-RSGLDATGVSYSVQYVLDC--DRKDHGC 142

Query: 164 DGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSK 223
           +GG P  A+ +    G V   C  Y          C      PK          ++  S 
Sbjct: 143 NGGEPTKAFDFLHSTGTVLTSCVDYTAGADNVVKFC------PKTCDDGSAVENVFAASG 196

Query: 224 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 283
             S SA  +          +  +GPV  +F V +DF +YKSGVY+H  G  +GGHAV+++
Sbjct: 197 SKSGSAIDV----------LLSHGPVVATFNVAQDFMYYKSGVYQHRWGVWLGGHAVEVV 246

Query: 284 GWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 325
           G+G +D G DYW + N W   WG DGYF+I RGS+ECGIE++
Sbjct: 247 GYGVTDSGLDYWTVRNSWGPDWGEDGYFRIVRGSDECGIEQE 288


>gi|10803443|emb|CAC13134.1| putative cathepsin B.8 [Ostertagia ostertagi]
          Length = 197

 Score =  145 bits (365), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 86/193 (44%), Positives = 111/193 (57%), Gaps = 22/193 (11%)

Query: 122 SCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 179
           SCWAFGAVEA+SDR CI       ++LS  DLL+CC   CG GC+GG P+SAW+++V  G
Sbjct: 1   SCWAFGAVEAISDRICIASKGKTQVTLSAADLLSCC-RSCGFGCNGGDPLSAWKFWVKEG 59

Query: 180 VVTEE-------CDPYFDSTGCSH--------PGCEPAYPTPKCVRKCVKK--NQLWRNS 222
           +VT         C PY     C H        P     +PTPKC + C      + ++  
Sbjct: 60  IVTGSNHSTNAGCKPY-PFPACEHHSNKTHYDPCKHDLFPTPKCEKSCQATFGERTYKED 118

Query: 223 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 282
           K++  SAY + +  E I  EI   GPVEV+F VYEDF +Y  G+Y H  G + GGHAVK+
Sbjct: 119 KYFGRSAYGVKNHMEAIQKEIITYGPVEVAFEVYEDFLNYAGGIYVHQGGALGGGHAVKM 178

Query: 283 IGWGTSDDGEDYW 295
           IGWG  D+G  YW
Sbjct: 179 IGWGI-DNGVPYW 190


>gi|201023321|ref|NP_001128402.1| cathepsin B-1874 precursor [Acyrthosiphon pisum]
          Length = 315

 Score =  145 bits (365), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 95/269 (35%), Positives = 134/269 (49%), Gaps = 47/269 (17%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLL 152
           LP +FD+R  WP C +I  I +QG+C S +A  A  A SDR CI      N  +S   ++
Sbjct: 61  LPINFDSRKKWPNCPSIGHIYNQGNCRSSYAVAAASAASDRICIQSNGTKNPIMSAQQII 120

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCE----- 200
           +CC +LCG GCDGG    +W Y+  HG V+       + C PY      + P C+     
Sbjct: 121 SCC-YLCGHGCDGGSLFESWDYYRRHGFVSGGDYNSNQGCQPY------TIPPCKLMNEK 173

Query: 201 -PAYP--------TPKCVRKCVKKNQLWR------NSKHYSISAYRINSDPEDIMAEIYK 245
            P +         TP C +KC   N            K+Y +S Y         M +I+ 
Sbjct: 174 PPGHSCTTYHREETPICEKKCYNPNYYTSFRTDIYKGKYYKLSPYM-------AMKDIFD 226

Query: 246 NGPVEVSFTVYEDFAHYKSGVYKHITG---DVMGGHAVKLIGWGTSDDGEDYWILANQWN 302
           NGP+   F +Y D   YKSGVY++      D    H+VK+ GWG  ++G  YW++AN + 
Sbjct: 227 NGPITTQFYMYRDLVDYKSGVYQYDEQSDFDFFTVHSVKIFGWG-EENGVPYWLVANSFG 285

Query: 303 RSWGADGYFKIKRGSNECGIEEDVVAGLP 331
             WG +G FKI RG++ C  +E + AGLP
Sbjct: 286 TDWGYNGTFKISRGNDGCFFQEKMYAGLP 314


>gi|395528577|ref|XP_003766405.1| PREDICTED: dipeptidyl peptidase 1-like [Sarcophilus harrisii]
          Length = 568

 Score =  145 bits (365), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 92/270 (34%), Positives = 143/270 (52%), Gaps = 37/270 (13%)

Query: 75  PTPKGLLLGVPVKTHD---KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEA 131
           P PK   L     TH+   K+  LPKS+D R+     + +S + +Q +CGSC+AF ++  
Sbjct: 319 PRPKSAPL-----THEILQKTSTLPKSWDWRNV-NGVNYVSPVRNQANCGSCYAFASLGM 372

Query: 132 LSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPY 188
           L  R  I    +    LS  ++++C  +    GC+GG+P +   +Y    G+V EEC PY
Sbjct: 373 LESRIRIKTNNSQVPVLSPQEIVSCSEY--SQGCEGGFPYLIGGKYAQDFGLVEEECFPY 430

Query: 189 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 248
                        AY +P   +KC +    +  S+++ +  +    +   +  E+ +NGP
Sbjct: 431 ------------QAYDSPCTPKKCSR----YYTSEYHYVGGFYGGCNEALMKHELIQNGP 474

Query: 249 VEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGTSDD-GEDYWILANQW 301
           + V+F VY+DF HY++G+Y H           +  HAV L+G+GT +  GEDYWI+ N W
Sbjct: 475 LTVAFEVYDDFIHYRTGIYHHTGLRDNFNPFELTNHAVLLVGYGTDEKTGEDYWIVKNSW 534

Query: 302 NRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
             SWG +GYF+I RG++EC IE   VA  P
Sbjct: 535 GTSWGENGYFRILRGTDECAIESIAVAATP 564


>gi|297465285|ref|XP_887401.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
           [Bos taurus]
 gi|297472148|ref|XP_002685665.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Bos taurus]
 gi|296490232|tpg|DAA32345.1| TPA: tubulointerstitial nephritis antigen-like 1-like [Bos taurus]
          Length = 534

 Score =  144 bits (364), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 104/324 (32%), Positives = 157/324 (48%), Gaps = 39/324 (12%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++ + +I+ +N     GW+A  +  F   T+ +  ++ LG V+P+     +         
Sbjct: 208 LVDEDMIEAINHG-DYGWRAGNHSAFWGMTLDEGIRYRLGTVRPSSFVANMNEIHTVLGP 266

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVN 149
              LP++F+A   WP  + I   LDQG+C   WAF      SDR  IH   ++S  LS  
Sbjct: 267 GEVLPRTFEASEKWP--NLIHDPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMSPVLSPQ 324

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
           +LL+C       GC GG    AW +    GVV++ C P+      S  G + A P P C+
Sbjct: 325 NLLSC-DTHNQQGCRGGRLDGAWWFLRRRGVVSDHCYPF------SGHGRDEAVPAPPCM 377

Query: 210 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
                     R+   +   + +  N  +    AYR+ S+ ++IM E+ +NGPV+    V+
Sbjct: 378 MHSRAMGRGKRQATARCPNSYVHANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVH 437

Query: 257 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRS 304
           EDF  Y+SG+Y H    +         G H+VK+ GWG  T  DG    YW  AN W  +
Sbjct: 438 EDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPA 497

Query: 305 WGADGYFKIKRGSNECGIEEDVVA 328
           WG  G+F+I RG+NEC IE  V+ 
Sbjct: 498 WGERGHFRIVRGANECDIESFVLG 521


>gi|157058739|gb|ABV03127.1| cathepsin B-2744 [Acyrthosiphon pisum]
          Length = 260

 Score =  144 bits (364), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 93/250 (37%), Positives = 124/250 (49%), Gaps = 33/250 (13%)

Query: 87  KTHDKSLK--LPKSFDARSAWPQCST-ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 143
           KT D S K  +P+ FDAR  +  C+  I  + DQG+C S WA       +DR CI     
Sbjct: 16  KTVDNSYKTDIPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGQ 75

Query: 144 LS--LSVNDLLACCGFLCGDG----CDGGYPISAWRYFVHHGVVT-------EECDPYFD 190
            +  LS  +L++C     GDG    CDGG    AW   ++ G+VT       E C PY +
Sbjct: 76  FTDNLSAQNLMSC-----GDGEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKN 130

Query: 191 STGCSHPG------CEPAYPTPK--CVRKCVKKNQL--WRNSKHYSISAYRIN-SDPEDI 239
              C H G      C     T    C +KCV KN    + +  H +   Y  + ++ + I
Sbjct: 131 RP-CDHYGDSRLTNCSSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQI 189

Query: 240 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 299
             EI   GPV     VYE+F  YK G+YK  TG+++G H VKLIGWG   DG +YW+  N
Sbjct: 190 QQEIMTYGPVTAFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWLAMN 249

Query: 300 QWNRSWGADG 309
            WN +WG DG
Sbjct: 250 SWNSNWGNDG 259


>gi|426328832|ref|XP_004025452.1| PREDICTED: tubulointerstitial nephritis antigen-like [Gorilla
           gorilla gorilla]
          Length = 462

 Score =  144 bits (364), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 106/339 (31%), Positives = 158/339 (46%), Gaps = 27/339 (7%)

Query: 13  CLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLL 71
           C+    T  E    +   +  ++   IIK +N+    GW+A  +  F   T+ +  ++ L
Sbjct: 115 CVILGRTCQENRQWQCDQEPCLVDPDIIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRL 173

Query: 72  G-VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 130
           G ++P+   + +       +    LP +F+A   WP  + I   LDQG+C   WAF    
Sbjct: 174 GTIRPSSSVMNMHEIYTVLNPGEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAA 231

Query: 131 ALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY 188
             SDR  IH    M   LS  +LL+C       GC GG    AW +    GVV++ C P+
Sbjct: 232 VASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPF 290

Query: 189 ----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMA 241
                D  G + P    +    +  R+      N    N+  Y ++  YR+ S+ ++IM 
Sbjct: 291 SGRERDEAGPAPPCMMHSQAMGRGKRQATAHCPNSYVNNNDIYQVTPVYRLGSNDKEIMK 350

Query: 242 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDG 291
           E+ +NGPV+    V+EDF  YK G+Y H    +         G H+VK+ GWG  T  DG
Sbjct: 351 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 410

Query: 292 E--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
               YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 411 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 449


>gi|335290878|ref|XP_003127800.2| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Sus scrofa]
          Length = 362

 Score =  144 bits (364), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 106/324 (32%), Positives = 154/324 (47%), Gaps = 39/324 (12%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +IK +N+    GW+A  +  F   T+ +  ++ LG ++P+     +         
Sbjct: 36  LVDPDMIKAINQG-NYGWRAGNHSAFWGMTLDEGIRYRLGTIRPSSSVANMNEIHTVLGP 94

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP++F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 95  GEVLPRAFEASEKWP--NLIHDPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 152

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
           +LL+C       GC GG    AW +    GVV++ C P+       H   E A P P+C+
Sbjct: 153 NLLSC-DTHNQQGCQGGRLDGAWWFLRRRGVVSDHCYPF-----SGHERNE-AGPAPRCM 205

Query: 210 ----------RKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVY 256
                     R+   +  N     +  Y ++ AYR+ S+ +DIM E+ +NGPV+    V+
Sbjct: 206 MHSRAMGRGKRQATARCPNSYVHANDIYQVTPAYRLGSNEKDIMKELMENGPVQALMEVH 265

Query: 257 EDFAHYKSGVYKHITGD--------VMGGHAVKLIGWG--TSDDGE--DYWILANQWNRS 304
           EDF  Y+SG+Y H              G H+VK+ GWG  T  DG    YW  AN W   
Sbjct: 266 EDFFLYQSGIYSHTPVSHGRPERYRRHGTHSVKITGWGEETLPDGRMLKYWTAANSWGPG 325

Query: 305 WGADGYFKIKRGSNECGIEEDVVA 328
           WG  G+F+I RG+NEC IE  V+ 
Sbjct: 326 WGERGHFRIVRGANECDIESFVLG 349


>gi|193688336|ref|XP_001945899.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 308

 Score =  144 bits (364), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 92/247 (37%), Positives = 120/247 (48%), Gaps = 14/247 (5%)

Query: 97  KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLAC 154
           K FDAR  WP+C TI  + ++G+    WA+     L+DR CI  + G N  LS  +L++C
Sbjct: 67  KEFDARKRWPKCKTIGEVHNEGNFALGWAYAVAGVLADRTCIATNGGYNKLLSTEELISC 126

Query: 155 CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK--- 211
            G    +G       S W Y   HGVV+     Y  + GC      P    PK + K   
Sbjct: 127 SGIKENNGSVPS-ERSIWEYLKSHGVVS--GGKYNSNDGCQPFKFPPIANIPKHLHKHTC 183

Query: 212 ---CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY- 267
              C   + +  N  H  +  Y       DI  E+   GPV V F V +DF  YKSGVY 
Sbjct: 184 DDHCYGNSTINYNHDHVRVRNY-YTIRTRDIQKEVQTYGPVVVRFMVCDDFFLYKSGVYA 242

Query: 268 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 327
           K      +     KLIGWG  ++G DYW++ N W   WG  G FKIK G+N+CG+E  V 
Sbjct: 243 KSDKAKGIRTQYAKLIGWGV-ENGVDYWLVINSWGHEWGQKGLFKIKSGTNQCGVESFVY 301

Query: 328 AGLPSSK 334
           AGLP  K
Sbjct: 302 AGLPEIK 308


>gi|395856781|ref|XP_003800797.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Otolemur garnettii]
          Length = 436

 Score =  144 bits (364), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 99/301 (32%), Positives = 145/301 (48%), Gaps = 26/301 (8%)

Query: 51  WKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQC 108
           W+A  +  F   T+ +  ++ LG ++P+   + +            LP +F+A   WP  
Sbjct: 126 WRAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIYTVLSPGEVLPTAFEASEKWP-- 183

Query: 109 STISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGG 166
           + I   LDQG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG
Sbjct: 184 NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHH-QQGCHGG 242

Query: 167 YPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVKK---NQLW 219
               AW +    GVV++ C P+     D  G +      + P  +  R+   +   NQ+ 
Sbjct: 243 RLDGAWWFLRRRGVVSDHCYPFSGQERDKAGPAPLCMMHSRPMGRGKRQATARCPNNQVQ 302

Query: 220 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM---- 275
            N  +    AYR+ S+ ++IM E+ +NGPV+    V+EDF  Y+SG+Y H    +     
Sbjct: 303 ANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLQRPEG 362

Query: 276 ----GGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 327
               G H+VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+
Sbjct: 363 YRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVL 422

Query: 328 A 328
            
Sbjct: 423 G 423


>gi|296207307|ref|XP_002750588.1| PREDICTED: tubulointerstitial nephritis antigen-like [Callithrix
           jacchus]
          Length = 467

 Score =  144 bits (364), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 103/318 (32%), Positives = 153/318 (48%), Gaps = 27/318 (8%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +I  +N+    GW+A  +  F   T+ +  ++ LG ++P+   + +       + 
Sbjct: 141 LVDPDMINAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNP 199

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 200 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQ 257

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF----DSTGCSHPGCEPAYPT 205
           +LL+C       GC GG+   AW +    GVV++ C P+     D  G   P    +  T
Sbjct: 258 NLLSCNTHH-QQGCRGGHLDGAWWFLRRRGVVSDHCYPFLGRERDKAGPVPPCMMHSRAT 316

Query: 206 PKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 262
            +  R+      N    N+  Y ++ AYR+ S+  +IM E+ +NGPV+    V+EDF  Y
Sbjct: 317 GRGKRQATAHCPNGHVNNNNIYQVTPAYRLGSNDTEIMKELMENGPVQALMEVHEDFFLY 376

Query: 263 KSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGY 310
           K G+Y H   ++         G H+VK+ GWG  T  DG    YW  AN W  +WG  G+
Sbjct: 377 KGGIYSHTPVNLGRPERYRRHGTHSVKITGWGEETWPDGRKLKYWTAANSWGPAWGERGH 436

Query: 311 FKIKRGSNECGIEEDVVA 328
           F+I RG NEC IE  V+ 
Sbjct: 437 FRIVRGVNECDIESFVLG 454


>gi|403355865|gb|EJY77523.1| Cathepsin B [Oxytricha trifallax]
          Length = 299

 Score =  144 bits (363), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 89/246 (36%), Positives = 124/246 (50%), Gaps = 29/246 (11%)

Query: 79  GLLLGVPVKTHDKSLK------LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEAL 132
           G  LG+     +++ K      LP S+D R+A P C+    +L+Q  CGSCW+F A   L
Sbjct: 54  GTALGIESSPDNQNTKKKLTTTLPSSYDYRTAHPGCT--HAVLNQQSCGSCWSFAATSML 111

Query: 133 SDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD 190
            DR C+H    +N+ LS  D+++C       GC GG+      Y V HGVVT +C  Y  
Sbjct: 112 QDRLCLHSNGAVNVQLSQQDMVSC--DFDNAGCSGGWLSHTINYLVVHGVVTSQCLAYAS 169

Query: 191 STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYS--ISAYRINSDPEDIMAEIYKNGP 248
             G             +C  +C   N  +   K Y    ++ ++ +  E++M EIY NGP
Sbjct: 170 VDGAGR----------ECSFRCDDANTEY---KKYGCKFNSLKMTTSKEEMMEEIYLNGP 216

Query: 249 VEVSFTVYEDFAHYKSGVYK-HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 307
           V V F VY DF  Y  G Y+   +  + GGHAV + GWG  + G  YWI  NQW  +WG+
Sbjct: 217 VMVGFIVYSDFMSYGGGYYEVSPSASISGGHAVIVHGWGY-NGGRLYWIAQNQWGTTWGS 275

Query: 308 DGYFKI 313
            GYF I
Sbjct: 276 SGYFNI 281


>gi|395730851|ref|XP_003775799.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Pongo
           abelii]
          Length = 362

 Score =  144 bits (363), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 105/324 (32%), Positives = 154/324 (47%), Gaps = 39/324 (12%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +IK +N+    GW+A  +  F   T+ +  ++ LG ++P+   + +       + 
Sbjct: 36  LVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNP 94

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 95  GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 152

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
           +LL+C       GC GG    AW +    GVV++ C P+      S    + A PTP C+
Sbjct: 153 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPF------SGRERDEAGPTPPCM 205

Query: 210 ----------RKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVY 256
                     R+      N    N+  Y ++  YR+ S+ ++IM E+ +NGPV+    V+
Sbjct: 206 MHSRAMGRGKRQATASCPNSHVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVH 265

Query: 257 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRS 304
           EDF  YK G+Y H    +         G H+VK+ GWG  T  DG    YW  AN W  +
Sbjct: 266 EDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPA 325

Query: 305 WGADGYFKIKRGSNECGIEEDVVA 328
           WG  G+F+I RG NEC IE  V+ 
Sbjct: 326 WGERGHFRIVRGVNECDIESFVLG 349


>gi|324713036|ref|NP_001191344.1| tubulointerstitial nephritis antigen-like isoform 3 [Homo sapiens]
 gi|119628008|gb|EAX07603.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_a [Homo
           sapiens]
          Length = 362

 Score =  144 bits (363), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 102/318 (32%), Positives = 152/318 (47%), Gaps = 27/318 (8%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +IK +N+    GW+A  +  F   T+ +  ++ LG ++P+   + +       + 
Sbjct: 36  LVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNP 94

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 95  GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 152

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 205
           +LL+C       GC GG    AW +    GVV++ C P+     D  G + P    +   
Sbjct: 153 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAM 211

Query: 206 PKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 262
            +  R+      N    N+  Y ++  YR+ S+ ++IM E+ +NGPV+    V+EDF  Y
Sbjct: 212 GRGKRQATAHCPNSYVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLY 271

Query: 263 KSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGY 310
           K G+Y H    +         G H+VK+ GWG  T  DG    YW  AN W  +WG  G+
Sbjct: 272 KGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGH 331

Query: 311 FKIKRGSNECGIEEDVVA 328
           F+I RG NEC IE  V+ 
Sbjct: 332 FRIVRGVNECDIESFVLG 349


>gi|294929081|ref|XP_002779258.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239888294|gb|EER11053.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 288

 Score =  144 bits (363), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 97/263 (36%), Positives = 129/263 (49%), Gaps = 23/263 (8%)

Query: 81  LLGVPVKTHDKSLKLPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIH 139
           LLG P K   K L  P +FDAR  +  C+  I  + DQ  C +CW   +   L+DR CI 
Sbjct: 26  LLG-PTKPELKDL--PSNFDARQKFASCAGVIGHVRDQSACHNCWTVSSTGMLNDRVCIK 82

Query: 140 FGMNLS--LSVNDLLACCGFLCG----DGCDGGYPISAWRYFVHHGVVT-EECDP---YF 189
            G      LSV    +CC    G     GC GG  +    +  +HG+VT +E  P     
Sbjct: 83  SGGTFRDILSVGYFTSCCNPANGCPKAKGCQGGNLLEGLNFLKNHGIVTGDEFKPAGQLS 142

Query: 190 DSTGC---SHPGCEPA-YPTPKCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEI 243
            + GC     P C+ A Y +P C  KC  K      +   H + S  R+ + P++I  EI
Sbjct: 143 SADGCWPYPFPKCKHAGYSSPACQTKCTNKAYKTSLQQDLHRAKSFGRLPAIPQNIKQEI 202

Query: 244 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 303
           + NGPV    ++YED   YK+GVY H TG   G H +K+IGWG  + G+DYW+  N WN 
Sbjct: 203 FTNGPVIGMLSIYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGV-ESGQDYWLAVNSWNE 261

Query: 304 SWGADGYFKIKRGSNECGIEEDV 326
            WG  G  K+  G    GIE  V
Sbjct: 262 EWGDHGMIKLAVG--RTGIENSV 282


>gi|402853710|ref|XP_003891533.1| PREDICTED: tubulointerstitial nephritis antigen-like [Papio anubis]
          Length = 362

 Score =  144 bits (363), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 102/318 (32%), Positives = 153/318 (48%), Gaps = 27/318 (8%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +IK +N+    GW+A  +  F   T+ +  ++ LG ++P+   + +       + 
Sbjct: 36  LVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSLVMNMHEIYTVLNP 94

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 95  GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 152

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 205
           +LL+C       GC GG    AW +    GVV++ C P+     D  G + P    +   
Sbjct: 153 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAM 211

Query: 206 PKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 262
            +  R+   +  N    N+  Y ++  YR+ S+ ++IM E+ +NGPV+    V+EDF  Y
Sbjct: 212 GRGKRQATARCPNSHVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLY 271

Query: 263 KSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGY 310
           K G+Y H    +         G H+VK+ GWG  T  DG    YW  AN W  +WG  G+
Sbjct: 272 KGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGH 331

Query: 311 FKIKRGSNECGIEEDVVA 328
           F+I RG NEC IE  V+ 
Sbjct: 332 FRIVRGVNECDIESFVLG 349


>gi|339248603|ref|XP_003373289.1| cathepsin B [Trichinella spiralis]
 gi|316970616|gb|EFV54519.1| cathepsin B [Trichinella spiralis]
          Length = 576

 Score =  144 bits (362), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 106/316 (33%), Positives = 154/316 (48%), Gaps = 38/316 (12%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKS 92
           ++Q+ I++ +  + +  W +A    F   T+ + F + LG       LL    VK  ++ 
Sbjct: 251 LIQEDILERM-LHERNSWTSANYSTFWGKTLDEGFSYRLGT------LLPEKSVKNMNEI 303

Query: 93  LK-----LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL--S 145
           L      LP+SFDAR  WP  S I  + DQG C S WAF      +DR  I  G      
Sbjct: 304 LIEMSNFLPESFDARERWP--SFIHPVRDQGDCASSWAFSTTAVSADRLAIQSGGKFYNP 361

Query: 146 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 205
           LSV  LL+C       GC+GGY   AW       VV++EC  Y  S   + PG E   P 
Sbjct: 362 LSVQQLLSC-NQARQRGCNGGYLDRAW------CVVSDECYTY-TSGQTNQPG-ECHIPR 412

Query: 206 PKCVRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
              +   ++      +++ Y ++  YRI+++  +IM EI  NGPV+ +F V+EDF  YKS
Sbjct: 413 TAYLDGEIRCPSGSADNRVYKMTPPYRISTNEREIMTEIMANGPVQATFLVHEDFFMYKS 472

Query: 265 GVYKHI--------TGDVMGGHAVKLIGWGTSDDGE---DYWILANQWNRSWGADGYFKI 313
           GVY+H+             G H+V+++GWG          YW+ AN W   WG +G F+I
Sbjct: 473 GVYQHLPYANDKGPAYARSGYHSVRILGWGVDHSTGVPIKYWLCANSWGEEWGENGLFRI 532

Query: 314 KRGSNECGIEEDVVAG 329
            RG N C IE  ++  
Sbjct: 533 LRGENHCDIESFIIGA 548


>gi|308159555|gb|EFO62082.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 305

 Score =  144 bits (362), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 104/321 (32%), Positives = 152/321 (47%), Gaps = 35/321 (10%)

Query: 17  FATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-----FKHLL 71
           FA     V+S      H+L     K +N      W+A    +F+N T  +     F H  
Sbjct: 3   FAALVVAVLSTPFYSPHLL-----KYLNTKEGKLWEAGIPAKFANRTHDEVTKMFFPHAF 57

Query: 72  GVKPTPKGLLLGVPVKTHD--KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 129
                P+    GV +   D       P   D R   P+C       DQ  C  C+AF  +
Sbjct: 58  LKPNIPR--YYGVNITEDDLYPPDGSPDRLDYRQTHPEC--FFEPEDQKECSCCYAFATI 113

Query: 130 EALSDRFCIHF--GMNLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECD 186
            ALS R CI       +SLSV  +++C     G+ GC GG   S+W +    GVV  +C 
Sbjct: 114 GALSTRRCIAKLDSQAVSLSVQHMVSCDN---GEAGCLGGEFESSWAFLETEGVVKSDCL 170

Query: 187 PYFD-STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYK 245
           PY    TG S           +C   C +   L  ++ HY  ++    ++  +IM  +  
Sbjct: 171 PYTSGETGNSG----------ECPMMC-QDGTLVEDAFHYKAASASPLNNYNEIMVSLLA 219

Query: 246 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 305
           +GPV+  F V+EDF +Y  G+Y  + G  +GGHAV ++G+G+ +D  DYWI+ N W   W
Sbjct: 220 DGPVQTGFYVHEDFLYYVGGIYHKVYGSSLGGHAVLIVGYGSMND-HDYWIVRNSWGPDW 278

Query: 306 GADGYFKIKRGSNECGIEEDV 326
           G +GYF+I RG+NECGIE++ 
Sbjct: 279 GENGYFRILRGTNECGIEKNA 299


>gi|193688334|ref|XP_001945855.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 313

 Score =  144 bits (362), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 92/251 (36%), Positives = 120/251 (47%), Gaps = 24/251 (9%)

Query: 97  KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLAC 154
           K FDAR  WP+C TI  + ++G+    WA+ A   L+DR CI  + G N  LS  +L++C
Sbjct: 74  KEFDARKRWPKCKTIGEVHNEGNFAFGWAYAAAGVLADRTCIATNGGYNKLLSTEELISC 133

Query: 155 CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP-------- 206
            G    +G      I  W Y   HGVV+            S+ GC+P    P        
Sbjct: 134 SGIKETNGNVNERSI--WEYLKSHGVVS-------GGKYNSNDGCQPFKFPPIANILTHL 184

Query: 207 --KCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
              C   C     +  N  H  +  Y        I  E+   GPV V F V +DF  YKS
Sbjct: 185 QHTCDDHCYGNTSINYNHDHVRVRNY-YTIRTGYIQKEVQTYGPVAVQFKVCDDFLLYKS 243

Query: 265 GVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 323
           GVY K     V+     KLIGWG  ++G DYW++ N W   WG  G FKIKRG+N+CG+E
Sbjct: 244 GVYVKSDNAKVIRTQYAKLIGWGV-ENGVDYWLVINSWGHEWGQKGLFKIKRGTNQCGVE 302

Query: 324 EDVVAGLPSSK 334
             V AG+P  K
Sbjct: 303 SVVYAGVPEIK 313


>gi|297665714|ref|XP_002811184.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
           [Pongo abelii]
          Length = 467

 Score =  144 bits (362), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 104/324 (32%), Positives = 153/324 (47%), Gaps = 39/324 (12%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +IK +N+    GW+A  +  F   T+ +  ++ LG ++P+   + +       + 
Sbjct: 141 LVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNP 199

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 200 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
           +LL+C       GC GG    AW +    GVV++ C P+           + A PTP C+
Sbjct: 258 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRER------DEAGPTPPCM 310

Query: 210 ----------RKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVY 256
                     R+      N    N+  Y ++  YR+ S+ ++IM E+ +NGPV+    V+
Sbjct: 311 MHSRAMGRGKRQATASCPNSHVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVH 370

Query: 257 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRS 304
           EDF  YK G+Y H    +         G H+VK+ GWG  T  DG    YW  AN W  +
Sbjct: 371 EDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPA 430

Query: 305 WGADGYFKIKRGSNECGIEEDVVA 328
           WG  G+F+I RG NEC IE  V+ 
Sbjct: 431 WGERGHFRIVRGVNECDIESFVLG 454


>gi|301779281|ref|XP_002925058.1| PREDICTED: dipeptidyl peptidase 1-like [Ailuropoda melanoleuca]
 gi|281337582|gb|EFB13166.1| hypothetical protein PANDA_014484 [Ailuropoda melanoleuca]
          Length = 461

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 92/306 (30%), Positives = 150/306 (49%), Gaps = 29/306 (9%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKS 98
            +K +N   K+ W A    ++   T+       G +  P+     +    H+K L+LP S
Sbjct: 174 FVKAINTIQKS-WTATTYTEYKTLTLRDMMRKGGGRRIPRPKPAPLTADIHEKMLRLPAS 232

Query: 99  FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCG 156
           +D R+     + +S + +Q  CGSC+AF ++  L  R  I      +  LS  ++++C  
Sbjct: 233 WDWRNV-HGTNFVSPVRNQASCGSCYAFASMGMLEARIRILTNNTQTPILSPQEVVSCSQ 291

Query: 157 FLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK 215
           +    GC+GG+P + A +Y    G+V E C PY  +         P  P   C R     
Sbjct: 292 Y--AQGCEGGFPYLIAGKYAQDFGLVEEACFPYMGAD-------FPCKPKKDCFR----- 337

Query: 216 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ 269
              + +S ++ +  +    +   +  E+  +GP+ V+F VY+DF HY++G+Y H      
Sbjct: 338 ---YYSSDYHYVGGFYGGCNEALMKLELVHHGPIAVAFQVYDDFFHYRTGIYYHTGLRDP 394

Query: 270 ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
                +  HAV L+G+GT +  G DYWI+ N W   WG +GYF+I+RG++EC IE   VA
Sbjct: 395 FNPFELTNHAVLLVGYGTDTASGMDYWIVKNSWGAGWGENGYFRIRRGTDECAIESIAVA 454

Query: 329 GLPSSK 334
             P  K
Sbjct: 455 ATPVPK 460


>gi|328872536|gb|EGG20903.1| hypothetical protein DFA_00770 [Dictyostelium fasciculatum]
          Length = 313

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 94/244 (38%), Positives = 122/244 (50%), Gaps = 26/244 (10%)

Query: 88  THDKSLKLPKSFDARSAWPQCSTISRILDQGH-CGSCWAFGAVEALSDRFCIHFGMNLS- 145
           T D S  LP SFD+R  W  C   S + DQG  C SCWA  A   L+DR C+  G  +  
Sbjct: 27  TFDAS-NLPASFDSRQKWSDC--FSPVRDQGQKCSSCWAMTATGVLADRLCVASGGKVKK 83

Query: 146 -LSVNDLLAC--CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 202
            LS  +L+ C   G L   GC GG   +   YF  +GVVTE+C+ Y             A
Sbjct: 84  VLSPQELIDCDRNGNL---GCGGGRLDTPLAYFRDNGVVTEKCESY------------KA 128

Query: 203 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 262
                C   C         +K++S   YR++S  E   A+IY NGP+   F +Y D  +Y
Sbjct: 129 TQASSCSNTCDDGTSFSNTTKYHSKDCYRLSS-IEQAKADIYLNGPIIAVFDLYTDIYNY 187

Query: 263 KSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           KSGVY K  +      HA ++IGWG  +DG  YW+ AN W   WG  G FKI+ G+NE G
Sbjct: 188 KSGVYIKSDSATYKETHAGRVIGWGV-EDGVQYWLAANSWGTGWGQQGLFKIRSGTNEVG 246

Query: 322 IEED 325
            E +
Sbjct: 247 FEAN 250


>gi|308163309|gb|EFO65659.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 309

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 92/289 (31%), Positives = 137/289 (47%), Gaps = 24/289 (8%)

Query: 51  WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCST 110
           WKA    +  + T   FK +L          +  P+   +     P  FD R  +PQC  
Sbjct: 31  WKAGIPERLKSLTKSDFKRMLSADSPRTQPSMVRPIHVPESEDPAPDHFDFREEYPQC-- 88

Query: 111 ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGY 167
           I+ ++D G C S WA  AV+A S R C+  G++      S   +L+C      +GC G  
Sbjct: 89  ITEVIDIGLCSSSWAHSAVDAFSHRRCLT-GLDQEATRYSAQYILSCAS---TNGCFGFS 144

Query: 168 PIS--AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHY 225
                AW +    GV  E C  Y D     +   + ++P P     C   + L    + Y
Sbjct: 145 TQGDIAWDFIATTGVPLESCVKYTD-----YNETQSSWPCPSV---CNDNSFL----EIY 192

Query: 226 SISAYR-INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 284
               Y  +  + E +   +   GP++  F VYEDF +Y  G+Y H  G+  G  +V+++G
Sbjct: 193 KPDGYEGVGFNSERLKRAVAFRGPMQAMFAVYEDFTYYLEGIYSHTYGNRAGFLSVEIVG 252

Query: 285 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
           +GTSD+G+DYWI+ N W   WG DGYF+I RG +EC IEE     + +S
Sbjct: 253 YGTSDEGQDYWIVKNYWGPDWGEDGYFRIVRGQDECQIEEATYGAIINS 301


>gi|257215762|emb|CAX83033.1| Cysteine PRotease related protein [Schistosoma japonicum]
          Length = 233

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 78/202 (38%), Positives = 114/202 (56%), Gaps = 15/202 (7%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
           +  ++ F      V ++       L D +I  +NE+P AGWKA ++ +F  +++   + L
Sbjct: 6   VYIVSLFTLLEAHVTTRNNQRIEPLSDEMISFINEHPDAGWKADKSDRF--HSLDDARIL 63

Query: 71  LGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 128
           +G +     +       V  HD ++++P  FD+R  WP C +IS+I DQ  CGSCWAFGA
Sbjct: 64  MGARKEDAEMKRKRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGA 123

Query: 129 VEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 186
           VEA++DR CI    G +  LS  DL++CC   CGDGC GG+P  AW Y+V  G+VT   +
Sbjct: 124 VEAMTDRICIQSGGGQSAELSALDLISCCKD-CGDGCKGGFPGQAWDYWVKRGIVTGGSE 182

Query: 187 PYFDSTGCSHPGCEPAYPTPKC 208
                   +H GC+P YP PKC
Sbjct: 183 E-------NHTGCQP-YPFPKC 196


>gi|355557764|gb|EHH14544.1| hypothetical protein EGK_00488 [Macaca mulatta]
 gi|355745087|gb|EHH49712.1| hypothetical protein EGM_00421 [Macaca fascicularis]
 gi|384948750|gb|AFI37980.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
           [Macaca mulatta]
 gi|384948752|gb|AFI37981.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
           [Macaca mulatta]
 gi|387540550|gb|AFJ70902.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
           [Macaca mulatta]
          Length = 467

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 102/318 (32%), Positives = 153/318 (48%), Gaps = 27/318 (8%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +IK +N+    GW+A  +  F   T+ +  ++ LG ++P+   + +       + 
Sbjct: 141 LVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSLVMNMHEIYTVLNP 199

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 200 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 205
           +LL+C       GC GG    AW +    GVV++ C P+     D  G + P    +   
Sbjct: 258 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAM 316

Query: 206 PKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 262
            +  R+   +  N    N+  Y ++  YR+ S+ ++IM E+ +NGPV+    V+EDF  Y
Sbjct: 317 GRGKRQATARCPNSHVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLY 376

Query: 263 KSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGY 310
           K G+Y H    +         G H+VK+ GWG  T  DG    YW  AN W  +WG  G+
Sbjct: 377 KGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGH 436

Query: 311 FKIKRGSNECGIEEDVVA 328
           F+I RG NEC IE  V+ 
Sbjct: 437 FRIVRGVNECDIESFVLG 454


>gi|403293249|ref|XP_003937633.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Saimiri boliviensis boliviensis]
          Length = 467

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 102/318 (32%), Positives = 152/318 (47%), Gaps = 27/318 (8%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +I  +N+    GW+A  +  F   T+ +  ++ LG ++P+   + +       + 
Sbjct: 141 LVDPDMINAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNP 199

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 200 GEALPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 205
           +LL+C       GC GG    AW +    GVV++ C P+     D  G + P    +   
Sbjct: 258 NLLSCNTHH-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDKAGPAPPCMMHSRAM 316

Query: 206 PKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 262
            +  R+      N    N+  Y ++ AYR+ S+  +IM E+ +NGPV+    V+EDF  Y
Sbjct: 317 GRGKRQATAHCPNGHVNNNNIYQVTPAYRLGSNDTEIMKELMENGPVQALMEVHEDFFLY 376

Query: 263 KSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGY 310
           K G+Y H   ++         G H+VK+ GWG  T  DG    YW  AN W  +WG  G+
Sbjct: 377 KGGIYSHTPVNLGRPERYRRHGTHSVKITGWGEETRPDGRKLKYWTAANSWGPAWGERGH 436

Query: 311 FKIKRGSNECGIEEDVVA 328
           F+I RG NEC IE  V+ 
Sbjct: 437 FRIVRGVNECDIESFVLG 454


>gi|16758354|ref|NP_446034.1| tubulointerstitial nephritis antigen-like precursor [Rattus
           norvegicus]
 gi|61213054|sp|Q9EQT5.1|TINAL_RAT RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
           Full=Glucocorticoid-inducible protein 5; Flags:
           Precursor
 gi|11527795|dbj|BAB18637.1| glucocorticoid-inducible protein [Rattus norvegicus]
          Length = 467

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 103/324 (31%), Positives = 152/324 (46%), Gaps = 38/324 (11%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++  ++IK +N     GW+A  +  F   T+ +  ++ LG ++P+   + +        +
Sbjct: 140 LVDPAMIKAINRG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIYTVLGQ 198

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 199 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQ 256

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
           +LL+C       GC GG    AW +    GVV++ C P+           + A PTP+C+
Sbjct: 257 NLLSCDTHH-QKGCRGGRLDGAWWFLRRRGVVSDNCYPF-----SGREQNDEASPTPRCM 310

Query: 210 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
                     R+   +   +Q+  N  +     YR+ SD ++IM E+ +NGPV+    V+
Sbjct: 311 MHSRAMGRGKRQATSRCPNSQVDSNDIYQVTPVYRLASDEKEIMKELMENGPVQALMEVH 370

Query: 257 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRS 304
           EDF  Y+ G+Y H              G H+VK+ GWG  T  DG    YW  AN W   
Sbjct: 371 EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPW 430

Query: 305 WGADGYFKIKRGSNECGIEEDVVA 328
           WG  G+F+I RG NEC IE  V+ 
Sbjct: 431 WGERGHFRIVRGINECDIETFVLG 454


>gi|291408920|ref|XP_002720687.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Oryctolagus
           cuniculus]
          Length = 467

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 104/323 (32%), Positives = 153/323 (47%), Gaps = 37/323 (11%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKS 92
           ++   +I  +N+    GW+A  +  F   T+ +  ++ LG    P  ++    + T   S
Sbjct: 141 LVDPDMINAINQG-NYGWQAGNHSAFWGMTLEEGIRYRLGTNRPPSSVMNMNEIYTGLGS 199

Query: 93  LK-LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
            + LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 200 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF----DSTGCSHP-------- 197
           +LL+C       GC GG    AW +    GVV++ C P+     D  G + P        
Sbjct: 258 NLLSCDTHH-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGHEQDEAGPAPPCMMHSRAM 316

Query: 198 GCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 257
           G      T +C    V  N +++ +      AYR+ S+ ++IM E+ +NGPV+    V+E
Sbjct: 317 GRGKRQATARCPNSHVHANDIYQVTP-----AYRLGSNEKEIMKELLENGPVQALMEVHE 371

Query: 258 DFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSW 305
           DF  Y+ G+Y H    +         G H+VK+ GWG  T  DG    YW  AN W  +W
Sbjct: 372 DFFLYQGGIYSHTPVSLERPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAW 431

Query: 306 GADGYFKIKRGSNECGIEEDVVA 328
           G  G+F+I RG+NEC IE  V+ 
Sbjct: 432 GERGHFRILRGTNECDIESFVLG 454


>gi|397515889|ref|XP_003828174.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1 [Pan
           paniscus]
          Length = 467

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 102/318 (32%), Positives = 152/318 (47%), Gaps = 27/318 (8%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +IK +N+    GW+A  +  F   T+ +  ++ LG ++P+   + +       + 
Sbjct: 141 LVDPDMIKAINQG-NYGWQAGNHSTFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNP 199

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 200 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 205
           +LL+C       GC GG    AW +    GVV++ C P+     D  G + P    +   
Sbjct: 258 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAM 316

Query: 206 PKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 262
            +  R+      N    N+  Y ++  YR+ S+ ++IM E+ +NGPV+    V+EDF  Y
Sbjct: 317 GRGKRQATAHCPNSYVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLY 376

Query: 263 KSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGY 310
           K G+Y H    +         G H+VK+ GWG  T  DG    YW  AN W  +WG  G+
Sbjct: 377 KGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGH 436

Query: 311 FKIKRGSNECGIEEDVVA 328
           F+I RG NEC IE  V+ 
Sbjct: 437 FRIVRGVNECDIESFVLG 454


>gi|11545918|ref|NP_071447.1| tubulointerstitial nephritis antigen-like isoform 1 precursor [Homo
           sapiens]
 gi|61213628|sp|Q9GZM7.1|TINAL_HUMAN RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
           Full=Glucocorticoid-inducible protein 5; AltName:
           Full=Oxidized LDL-responsive gene 2 protein;
           Short=OLRG-2; AltName: Full=Tubulointerstitial nephritis
           antigen-related protein; Short=TIN Ag-related protein;
           Short=TIN-Ag-RP; Flags: Precursor
 gi|11602840|gb|AAG38876.1|AF236150_1 tubulointerstitial nephritis antigen-related protein precursor
           [Homo sapiens]
 gi|11275667|gb|AAG33699.1| oxidized-LDL responsive gene 2 [Homo sapiens]
 gi|11527793|dbj|BAB18636.1| glucocorticoid-inducible protein [Homo sapiens]
 gi|11527809|dbj|BAB18727.1| glucocorticoid-inducible protein [Homo sapiens]
 gi|11761715|gb|AAG40154.1| tubulointerstitial nephritis antigen-related protein [Homo sapiens]
 gi|22761462|dbj|BAC11596.1| unnamed protein product [Homo sapiens]
 gi|37181967|gb|AAQ88787.1| LCN7 [Homo sapiens]
 gi|40353044|gb|AAH64633.1| Tubulointerstitial nephritis antigen-like 1 [Homo sapiens]
 gi|119628009|gb|EAX07604.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
           sapiens]
 gi|119628010|gb|EAX07605.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
           sapiens]
 gi|119628011|gb|EAX07606.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
           sapiens]
 gi|158258977|dbj|BAF85459.1| unnamed protein product [Homo sapiens]
 gi|261858502|dbj|BAI45773.1| tubulointerstitial nephritis antigen-like 1 [synthetic construct]
 gi|410265400|gb|JAA20666.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410307560|gb|JAA32380.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410307562|gb|JAA32381.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410307564|gb|JAA32382.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410335249|gb|JAA36571.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
          Length = 467

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 102/318 (32%), Positives = 152/318 (47%), Gaps = 27/318 (8%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +IK +N+    GW+A  +  F   T+ +  ++ LG ++P+   + +       + 
Sbjct: 141 LVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNP 199

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 200 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 205
           +LL+C       GC GG    AW +    GVV++ C P+     D  G + P    +   
Sbjct: 258 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAM 316

Query: 206 PKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 262
            +  R+      N    N+  Y ++  YR+ S+ ++IM E+ +NGPV+    V+EDF  Y
Sbjct: 317 GRGKRQATAHCPNSYVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLY 376

Query: 263 KSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGY 310
           K G+Y H    +         G H+VK+ GWG  T  DG    YW  AN W  +WG  G+
Sbjct: 377 KGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGH 436

Query: 311 FKIKRGSNECGIEEDVVA 328
           F+I RG NEC IE  V+ 
Sbjct: 437 FRIVRGVNECDIESFVLG 454


>gi|157058751|gb|ABV03133.1| cathepsin B-3098 [Aulacorthum solani]
          Length = 215

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 82/214 (38%), Positives = 111/214 (51%), Gaps = 20/214 (9%)

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 147
           D   ++P+ FDAR  W +C TI  + DQG+C S WA     A +DR C+  +   N  LS
Sbjct: 1   DNYQEIPRKFDARKKWLRCKTIGEVRDQGNCASGWALSTSSAFADRLCVATNGDFNQLLS 60

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGC 194
             ++  CC   CG+GC GGYPI AW+ F  HG+VT       E C+PY      +D  G 
Sbjct: 61  AEEITFCC-HTCGNGCYGGYPIRAWKSFKKHGLVTGGNYKSGEGCEPYRVPPCPYDEYGN 119

Query: 195 SHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGPVEVSF 253
           +    +P     +C R C     L  +  H Y+   Y +      I  ++   GP+E SF
Sbjct: 120 NTCSGQPMESNHRCTRMCYGNQDLDFDQDHRYTRDHYYLTY--RGIQKDVINYGPIEASF 177

Query: 254 TVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWG 286
            VY+DF  YKSG+Y K      +GGH+VKLIGWG
Sbjct: 178 DVYDDFPSYKSGIYVKSENASYLGGHSVKLIGWG 211


>gi|290990726|ref|XP_002677987.1| predicted protein [Naegleria gruberi]
 gi|284091597|gb|EFC45243.1| predicted protein [Naegleria gruberi]
          Length = 225

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 87/239 (36%), Positives = 116/239 (48%), Gaps = 20/239 (8%)

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 150
           + +P +FDAR+ W  C  +  I DQ  CG+CWAF A   L+ R CI      N+ LS   
Sbjct: 1   MDIPMNFDARTQWRGC--VPAIRDQQTCGACWAFSANYVLAHRLCIATNGQTNVVLSPEY 58

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
            + C        C GGY   +W +  + G   + C PY    G         + +  C  
Sbjct: 59  QVQC--DTMNKACQGGYLKYSWTFLENTGTPLDTCIPYASGRG--------TFSSGTCPT 108

Query: 211 KCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 270
           +C   +    +   Y     R  +   +I   I   G V+  FTVY D   YKSGVYKH+
Sbjct: 109 QCKIASM---SMSKYKAKNTRYITGINNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHV 165

Query: 271 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 329
              V+GGHAV LIG+G  + G +YW+ AN W  +WG  GYFKI +G  E GIE  V AG
Sbjct: 166 VSTVLGGHAVALIGFGV-EGGSNYWLAANSWGPNWGMSGYFKIAQG--EGGIENQVYAG 221


>gi|21697|emb|CAA46813.1| cathepsin B [Triticum aestivum]
          Length = 130

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 69/120 (57%), Positives = 81/120 (67%), Gaps = 2/120 (1%)

Query: 12  LCLTCFATFAEGVVSKLKLDSH--ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKH 69
           +CLTC       +V   + D    I+Q  II+ VN +P AGW A  NP  +NYT+ QFKH
Sbjct: 11  VCLTCVCATYLQLVGAARRDHSLGIIQKDIIQTVNNHPNAGWTAGHNPYLANYTIEQFKH 70

Query: 70  LLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 129
           +LGVKPTP GL   V  KTH +S +LPK FDARS W  CSTI +ILDQGHCGSCWAFGAV
Sbjct: 71  MLGVKPTPPGLRAAVRTKTHSRSEQLPKVFDARSKWSGCSTIGKILDQGHCGSCWAFGAV 130


>gi|10803441|emb|CAC13133.1| putative cathepsin B.7 [Ostertagia ostertagi]
          Length = 198

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 84/202 (41%), Positives = 112/202 (55%), Gaps = 26/202 (12%)

Query: 122 SCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 179
           SCWA     A+SDR CI       + +S  D+++CC + CG GC+GG+PI AW+Y V  G
Sbjct: 1   SCWAVSTAAAMSDRICIASKGATQVLISAQDIVSCCTW-CGAGCEGGWPIEAWKYGVTEG 59

Query: 180 VVT------EECDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKNQLWRNS---- 222
           VVT      +EC   ++   C + G EP Y        TP C ++C      ++NS    
Sbjct: 60  VVTGGNFGRKECCRSYEIHPCGYHGNEPFYGHCHSMARTPPCKKRC---RPGYKNSYMMD 116

Query: 223 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 282
           K Y  SAY + +    I  +I +NGPV   F VYEDF +YKSG+Y+H  G   GGHAVK+
Sbjct: 117 KRYGTSAYELPNSVXAIQRDIMENGPVVAGFDVYEDFKYYKSGIYRHTAGKXTGGHAVKV 176

Query: 283 IGWG---TSDDGEDYWILANQW 301
           IGWG   T +    YWI+AN W
Sbjct: 177 IGWGEEXTENGTIPYWIIANSW 198


>gi|405963121|gb|EKC28721.1| Tubulointerstitial nephritis antigen-like protein [Crassostrea
           gigas]
          Length = 464

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 104/288 (36%), Positives = 135/288 (46%), Gaps = 24/288 (8%)

Query: 50  GWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKSL-KLPKSFDARSAWPQ 107
           GW+ A   +F N T  Q     +G++   +   +      H  S  +LP  FDAR  W  
Sbjct: 149 GWQTANYTRFWNLTFTQGISEHVGIETESRAKNMS---SLHSYSRDQLPIHFDARINWT- 204

Query: 108 CSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDG 165
            S I  + DQ +C S WAF  V+  +DR  I     L+  LS   L++C       GC G
Sbjct: 205 -SWIHPVRDQKNCASSWAFSTVDVAADRLAIESEGLLTNQLSPQHLVSCNTGRGQRGCRG 263

Query: 166 GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHY 225
           G    AW +    G++TEEC PY  S G     C     T      C   N        Y
Sbjct: 264 GSTEKAWWFVKRRGIITEECYPYTASDG----ECLDGETT------CPNANSSTAKIVLY 313

Query: 226 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH-AVKLIG 284
               YR+  D EDI AEIY+NGPV+ +F V  DF  Y+SGVY+H   D+     +V++IG
Sbjct: 314 VTPPYRVRQDEEDIKAEIYRNGPVQATFRVSSDFFMYRSGVYRHTGADLGESRLSVRIIG 373

Query: 285 WG----TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
           WG           YWI  N W   WG  G F+I RG N  GIEE+V+A
Sbjct: 374 WGEKTNKKGKKRKYWICLNSWGTKWGEKGAFRIVRGENHLGIEENVLA 421


>gi|10803450|emb|CAB97364.2| putative cathepsin B.1 [Ostertagia ostertagi]
          Length = 199

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 83/194 (42%), Positives = 109/194 (56%), Gaps = 22/194 (11%)

Query: 122 SCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 179
           SCWA  +  A+SDR CI       + +S  D+++CC + CG GC GG+ I AW YF   G
Sbjct: 1   SCWAVSSASAMSDRVCIATQGAKQVLISDQDIVSCCTW-CGYGCQGGWSIRAWYYFAEQG 59

Query: 180 VVTE-------ECDPYFDSTGCSHPGCEPAY-------PTPKCVRKC-VKKNQLWRNSKH 224
           VVT         C PY +   C +   EP Y        TP+C R+C +   + + + KH
Sbjct: 60  VVTGGNYNTKGSCRPY-EIHPCGYHKDEPYYGECDDLADTPRCKRRCQLGYPKSYPSDKH 118

Query: 225 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 284
           Y  +AY++    E I  EI +NGPV   FTVYEDFAHYK G+YKH +G   GGHAVK+IG
Sbjct: 119 YGRTAYQLPMSVESIQREIMRNGPVVAGFTVYEDFAHYKGGIYKHTSGKKTGGHAVKVIG 178

Query: 285 WGTSDDGED---YW 295
           WG+   G +   YW
Sbjct: 179 WGSEQKGSEKIPYW 192


>gi|332808277|ref|XP_524645.3| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
           antigen-like 1 [Pan troglodytes]
          Length = 472

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 102/318 (32%), Positives = 152/318 (47%), Gaps = 27/318 (8%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +IK +N+    GW+A  +  F   T+ +  ++ LG ++P+   + +       + 
Sbjct: 146 LVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNP 204

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 205 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 262

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 205
           +LL+C       GC GG    AW +    GVV++ C P+     D  G + P    +   
Sbjct: 263 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAM 321

Query: 206 PKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 262
            +  R+      N    N+  Y ++  YR+ S+ ++IM E+ +NGPV+    V+EDF  Y
Sbjct: 322 GRGKRQATAHCPNSYVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLY 381

Query: 263 KSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGY 310
           K G+Y H    +         G H+VK+ GWG  T  DG    YW  AN W  +WG  G+
Sbjct: 382 KGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGH 441

Query: 311 FKIKRGSNECGIEEDVVA 328
           F+I RG NEC IE  V+ 
Sbjct: 442 FRIVRGVNECDIESFVLG 459


>gi|162813|gb|AAA30434.1| cathepsin B, partial [Bos taurus]
          Length = 122

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 59/113 (52%), Positives = 89/113 (78%), Gaps = 1/113 (0%)

Query: 219 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 278
           ++  KH+  S+Y + ++ ++IMAEIYKNGPVE +F+VY DF  YKSGVY+H++G++MGGH
Sbjct: 6   YKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGH 65

Query: 279 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           A++++GWG  ++G  YW++ N WN  WG +G+FKI RG + CGIE ++VAG+P
Sbjct: 66  AIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 117


>gi|268564843|ref|XP_002639246.1| Hypothetical protein CBG03805 [Caenorhabditis briggsae]
          Length = 526

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 94/252 (37%), Positives = 125/252 (49%), Gaps = 18/252 (7%)

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSV 148
           K  +LP+ FDAR  W     I  I DQG CGS WA       SDR  I     +N SLS 
Sbjct: 254 KPRELPEHFDARDKWGH--LIHPIADQGDCGSSWAVSTTGISSDRLSIISEGRINASLSS 311

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 208
             LL+C       GC+GGY   AW Y    GVV + C PY  S     PG          
Sbjct: 312 QQLLSCNQHR-QKGCEGGYLDRAWWYIRKLGVVGDHCYPYV-SGQSREPGHCLIPKRDYT 369

Query: 209 VRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 267
            R+ ++     ++S  + ++  Y+++S  EDI  E+  NGPV+ +F V+EDF  Y  GVY
Sbjct: 370 NRQGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVY 429

Query: 268 KH--------ITGDVMGGHAVKLIGWG---TSDDGEDYWILANQWNRSWGADGYFKIKRG 316
           +H         +    G H+V+++GWG   ++     YW+ AN W   WG DGYFKI RG
Sbjct: 430 QHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDGYFKILRG 489

Query: 317 SNECGIEEDVVA 328
            N C IE  V+ 
Sbjct: 490 ENHCEIESFVIG 501


>gi|260826514|ref|XP_002608210.1| hypothetical protein BRAFLDRAFT_125840 [Branchiostoma floridae]
 gi|229293561|gb|EEN64220.1| hypothetical protein BRAFLDRAFT_125840 [Branchiostoma floridae]
          Length = 470

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 98/312 (31%), Positives = 151/312 (48%), Gaps = 44/312 (14%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYT-------VGQFKHLLGVKPTPKGLLLGVPVKTHDK 91
            I+++N + ++ W+A   P++  +T        G  K  L  +P P      V  +T   
Sbjct: 179 FIEQIN-SAQSSWQAGVYPEYEKFTRNDLIRRAGGRKSRLPHRPRPAP----VSEETRLA 233

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVN 149
           + +LP+SFD R      + +S I DQG CGSC+AF ++  L  R  +  +      LS  
Sbjct: 234 AAQLPESFDWRKVM-GLNFVSPIRDQGQCGSCYAFASMGMLEARLRVLTNNTQQFVLSPQ 292

Query: 150 DLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYF--DSTGCSHPGCEPAYPTP 206
           ++++C  +    GC+GG+P + A +Y    GVV EEC PY   DS+      C   Y T 
Sbjct: 293 EIVSCGKY--SQGCEGGFPYLIAGKYAEDFGVVLEECYPYEGKDSSCKDTSRCGRGYAT- 349

Query: 207 KCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 266
                            +  +  +    + E +  E+ KNGP+ V+F VY DF HYK GV
Sbjct: 350 ----------------NYRYVGGFYGGCNEELMQLELVKNGPMAVAFEVYSDFMHYKGGV 393

Query: 267 YKH------ITGDVMGGHAVKLIGWGTS-DDGEDYWILANQWNRSWGADGYFKIKRGSNE 319
           Y+H           +  HAV L+G+G   + G  +W + N W   WG +G+F+I+RG++E
Sbjct: 394 YEHTGLSDPFNPFEITNHAVLLVGYGRDPETGAKFWTVKNSWGEKWGEEGFFRIRRGTDE 453

Query: 320 CGIEEDVVAGLP 331
           C IE   VA  P
Sbjct: 454 CAIESIAVAADP 465


>gi|431891156|gb|ELK02033.1| Tubulointerstitial nephritis antigen-like protein [Pteropus alecto]
          Length = 467

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 102/324 (31%), Positives = 152/324 (46%), Gaps = 39/324 (12%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +I  +N+    GW+A  +  F   T+ +  ++ LG ++P+     +         
Sbjct: 141 LVDQDMISAINQG-NYGWRAGNHSAFWGMTLDEGIRYRLGTIRPSSSVTNMNEIHTVLVP 199

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
             +LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 200 GERLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
           +LL+C       GC GG    AW +    GVV++ C P+             A P P+C+
Sbjct: 258 NLLSCDKHN-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGQER------NEAGPEPRCM 310

Query: 210 RKCV-----KKNQLWRNSKH-------YSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVY 256
                    K+  + R   H       Y ++ AYR+ S+ ++IM E+ +NGPV+    V+
Sbjct: 311 MHSRAMGRGKRQAIARCPNHHVHANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVH 370

Query: 257 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRS 304
           EDF  Y+ G+Y H    +         G H+VK+ GWG  T  DG    YW  AN W  +
Sbjct: 371 EDFFLYQGGIYSHTPVSLGKPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPA 430

Query: 305 WGADGYFKIKRGSNECGIEEDVVA 328
           WG  G+F+I RG+NEC IE  V+ 
Sbjct: 431 WGERGHFRIVRGTNECDIESFVLG 454


>gi|354472325|ref|XP_003498390.1| PREDICTED: tubulointerstitial nephritis antigen [Cricetulus
           griseus]
 gi|344245030|gb|EGW01134.1| Tubulointerstitial nephritis antigen-like [Cricetulus griseus]
          Length = 465

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 99/318 (31%), Positives = 151/318 (47%), Gaps = 27/318 (8%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +I  +N     GW+A  +  F   T+ +  ++ LG ++P+   + +        +
Sbjct: 140 LVDPDMINAINRG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIYTALGR 198

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVN 149
              LP++F+A   WP  + I   LDQG+C   WAF      SDR  IH   +++  LS  
Sbjct: 199 GEVLPRAFEASEKWP--NLIQEPLDQGNCAGSWAFSTAAVASDRVSIHSMGHMTPILSPQ 256

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF----DSTGCSHPGCEPAYPT 205
           +LL+C       GC GG    AW +    GVV++ C P+     +  G S      +   
Sbjct: 257 NLLSCDTHH-QQGCRGGRLDGAWWFLRRRGVVSDNCYPFVGREQNEAGTSSRCMMHSRAM 315

Query: 206 PKCVRKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 262
            +  R+   +    Q+  N  +    AYR+ SD ++IM E+ +NGPV+    V+EDF  Y
Sbjct: 316 GRGKRQATSRCPNGQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVHEDFFLY 375

Query: 263 KSGVYKHI--------TGDVMGGHAVKLIGWGTSD--DGE--DYWILANQWNRSWGADGY 310
           +SG+Y H              G H+VK+ GWG     DG    YW  AN W   WG  G+
Sbjct: 376 QSGIYSHTPISQGRPEQYRRHGTHSVKITGWGEEKLPDGRTIKYWTAANSWGPWWGERGH 435

Query: 311 FKIKRGSNECGIEEDVVA 328
           F+I RG+NEC IE  V+ 
Sbjct: 436 FRIVRGTNECDIESFVLG 453


>gi|417401357|gb|JAA47568.1| Putative dipeptidyl peptidase 1 [Desmodus rotundus]
          Length = 463

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 103/339 (30%), Positives = 162/339 (47%), Gaps = 41/339 (12%)

Query: 11  ILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
           I+      +  E   S+L   +H      ++ +N   K+ W A    ++   T+ +    
Sbjct: 150 IMNTAHLQSLKEKYSSRLYKYNH----EFVEAINAVQKS-WTATTYMEYETLTLREMIRR 204

Query: 71  LG--VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 128
            G   +  P+     V  +  +K L LP S+D R+ +   + +S + +Q  CGSC++F +
Sbjct: 205 GGGHSRRIPRTSPAPVTAEIREKVLHLPTSWDWRNVY-GTNFVSPVRNQASCGSCYSFAS 263

Query: 129 VEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEEC 185
           V  L  R  I      +  LS  ++++C  +    GCDGG+P + A +Y    G+V E C
Sbjct: 264 VGMLEARIRILTNNTQTPILSPQEVVSCSQY--AQGCDGGFPYLIAGKYAQDFGLVEEAC 321

Query: 186 DPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEI 243
            PY   TG   P              C+ K   +R   S+++ +  +    +   +  E+
Sbjct: 322 FPY---TGTDSP--------------CMLKEDCFRYYTSEYHYVGGFYGGCNEALMKLEL 364

Query: 244 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-------MGGHAVKLIGWGTSD-DGEDYW 295
             NGP+ V+F VY DF HY+ G+Y H TG         +  HAV L+G+GT    G DYW
Sbjct: 365 VHNGPMAVAFEVYNDFLHYQEGIYHH-TGLTDPFNPFELTNHAVLLVGYGTDPATGMDYW 423

Query: 296 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 334
           I+ N W  +WG DGYF+I+RG++EC IE   VA  P  K
Sbjct: 424 IVKNSWGTAWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|62510425|sp|Q60HG6.1|CATC_MACFA RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|52782205|dbj|BAD51949.1| cathepsin C [Macaca fascicularis]
          Length = 463

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 99/317 (31%), Positives = 157/317 (49%), Gaps = 47/317 (14%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQF--------KHLLGVKPTPKGLLLGVPVKTH 89
           + +K +N   K+ W A    ++   T+G          + +   KPTP      +  +  
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIKRSGGHSRKIPRPKPTP------LTAEIQ 225

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LS 147
            K L LP S+D R+     + +S + +Q  CGSC++F +V  L  R  I    + +  LS
Sbjct: 226 QKILHLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILS 284

Query: 148 VNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP 206
             ++++C  +    GC+GG+P ++A +Y    G+V E C PY   TG   P         
Sbjct: 285 SQEVVSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGTDSP--------- 330

Query: 207 KCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
                C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HY++
Sbjct: 331 -----CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDDFLHYQN 385

Query: 265 GVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGS 317
           G+Y H           +  HAV L+G+GT S  G DYWI+ N W  SWG DGYF+I+RG+
Sbjct: 386 GIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGT 445

Query: 318 NECGIEEDVVAGLPSSK 334
           +EC IE   VA  P  K
Sbjct: 446 DECAIESIAVAATPIPK 462


>gi|332254562|ref|XP_003276398.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 3
           [Nomascus leucogenys]
          Length = 362

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 101/318 (31%), Positives = 152/318 (47%), Gaps = 27/318 (8%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +IK +N+    GW+A  +  F   T+ +  ++ LG ++P+   + +       + 
Sbjct: 36  LVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTMRPSSSVMNMHEIYTVLNP 94

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 95  GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 152

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 205
           +LL+C       GC GG    AW +    GVV++ C P+     D  G + P    +   
Sbjct: 153 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAM 211

Query: 206 PKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 262
            +  R+      N    N+  Y ++  YR+ S+ +++M E+ +NGPV+    V+EDF  Y
Sbjct: 212 GRGKRQATAHCPNSHVNNNDIYQVTPVYRLGSNDKEVMKELMENGPVQALMEVHEDFFLY 271

Query: 263 KSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGY 310
           K G+Y H    +         G H+VK+ GWG  T  DG    YW  AN W  +WG  G+
Sbjct: 272 KGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGH 331

Query: 311 FKIKRGSNECGIEEDVVA 328
           F+I RG NEC IE  V+ 
Sbjct: 332 FRIVRGVNECDIESFVLG 349


>gi|341898422|gb|EGT54357.1| hypothetical protein CAEBREN_10381 [Caenorhabditis brenneri]
          Length = 466

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 92/252 (36%), Positives = 127/252 (50%), Gaps = 18/252 (7%)

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSV 148
           K  +LP+ FD+R  W     I+ ++DQG CGS WA       SDR  I     +N SLS 
Sbjct: 194 KPRELPEHFDSRDKWGH--LINPVVDQGDCGSSWAVSTTGISSDRLAIISEGRINASLSS 251

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 208
             LL+C       GC+GGY   AW Y    GVV + C PY  S     PG          
Sbjct: 252 QQLLSCNQHR-QKGCEGGYLDRAWWYIRKLGVVGDHCYPYV-SGQSREPGHCLIPKRDYT 309

Query: 209 VRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 267
            R+ ++     ++S  + ++  Y+++S  EDI  E+  NGPV+ +F V+EDF  Y  GVY
Sbjct: 310 DRRGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVY 369

Query: 268 KH--------ITGDVMGGHAVKLIGWG---TSDDGEDYWILANQWNRSWGADGYFKIKRG 316
           +H         +    G H+V+++GWG   ++     YW+ AN W   WG DGYFKI RG
Sbjct: 370 QHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDGYFKILRG 429

Query: 317 SNECGIEEDVVA 328
            N C IE  V+ 
Sbjct: 430 DNHCEIESFVIG 441


>gi|126327832|ref|XP_001363345.1| PREDICTED: dipeptidyl peptidase 1-like [Monodelphis domestica]
          Length = 462

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 102/310 (32%), Positives = 148/310 (47%), Gaps = 38/310 (12%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV----KPTPKGLLLGVPVKTHDKSLK 94
            +K +N   +  W A    +   Y + Q     G     +P P  L  G+      K+L 
Sbjct: 176 FVKAIN-TVQDSWTATIYEEHEKYNMDQMIKRSGAHSFPRPKPAPLTHGIL----QKALT 230

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLL 152
           LP S+D R+     + +S + +Q  CGSC+AF ++  L  R  I    + +  LS   ++
Sbjct: 231 LPSSWDWRNV-NGVNYVSPVRNQASCGSCYAFASMAMLEARIRILTNNSKTPVLSTQQIV 289

Query: 153 ACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK 211
           +C  +    GCDGG+P + A +Y    GVV E C PY    G   P C P      C R 
Sbjct: 290 SCSEY--SQGCDGGFPYLIAGKYVQDFGVVEENCFPYL---GHDSP-CSPK----NCTRY 339

Query: 212 CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH-- 269
            V        S ++ +  +    +   +  E+ +NGP+ V+F VY DF HY+ GVY H  
Sbjct: 340 YV--------SDYHYVGGFYGACNEALMKLELVENGPMAVAFEVYNDFIHYQKGVYHHTG 391

Query: 270 ----ITGDVMGGHAVKLIGWGTSDD-GEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 324
                    +  HAV L+G+GT +  GE YWI+ N W   WG DGYF+I RG++ECGIE 
Sbjct: 392 LRDSFNPFEITNHAVLLVGYGTDEKTGEHYWIVKNSWGSYWGEDGYFRILRGTDECGIES 451

Query: 325 DVVAGLPSSK 334
             V+  P  K
Sbjct: 452 IAVSATPIPK 461


>gi|312383398|gb|EFR28501.1| hypothetical protein AND_03481 [Anopheles darlingi]
          Length = 573

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 101/315 (32%), Positives = 142/315 (45%), Gaps = 32/315 (10%)

Query: 50  GWKAARNPQF--SNYTVGQFKHLLGVKPTPKGLLLGVPVKT----HDKSLKLPKSFDARS 103
           GWKA    ++    Y  G+   L   +P        +PVK      ++   LP  FDA  
Sbjct: 252 GWKAGNYSEWWGRKYDEGKVLRLGTFQPK-------IPVKAMKRLSNRGGPLPSHFDAAD 304

Query: 104 AWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGD 161
            WP+    +R  DQG CGS WA       SDRF I       + L+   LLAC       
Sbjct: 305 HWPRLVGEAR--DQGWCGSSWALSTTTMASDRFAILSKGREQVQLAPQQLLACVRR--QQ 360

Query: 162 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 221
            C GG+  +AW+Y    GVV +EC PY  +       C+           C     + R 
Sbjct: 361 ACSGGHLDTAWQYLRRVGVVNDECYPYIAAKN----QCKINDGDTLVSANCELPANVNRT 416

Query: 222 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG-----DVMG 276
           + +    AY +N++  DIM EI + G V+    VY DF  Y++G+Y+H        +   
Sbjct: 417 AMYRMGPAYSLNNE-TDIMTEIKERGTVQAILRVYRDFFSYQNGIYRHSAAATPAEERSA 475

Query: 277 GHAVKLIGWGTSDDGED---YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
            H+V+LIGWG    G D   YWI  N W   WG +G F+I RG+NEC IE  V+A  P  
Sbjct: 476 YHSVRLIGWGEERVGYDMVKYWIAVNSWGTWWGENGRFRILRGTNECEIESYVLASNPYV 535

Query: 334 KNLVKEITSADMFED 348
              V+ + +    ++
Sbjct: 536 HQHVQTVRNVGDLQE 550


>gi|159115721|ref|XP_001708083.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157436192|gb|EDO80409.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 305

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 102/321 (31%), Positives = 151/321 (47%), Gaps = 35/321 (10%)

Query: 17  FATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-----FKHLL 71
           FA     V+S      H+L     K +N+     W+A    +F+N T  +     F H  
Sbjct: 3   FAVLVVAVLSTPFYSPHLL-----KYLNKKENKLWEAGIPAKFANRTHDEVTKMFFPHAF 57

Query: 72  GVKPTPKGLLLGVPVKTHD--KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 129
                P+    GV +   D       P   D R   P+C       DQ  C  C+AF  +
Sbjct: 58  LRPNIPR--YYGVNITEDDLYPPAGSPDRLDYRQTHPEC--FFEPEDQKECSCCYAFATL 113

Query: 130 EALSDRFCIHF--GMNLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECD 186
            ALS R CI       +SLSV  +++C     G+ GC GG   S+W +    G V  +C 
Sbjct: 114 GALSTRRCIAKLDPQAVSLSVQHMVSCDS---GEAGCQGGEFESSWAFLETEGAVKSDCL 170

Query: 187 PYFD-STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYK 245
           PY    TG S           +C   C     +  ++ HY  ++    S+  +IM  +  
Sbjct: 171 PYTSGETGKSG----------ECPTTCQDGTPV-ESAFHYKAASASRLSNYNEIMVSLLA 219

Query: 246 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 305
           +GPV+  F V+EDF +Y  G+Y  + G  +GGHAV ++G+G+ ++  DYWI+ N W   W
Sbjct: 220 DGPVQTGFYVHEDFLYYVGGIYHKVYGTSLGGHAVLIVGYGSMNN-HDYWIVRNSWGSDW 278

Query: 306 GADGYFKIKRGSNECGIEEDV 326
           G +GYF+I RG+NECGIE++ 
Sbjct: 279 GENGYFRILRGTNECGIEKNA 299


>gi|347546077|gb|AEP03186.1| cathepsin B [Diuraphis noxia]
          Length = 239

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 97/238 (40%), Positives = 120/238 (50%), Gaps = 37/238 (15%)

Query: 68  KHLLGVK----PTPKGLLLGVPVKTHD----KSLKLPKSFDARSAWPQCSTISRILDQGH 119
           K LLG K    P    + +    KT+D     S K+PK+FDAR  W QC TI R+ DQG 
Sbjct: 15  KRLLGSKGVQIPNKNNMHM---YKTNDVAYISSGKIPKTFDARKKWVQCDTIGRVRDQGQ 71

Query: 120 CGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 177
           CGSCWA     A +DR CI      N  LS +++  CC + CG GCDGGYPI AW+ F  
Sbjct: 72  CGSCWAVSTSSAFADRLCIATDGDFNELLSADEITFCC-YTCGFGCDGGYPIKAWKQFSR 130

Query: 178 HGVVTEECDPYFDSTGCSHPGCEPAYPTPK-----------CVRKCVKKNQ--LWRNSKH 224
           HG+VT      FDS      GCEP    P            C  KC   NQ   +     
Sbjct: 131 HGLVT---GGDFDSG----EGCEPYRVPPSGSNSSNSYNHFCRGKCYGDNQNISYSEDHR 183

Query: 225 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVK 281
           Y+   Y ++ +   I  ++   GP+E SF VY+DF  YKSGVY K      +GGHAVK
Sbjct: 184 YTRDYYYLSYNA--IQKDVLLYGPIEASFEVYDDFMIYKSGVYVKSENATHLGGHAVK 239


>gi|403359042|gb|EJY79178.1| Cysteine protease [Oxytricha trifallax]
          Length = 366

 Score =  141 bits (356), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 88/294 (29%), Positives = 133/294 (45%), Gaps = 22/294 (7%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS 92
            ++ +S I   N  P AG++   N  ++N+T+   K L           +G      D+ 
Sbjct: 45  QVIDESQILVHNGQPNAGFQQGANSFYTNWTLSNAKSLFQ-NSLSDTQNIGPCKSKDDEE 103

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLL 152
             +P+ +D R  +P C  +  +++QG+C S +   A+  ++DR C      + LS  +LL
Sbjct: 104 TIIPEKYDWREVYPDC--VQPVVNQGNCSSSYITAALSTVADRICQTTKKPIQLSAQELL 161

Query: 153 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKC 212
            C        CDGGY    + +    G + E+C PY    G             +C    
Sbjct: 162 DCDK--SSYQCDGGYVSRTFNWGKRKGFIPEQCYPYTGVVG-------------ECEDDH 206

Query: 213 VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 272
           ++ N+   N+  Y +  Y + SD   +  EI KNGPV     +Y DF  YK GVY H T 
Sbjct: 207 LETNECRVNNMFYRVIDYCLASDELGLKKEILKNGPVVAQMVIYTDFLTYKEGVY-HRTE 265

Query: 273 DVM---GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 323
           D     G H VK++GW    DG D+WI+ N W   WG DGY KI       G++
Sbjct: 266 DAFKFNGQHVVKIVGWDRQGDGNDFWIVENSWGSDWGEDGYVKILASDKSTGLD 319


>gi|67867504|gb|AAH98085.1| Unknown (protein for MGC:107782) [Xenopus (Silurana) tropicalis]
          Length = 458

 Score =  141 bits (356), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 99/330 (30%), Positives = 158/330 (47%), Gaps = 46/330 (13%)

Query: 22  EGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG-------VK 74
           E + S+L   +H      +K++NE  K+ W A   P++   T+       G       ++
Sbjct: 157 EMLTSRLYNYNH----DFVKQINEVQKS-WTATAYPEYEGMTIEDLIRRAGGRNSRIPMR 211

Query: 75  PTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSD 134
           P P       P+ T +K   LP  +D R+     + ++ + +Q  CGSC+AF ++  L  
Sbjct: 212 PRP------APLPTDEKYQGLPTEWDWRNI-AGYNFVTPVRNQASCGSCYAFSSMGMLES 264

Query: 135 RFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDS 191
           R  I   ++    LS   +++C  +    GC+GG+P + A +Y   +G+V E   PY   
Sbjct: 265 RIQIRSQLSQKPILSPQQVVSCSNY--SQGCEGGFPYLIAGKYVSDYGIVEESDLPY--- 319

Query: 192 TGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
           TG   P          C  K     Q +  ++++ +  +    +   +  E+   GP+ V
Sbjct: 320 TGSDSP----------CTLK--DSQQKYYTAEYHYVGGFYGGCNEAYMKLELVLGGPLSV 367

Query: 252 SFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGTSDD-GEDYWILANQWNRS 304
           +F VY+DF HY+SGVY H           +  HAV L+G+GT    GE YWI+ N W  S
Sbjct: 368 AFEVYDDFMHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGTDQQTGEKYWIVKNSWGES 427

Query: 305 WGADGYFKIKRGSNECGIEEDVVAGLPSSK 334
           WG  GYF+I+RG++EC IE   V+  P  K
Sbjct: 428 WGEKGYFRIRRGTDECAIESIAVSAEPIIK 457


>gi|345794363|ref|XP_535330.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Canis lupus
           familiaris]
          Length = 467

 Score =  141 bits (355), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 99/324 (30%), Positives = 151/324 (46%), Gaps = 39/324 (12%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +I  +N+    GW+A  +  F   T+ +  ++ LG ++P+     +         
Sbjct: 141 LVDQDMINAINQG-NYGWRAGNHSAFWGMTLDEGIRYRLGTIRPSSSVTNMNEIHTVLRP 199

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 200 GEVLPTAFEAAEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
           +LL+C       GC GG    AW +    GVV++ C P+           + A P P+C+
Sbjct: 258 NLLSC-DTHNQQGCRGGRLDGAWWFLRRRGVVSDHCYPFVGREQ------DEAGPAPRCM 310

Query: 210 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
                     R+   +   + +  N  +    AYR+ ++ ++IM E+ +NGPV+    V+
Sbjct: 311 MHSRAMGRGKRQATARCPSSHVHANDIYQVTPAYRLGTNEKEIMKELMENGPVQALMEVH 370

Query: 257 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRS 304
           EDF  Y+ G+Y H    +         G H+VK+ GWG  T  DG    YW  AN W  +
Sbjct: 371 EDFFLYQGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPA 430

Query: 305 WGADGYFKIKRGSNECGIEEDVVA 328
           WG  G+F+I RG+NEC IE  V+ 
Sbjct: 431 WGERGHFRIVRGANECDIESFVLG 454


>gi|328712827|ref|XP_003244913.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Acyrthosiphon pisum]
          Length = 487

 Score =  141 bits (355), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 104/317 (32%), Positives = 148/317 (46%), Gaps = 21/317 (6%)

Query: 45  ENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVK-THDKSLKLPKSFDAR 102
           ++ + GW A     F   T     K  LG    P+ +L  VP+K    +  +LP SFD R
Sbjct: 170 QSRQFGWSAKNYSVFWGVTYDNGLKWRLGTLQPPEKILQVVPLKAVFHQDYQLPSSFDLR 229

Query: 103 SAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCG 160
             +     I+  +DQG CG+ WA    +  +DRF I     M  +LS   LL+C   L  
Sbjct: 230 KVFG--DKITDPIDQGWCGASWAISTAQVTTDRFVIMTKGLMRDALSPKHLLSCNNDL-Q 286

Query: 161 DGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW 219
            GC GG+  SAW + +  G+VTEEC P+   +T C+               +  K + L 
Sbjct: 287 RGCQGGHLTSAWNWVMTFGLVTEECYPWDGRATDCAVSNQRSNNNLIVTCPRSAKTSPLR 346

Query: 220 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK---HITGDVMG 276
           R    Y ++        E IM EI   G V+    V ++F  Y+SGVY+      G   G
Sbjct: 347 RVGLMYRVAT------EEGIMYEIMNWGSVQAMMKVSKEFFMYESGVYRCSNLALGSKTG 400

Query: 277 GHAVKLIGWGTSDDG---EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 333
            H V+++GWG          YWI++N W   WG  GYF+I +G+NEC IE+ VVA +   
Sbjct: 401 YHTVRIVGWGEEQQNGRTVKYWIVSNSWGLWWGESGYFRILKGTNECQIEDFVVAAMADI 460

Query: 334 KNLVKEITSADMFEDAS 350
            N    I+     E+AS
Sbjct: 461 GNFC-SISDKSFRENAS 476


>gi|380808942|gb|AFE76346.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
          Length = 463

 Score =  141 bits (355), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 100/313 (31%), Positives = 156/313 (49%), Gaps = 39/313 (12%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 93
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   K L
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIKRSGGHSRKIPRPKPAPLTAEIQ--QKIL 229

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
            LP S+D R+     + +S + +Q  CGSC++F +V  L  R  I    + +  LS  ++
Sbjct: 230 HLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEV 288

Query: 152 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           ++C  +    GC+GG+P ++A +Y    G+V E C PY   TG   P             
Sbjct: 289 VSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGNDSP------------- 330

Query: 211 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HY++G+Y 
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDDFLHYQNGIYH 389

Query: 269 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           H           +  HAV L+G+GT S  G DYWI+ N W  SWG DGYF+I+RG++EC 
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECA 449

Query: 322 IEEDVVAGLPSSK 334
           IE   VA  P  K
Sbjct: 450 IESIAVAATPIPK 462


>gi|66911417|gb|AAH97299.1| Tubulointerstitial nephritis antigen-like 1 [Rattus norvegicus]
 gi|149024087|gb|EDL80584.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
 gi|149024088|gb|EDL80585.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
 gi|149024089|gb|EDL80586.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
          Length = 467

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 102/324 (31%), Positives = 152/324 (46%), Gaps = 38/324 (11%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++  ++IK +N     GW+A  +  F   T+ +  ++ LG ++P+   + +        +
Sbjct: 140 LVDPAMIKAINRG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIYTVLGQ 198

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 199 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQ 256

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
           +LL+C       GC GG    AW +    GVV++ C P+           + A PTP+C+
Sbjct: 257 NLLSCDTHH-QKGCRGGRLDGAWWFLRCRGVVSDNCYPF-----SGREQNDEASPTPRCM 310

Query: 210 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
                     R+   +   + +  N  +     YR+ SD ++IM E+ +NGPV+    V+
Sbjct: 311 MHSRAMGRGKRQATSRCPNSHVDSNDIYQVTPVYRLASDEKEIMKELMENGPVQALMEVH 370

Query: 257 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRS 304
           EDF  Y+ G+Y H              G H+VK+ GWG  T  DG    YW  AN W   
Sbjct: 371 EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPW 430

Query: 305 WGADGYFKIKRGSNECGIEEDVVA 328
           WG  G+F+I RG+NEC IE  V+ 
Sbjct: 431 WGERGHFRIVRGTNECDIETFVLG 454


>gi|332254558|ref|XP_003276396.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Nomascus leucogenys]
          Length = 467

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 101/318 (31%), Positives = 152/318 (47%), Gaps = 27/318 (8%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +IK +N+    GW+A  +  F   T+ +  ++ LG ++P+   + +       + 
Sbjct: 141 LVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTMRPSSSVMNMHEIYTVLNP 199

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 200 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 205
           +LL+C       GC GG    AW +    GVV++ C P+     D  G + P    +   
Sbjct: 258 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAM 316

Query: 206 PKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 262
            +  R+      N    N+  Y ++  YR+ S+ +++M E+ +NGPV+    V+EDF  Y
Sbjct: 317 GRGKRQATAHCPNSHVNNNDIYQVTPVYRLGSNDKEVMKELMENGPVQALMEVHEDFFLY 376

Query: 263 KSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGY 310
           K G+Y H    +         G H+VK+ GWG  T  DG    YW  AN W  +WG  G+
Sbjct: 377 KGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGH 436

Query: 311 FKIKRGSNECGIEEDVVA 328
           F+I RG NEC IE  V+ 
Sbjct: 437 FRIVRGVNECDIESFVLG 454


>gi|56755295|gb|AAW25827.1| SJCHGC06356 protein [Schistosoma japonicum]
          Length = 279

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 91/257 (35%), Positives = 129/257 (50%), Gaps = 21/257 (8%)

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 149
           ++++P+SFDAR  W  CSTI +I D+  C + WA   V+++SDR CI     +++ LS  
Sbjct: 25  NMEIPRSFDARYHWINCSTIRQIHDESLCRADWAIATVDSISDRICIRSNGRISVQLSAR 84

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCE-- 200
           D ++ CGF    GC  G  +    Y++ +G+VT         C PY       HP     
Sbjct: 85  DAIS-CGF--SPGCFHGSEVEVLVYWITYGIVTGGSYEDQSGCQPYPLPKCSYHPESRFL 141

Query: 201 ----PAYPTPKCVRKCVK-KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 255
                 +  P+C  +C    N+ + + K Y    Y +    EDI  EI  NGPV  S +V
Sbjct: 142 DCNNNTFEFPQCTNECQDGYNKTYDDDKFYGERIYNVYGTQEDIQKEILMNGPVIASISV 201

Query: 256 YEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
             DF  YKSGVY        +G   +++IGWG  +    YW+ AN WN  WGA+GY KI+
Sbjct: 202 NTDFLVYKSGVYLPTPRSRNLGWITLRIIGWGY-EGKIPYWLCANSWNEEWGANGYVKIQ 260

Query: 315 RGSNECGIEEDVVAGLP 331
           RG     IE  V A +P
Sbjct: 261 RGVQAGYIESYVRAPIP 277


>gi|355752523|gb|EHH56643.1| hypothetical protein EGM_06098 [Macaca fascicularis]
          Length = 463

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 100/313 (31%), Positives = 156/313 (49%), Gaps = 39/313 (12%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 93
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   K L
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIKRSGGHSRKIPRPKPAPLTAEIQ--QKIL 229

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
            LP S+D R+     + +S + +Q  CGSC++F +V  L  R  I    + +  LS  ++
Sbjct: 230 HLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEV 288

Query: 152 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           ++C  +    GC+GG+P ++A +Y    G+V E C PY   TG   P             
Sbjct: 289 VSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGNDSP------------- 330

Query: 211 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HY++G+Y 
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDDFLHYQNGIYH 389

Query: 269 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           H           +  HAV L+G+GT S  G DYWI+ N W  SWG DGYF+I+RG++EC 
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECA 449

Query: 322 IEEDVVAGLPSSK 334
           IE   VA  P  K
Sbjct: 450 IESIAVAATPIPK 462


>gi|193610664|ref|XP_001948185.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
          Length = 324

 Score =  140 bits (354), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 108/344 (31%), Positives = 157/344 (45%), Gaps = 44/344 (12%)

Query: 6   LIMDPILCLTCFATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 65
           L +  I+ L+C+ T      +KL  D +++Q +I  E N       KA  N   ++    
Sbjct: 5   LFLMSIMLLSCYLTEQ----AKLSRD-NMIQTNI--ETNT-----LKALDNIDLNS---A 49

Query: 66  QFKHLL-----GVKPTPKGLLLGVPVKTHDKSL----KLPKSFDARSAWPQCSTISRILD 116
           + +HL+     GV  T K  LL    KT D       K+ K FDAR  W QC TI  + +
Sbjct: 50  KEEHLMLLGKRGVAATFKSKLL---YKTRDPRYVAYGKISKEFDARKHWSQCKTIGEVYN 106

Query: 117 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 174
            G+    WA+    A +DR C+  +   N  LS   L++C G       D      AW++
Sbjct: 107 DGNSDLSWAYATTGAFADRMCVATNGSYNQLLSTEQLISCSGIKSNAMADD----QAWKF 162

Query: 175 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK------CVRKCVKKNQLWRNSKHYSIS 228
           F   G+V+     Y  + GC      P +  PK      C   C   + +  N  H  +S
Sbjct: 163 FKKQGLVS--GGKYNTNDGCQPSKIPPIFNLPKKIYNRTCDNFCYGNSLIDYNHDHVKVS 220

Query: 229 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGT 287
            Y  +   ++I  E+   GPV   F++Y+D   Y SGVY        +   + KLIGWG 
Sbjct: 221 -YTYHVLYKNIQREVQTYGPVSAYFSLYDDLFLYTSGVYARTEKSKFVRYQSAKLIGWGV 279

Query: 288 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
            ++G DYW+L N W   WG +G FKIKRG++EC       AG+P
Sbjct: 280 -ENGVDYWLLVNSWGNEWGQNGLFKIKRGTDECQFGRHTYAGVP 322


>gi|308494436|ref|XP_003109407.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
 gi|308246820|gb|EFO90772.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
          Length = 470

 Score =  140 bits (354), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 94/255 (36%), Positives = 126/255 (49%), Gaps = 24/255 (9%)

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSV 148
           K  +LP+ FDAR  W     I  + DQG CGS WA       SDR  I     +N SLS 
Sbjct: 198 KPRELPEHFDARDKWGH--LIHPVADQGDCGSSWAVSTTGISSDRLSIISEGRINASLSS 255

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGC---EPAYPT 205
             LL+C       GC+GGY   AW Y    GVV + C PY          C   +  Y  
Sbjct: 256 QQLLSCNQHR-QKGCEGGYLDRAWWYIRKLGVVGDHCYPYVSGQSREPGHCLIPKRDYTN 314

Query: 206 PKCVRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 264
            + +R C   +Q   +S  + ++  Y+++S  EDI  E+  NGPV+ +F V+EDF  Y  
Sbjct: 315 RQGLR-CPSGDQ---DSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAG 370

Query: 265 GVYKH--------ITGDVMGGHAVKLIGWG---TSDDGEDYWILANQWNRSWGADGYFKI 313
           GVY+H         +    G H+V+++GWG   ++     YW+ AN W   WG DGYFKI
Sbjct: 371 GVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDGYFKI 430

Query: 314 KRGSNECGIEEDVVA 328
            RG N C IE  V+ 
Sbjct: 431 LRGENHCEIESFVIG 445


>gi|294945206|ref|XP_002784584.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239897729|gb|EER16380.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 298

 Score =  140 bits (354), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 89/256 (34%), Positives = 121/256 (47%), Gaps = 32/256 (12%)

Query: 95  LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
           LP  FDAR  +  C   I  + DQG CG+CWA    E L+DR CI     +   LS   +
Sbjct: 33  LPPEFDARQKFNYCRDVIGHVRDQGRCGNCWAVCPTEVLNDRLCIKSSGKIQEILSAGYV 92

Query: 152 LACC----GFLCGDGCDGGYPISAWRYFVHHGVVT-------------EECDPY------ 188
            +CC    G L   GC+GG  + A  +   HGVVT             + C PY      
Sbjct: 93  TSCCNPAHGCLHAKGCNGGRLVEAMSFLRDHGVVTGNDFKPQDQLREADGCWPYPFQKCN 152

Query: 189 -FDSTGCSHPGCEPAY--PTPKCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEI 243
              + G  +P C+     P P C   C  K   +      H + S  ++ +D + I  EI
Sbjct: 153 HVPTEGTGYPKCKDVVQQPVPPCRTTCTNKAYKKSLEKDVHRAKSWRKVLNDAQSIKQEI 212

Query: 244 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 303
           + NGPV  +F +Y+DF +YKSGVY   T +V   H +K+IGWG +D   +YW+  N WN 
Sbjct: 213 FDNGPVFSAFEMYKDFRYYKSGVYVPTTKEVDCLHVIKIIGWG-ADSVREYWLAMNAWNE 271

Query: 304 SWGADGYFKIKRGSNE 319
            WG  G  K+  G N 
Sbjct: 272 EWGDHGLIKMAFGKNR 287


>gi|149694136|ref|XP_001503950.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 1
           [Equus caballus]
          Length = 467

 Score =  140 bits (354), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 101/324 (31%), Positives = 150/324 (46%), Gaps = 39/324 (12%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +I  +N+    GW+A  +  F   T+ +  ++ LG ++P+     +         
Sbjct: 141 LVDQDMINAINQG-NYGWRAGNHSAFWGMTLDEGIRYRLGTIRPSSSVTSMNEIHTVLGP 199

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 200 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
           +LL+C       GC GG+   AW +    GVV++ C P+           + A P P+C+
Sbjct: 258 NLLSC-DTHNQQGCRGGHLDGAWWFLRRRGVVSDHCYPFSGRER------DEAGPAPRCM 310

Query: 210 ----------RKCVK---KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
                     R+       +++  N  +    AYR+ S  ++IM E+ +NGPV+    V+
Sbjct: 311 MHSRAMGRGKRQATAHCPNSRVHTNDIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVH 370

Query: 257 EDFAHYKSGVYKHITGD--------VMGGHAVKLIGWG--TSDDGE--DYWILANQWNRS 304
           EDF  Y+ GVY H              G H+VK+ GWG  T  DG    YW  AN W  +
Sbjct: 371 EDFFLYQGGVYSHTPVSHGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPA 430

Query: 305 WGADGYFKIKRGSNECGIEEDVVA 328
           WG  G+F+I RG+NEC IE  V+ 
Sbjct: 431 WGERGHFRIVRGANECDIESFVLG 454


>gi|402894881|ref|XP_003910570.1| PREDICTED: dipeptidyl peptidase 1 [Papio anubis]
          Length = 463

 Score =  140 bits (354), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 100/313 (31%), Positives = 155/313 (49%), Gaps = 39/313 (12%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 93
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   K L
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIKRSGGHSRKIPRPKPAPLTAEIQ--QKIL 229

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
            LP S+D R+     + +S + +Q  CGSC++F +V  L  R  I    + +  LS  ++
Sbjct: 230 HLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEV 288

Query: 152 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           ++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P             
Sbjct: 289 VSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP------------- 330

Query: 211 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HY++G+Y 
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLSVAFEVYDDFLHYQNGIYH 389

Query: 269 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           H           +  HAV L+G+GT S  G DYWI+ N W  SWG DGYF+I+RG++EC 
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECA 449

Query: 322 IEEDVVAGLPSSK 334
           IE   VA  P  K
Sbjct: 450 IESIAVAATPIPK 462


>gi|432108509|gb|ELK33225.1| Dipeptidyl peptidase 1 [Myotis davidii]
          Length = 466

 Score =  140 bits (353), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 100/331 (30%), Positives = 159/331 (48%), Gaps = 35/331 (10%)

Query: 18  ATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK--- 74
           A   EG+  K     +      +K +N   K+ W A    ++   T+ +     G +   
Sbjct: 156 AAHLEGLQEKYSNRLYKYNHDFVKAINAVQKS-WTATTYLEYETLTLREMIRRSGGRRQR 214

Query: 75  -PTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALS 133
            P PK   L   +  H+K L+LP S+D R+     + ++ + +Q  CGSC++F ++  L 
Sbjct: 215 LPRPKPAPLTAEI--HEKLLRLPTSWDWRNV-HGTNFVTPVRNQASCGSCYSFASMGMLE 271

Query: 134 DRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFD 190
            R  I      S  LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY  
Sbjct: 272 ARIRILTNNTQSPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY-- 327

Query: 191 STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 250
            TG   P C       K    C++    +  S+++ +  +    +   +  E+  +GP+ 
Sbjct: 328 -TGTDSP-C-------KMKEDCIR----YYTSEYHYVGGFYGGCNEALMKLELVHHGPMA 374

Query: 251 VSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGTS-DDGEDYWILANQWNR 303
           V+F VY+DF HY  G+Y H           +  HAV L+G+GT    G DYWI+ N W  
Sbjct: 375 VAFEVYDDFLHYNQGIYHHTGLKDPFNPFELTNHAVLLVGYGTDPKTGLDYWIVKNSWGT 434

Query: 304 SWGADGYFKIKRGSNECGIEEDVVAGLPSSK 334
           SWG  GYF+I+RG++EC IE   +A  P  K
Sbjct: 435 SWGEQGYFRIRRGTDECAIESIAMAATPIPK 465


>gi|56758658|gb|AAW27469.1| unknown [Schistosoma japonicum]
          Length = 181

 Score =  140 bits (353), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 74/171 (43%), Positives = 100/171 (58%), Gaps = 15/171 (8%)

Query: 174 YFVHHGVVT-------EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-W 219
           Y V  G+VT         C PY        T   +P C    Y TP+C +KC K  +  +
Sbjct: 9   YLVKRGIVTGGSKENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQKCQKGYKTPY 68

Query: 220 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 279
              K+Y    Y + S+ + I  EI  NGPVE +F VYEDF +YKSG+Y+H+TG ++GGHA
Sbjct: 69  EQDKNYGDQRYNVISNAKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHA 128

Query: 280 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 330
           +++IGWG  +    YW++AN WN  WG  G F+I RG +EC IE +VVAGL
Sbjct: 129 IRIIGWGV-EKRTPYWLIANSWNEDWGEKGLFRIVRGRDECSIESNVVAGL 178


>gi|209863086|ref|NP_001119616.2| cathepsin B-1674 precursor [Acyrthosiphon pisum]
 gi|239799412|dbj|BAH70627.1| ACYPI000012 [Acyrthosiphon pisum]
          Length = 334

 Score =  140 bits (353), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 94/278 (33%), Positives = 134/278 (48%), Gaps = 25/278 (8%)

Query: 72  GVKPTPKGLLLGVPVKTHDKSL-------KLPKSFDARSAWPQCSTISRILDQGHCGSCW 124
           GV+ T K  +L    KT ++         ++ + FDAR  WP C TI  + + G+    W
Sbjct: 63  GVEATSKSKMLH---KTRNRRCFRVEIDHQIDQEFDARKRWPHCKTIGEVHNDGNSLLSW 119

Query: 125 AFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 182
           A+      +DR CI      N  LS  +L++C G +  D          W Y  +HG+V+
Sbjct: 120 AYVPTGVFADRMCIATNGTYNQLLSTEELISCSG-IKEDEFGSVNDDYVWEYLKNHGLVS 178

Query: 183 EECDPYFDSTGCSHPGCEPAYPTPK------CVRKCVKKNQLWRNSKHYSISAYRINSDP 236
                Y  + GC      P    P       C ++C   N +  N  H  I  +  + + 
Sbjct: 179 --GGKYNTNNGCQPSKIPPIGNLPTGLYENTCEKRCYGNNTINYNQDHVKIKNH-YDIEY 235

Query: 237 EDIMAEIYKNGPVEVSFTVYE-DFAHYKSGVYKHITG-DVMGGHAVKLIGWGTSDDGEDY 294
           EDI  E+   GPV ++F V++ DF  YKSGVY+  T  + +     KLIGWG  ++G DY
Sbjct: 236 EDIQREVQNYGPVSMAFRVFDNDFFLYKSGVYEKTTNSEFIQWQYAKLIGWGV-ENGVDY 294

Query: 295 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
           W+L N W   WG +G FKIKRG++EC IE  V AG P 
Sbjct: 295 WLLVNSWGYEWGQNGLFKIKRGTDECNIETFVHAGEPQ 332


>gi|157058733|gb|ABV03124.1| cathepsin B-16a [Acyrthosiphon pisum]
          Length = 274

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 92/258 (35%), Positives = 123/258 (47%), Gaps = 25/258 (9%)

Query: 32  SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KTHD 90
           ++ L++S I+ +N+     W A  N   S      F  +LG K           + KTHD
Sbjct: 17  AYFLEESYIEMINDVATT-WTAGVNFDPST-PEKDFIKMLGSKGVEAAKNASAHMFKTHD 74

Query: 91  -----KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMN 143
                 +  +P++FDAR  W  C TI  + DQGHCGSCWA     A +DR C+  +   N
Sbjct: 75  VANDNNNGYIPRTFDARRRWRHCKTIGEVRDQGHCGSCWAMATSSAFADRLCVATNGDFN 134

Query: 144 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------D 190
             LS  ++  CC   CG GC+GGYPI AW+YF  HG+VT       E C+PY       D
Sbjct: 135 ELLSAEEITFCC-HTCGFGCNGGYPIKAWKYFSSHGIVTGGNYKSGEGCEPYRVPPCPQD 193

Query: 191 STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 250
             G S    +P     +C R C     L  N  H     Y   +    I  ++   GP+E
Sbjct: 194 EEGKSSCAGKPIEKNHRCTRMCYGNQDLDYNEDHRFTRDYYYLT-YGSIQKDVMNYGPIE 252

Query: 251 VSFTVYEDFAHYKSGVYK 268
            SF VY+DF  YKSGVY+
Sbjct: 253 ASFDVYDDFPSYKSGVYQ 270


>gi|161343849|tpg|DAA06105.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 334

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 95/279 (34%), Positives = 136/279 (48%), Gaps = 27/279 (9%)

Query: 72  GVKPTPKGLLLGVPVKTHDKSL-------KLPKSFDARSAWPQCSTISRILDQGHCGSCW 124
           GV+ T K  +L    KT ++         ++ + FDAR  WP C TI  + + G+    W
Sbjct: 63  GVEATSKSKMLH---KTRNRRCFSVEIDHQIDQEFDARKRWPHCKTIGEVHNDGNSLLSW 119

Query: 125 AFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVV 181
           A+      +DR CI      N  LS  +L++C G    + G    Y +  W Y  +HG+V
Sbjct: 120 AYVPTGVFADRMCIATNGTYNQLLSTEELISCSGIKEDEFGSVNDYYV--WEYLKNHGLV 177

Query: 182 TEECDPYFDSTGCSHPGCEPAYPTPK------CVRKCVKKNQLWRNSKHYSISAYRINSD 235
           +     Y  + GC      P    P       C ++C   N +  N  H  I  +  + +
Sbjct: 178 S--GGKYNTNNGCQPSKIPPIGNLPTGLYENTCEKRCYGNNTINYNQDHVKIKNH-YDIE 234

Query: 236 PEDIMAEIYKNGPVEVSFTVYE-DFAHYKSGVYKHITG-DVMGGHAVKLIGWGTSDDGED 293
            EDI  E+   GPV ++F V++ DF  YKSGVY+  T  + +     KLIGWG  ++G D
Sbjct: 235 YEDIQREVQNYGPVSMAFKVFDNDFFLYKSGVYEKTTNSEFIQWQYAKLIGWGV-ENGVD 293

Query: 294 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 332
           YW+L N W   WG +G FKIKRG++EC IE  V AG P 
Sbjct: 294 YWLLVNFWGYEWGQNGLFKIKRGTDECNIETFVHAGEPQ 332


>gi|161343821|tpg|DAA06091.1| TPA_inf: cathepsin B [Aphis gossypii]
          Length = 196

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 80/197 (40%), Positives = 107/197 (54%), Gaps = 19/197 (9%)

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSH 196
           +L  CC   CG GC GGYPI AW+ F +HG+VT       E C+PY      +D  G + 
Sbjct: 1   ELTFCC-HTCGFGCHGGYPIRAWKRFKNHGLVTGGDYKSGEGCEPYRVPPCPYDEQGNNT 59

Query: 197 PGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTV 255
              +P     +C R C    +L  +  H Y+   Y +      I  ++   GP+E SF V
Sbjct: 60  CAGKPMEKNHRCTRICYGDQELDFDEDHRYTRDYYYLTYG--SIQKDVMTYGPIEASFDV 117

Query: 256 YEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 314
           Y DF  YKSG+Y+       +GGHAVKLIGWG    G  YW++ N WN  WG +G FKI+
Sbjct: 118 YSDFPSYKSGIYERTENATYLGGHAVKLIGWG-EQYGIPYWLMVNSWNEDWGDNGLFKIR 176

Query: 315 RGSNECGIEEDVVAGLP 331
           RG+NECG++    AG+P
Sbjct: 177 RGTNECGVDNSTTAGVP 193


>gi|410972493|ref|XP_003992693.1| PREDICTED: dipeptidyl peptidase 1 isoform 1 [Felis catus]
          Length = 463

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 98/314 (31%), Positives = 154/314 (49%), Gaps = 43/314 (13%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTV--------GQFKHLLGVKPTPKGLLLGVPVKTHD 90
            +K +N   K+ W A    ++   T+        G  + +   KP P      +  + H+
Sbjct: 174 FVKAINAIQKS-WTATTYMEYETLTLREMIRRGGGHSRRIPRPKPAP------LTAEIHE 226

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSV 148
           K L LP S+D R+     + ++ + +Q  CGSC++F ++  L  R  I      +  LS 
Sbjct: 227 KLLHLPASWDWRNV-HGTNFVTPVRNQASCGSCYSFASMGMLEARIRILTNNTQTPILSP 285

Query: 149 NDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK 207
            ++++C  +    GCDGG+P + A +Y    G+V E C PY   TG   P C+P      
Sbjct: 286 QEVVSCSQY--AQGCDGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP-CKPK---ED 336

Query: 208 CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 267
           CVR        + +S+++ +  +    +   +  E+  +GP+ V+F VY DF HY+ G+Y
Sbjct: 337 CVR--------YYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYNDFLHYRKGIY 388

Query: 268 KH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKIKRGSNEC 320
            H           +  HAV L+G+GT    G DYWI+ N W   WG DGYF+I+RG++EC
Sbjct: 389 YHTGLRDPFNPFELTNHAVLLVGYGTDPVSGMDYWIVKNSWGIGWGEDGYFRIRRGTDEC 448

Query: 321 GIEEDVVAGLPSSK 334
            IE   VA  P  K
Sbjct: 449 AIESIAVAATPIPK 462


>gi|332210919|ref|XP_003254561.1| PREDICTED: LOW QUALITY PROTEIN: dipeptidyl peptidase 1 [Nomascus
           leucogenys]
          Length = 463

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 99/313 (31%), Positives = 154/313 (49%), Gaps = 39/313 (12%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 93
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   K L
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQ--QKIL 229

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
            LP S+D R+     + +S + +Q  CGSC++F +V  L  R  I    + +  LS  ++
Sbjct: 230 HLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEV 288

Query: 152 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           ++C  +    GC+GG+P ++A +Y    G+V E C PY   TG   P             
Sbjct: 289 VSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGTDSP------------- 330

Query: 211 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HY+ G+Y 
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYEKGIYH 389

Query: 269 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           H           +  HAV L+G+GT S  G DYWI+ N W   WG DGYF+I+RG++EC 
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEDGYFRIRRGTDECA 449

Query: 322 IEEDVVAGLPSSK 334
           IE   VA  P  K
Sbjct: 450 IESIAVAATPIPK 462


>gi|403365594|gb|EJY82586.1| Cathepsin B [Oxytricha trifallax]
          Length = 333

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 84/269 (31%), Positives = 134/269 (49%), Gaps = 21/269 (7%)

Query: 55  RNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP--VKTHDKSLKLPKSFDARSAWPQCSTIS 112
            NP F  Y    F+ LLG+      L L      K     + +PK++D+R  +  C  I 
Sbjct: 64  ENP-FKGYAKEDFQSLLGISKRAPSLFLADSSFYKPKANGVTIPKTYDSRKIYKNC--IH 120

Query: 113 RILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 170
            +LDQ  C +CWAF   + +SDRFCI  +   ++ LS  +L++C       GC  G    
Sbjct: 121 GVLDQVKCSACWAFAIAQVVSDRFCIVSNSTTDVVLSYQNLISCVNPKIF-GCKIGVIDV 179

Query: 171 AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS-- 228
           A++Y    G+++++C PY    G       P      C  KC   N    +++ Y     
Sbjct: 180 AFQYMEKTGIMSDQCMPYTAQEG-------PNATIEACRTKC---NNASDSNRKYQCKKG 229

Query: 229 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 288
           ++++    +DI A +   G + V+F V+EDF +Y+ G+Y++ TG+++G HA KLIGWG  
Sbjct: 230 SFKVAQGADDIKAMLVDKGSIFVTFDVFEDFFNYRRGIYRYTTGELVGYHACKLIGWGYD 289

Query: 289 -DDGEDYWILANQWNRSWGADGYFKIKRG 316
                +Y+I+ N W   WG  G+F +  G
Sbjct: 290 WFRDTNYYIIENSWGTEWGMKGFFNVAVG 318


>gi|348565723|ref|XP_003468652.1| PREDICTED: dipeptidyl peptidase 1-like [Cavia porcellus]
          Length = 463

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 102/332 (30%), Positives = 163/332 (49%), Gaps = 37/332 (11%)

Query: 18  ATFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG----V 73
            T  E +V K     +    + +K +N   K+ W A    ++   T+ +     G    +
Sbjct: 153 TTHLENLVEKYSNKLYKYDHNFVKAINAIQKS-WTATTYMEYETLTLKEMIRRRGGFNQL 211

Query: 74  KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALS 133
            P PK + L   ++   K L+LP S+D R+     + ++ + +QG CGSC++F +V  L 
Sbjct: 212 VPRPKPVPLTAEIQR--KILQLPASWDWRNV-NGINFVTPVRNQGSCGSCYSFASVGMLE 268

Query: 134 DRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFD 190
            R  I      +  LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY  
Sbjct: 269 ARIRILTNNTQTPILSPQEIVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEESCFPY-- 324

Query: 191 STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 250
             G   P C       K  + CV+    +  S+++ +  +    +   +  E+ ++GP+ 
Sbjct: 325 -KGIDVP-C-------KVKKDCVR----YYTSEYHYVGGFYGGCNEALMKLELVQHGPMA 371

Query: 251 VSFTVYEDFAHYKSGVYKHITGDV-------MGGHAVKLIGWGTSD-DGEDYWILANQWN 302
           V+F VY+DF HY  G+Y H TG         +  HAV L+G+GT    G DYWI+ N W 
Sbjct: 372 VAFEVYDDFLHYHKGIY-HRTGLRDPFNPFELTNHAVLLVGYGTDPVSGRDYWIVKNSWG 430

Query: 303 RSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 334
             WG DGYF+I RG++EC IE   +A  P  K
Sbjct: 431 TGWGEDGYFRILRGTDECAIESIAMAATPIPK 462


>gi|355724275|gb|AES08176.1| tubulointerstitial nephritis antigen-like 1 [Mustela putorius furo]
          Length = 454

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 101/324 (31%), Positives = 148/324 (45%), Gaps = 39/324 (12%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +I  +N+    GW A  +  F   T+ +  ++ LG ++P+     +         
Sbjct: 128 LVDQDMINAINQG-NYGWWAGNHSAFWGMTLDEGIRYRLGTMRPSSSVTNMNEIHTVLRP 186

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  
Sbjct: 187 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 244

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
           +LL+C       GC GG    AW +    GVV++ C P+           + A P P+C+
Sbjct: 245 NLLSC-DTHNQRGCHGGRLDGAWWFLRRRGVVSDHCYPFVGREQ------DEAGPAPRCM 297

Query: 210 RKCVKKNQLWR-------------NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
                  +  R             N  +    AYR+ S+ ++IM E+ +NGPV+    V+
Sbjct: 298 MHSRAMGRGKRQATARCPSSHAHANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVH 357

Query: 257 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRS 304
           EDF  Y+SG+Y H    +         G H+VK+ GWG  T  DG    YW  AN W  +
Sbjct: 358 EDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPA 417

Query: 305 WGADGYFKIKRGSNECGIEEDVVA 328
           WG  G+F+I RG+NEC IE  V+ 
Sbjct: 418 WGERGHFRIVRGANECDIESFVLG 441


>gi|149635146|ref|XP_001512140.1| PREDICTED: dipeptidyl peptidase 1-like [Ornithorhynchus anatinus]
          Length = 469

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 101/318 (31%), Positives = 154/318 (48%), Gaps = 35/318 (11%)

Query: 29  KLDSHILQD--SIIKEVNENPKAGWKAARNPQFSNYT-VGQFKHLLGVK-PTPKGLLLGV 84
           +L   + Q+    +  +N   KA WKA    ++   T V  FK   G   P P+     +
Sbjct: 168 RLPKKLYQNHPDFVSTINSAQKA-WKATTYEEYETLTLVEMFKRSGGRSFPNPRPKPAPL 226

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 144
             +  +++  LPKS+D R      + +S + +Q  CGSC++F ++  L  R  I    + 
Sbjct: 227 SPELANQASSLPKSWDWRDVH-GVNYVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQ 285

Query: 145 S--LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYF-DSTGCSHPGCE 200
           +  LS   +++C  +    GCDGG+P + A +Y    GVV E+C PY    T C      
Sbjct: 286 TPILSTQQIVSCSEY--SQGCDGGFPYLIAGKYTQDFGVVEEDCFPYTARDTQC------ 337

Query: 201 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
              P  +C R        +  S +  +  +    +   +  E+ ++GP+ V+F VY DF 
Sbjct: 338 --VPKKECPR--------YYASDYQYVGGFYGGCNEALMKLELVRHGPMAVAFEVYNDFL 387

Query: 261 HYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 313
           HY+ GVY H           +  HAV L+G+GT    G DYWI+ N W  +WG DGYF+I
Sbjct: 388 HYREGVYHHTGLRDPFNPFELTNHAVLLVGYGTDPATGLDYWIVKNSWGTAWGEDGYFRI 447

Query: 314 KRGSNECGIEEDVVAGLP 331
           +RGS+EC IE   VA  P
Sbjct: 448 RRGSDECAIESIAVAATP 465


>gi|307548878|ref|NP_001182580.1| dipeptidyl peptidase 1 precursor [Macaca mulatta]
          Length = 463

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 99/313 (31%), Positives = 155/313 (49%), Gaps = 39/313 (12%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 93
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   K  
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIKRSGGHSRKIPRPKPAPLTAEIQ--QKIF 229

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
            LP S+D R+     + +S + +Q  CGSC++F +V  L  R  I    + +  LS  ++
Sbjct: 230 HLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEV 288

Query: 152 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           ++C  +    GC+GG+P ++A +Y    G+V E C PY   TG   P             
Sbjct: 289 VSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGNDSP------------- 330

Query: 211 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HY++G+Y 
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDDFLHYQNGIYH 389

Query: 269 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           H           +  HAV L+G+GT S  G DYWI+ N W  SWG DGYF+I+RG++EC 
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECA 449

Query: 322 IEEDVVAGLPSSK 334
           IE   VA  P  K
Sbjct: 450 IESIAVAATPIPK 462


>gi|157058745|gb|ABV03130.1| cathepsin B-2744 [Sitobion avenae]
          Length = 260

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 93/250 (37%), Positives = 121/250 (48%), Gaps = 33/250 (13%)

Query: 87  KTHDKSLKL--PKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 143
           KT D S K+  P+ FDAR  +  C+  I  + DQG+C S WA       SDR CI     
Sbjct: 16  KTVDISYKIDIPREFDARQYFGSCADVIGDVKDQGNCASSWAVAVASTFSDRLCIASNGQ 75

Query: 144 LS--LSVNDLLACCGFLCGD----GCDGGYPISAWRYFVHHGVVT-------EECDPYFD 190
            +  LS  +LL+C     GD    GCDGG    AW   +  G+VT       E C PY  
Sbjct: 76  FTDNLSAQNLLSC-----GDEEKMGCDGGSAFKAWELTMSKGIVTGGNFDSNEGCQPY-K 129

Query: 191 STGCSHPG------CEPAYPTPK--CVRKCVKKNQL--WRNSKHYSISAYRIN-SDPEDI 239
              C+H G      C     T    C  KCV KN    + +  H +   Y  + ++ + I
Sbjct: 130 IRPCNHYGNGNLKNCSSLRRTQMTVCREKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQI 189

Query: 240 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 299
             EI   GPV     VYE+F  YK G+YK   G+++G H VKLIGWG   DG +YW+  N
Sbjct: 190 QQEIMTYGPVTAFMYVYENFMGYKEGIYKSTAGELIGYHHVKLIGWGVDGDGTEYWLAMN 249

Query: 300 QWNRSWGADG 309
            WN +WG +G
Sbjct: 250 SWNSNWGTNG 259


>gi|383415299|gb|AFH30863.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
 gi|384944880|gb|AFI36045.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
          Length = 463

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 99/313 (31%), Positives = 155/313 (49%), Gaps = 39/313 (12%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 93
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   K  
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIKRSGGHSRKIPRPKPAPLTAEIQ--QKIF 229

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
            LP S+D R+     + +S + +Q  CGSC++F +V  L  R  I    + +  LS  ++
Sbjct: 230 HLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEV 288

Query: 152 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           ++C  +    GC+GG+P ++A +Y    G+V E C PY   TG   P             
Sbjct: 289 VSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGNDSP------------- 330

Query: 211 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HY++G+Y 
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDDFLHYQNGIYH 389

Query: 269 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           H           +  HAV L+G+GT S  G DYWI+ N W  SWG DGYF+I+RG++EC 
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECA 449

Query: 322 IEEDVVAGLPSSK 334
           IE   VA  P  K
Sbjct: 450 IESIAVAATPIPK 462


>gi|290984292|ref|XP_002674861.1| cathepsin C [Naegleria gruberi]
 gi|284088454|gb|EFC42117.1| cathepsin C [Naegleria gruberi]
          Length = 569

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 101/328 (30%), Positives = 147/328 (44%), Gaps = 67/328 (20%)

Query: 50  GWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLG-----VPVKTHD-------------- 90
           GW A   PQF   T     +L G     K L LG      P+   D              
Sbjct: 255 GWSAQAYPQFEEMTEADLINLSG---GWKSLFLGHWNKWRPIGLDDAESFESTSDNFAIA 311

Query: 91  ------KSLKLPKSFDARSAWPQC---STISRILDQGHCGSCWAFGAVEALSDRFCIHFG 141
                 +  KLPK+FD    W      + +  + +Q  CGSC+AF AV A+  R  I   
Sbjct: 312 NQELLNQVEKLPKNFD----WSNVDGENYVPDVKNQMACGSCYAFAAVTAIESRIRIQSR 367

Query: 142 MNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGC 199
            N+   L+V D+++C  +     C GG P +  R+     +V E C PY  S   +    
Sbjct: 368 NNVREPLAVQDIVSCSPY--AQKCHGGIPYAVGRHLRDFNLVPESCFPYKGSENVA---- 421

Query: 200 EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 259
                   C  KC     + + +K+  +S Y   S+  ++M EIY++GP+  S+ +Y DF
Sbjct: 422 --------CSSKCKNPEYIVKVTKYRYVSDYYGGSNYANMMKEIYEHGPISASYLIYPDF 473

Query: 260 AHYKSGVYKH-----------ITGDVMG----GHAVKLIGWGTS-DDGEDYWILANQWNR 303
            +Y  G+YKH           I  ++ G     H+V + GWG     GE YW + N W+ 
Sbjct: 474 KYYSKGIYKHSGKGYPMKTDRINREMNGWEPTTHSVVITGWGEDPKTGEKYWNVLNSWSE 533

Query: 304 SWGADGYFKIKRGSNECGIEEDVVAGLP 331
           SWG +G F+IKRG++EC IE + VA  P
Sbjct: 534 SWGENGRFRIKRGNDECAIEAEGVAFYP 561


>gi|410909768|ref|XP_003968362.1| PREDICTED: dipeptidyl peptidase 1-like [Takifugu rubripes]
          Length = 455

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 97/316 (30%), Positives = 146/316 (46%), Gaps = 47/316 (14%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH--------- 89
            I+ +N+  ++ WKA   P+   +T  +  +  G      G    +P++ H         
Sbjct: 166 FIETINK-VQSSWKAVPYPELETFTREELFNRAG------GFASRIPIRVHPTNVDPELA 218

Query: 90  DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LS 147
            K+  LP+ +D R+     + +S + +QG CGSC+ F  +  L  R  I    + S  LS
Sbjct: 219 KKAAALPELWDWRNV-EGVNFVSPVRNQGSCGSCYCFATMGMLEARLRILTNNSQSPVLS 277

Query: 148 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF--DSTGCSHPGCEPAYPT 205
              +++C  +    GCDGG+P    +Y    G+V E C PY   DS       C   Y  
Sbjct: 278 PQQVVSCSEY--SQGCDGGFPYLTGKYVQDFGIVDESCFPYMGKDSPCGISQSCRRGYA- 334

Query: 206 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 265
                           +++  +  +        +M E+ KNGP+ V+  VY DF  YK G
Sbjct: 335 ----------------AEYKYVGGFYGGCSEAAMMVELVKNGPMAVALEVYSDFMSYKGG 378

Query: 266 VYKH--ITGDV----MGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKIKRGSN 318
           +Y H  +T  V    +  HAV L+G+G     G+ YWI+ N W  SWG DGYF+I+RGS+
Sbjct: 379 IYHHTGLTDHVNPFELTNHAVLLVGYGRCHMTGQKYWIVKNSWGSSWGEDGYFRIRRGSD 438

Query: 319 ECGIEEDVVAGLPSSK 334
           EC IE   VA  P  K
Sbjct: 439 ECAIESIAVAASPIPK 454


>gi|403293251|ref|XP_003937634.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Saimiri boliviensis boliviensis]
          Length = 436

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 99/301 (32%), Positives = 144/301 (47%), Gaps = 26/301 (8%)

Query: 51  WKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQC 108
           W+A  +  F   T+ +  ++ LG ++P+   + +       +    LP +F+A   WP  
Sbjct: 126 WQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNPGEALPTAFEASEKWP-- 183

Query: 109 STISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGG 166
           + I   LDQG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG
Sbjct: 184 NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCNTHH-QQGCRGG 242

Query: 167 YPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWR 220
               AW +    GVV++ C P+     D  G + P    +    +  R+      N    
Sbjct: 243 RLDGAWWFLRRRGVVSDHCYPFSGRERDKAGPAPPCMMHSRAMGRGKRQATAHCPNGHVN 302

Query: 221 NSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV----- 274
           N+  Y ++ AYR+ S+  +IM E+ +NGPV+    V+EDF  YK G+Y H   ++     
Sbjct: 303 NNNIYQVTPAYRLGSNDTEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPER 362

Query: 275 ---MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 327
               G H+VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG NEC IE  V+
Sbjct: 363 YRRHGTHSVKITGWGEETRPDGRKLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVL 422

Query: 328 A 328
            
Sbjct: 423 G 423


>gi|197101281|ref|NP_001125612.1| dipeptidyl peptidase 1 precursor [Pongo abelii]
 gi|75061881|sp|Q5RB02.1|CATC_PONAB RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|55728636|emb|CAH91058.1| hypothetical protein [Pongo abelii]
          Length = 463

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 99/313 (31%), Positives = 153/313 (48%), Gaps = 39/313 (12%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 93
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   K L
Sbjct: 173 NFVKAINAIQKS-WTATTYKEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQ--QKVL 229

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
            LP S+D R+     + +S + +Q  CGSC++F ++  L  R  I    + +  LS  ++
Sbjct: 230 HLPTSWDWRNI-HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTSNSQTPILSPQEV 288

Query: 152 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           ++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P             
Sbjct: 289 VSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP------------- 330

Query: 211 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HYK G+Y 
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYH 389

Query: 269 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           H           +  HAV L+G+GT S  G DYWI+ N W   WG DGYF+I+RG++EC 
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEDGYFRIRRGTDECA 449

Query: 322 IEEDVVAGLPSSK 334
           IE   VA  P  K
Sbjct: 450 IESIAVAATPIPK 462


>gi|189502866|gb|ACE06814.1| unknown [Schistosoma japonicum]
          Length = 121

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 65/119 (54%), Positives = 85/119 (71%), Gaps = 1/119 (0%)

Query: 216 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 275
           N  + N K Y    YR+ S+ E IM E+ ++GPVEV F VY DF +YKSGVY+H++G ++
Sbjct: 3   NVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALL 62

Query: 276 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 334
           GGHAV+L+GWG  ++   YW++AN WN  WG +GYFKI RG NECGIE DV AG+P  K
Sbjct: 63  GGHAVRLLGWGEENN-VPYWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNAGIPKIK 120


>gi|426252217|ref|XP_004019812.1| PREDICTED: dipeptidyl peptidase 1, partial [Ovis aries]
          Length = 455

 Score =  139 bits (349), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 97/314 (30%), Positives = 155/314 (49%), Gaps = 43/314 (13%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTV--------GQFKHLLGVKPTPKGLLLGVPVKTHD 90
            +K +N   K+ W AA   ++   T+        G  + +   KP P      +  +   
Sbjct: 166 FVKAINAIQKS-WTAAPYAEYETLTLKEMIRRGGGHSRRIPRPKPAP------ITAEIQK 218

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSV 148
           K L LPKS+D R+     + ++ + +QG CGSC++F ++  +  R  I      +  LS 
Sbjct: 219 KILHLPKSWDWRNV-HGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSP 277

Query: 149 NDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK 207
            ++++C  +    GC+GG+P + A +Y    G+V E+C PY   TG   P C       K
Sbjct: 278 QEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEDCFPY---TGTDSP-C-------K 324

Query: 208 CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 267
               C +    + +S+++ +  +    +   +  E+   GP+ V+F VY DF HY+ GVY
Sbjct: 325 LKEGCFR----YYSSEYHYVGGFYGGCNEALMKLELVHRGPMAVAFEVYNDFLHYRQGVY 380

Query: 268 KH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 320
            H           +  HAV L+G+GT +  G DYWI+ N W  SWG DGYF+I+RG++EC
Sbjct: 381 HHTGLRDPFNPFELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGEDGYFRIRRGTDEC 440

Query: 321 GIEEDVVAGLPSSK 334
            IE   +A  P  K
Sbjct: 441 AIESIALAATPIPK 454


>gi|147902366|ref|NP_001080511.1| cathepsin C precursor [Xenopus laevis]
 gi|33417162|gb|AAH56109.1| Ctsc protein [Xenopus laevis]
          Length = 458

 Score =  139 bits (349), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 96/315 (30%), Positives = 150/315 (47%), Gaps = 46/315 (14%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG-------VKPTPKGLLLGVPVKTHDK 91
            +K++N   K+ W A+  P++   ++       G       V+P P       P+ T  K
Sbjct: 170 FVKQINTVQKS-WTASVYPEYEGMSIEDLVRRAGGRNSRIPVRPRP------APMPTDQK 222

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVN 149
              LP  +D R+     + +S + +QG CGSC+AF ++  L  R  I   ++    LS  
Sbjct: 223 YQGLPNEWDWRNI-AGFNFVSPVRNQGSCGSCYAFASMGMLESRIQIQSQLSQKPILSPQ 281

Query: 150 DLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 208
            +++C  +    GCDGG+P + A +Y    G+V E   PY    G   P           
Sbjct: 282 QVVSCSNY--SQGCDGGFPYLIAGKYLNDFGIVEESDFPYI---GSDSP----------- 325

Query: 209 VRKCVKKN--QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 266
              C  K+  Q +  ++++ +  +    +   +  E+   GP+ V+F VY+DF HY+SGV
Sbjct: 326 ---CTLKDSYQRYYTAEYHYVGGFYGGCNEAYMKLELVLGGPLSVAFEVYDDFIHYRSGV 382

Query: 267 YKH------ITGDVMGGHAVKLIGWGTSDD-GEDYWILANQWNRSWGADGYFKIKRGSNE 319
           Y H           +  HAV L+G+GT    GE YWI+ N W  SWG  G+F+I+RGS+E
Sbjct: 383 YHHTGLQDKFNPFQLTNHAVLLVGYGTDQQTGEKYWIVKNSWGESWGEKGFFRIRRGSDE 442

Query: 320 CGIEEDVVAGLPSSK 334
           C IE   V+  P  K
Sbjct: 443 CAIESIAVSANPIIK 457


>gi|45708820|gb|AAH67941.1| LOC407938 protein, partial [Xenopus (Silurana) tropicalis]
          Length = 470

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 97/323 (30%), Positives = 155/323 (47%), Gaps = 46/323 (14%)

Query: 22  EGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG-------VK 74
           E + S+L   +H      +K++NE  K+ W A   P++   T+       G       ++
Sbjct: 157 EMLTSRLYNYNH----DFVKQINEVQKS-WTATAYPEYEGMTIEDLIRRAGGRNSRIPMR 211

Query: 75  PTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSD 134
           P P       P+ T +K   LP  +D R+     + ++ + +Q  CGSC+AF ++  L  
Sbjct: 212 PRP------APLPTDEKYQGLPTEWDWRNI-AGYNFVTPVRNQASCGSCYAFSSMGMLES 264

Query: 135 RFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDS 191
           R  I   ++    LS   +++C  +    GC+GG+P + A +Y   +G+V E   PY   
Sbjct: 265 RIQIRSQLSQKPILSPQQVVSCSNY--SQGCEGGFPYLIAGKYVSDYGIVEESDLPY--- 319

Query: 192 TGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 251
           TG   P          C  K     Q +  ++++ +  +    +   +  E+   GP+ V
Sbjct: 320 TGSDSP----------CTLK--DSQQKYYTAEYHYVGGFYGGCNEAYMKLELVLGGPLSV 367

Query: 252 SFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGTSDD-GEDYWILANQWNRS 304
           +F VY+DF HY+SGVY H           +  HAV L+G+GT    GE YWI+ N W  S
Sbjct: 368 AFEVYDDFMHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGTDQQTGEKYWIVKNSWGES 427

Query: 305 WGADGYFKIKRGSNECGIEEDVV 327
           WG  GYF+I+RG++EC IE   V
Sbjct: 428 WGEKGYFRIRRGTDECAIESIAV 450


>gi|114639716|ref|XP_508684.2| PREDICTED: dipeptidyl peptidase 1 isoform 2 [Pan troglodytes]
 gi|397526223|ref|XP_003833035.1| PREDICTED: dipeptidyl peptidase 1 [Pan paniscus]
 gi|410219182|gb|JAA06810.1| cathepsin C [Pan troglodytes]
 gi|410260226|gb|JAA18079.1| cathepsin C [Pan troglodytes]
 gi|410304128|gb|JAA30664.1| cathepsin C [Pan troglodytes]
 gi|410353831|gb|JAA43519.1| cathepsin C [Pan troglodytes]
          Length = 463

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 99/313 (31%), Positives = 153/313 (48%), Gaps = 39/313 (12%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 93
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   K L
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQ--QKLL 229

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
            LP S+D R+     + +S + +Q  CGSC++F ++  L  R  I    + +  LS  ++
Sbjct: 230 HLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEV 288

Query: 152 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           ++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P             
Sbjct: 289 VSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP------------- 330

Query: 211 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HYK G+Y 
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYH 389

Query: 269 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           H           +  HAV L+G+GT S  G DYWI+ N W   WG DGYF+I+RG++EC 
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEDGYFRIRRGTDECA 449

Query: 322 IEEDVVAGLPSSK 334
           IE   VA  P  K
Sbjct: 450 IESIAVAATPIPK 462


>gi|255087666|ref|XP_002505756.1| cathepsin B-like cysteine proteinase [Micromonas sp. RCC299]
 gi|226521026|gb|ACO67014.1| cathepsin B-like cysteine proteinase [Micromonas sp. RCC299]
          Length = 273

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 105/265 (39%), Positives = 134/265 (50%), Gaps = 33/265 (12%)

Query: 87  KTHDKSLKLPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MN 143
           K + K+L LP+SFDAR+ WP C+  I    DQG+CGSCWA    E +SDR CI  G  ++
Sbjct: 10  KFNPKALGLPESFDARTKWPTCAHLIGVARDQGNCGSCWAMAPAEVMSDRACIQSGGEID 69

Query: 144 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH 196
             LS   LLAC       GC+GG    A+ +   +GVVT         C PY  +  C H
Sbjct: 70  AELSPFQLLACA--QGSFGCEGGESADAYEFAKSNGVVTGGGFDDQNTCAPYPFAP-CHH 126

Query: 197 PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPED----IMAEIYKNGPV-EV 251
           P CE  +PTP C   CV  +     +   S     I   P      +  EIY NGPV   
Sbjct: 127 P-CE-VFPTPACPATCVGGSNDGVQNGKASFKVKAIVDCPSFDYGCVANEIYHNGPVSSY 184

Query: 252 SFTVYEDFAHYKSGVYKHI-----TGDVMGGHAVKLIGWGTSD----DGED-YWILANQW 301
           +  +YE+F  YKSGV++        G   GGH VK+IGWG +D    +GE  YWI+ N W
Sbjct: 185 AGDIYEEFYAYKSGVFRESPSVAQRGANHGGHVVKVIGWGKADPAKGEGEGYYWIVVNSW 244

Query: 302 NRSWGADGYFKIKRGSNECGIEEDV 326
             +WG DG  +I  G  E GI   V
Sbjct: 245 -LNWGDDGVGRIAVG--EVGIGAGV 266


>gi|119579767|gb|EAW59363.1| cathepsin C, isoform CRA_a [Homo sapiens]
          Length = 316

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 105/340 (30%), Positives = 165/340 (48%), Gaps = 44/340 (12%)

Query: 14  LTCFATFAEGVVSKLKLDSHIL---QDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 70
           + C+A    G+VS  +  S+ L     + +K +N   K+ W A    ++   T+G     
Sbjct: 1   MMCWA--GTGLVSPERRYSNRLYKYDHNFVKAINAIQKS-WTATTYMEYETLTLGDMIRR 57

Query: 71  LGVK----PTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF 126
            G      P PK   L   ++   K L LP S+D R+     + +S + +Q  CGSC++F
Sbjct: 58  SGGHSRKIPRPKPAPLTAEIQ--QKILHLPTSWDWRNV-HGINFVSPVRNQASCGSCYSF 114

Query: 127 GAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTE 183
            ++  L  R  I    + +  LS  ++++C  +    GC+GG+P + A +Y    G+V E
Sbjct: 115 ASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEE 172

Query: 184 ECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMA 241
            C PY   TG   P              C  K   +R  +S+++ +  +    +   +  
Sbjct: 173 ACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKL 215

Query: 242 EIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDY 294
           E+  +GP+ V+F VY+DF HYK G+Y H           +  HAV L+G+GT S  G DY
Sbjct: 216 ELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDY 275

Query: 295 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 334
           WI+ N W   WG +GYF+I+RG++EC IE   VA  P  K
Sbjct: 276 WIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 315


>gi|355566931|gb|EHH23310.1| hypothetical protein EGK_06753 [Macaca mulatta]
          Length = 463

 Score =  138 bits (348), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 99/313 (31%), Positives = 154/313 (49%), Gaps = 39/313 (12%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 93
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   K  
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIKRSGGHSRKIPRPKPAPLTAEIQ--QKIF 229

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
            LP S+D R+     + +S + +Q  CGSC++F +V  L  R  I    + +  LS  ++
Sbjct: 230 HLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEV 288

Query: 152 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           ++C  +    GC+GG+P ++A +Y    G+V E C PY   TG   P             
Sbjct: 289 VSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGNDSP------------- 330

Query: 211 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HY++G+Y 
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDDFLHYQNGIYH 389

Query: 269 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           H           +  HAV L+G+GT S  G DYWI+ N W  SWG DGYF+I RG++EC 
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIHRGTDECA 449

Query: 322 IEEDVVAGLPSSK 334
           IE   VA  P  K
Sbjct: 450 IESIAVAATPIPK 462


>gi|297665716|ref|XP_002811185.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 3
           [Pongo abelii]
          Length = 436

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 102/319 (31%), Positives = 148/319 (46%), Gaps = 38/319 (11%)

Query: 39  IIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLP 96
           I+    +N    W+A  +  F   T+ +  ++ LG ++P+   + +       +    LP
Sbjct: 114 ILGTYWDNCNRCWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNPGEVLP 173

Query: 97  KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLAC 154
            +F+A   WP  + I   LDQG+C   WAF      SDR  IH    M   LS  +LL+C
Sbjct: 174 TAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSC 231

Query: 155 CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----- 209
                  GC GG    AW +    GVV++ C P+           + A PTP C+     
Sbjct: 232 DTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRER------DEAGPTPPCMMHSRA 284

Query: 210 -----RKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 261
                R+      N    N+  Y ++  YR+ S+ ++IM E+ +NGPV+    V+EDF  
Sbjct: 285 MGRGKRQATASCPNSHVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFL 344

Query: 262 YKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADG 309
           YK G+Y H    +         G H+VK+ GWG  T  DG    YW  AN W  +WG  G
Sbjct: 345 YKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERG 404

Query: 310 YFKIKRGSNECGIEEDVVA 328
           +F+I RG NEC IE  V+ 
Sbjct: 405 HFRIVRGVNECDIESFVLG 423


>gi|496968|gb|AAA96831.1| cysteine protease homologue, partial [Ancylostoma caninum]
          Length = 197

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 73/198 (36%), Positives = 110/198 (55%), Gaps = 19/198 (9%)

Query: 122 SCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY----- 174
           SCWA  + EA+SD  C+     + + +S +D+L+CCG  CG GC GG+ I A+++     
Sbjct: 1   SCWAVSSAEAMSDEICVQSNSTIRVMISDSDILSCCGISCGYGCQGGWSIEAYKWMQRER 60

Query: 175 --FVHHGVVTEECDPYFDSTGCSHPGCEPAY--------PTPKCVRKCVKKN-QLWRNSK 223
             +         C P   S    +   +P Y        PTPKC + C +K  + ++  K
Sbjct: 61  CCYRWENTDRRVCKPVRPSIRVGNHPNDPYYGPCPGGLWPTPKCRKTCQRKYYKSYQEDK 120

Query: 224 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 283
           H++  AY + ++   I  EIYKNGPV  +F VY+DF++YK G+Y H  G   G HAVK++
Sbjct: 121 HFATRAYYLPNNERSIRQEIYKNGPVVAAFRVYQDFSYYKKGIYVHKWGGQTGAHAVKVV 180

Query: 284 GWGTSDDGEDYWILANQW 301
           GWG  ++  DYW++AN W
Sbjct: 181 GWG-RENATDYWLIANSW 197


>gi|426370061|ref|XP_004051995.1| PREDICTED: dipeptidyl peptidase 1 [Gorilla gorilla gorilla]
          Length = 463

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 98/313 (31%), Positives = 153/313 (48%), Gaps = 39/313 (12%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 93
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   + L
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQ--QRIL 229

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
            LP S+D R+     + +S + +Q  CGSC++F ++  L  R  I    + +  LS  ++
Sbjct: 230 HLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEV 288

Query: 152 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           ++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P             
Sbjct: 289 VSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP------------- 330

Query: 211 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HYK G+Y 
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYH 389

Query: 269 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           H           +  HAV L+G+GT S  G DYWI+ N W   WG DGYF+I+RG++EC 
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEDGYFRIRRGTDECA 449

Query: 322 IEEDVVAGLPSSK 334
           IE   VA  P  K
Sbjct: 450 IESIAVAATPIPK 462


>gi|226472634|emb|CAX71003.1| hypotherical protein [Schistosoma japonicum]
          Length = 458

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 103/326 (31%), Positives = 158/326 (48%), Gaps = 46/326 (14%)

Query: 28  LKLDSHIL---QDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG-VKPTPKGLLLG 83
           L+LD + L       IK +N    + WKA   P++S YT+ + +   G  + T K   + 
Sbjct: 147 LQLDENQLYKVDTKFIKAINAKQNS-WKATIYPEYSKYTIKEMRRRAGGSRSTFKRQNVQ 205

Query: 84  VPVKTHDKS-----LKLPKSFDARSAWPQC--STISRILDQGHCGSCWAFGAVEALSDRF 136
           +P K    +     L LPK FD  +  P+   S ++ + +Q  CGSC+AF +  A+  R 
Sbjct: 206 LPKKNLTSAMMLELLALPKEFDWVNR-PEGLRSPVTPVRNQKTCGSCYAFASTAAIEARI 264

Query: 137 CI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTG 193
            +   F +   LS  D++ C  +   +GCDGG+P + A ++    G V E+C+PY   TG
Sbjct: 265 RLASRFRLQPILSPQDIIDCSPY--SEGCDGGFPYLVAGKHGEDFGFVEEKCNPY---TG 319

Query: 194 CSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
                C        C R        +  + ++ I  Y   ++ + +  E+ KNGP  V F
Sbjct: 320 VKSGTCNRLL---GCTR--------YYTTDYHYIGGYYGATNEDLMKLELVKNGPFPVGF 368

Query: 254 TVYEDFAHYKSGVYKHITGDVMGGH-----------AVKLIGWGTSDDGE-DYWILANQW 301
            VY DF  YKSGVY H   D++  H           AV L+G+G  +     YW + N W
Sbjct: 369 EVYGDFLQYKSGVYSHT--DIINNHHPFNPFELTNHAVLLVGYGIDNSSNLPYWKIKNSW 426

Query: 302 NRSWGADGYFKIKRGSNECGIEEDVV 327
            + WG +GYF+I RGS+ECG+E   +
Sbjct: 427 GQYWGEEGYFRILRGSDECGVESIAI 452


>gi|60827947|gb|AAX36820.1| cathepsin C [synthetic construct]
 gi|61368416|gb|AAX43175.1| cathepsin C [synthetic construct]
          Length = 464

 Score =  138 bits (347), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 99/315 (31%), Positives = 154/315 (48%), Gaps = 39/315 (12%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 93
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   K L
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQ--QKIL 229

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
            LP S+D R+     + +S + +Q  CGSC++F ++  L  R  I    + +  LS  ++
Sbjct: 230 HLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEV 288

Query: 152 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           ++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P             
Sbjct: 289 VSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP------------- 330

Query: 211 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HYK G+Y 
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYH 389

Query: 269 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           H           +  HAV L+G+GT S  G DYWI+ N W   WG +GYF+I+RG++EC 
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECA 449

Query: 322 IEEDVVAGLPSSKNL 336
           IE   VA  P  K L
Sbjct: 450 IESIAVAATPIPKLL 464


>gi|193202653|ref|NP_492593.2| Protein F26E4.3 [Caenorhabditis elegans]
 gi|205371857|sp|P90850.3|YCF2E_CAEEL RecName: Full=Uncharacterized peptidase C1-like protein F26E4.3;
           Flags: Precursor
 gi|166157004|emb|CAB03007.2| Protein F26E4.3 [Caenorhabditis elegans]
          Length = 452

 Score =  138 bits (347), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 90/252 (35%), Positives = 125/252 (49%), Gaps = 18/252 (7%)

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSV 148
           K  +LP+ FDAR  W     I  + DQG CGS W+       SDR  I     +N +LS 
Sbjct: 180 KPRELPEHFDARDKWG--PLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSS 237

Query: 149 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 208
             LL+C       GC+GGY   AW Y    GVV + C PY  S     PG          
Sbjct: 238 QQLLSCNQHR-QKGCEGGYLDRAWWYIRKLGVVGDHCYPYV-SGQSREPGHCLIPKRDYT 295

Query: 209 VRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 267
            R+ ++     ++S  + ++  Y+++S  EDI  E+  NGPV+ +F V+EDF  Y  GVY
Sbjct: 296 NRQGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVY 355

Query: 268 KH--------ITGDVMGGHAVKLIGWG---TSDDGEDYWILANQWNRSWGADGYFKIKRG 316
           +H         +    G H+V+++GWG   ++     YW+ AN W   WG DGYFK+ RG
Sbjct: 356 QHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGYFKVLRG 415

Query: 317 SNECGIEEDVVA 328
            N C IE  V+ 
Sbjct: 416 ENHCEIESFVIG 427


>gi|54696504|gb|AAV38624.1| cathepsin C [synthetic construct]
 gi|54696506|gb|AAV38625.1| cathepsin C [synthetic construct]
 gi|61368207|gb|AAX43130.1| cathepsin C [synthetic construct]
 gi|61368212|gb|AAX43131.1| cathepsin C [synthetic construct]
          Length = 464

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 99/315 (31%), Positives = 154/315 (48%), Gaps = 39/315 (12%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 93
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   K L
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQ--QKIL 229

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
            LP S+D R+     + +S + +Q  CGSC++F ++  L  R  I    + +  LS  ++
Sbjct: 230 HLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEV 288

Query: 152 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           ++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P             
Sbjct: 289 VSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP------------- 330

Query: 211 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HYK G+Y 
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYH 389

Query: 269 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           H           +  HAV L+G+GT S  G DYWI+ N W   WG +GYF+I+RG++EC 
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECA 449

Query: 322 IEEDVVAGLPSSKNL 336
           IE   VA  P  K L
Sbjct: 450 IESIAVAATPIPKLL 464


>gi|403287831|ref|XP_003935129.1| PREDICTED: dipeptidyl peptidase 1 [Saimiri boliviensis boliviensis]
          Length = 463

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 99/313 (31%), Positives = 153/313 (48%), Gaps = 39/313 (12%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 93
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   K L
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIRRSGGHSRRLPRPKPAPLTAEIQ--QKIL 229

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
            LP S+D R+     + +S + +Q  CGSC++F ++  L  R  I    + +  LS  ++
Sbjct: 230 NLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEV 288

Query: 152 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           ++C  +    GC+GG+P + A +Y    GVV E C PY   TG   P             
Sbjct: 289 VSCSKY--AQGCEGGFPYLIAGKYAQDFGVVEEACFPY---TGTDSP------------- 330

Query: 211 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HY+ G+Y 
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYRKGIYH 389

Query: 269 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           H           +  HAV L+G+GT S  G  YWI+ N W  SWG DGYF+I+RG++EC 
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGIHYWIVKNSWGTSWGEDGYFRIRRGTDECA 449

Query: 322 IEEDVVAGLPSSK 334
           IE   VA  P  K
Sbjct: 450 IESIAVAATPIPK 462


>gi|351712812|gb|EHB15731.1| Dipeptidyl-peptidase 1 [Heterocephalus glaber]
          Length = 462

 Score =  137 bits (346), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 102/333 (30%), Positives = 163/333 (48%), Gaps = 44/333 (13%)

Query: 25  VSKLKLDSHILQDS---------IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG--- 72
           V+   L+SH+ + S          +K +N   K+ W A    ++   T+ +     G   
Sbjct: 150 VNTAYLESHLEKYSNRLYKYDHKFVKAINAVQKS-WTATTYKEYETLTLREMARRRGGHN 208

Query: 73  -VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEA 131
            + P PK   L   ++   K L+LPKS+D R      + +S + +QG+CGSC++F ++  
Sbjct: 209 QIIPRPKPAPLSAEIQ--QKILQLPKSWDWRDV-HGMNFVSPVRNQGYCGSCYSFASMGM 265

Query: 132 LSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPY 188
           L  R  I      +  LS  ++++C  +    GC+GG+P + A +Y    G V E C PY
Sbjct: 266 LEARIRILTNNTQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGFVEESCFPY 323

Query: 189 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 248
              TG   P C       K    C++    +  S+++ +  +    +   +  E+ ++GP
Sbjct: 324 ---TGTDAP-C-------KMKEDCMR----YYTSEYHYVGGFYGGCNEALMKLELVQHGP 368

Query: 249 VEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQW 301
           + V+F V +DF HY  G+Y H           +  HAV L+G+GT S +G DYWI+ N W
Sbjct: 369 MAVAFEVCDDFMHYHKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSANGMDYWIVKNSW 428

Query: 302 NRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 334
             SWG  GYF+I RG++EC IE   +A  P  K
Sbjct: 429 GTSWGEKGYFRILRGTDECAIESIAMAATPIPK 461


>gi|193629592|ref|XP_001944624.1| PREDICTED: cathepsin B-like isoform 4 [Acyrthosiphon pisum]
          Length = 331

 Score =  137 bits (346), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 86/255 (33%), Positives = 119/255 (46%), Gaps = 29/255 (11%)

Query: 97  KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLAC 154
           K FDAR  WPQC TI  + ++G+    WA+      +DR CI  +   N  LS  +L++C
Sbjct: 89  KEFDARKRWPQCKTIGEVYNEGNALLSWAYATTGVFADRMCIATNGSYNKHLSTEELISC 148

Query: 155 CGFLCGDGC---DGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP----- 206
            G          DG     AW YF  HG+V+        S   ++ GC+P+   P     
Sbjct: 149 SGIKASANGWVRDG----LAWEYFKTHGLVSG------GSIYNTNDGCQPSKIPPVCNLP 198

Query: 207 ------KCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
                  CV  C   + +  N  H  +  Y  +  P+DI  E+   GPV  +  +Y+D  
Sbjct: 199 TKINKRTCVDYCYGNDTIKYNHDHVKVRYY-YHVKPKDIQKEVQTYGPVTAALNLYDDIF 257

Query: 261 HYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 319
            +KSGVY        +    VKLIGWG  ++G DYW+L N W   WG +G  KIKRG   
Sbjct: 258 LHKSGVYTLTKNAKYVRLQYVKLIGWGV-ENGVDYWLLVNSWGNEWGQNGLLKIKRGKYG 316

Query: 320 CGIEEDVVAGLPSSK 334
           C +E  V A +P  K
Sbjct: 317 CAVESFVYAAVPKIK 331


>gi|344287518|ref|XP_003415500.1| PREDICTED: tubulointerstitial nephritis antigen isoform 1
           [Loxodonta africana]
          Length = 468

 Score =  137 bits (346), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 99/324 (30%), Positives = 149/324 (45%), Gaps = 39/324 (12%)

Query: 34  ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 91
           ++   +I  +N+    GW+A  +  F   T+ +  ++ LG ++P+   + +         
Sbjct: 142 LVDQDMINAINQG-NYGWRAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIHTVLGP 200

Query: 92  SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 149
              LP +F+A   WP  + I   LDQG C   WAF      SDR  IH    M   LS  
Sbjct: 201 GEVLPMAFEASKKWP--NLIHEPLDQGDCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQ 258

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 209
           +LL+C       GC GG    AW +    GVV++ C P+           + A P P C+
Sbjct: 259 NLLSC-DTHNQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGHER------DKAGPVPPCM 311

Query: 210 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
                     R+   +   + +  N  +    AYR+ ++ ++IM E+ +NGPV+    V+
Sbjct: 312 MHSRAMGRGKRQATSRCPNSHVHGNDIYQVTPAYRLGTNEKEIMKELMENGPVQALMEVH 371

Query: 257 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRS 304
           EDF  Y+ G+Y H              G H+VK+ GWG  T  DG    YW  AN W  +
Sbjct: 372 EDFFLYQGGIYSHTPVSQERPEQYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPA 431

Query: 305 WGADGYFKIKRGSNECGIEEDVVA 328
           WG  G+F+I RG+NEC IE  V+ 
Sbjct: 432 WGERGHFRIVRGANECDIESFVLG 455


>gi|194382330|dbj|BAG58920.1| unnamed protein product [Homo sapiens]
          Length = 446

 Score =  137 bits (346), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 98/313 (31%), Positives = 153/313 (48%), Gaps = 39/313 (12%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 93
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   K L
Sbjct: 156 NFVKAINAIQKS-WTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQ--QKIL 212

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
            LP S+D R+     + +S + +Q  CGSC++F ++  L  R  I    + +  LS  ++
Sbjct: 213 HLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEV 271

Query: 152 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           ++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P             
Sbjct: 272 VSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP------------- 313

Query: 211 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HYK G+Y 
Sbjct: 314 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYH 372

Query: 269 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           H           +  HAV L+G+GT S  G DYWI+ N W   WG +GYF+I+RG++EC 
Sbjct: 373 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECA 432

Query: 322 IEEDVVAGLPSSK 334
           IE   VA  P  K
Sbjct: 433 IESIAVAATPIPK 445


>gi|290980380|ref|XP_002672910.1| predicted protein [Naegleria gruberi]
 gi|284086490|gb|EFC40166.1| predicted protein [Naegleria gruberi]
          Length = 302

 Score =  137 bits (345), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 89/297 (29%), Positives = 139/297 (46%), Gaps = 26/297 (8%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH--DKSLKL 95
           ++++ +NENPK+ +KA    +F   ++ +  +L       K  +  V  +     K L +
Sbjct: 22  TLVRRINENPKSPFKAKLYERFD--SIAKLINLSRRNGGRKFSMKTVQSRKFKLSKGLAI 79

Query: 96  PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLA 153
           P  +D R  W QC  +  I ++G CG+ WA      +SDR CI         LS   +L 
Sbjct: 80  PPEYDLRKNWYQC--VGDIQNEGQCGAVWAMAPSATVSDRMCIQSNAKFQERLSSQYILE 137

Query: 154 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV 213
           C       GC+GGY  + + + ++ GV TE+C PY        P          C   C 
Sbjct: 138 CD--TRDFGCNGGYMNTEFEFELNRGVPTEKCVPYIAFNMTLQP----------CPTSCF 185

Query: 214 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG- 272
              Q     K  S+      +   D+   I + G +     VY+DF +Y SGVY+H    
Sbjct: 186 NSTQPMVLYKTKSVQNV---TGELDMQQAILQGGSIMTEMDVYQDFIYYSSGVYEHDPSF 242

Query: 273 -DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 328
              +     +++GWG S +G +YWI+AN W ++WG DGY  ++RG+NE  IE+D  A
Sbjct: 243 TQPIAKTVARIVGWG-SLNGVNYWIVANVWGKTWGLDGYVLVRRGTNESNIEKDAYA 298


>gi|443687066|gb|ELT90166.1| hypothetical protein CAPTEDRAFT_138389 [Capitella teleta]
          Length = 446

 Score =  137 bits (345), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 100/321 (31%), Positives = 151/321 (47%), Gaps = 41/321 (12%)

Query: 26  SKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG-----VKPTPKGL 80
           S++K   +      I+++NE   + WKA    ++    +       G     V    +GL
Sbjct: 150 SQMKSSVYKPNPDYIRQLNE-ASSTWKATIYAEYEGMHLIDLHRRNGGSRSRVSSPGRGL 208

Query: 81  LLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF 140
           L     +T   ++ LP+S+D R+       +S + +QG CGSC+AF ++     R  +  
Sbjct: 209 L---KEETKMAAVNLPESWDWRNV-DGVDFVSPVRNQGGCGSCYAFSSMAMNEARIRV-M 263

Query: 141 GMNLSLSV---NDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSH 196
             N  + V    D++ CC +    GCDGG+P +   +Y    G+V E CDPY        
Sbjct: 264 SNNTQMPVFSPQDIVDCCQY--SQGCDGGFPYLVGGKYAEDFGLVDESCDPYVGED---- 317

Query: 197 PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
                        RKC   +   R +  Y        +  E  M    + GP+ VSF VY
Sbjct: 318 -------------RKCKSTSCSRRYATRYRYVGGYYGACNEQEMKLALQRGPLSVSFMVY 364

Query: 257 EDFAHYKSGVYKH--ITGDV----MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 310
           +DF HYKSGVY+H  +T       +  HAV L+G+G +D+G  YWI+ N W + WG +GY
Sbjct: 365 DDFMHYKSGVYRHSGLTDKYNPFEITNHAVLLVGYG-ADEGTKYWIVKNSWGKGWGEEGY 423

Query: 311 FKIKRGSNECGIEEDVVAGLP 331
           F+I RG++EC IE   V   P
Sbjct: 424 FRILRGADECAIESIAVETFP 444


>gi|397515891|ref|XP_003828175.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2 [Pan
           paniscus]
          Length = 436

 Score =  137 bits (345), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 98/301 (32%), Positives = 143/301 (47%), Gaps = 26/301 (8%)

Query: 51  WKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQC 108
           W+A  +  F   T+ +  ++ LG ++P+   + +       +    LP +F+A   WP  
Sbjct: 126 WQAGNHSTFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNPGEVLPTAFEASEKWP-- 183

Query: 109 STISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGG 166
           + I   LDQG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG
Sbjct: 184 NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGG 242

Query: 167 YPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWR 220
               AW +    GVV++ C P+     D  G + P    +    +  R+      N    
Sbjct: 243 RLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSYVN 302

Query: 221 NSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV----- 274
           N+  Y ++  YR+ S+ ++IM E+ +NGPV+    V+EDF  YK G+Y H    +     
Sbjct: 303 NNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPER 362

Query: 275 ---MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 327
               G H+VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG NEC IE  V+
Sbjct: 363 YRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVL 422

Query: 328 A 328
            
Sbjct: 423 G 423


>gi|324711034|ref|NP_001191343.1| tubulointerstitial nephritis antigen-like isoform 2 precursor [Homo
           sapiens]
 gi|194391000|dbj|BAG60618.1| unnamed protein product [Homo sapiens]
          Length = 436

 Score =  137 bits (345), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 98/301 (32%), Positives = 143/301 (47%), Gaps = 26/301 (8%)

Query: 51  WKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQC 108
           W+A  +  F   T+ +  ++ LG ++P+   + +       +    LP +F+A   WP  
Sbjct: 126 WQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNPGEVLPTAFEASEKWP-- 183

Query: 109 STISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGG 166
           + I   LDQG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG
Sbjct: 184 NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGG 242

Query: 167 YPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWR 220
               AW +    GVV++ C P+     D  G + P    +    +  R+      N    
Sbjct: 243 RLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSYVN 302

Query: 221 NSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV----- 274
           N+  Y ++  YR+ S+ ++IM E+ +NGPV+    V+EDF  YK G+Y H    +     
Sbjct: 303 NNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPER 362

Query: 275 ---MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 327
               G H+VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG NEC IE  V+
Sbjct: 363 YRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVL 422

Query: 328 A 328
            
Sbjct: 423 G 423


>gi|317373330|sp|P53634.2|CATC_HUMAN RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|17933069|gb|AAL48191.1| cathepsin C [Homo sapiens]
          Length = 463

 Score =  137 bits (345), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 98/313 (31%), Positives = 153/313 (48%), Gaps = 39/313 (12%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 93
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   K L
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQ--QKIL 229

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
            LP S+D R+     + +S + +Q  CGSC++F ++  L  R  I    + +  LS  ++
Sbjct: 230 HLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEV 288

Query: 152 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           ++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P             
Sbjct: 289 VSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP------------- 330

Query: 211 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HYK G+Y 
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYH 389

Query: 269 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           H           +  HAV L+G+GT S  G DYWI+ N W   WG +GYF+I+RG++EC 
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECA 449

Query: 322 IEEDVVAGLPSSK 334
           IE   VA  P  K
Sbjct: 450 IESIAVAATPIPK 462


>gi|189083844|ref|NP_001805.3| dipeptidyl peptidase 1 isoform a preproprotein [Homo sapiens]
 gi|1006657|emb|CAA60671.1| cathepsin C [Homo sapiens]
 gi|1947071|gb|AAC51341.1| prepro dipeptidyl peptidase I [Homo sapiens]
 gi|60816242|gb|AAX36375.1| cathepsin C [synthetic construct]
 gi|119579768|gb|EAW59364.1| cathepsin C, isoform CRA_b [Homo sapiens]
 gi|158257666|dbj|BAF84806.1| unnamed protein product [Homo sapiens]
 gi|261858568|dbj|BAI45806.1| cathepsin C [synthetic construct]
          Length = 463

 Score =  137 bits (345), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 98/313 (31%), Positives = 153/313 (48%), Gaps = 39/313 (12%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 93
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   K L
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQ--QKIL 229

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
            LP S+D R+     + +S + +Q  CGSC++F ++  L  R  I    + +  LS  ++
Sbjct: 230 HLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEV 288

Query: 152 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           ++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P             
Sbjct: 289 VSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP------------- 330

Query: 211 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HYK G+Y 
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYH 389

Query: 269 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           H           +  HAV L+G+GT S  G DYWI+ N W   WG +GYF+I+RG++EC 
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECA 449

Query: 322 IEEDVVAGLPSSK 334
           IE   VA  P  K
Sbjct: 450 IESIAVAATPIPK 462


>gi|321476473|gb|EFX87434.1| hypothetical protein DAPPUDRAFT_221708 [Daphnia pulex]
          Length = 464

 Score =  137 bits (345), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 89/251 (35%), Positives = 128/251 (50%), Gaps = 32/251 (12%)

Query: 95  LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 152
           LP+ +D R+     + +  + +QG CGSC+AF ++  L  R  +     + ++LS  D++
Sbjct: 230 LPEEWDWRNV-SGVNYVPVVKNQGSCGSCYAFSSMGMLESRLRVATKNQVQVNLSPQDIV 288

Query: 153 ACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK 211
           +C  +    GC+GG+P + A +Y   HGVV EEC PY   TG     C  A    KC R 
Sbjct: 289 SCSAY--SQGCEGGFPYLIAGKYAQDHGVVAEECYPY---TG-RDSACSAA---KKCQRS 339

Query: 212 CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 271
            V        +K+  +  Y    + E +   + ++GP+ VSF VY DF HY  GVY    
Sbjct: 340 YV--------AKYRYVGGYYGACNEELMKMSLVESGPLSVSFEVYSDFMHYAGGVYHRTD 391

Query: 272 GDV----------MGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 320
           G            +  HAV L+G+GT S   E YWI+ N W   WG DG+F+I+RG +EC
Sbjct: 392 GLFNKINEFNPFELTNHAVLLVGYGTDSQTKEKYWIVKNSWGTKWGEDGFFRIRRGVDEC 451

Query: 321 GIEEDVVAGLP 331
           GIE   V   P
Sbjct: 452 GIESIAVEVTP 462


>gi|62897637|dbj|BAD96758.1| cathepsin C isoform a preproprotein variant [Homo sapiens]
          Length = 463

 Score =  137 bits (345), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 98/313 (31%), Positives = 153/313 (48%), Gaps = 39/313 (12%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 93
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   K L
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQ--QKIL 229

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
            LP S+D R+     + +S + +Q  CGSC++F ++  L  R  I    + +  LS  ++
Sbjct: 230 HLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEV 288

Query: 152 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           ++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P             
Sbjct: 289 VSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP------------- 330

Query: 211 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HYK G+Y 
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYH 389

Query: 269 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           H           +  HAV L+G+GT S  G DYWI+ N W   WG +GYF+I+RG++EC 
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECA 449

Query: 322 IEEDVVAGLPSSK 334
           IE   VA  P  K
Sbjct: 450 IESIAVAATPIPK 462


>gi|1582221|prf||2118248A prepro-cathepsin C
          Length = 463

 Score =  137 bits (345), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 98/313 (31%), Positives = 153/313 (48%), Gaps = 39/313 (12%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 93
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   K L
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQ--QKIL 229

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
            LP S+D R+     + +S + +Q  CGSC++F ++  L  R  I    + +  LS  ++
Sbjct: 230 HLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEV 288

Query: 152 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           ++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P             
Sbjct: 289 VSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP------------- 330

Query: 211 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HYK G+Y 
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYH 389

Query: 269 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           H           +  HAV L+G+GT S  G DYWI+ N W   WG +GYF+I+RG++EC 
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECA 449

Query: 322 IEEDVVAGLPSSK 334
           IE   VA  P  K
Sbjct: 450 IESIAVAATPIPK 462


>gi|327239610|gb|AEA39649.1| cathepsin B [Epinephelus coioides]
          Length = 171

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 78/172 (45%), Positives = 102/172 (59%), Gaps = 17/172 (9%)

Query: 121 GSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHH 178
           GSCWAFGA EA+SDR CIH    +S+ ++  DLLACC   CG GC+GGYP +AW ++   
Sbjct: 1   GSCWAFGAAEAISDRLCIHSNGKVSVEISSEDLLACCD-SCGMGCNGGYPSAAWDFWTDV 59

Query: 179 GVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKH 224
           G+V+         C PY          G   P       TP+C+ +C       ++  KH
Sbjct: 60  GLVSGGLYDSHVGCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCILQCESGYTPSYKADKH 119

Query: 225 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 276
           Y  S+Y + SD E I +EIYKNGPVE +FTVYEDF  YK+GVY+H+TG  +G
Sbjct: 120 YGKSSYSVPSDEEQIQSEIYKNGPVEGAFTVYEDFLLYKTGVYQHMTGSAVG 171


>gi|348508181|ref|XP_003441633.1| PREDICTED: dipeptidyl peptidase 1-like isoform 1 [Oreochromis
           niloticus]
          Length = 455

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 99/329 (30%), Positives = 154/329 (46%), Gaps = 47/329 (14%)

Query: 26  SKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP 85
           S+L    +      I  +N   K+ WKAA  P+   YT+ + ++  G      G    +P
Sbjct: 153 SRLPQKRYKHSMDFIDVINSVQKS-WKAAPYPEHEMYTLQELQYRAG------GPASRIP 205

Query: 86  VKTHDKSLK---------LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRF 136
           V+     +K         LP+ +D R+     + +S + +Q  CGSC++F  +  L  R 
Sbjct: 206 VRVRPAPVKADVAKMASALPEQWDWRNV-DGVNFVSPVRNQESCGSCYSFATMGMLEARI 264

Query: 137 CIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF-DSTG 193
            I    +   +LS   +++C  +    GCDGG+P    +Y    G+V E C PY   +T 
Sbjct: 265 RILTNNSDAPTLSPQQVVSCSEY--SQGCDGGFPYLIGKYTQDFGIVDESCFPYVGQNTP 322

Query: 194 CSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
           C  P                +K Q    +++  +  +        +M E+ KNGP+ V+F
Sbjct: 323 CGVP----------------QKCQRIYAAEYNYVGGFYGGCSEAAMMLELVKNGPMAVAF 366

Query: 254 TVYEDFAHYKSGVYKHITGDV-------MGGHAVKLIGWGT-SDDGEDYWILANQWNRSW 305
            VY DF +YK G+Y H TG         +  HAV L+G+G     G++YWI+ N W   W
Sbjct: 367 EVYPDFMNYKEGIYHH-TGLADPFNPFELTNHAVLLVGYGRCHKTGQNYWIVKNSWGTGW 425

Query: 306 GADGYFKIKRGSNECGIEEDVVAGLPSSK 334
           G +GYF+I+RG++EC IE   VA  P  K
Sbjct: 426 GEEGYFRIRRGNDECAIESIAVAANPIPK 454


>gi|296216857|ref|XP_002754752.1| PREDICTED: dipeptidyl peptidase 1 [Callithrix jacchus]
          Length = 460

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 99/313 (31%), Positives = 152/313 (48%), Gaps = 39/313 (12%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 93
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   K L
Sbjct: 170 NFVKALNAIQKS-WTATTYMEYETLTLGDMIRRSGGHSRRLPRPKPAPLSAEIQ--QKIL 226

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
            LP S+D R+     + +S + +Q  CGSC++F ++  L  R  I    + +  LS  ++
Sbjct: 227 NLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEV 285

Query: 152 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           ++C  +    GC+GG+P + A +Y    GVV E C PY   TG   P             
Sbjct: 286 VSCSQY--AQGCEGGFPYLIAGKYAQDFGVVEEACFPY---TGTDSP------------- 327

Query: 211 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HY  G+Y 
Sbjct: 328 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYHKGIYH 386

Query: 269 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           H           +  HAV L+G+GT S  G  YWI+ N W  SWG DGYF+I+RG++EC 
Sbjct: 387 HTGLRDPFNPFELTNHAVLLVGYGTDSASGIHYWIVKNSWGTSWGEDGYFRIRRGTDECA 446

Query: 322 IEEDVVAGLPSSK 334
           IE   VA  P  K
Sbjct: 447 IESIAVAATPIPK 459


>gi|328722316|ref|XP_003247542.1| PREDICTED: cathepsin B-like isoform 2 [Acyrthosiphon pisum]
 gi|328722318|ref|XP_003247543.1| PREDICTED: cathepsin B-like isoform 3 [Acyrthosiphon pisum]
          Length = 276

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 87/255 (34%), Positives = 120/255 (47%), Gaps = 29/255 (11%)

Query: 97  KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLAC 154
           K FDAR  WPQC TI  + ++G+    WA+      +DR CI  +   N  LS  +L++C
Sbjct: 34  KEFDARKRWPQCKTIGEVYNEGNALLSWAYATTGVFADRMCIATNGSYNKHLSTEELISC 93

Query: 155 CGFLC---GDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP----- 206
            G      G   DG     AW YF  HG+V+        S   ++ GC+P+   P     
Sbjct: 94  SGIKASANGWVRDG----LAWEYFKTHGLVSG------GSIYNTNDGCQPSKIPPVCNLP 143

Query: 207 ------KCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 260
                  CV  C   + +  N  H  +  Y  +  P+DI  E+   GPV  +  +Y+D  
Sbjct: 144 TKINKRTCVDYCYGNDTIKYNHDHVKVRYY-YHVKPKDIQKEVQTYGPVTAALNLYDDIF 202

Query: 261 HYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 319
            +KSGVY        +    VKLIGWG  ++G DYW+L N W   WG +G  KIKRG   
Sbjct: 203 LHKSGVYTLTKNAKYVRLQYVKLIGWGV-ENGVDYWLLVNSWGNEWGQNGLLKIKRGKYG 261

Query: 320 CGIEEDVVAGLPSSK 334
           C +E  V A +P  K
Sbjct: 262 CAVESFVYAAVPKIK 276


>gi|13469701|gb|AAK27318.1| cysteine proteinase [Clonorchis sinensis]
          Length = 179

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 80/178 (44%), Positives = 103/178 (57%), Gaps = 16/178 (8%)

Query: 127 GAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-- 182
           GAVEA+SDR CIH     N SLS  DLL+CC   CG GCDGG+P  AW ++  HG+VT  
Sbjct: 1   GAVEAMSDRLCIHSSGAFNKSLSAVDLLSCCK-DCGYGCDGGFPPMAWDFWKTHGIVTGG 59

Query: 183 --EE---CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYR 231
             EE   C PY        S G   P     YPTPKCV+ C      ++  K  + ++Y 
Sbjct: 60  SKEEPAGCRPYPFPKCQHHSQGHYPPCPRRIYPTPKCVKHCDTPKIDYQKDKTRANTSYN 119

Query: 232 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 289
           ++     IM EI  NGPVE +F V+EDF  YKSG+Y H  G  +GGHA++++GWG  +
Sbjct: 120 VHQSEVAIMKEILLNGPVEATFEVHEDFPEYKSGIYFHAWGGSVGGHAIRILGWGEEN 177


>gi|294889976|ref|XP_002773021.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239877724|gb|EER04837.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 342

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 95/287 (33%), Positives = 138/287 (48%), Gaps = 38/287 (13%)

Query: 95  LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
           LP +F+A+  +  C+  I  I DQ  C +CWA  +V   +DR CI  G  ++  LS+  L
Sbjct: 39  LPSNFNAQIKFASCADVIGHIRDQAECHNCWASASVGMFNDRVCIQSGGRITDILSLAYL 98

Query: 152 LACCGFLCG----DGCDGGYPISAWRYFVHHGVVT-------------EECDPYFDSTGC 194
            +CC    G    DGC  G       +  +HG+VT             + C PY     C
Sbjct: 99  TSCCNHANGCPKSDGCRRGSVAEGLIFMKNHGIVTGGEYKPPKKLGNDDGCWPY-PFPKC 157

Query: 195 SH-PGCEPAYPTPKCVRK---------CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIY 244
           +H PG +  YP  +C  K         C   +       H + S  R+   PE I  EI+
Sbjct: 158 NHVPGMKVKYP--RCGSKVGRLAAPSHCDGLHCRRAGDVHRAKSWGRLPISPEKIKQEIF 215

Query: 245 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 304
            NGPV    T++EDF  YKSGVY++ TG ++G H +KLIGWG  + G++YW+  N WN  
Sbjct: 216 DNGPVAAIMTIHEDFRLYKSGVYEYKTGAMVGAHTLKLIGWGV-EAGQEYWLAVNSWNEE 274

Query: 305 WGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDASA 351
           WG  G  K+  G N   ++E+    +P  +  V E+    M  ++ A
Sbjct: 275 WGDQGKIKLAVGKN--ALDEESRQQVP--RRAVNELDEDAMMAESGA 317


>gi|354459545|pdb|3PDF|A Chain A, Discovery Of Novel Cyanamide-Based Inhibitors Of Cathepsin
           C
          Length = 441

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 98/313 (31%), Positives = 153/313 (48%), Gaps = 39/313 (12%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 93
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   K L
Sbjct: 149 NFVKAINAIQKS-WTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQ--QKIL 205

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
            LP S+D R+     + +S + +Q  CGSC++F ++  L  R  I    + +  LS  ++
Sbjct: 206 FLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEV 264

Query: 152 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           ++C  +    GC+GG+P + A +Y    G+V E C PY   TG   P             
Sbjct: 265 VSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP------------- 306

Query: 211 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HYK G+Y 
Sbjct: 307 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYH 365

Query: 269 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           H           +  HAV L+G+GT S  G DYWI+ N W   WG +GYF+I+RG++EC 
Sbjct: 366 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECA 425

Query: 322 IEEDVVAGLPSSK 334
           IE   VA  P  K
Sbjct: 426 IESIAVAATPIPK 438


>gi|115803127|ref|XP_791043.2| PREDICTED: dipeptidyl peptidase 1-like [Strongylocentrotus
           purpuratus]
          Length = 482

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 92/315 (29%), Positives = 148/315 (46%), Gaps = 37/315 (11%)

Query: 33  HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPV--KTHD 90
           H   D  I+ +N++  + WKA    ++ N T+G  +   G K   +      P   +T  
Sbjct: 186 HRRNDKFIEGINKHQDS-WKATYYDRYVNLTLGDMRRRAGGKLWKRVWPDVSPTDERTKQ 244

Query: 91  KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSV 148
            +  LP+ FD R        +S + DQG CGSC+AF +      R  +    N+   +S 
Sbjct: 245 AASNLPEKFDWRDV-GGIDYVSPVRDQGICGSCYAFASTATQESRLRVMTNNNVKVVMSP 303

Query: 149 NDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTG-CSHPGCEPAYPTP 206
            ++++C  +    GC+GG+P + A +Y    G+V E C PY +    C    C       
Sbjct: 304 QEVVSCSEY--AQGCEGGFPYLIAGKYGQDFGLVDETCYPYRERDAPCRQVSC------- 354

Query: 207 KCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 266
                     + +R S+++ I  +    + + +  E+ ++GP+ +SF VY+DF  Y+ G+
Sbjct: 355 ----------RRFRTSEYHYIGGFYGACNEDLMRLELLRSGPLAISFEVYDDFLFYRGGI 404

Query: 267 YKHI-TGDVMG-----GHAVKLIGWGTSDD----GEDYWILANQWNRSWGADGYFKIKRG 316
           Y H+   D         H V ++G+G   +    GE YWI+ N W   WG  GYF+I+RG
Sbjct: 405 YHHVPMYDRFNPWETTNHVVTIVGYGHKGNNPKKGEKYWIVQNTWGSEWGERGYFRIRRG 464

Query: 317 SNECGIEEDVVAGLP 331
            NEC IE   VA  P
Sbjct: 465 DNECNIETLAVATTP 479


>gi|226472628|emb|CAX71000.1| hypotherical protein [Schistosoma japonicum]
          Length = 458

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 102/326 (31%), Positives = 157/326 (48%), Gaps = 46/326 (14%)

Query: 28  LKLDSHIL---QDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG-VKPTPKGLLLG 83
           L+LD + L       IK +N    + WKA   P++S YT+ + +   G  +   K   + 
Sbjct: 147 LQLDENQLYKVDTKFIKAINAKQNS-WKATIYPEYSKYTIKEMRRRAGGSRSAFKRQNVQ 205

Query: 84  VPVKTHDKS-----LKLPKSFDARSAWPQC--STISRILDQGHCGSCWAFGAVEALSDRF 136
           +P K    +     L LPK FD  +  P+   S ++ + +Q  CGSC+AF +  A+  R 
Sbjct: 206 LPKKNLTSAMMLELLALPKEFDWVNR-PEGLRSPVTPVRNQKTCGSCYAFASTAAIEARI 264

Query: 137 CI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTG 193
            +   F +   LS  D++ C  +   +GCDGG+P + A ++    G V E+C+PY   TG
Sbjct: 265 RLASRFRLQPILSPQDIIDCSPY--SEGCDGGFPYLVAGKHGEDFGFVEEKCNPY---TG 319

Query: 194 CSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
                C        C R        +  + ++ I  Y   ++ + +  E+ KNGP  V F
Sbjct: 320 VKSGTCNRLL---GCTR--------YYTTDYHYIGGYYGATNEDLMKLELVKNGPFPVGF 368

Query: 254 TVYEDFAHYKSGVYKHITGDVMGGH-----------AVKLIGWGTSDDGE-DYWILANQW 301
            VY DF  YKSGVY H   D++  H           AV L+G+G  +     YW + N W
Sbjct: 369 EVYGDFLQYKSGVYSHT--DIINNHHPFNPFELTNHAVLLVGYGIDNSSNLPYWKIKNSW 426

Query: 302 NRSWGADGYFKIKRGSNECGIEEDVV 327
            + WG +GYF+I RGS+ECG+E   +
Sbjct: 427 GQYWGEEGYFRILRGSDECGVESIAI 452


>gi|226472626|emb|CAX70999.1| hypotherical protein [Schistosoma japonicum]
          Length = 458

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 105/331 (31%), Positives = 160/331 (48%), Gaps = 56/331 (16%)

Query: 28  LKLDSHIL---QDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG-VKPTPKGLLLG 83
           L+LD + L       IK +N   +  WKA   P++S YT+ + +   G  +   K   + 
Sbjct: 147 LQLDENQLYKVDTKFIKAINAK-QNSWKATIYPEYSKYTIKEMRRRAGGSRSAFKRQNVQ 205

Query: 84  VPVKTHDKS-----LKLPKSFDARSAWPQC--STISRILDQGHCGSCWAFGAVEALSDRF 136
           +P K    +     L LPK FD  +  P+   S ++ + +Q  CGSC+AF +  A+  R 
Sbjct: 206 LPKKNLTSAMMLELLALPKEFDWVNR-PEGLRSPVTPVRNQKTCGSCYAFASTAAIEARI 264

Query: 137 CI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTG 193
            +   F +   LS  D++ C  +   +GCDGG+P + A ++    G V E+C+PY   TG
Sbjct: 265 RLASRFRLQPILSPQDIIDCSPY--SEGCDGGFPYLVAGKHGEDFGFVEEKCNPY---TG 319

Query: 194 CSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN----SDPEDIMA-EIYKNGP 248
                C                N+L   +++Y+   + I     +  ED+M  E+ KNGP
Sbjct: 320 VKSGTC----------------NRLLGCTRYYTTDYHYIGGYYGATNEDLMKLELVKNGP 363

Query: 249 VEVSFTVYEDFAHYKSGVYKHITGDVMGGH-----------AVKLIGWGTSDDGE-DYWI 296
             V F VY DF  YKSGVY H   D++  H           AV L+G+G  +     YW 
Sbjct: 364 FPVGFEVYGDFLQYKSGVYSHT--DIINNHHPFNPFELTNHAVLLVGYGIDNSSNLPYWK 421

Query: 297 LANQWNRSWGADGYFKIKRGSNECGIEEDVV 327
           + N W + WG +GYF+I RGS+ECG+E   +
Sbjct: 422 IKNSWGQYWGEEGYFRILRGSDECGVESIAI 452


>gi|338722032|ref|XP_003364468.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
           [Equus caballus]
          Length = 436

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 98/307 (31%), Positives = 142/307 (46%), Gaps = 38/307 (12%)

Query: 51  WKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQC 108
           W+A  +  F   T+ +  ++ LG ++P+     +            LP +F+A   WP  
Sbjct: 126 WRAGNHSAFWGMTLDEGIRYRLGTIRPSSSVTSMNEIHTVLGPGEVLPTAFEASEKWP-- 183

Query: 109 STISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGG 166
           + I   LDQG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG
Sbjct: 184 NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSC-DTHNQQGCRGG 242

Query: 167 YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVK-- 214
           +   AW +    GVV++ C P+           + A P P+C+          R+     
Sbjct: 243 HLDGAWWFLRRRGVVSDHCYPFSGRER------DEAGPAPRCMMHSRAMGRGKRQATAHC 296

Query: 215 -KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 273
             +++  N  +    AYR+ S  ++IM E+ +NGPV+    V+EDF  Y+ GVY H    
Sbjct: 297 PNSRVHTNDIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVHEDFFLYQGGVYSHTPVS 356

Query: 274 --------VMGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECG 321
                     G H+VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG+NEC 
Sbjct: 357 HGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECD 416

Query: 322 IEEDVVA 328
           IE  V+ 
Sbjct: 417 IESFVLG 423


>gi|189502968|gb|ACE06865.1| unknown [Schistosoma japonicum]
          Length = 458

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 102/326 (31%), Positives = 157/326 (48%), Gaps = 46/326 (14%)

Query: 28  LKLDSHIL---QDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG-VKPTPKGLLLG 83
           L+LD + L       IK +N    + WKA   P++S YT+ + +   G  +   K   + 
Sbjct: 147 LQLDENQLYKVDTKFIKAINAKQNS-WKATIYPEYSKYTIKEMRRRAGGSRSAFKRQNVQ 205

Query: 84  VPVKTHDKS-----LKLPKSFDARSAWPQC--STISRILDQGHCGSCWAFGAVEALSDRF 136
           +P K    +     L LPK FD  +  P+   S ++ + +Q  CGSC+AF +  A+  R 
Sbjct: 206 LPKKNLTSAMMLELLALPKEFDWVNR-PEGLRSPVTPVRNQKTCGSCYAFASTAAIEARI 264

Query: 137 CI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTG 193
            +   F +   LS  D++ C  +   +GCDGG+P + A ++    G V E+C+PY   TG
Sbjct: 265 RLASRFRLQPILSPQDIIDCSPY--SEGCDGGFPYLVAGKHGEDFGFVEEKCNPY---TG 319

Query: 194 CSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
                C        C R        +  + ++ I  Y   ++ + +  E+ KNGP  V F
Sbjct: 320 VKSGTCNRLL---GCTR--------YYTTDYHYIGGYYGATNEDLMKLELVKNGPFPVGF 368

Query: 254 TVYEDFAHYKSGVYKHITGDVMGGH-----------AVKLIGWGTSDDGE-DYWILANQW 301
            VY DF  YKSGVY H   D++  H           AV L+G+G  +     YW + N W
Sbjct: 369 EVYGDFLQYKSGVYSHT--DIINNHHPFNPFELTNHAVLLVGYGIDNSSNLPYWKIKNSW 426

Query: 302 NRSWGADGYFKIKRGSNECGIEEDVV 327
            + WG +GYF+I RGS+ECG+E   +
Sbjct: 427 GQYWGEEGYFRILRGSDECGVESIAI 452


>gi|348508183|ref|XP_003441634.1| PREDICTED: dipeptidyl peptidase 1-like isoform 2 [Oreochromis
           niloticus]
          Length = 461

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 95/313 (30%), Positives = 149/313 (47%), Gaps = 46/313 (14%)

Query: 42  EVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK------- 94
           +V  + +  WKAA  P+   YT+ + ++  G      G    +PV+     +K       
Sbjct: 174 DVINSVQKSWKAAPYPEHEMYTLQELQYRAG------GPASRIPVRVRPAPVKADVAKMA 227

Query: 95  --LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVND 150
             LP+ +D R+     + +S + +Q  CGSC++F  +  L  R  I    +   +LS   
Sbjct: 228 SALPEQWDWRNV-DGVNFVSPVRNQESCGSCYSFATMGMLEARIRILTNNSDAPTLSPQQ 286

Query: 151 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF-DSTGCSHPGCEPAYPTPKCV 209
           +++C  +    GCDGG+P    +Y    G+V E C PY   +T C  P            
Sbjct: 287 VVSCSEY--SQGCDGGFPYLIGKYTQDFGIVDESCFPYVGQNTPCGVP------------ 332

Query: 210 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 269
               +K Q    +++  +  +        +M E+ KNGP+ V+F VY DF +YK G+Y H
Sbjct: 333 ----QKCQRIYAAEYNYVGGFYGGCSEAAMMLELVKNGPMAVAFEVYPDFMNYKEGIYHH 388

Query: 270 ITGDV-------MGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
            TG         +  HAV L+G+G     G++YWI+ N W   WG +GYF+I+RG++EC 
Sbjct: 389 -TGLADPFNPFELTNHAVLLVGYGRCHKTGQNYWIVKNSWGTGWGEEGYFRIRRGNDECA 447

Query: 322 IEEDVVAGLPSSK 334
           IE   VA  P  K
Sbjct: 448 IESIAVAANPIPK 460


>gi|290987261|ref|XP_002676341.1| predicted protein [Naegleria gruberi]
 gi|284089943|gb|EFC43597.1| predicted protein [Naegleria gruberi]
          Length = 218

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 81/236 (34%), Positives = 116/236 (49%), Gaps = 33/236 (13%)

Query: 108 CSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDG 165
           C  +S I D+  CG CWAF   E +SDRFC+     +N  LS   L++C       GC  
Sbjct: 1   CKQLSLIRDEQQCG-CWAFVVAEVVSDRFCVSSKTKVNEVLSPQYLISCDS--NNGGCSY 57

Query: 166 GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHY 225
           GY  +A+++  + G+VTE C P+    G            P C +KC+  N         
Sbjct: 58  GYFDTAFQFVENQGIVTENCFPFVSGEGNY---------IPPCPKKCLAYNPF------- 101

Query: 226 SISAYRINSD----PEDIMA---EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 278
             + +++N+     P+DI      I   G +  S  +Y DF  Y+ GVY+H+ G+ M  H
Sbjct: 102 --TLFKVNNSRAFLPQDIQGMQLSIMNGGSLAASLDIYRDFVQYRGGVYRHLVGNYMFTH 159

Query: 279 AVKLIGWGTSDDGED---YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 331
           +V+++GWG +   +    YWI  N W   WG  G+F I RGSNEC IE DV    P
Sbjct: 160 SVRIVGWGITSPQQGSIPYWICGNNWTEEWGMQGWFWILRGSNECNIELDVWETTP 215


>gi|226472638|emb|CAX71005.1| hypotherical protein [Schistosoma japonicum]
          Length = 457

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 102/326 (31%), Positives = 157/326 (48%), Gaps = 46/326 (14%)

Query: 28  LKLDSHIL---QDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG-VKPTPKGLLLG 83
           L+LD + L       IK +N    + WKA   P++S YT+ + +   G  +   K   + 
Sbjct: 146 LQLDENQLYKVDTKFIKAINAKQNS-WKATIYPEYSKYTIKEMRRRAGGSRSAFKRQNVQ 204

Query: 84  VPVKTHDKS-----LKLPKSFDARSAWPQC--STISRILDQGHCGSCWAFGAVEALSDRF 136
           +P K    +     L LPK FD  +  P+   S ++ + +Q  CGSC+AF +  A+  R 
Sbjct: 205 LPKKNLTSAMMLELLALPKEFDWVNR-PEGLRSPVTPVRNQKTCGSCYAFASTAAIEARI 263

Query: 137 CI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTG 193
            +   F +   LS  D++ C  +   +GCDGG+P + A ++    G V E+C+PY   TG
Sbjct: 264 RLASRFRLQPILSPQDIIDCSPY--SEGCDGGFPYLVAGKHGEDFGFVEEKCNPY---TG 318

Query: 194 CSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 253
                C        C R        +  + ++ I  Y   ++ + +  E+ KNGP  V F
Sbjct: 319 VKSGTCNRLL---GCTR--------YYTTDYHYIGGYYGATNEDLMKLELVKNGPFPVGF 367

Query: 254 TVYEDFAHYKSGVYKHITGDVMGGH-----------AVKLIGWGTSDDGE-DYWILANQW 301
            VY DF  YKSGVY H   D++  H           AV L+G+G  +     YW + N W
Sbjct: 368 EVYGDFLQYKSGVYSHT--DIINNHHPFNPFELTNHAVLLVGYGIDNSSNLPYWKIKNSW 425

Query: 302 NRSWGADGYFKIKRGSNECGIEEDVV 327
            + WG +GYF+I RGS+ECG+E   +
Sbjct: 426 GQYWGEEGYFRILRGSDECGVESIAI 451


>gi|157058747|gb|ABV03131.1| cathepsin B-2744 [Myzus persicae]
          Length = 261

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 93/254 (36%), Positives = 122/254 (48%), Gaps = 41/254 (16%)

Query: 87  KTHDKSLK--LPKSFDARSAWPQCST-ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 143
           KT D + K  +PK FDAR  +  C+  I  + DQG+C S WA       +DR CI     
Sbjct: 18  KTADINYKTDIPKEFDARQYFISCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGK 77

Query: 144 LS--LSVNDLLACCGFLCGD----GCDGGYPISAWRYFVHHGVVT-------EECDPYFD 190
            +  LS  +L++C     GD    GCDGG    AW + +  G+VT       E C PY +
Sbjct: 78  FTDNLSAQNLMSC-----GDDEKLGCDGGSAYKAWEFTMGKGIVTGGPYDSNEGCQPYKN 132

Query: 191 STGCSHPG------CEPAYPTPK--CVRKCVKKN-------QLWRNSKHYSISAYRINSD 235
              C H G      C     T    C  KCV KN        L++ S  Y  S     ++
Sbjct: 133 RP-CDHYGDSSLTNCSSLRRTQMMFCRDKCVNKNYKVKYEDDLYKTSVVYMTSW----TN 187

Query: 236 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 295
            + I  EI   GPV     VYE+F  YK GVYK   G+++G H VKLIGWG  + G +YW
Sbjct: 188 VKQIQQEIMTYGPVTAFMYVYENFMGYKEGVYKSTAGELIGYHHVKLIGWGVDEAGIEYW 247

Query: 296 ILANQWNRSWGADG 309
           +  N WN +WG +G
Sbjct: 248 LAMNSWNSNWGTNG 261


>gi|431838501|gb|ELK00433.1| Dipeptidyl-peptidase 1 [Pteropus alecto]
          Length = 460

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 94/325 (28%), Positives = 157/325 (48%), Gaps = 31/325 (9%)

Query: 22  EGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG--VKPTPKG 79
           +G+  K     +      +K +N   K+ W A    ++   T+ +     G   +  P+ 
Sbjct: 154 QGLQDKYSNRPYKYNHDFVKAINAAQKS-WTATTYMEYETLTLREMIRRSGGHSRRVPRP 212

Query: 80  LLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH 139
               +  + H+K L+LP S+D R+     + ++ + +Q  CGSC++F +V  L  R  I 
Sbjct: 213 KPAPLTAEIHEKVLRLPTSWDWRNV-RGTNFVTPVRNQASCGSCYSFASVGMLEARIRIL 271

Query: 140 FGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSH 196
                S  LS  ++++C  +    GC+GG+P + A +Y    G+V E C PY   TG   
Sbjct: 272 TNNTQSPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEETCFPY---TGTDS 326

Query: 197 PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 256
           P C       K    C +    + +S+++ +  +    +   +  E+  +GP+ V+F VY
Sbjct: 327 P-C-------KLKENCFR----YYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVY 374

Query: 257 EDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADG 309
           +DF HY  G+Y H           +  HAV L+G+GT    G +YW + N W  SWG +G
Sbjct: 375 DDFLHYHKGIYHHTGLKDPFNPFELTNHAVLLVGYGTDPASGLNYWTVKNSWGTSWGENG 434

Query: 310 YFKIKRGSNECGIEEDVVAGLPSSK 334
           YF+I+RG++EC IE   +A  P  K
Sbjct: 435 YFRIRRGTDECAIESIAMAATPIPK 459


>gi|290975817|ref|XP_002670638.1| predicted protein [Naegleria gruberi]
 gi|284084199|gb|EFC37894.1| predicted protein [Naegleria gruberi]
          Length = 528

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 80/254 (31%), Positives = 126/254 (49%), Gaps = 33/254 (12%)

Query: 93  LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSV---N 149
           + +PK+FD R+   Q   +S I  QG CGSC++F     +  R  + F  N    V    
Sbjct: 292 VSIPKAFDWRNVNGQ-DFVSPIRSQGQCGSCYSFSTTAMMEARKRV-FTQNKEQPVYSPE 349

Query: 150 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC- 208
           ++++C  +    GCDGG+     ++    G++ E+CDPY   TG  H          KC 
Sbjct: 350 NIISCSFY--SQGCDGGFAYLISKWGEDFGIIAEQCDPY---TGTPH----------KCN 394

Query: 209 VRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
           + +     Q W N ++     Y      E++  ++ K GP+ VS  VY D  +Y SG+Y+
Sbjct: 395 LNQACSTRQYWTNYRY--TGGYYGAVTVENMQLDVLKYGPLSVSMEVYNDLFNYHSGIYR 452

Query: 269 HITGDVMG----------GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 318
           H++   +            H V ++GWG ++ GE YWI+ N W  S+G DGYF I RG +
Sbjct: 453 HVSSSKLTSPVPNPFELTNHVVLIVGWGENEKGEKYWIVKNSWGTSFGMDGYFLIARGVD 512

Query: 319 ECGIEEDVVAGLPS 332
           EC IE +  + +P+
Sbjct: 513 ECAIESENASAIPT 526


>gi|332254560|ref|XP_003276397.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Nomascus leucogenys]
          Length = 436

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 97/301 (32%), Positives = 143/301 (47%), Gaps = 26/301 (8%)

Query: 51  WKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQC 108
           W+A  +  F   T+ +  ++ LG ++P+   + +       +    LP +F+A   WP  
Sbjct: 126 WQAGNHSAFWGMTLDEGIRYRLGTMRPSSSVMNMHEIYTVLNPGEVLPTAFEASEKWP-- 183

Query: 109 STISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGG 166
           + I   LDQG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG
Sbjct: 184 NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGG 242

Query: 167 YPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWR 220
               AW +    GVV++ C P+     D  G + P    +    +  R+      N    
Sbjct: 243 RLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSHVN 302

Query: 221 NSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV----- 274
           N+  Y ++  YR+ S+ +++M E+ +NGPV+    V+EDF  YK G+Y H    +     
Sbjct: 303 NNDIYQVTPVYRLGSNDKEVMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPER 362

Query: 275 ---MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 327
               G H+VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG NEC IE  V+
Sbjct: 363 YRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVL 422

Query: 328 A 328
            
Sbjct: 423 G 423


>gi|157058741|gb|ABV03128.1| cathepsin B-2744 [Aulacorthum solani]
          Length = 255

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 84/232 (36%), Positives = 114/232 (49%), Gaps = 23/232 (9%)

Query: 95  LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
           +P++FDAR  +  CS  I  + DQG+C S WA       +DR CI      +  LS  +L
Sbjct: 26  IPRTFDARQYFVSCSDVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGQFTDNLSAQNL 85

Query: 152 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG------ 198
           ++C G     GCDGG    AW   +  G+VT       E C PY +   C H G      
Sbjct: 86  MSC-GNEEKMGCDGGSAFKAWELTMSKGIVTGGNYDSNEGCQPYKNRP-CDHYGDSSLTN 143

Query: 199 CEPAYPTPK--CVRKCVKKNQL--WRNSKHYSISAYRIN-SDPEDIMAEIYKNGPVEVSF 253
           C     T    C  KCV KN    + +  H +   Y  + ++ + I  EI   GPV    
Sbjct: 144 CSSLRRTQMTVCREKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTYGPVTALM 203

Query: 254 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 305
            VYE+F  YK G+YK   G+++G H VKLIGWG  +DG +YW+  N WN +W
Sbjct: 204 YVYENFMGYKKGIYKSTAGELIGYHHVKLIGWGVDEDGTEYWLAMNSWNSNW 255


>gi|2599293|gb|AAC32040.1| preprocathepsin C [Schistosoma japonicum]
          Length = 458

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 104/337 (30%), Positives = 159/337 (47%), Gaps = 46/337 (13%)

Query: 17  FATFAEGVVSKLKLDSHIL---QDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG- 72
           F    E     L+LD + L       IK +N    + WKA   P++S YT+ + +   G 
Sbjct: 136 FQRMIEYKSPVLQLDGNQLYKVDTKFIKAINAKQNS-WKATIYPEYSKYTIKEMRRRAGG 194

Query: 73  VKPTPKGLLLGVPVKTHDKS-----LKLPKSFDARSAWPQC--STISRILDQGHCGSCWA 125
            +   K   + +P K    +     L LPK FD  +  P+   S ++ + +Q  CGSC+A
Sbjct: 195 SRSAFKRQNVQLPKKNLTSAMMLELLALPKEFDWVNR-PEGLRSPVTPVRNQKTCGSCYA 253

Query: 126 FGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVT 182
           F +  A+  R  +   F +   LS  D++ C  +   +GCDGG+P + A ++    G V 
Sbjct: 254 FASTAAIEARIRLASRFRLQPILSPQDIIDCSPY--SEGCDGGFPYLVAGKHGEDFGFVE 311

Query: 183 EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE 242
           E+C+PY   TG     C           K +   + +    HY I  Y   ++ + +  E
Sbjct: 312 EKCNPY---TGVKSGTCN----------KLLGCTRYYTTDYHY-IGGYYGATNEDLMKLE 357

Query: 243 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH-----------AVKLIGWGTSDDG 291
           + KNGP  V F VY DF  YKSGVY H   D++  H           AV L+G+G  +  
Sbjct: 358 LVKNGPFPVGFEVYGDFLQYKSGVYSHT--DIINNHHPFNPFELTNHAVLLVGYGIDNSS 415

Query: 292 E-DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 327
              YW + N W + WG +GYF+I RGS+ECG++   +
Sbjct: 416 NLPYWKIKNSWGQYWGEEGYFRILRGSDECGVQSIAI 452


>gi|33327024|gb|AAQ08887.1| cathepsin C [Homo sapiens]
          Length = 463

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 98/313 (31%), Positives = 152/313 (48%), Gaps = 39/313 (12%)

Query: 38  SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVKTHDKSL 93
           + +K +N   K+ W A    ++   T+G      G      P PK   L   ++   K L
Sbjct: 173 NFVKAINAIQKS-WTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQ--QKIL 229

Query: 94  KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 151
            LP S+D R+     + +S + +Q  CGSC++F ++  L  R  I    + +  LS  ++
Sbjct: 230 HLPTSWDWRNV-HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEV 288

Query: 152 LACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 210
           ++C       GC+GG+P + A +Y    G+V E C PY   TG   P             
Sbjct: 289 VSCSQH--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP------------- 330

Query: 211 KCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 268
            C  K   +R  +S+++ +  +    +   +  E+  +GP+ V+F VY+DF HYK G+Y 
Sbjct: 331 -CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYH 389

Query: 269 H------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 321
           H           +  HAV L+G+GT S  G DYWI+ N W   WG +GYF+I+RG++EC 
Sbjct: 390 HTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECA 449

Query: 322 IEEDVVAGLPSSK 334
           IE   VA  P  K
Sbjct: 450 IESIAVAATPIPK 462


>gi|3859607|gb|AAC72873.1| contains similarity to cysteine proteases (Pfam: PF00112, E=.21,
           N=1) [Arabidopsis thaliana]
 gi|7268204|emb|CAB77731.1| putative cysteine protease [Arabidopsis thaliana]
          Length = 129

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 64/96 (66%), Positives = 76/96 (79%)

Query: 25  VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 84
           ++K KLDS ILQD I+K+VNENP AGWKAA N +FSN TV +FK LLGVKPTPK   LGV
Sbjct: 33  LTKQKLDSKILQDEIVKKVNENPNAGWKAAINDRFSNATVAEFKRLLGVKPTPKKHFLGV 92

Query: 85  PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHC 120
           P+ +HD SLKLPK+FDAR+AWPQC++I  IL    C
Sbjct: 93  PIVSHDPSLKLPKAFDARTAWPQCTSIGNILGLVLC 128


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.137    0.449 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,305,042,575
Number of Sequences: 23463169
Number of extensions: 286751831
Number of successful extensions: 548057
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 5761
Number of HSP's successfully gapped in prelim test: 1425
Number of HSP's that attempted gapping in prelim test: 528402
Number of HSP's gapped (non-prelim): 8895
length of query: 351
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 208
effective length of database: 9,003,962,200
effective search space: 1872824137600
effective search space used: 1872824137600
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)