BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 025695
         (249 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255548165|ref|XP_002515139.1| cathepsin B, putative [Ricinus communis]
 gi|223545619|gb|EEF47123.1| cathepsin B, putative [Ricinus communis]
          Length = 376

 Score =  439 bits (1130), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 199/237 (83%), Positives = 221/237 (93%), Gaps = 2/237 (0%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           +GHCGSCWAFGAVE+LSDRFCIHFGMN+SLSVNDLLACCGFLCGDGCDGGYP+ AWRYFV
Sbjct: 140 EGHCGSCWAFGAVESLSDRFCIHFGMNISLSVNDLLACCGFLCGDGCDGGYPMYAWRYFV 199

Query: 75  HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 134
           HHGVVTEECDPYFD+ GCSHPGCEP +PTPKCVRKC+ KNQLWR SKHYS++AYRI+SDP
Sbjct: 200 HHGVVTEECDPYFDNIGCSHPGCEPGFPTPKCVRKCIDKNQLWRQSKHYSVNAYRISSDP 259

Query: 135 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 194
            D+MAE+YKNGPVEVSFTVYEDFAHYKSGVYKHITG+VMGGHAVKLIGWGTSD+GEDYW+
Sbjct: 260 HDVMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGEVMGGHAVKLIGWGTSDNGEDYWL 319

Query: 195 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN--LVKEITSADMFEDASA 249
           LANQWNR WG DGYFKI+RG+NECGIE+D VAGLPS++N  LV+E+ S D  EDA A
Sbjct: 320 LANQWNRGWGDDGYFKIRRGTNECGIEDDAVAGLPSARNLDLVREVASMDALEDAFA 376


>gi|224064400|ref|XP_002301457.1| predicted protein [Populus trichocarpa]
 gi|222843183|gb|EEE80730.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score =  436 bits (1121), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 198/238 (83%), Positives = 217/238 (91%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+LSDRFCIH+GMN+SLSVNDLLACCGFLCG GC+GGYPISAWR
Sbjct: 120 ILDQGHCGSCWAFGAVESLSDRFCIHYGMNISLSVNDLLACCGFLCGSGCNGGYPISAWR 179

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YFVHHGVVTEECDPYFD  GCSHPGCEP YPTPKC RKCV KNQLW+ SKHY +  YRI+
Sbjct: 180 YFVHHGVVTEECDPYFDDIGCSHPGCEPGYPTPKCARKCVNKNQLWKKSKHYGVKPYRID 239

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           SDPE IMAEIYKNGPVEV+FTVYEDFAHYKSGVYKHITG +MGGHAVKLIGWGTS+DGE 
Sbjct: 240 SDPESIMAEIYKNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTSEDGEA 299

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDASA 249
           YW+LANQWNR WG DGYFKI+RG+NECGIE DVVAGLPS++NLV+E+ S D  EDASA
Sbjct: 300 YWLLANQWNRGWGDDGYFKIRRGTNECGIEGDVVAGLPSTRNLVREVVSVDAREDASA 357


>gi|449489527|ref|XP_004158338.1| PREDICTED: cathepsin B-like [Cucumis sativus]
          Length = 349

 Score =  432 bits (1112), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 192/229 (83%), Positives = 214/229 (93%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
           +  ++ QGHCGSCWAFGAVE+LSDRFCIHF MN++LSVNDLLACCGF+CGDGCDGGYPIS
Sbjct: 118 IGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPIS 177

Query: 69  AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
           AWRYFV HGVVTE+CDPYFD+TGCSHPGCEPAYPTP+CVR CV KNQ+WR +KHY +SAY
Sbjct: 178 AWRYFVRHGVVTEQCDPYFDTTGCSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVSAY 237

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
           R+  DP DIMAE+YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT+DD
Sbjct: 238 RVKRDPNDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDD 297

Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 237
           GEDYW+LANQWNR WG DGYFKI+RG+NECGIEEDVVAGLPS+KN+ +E
Sbjct: 298 GEDYWLLANQWNRGWGDDGYFKIRRGTNECGIEEDVVAGLPSTKNIARE 346


>gi|449446774|ref|XP_004141146.1| PREDICTED: cathepsin B-like [Cucumis sativus]
          Length = 348

 Score =  432 bits (1112), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 192/229 (83%), Positives = 214/229 (93%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
           +  ++ QGHCGSCWAFGAVE+LSDRFCIHF MN++LSVNDLLACCGF+CGDGCDGGYPIS
Sbjct: 117 IGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPIS 176

Query: 69  AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
           AWRYFV HGVVTE+CDPYFD+TGCSHPGCEPAYPTP+CVR CV KNQ+WR +KHY +SAY
Sbjct: 177 AWRYFVRHGVVTEQCDPYFDTTGCSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVSAY 236

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
           R+  DP DIMAE+YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT+DD
Sbjct: 237 RVKRDPNDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDD 296

Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 237
           GEDYW+LANQWNR WG DGYFKI+RG+NECGIEEDVVAGLPS+KN+ +E
Sbjct: 297 GEDYWLLANQWNRGWGDDGYFKIRRGTNECGIEEDVVAGLPSTKNIARE 345


>gi|356505709|ref|XP_003521632.1| PREDICTED: cathepsin B-like [Glycine max]
          Length = 357

 Score =  424 bits (1090), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 192/237 (81%), Positives = 215/237 (90%), Gaps = 2/237 (0%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+LSDRFCIHF +N+SLSVNDLLACCGFLCG GCDGGYP+ AWR
Sbjct: 120 ILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSGCDGGYPLYAWR 179

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y  HHGVVTEECDPYFD  GCSHPGCEPAY TPKCV+KCV  NQ+W+ SKHYS+SAYR+N
Sbjct: 180 YLAHHGVVTEECDPYFDQIGCSHPGCEPAYRTPKCVKKCVSGNQVWKKSKHYSVSAYRVN 239

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           SDP DIMAE+YKNGPVEV+FTVYEDFA+YKSGVYKHITG  +GGHAVKLIGWGT+DDGED
Sbjct: 240 SDPHDIMAEVYKNGPVEVAFTVYEDFAYYKSGVYKHITGYELGGHAVKLIGWGTTDDGED 299

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 248
           YW+LANQWNR WG DGYFKI+RG+NECGIEEDV AGLPS+KNLV+E+T  DM  DA+
Sbjct: 300 YWLLANQWNREWGDDGYFKIRRGTNECGIEEDVTAGLPSTKNLVREVT--DMDADAA 354


>gi|224064398|ref|XP_002301456.1| predicted protein [Populus trichocarpa]
 gi|222843182|gb|EEE80729.1| predicted protein [Populus trichocarpa]
          Length = 325

 Score =  422 bits (1085), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 193/237 (81%), Positives = 212/237 (89%)

Query: 13  VIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           V  GHCGSCWAFGAVE+LSDRFCIH+GMNLSLSVNDLLACCG++CGDGCDGGYPI AWRY
Sbjct: 89  VPLGHCGSCWAFGAVESLSDRFCIHYGMNLSLSVNDLLACCGWMCGDGCDGGYPIDAWRY 148

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
           FV  GVVTEECDPYFD  GCSHPGCEP +PTPKC RKC  KN+LW  SKH+S++AYRI+S
Sbjct: 149 FVQSGVVTEECDPYFDDIGCSHPGCEPGFPTPKCERKCADKNKLWAESKHFSVNAYRIDS 208

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
           DP  IMAE+  NGPVEV+FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY
Sbjct: 209 DPHSIMAEVSMNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 268

Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDASA 249
           W+LANQWNR WG DGYFKI+RG+NECGIEEDVVAGLPS++NLV+E+   D  E ASA
Sbjct: 269 WLLANQWNRGWGDDGYFKIRRGTNECGIEEDVVAGLPSTRNLVREVAKIDAHEHASA 325


>gi|217072748|gb|ACJ84734.1| unknown [Medicago truncatula]
 gi|388505480|gb|AFK40806.1| unknown [Medicago truncatula]
          Length = 359

 Score =  422 bits (1084), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 187/232 (80%), Positives = 210/232 (90%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+L DRFCIHF MN+SLSVNDLLACCGFLCG GCDGG PI AWR
Sbjct: 122 ILDQGHCGSCWAFGAVESLQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWR 181

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y  HHGVVTEECDPYFD  GCSHPGCEPAY TPKCVRKCVK NQ+W+ SKHYS+ AYR+ 
Sbjct: 182 YLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVK 241

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           SDP+DIMAE+YKNGPVEV+FTV+EDFAHYKSGVYKHITG  +GGHAVKLIGWGTSD+GED
Sbjct: 242 SDPQDIMAEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGED 301

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADM 243
           YW+LANQWN +WG DGYFKIKRG+NECGIE+DV AGLPS+KN+V+E+T  D+
Sbjct: 302 YWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVTAGLPSTKNIVREVTDMDV 353


>gi|357511629|ref|XP_003626103.1| Cathepsin B [Medicago truncatula]
 gi|87240982|gb|ABD32840.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
           propeptide [Medicago truncatula]
 gi|355501118|gb|AES82321.1| Cathepsin B [Medicago truncatula]
          Length = 357

 Score =  421 bits (1083), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 187/232 (80%), Positives = 210/232 (90%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+L DRFCIHF MN+SLSVNDLLACCGFLCG GCDGG PI AWR
Sbjct: 120 ILDQGHCGSCWAFGAVESLQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWR 179

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y  HHGVVTEECDPYFD  GCSHPGCEPAY TPKCVRKCVK NQ+W+ SKHYS+ AYR+ 
Sbjct: 180 YLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVK 239

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           SDP+DIMAE+YKNGPVEV+FTV+EDFAHYKSGVYKHITG  +GGHAVKLIGWGTSD+GED
Sbjct: 240 SDPQDIMAEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGED 299

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADM 243
           YW+LANQWN +WG DGYFKIKRG+NECGIE+DV AGLPS+KN+V+E+T  D+
Sbjct: 300 YWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVTAGLPSTKNIVREVTDMDV 351


>gi|18378947|ref|NP_563648.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|16226808|gb|AAL16267.1|AF428337_1 At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|14532526|gb|AAK63991.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|25090140|gb|AAN72238.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|332189292|gb|AEE27413.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
          Length = 362

 Score =  421 bits (1081), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 190/231 (82%), Positives = 212/231 (91%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+LSDRFCI + MN+SLSVNDLLACCGFLCG GC+GGYPI+AWR
Sbjct: 125 ILDQGHCGSCWAFGAVESLSDRFCIKYNMNVSLSVNDLLACCGFLCGQGCNGGYPIAAWR 184

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YF HHGVVTEECDPYFD+TGCSHPGCEPAYPTPKC RKCV  NQLWR SKHY +SAY++ 
Sbjct: 185 YFKHHGVVTEECDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQLWRESKHYGVSAYKVR 244

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           S P+DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG  +GGHAVKLIGWGTSDDGED
Sbjct: 245 SHPDDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGED 304

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 242
           YW+LANQWNRSWG DGYFKI+RG+NECGIE  VVAGLPS +N+VK IT++D
Sbjct: 305 YWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVVKGITTSD 355


>gi|297843028|ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335237|gb|EFH65654.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 360

 Score =  419 bits (1078), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 189/231 (81%), Positives = 211/231 (91%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+LSDRFCI + MN+SLSVNDLLACCGFLCG GC+GGYPI+AWR
Sbjct: 123 ILDQGHCGSCWAFGAVESLSDRFCIKYNMNISLSVNDLLACCGFLCGQGCNGGYPIAAWR 182

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YF HHGVVTEECDPYFD+TGCSHPGCEPAYPTPKC RKCV  NQLWR SKHY +SAY++ 
Sbjct: 183 YFKHHGVVTEECDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQLWRESKHYGVSAYKVR 242

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           S P+DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG  +GGHAVKLIGWGTSDDGED
Sbjct: 243 SHPDDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGED 302

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 242
           YW+LANQWNRSWG DGYFKI+RG+NECGIE  VVAGLPS +N+ K IT++D
Sbjct: 303 YWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVFKGITTSD 353


>gi|356572872|ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max]
          Length = 356

 Score =  419 bits (1076), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 188/237 (79%), Positives = 214/237 (90%), Gaps = 2/237 (0%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+LSDRFCIHF +N+SLSVNDLLACCGFLCG GCDGGYP+ AW+
Sbjct: 119 ILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSGCDGGYPLYAWQ 178

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y  HHGVVTEECDPYFD  GCSHPGCEPAY TPKCV+KCV  NQ+W+ SKHYS++AYR++
Sbjct: 179 YLAHHGVVTEECDPYFDQIGCSHPGCEPAYRTPKCVKKCVSGNQVWKKSKHYSVNAYRVS 238

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           SDP DIM E+YKNGPVEV+FTVYEDFAHYKSGVYKHITG  +GGHAVKLIGWGT++DGED
Sbjct: 239 SDPHDIMTEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGYELGGHAVKLIGWGTTEDGED 298

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 248
           YW+LANQWNR WG DGYFKI+RG+NECGIEEDV AGLPS+KNLV+E+T  DM  DA+
Sbjct: 299 YWLLANQWNREWGDDGYFKIRRGTNECGIEEDVTAGLPSTKNLVREVT--DMDADAA 353


>gi|222424744|dbj|BAH20325.1| AT1G02305 [Arabidopsis thaliana]
          Length = 293

 Score =  418 bits (1075), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 190/231 (82%), Positives = 212/231 (91%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+LSDRFCI + MN+SLSVNDLLACCGFLCG GC+GGYPI+AWR
Sbjct: 56  ILDQGHCGSCWAFGAVESLSDRFCIKYNMNVSLSVNDLLACCGFLCGQGCNGGYPIAAWR 115

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YF HHGVVTEECDPYFD+TGCSHPGCEPAYPTPKC RKCV  NQLWR SKHY +SAY++ 
Sbjct: 116 YFKHHGVVTEECDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQLWRESKHYGVSAYKVR 175

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           S P+DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG  +GGHAVKLIGWGTSDDGED
Sbjct: 176 SHPDDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGED 235

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 242
           YW+LANQWNRSWG DGYFKI+RG+NECGIE  VVAGLPS +N+VK IT++D
Sbjct: 236 YWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVVKGITTSD 286


>gi|217073630|gb|ACJ85175.1| unknown [Medicago truncatula]
          Length = 359

 Score =  418 bits (1074), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 185/232 (79%), Positives = 208/232 (89%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+L DRFC HF MN+SLSVNDLLACCGFLCG GCDGG PI AWR
Sbjct: 122 ILDQGHCGSCWAFGAVESLQDRFCSHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWR 181

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y  HHGVVTEECDPYFD  GCSHPGCEPAY TPKCVRKCVK NQ+W+ SKHYS+ AYR+ 
Sbjct: 182 YLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVK 241

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           SDP+DIM E+YKNGPVEV+FTV+EDFAHYKSGVYKHITG  +GGHAVKLIGWGTSD+GED
Sbjct: 242 SDPQDIMTEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGED 301

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADM 243
           YW+LANQWN +WG DGYFKIKRG+NECGIE+DV AGLPS+KN+V+E+T  D+
Sbjct: 302 YWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVTAGLPSTKNIVREVTDMDV 353


>gi|225437812|ref|XP_002281936.1| PREDICTED: cathepsin B-like isoform 1 [Vitis vinifera]
 gi|359480250|ref|XP_003632421.1| PREDICTED: cathepsin B-like [Vitis vinifera]
          Length = 358

 Score =  416 bits (1070), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 188/224 (83%), Positives = 207/224 (92%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+LSDRFCIHFGMN+SLSVNDLLACCGFLCG GCDGGYP+ AWR
Sbjct: 120 ILDQGHCGSCWAFGAVESLSDRFCIHFGMNISLSVNDLLACCGFLCGSGCDGGYPLYAWR 179

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YF+HHGVVTEECDPYFD+TGCSHPGCEP YPTPKCVRKC  +NQLWR +K Y  SAYRI+
Sbjct: 180 YFIHHGVVTEECDPYFDATGCSHPGCEPGYPTPKCVRKCTDENQLWRKAKRYGQSAYRIS 239

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           SDP  IMAE+YKNGPVEV+FTVYEDFAHY+SGVY++ TGDVMGGHAVKLIGWGT+DDGED
Sbjct: 240 SDPYQIMAEVYKNGPVEVAFTVYEDFAHYESGVYRYTTGDVMGGHAVKLIGWGTTDDGED 299

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
           YWILANQWNR+WG DGYF I+RG NECGIEE VVAGLPSSKNL+
Sbjct: 300 YWILANQWNRNWGDDGYFMIRRGVNECGIEEGVVAGLPSSKNLM 343


>gi|297744106|emb|CBI37076.3| unnamed protein product [Vitis vinifera]
          Length = 392

 Score =  416 bits (1069), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 188/221 (85%), Positives = 205/221 (92%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           QGHCGSCWAFGAVE+LSDRFCIHFGMN+SLSVNDLLACCGFLCG GCDGGYP+ AWRYF+
Sbjct: 157 QGHCGSCWAFGAVESLSDRFCIHFGMNISLSVNDLLACCGFLCGSGCDGGYPLYAWRYFI 216

Query: 75  HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 134
           HHGVVTEECDPYFD+TGCSHPGCEP YPTPKCVRKC  +NQLWR +K Y  SAYRI+SDP
Sbjct: 217 HHGVVTEECDPYFDATGCSHPGCEPGYPTPKCVRKCTDENQLWRKAKRYGQSAYRISSDP 276

Query: 135 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 194
             IMAE+YKNGPVEV+FTVYEDFAHY+SGVY++ TGDVMGGHAVKLIGWGT+DDGEDYWI
Sbjct: 277 YQIMAEVYKNGPVEVAFTVYEDFAHYESGVYRYTTGDVMGGHAVKLIGWGTTDDGEDYWI 336

Query: 195 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
           LANQWNR+WG DGYF I+RG NECGIEE VVAGLPSSKNL+
Sbjct: 337 LANQWNRNWGDDGYFMIRRGVNECGIEEGVVAGLPSSKNLM 377


>gi|312283137|dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila]
          Length = 362

 Score =  416 bits (1068), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 188/238 (78%), Positives = 212/238 (89%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+LSDRFCI FGMN+SLSVNDLLACCGF CGDGCDGGYPI+AW+
Sbjct: 125 ILDQGHCGSCWAFGAVESLSDRFCIEFGMNISLSVNDLLACCGFRCGDGCDGGYPIAAWQ 184

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YF + GVVTEECDPYFD TGCSHPGCEPAYPTPKC+RKCV  NQLW  SKHYS+S Y + 
Sbjct: 185 YFSYSGVVTEECDPYFDDTGCSHPGCEPAYPTPKCMRKCVSGNQLWSQSKHYSVSTYTVK 244

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           S+P+DIMAE+YKNGPVEVSFTVYEDFAHYKSGVYKHITG  +GGHAVKLIGWGT+D+GED
Sbjct: 245 SNPQDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTTDEGED 304

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDASA 249
           YW+LANQWNRSWG DGYF I+RG+NECGIE++ VAGLPSS+N+ K IT +D    AS 
Sbjct: 305 YWLLANQWNRSWGDDGYFMIRRGTNECGIEDEPVAGLPSSRNVFKVITGSDDLSVASV 362


>gi|94958151|gb|ABF47216.1| cathepsin B [Nicotiana benthamiana]
          Length = 356

 Score =  413 bits (1061), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 185/244 (75%), Positives = 215/244 (88%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGG 64
           N   +  ++ QGHCGSCWAFGAVE+LSDRFCIH+G+N+SLS NDLLACCGFLCGDGCDGG
Sbjct: 112 NCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHYGLNISLSANDLLACCGFLCGDGCDGG 171

Query: 65  YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYS 124
           YP+ AW+YFV  GVVT+ECDPYFD+ GCSHPGCEPAYPTPKC RKCVK+N LW  SKH+ 
Sbjct: 172 YPLQAWKYFVRKGVVTDECDPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWSKSKHFG 231

Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 184
           ++AY I+SDP  IM E+YKNGPVEVSFTVYEDFAHYKSGVYKH+TGDVMGGHAVKLIGWG
Sbjct: 232 VNAYMISSDPHSIMTELYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDVMGGHAVKLIGWG 291

Query: 185 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMF 244
           TS+DGEDYW+LANQWNR WG DGYFKI+RG++EC IE++VVAGLPS++NL  E+  +D F
Sbjct: 292 TSEDGEDYWLLANQWNRGWGDDGYFKIRRGTDECEIEDEVVAGLPSARNLNMELDVSDAF 351

Query: 245 EDAS 248
            DA+
Sbjct: 352 LDAA 355


>gi|609175|emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana rustica]
          Length = 356

 Score =  413 bits (1061), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 184/248 (74%), Positives = 217/248 (87%)

Query: 1   MPFTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDG 60
           + ++N   +  ++ QGHCGSCWAFGAVE+LSDRFCIH+G+N+SLS NDL ACCGFLCGDG
Sbjct: 108 VAWSNCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHYGLNISLSANDLYACCGFLCGDG 167

Query: 61  CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
           CDGGYP+ AW+YFV  GVVT+ECDPYFD+ GCSHPGCEPAYPTPKC RKCVK+N LW  S
Sbjct: 168 CDGGYPLQAWKYFVRKGVVTDECDPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWSRS 227

Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 180
           KH+ ++AY I+SDP  IM E+YKNGPVEVSFTVYEDFAHYKSGVYKH+TGD+MGGHAVKL
Sbjct: 228 KHFGVNAYMISSDPHSIMTEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDIMGGHAVKL 287

Query: 181 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITS 240
           IGWGTS+DGEDYW+LANQWNR WG DGYFKI+RG+NEC IE++VVAGLPS++NL  E+  
Sbjct: 288 IGWGTSEDGEDYWLLANQWNRGWGDDGYFKIRRGTNECEIEDEVVAGLPSARNLNVELDV 347

Query: 241 ADMFEDAS 248
           +D F DA+
Sbjct: 348 SDAFLDAA 355


>gi|224128101|ref|XP_002320244.1| predicted protein [Populus trichocarpa]
 gi|222861017|gb|EEE98559.1| predicted protein [Populus trichocarpa]
          Length = 339

 Score =  409 bits (1051), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 187/238 (78%), Positives = 209/238 (87%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+LSDRFCIH+GMNLSLSVNDLLACCG++CG GCDGG PI AWR
Sbjct: 102 ILDQGHCGSCWAFGAVESLSDRFCIHYGMNLSLSVNDLLACCGWMCGAGCDGGSPIDAWR 161

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YFV  GVVTEECDPYFD  GCSHPGCEP +PTPKC RKC  KN+LW  SKH+S++AYRI+
Sbjct: 162 YFVQSGVVTEECDPYFDDIGCSHPGCEPGFPTPKCERKCADKNKLWAESKHFSVNAYRID 221

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           SDP  IMAE+  NGPVEV+FTVYEDFAHYKSGVYKHITGD MGGHAVKLIGWGTS+DGED
Sbjct: 222 SDPHSIMAEVSSNGPVEVAFTVYEDFAHYKSGVYKHITGDAMGGHAVKLIGWGTSEDGED 281

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDASA 249
           YW+LANQWNR WG DGYFKIKRG+NECGIE  VVAGLPS++NLV+E+   D  E A+A
Sbjct: 282 YWLLANQWNRGWGDDGYFKIKRGTNECGIEGAVVAGLPSTRNLVREVAGIDGHEHATA 339


>gi|388500062|gb|AFK38097.1| unknown [Lotus japonicus]
          Length = 357

 Score =  408 bits (1048), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 182/237 (76%), Positives = 208/237 (87%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+LSDRFCIH  +N+SLSVNDLLACCGFLCG GCDGGYP+ AWR
Sbjct: 120 ILDQGHCGSCWAFGAVESLSDRFCIHLDVNVSLSVNDLLACCGFLCGSGCDGGYPLYAWR 179

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y  HHGVVTEECDPYFD  GCSHPGCEPAY TPKCVRKCVK NQ+W+ SK++S++AY + 
Sbjct: 180 YLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQIWKKSKYFSVNAYSVK 239

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           SDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG  +GGHAVKLIGWGT+D+GED
Sbjct: 240 SDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGSQLGGHAVKLIGWGTTDEGED 299

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 248
           YW++ANQWNRSWG DGYF I+RG+NECGIEEDV AGLPS+KN+ + +   D   D S
Sbjct: 300 YWLIANQWNRSWGDDGYFMIRRGTNECGIEEDVTAGLPSTKNMGRWVMDMDADADVS 356


>gi|297814171|ref|XP_002874969.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297320806|gb|EFH51228.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 359

 Score =  407 bits (1045), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 183/237 (77%), Positives = 211/237 (89%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+LSDRFCI FGMN+SLSVNDLLACCGF CGDGCDGGYPI+AW+
Sbjct: 122 ILDQGHCGSCWAFGAVESLSDRFCIQFGMNISLSVNDLLACCGFRCGDGCDGGYPIAAWQ 181

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YF + GVVTEECDPYFD+TGCSHPGCEPAYPTP+C+RKCV  N+LW  SKHYS+S Y +N
Sbjct: 182 YFSYSGVVTEECDPYFDNTGCSHPGCEPAYPTPRCLRKCVSDNKLWSESKHYSVSTYTVN 241

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           S P+DIMAE+YKNGPVEVSFTVYEDFAHYKSGVYKHITG  +GGHAVKLIGWGTS++GED
Sbjct: 242 SSPQDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSNEGED 301

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 248
           YW++ANQWNR WG DGYF I+RG+NECGIE++ VAGLPSS+N+ K  T ++    AS
Sbjct: 302 YWLMANQWNRGWGDDGYFMIRRGTNECGIEDEPVAGLPSSRNVFKVDTGSNDLPVAS 358


>gi|215687149|dbj|BAG90919.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 403

 Score =  405 bits (1040), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 180/233 (77%), Positives = 206/233 (88%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
           +  ++ QGHCGSCWAFGAVE L DRFCIHF MN+SLSVNDL+ACCGF+CGDGCDGGYPI 
Sbjct: 163 IGTILDQGHCGSCWAFGAVECLQDRFCIHFNMNISLSVNDLVACCGFMCGDGCDGGYPIM 222

Query: 69  AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
           AWRYFV +GVVT+ECDPYFD  GC HPGCEPAYPTP C +KC  +NQ+W   KH+S++AY
Sbjct: 223 AWRYFVRNGVVTDECDPYFDQVGCKHPGCEPAYPTPVCEKKCKVQNQVWLEKKHFSVNAY 282

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
           R+NSDP DIMAE+Y+NGPVEV+FTVYEDFAHYKSGVYKHITG +MGGHAVKLIGWGT+D 
Sbjct: 283 RVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTDA 342

Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSA 241
           GEDYW+LANQWNR WG DGYFKI RG+NECGIEEDVVAG+PS+KN+V+   SA
Sbjct: 343 GEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEEDVVAGMPSTKNMVRNYDSA 395


>gi|357511627|ref|XP_003626102.1| Cathepsin L-like proteinase [Medicago truncatula]
 gi|355501117|gb|AES82320.1| Cathepsin L-like proteinase [Medicago truncatula]
          Length = 351

 Score =  405 bits (1040), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 182/237 (76%), Positives = 204/237 (86%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+LSDRFCIHF MN+SLSVND+LACCG LCG GC GG P SAW 
Sbjct: 114 ILDQGHCGSCWAFGAVESLSDRFCIHFDMNVSLSVNDILACCGLLCGAGCAGGTPFSAWI 173

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y  HHGVVTEECDPYFD  GCSHPGCEP Y TPKCV+KCV  NQLW  SKHYS+ AY +N
Sbjct: 174 YLAHHGVVTEECDPYFDQIGCSHPGCEPTYRTPKCVKKCVNGNQLWETSKHYSVKAYTVN 233

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           SDP+DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG  +GGHAVKL+GWGTS +GED
Sbjct: 234 SDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGFALGGHAVKLVGWGTSHEGED 293

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 248
           YW+LANQWN +WG DGYFKIKRG+NECGIE  V AGLPS+KN+V+E+T  D+  D S
Sbjct: 294 YWLLANQWNTNWGDDGYFKIKRGTNECGIENAVTAGLPSTKNIVREVTDMDVDADVS 350


>gi|87240981|gb|ABD32839.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
           propeptide [Medicago truncatula]
          Length = 356

 Score =  404 bits (1039), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 182/237 (76%), Positives = 204/237 (86%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+LSDRFCIHF MN+SLSVND+LACCG LCG GC GG P SAW 
Sbjct: 119 ILDQGHCGSCWAFGAVESLSDRFCIHFDMNVSLSVNDILACCGLLCGAGCAGGTPFSAWI 178

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y  HHGVVTEECDPYFD  GCSHPGCEP Y TPKCV+KCV  NQLW  SKHYS+ AY +N
Sbjct: 179 YLAHHGVVTEECDPYFDQIGCSHPGCEPTYRTPKCVKKCVNGNQLWETSKHYSVKAYTVN 238

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           SDP+DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG  +GGHAVKL+GWGTS +GED
Sbjct: 239 SDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGFALGGHAVKLVGWGTSHEGED 298

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 248
           YW+LANQWN +WG DGYFKIKRG+NECGIE  V AGLPS+KN+V+E+T  D+  D S
Sbjct: 299 YWLLANQWNTNWGDDGYFKIKRGTNECGIENAVTAGLPSTKNIVREVTDMDVDADVS 355


>gi|18411686|ref|NP_567215.1| cathepsin B [Arabidopsis thaliana]
 gi|13877861|gb|AAK44008.1|AF370193_1 putative cathepsin B cysteine protease [Arabidopsis thaliana]
 gi|17473834|gb|AAL38343.1| unknown protein [Arabidopsis thaliana]
 gi|21281113|gb|AAM45063.1| putative cathepsin B cysteine protease [Arabidopsis thaliana]
 gi|21554165|gb|AAM63244.1| cathepsin B-like cysteine protease, putative [Arabidopsis thaliana]
 gi|24417490|gb|AAN60355.1| unknown [Arabidopsis thaliana]
 gi|24899725|gb|AAN65077.1| unknown protein [Arabidopsis thaliana]
 gi|51968702|dbj|BAD43043.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51969104|dbj|BAD43244.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51969220|dbj|BAD43302.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970472|dbj|BAD43928.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970630|dbj|BAD44007.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970704|dbj|BAD44044.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970802|dbj|BAD44093.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970974|dbj|BAD44179.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51971008|dbj|BAD44196.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51971116|dbj|BAD44250.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|62320144|dbj|BAD94342.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|110740287|dbj|BAF02040.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332656652|gb|AEE82052.1| cathepsin B [Arabidopsis thaliana]
          Length = 359

 Score =  404 bits (1038), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 183/237 (77%), Positives = 209/237 (88%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+LSDRFCI FGMN+SLSVNDLLACCGF CGDGCDGGYPI+AW+
Sbjct: 122 ILDQGHCGSCWAFGAVESLSDRFCIQFGMNISLSVNDLLACCGFRCGDGCDGGYPIAAWQ 181

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YF + GVVTEECDPYFD+TGCSHPGCEPAYPTPKC RKCV  N+LW  SKHYS+S Y + 
Sbjct: 182 YFSYSGVVTEECDPYFDNTGCSHPGCEPAYPTPKCSRKCVSDNKLWSESKHYSVSTYTVK 241

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           S+P+DIMAE+YKNGPVEVSFTVYEDFAHYKSGVYKHITG  +GGHAVKLIGWGTS +GED
Sbjct: 242 SNPQDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGED 301

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 248
           YW++ANQWNR WG DGYF I+RG+NECGIE++ VAGLPSSKN+ +  T ++    AS
Sbjct: 302 YWLMANQWNRGWGDDGYFMIRRGTNECGIEDEPVAGLPSSKNVFRVDTGSNDLPVAS 358


>gi|59895951|gb|AAX11351.1| cathepsin B-like cysteine protease [Oryza sativa Japonica Group]
 gi|125551767|gb|EAY97476.1| hypothetical protein OsI_19406 [Oryza sativa Indica Group]
 gi|215694023|dbj|BAG89222.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215712372|dbj|BAG94499.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765382|dbj|BAG87079.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222631058|gb|EEE63190.1| hypothetical protein OsJ_17999 [Oryza sativa Japonica Group]
          Length = 358

 Score =  404 bits (1038), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 180/233 (77%), Positives = 206/233 (88%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
           +  ++ QGHCGSCWAFGAVE L DRFCIHF MN+SLSVNDL+ACCGF+CGDGCDGGYPI 
Sbjct: 118 IGTILDQGHCGSCWAFGAVECLQDRFCIHFNMNISLSVNDLVACCGFMCGDGCDGGYPIM 177

Query: 69  AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
           AWRYFV +GVVT+ECDPYFD  GC HPGCEPAYPTP C +KC  +NQ+W   KH+S++AY
Sbjct: 178 AWRYFVRNGVVTDECDPYFDQVGCKHPGCEPAYPTPVCEKKCKVQNQVWLEKKHFSVNAY 237

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
           R+NSDP DIMAE+Y+NGPVEV+FTVYEDFAHYKSGVYKHITG +MGGHAVKLIGWGT+D 
Sbjct: 238 RVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTDA 297

Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSA 241
           GEDYW+LANQWNR WG DGYFKI RG+NECGIEEDVVAG+PS+KN+V+   SA
Sbjct: 298 GEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEEDVVAGMPSTKNMVRNYDSA 350


>gi|197304333|dbj|BAG69285.1| cathepsin B-like cysteine protease [Raphanus sativus]
          Length = 343

 Score =  402 bits (1033), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 182/221 (82%), Positives = 203/221 (91%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+LSDRFCI FGMN++LSVNDLLACCGF CGDGCDGGYPISAW+
Sbjct: 123 ILDQGHCGSCWAFGAVESLSDRFCIQFGMNITLSVNDLLACCGFRCGDGCDGGYPISAWQ 182

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YF + GVVTEECDPYFD TGCSHPGCEPAY TP+C+RKCV +NQLW  SKHYSI+ Y + 
Sbjct: 183 YFSYSGVVTEECDPYFDQTGCSHPGCEPAYNTPQCLRKCVGRNQLWSESKHYSINTYVVE 242

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           S+P+DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG  +GGHAVKLIGWGT+DDGED
Sbjct: 243 SNPQDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTTDDGED 302

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           YW+LANQWNRSWG DGYF I+RG+NECGIE++ VAGLPSSK
Sbjct: 303 YWLLANQWNRSWGDDGYFMIRRGTNECGIEDEPVAGLPSSK 343


>gi|2317912|gb|AAC24376.1| cathepsin B-like cysteine proteinase [Arabidopsis thaliana]
          Length = 357

 Score =  402 bits (1032), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 180/234 (76%), Positives = 206/234 (88%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
             I  I GHCGSCWAFGAVE+LSDRFCI + +N+SLS ND++ACCG LCG GC+GG+P+ 
Sbjct: 117 TSIRRILGHCGSCWAFGAVESLSDRFCIKYNLNVSLSANDVIACCGLLCGFGCNGGFPMG 176

Query: 69  AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
           AW YF +HGVVT+ECDPYFD+TGCSHPGCEP YPTPKC RKCV +NQLW  SKHY + AY
Sbjct: 177 AWLYFKYHGVVTQECDPYFDNTGCSHPGCEPTYPTPKCERKCVSRNQLWGESKHYGVGAY 236

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
           RIN DP+DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYK+ITG  +GGHAVKLIGWGTSDD
Sbjct: 237 RINPDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDD 296

Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 242
           GEDYW+LANQWNRSWG DGYFKI+RG+NECGIE+ VVAGLPS KN+ K IT++D
Sbjct: 297 GEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQSVVAGLPSEKNVFKGITTSD 350


>gi|30678927|ref|NP_849281.1| cathepsin B [Arabidopsis thaliana]
 gi|3859606|gb|AAC72872.1| contains similarity to cysteine proteases (Pfam: PF00112,
           E=1.3e-79, N=1) [Arabidopsis thaliana]
 gi|7268205|emb|CAB77732.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332656653|gb|AEE82053.1| cathepsin B [Arabidopsis thaliana]
          Length = 359

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 182/233 (78%), Positives = 206/233 (88%)

Query: 16  GHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 75
           GHCGSCWAFGAVE+LSDRFCI FGMN+SLSVNDLLACCGF CGDGCDGGYPI+AW+YF +
Sbjct: 126 GHCGSCWAFGAVESLSDRFCIQFGMNISLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSY 185

Query: 76  HGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPE 135
            GVVTEECDPYFD+TGCSHPGCEPAYPTPKC RKCV  N+LW  SKHYS+S Y + S+P+
Sbjct: 186 SGVVTEECDPYFDNTGCSHPGCEPAYPTPKCSRKCVSDNKLWSESKHYSVSTYTVKSNPQ 245

Query: 136 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWIL 195
           DIMAE+YKNGPVEVSFTVYEDFAHYKSGVYKHITG  +GGHAVKLIGWGTS +GEDYW++
Sbjct: 246 DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLM 305

Query: 196 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 248
           ANQWNR WG DGYF I+RG+NECGIE++ VAGLPSSKN+ +  T ++    AS
Sbjct: 306 ANQWNRGWGDDGYFMIRRGTNECGIEDEPVAGLPSSKNVFRVDTGSNDLPVAS 358


>gi|18378945|ref|NP_563647.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332189291|gb|AEE27412.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
          Length = 379

 Score =  400 bits (1029), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 178/229 (77%), Positives = 205/229 (89%)

Query: 14  IQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
           + GHCGSCWAFGAVE+LSDRFCI + +N+SLS ND++ACCG LCG GC+GG+P+ AW YF
Sbjct: 144 LLGHCGSCWAFGAVESLSDRFCIKYNLNVSLSANDVIACCGLLCGFGCNGGFPMGAWLYF 203

Query: 74  VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 133
            +HGVVT+ECDPYFD+TGCSHPGCEP YPTPKC RKCV +NQLW  SKHY + AYRIN D
Sbjct: 204 KYHGVVTQECDPYFDNTGCSHPGCEPTYPTPKCERKCVSRNQLWGESKHYGVGAYRINPD 263

Query: 134 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 193
           P+DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYK+ITG  +GGHAVKLIGWGTSDDGEDYW
Sbjct: 264 PQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYW 323

Query: 194 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 242
           +LANQWNRSWG DGYFKI+RG+NECGIE+ VVAGLPS KN+ K IT++D
Sbjct: 324 LLANQWNRSWGDDGYFKIRRGTNECGIEQSVVAGLPSEKNVFKGITTSD 372


>gi|414886870|tpg|DAA62884.1| TPA: cathepsin B-like cysteine proteinase 3 [Zea mays]
          Length = 347

 Score =  400 bits (1029), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 181/230 (78%), Positives = 201/230 (87%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE L DRFCIH  M++ LSVNDLLACCGF+CGDGCDGGYPI AWR
Sbjct: 112 ILEQGHCGSCWAFGAVECLQDRFCIHLNMSILLSVNDLLACCGFMCGDGCDGGYPIEAWR 171

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YFV +GVVT+ECDPYFD  GC HPGCEPAYPTPKC +KC ++NQ+W+  KH+SI AYRIN
Sbjct: 172 YFVQNGVVTDECDPYFDPVGCKHPGCEPAYPTPKCEKKCKEQNQVWQEKKHFSIDAYRIN 231

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           SDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG +MGGHAVKLIGWGTSD GED
Sbjct: 232 SDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSDAGED 291

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSA 241
           YW+LANQWNR WG DGYFKI RG NECGIEE VVAG+PS+KN+V     A
Sbjct: 292 YWLLANQWNRGWGDDGYFKIIRGKNECGIEEGVVAGMPSTKNMVPNFGGA 341


>gi|226497010|ref|NP_001150152.1| LOC100283781 precursor [Zea mays]
 gi|195637168|gb|ACG38052.1| cathepsin B-like cysteine proteinase 3 precursor [Zea mays]
          Length = 347

 Score =  400 bits (1028), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 181/230 (78%), Positives = 201/230 (87%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE L DRFCIH  M++ LSVNDLLACCGF+CGDGCDGGYPI AWR
Sbjct: 112 ILDQGHCGSCWAFGAVECLQDRFCIHLNMSILLSVNDLLACCGFMCGDGCDGGYPIEAWR 171

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YFV +GVVT+ECDPYFD  GC HPGCEPAYPTPKC +KC ++NQ+W+  KH+SI AYRIN
Sbjct: 172 YFVQNGVVTDECDPYFDPVGCKHPGCEPAYPTPKCEKKCKEQNQVWQEKKHFSIDAYRIN 231

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           SDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG +MGGHAVKLIGWGTSD GED
Sbjct: 232 SDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSDAGED 291

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSA 241
           YW+LANQWNR WG DGYFKI RG NECGIEE VVAG+PS+KN+V     A
Sbjct: 292 YWLLANQWNRGWGDDGYFKIIRGKNECGIEEGVVAGMPSTKNMVPNFGGA 341


>gi|357116869|ref|XP_003560199.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
          Length = 350

 Score =  399 bits (1024), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 177/228 (77%), Positives = 203/228 (89%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
           +  ++ QGHCGSCWAFGAVE L DRFCIH  MN+SLSVNDL+ACCGF+CGDGCDGGYPIS
Sbjct: 114 IGTILDQGHCGSCWAFGAVECLQDRFCIHLNMNISLSVNDLVACCGFMCGDGCDGGYPIS 173

Query: 69  AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
           AW+Y V +GVVT+ECDPYFD  GC HPGCEPAYPTP C +KC  +NQ+W+  KH+SI+AY
Sbjct: 174 AWQYLVENGVVTDECDPYFDQVGCKHPGCEPAYPTPACEKKCKVQNQVWQEKKHFSINAY 233

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
           R+NSDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVY+HITG++MGGHAVKLIGWGTS D
Sbjct: 234 RVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYEHITGEMMGGHAVKLIGWGTSAD 293

Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVK 236
           G+DYW+LANQWNR WG DGYFKI RG NECGIEEDVVAG+PS+KN V+
Sbjct: 294 GKDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVVAGMPSTKNTVR 341


>gi|38639325|gb|AAR25800.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
          Length = 354

 Score =  398 bits (1023), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 181/237 (76%), Positives = 205/237 (86%), Gaps = 2/237 (0%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+LSDRFCIH+ +++SLSVNDLLACC FLCG GCDGGYPI+AWR
Sbjct: 119 ILDQGHCGSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCSFLCGSGCDGGYPIAAWR 178

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YF   GVVTEECDPYFD+TGCSHPGCEP YPTPKC RKCVK N LWR SKHY ++AYR++
Sbjct: 179 YFKRSGVVTEECDPYFDTTGCSHPGCEPLYPTPKCHRKCVKGNVLWRKSKHYGVNAYRVS 238

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
            DP+ IMAE+YKNGPVEVSFTVYEDFAHYKSGVYKH+TG  MGGHAVKLIGWGTS+ GED
Sbjct: 239 HDPQSIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGGNMGGHAVKLIGWGTSEQGED 298

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 248
           YW++ N WNR WG DGYFKI+RG+NECGIE  VVAGLPS++NL  E+   D   DAS
Sbjct: 299 YWLIVNSWNRGWGEDGYFKIRRGTNECGIEHSVVAGLPSARNLNVEL--GDAVLDAS 353


>gi|297723949|ref|NP_001174338.1| Os05g0310500 [Oryza sativa Japonica Group]
 gi|255676228|dbj|BAH93066.1| Os05g0310500, partial [Oryza sativa Japonica Group]
          Length = 234

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 179/226 (79%), Positives = 202/226 (89%)

Query: 16  GHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 75
           GHCGSCWAFGAVE L DRFCIHF MN+SLSVNDL+ACCGF+CGDGCDGGYPI AWRYFV 
Sbjct: 1   GHCGSCWAFGAVECLQDRFCIHFNMNISLSVNDLVACCGFMCGDGCDGGYPIMAWRYFVR 60

Query: 76  HGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPE 135
           +GVVT+ECDPYFD  GC HPGCEPAYPTP C +KC  +NQ+W   KH+S++AYR+NSDP 
Sbjct: 61  NGVVTDECDPYFDQVGCKHPGCEPAYPTPVCEKKCKVQNQVWLEKKHFSVNAYRVNSDPH 120

Query: 136 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWIL 195
           DIMAE+Y+NGPVEV+FTVYEDFAHYKSGVYKHITG +MGGHAVKLIGWGT+D GEDYW+L
Sbjct: 121 DIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLL 180

Query: 196 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSA 241
           ANQWNR WG DGYFKI RG+NECGIEEDVVAG+PS+KN+V+   SA
Sbjct: 181 ANQWNRGWGDDGYFKIIRGTNECGIEEDVVAGMPSTKNMVRNYDSA 226


>gi|194352768|emb|CAQ00112.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326488519|dbj|BAJ93928.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326508126|dbj|BAJ99330.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 355

 Score =  395 bits (1015), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 177/223 (79%), Positives = 200/223 (89%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCG+CWAF AVE+L DRFCIH  M++SLSVNDLLACCGFLCG GC+GGYPISAWR
Sbjct: 120 ILDQGHCGACWAFAAVESLQDRFCIHLNMSVSLSVNDLLACCGFLCGSGCNGGYPISAWR 179

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YF   GVVTEECDPYFD TGC HPGCEPAYPTPKC RKC  +NQ+W+ +KH+S++AYR++
Sbjct: 180 YFRRSGVVTEECDPYFDQTGCQHPGCEPAYPTPKCHRKCKVENQVWKKNKHFSVNAYRVH 239

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           S+P DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG VMGGHAVKLIGWGTSD GED
Sbjct: 240 SNPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGED 299

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
           YW+LANQWNR WG DGYFKI RG NECGIEEDV AG+PS+KN+
Sbjct: 300 YWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVTAGMPSTKNM 342


>gi|14582576|gb|AAK69541.1|AF283476_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
          Length = 352

 Score =  395 bits (1014), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 177/223 (79%), Positives = 199/223 (89%), Gaps = 1/223 (0%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+L+DRFCIH+G N++LSVNDLLACCGFLCG+GCDGGYPI+AW+
Sbjct: 115 ILDQGHCGSCWAFGAVESLTDRFCIHYGTNVTLSVNDLLACCGFLCGEGCDGGYPIAAWQ 174

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YF   GVVT ECDPYFD TGCSHPGCEPAYPTP C +KCVKKN LW  SKH+S++AYR+N
Sbjct: 175 YFKRTGVVTSECDPYFDQTGCSHPGCEPAYPTPACEKKCVKKNLLWSESKHFSVNAYRVN 234

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           SD   IM E+Y NGP EVSFTVYEDFAHYKSGVYKH+TG  MGGHAVKLIGWGTS+DGED
Sbjct: 235 SDQHSIMTEVYTNGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHAVKLIGWGTSEDGED 294

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
           YW+LANQWNRSWG DGYFKI RG+NECGI EDV AG+PS+KNL
Sbjct: 295 YWLLANQWNRSWGDDGYFKIIRGTNECGI-EDVTAGMPSTKNL 336


>gi|326492684|dbj|BAJ90198.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 355

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 177/223 (79%), Positives = 199/223 (89%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCG+CWAF AVE+L DRFCIH  M++SLSVNDLLACCGFLCG GC+GGYPISAWR
Sbjct: 120 ILDQGHCGACWAFAAVESLQDRFCIHLNMSVSLSVNDLLACCGFLCGSGCNGGYPISAWR 179

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YF   GVVTEECDPYFD TGC HPGCEPAYPTPKC RKC  +NQ+W+ +KH S++AYR++
Sbjct: 180 YFRRSGVVTEECDPYFDQTGCQHPGCEPAYPTPKCHRKCKVENQVWKKNKHSSVNAYRVH 239

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           S+P DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG VMGGHAVKLIGWGTSD GED
Sbjct: 240 SNPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGED 299

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
           YW+LANQWNR WG DGYFKI RG NECGIEEDV AG+PS+KN+
Sbjct: 300 YWLLANQWNRGWGGDGYFKIIRGKNECGIEEDVTAGMPSTKNM 342


>gi|40643250|emb|CAC83720.1| cathepsin B [Hordeum vulgare subsp. vulgare]
 gi|326494236|dbj|BAJ90387.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326499864|dbj|BAJ90767.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 344

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 178/224 (79%), Positives = 197/224 (87%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE L DRFCIH  MN+SLS NDL+ACCGF+CGDGCDGGYPISAW+
Sbjct: 115 ILDQGHCGSCWAFGAVECLQDRFCIHHNMNISLSANDLVACCGFMCGDGCDGGYPISAWQ 174

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YFV +GVVTEECDPYFD  GC HPGCEPAYPTP C +KC  +NQ+W+  KH+SI AY++N
Sbjct: 175 YFVQNGVVTEECDPYFDQVGCKHPGCEPAYPTPVCEKKCKVQNQVWQEKKHFSIDAYQVN 234

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           SDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG VMGGHAVKLIGWGTSD GED
Sbjct: 235 SDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGED 294

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
           YW+LANQWNR WG DGYFKI RG NECGIEEDV AG+PS KN+ 
Sbjct: 295 YWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVTAGMPSMKNIA 338


>gi|6165885|gb|AAF04727.1|AF101239_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
          Length = 352

 Score =  394 bits (1012), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 177/223 (79%), Positives = 198/223 (88%), Gaps = 1/223 (0%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+L+DRFCIH+G N++LSVNDLLACCGFLCG+GCDGGYPI+AW+
Sbjct: 115 ILDQGHCGSCWAFGAVESLTDRFCIHYGTNVTLSVNDLLACCGFLCGEGCDGGYPIAAWQ 174

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YF   GVVT ECDPYFD TGCSHPGCEPAYPTP C +KCVKKN LW  SKH+S++AYR+N
Sbjct: 175 YFKRTGVVTSECDPYFDQTGCSHPGCEPAYPTPACEKKCVKKNLLWSESKHFSVNAYRVN 234

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           SD   IM E+Y NGP EVSFTVYEDFAHYKSGVYKH+TG  MGGHAVKLIGWGTS+DGED
Sbjct: 235 SDQHSIMTEVYTNGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHAVKLIGWGTSEDGED 294

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
           YW+LANQWNRSWG DGYFKI RG+NECGI EDV AG PS+KNL
Sbjct: 295 YWLLANQWNRSWGGDGYFKIIRGTNECGI-EDVTAGTPSTKNL 336


>gi|262217337|gb|ACY38050.1| cathepsin B [Dactylis glomerata]
          Length = 348

 Score =  394 bits (1011), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 178/233 (76%), Positives = 200/233 (85%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
           +  ++ QGHCGSCWAFGAVE L DRFCIH  +N+SLS NDL+ACCGF+CGDGCDGGYPI 
Sbjct: 110 IGTILDQGHCGSCWAFGAVECLQDRFCIHQNINISLSANDLVACCGFMCGDGCDGGYPIK 169

Query: 69  AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
           AW+YFV  GVVTEECDPYFD  GC HPGCEPAY TPKC +KC  +NQ+W   KH+SI+AY
Sbjct: 170 AWQYFVQSGVVTEECDPYFDQVGCKHPGCEPAYDTPKCEKKCKVQNQVWEEKKHFSINAY 229

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
           R+NSDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKH+TG VMGGHAVKLIGWGTSD 
Sbjct: 230 RVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGGVMGGHAVKLIGWGTSDA 289

Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSA 241
           GEDYW+LANQWNR WG DGYFKI RG NECGIEE+VVAG+PS+KN+     SA
Sbjct: 290 GEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEEVVAGMPSTKNMAGNHGSA 342


>gi|357116879|ref|XP_003560204.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
          Length = 351

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 176/233 (75%), Positives = 199/233 (85%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
           +  ++ QGHCGSCWAFGAVE L DRFCIH  MN+SLSVNDLLACCGFLCG GC+GGYPIS
Sbjct: 113 IGTILDQGHCGSCWAFGAVECLQDRFCIHLNMNISLSVNDLLACCGFLCGSGCNGGYPIS 172

Query: 69  AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
           AWRYF   GVVT+ECDPYFD  GC HPGCEPAY TPKC +KC  +N++W+  KH+S+ AY
Sbjct: 173 AWRYFRRKGVVTDECDPYFDQVGCKHPGCEPAYRTPKCEKKCKVQNEVWKEQKHFSVDAY 232

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
           R++S+P DIMAE+Y NGPVEV+FTVYEDFAHYKSGVYKHITG VMGGHAVKLIGWGTSD 
Sbjct: 233 RVHSNPHDIMAEVYTNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDA 292

Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSA 241
           GEDYW+LANQWNR WG DGYFKI RG NECGIEEDVVAG+PS+KN+ +    A
Sbjct: 293 GEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVVAGMPSTKNMARNYDDA 345


>gi|21693|emb|CAA46810.1| cathepsin B [Triticum aestivum]
          Length = 305

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 175/224 (78%), Positives = 197/224 (87%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE L DRFCIH  MN++LS NDL+ACCGF+CGDGCDGGYPISAW+
Sbjct: 76  ILDQGHCGSCWAFGAVECLQDRFCIHHNMNITLSANDLVACCGFMCGDGCDGGYPISAWQ 135

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YFV +GVVT+ECDPYFD  GC HPGCEPAYPTP C +KC  +NQ+W   KH+SI+AY++N
Sbjct: 136 YFVQNGVVTDECDPYFDQVGCKHPGCEPAYPTPVCEKKCKVQNQVWEEKKHFSINAYQVN 195

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           SDP DIMAE+Y NGPVEV+FTVYEDFAHYKSGVYKHITG VMGGHAVKLIGWGTSD GED
Sbjct: 196 SDPHDIMAEVYNNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGED 255

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
           YW+LANQWNR WG DGYFKI RG NECGIEEDV AG+PS+KN+ 
Sbjct: 256 YWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVTAGMPSTKNIA 299


>gi|255647484|gb|ACU24206.1| unknown [Glycine max]
          Length = 327

 Score =  387 bits (995), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 172/208 (82%), Positives = 191/208 (91%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+LSDRFCIHF +N+SLSVNDLLACCGFLCG GCDGGYP+ AWR
Sbjct: 120 ILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSGCDGGYPLYAWR 179

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y  HHGVVTEECDPYFD  GCSHPGCEPAY TPKCV+KCV  NQ+W+ SKHYS+SAYR+N
Sbjct: 180 YLAHHGVVTEECDPYFDQIGCSHPGCEPAYRTPKCVKKCVSGNQVWKKSKHYSVSAYRVN 239

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           SDP DIMAE+YKNGPVEV+FTVYEDFA+YKSGVYKHITG  +GGHAVKLIGWGT+DDGED
Sbjct: 240 SDPHDIMAEVYKNGPVEVAFTVYEDFAYYKSGVYKHITGYELGGHAVKLIGWGTTDDGED 299

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECG 219
           YW+LANQWNR WG DGYFKI+RG+NECG
Sbjct: 300 YWLLANQWNREWGDDGYFKIRRGTNECG 327


>gi|21699|emb|CAA46811.1| cathepsin B [Triticum aestivum]
          Length = 353

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 174/227 (76%), Positives = 196/227 (86%), Gaps = 2/227 (0%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCG+CWAF AVEAL DRFCIH  M++SLSVNDLLACCGFLCG GC+GGYPISAWR
Sbjct: 116 ILDQGHCGACWAFAAVEALQDRFCIHLNMSVSLSVNDLLACCGFLCGSGCNGGYPISAWR 175

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YF   GVVTEECDPYFD TGC HPGCEPAYPTPKC RKC  +NQ W+ +KH+S++AYR++
Sbjct: 176 YFRRSGVVTEECDPYFDQTGCQHPGCEPAYPTPKCQRKCKVENQAWKENKHFSVNAYRVH 235

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYE--DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 189
           S+P DIMAE+YKNGPVEV+FT  +  DFAHYKSGVYKHITG VMGGHAVKLIGWGTSD G
Sbjct: 236 SNPHDIMAEVYKNGPVEVAFTYCQILDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAG 295

Query: 190 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVK 236
           EDYW+LANQWNR WG DGYFKI RG NECGIE DV AG+PS+KN  +
Sbjct: 296 EDYWLLANQWNRGWGDDGYFKIIRGENECGIEGDVTAGMPSTKNTAR 342


>gi|297843026|ref|XP_002889394.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335236|gb|EFH65653.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 359

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 181/231 (78%), Positives = 209/231 (90%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+LSDRFCI + +N+SLS ND++ACCG LCG GC+GG+P+ AW 
Sbjct: 122 ILDQGHCGSCWAFGAVESLSDRFCIKYNLNVSLSANDVVACCGLLCGLGCNGGFPMGAWL 181

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YF +HGVVTEECDPYFD+TGCSHPGCEP YPTPKCVRKCV +NQLW  SKHY +SAYRIN
Sbjct: 182 YFKYHGVVTEECDPYFDNTGCSHPGCEPGYPTPKCVRKCVSENQLWGESKHYGVSAYRIN 241

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
            DP+DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG  +GGHAVKLIGWGTSDDGED
Sbjct: 242 HDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTKIGGHAVKLIGWGTSDDGED 301

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 242
           YW+LANQWNRSWG DGYFKI+RG+NECGIE  VVAGLPS +N+ K++T++D
Sbjct: 302 YWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVFKDVTTSD 352


>gi|326490902|dbj|BAJ90118.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326508404|dbj|BAJ99469.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514912|dbj|BAJ99817.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 345

 Score =  377 bits (969), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 169/224 (75%), Positives = 194/224 (86%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCG+CWAFGAVE L DRFCIH  +N+SLSVNDL+ACCGFLCGDGCDGGYPI AW+
Sbjct: 116 ILDQGHCGACWAFGAVECLQDRFCIHHSVNVSLSVNDLVACCGFLCGDGCDGGYPIFAWQ 175

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YFV +GVVT+ECDP+FD  GC HPGCEPAYPTP C +KC  +NQ+W   KH+SI AY++N
Sbjct: 176 YFVENGVVTDECDPFFDQVGCQHPGCEPAYPTPVCEKKCKVQNQVWEEKKHFSIDAYQVN 235

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           SDP DIMAE+YKNGPVEVSF +YEDFAHYKSGVYK ITG ++GGHA KLIGWGTSD GED
Sbjct: 236 SDPHDIMAEVYKNGPVEVSFIIYEDFAHYKSGVYKQITGRMVGGHAAKLIGWGTSDAGED 295

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
           YW+LANQWNR WG DGYFKI RG+NECGIE DV AG+PS+KN+ 
Sbjct: 296 YWLLANQWNRGWGDDGYFKIIRGTNECGIEGDVNAGMPSTKNIA 339


>gi|116784401|gb|ABK23329.1| unknown [Picea sitchensis]
          Length = 350

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 169/229 (73%), Positives = 197/229 (86%), Gaps = 1/229 (0%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
           V+ ++ QGHCGSCWAFGAVEALSDRFCIH  +N++LS NDL+ACCGF+CGDGCDGGYPIS
Sbjct: 112 VQTILDQGHCGSCWAFGAVEALSDRFCIHHKVNVTLSENDLVACCGFMCGDGCDGGYPIS 171

Query: 69  AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
           AW+YF+  GVVT ECDPYFD  GC HPGCEP YPTP+CV++C  +NQ W NSK +S +AY
Sbjct: 172 AWQYFISTGVVTAECDPYFDDAGCQHPGCEPLYPTPQCVKQCKDENQKWGNSKRFSATAY 231

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
           RI+S P DIMAE+Y NGPVEVSF+VYEDFAHYKSGVYK+  GD MGGHAVKL+GWGT +D
Sbjct: 232 RISSKPYDIMAEVYTNGPVEVSFSVYEDFAHYKSGVYKYTKGDYMGGHAVKLVGWGT-ED 290

Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 237
           G DYW++AN WN +WG DGYFKI RGSNECGIE DVVAG+PS+KNLV +
Sbjct: 291 GTDYWLVANSWNTAWGEDGYFKIARGSNECGIEGDVVAGMPSTKNLVMD 339


>gi|224285427|gb|ACN40436.1| unknown [Picea sitchensis]
          Length = 350

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 169/229 (73%), Positives = 197/229 (86%), Gaps = 1/229 (0%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
           V+ ++ QGHCGSCWAFGAVEALSDRFCIH  +N++LS NDL+ACCGF+CGDGCDGGYPIS
Sbjct: 112 VQTILDQGHCGSCWAFGAVEALSDRFCIHHKVNVTLSENDLVACCGFMCGDGCDGGYPIS 171

Query: 69  AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
           AW+YF+  GVVT ECDPYFD  GC HPGCEP YPTP+CV++C  +NQ W NSK +S +AY
Sbjct: 172 AWQYFISTGVVTAECDPYFDDAGCQHPGCEPLYPTPQCVKQCKDENQKWGNSKRFSATAY 231

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
           RI+S P DIMAE+Y NGPVEVSF+VYEDFAHYKSGVYK+  GD MGGHAVKL+GWGT +D
Sbjct: 232 RISSKPYDIMAEVYTNGPVEVSFSVYEDFAHYKSGVYKYTKGDYMGGHAVKLVGWGT-ED 290

Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 237
           G DYW++AN WN +WG DGYFKI RGSNECGIE DVVAG+PS+KNLV +
Sbjct: 291 GTDYWLVANSWNTAWGEDGYFKIARGSNECGIEGDVVAGMPSTKNLVMD 339


>gi|116779190|gb|ABK21175.1| unknown [Picea sitchensis]
 gi|148907952|gb|ABR17096.1| unknown [Picea sitchensis]
 gi|224284884|gb|ACN40172.1| unknown [Picea sitchensis]
          Length = 350

 Score =  371 bits (952), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 167/229 (72%), Positives = 194/229 (84%), Gaps = 1/229 (0%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
           V  ++ QGHCGSCWAFGAVEALSDRFCIH+ +N++LS NDL+ACCGF CGDGCDGGYP+S
Sbjct: 112 VRTILDQGHCGSCWAFGAVEALSDRFCIHYKVNVTLSENDLVACCGFRCGDGCDGGYPLS 171

Query: 69  AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
           AW+YF+  GVVT ECDPYFD  GC HPGCEP YPTP+CV++C  +NQ W NSK +S +AY
Sbjct: 172 AWQYFISTGVVTAECDPYFDEAGCQHPGCEPLYPTPQCVKQCKDENQNWGNSKRFSATAY 231

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
           RI S P DIMAE+Y  GPVEV F VYEDFAHYKSGVYK+ITGD +GGHAVKLIGWGT ++
Sbjct: 232 RITSKPYDIMAEVYTKGPVEVDFLVYEDFAHYKSGVYKYITGDFLGGHAVKLIGWGT-EN 290

Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 237
           G DYW++AN WN +WG DGYFKI RGSNEC IEEDVVAG+PS+KNLV +
Sbjct: 291 GTDYWLVANSWNTAWGEDGYFKIARGSNECSIEEDVVAGMPSTKNLVMD 339


>gi|224285256|gb|ACN40354.1| unknown [Picea sitchensis]
          Length = 350

 Score =  365 bits (936), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 162/228 (71%), Positives = 195/228 (85%), Gaps = 1/228 (0%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAF AVEALSDRFCIHF +N +LS NDL+ACCGF CG GC+GG+P+SAWR
Sbjct: 114 ILDQGHCGSCWAFAAVEALSDRFCIHFQVNATLSENDLVACCGFRCGSGCNGGFPLSAWR 173

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YF   GVVT+ECDPYFD+ GC+HPGCEP+YPTP+CV+ C K NQ W +SKHYS +AYRI 
Sbjct: 174 YFSRRGVVTDECDPYFDNDGCNHPGCEPSYPTPRCVKNC-KDNQRWSHSKHYSANAYRIK 232

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
           SDP +IMAE++ NGPVEVSF+VYEDFAHY++GVYKH+ G  +GGHAVKLIGWGT+DDG D
Sbjct: 233 SDPYNIMAEVFNNGPVEVSFSVYEDFAHYETGVYKHVQGRYLGGHAVKLIGWGTTDDGID 292

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEIT 239
           YW++AN WN +WG  GYFKI RG NECGIE D VAG+PS+KNL+++ T
Sbjct: 293 YWLIANSWNTAWGEGGYFKIARGVNECGIERDPVAGMPSAKNLIQDPT 340


>gi|414886872|tpg|DAA62886.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
 gi|414886873|tpg|DAA62887.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
          Length = 208

 Score =  352 bits (902), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 160/202 (79%), Positives = 178/202 (88%)

Query: 40  MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 99
           M++ LSVNDLLACCGF+CGDGCDGGYPI AWRYFV +GVVT+ECDPYFD  GC HPGCEP
Sbjct: 1   MSILLSVNDLLACCGFMCGDGCDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCKHPGCEP 60

Query: 100 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 159
           AYPTPKC +KC ++NQ+W+  KH+SI AYRINSDP DIMAE+YKNGPVEV+FTVYEDFAH
Sbjct: 61  AYPTPKCEKKCKEQNQVWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAH 120

Query: 160 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 219
           YKSGVYKHITG +MGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECG
Sbjct: 121 YKSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECG 180

Query: 220 IEEDVVAGLPSSKNLVKEITSA 241
           IEE VVAG+PS+KN+V     A
Sbjct: 181 IEEGVVAGMPSTKNMVPNFGGA 202


>gi|302823081|ref|XP_002993195.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
 gi|300138965|gb|EFJ05715.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
          Length = 342

 Score =  337 bits (863), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 156/230 (67%), Positives = 185/230 (80%), Gaps = 2/230 (0%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
           ++ ++ QGHCGSCWAFGAVEAL+DRFCI    N+SLS NDL+ACC   CG GCDGGYP +
Sbjct: 115 IKNILDQGHCGSCWAFGAVEALTDRFCILNNENVSLSENDLVACCS-SCGFGCDGGYPYA 173

Query: 69  AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
           AW YF   GVVT +CDPYFD  GC HPGCEP Y TP CV++CV  N+ WR+SKH+++  Y
Sbjct: 174 AWEYFAQTGVVTSQCDPYFDGKGCKHPGCEPEYDTPVCVKQCVD-NEQWRDSKHFTVQTY 232

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
            +NSD  DI AEIYKNGPVEVS+TVYEDFAHYKSGVYKH+ G+V+GGHAVK IGWGT+DD
Sbjct: 233 AVNSDIYDIQAEIYKNGPVEVSYTVYEDFAHYKSGVYKHVFGEVLGGHAVKFIGWGTTDD 292

Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           G+DYWI+AN WNRSWG DG+F+I RGSNECGIE + VAG+P  K    +I
Sbjct: 293 GKDYWIVANSWNRSWGEDGFFQISRGSNECGIESEPVAGIPLKKTGFSDI 342


>gi|302764096|ref|XP_002965469.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
 gi|300166283|gb|EFJ32889.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
          Length = 331

 Score =  336 bits (861), Expect = 6e-90,   Method: Compositional matrix adjust.
 Identities = 155/230 (67%), Positives = 184/230 (80%), Gaps = 2/230 (0%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
           ++ ++ QGHCGSCWAFGAVEAL+DRFCI    N+SLS NDL+ACC   CG GC+GGYP +
Sbjct: 104 IKTILDQGHCGSCWAFGAVEALTDRFCILNNENVSLSENDLVACCS-SCGFGCEGGYPYA 162

Query: 69  AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
           AW YF   GVVT +CDPYFD  GC HPGCEP Y TP CV++CV  N+ WR+SKH+++  Y
Sbjct: 163 AWEYFAQTGVVTSQCDPYFDGKGCKHPGCEPEYDTPVCVKQCVD-NEQWRDSKHFTVQTY 221

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
            +NSD  DI AEIYKNGPVEVS+TVYEDFAHYKSGVYKH+ G V+GGHAVK IGWGT+DD
Sbjct: 222 AVNSDIYDIQAEIYKNGPVEVSYTVYEDFAHYKSGVYKHVFGQVLGGHAVKFIGWGTTDD 281

Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           G+DYWI+AN WNRSWG DG+F+I RGSNECGIE + VAG+P  K    +I
Sbjct: 282 GKDYWIVANSWNRSWGEDGFFQISRGSNECGIESEPVAGIPLKKTGFSDI 331


>gi|21695|emb|CAA46812.1| cathepsin B [Triticum aestivum]
          Length = 310

 Score =  335 bits (860), Expect = 9e-90,   Method: Compositional matrix adjust.
 Identities = 152/195 (77%), Positives = 171/195 (87%), Gaps = 2/195 (1%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCG+CWAF AVEAL DRFCIH  M++SLSVNDLLACCGFLCG GC+GGYPISAWR
Sbjct: 116 ILDQGHCGACWAFAAVEALQDRFCIHLNMSVSLSVNDLLACCGFLCGSGCNGGYPISAWR 175

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YF   GVVTEECDPYFD TGC HPGCEPAYPTPKC RKC  +NQ W+ +KH+S++AYR++
Sbjct: 176 YFRRSGVVTEECDPYFDQTGCQHPGCEPAYPTPKCQRKCKVENQAWKENKHFSVNAYRVH 235

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYE--DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 189
           S+P DIMAE+YKNGPVEV+FT  +  DFAHYKSGVYKHITG VMGGHAVKLIGWGTSD G
Sbjct: 236 SNPHDIMAEVYKNGPVEVAFTYCQILDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAG 295

Query: 190 EDYWILANQWNRSWG 204
           EDYW+LANQWNR WG
Sbjct: 296 EDYWLLANQWNRGWG 310


>gi|168000937|ref|XP_001753172.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695871|gb|EDQ82213.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 347

 Score =  333 bits (855), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 149/223 (66%), Positives = 180/223 (80%), Gaps = 1/223 (0%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+L+DRFCIH   ++SLS NDLLACCGF CG GC+GGYPI AW+
Sbjct: 122 ILGQGHCGSCWAFGAVESLTDRFCIHLNESVSLSENDLLACCGFECGYGCEGGYPIRAWK 181

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YF H GVVT +CDPYFD  GC+HPGC P Y TPKC ++CV  ++ W  SKH  ++AY ++
Sbjct: 182 YFKHSGVVTNKCDPYFDQKGCAHPGCYPTYETPKCEKQCVD-DEFWVQSKHLGVNAYEMS 240

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
            +PED+MAE+Y NGPVEV+F VYEDFAHYK+GVYKH+ G  MGGHAVKLIGWGT+DDG D
Sbjct: 241 MEPEDLMAELYTNGPVEVAFEVYEDFAHYKTGVYKHLFGGFMGGHAVKLIGWGTTDDGVD 300

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
           YW + N WN +WG DG F+I RG++ECGIE + VAGLPS K L
Sbjct: 301 YWTIVNSWNTNWGEDGLFRIVRGNDECGIESNAVAGLPSRKGL 343


>gi|168026641|ref|XP_001765840.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683017|gb|EDQ69431.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 339

 Score =  332 bits (852), Expect = 7e-89,   Method: Compositional matrix adjust.
 Identities = 150/223 (67%), Positives = 176/223 (78%), Gaps = 1/223 (0%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGA E+L+DRFCIH   ++SLS NDLLACCGF CGDGCDGGYPI AWR
Sbjct: 114 ILDQGHCGSCWAFGAAESLTDRFCIHMNESVSLSENDLLACCGFECGDGCDGGYPIRAWR 173

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YF   GVVT +CDPYFD  GC HPGC P Y TPKCV+ CV  ++LW  SKH S++AY ++
Sbjct: 174 YFKRTGVVTSKCDPYFDQIGCGHPGCYPTYRTPKCVKHCVD-DELWVKSKHLSVNAYEVS 232

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
            +PED+MAE+Y NGP+EVSF V+EDFAHYK+GVYKH+ G  +GGHAVKLIGWGT+DDG D
Sbjct: 233 KEPEDLMAELYTNGPIEVSFEVFEDFAHYKTGVYKHVYGRYIGGHAVKLIGWGTTDDGVD 292

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
           YW + N WN +WG  G F+I RG NECGIE   VAGLP  K L
Sbjct: 293 YWTIVNSWNTNWGEHGLFRIARGGNECGIESYAVAGLPFDKGL 335


>gi|168020784|ref|XP_001762922.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685734|gb|EDQ72127.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 345

 Score =  329 bits (843), Expect = 7e-88,   Method: Compositional matrix adjust.
 Identities = 145/223 (65%), Positives = 181/223 (81%), Gaps = 1/223 (0%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+L+DRFCIH   ++SLS NDLLACCGF CGDGC+GGYPI AW+
Sbjct: 120 ILDQGHCGSCWAFGAVESLTDRFCIHLNESVSLSENDLLACCGFECGDGCEGGYPIRAWQ 179

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           YF   GVVT +CDPYFD  GC HPGC P Y TPKC ++CV  ++LW +SKH  +SAY ++
Sbjct: 180 YFKRTGVVTSKCDPYFDQKGCGHPGCYPTYDTPKCFKRCVD-DELWVSSKHLGVSAYEVS 238

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
            +PE++MAE++ NGP+EV+F V+EDFAHYK+GVYKH+ G  +GGHAVKL+GWGT+DDG D
Sbjct: 239 MEPEELMAELFTNGPIEVAFDVFEDFAHYKTGVYKHLYGGYIGGHAVKLVGWGTTDDGVD 298

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
           YW + N WN +WG DG F+I RG +ECGIE + VAGLPS+K L
Sbjct: 299 YWSMVNSWNTNWGEDGTFRILRGKDECGIESNAVAGLPSNKGL 341


>gi|388499754|gb|AFK37943.1| unknown [Lotus japonicus]
          Length = 209

 Score =  319 bits (817), Expect = 7e-85,   Method: Compositional matrix adjust.
 Identities = 144/200 (72%), Positives = 164/200 (82%)

Query: 49  LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 108
            L    F  G    GGYP+ AWRY  HHGVVTEECDPYFD  GCSHPGCEPAY TPKCVR
Sbjct: 9   FLHAVAFSVGLAVMGGYPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVR 68

Query: 109 KCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 168
           KCVK NQ+W+ SKH+S++AY + SDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHI
Sbjct: 69  KCVKGNQIWKKSKHFSVNAYSVKSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHI 128

Query: 169 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           TG  +GGHAVKLIGWGT+D+GEDYW++ANQWNRSWG DGYF I+RG+NECGIEEDV AGL
Sbjct: 129 TGSQLGGHAVKLIGWGTTDEGEDYWLIANQWNRSWGDDGYFMIRRGTNECGIEEDVTAGL 188

Query: 229 PSSKNLVKEITSADMFEDAS 248
           PS+KN+ + +   D   D S
Sbjct: 189 PSTKNMGRWVMDMDADADVS 208


>gi|62320420|dbj|BAD94873.1| cathepsin B-like cysteine proteinase like protein [Arabidopsis
           thaliana]
          Length = 183

 Score =  310 bits (793), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 140/176 (79%), Positives = 158/176 (89%)

Query: 67  ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS 126
           + AW YF +HGVVT+ECDPYFD+TGCSHPGCEP YPTPKC RKCV +NQLW  SKHY + 
Sbjct: 1   MGAWLYFKYHGVVTQECDPYFDNTGCSHPGCEPTYPTPKCERKCVSRNQLWGESKHYGVG 60

Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 186
           AYRIN DP+DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYK+ITG  +GGHAVKLIGWGTS
Sbjct: 61  AYRINPDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTS 120

Query: 187 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 242
           DDGEDYW+LANQWNRSWG DGYFKI+RG+NECGIE+ VVAGLPS KN+ K IT++D
Sbjct: 121 DDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQSVVAGLPSEKNVFKGITTSD 176


>gi|6562772|emb|CAB62590.1| putative cathepsin B-like protease [Pisum sativum]
          Length = 174

 Score =  300 bits (769), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 134/166 (80%), Positives = 150/166 (90%)

Query: 61  CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
           CDGGYPISAW+YF HHGVVTEECDPYFD  GCSHPGCEP Y TPKCVRKCVK NQ+W+ S
Sbjct: 1   CDGGYPISAWKYFAHHGVVTEECDPYFDQIGCSHPGCEPGYQTPKCVRKCVKGNQVWKKS 60

Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 180
           KHYS+  Y++NSDP++IM E+YKNGPVEV+F+VYEDFAHYKSGVYKHITG  +GGHAVKL
Sbjct: 61  KHYSVKPYKVNSDPQNIMEEVYKNGPVEVAFSVYEDFAHYKSGVYKHITGSALGGHAVKL 120

Query: 181 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
            GWGTSD+GEDYW+LANQWN +WG DGYFKIKRG+NECGIEEDV A
Sbjct: 121 NGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEEDVTA 166


>gi|149941230|emb|CAO02547.1| putative cathepsin B-like cysteine protease [Vigna unguiculata]
          Length = 201

 Score =  293 bits (750), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 131/161 (81%), Positives = 148/161 (91%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+LSDRFCIHF +N+SLSVNDLLACCGFLCG GC+GGYP+SAWR
Sbjct: 37  ILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSGCNGGYPLSAWR 96

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y  +HGVVTEECDPYFD TGCSHPGCEPAY TPKCV+KCV  NQLW+ SKHYS+SAY++ 
Sbjct: 97  YLSNHGVVTEECDPYFDQTGCSHPGCEPAYRTPKCVKKCVSGNQLWKKSKHYSVSAYKVK 156

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
           S+P DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKH+TG V
Sbjct: 157 SNPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGYV 197


>gi|149941232|emb|CAO02548.1| putative cathepsin B-like cysteine protease,putative [Vigna
           unguiculata]
          Length = 195

 Score =  292 bits (747), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 130/159 (81%), Positives = 147/159 (92%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+LSDRFCIHF +N+SLSVNDLLACCGFLCG GC+GGYP+SAWR
Sbjct: 37  ILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSGCNGGYPLSAWR 96

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y  +HGVVTEECDPYFD TGCSHPGCEPAY TPKCV+KCV  NQLW+ SKHYS+SAY++ 
Sbjct: 97  YLSNHGVVTEECDPYFDQTGCSHPGCEPAYRTPKCVKKCVSGNQLWKKSKHYSVSAYKVK 156

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 170
           S+P DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKH+TG
Sbjct: 157 SNPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTG 195


>gi|198429088|ref|XP_002120307.1| PREDICTED: similar to cathepsin B [Ciona intestinalis]
          Length = 364

 Score =  239 bits (610), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 122/232 (52%), Positives = 152/232 (65%), Gaps = 18/232 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR+CI     + + +S  DLL+CCGF CGDGC+GG+P SAW+Y
Sbjct: 134 QGSCGSCWAFGAVEAMSDRYCIRSNGKIQVEISAEDLLSCCGFECGDGCNGGFPGSAWKY 193

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
           +   G+VT         C PY     C H      P C     TP CV KC     + + 
Sbjct: 194 WNSDGLVTGGLYGSKTGCLPY-QIKPCEHHVPGDRPKCSEGGGTPSCVSKCKGNTTIHYN 252

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY +S+Y + SDP  I  EI  +GPVE +FTVY DF  YKSGVYKH+TG V+GGHA+
Sbjct: 253 QDKHYGLSSYAVGSDPTQIQTEIMTHGPVEGAFTVYADFPTYKSGVYKHVTGGVLGGHAI 312

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           +++GWG S++G  YW++AN WN  WG  GYFKI RGS+ECGIE  VVAG+P 
Sbjct: 313 RILGWG-SENGVAYWLVANSWNTDWGDKGYFKILRGSDECGIESSVVAGIPQ 363


>gi|395507317|ref|XP_003757972.1| PREDICTED: cathepsin B [Sarcophilus harrisii]
          Length = 342

 Score =  235 bits (599), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 118/241 (48%), Positives = 161/241 (66%), Gaps = 19/241 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR C+H    + + +S  DLL+CCG  CG+GC+GG+P  AW+Y
Sbjct: 104 QGSCGSCWAFGAVEAISDRICVHTNGYITIEVSAEDLLSCCGLQCGEGCNGGFPAGAWKY 163

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH--PGCEPAYP-----TPKCVRKC-VKKNQLW 117
           ++  G+V+         C PY     C H   G  PA       TPKC +KC    +  +
Sbjct: 164 WIKKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPACTGEGGDTPKCNKKCEAGYSPDY 222

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
           ++ KHY  +AY + S  ++IMAEIYKNGPVE +F VY DF  YKSGVY+H+TGD++GGHA
Sbjct: 223 KDDKHYGTTAYNVPSSEKEIMAEIYKNGPVEGAFIVYADFLQYKSGVYQHVTGDMLGGHA 282

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 237
           ++++GWG  +DG  YW+ AN WN  WG +G+FKI RG + CGIE ++VAG+P ++   K+
Sbjct: 283 IRVLGWGV-EDGVPYWLAANSWNTDWGDNGFFKILRGKDHCGIESEMVAGIPRTEQYWKK 341

Query: 238 I 238
           I
Sbjct: 342 I 342


>gi|340501578|gb|EGR28345.1| hypothetical protein IMG5_177790 [Ichthyophthirius multifiliis]
          Length = 356

 Score =  234 bits (598), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 123/241 (51%), Positives = 155/241 (64%), Gaps = 30/241 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           Q  CGSCWAF A E+LSDR CIH G ++ LS  +L++CC   CGDGC+GGYP +A +YFV
Sbjct: 120 QSTCGSCWAFAAAESLSDRICIHTGEDVRLSTENLVSCCSS-CGDGCNGGYPEAAMQYFV 178

Query: 75  HHGVVTEE-------CDPYFDSTGCSH-------PGCEPAYPTPKCVRKC-----VKK-- 113
             G+VT +       C  Y     C+H       P C+   PTP+C +KC     VK+  
Sbjct: 179 KTGLVTGDLFGDNNFCQAY-SFPPCAHHVASTKYPPCKGEVPTPECKKKCDDDSKVKRPY 237

Query: 114 -NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
              L++  K YS+S     SDP+ IM EI  NGPVEV+FTVYEDF  YKSGVY+H+TG+ 
Sbjct: 238 NEDLYKGQKSYSVS-----SDPKAIMTEIMNNGPVEVAFTVYEDFVTYKSGVYQHVTGEQ 292

Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           +GGHAVK+IGWG  +D   YW++ N WN +WG  G FKI RGSNECGIE++VV  LP  K
Sbjct: 293 LGGHAVKMIGWGVEND-TPYWLIVNSWNETWGDQGTFKILRGSNECGIEDEVVTALPQKK 351

Query: 233 N 233
            
Sbjct: 352 Q 352


>gi|402877481|ref|XP_003902454.1| PREDICTED: cathepsin B [Papio anubis]
          Length = 339

 Score =  234 bits (597), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 117/240 (48%), Positives = 160/240 (66%), Gaps = 18/240 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNF 161

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY     C H      P C     TPKC + C    +  ++
Sbjct: 162 WTRKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  ++Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           +++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|302564570|ref|NP_001181828.1| cathepsin B precursor [Macaca mulatta]
          Length = 339

 Score =  234 bits (597), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 117/240 (48%), Positives = 160/240 (66%), Gaps = 18/240 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNF 161

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY     C H      P C     TPKC + C    +  ++
Sbjct: 162 WTRKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  ++Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           +++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|355697726|gb|EHH28274.1| Cathepsin B [Macaca mulatta]
          Length = 339

 Score =  234 bits (597), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 117/240 (48%), Positives = 159/240 (66%), Gaps = 18/240 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNF 161

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
               G+V+         C PY     C H      P C     TPKC + C    +  ++
Sbjct: 162 LTRKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  ++Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           +++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|75076082|sp|Q4R5M2.1|CATB_MACFA RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|67970521|dbj|BAE01603.1| unnamed protein product [Macaca fascicularis]
 gi|355779504|gb|EHH63980.1| Cathepsin B [Macaca fascicularis]
 gi|383411999|gb|AFH29213.1| cathepsin B preproprotein [Macaca mulatta]
 gi|384942194|gb|AFI34702.1| cathepsin B preproprotein [Macaca mulatta]
          Length = 339

 Score =  234 bits (597), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 117/240 (48%), Positives = 160/240 (66%), Gaps = 18/240 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNF 161

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY     C H      P C     TPKC + C    +  ++
Sbjct: 162 WTRKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  ++Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           +++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|181192|gb|AAA52129.1| preprocathepsin B [Homo sapiens]
 gi|193787271|dbj|BAG52477.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score =  234 bits (596), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 117/240 (48%), Positives = 160/240 (66%), Gaps = 18/240 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 161

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY     C H      P C     TPKC + C    +  ++
Sbjct: 162 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  ++Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           +++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|25988674|gb|AAN76202.1| lysosomal cysteine proteinase cathepsin B/green fluorescent protein
           EGFP fusion protein [synthetic construct]
          Length = 578

 Score =  234 bits (596), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 116/234 (49%), Positives = 153/234 (65%), Gaps = 16/234 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH    +N+ +S  DLL CCG  CGDGC+GGYP  AW +
Sbjct: 102 QGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNF 161

Query: 73  FVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRN 119
           +   G+V+         C PY           S P C     TPKC + C    +  ++ 
Sbjct: 162 WTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKE 221

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
            KHY  ++Y ++   ++IMAEIYKNGPVE +FTV+ DF  YKSGVYKH  GDVMGGHA++
Sbjct: 222 DKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIR 281

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
           ++GWG  ++G  YW++AN WN  WG +G+FKI RG N CGIE ++VAG+P +++
Sbjct: 282 ILGWGI-ENGVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAGIPRTQD 334


>gi|4503139|ref|NP_001899.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538431|ref|NP_680090.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538433|ref|NP_680091.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538435|ref|NP_680092.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538437|ref|NP_680093.1| cathepsin B preproprotein [Homo sapiens]
 gi|68067549|sp|P07858.3|CATB_HUMAN RecName: Full=Cathepsin B; AltName: Full=APP secretase; Short=APPS;
           AltName: Full=Cathepsin B1; Contains: RecName:
           Full=Cathepsin B light chain; Contains: RecName:
           Full=Cathepsin B heavy chain; Flags: Precursor
 gi|291888|gb|AAC37547.1| cathepsin B [Homo sapiens]
 gi|63102437|gb|AAH95408.1| Cathepsin B [Homo sapiens]
 gi|119586034|gb|EAW65630.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586036|gb|EAW65632.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586037|gb|EAW65633.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586038|gb|EAW65634.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586039|gb|EAW65635.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586040|gb|EAW65636.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|168277954|dbj|BAG10955.1| cathepsin B precursor [synthetic construct]
 gi|193786804|dbj|BAG52127.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score =  234 bits (596), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 117/240 (48%), Positives = 160/240 (66%), Gaps = 18/240 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 161

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY     C H      P C     TPKC + C    +  ++
Sbjct: 162 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  ++Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           +++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|426358853|ref|XP_004046705.1| PREDICTED: cathepsin B isoform 1 [Gorilla gorilla gorilla]
 gi|426358855|ref|XP_004046706.1| PREDICTED: cathepsin B isoform 2 [Gorilla gorilla gorilla]
 gi|426358857|ref|XP_004046707.1| PREDICTED: cathepsin B isoform 3 [Gorilla gorilla gorilla]
          Length = 339

 Score =  234 bits (596), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 118/246 (47%), Positives = 163/246 (66%), Gaps = 18/246 (7%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYP 66
           V+ +  QG CGSCWAFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP
Sbjct: 96  VKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYP 155

Query: 67  ISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK 113
             AW ++   G+V+         C PY     C H      P C     TPKC + C   
Sbjct: 156 AEAWNFWTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPG 214

Query: 114 -NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
            +  ++  KHY  ++Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274

Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           MGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P + 
Sbjct: 275 MGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTD 333

Query: 233 NLVKEI 238
              ++I
Sbjct: 334 QYWEKI 339


>gi|16307393|gb|AAH10240.1| Cathepsin B [Homo sapiens]
          Length = 339

 Score =  234 bits (596), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 117/240 (48%), Positives = 160/240 (66%), Gaps = 18/240 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 161

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY     C H      P C     TPKC + C    +  ++
Sbjct: 162 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  ++Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           +++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|332862712|ref|XP_003317964.1| PREDICTED: cathepsin B isoform 1 [Pan troglodytes]
 gi|332862714|ref|XP_003317965.1| PREDICTED: cathepsin B isoform 2 [Pan troglodytes]
 gi|332862716|ref|XP_003317966.1| PREDICTED: cathepsin B isoform 3 [Pan troglodytes]
 gi|332862718|ref|XP_519607.3| PREDICTED: cathepsin B isoform 5 [Pan troglodytes]
 gi|410057614|ref|XP_003954244.1| PREDICTED: cathepsin B [Pan troglodytes]
 gi|410262606|gb|JAA19269.1| cathepsin B [Pan troglodytes]
 gi|410262608|gb|JAA19270.1| cathepsin B [Pan troglodytes]
 gi|410359820|gb|JAA44654.1| cathepsin B [Pan troglodytes]
 gi|410359822|gb|JAA44655.1| cathepsin B [Pan troglodytes]
 gi|410359824|gb|JAA44656.1| cathepsin B [Pan troglodytes]
 gi|410359826|gb|JAA44657.1| cathepsin B [Pan troglodytes]
 gi|410359828|gb|JAA44658.1| cathepsin B [Pan troglodytes]
          Length = 339

 Score =  234 bits (596), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 117/240 (48%), Positives = 160/240 (66%), Gaps = 18/240 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 161

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY     C H      P C     TPKC + C    +  ++
Sbjct: 162 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  ++Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           +++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|30583753|gb|AAP36125.1| Homo sapiens cathepsin B [synthetic construct]
 gi|61370555|gb|AAX43516.1| cathepsin B [synthetic construct]
          Length = 340

 Score =  234 bits (596), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 117/240 (48%), Positives = 160/240 (66%), Gaps = 18/240 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 161

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY     C H      P C     TPKC + C    +  ++
Sbjct: 162 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  ++Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           +++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|397467300|ref|XP_003805362.1| PREDICTED: cathepsin B [Pan paniscus]
          Length = 339

 Score =  234 bits (596), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 117/240 (48%), Positives = 160/240 (66%), Gaps = 18/240 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 161

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY     C H      P C     TPKC + C    +  ++
Sbjct: 162 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  ++Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           +++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|449267314|gb|EMC78276.1| Cathepsin B [Columba livia]
          Length = 340

 Score =  233 bits (595), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 119/252 (47%), Positives = 160/252 (63%), Gaps = 17/252 (6%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDG 60
           + N   +  +  QG CGSCWAFGAVEA+SDR C+H    +S+ V+  DLL+CCGF CG G
Sbjct: 90  WPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLLSCCGFECGMG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCV 107
           C+GGYP  AWRY+   G+V+         C PY          G   P       TP+C 
Sbjct: 150 CNGGYPSGAWRYWTEKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGGETPRCS 209

Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           R C    +  ++  KHY I++Y +    ++IMAEIYKNGPVE +F VYEDF  YKSGVY+
Sbjct: 210 RHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQ 269

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H+TG+ +GGHA++L+GWG  D+G  YW+ AN WN  WG +G+FKI RG + CGIE ++VA
Sbjct: 270 HVTGEQVGGHAIRLLGWGV-DNGTPYWLAANSWNTDWGDNGFFKILRGEDHCGIESEIVA 328

Query: 227 GLPSSKNLVKEI 238
           G+PS++   K +
Sbjct: 329 GIPSTERYWKRV 340


>gi|197098184|ref|NP_001126573.1| cathepsin B precursor [Pongo abelii]
 gi|75061687|sp|Q5R6D1.1|CATB_PONAB RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|55731764|emb|CAH92586.1| hypothetical protein [Pongo abelii]
 gi|55731953|emb|CAH92685.1| hypothetical protein [Pongo abelii]
          Length = 339

 Score =  233 bits (595), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 117/240 (48%), Positives = 159/240 (66%), Gaps = 18/240 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 161

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY     C H      P C     TPKC + C    +  ++
Sbjct: 162 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  ++Y +++   DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSERDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           +++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|24158605|pdb|1GMY|A Chain A, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
 gi|24158606|pdb|1GMY|B Chain B, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
 gi|24158607|pdb|1GMY|C Chain C, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
          Length = 261

 Score =  233 bits (595), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 117/240 (48%), Positives = 160/240 (66%), Gaps = 18/240 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW +
Sbjct: 24  QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 83

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY     C H      P C     TPKC + C    +  ++
Sbjct: 84  WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 142

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  ++Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA+
Sbjct: 143 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 202

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           +++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 203 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 261


>gi|157833437|pdb|1PBH|A Chain A, Crystal Structure Of Human Recombinant Procathepsin B At
           3.2 Angstrom Resolution
 gi|157835646|pdb|2PBH|A Chain A, Crystal Structure Of Human Procathepsin B At 3.3 Angstrom
           Resolution
 gi|157836863|pdb|3PBH|A Chain A, Refined Crystal Structure Of Human Procathepsin B At 2.5
           Angstrom Resolution
          Length = 317

 Score =  233 bits (594), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 116/233 (49%), Positives = 157/233 (67%), Gaps = 18/233 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW +
Sbjct: 86  QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 145

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY     C H      P C     TPKC + C    +  ++
Sbjct: 146 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 204

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  ++Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA+
Sbjct: 205 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 264

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           +++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +
Sbjct: 265 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRT 316


>gi|296221607|ref|XP_002756833.1| PREDICTED: cathepsin B, partial [Callithrix jacchus]
          Length = 330

 Score =  233 bits (593), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 116/240 (48%), Positives = 157/240 (65%), Gaps = 18/240 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW +
Sbjct: 93  QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 152

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY     C H      P C     TPKC + C    +  ++
Sbjct: 153 WTRKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKSCEPGYSPTYK 211

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY   +Y ++++  DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA+
Sbjct: 212 QDKHYGYDSYSVSNNERDIMAEIYKNGPVEGAFSVYADFLLYKSGVYQHVTGEMMGGHAI 271

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           +++GWG  ++G  YW++ N WN  WG +G+FKI RG + CGIE +VVAG+P +    + I
Sbjct: 272 RILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWRNI 330


>gi|158261501|dbj|BAF82928.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score =  233 bits (593), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 116/240 (48%), Positives = 159/240 (66%), Gaps = 18/240 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 161

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY     C H      P C     TPKC + C    +  ++
Sbjct: 162 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  ++Y +++  +DIMAEIYKNGP E +F+VY DF  YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPAEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           +++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|333361087|pdb|3AI8|B Chain B, Cathepsin B In Complex With The Nitroxoline
 gi|333361088|pdb|3AI8|A Chain A, Cathepsin B In Complex With The Nitroxoline
          Length = 256

 Score =  232 bits (592), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 116/233 (49%), Positives = 157/233 (67%), Gaps = 18/233 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW +
Sbjct: 25  QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 84

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY     C H      P C     TPKC + C    +  ++
Sbjct: 85  WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 143

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  ++Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA+
Sbjct: 144 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 203

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           +++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +
Sbjct: 204 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRT 255


>gi|60816353|gb|AAX36379.1| cathepsin B [synthetic construct]
 gi|61358313|gb|AAX41546.1| cathepsin B [synthetic construct]
          Length = 339

 Score =  232 bits (592), Expect = 9e-59,   Method: Compositional matrix adjust.
 Identities = 117/240 (48%), Positives = 159/240 (66%), Gaps = 18/240 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH   ++S+ V+  DLL CCG  CGDGC+GGYP  AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSRCGDGCNGGYPAEAWNF 161

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY     C H      P C     TPKC + C    +  ++
Sbjct: 162 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  ++Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           +++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|262368170|pdb|3K9M|A Chain A, Cathepsin B In Complex With Stefin A
 gi|262368172|pdb|3K9M|B Chain B, Cathepsin B In Complex With Stefin A
          Length = 254

 Score =  232 bits (592), Expect = 9e-59,   Method: Compositional matrix adjust.
 Identities = 116/233 (49%), Positives = 157/233 (67%), Gaps = 18/233 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW +
Sbjct: 23  QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 82

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY     C H      P C     TPKC + C    +  ++
Sbjct: 83  WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 141

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  ++Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA+
Sbjct: 142 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 201

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           +++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +
Sbjct: 202 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRT 253


>gi|1311050|pdb|1CPJ|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1311051|pdb|1CPJ|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1421561|pdb|1THE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B- Inhibitor Complex: Implications For
           Structure-Based Inhibitor Design
 gi|1421562|pdb|1THE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B- Inhibitor Complex: Implications For
           Structure-Based Inhibitor Design
          Length = 260

 Score =  232 bits (592), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 118/246 (47%), Positives = 158/246 (64%), Gaps = 18/246 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           ++N   +  +  QG CGSCWAFGAVEA+SDR CIH    +N+ +S  DLL CCG  CGDG
Sbjct: 17  WSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLLTCCGIQCGDG 76

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C+GGYP  AW ++   G+V+         C PY     C H      P C     TPKC 
Sbjct: 77  CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPP-CEHHVNGARPPCTGEGDTPKCN 135

Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C    +  ++  KHY  ++Y ++   ++IMAEIYKNGPVE +FTV+ DF  YKSGVYK
Sbjct: 136 KMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYK 195

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H  GDVMGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI RG N CGIE ++VA
Sbjct: 196 HEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNADWGDNGFFKILRGENHCGIESEIVA 254

Query: 227 GLPSSK 232
           G+P ++
Sbjct: 255 GIPRTQ 260


>gi|1127275|pdb|1CTE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1127276|pdb|1CTE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
          Length = 254

 Score =  232 bits (592), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 118/246 (47%), Positives = 158/246 (64%), Gaps = 18/246 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           ++N   +  +  QG CGSCWAFGAVEA+SDR CIH    +N+ +S  DLL CCG  CGDG
Sbjct: 11  WSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLLTCCGIQCGDG 70

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C+GGYP  AW ++   G+V+         C PY     C H      P C     TPKC 
Sbjct: 71  CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPP-CEHHVNGARPPCTGEGDTPKCN 129

Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C    +  ++  KHY  ++Y ++   ++IMAEIYKNGPVE +FTV+ DF  YKSGVYK
Sbjct: 130 KMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYK 189

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H  GDVMGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI RG N CGIE ++VA
Sbjct: 190 HEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNADWGDNGFFKILRGENHCGIESEIVA 248

Query: 227 GLPSSK 232
           G+P ++
Sbjct: 249 GIPRTQ 254


>gi|1705630|sp|P00787.2|CATB_RAT RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; AltName:
           Full=RSG-2; Contains: RecName: Full=Cathepsin B light
           chain; Contains: RecName: Full=Cathepsin B heavy chain;
           Flags: Precursor
 gi|1524328|emb|CAA57792.1| cathepsin b [Rattus norvegicus]
          Length = 339

 Score =  232 bits (592), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 117/246 (47%), Positives = 157/246 (63%), Gaps = 16/246 (6%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           ++N   +  +  QG CGSCWAFGAVEA+SDR CIH    +N+ +S  DLL CCG  CGDG
Sbjct: 90  WSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLLTCCGIQCGDG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVR 108
           C+GGYP  AW ++   G+V+         C PY           S P C     TPKC +
Sbjct: 150 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGSRPPCTGEGDTPKCNK 209

Query: 109 KC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
            C    +  ++  KHY  ++Y ++   ++IMAEIYKNGPVE +FTV+ DF  YKSGVYKH
Sbjct: 210 MCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKH 269

Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
             GDVMGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI RG N CGIE ++VAG
Sbjct: 270 EAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAG 328

Query: 228 LPSSKN 233
           +P ++ 
Sbjct: 329 IPRTQQ 334


>gi|313233819|emb|CBY09988.1| unnamed protein product [Oikopleura dioica]
          Length = 356

 Score =  232 bits (591), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 119/235 (50%), Positives = 153/235 (65%), Gaps = 16/235 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIH--FGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGA EA+SDR CIH        LS  DLL+CCG++CG+GC+GG+P +AW Y
Sbjct: 117 QGSCGSCWAFGASEAISDRTCIHSNAAFTFDLSSEDLLSCCGYVCGNGCNGGFPQAAWEY 176

Query: 73  FVHHGVVT------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRN 119
           +V +G+V+        C PY        + G   P       TPKC  KCV      +  
Sbjct: 177 WVQNGLVSGGLYHGTGCQPYAIEPCEHHTEGDRPPCTGEEGTTPKCSHKCVDGYTGNFAQ 236

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
            KHY   AYRI ++ + IM EIYKNGPVE +F VYEDF  YKSGVY H TG  +GGHA++
Sbjct: 237 DKHYGSVAYRIPANEKAIMNEIYKNGPVEGAFIVYEDFPTYKSGVYSHHTGSALGGHAIR 296

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
           ++GWG  ++GE YW+  N WN  WG +G+FKIKRG NECGIE ++V G+P+S++L
Sbjct: 297 VLGWG-EENGEKYWLCGNSWNTDWGNNGFFKIKRGVNECGIESEMVGGIPASESL 350


>gi|82830420|ref|NP_072119.2| cathepsin B preproprotein [Rattus norvegicus]
 gi|47939014|gb|AAH72490.1| Cathepsin B [Rattus norvegicus]
 gi|149030258|gb|EDL85314.1| rCG52258, isoform CRA_a [Rattus norvegicus]
          Length = 339

 Score =  232 bits (591), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 117/246 (47%), Positives = 157/246 (63%), Gaps = 16/246 (6%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           ++N   +  +  QG CGSCWAFGAVEA+SDR CIH    +N+ +S  DLL CCG  CGDG
Sbjct: 90  WSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLLTCCGIQCGDG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVR 108
           C+GGYP  AW ++   G+V+         C PY           S P C     TPKC +
Sbjct: 150 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGSRPPCTGEGDTPKCNK 209

Query: 109 KC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
            C    +  ++  KHY  ++Y ++   ++IMAEIYKNGPVE +FTV+ DF  YKSGVYKH
Sbjct: 210 MCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKH 269

Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
             GDVMGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI RG N CGIE ++VAG
Sbjct: 270 EAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAG 328

Query: 228 LPSSKN 233
           +P ++ 
Sbjct: 329 IPRTQQ 334


>gi|203648|gb|AAA40993.1| cathepsin (EC 3.4.22.1), partial [Rattus norvegicus]
          Length = 271

 Score =  231 bits (590), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 117/246 (47%), Positives = 157/246 (63%), Gaps = 16/246 (6%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           ++N   +  +  QG CGSCWAFGAVEA+SDR CIH    +N+ +S  DLL CCG  CGDG
Sbjct: 22  WSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLLTCCGIQCGDG 81

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVR 108
           C+GGYP  AW ++   G+V+         C PY           S P C     TPKC +
Sbjct: 82  CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGSRPPCTGEGDTPKCNK 141

Query: 109 KC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
            C    +  ++  KHY  ++Y ++   ++IMAEIYKNGPVE +FTV+ DF  YKSGVYKH
Sbjct: 142 MCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKH 201

Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
             GDVMGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI RG N CGIE ++VAG
Sbjct: 202 EAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAG 260

Query: 228 LPSSKN 233
           +P ++ 
Sbjct: 261 IPRTQQ 266


>gi|345790427|ref|XP_543203.3| PREDICTED: cathepsin B [Canis lupus familiaris]
          Length = 339

 Score =  231 bits (589), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 117/252 (46%), Positives = 161/252 (63%), Gaps = 18/252 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWAFGAVEA+SDR CI     +N+ +S  D+L CCG  CGDG
Sbjct: 90  WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVNVEVSAEDMLTCCGDQCGDG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C+GG+P  AW ++   G+V+         C PY     C H      P C     TPKC 
Sbjct: 150 CNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 208

Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C    +  ++  KHY  S+Y ++ + ++IMAEIYKNGPVE +FTVY DF  YKSGVY+
Sbjct: 209 KICEPGYSPSYKEDKHYGCSSYSVSDNEKEIMAEIYKNGPVEAAFTVYSDFLLYKSGVYQ 268

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H+TG++MGGHAV+++GWG  +DG  YW++ N WN  WG +G+FKI RG + CGIE ++VA
Sbjct: 269 HVTGEMMGGHAVRILGWGV-EDGTPYWLVGNSWNTDWGDNGFFKILRGRDHCGIESEIVA 327

Query: 227 GLPSSKNLVKEI 238
           G+P +    K+I
Sbjct: 328 GIPCTDQYWKKI 339


>gi|327281751|ref|XP_003225610.1| PREDICTED: cathepsin B-like [Anolis carolinensis]
          Length = 330

 Score =  231 bits (589), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 114/240 (47%), Positives = 155/240 (64%), Gaps = 17/240 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR C+H    +N+ +S  DLL+CCGF CG GC+GGYP  AW+Y
Sbjct: 92  QGSCGSCWAFGAVEAISDRVCVHTNGKVNVEISAEDLLSCCGFECGMGCNGGYPSGAWKY 151

Query: 73  FVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY        + G   P       TP+CV+KC       ++
Sbjct: 152 WTEKGLVSGGLYDSHVGCRPYSIPPCEHHTNGTRPPCSGEGGETPECVKKCEDGYTPAYK 211

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY +++Y I    ++IMAEIYKNGPVE +F VY DF  YKSGVY+H++G+ +GGHA+
Sbjct: 212 QDKHYGVTSYGIPRSEKEIMAEIYKNGPVEGAFVVYSDFLMYKSGVYQHVSGEEVGGHAI 271

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           +++GWG  D+G  YW+ AN WN  WG DG+F+I RG + CGIE ++VAG+P +    K +
Sbjct: 272 RILGWGV-DNGTPYWLAANSWNTDWGEDGFFRILRGQDHCGIESEIVAGIPKTSEYWKML 330


>gi|403307501|ref|XP_003944231.1| PREDICTED: cathepsin B [Saimiri boliviensis boliviensis]
          Length = 351

 Score =  231 bits (589), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 115/233 (49%), Positives = 154/233 (66%), Gaps = 18/233 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW +
Sbjct: 114 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 173

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY     C H      P C     TPKC + C       ++
Sbjct: 174 WTRKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKSCEPGYTPTYK 232

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  ++Y +++   DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA+
Sbjct: 233 QDKHYGYNSYSVSNSERDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 292

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           +++GWG  ++G  YW++ N WN  WG +G+FKI RG + CGIE +VVAG+P +
Sbjct: 293 RILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRT 344


>gi|449667614|ref|XP_002166962.2| PREDICTED: cathepsin B-like [Hydra magnipapillata]
          Length = 330

 Score =  231 bits (588), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 119/232 (51%), Positives = 155/232 (66%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF-GMN-LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGA EA++DR CI   G N + +S  DLL CC   CG GC+GGYP SAW +
Sbjct: 101 QGECGSCWAFGAAEAMTDRICIATKGKNQVRISTEDLLTCCD-SCGFGCNGGYPQSAWEF 159

Query: 73  FVHHGVVTEECDPYFDSTGC--------------SHPGCEPAYPTPKCVRKCVKK-NQLW 117
           F   G+VT    PY    GC              S   C  + PTPKC + C K  N  +
Sbjct: 160 FKTKGIVTG--GPYNSHKGCQPYAIPACDHHVPHSKNPCNGSLPTPKCEKVCEKGYNITY 217

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
           +N KHY +++Y IN+D  +IM EI  NGPVE +FTV+ DF +YKSGVY+H++G+ +GGHA
Sbjct: 218 KNDKHYGVTSYSINNDQNEIMREIMTNGPVEAAFTVFADFPNYKSGVYQHVSGEELGGHA 277

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +K++GWG  ++   YW++AN WN SWG +G+FKI RGS+ECGIE++VVAGLP
Sbjct: 278 IKILGWGVENN-TPYWLVANSWNPSWGDNGFFKILRGSDECGIEDEVVAGLP 328


>gi|189096178|pdb|3CBJ|A Chain A, Chagasin-cathepsin B Complex
 gi|189096180|pdb|3CBK|A Chain A, Chagasin-Cathepsin B
          Length = 266

 Score =  230 bits (587), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 115/239 (48%), Positives = 159/239 (66%), Gaps = 16/239 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGS WAFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW +
Sbjct: 29  QGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 88

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH-----PGCEPAYPTPKCVRKCVKK-NQLWRN 119
           +   G+V+         C PY      +H     P C     TPKC + C    +  ++ 
Sbjct: 89  WTRKGLVSGGLYESHVGCRPYSIPPCEAHVNGARPPCTGEGDTPKCSKICEPGYSPTYKQ 148

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
            KHY  ++Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++
Sbjct: 149 DKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIR 208

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           ++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 209 ILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 266


>gi|240992699|ref|XP_002404474.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215491571|gb|EEC01212.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score =  229 bits (583), Expect = 9e-58,   Method: Compositional matrix adjust.
 Identities = 117/239 (48%), Positives = 155/239 (64%), Gaps = 17/239 (7%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
           + ++  Q  CGSCWAFGA EA+SDR CIH    + + +S  DLL CC   CG GC+GGYP
Sbjct: 101 IHLIRDQSTCGSCWAFGAAEAMSDRVCIHSKGKIQVDISAEDLLDCCDS-CGAGCNGGYP 159

Query: 67  ISAWRYFVHHGVVT-------EECDPYFDS-----TGCSHPGCEPAYPTPKCVRKCVKK- 113
            +AW Y+   G+VT       + C PY  +     T  S P C    PTPKCV  C K  
Sbjct: 160 AAAWEYWKESGLVTGGLYGTSDGCKPYSLAPCEHHTKGSLPNCTGTVPTPKCVHLCRKGY 219

Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
            + +++ KH+    Y I+SD + I  EI+KNGPVE  FTVY DF  YKSGVY+H +GDV+
Sbjct: 220 GKDYQDDKHFGRKVYSISSDEKQIQTEIFKNGPVEADFTVYADFLSYKSGVYQHQSGDVL 279

Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GGHA++++GWGT ++G  YW++AN WN  WG  GYFKI RG +ECGIE+D+ AG+P ++
Sbjct: 280 GGHAIRILGWGT-ENGTPYWLVANSWNEDWGDHGYFKILRGKDECGIEDDINAGIPKNE 337


>gi|50657025|emb|CAH04630.1| cathepsin B [Suberites domuncula]
          Length = 331

 Score =  229 bits (583), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 115/243 (47%), Positives = 153/243 (62%), Gaps = 16/243 (6%)

Query: 1   MPFTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDG 60
           M + +   ++ +  QG CGSCWAFGAVE++SDRFCIHF  +  +S  DL+ACC   CG G
Sbjct: 86  MQWPDCPTIKEIRDQGACGSCWAFGAVESMSDRFCIHFNQSAHISAEDLMACCE-TCGMG 144

Query: 61  CDGGYPISAWRYFVHHGVVT-------EECDPYFDST------GCSHPGCEPAYPTPKCV 107
           C+GGY  +AWRYF H G+VT       E C PY  ++      G   P       TP+C 
Sbjct: 145 CNGGYLGAAWRYFEHTGLVTGGQYNSKEGCQPYLIASCDHHVVGKKQPCASKEEHTPRCS 204

Query: 108 RKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C     + +   KH+  SAY + S  E I  EI  NGPVE +FTVY DF  YKSGVY+
Sbjct: 205 KTCEAGYDVSFEKDKHFGASAYSVRSSVEAIQTEIMTNGPVEGAFTVYADFPTYKSGVYQ 264

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H +G ++GGHA++++GWGT ++G  YW++AN WN  WGA GYFKI RG ++CGIE  + A
Sbjct: 265 HTSGAMLGGHAIRILGWGT-ENGTPYWLVANSWNEDWGAMGYFKIIRGKDDCGIESQITA 323

Query: 227 GLP 229
           G+P
Sbjct: 324 GMP 326


>gi|126681075|gb|ABO26563.1| cathepsin B-like cysteine protease form 1 [Ixodes ricinus]
          Length = 337

 Score =  229 bits (583), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 117/239 (48%), Positives = 156/239 (65%), Gaps = 17/239 (7%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYP 66
           + ++  Q  CGSCWAFGA EA+SDR CIH   G+ +++S  DLL CC   CG GCDGGYP
Sbjct: 101 INLIRDQSTCGSCWAFGAAEAMSDRVCIHSEGGIQVNISAEDLLDCCDS-CGAGCDGGYP 159

Query: 67  ISAWRYFVHHGVVTEE-------CDPYFDS-----TGCSHPGCEPAYPTPKCVRKCVKK- 113
            +AW Y+   G+V++        C PY  +     T  S P C    PTPKCV  C K  
Sbjct: 160 AAAWEYWKESGLVSDGLYGTPDGCKPYSLAPCEHHTKGSLPNCTGTVPTPKCVHLCRKGY 219

Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
            + +++ KH+    Y I+S+ + I  EI+KNGPVE  FTVY DF  YKSGVY+H +GDV+
Sbjct: 220 GKDYQHDKHFGKKVYSISSNEKQIQTEIFKNGPVEADFTVYADFLSYKSGVYQHHSGDVL 279

Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GGHA++++GWGT ++G  YW++AN WN  WG  GYFKI RG +ECGIE+D+ AG+P  +
Sbjct: 280 GGHAIRILGWGT-ENGTPYWLVANSWNEDWGDHGYFKILRGKDECGIEDDINAGIPKDE 337


>gi|1942645|pdb|1MIR|A Chain A, Rat Procathepsin B
 gi|1942646|pdb|1MIR|B Chain B, Rat Procathepsin B
          Length = 322

 Score =  229 bits (583), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 117/247 (47%), Positives = 157/247 (63%), Gaps = 18/247 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           ++N   +  +  QG CGS WAFGAVEA+SDR CIH    +N+ +S  DLL CCG  CGDG
Sbjct: 73  WSNCPTIAQIRDQGSCGSSWAFGAVEAMSDRICIHTNGRVNVEVSAEDLLTCCGIQCGDG 132

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C+GGYP  AW ++   G+V+         C PY     C H      P C     TPKC 
Sbjct: 133 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPP-CEHHVNGARPPCTGEGDTPKCN 191

Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C    +  ++  KHY  ++Y ++   ++IMAEIYKNGPVE +FTV+ DF  YKSGVYK
Sbjct: 192 KMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYK 251

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H  GDVMGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI RG N CGIE ++VA
Sbjct: 252 HEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNADWGDNGFFKILRGENHCGIESEIVA 310

Query: 227 GLPSSKN 233
           G+P ++ 
Sbjct: 311 GIPRTQQ 317


>gi|326427908|gb|EGD73478.1| cathepsin B [Salpingoeca sp. ATCC 50818]
          Length = 341

 Score =  228 bits (582), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 119/231 (51%), Positives = 147/231 (63%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CGSCWAFGAVEA++DR CI         +S  DLL CC F CGDGC+GGYP +AW Y
Sbjct: 112 QANCGSCWAFGAVEAMTDRTCIASKGAQTPHISAEDLLTCCTFTCGDGCNGGYPAAAWEY 171

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWR 118
           + + G+VT       + C PY        +TG   P C    PTP C R C +  N  + 
Sbjct: 172 WKNQGIVTGGQYDSNQGCQPYSLAKCEHHTTGPYKP-CGDIVPTPACKRSCRQGYNVTYP 230

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
           N KH+  S+Y +    + I  EI  NGPVE +FTVY DF  YKSGVY+H +G  +GGHA+
Sbjct: 231 NDKHFGASSYGVRG-VDQIATEIMTNGPVEAAFTVYSDFLSYKSGVYQHTSGQPLGGHAI 289

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           K+IGWG   DG DYWI+AN WN SWG DG+F IK+G++ECGIE  VVAGLP
Sbjct: 290 KIIGWGVQ-DGTDYWIVANSWNDSWGNDGFFWIKKGTDECGIESQVVAGLP 339


>gi|410916585|ref|XP_003971767.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
          Length = 328

 Score =  228 bits (582), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 118/237 (49%), Positives = 152/237 (64%), Gaps = 19/237 (8%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSL--SVNDLLACCGFLCGDGCDGGYP 66
           ++ +  QG CGSCWAFGA EA+SDR CIH G  +SL  S  DLL+CC   CG GC GGYP
Sbjct: 93  IQQIRDQGSCGSCWAFGAAEAISDRLCIHSGSKISLEISAEDLLSCCD-ECGMGCSGGYP 151

Query: 67  ISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK 113
            SAW ++   G+VT         C PY  +  C H      P C+    TPKC +KC+  
Sbjct: 152 SSAWEFWTKKGLVTGGLCGSEVGCRPYSIAP-CEHHVNGTRPPCQGTQETPKCEKKCIDG 210

Query: 114 NQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
               +   KH+   +Y + S  E IM E+YKNGPVE +FTVY DF  YK+GVY+H+TG+V
Sbjct: 211 YLTSYLKDKHFGKRSYSLPSQQEQIMTELYKNGPVEAAFTVYADFLLYKTGVYQHVTGEV 270

Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +GGHA+K++GWG  + G  YW+ AN WN  WG  G+FKIKRG++ECGIE ++VAG P
Sbjct: 271 LGGHAIKILGWG-EESGTPYWLAANSWNGDWGDKGFFKIKRGNDECGIESEMVAGTP 326


>gi|149392557|gb|ABR26081.1| cathepsin b-like cysteine proteinase 3 [Oryza sativa Indica Group]
          Length = 142

 Score =  228 bits (581), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 102/134 (76%), Positives = 120/134 (89%)

Query: 108 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
           +KC  +NQ+W   KH+S++AYR+NSDP DIMAE+Y+NGPVEV+FTVYEDFAHYKSGVYKH
Sbjct: 1   KKCKVQNQVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKH 60

Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           ITG +MGGHAVKLIGWGT+D GEDYW+LANQWNR WG DGYFKI RG+NECGIEEDVVAG
Sbjct: 61  ITGGMMGGHAVKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEEDVVAG 120

Query: 228 LPSSKNLVKEITSA 241
           +PS+KN+V+   SA
Sbjct: 121 MPSTKNMVRNYDSA 134


>gi|431918315|gb|ELK17542.1| Cathepsin B [Pteropus alecto]
          Length = 359

 Score =  228 bits (580), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 115/247 (46%), Positives = 155/247 (62%), Gaps = 17/247 (6%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWAFGAVEA+SDR CI  +  +N+ +S  DLL CCGF CG+G
Sbjct: 113 WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICILTNGNVNVEVSAEDLLTCCGFQCGEG 172

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCV 107
           C+GG+P  AW ++   G+V+         C PY          G   P       TPKC 
Sbjct: 173 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGGSTPKCS 232

Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           R C       ++  KH+  S+Y + S   +IMAEIYKNGPVE +F+VY DF  YKSGVY+
Sbjct: 233 RICEAGYTPSYKEDKHFGCSSYSVPSSETEIMAEIYKNGPVEAAFSVYSDFLLYKSGVYQ 292

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H+TG++MGGHAV+++GWG  +DG  YW++ N WN  WG  G+FKI RG + CGIE ++VA
Sbjct: 293 HVTGEMMGGHAVRILGWGV-EDGTPYWLVGNSWNTDWGDSGFFKILRGQDHCGIESEIVA 351

Query: 227 GLPSSKN 233
           GLP ++ 
Sbjct: 352 GLPCTEQ 358


>gi|326515156|dbj|BAK03491.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 471

 Score =  227 bits (579), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 118/230 (51%), Positives = 145/230 (63%), Gaps = 19/230 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGA EA++DR CI  +   N  LS  DL +CC   CG GC+GGYP +AW Y
Sbjct: 239 QGSCGSCWAFGAAEAMTDRICIASNGQNNFYLSAEDLTSCCDS-CGMGCEGGYPSAAWDY 297

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
           F   G+VT       + C PY         TG   P C    PTP C   C + N  W +
Sbjct: 298 FQSTGLVTGGDWNSNQGCYPYQLQACDHHVTGKYQP-CGDIQPTPACANSC-QNNATWSS 355

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
            KH+  S+Y + +D + IM EIY NGPVE S+ VY DF  YKSGVY+H+TGD +GGHAVK
Sbjct: 356 DKHFGASSYSVGTDQQSIMTEIYTNGPVEASYDVYADFVSYKSGVYQHVTGDYLGGHAVK 415

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +IGWG  D    YWI+AN WN  WG +G+F I RGS+ECGIE+ +VAG+P
Sbjct: 416 IIGWGV-DGSTPYWIVANSWNNDWGNNGFFNILRGSDECGIEDGIVAGIP 464


>gi|341888136|gb|EGT44071.1| hypothetical protein CAEBREN_13576 [Caenorhabditis brenneri]
          Length = 337

 Score =  227 bits (579), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 118/239 (49%), Positives = 150/239 (62%), Gaps = 22/239 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACC--GFLCGDGCDGGYPISAW 70
           Q  CGSCWA  A E +SDR CI     +N+ +S  DLL+CC  G+ CGDGC+GGYPI AW
Sbjct: 97  QSDCGSCWAVAAAETISDRTCIASNGEVNVLISAEDLLSCCTGGYNCGDGCEGGYPIQAW 156

Query: 71  RYFVHHGVVT-------EECDPYFDS------TGCSHPGCEP-AYPTPKCVRKCVKKNQL 116
           RY+VH+G+VT         C PY  +       G + P C      TP+CV++C  K+  
Sbjct: 157 RYWVHNGLVTGGSYESQYGCKPYSIAPCGQTVNGVTWPKCAADEVATPECVKQCTSKSDY 216

Query: 117 ---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
              +   KHY  SAY I  +   I  EI +NGPVEV F VY DF  YKSG+YKH+ G  +
Sbjct: 217 AVPYDQDKHYGSSAYAIRQNVAQIQTEIMRNGPVEVGFLVYSDFYQYKSGIYKHVAGREL 276

Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GGHAVK++GWG  ++G  YW+ AN WN +WG  GYF+I+RG+NECGIE  VVAG+P  K
Sbjct: 277 GGHAVKILGWGV-ENGTPYWLAANSWNVNWGEKGYFRIRRGTNECGIESSVVAGIPDLK 334


>gi|118364222|ref|XP_001015333.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297100|gb|EAR95088.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 341

 Score =  227 bits (579), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 114/232 (49%), Positives = 146/232 (62%), Gaps = 24/232 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           Q +CGSCWAFGAVE+L+DR CIH G ++ LS  ++L CC   CG GC+GGYP SA  Y+V
Sbjct: 115 QSNCGSCWAFGAVESLTDRHCIHLGQDIRLSAQNMLTCCA-TCGQGCNGGYPASAMSYYV 173

Query: 75  HHGVVTEECDPYFDSTG---------CSH-------PGCEPAYPTPKCVRKC-VKKNQLW 117
             G+VT +    +++TG         C+H       P C    PTPKC + C     Q +
Sbjct: 174 KTGLVTGD---LYNTTGWCQAYSFAPCAHHVDTPLYPACTGELPTPKCAKTCDSGSGQTY 230

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
             + H    AY +    E IM EI  NGPVE +FTVYEDF +YKSGVYKH+TG  +GGHA
Sbjct: 231 --TVHKGSKAYSVGKTQEAIMTEIQTNGPVEAAFTVYEDFLNYKSGVYKHVTGKALGGHA 288

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +K++GWG  ++   YWI+ N WN++WG +G FKI RG NECGIE  VV  LP
Sbjct: 289 IKIVGWGVENN-TPYWIVVNSWNQTWGDNGTFKILRGKNECGIEAQVVTALP 339


>gi|147906534|ref|NP_001090927.1| cathepsin B precursor [Sus scrofa]
 gi|187470655|sp|A1E295.1|CATB_PIG RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|118490058|gb|ABK96810.1| cathepsin B [Sus scrofa]
          Length = 335

 Score =  227 bits (578), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 114/248 (45%), Positives = 158/248 (63%), Gaps = 18/248 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWAFGAVEA+SDR CI     +N+ +S  D+L CCG  CGDG
Sbjct: 90  WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLTCCGDECGDG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C+GG+P  AW ++   G+V+         C PY     C H      P C     TPKC 
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 208

Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C       ++  KH+  S+Y I+ + ++IMAEIYKNGPVE +FTVY DF  YKSGVY+
Sbjct: 209 KICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGVYQ 268

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H+TGD+MGGHA++++GWG  ++G  YW++ N WN  WG +G+FKI RG + CGIE ++VA
Sbjct: 269 HVTGDLMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVA 327

Query: 227 GLPSSKNL 234
           G+P + + 
Sbjct: 328 GIPCTPHF 335


>gi|167538317|ref|XP_001750823.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163770644|gb|EDQ84327.1| predicted protein [Monosiga brevicollis MX1]
          Length = 341

 Score =  227 bits (578), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 113/231 (48%), Positives = 143/231 (61%), Gaps = 18/231 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVE+++DR CI    +L   +S  DL+ CC F CG GC GGYP +AW +
Sbjct: 111 QAACGSCWAFGAVESMTDRICIASKGSLRPHISAQDLMTCCLFTCGSGCSGGYPSAAWSW 170

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKC-VKKNQLWR 118
           F   G+VT       + C PY     C H      P C    PTP C + C    N  + 
Sbjct: 171 FKTTGIVTGGNYNSSQGCQPY-SLPNCDHHVSGQYPACSGEGPTPACKKSCEAGYNNTYS 229

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
           N KH+  +AY +  + + I  EI  NGPVE +FTVYED   YKSGVY+H TG V+GGHA+
Sbjct: 230 NDKHFGATAYSVAGEADKIATEIMTNGPVEGAFTVYEDLLTYKSGVYQHTTGQVLGGHAI 289

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           K+IGWG  + G DYW +AN WN  WG +G+FKIK+G +ECGIE  +VAG+P
Sbjct: 290 KIIGWGV-ESGVDYWWVANSWNNDWGDNGFFKIKKGVDECGIESQIVAGMP 339


>gi|171948776|gb|ACB59245.1| cathepsin B [Sus scrofa]
          Length = 335

 Score =  226 bits (577), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 114/248 (45%), Positives = 158/248 (63%), Gaps = 18/248 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWAFGAVEA+SDR CI     +N+ +S  D+L CCG  CGDG
Sbjct: 90  WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLTCCGDECGDG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C+GG+P  AW ++   G+V+         C PY     C H      P C     TPKC 
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 208

Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C       ++  KH+  S+Y I+ + ++IMAEIYKNGPVE +FTVY DF  YKSGVY+
Sbjct: 209 KICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGVYQ 268

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H+TGD+MGGHA++++GWG  ++G  YW++ N WN  WG +G+FKI RG + CGIE ++VA
Sbjct: 269 HVTGDLMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVA 327

Query: 227 GLPSSKNL 234
           G+P + + 
Sbjct: 328 GIPCTPHF 335


>gi|348587350|ref|XP_003479431.1| PREDICTED: cathepsin B-like [Cavia porcellus]
          Length = 340

 Score =  226 bits (577), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 116/246 (47%), Positives = 160/246 (65%), Gaps = 19/246 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWAFGAVEA+SDR CIH    +N+ +S  DLL+CCG LCG+G
Sbjct: 90  WPNCPTIKEIRDQGSCGSCWAFGAVEAMSDRLCIHTNGHVNVEVSAEDLLSCCGPLCGEG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCE-PAYPTPKC 106
           C+GGYP  AW+Y+   G+V+         C PY     C H      P C      TPKC
Sbjct: 150 CNGGYPTEAWKYWTRKGLVSGGLYGSHVGCRPY-SIPPCEHHVNGTRPKCTGEGGDTPKC 208

Query: 107 VRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
            + C    +  ++  K+Y  S+Y + S  ++IMAEIYKNGPVE +F+V+ DF  YKSGVY
Sbjct: 209 SKTCEPGYSPSYKEDKYYGYSSYSVPSTEKEIMAEIYKNGPVEAAFSVFSDFLTYKSGVY 268

Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
           KH+ G+V+GGHA++++GWG  ++G  YW++ N WN  WG +G+FKI RG + CGIE +VV
Sbjct: 269 KHVAGEVLGGHAIRILGWG-KENGVPYWLVGNSWNVDWGDNGFFKILRGEDHCGIESEVV 327

Query: 226 AGLPSS 231
           AG+P +
Sbjct: 328 AGIPRT 333


>gi|326916753|ref|XP_003204669.1| PREDICTED: cathepsin B-like [Meleagris gallopavo]
          Length = 340

 Score =  226 bits (576), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 114/247 (46%), Positives = 157/247 (63%), Gaps = 17/247 (6%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDG 60
           + N   +  +  QG CGSCWAFGAVEA+SDR C+H    +S+ V+  DLL+CCGF CG G
Sbjct: 90  WPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLLSCCGFECGMG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCV 107
           C+GGYP  AWRY+   G+V+         C PY          G   P       TP+C 
Sbjct: 150 CNGGYPSGAWRYWTERGLVSGGLYDSHVGCRPYTIPPCEHHVNGSRPPCTGEGGETPRCS 209

Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           R C    +  ++  KHY I++Y +    ++IMAEIYKNGPVE +F VYEDF  YKSGVY+
Sbjct: 210 RHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQ 269

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H++G+ +GGHA++++GWG  ++G  YW+ AN WN  WG +G+FKI RG + CGIE ++VA
Sbjct: 270 HVSGEQVGGHAIRILGWGV-ENGTPYWLAANSWNTDWGDNGFFKILRGEDHCGIESEIVA 328

Query: 227 GLPSSKN 233
           G+P ++ 
Sbjct: 329 GVPRTEQ 335


>gi|354471594|ref|XP_003498026.1| PREDICTED: cathepsin B-like [Cricetulus griseus]
 gi|344254255|gb|EGW10359.1| Cathepsin B [Cricetulus griseus]
          Length = 339

 Score =  226 bits (576), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 113/244 (46%), Positives = 157/244 (64%), Gaps = 16/244 (6%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           ++N   ++ +  QG CGSCWAFGAV A+SDR CIH    +N+ +S  DLL CCG  CGDG
Sbjct: 90  WSNCPTIKQIRDQGSCGSCWAFGAVGAMSDRLCIHTNGHVNVEVSAEDLLTCCGSQCGDG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVR 108
           C+GGYP  AW +++  G+V+         C PY           S P C     TPKC +
Sbjct: 150 CNGGYPSGAWNFWIKKGLVSGGLYNSHVGCLPYTIPPCEHHVNGSRPQCTGEGDTPKCTK 209

Query: 109 KC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
            C    +  ++  KHY  ++Y ++++ ++IMAEIYKNGPVE +FTV+ DF  YKSGVYKH
Sbjct: 210 SCEAGYSPSYKEDKHYGYTSYSVSNNEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKH 269

Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
             GD+MGGHA++++GWG  ++   YW++AN WN  WG +G FKI RG + CGIE ++VAG
Sbjct: 270 EAGDIMGGHAIRILGWGV-ENSVPYWLVANSWNVDWGDNGLFKILRGEDHCGIESEIVAG 328

Query: 228 LPSS 231
           +P +
Sbjct: 329 IPRT 332


>gi|333408990|gb|AEF32260.1| cathepsin B [Cristaria plicata]
          Length = 347

 Score =  226 bits (576), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 122/245 (49%), Positives = 153/245 (62%), Gaps = 21/245 (8%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDG 60
           + N   V+ +  QG CGSCWAFGAVEA+SDR CI  +  +N  +S  DLLACC   CG+G
Sbjct: 105 WPNCPTVKEVRDQGDCGSCWAFGAVEAMSDRICIASNGKVNAEISAEDLLACCSS-CGEG 163

Query: 61  CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKC 106
           C GG+P  AWRY+   G+VT       + C PY     C H       P  +    TPKC
Sbjct: 164 CQGGFPAEAWRYYEREGLVTGGLYNSSQGCQPYM-IPACDHHVVGHLQPCPKEEAKTPKC 222

Query: 107 VRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
            +KC    N  +++ KHY  ++Y ++S  E IM EI  NGPVE +FTVYEDF  YKSGVY
Sbjct: 223 SKKCEANYNVTYKDDKHYGKNSYSVDSV-EKIMTEIMTNGPVEAAFTVYEDFLSYKSGVY 281

Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
           +H TG  +GGHAVK++GWG  D+G  YWI+AN WN  WG  G+F I RG +ECGIE  +V
Sbjct: 282 QHRTGQELGGHAVKILGWG-EDNGTPYWIVANSWNPDWGNQGFFNILRGKDECGIESQIV 340

Query: 226 AGLPS 230
           AGLP 
Sbjct: 341 AGLPK 345


>gi|6681079|ref|NP_031824.1| cathepsin B preproprotein [Mus musculus]
 gi|115712|sp|P10605.2|CATB_MOUSE RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
           RecName: Full=Cathepsin B light chain; Contains:
           RecName: Full=Cathepsin B heavy chain; Flags: Precursor
 gi|239907|gb|AAB20536.1| preprocathepsin B [Mus sp.]
 gi|309152|gb|AAA37375.1| cathepsin B [Mus musculus]
 gi|13879360|gb|AAH06656.1| Cathepsin B [Mus musculus]
 gi|26350521|dbj|BAC38900.1| unnamed protein product [Mus musculus]
 gi|74180941|dbj|BAE27751.1| unnamed protein product [Mus musculus]
 gi|74191261|dbj|BAE39458.1| unnamed protein product [Mus musculus]
 gi|74198944|dbj|BAE30691.1| unnamed protein product [Mus musculus]
 gi|74208073|dbj|BAE29144.1| unnamed protein product [Mus musculus]
 gi|148704123|gb|EDL36070.1| cathepsin B, isoform CRA_a [Mus musculus]
          Length = 339

 Score =  226 bits (575), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 114/244 (46%), Positives = 156/244 (63%), Gaps = 16/244 (6%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           ++N   +  +  QG CGSCWAFGAVEA+SDR CIH    +N+ +S  DLL CCG  CGDG
Sbjct: 90  WSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCGIQCGDG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVR 108
           C+GGYP  AW ++   G+V+         C PY           S P C     TP+C +
Sbjct: 150 CNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNGSRPPCTGEGDTPRCNK 209

Query: 109 KC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
            C    +  ++  KH+  ++Y +++  ++IMAEIYKNGPVE +FTV+ DF  YKSGVYKH
Sbjct: 210 SCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKH 269

Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
             GD+MGGHA++++GWG  ++G  YW+ AN WN  WG +G+FKI RG N CGIE ++VAG
Sbjct: 270 EAGDMMGGHAIRILGWGV-ENGVPYWLAANSWNLDWGDNGFFKILRGENHCGIESEIVAG 328

Query: 228 LPSS 231
           +P +
Sbjct: 329 IPRT 332


>gi|126303983|ref|XP_001381634.1| PREDICTED: cathepsin B-like [Monodelphis domestica]
          Length = 337

 Score =  225 bits (574), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 113/246 (45%), Positives = 161/246 (65%), Gaps = 19/246 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWAFGAVEA+SDR C+H     N+ +S  DLL+CCG  CGDG
Sbjct: 91  WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICVHSNGNANVEVSAEDLLSCCGSECGDG 150

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH--PGCEPAYP-----TPKC 106
           C+GG+P  AW ++   G+V+         C PY     C H   G  PA       TP C
Sbjct: 151 CNGGFPAGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPACTGEEGDTPTC 209

Query: 107 VRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
            +KC +  +  +++ K+Y  ++Y + S  ++IMAEIYKNGPVE +F+VYEDF HYKSGVY
Sbjct: 210 RKKCEEGYSTQYKDDKNYGSTSYSVPSSEQEIMAEIYKNGPVEGAFSVYEDFLHYKSGVY 269

Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
           +H+ G+++GGHA++++GWG  ++G  YW+ AN WN  WG +G+FK  RG N CGIE +++
Sbjct: 270 QHVAGEMLGGHAIRILGWGV-ENGIRYWLAANSWNIDWGDNGFFKFLRGKNHCGIESEII 328

Query: 226 AGLPSS 231
           AG+P +
Sbjct: 329 AGIPRT 334


>gi|118358706|ref|XP_001012594.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89294361|gb|EAR92349.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 346

 Score =  225 bits (574), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 116/233 (49%), Positives = 144/233 (61%), Gaps = 20/233 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           Q  CGSCWAFGA E+LSDR CIH G ++ LS  +LL CC   CGDGCDGG+P +A  Y+V
Sbjct: 116 QSTCGSCWAFGAAESLSDRHCIHLGQDIRLSTQNLLTCCA-ACGDGCDGGWPEAAMDYYV 174

Query: 75  HHGVVTEE-------CDPYFDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQ---LW 117
           + G+VT +       C  Y  +  C+H       P C    PTP C+  C   +     +
Sbjct: 175 NTGLVTGDLYGNNSWCQAYTFAP-CAHHVTSDIYPPCTGELPTPPCINSCDSNSTHTIPY 233

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
               H    AY I  D + IMAEIYKNGP+EV+ TVYEDF  YK+GVY+H+TGD +GGHA
Sbjct: 234 SKDIHRGSKAYGIAKDEKAIMAEIYKNGPIEVALTVYEDFLTYKTGVYQHVTGDELGGHA 293

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           VK++GWG  ++G  YW + N WN SWG  G FKI RG NECGIE   V  LP+
Sbjct: 294 VKMVGWGV-ENGTPYWTIVNSWNESWGDKGTFKILRGKNECGIESSCVTALPA 345


>gi|17565164|ref|NP_503383.1| Protein CPR-5 [Caenorhabditis elegans]
 gi|1169086|sp|P43509.1|CPR5_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 5; AltName:
           Full=Cysteine protease-related 5; Flags: Precursor
 gi|671713|gb|AAA98786.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|675502|gb|AAA98784.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351059399|emb|CCD74289.1| Protein CPR-5 [Caenorhabditis elegans]
          Length = 344

 Score =  225 bits (573), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 118/236 (50%), Positives = 145/236 (61%), Gaps = 22/236 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCG--FLCGDGCDGGYPISAW 70
           Q  CGSCWAF A EA+SDR CI  +  +N  LS  DLL+CC   F CG+GC+GGYPI AW
Sbjct: 104 QSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDLLSCCTGMFSCGNGCEGGYPIQAW 163

Query: 71  RYFVHHGVVTEE-------CDPYFDS------TGCSHPGC-EPAYPTPKCVRKCVKKNQL 116
           +++V HG+VT         C PY  +       G   P C E   PTPKCV  C  KN  
Sbjct: 164 KWWVKHGLVTGGSYETQFGCKPYSIAPCGETVNGVKWPACPEDTEPTPKCVDSCTSKNNY 223

Query: 117 ---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
              +   KH+  +AY +    E I  EI  NGP+EV+FTVYEDF  Y +GVY H  G  +
Sbjct: 224 ATPYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIEVAFTVYEDFYQYTTGVYVHTAGASL 283

Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           GGHAVK++GWG  D+G  YW++AN WN +WG  GYF+I RG NECGIE   VAG+P
Sbjct: 284 GGHAVKILGWGV-DNGTPYWLVANSWNVAWGEKGYFRIIRGLNECGIEHSAVAGIP 338


>gi|432946172|ref|XP_004083803.1| PREDICTED: cathepsin B-like [Oryzias latipes]
          Length = 330

 Score =  225 bits (573), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 118/245 (48%), Positives = 158/245 (64%), Gaps = 19/245 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   ++ +  QG+CGSCWAFGA EA+SDR CI  G  ++L +S  DLL CC   CG G
Sbjct: 89  WPNCPTLKQIRDQGNCGSCWAFGAAEAISDRICIQSGGKISLEISAEDLLTCCD-ECGMG 147

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C GG+P +AW ++ + G+VT         C PY  +  C H      P C+    TPKCV
Sbjct: 148 CFGGFPSAAWEFWTNKGLVTGGLFDSKVGCRPYTLAP-CEHHVNGSRPPCQGEVETPKCV 206

Query: 108 RKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
            +C     L +   KH+   +Y I S  E IM E+YKNGPVE +F+VY DF  YK+GVY+
Sbjct: 207 TQCNNGYSLSYPKDKHFGQRSYSIPSQQEQIMTELYKNGPVEAAFSVYADFLLYKNGVYQ 266

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H+TGD++GGHAVK++GWG  ++G  YW++AN WN  WG  G+FKIKRG++ECGIE ++VA
Sbjct: 267 HVTGDMLGGHAVKILGWG-EENGTPYWLVANSWNSDWGDKGFFKIKRGNDECGIESEMVA 325

Query: 227 GLPSS 231
           G P S
Sbjct: 326 GAPLS 330


>gi|410956528|ref|XP_003984894.1| PREDICTED: cathepsin B [Felis catus]
          Length = 339

 Score =  225 bits (573), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 114/252 (45%), Positives = 158/252 (62%), Gaps = 18/252 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWAFGAVEA+SDR CI     +N+ +S  D+L CCG  CGDG
Sbjct: 90  WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICILTNGHVNVEVSAEDMLTCCGDQCGDG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C+GG+P  AW ++   G+V+         C PY     C H      P C     TPKC 
Sbjct: 150 CNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 208

Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C       ++  KHY  ++Y +++  ++IMAEIYKNGPVE +F+V+ DF  YKSGVY+
Sbjct: 209 KICEPGYTPSYKEDKHYGCNSYSVSNSEKEIMAEIYKNGPVEAAFSVFSDFLQYKSGVYQ 268

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H+TG++MGGHAV+++GWG  +D   YW++ N WN  WG  G+FKI RG + CGIE +VVA
Sbjct: 269 HVTGEMMGGHAVRILGWGVEND-TPYWLVGNSWNTDWGDHGFFKILRGRDHCGIESEVVA 327

Query: 227 GLPSSKNLVKEI 238
           G+P ++   K I
Sbjct: 328 GIPCTEQYWKRI 339


>gi|241998314|ref|XP_002433800.1| longipain, putative [Ixodes scapularis]
 gi|215495559|gb|EEC05200.1| longipain, putative [Ixodes scapularis]
          Length = 339

 Score =  224 bits (572), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 120/231 (51%), Positives = 154/231 (66%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVE++SDR CIH G    + L+ +D+L+CC + CG GC+GG+P +AW Y
Sbjct: 110 QGACGSCWAFGAVESMSDRHCIHSGAKNIVHLAADDVLSCC-WGCGSGCNGGFPGAAWSY 168

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +V  G+VT       E C PY     C H        C    PTPKCVR C K  N  ++
Sbjct: 169 WVEKGIVTGGNYDTDEGCMPY-PVPSCDHHVNGTLGPCGQDPPTPKCVRLCRKGYNIDFK 227

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
           + KHY  S+Y ++S+   I  EI KNGPVE +FTVY DF  YKSGVYK  + D +GGHA+
Sbjct: 228 DDKHYGKSSYSVSSNETQIQMEIMKNGPVEGAFTVYADFPLYKSGVYKSHSTDALGGHAI 287

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +++GWG  ++G  +W++AN WN  WG  GYFKI RGSNECGIEED+VAG+P
Sbjct: 288 RILGWGV-ENGVPFWLVANSWNTEWGDKGYFKILRGSNECGIEEDIVAGIP 337


>gi|379067374|gb|AFC90100.1| cathepsin B [Capra hircus]
          Length = 335

 Score =  224 bits (571), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 112/243 (46%), Positives = 158/243 (65%), Gaps = 18/243 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWAFGAVEA+SDR CIH    +N+ +S  D+L CCG  CGDG
Sbjct: 90  WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSKGRVNVEVSAEDMLTCCGSECGDG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C+GG+P  AW ++   G+V+         C PY     C H      P C     TPKC 
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 208

Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C    +  +++ KH+  S+Y ++S+ ++IMAEIYKNGPVE +F+VY DF  YKSGVY+
Sbjct: 209 KICEPGYSPSYKDDKHFGCSSYSVSSNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQ 268

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H++G++MGGHA++++GWG  +D   YW++ N WN  WG  G+FKI RG + CGIE ++VA
Sbjct: 269 HVSGEMMGGHAIRILGWGVEND-TPYWLVGNSWNTDWGDKGFFKILRGQDHCGIESEIVA 327

Query: 227 GLP 229
           G+P
Sbjct: 328 GMP 330


>gi|426220597|ref|XP_004004501.1| PREDICTED: cathepsin B [Ovis aries]
          Length = 335

 Score =  224 bits (571), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 112/243 (46%), Positives = 158/243 (65%), Gaps = 18/243 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWAFGAVEA+SDR CIH    +N+ +S  D+L CCG  CGDG
Sbjct: 90  WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSKGRVNVEVSAEDMLTCCGSECGDG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C+GG+P  AW ++   G+V+         C PY     C H      P C     TPKC 
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 208

Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C    +  +++ KH+  S+Y ++S+ ++IMAEIYKNGPVE +F+VY DF  YKSGVY+
Sbjct: 209 KICEPGYSPSYKDDKHFGCSSYSVSSNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQ 268

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H++G++MGGHA++++GWG  +D   YW++ N WN  WG  G+FKI RG + CGIE ++VA
Sbjct: 269 HVSGEMMGGHAIRILGWGVEND-TPYWLVGNSWNTDWGDKGFFKILRGQDHCGIESEIVA 327

Query: 227 GLP 229
           G+P
Sbjct: 328 GMP 330


>gi|46195455|ref|NP_990702.1| cathepsin B precursor [Gallus gallus]
 gi|1168790|sp|P43233.1|CATB_CHICK RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
           RecName: Full=Cathepsin B light chain; Contains:
           RecName: Full=Cathepsin B heavy chain; Flags: Precursor
 gi|603203|gb|AAA87075.1| cathepsin B [Gallus gallus]
          Length = 340

 Score =  224 bits (570), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 114/233 (48%), Positives = 152/233 (65%), Gaps = 21/233 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR C+H    +S+ V+  DLL+CCGF CG GC+GGYP  AWRY
Sbjct: 102 QGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLLSCCGFECGMGCNGGYPSGAWRY 161

Query: 73  FVHHGVVTEECDPYFDSTGC---SHPGCE------------PAYPTPKCVRKCVKK-NQL 116
           +   G+V+     Y    GC   + P CE                TP+C R C    +  
Sbjct: 162 WTERGLVSGGL--YDSHVGCRAYTIPPCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPS 219

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           ++  KHY I++Y +    ++IMAEIYKNGPVE +F VYEDF  YKSGVY+H++G+ +GGH
Sbjct: 220 YKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVSGEQVGGH 279

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           A++++GWG  ++G  YW+ AN WN  WG  G+FKI RG + CGIE ++VAG+P
Sbjct: 280 AIRILGWGV-ENGTPYWLAANSWNTDWGITGFFKILRGEDHCGIESEIVAGVP 331


>gi|268555788|ref|XP_002635883.1| C. briggsae CBR-CPR-5 protein [Caenorhabditis briggsae]
          Length = 345

 Score =  223 bits (569), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 118/236 (50%), Positives = 147/236 (62%), Gaps = 22/236 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFL--CGDGCDGGYPISAW 70
           Q  CGSCWAF A EA+SDR CI  +  +N  LS  DLL+CC  L  CG+GC+GGYPI AW
Sbjct: 105 QSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSQDLLSCCTGLLSCGNGCEGGYPIQAW 164

Query: 71  RYFVHHGVVTEE-------CDPYFDS------TGCSHPGC-EPAYPTPKCVRKCVKKNQL 116
           +++V HG+VT         C PY  +       G + P C +   PTPKCV  C   N  
Sbjct: 165 KWWVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPDDTEPTPKCVEACTSNNTY 224

Query: 117 ---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
              +   KH+  +AY +    E I  EI KNGPVEV+FTVYEDF  Y +GVY H +G  +
Sbjct: 225 PTPYLQDKHFGATAYAVGKKVEQIQTEILKNGPVEVAFTVYEDFYQYTTGVYVHTSGASL 284

Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           GGHAVK++GWG  D+G  YW++AN WN +WG  GYF+I RG NECGIE   VAG+P
Sbjct: 285 GGHAVKILGWGV-DNGTPYWLVANSWNVNWGEKGYFRIIRGLNECGIEHSAVAGIP 339


>gi|221107055|ref|XP_002166984.1| PREDICTED: cathepsin B-like [Hydra magnipapillata]
          Length = 330

 Score =  223 bits (568), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 119/230 (51%), Positives = 145/230 (63%), Gaps = 18/230 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA +DR CI      N  +S  DLL CCGF CG GC+GG    AW +
Sbjct: 100 QGSCGSCWAFGAVEAFTDRICIQSNGAKNPHISAEDLLTCCGFWCGFGCNGGRLGPAWNF 159

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWR 118
           F + G VT       E C PY        ++G   P CE + PTPKC R C +  N  + 
Sbjct: 160 FKYAGAVTGGQYNSSEGCQPYEIPSCEHHTSGSKKP-CEGSEPTPKCKRSCREGYNVSYS 218

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
           + KH   S Y I +D E I  EIY NGPVE +FTVY DF +YKSGVYK+ TG+ +GGHA+
Sbjct: 219 DDKHKVSSHYSIANDEEQIKNEIYLNGPVEAAFTVYSDFPNYKSGVYKYTTGNALGGHAI 278

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           K++GWG  ++   YW++AN WN  WG  G+FKI RGSNECGIE  VVAG+
Sbjct: 279 KILGWGVENN-VPYWLVANSWNPDWGDKGFFKILRGSNECGIEASVVAGM 327


>gi|31872149|gb|AAP59456.1| cathepsin B precursor [Araneus ventricosus]
          Length = 334

 Score =  223 bits (568), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 119/231 (51%), Positives = 148/231 (64%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGA EA+SDR CIH    +N+ +S  DLL CC   CG GC+GG+P SAW Y
Sbjct: 105 QGSCGSCWAFGAAEAMSDRHCIHSNGKVNVEISAEDLLTCCD-SCGMGCNGGFPGSAWEY 163

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +V  G+VT         C PY  ++ C H      P C     TP+CV  C K  N  +R
Sbjct: 164 WVDKGLVTGGLYNSHVGCQPYTIAS-CEHHTKGKLPPCGDIVDTPQCVHMCEKGYNVSYR 222

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             K++   +Y I+   + I  EI  NGPVE +FTVY DF  YKSGVY+H+TG+ MGGHAV
Sbjct: 223 ADKYFGKKSYSIDEQEDQIKTEISTNGPVEAAFTVYADFVTYKSGVYRHVTGEEMGGHAV 282

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +++GWGT + G  YW++AN WN  WG  GYFKI RGS+ECGIE  +VAGLP
Sbjct: 283 RILGWGT-ESGTPYWLVANSWNTDWGDKGYFKILRGSDECGIESSIVAGLP 332


>gi|390994429|gb|AFM37364.1| cathepsin B1 [Dictyocaulus viviparus]
          Length = 350

 Score =  223 bits (567), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 115/245 (46%), Positives = 145/245 (59%), Gaps = 19/245 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDG 60
           ++  + +  +  Q HCGSCWA  A E +SDR CIH    +N+ LS  D+L+CCG  CG G
Sbjct: 105 WSQCDSIRTIRDQSHCGSCWAVSAAETMSDRTCIHSDGKINVGLSATDILSCCGTTCGRG 164

Query: 61  CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAY--------PTPK 105
           C GGYPI AWRYF+ HGV T       + C PY     C H   E  Y        PTP+
Sbjct: 165 CRGGYPIEAWRYFMLHGVCTGGHYAEKDVCKPYAFHP-CGHHRNEIYYGECPKEIFPTPQ 223

Query: 106 CVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 164
           C + C       + + K Y  SAY + ++ + I  EI  NGPV+ +F VYEDF+ Y+SG+
Sbjct: 224 CTQSCQAGYASDYEDDKIYGKSAYALPNNEKAIQREIMTNGPVQAAFMVYEDFSRYRSGI 283

Query: 165 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
           Y H  G   GGHAVKLIGWG  DDG  YW+ AN WN  WG +GYF+I RG + CGIE  V
Sbjct: 284 YVHTAGRREGGHAVKLIGWGVDDDGNKYWLAANSWNSDWGENGYFRIVRGVDHCGIESAV 343

Query: 225 VAGLP 229
           VAG+P
Sbjct: 344 VAGMP 348


>gi|346470617|gb|AEO35153.1| hypothetical protein [Amblyomma maculatum]
          Length = 335

 Score =  223 bits (567), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 112/238 (47%), Positives = 151/238 (63%), Gaps = 19/238 (7%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
           + ++  Q  CGSCWAFGA EA+SDR CIH    + +++S  DLL CC   CG GC+GGYP
Sbjct: 100 IHVIRDQSTCGSCWAFGATEAMSDRVCIHSKGKVQVNISAEDLLTCCD-SCGAGCNGGYP 158

Query: 67  ISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK 113
            +AW ++   G+VT       + C PY+    C H      P C    PTP+CVR C K 
Sbjct: 159 AAAWEFYKTDGIVTGGLYGTDDGCQPYYFPP-CEHHTVGPLPNCTGIKPTPQCVRDCRKG 217

Query: 114 -NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
             + +   KHY+   Y +++D   I  EI+KNGPVE  FTVY DF  YKSGVY+  + D 
Sbjct: 218 YEKSYSEDKHYAKKVYTLSADETQIKTEIFKNGPVEADFTVYADFVSYKSGVYQRHSDDA 277

Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           +GGHA++++GWGT ++G  YW++AN WN  WG  GYFKI RG++ECGIE+D+ AG+P 
Sbjct: 278 LGGHAIRILGWGT-ENGVPYWLVANSWNEDWGDKGYFKILRGNDECGIEDDINAGIPK 334


>gi|344281458|ref|XP_003412496.1| PREDICTED: cathepsin B-like [Loxodonta africana]
          Length = 340

 Score =  223 bits (567), Expect = 8e-56,   Method: Compositional matrix adjust.
 Identities = 116/253 (45%), Positives = 160/253 (63%), Gaps = 19/253 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWAFGAVEA+SDR CIH    +N+ +S  DLL CC   CGDG
Sbjct: 90  WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRVCIHTNGNVNVEVSAEDLLTCCHMECGDG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCE-PAYPTPKC 106
           C+GG+P  AW ++   G+V+         C PY     C H      P C+     TPKC
Sbjct: 150 CNGGFPAGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCKGEGGETPKC 208

Query: 107 VRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
            + C    +  ++  KHY  S+Y + S  ++IMAEIYKNGPVE +F+VY DF  YKSGVY
Sbjct: 209 SKTCEPGYSPSYKEDKHYGYSSYGVPSSEQEIMAEIYKNGPVEGAFSVYTDFLVYKSGVY 268

Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
           +H+TG+ +GGHA++++GWG  ++G  YW+ AN WN  WG +G+FKI RG + CGIE ++V
Sbjct: 269 QHVTGEEVGGHAIRILGWGV-ENGTPYWLAANSWNTDWGDNGFFKILRGQDHCGIESEIV 327

Query: 226 AGLPSSKNLVKEI 238
           AG+P +    K+I
Sbjct: 328 AGIPRTDQYWKKI 340


>gi|160333103|ref|NP_001103948.1| capthepsin B, b precursor [Danio rerio]
 gi|133777414|gb|AAI15255.1| Ctsbb protein [Danio rerio]
          Length = 326

 Score =  222 bits (566), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 117/243 (48%), Positives = 151/243 (62%), Gaps = 19/243 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG 60
           + N + +  +  QG CGSCWAFGAVE++SDR CIH     S  +S  DLL+CC   CG G
Sbjct: 85  WPNCKTLSQIRDQGSCGSCWAFGAVESISDRICIHSKGKQSPEISAEDLLSCCD-QCGFG 143

Query: 61  CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C GG+P  AW Y+   G+VT         C PY     C H      P C     TPKC 
Sbjct: 144 CSGGFPAEAWDYWRRSGLVTGGLYNSDVGCRPY-SIAPCEHHVNGTRPPCSGEQDTPKCT 202

Query: 108 RKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
             C+ K  + ++  KH+    Y + SD + IM E+Y NGPVE +FTVYEDF  YKSGVY+
Sbjct: 203 GVCIPKYSVPYKQDKHFGSKVYNVPSDQQQIMTELYTNGPVEAAFTVYEDFPLYKSGVYQ 262

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H+TG  +GGHAVK++GWG  ++G  +W++AN WN  WG +GYFKI RG +ECGIE ++VA
Sbjct: 263 HLTGSALGGHAVKILGWG-EENGTPFWLVANSWNSDWGDNGYFKILRGHDECGIESEMVA 321

Query: 227 GLP 229
           GLP
Sbjct: 322 GLP 324


>gi|309202|gb|AAA37494.1| mouse preprocathepsin B [Mus musculus]
          Length = 339

 Score =  222 bits (566), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 113/244 (46%), Positives = 155/244 (63%), Gaps = 16/244 (6%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           ++N   +  +  QG CGSCWAFGAVEA+SDR CIH    +N+ +S  DLL CCG  CGDG
Sbjct: 90  WSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCGIQCGDG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVR 108
           C+GGYP  AW ++   G+V+         C PY           S P C     TP+C +
Sbjct: 150 CNGGYPSGAWNFWTKKGLVSGGVYDSHIGCLPYTIPPCEHHVNGSRPPCTGEGDTPRCNK 209

Query: 109 KC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
            C    +  ++  KH+  ++Y +++  ++IMAEIYKNGPVE +FTV+ DF  YKSGVYKH
Sbjct: 210 SCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKH 269

Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
             GD+MGGHA++++ WG  ++G  YW+ AN WN  WG +G+FKI RG N CGIE ++VAG
Sbjct: 270 EAGDMMGGHAIRILVWGV-ENGVPYWLAANSWNLDWGDNGFFKILRGENHCGIESEIVAG 328

Query: 228 LPSS 231
           +P +
Sbjct: 329 IPRT 332


>gi|74221319|dbj|BAE42140.1| unnamed protein product [Mus musculus]
          Length = 339

 Score =  222 bits (566), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 113/244 (46%), Positives = 154/244 (63%), Gaps = 16/244 (6%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           ++N   +  +  QG CGSCWAFGAVEA+SDR CIH    +N+ +S  DLL CCG  CGDG
Sbjct: 90  WSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCGIQCGDG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVR 108
           C+GGYP  AW ++   G+V+         C PY           S P C     TP+C +
Sbjct: 150 CNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNGSRPPCTGEGDTPRCNK 209

Query: 109 KC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
            C    +  ++  KH+  ++Y +++  ++IMAEIYKN PVE +FTV+ DF  YKSGVYKH
Sbjct: 210 SCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNDPVEGAFTVFSDFLTYKSGVYKH 269

Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
             GD+MGGHA++++GWG   +G  YW+ AN WN  WG +G+FKI RG N CGIE ++VAG
Sbjct: 270 EAGDMMGGHAIRILGWGVG-NGVPYWLAANSWNLDWGDNGFFKILRGENHCGIESEIVAG 328

Query: 228 LPSS 231
           +P +
Sbjct: 329 IPRT 332


>gi|225708580|gb|ACO10136.1| Cathepsin B precursor [Osmerus mordax]
          Length = 329

 Score =  222 bits (566), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 115/244 (47%), Positives = 151/244 (61%), Gaps = 19/244 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWAFGA EA+SDR CIH    ++  +S  DLL+CC   CG G
Sbjct: 88  WPNCPTIQDIRDQGSCGSCWAFGAAEAISDRLCIHSNAKITVEISAEDLLSCCE-ECGMG 146

Query: 61  CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C GGYP +AW Y+   G+VT       + C PY     C H      P C+    TPKC 
Sbjct: 147 CFGGYPSAAWEYWAKSGLVTGGLYGSNKGCRPY-SIPPCEHHVNGTRPPCQGEGDTPKCQ 205

Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
            KC+      +   K++    Y + S  E IM E+YKNGPVE +F+VYEDF  YKSGVY+
Sbjct: 206 TKCIDGYTPAYEKDKYFGKKTYSVPSKQEQIMTELYKNGPVEAAFSVYEDFLLYKSGVYQ 265

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H+TGD++GGHA+K++GWG  ++   YW+ AN WN  WG  G+FKI RG +ECGIE +VVA
Sbjct: 266 HLTGDMLGGHAIKILGWG-KENNTPYWLAANSWNTDWGNQGFFKILRGGDECGIESEVVA 324

Query: 227 GLPS 230
           G+P 
Sbjct: 325 GIPQ 328


>gi|45361295|ref|NP_989225.1| cathepsin B precursor [Xenopus (Silurana) tropicalis]
 gi|38969948|gb|AAH63365.1| hypothetical protein MGC75969 [Xenopus (Silurana) tropicalis]
          Length = 333

 Score =  222 bits (565), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 116/246 (47%), Positives = 159/246 (64%), Gaps = 19/246 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   +  +  QG CGSCWAFGAVEA+SDR C+H    +N+ +S  DLL+CCGF CG G
Sbjct: 90  WPNCPTIREVRDQGSCGSCWAFGAVEAISDRVCVHTNGKVNVEVSAEDLLSCCGFECGMG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH--PGCEPAYP-----TPKC 106
           C+GGYP  AW+++   G+V+         C PY     C H   G  PA       TPKC
Sbjct: 150 CNGGYPSGAWKFWTETGLVSGGLYDSHLGCRPY-SIPPCEHHVNGSRPACKGEEGDTPKC 208

Query: 107 VRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
           V++C      ++ + KH+  ++Y + S  ++IMAEIYKNGPVE +F VY DF  YKSGVY
Sbjct: 209 VKQCEDGYAPVYGSDKHFGATSYGVPSSEKEIMAEIYKNGPVEGAFLVYADFPMYKSGVY 268

Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
           +H TG+ +GGHA+K++GWG  ++G  YW+ AN WN  WG +G+FKI RG + CGIE ++V
Sbjct: 269 QHETGEELGGHAIKILGWGV-ENGTPYWLCANSWNTDWGDNGFFKILRGKDHCGIESEIV 327

Query: 226 AGLPSS 231
           AG+P +
Sbjct: 328 AGIPKN 333


>gi|194387364|dbj|BAG60046.1| unnamed protein product [Homo sapiens]
          Length = 245

 Score =  222 bits (565), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 112/237 (47%), Positives = 155/237 (65%), Gaps = 18/237 (7%)

Query: 18  CGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVH 75
           C   WAFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW ++  
Sbjct: 11  CRMSWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTR 70

Query: 76  HGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSK 121
            G+V+         C PY     C H      P C     TPKC + C    +  ++  K
Sbjct: 71  KGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDK 129

Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 181
           HY  ++Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++
Sbjct: 130 HYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRIL 189

Query: 182 GWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 190 GWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 245


>gi|74213457|dbj|BAE35542.1| unnamed protein product [Mus musculus]
          Length = 339

 Score =  222 bits (565), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 113/244 (46%), Positives = 155/244 (63%), Gaps = 16/244 (6%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           ++N   +  +  QG CGSCWAFGAVEA+SDR CIH    +N+ +S  DLL CCG  CGDG
Sbjct: 90  WSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCGIQCGDG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVR 108
           C+GGYP  AW ++   G+V+         C PY           S P C     T +C +
Sbjct: 150 CNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNGSRPPCTGEGDTHRCNK 209

Query: 109 KC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
            C    +  ++  KH+  ++Y +++  ++IMAEIYKNGPVE +FTV+ DF  YKSGVYKH
Sbjct: 210 SCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKH 269

Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
             GD+MGGHA++++GWG  ++G  YW+ AN WN  WG +G+FKI RG N CGIE ++VAG
Sbjct: 270 EAGDMMGGHAIRILGWGV-ENGVPYWLAANSWNLDWGDNGFFKILRGENHCGIESEIVAG 328

Query: 228 LPSS 231
           +P +
Sbjct: 329 IPRT 332


>gi|73586701|gb|AAI02998.1| CTSB protein [Bos taurus]
          Length = 335

 Score =  221 bits (564), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 110/243 (45%), Positives = 157/243 (64%), Gaps = 18/243 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWAFGAVEA+SDR CIH    +N+ +S  D+L CC   CGDG
Sbjct: 90  WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLTCCDGECGDG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C+GG+P  AW ++   G+V+         C PY     C H      P C     TPKC 
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 208

Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C    +  ++  KH+  S+Y + ++ ++IMAEIYKNGPVE +F+VY DF  YKSGVY+
Sbjct: 209 KTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQ 268

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H++G++MGGHA++++GWG  ++G  YW++ N WN  WG +G+FKI RG + CGIE ++VA
Sbjct: 269 HVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVA 327

Query: 227 GLP 229
           G+P
Sbjct: 328 GMP 330


>gi|308488328|ref|XP_003106358.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
 gi|308253708|gb|EFO97660.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
          Length = 343

 Score =  221 bits (564), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 120/248 (48%), Positives = 150/248 (60%), Gaps = 22/248 (8%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACC--GFLCG 58
           F+    V  +  Q HCGSCWA  A EA+SDR CI     +N  LS  D+L CC   + CG
Sbjct: 91  FSQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGVVNTLLSAEDILTCCIGEYYCG 150

Query: 59  DGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS------TGCSHPGCEPA-YPTP 104
           DGC+GGYPI AW+Y+V +G+VT         C PY  +       G + P C  +   TP
Sbjct: 151 DGCEGGYPIQAWKYWVKNGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPNSDADTP 210

Query: 105 KCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 161
           KCV  C   +     +   KHY  +AY ++   + I +EI KNGPVEV FTVY DF  YK
Sbjct: 211 KCVDHCTSNSSYPIPYEKDKHYGATAYAVSRKVDQIQSEILKNGPVEVGFTVYADFYQYK 270

Query: 162 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 221
           SGVY H+ G  +GGHAVKL+GWG  D+G  YW+ AN WN +WG +GYF+I RG NECGIE
Sbjct: 271 SGVYVHVAGPELGGHAVKLLGWGV-DNGTPYWLAANSWNTNWGENGYFRILRGVNECGIE 329

Query: 222 EDVVAGLP 229
             VVAG+P
Sbjct: 330 SQVVAGMP 337


>gi|427785213|gb|JAA58058.1| Putative cathepsin l culex quinquefasciatus cathepsin l
           [Rhipicephalus pulchellus]
          Length = 346

 Score =  221 bits (563), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 119/234 (50%), Positives = 151/234 (64%), Gaps = 22/234 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG----MNLSLSVNDLLACCGFLCGDGCDGGYPISAW 70
           QG CGSCWAFGAVEA+SDR CIH        + LS +DLL+CC   CG+GC+GG+P SAW
Sbjct: 114 QGSCGSCWAFGAVEAMSDRTCIHSPSGGPKRVHLSADDLLSCC-RTCGNGCNGGFPGSAW 172

Query: 71  RYFVHHGVVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQL 116
            ++V  G+VT       + C PY     C H       P  +   PTP+CV  C K   +
Sbjct: 173 SFWVKTGIVTGGNYDSDDGCMPY-PIKACDHHVNGTLGPCDKKIPPTPRCVHMCRKGYDV 231

Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
            + + KHY  S+Y + S+ + I AEI  NGPVE  FTVY DF HYKSGVY+  T + +GG
Sbjct: 232 DYHDDKHYGKSSYSVPSEEKQIQAEIMTNGPVEADFTVYSDFVHYKSGVYQRHTDEALGG 291

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           HA++L+GWG  ++G  YW+ AN WN  WG  G+FKI RGS+ECGIE+DVVAGLP
Sbjct: 292 HAIRLLGWGV-ENGVPYWLAANSWNTEWGDKGFFKILRGSDECGIEDDVVAGLP 344


>gi|346472613|gb|AEO36151.1| hypothetical protein [Amblyomma maculatum]
          Length = 373

 Score =  221 bits (563), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 120/235 (51%), Positives = 149/235 (63%), Gaps = 23/235 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG-----MNLSLSVNDLLACCGFLCGDGCDGGYPISA 69
           QG CGSCWAFGAVEA+SDR CIH       +   L+ +D+L+CC   CG GC+GG+P SA
Sbjct: 140 QGSCGSCWAFGAVEAISDRTCIHSPEGKPRVIAHLAADDVLSCC-TECGAGCNGGFPGSA 198

Query: 70  WRYFVHHGVVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQ 115
           W Y+VH G+VT       E C PY     C H       P  +   PTP+CVR C K   
Sbjct: 199 WSYWVHKGIVTGGNYDSDEGCMPY-PIKACDHHVNGTLGPCDKTIPPTPRCVRMCRKGYD 257

Query: 116 L-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
           + + + KHY   AY + +  + I AEI  NGPVE  FTVYEDF HYKSGVY+  T   +G
Sbjct: 258 VDFMDDKHYGRHAYSVPAKAKQIQAEIMMNGPVEADFTVYEDFLHYKSGVYQRHTDSALG 317

Query: 175 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           GHA++L+GWG  ++G  YW+ AN WN  WG  G+FKI RGS+ECGIE D+VAGLP
Sbjct: 318 GHAIRLLGWGV-ENGVPYWLAANSWNTEWGDKGFFKILRGSDECGIESDIVAGLP 371


>gi|443692853|gb|ELT94358.1| hypothetical protein CAPTEDRAFT_221292 [Capitella teleta]
          Length = 374

 Score =  220 bits (561), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 114/234 (48%), Positives = 153/234 (65%), Gaps = 19/234 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CI    N+   +S  DLL+CC   CG GC+GG+P +AW Y
Sbjct: 144 QGSCGSCWAFGAVEAMSDRICIASKGNVHAHISSEDLLSCCSS-CGMGCNGGFPPAAWEY 202

Query: 73  FVHHGVVT-------EECDPYFDS------TGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
           F   G+V+       + C PY  +       G   P C    PTPKC R C K  ++ + 
Sbjct: 203 FRDTGLVSGGQYGTHQGCRPYSIAPCEHHVNGTRLP-CSGEGPTPKCERTCEKGYKVKYE 261

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
           + K++  +AY +++D + IM EI  NGPVE +FTVY DF  YKSGVY+H++G  +GGHA+
Sbjct: 262 DDKNFGYTAYSVDNDEKQIMTEIMTNGPVEGAFTVYADFPTYKSGVYQHVSGGELGGHAI 321

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           +++GWG  +DG  YW++AN WN  WG +G+FKI RG NECGIE ++VAGLP  +
Sbjct: 322 RVLGWGV-EDGTPYWLVANSWNSDWGDNGFFKILRGQNECGIEGEIVAGLPKKQ 374


>gi|351695295|gb|EHA98213.1| Cathepsin B [Heterocephalus glaber]
          Length = 340

 Score =  220 bits (561), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 112/246 (45%), Positives = 159/246 (64%), Gaps = 19/246 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWAFGAV A+SDR CIH    +N+ +S  DLL+CCG  CGDG
Sbjct: 90  WPNCPTIKEIRDQGSCGSCWAFGAVGAMSDRVCIHTNGHVNVEVSAEDLLSCCGLECGDG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCE-PAYPTPKC 106
           C+GGYP +AW+Y+   G+V+         C PY     C H      P C      TPKC
Sbjct: 150 CNGGYPSAAWKYWTKKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGTRPQCTGEGGDTPKC 208

Query: 107 VRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
            + C    +  ++  KH+   +Y ++S+ ++IMAEIYKNGPVE +FTV+ DF  YK+GVY
Sbjct: 209 SKTCEPGYSPSYKEDKHFGYDSYSVSSNEKEIMAEIYKNGPVEGAFTVFSDFLMYKTGVY 268

Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
           KH+ G+++GGHA++++GWG  ++G  YW++ N WN  WG  G+FKI RG + CGIE ++V
Sbjct: 269 KHLAGEMLGGHAIRILGWG-KENGVPYWLVGNSWNVDWGDSGFFKIVRGEDHCGIESEIV 327

Query: 226 AGLPSS 231
           AG+P +
Sbjct: 328 AGIPRT 333


>gi|301776581|ref|XP_002923704.1| PREDICTED: cathepsin B-like [Ailuropoda melanoleuca]
 gi|281347694|gb|EFB23278.1| hypothetical protein PANDA_012896 [Ailuropoda melanoleuca]
          Length = 339

 Score =  220 bits (561), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 117/252 (46%), Positives = 160/252 (63%), Gaps = 18/252 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWAFGAVEA+SDR CI     +N+ +S  D+L CCG  CGDG
Sbjct: 90  WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVNVEVSAEDMLTCCGDQCGDG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C+GG+P  AW ++   G+V+         C PY     C H      P C     TPKC 
Sbjct: 150 CNGGFPAEAWNFWTKQGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 208

Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C       ++  KHY  S+Y ++S  ++IMAEIYKNGPVE +FTVY DF  YKSGVY+
Sbjct: 209 KFCEPGYTPSYKEDKHYGCSSYSVSSSEKEIMAEIYKNGPVEAAFTVYSDFLLYKSGVYQ 268

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H+TG++MGGHAV+++GWG  ++G  YW++ N WN  WG +G+FKI RG + CGIE ++VA
Sbjct: 269 HVTGEMMGGHAVRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGRDHCGIESEIVA 327

Query: 227 GLPSSKNLVKEI 238
           G+P +    K+I
Sbjct: 328 GIPCTDQYWKKI 339


>gi|156365510|ref|XP_001626688.1| predicted protein [Nematostella vectensis]
 gi|156213574|gb|EDO34588.1| predicted protein [Nematostella vectensis]
          Length = 259

 Score =  220 bits (561), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 117/231 (50%), Positives = 149/231 (64%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNL--SLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR+CI     +   +S  DLL+CC   CG GC+GGYP SAW +
Sbjct: 26  QGACGSCWAFGAVEAMSDRYCIKSEGKVMPHISAEDLLSCC-ETCGMGCNGGYPESAWDH 84

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKC-VKKNQLWR 118
           +   G+VT       + C PY     C H        C+   PTPKC RKC    N  + 
Sbjct: 85  WKSKGLVTGGQYDSHKGCQPY-KIAACDHHVVGKLKPCKGDSPTPKCERKCEAGYNVSYS 143

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
           + KH+  SAY + SDP +I  EI  NGPVE +FTVY DF  YKSGVY+H +G  +GGHA+
Sbjct: 144 DDKHFGQSAYSVRSDPAEIQKEIMTNGPVEGAFTVYADFPTYKSGVYQHTSGSALGGHAI 203

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           K++GWG  ++G  YW++AN WN  WG +G+FKIKRG++ECGIE  +V GLP
Sbjct: 204 KILGWG-EENGTPYWLVANSWNSDWGDEGFFKIKRGNDECGIESGIVGGLP 253


>gi|240992702|ref|XP_002404475.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215491572|gb|EEC01213.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score =  220 bits (561), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 118/246 (47%), Positives = 154/246 (62%), Gaps = 19/246 (7%)

Query: 2   PFTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGD 59
           P  NS H  ++  Q  CGSCWAFGA EA+SDR CIH    + +++S  DLL CC   CG 
Sbjct: 96  PHCNSIH--LIRDQSTCGSCWAFGAAEAMSDRVCIHSKGKIQVNISAEDLLDCCDS-CGA 152

Query: 60  GCDGGYPISAWRYFVHHGVVT-------EECDPYFDS-----TGCSHPGCEPAYPTPKCV 107
           GC+GG P +AW Y+   G+VT       + C PY  +     T  S P C    PTPKCV
Sbjct: 153 GCNGGTPAAAWEYWKESGLVTGGLYGTNDGCKPYSLAPCEHHTKGSLPNCTGTVPTPKCV 212

Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
             C K   + +++ KH+    Y I+SD + I  EI+KNGPVE  F V  DF  YKSGVY+
Sbjct: 213 HLCRKGYGKDYQDDKHFGKKVYSISSDEKQIQTEIFKNGPVEADFIVLADFLSYKSGVYQ 272

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H + DV+GGHA++++GWGT ++G  YW+ AN WN  WG  GYFKI RG +ECGIEED+ A
Sbjct: 273 HHSDDVIGGHAIRILGWGT-ENGTPYWLAANSWNEDWGDHGYFKILRGKDECGIEEDINA 331

Query: 227 GLPSSK 232
           G+P ++
Sbjct: 332 GIPKNR 337


>gi|312271211|gb|ADQ57303.1| cathepsin B-like cysteine proteinase 1 [Angiostrongylus
           cantonensis]
          Length = 394

 Score =  220 bits (561), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 118/254 (46%), Positives = 158/254 (62%), Gaps = 22/254 (8%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDG 60
           ++N + ++ +  Q  CGSCWAFGAVEA+SDR CI  +  + ++LS +DLL+CC   CG G
Sbjct: 131 WSNCQSIKNIRDQSSCGSCWAFGAVEAMSDRICIASNEKIQVTLSADDLLSCCR-TCGFG 189

Query: 61  CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH--------PGCEPAYPTPK 105
           C+GG P+ AW+Y+V HG+VT       + C PY     C H        P     YPTPK
Sbjct: 190 CEGGDPMFAWQYWVDHGIVTGSNFTANQGCKPY-PFPPCEHHSNKTRFDPCRHDLYPTPK 248

Query: 106 CVRKCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 163
           C +KCV   K + + + + Y  +AY + +D   I  EI  +GPVEV+F VYEDF HY  G
Sbjct: 249 CSKKCVPSYKEKNYDDDRFYGRTAYGVKNDVAAIQKEILTHGPVEVAFEVYEDFLHYAGG 308

Query: 164 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
           +Y H  G + GGHAVKLIGWG  D G  YW++AN WN  WG +G+F+I RG +ECGIE  
Sbjct: 309 IYVHTGGKLGGGHAVKLIGWGI-DQGTPYWLIANSWNTDWGEEGFFRILRGVDECGIESG 367

Query: 224 VVAGLPSSKNLVKE 237
           VV G+P S N+ + 
Sbjct: 368 VVGGIPKSTNIQRR 381


>gi|148222779|ref|NP_001080410.1| uncharacterized protein LOC380102 precursor [Xenopus laevis]
 gi|28302291|gb|AAH46667.1| Cg10992 protein [Xenopus laevis]
          Length = 333

 Score =  220 bits (561), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 115/246 (46%), Positives = 158/246 (64%), Gaps = 19/246 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   +  +  QG CGSCWAFGAVEA+SDR C+H    +N+ +S  DLL+CCGF CG G
Sbjct: 90  WPNCPTIREIRDQGSCGSCWAFGAVEAISDRVCVHTNGKVNVEVSAEDLLSCCGFKCGMG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCE-PAYPTPKC 106
           C+GGYP  AWR++   G+V+         C PY     C H      P C+     TPKC
Sbjct: 150 CNGGYPSGAWRFWTETGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPSCKGEEGDTPKC 208

Query: 107 VRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
           ++ C +     + + KH+  ++Y + S  ++IMA+IYKNGPVE +F VY DF  YKSGVY
Sbjct: 209 MKTCEEGYTPAYGSDKHFGATSYGVPSSEKEIMADIYKNGPVEGAFVVYADFPLYKSGVY 268

Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
           +H TG+ +GGHA+K++GWG  ++G  YW+ AN WN  WG +G+FKI RG + CGIE +VV
Sbjct: 269 QHETGEELGGHAIKILGWGV-ENGTPYWLCANSWNTDWGDNGFFKILRGKDHCGIESEVV 327

Query: 226 AGLPSS 231
           AG+P +
Sbjct: 328 AGIPKN 333


>gi|417399216|gb|JAA46636.1| Putative cathepsin b [Desmodus rotundus]
          Length = 340

 Score =  220 bits (560), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 110/240 (45%), Positives = 151/240 (62%), Gaps = 17/240 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CI      N+ +S  DLL CCGF CG+GC+GG+P  AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIRSNGLQNVEVSAEDLLTCCGFQCGEGCNGGFPSGAWNF 161

Query: 73  FVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY          G   P       TPKC + C    +  ++
Sbjct: 162 WKKQGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCSGEGGDTPKCSKICEPGYSPSYK 221

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KH+    Y + SD ++IM EIYKNGPVE +F+VY DF  YKSGVY+H+TG+++GGHAV
Sbjct: 222 EDKHFGCDTYSVPSDEKEIMVEIYKNGPVEAAFSVYSDFLLYKSGVYQHVTGEMVGGHAV 281

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           +++GWG  ++G  YW++ N WN  WG +G+FKI RG + CGIE ++VAG+P + +  + I
Sbjct: 282 RILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGRDHCGIESEIVAGIPCTGHYSERI 340


>gi|341900876|gb|EGT56811.1| hypothetical protein CAEBREN_29569 [Caenorhabditis brenneri]
          Length = 344

 Score =  220 bits (560), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 116/236 (49%), Positives = 145/236 (61%), Gaps = 22/236 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCG--FLCGDGCDGGYPISAW 70
           Q  CGSCWAF A EA+SDR CI  +  +N  LS  DLL+CC   F CG+GC+GGYPI AW
Sbjct: 104 QSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDLLSCCTGIFSCGNGCEGGYPIQAW 163

Query: 71  RYFVHHGVVTEE-------CDPYFDS------TGCSHPGC-EPAYPTPKCVRKCVKKNQ- 115
           +++  HG+VT         C PY  +       G + P C E   PTPKCV  C   +  
Sbjct: 164 KWWGKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKCVDACTSNHTY 223

Query: 116 --LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
              +   KH+  +AY +    E I  EI KNGP+EV+FTVYEDF  Y +GVY H  G  +
Sbjct: 224 PTAYLQDKHFGATAYAVGKKVEQIQTEILKNGPIEVAFTVYEDFYQYTTGVYVHTAGASL 283

Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           GGHAVK++GWG  D+G  YW++AN WN +WG  GYF+I RG NECGIE   VAG+P
Sbjct: 284 GGHAVKILGWGV-DNGTPYWLVANSWNINWGEKGYFRIIRGLNECGIEHSAVAGIP 338


>gi|341888137|gb|EGT44072.1| hypothetical protein CAEBREN_10156 [Caenorhabditis brenneri]
          Length = 344

 Score =  220 bits (560), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 116/236 (49%), Positives = 145/236 (61%), Gaps = 22/236 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCG--FLCGDGCDGGYPISAW 70
           Q  CGSCWAF A EA+SDR CI  +  +N  LS  DLL+CC   F CG+GC+GGYPI AW
Sbjct: 104 QSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDLLSCCTGIFSCGNGCEGGYPIQAW 163

Query: 71  RYFVHHGVVTEE-------CDPYFDS------TGCSHPGC-EPAYPTPKCVRKCVKKNQ- 115
           +++  HG+VT         C PY  +       G + P C E   PTPKCV  C   +  
Sbjct: 164 KWWGKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKCVDACTSNHTY 223

Query: 116 --LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
              +   KH+  +AY +    E I  EI KNGP+EV+FTVYEDF  Y +GVY H  G  +
Sbjct: 224 PTAYLQDKHFGATAYAVGKKVEQIQTEILKNGPIEVAFTVYEDFYQYTTGVYVHTAGASL 283

Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           GGHAVK++GWG  D+G  YW++AN WN +WG  GYF+I RG NECGIE   VAG+P
Sbjct: 284 GGHAVKILGWGV-DNGTPYWLVANSWNINWGEKGYFRIIRGLNECGIEHSAVAGIP 338


>gi|118358710|ref|XP_001012596.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89294363|gb|EAR92351.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 346

 Score =  219 bits (559), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 113/233 (48%), Positives = 145/233 (62%), Gaps = 20/233 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           Q +CGSCWAFGA E+LSDR CIH G ++ LS  +L+ CC   CG GCDGG+P +A  Y+V
Sbjct: 116 QSNCGSCWAFGAAESLSDRHCIHLGQDIRLSTQNLVTCCD-ECGFGCDGGWPEAAMDYYV 174

Query: 75  HHGVVTEE-------CDPYFDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQL---W 117
           ++G+VT +       C  Y     C+H       P C    PTP CV+ C   +     +
Sbjct: 175 NNGLVTGDLYGNNSWCQAY-SLAPCAHHVTSDVYPPCTGELPTPPCVKSCDSNSTYTIPY 233

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
               H    AY I+ + + IM EI  NGP+EV+FTVYEDF  YKSGVY+H+TG  +GGHA
Sbjct: 234 PKDLHKGSKAYSIDQNEQAIMTEIQTNGPIEVAFTVYEDFLTYKSGVYQHVTGSELGGHA 293

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           VK++GWG  ++G  YWI+ N WN SWG  G FKI RG NECGIE + V  LP+
Sbjct: 294 VKMVGWGV-ENGTPYWIIVNSWNESWGDKGTFKILRGQNECGIESECVTALPA 345


>gi|38147393|gb|AAR12009.1| cathepsin B-like proteinase [Triatoma infestans]
          Length = 332

 Score =  219 bits (558), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 114/231 (49%), Positives = 149/231 (64%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH    + + LS  +L++CC   CG GCDGGYP SAW Y
Sbjct: 103 QGSCGSCWAFGAVEAMSDRICIHSNGKLQVHLSAENLVSCCDS-CGFGCDGGYPASAWDY 161

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
           + + G+V+       + C PY  +  C H      P C     TP C  +C K++ + + 
Sbjct: 162 WQNVGIVSGGNYGSKQGCQPYSIAP-CEHHVPGPRPACSGEGSTPDCRNQCDKRSGISYD 220

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
              +Y  SAY +  + + I AEI KNGPVE +FTVYED  +YK GVY+H+ G V+GGHA+
Sbjct: 221 KDLYYGESAYSLEDEAKQIQAEILKNGPVEAAFTVYEDLVNYKEGVYQHVAGSVLGGHAI 280

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           K++GWG  +D   YW++AN WN  WG +G+FKI RG +ECGIE DV AGLP
Sbjct: 281 KILGWGVEND-TPYWLVANSWNTDWGNNGFFKILRGKDECGIEIDVSAGLP 330


>gi|225717770|gb|ACO14731.1| Cathepsin B precursor [Caligus clemensi]
          Length = 331

 Score =  219 bits (558), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 112/240 (46%), Positives = 151/240 (62%), Gaps = 18/240 (7%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGG 64
           N   + ++  QG CGSCWAFGA EA+SDR CIH   N+++S  +LL+CC + CG GC+GG
Sbjct: 94  NCPSIRLIRDQGSCGSCWAFGAAEAMSDRVCIHTHKNVNISAENLLSCC-YTCGFGCNGG 152

Query: 65  YPISAWRYFVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCV 111
           +P +AWR++ + G+V+       + C PY          G   P C     TPKC + C 
Sbjct: 153 FPGAAWRFWENKGLVSGGLYGSHKGCQPYLIEPCEHHVNGTRKP-CAEGGRTPKCHKTCD 211

Query: 112 KKNQLWRNSKHYSI--SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 169
            KN      K  S   S+Y I SDP+ I  +I  NGPVE +F+VY DF  YKSGVY+H+ 
Sbjct: 212 NKNYPISYEKDLSFGRSSYSIRSDPKQIQMDIMTNGPVEAAFSVYSDFMSYKSGVYRHVK 271

Query: 170 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           G ++GGHA++++GWG  + G  YW++AN WN  WG +G FKI RGS+ CGIE+ VVAGLP
Sbjct: 272 GSLLGGHAIRILGWGM-EKGTPYWLVANSWNTDWGDNGTFKILRGSDHCGIEDSVVAGLP 330


>gi|227293|prf||1701299A cathepsin B
          Length = 339

 Score =  219 bits (557), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 116/252 (46%), Positives = 156/252 (61%), Gaps = 32/252 (12%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           ++N   +  +  QG CGSCWAFGAVEA+SDR CIH    +N+ +S  DLL CCG  CGDG
Sbjct: 90  WSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCGIQCGDG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC-------------- 106
           C+GGYP  AW ++   G+V+     Y+DS    H GC P Y  P C              
Sbjct: 150 CNGGYPSGAWNFWTKKGLVS---GGYYDS----HIGCLP-YTIPPCEHHVNGSRPPCTGE 201

Query: 107 --VRKCVKKNQL-----WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 159
              R+C K  +      ++  KH+  ++Y +++  + IMAEIYKNGPVE +FTV+ DF  
Sbjct: 202 GDTRRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKKIMAEIYKNGPVEGAFTVFSDFLT 261

Query: 160 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 219
           YKSGVYKH  GD+MGGHA++++ WG  ++G  YW  AN WN  WG +G+FKI RG N CG
Sbjct: 262 YKSGVYKHEAGDMMGGHAIRILVWGV-ENGVPYWAAANSWNLDWGDNGFFKILRGENHCG 320

Query: 220 IEEDVVAGLPSS 231
           IE ++VAG+P +
Sbjct: 321 IESEIVAGIPRT 332


>gi|332244666|ref|XP_003271495.1| PREDICTED: cathepsin B [Nomascus leucogenys]
          Length = 351

 Score =  218 bits (556), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 111/232 (47%), Positives = 153/232 (65%), Gaps = 18/232 (7%)

Query: 23  AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 80
           AFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW ++   G+V+
Sbjct: 122 AFGAVEAISDRICIHTNAHISVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVS 181

Query: 81  EE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSIS 126
                    C PY     C H      P C     TPKC + C    +  ++  KHY  +
Sbjct: 182 GGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYN 240

Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 186
           +Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+HITG++MGGHA++++GWG  
Sbjct: 241 SYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHITGEMMGGHAIRILGWGV- 299

Query: 187 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 300 ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 351


>gi|51038793|gb|AAT94175.1| cathepsin B [Paralichthys olivaceus]
 gi|121053785|gb|ABM47001.1| cathepsin B [Paralichthys olivaceus]
          Length = 330

 Score =  218 bits (556), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 115/231 (49%), Positives = 148/231 (64%), Gaps = 18/231 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGA EA+SDR CIH G  +S+ ++  DLL CC   CG GC+GGYP SAW +
Sbjct: 101 QGSCGSCWAFGAAEAISDRVCIHSGGKISVEISSEDLLTCCDS-CGMGCNGGYPSSAWDF 159

Query: 73  FVHHGVVTEE-------CDPYFDS------TGCSHPGCEPAYPTPKCVRKC-VKKNQLWR 118
           +   G+V+         C PY  S       G   P       TP+C+ +C    +  ++
Sbjct: 160 WTKEGLVSGGLYNSHIGCRPYTISPCEHHVNGSRPPCTGEGGDTPECISRCEAGYSPSYK 219

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  S+Y +    E I AEI KNGPVE +FTVYEDF  YKSGVY+H++G V+GGHA+
Sbjct: 220 QDKHYGKSSYSVEGSVEQIQAEISKNGPVEGAFTVYEDFVMYKSGVYQHVSGSVLGGHAI 279

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           K++GWG  +DG  YW+ AN WN  WG +G+FKI RGSN CGIE ++VAG+P
Sbjct: 280 KVLGWG-EEDGIPYWLCANSWNTDWGDNGFFKILRGSNHCGIESEIVAGIP 329


>gi|121309133|dbj|BAF43801.1| Longipain [Haemaphysalis longicornis]
          Length = 341

 Score =  218 bits (555), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 118/232 (50%), Positives = 147/232 (63%), Gaps = 20/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH G    + L+ +D+L+CC   CG GC+GG+P +AW Y
Sbjct: 111 QGSCGSCWAFGAVEAMSDRHCIHSGAKNIVHLAADDVLSCC-MSCGSGCNGGFPGAAWSY 169

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKCVRKCVKK-NQLW 117
           +VH G+VT       E C PY     C H       P  +   PTP+CVR C K  N  +
Sbjct: 170 WVHKGIVTGGNYDSDEGCMPY-PIKACDHHVNGTLGPCDKSIPPTPRCVRMCRKGYNVDF 228

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
            + KHY   +Y + S+   I  EI  NGPVE  FTVY DF  YKSGVY+  T   +GGHA
Sbjct: 229 ADDKHYGKKSYSVPSNVTQIQVEIMTNGPVEADFTVYADFPLYKSGVYQRHTDQALGGHA 288

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           ++L+GWG  + G  YW+ AN WN  WG  G+FKI RGS+ECGIE+DVVAG+P
Sbjct: 289 IRLLGWGV-EKGVPYWLAANSWNTEWGDKGFFKILRGSDECGIEDDVVAGIP 339


>gi|71984043|ref|NP_001024426.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
 gi|351058214|emb|CCD65629.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
          Length = 378

 Score =  218 bits (554), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 117/249 (46%), Positives = 153/249 (61%), Gaps = 24/249 (9%)

Query: 7   EHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGG 64
           + ++++  Q  CGSCWAFGAVEA+SDR CI  H  + ++LS +DLL+CC   CG GC+GG
Sbjct: 118 DSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLLSCCKS-CGFGCNGG 176

Query: 65  YPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVR 108
            P++AWRY+V  G+VT     Y  + GC     P CE               YPTPKC +
Sbjct: 177 DPLAAWRYWVKDGIVTGS--NYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEK 234

Query: 109 KCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           KCV    ++ +   K +  SAY +  D E I  E+  +GP+E++F VYEDF +Y  GVY 
Sbjct: 235 KCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYV 294

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H  G + GGHAVKLIGWG  DDG  YW +AN WN  WG DG+F+I RG +ECGIE  VV 
Sbjct: 295 HTGGKLGGGHAVKLIGWGI-DDGIPYWTVANSWNTDWGEDGFFRILRGVDECGIESGVVG 353

Query: 227 GLPSSKNLV 235
           G+P   +L 
Sbjct: 354 GIPKLNSLT 362


>gi|25146613|ref|NP_741818.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
 gi|1169087|sp|P43510.1|CPR6_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 6; AltName:
           Full=Cysteine protease-related 6; Flags: Precursor
 gi|671715|gb|AAA98787.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|695294|gb|AAA98789.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351058213|emb|CCD65628.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
          Length = 379

 Score =  218 bits (554), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 117/249 (46%), Positives = 153/249 (61%), Gaps = 24/249 (9%)

Query: 7   EHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGG 64
           + ++++  Q  CGSCWAFGAVEA+SDR CI  H  + ++LS +DLL+CC   CG GC+GG
Sbjct: 119 DSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLLSCCKS-CGFGCNGG 177

Query: 65  YPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVR 108
            P++AWRY+V  G+VT     Y  + GC     P CE               YPTPKC +
Sbjct: 178 DPLAAWRYWVKDGIVTGS--NYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEK 235

Query: 109 KCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           KCV    ++ +   K +  SAY +  D E I  E+  +GP+E++F VYEDF +Y  GVY 
Sbjct: 236 KCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYV 295

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H  G + GGHAVKLIGWG  DDG  YW +AN WN  WG DG+F+I RG +ECGIE  VV 
Sbjct: 296 HTGGKLGGGHAVKLIGWGI-DDGIPYWTVANSWNTDWGEDGFFRILRGVDECGIESGVVG 354

Query: 227 GLPSSKNLV 235
           G+P   +L 
Sbjct: 355 GIPKLNSLT 363


>gi|193209594|ref|NP_001123113.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
 gi|351058222|emb|CCD65637.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
          Length = 369

 Score =  218 bits (554), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 117/249 (46%), Positives = 153/249 (61%), Gaps = 24/249 (9%)

Query: 7   EHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGG 64
           + ++++  Q  CGSCWAFGAVEA+SDR CI  H  + ++LS +DLL+CC   CG GC+GG
Sbjct: 109 DSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLLSCCKS-CGFGCNGG 167

Query: 65  YPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVR 108
            P++AWRY+V  G+VT     Y  + GC     P CE               YPTPKC +
Sbjct: 168 DPLAAWRYWVKDGIVTGS--NYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEK 225

Query: 109 KCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           KCV    ++ +   K +  SAY +  D E I  E+  +GP+E++F VYEDF +Y  GVY 
Sbjct: 226 KCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYV 285

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H  G + GGHAVKLIGWG  DDG  YW +AN WN  WG DG+F+I RG +ECGIE  VV 
Sbjct: 286 HTGGKLGGGHAVKLIGWGI-DDGIPYWTVANSWNTDWGEDGFFRILRGVDECGIESGVVG 344

Query: 227 GLPSSKNLV 235
           G+P   +L 
Sbjct: 345 GIPKLNSLT 353


>gi|338815385|gb|AEJ08755.1| cathepsin B [Crassostrea ariakensis]
          Length = 341

 Score =  217 bits (553), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 116/232 (50%), Positives = 145/232 (62%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CI      N+ +S  DL +CC   CG+GC+GG+P +AW Y
Sbjct: 111 QGACGSCWAFGAVEAMSDRICIKSQGKENVHISAEDLTSCC-RTCGNGCEGGFPSAAWSY 169

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKCVRKC-VKKNQLW 117
           +   G+VT       + C PY     C H       P  +   PTPKC   C    N  +
Sbjct: 170 YKRDGLVTGGQYNSHQGCQPY-TIKACDHHVVGKLQPCSKDIGPTPKCKHTCEAGYNVTY 228

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              KHY +SAY ++   E IM EI  NGPVE +FTVY DF  YKSGVYKH TG  +GGHA
Sbjct: 229 EKDKHYGMSAYSVHG-VEKIMTEIMTNGPVEGAFTVYADFPQYKSGVYKHTTGQPLGGHA 287

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +K++GWGT ++G+DYW++AN WN  WG  G+FKI RG +ECGIE  + AG P
Sbjct: 288 IKILGWGT-ENGDDYWLVANSWNPDWGDQGFFKILRGQDECGIESQISAGEP 338


>gi|161671340|gb|ABX75522.1| cathepsin b [Lycosa singoriensis]
          Length = 247

 Score =  217 bits (552), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 116/231 (50%), Positives = 148/231 (64%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH    + + +S  DLL+CC   CG GCDGG+P SAW +
Sbjct: 18  QGSCGSCWAFGAVEAMSDRHCIHSNGKVKIEVSPEDLLSCCS-SCGMGCDGGFPPSAWEF 76

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +V  G+ T         C PY +   C H      P C     TPKCV  C K  N  +R
Sbjct: 77  WVDKGIATGGLWNSHIGCQPY-EIPACEHHTTGDRPPCSDIVDTPKCVHLCEKGYNTSYR 135

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
           + KH+   +Y I S  + I  EI+KNGPVE +F+VY DF +YKSGVY+H +G+ +GGHA+
Sbjct: 136 DDKHFGKKSYSIESLEQQIQTEIFKNGPVEGAFSVYSDFINYKSGVYQHHSGESLGGHAI 195

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +++GWG  +D   YW+ AN WN  WG  GYFKI RGS+ECGIE  +VAG+P
Sbjct: 196 RVLGWGYEND-VPYWLCANSWNTDWGDKGYFKILRGSDECGIESSIVAGIP 245


>gi|37788265|gb|AAO64472.1| cathepsin B precursor [Fundulus heteroclitus]
          Length = 330

 Score =  217 bits (552), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 114/231 (49%), Positives = 146/231 (63%), Gaps = 18/231 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGA EA+SDR CIH    +++ +S  DLL CC   CG GC+GGYP +AW +
Sbjct: 101 QGSCGSCWAFGAAEAISDRICIHTKGKVSVEISSQDLLTCCDS-CGMGCNGGYPANAWEF 159

Query: 73  FVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWR 118
           +   G+VT         C PY          G   P       TP+CV +C       ++
Sbjct: 160 WTEQGLVTGGLYNSHIGCRPYTIEPCEHHVNGSRPPCTGEGGDTPECVTQCEAGYTPSYQ 219

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  ++Y + S+ E I +EIYKNGPVE +F VYEDF  YKSGVY+H+TG  +GGHA+
Sbjct: 220 KDKHYGKTSYGVPSEEEQIQSEIYKNGPVEGAFIVYEDFPSYKSGVYQHVTGSALGGHAI 279

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           K+IGWG  ++G  YW+ AN WN  WG +G+FKI RGSN CGIE +VVAG+P
Sbjct: 280 KMIGWG-EENGVPYWLCANSWNTDWGDNGFFKILRGSNHCGIESEVVAGIP 329


>gi|148229459|ref|NP_001079570.1| cathepsin B precursor [Xenopus laevis]
 gi|28277314|gb|AAH44689.1| MGC53360 protein [Xenopus laevis]
          Length = 333

 Score =  217 bits (552), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 114/246 (46%), Positives = 158/246 (64%), Gaps = 19/246 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   +  +  QG CGSCWAFGAVEA+SDR C+H    +N+ +S  DLL+CCG  CG G
Sbjct: 90  WPNCPTIREIRDQGSCGSCWAFGAVEAISDRVCVHTNGKVNVEVSAEDLLSCCGDECGMG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH--PGCEPAYP-----TPKC 106
           C+GGYP  AW+++   G+V+         C PY     C H   G  PA       TPKC
Sbjct: 150 CNGGYPSGAWQFWTETGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPACKGEEGDTPKC 208

Query: 107 VRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
           V++C +  +  +   KH+  ++Y + +  ++IMAEIYKNGPVE +F VY DF  YKSGVY
Sbjct: 209 VKQCEEGYSPAYGTDKHFGTTSYGVPTSEKEIMAEIYKNGPVEGAFLVYADFPLYKSGVY 268

Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
           +H TG+ +GGHA+K++GWG  ++G  YW+ AN WN  WG +G+FKI RG + CGIE ++V
Sbjct: 269 QHETGEELGGHAIKILGWGV-ENGTPYWLCANSWNTDWGDNGFFKILRGKDHCGIESEIV 327

Query: 226 AGLPSS 231
           AG+P +
Sbjct: 328 AGVPKN 333


>gi|355681635|gb|AER96808.1| cathepsin B [Mustela putorius furo]
          Length = 338

 Score =  216 bits (550), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 115/251 (45%), Positives = 160/251 (63%), Gaps = 18/251 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWAFGAVEA+SDR CI    ++S+ V+  D+L CCG  CGDG
Sbjct: 90  WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVSVEVSAEDMLTCCGDQCGDG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C+GG+P  AW ++   G+V+         C PY     C H      P C     TPKC 
Sbjct: 150 CNGGFPAEAWNFWTXXGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 208

Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C       ++  KHY  S+Y ++S  ++IMAEIYKNGPVE +F+VY DF  YKSGVY+
Sbjct: 209 KICEPGYTPSYKEDKHYGCSSYSVSSSEKEIMAEIYKNGPVEAAFSVYSDFLMYKSGVYQ 268

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H+TG++MGGHAV+++GWG  ++G  YW++ N WN  WG +G+FKI RG + CGIE ++VA
Sbjct: 269 HVTGEMMGGHAVRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVA 327

Query: 227 GLPSSKNLVKE 237
           G+P +    K+
Sbjct: 328 GIPCTDQYWKK 338


>gi|308504233|ref|XP_003114300.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
 gi|308261685|gb|EFP05638.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
          Length = 351

 Score =  216 bits (550), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 114/234 (48%), Positives = 143/234 (61%), Gaps = 22/234 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWA  A E +SDR CI  +    LS+S +D+ ACCG +CG+GC+GGYPI AWR+
Sbjct: 119 QSSCGSCWAVSAAETISDRICIASNGKTQLSISADDINACCGMVCGNGCNGGYPIEAWRH 178

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE-----------PA--YPTPKCVRKCVKKNQL 116
           +V  G VT     Y + TGC    +P CE           P+  YPT KC R C     L
Sbjct: 179 YVKKGYVTG--GSYQEKTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKCERSCQAGYAL 236

Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
            +    H+  SAY ++    +I  EI  +GPVEV+F+VYEDF HY  GVY H  G  +GG
Sbjct: 237 TYTQDLHFGQSAYAVSKKVTEIQKEIMTHGPVEVAFSVYEDFEHYSGGVYVHTAGASLGG 296

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           HAVK++GWG  D+G  YW+ AN WN  WG +GYF+I RG NECGIE  VV G+P
Sbjct: 297 HAVKMLGWGV-DNGTPYWLCANSWNEDWGENGYFRIIRGVNECGIESGVVGGIP 349


>gi|428174191|gb|EKX43088.1| hypothetical protein GUITHDRAFT_73372 [Guillardia theta CCMP2712]
          Length = 255

 Score =  216 bits (550), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 111/223 (49%), Positives = 140/223 (62%), Gaps = 19/223 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA+SDR CI     +   LS  D+L+CC   CG GC+GG+P  AWR+
Sbjct: 37  QSTCGSCWAFGAVEAMSDRLCIASNGTVKDELSAEDMLSCCLVQCGMGCNGGFPTGAWRF 96

Query: 73  FVHHGVVTEECDPYFDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSI 125
           F  HG+ TE   PY     C H         C P+ PTPKCVR   KK       +++  
Sbjct: 97  FKMHGLTTESKYPYVFPP-CEHHINKTHYKPCGPSQPTPKCVRASEKK------PRYHGK 149

Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
           S Y ++  P  I AEI  NGPVE +FTVY+DF  Y+SGVY+H++G  +GGHA+K++GWG 
Sbjct: 150 SVYSVS--PAKIQAEIMTNGPVEAAFTVYQDFLAYQSGVYRHVSGPELGGHAIKIMGWGV 207

Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
            + G  YW++AN WN  WG  G FKI RG +ECGIE  VVAG+
Sbjct: 208 -EAGNKYWLVANSWNEDWGDKGTFKIARGDDECGIESSVVAGM 249


>gi|268557308|ref|XP_002636643.1| Hypothetical protein CBG23351 [Caenorhabditis briggsae]
          Length = 351

 Score =  216 bits (549), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 113/234 (48%), Positives = 142/234 (60%), Gaps = 22/234 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWA  A E +SDR CI  +    +S+S +D+ ACCG +CG+GC+GGYPI AWR+
Sbjct: 119 QSSCGSCWAVSAAETISDRICIASNGKTQISISADDINACCGMVCGNGCNGGYPIEAWRH 178

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE-----------PA--YPTPKCVRKCVKKNQL 116
           +V  G VT     Y + +GC    +P CE           P+  YPT KC   C     L
Sbjct: 179 YVKKGYVTG--GSYQEKSGCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKCEHSCQAGYPL 236

Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
            +    H+  SAY ++  P +I  EI  +GPVEV+FTVYEDF HY  GVY H  G  +GG
Sbjct: 237 TYTQDLHFGQSAYAVSKKPAEIQKEIMTHGPVEVAFTVYEDFEHYSGGVYVHTAGASLGG 296

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           HAVK++GWG  D+G  YW+ AN WN  WG +GYF+I RG NECGIE  VV G P
Sbjct: 297 HAVKMLGWGV-DNGTPYWLCANSWNEDWGENGYFRIIRGVNECGIESGVVGGTP 349


>gi|91078964|ref|XP_974298.1| PREDICTED: similar to putative cathepsin B-like like proteinase
           [Tribolium castaneum]
 gi|270004838|gb|EFA01286.1| cathepsin B precursor [Tribolium castaneum]
          Length = 335

 Score =  216 bits (549), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 111/230 (48%), Positives = 138/230 (60%), Gaps = 18/230 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGA EA+SDR CIH    + +S+S  DL  CC + CGDGC+GG+P  AW Y
Sbjct: 107 QASCGSCWAFGAAEAMSDRICIHSNATVKVSISTEDLNTCC-YECGDGCNGGWPAEAWAY 165

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQLWRN 119
           +   G+VT       + C  Y     C H      P C    PTP+C ++C     +   
Sbjct: 166 WAETGIVTGGKYETKDGCKAY-TVPPCEHHTEGDLPACGDIVPTPQCKKECDAGVDIEYK 224

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
           S     SAY+ +SD   I  EI  NGPVE  F VYEDF +YKSGVY+  TG+  GGHA+K
Sbjct: 225 SDLRKGSAYQTSSDESQIQTEIMTNGPVEADFDVYEDFLNYKSGVYQQTTGNYAGGHAIK 284

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           ++GWG  +DG  YW+ AN WN  WG  GYFKI RG NECGIE D++ G+P
Sbjct: 285 ILGWGV-EDGTPYWLAANSWNEDWGDKGYFKILRGQNECGIESDIIGGIP 333


>gi|196009263|ref|XP_002114497.1| expressed hypothetical protein [Trichoplax adhaerens]
 gi|190583516|gb|EDV23587.1| expressed hypothetical protein [Trichoplax adhaerens]
          Length = 333

 Score =  215 bits (548), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 110/231 (47%), Positives = 147/231 (63%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCW+FGAVE+++DR CIH    + + +S  DL+ CC   CG GC+GG+   AW Y
Sbjct: 104 QGSCGSCWSFGAVESITDRICIHSNGKVKVHISAEDLMTCCT-SCGMGCNGGFLPQAWHY 162

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +V++G+VT       + C PY +   C H        C    PTPKC +KC    N+ + 
Sbjct: 163 WVNNGIVTGGQYHSHKGCQPY-EIPKCEHHVKGPFKACGKELPTPKCSQKCQPGYNKTFN 221

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KH+   +Y I ++ + I  EI  NGPVE +FTVY DF  YKSGVY+H TG  +GGHAV
Sbjct: 222 QDKHFGKKSYSITNNIQQIQKEIMMNGPVEAAFTVYADFPSYKSGVYQHTTGGPLGGHAV 281

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           K++GWGT ++   YW++AN WN +WG  GYFKI RG +ECGIE  +VAG+P
Sbjct: 282 KILGWGTENN-TPYWLIANSWNPTWGDKGYFKIIRGKDECGIESSIVAGMP 331


>gi|149698064|ref|XP_001498242.1| PREDICTED: cathepsin B [Equus caballus]
          Length = 340

 Score =  215 bits (548), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 114/252 (45%), Positives = 158/252 (62%), Gaps = 17/252 (6%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWAFGAVEA+SDR CI    ++S+ V+  D+L CCG  CGDG
Sbjct: 90  WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVSVEVSAEDMLTCCGDQCGDG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCV 107
           C+GG+P  AW ++   G+V+         C PY          G   P       TPKC 
Sbjct: 150 CNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGGDTPKCS 209

Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C    +  ++  KHY  S+Y ++S  ++IMAEI+KNGPVE +FTVY DF  YKSGVY+
Sbjct: 210 KICEPGYSPSYKEDKHYGCSSYSVSSSEKEIMAEIFKNGPVEAAFTVYSDFLQYKSGVYQ 269

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H+ GD+MGGHAV+++GWG  ++G  YW++ N WN  WG +G+FKI RG + CGIE ++VA
Sbjct: 270 HVAGDMMGGHAVRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVA 328

Query: 227 GLPSSKNLVKEI 238
           G+P +    K I
Sbjct: 329 GIPCTDQYWKRI 340


>gi|312271213|gb|ADQ57304.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 347

 Score =  215 bits (547), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 118/234 (50%), Positives = 144/234 (61%), Gaps = 23/234 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWA GAVEA++DR CI    N  +++S +DLL+CC   CG GCDGG P +AW Y
Sbjct: 117 QSSCGSCWAVGAVEAMTDRICIASKGNQKVTISADDLLSCCD-ECGFGCDGGDPYAAWSY 175

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKKNQL 116
           +V +G+VT     Y   +GC    +P CE               YPT  C  KC     +
Sbjct: 176 WVSNGIVTGS--NYTSKSGCKPYPYPPCEHHIPEHHYKKCPKDIYPTNTCEYKCQDGYSI 233

Query: 117 WRNS-KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
             NS KHY  S Y +  D   I  EI  NGPVEV+F VYEDF HY SG+YKH TGD +GG
Sbjct: 234 SYNSDKHYGASVYAVAQDVASIQKEIMTNGPVEVAFDVYEDFEHYSSGIYKHTTGDYLGG 293

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           HAVK++GWGT ++G DYWI AN WN  WG +G+F+I RG +EC IE  VVAG P
Sbjct: 294 HAVKMLGWGT-ENGTDYWICANSWNSDWGENGFFRILRGVDECQIESSVVAGEP 346


>gi|256077361|ref|XP_002574974.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
 gi|18181863|emb|CAC85211.2| cathepsin B endopeptidase [Schistosoma mansoni]
 gi|353231645|emb|CCD79000.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
          Length = 347

 Score =  215 bits (547), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 116/235 (49%), Positives = 146/235 (62%), Gaps = 19/235 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA+SDR CI         LS  +L++CC   CG GC+GG+P SAW Y
Sbjct: 116 QSSCGSCWAFGAVEAMSDRICIKSKGKHKPFLSAENLVSCCSS-CGMGCNGGFPHSAWLY 174

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           + + G+VT +       C PY +   C H      P C+    TP C   C    N  + 
Sbjct: 175 WKNQGIVTGDLYNTTNGCQPY-EFPPCEHHVIGPLPSCDGDVETPSCKTNCQPGYNIPYE 233

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             K Y    YRI+S+PE IM E+ +NGPVEV F VY DF +YKSGVY+H++G ++GGHAV
Sbjct: 234 KDKWYGEKVYRIHSNPEAIMLELMRNGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAV 293

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
           +L+GWG  ++   YW++AN WN  WG  GYFKI RG NECGIE DV AG+P  KN
Sbjct: 294 RLLGWG-EENNVPYWLIANSWNSDWGDKGYFKIVRGKNECGIESDVNAGIPKIKN 347


>gi|340380685|ref|XP_003388852.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
          Length = 341

 Score =  214 bits (546), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 116/233 (49%), Positives = 152/233 (65%), Gaps = 21/233 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY-- 72
           QG CGSCWAFGAVEA+SDR+CI F   +++S  +LL+CC   CG GCDGGYP +AWR+  
Sbjct: 108 QGACGSCWAFGAVEAMSDRYCISFKEQVNISAENLLSCCE-TCGSGCDGGYPAAAWRHWA 166

Query: 73  --FVHHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQL 116
              ++ G+VT         C PY     C H  PG    C  +  TP C R C+   ++ 
Sbjct: 167 DKLLYEGIVTGGQYDSNAGCQPY-TIPKCDHHEPGPYENCSGSQSTPSCKRSCISSYDKS 225

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +R+ KHY  ++Y I+SD   I  EI  NGPVE +F+VY DF  Y SGVY+H TG  +GGH
Sbjct: 226 YRSDKHYGKNSYSISSDVSSIQTEIMTNGPVEGAFSVYADFPTYTSGVYQHTTGSFLGGH 285

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           A+K++GWGT ++G  YW++AN WN SWG  G+FKI RG +ECGIE  +VAG+P
Sbjct: 286 AIKILGWGT-ENGVPYWLVANSWNPSWGDSGFFKIIRGKDECGIESSIVAGMP 337


>gi|225711544|gb|ACO11618.1| Cathepsin B precursor [Caligus rogercresseyi]
          Length = 332

 Score =  214 bits (546), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 112/242 (46%), Positives = 148/242 (61%), Gaps = 18/242 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCD 62
           + N   + ++  QG CGSCWAFGA EA+SDR CIH   N+++S  +LL+CC + CG GC+
Sbjct: 93  WPNCPSIRLIRDQGSCGSCWAFGAAEAMSDRICIHTNKNVNISAENLLSCC-YSCGFGCN 151

Query: 63  GGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRK 109
           GG+P +AW+Y+   G+V+         C PY D   C H        C     TPKC R 
Sbjct: 152 GGFPGAAWKYWTSKGLVSGGLYGSHSGCQPY-DIEPCEHHVNGTRQPCAEGGRTPKCHRT 210

Query: 110 CVKKNQLWRNSKHYSI--SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
           C  +N      K  S   S+Y I SDP+ I  EI  NGPVE +F+VY DF + KSGVY+H
Sbjct: 211 CENENYSVPYDKDLSFGRSSYSIRSDPKQIQLEIMDNGPVEAAFSVYSDFMNDKSGVYRH 270

Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           + G ++GGHA++++GWG  + G  YW++AN WN  WG  G FKI RGS+ CGIE  VV G
Sbjct: 271 VKGSLLGGHAIRILGWGV-EKGTPYWLVANSWNTDWGDKGTFKILRGSDHCGIEGSVVTG 329

Query: 228 LP 229
           LP
Sbjct: 330 LP 331


>gi|49036806|gb|AAT48984.1| cathepsin B-like proteinase [Triatoma sordida]
          Length = 331

 Score =  214 bits (546), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 111/230 (48%), Positives = 148/230 (64%), Gaps = 18/230 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH    + + LS  +L++CC   CG GCDGG+P SAW Y
Sbjct: 103 QGSCGSCWAFGAVEAMSDRICIHSNGKLQVHLSAENLVSCCDS-CGYGCDGGFPASAWDY 161

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH--PGCEPAYP----TPKCVRKCVKKNQLWRN 119
           + + G+V+       + C PY  +  C H  PG  PA      TP C  +C + + +  +
Sbjct: 162 WQNEGIVSGGNYGSKQGCQPYSIAP-CEHHVPGSRPACSGGGDTPDCRNQCDEGSGISYD 220

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
             HY         + + I AEI KNGPVE +FTVYED  +YK GVY+H+ G+ +GGHA+K
Sbjct: 221 QDHYYGETVYTLDEAKQIQAEILKNGPVEAAFTVYEDLLNYKEGVYQHVAGEALGGHAIK 280

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           ++GWG  +D   YW++AN WN  WG +G+FKI RGS+ECGIE+ +VAGLP
Sbjct: 281 ILGWGVEND-TPYWLVANSWNTDWGNNGFFKILRGSDECGIEDQIVAGLP 329


>gi|187097096|ref|NP_001119608.1| cathepsin B-348 precursor [Acyrthosiphon pisum]
 gi|161343833|tpg|DAA06097.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 342

 Score =  214 bits (545), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 109/233 (46%), Positives = 148/233 (63%), Gaps = 20/233 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH     N   S  +L++CC + CG GC+GG+P +AW Y
Sbjct: 113 QGSCGSCWAFGAVEAMSDRVCIHSNGTKNFHFSAENLVSCC-WTCGFGCNGGFPGAAWNY 171

Query: 73  FVHHGVVTEECDPYFDSTGC--------------SHPGCEPAYPTPKCVRKCVKKNQL-W 117
           +   G+V+    PY  + GC              +   C+    TP CV+KC +  ++ +
Sbjct: 172 WKTKGIVSG--GPYGSNMGCIPYEIAPCEHHVNGTRGPCKEGGKTPTCVKKCEEGYKVPY 229

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
               H+  SAY I +D + I  EIY NGPVE +FTVYEDF  Y++GVYKH+ G  +GGHA
Sbjct: 230 AQDLHHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHA 289

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           ++++GWG  +    YW++AN WN  WG+DG+FKI RGS+ECGIE  + AGLP+
Sbjct: 290 IRILGWGVQNGEIPYWLVANSWNTDWGSDGFFKILRGSDECGIEGQINAGLPA 342


>gi|45822203|emb|CAE47498.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
          Length = 328

 Score =  214 bits (545), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 112/231 (48%), Positives = 148/231 (64%), Gaps = 18/231 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH     N   S +DL++CC + CG GC+GGYP +AW Y
Sbjct: 99  QGSCGSCWAFGAVEAMSDRVCIHSNGESNFHFSSDDLVSCC-WTCGMGCNGGYPGAAWHY 157

Query: 73  FVHHGVVT-------EECDPYF-----DSTGCSHPGCEPAY-PTPKCVRKCVKKNQL-WR 118
           +V  G+V+       + C PY        T  S P C+ +   TPKC + C    ++ + 
Sbjct: 158 WVRKGLVSGGQYGTKQGCRPYEIPPCEHHTNGSRPACDASEGNTPKCAKSCESNYKINYS 217

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
           N  H+   AY I+SD + I AEI +NGPVE +F+VY DF +YK+GVY+HI G  +GGHA+
Sbjct: 218 NDLHFGSKAYSISSDVKQIQAEILQNGPVEGAFSVYADFVNYKTGVYQHIKGQFLGGHAI 277

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           ++ GWG  ++   YW++AN WN  WG  G FKI RGS+ CGIE  +VAGLP
Sbjct: 278 RIFGWGVENN-TPYWLIANSWNTDWGDSGTFKILRGSDHCGIESGIVAGLP 327


>gi|185135431|ref|NP_001117776.1| procathepsin B precursor [Oncorhynchus mykiss]
 gi|14582897|gb|AAK69705.1|AF358667_1 procathepsin B [Oncorhynchus mykiss]
          Length = 330

 Score =  214 bits (545), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 109/245 (44%), Positives = 153/245 (62%), Gaps = 19/245 (7%)

Query: 1   MPFTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCG 58
           + + N   ++ +  QG CGSCWAFGA EA+SDR+CIH    +++ +S  DLL+CC   CG
Sbjct: 87  LQWPNCPTIKEIRDQGSCGSCWAFGAAEAISDRYCIHSNGKVSVEISAEDLLSCCD-ACG 145

Query: 59  DGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPK 105
            GC GG+P +AW Y+   G+VT         C PY  +  C H      P C     TPK
Sbjct: 146 MGCMGGFPSAAWDYWAESGLVTGGLYGSNIGCRPYSIAP-CEHHVNGTRPPCTGEGDTPK 204

Query: 106 CVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 164
           CV +C       ++  K +    Y +    + IM E+YKNGPVE +F+VYEDF  YK+GV
Sbjct: 205 CVSECNAGYTPSYKKDKRFGKQTYSVPPKEQQIMTELYKNGPVEAAFSVYEDFLLYKTGV 264

Query: 165 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
           Y+H+TG ++GGHA+K++GWG  ++   YW++AN WN  WG +G+FKI RG +ECGIE ++
Sbjct: 265 YQHVTGQMLGGHAIKILGWG-KENNTPYWLVANSWNTDWGDNGFFKILRGKDECGIESEI 323

Query: 225 VAGLP 229
           VAG+P
Sbjct: 324 VAGIP 328


>gi|405971658|gb|EKC36483.1| Cathepsin B [Crassostrea gigas]
          Length = 341

 Score =  214 bits (544), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 116/232 (50%), Positives = 143/232 (61%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CI      N  +S  DL +CC   CG+GC+GG+P +AW Y
Sbjct: 111 QGACGSCWAFGAVEAMSDRICIKSQGKENTHISAEDLTSCC-RTCGNGCEGGFPSAAWSY 169

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKCVRKC-VKKNQLW 117
           +   G+VT       + C PY     C H       P  +   PTPKC   C    N  +
Sbjct: 170 YKKDGLVTGGQYNSHQGCLPY-TIKACDHHVVGKLQPCSKSIGPTPKCKHTCEAGYNVTY 228

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              KHY  SAY ++   E IM EI  NGPVE +FTVY DF  YKSGVYKH TG  +GGHA
Sbjct: 229 EKDKHYGSSAYSVHG-VEKIMTEIMTNGPVEGAFTVYADFPQYKSGVYKHTTGQPLGGHA 287

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +K++GWGT ++G+DYW++AN WN  WG  G+FKI RG +ECGIE  + AG P
Sbjct: 288 IKILGWGT-ENGDDYWLVANSWNPDWGDQGFFKILRGQDECGIESQISAGEP 338


>gi|348534156|ref|XP_003454569.1| PREDICTED: cathepsin B-like [Oreochromis niloticus]
          Length = 330

 Score =  214 bits (544), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 112/231 (48%), Positives = 147/231 (63%), Gaps = 18/231 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGA EA+SDR CIH    +N+ +S  DLL CC   CG GC+GGYP +AW +
Sbjct: 101 QGSCGSCWAFGAAEAISDRVCIHSNGKVNVEISSEDLLTCCDS-CGMGCNGGYPSAAWDF 159

Query: 73  FVHHGVVTEE-------CDPYFDS------TGCSHPGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY  +       G   P       TP+CVR+C       + 
Sbjct: 160 WASEGLVSGGLYESHIGCRPYTIAPCEHHVNGSRPPCTGEGGDTPECVRQCESGYTPSYI 219

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  ++Y + SD + I  EIYKNGPVE +FTVYEDF  YK+GVY+H++G  +GGHA+
Sbjct: 220 QDKHYGKTSYSVPSDEQQIQTEIYKNGPVEGAFTVYEDFLLYKTGVYQHVSGSAVGGHAI 279

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           K++GWG  ++G  YW+ AN WN  WG +GYFKI RGS+ CGIE ++VAG+P
Sbjct: 280 KVLGWG-EENGTPYWLCANSWNTDWGDNGYFKILRGSDHCGIESEIVAGIP 329


>gi|330805199|ref|XP_003290573.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
 gi|325079281|gb|EGC32888.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
          Length = 313

 Score =  214 bits (544), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 114/223 (51%), Positives = 141/223 (63%), Gaps = 18/223 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           Q  CGSCWAFGAVE+  DR CIH G+++ LS  DL+ C      DGC+GG  +SAW +  
Sbjct: 100 QARCGSCWAFGAVESAQDRICIHKGLDVQLSFLDLVTC--DQSDDGCEGGDDVSAWNFLK 157

Query: 75  HHGVVTEECDPYFDSTGCSHPGCEPAYP-------TPKCVRKCVKKNQL-WRNSKHYSIS 126
             GVVT+EC PY      + P C PA         TP CV++C   + L +   KH    
Sbjct: 158 KQGVVTQECKPY------TIPTCPPAQQPCLNFVNTPNCVKQCESNSTLIYSQDKHKMAK 211

Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 186
            Y INS  E IM EI  NGPVE  F+VYEDF  YKSGVY+H TG  +GGH VK+ G+GT 
Sbjct: 212 IYSINS-VEAIMQEISTNGPVEACFSVYEDFLGYKSGVYQHTTGKFLGGHCVKIFGYGTL 270

Query: 187 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
            +G +YW +AN W  SWG +G F IKRGS+ECGIE++VVAG+P
Sbjct: 271 -NGVNYWSVANSWTTSWGDNGIFLIKRGSDECGIEDEVVAGIP 312


>gi|56753605|gb|AAW25005.1| SJCHGC02852 protein [Schistosoma japonicum]
          Length = 346

 Score =  214 bits (544), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 115/245 (46%), Positives = 151/245 (61%), Gaps = 19/245 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   +  +  Q  CGSCWAFGAVE++SDR CIH    +++ LS  +LL+CC   CG G
Sbjct: 102 WKNCSSIRTIRDQSSCGSCWAFGAVESMSDRICIHSKGRISIELSAVNLLSCCS-RCGFG 160

Query: 61  CDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHPGCE-PAYPTPKC 106
           C+GG P  AW Y+   G+VT         C PY        ST  +H  CE   Y TP+C
Sbjct: 161 CNGGIPGMAWDYWKDEGIVTGGSNETHTGCQPYPFPECIHHSTSINHSSCEVKYYSTPEC 220

Query: 107 VRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
            + C     + + N K+Y  S+Y + SD   IM EI  NGPVE +F V++DF +YK+GVY
Sbjct: 221 YQTCQPDYAIQYENDKYYGKSSYYVTSDEVSIMKEILLNGPVEATFYVFDDFLNYKTGVY 280

Query: 166 KHITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
           K++TG ++GGHA+++IGWG S  +   YW+ AN WN+ WG  GYFKI RGSNECGIE  V
Sbjct: 281 KYVTGSLLGGHAIRIIGWGVSTLNHTPYWLCANSWNKQWGDKGYFKILRGSNECGIESMV 340

Query: 225 VAGLP 229
            AGLP
Sbjct: 341 TAGLP 345


>gi|341887135|gb|EGT43070.1| hypothetical protein CAEBREN_13756 [Caenorhabditis brenneri]
          Length = 398

 Score =  213 bits (543), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 116/242 (47%), Positives = 149/242 (61%), Gaps = 22/242 (9%)

Query: 7   EHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGG 64
           E ++ +  Q  CGSCWAFGAVEA+SDR CI  H  + +SLS +DLL+CC   CG GC+GG
Sbjct: 134 ESIKAIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVSLSADDLLSCC-RSCGFGCNGG 192

Query: 65  YPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH--------PGCEPAYPTPKCVRK 109
            P++AWRY+V  G+VT         C PY     C H        P     YPTPKC ++
Sbjct: 193 DPLAAWRYWVKDGIVTGSNFTANSGCKPY-PFPPCEHHSKKTHFDPCPHDLYPTPKCEKR 251

Query: 110 CVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
           C  +  ++ +   K Y  SAY +  D E I  E+  +GP+E++F VYEDF +Y  GVY H
Sbjct: 252 CNAEYTDKTYSEDKFYGSSAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVH 311

Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
             G + GGHAVKLIGWG  +DG  YW +AN WN  WG DG+F+I RG +ECGIE  VV G
Sbjct: 312 TGGKLGGGHAVKLIGWGI-EDGIPYWTVANSWNTDWGEDGFFRILRGVDECGIESGVVGG 370

Query: 228 LP 229
           +P
Sbjct: 371 IP 372


>gi|341904470|gb|EGT60303.1| hypothetical protein CAEBREN_20420 [Caenorhabditis brenneri]
          Length = 351

 Score =  213 bits (543), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 113/234 (48%), Positives = 141/234 (60%), Gaps = 22/234 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWA  A E +SDR CI       +S+S +D+ ACCG  CG+GC+GGYPI AWR+
Sbjct: 119 QSSCGSCWAVSAAETISDRICIASKGQTQVSISADDINACCGMACGNGCNGGYPIEAWRH 178

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE-----------PA--YPTPKCVRKCVKKNQL 116
           +V +G VT     Y + TGC    +P CE           P+  YPT KC R C     L
Sbjct: 179 YVKNGYVTG--GSYQEKTGCKPYPYPPCEHHVNGTHYKPCPSDMYPTDKCERSCQAGYSL 236

Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
            ++   H+  SAY ++    +I  EI  NGPVEV+FTVY DF  Y  GVY H  G  +GG
Sbjct: 237 TYKQDLHFGQSAYAVSKKATEIQKEIMTNGPVEVAFTVYADFEVYSGGVYVHTAGASLGG 296

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           HAVK++GWG  D+G  YW+ AN WN  WG +GYF+I RG NECGIE  VV G+P
Sbjct: 297 HAVKMLGWGV-DNGTPYWLCANSWNEDWGENGYFRIIRGVNECGIEHGVVGGIP 349


>gi|340380665|ref|XP_003388842.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
          Length = 333

 Score =  213 bits (543), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 110/229 (48%), Positives = 144/229 (62%), Gaps = 17/229 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           QG CGSCWA GAVEA+SDR+C+ F  N+ +S  +L+ CC F CG+GC GG+   AW Y+V
Sbjct: 104 QGSCGSCWALGAVEAMSDRYCVSFQENVHISAENLMTCCKF-CGNGCAGGFLQQAWEYWV 162

Query: 75  HHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWRNS 120
             G+VT       E C PY     C+H  PG    C     TP+C R C       +   
Sbjct: 163 KDGLVTGGQYGSDEGCQPYLIPK-CNHHEPGPYENCTGEGKTPQCERTCRSGYTTSYEAD 221

Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 180
            HY   AY ++ + E I  EI  NGPVE +FTVY DF  YKSGVY+H+ G  +GGHA+++
Sbjct: 222 LHYGEKAYAVHREVEAIQTEIMTNGPVEGAFTVYSDFPTYKSGVYQHVVGHALGGHAIRI 281

Query: 181 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +GWGT ++G  YW++AN WN SWG  GYFK+ RG ++CGIE ++VAG P
Sbjct: 282 LGWGT-ENGVPYWLIANSWNPSWGDKGYFKMIRGKDDCGIESNIVAGTP 329


>gi|255040225|gb|ACT99885.1| cathepsin B2 [Opisthorchis viverrini]
          Length = 337

 Score =  213 bits (542), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 114/230 (49%), Positives = 143/230 (62%), Gaps = 17/230 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA+SDR CIH       SLS  DL++CCG+ CG GC GGYP +AW +
Sbjct: 102 QSSCGSCWAFGAVEAMSDRLCIHSNGTFTKSLSSIDLVSCCGY-CGFGCQGGYPPAAWDF 160

Query: 73  FVHHGVVT--EECDPY----FDSTGCSHPGCEP-------AYPTPKCVRKCVKKNQLWRN 119
           +  +G+VT   + DP     +    CSH G +         Y TPKCV KC   N  +  
Sbjct: 161 WQAYGIVTGGSKEDPMGCRSYPFPKCSHHGSKKYPPCPHRIYDTPKCVPKCDTPNIDYET 220

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
            K  +   Y +      IM EI  NGPVE +F VYEDF  YK GVY H TG+ +GGHA++
Sbjct: 221 DKTRANITYNVQRSQMAIMKEIMINGPVEAAFEVYEDFFGYKQGVYFHSTGEFIGGHAIR 280

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           ++GWG  ++G  YW++AN WN  WG DGYFK+ RG NECGIE++V AGLP
Sbjct: 281 ILGWG-EENGTPYWLIANSWNEGWGEDGYFKMLRGKNECGIEDEVTAGLP 329


>gi|389611087|dbj|BAM19154.1| cathepsin B [Papilio polytes]
          Length = 334

 Score =  213 bits (542), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 112/231 (48%), Positives = 144/231 (62%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA++DR C +     +   S  DLL+CC  +CG GC+GG P  AW Y
Sbjct: 104 QGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEY 162

Query: 73  FVHHGVV-------TEECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWR 118
           + H G+V       T+ C PY +   C H  PG    C     TPKC++KC    N  ++
Sbjct: 163 WKHFGLVSGGSYNSTQGCRPY-EIPPCEHHVPGNRLPCSGDTKTPKCIKKCEDNYNVAYK 221

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY    Y +    + I AE+YKNGPVE +FTVY D   YKSGVYKH+ GD +GGHA+
Sbjct: 222 QDKHYGKHIYSVRGGEDHIKAELYKNGPVEGAFTVYADLLSYKSGVYKHVAGDALGGHAI 281

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           K++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE  +VAG P
Sbjct: 282 KIMGWGV-ENGNKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 331


>gi|432852559|ref|XP_004067308.1| PREDICTED: cathepsin B-like [Oryzias latipes]
          Length = 330

 Score =  213 bits (542), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 111/243 (45%), Positives = 150/243 (61%), Gaps = 18/243 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWAFGA EA+SDR CIH    +S+ ++  DLL CC   CG G
Sbjct: 89  WPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNARVSVEISSEDLLTCCES-CGMG 147

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCV 107
           C+GGYP +AW ++   G+VT         C PY          G   P       TP+C+
Sbjct: 148 CNGGYPTAAWDFWTKEGLVTGGLYDSHVGCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCI 207

Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
            +C       ++  KHY  ++Y + ++   I  EIYKNGPVE +F VYEDF  YKSGVY+
Sbjct: 208 NQCESGYTPSYKKDKHYGKTSYSVEANENQIQTEIYKNGPVEGAFMVYEDFPMYKSGVYQ 267

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H++G ++GGHA+K++GWG  +DG  YW+ AN WN  WG +GYFKI RGS+ CGIE +VVA
Sbjct: 268 HVSGSLIGGHAIKILGWGV-EDGVPYWLCANSWNTDWGDNGYFKILRGSDHCGIESEVVA 326

Query: 227 GLP 229
           G+P
Sbjct: 327 GIP 329


>gi|154089579|gb|ABS57370.1| cathepsin B2 [Trichobilharzia regenti]
          Length = 344

 Score =  213 bits (542), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 117/234 (50%), Positives = 150/234 (64%), Gaps = 19/234 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF-GMNLS-LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA+SDR CI   G++   LS  +L+ACC   CG GC+GG+P SAW Y
Sbjct: 114 QSSCGSCWAFGAVEAMSDRICIESKGLHKPFLSAENLVACCS-SCGMGCNGGFPHSAWSY 172

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+VT +       C PY +   C H      P CE    TPKC   C    N  + 
Sbjct: 173 WKRSGIVTGDLYNPTDGCQPY-EFPPCEHHVVGPRPSCEGDVETPKCKTTCQPGYNIPYN 231

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             K Y  + YR++S+ E IM E+ ++GPVEV F VY DF +YKSGVY+H++G ++GGHAV
Sbjct: 232 KDKWYGKTVYRVHSNQEAIMKEVKEHGPVEVDFEVYADFPNYKSGVYQHVSGGLLGGHAV 291

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           +L+GWG  ++G  YW++AN WN  WG +GYFKI RG NECGIE DV AG+P  K
Sbjct: 292 RLLGWG-EENGVPYWLIANSWNSDWGDNGYFKIIRGRNECGIESDVNAGIPKLK 344


>gi|325302582|dbj|BAJ83491.1| cathepsin B-like peptidase [Echinococcus multilocularis]
          Length = 338

 Score =  213 bits (542), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 111/234 (47%), Positives = 145/234 (61%), Gaps = 18/234 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGA EA+SDR CIH      + +S +DLL+CCG  CG GC+GG P +AWRY
Sbjct: 106 QGTCGSCWAFGATEAMSDRICIHSEGKEVVRISADDLLSCCGLFCGFGCNGGLPENAWRY 165

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY +   C H      P C+    TPKC R+CV+  +  ++
Sbjct: 166 WAIDGIVSGGLYGSHVGCRPY-EIPPCEHHTSGNRPDCKGNSKTPKCQRQCVESFDGKYQ 224

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KH++ + Y + +  EDIM EI   GPVE  F VY DF  YKSGVY+H+ G  +GGHAV
Sbjct: 225 ADKHFASNVYNVRASEEDIMNEILVYGPVEADFIVYADFLTYKSGVYQHVKGGFLGGHAV 284

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           K++GWG  ++G  YW+ AN WN  WG  G+FKI RG N C IE D+ AG+P  +
Sbjct: 285 KILGWG-EENGVPYWLCANSWNTDWGDGGFFKILRGYNHCKIEADINAGIPKIR 337


>gi|32566081|ref|NP_506002.2| Protein CPR-1 [Caenorhabditis elegans]
 gi|32172429|sp|P25807.2|CPR1_CAEEL RecName: Full=Gut-specific cysteine proteinase; Flags: Precursor
 gi|1395200|gb|AAB88058.1| gut-specific cysteine protease-1 [Caenorhabditis elegans]
 gi|24817276|emb|CAB01410.2| Protein CPR-1 [Caenorhabditis elegans]
          Length = 329

 Score =  213 bits (542), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 110/230 (47%), Positives = 143/230 (62%), Gaps = 12/230 (5%)

Query: 7   EHVEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGG 64
           + ++++  Q  CGSCWAFGA E +SDR CI         +S +DLL+CCG  CG+GC+GG
Sbjct: 99  KSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLLSCCGSSCGNGCEGG 158

Query: 65  YPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLW 117
           YPI A R++   GVVT        C PY  +  C+   C P   TP C   C    +  +
Sbjct: 159 YPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CTSGNC-PESKTPSCSMSCQSGYSTAY 216

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              KH+ +SAY +  +   I AEIY NGPVE +F+VYEDF  YKSGVYKH  G  +GGHA
Sbjct: 217 AKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYKSGVYKHTAGKYLGGHA 276

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           +K+IGWGT + G  YW++AN W  +WG  G+FKI RG ++CGIE  VVAG
Sbjct: 277 IKIIGWGT-ESGSPYWLVANSWGVNWGESGFFKIYRGDDQCGIESAVVAG 325


>gi|344195776|gb|AEM98130.1| cathepsin B [Cynoglossus semilaevis]
          Length = 332

 Score =  213 bits (541), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 110/231 (47%), Positives = 146/231 (63%), Gaps = 18/231 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGA EA+SDR CIH    MN+ +S  DLL+CC   CG GC+GGYP +AW +
Sbjct: 101 QGSCGSCWAFGAAEAISDRLCIHSNGLMNVEISAEDLLSCCDS-CGMGCNGGYPSAAWEF 159

Query: 73  FVHHGVVTEE-------CDPYFDS------TGCSHPGCEPAYPTPKCVRKC-VKKNQLWR 118
           +   G+V+         C PY  +       G   P       TP+C +KC       + 
Sbjct: 160 WTTDGLVSGGLYDSHIGCRPYSIAPCEHHVNGSRPPCTGEGGDTPQCTKKCEAGYTPGYT 219

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY   +Y ++   ++I  EIYKNGPVE +FTVYEDF  YK+GVY+H+TG  +GGHA+
Sbjct: 220 QDKHYGKLSYSVDDSEKEIQLEIYKNGPVEGAFTVYEDFLLYKTGVYQHVTGSAVGGHAI 279

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           K++GWG  ++G  YW+ AN WN  WG +G+FKI RGS+ CGIE ++VAG+P
Sbjct: 280 KVLGWG-EENGTPYWLCANSWNTDWGDNGFFKILRGSDHCGIESEIVAGIP 329


>gi|268566077|ref|XP_002647467.1| Hypothetical protein CBG06539 [Caenorhabditis briggsae]
          Length = 332

 Score =  212 bits (540), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 111/230 (48%), Positives = 144/230 (62%), Gaps = 14/230 (6%)

Query: 7   EHVEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGG 64
           + ++++  Q +CGSCWAFGA E +SDR CI         +S  D++ CCG  CG GCDGG
Sbjct: 101 KSIKLIRNQANCGSCWAFGAAEVISDRICIATKGARQPVISPMDMVDCCGEYCGYGCDGG 160

Query: 65  YPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLW 117
           Y I A R++V  GVVT      + C PY     C+  GC P   TP+C   C  K N  +
Sbjct: 161 YSIQALRWWVFDGVVTGGDYQGDGCKPY---QFCNSAGC-PDAVTPECALSCQSKYNTEY 216

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              K++  SAY +      I  +I  NGPVE SF VYEDF  YKSGVYK+I G ++GGHA
Sbjct: 217 AKDKNFGTSAYYVGMTVNAIQTDIMTNGPVEASFKVYEDFYKYKSGVYKYIAGKMLGGHA 276

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           +K+IGWGT ++G  YW++AN W   WG +G+FKI+RG NECGIE +VVAG
Sbjct: 277 IKIIGWGT-ENGTAYWLIANSWGTKWGENGFFKIRRGVNECGIENNVVAG 325


>gi|76576341|gb|ABA53864.1| cathepsin B-like cysteine protease 2 [Parelaphostrongylus tenuis]
          Length = 344

 Score =  212 bits (540), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 114/230 (49%), Positives = 141/230 (61%), Gaps = 20/230 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFG+ EA+SDR CI  H    + LS +D+L+CC + CGDGCDGGYPISAW Y
Sbjct: 116 QSQCGSCWAFGSAEAMSDRVCIASHGNKTVELSADDILSCC-YDCGDGCDGGYPISAWEY 174

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKNQL-W 117
           FV  GVVT       + C PY +   C H   E  Y        TP CV  C     + +
Sbjct: 175 FVETGVVTGGLYGTKDSCRPY-EIPPCGHHRNETFYGNCTQIADTPDCVTTCQAGYPISY 233

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
            + K +   +Y I S    I  EI   GPV  +F VYEDF HY  G+YKH++G   GGHA
Sbjct: 234 DDDKTFGKDSYTIESSVTAIQKEIMTYGPVTAAFIVYEDFFHYHRGIYKHVSGGEEGGHA 293

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           V+++GWG  + G  YW++AN WN  WG +GYF+I RGSNECGIEE+VVAG
Sbjct: 294 VRILGWG-EEKGTAYWLVANSWNTDWGENGYFRILRGSNECGIEENVVAG 342


>gi|268579855|ref|XP_002644910.1| C. briggsae CBR-CPR-6 protein [Caenorhabditis briggsae]
          Length = 376

 Score =  212 bits (540), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 116/245 (47%), Positives = 150/245 (61%), Gaps = 24/245 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA+SDR CI  H  + +SLS +DLL+CC   CG GC+GG P++AWRY
Sbjct: 128 QSSCGSCWAFGAVEAMSDRICIASHGELQVSLSADDLLSCC-RSCGFGCNGGDPLAAWRY 186

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVK--KN 114
           +V  G+VT     Y  ++GC     P CE               YPTPKC +KC+    +
Sbjct: 187 WVKDGIVTGS--NYTANSGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCIADYTD 244

Query: 115 QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
           + +   K Y  SAY +  D E I  E+  +GP+E++F VYEDF +Y  GVY H  G + G
Sbjct: 245 KTYSEDKFYGHSAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGG 304

Query: 175 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
           GHAVKLIGWG  +DG  YW  AN WN  WG DG+F+I RG +ECGIE  VV G+P   ++
Sbjct: 305 GHAVKLIGWGI-EDGIPYWTCANSWNTDWGEDGFFRILRGVDECGIESGVVGGIPKLNSV 363

Query: 235 VKEIT 239
              ++
Sbjct: 364 SSRLS 368


>gi|34979797|gb|AAQ83887.1| cathepsin B [Branchiostoma belcheri tsingtauense]
          Length = 332

 Score =  212 bits (540), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 113/243 (46%), Positives = 150/243 (61%), Gaps = 19/243 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCI-HFGMNLS-LSVNDLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWA  AVEA+SDR C+   G  ++ +S  DL +CC   CG+G
Sbjct: 90  WANCPTIKEVRDQGSCGSCWALAAVEAMSDRICVASKGSTMAHISAEDLNSCCKS-CGNG 148

Query: 61  CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C+GG+P +AW Y+   G+VT       + C PY +   C H      P C    PTP+C 
Sbjct: 149 CNGGFPEAAWEYWKRDGLVTGGPYGSHQGCQPY-EIKPCEHHINGSRPACGKLEPTPRCK 207

Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C    N  +   KHY+ +AY ++S  + I  EI  NGPVE +FTVY DF HYKSGVY+
Sbjct: 208 KSCESGYNVTFAKDKHYAKTAYSVSSKVQQIQMEIMTNGPVEAAFTVYADFPHYKSGVYQ 267

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H +G  +GGHAVK+IGWGT +    YW++AN WN  WG  G+FKI RG +ECGIE D+VA
Sbjct: 268 HESGAELGGHAVKMIGWGT-EGSTPYWLIANSWNTDWGNMGFFKILRGQDECGIERDIVA 326

Query: 227 GLP 229
           G P
Sbjct: 327 GEP 329


>gi|321452279|gb|EFX63703.1| hypothetical protein DAPPUDRAFT_306608 [Daphnia pulex]
          Length = 340

 Score =  212 bits (540), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 108/231 (46%), Positives = 144/231 (62%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH    +N  LS  +L++CC + CG GC+GG+P +AW +
Sbjct: 111 QGSCGSCWAFGAVEAMSDRICIHSKGEVNAHLSAENLVSCC-YTCGFGCNGGFPGAAWSH 169

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
           +V  G+VT       + C PY     C H      P C     TPKC++ C     + + 
Sbjct: 170 WVKKGIVTGGNFNSSQGCQPYI-IPACEHHTTGDRPPCSEGGGTPKCLKTCEDGYTVDYT 228

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
              HY  S+Y ++   EDI  EI  NGPVE + TVYEDF  YKSGVY+H+ G  +GGHA+
Sbjct: 229 QDLHYGASSYSVHKRMEDIQLEIMNNGPVEGALTVYEDFPTYKSGVYQHVHGKALGGHAI 288

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +++GWG  ++G  YW++AN WN  WG +GY K+ RG + CGIE  + AGLP
Sbjct: 289 RILGWGV-EEGVPYWLIANSWNTDWGDNGYIKLLRGKDHCGIESQITAGLP 338


>gi|225713216|gb|ACO12454.1| Cathepsin B precursor [Lepeophtheirus salmonis]
 gi|290561811|gb|ADD38303.1| Cathepsin B [Lepeophtheirus salmonis]
          Length = 333

 Score =  212 bits (539), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 106/236 (44%), Positives = 151/236 (63%), Gaps = 18/236 (7%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
           + ++  QG CGSCWAFGAVEA+SDR CIH    +++S  +LL+CC + CG GC+GG+P +
Sbjct: 100 ISLIRDQGSCGSCWAFGAVEAMSDRLCIHSNKIVNVSAENLLSCC-YSCGFGCNGGFPGA 158

Query: 69  AWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKN- 114
           AW ++   G+V+       + C PY  +  C H      P C     TPKC   C  ++ 
Sbjct: 159 AWSFWKKKGLVSGGLYGSHKGCQPYAIAP-CEHHANGTRPPCSGGGRTPKCHTFCENEDY 217

Query: 115 -QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
              +   K +  S+Y + SDP+ I  EI  NGPVE +F+VY DF +YKSGVY+H+ G ++
Sbjct: 218 SLPYEKDKSFGRSSYSVKSDPKQIQLEIMNNGPVEAAFSVYSDFLNYKSGVYRHVKGSLL 277

Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           GGHA++++GWG  ++G  YW++AN WN  WG +G FKI +GS+ CGIE  +VAGLP
Sbjct: 278 GGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGTFKILKGSDHCGIEGSIVAGLP 332


>gi|121073168|gb|ABM47070.1| cathepsin B1 [Clonorchis sinensis]
 gi|358341105|dbj|GAA29748.2| cathepsin B [Clonorchis sinensis]
          Length = 339

 Score =  212 bits (539), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 114/231 (49%), Positives = 142/231 (61%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CGSCWAFGA  A+SDR CI  G      +S  DL+ CC   CG GC GGYP  AW Y
Sbjct: 110 QSNCGSCWAFGAAGAISDRICIASGGKHQPRISPEDLVDCCAD-CGMGCQGGYPAQAWEY 168

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPGCEPAYP------TPKCVRKCVKK-NQLWR 118
           +V +G+VT       + C PY     C H    P  P      TP+CV+KC  +  + + 
Sbjct: 169 WVRNGLVTGDLYNTTDTCRPY-SFPPCEHHVVGPRKPCTGDPTTPQCVKKCQPEYPKTYE 227

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
           N K Y + AY I+SD E IM ++   GP+EV F VY DF  Y SGVY+H+ G ++GGHAV
Sbjct: 228 NDKWYGLKAYSIHSDQEAIMRDLMTYGPLEVDFEVYADFPSYSSGVYRHVAGGLLGGHAV 287

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +L+GWG  +DG DYW++AN WN  WG  GYFKI+RG NECGIE D  AG P
Sbjct: 288 RLVGWGV-EDGADYWLIANSWNTDWGDGGYFKIRRGVNECGIESDANAGHP 337


>gi|289743429|gb|ADD20462.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
          Length = 340

 Score =  212 bits (539), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 111/234 (47%), Positives = 144/234 (61%), Gaps = 22/234 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH    +N   S +DL++CC   CG GC+GG+P +AW Y
Sbjct: 110 QGSCGSCWAFGAVEAMSDRVCIHSNGTVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWGY 168

Query: 73  FVHHGVVTEECDPYFDSTGC--------------SHPGCEPAY-PTPKCVRKCVKKNQL- 116
           +V  G+V+    PY  S GC              + P CE  Y  TP+C  KC    ++ 
Sbjct: 169 WVRKGIVSG--GPYGSSQGCRPYEIAPCEHHVNGTRPPCEKEYGKTPRCQHKCQASYKVD 226

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           ++  KH+   AY I+ +  DI  EI  NGPVE +FTVYED   YK GVY+H+ G  +GGH
Sbjct: 227 YKTDKHFGSRAYSISKNVRDIQGEIMTNGPVEGAFTVYEDLILYKDGVYEHVHGKELGGH 286

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           A+++IGWG   D   YW++AN WN  WG +G+FKI RG + CGIE  + AGLP 
Sbjct: 287 AIRIIGWGVEKD-TPYWLIANSWNTDWGNNGFFKILRGKDHCGIESSISAGLPK 339


>gi|389608541|dbj|BAM17880.1| cathepsin B [Papilio xuthus]
          Length = 334

 Score =  212 bits (539), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 111/231 (48%), Positives = 146/231 (63%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA++DR C +     +   S  DLL+CC  +CG GC+GG P  AW Y
Sbjct: 104 QGSCGSCWAFGAVEAMTDRICTYSNGTKHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEY 162

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKKNQL-WR 118
           + H G+V+       + C PY +   C H  PG    C     TPKCV++C    ++ ++
Sbjct: 163 WKHFGLVSGGSYNSSQGCRPY-EIPPCEHHVPGNRLPCSGDTKTPKCVKECESGYKVPYK 221

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY    Y +    + I AE+YKNGPVE +FTVY D   YKSGVYKH+TGD +GGHA+
Sbjct: 222 QDKHYGKHVYSVRGGEDHIKAELYKNGPVEGAFTVYADLLSYKSGVYKHVTGDALGGHAI 281

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           K++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE  +VAG P
Sbjct: 282 KIMGWGV-ENGNKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 331


>gi|325302580|dbj|BAJ83490.1| cathepsin B-like peptidase [Echinococcus multilocularis]
          Length = 351

 Score =  211 bits (538), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 107/236 (45%), Positives = 146/236 (61%), Gaps = 21/236 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIH------FGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
           QG CGSCWAFGA EA+SDR CI         + + LS +DLL+CC   CG GC+GG+P  
Sbjct: 118 QGSCGSCWAFGAAEAMSDRLCIQQQTVSGRAVMVRLSADDLLSCCRD-CGMGCNGGFPSQ 176

Query: 69  AWRYFVHHGVVTE------------ECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL 116
           AW ++ H G+V+             E  P       + P CE   PTPKC   C ++ ++
Sbjct: 177 AWNFWKHEGLVSGGLYGTKGVCRAYEIPPCEHHVNGTRPPCEGDAPTPKCKNVCQEEYKV 236

Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
            ++  KHY++  Y ++S+ + I  E+  +GPVE  F VY DF  YKSGVY+H++G ++GG
Sbjct: 237 PYKKDKHYAVKVYSVHSNEDAIKHELITHGPVEADFEVYADFPTYKSGVYQHVSGALLGG 296

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           HA+KL+GWG  +DG  YW+ AN WN  WG  G+FKI RG N CGIE D+VAG+P +
Sbjct: 297 HAIKLMGWG-EEDGVPYWLCANSWNTDWGEGGFFKILRGKNHCGIESDIVAGIPQN 351


>gi|327322926|gb|AEA48884.1| cathepsin B [Oplegnathus fasciatus]
          Length = 330

 Score =  211 bits (538), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 111/232 (47%), Positives = 148/232 (63%), Gaps = 20/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGA EA+SDR CIH    +S+ ++  DLL CC   CG GC+GGYP +AW +
Sbjct: 101 QGSCGSCWAFGAAEAISDRVCIHSNAKVSVEISSEDLLTCC-MSCGMGCNGGYPSAAWDF 159

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCE-PAYPTPKCVRKC-VKKNQLW 117
           +   G+V+         C PY  +  C H      P C      TP+C+ KC       +
Sbjct: 160 WTKEGLVSGGLYDSHIGCRPYTIAP-CEHHVNGSRPSCTGEGGDTPQCITKCEAGYTPSY 218

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
           +  KH+  ++Y + SD E I +EI+KNGPVE +F VYEDF  YKSGVY+H++G  +GGHA
Sbjct: 219 KEDKHFGKTSYTVLSDEEQIQSEIFKNGPVEGAFIVYEDFVLYKSGVYQHVSGSAVGGHA 278

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +K++GWG  +DG  YW+ AN WN  WG +G+FK  RGS+ CGIE +VVAG+P
Sbjct: 279 IKILGWGV-EDGVPYWLCANSWNTDWGDNGFFKFLRGSDHCGIESEVVAGIP 329


>gi|226468762|emb|CAX76409.1| cathepsin B [Schistosoma japonicum]
 gi|257206178|emb|CAX82740.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  211 bits (538), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 115/246 (46%), Positives = 151/246 (61%), Gaps = 19/246 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG 60
           +T+   +  +  Q  CGSCWAFGAVEA+SDR CI         LS  +L++CC   CG G
Sbjct: 105 WTHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSS-CGMG 163

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C+GG+P SAW Y+ + G+VT +       C PY +   C H      P C+    TP C 
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY-EFPPCEHNTLGPLPVCDGDVETPPCK 222

Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           R C    N  + N K Y    YR+ S+ E IM E+ ++GPVEV F VY DF +YKSGVY+
Sbjct: 223 RTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVYQ 282

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H++G ++GGHAV+L+GWG  ++   YW++AN WN  WG +GYFKI RG NECGIE DV A
Sbjct: 283 HVSGALLGGHAVRLLGWG-EENNVPYWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNA 341

Query: 227 GLPSSK 232
           G+P  K
Sbjct: 342 GIPKIK 347


>gi|30995341|gb|AAO59414.2| cathepsin B endopeptidase [Schistosoma japonicum]
 gi|226472794|emb|CAX71083.1| cathepsin B [Schistosoma japonicum]
 gi|226472796|emb|CAX71084.1| cathepsin B [Schistosoma japonicum]
 gi|226472798|emb|CAX71085.1| cathepsin B [Schistosoma japonicum]
 gi|226472802|emb|CAX71087.1| cathepsin B [Schistosoma japonicum]
 gi|226472806|emb|CAX71089.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  211 bits (538), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 115/246 (46%), Positives = 151/246 (61%), Gaps = 19/246 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG 60
           +T+   +  +  Q  CGSCWAFGAVEA+SDR CI         LS  +L++CC   CG G
Sbjct: 105 WTHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSS-CGMG 163

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C+GG+P SAW Y+ + G+VT +       C PY +   C H      P C+    TP C 
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY-EFPPCEHHTLGPLPVCDGDVETPPCK 222

Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           R C    N  + N K Y    YR+ S+ E IM E+ ++GPVEV F VY DF +YKSGVY+
Sbjct: 223 RTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVYQ 282

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H++G ++GGHAV+L+GWG  ++   YW++AN WN  WG +GYFKI RG NECGIE DV A
Sbjct: 283 HVSGALLGGHAVRLLGWG-EENNVPYWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNA 341

Query: 227 GLPSSK 232
           G+P  K
Sbjct: 342 GIPKIK 347


>gi|268561802|ref|XP_002638421.1| C. briggsae CBR-CPR-3 protein [Caenorhabditis briggsae]
          Length = 375

 Score =  211 bits (538), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 110/237 (46%), Positives = 151/237 (63%), Gaps = 24/237 (10%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP 66
           ++ +  Q  CGSCWAFGA E +SDR CI         +S  D+L+CCG  CG GC GGY 
Sbjct: 111 LKFIRNQASCGSCWAFGAAEVISDRVCIQSNGTQQPIISAEDILSCCGSTCGKGCQGGYT 170

Query: 67  ISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPA----YPTPKCVRKCVKKNQL 116
           I A +Y+++ GVVT        C PY      S P C+ +    + TP C   C +K   
Sbjct: 171 IEAMKYWMNSGVVTGGDYNGAGCMPY------SFPPCKKSPCVEFSTPSCKTTCQEKYTT 224

Query: 117 --WRNSKHYSISAYRINSDPE---DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 171
             ++N KH++ SAY++++       I  EIY NGPVE S+ V+EDF  YKSGVY H++G+
Sbjct: 225 ADYKNDKHFATSAYKLSTTKNAVPTIQYEIYHNGPVEASYRVFEDFYQYKSGVYHHVSGN 284

Query: 172 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           ++GGHAVK+IGWGT ++G DYW++AN W  S+G  G+FKI+RG+NEC IE ++VAGL
Sbjct: 285 LVGGHAVKIIGWGT-ENGVDYWLVANSWGTSFGEKGFFKIRRGTNECQIESNIVAGL 340


>gi|116177489|gb|ABJ80691.1| cathepsin B [Hippoglossus hippoglossus]
          Length = 330

 Score =  211 bits (538), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 113/231 (48%), Positives = 144/231 (62%), Gaps = 18/231 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGA EA+SDR CIH G  +S+ ++  DLL CC   CG GC+GGYP +AW +
Sbjct: 101 QGSCGSCWAFGAAEAISDRVCIHSGSKVSVEISSEDLLTCCD-ACGMGCNGGYPSAAWDF 159

Query: 73  FVHHGVVTEE-------CDPYF-----DSTGCSHPGCE-PAYPTPKCVRKC-VKKNQLWR 118
           +   G+V+         C PY           S P C      TPKCV  C    +  + 
Sbjct: 160 WTKEGLVSGGLYNSHIGCRPYTIPPCEHHVNGSRPHCSGEGGDTPKCVHSCEAGYSPTYT 219

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  S+Y + +  E I AEI +NGPVE +F VYEDF  YKSGVY+H TG  +GGHA+
Sbjct: 220 KDKHYGKSSYSVEASVEQIQAEISQNGPVEGAFIVYEDFVMYKSGVYQHTTGSALGGHAI 279

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           K++GWG  +DG  YW+ AN WN  WG +G+FKI RGS+ CGIE ++VAG+P
Sbjct: 280 KVLGWG-EEDGVPYWLCANSWNTDWGENGFFKILRGSDHCGIESEIVAGIP 329


>gi|91078958|ref|XP_974220.1| PREDICTED: similar to cathepsin b [Tribolium castaneum]
 gi|270004841|gb|EFA01289.1| cathepsin B precursor [Tribolium castaneum]
          Length = 334

 Score =  211 bits (538), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 110/244 (45%), Positives = 154/244 (63%), Gaps = 20/244 (8%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDG 60
           + N   +  +  QG CGSCWAFGAVEA+SDR CIH    +N+ LS +DL++CC + CG G
Sbjct: 90  WPNCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIHSKGAVNVRLSADDLVSCC-YSCGMG 148

Query: 61  CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH--PGCEPA-----YPTPKC 106
           C+GG+P +AW Y+V+ G+V+       + C PY +   C H   G  P        TP C
Sbjct: 149 CNGGFPGAAWHYWVNKGIVSGGSFGSNQGCRPY-EIAPCEHHVNGTRPPCTGDDNKTPSC 207

Query: 107 VRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
            ++C K  N  ++  K++   AY I+S+ + I  EI  NGPVE +F VYED   YK GVY
Sbjct: 208 KQQCEKGYNVPYKKDKNFGKEAYSISSEVQQIQKEIMTNGPVEGAFEVYEDLLSYKKGVY 267

Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
           +H+ G+ +GGHA++++GWGT + G  YW++AN WN  WG +G FKI RG + CGIE  +V
Sbjct: 268 QHVKGEALGGHAIRILGWGT-EKGTPYWLIANSWNSDWGDNGTFKILRGEDHCGIESSIV 326

Query: 226 AGLP 229
           AG+P
Sbjct: 327 AGIP 330


>gi|226472800|emb|CAX71086.1| cathepsin B [Schistosoma japonicum]
 gi|226472804|emb|CAX71088.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  211 bits (538), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 115/246 (46%), Positives = 151/246 (61%), Gaps = 19/246 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG 60
           +T+   +  +  Q  CGSCWAFGAVEA+SDR CI         LS  +L++CC   CG G
Sbjct: 105 WTHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSS-CGMG 163

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C+GG+P SAW Y+ + G+VT +       C PY +   C H      P C+    TP C 
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY-EFPPCEHHTLGPLPVCDGDVETPPCK 222

Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           R C    N  + N K Y    YR+ S+ E IM E+ ++GPVEV F VY DF +YKSGVY+
Sbjct: 223 RTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVYQ 282

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H++G ++GGHAV+L+GWG  ++   YW++AN WN  WG +GYFKI RG NECGIE DV A
Sbjct: 283 HVSGALLGGHAVRLLGWG-EENNVPYWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNA 341

Query: 227 GLPSSK 232
           G+P  K
Sbjct: 342 GIPKIK 347


>gi|268555790|ref|XP_002635884.1| Hypothetical protein CBG01104 [Caenorhabditis briggsae]
          Length = 337

 Score =  211 bits (537), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 111/235 (47%), Positives = 142/235 (60%), Gaps = 23/235 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWA  A E +SDR CI  +  +N  +S  DLL+CC   CGDGCDGGYP+ AWRY
Sbjct: 100 QSDCGSCWAVAAAETISDRLCIASNGSINTFVSAEDLLSCCTS-CGDGCDGGYPLQAWRY 158

Query: 73  FVHHGVVT-------EECDPYFDS------TGCSHPGCEPAY--PTPKCVRKCVKKNQL- 116
           +V  G+V+         C PY  +       G + P C PA    TP+C   C  K+   
Sbjct: 159 WVKQGLVSGGSYESQYGCKPYSIAPCGQTVNGVTWPKC-PAQEEATPECASHCTSKSSYS 217

Query: 117 --WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
             +   KHY +SAY +      I  EI ++GPVE  F VY DF  YKSG+Y H++G  +G
Sbjct: 218 VAYEKDKHYGLSAYPVGRKEAQIQTEILQHGPVEAGFLVYSDFYRYKSGIYTHVSGQELG 277

Query: 175 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           GHAVK++GWG  ++G  YW++AN WN +WG  GYF+I RG NECGIE  VVAG+P
Sbjct: 278 GHAVKILGWGV-ENGTKYWLVANSWNINWGEKGYFRILRGRNECGIESAVVAGIP 331


>gi|121073189|gb|ABM47071.1| cathepsin B2 [Clonorchis sinensis]
 gi|358341868|dbj|GAA36574.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 343

 Score =  211 bits (536), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 113/230 (49%), Positives = 141/230 (61%), Gaps = 17/230 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA+SDR CIH     N SLS  DLL+CC   CGDGCDGG+P  AW +
Sbjct: 108 QSSCGSCWAFGAVEAMSDRLCIHSSGAFNKSLSAVDLLSCCK-DCGDGCDGGFPPMAWDF 166

Query: 73  FVHHGVVT----EE---CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
           +  HG+VT    EE   C PY        S G   P     YPTPKCV+ C      ++ 
Sbjct: 167 WKTHGIVTGGSKEEPTGCRPYPFPKCQHHSQGHYPPCPRRIYPTPKCVKHCDTPKIDYQK 226

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
            K  + ++Y ++     IM EI  NGPVE +F V+EDF  YKSG+Y H  G  +GGHA++
Sbjct: 227 DKTRANTSYNVHQSEVAIMKEILLNGPVEATFEVHEDFPEYKSGIYFHAWGGSVGGHAIR 286

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           ++GWG  ++G  YW++AN WN  WG  GY +  RG NECGIEE+  AGLP
Sbjct: 287 ILGWG-EENGVPYWLIANSWNEDWGEKGYLRFLRGHNECGIEEEATAGLP 335


>gi|118122|sp|P25793.1|CYSP2_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 2; Flags:
           Precursor
 gi|159165|gb|AAA29171.1| cathepsin B-like cysteine protease [Haemonchus contortus]
          Length = 342

 Score =  211 bits (536), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 107/237 (45%), Positives = 145/237 (61%), Gaps = 19/237 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CGSCWA     A+SDR CI       +++S  D++ CC   CGDGC+GG+PI AW+Y
Sbjct: 108 QANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIMTCCRPQCGDGCEGGWPIEAWKY 167

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPG-------CEPAYPTPKCVRKCVKK-NQLW 117
           F++ GVV+       + C PY     C H G       C    PTP C RKC     +++
Sbjct: 168 FIYDGVVSGGEYLTKDVCRPY-PIHPCGHHGNDTYYGECRGTAPTPPCKRKCRPGVRKMY 226

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
           R  K Y   AY +    + I +EI KNGPV  SF VYEDF HYKSG+YKH  G++ G HA
Sbjct: 227 RIDKRYGKDAYIVKQSVKAIQSEILKNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHA 286

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
           VK+IGWG +++  D+W++AN W+  WG  GYF+I RGSN+CGIE  + AG+  +++L
Sbjct: 287 VKMIGWG-NENNTDFWLIANSWHNDWGEKGYFRIVRGSNDCGIEGTIAAGIVDTESL 342


>gi|308511959|ref|XP_003118162.1| CRE-CPR-6 protein [Caenorhabditis remanei]
 gi|308238808|gb|EFO82760.1| CRE-CPR-6 protein [Caenorhabditis remanei]
          Length = 387

 Score =  211 bits (536), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 114/244 (46%), Positives = 149/244 (61%), Gaps = 24/244 (9%)

Query: 7   EHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGG 64
           + +  +  Q  CGSCWAFGAVEA+SDR CI  H  + +SLS +DLL+CC   CG GC+GG
Sbjct: 119 QSIRNIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVSLSADDLLSCC-RSCGFGCNGG 177

Query: 65  YPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVR 108
            P++AWRY+V  G+VT     Y  ++GC     P CE               YPTPKC +
Sbjct: 178 DPLAAWRYWVKDGIVTGS--NYTANSGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEK 235

Query: 109 KCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           KC+    ++ +   K Y  SAY +  D E I  E+  +GP+E++F VYEDF +Y  GVY 
Sbjct: 236 KCIADYTDKTYSEDKFYGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYV 295

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H  G + GGHAVKL+GWG  ++G  YW  AN WN  WG DG+F+I RG +ECGIE  VV 
Sbjct: 296 HTGGKLGGGHAVKLVGWGI-ENGIPYWTCANSWNTDWGEDGFFRILRGVDECGIESGVVG 354

Query: 227 GLPS 230
           G+P 
Sbjct: 355 GVPK 358


>gi|282400164|ref|NP_001164205.1| cathepsin B precursor [Tribolium castaneum]
 gi|270004839|gb|EFA01287.1| cathepsin B precursor [Tribolium castaneum]
          Length = 335

 Score =  211 bits (536), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 111/231 (48%), Positives = 141/231 (61%), Gaps = 20/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA+SDR CIH    + +++S  D L CC  +CG GC+GG P  AW +
Sbjct: 107 QSTCGSCWAFGAVEAMSDRICIHSNATVKVNISAEDPLDCC-TICGMGCNGGMPAMAWLH 165

Query: 73  FVHHGVVTEECDPYFDSTGCSH--------------PGCEPAYPTPKCVRKCVKKNQLWR 118
           +  +G+VT     Y D+ GC                P C P  PTP C ++C   + L  
Sbjct: 166 WTVNGIVTG--GNYEDTNGCKAYSFAPCEHHVDGDLPPCGPTKPTPDCKKECDSGSSLTY 223

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
            +     S Y I+  P+ I  EI  NGPVE SF+VYEDF  YKSGVY+H+ G+  GGHA+
Sbjct: 224 QNDLTHGSNYGIDPYPKQIQTEIMTNGPVEASFSVYEDFLSYKSGVYQHLEGEYAGGHAI 283

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           K++GWG  +D   YW++AN WN  WG  GYFKI RGSNECGIE  +VAG+P
Sbjct: 284 KILGWGVEND-TPYWLVANSWNEDWGDKGYFKILRGSNECGIEGSIVAGIP 333


>gi|393909827|gb|EJD75608.1| cysteine endopeptidase [Loa loa]
          Length = 383

 Score =  210 bits (535), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 115/241 (47%), Positives = 145/241 (60%), Gaps = 25/241 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWA  AVEA+SDR CI       ++LS +DLL+CC   CG GC GG P++AW+Y
Sbjct: 145 QSSCGSCWAVAAVEAMSDRICIMSKGKKQVTLSADDLLSCCK-TCGFGCFGGEPMAAWKY 203

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKK-NQ 115
           +V  G+VT     Y + +GC     P CE               YPTPKCV+KC K   +
Sbjct: 204 WVLRGIVTG--SEYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCVKKCDKNYGK 261

Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
            ++  K+Y    Y + S+ E I  EI   GPVE SF VY DF +Y  G+YKH+ G + GG
Sbjct: 262 SYKADKYYGEQVYNVESNVESIQKEIMTLGPVEASFEVYTDFLYYTGGIYKHVAGSMGGG 321

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
           HAVK++GWG  D G  YW+ AN WN  WG DGYF+I RG NECGIE  ++AG+P  K L 
Sbjct: 322 HAVKVLGWGI-DQGVPYWLAANSWNTDWGEDGYFRILRGVNECGIESGIIAGIP--KQLA 378

Query: 236 K 236
           K
Sbjct: 379 K 379


>gi|195729973|gb|ACG50797.1| cathepsin B2 [Trichobilharzia szidati]
          Length = 344

 Score =  210 bits (534), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 116/234 (49%), Positives = 148/234 (63%), Gaps = 19/234 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF-GMNLS-LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA+SDR CI   G++   LS  +L+ACC   CG GC+GG+P SAW Y
Sbjct: 114 QSSCGSCWAFGAVEAMSDRICIESKGLHKPFLSAENLVACCS-SCGMGCNGGFPHSAWSY 172

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+VT +       C PY +   C H      P C     TPKC   C    N  + 
Sbjct: 173 WKRSGIVTGDLYNTTDGCQPY-EFPPCEHHVVGPRPSCGGDVETPKCKTTCQPGYNIPYN 231

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             K Y  + YR++S+ E IM E+  +GPVEV F VY DF +YKSGVY+H++G ++GGHAV
Sbjct: 232 KDKWYGKTVYRVHSNQEAIMKEVMDHGPVEVDFEVYADFPNYKSGVYQHVSGGLLGGHAV 291

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           +L+GWG  ++G  YW++AN WN  WG +GYFKI RG NECGIE DV AG+P  K
Sbjct: 292 RLLGWG-EENGVPYWLIANSWNSDWGDNGYFKIIRGRNECGIESDVNAGIPKLK 344


>gi|118153|sp|P25792.1|CYSP_SCHMA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
           Full=Antigen Sm31; Flags: Precursor
 gi|160950|gb|AAA29865.1| cathepsin B [Schistosoma mansoni]
          Length = 340

 Score =  210 bits (534), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 111/229 (48%), Positives = 144/229 (62%), Gaps = 18/229 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCW+FGAVEA+SDR CI  G   N+ LS  DLL CC   CG GC+GG    AW Y
Sbjct: 111 QSRCGSCWSFGAVEAMSDRSCIQSGGKQNVELSAVDLLTCCES-CGLGCEGGILGPAWDY 169

Query: 73  FVHHGVVTEE-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
           +V  G+VT         C+PY        T   +P C    Y TP+C + C +K +  + 
Sbjct: 170 WVKEGIVTASSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYNTPRCKQTCQRKYKTPYT 229

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KH   S+Y + +D + I  EI K GPVE SFTVYEDF +YKSG+YKHITG+ +GGHA+
Sbjct: 230 QDKHRGKSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAI 289

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           ++IGWG  ++   YW++AN WN  WG +GYF+I RG +EC IE +V+AG
Sbjct: 290 RIIGWGV-ENKTPYWLIANSWNEDWGENGYFRIVRGRDECSIESEVIAG 337


>gi|254746338|emb|CAX16634.1| putative C1A cysteine protease precursor [Manduca sexta]
          Length = 337

 Score =  210 bits (534), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 111/231 (48%), Positives = 145/231 (62%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA++DR+C +     +   S  DLL+CC  +CG GC+GG P  AW Y
Sbjct: 105 QGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSSEDLLSCCP-ICGLGCNGGIPSLAWEY 163

Query: 73  FVHHGVV-------TEECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWR 118
           + H G+V       T+ C PY +   C H  PG    C     TPKC + C    N +++
Sbjct: 164 WKHFGIVSGGNYNSTQGCRPY-EIPPCEHHVPGNRMPCSGDTKTPKCQKNCENGYNVMYK 222

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             K Y    Y +++  + I AE+YKNGPVE +FTVY D   YKSGVYKHI GD +GGHA+
Sbjct: 223 KDKRYGKHVYSVSAGEDHIRAELYKNGPVEGAFTVYADLLAYKSGVYKHIQGDALGGHAI 282

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           K++GWG  +D + YW++AN WN  WG +G+FKI RG N CGIE  ++AG P
Sbjct: 283 KILGWGVENDNK-YWLVANSWNTDWGDNGFFKILRGENHCGIEGSIIAGEP 332


>gi|341904369|gb|EGT60202.1| hypothetical protein CAEBREN_08101 [Caenorhabditis brenneri]
          Length = 330

 Score =  210 bits (534), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 110/230 (47%), Positives = 138/230 (60%), Gaps = 12/230 (5%)

Query: 7   EHVEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGG 64
           + ++++  Q  CGSCWAFGA E +SDR CI         +S +DLL+CCG  CG+GC+GG
Sbjct: 100 KSIKLIRNQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDDLLSCCGSSCGNGCEGG 159

Query: 65  YPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLW 117
           YPI A R++   GVVT        C PY  +  C+   C P   TP C   C       +
Sbjct: 160 YPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CTSGSC-PESKTPACSLSCQSGYTTAY 217

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              KH+  SAY +      I  EI  NGPVE +FTVYEDF  YKSGVYKH  G  +GGHA
Sbjct: 218 AKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHA 277

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           +K+IGWGT + G  YW++AN W  SWG  G+FKI RG ++CGIE  VVAG
Sbjct: 278 IKIIGWGT-ESGSPYWLVANSWGTSWGESGFFKIFRGDDQCGIESAVVAG 326


>gi|161343863|tpg|DAA06112.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 340

 Score =  210 bits (534), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 108/233 (46%), Positives = 144/233 (61%), Gaps = 20/233 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH     N   S  +L++CC   CG GC+GG+P +AW Y
Sbjct: 111 QGSCGSCWAFGAVEAMSDRVCIHSKGAKNFHFSAENLVSCC-RTCGFGCNGGFPGAAWHY 169

Query: 73  FVHHGVVTEECDPYFDSTGC--------------SHPGCEPAYPTPKCVRKCVKKNQL-W 117
           +   G+V+    PY    GC              +   C+    TP CV+KC    ++ +
Sbjct: 170 WKTKGIVSG--GPYGSKMGCIPYEIAPCEHHVNGTRGPCKEGGKTPACVKKCEDGYKVPY 227

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
               H   SAY + +D + I  EIY NGPVE +FTVYEDF  Y++GVYKH+ G  +GGHA
Sbjct: 228 AQDLHRGKSAYSLGNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHA 287

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           ++++GWG  +    YW++AN WN  WG+DG+FKI RGS+ECGIE  + AGLP+
Sbjct: 288 IRILGWGVQNGEIPYWLVANSWNSDWGSDGFFKILRGSDECGIEGQINAGLPA 340


>gi|341878049|gb|EGT33984.1| CBN-CPR-1 protein [Caenorhabditis brenneri]
          Length = 330

 Score =  209 bits (533), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 110/230 (47%), Positives = 138/230 (60%), Gaps = 12/230 (5%)

Query: 7   EHVEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGG 64
           + ++++  Q  CGSCWAFGA E +SDR CI         +S +DLL+CCG  CG+GC+GG
Sbjct: 100 KSIKLIRNQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDDLLSCCGSSCGNGCEGG 159

Query: 65  YPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLW 117
           YPI A R++   GVVT        C PY  +  C+   C P   TP C   C       +
Sbjct: 160 YPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CTSGSC-PESKTPACSLSCQPGYTTAY 217

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              KH+  SAY +      I  EI  NGPVE +FTVYEDF  YKSGVYKH  G  +GGHA
Sbjct: 218 AKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHA 277

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           +K+IGWGT + G  YW++AN W  SWG  G+FKI RG ++CGIE  VVAG
Sbjct: 278 IKIIGWGT-ESGSPYWLVANSWGTSWGESGFFKIFRGDDQCGIESAVVAG 326


>gi|256090368|ref|XP_002581167.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|22531387|emb|CAD44624.1| cathepsin B1 isotype 1 [Schistosoma mansoni]
 gi|353228442|emb|CCD74613.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 340

 Score =  209 bits (533), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 112/229 (48%), Positives = 143/229 (62%), Gaps = 18/229 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA+SDR CI  G   N+ LS  DLL+CC   CG GC+GG    AW Y
Sbjct: 111 QSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCES-CGLGCEGGILGPAWDY 169

Query: 73  FVHHGVVTEE-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
           +V  G+VT         C+PY        T   +P C    Y TP+C + C KK +  + 
Sbjct: 170 WVKEGIVTGSSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYT 229

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KH   S+Y + +D + I  EI K GPVE  FTVYEDF +YKSG+YKHITG+ +GGHA+
Sbjct: 230 QDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYEDFLNYKSGIYKHITGETLGGHAI 289

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           ++IGWG  ++   YW++AN WN  WG +GYF+I RG +EC IE +V AG
Sbjct: 290 RIIGWGV-ENKTPYWLIANSWNEDWGENGYFRIVRGRDECSIESEVTAG 337


>gi|256052329|ref|XP_002569725.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228436|emb|CCD74607.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 345

 Score =  209 bits (533), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 112/229 (48%), Positives = 145/229 (63%), Gaps = 18/229 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA+SDR CI  G   N+ LS  DLL+CC   CG GC+GG    AW +
Sbjct: 116 QSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCES-CGLGCEGGILGPAWDF 174

Query: 73  FVHHGVVTEE-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
           +V  G+VT         C+PY        T   +P C    Y TP+C + C KK +  + 
Sbjct: 175 WVKEGIVTGSSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYT 234

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KH   S+Y + +D + I  EI K GPVE SFTVYEDF +YKSG+YKHITG+ +GGHA+
Sbjct: 235 QDKHRGKSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAI 294

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           ++IGWG  ++   YW++AN WN  WG +GYF+I RG +EC IE +V+AG
Sbjct: 295 RIIGWGV-ENKTPYWLIANSWNEDWGENGYFRIVRGRDECFIESEVIAG 342


>gi|17565162|ref|NP_503382.1| Protein W07B8.4 [Caenorhabditis elegans]
 gi|351059398|emb|CCD74288.1| Protein W07B8.4 [Caenorhabditis elegans]
          Length = 335

 Score =  209 bits (533), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 114/236 (48%), Positives = 139/236 (58%), Gaps = 22/236 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCG--FLCGDGCDGGYPISAW 70
           Q HCGSCWA  A EA+SDR CI  +  +N  LS  D+L CC   F CGDGC+GGYPI AW
Sbjct: 95  QSHCGSCWAVAAAEAISDRTCIASNGDVNTLLSAEDILTCCTGKFNCGDGCEGGYPIQAW 154

Query: 71  RYFVHHGVVT-------EECDPYFDST------GCSHPGCEPAYP-TPKCVRKCVKKNQL 116
           RY+V +G+VT         C PY  +       G + P C      TPKC   C   N  
Sbjct: 155 RYWVKNGLVTGGSFESQYGCKPYSIAPCGETIDGVTWPECPMKISDTPKCEHHCTGNNSY 214

Query: 117 ---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
              +   KH+  SAY I    + I  EI  +GPVEV F VYEDF  YK+G+Y H+ G  +
Sbjct: 215 PIPYDQDKHFGASAYAIGRSAKQIQTEILAHGPVEVGFIVYEDFYLYKTGIYTHVAGGEL 274

Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           GGHAVK++GWG  D+G  YW+ AN WN  WG  GYF+I RG +ECGIE   VAG+P
Sbjct: 275 GGHAVKMLGWGV-DNGTPYWLAANSWNTVWGEKGYFRILRGVDECGIESAAVAGMP 329


>gi|87246247|gb|ABD35300.1| cathepsin B-like cysteine protease [Triatoma infestans]
          Length = 333

 Score =  209 bits (533), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 111/231 (48%), Positives = 142/231 (61%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH    + + LS  +LL+CC   CGDGC GG P SAW Y
Sbjct: 104 QGSCGSCWAFGAVEAMSDRLCIHSNGKLQVHLSAENLLSCCD-SCGDGCLGGSPESAWEY 162

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
           +   G+V+       + C PY     C H      P C     TPKC ++C K   + + 
Sbjct: 163 WHKFGIVSGGNYGSKQGCQPY-SIAPCEHSIHGSSPACGGVTDTPKCKKQCEKGYSIPYD 221

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
            + +Y    Y I +D + I AEI KNGP+  SF VYED   YK GVY+H+ G+ +GGH +
Sbjct: 222 KAFYYGQPGYAIPNDAQKIQAEILKNGPIVASFLVYEDLFSYKEGVYQHVAGEFLGGHVI 281

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           K+ GWG  ++G  YW++AN WN  WG +G+FKI RG +ECGIE DV AGLP
Sbjct: 282 KIFGWGI-ENGTPYWLVANSWNTDWGNNGFFKIPRGKDECGIEIDVSAGLP 331


>gi|195393194|ref|XP_002055239.1| GJ19262 [Drosophila virilis]
 gi|194149749|gb|EDW65440.1| GJ19262 [Drosophila virilis]
          Length = 338

 Score =  209 bits (532), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 108/232 (46%), Positives = 144/232 (62%), Gaps = 20/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH    +N   S +DL++CC   CG GC+GG+P +AW Y
Sbjct: 108 QGSCGSCWAFGAVEAMSDRVCIHSNATVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSY 166

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
           + H G+V+       E C PY +   C H      P C     TP+C+ KC     + + 
Sbjct: 167 WTHKGIVSGGSYGSKEGCRPY-EVEPCEHHVNGTRPPCHSG-STPRCMHKCESGYSVDYA 224

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KH+   AY +N +P DI  EI  NGPVE +FTVYED   YK+GVY+H+ G  +GGHA+
Sbjct: 225 KDKHFGAKAYSVNRNPLDIQREIMTNGPVEGAFTVYEDLILYKTGVYQHVHGRQLGGHAI 284

Query: 179 KLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +++GWG   D+   YW++ N WN  WG +G+F+I RG + CGIE  + AGLP
Sbjct: 285 RILGWGVWGDNKVPYWLIGNSWNTDWGDNGFFRILRGEDHCGIESAISAGLP 336


>gi|260786791|ref|XP_002588440.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
 gi|229273602|gb|EEN44451.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
          Length = 332

 Score =  209 bits (532), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 112/243 (46%), Positives = 147/243 (60%), Gaps = 19/243 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWA  A EA+SDR C+  +  + + LS  +L+ACC   CG G
Sbjct: 90  WANCPTIKEVRDQGSCGSCWAEAAAEAMSDRTCVASNGKVQVHLSSENLMACCE-TCGMG 148

Query: 61  CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C GG+P +AW Y+   G+VT       + C PY +   C H      P C    PTP+C 
Sbjct: 149 CHGGFPEAAWEYWKQDGLVTGGPYGSMQGCQPY-EIAPCEHHINGSRPACGKIEPTPRCK 207

Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C    N  +   KHY+ SAY ++S  + I  EI  NGPVE +FTVY DF HYKSGVY+
Sbjct: 208 KTCESGYNVTFNKDKHYAKSAYSVSSKVQQIQMEIMTNGPVEAAFTVYADFPHYKSGVYQ 267

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H +G  +GGHAVK+IGWG  +    YW++AN WN  WG  G+FKI RG +ECGIE D+VA
Sbjct: 268 HESGAELGGHAVKMIGWGM-EGSTPYWLIANSWNSDWGDMGFFKILRGQDECGIERDIVA 326

Query: 227 GLP 229
           G P
Sbjct: 327 GEP 329


>gi|312091331|ref|XP_003146940.1| cathepsin B [Loa loa]
          Length = 249

 Score =  209 bits (532), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 115/246 (46%), Positives = 149/246 (60%), Gaps = 25/246 (10%)

Query: 10  EILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPI 67
           +I +++   GSCWA  AVEA+SDR CI       ++LS +DLL+CC   CG GC GG P+
Sbjct: 6   DIYILKSSSGSCWAVAAVEAMSDRICIMSKGKKQVTLSADDLLSCCK-TCGFGCFGGEPM 64

Query: 68  SAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCV 111
           +AW+Y+V  G+VT     Y + +GC     P CE               YPTPKCV+KC 
Sbjct: 65  AAWKYWVLRGIVTG--SEYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCVKKCD 122

Query: 112 KK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 170
           K   + ++  K+Y  S Y + S+ E I  EI   GPVE SF VY DF +Y  G+YKH+ G
Sbjct: 123 KNYGKSYKADKYYGQSVYNVESNVESIQKEIMTLGPVEASFEVYTDFLYYTGGIYKHVAG 182

Query: 171 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
            + GGHAVK++GWG  D G  YW+ AN WN  WG DGYF+I RG NECGIE  ++AG+P 
Sbjct: 183 SMGGGHAVKVLGWGI-DQGVPYWLAANSWNTDWGEDGYFRILRGVNECGIESGIIAGIP- 240

Query: 231 SKNLVK 236
            K L K
Sbjct: 241 -KQLAK 245


>gi|324507953|gb|ADY43363.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
          Length = 352

 Score =  209 bits (532), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 113/242 (46%), Positives = 145/242 (59%), Gaps = 22/242 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA+SDR CI  +  + +SLS +DLL+CC   CG GCDGG P++AW+Y
Sbjct: 102 QSSCGSCWAFGAVEAMSDRICIASNGKIQVSLSADDLLSCCK-SCGFGCDGGDPMAAWKY 160

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH--------PGCEPAYPTPKCVRKC--VKKNQ 115
           +V  G+VT       + C PY     C H        P     YPTPKC +KC  +   +
Sbjct: 161 WVKEGIVTGSNFTMKQGCKPY-PFPPCEHHSNKTHYQPCKHDLYPTPKCEKKCLDIYTEK 219

Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
            +   K +  +AY +  D   I  EI  +GPVEV+F VYEDF  Y  G+Y H  G + GG
Sbjct: 220 TYAEDKFFGETAYGVEDDVTSIQKEILTHGPVEVAFEVYEDFLMYDGGIYVHTGGKIGGG 279

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
           HAVK++GWG  + G  YW++AN WN  WG DG+F+I RG +ECGIE  VV GLP      
Sbjct: 280 HAVKMLGWGV-EQGVPYWLVANSWNTDWGEDGFFRIIRGIDECGIESSVVGGLPKLNRTY 338

Query: 236 KE 237
           K+
Sbjct: 339 KK 340


>gi|118118|sp|P19092.1|CYSP1_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
           Precursor
 gi|159173|gb|AAA29175.1| cysteine protease (AC-1) [Haemonchus contortus]
          Length = 342

 Score =  209 bits (532), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 105/237 (44%), Positives = 145/237 (61%), Gaps = 19/237 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CGSCWA     A+SDR CI       +++S  D++ CC   CGDGC+GG+PI AW+Y
Sbjct: 108 QANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIMTCCRPQCGDGCEGGWPIEAWKY 167

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPG-------CEPAYPTPKCVRKCVKK-NQLW 117
           F++ GVV+       + C PY     C H G       C    PTP C RKC     +++
Sbjct: 168 FIYDGVVSGGEYLTKDVCRPY-PIHPCGHHGNDTYYGECRGTAPTPPCKRKCRPGVRKMY 226

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
           R  K Y   AY +    + I +EI +NGPV  SF VYEDF HYKSG+YKH  G++ G HA
Sbjct: 227 RIDKRYGKDAYIVKQSVKAIQSEILRNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHA 286

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
           VK+IGWG +++  D+W++AN W+  WG  GYF+I RG+N+CGIE  + AG+  +++L
Sbjct: 287 VKMIGWG-NENNTDFWLIANSWHNDWGEKGYFRIIRGTNDCGIEGTIAAGIVDTESL 342


>gi|27882093|gb|AAH44517.1| Zgc:55862 [Danio rerio]
          Length = 330

 Score =  209 bits (532), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 110/231 (47%), Positives = 144/231 (62%), Gaps = 18/231 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGA EA+SDR CI     +S  +S  DLL CC   CG GC+GGYP +AW +
Sbjct: 101 QGSCGSCWAFGAAEAISDRVCIQSNAKVSVEISSQDLLTCCDS-CGMGCNGGYPSAAWDF 159

Query: 73  FVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+VT         C PY          G   P       TP C  KC    + L++
Sbjct: 160 WTTDGLVTGGLYNSHIGCRPYTIEPCEHHVNGSRPPCTGEGGDTPNCDMKCEPGYSPLYK 219

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KH+  ++Y + S+   IMAE++KNGPVE +FTVYEDF  YKSGVY+H++G  +GGHA+
Sbjct: 220 EDKHFGKTSYSVPSNQNGIMAELFKNGPVEAAFTVYEDFLLYKSGVYQHMSGSALGGHAI 279

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           K++GWG  ++G  YW+ AN WN  WG +GYFKI RG + CGIE ++VAG+P
Sbjct: 280 KILGWG-EENGVPYWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAGIP 329


>gi|226821413|gb|ACO82382.1| cathepsin B [Lutjanus argentimaculatus]
          Length = 330

 Score =  209 bits (531), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 111/243 (45%), Positives = 149/243 (61%), Gaps = 18/243 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWAFGA EA+SDR CIH    +S  +S  DLL CC   CG G
Sbjct: 89  WPNCPTLKEIRDQGSCGSCWAFGASEAISDRLCIHSNAKVSVEISAEDLLTCCD-SCGMG 147

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCV 107
           C+GGYP +AW ++   G+V+         C PY          G   P       TP+C+
Sbjct: 148 CNGGYPSAAWDFWTKEGLVSGGLYDSHVGCRPYTIPPCEHHVNGSRPPCTGEGGDTPQCL 207

Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
            +C       +R  KHY  ++Y + SD  +I  EIYKNGPVE +FTVYEDF  YKSGVY+
Sbjct: 208 SQCEAGYTPSYREDKHYGKTSYSVLSDEAEIQYEIYKNGPVEGAFTVYEDFVLYKSGVYQ 267

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H++G  +GGHA+K++GWG  ++G  YW+ AN WN  WG +G+FK  RGS+ CGIE ++VA
Sbjct: 268 HVSGSAVGGHAIKVLGWG-EENGVPYWLCANSWNTDWGDNGFFKFLRGSDHCGIESEIVA 326

Query: 227 GLP 229
           G+P
Sbjct: 327 GIP 329


>gi|308504375|ref|XP_003114371.1| CRE-CPR-1 protein [Caenorhabditis remanei]
 gi|308261756|gb|EFP05709.1| CRE-CPR-1 protein [Caenorhabditis remanei]
          Length = 366

 Score =  209 bits (531), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 109/230 (47%), Positives = 137/230 (59%), Gaps = 12/230 (5%)

Query: 7   EHVEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGG 64
           + ++++  Q  CGSCWAFGA E +SDR CI         +S +DLL+CCG  CG+GC+GG
Sbjct: 136 KSIKLIRDQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDDLLSCCGSSCGNGCEGG 195

Query: 65  YPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLW 117
           YPI A R++   GVVT        C PY     C+   C P   TP C   C       +
Sbjct: 196 YPIQALRWWDSKGVVTGGDYHGAGCKPY-PIAPCTSGNC-PESKTPSCSLSCQSGYTTAY 253

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              KH+  SAY +      I  EI  NGPVE +FTVYEDF  YKSGVYKH  G  +GGHA
Sbjct: 254 AKDKHFGTSAYAVARKVASIQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHA 313

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           +K+IGWGT + G  YW++AN W  SWG  G+F+I RG ++CGIE  VVAG
Sbjct: 314 IKIIGWGT-ESGSPYWLVANSWGNSWGESGFFRIFRGDDQCGIESAVVAG 362


>gi|14141821|gb|AAK07477.2|AF329480_1 probable cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
 gi|289743431|gb|ADD20463.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
          Length = 340

 Score =  209 bits (531), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 110/234 (47%), Positives = 144/234 (61%), Gaps = 22/234 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH    +N   S +DL++CC   CG GC+GG+P +AW Y
Sbjct: 110 QGSCGSCWAFGAVEAMSDRVCIHSNGTVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSY 168

Query: 73  FVHHGVVTEECDPYFDSTGC--------------SHPGCEPAY-PTPKCVRKCVKKNQL- 116
           +V  G+V+    PY  S GC              + P CE  Y  TP+C  KC    ++ 
Sbjct: 169 WVRKGIVSG--GPYGSSQGCRPYEIAPCEHHVNGTRPPCEKEYGKTPRCQHKCQASYKVD 226

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           ++  KH+   AY I+ +  DI  EI  +GPVE +FTVYED   YK GVY+H+ G  +GGH
Sbjct: 227 YKTDKHFGSRAYSISKNVHDIQEEIMTHGPVEGAFTVYEDLILYKDGVYEHVHGKELGGH 286

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           A+++IGWG   D   YW++AN WN  WG +G+FKI RG + CGIE  + AGLP 
Sbjct: 287 AIRIIGWGVEKD-IPYWLVANSWNTDWGNNGFFKILRGKDHCGIESSISAGLPK 339


>gi|74179506|dbj|BAE44111.1| cathepsin B preproprotein [Cyprinus carpio]
          Length = 330

 Score =  208 bits (530), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 111/244 (45%), Positives = 149/244 (61%), Gaps = 18/244 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWAFGA EA+SDR CIH    +S  +S  DLL CC   CG G
Sbjct: 89  WPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNAKVSVEISAQDLLTCCDG-CGMG 147

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCV 107
           C+GGYP +AW ++   G+VT         C PY          G   P       TP C 
Sbjct: 148 CNGGYPSAAWDFWSSDGLVTGGLYNSHIGCRPYTIEPCEHHVNGSRPPCTGEGGDTPNCD 207

Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
             C    +  ++  KH+  ++Y + S+ +DIM E+YKNGPVE +FTVYEDF  YKSGVY+
Sbjct: 208 MSCEPGYSPSYKQDKHFGKTSYSVPSNQKDIMKELYKNGPVEGAFTVYEDFLSYKSGVYQ 267

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H++G  +GGHA+K++GWG  ++G  YW+ AN WN  WG +GYFKI RG + CGIE ++VA
Sbjct: 268 HVSGPALGGHAIKILGWG-EENGVPYWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVA 326

Query: 227 GLPS 230
           G+P 
Sbjct: 327 GIPQ 330


>gi|145498570|ref|XP_001435272.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124402403|emb|CAK67875.1| unnamed protein product [Paramecium tetraurelia]
          Length = 325

 Score =  208 bits (530), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 111/238 (46%), Positives = 141/238 (59%), Gaps = 17/238 (7%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGG 64
           N E ++ +  Q  CGSCWAFGA EA+SDR CI  G    +S  DLL CCG  CG GC+GG
Sbjct: 87  NCESIKEVRDQSTCGSCWAFGAAEAMSDRLCIATGKQTRISTEDLLTCCGITCGMGCNGG 146

Query: 65  YPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH-------PGCEPAYPTPKCVRKC 110
           +P  AW YF + G+VT +       C PY     C H         C  + PTP CV+ C
Sbjct: 147 FPSGAWNYFKNKGLVTGDLFGDNSWCRPY-TFPPCDHHVDDGKYGPCGDSQPTPACVKSC 205

Query: 111 VKKN-QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 169
             ++ + + + K  SI +Y ++S  E I  EI   GPVE SFTVYEDF  YKSGVY+++ 
Sbjct: 206 TAQSGRNYDSDKIRSIDSYSVSSKVEQIQNEIMTFGPVEASFTVYEDFLTYKSGVYQNVA 265

Query: 170 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           G  +GGHAVK+IGWG   +   YW++ N WN  WG +G FKI RGSN  GIE  + AG
Sbjct: 266 GANLGGHAVKIIGWGVEKN-VPYWLVVNSWNEGWGENGLFKILRGSNHVGIEGGIYAG 322


>gi|1777779|gb|AAB40605.1| cathepsin B-like cysteine proteinase [Ascaris suum]
 gi|324515014|gb|ADY46062.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
          Length = 398

 Score =  208 bits (530), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 113/242 (46%), Positives = 145/242 (59%), Gaps = 22/242 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA+SDR CI  +  + +SLS +DLL+CC   CG GCDGG P++AW+Y
Sbjct: 143 QSSCGSCWAFGAVEAMSDRICIASNGKIQVSLSADDLLSCCK-SCGFGCDGGDPMAAWKY 201

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH--------PGCEPAYPTPKCVRKC--VKKNQ 115
           +V  G+VT       + C PY     C H        P     YPTPKC +KC  +   +
Sbjct: 202 WVKEGIVTGSNFTMKQGCKPY-PFPPCEHHSNKTHYQPCKHDLYPTPKCEKKCLDIYTEK 260

Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
            +   K +  +AY +  D   I  EI  +GPVEV+F VYEDF  Y  G+Y H  G + GG
Sbjct: 261 TYAEDKFFGETAYGVEDDVTSIQKEILTHGPVEVAFEVYEDFLMYDGGIYVHTGGKIGGG 320

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
           HAVK++GWG  + G  YW++AN WN  WG DG+F+I RG +ECGIE  VV GLP      
Sbjct: 321 HAVKMLGWGV-EQGVPYWLVANSWNTDWGEDGFFRIIRGIDECGIESSVVGGLPKLNRTY 379

Query: 236 KE 237
           K+
Sbjct: 380 KK 381


>gi|347972086|ref|XP_313835.5| AGAP004533-PA [Anopheles gambiae str. PEST]
 gi|333469165|gb|EAA09183.5| AGAP004533-PA [Anopheles gambiae str. PEST]
          Length = 337

 Score =  208 bits (530), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 109/232 (46%), Positives = 145/232 (62%), Gaps = 20/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR C+  G  ++   S  DL++CC   CG GC+GG+P +AW Y
Sbjct: 107 QGSCGSCWAFGAVEAMSDRVCVASGGKIHFRFSAEDLVSCC-HTCGFGCNGGFPGAAWSY 165

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCE-PAYPTPKCVRKCVKK-NQLW 117
           +V  G+V+         C PY  +  C H      P CE     TPKCV+KC +  N  +
Sbjct: 166 WVRKGLVSGGPFGSNLGCQPYAIAP-CEHHVNGTRPSCEGEGGKTPKCVKKCQESYNVPY 224

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
           +  K +  S+Y I      I  EI  NGPVE +FTVYED  HYK GVY+H+TG ++GGHA
Sbjct: 225 QKDKRFGASSYSIARHEAQIQKEIMTNGPVEGAFTVYEDLLHYKEGVYQHVTGKMLGGHA 284

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           ++++GWG  ++G  YW++AN WN  WG +G+FKI RG +  GIE  + AGLP
Sbjct: 285 IRILGWGV-ENGTKYWLIANSWNSDWGDNGFFKILRGEDHLGIESSISAGLP 335


>gi|343197337|pdb|3QSD|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With Ca074 Inhibitor
 gi|343197588|pdb|3S3Q|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11017 Inhibitor
 gi|343197589|pdb|3S3R|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
 gi|343197590|pdb|3S3R|B Chain B, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
 gi|343197591|pdb|3S3R|C Chain C, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
          Length = 254

 Score =  208 bits (530), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 112/237 (47%), Positives = 145/237 (61%), Gaps = 18/237 (7%)

Query: 7   EHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGG 64
           + +  +  Q  CGSCWAFGAVEA+SDR CI  G   N+ LS  DLL+CC   CG GC+GG
Sbjct: 17  KSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCE-SCGLGCEGG 75

Query: 65  YPISAWRYFVHHGVVTEE-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCV 111
               AW Y+V  G+VT         C+PY        T   +P C    Y TP+C + C 
Sbjct: 76  ILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQ 135

Query: 112 KKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 170
           KK +  +   KH   S+Y + +D + I  EI K GPVE  FTVYEDF +YKSG+YKHITG
Sbjct: 136 KKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYEDFLNYKSGIYKHITG 195

Query: 171 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           + +GGHA+++IGWG  +    YW++AN WN  WG +GYF+I RG +EC IE +V AG
Sbjct: 196 ETLGGHAIRIIGWGVENKA-PYWLIANSWNEDWGENGYFRIVRGRDECSIESEVTAG 251


>gi|329669000|gb|AEB96388.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 232

 Score =  208 bits (530), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 117/235 (49%), Positives = 143/235 (60%), Gaps = 23/235 (9%)

Query: 14  IQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
            Q  CGSCWA GAVEA++DR CI    N  +++S +DLL+CC   CG GCDG  P +AW 
Sbjct: 1   FQSSCGSCWAVGAVEAMTDRICIASKGNQKVTISADDLLSCCD-ECGFGCDGRDPYAAWS 59

Query: 72  YFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKKNQ 115
           Y+V +G+VT     Y   +GC    +P CE               YPT  C  KC     
Sbjct: 60  YWVSNGIVTGS--NYTSKSGCKPYPYPPCEHHIPEHHYKKCPKDIYPTNTCEYKCQDGYS 117

Query: 116 LWRNS-KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
           +  NS KHY  S Y +  D   I  EI  NGPVEV+F VYEDF HY SG+YKH TGD +G
Sbjct: 118 ISYNSDKHYGASVYAVAQDVASIQKEIMTNGPVEVAFDVYEDFEHYSSGIYKHTTGDYLG 177

Query: 175 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           GHAVK++GWGT ++G DYWI AN WN  WG +G+F+I RG +EC IE  VVAG P
Sbjct: 178 GHAVKMLGWGT-ENGTDYWICANSWNSDWGENGFFRILRGVDECEIESGVVAGEP 231


>gi|308500570|ref|XP_003112470.1| CRE-CPR-4 protein [Caenorhabditis remanei]
 gi|308267038|gb|EFP10991.1| CRE-CPR-4 protein [Caenorhabditis remanei]
          Length = 335

 Score =  208 bits (529), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 117/233 (50%), Positives = 148/233 (63%), Gaps = 20/233 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAF A EA SDRFCI  +  +N  LS  D+L+CC   CG GCDGGYPI+AW+Y
Sbjct: 103 QSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSCCSN-CGYGCDGGYPINAWKY 161

Query: 73  FVHHGVVTEE-------CDPYF-----DSTG-CSHPGC-EPAYPTPKCVRKCV--KKNQL 116
            V  G  T         C PY      ++ G  + P C +  Y TP CV KC   K N  
Sbjct: 162 LVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPDCPDDGYNTPACVNKCTNTKYNTA 221

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +++ KH+  +AY +      I AEI  +GPVE +FTVYEDF  YKSGVY H TG  +GGH
Sbjct: 222 YKDDKHFGSTAYAVGKKVAQIQAEIIAHGPVEAAFTVYEDFYQYKSGVYVHTTGQELGGH 281

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           A++++GWGT D+G  YW++AN WN +WG +GYF+I RG+NECGIE  VV G+P
Sbjct: 282 AIRILGWGT-DNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVP 333


>gi|1848229|gb|AAB48119.1| cathepsin B-like protease [Leishmania major]
          Length = 340

 Score =  208 bits (529), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 111/232 (47%), Positives = 148/232 (63%), Gaps = 25/232 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
           Q +CGSCWA  AVEA+SDR+C   G+ +  +S ++LL+CC F+CG GC GG P  AW ++
Sbjct: 120 QSNCGSCWAIAAVEAISDRYCTFGGVPDRRMSTSNLLSCC-FICGLGCHGGIPTVAWLWW 178

Query: 74  VHHGVVTEECDPY-FDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQL----WRNS 120
           V  G+ TE+C PY FD   CSH G    YP        TPKC   C ++N++    ++ S
Sbjct: 179 VWVGIATEDCQPYPFDP--CSHHGNSEKYPPCPSTIYDTPKCNTTC-ERNEMDLVKYKGS 235

Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 180
             YS+   +      ++M E+  NGP+E++  VY DF  YKSGVYKH+ GD +GGHAVKL
Sbjct: 236 TSYSVKGEK------ELMIELMTNGPLELTMQVYSDFVGYKSGVYKHVLGDFLGGHAVKL 289

Query: 181 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           +GWGT  DG  YW +AN WN  WG  GYF I+RG+NEC IE   VAG+P+ +
Sbjct: 290 VGWGT-QDGVPYWKVANSWNTDWGDKGYFLIQRGNNECKIESGGVAGIPAQE 340


>gi|195058549|ref|XP_001995463.1| GH17748 [Drosophila grimshawi]
 gi|193896249|gb|EDV95115.1| GH17748 [Drosophila grimshawi]
          Length = 340

 Score =  208 bits (529), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 108/231 (46%), Positives = 142/231 (61%), Gaps = 18/231 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH    +N   S +DL+ CC   CG GC+GG+P +AW Y
Sbjct: 110 QGSCGSCWAFGAVEAMSDRVCIHSNATVNFHFSADDLVTCC-HTCGFGCNGGFPGAAWSY 168

Query: 73  FVHHGVV-------TEECDPYFDSTGCSHPGCEPAYP-----TPKCVRKCVKKNQL-WRN 119
           +   G+V       TE C PY +   C H    P  P     TP C  +C     + +  
Sbjct: 169 WTTRGIVSGGSYNSTEGCRPY-EVEPCEHHVDGPRPPCHSGSTPHCKHQCQPNYSVDYEK 227

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
            KH+  S+Y IN +P +I  EI  NGPVE +FTVYED   YK+GVY+H+ G  +GGHA++
Sbjct: 228 DKHFGASSYSINRNPRNIQREIMTNGPVEGAFTVYEDLILYKTGVYQHVHGKQLGGHAIR 287

Query: 180 LIGWGTSDDGE-DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +IGWG   + +  YW++AN WN  WG +G+F+I RG + CGIE  + AGLP
Sbjct: 288 IIGWGVWGESKVPYWLIANSWNTDWGDNGFFRILRGKDHCGIESQISAGLP 338


>gi|268557292|ref|XP_002636635.1| C. briggsae CBR-CPR-1 protein [Caenorhabditis briggsae]
          Length = 330

 Score =  207 bits (528), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 109/230 (47%), Positives = 139/230 (60%), Gaps = 12/230 (5%)

Query: 7   EHVEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGG 64
           + ++++  Q  CGSCWAFGA E +SDR CI         +S +DLL+CCG  CG+GC+GG
Sbjct: 100 KSIKLIRNQATCGSCWAFGAAEIISDRTCIETKGAQQPIISPDDLLSCCGSSCGNGCEGG 159

Query: 65  YPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLW 117
           YPI A R++   GVVT        C PY  +  C+   C P   TP C   C    +  +
Sbjct: 160 YPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CTSGNC-PESKTPACSLSCQSGYSTAY 217

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              KH+  SAY +      I  EI  NGPVE +FTVYEDF  YKSGVYKH  G  +GGHA
Sbjct: 218 AKDKHFGASAYAVARSVAAIQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHA 277

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           +K+IGWGT + G  YW++AN W  +WG  G+FKI RG ++CGIE  VVAG
Sbjct: 278 IKIIGWGT-ESGSPYWLVANSWGTNWGESGFFKILRGDDQCGIEGAVVAG 326


>gi|22531389|emb|CAD44625.1| cathepsin B1 isotype 2 [Schistosoma mansoni]
          Length = 340

 Score =  207 bits (528), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 111/234 (47%), Positives = 146/234 (62%), Gaps = 18/234 (7%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
           +  +  Q  CGSCWAFGAVEA+SDR CI  G   N+ LS  DLL+CC   CG GC+GG  
Sbjct: 105 IATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCES-CGLGCEGGIL 163

Query: 67  ISAWRYFVHHGVVTEE-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKK 113
             AW ++V  G+VT         C+PY        T   +P C    Y TP+C + C KK
Sbjct: 164 GPAWDFWVKEGIVTGSSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKK 223

Query: 114 NQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
            +  +   KH   S+Y + +D + I  EI K GPVE SFTVYEDF +YKSG+YKHITG+ 
Sbjct: 224 YKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEA 283

Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           +GGHA+++IGWG  ++   YW++AN WN  WG +GYF+I RG +EC IE +V+A
Sbjct: 284 LGGHAIRIIGWGV-ENKTPYWLIANSWNEDWGENGYFRIVRGRDECFIESEVIA 336


>gi|226472810|emb|CAX71091.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  207 bits (528), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 114/246 (46%), Positives = 150/246 (60%), Gaps = 19/246 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG 60
           +T+   +  +  Q  CGS WAFGAVEA+SDR CI         LS  +L++CC   CG G
Sbjct: 105 WTHCPSISEIRDQSSCGSYWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSS-CGMG 163

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C+GG+P SAW Y+ + G+VT +       C PY +   C H      P C+    TP C 
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY-EFPPCEHHTLGPLPVCDGDVETPPCK 222

Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           R C    N  + N K Y    YR+ S+ E IM E+ ++GPVEV F VY DF +YKSGVY+
Sbjct: 223 RTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVYQ 282

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H++G ++GGHAV+L+GWG  ++   YW++AN WN  WG +GYFKI RG NECGIE DV A
Sbjct: 283 HVSGALLGGHAVRLLGWG-EENNVPYWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNA 341

Query: 227 GLPSSK 232
           G+P  K
Sbjct: 342 GIPKIK 347


>gi|119887749|gb|ABM05925.1| cathepsin B-like cysteine proteinase [Helicoverpa assulta]
          Length = 338

 Score =  207 bits (527), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 109/231 (47%), Positives = 144/231 (62%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA++DR+C +     +   S  DLL+CC  +CG GC+GG P  AW Y
Sbjct: 106 QGSCGSCWAFGAVEAMTDRYCTYSNGTQHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEY 164

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWR 118
           + H G+V+       + C PY +   C H  PG    C     TPKC + C    N  +R
Sbjct: 165 WKHFGLVSGGSYNSSQGCRPY-EIPPCEHHVPGNRMPCNGDSKTPKCEKTCESNYNVDYR 223

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             K Y    + ++S  + I AE++KNGPVE +FTVY D  +YK+GVYKH  GD +GGHAV
Sbjct: 224 KDKRYGKHVFSVSSKEDHIRAELFKNGPVEGAFTVYSDLLNYKTGVYKHTIGDALGGHAV 283

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           K++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE  +VAG P
Sbjct: 284 KILGWGV-ENGNKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 333


>gi|7537454|gb|AAF35867.2| cathepsin B-like cysteine proteinase [Helicoverpa armigera]
          Length = 338

 Score =  207 bits (527), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 109/231 (47%), Positives = 144/231 (62%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA++DR+C +     +   S  DLL+CC  +CG GC+GG P  AW Y
Sbjct: 106 QGSCGSCWAFGAVEAMTDRYCTYSNGTQHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEY 164

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWR 118
           + H G+V+       + C PY +   C H  PG    C     TPKC + C    N  +R
Sbjct: 165 WKHFGLVSGGSYNSSQGCRPY-EIPPCEHHVPGNRMPCNGDSKTPKCEKTCESNYNVDYR 223

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             K Y    + ++S  + I AE++KNGPVE +FTVY D  +YK+GVYKH  GD +GGHAV
Sbjct: 224 KDKRYGKHVFSVSSKEDHIRAELFKNGPVEGAFTVYSDLLNYKTGVYKHTIGDALGGHAV 283

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           K++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE  +VAG P
Sbjct: 284 KILGWGV-ENGNKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 333


>gi|308390275|gb|ADO32581.1| cathepsin B [Marsupenaeus japonicus]
          Length = 332

 Score =  207 bits (527), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 112/231 (48%), Positives = 146/231 (63%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVE +SDR CIH     N   S  +L++CC  LCG GC+GG+P +A++Y
Sbjct: 102 QGSCGSCWAFGAVEVMSDRQCIHSKGKSNFHYSSENLVSCC-HLCGFGCNGGFPGAAFKY 160

Query: 73  FVHHGVV-------TEECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
           +VH G+V       T+ C PY +   C H      P C     TPKCV++C     + + 
Sbjct: 161 WVHSGIVSGGSFNSTQGCQPY-EIAPCEHHVPGPRPKCSEGGGTPKCVKRCENGYTVDYE 219

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
           +  H+   AY I  D + I  EI KNGPVE +FTVY DF HYKSGVY+H  G  +GGHA+
Sbjct: 220 SDLHHGGKAYSIMKDEDQIKYEIMKNGPVEGAFTVYVDFLHYKSGVYQHRHGLPLGGHAI 279

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +++GWG  ++G  YW+ AN WN  WG +G FKI RGS+ CGIE ++ AGLP
Sbjct: 280 RILGWG-EENGTPYWLCANSWNTDWGDNGLFKILRGSDHCGIESEISAGLP 329


>gi|170586854|ref|XP_001898194.1| cathepsin B-like cysteine proteinase [Brugia malayi]
 gi|158594589|gb|EDP33173.1| cathepsin B-like cysteine proteinase, putative [Brugia malayi]
          Length = 384

 Score =  207 bits (526), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 114/237 (48%), Positives = 143/237 (60%), Gaps = 26/237 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWA  AVEA+SDR CI       + LS +DLL+CC   CG GC GG P++AW+Y
Sbjct: 143 QSSCGSCWAVAAVEAMSDRICITSKGKKQVILSADDLLSCCK-TCGFGCFGGEPMAAWKY 201

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKK-NQ 115
           +V  G+VT     Y + +GC     P CE               YPTPKC ++C K   +
Sbjct: 202 WVLSGIVTGS--DYTNHSGCRPYPFPPCEHHSNKTHYEPCKHDLYPTPKCYKQCDKNYTK 259

Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
            ++  K+Y   AY + +D E I  EI   GPVE SF VY DF HY SG+YKH+ G V GG
Sbjct: 260 SYKADKYYGEQAYNVENDVESIQKEIMTLGPVEASFEVYTDFLHYTSGIYKHVAGSVGGG 319

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGAD---GYFKIKRGSNECGIEEDVVAGLP 229
           HAVK++GWG  D G  YW+ AN WN  WG D   GYF+I RG++ECGIE  +VAG+P
Sbjct: 320 HAVKILGWGI-DQGVSYWLAANSWNNDWGEDVFSGYFRILRGADECGIESGIVAGIP 375


>gi|442754445|gb|JAA69382.1| Putative cathepsin b precursor [Ixodes ricinus]
          Length = 340

 Score =  207 bits (526), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 113/223 (50%), Positives = 144/223 (64%), Gaps = 19/223 (8%)

Query: 24  FGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT- 80
           FGAVE++SDR CIH G    + L+ +D+L+CC + CG GC+GG+P +AW Y+V  G+VT 
Sbjct: 120 FGAVESMSDRHCIHSGAKNIVHLAADDVLSCC-WGCGSGCNGGFPAAAWSYWVDKGIVTG 178

Query: 81  ------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 127
                 E C PY     C H        C    PTPKCVR C K  N  +++ KHY  S+
Sbjct: 179 GNYDTDEGCMPY-PVPSCDHHVNGTLGPCGQDPPTPKCVRLCRKGYNVDFKDDKHYGKSS 237

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
           Y + S+   I  EI KNGPVE +FTVY DF  YKSGVYK  + D +GGHA++++GWG  +
Sbjct: 238 YSVPSNETQIQMEIMKNGPVEGAFTVYADFPLYKSGVYKSHSTDALGGHAIRILGWGVEN 297

Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           D   YW++AN WN  WG  GYFKI RGSNECGIEED+VAG+P 
Sbjct: 298 D-VPYWLVANSWNTEWGDKGYFKILRGSNECGIEEDIVAGIPK 339


>gi|380791571|gb|AFE67661.1| cathepsin B preproprotein, partial [Macaca mulatta]
          Length = 311

 Score =  207 bits (526), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 104/212 (49%), Positives = 141/212 (66%), Gaps = 18/212 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNF 161

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY     C H      P C     TPKC + C    +  ++
Sbjct: 162 WTRKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  ++Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFK 210
           +++GWG  ++G  YW++AN WN  WG +G+FK
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFK 311


>gi|29374025|gb|AAO73003.1| cathepsin B [Fasciola gigantica]
          Length = 339

 Score =  207 bits (526), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 106/233 (45%), Positives = 138/233 (59%), Gaps = 21/233 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWA  A  A+SDR CIH    M   L+  D L+CC + CG GC GGYP  AW Y
Sbjct: 108 QASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAADPLSCCTY-CGQGCRGGYPPKAWDY 166

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPGCEP--------AYPTPKCVRKC-VKKNQL 116
           ++  G+VT         C P+   T C H G            YPTP C R C    N+ 
Sbjct: 167 WMREGIVTGGTWENRTGCQPWM-FTKCDHVGDSRKYSRCPHYTYPTPPCARACQTGYNKT 225

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   K Y  S+Y +      IM EI KNGPVEV+F +++DF  Y+SG+Y H+ G  +G H
Sbjct: 226 YEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEVTFAIFQDFGVYRSGIYHHVAGKFIGRH 285

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           AV++IGWG  ++G +YW++AN WN  WG +GYF++ RG NECGIE +VVAG+P
Sbjct: 286 AVRMIGWGV-ENGVNYWLMANSWNEEWGENGYFRMVRGRNECGIESEVVAGMP 337


>gi|313229093|emb|CBY18245.1| unnamed protein product [Oikopleura dioica]
          Length = 355

 Score =  207 bits (526), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 110/237 (46%), Positives = 143/237 (60%), Gaps = 20/237 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGA EA+SDR CI      ++  +  D+L+CC   CG+GC+GGYP++A  Y
Sbjct: 116 QGGCGSCWAFGAAEAISDRICIASKGATDVMYAAEDVLSCC-LTCGNGCNGGYPLAAMEY 174

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVK--KNQLW 117
           FV  G+VT       + C PY     C H      P C     TPKC  +C+     + +
Sbjct: 175 FVTRGLVTGGLYGTKDTCQPY-TLEACEHHVPGDRPPCTEGGGTPKCSHQCIPDYTTKAY 233

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
           ++ K +   AY + +D   I  EI   GPVE +FTVY DF  YKSGVY+H +G  +GGHA
Sbjct: 234 KDDKVHGHKAYSVPNDVGKIQQEIMHYGPVEAAFTVYSDFPSYKSGVYRHTSGSELGGHA 293

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
           +K+IGWGT + G+DYW++ N WN  WG  G FKI RGSNECGIE +VVA    +  L
Sbjct: 294 IKIIGWGT-EGGDDYWLINNSWNSDWGDKGTFKILRGSNECGIEGEVVAATVDASTL 349


>gi|392920988|ref|NP_506011.2| Protein F57F5.1 [Caenorhabditis elegans]
 gi|206994319|emb|CAB00098.2| Protein F57F5.1 [Caenorhabditis elegans]
          Length = 351

 Score =  206 bits (525), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 109/223 (48%), Positives = 135/223 (60%), Gaps = 22/223 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWA  A E +SDR CI       LS+S +D+ ACCG +CG+GC+GGYPI AWR+
Sbjct: 119 QSSCGSCWAVSAAETISDRICIASNAKTILSISADDINACCGMVCGNGCNGGYPIEAWRH 178

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE-----------PA--YPTPKCVRKCVKKNQL 116
           +V  G VT     Y D TGC    +P CE           P+  YPT KC R C     L
Sbjct: 179 YVKKGYVTG--GSYQDKTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKCERSCQAGYAL 236

Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
            ++   H+  SAY ++    +I  EI  +GPVEV+FTVYEDF HY  GVY H  G  +GG
Sbjct: 237 TYQQDLHFGQSAYAVSKKAAEIQKEIMTHGPVEVAFTVYEDFEHYSGGVYVHTAGASLGG 296

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 218
           HAVK++GWG  D+G  YW+ AN WN  WG +GYF+I RG NEC
Sbjct: 297 HAVKMLGWGV-DNGTPYWLCANSWNEDWGENGYFRIIRGVNEC 338


>gi|312374701|gb|EFR22198.1| hypothetical protein AND_15621 [Anopheles darlingi]
          Length = 335

 Score =  206 bits (525), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 109/232 (46%), Positives = 144/232 (62%), Gaps = 20/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CI  G  ++   S  DL++CC   CG GC+GG+P +AW Y
Sbjct: 105 QGSCGSCWAFGAVEAMSDRVCIASGGKIHFRFSAEDLVSCC-HTCGFGCNGGFPGAAWSY 163

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCE-PAYPTPKCVRKCVKKNQL-W 117
           +VH G+V+         C PY  +  C H      P CE     TPKCV+KC     + +
Sbjct: 164 WVHKGLVSGGPFGSNLGCQPYAIAP-CEHHVNGTRPSCEGEGGKTPKCVKKCQDSYTVPY 222

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              K Y   +Y I    + I  EI  NGPVE +FTVYED  HYK GVY+H+TG ++GGHA
Sbjct: 223 AKDKRYGSKSYSIPRHEDQIRKEIMTNGPVEGAFTVYEDLLHYKEGVYQHVTGKMLGGHA 282

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           ++++GWG  ++ + YW++AN WN  WG +G+FKI RG +  GIE  + AGLP
Sbjct: 283 IRILGWGVENNTK-YWLIANSWNSDWGDNGFFKILRGEDHLGIESSIAAGLP 333


>gi|50540542|ref|NP_998501.1| cathepsin B, a precursor [Danio rerio]
 gi|34784038|gb|AAH56688.1| Cathepsin B, a [Danio rerio]
 gi|37681773|gb|AAQ97764.1| cathepsin B [Danio rerio]
 gi|41351445|gb|AAH65589.1| Cathepsin B, a [Danio rerio]
          Length = 330

 Score =  206 bits (525), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 110/231 (47%), Positives = 144/231 (62%), Gaps = 18/231 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGA EA+SDR CIH    +S  +S  DLL CC   CG GC+GGYP +AW +
Sbjct: 101 QGSCGSCWAFGAAEAISDRVCIHSDAKVSVEISSQDLLTCCDS-CGMGCNGGYPSAAWDF 159

Query: 73  FVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+VT         C PY          G   P       TP C  KC    +  ++
Sbjct: 160 WATEGLVTGGLYNSHIGCRPYTIEPCEHHVNGSRPPCSGEGGDTPNCDMKCEPGYSPSYK 219

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KH+  ++Y + S+   IMAE++KNGPVE +FTVYEDF  YKSGVY+H++G  +GGHA+
Sbjct: 220 QDKHFGKTSYSVPSNQNSIMAELFKNGPVEGAFTVYEDFLLYKSGVYQHMSGSPVGGHAI 279

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           K++GWG  ++G  YW+ AN WN  WG +GYFKI RG + CGIE ++VAG+P
Sbjct: 280 KILGWG-EENGVPYWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAGIP 329


>gi|157167366|ref|XP_001653890.1| cathepsin b [Aedes aegypti]
 gi|54289254|gb|AAV31917.1| lysosomal cathepsin B [Aedes aegypti]
 gi|108874249|gb|EAT38474.1| AAEL009637-PA [Aedes aegypti]
          Length = 340

 Score =  206 bits (524), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 110/232 (47%), Positives = 145/232 (62%), Gaps = 20/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH    ++  +S  DL++CC   CG GC+GG+P +AW Y
Sbjct: 110 QGSCGSCWAFGAVEAMSDRICIHSEGKVHFRVSSEDLVSCC-HTCGFGCNGGFPGAAWSY 168

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCE-PAYPTPKCVRKC-VKKNQLW 117
           +V  G+V+       + C PY  +  C H      P CE     TPKCV+KC    N  +
Sbjct: 169 WVRKGLVSGGPFGSDQGCQPYAIAP-CEHHVNGSRPSCEGEGGKTPKCVKKCQASYNVPY 227

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              K Y  S+Y I +  + I  EI  NGPVE +FTVYED  +YK GVY H+ G ++GGHA
Sbjct: 228 AKDKMYGKSSYSIANHEKQIQKEIMTNGPVEGAFTVYEDLLNYKEGVYHHVHGKMLGGHA 287

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           ++++GWG  +DG  YW++AN WN  WG +G+FKI RG +  GIE  + AGLP
Sbjct: 288 IRILGWGV-EDGTKYWLIANSWNSDWGDNGFFKILRGEDHLGIESSIAAGLP 338


>gi|47217183|emb|CAG11019.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 351

 Score =  206 bits (524), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 113/252 (44%), Positives = 151/252 (59%), Gaps = 39/252 (15%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGA EA+SDR CIH    +S  LS  DLL CC   CG GC+GGYP SAW +
Sbjct: 101 QGSCGSCWAFGASEAMSDRVCIHSNAKVSVELSAQDLLTCCNS-CGMGCNGGYPSSAWNF 159

Query: 73  FVHHGVVTE-------------------ECDPYFDSTGC--------------SHPGCE- 98
           +V  G+V+                      D  F S GC              S P C  
Sbjct: 160 WVSDGLVSGGLYDSHIGRIQVSLCVLLLAVDRDFVSPGCRPYTIPPCEHHVNGSRPSCSG 219

Query: 99  PAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 157
               TP+C+ +C    +  ++  KH+  ++Y ++S+ ++I  EIYKNGPVE +FTVYEDF
Sbjct: 220 EGGDTPECIFRCEAGYSPSYKQDKHFGKTSYSVSSEEDEIKQEIYKNGPVEGAFTVYEDF 279

Query: 158 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 217
             YKSGVY+H++G  +GGHA+K++GWG  ++G  YW+ AN WN  WG +G+FKI RG++ 
Sbjct: 280 VLYKSGVYQHVSGSALGGHAIKMLGWG-EENGVPYWLCANSWNTDWGDNGFFKILRGADH 338

Query: 218 CGIEEDVVAGLP 229
           CGIE ++VAG P
Sbjct: 339 CGIESEIVAGNP 350


>gi|126116630|gb|ABN79675.1| cathepsin B3 [Clonorchis sinensis]
          Length = 337

 Score =  206 bits (524), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 108/235 (45%), Positives = 145/235 (61%), Gaps = 17/235 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA+SDR CIH     +  +S  DL++CCG+ CG GC GG+P +AW +
Sbjct: 102 QSSCGSCWAFGAVEAMSDRLCIHTNGTFTKRISAVDLISCCGY-CGFGCQGGFPPTAWDF 160

Query: 73  FVHHGVVT--EECDPY----FDSTGCSHPGCEP-------AYPTPKCVRKCVKKNQLWRN 119
           +   G+VT   + +P     +    CSH G +         Y TP CV+KC   +  +  
Sbjct: 161 WQTEGIVTGGSKENPTGCRSYPFPRCSHHGSKKYPPCSHRIYDTPNCVQKCDTPDTDYAT 220

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
            K  +   Y + +    IM EI  NGPVE +F VYEDF  YKSGVY H  G ++GGHA++
Sbjct: 221 DKTRANITYNVKAKQNAIMKEIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIR 280

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
           ++GWG  ++G  YW++AN WN  WG DGYFK+ RG NECGIE++V AGLP   ++
Sbjct: 281 ILGWG-EENGVAYWLIANSWNDGWGEDGYFKMLRGKNECGIEDEVTAGLPELSSI 334


>gi|341888694|gb|EGT44629.1| hypothetical protein CAEBREN_31940 [Caenorhabditis brenneri]
          Length = 374

 Score =  206 bits (524), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 107/231 (46%), Positives = 141/231 (61%), Gaps = 13/231 (5%)

Query: 7   EHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGG 64
           + ++++  Q  CGSCWAFGA E +SDR CI      +  +SV D+L+CCG  CG GC GG
Sbjct: 111 KSIKLIRNQATCGSCWAFGAAEIISDRICIQSNATQTPIISVEDILSCCGVSCGKGCQGG 170

Query: 65  YPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK--KNQL 116
           Y I A R++   G VT        C PY     C    C     TP C   C    K   
Sbjct: 171 YSIEALRFWKSSGAVTGGDYNGAGCMPY-SFAPCKKDSCAQG-TTPSCKTTCQSSYKTAE 228

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   KH+  +AY+I +    I  EIY NGPVE SF VYEDF  YKSGVY++ +G ++GGH
Sbjct: 229 YTKDKHFGTTAYKITNSVAAIQTEIYHNGPVEASFKVYEDFYKYKSGVYQYTSGKLVGGH 288

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           AVK+IGWGT ++G DYW++AN W  ++G  G+FK++RG+NE GIE +VVAG
Sbjct: 289 AVKIIGWGT-ENGVDYWLIANSWGTTFGDSGFFKMRRGTNEVGIEGNVVAG 338


>gi|170028910|ref|XP_001842337.1| cathepsin L [Culex quinquefasciatus]
 gi|167879387|gb|EDS42770.1| cathepsin L [Culex quinquefasciatus]
          Length = 334

 Score =  206 bits (524), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 109/243 (44%), Positives = 149/243 (61%), Gaps = 19/243 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   +  +  QG CGSCWAFGAVEA+SDR CIH    ++  +S  DL++CC   CG G
Sbjct: 93  WPNCPTIREIRDQGSCGSCWAFGAVEAMSDRICIHSKGKVHFRVSAEDLVSCC-HTCGFG 151

Query: 61  CDGGYPISAWRYFVHHGVVT-------EECDPYFDS------TGCSHPGCEPAYPTPKCV 107
           C+GG+P +AW Y+V  G+V+       + C PY  S       G   P C     TPKCV
Sbjct: 152 CNGGFPGAAWSYWVRKGLVSGGPYGSDQGCQPYAISPCEHHVNGTRGP-CNGEGKTPKCV 210

Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           +KC    N  +   K +  S+Y I S  + I  E++ NGPVE +FTVYED  +YK GVY+
Sbjct: 211 KKCQASYNVPYAKDKFFGKSSYSIASHEQQIQKELFTNGPVEGAFTVYEDLLNYKEGVYQ 270

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H  G ++GGHA++++GWG  +D + +W++AN WN  WG +GYFKI RGS+  GIE  + A
Sbjct: 271 HTAGKMLGGHAIRILGWGVENDTK-FWLIANSWNSDWGDNGYFKILRGSDHLGIESSIAA 329

Query: 227 GLP 229
           GLP
Sbjct: 330 GLP 332


>gi|241154720|ref|XP_002407359.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215494103|gb|EEC03744.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score =  206 bits (523), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 107/243 (44%), Positives = 148/243 (60%), Gaps = 17/243 (6%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           +++ + + ++  Q  CGSCWAFGA EA+SDR CIH    M +++S  DLL CC   CG G
Sbjct: 95  WSHCDSIHLIRDQSTCGSCWAFGATEAMSDRICIHSKGKMQVNISAEDLLDCCD-TCGHG 153

Query: 61  CDGGYPISAWRYFVHHGVVT-------EECDPYFDS-----TGCSHPGCEPAYPTPKCVR 108
           C GG+P +AW ++   G+V+       + C PY  +     T C  P C P   TP+CV 
Sbjct: 154 CKGGFPAAAWEHWKERGIVSGGLYGTPDGCKPYSLAPCEYHTKCRIPNCIPIVHTPECVH 213

Query: 109 KCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
            C K  ++ ++  KH+    Y I+ D + I  EI+ NGPVE  F VY DF  YKSGVY+ 
Sbjct: 214 HCRKGYDKDYQEDKHFGQKVYSISRDEKQIQTEIFTNGPVEADFHVYGDFLCYKSGVYQR 273

Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
            + D  G HA++++GWGT ++G  YW+ AN WN +WG  GYFKI R +NECGIEE + AG
Sbjct: 274 HSNDGRGMHAIRILGWGT-ENGTPYWLAANSWNENWGDKGYFKILRRTNECGIEEHIYAG 332

Query: 228 LPS 230
           +P 
Sbjct: 333 IPK 335


>gi|308466896|ref|XP_003095699.1| CRE-CPR-3 protein [Caenorhabditis remanei]
 gi|308244581|gb|EFO88533.1| CRE-CPR-3 protein [Caenorhabditis remanei]
          Length = 373

 Score =  206 bits (523), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 108/233 (46%), Positives = 141/233 (60%), Gaps = 12/233 (5%)

Query: 7   EHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGG 64
           + ++++  Q  CGSCWAFGA E +SDR CI         +SV D+L+CCG  CG GC GG
Sbjct: 107 KSIKLIRNQATCGSCWAFGAAEVISDRICIQSNGTQQPIISVEDILSCCGTTCGKGCQGG 166

Query: 65  YPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR 118
           Y I A R++  +G VT        C PY  +     P  E   PT K   +       + 
Sbjct: 167 YSIEAMRFWKSNGAVTGGDYNGNGCMPYSFAPCQKSPCVESTTPTCKTTCQSSYTTANYT 226

Query: 119 NSKHYSISAYRI---NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
             KHY  SAYR+   N+    I  EIY NGPVE S+ VYEDF  YKSGVY +++G ++GG
Sbjct: 227 TDKHYGTSAYRLATTNNVVSTIQYEIYHNGPVEASYKVYEDFYQYKSGVYHYVSGKLVGG 286

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           HAVK+IGWGT +D  DYW++AN W   +G  G+FKI+RG+NEC IE +VVAG+
Sbjct: 287 HAVKIIGWGTEND-VDYWLVANSWGIKFGEGGFFKIRRGTNECQIESNVVAGV 338


>gi|357613937|gb|EHJ68797.1| cathepsin B-like cysteine proteinase [Danaus plexippus]
          Length = 334

 Score =  206 bits (523), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 111/232 (47%), Positives = 144/232 (62%), Gaps = 19/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA++DR+C +     +   S  DLL+CC  +CG GC+GG P  AW Y
Sbjct: 104 QGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSAEDLLSCCP-VCGLGCNGGIPSFAWEY 162

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWR 118
           + H G+V+       + C PY +   C H  PG    C     TPKC R C K+    ++
Sbjct: 163 WKHFGIVSGGNYNSSQGCLPY-EIPPCEHHVPGNRIPCNGETSTPKCHRSCRKEYTNSYK 221

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
           + K Y    Y +    E I AEI+KNGPVE +FTVY D   YKSGVYKH  G+ +GGHA+
Sbjct: 222 SDKKYGKHVYSVGGGEEHIKAEIFKNGPVEGAFTVYADLLTYKSGVYKHTEGEALGGHAI 281

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           K++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE  +VAG PS
Sbjct: 282 KIMGWGV-ENGNKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEPS 332


>gi|341891084|gb|EGT47019.1| CBN-CPR-4 protein [Caenorhabditis brenneri]
          Length = 335

 Score =  206 bits (523), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 115/233 (49%), Positives = 147/233 (63%), Gaps = 20/233 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAF A EA SDRFCI  +  +N  LS  D+L+CC   CG GC+GGYPI+AW+Y
Sbjct: 103 QSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSCCSN-CGYGCEGGYPINAWKY 161

Query: 73  FVHHGVVTEE-------CDPYF-----DSTG-CSHPGCEP-AYPTPKCVRKCVKKNQ--L 116
            V  G  T         C PY      ++ G  + P C    Y TP CV KC   N    
Sbjct: 162 LVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNTTWPACPTDGYDTPACVNKCTNSNYNVA 221

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +++ KH+  +AY +      I AEI  +GPVE +FTVYEDF  YKSGVY H TG+ +GGH
Sbjct: 222 YKDDKHFGSTAYAVGKKVAQIQAEIIAHGPVEAAFTVYEDFYQYKSGVYVHTTGEELGGH 281

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           A++++GWGT D+G  YW++AN WN +WG +GYF+I RG+NECGIE  VV G+P
Sbjct: 282 AIRILGWGT-DNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVP 333


>gi|298370749|gb|ADI80349.1| cathepsin B [Litopenaeus vannamei]
          Length = 331

 Score =  205 bits (522), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 112/231 (48%), Positives = 145/231 (62%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVE +SDR CIH     N   S  +L++CC  LCG GC+GG+P +A++Y
Sbjct: 101 QGSCGSCWAFGAVEVMSDRQCIHSKGKSNFHYSAENLVSCC-HLCGFGCNGGFPGAAFKY 159

Query: 73  FVHHGVV-------TEECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
           +VH G+V       T+ C PY +   C H      P C     TPKC + C K   + + 
Sbjct: 160 WVHSGIVSGGSFNSTQGCQPY-EIAPCEHHVPGPRPKCSEGGGTPKCAKTCEKGYIVDYE 218

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
           +  H+   AY I  D + I  EI KNGPVE +FTVY DF HYKSGVY+H  G  +GGHA+
Sbjct: 219 SDLHHGGKAYSIMKDEDQIKYEIMKNGPVEGAFTVYVDFLHYKSGVYQHRHGLPLGGHAI 278

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +++GWG  ++G  YW+ AN WN  WG +G FKI RGS+ CGIE ++ AGLP
Sbjct: 279 RVLGWG-EENGTPYWLCANSWNTDWGDNGLFKILRGSDHCGIESEISAGLP 328


>gi|339236191|ref|XP_003379650.1| cathepsin B [Trichinella spiralis]
 gi|316977649|gb|EFV60721.1| cathepsin B [Trichinella spiralis]
          Length = 356

 Score =  205 bits (522), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 110/230 (47%), Positives = 142/230 (61%), Gaps = 18/230 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CGSCWAFGAVEA+SDR CI         +S  DLL+CC  +CG GC GG P  AW +
Sbjct: 124 QSNCGSCWAFGAVEAISDRICIATDGRQKPHISSTDLLSCCK-ICGFGCQGGDPHQAWSF 182

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
           +V +G+VT       + C PY        S G   P      PTP C + C    ++  N
Sbjct: 183 WVKYGLVTGGNYTTHDGCRPYPFAPCNHHSNGTYGPCSHDLEPTPVCKKACQSTYKIQYN 242

Query: 120 S-KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             K+Y + AY +++   D+  E+  NGP+EV+F VYEDF  YK+GVY+H TG V+GGHAV
Sbjct: 243 KDKYYGLKAYSLHNKASDLQKELMMNGPMEVAFEVYEDFLLYKTGVYQHHTGSVLGGHAV 302

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           +L+GWG  ++G  YW+LAN WN  WG  G+FKI RG NECGIE + VAGL
Sbjct: 303 RLLGWG-EENGVPYWLLANSWNTEWGDKGFFKIYRGRNECGIESEAVAGL 351


>gi|268558600|ref|XP_002637291.1| C. briggsae CBR-CPR-4 protein [Caenorhabditis briggsae]
          Length = 335

 Score =  205 bits (522), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 115/233 (49%), Positives = 147/233 (63%), Gaps = 20/233 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAF A EA SDRFCI  +  +N  LS  D+L+CC   CG GC+GGYPI+AW+Y
Sbjct: 103 QSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSCCSN-CGYGCEGGYPINAWKY 161

Query: 73  FVHHGVVTEE-------CDPYF-----DSTG-CSHPGC-EPAYPTPKCVRKCVKKNQ--L 116
            V  G  T         C PY      ++ G  + P C +  Y TP CV KC   N    
Sbjct: 162 LVKSGFCTGGSYVSQFGCKPYSLAPCGETVGNTTWPDCPQDGYNTPSCVNKCTNNNYNIA 221

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +++ KH+  +AY +      I AEI  +GPVE +FTVYEDF  YKSGVY H TG  +GGH
Sbjct: 222 YKDDKHFGSTAYAVGKKVAQIQAEILAHGPVEAAFTVYEDFYQYKSGVYVHTTGQELGGH 281

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           A++++GWGT D+G  YW++AN WN +WG +GYF+I RG+NECGIE  VV G+P
Sbjct: 282 AIRILGWGT-DNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVP 333


>gi|195729971|gb|ACG50796.1| cathepsin B1 [Trichobilharzia szidati]
          Length = 342

 Score =  205 bits (522), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 110/231 (47%), Positives = 140/231 (60%), Gaps = 18/231 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CG CWAF AVEA+SDR CI      ++ LS  DLL+CC   CG GC GG+P +AW Y
Sbjct: 112 QSRCGPCWAFAAVEAMSDRICIQSKGKKSVELSAVDLLSCCT-ECGLGCQGGFPGAAWDY 170

Query: 73  FVHHGVVTEE-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
           +V  G+VT         C PY        T   +P C E  Y TPKC +KC K  +  ++
Sbjct: 171 WVEEGIVTGSSKENHTGCQPYPFPKCEHHTKGKYPACGEKIYKTPKCQQKCQKGYKTPYK 230

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             K+Y   +Y + S  + I  EI  +GPVE +FTVY DF +YKSG+YKH+ G V+GGHAV
Sbjct: 231 KDKYYGKLSYNVLSKEDAIKKEIMMHGPVEAAFTVYSDFLNYKSGIYKHMKGTVIGGHAV 290

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           ++IGWG  +    YW++AN WN  WG  GYF+I RG + CGIE  V AGLP
Sbjct: 291 RIIGWGV-EKKTPYWLIANSWNEDWGEKGYFRILRGKDVCGIESAVTAGLP 340


>gi|112983908|ref|NP_001036850.1| cathepsin B precursor [Bombyx mori]
 gi|13548667|dbj|BAB40804.1| cathepsin B [Bombyx mori]
          Length = 337

 Score =  205 bits (522), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 109/231 (47%), Positives = 141/231 (61%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA++DR C +     +   S  DLL+CC  +CG GC GG P  AW Y
Sbjct: 105 QGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCP-ICGLGCSGGMPRLAWEY 163

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKKNQL-WR 118
           + H G+V+       + C PY +   C H  PG    C     TPKC +KC     + ++
Sbjct: 164 WKHFGLVSGGSYNSSQGCRPY-EIPPCEHHVPGNRMPCSGDTKTPKCTKKCESGYDVNYK 222

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             K Y    Y ++ D + I AE++KNGPVE +FTVY D   YKSGVYKH  GD +GGHAV
Sbjct: 223 QDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQGDALGGHAV 282

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           K++GWG  +D + YW++AN WN  WG +G+FKI RG + CGIE  +V G P
Sbjct: 283 KILGWGVENDNK-YWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVTGEP 332


>gi|223646922|gb|ACN10219.1| Cathepsin B precursor [Salmo salar]
 gi|223647940|gb|ACN10728.1| Cathepsin B precursor [Salmo salar]
 gi|223672785|gb|ACN12574.1| Cathepsin B precursor [Salmo salar]
          Length = 330

 Score =  205 bits (521), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 108/243 (44%), Positives = 150/243 (61%), Gaps = 18/243 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWAFGA EA+SDR CIH    +S+ ++  DLL+CC   CG G
Sbjct: 89  WPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNAKVSVEISSEDLLSCCDS-CGMG 147

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCV 107
           C+GGYP +AW ++   G+VT         C PY          G   P       TP+C 
Sbjct: 148 CNGGYPSAAWDFWTTEGLVTGGLYDSHVGCRPYSIPPCEHHVNGTRPPCTGEEGDTPQCS 207

Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
            +C       ++  KH+  ++Y + S+ + IMAE+ KNGPVE +FTVYEDF  YKSGVY+
Sbjct: 208 NQCETGYTPGYKQDKHFGKNSYSLPSEEQQIMAELLKNGPVEGAFTVYEDFLLYKSGVYQ 267

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H++G  +GGHA+K++GWG  + G  YW+ AN WN  WG +G+FKI RG + CGIE ++VA
Sbjct: 268 HVSGSAVGGHAIKVLGWG-EEGGTPYWLAANSWNTDWGENGFFKILRGKDHCGIESEMVA 326

Query: 227 GLP 229
           G+P
Sbjct: 327 GVP 329


>gi|389593817|ref|XP_003722157.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
 gi|321438655|emb|CBZ12414.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
          Length = 340

 Score =  205 bits (521), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 109/231 (47%), Positives = 144/231 (62%), Gaps = 23/231 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
           Q +CGSCWA  AVEA+SDR+C   G+ +  +S ++LL+CC F+CG GC GG P  AW ++
Sbjct: 120 QSNCGSCWAIAAVEAISDRYCTFGGVPDRRMSTSNLLSCC-FICGLGCHGGIPTVAWLWW 178

Query: 74  VHHGVVTEECDPY-FDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQ---LWRNSK 121
           V  G+ TE+C PY FD   CSH G    YP        TPKC   C +       ++ S 
Sbjct: 179 VWVGIATEDCQPYPFDP--CSHHGNSEKYPPCPSTIYDTPKCNTTCERSEMDLVKYKGST 236

Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 181
            YS+   +      ++M E+  NGP+E++  VY DF  YKSGVYKH+ G+ +GGHAVKL+
Sbjct: 237 SYSVKGEK------ELMIELMTNGPLELTMQVYSDFVGYKSGVYKHVLGEFLGGHAVKLV 290

Query: 182 GWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GWGT  DG  YW +AN WN  WG  GYF I+RG+NEC IE   VAG+P+ +
Sbjct: 291 GWGT-QDGVPYWKVANSWNTDWGDKGYFLIQRGNNECKIESGGVAGIPAQE 340


>gi|56756436|gb|AAW26391.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  205 bits (521), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 111/230 (48%), Positives = 143/230 (62%), Gaps = 18/230 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA++DR CI  G   S  LS  DL++CC   CGDGC GG+P  AW Y
Sbjct: 112 QSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCED-CGDGCKGGFPGQAWDY 170

Query: 73  FVHHGVVT---EE----CDPY-FDS----TGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
           +V  G+VT   EE    C PY F      T   +P C    Y TP+C + C K  +  + 
Sbjct: 171 WVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYE 230

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY    Y + S+ + I  EI   GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+
Sbjct: 231 QDKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAI 290

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           ++IGWG  + G+ YW++AN WN  WG  G F++ RG +EC IE  VVAGL
Sbjct: 291 RIIGWGV-EKGKPYWLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGL 339


>gi|409905640|gb|AFV46426.1| cysteine protease C [Leishmania donovani]
          Length = 345

 Score =  205 bits (521), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 115/245 (46%), Positives = 148/245 (60%), Gaps = 21/245 (8%)

Query: 3   FTNSEHVEILVI------QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGF 55
           F  +EH  + V       Q +CGSCWA  AVEA+SDR+C   G+ +  +S ++LL+CC F
Sbjct: 107 FDAAEHWPMCVTISEIRDQSNCGSCWAIAAVEAISDRYCTLGGVPDRRISTSNLLSCC-F 165

Query: 56  LCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCV 107
           +CG GC GG P  AW ++V  G+ TE C PY     CSH G    YP        TPKC 
Sbjct: 166 ICGFGCYGGIPTMAWLWWVWVGITTEVCQPY-PFGPCSHHGNSDKYPPCPNTIYDTPKCN 224

Query: 108 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
             C K        K+   ++Y +  + E +M E+  NGP+EV+  VY DF  YKSGVYKH
Sbjct: 225 TTCEKSEM--DLVKYKGGTSYSVKGEKE-LMIELMTNGPLEVTMQVYSDFVGYKSGVYKH 281

Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           ++GD++GGHAVKL+GWGT   G  YW +AN WN  WG  GYF I+RGSNECGIE   VAG
Sbjct: 282 VSGDLLGGHAVKLVGWGT-QGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAG 340

Query: 228 LPSSK 232
            P+ +
Sbjct: 341 TPAQE 345


>gi|146092987|ref|XP_001466605.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania infantum JPCM5]
 gi|398018677|ref|XP_003862503.1| cysteine peptidase C (CPC) [Leishmania donovani]
 gi|12005276|gb|AAG44365.1| cathepsin B-like cysteine protease [Leishmania donovani]
 gi|134070968|emb|CAM69644.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania infantum JPCM5]
 gi|322500733|emb|CBZ35810.1| cysteine peptidase C (CPC) [Leishmania donovani]
          Length = 340

 Score =  205 bits (521), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 115/245 (46%), Positives = 148/245 (60%), Gaps = 21/245 (8%)

Query: 3   FTNSEHVEILVI------QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGF 55
           F  +EH  + V       Q +CGSCWA  AVEA+SDR+C   G+ +  +S ++LL+CC F
Sbjct: 102 FDAAEHWPMCVTISEIRDQSNCGSCWAIAAVEAISDRYCTLGGVPDRRISTSNLLSCC-F 160

Query: 56  LCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCV 107
           +CG GC GG P  AW ++V  G+ TE C PY     CSH G    YP        TPKC 
Sbjct: 161 ICGFGCYGGIPTMAWLWWVWVGITTEVCQPY-PFGPCSHHGNSDKYPPCPNTIYDTPKCN 219

Query: 108 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
             C K        K+   ++Y +  + E +M E+  NGP+EV+  VY DF  YKSGVYKH
Sbjct: 220 TTCEKSEM--DLVKYKGGTSYSVKGEKE-LMIELMTNGPLEVTMQVYSDFVGYKSGVYKH 276

Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           ++GD++GGHAVKL+GWGT   G  YW +AN WN  WG  GYF I+RGSNECGIE   VAG
Sbjct: 277 VSGDLLGGHAVKLVGWGT-QGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAG 335

Query: 228 LPSSK 232
            P+ +
Sbjct: 336 TPAQE 340


>gi|390994431|gb|AFM37365.1| cathepsin B2 [Dictyocaulus viviparus]
          Length = 346

 Score =  205 bits (521), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 109/244 (44%), Positives = 144/244 (59%), Gaps = 23/244 (9%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCD 62
           N   ++ +  Q  CGSCWAFGA EA++DR CI     +  ++S +DLL+CC   CG GCD
Sbjct: 105 NCPSIKSIRDQSSCGSCWAFGAAEAMTDRICIASKGAIQFTVSADDLLSCCD-ECGFGCD 163

Query: 63  GGYPISAWRYFVHHGVVTEECDPYFDSTGCS----------------HPGCEPAYPTPKC 106
           GG+P +AW Y+V  G+V+     Y   +GC                 HP  +  YPT  C
Sbjct: 164 GGFPYAAWNYWVEKGIVSG--GSYTSKSGCKPYPFPPCEHHTNGTHYHPCPKDLYPTNTC 221

Query: 107 VRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
             KC       + N K Y   AY + +  + I  EI  +GPVEV++ VYEDF HY  G+Y
Sbjct: 222 EHKCQSGYATAYTNDKRYGAKAYTVAARVKAIQKEIMLHGPVEVAYDVYEDFEHYLKGIY 281

Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
           KH  G  +GGHAVK+IGWGT ++G  YWI +N WN  WG +G+F+I RG++ECGIE  VV
Sbjct: 282 KHTAGSYLGGHAVKMIGWGT-ENGIPYWICSNSWNSDWGENGFFRILRGTDECGIESGVV 340

Query: 226 AGLP 229
           AGLP
Sbjct: 341 AGLP 344


>gi|49036808|gb|AAT48985.1| cathepsin B-like proteinase [Triatoma vitticeps]
          Length = 332

 Score =  205 bits (521), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 108/230 (46%), Positives = 143/230 (62%), Gaps = 17/230 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH    + + LS  +LL+CC   CG GC GG   +AW Y
Sbjct: 103 QGSCGSCWAFGAVEAMSDRICIHSNGKLQVHLSAENLLSCCDS-CGYGCLGGSAENAWEY 161

Query: 73  FVHHGVVT-------EECDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRN 119
           +   G+V+       + C PY       S   S P CE    TPKC ++C K   + + +
Sbjct: 162 WHKFGIVSGGNYGSKQGCQPYSIAPCEHSIPGSRPACEGVRDTPKCKKQCEKGYGIPYGD 221

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
              Y    Y I +D + I AEI KNGP+  S  VYED   YK+GVY+H+ G+V+GGH +K
Sbjct: 222 DLCYGQPGYTIENDAQKIQAEILKNGPIVASILVYEDLFSYKAGVYQHVAGEVLGGHVIK 281

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           ++GWG  +D   YW++AN WN  WG +G+FKI RGS+ECGIE+ +VAG+P
Sbjct: 282 ILGWGVEND-TPYWLVANSWNTDWGNNGFFKILRGSDECGIEDQIVAGIP 330


>gi|17384033|emb|CAD12394.1| cysteine proteinase [Leishmania infantum]
          Length = 340

 Score =  205 bits (521), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 115/245 (46%), Positives = 148/245 (60%), Gaps = 21/245 (8%)

Query: 3   FTNSEHVEILVI------QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGF 55
           F  +EH  + V       Q +CGSCWA  AVEA+SDR+C   G+ +  +S ++LL+CC F
Sbjct: 102 FDAAEHWPMCVTISEIRDQSNCGSCWAIAAVEAISDRYCTLGGVPDRRISTSNLLSCC-F 160

Query: 56  LCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCV 107
           +CG GC GG P  AW ++V  G+ TE C PY     CSH G    YP        TPKC 
Sbjct: 161 ICGFGCYGGIPTMAWLWWVWVGITTEVCQPY-PFGPCSHHGNSDKYPPCPNTIYDTPKCN 219

Query: 108 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
             C K        K+   ++Y +  + E +M E+  NGP+EV+  VY DF  YKSGVYKH
Sbjct: 220 TTCEKSEM--DLVKYKGGTSYSVKGEKE-LMIELMTNGPLEVTMQVYSDFVGYKSGVYKH 276

Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           ++GD++GGHAVKL+GWGT   G  YW +AN WN  WG  GYF I+RGSNECGIE   VAG
Sbjct: 277 VSGDLLGGHAVKLVGWGT-QGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAG 335

Query: 228 LPSSK 232
            P+ +
Sbjct: 336 TPAQE 340


>gi|300176937|emb|CBK25506.2| unnamed protein product [Blastocystis hominis]
          Length = 320

 Score =  205 bits (521), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 111/231 (48%), Positives = 141/231 (61%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGA EA +DR CI     +   LS  DLL CC   CG GC+GG+P  AW +
Sbjct: 91  QSVCGSCWAFGAAEAATDRLCIASKGKIQDRLSDQDLLTCCE-SCGFGCNGGWPSMAWSW 149

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
           F   GV T       + C+ Y +   C H      P C    PTP+CV KC +   + ++
Sbjct: 150 FHSTGVTTGGEYGSKDWCNAY-EFPKCDHHVEGKYPPCGETQPTPECVEKCQEGYPVEYK 208

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KH+   AY + S+ E I  E+  NGP+EV F+VYEDF  YKSG+Y+H+ G  +GGHAV
Sbjct: 209 KDKHFFGEAYHVPSNVEAIKTELMTNGPIEVDFSVYEDFMTYKSGIYQHVAGKYLGGHAV 268

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           KL+GWG  +DG +YW +AN WN  WG +GYF+I  G NECGIE D VAG+P
Sbjct: 269 KLVGWGV-EDGVEYWKIANSWNEDWGENGYFRIIAGKNECGIESDGVAGIP 318


>gi|427787723|gb|JAA59313.1| Putative cathepsin b-like cysteine protease form 2 [Rhipicephalus
           pulchellus]
          Length = 338

 Score =  204 bits (520), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 105/237 (44%), Positives = 140/237 (59%), Gaps = 18/237 (7%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYP 66
           + ++  Q  CG+CWAFGAVEA+SDR CIH    + +++S  DLL CC + C  GC GG P
Sbjct: 99  IHVIRDQSSCGACWAFGAVEAISDRICIHTKGSVQVNISAQDLLTCCDY-CRTGCKGGVP 157

Query: 67  ISAWRYFVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK 113
             AW ++   G+VT       + C PY      + +TG   P      P P C R+C K 
Sbjct: 158 SYAWMFYKEKGIVTGGLYGTEDGCQPYSIHTTRYTTTGLLPPPINDLSPMPPCKRECRKS 217

Query: 114 -NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
             + +   KHY    Y ++ D   I  EI+KNGPVE  F VY DF  YKSGVY+  +   
Sbjct: 218 YGKKYSEDKHYGEKVYTLSGDEAQIKTEIFKNGPVEADFAVYADFYSYKSGVYQAHSRVR 277

Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
            G HA++++GWGT ++G  YW+ AN W   WG  GYFKI+RG+NECGIEED+ AG+P
Sbjct: 278 CGSHAIRILGWGT-ENGVPYWLAANSWTEHWGDKGYFKIRRGNNECGIEEDINAGIP 333


>gi|4204370|gb|AAD11445.1| cathepsin B protease, partial [Fasciola hepatica]
          Length = 247

 Score =  204 bits (520), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 106/233 (45%), Positives = 138/233 (59%), Gaps = 21/233 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWA  A  A+SDR CIH    M   L+  D L+CC + CG GC GGYP  AW Y
Sbjct: 16  QASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAADPLSCCTY-CGQGCRGGYPPKAWDY 74

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPGCEP--------AYPTPKCVRKC-VKKNQL 116
           ++  G+VT         C P+   T C H G            YPTP C R C    N+ 
Sbjct: 75  WMREGIVTGGTWENRTGCQPWM-FTKCDHVGDSRKYSRCPHYTYPTPPCARACQTGYNKT 133

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   K Y  S+Y +      IM EI KNGPVEV+F +++DF  Y+SG+Y H+ G  +G H
Sbjct: 134 YEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEVTFAIFQDFGVYRSGIYHHVAGKFIGRH 193

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           AV++IGWG  ++G +YW++AN WN  WG +GYF++ RG NECGIE +VVAG+P
Sbjct: 194 AVRMIGWGV-ENGVNYWLMANSWNEEWGENGYFRMVRGRNECGIESEVVAGMP 245


>gi|401415968|ref|XP_003872479.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|322488703|emb|CBZ23950.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 340

 Score =  204 bits (520), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 112/224 (50%), Positives = 139/224 (62%), Gaps = 15/224 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
           Q +CGSCWA  AVEA+SDR+C   G+ +  +S  +LL+CC F+CG GC GG P  AW ++
Sbjct: 120 QSNCGSCWAIAAVEAMSDRYCTMSGIPDRRISTTNLLSCC-FICGFGCYGGIPAMAWLWW 178

Query: 74  VHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSI 125
           V  GV TE C PY     CSH G    YP        TPKC   C   N      K+  +
Sbjct: 179 VWVGVTTELCQPY-PFGPCSHHGNSSKYPPCPNTIYNTPKCNTTC--DNVEMELVKYKGV 235

Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
           S+Y I  + E +M E+  NGP+EV+  VY DF  YKSGVYKH++GD +GGHAVKL+GWG 
Sbjct: 236 SSYSIKGERE-LMVELMNNGPLEVAMQVYADFVAYKSGVYKHVSGDHLGGHAVKLVGWGV 294

Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
             DG  YW +AN WN  WG  GYF I+RG++ECGIE   VAG P
Sbjct: 295 -KDGIPYWKIANSWNTDWGDKGYFLIQRGNDECGIESSGVAGKP 337


>gi|17559068|ref|NP_504682.1| Protein CPR-4 [Caenorhabditis elegans]
 gi|1169085|sp|P43508.1|CPR4_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 4; AltName:
           Full=Cysteine protease-related 4; Flags: Precursor
 gi|675500|gb|AAA98785.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|695293|gb|AAA98783.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351063163|emb|CCD71204.1| Protein CPR-4 [Caenorhabditis elegans]
          Length = 335

 Score =  204 bits (520), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 115/233 (49%), Positives = 146/233 (62%), Gaps = 20/233 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAF A EA SDRFCI  +  +N  LS  D+L+CC   CG GC+GGYPI+AW+Y
Sbjct: 103 QSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSCCSN-CGYGCEGGYPINAWKY 161

Query: 73  FVHHGVVTEE-------CDPYF-----DSTG-CSHPGC-EPAYPTPKCVRKCVKKNQ--L 116
            V  G  T         C PY      ++ G  + P C +  Y TP CV KC  KN    
Sbjct: 162 LVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSCPDDGYDTPACVNKCTNKNYNVA 221

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   KH+  +AY +      I AEI  +GPVE +FTVYEDF  YK+GVY H TG  +GGH
Sbjct: 222 YTADKHFGSTAYAVGKKVSQIQAEIIAHGPVEAAFTVYEDFYQYKTGVYVHTTGQELGGH 281

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           A++++GWGT D+G  YW++AN WN +WG +GYF+I RG+NECGIE  VV G+P
Sbjct: 282 AIRILGWGT-DNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVP 333


>gi|146217390|gb|ABQ10737.1| cathepsin B [Penaeus monodon]
          Length = 331

 Score =  204 bits (519), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 111/231 (48%), Positives = 144/231 (62%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVE +SDR CIH     N   S  +L++CC  LCG GC+GG+P +A++Y
Sbjct: 101 QGSCGSCWAFGAVEVMSDRQCIHSKGKSNFHYSAENLVSCC-HLCGFGCNGGFPGAAFKY 159

Query: 73  FVHHGVV-------TEECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
           +VH G+V       T+ C PY +   C H      P C     TPKC + C K   + + 
Sbjct: 160 WVHSGIVSGGSFNSTQGCQPY-EIAPCEHHVSGPRPKCSEGGGTPKCAKTCEKGYIVDYE 218

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
           +  H+   AY I  D + I  EI  NGPVE +FTVY DF HYKSGVY+H  G  +GGHA+
Sbjct: 219 SDLHHGGKAYSIMKDEDQIKYEIMNNGPVEGAFTVYVDFLHYKSGVYQHRHGLPLGGHAI 278

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +++GWG  ++G  YW+ AN WN  WG +G FKI RGS+ CGIE ++ AGLP
Sbjct: 279 RVLGWG-EENGTPYWLCANSWNTDWGDNGLFKILRGSDHCGIESEISAGLP 328


>gi|17559066|ref|NP_506790.1| Protein CPR-3 [Caenorhabditis elegans]
 gi|1169083|sp|P43507.1|CPR3_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 3; AltName:
           Full=Cysteine protease-related 3; Flags: Precursor
 gi|675494|gb|AAA98788.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|675496|gb|AAA98782.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|14530554|emb|CAB61032.2| Protein CPR-3 [Caenorhabditis elegans]
          Length = 370

 Score =  204 bits (519), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 113/250 (45%), Positives = 149/250 (59%), Gaps = 22/250 (8%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
           ++++  Q  CGSCWAFGA E +SDR CI         +SV D+L+CCG  CG GC GGY 
Sbjct: 108 IKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDILSCCGTTCGYGCKGGYS 167

Query: 67  ISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWR 118
           I A R++   G VT        C PY  S       C P   TP C   C    K + ++
Sbjct: 168 IEALRFWASSGAVTGGDYGGHGCMPY--SFAPCTKNC-PESTTPSCKTTCQSSYKTEEYK 224

Query: 119 NSKHYSISAYRINSDPE--DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
             KHY  SAY++ +     +I  EIY  GPVE S+ VYEDF HYKSGVY + +G ++GGH
Sbjct: 225 KDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHYKSGVYHYTSGKLVGGH 284

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVK 236
           AVK+IGWG  ++G DYW++AN W  S+G  G+FKI+RG+NEC IE +VVAG      + K
Sbjct: 285 AVKIIGWGV-ENGVDYWLIANSWGTSFGEKGFFKIRRGTNECQIEGNVVAG------IAK 337

Query: 237 EITSADMFED 246
             T ++ +ED
Sbjct: 338 LGTHSETYED 347


>gi|118424551|gb|ABK90823.1| cathepsin B-like cysteine proteinase [Spodoptera exigua]
          Length = 341

 Score =  204 bits (519), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 108/231 (46%), Positives = 143/231 (61%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA++DR+C +     +   S  DLL+CC  +CG GC+GG P  AW Y
Sbjct: 109 QGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSAEDLLSCCP-VCGLGCNGGMPTLAWEY 167

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWR 118
           + H G+V+       + C PY +   C H  PG    C     TPKC + C    N  + 
Sbjct: 168 WKHFGLVSGGSYNSSQGCRPY-EIPPCEHHVPGNRMPCNGDSKTPKCHKTCESSYNVDYH 226

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             K Y    Y ++S  + I AE+YKNGPVE +FTVY D  +YK+GVYKH  G+ +GGHA+
Sbjct: 227 KDKRYGKHVYSVSSKEDHIKAELYKNGPVEGAFTVYSDLLNYKNGVYKHTVGNALGGHAI 286

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           K++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE  +VAG P
Sbjct: 287 KILGWGV-ENGNKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 336


>gi|56462338|gb|AAV91452.1| cysteine peptidase 2 cathepsin-B-like [Lonomia obliqua]
          Length = 338

 Score =  204 bits (519), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 108/231 (46%), Positives = 142/231 (61%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA++DR C +     +   S  DLL+CC  +CG GC+GG P  AW Y
Sbjct: 106 QGSCGSCWAFGAVEAMTDRVCTYSDGTKHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEY 164

Query: 73  FVHHGVV-------TEECDPYFDSTGCSH--PG----CEPAYPTPKCVRKC-VKKNQLWR 118
           + H G+V       T+ C PY +   C H  PG    C     TPKC + C    N  ++
Sbjct: 165 WKHAGIVSGGSYNSTQGCIPY-EVPPCEHHVPGNRLPCNGDTKTPKCQKTCEAGYNVPFK 223

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY    Y ++ + ++I AE++KNGPVE +FTVY D   YKSGVY+H  G  +GGHAV
Sbjct: 224 KDKHYGKHVYSVSGNEDNIKAELFKNGPVEGAFTVYSDLLSYKSGVYQHTDGSALGGHAV 283

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           K++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE  +V G P
Sbjct: 284 KILGWGV-ENGSKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVTGEP 333


>gi|124502519|gb|ABN13633.1| cysteine proteinase [Haemonchus contortus]
          Length = 342

 Score =  204 bits (519), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 103/237 (43%), Positives = 144/237 (60%), Gaps = 19/237 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CGSCWA     A+SDR CI       +++S  D++ CC   CGDGC+GG+PI AW+Y
Sbjct: 108 QANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIMTCCRPQCGDGCEGGWPIEAWKY 167

Query: 73  FVHHGVVTE-------ECDPYFDSTGCSHPG-------CEPAYPTPKCVRKCVKK-NQLW 117
           F++ GVV+         C PY     C H G       C    PTP C ++C     +++
Sbjct: 168 FIYDGVVSGGEYLTKGVCRPY-PIHPCGHHGNDTYYGECRGTAPTPPCKKECRPGVRKVY 226

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
           R  K Y   AY +    + I +EI +NGPV  SF VYEDF HYKSG+YKH  G++ G HA
Sbjct: 227 RIDKRYGKDAYIVKQSVKAIQSEILRNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHA 286

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
           VK+IGWG +++  D+W++AN W+  WG  GYF+I RG+N+CGIE  + AG+  +++L
Sbjct: 287 VKMIGWG-NENNTDFWLIANSWHNDWGEKGYFRIIRGTNDCGIEGTIAAGIVDTESL 342


>gi|56759588|gb|AAW28820.1| Parcxpwnx02 [Periplaneta americana]
          Length = 343

 Score =  204 bits (519), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 107/230 (46%), Positives = 142/230 (61%), Gaps = 18/230 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH     +   S  DLL CC   CG GC+GG P +AW Y
Sbjct: 115 QGSCGSCWAFGAVEAMSDRVCIHSKGKTHFHFSAEDLLTCCSS-CGFGCNGGEPGAAWDY 173

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPGCEPAYP-----TPKCVRKCVKKNQL-WRN 119
           +V  G+V+       + C PY     C H       P     TP+CV++C +   + +  
Sbjct: 174 WVSTGIVSGGSYNSHQGCQPYAIEP-CEHHVNGTRKPCGEGDTPRCVKRCEEGYDVPYGK 232

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
            +H+  SAY +    + I  E+  NGP E + TVY+DF HY++GVY+H++G  +GGHAV+
Sbjct: 233 DRHFGKSAYAVPGSVKAIQKELLLNGPAEAALTVYDDFLHYRTGVYQHVSGGALGGHAVR 292

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           L+GWG  +DG  YW+LAN WN  WG +GYF+I RG +ECGIE D+  GLP
Sbjct: 293 LLGWGV-EDGTPYWLLANSWNYDWGDNGYFRILRGQDECGIESDINGGLP 341


>gi|268572243|ref|XP_002648913.1| Hypothetical protein CBG17826 [Caenorhabditis briggsae]
          Length = 323

 Score =  203 bits (517), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 105/236 (44%), Positives = 140/236 (59%), Gaps = 14/236 (5%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDG 60
           ++N   +E++  Q  CGSCWAF   E +SDR CI        ++S  D+LACCG  CGDG
Sbjct: 91  WSNCTSIEMIRDQAQCGSCWAFSTAEVISDRICIATKGTQQPTISPTDMLACCGNSCGDG 150

Query: 61  CDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKC-VKK 113
           C GGYPI A+R++   GVVT        C PY  +   S P       TP C   C    
Sbjct: 151 CKGGYPIQAFRWWNSRGVVTGGDFRGSGCRPYPFAPCISCP----EEKTPTCSLSCQFGY 206

Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
           +  +   K + +SAY +  +   I  EI  NGPV  +FT+YED   YKSGVY+H  G ++
Sbjct: 207 STAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLL 266

Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           GGHA+K+IGWGT  +G  YW++AN W  +WG +G+ K++RG NECGIE  VVAG+P
Sbjct: 267 GGHAIKIIGWGT-QNGIPYWLIANSWGANWGENGFLKMRRGVNECGIERAVVAGMP 321


>gi|330434688|gb|AEC22812.1| cathepsin B [Macrobrachium nipponense]
          Length = 331

 Score =  203 bits (517), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 108/231 (46%), Positives = 143/231 (61%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVE ++DR CIH     N   S  +L++CC  LCG GC+GG+P +A++Y
Sbjct: 101 QGSCGSCWAFGAVEVMTDRDCIHSNGTKNFHYSAENLVSCC-HLCGFGCNGGFPGAAFQY 159

Query: 73  FVHHGVV-------TEECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
           +VH G+V       T+ C PY +   C H      P C     TPKC + C     + + 
Sbjct: 160 WVHSGIVSGGAFNSTQGCQPY-EIAPCEHHVSGPRPKCAEGGSTPKCHKNCESNYVVDYE 218

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
           +  H+    Y ++ D   I  +I  NGPVE +FTVY DF HYKSGVY+H  G  +GGHA+
Sbjct: 219 SDLHHGSKHYSVDKDETQIKYDIMTNGPVEGAFTVYVDFLHYKSGVYQHTHGLPLGGHAI 278

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +++GWG  +DG  YW+ AN WN  WG +GYFKI RGS+ CGIE ++ AGLP
Sbjct: 279 RVLGWG-EEDGTPYWLCANSWNTDWGDNGYFKILRGSDHCGIESEISAGLP 328


>gi|195130519|ref|XP_002009699.1| GI15503 [Drosophila mojavensis]
 gi|193908149|gb|EDW07016.1| GI15503 [Drosophila mojavensis]
          Length = 342

 Score =  203 bits (516), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 107/233 (45%), Positives = 142/233 (60%), Gaps = 20/233 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH    +N   S  DL++CC   CG GC+GG+P +AW Y
Sbjct: 112 QGSCGSCWAFGAVEAMSDRVCIHSNGTVNFHFSAEDLVSCC-HTCGFGCNGGFPGAAWSY 170

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
           + H G+V+       E C PY +   C H      P C+    TP C  +C     + + 
Sbjct: 171 WTHKGIVSGGSYNSNEGCRPY-EIEPCEHHVNGTRPPCKNGR-TPSCKHQCESSYSVDYA 228

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KH+   +Y I  +P +I  EI  NGPVE +FTVYED   YKSGVYKH+ G  +GGHA+
Sbjct: 229 KDKHFGSKSYSIRRNPREIQREIMTNGPVEGAFTVYEDLILYKSGVYKHVHGKELGGHAI 288

Query: 179 KLIGWGTSDDGE-DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           +++GWG   D +  YW++ N WN  WG +G+F+I RG + CGIE  + AGLP+
Sbjct: 289 RILGWGVWGDSKVPYWLIGNSWNTDWGDNGFFRIVRGEDHCGIESAISAGLPA 341


>gi|226471004|emb|CAX70583.1| Cysteine PRotease related protein [Schistosoma japonicum]
          Length = 304

 Score =  202 bits (514), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 110/230 (47%), Positives = 141/230 (61%), Gaps = 18/230 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA++DR CI  G   +  LS  DL++CC   CGDGC GG+P  AW Y
Sbjct: 74  QSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLISCCKD-CGDGCKGGFPGQAWDY 132

Query: 73  FVHHGVVT---EE----CDPY-FDS----TGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
           +V  G+VT   EE    C PY F      T   +P C    Y TP+C + C K  +  + 
Sbjct: 133 WVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYE 192

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY    Y + S+ + I  EI   GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+
Sbjct: 193 QDKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAI 252

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           ++IGWG  +    YW++AN WN  WG  G F+I RG +EC IE  VVAGL
Sbjct: 253 RIIGWGV-EKRTPYWLIANSWNEDWGEKGLFRIVRGRDECSIESHVVAGL 301


>gi|410912140|ref|XP_003969548.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
          Length = 246

 Score =  202 bits (514), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 108/224 (48%), Positives = 143/224 (63%), Gaps = 18/224 (8%)

Query: 23  AFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 80
           AFGA EA+SDR CIH    +S  LS  DLL+CC   CG GC+GGYP +AW ++   G+V+
Sbjct: 25  AFGASEAMSDRICIHSNAKISVELSAEDLLSCC-ESCGMGCNGGYPSAAWDFWTKDGLVS 83

Query: 81  EE-------CDPYF-----DSTGCSHPGCE-PAYPTPKCVRKC-VKKNQLWRNSKHYSIS 126
                    C PY           S P C      TP+CV +C       ++  KHY  +
Sbjct: 84  GGLYDSHIGCRPYTIPPCEHHVNGSRPSCSGEGGETPQCVYRCEAGYTPSYKQDKHYGKT 143

Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 186
           +Y ++SD +DI  EIYKNGPVE +FTVYEDF  YK+GVY+H+TG  +GGHA+K++GWG  
Sbjct: 144 SYSVSSDEDDIKHEIYKNGPVEGAFTVYEDFVLYKTGVYQHVTGSALGGHAIKILGWG-E 202

Query: 187 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           ++G  YW+ AN WN  WG +G+FKI RGSN CGIE ++VAG+P+
Sbjct: 203 ENGIPYWLCANSWNTDWGNNGFFKILRGSNHCGIESEIVAGIPN 246


>gi|187103108|ref|NP_001119614.1| cathepsin B-1418 precursor [Acyrthosiphon pisum]
 gi|163300438|tpg|DAA06126.1| TPA_inf: cathepsin B transcript 1418 [Acyrthosiphon pisum]
 gi|239788654|dbj|BAH70998.1| ACYPI000010 [Acyrthosiphon pisum]
          Length = 346

 Score =  202 bits (514), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 106/229 (46%), Positives = 140/229 (61%), Gaps = 16/229 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CGSCWA  A    SDR CI  G  +  +LS   L  CC + CG+GCDGG P SAW +
Sbjct: 113 QSNCGSCWAVSAASVFSDRLCIATGGAVARNLSAEQLNTCC-YRCGNGCDGGSPESAWYF 171

Query: 73  FVHHGVVT-------EECDPY-FDSTGCSHPGCEPAYP-TPKC-VRKCVKKN--QLWRNS 120
           F+ HG+VT       + C PY     G     C    P TP C ++ C   N  + +R  
Sbjct: 172 FMRHGIVTGGDYGSEDGCQPYSIYPCGKGRNTCIEDDPDTPDCSIKTCTNSNYSKNYRAD 231

Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 180
            HY  + Y ++   EDIM ++YKNGPV+ +F VY DF +YKSGVY +  G + GGHA+K+
Sbjct: 232 LHYVDTVYSLSRSEEDIMKDLYKNGPVQAAFYVYTDFMYYKSGVYSYTRGQIEGGHAIKI 291

Query: 181 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +GWG  DDG  YW+ AN W+RSWG +G F+I RG+NEC IE+ V+AG+P
Sbjct: 292 LGWGV-DDGTKYWLCANSWSRSWGENGLFRILRGNNECHIEDRVIAGMP 339


>gi|56753443|gb|AAW24925.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  202 bits (514), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 107/230 (46%), Positives = 141/230 (61%), Gaps = 18/230 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA++DR CI  G   S  LS  DL++CC   CGDGC GG+P  AW Y
Sbjct: 112 QSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCED-CGDGCQGGFPGVAWDY 170

Query: 73  FVHHGVVT-------EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
           +V  G+VT         C PY        T   +P C    Y TP+C +KC K  +  + 
Sbjct: 171 WVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQKCQKGYKTPYE 230

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             K+Y    Y + S+ + I  EI   GPVE +F VYEDF +YKSG+Y+H+ G ++GGHA+
Sbjct: 231 QDKNYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAI 290

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           ++IGWG  + G+ YW++AN WN  WG +G F++ RG +EC IE  VVAGL
Sbjct: 291 RIIGWGV-EKGKPYWLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339


>gi|1169189|sp|P43157.1|CYSP_SCHJA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
           Full=Antigen Sj31; Flags: Precursor
 gi|11167|emb|CAA50305.1| cathepsin B [Schistosoma japonicum]
          Length = 342

 Score =  202 bits (514), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 105/230 (45%), Positives = 140/230 (60%), Gaps = 18/230 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA++DR CI  G   +  LS  DL++CC   CGDGC GG+P  AW Y
Sbjct: 112 QSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLISCCKD-CGDGCQGGFPGVAWDY 170

Query: 73  FVHHGVVT-------EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
           +V  G+VT         C PY        T   +P C    Y TP+C + C K  +  + 
Sbjct: 171 WVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYE 230

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY   +Y + ++ + I  +I   GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+
Sbjct: 231 QDKHYGDESYNVQNNEKVIQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAI 290

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           ++IGWG  +    YW++AN WN  WG  G F++ RG +EC IE DVVAGL
Sbjct: 291 RIIGWGV-EKRTPYWLIANSWNEDWGEKGLFRMVRGRDECSIESDVVAGL 339


>gi|12004577|gb|AAG44098.1| cathepsin B cysteine protease [Leishmania chagasi]
          Length = 340

 Score =  202 bits (514), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 114/245 (46%), Positives = 147/245 (60%), Gaps = 21/245 (8%)

Query: 3   FTNSEHVEILVI------QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGF 55
           F  +EH  + V       Q +CGSCWA  AVEA+SDR+C   G+ +  +S ++LL+CC F
Sbjct: 102 FDAAEHWPMCVTISEIRDQSNCGSCWAIAAVEAISDRYCTLGGVPDRRISTSNLLSCC-F 160

Query: 56  LCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCV 107
           +CG GC GG P  AW ++V  G+ TE C PY     CSH G    YP        TPKC 
Sbjct: 161 ICGFGCYGGIPTMAWLWWVWVGITTEVCQPY-PFGPCSHHGNSDKYPPCPNTIYDTPKCN 219

Query: 108 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
             C K        K+   ++Y +  + E +M E+  NGP+EV+  VY DF  YKSG YKH
Sbjct: 220 TTCEKSEM--DLVKYKGGTSYSVKGEKE-LMIELMTNGPLEVTMQVYSDFVGYKSGGYKH 276

Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           ++GD++GGHAVKL+GWGT   G  YW +AN WN  WG  GYF I+RGSNECGIE   VAG
Sbjct: 277 VSGDLLGGHAVKLVGWGT-QGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAG 335

Query: 228 LPSSK 232
            P+ +
Sbjct: 336 TPAQE 340


>gi|268561866|ref|XP_002638438.1| Hypothetical protein CBG18654 [Caenorhabditis briggsae]
          Length = 396

 Score =  202 bits (513), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 111/239 (46%), Positives = 146/239 (61%), Gaps = 16/239 (6%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCD 62
           N   ++++  Q +CGSCWAF A E +SDR CI         +S  D+L+CCG  C +GC 
Sbjct: 97  NCNSIKLIRDQTYCGSCWAFAAAEIISDRICIQSNGTQQPIISPEDILSCCGSSCNNGCQ 156

Query: 63  GGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKC---VKK 113
           GGY I A +Y+++ GVVT        C PY     CS   C+     P C   C    K 
Sbjct: 157 GGYTIEAMKYWMNSGVVTGGDYQGAGCIPY-SFRPCS--TCKEPKDAPSCKTTCQASYKA 213

Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
              +R     S +A   N+  + I  EIY NGPVEV++ VY+DF HYKSGVY H+ GD  
Sbjct: 214 KSAYRLPTTTSSNAIVANA-VQMIQTEIYNNGPVEVAYQVYDDFYHYKSGVYYHVYGDKP 272

Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
            GHAVK+IGWGT +   DYW++AN W+ ++G +G+FKI+RG+NECGIEE+VVAGLP SK
Sbjct: 273 SGHAVKIIGWGT-EKKVDYWLVANSWSTTFGENGFFKIRRGTNECGIEENVVAGLPKSK 330


>gi|213514196|ref|NP_001133994.1| Cathepsin B precursor [Salmo salar]
 gi|209156086|gb|ACI34275.1| Cathepsin B precursor [Salmo salar]
          Length = 330

 Score =  201 bits (511), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 109/246 (44%), Positives = 153/246 (62%), Gaps = 20/246 (8%)

Query: 1   MPFTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCG 58
           + + N   ++ +  QG CGSCWAFGA EA+SDR CIH    +S+ ++  DLL+CC   CG
Sbjct: 87  LQWPNCPTLKEVRDQGSCGSCWAFGAAEAISDRVCIHSNAKVSVEISSEDLLSCCES-CG 145

Query: 59  DGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAY-PTP 104
            GC+GGYP +A  ++   G+V+         C PY     C H      P C+     TP
Sbjct: 146 MGCNGGYPSAACDFWTKEGLVSGGLYDSHIGCRPY-SIPPCEHHVNGTRPPCKGEEGDTP 204

Query: 105 KCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 163
           +C  +C       ++  KH+   +Y + SD ++IM E+YKNGPVE +FTVYEDF  YKSG
Sbjct: 205 QCTNQCEPGYTPGYKQDKHFGKRSYSVPSDEKEIMKELYKNGPVEGAFTVYEDFLLYKSG 264

Query: 164 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
           VY+H++G  +GGHA+K++GWG  + G  YW+ AN WN  WG +G+FKI RG + CGIE +
Sbjct: 265 VYRHVSGSAVGGHAIKVLGWG-EEGGIPYWLAANSWNTDWGENGFFKIVRGEDHCGIESE 323

Query: 224 VVAGLP 229
           +VAG+P
Sbjct: 324 MVAGIP 329


>gi|170028912|ref|XP_001842338.1| oryzain gamma chain [Culex quinquefasciatus]
 gi|167879388|gb|EDS42771.1| oryzain gamma chain [Culex quinquefasciatus]
          Length = 333

 Score =  201 bits (511), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 106/231 (45%), Positives = 140/231 (60%), Gaps = 20/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH    +   +S  DLL CC   CG GCDGG P + W++
Sbjct: 105 QGSCGSCWAFGAVEAMSDRVCIHSKGKVLFRVSAEDLLTCCTN-CGHGCDGGAPGAGWKH 163

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYP-------------TPKCVRKCVKK-NQLWR 118
           ++  G+V+    P+    GC     EP                TPKC++KC+   N  + 
Sbjct: 164 WIEKGLVSG--GPFGSDQGCRPYTIEPCVHVENGAQSPCKDSITPKCIKKCLPGYNVPYA 221

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             K +  S Y I +D   I  EI+ NGPVE +FTV++DFA YK G+Y+H +G++ G HAV
Sbjct: 222 KDKSFGKSTYSIANDERQIRKEIFTNGPVEATFTVFDDFASYKHGIYQHTSGNLAGEHAV 281

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +++GWG  ++G  YW+ AN WN  WG +GYFKI RGSN   IE  +VAGLP
Sbjct: 282 RILGWGV-ENGTKYWLAANSWNSDWGDNGYFKILRGSNHVDIESAIVAGLP 331


>gi|268561878|ref|XP_002638441.1| Hypothetical protein CBG18657 [Caenorhabditis briggsae]
          Length = 372

 Score =  201 bits (511), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 112/267 (41%), Positives = 156/267 (58%), Gaps = 47/267 (17%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCD 62
           N + ++++  Q +CG+CWAFGA E +SDR CI  G      +SV D+L+CCG  CG+GC 
Sbjct: 88  NCKSIKLIRNQAYCGACWAFGAAEIISDRICIQSGGAHQPIISVEDILSCCGSSCGEGCK 147

Query: 63  GGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKC------ 110
           GGYP+   +++++ GVVT        C PY     CS   CE +  TP C +KC      
Sbjct: 148 GGYPLEGLKFWMNSGVVTGGDYNGTGCQPY-TFPPCSS--CEASKSTPSCQKKCQTGYLE 204

Query: 111 --VKKNQLWRNSKH---------YSI--------SAYRINSDPED----------IMAEI 141
              K ++ + N +          Y +        SAYR+++              I  EI
Sbjct: 205 ATYKNDKRFENEEQDSSYMSENFYQVLIILKGGKSAYRLSTTTSSNKISTDAIITIQTEI 264

Query: 142 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 201
           Y NGPVEVS+ V+EDF  YKSGVY +++G + G HAVK+IGWGT ++  DYW++AN W  
Sbjct: 265 YNNGPVEVSYRVFEDFYQYKSGVYHYVSGKLTGAHAVKIIGWGT-ENKVDYWLVANSWGT 323

Query: 202 SWGADGYFKIKRGSNECGIEEDVVAGL 228
            +G  G+FKI+RG+NECGIEE+VVAGL
Sbjct: 324 DFGEKGFFKIRRGTNECGIEENVVAGL 350


>gi|38373697|gb|AAR19103.1| cathepsin B [Uronema marinum]
          Length = 350

 Score =  201 bits (511), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 112/250 (44%), Positives = 144/250 (57%), Gaps = 32/250 (12%)

Query: 7   EHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGM--NLSLSVNDLLACC--GFLCGDGCD 62
           E ++ +  Q +CGSCWAFG VEA+SDR CI  G      +S  +LL+CC   F CG GC+
Sbjct: 100 ESLQQVRDQSNCGSCWAFGTVEAISDRICIASGQKDQTRISSENLLSCCRGTFACGMGCN 159

Query: 63  GGYPISAWRYFVHHGVVT------------EECDPYFDSTGCSH------PGCE--PAYP 102
           GGY   AW Y+V  G+V+             EC PY     CSH        C   P + 
Sbjct: 160 GGYTAGAWNYYVKTGLVSGNLYTDDNQNSKTECQPY-SFPPCSHHVQGEYQACTDLPQFN 218

Query: 103 TPKCVRKCVKKNQLWRNSK----HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 158
           TPKC  +C   +Q  +NS     H  +S+Y +    E I AEIY+ G    SF VY DF 
Sbjct: 219 TPKCYTEC--NSQYTQNSYEQDLHKGVSSYSVPKSEEQIKAEIYQYGSTTASFNVYSDFL 276

Query: 159 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 218
            Y SGVY++ +G  MGGHA+K++GWG  ++G  YW+ AN WN SWG +G+FKI RGSNEC
Sbjct: 277 TYSSGVYQNTSGSYMGGHAIKMLGWGV-ENGTPYWLCANSWNSSWGENGFFKILRGSNEC 335

Query: 219 GIEEDVVAGL 228
           GIE  +VAG 
Sbjct: 336 GIESGMVAGF 345


>gi|132566367|gb|ABO34080.1| cathepsin B5 [Clonorchis sinensis]
          Length = 343

 Score =  201 bits (511), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 113/232 (48%), Positives = 135/232 (58%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA+SDR CIH     N SLS  DLL+CC   CG GC GGYP  AW Y
Sbjct: 108 QSSCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLLSCCKD-CGFGCRGGYPAVAWDY 166

Query: 73  FVHHGVVTEECDPYFDSTGCSH---PGCE------------PAYPTPKCVRKCVKKNQLW 117
           +  HG+VT       D +GC     P CE              YPTP+CV++C   +  +
Sbjct: 167 WKTHGIVTGGSKE--DPSGCRSYPFPKCEHHVQGHYPPCPRELYPTPECVQQCDTPDVGY 224

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              K  +  +Y I +    IM EI   GPVE  FT+YEDF  Y SGVY H  G  M GHA
Sbjct: 225 LEDKTRANMSYNIYASEISIMKEIMLRGPVEAIFTMYEDFLRYSSGVYFHALGAPMSGHA 284

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           V+++GWG   +   YW++AN WN  WG +GY K  RG NECGIE+DV AGLP
Sbjct: 285 VRILGWGELGN-VPYWLIANSWNEDWGEEGYMKFLRGYNECGIEDDVTAGLP 335


>gi|323147412|gb|ADX32985.1| cathepsin B [Pinctada fucata]
          Length = 366

 Score =  201 bits (510), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 111/244 (45%), Positives = 143/244 (58%), Gaps = 21/244 (8%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           ++N   ++ +  QG CGSCWAFGAVE++SDR CI      N  +S  DL +CC   CG+G
Sbjct: 124 WSNCPTIKEIRDQGSCGSCWAFGAVESMSDRICIKSNGQQNAHISAEDLTSCC-RSCGNG 182

Query: 61  CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKC 106
           C+GG+   AW Y+   G+VT       + C PY     C H       P  +    TP C
Sbjct: 183 CNGGFLSGAWEYYKRDGLVTGGQYNSHQGCQPY-TVKACDHHVVGKLQPCSKKEEHTPVC 241

Query: 107 VRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
             +C    N  +   KHY  +AY +    + IM EI  NGPVE +FTVY DF  YKSGVY
Sbjct: 242 KHECESGYNVSYTKDKHYGATAYSVRG-VQQIMTEIMTNGPVEGAFTVYADFPQYKSGVY 300

Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
           KH TG  +GGHA+K++GWGT + G+DYW++AN WN  WG  G FKI RG +ECGIE  + 
Sbjct: 301 KHTTGSPLGGHAIKIMGWGT-EGGDDYWLVANSWNPDWGNQGTFKILRGRDECGIESQIA 359

Query: 226 AGLP 229
           AG P
Sbjct: 360 AGEP 363


>gi|349956183|dbj|GAA30948.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 337

 Score =  201 bits (510), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 107/235 (45%), Positives = 143/235 (60%), Gaps = 17/235 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA+SDR CIH     +  +S  DL++CCG+ CG GC GG+P  AW +
Sbjct: 102 QSSCGSCWAFGAVEAMSDRLCIHTNGTFTKRISAVDLISCCGY-CGFGCQGGFPPIAWDF 160

Query: 73  FVHHGVVT--EECDPY----FDSTGCSHPGCEP-------AYPTPKCVRKCVKKNQLWRN 119
           +   G+VT   + +P     +    CSH G +         Y TP CV+KC   +  +  
Sbjct: 161 WQTEGIVTGGSKENPTGCRSYPFPRCSHHGSKKYPPCSHRIYDTPNCVQKCDTPDTDYAT 220

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
            K  +   Y + +    IM EI  NGPVE +F VYEDF  YKSGVY H  G ++GGHA++
Sbjct: 221 DKTRANITYNVKAKQNAIMKEIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIR 280

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
           ++GWG  ++G  YW++AN WN  WG DG FK+ RG NECGIE++V AGLP   ++
Sbjct: 281 ILGWG-EENGVAYWLIANSWNDGWGEDGCFKMLRGKNECGIEDEVTAGLPELSSI 334


>gi|320166129|gb|EFW43028.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
          Length = 332

 Score =  201 bits (510), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 110/234 (47%), Positives = 137/234 (58%), Gaps = 24/234 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVE++SDR CI       + LS +DLL+CC   CGDGCDGG    +W Y
Sbjct: 101 QSACGSCWAFGAVESMSDRICIASNATKIVRLSASDLLSCC-TSCGDGCDGGQLGPSWDY 159

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVK--KNQ 115
           + + G+VT         C PY D   C+H    P YP        TPKC + CV      
Sbjct: 160 YKNKGIVTGYLYNTTGYCKPY-DFPACAHHEASPDYPDCPSTDYSTPKCTKSCVAGYTAN 218

Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
            +    HY  S+Y +      I  EI  +GPVE +FTVY DF  Y+SGVYKH +G V+GG
Sbjct: 219 TYTADLHYGQSSYSVGRTDAAIQTEILNHGPVEAAFTVYSDFPTYRSGVYKHTSGSVLGG 278

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           HA+ ++GWGT + G  YW++ N WN SWG  G+FKI RG  +CGI  DVV GLP
Sbjct: 279 HAISIVGWGT-ESGSPYWLVKNSWNPSWGDGGFFKILRG--DCGINNDVVGGLP 329


>gi|268570495|ref|XP_002648548.1| Hypothetical protein CBG24861 [Caenorhabditis briggsae]
          Length = 323

 Score =  201 bits (510), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 104/236 (44%), Positives = 139/236 (58%), Gaps = 14/236 (5%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDG 60
           ++N   +E++  Q  CGSCWAF   E +SDR CI        ++S  D+LACCG  CGDG
Sbjct: 91  WSNCTSIEMIRDQAQCGSCWAFSTAEVISDRICIATKGTQQPTISPTDMLACCGNSCGDG 150

Query: 61  CDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKC-VKK 113
           C G YPI A+R++   GVVT        C PY  +   S P       TP C   C    
Sbjct: 151 CKGRYPIQAFRWWNSRGVVTGGDFRGSGCRPYPFAPCISCP----EEKTPTCSLSCQFGY 206

Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
           +  +   K + +SAY +  +   I  EI  NGPV  +FT+YED   YKSGVY+H  G ++
Sbjct: 207 STAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLL 266

Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           GGHA+K+IGWGT  +G  YW++AN W  +WG +G+ K++RG NECGIE  VVAG+P
Sbjct: 267 GGHAIKIIGWGT-QNGIPYWLIANSWGANWGENGFLKMRRGVNECGIERAVVAGMP 321


>gi|55793941|gb|AAV65881.1| cathepsin B1 isotype 1 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  201 bits (510), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 109/232 (46%), Positives = 144/232 (62%), Gaps = 20/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAF AVEA+SDR CI      ++ LS  DLL+CC   CG GC GG+P +AW Y
Sbjct: 112 QSRCGSCWAFAAVEAMSDRICIESKGKKSVELSAVDLLSCCT-ECGLGCQGGFPGAAWDY 170

Query: 73  FVHHGVVTEE-------CDPY------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-W 117
           +V  G+VT         C PY        +TG  +P C E  Y TPKC +KC K  +  +
Sbjct: 171 WVEDGIVTGSSKENHTGCQPYPFPKCEHHTTG-KYPECGEKIYKTPKCHQKCQKGYKTPY 229

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
           +  K+Y   +Y + ++   I  EI  +GPVE +FTV+ DF +YKSG+YK++TG  +GGHA
Sbjct: 230 KKDKYYGRMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHA 289

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           V++IGWG  +    YW++AN WN  WG  GYF+I RG +ECGIE +V  GLP
Sbjct: 290 VRIIGWGV-EKKTPYWLIANSWNEDWGEKGYFRILRGKDECGIESEVTGGLP 340


>gi|496317|dbj|BAA04103.1| Sarcophaga pro-cathepsin B [Sarcophaga peregrina]
          Length = 344

 Score =  200 bits (509), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 102/234 (43%), Positives = 144/234 (61%), Gaps = 22/234 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH    ++   S +DL++CC   CG GC+GG+P +AW Y
Sbjct: 114 QGSCGSCWAFGAVEAMSDRLCIHSNATIHFHFSADDLVSCC-HTCGFGCNGGFPGAAWAY 172

Query: 73  FVHHGVVTEECDPYFDSTGC--------------SHPGCEPAY-PTPKCVRKCVKKNQL- 116
           +   G+V+    PY  S GC              + P C+  +  TP C  +C K   + 
Sbjct: 173 WTRKGIVSG--GPYGSSQGCRPYEIAPCEHHVNGTRPPCDGEHGKTPSCRHECQKSYDVD 230

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           ++  KH+   +Y +  + +DI  EI +NGPVE +FTVYED   YK GVY+H+ G  +GGH
Sbjct: 231 YKTDKHFGSKSYSVKRNVKDIQKEIMQNGPVEGAFTVYEDLILYKDGVYQHVHGRELGGH 290

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           A++++GWG  ++   YW++AN WN  WG +G+FK+ RG + CGIE  + AGLP 
Sbjct: 291 AIRILGWGV-ENKTPYWLIANSWNTDWGNNGFFKMLRGEDHCGIESAIAAGLPK 343


>gi|444525951|gb|ELV14228.1| Cathepsin B [Tupaia chinensis]
          Length = 339

 Score =  200 bits (509), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 114/240 (47%), Positives = 157/240 (65%), Gaps = 18/240 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVE++SDR CIH    +N+ +S  D+L CCG  CG+GC+GGYP +AW +
Sbjct: 102 QGSCGSCWAFGAVESISDRICIHTNGHVNVEVSAEDMLTCCGGQCGEGCNGGYPSAAWNF 161

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY     C H      P C     TPKC + C    +  ++
Sbjct: 162 WTKKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKSCEPGYSSSYK 220

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  S+Y +    ++IMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA+
Sbjct: 221 EDKHYGYSSYSVPGIEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           +++GWGT ++G  YW++AN WN  WG +G+FKI RG + CGIE ++VAG+P +     +I
Sbjct: 281 RILGWGT-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEIVAGIPRTDQYWAKI 339


>gi|159179|gb|AAA29178.1| cysteine proteinase, partial [Haemonchus contortus]
          Length = 341

 Score =  200 bits (508), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 104/236 (44%), Positives = 140/236 (59%), Gaps = 17/236 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CGSCWA     A+SDR CI       +++S  DL+ CC   CG GCDGG+ I AW Y
Sbjct: 107 QANCGSCWAVSTAAAISDRICIATKARKQVNISATDLVTCCTPTCGFGCDGGWSIKAWEY 166

Query: 73  FVHHGVVT------EECDPYFDSTGCSHPG-------CEPAYPTPKCVRKCVKK-NQLWR 118
           F + G+V+      + C   +    C H G       C     TP C +KC     +L+R
Sbjct: 167 FTYAGLVSGGEYRSKRCCRPYPIHPCGHHGNDTYYGECPEEASTPSCKKKCQPGYRKLYR 226

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             K Y   A+++    E I  E+ KNGPV  SF VYEDF+ YKSG+Y+H  G++ G HAV
Sbjct: 227 MDKRYGTDAFQLPKSVEAIQKELLKNGPVTASFAVYEDFSLYKSGIYRHTAGELRGYHAV 286

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
           K+IGWGT ++  DYW++AN W+  WG +GYF+I RG N+CGIEE+V AGL   ++L
Sbjct: 287 KMIGWGT-ENRTDYWLIANSWHDDWGENGYFRIIRGINDCGIEENVAAGLIDVESL 341


>gi|728602|emb|CAA88490.1| cathepsin B-like enzyme [Leishmania mexicana]
 gi|1586011|prf||2202319A cathepsin B-like Cys protease
          Length = 340

 Score =  200 bits (508), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 111/224 (49%), Positives = 138/224 (61%), Gaps = 15/224 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
           Q +CGSCWA  AVEA+SDR+C   G+ +  +S  +LL+CC F+CG GC GG P  AW ++
Sbjct: 120 QSNCGSCWAIAAVEAMSDRYCTMSGIPDRRISTTNLLSCC-FICGFGCYGGIPAMAWLWW 178

Query: 74  VHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSI 125
           V  GV TE C PY     CSH G    YP        TPKC   C   N      K+  +
Sbjct: 179 VWVGVTTELCQPY-PFGPCSHHGNSSKYPPCPNTIYNTPKCNTTC--DNVEMELVKYKGV 235

Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
           S+Y I  + E +  E+  NGP+EV+  VY DF  YKSGVYKH++GD +GGHAVKL+GWG 
Sbjct: 236 SSYSIKGERE-LDHELMNNGPLEVAMQVYADFVAYKSGVYKHVSGDHLGGHAVKLVGWGV 294

Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
             DG  YW +AN WN  WG  GYF I+RG++ECGIE   VAG P
Sbjct: 295 -KDGIPYWKIANSWNTDWGDKGYFLIQRGNDECGIESSGVAGKP 337


>gi|195478432|ref|XP_002100515.1| GE16138 [Drosophila yakuba]
 gi|194188039|gb|EDX01623.1| GE16138 [Drosophila yakuba]
          Length = 340

 Score =  200 bits (508), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 104/232 (44%), Positives = 138/232 (59%), Gaps = 19/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH G  +N   S +DL++CC   CG GC+GG+P +AW Y
Sbjct: 109 QGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSY 167

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
           +   G+V+       + C PY + + C H      P C     TPKC   C     + + 
Sbjct: 168 WTRKGIVSGGPYGSNQGCRPY-EISPCEHHVNGTRPPCAHGGATPKCSHVCQSSYTVDYA 226

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KH+   +Y +  +  DI  EI  NGPVE +FTVYED   YK GVY+H  G  +GGHA+
Sbjct: 227 KDKHFGSKSYSVRRNVRDIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAI 286

Query: 179 KLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +++GWG   D+   YW++ N WN  WG  G+F+I RG + CGIE  + AGLP
Sbjct: 287 RILGWGVWGDEKIPYWLIGNSWNTDWGDQGFFRILRGQDHCGIESSISAGLP 338


>gi|55793945|gb|AAV65883.1| cathepsin B1 isotype 3 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  200 bits (508), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 109/232 (46%), Positives = 144/232 (62%), Gaps = 20/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAF AVEA+SDR CI      ++ LS  DLL+CC   CG GC GG+P +AW Y
Sbjct: 112 QSRCGSCWAFTAVEAMSDRICIESKGKKSVELSAVDLLSCCT-ECGLGCQGGFPGAAWDY 170

Query: 73  FVHHGVVTEE-------CDPY------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-W 117
           +V  G+VT         C PY        +TG  +P C E  Y TPKC +KC K  +  +
Sbjct: 171 WVEDGIVTGSSKENHTGCQPYPFPKCEHHTTG-KYPECGEKIYKTPKCHQKCQKGYKTPY 229

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
           +  K+Y   +Y + ++   I  EI  +GPVE +FTV+ DF +YKSG+YK++TG  +GGHA
Sbjct: 230 KKDKYYGRMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHA 289

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           V++IGWG  +    YW++AN WN  WG  GYF+I RG +ECGIE +V  GLP
Sbjct: 290 VRIIGWGV-EKKTPYWLIANSWNEDWGEKGYFRILRGKDECGIESEVTGGLP 340


>gi|195438776|ref|XP_002067308.1| GK16352 [Drosophila willistoni]
 gi|194163393|gb|EDW78294.1| GK16352 [Drosophila willistoni]
          Length = 340

 Score =  199 bits (506), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 105/232 (45%), Positives = 144/232 (62%), Gaps = 20/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH    +N  LS +DL++CC   CG GC+GG+P +AW Y
Sbjct: 110 QGSCGSCWAFGAVEAMSDRVCIHSQGKVNFHLSADDLVSCC-HTCGFGCNGGFPGAAWSY 168

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
           +   G+V+       + C PY +   C H      P C     TP+C   C    ++ ++
Sbjct: 169 WTRKGIVSGGNFGSQQGCRPY-EIEPCEHHVNGTRPPCSSG-STPRCQHVCESSYKVDYK 226

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             K++   +Y I ++  DI  EI  NGPVE +FTVYED   YKSGVY+H+ G  +GGHA+
Sbjct: 227 KDKNFGSKSYSIKNNVLDIQKEIMNNGPVEGAFTVYEDLILYKSGVYEHVHGKELGGHAI 286

Query: 179 KLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +++GWG   D+   YW++AN WN  WG +G+F+I RG + CGIE  + AGLP
Sbjct: 287 RILGWGVWGDEKIPYWLIANSWNTDWGDNGFFRIVRGKDHCGIESSISAGLP 338


>gi|984958|gb|AAC46877.1| cathepsin B-like proteinase [Ancylostoma caninum]
          Length = 343

 Score =  199 bits (506), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 101/231 (43%), Positives = 141/231 (61%), Gaps = 20/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWA  + EA+SD  C+     + + +S +D+L+CCG  CG GC GG+PI A+++
Sbjct: 111 QSSCGSCWAVSSAEAMSDEICVQSNSTIRVMISDSDILSCCGISCGYGCQGGWPIEAYKW 170

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPGCEPAY--------PTPKCVRKCVKK-NQL 116
               GVVT       + C PY     C H   +P Y        PTPKC + C +K N+ 
Sbjct: 171 MQRDGVVTGGKYRQKKVCKPY-AFYPCGHHQNDPYYGPCPGGLWPTPKCRKTCQRKYNKS 229

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           ++  KH++  AY + ++  +I  EIYKNGPV  +F VY+DF++YK G+Y H  G   G H
Sbjct: 230 YQEDKHFATRAYYLPNNERNIRQEIYKNGPVVAAFRVYQDFSYYKKGIYVHKWGGQTGAH 289

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           AVK++GWG  ++  DYW++AN WN  WG  GYF+I RG+NECGIE  +V G
Sbjct: 290 AVKVVGWG-RENATDYWLIANSWNTDWGESGYFRIVRGTNECGIEAQMVGG 339


>gi|154761391|gb|ABS85545.1| cathepsin B preproprotein [Biomphalaria glabrata]
          Length = 333

 Score =  199 bits (506), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 107/234 (45%), Positives = 138/234 (58%), Gaps = 28/234 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           Q +CGSCWAFG+ EA++DR CI    N+ +S  D+  CC   CG GC+GGYP +AW ++V
Sbjct: 109 QANCGSCWAFGSAEAMTDRICIAGKGNIHISAEDINDCCKS-CGMGCNGGYPAAAWEWYV 167

Query: 75  HHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVK------KNQ 115
             GVV+       E C PY        +TG   P C    PTPKC +KC+        N 
Sbjct: 168 DTGVVSGGQYGTNEGCMPYSLPHCDHHTTGKYQP-CPAVVPTPKCEKKCLTGYPKSYSND 226

Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
             R  K Y +         + IM E+  NGPV  +F VY DF  YK+GVY+H TG   GG
Sbjct: 227 KTRGKKSYGVRGV------QSIMQELVDNGPVTAAFDVYSDFLSYKTGVYRHTTGSYEGG 280

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           HAVK+IG+GT + G+DYW++AN WN  WG  G+FKI +G +ECGIE  +VAG P
Sbjct: 281 HAVKIIGYGT-ESGQDYWLVANSWNEDWGDKGFFKIAKGKDECGIESSIVAGDP 333


>gi|55793943|gb|AAV65882.1| cathepsin B1 isotype 2 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  199 bits (505), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 109/232 (46%), Positives = 143/232 (61%), Gaps = 20/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAF AVEA+SDR CI      ++ LS  DLL+CC   CG GC GG+P +AW Y
Sbjct: 112 QSRCGSCWAFAAVEAMSDRICIESKGKKSVELSAVDLLSCCT-ECGLGCQGGFPGAAWDY 170

Query: 73  FVHHGVVTEE-------CDPY------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-W 117
           +V  G+VT         C PY        +TG  +P C E  Y TPKC +KC K  +  +
Sbjct: 171 WVEDGIVTGSSKENHTGCQPYPFPKCEHHTTG-KYPECGEKIYKTPKCHQKCQKGYKTPY 229

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              K+Y   +Y + ++   I  EI  +GPVE +FTV+ DF +YKSG+YK++TG  +GGHA
Sbjct: 230 GKDKYYGRMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHA 289

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           V++IGWG  +    YW++AN WN  WG  GYF+I RG +ECGIE +V  GLP
Sbjct: 290 VRIIGWGV-EKKTPYWLIANSWNEDWGEKGYFRILRGKDECGIESEVTGGLP 340


>gi|56759504|gb|AAW27892.1| unknown [Schistosoma japonicum]
          Length = 279

 Score =  198 bits (504), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 105/230 (45%), Positives = 139/230 (60%), Gaps = 18/230 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA++DR CI  G   S  LS  DL++CC   CG GC GG+P  AW Y
Sbjct: 49  QSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCED-CGQGCQGGFPGVAWDY 107

Query: 73  FVHHGVVT-------EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
           +V  G+VT         C PY        T   +P C    Y TP+C + C K  +  + 
Sbjct: 108 WVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYE 167

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY   +Y + ++ + I  +I   GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+
Sbjct: 168 QDKHYGEESYNVQNNEKVIQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAI 227

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           ++IGWG  +    YW++AN WN  WG  G F+I RG +EC IE +VVAGL
Sbjct: 228 RIIGWGV-EKRTPYWLIANSWNEDWGEKGLFRIVRGRDECSIESNVVAGL 276


>gi|154340956|ref|XP_001566431.1| cysteine peptidase C (CPC) [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134063754|emb|CAM39941.1| cysteine peptidase C (CPC) [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 340

 Score =  198 bits (504), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 107/226 (47%), Positives = 137/226 (60%), Gaps = 13/226 (5%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
           Q +CGSCWA  AVEA+SDR+C   G+ +L +S   LL+CC F+CG GC GG P  AW ++
Sbjct: 120 QSNCGSCWAIAAVEAMSDRYCTVAGITDLRVSTGHLLSCC-FVCGMGCQGGIPTMAWLWW 178

Query: 74  VHHGVVTEECDPY------FDSTGCSHPGCEPA-YPTPKCVRKCVKKNQLWRNSKHYSIS 126
           V  G+ +E C PY        + G  +P C    Y TP C   C   +     +KH    
Sbjct: 179 VWVGLTSEVCQPYPFPPCGHHTDGGKYPACPSTIYDTPTCNSTCADSHTAL--TKHKGEK 236

Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 186
           +Y +  + E  M E+   GP EV+F VY DF  YKSGVY H TG+ +GGHAVKL+GWG  
Sbjct: 237 SYSLRGERE-YMIELMTYGPFEVAFDVYADFVSYKSGVYSHTTGERLGGHAVKLVGWGV- 294

Query: 187 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
            +G  YW +AN WN  WG +GYF I+RG++ECGIE   VAGLPS K
Sbjct: 295 QNGTPYWKIANSWNSDWGDNGYFLIRRGTDECGIESTGVAGLPSLK 340


>gi|384597848|gb|AFI23675.1| cathepsin B, partial [Brugia malayi]
          Length = 319

 Score =  198 bits (504), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 108/225 (48%), Positives = 136/225 (60%), Gaps = 23/225 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWA  AVEA+SDR CI       + LS +DLL+CC   CG GC GG P++AW+Y
Sbjct: 99  QSSCGSCWAVAAVEAMSDRICITSKGKKQVILSADDLLSCCK-TCGFGCFGGEPMAAWKY 157

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKK-NQ 115
           +V  G+VT     Y + +GC     P CE               YPTPKC ++C K   +
Sbjct: 158 WVLSGIVTGS--DYTNHSGCRPYPFPPCEHHSNKTHYEPCKHDLYPTPKCYKQCDKNYTK 215

Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
            ++  K+Y   AY + +D E I  EI   GPVE SF VY DF HY SG+YKH+ G V GG
Sbjct: 216 SYKADKYYGEQAYNVENDVESIQKEIMTLGPVEASFEVYTDFLHYTSGIYKHVAGSVGGG 275

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 220
           HAVK++GWG  D G  YW+ AN WN  WG DGYF+I RG++ECG+
Sbjct: 276 HAVKILGWGI-DQGVSYWLAANSWNNDWGEDGYFRILRGADECGM 319


>gi|170787211|gb|ACB38229.1| cathepsin B [Meretrix meretrix]
          Length = 337

 Score =  198 bits (504), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 106/230 (46%), Positives = 143/230 (62%), Gaps = 20/230 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFG VEA +DR CI     +N  LS  DL +CC   CG+GC+GG+   AW Y
Sbjct: 108 QGACGSCWAFGCVEAATDRLCIQSKGIVNAHLSAEDLTSCC-RTCGNGCNGGFLEGAWNY 166

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
               G+VT       + C PY +   C H        C+   PTP+C ++C    N  + 
Sbjct: 167 LKRDGIVTGGPYNSHQGCLPY-EIKACDHHVVGKLQPCKGDGPTPRCKKECESGYNNTYS 225

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             +H++ + + +    E IM EI  NGPVE +FTVY DF  YKSGVY+H +G  +GGHA+
Sbjct: 226 KDEHHAKTVHAVEG-VEQIMTEIMTNGPVEAAFTVYSDFPTYKSGVYEHKSGGPLGGHAI 284

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           K +GWG ++DG+DYW++AN WN  WG +G+FKI RG +ECGIE ++VAG+
Sbjct: 285 KTLGWG-NEDGKDYWLVANSWNPDWGDNGFFKILRGRDECGIESNIVAGM 333


>gi|256086863|ref|XP_002579605.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228447|emb|CCD74618.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 271

 Score =  198 bits (504), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 111/225 (49%), Positives = 137/225 (60%), Gaps = 18/225 (8%)

Query: 23  AFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 80
           AFGAVE++SDR CIH    +S  LS  +LL+CC   CG GC GG P  AW Y+ + G+VT
Sbjct: 45  AFGAVESMSDRICIHSKNKISVELSAINLLSCCT-RCGFGCRGGIPGMAWDYWKYEGIVT 103

Query: 81  -------EECDPY------FDSTGCSHPGCEPAY-PTPKCVRKCVKK-NQLWRNSKHYSI 125
                    C PY        S+  S+P CE  Y PTP+C   C     + ++  K Y  
Sbjct: 104 GGSNETHTGCQPYPFPECNHHSSSKSYPPCESYYFPTPECHETCQDDYGKPYKKDKFYGK 163

Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
           S+Y + S+   IM EI  NGPVE  F VYEDF +YKSGVYKHITG  +GGHA+++IGWG 
Sbjct: 164 SSYNVASEEISIMKEILLNGPVEGGFYVYEDFLNYKSGVYKHITGSYLGGHAIRIIGWGI 223

Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
             +   YW+ AN WN  WG  GYFKI RG+NECGIE  V AGLP+
Sbjct: 224 QQNHIPYWLCANSWNNQWGDQGYFKILRGTNECGIESMVTAGLPN 268


>gi|306992171|gb|ADN19566.1| cathepsin B-like proteinase [Spodoptera frugiperda]
          Length = 341

 Score =  198 bits (503), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 105/231 (45%), Positives = 142/231 (61%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA++DR+C +     +   S  DLL+CC  +CG GC+GG P  AW Y
Sbjct: 109 QGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSAEDLLSCCP-VCGLGCNGGMPTLAWEY 167

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKKNQL-WR 118
           + H G+V+       + C PY +   C H  PG    C     TPKC + C     + + 
Sbjct: 168 WKHFGLVSGGSYNSGQGCRPY-EIPPCEHHVPGNRVPCNGDSKTPKCHKTCEASYSVDYH 226

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             K Y    Y ++S  + I AE++KNGPVE +FTVY D  +YK+GVYKH  G+ +GGHA+
Sbjct: 227 KDKRYGKHVYSVSSKEDHIKAELFKNGPVEGAFTVYSDLLNYKNGVYKHTVGNALGGHAI 286

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           K++GWG  ++G  Y ++AN WN  WG +G+FKI RG + CGIE  +VAG P
Sbjct: 287 KILGWGV-ENGNKYRLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 336


>gi|55793947|gb|AAV65884.1| cathepsin B1 isotype 4 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  198 bits (503), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 108/232 (46%), Positives = 144/232 (62%), Gaps = 20/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAF AVEA+SDR CI      ++ LS  DLL+CC   CG GC GG+P +AW Y
Sbjct: 112 QSRCGSCWAFAAVEAMSDRICIESKGKKSVELSAVDLLSCCT-ECGLGCQGGFPGAAWDY 170

Query: 73  FVHHGVVTEE-------CDPY------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-W 117
           +V  G+VT         C PY        +TG  +P C E  Y TPKC +KC K  +  +
Sbjct: 171 WVEDGIVTGSSKENHTGCQPYPFPKCEHHTTG-KYPECGEKIYKTPKCHQKCQKGYKTPY 229

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
           +  K+Y   +Y + ++   I  EI  +GPVEV+FTV+ DF +YKSG+YK++TG  +G HA
Sbjct: 230 KKDKYYGRMSYNVLNNENAIKKEIMMHGPVEVAFTVHSDFLNYKSGIYKYMTGAEIGEHA 289

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           V++IGWG  +    YW++AN WN  WG  GYF++ RG +ECGIE  V +GLP
Sbjct: 290 VRIIGWGV-EKKTPYWLIANSWNEDWGEKGYFRMLRGKDECGIESAVTSGLP 340


>gi|407425570|gb|EKF39488.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi
           marinkellei]
          Length = 333

 Score =  197 bits (502), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 109/225 (48%), Positives = 138/225 (61%), Gaps = 18/225 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
           Q  CGSCWA  A  A+SDR+C   G+ +L +S  DL++CC  +CG GC+GG+P  AW ++
Sbjct: 114 QSSCGSCWAVAAASAMSDRYCTLGGVRDLRISAGDLMSCCD-VCGYGCNGGFPEVAWVFY 172

Query: 74  VHHGVVTEECDPY-FDSTGCSH-------PGCEPAYPTPKCVRKCV-KKNQLWRNSKHYS 124
           V HG+V+E C PY F S  C+H         C   Y TPKC   C  KK  L R   ++S
Sbjct: 173 VVHGLVSEYCQPYPFPS--CAHHVNSSDLAPCSGDYKTPKCNSTCTEKKIPLIRYRGNHS 230

Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 184
                + S  E    E+  NGP EV+F VY DF  Y  GVYKH+ GD++GGHAV+L+GWG
Sbjct: 231 Y----VLSGEEHFKRELLLNGPFEVAFEVYADFMAYTGGVYKHVAGDLLGGHAVRLVGWG 286

Query: 185 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
              +GE YW +AN WN  WG +GYF I RG NECGIE + VAG P
Sbjct: 287 EL-NGEPYWKIANSWNHEWGMNGYFLIARGVNECGIESNGVAGTP 330


>gi|3912916|gb|AAC78691.1| thiol protease [Trichuris suis]
          Length = 348

 Score =  197 bits (502), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 105/246 (42%), Positives = 142/246 (57%), Gaps = 27/246 (10%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP 66
           + ++  Q  CGSCWA  A E +SDR C+    ++   +S  D+L+CCG  CG GC+GG+P
Sbjct: 99  LNLIRDQAKCGSCWAVSAAETMSDRICVQSNCSIKACISDTDILSCCGLYCGYGCNGGFP 158

Query: 67  ISAWRYFVHHGVVT-------EECDPYF------------DSTGCSHPG----CEPAYPT 103
           I AWR+F   G  T         C PY             D   C +      C     T
Sbjct: 159 IEAWRHFTVAGNCTGGKTIDKYGCKPYKPTGPIGRHLKRNDYAPCPNDTYYGECVGMADT 218

Query: 104 PKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 162
           P+C R+C+    + + + ++Y  SAY +    + I  EI KNGPV  SF VYEDF HYKS
Sbjct: 219 PRCKRRCLLGYPKSYPSDRYYGKSAYIVKQSVKAIQREIMKNGPVVASFAVYEDFRHYKS 278

Query: 163 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 222
           G+YKH  G++ G HAVK+IGWG  ++  D+W++AN W++ WG  GYF+I RG NECGIE 
Sbjct: 279 GIYKHTAGELRGYHAVKIIGWG-KENNTDFWLIANSWHQDWGEKGYFRIVRGKNECGIET 337

Query: 223 DVVAGL 228
           DVVAG+
Sbjct: 338 DVVAGI 343


>gi|328871084|gb|EGG19455.1| peptidase C1A family protein [Dictyostelium fasciculatum]
          Length = 352

 Score =  197 bits (502), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 105/232 (45%), Positives = 133/232 (57%), Gaps = 16/232 (6%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCD 62
           ++N  ++  +  Q  CGSCWAFGAVE++SDRFCIH G ++ LS  DL+ C      +GC 
Sbjct: 80  WSNCSYISAIQNQARCGSCWAFGAVESVSDRFCIHKGEDVLLSFQDLVTC--DQSDNGCQ 137

Query: 63  GGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP-------TPKCVRKCVKKNQ 115
           GG   +A ++    G+V+ +C PY      + P C PA         TP+CV KC   + 
Sbjct: 138 GGDAYTAMKFIQKKGIVSNDCLPY------TIPTCAPAQQPCLNFVDTPQCVEKCSNASY 191

Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
            +    H+    Y +N     I  EI  NGPVE  F VYEDF  YKSGVY+H TG  +GG
Sbjct: 192 TYAQDLHFIDGVYSMNPTVNAIQQEIMTNGPVEACFEVYEDFLGYKSGVYQHTTGKDLGG 251

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           H VK+IGWGT ++ E YWI  N W   WG  G F IK G NECGIE DVVA 
Sbjct: 252 HCVKMIGWGTQNN-ELYWICNNSWTTYWGNQGVFWIKAGVNECGIESDVVAA 302


>gi|256052331|ref|XP_002569726.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228435|emb|CCD74606.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 319

 Score =  197 bits (501), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 107/229 (46%), Positives = 136/229 (59%), Gaps = 18/229 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WAFGAVEA+SDR CI  G   N+ LS  DLL+CC   CGDG +GG+P  AW Y
Sbjct: 89  QSRCGSSWAFGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCEH-CGDGFEGGFPALAWDY 147

Query: 73  FVHHGVVT-------EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
           +V  G+VT         C PY        T   +P C E  Y TP C   C K  +  + 
Sbjct: 148 WVKEGIVTGSSKENHTSCQPYPFPKCEHHTKGKYPACFEEIYKTPNCENTCQKSYKTPYA 207

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KH   S Y + +D + I  EI K GPVE +F VYEDF +YKSG+YKHITG ++  HA+
Sbjct: 208 QDKHRGKSRYNVKNDEKAIQKEIMKYGPVEANFIVYEDFLNYKSGIYKHITGKLVSWHAI 267

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           ++IGWG  ++   YW++ N WN  WG +G F+I RG +EC IE +V AG
Sbjct: 268 RIIGWGV-ENNTPYWLIPNSWNEDWGENGNFRILRGRHECSIESEVTAG 315


>gi|268566089|ref|XP_002647469.1| Hypothetical protein CBG06541 [Caenorhabditis briggsae]
          Length = 280

 Score =  197 bits (501), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 103/222 (46%), Positives = 134/222 (60%), Gaps = 12/222 (5%)

Query: 17  HCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
            CGSCWAF   E +SDR CI        ++S  D+LACCG  CGDGC+GGYPI A+R++ 
Sbjct: 60  QCGSCWAFSTAEVISDRICIATKGTQQPTISPTDMLACCGRSCGDGCEGGYPIQAFRWWN 119

Query: 75  HHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISA 127
             GVVT        C PY  +  C+   C P   TP C   C    +  +   K + +SA
Sbjct: 120 SRGVVTGGDFRGSGCRPYPFAP-CNSYKC-PEEKTPTCSLSCQFGYSTAYAKDKRFGVSA 177

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
           Y +  +   I  EI  NGPV  +FT+YED   YKSGVY+H  G ++GGHA+K+IGWGT  
Sbjct: 178 YAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGT-Q 236

Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +G  YW++AN W   WG +G+ K++RG NECGIE  VVAG+P
Sbjct: 237 NGIPYWLIANSWGADWGENGFLKMRRGVNECGIESAVVAGMP 278



 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 36/62 (58%), Positives = 46/62 (74%), Gaps = 1/62 (1%)

Query: 144 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 203
           NGPVE SFTVYEDF  YK GVY++  G V+G HA+K++GWGT + G DYW++AN W    
Sbjct: 3   NGPVEASFTVYEDFYIYKKGVYQYTAGQVVGVHAIKIMGWGT-EHGTDYWLIANSWGAQC 61

Query: 204 GA 205
           G+
Sbjct: 62  GS 63


>gi|194895314|ref|XP_001978227.1| GG19486 [Drosophila erecta]
 gi|190649876|gb|EDV47154.1| GG19486 [Drosophila erecta]
          Length = 340

 Score =  197 bits (501), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 103/232 (44%), Positives = 138/232 (59%), Gaps = 19/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH G  +N   S +DL++CC   CG GC+GG+P +AW Y
Sbjct: 109 QGECGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSY 167

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
           +   G+V+       + C PY +   C H      P C     TPKC   C     + + 
Sbjct: 168 WTRKGIVSGGPYGSNQGCRPY-EIAPCEHHVNGTRPPCGHGGGTPKCSHVCESGYTVDYA 226

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KH+   +Y +  +  DI  EI  NGPVE +FTVYED   YK GVY+H  G  +GGHA+
Sbjct: 227 KDKHFGSKSYSVKRNVRDIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHQHGKELGGHAI 286

Query: 179 KLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +++GWG   ++   YW++ N WN  WG +G+F+I RG + CGIE  + AGLP
Sbjct: 287 RILGWGVWGEEKIPYWLIGNSWNTDWGDNGFFRILRGQDHCGIESSISAGLP 338


>gi|195566634|ref|XP_002106884.1| GD15875 [Drosophila simulans]
 gi|194204277|gb|EDX17853.1| GD15875 [Drosophila simulans]
          Length = 340

 Score =  197 bits (501), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 103/232 (44%), Positives = 138/232 (59%), Gaps = 19/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH G  +N   S +DL++CC   CG GC+GG+P +AW Y
Sbjct: 109 QGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSY 167

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
           +   G+V+       + C PY + + C H      P C     TPKC   C     + + 
Sbjct: 168 WTRKGIVSGGPYGSNQGCRPY-EISPCEHHVNGTRPPCAHGGGTPKCSHVCQSSYTVDYA 226

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KH+   +Y +  +  +I  EI  NGPVE +FTVYED   YK GVY+H  G  +GGHA+
Sbjct: 227 KDKHFGSKSYSVKRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAI 286

Query: 179 KLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +++GWG   D+   YW++ N WN  WG  G+F+I RG + CGIE  + AGLP
Sbjct: 287 RILGWGVWGDEKIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIESSISAGLP 338


>gi|358331547|dbj|GAA35870.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 508

 Score =  197 bits (500), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 111/231 (48%), Positives = 133/231 (57%), Gaps = 21/231 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA+SDR CIH     N SLS  DLL+CC   CG GC GGYP  AW Y
Sbjct: 108 QSSCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLLSCCKD-CGFGCRGGYPAVAWDY 166

Query: 73  FVHHGVVTEECDPYFDSTGCSH---PGCE------------PAYPTPKCVRKCVKKNQLW 117
           +  HG+VT       D +GC     P CE              YPTP+CV++C   +  +
Sbjct: 167 WKTHGIVTGGSKE--DPSGCRSYPFPKCEHHVQGHYPPCPRELYPTPECVQQCDTPDVGY 224

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              K  +  +Y I +    IM EI   GPVE  FT+YEDF  Y SGVY H  G  M GHA
Sbjct: 225 LEDKTRANMSYNIYASEISIMKEIMLRGPVEAIFTMYEDFLRYSSGVYFHALGAPMSGHA 284

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           V+++GWG   +   YW++AN WN  WG +GY K  RG NECGIE+DV A L
Sbjct: 285 VRILGWGELGN-VPYWLIANSWNEDWGEEGYMKFLRGYNECGIEDDVTAVL 334


>gi|395842321|ref|XP_003793966.1| PREDICTED: cathepsin B [Otolemur garnettii]
          Length = 339

 Score =  197 bits (500), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 114/240 (47%), Positives = 157/240 (65%), Gaps = 18/240 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVE++SDR CIH   ++S+ V+  DLL CCG  CGDGC+GGYP  AW +
Sbjct: 102 QGSCGSCWAFGAVESISDRICIHTNGHVSVEVSAEDLLTCCGGQCGDGCNGGYPAEAWNF 161

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY     C H      P C     TPKC + C    +  ++
Sbjct: 162 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPACTGEGDTPKCSKTCEPGYSPTYK 220

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KH+  ++Y + ++  +IMAEIYKNGPVE +F+VY DF  YKSGVY+H+TGD+MGGHA+
Sbjct: 221 EDKHFGYTSYSLPTNEWEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHLTGDMMGGHAI 280

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           +++GWG  ++G  YW++AN WN  WG  G+F+I RG + CGIE +VVAG+P +    ++I
Sbjct: 281 RILGWG-EENGVPYWLVANSWNTDWGDGGFFRILRGQDHCGIESEVVAGIPRTDQYWEKI 339


>gi|52546914|gb|AAU81590.1| cysteine proteinase, partial [Petunia x hybrida]
          Length = 122

 Score =  197 bits (500), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 91/120 (75%), Positives = 105/120 (87%)

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
           R +SDP  IM E+YKNGPVEV+FTVYEDFAHYKSGVYKH+TGD +GGHAVKLIGWGTS+D
Sbjct: 2   RGSSDPYSIMTEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGDELGGHAVKLIGWGTSED 61

Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 248
           GEDYW+LANQWNR WG DGYFKI+RG+NEC IE++VVAG+PS KNL  E+  +D F DAS
Sbjct: 62  GEDYWLLANQWNRGWGDDGYFKIRRGTNECDIEDEVVAGMPSPKNLNMELDVSDAFLDAS 121


>gi|146165818|ref|XP_001015807.2| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|146145394|gb|EAR95562.2| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 338

 Score =  197 bits (500), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 108/245 (44%), Positives = 138/245 (56%), Gaps = 21/245 (8%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNL--SLSVNDLLACCGFLCGDG 60
           + N   ++ +  Q  CGSCWAF A E  SDR CI     L  S+S  DLL CC   CG+G
Sbjct: 97  WPNCNSIKTIRDQSTCGSCWAFAATETYSDRICIASNQELQTSISSEDLLECCA-TCGNG 155

Query: 61  CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C GGYP +AW+Y    GV T         C PY     C H      P C P  PTPKCV
Sbjct: 156 CQGGYPSAAWKYMKATGVSTGGLYGDDSSCKPYVFPP-CDHHVVGQYPPCGPIKPTPKCV 214

Query: 108 RKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
           ++C  +   + ++   H+    Y++ ++ E I  EI  +GPV+ SF V  DF  YKSGVY
Sbjct: 215 KQCNSQYTEKTYQQDLHHPSKVYQLPNNAEAIQREIMAHGPVQASFRVASDFLTYKSGVY 274

Query: 166 -KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
            +       GGH+VK+IGWG  + G  YW++AN WN  WG +G FK+ RG NECGIE +V
Sbjct: 275 IRDPKLKYEGGHSVKIIGWGV-EQGTPYWLIANSWNEDWGENGLFKMLRGKNECGIEAEV 333

Query: 225 VAGLP 229
           VAGLP
Sbjct: 334 VAGLP 338


>gi|340053922|emb|CCC48215.1| cysteine peptidase C (CPC) [Trypanosoma vivax Y486]
          Length = 334

 Score =  197 bits (500), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 105/231 (45%), Positives = 137/231 (59%), Gaps = 15/231 (6%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPI 67
           ++ +  Q  CGSCWA  A  A+SDRFC+  G+ +L +S  DLL+CC   CGDGCDGGYP 
Sbjct: 107 IKRIADQSSCGSCWAVAAATAMSDRFCVTGGVRDLGISAGDLLSCC-TSCGDGCDGGYPD 165

Query: 68  SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRN 119
            AW YF   G+V++ C PY     C H G     P        TPKC   C  K      
Sbjct: 166 EAWLYFTESGLVSDYCQPY-PFPPCKHSGGRSKNPSCHDMHFHTPKCNATCTDKRIP--V 222

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
            ++++  +Y +  + ED   E+Y  GP EV+FTVYEDF  Y+SGVYKH++G  +GGHAV+
Sbjct: 223 VRYFASESYSLQGE-EDYKRELYLRGPFEVAFTVYEDFLAYESGVYKHVSGGPVGGHAVR 281

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           ++GWG   +G  YW +AN WN  WG +GY    RG +ECGIE    AG PS
Sbjct: 282 VVGWG-ERNGVPYWKIANSWNTDWGENGYLYFYRGKDECGIESQGSAGTPS 331


>gi|195352458|ref|XP_002042729.1| GM17589 [Drosophila sechellia]
 gi|194126760|gb|EDW48803.1| GM17589 [Drosophila sechellia]
          Length = 340

 Score =  196 bits (498), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 102/232 (43%), Positives = 138/232 (59%), Gaps = 19/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH G  +N   S +DL++CC   CG GC+GG+P +AW Y
Sbjct: 109 QGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSY 167

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
           +   G+V+       + C PY + + C H      P C     TPKC   C     + + 
Sbjct: 168 WTRKGIVSGGPYGSNQGCRPY-EISPCEHHVNGTRPPCANGSGTPKCSHVCQSSYTVDYA 226

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KH+   +Y +  +  +I  EI  NGPVE +FTVYED   YK GVY+H  G  +GGHA+
Sbjct: 227 KDKHFGSKSYSVKRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAI 286

Query: 179 KLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +++GWG   ++   YW++ N WN  WG  G+F+I RG + CGIE  + AGLP
Sbjct: 287 RILGWGVWGNEKIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIESSISAGLP 338


>gi|194766882|ref|XP_001965553.1| GF22391 [Drosophila ananassae]
 gi|190619544|gb|EDV35068.1| GF22391 [Drosophila ananassae]
          Length = 342

 Score =  196 bits (497), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 105/233 (45%), Positives = 136/233 (58%), Gaps = 19/233 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH    +N   S +DL++CC   CG GC+GG+P +AW Y
Sbjct: 110 QGSCGSCWAFGAVEAMSDRVCIHSNGNVNFRFSADDLVSCC-HTCGFGCNGGFPGAAWSY 168

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKC-VKKNQLWR 118
           +   G+V+         C PY +   C H        C     TPKC  +C    N  + 
Sbjct: 169 WTRKGIVSGGRYGSKTGCRPY-EIAPCEHHVNGTRAPCNHDSKTPKCQHQCEAGYNVEYS 227

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KH+   +Y +  +  DI  EI  NGPVE +FTVYED   YKSGVY+H  G  +GGHA+
Sbjct: 228 KDKHFGSKSYSVRRNVRDIQEEIMTNGPVEGAFTVYEDLILYKSGVYQHEHGKELGGHAI 287

Query: 179 KLIGWGTSDDGE-DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           +++GWG     E  YW++AN WN  WG  G+F+I RG + CGIE  + AGLP 
Sbjct: 288 RILGWGVWGKEEVPYWLIANSWNDDWGDKGFFRILRGEDHCGIESSISAGLPK 340


>gi|355332948|pdb|3MOR|A Chain A, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
 gi|355332949|pdb|3MOR|B Chain B, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
          Length = 317

 Score =  196 bits (497), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 108/240 (45%), Positives = 142/240 (59%), Gaps = 14/240 (5%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGC 61
           + N   +  +  Q  CGSCWA  A  A+SDRFC   G+ ++ +S  DLLACC   CGDGC
Sbjct: 81  WPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAGDLLACCS-DCGDGC 139

Query: 62  DGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKK 113
           +GG P  AW YF   G+V++ C PY       H   +  YP        TPKC   C   
Sbjct: 140 NGGDPDRAWAYFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCNYTCDDP 199

Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
                N +  S ++Y +  + +D M E++  GP EV+F VYEDF  Y SGVY H++G  +
Sbjct: 200 TIPVVNYR--SWTSYALQGE-DDYMRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYL 256

Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
           GGHAV+L+GWGTS +G  YW +AN WN  WG DGYF I+RGS+ECGIE+   AG+P + N
Sbjct: 257 GGHAVRLVGWGTS-NGVPYWKIANSWNTEWGMDGYFLIRRGSSECGIEDGGSAGIPLAPN 315


>gi|261328564|emb|CBH11542.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like,
           putative [Trypanosoma brucei gambiense DAL972]
          Length = 340

 Score =  196 bits (497), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 108/238 (45%), Positives = 141/238 (59%), Gaps = 14/238 (5%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDG 63
           N   +  +  Q  CGSCWA  A  A+SDRFC   G+ ++ +S  DLLACC   CGDGC+G
Sbjct: 106 NCPTIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAGDLLACCS-DCGDGCNG 164

Query: 64  GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQ 115
           G P  AW YF   G+V++ C PY       H   +  YP        TPKC   C     
Sbjct: 165 GDPDRAWAYFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCNYTCDDPTI 224

Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
              N +  S ++Y +  + +D M E++  GP EV+F VYEDF  Y SGVY H++G  +GG
Sbjct: 225 PVVNYR--SWTSYALQGE-DDYMRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGG 281

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
           HAV+L+GWGTS +G  YW +AN WN  WG DGYF I+RGS+ECGIE+   AG+P + N
Sbjct: 282 HAVRLVGWGTS-NGVPYWKIANSWNTEWGMDGYFLIRRGSSECGIEDGGSAGIPLAPN 338


>gi|1181143|emb|CAA93278.1| cysteine proteinase [Haemonchus contortus]
          Length = 341

 Score =  196 bits (497), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 107/230 (46%), Positives = 136/230 (59%), Gaps = 19/230 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CGSCWA     ALSDR CI  +    + +S  D+L+CCG  CG GC+GG+PI A+ Y
Sbjct: 112 QANCGSCWAVSTASALSDRICIASNGRKQVHVSATDILSCCGNQCGYGCNGGWPIQAFNY 171

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPG-------CEPAYPTPKCVRKC-VKKNQLW 117
           F   G VT         C PY     C H G       C     TPKCVRKC     + +
Sbjct: 172 FSKQGAVTGGDYKATSGCRPY-PFHPCGHHGKDTYYGECPNEATTPKCVRKCQKSYKKSY 230

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
           +  +     AY + +  + I  EI KNGPV  +FTVYEDF++YK G+YKH  G   GGHA
Sbjct: 231 KKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHA 290

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           +K+IGWG  + G  YW++AN W+  WG +GYF+I RGSN CGIEE+VVAG
Sbjct: 291 IKIIGWG-KEGGVPYWLIANSWHNDWGENGYFRILRGSNHCGIEENVVAG 339


>gi|402594312|gb|EJW88238.1| cathepsin B5 [Wuchereria bancrofti]
          Length = 407

 Score =  196 bits (497), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 111/235 (47%), Positives = 139/235 (59%), Gaps = 26/235 (11%)

Query: 20  SCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 77
           SCWA  AVEA+SDR CI       + LS +DLL+CC   CG GC GG P++AW+Y+V  G
Sbjct: 163 SCWAVAAVEAMSDRICITSKGKKQVILSADDLLSCCK-TCGFGCFGGEPMAAWKYWVLSG 221

Query: 78  VVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKK-NQLWRNS 120
           +VT     Y + +GC     P CE               YPTPKC R+C K   + ++  
Sbjct: 222 IVTG--SDYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCDRQCDKNYKKPYKAD 279

Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 180
           K+Y   AY + +D E I  EI   GPVE SF VY DF HY  G+YKH+ G V GGHAVK+
Sbjct: 280 KYYGEQAYNVENDVELIQKEIMTLGPVEASFEVYTDFLHYIGGIYKHVAGSVGGGHAVKI 339

Query: 181 IGWGTSDDGEDYWILANQWNRSWGAD---GYFKIKRGSNECGIEEDVVAGLPSSK 232
           +GWG  D G  YW+ AN WN  WG D   GYF+I RG +ECGIE  +VAG+P  +
Sbjct: 340 LGWGI-DQGVSYWLAANSWNTDWGEDVFSGYFRILRGVDECGIESGIVAGIPRKE 393


>gi|296863454|pdb|3HHI|A Chain A, Crystal Structure Of Cathepsin B From T. Brucei In Complex
           With Ca074
 gi|296863455|pdb|3HHI|B Chain B, Crystal Structure Of Cathepsin B From T. Brucei In Complex
           With Ca074
          Length = 325

 Score =  196 bits (497), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 108/240 (45%), Positives = 142/240 (59%), Gaps = 14/240 (5%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGC 61
           + N   +  +  Q  CGSCWA  A  A+SDRFC   G+ ++ +S  DLLACC   CGDGC
Sbjct: 82  WPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAGDLLACCS-DCGDGC 140

Query: 62  DGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKK 113
           +GG P  AW YF   G+V++ C PY       H   +  YP        TPKC   C   
Sbjct: 141 NGGDPDRAWAYFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDP 200

Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
                N +  S ++Y +  + +D M E++  GP EV+F VYEDF  Y SGVY H++G  +
Sbjct: 201 TIPVVNYR--SWTSYALQGE-DDYMRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYL 257

Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
           GGHAV+L+GWGTS +G  YW +AN WN  WG DGYF I+RGS+ECGIE+   AG+P + N
Sbjct: 258 GGHAVRLVGWGTS-NGVPYWKIANSWNTEWGMDGYFLIRRGSSECGIEDGGSAGIPLAPN 316


>gi|27806671|ref|NP_776456.1| cathepsin B precursor [Bos taurus]
 gi|115312124|sp|P07688.5|CATB_BOVIN RecName: Full=Cathepsin B; AltName: Full=BCSB; Contains: RecName:
           Full=Cathepsin B light chain; Contains: RecName:
           Full=Cathepsin B heavy chain; Flags: Precursor
 gi|289402|gb|AAA03064.1| cathepsin B [Bos taurus]
 gi|809479|gb|AAA80198.1| cathepsin B [Bos taurus]
 gi|296484950|tpg|DAA27065.1| TPA: cathepsin B precursor [Bos taurus]
          Length = 335

 Score =  196 bits (497), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 111/243 (45%), Positives = 158/243 (65%), Gaps = 18/243 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWAFGAVEA+SDR CIH    +N+ +S  D+L CCG  CGDG
Sbjct: 90  WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLTCCGGECGDG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C+GG+P  AW ++   G+V+         C PY     C H      P C     TPKC 
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 208

Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C    +  ++  KH+  S+Y + ++ ++IMAEIYKNGPVE +F+VY DF  YKSGVY+
Sbjct: 209 KTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQ 268

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H++G++MGGHA++++GWG  ++G  YW++ N WN  WG +G+FKI RG + CGIE ++VA
Sbjct: 269 HVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVA 327

Query: 227 GLP 229
           G+P
Sbjct: 328 GMP 330


>gi|72389769|ref|XP_845179.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|427931064|pdb|4HWY|A Chain A, Trypanosoma Brucei Procathepsin B Solved From 40 Fs
           Free-electron Laser Pulse Data By Serial Femtosecond
           X-ray Crystallography
 gi|40557577|gb|AAR88085.1| cathepsin B-like cysteine protease [Trypanosoma brucei]
 gi|62360039|gb|AAX80461.1| cysteine peptidase C (CPC) [Trypanosoma brucei]
 gi|70801714|gb|AAZ11620.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
          Length = 340

 Score =  196 bits (497), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 108/238 (45%), Positives = 141/238 (59%), Gaps = 14/238 (5%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDG 63
           N   +  +  Q  CGSCWA  A  A+SDRFC   G+ ++ +S  DLLACC   CGDGC+G
Sbjct: 106 NCPTIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAGDLLACCS-DCGDGCNG 164

Query: 64  GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQ 115
           G P  AW YF   G+V++ C PY       H   +  YP        TPKC   C     
Sbjct: 165 GDPDRAWAYFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCNYTCDDPTI 224

Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
              N +  S ++Y +  + +D M E++  GP EV+F VYEDF  Y SGVY H++G  +GG
Sbjct: 225 PVVNYR--SWTSYALQGE-DDYMRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGG 281

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
           HAV+L+GWGTS +G  YW +AN WN  WG DGYF I+RGS+ECGIE+   AG+P + N
Sbjct: 282 HAVRLVGWGTS-NGVPYWKIANSWNTEWGMDGYFLIRRGSSECGIEDGGSAGIPLAPN 338


>gi|440913587|gb|ELR63025.1| Cathepsin B [Bos grunniens mutus]
          Length = 335

 Score =  195 bits (496), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 111/243 (45%), Positives = 158/243 (65%), Gaps = 18/243 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWAFGAVEA+SDR CIH    +N+ +S  D+L CCG  CGDG
Sbjct: 90  WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLTCCGGECGDG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C+GG+P  AW ++   G+V+         C PY     C H      P C     TPKC 
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 208

Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C    +  ++  KH+  S+Y + ++ ++IMAEIYKNGPVE +F+VY DF  YKSGVY+
Sbjct: 209 KTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQ 268

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H++G++MGGHA++++GWG  ++G  YW++ N WN  WG +G+FKI RG + CGIE ++VA
Sbjct: 269 HVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVA 327

Query: 227 GLP 229
           G+P
Sbjct: 328 GMP 330


>gi|18921171|ref|NP_572920.1| cathepsin B1, isoform A [Drosophila melanogaster]
 gi|7292926|gb|AAF48317.1| cathepsin B1, isoform A [Drosophila melanogaster]
 gi|16767940|gb|AAL28188.1| GH06546p [Drosophila melanogaster]
 gi|220944992|gb|ACL85039.1| CG10992-PA [synthetic construct]
 gi|220954816|gb|ACL89951.1| CG10992-PA [synthetic construct]
          Length = 340

 Score =  195 bits (496), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 102/232 (43%), Positives = 138/232 (59%), Gaps = 19/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH G  +N   S +DL++CC   CG GC+GG+P +AW Y
Sbjct: 109 QGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSY 167

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
           +   G+V+       + C PY + + C H      P C     TPKC   C     + + 
Sbjct: 168 WTRKGIVSGGPYGSNQGCRPY-EISPCEHHVNGTRPPCAHGGRTPKCSHVCQSGYTVDYA 226

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KH+   +Y +  +  +I  EI  NGPVE +FTVYED   YK GVY+H  G  +GGHA+
Sbjct: 227 KDKHFGSKSYSVRRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAI 286

Query: 179 KLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +++GWG   ++   YW++ N WN  WG  G+F+I RG + CGIE  + AGLP
Sbjct: 287 RILGWGVWGEEKIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIESSISAGLP 338


>gi|442616292|ref|NP_001259536.1| cathepsin B1, isoform B [Drosophila melanogaster]
 gi|440216755|gb|AGB95378.1| cathepsin B1, isoform B [Drosophila melanogaster]
          Length = 330

 Score =  195 bits (496), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 102/232 (43%), Positives = 138/232 (59%), Gaps = 19/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH G  +N   S +DL++CC   CG GC+GG+P +AW Y
Sbjct: 99  QGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSY 157

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
           +   G+V+       + C PY + + C H      P C     TPKC   C     + + 
Sbjct: 158 WTRKGIVSGGPYGSNQGCRPY-EISPCEHHVNGTRPPCAHGGRTPKCSHVCQSGYTVDYA 216

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KH+   +Y +  +  +I  EI  NGPVE +FTVYED   YK GVY+H  G  +GGHA+
Sbjct: 217 KDKHFGSKSYSVRRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAI 276

Query: 179 KLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +++GWG   ++   YW++ N WN  WG  G+F+I RG + CGIE  + AGLP
Sbjct: 277 RILGWGVWGEEKIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIESSISAGLP 328


>gi|211853248|emb|CAP17587.1| cathepsin-like protein 4 [Crateromorpha meyeri]
          Length = 325

 Score =  195 bits (495), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 107/232 (46%), Positives = 138/232 (59%), Gaps = 22/232 (9%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
           + ++  Q +CGSCWAFGA E++SDR+CIH  M+L +S  +L+ CC   CG+GC+GG+  +
Sbjct: 96  IGLIEDQSNCGSCWAFGATESMSDRYCIHMKMHLLISAANLMECCRN-CGNGCEGGFLGA 154

Query: 69  AWRYFVHHGVVT-----------EECDPYFDSTGCSH--PGCEPAYP-----TPKCVRKC 110
           AW Y+   G+VT           + C PY     C H   G +PA P     TP+CV  C
Sbjct: 155 AWNYWKQEGLVTGGLYNPSATESDTCQPY-PLPSCEHHINGSKPACPSKIAKTPECVHTC 213

Query: 111 -VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 169
                  +    HY  SAY +     +I  EI  NGPVE +FTVY DF  YKSGVYK  +
Sbjct: 214 HAGYPTSYEQDLHYGESAYSVRRRVAEIQTEIMTNGPVEAAFTVYADFPAYKSGVYKRHS 273

Query: 170 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 221
              +GGHAVK+IGWG  +DG  YW++AN WN  WG  GYFKI RG +ECGIE
Sbjct: 274 LRQLGGHAVKMIGWG-EEDGIPYWLIANSWNSDWGDHGYFKIVRGQDECGIE 324


>gi|66810163|ref|XP_638805.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
 gi|74897075|sp|Q54QD9.1|CTSB_DICDI RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Flags:
           Precursor
 gi|60467425|gb|EAL65448.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
          Length = 311

 Score =  195 bits (495), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 108/223 (48%), Positives = 132/223 (59%), Gaps = 20/223 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           Q  CGSCWAFGA E+ +DR CIH   N+ LS  D++ C      +GC+GG   SAW +  
Sbjct: 101 QARCGSCWAFGATESATDRLCIHNNENVQLSFMDMVTCDE--TDNGCEGGDAFSAWNWLR 158

Query: 75  HHGVVTEECDPYFDSTGCSHPGCEPAYP-------TPKCVRKCVKKNQL-WRNSKHYSIS 126
             G V+EEC PY      + P C PA         TP C ++C   + L +   KH    
Sbjct: 159 KQGAVSEECLPY------TIPTCPPAQQPCLNFVNTPSCTKECQSNSSLIYSQDKHKMAK 212

Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 186
            Y  +SD E IM EI  NGPVE  FTV+EDF  YKSGVY H TG  +GGH VKL+G+GT 
Sbjct: 213 IYSFDSD-EAIMQEIVTNGPVEACFTVFEDFLAYKSGVYVHTTGKDLGGHCVKLVGFGTL 271

Query: 187 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
            +G DY+   NQW  SWG +G F IKRG  +CGI +DVVAGLP
Sbjct: 272 -NGVDYYAANNQWTTSWGDNGTFLIKRG--DCGISDDVVAGLP 311


>gi|27526823|emb|CAD32937.1| pro-cathepsin B2 [Fasciola hepatica]
          Length = 337

 Score =  195 bits (495), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 109/244 (44%), Positives = 140/244 (57%), Gaps = 24/244 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWA   V A+SDR CIH    M   LS  DL++CC + CG+GC GG P +AW Y
Sbjct: 98  QSSCGSCWAVAGVGAMSDRVCIHSNGMMQPELSAIDLVSCCSY-CGNGCQGGSPPAAWDY 156

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPGCEP--------AYPTPKCVRKC-VKKNQL 116
           +  +G+VT         C PY     C HPG            YPTP C   C    ++ 
Sbjct: 157 WWRNGIVTGGTLENPTGCLPY-PFPQCRHPGSRSQLNPCPRYTYPTPSCYPYCQAGYDKT 215

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   K Y  ++Y ++     IM EI KNGPVE  F VY DFA YKSG+Y H++G   G H
Sbjct: 216 YEKDKVYGKTSYNVDRHEYTIMEEIMKNGPVEAGFIVYTDFAVYKSGIYHHVSGRYAGKH 275

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVK 236
           A+++IGWG  ++G  YW+ AN WN  WG +GYF+I RG++EC IE  VVAG+P    L K
Sbjct: 276 AIRIIGWGV-ENGVKYWLTANSWNVGWGENGYFRILRGTDECRIESIVVAGMP---RLQK 331

Query: 237 EITS 240
            IT+
Sbjct: 332 NITN 335


>gi|1008858|gb|AAA79004.1| cathepsin B-like thiol protease [Aedes aegypti]
          Length = 342

 Score =  195 bits (495), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 108/239 (45%), Positives = 141/239 (58%), Gaps = 28/239 (11%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
           + ++  QG CGSCWA  A  A++DR+CI        S    D+LACC   CGDGC GGY 
Sbjct: 103 LNVIRNQGCCGSCWAISAASAMTDRWCIKSKGKEQFSFGATDMLACC-HACGDGCKGGYL 161

Query: 67  ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP------------TPKCVRKC---V 111
             AW+++V  GV +    PY    GC HP     YP            TPKC ++C    
Sbjct: 162 GPAWQFWVEQGVSSG--GPYNSRQGC-HP-----YPIDVCDASGEEADTPKCSKRCQSGY 213

Query: 112 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 171
               +W++ + Y   AY I +D + IM EIY NGPV+ +F  Y+D   YKSGVY+H+ G 
Sbjct: 214 NVTDVWQD-RRYGRVAYSIPNDEQKIMEEIYINGPVQAAFMTYQDLHAYKSGVYRHVWGH 272

Query: 172 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           + GGHAVKL+GWG  ++G  YW++AN W   WG +G+FKI RG N CGIE+DV AGLPS
Sbjct: 273 MAGGHAVKLMGWGV-ENGLKYWLVANSWGDDWGDNGFFKIVRGENHCGIEKDVHAGLPS 330


>gi|9955277|pdb|1QDQ|A Chain A, X-Ray Crystal Structure Of Bovine Cathepsin B-Ca074
           Complex
          Length = 253

 Score =  195 bits (495), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 112/243 (46%), Positives = 158/243 (65%), Gaps = 18/243 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWAFGAVEA+SDR CIH    +N+ +S  D+L CCG  CGDG
Sbjct: 11  WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLTCCGGECGDG 70

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C+GG P  AW ++   G+V+         C PY     C H      P C     TPKC 
Sbjct: 71  CNGGEPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 129

Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C    +  ++  KH+  S+Y + ++ ++IMAEIYKNGPVE +F+VY DF  YKSGVY+
Sbjct: 130 KTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQ 189

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H++G++MGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE ++VA
Sbjct: 190 HVSGEIMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEIVA 248

Query: 227 GLP 229
           G+P
Sbjct: 249 GMP 251


>gi|156255405|gb|ABU62925.1| cathepsin B [Fasciola hepatica]
          Length = 337

 Score =  194 bits (494), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 105/245 (42%), Positives = 142/245 (57%), Gaps = 21/245 (8%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   +  +  Q  C SCWA  +  A++DR CIH        LS  D+++CC + CG G
Sbjct: 96  WANCPSISEIRDQSSCSSCWAVSSASAITDRICIHSNGQKKPRLSAIDIVSCCAY-CGYG 154

Query: 61  CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH----PGCEP----AYPTPK 105
           C+GG P  +W Y+   GVVT         C PY     CSH    PG  P     YPTPK
Sbjct: 155 CNGGIPAMSWDYWTREGVVTGGTLENPTGCLPY-PFPKCSHGVVTPGLPPCPRDIYPTPK 213

Query: 106 CVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 164
           C +KC    N+ +   K    S+Y +     DIM EI KNGPV+  F ++EDF  YKSG+
Sbjct: 214 CEKKCHAGYNKTYEQDKVKGKSSYNVGGQETDIMMEIMKNGPVDGIFYMFEDFLVYKSGI 273

Query: 165 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
           Y + TG ++GGHA+++IGWG  ++G  YW++AN WN  WG  GYF+++RG+NECGIE  +
Sbjct: 274 YHYTTGRLVGGHAIRVIGWGV-ENGVKYWLIANSWNEGWGEKGYFRMRRGNNECGIEARI 332

Query: 225 VAGLP 229
            AGLP
Sbjct: 333 NAGLP 337


>gi|28373366|pdb|1ITO|A Chain A, Crystal Structure Analysis Of Bovine Spleen Cathepsin B-
           E64c Complex
 gi|88192750|pdb|2DC6|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca073 Complex
 gi|88192751|pdb|2DC7|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca042 Complex
 gi|88192752|pdb|2DC8|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca059 Complex
 gi|88192753|pdb|2DC9|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca074me Complex
 gi|88192754|pdb|2DCA|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca075 Complex
 gi|88192755|pdb|2DCB|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca076 Complex
 gi|88192756|pdb|2DCC|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca077 Complex
 gi|88192757|pdb|2DCD|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca078 Complex
          Length = 256

 Score =  194 bits (494), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 111/243 (45%), Positives = 158/243 (65%), Gaps = 18/243 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWAFGAVEA+SDR CIH    +N+ +S  D+L CCG  CGDG
Sbjct: 11  WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLTCCGGECGDG 70

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C+GG+P  AW ++   G+V+         C PY     C H      P C     TPKC 
Sbjct: 71  CNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 129

Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C    +  ++  KH+  S+Y + ++ ++IMAEIYKNGPVE +F+VY DF  YKSGVY+
Sbjct: 130 KTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQ 189

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H++G++MGGHA++++GWG  ++G  YW++ N WN  WG +G+FKI RG + CGIE ++VA
Sbjct: 190 HVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVA 248

Query: 227 GLP 229
           G+P
Sbjct: 249 GMP 251


>gi|239938574|gb|ACS36086.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score =  194 bits (494), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 107/230 (46%), Positives = 137/230 (59%), Gaps = 19/230 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CGSCWA     ALSDR CI  +    + +S  D+L+CCG  CG GC+GG+PI A+ Y
Sbjct: 24  QANCGSCWAVSTASALSDRICIASNGRKQVHVSATDILSCCGNQCGYGCNGGWPIQAFNY 83

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPG-------CEPAYPTPKCVRKC-VKKNQLW 117
           F   G VT         C PY     C H G       C     TPKCVRKC     + +
Sbjct: 84  FSKQGAVTGGDYKATSGCRPY-PFHPCGHHGKDTYYGECPNEATTPKCVRKCQKSYKKSY 142

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
           +  +     AY + +  + I  EI KNGPV  +FTVYEDF++YK G+YKH  G   GGHA
Sbjct: 143 KKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHA 202

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           +K+IGWG  ++G  YW++AN W+  WG +GYF+I RGSN CGIEE+VVAG
Sbjct: 203 IKIIGWG-KENGVPYWLIANSWHNDWGENGYFRILRGSNHCGIEENVVAG 251


>gi|157167283|ref|XP_001658486.1| cathepsin b [Aedes aegypti]
 gi|108876477|gb|EAT40702.1| AAEL007599-PA [Aedes aegypti]
          Length = 342

 Score =  194 bits (494), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 108/239 (45%), Positives = 141/239 (58%), Gaps = 28/239 (11%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
           + ++  QG CGSCWA  A  A++DR+CI        S    D+LACC   CGDGC GGY 
Sbjct: 103 LNVIRNQGCCGSCWAISAASAMTDRWCIKSKGKEQFSFGATDMLACC-HACGDGCKGGYL 161

Query: 67  ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP------------TPKCVRKC---V 111
             AW+++V  GV +    PY    GC HP     YP            TPKC ++C    
Sbjct: 162 GPAWQFWVEQGVSSG--GPYNSRQGC-HP-----YPIDVCDASGEEADTPKCSKRCQSGY 213

Query: 112 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 171
               +W++ + Y   AY I +D + IM EIY NGPV+ +F  Y+D   YKSGVY+H+ G 
Sbjct: 214 NVTDVWQD-RRYGRVAYSIPNDEQKIMEEIYINGPVQAAFMTYQDLHAYKSGVYRHVWGH 272

Query: 172 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           + GGHAVKL+GWG  ++G  YW++AN W   WG +G+FKI RG N CGIE+DV AGLPS
Sbjct: 273 MAGGHAVKLMGWGV-ENGLKYWLVANSWGDDWGDNGFFKIVRGENHCGIEKDVHAGLPS 330


>gi|268555420|ref|XP_002635699.1| Hypothetical protein CBG22436 [Caenorhabditis briggsae]
          Length = 317

 Score =  194 bits (494), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 103/234 (44%), Positives = 139/234 (59%), Gaps = 13/234 (5%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCI-HFGMNLSL-SVNDLLACCGFLCGDGCD 62
           N   ++++  Q  CGSCWAFGA E +SDR CI   G    + S  DLL+CCG  CG GC 
Sbjct: 87  NCRSIKMIRNQATCGSCWAFGAAEVMSDRICIASMGTKQPIISPTDLLSCCGNFCGYGCK 146

Query: 63  GGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQ 115
           G  P+ A+R++   GVVT        C PY     C+   C  +  TP+C   C    ++
Sbjct: 147 GASPLQAFRWWNKKGVVTGGDYRGSGCKPY-PFAPCTALPCTKS-ETPRCSLNCQPAYSK 204

Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
            +   K++   AY +  D   I  EI  NGPVE +F VY+DF HY+SGVY+H+ G ++GG
Sbjct: 205 AYSKDKYFGTPAYIVGMDVAAIQTEI-TNGPVEAAFIVYDDFNHYRSGVYRHVAGKLVGG 263

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           HAVK+IGWG   +G  YW++AN W   WG +G+FK+ RG +ECGIE  +VAG P
Sbjct: 264 HAVKIIGWGI-QNGAPYWLMANSWGPYWGENGFFKMLRGVDECGIESTIVAGKP 316


>gi|125981197|ref|XP_001354605.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
 gi|54642915|gb|EAL31659.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
          Length = 338

 Score =  194 bits (494), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 103/232 (44%), Positives = 140/232 (60%), Gaps = 20/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH    +N  LS +DL++CC  +CG GC+GG+P +AW Y
Sbjct: 108 QGSCGSCWAFGAVEAMSDRVCIHSEGKVNFHLSADDLVSCC-HICGFGCNGGFPGAAWSY 166

Query: 73  FVHHGVV-------TEECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
           +   G+V       T+ C PY +   C H      P C     TP C  KC     + + 
Sbjct: 167 WTRKGIVSGGPYGSTQGCRPY-EIAPCEHHVNGTRPPCSHG-STPSCQHKCQASYSVEYA 224

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             K++   +Y +  +  +I  EI  NGPVE +FTVYED   YKSGVY+H  G  +GGHA+
Sbjct: 225 KDKNFGSKSYSVRRNVAEIQQEIMTNGPVEGAFTVYEDLILYKSGVYQHEHGKELGGHAI 284

Query: 179 KLIGWGTSDDGE-DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +++GWG   + +  YW++ N WN  WG +G+F+I RG + CGIE  + AGLP
Sbjct: 285 RILGWGVWGESKVPYWLIGNSWNTDWGDNGFFRILRGQDHCGIESSISAGLP 336


>gi|55793951|gb|AAV65886.1| cathepsin B1 isotype 6 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  194 bits (494), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 107/234 (45%), Positives = 140/234 (59%), Gaps = 20/234 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WAF AVE +SDR CI      ++ LS  DLL+CC   CG GC GG+P SAW Y
Sbjct: 112 QSRCGSGWAFAAVEVMSDRICIQSKGEKSVELSAVDLLSCC-RECGLGCLGGFPGSAWDY 170

Query: 73  FVHHGVVTEE-------CDPY------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-W 117
           +V  GVVT         C PY       ++TG  +P C +  Y TPKC +KC K  +  +
Sbjct: 171 WVEEGVVTGSSGENHTGCQPYPFPKCEHNTTG-KYPACGQKIYETPKCQKKCQKGYKTPY 229

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
           +  KHY   AY + ++ + I  EI  +GPV   FTVY DF +YKSG+YKH+ G  +G H 
Sbjct: 230 KKDKHYGKVAYNVPNNEDSIKKEIMMHGPVGSFFTVYSDFLNYKSGIYKHMKGTEIGVHT 289

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           V+++GWG  + G  YW++AN WN  WG  GYF+I RG +EC IE  V+ GLP +
Sbjct: 290 VRIVGWGV-EKGTPYWLIANSWNEGWGEKGYFRILRGKDECDIESLVIGGLPRN 342


>gi|392922404|ref|NP_507186.3| Protein CPR-2 [Caenorhabditis elegans]
 gi|206994217|emb|CAB04322.3| Protein CPR-2 [Caenorhabditis elegans]
          Length = 326

 Score =  194 bits (493), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 105/232 (45%), Positives = 133/232 (57%), Gaps = 12/232 (5%)

Query: 7   EHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGG 64
           + ++++  Q +CGSCWAF   E +SDR CI  +      +S  DLL CCG  CG+GCDGG
Sbjct: 97  KSMKLIREQSNCGSCWAFSTAEVISDRTCIASNGTQQPIISPTDLLTCCGMSCGEGCDGG 156

Query: 65  YPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLW 117
           +P  A++++   GVVT        C PY     C+   C     TP C   C       +
Sbjct: 157 FPYRAFQWWARRGVVTGGDYLGTGCKPY-PIRPCNSDNCV-NLQTPPCRLSCQPGYRTTY 214

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
            N K+Y  SAY +      I A+IY NGPV  +F VYEDF  YKSG+Y+HI G   GGHA
Sbjct: 215 TNDKNYGNSAYPVPRTVAAIQADIYYNGPVVAAFIVYEDFEKYKSGIYRHIAGRSKGGHA 274

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           VKLIGWGT + G  YW+  N W   WG  G F+I RG +ECGIE  +VAGLP
Sbjct: 275 VKLIGWGT-ERGTPYWLAVNSWGSQWGESGTFRILRGVDECGIESRIVAGLP 325


>gi|3929733|emb|CAA77178.1| cathepsin B [Homo sapiens]
          Length = 195

 Score =  194 bits (493), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 96/197 (48%), Positives = 130/197 (65%), Gaps = 16/197 (8%)

Query: 18  CGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 77
           CGSCWAFGAVEA+SDR CIH  +++ +S  DLL CCG +CGDGC+GGYP  AW ++   G
Sbjct: 1   CGSCWAFGAVEAISDRICIHTNVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKG 60

Query: 78  VVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHY 123
           +V+         C PY     C H      P C     TPKC + C    +  ++  KHY
Sbjct: 61  LVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHY 119

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 183
              +Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GW
Sbjct: 120 GYDSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGW 179

Query: 184 GTSDDGEDYWILANQWN 200
           G  ++G  YW++AN WN
Sbjct: 180 GV-ENGTPYWLVANSWN 195


>gi|181178|gb|AAA52125.1| lysosomal proteinase cathepsin B, partial [Homo sapiens]
          Length = 209

 Score =  194 bits (492), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 96/211 (45%), Positives = 136/211 (64%), Gaps = 16/211 (7%)

Query: 42  LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH 94
           + +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+         C PY     C H
Sbjct: 1   VEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPY-SIPPCEH 59

Query: 95  ------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 147
                 P C     TPKC + C    +  ++  KHY  ++Y +++  +DIMAEIYKNGPV
Sbjct: 60  HVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV 119

Query: 148 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 207
           E +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +G
Sbjct: 120 EGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNG 178

Query: 208 YFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           +FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 179 FFKILRGQDHCGIESEVVAGIPRTDQYWEKI 209


>gi|118429531|gb|ABK91813.1| cathepsin B-like cysteine proteinase precursor [Clonorchis
           sinensis]
 gi|358331549|dbj|GAA37857.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 343

 Score =  194 bits (492), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 113/232 (48%), Positives = 131/232 (56%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA+SDR CIH     N SLS  DLL+CC   CG GC GGYP  AW Y
Sbjct: 108 QSGCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLLSCCEN-CGYGCSGGYPAVAWDY 166

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKKNQLW 117
           +  HG+VT       D +GC     P CE              YPTP+CV+ C      +
Sbjct: 167 WGAHGIVTGGSKE--DPSGCRSYPFPKCEHHVQGHYPPCPHQYYPTPECVQHCDTPGIDY 224

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              K  +  +Y I S    IM EI   GPVE  FTVYEDF  YK GVY H  G  +  HA
Sbjct: 225 VKDKTRANMSYNIYSSEILIMKEIMLRGPVEAVFTVYEDFLQYKFGVYFHSWGAPLSEHA 284

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           ++++GWG   D   YW++AN WN  WG  GY K  RG NECGIE+DV AGLP
Sbjct: 285 IRILGWGEEGD-VPYWLIANSWNEDWGEKGYMKFLRGLNECGIEDDVTAGLP 335


>gi|999909|pdb|1HUC|B Chain B, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
           Human Liver Cathepsin B: The Structural Basis For Its
           Specificity
 gi|999911|pdb|1HUC|D Chain D, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
           Human Liver Cathepsin B: The Structural Basis For Its
           Specificity
 gi|1421164|pdb|1CSB|B Chain B, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
           2.1 Angstroms Resolution: A Basis For The Design Of
           Specific Epoxysuccinyl Inhibitors
 gi|1421167|pdb|1CSB|E Chain E, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
           2.1 Angstroms Resolution: A Basis For The Design Of
           Specific Epoxysuccinyl Inhibitors
 gi|122920711|pdb|2IPP|B Chain B, Crystal Structure Of The Tetragonal Form Of Human Liver
           Cathepsin B
          Length = 205

 Score =  194 bits (492), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 95/206 (46%), Positives = 135/206 (65%), Gaps = 16/206 (7%)

Query: 40  MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGC 92
           +++ +S  DLL CCG +CGDGC+GGYP  AW ++   G+V+         C PY     C
Sbjct: 1   VSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPY-SIPPC 59

Query: 93  SH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 145
            H      P C     TPKC + C    +  ++  KHY  ++Y +++  +DIMAEIYKNG
Sbjct: 60  EHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNG 119

Query: 146 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 205
           PVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG 
Sbjct: 120 PVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGD 178

Query: 206 DGYFKIKRGSNECGIEEDVVAGLPSS 231
           +G+FKI RG + CGIE +VVAG+P +
Sbjct: 179 NGFFKILRGQDHCGIESEVVAGIPRT 204


>gi|239938576|gb|ACS36087.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score =  193 bits (491), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 107/230 (46%), Positives = 136/230 (59%), Gaps = 19/230 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CGSCWA     ALSDR CI  +    + +S  D+L+CCG  CG GC+GG+PI A+ Y
Sbjct: 24  QANCGSCWAVSTASALSDRICIASNGRKQVHVSATDILSCCGNQCGYGCNGGWPIQAFNY 83

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPG-------CEPAYPTPKCVRKC-VKKNQLW 117
           F   G VT         C PY     C H G       C     TPKCVRKC     + +
Sbjct: 84  FSKQGAVTGGDYKATSGCRPY-PFHPCGHHGKDTYYGECPNEATTPKCVRKCQKSYKKSY 142

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
           +  +     AY + +  + I  EI KNGPV  +FTVYEDF++YK G+YKH  G   GGHA
Sbjct: 143 KKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHA 202

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           +K+IGWG  + G  YW++AN W+  WG +GYF+I RGSN CGIEE+VVAG
Sbjct: 203 IKIIGWG-KEGGVPYWLIANSWHNDWGENGYFRILRGSNHCGIEENVVAG 251


>gi|3088522|gb|AAD03404.1| cathepsin B-like protease precursor [Trypanosoma cruzi]
 gi|407859283|gb|EKG06969.1| cysteine peptidase C (CPC) [Trypanosoma cruzi]
          Length = 333

 Score =  193 bits (491), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 108/225 (48%), Positives = 135/225 (60%), Gaps = 18/225 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
           Q  CGSCWA  A  A+SDR+C   G+ +L +S  DL++CC  +CG GC+GGYP  AW Y+
Sbjct: 114 QSSCGSCWAVAAASAMSDRYCTLGGVRDLRISAGDLMSCCD-VCGYGCNGGYPEVAWEYY 172

Query: 74  VHHGVVTEECDPY-FDSTGCSH-------PGCEPAYPTPKCVRKCV-KKNQLWRNSKHYS 124
             HG+V+E C PY F S  C+H         C   Y TP C   C  KK  L +    Y 
Sbjct: 173 AVHGIVSEYCQPYPFPS--CAHHVNSSDLSPCSGEYDTPTCNSTCTDKKIPLIK----YR 226

Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 184
            +   I S  E    E+  NGP EVSF+VY DF  Y  GVYKH+TG  +GGHAV+++GWG
Sbjct: 227 GNTSYILSGEESFKRELLLNGPFEVSFSVYADFVAYTGGVYKHVTGVFLGGHAVRIVGWG 286

Query: 185 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
              +GE YW +AN WN  WG +GYF I RG +ECGIE   VAG+P
Sbjct: 287 -ELNGEPYWKIANSWNHEWGMNGYFLIARGVDECGIEGSGVAGIP 330


>gi|71656032|ref|XP_816569.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
 gi|70881707|gb|EAN94718.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
          Length = 333

 Score =  193 bits (491), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 106/227 (46%), Positives = 133/227 (58%), Gaps = 22/227 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
           Q  CGSCWA  A  A+SDR+C   G+ +L +S  DL++CC  +CG GC+GGYP  AW Y+
Sbjct: 114 QSSCGSCWAVAAASAISDRYCTLGGVRDLRISAGDLMSCCD-VCGYGCNGGYPEVAWEYY 172

Query: 74  VHHGVVTEECDPY-FDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQ---LWRNSKH 122
             HG+V+E C PY F S  C+H         C   Y TP C   C  K      +R +  
Sbjct: 173 AVHGIVSEYCQPYPFPS--CAHHVNSSDLSPCSGEYDTPTCNSTCTDKKVPLIKYRGNTS 230

Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
           Y +S        E    E+  NGP EVSF+VY DF  Y  GVYKH+ G  +GGHAV+++G
Sbjct: 231 YLLSG------EESFKRELLLNGPFEVSFSVYADFLAYTGGVYKHVAGTFLGGHAVRIVG 284

Query: 183 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           WG   +GE YW +AN WNR WG +GYF I RG +ECGIE   VAG P
Sbjct: 285 WG-ELNGEPYWKIANSWNREWGMNGYFLIARGVDECGIEGSGVAGTP 330


>gi|7507648|pir||T24819 hypothetical protein T10H4.12 - Caenorhabditis elegans
          Length = 324

 Score =  193 bits (491), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 113/266 (42%), Positives = 149/266 (56%), Gaps = 38/266 (14%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
           ++++  Q  CGSCWAFGA E +SDR CI         +SV D+L+CCG  CG GC GGY 
Sbjct: 46  IKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDILSCCGTTCGYGCKGGYS 105

Query: 67  ISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWR 118
           I A R++   G VT        C PY  S       C P   TP C   C    K + ++
Sbjct: 106 IEALRFWASSGAVTGGDYGGHGCMPY--SFAPCTKNC-PESTTPSCKTTCQSSYKTEEYK 162

Query: 119 NSKHY----------------SISAYRINSDPE--DIMAEIYKNGPVEVSFTVYEDFAHY 160
             KHY                  SAY++ +     +I  EIY  GPVE S+ VYEDF HY
Sbjct: 163 KDKHYGELVWHSFNRFQRFLNRASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHY 222

Query: 161 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 220
           KSGVY + +G ++GGHAVK+IGWG  ++G DYW++AN W  S+G  G+FKI+RG+NEC I
Sbjct: 223 KSGVYHYTSGKLVGGHAVKIIGWGV-ENGVDYWLIANSWGTSFGEKGFFKIRRGTNECQI 281

Query: 221 EEDVVAGLPSSKNLVKEITSADMFED 246
           E +VVAG      + K  T ++ +ED
Sbjct: 282 EGNVVAG------IAKLGTHSETYED 301


>gi|29374027|gb|AAO73004.1| cathepsin B [Fasciola gigantica]
          Length = 337

 Score =  193 bits (490), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 104/245 (42%), Positives = 141/245 (57%), Gaps = 21/245 (8%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   +  +  Q  C SCWA  +  A++DR CIH        LS  D+++CC + CG G
Sbjct: 96  WANCPSISEIRDQSSCSSCWAVSSASAITDRICIHSNGQKKPRLSAIDIVSCCAY-CGYG 154

Query: 61  CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH----PGCEP----AYPTPK 105
           C+GG P  +W Y+   GVVT         C PY     CSH    PG  P     YPTPK
Sbjct: 155 CNGGIPAMSWDYWTREGVVTGGTLENPTGCLPY-PFPKCSHGVVTPGLPPCPRDIYPTPK 213

Query: 106 CVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 164
           C +KC    N+ +   K    S+Y +     D M EI KNGPV+  F ++EDF  YKSG+
Sbjct: 214 CEKKCHAGYNKTYEQDKVKGKSSYNVGEQETDFMMEIMKNGPVDGIFYMFEDFLVYKSGI 273

Query: 165 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
           Y + TG ++GGHA+++IGWG  ++G  YW++AN WN  WG  GYF+++RG+NECGIE  +
Sbjct: 274 YHYTTGRLVGGHAIRVIGWGV-ENGVKYWLIANSWNEGWGEKGYFRMRRGNNECGIEARI 332

Query: 225 VAGLP 229
            AGLP
Sbjct: 333 NAGLP 337


>gi|291385792|ref|XP_002709482.1| PREDICTED: cathepsin B [Oryctolagus cuniculus]
          Length = 339

 Score =  192 bits (489), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 118/252 (46%), Positives = 160/252 (63%), Gaps = 18/252 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   ++ +  QG CGSCWAFGAVEA+SDR CIH    +N+ +S  D+L CCG  CGDG
Sbjct: 90  WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNGHVNVEVSAEDMLTCCGGQCGDG 149

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C+GGYP  AW ++   G+V+         C PY     C H      P C     TP+C 
Sbjct: 150 CNGGYPSGAWNFWTKKGLVSGGLYDSHVGCKPY-SIPPCEHHVNGSRPACTGEGDTPRCS 208

Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C    +  ++  KHY  S+Y ++SD  +I AEIYKNGPVE +FTVY DF  YKSGVY+
Sbjct: 209 KTCEPGYSPSYKEDKHYGYSSYSVSSDENEIKAEIYKNGPVEGAFTVYSDFLMYKSGVYQ 268

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H TGD+MGGHA++++GWG  ++G  YW++AN WN  WG  G+FKI RG + CGIE ++VA
Sbjct: 269 HTTGDIMGGHAIRILGWG-EENGVPYWLVANSWNTDWGDKGFFKILRGQDHCGIESEIVA 327

Query: 227 GLPSSKNLVKEI 238
           G+P +    ++I
Sbjct: 328 GIPRTDQYWRQI 339


>gi|71424150|ref|XP_812694.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
 gi|70877506|gb|EAN90843.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
          Length = 333

 Score =  192 bits (488), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 106/227 (46%), Positives = 133/227 (58%), Gaps = 22/227 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
           Q  CGSCWA  A  A+SDR+C   G+ +L +S  DL++CC  +CG GC+GGYP  AW Y+
Sbjct: 114 QSSCGSCWAVAAASAISDRYCTLGGVRDLRISAGDLMSCCD-VCGFGCNGGYPEVAWEYY 172

Query: 74  VHHGVVTEECDPY-FDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQ---LWRNSKH 122
             HG+V+E C PY F S  C+H         C   Y TP C   C  K      +R +  
Sbjct: 173 AVHGIVSEYCQPYPFPS--CAHHVNSSDLSPCSGEYDTPTCNSTCTDKKIPLIKYRGNTS 230

Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
           Y +S        E    E+  NGP EVSF+VY DF  Y  GVYKH+ G  +GGHAV+++G
Sbjct: 231 YVLSG------EEPFKRELILNGPFEVSFSVYADFVAYTGGVYKHVAGIFLGGHAVRIVG 284

Query: 183 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           WG   +GE YW +AN WNR WG +GYF I RG +ECGIE   VAG P
Sbjct: 285 WG-ELNGEPYWKIANSWNREWGMNGYFLIARGVDECGIEGSGVAGTP 330


>gi|358341561|dbj|GAA37330.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 347

 Score =  192 bits (488), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 109/234 (46%), Positives = 140/234 (59%), Gaps = 20/234 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG---MNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           Q  CGSCWAF AV A+SDR CIH     +N+ LS  DLLACC   CG GC GG+   AW 
Sbjct: 108 QSGCGSCWAFAAVTAMSDRVCIHSNQTLVNVQLSATDLLACCT-TCGFGCVGGWGGMAWD 166

Query: 72  YFVHHGVVT-------EECDPY-------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL 116
           Y+  +G+VT         C PY         + G  +P C E  Y TP+CV +C K    
Sbjct: 167 YWRDNGIVTGGEYKDSHTCLPYPFPPCRHHGAKGSEYPPCPEKMYSTPQCVSECQKGYAT 226

Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
            + + K  + ++Y +      I  EI+  GPVE +  VY DFA+Y  GVYKH TG+++GG
Sbjct: 227 KYEDDKIRASTSYNLYRSVTTIQKEIWMRGPVEATMNVYTDFANYAGGVYKHTTGELLGG 286

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           HA++L+GWG  +DG  YW+ AN WN SWG  G+F+I RGS+ CGIE DV AGLP
Sbjct: 287 HAIRLLGWGVEEDGTPYWLAANSWNPSWGEKGFFRILRGSDHCGIESDVSAGLP 340


>gi|329668994|gb|AEB96385.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 316

 Score =  192 bits (487), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 103/229 (44%), Positives = 136/229 (59%), Gaps = 18/229 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAF + E +SDR CI  H    + LS +D+L+CC    G GCDGG+P+SAW+Y
Sbjct: 88  QSQCGSCWAFSSAEVMSDRVCIASHGHKKVELSADDILSCC-TDGGYGCDGGWPVSAWQY 146

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
           FV  GVVT       + C PY             +  C     TP C   C     + + 
Sbjct: 147 FVETGVVTGGLYGTKDACRPYEIPPCGIHKNETFYSNCTQEIDTPDCKTTCQAGYPISYD 206

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
           + K Y  +AY +++    I  EI   GPV  +FTVY+DF HYK+G+YKH++G   GGHAV
Sbjct: 207 DDKTYGKTAYSVSNSVHAIQKEIMTYGPVVAAFTVYDDFFHYKTGIYKHVSGAEAGGHAV 266

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           +++GWG    G  YW++AN WN  WG +GYF+I RGS+ECGIE+ VVAG
Sbjct: 267 RILGWG-QQGGVPYWLVANSWNTDWGENGYFRILRGSDECGIEDGVVAG 314


>gi|144952804|gb|ABP04056.1| cathepsin B-4 [Clonorchis sinensis]
          Length = 347

 Score =  191 bits (486), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 109/234 (46%), Positives = 140/234 (59%), Gaps = 20/234 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG---MNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           Q  CGSCWAF AV A+SDR CIH     +N+ LS  DLLACC   CG GC GG+   AW 
Sbjct: 108 QSGCGSCWAFAAVTAMSDRVCIHSNQTLVNVQLSATDLLACCT-TCGFGCVGGWGGMAWD 166

Query: 72  YFVHHGVVT-------EECDPY-------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL 116
           Y+  +G+VT         C PY         + G  +P C E  Y TP+CV +C K    
Sbjct: 167 YWRDNGIVTGGEYKDSHTCLPYPFPPCRHHGAKGSEYPPCPEKMYSTPQCVSECQKGYAT 226

Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
            + + K  + ++Y +      I  EI+  GPVE +  VY DFA+Y  GVYKH TG+++GG
Sbjct: 227 KYEDDKIRASTSYNLYRSVTAIQKEIWMRGPVEATMNVYTDFANYAGGVYKHTTGELLGG 286

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           HA++L+GWG  +DG  YW+ AN WN SWG  G+F+I RGS+ CGIE DV AGLP
Sbjct: 287 HAIRLLGWGVEEDGTPYWLAANSWNPSWGEKGFFRILRGSDHCGIESDVSAGLP 340


>gi|28932700|gb|AAO60044.1| midgut cysteine proteinase 1 [Rhipicephalus appendiculatus]
          Length = 332

 Score =  191 bits (486), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 107/230 (46%), Positives = 136/230 (59%), Gaps = 13/230 (5%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDG--- 63
           + ++  Q  CGSCWAF A E++SDR CIH    + +++S  DLLACC   CG GCDG   
Sbjct: 103 IRVIRDQSACGSCWAFAAAESISDRICIHTNGKVQVNISAEDLLACC-HTCGHGCDGRCH 161

Query: 64  --GYPISAWRYFVHHGVVTEE-CDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRN 119
                I   R  V   V TE+ C PY  S     P C    PTPKC   C K   + +  
Sbjct: 162 CSSVAILQGRRLVPEPVRTEDGCQPY--SLPPCVPNCTHPEPTPKCQHVCRKGYEKSYEE 219

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
            KH++ + YR+    + I  +IYKNGPVE +F VY DF  YKSGVY+      MG HA+K
Sbjct: 220 DKHFAKNVYRLLKKCDAIKTDIYKNGPVESAFFVYADFPSYKSGVYQQHMIKFMGVHAIK 279

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           ++GWGT +DG  YW++AN WN  WG  GYFKI RG +ECGIEE + AG+P
Sbjct: 280 ILGWGT-EDGVPYWLVANSWNVGWGDKGYFKILRGKDECGIEEVIDAGIP 328


>gi|167541036|gb|ABZ82028.1| cathepsin B endopeptidase [Clonorchis sinensis]
          Length = 228

 Score =  191 bits (485), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 102/227 (44%), Positives = 139/227 (61%), Gaps = 17/227 (7%)

Query: 23  AFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 80
           AFGAVEA+SDR CIH     +  +S  DL++CCG+ CG GC GG+P +AW ++   G+VT
Sbjct: 1   AFGAVEAMSDRLCIHTNGTFTKRISAVDLISCCGY-CGFGCQGGFPPTAWDFWQTEGIVT 59

Query: 81  --EECDPY----FDSTGCSHPGCEP-------AYPTPKCVRKCVKKNQLWRNSKHYSISA 127
              + +P     +    CSH G +         Y TP CV+KC   +  +   K  +   
Sbjct: 60  GGSKENPTGCRSYPFPRCSHHGSKKYPPCSHRIYDTPNCVQKCDTPDTDYATDKTRANIT 119

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
           Y + +    IM EI  NGPVE +F VYEDF  YKSGVY H  G ++GGHA++++GWG  +
Sbjct: 120 YNVKAKQNAIMKEIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWG-EE 178

Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
           +G  YW++AN WN  WG DGYFK+ RG NECGIE++V AGLP   ++
Sbjct: 179 NGVAYWLIANSWNDGWGEDGYFKMLRGKNECGIEDEVTAGLPELSSI 225


>gi|118365170|ref|XP_001015806.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297573|gb|EAR95561.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 340

 Score =  191 bits (485), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 104/243 (42%), Positives = 132/243 (54%), Gaps = 20/243 (8%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNL--SLSVNDLLACCGFLCGDGCD 62
           N + ++++  Q  CGSCWAF A E  SDR CI     L  S+S  DLL CC   CG GC 
Sbjct: 100 NCQSIKLIRDQSTCGSCWAFAATETFSDRICIASNQTLQTSISSEDLLECCADYCGMGCK 159

Query: 63  GGYPISAWRYFVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRK 109
           GGYP +AW Y    GV T         C PY         TG   P C P  PTP+CV++
Sbjct: 160 GGYPSAAWGYMKRQGVSTGGLYGDDTSCKPYIFPPCDHHVTGQYQP-CGPIQPTPQCVKE 218

Query: 110 CVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-K 166
           C  +     +    H++   Y I  + + I  EI  +GPV+ SF V  DF  YKSGVY +
Sbjct: 219 CNSEYTQNTYEKDLHFASQTYSIKQNVQAIQREIMAHGPVQASFKVAADFLTYKSGVYIR 278

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           +      GGH+VK+IGWG  +    YW++AN WN  WG  G F++ RG NECGIE  +VA
Sbjct: 279 NPKLKYEGGHSVKIIGWG-KEGNTPYWLIANSWNEDWGEKGLFRMLRGRNECGIEAQIVA 337

Query: 227 GLP 229
           GLP
Sbjct: 338 GLP 340


>gi|38639319|gb|AAR25797.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
          Length = 218

 Score =  191 bits (484), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 83/98 (84%), Positives = 90/98 (91%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           QGHCGSCWAFGAVE+LSDRFCIH+ +++SLSVNDLLACC FLCG GCDGGYPI+AWRYF 
Sbjct: 120 QGHCGSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCSFLCGSGCDGGYPIAAWRYFK 179

Query: 75  HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK 112
             GVVTEECDPYFD+TGCSHPGCEP YPTPKC RKCVK
Sbjct: 180 RSGVVTEECDPYFDTTGCSHPGCEPLYPTPKCHRKCVK 217


>gi|320167003|gb|EFW43902.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
          Length = 306

 Score =  191 bits (484), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 103/218 (47%), Positives = 131/218 (60%), Gaps = 15/218 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGA EALSDR  I  +  +N+ LS  DL++C       GCDGGYPI+AW Y
Sbjct: 101 QQQCGSCWAFGATEALSDRLAIASNNSINVVLSPQDLVSCDS--TDYGCDGGYPINAWHY 158

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
               GVVT+ C PY    G S         TP C      K +          +AY++ +
Sbjct: 159 MQSLGVVTDTCYPYTSGNGDSGTCQITGKKTPACATATFYKAK----------TAYQVAN 208

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
           +   I +EI  NGPVE +F+VY+DF  Y SGVY H +G + GGHAVK++GWG  D    Y
Sbjct: 209 NMAAIQSEILANGPVEAAFSVYDDFFSYTSGVYSHQSGALDGGHAVKIVGWGV-DGTTPY 267

Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           WI+AN W  SWG  G+F IKRG++ECGIE+ +VAGL +
Sbjct: 268 WIVANSWGTSWGQAGFFWIKRGNDECGIEDGIVAGLAA 305


>gi|5764077|emb|CAB53367.1| necpain [Necator americanus]
          Length = 339

 Score =  190 bits (482), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 98/231 (42%), Positives = 138/231 (59%), Gaps = 18/231 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND--LLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWA  A   +SDR C+     + L V+D  +LACCG  CGDGC GG+P  AW +
Sbjct: 107 QSRCGSCWAVSAASVMSDRLCVQSNGKIKLHVSDTDILACCGEFCGDGCSGGWPFQAWEW 166

Query: 73  FVHHGVVTE-------ECDPYFDSTGCSHP-----GCEP--AYPTPKCVRKCVKKN-QLW 117
              +GV T         C PY      +H      G  P  ++PTP+C + C +   + +
Sbjct: 167 VRKYGVCTGGDYRAKGVCKPYAFHPCGNHENQVYYGVCPKGSWPTPRCEKFCQRGYIKPY 226

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
           +  K Y+  +Y + +D ++I  +I KNGPV+ +F VYEDF  YK G+YKH  G   GGHA
Sbjct: 227 KKDKFYAKKSYWLPNDEKEIRLDIMKNGPVQAAFDVYEDFKLYKRGIYKHKEGIQTGGHA 286

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           VK+IGWG  D+G DYW++AN W++ WG  G+F++ RG N+C IE+ + AG+
Sbjct: 287 VKIIGWG-KDNGTDYWLIANSWSKDWGESGFFRMVRGENDCEIEDMITAGI 336


>gi|1345924|sp|P25802.3|CYSP1_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
           Precursor
          Length = 341

 Score =  190 bits (482), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 102/229 (44%), Positives = 138/229 (60%), Gaps = 19/229 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CGSCWA  +  A+SDR CI       + +S  D+++CC + CGDGC+GG+PISA+R+
Sbjct: 113 QANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVVSCCTW-CGDGCEGGWPISAFRF 171

Query: 73  FVHHGVVTE-------ECDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKNQLWR 118
               GVVT         C PY +   C H G E  Y        TP+C R+C+       
Sbjct: 172 HADEGVVTGGDYNTKGSCRPY-EIHPCGHHGNETYYGECVGMADTPRCKRRCLLGYPKSY 230

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
            S  Y   AY++ +  + I  +I KNGPV  ++TVYEDFAHY+SG+YKH  G   G HAV
Sbjct: 231 PSDRYYKKAYQLKNSVKAIQKDIMKNGPVVATYTVYEDFAHYRSGIYKHKAGRKTGLHAV 290

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           K+IGWG  + G  YWI+AN W+  WG +G+F++ RGSN+CG EE + AG
Sbjct: 291 KVIGWG-EEKGTPYWIVANSWHDDWGENGFFRMHRGSNDCGFEERMAAG 338


>gi|300176938|emb|CBK25507.2| unnamed protein product [Blastocystis hominis]
          Length = 320

 Score =  190 bits (482), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 105/232 (45%), Positives = 135/232 (58%), Gaps = 19/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGA EA +DR CI     +   LS  DLL CC   CG GCDGG+   AWR+
Sbjct: 91  QSVCGSCWAFGAAEAATDRLCIASKGKIQDRLSEQDLLTCCD-SCGFGCDGGWLDMAWRW 149

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
           F   GV T       + C+ Y     C H      P C  +  TP+CV++C +   + + 
Sbjct: 150 FQSTGVTTGGEYGSKDWCNAY-SFPKCEHHAEGKYPPCGESQETPECVKQCQEGYPVEYE 208

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KH+   AY +    + I  E+  NGP+EVSF VYEDF  YKSG+Y+H+ G  +GGHAV
Sbjct: 209 KDKHFFGEAYYVQGGIDAIKTELMTNGPLEVSFFVYEDFLTYKSGIYQHVAGKYLGGHAV 268

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           KL+GWG  +DG +YW +AN WN  WG +GYF+I  G  ECGIE   + G+P 
Sbjct: 269 KLVGWGV-EDGIEYWKIANSWNEDWGENGYFRIVAGKGECGIEVGPIGGIPK 319


>gi|183988834|gb|ACC66066.1| cathepsin B [Samia ricini]
          Length = 283

 Score =  189 bits (481), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 101/222 (45%), Positives = 136/222 (61%), Gaps = 19/222 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGM--NLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA++DR CI+     +   S  DL++CC  +CG GC+GG P  AW Y
Sbjct: 65  QGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP-ICGLGCNGGMPTLAWEY 123

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWR 118
           + H G+V+       + C PY +   C H  PG    C     TPKC + C    N  ++
Sbjct: 124 WKHVGLVSGGNYNSSQGCRPY-EIPPCEHHVPGNRMPCNGDTKTPKCQKNCESSYNVPFK 182

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             K Y    Y ++   + I AE++KNGPVE +FTVY D   YK+GVYKH  G+ +GGHA+
Sbjct: 183 KDKRYGKHVYSVSGHEDHIKAELFKNGPVEAAFTVYSDLLSYKNGVYKHTEGNALGGHAI 242

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 220
           K+IGWG  ++ + YW++AN WN  WG +G+FKI RG + CGI
Sbjct: 243 KIIGWGVENNNK-YWLIANSWNSDWGDNGFFKILRGEDHCGI 283


>gi|170060936|ref|XP_001866022.1| cathepsin B [Culex quinquefasciatus]
 gi|167879259|gb|EDS42642.1| cathepsin B [Culex quinquefasciatus]
          Length = 341

 Score =  189 bits (481), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 105/226 (46%), Positives = 131/226 (57%), Gaps = 16/226 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWA  A E  +DR+CIH       S    DLL+CC   CGDGC GG    AW++
Sbjct: 111 QGCCGSCWAISAAETFTDRWCIHSEDKDQFSFGAYDLLSCC-HSCGDGCQGGNLGPAWQF 169

Query: 73  FVHHGVVTEECDPYFDSTGCSHP-------GCEPAYPTPKCVRKCVKKNQLWRNS--KHY 123
           +V  GV +    PY    GC HP         +    TPKC RKC     +   S  + +
Sbjct: 170 WVQRGVSSG--GPYNSRQGC-HPYPVDVCHSADEDADTPKCTRKCQSMYNVTNVSDDRRF 226

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 183
              AY ++ D E I  EI++NGPV+ SF VY DF  YK+GVY+H+ G + GGHAVK+IGW
Sbjct: 227 GRVAYSVSQDEERIKEEIFRNGPVQASFDVYLDFKAYKTGVYRHVFGPMEGGHAVKMIGW 286

Query: 184 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           G  ++G  YW+ +N W   WG  G+FKI RG N CGIE DV AGLP
Sbjct: 287 GV-ENGTKYWLCSNSWGEDWGERGFFKIVRGENHCGIESDVHAGLP 331


>gi|260782761|ref|XP_002586451.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
 gi|229271561|gb|EEN42462.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
          Length = 272

 Score =  189 bits (480), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 104/217 (47%), Positives = 130/217 (59%), Gaps = 20/217 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGC-DGGYPISAWR 71
           QGHCGSCWAF + E LSDR CI      N+ LS  DLL+C     G GC DGG    AWR
Sbjct: 65  QGHCGSCWAFASTEVLSDRLCIQTRGSTNIILSSEDLLSC--DKAGRGCSDGGRLSEAWR 122

Query: 72  YFVHHGVVTEECDPYFD-STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI 130
           Y    GVV   C PY   +TG            P+C+ KC  +   ++  K Y +  Y +
Sbjct: 123 YMQKKGVVANRCKPYTSGATGF----------IPECMSKCTGEGHAYQ--KFYGLYLYTV 170

Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 190
           + + + I  EI  NGPVE +FTVY D  HYKSGVY H +G  +GGHAVK++GWG  D+ E
Sbjct: 171 SGENQ-IKVEIMTNGPVEAAFTVYSDIVHYKSGVYHHTSGGKLGGHAVKVLGWGVEDE-E 228

Query: 191 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           +YW++AN W   WG  G+FKIKRGS+ECGIE  V+ G
Sbjct: 229 EYWLVANSWGPDWGDQGFFKIKRGSDECGIESRVLTG 265


>gi|268560898|ref|XP_002638183.1| Hypothetical protein CBG22612 [Caenorhabditis briggsae]
          Length = 721

 Score =  189 bits (479), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 104/239 (43%), Positives = 147/239 (61%), Gaps = 18/239 (7%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCD 62
           N + ++++  Q +CGSCWAFGA E +SDR CI         +S  D+L CC      GC 
Sbjct: 92  NCKSIKMIRDQAYCGSCWAFGAAEVISDRICIQSNGTDQPIISPEDILTCC--TNSHGCQ 149

Query: 63  GGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK--N 114
           GG+ + A +++   GVVT      + C PY     CS   C  A  TPKC  +C  K   
Sbjct: 150 GGFVLEAMKFWKSKGVVTGGDFQGDGCIPY-SYGSCSD--CHTAQTTPKCKNECQVKYTK 206

Query: 115 QLWRNSKHYSISAYRINSDP--EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
             ++  K+Y  SAYR+++      I +EI +NGPVE ++ VYEDF +YKSGVY++I+G  
Sbjct: 207 NEYKEDKYYGSSAYRLSTSNAVRTIQSEILRNGPVEATYQVYEDFYYYKSGVYEYISGRH 266

Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           MGGHAVK+IGWG  ++  +YW++AN W   +G +G+FK++RG+NECGIE  VVAG+  S
Sbjct: 267 MGGHAVKIIGWGV-EENVNYWLIANSWGTGFGENGFFKMRRGNNECGIENYVVAGMAKS 324


>gi|194246059|gb|ACF35521.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 217

 Score =  188 bits (478), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 97/216 (44%), Positives = 133/216 (61%), Gaps = 19/216 (8%)

Query: 31  SDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------E 81
           SDR CIH    + +++S  DLL CC   CG GC+GGYP +AW+++   G+VT       +
Sbjct: 1   SDRICIHTKGKVQVNISAEDLLTCCD-SCGSGCNGGYPSAAWQFYKDEGIVTGGLYGTED 59

Query: 82  ECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDP 134
            C PY+    C H      P C    PTP+C + C +   + +   KH+    Y I+SD 
Sbjct: 60  GCQPYYFPP-CEHHTVGPLPNCTGIKPTPECAKTCREGYEKSYTRDKHFGKKVYSISSDE 118

Query: 135 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 194
             I  EI KNGPVE  F VY DF  YKSGVY+  + +++GGHA++++GWGT +DG  YW+
Sbjct: 119 TQIKTEICKNGPVEADFNVYADFPSYKSGVYQRHSKEMLGGHAIRILGWGT-EDGVPYWL 177

Query: 195 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           +AN WN  WG  GYFKI+RG++ECGIE D+ AG+P 
Sbjct: 178 VANSWNEDWGDKGYFKIRRGNDECGIENDINAGIPK 213


>gi|91089435|ref|XP_966663.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
 gi|270012706|gb|EFA09154.1| cathepsin B precursor [Tribolium castaneum]
          Length = 320

 Score =  188 bits (478), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 103/231 (44%), Positives = 137/231 (59%), Gaps = 13/231 (5%)

Query: 7   EHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGG 64
           E +  +  QG CGSCWAFGAVE +SDR CI  +       S  DLLACC   CG GC GG
Sbjct: 93  ESIRKIRNQGSCGSCWAFGAVETMSDRLCIASNATKKFEFSAQDLLACCK-ECGHGCGGG 151

Query: 65  YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY---PTPKCVRKCV--KKNQLWRN 119
           Y   AW+Y+V  G+V+     +  S GC HP    A+    TP C   C   K  + +  
Sbjct: 152 YSSRAWQYWVTDGIVSG--GDFNTSQGC-HPYSVQAFRDSTTPNCSSFCTNPKYQKNYSE 208

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
            K Y   +YRI  + E I AEI  +GPV+ S+ VY+DF  Y++GVY+H+ G+V G H+VK
Sbjct: 209 DKRYGARSYRIAKNIEQIQAEIMTSGPVQASYVVYDDFYSYQNGVYQHVLGNVSGRHSVK 268

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGA-DGYFKIKRGSNECGIEEDVVAGLP 229
           ++GWG  ++G DYW++AN W R WG   G+FK  RG N C IE +++ G P
Sbjct: 269 ILGWG-RENGTDYWLVANSWGRDWGRLGGFFKFLRGENHCDIESNILGGDP 318


>gi|183988832|gb|ACC66065.1| cathepsin B [Antheraea assama]
          Length = 287

 Score =  188 bits (478), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 100/223 (44%), Positives = 137/223 (61%), Gaps = 19/223 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGM--NLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA++DR CI+     +   S  DL++CC  +CG GC+GG P  AW Y
Sbjct: 66  QGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP-ICGLGCNGGMPTLAWEY 124

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKKNQL-WR 118
           + H G+V+       + C PY +   C H  PG    C     TPKC + C     + ++
Sbjct: 125 WKHVGLVSGGNYNSSQGCRPY-EIPPCEHHVPGNRMPCNGDTKTPKCEKTCESSYTVPFK 183

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             K Y    Y ++   ++I AE++KNGPVE +FTVY D   YKSGVY+H  G+ +GGHA+
Sbjct: 184 KDKRYGKHVYSVSGHEDNIKAELFKNGPVEGAFTVYSDLLSYKSGVYQHTHGNALGGHAI 243

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 221
           K++GWG  ++G  YW++AN WN  WG +G+ KI RG + CGIE
Sbjct: 244 KILGWGV-ENGSKYWLIANSWNSDWGDNGFLKILRGEDHCGIE 285


>gi|343476048|emb|CCD12737.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  188 bits (478), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 103/235 (43%), Positives = 134/235 (57%), Gaps = 14/235 (5%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDG 63
           N   +  +  Q  CGSCWA     A+SDR C   G+  L +S   LL+CC   CGDGCDG
Sbjct: 102 NCPTIREIADQSACGSCWAVSTASAISDRHCTVGGVQQLRISAAHLLSCCKD-CGDGCDG 160

Query: 64  GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQ 115
           GYP SAW Y+V HG+ +  C PY     C H G +   P        TPKC   C  K  
Sbjct: 161 GYPDSAWEYYVSHGLASSYCQPY-PFPHCGHHGGKGKKPPCSKYDFHTPKCNTTCTDKAI 219

Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
                K+    +Y +    +D   E+Y NGP  V+F VY DF  YK+GVY+H++GD +GG
Sbjct: 220 PL--IKYRGNDSYVLLHGEDDFKRELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDFLGG 277

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           HAV+++GWG   +G  YW +AN W+  WG +G+F I RG+NECGIE    AGLP+
Sbjct: 278 HAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHFLILRGNNECGIESTGYAGLPA 331


>gi|170028916|ref|XP_001842340.1| cathepsin B [Culex quinquefasciatus]
 gi|167879390|gb|EDS42773.1| cathepsin B [Culex quinquefasciatus]
          Length = 339

 Score =  188 bits (478), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 106/243 (43%), Positives = 147/243 (60%), Gaps = 20/243 (8%)

Query: 2   PFTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGD 59
           PF  S H   +  QG CGSCWA   V  +SDR CIH    +NL L+  DL+ CC   CG+
Sbjct: 102 PFCQSIHS--VRNQGTCGSCWAVATVSVMSDRLCIHSDGEVNLELATEDLMGCCK-DCGN 158

Query: 60  GCDGGY-PISAWRYFVHHGVVT-------EECDPY-FDSTGCSHP--GCEPAYPTPKCVR 108
           GC+GG+   +A++Y+V  G+V+       E C PY F+   CS+P  GC      PKC+ 
Sbjct: 159 GCNGGFLDGTAFQYWVDAGLVSGAPYNSSEGCKPYPFEP--CSYPFVGCHHEKKNPKCLH 216

Query: 109 KCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
            C+   ++ +R  K +  +AY+I +D   I  EI  NGPV   F V+EDF  Y SGVYKH
Sbjct: 217 HCINGYDRKYRKDKFFGATAYKIPNDARMIQLEIMTNGPVATGFEVFEDFYFYHSGVYKH 276

Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           + G  +G HA++++GWGT ++G  YW++AN +  +WG  G+FK+ RGSN  GIE  V+AG
Sbjct: 277 VVGKKVGMHAIRIVGWGT-ENGTPYWLIANSYGDTWGDKGFFKMLRGSNHLGIESTVIAG 335

Query: 228 LPS 230
           LP 
Sbjct: 336 LPQ 338


>gi|242001640|ref|XP_002435463.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215498799|gb|EEC08293.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 223

 Score =  188 bits (477), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 101/222 (45%), Positives = 136/222 (61%), Gaps = 17/222 (7%)

Query: 23  AFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV- 79
           AFGAVEA+SDR CIH    + + +S  DL+ CC   CG GC GG   +AW+Y+   G+V 
Sbjct: 1   AFGAVEAMSDRVCIHSNGRVQVDISAEDLMDCCD-KCGSGCSGGVSAAAWQYWKDAGLVS 59

Query: 80  ------TEECDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 127
                 T+ C PY       S+  S P C    PTPKC R+C +   + + + K+++ + 
Sbjct: 60  GGLYNTTDGCKPYSLAPCEHSSQGSLPECVGTLPTPKCKRQCREGYERSYDDDKYFAKNV 119

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
           Y IN   + I  EI++NGPVE  FT Y DF  YKSGVY+H + D++G HA++++GWG S+
Sbjct: 120 YSINGSEKQIRTEIFQNGPVEAEFTAYADFLSYKSGVYQHHSRDIIGRHAIRILGWG-SE 178

Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           D   YW+LAN WN  WG  GYFK+ RG NEC IE  V AG+P
Sbjct: 179 DNNPYWLLANSWNEDWGDHGYFKMLRGVNECDIESFVNAGIP 220


>gi|194246069|gb|ACF35526.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 277

 Score =  188 bits (477), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 103/243 (42%), Positives = 142/243 (58%), Gaps = 21/243 (8%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           +++ + + ++  Q  CGSC AFGA EA+SDR CIH    + +++S  DLL CC   CG G
Sbjct: 35  WSHCDSIHLIRDQSTCGSCRAFGATEAMSDRICIHTKGRVQVNISAQDLLTCC-HQCGMG 93

Query: 61  CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C GGYP +AW Y+   G+VT       + C PY+    C H      P C    PTPKC+
Sbjct: 94  CFGGYPSAAWDYYKDEGIVTGGLYGTDDGCQPYYFPP-CEHHTKGPLPNCTDTKPTPKCL 152

Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C K   + +   K+++ + Y ++SD   I  EIYKNGPVE  F+VY DF  YKSGVY+
Sbjct: 153 QVCRKGYEKSYSEDKYFAKTVYSLHSDETQIKTEIYKNGPVEADFSVYTDFLAYKSGVYQ 212

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
             + ++       L GW         W++AN WN+ WG  GYFKI+RG+NECGIE D+ A
Sbjct: 213 RHSYELWEARHQNL-GWALKR--RSVWLVANSWNQDWGDKGYFKIRRGNNECGIENDINA 269

Query: 227 GLP 229
           G+P
Sbjct: 270 GIP 272


>gi|86451908|gb|ABC97349.1| cathepsin B [Streblomastix strix]
          Length = 312

 Score =  188 bits (477), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 104/237 (43%), Positives = 138/237 (58%), Gaps = 17/237 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDG 60
           + N + +  +  QGHCGSCWA  + E L DRFCI         LS   L +C       G
Sbjct: 86  WPNCQLIGKIYDQGHCGSCWAMSSFEVLQDRFCIKSEGKQTPELSPQHLTSCTPGC--SG 143

Query: 61  CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR-KC----VKKNQ 115
           C+GG+  +A+ +   +G++ E+C PY     C HPGC   +PTPKC + KC     K  +
Sbjct: 144 CNGGWMSTAFGFMQSNGILGEDCIPY-QMGKCKHPGCS-TWPTPKCNKTKCYPNDTKSTE 201

Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
           LW     ++ S+Y + S+  DI  EIY+NGPV  SF VYED + Y+SGVY+H+TG   G 
Sbjct: 202 LW-----HAASSYSVRSNEADIQKEIYENGPVTASFAVYEDLSVYQSGVYQHVTGGFEGL 256

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           HA+K++GWG   DG  YW + N W   WG DG   I+RG +ECGIE DVVAG P  K
Sbjct: 257 HAIKVVGWGIL-DGVKYWTIVNSWAEDWGFDGLLLIRRGVDECGIESDVVAGQPKLK 312


>gi|239938584|gb|ACS36091.1| cysteine proteinase [Haemonchus contortus]
          Length = 346

 Score =  187 bits (476), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 103/240 (42%), Positives = 137/240 (57%), Gaps = 20/240 (8%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCD 62
           N   +  +  Q +CGSCWA     ALSDR CI       + +S  D ++CC   CG GCD
Sbjct: 106 NCTSIRHIRDQANCGSCWAVSTASALSDRICIESNGETQMHISSIDFVSCCE-SCGYGCD 164

Query: 63  GGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG-------CEPAYPTPKCVR 108
           GG+PI A+ ++ + G VT       + C PY     C H G       C     TPKC R
Sbjct: 165 GGWPILAFDFYTYEGAVTGGDYGSKDGCRPY-PFHPCGHHGNDTYYGECPKGAKTPKCRR 223

Query: 109 KCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
           +C +   + +   K Y   AY +    + I  EI KNGPV  +FTVYEDF++YK G+YKH
Sbjct: 224 RCQRSYKKAYYMDKSYGEDAYEVPHSVKAIQREIMKNGPVVGAFTVYEDFSYYKKGIYKH 283

Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
             G   GGHA+K+IGWG  +D   YW++AN W+  WG +GYF++ RG NECGIE++VVAG
Sbjct: 284 TAGQARGGHAIKIIGWGVEND-VPYWLIANSWHNDWGEEGYFRMIRGINECGIEQEVVAG 342


>gi|339242629|ref|XP_003377240.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316973974|gb|EFV57515.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 325

 Score =  187 bits (475), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 98/221 (44%), Positives = 132/221 (59%), Gaps = 11/221 (4%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIH--FGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CGSCWA  +   ++DR CI         LS  +L++CC  +CG GCDGGYP  A+ Y
Sbjct: 106 QANCGSCWAVSSASVMTDRICIESIAAKQPLLSEEELVSCCK-ICGYGCDGGYPDKAFIY 164

Query: 73  FVHHGVVTEECDPYFDSTGCSH----PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISA 127
           +   G+ T    PY  + GC         E    TP C R+C+ +        +H+    
Sbjct: 165 WATRGIPTG--GPYGSTKGCKPYSIGSNSEDEAETPLCTRQCINEYPYNLSQDRHFGEKP 222

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
           Y +NS+ E IM E+YKNGPV V+F VYEDF +Y  GVY+H  G  +GGHAVKLIGWG  +
Sbjct: 223 YWVNSNEEQIMQELYKNGPVVVAFNVYEDFMYYIKGVYEHRFGKFLGGHAVKLIGWGI-E 281

Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           + + YW+++N WN +WG +G+FKI RG N C IE  VVAG+
Sbjct: 282 NSKKYWLISNSWNTTWGENGFFKIIRGKNCCAIESYVVAGM 322


>gi|156375635|ref|XP_001630185.1| predicted protein [Nematostella vectensis]
 gi|156217201|gb|EDO38122.1| predicted protein [Nematostella vectensis]
          Length = 311

 Score =  187 bits (475), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 98/218 (44%), Positives = 131/218 (60%), Gaps = 14/218 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGA E LSDRF I     + ++LS   L+ C   L   GC GG+PI+AW Y
Sbjct: 103 QGQCGSCWAFGASEVLSDRFAIASKNQIYVTLSAQQLVDCD--LDNSGCSGGWPINAWNY 160

Query: 73  FVHHGVVTEEC-DPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
            V  G++TE+C  PY+         C     T  C  +   K + +     Y + A  + 
Sbjct: 161 MVKTGLLTEQCYGPYY----AKQYTCRLTANTTDCPWQPGVKARFYHAKSAYKLPAKNV- 215

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
              E I  +I  NGPVE  FT+++DF  Y+SG+Y H TG  +GGHA+K++GWGT D+  D
Sbjct: 216 ---EAIQTDIMNNGPVEADFTIFQDFYAYRSGIYVHATGKQLGGHAIKILGWGTEDN-VD 271

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           YW+ AN W  +WG  GYFKI+RG++ECGIE+ + AGLP
Sbjct: 272 YWLCANSWGANWGIQGYFKIRRGTDECGIEDGLAAGLP 309


>gi|76576339|gb|ABA53863.1| cathepsin B-like cysteine protease 1 [Parelaphostrongylus tenuis]
          Length = 346

 Score =  186 bits (472), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 106/234 (45%), Positives = 133/234 (56%), Gaps = 23/234 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+ DR CI       + LS +D+L+CC   CG GC+GG    AW Y
Sbjct: 116 QSACGSGWAVAAVGAIMDRICIASEGKQQVILSADDILSCCT-ECGYGCEGGDTYKAWNY 174

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKKNQL 116
           +   G+VT     Y   +GC    +P CE               YPT  C  KC     +
Sbjct: 175 WTTDGIVTGS--NYTTKSGCKPYPYPPCEHYIDAGRYKKCPKDLYPTNTCEYKCQDNYTI 232

Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
            +   KHY    Y +  D   I  EI  +GPVEV+F VYEDF HY SG+YKH+ G+ +G 
Sbjct: 233 SYDEDKHYGAYPYVLVGDASFIQQEIMNHGPVEVTFDVYEDFEHYSSGIYKHMAGEYVGV 292

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           HAVK++GWGT ++G DYWI AN WN  WG +G+F+I RG NECGIE +VVAG P
Sbjct: 293 HAVKMLGWGT-ENGVDYWICANSWNSDWGENGFFRILRGENECGIESNVVAGKP 345


>gi|984960|gb|AAC46878.1| cathepsin B proteinase, partial [Ancylostoma caninum]
          Length = 340

 Score =  186 bits (471), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 97/231 (41%), Positives = 138/231 (59%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWA  + EA+SD+ C+       + +S  D+L+CCG  CG GC+   PI A+R+
Sbjct: 109 QSACGSCWAVSSAEAMSDQICVQSNRTTRVMISDTDILSCCGISCGYGCEV-LPIEAYRW 167

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKCVRKCVKK-NQLW 117
                VVT       + C PY      +H       P     +PTPKC + C +K N+ +
Sbjct: 168 MQRSVVVTGGKYRQKDVCKPYAFYPCGNHTNERYYGPCPRGLWPTPKCRKACQRKYNKSY 227

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              K+++  +Y + S+   I  EIYKNGPV  +F VY+DF++Y+ G+Y H  G   G HA
Sbjct: 228 NEDKYFATRSYYLPSNERSIREEIYKNGPVVAAFKVYQDFSYYRGGIYVHKWGGQTGAHA 287

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           VK++GWG  ++G DYW++AN WN  WG +GYF+I RGSNECGIE  +V+G+
Sbjct: 288 VKVVGWG-RENGTDYWLIANSWNTDWGENGYFRIARGSNECGIEGQMVSGV 337


>gi|291291827|gb|ADD91786.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score =  185 bits (470), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 105/230 (45%), Positives = 133/230 (57%), Gaps = 19/230 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CGSCWA     ALSDR CI  +    + +S  D+L+CCG  CG GC+GG+PI A+ Y
Sbjct: 24  QANCGSCWAVSTASALSDRICIASNGRKQVHVSATDILSCCGNQCGYGCNGGWPIQAFNY 83

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPG-------CEPAYPTPKCVRKC-VKKNQLW 117
           F   G VT         C PY     C H G       C     TPKCVRKC     + +
Sbjct: 84  FSKQGAVTGGDYKATSGCRPY-PFHPCGHHGKDTYYGECPNEATTPKCVRKCQKSYKKSY 142

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
           +  +     AY   +  +    EI KNGPV  +FTVYEDF++YK G+YKH  G   GGHA
Sbjct: 143 KKDRSIGKDAYEEPNAEKATQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHA 202

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           +K+IGWG  + G  YW++AN W+  WG +GYF+I  GSN CGIEE+VVAG
Sbjct: 203 IKIIGWG-KEGGVPYWLIANSWHNDWGENGYFRILCGSNHCGIEENVVAG 251


>gi|239938582|gb|ACS36090.1| cysteine proteinase [Haemonchus contortus]
          Length = 346

 Score =  185 bits (470), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 101/230 (43%), Positives = 133/230 (57%), Gaps = 20/230 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CGSCWA     ALSDR CI       + +S  D ++CC   C  GCDGG+PI A+ +
Sbjct: 116 QANCGSCWAVSTASALSDRICIESNGETQMHISSIDFVSCCE-SCSYGCDGGWPILAFDF 174

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPG-------CEPAYPTPKCVRKCVKK-NQLW 117
           + + G VT       + C PY     C H G       C     TPKC R+C +   + +
Sbjct: 175 YTYEGAVTGGDYGSKDGCRPY-PFHPCGHHGNDTYYGECPKGAKTPKCRRRCQRSYKKAY 233

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              K Y   AY +    + I  EI KNGPV  +FTVYEDF++YK G+YKH  G   GGHA
Sbjct: 234 YMDKSYGEDAYEVPHSVKAIQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGQARGGHA 293

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           +K+IGWG  +D   YW++AN W+  WG +GYF++ RG NECGIE++VVAG
Sbjct: 294 IKIIGWGVEND-VPYWLIANSWHNDWGEEGYFRMIRGINECGIEQEVVAG 342


>gi|170060938|ref|XP_001866023.1| cathepsin B [Culex quinquefasciatus]
 gi|167879260|gb|EDS42643.1| cathepsin B [Culex quinquefasciatus]
          Length = 353

 Score =  185 bits (469), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 105/225 (46%), Positives = 132/225 (58%), Gaps = 14/225 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWA  A EA +DR+CIH   + + S    DL++CC   CGDGC GG    AW Y
Sbjct: 120 QGCCGSCWAISAAEAFTDRWCIHSPEHTTFSFGSFDLISCC-HSCGDGCQGGVLGPAWDY 178

Query: 73  FVHHGVVTEECDPYFDSTGC-SHPGCEPAYP-----TPKCVRKCVKKNQLWRNSK--HYS 124
           +V  GV +    PY    GC S+P      P      PKC RKC     +   SK   + 
Sbjct: 179 WVQKGVSSG--GPYNSKQGCHSYPFDTCHSPDEDDDAPKCSRKCQSSYSVQDVSKDRRFG 236

Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 184
             AY + +D   IM EI+ NGPV+ +F VY DF  YKSGVY+H+TG + GGHA+K++GWG
Sbjct: 237 RVAYSVVADEHRIMEEIFVNGPVQAAFQVYLDFKTYKSGVYRHVTGPLEGGHAIKILGWG 296

Query: 185 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
             ++G  YW+ +N W   WG  G+FKI RG N  GIE DV AGLP
Sbjct: 297 V-ENGTKYWLCSNSWGEDWGDHGFFKIVRGENHLGIETDVHAGLP 340


>gi|166030316|gb|ABY78825.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  185 bits (469), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 100/235 (42%), Positives = 135/235 (57%), Gaps = 14/235 (5%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDG 63
           N   +  +  Q  CGSCWA     A+SDR C   G+  L +S   LL+CC   CGDGCDG
Sbjct: 102 NCPTIREIADQSACGSCWAVSTASAISDRHCTVGGVQQLRISAAHLLSCCK-DCGDGCDG 160

Query: 64  GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQ 115
           GYP +AWRY+V HG+ +  C PY     C H G +   P        TPKC   C  K  
Sbjct: 161 GYPDAAWRYYVSHGLASSYCQPY-PFPHCGHHGGKGKKPPCSKYDFHTPKCNTTCTDKAI 219

Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
                ++    +Y +    +D   E+Y NGP  V+F V+ DF  YK+GVY+H++GD +GG
Sbjct: 220 PL--IEYRGNDSYVLLHGEDDFKRELYFNGPFVVAFQVFSDFLAYKTGVYRHVSGDFLGG 277

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           HAV+++GWG   +G  YW +AN W+  WG +G+F   RG+NECGIE +  AGLP+
Sbjct: 278 HAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHFLFLRGNNECGIEFEGYAGLPA 331


>gi|29374023|gb|AAO73002.1| cathepsin B [Fasciola gigantica]
          Length = 335

 Score =  185 bits (469), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 103/245 (42%), Positives = 138/245 (56%), Gaps = 23/245 (9%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   +  +  Q  C SCWA G   A++DR CIH        LS  DL++CC + CG G
Sbjct: 96  WPNCSSISEIPDQSSCSSCWAVGTASAMTDRICIHSNGEKKPRLSAVDLVSCCPY-CGYG 154

Query: 61  CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH----PGCEP----AYPTPK 105
           C+GGYP  AW Y+  HG+V+         C PY     CSH    PG  P     Y TPK
Sbjct: 155 CEGGYPSMAWDYWWRHGIVSGGTLENPTGCLPY-PFPKCSHLEETPGLAPCPRELYATPK 213

Query: 106 CVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 164
           C ++C    ++     K    S+Y +     DIM EI  NGPV   + ++EDF  YKSG+
Sbjct: 214 CEKQCQAGYSKTSEEDKIKGKSSYNVGDRETDIMMEIITNGPVSTIYYIFEDFTVYKSGI 273

Query: 165 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
           Y++ +G +MGGH +  IGWG  ++G  YW+ AN WN  WG +GYF+I+RG+NECGIE  +
Sbjct: 274 YQYTSGSLMGGHGI--IGWGV-ENGVKYWLAANSWNEGWGENGYFRIRRGTNECGIESRI 330

Query: 225 VAGLP 229
            AGLP
Sbjct: 331 NAGLP 335


>gi|166030308|gb|ABY78821.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  185 bits (469), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 101/235 (42%), Positives = 135/235 (57%), Gaps = 14/235 (5%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDG 63
           N   +  +  Q  CGSCWA     A+SDR+C   G+  L +S   L++CC   CGDGC G
Sbjct: 102 NCPTIREIADQSACGSCWAVSTASAISDRYCTVGGVQQLRISAAHLMSCCED-CGDGCKG 160

Query: 64  GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQ 115
           G P SAW Y+V HG+ +  C PY     C H G +   P        TPKC   C  K  
Sbjct: 161 GAPDSAWEYYVSHGLASSYCQPY-PFPHCGHHGGKGKKPPCSKYHFHTPKCNTTCTDKAI 219

Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
                K+   ++Y + +  +D   E+Y NGP  V F VY DF  YK+GVY+H++GDV+GG
Sbjct: 220 PL--IKYRGNNSYMLLNGEDDYKRELYFNGPFVVDFGVYSDFLAYKTGVYRHVSGDVLGG 277

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           HAV+++GWG   +G  YW +AN W+  WG +G+F I RG+NECGIE    AGLP+
Sbjct: 278 HAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHFLILRGNNECGIESTGYAGLPA 331


>gi|308507719|ref|XP_003116043.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
 gi|308250987|gb|EFO94939.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
          Length = 356

 Score =  184 bits (468), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 112/253 (44%), Positives = 150/253 (59%), Gaps = 32/253 (12%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP 66
           ++++  Q +CGSCWAFGA E +SDR CIH        +S  D+L CCG  CG+GC GG  
Sbjct: 86  IKMVRDQSNCGSCWAFGAAEVISDRICIHSNGKEQPVISAEDILTCCGKSCGNGCQGGQG 145

Query: 67  ISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL--WR 118
           + A +++  +G VT      + C PY     CS+  C  +  TP C  KC     +  ++
Sbjct: 146 LEAMKFWTTYGAVTGGDYKGDGCKPY-SFAPCSN--CVESKTTPSCQSKCQSTYTVTNYK 202

Query: 119 NSKHYS---------------ISAYRINSDPED---IMAEIYKNGPVEVSFTVYEDFAHY 160
             KHY                 SAYR+++       I  EIY+NGPVEV++TVY+DF HY
Sbjct: 203 GDKHYGKNEGKVTERHKHLECTSAYRLDTSSNAVPIIQNEIYQNGPVEVAYTVYDDFYHY 262

Query: 161 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 220
           KSGVY H+TG   GGHAVK+IGWGT + G DYW++ N W  S+G  G+FKI+RG+NECGI
Sbjct: 263 KSGVYHHVTGKDTGGHAVKIIGWGT-EKGVDYWLVTNSWGTSFGDKGFFKIRRGTNECGI 321

Query: 221 EEDVVAGLPSSKN 233
           E +VVAG+    N
Sbjct: 322 ESNVVAGMAKVGN 334


>gi|390994433|gb|AFM37366.1| cathepsin B3 [Dictyocaulus viviparus]
          Length = 342

 Score =  184 bits (468), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 101/236 (42%), Positives = 138/236 (58%), Gaps = 19/236 (8%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP 66
           + ++  Q  CGSCWAF   E++SDR CI    N +   SV D+L CC   CG GCDGG+P
Sbjct: 109 ISLIRDQADCGSCWAFAVGESISDRVCIATDANKTAEFSVEDILTCCD-ECGFGCDGGFP 167

Query: 67  ISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAY------PTPKCVRKCVKK 113
            +AW YFV  GVVT         C PY  S   +HP  E  Y       TP C   C K 
Sbjct: 168 DAAWEYFVSTGVVTGGLYGTKNACRPYEISPCGNHPN-ETFYRNCTGVSTPSCKTSCQKG 226

Query: 114 NQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
             + +++ K     +Y + +    I  +I K+GP+  +F+VYEDF +YK G+Y++  G  
Sbjct: 227 YPVSYKDDKTRGRKSYNLANSVSAIQKDILKHGPLVATFSVYEDFMYYKKGIYRYTHGGY 286

Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
            GGHAV+++GWG  ++ + YWI+AN WN  WG DG+F++ RG N+CGIEE V AGL
Sbjct: 287 EGGHAVRILGWGVENNVK-YWIIANSWNTDWGEDGFFRMVRGINDCGIEESVSAGL 341


>gi|166030314|gb|ABY78824.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score =  184 bits (467), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 101/235 (42%), Positives = 137/235 (58%), Gaps = 15/235 (6%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDG 63
           N   +  +  Q  CGSCWA     A+SDR+C   G+  L +S   LL+CC   CG GCDG
Sbjct: 102 NCPTIREIADQSACGSCWAVSTASAISDRYCTVGGVQQLRISAAHLLSCCKD-CGYGCDG 160

Query: 64  GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQ 115
           GYP +AW Y+V HG+ +  C PY     C H G +   P        TPKC   C  K  
Sbjct: 161 GYPGTAWEYYVSHGLASSYCQPY-PFPHCGHHGGKGKKPPCSKYDFHTPKCNTTCTDKAI 219

Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
                K+    +Y ++ + +D   E+Y NGP  V+F VY DF  YK+GVY+H++GDV+GG
Sbjct: 220 PL--IKYRGNHSYGLDGE-DDYKRELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDVLGG 276

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           HAV+++GWG   +G  YW +AN W+  WG +G+F I RG +ECGIE +  AGLP+
Sbjct: 277 HAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHFLILRGKDECGIESEGYAGLPA 330


>gi|226469952|emb|CAX70257.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  183 bits (465), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 106/234 (45%), Positives = 132/234 (56%), Gaps = 20/234 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC   CG GCDGG+P  AW Y
Sbjct: 112 QSRCGSSWAVSAVGAISDRICIQSGGKQSVELSAIDLISCCEN-CGSGCDGGFPGPAWDY 170

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
           +V HG+VT         C PY     C H      P C +  Y TP+C RKC K     +
Sbjct: 171 WVSHGIVTGGSKENHTGCQPY-PFPKCEHHSIGKYPSCGDKMYKTPQCKRKCQKGYTTPY 229

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
            + KHY   A  +  +   I  EI   GPVE    ++EDF +YKSG+YK+ TG  +G H 
Sbjct: 230 EHDKHYGGIAINVIKNELAIQKEIMMYGPVEAYLLIFEDFLNYKSGIYKYTTGSFVGEHY 289

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           V++IGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE  VVAG   S
Sbjct: 290 VRIIGWGI-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESVVVAGRLKS 342


>gi|166030312|gb|ABY78823.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score =  183 bits (465), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 102/235 (43%), Positives = 135/235 (57%), Gaps = 15/235 (6%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDG 63
           N   +  +  Q  CGSCWA     A+SDR C   G+  L +S   LL+CC   CG GCDG
Sbjct: 102 NCPTIREIADQSACGSCWAVSTASAISDRHCTVGGVQQLRISAAHLLSCCKD-CGYGCDG 160

Query: 64  GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQ 115
           GYP +AWRY+V HG+ +  C PY     C H G +   P        TPKC   C  K  
Sbjct: 161 GYPDAAWRYYVSHGLASSYCQPY-PFPHCDHHGGKGKKPPCSKYDFHTPKCNTTCTDKAI 219

Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
                K+    +Y ++ + ED   E+Y NGP  V+F VY DF  YK+GVY+H++GDV+GG
Sbjct: 220 PL--IKYRGNHSYEVHGE-EDYKRELYFNGPFVVAFQVYSDFFAYKTGVYRHVSGDVLGG 276

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           HAV+++GWG   +G  YW +AN W+  WG +G+F I RG +ECGIE    AG P+
Sbjct: 277 HAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHFLILRGKDECGIEHQGYAGSPA 330


>gi|55793949|gb|AAV65885.1| cathepsin B1 isotype 5 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  183 bits (465), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 101/231 (43%), Positives = 134/231 (58%), Gaps = 18/231 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CG+ WAF AV+A+SDR CI      ++ LS  DLL+CC   CG GC  G+P  AW Y
Sbjct: 112 QSRCGAGWAFAAVQAMSDRICIESKGKKSVELSAVDLLSCC-IECGLGCQMGFPGIAWDY 170

Query: 73  FVHHGVVT-------EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
           +V  G+VT         C PY        T   +P C E  Y  PKC +KC K  +  + 
Sbjct: 171 WVQEGIVTGGSKENHTGCQPYPFPKCEHHTKGRYPECGEIIYMKPKCHQKCQKGYKTPYE 230

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             K+Y   +Y +  + + I  EI  +GPVE SF V+ DF +YKSG+YKH+TG  +G H V
Sbjct: 231 KDKYYGKVSYNLLKNEDSIKKEIMMHGPVEASFRVHSDFLNYKSGIYKHMTGIDIGSHVV 290

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           ++IGWG   +   YW++AN WN  WG  GYF++ RG +ECGIE  V +GLP
Sbjct: 291 RIIGWGVEKE-TPYWLIANSWNEDWGEKGYFRMLRGKDECGIESAVTSGLP 340


>gi|91089437|ref|XP_966750.1| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270012705|gb|EFA09153.1| cathepsin B precursor [Tribolium castaneum]
          Length = 324

 Score =  182 bits (463), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 103/229 (44%), Positives = 129/229 (56%), Gaps = 22/229 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCWAFG+VE ++DR CI          S +DLLACC   CG GCDGG P  A+ Y
Sbjct: 104 QGNCGSCWAFGSVEVMTDRLCIASKGKTKFEFSADDLLACCT-ACGKGCDGGAPYRAFEY 162

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCV--KKNQLWRNSKHY 123
           +V  G+V+       E C PY  S   +         TPKC  KC+  K    +   KHY
Sbjct: 163 WVAKGIVSGGDYNSNEGCQPYEGSAFLNSV-------TPKCSTKCLNSKYTTPYAKDKHY 215

Query: 124 SIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
                Y  + +  +I  EI  NGPV     VYEDF  YKSGVY+H++G+ MGGHAVK+IG
Sbjct: 216 GTDFIYMTSKNVAEIQTEIMNNGPVVTHMDVYEDFYSYKSGVYQHVSGNSMGGHAVKIIG 275

Query: 183 WGTSDDGEDYWILANQWNRSWG-ADGYFKIKRGSNECGIEEDVVAGLPS 230
           WGT + G  YW++AN W   W   DG++KI RG N C IE  +  G P 
Sbjct: 276 WGT-EKGVPYWLIANSWGAKWADLDGFYKILRGKNHCKIETYIYGGTPQ 323


>gi|226469950|emb|CAX70256.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  182 bits (463), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 104/234 (44%), Positives = 132/234 (56%), Gaps = 20/234 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC   CG GCDGG+P  AW Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCEN-CGSGCDGGFPGPAWDY 170

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
           +V HG+VT         C PY     C H      P C +  Y TP+C RKC K     +
Sbjct: 171 WVSHGIVTGGSKENHTGCQPY-PFPKCEHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPY 229

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
            + KHY   +  +  +   I  EI   GPVE    ++EDF +YKSG+Y++ TG  +G H 
Sbjct: 230 EHDKHYGGISINVIKNESAIQKEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHY 289

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           V++IGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE  VVAG   S
Sbjct: 290 VRIIGWGI-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESVVVAGRLKS 342


>gi|226474182|emb|CAX71577.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  182 bits (463), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 102/234 (43%), Positives = 133/234 (56%), Gaps = 20/234 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
           +V  G+VT         C PY     C H        C +  Y TP+C + C K  N  +
Sbjct: 171 WVLRGIVTGGSKENHTSCRPY-PFPKCDHFVKGKYRACGDKLYETPQCKQTCQKGYNTSY 229

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              KHY   +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHA
Sbjct: 230 EQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHA 289

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           V+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 290 VRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|332376204|gb|AEE63242.1| unknown [Dendroctonus ponderosae]
          Length = 338

 Score =  182 bits (463), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 101/237 (42%), Positives = 133/237 (56%), Gaps = 25/237 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCG--FLCGDGCDGGYPISAW 70
           Q  CGSCWAFGAVEA+SDR CIH   +  + +S  DL +CC   F CG GCDGGY    W
Sbjct: 106 QSSCGSCWAFGAVEAMSDRICIHSDQSNQVYVSAEDLNSCCFGLFACGLGCDGGYVAEPW 165

Query: 71  RYFVHHGVVTEECDPYFDSTGCSHPGCEPA----------------YPTPKCVRKCVKKN 114
            Y+   G+VT     Y  S GC     EP                 + TP+CVR C + +
Sbjct: 166 DYWRTDGIVTG--GAYNSSQGCKDYSLEPCEHHVEVGSRPQCSSLNFDTPECVRSCYESS 223

Query: 115 QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD-VM 173
             +  S  +        ++ + +  EI KNGP+E +FTVY DF  YKSGVY+    D  +
Sbjct: 224 LDYTESLTFGQQVSTFTNEKQ-MQLEILKNGPIEAAFTVYNDFLSYKSGVYQATAQDESV 282

Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           GGHA+K++GWG  ++G  YW++AN WN  WG +GYFK  RG + CGIE +  A LP+
Sbjct: 283 GGHAIKVLGWGV-EEGTKYWLIANSWNTDWGDNGYFKFLRGVDHCGIESETAASLPA 338


>gi|226469948|emb|CAX70255.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  182 bits (461), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 104/234 (44%), Positives = 132/234 (56%), Gaps = 20/234 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC   CG GCDGG+P  AW Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCEN-CGSGCDGGFPGPAWDY 170

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
           +V HG+VT         C PY     C H      P C +  Y TP+C RKC K     +
Sbjct: 171 WVSHGIVTGGSKENHTGCQPY-PFPKCEHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPY 229

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
            + KHY   +  +  +   I  EI   GPVE    ++EDF +YKSG+Y++ TG  +G H 
Sbjct: 230 EHDKHYGGISINVIKNESAIQNEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHY 289

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           V++IGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE  VVAG   S
Sbjct: 290 VRIIGWGI-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESVVVAGRLKS 342


>gi|119638965|gb|ABL85237.1| cysteine proteinase 3 [Necator americanus]
          Length = 360

 Score =  182 bits (461), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 102/231 (44%), Positives = 138/231 (59%), Gaps = 18/231 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q HCGSCWA  + E +SDR C+     + + LS  D+LACC   CG GC GG+ I AW Y
Sbjct: 112 QSHCGSCWAVSSAETMSDRLCVQSNGTIKVLLSDTDILACCPN-CGAGCGGGHTIRAWEY 170

Query: 73  FVHHGVVT-------EECDPY--FDSTGCSHPGC-EPAYPTPKCVRKC-VKKNQLWRNSK 121
           F + GV T       + C PY  +     S+  C + ++PTPKC + C  K ++ + + K
Sbjct: 171 FKNTGVCTGGLYGTKDSCKPYAFYPCKDESYGKCPKDSFPTPKCRKICQYKYSKKYADDK 230

Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 181
           +Y+ SAYRI  +   I  EI +NGPV  SF +Y DF  Y+ GVY    G  +GGHA+K+I
Sbjct: 231 YYANSAYRIPQNETWIKLEIMRNGPVTASFRIYPDFGFYEKGVYVTSGGRELGGHAIKII 290

Query: 182 GWGTSD-DGED--YWILANQWNRSWGA-DGYFKIKRGSNECGIEEDVVAGL 228
           GWGT   +G D  YW++AN W   WG  +GYF+I RG N C IE+ V+AG+
Sbjct: 291 GWGTEKVNGTDLPYWLIANSWGTDWGENNGYFRILRGQNHCQIEQKVIAGM 341


>gi|56756410|gb|AAW26378.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 101/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
           +V  G+VT       + TGC     P C+              Y TP+C + C K  N  
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTS 228

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   KHY   +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGH 288

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           AV+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|56752809|gb|AAW24616.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  181 bits (460), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 102/234 (43%), Positives = 133/234 (56%), Gaps = 20/234 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
           +V  G+VT         C PY     C H        C +  Y TP+C + C K  N  +
Sbjct: 171 WVLRGIVTGGSKENHTGCRPY-PFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSY 229

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              KHY   +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHA
Sbjct: 230 EQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHA 289

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           V+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 290 VRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|91078960|ref|XP_974244.1| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270004840|gb|EFA01288.1| cathepsin B precursor [Tribolium castaneum]
          Length = 319

 Score =  181 bits (460), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 101/225 (44%), Positives = 132/225 (58%), Gaps = 20/225 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAF AVE +SDR CIH         S  DLL+CC   CG  C GGY ++A+ +
Sbjct: 103 QGSCGSCWAFAAVETMSDRICIHSSGAKKFFFSAEDLLSCCT-ACGS-CSGGYMMAAFDF 160

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYS 124
           ++  GVV+       E C PY   T  +H        TP C + C K     + + KHY 
Sbjct: 161 YIKQGVVSGGDLNSNEGCRPY---TADAHDKGV----TPSCTKSCRKGYPTSYSSDKHYG 213

Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 184
              Y +++   +I  EI  NGP+ VSF VY+DF +Y SGVY H++G+  G H VK++GWG
Sbjct: 214 SKDYIVDAGVSNIQYEIMTNGPIIVSFKVYQDFYNYGSGVYHHVSGNYTGNHIVKIVGWG 273

Query: 185 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           T  + +DYW++AN W  SWG  G+FKI RG NECGIE +  A LP
Sbjct: 274 TEKE-QDYWLIANSWGSSWGEHGFFKILRGKNECGIENNPYAVLP 317


>gi|226474180|emb|CAX71576.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  181 bits (460), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 101/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
           +V  G+VT       + TGC     P C+              Y TP+C + C K  N  
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYETPQCKQTCQKGYNTS 228

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   KHY   +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGH 288

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           AV+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|3087801|emb|CAA93277.1| cysteine proteinase [Haemonchus contortus]
          Length = 344

 Score =  181 bits (460), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 103/231 (44%), Positives = 134/231 (58%), Gaps = 21/231 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWA     ALSDR CI       + +S  D+L+CC   CGDGCDGGY I A+++
Sbjct: 113 QADCGSCWAVSTASALSDRICIASKGAKQVYVSATDILSCC-HSCGDGCDGGYVIDAFKF 171

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKNQL-W 117
           F   G VT       + C PY     C H G E  Y        TP+CVRKC +  +  +
Sbjct: 172 FAEQGAVTGGDYGAKDCCRPY-PFHPCGHHGNETYYGECPEDGSTPECVRKCQEGYETEY 230

Query: 118 RNSKHYSISAYRIN-SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
              +     AYR+     + I  EI +NGPV  +F V++DF+ Y+ G+Y H+ G   GGH
Sbjct: 231 HEDRVRGEDAYRLPIGSVKAIQKEIMRNGPVVAAFIVFDDFSFYRKGIYAHVAGSPRGGH 290

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           AVK+IGWGT + G  YWI+AN W+  WG DGYF++ RG N+CGIE +VVAG
Sbjct: 291 AVKIIGWGT-EHGVPYWIIANSWHSDWGEDGYFRMVRGINDCGIETNVVAG 340


>gi|118429529|gb|ABK91812.1| cathepsin B precursor [Clonorchis sinensis]
          Length = 342

 Score =  181 bits (459), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 102/237 (43%), Positives = 129/237 (54%), Gaps = 21/237 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA+SDR CIH     N SLS  DL++CC   CG GC GGY   AW  
Sbjct: 108 QSSCGSCWAFGAVEAMSDRLCIHSKGAFNKSLSAVDLVSCCT-ECGCGCRGGYSPIAWDL 166

Query: 73  FVHHGVVTEECDPYFDSTGCSH---PGCE------------PAYPTPKCVRKCVKKNQLW 117
           +  HG+VT         TGC     P CE              YPTP+C+++C  K   +
Sbjct: 167 WKTHGIVTGGSKE--KPTGCRSYPFPSCEHRGKGQYPPCPHQLYPTPECIKRCDTKEIDY 224

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              K  +  +Y +    + +M EI   GPV     VYED   YKSGVY H+ G  +G H 
Sbjct: 225 EKDKTRANISYNVYPAEQAVMKEIMLRGPVGAILHVYEDLLDYKSGVYFHVWGGHLGEHG 284

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
           ++++GWG  +DG  YW++AN WN  WG  GY ++ R  NECGI + V AGLP   N 
Sbjct: 285 IRILGWG-EEDGVPYWLVANSWNEDWGEKGYMRVLRWRNECGIVDQVTAGLPDLSNF 340


>gi|56755451|gb|AAW25905.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  181 bits (459), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 102/234 (43%), Positives = 133/234 (56%), Gaps = 20/234 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
           +V  G+VT         C PY     C H        C +  Y TP+C + C K  N  +
Sbjct: 171 WVLRGIVTGGSKENHTGCRPY-PFPKCDHFVKGKYRACGDKLYKTPQCKQICQKGYNTSY 229

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              KHY   +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHA
Sbjct: 230 EQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHA 289

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           V+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 290 VRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|56756114|gb|AAW26235.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  181 bits (459), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 101/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  C S WA  +V A+SDR CI  G   ++ LS  DL++CC   CG GCDGGY + +W Y
Sbjct: 112 QSRCASSWAVSSVGAMSDRICIQSGGKQSVELSAIDLISCCKN-CGSGCDGGYFLPSWDY 170

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
           +V HG+VT       + TGC     P C+              Y TP+C + C K  N  
Sbjct: 171 WVSHGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYETPQCKQTCQKGYNTS 228

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   KHY   +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGH 288

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           AV+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|226474160|emb|CAX71567.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  181 bits (458), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 101/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
           +V  G+VT       + TGC     P C+              Y TP+C + C K  N  
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQICQKGYNTS 228

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   KHY   +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGH 288

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           AV+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|226474164|emb|CAX71568.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
 gi|226474166|emb|CAX71569.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  181 bits (458), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 102/234 (43%), Positives = 133/234 (56%), Gaps = 20/234 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
           +V  G+VT         C PY     C H        C +  Y TP+C + C K  N  +
Sbjct: 171 WVLRGIVTGGSKENHTGCRPY-PFPKCDHFVKGKYRACGDKLYKTPQCKQICQKGYNTSY 229

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              KHY   +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHA
Sbjct: 230 EQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHA 289

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           V+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 290 VRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|226473758|emb|CAX71564.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  180 bits (457), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 102/234 (43%), Positives = 133/234 (56%), Gaps = 20/234 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
           +V  G+VT         C PY     C H        C +  Y TP+C + C K  N  +
Sbjct: 171 WVLRGIVTGGSKENHTGCRPY-PFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSY 229

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              KHY   +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHA
Sbjct: 230 EQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHA 289

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           V+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 290 VRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|226474172|emb|CAX71572.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  180 bits (457), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 101/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
           +V  G+VT       + TGC     P C+              Y TP+C + C K  N  
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTS 228

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   KHY   +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGH 288

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           AV+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|226474174|emb|CAX71573.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  180 bits (457), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 101/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCKY-CGSGCDGGFLGPSWDY 170

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
           +V  G+VT       + TGC     P C+              Y TP+C + C K  N  
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTS 228

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   KHY   +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGH 288

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           AV+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|56756475|gb|AAW26410.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  180 bits (456), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 102/234 (43%), Positives = 133/234 (56%), Gaps = 20/234 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
           +V  G+VT         C PY     C H        C +  Y TP+C + C K  N  +
Sbjct: 171 WVLRGIVTGGSKENHTGCRPY-PFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSY 229

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              KHY   +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHA
Sbjct: 230 EQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHA 289

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           V+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 290 VRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|171474007|gb|AAX31052.2| SJCHGC09761 protein [Schistosoma japonicum]
          Length = 342

 Score =  180 bits (456), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 102/234 (43%), Positives = 133/234 (56%), Gaps = 20/234 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
           +V  G+VT         C PY     C H        C +  Y TP+C + C K  N  +
Sbjct: 171 WVLRGIVTGGSKENHTGCRPY-PFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSY 229

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              KHY   +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHA
Sbjct: 230 EQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHA 289

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           V+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 290 VRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|56752811|gb|AAW24617.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  180 bits (456), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 107/229 (46%), Positives = 139/229 (60%), Gaps = 16/229 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL-LACCGFLCGDGCDGGYPISAWRYF 73
           Q  CGSCWAFGAVEA++DR CI  G   S  ++ L L  C   CG GC GG+P  AW Y+
Sbjct: 112 QSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCKDCGGGCKGGFPGQAWDYW 171

Query: 74  VHHGVVT---EE----CDPY-FDS----TGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRN 119
           V  G+VT   EE    C PY F      T   +P C    Y TP+C + C K  +  +  
Sbjct: 172 VKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQ 231

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
            KHY    Y + S+ + I  EI   GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA++
Sbjct: 232 DKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIR 291

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           +IGWG  + G+ YW++AN WN  WG  G F++ RG +EC IE  VVAGL
Sbjct: 292 IIGWGV-EKGKPYWLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGL 339


>gi|335347291|gb|AEH42093.1| cysteine proteinase 6 [Haemonchus contortus]
          Length = 346

 Score =  180 bits (456), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 101/241 (41%), Positives = 138/241 (57%), Gaps = 22/241 (9%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCD 62
           N   ++ +  Q +CGSCWA      LSDR CI       + ++  D ++CC   CG GC+
Sbjct: 106 NCTSIKHIRDQANCGSCWAVSTASVLSDRICIASKQKKQVHISSIDFVSCCD-SCGFGCE 164

Query: 63  GGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAY-------PTPKCVR 108
           GG+PI A+ Y+ + GVVT         C PY     C H G E  Y        TP+CV+
Sbjct: 165 GGWPIDAFEYYSYQGVVTGGDYGSKTGCRPY-PFHPCGHHGNETYYGECPKEESTPECVK 223

Query: 109 KCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           +C K  KN  +R  K +    Y + +  + I  EI ++GPV  SFTVY+DF++Y  G+YK
Sbjct: 224 QCQKGYKNS-YRRDKTWGEDYYEVENSVKAIQREIMRSGPVVSSFTVYDDFSYYVKGIYK 282

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H  G   G HA+K+IGWGT +    YWI+AN W+  WG  G+F++ RG+N CGIEEDVVA
Sbjct: 283 HTAGKARGSHAIKIIGWGT-EKNVPYWIIANSWHNDWGEKGFFRMVRGTNHCGIEEDVVA 341

Query: 227 G 227
           G
Sbjct: 342 G 342


>gi|44965462|gb|AAS49538.1| cathepsin B [Protopterus dolloi]
          Length = 225

 Score =  180 bits (456), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 98/203 (48%), Positives = 125/203 (61%), Gaps = 18/203 (8%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   +  +  QG CGSCWAFGAVE++SDR C+H G   N+ +S  DLL+CCGF CG G
Sbjct: 23  WPNCPTIREIRDQGSCGSCWAFGAVESMSDRVCVHSGGKQNVEVSAEDLLSCCGFECGMG 82

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCE-PAYPTPKC 106
           C+GGYP  AW+Y+   G+V+         C PY     C H      P C      TPKC
Sbjct: 83  CNGGYPSGAWQYWTEKGLVSGGLYGSGIGCRPYTIPP-CEHHVNGSRPSCSGEGGDTPKC 141

Query: 107 VRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
           V+KC       +   K Y  SAY + S PE IM EIYK+GPVE +FTVYEDF  YKSGVY
Sbjct: 142 VQKCDSGYTPAYEKDKIYGQSAYSVPSSPESIMEEIYKDGPVEGAFTVYEDFLLYKSGVY 201

Query: 166 KHITGDVMGGHAVKLIGWGTSDD 188
           +H TG+ +GGHA+K++GWG  ++
Sbjct: 202 QHHTGEAVGGHAIKILGWGIENN 224


>gi|56756907|gb|AAW26625.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  180 bits (456), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 101/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
           +V  G+VT       + TGC     P C+              Y TP+C + C K  N  
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTS 228

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   KHY   +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGH 288

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           AV+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|226473762|emb|CAX71566.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
 gi|226474170|emb|CAX71571.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  179 bits (455), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 101/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
           +V  G+VT       + TGC     P C+              Y TP+C + C K  N  
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTS 228

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   KHY   +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYIEIYEDFLNYKSGIYRYTTGKYISGH 288

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           AV+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|56757646|gb|AAW26973.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  179 bits (455), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 100/235 (42%), Positives = 132/235 (56%), Gaps = 22/235 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
           +V  G+VT       + TGC     P C+              Y TP+C + C K  N  
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTS 228

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   KHY   +Y + S       +I  +GPVE    +YEDF +YKSG+Y++ TG  + GH
Sbjct: 229 YEQDKHYGGFSYNVLSGESVFQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGH 288

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           AV+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|226474176|emb|CAX71574.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  179 bits (455), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 100/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
           +V  G+VT       + TGC     P C+              Y TP+C + C K  N  
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTS 228

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   KHY   +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGH 288

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           AV+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC I+ ++ AGL  S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIDSEIAAGLIKS 342


>gi|56754499|gb|AAW25437.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  179 bits (455), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 101/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
           +V  G+VT       + TGC     P C+              Y TP+C + C K  N  
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTS 228

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   KHY   +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGH 288

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           AV+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|226474184|emb|CAX71578.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  179 bits (455), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 101/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
           +V  G+VT       + TGC     P C+              Y TP+C + C K  N  
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTS 228

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   KHY   +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMVHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGH 288

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           AV+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|56752787|gb|AAW24605.1| unknown [Schistosoma japonicum]
          Length = 309

 Score =  179 bits (454), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 101/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y
Sbjct: 79  QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 137

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
           +V  G+VT       + TGC     P C+              Y TP+C + C K  N  
Sbjct: 138 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTS 195

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   KHY   +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GH
Sbjct: 196 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGH 255

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           AV+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 256 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 309


>gi|345308|pir||S31909 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - fluke
           (Schistosoma japonicum)
          Length = 316

 Score =  179 bits (454), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 102/234 (43%), Positives = 132/234 (56%), Gaps = 20/234 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  C S WA  AV A+SDR CI  G   ++ LS  DL++CC   CG GCDGG+P  AW Y
Sbjct: 86  QSRCASSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCEN-CGSGCDGGFPGPAWDY 144

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKKNQL-W 117
           +V HG+VT         C PY     C H      P C +  Y TP+C RKC K  +  +
Sbjct: 145 WVSHGIVTGGSKENHTGCQPY-PFPKCEHHSKGKYPSCGDKMYKTPQCKRKCQKGYKTPY 203

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
            + KHY   +  +  +   I  EI   GPVE    ++EDF +YKSG+Y++ TG  +G H 
Sbjct: 204 EHDKHYGGISINVIKNESAIQKEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHY 263

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           V++IGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC +E  VVAG   S
Sbjct: 264 VRIIGWGI-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSVESVVVAGRLKS 316


>gi|226473756|emb|CAX71563.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  179 bits (454), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 101/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
           +V  G+VT       + TGC     P C+              Y TP+C + C K  N  
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTS 228

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   KHY   +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGH 288

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           AV+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|45822211|emb|CAE47502.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
          Length = 331

 Score =  179 bits (454), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 97/238 (40%), Positives = 133/238 (55%), Gaps = 19/238 (7%)

Query: 6   SEHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDG 63
           S+ +  +V Q  CGSCWA  A  A+SDR CI     + + +S  +LL+CC   CG GC+G
Sbjct: 93  SDVISTVVDQSDCGSCWAVAAASAMSDRRCIASQGKLKVPVSAENLLSCCDS-CGYGCEG 151

Query: 64  GYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPA-YPTPKCVRK 109
           GYP  AW Y++  G+ T       + C PY     C H        C    Y TP C  K
Sbjct: 152 GYPTMAWSYWIDTGITTGGLYGSKQGCQPY-SLQPCEHHTEGNKVQCSTLDYDTPSCKHK 210

Query: 110 CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 169
           C      +++   +   + R      +I  EI  NGPVE +F VY DF +YKSGVY+H+ 
Sbjct: 211 CDDSALNYKSELTFGSGSVRNFYSVANIQKEILTNGPVEAAFDVYSDFVNYKSGVYQHVA 270

Query: 170 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           G+ +GGHAV+++GWG  + G  YW++AN WN  WG  G FKI+RG+NE G E+ +VA 
Sbjct: 271 GEYLGGHAVRILGWG-EESGVPYWLVANSWNEDWGDKGLFKIRRGNNESGFEDSIVAA 327


>gi|300835056|gb|ADK37857.1| putative cathepsin precursor [Sitobion avenae]
          Length = 340

 Score =  179 bits (454), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 101/232 (43%), Positives = 127/232 (54%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QGHCGSCWAFG   A +DR C+      N  LS  ++  CC   CG GC GGYPI AW+Y
Sbjct: 110 QGHCGSCWAFGTSSAFADRLCVATDGDFNELLSAEEITFCC-HTCGFGCHGGYPIKAWKY 168

Query: 73  FVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
           F  HG+VT       E C+PY       D  G +    +P     +C R C     L  N
Sbjct: 169 FSKHGLVTGGNYKSGEGCEPYRVPPCPRDDKGNNTCAGKPIEKNHRCTRMCYGDQDLDYN 228

Query: 120 SKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
             H ++   Y +      I  ++   GP+E SF VY+DF  YKSGVY K      +GGHA
Sbjct: 229 DDHRFTRDFYYLTYG--SIQKDVMTYGPIEASFDVYDDFPSYKSGVYEKTENASYLGGHA 286

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           VKLIGWG  ++G  YW++ N WN  WG  G FKI+RG+NECGI+    AG+P
Sbjct: 287 VKLIGWGV-EEGTPYWLMVNSWNAQWGDKGLFKIRRGTNECGIDNSTTAGVP 337


>gi|226474178|emb|CAX71575.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  179 bits (453), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 101/234 (43%), Positives = 132/234 (56%), Gaps = 20/234 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
           +V  G+VT         C PY     C H        C +  Y TP+C + C K  N  +
Sbjct: 171 WVLRGIVTGGSKENHTGCRPY-PFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSY 229

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              KHY   +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHA
Sbjct: 230 EQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHA 289

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           V+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AG   S
Sbjct: 290 VRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGRIKS 342


>gi|294883442|ref|XP_002770942.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239874068|gb|EER02758.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 393

 Score =  179 bits (453), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 109/235 (46%), Positives = 134/235 (57%), Gaps = 26/235 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCG---DGCDGGYPIS 68
           Q  CGSCWAF   EA SDR CI        + LS     ACC    G    GCDGG P S
Sbjct: 150 QSTCGSCWAFATSEAFSDRLCIRSSGEFDLVPLSAGHTAACCSEAEGCFSFGCDGGQPDS 209

Query: 69  AWRYFVHHGVVTE---ECDPYFDSTGCSH----PGCEPAY---PTPKCVRKCVKKNQLWR 118
           AWR+F  HGVV+E    C PY +   CSH     G EP     P+P C   C  +N  ++
Sbjct: 210 AWRWFSEHGVVSELDSGCWPY-NFPECSHHVETKGMEPCKGNSPSPVCSTTC--RNHHFK 266

Query: 119 NS----KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
            S    +H++        + ++I  EI  NGPV  +FTVYEDF +YKSGVYKH+ G  +G
Sbjct: 267 PSFESDRHFTEDEGYSLDEVDEIKKEIIDNGPVAAAFTVYEDFLYYKSGVYKHVNGSELG 326

Query: 175 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           GHAVK+IGWGT D  E YW++ N WN +WG  G FKI  G  ECGI+ +V AG+P
Sbjct: 327 GHAVKIIGWGT-DQNEQYWLVMNSWNVNWGDQGIFKIAIG--ECGIDSEVTAGIP 378


>gi|204022108|dbj|BAG71151.1| cathepsin B-N [Cerataphis jamuritsu]
          Length = 333

 Score =  179 bits (453), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 104/232 (44%), Positives = 125/232 (53%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QGHCGSCWAFG   A +DR CI      N  LS  +L  CC   CG GC+GGYPI AW  
Sbjct: 106 QGHCGSCWAFGTSSAFADRLCIATEGEFNELLSAEELTFCC-HKCGFGCNGGYPIRAWER 164

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
           F  HG+VT       E C PY       D  G +    +P     +C R C     L + 
Sbjct: 165 FRKHGLVTGGNYDSYEGCQPYRVPPCPLDEYGNNTCHGKPMEKNHRCTRMCYGDQDLDFN 224

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
           N  HY+  AY +      I  ++   GP+E SF VY+DF  YKSGVY K      +GGHA
Sbjct: 225 NDHHYTRDAYYLTYGT--IQNDVLTYGPIEASFEVYDDFPSYKSGVYVKTENASYLGGHA 282

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           VKLIGWG  + G  YW+L N WN  WG  G FKI+RG+NECGI+     G+P
Sbjct: 283 VKLIGWG-EEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 333


>gi|157167281|ref|XP_001658485.1| cathepsin b [Aedes aegypti]
 gi|108876476|gb|EAT40701.1| AAEL007585-PA [Aedes aegypti]
          Length = 386

 Score =  179 bits (453), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 99/228 (43%), Positives = 134/228 (58%), Gaps = 18/228 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWA  A  A++DR+C+             DLL+CC   CG GC GG    AW++
Sbjct: 147 QGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLLSCC-HSCGQGCRGGTLGPAWQF 205

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK---NQLWRNSKH 122
           +V  G+ +       + C PY     C  PG +    TPKC  KC        +W++ +H
Sbjct: 206 WVEKGLSSGGPLNSRQGCHPYPIGE-CRIPGEDED--TPKCSNKCRSGYNVTDVWQD-RH 261

Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
           Y   AY + +D   IM EI+ NGPV+ +F  Y D   YKSG+Y+H+ G + GGHAVKL+G
Sbjct: 262 YGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLLG 321

Query: 183 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           WG  ++G  YW++AN W R WG +G+FKI RG N CGIEE++ AGLP+
Sbjct: 322 WGV-ENGVKYWLVANSWGREWGENGFFKIVRGENHCGIEENIHAGLPN 368


>gi|159177|gb|AAA29177.1| cysteine proteinase [Haemonchus contortus]
          Length = 342

 Score =  179 bits (453), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 99/232 (42%), Positives = 128/232 (55%), Gaps = 19/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CGSCWA     A+SDR CI       +++S  D+L CC   CG GC GG+ I AW Y
Sbjct: 106 QANCGSCWAVSTAAAISDRICIATNGEKQVNISSTDILTCCNPQCGFGCGGGWSIRAWEY 165

Query: 73  FVHHGVVTE-------ECDPYFDSTGCSHPG-------CEPAYPTPKCVRKCVKK-NQLW 117
           FV+ GVV+         C PY     C H G       C     TP C +KC     +++
Sbjct: 166 FVYEGVVSGGEYLTKGVCRPY-PIHPCGHHGNDTYYGECPREAATPPCKKKCQPGYKKIF 224

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
           R  K     AY +    E I  EI ++GPV  SF VYEDF+ YK+GVYKH  G + G HA
Sbjct: 225 RMDKRQGKVAYGVEPKEEAIQREILRHGPVVASFAVYEDFSLYKTGVYKHTAGALRGYHA 284

Query: 178 VKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           VK++GWG  S     YW++AN W+  WG +GYF+  RG N+C IE+ V AG+
Sbjct: 285 VKMMGWGVDSKTKAKYWLIANSWHNDWGENGYFRFIRGINDCEIEDTVAAGI 336


>gi|281208776|gb|EFA82951.1| peptidase C1A family protein [Polysphondylium pallidum PN500]
          Length = 1308

 Score =  179 bits (453), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 93/211 (44%), Positives = 122/211 (57%), Gaps = 16/211 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           Q  CGSCWAFGA+E++SDRFCIH   ++ LS  DL+ C      +GC+GG P +A++Y  
Sbjct: 92  QAECGSCWAFGAIESISDRFCIHKNESVQLSFQDLITCDN--QDNGCEGGDPYTAYKYVQ 149

Query: 75  HHGVVTEECDPYFDSTGCSHPGCEPAYP-------TPKCVRKCVKKNQLWRNSKHYSISA 127
            +GVVT  C PY      + P C PA         TP C  KC   +  ++   H+  + 
Sbjct: 150 KNGVVTSNCQPY------TIPTCPPAQQPCMNFVNTPPCSAKCANSSVNFQQDLHHLKTV 203

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
           Y +  +   I  EI  NGPVE  F VYEDF  YKSGVY H +G  +GGH +K++G+G S 
Sbjct: 204 YAVKPNVAAIQNEIVTNGPVEACFEVYEDFLGYKSGVYTHKSGKDLGGHCIKIVGFGVS- 262

Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNEC 218
           +G  YWI  N W  SWG +G F I+ G NEC
Sbjct: 263 NGTPYWICNNSWTTSWGNNGIFWIEAGKNEC 293


>gi|332374788|gb|AEE62535.1| unknown [Dendroctonus ponderosae]
          Length = 328

 Score =  179 bits (453), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 96/234 (41%), Positives = 132/234 (56%), Gaps = 15/234 (6%)

Query: 6   SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDG 63
           ++ + ++  Q  CGSCWAF AVEA+SDR CIH      L +S  DLL C       GC+G
Sbjct: 95  TQIIGMIRDQSRCGSCWAFAAVEAMSDRICIHSNATKKLLVSSQDLLTCG---TAGGCNG 151

Query: 64  GYPISAWRYFVH-------HGVVTEECDPYFDSTGCSHPG-CEPAYPTPKCVRKCVKKNQ 115
           G+P  AW  + +       +G + + C  YF      HP  C     TP CV +C + + 
Sbjct: 152 GWPAVAWSDWTNGIVTGGLYGALEQGCKSYFLEGCDDHPNKCRNYVSTPACVEQCDEPSL 211

Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
            ++  + Y  + Y I  + E I  EI  NGPVE +  VY DFA Y+SG+Y+  T +  GG
Sbjct: 212 YYKAQETYGQTPYEIQGE-EQIQYEIMTNGPVEATMDVYVDFAQYQSGIYQLTTDEYEGG 270

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           HAVK++GWG  +DG  YW++AN WN  WG +G F+I RG +E GIE  + A LP
Sbjct: 271 HAVKILGWGV-EDGVKYWLVANSWNERWGENGLFRIIRGRDEVGIESTIDAALP 323


>gi|226474168|emb|CAX71570.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  179 bits (453), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 100/235 (42%), Positives = 132/235 (56%), Gaps = 22/235 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
           +V  G+VT       + TGC     P C+              Y TP+C + C K  N  
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTS 228

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   KHY   +Y + S    I  +I  +GP E    +YEDF +YKSG+Y++ TG  + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPAEAYLEIYEDFLNYKSGIYRYTTGQFISGH 288

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           AV+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|46812327|gb|AAT02230.1| cathepsin B-like proteinase [Triatoma dimidiata]
          Length = 332

 Score =  179 bits (453), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 102/231 (44%), Positives = 130/231 (56%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWA   +      F  H    + + LS  +L+ CCG  CG GC GG P SAW Y
Sbjct: 103 QGSCGSCWALELLRLCLIVFVSHSNGKLQVHLSAENLVTCCGS-CGAGCFGGDPGSAWEY 161

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
           +   G+V+       E C PY     C H      P C     T  C ++C K   + + 
Sbjct: 162 WRDVGIVSGGNYGSKEGCQPY-SIAPCEHHIPGSRPPCRGEGHTADCRKQCEKGYSIPYD 220

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
              HY+   Y    D ++I  EI KNGPVE +F VYED   YK GVYKH+ G  +GGHA+
Sbjct: 221 KDLHYAEFVYSTERDVKEIQTEILKNGPVEAAFFVYEDLLTYKEGVYKHVAGAPVGGHAI 280

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           K++GWG  ++G  YW++AN WN  WG +G+FKI RGS+ECGIE DV AGLP
Sbjct: 281 KILGWGV-ENGTPYWLIANSWNTDWGNNGFFKILRGSDECGIEIDVSAGLP 330


>gi|157111449|ref|XP_001651570.1| cathepsin b [Aedes aegypti]
 gi|108868331|gb|EAT32556.1| AAEL015312-PA [Aedes aegypti]
          Length = 386

 Score =  179 bits (453), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 101/242 (41%), Positives = 139/242 (57%), Gaps = 21/242 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWA  A  A++DR+C+             DLL+CC   CG GC GG    AW++
Sbjct: 147 QGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLLSCC-HSCGQGCRGGTLGPAWQF 205

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK---NQLWRNSKH 122
           +V  G+ +       + C PY     C  PG +    TPKC  KC        +W++ +H
Sbjct: 206 WVEKGLSSGGPLNSRQGCHPYPIGE-CRIPGEDED--TPKCSNKCRSGYNVTDVWQD-RH 261

Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
           Y   AY + +D   IM EI+ NGPV+ +F  Y D   YKSG+Y+H+ G + GGHAVKL+G
Sbjct: 262 YGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLLG 321

Query: 183 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 242
           WG  ++G  YW++AN W R WG +G+FK+ RG N CGIEE++ AGLP   N  ++  +A 
Sbjct: 322 WGV-ENGVKYWLVANSWGREWGENGFFKMVRGENHCGIEENIHAGLP---NFHRQGEAAK 377

Query: 243 MF 244
            F
Sbjct: 378 YF 379


>gi|204022102|dbj|BAG71148.1| cathepsin B-N2 [Tuberaphis takenouchii]
          Length = 334

 Score =  178 bits (452), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 102/232 (43%), Positives = 128/232 (55%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CG+CWAFG   A +DR CI      N  LS  +L  CC   CG GC GGYPI AW  
Sbjct: 107 QGNCGTCWAFGTSSAFADRLCIATNGEFNELLSAEELAFCC-HKCGSGCHGGYPIKAWER 165

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
           F  HG+VT       E C PY      FD  G +    +PA    +C R C     L ++
Sbjct: 166 FRKHGLVTGGDYNSGEGCQPYRVPPCPFDEYGNNTCRGKPAEKNHRCTRMCYGNQNLDFK 225

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
               Y+  AY +N   + I  ++   GP+E S+ VY+DF +YKSGVY K      +GGHA
Sbjct: 226 EDHRYTRDAYYLNY--QIIQNDLMTYGPIEASYDVYDDFPNYKSGVYMKTENASYLGGHA 283

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           VKLIGWG  + G  YW+L N WN  WG  G FKI+RG+NECGI+     G+P
Sbjct: 284 VKLIGWG-EEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 334


>gi|56759488|gb|AAW27884.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  178 bits (452), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 101/234 (43%), Positives = 132/234 (56%), Gaps = 20/234 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
           +V  G+VT         C PY     C H        C +  Y TP+C + C K  N  +
Sbjct: 171 WVLRGIVTGGSKENHTGCRPY-PFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSY 229

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              KHY   +Y +      I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHA
Sbjct: 230 EQDKHYGGFSYNVLGIESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHA 289

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           V+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 290 VRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|299471123|emb|CBN78981.1| cathepsin B-like proteinase [Ectocarpus siliculosus]
          Length = 557

 Score =  178 bits (452), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 107/254 (42%), Positives = 135/254 (53%), Gaps = 42/254 (16%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI---------------HFGMNLSLSVNDLLACC-GFLCG 58
           Q  CGSCWAF + EA +DR CI                    L LS  D  ACC GF CG
Sbjct: 303 QSDCGSCWAFASTEAFNDRRCIAGIGKEDAAGAEGEATADQLLVLSAEDTTACCHGFHCG 362

Query: 59  --DGCDGGYPISAWRYFVHHGVVT----------EECDPY--------FDSTGCSHPGC- 97
              GC+GG P SAW++F   GVVT            C PY         D     +P C 
Sbjct: 363 LSMGCNGGQPGSAWKWFTKTGVVTGGDYADIGTGTTCKPYEFMPCAHHVDPGASGYPACP 422

Query: 98  EPAYPTPKCVRKCVKKN---QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 154
           +  YPTP+C+ +C + N     +   K  +  AY + +  E+I  ++ K G V  +F+V+
Sbjct: 423 DGEYPTPECLSECSETNFSGGSYGEDKKMAREAYSL-AGIENIQRDMMKYGSVTAAFSVF 481

Query: 155 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKIKR 213
            DF  Y  GVY H +G  MGGHAVK+IGWGT +  GEDYW++AN WN SWG  G F+I R
Sbjct: 482 SDFLTYSGGVYTHESGSFMGGHAVKMIGWGTDEVSGEDYWLIANSWNPSWGEGGLFRILR 541

Query: 214 GSNECGIEEDVVAG 227
           G NECGIE  +VAG
Sbjct: 542 GVNECGIEGQIVAG 555


>gi|157131748|ref|XP_001662318.1| cathepsin b [Aedes aegypti]
 gi|108871395|gb|EAT35620.1| AAEL012216-PA [Aedes aegypti]
          Length = 386

 Score =  178 bits (451), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 101/242 (41%), Positives = 139/242 (57%), Gaps = 21/242 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWA  A  A++DR+C+             DLL+CC   CG GC GG    AW++
Sbjct: 147 QGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLLSCC-HSCGQGCRGGTLGPAWQF 205

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK---NQLWRNSKH 122
           +V  G+ +       + C PY     C  PG +    TPKC  KC        +W++ +H
Sbjct: 206 WVEKGLSSGGPLNSRQGCHPYPIGE-CRIPGEDED--TPKCSNKCRSGYNVTDVWQD-RH 261

Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
           Y   AY + +D   IM EI+ NGPV+ +F  Y D   YKSG+Y+H+ G + GGHAVKL+G
Sbjct: 262 YGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLLG 321

Query: 183 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 242
           WG  ++G  YW++AN W R WG +G+FK+ RG N CGIEE++ AGLP   N  ++  +A 
Sbjct: 322 WGV-ENGVKYWLVANSWGREWGENGFFKMVRGENHCGIEENIHAGLP---NFHRQGEAAK 377

Query: 243 MF 244
            F
Sbjct: 378 YF 379


>gi|159175|gb|AAA29176.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score =  178 bits (451), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 104/253 (41%), Positives = 136/253 (53%), Gaps = 24/253 (9%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N      +  Q +CGSCWA     A+SDR CI       +  S  D+L CCG  CG G
Sbjct: 99  WKNCSSFHTIRDQANCGSCWAVSTAAAISDRICIATKGKKQVYASDTDILTCCGARCGLG 158

Query: 61  CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS----HP-----------GCEPAYPTPK 105
           C GG+PI AW++F + GVV+    PY     CS    HP            C    PTP 
Sbjct: 159 CRGGWPIEAWKFFEYDGVVSG--GPYLGKGCCSPYPLHPCGRHGNDTFYGNCVGMAPTPP 216

Query: 106 CVRKCVKKNQ-LWRNSKHYSI--SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 162
           C RKC    + ++R  K Y      Y +      I  +I + G V   F VYEDF+HY+S
Sbjct: 217 CKRKCQPGFRGMYRVDKRYGEPGRTYTLPRSEVKIRRDIKERGSVVAVFAVYEDFSHYQS 276

Query: 163 GVYKHITGDVMGG-HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 221
           G+YKH  G   GG HAVK+IGWG  D+G DYW++AN W+  WG +G+F++ RG N CGIE
Sbjct: 277 GIYKHTAGRFTGGYHAVKMIGWG-KDNGTDYWLIANSWHDDWGENGFFRMIRGINNCGIE 335

Query: 222 EDVVAGLPSSKNL 234
           E V AG+   ++L
Sbjct: 336 EQVDAGIVDVESL 348


>gi|56756587|gb|AAW26466.1| unknown [Schistosoma japonicum]
          Length = 216

 Score =  178 bits (451), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 95/215 (44%), Positives = 128/215 (59%), Gaps = 18/215 (8%)

Query: 30  LSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------- 80
           ++DR CI  G   +  LS  DL++CC   CG GC GG+P  AW Y+V  G+VT       
Sbjct: 1   MTDRICIQSGGGQSAELSALDLISCC-EDCGQGCQGGFPGVAWDYWVTQGIVTGGSKENH 59

Query: 81  EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSD 133
             C PY        T   +P C    Y TP+C +KC K  +  ++  KHY   +Y + S+
Sbjct: 60  TGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQKCQKGYKTPYKQDKHYGDESYNVISN 119

Query: 134 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 193
            + I  EI  NGPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG       YW
Sbjct: 120 EKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVKKR-TPYW 178

Query: 194 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           ++AN WN  WG  G F+I RG +EC IE +VVAGL
Sbjct: 179 LIANSWNEDWGEKGLFRIVRGRDECSIESNVVAGL 213


>gi|254575663|gb|ACT68328.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score =  178 bits (451), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 100/236 (42%), Positives = 131/236 (55%), Gaps = 18/236 (7%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
           + ++  Q +CGSCWA  A + +SDR CIH      + LS  D+LACCG  CG GCDGGY 
Sbjct: 112 LRVIPDQSNCGSCWAVSAAQCMSDRLCIHSQGRKKVLLSATDILACCGKFCGYGCDGGYN 171

Query: 67  ISAWRYFVHHGVVTE-------ECDPYFDSTGCSHPGCE----PAYP--TPKCVRKC-VK 112
             AW++    GVVT         C PY      +H G      P++P  TP C   C   
Sbjct: 172 ARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCGAHKGKAFNNCPSHPYATPACKPYCQYG 231

Query: 113 KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
             + + N K  + + Y + +D   I  EI K GPV  +F +YEDF HY  GVY H  G +
Sbjct: 232 YGKRYENDKIKAKTWYWLPNDERTIQLEIMKKGPVHATFNIYEDFEHYNGGVYIHTAGAM 291

Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD-GYFKIKRGSNECGIEEDVVAG 227
            GGH++K+IGWG  D G  YW++AN W+  WG D GYF++ RG N C IE  V+AG
Sbjct: 292 EGGHSIKIIGWGV-DKGVKYWLIANSWSTDWGEDGGYFRVVRGINNCDIEGGVLAG 346


>gi|56758864|gb|AAW27572.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  178 bits (451), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 100/235 (42%), Positives = 132/235 (56%), Gaps = 22/235 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
           +V  G+VT       + TGC     P C+              Y TP+C + C K  N  
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTS 228

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   KHY   +Y +      I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GH
Sbjct: 229 YEQDKHYGGFSYNVLGIESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGH 288

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           AV+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|226471002|emb|CAX70582.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  178 bits (451), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 107/229 (46%), Positives = 141/229 (61%), Gaps = 16/229 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL-LACCGFLCGDGCDGGYPISAWRYF 73
           Q  CGSCWAFGAVEA++DR CI  G   S  ++ L L  C   CG GC GG+P  AW Y+
Sbjct: 112 QSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDYW 171

Query: 74  VHHGVVT---EE----CDPY-FDS----TGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRN 119
           V  G+VT   EE    C PY F      T   +P C    Y TP+C + C K  +  ++ 
Sbjct: 172 VKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYKQ 231

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
            KHY   +Y + S+ + I  EI   GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA++
Sbjct: 232 DKHYGDESYNVISNEKAIQKEIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIR 291

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           +IGWG  + G+ YW++AN WN  WG  G F++ RG +EC IE  VVAGL
Sbjct: 292 IIGWGV-EKGKPYWLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGL 339


>gi|157167368|ref|XP_001653891.1| cathepsin b [Aedes aegypti]
 gi|108874250|gb|EAT38475.1| AAEL009642-PA [Aedes aegypti]
          Length = 332

 Score =  178 bits (451), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 100/229 (43%), Positives = 137/229 (59%), Gaps = 17/229 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGY-PISAWR 71
           QG CG+CWA  AV  +SDR CIH     ++ L+  DL+ CC   CG+GC+GG+   ++++
Sbjct: 107 QGLCGACWAVAAVSVMSDRLCIHSEGKFDVELAAEDLMGCCK-DCGNGCNGGFLDGTSFQ 165

Query: 72  YFVHHGVV-------TEECDPYFDSTGCSHP--GCEPAYPTPKCVRKCVKK-NQLWRNSK 121
           Y+V  G+V       T+ C PY     C +P  GC P   TP C   C +  +  +R  K
Sbjct: 166 YWVDVGLVSGAAYNSTDGCKPY-PFKPCLYPFVGCHPE-KTPSCTHHCTEGYDGTYRRDK 223

Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 181
           +Y  +AY++ +D   I  EI  NGPVE  F+VY+D   YK+GVY+H+ G  +G HAV+LI
Sbjct: 224 YYGSAAYKLPNDERMIQLEIMTNGPVESGFSVYQDLYLYKTGVYQHVVGREVGKHAVRLI 283

Query: 182 GWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           GWG  + G  YW++AN +   WG  GYFK  RGSN  GIE  V+AGLP 
Sbjct: 284 GWG-KERGVPYWLIANSYGEDWGEHGYFKFLRGSNHLGIESVVIAGLPK 331


>gi|204022100|dbj|BAG71147.1| cathepsin B-N1 [Tuberaphis takenouchii]
          Length = 334

 Score =  177 bits (450), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 103/232 (44%), Positives = 127/232 (54%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QGHCGSCWAFG   A +DR CI      N  LS  +L  CC   CG GC GGYPI AW +
Sbjct: 107 QGHCGSCWAFGTSSAFADRLCIATDGEFNELLSAEELAFCC-HKCGFGCHGGYPIKAWEW 165

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
           F  HG+VT       E C PY       D  G +    +PA    +C R C    +L ++
Sbjct: 166 FKKHGLVTGGDYDSGEGCQPYRVPPCPLDEYGNNTCRGKPAEKNHRCTRMCYGNQELDFK 225

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
              H++  AY +      I  ++   GP+E SF VY+DF +YKSGVY K      +GGHA
Sbjct: 226 EDHHWTRDAYYLTY--TTIQKDVMAYGPIEASFDVYDDFPNYKSGVYMKTENASYLGGHA 283

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           VKLIGWG  + G  YW+L N WN  WG  G FKI RG+NECGI+     G+P
Sbjct: 284 VKLIGWG-EEYGVPYWLLVNSWNDQWGDQGLFKILRGTNECGIDNSTTGGVP 334


>gi|290989996|ref|XP_002677623.1| cathepsin B [Naegleria gruberi]
 gi|284091231|gb|EFC44879.1| cathepsin B [Naegleria gruberi]
          Length = 321

 Score =  177 bits (450), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 100/220 (45%), Positives = 127/220 (57%), Gaps = 20/220 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAF A E+LSDRFCI  +  +++ LS  D+++C       GCDGG   +AW +
Sbjct: 106 QEQCGSCWAFSASESLSDRFCIASNGKVDVILSPQDMVSCD--YNDMGCDGGNLDNAWWW 163

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKN-----QLWRNSKHYSISA 127
             + G+V + C PY    G            P C   C   N     QL+       IS 
Sbjct: 164 MKNKGIVPDSCMPYVSGGG----------NVPACPSNCNGTNIPISSQLYYAKSFSHISP 213

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
           +       DI  EIY NGPV+  F+VY+DF +YKSGVY H TG  +GGHA+K+IGWG  +
Sbjct: 214 WMFWERVADIQQEIYTNGPVQGGFSVYQDFMNYKSGVYSHKTGSFLGGHAIKIIGWGV-E 272

Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
            G DYW++AN W+  WG DG FKI RG NECGIE+DV AG
Sbjct: 273 GGVDYWLVANSWSTDWGIDGTFKILRGHNECGIEDDVYAG 312


>gi|328697984|ref|XP_003240502.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
          Length = 339

 Score =  177 bits (450), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 98/231 (42%), Positives = 128/231 (55%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCWAFG   A +DR C+      N  LS  +L  CC   CG+GC+GGYPI AW+Y
Sbjct: 109 QGYCGSCWAFGTSSAFADRLCVATDGDFNELLSAEELTFCC-HTCGNGCNGGYPIKAWKY 167

Query: 73  FVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
           F  HG+VT       E C+PY       +  G S    +P     +C R C     L  N
Sbjct: 168 FSSHGLVTGGNYKSGEGCEPYRVPPCPRNEDGTSSCAGQPIEKNHRCTRMCYGNQDLDYN 227

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAV 178
             H     Y   +    I  ++   GP+E SF VY+DF  YKSGVY+       +GGHAV
Sbjct: 228 DDHRFTRDYYYLT-YGSIQKDVMNYGPIEASFDVYDDFYSYKSGVYQRTPNATKLGGHAV 286

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           KLIGWG  ++G  YW++ N W+  WG +G FKI+RG++ECGI+    AG+P
Sbjct: 287 KLIGWGV-EEGIPYWLMVNSWSAQWGDNGLFKIRRGTDECGIDSATTAGVP 336


>gi|187104114|ref|NP_001119617.1| cathepsin B-16A precursor [Acyrthosiphon pisum]
 gi|161343835|tpg|DAA06098.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 340

 Score =  177 bits (450), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 98/231 (42%), Positives = 125/231 (54%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QGHCGSCWA     A +DR C+      N  LS  ++  CC   CG GC+GGYPI AW+Y
Sbjct: 110 QGHCGSCWAMATSSAFADRLCVATNGDFNELLSAEEITFCC-HTCGFGCNGGYPIKAWKY 168

Query: 73  FVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
           F  HG+VT       E C+PY       D  G S    +P     +C R C     L  N
Sbjct: 169 FSSHGIVTGGNYKSGEGCEPYRVPPCPQDEEGKSSCAGKPIEKNHRCTRMCYGNQDLDYN 228

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAV 178
             H     Y   +    I  ++   GP+E SF VY+DF  YKSGVY+       +GGHAV
Sbjct: 229 DDHRFTRDYYYLT-YGSIQKDVMNYGPIEASFDVYDDFPSYKSGVYQRTPNATKLGGHAV 287

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           KLIGWG  ++G  YW++ N WN  WG +G FKI+RG++ECGI+    AG+P
Sbjct: 288 KLIGWGV-EEGTPYWLMVNSWNAQWGDNGLFKIRRGTDECGIDSAATAGVP 337


>gi|56756380|gb|AAW26363.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  177 bits (449), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 101/234 (43%), Positives = 132/234 (56%), Gaps = 20/234 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  A+ A+SDR CI  G   ++ LS  DL++CC   CG GCDGG+   +W Y
Sbjct: 112 QSQCGSSWAVSAIGAMSDRICIQSGGKQSVKLSAVDLISCCEN-CGSGCDGGFLGPSWDY 170

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
           +V  G+VT         C PY     C H        C +  Y TP+C + C K  N  +
Sbjct: 171 WVLRGIVTGGSKENHTGCRPY-PFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSY 229

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              KHY   +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHA
Sbjct: 230 EQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHA 289

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           V+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 290 VRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|56758040|gb|AAW27160.1| unknown [Schistosoma japonicum]
          Length = 216

 Score =  177 bits (449), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 96/215 (44%), Positives = 127/215 (59%), Gaps = 18/215 (8%)

Query: 30  LSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------- 80
           ++DR CI  G   S  LS  DL++CC   CGDGC GG+P  AW Y+V  G+VT       
Sbjct: 1   MTDRICIQSGGQQSAELSALDLISCCED-CGDGCQGGFPGQAWDYWVTQGIVTGGSKENH 59

Query: 81  EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSD 133
             C PY        T   +P C    Y TP+C + C K  +  +   KHY   +Y + S+
Sbjct: 60  TGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVISN 119

Query: 134 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 193
            + I  EI  NGPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG  +    YW
Sbjct: 120 EKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKRTPYW 178

Query: 194 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           ++AN WN  WG  G F+I RG +EC IE  VVAGL
Sbjct: 179 LIANSWNEDWGEKGLFRIVRGRDECSIESHVVAGL 213


>gi|204022094|dbj|BAG71144.1| cathepsin B-N1 [Tuberaphis taiwana]
          Length = 334

 Score =  177 bits (449), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 103/232 (44%), Positives = 124/232 (53%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFG   A +DR CI      N  LS  +L  CC   CG GC GGYPI AW  
Sbjct: 107 QGKCGSCWAFGTSSAFADRLCIATDGEFNELLSAEELAFCC-HKCGFGCSGGYPIRAWER 165

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
           F  HG+VT       E C PY       D  G +    +PA    +C R C     L ++
Sbjct: 166 FKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFK 225

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHA 177
              HY+  AY +      I  +I   GP+E SF VY+DF  YKSGVY  +     +GGHA
Sbjct: 226 EDHHYTRDAYYLTYGT--IQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHA 283

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           VKLIGWG  + G  YW+L N WN  WG  G FKI+RG+NECGI+     G+P
Sbjct: 284 VKLIGWG-EEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 334


>gi|356984175|gb|AET43950.1| cathepsin B, partial [Reishia clavigera]
          Length = 209

 Score =  177 bits (449), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 95/200 (47%), Positives = 125/200 (62%), Gaps = 18/200 (9%)

Query: 44  LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 94
           +S N+LLACC   CGDGC+GGYP +AW  F H GVVT       + C PY  +  C H  
Sbjct: 12  VSANELLACC-ESCGDGCNGGYPSAAWEVFDHDGVVTGGQYNSKQGCQPYLIAA-CDHHV 69

Query: 95  ----PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 149
                 C+    TP+C +KC    N  +++ KHY   +Y ++S   DIM E+   GPVE 
Sbjct: 70  VGKLKPCKGDGKTPRCEKKCEAGYNVTFKDDKHYGQRSYSVSS-VNDIMEELVTRGPVEA 128

Query: 150 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 209
           +FTVY DF  Y SGVY+H TG  +GGHAVK++G+G  ++G+ YW++AN WN  WG  G+F
Sbjct: 129 AFTVYSDFLQYHSGVYRHTTGSALGGHAVKILGYGV-ENGDKYWLVANSWNPDWGDQGFF 187

Query: 210 KIKRGSNECGIEEDVVAGLP 229
           KI RG +ECGIE  +VAG P
Sbjct: 188 KILRGVDECGIEGQIVAGEP 207


>gi|2944340|gb|AAC05262.1| cathepsin B-like cysteine protease GCP7 [Haemonchus contortus]
          Length = 348

 Score =  177 bits (448), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 99/236 (41%), Positives = 132/236 (55%), Gaps = 18/236 (7%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
           + ++  Q +CGSCWA  A + +SDR CIH      + LS  D+LACCG  CG GCDGGY 
Sbjct: 112 LRVIPDQSNCGSCWAVSAAQCMSDRLCIHSQGRKKVLLSATDILACCGKFCGYGCDGGYN 171

Query: 67  ISAWRYFVHHGVVTE-------ECDPYFDSTGCSHPGCE----PAYP--TPKCVRKC-VK 112
             AW++    GVVT         C PY      +H G      P++P  TP C   C   
Sbjct: 172 ARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCGAHKGKAFNNCPSHPYATPACKPYCQYG 231

Query: 113 KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
             + + N K  + + Y + +D   I  EI + GPV  +F +YEDF HY+ GVY H  G +
Sbjct: 232 YGKRYENDKIKARTWYWLPNDERTIQLEIMQKGPVHATFNIYEDFEHYEGGVYIHTAGAM 291

Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD-GYFKIKRGSNECGIEEDVVAG 227
            GGH++K+IGWG  D G  YW++AN W+  WG D GYF++ RG N C IE  V+AG
Sbjct: 292 EGGHSIKIIGWGV-DKGVKYWLIANSWSTDWGEDGGYFRVVRGINNCDIEGGVLAG 346


>gi|308512693|gb|ADO33000.1| cathepsin B [Biston betularia]
          Length = 217

 Score =  177 bits (448), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 96/215 (44%), Positives = 126/215 (58%), Gaps = 19/215 (8%)

Query: 31  SDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------E 81
           +DR C +     +   S  DLL+CC  +CG GC+GG P  AW Y+ H G+V+       +
Sbjct: 1   TDRVCTYSNGTKHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEYWKHMGLVSGGNYNSSQ 59

Query: 82  ECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDP 134
            C PY     C H  PG    C     TPKC + C    N L++  K Y    Y +    
Sbjct: 60  GCSPYVIPP-CEHHVPGNRLPCNGDTKTPKCSKTCENGYNVLYKKDKRYGKHVYAVRGGE 118

Query: 135 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 194
           + I AE++KNGPVE +FTVY D   YKSGVYKH+ GD +GGHA+K+IGWG  ++G  YW+
Sbjct: 119 DHIKAELFKNGPVEAAFTVYADLLAYKSGVYKHVEGDALGGHAIKIIGWGV-ENGNKYWL 177

Query: 195 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +AN WN  WG +G+FKI RG + CGIE  +VAG P
Sbjct: 178 IANSWNTDWGNNGFFKILRGEDHCGIESSIVAGEP 212


>gi|204022098|dbj|BAG71146.1| cathepsin B-N2 [Tuberaphis sumatrana]
          Length = 334

 Score =  177 bits (448), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 102/232 (43%), Positives = 124/232 (53%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QGHCGSCWAFG   A +DR CI      N  LS  +L  CC   CG GC GGYPI AW  
Sbjct: 107 QGHCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELAFCC-HKCGFGCSGGYPIKAWER 165

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
           F  HG+VT       E C PY       D  G +    +P     +C R C     L ++
Sbjct: 166 FKKHGLVTGGNYESGEGCQPYRVPPCPLDEYGNNTCSGKPTEKNHRCTRMCYGNQDLDFK 225

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHA 177
              HY+  AY +      I  ++   GP+E SF VY+DF  YKSGVY  +     +GGHA
Sbjct: 226 EDHHYTRDAYYLTYGT--IQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHA 283

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           VKLIGWG  + G  YW+L N WN  WG  G FKI+RG+NECGI+     G+P
Sbjct: 284 VKLIGWG-EEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 334


>gi|321461662|gb|EFX72692.1| hypothetical protein DAPPUDRAFT_308155 [Daphnia pulex]
          Length = 379

 Score =  177 bits (448), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 100/235 (42%), Positives = 130/235 (55%), Gaps = 24/235 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG C SCWA    + +SDR CIH G    + LS  +LL+CC  LCG GC GG+P  AW +
Sbjct: 135 QGSCASCWAVAPTDVMSDRICIHSGSRHIVRLSAGNLLSCCK-LCGKGCKGGFPGGAWMH 193

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAY-PTPK--CVRKCVKK---------------N 114
           +  HG+VT     Y    GC      P Y P  K     KC K                N
Sbjct: 194 WSKHGIVTG--GSYSSDYGCQKYQFFPCYQPRTKGSIKNKCPKTDNTLLECRETCRTSYN 251

Query: 115 QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
           + ++   +Y  S YRI +D   I  EI +NGPV+ +  +YEDF HYK GVY+H+ G  + 
Sbjct: 252 KSYKQDLYYGESVYRIPNDARAIQLEIMENGPVQANLRIYEDFLHYKFGVYRHVHGQGLE 311

Query: 175 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
            HAVK+ GWGT + G  YW+ AN W++ WG  G+FKI RGSN   IE+ V+AG+P
Sbjct: 312 YHAVKIFGWGT-EGGTPYWLAANPWSKRWGNGGFFKILRGSNHAEIEDHVMAGIP 365


>gi|40557606|gb|AAR88096.1| cathepsin B-like cysteine protease [Callosobruchus maculatus]
          Length = 330

 Score =  176 bits (447), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 101/241 (41%), Positives = 132/241 (54%), Gaps = 16/241 (6%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGM--NLSLSVNDLLACCG--FLCG 58
           ++  E ++ +  Q  CGSCWA  +   +SDR CI       L +S  D++ CC       
Sbjct: 91  WSKCESIKEIRDQSGCGSCWAVSSASVMSDRICIQSDQKNQLRISAADMIECCESCTFSV 150

Query: 59  DGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS-------HPGCEPAYPTPKCVRKCV 111
           DGC GG P   +  +   G V+     Y  + GC        +P C+  Y  P C ++C 
Sbjct: 151 DGCHGGIPSFTFTEWKDSGFVSG--GEYNSTNGCMSYPLPRCNPSCKTLYDAPTCKKECD 208

Query: 112 KKNQL-WRNSKHYSISAYRINSDPE-DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI- 168
           K + L +   KHY+  AYRI S  E  I  EI KNGPV  SFTVY DF HY SGVYK   
Sbjct: 209 KGSPLKYEEDKHYAKQAYRIMSKVERQIQLEIIKNGPVVASFTVYADFIHYLSGVYKFDG 268

Query: 169 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
              ++GGHAV++IGWG  +    YW+++N WN  WG  G FKI RG NECGIEE++ AGL
Sbjct: 269 ESKLLGGHAVRIIGWGIENGTYPYWLVSNSWNERWGDQGLFKIWRGKNECGIEEEITAGL 328

Query: 229 P 229
           P
Sbjct: 329 P 329


>gi|291000228|ref|XP_002682681.1| predicted protein [Naegleria gruberi]
 gi|284096309|gb|EFC49937.1| predicted protein [Naegleria gruberi]
          Length = 225

 Score =  176 bits (447), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 97/217 (44%), Positives = 127/217 (58%), Gaps = 20/217 (9%)

Query: 15  QGHCGSCWA-----FGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPI 67
           Q  CGSCWA       + E LSDRFCI  G  +N+ LS  DL++C  +    GCDGG   
Sbjct: 22  QEQCGSCWACKNLFIQSSEVLSDRFCIASGGKVNVVLSPQDLVSCNWY--NAGCDGGILW 79

Query: 68  SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 127
           +AW Y  H G+VT++C PY    G +          P C + C   +    + K+ +   
Sbjct: 80  AAWIYLKHTGIVTDQCLPYSSGNGVA----------PSCPKYCNGTSTPIDSVKYKAKDW 129

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
           Y + S  E IM EI  NGPV+  F+VY+DF  YKSGVY H TG  +GGHA+K++GWG  +
Sbjct: 130 YEVGSIAEKIMNEIATNGPVQSGFSVYQDFMSYKSGVYTHQTGSFLGGHAIKIVGWGVEN 189

Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
           + + YW++AN W   WG +G FKIKRG NECGIE DV
Sbjct: 190 NVK-YWLVANSWGPDWGLNGLFKIKRGDNECGIEADV 225


>gi|166030332|gb|ABY78833.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  176 bits (447), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 98/248 (39%), Positives = 133/248 (53%), Gaps = 21/248 (8%)

Query: 3   FTNSEH------VEILVIQGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGF 55
           F  +EH      +  +  Q  C + WA     A+SDR+C +  G  L +S  DL+ACC  
Sbjct: 95  FDAAEHWPHCPTIREIADQSACRASWAVATASAISDRYCTVGKGKQLRISAADLMACCK- 153

Query: 56  LCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCV 107
            CG GC+GGYP +AW Y+V HG+ + +C PY     C H G +   P        TP+C 
Sbjct: 154 DCGGGCEGGYPDAAWEYYVSHGITSSQCQPY-PFPRCEHRGAQGKKPPCSKYKFVTPQCN 212

Query: 108 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
             C  K+      K+    +Y +  + ED   E+Y NGP  V F V+ DF  YKSGVY+H
Sbjct: 213 ATCTDKSVPL--IKYRGNHSYEVRGE-EDYKRELYFNGPFVVRFQVHSDFLAYKSGVYQH 269

Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           + G+ +GG AV+++GWG   +G  YW +AN W+  WG +GYF I RG NEC IE    AG
Sbjct: 270 VAGNFLGGKAVRIVGWGKL-NGTPYWKVANSWDTDWGMNGYFLILRGDNECNIEHLGFAG 328

Query: 228 LPSSKNLV 235
            P    L 
Sbjct: 329 TPDPSQLA 336


>gi|54289256|gb|AAV31918.1| putative vitellogenic cathepsin B [Aedes aegypti]
          Length = 332

 Score =  176 bits (447), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 99/229 (43%), Positives = 136/229 (59%), Gaps = 17/229 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGY-PISAWR 71
           QG CG+CWA   V  +SDR CIH     ++ L+  DL+ CC   CG+GC+GG+   ++++
Sbjct: 107 QGLCGACWAVATVSVMSDRLCIHSEGKFDVELAAEDLMGCCK-DCGNGCNGGFLDGTSFQ 165

Query: 72  YFVHHGVV-------TEECDPYFDSTGCSHP--GCEPAYPTPKCVRKCVKK-NQLWRNSK 121
           Y+V  G+V       T+ C PY     C +P  GC P   TP C   C +  +  +R  K
Sbjct: 166 YWVDVGLVSGAAYNNTDGCKPY-PFKPCLYPFVGCHPE-KTPSCTHHCTEGYDGTYRRDK 223

Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 181
           +Y  +AY++ +D   I  EI  NGPVE  F+VY+D   YK+GVY+H+ G  +G HAV+LI
Sbjct: 224 YYGSAAYKLPNDERMIQLEIMTNGPVESGFSVYQDLYLYKTGVYQHVVGREVGKHAVRLI 283

Query: 182 GWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           GWG  + G  YW++AN +   WG  GYFK  RGSN  GIE  V+AGLP 
Sbjct: 284 GWG-KERGVPYWLIANSYGEDWGEHGYFKFLRGSNHLGIESVVIAGLPK 331


>gi|56752997|gb|AAW24710.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  176 bits (446), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 107/229 (46%), Positives = 139/229 (60%), Gaps = 16/229 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL-LACCGFLCGDGCDGGYPISAWRYF 73
           Q  CGSCWAFGAVEA++DR CI  G   S  ++ L L  C   CG GC GG+P  AW Y+
Sbjct: 112 QSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDYW 171

Query: 74  VHHGVVT---EE----CDPY-FDS----TGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRN 119
           V  G+VT   EE    C PY F      T   +P C    Y TP+C + C K  +  +  
Sbjct: 172 VKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQ 231

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
            KHY    Y + S+ + I  EI   GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA++
Sbjct: 232 DKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIR 291

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           +IGWG  + G+ YW++AN WN  WG  G F++ RG +EC IE  VVAGL
Sbjct: 292 IIGWGV-EKGKPYWLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGL 339


>gi|157058767|gb|ABV03141.1| cathepsin B-348 [Sitobion avenae]
          Length = 252

 Score =  176 bits (446), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 92/218 (42%), Positives = 129/218 (59%), Gaps = 20/218 (9%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   +  +  QG CGSCWAFGAVEA+SDR CIH     N   S  +L++CC + CG G
Sbjct: 38  WPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGTKNFHFSAENLVSCC-WTCGFG 96

Query: 61  CDGGYPISAWRYFVHHGVVTEECDPYFDSTGC--------------SHPGCEPAYPTPKC 106
           C+GG+P +AW Y+   G+V+    PY  + GC              +   C+    TPKC
Sbjct: 97  CNGGFPGAAWHYWKTKGIVSG--GPYGSNMGCIPYEIAPCEHHVNGTRGPCKEGGKTPKC 154

Query: 107 VRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
           V+KC    ++ +    H   SAY +++D + I  EIY NGPVE +FTVYEDF  Y++GVY
Sbjct: 155 VKKCEDGYKVPYEQDLHRGKSAYSLSNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVY 214

Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 203
           KH+ G  +GGHA++++GWG  +    YW++AN WN  W
Sbjct: 215 KHVAGKALGGHAIRILGWGVQNGEIPYWLVANSWNTDW 252


>gi|48762485|dbj|BAD23812.1| cathepsin B-N1 [Tuberaphis styraci]
          Length = 340

 Score =  176 bits (446), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 103/232 (44%), Positives = 124/232 (53%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFG   A +DR CI      N  LS  +L  CC   CG GC GGYPI AW  
Sbjct: 110 QGKCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELAFCC-HKCGFGCSGGYPIRAWER 168

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
           F  HG+VT       E C PY       D  G +    +PA    +C R C     L ++
Sbjct: 169 FKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFK 228

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHA 177
              HY+  AY +      I  +I   GP+E SF VY+DF  YKSGVY  +     +GGHA
Sbjct: 229 EDHHYTRDAYYLTYGT--IQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHA 286

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           VKLIGWG  + G  YW+L N WN  WG  G FKI+RG+NECGI+     G+P
Sbjct: 287 VKLIGWG-EEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 337


>gi|204022085|dbj|BAG71140.1| cathepsin B-S [Astegopteryx spinocephala]
          Length = 335

 Score =  176 bits (446), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 99/231 (42%), Positives = 126/231 (54%), Gaps = 20/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCWAFG   A +DR C+  G   N  LS   L  CC + CG GC GG PI AW+Y
Sbjct: 108 QGNCGSCWAFGTTGAFADRLCVATGGGFNEQLSAEKLTFCC-WTCGLGCQGGNPIKAWKY 166

Query: 73  FVHHGVVT-------EECDPY-----FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
           F  HG+ T       E C PY     +D  G      +P     KC R C   + +    
Sbjct: 167 FKRHGITTGGDYGSNEGCAPYKVPPCYDDQGEFLCQGKPTEHNHKCPRACYGNSTV---E 223

Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVK 179
             Y + +  +    + I  +I K GPVE SF VY+DF  YKSG+Y+       +GGH+VK
Sbjct: 224 NRYKVKSIYVLDSSKTIEQDIRKYGPVEASFDVYDDFITYKSGIYQKTPNAFYVGGHSVK 283

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           LIGWG  +DG  YW+L N W++ WG  G F+I +G NECGIE    AG+PS
Sbjct: 284 LIGWG-EEDGIPYWLLVNSWSKFWGEQGTFRIIKGRNECGIERSATAGVPS 333


>gi|204022092|dbj|BAG71143.1| cathepsin B-N2 [Tuberaphis coreana]
          Length = 334

 Score =  176 bits (445), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 103/232 (44%), Positives = 124/232 (53%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFG   A +DR CI      N  LS  +L  CC   CG GC GGYPI AW  
Sbjct: 107 QGKCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELAFCC-HKCGFGCSGGYPIRAWER 165

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
           F  HG+VT       E C PY       D  G +    +PA    +C R C     L ++
Sbjct: 166 FKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFK 225

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHA 177
              HY+  AY +      I  +I   GP+E SF VY+DF  YKSGVY  +     +GGHA
Sbjct: 226 EDHHYTRDAYYLTYGT--IQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHA 283

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           VKLIGWG  + G  YW+L N WN  WG  G FKI+RG+NECGI+     G+P
Sbjct: 284 VKLIGWG-EEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 334


>gi|48762493|dbj|BAD23816.1| cathepsin B-N1 [Tuberaphis coreana]
          Length = 340

 Score =  176 bits (445), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 103/232 (44%), Positives = 124/232 (53%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFG   A +DR CI      N  LS  +L  CC   CG GC GGYPI AW  
Sbjct: 110 QGKCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELAFCC-HKCGFGCSGGYPIRAWER 168

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
           F  HG+VT       E C PY       D  G +    +PA    +C R C     L ++
Sbjct: 169 FKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFK 228

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHA 177
              HY+  AY +      I  +I   GP+E SF VY+DF  YKSGVY  +     +GGHA
Sbjct: 229 EDHHYTRDAYYLTYGT--IQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHA 286

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           VKLIGWG  + G  YW+L N WN  WG  G FKI+RG+NECGI+     G+P
Sbjct: 287 VKLIGWG-EEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 337


>gi|358341867|dbj|GAA49438.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 952

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 100/227 (44%), Positives = 127/227 (55%), Gaps = 17/227 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  C S WAFGAVE++SDR CIH     N SLS  DLL+CC   CG GC  G+   AW +
Sbjct: 73  QSSCESFWAFGAVESMSDRLCIHSNGAFNKSLSATDLLSCCED-CGLGCGAGFHPMAWDF 131

Query: 73  FVHHGVVT----EE---CDPY-FDSTGCSHPGCEPA-----YPTPKCVRKCVKKNQLWRN 119
           +  HG+VT    EE   C  + F   G    G  P      YPTP+C+++C +    +  
Sbjct: 132 WKTHGIVTGGSKEEPSGCRSFPFPKCGHRRKGRYPPCPRHIYPTPECIKQCDEPEVNYEK 191

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
            K  +  +Y +      IM EI  NGPVE SF +Y DF  Y  GVY H  G  +  HA++
Sbjct: 192 DKTRANISYNVYPSDISIMKEIMLNGPVEASFGIYADFLEYNGGVYFHCWGGPISRHAIR 251

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           ++GWG  DDG  YW++AN WN  WG  GY +  RG NECGIEE+V A
Sbjct: 252 ILGWG-EDDGVPYWLIANSWNEDWGEKGYVRFLRGHNECGIEEEVTA 297



 Score =  163 bits (412), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 105/294 (35%), Positives = 133/294 (45%), Gaps = 78/294 (26%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA+SDR CIH     N SLS  DL++CC   CG GC GGY   AW +
Sbjct: 661 QSSCGSCWAFGAVEAMSDRLCIHSKGAFNKSLSAVDLVSCCT-ECGCGCRGGYSPIAWDF 719

Query: 73  FVHHGVVTEECDPYFDSTGCSH---PGCE------------PAYPTPKCVRKCVKKNQLW 117
           +  HG+VT         TGC     P CE              YPTP+C+++C  K   +
Sbjct: 720 WKTHGIVTGGSKE--KPTGCRSYPFPSCEHRGKGQYPPCPHQLYPTPECIKRCDTKEIDY 777

Query: 118 RNSK----------------------------------------HYSIS----------- 126
              K                                        H+SI            
Sbjct: 778 EKDKTRGFDSASSEQLADRHCFHTSNFGEASAQRTLHLTCLNFMHHSIDLLSSRLEKAVL 837

Query: 127 ------AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 180
                 +Y +    + +M EI   GPV     VYED   YKSGVY H+ G  +G H +++
Sbjct: 838 RSTANISYNVYPAEQAVMKEIMLRGPVGAILHVYEDLLDYKSGVYFHVWGGHLGEHGIRI 897

Query: 181 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
           +GWG  +DG  YW++AN WN  WG  GY ++ R  NECGI + V AGLP   N 
Sbjct: 898 LGWG-EEDGVPYWLVANSWNEDWGEKGYMRVLRWRNECGIVDQVTAGLPDLSNF 950


>gi|204022096|dbj|BAG71145.1| cathepsin B-N1 [Tuberaphis sumatrana]
          Length = 334

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 102/232 (43%), Positives = 124/232 (53%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QGHCGSCWAFG   A +DR CI      N  LS  +L  CC   CG GC GG PI AW  
Sbjct: 107 QGHCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELAFCC-HKCGFGCSGGNPIKAWER 165

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
           F  HG+VT       E C PY       D  G +    +PA    +C R C     L ++
Sbjct: 166 FQKHGLVTGGNYDSGEGCQPYKVPPCPLDEYGNNTCSGKPAEKNHRCTRMCYGNQNLDFK 225

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHA 177
              HY+  AY +      I  ++   GP+E SF VY+DF  YKSGVY  +     +GGHA
Sbjct: 226 EDHHYTRDAYYLTYGT--IQYDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHA 283

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           VKLIGWG  + G  YW+L N WN  WG  G FKI+RG+NECGI+     G+P
Sbjct: 284 VKLIGWG-EEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 334


>gi|343961899|dbj|BAK62537.1| cathepsin B precursor [Pan troglodytes]
          Length = 195

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 87/196 (44%), Positives = 125/196 (63%), Gaps = 14/196 (7%)

Query: 56  LCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPT 103
           +CGDGC+GGYP  AW ++   G+V+         C PY           S P C     T
Sbjct: 1   MCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDT 60

Query: 104 PKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 162
           PKC + C    +  ++  KHY  ++Y +++  + IMAEIYKNGPVE +F+VY DF  YKS
Sbjct: 61  PKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKGIMAEIYKNGPVEGAFSVYSDFLLYKS 120

Query: 163 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 222
           GVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE 
Sbjct: 121 GVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIES 179

Query: 223 DVVAGLPSSKNLVKEI 238
           +VVAG+P +    ++I
Sbjct: 180 EVVAGIPRTDQYWEKI 195


>gi|5031250|gb|AAD38132.1|AF127592_1 vitellogenic cathepsin-B like protease [Aedes aegypti]
          Length = 386

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 100/242 (41%), Positives = 138/242 (57%), Gaps = 21/242 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWA  A  A++DR+C+             DLL+CC   CG GC GG    AW++
Sbjct: 147 QGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLLSCC-HSCGQGCRGGTLGPAWQF 205

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK---NQLWRNSKH 122
           +V  G+ +       + C PY     C  PG +    TPKC  KC        +W++ +H
Sbjct: 206 WVEKGLSSGGPLNSRQGCHPYPIGE-CRIPGEDED--TPKCSNKCRSGYNVTDVWQD-RH 261

Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
               AY + +D   IM EI+ NGPV+ +F  Y D   YKSG+Y+H+ G + GGHAVKL+G
Sbjct: 262 IGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLLG 321

Query: 183 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 242
           WG  ++G  YW++AN W R WG +G+FK+ RG N CGIEE++ AGLP   N  ++  +A 
Sbjct: 322 WGV-ENGVKYWLVANSWGREWGENGFFKMVRGENHCGIEENIHAGLP---NFHRQGEAAK 377

Query: 243 MF 244
            F
Sbjct: 378 YF 379


>gi|161343875|tpg|DAA06118.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 210

 Score =  175 bits (444), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 95/212 (44%), Positives = 125/212 (58%), Gaps = 16/212 (7%)

Query: 18  CGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 75
           CGSCWA  A    SDR CI  G  +  +LS   L  CC + CG+GCDGG P +AW +F+ 
Sbjct: 1   CGSCWAASAASVFSDRLCIATGGAVARNLSAEQLNTCC-YRCGNGCDGGSPEAAWYFFMR 59

Query: 76  HGVVT-------EECDPY-FDSTGCSHPGC-EPAYPTPKC-VRKCVKKN--QLWRNSKHY 123
           HG+VT       + C PY     G     C +    TP C +R C   N  + +R   HY
Sbjct: 60  HGIVTGGDYESGDGCQPYSIYPRGKGRNTCIDDDIDTPDCSIRTCTNSNYTKGYRADLHY 119

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 183
             + Y ++   EDIM +IYKNGPV+ +F VY DF +YKSGVY +  G + GGHA+K++GW
Sbjct: 120 VDTVYSLSRSEEDIMTDIYKNGPVQAAFYVYTDFMYYKSGVYSYTRGQIEGGHAIKILGW 179

Query: 184 GTSDDGEDYWILANQWNRSWGADGYFKIKRGS 215
           G  DD   YW+ AN W+RSWG +G F+I RG+
Sbjct: 180 GV-DDNTKYWLCANSWSRSWGENGLFRILRGN 210


>gi|17565158|ref|NP_503384.1| Protein W07B8.1 [Caenorhabditis elegans]
 gi|351059396|emb|CCD74286.1| Protein W07B8.1 [Caenorhabditis elegans]
          Length = 335

 Score =  175 bits (444), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 90/234 (38%), Positives = 134/234 (57%), Gaps = 22/234 (9%)

Query: 17  HCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCG--FLCGDGCDGGYPISAWRY 72
            C + WAF A E++SDR CI+ G   N  LS  +LL+CC   F CG+GC+GG P  AW+Y
Sbjct: 100 ECKTSWAFAAAESMSDRLCINSGGFKNTILSAEELLSCCTGMFSCGEGCEGGNPFKAWQY 159

Query: 73  FVHHGVVTEE-------CDPYF-----DSTG-CSHPGC-EPAYPTPKCVRKCVKKNQL-- 116
              HG+ T         C PY       + G  ++P C     PTP C +KC  +     
Sbjct: 160 IQKHGIPTGGSYESQFGCKPYSIPPCGKTVGNVTYPACTNTTSPTPSCEKKCTSRIGYPI 219

Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
                +HY +S  ++ +   +I +++  NGP++ +F VY+DF  Y +G+Y H+TG+  G 
Sbjct: 220 DIDKDRHYGVSVDQLPNSQIEIQSDVMLNGPIQATFEVYDDFLQYTTGIYVHLTGNKQGH 279

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
            +V++IGWG    G  YW+ AN W R WG +G F++ RG+NECG+E + V+G+P
Sbjct: 280 LSVRIIGWGVW-QGVPYWLCANSWGRQWGENGTFRVLRGTNECGLESNCVSGMP 332


>gi|226471008|emb|CAX70585.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  175 bits (443), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 106/229 (46%), Positives = 139/229 (60%), Gaps = 16/229 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL-LACCGFLCGDGCDGGYPISAWRYF 73
           Q  CGSCWAFGAVEA++DR CI  G   S  ++ L L  C   CG GC GG+P  AW Y+
Sbjct: 112 QSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDYW 171

Query: 74  VHHGVVT---EE----CDPY-FDS----TGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRN 119
           V  G+VT   EE    C PY F      T   +P C    Y TP+C + C K  +  +  
Sbjct: 172 VKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQ 231

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
            KHY    Y + S+ + I  EI   GPVE +F VYEDF +YKSG+Y+H+ G ++GGHA++
Sbjct: 232 DKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAIR 291

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           +IGWG  + G+ YW++AN WN  WG +G F++ RG +EC IE  VVAGL
Sbjct: 292 IIGWGV-EKGKPYWLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339


>gi|52630945|gb|AAU84936.1| putative cathepsin B-S [Toxoptera citricida]
          Length = 335

 Score =  175 bits (443), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 100/232 (43%), Positives = 127/232 (54%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCWA G   A +DR CI      N  +S  +L  CC   CG GC+GG P+ AW+Y
Sbjct: 106 QGNCGSCWAHGTTGAFADRLCIATNGDFNELISAEELTFCC-HRCGFGCNGGNPLKAWQY 164

Query: 73  FVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
           F  HGVVT       + C PY       D  G +    +P  P  KC R C         
Sbjct: 165 FKRHGVVTGGNYNTTDGCQPYKVPPCVKDEEGHNSCSGQPTEPNHKCSRSCYGDKTCDYK 224

Query: 120 SKHYSI-SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT-GDVMGGHA 177
             HY   +AY +N D   +  +    GP+E SF VY+DF +Y+SGVY+       +GGHA
Sbjct: 225 KGHYKTKNAYYLNIDT--MQKDTIAYGPIEASFDVYDDFVNYESGVYQKTEDAKYLGGHA 282

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           VK+IGWG  +DG  YW++ N W   WGA+G FKI RG+NECGIE    AG+P
Sbjct: 283 VKMIGWG-EEDGTPYWLMVNSWGEQWGANGMFKILRGTNECGIEGSPTAGVP 333


>gi|226471006|emb|CAX70584.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  175 bits (443), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 106/229 (46%), Positives = 139/229 (60%), Gaps = 16/229 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL-LACCGFLCGDGCDGGYPISAWRYF 73
           Q  CGSCWAFGAVEA++DR CI  G   S  ++ L L  C   CG GC GG+P  AW Y+
Sbjct: 112 QSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDYW 171

Query: 74  VHHGVVT---EE----CDPY-FDS----TGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRN 119
           V  G+VT   EE    C PY F      T   +P C    Y TP+C + C K  +  +  
Sbjct: 172 VKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQ 231

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
            KHY    Y + S+ + I  EI   GPVE +F VYEDF +YKSG+Y+H+ G ++GGHA++
Sbjct: 232 DKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAIR 291

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           +IGWG  + G+ YW++AN WN  WG +G F++ RG +EC IE  VVAGL
Sbjct: 292 IIGWGV-EKGKPYWLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339


>gi|226473760|emb|CAX71565.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  174 bits (442), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 101/234 (43%), Positives = 132/234 (56%), Gaps = 20/234 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGS WA  AV A+SDR CI  G   ++ LS  DL++CC + CG GCDGG+   +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
           +V  G+VT         C PY     C H        C +  Y TP+C + C K  N  +
Sbjct: 171 WVLRGIVTGGSKENHTGCRPY-PFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSY 229

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              KHY   +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHA
Sbjct: 230 EQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHA 289

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           V+LIG G  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 290 VRLIGCGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|166030310|gb|ABY78822.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score =  174 bits (442), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 97/235 (41%), Positives = 132/235 (56%), Gaps = 15/235 (6%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDG 63
           N   +  +  Q  CGSCWA     A+SDR C   G+  L +S   L++CC   CGDGCDG
Sbjct: 102 NCPTIREIADQSACGSCWAVSTASAISDRHCTVGGVQQLRISAAHLMSCCED-CGDGCDG 160

Query: 64  GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQ 115
           GYP ++W Y+V HG+ +  C PY     C H G +   P        TPKC   C  K  
Sbjct: 161 GYPGTSWEYYVSHGLASSYCQPY-PFPHCGHHGGKGKKPPCSKYHFHTPKCNTTCTDKAI 219

Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
                K+    +Y ++ + +D   E+Y NGP  V F VY DF  YK+GVY+H++GD +GG
Sbjct: 220 PL--IKYRGNHSYEVHGE-DDYKRELYFNGPFVVVFWVYSDFLAYKTGVYRHVSGDFLGG 276

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           HAV+++GWG   +G  YW +AN W+  WG +G+    RG+NECGIE    AG P+
Sbjct: 277 HAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHLLFLRGNNECGIEAAGYAGSPA 330


>gi|254575665|gb|ACT68329.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score =  174 bits (442), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 97/236 (41%), Positives = 130/236 (55%), Gaps = 18/236 (7%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
           + ++  Q +CGSCWA  A + +SDR CIH      + LS  D+LACCG  CG GCDGGY 
Sbjct: 112 LRVIPDQSNCGSCWAVSAAQCMSDRLCIHSQGRKKVLLSATDILACCGKFCGYGCDGGYN 171

Query: 67  ISAWRYFVHHGVVTE-------ECDPYFDSTGCSHPGCE----PAYPTPKCVRKCVKK-- 113
             AW++    GVVT         C PY      +H G      P++P     RK   +  
Sbjct: 172 ARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCGAHKGKAFNNCPSHPYATPARKPYCQYG 231

Query: 114 -NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
             + + N K  + + Y + +D   I  EI + GPV  +F +YEDF HY  GVY H  G +
Sbjct: 232 YGKRYENDKIKARTWYWLPNDERTIQLEIMQKGPVHATFNIYEDFEHYNGGVYIHTAGAM 291

Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD-GYFKIKRGSNECGIEEDVVAG 227
            GGH++K+IGWG  D G  YW++AN W+  WG D GYF++ RG N C IE  V+AG
Sbjct: 292 EGGHSIKIIGWGV-DKGVKYWLIANSWSTDWGEDGGYFRVVRGINNCDIEGGVLAG 346


>gi|44968648|gb|AAS49594.1| cathepsin B [Scyliorhinus canicula]
          Length = 206

 Score =  174 bits (442), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 90/189 (47%), Positives = 119/189 (62%), Gaps = 17/189 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH    +N+ +S  DLL+CC   CG+GC+GGYP  AW +
Sbjct: 17  QGSCGSCWAFGAVEAMSDRICIHSRGKVNVEVSAEDLLSCCKLECGNGCNGGYPSGAWEF 76

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKC-VKKNQLWR 118
           + + G+V+         C PY  S  C H      P C     TP+C R+C    +  + 
Sbjct: 77  WTNDGLVSGGLYYSHIGCRPYSISP-CEHHVNGSRPKCSGEIETPRCSRRCEAGYSPKYS 135

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY +++Y I SD  +IM EIYKNGPVE +  V++DF  YKSGVY+H TG  +GGHA+
Sbjct: 136 EDKHYGLTSYSIGSDVTEIMTEIYKNGPVEAALEVFKDFLLYKSGVYQHKTGGSIGGHAI 195

Query: 179 KLIGWGTSD 187
           K++GWG  +
Sbjct: 196 KILGWGEEN 204


>gi|56754307|gb|AAW25341.1| unknown [Schistosoma japonicum]
          Length = 309

 Score =  174 bits (441), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 100/235 (42%), Positives = 130/235 (55%), Gaps = 22/235 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  C S WA  AV A+SDR CI  G   ++ LS  DL++CC   CG GCDGG    +W Y
Sbjct: 79  QSRCASSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCKN-CGSGCDGGVTGYSWDY 137

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
           +V HG+VT       + TGC     P C+              Y TP+C + C K  N  
Sbjct: 138 WVSHGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTS 195

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   KHY   +Y + S    I  +I  +G VE    +YEDF +YKSG+Y++ TG  + GH
Sbjct: 196 YEQDKHYGGFSYNVLSVESVIQKDIMMHGTVEAYLEIYEDFLNYKSGIYRYTTGQFISGH 255

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           AV+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 256 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 309


>gi|350535627|ref|NP_001233013.1| uncharacterized protein LOC100164982 precursor [Acyrthosiphon
           pisum]
 gi|239789514|dbj|BAH71377.1| ACYPI005957 [Acyrthosiphon pisum]
          Length = 339

 Score =  174 bits (441), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 97/232 (41%), Positives = 128/232 (55%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QGHCGSCWAFG   A +DR C+      N  LS  +L  CC   CG GC+GGYPI AW+Y
Sbjct: 109 QGHCGSCWAFGTSSAFADRLCVATDGDFNELLSAEELTFCC-HACGHGCNGGYPIKAWKY 167

Query: 73  FVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
           F  HG+VT       + C+PY       +  G S    +P     +C R C     L  +
Sbjct: 168 FSTHGLVTGGNYKSGKGCEPYRVPPCPRNEDGKSSCAGKPKEKNHRCTRMCYGNQDLDYD 227

Query: 120 SKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHA 177
             H ++   Y +      I  ++   GP+E SF VY+DF  YKSGVY+       +GGHA
Sbjct: 228 DDHRFTRDFYYLTYG--SIQKDVLNYGPIEASFDVYDDFPSYKSGVYQRTPNATKLGGHA 285

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           VKLIGWG  ++G  YW++ N WN  WG +G FKI+RG++EC I+    AG+P
Sbjct: 286 VKLIGWGV-EEGTPYWLMVNSWNAQWGDNGLFKIRRGTDECRIDSATTAGVP 336


>gi|204022077|dbj|BAG71136.1| cathepsin B-S1 [Tuberaphis sumatrana]
 gi|204022079|dbj|BAG71137.1| cathepsin B-S2 [Tuberaphis sumatrana]
          Length = 334

 Score =  174 bits (441), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 100/232 (43%), Positives = 131/232 (56%), Gaps = 20/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCW+F    A +DR C+  G   N  LS  +L  CC   CG+GC+GGYPI AWRY
Sbjct: 107 QGNCGSCWSFSTTGAFADRLCVSTGGKFNELLSPEELAFCCK-DCGNGCEGGYPIKAWRY 165

Query: 73  FVHHGVVT-------EECDPY-----FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
           F   GV T       E C PY     ++  G +  G +P     +C + C  K       
Sbjct: 166 FRTQGVTTGGDYDTKEGCKPYKVAPCYNKQGKNTCGGKPMERNHQCPKTCYGKTT--DQK 223

Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVK 179
           ++ + S Y INS  + I  +I   GPVE SF VY+DF+ YKSG+Y+         GH+VK
Sbjct: 224 RYKTKSEYVINS-IKTIEQDIKTYGPVEASFDVYDDFSVYKSGIYRKTPNAKYQNGHSVK 282

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           +IGWG  ++G  YW+  N W++ WG  G FKI +G NECGIE  V AG+PSS
Sbjct: 283 IIGWG-QENGTPYWLAVNSWSKFWGDHGTFKIIKGKNECGIERAVTAGIPSS 333


>gi|157058763|gb|ABV03139.1| cathepsin B-348 [Acyrthosiphon pisum]
          Length = 248

 Score =  174 bits (440), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 91/215 (42%), Positives = 128/215 (59%), Gaps = 20/215 (9%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   +  +  QG CGSCWAFGAVEA+SDR CIH     N   S  +L++CC + CG G
Sbjct: 36  WPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSNGTKNFHFSAENLVSCC-WTCGFG 94

Query: 61  CDGGYPISAWRYFVHHGVVTEECDPYFDSTGC--------------SHPGCEPAYPTPKC 106
           C+GG+P +AW Y+   G+V+    PY  + GC              +   C+    TP C
Sbjct: 95  CNGGFPGAAWNYWKTKGIVSG--GPYGSNMGCIPYEIAPCEHHVNGTRGPCKEGGKTPTC 152

Query: 107 VRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
           V+KC +  ++ +    H+  SAY I +D + I  EIY NGPVE +FTVYEDF  Y++GVY
Sbjct: 153 VKKCEEGYKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVY 212

Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 200
           KH+ G  +GGHA++++GWG  +    YW++AN WN
Sbjct: 213 KHVAGKALGGHAIRILGWGVQNGEIPYWLVANSWN 247


>gi|3929817|emb|CAA77181.1| cathepsin B [Mus musculus]
          Length = 194

 Score =  174 bits (440), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 91/195 (46%), Positives = 123/195 (63%), Gaps = 16/195 (8%)

Query: 18  CGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 75
           CGSCWAFGAVEA+SDR CIH    +N+ +S  DLL CCG  CGDGC+GGYP  AW ++  
Sbjct: 1   CGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTK 60

Query: 76  HGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKH 122
            G+V+         C PY           S P       TP+C + C    +  ++  KH
Sbjct: 61  KGLVSGGVYDSHIGCLPYTIPPCEHHVNGSRPPMHGEGDTPRCNKSCEAGYSPSYKEDKH 120

Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
           +  ++Y +++  ++IMAEIYKNGPVE +FTV+ DF  YKSGVYKH  GD+MGGHA++++G
Sbjct: 121 FGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILG 180

Query: 183 WGTSDDGEDYWILAN 197
           WG  ++G  YW+ AN
Sbjct: 181 WGV-ENGVPYWLAAN 194


>gi|56752925|gb|AAW24674.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  173 bits (439), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 100/235 (42%), Positives = 128/235 (54%), Gaps = 22/235 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  C S WA  AV A+SDR CI  G   ++ LS  DL++CC   CG GCDGG    +W Y
Sbjct: 112 QSRCASSWAVSAVAAMSDRICIQSGGKQSVELSAIDLISCCKN-CGSGCDGGVTGYSWDY 170

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
           +V HG+VT       + TGC     P C+              Y TP+C + C K  N  
Sbjct: 171 WVKHGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTS 228

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   KHY   +Y +      I  EI   GPVE    +YEDF +YKSG+Y++ TG  + GH
Sbjct: 229 YEQDKHYGGFSYSVIGVESAIQKEIMMYGPVEAYLQIYEDFLNYKSGIYRYTTGKYISGH 288

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           AV+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG +EC IE  +VAG   S
Sbjct: 289 AVRLIGWGV-ENGTSYWLAANTWNEDWGEKGYFRIVRGRDECLIESFIVAGQIKS 342


>gi|204022088|dbj|BAG71141.1| cathepsin B-N2 [Tuberaphis styraci]
          Length = 334

 Score =  173 bits (439), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 100/232 (43%), Positives = 124/232 (53%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCWAFG   A +DR CI      N  LS  +L  CC   CG GC GGYPI AW  
Sbjct: 107 QGNCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELAFCC-HKCGFGCSGGYPIRAWER 165

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
           F  HG+VT       E C PY       D  G +    +PA    +C + C     L ++
Sbjct: 166 FKKHGLVTGGNYDSGEGCQPYKVSPCPLDEYGNNTCSGKPAEKNHRCTQMCYGNQNLDFK 225

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHA 177
              HY+  AY +      I  ++   GP+E SF VY+DF  YKSGVY  +     +GGHA
Sbjct: 226 EDHHYTRDAYYLTYGT--IQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHA 283

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           VKLIGWG  + G  YW+L N WN  WG  G FKI+RG+NECG +     G+P
Sbjct: 284 VKLIGWG-EEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGTDNSTTGGVP 334


>gi|44965401|gb|AAS49537.1| cathepsin B [Latimeria chalumnae]
          Length = 225

 Score =  173 bits (439), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 93/193 (48%), Positives = 121/193 (62%), Gaps = 18/193 (9%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
           ++ +  QG CGSCWAFGAVEA+SDR CIH    +N+ +S  DLL+CCG  CG GC+GGYP
Sbjct: 29  IQEIRDQGSCGSCWAFGAVEAISDRVCIHSKGKVNVEISAEDLLSCCGMECGFGCNGGYP 88

Query: 67  ISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAY-PTPKCVRKC-V 111
             AW ++   G+V+         C PY     C H      P C      TPKCV +C  
Sbjct: 89  SGAWNFWTETGLVSGGLFKSHIGCRPYTIPP-CEHHVNGSRPSCTGEEGDTPKCVMQCEA 147

Query: 112 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 171
                +   KH+  ++Y ++S+  DI  EIYKNGPVE +FTVYEDF  YKSGVYKH+TGD
Sbjct: 148 GYTPSYFKDKHFGSTSYAVSSNEADIQIEIYKNGPVEGAFTVYEDFLQYKSGVYKHVTGD 207

Query: 172 VMGGHAVKLIGWG 184
            +GGHA++++GWG
Sbjct: 208 AVGGHAIRILGWG 220


>gi|194246067|gb|ACF35525.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 192

 Score =  173 bits (439), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 86/187 (45%), Positives = 119/187 (63%), Gaps = 16/187 (8%)

Query: 57  CGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPT 103
           CG GC+GGYP +AW+++    +VT       + C PY+    C H      P C    PT
Sbjct: 3   CGSGCNGGYPSAAWQFYKDEDIVTGGLYGTEDGCQPYYFPP-CEHHTVGPLPNCTGIKPT 61

Query: 104 PKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 162
           P+C + C +  Q  +   KH+    Y I+SD   I  EIYKNGPVE  F+VY DF  YKS
Sbjct: 62  PECAKTCREGYQKSYTRDKHFGKKVYSISSDETQIKTEIYKNGPVEADFSVYADFPSYKS 121

Query: 163 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 222
           GVY+  + +++GGHA++++GWGT +DG  YW++AN WN  WG  GYFKI+RG++ECGIE+
Sbjct: 122 GVYQRHSEEMLGGHAIRILGWGT-EDGVPYWLVANSWNEDWGDKGYFKIRRGNDECGIED 180

Query: 223 DVVAGLP 229
           D+ AG+P
Sbjct: 181 DINAGIP 187


>gi|204022090|dbj|BAG71142.1| cathepsin B-N3 [Tuberaphis styraci]
          Length = 334

 Score =  173 bits (439), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 100/232 (43%), Positives = 124/232 (53%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCWAFG   A +DR CI      N  LS  +L  CC   CG GC GGYPI AW  
Sbjct: 107 QGNCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELAFCC-HKCGFGCSGGYPIRAWER 165

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
           F  HG+VT       E C PY       D  G +    +PA    +C + C     L ++
Sbjct: 166 FKKHGLVTGGNYDSGEGCQPYKVPPCPLDEYGNNTCSGKPAEKNHRCTQMCYGNQNLDFK 225

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHA 177
              HY+  AY +      I  ++   GP+E SF VY+DF  YKSGVY  +     +GGHA
Sbjct: 226 EDHHYTRDAYYLTYGT--IQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHA 283

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           VKLIGWG  + G  YW+L N WN  WG  G FKI+RG+NECG +     G+P
Sbjct: 284 VKLIGWG-EEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGTDNSTTGGVP 334


>gi|343474530|emb|CCD13852.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 335

 Score =  173 bits (439), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 99/231 (42%), Positives = 123/231 (53%), Gaps = 21/231 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
           Q  CGSCWA  A  +++DR+C IH    L +S  DLLACCG  CG GC GG P  AW YF
Sbjct: 112 QSSCGSCWAVAAATSMTDRYCTIHGVRGLRISAADLLACCGD-CGYGCLGGDPDMAWAYF 170

Query: 74  VHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKN---QLWRNSKH 122
              G+ +  C PY     CSH      YP        TP C   C       + +R  K 
Sbjct: 171 SSEGIASGRCQPY-PFPRCSHYTNSTTYPQCSALHLWTPTCNPACTDSTISKKKYRGLKS 229

Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
           YS+S        ED   E+Y  GP +  F V+ D   YK GVYKH+ G  +G HAV+++G
Sbjct: 230 YSLSG------EEDFRRELYFRGPFQAVFDVWSDLFAYKHGVYKHVGGAFIGAHAVRIVG 283

Query: 183 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
           WG +  G  YW +AN WN  WG  GYF + RG NECGIE+   AG+P+  N
Sbjct: 284 WG-NQSGVPYWKIANSWNAEWGDRGYFFMLRGDNECGIEDSGSAGVPAIPN 333


>gi|4325188|gb|AAD17297.1| cysteine proteinase [Ancylostoma ceylanicum]
          Length = 341

 Score =  173 bits (438), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 93/229 (40%), Positives = 130/229 (56%), Gaps = 21/229 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWA  + EA+SD  C+     + + +S  D+L+CCG  CG GC GG+PI A+R+
Sbjct: 110 QSACGSCWAVSSAEAMSDEICVQSNSTIKVMISDTDILSCCGLDCGYGCQGGWPIEAYRW 169

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPGCEPAY--------PTPKCVRKCVKK-NQL 116
               GVVT       + C PY     C      P Y        PTPKC +   +K N+ 
Sbjct: 170 MQRDGVVTGGKYRQRDVCKPY-SFYPCGQHKDVPYYGPCPGGLWPTPKCRKSSQRKYNKT 228

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           ++  KH++  +Y + ++   I  EIYKNGPV  +F VYED++    G+Y H  G   G H
Sbjct: 229 YQEDKHFATRSYSLPNNERSIRQEIYKNGPVVAAFKVYEDYSS-TGGIYVHKWGIQTGAH 287

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
           A K+IGWG  ++G DYW++AN WN  WG DGY++I R ++ C IE  +V
Sbjct: 288 ADKVIGWG-RENGTDYWLIANSWNTDWGEDGYYRIVRETDNCEIERQMV 335


>gi|341900875|gb|EGT56810.1| hypothetical protein CAEBREN_32632 [Caenorhabditis brenneri]
          Length = 287

 Score =  173 bits (438), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 92/234 (39%), Positives = 135/234 (57%), Gaps = 22/234 (9%)

Query: 17  HCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACC-GFL-CGDGCDGGYPISAWRY 72
            C S WAF A E++SDR CI+ G  +N  LS  +LL+CC G L CG+GC GG    AW+Y
Sbjct: 52  ECKSSWAFAAAESMSDRLCINSGGTINTILSAQELLSCCTGVLSCGEGCGGGNAFKAWQY 111

Query: 73  FVHHGVVTEE-------CDPYFDST------GCSHPGC-EPAYPTPKCVRKCVKKNQL-- 116
           +  HG+ T         C PY  +         ++P C     PTP C +KC  KN    
Sbjct: 112 WGKHGLPTGGSYESQFGCKPYSIAPCGKTVGNVTYPACTNTTLPTPSCEKKCTSKNGYPV 171

Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
                +HY  S  ++ +   +I +++  NGP+E +F VY+DF  Y +G+Y H+TG+  G 
Sbjct: 172 DIDKDRHYGASVDQLPNRQIEIQSDVMLNGPIETTFEVYDDFLQYTTGIYVHLTGNKQGH 231

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
            +V+++GWG   +G  YW+LAN W + WG +G F+  RG+NECG+E + V+G+P
Sbjct: 232 LSVRILGWGMY-EGVPYWLLANSWGKEWGENGTFRALRGTNECGLEANCVSGMP 284


>gi|343477197|emb|CCD11909.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  173 bits (438), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 95/247 (38%), Positives = 132/247 (53%), Gaps = 19/247 (7%)

Query: 3   FTNSEH------VEILVIQGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGF 55
           F  +EH      +  +  Q  C + WA     A+SDR+C +  G  L +S  DL+ACC  
Sbjct: 95  FDAAEHWPHCPTIREIADQSACRASWAVATASAISDRYCTVGKGKQLRISAADLMACCK- 153

Query: 56  LCGDGCDGGYPISAWRYFVHHGVVTEECDPY-------FDSTGCSHPGCEPAYPTPKCVR 108
            CG GC+GGYP +AW Y+V HG+ + +C PY         + G   P  +  + TP+C  
Sbjct: 154 DCGGGCEGGYPDAAWEYYVSHGIASSQCQPYPFPRCEHRGAQGKKTPCSKYKFVTPQCNA 213

Query: 109 KCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 168
            C  K       K+    +Y +  + ED   E+Y NGP  V F V+ DF  YK+GVY+H+
Sbjct: 214 TCTDKTIPL--IKYRGNHSYEVRGE-EDYKRELYFNGPFVVRFQVHSDFLAYKNGVYQHV 270

Query: 169 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
            G+ +GG AV+++GWG   +G  YW +AN W+  WG +GYF I RG NEC IE    AG 
Sbjct: 271 AGNFLGGKAVRIVGWGKL-NGTPYWKVANSWDTDWGMNGYFLILRGDNECNIEHLGFAGT 329

Query: 229 PSSKNLV 235
           P    L 
Sbjct: 330 PDPSQLT 336


>gi|145481831|ref|XP_001426938.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124394016|emb|CAK59540.1| unnamed protein product [Paramecium tetraurelia]
          Length = 332

 Score =  173 bits (438), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 106/249 (42%), Positives = 131/249 (52%), Gaps = 33/249 (13%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLC----GDGCD 62
           +E++  QG+CGSCWA  A   +SDR CI  G      +S  DLL+CCG  C      GCD
Sbjct: 87  IELIPDQGNCGSCWAVSAASTMSDRLCIASGQTDKRQISAEDLLSCCGINCELDGNGGCD 146

Query: 63  GGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-------PGCEPAY-----PT 103
           GGYP  AW+Y    G+VT         C PY     CSH         CE  +      T
Sbjct: 147 GGYPYGAWKYLRVDGIVTGGTYNDFSLCKPY-SFPPCSHGNDSGKYSKCENDFFMLTEVT 205

Query: 104 PKCVRKCVKKNQLWRNSKHYSISA----YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 159
           P C +KC    Q  R      I +    Y++  D E I  EIY NGPV+  FTV++DF +
Sbjct: 206 PSCTKKC--HPQFSRTYDVDKIRSRENPYKLIKDQEQIKNEIYLNGPVQAVFTVFDDFLN 263

Query: 160 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 219
           YKSGVY+  TG   G HAVK+IGWGT ++G  YW   N WN  WG +G FKI RG N   
Sbjct: 264 YKSGVYQQTTGQRRGKHAVKIIGWGT-ENGVPYWEAINSWNDGWGINGKFKILRGFNHLD 322

Query: 220 IEEDVVAGL 228
           IE +V A +
Sbjct: 323 IEGEVYASI 331


>gi|56757271|gb|AAW26807.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  173 bits (438), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 100/235 (42%), Positives = 128/235 (54%), Gaps = 22/235 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  C S WA  AV A+SDR CI  G   ++ LS  DL++CC   CG GCDGG    +W Y
Sbjct: 112 QSRCASSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCKN-CGSGCDGGVTGYSWDY 170

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
           +V HG+VT       + TGC     P C+              Y TP+C + C K  N  
Sbjct: 171 WVKHGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTS 228

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   KHY   +Y +      I  EI   GPVE    +YEDF +YKSG+Y++ TG  + GH
Sbjct: 229 YEQDKHYGEFSYNVIGVESVIQKEIMMYGPVEAYLHIYEDFLNYKSGIYRYTTGQFISGH 288

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           AV+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG +EC IE  +VAG   S
Sbjct: 289 AVRLIGWGV-ENGTSYWLAANTWNEDWGEKGYFRIVRGRDECLIESFIVAGQIKS 342


>gi|56758716|gb|AAW27498.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  172 bits (437), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 100/235 (42%), Positives = 128/235 (54%), Gaps = 22/235 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  C S WA  AV A+SDR CI  G   ++ LS  DL++CC   CG GCDGG    +W Y
Sbjct: 112 QSRCASSWAVSAVAAMSDRICIQSGGKQSVELSAIDLISCCEN-CGSGCDGGVTGYSWDY 170

Query: 73  FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
           +V HG+VT       + TGC     P C+              Y TP+C + C K  N  
Sbjct: 171 WVKHGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTS 228

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   KHY   +Y +      I  EI   GPVE    +YEDF +YKSG+Y++ TG  + GH
Sbjct: 229 YEQDKHYGGFSYSVIGVESAIQKEIMMYGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGH 288

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           AV+LIGWG  ++G  YW+ AN WN  WG  GYF+I RG +EC IE  +VAG   S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRDECLIESFIVAGQIKS 342


>gi|86279343|gb|ABC88767.1| putative cathepsin B-like proteinase [Tenebrio molitor]
          Length = 321

 Score =  172 bits (437), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 95/235 (40%), Positives = 139/235 (59%), Gaps = 20/235 (8%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCD 62
           N + +  +  QG CGSCWAF ++E++SDR CIH         S  DLL+CC   CGD C 
Sbjct: 95  NCDSLNRIRDQGACGSCWAFASIESMSDRICIHSSGSAQFMFSPEDLLSCCT-SCGD-CG 152

Query: 63  GGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-N 114
           GGY +SA  ++++ G+V+       E C PY   T  +H   +    TP C + C    +
Sbjct: 153 GGYMMSALDFYINEGIVSGGDVNSNEGCRPY---TADAHDQGQ----TPACTKSCRNGYS 205

Query: 115 QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
             +   KHY  + Y ++S  + I  E+  NGP+ V+F V++DF +Y SGVY+H++G+ +G
Sbjct: 206 TSYSADKHYGSNDYVVSSVIDQIQYEVMTNGPIIVNFEVFQDFYNYVSGVYRHVSGESVG 265

Query: 175 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
            H VK++GWG  ++G  YW++AN W  SWG  G+FK+ RG NECGIE    A +P
Sbjct: 266 FHVVKIVGWGV-ENGVPYWLIANSWGSSWGDHGFFKMLRGQNECGIENYPYAVMP 319


>gi|390357905|ref|XP_003729132.1| PREDICTED: cathepsin B-like [Strongylocentrotus purpuratus]
          Length = 354

 Score =  172 bits (437), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 88/187 (47%), Positives = 115/187 (61%), Gaps = 16/187 (8%)

Query: 57  CGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPT 103
           C   C+GG+P SAW Y+   G+VT       + C PY     C H        C+   PT
Sbjct: 169 CKHKCNGGFPGSAWEYYKDTGIVTGGQWNSSQGCQPY-QIKSCDHHVNGTKGPCQGEGPT 227

Query: 104 PKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 162
           P+C  KC    +  +   KHY++S   I+++PE    EI  NGPVE  FTVYEDF  YKS
Sbjct: 228 PECKHKCEASYSTPYEQDKHYALSVNSISNNPEATQTEIMTNGPVEADFTVYEDFPTYKS 287

Query: 163 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 222
           GVY+H TG V+GGHA+K++GWG  ++G  YW++AN WN  WG +G+FKI RGSNECGIE 
Sbjct: 288 GVYQHTTGGVLGGHAIKILGWGV-EEGTKYWLVANSWNNEWGDNGFFKILRGSNECGIES 346

Query: 223 DVVAGLP 229
           D+  G+P
Sbjct: 347 DINFGIP 353


>gi|48762476|dbj|BAD23809.1| cathepsin B-S [Tuberaphis styraci]
 gi|204022069|dbj|BAG71132.1| cathepsin B-S1 [Tuberaphis styraci]
          Length = 349

 Score =  172 bits (437), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 98/232 (42%), Positives = 131/232 (56%), Gaps = 20/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCW+F    A +DR C+  G   N  LS  +L  CC   CG GC GGYPI AW+Y
Sbjct: 107 QGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEELAFCC-MDCGKGCGGGYPIKAWKY 165

Query: 73  FVHHGVVT-------EECDPY-----FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
           F   GV T       E C PY     +D  G +  G +P     +C + C  K  +    
Sbjct: 166 FRTQGVTTGGDYDTKEGCMPYKVPPCYDEQGKNTCGGKPMERNHQCPKTCYGKTTV--QD 223

Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVK 179
           ++ + + Y INS  E I  ++   GPVE SF VY+DF+ YKSG+Y+        GGH++K
Sbjct: 224 RYKTKNEYVINS-IETIEQDLMTYGPVEASFDVYDDFSVYKSGIYRKTPKAKYEGGHSIK 282

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           +IGWG  ++G  YW+  N W++ WG  G FKI +G NECGIE  V AG+PS+
Sbjct: 283 IIGWG-EENGTPYWLAVNSWSKFWGDHGTFKIIKGRNECGIERAVTAGIPST 333


>gi|308488550|ref|XP_003106469.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
 gi|308253819|gb|EFO97771.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
          Length = 205

 Score =  172 bits (437), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 89/186 (47%), Positives = 112/186 (60%), Gaps = 18/186 (9%)

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDS------TGCSHPGC-EPAYPTPKC 106
           C+GGYPI AW+++V HG+VT         C PY  +       G + P C E   PTPKC
Sbjct: 14  CEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKC 73

Query: 107 VRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 163
           V  C   N     +   KH+  +AY +    E I  EI  +GP+EV+FTVYEDF  Y +G
Sbjct: 74  VEACTSNNTYPTGYLQDKHFGATAYAVGKKVEQIQTEILAHGPIEVAFTVYEDFYQYTTG 133

Query: 164 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
           VY H  G  +GGHAVK++GWG  D+G  YW++AN WN +WG  GYF+I RG NECGIE  
Sbjct: 134 VYVHTAGKSLGGHAVKILGWGV-DNGTPYWLVANSWNVNWGEKGYFRIIRGLNECGIEHS 192

Query: 224 VVAGLP 229
            VAGLP
Sbjct: 193 AVAGLP 198


>gi|166030318|gb|ABY78826.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score =  172 bits (436), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 99/231 (42%), Positives = 122/231 (52%), Gaps = 21/231 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
           Q  CGSCWA  A  +++DR+C IH    L +S  DLLACCG  CG GC GG P  AW YF
Sbjct: 112 QSSCGSCWAVAAATSMTDRYCTIHGVRGLRISAADLLACCGD-CGYGCLGGDPDMAWAYF 170

Query: 74  VHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKN---QLWRNSKH 122
              G+ +  C PY     CSH      YP        TP C   C       + +R  K 
Sbjct: 171 SSEGIASGRCQPY-PFPRCSHYTNSTTYPQCSALHLWTPTCNPACTDSTISKKKYRGLKS 229

Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
           YS S        ED   E+Y  GP +  F V+ D   YK GVYKH+ G  +G HAV+++G
Sbjct: 230 YSFSG------EEDFRRELYFRGPFQAVFDVWSDLFAYKHGVYKHVGGAFIGAHAVRIVG 283

Query: 183 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
           WG +  G  YW +AN WN  WG  GYF + RG NECGIE+   AG+P+  N
Sbjct: 284 WG-NQSGVPYWKIANSWNAEWGDRGYFFMLRGDNECGIEDSGSAGVPAIPN 333


>gi|204022104|dbj|BAG71149.1| cathepsin B-N [Astegopteryx styracophila]
          Length = 332

 Score =  172 bits (435), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 99/232 (42%), Positives = 124/232 (53%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFG   A +DR CI      N  LS  +L  CC   CG GC GGYPI AW  
Sbjct: 105 QGKCGSCWAFGTSSAFADRLCIATNGEFNELLSAEELTFCC-HKCGFGCHGGYPIKAWER 163

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
           F  HG+VT       E C PY       D  G +    +PA    +C R C     L ++
Sbjct: 164 FQKHGLVTGGDYDSGEGCQPYRVSPCPLDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFK 223

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
              H++  AY +      I  ++   GP+E S+ VY+DF  YKSGVY +      +GGHA
Sbjct: 224 KDHHFTRDAYYLTFGI--IQRDVMAYGPIEASYDVYDDFPSYKSGVYVRTENATYLGGHA 281

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           VKLIGWG  + G  YW++ N WN  WG  G FKI+RG+NECGI+     G+P
Sbjct: 282 VKLIGWG-EEYGVPYWLMVNSWNDQWGDKGLFKIRRGTNECGIDNSTTGGVP 332


>gi|209863073|ref|NP_001119610.2| cathepsin B-1852 [Acyrthosiphon pisum]
          Length = 333

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 98/238 (41%), Positives = 131/238 (55%), Gaps = 17/238 (7%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCD 62
           N   +  +  QG+CGSCWAF    A +DR CI  +   N  LS   + +CC + CG GC 
Sbjct: 96  NCTTIGTIRDQGNCGSCWAFSTSGAFADRLCIASNGSFNQLLSAEHVTSCC-YRCGLGCQ 154

Query: 63  GGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCE-PAYPTPKCVRKCVKKN 114
           GGYPI AWRY+  HG+VT       E C PY       +  C   +    KC +KC    
Sbjct: 155 GGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPCTGNNSCSGQSEKNHKCQKKCFGNT 214

Query: 115 QL-WRNSKHY-SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGD 171
            + +R  + Y   S Y +  D  ++  +I   GP+E SF VY+DF  YKSGVY K     
Sbjct: 215 SISYRGDRRYVERSPYVLAYD--NMQNDIMTYGPIESSFDVYDDFISYKSGVYFKSPNAT 272

Query: 172 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
            +GGH+VK IGWG  +    YW++ N WN +WG  GYFKI+RG+NEC +E+   AG+P
Sbjct: 273 YLGGHSVKCIGWGV-ERNVSYWLMMNSWNSTWGDGGYFKIRRGTNECQVEDSSTAGVP 329


>gi|204022083|dbj|BAG71139.1| cathepsin B-S [Astegopteryx styracophila]
          Length = 335

 Score =  171 bits (434), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 97/231 (41%), Positives = 125/231 (54%), Gaps = 20/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCWAFG   A +DR C+  G   N  LS   L  CC + CG GC GG PI AW+Y
Sbjct: 108 QGNCGSCWAFGTTGAFADRLCVATGGGFNEQLSAEKLTFCC-WTCGLGCQGGNPIKAWKY 166

Query: 73  FVHHGVVT-------EECDPY-----FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
           F   G+ T       E C PY     +D  G      +P     KC R C   + +    
Sbjct: 167 FKRRGITTGGDYGSNEGCAPYKVPPCYDDQGEFLCQGKPTEHNHKCPRACYGNSTV---E 223

Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVK 179
             Y + +  +    + I  +I   GPVE SF VY+DF  YKSG+Y+     + +GGH+VK
Sbjct: 224 NRYKVESIYVLDSFKTIEQDIRTYGPVEASFDVYDDFITYKSGIYQKTPNALYVGGHSVK 283

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           LIGWG  +DG  YW+L N W++ WG  G F+I +G NECGIE    AG+PS
Sbjct: 284 LIGWG-EEDGIPYWLLVNSWSKFWGEQGTFRIIKGRNECGIERSATAGIPS 333


>gi|195165479|ref|XP_002023566.1| GL19846 [Drosophila persimilis]
 gi|194105700|gb|EDW27743.1| GL19846 [Drosophila persimilis]
          Length = 329

 Score =  171 bits (434), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 97/231 (41%), Positives = 132/231 (57%), Gaps = 29/231 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH    +N  LS +DL++CC  +CG GC+GG+P +AW Y
Sbjct: 108 QGSCGSCWAFGAVEAMSDRVCIHSEGKVNFHLSADDLVSCC-HICGFGCNGGFPGAAWSY 166

Query: 73  FVHHGVV-------TEECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
           +   G+V       T+ C PY +   C H      P C     TP C  KC     + + 
Sbjct: 167 WTRKGIVSGGPYGSTQGCRPY-EIAPCEHHVNGTRPPCSHG-STPSCQHKCQASYSVEYA 224

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             K++   +Y +  +  +I  EI  NGPVE +FTVYED   YKSGVY+H  G  +GGHA+
Sbjct: 225 KDKNFGSKSYSVRRNVAEIQQEIMTNGPVEGAFTVYEDLILYKSGVYQHEHGKELGGHAI 284

Query: 179 KLIGWGTSDDGE-DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           +++GWG   + +  YW++ N WN  WG +         + CGIE  + AGL
Sbjct: 285 RILGWGVWGESKVPYWLIGNSWNTDWGDN---------DHCGIESSISAGL 326


>gi|300122171|emb|CBK22745.2| unnamed protein product [Blastocystis hominis]
          Length = 319

 Score =  171 bits (434), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 102/232 (43%), Positives = 123/232 (53%), Gaps = 22/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF-GMN-LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  C SCWAFG VE  +DR CI   G N + LS  D+L CC   CG  C GGY   AW Y
Sbjct: 91  QSSCASCWAFGVVEVATDRICIESKGKNQVRLSAEDVLECCKD-CGFQCQGGYSAMAWEY 149

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLW 117
               GVVT       E C  Y     CSH G E  YP         PKC   C +   + 
Sbjct: 150 LRRTGVVTGGQYNSTEWCKSY-PFPPCSH-GIEGQYPQCSTKPPVVPKCETTCQEGYPIE 207

Query: 118 RNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
                Y  S  Y++ ++ + I  EI +NGPV+ SF VYEDF  YKSG+Y H+ G  M  H
Sbjct: 208 YEKDRYKFSNVYQLENNVDQIKNEIMENGPVDASFQVYEDFMTYKSGIYHHVEGKFMNLH 267

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
            VK+IGWG  ++GE YW   N WN  WG +G F+I+ G+NEC IE  V  GL
Sbjct: 268 TVKIIGWG-EENGEAYWKAVNSWNSEWGENGLFRIRLGTNECTIESQVEGGL 318


>gi|19526442|gb|AAL89717.1|AF483623_1 cathepsin B [Apriona germari]
          Length = 324

 Score =  171 bits (434), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 91/231 (39%), Positives = 130/231 (56%), Gaps = 7/231 (3%)

Query: 7   EHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGG 64
           E +  +  +G CGSCWAF AVE +SDR C+          S  ++++CC   CG GC GG
Sbjct: 98  ESIRTIRDEGACGSCWAFAAVEVMSDRLCLASEGRKKFIFSAEEVVSCCT-ACGGGCRGG 156

Query: 65  YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHY 123
           +    ++Y+V +G+ +     Y    GC       +  TP+C + CV    + W     +
Sbjct: 157 FLNEPYKYWVTNGIPSG--GDYGSKLGCKPYTAAVSGETPQCQKACVSGYEKSWEKDLRH 214

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 183
           + SAY++N     I  EI  NGPV     VYEDF  Y +G+Y+H +G  +GGHAVK+IGW
Sbjct: 215 ATSAYQVNGGVLQIQREILDNGPVTAYMEVYEDFYSYGTGIYQHTSGSFVGGHAVKIIGW 274

Query: 184 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
           G+ +D   YWI AN W   +G DG+F+I RGSN  GIE  +VAG P++  +
Sbjct: 275 GSEND-VPYWIAANSWGTGFGEDGFFRILRGSNCAGIESYIVAGYPNTSEV 324


>gi|156708122|gb|ABU93319.1| cathepsin B10 cysteine protease [Monocercomonoides sp. PA]
          Length = 283

 Score =  171 bits (434), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 92/213 (43%), Positives = 124/213 (58%), Gaps = 18/213 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           Q  CGSCWAF   E+L DRF I       LS  DL++C     G  C+GGY  ++W + +
Sbjct: 84  QEKCGSCWAFSIAESLGDRFGILGCGKGHLSPQDLISCDSNDLG--CNGGYQENSWTWVL 141

Query: 75  HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 134
             G+ TE C PY   +G            P C  +CV  + L RN+    I+ YR   D 
Sbjct: 142 TTGITTESCWPYRSGSG----------RIPSCPHRCVNGSVLQRNT----INNYR-RLDS 186

Query: 135 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 194
            ++  E+Y NGP++V++ VYEDF +Y  G+YKH++G+ +GGHAV L+GWG  +DG  YW+
Sbjct: 187 SELQDELYNNGPIQVTYVVYEDFFYYSKGIYKHLSGNKVGGHAVVLMGWGI-EDGVKYWL 245

Query: 195 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           + N W   WG  GYF+I RGSNECGIE    AG
Sbjct: 246 VQNSWGYEWGEQGYFRILRGSNECGIESSAYAG 278


>gi|342181301|emb|CCC90780.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 335

 Score =  171 bits (434), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 96/235 (40%), Positives = 131/235 (55%), Gaps = 15/235 (6%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDG 63
           N   +  +  Q  CGSCWA     A+SDR C   G+  L +S   L++CC   CG GCDG
Sbjct: 102 NCPTIREIADQSACGSCWAVSTASAISDRHCTVGGVQQLRISAAHLMSCCED-CGYGCDG 160

Query: 64  GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQ 115
           GYP ++W Y+V HG+ +  C PY     C H G +   P        TPKC   C  K  
Sbjct: 161 GYPGTSWEYYVSHGLASSYCQPY-PFPHCGHHGGKGKKPPCSKYHFHTPKCNTTCTDKAI 219

Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
                K+    +Y ++ + +D   E+Y NGP  V F VY DF  YK+GVY+H++GD +GG
Sbjct: 220 PL--IKYRGNHSYEVHGE-DDYKRELYFNGPFVVVFWVYSDFLAYKTGVYRHVSGDFLGG 276

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           HAV+++GWG   +G  YW +AN W+  WG +G+    RG+NECGIE    AG P+
Sbjct: 277 HAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHLLFLRGNNECGIEAAGYAGSPA 330


>gi|170030060|ref|XP_001842908.1| cathepsin B [Culex quinquefasciatus]
 gi|167865914|gb|EDS29297.1| cathepsin B [Culex quinquefasciatus]
          Length = 320

 Score =  171 bits (433), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 98/221 (44%), Positives = 123/221 (55%), Gaps = 11/221 (4%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSC+      A++DR+CIH G     +    D LACC       CDGGY    W+Y
Sbjct: 103 QGCCGSCYVVSTAAAITDRYCIHSGGQKQFTFGATDYLACCTDCFK--CDGGYVGKTWQY 160

Query: 73  FVHHGVVTEECDPYFDSTGC-SHPGCEPAY--PTPKCVRKCVKKNQL-WRNSKHYSISAY 128
           +V  G+ +E   PY    GC S+P        P P C R C     L +     Y  SAY
Sbjct: 161 WVDSGLTSE--GPYKSGQGCNSYPFGSYCVNDPLPTCSRTCQAGYPLTYSQDLKYGGSAY 218

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
           R+  +   IM EIY+NGPV V F V+ DF  YKSGVY+H+TG   G HAV++IGWG  ++
Sbjct: 219 RVMWNENAIMTEIYQNGPVVVQFEVFADFYQYKSGVYRHVTGATEGWHAVRVIGWGV-EN 277

Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           G  YW++AN W   WG  G+FK  RG N  GIE+ V AGLP
Sbjct: 278 GVKYWLVANSWGVRWGDKGFFKFVRGENHLGIEDFVYAGLP 318


>gi|187105116|ref|NP_001119618.1| cathepsin B-84 precursor [Acyrthosiphon pisum]
 gi|161343843|tpg|DAA06102.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 335

 Score =  171 bits (433), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 96/233 (41%), Positives = 127/233 (54%), Gaps = 21/233 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCWA G   A +DR C+      N  +S  +L  CC   CG GC+GGYP+ AW+Y
Sbjct: 106 QGNCGSCWAHGTTGAFADRLCVATNGEFNELISAEELTFCC-HRCGFGCNGGYPLKAWQY 164

Query: 73  FVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
           F  HGVVT       + C PY       D  G +    +P     KC +KC   + +   
Sbjct: 165 FKRHGVVTGGDYDTTDGCQPYRVPPCVKDDEGHNSCSGQPTERNHKCSKKCYGDDTIDYK 224

Query: 120 SKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHA 177
             HY    AY + +        +Y  GP+E SF VY+DF +Y+SGVY+       +GGHA
Sbjct: 225 KNHYKTKDAYYLKNTTMQKDTMVY--GPIEASFDVYDDFMNYESGVYQRTGNASYLGGHA 282

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           VK+IGWG  ++G  YW++ N W   WG  G FKI RG++ECGIE    AG+PS
Sbjct: 283 VKMIGWGV-EEGTPYWLMVNSWGEQWGDKGMFKILRGTDECGIESSCTAGVPS 334


>gi|48762491|dbj|BAD23815.1| cathepsin B-S1 [Tuberaphis coreana]
          Length = 334

 Score =  171 bits (433), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 97/232 (41%), Positives = 131/232 (56%), Gaps = 20/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCW+F    A +DR C+  G   N  LS  +L  CC   CG GC GGYPI AW+Y
Sbjct: 107 QGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEELAFCCK-DCGQGCGGGYPIKAWKY 165

Query: 73  FVHHGVVT-------EECDPY-----FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
           F   GV T       E C PY     ++  G +  G +P     +C + C  K  +   +
Sbjct: 166 FRTQGVTTGGDYDTKEGCMPYKVPPCYNKQGKNTCGGQPMERNHQCPKTCYGKTTV--QN 223

Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVK 179
           ++ + S Y INS  + I  ++   GPVE SF VY+DF+ YKSG+Y+        G H++K
Sbjct: 224 RYKTKSEYSINS-IKTIEQDLKTYGPVEASFDVYDDFSVYKSGIYRKTPKAKYEGRHSIK 282

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           +IGWG  ++G  YW+  N W++ WG  G FKI +G NECGIE  V AG+PSS
Sbjct: 283 IIGWG-QENGTTYWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGIPSS 333


>gi|119638992|gb|ABL85238.1| cysteine proteinase 4 [Necator americanus]
          Length = 339

 Score =  171 bits (432), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 102/232 (43%), Positives = 137/232 (59%), Gaps = 17/232 (7%)

Query: 11  ILVIQGH--CGSCWAFGAVEALSDRFCIHF-GMNLS-LSVNDLLACCGFLCGDGCDGGYP 66
           I +I+ H  CGSCWA  A   +SDR CI   G N   LS  D+LACCG  CG GC+GGYP
Sbjct: 104 IGLIRDHSACGSCWAVSAASVMSDRLCIQTNGTNQKILSSADILACCGEDCGSGCEGGYP 163

Query: 67  ISAWRYFVHHGVVT-------EECDPY-FDSTGCSHPGC--EPAYPTPKCVRKCVKKNQL 116
           I A+ Y  + GV +         C PY F     ++  C  E A+ TPKC + C  +  +
Sbjct: 164 IQAYFYLENTGVCSGGEYREKNVCKPYPFYPCDGNYGPCPKEGAFDTPKCRKICQFRYPV 223

Query: 117 -WRNSKHYSISAYRINSDPE-DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
            +   K +  +++ +  D E  I  EI+ NGPV  +F V+EDF HYK G+YK   G  +G
Sbjct: 224 PYEEDKVFGKNSHILLQDNEARIRQEIFINGPVGANFYVFEDFIHYKEGIYKQTYGKWIG 283

Query: 175 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
            HA+KLIGWGT ++G DYW++AN +N  WG +G F+I RG+N C IE  V+A
Sbjct: 284 VHAIKLIGWGT-ENGTDYWLVANSYNYDWGENGTFRILRGTNHCLIESQVIA 334


>gi|157058769|gb|ABV03142.1| cathepsin B-348 [Myzus persicae]
          Length = 246

 Score =  171 bits (432), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 90/215 (41%), Positives = 125/215 (58%), Gaps = 20/215 (9%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDG 60
           + N   +  +  QG CGSCWAFGAVEA+SDR CIH     N   S  +L++CC + CG G
Sbjct: 34  WPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGAKNFHFSAENLVSCC-WTCGFG 92

Query: 61  CDGGYPISAWRYFVHHGVVTEECDPYFDSTGC--------------SHPGCEPAYPTPKC 106
           C+GG+P +AW Y+   G+V+    PY    GC              +   C+    TP C
Sbjct: 93  CNGGFPGAAWHYWKTKGIVSG--GPYGSKMGCIPYEIAPCEHHVNGTRGPCKEGGKTPAC 150

Query: 107 VRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
           V+KC    ++ +    H   SAY + +D + I  EIY NGPVE +FTVYEDF  Y++GVY
Sbjct: 151 VKKCEDGYKVPYAQDLHRGKSAYSLGNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVY 210

Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 200
           KH+ G  +GGHA++++GWG  +    YW++AN WN
Sbjct: 211 KHVAGKALGGHAIRILGWGVQNGEIPYWLVANSWN 245


>gi|294914603|ref|XP_002778294.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239886508|gb|EER10089.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 365

 Score =  170 bits (431), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 102/246 (41%), Positives = 135/246 (54%), Gaps = 35/246 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCG---FLCGDGCDGGYPISA 69
           Q  CGSCWAFG VEA + R CI  G  +N  LS  D+LACC    F    GC GG PI++
Sbjct: 123 QSACGSCWAFGTVEAFNARVCIKSGGKLNQLLSAADMLACCNIGHFCLSFGCSGGNPITS 182

Query: 70  WRYFVHHGVVT-------------EECDPYFDSTGCSH--------PGCEPAYPTPKCVR 108
           W +   +G+V+             + C PY +   C+H        P  +  Y TP C  
Sbjct: 183 WTFLHTNGIVSGGGFVPEKNMKAADGCWPY-NFPKCAHHQKESDYKPCAKEIYDTPSCSS 241

Query: 109 KC--VKKNQLWRNSKHYSISAY--RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 164
            C   K    +   +HY+ S +  R  S    I  EI  NGP   +F+VYEDF  YKSGV
Sbjct: 242 SCPNAKYGTAFDKDRHYTESLFPSRFGST-SSIKKEIMTNGPTSAAFSVYEDFLSYKSGV 300

Query: 165 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
           YKH +G  +GGHAV++IGWGT + G DYW++ N WN  WG  G FKI +G  +CGI++ +
Sbjct: 301 YKHTSGGFLGGHAVEIIGWGT-EKGVDYWLVMNSWNEEWGDHGTFKIVQG--DCGIDDMI 357

Query: 225 VAGLPS 230
           +AG P+
Sbjct: 358 LAGTPA 363


>gi|428180143|gb|EKX49011.1| cathepsin B-like cysteine protease [Guillardia theta CCMP2712]
          Length = 330

 Score =  170 bits (431), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 105/230 (45%), Positives = 121/230 (52%), Gaps = 32/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGD-GCDGGYPISAWR 71
           Q  CGSCWAF A E LSDRF I  +  +N  LS  DL++C     GD GC GGY   AW 
Sbjct: 114 QARCGSCWAFAASEVLSDRFAIASNGTVNKILSPEDLVSCDK---GDMGCQGGYLDKAWD 170

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y   +G+VTE C PY    G +          P C   CV         K Y  S Y   
Sbjct: 171 YLKTNGIVTESCFPYAAQKGVA----------PSCRISCVDGEPY----KKYKASDYYQL 216

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM-GGHAVKLIGWGTS---- 186
           +  EDIM EIY NGPVE  F VY  F  YKSGVY H   D+M GGHA+K++GWG      
Sbjct: 217 TTEEDIMKEIYLNGPVEAGFRVYTSFMSYKSGVYHHRILDIMEGGHAIKIVGWGVEPPKR 276

Query: 187 --DDGEDYWILANQWNRSWGADGYFKIKRGSN-----ECGIEEDVVAGLP 229
                  YWI AN W   WG +G+FKI+RG N     ECGIE+ V AG P
Sbjct: 277 FWQKPTKYWICANSWTADWGMNGFFKIRRGKNRFGQSECGIEDQVFAGHP 326


>gi|52630925|gb|AAU84926.1| putative cathepsin B-N [Toxoptera citricida]
          Length = 340

 Score =  170 bits (431), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 97/242 (40%), Positives = 131/242 (54%), Gaps = 21/242 (8%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCD 62
           N + +  +  QG+CGSCWA     A +DR C+  +   N  LS  +L  CC   CG GC+
Sbjct: 100 NCKTIGAIRDQGNCGSCWALATSSAFADRLCVVSNEDFNQLLSAEELTFCC-HKCGFGCN 158

Query: 63  GGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRK 109
           GGYPI AW +F  HG+VT       E C+PY      +D +G +    +P     +C R 
Sbjct: 159 GGYPIKAWEHFKKHGLVTGGDYKSGEGCEPYRVPPCPYDESGNNTCAGKPMEANHRCTRM 218

Query: 110 CVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KH 167
           C     L  +  H Y+  +Y +      I  ++   GPVE SF VY+DF  YKSGVY + 
Sbjct: 219 CYGDQDLDFDEDHRYTRDSYYLTYG--SIQKDVLTYGPVEASFDVYDDFPSYKSGVYIRS 276

Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
                +GGHA KLIGWG  + G  YW++ N WN  WG +G FKI+RG+NECGI+     G
Sbjct: 277 ENASYLGGHAAKLIGWG-EEYGVPYWLMVNSWNADWGDNGLFKIQRGTNECGIDNSTTGG 335

Query: 228 LP 229
           +P
Sbjct: 336 VP 337


>gi|343470805|emb|CCD16605.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score =  170 bits (430), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 94/230 (40%), Positives = 123/230 (53%), Gaps = 14/230 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
           Q  C + WA     A+SDR+C +  G  L +S   LL+CC   CGDGC GG+P  AWRY+
Sbjct: 113 QSECRASWAVSTASAISDRYCTVGKGKQLRISAAHLLSCCK-DCGDGCKGGFPGFAWRYY 171

Query: 74  VHHGVVTEECDPYFDSTGCSHPGCEPA--------YPTPKCVRKCVKKNQLWRNSKHYSI 125
           V +G+ +  C PY     C H G +          + TPKC   C  K+      K+   
Sbjct: 172 VEYGITSSSCQPY-PFPRCEHQGAQGNKTPCSKYNFDTPKCNATCTDKSVPL--IKYRGN 228

Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
           + Y +    ED   E+Y NGP    F VY D   YKSGVY+++ GD +GG AVK++GWG 
Sbjct: 229 ATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRNVDGDFLGGTAVKVVGWGK 288

Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
             +G  YW +AN W+  WG DGY  I RG+NEC IE    AG P +  L 
Sbjct: 289 L-NGTPYWKVANSWDTDWGMDGYLLILRGNNECNIEHLGFAGTPETSQLT 337


>gi|194384502|dbj|BAG59411.1| unnamed protein product [Homo sapiens]
          Length = 273

 Score =  170 bits (430), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 92/225 (40%), Positives = 126/225 (56%), Gaps = 54/225 (24%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           QG CGSCWAFGAVEA+SDR C        + VN                           
Sbjct: 102 QGSCGSCWAFGAVEAISDRIC--------IHVNG-------------------------- 127

Query: 75  HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSD 133
                             S P C     TPKC + C    +  ++  KHY  ++Y +++ 
Sbjct: 128 ------------------SRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNS 169

Query: 134 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 193
            +DIMAEIYKNGPVE +F+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW
Sbjct: 170 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYW 228

Query: 194 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
           ++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 229 LVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 273


>gi|119638996|gb|ABL85239.1| cysteine proteinase 5 [Necator americanus]
          Length = 342

 Score =  170 bits (430), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 98/240 (40%), Positives = 129/240 (53%), Gaps = 19/240 (7%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCD 62
           N   +  +  Q +CGSCWA  A   +SDR CI     +    S  D+L+CC + CG GCD
Sbjct: 101 NCTSIRTIRDQSNCGSCWAVSAASVMSDRLCIQSNGTIQSWASDTDILSCC-WNCGMGCD 159

Query: 63  GGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKCVR 108
           GG P +A+ + + +GV T         C PY       H       P  +  +PTPKC +
Sbjct: 160 GGRPFAAFFFAIDNGVCTGGPFREPNVCKPYAFYPCGRHQNQKYFGPCPKELWPTPKCRK 219

Query: 109 KC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
            C +K N  +++ K Y   AY + ++   IM EI+ NGPV  SF+V+ DFA YK GVY  
Sbjct: 220 MCQLKYNVAYKDDKIYGNDAYSLPNNETRIMQEIFTNGPVVGSFSVFADFAIYKKGVYVS 279

Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
                 G HAVK+IGWG  D G  YW++AN WN  WG +GY +  RG N CGIE  VV G
Sbjct: 280 NGIQQNGAHAVKIIGWGVQD-GLKYWLIANSWNNDWGDEGYVRFLRGDNHCGIESRVVTG 338


>gi|193783549|dbj|BAG53460.1| unnamed protein product [Homo sapiens]
          Length = 276

 Score =  170 bits (430), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 85/192 (44%), Positives = 122/192 (63%), Gaps = 16/192 (8%)

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
           C+GGYP  AW ++   G+V+         C PY     C H      P C     TPKC 
Sbjct: 87  CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 145

Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C    +  ++  KHY  ++Y +++  +DIMAEIYKNGPVE +F+VY DF  YKSGVY+
Sbjct: 146 KICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQ 205

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI RG + CGIE +VVA
Sbjct: 206 HVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVA 264

Query: 227 GLPSSKNLVKEI 238
           G+P +    ++I
Sbjct: 265 GIPRTDQYWEKI 276


>gi|343474137|emb|CCD14154.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score =  169 bits (429), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 95/230 (41%), Positives = 121/230 (52%), Gaps = 14/230 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
           Q  C + WA     A+SDR+C +  G  L +S   LL+CC   CGDGC GG+P  AWRY+
Sbjct: 113 QSECRASWAVSTASAISDRYCTVGKGKQLRISAAHLLSCCK-DCGDGCKGGFPGFAWRYY 171

Query: 74  VHHGVVTEECDPYFDSTGCSHPGCEPA--------YPTPKCVRKCVKKNQLWRNSKHYSI 125
           V +G+ +  C PY     C H G +          + TPKC   C  K       K+   
Sbjct: 172 VEYGITSSSCQPY-PFPRCEHQGAQGNKTPCSKYNFDTPKCNATCTDKAIPL--IKYRGN 228

Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
           + Y +    ED   E+Y NGP    F VY D   YKSGVY+H+ GD +GG AVK++GWG 
Sbjct: 229 ATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGK 288

Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
             +G  YW LAN W+  WG  GY  I RG+NEC IE    AG P +  L 
Sbjct: 289 L-NGTPYWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAGTPEASQLT 337


>gi|343474132|emb|CCD14149.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score =  169 bits (429), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 95/230 (41%), Positives = 121/230 (52%), Gaps = 14/230 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
           Q  C + WA     A+SDR+C +  G  L +S   LL+CC   CGDGC GG+P  AWRY+
Sbjct: 113 QSECRASWAVSTASAISDRYCTVGKGKQLRISAAHLLSCCK-DCGDGCKGGFPGFAWRYY 171

Query: 74  VHHGVVTEECDPYFDSTGCSHPGCEPA--------YPTPKCVRKCVKKNQLWRNSKHYSI 125
           V +G+ +  C PY     C H G +          + TPKC   C  K       K+   
Sbjct: 172 VEYGITSSSCQPY-PFPRCEHQGAQGNKTPCSKYNFDTPKCNATCTDKAIPL--IKYRGN 228

Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
           + Y +    ED   E+Y NGP    F VY D   YKSGVY+H+ GD +GG AVK++GWG 
Sbjct: 229 ATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGK 288

Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
             +G  YW LAN W+  WG  GY  I RG+NEC IE    AG P +  L 
Sbjct: 289 L-NGTPYWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAGTPEASQLT 337


>gi|166030330|gb|ABY78832.1| cathepsin B-like protease [Trypanosoma congolense]
 gi|343476577|emb|CCD12360.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score =  169 bits (429), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 95/230 (41%), Positives = 121/230 (52%), Gaps = 14/230 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
           Q  C + WA     A+SDR+C +  G  L +S   LL+CC   CGDGC GG+P  AWRY+
Sbjct: 113 QSECRASWAVSTASAISDRYCTVGKGKQLRISAAHLLSCCK-DCGDGCKGGFPGFAWRYY 171

Query: 74  VHHGVVTEECDPYFDSTGCSHPGCEPA--------YPTPKCVRKCVKKNQLWRNSKHYSI 125
           V +G+ +  C PY     C H G +          + TPKC   C  K       K+   
Sbjct: 172 VEYGITSSSCQPY-PFPRCEHQGAQGNKTPCSKYNFDTPKCNATCTDKAIPL--IKYRGN 228

Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
           + Y +    ED   E+Y NGP    F VY D   YKSGVY+H+ GD +GG AVK++GWG 
Sbjct: 229 ATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGK 288

Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
             +G  YW LAN W+  WG  GY  I RG+NEC IE    AG P +  L 
Sbjct: 289 L-NGTPYWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAGTPEASQLT 337


>gi|204022106|dbj|BAG71150.1| cathepsin B-N [Astegopteryx spinocephala]
          Length = 332

 Score =  169 bits (429), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 97/232 (41%), Positives = 125/232 (53%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFG   A +DR CI      N  LS  +L  CC   CG GC GGYPI AW  
Sbjct: 105 QGKCGSCWAFGTSSAFADRLCIATDGDFNELLSAEELTFCC-HTCGYGCHGGYPIKAWER 163

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCV-KKNQLWR 118
           F  HG+VT       E C PY       D  G +    +PA    +C R C   +++ ++
Sbjct: 164 FKKHGLVTGGNYDSSEGCQPYRVSPCPLDEYGNNTCRGKPAEKNHRCTRMCYGDQDRDFK 223

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
               ++  AY +      I  ++   GP+E S+ VY+DF  YKSGVY +      +GGHA
Sbjct: 224 EDHRFTRDAYYLTYGT--IQKDVMTYGPIEASYEVYDDFPSYKSGVYVRTENATYLGGHA 281

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           VKLIGWG  + G  YW++ N WN  WG  G FKI+RG+NECGI+     G+P
Sbjct: 282 VKLIGWG-EEYGVPYWLMVNSWNDQWGDRGLFKIRRGTNECGIDNSTTGGVP 332


>gi|22535408|emb|CAC87118.1| cathepsin B-like protease [Nilaparvata lugens]
          Length = 347

 Score =  169 bits (428), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 99/238 (41%), Positives = 129/238 (54%), Gaps = 23/238 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCWA     A +DR CI  +   N  +S  +L++CC + CG GC+GG+P +AW +
Sbjct: 114 QGNCGSCWAVSVAAAFADRLCIASNAKWNGHISSRELMSCCSY-CGFGCEGGFPDAAWVF 172

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCE--PAYPTPKCVRKCVKKNQL- 116
              HG+VT       + C PY     C H      P C   P  PTP C   C   + L 
Sbjct: 173 IKRHGLVTGGDYHSHDGCQPY-PIAPCEHHMEGSKPNCSASPTEPTPACETTCTHGSSLA 231

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK-HITGDVMGG 175
           ++  +    SAY +    +    EI+KNGP+  +F VYEDF  YKSGVYK H      G 
Sbjct: 232 YQKDRQKGKSAYLVPVGEKQTQLEIFKNGPIVAAFKVYEDFFMYKSGVYKRHPESPFRGR 291

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
           HAVK+IGWG   +G  YW++ N W+  WG  G FKI RG NEC  E+ + AGLP  K 
Sbjct: 292 HAVKVIGWG-EQNGLPYWLVQNSWDYDWGDKGLFKIARG-NECDFEKSMTAGLPKYKK 347


>gi|204022081|dbj|BAG71138.1| cathepsin B-S1 [Tuberaphis takenouchii]
          Length = 332

 Score =  169 bits (428), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 90/230 (39%), Positives = 125/230 (54%), Gaps = 20/230 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCWAFG   A +DR C+  G   N  LS  D+  CC   CG GC+GGYPI AW+Y
Sbjct: 107 QGNCGSCWAFGTTGAFADRLCVSTGGKFNELLSPEDVAFCCQ-NCGKGCEGGYPIKAWQY 165

Query: 73  FVHHGVVT-------EECDPY-----FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
           F   GV T       E C PY     FD  G +    +P     +C + C     +    
Sbjct: 166 FRTQGVPTGGDYDSKEGCAPYKIPPCFDQKGKNTCAGKPLERNHQCPKTCYGSTTV---Q 222

Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVK 179
           K Y +    + + P  +  ++ K GP+E SF +++D + YKSG+Y K      + GH++K
Sbjct: 223 KRYKVKNEYVLNSPNTMEQDLIKYGPIEASFNLFDDLSAYKSGIYQKTPKAKFLSGHSIK 282

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +IGWG  ++G  YW+  N W++ WG  G F+I +G NECGIE    AG+P
Sbjct: 283 IIGWG-KENGVPYWLAVNSWSKFWGEQGTFRIIKGRNECGIERSATAGIP 331


>gi|161343851|tpg|DAA06106.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 333

 Score =  169 bits (428), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 97/238 (40%), Positives = 130/238 (54%), Gaps = 17/238 (7%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCD 62
           N   +  +  QG+CGSCWAF    A +DR CI  +   N  LS   + +CC + CG GC 
Sbjct: 96  NCTTIGTIRDQGNCGSCWAFSTSGAFADRLCIASNGSFNQLLSAEHVTSCC-YRCGLGCQ 154

Query: 63  GGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCE-PAYPTPKCVRKCVKKN 114
           GGYPI AWRY+  HG+VT       E C PY       +  C   +    KC +KC    
Sbjct: 155 GGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPCTGNNSCSGQSEKNHKCQKKCFGNT 214

Query: 115 QL-WRNSKHY-SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGD 171
            + +R  + Y   S Y +  D  ++  +I   GP+E SF VY+DF  YKSGVY K     
Sbjct: 215 SISYRGDRRYVERSPYVLAYD--NMQNDIMTYGPIESSFDVYDDFISYKSGVYFKSPNAT 272

Query: 172 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
            +GGH+VK IGWG  +    YW++ N WN +WG  G FKI+RG+NEC +E+   AG+P
Sbjct: 273 YLGGHSVKCIGWGV-ERNVSYWLMMNSWNNTWGDGGNFKIRRGTNECQVEDSSTAGMP 329


>gi|347972088|ref|XP_313836.5| AGAP004534-PA [Anopheles gambiae str. PEST]
 gi|333469166|gb|EAA09182.5| AGAP004534-PA [Anopheles gambiae str. PEST]
          Length = 334

 Score =  169 bits (428), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 98/237 (41%), Positives = 135/237 (56%), Gaps = 15/237 (6%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCD 62
           N E +  +  QG CGSCWA  A   +SDR CIH    +N++L+  DL+ CC   CG+GC+
Sbjct: 99  NCESLRAIRNQGTCGSCWAVAAASVMSDRVCIHSNGTINVALAAEDLMGCC-VDCGNGCN 157

Query: 63  GGY-PISAWRYFVHHGVV-------TEECDPYFDSTGCSHPGCE-PAYPTPKCVRKCVKK 113
           GG+   ++++Y+V  G+V       T+ C PY     C +P  +     +PKC   C   
Sbjct: 158 GGFLDGTSFQYWVDAGLVSGGAYNSTDGCKPY-PFKPCEYPFNDCHVEISPKCTHHCRDG 216

Query: 114 -NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
            ++ +   K +   AY +  D   I  EI  NGPVE  F VYED   YKSGVY+H+ G+ 
Sbjct: 217 VDRHYSKDKLFGKVAYSVPRDERAIRYEIMTNGPVEAGFDVYEDVLLYKSGVYRHVYGEQ 276

Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +G HAV++IGWG  D G  YW++AN +   WG  GYFK  RGSN  GIE  ++ GLP
Sbjct: 277 IGKHAVRIIGWG-RDGGIPYWLIANSYGDDWGDHGYFKFVRGSNHLGIESKIITGLP 332


>gi|308488594|ref|XP_003106491.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
 gi|308253841|gb|EFO97793.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
          Length = 342

 Score =  169 bits (427), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 87/232 (37%), Positives = 135/232 (58%), Gaps = 20/232 (8%)

Query: 17  HCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACC-GFL-CGDGCDGGYPISAWRY 72
            C S WAF A E++SDR CI+ G  ++  LS  +LL+CC G L CG+GC GG P+ AW+Y
Sbjct: 109 ECKSSWAFAAAESMSDRLCINSGGMIDTILSAQELLSCCTGVLSCGEGCAGGNPLKAWQY 168

Query: 73  FVHHGVVTEE-------CDPYFDST------GCSHPGC-EPAYPTPKCVRKCVKKNQL-W 117
           +  HG+ T         C PY  +         ++P C     PTP C +KC     +  
Sbjct: 169 WQKHGIPTGGSYESQFGCKPYSIAPCGKTIGNVTYPPCTNTTLPTPTCEKKCKPGYPVDL 228

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              +HY +S  ++ +   +I +++  NGPVE +  +Y+DF  Y +G+Y H+ G+  G  +
Sbjct: 229 DKDRHYGVSVDQLPNRQIEIQSDVMLNGPVEATMEIYDDFLQYTTGIYVHLAGNKQGHLS 288

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           V+++GWG   +G  YW+LAN W + WG +G F++ RG NECG+E + ++G+P
Sbjct: 289 VRILGWGMF-EGVPYWLLANSWGKEWGENGTFRVLRGVNECGLEANCISGMP 339


>gi|341888224|gb|EGT44159.1| hypothetical protein CAEBREN_15022 [Caenorhabditis brenneri]
          Length = 332

 Score =  168 bits (426), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 91/235 (38%), Positives = 135/235 (57%), Gaps = 23/235 (9%)

Query: 17  HCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACC-GFL-CGDGCDGGYPISAWRY 72
            C S WAF A E++SDR CI+ G  +N  LS  +LL+CC G L CG+GC GG    AW+Y
Sbjct: 96  ECKSSWAFAAAESMSDRLCINSGGMINTILSAQELLSCCTGVLSCGEGCGGGNAFKAWQY 155

Query: 73  FVHHGVVTEE-------CDPYFDST------GCSHPGC-EPAYPTPKCVRKCVKKNQL-- 116
           +  HG+ T         C PY  +         ++P C     PTP C +KC  KN    
Sbjct: 156 WGKHGLPTGGSYETQFGCKPYSIAPCGKTVGNVTYPACTNTTLPTPSCEKKCTSKNGYPV 215

Query: 117 -WRNSKHYSISAY-RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
                +HY  S+  ++ +   +I +++  NGP+E +F VY+DF  Y +G+Y H+TG+  G
Sbjct: 216 DIDKDRHYGASSVDQLPNRQIEIQSDVMLNGPIETTFEVYDDFLQYTTGIYVHLTGNKQG 275

Query: 175 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
             +V+++GWG   +G  YW+LAN W + WG +G F+  RG+NECG+E + V+ +P
Sbjct: 276 HLSVRILGWGMY-EGVPYWLLANSWGKEWGENGTFRALRGTNECGLEANCVSAMP 329


>gi|343472937|emb|CCD15042.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  168 bits (426), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 90/235 (38%), Positives = 124/235 (52%), Gaps = 12/235 (5%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPI 67
           +  +  Q  C + WA     A+SDR+C +  G  L +S  DL+ACC   CGDGC GG+P 
Sbjct: 106 IREIADQSECRASWAVSTASAISDRYCTVGGGKQLRISAADLMACCK-QCGDGCKGGFPG 164

Query: 68  SAWRYFVHHGVVTEECDPY-------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
            AW Y+V +G+ + +C PY         + G   P  +  + TPKC   C  K+      
Sbjct: 165 FAWLYYVEYGITSSQCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKCNATCTDKSIPL--V 222

Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 180
           K+   + Y +    ED   E+Y NGP    F VY D   YKSGVY+++ GD +GG AV++
Sbjct: 223 KYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGGQAVRI 282

Query: 181 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
           +GWG   +G  YW +AN W+  WG +GY  I RG+NEC IE     G P    L 
Sbjct: 283 VGWGKL-NGTPYWKVANSWDTDWGMNGYMLILRGNNECNIEHLGFTGFPDPSQLT 336


>gi|193716207|ref|XP_001950562.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
          Length = 340

 Score =  168 bits (425), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 93/232 (40%), Positives = 128/232 (55%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCWA     A +DR C+  +   N  LS  ++  CC   CG GC+GGYPI AW  
Sbjct: 110 QGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEITFCCS-SCGYGCNGGYPIKAWES 168

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
           F + G+VT       E C+PY      +D+ G +    +P     +C R C     L  N
Sbjct: 169 FNNRGLVTGGDYQSGEGCEPYRVPPCPYDAEGHNTCAGKPREKNHRCTRTCYGNQDLDYN 228

Query: 120 SKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
             H ++  +Y +      I  ++ + GP+E SF +Y+DF  YKSGVY +      +GGHA
Sbjct: 229 DDHRFTRDSYYLTY--SSIQKDVMRYGPIEASFDMYDDFPSYKSGVYVRSENASYLGGHA 286

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           VKLIGWG  + G  YW++ N WN  WG +G FKI+RG+NECGI+     G+P
Sbjct: 287 VKLIGWG-EEHGVLYWLMVNSWNEGWGDNGLFKIRRGTNECGIDNSTTGGVP 337


>gi|239788404|dbj|BAH70886.1| ACYPI000014 [Acyrthosiphon pisum]
          Length = 335

 Score =  168 bits (425), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 95/233 (40%), Positives = 126/233 (54%), Gaps = 21/233 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCWA G   A +DR C+      N  +S  +L  CC   C  GC+GGYP+ AW+Y
Sbjct: 106 QGNCGSCWAHGTTGAFADRLCVATNGEFNELISAEELTFCC-HRCVFGCNGGYPLKAWQY 164

Query: 73  FVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
           F  HGVVT       + C PY       D  G +    +P     KC +KC   + +   
Sbjct: 165 FKRHGVVTGGDYDTTDGCQPYRVPPCVKDDEGHNSCSGQPTERNHKCSKKCYGDDTIDYK 224

Query: 120 SKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHA 177
             HY    AY + +        +Y  GP+E SF VY+DF +Y+SGVY+       +GGHA
Sbjct: 225 KNHYKTKDAYYLKNTTMQKDTMVY--GPIEASFDVYDDFMNYESGVYQRTGNASYLGGHA 282

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           VK+IGWG  ++G  YW++ N W   WG  G FKI RG++ECGIE    AG+PS
Sbjct: 283 VKMIGWGV-EEGTPYWLMVNSWGEQWGDKGMFKILRGTDECGIESSCTAGVPS 334


>gi|328718094|ref|XP_003246386.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
          Length = 340

 Score =  167 bits (424), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 96/232 (41%), Positives = 127/232 (54%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCWA     A +DR C+  +   N  LS  ++  CC   CG GC+GGYPI AW  
Sbjct: 110 QGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEITFCC-HSCGFGCNGGYPIKAWER 168

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
           F   G+VT       E C+PY      +D+ G +    +P     +C R C     L  +
Sbjct: 169 FKKRGLVTGGDYQSGEGCEPYRVPPCPYDAEGHNTCAGKPRESNHRCTRMCYGNQDLDFD 228

Query: 120 SKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
             H Y+  +Y +      I  ++   GP+E SF VY+DF  YKSGVY K      +GGHA
Sbjct: 229 EDHRYTRDSYYLTYG--SIQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENATYLGGHA 286

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           VKLIGWG  + G  YW++ N WN  WG +G FKI+RG+NECGI+    AG+P
Sbjct: 287 VKLIGWG-EEYGVPYWLMVNSWNADWGDNGLFKIRRGTNECGIDNSTTAGVP 337


>gi|161343865|tpg|DAA06113.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 335

 Score =  167 bits (424), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 95/233 (40%), Positives = 125/233 (53%), Gaps = 21/233 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCWA G   A +DR CI      N  +S  +L  CC   CG GC+GG P+ AW+Y
Sbjct: 106 QGNCGSCWAHGTTGAFADRLCIATDGEFNELISAEELTFCC-HTCGFGCNGGNPLKAWKY 164

Query: 73  FVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
           F  HGVVT       + C PY       D  G +    +P     KC +KC     +   
Sbjct: 165 FKRHGVVTGGNYNTTDGCQPYRVPPCVRDDEGHNSCSGQPTERNHKCSKKCYGDETINYK 224

Query: 120 SKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHA 177
             HY    AY +++        +Y  GP+E SF VY+DF  Y+SGVY+       +GGHA
Sbjct: 225 KNHYKTKDAYYLSNTTMQKDTMVY--GPIEASFDVYDDFTSYESGVYQKTENASYLGGHA 282

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           VK+IGWG  ++G  YW++ N W   WG  G FKI RG++ECG+E    AG+PS
Sbjct: 283 VKMIGWGV-EEGTPYWLMVNSWGEQWGDKGMFKILRGTDECGVESSCTAGVPS 334


>gi|294873367|ref|XP_002766594.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239867622|gb|EEQ99311.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 244

 Score =  167 bits (424), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 101/246 (41%), Positives = 134/246 (54%), Gaps = 35/246 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCG---FLCGDGCDGGYPISA 69
           Q  CGSCWAFG VEA + R CI  G  +N  LS  ++LACC    F    GC GG PI++
Sbjct: 2   QSACGSCWAFGTVEAFNARVCIKSGGKLNQLLSAANMLACCNIGHFCLSFGCSGGNPITS 61

Query: 70  WRYFVHHGVVT-------------EECDPYFDSTGCSH--------PGCEPAYPTPKCVR 108
           W +   +G+V+             + C PY     C+H        P  +  Y TP C  
Sbjct: 62  WTFLHTNGIVSGGGFVPEKNMKAADGCWPY-SFPKCAHHQDGSDYKPCAKEIYDTPSCSS 120

Query: 109 KC--VKKNQLWRNSKHYSISAY--RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 164
            C   K    +   +HY+ S +  R  S    I  EI  NGP   +F+VYEDF  YKSGV
Sbjct: 121 SCPNAKYGTAFDKDRHYTESLFPSRFGST-SSIKKEIMTNGPTSAAFSVYEDFLSYKSGV 179

Query: 165 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
           YKH +G  +GGHAV++IGWGT + G DYW++ N WN  WG  G FKI +G  +CGI++ +
Sbjct: 180 YKHTSGGFLGGHAVEIIGWGT-EKGVDYWLVMNSWNEEWGDHGTFKIVQG--DCGIDDTI 236

Query: 225 VAGLPS 230
           +AG P+
Sbjct: 237 LAGTPA 242


>gi|290992302|ref|XP_002678773.1| predicted protein [Naegleria gruberi]
 gi|284092387|gb|EFC46029.1| predicted protein [Naegleria gruberi]
          Length = 236

 Score =  167 bits (424), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 91/216 (42%), Positives = 127/216 (58%), Gaps = 19/216 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAF A E LSDRFCI  G  +++ LS   +++C       GCDGGY  +AW +
Sbjct: 33  QEQCGSCWAFSASEVLSDRFCIASGGKVDVVLSPQYMVSCDS--TDYGCDGGYLNNAWAF 90

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
               G+ +++C PY    G               V  C  K Q   + K Y     +  +
Sbjct: 91  LAGTGIPSDKCAPYTSQNGD--------------VAACPSKCQDGSSVKLYKAKNPQQLN 136

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGED 191
           D   IM ++ +NGPV+ +F+VY DF  YKSGVY H++G ++GGHA+K++GWG  S   + 
Sbjct: 137 DIPSIMEDMQQNGPVQAAFSVYRDFMSYKSGVYHHVSGSLLGGHAIKMVGWGVDSATNKP 196

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           YWI+AN W  SWG +G+F I RGS+ECGIE++V +G
Sbjct: 197 YWIIANSWGPSWGLNGFFWILRGSDECGIEDNVWSG 232


>gi|201023315|ref|NP_001128400.1| cathepsin B-16D2 precursor [Acyrthosiphon pisum]
          Length = 340

 Score =  167 bits (424), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 96/232 (41%), Positives = 127/232 (54%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCWA     A +DR C+  +   N  LS  ++  CC   CG GC+GGYPI AW  
Sbjct: 110 QGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEITFCC-HSCGFGCNGGYPIKAWER 168

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
           F   G+VT       E C+PY      +D+ G +    +P     +C R C     L  +
Sbjct: 169 FKKRGLVTGGDYQSGEGCEPYRVPPCPYDAEGHNTCAGKPRESNHRCTRMCYGNQDLDFD 228

Query: 120 SKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
             H Y+  +Y +      I  ++   GP+E SF VY+DF  YKSGVY K      +GGHA
Sbjct: 229 EDHRYTRDSYYLTYG--SIQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENATYLGGHA 286

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           VKLIGWG  + G  YW++ N WN  WG +G FKI+RG+NECGI+    AG+P
Sbjct: 287 VKLIGWG-EEYGVPYWLMVNSWNADWGDNGLFKIRRGTNECGIDNSTTAGVP 337


>gi|359427491|gb|AEV46267.1| eimeripain [Eimeria tenella]
          Length = 512

 Score =  167 bits (423), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 103/245 (42%), Positives = 134/245 (54%), Gaps = 31/245 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFL--CGDGCDGGYPISAW 70
           QG CGSCWAF + EAL+DRFCI  G     +LS     +CC  L     GC GG P  AW
Sbjct: 260 QGDCGSCWAFASTEALNDRFCIKSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAW 319

Query: 71  RYFVHHGVVT----------EECDPYFDSTGCSH------PGCEPAYP-TPKCVRKC--- 110
           R+F + GVVT          + C PY +   C H      P CE   P  PKC + C   
Sbjct: 320 RWFSNDGVVTGGDYNELHTGKSCWPY-EIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEA 378

Query: 111 --VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 168
               K + +++  H++ SAY +    + I  E+ +NG +  +F VYEDF  YK GVY H+
Sbjct: 379 EYTSKVKPFKDDLHFATSAYSVEGR-DQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHV 437

Query: 169 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           TG  MGGHAVK+IG+G ++DG DYW+  N WN  WG  G FKI+ G  E GI+++   G 
Sbjct: 438 TGMPMGGHAVKVIGFG-NEDGRDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCGGE 494

Query: 229 PSSKN 233
           P   N
Sbjct: 495 PKVPN 499


>gi|339831342|gb|AEK20867.1| cathepsin B [Eimeria tenella]
          Length = 512

 Score =  167 bits (423), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 103/245 (42%), Positives = 134/245 (54%), Gaps = 31/245 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFL--CGDGCDGGYPISAW 70
           QG CGSCWAF + EAL+DRFCI  G     +LS     +CC  L     GC GG P  AW
Sbjct: 260 QGDCGSCWAFASTEALNDRFCIKSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAW 319

Query: 71  RYFVHHGVVT----------EECDPYFDSTGCSH------PGCEPAYP-TPKCVRKC--- 110
           R+F + GVVT          + C PY +   C H      P CE   P  PKC + C   
Sbjct: 320 RWFSNDGVVTGGDYNELHTGKSCWPY-EIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEA 378

Query: 111 --VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 168
               K + +++  H++ SAY +    + I  E+ +NG +  +F VYEDF  YK GVY H+
Sbjct: 379 EYTSKVKPFKDDLHFATSAYSVEGR-DQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHV 437

Query: 169 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           TG  MGGHAVK+IG+G ++DG DYW+  N WN  WG  G FKI+ G  E GI+++   G 
Sbjct: 438 TGMPMGGHAVKVIGFG-NEDGRDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCGGE 494

Query: 229 PSSKN 233
           P   N
Sbjct: 495 PKVPN 499


>gi|294898091|ref|XP_002776152.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239882839|gb|EER07968.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 382

 Score =  167 bits (423), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 95/222 (42%), Positives = 121/222 (54%), Gaps = 13/222 (5%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFG  EA +DR CI      +  LS  ++ AC  F    GC GG P SAW +
Sbjct: 165 QSACGSCWAFGVTEAFNDRLCIKSNGAFTELLSAGEMNACTLFF---GCGGGDPYSAWSW 221

Query: 73  FVHHGVVTEE-CDPYFDSTGCSHP--GCEPAYPTPKCVRKCV--KKNQLWRNSKHYSISA 127
               G+ T E   P   S   + P    +  YPTP CV +C   K     R+ +H+ + +
Sbjct: 222 VHDKGIATGEGSRPKRVSESEAIPVIAYQDIYPTPNCVEQCRNPKYTTTLRDDRHFMLES 281

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
              +    D    I  +GPV  SFTVYEDF  YKSGVYKH +G  +GGHAVK+IGWG   
Sbjct: 282 SPYHYSVNDAKNAIRTDGPVSASFTVYEDFLAYKSGVYKHTSGSYLGGHAVKIIGWG-EK 340

Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
            G+ YW+  N WN  WG  G FKI  G+  CGI++D++ G P
Sbjct: 341 SGQAYWLAVNSWNEDWGDKGLFKIALGN--CGIDDDLLGGTP 380


>gi|347972080|ref|XP_313831.5| AGAP004531-PA [Anopheles gambiae str. PEST]
 gi|333469162|gb|EAA09191.5| AGAP004531-PA [Anopheles gambiae str. PEST]
          Length = 375

 Score =  167 bits (423), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 92/230 (40%), Positives = 127/230 (55%), Gaps = 25/230 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG C S +A  AV  ++DR+C+H       +    D+L+CC   CG GCDGG P + W Y
Sbjct: 153 QGCCDSSYAVAAVSTMTDRWCVHSEGKAQFNFGAYDVLSCC-HRCGFGCDGGVPSAVWHY 211

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYP------------TPKCVRKCVKK-NQLWRN 119
           +V +G+ +            SH GC+ +YP            TP+C+R C    N  +  
Sbjct: 212 WVENGITS-------GGAFGSHEGCQ-SYPFDVCKKSGDSNDTPRCLRFCQPGYNVTYPE 263

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
            KHY   AY +  D E IM E++  GP + +FT+Y DF  YKSGVY+H  G  +G H+VK
Sbjct: 264 DKHYGRVAYTVPKDEERIMYEVFNFGPAQATFTMYTDFVQYKSGVYRHTFGVRVGTHSVK 323

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           ++GWG  +D + YW+ AN W   WG  G+FKI RG +    E +VVAGLP
Sbjct: 324 VMGWGVENDVK-YWLCANSWGAQWGDGGFFKIVRGEDHLSFETNVVAGLP 372


>gi|161343879|tpg|DAA06120.1| TPA_inf: cathepsin B [Toxoptera citricida]
          Length = 340

 Score =  167 bits (422), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 97/234 (41%), Positives = 130/234 (55%), Gaps = 25/234 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCWA     A +DR C+  +   N  LS  ++  CC   CG GC+GGYPI AW  
Sbjct: 110 QGNCGSCWAIATSSAFADRLCVATNADFNQLLSAEEITFCC-HKCGYGCNGGYPIKAWER 168

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
           F  HG+VT       E C+PY      +D +G +    +P     +C R C     L  +
Sbjct: 169 FKKHGLVTGGEYKSGEGCEPYRVPPCPYDESGNNTCSGKPMEQNHRCTRMCYGDQDLDFD 228

Query: 120 SKH-YSISAY--RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGG 175
             H ++  +Y   I S  +D+M      GP+E SF VY+DF  YKSGVY +      +GG
Sbjct: 229 DDHRHTRDSYYLTIGSIQKDVMTY----GPIEASFDVYDDFLSYKSGVYVRSENASYLGG 284

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           HAVKLIGWG  + G  YW++ N WN  WG +G FKI+RG+NECG++    AG+P
Sbjct: 285 HAVKLIGWG-EEYGTPYWLMMNSWNADWGDEGLFKIRRGTNECGVDNSTTAGVP 337


>gi|6562770|emb|CAB62589.1| putative cathepsin B-like protease [Pisum sativum]
          Length = 206

 Score =  167 bits (422), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 73/86 (84%), Positives = 80/86 (93%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+LSDRFCIHFG+++ LSVNDLLACCGFLCG GCDGGYPISAW+
Sbjct: 121 ILDQGHCGSCWAFGAVESLSDRFCIHFGVDVPLSVNDLLACCGFLCGSGCDGGYPISAWK 180

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGC 97
           YF HHGVVTEECDPYFD  GCSHPGC
Sbjct: 181 YFAHHGVVTEECDPYFDQIGCSHPGC 206


>gi|189239879|ref|XP_968767.2| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270012755|gb|EFA09203.1| cathepsin B precursor [Tribolium castaneum]
          Length = 353

 Score =  166 bits (421), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 93/222 (41%), Positives = 127/222 (57%), Gaps = 14/222 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAF A E +SDR C+  +  +    S  DL+ CC   CG  C GGY   AW+Y
Sbjct: 96  QGKCGSCWAFAAAEVMSDRLCVATNGSVKFEFSPEDLINCCE-TCGKKCKGGYSYYAWKY 154

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYP---TPKCVRKC--VKKNQLWRNSKHYSISA 127
           +   G+V+     Y  S GC  P  +  +    +P+C + C   K    + N +H+    
Sbjct: 155 YTSTGLVSG--GDYNTSRGC-QPYSKSNFNDGVSPECSKTCQNTKYPTSYLNDRHFGDGT 211

Query: 128 YRINSDPEDIMAEIY-KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 186
           Y I  +   I  EI  + GPV   F VYEDF  Y+ GVY H +G ++G HAVK+IGWGT 
Sbjct: 212 YYILKNVTTIQQEILLRGGPVMAGFDVYEDFKLYREGVYVHTSGALLGSHAVKIIGWGT- 270

Query: 187 DDGEDYWILANQWNRSWGA-DGYFKIKRGSNECGIEEDVVAG 227
           ++G  YW++AN W + WGA  G FKI+RG+NEC IE+ ++ G
Sbjct: 271 ENGWAYWLVANSWGKDWGALGGVFKIRRGTNECKIEQSIITG 312


>gi|226473754|emb|CAX71562.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 329

 Score =  166 bits (421), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 96/233 (41%), Positives = 123/233 (52%), Gaps = 31/233 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           Q  CGS WA  AV A+SDR CI  G   S             CG GCDGG+   +W Y+V
Sbjct: 112 QSQCGSSWAVSAVGAISDRICIQSGGKQSY------------CGSGCDGGFLGPSWDYWV 159

Query: 75  HHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWR 118
             G+VT       + TGC     P C+              Y TP+C + C K  N  + 
Sbjct: 160 LRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYE 217

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY   +Y + S    I  +I  +GPVE    +YEDF +YKSG+Y++ TG  + GHAV
Sbjct: 218 QDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAV 277

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           +LIGWG  ++G  YW+ AN WN  WG  GYF+I RG NEC IE ++ AGL  S
Sbjct: 278 RLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 329


>gi|161343869|tpg|DAA06115.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 337

 Score =  166 bits (421), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 97/232 (41%), Positives = 126/232 (54%), Gaps = 18/232 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCWA     A +DR C+      N  LS  ++  CC   CG GC GGYPI AW+ 
Sbjct: 110 QGNCGSCWAVATSSAFADRLCVATTGDFNELLSAEEITFCC-HTCGFGCHGGYPIKAWKR 168

Query: 73  FVHHGVVT-------EECDPYF---DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH 122
           F  HG+VT       E C+PY     + G S    +P      C R C     +  N  H
Sbjct: 169 FSTHGLVTGGDYNSGEGCEPYRVPPSNDGNSSSSDQPLAINHICRRHCYGNQSIDFNDDH 228

Query: 123 -YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKL 180
            Y+   Y +      I  ++   GP+E SF VY+DF  YKSGVY K      +GGHAVKL
Sbjct: 229 RYTRDYYYLTYG--SIQKDVLTYGPIEASFDVYDDFPSYKSGVYVKSDNASYLGGHAVKL 286

Query: 181 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           IGWG  +DG  YW++ N WN  WG +G+FKI+RG+NECG++    AG+P + 
Sbjct: 287 IGWG-EEDGTPYWLMVNSWNTQWGDNGFFKIRRGTNECGVDNSTTAGVPVTN 337


>gi|255040223|gb|ACT99884.1| truncated cathepsin B [Opisthorchis viverrini]
          Length = 313

 Score =  166 bits (420), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 98/208 (47%), Positives = 116/208 (55%), Gaps = 17/208 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA+SDR CIH     N SLS  DLL+CC   CG GC GGYP  AW Y
Sbjct: 108 QSSCGSCWAFGAVEAMSDRLCIHSNGSFNKSLSAVDLLSCCK-DCGFGCRGGYPAVAWDY 166

Query: 73  FVHHGVVT--EECDPY----FDSTGCSH------PGC-EPAYPTPKCVRKCVKKNQLWRN 119
           +  HG+VT   + DP     +    C H      P C    YPTP+CV+ C      +  
Sbjct: 167 WRTHGIVTGGSKEDPSGCRSYPFPKCDHHVQGHYPPCPRQIYPTPECVQDCDTPELGYLE 226

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
            K  +  +Y I +    IM EI   GPVE  FTVYEDF  YKS VY H  G  M GHA++
Sbjct: 227 DKTRANISYNIYASEISIMKEIMLRGPVEAVFTVYEDFLQYKSRVYFHAWGAPMSGHAIR 286

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADG 207
           ++GWG   D   YW++AN WN  WG  G
Sbjct: 287 ILGWGEEGD-VPYWLIANSWNEDWGEKG 313


>gi|51947600|gb|AAU14266.1| cathepsin B-N [Myzus persicae]
          Length = 338

 Score =  166 bits (419), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 95/232 (40%), Positives = 125/232 (53%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCWA     A +DR C+  +   N  LS  ++  CC   CG GC+GGYPI AW+ 
Sbjct: 108 QGNCGSCWAVATSSAFADRLCVATNADFNELLSAEEITFCC-HTCGFGCNGGYPIKAWKR 166

Query: 73  FVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
           F   G+VT       E C+PY       D  G +    +P     +C R C     L  +
Sbjct: 167 FSKKGLVTGGDYKSGEGCEPYRVPPCPNDDQGNNTCAGKPMESNHRCTRMCYGDQDLDFD 226

Query: 120 SKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
             H Y+   Y +      I  ++   GP+E SF VY+DF  YKSGVY K      +GGHA
Sbjct: 227 EDHRYTRDYYYLTYG--SIQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENASYLGGHA 284

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           VKLIGWG  + G  YW++ N WN  WG  G+FKI+RG+NECG++    AG+P
Sbjct: 285 VKLIGWG-EEYGVPYWLMVNSWNEDWGDHGFFKIQRGTNECGVDNSTTAGVP 335


>gi|290975216|ref|XP_002670339.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
 gi|284083897|gb|EFC37595.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
          Length = 350

 Score =  166 bits (419), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 95/219 (43%), Positives = 124/219 (56%), Gaps = 17/219 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CGSCWAF A   L+DRFCI  G  +N+ LS   +++C G    +GC+GG+  + WR+
Sbjct: 145 QKNCGSCWAFSASSVLADRFCIKSGGKVNVDLSPQFMVSCSG--QNNGCNGGFFDATWRF 202

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
            V  G V+E C PY  S G + P C         V+ C    Q    S  Y   + R   
Sbjct: 203 LVSVGTVSEACVPYV-SFGGAVPACN--------VKSCGVPGQ---KSPFYRAGSARKLE 250

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG-TSDDGED 191
              DIMA++  NGP++V+  VY DF  YKSGVY H++G  +GGHAVK++GWG  S     
Sbjct: 251 GMLDIMADLKANGPIQVAMGVYRDFYSYKSGVYHHVSGRYVGGHAVKIVGWGYDSASKLP 310

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           YWI AN W   WG  GYF I RG  ECGI + V +G P+
Sbjct: 311 YWICANSWGEDWGIKGYFWILRGRGECGIGKMVWSGKPA 349


>gi|195437434|ref|XP_002066645.1| GK24603 [Drosophila willistoni]
 gi|194162730|gb|EDW77631.1| GK24603 [Drosophila willistoni]
          Length = 341

 Score =  166 bits (419), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 94/245 (38%), Positives = 133/245 (54%), Gaps = 18/245 (7%)

Query: 1   MPFTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCG 58
           + +T+   +  +  QG CGSCWA      +SDR CI     MN  LS  D+L+CC  +CG
Sbjct: 97  LRWTSCPTISEIREQGSCGSCWAIATTSVMSDRLCIGSNGVMNFRLSGLDMLSCCA-ICG 155

Query: 59  DGCDGGYPISAWRYFVHHGVVT-------EECDPYF-----DSTGCSHPGCEPAYPTPKC 106
             C GGYP +AW Y+   G+V+       + C PY       S   S P C       +C
Sbjct: 156 FACQGGYPGAAWAYWARKGLVSGGDYGSQQGCQPYTIEPCDHSGNGSRPVCTVGGGV-RC 214

Query: 107 VRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
              C    ++ ++  K+++   Y I++D  +I  EI  NGPV+   TVYEDF  YK+GVY
Sbjct: 215 QHLCEPSYKVDFQRDKNFASKVYSISNDVLEIQKEIMTNGPVQAILTVYEDFLSYKTGVY 274

Query: 166 KHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
            H+ G+ +G HAV+++GWG        YW++AN W   WG +G+F I RG N C IE  +
Sbjct: 275 YHLEGEKVGPHAVRILGWGVWGTKKVPYWLVANSWGSDWGDNGFFHIFRGENHCDIEGYI 334

Query: 225 VAGLP 229
           +AGLP
Sbjct: 335 MAGLP 339


>gi|1644295|emb|CAB03627.1| cysteine proteinase [Haemonchus contortus]
          Length = 345

 Score =  166 bits (419), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 98/231 (42%), Positives = 128/231 (55%), Gaps = 23/231 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CGSCWA     ALSDR CI       L +S  D+++CC  LCG GCDGG+PI A+ Y
Sbjct: 116 QANCGSCWAVSTASALSDRICIASKGETQLHISSIDIVSCCK-LCGYGCDGGWPIEAFDY 174

Query: 73  FVHHGVVTEE------CDPY---------FDSTGCSHPG-CEPAYPTPKCVRKCVKKNQL 116
           F   G VT E      C PY          D+ G    G C+ +    + V++ V +N  
Sbjct: 175 FSRQGAVTGETTSKDGCRPYPFHPLWTYGNDTVGRRMSGRCKHSKTVGEGVKR-VTRNHT 233

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
            R     +    RI    +      + NGPV   FTVYEDF++YK G+Y HI G   G H
Sbjct: 234 RRTG--LTARRLRITEFCQSHSEGDHGNGPVVAVFTVYEDFSYYKKGIYVHIAGKARGAH 291

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           A+K+IGWG  ++G  YW++AN W+  WG  G F+I RG NECGIE++VVAG
Sbjct: 292 AIKIIGWGV-ENGLPYWLIANSWHDDWGEQGLFRIVRGINECGIEQEVVAG 341


>gi|281200411|gb|EFA74631.1| hypothetical protein PPL_11599 [Polysphondylium pallidum PN500]
          Length = 311

 Score =  166 bits (419), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 92/225 (40%), Positives = 128/225 (56%), Gaps = 21/225 (9%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYP 66
           V  ++ QG CGSCWAF A E+LSDR CI     +N++LS   L++C       GC+GG P
Sbjct: 95  VHAVLNQGQCGSCWAFAASESLSDRLCIASQGAINVTLSPQALVSC-DIEFNQGCNGGIP 153

Query: 67  ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV--KKNQLWRNSKHYS 124
             AW Y   HG+ T+ C PY    G +          P C ++C    K QL++  K ++
Sbjct: 154 QMAWEYLELHGIPTDSCFPYTSGNGTA----------PDCQKECSDGSKYQLYKG-KTFT 202

Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD-VMGGHAVKLIGW 183
           +   +  S    I A ++  GP+E +  VY+DF  Y SGVY    G  ++GGHA+K++GW
Sbjct: 203 L---KTCSSVAAIQANVFAYGPIEGTMDVYQDFMSYTSGVYVMTPGSKLLGGHAIKIVGW 259

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           GT S  G DYWI+ N W   WG +G+F I+RG+N CGI+ D  AG
Sbjct: 260 GTDSTSGLDYWIVQNSWGSDWGMNGFFWIQRGTNMCGIDRDASAG 304


>gi|209863077|ref|NP_001119612.2| cathepsin B-912 precursor [Acyrthosiphon pisum]
          Length = 342

 Score =  166 bits (419), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 94/231 (40%), Positives = 123/231 (53%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCWA     A +DR CI  ++  N  LS  +L  CC  LCG  C GGYPI AW Y
Sbjct: 112 QGNCGSCWALATSSAFADRLCIATNYEFNELLSAEELTFCC-HLCGFACHGGYPIKAWSY 170

Query: 73  FVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
           F  HG+VT       E C PY       +  G +    +P     +C R C    ++  +
Sbjct: 171 FRRHGIVTGGDYQSGEGCAPYRVPPCFSEEDGNNTCRGQPMEKHHRCTRMCYGDQEIDYD 230

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAV 178
             H     Y   +    I  ++   GP+E S  VY+DF  YKSGVY K      +GGHAV
Sbjct: 231 DDHRFTRDYYYLT-YASIQKDVMTYGPIEASMEVYDDFPSYKSGVYEKSENATYLGGHAV 289

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           KLIGWG  +DG  YW++ N W+  WG  G FKI+RG+NEC ++  + AG+P
Sbjct: 290 KLIGWG-EEDGVPYWLMVNSWSEMWGDKGLFKIRRGTNECSVDNSMTAGVP 339


>gi|156708108|gb|ABU93312.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  166 bits (419), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 91/212 (42%), Positives = 119/212 (56%), Gaps = 18/212 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           Q  CGSCWAF   E + DR  I       +S  DL++C       GC+GGY   AW +  
Sbjct: 83  QASCGSCWAFSVAETMGDRLSIKGCDFGDMSPQDLVSC--DTTDMGCNGGYMDHAWAWTK 140

Query: 75  HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 134
            HG+ TE+C PY   +G            P C  KCV  + + RN    S+S  ++N+  
Sbjct: 141 SHGITTEKCMPYQSGSG----------RVPACPAKCVNGSAIVRNK---SVSYKKLNA-- 185

Query: 135 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 194
           + +M E+Y+NGP+ V+FTVY DF +YKSGVY H TG + GGHAV  +GWG  +D   YW+
Sbjct: 186 QQMMEELYENGPISVAFTVYYDFMNYKSGVYVHKTGGIAGGHAVLCVGWGV-EDNTPYWL 244

Query: 195 LANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
             N W  +WG  G+FKI RGSN CGIE    A
Sbjct: 245 CQNSWGPAWGEKGHFKILRGSNHCGIENQSYA 276


>gi|161343855|tpg|DAA06108.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 342

 Score =  165 bits (418), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 94/231 (40%), Positives = 123/231 (53%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCWA     A +DR CI  ++  N  LS  +L  CC  LCG  C GGYPI AW Y
Sbjct: 112 QGNCGSCWALATSSAFADRLCIATNYEFNELLSAEELTFCC-HLCGFACHGGYPIKAWSY 170

Query: 73  FVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
           F  HG+VT       E C PY       +  G +    +P     +C R C    ++  +
Sbjct: 171 FRRHGIVTGGGYQSGEGCAPYRVPPCFSEEDGNNTCRGQPMEKHHRCTRMCYGDQEIDYD 230

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAV 178
             H     Y   +    I  ++   GP+E S  VY+DF  YKSGVY K      +GGHAV
Sbjct: 231 DDHRFTRDYYYLTYAS-IQKDVMTYGPIEASMEVYDDFPSYKSGVYEKSENATYLGGHAV 289

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           KLIGWG  +DG  YW++ N W+  WG  G FKI+RG+NEC ++  + AG+P
Sbjct: 290 KLIGWG-EEDGVPYWLMVNSWSEMWGDKGLFKIRRGTNECSVDNSMTAGVP 339


>gi|323448735|gb|EGB04630.1| hypothetical protein AURANDRAFT_32318 [Aureococcus anophagefferens]
          Length = 253

 Score =  165 bits (418), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 96/241 (39%), Positives = 134/241 (55%), Gaps = 31/241 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGD-GCDGGYPISAWR 71
           Q +CGSCWAFG+ EA++DR CI     ++  LS  D+ +C     GD GC+GG P S + 
Sbjct: 10  QANCGSCWAFGSTEAMTDRMCIASNGTVTTHLSAQDVTSCDKL--GDMGCNGGIPSSVYS 67

Query: 72  YFVHHGVVTEECDPYFDSTGC---------------SHPGCEPAYPTPKCVRKCVKKNQL 116
           Y+   G+V  +   Y D +GC                +P C      PKC RKC  +++ 
Sbjct: 68  YWALSGIV--DGGNYGDKSGCWSYQLEPCAHHVNSSKYPACPDEVRAPKCARKCESEDKD 125

Query: 117 WRNSKHYSISAYRINSDPE-------DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK-HI 168
           W  +K      Y +    E        + A+IY+NGP+   F V +DF  YKSGVY+  +
Sbjct: 126 WTKAKVKGEKGYSVCQQGELEGTCAIKMAADIYQNGPITGMFFVKQDFLAYKSGVYEPKL 185

Query: 169 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
               +GGHA+K++G+GT +DG+DYW++AN WN  WG DGYFKI RG N C IE+ V+ G 
Sbjct: 186 LSPPLGGHAIKIMGFGT-EDGKDYWLVANSWNEDWGDDGYFKIIRGKNACQIEDPVINGG 244

Query: 229 P 229
           P
Sbjct: 245 P 245


>gi|156708104|gb|ABU93310.1| cathepsin B1 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  165 bits (418), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 91/212 (42%), Positives = 119/212 (56%), Gaps = 18/212 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           Q  CGSCWAF   E + DR  I       ++  DL++C       GC+GGY   AW +  
Sbjct: 83  QASCGSCWAFSVAETMGDRLSIKGCDYGDMAPQDLVSC--DTTDMGCNGGYMDHAWAWTK 140

Query: 75  HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 134
            HGV TE+C PY   +G            P C  KCV  + + RN    S+S  ++N+  
Sbjct: 141 SHGVTTEKCMPYQSGSG----------RVPACPAKCVNGSAIVRNK---SVSYKKLNA-- 185

Query: 135 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 194
           + +M E+Y+NGP+ V+FTVY DF +YKSGVY H TG + GGHAV  +GWG  +D   YW+
Sbjct: 186 QQMMEELYENGPISVAFTVYYDFMNYKSGVYVHKTGGIAGGHAVLCVGWGV-EDNTPYWL 244

Query: 195 LANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
             N W  +WG  G+FKI RGSN CGIE    A
Sbjct: 245 CQNSWGPAWGEKGHFKILRGSNHCGIENQSYA 276


>gi|166030328|gb|ABY78831.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  165 bits (418), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 121/229 (52%), Gaps = 12/229 (5%)

Query: 15  QGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
           Q  C + WA     A+SDR+C +  G  L +S  DLL+CC   CGDGC GG+P  AW Y+
Sbjct: 112 QSACRASWAVSTASAISDRYCTVGGGKQLRISAADLLSCCK-QCGDGCKGGFPGFAWLYY 170

Query: 74  VHHGVVTEECDPY-------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS 126
           V +G+ +  C PY         + G   P  +  + TPKC   C  K+      K+   +
Sbjct: 171 VEYGIASSGCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKCNATCTDKSIPL--VKYRGNA 228

Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 186
            Y +    ED   E+Y NGP    F VY D   YKSGVY+++ GD +GG AV+++GWG  
Sbjct: 229 TYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGGQAVRIVGWGKL 288

Query: 187 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
            +G  YW +AN W+  WG +GY  I RG+NEC IE     G P    L 
Sbjct: 289 -NGTPYWKVANSWDTDWGMNGYMLILRGNNECNIEHLGFTGFPDPSQLT 336


>gi|156708112|gb|ABU93314.1| cathepsin B5 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 87/213 (40%), Positives = 116/213 (54%), Gaps = 18/213 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           Q  CGSCWAF   E   +R  I       +S  DL++C       GC+GG P+ +W +  
Sbjct: 83  QEQCGSCWAFAVAETTGNRLNILGCGRGDMSPQDLVSC--DKVDHGCNGGSPLFSWEWVK 140

Query: 75  HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 134
           H G+ TEEC PY    G            P C +KC   + + R +K  S+   +     
Sbjct: 141 HSGITTEECIPYVSGGG----------RVPSCPKKCTNGSAIVR-TKAKSVGLVK----G 185

Query: 135 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 194
           + +  E+Y  GP E +F+VYEDF  YKSGVY HITG ++GGHAV ++GWG  +DG  YW+
Sbjct: 186 DKMQNELYSRGPFEAAFSVYEDFKSYKSGVYHHITGKMLGGHAVMVVGWGV-EDGTPYWL 244

Query: 195 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           + N W  +WG  G+FKI RG NECGIE     G
Sbjct: 245 IQNSWGTTWGEQGFFKILRGKNECGIETTCFQG 277


>gi|339241013|ref|XP_003376432.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316974853|gb|EFV58323.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 551

 Score =  165 bits (417), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 94/231 (40%), Positives = 128/231 (55%), Gaps = 20/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CGSCWA  +   +SDR CI      +  LS  +LL+CC   CG GC+GGYP   ++Y
Sbjct: 312 QANCGSCWAVSSASVMSDRTCIATDGQFTTLLSDAELLSCCT-SCGYGCNGGYPQRTFKY 370

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPGCE--PAYPTPKCVRKCVKKNQLWRNS-KH 122
           +V+ G+ T       + C PY        P C       TPKC + C+    L  N  +H
Sbjct: 371 WVYSGMPTGGPYGSNDTCKPY------PIPPCSNCSETRTPKCSKSCISTYPLSLNEDRH 424

Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
           Y  + Y+     + +M +I   GP+    +VYEDF HYK GVY   +G  +GGHAV++IG
Sbjct: 425 YGSTYYQFWLGEKSMMKDISLYGPIVAGMSVYEDFLHYKEGVYTQESGIFLGGHAVRIIG 484

Query: 183 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
           WG  D+   YW++AN WN ++G DG FKI+RG +ECGIE  V AG    K 
Sbjct: 485 WGEQDN-IPYWLVANSWNTTFGEDGLFKIRRGFDECGIESYVSAGRAKCKQ 534


>gi|161343871|tpg|DAA06116.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 276

 Score =  165 bits (417), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 94/234 (40%), Positives = 125/234 (53%), Gaps = 21/234 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QGHCGS WA     A SDR C+  +   N  LS  ++  CC   CGDGC GGYPI AW+ 
Sbjct: 47  QGHCGSDWAMSTSSAFSDRLCVATNGDFNQLLSAEEITFCC-HTCGDGCSGGYPIRAWKR 105

Query: 73  FVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
           +  HG+VT       E C+PY       D  G +    +P     +C R C     L  +
Sbjct: 106 YKKHGLVTGGNYKSGEGCEPYRVPPCPNDDQGNNTCSGQPMEKNHRCTRMCYGDQDLDFD 165

Query: 120 SKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
             H Y+   Y +      I  ++   GP+E SF VY+DF  YKSG+Y K      +GGH+
Sbjct: 166 EDHRYTRDHYYLTY--RGIQKDVINYGPIEASFDVYDDFPSYKSGIYVKSENASYLGGHS 223

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           VKLIGWG  + G  YW++ N WN  WG  G FKI+RG+NECG++     G+P++
Sbjct: 224 VKLIGWG-EEYGVLYWLMVNSWNADWGDKGLFKIRRGTNECGVDNSTTGGVPAT 276


>gi|6562768|emb|CAB62588.1| putative cathepsin B-like protease [Pisum sativum]
          Length = 166

 Score =  164 bits (416), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 73/86 (84%), Positives = 80/86 (93%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           ++ QGHCGSCWAFGAVE+LSDRFCIHFG+++ LSVNDLLACCGFLCG GCDGGYPISAW+
Sbjct: 81  ILDQGHCGSCWAFGAVESLSDRFCIHFGVDVPLSVNDLLACCGFLCGSGCDGGYPISAWK 140

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGC 97
           YF HHGVVTEECDPYFD  GCSHPGC
Sbjct: 141 YFAHHGVVTEECDPYFDQIGCSHPGC 166


>gi|226466816|emb|CAX69543.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 337

 Score =  164 bits (415), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 99/243 (40%), Positives = 135/243 (55%), Gaps = 22/243 (9%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   ++ +  Q  C S WA  +V A+SDR CI     + + LS  +L++CC   C  G
Sbjct: 94  WKNCPSIKRIYDQSQCYSSWAMASVAAISDRICIQTNGTVKVELSAIELVSCCS-KCAVG 152

Query: 61  CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSHPGCEPAYPT--------PK 105
           C+ GY  SAW Y+V +G+VT E       C PY     C H G   +YP         P 
Sbjct: 153 CNFGYSESAWYYWVENGLVTGESNGNNSGCLPY-PFPKCDH-GSSDSYPMCGYVVYTPPV 210

Query: 106 CVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 164
           C   C     + + + KH+  SAY++  +  DI  EI   GPVE S  +Y+DF  YKSGV
Sbjct: 211 CNGTCRPGYPIPYNDDKHFGKSAYQVKQNESDIRREIMLYGPVEASIFIYDDFVDYKSGV 270

Query: 165 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
           YKH+TG ++   +V++IGWG  ++G  YW+ AN WN  WG +G+FKI RGSNEC IE  V
Sbjct: 271 YKHLTGRLITIQSVRIIGWGI-ENGIPYWLCANSWNEEWGLNGFFKILRGSNECEIEAFV 329

Query: 225 VAG 227
            AG
Sbjct: 330 NAG 332


>gi|187107122|ref|NP_001119621.1| cathepsin B-3098 precursor [Acyrthosiphon pisum]
 gi|161343841|tpg|DAA06101.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 337

 Score =  164 bits (414), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 92/234 (39%), Positives = 127/234 (54%), Gaps = 21/234 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGS WA     A +DR C+      N  LS  ++  CC   CG+GC+GGYPI AW+ 
Sbjct: 108 QGNCGSDWALSTSSAFADRLCVATNGDFNQLLSAEEITFCC-HKCGNGCNGGYPIRAWKR 166

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
           F +HG+VT       E C+PY      +D  G +    +P     KC +KC     +  N
Sbjct: 167 FKNHGLVTGGNYKSGEGCEPYRVPPCPYDKDGKNTCSGQPMESNHKCSKKCYGDEDIDFN 226

Query: 120 SKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
             H Y+   Y +      I  ++   GP+E SF VY+DF +YKSG+Y K      +GGH+
Sbjct: 227 KDHRYTRDDYYLTY--RGIQKDVINYGPIETSFDVYDDFPNYKSGIYVKSENASYLGGHS 284

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           VKLIGWG  + G  YW++ N WN  WG  G FKI+RG+NEC ++     G+P +
Sbjct: 285 VKLIGWG-EEYGVLYWLMVNSWNADWGDKGLFKIRRGTNECRVDNSTTGGVPDT 337


>gi|300952942|gb|ADK46902.1| cathepsin B [Radopholus similis]
          Length = 356

 Score =  163 bits (413), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 88/222 (39%), Positives = 125/222 (56%), Gaps = 23/222 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CGSCWA  +   + DR CI       + +S  D+L+C       GC+GGYP  A+ +
Sbjct: 131 QSNCGSCWAVSSASVIQDRICIASNGEQKVHISAQDILSCATDR-SQGCNGGYPDEAFEH 189

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEP---------AYPTPKCVRKC--VKKNQLWRNSK 121
           +   GVVT        S   ++ GC+P          Y TP+C +KC   +  + ++  K
Sbjct: 190 YAQSGVVT-------GSGNSANQGCKPYPFLPHTTVEYSTPECSKKCENYQYKKAYKQDK 242

Query: 122 HYSISAYRIN-SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 180
           H+ +S Y +  SDP DI  EI  NGPVE +  VY DF  YKSGVY+ +    +GGHAV++
Sbjct: 243 HFGMSVYNVQFSDPVDIQYEIMNNGPVEANMIVYYDFMFYKSGVYQTVFPWPLGGHAVRI 302

Query: 181 IGWGTSDDGE-DYWILANQWNRSWGADGYFKIKRGSNECGIE 221
           +GWG     +  YW++AN WN  WG DGYF+I+RG++E  IE
Sbjct: 303 VGWGVDGPTKVPYWLVANSWNTDWGEDGYFRIRRGTDESYIE 344


>gi|161343867|tpg|DAA06114.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 340

 Score =  163 bits (413), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 94/232 (40%), Positives = 125/232 (53%), Gaps = 21/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCWA     A +DR C+    + +  LS  +L  CC   CG GC+GGYPI AW  
Sbjct: 110 QGNCGSCWALATSSAFADRLCVATDADFNEFLSPEELTFCC-HTCGYGCNGGYPIKAWER 168

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
           F  HG+VT       E C+PY        + G +    +P     +C R C     L  +
Sbjct: 169 FKSHGLVTGGDYKSGEGCEPYRVPPCRHHAEGNNSCSDKPMEKNHRCTRMCYGDQDLDFD 228

Query: 120 SKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
             H Y+  +Y +      I  ++   GP+E SF VY+DF  YKSGVY +      +GGHA
Sbjct: 229 DDHRYTRDSYYLTYG--SIQKDVMNYGPIEASFDVYDDFPSYKSGVYIRSDNASYLGGHA 286

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           VKLIGWG  + G  YW++ N WN  WG  G FKI+RG+NECG++    AG+P
Sbjct: 287 VKLIGWG-EESGVPYWLMVNSWNTDWGDKGLFKIQRGTNECGVDNSTTAGVP 337


>gi|401758196|gb|AFQ01133.1| cathepsin B [Chilo suppressalis]
          Length = 350

 Score =  163 bits (413), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 102/253 (40%), Positives = 137/253 (54%), Gaps = 49/253 (19%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI------HFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
           QG CGSCWA  AV A++DR CI      HF      S+ D+L+CCG+ CG+GC+GG    
Sbjct: 106 QGGCGSCWAVAAVSAMTDRMCILSKGKEHF----YFSIKDVLSCCGY-CGNGCEGGVLTR 160

Query: 69  AWRYFVHHGVVT-------EECDPYFDSTGCSH---------------PGCE--PAYP-- 102
           AW Y+   G+V+       + C PY     C+H               P C+  P  P  
Sbjct: 161 AWIYYKKIGIVSGGGYKSKQGCQPY-TIPPCNHLVWGEIEQCKNIPMTPKCKNIPVIPEQ 219

Query: 103 ------TPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 155
                 TP+C +KC K  ++ +   KH   S YR+     +I  EIY+ GPV   FTVYE
Sbjct: 220 CKYIPITPECEKKCNKNYKVCYSKDKHRGKSVYRVKKS--EIFKEIYEYGPVTSYFTVYE 277

Query: 156 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR-G 214
           DF +YK G+Y + +G  +G H+VK+IGWG  + G  YW+ AN +N  WG  G+FKI R G
Sbjct: 278 DFLNYKEGIYNYTSGQKLGLHSVKIIGWG-EERGIKYWLAANSFNTDWGDKGFFKIIREG 336

Query: 215 SNECGIEEDVVAG 227
              CGI ++VVAG
Sbjct: 337 VGSCGISDNVVAG 349


>gi|403371460|gb|EJY85611.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score =  163 bits (413), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 93/217 (42%), Positives = 123/217 (56%), Gaps = 19/217 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAF A E+LSDRFCI     +NL LS  D+++C       GC GGY   AW+Y
Sbjct: 98  QAQCGSCWAFAAAESLSDRFCIASQGKVNLVLSPQDMVSC--DTSNFGCFGGYLDQAWQY 155

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
               GV ++ C+PY      S  G +P+ PT     + +KK +    S   +  A     
Sbjct: 156 LEQQGVSSDSCEPYK-----SGNGDQPSCPTKCSNGQAIKKYKCKAGSTKQAKGA----- 205

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
             E   + I ++GPVE  FTVY+DF +Y SGVY H+TGD  GGHAVK++GWG     E+Y
Sbjct: 206 --EATKSLIQESGPVETGFTVYQDFYNYNSGVYHHVTGDAEGGHAVKILGWG-KQGLENY 262

Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           WI+AN W   WG  GYF I++G  + GI+E     +P
Sbjct: 263 WIVANSWGEDWGEKGYFNIRQG--DSGIDEATFGCIP 297


>gi|403365170|gb|EJY82363.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score =  163 bits (412), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 93/220 (42%), Positives = 120/220 (54%), Gaps = 25/220 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLAC-CGFLCGDGCDGGYPISAWR 71
           Q  CGSCWAF AVE+LSDRFCI     +NL LS  D+L+C     C   C GGY  +AW+
Sbjct: 98  QAKCGSCWAFAAVESLSDRFCIASQGKVNLVLSPQDMLSCDASNFC---CFGGYLDTAWQ 154

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA--YR 129
           Y    GV ++ C+PY    G            P C  KC     +    K Y   A   +
Sbjct: 155 YLEQQGVGSDSCEPYKSGNG----------DQPSCPSKCSNGQAI----KKYKCKAGSTK 200

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 189
                E   + I ++GPVE  FT+YEDF +Y SG+Y H+TG  MGGHAVK++GWG     
Sbjct: 201 QAKGAEATKSLIQQSGPVETGFTIYEDFLNYNSGIYHHVTGGNMGGHAVKILGWGKQGL- 259

Query: 190 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           E+YWI+AN W   WG  GYF I++G  + GI+E     +P
Sbjct: 260 ENYWIVANSWGEDWGEKGYFNIRQG--DSGIDEATFGCIP 297


>gi|28974200|gb|AAO61484.1| cathepsin B [Sterkiella histriomuscorum]
          Length = 294

 Score =  162 bits (411), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 93/216 (43%), Positives = 125/216 (57%), Gaps = 18/216 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           Q  CGSCWAFGA EA SDRF I+ G ++ LS  DL++C       GC+GGY   AW Y  
Sbjct: 96  QQQCGSCWAFGATEAFSDRFAIN-GKDVILSPEDLVSC--DTNDYGCNGGYMDVAWEYLA 152

Query: 75  HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 134
            HG  T+ C PY   +G +          P C  KC   + + R     + ++ R +   
Sbjct: 153 DHGAATDSCFPYSAGSGFA----------PACSDKCADGSAMQRFK--CAPNSVRQSKGV 200

Query: 135 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 194
             I +EI  +GPVE +FTVY DF +Y+SGVY   T DV GGHA+K++G+G  ++G  YW+
Sbjct: 201 AQIQSEIVSHGPVEGAFTVYTDFFNYQSGVYTPTTTDVAGGHAIKILGYGV-ENGTPYWL 259

Query: 195 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
            AN W  +WG  G+FKIK+G  ECGIE+ V +  P 
Sbjct: 260 CANSWGPAWGMSGFFKIKQG--ECGIEDQVFSCDPQ 293


>gi|156708114|gb|ABU93315.1| cathepsin B6 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 96/214 (44%), Positives = 120/214 (56%), Gaps = 20/214 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYF 73
           Q  CGSCWA  A EA+ +RF I       LSV DL++C     GD GC+GG    + ++ 
Sbjct: 83  QASCGSCWAHAASEAIGNRFSIKGCGKGMLSVQDLVSCDK---GDSGCNGGSGPLSSKWL 139

Query: 74  VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 133
           V +GV TEEC PY    G            P C  KC   +Q+ R  K+     Y +   
Sbjct: 140 VSNGVTTEECLPYVSGNG----------RVPACAAKCSNGSQIIR-YKYEKAETYTV--- 185

Query: 134 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 193
            ++I  E+ KNGPV   FTVY DF +YKSGVY+H +G   GGHAV LIGWG  +DG  YW
Sbjct: 186 -QNIQEELMKNGPVYFRFTVYSDFMNYKSGVYQHKSGYQEGGHAVLLIGWGV-EDGVPYW 243

Query: 194 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           +L N W  +WG  G+FKI RG NECG E+   AG
Sbjct: 244 LLQNSWGPAWGEKGHFKIIRGKNECGCEQGFYAG 277


>gi|197725747|gb|ACH73069.1| cathepsin B precursor [Epinephelus coioides]
          Length = 333

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 101/238 (42%), Positives = 135/238 (56%), Gaps = 26/238 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGA EA+SDR CIH    +S+ ++  DLL CC   CG GC+GGYP +AW +
Sbjct: 101 QGSCGSCWAFGAAEAISDRLCIHSNGKVSVEISSEDLLTCCDS-CGMGCNGGYPSAAWDF 159

Query: 73  FVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY          G   P       TP+C+ +C       ++
Sbjct: 160 WTDVGLVSGGLYDSHVGCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCILQCESGYTPSYK 219

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  S+Y + SD E I +EIYKNGPVE +FTVYEDF  YK+GVY+H+TG  +GGHA+
Sbjct: 220 ADKHYGKSSYSVPSDEEQIQSEIYKNGPVEGAFTVYEDFLLYKTGVYQHMTGSAVGGHAI 279

Query: 179 KLIGWGTSDDGEDYWILAN--QWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
           K      S  GE+   L      +  WG D       GS+ CGIE ++VAG+P +++ 
Sbjct: 280 K------SWLGEEVCSLLALCHSDTDWG-DMVSLSSAGSDHCGIESEIVAGIPITQSF 330


>gi|21700775|gb|AAL60053.1| cysteine proteinase [Toxoplasma gondii]
          Length = 569

 Score =  162 bits (410), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 99/241 (41%), Positives = 133/241 (55%), Gaps = 33/241 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFL-CGD-GCDGGYPISAW 70
           QG CGSCWAF + EA +DR CI       + LS     +CC  + C   GC+GG P  AW
Sbjct: 297 QGDCGSCWAFASTEAFNDRLCIRSQGKRLMPLSAQHTTSCCNAIHCASFGCNGGQPGMAW 356

Query: 71  RYFVHHGVVT----------EECDPYFDSTGCSH------PGCEPAY---PTPKCVRKCV 111
           R+F   GVVT            C PY +   C+H      P C+       TPKC + C 
Sbjct: 357 RWFERKGVVTGGDFDALGKGTTCWPY-EVPFCAHHAKAPFPDCDATLVPRKTPKCRKDCE 415

Query: 112 KKN-----QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           ++        +    H + SAY + S  +D+  ++  +GPV  +F VYEDF  YKSGVYK
Sbjct: 416 EQAYADNVHPFDQDTHKATSAYSLRSR-DDVKRDMMTHGPVSGAFMVYEDFLSYKSGVYK 474

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H++G  +GGHA+K+IGWGT ++GE+YW   N WN  WG  G FKI  G  +CGI+ ++VA
Sbjct: 475 HVSGLPVGGHAIKIIGWGT-ENGEEYWHAVNSWNTYWGDGGQFKIAMG--QCGIDGEMVA 531

Query: 227 G 227
           G
Sbjct: 532 G 532


>gi|221484923|gb|EEE23213.1| cysteine proteinase, putative [Toxoplasma gondii GT1]
          Length = 569

 Score =  162 bits (410), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 99/241 (41%), Positives = 133/241 (55%), Gaps = 33/241 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFL-CGD-GCDGGYPISAW 70
           QG CGSCWAF + EA +DR CI       + LS     +CC  + C   GC+GG P  AW
Sbjct: 297 QGDCGSCWAFASTEAFNDRLCIRSQGKRLMPLSAQHTTSCCNAIHCASFGCNGGQPGMAW 356

Query: 71  RYFVHHGVVT----------EECDPYFDSTGCSH------PGCEPAY---PTPKCVRKCV 111
           R+F   GVVT            C PY +   C+H      P C+       TPKC + C 
Sbjct: 357 RWFERKGVVTGGDFDALGKGTTCWPY-EVPFCAHHAKAPFPDCDATLVPRKTPKCRKDCE 415

Query: 112 KKN-----QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           ++        +    H + SAY + S  +D+  ++  +GPV  +F VYEDF  YKSGVYK
Sbjct: 416 EQAYADNVHPFDQDTHKATSAYSLRSR-DDVKRDMMTHGPVSGAFMVYEDFLSYKSGVYK 474

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H++G  +GGHA+K+IGWGT ++GE+YW   N WN  WG  G FKI  G  +CGI+ ++VA
Sbjct: 475 HVSGLPVGGHAIKIIGWGT-ENGEEYWHAVNSWNTYWGDGGQFKIAMG--QCGIDGEMVA 531

Query: 227 G 227
           G
Sbjct: 532 G 532


>gi|32129435|sp|P92133.2|CATB3_GIALA RecName: Full=Cathepsin B-like CP3; AltName: Full=Cathepsin B-like
           protease B3; Flags: Precursor
 gi|1763663|gb|AAB58260.1| cysteine protease [Giardia intestinalis]
 gi|11691660|emb|CAC18648.1| cathepsin B-like cysteine protease 3 [Giardia intestinalis]
          Length = 299

 Score =  162 bits (410), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 94/221 (42%), Positives = 123/221 (55%), Gaps = 19/221 (8%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD-GCDGGYPI 67
           +V QG CGSCWAF +V ++ DR C   G++   +  S   +++C     GD  CDGG+  
Sbjct: 91  VVDQGGCGSCWAFSSVASVGDRRCFA-GLDKKAVKYSPQYVVSCDR---GDMACDGGWLP 146

Query: 68  SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 127
           S WR+    G  T+EC PY         G   A  T  C  KC   + L    K      
Sbjct: 147 SVWRFLTKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSDLPHLYKATKAVD 197

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
           Y +  D   IM  +   GP++ +FTVY DF +Y+SGVY+H  G V GGHAV ++G+GT D
Sbjct: 198 YGL--DAPAIMKALATGGPLQTAFTVYSDFMYYESGVYQHTYGRVEGGHAVDMVGYGTDD 255

Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           DG DYWI+ N W   WG DGYF+I R +NECGIEE V+ G 
Sbjct: 256 DGVDYWIIKNSWGPDWGEDGYFRIIRMTNECGIEEQVIGGF 296


>gi|237836005|ref|XP_002367300.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
 gi|211964964|gb|EEB00160.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
 gi|221506020|gb|EEE31655.1| cysteine proteinase, putative [Toxoplasma gondii VEG]
          Length = 572

 Score =  162 bits (410), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 99/241 (41%), Positives = 133/241 (55%), Gaps = 33/241 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFL-CGD-GCDGGYPISAW 70
           QG CGSCWAF + EA +DR CI       + LS     +CC  + C   GC+GG P  AW
Sbjct: 300 QGDCGSCWAFASTEAFNDRLCIRSQGKGLMPLSAQHTTSCCNAIHCASFGCNGGQPGMAW 359

Query: 71  RYFVHHGVVT----------EECDPYFDSTGCSH------PGCEPAY---PTPKCVRKCV 111
           R+F   GVVT            C PY +   C+H      P C+       TPKC + C 
Sbjct: 360 RWFERKGVVTGGDFDALGKGTTCWPY-EVPFCAHHAKAPFPDCDATLVPRKTPKCRKDCE 418

Query: 112 KKN-----QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           ++        +    H + SAY + S  +D+  ++  +GPV  +F VYEDF  YKSGVYK
Sbjct: 419 EQAYADNVHPFDQDTHKATSAYSLRSR-DDVKRDMMTHGPVSGAFMVYEDFLSYKSGVYK 477

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H++G  +GGHA+K+IGWGT ++GE+YW   N WN  WG  G FKI  G  +CGI+ ++VA
Sbjct: 478 HVSGLPVGGHAIKIIGWGT-ENGEEYWHAVNSWNTYWGDGGQFKIAMG--QCGIDGEMVA 534

Query: 227 G 227
           G
Sbjct: 535 G 535


>gi|226472808|emb|CAX71090.1| cathepsin B [Schistosoma japonicum]
          Length = 325

 Score =  162 bits (409), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 90/198 (45%), Positives = 120/198 (60%), Gaps = 19/198 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA+SDR CI         LS  +L++CC   CG GC+GG+P SAW Y
Sbjct: 117 QSSCGSCWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSS-CGMGCNGGFPHSAWLY 175

Query: 73  FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKC-VKKNQLWR 118
           + + G+VT +       C PY +   C H      P C+    TP C R C    N  + 
Sbjct: 176 WKNQGIVTGDLYNTTNGCQPY-EFPPCEHHTLGPLPVCDGDVETPPCKRTCQAGYNVSYE 234

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
           N K Y    YR+ S+ E IM E+ ++GPVEV F VY DF +YKSGVY+H++G ++GGHAV
Sbjct: 235 NDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAV 294

Query: 179 KLIGWGTSDDGEDYWILA 196
           +L+GWG  ++   YW++A
Sbjct: 295 RLLGWG-EENNVPYWLIA 311


>gi|294954734|ref|XP_002788292.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239903555|gb|EER20088.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 317

 Score =  161 bits (408), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 96/240 (40%), Positives = 128/240 (53%), Gaps = 32/240 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFG  EA +DR C+      +  LS  ++ AC       GCDGGYP SAW +
Sbjct: 83  QSACGSCWAFGVTEAFNDRLCVKSNGTFTELLSAGEMNACAPSY---GCDGGYPDSAWSW 139

Query: 73  FVHHGVVT-------------EECDPYFDSTGCSH-------PGC-EPAYPTPKCVRKC- 110
               G+ T             + C PY D   C+H       P C + +Y TP CV +C 
Sbjct: 140 VHDEGIATGGDYVARGNLTKGDGCWPY-DFPPCAHHINDTKYPKCPKGSYETPNCVEQCH 198

Query: 111 -VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 169
             K +   +N +HY + +        +    I  +GPV  S+ VYEDF  YKSGVYKH +
Sbjct: 199 NPKYSTSLKNDRHYMLESSPYQYSVNNAKNAIRTDGPVSASYLVYEDFLAYKSGVYKHTS 258

Query: 170 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           G  +GGHAVK+IGWG  ++GE YW++ N WN  WG  G FKI  G+  C I++D++ G P
Sbjct: 259 GSYLGGHAVKIIGWG-EENGEAYWLVVNSWNEDWGDHGLFKIALGN--CQIDDDLLGGTP 315


>gi|312374702|gb|EFR22199.1| hypothetical protein AND_15622 [Anopheles darlingi]
          Length = 339

 Score =  161 bits (407), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 96/237 (40%), Positives = 134/237 (56%), Gaps = 15/237 (6%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCD 62
           N + +  +  QG CGSCWA  A   +SDR CIH     N++++  DL+ CC   CG+GC+
Sbjct: 104 NCDSLREIRNQGTCGSCWAVAAASVMSDRVCIHTNGTRNVAIAAEDLMGCCA-DCGNGCE 162

Query: 63  GGY-PISAWRYFVHHGVV-------TEECDPYFDSTGCSHPGCE-PAYPTPKCVRKCVKK 113
           GG+   ++++Y+V  G+V       TE C PY     C +P  +     +PKC   C   
Sbjct: 163 GGFLDGTSFQYWVDAGLVSGGAYNSTEGCKPY-PFKPCLYPFTDCHREESPKCKHHCQHG 221

Query: 114 -NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
            ++ +   K +   AY +  D   I  EI  NGPVE  F VYED   YKSGVY+H+ G+ 
Sbjct: 222 VDKRYARDKVFGSVAYSVPRDERVIRYEIMTNGPVEGGFDVYEDVFLYKSGVYRHVYGEH 281

Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +G HAV++IGWG  + G  YW+++N +   WG  GYFKI RG N  GIE  V+ GLP
Sbjct: 282 VGKHAVRIIGWG-REGGIPYWLISNSYGEDWGDHGYFKIVRGINHLGIESKVITGLP 337


>gi|21930117|gb|AAM82155.1| cysteine proteinase [Ancylostoma ceylanicum]
          Length = 348

 Score =  161 bits (407), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 94/232 (40%), Positives = 125/232 (53%), Gaps = 20/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFC--IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWA  A  A+SDR C   +  +N  LS  ++L+CC   CG GC GGYP  A+ Y
Sbjct: 116 QSSCGSCWAVAAASAMSDRVCALTNGRINRILSDTEVLSCCFGSCGFGCKGGYPARAFGY 175

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPGCEPAY--------PTPKCVRKCVKKNQL- 116
              +G+ T       + C PY     C +   EP Y        PTP C R C     + 
Sbjct: 176 AWRYGLSTGGPYGEKDACQPY-AFYPCGNHAHEPYYGPCPDELWPTPTCRRTCQLGYPIP 234

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   K ++   Y I  +  +I  EI   GPV  ++ VY DF +YK GVY H  G+V G H
Sbjct: 235 FEKDKIFNDQTYYIFGNETEIKYEIMTRGPVVATYKVYRDFDYYKKGVYIHREGEVTGLH 294

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           AVK+IGWG  +D   YW++AN WN  WG +GYF+I RG++ C IE  +V G+
Sbjct: 295 AVKIIGWGKGND-VPYWLVANSWNTDWGDNGYFRIVRGTDNCEIERQMVGGI 345


>gi|403362666|gb|EJY81064.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score =  161 bits (407), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 89/217 (41%), Positives = 123/217 (56%), Gaps = 19/217 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAF A E+LSDRFCI     +N+ LS  D+++C       GCDGGY   AW+Y
Sbjct: 98  QAQCGSCWAFAASESLSDRFCIASQGKVNVVLSPQDMVSC--DTNNYGCDGGYLNLAWQY 155

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
               GV ++ C+PY  ++G +          P C  KC    Q  +  K  + S  + N 
Sbjct: 156 LEKKGVASDSCEPYKSASGTA----------PSCPSKCAN-GQAIKKYKCQAGSTKQANG 204

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
                 + I ++GPVE  FTVY DF +YKSG+Y H++G   GGHAVK++GWG     E+Y
Sbjct: 205 AAA-TKSLIQQSGPVETGFTVYADFFNYKSGIYHHVSGGAEGGHAVKILGWG-KQGSENY 262

Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           WI+AN W  SWG  G+F I++G  + GI++     +P
Sbjct: 263 WIVANSWGESWGEKGFFNIRQG--DSGIDQATFGCIP 297


>gi|323447573|gb|EGB03489.1| hypothetical protein AURANDRAFT_72715 [Aureococcus anophagefferens]
          Length = 812

 Score =  161 bits (407), Expect = 3e-37,   Method: Composition-based stats.
 Identities = 94/214 (43%), Positives = 122/214 (57%), Gaps = 22/214 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI-HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
           Q  CGSCWAF A E LSDR  I H      LS  DL++C       GC+GG   +AW Y 
Sbjct: 360 QQQCGSCWAFSAAEVLSDRNAIQHNKAEPVLSPEDLVSCD--RVDQGCNGGNLGTAWTYL 417

Query: 74  VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 133
            + G+VT+ C PY    G            PKC   C K    W  +K+ + SAY +N  
Sbjct: 418 KNTGIVTDACFPYTAGGG----------DAPKCETSC-KDGSSW--TKYKAASAYAVNG- 463

Query: 134 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM--GGHAVKLIGWGTSDDGED 191
            E++  EI  +GP++V+F VY+ F  YKSGVY     ++M  GGHAVK++GWGT + G+D
Sbjct: 464 VENMQKEIMTHGPIQVAFNVYKSFMSYKSGVYAKKWYELMPEGGHAVKIVGWGT-EGGKD 522

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
           YW++AN WN SWG +GYFKI  G+    I  DVV
Sbjct: 523 YWLVANSWNTSWGDEGYFKIAVGAES--ISLDVV 554


>gi|294939825|ref|XP_002782575.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239894358|gb|EER14370.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 398

 Score =  160 bits (406), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 93/238 (39%), Positives = 128/238 (53%), Gaps = 27/238 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CG CWAFG  EA +DR CI      +  LS  ++ AC   L   GC GG+P SAW +
Sbjct: 163 QSACGDCWAFGVTEAFNDRLCIKSNGTFTKLLSAGEMNACAPSLKDPGCRGGFPYSAWSW 222

Query: 73  FVHHGVVT-------------EECDPYFDSTGCSHPGCEPAYPT-PKCVR---KCVKKNQ 115
               G+ T             + C PY D   C+H   +P YP  PK  R   +CV K +
Sbjct: 223 VHDEGIATGGDYVPRDNMTEDDGCWPY-DFPPCAHFFKDPKYPACPKFARVNLRCVSKLR 281

Query: 116 ----LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 171
               ++ + +++ + +   +   +D    I  +GPV  +F VYEDF  YKSGVYKH +G 
Sbjct: 282 HMMVVYFSDRYFMVESVPYHFSADDAKNAIRTDGPVSATFYVYEDFLAYKSGVYKHTSGS 341

Query: 172 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           ++G HAVK+IGWG  D GE YW++ N WN  WG  G FKI  G  +CGI+ +++ G P
Sbjct: 342 LLGAHAVKIIGWG-EDGGEAYWLVVNSWNEGWGDHGLFKIALG--DCGIDNELLGGTP 396


>gi|403345965|gb|EJY72367.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score =  160 bits (406), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 89/217 (41%), Positives = 123/217 (56%), Gaps = 19/217 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAF A E+LSDRFCI     +N+ LS  D+++C       GCDGGY   AW+Y
Sbjct: 98  QAQCGSCWAFAASESLSDRFCIASQGKVNVVLSPQDMVSC--DTNNYGCDGGYLNLAWQY 155

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
               GV ++ C+PY  ++G +          P C  KC    Q  +  K  + S  + N 
Sbjct: 156 LEKKGVASDSCEPYKSASGTA----------PSCPSKC-SNGQAIKKYKCKAGSTKQANG 204

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
                 + I ++GPVE  FTVY DF +YKSG+Y H++G   GGHAVK++GWG     E+Y
Sbjct: 205 AAA-TKSLIQQSGPVETGFTVYADFFNYKSGIYHHVSGGAEGGHAVKILGWG-KQGSENY 262

Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           WI+AN W  SWG  G+F I++G  + GI++     +P
Sbjct: 263 WIVANSWGESWGEKGFFNIRQG--DSGIDQATFGCIP 297


>gi|403332696|gb|EJY65386.1| Cathepsin B [Oxytricha trifallax]
          Length = 297

 Score =  160 bits (406), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 95/219 (43%), Positives = 126/219 (57%), Gaps = 23/219 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CG+CWAFGA EALSDRF I  +  +++  S  DL++C       GC+GGY   AW +
Sbjct: 96  QQQCGACWAFGATEALSDRFTIASNGSVDVVFSPEDLVSC--DTNDYGCNGGYMDMAWEF 153

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSI--SAYRI 130
              HGVV + C PY   +G +          P C  KC   +      K YS    + R 
Sbjct: 154 LDQHGVVADSCFPYSAGSGFA----------PACASKCADGSA----EKKYSCVHGSIRQ 199

Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 190
           +   E I +EI  +GPVE +FTVY DF +Y+SGVY   T DV GGHA+K++G+G  ++G 
Sbjct: 200 SQGVEQIKSEIVAHGPVEGAFTVYTDFFNYQSGVYTPTTSDVAGGHAIKILGFGV-ENGT 258

Query: 191 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
            YW+ AN W  SWG  G+FKIK+G  ECGIE+ V +  P
Sbjct: 259 PYWLCANSWGPSWGMQGFFKIKQG--ECGIEDQVFSCDP 295


>gi|28971815|dbj|BAC65419.1| cathepsin B [Pandalus borealis]
          Length = 328

 Score =  160 bits (405), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 95/234 (40%), Positives = 127/234 (54%), Gaps = 19/234 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCWA  A   ++DR CI     ++   S  ++ ACC   CG+ C GG   +A+ +
Sbjct: 98  QGNCGSCWAVSAASVMTDRTCIDTEGLVDFRFSSENVAACCT-ECGNACYGGDEDTAFTH 156

Query: 73  FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +V  G V+       E C PY     C H      P CE   P   C   C ++  + + 
Sbjct: 157 WVTKGFVSGGRHNSNEGCQPY-SVEECEHHIEGPRPPCEGDMPELVCSETCHEEYGKTYE 215

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
               Y + AY +  D   I  EI  NGPV  +F VY+DF  YKSGVY+H TG + G HAV
Sbjct: 216 EDLEYGLEAYVLPQDVTQIQEEIMTNGPVTAAFAVYDDFLSYKSGVYQHETGLLDGYHAV 275

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           ++IGWG  ++G  YW++AN WN  WG +G FKI RGS+EC  E D+ A   SSK
Sbjct: 276 RVIGWG-EEEGTPYWLVANSWNTDWGDNGLFKILRGSDECEFEGDMAAATYSSK 328


>gi|204022071|dbj|BAG71133.1| cathepsin B-S2 [Tuberaphis coreana]
          Length = 334

 Score =  160 bits (404), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 96/232 (41%), Positives = 126/232 (54%), Gaps = 20/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCW+F    A +DR C+  G   N  LS  +L  CC   CG GC GG P+ AW Y
Sbjct: 107 QGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEELTFCCK-DCGQGCGGGNPMKAWEY 165

Query: 73  FVHHGVVT-------EECDPY-----FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
           F   GV T       E C PY      +  G +    +P     +C + C  K  +   +
Sbjct: 166 FRTQGVTTGGDYNTKEGCMPYKVPPCRNKQGENICDEQPMERNHQCPKTCYGKTTV--QN 223

Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVK 179
           ++ + S Y INS  + I  +I   GPVE SF  Y+D + YKSG+Y K       GGH++K
Sbjct: 224 RYKTKSEYYINS-IKTIEQDIKTYGPVEASFDCYDDLSVYKSGIYRKSPNAKYKGGHSIK 282

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           +IGWG  +DG  YW+  N W++ WG  G FKI +G NECGIE  V AG+PSS
Sbjct: 283 IIGWG-QEDGTPYWLAVNSWSKFWGDHGTFKIIKGRNECGIERAVTAGIPSS 333


>gi|193603738|ref|XP_001943652.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 337

 Score =  160 bits (404), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 99/233 (42%), Positives = 134/233 (57%), Gaps = 23/233 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CGSCWA     A SDR CI  + G+N  LS   + +CC   CG+GC+GG+P  AW+Y
Sbjct: 108 QSNCGSCWALSTASAFSDRLCITSNMGVNKVLSGEYINSCCNGKCGNGCNGGHPEKAWKY 167

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVR-KCVKKN--QL 116
              +G+ T       E C PY       ++  CS    +    TP+C + +C   N    
Sbjct: 168 IKKNGLCTGGEYGSNEGCQPYSIVPCPRNANSCSKENED----TPQCYKDQCTNNNYETP 223

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
             +  +Y+   Y +   PE IM+E++KNGPV  +  VY+DF  YK G+Y++ TG + G H
Sbjct: 224 LVSDLYYAYKVYSVKPKPEIIMSEVFKNGPVVAAMKVYDDFLCYKGGIYQYTTGGLKGDH 283

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           AVK++GWG  DDG DYW+ AN W  SWG  G FKI+RG NECGIE  +  GLP
Sbjct: 284 AVKIMGWG-EDDGIDYWLCANTWGNSWGMGGMFKIRRGRNECGIENRITGGLP 335


>gi|91088083|ref|XP_968689.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
          Length = 360

 Score =  160 bits (404), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 98/221 (44%), Positives = 126/221 (57%), Gaps = 13/221 (5%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG C S WAF A E +SDR CI     + + LS  DL+ CC + CG+ C GGY   AW Y
Sbjct: 95  QGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPEDLIDCCHY-CGNQCKGGYTYYAWNY 153

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAY--PTPKCVRKCV--KKNQLWRNSKHYSISAY 128
           F+  G+V+     Y  STGC  P  E  Y   TP C   C   K    + + KH+  S Y
Sbjct: 154 FMLTGLVSG--GDYNTSTGC-QPYSELNYYRITPPCNTTCQNDKYPIPYVSDKHFGDSIY 210

Query: 129 RINSDPEDIMAEIYKNG-PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
            I  +   I  EI   G PV  +F VY DF  Y+ GVY + +G + G  AVK+IGWGT +
Sbjct: 211 YIPQNETAIQNEILSGGGPVVAAFDVYGDFKIYRDGVYIYTSGALFGRTAVKIIGWGT-E 269

Query: 188 DGEDYWILANQWNRSWGA-DGYFKIKRGSNECGIEEDVVAG 227
           +G  YW+ AN W + WGA  G+FKI+RG+NECG EE ++AG
Sbjct: 270 NGWAYWLAANSWGKDWGALGGFFKIRRGTNECGFEESIIAG 310


>gi|294951797|ref|XP_002787132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239901778|gb|EER18928.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 278

 Score =  160 bits (404), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 96/240 (40%), Positives = 127/240 (52%), Gaps = 32/240 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFG  EA +DR CI  H      LS  ++ AC       GC+GG+P SAW +
Sbjct: 44  QSACGSCWAFGVTEAFNDRLCIKSHGTFTELLSAGEMNACAP---SHGCNGGFPNSAWSW 100

Query: 73  FVHHGVVT-------------EECDPYFDSTGCSH-------PGC-EPAYPTPKCVRKC- 110
               G+ T             + C PY D   C+H       P C + +Y TP C  +C 
Sbjct: 101 VHDKGIATGGDYVAEDDMTKDDGCWPY-DFPPCAHHVNDSKYPKCPKDSYETPNCAEQCH 159

Query: 111 -VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 169
             K     R+ +H+ + +        D    I  +GPV  SFTVYEDF  YKSGVYKH +
Sbjct: 160 NPKYTTTLRDDRHFMVESSPYQYSVNDAKNAIRTDGPVSASFTVYEDFLAYKSGVYKHTS 219

Query: 170 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           G+ +GGHAVK+IGWG  + G+ YW++ N WN  WG  G FKI  G+  CGI++ ++ G P
Sbjct: 220 GEYLGGHAVKIIGWG-EESGQAYWLVVNSWNEDWGDHGLFKIALGN--CGIDDYLLGGTP 276


>gi|395734831|ref|XP_003776483.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin B-like [Pongo abelii]
          Length = 350

 Score =  160 bits (404), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 94/242 (38%), Positives = 134/242 (55%), Gaps = 23/242 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF-------GMNLSLSVNDLLACCGFLCGDGCDGGYPI 67
           QG  G CWA GA+EA+SD  CIH        G ++ +S  D L C   LCGDGC+GG P 
Sbjct: 114 QGSYGFCWALGALEAISDWICIHPNVGGAQGGNHVEVSAEDKLTC---LCGDGCNGGXPN 170

Query: 68  SAWRYFVHHGVVTEE-------CDPYFDSTGCSHPGCEPAY----PTPKCVRKCVKKNQL 116
             W ++   G+V+         C  +     C H      Y     +PKC   C +  Q 
Sbjct: 171 EGWNFWTGKGLVSGGLYDSHVGCRLFPSLLPCKHHIHGXPYVXTGDSPKCSMTC-EPGQT 229

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           ++  KHY  S+Y I+   +DIM  IYKN  VE +F+VY DF  YK   Y+ +TG++ GGH
Sbjct: 230 YKXDKHYGCSSYSISDSTKDIMTNIYKNDXVEEAFSVYLDFLMYKFKEYQGVTGEMXGGH 289

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVK 236
           A+ ++G    ++   YW++AN WNR WG +G+FKI RG +  GIE +VVA +P ++   +
Sbjct: 290 AICILGCKV-ENSTSYWLVANXWNRDWGDNGFFKILRGQDHYGIESEVVAEIPHTEQYWE 348

Query: 237 EI 238
           +I
Sbjct: 349 KI 350


>gi|157058765|gb|ABV03140.1| cathepsin B-348 [Aulacorthum solani]
          Length = 237

 Score =  160 bits (404), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 85/202 (42%), Positives = 121/202 (59%), Gaps = 20/202 (9%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   +  +  QG CGSCWAFGAVEA+SDR CIH     N   S  +L++CC + CG G
Sbjct: 38  WPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGTKNFHFSAENLVSCC-WTCGFG 96

Query: 61  CDGGYPISAWRYFVHHGVVTEECDPYFDSTGC--------------SHPGCEPAYPTPKC 106
           C+GG+P +AW Y+   G+V+    PY  + GC              +   C+    TPKC
Sbjct: 97  CNGGFPGAAWNYWKTKGIVSG--GPYGSNMGCIPYEVAPCEHHVNGTRGPCKEGGKTPKC 154

Query: 107 VRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
           V+KC    ++ +    H+  SAY +++D + I  EIY NGPVE +FTVYEDF  Y++GVY
Sbjct: 155 VKKCEDGYKVPYAQDLHHGKSAYSLSNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVY 214

Query: 166 KHITGDVMGGHAVKLIGWGTSD 187
           KH+ G  +GGHA++++GWG  +
Sbjct: 215 KHVAGKALGGHAIRILGWGVQN 236


>gi|10803452|emb|CAB97365.2| putative cathepsin B.2 [Ostertagia ostertagi]
          Length = 194

 Score =  159 bits (402), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 90/197 (45%), Positives = 117/197 (59%), Gaps = 20/197 (10%)

Query: 20  SCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 77
           SCWA  +  A+SDR CI       + LS  D+LACC + CG GC+GG+P+ AW+YF   G
Sbjct: 1   SCWAVSSAAAMSDRVCIASXGAKQVLLSDQDMLACCSW-CGYGCEGGWPMKAWQYFXLEG 59

Query: 78  VVTEE-------CDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKN-QLWRNSKH 122
           VVT         C PY +   C   G EP Y        TPKC + C +   + ++  KH
Sbjct: 60  VVTGGNYRKQGCCRPY-EFPPCGRHGKEPYYGECYDSAKTPKCQKTCQRGYLKPYKEDKH 118

Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
           +  SAYR+ ++ + I  +I KNGPV   F VYEDFAHYKSG+YKH  G + GGHAVK+IG
Sbjct: 119 FGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMTGGHAVKIIG 178

Query: 183 WGTSDDGEDYWILANQW 199
           WG  + G  YW++AN W
Sbjct: 179 WG-KEXGTPYWLIANSW 194


>gi|256090364|ref|XP_002581165.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228444|emb|CCD74615.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 303

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 96/222 (43%), Positives = 121/222 (54%), Gaps = 41/222 (18%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGD------GCDGGYP 66
           Q  CGSC AFGAVEA+S+R CI  G   N+ LS  DL    G + G       GC+  YP
Sbjct: 111 QSRCGSCCAFGAVEAMSERSCIQSGGKQNVELSAVDLE---GIVTGSSKENNTGCEP-YP 166

Query: 67  ISAWRYFVHHGVVTEECDPYFDSTGCSHPGC-EPAYPTPKCVRKCVKKNQLWRNSKHYSI 125
                +F                T   +P C    Y TP+C   C K     R    Y+ 
Sbjct: 167 FPKCEHF----------------TKGQYPPCGSKIYKTPRCKTTCQK-----RYKTSYAQ 205

Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
             +R       I  EI K GPVE SFTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG 
Sbjct: 206 DKHRA------IQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGV 259

Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
            ++   YW++AN WN  WG +GYF+I RG +EC IE +V AG
Sbjct: 260 -ENKTPYWLIANSWNEDWGENGYFRIVRGRDECSIESEVTAG 300


>gi|335347289|gb|AEH42092.1| cysteine proteinase 1 [Haemonchus contortus]
          Length = 332

 Score =  159 bits (401), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 88/228 (38%), Positives = 122/228 (53%), Gaps = 17/228 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND--LLACCGFLCGDG 60
           + N   +  +  Q +CGSCWA  A E +SDR C+     +   ++D  +LACCG  CG G
Sbjct: 105 WKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDILACCGRECGRG 164

Query: 61  CDGGYPISAWRYFVHHGVVT----EE---CDPYFDSTGCSHPGC------EPAYPTPKCV 107
           C+GG    AW Y    GVVT    +E   C PY      +H G       + ++ TP C 
Sbjct: 165 CNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPACK 224

Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C     + +   K Y  S Y ++ D + I  E+ KNGPV+ +F  YEDF+ Y  G+Y 
Sbjct: 225 KYCQYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFITYEDFSFYTKGIYV 284

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 214
           H  G   G HAVK++GWG  ++G  YW +AN W+  WG DGYF+I RG
Sbjct: 285 HTRGRQRGAHAVKVVGWGV-ENGTKYWNVANSWSTDWGEDGYFRILRG 331


>gi|156708106|gb|ABU93311.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
          Length = 282

 Score =  159 bits (401), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 92/218 (42%), Positives = 116/218 (53%), Gaps = 18/218 (8%)

Query: 7   EHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYP 66
           E +  +  Q  CGSCWAF   E + DR  I       +S  DL++C       GC+GGY 
Sbjct: 75  EQILPVRDQASCGSCWAFSVAETMGDRLSIIGCGRGHMSPQDLVSC--DTTDMGCNGGYM 132

Query: 67  ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS 126
             AW +   HGV  EEC PY    G            P C  KCV  + + R +K  S +
Sbjct: 133 DKAWAWTKSHGVTNEECMPYQSGGG----------RVPACPAKCVNGSTIVR-TKSQSFT 181

Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 186
            +  +     +  E+Y+NGP+ V+FTVY DF +YKSGVY H TG V GGHAV  IGWG  
Sbjct: 182 HFTAS----QMQQELYENGPLSVAFTVYYDFMNYKSGVYVHKTGGVAGGHAVLCIGWGVE 237

Query: 187 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
           D+   YW+  N W  +WG  G+FKI RGSN CGIE  V
Sbjct: 238 DN-TPYWLCQNSWGPAWGEKGHFKILRGSNHCGIENQV 274


>gi|86451924|gb|ABC97357.1| cathepsin B [Streblomastix strix]
          Length = 283

 Score =  159 bits (401), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 89/215 (41%), Positives = 116/215 (53%), Gaps = 18/215 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           QG CGSCWAF   E + DR  +       ++  DL++C  F   DGCDGG+   AW +  
Sbjct: 83  QGECGSCWAFSIAETIGDRLGVLGCSRGDIAPEDLVSCDIF--DDGCDGGFIDMAWDWCQ 140

Query: 75  HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 134
            +G+ TEEC PY    G   P          C   C   + ++R      I +YR   D 
Sbjct: 141 ENGLTTEECIPYKAGEGVPSP----------CPETCEDGSAIYRTP----IESYRY-IDA 185

Query: 135 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 194
           +DI  EIY+ GPV + F VY DF  YKSGVY H  G + GGHAV ++GWG  D+   YW+
Sbjct: 186 DDIQGEIYEYGPVSMGFIVYSDFMSYKSGVYVHQAGYIEGGHAVLIVGWGVEDE-VPYWL 244

Query: 195 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           + N W   WG +G+FKI RGS+ C  E +V AG P
Sbjct: 245 VQNSWGTDWGENGFFKILRGSDHCECESNVTAGYP 279


>gi|187105118|ref|NP_001119619.1| cathepsin B-5880 precursor [Acyrthosiphon pisum]
 gi|163300442|tpg|DAA06127.1| TPA_inf: cathepsin B transcript 5880 [Acyrthosiphon pisum]
 gi|239790051|dbj|BAH71611.1| ACYPI000015 [Acyrthosiphon pisum]
 gi|239790053|dbj|BAH71612.1| ACYPI000015 [Acyrthosiphon pisum]
          Length = 302

 Score =  157 bits (398), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 91/242 (37%), Positives = 130/242 (53%), Gaps = 30/242 (12%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP 66
           + ++  QG+C S +A     A+SDR CIH    +   LS   +L+CC +LCGDGC GG  
Sbjct: 69  IGMVYDQGNCKSSYAISVASAVSDRICIHSNGTVKPKLSAQQILSCC-YLCGDGCSGGQH 127

Query: 67  ISAWRYFVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK 113
             +W ++  HG+V+       E C PY         T   +        TP+C  +C   
Sbjct: 128 FESWDFYRRHGLVSGGEYGSNEGCQPYTIEPCQHTETAVENACSNKTLFTPECKVQCYNP 187

Query: 114 NQLWRNSK------HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
           +   R  K      HY + AY         M EIY+NGP+  SF +Y+DF +Y+SGVY +
Sbjct: 188 DYGTRYVKDNHQGTHYRVPAYTA-------MKEIYENGPITASFYMYQDFVNYQSGVYAY 240

Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
            +G  +   AVK++GWG  ++G  YW+ AN +N  WG +G+ KI RG+NEC IEE + AG
Sbjct: 241 NSGKYVTTQAVKILGWG-EENGTPYWLAANSFNTYWGDNGFVKILRGANECYIEEFMYAG 299

Query: 228 LP 229
           LP
Sbjct: 300 LP 301


>gi|253748582|gb|EET02635.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 298

 Score =  157 bits (397), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 91/221 (41%), Positives = 126/221 (57%), Gaps = 20/221 (9%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD-GCDGGYPI 67
           +V QG CGSCWAF +V +L DR C   G++   ++ S   +++C     GD  CDGG+  
Sbjct: 91  VVDQGSCGSCWAFSSVASLGDRRCFA-GLDKKAVTYSPQYVVSCDH---GDMACDGGWLQ 146

Query: 68  SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 127
           S WR+    G  T EC PY   T  +   C    PT     KC    +L   S   +  A
Sbjct: 147 SVWRFLTKTGTTTNECVPYQSGTTGARGTC----PT-----KCADGGEL---STVKAKKA 194

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
                D + IM  +   GP++ +FTVY DF +Y+ GVY+H++G V GGHAV+++G+GT +
Sbjct: 195 VDYGLDCDLIMKALVTGGPLQTAFTVYSDFMYYEGGVYQHMSGRVEGGHAVEMVGYGTDE 254

Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
              DYWI+ N W   WG DGYF+I R +NECGIEE V+ G+
Sbjct: 255 YDVDYWIIRNSWGPDWGEDGYFRIIRMTNECGIEEQVMGGI 295


>gi|239938580|gb|ACS36089.1| cysteine proteinase [Haemonchus contortus]
          Length = 332

 Score =  157 bits (397), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 87/228 (38%), Positives = 122/228 (53%), Gaps = 17/228 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND--LLACCGFLCGDG 60
           + N   +  +  Q +CGSCWA  A E +SDR C+     +   ++D  +LACCG  CG G
Sbjct: 105 WKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDILACCGRECGRG 164

Query: 61  CDGGYPISAWRYFVHHGVVT----EE---CDPYFDSTGCSHPGC------EPAYPTPKCV 107
           C+GG    AW Y    GVVT    +E   C PY      +H G       + ++ TP C 
Sbjct: 165 CNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPACK 224

Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C     + +   K Y  S Y ++ D + I  E+ KNGPV+ +F  YEDF+ Y  G+Y 
Sbjct: 225 KYCQYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFITYEDFSFYTKGIYV 284

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 214
           H  G   G HAVK++GWG  ++G  YW +AN W+  WG +GYF+I RG
Sbjct: 285 HTRGRQRGAHAVKVVGWGV-ENGTKYWNVANSWSTDWGENGYFRILRG 331


>gi|239938578|gb|ACS36088.1| cysteine proteinase [Haemonchus contortus]
          Length = 332

 Score =  157 bits (396), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 87/228 (38%), Positives = 122/228 (53%), Gaps = 17/228 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND--LLACCGFLCGDG 60
           + N   +  +  Q +CGSCWA  A E +SDR C+     +   ++D  +LACCG  CG G
Sbjct: 105 WKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDILACCGRECGRG 164

Query: 61  CDGGYPISAWRYFVHHGVVT----EE---CDPYFDSTGCSHPGC------EPAYPTPKCV 107
           C+GG    AW Y    GVVT    +E   C PY      +H G       + ++ TP C 
Sbjct: 165 CNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPACK 224

Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           + C     + +   K Y  S Y ++ D + I  E+ KNGPV+ +   YEDF+ Y+ G+Y 
Sbjct: 225 KYCQYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAASITYEDFSFYRRGIYV 284

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 214
           H  G   G HAVK++GWG  ++G  YW +AN W+  WG DGYF+I RG
Sbjct: 285 HTRGRQRGAHAVKVVGWGV-ENGTKYWNVANSWSTDWGEDGYFRILRG 331


>gi|48425700|pdb|1SP4|B Chain B, Crystal Structure Of Ns-134 In Complex With Bovine
           Cathepsin B: A Two Headed Epoxysuccinyl Inhibitor
           Extends Along The Whole Active Site Cleft
          Length = 205

 Score =  157 bits (396), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 90/204 (44%), Positives = 132/204 (64%), Gaps = 16/204 (7%)

Query: 40  MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGC 92
           +N+ +S  D+L CCG  CGDGC+GG+P  AW ++   G+V+         C PY     C
Sbjct: 2   VNVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPC 60

Query: 93  SH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 145
            H      P C     TPKC + C    +  ++  KH+  S+Y + ++ ++IMAEIYKNG
Sbjct: 61  EHHVNGSRPPCTGEGDTPKCNKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNG 120

Query: 146 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 205
           PVE +F+VY DF  YKSGVY+H++G++MGGHA++++GWG  ++G  YW++ N WN  WG 
Sbjct: 121 PVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGD 179

Query: 206 DGYFKIKRGSNECGIEEDVVAGLP 229
           +G+FKI RG + CGIE ++VAG+P
Sbjct: 180 NGFFKILRGQDHCGIESEIVAGMP 203


>gi|403340695|gb|EJY69640.1| Cathepsin B [Oxytricha trifallax]
          Length = 247

 Score =  157 bits (396), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 94/217 (43%), Positives = 117/217 (53%), Gaps = 22/217 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGA E LSDR CI      ++ LS  DL+AC G+    GC+GG    AW Y
Sbjct: 49  QAQCGSCWAFGASETLSDRICIASDKKTDVILSPEDLVACDGW--NMGCNGGILPWAWSY 106

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
             + G V + C PY    G            P C +KC      +   K    S  +  S
Sbjct: 107 LTNTGAVEDSCFPYSSDKGA----------VPTCAKKCQNDKDSFTKYKCKKNSVVQA-S 155

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
             + I AEI KNGP+E  FTVYEDF +Y+SGVY H TG+ +GGHAVK++G+     G+ Y
Sbjct: 156 GVDKIKAEISKNGPMETGFTVYEDFMNYESGVYHHTTGNQLGGHAVKIVGY-----GDGY 210

Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           WI AN W+  WG  G+F I  G  ECGI+    A  P
Sbjct: 211 WICANSWSEKWGEKGFFNI--GFGECGIDSAAYACTP 245


>gi|404250524|gb|AFR54113.1| cysteine proteinase, partial [Haemonchus contortus]
          Length = 332

 Score =  156 bits (395), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 86/216 (39%), Positives = 118/216 (54%), Gaps = 17/216 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND--LLACCGFLCGDGCDGGYPISAWRY 72
           Q +CGSCWA  A E +SDR C+     +   ++D  +LACCG  CG GC+GG    AW Y
Sbjct: 117 QSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDILACCGSECGRGCNGGMDHKAWEY 176

Query: 73  FVHHGVVT----EE---CDPYFDSTGCSHPGC------EPAYPTPKCVRKC-VKKNQLWR 118
               GVVT    +E   C PY      +H G       + ++ TP C + C     + + 
Sbjct: 177 VKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPACKKYCQYGYGKRYE 236

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             K Y  S Y ++ D + I  E+ KNGPV+ +F  YEDF+ Y  G+Y H  G   G HAV
Sbjct: 237 KDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFITYEDFSFYTKGIYVHTRGRQRGAHAV 296

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 214
           K++GWG  ++G  YW +AN W+  WG +GYF+I RG
Sbjct: 297 KVVGWGV-ENGTKYWNVANSWSTDWGENGYFRILRG 331


>gi|56758644|gb|AAW27462.1| unknown [Schistosoma japonicum]
          Length = 294

 Score =  156 bits (395), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 85/183 (46%), Positives = 111/183 (60%), Gaps = 17/183 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA++DR CI  G   S  LS  DL++CC   CGDGC GG+P  AW Y
Sbjct: 112 QSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCED-CGDGCQGGFPGVAWDY 170

Query: 73  FVHHGVVT-------EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
           +V  G+VT         C PY        T   +P C    Y TP+C +KC K  +  + 
Sbjct: 171 WVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQKCQKGYKTPYE 230

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY   +Y + S+ + I  EI  NGPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+
Sbjct: 231 QDKHYGEESYNVISNEKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAI 290

Query: 179 KLI 181
           ++I
Sbjct: 291 RII 293


>gi|290992564|ref|XP_002678904.1| predicted protein [Naegleria gruberi]
 gi|284092518|gb|EFC46160.1| predicted protein [Naegleria gruberi]
          Length = 289

 Score =  156 bits (395), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 86/201 (42%), Positives = 121/201 (60%), Gaps = 18/201 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAF A E LSDRFCI  +  +++ LS   +L C       GCDGGY  +AW +
Sbjct: 103 QQQCGSCWAFSASEVLSDRFCIASNGSVDVVLSPEYMLQCDS--TDYGCDGGYLNNAWAF 160

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
               G+ +++CDPY  ++G    G  P   T     K  K       +K  S++     S
Sbjct: 161 LAGTGIPSDKCDPY--TSGNGDVGSCPTSCTDGSAIKLYK-------AKSSSVAQL---S 208

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED- 191
             +DI  +I  NGPV+ +F+VY+DF  YKSGVY+H++G + GGHA+K++GWG + DG+D 
Sbjct: 209 SIDDIQKDIQANGPVQAAFSVYQDFFSYKSGVYRHVSGSLAGGHAIKIVGWGVTSDGKDT 268

Query: 192 -YWILANQWNRSWGADGYFKI 211
            YWI+AN WN +WG +G+F I
Sbjct: 269 PYWIVANSWNTNWGQEGFFWI 289


>gi|15723272|gb|AAL06324.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score =  156 bits (394), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 91/196 (46%), Positives = 115/196 (58%), Gaps = 18/196 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
           Q  CGSCWA  A  A+SDR+C   G+ +L +S  DL++CC  +CG GC+GGYP  AW Y+
Sbjct: 19  QSSCGSCWAVAAASAMSDRYCTLGGVRDLRISAGDLMSCCD-VCGYGCNGGYPEVAWEYY 77

Query: 74  VHHGVVTEECDPY-FDSTGCSH-------PGCEPAYPTPKCVRKCV-KKNQLWRNSKHYS 124
             HG+V+E C PY F S  C+H         C   Y TP C   C  KK  L +   + S
Sbjct: 78  AVHGIVSEYCQPYPFPS--CAHHVNSSDLSPCSGEYDTPTCNSTCTDKKIPLIKYRGNTS 135

Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 184
                I S  E    E+  NGP EVSF+VY DF  Y  GVYKH+TG  +GGHAV+++GWG
Sbjct: 136 C----ILSGEESFKRELLLNGPFEVSFSVYADFVAYTGGVYKHVTGVFLGGHAVRIVGWG 191

Query: 185 TSDDGEDYWILANQWN 200
              +GE YW +AN WN
Sbjct: 192 EL-NGEPYWKIANSWN 206


>gi|294935195|ref|XP_002781337.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239891887|gb|EER13132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 317

 Score =  156 bits (394), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 94/241 (39%), Positives = 127/241 (52%), Gaps = 27/241 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCG---DGCDGGYPISA 69
           Q  C SCWA   VEA + R CI  G   N  LS  +++ACC         GC GG  ++A
Sbjct: 82  QSACASCWAIAPVEAFNARLCIKSGGKFNQLLSAGEMIACCNSTHSWQPRGCKGGMILNA 141

Query: 70  WRYFVHHGVVTEE-------CDPYFDSTGCSH--------PGCEPAYPTPKCVRKCV--K 112
           W +   HG+ TE        C PY +   C+H        P  +  Y TP C+ +C   K
Sbjct: 142 WSFLKTHGIATEGSMSAADGCWPY-NFPKCAHHQKKSKYEPCSKKLYDTPSCLDRCPNEK 200

Query: 113 KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
                   +H++  +  +    ++I  EI  NGP   +F+VYEDF  YKSGVYKH  G +
Sbjct: 201 YGIPLDKDRHFTAHSPDLFEGTDNIKKEIMTNGPTSATFSVYEDFVSYKSGVYKHTNGTL 260

Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           MG H+V++IGWGT + G DYW++ N WN  WG  G FKI +G  +CGI +D V G P + 
Sbjct: 261 MGIHSVEIIGWGT-EKGVDYWLVMNSWNEGWGDHGTFKIAQG--DCGI-DDAVLGSPPAM 316

Query: 233 N 233
           N
Sbjct: 317 N 317


>gi|157167285|ref|XP_001658487.1| cathepsin b [Aedes aegypti]
 gi|108876478|gb|EAT40703.1| AAEL007590-PA [Aedes aegypti]
          Length = 313

 Score =  155 bits (393), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 97/231 (41%), Positives = 125/231 (54%), Gaps = 14/231 (6%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISA 69
           +  QG CGSC       A++DR+CIH       +    DLL+CC    G    GG P   
Sbjct: 81  IRTQGCCGSCAYVSGASAMTDRWCIHSKGKKQFTFGAFDLLSCCYECGGGCTGGGIPGPI 140

Query: 70  WRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAYP-TPKCVRKCVKKNQLWRN-- 119
           W Y+V  GV +       + C PY     C  P  E  YP  P C  +C     +  +  
Sbjct: 141 WSYWVKQGVSSGGPYGSNQGCHPYPMPPSCPKPS-EGDYPDEPNCSTRCNAGYNVTEDLR 199

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
            + +   AY I +D   IM +I+ NGPV+  F  YED  +Y  GVY+H +G + GGHAVK
Sbjct: 200 DRRFGRVAYSIPADERKIMEDIFVNGPVQAVFQWYEDIVNYSGGVYRHQSGRLKGGHAVK 259

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           LIGWG  +DG  YW++AN W R WG DG+FK+ RG N CGIEE+V AGLPS
Sbjct: 260 LIGWGV-EDGTKYWLVANSWGRVWGDDGFFKMVRGENHCGIEENVHAGLPS 309


>gi|157092993|gb|ABV22151.1| cysteine proteinase [Perkinsus chesapeaki]
          Length = 396

 Score =  155 bits (392), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 99/249 (39%), Positives = 131/249 (52%), Gaps = 43/249 (17%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAF   EA +DR CI    N +  LS  ++ AC       GC GG  + AW++
Sbjct: 162 QSACGSCWAFAPTEAFNDRLCIKSAGNFTSLLSPGNVAACSK---TSGCHGGSSLDAWQW 218

Query: 73  FVHHGVVT-------------EECDPYFDSTGCSH-------PGC-EPAYPTPKCVRKCV 111
               GVVT             + C PY D   C+H       P C +  Y  P C   C 
Sbjct: 219 LHTTGVVTGGDYSAEKDMTESDGCWPY-DIPPCAHYTNSTLYPKCPKTKYDFPTCQESCP 277

Query: 112 KK--NQLWRNSKHY----SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
            K  +      +H+    S+SA R     + I  EI  NGPV  S+ VY+DF  YKSGVY
Sbjct: 278 NKKYDTPMEKDRHFVEEESLSALR---SIDAIKKEIMTNGPVSASYLVYDDFLTYKSGVY 334

Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
           K  + + +GGHAVK+IGW     GEDYW++ N WN++WG +G FKI  G  +CGIE++V+
Sbjct: 335 KRTSHNALGGHAVKIIGW-----GEDYWLVVNSWNKNWGDNGMFKI--GCGQCGIEDNVL 387

Query: 226 AGLPSSKNL 234
           AG P + +L
Sbjct: 388 AGTPMTSSL 396


>gi|308160258|gb|EFO62754.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 298

 Score =  155 bits (392), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 89/221 (40%), Positives = 125/221 (56%), Gaps = 20/221 (9%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD-GCDGGYPI 67
           +V QG CGSCWAF +V ++ DR C+  G++   +  S   +++C     GD  CDGG+  
Sbjct: 91  VVDQGGCGSCWAFSSVASVGDRRCVA-GLDKKAVRYSPQYVVSCDR---GDMACDGGWLP 146

Query: 68  SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 127
           S WR+ V  G  T+EC PY         G   A  T  C  KC   ++L     + +  A
Sbjct: 147 SVWRFLVKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSEL---PIYKATKA 194

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
                D + IM  +   GP++ +FTVY DF +Y+ GVY+H+ G   GGHAV+++G+GT +
Sbjct: 195 VDYGLDCDLIMKALATGGPLQTAFTVYSDFMYYQGGVYQHVYGRAEGGHAVEMVGYGTDE 254

Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
              DYWI+ N W   WG DGYF+I R +NECGIEE V+ G 
Sbjct: 255 YDVDYWIIRNSWGPDWGEDGYFRIIRMTNECGIEEQVIGGF 295


>gi|741376|prf||2007265A cathepsin B
          Length = 153

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 72/147 (48%), Positives = 104/147 (70%), Gaps = 2/147 (1%)

Query: 93  SHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 151
           S P C     TPKC + C    +  ++  KHY  ++Y +++  +DIMAEIYKNGPVE +F
Sbjct: 8   SRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF 67

Query: 152 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 211
           +VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI
Sbjct: 68  SVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKI 126

Query: 212 KRGSNECGIEEDVVAGLPSSKNLVKEI 238
            RG + CGIE +VVAG+P +    ++I
Sbjct: 127 LRGQDHCGIESEVVAGIPRTDQYWEKI 153


>gi|15723276|gb|AAL06326.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 88/198 (44%), Positives = 113/198 (57%), Gaps = 22/198 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
           Q  CGSCWA  A  A+SDR+C   G+ +L +S  DL++CC  +CG GC+GGYP  AW Y+
Sbjct: 19  QSSCGSCWAVAAASAISDRYCTLGGVRDLRISAGDLMSCCD-VCGYGCNGGYPEVAWEYY 77

Query: 74  VHHGVVTEECDPY-FDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQ---LWRNSKH 122
             HG+V+E C PY F S  C+H         C   Y TP C   C  K      +R +  
Sbjct: 78  AVHGIVSEYCQPYPFPS--CAHHVNSSDLSPCSGEYDTPTCNSTCTDKKVPLIKYRGNTS 135

Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
           Y +S        E    E+  NGP EVSF+VY DF  Y  GVYKH+ G  +GGHAV+++G
Sbjct: 136 YLLSG------EESFKRELLLNGPFEVSFSVYADFLAYTGGVYKHVAGTFLGGHAVRIVG 189

Query: 183 WGTSDDGEDYWILANQWN 200
           WG   +GE YW +AN WN
Sbjct: 190 WGEL-NGEPYWKIANSWN 206


>gi|90074902|dbj|BAE87131.1| unnamed protein product [Macaca fascicularis]
          Length = 296

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 91/240 (37%), Positives = 122/240 (50%), Gaps = 61/240 (25%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAFGAVEA+SDR CIH   ++S+ V+  DLL CCG +CGDGC+GGYP  AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNF 161

Query: 73  FVHHGVVTE-------ECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
           +   G+V+         C PY     C H      P C     TPKC + C    +  ++
Sbjct: 162 WTRKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             KHY  ++Y +++  +DIMAEIYKN                                  
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKN---------------------------------- 246

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
                     G  YW++AN WN  WG +G+FKI RG + CGIE +VVAG+P +    ++I
Sbjct: 247 ----------GTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 296


>gi|327408413|emb|CCA30060.1| unnamed protein product [Neospora caninum Liverpool]
          Length = 463

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 96/241 (39%), Positives = 132/241 (54%), Gaps = 33/241 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFL-CGD-GCDGGYPISAW 70
           QG CGSCWAF + EA +DR CI       + LS     +CC  + C   GC+GG P  AW
Sbjct: 191 QGDCGSCWAFASTEAFNDRLCIRSQGKGVMPLSTQHTTSCCNAIHCASFGCNGGQPGMAW 250

Query: 71  RYFVHHGVVT----------EECDPYFDSTGCSH------PGCEP---AYPTPKCVRKCV 111
           R+F   GVVT            C PY +   C+H      P C+       TPKC + C 
Sbjct: 251 RWFERKGVVTGGDFDTLGKGTTCWPY-EIPFCAHHAKAPFPNCDTDVRPRKTPKCRKDCE 309

Query: 112 KKN-----QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           +         +    H + S+Y + S  + +  ++  +G V  +F VYEDF +YKSGVYK
Sbjct: 310 EAAYSEHVLPFDKDVHKASSSYSLRSR-DAVKRDMMAHGTVTGAFMVYEDFLNYKSGVYK 368

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H+ G  +GGHA+K+IGWGT +DGE+YW   N WN  WG  G+FKI+ G  +CG++ ++VA
Sbjct: 369 HVYGGPLGGHAIKIIGWGT-EDGEEYWHAVNSWNTYWGDSGHFKIEMG--QCGVDNEMVA 425

Query: 227 G 227
           G
Sbjct: 426 G 426


>gi|15723280|gb|AAL06328.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score =  154 bits (389), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 88/198 (44%), Positives = 113/198 (57%), Gaps = 22/198 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
           Q  CGSCWA  A  A+SDR+C   G+ +L +S  DL++CC  +CG GC+GGYP  AW Y+
Sbjct: 19  QSSCGSCWAVAAASAISDRYCTLGGVRDLRISAGDLMSCCD-VCGYGCNGGYPEVAWEYY 77

Query: 74  VHHGVVTEECDPY-FDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQ---LWRNSKH 122
             HG+V+E C PY F S  C+H         C   Y TP C   C  K      +R +  
Sbjct: 78  AVHGIVSEYCQPYPFPS--CAHHVNSSDLSPCSGEYDTPTCNSTCTDKKVPLIKYRGNTS 135

Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
           Y +S        E    E+  NGP EVSF+VY DF  Y  GVYKH+ G  +GGHAV+++G
Sbjct: 136 YLLSG------EESFKRELLLNGPFEVSFSVYADFLAYTGGVYKHVAGIFLGGHAVRIVG 189

Query: 183 WGTSDDGEDYWILANQWN 200
           WG   +GE YW +AN WN
Sbjct: 190 WGEL-NGEPYWKIANSWN 206


>gi|353228456|emb|CCD74627.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 333

 Score =  154 bits (389), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 92/236 (38%), Positives = 128/236 (54%), Gaps = 15/236 (6%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
           + I+  Q  C S WA  +  ++SDR CI     M + LS  +L++C     G  C  G+ 
Sbjct: 100 INIIHDQSKCDSGWAVASAASISDRTCIQTNGTMKVQLSAIELISCSKNKLG--CQIGFS 157

Query: 67  ISAWRYFVHHGVVTEE---CDPYF-----DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL- 116
             +W Y++ +G+VT +   C PY        +  S+P C    Y  P C + C     + 
Sbjct: 158 EFSWDYWLKNGLVTGDPTGCLPYPFPKCDHRSSNSYPKCGYITYTAPPCTKTCRSGYPIP 217

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           ++  KHY    Y +  +  DI  EI  NGPVE    V+ DF +YKSGVY+HITG ++  H
Sbjct: 218 YKADKHYGRVIYSLRPNESDIRKEIMMNGPVEAGIFVHSDFLNYKSGVYRHITGQLVTIH 277

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           +V++IGWG  +D   YW+ AN WN  WG +GYFKI RGSNEC IE  V AG   +K
Sbjct: 278 SVRIIGWGIENDIP-YWLCANSWNEDWGLNGYFKILRGSNECEIESFVNAGKVDNK 332


>gi|195384166|ref|XP_002050789.1| GJ20006 [Drosophila virilis]
 gi|194145586|gb|EDW61982.1| GJ20006 [Drosophila virilis]
          Length = 432

 Score =  154 bits (389), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 93/240 (38%), Positives = 121/240 (50%), Gaps = 32/240 (13%)

Query: 6   SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDG 63
           S ++  +  QG CGS W        SDRF I       + LS  ++L+C       GC+G
Sbjct: 198 SSYISEVPDQGWCGSSWVLSTTSVASDRFAIQSQGKEVVQLSAQNILSCTRRQ--QGCEG 255

Query: 64  GYPISAWRYFVHHGVVTEECDPY-----------FDSTGCSHPGCEPAYPTPKCVRKCVK 112
           G+  +AWRY    GV+ E+C PY            +S      GC+PAY         V 
Sbjct: 256 GHLDAAWRYLHKKGVLDEKCYPYTQHRDSCKIQRHNSRSLKANGCQPAYG--------VN 307

Query: 113 KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT--- 169
           ++ L+     YS+S         DIMAEIY +GPV+ +  +Y DF  Y  G+Y+      
Sbjct: 308 RDSLYTVGPAYSLSR------EADIMAEIYHSGPVQATMRIYRDFFSYSGGIYRQTAANR 361

Query: 170 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           G   G H+VKL+GWG   DG  YWI AN W   WG  GYF+I RGSNECGIEE V+A  P
Sbjct: 362 GAPTGFHSVKLVGWGEEHDGVKYWIAANSWGPWWGEHGYFRILRGSNECGIEEYVLASWP 421


>gi|289724789|gb|ADD18342.1| putative cysteine proteinase TIN-ag [Glossina morsitans morsitans]
          Length = 387

 Score =  154 bits (389), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 92/238 (38%), Positives = 123/238 (51%), Gaps = 15/238 (6%)

Query: 6   SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDG 63
           + ++  ++ QG CGS W        SDRF I       + LS  ++L+C       GC+G
Sbjct: 153 ASYISDVLDQGWCGSSWVISTASVASDRFAIQSRGKEVIQLSPQNILSCTRR--QQGCNG 210

Query: 64  GYPISAWRYFVHHGVVTEECDPYFD-STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSK 121
           G+  +AWRY    GVV E C PY      C  P    +     C     V +++L+    
Sbjct: 211 GHLDAAWRYLHKQGVVDESCYPYVGYRDACKIPHNSRSLRNNGCRSYSGVDRDELYTVGP 270

Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT---GDVMGGHAV 178
            YS++      +  DIMAEI+ +GPV+ + TVY DF  Y  G+Y+H     G  +G H+V
Sbjct: 271 AYSLN------NETDIMAEIFMSGPVQATLTVYRDFFSYSGGIYRHTAASRGSPVGFHSV 324

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVK 236
           KLIGWG   DG  YWI  N W   WG  G F+I RGSNECGIEE V+A  P+  N  K
Sbjct: 325 KLIGWGEEHDGNKYWIATNSWGTWWGEHGNFRILRGSNECGIEEYVLAAWPNVYNYFK 382


>gi|15723274|gb|AAL06325.1| cathepsin B-like protease [Trypanosoma cruzi]
 gi|15723278|gb|AAL06327.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score =  154 bits (388), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 88/198 (44%), Positives = 113/198 (57%), Gaps = 22/198 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
           Q  CGSCWA  A  A+SDR+C   G+ +L +S  DL++CC  +CG GC+GGYP  AW Y+
Sbjct: 19  QSSCGSCWAVAAASAISDRYCTLGGVRDLRISAGDLMSCCD-VCGFGCNGGYPEVAWEYY 77

Query: 74  VHHGVVTEECDPY-FDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQ---LWRNSKH 122
             HG+V+E C PY F S  C+H         C   Y TP C   C  K      +R +  
Sbjct: 78  AVHGIVSEYCQPYPFPS--CAHHVNSSDLSPCSGEYDTPTCNSTCTDKKIPLIKYRGNTS 135

Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
           Y +S        E    E+  NGP EVSF+VY DF  Y  GVYKH+ G  +GGHAV+++G
Sbjct: 136 YVLSG------EEPFKRELILNGPFEVSFSVYADFVAYTGGVYKHVAGIFLGGHAVRIVG 189

Query: 183 WGTSDDGEDYWILANQWN 200
           WG   +GE YW +AN WN
Sbjct: 190 WGEL-NGEPYWKIANSWN 206


>gi|312382740|gb|EFR28091.1| hypothetical protein AND_04395 [Anopheles darlingi]
          Length = 381

 Score =  154 bits (388), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 92/232 (39%), Positives = 130/232 (56%), Gaps = 27/232 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG C S +A  AV  ++DR+CIH       S    D+L+CC   CG GCDGG P + W Y
Sbjct: 157 QGCCASSYAVAAVATITDRWCIHSEGKSQFSFGAYDVLSCC-HRCGFGCDGGVPSAVWHY 215

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYP----TPK----------CVRKCVKK-NQLW 117
           +V +G+ +            SH GC+ +YP     P+          C+R+C    N  +
Sbjct: 216 WVENGITS-------GGAYESHEGCQ-SYPFGVCKPQEIFAPHVDLICLRQCQPGYNTTY 267

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              KH+   AY +  D + I+ E++  GPV+ SFTVY DF  YKSGVY+H  G  +G H+
Sbjct: 268 LEDKHFGRVAYSVPRDEDRILYELFYFGPVQASFTVYTDFIQYKSGVYRHTYGVRVGDHS 327

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           VK++GWG  ++G  +W+ AN W   WG +G+FKI RG +   +E +VVAGLP
Sbjct: 328 VKIVGWGV-ENGTKFWLCANSWGAEWGENGFFKIIRGEDHLSVESNVVAGLP 378


>gi|123478051|ref|XP_001322190.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
 gi|121905031|gb|EAY09967.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
          Length = 288

 Score =  153 bits (386), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 83/215 (38%), Positives = 122/215 (56%), Gaps = 19/215 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           QG CGSCW+F   ++ S R+C  +   +  S + L+AC       GC GG  ++AWRY  
Sbjct: 88  QGKCGSCWSFAVSKSFSHRYCRKYNKPVLFSQSHLVACDRR--NSGCGGGIEVNAWRYID 145

Query: 75  HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW--RNSKHYSISAYRINS 132
             G+  + C PY         G    Y    C +KC  +++ +  + ++++S++ Y   +
Sbjct: 146 LRGLPLDSCQPY--------DGNITKY---NCSKKCTNESETYEAQFTEYWSVARY---A 191

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
             E++   I   GPV  S  VY D  +YKSG+Y H  G+ +G HAV++IGWGT  +G DY
Sbjct: 192 SIEEMQIGIMTEGPVTTSLKVYSDLMYYKSGIYTHTKGEFLGHHAVEIIGWGTK-NGIDY 250

Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           WI++N WN +WG +G F IKRG NEC IE+ V AG
Sbjct: 251 WIISNSWNTTWGMNGLFLIKRGVNECHIEDYVCAG 285


>gi|119638954|gb|ABL85236.1| cysteine proteinase 2 [Necator americanus]
          Length = 347

 Score =  153 bits (386), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 85/236 (36%), Positives = 119/236 (50%), Gaps = 19/236 (8%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
           + ++  Q   G CWA  + E ++DR CI       + +S  D+L+CCG  CG GC  G P
Sbjct: 110 IGLIRDQSAGGGCWAVSSAEVMTDRICIQSNGTKQVYVSETDILSCCGQRCGSGCTSGVP 169

Query: 67  ISAWRYFVHHGV-------VTEECDPYFDSTGCSHPGCEPAY--------PTPKCVRKCV 111
             A+ Y +  GV           C PY     C +    P Y        PTP C + C 
Sbjct: 170 RQAFNYAIRKGVCSGGPYGTKGVCKPY-PFYPCGYHAHLPYYGPCPDGMWPTPTCEKACQ 228

Query: 112 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 171
               +  N      S   + +  E I  EI+ NGP+  ++TVYEDFA+YK+G+Y    G 
Sbjct: 229 SDYTVPYNDDRIFGSKTIVLTGEEKIKREIFNNGPLVATYTVYEDFAYYKNGIYMTGLGR 288

Query: 172 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
             G HAVK+IGWG  ++G  YW++AN WN  WG +G+F++ RG+N C IE     G
Sbjct: 289 ATGAHAVKIIGWG-EENGVKYWLIANSWNTDWGENGFFRMLRGTNLCDIELSATGG 343


>gi|195026034|ref|XP_001986167.1| GH20676 [Drosophila grimshawi]
 gi|193902167|gb|EDW01034.1| GH20676 [Drosophila grimshawi]
          Length = 432

 Score =  153 bits (386), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 93/235 (39%), Positives = 117/235 (49%), Gaps = 32/235 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CG+ W        SDRF I       + LS  ++L+C       GCDGG+  +AWRY
Sbjct: 207 QGWCGASWVLSTTSVASDRFAIQSQGKEVVQLSAQNILSCTRR--QQGCDGGHLDAAWRY 264

Query: 73  FVHHGVVTEECDPYFDST-----------GCSHPGCEPAYPTPKCVRKCVKKNQLWRNSK 121
              +GV+   C PY                    GC+PA+         V ++  +    
Sbjct: 265 MHKNGVLDANCYPYIQQRDTCKVQRHRGRSLKAYGCQPAHG--------VNRDNFYTVGP 316

Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT---GDVMGGHAV 178
            YS+S         DIMAEIY +GPV+ + TVY DF  Y SGVY+H     G   G H+V
Sbjct: 317 AYSLSR------EADIMAEIYHSGPVQATMTVYRDFFSYSSGVYQHTAANRGAATGFHSV 370

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
           KL+GWG   +G  YWI AN W   WG  GYF+I RGSNECGIEE V+A  P   N
Sbjct: 371 KLVGWGEEHNGVKYWIAANSWGPWWGERGYFRILRGSNECGIEEYVLASWPHVYN 425


>gi|195121981|ref|XP_002005491.1| GI19039 [Drosophila mojavensis]
 gi|193910559|gb|EDW09426.1| GI19039 [Drosophila mojavensis]
          Length = 432

 Score =  153 bits (386), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 95/243 (39%), Positives = 120/243 (49%), Gaps = 31/243 (12%)

Query: 6   SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDG 63
           S ++  +  QG CGS W        SDRF I       + LS  ++L+C       GC+G
Sbjct: 198 SSYISEVPDQGWCGSSWVLSTTSVASDRFAIQSQGKEVVQLSPQNILSCTRRQ--QGCEG 255

Query: 64  GYPISAWRYFVHHGVVTEECDPY----------FDSTGCSHPGCEPAYPTPKCVRKCVKK 113
           G+  +AWRY    GVV E C PY           +S      GC PAY         V +
Sbjct: 256 GHLDAAWRYLHKKGVVDETCYPYTQRRDSCKIRHNSRSLKANGCRPAYG--------VNR 307

Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT---G 170
           + L+     YS+          DIMAEIY +GPV+ +  VY DF  Y  GVY+      G
Sbjct: 308 DSLYTVGPAYSLKG------ETDIMAEIYHSGPVQATMRVYRDFFSYSGGVYRQTAANRG 361

Query: 171 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
              G H+VK++GWG   DG  YWI AN W   WG  GYF+I RGSNECGIEE V+A  P+
Sbjct: 362 APTGFHSVKIVGWGEEHDGVKYWIAANSWGPWWGEHGYFRILRGSNECGIEEYVLASWPN 421

Query: 231 SKN 233
             N
Sbjct: 422 VYN 424


>gi|3087799|emb|CAA93276.1| cysteine proteinase [Haemonchus contortus]
          Length = 350

 Score =  152 bits (384), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 89/249 (35%), Positives = 124/249 (49%), Gaps = 19/249 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG 60
           + N   +  +  Q  CGSCWA  A   +SDR C+     L   LS  D+L+CCG +CGDG
Sbjct: 104 WKNCSSITYVRDQSRCGSCWAVSAASTMSDRICVQTKGKLQTILSDTDILSCCGRMCGDG 163

Query: 61  CDGGYPISAWRYFVHHGVVTE-------ECDPY-FDSTGCSHPGC-----EPAYPTPKCV 107
           C+GGY   AW +    GVVT         C PY F   G  H        + ++ TP C 
Sbjct: 164 CEGGYDHLAWEWVQRFGVVTGGPYQQKGVCRPYAFHPCGLHHGRRYDCPWDHSFSTPACK 223

Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
             C     + +   K +  S Y +++D + I  E+ KNGPV+ +F  YEDF+ YK G+Y 
Sbjct: 224 PYCQFGYGKRYEKDKFFVKSTYILDNDEKVIQREMMKNGPVQAAFITYEDFSPYKGGIYV 283

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           H+ G   G HAVKLIGWG  ++G  YW +AN W+  WG   +      S    +   +V 
Sbjct: 284 HVKGRERGAHAVKLIGWGV-ENGTKYWTVANSWHDDWGGKRFLPYSTWSESLRVR--IVC 340

Query: 227 GLPSSKNLV 235
                +NL+
Sbjct: 341 RFRRIQNLI 349


>gi|270012756|gb|EFA09204.1| cathepsin B precursor [Tribolium castaneum]
          Length = 369

 Score =  152 bits (384), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 98/230 (42%), Positives = 126/230 (54%), Gaps = 22/230 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG C S WAF A E +SDR CI     + + LS  DL+ CC + CG+ C GGY   AW Y
Sbjct: 95  QGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPEDLIDCCHY-CGNQCKGGYTYYAWNY 153

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAY--PTPKCVRKCV--KKNQLWRNSKHYSISAY 128
           F+  G+V+     Y  STGC  P  E  Y   TP C   C   K    + + KH+  S Y
Sbjct: 154 FMLTGLVSG--GDYNTSTGC-QPYSELNYYRITPPCNTTCQNDKYPIPYVSDKHFGDSIY 210

Query: 129 RINSDPEDIMAEIYKNG-PVEVSFTVYEDFAHYK---------SGVYKHITGDVMGGHAV 178
            I  +   I  EI   G PV  +F VY DF  Y+          GVY + +G + G  AV
Sbjct: 211 YIPQNETAIQNEILSGGGPVVAAFDVYGDFKIYRDGEQHDTILEGVYIYTSGALFGRTAV 270

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGA-DGYFKIKRGSNECGIEEDVVAG 227
           K+IGWGT ++G  YW+ AN W + WGA  G+FKI+RG+NECG EE ++AG
Sbjct: 271 KIIGWGT-ENGWAYWLAANSWGKDWGALGGFFKIRRGTNECGFEESIIAG 319


>gi|16768502|gb|AAL28470.1| GM06507p [Drosophila melanogaster]
          Length = 430

 Score =  152 bits (384), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 119/229 (51%), Gaps = 12/229 (5%)

Query: 6   SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDG 63
           S ++  +  QG CG+ W        SDRF I      N+ LS  ++L+C       GC+G
Sbjct: 198 SSYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNILSCTRRQ--QGCEG 255

Query: 64  GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHY 123
           G+  +AWRY    GVV E C PY          C+  +        C K   + R+S + 
Sbjct: 256 GHLDAAWRYLHKKGVVDENCYPYTQH----RDTCKIRHSRSLKANGCQKPVNVDRDSLYT 311

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV---MGGHAVKL 180
              AY +N +  DIMAEI+ +GPV+ +  V  DF  Y  GVY+    +     G H+VKL
Sbjct: 312 VGPAYSLNREA-DIMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKL 370

Query: 181 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +GWG   +GE YWI AN W   WG  GYF+I RGSNECGIEE V+A  P
Sbjct: 371 VGWGEEHNGEKYWIAANSWGSWWGEHGYFRILRGSNECGIEEYVLASWP 419


>gi|159108625|ref|XP_001704582.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157432649|gb|EDO76908.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 298

 Score =  152 bits (384), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 89/221 (40%), Positives = 122/221 (55%), Gaps = 20/221 (9%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD-GCDGGYPI 67
           +V QG CGSCWAF +V ++ DR C   G++   +  S   +++C     GD  CDGG+  
Sbjct: 91  VVDQGGCGSCWAFSSVASVGDRRCFA-GLDKKAVKYSPQYVVSCDR---GDMACDGGWLP 146

Query: 68  SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 127
           S WR+    G  T+EC PY         G   A  T  C  KC   + L     + +  A
Sbjct: 147 SVWRFLTKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSDL---PIYKATKA 194

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
                D + IM  +   GP++ +FTVY DF +Y+ GVY+H  G V GGHAV+++G+GT +
Sbjct: 195 VDYGLDCDLIMKALATGGPLQTAFTVYSDFMYYEGGVYQHTYGRVEGGHAVEMVGYGTDE 254

Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
              DYWI+ N W   WG DGYF+I R +NECGIEE V+ G 
Sbjct: 255 YDVDYWIIRNSWGPDWGEDGYFRIIRMTNECGIEEQVIGGF 295


>gi|291000017|ref|XP_002682576.1| cathepsin C [Naegleria gruberi]
 gi|284096203|gb|EFC49832.1| cathepsin C [Naegleria gruberi]
          Length = 430

 Score =  152 bits (384), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 84/236 (35%), Positives = 122/236 (51%), Gaps = 38/236 (16%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS----LSVNDLLACCGFLCGDGCDGGYPISAW 70
           Q  CGSC+AF + +    R  +    NL+     S  D++ C  +    GCDGG+P    
Sbjct: 210 QEQCGSCYAFSSSDMFGSR--VRIPSNLTQVPVYSPQDIVDCSAY--SQGCDGGFPFLVG 265

Query: 71  RYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYR 129
           +Y + +G+  E CDPY              +   KC  +C V + Q   +S +Y +  Y 
Sbjct: 266 KYAMDYGLTVESCDPY------------QGHDLGKCSNQCPVNRQQRLHSSNYYFVGGYY 313

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG-------------- 175
            NS    +M EIY+NGP+ + F VY D  +YK GVYKH+T + +                
Sbjct: 314 GNSHELSMMHEIYQNGPLAIGFEVYPDLRNYKHGVYKHVTAEELKAQGLSEDEMIPHFEV 373

Query: 176 --HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
             HAV ++GWG  ++G  YW + N W+ +WG +GYFKI RGS+ECG+E D  AG+P
Sbjct: 374 VNHAVLMVGWGV-ENGTPYWKIKNSWSTTWGDNGYFKILRGSDECGVESDAEAGIP 428


>gi|407080581|gb|AFS89610.1| procathepsin B precursor [Phenacoccus solenopsis]
          Length = 309

 Score =  152 bits (383), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 102/247 (41%), Positives = 133/247 (53%), Gaps = 20/247 (8%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGM--NLSLSVNDLLACCGFLCGDGCD 62
           N   +  +  QG CG+CWAF A EA+SDR CIH     +   S  +LL+CC   C  GC 
Sbjct: 63  NCPTIREIRDQGSCGACWAFAAAEAMSDRVCIHSSQTKHFHFSALNLLSCCD-SCEKGCL 121

Query: 63  GGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRK 109
           G     AW ++V HG+V+       E C PY     C H        C    PTP C R 
Sbjct: 122 GCDHHLAWDHWVKHGIVSGGSYGSKEGCQPYH-LPPCEHHRAGPRRNCTKYGPTPSCARV 180

Query: 110 CVKKNQL-WRNSKHYSISAYRINSDPEDIM-AEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
           C    ++ + +  H+    Y +    E I+  EI+ NGPVE +   YEDF  Y+SG+Y H
Sbjct: 181 CQPDYKISYEDDLHFGKQWYALAPHNEKIIRTEIFHNGPVEATMAAYEDFYTYESGIYHH 240

Query: 168 ITGDVMGGHAVKLIGWGTSDD-GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           I G  +  HAVK+IGWGT       YW++AN +N  WG  G+FKIKRG NECGIE  + A
Sbjct: 241 IEGTFVCDHAVKIIGWGTDKKTNTPYWLVANSFNTDWGEYGFFKIKRGVNECGIENKITA 300

Query: 227 GLPSSKN 233
           G+P+ KN
Sbjct: 301 GIPAYKN 307


>gi|29840882|gb|AAP05883.1| similar to GenBank Accession Number X70968 cathepsin B in
           Schistosoma japonicum [Schistosoma japonicum]
          Length = 312

 Score =  152 bits (383), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 90/209 (43%), Positives = 120/209 (57%), Gaps = 19/209 (9%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCD 62
           N   +  +  Q  CGSCWAFGAVE++SDR CIH    +++ LS  +LL+CC   CG GC+
Sbjct: 104 NCSSIRTIRDQSSCGSCWAFGAVESMSDRICIHSKGRISIELSAVNLLSCCS-RCGFGCN 162

Query: 63  GGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHPGCE-PAYPTPKCVR 108
           GG P  AW Y+   G+VT         C PY        ST  +H  CE   Y TP+C +
Sbjct: 163 GGIPGMAWDYWKDEGIVTGGSNETHTGCQPYPFPECIHHSTSINHSSCEVKYYSTPECYQ 222

Query: 109 KCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
            C     + + N K+Y  S+Y + SD   IM EI  NGPVE +F VY+DF +YK+GVYK+
Sbjct: 223 TCQPDYAIQYENDKYYGKSSYYVTSDEVSIMKEILLNGPVEATFYVYDDFLNYKTGVYKY 282

Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILA 196
           +TG ++GGHA++ I W      E Y IL 
Sbjct: 283 VTGSLLGGHAIR-ITWLGCIHIESYTILV 310


>gi|204022073|dbj|BAG71134.1| cathepsin B-S1 [Tuberaphis taiwana]
          Length = 334

 Score =  151 bits (382), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 98/232 (42%), Positives = 130/232 (56%), Gaps = 20/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCW+F    A +DR C+  G   N  LS  +L A C   CG GC GGYPI AW+Y
Sbjct: 107 QGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEEL-AFCCKDCGKGCGGGYPIKAWKY 165

Query: 73  FVHHGVVT-------EECDPY-----FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
           F   GV T       E C PY     ++  G +  G +P     +C + C  K  +   +
Sbjct: 166 FRTQGVTTGGDYGTKEGCMPYKVPPCYNKQGKNTCGGQPMERNHQCPKTCYGKTTV--QN 223

Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVK 179
           ++ + S Y INS  + I  +I   GPVE SF VY+D + YKSG+Y+        GGH++K
Sbjct: 224 RYKTKSEYVINSI-KTIERDIMTYGPVEASFDVYDDLSAYKSGIYRKTPKAKYQGGHSIK 282

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           +IGWG   +G  YW+  N W++ WG  G FKI +G NECGIE  V AG+PSS
Sbjct: 283 IIGWG-QQNGTPYWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGIPSS 333


>gi|107921791|gb|ABF85679.1| cathepsin B2 [Fasciola hepatica]
          Length = 278

 Score =  151 bits (382), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 83/197 (42%), Positives = 108/197 (54%), Gaps = 21/197 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWA  A  A+SDR CIH    M   L+  D L+CC + CG GC GGYP  AW Y
Sbjct: 85  QASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAADPLSCCTY-CGQGCRGGYPPKAWDY 143

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPGCEP--------AYPTPKCVRKC-VKKNQL 116
           ++  G+VT         C P+   T C H G            YP P C R C    N+ 
Sbjct: 144 WMREGIVTGGTWENRTGCQPWM-FTKCDHVGDSRKYSRCPHYTYPKPPCARACQTGYNKT 202

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   K Y  S+Y +      IM EI KNGPVEV+F +++DF  Y+SG+Y H+ G  +G H
Sbjct: 203 YEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEVTFAIFQDFGVYRSGIYHHVAGKFIGRH 262

Query: 177 AVKLIGWGTSDDGEDYW 193
           AV++IGWG  ++G +YW
Sbjct: 263 AVRMIGWGV-ENGVNYW 278


>gi|195154396|ref|XP_002018108.1| GL16940 [Drosophila persimilis]
 gi|194113904|gb|EDW35947.1| GL16940 [Drosophila persimilis]
          Length = 433

 Score =  151 bits (381), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 93/239 (38%), Positives = 121/239 (50%), Gaps = 31/239 (12%)

Query: 6   SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDG 63
           S ++  +  QG CGS W        SDRF I       + LS  ++L+C       GC+G
Sbjct: 200 SSYISEVPDQGWCGSSWVLSTTSVASDRFAIQSKGKEAVQLSAQNILSCTRRQ--QGCEG 257

Query: 64  GYPISAWRYFVHHGVVTEECDPY----------FDSTGCSHPGCEPAYPTPKCVRKCVKK 113
           G+  +AWRY    GVV E C PY           +S      GC P+             
Sbjct: 258 GHLDAAWRYLHKKGVVDESCYPYTQHRDTCKIRHNSRSLKANGCRPS------------- 304

Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT---G 170
             + R+S +    AY +N +  DIMAEIY +GPV+ +  VY DF  Y SGVY+      G
Sbjct: 305 ANVDRDSFYTVGPAYTLNKE-SDIMAEIYHSGPVQATMRVYRDFFSYSSGVYRQTAANRG 363

Query: 171 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
              G H+VKL+GWG   +G+ YWI AN W   WG  GYF+I RGSNECGIE+ V+A  P
Sbjct: 364 APTGFHSVKLVGWGEEHNGDKYWIAANSWGPWWGERGYFRILRGSNECGIEDYVLASWP 422


>gi|125810908|ref|XP_001361665.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
 gi|54636841|gb|EAL26244.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
          Length = 433

 Score =  151 bits (381), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 93/239 (38%), Positives = 121/239 (50%), Gaps = 31/239 (12%)

Query: 6   SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDG 63
           S ++  +  QG CGS W        SDRF I       + LS  ++L+C       GC+G
Sbjct: 200 SSYISEVPDQGWCGSSWVLSTTSVASDRFAIQSKGKEAVQLSAQNILSCTRRQ--QGCEG 257

Query: 64  GYPISAWRYFVHHGVVTEECDPY----------FDSTGCSHPGCEPAYPTPKCVRKCVKK 113
           G+  +AWRY    GVV E C PY           +S      GC P+             
Sbjct: 258 GHLDAAWRYLHKKGVVDESCYPYTQHRDTCKIRHNSRSLKANGCRPS------------- 304

Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT---G 170
             + R+S +    AY +N +  DIMAEIY +GPV+ +  VY DF  Y SGVY+      G
Sbjct: 305 ANVDRDSFYTVGPAYTLNKE-SDIMAEIYHSGPVQATMRVYRDFFSYSSGVYRQTAANRG 363

Query: 171 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
              G H+VKL+GWG   +G+ YWI AN W   WG  GYF+I RGSNECGIE+ V+A  P
Sbjct: 364 APTGFHSVKLVGWGEEHNGDKYWIAANSWGPWWGERGYFRILRGSNECGIEDYVLASWP 422


>gi|204022075|dbj|BAG71135.1| cathepsin B-S2 [Tuberaphis taiwana]
          Length = 334

 Score =  151 bits (381), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 97/232 (41%), Positives = 131/232 (56%), Gaps = 20/232 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCW+F    A +DR C+  G   N  LS  +L A C   CG GC GGYPI AW+Y
Sbjct: 107 QGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEEL-AFCCKDCGKGCGGGYPIKAWKY 165

Query: 73  FVHHGVVT-------EECDPY-----FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
           F   GV T       E C PY     ++  G +  G +P     +C + C  K  +   +
Sbjct: 166 FRTQGVTTGGDYGTKEGCMPYKVPPCYNKQGKNTCGGQPMERNHQCPKTCYGKTTV--QN 223

Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVK 179
           ++ + S Y +NS  + I  ++   GPVE SF VY+DF+ YKSG+Y+        GGH++K
Sbjct: 224 RYKTKSEYVMNSI-KTIEQDLKTYGPVEASFDVYDDFSVYKSGIYRKTPKAKYQGGHSIK 282

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           +IGWG   +G  YW+  N W++ WG  G FKI +G NECGIE  V AG+PSS
Sbjct: 283 IIGWG-QQNGTPYWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGIPSS 333


>gi|343475054|emb|CCD13447.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  150 bits (380), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 94/230 (40%), Positives = 128/230 (55%), Gaps = 15/230 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
           Q  C + WA     A+SDR+C +  G  L +S  DL+ACC   CG GC+GGYP +AW Y+
Sbjct: 113 QSACRASWAVATASAISDRYCTVGNGKQLRISAADLMACCT-GCGGGCEGGYPDAAWEYY 171

Query: 74  VHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSI 125
           V +G+ + +C PY     C H G +   P        TP C   C  K+      K+   
Sbjct: 172 VSNGITSSQCQPY-PFPRCEHRGAQGKKPPCSKYNFDTPTCNATCTDKSVPL--IKYRGN 228

Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
            +Y +  + ED   E+Y NGP  V F V+ DF  YKSGVY+H+ G+ +GG AV+++GWG 
Sbjct: 229 HSYEVRGE-EDYKRELYFNGPFVVRFQVHSDFLAYKSGVYQHVAGNFLGGKAVRIVGWGK 287

Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
             +G  YW +AN W+  WG +GYF I RG+NEC IE    AG P +  L 
Sbjct: 288 M-NGTPYWKVANSWDTDWGMNGYFLILRGNNECNIEHLGFAGTPDTSQLT 336


>gi|10803437|emb|CAC13131.1| putative cathepsin B.5 [Ostertagia ostertagi]
          Length = 196

 Score =  150 bits (380), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 81/183 (44%), Positives = 104/183 (56%), Gaps = 19/183 (10%)

Query: 20  SCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 77
           SCWAFGA EA+SDR CI       +++S +D+L+CCG  CG+GC+GGYPI AW+Y+V  G
Sbjct: 1   SCWAFGAAEAMSDRICIASQGKTQVTISADDVLSCCGKKCGNGCEGGYPIEAWKYWVKTG 60

Query: 78  VVT-------EECDPYFDSTGCSH--------PGCEPAYPTPKCVRKCVKKNQL-WRNSK 121
           + T         C PY     C H        P     Y TP C  KC+   +  + + K
Sbjct: 61  ICTGGSYESQSGCKPY-PIPPCGHHKNQTYFGPCPTDEYDTPVCTNKCIAAYKTPYSDDK 119

Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 181
           HY  SAY +      I  EI  NGPVE ++TVYEDF  Y  GVY H  G  +GGHAV+++
Sbjct: 120 HYGTSAYNVAKTVAGIQKEIMTNGPVEAAYTVYEDFYQYTGGVYTHTGGAEVGGHAVRIL 179

Query: 182 GWG 184
           GWG
Sbjct: 180 GWG 182


>gi|290990464|ref|XP_002677856.1| predicted protein [Naegleria gruberi]
 gi|284091466|gb|EFC45112.1| predicted protein [Naegleria gruberi]
          Length = 231

 Score =  150 bits (380), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 88/217 (40%), Positives = 123/217 (56%), Gaps = 20/217 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSC++F + E +SDRFCI  +  +N+ LS  DL+ C  +    GC+GG P   + Y
Sbjct: 22  QGNCGSCYSFASSEVMSDRFCIFSNGSVNVVLSPQDLVTCSWY--SFGCNGGIPGLVFDY 79

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
               G+V++ C PY    G +H  C P +    C      K + +++ KH++   Y +  
Sbjct: 80  IHKDGLVSDACFPYLSYDGNTHVKC-PDF----CYNN---KTKSFKSDKHFADKVYHVGE 131

Query: 133 DPED-------IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
             ED       I  EI  +GPV   F VY DF  YKSGVY+H TG   G HAVK+IGWGT
Sbjct: 132 FLEDKAKRVLEIQKEILTHGPVNADFMVYSDFTVYKSGVYRHQTGSFEGIHAVKIIGWGT 191

Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 222
            ++G DYW++AN W  ++G  G+FKI RG     +EE
Sbjct: 192 -ENGVDYWLIANSWGTTFGLQGFFKIVRGGKFIHLEE 227


>gi|239792046|dbj|BAH72408.1| ACYPI000003 [Acyrthosiphon pisum]
          Length = 182

 Score =  150 bits (380), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 73/165 (44%), Positives = 103/165 (62%), Gaps = 1/165 (0%)

Query: 67  ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSI 125
           +S   Y  + G +  E  P       +   C+    TP CV+KC +  ++ +    H+  
Sbjct: 18  VSGGPYGSNMGCIPYEIAPCEHHVNGTRGPCKEGGKTPTCVKKCEEGYKVPYAQDLHHGK 77

Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
           SAY I +D + I  EIY NGPVE +FTVYEDF  Y++GVYKH+ G  +GGHA++++GWG 
Sbjct: 78  SAYSIRNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGV 137

Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
            +    YW++AN WN  WG+DG+FKI RGS+ECGIE  + AGLP+
Sbjct: 138 QNGEIPYWLVANSWNTDWGSDGFFKILRGSDECGIEGQINAGLPA 182


>gi|403357104|gb|EJY78168.1| Cathepsin B [Oxytricha trifallax]
          Length = 349

 Score =  150 bits (379), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 87/220 (39%), Positives = 120/220 (54%), Gaps = 26/220 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACC--GFLCGDGCDGGYPISAW 70
           Q  CGSCWAF +   LSDRFCIH    +N  LS  DL++C    F    GC GG    + 
Sbjct: 145 QQLCGSCWAFASSAFLSDRFCIHSEGQINEDLSPQDLVSCSYENF----GCSGGQLTESV 200

Query: 71  RYFVHHGVVTEECDPYFDS-TGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYR 129
            + ++ G+V+E+C PY +  T C         P  K    C +K+ L             
Sbjct: 201 DFLIYEGIVSEKCKPYMNQDTYCKFKCQNDKQPYTKYF--CEQKSML------------- 245

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 189
           I SD E+I  E+  NGP+ V  +VYED  +YK GVY++ TG+ +GGHA+K+IGWG ++ G
Sbjct: 246 ILSDIEEIQLELMTNGPMMVGLSVYEDLMNYKEGVYEYTTGNQVGGHAIKIIGWGHTEKG 305

Query: 190 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           E +W   NQW + WG  GY  IK G  E G++  V+  +P
Sbjct: 306 ELFWKCQNQWGKDWGMGGYINIKAG--ELGMDTMVLGCMP 343


>gi|256090674|ref|XP_002581308.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 250

 Score =  150 bits (379), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 89/228 (39%), Positives = 124/228 (54%), Gaps = 15/228 (6%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISA 69
           L+ + H    WA  +  ++SDR CI     M + LS  +L++C     G  C  G+   +
Sbjct: 20  LLPREHYTELWAVASAASISDRTCIQTNGTMKVQLSAIELISCSKNKLG--CQIGFSEFS 77

Query: 70  WRYFVHHGVVTEE---CDPYF-----DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRN 119
           W Y++ +G+VT +   C PY        +  S+P C    Y  P C + C     + ++ 
Sbjct: 78  WDYWLKNGLVTGDPTGCLPYPFPKCDHRSSNSYPKCGYITYTAPPCTKTCRSGYPIPYKA 137

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
            KHY    Y +  +  DI  EI  NGPVE    V+ DF +YKSGVY+HITG ++  H+V+
Sbjct: 138 DKHYGRVIYSLRPNESDIRKEIMMNGPVEAGIFVHSDFLNYKSGVYRHITGQLVTIHSVR 197

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           +IGWG  +D   YW+ AN WN  WG +GYFKI RGSNEC IE  V AG
Sbjct: 198 IIGWGIEND-IPYWLCANSWNEDWGLNGYFKILRGSNECEIESFVNAG 244


>gi|24657813|ref|NP_726176.1| secreted Wg-interacting molecule, isoform A [Drosophila
           melanogaster]
 gi|24657819|ref|NP_611652.2| secreted Wg-interacting molecule, isoform B [Drosophila
           melanogaster]
 gi|21064305|gb|AAM29382.1| RE01730p [Drosophila melanogaster]
 gi|21626543|gb|AAF46818.2| secreted Wg-interacting molecule, isoform A [Drosophila
           melanogaster]
 gi|21626544|gb|AAM68213.1| secreted Wg-interacting molecule, isoform B [Drosophila
           melanogaster]
 gi|220949028|gb|ACL87057.1| CG3074-PA [synthetic construct]
 gi|220958134|gb|ACL91610.1| CG3074-PA [synthetic construct]
          Length = 431

 Score =  150 bits (378), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 91/231 (39%), Positives = 120/231 (51%), Gaps = 15/231 (6%)

Query: 6   SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDG 63
           S ++  +  QG CG+ W        SDRF I      N+ LS  ++L+C       GC+G
Sbjct: 198 SSYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNILSCTRRQ--QGCEG 255

Query: 64  GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK--CVKKNQLWRNSK 121
           G+  +AWRY    GVV E C PY       H          + +R   C K   + R+S 
Sbjct: 256 GHLDAAWRYLHKKGVVDENCYPYT-----QHRDTCKIRHNSRSLRANGCQKPVNVDRDSL 310

Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV---MGGHAV 178
           +    AY +N +  DIMAEI+ +GPV+ +  V  DF  Y  GVY+    +     G H+V
Sbjct: 311 YTVGPAYSLNREA-DIMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSV 369

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           KL+GWG   +GE YWI AN W   WG  GYF+I RGSNECGIEE V+A  P
Sbjct: 370 KLVGWGEEHNGEKYWIAANSWGSWWGEHGYFRILRGSNECGIEEYVLASWP 420


>gi|170030062|ref|XP_001842909.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
 gi|167865915|gb|EDS29298.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
          Length = 288

 Score =  149 bits (377), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 89/223 (39%), Positives = 120/223 (53%), Gaps = 13/223 (5%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSC+A      ++DR+CIH G            L+CC       CDGGY    + Y
Sbjct: 69  QGSCGSCYAVSTAAVITDRYCIHSGGERQFYFGSTGYLSCCTDCYK--CDGGYVHKTFDY 126

Query: 73  FVHHGVVTEECDPYFDSTGCS-HP---GCEPAYPTPKCVRKCVKKNQLW--RNSKHYSIS 126
           +V +G+ +    PY    GC  +P     +      KC R+C     L   ++ KH + S
Sbjct: 127 WVKYGLTSG--GPYHSGQGCKPYPFGGATQDVNIVLKCDRQCQAGYPLTYSQDLKHGASS 184

Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 186
                 D   + AEIY+NGP+  SF VY DF  Y+SGVY+H+TG   G HAV++IGWG  
Sbjct: 185 YILPWGDENAMKAEIYQNGPIVTSFDVYGDFFQYRSGVYRHVTGAYKGSHAVRVIGWGV- 243

Query: 187 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           ++G  YW+ AN WN  WG +G+FKI RG N  G+E+   AGLP
Sbjct: 244 ENGVKYWLCANSWNERWGENGFFKIVRGENHVGVEDISYAGLP 286


>gi|161343839|tpg|DAA06100.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 323

 Score =  149 bits (377), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 92/238 (38%), Positives = 121/238 (50%), Gaps = 30/238 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG----CDGGYPIS 68
           QG+C S WA       +DR CI      +  LS  +L++C     GDG    CDGG    
Sbjct: 87  QGNCASSWAVAVASTFTDRLCIASNGQFTDNLSAQNLMSC-----GDGEKMGCDGGSAFK 141

Query: 69  AWRYFVHHGVVT-------EECDPYFDSTGCSHPG------CEPAYPTPK--CVRKCVKK 113
           AW   ++ G+VT       E C PY  +  C H G      C     T    C +KCV K
Sbjct: 142 AWELTMNKGIVTGGNFDSNEGCQPY-KNRPCDHYGDSRLTNCSSLRRTQMTVCRKKCVNK 200

Query: 114 NQL--WRNSKHYSISAYRIN-SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 170
           N    + +  H +   Y  + ++ + I  EI  +GPV     VYE+F  YK G+YK  TG
Sbjct: 201 NYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTHGPVTAFMYVYENFMGYKEGIYKSTTG 260

Query: 171 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           +++G H VKLIGWG   DG +YW+  N WN +WG DG FKI RG N C IE  V+AG+
Sbjct: 261 ELIGYHHVKLIGWGVDGDGTEYWLAMNSWNSNWGNDGLFKILRGYNFCSIELLVMAGI 318


>gi|308157829|gb|EFO60849.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 300

 Score =  149 bits (376), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 86/221 (38%), Positives = 122/221 (55%), Gaps = 19/221 (8%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLAC-CGFLCGDGCDGGYPI 67
           +V QG CGSCWAF +V    DR CI  G++   +  S   +++C  G +    C+GG+  
Sbjct: 92  VVDQGGCGSCWAFSSVATFGDRRCIA-GLDKKPVKYSPQYVVSCDHGNM---ACNGGWLP 147

Query: 68  SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 127
           +AW++    G  T+EC PY   +      C    PT     KC   +     +   S   
Sbjct: 148 NAWKFLTKTGTTTDECVPYQSGSTTLRGTC----PT-----KCADGSSKVHLTTATSYKD 198

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
           Y +  D   +M  +   GP++V+F VY DF +Y+SGVY+H  G + GGHAV+++G+GT D
Sbjct: 199 YGL--DIPAMMKALSTTGPLQVAFLVYSDFMYYESGVYQHTYGYMEGGHAVEMVGYGTDD 256

Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           DG DYWI+ N W   WG DGYF++ RG N+C IEE   AG 
Sbjct: 257 DGVDYWIIRNSWGPDWGEDGYFRMIRGINDCSIEEQAYAGF 297


>gi|156708116|gb|ABU93316.1| cathepsin B7 cysteine protease, partial [Monocercomonoides sp. PA]
          Length = 273

 Score =  149 bits (376), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 85/221 (38%), Positives = 117/221 (52%), Gaps = 18/221 (8%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
            E +  QG CGSCWA  A E +  R  I       +S  DL++C       GC+GGY   
Sbjct: 69  AEPVRNQGSCGSCWAHAASETMGFRMGIRRCSKGVMSPQDLVSCESN--NMGCNGGYADR 126

Query: 69  AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
            W +    G+ TE+C PY   +G            P C  KC   + + R+   +  S  
Sbjct: 127 VWNWIQKKGITTEQCIPYVSGSG----------RVPTCPSKCKNGSNIVRS---FVSSWG 173

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
             NS  + +M E+  NGPV   F V+EDF +Y+SGVY+H TG   G H V L+GWGT ++
Sbjct: 174 SFNS--KTVMDEVANNGPVYACFEVFEDFYNYRSGVYQHKTGRSQGWHHVMLMGWGT-EN 230

Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           G  YW+L N W   WG  G+F+I+RG+N+C I+E   +GLP
Sbjct: 231 GVPYWLLQNSWGSGWGEKGFFRIRRGTNDCHIDEIFYSGLP 271


>gi|268555786|ref|XP_002635882.1| Hypothetical protein CBG01102 [Caenorhabditis briggsae]
          Length = 374

 Score =  149 bits (375), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 88/267 (32%), Positives = 131/267 (49%), Gaps = 56/267 (20%)

Query: 18  CGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCG--FLCGDG------------- 60
           C S WAF A E++SDR CI+ G  +N  LS  +LL+CC   F CG+G             
Sbjct: 106 CKSSWAFSAAESMSDRLCINSGGMINTVLSAQELLSCCTGVFSCGEGDSEHWQFRNSKFR 165

Query: 61  -----------------------CDGGYPISAWRYFVHHGVVTEE-------CDPYFDST 90
                                  C GG    AW+Y+  HG+ T         C PY  S 
Sbjct: 166 KPRCQKFNKEILEARRNLETREKCAGGNVFKAWQYWQKHGLPTGGSYESQFGCKPYSISP 225

Query: 91  ------GCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIY 142
                   + PGC      TP C +KC     +     +HY +S  ++ +   +I +++ 
Sbjct: 226 CDTVIGNITFPGCLNSTVQTPSCEKKCKSGYPVELDKDRHYGVSVDQLPNRQIEIQSDVM 285

Query: 143 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 202
            NGP+  +  VY+DF  Y +G+Y H+TG+  G  +V+++GWG   +G  YW+LAN W + 
Sbjct: 286 LNGPISATMEVYDDFLQYTTGIYVHLTGNKQGHLSVRILGWGMY-EGVPYWLLANSWGKQ 344

Query: 203 WGADGYFKIKRGSNECGIEEDVVAGLP 229
           WG +G F++ RG NECG+E + V+G+P
Sbjct: 345 WGENGTFRVLRGVNECGLEANCVSGMP 371


>gi|10803454|emb|CAB97366.2| putative cathepsin B.3 [Ostertagia ostertagi]
          Length = 196

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 85/197 (43%), Positives = 108/197 (54%), Gaps = 18/197 (9%)

Query: 20  SCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 77
           SCWA  A E +SDR C+         LS  D+LACCG  CG GC+GGY   AW Y  + G
Sbjct: 1   SCWAVSAAETMSDRLCVQTNGRKKTLLSDTDILACCGDFCGYGCNGGYSARAWLYARNSG 60

Query: 78  VVT----EE---CDPY------FDSTGCSHPGC-EPAYPTPKCVRKC-VKKNQLWRNSKH 122
           V +    +E   C PY      +      +  C +  Y TP C + C     + +   K 
Sbjct: 61  VCSGGRYQEKGVCKPYTFHPCGYHKNQTYYGECPKHTYQTPACKKYCQYGYGKRYEKDKI 120

Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
           Y+  AYR++SD   I AEI+  GPV+ SF  YEDFAHYKSG+Y H  G   GGHAVK+IG
Sbjct: 121 YAXDAYRVSSDEAAIRAEIFARGPVQASFATYEDFAHYKSGIYVHTAGKRRGGHAVKIIG 180

Query: 183 WGTSDDGEDYWILANQW 199
           WG  ++G   WI+AN W
Sbjct: 181 WGV-ENGTKXWIVANSW 196


>gi|209863079|ref|NP_001119613.2| cathepsin B precursor [Acyrthosiphon pisum]
          Length = 323

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 92/238 (38%), Positives = 120/238 (50%), Gaps = 30/238 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG----CDGGYPIS 68
           QG+C S WA       +DR CI      +  LS  +L++C     GDG    CDGG    
Sbjct: 87  QGNCASSWAVAVASTFTDRLCIASNGQFTDNLSAQNLMSC-----GDGEKMGCDGGSAFK 141

Query: 69  AWRYFVHHGVVT-------EECDPYFDSTGCSHPG------CEPAYPTPK--CVRKCVKK 113
           AW   ++ G+VT       E C PY  +  C H G      C     T    C +KCV K
Sbjct: 142 AWELTMNKGIVTGGNFDSNEGCQPY-KNRPCDHYGDSRLTNCSSLRRTQMTVCRKKCVNK 200

Query: 114 NQL--WRNSKHYSISAYRIN-SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 170
           N    + +  H +   Y  + ++ + I  EI   GPV     VYE+F  YK G+YK  TG
Sbjct: 201 NYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTYGPVTAFMYVYENFMGYKEGIYKSTTG 260

Query: 171 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           +++G H VKLIGWG   DG +YW+  N WN +WG DG FKI RG N C IE  V+AG+
Sbjct: 261 ELIGYHHVKLIGWGVDGDGTEYWLAMNSWNSNWGNDGLFKILRGYNFCSIELLVMAGI 318


>gi|448278133|gb|AGE43966.1| putative cathepsin B [Naegleria fowleri]
          Length = 349

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 81/223 (36%), Positives = 123/223 (55%), Gaps = 26/223 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAF   E ++DRFCI     +N  +S   +++C      +GC+GG   +A+++
Sbjct: 145 QEQCGSCWAFSISEMVADRFCIGTRGKINTIMSPQWMVSCD--TADNGCNGGEFPTAFQF 202

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-----WRNSKHYSISA 127
               G+V++ C PY    G            P C   C     +      +NS+++ ++ 
Sbjct: 203 VETTGLVSDGCVPYQSGNGF----------VPPCPNSCANGEDINVRYRTKNSRNFDVN- 251

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
                D + + A I  NGPV   F VY DF +Y+SG YKH+ G ++GGHA+K++GWG + 
Sbjct: 252 -----DMKSVQASILANGPVISGFKVYRDFYNYRSG-YKHVAGGLVGGHAIKVVGWGVTQ 305

Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
               YWI+AN W+  WG +GYF I RG+NEC IEE++   +P+
Sbjct: 306 SNVPYWIVANSWSDEWGMNGYFWILRGTNECSIEENMWETIPA 348


>gi|194882138|ref|XP_001975170.1| GG20712 [Drosophila erecta]
 gi|190658357|gb|EDV55570.1| GG20712 [Drosophila erecta]
          Length = 431

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 94/244 (38%), Positives = 121/244 (49%), Gaps = 31/244 (12%)

Query: 6   SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDG 63
           S ++  +  QG CG+ W        SDRF I       + LS  ++L+C       GCDG
Sbjct: 198 SSYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKETVQLSAQNILSCTRRQ--QGCDG 255

Query: 64  GYPISAWRYFVHHGVVTEECDPY----------FDSTGCSHPGCEPAYPTPKCVRKCVKK 113
           G+  +AWRY    GVV E C PY           +S      GCE    TP  V      
Sbjct: 256 GHLDAAWRYLHKKGVVDESCYPYTQHRDTCKIRHNSRSLRANGCE----TPVNVD----- 306

Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV- 172
               R++ +    AY +N +  DIMAEI+ +GPV+ +  V  DF  Y  GVY+    +  
Sbjct: 307 ----RDTFYTVGPAYSLNREA-DIMAEIFNSGPVQATMRVNRDFFSYSRGVYRQTAANRE 361

Query: 173 --MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
              G H+VKL+GWG   +GE YWI AN W   WG  GYF+I RGSNECGIEE V+A  P 
Sbjct: 362 APTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEKGYFRILRGSNECGIEEYVLASWPY 421

Query: 231 SKNL 234
             N 
Sbjct: 422 VYNF 425


>gi|159109223|ref|XP_001704877.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157432952|gb|EDO77203.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 300

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 85/221 (38%), Positives = 122/221 (55%), Gaps = 19/221 (8%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD-GCDGGYPI 67
           +V QG CGSCWAF +V    DR C+  G++   +  S   +++C     GD  C+GG+  
Sbjct: 92  VVDQGGCGSCWAFSSVATFGDRRCVA-GLDKKPVKYSPQYVVSCDH---GDMACNGGWLP 147

Query: 68  SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 127
           + W++    G  T+EC PY   +      C    PT     KC   +     +   S   
Sbjct: 148 NVWKFLTKTGTTTDECVPYKSGSTTLRGTC----PT-----KCADGSSKVHLATATSYKD 198

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
           Y +  D   +M  +  +GP++V+F VY DF +Y+SGVY+H  G + GGHAV+++G+GT D
Sbjct: 199 YGL--DIPAMMKALSTSGPLQVAFLVYSDFMYYESGVYQHTYGYMEGGHAVEMVGYGTDD 256

Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           DG DYWI+ N W   WG DGYF++ RG N+C IEE   AG 
Sbjct: 257 DGVDYWIIRNSWGPDWGEDGYFRMIRGINDCSIEEQAYAGF 297


>gi|195426329|ref|XP_002061289.1| GK20838 [Drosophila willistoni]
 gi|194157374|gb|EDW72275.1| GK20838 [Drosophila willistoni]
          Length = 432

 Score =  148 bits (373), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 92/239 (38%), Positives = 121/239 (50%), Gaps = 24/239 (10%)

Query: 6   SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDG 63
           S ++  +  QG CGS W        SDRF I       + LS  ++L+C       GC+G
Sbjct: 200 SRYISEVPDQGWCGSSWVLSTTSVASDRFAIQSQGKEVVQLSPQNILSCTRRQ--QGCEG 257

Query: 64  GYPISAWRYFVHHGVVTEECDPYFDSTGC---SHPGCEPAY---PTPKCVRKCVKKNQLW 117
           G+  +AWRY    GV+ E C PY  S G     H G   A+   P P      V ++ L+
Sbjct: 258 GHLDAAWRYLHKKGVLDESCYPYTQSRGTCKVRHSGSLKAHGCRPAPG-----VDRDSLY 312

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT---GDVMG 174
                YS+S         DI AEI+ +GPV+ +  VY DF  Y  G+Y+      G   G
Sbjct: 313 TVGPAYSLSR------EADIKAEIFHSGPVQATMRVYRDFFSYSGGIYRQTAANRGAPTG 366

Query: 175 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
            H+VKL+GWG   +G+ YWI AN W   WG  GYF+I RGSNECGIE+ V+A  P   N
Sbjct: 367 FHSVKLVGWGEEHNGDKYWIAANSWGPWWGERGYFRILRGSNECGIEDYVLASWPYVYN 425


>gi|294894292|ref|XP_002774787.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239880404|gb|EER06603.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 414

 Score =  147 bits (372), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 93/255 (36%), Positives = 125/255 (49%), Gaps = 47/255 (18%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFG  EA +DR CI      +  LS  ++ AC       GCDGG P  AW +
Sbjct: 165 QSDCGSCWAFGVTEAFNDRLCIKSNGTFTELLSAGEMNACAPSF---GCDGGIPSLAWSW 221

Query: 73  FVHHGVVT-------------EECDPYFDSTGCSH-------PGC-EPAYPTPKCVRKC- 110
             + G+ T             + C PY D   C+H       P C + +Y TP C  +C 
Sbjct: 222 VHNKGIATGGDYLAEDDMTKDDGCWPY-DFPPCAHHVNDSKYPKCPKDSYETPNCAEQCH 280

Query: 111 -VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV---------------EVSFTVY 154
             K     R+ +H+ + +        D    I  +GPV                 SF VY
Sbjct: 281 NPKYTTTLRDDRHFLVESVPYEYSVNDAKNAIRTDGPVGPIYFCDPSVNFDQVSASFIVY 340

Query: 155 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 214
           EDF  Y+SGVYKH +G  +GGHAVK+IGWG  + G+ YW++ N WN  WG +G FKI  G
Sbjct: 341 EDFLAYRSGVYKHTSGKELGGHAVKIIGWG-EETGQAYWLVVNSWNEDWGDNGLFKIALG 399

Query: 215 SNECGIEEDVVAGLP 229
           +  C I++D++ G P
Sbjct: 400 N--CEIDDDLLGGTP 412


>gi|201023369|ref|NP_001128426.1| cathepsin B-3483 [Acyrthosiphon pisum]
 gi|328712086|ref|XP_003244726.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
          Length = 355

 Score =  147 bits (372), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 91/240 (37%), Positives = 124/240 (51%), Gaps = 27/240 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C + WA     A++DR CI    N++   S   L++CC   CG+GC GGY  +AWRY
Sbjct: 118 QGNCAADWAISVTSAMNDRICIASQGNITALYSPQKLVSCCED-CGNGCSGGYTAAAWRY 176

Query: 73  FVHHGVVT-------EECDPYF-----DSTGCSHP----------GCEPAYPTPKCVRKC 110
            +  G+VT       E C P+       ST  + P          G +PA  TPKC   C
Sbjct: 177 ILKKGIVTGGDYGSNEGCQPWLVQPCNASTTAADPSSVLGPHGVCGGDPA-TTPKCDLSC 235

Query: 111 VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 170
                  +       +      D       + K+GP  V+  VYEDF  YKSGVY H+TG
Sbjct: 236 YNARHEGKYLDDIIKAKKVFTFDGCSARKNLRKHGPYVVTMRVYEDFLAYKSGVYHHVTG 295

Query: 171 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           D +G  +V++IGWG  + G+ +W+LAN W  SWG  G+FKI+R  NEC IE    AG+P+
Sbjct: 296 DYLGLLSVRMIGWGL-EGGQAFWLLANSWGTSWGDKGFFKIRRFVNECWIENFRYAGVPN 354


>gi|15150360|gb|AAK85411.1| cathepsin B-like protease [Trypanosoma rangeli]
          Length = 207

 Score =  147 bits (371), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 112/206 (54%), Gaps = 15/206 (7%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGC 61
           + N   +  +  Q  CGSCWA  A  A+SDR+C   G+ +L +S  DLL+CC   CG GC
Sbjct: 7   WPNCPTITEIRDQSGCGSCWAVAARSAMSDRYCTRGGVRDLRISAGDLLSCCN-ACGLGC 65

Query: 62  DGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH-------PGCEPAYPTPKCVRKCVKKN 114
           +GG P  AW Y+V  G+V+E C PY     C+H         C   Y TP C   C    
Sbjct: 66  NGGDPDWAWLYYVETGIVSEFCQPY-PFPPCAHHVNSTHYTPCSVEYDTPFCNITCTNTI 124

Query: 115 QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
              +     S S     S  ED   E++  GP EV+FTVYEDF  Y  GVYKH +G+ +G
Sbjct: 125 PPIKYKGRISYSL----SGEEDYKRELFLYGPFEVAFTVYEDFVAYSDGVYKHFSGNALG 180

Query: 175 GHAVKLIGWGTSDDGEDYWILANQWN 200
           GHAV+L+GWG   +G  YW +AN WN
Sbjct: 181 GHAVRLVGWGNL-NGTPYWKIANSWN 205


>gi|156708118|gb|ABU93317.1| cathepsin B8 cysteine protease, partial [Monocercomonoides sp. PA]
          Length = 275

 Score =  147 bits (371), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 84/221 (38%), Positives = 116/221 (52%), Gaps = 18/221 (8%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
            E +  Q  CGSCWA  A E +  R  I       +S  DL++C       GC+GGY   
Sbjct: 71  AEPVRNQASCGSCWAHAASETMGFRMGIRGCYKGVMSPQDLVSCESN--NMGCEGGYADR 128

Query: 69  AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
            W +    G+ TE+C PY   +G            P C  KC   + + R+   +  S  
Sbjct: 129 VWNWIQKKGITTEQCLPYVSGSG----------RVPTCPSKCKNGSNIVRS---FVSSWG 175

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
             NS  + +M E+  NGPV   F V+EDF +YKSG+Y+H TG   G H V L+GWGT ++
Sbjct: 176 SFNS--KTVMDEVANNGPVYACFEVFEDFLNYKSGIYQHKTGKSKGWHHVMLMGWGT-EN 232

Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           G  YW+L N W   WG  G+F+I+RG+N+C I+E   +GLP
Sbjct: 233 GVPYWLLQNSWGSGWGEKGFFRIRRGTNDCHIDEIFYSGLP 273


>gi|195488613|ref|XP_002092389.1| GE11695 [Drosophila yakuba]
 gi|194178490|gb|EDW92101.1| GE11695 [Drosophila yakuba]
          Length = 431

 Score =  147 bits (371), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 90/239 (37%), Positives = 119/239 (49%), Gaps = 31/239 (12%)

Query: 6   SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDG 63
           S ++  +  QG CG+ W        SDRF I       + LS  ++L+C       GC+G
Sbjct: 198 SSYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKEAVQLSAQNILSCTRRQ--QGCEG 255

Query: 64  GYPISAWRYFVHHGVVTEECDPY----------FDSTGCSHPGCEPAYPTPKCVRKCVKK 113
           G+  +AWRY    GVV E C PY           +S      GC+  Y            
Sbjct: 256 GHLDAAWRYLHKKGVVDESCYPYTQQRDTCKIRHNSRSLRANGCQTPYNVD--------- 306

Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
               R++ +    AY +N +  DIMAEI+ +GPV+ +  V  DF  Y  GVY+    + M
Sbjct: 307 ----RDTFYTVGPAYSLNREA-DIMAEIFHSGPVQATMRVNRDFFAYAGGVYRQTAANRM 361

Query: 174 ---GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
              G H+VKL+GWG   +GE YWI AN W   WG  GYF+I RGSNECGIEE V+A  P
Sbjct: 362 APTGFHSVKLVGWGEEHNGEKYWIAANSWGPWWGERGYFRILRGSNECGIEEYVLASWP 420


>gi|194753202|ref|XP_001958906.1| GF12327 [Drosophila ananassae]
 gi|190620204|gb|EDV35728.1| GF12327 [Drosophila ananassae]
          Length = 431

 Score =  147 bits (370), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 90/230 (39%), Positives = 116/230 (50%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGS W        SDRF I       + LS  ++L+C       GCDGG+  +AWR+
Sbjct: 206 QGWCGSSWVLSTTSVASDRFAIQSKGKEAVRLSAQNILSCTRRQ--QGCDGGHLDAAWRF 263

Query: 73  FVHHGVVTEECDPY----------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH 122
               GVV + C PY           +S      GC P+               + R+S +
Sbjct: 264 LHKKGVVDDSCYPYTQQRDTCKIRHNSRSLKANGCRPS-------------PNVDRDSFY 310

Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT---GDVMGGHAVK 179
               AY +N +  DIMAEIY +GPV+ +  VY DF  Y  G+Y+      G   G H+VK
Sbjct: 311 TVGPAYTLNRE-GDIMAEIYHSGPVQATMRVYRDFFSYSGGIYRQTAANRGAPQGFHSVK 369

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           L+GWG   +G+ YWI AN W   WG  GYF+I RGSNECGIEE V+A  P
Sbjct: 370 LVGWGEEHNGDKYWIAANSWGPWWGERGYFRILRGSNECGIEEYVLASWP 419


>gi|201023319|ref|NP_001128401.1| cathepsin B-10270 precursor [Acyrthosiphon pisum]
 gi|239788119|dbj|BAH70754.1| ACYPI000021 [Acyrthosiphon pisum]
          Length = 341

 Score =  147 bits (370), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 89/235 (37%), Positives = 124/235 (52%), Gaps = 24/235 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C + WA     A++DR CI    N++   S   +L+CC   CGDGC+GGY  +AW+Y
Sbjct: 109 QGNCAADWAISVTSAINDRICIKSKKNITAFYSPQKMLSCCDD-CGDGCNGGYSGAAWQY 167

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPGCEPAYP----------TPKCVRKCVKKNQ 115
           ++  G+VT       E C P+     C+H   +   P          TP+C   C   N 
Sbjct: 168 WMKRGLVTGGDYGSNEGCQPWLIPP-CNHTVMDERSPSYMCGKYKSETPQCTLNCYNPNY 226

Query: 116 LWRNSKHYSISAYRINSDPEDIMA-EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
                K  S    RI+     ++  E+ K+GP      VYEDF  YKSG+Y+H+TG ++G
Sbjct: 227 SKPFLKDIS-KGIRIDWHCSGMIRNELKKHGPATAIMRVYEDFLTYKSGIYQHVTGKLLG 285

Query: 175 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
              VK+IGWG    G  YW+ AN W  SWG  G+FKI+RG NEC  E+  ++G P
Sbjct: 286 QITVKVIGWGVY-RGVQYWLAANSWGTSWGDKGFFKIRRGYNECLFEDYFISGRP 339


>gi|56754337|gb|AAW25356.1| SJCHGC00056 protein [Schistosoma japonicum]
          Length = 342

 Score =  146 bits (369), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 91/234 (38%), Positives = 126/234 (53%), Gaps = 26/234 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCD---------G 63
           Q  CGSCWAFGAVEA++DR CI  G   +  LS  DL++CC    G             G
Sbjct: 112 QSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLISCCEDCGGGCKGGFPGQAWDMG 171

Query: 64  GYPISAWRYFV--HHGVVTEECDPY-FDS----TGCSHPGC-EPAYPTPKCVRKCVKKNQ 115
               S WR+    H G     C PY F      T   +P C    Y TP+C + C K  +
Sbjct: 172 KTRDSHWRFRKKNHTG-----CQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYK 226

Query: 116 L-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
             +   K +   +  + ++ +    +I   GPVE +F VYEDF + KSG+ +H+TG ++G
Sbjct: 227 TPFEQDKPFGEGSSNVQNNEKVFQRDIMMYGPVEAAFDVYEDFLNSKSGISRHVTGSIVG 286

Query: 175 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           GH +++IGWG  + G  YW++AN WN  WG +G F++ RG +EC IE  VVAGL
Sbjct: 287 GHPIRIIGWGV-EKGNPYWLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339


>gi|339242313|ref|XP_003377082.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316974149|gb|EFV57673.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 517

 Score =  146 bits (369), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 84/235 (35%), Positives = 124/235 (52%), Gaps = 23/235 (9%)

Query: 7   EHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND--LLACCGFLCGDGCDGG 64
           E +  +  Q +CGSCWA  A   ++DR CI      +  ++D  +LAC           G
Sbjct: 294 EWIRFIRDQSNCGSCWAVSAASVMTDRHCIASKGQETPYISDEQILAC-----------G 342

Query: 65  YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP------AYPTPKCVRKCVKKNQL-W 117
              S + Y+   G+ T    PY D + C      P         TP C   C     +  
Sbjct: 343 MIPSPFNYWKKMGIATG--GPYGDKSCCQPYSIAPCSKCSYTASTPSCKYDCQADYDIPI 400

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
            + K Y+   Y ++S+  +IM EIY +GPV   F VYEDF +Y SG+Y+  T   MGGHA
Sbjct: 401 SDDKFYASEHYHVSSNQYEIMNEIYTHGPVVAGFIVYEDFTYYISGIYQQTTYVAMGGHA 460

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           +++IGWG  ++G  YW++AN WN ++G  G+F+I+RG+NEC IE +V  G+P  +
Sbjct: 461 IRIIGWG-EENGIPYWLIANSWNTTFGEKGFFRIRRGTNECRIESEVYTGIPKLR 514



 Score = 64.7 bits (156), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 60/131 (45%), Gaps = 11/131 (8%)

Query: 60  GCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK 113
           GC  G   +A+ Y+   G+VT      + C   +  + C+   C P    PKC R C   
Sbjct: 69  GCRSGKIEAAFIYWQRSGLVTGGPYGEKACCLPYSISPCTM--CRPYMLAPKCQRTCQAS 126

Query: 114 NQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
             L  +  K+Y  S Y +N D  DIM EIY+ GPV   F VY DF +Y SG +  I G+ 
Sbjct: 127 YNLSLKRDKYYGKSHYYVNQDEFDIMQEIYQRGPVVAGFKVYHDFLYYISGQF--ICGNK 184

Query: 173 MGGHAVKLIGW 183
                  L  W
Sbjct: 185 RCEEEENLTSW 195


>gi|107921773|gb|ABF85678.1| cathepsin B1 [Fasciola hepatica]
          Length = 278

 Score =  146 bits (369), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 84/197 (42%), Positives = 109/197 (55%), Gaps = 21/197 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWA   V A+SDR CIH    M   LS  DL++CC + CG+GC GG P +AW Y
Sbjct: 85  QSSCGSCWAVAGVGAMSDRVCIHSNGMMQPELSAIDLVSCCSY-CGNGCQGGSPPAAWDY 143

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPGCEPA--------YPTPKCVRKC-VKKNQL 116
           +  +G+VT         C PY     C HPG            YPTP C   C    ++ 
Sbjct: 144 WWRNGIVTGGTLENPTGCLPY-PFPQCRHPGSRSQLNPCPGYIYPTPSCYPYCQAGYDKT 202

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           +   K Y  ++Y ++     IM EI KNGPVE  F VY DFA YKSG+Y H++G   G H
Sbjct: 203 YEEDKVYGKTSYNVDRHEYTIMQEIMKNGPVEAGFIVYTDFAVYKSGIYHHVSGRYAGKH 262

Query: 177 AVKLIGWGTSDDGEDYW 193
           A+++IGWG  ++G +YW
Sbjct: 263 AIRIIGWGV-ENGVNYW 278


>gi|3087803|emb|CAA93279.1| cysteine protease [Haemonchus contortus]
          Length = 325

 Score =  146 bits (368), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 106/206 (51%), Gaps = 20/206 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CGSCWA     ALSDR CI       +++S  D+L CC + CG GC GG+PI AW Y
Sbjct: 116 QANCGSCWAVSTAAALSDRICISTNGTKQVNISATDILTCC-YKCGYGCQGGWPIEAWEY 174

Query: 73  FVHHGVVT------EECDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVK--KNQLW 117
               G VT      + C        C H G E  Y        TPKC   C    KN  +
Sbjct: 175 VAREGAVTGGRLLAKSCCRSHPFPPCGHHGNETYYGECGGRARTPKCRTSCTPGYKNS-Y 233

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
            + K     AY + +  + I  EI KNGPV  +FTVY DF++YK G+YKH  G   G HA
Sbjct: 234 SDDKIRGKDAYELPNSVKAIQREIMKNGPVVAAFTVYADFSYYKKGIYKHTAGRARGSHA 293

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSW 203
           VK+IGWG   D   YWI+ N W+  W
Sbjct: 294 VKVIGWGEEGD-VPYWIVKNSWHNDW 318


>gi|32129434|sp|P92132.2|CATB2_GIALA RecName: Full=Cathepsin B-like CP2; AltName: Full=Cathepsin B-like
           protease B2; Flags: Precursor
 gi|11691658|emb|CAC18647.1| cathepsin B-like protease 2 [Giardia intestinalis]
          Length = 300

 Score =  146 bits (368), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 84/221 (38%), Positives = 122/221 (55%), Gaps = 19/221 (8%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD-GCDGGYPI 67
           +V QG CGSCWAF +V    DR C+  G++   +  S   +++C     GD  C+GG+  
Sbjct: 92  VVDQGGCGSCWAFSSVATFGDRRCVA-GLDKKPVKYSPQYVVSCDH---GDMACNGGWLP 147

Query: 68  SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 127
           + W++    G  T+EC PY   +      C    PT     KC   +     +   S   
Sbjct: 148 NVWKFLTKTGTTTDECVPYKSGSTTLRGTC----PT-----KCADGSSKVHLATATSYKD 198

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
           Y +  D   +M  +  +GP++V+F V+ DF +Y+SGVY+H  G + GGHAV+++G+GT D
Sbjct: 199 YGL--DIPAMMKALSTSGPLQVAFLVHSDFMYYESGVYQHTYGYMEGGHAVEMVGYGTDD 256

Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           DG DYWI+ N W   WG DGYF++ RG N+C IEE   AG 
Sbjct: 257 DGVDYWIIKNSWGPDWGEDGYFRMIRGINDCSIEEQAYAGF 297


>gi|403377404|gb|EJY88697.1| hypothetical protein OXYTRI_00086 [Oxytricha trifallax]
          Length = 351

 Score =  146 bits (368), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 84/220 (38%), Positives = 118/220 (53%), Gaps = 26/220 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACC--GFLCGDGCDGGYPISAW 70
           Q +CG+CWAF     L+DR CI  +  +N  LS  D++ C    F    GC+GGY ++A 
Sbjct: 140 QANCGACWAFTGSGMLADRICILTNGTINEELSPQDMVDCSHDNF----GCEGGYLMNAL 195

Query: 71  RYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHY-SISAYR 129
            Y ++ GV  E C PY D T              KC   C  K + +   KHY      R
Sbjct: 196 DYLMNEGVTKESCTPYKDKTN-------------KCQYTCQNKTEEFH--KHYCKPGTLR 240

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 189
           + ++ E I  ++ +NGP+ V  TVYEDF +Y +G YK + G+++GGHAVKL+GW T+  G
Sbjct: 241 VLTNEEQIKRDLMQNGPLMVGLTVYEDFINYATGDYKFVAGEIVGGHAVKLMGWRTTQKG 300

Query: 190 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           +  W++ NQWN  WG  G+  I    NE GI+   V   P
Sbjct: 301 QTSWLIQNQWNDDWGEQGFGYIL--ENEVGIDSIGVGCTP 338


>gi|389608479|dbj|BAM17849.1| tubulointerstitial nephritis antigen [Papilio xuthus]
          Length = 429

 Score =  146 bits (368), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 91/227 (40%), Positives = 123/227 (54%), Gaps = 17/227 (7%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYP 66
           +  +V QG CGS WA       SDRF I      N+ LS   LL+C       GC GG+ 
Sbjct: 204 ISPIVDQGWCGSDWAVSLAGVASDRFAIQSNGAENMVLSPQTLLSC-NVRAQQGCHGGHI 262

Query: 67  ISAWRYFVHHGVVTEECDPYFDS-TGCSHPGCEPAYPTPKCVRK-CVKKNQLWRNSKHYS 124
             AW +   HG+V E+C PY  S T C      P  P    ++  C+    + R +  Y 
Sbjct: 263 DVAWNFARGHGLVDEKCFPYKASVTRC------PFRPRGNLIQDGCMP--LVKRRTSRYK 314

Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK---HITGDVMGGHAVKLI 181
           +      S  +DIM +I ++GPV+   TVY+DF HY+ GVY+   H   ++ G H+V++I
Sbjct: 315 LGPPAKLSHEKDIMYDIMESGPVQAVMTVYQDFFHYRDGVYRRSYHGNNELKGFHSVRII 374

Query: 182 GWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           GWG  D G+ YW++AN W R WG +GYF+I RGSNE  IE  VV GL
Sbjct: 375 GWG-EDRGDRYWVVANSWGRQWGENGYFRIARGSNEADIESFVVTGL 420


>gi|195585648|ref|XP_002082593.1| GD25141 [Drosophila simulans]
 gi|194194602|gb|EDX08178.1| GD25141 [Drosophila simulans]
          Length = 484

 Score =  145 bits (367), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 90/238 (37%), Positives = 120/238 (50%), Gaps = 15/238 (6%)

Query: 6   SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDG 63
           S ++  +  QG CG+ W        SDRF I       + LS  ++L+C       GC+G
Sbjct: 198 SSYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKEAVQLSAQNILSCTRRQ--QGCEG 255

Query: 64  GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK--CVKKNQLWRNSK 121
           G+  +AWRY    GVV E C PY       H          + +R   C     + R++ 
Sbjct: 256 GHLDAAWRYLHKKGVVDENCYPY-----TQHRDTCKIRHNSRSLRANGCQTPVNVDRDTL 310

Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV---MGGHAV 178
           +    AY +N +  DIMAEI+ +GPV+ +  V  DF  Y  GVY+    +     G H+V
Sbjct: 311 YTVGPAYSLNREA-DIMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSV 369

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVK 236
           KL+GWG   +GE YWI AN W   WG  GYF+I RGSNECGIEE V+A  P   N  K
Sbjct: 370 KLVGWGEEHNGEKYWIAANSWGSWWGEHGYFRILRGSNECGIEEYVLASWPYVYNYYK 427


>gi|449283627|gb|EMC90232.1| Tubulointerstitial nephritis antigen [Columba livia]
          Length = 469

 Score =  145 bits (367), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 90/232 (38%), Positives = 123/232 (53%), Gaps = 29/232 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CG+ WAF      +DR  IH    ++  LSV +L++C       GC+GG   SAWRY
Sbjct: 242 QRNCGASWAFSTASVAADRIAIHSEGQITDNLSVQNLISC-DTRNQHGCNGGNIDSAWRY 300

Query: 73  FVHHGVVTEECDPYF-----DSTGCSHPGCEPAY-------PTPKCVRKCVKKNQLWRNS 120
              HGVV+  C P F     + +G +H      Y       P P  + K    N+L+R +
Sbjct: 301 LKTHGVVSYACYPSFWKKHLEPSGENHCYVSSEYGKNYTNGPCPNALEK---SNRLYRCA 357

Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH--ITGDVMGGHAV 178
            HY     R++S   +IM EI   GPV+    VYEDF  YK G+Y+H    G     H+V
Sbjct: 358 SHY-----RVSSKETNIMKEIMDKGPVQAIMKVYEDFFLYKEGIYRHSQKAGSKWKTHSV 412

Query: 179 KLIGWGTSDDG----EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           KL+GWG   D     + +WI AN W +SWG +GYF+I RG NEC IE+ ++A
Sbjct: 413 KLLGWGALADKNGQKQKFWIAANSWGKSWGENGYFRILRGQNECDIEKLILA 464


>gi|323448265|gb|EGB04166.1| hypothetical protein AURANDRAFT_32974 [Aureococcus anophagefferens]
          Length = 298

 Score =  145 bits (367), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 95/249 (38%), Positives = 122/249 (48%), Gaps = 36/249 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CG CWAF   EA SDR CI  G  + + LS  D+   C     DGCDGG  I+ W Y
Sbjct: 47  QSNCGCCWAFAGAEAASDRQCIATGGAVAVPLSAQDV---CFNANVDGCDGGQIITPWTY 103

Query: 73  FVHHGVVTEE------------CDPYFDSTGCSHPGCE-------------PAYPTPKCV 107
               G VT              C  +F +  C H G               P+  +P+  
Sbjct: 104 VAKAGAVTGGQYNGTGPFGAGLCADWF-APHCHHHGPRGDDPYPAEGDAGCPSEKSPEGP 162

Query: 108 RKC----VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 163
           + C       +  +   KH      +  S    IMA I + GPVE +FTVYEDF +Y  G
Sbjct: 163 KACDATAAAGHDAFAADKHTFAGDVQTASGEAAIMAMIAEGGPVETAFTVYEDFENYAGG 222

Query: 164 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
           +Y H+TG+  GGHAVK +GWG  ++G  YW +AN WN  WG  GYF+I RGSNE GIE+ 
Sbjct: 223 IYHHVTGEEAGGHAVKFVGWGV-ENGTKYWKVANSWNPYWGEAGYFRILRGSNEGGIEDQ 281

Query: 224 VVAGLPSSK 232
           V      +K
Sbjct: 282 VTGSHADAK 290


>gi|149030260|gb|EDL85316.1| rCG52258, isoform CRA_c [Rattus norvegicus]
          Length = 130

 Score =  145 bits (366), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 68/137 (49%), Positives = 93/137 (67%), Gaps = 13/137 (9%)

Query: 97  CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 156
           CE  Y T             ++  KHY  ++Y ++   ++IMAEIYKNGPVE +FTV+ D
Sbjct: 2   CEAGYSTS------------YKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSD 49

Query: 157 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 216
           F  YKSGVYKH  GDVMGGHA++++GWG  ++G  YW++AN WN  WG +G+FKI RG N
Sbjct: 50  FLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNGFFKILRGEN 108

Query: 217 ECGIEEDVVAGLPSSKN 233
            CGIE ++VAG+P ++ 
Sbjct: 109 HCGIESEIVAGIPRTQQ 125


>gi|294877489|ref|XP_002768007.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239870145|gb|EER00725.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 344

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 94/264 (35%), Positives = 127/264 (48%), Gaps = 51/264 (19%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFL--CGD-GCDGGYPISA 69
           Q  CGSCWA   VEA + R CI  G   N  LS  ++LACC  +  C   GC GG   +A
Sbjct: 82  QSACGSCWAIAPVEAFNARLCIKSGGKFNQLLSAGEMLACCNSVHSCNSHGCQGGIARAA 141

Query: 70  WRYFVHHGVVT-------------EECDPY------FDSTGCSHPGC------------- 97
           W +   HG+VT             + C PY       D     +  C             
Sbjct: 142 WSFLKMHGIVTGGDFVPKGSMSAADGCWPYSFPKCAHDQEDSKYEPCPEVRVPPLGERHQ 201

Query: 98  --------EPAYPTPKCVRKCV--KKNQLWRNSKHYSISAYR-INSDPEDIMAEIYKNGP 146
                   +  Y TP C+ +C   K        +H++  A   +    ++I  EI  NGP
Sbjct: 202 RGAGASIHQKLYDTPSCLDRCPNEKYGTPRDKDRHFTARALPYLFEGTDNIKKEIMTNGP 261

Query: 147 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 206
              SF+ YEDF+ YKSGVYKH +G  +G H+V++IGWGT + G DYW++ N WN  WG  
Sbjct: 262 TSASFSTYEDFSSYKSGVYKHTSGGYLGDHSVEIIGWGT-EKGVDYWLVMNSWNEGWGDH 320

Query: 207 GYFKIKRGSNECGIEEDVVAGLPS 230
           G FKI +G  +CGI++ V   LP+
Sbjct: 321 GTFKIAQG--DCGIDDAVQGSLPA 342


>gi|294894290|ref|XP_002774786.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239880403|gb|EER06602.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 830

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 95/276 (34%), Positives = 128/276 (46%), Gaps = 68/276 (24%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFG  EA +DR CI      +  LS  ++ AC       GC+GG+P SAW +
Sbjct: 560 QSACGSCWAFGVTEAFNDRLCIKSNGTFTELLSAGEMNACAP---SHGCNGGFPNSAWSW 616

Query: 73  FVHHGVVT-------------EECDPYFDSTGCSH-------PGC--------------- 97
               G+ T             + C PY D   C+H       P C               
Sbjct: 617 VHDKGIATGGDYVAKDDMTKDDGCWPY-DFPPCAHHINDTKYPECPKVSCSGESPPATAE 675

Query: 98  -------EPAYPTPKCVRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV- 147
                  + +Y TP C  +C   K     R+ +H+ + +        D    I  +GPV 
Sbjct: 676 TATVIAYQNSYETPNCAEQCHNPKYTTTLRDDRHFMLESSPYQYSVNDAKNAIRTDGPVG 735

Query: 148 --------------EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 193
                           SF+VYEDF  YKSGVYKH +G+ +GGHAVK+IGWG  + G+ YW
Sbjct: 736 PIYFCDPNVNFDQVSASFSVYEDFLAYKSGVYKHTSGEYLGGHAVKIIGWG-EESGQAYW 794

Query: 194 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           I+ N WN  WG  G FKI  G+  CGI+++++ G P
Sbjct: 795 IVVNSWNEDWGDHGLFKIALGN--CGIDDNLLGGTP 828


>gi|729283|sp|Q06544.1|CYSP3_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 3
 gi|159952|gb|AAA29436.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
          Length = 174

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 79/175 (45%), Positives = 104/175 (59%), Gaps = 17/175 (9%)

Query: 69  AWRYFVHHGVVTEE-------CDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKN 114
           AW+YF   GVVT         C PY +   C   G EP Y        TPKC + C +  
Sbjct: 1   AWQYFALEGVVTGGNYRKQGCCRPY-EFPPCGRHGKEPYYGECYDTAKTPKCQKTCQRGY 59

Query: 115 -QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
            + ++  KH+  SAYR+ ++ + I  +I KNGPV   F VYEDFAHYKSG+YKH  G + 
Sbjct: 60  LKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMT 119

Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           GGHAVK+IGWG  + G  YW++AN W+  WG  G++++ RG N C IEE V AG+
Sbjct: 120 GGHAVKIIGWG-KEKGTPYWLIANSWHDDWGEKGFYRMIRGINNCRIEEMVFAGI 173


>gi|10803443|emb|CAC13134.1| putative cathepsin B.8 [Ostertagia ostertagi]
          Length = 197

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 86/193 (44%), Positives = 111/193 (57%), Gaps = 22/193 (11%)

Query: 20  SCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 77
           SCWAFGAVEA+SDR CI       ++LS  DLL+CC   CG GC+GG P+SAW+++V  G
Sbjct: 1   SCWAFGAVEAISDRICIASKGKTQVTLSAADLLSCC-RSCGFGCNGGDPLSAWKFWVKEG 59

Query: 78  VVTEE-------CDPYFDSTGCSH--------PGCEPAYPTPKCVRKCVKK--NQLWRNS 120
           +VT         C PY     C H        P     +PTPKC + C      + ++  
Sbjct: 60  IVTGSNHSTNAGCKPY-PFPACEHHSNKTHYDPCKHDLFPTPKCEKSCQATFGERTYKED 118

Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 180
           K++  SAY + +  E I  EI   GPVEV+F VYEDF +Y  G+Y H  G + GGHAVK+
Sbjct: 119 KYFGRSAYGVKNHMEAIQKEIITYGPVEVAFEVYEDFLNYAGGIYVHQGGALGGGHAVKM 178

Query: 181 IGWGTSDDGEDYW 193
           IGWG  D+G  YW
Sbjct: 179 IGWGI-DNGVPYW 190


>gi|195346663|ref|XP_002039877.1| GM15657 [Drosophila sechellia]
 gi|194135226|gb|EDW56742.1| GM15657 [Drosophila sechellia]
          Length = 431

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 87/231 (37%), Positives = 119/231 (51%), Gaps = 15/231 (6%)

Query: 6   SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDG 63
           S ++  +  QG CG+ W        SDRF I       + LS  ++L+C       GC+G
Sbjct: 198 SSYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKEAVQLSAQNILSCTRRQ--QGCEG 255

Query: 64  GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK--CVKKNQLWRNSK 121
           G+  +AWRY    GVV E C PY       H          + +R   C     + R++ 
Sbjct: 256 GHLDAAWRYLHKKGVVDENCYPYT-----QHRDTCKIRHNSRSLRANGCQTPVNVDRDTL 310

Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD---VMGGHAV 178
           +    AY +N +  DIMAEI+ +GPV+ +  V  DF  Y  GVY+    +   + G H+V
Sbjct: 311 YTVGPAYSLNREA-DIMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKALTGFHSV 369

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           KL+GWG   +GE YWI AN W   WG  GYF+I RGSNECGIE+ V+A  P
Sbjct: 370 KLVGWGEEHNGEKYWIAANSWGSWWGEHGYFRILRGSNECGIEDYVLASWP 420


>gi|268578113|ref|XP_002644039.1| Hypothetical protein CBG17499 [Caenorhabditis briggsae]
          Length = 355

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 94/241 (39%), Positives = 117/241 (48%), Gaps = 30/241 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFL---CGDG--CDGGYPI 67
           Q  CGS     AVE  SDR CI      N  LS  D L+CC  L   CGDG  CDG +P 
Sbjct: 113 QSDCGSAAHLVAVEMASDRTCISSNGTFNWPLSAQDPLSCCVGLMSICGDGWGCDGSWPK 172

Query: 68  SAWRYFVHHGVVT---------------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK 112
              +++  HG+ T                 CD  + +   S P   P Y TP C   C  
Sbjct: 173 DILKWWQTHGLCTGGNYDDQFGCKPYSIYPCDKNYPNGTTSVPC--PGYHTPPCEDHCTS 230

Query: 113 KNQLW----RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 168
            N  W    +  KH+  + Y +     DI  EI  NGPV  SF +YEDF  YKSG+Y H 
Sbjct: 231 -NITWPIAYKQDKHFGKAHYNVGKKMTDIQTEIMTNGPVIASFIIYEDFWDYKSGIYVHT 289

Query: 169 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
            GD  GG   K+IGWG  D+G  YW+  +QW   +G +G+ +I RG NE  IE  V+A L
Sbjct: 290 AGDQEGGMDTKIIGWGV-DNGVPYWLCVHQWGTDFGENGFVRILRGVNEVNIEHQVLAAL 348

Query: 229 P 229
           P
Sbjct: 349 P 349


>gi|107921798|gb|ABF85680.1| cathepsin B3 [Fasciola hepatica]
          Length = 278

 Score =  144 bits (364), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 84/209 (40%), Positives = 115/209 (55%), Gaps = 21/209 (10%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
           + N   +  +  Q  C SCWA  +  A++DR CIH        LS  D+++CC + CG G
Sbjct: 73  WPNCPSISEIRDQSSCSSCWAVSSASAITDRICIHSNGQKKPRLSAIDIVSCCAY-CGYG 131

Query: 61  CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH----PGCEPA----YPTPK 105
           C+GG P  +W Y+   GVVT         C PY     CSH    PG  P     YPTPK
Sbjct: 132 CNGGIPAMSWDYWTREGVVTGGTLENPTGCLPY-PFPKCSHGVVTPGLPPCPRDIYPTPK 190

Query: 106 CVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 164
           C +KC    N+ +   K    S+Y +     DIM EI KNGPV+  F ++EDF  YKSG+
Sbjct: 191 CEKKCHAGYNKTYEQDKVKGKSSYNVGEQETDIMMEIMKNGPVDGIFYMFEDFLVYKSGI 250

Query: 165 YKHITGDVMGGHAVKLIGWGTSDDGEDYW 193
           Y + TG ++GGHA+++IGWG  ++G +YW
Sbjct: 251 YHYTTGRLVGGHAIRVIGWGV-ENGVNYW 278


>gi|291228863|ref|XP_002734398.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 451

 Score =  144 bits (364), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 95/230 (41%), Positives = 120/230 (52%), Gaps = 26/230 (11%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISA 69
           ++ QG+C S WAF  V   SDR  I       ++LS   LL+C       GC GG+   A
Sbjct: 196 ILDQGNCASSWAFSTVGVASDRLAIQSSGETGMTLSPQHLLSC-NTRGQRGCSGGHIDRA 254

Query: 70  WRYFVHHGVVTEECDPYF----DSTG-CSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYS 124
           W +    GVV+ +C PY     D  G C  PG  P+     C     + N+L     H+S
Sbjct: 255 WWFMRKRGVVSNDCYPYTSGDQDKKGVCMMPGKLPS----DCPTGRERNNEL-----HHS 305

Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI---TGDVMGGHA---- 177
              YRI ++  +I  EI +NGPV+ SF V EDF  Y SGVY+H    + D    HA    
Sbjct: 306 TPPYRIAANEREIQVEIMENGPVQASFEVKEDFFMYGSGVYRHTPIASNDAEQYHASEWH 365

Query: 178 -VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
            VKL+GWG  ++G  YW+ AN W   WG DGYFKI RG NEC IE  VVA
Sbjct: 366 SVKLLGWGV-ENGIKYWLGANSWGTKWGEDGYFKILRGENECNIESYVVA 414


>gi|395833440|ref|XP_003789742.1| PREDICTED: tubulointerstitial nephritis antigen [Otolemur
           garnettii]
          Length = 464

 Score =  144 bits (363), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 84/236 (35%), Positives = 119/236 (50%), Gaps = 29/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW Y
Sbjct: 225 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 283

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++ GC  A         + T  C     K N++++ S   
Sbjct: 284 LRKRGLVSHACYPLFKDQHATNSGCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 340

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
               YRI+S+  +IM EI +NGPV+    V+EDF HYKSG+Y+H+            +  
Sbjct: 341 --PPYRISSNETEIMKEIMQNGPVQAIMQVHEDFFHYKSGIYRHVASTHGESENYRKLRT 398

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL+GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 399 HAVKLLGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 454


>gi|47125398|gb|AAH70278.1| Tubulointerstitial nephritis antigen [Homo sapiens]
 gi|190690249|gb|ACE86899.1| tubulointerstitial nephritis antigen protein [synthetic construct]
 gi|190691623|gb|ACE87586.1| tubulointerstitial nephritis antigen protein [synthetic construct]
 gi|312150986|gb|ADQ32005.1| tubulointerstitial nephritis antigen [synthetic construct]
          Length = 476

 Score =  144 bits (363), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 83/236 (35%), Positives = 119/236 (50%), Gaps = 29/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++ GC  A         + T  C     K N++++ S   
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATKPCPNNVEKSNRIYQCS--- 352

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
               YR++S+  +IM EI +NGPV+    V+EDF HYK+G+Y+H+T           +  
Sbjct: 353 --PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNKESEKYRKLQT 410

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 411 HAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|6009533|dbj|BAA84949.1| tubulointerstitial nephritis antigen [Homo sapiens]
          Length = 476

 Score =  144 bits (363), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 83/236 (35%), Positives = 119/236 (50%), Gaps = 29/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++ GC  A         + T  C     K N++++ S   
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATKPCPNNVEKSNRIYQCS--- 352

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
               YR++S+  +IM EI +NGPV+    V+EDF HYK+G+Y+H+T           +  
Sbjct: 353 --PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNKESEKYRKLQT 410

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 411 HAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|328869211|gb|EGG17589.1| hypothetical protein DFA_08585 [Dictyostelium fasciculatum]
          Length = 323

 Score =  143 bits (361), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 88/224 (39%), Positives = 124/224 (55%), Gaps = 20/224 (8%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYP 66
           V  ++ Q  CGSCWAF + EALSDR CI     +N++LS   L+A C  +   GC+GG P
Sbjct: 109 VHAVLNQEQCGSCWAFSSSEALSDRLCIASKGQVNVTLSPQALVA-CDDIGNQGCNGGVP 167

Query: 67  ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSI 125
             AW Y    G+ T EC PY    G              C R+C   + + +  +K +S+
Sbjct: 168 QLAWEYMEWKGLPTFECYPYTAGNGTDG----------TCQRQCADGSAMTYYRAKPFSM 217

Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWG 184
           +     +    I  EI   GPV  +  VY+DF  Y SGVY +  T +++GGHA++++GWG
Sbjct: 218 TTC---NSVACIQNEIITYGPVVGTMMVYQDFMSYSSGVYVYDGTAELLGGHAIEIVGWG 274

Query: 185 TSDDGE-DYWILANQWNRSWGA-DGYFKIKRGSNECGIEEDVVA 226
           T    + DYWI+ N W+ +WG  DGYF I+RG+N CGI+ D  A
Sbjct: 275 TDATSKLDYWIVKNSWSAAWGGLDGYFWIQRGTNMCGIDHDASA 318


>gi|296198446|ref|XP_002746707.1| PREDICTED: tubulointerstitial nephritis antigen [Callithrix
           jacchus]
          Length = 476

 Score =  143 bits (361), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 83/236 (35%), Positives = 118/236 (50%), Gaps = 29/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++ GC  A         + T  C     K N++++ S   
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNSGCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 352

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG--------DVMGG 175
               YR++S   +IM EI +NGPV+    V+EDF HYK+G+Y+H+T           +  
Sbjct: 353 --PPYRVSSSETEIMKEIMQNGPVQAIMKVHEDFFHYKTGIYRHVTSTNKESEKFQKLQT 410

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 411 HAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|156708110|gb|ABU93313.1| cathepsin B4 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  143 bits (360), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 82/215 (38%), Positives = 113/215 (52%), Gaps = 18/215 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           Q  CGSCWA    EA+ D   I      ++SV DL++C        C+GG    A  Y V
Sbjct: 83  QASCGSCWAHSVAEAMGDAQNIAGCPRGAMSVQDLVSCDK--TDSACNGGDMKKAQEYLV 140

Query: 75  HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 134
             G+ TE C  Y   +G            P C  KC   +Q+ R    Y + +++ + +P
Sbjct: 141 KTGITTEACVKYVSGSG----------RVPACPSKCDNGSQIIR----YKLQSWK-SVEP 185

Query: 135 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 194
            +IM  + + GP+   F VY DF +Y+SGVY+H +G   GGHAV L GWG  ++G  YW+
Sbjct: 186 SEIMQALMEYGPLSCGFMVYSDFMNYRSGVYQHKSGYFEGGHAVLLCGWGV-ENGLPYWL 244

Query: 195 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           + N W  +WG  G+FKI RGSN C IE  V  G+P
Sbjct: 245 VQNSWGPAWGEKGFFKILRGSNHCEIESYVTLGVP 279


>gi|224586907|ref|NP_055279.3| tubulointerstitial nephritis antigen [Homo sapiens]
 gi|317373501|sp|Q9UJW2.3|TINAG_HUMAN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
 gi|119624842|gb|EAX04437.1| tubulointerstitial nephritis antigen [Homo sapiens]
 gi|189066513|dbj|BAG35763.1| unnamed protein product [Homo sapiens]
          Length = 476

 Score =  143 bits (360), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 83/236 (35%), Positives = 118/236 (50%), Gaps = 29/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++ GC  A         + T  C     K N++++ S   
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATKPCPNNVEKSNRIYQCS--- 352

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
               YR++S+  +IM EI +NGPV+    V EDF HYK+G+Y+H+T           +  
Sbjct: 353 --PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQT 410

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 411 HAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|426353589|ref|XP_004044272.1| PREDICTED: tubulointerstitial nephritis antigen [Gorilla gorilla
           gorilla]
          Length = 476

 Score =  143 bits (360), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 83/236 (35%), Positives = 118/236 (50%), Gaps = 29/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++ GC  A         + T  C     K N++++ S   
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATKPCPNNVEKSNRIYQCS--- 352

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
               YR++S+  +IM EI +NGPV+    V EDF HYK+G+Y+H+T           +  
Sbjct: 353 --PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQT 410

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 411 HAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|161343861|tpg|DAA06111.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 323

 Score =  143 bits (360), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 94/242 (38%), Positives = 118/242 (48%), Gaps = 38/242 (15%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGD----GCDGGYPIS 68
           QG+C S WA       +DR CI      +  LS  +L++C     GD    GCDGG    
Sbjct: 87  QGNCASSWAVAVASTFTDRLCIASNGKFTDNLSAQNLMSC-----GDDEKLGCDGGSAYK 141

Query: 69  AWRYFVHHGVVT-------EECDPYFDSTGCSHPG------CEPAYPTPK--CVRKCVKK 113
           AW + +  G+VT       E C PY  +  C H G      C     T    C  KCV K
Sbjct: 142 AWEFTMGKGIVTGGPYDSNEGCQPY-KNRPCDHYGDSSLTNCSSLRRTQMMFCRDKCVNK 200

Query: 114 N-------QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
           N        L++ S  Y  S     ++ + I  EI   GPV     VYE+F  YK GVYK
Sbjct: 201 NYKVKYEDDLYKTSVVYMTSW----TNVKQIQQEIMTYGPVTAFMYVYENFMGYKEGVYK 256

Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
              G+++G H VKLIGWG  + G +YW+  N WN +WG DG FKI RG N C IE  V+A
Sbjct: 257 STAGELIGYHHVKLIGWGVDEAGIEYWLAMNSWNSNWGNDGLFKILRGYNFCSIELLVMA 316

Query: 227 GL 228
           GL
Sbjct: 317 GL 318


>gi|350596935|ref|XP_001927698.4| PREDICTED: tubulointerstitial nephritis antigen, partial [Sus
           scrofa]
          Length = 368

 Score =  143 bits (360), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 83/236 (35%), Positives = 119/236 (50%), Gaps = 29/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW Y
Sbjct: 129 QKNCAASWAFSTASVAADRIAIQSEGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 187

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++ GC  A         + T  C     K N++++ S   
Sbjct: 188 LRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATKPCPNNFEKSNRIYQCS--- 244

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
               YR++S+  +IM EI +NGPV+    V+EDF HYK+G+Y+H+T           +  
Sbjct: 245 --PPYRVSSNETEIMREIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNEESDKYRKLRT 302

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 303 HAVKLTGWGTLKGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 358


>gi|403268748|ref|XP_003926429.1| PREDICTED: tubulointerstitial nephritis antigen [Saimiri
           boliviensis boliviensis]
          Length = 476

 Score =  143 bits (360), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 83/236 (35%), Positives = 118/236 (50%), Gaps = 29/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++ GC  A         + T  C     K N++++ S   
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNSGCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 352

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
               YR++S   +IM EI +NGPV+    V+EDF HYK+G+Y+H+T           +  
Sbjct: 353 --PPYRVSSSETEIMKEIMQNGPVQAIMKVHEDFFHYKTGIYRHVTSTNKESEKFLKLQT 410

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 411 HAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|10803441|emb|CAC13133.1| putative cathepsin B.7 [Ostertagia ostertagi]
          Length = 198

 Score =  142 bits (359), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 84/200 (42%), Positives = 111/200 (55%), Gaps = 22/200 (11%)

Query: 20  SCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 77
           SCWA     A+SDR CI       + +S  D+++CC + CG GC+GG+PI AW+Y V  G
Sbjct: 1   SCWAVSTAAAMSDRICIASKGATQVLISAQDIVSCCTW-CGAGCEGGWPIEAWKYGVTEG 59

Query: 78  VVT------EECDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVK--KNQLWRNSKH 122
           VVT      +EC   ++   C + G EP Y        TP C ++C    KN    + K 
Sbjct: 60  VVTGGNFGRKECCRSYEIHPCGYHGNEPFYGHCHSMARTPPCKKRCRPGYKNSYMMD-KR 118

Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
           Y  SAY + +    I  +I +NGPV   F VYEDF +YKSG+Y+H  G   GGHAVK+IG
Sbjct: 119 YGTSAYELPNSVXAIQRDIMENGPVVAGFDVYEDFKYYKSGIYRHTAGKXTGGHAVKVIG 178

Query: 183 WG---TSDDGEDYWILANQW 199
           WG   T +    YWI+AN W
Sbjct: 179 WGEEXTENGTIPYWIIANSW 198


>gi|17560488|ref|NP_506310.1| Protein F32H5.1 [Caenorhabditis elegans]
 gi|3876629|emb|CAB04249.1| Protein F32H5.1 [Caenorhabditis elegans]
          Length = 356

 Score =  142 bits (359), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 93/243 (38%), Positives = 120/243 (49%), Gaps = 28/243 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFL---CGDG--CDGGYPI 67
           Q  CGS     AVE  SDR CI  +   N  LS  D L+CC  L   CGDG  CDG +P 
Sbjct: 114 QSDCGSAAHLVAVEIASDRTCIASNGTFNWPLSAQDPLSCCVGLMSICGDGWGCDGSWPK 173

Query: 68  SAWRYFVHHGVVTEE-------CDPYFD-------STGCSHPGCEPAYPTPKCVRKCVKK 113
              +++  HG+ T         C PY         + G +   C P Y TP C   C   
Sbjct: 174 DILKWWQTHGLCTGGNYNDQFGCKPYSIYPCDKKYANGTTSVPC-PGYHTPTCEEHCTS- 231

Query: 114 NQLW----RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 169
           N  W    +  KH+  + Y +     DI  EI  NGPV  SF +Y+DF  YK+G+Y H  
Sbjct: 232 NITWPIAYKQDKHFGKAHYNVGKKMTDIQIEIMTNGPVIASFIIYDDFWDYKTGIYVHTA 291

Query: 170 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           GD  GG   K+IGWG  D+G  YW+  +QW   +G +G+ +  RG NE  IE  V+A LP
Sbjct: 292 GDQEGGMDTKIIGWGV-DNGVPYWLCVHQWGTDFGENGFVRFLRGVNEVNIEHQVLAALP 350

Query: 230 SSK 232
            S+
Sbjct: 351 DSE 353


>gi|410959397|ref|XP_003986297.1| PREDICTED: tubulointerstitial nephritis antigen [Felis catus]
          Length = 474

 Score =  142 bits (358), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 83/237 (35%), Positives = 120/237 (50%), Gaps = 30/237 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW +
Sbjct: 234 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCP-KNRHGCNSGSIDRAWWF 292

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F +   ++ GC  A         + T  C     K N++++ S   
Sbjct: 293 LRKRGLVSHACYPLFKNQNATNHGCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 349

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV---------MG 174
               YR++S+  +IM EI +NGPV+    V+EDF HYK+G+Y+HIT            + 
Sbjct: 350 --PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHITKKANEESGKYRKLQ 407

Query: 175 GHAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
            HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 408 THAVKLTGWGTLKGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 464


>gi|403339807|gb|EJY69164.1| Cathepsin B [Oxytricha trifallax]
          Length = 345

 Score =  142 bits (358), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 88/216 (40%), Positives = 123/216 (56%), Gaps = 24/216 (11%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISA 69
           ++ Q +CGSCWA  AV  L +RFCI  G  +N+  S  D+++C   L    C+GGY  S+
Sbjct: 137 ILDQANCGSCWAHAAVTMLQNRFCIKSGGSINMQFSRQDMVSCD--LGNAACNGGYLSSS 194

Query: 70  WRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHY--SISA 127
            +Y    GVV+E+C  Y  + G S          P+C  +C  K+  +   K Y    ++
Sbjct: 195 VQYLQTEGVVSEQCLAYASADGNS---------VPRCNYRCDDKSLEY---KKYGCKYNS 242

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM--GGHAVKLIGWGT 185
            +I +  EDI  EIY NGPV V F VY+DF+ Y +G+Y+ +T D +  GGHAV L GWG 
Sbjct: 243 MKILTTYEDIKEEIYTNGPVMVGFVVYDDFSSYSTGIYE-VTPDSVEEGGHAVTLNGWGY 301

Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 221
            D+G  YWI  NQW  +WG  G+F+I  G  E GI+
Sbjct: 302 -DNGRLYWIGQNQWQNTWGESGFFRIYAG--EAGID 334


>gi|397517574|ref|XP_003828984.1| PREDICTED: tubulointerstitial nephritis antigen [Pan paniscus]
          Length = 476

 Score =  142 bits (358), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 83/236 (35%), Positives = 118/236 (50%), Gaps = 29/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++ GC  A         + T  C     K N++++ S   
Sbjct: 296 LRKRGLVSHACYPLFKDHNATNNGCAMASRSDGRGKRHATKPCPNNVEKSNRIYQCS--- 352

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
               YR++S+  +IM EI +NGPV+    V EDF HYK+G+Y+H+T           +  
Sbjct: 353 --PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQT 410

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 411 HAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|10803450|emb|CAB97364.2| putative cathepsin B.1 [Ostertagia ostertagi]
          Length = 199

 Score =  142 bits (358), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 83/194 (42%), Positives = 109/194 (56%), Gaps = 22/194 (11%)

Query: 20  SCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 77
           SCWA  +  A+SDR CI       + +S  D+++CC + CG GC GG+ I AW YF   G
Sbjct: 1   SCWAVSSASAMSDRVCIATQGAKQVLISDQDIVSCCTW-CGYGCQGGWSIRAWYYFAEQG 59

Query: 78  VVTE-------ECDPYFDSTGCSHPGCEPAY-------PTPKCVRKC-VKKNQLWRNSKH 122
           VVT         C PY +   C +   EP Y        TP+C R+C +   + + + KH
Sbjct: 60  VVTGGNYNTKGSCRPY-EIHPCGYHKDEPYYGECDDLADTPRCKRRCQLGYPKSYPSDKH 118

Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
           Y  +AY++    E I  EI +NGPV   FTVYEDFAHYK G+YKH +G   GGHAVK+IG
Sbjct: 119 YGRTAYQLPMSVESIQREIMRNGPVVAGFTVYEDFAHYKGGIYKHTSGKKTGGHAVKVIG 178

Query: 183 WGTSDDGED---YW 193
           WG+   G +   YW
Sbjct: 179 WGSEQKGSEKIPYW 192


>gi|166030324|gb|ABY78829.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  142 bits (358), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 93/239 (38%), Positives = 124/239 (51%), Gaps = 12/239 (5%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDG 63
           N   +  +  Q  C + WA      +SDR+C   G+  L +S   LL+CC   CG GC G
Sbjct: 102 NCPTIREIADQSACRASWAVSTASVISDRYCTVGGVQQLRISAAHLLSCCK-QCGGGCKG 160

Query: 64  GYPISAWRYFVHHGVVTEECDPY-------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL 116
           G+P  AWRY+V +G+ +  C PY         + G   P  +  + TPKC   C  K+  
Sbjct: 161 GFPGFAWRYYVEYGIASSYCQPYPFPHCEHRGAQGNKTPCSKYNFDTPKCNATCTDKSIP 220

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
               K+   + Y +    ED   E+Y NGP    F VY D   YKSGVY+H+ GD +GG 
Sbjct: 221 L--VKYRGNATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGT 278

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
           AVK++GWG   +G  YW +AN W+  WG DGY  I RG+NEC IE    AG P +  L 
Sbjct: 279 AVKVVGWG-KLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPETSQLT 336


>gi|332824268|ref|XP_518550.3| PREDICTED: tubulointerstitial nephritis antigen [Pan troglodytes]
          Length = 476

 Score =  142 bits (358), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 83/236 (35%), Positives = 118/236 (50%), Gaps = 29/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++ GC  A         + T  C     K N++++ S   
Sbjct: 296 LRKRGLVSHACYPLFKDHNATNNGCAMASRSDGRGKRHATKPCPNNVEKSNRIYQCS--- 352

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
               YR++S+  +IM EI +NGPV+    V EDF HYK+G+Y+H+T           +  
Sbjct: 353 --PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQT 410

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 411 HAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|326430261|gb|EGD75831.1| hypothetical protein PTSG_07950 [Salpingoeca sp. ATCC 50818]
          Length = 381

 Score =  142 bits (358), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 89/251 (35%), Positives = 120/251 (47%), Gaps = 29/251 (11%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCI---HFGMNLSLSVNDLLACCGFLC----GD----- 59
           ++ QG CGSCWAF     L+ R CI     G    L+   L++C   +C    GD     
Sbjct: 111 ILQQGSCGSCWAFATTGVLAQRMCIKSEQIGQGYELAPQALVSCTDQICYTKAGDRCSSP 170

Query: 60  --------GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV 111
                   GCDGGYP  A+R+    G+  E C  Y    G     C         V +C 
Sbjct: 171 SSTCYCSLGCDGGYPDGAFRFMQDEGITPELCVKYVSKDGTDPLECSDVQTM---VSECT 227

Query: 112 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK--HIT 169
             +    N        Y  +SD E I  +I ++GPV  S+ V+EDF  Y SGVY      
Sbjct: 228 ATSNATVNGDR---CYYHSSSDIETIQRDIMQHGPVLASYEVFEDFGEYDSGVYTCPDDG 284

Query: 170 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
            D +G HAV ++GWG  +D   YW++ N W   +G DGYFKI RG+NEC IE  +V  L 
Sbjct: 285 SDSIGWHAVIIVGWGV-EDNTPYWLVQNSWGTGFGIDGYFKIARGTNECNIESRLVTSLV 343

Query: 230 SSKNLVKEITS 240
           +++ +V   TS
Sbjct: 344 NTEGVVFASTS 354


>gi|449498128|ref|XP_002193225.2| PREDICTED: tubulointerstitial nephritis antigen [Taeniopygia
           guttata]
          Length = 469

 Score =  142 bits (358), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 87/231 (37%), Positives = 118/231 (51%), Gaps = 23/231 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CG+ WAF      +DR  IH    ++  LS  +L++C       GC+GG    AWRY
Sbjct: 242 QRNCGASWAFSTASVAADRIAIHSKGQITDNLSAQNLISC-DTRNQHGCNGGSIDGAWRY 300

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK---------CVRKCVKKNQLWRNSKHY 123
              HGVV+  C P F +           Y + +         C     K N+L+R + HY
Sbjct: 301 LKTHGVVSYACYPSFWNKHLGPSAENQCYVSNEYGKNHTNGPCPNAFEKSNRLYRCASHY 360

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH--ITGDVMGGHAVKLI 181
                R++S   DIM EI   GPV+    VYEDF  YK G+Y+H    G     H+VKL+
Sbjct: 361 -----RVSSKETDIMKEIKDRGPVQAIMKVYEDFFLYKEGIYQHSQKAGSKWKTHSVKLL 415

Query: 182 GWGTSDDG----EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           GWG   D     + +WI AN W +SWG +GYF+I RG NEC IE+ ++A L
Sbjct: 416 GWGALPDKNGQKQKFWIAANSWGKSWGENGYFRILRGQNECDIEKLILATL 466


>gi|134023803|gb|AAI35570.1| LOC100124858 protein [Xenopus (Silurana) tropicalis]
          Length = 484

 Score =  142 bits (357), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 90/230 (39%), Positives = 118/230 (51%), Gaps = 20/230 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  I     M  SLS  +LL+C       GC GG    AW Y
Sbjct: 241 QGNCAGSWAFSTAAVASDRISIQSMGHMTQSLSPQNLLSC-DTRNQHGCRGGRVDGAWWY 299

Query: 73  FVHHGVVTEECDPY--FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHY-SISA 127
               GVV+E C P+   ++ G S P    +    +  R+      NQ + +++ Y S  A
Sbjct: 300 LRRRGVVSEPCYPFTSLNTNGHSAPCMMQSRSMGRGKRQATNNCPNQYYSSNEIYQSTPA 359

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT--------GDVMGGHAVK 179
           YR+ S  +DIM E+Y+NGPV+    V+EDF  YKSG+Y+               G H+VK
Sbjct: 360 YRLASSEKDIMKELYENGPVQAIMEVHEDFFMYKSGIYRRTPVTEREPEHHRRHGTHSVK 419

Query: 180 LIGWGTSD--DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
           + GWG     DG+   YW+ AN W R WG DGYF+I RG NEC IE  +V
Sbjct: 420 ITGWGEERGRDGQTHKYWLAANSWGRDWGEDGYFRIARGENECEIETFIV 469


>gi|290979437|ref|XP_002672440.1| predicted protein [Naegleria gruberi]
 gi|284086017|gb|EFC39696.1| predicted protein [Naegleria gruberi]
          Length = 354

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 84/217 (38%), Positives = 103/217 (47%), Gaps = 18/217 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CG+CWAF A   L+ R CI      N+ LS    + C        C GGY   AW +
Sbjct: 152 QQTCGACWAFSATYVLAHRLCIATNGKTNVVLSPEYQVQCDTM--NKACQGGYLKYAWSF 209

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
               G   + C PY         G  PA        KC    Q   +   Y     R  S
Sbjct: 210 LERTGTTVDSCIPYASGRATFSSGTCPA--------KCKVSTQ---SMTMYKAKNSRYIS 258

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
              +I A I   G V+  FT+Y DF  Y+SGVYKH++   +GGHAV LIGWG  + G +Y
Sbjct: 259 GVNNIKAAIMSYGSVQSGFTIYRDFMSYRSGVYKHVSTTTLGGHAVALIGWGV-ESGTNY 317

Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           W+  N W  +WG  GYFKI +G  ECGIE  V AG P
Sbjct: 318 WLAVNSWGSNWGMSGYFKIAQG--ECGIENQVYAGEP 352


>gi|253744515|gb|EET00718.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 306

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 88/220 (40%), Positives = 113/220 (51%), Gaps = 22/220 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSV---NDLLACCGFLCGDGCDGGYPISAWR 71
           QGHCGSCWAF A  A  DR C+  G++ S  V         C +L   GC GG   S W 
Sbjct: 98  QGHCGSCWAFSATSAFGDRRCMQ-GLD-SAGVPYSQQYTISCDYL-DLGCAGGLSFSVWT 154

Query: 72  YFVHHGVVTEECDPYFDSTG-CSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI 130
           +   HG  T EC PY D+    S P          C   C   +++ R  K      Y  
Sbjct: 155 FLTEHGTTTLECVPYTDANKDISSP----------CPDACADGSEI-RLVKADGCLDYSG 203

Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 190
           N     IM  +  +GPV+ S  VY DF +Y+SGVY+H+ G  +  HAV++IG+G +DD +
Sbjct: 204 NVTA--IMQALANDGPVQASMAVYRDFLYYRSGVYRHVYGSQISSHAVEIIGYGAADDED 261

Query: 191 D--YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
              YWI+ N     WG +GYF I RGSNEC IE  V +GL
Sbjct: 262 STPYWIVKNSLGSGWGEEGYFNIVRGSNECDIESAVYSGL 301


>gi|363742306|ref|XP_428202.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Gallus
           gallus]
          Length = 464

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 89/230 (38%), Positives = 116/230 (50%), Gaps = 20/230 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M  SLS  +LL+C       GC GG    AW Y
Sbjct: 222 QGNCAGSWAFSTAAVASDRISIHSMGHMTPSLSPQNLLSC-DTRNQRGCSGGRLDGAWWY 280

Query: 73  FVHHGVVTEECDPYF--DSTGCSHPGCEPAYPTPKCVRKCVKK---NQLWRNSKHYSISA 127
               GVVT+EC P+   DS   + P    +  T +  R+   +    Q   N  + S  A
Sbjct: 281 LRRRGVVTDECYPFTSQDSQPAAQPCMMHSRSTGRGKRQATARCPNPQTHANDIYQSTPA 340

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT--------GDVMGGHAVK 179
           YR+    ++IM E+ +NGPV+    V+EDF  YKSG+Y+H              G H+VK
Sbjct: 341 YRLAPSEKEIMKELMENGPVQAILEVHEDFFLYKSGIYRHTAVAEGKGPKHQQHGTHSVK 400

Query: 180 LIGWGTSD--DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
           + GWG     DG+   YW  AN W R+WG DG+F+I RG NEC +E  VV
Sbjct: 401 ITGWGEEQLPDGQVQKYWTAANSWGRAWGEDGHFRIARGVNECEVESFVV 450


>gi|332210168|ref|XP_003254178.1| PREDICTED: tubulointerstitial nephritis antigen [Nomascus
           leucogenys]
          Length = 476

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 83/236 (35%), Positives = 116/236 (49%), Gaps = 29/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCS-KNRPGCNSGSIDRAWWY 295

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     +  GC  A         + T  C     K N++++ S   
Sbjct: 296 LRKRGLVSHACYPLFKDQNATSNGCAMASRSDGRGKRHATKPCPNNVEKSNRIYQCS--- 352

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG--------DVMGG 175
               YR++S   +IM EI +NGPV+    V EDF HYK+G+Y+H+T           +  
Sbjct: 353 --PPYRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSANKESEKYRKLQT 410

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 411 HAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|166030320|gb|ABY78827.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 91/239 (38%), Positives = 125/239 (52%), Gaps = 12/239 (5%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDG 63
           N   +  +  Q  C + WA      +SDR+C   G+  L +S   LL+CC   CG GC G
Sbjct: 102 NCPTIREIADQSACRASWAVSTASVISDRYCTVGGVQQLRISAAHLLSCCK-QCGGGCKG 160

Query: 64  GYPISAWRYFVHHGVVTEECDPY-------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL 116
           G+P  AWRY+V +G+ +  C PY         + G   P  +  + TPKC   C  K+  
Sbjct: 161 GFPGFAWRYYVEYGIASSYCQPYPFPHCEHRGAQGNKTPCSKYNFDTPKCNATCTDKSIP 220

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
               K+   + Y +    ED   E+Y NGP    F VY D   YKSGVY+++ GD++GG 
Sbjct: 221 L--VKYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDILGGQ 278

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
           AV+++GWG   +G  YW +AN W+  WG DGY  I RG+NEC IE    AG P +  L 
Sbjct: 279 AVRIVGWGKL-NGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPETSQLT 336


>gi|162813|gb|AAA30434.1| cathepsin B, partial [Bos taurus]
          Length = 122

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 59/113 (52%), Positives = 89/113 (78%), Gaps = 1/113 (0%)

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
           ++  KH+  S+Y + ++ ++IMAEIYKNGPVE +F+VY DF  YKSGVY+H++G++MGGH
Sbjct: 6   YKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGH 65

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           A++++GWG  ++G  YW++ N WN  WG +G+FKI RG + CGIE ++VAG+P
Sbjct: 66  AIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 117


>gi|341886633|gb|EGT42568.1| hypothetical protein CAEBREN_17563 [Caenorhabditis brenneri]
          Length = 358

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 91/241 (37%), Positives = 117/241 (48%), Gaps = 30/241 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFL---CGDG--CDGGYPI 67
           Q  CGS     A E  SDR CI      N  LS  D L+CC  L   CGDG  CDG +P 
Sbjct: 116 QSDCGSAAHLVAAEIASDRTCIFSNGTFNWPLSAQDPLSCCVGLMSICGDGWGCDGSWPK 175

Query: 68  SAWRYFVHHGVVT---------------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK 112
              +++  HG+ T                 CD  + +   S P   P Y TP C  +C  
Sbjct: 176 DILKWWQTHGLCTGGNYDDQFGCKPYTIYPCDKKYPNGTTSVPC--PGYHTPVCEERCTS 233

Query: 113 KNQLW----RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 168
            N  W    +  KH+  + Y +     DI  EI +NGPV  SF +Y+DF  YKSG+Y H 
Sbjct: 234 -NITWPISYKQDKHFGKAHYNVGKKMTDIQTEIMRNGPVIASFIIYDDFWDYKSGIYVHT 292

Query: 169 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
            GD  GG   K+IGWG  D+G  YW+  +QW   +G +G+ +I RG NE  IE  V+A  
Sbjct: 293 AGDQEGGMDTKIIGWGV-DNGVPYWLCVHQWGTDFGENGFVRILRGVNEVNIEHQVLAAQ 351

Query: 229 P 229
           P
Sbjct: 352 P 352


>gi|160688716|gb|ABX45136.1| cathepsin B-like cysteine protease 2 [Callosobruchus maculatus]
          Length = 260

 Score =  141 bits (355), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 72/139 (51%), Positives = 89/139 (64%), Gaps = 3/139 (2%)

Query: 94  HPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPE-DIMAEIYKNGPVEVSF 151
           +P C+  Y  P C ++C K + L +   KHY+  AYRI S  E  I  EI KNGPV  SF
Sbjct: 121 NPSCKTLYDAPTCKKECDKGSPLKYEEDKHYAKQAYRIMSKVERQIQLEIIKNGPVVASF 180

Query: 152 TVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 210
           TVY DF HY SGVYK      ++GGHAV++IGWG  +    YW+++N WN  WG  G FK
Sbjct: 181 TVYADFIHYLSGVYKFDGESKLLGGHAVRIIGWGIENGTYPYWLVSNSWNERWGDQGLFK 240

Query: 211 IKRGSNECGIEEDVVAGLP 229
           I RG NECGIEE++ AGLP
Sbjct: 241 IWRGKNECGIEEEITAGLP 259


>gi|308504721|ref|XP_003114544.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
 gi|308261929|gb|EFP05882.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
          Length = 358

 Score =  141 bits (355), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 92/241 (38%), Positives = 115/241 (47%), Gaps = 30/241 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFL---CGDG--CDGGYPI 67
           Q  CGS     AVE  SDR CI      N  LS  D L+CC  L   CGDG  CDG +P 
Sbjct: 116 QSDCGSAAHLVAVELASDRTCIFSNGTFNWPLSAQDPLSCCVGLMSICGDGWGCDGSWPK 175

Query: 68  SAWRYFVHHGVVT---------------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK 112
              +++  HG+ T                 CD  + +   S P   P Y TP C   C  
Sbjct: 176 DILKWWQTHGLCTGGNYEDQFGCKPYSIYPCDKKYPNGTTSVPC--PGYHTPTCEEHCTS 233

Query: 113 KNQLW----RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 168
            N  W    +  KH+  + Y +     DI  EI  NGPV  SF +Y+DF  YKSG+Y H 
Sbjct: 234 -NITWPIAYKQDKHFGKAHYNVGKKMTDIQTEIMTNGPVIASFVIYDDFWDYKSGIYVHT 292

Query: 169 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
            GD  GG   K+IGWG  D G  YW+  +QW   +G +G+ +  RG NE  IE  V+A L
Sbjct: 293 AGDQEGGMDTKIIGWGV-DSGVPYWLCVHQWGTDFGENGFVRFLRGVNEVNIEHQVLAAL 351

Query: 229 P 229
           P
Sbjct: 352 P 352


>gi|6449322|gb|AAF08931.1| tubulointerstitial nephritis antigen isoform TIN-ag [Homo sapiens]
          Length = 476

 Score =  141 bits (355), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 83/231 (35%), Positives = 120/231 (51%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK---NQLWRNSKHYSISA-Y 128
               G+V+  C P F     ++ GC  A  +    ++   K   N + ++++ Y  S  Y
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRDATKPCPNNVEKSNRIYQCSPPY 355

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGGHAVKL 180
           R++S+  +IM EI +NGPV+    V EDF HYK+G+Y+H+T           +  HAVKL
Sbjct: 356 RVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKL 415

Query: 181 IGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
            GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ V+A 
Sbjct: 416 TGWGTLRGAQGQKEKFWIAANFWGKSWGENGYFRILRGVNESDIEKLVIAA 466


>gi|301775398|ref|XP_002923119.1| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
           antigen-like [Ailuropoda melanoleuca]
          Length = 472

 Score =  141 bits (355), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 81/234 (34%), Positives = 119/234 (50%), Gaps = 29/234 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           Q +C + WAF      +DR    +  NLS    +L++CC      GC+ G    AW +  
Sbjct: 237 QKNCAASWAFSTASVAADRIXGRYTANLS--PQNLISCCA-KNRHGCNSGSIDRAWWFLR 293

Query: 75  HHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHYSI 125
             G+V+  C P F     ++ GC  A         + T  C     K N++++ S     
Sbjct: 294 KRGLVSHACYPLFKDQNATNYGCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS----- 348

Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGGHA 177
             YR++S+  +IM EI +NGPV+    V+EDF HYK+G+Y+H+T           +  HA
Sbjct: 349 PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTRTNEESSKYRKLQTHA 408

Query: 178 VKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           +KL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 409 IKLTGWGTLKGARGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 462


>gi|301618234|ref|XP_002938532.1| PREDICTED: tubulointerstitial nephritis antigen-like [Xenopus
           (Silurana) tropicalis]
          Length = 494

 Score =  141 bits (355), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 88/226 (38%), Positives = 115/226 (50%), Gaps = 17/226 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  I     M  SLS  +LL+C       GC GG    AW Y
Sbjct: 256 QGNCAGSWAFSTAAVASDRISIQSMGHMTQSLSPQNLLSC-DTRNQHGCRGGRVDGAWWY 314

Query: 73  FVHHGVVTEECDPY--FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHY-SISA 127
               GVV+E C P+   ++ G S P    +    +  R+      NQ + +++ Y S  A
Sbjct: 315 LRRRGVVSEPCYPFTSLNTNGHSAPCMMQSRSMGRGKRQATNNCPNQYYSSNEIYQSTPA 374

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT--------GDVMGGHAVK 179
           YR+ S  +DIM E+Y+NGPV+    V+EDF  YKSG+Y+H              G H+VK
Sbjct: 375 YRLASSEKDIMKELYENGPVQAIMEVHEDFFMYKSGIYRHTPVTEREPEHHRRHGTHSVK 434

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
           + G G       YW+ AN W R WG DGYF+I RG NEC IE  +V
Sbjct: 435 ITG-GRDGQTHKYWLAANSWGRDWGEDGYFRIARGENECEIETFIV 479


>gi|78042562|ref|NP_001030279.1| tubulointerstitial nephritis antigen [Bos taurus]
 gi|108861910|sp|Q3SZI1.1|TINAG_BOVIN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
 gi|74354008|gb|AAI02844.1| Tubulointerstitial nephritis antigen [Bos taurus]
 gi|296474572|tpg|DAA16687.1| TPA: tubulointerstitial nephritis antigen [Bos taurus]
          Length = 476

 Score =  141 bits (355), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 87/251 (34%), Positives = 124/251 (49%), Gaps = 36/251 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSQGRYTANLSPQNLISCCAKK-RHGCNSGSVDRAWWY 295

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++ GC  A         + T  C     K N++++ S   
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATTPCPNSIEKSNRIYQCS--- 352

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
               YR++S+  +IM EI +NGPV+    V+EDF +YK+G+Y+HIT              
Sbjct: 353 --PPYRVSSNETEIMREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRT 410

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A     
Sbjct: 411 HAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAW--- 467

Query: 232 KNLVKEITSAD 242
                ++TSAD
Sbjct: 468 ----GQLTSAD 474


>gi|355724272|gb|AES08175.1| tubulointerstitial nephritis antigen [Mustela putorius furo]
          Length = 476

 Score =  140 bits (354), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 82/236 (34%), Positives = 118/236 (50%), Gaps = 29/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW +
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWF 295

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++ GC  A         + T  C     K N++++ S   
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNDGCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 352

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
               YR++S+  +IM EI +NGPV+    V+EDF HYK+G+Y+H+T              
Sbjct: 353 --PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTRTNEEASKYRKFQT 410

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 411 HAVKLTGWGTLKGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|73973401|ref|XP_538969.2| PREDICTED: tubulointerstitial nephritis antigen [Canis lupus
           familiaris]
          Length = 476

 Score =  140 bits (354), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 83/236 (35%), Positives = 118/236 (50%), Gaps = 29/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW +
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSNGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWF 295

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++ GC  A         + T  C     K N++++ S   
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNYGCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 352

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG--------DVMGG 175
               YR++S+  +IM EI +NGPV+    V+EDF HYK+G+Y+HIT           +  
Sbjct: 353 --PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHITRTNEESRKYQKLQT 410

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL GWGT        E +WI AN W  SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 411 HAVKLTGWGTLKGAQGQKEKFWIAANSWGISWGENGYFRILRGVNESDIEKLIIAA 466


>gi|351704465|gb|EHB07384.1| Tubulointerstitial nephritis antigen [Heterocephalus glaber]
          Length = 475

 Score =  140 bits (354), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 84/235 (35%), Positives = 120/235 (51%), Gaps = 28/235 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I       ++LS  +L++CC      GC GG    AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSNGRYTVNLSPQNLISCC-LKHRYGCSGGSIDRAWWY 295

Query: 73  FVHHGVVTEECDPYFD----STGCSHP----GCEPAYPTPKCVRKCVKKNQLWRNSKHYS 124
               G+V+  C P F     + GC+      G    + T  C     K N++++ S    
Sbjct: 296 LRKRGLVSHACYPLFKDQNSTNGCAMASRSDGRGKRHATTPCPNNIEKSNRIYQCS---- 351

Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGH 176
              YR++S+   IM EI KNGPV+    V+EDF +YK+G+Y+H+T  +        +  H
Sbjct: 352 -PPYRVSSNETQIMKEIMKNGPVQAIMQVHEDFFYYKTGIYRHVTSTIEDSEKYQKLRTH 410

Query: 177 AVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           AVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 411 AVKLTGWGTLRGAKGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465


>gi|326916361|ref|XP_003204476.1| PREDICTED: tubulointerstitial nephritis antigen-like [Meleagris
           gallopavo]
          Length = 467

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 86/231 (37%), Positives = 116/231 (50%), Gaps = 23/231 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CG+ WAF      +DR  IH    ++  LSV +L++C       GC GG    AWRY
Sbjct: 242 QRNCGASWAFSTASVAADRIAIHSDGQITDNLSVQNLISC-DTKNQHGCGGGNIEGAWRY 300

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK---------CVRKCVKKNQLWRNSKHY 123
              HGVV+  C P F       P     Y + +         C       N+L+R + HY
Sbjct: 301 LKTHGVVSYACYPSFWKHSLDSPSENHCYVSSEYGKNHTNGPCPNALEDSNRLYRCASHY 360

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH--ITGDVMGGHAVKLI 181
                RI+S   DIM EI   GPV+    VYEDF  YK G+Y+H    G     H+VKL+
Sbjct: 361 -----RISSKETDIMEEIMAKGPVQAIMKVYEDFFLYKEGIYRHSYKAGSKWKTHSVKLL 415

Query: 182 GWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           GWG+    +   + +WI AN W + WG +GYF+I RG NEC IE+ ++  L
Sbjct: 416 GWGSLPGKNGQKQKFWIAANSWGKYWGENGYFRILRGQNECDIEKLILTTL 466


>gi|363732245|ref|XP_419905.3| PREDICTED: tubulointerstitial nephritis antigen [Gallus gallus]
          Length = 467

 Score =  140 bits (353), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 85/231 (36%), Positives = 116/231 (50%), Gaps = 23/231 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CG+ WAF      +DR  IH    ++  LSV +L++C       GC+GG    AWRY
Sbjct: 242 QRNCGASWAFSTASVAADRITIHSDGQITDNLSVQNLISC-DTGNQRGCNGGSIDGAWRY 300

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK---------CVRKCVKKNQLWRNSKHY 123
              HGVV+  C P F       P     Y + +         C       N+L+R   HY
Sbjct: 301 LTTHGVVSYACYPSFWKHHLDSPSENQCYVSSEYGKNHTNGPCPNALEDSNRLYRCGSHY 360

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH--ITGDVMGGHAVKLI 181
                R++S   DIM EI   GPV+    VYEDF  YK G+Y+H    G     H+VKL+
Sbjct: 361 -----RVSSKETDIMEEIMAKGPVQAIMKVYEDFFLYKEGIYRHSYKAGSKWKTHSVKLL 415

Query: 182 GWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           GWG+    +   + +WI AN W + WG +GYF+I RG NEC IE+ ++  L
Sbjct: 416 GWGSLPGKNGQKQKFWIAANSWGKYWGENGYFRILRGQNECDIEKLILTTL 466


>gi|431838263|gb|ELK00195.1| Tubulointerstitial nephritis antigen [Pteropus alecto]
          Length = 425

 Score =  140 bits (353), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 82/236 (34%), Positives = 116/236 (49%), Gaps = 29/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC  G    AW Y
Sbjct: 186 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCSSGSIDRAWWY 244

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P+      ++  C  A         + T  C     K N++++ S   
Sbjct: 245 LRKRGLVSHACYPFLKDQNTTNNACAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 301

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG--------DVMGG 175
               YR++S+  +IM EI  NGPV+    V+EDF HYKSG+Y+H+T           +  
Sbjct: 302 --PPYRVSSNETEIMKEIIHNGPVQAIMQVHEDFFHYKSGIYRHVTSTNEKSEKYQKLQT 359

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL GWGT        E +WI+AN W  SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 360 HAVKLTGWGTLRGAQGRKEKFWIVANSWGNSWGENGYFRILRGVNESDIEKLIIAA 415


>gi|426250116|ref|XP_004018784.1| PREDICTED: tubulointerstitial nephritis antigen [Ovis aries]
          Length = 476

 Score =  140 bits (353), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 87/251 (34%), Positives = 124/251 (49%), Gaps = 36/251 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSQGRYTANLSPQNLISCCAKK-RHGCNSGSVDRAWWY 295

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++ GC  A         + T  C     K N++++ S   
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATTPCPNSIEKSNRIYQCS--- 352

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
               YR++S+  +IM EI +NGPV+    V+EDF +YK+G+Y+HIT              
Sbjct: 353 --PPYRVSSNETEIMREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRT 410

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A     
Sbjct: 411 HAVKLTGWGTLRGAHGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAW--- 467

Query: 232 KNLVKEITSAD 242
                ++TSAD
Sbjct: 468 ----GQLTSAD 474


>gi|56758658|gb|AAW27469.1| unknown [Schistosoma japonicum]
          Length = 181

 Score =  140 bits (353), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 75/171 (43%), Positives = 101/171 (59%), Gaps = 15/171 (8%)

Query: 72  YFVHHGVVT-------EECDPY-FDS----TGCSHPGC-EPAYPTPKCVRKCVKKNQL-W 117
           Y V  G+VT         C PY F      T   +P C    Y TP+C +KC K  +  +
Sbjct: 9   YLVKRGIVTGGSKENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQKCQKGYKTPY 68

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
              K+Y    Y + S+ + I  EI  NGPVE +F VYEDF +YKSG+Y+H+TG ++GGHA
Sbjct: 69  EQDKNYGDQRYNVISNAKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHA 128

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           +++IGWG  +    YW++AN WN  WG  G F+I RG +EC IE +VVAGL
Sbjct: 129 IRIIGWGV-EKRTPYWLIANSWNEDWGEKGLFRIVRGRDECSIESNVVAGL 178


>gi|157058749|gb|ABV03132.1| cathepsin B-3098 [Acyrthosiphon pisum]
          Length = 256

 Score =  140 bits (353), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 83/210 (39%), Positives = 113/210 (53%), Gaps = 21/210 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGS WA     A +DR C+  +   N  LS  ++  CC   CG+GC+GGYPI AW+ 
Sbjct: 50  QGNCGSDWALSTSSAFADRLCVATNGDFNQLLSAEEITFCC-HKCGNGCNGGYPIRAWKR 108

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
           F +HG+VT       E C+PY      +D  G +    +P  P  KC +KC     +  N
Sbjct: 109 FKNHGLVTGGNYKSGEGCEPYRVPPCPYDKDGKNTCSGQPMEPNHKCSKKCYGDEDIDFN 168

Query: 120 SKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
             H Y+   Y +      I  ++   GP+E SF VY+DF +YKSG+Y K      +GGH+
Sbjct: 169 KDHRYTRDDYYLTY--RGIQKDVINYGPIEASFDVYDDFPNYKSGIYVKSENASYLGGHS 226

Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADG 207
           VKLIGWG  + G  YW++ N WN  WG  G
Sbjct: 227 VKLIGWG-EEYGVLYWLMVNSWNADWGDKG 255


>gi|3087797|emb|CAA93275.1| cysteine proteinase [Haemonchus contortus]
          Length = 330

 Score =  140 bits (352), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 82/222 (36%), Positives = 114/222 (51%), Gaps = 28/222 (12%)

Query: 3   FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND--LLACCGFLCGDG 60
           + N   +  +  Q + GSCWA  A E +SDR C+     +   ++D  +LACCG  CG G
Sbjct: 105 WKNCSSITYIRDQSNSGSCWAVSAAETMSDRICVQSKGRVQKMISDVDILACCGRECGRG 164

Query: 61  CDGGYPISAWRYFVHHGVVT----EE---CDPYFDSTGCSHPGCEP-----------AYP 102
           C+GG    AW Y    GVVT    +E   C PY       HP CE            ++ 
Sbjct: 165 CNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYH-----LHP-CEITGKFWSCPRDHSFR 218

Query: 103 TPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 161
           TP C + C     + +   K Y  S Y ++ D + I  E+ KNGPV+ +FT YEDF+ Y+
Sbjct: 219 TPACKKYCQYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFTTYEDFSFYR 278

Query: 162 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 203
            G+Y H  G   G HAVK++GWG  ++G  YW +AN W+  W
Sbjct: 279 KGIYVHSYGRQRGAHAVKVVGWGV-ENGTKYWNVANSWSTDW 319


>gi|126310154|ref|XP_001364630.1| PREDICTED: tubulointerstitial nephritis antigen [Monodelphis
           domestica]
          Length = 468

 Score =  140 bits (352), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 84/236 (35%), Positives = 118/236 (50%), Gaps = 29/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I      +  LS  +L++CC      GC GG    AW Y
Sbjct: 229 QKNCAASWAFSTASVAADRIAIQSKGRYTDNLSPQNLISCC-VKNRHGCKGGSIDRAWWY 287

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++ GC+ A         + T  C     K N++++ S   
Sbjct: 288 LRKRGLVSHACYPLFKDQIFNNNGCDMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 344

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG--------DVMGG 175
               YR++S+  +IM EI +NGPV+    V+EDF HYKSG+Y+HI            +  
Sbjct: 345 --PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKSGIYRHINNLKDESEKYRNLRT 402

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL GWG         E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 403 HAVKLTGWGVLRGAQGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 458


>gi|161343821|tpg|DAA06091.1| TPA_inf: cathepsin B [Aphis gossypii]
          Length = 196

 Score =  139 bits (351), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 80/199 (40%), Positives = 108/199 (54%), Gaps = 19/199 (9%)

Query: 48  DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSH 94
           +L  CC   CG GC GGYPI AW+ F +HG+VT       E C+PY      +D  G + 
Sbjct: 1   ELTFCC-HTCGFGCHGGYPIRAWKRFKNHGLVTGGDYKSGEGCEPYRVPPCPYDEQGNNT 59

Query: 95  PGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTV 153
              +P     +C R C    +L  +  H Y+   Y +      I  ++   GP+E SF V
Sbjct: 60  CAGKPMEKNHRCTRICYGDQELDFDEDHRYTRDYYYLTYG--SIQKDVMTYGPIEASFDV 117

Query: 154 YEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 212
           Y DF  YKSG+Y+       +GGHAVKLIGWG    G  YW++ N WN  WG +G FKI+
Sbjct: 118 YSDFPSYKSGIYERTENATYLGGHAVKLIGWG-EQYGIPYWLMVNSWNEDWGDNGLFKIR 176

Query: 213 RGSNECGIEEDVVAGLPSS 231
           RG+NECG++    AG+P +
Sbjct: 177 RGTNECGVDNSTTAGVPVT 195


>gi|338718488|ref|XP_001918155.2| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
           antigen-like [Equus caballus]
          Length = 480

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 80/236 (33%), Positives = 117/236 (49%), Gaps = 29/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW Y
Sbjct: 241 QKNCAASWAFSTASVAADRIAIQSNGRFTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 299

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++  C  A         + T  C     K N++++ S   
Sbjct: 300 LRKRGLVSHACYPLFKDQNATNNDCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 356

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
               YR++S+  +IM EI +NGPV+    V++DF HYK G+Y+H+T           +  
Sbjct: 357 --PPYRVSSNETEIMKEIMQNGPVQAIMQVHDDFFHYKKGIYRHVTSTHEEPEKYRKLRT 414

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HA+KL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 415 HAIKLAGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 470


>gi|348513320|ref|XP_003444190.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oreochromis
           niloticus]
          Length = 499

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 86/243 (35%), Positives = 118/243 (48%), Gaps = 43/243 (17%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C + WAF      SDR  I     M   LS  +L++C     G GC GG    AW Y
Sbjct: 247 QGNCAASWAFSTAAVASDRISIQSMGHMTPRLSPQNLISCDTRNQG-GCAGGRIDGAWWY 305

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKN-----------------Q 115
               GVVTE+C PY           +P + TP  V +C+ ++                 Q
Sbjct: 306 LRRRGVVTEDCYPY-----------QPPHQTPAEVGRCMMQSRSVGRGKRQATQRCPNTQ 354

Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM-- 173
            + N  + S   YR++S+ ++IM EI  NGPV+    V+EDF  YK+G+YKH        
Sbjct: 355 NYHNDIYQSTPPYRLSSNEKEIMKEIMDNGPVQAIMEVHEDFFVYKTGIYKHTDVSFTKP 414

Query: 174 ------GGHAVKLIGWGTSDD----GEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
                 G H+V++ GWG   +       YWI AN W ++WG +GYF+I RG NEC IE  
Sbjct: 415 PQYRKHGTHSVRITGWGEDRNVDGTSRKYWIAANSWGKNWGENGYFRIVRGENECEIETF 474

Query: 224 VVA 226
           V+ 
Sbjct: 475 VIG 477


>gi|294876463|ref|XP_002767679.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239869446|gb|EER00397.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 348

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 96/249 (38%), Positives = 122/249 (48%), Gaps = 36/249 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFL--C-GDGCDGGYPISA 69
           Q  C SCWA   V+A S R CI  G   N  LS  +LLACC     C   GC GG    A
Sbjct: 106 QSACASCWAIAPVQAFSARLCIKSGGKFNQLLSAGELLACCNLAHSCEARGCKGGVARDA 165

Query: 70  WRYFVHHGVVT-------------EECDPYFDSTGCSH--------PGCEPAYPTPKCVR 108
           W +   HG+ T             + C PY +   C+H        P  + +Y TP C+ 
Sbjct: 166 WVFLNKHGIATGGDFVPKSSMEAVDGCWPY-NFPRCAHYQKKSKYGPCPKKSYETPSCLD 224

Query: 109 KCV--KKNQLWRNSKHYSISA--YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 164
           +C   K        +H++  A  Y  N     I  EI K+GP   SF  YEDF  YKSGV
Sbjct: 225 RCPNEKYGTPLDKDRHFTARAVPYWFNGI-RSIKKEIMKHGPTSASFFTYEDFFSYKSGV 283

Query: 165 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
           YK+ +G  +  H V+LIGWGT + G DYW+  N WN  W   G FKI +G  +CGI  D+
Sbjct: 284 YKYTSGAYVEFHTVELIGWGT-EKGVDYWLAKNDWNEEWADLGTFKIAQG--DCGI-NDL 339

Query: 225 VAGLPSSKN 233
           V G P++ N
Sbjct: 340 VLGAPAALN 348


>gi|327281715|ref|XP_003225592.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
           carolinensis]
          Length = 520

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 88/233 (37%), Positives = 120/233 (51%), Gaps = 24/233 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M  +LS  +LL+C       GC+GG    AW +
Sbjct: 274 QGNCAGSWAFSTAAVASDRISIHSMGHMTPALSPQNLLSC-NTRHQQGCNGGRIDGAWWF 332

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA-----YPTPKCVRKCVKKNQLWR---NSKHYS 124
               GVVT+EC P F +   +H    PA       T +  R+ + +    R   N  + S
Sbjct: 333 LRRRGVVTDECYP-FSNQETNHSPNAPACMMHSRSTGRGKRQAIARCPNPRSHANEIYQS 391

Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGH 176
             AYR++S+ ++IM E+ +NGPV+    V+EDF  Y++G+Y+H              G H
Sbjct: 392 TPAYRLSSNEKEIMKELMENGPVQAILEVHEDFFMYRTGIYRHTAVAAGKPEQYRRHGTH 451

Query: 177 AVKLIGWGTSD--DG--EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
           +VK+ GWG     DG  + YWI AN W + WG  GYF+I RG NEC IE  VV
Sbjct: 452 SVKITGWGEEQMPDGSNQKYWIAANSWGKDWGEHGYFRITRGENECEIETFVV 504


>gi|197100841|ref|NP_001126804.1| tubulointerstitial nephritis antigen [Pongo abelii]
 gi|55732702|emb|CAH93049.1| hypothetical protein [Pongo abelii]
          Length = 476

 Score =  139 bits (349), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 81/236 (34%), Positives = 117/236 (49%), Gaps = 29/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P       ++ GC  A         + T  C     K N++++ S   
Sbjct: 296 LRKRGLVSHACYPLSKDQNATNNGCAMASRSDGRGKRHATKPCPNNVEKSNRIYQCS--- 352

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
               YR++S+  +IM EI +NGPV+    V EDF HYK+G+Y+H+T           +  
Sbjct: 353 --PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQT 410

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL GWGT        E +W+ AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 411 HAVKLTGWGTLRGAQGQKEKFWVAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|427783627|gb|JAA57265.1| hypothetical protein [Rhipicephalus pulchellus]
          Length = 483

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 83/226 (36%), Positives = 114/226 (50%), Gaps = 17/226 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIH-FGMN-LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG C + WAF      SDR  I   G++ + LS  DL++C        C GG+P   WR+
Sbjct: 220 QGDCANSWAFSTAAVASDRLSIQSRGVDKVELSPQDLMSCLNGGRRVVCQGGHPDRGWRF 279

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCE-PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
            +++G V+EEC PY      ++  C  P    P    +C          KH+S   YR+ 
Sbjct: 280 LLNYGGVSEECYPYEGVHSSANATCRIPRRRDPIEDARCPTGRT---EQKHFSTPPYRVP 336

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI--------TGDVMGGHAVKLIGW 183
           ++ EDIM EIY NGPV+    V EDF  Y+SGVY+H              G H+V+++GW
Sbjct: 337 ANEEDIMQEIYANGPVQALILVKEDFFLYRSGVYRHTRIAESLRPQYSRSGWHSVRILGW 396

Query: 184 GTSDDGE---DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           G          YW+ AN W   WG +GYF+I RG +E  IE  V+A
Sbjct: 397 GVDRSQYRPIKYWLCANSWGHGWGENGYFRIVRGEDESQIESFVLA 442


>gi|189502866|gb|ACE06814.1| unknown [Schistosoma japonicum]
          Length = 121

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 65/119 (54%), Positives = 85/119 (71%), Gaps = 1/119 (0%)

Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
           N  + N K Y    YR+ S+ E IM E+ ++GPVEV F VY DF +YKSGVY+H++G ++
Sbjct: 3   NVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALL 62

Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GGHAV+L+GWG  ++   YW++AN WN  WG +GYFKI RG NECGIE DV AG+P  K
Sbjct: 63  GGHAVRLLGWGEENN-VPYWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNAGIPKIK 120


>gi|32129433|sp|P92131.3|CATB1_GIALA RecName: Full=Cathepsin B-like CP1; AltName: Full=Cathepsin B-like
           protease B1; Flags: Precursor
          Length = 303

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 84/216 (38%), Positives = 115/216 (53%), Gaps = 19/216 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           QG CGSCWAF A+    DR C   G++   +S S   L++C   L   GCDGG     W 
Sbjct: 99  QGSCGSCWAFSAIGVFGDRRC-AMGIDKEAVSYSQQHLISCS--LENFGCDGGDFQPTWS 155

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           +    G  T EC  Y D       G   A P P          QL++   +  +S     
Sbjct: 156 FLTFTGATTAECVKYVDY------GHTVASPCPAVCDDG-SPIQLYKAHGYGQVS----K 204

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGE 190
           S P  IM  +   GP++    VY D ++Y+SGVYKH  G + +G HA++++G+GT+DDG 
Sbjct: 205 SVPA-IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGT 263

Query: 191 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           DYWI+ N W   WG +GYF+I RG NEC IE+++ A
Sbjct: 264 DYWIIKNSWGPDWGENGYFRIVRGVNECRIEDEIYA 299


>gi|440907441|gb|ELR57591.1| Tubulointerstitial nephritis antigen [Bos grunniens mutus]
          Length = 476

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 86/251 (34%), Positives = 123/251 (49%), Gaps = 36/251 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+      AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSQGRYTANLSPQNLISCCAKK-RRGCNSESVDRAWWY 295

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++ GC  A         + T  C     K N++++ S   
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATTPCPNSIEKSNRIYQCS--- 352

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
               YR++S+  +IM EI +NGPV+    V+EDF +YK+G+Y+HIT              
Sbjct: 353 --PPYRVSSNETEIMREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRT 410

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A     
Sbjct: 411 HAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAW--- 467

Query: 232 KNLVKEITSAD 242
                ++TSAD
Sbjct: 468 ----GQLTSAD 474


>gi|496968|gb|AAA96831.1| cysteine protease homologue, partial [Ancylostoma caninum]
          Length = 197

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 73/198 (36%), Positives = 110/198 (55%), Gaps = 19/198 (9%)

Query: 20  SCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY----- 72
           SCWA  + EA+SD  C+     + + +S +D+L+CCG  CG GC GG+ I A+++     
Sbjct: 1   SCWAVSSAEAMSDEICVQSNSTIRVMISDSDILSCCGISCGYGCQGGWSIEAYKWMQRER 60

Query: 73  --FVHHGVVTEECDPYFDSTGCSHPGCEPAY--------PTPKCVRKCVKKN-QLWRNSK 121
             +         C P   S    +   +P Y        PTPKC + C +K  + ++  K
Sbjct: 61  CCYRWENTDRRVCKPVRPSIRVGNHPNDPYYGPCPGGLWPTPKCRKTCQRKYYKSYQEDK 120

Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 181
           H++  AY + ++   I  EIYKNGPV  +F VY+DF++YK G+Y H  G   G HAVK++
Sbjct: 121 HFATRAYYLPNNERSIRQEIYKNGPVVAAFRVYQDFSYYKKGIYVHKWGGQTGAHAVKVV 180

Query: 182 GWGTSDDGEDYWILANQW 199
           GWG  ++  DYW++AN W
Sbjct: 181 GWG-RENATDYWLIANSW 197


>gi|327282776|ref|XP_003226118.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
           carolinensis]
          Length = 476

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 86/232 (37%), Positives = 113/232 (48%), Gaps = 24/232 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  IH     +  LS   L++C       GC GG    AW Y
Sbjct: 244 QRNCAASWAFSTASVAADRIAIHSKGRFTDNLSPQHLISC-DTRNQYGCKGGSITGAWSY 302

Query: 73  FVHHGVVTEECDPYF----DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA- 127
              +G+V+  C P F      T C       A    + ++ C  +   W  S H      
Sbjct: 303 LKKYGLVSHACYPLFWNNLHQTSCEMSSVFDAEGKRQAIQPCPNR---WEPSNHIYQCGL 359

Query: 128 -YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI--------TGDVMGGHAV 178
            YRI+S   DIM EI +NGPV+    VY+DF  YKSG+YKHI               H++
Sbjct: 360 PYRISSQDADIMKEIKENGPVQAVMQVYDDFFLYKSGIYKHIWSLEGKTQNRHQKKPHSI 419

Query: 179 KLIGWGTSDDGE----DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           K++GWGT  D E     +WI AN W  SWG +GYF+I RG NEC IE+ V+A
Sbjct: 420 KIVGWGTLRDAEGQRQKFWIAANSWGNSWGENGYFRILRGQNECDIEKTVIA 471


>gi|297291062|ref|XP_002803846.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
           mulatta]
          Length = 463

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 83/236 (35%), Positives = 117/236 (49%), Gaps = 30/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW Y
Sbjct: 225 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 283

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++ GC  A         + T  C     K N++++ S   
Sbjct: 284 LRKRGLVSHACYPLFKDQNANN-GCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 339

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
               YR++S   +IM EI +NGPV+    V EDF HYK+G+Y+H+T           +  
Sbjct: 340 --PPYRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQT 397

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 398 HAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 453


>gi|348553066|ref|XP_003462348.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
           porcellus]
          Length = 475

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 84/236 (35%), Positives = 117/236 (49%), Gaps = 30/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC GG    AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSSGRYTANLSPQNLISCCARK-RHGCGGGSVDRAWWY 295

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++ GC  A         + T  C     K N++++ S   
Sbjct: 296 LRKRGLVSHACYPLFKDQNATN-GCAMASRSDGRGKRHATTPCPNHIEKSNRIYQCS--- 351

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG--------DVMGG 175
               YR++S+   IM EI +NGPV+    V+EDF  YK+G+Y+H+T           +  
Sbjct: 352 --PPYRVSSNETQIMKEIMQNGPVQAIMKVHEDFFSYKTGIYRHVTSTSEDSEKYQKLRT 409

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL GWGT        E +WI AN W +SWG +GYFKI RG NE  IE+ ++A 
Sbjct: 410 HAVKLTGWGTLKGARGKKEKFWIAANSWGKSWGENGYFKILRGVNESDIEKLIIAA 465


>gi|344264196|ref|XP_003404179.1| PREDICTED: tubulointerstitial nephritis antigen [Loxodonta
           africana]
          Length = 476

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 81/236 (34%), Positives = 116/236 (49%), Gaps = 29/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSNGRYTANLSPQNLISCCT-KNRHGCNSGSVDRAWWY 295

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++ GC  A         + T  C     K N +++ S   
Sbjct: 296 LRKRGLVSHACYPLFKDQNANNNGCAMASRSDGRGKRHATKPCPNNIEKSNVIYQCS--- 352

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG--------DVMGG 175
               YR++S+  +IM EI +NGPV+    V+EDF HYK+G+Y+H+            +  
Sbjct: 353 --PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVIRTSEESEKYQKLRT 410

Query: 176 HAVKLIGWGTSDDG----EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL GWG         E +W+ AN W +SWG DGYF+I RG NE  IE+ ++A 
Sbjct: 411 HAVKLTGWGMMKGAKGRKEKFWVAANSWGKSWGEDGYFRILRGVNESDIEKLIIAA 466


>gi|47212965|emb|CAF93376.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 271

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 86/242 (35%), Positives = 116/242 (47%), Gaps = 43/242 (17%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C + WAF      SDR  I     M   LS  +L++C     G GC GG    AW Y
Sbjct: 28  QGNCAASWAFSTAAVASDRISIQSMGHMTPQLSPQNLISCDTRNQG-GCAGGRLDGAWWY 86

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL---------------- 116
               GVVTE+C PY            P   TP  + +C+ +++                 
Sbjct: 87  LRRRGVVTEDCYPY-----------RPPQQTPAELSRCMMQSRSVGRGKRQATQRCPNTN 135

Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM-- 173
            ++N  + S   YR+++  ++IM EI  NGPV+    V+EDF  Y SG+YKH        
Sbjct: 136 NYQNDIYQSTPPYRLSTSEKEIMKEIQDNGPVQAIMEVHEDFFMYNSGIYKHTDVSFTKP 195

Query: 174 ------GGHAVKLIGWGTSD--DG--EDYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
                 G H+VK+ GWG     DG    YWI AN W ++WG +GYF+I RG NEC IE  
Sbjct: 196 PHYRKHGTHSVKITGWGEERNFDGTTRKYWIAANSWGKNWGENGYFRIARGENECEIEAF 255

Query: 224 VV 225
           V+
Sbjct: 256 VI 257


>gi|308162940|gb|EFO65307.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 303

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 80/224 (35%), Positives = 117/224 (52%), Gaps = 21/224 (9%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGY 65
           V  ++ QG CG CWAF A+    DR C+  G++   +  S   L++C       GCDGG 
Sbjct: 93  VTPVMDQGSCGGCWAFSAIGVFGDRRCVA-GIDKEGVPYSQQYLISCS--TENHGCDGGD 149

Query: 66  PISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSI 125
               W +    G  T EC  Y D          P      C   C   +Q+    + Y  
Sbjct: 150 FWPTWSFLTLTGATTAECVKYIDY---------PNIVASPCPAVCDDGSQI----QLYKA 196

Query: 126 SAY-RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGW 183
             Y +++ + + IM  +   GPV+    VY D ++Y+SGVYKH  G + +G HA++++G+
Sbjct: 197 HGYGQVSKNVQAIMHMLATGGPVQTMIVVYSDLSYYESGVYKHTYGTISLGLHALEMVGY 256

Query: 184 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           GT+DDG DYWI+ N W   WG +GYF+I RG NEC IE+++ A 
Sbjct: 257 GTTDDGTDYWIIRNSWGADWGENGYFRIVRGVNECRIEDEIYAA 300


>gi|290982673|ref|XP_002674054.1| predicted protein [Naegleria gruberi]
 gi|284087642|gb|EFC41310.1| predicted protein [Naegleria gruberi]
          Length = 673

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 84/224 (37%), Positives = 111/224 (49%), Gaps = 21/224 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAF      SDR CI      N+ +S   L+ C        C GGY   +W++
Sbjct: 108 QGQCGSCWAFATTGVFSDRLCITTNNVSNVVISPEFLIEC--DKTSFACQGGYGYYSWKF 165

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAY---PTPKCVRKCVKKNQLWRNSKHYSISAYR 129
           F++ G+  E C PY   +          Y      +C   C   + L     + + SAY 
Sbjct: 166 FMNTGIPLESCVPYTKDS--------LVYGNTTNAQCRSTCTDGSPL---KLYKAASAYY 214

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDD 188
           I S   +   EI  NGPVE  F VY DF  YKSG+Y+   G   +GGHAVK++GW +  +
Sbjct: 215 IYSPITNYQTEIMTNGPVEADFDVYSDFYSYKSGIYQKTAGSTYVGGHAVKVLGWASDSN 274

Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSN--ECGIEEDVVAGLPS 230
           G  YWI  NQW  SWG  GYF I RG++   C  +  ++AG  S
Sbjct: 275 GTPYWIAQNQWGTSWGMGGYFYIYRGNSTLNCKFDNYMIAGTVS 318


>gi|355748654|gb|EHH53137.1| hypothetical protein EGM_13709 [Macaca fascicularis]
          Length = 475

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 83/236 (35%), Positives = 117/236 (49%), Gaps = 30/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++ GC  A         + T  C     K N++++ S   
Sbjct: 296 LRKRGLVSHACYPLFKDQNANN-GCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 351

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
               YR++S   +IM EI +NGPV+    V EDF HYK+G+Y+H+T           +  
Sbjct: 352 --PPYRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQT 409

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 410 HAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465


>gi|270011021|gb|EFA07469.1| cathepsin B precursor [Tribolium castaneum]
          Length = 327

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 86/227 (37%), Positives = 115/227 (50%), Gaps = 14/227 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGS WA       SDRF I       ++LS   LL+C        C+GGY   AW Y
Sbjct: 99  QGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQHLLSC-DRRGQQSCNGGYLDRAWSY 157

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
               G+V E+C PY      ++  C            C     + R SK+    AYR+ +
Sbjct: 158 IRKIGLVDEQCFPY----SATNEKCRIPRRGDLVTANCQLPTNVDRRSKYKVAPAYRVGN 213

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI---TGDVMGGHAVKLIGWGT--SD 187
           +  DIM EI  +GPV+ +  VY DF  YK G+Y+H    T D  G H+V+++GWG   S 
Sbjct: 214 ET-DIMYEILHSGPVQATMKVYHDFFTYKRGIYRHSPISTNDRTGYHSVRIVGWGEEYSP 272

Query: 188 DG-EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
           +G + YW +AN W   WG +GYF+I RGSNEC IE  V+      +N
Sbjct: 273 EGLKKYWKVANSWGPEWGENGYFRILRGSNECEIESFVLGTWAEVEN 319


>gi|130502070|ref|NP_001076255.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
 gi|818411|gb|AAC48477.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
          Length = 474

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 80/234 (34%), Positives = 115/234 (49%), Gaps = 29/234 (12%)

Query: 17  HCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW Y  
Sbjct: 237 NCAASWAFSTASVAADRIAIQSNGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWYLR 295

Query: 75  HHGVVTEECDPYFDSTGCSHPGCE---------PAYPTPKCVRKCVKKNQLWRNSKHYSI 125
             G+V+  C P F     S+  C            + T  C     K N++++ S     
Sbjct: 296 KRGLVSHACYPLFKDQNISNNTCAMTSKADGRGKRHATRPCPNNIEKSNRIYQCS----- 350

Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGGHA 177
             YR++S+  +IM EI +NGPV+    V+EDF HYK+G+Y+H+            +  HA
Sbjct: 351 PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVISTNEESEKYRKLQTHA 410

Query: 178 VKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           VKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 411 VKLTGWGTLKGARGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 464


>gi|355561807|gb|EHH18439.1| hypothetical protein EGK_15031 [Macaca mulatta]
          Length = 475

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 83/236 (35%), Positives = 117/236 (49%), Gaps = 30/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++ GC  A         + T  C     K N++++ S   
Sbjct: 296 LRKRGLVSHACYPLFKDQNANN-GCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 351

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
               YR++S   +IM EI +NGPV+    V EDF HYK+G+Y+H+T           +  
Sbjct: 352 --PPYRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQT 409

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 410 HAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465


>gi|402867308|ref|XP_003897801.1| PREDICTED: tubulointerstitial nephritis antigen [Papio anubis]
          Length = 475

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 83/236 (35%), Positives = 117/236 (49%), Gaps = 30/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++ GC  A         + T  C     K N++++ S   
Sbjct: 296 LRKRGLVSHACYPLFKDQNANN-GCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 351

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
               YR++S   +IM EI +NGPV+    V EDF HYK+G+Y+H+T           +  
Sbjct: 352 --PPYRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQT 409

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 410 HAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465


>gi|11691656|emb|CAC18646.1| cathepsin B-like protease 1 [Giardia intestinalis]
          Length = 303

 Score =  136 bits (343), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 83/216 (38%), Positives = 114/216 (52%), Gaps = 19/216 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           QG CG CWAF A+    DR C   G++   +S S   L++C   L   GCDGG     W 
Sbjct: 99  QGSCGECWAFSAIGVFGDRRC-AMGIDKEAVSYSQQHLISCS--LENFGCDGGDFQPTWS 155

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           +    G  T EC  Y D       G   A P P          QL++   +  +S     
Sbjct: 156 FLTFTGATTAECVKYVDY------GHTVASPCPAVCDDG-SPIQLYKAHGYGQVS----K 204

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGE 190
           S P  IM  +   GP++    VY D ++Y+SGVYKH  G + +G HA++++G+GT+DDG 
Sbjct: 205 SVPA-IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGT 263

Query: 191 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           DYWI+ N W   WG +GYF+I RG NEC IE+++ A
Sbjct: 264 DYWIIKNSWGPDWGENGYFRIVRGVNECRIEDEIYA 299


>gi|159112288|ref|XP_001706373.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157434469|gb|EDO78699.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 303

 Score =  136 bits (343), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 83/216 (38%), Positives = 114/216 (52%), Gaps = 19/216 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           QG CG CWAF A+    DR C   G++   +S S   L++C   L   GCDGG     W 
Sbjct: 99  QGSCGGCWAFSAIGVFGDRRC-AMGIDKEAVSYSQQHLISCS--LENFGCDGGDFQPTWS 155

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           +    G  T EC  Y D       G   A P P          QL++   +  +S     
Sbjct: 156 FLTFTGATTAECVKYVDY------GHTVASPCPAVCDDG-SPIQLYKAHGYGQVS----K 204

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGE 190
           S P  IM  +   GP++    VY D ++Y+SGVYKH  G + +G HA++++G+GT+DDG 
Sbjct: 205 SVPA-IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGT 263

Query: 191 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           DYWI+ N W   WG +GYF+I RG NEC IE+++ A
Sbjct: 264 DYWIIKNSWGPDWGENGYFRIVRGVNECRIEDEIYA 299


>gi|1763659|gb|AAB58258.1| cysteine protease [Giardia intestinalis]
          Length = 269

 Score =  136 bits (343), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 83/216 (38%), Positives = 114/216 (52%), Gaps = 19/216 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           QG CG CWAF A+    DR C   G++   +S S   L++C   L   GCDGG     W 
Sbjct: 65  QGSCGECWAFSAIGVFGDRRC-AMGIDKEAVSYSQQHLISCS--LENFGCDGGDFQPTWS 121

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           +    G  T EC  Y D       G   A P P          QL++   +  +S     
Sbjct: 122 FLTFTGATTAECVKYVDY------GHTVASPCPAVCDDG-SPIQLYKAHGYGQVS----K 170

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGE 190
           S P  IM  +   GP++    VY D ++Y+SGVYKH  G + +G HA++++G+GT+DDG 
Sbjct: 171 SVPA-IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGT 229

Query: 191 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           DYWI+ N W   WG +GYF+I RG NEC IE+++ A
Sbjct: 230 DYWIIKNSWGPDWGENGYFRIVRGVNECRIEDEIYA 265


>gi|327239610|gb|AEA39649.1| cathepsin B [Epinephelus coioides]
          Length = 171

 Score =  136 bits (342), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 78/172 (45%), Positives = 102/172 (59%), Gaps = 17/172 (9%)

Query: 19  GSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHH 76
           GSCWAFGA EA+SDR CIH    +S+ ++  DLLACC   CG GC+GGYP +AW ++   
Sbjct: 1   GSCWAFGAAEAISDRLCIHSNGKVSVEISSEDLLACCDS-CGMGCNGGYPSAAWDFWTDV 59

Query: 77  GVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKH 122
           G+V+         C PY          G   P       TP+C+ +C       ++  KH
Sbjct: 60  GLVSGGLYDSHVGCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCILQCESGYTPSYKADKH 119

Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
           Y  S+Y + SD E I +EIYKNGPVE +FTVYEDF  YK+GVY+H+TG  +G
Sbjct: 120 YGKSSYSVPSDEEQIQSEIYKNGPVEGAFTVYEDFLLYKTGVYQHMTGSAVG 171


>gi|13469701|gb|AAK27318.1| cysteine proteinase [Clonorchis sinensis]
          Length = 179

 Score =  136 bits (342), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 80/178 (44%), Positives = 103/178 (57%), Gaps = 16/178 (8%)

Query: 25  GAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-- 80
           GAVEA+SDR CIH     N SLS  DLL+CC   CG GCDGG+P  AW ++  HG+VT  
Sbjct: 1   GAVEAMSDRLCIHSSGAFNKSLSAVDLLSCCK-DCGYGCDGGFPPMAWDFWKTHGIVTGG 59

Query: 81  --EE---CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYR 129
             EE   C PY        S G   P     YPTPKCV+ C      ++  K  + ++Y 
Sbjct: 60  SKEEPAGCRPYPFPKCQHHSQGHYPPCPRRIYPTPKCVKHCDTPKIDYQKDKTRANTSYN 119

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
           ++     IM EI  NGPVE +F V+EDF  YKSG+Y H  G  +GGHA++++GWG  +
Sbjct: 120 VHQSEVAIMKEILLNGPVEATFEVHEDFPEYKSGIYFHAWGGSVGGHAIRILGWGEEN 177


>gi|410910940|ref|XP_003968948.1| PREDICTED: tubulointerstitial nephritis antigen-like [Takifugu
           rubripes]
          Length = 477

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 84/243 (34%), Positives = 117/243 (48%), Gaps = 43/243 (17%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C + WAF      SDR  I     M   LS  +L++C     G GC GG    AW +
Sbjct: 225 QGNCAASWAFSTAAVASDRISIQSMGHMTPQLSPQNLISCDTRNQG-GCTGGRIDGAWWF 283

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL---------------- 116
               GVVTE+C PY            P   TP  + +C+ +++                 
Sbjct: 284 LRRRGVVTEDCYPY-----------RPPQQTPAELGRCMMQSRSVGRGKRQATQRCPNTN 332

Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM-- 173
            ++N  + S   YR++++ ++IM EI  NGPV+    V+EDF  YKSG+YKH        
Sbjct: 333 NYQNDIYQSTPPYRLSTNEKEIMKEIQDNGPVQAIMEVHEDFFVYKSGIYKHTDVSFTKP 392

Query: 174 ------GGHAVKLIGWGTSDD----GEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
                 G H+VK+ GWG   +       YWI AN W ++WG +GYF+I RG NEC IE  
Sbjct: 393 PQYRKHGTHSVKITGWGEERNVDGAKRKYWIAANSWGKNWGEEGYFRIARGENECEIEAF 452

Query: 224 VVA 226
           V+ 
Sbjct: 453 VIG 455


>gi|158285208|ref|XP_001687862.1| AGAP007684-PA [Anopheles gambiae str. PEST]
 gi|158285210|ref|XP_308187.4| AGAP007684-PB [Anopheles gambiae str. PEST]
 gi|157019881|gb|EDO64511.1| AGAP007684-PA [Anopheles gambiae str. PEST]
 gi|157019882|gb|EAA04576.4| AGAP007684-PB [Anopheles gambiae str. PEST]
          Length = 463

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 95/258 (36%), Positives = 122/258 (47%), Gaps = 21/258 (8%)

Query: 3   FTNSEHVEILVI----QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFL 56
           F  SEH   LV     QG CGS WAF      SDRF I       + L+   +LAC    
Sbjct: 191 FDASEHWTGLVAEARDQGWCGSSWAFSTATMASDRFAILSKGREMVQLAPQQMLACVRR- 249

Query: 57  CGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL 116
              GC GG+  +AW+Y    GVV EEC PY  +        +    T  C    VK N  
Sbjct: 250 -QQGCSGGHLDTAWQYLRRTGVVNEECYPYIAAQNVCKISNDDTLITANCELP-VKVN-- 305

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG-----D 171
            R   +    A+ +N++  DIMAEI   G V+    VY DF  Y+SG+Y+H        +
Sbjct: 306 -RTLMYKMGPAFSLNNET-DIMAEIKDRGTVQAIMRVYRDFFSYRSGIYRHSAAATPAEE 363

Query: 172 VMGGHAVKLIGWGTSDDGED---YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
               H+V+LIGWG    G D   YWI  N W + WG +G F+I RGSNEC IE  V+A  
Sbjct: 364 RSAYHSVRLIGWGEERVGYDVVKYWIAINSWGQWWGENGRFRILRGSNECDIESYVLASN 423

Query: 229 PSSKNLVKEITSADMFED 246
           P     V+ I      ++
Sbjct: 424 PYVHEHVQAIRKVGELQE 441


>gi|189238903|ref|XP_967834.2| PREDICTED: similar to tubulointerstitial nephritis antigen
           [Tribolium castaneum]
          Length = 453

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 85/227 (37%), Positives = 112/227 (49%), Gaps = 14/227 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGS WA       SDRF I       ++LS   LL+C        C+GGY   AW Y
Sbjct: 225 QGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQHLLSC-DRRGQQSCNGGYLDRAWSY 283

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
               G+V E+C PY      ++  C            C     + R SK+    AYR+ +
Sbjct: 284 IRKIGLVDEQCFPY----SATNEKCRIPRRGDLVTANCQLPTNVDRRSKYKVAPAYRVGN 339

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI---TGDVMGGHAVKLIGWGTSDDG 189
           +  DIM EI  +GPV+ +  VY DF  YK G+Y+H    T D  G H+V+++GWG     
Sbjct: 340 E-TDIMYEILHSGPVQATMKVYHDFFTYKRGIYRHSPISTNDRTGYHSVRIVGWGEEYSP 398

Query: 190 E---DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
           E    YW +AN W   WG +GYF+I RGSNEC IE  V+      +N
Sbjct: 399 EGLKKYWKVANSWGPEWGENGYFRILRGSNECEIESFVLGTWAEVEN 445


>gi|166030322|gb|ABY78828.1| cathepsin B-like protease [Trypanosoma congolense]
 gi|343471419|emb|CCD16168.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 91/240 (37%), Positives = 121/240 (50%), Gaps = 14/240 (5%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDG 63
           N   +  +  Q  C + WA     A+SDR+C +  G  L +S   LL+CC   CG GC G
Sbjct: 102 NCPTIREIADQSACRASWAVSTASAISDRYCTVGGGKQLRISAAHLLSCCK-QCGGGCKG 160

Query: 64  GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP--------AYPTPKCVRKCVKKNQ 115
           G+P  AWRY+V +G+ +  C PY     C H G +          + TP+C   C  K  
Sbjct: 161 GFPGFAWRYYVEYGIASSYCQPY-PFPQCEHQGAQGNKTPCSNYKFVTPQCNTTCTDKTI 219

Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
                K+    AY +    E+   E+Y NGP      VY D   YKSGVY+++ G  MG 
Sbjct: 220 PL--IKYRGKDAYMLLPGEEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRNVDGSYMGV 277

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
            AVK++GWG   +G  YW +AN W+  WG DGY  I RG+NEC IE    AG P +  L 
Sbjct: 278 TAVKVVGWG-KLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPDTSQLT 336


>gi|157116531|ref|XP_001658537.1| tubulointerstitial nephritis antigen [Aedes aegypti]
 gi|108883447|gb|EAT47672.1| AAEL001232-PA [Aedes aegypti]
          Length = 462

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 85/234 (36%), Positives = 115/234 (49%), Gaps = 17/234 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGS WA       SDRF I       + L+   +++C       GC GG+  +AW Y
Sbjct: 205 QGWCGSSWAVSTASVASDRFAILSKGRETVQLAPQQIVSCVRR--SQGCSGGHLDTAWSY 262

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
               G V EEC PY  +    H  C+           C    ++ R + +    A+ +N+
Sbjct: 263 LRKVGTVNEECYPYISA----HNVCKIRPSDTLITANCELPMKVDRTNMYKMGPAFSLNN 318

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-----MGGHAVKLIGWGTSD 187
           +  DIM EI K+GPV+    V+ DF  YKSG+Y+H           G H+V+LIGWG   
Sbjct: 319 E-TDIMLEIKKHGPVQAIMRVHRDFFSYKSGIYRHSAASTSADQRAGYHSVRLIGWGEER 377

Query: 188 DGED---YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
            G +   YWI  N W   WG +G F+I RGSNEC IE  V+A LP     VK++
Sbjct: 378 HGYEVTKYWIAVNSWGTWWGENGRFRILRGSNECEIESYVLASLPYVHQQVKDL 431


>gi|321478457|gb|EFX89414.1| hypothetical protein DAPPUDRAFT_303204 [Daphnia pulex]
          Length = 442

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 85/231 (36%), Positives = 116/231 (50%), Gaps = 19/231 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CG+ WAF      +DR  I    +    LS+ +LLAC       GC+GG+   AW Y
Sbjct: 205 QGWCGASWAFSTAAVAADRLAIQSRGHEVYPLSMQNLLAC-NNRGQQGCNGGHLDRAWNY 263

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA----YPTPKCV------RKCVKKNQLWRNSKH 122
               GVV EEC PY          C+        T KC       RK  + ++  R    
Sbjct: 264 MRRFGVVNEECYPYISGRTGQVEKCKVPRRGNLATMKCQLVNAAERKSDRSDKPPRKGLF 323

Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI---TGDVMGGHAVK 179
            S  AYRI    +DIM EI ++GPV+ +  V+ DF  Y+ GVY++    +    G H+V+
Sbjct: 324 RSPPAYRIAPFEDDIMNEILQHGPVQATMRVHPDFFLYRGGVYRYSGTNSQQRSGYHSVR 383

Query: 180 LIGWGTSDDGED---YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           ++GWG      +   YW++AN W R WG DGYF+I RG NE  IE+ V+A 
Sbjct: 384 IVGWGVDSSKRNPTKYWLVANSWGRLWGEDGYFRIVRGENESDIEKFVLAA 434


>gi|343476073|emb|CCD12715.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 90/239 (37%), Positives = 120/239 (50%), Gaps = 12/239 (5%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDG 63
           N   +  +  Q  C + WA     A+SDR+C +  G  L +S   LL+CC   CG GC G
Sbjct: 102 NCPTIREIADQSACRASWAVSTASAISDRYCTVGGGKQLRISAAHLLSCCK-QCGGGCKG 160

Query: 64  GYPISAWRYFVHHGVVTEECDPY-------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL 116
           G+P  AWRY+V +G+ +  C PY         + G   P     + TP+C   C  K   
Sbjct: 161 GFPGFAWRYYVEYGIASSYCQPYPFPQCEHHGAQGNKTPCSNYKFVTPQCNTTCTDKTIP 220

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
               K+    AY +    E+   E+Y NGP      VY D   YKSGVY+++ G  MG  
Sbjct: 221 L--IKYRGKDAYMLLPGEEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRNVDGSYMGVT 278

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
           AVK++GWG   +G  YW +AN W+  WG DGY  I RG+NEC IE    AG P +  L 
Sbjct: 279 AVKVVGWG-KLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPDTSQLT 336


>gi|170045773|ref|XP_001850470.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
 gi|167868692|gb|EDS32075.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
          Length = 463

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 84/242 (34%), Positives = 118/242 (48%), Gaps = 17/242 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGS WA       SDRF I       + L+   +++C       GC GG+  +AW Y
Sbjct: 206 QGWCGSSWALSTASVASDRFAILSKGREIVQLAPQQIISCVRR--SQGCSGGHLDTAWNY 263

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
               G V +EC PY  +       C+           C    ++ R + +    A+ +N+
Sbjct: 264 VRKVGTVNDECYPYISAQN----ACKIRPSDTLITANCDLPTKVDRTNMYKMGPAFSLNN 319

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT----GDVMGG-HAVKLIGWGTSD 187
           +  DIM EI K+GPV+    V+ DF  YKSG+Y+H      GD   G H+V+LIGWG   
Sbjct: 320 E-TDIMIEIKKHGPVQAILRVHRDFFSYKSGIYRHSAASSAGDERAGYHSVRLIGWGEER 378

Query: 188 DGED---YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMF 244
           +G +   YW+  N W R WG +G F+I RG NEC IE  V+A LP     VK +      
Sbjct: 379 NGYETTKYWVAVNSWGRWWGENGRFRIVRGQNECEIESYVLASLPYVHQQVKPMRQVGEL 438

Query: 245 ED 246
           ++
Sbjct: 439 QE 440


>gi|2330009|gb|AAB66719.1| cysteine protease [Giardia muris]
          Length = 301

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 77/213 (36%), Positives = 112/213 (52%), Gaps = 20/213 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN---DLLACCGFLCGDG-CDGGYPISAW 70
           Q  CGSCWAF AV   +DR C  +G++ S  V+     +  C F  GDG C+GG+  + W
Sbjct: 97  QASCGSCWAFSAVATFADRRCA-YGLD-SKQVHYSEQYVVSCDF--GDGACNGGWLSNVW 152

Query: 71  RYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI 130
           ++    GV   +C  YF                  C+  C   + +      + I+    
Sbjct: 153 KFLTKTGVPKLDCLKYFSGMTGDRE---------SCITHCTDGSPVELYQASHVIN---Y 200

Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 190
             D + +M  +  +GP++V+F VY DF +Y SGVY+H+ G + GGHAV+++G+G  + G 
Sbjct: 201 GMDLDRMMEALVYDGPLQVAFVVYSDFGYYSSGVYQHVNGMMEGGHAVEMVGYGIDESGL 260

Query: 191 DYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
            YWI+ N W   WG  GYF+I R  NECGIEE 
Sbjct: 261 KYWIIRNSWGPDWGEGGYFRIIRRVNECGIEEQ 293


>gi|53850626|ref|NP_001005549.1| tubulointerstitial nephritis antigen precursor [Rattus norvegicus]
 gi|51858645|gb|AAH81887.1| Tubulointerstitial nephritis antigen [Rattus norvegicus]
 gi|149019129|gb|EDL77770.1| tubulointerstitial nephritis antigen [Rattus norvegicus]
          Length = 475

 Score =  134 bits (338), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 80/236 (33%), Positives = 117/236 (49%), Gaps = 29/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW +
Sbjct: 236 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWF 294

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++  C  A         + T  C     K N++++ S   
Sbjct: 295 LRKRGLVSHACYPLFKEQSTNNNSCAMASRSDGRGKRHATRPCPNSFEKSNRIYQCS--- 351

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
               YRI+S+  +IM EI +NGPV+    V+EDF +YK+G+Y+H+            +  
Sbjct: 352 --PPYRISSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYRKLRT 409

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 410 HAVKLTGWGTLRGAQGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465


>gi|321476473|gb|EFX87434.1| hypothetical protein DAPPUDRAFT_221708 [Daphnia pulex]
          Length = 464

 Score =  134 bits (338), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 86/236 (36%), Positives = 121/236 (51%), Gaps = 31/236 (13%)

Query: 8   HVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGY 65
           +V ++  QG CGSC+AF ++  L  R  +     + ++LS  D+++C  +    GC+GG+
Sbjct: 244 YVPVVKNQGSCGSCYAFSSMGMLESRLRVATKNQVQVNLSPQDIVSCSAY--SQGCEGGF 301

Query: 66  P-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYS 124
           P + A +Y   HGVV EEC PY   TG     C  A    KC R  V        +K+  
Sbjct: 302 PYLIAGKYAQDHGVVAEECYPY---TG-RDSACSAA---KKCQRSYV--------AKYRY 346

Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV----------MG 174
           +  Y    + E +   + ++GP+ VSF VY DF HY  GVY    G            + 
Sbjct: 347 VGGYYGACNEELMKMSLVESGPLSVSFEVYSDFMHYAGGVYHRTDGLFNKINEFNPFELT 406

Query: 175 GHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
            HAV L+G+GT S   E YWI+ N W   WG DG+F+I+RG +ECGIE   V   P
Sbjct: 407 NHAVLLVGYGTDSQTKEKYWIVKNSWGTKWGEDGFFRIRRGVDECGIESIAVEVTP 462


>gi|270012757|gb|EFA09205.1| cathepsin B precursor [Tribolium castaneum]
          Length = 348

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 84/221 (38%), Positives = 118/221 (53%), Gaps = 22/221 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSCWAF + E ++DR CI          S  +LL CC   C   C GGY   AW Y
Sbjct: 99  QGTCGSCWAFASTEVMTDRLCIGTKGETKFVFSPENLLTCCED-CRLECVGGYTAKAWDY 157

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYP---TPKCVRKC--VKKNQLWRNSKHYSISA 127
           +++ G+V+     Y  S GC  P  + ++      KCV+ C   K +  + + KHY  S 
Sbjct: 158 YINEGIVSG--GDYNSSEGC-QPYSKASFQYAVASKCVKACQNDKYDVKYDDDKHYGDSF 214

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
           Y + ++   I  EI  NGPV  +F V+ED  +YKSG+             V ++ WGT +
Sbjct: 215 YTLETNVTQIQTEILTNGPVMATFNVFEDIIYYKSGIQL---------SNVSILRWGT-E 264

Query: 188 DGEDYWILANQWNRSWG-ADGYFKIKRGSNECGIEEDVVAG 227
           +G  YW++AN W   WG   G+ KIKRG+NEC IE+++ AG
Sbjct: 265 EGVPYWLIANSWGTWWGDLGGFIKIKRGTNECAIEQEMAAG 305


>gi|269146930|gb|ACZ28411.1| cathepsin b [Simulium nigrimanum]
          Length = 168

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 60/128 (46%), Positives = 83/128 (64%), Gaps = 2/128 (1%)

Query: 103 TPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 161
           TPKC++ C     + +   K Y   +Y +      I  EI  NGPVE +FTVYED   YK
Sbjct: 41  TPKCIKHCQASYTVAYEQDKSYGAKSYSVPHHVAQIQKEIMTNGPVEGAFTVYEDLVQYK 100

Query: 162 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 221
            GVY+H+TG ++GGHA++++GWG  +D   YW++AN WN  WG +G+FKI RGS+ CGIE
Sbjct: 101 DGVYQHVTGKMLGGHAIRILGWGVEND-VPYWLIANSWNTDWGNNGFFKILRGSDHCGIE 159

Query: 222 EDVVAGLP 229
             + AG+P
Sbjct: 160 SQISAGIP 167


>gi|354483193|ref|XP_003503779.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cricetulus
           griseus]
          Length = 475

 Score =  134 bits (336), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 79/236 (33%), Positives = 117/236 (49%), Gaps = 29/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW +
Sbjct: 236 QKNCAASWAFSTASVAADRIAIQSRGRYTANLSPQNLISCCAKK-RHGCNSGSIDRAWWF 294

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++  C  A         + T  C     K N++++ S   
Sbjct: 295 LRKRGLVSHACYPLFKDQNTTNNICAMASRSDGRGKRHATKPCPNSFEKSNRIYQCS--- 351

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
               YR++S+  +IM EI +NGPV+    V+EDF +YK+G+Y+H+            +  
Sbjct: 352 --PPYRVSSNETEIMREIIRNGPVQAIMQVHEDFFYYKTGIYRHVISTNEESEKYRKLRS 409

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 410 HAVKLTGWGTLRGAGGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465


>gi|432884030|ref|XP_004074413.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oryzias
           latipes]
          Length = 474

 Score =  133 bits (335), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 83/243 (34%), Positives = 117/243 (48%), Gaps = 43/243 (17%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C + WAF      SDR  I     M   LS  +L++C     G GC GG    AW Y
Sbjct: 222 QGNCAASWAFSTAAVASDRISIQSMGHMTPQLSPQNLISCDTRNQG-GCAGGRIDGAWWY 280

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL---------------- 116
               GVVTE C PY           +P    P  V +C+ +++                 
Sbjct: 281 LRRRGVVTENCYPY-----------QPPQQAPAEVGRCMMQSRAVGRGKRQATQRCPNTY 329

Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM-- 173
            + N  + S   Y+++S+ ++IM EI +NGPV+    V+EDF  YK+G+YKH        
Sbjct: 330 NYHNDIYQSTPPYKLSSNEKEIMKEIMENGPVQAIMEVHEDFFVYKNGIYKHTDVSSTKP 389

Query: 174 ------GGHAVKLIGWGTSDD----GEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
                 G H+V++ GWG   D       YWI AN W ++WG +G+F+I RG+NEC IE  
Sbjct: 390 PQYRKHGTHSVRITGWGEDKDYDGTPRKYWIAANSWGKNWGENGFFRIARGANECEIEAF 449

Query: 224 VVA 226
           V+ 
Sbjct: 450 VIG 452


>gi|227499499|ref|NP_036163.3| tubulointerstitial nephritis antigen precursor [Mus musculus]
 gi|4929827|gb|AAD34171.1| tubulo-interstitial nephritis antigen [Mus musculus]
 gi|148694397|gb|EDL26344.1| tubulointerstitial nephritis antigen, isoform CRA_a [Mus musculus]
          Length = 475

 Score =  133 bits (335), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 79/236 (33%), Positives = 117/236 (49%), Gaps = 29/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW +
Sbjct: 236 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWF 294

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++  C  A         + T  C     K N++++ S   
Sbjct: 295 LRKRGLVSHACYPLFKDQNTTNNICAMASRSDGRGKRHATKPCPNSFEKSNRIYQCS--- 351

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG--------DVMGG 175
               YR++S+  +IM EI +NGPV+    V+EDF +YK+G+Y+H+            +  
Sbjct: 352 --PPYRVSSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYKKLRT 409

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 410 HAVKLTGWGTLRGARGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465


>gi|14789619|gb|AAH10745.1| Tubulointerstitial nephritis antigen [Mus musculus]
          Length = 475

 Score =  133 bits (335), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 79/236 (33%), Positives = 117/236 (49%), Gaps = 29/236 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C + WAF      +DR  I        +LS  +L++CC      GC+ G    AW +
Sbjct: 236 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWF 294

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
               G+V+  C P F     ++  C  A         + T  C     K N++++ S   
Sbjct: 295 LRKRGLVSHACYPLFKDQNTTNNICAMASRSDGRGKRHATKPCPNSFEKSNRIYQCS--- 351

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG--------DVMGG 175
               YR++S+  +IM EI +NGPV+    V+EDF +YK+G+Y+H+            +  
Sbjct: 352 --PPYRVSSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYKKLRT 409

Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           HAVKL GWGT        E +WI AN W +SWG +GYF+I RG NE  IE+ ++A 
Sbjct: 410 HAVKLTGWGTLRGARGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465


>gi|115621283|ref|XP_782184.2| PREDICTED: tubulointerstitial nephritis antigen-like
           [Strongylocentrotus purpuratus]
          Length = 450

 Score =  133 bits (335), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 83/236 (35%), Positives = 112/236 (47%), Gaps = 26/236 (11%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYP 66
           ++ ++ QG CGS WA       SDR  I     +N  LS   LL+C       GC GGY 
Sbjct: 211 IDEVIDQGKCGSSWAISTASVASDRLAIQSMGEINPRLSEQHLLSC-NIRGQRGCSGGYL 269

Query: 67  ISAWRYFVHHGVVTEECDPYF----DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH 122
             AW +    G V+  C PY     + T      C  AY + +C  + V  +       +
Sbjct: 270 DRAWYHLRRAGAVSRACYPYHSGLDEDTIMQKLRCRVAYGSSQCPERGVTSD------LY 323

Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT---------GDVM 173
            S   YRI +   DIM EIY+NGPV+ +F V  DF  Y  GVY+++           D  
Sbjct: 324 LSTPPYRIAAREVDIMTEIYQNGPVQATFNVKNDFFVYNRGVYRNVKQEFTASQSDSDQA 383

Query: 174 GGHAVKLIGWGTSD----DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
           G H+VK++GWG       +   YW+  N W R+WG  G F+I RG NEC IE  V+
Sbjct: 384 GWHSVKIVGWGIDRSDWYNPIKYWLCTNSWGRNWGEQGMFRIVRGVNECEIESFVL 439


>gi|508264|gb|AAA96833.1| cysteine protease, partial [Caenorhabditis elegans]
          Length = 198

 Score =  133 bits (334), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 81/201 (40%), Positives = 108/201 (53%), Gaps = 24/201 (11%)

Query: 20  SCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 77
           SCWA  A E +SDR CI       LS+S +D+ ACCG +CG+GC+GGYPI AWR++V  G
Sbjct: 1   SCWAVSAAETISDRICIASNAKTILSISADDINACCGMVCGNGCNGGYPIEAWRHYVKKG 60

Query: 78  VVTEECDPYFDSTGCS---HPGCE-----------PA--YPTPKCVRKCVKKN--QLWRN 119
            VT     Y D TGC    +P CE           P+  YPT +      K +    +  
Sbjct: 61  YVTG--GSYQDKTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTGQNANALGKLDIALTYHK 118

Query: 120 SKHY-SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
             H+ +I     + +   I   I  +G +    TV+EDF HY  GVY H  G  +GGHAV
Sbjct: 119 DLHFRTILHTPASKEAAGIPKGIKTHGQLRGGITVFEDFEHYSGGVYVHTAGASLGGHAV 178

Query: 179 KLIGWGTSDDGEDYWILANQW 199
           K++GWG  D+G  YW++AN W
Sbjct: 179 KMLGWGV-DNGTPYWLIANSW 198


>gi|395526635|ref|XP_003765465.1| PREDICTED: tubulointerstitial nephritis antigen-like [Sarcophilus
           harrisii]
          Length = 467

 Score =  133 bits (334), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 86/241 (35%), Positives = 116/241 (48%), Gaps = 38/241 (15%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M+ +LS  +LL+C       GC GG    AW +
Sbjct: 222 QGNCAGSWAFSTAAVASDRISIHSMGHMSPALSPQNLLSC-NTHNQHGCRGGRLDGAWWF 280

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYP---------------TPKCVRKCVKKNQLW 117
               G+V+  C P+ +     H G  PA P               T  C       N ++
Sbjct: 281 LRRRGLVSNNCYPFSEG---DHNGAAPAAPCMMHSRHMGRGKRQATAHCPNSRTHANHIY 337

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV----- 172
           +     +   YR++S  +DIM E+ +NGPV+    V+EDF  YKSG+YKH    +     
Sbjct: 338 Q-----ATPPYRLSSHEKDIMKELMENGPVQALLEVHEDFFLYKSGIYKHTPASLGKPER 392

Query: 173 ---MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
               G H+VK+ GWG     DG+   YW  AN W  +WG +GYF+I RG+NEC IE  VV
Sbjct: 393 YRQHGTHSVKITGWGEEIQPDGQKVKYWTAANSWGPTWGENGYFRIVRGANECDIESFVV 452

Query: 226 A 226
            
Sbjct: 453 G 453


>gi|166030326|gb|ABY78830.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  133 bits (334), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 88/239 (36%), Positives = 121/239 (50%), Gaps = 12/239 (5%)

Query: 5   NSEHVEILVIQGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDG 63
           N   +  +  Q  C + WA     A+SDR+C +  G  L +S   LL+CC   CG GC G
Sbjct: 102 NCPTIREIADQSACRASWAVSTASAISDRYCTVGGGKQLRISAAHLLSCCK-QCGGGCKG 160

Query: 64  GYPISAWRYFVHHGVVTEECDPY-------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL 116
           G+P  AW Y+V +G+ +  C PY         + G   P  +  + TPKC   C  K+  
Sbjct: 161 GFPGFAWLYYVEYGIASSGCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKCNATCTDKSIP 220

Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
               K+   + Y +    ED   E+Y NGP    F VY D   YKSGVY+++ GD +GG 
Sbjct: 221 L--VKYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGGQ 278

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
           AV+++GWG   +G  YW +AN W+  WG +GY  I  G+NEC IE     G P    L 
Sbjct: 279 AVRIVGWGKL-NGTPYWKVANSWDTDWGMNGYMLILGGNNECNIEHLGFTGFPDPSQLT 336


>gi|330846430|ref|XP_003295033.1| hypothetical protein DICPUDRAFT_51857 [Dictyostelium purpureum]
 gi|325074364|gb|EGC28440.1| hypothetical protein DICPUDRAFT_51857 [Dictyostelium purpureum]
          Length = 257

 Score =  133 bits (334), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 78/221 (35%), Positives = 109/221 (49%), Gaps = 16/221 (7%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN-DLLACCGFLCGDGCDGGYPI 67
           +  ++ Q  CGSCWAF A E LSDR CI       + ++   L  C      GC+GG P 
Sbjct: 45  IHPILNQEQCGSCWAFSASEVLSDRLCIASNGKTGVVLSPQALVSCDIFGNQGCNGGIPQ 104

Query: 68  SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 127
            AW Y   HG+ T  C PY    G              CV+     N+ +   +   ++ 
Sbjct: 105 LAWEYMELHGIPTYGCFPYTSGNGTDG----------SCVKNSCVDNEQYTLYRAKPLT- 153

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG-DVMGGHAVKLIGWGTS 186
            +  +  E I  +I K GP++ +  VY DF  Y SGVY    G  ++GGHA+K++GWG  
Sbjct: 154 LKTCASVECIQQDIMKFGPIQGTMEVYSDFMSYTSGVYTMTPGSSLLGGHAIKIVGWGFD 213

Query: 187 D-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
               ++YWI+AN W  SWG DG+F I    ++CGI  D  A
Sbjct: 214 QASNQNYWIVANSWGPSWGIDGFFWIAF--DQCGINSDACA 252


>gi|395528577|ref|XP_003766405.1| PREDICTED: dipeptidyl peptidase 1-like [Sarcophilus harrisii]
          Length = 568

 Score =  132 bits (333), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 79/232 (34%), Positives = 124/232 (53%), Gaps = 28/232 (12%)

Query: 8   HVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGY 65
           +V  +  Q +CGSC+AF ++  L  R  I    +    LS  ++++C  +    GC+GG+
Sbjct: 351 YVSPVRNQANCGSCYAFASLGMLESRIRIKTNNSQVPVLSPQEIVSCSEY--SQGCEGGF 408

Query: 66  P-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYS 124
           P +   +Y    G+V EEC PY             AY +P   +KC +    +  S+++ 
Sbjct: 409 PYLIGGKYAQDFGLVEEECFPY------------QAYDSPCTPKKCSR----YYTSEYHY 452

Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAV 178
           +  +    +   +  E+ +NGP+ V+F VY+DF HY++G+Y H           +  HAV
Sbjct: 453 VGGFYGGCNEALMKHELIQNGPLTVAFEVYDDFIHYRTGIYHHTGLRDNFNPFELTNHAV 512

Query: 179 KLIGWGTSDD-GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
            L+G+GT +  GEDYWI+ N W  SWG +GYF+I RG++EC IE   VA  P
Sbjct: 513 LLVGYGTDEKTGEDYWIVKNSWGTSWGENGYFRILRGTDECAIESIAVAATP 564


>gi|66801417|ref|XP_629634.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
 gi|60463014|gb|EAL61210.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
          Length = 323

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 83/234 (35%), Positives = 113/234 (48%), Gaps = 30/234 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFL-------CGDGCDGGY 65
           Q  CGSCWA      L+DR CI    N+   LS   L+ C G         C +GC GG+
Sbjct: 66  QQSCGSCWAQVTSGILADRMCIESDKNIKMLLSPQYLMDCDGSCVSDGVSGCNNGCKGGF 125

Query: 66  PISAWRYFVHHGVVTEECDPYFDSTGCSHPG-CEPAYPTPKCVRKCVKKNQLWRNSKHYS 124
              A    ++ G+V++EC  Y  S   S P  C+   P                N+  Y 
Sbjct: 126 VGLALTRLINEGIVSDECLSYQASKDSSCPTTCDDGSPI--------------SNTTIYK 171

Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 184
            ++ R     +D   EI  NGPV  +F +Y DF  +K  VY   +   +  HAV+++GWG
Sbjct: 172 ATSCRAFPTVQDAQYEIMTNGPVIATFMLYSDFKPHKWDVYIKSSNTQVESHAVRVVGWG 231

Query: 185 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV------AGLPSSK 232
           T+ DG DYWI AN W   WG  GYFKI+RGS+E   EE  +      A +P+S+
Sbjct: 232 TTSDGVDYWIAANSWGTGWGDKGYFKIRRGSDEAAFEEGFITVTADTASVPTSQ 285


>gi|270012758|gb|EFA09206.1| cathepsin B precursor [Tribolium castaneum]
          Length = 326

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 83/216 (38%), Positives = 121/216 (56%), Gaps = 23/216 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CGSCWAF + E ++DR CI     +    S  +LL CC         GGY  +AW Y
Sbjct: 99  QGNCGSCWAFASTEVMTDRLCISSKGKIKFVFSPENLLTCCKDCGCGC-KGGYIKNAWDY 157

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
           +++ G+ +     Y  S GC  P  E ++   +   +CVK               Y + +
Sbjct: 158 YINEGIAS--GGDYNSSEGC-QPYSESSFQYAE-ASECVK--------------FYTLET 199

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
           +   I  EI  NGPV   + V+EDFA +KSGVY + +G  +G H+VK+IGWGT ++G  Y
Sbjct: 200 NVAQIQMEILTNGPVMAYYNVFEDFACHKSGVYYYKSGKFVGRHSVKVIGWGT-EEGIPY 258

Query: 193 WILANQWNRSWGA-DGYFKIKRGSNECGIEEDVVAG 227
           W++AN W   WG   G+FK++RG+NEC IE+++ AG
Sbjct: 259 WLIANSWGSEWGELGGFFKMRRGTNECWIEQEMTAG 294


>gi|159114116|ref|XP_001707283.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157435387|gb|EDO79609.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 332

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 81/224 (36%), Positives = 111/224 (49%), Gaps = 30/224 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIH--------FGMNLSLSVNDLLACCGFLCGDGCDGGYP 66
           QG+CG+CWAF A  A  DR C+         +    ++S +DL          GC GG  
Sbjct: 124 QGYCGACWAFSATGAFGDRRCMQWLDPVGVPYSQQYTVSCDDLDL--------GCAGGTS 175

Query: 67  ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS 126
            + W +   HG  T EC  Y D+       C PA        + VK +     S + +  
Sbjct: 176 FNVWTFLTEHGTTTLECVRYTDADKDLSSPC-PALCDDGSEIQLVKADGCLDYSGNVTA- 233

Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 186
                     IM  +  +GPV+   +VY DF +Y+ GVYKH+ G  +  HAV++IG+GT+
Sbjct: 234 ----------IMQTLANDGPVQAVMSVYRDFLYYRGGVYKHVYGIQISSHAVEIIGYGTT 283

Query: 187 DDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           DD E   YWI+ N    +WG +GYF I RGSNEC IE  V +GL
Sbjct: 284 DDEERIPYWIVKNSLGPNWGEEGYFNIVRGSNECDIESAVYSGL 327


>gi|312082955|ref|XP_003143660.1| hypothetical protein LOAG_08080 [Loa loa]
 gi|307761175|gb|EFO20409.1| hypothetical protein LOAG_08080 [Loa loa]
          Length = 339

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 82/222 (36%), Positives = 115/222 (51%), Gaps = 15/222 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG C S WA       +DR  +      N++LS    L+C       GC+GGY   AW Y
Sbjct: 100 QGDCASSWAQSTAATSADRLALITEGRQNVALSAQQFLSCNQHR-QKGCEGGYLDRAWWY 158

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS-AYRIN 131
               GVV+EEC PY   T      C          R+C   +    NS+ Y  + +YR++
Sbjct: 159 IRKFGVVSEECYPYISGTTRKPEICYMQKSKHANGRQCPSGHP---NSRVYRTTPSYRVS 215

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG---DVMGGHAVKLIGWG---T 185
           S  +DIM+EI  NGPV+ +F V+ DF  + +GVYKH+     ++ G H+V+L+GWG   +
Sbjct: 216 SREQDIMSEILTNGPVQATFRVHGDF--FIAGVYKHLPTVGEEIEGYHSVRLLGWGEDYS 273

Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           +     YWI AN W  +WG +G F+I RG N C IE  V+  
Sbjct: 274 TGIPVKYWIAANSWGTNWGENGTFRILRGENHCEIESFVIGA 315


>gi|294888035|ref|XP_002772321.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239876433|gb|EER04137.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 200

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 82/211 (38%), Positives = 106/211 (50%), Gaps = 27/211 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFG  EA +DR CI      +  LS  ++ AC  F    GC GG P SAW +
Sbjct: 2   QSACGSCWAFGVTEAFNDRLCIKSDGAFTELLSAGEMNACTLFF---GCGGGDPYSAWSW 58

Query: 73  FVHHGVVT-------------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
               G+ T             + C PY D   C+H   +  YP  KC +     +     
Sbjct: 59  VHDKGIATGGDYVAKDDMTKDDGCWPY-DFPPCAHHINDTKYP--KCPKVSCSGDD---- 111

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
            +H+ + +   +    D    I  +GPV  SFTVYEDF  Y+SGVYKH +G  +GGHAVK
Sbjct: 112 -RHFMLESSPYHYSVNDAKNAIRTDGPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAVK 170

Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFK 210
           +IGWG    G+ YW+  N WN  WG  G F+
Sbjct: 171 IIGWGEK-SGQAYWLAVNSWNEDWGDHGLFR 200


>gi|308159555|gb|EFO62082.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 305

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 79/214 (36%), Positives = 117/214 (54%), Gaps = 19/214 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGD-GCDGGYPISAWR 71
           Q  C  C+AF  + ALS R CI       +SLSV  +++C     G+ GC GG   S+W 
Sbjct: 101 QKECSCCYAFATIGALSTRRCIAKLDSQAVSLSVQHMVSCDN---GEAGCLGGEFESSWA 157

Query: 72  YFVHHGVVTEECDPYFD-STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI 130
           +    GVV  +C PY    TG S           +C   C +   L  ++ HY  ++   
Sbjct: 158 FLETEGVVKSDCLPYTSGETGNSG----------ECPMMC-QDGTLVEDAFHYKAASASP 206

Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 190
            ++  +IM  +  +GPV+  F V+EDF +Y  G+Y  + G  +GGHAV ++G+G+ +D  
Sbjct: 207 LNNYNEIMVSLLADGPVQTGFYVHEDFLYYVGGIYHKVYGSSLGGHAVLIVGYGSMND-H 265

Query: 191 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
           DYWI+ N W   WG +GYF+I RG+NECGIE++ 
Sbjct: 266 DYWIVRNSWGPDWGENGYFRILRGTNECGIEKNA 299


>gi|294891881|ref|XP_002773785.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239878989|gb|EER05601.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 455

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 85/226 (37%), Positives = 109/226 (48%), Gaps = 31/226 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCG----DGCDGGYPIS 68
           QG C +CWA  AV   +DR CI  G  ++  LS+  L +CC    G    +GC  G    
Sbjct: 168 QGECNNCWASAAVGMFNDRVCIKSGGRITDILSLGYLTSCCNRANGCPKSNGCMFGSVPE 227

Query: 69  AWRYFVHHGVVT-------EE------CDPYFDSTGCSH-PGCEPAYPT-------PKCV 107
              +  +HG+VT       EE      C PY     C+H PG E  YP        P C 
Sbjct: 228 GLNFMKNHGLVTGGEYKPPEELGNDDGCWPY-PFPKCNHVPGLESKYPRCAQVRDLPACA 286

Query: 108 RKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
             C  K      +   H + S  R+   PE I  EI+ NGPV    T+YEDF  YKSGVY
Sbjct: 287 TTCPNKAYGTSMQKDTHRAKSWGRLPIGPEKIKQEIFDNGPVAAMMTLYEDFRFYKSGVY 346

Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 211
            H TG ++  H +KLIGWG  + G++YW+  N WN  WG  G  K+
Sbjct: 347 VHKTGQMLAAHTLKLIGWGV-ESGQEYWLAVNAWNEEWGDHGMIKL 391


>gi|324512900|gb|ADY45327.1| Peptidase C1-like protein [Ascaris suum]
          Length = 450

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 85/239 (35%), Positives = 112/239 (46%), Gaps = 40/239 (16%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG C S W+       +DR  I     +N+ LS   LL+C       GC+GGY   AW Y
Sbjct: 204 QGDCASSWSHSTTATSADRLSIITDGRVNIPLSAQQLLSCNQHR-QRGCEGGYLDRAWWY 262

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH---------- 122
               GVV+E C PY +S     PG            +C      +R   H          
Sbjct: 263 IRKLGVVSELCYPY-ESGATQQPG------------ECRIPKSAYRTGAHIDCPSGAADP 309

Query: 123 --YSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI--------TGD 171
             Y ++  YR++S  +DIM EI  NGPV+ +F VYEDF  Y  GVY+H+           
Sbjct: 310 SVYRMTPPYRVSSREQDIMTEIITNGPVQATFLVYEDFFMYSGGVYQHLDLHEHKEEERK 369

Query: 172 VMGGHAVKLIGWG---TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           V G H+V++IGWG   ++     YW+ AN W   WG DG F+I RG N C IE  V+  
Sbjct: 370 VQGYHSVRIIGWGEDYSTGPQVKYWLAANSWGNEWGEDGLFRILRGENHCEIESFVIGA 428


>gi|290984292|ref|XP_002674861.1| cathepsin C [Naegleria gruberi]
 gi|284088454|gb|EFC42117.1| cathepsin C [Naegleria gruberi]
          Length = 569

 Score =  131 bits (329), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 79/233 (33%), Positives = 117/233 (50%), Gaps = 32/233 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSC+AF AV A+  R  I    N+   L+V D+++C  +     C GG P +  R+
Sbjct: 343 QMACGSCYAFAAVTAIESRIRIQSRNNVREPLAVQDIVSCSPY--AQKCHGGIPYAVGRH 400

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
                +V E C PY  S   +            C  KC     + + +K+  +S Y   S
Sbjct: 401 LRDFNLVPESCFPYKGSENVA------------CSSKCKNPEYIVKVTKYRYVSDYYGGS 448

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH-----------ITGDVMG----GHA 177
           +  ++M EIY++GP+  S+ +Y DF +Y  G+YKH           I  ++ G     H+
Sbjct: 449 NYANMMKEIYEHGPISASYLIYPDFKYYSKGIYKHSGKGYPMKTDRINREMNGWEPTTHS 508

Query: 178 VKLIGWGTS-DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           V + GWG     GE YW + N W+ SWG +G F+IKRG++EC IE + VA  P
Sbjct: 509 VVITGWGEDPKTGEKYWNVLNSWSESWGENGRFRIKRGNDECAIEAEGVAFYP 561


>gi|156708120|gb|ABU93318.1| cathepsin B9 cysteine protease, partial [Monocercomonoides sp. PA]
          Length = 382

 Score =  131 bits (329), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 74/179 (41%), Positives = 101/179 (56%), Gaps = 7/179 (3%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QGHCGSCWA  + E L DRFCIH   +    LS  D+ +C       GC+GG+  +A+ Y
Sbjct: 92  QGHCGSCWAMCSFEVLQDRFCIHSNGSEKPWLSGQDITSCDSR--SHGCNGGWTETAFEY 149

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS-KHYSISAYRIN 131
               GV TEEC PY     C HPGC  ++ TP C ++C   +    +S ++Y+  +Y I 
Sbjct: 150 AKKAGVPTEECVPYLMGK-CHHPGCS-SWQTPTCKKECSSLSNYNYSSNRYYASKSYSIQ 207

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 190
            + E I  E+ +NGPV   FT Y+D A Y  GVY H+ G   G HA+K++GWG   + E
Sbjct: 208 RNVEAIQLELMRNGPVTAVFTTYDDLAVYWRGVYNHVMGSEQGLHAIKIVGWGVWRESE 266



 Score = 53.5 bits (127), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 23/46 (50%), Positives = 28/46 (60%)

Query: 185 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
             ++G  YWI+ N W   +G DG   IKRG NECGIE DV  G+P 
Sbjct: 319 NKEEGIPYWIIVNSWGEDFGMDGILLIKRGVNECGIESDVYTGIPK 364


>gi|159111216|ref|XP_001705840.1| Hypothetical protein GL50803_113303 [Giardia lamblia ATCC 50803]
 gi|157433930|gb|EDO78166.1| hypothetical protein GL50803_113303 [Giardia lamblia ATCC 50803]
          Length = 804

 Score =  130 bits (328), Expect = 3e-28,   Method: Composition-based stats.
 Identities = 86/234 (36%), Positives = 127/234 (54%), Gaps = 19/234 (8%)

Query: 3   FTNSEHVEILVI-QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLAC--CGFLC-- 57
           FT   H  I +I QG CG C+A  AVE ++ R C+    +  +S+ DL+ C    +L   
Sbjct: 64  FTYRGHRCIQIIDQGSCGCCYAAAAVEMVTARRCLQLNDSRLVSLEDLVTCDHTKYLNIQ 123

Query: 58  GDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW 117
            +GC GG P+++ ++    G+V + C+ Y++ T   +P     YPT  C   C  K    
Sbjct: 124 NNGCRGGNPLASLKFGETTGMVYDTCEDYWNRT---YP-----YPTETCKTVCKDKRPKD 175

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF-AHYKSGVYKHITGDVMGG- 175
           R  K+ +   YR+ S  + +M +IY+NGP+ VS  +  DF +  K G+Y       +GG 
Sbjct: 176 RTIKNKA--PYRL-SGVDAMMRDIYQNGPIAVSMYLANDFPSKDKKGIYSSGPNTKLGGG 232

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           HAV ++GWG  ++G  YW  AN +  +WG  GYFKIKRGSNE  IE    + LP
Sbjct: 233 HAVMIVGWG-EENGVPYWDCANTYGTNWGDQGYFKIKRGSNELKIETWPGSALP 285


>gi|357623033|gb|EHJ74345.1| tubulointerstitial nephritis antigen [Danaus plexippus]
          Length = 426

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 82/224 (36%), Positives = 113/224 (50%), Gaps = 11/224 (4%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHF-GMNLSLSVNDLLACCGFLCGDGCDGGYPI 67
           +  ++ QG CGS WA       SDRF I   G    +    +L  C      GC GG+  
Sbjct: 200 ISPVLDQGWCGSDWAVTIATVASDRFAIQSNGAERMVLSPQVLLSCNIRRQQGCRGGHID 259

Query: 68  SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 127
            AW +   HG+V EEC PY  +T        P  P    +    +     R S+ Y +  
Sbjct: 260 VAWNFARGHGLVDEECFPYKAATTSC-----PFRPKANLIEDGCRPPVRQRTSR-YKVGP 313

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT-GD--VMGGHAVKLIGWG 184
               +   DIM +I ++GPV    TV++DF HY  G+Y+    GD  + G H+V+++GWG
Sbjct: 314 PGKLATENDIMYDIMESGPVHAVMTVHQDFFHYHDGIYRRSPYGDNTLQGLHSVRIVGWG 373

Query: 185 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
             D G+ YW++AN W   WG +GYF+I RGSNE GIE  VV  L
Sbjct: 374 -EDRGDKYWVVANSWGCDWGENGYFRIARGSNESGIESFVVTVL 416


>gi|308161545|gb|EFO63987.1| Cathepsin B-like cysteine proteinase [Giardia lamblia P15]
          Length = 804

 Score =  130 bits (328), Expect = 4e-28,   Method: Composition-based stats.
 Identities = 86/234 (36%), Positives = 128/234 (54%), Gaps = 19/234 (8%)

Query: 3   FTNSEHVEILVI-QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLAC--CGFLC-- 57
           FT   H  I +I QG CG C+A  AVE ++ R C+ F  +  +S+ DL+ C    +L   
Sbjct: 64  FTYRGHRCIQIINQGSCGCCYAAAAVEMVTARRCLQFNDSKLVSLEDLVTCDHTKYLNIQ 123

Query: 58  GDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW 117
            +GC GG  +++ ++    G+V + C+ Y++ T   +P     YPT  C   C  K+   
Sbjct: 124 NNGCRGGNSLASLKFGETTGMVYDTCEDYWNRT---YP-----YPTETCKTVCKDKHPKD 175

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF-AHYKSGVYKH-ITGDVMGG 175
           R  K+ +   YR+ S  + +M +IY+NGP+ VS  +  DF +  K G+Y       + GG
Sbjct: 176 RTIKNKA--PYRL-SGVDAMMRDIYQNGPIAVSMYLANDFPSKDKKGIYSSGPNTKLRGG 232

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           HAV ++GWG  ++G  YW  AN +  +WG  GYFKIKRGSNE  IE    + LP
Sbjct: 233 HAVMIVGWG-EENGVPYWDCANTYGTNWGDQGYFKIKRGSNELKIETWPGSALP 285


>gi|10803435|emb|CAC13130.1| putative cathepsin B.4 [Ostertagia ostertagi]
          Length = 194

 Score =  130 bits (328), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 74/177 (41%), Positives = 98/177 (55%), Gaps = 19/177 (10%)

Query: 20  SCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 77
           SCWA  +  A+SDR CI       + +S  D+++CC + CG GCDGG+PI AW++F   G
Sbjct: 1   SCWAVSSAAAMSDRICIASKGVKQVLISAQDMVSCCSY-CGYGCDGGWPIKAWQFFAREG 59

Query: 78  VVTEE-------CDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKK-NQLWRNSKH 122
           VVT         C PY + T C H G EP Y        TP+C RKC       ++  K 
Sbjct: 60  VVTGGNYGRQGCCRPY-EITPCGHHGREPYYGECYDDAQTPRCKRKCQSGYKTTYKKDKR 118

Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
           Y   AY++ +  + I  EI  +GPV   +TVYEDF++Y  G+YKH  G   GGHAVK
Sbjct: 119 YGRKAYQLPNSVKAIQREIMMHGPVVAGYTVYEDFSYYTKGIYKHTAGRETGGHAVK 175


>gi|126330441|ref|XP_001381244.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Monodelphis
           domestica]
          Length = 466

 Score =  130 bits (328), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 83/232 (35%), Positives = 116/232 (50%), Gaps = 22/232 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M  +LS  +LL+C       GC GG    AW +
Sbjct: 221 QGNCAGSWAFSTAAVASDRISIHSMGHMTPALSPQNLLSC-DTHNQKGCRGGRLDGAWWF 279

Query: 73  FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCV---KKNQLWRNSKHYSI 125
               G+V+  C P+     D+T  + P    +    +  R+       ++   N  + + 
Sbjct: 280 LRRRGLVSNHCYPFSAGNRDATAPAAPCMMHSRSMGRGKRQATAHCPNSRAHANHIYQAT 339

Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
             YR++SD +DIM E+ +NGPV+    V+EDF  YKSG+YKH    +         G H+
Sbjct: 340 PPYRLSSDEKDIMKELMENGPVQALMEVHEDFFLYKSGIYKHTPASLGKPARYRQHGTHS 399

Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
           VK+ GWG     DG+   YW  AN W  +WG  G+F+I RG+NEC IE  VV
Sbjct: 400 VKITGWGEERQPDGQRLKYWTAANSWGPTWGEKGHFRILRGANECDIESFVV 451


>gi|4099305|gb|AAD00577.1| cysteine proteinase [Clonorchis sinensis]
          Length = 180

 Score =  130 bits (328), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 80/177 (45%), Positives = 97/177 (54%), Gaps = 20/177 (11%)

Query: 25  GAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 82
           GAVEA+SDR CIH     N SLS  DLL+CC   CG GC GGYP  AW Y+  HG+VT  
Sbjct: 1   GAVEAMSDRLCIHSNGAFNKSLSAVDLLSCCEN-CGFGCRGGYPAVAWDYWKTHGIVTGG 59

Query: 83  CDPYFDSTGCSH---PGCE------------PAYPTPKCVRKCVKKNQLWRNSKHYSISA 127
                D +GC     P CE              YPTP+CV++C   +  +   K  +  +
Sbjct: 60  SKE--DPSGCRSYPFPKCEHHVQGHYPPCPRELYPTPECVQQCDTPDVGYLEDKTRANMS 117

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 184
           Y I +    IM EI   GPVE  FT+YEDF  Y SGVY H  G  M GHAV+++GWG
Sbjct: 118 YNIYASEISIMKEIMLRGPVEAIFTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWG 174


>gi|345327151|ref|XP_001507103.2| PREDICTED: tubulointerstitial nephritis antigen-like
           [Ornithorhynchus anatinus]
          Length = 327

 Score =  130 bits (327), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 80/233 (34%), Positives = 111/233 (47%), Gaps = 22/233 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M  SLS  +LL+C       GC+GG    AW +
Sbjct: 77  QGNCAGSWAFSTAAVASDRISIHSKGHMTPSLSPQNLLSC-NTRHQQGCNGGRLDRAWSF 135

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV-------KKNQLWRNSKHYSI 125
               G+V+++C P       + P    + P  +  R+           +  + N  + S 
Sbjct: 136 LRRRGLVSDKCYPLASQNSIAEPCRMYSRPMGRGKRQATGPCPNNFHHSNDYSNDIYQST 195

Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGGHA 177
             YR++S+ +DIM EI +NGPV+    V+EDF  YK G+Y+H              G H+
Sbjct: 196 PPYRLSSNEKDIMKEIMENGPVQALMEVHEDFFLYKDGIYRHTPASNGKPPQFRRQGTHS 255

Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           VK+ GWG     +G    +W  AN W  +WG  G F+I RG NEC IE  VV 
Sbjct: 256 VKITGWGEELQPNGRRVKFWRAANSWGPTWGEGGSFRILRGCNECDIESFVVG 308


>gi|253744204|gb|EET00443.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 309

 Score =  130 bits (327), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 79/223 (35%), Positives = 114/223 (51%), Gaps = 22/223 (9%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDG--GYP 66
           ++  G C S WA   VEA   R C++ G++      S   +L+C      +GC    G  
Sbjct: 92  VIDMGTCSSSWAHSPVEAFGHRRCMN-GVDQEATRYSAQYILSCA---TTNGCLAFPGQG 147

Query: 67  ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS 126
           + +W +    G+  E C  Y D     +   E +YP P     C   + L      Y   
Sbjct: 148 VVSWDFIATTGIPLESCVKYTD-----YDKTESSYPCPSL---CNDNSSL----VLYKSD 195

Query: 127 AYR-INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
            Y  +  +PE +   I   GP++  FTVYEDFA+Y  G+Y H+ G   G  +V+++G+GT
Sbjct: 196 GYEGVGFNPEKLRRAIALRGPMQAMFTVYEDFAYYLEGIYSHVYGGTAGYLSVEIVGYGT 255

Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
           SD+G+DYWI+ N W  +WG DGYF+I RG NEC IEE V   +
Sbjct: 256 SDEGQDYWIVKNYWGSNWGEDGYFRIVRGQNECQIEEAVYGAI 298


>gi|351709947|gb|EHB12866.1| Tubulointerstitial nephritis antigen-like protein [Heterocephalus
           glaber]
          Length = 467

 Score =  130 bits (326), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 88/237 (37%), Positives = 112/237 (47%), Gaps = 30/237 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 223 QGNCAGSWAFSTAAVASDRVSIHSMGHMTPVLSPQNLLSCDTHH-QQGCQGGRLDGAWWF 281

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYP-----------TPKCVRKCVKKNQLWRNSK 121
               GVV++ C P+   +G       PA P             +  R+C   +    N  
Sbjct: 282 LRRRGVVSDHCYPF---SGHEQAEAGPATPCMMHSRAMGRGKRQATRRCPNSHDD-ANEI 337

Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------M 173
           +    AYR+ SD ++IM E+ +NGPV+    VYEDF  YKSG+Y H    +         
Sbjct: 338 YQVTPAYRLGSDEKEIMKELMENGPVQALMEVYEDFFLYKSGIYSHTLVSMGRPEQYRRH 397

Query: 174 GGHAVKLIGWGTS--DDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           G H+VK+ GWG     DG    YW  AN W  SWG  GYF+I RGSNEC IE  V+ 
Sbjct: 398 GTHSVKITGWGEEMLPDGRTLKYWTAANSWGPSWGERGYFRILRGSNECDIESFVLG 454


>gi|307201161|gb|EFN81067.1| Uncharacterized peptidase C1-like protein F26E4.3 [Harpegnathos
           saltator]
          Length = 443

 Score =  129 bits (325), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 82/221 (37%), Positives = 115/221 (52%), Gaps = 15/221 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CG+ WA    +  SDRF I      ++ LS   LL+C       GC GGY   AW +
Sbjct: 223 QGWCGASWAVSTADVASDRFAIMSKGAEDVELSAQHLLSC-NNRGQQGCRGGYLDRAWLF 281

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
               G+V +EC P+   TG  +  C     +   V  C K     R   +    AYR+ +
Sbjct: 282 MRKFGLVDKECYPW---TG-RNDQCRLRKRSNLNVAGCRKPPNPLRQELYKVGPAYRLGN 337

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM---GGHAVKLIGWGTSDDG 189
           +  DIM EI  +GPV+ +  VY+DF  YK+GVY+H     +   G H++++IGWG     
Sbjct: 338 E-TDIMQEILTSGPVQATMRVYQDFFVYKNGVYRHSRSAELHDSGYHSMRIIGWGEEPSY 396

Query: 190 E----DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
                 YW++AN W R WG +G F+I+RG+NEC IE  V+A
Sbjct: 397 RGPPLKYWLVANSWGRHWGENGLFRIQRGTNECEIESYVLA 437


>gi|348570708|ref|XP_003471139.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
           porcellus]
          Length = 468

 Score =  129 bits (325), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 87/237 (36%), Positives = 114/237 (48%), Gaps = 30/237 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C   L   GC GG+   AW +
Sbjct: 224 QGNCAGSWAFSTAAVASDRVSIHSMGHMTPLLSPQNLLSC-DTLHQQGCRGGHLDGAWWF 282

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYP-----------TPKCVRKCVKKNQLWRNSK 121
               GVV++ C P+   +G       PA P             +  R+C   +    N  
Sbjct: 283 LRRRGVVSDHCYPF---SGREQAEAGPAPPCMMHSRAMGRGKRQATRRCPNSHTD-ANDI 338

Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM-------- 173
           +    AYR+ SD ++IM E+ +NGPV+    V+EDF  YK G+Y H    +         
Sbjct: 339 YQVTPAYRLGSDEKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPLSMARPEQYRRH 398

Query: 174 GGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           G H+VK+ GWG  T  DG    YW  AN W  SWG  G+F+I RGSNEC IE  V+ 
Sbjct: 399 GTHSVKITGWGEETLPDGRTLKYWTAANSWGPSWGERGHFRILRGSNECDIESFVLG 455


>gi|12330246|gb|AAG52660.1| cysteine proteinase [Metagonimus yokogawai]
          Length = 179

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 74/176 (42%), Positives = 103/176 (58%), Gaps = 18/176 (10%)

Query: 25  GAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-- 80
           GAVEA++DR CIH    +   +S  DLL+CC   CG GC GG+P  AW +++ +G+VT  
Sbjct: 1   GAVEAMTDRLCIHSNATIKKHISATDLLSCCE-SCGFGCHGGFPPRAWDFWMENGLVTGG 59

Query: 81  -----EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
                  C  Y     CSH      P C +  + TP CV  C K +  +   K ++ S+Y
Sbjct: 60  SKENPSGCRSY-PFPRCSHHGKGKYPPCPKTIFDTPNCVDHCDKPDIDYAADKTHAKSSY 118

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 184
            + S+   IM EI +NGPVE +F VYEDF  YKSG+Y H  G ++GGHA++++GWG
Sbjct: 119 NVQSNERVIMKEIMRNGPVEAAFMVYEDFIEYKSGIYFHSHGKLLGGHAIRMLGWG 174


>gi|417409900|gb|JAA51439.1| Putative cysteine proteinase tin-ag, partial [Desmodus rotundus]
          Length = 346

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 84/239 (35%), Positives = 117/239 (48%), Gaps = 34/239 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG+  SAW +
Sbjct: 102 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSC-DKRNQQGCQGGHLDSAWWF 160

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRN 119
               GVV++ C P F   G +  G     P P+C+          R+   +   +Q+  N
Sbjct: 161 LRRRGVVSDHCYP-FSGQGRTETG-----PAPRCMMHSRAMGRGKRQATARCPNHQVHAN 214

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
             +    AYR+ S  ++IM E+ +NGPV+    V+EDF  Y++G+Y H    +       
Sbjct: 215 DIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVHEDFFLYQNGIYSHTPVSLGRPERYR 274

Query: 173 -MGGHAVKLIGWGTSD--DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
             G H+VK+ GWG     DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+ 
Sbjct: 275 RHGTHSVKITGWGEESLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 333


>gi|12330244|gb|AAG52659.1| cysteine proteinase [Metagonimus yokogawai]
          Length = 183

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 76/185 (41%), Positives = 106/185 (57%), Gaps = 24/185 (12%)

Query: 26  AVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-- 80
           AV ++SDR CIH   N   + LS  DLL+CC   CG GC GG+   AW Y+  +G+VT  
Sbjct: 1   AVTSMSDRVCIHSNQNKTNVQLSARDLLSCC-TSCGFGCVGGWIGDAWDYWRDNGIVTGG 59

Query: 81  -----EECDPY-------FDSTGCS---HPGCEPAYPTPKCVRKCVKKNQ-LWRNSKHYS 124
                  C PY         S G     +P  +  YPTP CV KC +     +   K ++
Sbjct: 60  DYQDKSTCLPYPFPPSHHLVSKGTPFEIYP--QTLYPTPPCVSKCQEGYPGEYEKDKIFA 117

Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 184
           +S+Y+I+ +  +I  EI  NGPVE    VY DF +YK+GVY+H TG+++GGHA++L+GWG
Sbjct: 118 LSSYKIDRNATEIQKEILINGPVEAGMNVYADFPNYKTGVYQHTTGEILGGHAIRLLGWG 177

Query: 185 TSDDG 189
            + DG
Sbjct: 178 KTKDG 182


>gi|290987261|ref|XP_002676341.1| predicted protein [Naegleria gruberi]
 gi|284089943|gb|EFC43597.1| predicted protein [Naegleria gruberi]
          Length = 218

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 77/235 (32%), Positives = 115/235 (48%), Gaps = 33/235 (14%)

Query: 7   EHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGG 64
           + + ++  +  CG CWAF   E +SDRFC+     +N  LS   L++C       GC  G
Sbjct: 2   KQLSLIRDEQQCG-CWAFVVAEVVSDRFCVSSKTKVNEVLSPQYLISCDS--NNGGCSYG 58

Query: 65  YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYS 124
           Y  +A+++  + G+VTE C P+    G            P C +KC+  N          
Sbjct: 59  YFDTAFQFVENQGIVTENCFPFVSGEGNY---------IPPCPKKCLAYNPF-------- 101

Query: 125 ISAYRINSD----PEDIMA---EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
            + +++N+     P+DI      I   G +  S  +Y DF  Y+ GVY+H+ G+ M  H+
Sbjct: 102 -TLFKVNNSRAFLPQDIQGMQLSIMNGGSLAASLDIYRDFVQYRGGVYRHLVGNYMFTHS 160

Query: 178 VKLIGWGTSDDGE---DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           V+++GWG +   +    YWI  N W   WG  G+F I RGSNEC IE DV    P
Sbjct: 161 VRIVGWGITSPQQGSIPYWICGNNWTEEWGMQGWFWILRGSNECNIELDVWETTP 215


>gi|268572247|ref|XP_002648914.1| Hypothetical protein CBG17827 [Caenorhabditis briggsae]
          Length = 150

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 76/181 (41%), Positives = 93/181 (51%), Gaps = 38/181 (20%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +CGSCWAFGA E +SDR CI         +S  D+L CCG  CG GCDG         
Sbjct: 5   QTNCGSCWAFGAAEVISDRICIVTKGARQPIISPTDMLDCCGEYCGYGCDGC-------- 56

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 131
                                     P   TPKC   C  K N  +   K++  SAY + 
Sbjct: 57  --------------------------PKAVTPKCALSCQSKYNTEYAKDKNFGSSAYYVG 90

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
            +   I  EI  NGPVE SFTVYEDF  YK GVY++  G+V+GGHA+K+IGWGT ++G D
Sbjct: 91  RNFSVIQTEIMTNGPVEASFTVYEDFYIYKKGVYQYTAGEVLGGHAIKIIGWGT-ENGTD 149

Query: 192 Y 192
           Y
Sbjct: 150 Y 150


>gi|196009233|ref|XP_002114482.1| hypothetical protein TRIADDRAFT_28083 [Trichoplax adhaerens]
 gi|190583501|gb|EDV23572.1| hypothetical protein TRIADDRAFT_28083 [Trichoplax adhaerens]
          Length = 466

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 82/236 (34%), Positives = 120/236 (50%), Gaps = 26/236 (11%)

Query: 4   TNSEHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGC 61
           +N  +V  +  QG CGSC+AF ++     R  +     +   +S  D+++C  +    GC
Sbjct: 245 SNVNYVSPVRNQGACGSCYAFSSMAMYEARLRVLSKNSVKRVMSPQDVVSCSEY--AQGC 302

Query: 62  DGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
            GG+P + A +Y    G+V E C PY    G   P  E      KC R           +
Sbjct: 303 AGGFPYLIAGKYGEDFGLVEESCFPY---NGKDEPCKETK---SKCRRHST--------T 348

Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDV-----MG 174
            +Y +  +    +   +M E+ KNGP+ +SF VY DF HYK G+Y+H   GD      + 
Sbjct: 349 NYYYVGGFYGACNEYLMMRELVKNGPISISFEVYGDFKHYKGGIYQHTGLGDSYNPWQIT 408

Query: 175 GHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
            HAV L+G+GT    G+DYWI+ N W   WG +G+F+I RG +EC IE + VA  P
Sbjct: 409 NHAVLLVGYGTDQKSGKDYWIVKNSWGTKWGENGFFRILRGVDECSIENEAVAVTP 464


>gi|349604734|gb|AEQ00202.1| Cathepsin B-like protein, partial [Equus caballus]
          Length = 134

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 86/143 (60%), Gaps = 14/143 (9%)

Query: 97  CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDI-MAEIYKNGPVEVSFTVYE 155
           CEP Y               ++  KHY  S+Y ++            KNGPVE +FTVY 
Sbjct: 5   CEPGYSPS------------YKEDKHYGCSSYSVSRGARRRSWQRSSKNGPVEAAFTVYS 52

Query: 156 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 215
           DF  YKSGVY+H+ GD+MGGHAV+++GWG  ++G  YW++ N WN  WG +G+FKI RG 
Sbjct: 53  DFLQYKSGVYQHVAGDMMGGHAVRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQ 111

Query: 216 NECGIEEDVVAGLPSSKNLVKEI 238
           + CGIE ++VAG+P +    K I
Sbjct: 112 DHCGIESEIVAGIPCTDQYWKRI 134


>gi|253748399|gb|EET02549.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 303

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 76/218 (34%), Positives = 105/218 (48%), Gaps = 21/218 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           QG CG CWAF A+     R C   G++   +  S   L++C       GC GG     W 
Sbjct: 99  QGSCGGCWAFSAIGMFGSRRC-AVGIDKAAVLYSQQHLISCS--TENFGCSGGDFFPTWS 155

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY-RI 130
           +    G  T EC  Y D        C    PT      C   +Q+    + Y    Y ++
Sbjct: 156 FLTQTGATTAECVKYVDYGSSVAAAC----PT-----TCDDGSQI----QFYKAHGYGQV 202

Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG-HAVKLIGWGTSDDG 189
           +     IM  +   GPV+    VY D  +Y  GVY+H  G +  G HA++++G+GT+DDG
Sbjct: 203 SKSVPAIMQMLVSGGPVQTMIVVYADLLYYAGGVYRHTYGPISNGLHALEMVGYGTTDDG 262

Query: 190 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
            DYW + N W   WG DGYF+I RG NEC IE+++ A 
Sbjct: 263 TDYWTIKNSWGSDWGEDGYFRIVRGVNECRIEDEIYAA 300


>gi|417401357|gb|JAA47568.1| Putative dipeptidyl peptidase 1 [Desmodus rotundus]
          Length = 463

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 82/231 (35%), Positives = 118/231 (51%), Gaps = 33/231 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F +V  L  R  I      +  LS  ++++C  +    GCDGG+P + A +
Sbjct: 252 QASCGSCYSFASVGMLEARIRILTNNTQTPILSPQEVVSCSQY--AQGCDGGFPYLIAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E C PY   TG   P              C+ K   +R   S+++ +  + 
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CMLKEDCFRYYTSEYHYVGGFY 352

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-------MGGHAVKLIG 182
              +   +  E+  NGP+ V+F VY DF HY+ G+Y H TG         +  HAV L+G
Sbjct: 353 GGCNEALMKLELVHNGPMAVAFEVYNDFLHYQEGIYHH-TGLTDPFNPFELTNHAVLLVG 411

Query: 183 WGTSD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           +GT    G DYWI+ N W  +WG DGYF+I+RG++EC IE   VA  P  K
Sbjct: 412 YGTDPATGMDYWIVKNSWGTAWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|339239305|ref|XP_003381207.1| cathepsin B [Trichinella spiralis]
 gi|316975778|gb|EFV59177.1| cathepsin B [Trichinella spiralis]
          Length = 343

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 87/223 (39%), Positives = 119/223 (53%), Gaps = 22/223 (9%)

Query: 23  AFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 80
           A  +   ++DR CI +       LS  +L +CC   CG GC+GG+P+ A++Y+   GV T
Sbjct: 124 AMSSASVMTDRTCIAYKGEQQPFLSDEELTSCCT-SCGYGCNGGFPLLAFKYWNEIGVPT 182

Query: 81  EECDPYFDSTGCSHPGCEP------AYPTPKCVRKCVK--KNQLWRNSKHYSISAYRINS 132
               PY   +GC      P      A  TP C  KC+   K +L ++ ++Y  S Y I S
Sbjct: 183 G--GPYGSKSGCKPFSIAPPTSSSTAAQTPLCQLKCISDYKRKLDKD-RYYGESYYLITS 239

Query: 133 DPE---DIMAEIYKNGPVEVSFTVYEDFAHYKSGVY---KHITGDVMGGHAVKLIGWGTS 186
             +    I  EI  +GPV  +  ++E F +YKSGVY   K      +G HAVKLIGWG  
Sbjct: 240 SNQPVKTIQREIMDHGPVVAAMEIFESFLYYKSGVYSANKRNDDPSLGLHAVKLIGWG-E 298

Query: 187 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE-DVVAGL 228
                YW++ N WN ++G  G FKI+RG+NECGIE   V AGL
Sbjct: 299 QKRIPYWLVVNSWNTTFGEQGLFKIRRGTNECGIENLHVTAGL 341


>gi|22653678|sp|O97578.1|CATC_CANFA RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain 1; AltName: Full=Dipeptidyl peptidase I
           heavy chain 1; Contains: RecName: Full=Dipeptidyl
           peptidase 1 heavy chain 2; AltName: Full=Dipeptidyl
           peptidase I heavy chain 2; Contains: RecName:
           Full=Dipeptidyl peptidase 1 heavy chain 3; AltName:
           Full=Dipeptidyl peptidase I heavy chain 3; Contains:
           RecName: Full=Dipeptidyl peptidase 1 heavy chain 4;
           AltName: Full=Dipeptidyl peptidase I heavy chain 4;
           Contains: RecName: Full=Dipeptidyl peptidase 1 light
           chain; AltName: Full=Dipeptidyl peptidase I light chain;
           Flags: Precursor
 gi|4106126|gb|AAD02704.1| dipeptidyl peptidase I [Canis lupus familiaris]
          Length = 435

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 80/228 (35%), Positives = 118/228 (51%), Gaps = 28/228 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC+AF +   L  R  I      +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 225 QASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSCSQY--AQGCEGGFPYLIAGK 282

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y    G+V E C PY    G   P C+P      C R        + +S++Y +  +   
Sbjct: 283 YAQDFGLVEEACFPY---AGSDSP-CKPN----DCFR--------YYSSEYYYVGGFYGA 326

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
            +   +  E+ ++GP+ V+F VY+DF HY+ G+Y H           +  HAV L+G+GT
Sbjct: 327 CNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPFELTNHAVLLVGYGT 386

Query: 186 -SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
            S  G DYWI+ N W   WG DGYF+I+RG++EC IE   VA  P  K
Sbjct: 387 DSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVAATPIPK 434


>gi|341898422|gb|EGT54357.1| hypothetical protein CAEBREN_10381 [Caenorhabditis brenneri]
          Length = 466

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 84/232 (36%), Positives = 115/232 (49%), Gaps = 16/232 (6%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYP 66
           +  +V QG CGS WA       SDR  I     +N SLS   LL+C       GC+GGY 
Sbjct: 212 INPVVDQGDCGSSWAVSTTGISSDRLAIISEGRINASLSSQQLLSCNQHR-QKGCEGGYL 270

Query: 67  ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS 126
             AW Y    GVV + C PY  S     PG           R+ ++     ++S  + ++
Sbjct: 271 DRAWWYIRKLGVVGDHCYPYV-SGQSREPGHCLIPKRDYTDRRGLRCPSGSQDSTAFKMT 329

Query: 127 A-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH--------ITGDVMGGHA 177
             Y+++S  EDI  E+  NGPV+ +F V+EDF  Y  GVY+H         +    G H+
Sbjct: 330 PPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHS 389

Query: 178 VKLIGWG---TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           V+++GWG   ++     YW+ AN W   WG DGYFKI RG N C IE  V+ 
Sbjct: 390 VRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDGYFKILRGDNHCEIESFVIG 441


>gi|253742315|gb|EES99155.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 303

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 76/218 (34%), Positives = 105/218 (48%), Gaps = 21/218 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           QG CG CWAF A+     R C   G++   +  S   L++C       GC GG     W 
Sbjct: 99  QGSCGGCWAFSAIGMFGSRRC-AVGIDKAAVLYSQQHLISCS--TENFGCSGGDFFPTWS 155

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY-RI 130
           +    G  T EC  Y D        C    PT      C   +Q+    + Y    Y ++
Sbjct: 156 FLTQTGATTAECVKYVDYGSSVAAAC----PT-----TCDDGSQI----QFYKAHGYGQL 202

Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG-HAVKLIGWGTSDDG 189
           +     IM  +   GPV+    VY D  +Y  GVY+H  G +  G HA++++G+GT+DDG
Sbjct: 203 SKSVPAIMQMLVSGGPVQTMIVVYADLLYYAGGVYRHTYGPISNGLHALEMVGYGTTDDG 262

Query: 190 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
            DYW + N W   WG DGYF+I RG NEC IE+++ A 
Sbjct: 263 TDYWTIKNSWGSDWGEDGYFRIVRGVNECRIEDEIYAA 300


>gi|253747613|gb|EET02212.1| Hypothetical protein GL50581_498 [Giardia intestinalis ATCC 50581]
          Length = 807

 Score =  128 bits (321), Expect = 3e-27,   Method: Composition-based stats.
 Identities = 87/247 (35%), Positives = 132/247 (53%), Gaps = 23/247 (9%)

Query: 3   FTNSEHVEILVI-QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLAC--CGFLC-- 57
           FT  +H  + +I QG CG C+A   VE ++ R C+ F  +  +S+ DL+ C    +L   
Sbjct: 64  FTYRDHKCVQIINQGSCGCCYAAATVEMVTARRCLQFNDSKLVSLEDLVTCDHTKYLNVQ 123

Query: 58  GDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW 117
            +GC GG  +++ ++    G+V + C+ Y++ T   +P     YPT  C   C  K    
Sbjct: 124 NNGCRGGNALASLKFGETTGMVYDTCEDYWNRT---YP-----YPTETCKTVCKDKRPKD 175

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA-HYKSGVYKHITG---DVM 173
           R  K+ +   YR+ S  + +M +IY+NGP+ VS  +  DF    K  +Y  ++G    + 
Sbjct: 176 RTIKNKA--PYRL-SGVDAMMRDIYQNGPIAVSMYLANDFPPKDKKSIY--VSGPNTKLS 230

Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
           GGHAV ++GWG  ++G  YW  AN +  +WG  GYF+IKRGSNE  IE    A LP + N
Sbjct: 231 GGHAVMIVGWG-EENGVPYWDCANTYGTNWGDHGYFRIKRGSNELKIETWPGAALPIASN 289

Query: 234 LVKEITS 240
              E  S
Sbjct: 290 SQPETPS 296


>gi|159115721|ref|XP_001708083.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157436192|gb|EDO80409.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 305

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 78/214 (36%), Positives = 115/214 (53%), Gaps = 19/214 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGD-GCDGGYPISAWR 71
           Q  C  C+AF  + ALS R CI       +SLSV  +++C     G+ GC GG   S+W 
Sbjct: 101 QKECSCCYAFATLGALSTRRCIAKLDPQAVSLSVQHMVSCDS---GEAGCQGGEFESSWA 157

Query: 72  YFVHHGVVTEECDPYFD-STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI 130
           +    G V  +C PY    TG S           +C   C     +     + + SA R+
Sbjct: 158 FLETEGAVKSDCLPYTSGETGKSG----------ECPTTCQDGTPVESAFHYKAASASRL 207

Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 190
            S+  +IM  +  +GPV+  F V+EDF +Y  G+Y  + G  +GGHAV ++G+G+ ++  
Sbjct: 208 -SNYNEIMVSLLADGPVQTGFYVHEDFLYYVGGIYHKVYGTSLGGHAVLIVGYGSMNN-H 265

Query: 191 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
           DYWI+ N W   WG +GYF+I RG+NECGIE++ 
Sbjct: 266 DYWIVRNSWGSDWGENGYFRILRGTNECGIEKNA 299


>gi|307938279|ref|NP_001182763.1| dipeptidyl peptidase 1 precursor [Canis lupus familiaris]
          Length = 459

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 80/228 (35%), Positives = 118/228 (51%), Gaps = 28/228 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC+AF +   L  R  I      +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 249 QASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSCSQY--AQGCEGGFPYLIAGK 306

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y    G+V E C PY    G   P C+P      C R        + +S++Y +  +   
Sbjct: 307 YAQDFGLVEEACFPY---AGSDSP-CKPN----DCFR--------YYSSEYYYVGGFYGA 350

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
            +   +  E+ ++GP+ V+F VY+DF HY+ G+Y H           +  HAV L+G+GT
Sbjct: 351 CNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPFELTNHAVLLVGYGT 410

Query: 186 -SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
            S  G DYWI+ N W   WG DGYF+I+RG++EC IE   VA  P  K
Sbjct: 411 DSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVAATPIPK 458


>gi|253742295|gb|EES99137.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 315

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 81/219 (36%), Positives = 112/219 (51%), Gaps = 28/219 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           QGHCGSCWAF +  A  D  C+  G++   +  S   L++C   L   GC GG       
Sbjct: 101 QGHCGSCWAFASSRAFGDTRCMQ-GLDPVPVLYSPQYLVSCS--LQNMGCTGGTMEDVGD 157

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           +    G+ T+ C PY D         E A+  P C   CV  + + R  +   +   R +
Sbjct: 158 FLRDTGIATDTCVPYVD---------EDAHWEP-CPVSCVDGSPI-RTVQ--LMDFVRYD 204

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE- 190
            + E +M  I  NGP+  S  +YEDF +Y+SG+Y  I G   G HA++L+G+GT   G+ 
Sbjct: 205 GNLEAMMEAIAMNGPIHASMMIYEDFMYYQSGIYHFIYGSGCGMHAIELVGYGTDISGDS 264

Query: 191 --------DYWILANQWNRSWGADGYFKIKRGSNECGIE 221
                   DYWI  N W   WG +GYF+I RG+NECGIE
Sbjct: 265 EAGEEVRVDYWIARNSWGEDWGENGYFRIVRGNNECGIE 303


>gi|294929081|ref|XP_002779258.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239888294|gb|EER11053.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 288

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 82/226 (36%), Positives = 110/226 (48%), Gaps = 19/226 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCG----DGCDGGYPIS 68
           Q  C +CW   +   L+DR CI  G      LSV    +CC    G     GC GG  + 
Sbjct: 60  QSACHNCWTVSSTGMLNDRVCIKSGGTFRDILSVGYFTSCCNPANGCPKAKGCQGGNLLE 119

Query: 69  AWRYFVHHGVVT-EECDP---YFDSTGC---SHPGCEPA-YPTPKCVRKCVKK--NQLWR 118
              +  +HG+VT +E  P      + GC     P C+ A Y +P C  KC  K      +
Sbjct: 120 GLNFLKNHGIVTGDEFKPAGQLSSADGCWPYPFPKCKHAGYSSPACQTKCTNKAYKTSLQ 179

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
              H + S  R+ + P++I  EI+ NGPV    ++YED   YK+GVY H TG   G H +
Sbjct: 180 QDLHRAKSFGRLPAIPQNIKQEIFTNGPVIGMLSIYEDIRVYKAGVYVHQTGSFQGIHTL 239

Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
           K+IGWG  + G+DYW+  N WN  WG  G  K+  G    GIE  V
Sbjct: 240 KIIGWGV-ESGQDYWLAVNSWNEEWGDHGMIKLAVG--RTGIENSV 282


>gi|395856781|ref|XP_003800797.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Otolemur garnettii]
          Length = 436

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 83/233 (35%), Positives = 115/233 (49%), Gaps = 22/233 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 192 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHH-QQGCHGGRLDGAWWF 250

Query: 73  FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVKK---NQLWRNSKHYSI 125
               GVV++ C P+     D  G +      + P  +  R+   +   NQ+  N  +   
Sbjct: 251 LRRRGVVSDHCYPFSGQERDKAGPAPLCMMHSRPMGRGKRQATARCPNNQVQANDIYQVT 310

Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM--------GGHA 177
            AYR+ S+ ++IM E+ +NGPV+    V+EDF  Y+SG+Y H    +         G H+
Sbjct: 311 PAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLQRPEGYRRHGTHS 370

Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+ 
Sbjct: 371 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 423


>gi|242014495|ref|XP_002427925.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
           corporis]
 gi|212512409|gb|EEB15187.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
           corporis]
          Length = 473

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 84/237 (35%), Positives = 123/237 (51%), Gaps = 16/237 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CG+ WA       SDR+ I       + LS   LL+C       GC GG+   AW +
Sbjct: 210 QGWCGASWAVSTASVASDRYAIMSKGLTKVDLSPQHLLSCNKGQ--RGCQGGHLSRAWTF 267

Query: 73  FVHHGVVTEECDPYFDS-TGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
               G+V + C P+  + T C  P   P +     +      + L R+  +    AY+I 
Sbjct: 268 IRKFGLVDDYCYPWTGTPTKCKIPK-RPNFDALSSICPPSLGSNL-RSELYRVGPAYKIQ 325

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV----MGGHAVKLIGWGTSD 187
            D +DIM EI ++GPV+ +  VY+DF  YKSGVY     +      G H+VK++GWG   
Sbjct: 326 -DEKDIMEEIMQSGPVQATMKVYQDFFSYKSGVYTKSNTERESSNFGYHSVKILGWGEET 384

Query: 188 D--GE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITS 240
           +  G+   YW+ AN W + WG +G+FKI+RG+NEC IEE V+A    + +  +EI +
Sbjct: 385 NIYGQPIKYWLAANSWGQQWGENGFFKIRRGTNECEIEEFVLAAWAETNDPSREIIT 441


>gi|159108157|ref|XP_001704351.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157432412|gb|EDO76677.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 360

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 80/215 (37%), Positives = 116/215 (53%), Gaps = 22/215 (10%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPIS 68
           +V QG+CGSCWAF +V+  +D  C   G++   +S SV  +L C       GC+GG P++
Sbjct: 157 VVDQGNCGSCWAFSSVQTFADHRC-RSGLDATGVSYSVQYVLDC--DRKDHGCNGGEPVN 213

Query: 69  AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
           A+ +  + G V   C  Y          C       KC      +N +       + S  
Sbjct: 214 AFNFLHNTGTVLASCVGYTAGDDAVVKFCPQ-----KCDDGSAVENVV-------ATSGS 261

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
           +  S  + ++A    +GPV  +F V +DF +YKSGVY+H  G  +GGHAV++IG+G +D 
Sbjct: 262 KSGSAIDVLLA----HGPVVATFNVAQDFMYYKSGVYQHRWGLWLGGHAVEIIGYGVTDS 317

Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
           G DYW + N W   WG DGYF+I RG +ECGIE +
Sbjct: 318 GLDYWTVRNSWGPDWGEDGYFRIVRGGDECGIEHE 352


>gi|62510425|sp|Q60HG6.1|CATC_MACFA RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|52782205|dbj|BAD51949.1| cathepsin C [Macaca fascicularis]
          Length = 463

 Score =  127 bits (320), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 80/230 (34%), Positives = 121/230 (52%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F +V  L  R  I    + +  LS  ++++C  +    GC+GG+P ++A +
Sbjct: 252 QASCGSCYSFASVGMLEARIRILTNNSQTPILSSQEVVSCSQY--AQGCEGGFPYLTAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HY++G+Y H           +  HAV L+G+
Sbjct: 353 GGCNEALMKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGY 412

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT S  G DYWI+ N W  SWG DGYF+I+RG++EC IE   VA  P  K
Sbjct: 413 GTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|290988628|ref|XP_002677000.1| predicted protein [Naegleria gruberi]
 gi|284090605|gb|EFC44256.1| predicted protein [Naegleria gruberi]
          Length = 158

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 64/169 (37%), Positives = 94/169 (55%), Gaps = 13/169 (7%)

Query: 63  GGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH 122
           GG+ ++ WR+    G  +E+C PY  S G + P C         ++ C    +    S  
Sbjct: 1   GGFLVATWRFLAAVGTASEQCVPYV-SFGGAVPACN--------IKSCAVSGE---KSPF 48

Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
           Y + + R      D+MA++  NGP++ +  VY+DF  YKSGVY H++G ++G HA+K++G
Sbjct: 49  YKVKSARKLKGMVDMMADLKANGPLQATMIVYKDFFSYKSGVYHHVSGRMVGAHAIKIVG 108

Query: 183 WGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
           WG  S     YWI AN W   WG DGYF I RG  ECG+ + V +G P+
Sbjct: 109 WGVDSASKLPYWICANSWGEDWGLDGYFWIARGRGECGLGKTVWSGKPA 157


>gi|126327832|ref|XP_001363345.1| PREDICTED: dipeptidyl peptidase 1-like [Monodelphis domestica]
          Length = 462

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 83/228 (36%), Positives = 115/228 (50%), Gaps = 28/228 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC+AF ++  L  R  I    + +  LS   +++C  +    GCDGG+P + A +
Sbjct: 252 QASCGSCYAFASMAMLEARIRILTNNSKTPVLSTQQIVSCSEY--SQGCDGGFPYLIAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y    GVV E C PY    G   P C P      C R  V        S ++ +  +   
Sbjct: 310 YVQDFGVVEENCFPYL---GHDSP-CSPK----NCTRYYV--------SDYHYVGGFYGA 353

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
            +   +  E+ +NGP+ V+F VY DF HY+ GVY H           +  HAV L+G+GT
Sbjct: 354 CNEALMKLELVENGPMAVAFEVYNDFIHYQKGVYHHTGLRDSFNPFEITNHAVLLVGYGT 413

Query: 186 SDD-GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
            +  GE YWI+ N W   WG DGYF+I RG++ECGIE   V+  P  K
Sbjct: 414 DEKTGEHYWIVKNSWGSYWGEDGYFRILRGTDECGIESIAVSATPIPK 461


>gi|308161503|gb|EFO63946.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 363

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 78/215 (36%), Positives = 116/215 (53%), Gaps = 22/215 (10%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPIS 68
           +V QG CGSCWAF +++  +D  C   G++   +S SV  +L C       GC+GG P++
Sbjct: 160 VVDQGSCGSCWAFSSIQTFADHRC-RSGLDATGVSYSVQYVLDCD--RKDHGCNGGEPVN 216

Query: 69  AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
           A+ +  + G V   C  Y          C       KC      +N +       + S  
Sbjct: 217 AFNFLHNTGTVLTSCVEYTAGDDAVVKFCPQ-----KCDDGSAVENIV-------ATSGA 264

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
           +  S  + ++A    +GPV  +F V +DF +YKSGVY+H  G  +GGHAV+++G+G +D 
Sbjct: 265 KSGSAIDVLLA----HGPVVATFNVAQDFMYYKSGVYQHRWGVWLGGHAVEIVGYGVTDS 320

Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
           G DYW + N W   WG DGYF+I RG +ECGIE++
Sbjct: 321 GLDYWTVRNSWGPDWGEDGYFRIVRGGDECGIEQE 355


>gi|395856779|ref|XP_003800796.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Otolemur garnettii]
          Length = 467

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 83/233 (35%), Positives = 115/233 (49%), Gaps = 22/233 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 223 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHH-QQGCHGGRLDGAWWF 281

Query: 73  FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVKK---NQLWRNSKHYSI 125
               GVV++ C P+     D  G +      + P  +  R+   +   NQ+  N  +   
Sbjct: 282 LRRRGVVSDHCYPFSGQERDKAGPAPLCMMHSRPMGRGKRQATARCPNNQVQANDIYQVT 341

Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM--------GGHA 177
            AYR+ S+ ++IM E+ +NGPV+    V+EDF  Y+SG+Y H    +         G H+
Sbjct: 342 PAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLQRPEGYRRHGTHS 401

Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+ 
Sbjct: 402 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 454


>gi|417401428|gb|JAA47600.1| Putative cysteine proteinase tin-ag [Desmodus rotundus]
          Length = 466

 Score =  127 bits (319), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 84/239 (35%), Positives = 117/239 (48%), Gaps = 34/239 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG+  SAW +
Sbjct: 222 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSC-DKRNQQGCQGGHLDSAWWF 280

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRN 119
               GVV++ C P F   G +  G     P P+C+          R+   +   +Q+  N
Sbjct: 281 LRRRGVVSDHCYP-FSGQGRTETG-----PAPRCMMHSRAMGRGKRQATARCPNHQVHAN 334

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
             +    AYR+ S  ++IM E+ +NGPV+    V+EDF  Y++G+Y H    +       
Sbjct: 335 DIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVHEDFFLYQNGIYSHTPVSLGRPERYR 394

Query: 173 -MGGHAVKLIGWGTSD--DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
             G H+VK+ GWG     DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+ 
Sbjct: 395 RHGTHSVKITGWGEESLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 453


>gi|159120206|ref|XP_001710319.1| Cathepsin B-like cysteine proteinase 3 precursor [Giardia lamblia
           ATCC 50803]
 gi|157438437|gb|EDO82645.1| Cathepsin B-like cysteine proteinase 3 precursor [Giardia lamblia
           ATCC 50803]
          Length = 804

 Score =  127 bits (318), Expect = 5e-27,   Method: Composition-based stats.
 Identities = 85/234 (36%), Positives = 126/234 (53%), Gaps = 19/234 (8%)

Query: 3   FTNSEHVEILVI-QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLAC--CGFLC-- 57
           FT   H  I +I QG CG C+A  AVE ++ R C+    +  +S+ DL+ C    +L   
Sbjct: 64  FTYRGHRCIQIINQGSCGCCYAAAAVEMVTARRCLQLNDSRLVSLEDLVTCDHTKYLNIQ 123

Query: 58  GDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW 117
            +GC GG  +++ ++    G+V + C+ Y++ T   +P     YPT  C   C  K    
Sbjct: 124 NNGCRGGNSLASLKFGETTGMVYDTCEDYWNRT---YP-----YPTETCKTVCKDKRPKD 175

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF-AHYKSGVYKHITGDVMGG- 175
           R  K+ +   YR+ S  + +M +IY+NGP+ VS  +  DF +  K G+Y       +GG 
Sbjct: 176 RTIKNKA--PYRL-SGVDAMMRDIYQNGPIAVSMYLANDFPSKDKKGIYSSGPNTKLGGG 232

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           HAV ++GWG  ++G  YW  AN +  +WG  GYFKIKRGSNE  IE    + LP
Sbjct: 233 HAVMIVGWG-EENGVPYWDCANTYGTNWGDQGYFKIKRGSNELKIETWPGSALP 285


>gi|253743418|gb|EES99819.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 296

 Score =  127 bits (318), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 79/215 (36%), Positives = 111/215 (51%), Gaps = 22/215 (10%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPIS 68
           +V QG CGSCWAF +++  +D  C   G++   +S SV  +L C       GC+GG P  
Sbjct: 93  VVDQGSCGSCWAFSSIQTFADHRC-RSGLDATGVSYSVQYVLDC--DRKDHGCNGGEPTK 149

Query: 69  AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
           A+ +    G V   C  Y          C      PK          ++  S   S SA 
Sbjct: 150 AFDFLHSTGTVLTSCVDYTAGADNVVKFC------PKTCDDGSAVENVFAASGSKSGSAI 203

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
            +          +  +GPV  +F V +DF +YKSGVY+H  G  +GGHAV+++G+G +D 
Sbjct: 204 DV----------LLSHGPVVATFNVAQDFMYYKSGVYQHRWGVWLGGHAVEVVGYGVTDS 253

Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
           G DYW + N W   WG DGYF+I RGS+ECGIE++
Sbjct: 254 GLDYWTVRNSWGPDWGEDGYFRIVRGSDECGIEQE 288


>gi|403355691|gb|EJY77431.1| Cathepsin H [Oxytricha trifallax]
          Length = 363

 Score =  127 bits (318), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 74/213 (34%), Positives = 104/213 (48%), Gaps = 29/213 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           QG CGSCW F  V AL   F + +G   +LS   L+ C G     GC+GG P  A+ Y  
Sbjct: 153 QGKCGSCWTFSTVGALESHFLLKYGQFRNLSEQQLVDCAGNYDNHGCNGGLPSHAFEYLK 212

Query: 75  HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS--AYRINS 132
            +G + EE                 +YP       C  K    + S+   +   A  ++ 
Sbjct: 213 DNGGIAEET----------------SYPYVAVTNTCALK----KGSQSVGVKGGAVNVSL 252

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-----KHITGDVMGGHAVKLIGWGTSD 187
             +D+   IY +GPV ++F V  DF  Y++GVY     K+   DV   HAV  +G+GT +
Sbjct: 253 SEDDLKQAIYSHGPVSIAFQVASDFRDYRAGVYTSKVCKNGPQDV--NHAVLAVGFGTDE 310

Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGI 220
           +  DYWI+ N W   WG  GYFK++RG N CG+
Sbjct: 311 NKVDYWIIKNSWGAVWGDQGYFKMERGVNMCGV 343


>gi|308157698|gb|EFO60800.1| Cathepsin B-like cysteine proteinase 3 precursor [Giardia lamblia
           P15]
          Length = 627

 Score =  127 bits (318), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 86/234 (36%), Positives = 127/234 (54%), Gaps = 19/234 (8%)

Query: 3   FTNSEHVEILVI-QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLAC--CGFL--C 57
           FT   H  I +I QG CG C+A  AVE ++ R C+ F  +  +S+ DL+ C    +L   
Sbjct: 64  FTYRGHRCIQIINQGSCGCCYAAAAVEMVTARRCLQFNDSKLVSLEDLVTCDHTKYLNIQ 123

Query: 58  GDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW 117
            +GC GG  +++ ++    G+V + C+ Y++ T   +P     YPT  C   C  K    
Sbjct: 124 NNGCRGGNSLASLKFGETTGMVYDTCEDYWNRT---YP-----YPTETCKTVCKDKRPKD 175

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF-AHYKSGVYKH-ITGDVMGG 175
           R  K+ +   YR+ S  + +M +IY+NGP+ VS  +  DF +  K G+Y       + GG
Sbjct: 176 RTIKNKA--PYRL-SGVDAMMRDIYQNGPIAVSMYLANDFPSKDKKGIYSSGPNTKLRGG 232

Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           HAV ++GWG  ++G  YW  AN +  +WG  GYFKIKRGSNE  IE    + LP
Sbjct: 233 HAVMIVGWG-EENGVPYWDCANTYGTNWGDQGYFKIKRGSNELKIETWPGSALP 285


>gi|294945206|ref|XP_002784584.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239897729|gb|EER16380.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 298

 Score =  127 bits (318), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 80/233 (34%), Positives = 110/233 (47%), Gaps = 31/233 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACC----GFLCGDGCDGGYPIS 68
           QG CG+CWA    E L+DR CI     +   LS   + +CC    G L   GC+GG  + 
Sbjct: 56  QGRCGNCWAVCPTEVLNDRLCIKSSGKIQEILSAGYVTSCCNPAHGCLHAKGCNGGRLVE 115

Query: 69  AWRYFVHHGVVT-------------EECDPY-------FDSTGCSHPGCEPA--YPTPKC 106
           A  +   HGVVT             + C PY         + G  +P C+     P P C
Sbjct: 116 AMSFLRDHGVVTGNDFKPQDQLREADGCWPYPFQKCNHVPTEGTGYPKCKDVVQQPVPPC 175

Query: 107 VRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 164
              C  K   +      H + S  ++ +D + I  EI+ NGPV  +F +Y+DF +YKSGV
Sbjct: 176 RTTCTNKAYKKSLEKDVHRAKSWRKVLNDAQSIKQEIFDNGPVFSAFEMYKDFRYYKSGV 235

Query: 165 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 217
           Y   T +V   H +K+IGWG +D   +YW+  N WN  WG  G  K+  G N 
Sbjct: 236 YVPTTKEVDCLHVIKIIGWG-ADSVREYWLAMNAWNEEWGDHGLIKMAFGKNR 287


>gi|268564843|ref|XP_002639246.1| Hypothetical protein CBG03805 [Caenorhabditis briggsae]
          Length = 526

 Score =  127 bits (318), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 83/233 (35%), Positives = 114/233 (48%), Gaps = 16/233 (6%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYP 66
           +  +  QG CGS WA       SDR  I     +N SLS   LL+C       GC+GGY 
Sbjct: 272 IHPIADQGDCGSSWAVSTTGISSDRLSIISEGRINASLSSQQLLSCNQHR-QKGCEGGYL 330

Query: 67  ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS 126
             AW Y    GVV + C PY  S     PG           R+ ++     ++S  + ++
Sbjct: 331 DRAWWYIRKLGVVGDHCYPYV-SGQSREPGHCLIPKRDYTNRQGLRCPSGSQDSTAFKMT 389

Query: 127 A-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH--------ITGDVMGGHA 177
             Y+++S  EDI  E+  NGPV+ +F V+EDF  Y  GVY+H         +    G H+
Sbjct: 390 PPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHS 449

Query: 178 VKLIGWG---TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           V+++GWG   ++     YW+ AN W   WG DGYFKI RG N C IE  V+  
Sbjct: 450 VRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDGYFKILRGENHCEIESFVIGA 502


>gi|12658201|gb|AAK01061.1| cysteine proteinase [Metagonimus yokogawai]
          Length = 179

 Score =  127 bits (318), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 72/179 (40%), Positives = 102/179 (56%), Gaps = 18/179 (10%)

Query: 25  GAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-- 80
           GAVEA++DR CIH    +   +S  DLL+CC   CG GC GG+P  AW +++ +G+VT  
Sbjct: 1   GAVEAMTDRLCIHSNATIKKHISSTDLLSCCE-SCGFGCHGGFPPRAWDFWMENGLVTGG 59

Query: 81  -----EECDPYFDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
                  C  Y     C+H       P  E  +PTP C + C      +   K  + S+Y
Sbjct: 60  SKENPSGCRSY-PFPKCNHHGKGPDAPCPEKIFPTPACNKTCDTPEVNYILDKTKAKSSY 118

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
            + +  + IM EI +NGPVE +F VYEDF HY+SGVY H  G ++GGHA++++GWG  +
Sbjct: 119 NVPNSEKAIMKEIMQNGPVEAAFEVYEDFLHYESGVYFHSFGRMIGGHAIRMLGWGEEN 177


>gi|383861394|ref|XP_003706171.1| PREDICTED: tubulointerstitial nephritis antigen-like [Megachile
           rotundata]
          Length = 442

 Score =  126 bits (317), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 79/228 (34%), Positives = 116/228 (50%), Gaps = 17/228 (7%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYP 66
           +  +  QG CG+ WA  + +  SDRF I       + LS   LL+C       GC GG+ 
Sbjct: 214 ISKITDQGWCGASWAISSAQVASDRFAIMSKGTDAVELSAQHLLSC-NNRGQQGCSGGHL 272

Query: 67  ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS 126
             AW +    G+V E C P+  ST      C     T      C       R   +    
Sbjct: 273 DRAWMFMRRFGLVDENCYPWKASTE----TCRLRKRTDLRSAGCAPPPNPLRTELYKVGP 328

Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH-ITGDVMGG--HAVKLIGW 183
           AYR+ ++  DIM EI  +GPV+ +  VY+DF  Y+SGVYKH +T ++     H+V++IGW
Sbjct: 329 AYRLANE-TDIMQEILTSGPVQATMRVYQDFFSYESGVYKHSVTAELYESDYHSVRIIGW 387

Query: 184 G------TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
           G      + +    YW++AN W + WG +G F+I++G+NEC IE  V+
Sbjct: 388 GEEPPTYSRNTPLKYWLVANSWGQQWGENGLFRIQKGTNECEIESFVL 435


>gi|340712697|ref|XP_003394892.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
           terrestris]
          Length = 445

 Score =  126 bits (317), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 79/224 (35%), Positives = 108/224 (48%), Gaps = 18/224 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CG+ WA  A    SDRF +      ++ LS   LL+C        C GGY   AW Y
Sbjct: 222 QGWCGASWAISATRVASDRFALMSKGADSVLLSAQHLLSC-NNRGQQACSGGYLDRAWLY 280

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
               G+V E+C P+  +       C+    T      C       R   +    AYR+ +
Sbjct: 281 MRKFGLVDEDCYPWEGTNA----QCKLRKRTDLKTAGCRPPVNPLRTELYKVGPAYRLGN 336

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD---VMGGHAVKLIGWGTSDDG 189
           +  DIM EI  +GPV+ +  VY+DF  Y+SG+YKH         G H+V++IGWG     
Sbjct: 337 E-TDIMYEILTSGPVQATMKVYQDFFSYESGIYKHTATTEHYAFGYHSVRIIGWGEDTSA 395

Query: 190 E-------DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
                    YW++ N W + WG  G F+I+RG+NEC IE  VVA
Sbjct: 396 HRHHNLPIKYWLVVNSWGQQWGESGLFRIQRGTNECDIESFVVA 439


>gi|358421824|ref|XP_003585145.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bos taurus]
          Length = 428

 Score =  126 bits (317), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 84/239 (35%), Positives = 115/239 (48%), Gaps = 34/239 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH   ++S  LS  +LL+C       GC GG    AW +
Sbjct: 184 QGNCAGSWAFSTAAVASDRVSIHSLGHMSPVLSPQNLLSC-DTHNQQGCRGGRLDGAWWF 242

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR-------------N 119
               GVV++ C P+      S  G + A P P C+       +  R             N
Sbjct: 243 LRRRGVVSDHCYPF------SGHGRDEAVPAPPCMMHSRAMGRGKRQATARCPNSYVHAN 296

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
             +    AYR+ S+ ++IM E+ +NGPV+    V+EDF  Y+SG+Y H    +       
Sbjct: 297 DIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYR 356

Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
             G H+VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+ 
Sbjct: 357 RHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 415


>gi|380808942|gb|AFE76346.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
          Length = 463

 Score =  126 bits (317), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 80/230 (34%), Positives = 121/230 (52%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F +V  L  R  I    + +  LS  ++++C  +    GC+GG+P ++A +
Sbjct: 252 QASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLTAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 310 YAQDFGLVEEACFPY---TGNDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HY++G+Y H           +  HAV L+G+
Sbjct: 353 GGCNEALMKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGY 412

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT S  G DYWI+ N W  SWG DGYF+I+RG++EC IE   VA  P  K
Sbjct: 413 GTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|403355865|gb|EJY77523.1| Cathepsin B [Oxytricha trifallax]
          Length = 299

 Score =  126 bits (317), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 76/202 (37%), Positives = 102/202 (50%), Gaps = 21/202 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCW+F A   L DR C+H    +N+ LS  D+++C       GC GG+      Y
Sbjct: 96  QQSCGSCWSFAATSMLQDRLCLHSNGAVNVQLSQQDMVSC--DFDNAGCSGGWLSHTINY 153

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHY--SISAYRI 130
            V HGVVT +C  Y    G             +C  +C   N  +   K Y    ++ ++
Sbjct: 154 LVVHGVVTSQCLAYASVDGAGR----------ECSFRCDDANTEY---KKYGCKFNSLKM 200

Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK-HITGDVMGGHAVKLIGWGTSDDG 189
            +  E++M EIY NGPV V F VY DF  Y  G Y+   +  + GGHAV + GWG  + G
Sbjct: 201 TTSKEEMMEEIYLNGPVMVGFIVYSDFMSYGGGYYEVSPSASISGGHAVIVHGWGY-NGG 259

Query: 190 EDYWILANQWNRSWGADGYFKI 211
             YWI  NQW  +WG+ GYF I
Sbjct: 260 RLYWIAQNQWGTTWGSSGYFNI 281


>gi|129270160|ref|NP_001038442.2| tubulointerstitial nephritis antigen-like precursor [Danio rerio]
 gi|126632071|gb|AAI33830.1| Si:dkey-158b13.1 [Danio rerio]
          Length = 471

 Score =  126 bits (317), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 79/239 (33%), Positives = 113/239 (47%), Gaps = 35/239 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C + WAF      SDR  I     M   LS  +L++C      DGC GG    AW +
Sbjct: 220 QGNCNASWAFSTAAVASDRISIQSMGHMTPQLSPQNLISC-DTRHQDGCAGGRIDGAWWF 278

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKC-------------VKKNQLWRN 119
               GVVT++C P+        P  + A    +C+ +                 +  + N
Sbjct: 279 MRRRGVVTQDCYPF-------SPPEQSAVEVARCMMQSRAVGRGKRQATAHCPNSHSYHN 331

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM------ 173
             + S   YR++++  +IM EI  NGPV+    V+EDF  YKSG+++H   +        
Sbjct: 332 DIYQSTPPYRLSTNENEIMKEIMDNGPVQAIMEVHEDFFVYKSGIFRHTDVNYHKPSQYR 391

Query: 174 --GGHAVKLIGWGTSDD----GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
               H+V++ GWG   D       YWI AN W ++WG DGYF+I RG NEC IE  V+ 
Sbjct: 392 KHATHSVRITGWGEERDYSGRTRKYWIGANSWGKNWGEDGYFRIARGVNECDIETFVIG 450


>gi|328712819|ref|XP_001942906.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Acyrthosiphon pisum]
 gi|328712821|ref|XP_003244911.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Acyrthosiphon pisum]
          Length = 463

 Score =  126 bits (317), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 80/230 (34%), Positives = 119/230 (51%), Gaps = 13/230 (5%)

Query: 8   HVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGY 65
           ++   + QG CG+ WA   V+  +DRF I     +S  LS   LL+C   L   GC GG+
Sbjct: 208 YISSPIDQGWCGASWAITTVQVTTDRFGIMSKRAISDVLSPQHLLSC-NNLNQQGCQGGH 266

Query: 66  PISAWRYFVHHGVVTEECDPYFDS-TGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYS 124
              AW +    G++TEEC P+    + C+ P  +      +C  +    N     ++ + 
Sbjct: 267 LTRAWNWIRKFGLITEECYPWQGRMSTCAVPK-KKKETMAQCPSRVRSNNDRTTKTRLHR 325

Query: 125 IS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK---HITGDVMGGHAVKL 180
           +   YR+ ++ E IM EI  +GPV+    V  DF  YKSGVYK     +G   G H+V++
Sbjct: 326 VGPVYRVATE-EGIMHEILTSGPVQAVMKVSRDFFMYKSGVYKCSNLASGSRTGYHSVRI 384

Query: 181 IGWGTSDDG---EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           +GWG    G     YWI +N W   WG +GYF+I +G +EC IE+ V+A 
Sbjct: 385 VGWGEEYQGGKIVKYWIASNSWGSWWGENGYFRILKGVDECEIEDFVIAA 434


>gi|157058739|gb|ABV03127.1| cathepsin B-2744 [Acyrthosiphon pisum]
          Length = 260

 Score =  126 bits (317), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 80/217 (36%), Positives = 106/217 (48%), Gaps = 30/217 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG----CDGGYPIS 68
           QG+C S WA       +DR CI      +  LS  +L++C     GDG    CDGG    
Sbjct: 49  QGNCASSWAVAVASTFTDRLCIASNGQFTDNLSAQNLMSC-----GDGEKMGCDGGSAFK 103

Query: 69  AWRYFVHHGVVT-------EECDPYFDSTGCSHPG------CEPAYPTPK--CVRKCVKK 113
           AW   ++ G+VT       E C PY +   C H G      C     T    C +KCV K
Sbjct: 104 AWELTMNKGIVTGGNFDSNEGCQPYKNRP-CDHYGDSRLTNCSSLRRTQMTVCRKKCVNK 162

Query: 114 NQL--WRNSKHYSISAYRIN-SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 170
           N    + +  H +   Y  + ++ + I  EI   GPV     VYE+F  YK G+YK  TG
Sbjct: 163 NYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTYGPVTAFMYVYENFMGYKEGIYKSTTG 222

Query: 171 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 207
           +++G H VKLIGWG   DG +YW+  N WN +WG DG
Sbjct: 223 ELIGYHHVKLIGWGVDGDGTEYWLAMNSWNSNWGNDG 259


>gi|405963121|gb|EKC28721.1| Tubulointerstitial nephritis antigen-like protein [Crassostrea
           gigas]
          Length = 464

 Score =  126 bits (317), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 84/219 (38%), Positives = 106/219 (48%), Gaps = 17/219 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q +C S WAF  V+  +DR  I     L+  LS   L++C       GC GG    AW +
Sbjct: 213 QKNCASSWAFSTVDVAADRLAIESEGLLTNQLSPQHLVSCNTGRGQRGCRGGSTEKAWWF 272

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
               G++TEEC PY  S G     C     T      C   N        Y    YR+  
Sbjct: 273 VKRRGIITEECYPYTASDG----ECLDGETT------CPNANSSTAKIVLYVTPPYRVRQ 322

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH-AVKLIGWG----TSD 187
           D EDI AEIY+NGPV+ +F V  DF  Y+SGVY+H   D+     +V++IGWG       
Sbjct: 323 DEEDIKAEIYRNGPVQATFRVSSDFFMYRSGVYRHTGADLGESRLSVRIIGWGEKTNKKG 382

Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
               YWI  N W   WG  G F+I RG N  GIEE+V+A
Sbjct: 383 KKRKYWICLNSWGTKWGEKGAFRIVRGENHLGIEENVLA 421


>gi|307548878|ref|NP_001182580.1| dipeptidyl peptidase 1 precursor [Macaca mulatta]
          Length = 463

 Score =  126 bits (317), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 80/230 (34%), Positives = 121/230 (52%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F +V  L  R  I    + +  LS  ++++C  +    GC+GG+P ++A +
Sbjct: 252 QASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLTAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 310 YAQDFGLVEEACFPY---TGNDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HY++G+Y H           +  HAV L+G+
Sbjct: 353 GGCNEALMKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGY 412

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT S  G DYWI+ N W  SWG DGYF+I+RG++EC IE   VA  P  K
Sbjct: 413 GTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|355752523|gb|EHH56643.1| hypothetical protein EGM_06098 [Macaca fascicularis]
          Length = 463

 Score =  126 bits (317), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 80/230 (34%), Positives = 121/230 (52%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F +V  L  R  I    + +  LS  ++++C  +    GC+GG+P ++A +
Sbjct: 252 QASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLTAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 310 YAQDFGLVEEACFPY---TGNDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HY++G+Y H           +  HAV L+G+
Sbjct: 353 GGCNEALMKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGY 412

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT S  G DYWI+ N W  SWG DGYF+I+RG++EC IE   VA  P  K
Sbjct: 413 GTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|383415299|gb|AFH30863.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
 gi|384944880|gb|AFI36045.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
          Length = 463

 Score =  126 bits (317), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 80/230 (34%), Positives = 121/230 (52%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F +V  L  R  I    + +  LS  ++++C  +    GC+GG+P ++A +
Sbjct: 252 QASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLTAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 310 YAQDFGLVEEACFPY---TGNDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HY++G+Y H           +  HAV L+G+
Sbjct: 353 GGCNEALMKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGY 412

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT S  G DYWI+ N W  SWG DGYF+I+RG++EC IE   VA  P  K
Sbjct: 413 GTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|290971375|ref|XP_002668483.1| predicted protein [Naegleria gruberi]
 gi|284081912|gb|EFC35739.1| predicted protein [Naegleria gruberi]
          Length = 325

 Score =  126 bits (316), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 79/215 (36%), Positives = 102/215 (47%), Gaps = 18/215 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CG+CWAF A   L+ R CI      N+ LS    + C        C GGY   +W +
Sbjct: 123 QQTCGACWAFSANYVLAHRLCIATNGQTNVVLSPEYQVQCDTM--NKACQGGYLKYSWTF 180

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
             + G   + C PY    G    G  P     +C    +  ++       Y     R  +
Sbjct: 181 LENTGTPLDTCIPYASGRGTFSSGTCPT----QCKIASMSMSK-------YKAKNTRYIT 229

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
              +I   I   G V+  FTVY D   YKSGVYKH+   V+GGHAV LIG+G  + G +Y
Sbjct: 230 GINNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHVVSTVLGGHAVALIGFGV-EGGSNY 288

Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           W+ AN W  +WG  GYFKI +G  E GIE  V AG
Sbjct: 289 WLAANSWGANWGMSGYFKIAQG--EGGIENQVYAG 321


>gi|402894881|ref|XP_003910570.1| PREDICTED: dipeptidyl peptidase 1 [Papio anubis]
          Length = 463

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 80/230 (34%), Positives = 120/230 (52%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F +V  L  R  I    + +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HY++G+Y H           +  HAV L+G+
Sbjct: 353 GGCNEALMKLELVYHGPLSVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGY 412

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT S  G DYWI+ N W  SWG DGYF+I+RG++EC IE   VA  P  K
Sbjct: 413 GTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|426221788|ref|XP_004005089.1| PREDICTED: tubulointerstitial nephritis antigen-like [Ovis aries]
          Length = 362

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 84/239 (35%), Positives = 118/239 (49%), Gaps = 34/239 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH   ++S  LS  +LL+C       GC GG    AW +
Sbjct: 118 QGNCAGSWAFSTAAVASDRVSIHSLGHMSPVLSPQNLLSC-DTHNQQGCHGGRLDGAWWF 176

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRN 119
               GVV++ C P+      S  G + A P P C+          R+   +   + +  N
Sbjct: 177 LRRRGVVSDHCYPF------SGHGRDEAVPAPPCMMHSRAMGRGKRQATARCPNSYVHAN 230

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
             +    AYR+ S+ ++IM E+ +NGPV+    V+EDF  Y+SG+Y H    +       
Sbjct: 231 DIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYR 290

Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
             G H+VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+ 
Sbjct: 291 RHGTHSVKITGWGEETLPDGRTVKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 349


>gi|403331769|gb|EJY64852.1| hypothetical protein OXYTRI_15000 [Oxytricha trifallax]
          Length = 259

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 81/222 (36%), Positives = 114/222 (51%), Gaps = 24/222 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSC+AF A   +SDR CI     +NL LS  +L++C       GC GG+  +   Y
Sbjct: 53  QGSCGSCYAFAASGMMSDRLCIKSNGQINLVLSPQELVSC--DYQNYGCSGGWMTNTLYY 110

Query: 73  FVHHGVVTEECDPY--FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI 130
            + +G+ +E C PY  F+S             T  C  +C   N  +   K    ++ +I
Sbjct: 111 LMSYGIPSETCLPYDMFNSE------------TKACSGRCDSPNYEYTRHKCKKGTS-KI 157

Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 190
            SDPE IM +I +NGP  V+F  +EDF ++  G+YK+ +G  + GHA KL GWG    G 
Sbjct: 158 MSDPETIMRDIMENGPSIVAFQAFEDFLNFGGGIYKYTSGKFLVGHATKLTGWGLDSAGR 217

Query: 191 DYWILANQWNRSWGAD---GYFKIKRGSNECGIEEDVVAGLP 229
            YWI  NQ+   WG     G++KI  G  E G    V + +P
Sbjct: 218 LYWIGQNQFGLGWGGRGDYGFYKIYDG--EVGFGSAVWSCIP 257


>gi|47550737|ref|NP_999887.1| dipeptidyl peptidase 1 precursor [Danio rerio]
 gi|39794586|gb|AAH64286.1| Cathepsin C [Danio rerio]
          Length = 455

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 79/228 (34%), Positives = 111/228 (48%), Gaps = 28/228 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSC++F  +  L  R  I          S   +++C  +    GCDGG+P    +Y
Sbjct: 245 QAQCGSCYSFATMGMLEARVRIQTNNTQQPVFSPQQVVSCSQY--SQGCDGGFPYLIGKY 302

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
               G+V E+C PY   TG   P   PA        KC K    +  S ++ +  +    
Sbjct: 303 IQDFGIVEEDCFPY---TGSDSPCNLPA--------KCTK----YYASDYHYVGGFYGGC 347

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-------MGGHAVKLIGWGT 185
               +M E+ KNGP+ V+  VY DF +YK G+Y H TG         +  HAV L+G+G 
Sbjct: 348 SESAMMLELVKNGPMGVALEVYPDFMNYKEGIYHH-TGLRDANNPFELTNHAVLLVGYGQ 406

Query: 186 -SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
               GE YWI+ N W   WG +G+F+I+RG++EC IE   VA  P  K
Sbjct: 407 CHKTGEKYWIVKNSWGSGWGENGFFRIRRGTDECAIESIAVAATPIPK 454


>gi|307175943|gb|EFN65753.1| Uncharacterized peptidase C1-like protein F26E4.3 [Camponotus
           floridanus]
          Length = 443

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 80/221 (36%), Positives = 112/221 (50%), Gaps = 15/221 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CG+ WA    +  SDRF I       + LS   LL+C       GC GGY   AW +
Sbjct: 223 QGWCGASWAVSTADVASDRFAIMSKGAETVELSAQHLLSC-NNRGQQGCKGGYLDRAWLF 281

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
               G+V EEC P+   TG  +  C     +      C       R   +    AYR+ +
Sbjct: 282 MRKFGLVDEECYPW---TG-RNDQCRLRKRSNLKTAGCQNPPNSLRTELYKVGPAYRLGN 337

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM---GGHAVKLIGWGTSDDG 189
           +  DIM EI  +GPV+ +  VY+DF  Y+SGVY+H     +   G H+V++IGWG     
Sbjct: 338 E-TDIMQEILTSGPVQATMRVYQDFFVYQSGVYRHSRSAELHDSGYHSVRIIGWGEEPSY 396

Query: 190 E----DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
                 YW++AN W  +WG +G F+I++G+NEC IE  V+A
Sbjct: 397 RGPPLKYWLVANSWGHNWGENGLFRIQKGTNECEIESYVLA 437


>gi|66506619|ref|XP_393283.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
           [Apis mellifera]
          Length = 439

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 79/221 (35%), Positives = 111/221 (50%), Gaps = 14/221 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF-GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
           QG CG+ WA    +  SDRF +   G +  L     L  C      GCDGGY   AW + 
Sbjct: 217 QGWCGASWAISTAQVASDRFAVMSKGTDSVLLSAQHLLSCNKKGQRGCDGGYLDRAWLFM 276

Query: 74  VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 133
              G+V E+C P+       +  C+    T      C       R   +    AYR+ ++
Sbjct: 277 RKFGLVDEQCYPWKGV----YEQCKLQKRTNLEAAGCRAPANPLRKELYKVGPAYRLGNE 332

Query: 134 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM---GGHAVKLIGWG---TSD 187
             DIM EI  +GPV+ +  VY+DF  Y+SG+Y H     +   G H+V++IGWG   ++D
Sbjct: 333 -TDIMREILTSGPVQATMKVYQDFFSYESGIYMHTPIAELYESGYHSVRIIGWGEDISTD 391

Query: 188 DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
            G    YW++ N W + WG +G F+I+RG NEC IE  VVA
Sbjct: 392 SGLPIKYWLVVNSWGQEWGENGLFRIRRGINECDIESFVVA 432


>gi|410972493|ref|XP_003992693.1| PREDICTED: dipeptidyl peptidase 1 isoform 1 [Felis catus]
          Length = 463

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 80/228 (35%), Positives = 118/228 (51%), Gaps = 27/228 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I      +  LS  ++++C  +    GCDGG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTNNTQTPILSPQEVVSCSQY--AQGCDGGFPYLIAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y    G+V E C PY   TG   P C+P      CVR        + +S+++ +  +   
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP-CKPK---EDCVR--------YYSSEYHYVGGFYGG 354

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
            +   +  E+  +GP+ V+F VY DF HY+ G+Y H           +  HAV L+G+GT
Sbjct: 355 CNEALMKLELVHHGPMAVAFEVYNDFLHYRKGIYYHTGLRDPFNPFELTNHAVLLVGYGT 414

Query: 186 SD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
               G DYWI+ N W   WG DGYF+I+RG++EC IE   VA  P  K
Sbjct: 415 DPVSGMDYWIVKNSWGIGWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|290981656|ref|XP_002673546.1| predicted protein [Naegleria gruberi]
 gi|284087130|gb|EFC40802.1| predicted protein [Naegleria gruberi]
          Length = 362

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 83/215 (38%), Positives = 104/215 (48%), Gaps = 18/215 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CG+CWAF A   L+ R CI      N+ LS    + C        C GGY   +W +
Sbjct: 160 QQTCGACWAFSANYVLAHRLCIATNGQTNVVLSPEYQVQCDTM--NKACQGGYLKYSWTF 217

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
             + G   + C PY    G    G         C  +C  K      SK+ + +   I S
Sbjct: 218 LENTGTPLDSCIPYASGRGTFSSGT--------CPTQC--KIASMSMSKYKAKNTVYI-S 266

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
              +I   I   G V+  FTVY D   YKSGVYKHI   V+GGHAV LIG+G  + G +Y
Sbjct: 267 GINNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHIENTVLGGHAVALIGFGV-EGGSNY 325

Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           W+ AN W  +WG  GYFKI +G  E GIE  V AG
Sbjct: 326 WLAANSWGPNWGMSGYFKIAQG--EGGIENQVYAG 358


>gi|443687066|gb|ELT90166.1| hypothetical protein CAPTEDRAFT_138389 [Capitella teleta]
          Length = 446

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 80/225 (35%), Positives = 111/225 (49%), Gaps = 31/225 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSV---NDLLACCGFLCGDGCDGGYP-ISAW 70
           QG CGSC+AF ++     R  +    N  + V    D++ CC +    GCDGG+P +   
Sbjct: 241 QGGCGSCYAFSSMAMNEARIRV-MSNNTQMPVFSPQDIVDCCQY--SQGCDGGFPYLVGG 297

Query: 71  RYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI 130
           +Y    G+V E CDPY                     RKC   +   R +  Y       
Sbjct: 298 KYAEDFGLVDESCDPYVGED-----------------RKCKSTSCSRRYATRYRYVGGYY 340

Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH--ITGDV----MGGHAVKLIGWG 184
            +  E  M    + GP+ VSF VY+DF HYKSGVY+H  +T       +  HAV L+G+G
Sbjct: 341 GACNEQEMKLALQRGPLSVSFMVYDDFMHYKSGVYRHSGLTDKYNPFEITNHAVLLVGYG 400

Query: 185 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
            +D+G  YWI+ N W + WG +GYF+I RG++EC IE   V   P
Sbjct: 401 -ADEGTKYWIVKNSWGKGWGEEGYFRILRGADECAIESIAVETFP 444


>gi|332030944|gb|EGI70570.1| Uncharacterized peptidase C1-like protein F26E4.3 [Acromyrmex
           echinatior]
          Length = 501

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 77/221 (34%), Positives = 110/221 (49%), Gaps = 15/221 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CG+ WA    +  +DRF I      +  LS   LL+C       GC GGY   AW +
Sbjct: 281 QGWCGASWAISTADVATDRFSIMSKGAEDAELSAQHLLSC-NNRGQQGCRGGYLDRAWLF 339

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
               G+V ++C P+    G     C+           C K     R   +    AYR+ +
Sbjct: 340 MRKFGLVDKDCYPWTGKNG----QCKLRKRNNLQAAGCRKPPNPLRTELYKVGPAYRLGN 395

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM---GGHAVKLIGWGTSDDG 189
           +  DIM EI  +GPV+ +  VY+DF  YK+G+Y+H     +   G H+V++IGWG     
Sbjct: 396 E-TDIMQEILTSGPVQATMRVYQDFFVYKNGIYRHSQSAELHDSGYHSVRIIGWGEERSY 454

Query: 190 E----DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
                 YW++ N W  +WG +G FKI+RG+NEC IE  V+A
Sbjct: 455 RGPPLKYWLVVNSWGYNWGENGLFKIQRGTNECEIESYVLA 495


>gi|297465285|ref|XP_887401.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
           [Bos taurus]
 gi|297472148|ref|XP_002685665.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Bos taurus]
 gi|296490232|tpg|DAA32345.1| TPA: tubulointerstitial nephritis antigen-like 1-like [Bos taurus]
          Length = 534

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 84/238 (35%), Positives = 118/238 (49%), Gaps = 34/238 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH   ++S  LS  +LL+C       GC GG    AW +
Sbjct: 290 QGNCAGSWAFSTAAVASDRVSIHSLGHMSPVLSPQNLLSC-DTHNQQGCRGGRLDGAWWF 348

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRN 119
               GVV++ C P+      S  G + A P P C+          R+   +   + +  N
Sbjct: 349 LRRRGVVSDHCYPF------SGHGRDEAVPAPPCMMHSRAMGRGKRQATARCPNSYVHAN 402

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
             +    AYR+ S+ ++IM E+ +NGPV+    V+EDF  Y+SG+Y H    +       
Sbjct: 403 DIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYR 462

Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
             G H+VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+
Sbjct: 463 RHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPAWGERGHFRIVRGANECDIESFVL 520


>gi|290998874|ref|XP_002682005.1| predicted protein [Naegleria gruberi]
 gi|284095631|gb|EFC49261.1| predicted protein [Naegleria gruberi]
          Length = 310

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 83/217 (38%), Positives = 105/217 (48%), Gaps = 18/217 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CG+CWAF A   L+ R CI      N+ LS    + C        C GGY   +W +
Sbjct: 108 QQTCGACWAFSANYVLAHRLCIATNGQTNVVLSPEYQVQCDTM--NKACQGGYLKYSWTF 165

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
             + G   + C PY    G    G         C  +C  K      SK+ + +   I S
Sbjct: 166 LENTGTPLDTCIPYASGGGTFSSGT--------CPTQC--KIASMSMSKYKAKNTVYI-S 214

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
              +I   I   G V+  FTVY D   YKSGVYKH+   V+GGHAV LIG+G  + G +Y
Sbjct: 215 GINNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHLVSTVLGGHAVALIGFGV-EGGSNY 273

Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           W+ AN W  +WG  GYFKI +G  E GIE  V AG P
Sbjct: 274 WLAANSWGPNWGMSGYFKIAQG--EGGIENQVYAGEP 308


>gi|312383398|gb|EFR28501.1| hypothetical protein AND_03481 [Anopheles darlingi]
          Length = 573

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 81/242 (33%), Positives = 113/242 (46%), Gaps = 17/242 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGS WA       SDRF I       + L+   LLAC        C GG+  +AW+Y
Sbjct: 316 QGWCGSSWALSTTTMASDRFAILSKGREQVQLAPQQLLACVRR--QQACSGGHLDTAWQY 373

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
               GVV +EC PY  +       C+           C     + R + +    AY +N+
Sbjct: 374 LRRVGVVNDECYPYIAAKN----QCKINDGDTLVSANCELPANVNRTAMYRMGPAYSLNN 429

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG-----DVMGGHAVKLIGWGTSD 187
           +  DIM EI + G V+    VY DF  Y++G+Y+H        +    H+V+LIGWG   
Sbjct: 430 E-TDIMTEIKERGTVQAILRVYRDFFSYQNGIYRHSAAATPAEERSAYHSVRLIGWGEER 488

Query: 188 DGED---YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMF 244
            G D   YWI  N W   WG +G F+I RG+NEC IE  V+A  P     V+ + +    
Sbjct: 489 VGYDMVKYWIAVNSWGTWWGENGRFRILRGTNECEIESYVLASNPYVHQHVQTVRNVGDL 548

Query: 245 ED 246
           ++
Sbjct: 549 QE 550


>gi|332210919|ref|XP_003254561.1| PREDICTED: LOW QUALITY PROTEIN: dipeptidyl peptidase 1 [Nomascus
           leucogenys]
          Length = 463

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 79/230 (34%), Positives = 119/230 (51%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F +V  L  R  I    + +  LS  ++++C  +    GC+GG+P ++A +
Sbjct: 252 QASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLTAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HY+ G+Y H           +  HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYEKGIYHHTGLRDPFNPFELTNHAVLLVGY 412

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT S  G DYWI+ N W   WG DGYF+I+RG++EC IE   VA  P  K
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|308494436|ref|XP_003109407.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
 gi|308246820|gb|EFO90772.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
          Length = 470

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 84/235 (35%), Positives = 115/235 (48%), Gaps = 22/235 (9%)

Query: 9   VEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYP 66
           +  +  QG CGS WA       SDR  I     +N SLS   LL+C       GC+GGY 
Sbjct: 216 IHPVADQGDCGSSWAVSTTGISSDRLSIISEGRINASLSSQQLLSCNQHR-QKGCEGGYL 274

Query: 67  ISAWRYFVHHGVVTEECDPYFDSTGCSHPGC---EPAYPTPKCVRKCVKKNQLWRNSKHY 123
             AW Y    GVV + C PY          C   +  Y   + +R C   +Q   +S  +
Sbjct: 275 DRAWWYIRKLGVVGDHCYPYVSGQSREPGHCLIPKRDYTNRQGLR-CPSGDQ---DSTAF 330

Query: 124 SISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH--------ITGDVMG 174
            ++  Y+++S  EDI  E+  NGPV+ +F V+EDF  Y  GVY+H         +    G
Sbjct: 331 KMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEG 390

Query: 175 GHAVKLIGWG---TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
            H+V+++GWG   ++     YW+ AN W   WG DGYFKI RG N C IE  V+ 
Sbjct: 391 YHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDGYFKILRGENHCEIESFVIG 445


>gi|345488309|ref|XP_001605531.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
           [Nasonia vitripennis]
          Length = 481

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 79/223 (35%), Positives = 111/223 (49%), Gaps = 16/223 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CG+ WA   V+  SDRF I       + LS   L++C       GC GGY   AW +
Sbjct: 254 QGWCGASWAISTVDVASDRFAIMSKGIEKVQLSGQHLISC-NNRGQRGCKGGYLDRAWLF 312

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSI-SAYRIN 131
               GVV E+C P+          C            C ++N     ++ Y +  AYR+ 
Sbjct: 313 MRKFGVVDEDCYPWLSG---RSDKCRIPRRGKLSDAGCQRRNSYNLRNEMYKVGPAYRLG 369

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH---ITGDVMGGHAVKLIGWGTSDD 188
           ++  DIM EI  +GPV+ +  V+ DF HY+SG+Y H         G H+V+++GWG    
Sbjct: 370 NE-TDIMQEILTSGPVQATMRVHRDFFHYESGIYVHSRPFDTRQSGYHSVRIVGWGEEPS 428

Query: 189 GED-----YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
             +     +W +AN W R WG DGYF+I RG+NEC IE  V+ 
Sbjct: 429 PYNGKPIKFWRVANSWGRDWGEDGYFRIVRGNNECEIESFVLG 471


>gi|283468816|emb|CAO98753.1| putative cathepsin B [Fasciola hepatica]
          Length = 112

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 55/104 (52%), Positives = 76/104 (73%), Gaps = 1/104 (0%)

Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
           S+Y +     DIM EI KNGPV+  F ++EDF  YKSG+Y + TG ++GGHA+++IGWG 
Sbjct: 10  SSYNVGEQETDIMMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGV 69

Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
            ++G  YW++AN WN  WG  GYF+++RG+NECGIE  + AGLP
Sbjct: 70  -ENGVKYWLIANSWNEGWGEKGYFRMRRGNNECGIEARINAGLP 112


>gi|294955270|ref|XP_002788457.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
 gi|239903926|gb|EER20253.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
          Length = 392

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 73/191 (38%), Positives = 103/191 (53%), Gaps = 24/191 (12%)

Query: 54  GFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH--------PGCE 98
           G +C DGC  G P +AW +   +G+ TE        C PY +   C H        P  E
Sbjct: 152 GHVCCDGCTKGRPDAAWSFLNVYGIATEGSMSAADGCWPY-NFPKCGHHQQDSKYQPCPE 210

Query: 99  PAYPTPKCVRKCVKKN--QLWRNSKHYS--ISAYRINSDPEDIMAEIYKNGPVEVSFTVY 154
             Y TP C+ +C  KN        +H++   S Y++    ++I  EI  NGP   +F++Y
Sbjct: 211 KNYDTPPCLDRCPNKNYGTPLDKDRHFTAHFSPYQLKGT-DNIKKEIMTNGPTSAAFSMY 269

Query: 155 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 214
           +DF  Y+SGVYKH +G +MG H V++IGWGT   G DYW++ N WN  WG  G FKI +G
Sbjct: 270 DDFLSYESGVYKHTSGTLMGEHGVEIIGWGTK-QGVDYWLVMNSWNEGWGVHGTFKIAQG 328

Query: 215 SNECGIEEDVV 225
             +CGI +  +
Sbjct: 329 --DCGINDMAI 337


>gi|355566931|gb|EHH23310.1| hypothetical protein EGK_06753 [Macaca mulatta]
          Length = 463

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 80/230 (34%), Positives = 120/230 (52%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F +V  L  R  I    + +  LS  ++++C  +    GC+GG+P ++A +
Sbjct: 252 QASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLTAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 310 YAQDFGLVEEACFPY---TGNDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HY++G+Y H           +  HAV L+G+
Sbjct: 353 GGCNEALMKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGY 412

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT S  G DYWI+ N W  SWG DGYF+I RG++EC IE   VA  P  K
Sbjct: 413 GTDSASGMDYWIVKNSWGTSWGEDGYFRIHRGTDECAIESIAVAATPIPK 462


>gi|358341865|dbj|GAA49436.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 515

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 74/167 (44%), Positives = 89/167 (53%), Gaps = 18/167 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA+SDR CIH G      LS  DLL+CC + CG GCDGG+P  AW Y
Sbjct: 103 QSSCGSCWAFGAVEAMSDRLCIHSGAKYQKGLSAVDLLSCC-WKCGYGCDGGFPAQAWNY 161

Query: 73  FVHHGVVT-------EECDPY------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQLWR 118
           +   G+VT         C  Y       D  G  HP C    Y TP+C +KC      + 
Sbjct: 162 WSTDGIVTGGSKENPSGCRSYPFPSCSHDERG-RHPLCPSEIYHTPRCTKKCDTDKLHYS 220

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
                + S+Y +     +IM EI  NGPVE  F VYEDF  Y+ G+Y
Sbjct: 221 AELTKANSSYNVLDSDREIMMEIMNNGPVEAVFDVYEDFLQYEKGIY 267


>gi|56758130|gb|AAW27205.1| unknown [Schistosoma japonicum]
          Length = 279

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 74/166 (44%), Positives = 93/166 (56%), Gaps = 17/166 (10%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSCWAFGAVEA++DR CI  G   S  LS  DL++CC   CGDGC GG+P  AW Y
Sbjct: 112 QSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCED-CGDGCQGGFPGVAWDY 170

Query: 73  FVHHGVVT-------EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
           +V  G+VT         C PY        T   +P C    Y TP+C + C K  +  + 
Sbjct: 171 WVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYE 230

Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 164
             KHY   +Y + S+ + I  EI   GPVE +F VYEDF +YKSG+
Sbjct: 231 QDKHYGDESYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGI 276


>gi|182509202|ref|NP_001116812.1| tubulointerstitial nephritis antigen precursor [Bombyx mori]
 gi|81303350|gb|ABB71105.1| TIN-ag-RP [Bombyx mori]
          Length = 404

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 83/229 (36%), Positives = 118/229 (51%), Gaps = 32/229 (13%)

Query: 8   HVEILVIQGHCGSCWAFGAVEALSDRFCIH-FGM-NLSLSVNDLLACCGFLCGD-GCDGG 64
           ++  +  Q  CGS WA      + DRF I  FG  N+ +S   LL+C   L G  GC+GG
Sbjct: 198 YISPIADQDWCGSDWAVSIASIVGDRFSIQSFGTENVRMSSQTLLSC--HLKGQRGCNGG 255

Query: 65  YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYS 124
               A+ +   HG+V+E+C PY                        V + ++  + + Y 
Sbjct: 256 NLDIAFDFVKTHGLVSEQCFPY---------------------EGAVTQCRIGNDCRRYR 294

Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT-GDVM--GGHAVKLI 181
           +      S  EDIM +I  +GP     TVY+DF HY+ G+Y+H   GD +  G H+V+++
Sbjct: 295 VGVPFSISKEEDIMYDIMTSGPALGIMTVYQDFFHYREGIYRHTRHGDQLMRGLHSVRIV 354

Query: 182 GWGTSDDGED-YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           GWG  +D ED YWI+AN W  SWG  GYF+I RG +  GIE  V+  LP
Sbjct: 355 GWG--EDAEDKYWIVANSWGTSWGEKGYFRIARGHSGTGIESSVLTVLP 401


>gi|290990726|ref|XP_002677987.1| predicted protein [Naegleria gruberi]
 gi|284091597|gb|EFC45243.1| predicted protein [Naegleria gruberi]
          Length = 225

 Score =  124 bits (312), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 78/215 (36%), Positives = 102/215 (47%), Gaps = 18/215 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CG+CWAF A   L+ R CI      N+ LS    + C        C GGY   +W +
Sbjct: 23  QQTCGACWAFSANYVLAHRLCIATNGQTNVVLSPEYQVQCDTM--NKACQGGYLKYSWTF 80

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
             + G   + C PY    G         + +  C  +C   +    +   Y     R  +
Sbjct: 81  LENTGTPLDTCIPYASGRG--------TFSSGTCPTQCKIASM---SMSKYKAKNTRYIT 129

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
              +I   I   G V+  FTVY D   YKSGVYKH+   V+GGHAV LIG+G  + G +Y
Sbjct: 130 GINNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHVVSTVLGGHAVALIGFGV-EGGSNY 188

Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           W+ AN W  +WG  GYFKI +G  E GIE  V AG
Sbjct: 189 WLAANSWGPNWGMSGYFKIAQG--EGGIENQVYAG 221


>gi|291236586|ref|XP_002738220.1| PREDICTED: cathepsin B preproprotein-like [Saccoglossus
           kowalevskii]
          Length = 93

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 57/93 (61%), Positives = 71/93 (76%), Gaps = 1/93 (1%)

Query: 138 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 197
           MAEI K GPVE +FTVY DF  YKSGVY+H TG+ +GGHA+K++GWG ++DG DYW++AN
Sbjct: 1   MAEIQKYGPVEGAFTVYADFPSYKSGVYQHETGEALGGHAIKILGWG-NEDGHDYWLVAN 59

Query: 198 QWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
            WN  WG  G+FKI RG +ECGIE  + AG P 
Sbjct: 60  SWNEDWGDQGFFKILRGVDECGIESQITAGSPK 92


>gi|290975817|ref|XP_002670638.1| predicted protein [Naegleria gruberi]
 gi|284084199|gb|EFC37894.1| predicted protein [Naegleria gruberi]
          Length = 528

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 72/231 (31%), Positives = 113/231 (48%), Gaps = 32/231 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSV---NDLLACCGFLCGDGCDGGYPISAWR 71
           QG CGSC++F     +  R  + F  N    V    ++++C  +    GCDGG+     +
Sbjct: 315 QGQCGSCYSFSTTAMMEARKRV-FTQNKEQPVYSPENIISCSFY--SQGCDGGFAYLISK 371

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC-VRKCVKKNQLWRNSKHYSISAYRI 130
           +    G++ E+CDPY   TG  H          KC + +     Q W N ++     Y  
Sbjct: 372 WGEDFGIIAEQCDPY---TGTPH----------KCNLNQACSTRQYWTNYRY--TGGYYG 416

Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG----------GHAVKL 180
               E++  ++ K GP+ VS  VY D  +Y SG+Y+H++   +            H V +
Sbjct: 417 AVTVENMQLDVLKYGPLSVSMEVYNDLFNYHSGIYRHVSSSKLTSPVPNPFELTNHVVLI 476

Query: 181 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
           +GWG ++ GE YWI+ N W  S+G DGYF I RG +EC IE +  + +P+ 
Sbjct: 477 VGWGENEKGEKYWIVKNSWGTSFGMDGYFLIARGVDECAIESENASAIPTQ 527


>gi|443686962|gb|ELT90079.1| hypothetical protein CAPTEDRAFT_166233 [Capitella teleta]
          Length = 495

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 86/229 (37%), Positives = 113/229 (49%), Gaps = 20/229 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+CG+ +AF      +DR  IH G  L   LS   L++C       GC+GG+   AW  
Sbjct: 233 QGNCGASYAFSTSTVAADRLSIHSGGELKDMLSAQYLISCTTDHHQKGCEGGHVDRAWWQ 292

Query: 73  FVHHGVVTEECDPYFDSTGCSHPG--CEPAYPTPKCVRKCVKKNQLWRNSKHYSISA-YR 129
               G V+++C PY  S   + PG      Y  PK   +C     +   SK Y  S  YR
Sbjct: 293 LRRVGTVSKDCYPY-TSGDTNDPGKCLMSKYKLPKKNIECPVGQGI--TSKLYQASPPYR 349

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG---------HAVKL 180
           I +   +IM EI  NGPV+    V +DF  Y+ GVYKH                 H+V++
Sbjct: 350 IAAKEREIMNEIILNGPVQAVMHVKDDFYTYERGVYKHSHAPKPANYPHLGKEAYHSVRI 409

Query: 181 IGWGTSDDGED---YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           IGWGT   G+D   YW+ AN W R WG  G+F+I RGS+E  IE  VV 
Sbjct: 410 IGWGTDYTGDDPIKYWLAANTWGRHWGEGGFFRIARGSDESHIESFVVG 458


>gi|297282815|ref|XP_002802331.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
           mulatta]
          Length = 322

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 84/233 (36%), Positives = 114/233 (48%), Gaps = 22/233 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIH-FG-MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH  G M   LS  +LLAC       GC GG    AW +
Sbjct: 78  QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLACDTH-HQQGCRGGRLDGAWWF 136

Query: 73  FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSIS 126
               GVV++ C P+     D  G + P    +    +  R+   +  N    N+  Y ++
Sbjct: 137 LRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATARCPNSHVNNNDIYQVT 196

Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
             YR+ S+ ++IM E+ +NGPV+    V+EDF  YK G+Y H    +         G H+
Sbjct: 197 PVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHS 256

Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 257 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 309


>gi|350408961|ref|XP_003488566.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
           impatiens]
          Length = 445

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 78/224 (34%), Positives = 108/224 (48%), Gaps = 18/224 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CG+ WA       SDRF +      ++ LS   LL+C        C GGY   AW Y
Sbjct: 222 QGWCGASWAISTTRVASDRFALMSKGADSVLLSAQHLLSC-NNRGQQACSGGYLDRAWLY 280

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
               G+V E+C P+  +    +  C+    T      C       R   +    AYR+ +
Sbjct: 281 MRKFGLVDEDCYPWEGT----NVQCKLRKRTDLKTAGCRPPVNPLRTELYKVGPAYRLGN 336

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD---VMGGHAVKLIGWGTSDDG 189
           +  DIM EI  +GPV+ +  VY+DF  Y+SG+YKH         G H+V++IGWG     
Sbjct: 337 E-TDIMYEILTSGPVQATMKVYQDFFSYESGIYKHTATTEHYAFGYHSVRIIGWGEDTSA 395

Query: 190 E-------DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
                    YW++ N W + WG  G F+I+RG+NEC IE  VVA
Sbjct: 396 HRYRNLPIKYWLVVNSWGQQWGESGLFRIQRGTNECDIESFVVA 439


>gi|290998826|ref|XP_002681981.1| predicted protein [Naegleria gruberi]
 gi|284095607|gb|EFC49237.1| predicted protein [Naegleria gruberi]
          Length = 310

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 81/215 (37%), Positives = 105/215 (48%), Gaps = 18/215 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CG+CWAF A   L+ R CI      N+ LS    + C        C GGY   +W +
Sbjct: 108 QQTCGACWAFSANYVLAHRLCIATNGKTNVVLSPEYQVQCDTM--NKACQGGYLKYSWTF 165

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
             + G   + C PY    G         + +  C  +C  K      SK+ + +   I S
Sbjct: 166 LENTGTPLDTCIPYASGRG--------TFSSGTCPTQC--KIASMSMSKYKAKNTVYI-S 214

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
              +I   I   G V+  FTVY D   YKSGVYKH+   V+GGHAV LIG+G  + G +Y
Sbjct: 215 GINNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHVVSTVLGGHAVALIGFGV-EGGSNY 273

Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
           W+ AN W  +WG  GYFKI +G  E GIE  V AG
Sbjct: 274 WLAANSWGPNWGMSGYFKIAQG--EGGIENQVYAG 306


>gi|56755425|gb|AAW25892.1| unknown [Schistosoma japonicum]
          Length = 226

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 74/183 (40%), Positives = 100/183 (54%), Gaps = 19/183 (10%)

Query: 23  AFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 80
           A  AV A+SDR CI  G   ++ LS  DL++CC   CG GCDGG+P  AW Y+V HG+VT
Sbjct: 42  AVSAVGAMSDRICIQSGGKQSVELSAIDLISCCEN-CGSGCDGGFPGPAWDYWVSHGIVT 100

Query: 81  -------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLWRNSKHYSI 125
                    C PY     C H      P C +  Y TP+C RKC K     + + KHY  
Sbjct: 101 GGSKENHTGCQPY-PFPKCEHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPYEHDKHYGG 159

Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
            +  +  +   I  EI   GPVE    ++EDF +YKSG+Y++ TG  +G H V++IGWG 
Sbjct: 160 ISINVIKNESAIQKEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGI 219

Query: 186 SDD 188
            ++
Sbjct: 220 ENE 222


>gi|294914336|ref|XP_002778250.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239886453|gb|EER10045.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 388

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 67/152 (44%), Positives = 91/152 (59%), Gaps = 11/152 (7%)

Query: 82  ECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS----KHYSISAYRINSDPEDI 137
           EC  + D+ G     C+   P+P C   C  +N  ++ S    +H++        + ++I
Sbjct: 229 ECSHHVDTKGME--PCKGNSPSPVCSTTC--RNHHFKPSFESDRHFTEDEGYSLDEVDEI 284

Query: 138 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 197
             EI  NGPV  +FTVYEDF +YKSGVYKH+ G  +GGHAVK+IGWG  D  E YW++ N
Sbjct: 285 KREIIDNGPVAAAFTVYEDFPYYKSGVYKHVNGSELGGHAVKIIGWGI-DQNEQYWLVMN 343

Query: 198 QWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
            WN +WG  G FKI  G  ECGI+ +V AG+P
Sbjct: 344 SWNVNWGDQGIFKIAIG--ECGIDSEVTAGIP 373


>gi|149635146|ref|XP_001512140.1| PREDICTED: dipeptidyl peptidase 1-like [Ornithorhynchus anatinus]
          Length = 469

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 80/236 (33%), Positives = 118/236 (50%), Gaps = 29/236 (12%)

Query: 8   HVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGY 65
           +V  +  Q  CGSC++F ++  L  R  I    + +  LS   +++C  +    GCDGG+
Sbjct: 251 YVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSTQQIVSCSEY--SQGCDGGF 308

Query: 66  P-ISAWRYFVHHGVVTEECDPYF-DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHY 123
           P + A +Y    GVV E+C PY    T C         P  +C R        +  S + 
Sbjct: 309 PYLIAGKYTQDFGVVEEDCFPYTARDTQC--------VPKKECPR--------YYASDYQ 352

Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHA 177
            +  +    +   +  E+ ++GP+ V+F VY DF HY+ GVY H           +  HA
Sbjct: 353 YVGGFYGGCNEALMKLELVRHGPMAVAFEVYNDFLHYREGVYHHTGLRDPFNPFELTNHA 412

Query: 178 VKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           V L+G+GT    G DYWI+ N W  +WG DGYF+I+RGS+EC IE   VA  P  +
Sbjct: 413 VLLVGYGTDPATGLDYWIVKNSWGTAWGEDGYFRIRRGSDECAIESIAVAATPIPR 468


>gi|426370061|ref|XP_004051995.1| PREDICTED: dipeptidyl peptidase 1 [Gorilla gorilla gorilla]
          Length = 463

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 79/230 (34%), Positives = 118/230 (51%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I    + +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HYK G+Y H           +  HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 412

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT S  G DYWI+ N W   WG DGYF+I+RG++EC IE   VA  P  K
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|410909768|ref|XP_003968362.1| PREDICTED: dipeptidyl peptidase 1-like [Takifugu rubripes]
          Length = 455

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 80/229 (34%), Positives = 109/229 (47%), Gaps = 30/229 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGSC+ F  +  L  R  I    + S  LS   +++C  +    GCDGG+P    +Y
Sbjct: 245 QGSCGSCYCFATMGMLEARLRILTNNSQSPVLSPQQVVSCSEY--SQGCDGGFPYLTGKY 302

Query: 73  FVHHGVVTEECDPYF--DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI 130
               G+V E C PY   DS       C   Y                  +++  +  +  
Sbjct: 303 VQDFGIVDESCFPYMGKDSPCGISQSCRRGYA-----------------AEYKYVGGFYG 345

Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH--ITGDV----MGGHAVKLIGWG 184
                 +M E+ KNGP+ V+  VY DF  YK G+Y H  +T  V    +  HAV L+G+G
Sbjct: 346 GCSEAAMMVELVKNGPMAVALEVYSDFMSYKGGIYHHTGLTDHVNPFELTNHAVLLVGYG 405

Query: 185 TSD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
                G+ YWI+ N W  SWG DGYF+I+RGS+EC IE   VA  P  K
Sbjct: 406 RCHMTGQKYWIVKNSWGSSWGEDGYFRIRRGSDECAIESIAVAASPIPK 454


>gi|322788703|gb|EFZ14296.1| hypothetical protein SINV_07506 [Solenopsis invicta]
          Length = 443

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 78/221 (35%), Positives = 111/221 (50%), Gaps = 15/221 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CG+ WA    +  SDR+ I         LS   LL+C       GC GGY   AW +
Sbjct: 223 QGWCGASWAVSTADVASDRYSIMSKGAEAPELSAQQLLSC-NNRGQQGCRGGYLDRAWLF 281

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
               G+V +EC P+          C+    +      C K +   R   +    AYR+ +
Sbjct: 282 MRKFGLVDKECYPWSGKND----QCKLRKRSTLKAAGCRKPSHPLRTELYKVGPAYRLGN 337

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM---GGHAVKLIGWGTSDDG 189
           +  DIM EI  +GPV+ +  VY+DF  YKSG+Y+H     +   G H+V++IGWG     
Sbjct: 338 E-TDIMQEILTSGPVQATMRVYQDFFIYKSGIYRHSRSAELHDSGYHSVRIIGWGEERSY 396

Query: 190 E----DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
                 YW++AN W  +WG +G FKI++G+NEC IE  V+A
Sbjct: 397 RGPPLKYWLVANSWGYNWGDNGLFKIQKGTNECEIESYVLA 437


>gi|197101281|ref|NP_001125612.1| dipeptidyl peptidase 1 precursor [Pongo abelii]
 gi|75061881|sp|Q5RB02.1|CATC_PONAB RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|55728636|emb|CAH91058.1| hypothetical protein [Pongo abelii]
          Length = 463

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 79/230 (34%), Positives = 118/230 (51%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I    + +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTSNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HYK G+Y H           +  HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 412

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT S  G DYWI+ N W   WG DGYF+I+RG++EC IE   VA  P  K
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|114639716|ref|XP_508684.2| PREDICTED: dipeptidyl peptidase 1 isoform 2 [Pan troglodytes]
 gi|397526223|ref|XP_003833035.1| PREDICTED: dipeptidyl peptidase 1 [Pan paniscus]
 gi|410219182|gb|JAA06810.1| cathepsin C [Pan troglodytes]
 gi|410260226|gb|JAA18079.1| cathepsin C [Pan troglodytes]
 gi|410304128|gb|JAA30664.1| cathepsin C [Pan troglodytes]
 gi|410353831|gb|JAA43519.1| cathepsin C [Pan troglodytes]
          Length = 463

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 79/230 (34%), Positives = 118/230 (51%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I    + +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HYK G+Y H           +  HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 412

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT S  G DYWI+ N W   WG DGYF+I+RG++EC IE   VA  P  K
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|328872536|gb|EGG20903.1| hypothetical protein DFA_00770 [Dictyostelium fasciculatum]
          Length = 313

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 81/215 (37%), Positives = 107/215 (49%), Gaps = 23/215 (10%)

Query: 15  QGH-CGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLAC--CGFLCGDGCDGGYPISA 69
           QG  C SCWA  A   L+DR C+  G  +   LS  +L+ C   G L   GC GG   + 
Sbjct: 53  QGQKCSSCWAMTATGVLADRLCVASGGKVKKVLSPQELIDCDRNGNL---GCGGGRLDTP 109

Query: 70  WRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYR 129
             YF  +GVVTE+C+ Y             A     C   C         +K++S   YR
Sbjct: 110 LAYFRDNGVVTEKCESY------------KATQASSCSNTCDDGTSFSNTTKYHSKDCYR 157

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDD 188
           ++S  E   A+IY NGP+   F +Y D  +YKSGVY K  +      HA ++IGWG  +D
Sbjct: 158 LSS-IEQAKADIYLNGPIIAVFDLYTDIYNYKSGVYIKSDSATYKETHAGRVIGWGV-ED 215

Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
           G  YW+ AN W   WG  G FKI+ G+NE G E +
Sbjct: 216 GVQYWLAANSWGTGWGQQGLFKIRSGTNEVGFEAN 250


>gi|328712825|ref|XP_001945477.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Acyrthosiphon pisum]
          Length = 487

 Score =  124 bits (311), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 85/243 (34%), Positives = 118/243 (48%), Gaps = 17/243 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CG+ WA    +  +DRF I     M  +LS   LL+C   L   GC GG+  SAW +
Sbjct: 242 QGWCGASWAISTAQVTTDRFVIMTKGLMRDALSPKHLLSCNNDL-QRGCQGGHLTSAWNW 300

Query: 73  FVHHGVVTEECDPY-FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
            +  G+VTEEC P+   +T C+               +  K + L R    Y ++     
Sbjct: 301 VMTFGLVTEECYPWDGRATDCAVSNQRSNNNLIVTCPRSAKTSPLRRVGLMYRVAT---- 356

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV---MGGHAVKLIGWGTSDD 188
              E IM EI   G V+    V ++F  Y+SGVYK    D+    G H V+++GWG    
Sbjct: 357 --EEGIMYEIMNWGSVQAMMKVSKEFFMYESGVYKCSKLDLGSKTGYHTVRIVGWGEEQQ 414

Query: 189 G---EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFE 245
                 YWI++N W   WG  GYF+I +G+NEC IE+ VVA +P   N    I+     E
Sbjct: 415 NGRTVKYWIVSNSWGLWWGESGYFRILKGTNECQIEDFVVAAMPDIDNFCN-ISDQSFRE 473

Query: 246 DAS 248
           +AS
Sbjct: 474 NAS 476


>gi|67867504|gb|AAH98085.1| Unknown (protein for MGC:107782) [Xenopus (Silurana) tropicalis]
          Length = 458

 Score =  124 bits (310), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 77/228 (33%), Positives = 115/228 (50%), Gaps = 27/228 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC+AF ++  L  R  I   ++    LS   +++C  +    GC+GG+P + A +
Sbjct: 247 QASCGSCYAFSSMGMLESRIQIRSQLSQKPILSPQQVVSCSNY--SQGCEGGFPYLIAGK 304

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y   +G+V E   PY   TG   P          C  K     Q +  ++++ +  +   
Sbjct: 305 YVSDYGIVEESDLPY---TGSDSP----------CTLK--DSQQKYYTAEYHYVGGFYGG 349

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
            +   +  E+   GP+ V+F VY+DF HY+SGVY H           +  HAV L+G+GT
Sbjct: 350 CNEAYMKLELVLGGPLSVAFEVYDDFMHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGT 409

Query: 186 SDD-GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
               GE YWI+ N W  SWG  GYF+I+RG++EC IE   V+  P  K
Sbjct: 410 DQQTGEKYWIVKNSWGESWGEKGYFRIRRGTDECAIESIAVSAEPIIK 457


>gi|426252217|ref|XP_004019812.1| PREDICTED: dipeptidyl peptidase 1, partial [Ovis aries]
          Length = 455

 Score =  124 bits (310), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 78/228 (34%), Positives = 119/228 (52%), Gaps = 27/228 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           QG CGSC++F ++  +  R  I      +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 244 QGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 301

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y    G+V E+C PY   TG   P C       K    C +    + +S+++ +  +   
Sbjct: 302 YAQDFGLVEEDCFPY---TGTDSP-C-------KLKEGCFR----YYSSEYHYVGGFYGG 346

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
            +   +  E+   GP+ V+F VY DF HY+ GVY H           +  HAV L+G+GT
Sbjct: 347 CNEALMKLELVHRGPMAVAFEVYNDFLHYRQGVYHHTGLRDPFNPFELTNHAVLLVGYGT 406

Query: 186 -SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
            +  G DYWI+ N W  SWG DGYF+I+RG++EC IE   +A  P  K
Sbjct: 407 DAASGLDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIALAATPIPK 454


>gi|268572255|ref|XP_002648916.1| Hypothetical protein CBG17829 [Caenorhabditis briggsae]
          Length = 220

 Score =  124 bits (310), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 57/102 (55%), Positives = 71/102 (69%), Gaps = 1/102 (0%)

Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
           SAY +      I  EI  NGPV   FT+YED   YKSGVY+H  G ++GGHA+K+IGWGT
Sbjct: 113 SAYYVGMTVSAIQTEIMTNGPVVGVFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGT 172

Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
             +G  YW++AN W   WG +G+FKI+RG NECGIE +VVAG
Sbjct: 173 -QNGIPYWLIANSWGTKWGENGFFKIRRGVNECGIENNVVAG 213



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/46 (52%), Positives = 28/46 (60%), Gaps = 2/46 (4%)

Query: 20  SCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDG 63
           SCWAFGA E +SDR CI         +S  D++ CCG  CG GCDG
Sbjct: 66  SCWAFGAAEVISDRICIATKGARQPIISPMDMVDCCGKYCGYGCDG 111


>gi|335290878|ref|XP_003127800.2| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Sus scrofa]
          Length = 362

 Score =  123 bits (309), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 85/239 (35%), Positives = 111/239 (46%), Gaps = 34/239 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 118 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSC-DTHNQQGCQGGRLDGAWWF 176

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR-------------N 119
               GVV++ C P+       H   E A P P+C+       +  R             N
Sbjct: 177 LRRRGVVSDHCYPF-----SGHERNE-AGPAPRCMMHSRAMGRGKRQATARCPNSYVHAN 230

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD-------- 171
             +    AYR+ S+ +DIM E+ +NGPV+    V+EDF  Y+SG+Y H            
Sbjct: 231 DIYQVTPAYRLGSNEKDIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSHGRPERYR 290

Query: 172 VMGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
             G H+VK+ GWG  T  DG    YW  AN W   WG  G+F+I RG+NEC IE  V+ 
Sbjct: 291 RHGTHSVKITGWGEETLPDGRMLKYWTAANSWGPGWGERGHFRIVRGANECDIESFVLG 349


>gi|147902366|ref|NP_001080511.1| cathepsin C precursor [Xenopus laevis]
 gi|33417162|gb|AAH56109.1| Ctsc protein [Xenopus laevis]
          Length = 458

 Score =  123 bits (309), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 78/230 (33%), Positives = 115/230 (50%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           QG CGSC+AF ++  L  R  I   ++    LS   +++C  +    GCDGG+P + A +
Sbjct: 247 QGSCGSCYAFASMGMLESRIQIQSQLSQKPILSPQQVVSCSNY--SQGCDGGFPYLIAGK 304

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKN--QLWRNSKHYSISAYR 129
           Y    G+V E   PY    G   P              C  K+  Q +  ++++ +  + 
Sbjct: 305 YLNDFGIVEESDFPYI---GSDSP--------------CTLKDSYQRYYTAEYHYVGGFY 347

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+   GP+ V+F VY+DF HY+SGVY H           +  HAV L+G+
Sbjct: 348 GGCNEAYMKLELVLGGPLSVAFEVYDDFIHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGY 407

Query: 184 GTSDD-GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT    GE YWI+ N W  SWG  G+F+I+RGS+EC IE   V+  P  K
Sbjct: 408 GTDQQTGEKYWIVKNSWGESWGEKGFFRIRRGSDECAIESIAVSANPIIK 457


>gi|344293788|ref|XP_003418602.1| PREDICTED: dipeptidyl peptidase 1 [Loxodonta africana]
          Length = 463

 Score =  123 bits (309), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 78/228 (34%), Positives = 120/228 (52%), Gaps = 27/228 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I    + +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARLRILTNNSQTPVLSPQEVVSCSQY--AQGCEGGFPYLIAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y    G+V E C PY   T    P C       K  + C +    + +S+++ +  +   
Sbjct: 310 YAQDFGLVEEACFPY---TATDSP-C-------KVKKDCFR----YYSSEYHYVGGFYGG 354

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
            +   +  E+  +GPV VSF VY+DF HY  G+Y H           +  HAV L+G+GT
Sbjct: 355 CNEALMKLELVNHGPVVVSFEVYDDFIHYHKGIYHHTGLRDPFNPFELTNHAVLLVGYGT 414

Query: 186 -SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
            S  G DYWI+ N W+ +WG DGYF+I+RG++ECGIE   +   P  K
Sbjct: 415 DSASGLDYWIVKNSWSATWGEDGYFRIRRGTDECGIESIALTATPIPK 462


>gi|268619140|gb|ACZ13346.1| cathepsin B-like cysteine proteinase [Bursaphelenchus xylophilus]
          Length = 405

 Score =  123 bits (309), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 80/231 (34%), Positives = 116/231 (50%), Gaps = 23/231 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           Q +CGSCWA  +   +SDR C+     + +S++ + A    + GDGC+GG    A+  F+
Sbjct: 95  QSNCGSCWAVSSAGVMSDRICVATNGKVKVSISGI-ATASCVGGDGCNGGLEEVAFEKFI 153

Query: 75  HHGVVT-------EECDPYFDSTGCSH-------PGCE--PAYPTPKCVRKCVKK-NQLW 117
            +G  T       + C PY     C+H       P C+  P Y    C  +C K  ++ +
Sbjct: 154 ENGFPTGSEVDKHQGCQPY-PFKHCAHHVNSTEYPPCDSVPEYKADTCSHECQKDYDRKY 212

Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD-VMGGH 176
               +Y    Y   SD   I  EI  NGPV VSFTVYE F +Y  G+Y+   G+ + G H
Sbjct: 213 EEDLYYGKEQYGF-SDEAPIQREIMTNGPVAVSFTVYESFLYYSGGIYRSTPGERIKGYH 271

Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK-IKRGSNECGIEEDVVA 226
           AV+++GWG  ++G  YW +AN WN  WG +        G +E  IE+  VA
Sbjct: 272 AVRVVGWGV-ENGTKYWKIANSWNEQWGRERLLPHTPAGVDESDIEDGGVA 321


>gi|193202653|ref|NP_492593.2| Protein F26E4.3 [Caenorhabditis elegans]
 gi|205371857|sp|P90850.3|YCF2E_CAEEL RecName: Full=Uncharacterized peptidase C1-like protein F26E4.3;
           Flags: Precursor
 gi|166157004|emb|CAB03007.2| Protein F26E4.3 [Caenorhabditis elegans]
          Length = 452

 Score =  123 bits (309), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 80/226 (35%), Positives = 112/226 (49%), Gaps = 16/226 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CGS W+       SDR  I     +N +LS   LL+C       GC+GGY   AW Y
Sbjct: 204 QGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQLLSCNQHR-QKGCEGGYLDRAWWY 262

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA-YRIN 131
               GVV + C PY  S     PG           R+ ++     ++S  + ++  Y+++
Sbjct: 263 IRKLGVVGDHCYPYV-SGQSREPGHCLIPKRDYTNRQGLRCPSGSQDSTAFKMTPPYKVS 321

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH--------ITGDVMGGHAVKLIGW 183
           S  EDI  E+  NGPV+ +F V+EDF  Y  GVY+H         +    G H+V+++GW
Sbjct: 322 SREEDIQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVRVLGW 381

Query: 184 G---TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           G   ++     YW+ AN W   WG DGYFK+ RG N C IE  V+ 
Sbjct: 382 GVDHSTGKPIKYWLCANSWGTQWGEDGYFKVLRGENHCEIESFVIG 427


>gi|66805843|ref|XP_636643.1| hypothetical protein DDB_G0288563 [Dictyostelium discoideum AX4]
 gi|60465035|gb|EAL63141.1| hypothetical protein DDB_G0288563 [Dictyostelium discoideum AX4]
          Length = 314

 Score =  123 bits (309), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 80/223 (35%), Positives = 106/223 (47%), Gaps = 24/223 (10%)

Query: 12  LVIQGHCGSCWAFGAVEALSDRFCIHFGMNL---SLSVNDLLACCGFLCGDGCDGGYPIS 68
           ++ Q  CGSCWAF + E LSDR CI         +LS   L+AC      DGC GG P  
Sbjct: 105 ILNQEQCGSCWAFSSSEVLSDRLCIASNNKTNPGALSPQTLVAC-DVYGNDGCSGGIPQL 163

Query: 69  AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKN--QLWRNSKHYSIS 126
           AW Y    G+ T+ C PY    G  +           C R C       L+R +K +++ 
Sbjct: 164 AWEYMELKGLPTDSCVPYTAGNGTVY----------SCQRSCSDSEDYSLYR-AKPFTL- 211

Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG-DVMGGHAVKLIGWGT 185
             +  S  + I   I   GP+  +  VYEDF  Y SGVY    G  ++GGHA+K++GWG 
Sbjct: 212 --KTCSSVQCIQENILAYGPIVGTMEVYEDFMSYSSGVYVMTPGSSLLGGHAIKIVGWGF 269

Query: 186 SDDGE-DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
               + +YWI+AN W   WG  G+F I      C I  D  A 
Sbjct: 270 DQTSQLNYWIVANSWGADWGQQGFFFISM--ETCSISSDASAA 310


>gi|60827947|gb|AAX36820.1| cathepsin C [synthetic construct]
 gi|61368416|gb|AAX43175.1| cathepsin C [synthetic construct]
          Length = 464

 Score =  123 bits (309), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 79/232 (34%), Positives = 119/232 (51%), Gaps = 31/232 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I    + +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HYK G+Y H           +  HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 412

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
           GT S  G DYWI+ N W   WG +GYF+I+RG++EC IE   VA  P  K L
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPKLL 464


>gi|54696504|gb|AAV38624.1| cathepsin C [synthetic construct]
 gi|54696506|gb|AAV38625.1| cathepsin C [synthetic construct]
 gi|61368207|gb|AAX43130.1| cathepsin C [synthetic construct]
 gi|61368212|gb|AAX43131.1| cathepsin C [synthetic construct]
          Length = 464

 Score =  123 bits (309), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 79/232 (34%), Positives = 119/232 (51%), Gaps = 31/232 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I    + +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HYK G+Y H           +  HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 412

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
           GT S  G DYWI+ N W   WG +GYF+I+RG++EC IE   VA  P  K L
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPKLL 464


>gi|301779281|ref|XP_002925058.1| PREDICTED: dipeptidyl peptidase 1-like [Ailuropoda melanoleuca]
 gi|281337582|gb|EFB13166.1| hypothetical protein PANDA_014484 [Ailuropoda melanoleuca]
          Length = 461

 Score =  123 bits (309), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 75/228 (32%), Positives = 116/228 (50%), Gaps = 27/228 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC+AF ++  L  R  I      +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 250 QASCGSCYAFASMGMLEARIRILTNNTQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 307

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y    G+V E C PY  +         P  P   C R        + +S ++ +  +   
Sbjct: 308 YAQDFGLVEEACFPYMGAD-------FPCKPKKDCFR--------YYSSDYHYVGGFYGG 352

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
            +   +  E+  +GP+ V+F VY+DF HY++G+Y H           +  HAV L+G+GT
Sbjct: 353 CNEALMKLELVHHGPIAVAFQVYDDFFHYRTGIYYHTGLRDPFNPFELTNHAVLLVGYGT 412

Query: 186 -SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
            +  G DYWI+ N W   WG +GYF+I+RG++EC IE   VA  P  K
Sbjct: 413 DTASGMDYWIVKNSWGAGWGENGYFRIRRGTDECAIESIAVAATPVPK 460


>gi|296207307|ref|XP_002750588.1| PREDICTED: tubulointerstitial nephritis antigen-like [Callithrix
           jacchus]
          Length = 467

 Score =  123 bits (309), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 84/233 (36%), Positives = 114/233 (48%), Gaps = 22/233 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG+   AW +
Sbjct: 223 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSCNTHH-QQGCRGGHLDGAWWF 281

Query: 73  FVHHGVVTEECDPYF----DSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSIS 126
               GVV++ C P+     D  G   P    +  T +  R+      N    N+  Y ++
Sbjct: 282 LRRRGVVSDHCYPFLGRERDKAGPVPPCMMHSRATGRGKRQATAHCPNGHVNNNNIYQVT 341

Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
            AYR+ S+  +IM E+ +NGPV+    V+EDF  YK G+Y H   ++         G H+
Sbjct: 342 PAYRLGSNDTEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHS 401

Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 402 VKITGWGEETWPDGRKLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 454


>gi|12060418|dbj|BAB20596.1| ARG1 [Mus musculus]
 gi|71059879|emb|CAJ18483.1| Lcn7 [Mus musculus]
          Length = 415

 Score =  123 bits (309), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 84/239 (35%), Positives = 112/239 (46%), Gaps = 34/239 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 171 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSCDTHH-QQGCRGGRLDGAWWF 229

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRN 119
               GVV++ C P+             A PTP+C+          R+   +    Q+  N
Sbjct: 230 LRRRGVVSDNCYPFSGREQ------NEASPTPRCMMHSRAMGRGKRQATSRCPNGQVDSN 283

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
             +    AYR+ SD ++IM E+ +NGPV+    V+EDF  Y+ G+Y H            
Sbjct: 284 DIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYR 343

Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
             G H+VK+ GWG  T  DG    YW  AN W   WG  G+F+I RG+NEC IE  V+ 
Sbjct: 344 RHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLG 402


>gi|294889976|ref|XP_002773021.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239877724|gb|EER04837.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 342

 Score =  123 bits (308), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 87/264 (32%), Positives = 125/264 (47%), Gaps = 37/264 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCG----DGCDGGYPIS 68
           Q  C +CWA  +V   +DR CI  G  ++  LS+  L +CC    G    DGC  G    
Sbjct: 62  QAECHNCWASASVGMFNDRVCIQSGGRITDILSLAYLTSCCNHANGCPKSDGCRRGSVAE 121

Query: 69  AWRYFVHHGVVT-------------EECDPYFDSTGCSH-PGCEPAYPTPKCVRK----- 109
              +  +HG+VT             + C PY     C+H PG +  YP  +C  K     
Sbjct: 122 GLIFMKNHGIVTGGEYKPPKKLGNDDGCWPY-PFPKCNHVPGMKVKYP--RCGSKVGRLA 178

Query: 110 ----CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
               C   +       H + S  R+   PE I  EI+ NGPV    T++EDF  YKSGVY
Sbjct: 179 APSHCDGLHCRRAGDVHRAKSWGRLPISPEKIKQEIFDNGPVAAIMTIHEDFRLYKSGVY 238

Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
           ++ TG ++G H +KLIGWG  + G++YW+  N WN  WG  G  K+  G N   ++E+  
Sbjct: 239 EYKTGAMVGAHTLKLIGWGV-EAGQEYWLAVNSWNEEWGDQGKIKLAVGKN--ALDEESR 295

Query: 226 AGLPSSKNLVKEITSADMFEDASA 249
             +P  +  V E+    M  ++ A
Sbjct: 296 QQVP--RRAVNELDEDAMMAESGA 317


>gi|253747738|gb|EET02294.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 305

 Score =  123 bits (308), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 75/214 (35%), Positives = 106/214 (49%), Gaps = 19/214 (8%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDG---CDGGYPISAWR 71
           Q  C  C+AF  + ALS R CI     L  SV  L A     C  G   C GG   ++W 
Sbjct: 101 QSDCSCCYAFATLGALSTRRCI---AKLDASVVPLSAQHMVSCDHGEAGCQGGGFNTSWA 157

Query: 72  YFVHHGVVTEECDPYFD-STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI 130
           +    G +  +C PY    TG S           +C   C +   L  ++ HY   +   
Sbjct: 158 FLETEGAIMRDCLPYVSGETGLSG----------ECPTTC-QDGTLLNDTIHYKAVSASH 206

Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 190
             +  +IM  +   GPV+  F V+EDF +Y  G+Y    G  +GGHAV ++G+G+ ++  
Sbjct: 207 LKNYNEIMTSLLNEGPVQTGFYVHEDFLYYVGGIYHKTYGSSIGGHAVLIVGYGSMNN-H 265

Query: 191 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
           DYWI+ N W   WG +GYF+I RG+NECGIE + 
Sbjct: 266 DYWIVRNSWGSDWGENGYFRILRGTNECGIENNA 299


>gi|403354695|gb|EJY76909.1| Cathepsin B [Oxytricha trifallax]
          Length = 311

 Score =  123 bits (308), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 70/203 (34%), Positives = 99/203 (48%), Gaps = 16/203 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNL--SLSVNDLLACCGFLCGD-GCDGGYPISAWR 71
           Q  CGSCWAF     L  R+C+         LS  +L++C  F     GCDGGY    + 
Sbjct: 107 QAQCGSCWAFATTNVLEYRYCMATKGKKYPELSPQNLISC--FNSASWGCDGGYIDQTFL 164

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y    GV TE+C PY    G              C  KC     L+ N  +    + +  
Sbjct: 165 YLEMMGVNTEQCMPYKSGDGN----------MTACPSKCANGENLYMNKYYCRPGSTQYM 214

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
              +     ++  GP+   F V+EDF +Y  G+Y  ++GD +G HAVKL+G+G  ++  +
Sbjct: 215 RGEQQFKNYLFNKGPMVAVFDVFEDFINYGGGIYNKVSGDKLGKHAVKLLGYGV-ENSTN 273

Query: 192 YWILANQWNRSWGADGYFKIKRG 214
           Y+I  NQW + WG DGYF+IK G
Sbjct: 274 YYIGVNQWGKDWGEDGYFRIKAG 296


>gi|300176830|emb|CBK25399.2| unnamed protein product [Blastocystis hominis]
          Length = 563

 Score =  123 bits (308), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 74/222 (33%), Positives = 109/222 (49%), Gaps = 18/222 (8%)

Query: 14  IQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLC---GDGCDGGYPISAW 70
           I  +CGSCW+F +V ++SDR  +         V+DL       C    +GC GG+P++A+
Sbjct: 66  IPQYCGSCWSFASVSSVSDR--LKLMTKGKWPVHDLSPQVILNCDHNSNGCQGGHPLTAF 123

Query: 71  RYFVHHGVVTEECDPYF-DSTGCSHPGCEPAYPTPKCVRKCVKKNQLW--RNSKHYSISA 127
           +Y   HGV  E C  Y   +  C+              R C  +   +  +N   Y +  
Sbjct: 124 KYMHDHGVPEEGCMRYMAKNMECTDI---------NICRDCDSEKGCFAVKNYTKYYVDE 174

Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
           Y   +  +++M EIY  GP+  S  V +D   YK G+Y+  TG     HA+ ++GWG  +
Sbjct: 175 YGSVAGEKNMMKEIYARGPITCSIAVPDDLMEYKGGIYRDTTGAKTLDHAISVVGWG-EE 233

Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           DG+ YWI  N W   WG  G+F+I RG N  GIE D    +P
Sbjct: 234 DGQKYWIARNSWGTFWGEKGWFRIVRGENNLGIEADCQWAVP 275



 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 64/206 (31%), Positives = 92/206 (44%), Gaps = 16/206 (7%)

Query: 14  IQGHCGSCWAFGAVEALSDRFCIHFG---MNLSLSVNDLLACCGFLCGDGCDGGYPISAW 70
           I  +CGSCWA     ALSDR  +        + LS  +++ C        CDGG     +
Sbjct: 350 IPQYCGSCWAQAPTSALSDRINLMRKGKWPTVELSAQEVINCSN---AGTCDGGSDADVF 406

Query: 71  RYFVHHGVVTEECDPY--FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
            Y  + G+  + C  Y   D        C    P   C           ++ K Y +S Y
Sbjct: 407 EYAFNEGIPDQTCQVYEAIDKECNDMARCMDCPPGEDCYPV--------KDYKRYKVSEY 458

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
                  +I AEI+  GPV  S  V E+F  Y+ G++    G ++G HAV++ GWG ++D
Sbjct: 459 GEVKGEMEIKAEIFARGPVSCSMIVTEEFLAYQGGIFVDDRGHIVGYHAVEVAGWGETED 518

Query: 189 GEDYWILANQWNRSWGADGYFKIKRG 214
           G  YWI  N W   WG  G+F++  G
Sbjct: 519 GTKYWIARNSWGPYWGEHGWFRMIVG 544


>gi|402853710|ref|XP_003891533.1| PREDICTED: tubulointerstitial nephritis antigen-like [Papio anubis]
          Length = 362

 Score =  123 bits (308), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 82/233 (35%), Positives = 113/233 (48%), Gaps = 22/233 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 118 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 176

Query: 73  FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSIS 126
               GVV++ C P+     D  G + P    +    +  R+   +  N    N+  Y ++
Sbjct: 177 LRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATARCPNSHVNNNDIYQVT 236

Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
             YR+ S+ ++IM E+ +NGPV+    V+EDF  YK G+Y H    +         G H+
Sbjct: 237 PVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHS 296

Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 297 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 349


>gi|258618831|gb|ACV84238.1| cysteine proteinase L [Anisakis simplex]
          Length = 411

 Score =  123 bits (308), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 79/211 (37%), Positives = 106/211 (50%), Gaps = 28/211 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           Q  CGSCWAFGAV  +     I     +SLS   L+ C   +  +GCDGGY   A +Y  
Sbjct: 215 QQRCGSCWAFGAVGVVESMNAIAKNPLVSLSEQQLVDCD--MNDNGCDGGYRPYALQYIR 272

Query: 75  HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 134
           H+G+V EE  PY    G     C+      +   K VK   + RN               
Sbjct: 273 HNGIVPEELYPY---AGKELDSCKLNTTVQRVYVKTVK--YIRRN--------------- 312

Query: 135 EDIMAE-IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV----MGGHAVKLIGWGTSDDG 189
           E  MA+ ++  GP+ V   V +D  HY+SGV+     D      G HA+ ++G+G S +G
Sbjct: 313 ESAMADFVFYKGPLSVGINVTKDLFHYQSGVFTPSKEDCEQNPQGTHALAVVGYG-SQNG 371

Query: 190 EDYWILANQWNRSWGADGYFKIKRGSNECGI 220
           EDYWI+ N W + WG DG+F  KRG+N CGI
Sbjct: 372 EDYWIIKNSWGKRWGMDGFFLYKRGANSCGI 402


>gi|119579767|gb|EAW59363.1| cathepsin C, isoform CRA_a [Homo sapiens]
          Length = 316

 Score =  123 bits (308), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 78/230 (33%), Positives = 118/230 (51%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I    + +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 105 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 162

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 163 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 205

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HYK G+Y H           +  HAV L+G+
Sbjct: 206 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 265

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT S  G DYWI+ N W   WG +GYF+I+RG++EC IE   VA  P  K
Sbjct: 266 GTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 315


>gi|270132817|ref|NP_075965.2| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
 gi|270132824|ref|NP_001161805.1| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
 gi|61213616|sp|Q99JR5.1|TINAL_MOUSE RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
           Full=Adrenocortical zonation factor 1; Short=AZ-1;
           AltName: Full=Androgen-regulated gene 1 protein;
           AltName: Full=Tubulointerstitial nephritis
           antigen-related protein; Short=TARP; Flags: Precursor
 gi|13543125|gb|AAH05738.1| Tinagl1 protein [Mus musculus]
 gi|17391278|gb|AAH18539.1| Tinagl1 protein [Mus musculus]
 gi|30314458|dbj|BAC76038.1| tubulointersititial nephritis antigen-related protein [Mus
           musculus]
 gi|148698197|gb|EDL30144.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
           musculus]
 gi|148698198|gb|EDL30145.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
           musculus]
          Length = 466

 Score =  123 bits (308), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 84/239 (35%), Positives = 112/239 (46%), Gaps = 34/239 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 222 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSCDTHH-QQGCRGGRLDGAWWF 280

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRN 119
               GVV++ C P+             A PTP+C+          R+   +    Q+  N
Sbjct: 281 LRRRGVVSDNCYPFSGREQ------NEASPTPRCMMHSRAMGRGKRQATSRCPNGQVDSN 334

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
             +    AYR+ SD ++IM E+ +NGPV+    V+EDF  Y+ G+Y H            
Sbjct: 335 DIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYR 394

Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
             G H+VK+ GWG  T  DG    YW  AN W   WG  G+F+I RG+NEC IE  V+ 
Sbjct: 395 RHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLG 453


>gi|403287831|ref|XP_003935129.1| PREDICTED: dipeptidyl peptidase 1 [Saimiri boliviensis boliviensis]
          Length = 463

 Score =  123 bits (308), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 79/230 (34%), Positives = 118/230 (51%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I    + +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSKY--AQGCEGGFPYLIAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    GVV E C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 310 YAQDFGVVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HY+ G+Y H           +  HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYRKGIYHHTGLRDPFNPFELTNHAVLLVGY 412

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT S  G  YWI+ N W  SWG DGYF+I+RG++EC IE   VA  P  K
Sbjct: 413 GTDSASGIHYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|603044|gb|AAA96832.1| cysteine protease homolog, partial [Strongyloides ratti]
          Length = 202

 Score =  122 bits (307), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 70/202 (34%), Positives = 103/202 (50%), Gaps = 22/202 (10%)

Query: 20  SCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 77
           SCWA  A   ++DR C+     +   +S  D+L+CCG  CG GC GG  I AW++ + +G
Sbjct: 1   SCWAVSAASVMTDRLCVQSKGRIKRFISDTDILSCCGRFCGYGCRGGANIRAWKHVMRNG 60

Query: 78  VVT-------EECDPY-FDSTGCSHPGC------EPAYPTPKCVRKCVKK--NQLWRNSK 121
           V T         C PY F   G              +Y TP+C + C +      +   +
Sbjct: 61  VCTGGPCGYKYGCRPYAFHPCGVHKDQVYYGECPRKSYDTPECRKICQRGCIQLQYGKDR 120

Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 181
           +Y+ SAY + +D + IM EI + GPV  ++  Y DF  YK GVY+H  G+  GGH++K++
Sbjct: 121 YYAASAYFVKNDTKAIMREIMRGGPVHGAYDTYTDFRLYKGGVYEHTAGERTGGHSIKIM 180

Query: 182 GWGTSDDGED----YWILANQW 199
           GWG           YW++AN W
Sbjct: 181 GWGNYKHPNGTVIPYWLVANSW 202


>gi|157058745|gb|ABV03130.1| cathepsin B-2744 [Sitobion avenae]
          Length = 260

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 80/217 (36%), Positives = 103/217 (47%), Gaps = 30/217 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGD----GCDGGYPIS 68
           QG+C S WA       SDR CI      +  LS  +LL+C     GD    GCDGG    
Sbjct: 49  QGNCASSWAVAVASTFSDRLCIASNGQFTDNLSAQNLLSC-----GDEEKMGCDGGSAFK 103

Query: 69  AWRYFVHHGVVT-------EECDPYFDSTGCSHPG------CEPAYPTPK--CVRKCVKK 113
           AW   +  G+VT       E C PY     C+H G      C     T    C  KCV K
Sbjct: 104 AWELTMSKGIVTGGNFDSNEGCQPY-KIRPCNHYGNGNLKNCSSLRRTQMTVCREKCVNK 162

Query: 114 NQL--WRNSKHYSISAYRIN-SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 170
           N    + +  H +   Y  + ++ + I  EI   GPV     VYE+F  YK G+YK   G
Sbjct: 163 NYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTYGPVTAFMYVYENFMGYKEGIYKSTAG 222

Query: 171 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 207
           +++G H VKLIGWG   DG +YW+  N WN +WG +G
Sbjct: 223 ELIGYHHVKLIGWGVDGDGTEYWLAMNSWNSNWGTNG 259


>gi|75812938|ref|NP_001028789.1| dipeptidyl peptidase 1 precursor [Bos taurus]
 gi|115312125|sp|Q3ZCJ8.1|CATC_BOVIN RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|73587261|gb|AAI02116.1| Cathepsin C [Bos taurus]
          Length = 463

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 77/230 (33%), Positives = 119/230 (51%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           QG CGSC++F ++  +  R  I      +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 252 QGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E+C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 310 YAQDFGLVEEDCFPY---TGTDSP--------------CRLKEGCFRYYSSEYHYVGGFY 352

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+   GP+ V+F VY+DF HY+ GVY H           +  HAV L+G+
Sbjct: 353 GGCNEALMKLELVHQGPMAVAFEVYDDFLHYRKGVYHHTGLRDPFNPFELTNHAVLLVGY 412

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT +  G DYWI+ N W  SWG +GYF+I+RG++EC IE   +A  P  K
Sbjct: 413 GTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIALAATPIPK 462


>gi|355724275|gb|AES08176.1| tubulointerstitial nephritis antigen-like 1 [Mustela putorius furo]
          Length = 454

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 82/239 (34%), Positives = 112/239 (46%), Gaps = 34/239 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 210 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSC-DTHNQRGCHGGRLDGAWWF 268

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR-------------N 119
               GVV++ C P+           + A P P+C+       +  R             N
Sbjct: 269 LRRRGVVSDHCYPFVGREQ------DEAGPAPRCMMHSRAMGRGKRQATARCPSSHAHAN 322

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
             +    AYR+ S+ ++IM E+ +NGPV+    V+EDF  Y+SG+Y H    +       
Sbjct: 323 DIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYR 382

Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
             G H+VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+ 
Sbjct: 383 RHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 441


>gi|354459545|pdb|3PDF|A Chain A, Discovery Of Novel Cyanamide-Based Inhibitors Of Cathepsin
           C
          Length = 441

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 78/230 (33%), Positives = 118/230 (51%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I    + +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 228 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 285

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 286 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 328

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HYK G+Y H           +  HAV L+G+
Sbjct: 329 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 388

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT S  G DYWI+ N W   WG +GYF+I+RG++EC IE   VA  P  K
Sbjct: 389 GTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 438


>gi|1582221|prf||2118248A prepro-cathepsin C
          Length = 463

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 78/230 (33%), Positives = 118/230 (51%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I    + +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HYK G+Y H           +  HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 412

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT S  G DYWI+ N W   WG +GYF+I+RG++EC IE   VA  P  K
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|17933071|gb|AAL48192.1| cathepsin C [Homo sapiens]
          Length = 463

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 78/230 (33%), Positives = 118/230 (51%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I    + +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HYK G+Y H           +  HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 412

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT S  G DYWI+ N W   WG +GYF+I+RG++EC IE   VA  P  K
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|62897637|dbj|BAD96758.1| cathepsin C isoform a preproprotein variant [Homo sapiens]
          Length = 463

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 78/230 (33%), Positives = 118/230 (51%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I    + +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HYK G+Y H           +  HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 412

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT S  G DYWI+ N W   WG +GYF+I+RG++EC IE   VA  P  K
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|194382330|dbj|BAG58920.1| unnamed protein product [Homo sapiens]
          Length = 446

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 78/230 (33%), Positives = 118/230 (51%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I    + +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 235 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 292

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 293 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 335

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HYK G+Y H           +  HAV L+G+
Sbjct: 336 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 395

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT S  G DYWI+ N W   WG +GYF+I+RG++EC IE   VA  P  K
Sbjct: 396 GTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 445


>gi|193606095|ref|XP_001951499.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 330

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 81/221 (36%), Positives = 104/221 (47%), Gaps = 15/221 (6%)

Query: 22  WAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGV 78
           WA+     L+DR CI  +   N  LS  +L+ C G      G   G  +  W Y   HG+
Sbjct: 115 WAYATAGVLADRMCIATNGSYNQLLSTEELIFCGGIKTKQSGAVRGDDV--WEYLKSHGL 172

Query: 79  VTEECDPYFDSTGCSHPGCEPAYPTPK------CVRKCVKKNQLWRNSKHYSISAYRINS 132
           V+     Y  + GC      P    P       C  +C   N +     H  +S Y    
Sbjct: 173 VS--GGKYNTNDGCQPSKIPPIGNIPTHLYNHTCEERCYGNNTIHYYHDHVKVSHYYNIK 230

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGED 191
             EDI  E+   GPV V F VY+DF  YKSGVY      + +  H  KLIGWG  ++G D
Sbjct: 231 SNEDIQKEVQTYGPVSVKFRVYDDFFLYKSGVYVKTEKSLYVRRHFAKLIGWGV-ENGVD 289

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           YW+L N W   WG +G FKIKRG+NE  +E+ V AG P  K
Sbjct: 290 YWLLVNSWGNEWGQNGLFKIKRGTNEVHVEDYVYAGEPEIK 330


>gi|30038325|dbj|BAC75711.1| cathepsin C [Bos taurus]
          Length = 458

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 77/230 (33%), Positives = 119/230 (51%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           QG CGSC++F ++  +  R  I      +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 247 QGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 304

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E+C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 305 YAQDFGLVEEDCFPY---TGTDSP--------------CRLKEGCFRYYSSEYHYVGGFY 347

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+   GP+ V+F VY+DF HY+ GVY H           +  HAV L+G+
Sbjct: 348 GGCNEALMKLELVHQGPMAVAFEVYDDFLHYRKGVYHHTGLRDPFNPFELTNHAVLLVGY 407

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT +  G DYWI+ N W  SWG +GYF+I+RG++EC IE   +A  P  K
Sbjct: 408 GTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIALAATPIPK 457


>gi|317373330|sp|P53634.2|CATC_HUMAN RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|17933069|gb|AAL48191.1| cathepsin C [Homo sapiens]
          Length = 463

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 78/230 (33%), Positives = 118/230 (51%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I    + +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HYK G+Y H           +  HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 412

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT S  G DYWI+ N W   WG +GYF+I+RG++EC IE   VA  P  K
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|189083844|ref|NP_001805.3| dipeptidyl peptidase 1 isoform a preproprotein [Homo sapiens]
 gi|1006657|emb|CAA60671.1| cathepsin C [Homo sapiens]
 gi|1947071|gb|AAC51341.1| prepro dipeptidyl peptidase I [Homo sapiens]
 gi|60816242|gb|AAX36375.1| cathepsin C [synthetic construct]
 gi|119579768|gb|EAW59364.1| cathepsin C, isoform CRA_b [Homo sapiens]
 gi|158257666|dbj|BAF84806.1| unnamed protein product [Homo sapiens]
 gi|261858568|dbj|BAI45806.1| cathepsin C [synthetic construct]
          Length = 463

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 78/230 (33%), Positives = 118/230 (51%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I    + +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HYK G+Y H           +  HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 412

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT S  G DYWI+ N W   WG +GYF+I+RG++EC IE   VA  P  K
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|296471940|tpg|DAA14055.1| TPA: dipeptidyl peptidase 1 [Bos taurus]
 gi|440894445|gb|ELR46895.1| Dipeptidyl peptidase 1 [Bos grunniens mutus]
          Length = 463

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 77/230 (33%), Positives = 119/230 (51%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           QG CGSC++F ++  +  R  I      +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 252 QGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E+C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 310 YAQDFGLVEEDCFPY---TGTDSP--------------CRLKEGCFRYYSSEYHYVGGFY 352

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+   GP+ V+F VY+DF HY+ GVY H           +  HAV L+G+
Sbjct: 353 GGCNEALMKLELVHQGPMAVAFEVYDDFLHYRKGVYHHTGLRDPFNPFELTNHAVLLVGY 412

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT +  G DYWI+ N W  SWG +GYF+I+RG++EC IE   +A  P  K
Sbjct: 413 GTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIALAATPIPK 462


>gi|395730851|ref|XP_003775799.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Pongo
           abelii]
          Length = 362

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 85/239 (35%), Positives = 114/239 (47%), Gaps = 34/239 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 118 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 176

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK--NQLWRNS 120
               GVV++ C P+      S    + A PTP C+          R+      N    N+
Sbjct: 177 LRRRGVVSDHCYPF------SGRERDEAGPTPPCMMHSRAMGRGKRQATASCPNSHVNNN 230

Query: 121 KHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
             Y ++  YR+ S+ ++IM E+ +NGPV+    V+EDF  YK G+Y H    +       
Sbjct: 231 DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYR 290

Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
             G H+VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 291 RHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 349


>gi|444728469|gb|ELW68926.1| Dipeptidyl peptidase 1 [Tupaia chinensis]
          Length = 462

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 76/228 (33%), Positives = 121/228 (53%), Gaps = 27/228 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I    + +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 251 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 308

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y    G+V E C PY   TG   P C       K  + C++    + +S+++ +  +   
Sbjct: 309 YAQDFGLVEESCFPY---TGTDAP-C-------KMKKDCIR----YYSSEYHYVGGFYGG 353

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
            +   +  E+  +GP+ V+F VY+DF HY+ G+Y+H           +  HAV L+G+GT
Sbjct: 354 CNEALMKLELVHHGPMAVAFEVYDDFLHYQKGIYQHTGLRDPFNPFELTNHAVLLVGYGT 413

Query: 186 S-DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
               G DYWI+ N W  SWG DG+F+I+RG +EC IE   +A  P  K
Sbjct: 414 DLASGMDYWIVKNSWGTSWGEDGFFRIRRGIDECSIESIAMAATPIPK 461


>gi|348565723|ref|XP_003468652.1| PREDICTED: dipeptidyl peptidase 1-like [Cavia porcellus]
          Length = 463

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 80/229 (34%), Positives = 119/229 (51%), Gaps = 29/229 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           QG CGSC++F +V  L  R  I      +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 252 QGSCGSCYSFASVGMLEARIRILTNNTQTPILSPQEIVSCSQY--AQGCEGGFPYLIAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y    G+V E C PY    G   P C       K  + CV+    +  S+++ +  +   
Sbjct: 310 YAQDFGLVEESCFPY---KGIDVP-C-------KVKKDCVR----YYTSEYHYVGGFYGG 354

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-------MGGHAVKLIGWG 184
            +   +  E+ ++GP+ V+F VY+DF HY  G+Y H TG         +  HAV L+G+G
Sbjct: 355 CNEALMKLELVQHGPMAVAFEVYDDFLHYHKGIY-HRTGLRDPFNPFELTNHAVLLVGYG 413

Query: 185 TSD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           T    G DYWI+ N W   WG DGYF+I RG++EC IE   +A  P  K
Sbjct: 414 TDPVSGRDYWIVKNSWGTGWGEDGYFRILRGTDECAIESIAMAATPIPK 462


>gi|431891156|gb|ELK02033.1| Tubulointerstitial nephritis antigen-like protein [Pteropus alecto]
          Length = 467

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 83/239 (34%), Positives = 114/239 (47%), Gaps = 34/239 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 223 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDKHN-QQGCRGGRLDGAWWF 281

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV-----KKNQLWRNSKH----- 122
               GVV++ C P+             A P P+C+         K+  + R   H     
Sbjct: 282 LRRRGVVSDHCYPFSGQER------NEAGPEPRCMMHSRAMGRGKRQAIARCPNHHVHAN 335

Query: 123 --YSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
             Y ++ AYR+ S+ ++IM E+ +NGPV+    V+EDF  Y+ G+Y H    +       
Sbjct: 336 DIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQGGIYSHTPVSLGKPERYR 395

Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
             G H+VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+ 
Sbjct: 396 RHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGTNECDIESFVLG 454


>gi|349605750|gb|AEQ00879.1| Dipeptidyl-peptidase 1-like protein, partial [Equus caballus]
          Length = 356

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 81/228 (35%), Positives = 119/228 (52%), Gaps = 27/228 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I    + +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 145 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 202

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y    GVV E C PY   TG   P C       K  + C +    + +S +Y +  +   
Sbjct: 203 YAQDFGVVEEGCFPY---TGTDSP-C-------KLKKDCFR----YYSSDYYYVGGFYGG 247

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
            +   I  E+  +GP+ V+F VY DF HY  G+Y H           +  HAV L+G+GT
Sbjct: 248 CNEALIKLELVHHGPMAVAFEVYNDFLHYHDGIYHHTGLRDPFNPFELTNHAVLLVGYGT 307

Query: 186 -SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
            S  G+DYWI+ N W  SWG DGYF+I+RG++EC IE   +A  P  K
Sbjct: 308 DSASGQDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAMAATPIPK 355


>gi|324713036|ref|NP_001191344.1| tubulointerstitial nephritis antigen-like isoform 3 [Homo sapiens]
 gi|119628008|gb|EAX07603.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_a [Homo
           sapiens]
          Length = 362

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 82/233 (35%), Positives = 112/233 (48%), Gaps = 22/233 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 118 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 176

Query: 73  FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSIS 126
               GVV++ C P+     D  G + P    +    +  R+      N    N+  Y ++
Sbjct: 177 LRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSYVNNNDIYQVT 236

Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
             YR+ S+ ++IM E+ +NGPV+    V+EDF  YK G+Y H    +         G H+
Sbjct: 237 PVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHS 296

Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 297 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 349


>gi|403293251|ref|XP_003937634.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Saimiri boliviensis boliviensis]
          Length = 436

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 83/233 (35%), Positives = 113/233 (48%), Gaps = 22/233 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 192 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCNTHH-QQGCRGGRLDGAWWF 250

Query: 73  FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSIS 126
               GVV++ C P+     D  G + P    +    +  R+      N    N+  Y ++
Sbjct: 251 LRRRGVVSDHCYPFSGRERDKAGPAPPCMMHSRAMGRGKRQATAHCPNGHVNNNNIYQVT 310

Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
            AYR+ S+  +IM E+ +NGPV+    V+EDF  YK G+Y H   ++         G H+
Sbjct: 311 PAYRLGSNDTEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHS 370

Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 371 VKITGWGEETRPDGRKLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 423


>gi|403293249|ref|XP_003937633.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Saimiri boliviensis boliviensis]
          Length = 467

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 83/233 (35%), Positives = 113/233 (48%), Gaps = 22/233 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 223 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCNTHH-QQGCRGGRLDGAWWF 281

Query: 73  FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSIS 126
               GVV++ C P+     D  G + P    +    +  R+      N    N+  Y ++
Sbjct: 282 LRRRGVVSDHCYPFSGRERDKAGPAPPCMMHSRAMGRGKRQATAHCPNGHVNNNNIYQVT 341

Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
            AYR+ S+  +IM E+ +NGPV+    V+EDF  YK G+Y H   ++         G H+
Sbjct: 342 PAYRLGSNDTEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHS 401

Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 402 VKITGWGEETRPDGRKLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 454


>gi|345794363|ref|XP_535330.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Canis lupus
           familiaris]
          Length = 467

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 80/239 (33%), Positives = 114/239 (47%), Gaps = 34/239 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 223 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSC-DTHNQQGCRGGRLDGAWWF 281

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRN 119
               GVV++ C P+           + A P P+C+          R+   +   + +  N
Sbjct: 282 LRRRGVVSDHCYPFVGREQ------DEAGPAPRCMMHSRAMGRGKRQATARCPSSHVHAN 335

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
             +    AYR+ ++ ++IM E+ +NGPV+    V+EDF  Y+ G+Y H    +       
Sbjct: 336 DIYQVTPAYRLGTNEKEIMKELMENGPVQALMEVHEDFFLYQGGIYSHTPVSLGRPERYR 395

Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
             G H+VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+ 
Sbjct: 396 RHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 454


>gi|403364285|gb|EJY81901.1| Cathepsin H [Oxytricha trifallax]
          Length = 363

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 77/214 (35%), Positives = 109/214 (50%), Gaps = 32/214 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           QG CGSCW F  V  L   F I +  + +LS   L+ C G     GC+GG P  A++Y  
Sbjct: 153 QGSCGSCWTFSTVGTLEAHFLIKYQQSRNLSEQQLVDCAGAYDNYGCNGGLPSHAFQYIS 212

Query: 75  HH-GVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN-S 132
            + G+ TE   PYF                    R C     + ++ K   +    +N +
Sbjct: 213 DNGGIATEAAYPYFAKD-----------------RPCT----IQQSQKSVGVVGGSVNLT 251

Query: 133 DPEDIMA-EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG-----HAVKLIGWGTS 186
             ED +A  I+++GPV +++ V +DF  Y SGVY   T D   G     HAV  +G+GT 
Sbjct: 252 KSEDELAIAIFQHGPVSIAYEVIDDFMDYHSGVY--TTKDCKNGPDDVNHAVVAVGFGT- 308

Query: 187 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 220
           ++G DYW++ N W+  WG +GYFKI+RG N CGI
Sbjct: 309 ENGVDYWLVKNSWSTKWGDNGYFKIQRGVNMCGI 342


>gi|291408920|ref|XP_002720687.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Oryctolagus
           cuniculus]
          Length = 467

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 83/238 (34%), Positives = 114/238 (47%), Gaps = 32/238 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 223 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHH-QQGCRGGRLDGAWWF 281

Query: 73  FVHHGVVTEECDPYF----DSTGCSHP--------GCEPAYPTPKCVRKCVKKNQLWRNS 120
               GVV++ C P+     D  G + P        G      T +C    V  N +++ +
Sbjct: 282 LRRRGVVSDHCYPFSGHEQDEAGPAPPCMMHSRAMGRGKRQATARCPNSHVHANDIYQVT 341

Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-------- 172
                 AYR+ S+ ++IM E+ +NGPV+    V+EDF  Y+ G+Y H    +        
Sbjct: 342 -----PAYRLGSNEKEIMKELLENGPVQALMEVHEDFFLYQGGIYSHTPVSLERPERYRR 396

Query: 173 MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
            G H+VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+ 
Sbjct: 397 HGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRILRGTNECDIESFVLG 454


>gi|296216857|ref|XP_002754752.1| PREDICTED: dipeptidyl peptidase 1 [Callithrix jacchus]
          Length = 460

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 79/230 (34%), Positives = 117/230 (50%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I    + +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 249 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 306

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    GVV E C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 307 YAQDFGVVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 349

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HY  G+Y H           +  HAV L+G+
Sbjct: 350 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYHKGIYHHTGLRDPFNPFELTNHAVLLVGY 409

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT S  G  YWI+ N W  SWG DGYF+I+RG++EC IE   VA  P  K
Sbjct: 410 GTDSASGIHYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 459


>gi|432108509|gb|ELK33225.1| Dipeptidyl peptidase 1 [Myotis davidii]
          Length = 466

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 77/228 (33%), Positives = 116/228 (50%), Gaps = 27/228 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I      S  LS  ++++C  +    GC+GG+P + A +
Sbjct: 255 QASCGSCYSFASMGMLEARIRILTNNTQSPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 312

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y    G+V E C PY   TG   P C       K    C++    +  S+++ +  +   
Sbjct: 313 YAQDFGLVEEACFPY---TGTDSP-C-------KMKEDCIR----YYTSEYHYVGGFYGG 357

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
            +   +  E+  +GP+ V+F VY+DF HY  G+Y H           +  HAV L+G+GT
Sbjct: 358 CNEALMKLELVHHGPMAVAFEVYDDFLHYNQGIYHHTGLKDPFNPFELTNHAVLLVGYGT 417

Query: 186 S-DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
               G DYWI+ N W  SWG  GYF+I+RG++EC IE   +A  P  K
Sbjct: 418 DPKTGLDYWIVKNSWGTSWGEQGYFRIRRGTDECAIESIAMAATPIPK 465


>gi|291384116|ref|XP_002708690.1| PREDICTED: cathepsin C [Oryctolagus cuniculus]
          Length = 463

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 78/230 (33%), Positives = 118/230 (51%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F +V  L  R  I    + +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 252 QESCGSCYSFASVGMLEARIRILTNNSQTPILSPQEIVSCSQY--AQGCNGGFPYLIAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E+C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 310 YAQDFGLVEEDCFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HY  G+Y H           +  HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYHKGIYHHTGLRDPFNPFELTNHAVLLVGY 412

Query: 184 GTSD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT    G DYWI+ N W  SWG +GYF+I+RG++EC IE   VA  P  K
Sbjct: 413 GTDPATGVDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|194213370|ref|XP_001492720.2| PREDICTED: dipeptidyl peptidase 1-like [Equus caballus]
          Length = 478

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 81/228 (35%), Positives = 119/228 (52%), Gaps = 27/228 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I    + +  LS  ++++C  +    GC+GG+P + A +
Sbjct: 267 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 324

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y    GVV E C PY   TG   P C       K  + C +    + +S +Y +  +   
Sbjct: 325 YAQDFGVVEEGCFPY---TGTDSP-C-------KLKKDCFR----YYSSDYYYVGGFYGG 369

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
            +   I  E+  +GP+ V+F VY DF HY  G+Y H           +  HAV L+G+GT
Sbjct: 370 CNEALIKLELVHHGPMAVAFEVYNDFLHYHDGIYHHTGLRDPFNPFELTNHAVLLVGYGT 429

Query: 186 -SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
            S  G+DYWI+ N W  SWG DGYF+I+RG++EC IE   +A  P  K
Sbjct: 430 DSASGQDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAMAATPIPK 477


>gi|239790303|dbj|BAH71722.1| ACYPI001175 [Acyrthosiphon pisum]
          Length = 330

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 81/221 (36%), Positives = 104/221 (47%), Gaps = 15/221 (6%)

Query: 22  WAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGV 78
           WA+     L+DR CI  +   N  LS  +L+ C G      G   G  +  W Y   HG+
Sbjct: 115 WAYATAGVLADRMCIATNGSYNQLLSTEELIFCGGIKTKQSGAVRGDDV--WEYLKSHGL 172

Query: 79  VTEECDPYFDSTGCSHPGCEPAYPTPK------CVRKCVKKNQLWRNSKHYSISAYRINS 132
           V+     Y  + GC      P    P       C  +C   N +     H  +S Y    
Sbjct: 173 VS--GGKYNTNDGCQPSKIPPIGNIPTHLYNHTCEERCYGNNTIHYYHDHVKVSHYYNIK 230

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGED 191
             EDI  E+   GPV V F VY+DF  YKSGVY      + +  H  KLIGWG  ++G D
Sbjct: 231 SNEDIQKEVQTYGPVSVKFRVYDDFFLYKSGVYVKTEKSLYVRRHFAKLIGWGV-ENGVD 289

Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           YW+L N W   WG +G FKIKRG+NE  +E+ V AG P  K
Sbjct: 290 YWLLVNFWGNEWGQNGLFKIKRGTNEVHVEDYVYAGEPEIK 330


>gi|260826514|ref|XP_002608210.1| hypothetical protein BRAFLDRAFT_125840 [Branchiostoma floridae]
 gi|229293561|gb|EEN64220.1| hypothetical protein BRAFLDRAFT_125840 [Branchiostoma floridae]
          Length = 470

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 76/227 (33%), Positives = 112/227 (49%), Gaps = 31/227 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           QG CGSC+AF ++  L  R  +  +      LS  ++++C  +    GC+GG+P + A +
Sbjct: 258 QGQCGSCYAFASMGMLEARLRVLTNNTQQFVLSPQEIVSCGKY--SQGCEGGFPYLIAGK 315

Query: 72  YFVHHGVVTEECDPY--FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYR 129
           Y    GVV EEC PY   DS+      C   Y T                  +  +  + 
Sbjct: 316 YAEDFGVVLEECYPYEGKDSSCKDTSRCGRGYAT-----------------NYRYVGGFY 358

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              + E +  E+ KNGP+ V+F VY DF HYK GVY+H           +  HAV L+G+
Sbjct: 359 GGCNEELMQLELVKNGPMAVAFEVYSDFMHYKGGVYEHTGLSDPFNPFEITNHAVLLVGY 418

Query: 184 GTS-DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           G   + G  +W + N W   WG +G+F+I+RG++EC IE   VA  P
Sbjct: 419 GRDPETGAKFWTVKNSWGEKWGEEGFFRIRRGTDECAIESIAVAADP 465


>gi|311263676|ref|XP_003129789.1| PREDICTED: dipeptidyl peptidase 1-like [Sus scrofa]
          Length = 463

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 77/230 (33%), Positives = 116/230 (50%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  +  R  I      +  LS  ++++C  +    GC GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVSCSQY--AQGCAGGFPYLIAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CTVKEGCFRYYSSEYHYVGGFY 352

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HY+ G+Y H           +  HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYRKGIYHHTGLRDPFNPFELTNHAVLLVGY 412

Query: 184 GTS-DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT    G DYWI+ N W  SWG DGYF+I+RG++EC IE   VA  P  K
Sbjct: 413 GTDLASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|45708820|gb|AAH67941.1| LOC407938 protein, partial [Xenopus (Silurana) tropicalis]
          Length = 470

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 75/221 (33%), Positives = 112/221 (50%), Gaps = 27/221 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC+AF ++  L  R  I   ++    LS   +++C  +    GC+GG+P + A +
Sbjct: 247 QASCGSCYAFSSMGMLESRIQIRSQLSQKPILSPQQVVSCSNY--SQGCEGGFPYLIAGK 304

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y   +G+V E   PY   TG   P          C  K     Q +  ++++ +  +   
Sbjct: 305 YVSDYGIVEESDLPY---TGSDSP----------CTLK--DSQQKYYTAEYHYVGGFYGG 349

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
            +   +  E+   GP+ V+F VY+DF HY+SGVY H           +  HAV L+G+GT
Sbjct: 350 CNEAYMKLELVLGGPLSVAFEVYDDFMHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGT 409

Query: 186 SDD-GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
               GE YWI+ N W  SWG  GYF+I+RG++EC IE   V
Sbjct: 410 DQQTGEKYWIVKNSWGESWGEKGYFRIRRGTDECAIESIAV 450


>gi|198434980|ref|XP_002126076.1| PREDICTED: similar to LOC100124858 protein [Ciona intestinalis]
          Length = 541

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 80/229 (34%), Positives = 110/229 (48%), Gaps = 18/229 (7%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIH---FGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
           Q + G+ WAF     LSDR  I    F + + LS   L++C  F   +G  G      W 
Sbjct: 308 QENEGTSWAFSTTSVLSDRLAIQSKNFTV-VELSPQHLVSC--FSSHEG-RGERLDRTWW 363

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y    GVV+  C P   S      G             C   N +  N  + +   YR++
Sbjct: 364 YLRKKGVVSTVCYPESRSKSTQGIGSCGLVAHSSGAHICPNGNVISSNEIYKTSPVYRVS 423

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM--------GGHAVKLIGW 183
           S+ E+IM EI++NGPV+    V  DF  YKSGVY     D +          H+VK+IGW
Sbjct: 424 SNEENIMKEIFENGPVQAVMRVQPDFFVYKSGVYSSTAIDNIVVEQVKDNTYHSVKIIGW 483

Query: 184 G---TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           G   +  +   YWI+ N W  +WG  GYF+I++G NECGIEE ++A  P
Sbjct: 484 GEKKSKTNSGKYWIVQNSWGANWGEGGYFRIRKGVNECGIEEMILAAWP 532


>gi|355557764|gb|EHH14544.1| hypothetical protein EGK_00488 [Macaca mulatta]
 gi|355745087|gb|EHH49712.1| hypothetical protein EGM_00421 [Macaca fascicularis]
 gi|384948750|gb|AFI37980.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
           [Macaca mulatta]
 gi|384948752|gb|AFI37981.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
           [Macaca mulatta]
 gi|387540550|gb|AFJ70902.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
           [Macaca mulatta]
          Length = 467

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 82/233 (35%), Positives = 113/233 (48%), Gaps = 22/233 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 223 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 281

Query: 73  FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSIS 126
               GVV++ C P+     D  G + P    +    +  R+   +  N    N+  Y ++
Sbjct: 282 LRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATARCPNSHVNNNDIYQVT 341

Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
             YR+ S+ ++IM E+ +NGPV+    V+EDF  YK G+Y H    +         G H+
Sbjct: 342 PVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHS 401

Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 402 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 454


>gi|338722032|ref|XP_003364468.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
           [Equus caballus]
          Length = 436

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 82/239 (34%), Positives = 113/239 (47%), Gaps = 34/239 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG+   AW +
Sbjct: 192 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSC-DTHNQQGCRGGHLDGAWWF 250

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCV---KKNQLWRN 119
               GVV++ C P+           + A P P+C+          R+       +++  N
Sbjct: 251 LRRRGVVSDHCYPFSGRER------DEAGPAPRCMMHSRAMGRGKRQATAHCPNSRVHTN 304

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD-------- 171
             +    AYR+ S  ++IM E+ +NGPV+    V+EDF  Y+ GVY H            
Sbjct: 305 DIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVHEDFFLYQGGVYSHTPVSHGRPERYR 364

Query: 172 VMGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
             G H+VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+ 
Sbjct: 365 RHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 423


>gi|149694136|ref|XP_001503950.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 1
           [Equus caballus]
          Length = 467

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 82/239 (34%), Positives = 113/239 (47%), Gaps = 34/239 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG+   AW +
Sbjct: 223 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSC-DTHNQQGCRGGHLDGAWWF 281

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCV---KKNQLWRN 119
               GVV++ C P+           + A P P+C+          R+       +++  N
Sbjct: 282 LRRRGVVSDHCYPFSGRER------DEAGPAPRCMMHSRAMGRGKRQATAHCPNSRVHTN 335

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD-------- 171
             +    AYR+ S  ++IM E+ +NGPV+    V+EDF  Y+ GVY H            
Sbjct: 336 DIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVHEDFFLYQGGVYSHTPVSHGRPERYR 395

Query: 172 VMGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
             G H+VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG+NEC IE  V+ 
Sbjct: 396 RHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 454


>gi|294879717|ref|XP_002768767.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239871616|gb|EER01485.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 157

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 68/157 (43%), Positives = 88/157 (56%), Gaps = 14/157 (8%)

Query: 83  CDPYFDSTGCSH-------PGCEPA-YPTPKCVRKC--VKKNQLWRNSKHYSISAYRINS 132
           C PY D   C+H       P C    YPTP CV +C   K     R+ +H+ + +   + 
Sbjct: 3   CWPY-DFPPCAHHINDTKYPKCPKGLYPTPNCVEQCHNPKYTTTLRDDRHFMLESSPYHY 61

Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
              D    I  +GPV  SFTVYEDF  Y+SGVYKH +G  +GGHAVK+IGWG    G+ Y
Sbjct: 62  SVNDAKNAIRTDGPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAVKIIGWGEK-SGQAY 120

Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           W+  N WN  WG  G FKI  G+  CGI++D++ G P
Sbjct: 121 WLAVNSWNEDWGDHGLFKIALGN--CGIDDDLLGGTP 155


>gi|33327024|gb|AAQ08887.1| cathepsin C [Homo sapiens]
          Length = 463

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 78/230 (33%), Positives = 117/230 (50%), Gaps = 31/230 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I    + +  LS  ++++C       GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQH--AQGCEGGFPYLIAGK 309

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
           Y    G+V E C PY   TG   P              C  K   +R  +S+++ +  + 
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352

Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
              +   +  E+  +GP+ V+F VY+DF HYK G+Y H           +  HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 412

Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
           GT S  G DYWI+ N W   WG +GYF+I+RG++EC IE   VA  P  K
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|332254562|ref|XP_003276398.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 3
           [Nomascus leucogenys]
          Length = 362

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 81/233 (34%), Positives = 112/233 (48%), Gaps = 22/233 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 118 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 176

Query: 73  FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSIS 126
               GVV++ C P+     D  G + P    +    +  R+      N    N+  Y ++
Sbjct: 177 LRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSHVNNNDIYQVT 236

Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
             YR+ S+ +++M E+ +NGPV+    V+EDF  YK G+Y H    +         G H+
Sbjct: 237 PVYRLGSNDKEVMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHS 296

Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 297 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 349


>gi|348508181|ref|XP_003441633.1| PREDICTED: dipeptidyl peptidase 1-like isoform 1 [Oreochromis
           niloticus]
          Length = 455

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 77/229 (33%), Positives = 113/229 (49%), Gaps = 30/229 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSC++F  +  L  R  I    +   +LS   +++C  +    GCDGG+P    +Y
Sbjct: 245 QESCGSCYSFATMGMLEARIRILTNNSDAPTLSPQQVVSCSEY--SQGCDGGFPYLIGKY 302

Query: 73  FVHHGVVTEECDPYF-DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
               G+V E C PY   +T C  P                +K Q    +++  +  +   
Sbjct: 303 TQDFGIVDESCFPYVGQNTPCGVP----------------QKCQRIYAAEYNYVGGFYGG 346

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-------MGGHAVKLIGWG 184
                +M E+ KNGP+ V+F VY DF +YK G+Y H TG         +  HAV L+G+G
Sbjct: 347 CSEAAMMLELVKNGPMAVAFEVYPDFMNYKEGIYHH-TGLADPFNPFELTNHAVLLVGYG 405

Query: 185 T-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
                G++YWI+ N W   WG +GYF+I+RG++EC IE   VA  P  K
Sbjct: 406 RCHKTGQNYWIVKNSWGTGWGEEGYFRIRRGNDECAIESIAVAANPIPK 454


>gi|297665716|ref|XP_002811185.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 3
           [Pongo abelii]
          Length = 436

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 84/239 (35%), Positives = 113/239 (47%), Gaps = 34/239 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 192 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 250

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK--NQLWRNS 120
               GVV++ C P+           + A PTP C+          R+      N    N+
Sbjct: 251 LRRRGVVSDHCYPFSGRER------DEAGPTPPCMMHSRAMGRGKRQATASCPNSHVNNN 304

Query: 121 KHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
             Y ++  YR+ S+ ++IM E+ +NGPV+    V+EDF  YK G+Y H    +       
Sbjct: 305 DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYR 364

Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
             G H+VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 365 RHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 423


>gi|426328832|ref|XP_004025452.1| PREDICTED: tubulointerstitial nephritis antigen-like [Gorilla
           gorilla gorilla]
          Length = 462

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 82/233 (35%), Positives = 112/233 (48%), Gaps = 22/233 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 218 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 276

Query: 73  FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSIS 126
               GVV++ C P+     D  G + P    +    +  R+      N    N+  Y ++
Sbjct: 277 LRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSQAMGRGKRQATAHCPNSYVNNNDIYQVT 336

Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
             YR+ S+ ++IM E+ +NGPV+    V+EDF  YK G+Y H    +         G H+
Sbjct: 337 PVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHS 396

Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 397 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 449


>gi|268581031|ref|XP_002645498.1| Hypothetical protein CBG22748 [Caenorhabditis briggsae]
          Length = 379

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 74/219 (33%), Positives = 108/219 (49%), Gaps = 28/219 (12%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
           QG CGSCWAF  V A+  +  I  G+ +SLS  +++ C G    +GC GGY   A R+  
Sbjct: 182 QGQCGSCWAFATVAAIEAQHAIKKGILVSLSEQEMVDCDGR--NNGCSGGYRPYAMRFVK 239

Query: 75  HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 134
            +G+ TE+  PY   +   H  C                  L +N     I  YR+ S  
Sbjct: 240 ENGLETEKSYPY---SALKHDQC-----------------MLHQNDTKVYIDDYRMLSTS 279

Query: 135 EDIMAE-IYKNGPVEVSFTVYEDFAHYKSGVYKHITGD----VMGGHAVKLIGWGTSDDG 189
           E+ +A+ +   GPV     V +    Y+SG++     D     MG HA+ ++G+G  +  
Sbjct: 280 EENIADWVGTKGPVTFGMNVVKAMYSYRSGIFNPSAEDCAEKSMGAHALTIVGYG-GEGT 338

Query: 190 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
             YWI+ N W  SWG+DGYF++ RG N CG+   VVA +
Sbjct: 339 SAYWIVKNSWGTSWGSDGYFRLARGVNSCGLANTVVAPI 377


>gi|348508183|ref|XP_003441634.1| PREDICTED: dipeptidyl peptidase 1-like isoform 2 [Oreochromis
           niloticus]
          Length = 461

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 77/229 (33%), Positives = 113/229 (49%), Gaps = 30/229 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           Q  CGSC++F  +  L  R  I    +   +LS   +++C  +    GCDGG+P    +Y
Sbjct: 251 QESCGSCYSFATMGMLEARIRILTNNSDAPTLSPQQVVSCSEY--SQGCDGGFPYLIGKY 308

Query: 73  FVHHGVVTEECDPYF-DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
               G+V E C PY   +T C  P                +K Q    +++  +  +   
Sbjct: 309 TQDFGIVDESCFPYVGQNTPCGVP----------------QKCQRIYAAEYNYVGGFYGG 352

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-------MGGHAVKLIGWG 184
                +M E+ KNGP+ V+F VY DF +YK G+Y H TG         +  HAV L+G+G
Sbjct: 353 CSEAAMMLELVKNGPMAVAFEVYPDFMNYKEGIYHH-TGLADPFNPFELTNHAVLLVGYG 411

Query: 185 T-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
                G++YWI+ N W   WG +GYF+I+RG++EC IE   VA  P  K
Sbjct: 412 RCHKTGQNYWIVKNSWGTGWGEEGYFRIRRGNDECAIESIAVAANPIPK 460


>gi|302848309|ref|XP_002955687.1| hypothetical protein VOLCADRAFT_106905 [Volvox carteri f.
           nagariensis]
 gi|300259096|gb|EFJ43327.1| hypothetical protein VOLCADRAFT_106905 [Volvox carteri f.
           nagariensis]
          Length = 846

 Score =  120 bits (302), Expect = 4e-25,   Method: Composition-based stats.
 Identities = 71/216 (32%), Positives = 105/216 (48%), Gaps = 13/216 (6%)

Query: 17  HCGSCWAFGAVEALSDRFCIHF---GMNLSLSVNDLLACCGFL-CGDGCDGGYPISAWRY 72
           +CG CW  G++  + DR  I       ++ LS   LL C  F   G GCDGG  +  + Y
Sbjct: 566 YCGGCWVHGSLSMIQDRLKIKKRAKSPDVMLSRQTLLNCAAFEGYGHGCDGGDTVDVFSY 625

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL---W---RNSKHYSIS 126
               G+  E C  Y  +     PG        +C+  C+  N +   W   R  K+Y  +
Sbjct: 626 MAEFGLPDEGCMTYNATDHTKFPGVSHCPVEGQCL-NCMPINGVDTCWPIERPVKYYLNA 684

Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA-HYKSGVYKHITGDVMGGHAVKLIGWGT 185
              ++   E +M+EIY  GP+       +DF  HYK G+YK  +GD    H V+++GWG 
Sbjct: 685 WGNLDKSVEAMMSEIYHRGPITCGIACPDDFTWHYKGGIYKDTSGDTELDHDVEVVGWGV 744

Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 221
            +DG  YW++ N W   WG  G+F+++RG N   IE
Sbjct: 745 -EDGVKYWVVRNSWGTYWGEMGFFRVERGVNALQIE 779


>gi|297665714|ref|XP_002811184.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
           [Pongo abelii]
          Length = 467

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 84/239 (35%), Positives = 113/239 (47%), Gaps = 34/239 (14%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 223 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 281

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK--NQLWRNS 120
               GVV++ C P+           + A PTP C+          R+      N    N+
Sbjct: 282 LRRRGVVSDHCYPFSGRER------DEAGPTPPCMMHSRAMGRGKRQATASCPNSHVNNN 335

Query: 121 KHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
             Y ++  YR+ S+ ++IM E+ +NGPV+    V+EDF  YK G+Y H    +       
Sbjct: 336 DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYR 395

Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
             G H+VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 396 RHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 454


>gi|397515891|ref|XP_003828175.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2 [Pan
           paniscus]
          Length = 436

 Score =  120 bits (301), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 82/233 (35%), Positives = 112/233 (48%), Gaps = 22/233 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 192 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 250

Query: 73  FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSIS 126
               GVV++ C P+     D  G + P    +    +  R+      N    N+  Y ++
Sbjct: 251 LRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSYVNNNDIYQVT 310

Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
             YR+ S+ ++IM E+ +NGPV+    V+EDF  YK G+Y H    +         G H+
Sbjct: 311 PVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHS 370

Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 371 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 423


>gi|324711034|ref|NP_001191343.1| tubulointerstitial nephritis antigen-like isoform 2 precursor [Homo
           sapiens]
 gi|194391000|dbj|BAG60618.1| unnamed protein product [Homo sapiens]
          Length = 436

 Score =  120 bits (301), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 82/233 (35%), Positives = 112/233 (48%), Gaps = 22/233 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 192 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 250

Query: 73  FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSIS 126
               GVV++ C P+     D  G + P    +    +  R+      N    N+  Y ++
Sbjct: 251 LRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSYVNNNDIYQVT 310

Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
             YR+ S+ ++IM E+ +NGPV+    V+EDF  YK G+Y H    +         G H+
Sbjct: 311 PVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHS 370

Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 371 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 423


>gi|255209|gb|AAB23200.1| preprocathepsin C, dipeptidylaminopeptidase I [rats, kidney,
           Peptide, 462 aa]
          Length = 462

 Score =  120 bits (301), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 76/228 (33%), Positives = 117/228 (51%), Gaps = 27/228 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I    + +  LS  ++++C  +    GCDGG+P + A +
Sbjct: 251 QESCGSCYSFASIGMLEARIRILTNNSQTPILSPQEVVSCSPY--AQGCDGGFPYLIAGK 308

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y    GVV E C PY  +         P  P   C+R        + +S++Y +  +   
Sbjct: 309 YAQDFGVVEENCFPYTATDA-------PCKPKENCLR--------YYSSEYYYVGGFYGG 353

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
            +   +  E+ K+GP+ V+F V++DF HY SG+Y H           +  HAV L+G+G 
Sbjct: 354 CNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGK 413

Query: 186 SD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
               G DYWI+ N W   WG  GYF+I+RG++EC IE   +A +P  K
Sbjct: 414 DPVTGLDYWIVKNSWGSQWGESGYFRIRRGTDECAIESIAMAAIPIPK 461


>gi|397515889|ref|XP_003828174.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1 [Pan
           paniscus]
          Length = 467

 Score =  120 bits (301), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 82/233 (35%), Positives = 112/233 (48%), Gaps = 22/233 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 223 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 281

Query: 73  FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSIS 126
               GVV++ C P+     D  G + P    +    +  R+      N    N+  Y ++
Sbjct: 282 LRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSYVNNNDIYQVT 341

Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
             YR+ S+ ++IM E+ +NGPV+    V+EDF  YK G+Y H    +         G H+
Sbjct: 342 PVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHS 401

Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 402 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 454


>gi|332808277|ref|XP_524645.3| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
           antigen-like 1 [Pan troglodytes]
          Length = 472

 Score =  120 bits (301), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 82/233 (35%), Positives = 112/233 (48%), Gaps = 22/233 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 228 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 286

Query: 73  FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSIS 126
               GVV++ C P+     D  G + P    +    +  R+      N    N+  Y ++
Sbjct: 287 LRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSYVNNNDIYQVT 346

Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
             YR+ S+ ++IM E+ +NGPV+    V+EDF  YK G+Y H    +         G H+
Sbjct: 347 PVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHS 406

Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 407 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 459


>gi|11545918|ref|NP_071447.1| tubulointerstitial nephritis antigen-like isoform 1 precursor [Homo
           sapiens]
 gi|61213628|sp|Q9GZM7.1|TINAL_HUMAN RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
           Full=Glucocorticoid-inducible protein 5; AltName:
           Full=Oxidized LDL-responsive gene 2 protein;
           Short=OLRG-2; AltName: Full=Tubulointerstitial nephritis
           antigen-related protein; Short=TIN Ag-related protein;
           Short=TIN-Ag-RP; Flags: Precursor
 gi|11602840|gb|AAG38876.1|AF236150_1 tubulointerstitial nephritis antigen-related protein precursor
           [Homo sapiens]
 gi|11275667|gb|AAG33699.1| oxidized-LDL responsive gene 2 [Homo sapiens]
 gi|11527793|dbj|BAB18636.1| glucocorticoid-inducible protein [Homo sapiens]
 gi|11527809|dbj|BAB18727.1| glucocorticoid-inducible protein [Homo sapiens]
 gi|11761715|gb|AAG40154.1| tubulointerstitial nephritis antigen-related protein [Homo sapiens]
 gi|22761462|dbj|BAC11596.1| unnamed protein product [Homo sapiens]
 gi|37181967|gb|AAQ88787.1| LCN7 [Homo sapiens]
 gi|40353044|gb|AAH64633.1| Tubulointerstitial nephritis antigen-like 1 [Homo sapiens]
 gi|119628009|gb|EAX07604.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
           sapiens]
 gi|119628010|gb|EAX07605.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
           sapiens]
 gi|119628011|gb|EAX07606.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
           sapiens]
 gi|158258977|dbj|BAF85459.1| unnamed protein product [Homo sapiens]
 gi|261858502|dbj|BAI45773.1| tubulointerstitial nephritis antigen-like 1 [synthetic construct]
 gi|410265400|gb|JAA20666.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410307560|gb|JAA32380.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410307562|gb|JAA32381.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410307564|gb|JAA32382.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410335249|gb|JAA36571.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
          Length = 467

 Score =  120 bits (301), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 82/233 (35%), Positives = 112/233 (48%), Gaps = 22/233 (9%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 223 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 281

Query: 73  FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSIS 126
               GVV++ C P+     D  G + P    +    +  R+      N    N+  Y ++
Sbjct: 282 LRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSYVNNNDIYQVT 341

Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
             YR+ S+ ++IM E+ +NGPV+    V+EDF  YK G+Y H    +         G H+
Sbjct: 342 PVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHS 401

Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
           VK+ GWG  T  DG    YW  AN W  +WG  G+F+I RG NEC IE  V+ 
Sbjct: 402 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 454


>gi|201023321|ref|NP_001128402.1| cathepsin B-1874 precursor [Acyrthosiphon pisum]
          Length = 315

 Score =  120 bits (301), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 85/247 (34%), Positives = 120/247 (48%), Gaps = 47/247 (19%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C S +A  A  A SDR CI      N  +S   +++CC +LCG GCDGG    +W Y
Sbjct: 83  QGNCRSSYAVAAASAASDRICIQSNGTKNPIMSAQQIISCC-YLCGHGCDGGSLFESWDY 141

Query: 73  FVHHGVVT-------EECDPYFDSTGCSHPGCE------PAYP--------TPKCVRKCV 111
           +  HG V+       + C PY      + P C+      P +         TP C +KC 
Sbjct: 142 YRRHGFVSGGDYNSNQGCQPY------TIPPCKLMNEKPPGHSCTTYHREETPICEKKCY 195

Query: 112 KKNQLWR------NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
             N            K+Y +S Y         M +I+ NGP+   F +Y D   YKSGVY
Sbjct: 196 NPNYYTSFRTDIYKGKYYKLSPYMA-------MKDIFDNGPITTQFYMYRDLVDYKSGVY 248

Query: 166 KHITG---DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 222
           ++      D    H+VK+ GWG  ++G  YW++AN +   WG +G FKI RG++ C  +E
Sbjct: 249 QYDEQSDFDFFTVHSVKIFGWG-EENGVPYWLVANSFGTDWGYNGTFKISRGNDGCFFQE 307

Query: 223 DVVAGLP 229
            + AGLP
Sbjct: 308 KMYAGLP 314


>gi|16758354|ref|NP_446034.1| tubulointerstitial nephritis antigen-like precursor [Rattus
           norvegicus]
 gi|61213054|sp|Q9EQT5.1|TINAL_RAT RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
           Full=Glucocorticoid-inducible protein 5; Flags:
           Precursor
 gi|11527795|dbj|BAB18637.1| glucocorticoid-inducible protein [Rattus norvegicus]
          Length = 467

 Score =  120 bits (300), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 83/239 (34%), Positives = 112/239 (46%), Gaps = 33/239 (13%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG+C   WAF      SDR  IH    M   LS  +LL+C       GC GG    AW +
Sbjct: 222 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSCDTHH-QKGCRGGRLDGAWWF 280

Query: 73  FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRN 119
               GVV++ C P+           + A PTP+C+          R+   +   +Q+  N
Sbjct: 281 LRRRGVVSDNCYPF-----SGREQNDEASPTPRCMMHSRAMGRGKRQATSRCPNSQVDSN 335

Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
             +     YR+ SD ++IM E+ +NGPV+    V+EDF  Y+ G+Y H            
Sbjct: 336 DIYQVTPVYRLASDEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYR 395

Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
             G H+VK+ GWG  T  DG    YW  AN W   WG  G+F+I RG NEC IE  V+ 
Sbjct: 396 RHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGINECDIETFVLG 454


>gi|328712827|ref|XP_003244913.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Acyrthosiphon pisum]
          Length = 487

 Score =  120 bits (300), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 83/243 (34%), Positives = 116/243 (47%), Gaps = 17/243 (6%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
           QG CG+ WA    +  +DRF I     M  +LS   LL+C   L   GC GG+  SAW +
Sbjct: 242 QGWCGASWAISTAQVTTDRFVIMTKGLMRDALSPKHLLSCNNDL-QRGCQGGHLTSAWNW 300

Query: 73  FVHHGVVTEECDPY-FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
            +  G+VTEEC P+   +T C+               +  K + L R    Y ++     
Sbjct: 301 VMTFGLVTEECYPWDGRATDCAVSNQRSNNNLIVTCPRSAKTSPLRRVGLMYRVAT---- 356

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK---HITGDVMGGHAVKLIGWGTSDD 188
              E IM EI   G V+    V ++F  Y+SGVY+      G   G H V+++GWG    
Sbjct: 357 --EEGIMYEIMNWGSVQAMMKVSKEFFMYESGVYRCSNLALGSKTGYHTVRIVGWGEEQQ 414

Query: 189 G---EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFE 245
                 YWI++N W   WG  GYF+I +G+NEC IE+ VVA +    N    I+     E
Sbjct: 415 NGRTVKYWIVSNSWGLWWGESGYFRILKGTNECQIEDFVVAAMADIGNFC-SISDKSFRE 473

Query: 246 DAS 248
           +AS
Sbjct: 474 NAS 476


>gi|24987409|pdb|1JQP|A Chain A, Dipeptidyl Peptidase I (Cathepsin C), A Tetrameric
           Cysteine Protease Of The Papain Family
          Length = 438

 Score =  120 bits (300), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 76/228 (33%), Positives = 117/228 (51%), Gaps = 27/228 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I    + +  LS  ++++C  +    GCDGG+P + A +
Sbjct: 227 QESCGSCYSFASLGMLEARIRILTNNSQTPILSPQEVVSCSPY--AQGCDGGFPYLIAGK 284

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y    GVV E C PY  +         P  P   C+R        + +S++Y +  +   
Sbjct: 285 YAQDFGVVEENCFPYTATDA-------PCKPKENCLR--------YYSSEYYYVGGFYGG 329

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
            +   +  E+ K+GP+ V+F V++DF HY SG+Y H           +  HAV L+G+G 
Sbjct: 330 CNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGK 389

Query: 186 SD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
               G DYWI+ N W   WG  GYF+I+RG++EC IE   +A +P  K
Sbjct: 390 DPVTGLDYWIVKNSWGSQWGESGYFRIRRGTDECAIESIAMAAIPIPK 437


>gi|8393218|ref|NP_058793.1| dipeptidyl peptidase 1 precursor [Rattus norvegicus]
 gi|114152780|sp|P80067.3|CATC_RAT RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|220686|dbj|BAA14400.1| cathepsin C precursor [Rattus norvegicus]
 gi|149069035|gb|EDM18587.1| cathepsin C, isoform CRA_a [Rattus norvegicus]
          Length = 462

 Score =  120 bits (300), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 76/228 (33%), Positives = 117/228 (51%), Gaps = 27/228 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I    + +  LS  ++++C  +    GCDGG+P + A +
Sbjct: 251 QESCGSCYSFASLGMLEARIRILTNNSQTPILSPQEVVSCSPY--AQGCDGGFPYLIAGK 308

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y    GVV E C PY  +         P  P   C+R        + +S++Y +  +   
Sbjct: 309 YAQDFGVVEENCFPYTATDA-------PCKPKENCLR--------YYSSEYYYVGGFYGG 353

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
            +   +  E+ K+GP+ V+F V++DF HY SG+Y H           +  HAV L+G+G 
Sbjct: 354 CNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGK 413

Query: 186 SD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
               G DYWI+ N W   WG  GYF+I+RG++EC IE   +A +P  K
Sbjct: 414 DPVTGLDYWIVKNSWGSQWGESGYFRIRRGTDECAIESIAMAAIPIPK 461


>gi|344250687|gb|EGW06791.1| Dipeptidyl-peptidase 1 [Cricetulus griseus]
          Length = 483

 Score =  120 bits (300), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 76/228 (33%), Positives = 114/228 (50%), Gaps = 27/228 (11%)

Query: 15  QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
           Q  CGSC++F ++  L  R  I    + +  LS  ++++C  +    GCDGG+P + A +
Sbjct: 272 QESCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSMY--AQGCDGGFPYLIAGK 329

Query: 72  YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
           Y    GVV E C PY  +         P  P   C+R        +  S +Y +  +   
Sbjct: 330 YAQDFGVVEENCFPYTATDA-------PCKPKENCLR--------YYTSGYYYVGGFYGG 374

Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
            +   +  E+ ++GP+ V+F V +DF HY SG+Y H           +  HAV L+G+G 
Sbjct: 375 CNEALMKLELVQHGPMAVAFEVQDDFLHYHSGIYHHTGLRDPFNPFELTNHAVLLVGYGR 434

Query: 186 S-DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
             D G DYW + N W   WG  GYF+I+RG++EC IE   VA +P  K
Sbjct: 435 DPDTGTDYWTVKNSWGTEWGESGYFRIRRGTDECAIESIAVAAIPIPK 482


>gi|300121755|emb|CBK22330.2| unnamed protein product [Blastocystis hominis]
          Length = 562

 Score =  120 bits (300), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 72/221 (32%), Positives = 107/221 (48%), Gaps = 16/221 (7%)

Query: 14  IQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLC---GDGCDGGYPISAW 70
           I  +CGSCW+F +V ++SDR  +         V+DL       C    +GC GG+P++A+
Sbjct: 66  IPQYCGSCWSFASVSSVSDR--LKLMTKGKWPVHDLSPQVILNCDHNSNGCQGGHPLTAF 123

Query: 71  RYFVHHGVVTEECDPYF-DSTGCSHPG-CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
           +Y   HGV  E C  Y   +  C+    C    P   C           +N   Y +  Y
Sbjct: 124 KYMHDHGVPEEGCMRYMAKNMECTDINICRDCDPDKGCFAV--------KNYTKYYVDEY 175

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
              +  +++M EIY  GP+  +    E+   YK G+Y+  TG     H++ ++GWG  +D
Sbjct: 176 GSVAGEKNMMKEIYARGPITCTIADPEELMEYKGGIYRDTTGAKSLDHSISVVGWG-EED 234

Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
           G+ YWI  N W   WG  G+F+I RG N  GIE D    +P
Sbjct: 235 GQKYWIARNSWGTFWGEKGWFRIVRGENNLGIEADCQWAVP 275



 Score =  110 bits (274), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 66/213 (30%), Positives = 100/213 (46%), Gaps = 16/213 (7%)

Query: 14  IQGHCGSCWAFGAVEALSDRFCIHFG---MNLSLSVNDLLACCGFLCGDGCDGGYPISAW 70
           I  +CGSCWA     ALSDR  +        + LSV +++ C G      C+GG+    +
Sbjct: 350 IPQYCGSCWAQAPTSALSDRINLMRKGKWPTVELSVQEIINCSG---KGSCEGGWQSGVY 406

Query: 71  RYFVHHGVVTEECDPY--FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
           +Y  H G+  + C  Y   D        C    P  +C           ++ K Y +S Y
Sbjct: 407 QYAYHQGIPDQTCQVYEAIDKECNDMARCMDCPPGKECGPV--------KDYKRYKVSEY 458

Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
              S   +I AEI+  GPV     V ++F  Y+ G++K    + +G H+V++ GWG ++D
Sbjct: 459 GYASGEAEIKAEIFARGPVSCDIWVTQEFLDYQGGIFKENGSEYLGRHSVEVAGWGETED 518

Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIE 221
           G  YWI  N W   WG  G+F+I  G    G++
Sbjct: 519 GTKYWIGRNSWGTYWGEHGWFRIIIGEKGLGLD 551


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.138    0.462 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,607,587,380
Number of Sequences: 23463169
Number of extensions: 209917486
Number of successful extensions: 411675
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 5533
Number of HSP's successfully gapped in prelim test: 1609
Number of HSP's that attempted gapping in prelim test: 393082
Number of HSP's gapped (non-prelim): 8625
length of query: 249
length of database: 8,064,228,071
effective HSP length: 139
effective length of query: 110
effective length of database: 9,097,814,876
effective search space: 1000759636360
effective search space used: 1000759636360
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 75 (33.5 bits)