BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy15348
         (298 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|187105118|ref|NP_001119619.1| cathepsin B-5880 precursor [Acyrthosiphon pisum]
 gi|163300442|tpg|DAA06127.1| TPA_inf: cathepsin B transcript 5880 [Acyrthosiphon pisum]
 gi|239790051|dbj|BAH71611.1| ACYPI000015 [Acyrthosiphon pisum]
 gi|239790053|dbj|BAH71612.1| ACYPI000015 [Acyrthosiphon pisum]
          Length = 302

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 76/245 (31%), Positives = 113/245 (46%), Gaps = 30/245 (12%)

Query: 27  SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
           +C  + A++ A+ ++  +C  S   V+    +  A     C +L         CS G   
Sbjct: 77  NCKSSYAISVASAVSDRICIHSNGTVK---PKLSAQQILSCCYLCGDG-----CSGGQHF 128

Query: 87  STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
            +W +  + GLV+GG + SN GCQP +  PC H   T  E  C       P+C  +C N 
Sbjct: 129 ESWDFYRRHGLVSGGEYGSNEGCQPYTIEPCQHTE-TAVENACSNKTLFTPECKVQCYNP 187

Query: 147 NYGRGFFQDKYQINGLGLYFD-PHF---------GPFWPAFWRSFCTKYTRPLFQTNGRV 196
           +YG  + +D +Q    G ++  P +         GP   +F+        +        V
Sbjct: 188 DYGTRYVKDNHQ----GTHYRVPAYTAMKEIYENGPITASFYM------YQDFVNYQSGV 237

Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
           YA + S + V    VKI+GWGEENG PYW   ++F   +GD G +KILRG NE  IE  +
Sbjct: 238 YAYN-SGKYVTTQAVKILGWGEENGTPYWLAANSFNTYWGDNGFVKILRGANECYIEEFM 296

Query: 257 NGALP 261
              LP
Sbjct: 297 YAGLP 301


>gi|242001640|ref|XP_002435463.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215498799|gb|EEC08293.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 223

 Score =  111 bits (277), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 70/192 (36%), Positives = 98/192 (51%), Gaps = 19/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           CS G+S++ W +    GLV+GG +++  GC+P S  PC H++   S PEC     P PKC
Sbjct: 40  CSGGVSAAAWQYWKDAGLVSGGLYNTTDGCKPYSLAPCEHSS-QGSLPEC-VGTLPTPKC 97

Query: 140 HTRCTNDNYGRGFFQDKY------QINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
             +C  + Y R +  DKY       ING            GP    F     T Y   L 
Sbjct: 98  KRQC-REGYERSYDDDKYFAKNVYSINGSEKQIRTEIFQNGPVEAEF-----TAYADFLS 151

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY    S +I+    ++I+GWG E+  PYW + +++ E +GD G  K+LRG NE 
Sbjct: 152 YKSG-VYQ-HHSRDIIGRHAIRILGWGSEDNNPYWLLANSWNEDWGDHGYFKMLRGVNEC 209

Query: 251 IIESLVNGALPK 262
            IES VN  +PK
Sbjct: 210 DIESFVNAGIPK 221


>gi|256086863|ref|XP_002579605.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228447|emb|CCD74618.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 271

 Score =  107 bits (268), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 73/249 (29%), Positives = 116/249 (46%), Gaps = 30/249 (12%)

Query: 26  LSCIEARAVATATPLAFAVCRSSK--MHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSG 83
           ++ I   A      ++  +C  SK  + VE ++   ++    RC +          C  G
Sbjct: 38  ITFINKHAFGAVESMSDRICIHSKNKISVELSAINLLSCCT-RCGF---------GCRGG 87

Query: 84  ISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRC 143
           I    W +    G+VTGG++ ++TGCQP  FP CNH + + S P C++   P P+CH  C
Sbjct: 88  IPGMAWDYWKYEGIVTGGSNETHTGCQPYPFPECNHHSSSKSYPPCESYYFPTPECHETC 147

Query: 144 TNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQTNG 194
             D+YG+ + +DK      Y +    +         GP    F+                
Sbjct: 148 -QDDYGKPYKKDKFYGKSSYNVASEEISIMKEILLNGPVEGGFYV------YEDFLNYKS 200

Query: 195 RVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
            VY     + +  +A ++I+GWG ++N  PYW   +++  Q+GD+G  KILRG NE  IE
Sbjct: 201 GVYKHITGSYLGGHA-IRIIGWGIQQNHIPYWLCANSWNNQWGDQGYFKILRGTNECGIE 259

Query: 254 SLVNGALPK 262
           S+V   LP 
Sbjct: 260 SMVTAGLPN 268


>gi|308511959|ref|XP_003118162.1| CRE-CPR-6 protein [Caenorhabditis remanei]
 gi|308238808|gb|EFO82760.1| CRE-CPR-6 protein [Caenorhabditis remanei]
          Length = 387

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 66/194 (34%), Positives = 94/194 (48%), Gaps = 16/194 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W +  K G+VTG  + +N+GC+P  FPPC H +  T    C     P PKC
Sbjct: 174 CNGGDPLAAWRYWVKDGIVTGSNYTANSGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKC 233

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
             +C  D   + + +DK Y  +  G+  D           GP   AF      +      
Sbjct: 234 EKKCIADYTDKTYSEDKFYGASAYGVKDDVEAIQKELMTHGPLEIAF------EVYEDFL 287

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY V    ++     VK+VGWG ENG PYWT  +++   +G+ G  +ILRG +E 
Sbjct: 288 NYDGGVY-VHTGGKLGGGHAVKLVGWGIENGIPYWTCANSWNTDWGEDGFFRILRGVDEC 346

Query: 251 IIESLVNGALPKDN 264
            IES V G +PK N
Sbjct: 347 GIESGVVGGVPKLN 360


>gi|55793943|gb|AAV65882.1| cathepsin B1 isotype 2 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 68/191 (35%), Positives = 95/191 (49%), Gaps = 18/191 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W +  + G+VTG +  ++TGCQP  FP C H + T   PEC       PKC
Sbjct: 159 CQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCEH-HTTGKYPECGEKIYKTPKC 217

Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
           H +C          +  YGR  +      N +      H GP   AF     T ++  L 
Sbjct: 218 HQKCQKGYKTPYGKDKYYGRMSYNVLNNENAIKKEIMMH-GPVEAAF-----TVHSDFLN 271

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G +Y     AEI  +A V+I+GWG E   PYW I +++ E +G+KG  +ILRG++E 
Sbjct: 272 YKSG-IYKYMTGAEIGGHA-VRIIGWGVEKKTPYWLIANSWNEDWGEKGYFRILRGKDEC 329

Query: 251 IIESLVNGALP 261
            IES V G LP
Sbjct: 330 GIESEVTGGLP 340


>gi|55793945|gb|AAV65883.1| cathepsin B1 isotype 3 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 68/191 (35%), Positives = 95/191 (49%), Gaps = 18/191 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W +  + G+VTG +  ++TGCQP  FP C H + T   PEC       PKC
Sbjct: 159 CQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCEH-HTTGKYPECGEKIYKTPKC 217

Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
           H +C          +  YGR  +      N +      H GP   AF     T ++  L 
Sbjct: 218 HQKCQKGYKTPYKKDKYYGRMSYNVLNNENAIKKEIMMH-GPVEAAF-----TVHSDFLN 271

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G +Y     AEI  +A V+I+GWG E   PYW I +++ E +G+KG  +ILRG++E 
Sbjct: 272 YKSG-IYKYMTGAEIGGHA-VRIIGWGVEKKTPYWLIANSWNEDWGEKGYFRILRGKDEC 329

Query: 251 IIESLVNGALP 261
            IES V G LP
Sbjct: 330 GIESEVTGGLP 340


>gi|55793941|gb|AAV65881.1| cathepsin B1 isotype 1 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 68/191 (35%), Positives = 95/191 (49%), Gaps = 18/191 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W +  + G+VTG +  ++TGCQP  FP C H + T   PEC       PKC
Sbjct: 159 CQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCEH-HTTGKYPECGEKIYKTPKC 217

Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
           H +C          +  YGR  +      N +      H GP   AF     T ++  L 
Sbjct: 218 HQKCQKGYKTPYKKDKYYGRMSYNVLNNENAIKKEIMMH-GPVEAAF-----TVHSDFLN 271

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G +Y     AEI  +A V+I+GWG E   PYW I +++ E +G+KG  +ILRG++E 
Sbjct: 272 YKSG-IYKYMTGAEIGGHA-VRIIGWGVEKKTPYWLIANSWNEDWGEKGYFRILRGKDEC 329

Query: 251 IIESLVNGALP 261
            IES V G LP
Sbjct: 330 GIESEVTGGLP 340


>gi|22535408|emb|CAC87118.1| cathepsin B-like protease [Nilaparvata lugens]
          Length = 347

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 66/195 (33%), Positives = 86/195 (44%), Gaps = 23/195 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLAT-PQPK 138
           C  G   + W ++ + GLVTGG +HS+ GCQP    PC H +   S+P C    T P P 
Sbjct: 161 CEGGFPDAAWVFIKRHGLVTGGDYHSHDGCQPYPIAPCEH-HMEGSKPNCSASPTEPTPA 219

Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTR 187
           C T CT   +G      K +  G   Y  P             GP   AF      K   
Sbjct: 220 CETTCT---HGSSLAYQKDRQKGKSAYLVPVGEKQTQLEIFKNGPIVAAF------KVYE 270

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
             F     VY     +       VK++GWGE+NG PYW + +++   +GDKG  KI RG 
Sbjct: 271 DFFMYKSGVYKRHPESPFRGRHAVKVIGWGEQNGLPYWLVQNSWDYDWGDKGLFKIARG- 329

Query: 248 NEAIIESLVNGALPK 262
           NE   E  +   LPK
Sbjct: 330 NECDFEKSMTAGLPK 344


>gi|55793947|gb|AAV65884.1| cathepsin B1 isotype 4 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 67/193 (34%), Positives = 96/193 (49%), Gaps = 18/193 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W +  + G+VTG +  ++TGCQP  FP C H + T   PEC       PKC
Sbjct: 159 CQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCEH-HTTGKYPECGEKIYKTPKC 217

Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
           H +C          +  YGR  +      N +      H GP   AF     T ++  L 
Sbjct: 218 HQKCQKGYKTPYKKDKYYGRMSYNVLNNENAIKKEIMMH-GPVEVAF-----TVHSDFLN 271

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G +Y     AEI  +A V+I+GWG E   PYW I +++ E +G+KG  ++LRG++E 
Sbjct: 272 YKSG-IYKYMTGAEIGEHA-VRIIGWGVEKKTPYWLIANSWNEDWGEKGYFRMLRGKDEC 329

Query: 251 IIESLVNGALPKD 263
            IES V   LP+D
Sbjct: 330 GIESAVTSGLPRD 342


>gi|55793951|gb|AAV65886.1| cathepsin B1 isotype 6 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 69/193 (35%), Positives = 97/193 (50%), Gaps = 18/193 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   S W +  + G+VTG +  ++TGCQP  FP C H N T   P C       PKC
Sbjct: 159 CLGGFPGSAWDYWVEEGVVTGSSGENHTGCQPYPFPKCEH-NTTGKYPACGQKIYETPKC 217

Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
             +C          + +YG+  +      + +      H GP       SF T Y+  L 
Sbjct: 218 QKKCQKGYKTPYKKDKHYGKVAYNVPNNEDSIKKEIMMH-GPV-----GSFFTVYSDFLN 271

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G +Y      EI  + TV+IVGWG E G PYW I +++ E +G+KG  +ILRG++E 
Sbjct: 272 YKSG-IYKHMKGTEIGVH-TVRIVGWGVEKGTPYWLIANSWNEGWGEKGYFRILRGKDEC 329

Query: 251 IIESLVNGALPKD 263
            IESLV G LP++
Sbjct: 330 DIESLVIGGLPRN 342


>gi|308488550|ref|XP_003106469.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
 gi|308253819|gb|EFO97771.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
          Length = 205

 Score =  104 bits (259), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 67/194 (34%), Positives = 93/194 (47%), Gaps = 19/194 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W W  K GLVTGG++ S  GC+P S  PC       + P+C     P PKC
Sbjct: 14  CEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKC 73

Query: 140 HTRCTNDN-YGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
              CT++N Y  G+ QDK+          ++  +      H GP   AF     T Y   
Sbjct: 74  VEACTSNNTYPTGYLQDKHFGATAYAVGKKVEQIQTEILAH-GPIEVAF-----TVY-ED 126

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
            +Q    VY  +A   +  +A VKI+GWG +NG PYW + +++   +G+KG  +I+RG N
Sbjct: 127 FYQYTTGVYVHTAGKSLGGHA-VKILGWGVDNGTPYWLVANSWNVNWGEKGYFRIIRGLN 185

Query: 249 EAIIESLVNGALPK 262
           E  IE      LP 
Sbjct: 186 ECGIEHSAVAGLPD 199


>gi|340501578|gb|EGR28345.1| hypothetical protein IMG5_177790 [Ichthyophthirius multifiliis]
          Length = 356

 Score =  103 bits (258), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 70/195 (35%), Positives = 96/195 (49%), Gaps = 22/195 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   +   +  K GLVTG     N  CQ  SFPPC H   +T  P CK    P P+C
Sbjct: 165 CNGGYPEAAMQYFVKTGLVTGDLFGDNNFCQAYSFPPCAHHVASTKYPPCKG-EVPTPEC 223

Query: 140 HTRCTNDN-YGRGFFQDKYQ-INGLGLYFDP--------HFGPFWPAF--WRSFCTKYTR 187
             +C +D+   R + +D Y+      +  DP        + GP   AF  +  F T Y  
Sbjct: 224 KKKCDDDSKVKRPYNEDLYKGQKSYSVSSDPKAIMTEIMNNGPVEVAFTVYEDFVT-YKS 282

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
            ++Q          + E +    VK++GWG EN  PYW IV+++ E +GD+GT KILRG 
Sbjct: 283 GVYQ--------HVTGEQLGGHAVKMIGWGVENDTPYWLIVNSWNETWGDQGTFKILRGS 334

Query: 248 NEAIIESLVNGALPK 262
           NE  IE  V  ALP+
Sbjct: 335 NECGIEDEVVTALPQ 349


>gi|71984043|ref|NP_001024426.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
 gi|351058214|emb|CCD65629.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
          Length = 378

 Score =  103 bits (258), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 61/195 (31%), Positives = 94/195 (48%), Gaps = 18/195 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W +  K G+VTG  + +N GC+P  FPPC H +  T    C     P PKC
Sbjct: 173 CNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKC 232

Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
             +C +D   + + +DK+           +  +      H GP   AF      +     
Sbjct: 233 EKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTH-GPLEIAF------EVYEDF 285

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
              +G VY V    ++     VK++GWG ++G PYWT+ +++   +G+ G  +ILRG +E
Sbjct: 286 LNYDGGVY-VHTGGKLGGGHAVKLIGWGIDDGIPYWTVANSWNTDWGEDGFFRILRGVDE 344

Query: 250 AIIESLVNGALPKDN 264
             IES V G +PK N
Sbjct: 345 CGIESGVVGGIPKLN 359


>gi|193209594|ref|NP_001123113.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
 gi|351058222|emb|CCD65637.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
          Length = 369

 Score =  103 bits (258), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 61/195 (31%), Positives = 94/195 (48%), Gaps = 18/195 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W +  K G+VTG  + +N GC+P  FPPC H +  T    C     P PKC
Sbjct: 164 CNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKC 223

Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
             +C +D   + + +DK+           +  +      H GP   AF      +     
Sbjct: 224 EKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTH-GPLEIAF------EVYEDF 276

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
              +G VY V    ++     VK++GWG ++G PYWT+ +++   +G+ G  +ILRG +E
Sbjct: 277 LNYDGGVY-VHTGGKLGGGHAVKLIGWGIDDGIPYWTVANSWNTDWGEDGFFRILRGVDE 335

Query: 250 AIIESLVNGALPKDN 264
             IES V G +PK N
Sbjct: 336 CGIESGVVGGIPKLN 350


>gi|25146613|ref|NP_741818.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
 gi|1169087|sp|P43510.1|CPR6_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 6; AltName:
           Full=Cysteine protease-related 6; Flags: Precursor
 gi|671715|gb|AAA98787.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|695294|gb|AAA98789.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351058213|emb|CCD65628.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
          Length = 379

 Score =  103 bits (258), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 61/195 (31%), Positives = 94/195 (48%), Gaps = 18/195 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W +  K G+VTG  + +N GC+P  FPPC H +  T    C     P PKC
Sbjct: 174 CNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKC 233

Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
             +C +D   + + +DK+           +  +      H GP   AF      +     
Sbjct: 234 EKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTH-GPLEIAF------EVYEDF 286

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
              +G VY V    ++     VK++GWG ++G PYWT+ +++   +G+ G  +ILRG +E
Sbjct: 287 LNYDGGVY-VHTGGKLGGGHAVKLIGWGIDDGIPYWTVANSWNTDWGEDGFFRILRGVDE 345

Query: 250 AIIESLVNGALPKDN 264
             IES V G +PK N
Sbjct: 346 CGIESGVVGGIPKLN 360


>gi|341887135|gb|EGT43070.1| hypothetical protein CAEBREN_13756 [Caenorhabditis brenneri]
          Length = 398

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 64/194 (32%), Positives = 94/194 (48%), Gaps = 16/194 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W +  K G+VTG    +N+GC+P  FPPC H +  T    C     P PKC
Sbjct: 189 CNGGDPLAAWRYWVKDGIVTGSNFTANSGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKC 248

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
             RC  +   + + +DK Y  +  G+  D           GP   AF      +      
Sbjct: 249 EKRCNAEYTDKTYSEDKFYGSSAYGVKDDVEAIQKELMTHGPLEIAF------EVYEDFL 302

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY V    ++     VK++GWG E+G PYWT+ +++   +G+ G  +ILRG +E 
Sbjct: 303 NYDGGVY-VHTGGKLGGGHAVKLIGWGIEDGIPYWTVANSWNTDWGEDGFFRILRGVDEC 361

Query: 251 IIESLVNGALPKDN 264
            IES V G +PK N
Sbjct: 362 GIESGVVGGIPKLN 375


>gi|268579855|ref|XP_002644910.1| C. briggsae CBR-CPR-6 protein [Caenorhabditis briggsae]
          Length = 376

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 64/194 (32%), Positives = 94/194 (48%), Gaps = 16/194 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W +  K G+VTG  + +N+GC+P  FPPC H +  T    C     P PKC
Sbjct: 175 CNGGDPLAAWRYWVKDGIVTGSNYTANSGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKC 234

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
             +C  D   + + +DK Y  +  G+  D           GP   AF      +      
Sbjct: 235 EKKCIADYTDKTYSEDKFYGHSAYGVKDDVEAIQKELMTHGPLEIAF------EVYEDFL 288

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY V    ++     VK++GWG E+G PYWT  +++   +G+ G  +ILRG +E 
Sbjct: 289 NYDGGVY-VHTGGKLGGGHAVKLIGWGIEDGIPYWTCANSWNTDWGEDGFFRILRGVDEC 347

Query: 251 IIESLVNGALPKDN 264
            IES V G +PK N
Sbjct: 348 GIESGVVGGIPKLN 361


>gi|341891084|gb|EGT47019.1| CBN-CPR-4 protein [Caenorhabditis brenneri]
          Length = 335

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 61/193 (31%), Positives = 92/193 (47%), Gaps = 18/193 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W ++ K G  TGG++ +  GC+P S  PC      T+ P C T     P C
Sbjct: 150 CEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNTTWPACPTDGYDTPAC 209

Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
             +CTN NY   +  DK+          ++  +      H GP   AF     T Y    
Sbjct: 210 VNKCTNSNYNVAYKDDKHFGSTAYAVGKKVAQIQAEIIAH-GPVEAAF-----TVY-EDF 262

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           +Q    VY  +   E+  +A ++I+GWG +NG PYW + +++   +G+ G  +I+RG NE
Sbjct: 263 YQYKSGVYVHTTGEELGGHA-IRILGWGTDNGTPYWLVANSWNVNWGENGYFRIIRGTNE 321

Query: 250 AIIESLVNGALPK 262
             IE  V G +PK
Sbjct: 322 CGIEHAVVGGVPK 334


>gi|268558600|ref|XP_002637291.1| C. briggsae CBR-CPR-4 protein [Caenorhabditis briggsae]
          Length = 335

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 61/193 (31%), Positives = 93/193 (48%), Gaps = 18/193 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W ++ K G  TGG++ S  GC+P S  PC      T+ P+C       P C
Sbjct: 150 CEGGYPINAWKYLVKSGFCTGGSYVSQFGCKPYSLAPCGETVGNTTWPDCPQDGYNTPSC 209

Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
             +CTN+NY   +  DK+          ++  +      H GP   AF     T Y    
Sbjct: 210 VNKCTNNNYNIAYKDDKHFGSTAYAVGKKVAQIQAEILAH-GPVEAAF-----TVY-EDF 262

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           +Q    VY  +   E+  +A ++I+GWG +NG PYW + +++   +G+ G  +I+RG NE
Sbjct: 263 YQYKSGVYVHTTGQELGGHA-IRILGWGTDNGTPYWLVANSWNVNWGENGYFRIIRGTNE 321

Query: 250 AIIESLVNGALPK 262
             IE  V G +PK
Sbjct: 322 CGIEHAVVGGVPK 334


>gi|201023319|ref|NP_001128401.1| cathepsin B-10270 precursor [Acyrthosiphon pisum]
 gi|239788119|dbj|BAH70754.1| ACYPI000021 [Acyrthosiphon pisum]
          Length = 341

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 61/193 (31%), Positives = 86/193 (44%), Gaps = 34/193 (17%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPE--CKTLATPQP 137
           C+ G S + W +  KRGLVTGG + SN GCQP   PPCNH       P   C    +  P
Sbjct: 156 CNGGYSGAAWQYWMKRGLVTGGDYGSNEGCQPWLIPPCNHTVMDERSPSYMCGKYKSETP 215

Query: 138 KCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
           +C   C N NY + F +D  +    G+  D H            C+   R   + +G   
Sbjct: 216 QCTLNCYNPNYSKPFLKDISK----GIRIDWH------------CSGMIRNELKKHGPAT 259

Query: 198 AV----------------SASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTI 241
           A+                  + +++   TVK++GWG   G  YW   +++G  +GDKG  
Sbjct: 260 AIMRVYEDFLTYKSGIYQHVTGKLLGQITVKVIGWGVYRGVQYWLAANSWGTSWGDKGFF 319

Query: 242 KILRGRNEAIIES 254
           KI RG NE + E 
Sbjct: 320 KIRRGYNECLFED 332


>gi|341900876|gb|EGT56811.1| hypothetical protein CAEBREN_29569 [Caenorhabditis brenneri]
          Length = 344

 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 68/197 (34%), Positives = 90/197 (45%), Gaps = 25/197 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W W  K GLVTGG++ S  GC+P S  PC       + P+C     P PKC
Sbjct: 154 CEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKC 213

Query: 140 HTRCT-NDNYGRGFFQDKY-------------QINGLGLYFDPHFGPFWPAFWRSFCTKY 185
              CT N  Y   + QDK+             QI    L      GP   AF     T Y
Sbjct: 214 VDACTSNHTYPTAYLQDKHFGATAYAVGKKVEQIQTEIL----KNGPIEVAF-----TVY 264

Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
               +Q    VY  +A A +  +A VKI+GWG +NG PYW + +++   +G+KG  +I+R
Sbjct: 265 -EDFYQYTTGVYVHTAGASLGGHA-VKILGWGVDNGTPYWLVANSWNINWGEKGYFRIIR 322

Query: 246 GRNEAIIESLVNGALPK 262
           G NE  IE      +P 
Sbjct: 323 GLNECGIEHSAVAGIPD 339


>gi|341888137|gb|EGT44072.1| hypothetical protein CAEBREN_10156 [Caenorhabditis brenneri]
          Length = 344

 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 68/197 (34%), Positives = 90/197 (45%), Gaps = 25/197 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W W  K GLVTGG++ S  GC+P S  PC       + P+C     P PKC
Sbjct: 154 CEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKC 213

Query: 140 HTRCT-NDNYGRGFFQDKY-------------QINGLGLYFDPHFGPFWPAFWRSFCTKY 185
              CT N  Y   + QDK+             QI    L      GP   AF     T Y
Sbjct: 214 VDACTSNHTYPTAYLQDKHFGATAYAVGKKVEQIQTEIL----KNGPIEVAF-----TVY 264

Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
               +Q    VY  +A A +  +A VKI+GWG +NG PYW + +++   +G+KG  +I+R
Sbjct: 265 -EDFYQYTTGVYVHTAGASLGGHA-VKILGWGVDNGTPYWLVANSWNINWGEKGYFRIIR 322

Query: 246 GRNEAIIESLVNGALPK 262
           G NE  IE      +P 
Sbjct: 323 GLNECGIEHSAVAGIPD 339


>gi|240992699|ref|XP_002404474.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215491571|gb|EEC01212.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score =  100 bits (250), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 66/193 (34%), Positives = 96/193 (49%), Gaps = 19/193 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  ++ W +  + GLVTGG + ++ GC+P S  PC H +   S P C T   P PKC
Sbjct: 154 CNGGYPAAAWEYWKESGLVTGGLYGTSDGCKPYSLAPCEH-HTKGSLPNC-TGTVPTPKC 211

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
              C    YG+ +  DK      Y I+             GP    F     T Y   L 
Sbjct: 212 VHLCRK-GYGKDYQDDKHFGRKVYSISSDEKQIQTEIFKNGPVEADF-----TVYADFLS 265

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY    S +++    ++I+GWG ENG PYW + +++ E +GD G  KILRG++E 
Sbjct: 266 YKSG-VYQ-HQSGDVLGGHAIRILGWGTENGTPYWLVANSWNEDWGDHGYFKILRGKDEC 323

Query: 251 IIESLVNGALPKD 263
            IE  +N  +PK+
Sbjct: 324 GIEDDINAGIPKN 336


>gi|225713216|gb|ACO12454.1| Cathepsin B precursor [Lepeophtheirus salmonis]
 gi|290561811|gb|ADD38303.1| Cathepsin B [Lepeophtheirus salmonis]
          Length = 333

 Score =  100 bits (249), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 65/193 (33%), Positives = 100/193 (51%), Gaps = 20/193 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 138
           C+ G   + W++  K+GLV+GG + S+ GCQP +  PC +HAN T   P C       PK
Sbjct: 151 CNGGFPGAAWSFWKKKGLVSGGLYGSHKGCQPYAIAPCEHHANGT--RPPCSG-GGRTPK 207

Query: 139 CHTRCTNDNYGRGFFQDK-YQINGLGLYFDP--------HFGPFWPAFWRSFCTKYTRPL 189
           CHT C N++Y   + +DK +  +   +  DP        + GP   AF     + Y+  L
Sbjct: 208 CHTFCENEDYSLPYEKDKSFGRSSYSVKSDPKQIQLEIMNNGPVEAAF-----SVYSDFL 262

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
              +G    V  S  ++    ++I+GWG ENG PYW + +++   +GD GT KIL+G + 
Sbjct: 263 NYKSGVYRHVKGS--LLGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGTFKILKGSDH 320

Query: 250 AIIESLVNGALPK 262
             IE  +   LP+
Sbjct: 321 CGIEGSIVAGLPQ 333


>gi|393909827|gb|EJD75608.1| cysteine endopeptidase [Loa loa]
          Length = 383

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 64/192 (33%), Positives = 94/192 (48%), Gaps = 17/192 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W +   RG+VTG  + +++GC+P  FPPC H N  T    CK    P PKC
Sbjct: 192 CFGGEPMAAWKYWVLRGIVTGSEYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKC 251

Query: 140 HTRCTNDNYGRGFFQDKY---QINGLGLYFDP------HFGPFWPAFWRSFCTKYTRPLF 190
             +C + NYG+ +  DKY   Q+  +    +         GP   +F       YT  L+
Sbjct: 252 VKKC-DKNYGKSYKADKYYGEQVYNVESNVESIQKEIMTLGPVEASF-----EVYTDFLY 305

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
            T G    V+ S  +     VK++GWG + G PYW   +++   +G+ G  +ILRG NE 
Sbjct: 306 YTGGIYKHVAGS--MGGGHAVKVLGWGIDQGVPYWLAANSWNTDWGEDGYFRILRGVNEC 363

Query: 251 IIESLVNGALPK 262
            IES +   +PK
Sbjct: 364 GIESGIIAGIPK 375


>gi|312091331|ref|XP_003146940.1| cathepsin B [Loa loa]
          Length = 249

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 65/194 (33%), Positives = 93/194 (47%), Gaps = 21/194 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W +   RG+VTG  + +++GC+P  FPPC H N  T    CK    P PKC
Sbjct: 58  CFGGEPMAAWKYWVLRGIVTGSEYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKC 117

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPH-----------FGPFWPAFWRSFCTKYTRP 188
             +C + NYG+ +  DKY   G  +Y                GP   +F       YT  
Sbjct: 118 VKKC-DKNYGKSYKADKYY--GQSVYNVESNVESIQKEIMTLGPVEASF-----EVYTDF 169

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L+ T G    V+ S  +     VK++GWG + G PYW   +++   +G+ G  +ILRG N
Sbjct: 170 LYYTGGIYKHVAGS--MGGGHAVKVLGWGIDQGVPYWLAANSWNTDWGEDGYFRILRGVN 227

Query: 249 EAIIESLVNGALPK 262
           E  IES +   +PK
Sbjct: 228 ECGIESGIIAGIPK 241


>gi|17565164|ref|NP_503383.1| Protein CPR-5 [Caenorhabditis elegans]
 gi|1169086|sp|P43509.1|CPR5_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 5; AltName:
           Full=Cysteine protease-related 5; Flags: Precursor
 gi|671713|gb|AAA98786.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|675502|gb|AAA98784.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351059399|emb|CCD74289.1| Protein CPR-5 [Caenorhabditis elegans]
          Length = 344

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 65/193 (33%), Positives = 88/193 (45%), Gaps = 17/193 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W W  K GLVTGG++ +  GC+P S  PC         P C     P PKC
Sbjct: 154 CEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAPCGETVNGVKWPACPEDTEPTPKC 213

Query: 140 HTRCTN-DNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPL 189
              CT+ +NY   + QDK      Y +              GP   AF     T Y    
Sbjct: 214 VDSCTSKNNYATPYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIEVAF-----TVY-EDF 267

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           +Q    VY  +A A +  +A VKI+GWG +NG PYW + +++   +G+KG  +I+RG NE
Sbjct: 268 YQYTTGVYVHTAGASLGGHA-VKILGWGVDNGTPYWLVANSWNVAWGEKGYFRIIRGLNE 326

Query: 250 AIIESLVNGALPK 262
             IE      +P 
Sbjct: 327 CGIEHSAVAGIPD 339


>gi|268555788|ref|XP_002635883.1| C. briggsae CBR-CPR-5 protein [Caenorhabditis briggsae]
          Length = 345

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 67/197 (34%), Positives = 92/197 (46%), Gaps = 25/197 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W W  K GLVTGG++ S  GC+P S  PC       + P+C     P PKC
Sbjct: 155 CEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPDDTEPTPKC 214

Query: 140 HTRCTNDN-YGRGFFQDKY-------------QINGLGLYFDPHFGPFWPAFWRSFCTKY 185
              CT++N Y   + QDK+             QI    L      GP   AF     T Y
Sbjct: 215 VEACTSNNTYPTPYLQDKHFGATAYAVGKKVEQIQTEIL----KNGPVEVAF-----TVY 265

Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
               +Q    VY  ++ A +  +A VKI+GWG +NG PYW + +++   +G+KG  +I+R
Sbjct: 266 -EDFYQYTTGVYVHTSGASLGGHA-VKILGWGVDNGTPYWLVANSWNVNWGEKGYFRIIR 323

Query: 246 GRNEAIIESLVNGALPK 262
           G NE  IE      +P 
Sbjct: 324 GLNECGIEHSAVAGIPD 340


>gi|240992702|ref|XP_002404475.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215491572|gb|EEC01213.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 65/200 (32%), Positives = 93/200 (46%), Gaps = 33/200 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  ++ W +  + GLVTGG + +N GC+P S  PC H +   S P C T   P PKC
Sbjct: 154 CNGGTPAAAWEYWKESGLVTGGLYGTNDGCKPYSLAPCEH-HTKGSLPNC-TGTVPTPKC 211

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA- 198
              C    YG+ +  DK            HFG     +  S   K  +     NG V A 
Sbjct: 212 VHLCRK-GYGKDYQDDK------------HFGK--KVYSISSDEKQIQTEIFKNGPVEAD 256

Query: 199 ---------------VSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
                             S +++    ++I+GWG ENG PYW   +++ E +GD G  KI
Sbjct: 257 FIVLADFLSYKSGVYQHHSDDVIGGHAIRILGWGTENGTPYWLAANSWNEDWGDHGYFKI 316

Query: 244 LRGRNEAIIESLVNGALPKD 263
           LRG++E  IE  +N  +PK+
Sbjct: 317 LRGKDECGIEEDINAGIPKN 336


>gi|55793949|gb|AAV65885.1| cathepsin B1 isotype 5 precursor [Trichobilharzia regenti]
          Length = 342

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 60/194 (30%), Positives = 92/194 (47%), Gaps = 20/194 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  + G+VTGG+  ++TGCQP  FP C H +     PEC  +   +PKC
Sbjct: 159 CQMGFPGIAWDYWVQEGIVTGGSKENHTGCQPYPFPKCEH-HTKGRYPECGEIIYMKPKC 217

Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
           H +C    Y   + +DKY            + +      H GP   +F      +     
Sbjct: 218 HQKCQK-GYKTPYEKDKYYGKVSYNLLKNEDSIKKEIMMH-GPVEASF------RVHSDF 269

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
                 +Y      +I ++  V+I+GWG E   PYW I +++ E +G+KG  ++LRG++E
Sbjct: 270 LNYKSGIYKHMTGIDIGSH-VVRIIGWGVEKETPYWLIANSWNEDWGEKGYFRMLRGKDE 328

Query: 250 AIIESLVNGALPKD 263
             IES V   LP+D
Sbjct: 329 CGIESAVTSGLPRD 342


>gi|17559068|ref|NP_504682.1| Protein CPR-4 [Caenorhabditis elegans]
 gi|1169085|sp|P43508.1|CPR4_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 4; AltName:
           Full=Cysteine protease-related 4; Flags: Precursor
 gi|675500|gb|AAA98785.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|695293|gb|AAA98783.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351063163|emb|CCD71204.1| Protein CPR-4 [Caenorhabditis elegans]
          Length = 335

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 59/193 (30%), Positives = 91/193 (47%), Gaps = 18/193 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W ++ K G  TGG++ +  GC+P S  PC       + P C       P C
Sbjct: 150 CEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSCPDDGYDTPAC 209

Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
             +CTN NY   +  DK+          +++ +      H GP   AF     T Y    
Sbjct: 210 VNKCTNKNYNVAYTADKHFGSTAYAVGKKVSQIQAEIIAH-GPVEAAF-----TVY-EDF 262

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           +Q    VY  +   E+  +A ++I+GWG +NG PYW + +++   +G+ G  +I+RG NE
Sbjct: 263 YQYKTGVYVHTTGQELGGHA-IRILGWGTDNGTPYWLVANSWNVNWGENGYFRIIRGTNE 321

Query: 250 AIIESLVNGALPK 262
             IE  V G +PK
Sbjct: 322 CGIEHAVVGGVPK 334


>gi|333408990|gb|AEF32260.1| cathepsin B [Cristaria plicata]
          Length = 347

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 65/191 (34%), Positives = 93/191 (48%), Gaps = 17/191 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G  +  W +  + GLVTGG ++S+ GCQP   P C+H      +P C       PKC
Sbjct: 164 CQGGFPAEAWRYYEREGLVTGGLYNSSQGCQPYMIPACDHHVVGHLQP-CPKEEAKTPKC 222

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQ 191
             +C   NY   +  DK      Y ++ +          GP   AF     T Y   L  
Sbjct: 223 SKKC-EANYNVTYKDDKHYGKNSYSVDSVEKIMTEIMTNGPVEAAF-----TVYEDFLSY 276

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
            +G VY      E+  +A VKI+GWGE+NG PYW + +++   +G++G   ILRG++E  
Sbjct: 277 KSG-VYQHRTGQELGGHA-VKILGWGEDNGTPYWIVANSWNPDWGNQGFFNILRGKDECG 334

Query: 252 IESLVNGALPK 262
           IES +   LPK
Sbjct: 335 IESQIVAGLPK 345


>gi|417399216|gb|JAA46636.1| Putative cathepsin b [Desmodus rotundus]
          Length = 340

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 66/193 (34%), Positives = 92/193 (47%), Gaps = 22/193 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C       PKC
Sbjct: 150 CNGGFPSGAWNFWKKQGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPCSGEGGDTPKC 208

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
              C    Y   + +DK+   G   Y  P             GP   AF     + Y+  
Sbjct: 209 SKIC-EPGYSPSYKEDKH--FGCDTYSVPSDEKEIMVEIYKNGPVEAAF-----SVYSDF 260

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L   +G    V+   E+V    V+I+GWG ENG PYW + +++   +GD G  KILRGR+
Sbjct: 261 LLYKSGVYQHVTG--EMVGGHAVRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGRD 318

Query: 249 EAIIESLVNGALP 261
              IES +   +P
Sbjct: 319 HCGIESEIVAGIP 331


>gi|239788404|dbj|BAH70886.1| ACYPI000014 [Acyrthosiphon pisum]
          Length = 335

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 60/196 (30%), Positives = 91/196 (46%), Gaps = 23/196 (11%)

Query: 77  IWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 136
           ++ C+ G     W +  + G+VTGG + +  GCQP   PPC        + E     + Q
Sbjct: 150 VFGCNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPC------VKDDEGHNSCSGQ 203

Query: 137 P-----KCHTRCTNDN---YGRGFFQ--DKYQINGLGLYFDPH-FGPFWPAFWRSFCTKY 185
           P     KC  +C  D+   Y +  ++  D Y +    +  D   +GP   +F        
Sbjct: 204 PTERNHKCSKKCYGDDTIDYKKNHYKTKDAYYLKNTTMQKDTMVYGPIEASF------DV 257

Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
                     VY  + +A  +    VK++GWG E G PYW +V+++GEQ+GDKG  KILR
Sbjct: 258 YDDFMNYESGVYQRTGNASYLGGHAVKMIGWGVEEGTPYWLMVNSWGEQWGDKGMFKILR 317

Query: 246 GRNEAIIESLVNGALP 261
           G +E  IES     +P
Sbjct: 318 GTDECGIESSCTAGVP 333


>gi|51038793|gb|AAT94175.1| cathepsin B [Paralichthys olivaceus]
 gi|121053785|gb|ABM47001.1| cathepsin B [Paralichthys olivaceus]
          Length = 330

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 65/192 (33%), Positives = 92/192 (47%), Gaps = 18/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  SS W +  K GLV+GG ++S+ GC+P +  PC H +   S P C       P+C
Sbjct: 148 CNGGYPSSAWDFWTKEGLVSGGLYNSHIGCRPYTISPCEH-HVNGSRPPCTGEGGDTPEC 206

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
            +RC    Y   + QDK      Y + G            GP   AF     T Y   + 
Sbjct: 207 ISRC-EAGYSPSYKQDKHYGKSSYSVEGSVEQIQAEISKNGPVEGAF-----TVYEDFVM 260

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    VS S  ++    +K++GWGEE+G PYW   +++   +GD G  KILRG N  
Sbjct: 261 YKSGVYQHVSGS--VLGGHAIKVLGWGEEDGIPYWLCANSWNTDWGDNGFFKILRGSNHC 318

Query: 251 IIESLVNGALPK 262
            IES +   +PK
Sbjct: 319 GIESEIVAGIPK 330


>gi|225711544|gb|ACO11618.1| Cathepsin B precursor [Caligus rogercresseyi]
          Length = 332

 Score = 97.8 bits (242), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 62/192 (32%), Positives = 94/192 (48%), Gaps = 18/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W +   +GLV+GG + S++GCQP    PC H    T +P  +   TP  KC
Sbjct: 150 CNGGFPGAAWKYWTSKGLVSGGLYGSHSGCQPYDIEPCEHHVNGTRQPCAEGGRTP--KC 207

Query: 140 HTRCTNDNYGRGFFQD-KYQINGLGLYFDP--------HFGPFWPAFWRSFCTKYTRPLF 190
           H  C N+NY   + +D  +  +   +  DP          GP   AF     + Y+  + 
Sbjct: 208 HRTCENENYSVPYDKDLSFGRSSYSIRSDPKQIQLEIMDNGPVEAAF-----SVYSDFMN 262

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V  S  ++    ++I+GWG E G PYW + +++   +GDKGT KILRG +  
Sbjct: 263 DKSGVYRHVKGS--LLGGHAIRILGWGVEKGTPYWLVANSWNTDWGDKGTFKILRGSDHC 320

Query: 251 IIESLVNGALPK 262
            IE  V   LP+
Sbjct: 321 GIEGSVVTGLPR 332


>gi|201023321|ref|NP_001128402.1| cathepsin B-1874 precursor [Acyrthosiphon pisum]
          Length = 315

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 61/193 (31%), Positives = 84/193 (43%), Gaps = 19/193 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ-PK 138
           C  G    +W +  + G V+GG ++SN GCQP + PPC   N       C T    + P 
Sbjct: 130 CDGGSLFESWDYYRRHGFVSGGDYNSNQGCQPYTIPPCKLMNEKPPGHSCTTYHREETPI 189

Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
           C  +C N NY   F  D Y+  G      P+         GP    F+        R L 
Sbjct: 190 CEKKCYNPNYYTSFRTDIYK--GKYYKLSPYMAMKDIFDNGPITTQFYM------YRDLV 241

Query: 191 QTNGRVYAVSASAEIVAYA--TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
                VY     ++   +   +VKI GWGEENG PYW + ++FG  +G  GT KI RG +
Sbjct: 242 DYKSGVYQYDEQSDFDFFTVHSVKIFGWGEENGVPYWLVANSFGTDWGYNGTFKISRGND 301

Query: 249 EAIIESLVNGALP 261
               +  +   LP
Sbjct: 302 GCFFQEKMYAGLP 314


>gi|56756436|gb|AAW26391.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 66/192 (34%), Positives = 92/192 (47%), Gaps = 18/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  KRG+VTGG+  ++TGCQP  FP C H       P C T     P+C
Sbjct: 159 CKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQC 217

Query: 140 HTRCTNDNYGRGFFQDK------YQI--NGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK      Y +  N   +  +   +GP   AF       Y   L 
Sbjct: 218 KQTCQK-GYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAF-----DVYEDFLN 271

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+ S  IV    ++I+GWG E G+PYW I +++ E +G+KG  +++RGR+E 
Sbjct: 272 YKSGIYRHVTGS--IVGGHAIRIIGWGVEKGKPYWLIANSWNEDWGEKGLFRMVRGRDEC 329

Query: 251 IIESLVNGALPK 262
            IES V   L K
Sbjct: 330 SIESHVVAGLIK 341


>gi|187105116|ref|NP_001119618.1| cathepsin B-84 precursor [Acyrthosiphon pisum]
 gi|161343843|tpg|DAA06102.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 335

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 60/193 (31%), Positives = 89/193 (46%), Gaps = 23/193 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP-- 137
           C+ G     W +  + G+VTGG + +  GCQP   PPC        + E     + QP  
Sbjct: 153 CNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPC------VKDDEGHNSCSGQPTE 206

Query: 138 ---KCHTRCTNDN---YGRGFFQ--DKYQINGLGLYFDPH-FGPFWPAFWRSFCTKYTRP 188
              KC  +C  D+   Y +  ++  D Y +    +  D   +GP   +F           
Sbjct: 207 RNHKCSKKCYGDDTIDYKKNHYKTKDAYYLKNTTMQKDTMVYGPIEASF------DVYDD 260

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
                  VY  + +A  +    VK++GWG E G PYW +V+++GEQ+GDKG  KILRG +
Sbjct: 261 FMNYESGVYQRTGNASYLGGHAVKMIGWGVEEGTPYWLMVNSWGEQWGDKGMFKILRGTD 320

Query: 249 EAIIESLVNGALP 261
           E  IES     +P
Sbjct: 321 ECGIESSCTAGVP 333


>gi|156255405|gb|ABU62925.1| cathepsin B [Fasciola hepatica]
          Length = 337

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 61/195 (31%), Positives = 92/195 (47%), Gaps = 25/195 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ GI + +W +  + G+VTGG   + TGC P  FP C+H   T   P C     P PKC
Sbjct: 155 CNGGIPAMSWDYWTREGVVTGGTLENPTGCLPYPFPKCSHGVVTPGLPPCPRDIYPTPKC 214

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLG-------LYFDPHFGPFWPAFWRSFCTKYT 186
             +C +  Y + + QDK      Y + G         +   P  G F+   +  F    +
Sbjct: 215 EKKC-HAGYNKTYEQDKVKGKSSYNVGGQETDIMMEIMKNGPVDGIFY--MFEDFLVYKS 271

Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
                T GR         +V    ++++GWG ENG  YW I +++ E +G+KG  ++ RG
Sbjct: 272 GIYHYTTGR---------LVGGHAIRVIGWGVENGVKYWLIANSWNEGWGEKGYFRMRRG 322

Query: 247 RNEAIIESLVNGALP 261
            NE  IE+ +N  LP
Sbjct: 323 NNECGIEARINAGLP 337


>gi|126681075|gb|ABO26563.1| cathepsin B-like cysteine protease form 1 [Ixodes ricinus]
          Length = 337

 Score = 97.4 bits (241), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 65/193 (33%), Positives = 93/193 (48%), Gaps = 19/193 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G  ++ W +  + GLV+ G + +  GC+P S  PC H +   S P C T   P PKC
Sbjct: 154 CDGGYPAAAWEYWKESGLVSDGLYGTPDGCKPYSLAPCEH-HTKGSLPNC-TGTVPTPKC 211

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
              C    YG+ +  DK      Y I+             GP    F     T Y   L 
Sbjct: 212 VHLCRK-GYGKDYQHDKHFGKKVYSISSNEKQIQTEIFKNGPVEADF-----TVYADFLS 265

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY    S +++    ++I+GWG ENG PYW + +++ E +GD G  KILRG++E 
Sbjct: 266 YKSG-VYQ-HHSGDVLGGHAIRILGWGTENGTPYWLVANSWNEDWGDHGYFKILRGKDEC 323

Query: 251 IIESLVNGALPKD 263
            IE  +N  +PKD
Sbjct: 324 GIEDDINAGIPKD 336


>gi|204022100|dbj|BAG71147.1| cathepsin B-N1 [Tuberaphis takenouchii]
          Length = 334

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 66/194 (34%), Positives = 94/194 (48%), Gaps = 25/194 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W W  K GLVTGG + S  GCQP   PPC    Y  +   C+    P  K 
Sbjct: 154 CHGGYPIKAWEWFKKHGLVTGGDYDSGEGCQPYRVPPCPLDEYGNNT--CR--GKPAEKN 209

Query: 140 HTRCTNDNYG---------RGFFQDKYQINGLGLYFDPH-FGPFWPAF--WRSFCTKYTR 187
           H RCT   YG           + +D Y +    +  D   +GP   +F  +  F      
Sbjct: 210 H-RCTRMCYGNQELDFKEDHHWTRDAYYLTYTTIQKDVMAYGPIEASFDVYDDF------ 262

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           P +++   VY  + +A  +    VK++GWGEE G PYW +V+++ +Q+GD+G  KILRG 
Sbjct: 263 PNYKSG--VYMKTENASYLGGHAVKLIGWGEEYGVPYWLLVNSWNDQWGDQGLFKILRGT 320

Query: 248 NEAIIESLVNGALP 261
           NE  I++   G +P
Sbjct: 321 NECGIDNSTTGGVP 334


>gi|308500570|ref|XP_003112470.1| CRE-CPR-4 protein [Caenorhabditis remanei]
 gi|308267038|gb|EFP10991.1| CRE-CPR-4 protein [Caenorhabditis remanei]
          Length = 335

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 58/193 (30%), Positives = 90/193 (46%), Gaps = 18/193 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W ++ K G  TGG++ +  GC+P S  PC       + P+C       P C
Sbjct: 150 CDGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPDCPDDGYNTPAC 209

Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
             +CTN  Y   +  DK+          ++  +      H GP   AF     T Y    
Sbjct: 210 VNKCTNTKYNTAYKDDKHFGSTAYAVGKKVAQIQAEIIAH-GPVEAAF-----TVY-EDF 262

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           +Q    VY  +   E+  +A ++I+GWG +NG PYW + +++   +G+ G  +I+RG NE
Sbjct: 263 YQYKSGVYVHTTGQELGGHA-IRILGWGTDNGTPYWLVANSWNVNWGENGYFRIIRGTNE 321

Query: 250 AIIESLVNGALPK 262
             IE  V G +PK
Sbjct: 322 CGIEHAVVGGVPK 334


>gi|344195776|gb|AEM98130.1| cathepsin B [Cynoglossus semilaevis]
          Length = 332

 Score = 97.1 bits (240), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 67/195 (34%), Positives = 93/195 (47%), Gaps = 24/195 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S+ W +    GLV+GG + S+ GC+P S  PC H +   S P C       P+C
Sbjct: 148 CNGGYPSAAWEFWTTDGLVSGGLYDSHIGCRPYSIAPCEH-HVNGSRPPCTGEGGDTPQC 206

Query: 140 HTRCTNDNYGRGFFQDK------YQING------LGLYFDPHFGPFWPAFWRSFCTKYTR 187
             +C    Y  G+ QDK      Y ++       L +Y +   GP   AF     T Y  
Sbjct: 207 TKKC-EAGYTPGYTQDKHYGKLSYSVDDSEKEIQLEIYKN---GPVEGAF-----TVYED 257

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
            L    G    V+ SA  V    +K++GWGEENG PYW   +++   +GD G  KILRG 
Sbjct: 258 FLLYKTGVYQHVTGSA--VGGHAIKVLGWGEENGTPYWLCANSWNTDWGDNGFFKILRGS 315

Query: 248 NEAIIESLVNGALPK 262
           +   IES +   +PK
Sbjct: 316 DHCGIESEIVAGIPK 330


>gi|225708580|gb|ACO10136.1| Cathepsin B precursor [Osmerus mordax]
          Length = 329

 Score = 97.1 bits (240), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 68/194 (35%), Positives = 96/194 (49%), Gaps = 23/194 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G  S+ W +  K GLVTGG + SN GC+P S PPC H +   + P C+      PKC
Sbjct: 147 CFGGYPSAAWEYWAKSGLVTGGLYGSNKGCRPYSIPPCEH-HVNGTRPPCQGEGD-TPKC 204

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
            T+C  D Y   + +DKY   G   Y  P             GP   AF     + Y   
Sbjct: 205 QTKCI-DGYTPAYEKDKY--FGKKTYSVPSKQEQIMTELYKNGPVEAAF-----SVYEDF 256

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L   +G VY    + +++    +KI+GWG+EN  PYW   +++   +G++G  KILRG +
Sbjct: 257 LLYKSG-VYQ-HLTGDMLGGHAIKILGWGKENNTPYWLAANSWNTDWGNQGFFKILRGGD 314

Query: 249 EAIIESLVNGALPK 262
           E  IES V   +P+
Sbjct: 315 ECGIESEVVAGIPQ 328


>gi|1777779|gb|AAB40605.1| cathepsin B-like cysteine proteinase [Ascaris suum]
 gi|324515014|gb|ADY46062.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
          Length = 398

 Score = 97.1 bits (240), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 60/195 (30%), Positives = 88/195 (45%), Gaps = 18/195 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W +  K G+VTG       GC+P  FPPC H +  T    CK    P PKC
Sbjct: 190 CDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPYPFPPCEHHSNKTHYQPCKHDLYPTPKC 249

Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
             +C +    + + +DK+           +  +      H GP   AF      +     
Sbjct: 250 EKKCLDIYTEKTYAEDKFFGETAYGVEDDVTSIQKEILTH-GPVEVAF------EVYEDF 302

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
              +G +Y V    +I     VK++GWG E G PYW + +++   +G+ G  +I+RG +E
Sbjct: 303 LMYDGGIY-VHTGGKIGGGHAVKMLGWGVEQGVPYWLVANSWNTDWGEDGFFRIIRGIDE 361

Query: 250 AIIESLVNGALPKDN 264
             IES V G LPK N
Sbjct: 362 CGIESSVVGGLPKLN 376


>gi|324507953|gb|ADY43363.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
          Length = 352

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 60/195 (30%), Positives = 88/195 (45%), Gaps = 18/195 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W +  K G+VTG       GC+P  FPPC H +  T    CK    P PKC
Sbjct: 149 CDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPYPFPPCEHHSNKTHYQPCKHDLYPTPKC 208

Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
             +C +    + + +DK+           +  +      H GP   AF      +     
Sbjct: 209 EKKCLDIYTEKTYAEDKFFGETAYGVEDDVTSIQKEILTH-GPVEVAF------EVYEDF 261

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
              +G +Y V    +I     VK++GWG E G PYW + +++   +G+ G  +I+RG +E
Sbjct: 262 LMYDGGIY-VHTGGKIGGGHAVKMLGWGVEQGVPYWLVANSWNTDWGEDGFFRIIRGIDE 320

Query: 250 AIIESLVNGALPKDN 264
             IES V G LPK N
Sbjct: 321 CGIESSVVGGLPKLN 335


>gi|161343865|tpg|DAA06113.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 335

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 62/195 (31%), Positives = 96/195 (49%), Gaps = 27/195 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP-- 137
           C+ G     W +  + G+VTGG +++  GCQP   PPC        + E     + QP  
Sbjct: 153 CNGGNPLKAWKYFKRHGVVTGGNYNTTDGCQPYRVPPC------VRDDEGHNSCSGQPTE 206

Query: 138 ---KCHTRCTND---NYGRGFFQ--DKYQINGLGLYFDPH-FGPFWPAF--WRSFCTKYT 186
              KC  +C  D   NY +  ++  D Y ++   +  D   +GP   +F  +  F T Y 
Sbjct: 207 RNHKCSKKCYGDETINYKKNHYKTKDAYYLSNTTMQKDTMVYGPIEASFDVYDDF-TSYE 265

Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
             ++Q        + +A  +    VK++GWG E G PYW +V+++GEQ+GDKG  KILRG
Sbjct: 266 SGVYQK-------TENASYLGGHAVKMIGWGVEEGTPYWLMVNSWGEQWGDKGMFKILRG 318

Query: 247 RNEAIIESLVNGALP 261
            +E  +ES     +P
Sbjct: 319 TDECGVESSCTAGVP 333


>gi|344281458|ref|XP_003412496.1| PREDICTED: cathepsin B-like [Loxodonta africana]
          Length = 340

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 68/198 (34%), Positives = 96/198 (48%), Gaps = 23/198 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  K+GLV+GG + S+ GC+P S PPC H +   S P CK      PKC
Sbjct: 150 CNGGFPAGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPCKGEGGETPKC 208

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
              C    Y   + +DK+   G   Y  P             GP   AF     + YT  
Sbjct: 209 SKTC-EPGYSPSYKEDKHY--GYSSYGVPSSEQEIMAEIYKNGPVEGAF-----SVYTDF 260

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L   +G VY      E+  +A ++I+GWG ENG PYW   +++   +GD G  KILRG++
Sbjct: 261 LVYKSG-VYQHVTGEEVGGHA-IRILGWGVENGTPYWLAANSWNTDWGDNGFFKILRGQD 318

Query: 249 EAIIESLVNGALPK-DNY 265
              IES +   +P+ D Y
Sbjct: 319 HCGIESEIVAGIPRTDQY 336


>gi|346470617|gb|AEO35153.1| hypothetical protein [Amblyomma maculatum]
          Length = 335

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 65/201 (32%), Positives = 97/201 (48%), Gaps = 35/201 (17%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  ++ W +    G+VTGG + ++ GCQP  FPPC H +     P C T   P P+C
Sbjct: 153 CNGGYPAAAWEFYKTDGIVTGGLYGTDDGCQPYYFPPCEH-HTVGPLPNC-TGIKPTPQC 210

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
              C    Y + + +DK Y      L  D               T+    +F+ NG V A
Sbjct: 211 VRDCRK-GYEKSYSEDKHYAKKVYTLSADE--------------TQIKTEIFK-NGPVEA 254

Query: 199 -VSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
             +  A+ V+Y +               ++I+GWG ENG PYW + +++ E +GDKG  K
Sbjct: 255 DFTVYADFVSYKSGVYQRHSDDALGGHAIRILGWGTENGVPYWLVANSWNEDWGDKGYFK 314

Query: 243 ILRGRNEAIIESLVNGALPKD 263
           ILRG +E  IE  +N  +PK+
Sbjct: 315 ILRGNDECGIEDDINAGIPKE 335


>gi|154089579|gb|ABS57370.1| cathepsin B2 [Trichobilharzia regenti]
          Length = 344

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 91/194 (46%), Gaps = 23/194 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   S W++  + G+VTG  ++   GCQP  FPPC H +     P C+      PKC
Sbjct: 161 CNGGFPHSAWSYWKRSGIVTGDLYNPTDGCQPYEFPPCEH-HVVGPRPSCEG-DVETPKC 218

Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRP 188
            T C          +  YG+  ++       +      H GP    F  +  F   Y   
Sbjct: 219 KTTCQPGYNIPYNKDKWYGKTVYRVHSNQEAIMKEVKEH-GPVEVDFEVYADF-PNYKSG 276

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           ++Q          S  ++    V+++GWGEENG PYW I +++   +GD G  KI+RGRN
Sbjct: 277 VYQ--------HVSGGLLGGHAVRLLGWGEENGVPYWLIANSWNSDWGDNGYFKIIRGRN 328

Query: 249 EAIIESLVNGALPK 262
           E  IES VN  +PK
Sbjct: 329 ECGIESDVNAGIPK 342


>gi|29374027|gb|AAO73004.1| cathepsin B [Fasciola gigantica]
          Length = 337

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 61/195 (31%), Positives = 91/195 (46%), Gaps = 25/195 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ GI + +W +  + G+VTGG   + TGC P  FP C+H   T   P C     P PKC
Sbjct: 155 CNGGIPAMSWDYWTREGVVTGGTLENPTGCLPYPFPKCSHGVVTPGLPPCPRDIYPTPKC 214

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYF-------DPHFGPFWPAFWRSFCTKYT 186
             +C +  Y + + QDK      Y +      F        P  G F+   +  F    +
Sbjct: 215 EKKC-HAGYNKTYEQDKVKGKSSYNVGEQETDFMMEIMKNGPVDGIFY--MFEDFLVYKS 271

Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
                T GR         +V    ++++GWG ENG  YW I +++ E +G+KG  ++ RG
Sbjct: 272 GIYHYTTGR---------LVGGHAIRVIGWGVENGVKYWLIANSWNEGWGEKGYFRMRRG 322

Query: 247 RNEAIIESLVNGALP 261
            NE  IE+ +N  LP
Sbjct: 323 NNECGIEARINAGLP 337


>gi|156365510|ref|XP_001626688.1| predicted protein [Nematostella vectensis]
 gi|156213574|gb|EDO34588.1| predicted protein [Nematostella vectensis]
          Length = 259

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 67/192 (34%), Positives = 96/192 (50%), Gaps = 19/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   S W     +GLVTGG + S+ GCQP     C+H      +P CK   +P PKC
Sbjct: 73  CNGGYPESAWDHWKSKGLVTGGQYDSHKGCQPYKIAACDHHVVGKLKP-CKG-DSPTPKC 130

Query: 140 HTRCT---NDNYG--RGFFQDKYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTR-PLF 190
             +C    N +Y   + F Q  Y +              GP   AF     T Y   P +
Sbjct: 131 ERKCEAGYNVSYSDDKHFGQSAYSVRSDPAEIQKEIMTNGPVEGAF-----TVYADFPTY 185

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
           ++   VY  ++ + +  +A +KI+GWGEENG PYW + +++   +GD+G  KI RG +E 
Sbjct: 186 KSG--VYQHTSGSALGGHA-IKILGWGEENGTPYWLVANSWNSDWGDEGFFKIKRGNDEC 242

Query: 251 IIESLVNGALPK 262
            IES + G LPK
Sbjct: 243 GIESGIVGGLPK 254


>gi|56759504|gb|AAW27892.1| unknown [Schistosoma japonicum]
          Length = 279

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 67/192 (34%), Positives = 91/192 (47%), Gaps = 18/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  KRG+VTGG+  ++TGCQP  FP C H +     P C T     P+C
Sbjct: 96  CQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 154

Query: 140 HTRCTNDNYGRGFFQDKY--------QINGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK+        Q N   +  D   +GP   AF       Y   L 
Sbjct: 155 KQTCQK-GYKTPYEQDKHYGEESYNVQNNEKVIQRDIMMYGPVEAAF-----DVYEDFLN 208

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+ S  IV    ++I+GWG E   PYW I +++ E +G+KG  +I+RGR+E 
Sbjct: 209 YKSGIYRHVTGS--IVGGHAIRIIGWGVEKRTPYWLIANSWNEDWGEKGLFRIVRGRDEC 266

Query: 251 IIESLVNGALPK 262
            IES V   L K
Sbjct: 267 SIESNVVAGLIK 278


>gi|226821413|gb|ACO82382.1| cathepsin B [Lutjanus argentimaculatus]
          Length = 330

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 64/195 (32%), Positives = 93/195 (47%), Gaps = 24/195 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S+ W +  K GLV+GG + S+ GC+P + PPC H +   S P C       P+C
Sbjct: 148 CNGGYPSAAWDFWTKEGLVSGGLYDSHVGCRPYTIPPCEH-HVNGSRPPCTGEGGDTPQC 206

Query: 140 HTRCT---------NDNYGR---GFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTR 187
            ++C          + +YG+       D+ +I     Y     GP   AF     T Y  
Sbjct: 207 LSQCEAGYTPSYREDKHYGKTSYSVLSDEAEIQ----YEIYKNGPVEGAF-----TVYED 257

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
            +   +G    VS SA  V    +K++GWGEENG PYW   +++   +GD G  K LRG 
Sbjct: 258 FVLYKSGVYQHVSGSA--VGGHAIKVLGWGEENGVPYWLCANSWNTDWGDNGFFKFLRGS 315

Query: 248 NEAIIESLVNGALPK 262
           +   IES +   +PK
Sbjct: 316 DHCGIESEIVAGIPK 330


>gi|195729973|gb|ACG50797.1| cathepsin B2 [Trichobilharzia szidati]
          Length = 344

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 91/194 (46%), Gaps = 23/194 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   S W++  + G+VTG  +++  GCQP  FPPC H +     P C       PKC
Sbjct: 161 CNGGFPHSAWSYWKRSGIVTGDLYNTTDGCQPYEFPPCEH-HVVGPRPSCGG-DVETPKC 218

Query: 140 HTRCT---------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRP 188
            T C          +  YG+  ++       +      H GP    F  +  F   Y   
Sbjct: 219 KTTCQPGYNIPYNKDKWYGKTVYRVHSNQEAIMKEVMDH-GPVEVDFEVYADF-PNYKSG 276

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           ++Q          S  ++    V+++GWGEENG PYW I +++   +GD G  KI+RGRN
Sbjct: 277 VYQ--------HVSGGLLGGHAVRLLGWGEENGVPYWLIANSWNSDWGDNGYFKIIRGRN 328

Query: 249 EAIIESLVNGALPK 262
           E  IES VN  +PK
Sbjct: 329 ECGIESDVNAGIPK 342


>gi|52630945|gb|AAU84936.1| putative cathepsin B-S [Toxoptera citricida]
          Length = 335

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 61/192 (31%), Positives = 90/192 (46%), Gaps = 21/192 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPC----NHANYTTSEPECKTLATP 135
           C+ G     W +  + G+VTGG +++  GCQP   PPC       N  + +P       P
Sbjct: 153 CNGGNPLKAWQYFKRHGVVTGGNYNTTDGCQPYKVPPCVKDEEGHNSCSGQP-----TEP 207

Query: 136 QPKCHTRCTND---NYGRGFFQDK--YQINGLGLYFDP-HFGPFWPAFWRSFCTKYTRPL 189
             KC   C  D   +Y +G ++ K  Y +N   +  D   +GP   +F            
Sbjct: 208 NHKCSRSCYGDKTCDYKKGHYKTKNAYYLNIDTMQKDTIAYGPIEASF------DVYDDF 261

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
                 VY  +  A+ +    VK++GWGEE+G PYW +V+++GEQ+G  G  KILRG NE
Sbjct: 262 VNYESGVYQKTEDAKYLGGHAVKMIGWGEEDGTPYWLMVNSWGEQWGANGMFKILRGTNE 321

Query: 250 AIIESLVNGALP 261
             IE      +P
Sbjct: 322 CGIEGSPTAGVP 333


>gi|118365170|ref|XP_001015806.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297573|gb|EAR95561.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 340

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 56/190 (29%), Positives = 90/190 (47%), Gaps = 15/190 (7%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ--P 137
           C  G  S+ W ++ ++G+ TGG +  +T C+P  FPPC+H      +P      TPQ   
Sbjct: 158 CKGGYPSAAWGYMKRQGVSTGGLYGDDTSCKPYIFPPCDHHVTGQYQPCGPIQPTPQCVK 217

Query: 138 KCHTRCTNDNYGRGF------FQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQ 191
           +C++  T + Y +        +  K  +  +      H GP   +F      K       
Sbjct: 218 ECNSEYTQNTYEKDLHFASQTYSIKQNVQAIQREIMAH-GPVQASF------KVAADFLT 270

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
               VY  +   +     +VKI+GWG+E   PYW I +++ E +G+KG  ++LRGRNE  
Sbjct: 271 YKSGVYIRNPKLKYEGGHSVKIIGWGKEGNTPYWLIANSWNEDWGEKGLFRMLRGRNECG 330

Query: 252 IESLVNGALP 261
           IE+ +   LP
Sbjct: 331 IEAQIVAGLP 340


>gi|148229459|ref|NP_001079570.1| cathepsin B precursor [Xenopus laevis]
 gi|28277314|gb|AAH44689.1| MGC53360 protein [Xenopus laevis]
          Length = 333

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 75/250 (30%), Positives = 112/250 (44%), Gaps = 33/250 (13%)

Query: 27  SCIEARAVATATPLAFAVC--RSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGI 84
           SC    A      ++  VC   + K++VE ++   ++     C            C+ G 
Sbjct: 104 SCGSCWAFGAVEAISDRVCVHTNGKVNVEVSAEDLLSCCGDECGM---------GCNGGY 154

Query: 85  SSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT 144
            S  W +  + GLV+GG + S+ GC+P S PPC H +   S P CK      PKC  +C 
Sbjct: 155 PSGAWQFWTETGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPACKGEEGDTPKCVKQC- 212

Query: 145 NDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRPLFQTN 193
            + Y   +  DK+   G   Y  P             GP   A    F      PL+++ 
Sbjct: 213 EEGYSPAYGTDKHF--GTTSYGVPTSEKEIMAEIYKNGPVEGA----FLVYADFPLYKSG 266

Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
             VY      E+  +A +KI+GWG ENG PYW   +++   +GD G  KILRG++   IE
Sbjct: 267 --VYQHETGEELGGHA-IKILGWGVENGTPYWLCANSWNTDWGDNGFFKILRGKDHCGIE 323

Query: 254 SLVNGALPKD 263
           S +   +PK+
Sbjct: 324 SEIVAGVPKN 333


>gi|45361295|ref|NP_989225.1| cathepsin B precursor [Xenopus (Silurana) tropicalis]
 gi|38969948|gb|AAH63365.1| hypothetical protein MGC75969 [Xenopus (Silurana) tropicalis]
          Length = 333

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 66/195 (33%), Positives = 94/195 (48%), Gaps = 22/195 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  + GLV+GG + S+ GC+P S PPC H +   S P CK      PKC
Sbjct: 150 CNGGYPSGAWKFWTETGLVSGGLYDSHLGCRPYSIPPCEH-HVNGSRPACKGEEGDTPKC 208

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
             +C  D Y   +  DK+   G   Y  P             GP   AF          P
Sbjct: 209 VKQC-EDGYAPVYGSDKHF--GATSYGVPSSEKEIMAEIYKNGPVEGAF----LVYADFP 261

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           ++++   VY      E+  +A +KI+GWG ENG PYW   +++   +GD G  KILRG++
Sbjct: 262 MYKSG--VYQHETGEELGGHA-IKILGWGVENGTPYWLCANSWNTDWGDNGFFKILRGKD 318

Query: 249 EAIIESLVNGALPKD 263
              IES +   +PK+
Sbjct: 319 HCGIESEIVAGIPKN 333


>gi|194246059|gb|ACF35521.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 217

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 62/195 (31%), Positives = 95/195 (48%), Gaps = 23/195 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S+ W +    G+VTGG + +  GCQP  FPPC H +     P C T   P P+C
Sbjct: 32  CNGGYPSAAWQFYKDEGIVTGGLYGTEDGCQPYYFPPCEH-HTVGPLPNC-TGIKPTPEC 89

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAF--WRSFCTKYTRP 188
              C  + Y + + +DK      Y I+             GP    F  +  F   Y   
Sbjct: 90  AKTC-REGYEKSYTRDKHFGKKVYSISSDETQIKTEICKNGPVEADFNVYADF-PSYKSG 147

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           ++Q +        S E++    ++I+GWG E+G PYW + +++ E +GDKG  KI RG +
Sbjct: 148 VYQRH--------SKEMLGGHAIRILGWGTEDGVPYWLVANSWNEDWGDKGYFKIRRGND 199

Query: 249 EAIIESLVNGALPKD 263
           E  IE+ +N  +PK+
Sbjct: 200 ECGIENDINAGIPKE 214


>gi|56753443|gb|AAW24925.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 64/190 (33%), Positives = 92/190 (48%), Gaps = 18/190 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  KRG+VTGG+  ++TGCQP  FP C H +     P C T     P+C
Sbjct: 159 CQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 217

Query: 140 HTRCTNDNYGRGFFQDK------YQI--NGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLF 190
             +C    Y   + QDK      Y +  N   +  +   +GP   AF       Y   L 
Sbjct: 218 KQKCQK-GYKTPYEQDKNYGDQRYNVISNEKAIQREIMMYGPVEAAF-----DVYEDFLN 271

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+ S  IV    ++I+GWG E G+PYW I +++ E +G+ G  +++RGR+E 
Sbjct: 272 YKSGIYRHVAGS--IVGGHAIRIIGWGVEKGKPYWLIANSWNEDWGENGLFRMVRGRDEC 329

Query: 251 IIESLVNGAL 260
            IES V   L
Sbjct: 330 SIESHVVAGL 339


>gi|45822203|emb|CAE47498.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
          Length = 328

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 76/252 (30%), Positives = 104/252 (41%), Gaps = 40/252 (15%)

Query: 27  SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
           SC    A      ++  VC  S      ++F F +     C W          C+ G   
Sbjct: 101 SCGSCWAFGAVEAMSDRVCIHSNGE---SNFHFSSDDLVSCCWTCGM-----GCNGGYPG 152

Query: 87  STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
           + W +  ++GLV+GG + +  GC+P   PPC H +   S P C       PKC   C   
Sbjct: 153 AAWHYWVRKGLVSGGQYGTKQGCRPYEIPPCEH-HTNGSRPACDASEGNTPKCAKSC--- 208

Query: 147 NYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY-AVSASAEI 205
                  +  Y+IN      D HFG    A+  S   K  +     NG V  A S  A+ 
Sbjct: 209 -------ESNYKIN---YSNDLHFGS--KAYSISSDVKQIQAEILQNGPVEGAFSVYADF 256

Query: 206 VAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
           V Y T               ++I GWG EN  PYW I +++   +GD GT KILRG +  
Sbjct: 257 VNYKTGVYQHIKGQFLGGHAIRIFGWGVENNTPYWLIANSWNTDWGDSGTFKILRGSDHC 316

Query: 251 IIESLVNGALPK 262
            IES +   LPK
Sbjct: 317 GIESGIVAGLPK 328


>gi|1169189|sp|P43157.1|CYSP_SCHJA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
           Full=Antigen Sj31; Flags: Precursor
 gi|11167|emb|CAA50305.1| cathepsin B [Schistosoma japonicum]
          Length = 342

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 66/192 (34%), Positives = 91/192 (47%), Gaps = 18/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  KRG+VTGG+  ++TGCQP  FP C H +     P C T     P+C
Sbjct: 159 CQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 217

Query: 140 HTRCTNDNYGRGFFQDKY--------QINGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK+        Q N   +  D   +GP   AF       Y   L 
Sbjct: 218 KQTCQK-GYKTPYEQDKHYGDESYNVQNNEKVIQRDIMMYGPVEAAF-----DVYEDFLN 271

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+ S  IV    ++I+GWG E   PYW I +++ E +G+KG  +++RGR+E 
Sbjct: 272 YKSGIYRHVTGS--IVGGHAIRIIGWGVEKRTPYWLIANSWNEDWGEKGLFRMVRGRDEC 329

Query: 251 IIESLVNGALPK 262
            IES V   L K
Sbjct: 330 SIESDVVAGLIK 341


>gi|308504233|ref|XP_003114300.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
 gi|308261685|gb|EFP05638.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
          Length = 351

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 78/265 (29%), Positives = 114/265 (43%), Gaps = 37/265 (13%)

Query: 7   SRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQR 66
           S+IRD S             SC    AV+ A  ++  +C +S      T     A     
Sbjct: 114 SKIRDQS-------------SCGSCWAVSAAETISDRICIASNGK---TQLSISADDINA 157

Query: 67  CAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE 126
           C  +V        C+ G     W    K+G VTGG++   TGC+P  +PPC H    T  
Sbjct: 158 CCGMVCGNG----CNGGYPIEAWRHYVKKGYVTGGSYQEKTGCKPYPYPPCEHHVNGTHY 213

Query: 127 PECKTLATPQPKCHTRC--------TND-NYGRGFFQDKYQINGLGLYFDPHFGPFWPAF 177
             C +   P  KC   C        T D ++G+  +    ++  +      H GP   AF
Sbjct: 214 KPCPSNMYPTDKCERSCQAGYALTYTQDLHFGQSAYAVSKKVTEIQKEIMTH-GPVEVAF 272

Query: 178 WRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGD 237
                          +G VY  +A A +  +A VK++GWG +NG PYW   +++ E +G+
Sbjct: 273 ------SVYEDFEHYSGGVYVHTAGASLGGHA-VKMLGWGVDNGTPYWLCANSWNEDWGE 325

Query: 238 KGTIKILRGRNEAIIESLVNGALPK 262
            G  +I+RG NE  IES V G +PK
Sbjct: 326 NGYFRIIRGVNECGIESGVVGGIPK 350


>gi|223646922|gb|ACN10219.1| Cathepsin B precursor [Salmo salar]
 gi|223647940|gb|ACN10728.1| Cathepsin B precursor [Salmo salar]
 gi|223672785|gb|ACN12574.1| Cathepsin B precursor [Salmo salar]
          Length = 330

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 65/195 (33%), Positives = 90/195 (46%), Gaps = 26/195 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S+ W +    GLVTGG + S+ GC+P S PPC H +   + P C       P+C
Sbjct: 148 CNGGYPSAAWDFWTTEGLVTGGLYDSHVGCRPYSIPPCEH-HVNGTRPPCTGEEGDTPQC 206

Query: 140 HTRCTNDNYGRGFFQDKY-------------QINGLGLYFDPHFGPFWPAFWRSFCTKYT 186
             +C    Y  G+ QDK+             QI    L   P  G F         T Y 
Sbjct: 207 SNQCET-GYTPGYKQDKHFGKNSYSLPSEEQQIMAELLKNGPVEGAF---------TVYE 256

Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
             L   +G    VS SA  V    +K++GWGEE G PYW   +++   +G+ G  KILRG
Sbjct: 257 DFLLYKSGVYQHVSGSA--VGGHAIKVLGWGEEGGTPYWLAANSWNTDWGENGFFKILRG 314

Query: 247 RNEAIIESLVNGALP 261
           ++   IES +   +P
Sbjct: 315 KDHCGIESEMVAGVP 329


>gi|148222779|ref|NP_001080410.1| uncharacterized protein LOC380102 precursor [Xenopus laevis]
 gi|28302291|gb|AAH46667.1| Cg10992 protein [Xenopus laevis]
          Length = 333

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 67/195 (34%), Positives = 93/195 (47%), Gaps = 22/195 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  + GLV+GG + S+ GC+P S PPC H +   S P CK      PKC
Sbjct: 150 CNGGYPSGAWRFWTETGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPSCKGEEGDTPKC 208

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
              C  + Y   +  DK+   G   Y  P             GP   AF          P
Sbjct: 209 MKTC-EEGYTPAYGSDKHF--GATSYGVPSSEKEIMADIYKNGPVEGAF----VVYADFP 261

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L+++   VY      E+  +A +KI+GWG ENG PYW   +++   +GD G  KILRG++
Sbjct: 262 LYKSG--VYQHETGEELGGHA-IKILGWGVENGTPYWLCANSWNTDWGDNGFFKILRGKD 318

Query: 249 EAIIESLVNGALPKD 263
              IES V   +PK+
Sbjct: 319 HCGIESEVVAGIPKN 333


>gi|432946172|ref|XP_004083803.1| PREDICTED: cathepsin B-like [Oryzias latipes]
          Length = 330

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 69/192 (35%), Positives = 94/192 (48%), Gaps = 24/192 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G  S+ W +   +GLVTGG   S  GC+P +  PC H +   S P C+      PKC
Sbjct: 148 CFGGFPSAAWEFWTNKGLVTGGLFDSKVGCRPYTLAPCEH-HVNGSRPPCQG-EVETPKC 205

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
            T+C N+ Y   + +DK+   G   Y  P             GP   AF     + Y   
Sbjct: 206 VTQC-NNGYSLSYPKDKH--FGQRSYSIPSQQEQIMTELYKNGPVEAAF-----SVYADF 257

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L   NG    V+   +++    VKI+GWGEENG PYW + +++   +GDKG  KI RG +
Sbjct: 258 LLYKNGVYQHVTG--DMLGGHAVKILGWGEENGTPYWLVANSWNSDWGDKGFFKIKRGND 315

Query: 249 EAIIES-LVNGA 259
           E  IES +V GA
Sbjct: 316 ECGIESEMVAGA 327


>gi|17565158|ref|NP_503384.1| Protein W07B8.1 [Caenorhabditis elegans]
 gi|351059396|emb|CCD74286.1| Protein W07B8.1 [Caenorhabditis elegans]
          Length = 335

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 57/195 (29%), Positives = 89/195 (45%), Gaps = 17/195 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W ++ K G+ TGG++ S  GC+P S PPC       + P C    +P P C
Sbjct: 148 CEGGNPFKAWQYIQKHGIPTGGSYESQFGCKPYSIPPCGKTVGNVTYPACTNTTSPTPSC 207

Query: 140 HTRCT---------NDNYGRGFFQDKYQINGLGLYFDPHF-GPFWPAFWRSFCTKYTRPL 189
             +CT         + +   G   D+   + + +  D    GP    F       Y   L
Sbjct: 208 EKKCTSRIGYPIDIDKDRHYGVSVDQLPNSQIEIQSDVMLNGPIQATF-----EVYDDFL 262

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
             T G    ++ + +   + +V+I+GWG   G PYW   +++G Q+G+ GT ++LRG NE
Sbjct: 263 QYTTGIYVHLTGNKQ--GHLSVRIIGWGVWQGVPYWLCANSWGRQWGENGTFRVLRGTNE 320

Query: 250 AIIESLVNGALPKDN 264
             +ES     +PK N
Sbjct: 321 CGLESNCVSGMPKLN 335


>gi|294894292|ref|XP_002774787.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239880404|gb|EER06603.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 414

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 61/208 (29%), Positives = 90/208 (43%), Gaps = 29/208 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHS------NTGCQPVSFPPCNHANYTTSEPECKTLA 133
           C  GI S  W+WVH +G+ TGG + +      + GC P  FPPC H    +  P+C   +
Sbjct: 210 CDGGIPSLAWSWVHNKGIATGGDYLAEDDMTKDDGCWPYDFPPCAHHVNDSKYPKCPKDS 269

Query: 134 TPQPKCHTRCTNDNYGRGFFQDK----------YQINGLGLYFDPHFGPFWPAFWRSFCT 183
              P C  +C N  Y      D+          Y +N          GP  P ++     
Sbjct: 270 YETPNCAEQCHNPKYTTTLRDDRHFLVESVPYEYSVNDAKNAIRTD-GPVGPIYFCDPSV 328

Query: 184 KYTR---------PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQ 234
            + +                 VY  ++  E+  +A VKI+GWGEE G+ YW +V+++ E 
Sbjct: 329 NFDQVSASFIVYEDFLAYRSGVYKHTSGKELGGHA-VKIIGWGEETGQAYWLVVNSWNED 387

Query: 235 FGDKGTIKILRGRNEAIIESLVNGALPK 262
           +GD G  KI  G  E  I+  + G  PK
Sbjct: 388 WGDNGLFKIALGNCE--IDDDLLGGTPK 413


>gi|338815385|gb|AEJ08755.1| cathepsin B [Crassostrea ariakensis]
          Length = 341

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 63/191 (32%), Positives = 93/191 (48%), Gaps = 17/191 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G  S+ W++  + GLVTGG ++S+ GCQP +   C+H      +P C     P PKC
Sbjct: 158 CEGGFPSAAWSYYKRDGLVTGGQYNSHQGCQPYTIKACDHHVVGKLQP-CSKDIGPTPKC 216

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQ 191
              C    Y   + +DK      Y ++G+          GP   AF     T Y     Q
Sbjct: 217 KHTC-EAGYNVTYEKDKHYGMSAYSVHGVEKIMTEIMTNGPVEGAF-----TVYAD-FPQ 269

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
               VY  +    +  +A +KI+GWG ENG  YW + +++   +GD+G  KILRG++E  
Sbjct: 270 YKSGVYKHTTGQPLGGHA-IKILGWGTENGDDYWLVANSWNPDWGDQGFFKILRGQDECG 328

Query: 252 IESLVNGALPK 262
           IES ++   PK
Sbjct: 329 IESQISAGEPK 339


>gi|348534156|ref|XP_003454569.1| PREDICTED: cathepsin B-like [Oreochromis niloticus]
          Length = 330

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 66/194 (34%), Positives = 90/194 (46%), Gaps = 22/194 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S+ W +    GLV+GG + S+ GC+P +  PC H +   S P C       P+C
Sbjct: 148 CNGGYPSAAWDFWASEGLVSGGLYESHIGCRPYTIAPCEH-HVNGSRPPCTGEGGDTPEC 206

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
             +C +  Y   + QDK+   G   Y  P             GP   AF     T Y   
Sbjct: 207 VRQCES-GYTPSYIQDKHY--GKTSYSVPSDEQQIQTEIYKNGPVEGAF-----TVYEDF 258

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L    G    VS SA  V    +K++GWGEENG PYW   +++   +GD G  KILRG +
Sbjct: 259 LLYKTGVYQHVSGSA--VGGHAIKVLGWGEENGTPYWLCANSWNTDWGDNGYFKILRGSD 316

Query: 249 EAIIESLVNGALPK 262
              IES +   +PK
Sbjct: 317 HCGIESEIVAGIPK 330


>gi|327281751|ref|XP_003225610.1| PREDICTED: cathepsin B-like [Anolis carolinensis]
          Length = 330

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 95/194 (48%), Gaps = 22/194 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  ++GLV+GG + S+ GC+P S PPC H    T  P C       P+C
Sbjct: 140 CNGGYPSGAWKYWTEKGLVSGGLYDSHVGCRPYSIPPCEHHTNGT-RPPCSGEGGETPEC 198

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRP 188
             +C  D Y   + QDK+   G+  Y  P             GP   AF       Y+  
Sbjct: 199 VKKC-EDGYTPAYKQDKHY--GVTSYGIPRSEKEIMAEIYKNGPVEGAF-----VVYSDF 250

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L   +G VY   +  E+  +A ++I+GWG +NG PYW   +++   +G+ G  +ILRG++
Sbjct: 251 LMYKSG-VYQHVSGEEVGGHA-IRILGWGVDNGTPYWLAANSWNTDWGEDGFFRILRGQD 308

Query: 249 EAIIESLVNGALPK 262
              IES +   +PK
Sbjct: 309 HCGIESEIVAGIPK 322


>gi|160333103|ref|NP_001103948.1| capthepsin B, b precursor [Danio rerio]
 gi|133777414|gb|AAI15255.1| Ctsbb protein [Danio rerio]
          Length = 326

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 69/195 (35%), Positives = 96/195 (49%), Gaps = 25/195 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           CS G  +  W +  + GLVTGG ++S+ GC+P S  PC H +   + P C       PKC
Sbjct: 144 CSGGFPAEAWDYWRRSGLVTGGLYNSDVGCRPYSIAPCEH-HVNGTRPPCSG-EQDTPKC 201

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTR- 187
              C    Y   + QDK+   G  +Y  P             GP   AF     T Y   
Sbjct: 202 TGVCI-PKYSVPYKQDKH--FGSKVYNVPSDQQQIMTELYTNGPVEAAF-----TVYEDF 253

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           PL+++   VY     + +  +A VKI+GWGEENG P+W + +++   +GD G  KILRG 
Sbjct: 254 PLYKSG--VYQHLTGSALGGHA-VKILGWGEENGTPFWLVANSWNSDWGDNGYFKILRGH 310

Query: 248 NEAIIESLVNGALPK 262
           +E  IES +   LPK
Sbjct: 311 DECGIESEMVAGLPK 325


>gi|410916585|ref|XP_003971767.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
          Length = 328

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 71/195 (36%), Positives = 91/195 (46%), Gaps = 27/195 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           CS G  SS W +  K+GLVTGG   S  GC+P S  PC H    T  P   T  TP  KC
Sbjct: 146 CSGGYPSSAWEFWTKKGLVTGGLCGSEVGCRPYSIAPCEHHVNGTRPPCQGTQETP--KC 203

Query: 140 HTRCTNDNYGRGFFQDKY-------------QINGLGLYFDPHFGPFWPAFWRSFCTKYT 186
             +C  D Y   + +DK+             QI    LY +   GP   AF     T Y 
Sbjct: 204 EKKCI-DGYLTSYLKDKHFGKRSYSLPSQQEQIM-TELYKN---GPVEAAF-----TVYA 253

Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
             L    G    V+   E++    +KI+GWGEE+G PYW   +++   +GDKG  KI RG
Sbjct: 254 DFLLYKTGVYQHVTG--EVLGGHAIKILGWGEESGTPYWLAANSWNGDWGDKGFFKIKRG 311

Query: 247 RNEAIIESLVNGALP 261
            +E  IES +    P
Sbjct: 312 NDECGIESEMVAGTP 326


>gi|225717770|gb|ACO14731.1| Cathepsin B precursor [Caligus clemensi]
          Length = 331

 Score = 94.0 bits (232), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 61/192 (31%), Positives = 91/192 (47%), Gaps = 18/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W +   +GLV+GG + S+ GCQP    PC H    T +P  +   TP  KC
Sbjct: 149 CNGGFPGAAWRFWENKGLVSGGLYGSHKGCQPYLIEPCEHHVNGTRKPCAEGGRTP--KC 206

Query: 140 HTRCTNDNYGRGFFQD-KYQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
           H  C N NY   + +D  +  +   +  DP          GP   AF     + Y+  + 
Sbjct: 207 HKTCDNKNYPISYEKDLSFGRSSYSIRSDPKQIQMDIMTNGPVEAAF-----SVYSDFMS 261

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V  S  ++    ++I+GWG E G PYW + +++   +GD GT KILRG +  
Sbjct: 262 YKSGVYRHVKGS--LLGGHAIRILGWGMEKGTPYWLVANSWNTDWGDNGTFKILRGSDHC 319

Query: 251 IIESLVNGALPK 262
            IE  V   LP+
Sbjct: 320 GIEDSVVAGLPR 331


>gi|410912140|ref|XP_003969548.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
          Length = 246

 Score = 94.0 bits (232), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 65/192 (33%), Positives = 89/192 (46%), Gaps = 18/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S+ W +  K GLV+GG + S+ GC+P + PPC H +   S P C       P+C
Sbjct: 64  CNGGYPSAAWDFWTKDGLVSGGLYDSHIGCRPYTIPPCEH-HVNGSRPSCSGEGGETPQC 122

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
             RC    Y   + QDK Y      +  D           GP   AF     T Y   + 
Sbjct: 123 VYRC-EAGYTPSYKQDKHYGKTSYSVSSDEDDIKHEIYKNGPVEGAF-----TVYEDFVL 176

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
              G    V+ SA  +    +KI+GWGEENG PYW   +++   +G+ G  KILRG N  
Sbjct: 177 YKTGVYQHVTGSA--LGGHAIKILGWGEENGIPYWLCANSWNTDWGNNGFFKILRGSNHC 234

Query: 251 IIESLVNGALPK 262
            IES +   +P 
Sbjct: 235 GIESEIVAGIPN 246


>gi|56753605|gb|AAW25005.1| SJCHGC02852 protein [Schistosoma japonicum]
          Length = 346

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 63/196 (32%), Positives = 92/196 (46%), Gaps = 23/196 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ GI    W +    G+VTGG++ ++TGCQP  FP C H + + +   C+      P+C
Sbjct: 161 CNGGIPGMAWDYWKDEGIVTGGSNETHTGCQPYPFPECIHHSTSINHSSCEVKYYSTPEC 220

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRP 188
           +  C  D Y   +  DKY   G   Y+               GP    F+      +   
Sbjct: 221 YQTCQPD-YAIQYENDKYY--GKSSYYVTSDEVSIMKEILLNGPVEATFYV-----FDDF 272

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEE--NGRPYWTIVSTFGEQFGDKGTIKILRG 246
           L    G    V+ S  ++    ++I+GWG    N  PYW   +++ +Q+GDKG  KILRG
Sbjct: 273 LNYKTGVYKYVTGS--LLGGHAIRIIGWGVSTLNHTPYWLCANSWNKQWGDKGYFKILRG 330

Query: 247 RNEAIIESLVNGALPK 262
            NE  IES+V   LPK
Sbjct: 331 SNECGIESMVTAGLPK 346


>gi|348587350|ref|XP_003479431.1| PREDICTED: cathepsin B-like [Cavia porcellus]
          Length = 340

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 64/198 (32%), Positives = 97/198 (48%), Gaps = 23/198 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   + P+C       PKC
Sbjct: 150 CNGGYPTEAWKYWTRKGLVSGGLYGSHVGCRPYSIPPCEH-HVNGTRPKCTGEGGDTPKC 208

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
              C    Y   + +DKY   G   Y  P             GP   AF     + ++  
Sbjct: 209 SKTC-EPGYSPSYKEDKYY--GYSSYSVPSTEKEIMAEIYKNGPVEAAF-----SVFSDF 260

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L   +G VY    + E++    ++I+GWG+ENG PYW + +++   +GD G  KILRG +
Sbjct: 261 LTYKSG-VYK-HVAGEVLGGHAIRILGWGKENGVPYWLVGNSWNVDWGDNGFFKILRGED 318

Query: 249 EAIIESLVNGALPK-DNY 265
              IES V   +P+ D Y
Sbjct: 319 HCGIESEVVAGIPRTDQY 336


>gi|431918315|gb|ELK17542.1| Cathepsin B [Pteropus alecto]
          Length = 359

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 64/193 (33%), Positives = 93/193 (48%), Gaps = 22/193 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C       PKC
Sbjct: 173 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPCTGEGGSTPKC 231

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
            +R     Y   + +DK+   G   Y  P             GP   AF     + Y+  
Sbjct: 232 -SRICEAGYTPSYKEDKHF--GCSSYSVPSSETEIMAEIYKNGPVEAAF-----SVYSDF 283

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L   +G    V+   E++    V+I+GWG E+G PYW + +++   +GD G  KILRG++
Sbjct: 284 LLYKSGVYQHVTG--EMMGGHAVRILGWGVEDGTPYWLVGNSWNTDWGDSGFFKILRGQD 341

Query: 249 EAIIESLVNGALP 261
              IES +   LP
Sbjct: 342 HCGIESEIVAGLP 354


>gi|226471004|emb|CAX70583.1| Cysteine PRotease related protein [Schistosoma japonicum]
          Length = 304

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 65/190 (34%), Positives = 89/190 (46%), Gaps = 18/190 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  KRG+VTGG+  ++TGCQP  FP C H       P C T     P+C
Sbjct: 121 CKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQC 179

Query: 140 HTRCTNDNYGRGFFQDK------YQI--NGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK      Y +  N   +  +   +GP   AF       Y   L 
Sbjct: 180 KQTCQK-GYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAF-----DVYEDFLN 233

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+ S  IV    ++I+GWG E   PYW I +++ E +G+KG  +I+RGR+E 
Sbjct: 234 YKSGIYRHVTGS--IVGGHAIRIIGWGVEKRTPYWLIANSWNEDWGEKGLFRIVRGRDEC 291

Query: 251 IIESLVNGAL 260
            IES V   L
Sbjct: 292 SIESHVVAGL 301


>gi|56752811|gb|AAW24617.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 64/185 (34%), Positives = 90/185 (48%), Gaps = 18/185 (9%)

Query: 87  STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
             W +  KRG+VTGG+  ++TGCQP  FP C H       P C T     P+C   C   
Sbjct: 166 QAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQTCQK- 223

Query: 147 NYGRGFFQDK------YQI--NGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
            Y   + QDK      Y +  N   +  +   +GP   AF       Y   L   +G   
Sbjct: 224 GYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAF-----DVYEDFLNYKSGIYR 278

Query: 198 AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 257
            V+ S  IV    ++I+GWG E G+PYW I +++ E +G+KG  +++RGR+E  IES V 
Sbjct: 279 HVTGS--IVGGHAIRIIGWGVEKGKPYWLIANSWNEDWGEKGLFRMVRGRDECSIESHVV 336

Query: 258 GALPK 262
             L K
Sbjct: 337 AGLIK 341


>gi|226468762|emb|CAX76409.1| cathepsin B [Schistosoma japonicum]
 gi|257206178|emb|CAX82740.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 89/194 (45%), Gaps = 23/194 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   S W +   +G+VTG  +++  GCQP  FPPC H N     P C       P C
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEH-NTLGPLPVCDG-DVETPPC 221

Query: 140 HTRCT--------NDN-YGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRP 188
              C         ND  YG+  ++ K     +      H GP    F  +  F   Y   
Sbjct: 222 KRTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQH-GPVEVDFEVYADF-PNYKSG 279

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           ++Q          S  ++    V+++GWGEEN  PYW I +++   +GD G  KI+RG+N
Sbjct: 280 VYQ--------HVSGALLGGHAVRLLGWGEENNVPYWLIANSWNTDWGDNGYFKIIRGKN 331

Query: 249 EAIIESLVNGALPK 262
           E  IES VN  +PK
Sbjct: 332 ECGIESDVNAGIPK 345


>gi|312271211|gb|ADQ57303.1| cathepsin B-like cysteine proteinase 1 [Angiostrongylus
           cantonensis]
          Length = 394

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 60/193 (31%), Positives = 88/193 (45%), Gaps = 18/193 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +    G+VTG    +N GC+P  FPPC H +  T    C+    P PKC
Sbjct: 190 CEGGDPMFAWQYWVDHGIVTGSNFTANQGCKPYPFPPCEHHSNKTRFDPCRHDLYPTPKC 249

Query: 140 HTRCT--------NDN--YGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
             +C         +D+  YGR  +  K  +  +      H GP   AF      +     
Sbjct: 250 SKKCVPSYKEKNYDDDRFYGRTAYGVKNDVAAIQKEILTH-GPVEVAF------EVYEDF 302

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
               G +Y V    ++     VK++GWG + G PYW I +++   +G++G  +ILRG +E
Sbjct: 303 LHYAGGIY-VHTGGKLGGGHAVKLIGWGIDQGTPYWLIANSWNTDWGEEGFFRILRGVDE 361

Query: 250 AIIESLVNGALPK 262
             IES V G +PK
Sbjct: 362 CGIESGVVGGIPK 374


>gi|256077361|ref|XP_002574974.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
 gi|18181863|emb|CAC85211.2| cathepsin B endopeptidase [Schistosoma mansoni]
 gi|353231645|emb|CCD79000.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
          Length = 347

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 91/194 (46%), Gaps = 23/194 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   S W +   +G+VTG  +++  GCQP  FPPC H +     P C       P C
Sbjct: 163 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEH-HVIGPLPSCDG-DVETPSC 220

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAF--WRSFCTKYTRP 188
            T C    Y   + +DK Y      ++ +P          GP    F  +  F   Y   
Sbjct: 221 KTNC-QPGYNIPYEKDKWYGEKVYRIHSNPEAIMLELMRNGPVEVDFEVYADF-PNYKSG 278

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           ++Q          S  ++    V+++GWGEEN  PYW I +++   +GDKG  KI+RG+N
Sbjct: 279 VYQ--------HVSGALLGGHAVRLLGWGEENNVPYWLIANSWNSDWGDKGYFKIVRGKN 330

Query: 249 EAIIESLVNGALPK 262
           E  IES VN  +PK
Sbjct: 331 ECGIESDVNAGIPK 344


>gi|405971658|gb|EKC36483.1| Cathepsin B [Crassostrea gigas]
          Length = 341

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 63/191 (32%), Positives = 94/191 (49%), Gaps = 17/191 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G  S+ W++  K GLVTGG ++S+ GC P +   C+H      +P  K++  P PKC
Sbjct: 158 CEGGFPSAAWSYYKKDGLVTGGQYNSHQGCLPYTIKACDHHVVGKLQPCSKSIG-PTPKC 216

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQ 191
              C    Y   + +DK      Y ++G+          GP   AF     T Y     Q
Sbjct: 217 KHTC-EAGYNVTYEKDKHYGSSAYSVHGVEKIMTEIMTNGPVEGAF-----TVYAD-FPQ 269

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
               VY  +    +  +A +KI+GWG ENG  YW + +++   +GD+G  KILRG++E  
Sbjct: 270 YKSGVYKHTTGQPLGGHA-IKILGWGTENGDDYWLVANSWNPDWGDQGFFKILRGQDECG 328

Query: 252 IESLVNGALPK 262
           IES ++   PK
Sbjct: 329 IESQISAGEPK 339


>gi|347972086|ref|XP_313835.5| AGAP004533-PA [Anopheles gambiae str. PEST]
 gi|333469165|gb|EAA09183.5| AGAP004533-PA [Anopheles gambiae str. PEST]
          Length = 337

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 74/247 (29%), Positives = 109/247 (44%), Gaps = 30/247 (12%)

Query: 27  SCIEARAVATATPLAFAVCRSS--KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGI 84
           SC    A      ++  VC +S  K+H     FRF A     C          + C+ G 
Sbjct: 109 SCGSCWAFGAVEAMSDRVCVASGGKIH-----FRFSAEDLVSCCHTCG-----FGCNGGF 158

Query: 85  SSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT 144
             + W++  ++GLV+GG   SN GCQP +  PC H +   + P C+      PKC  +C 
Sbjct: 159 PGAAWSYWVRKGLVSGGPFGSNLGCQPYAIAPCEH-HVNGTRPSCEGEGGKTPKCVKKC- 216

Query: 145 NDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQTNGR 195
            ++Y   + +DK      Y I              GP   AF     T Y   L    G 
Sbjct: 217 QESYNVPYQKDKRFGASSYSIARHEAQIQKEIMTNGPVEGAF-----TVYEDLLHYKEGV 271

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
              V+   +++    ++I+GWG ENG  YW I +++   +GD G  KILRG +   IES 
Sbjct: 272 YQHVTG--KMLGGHAIRILGWGVENGTKYWLIANSWNSDWGDNGFFKILRGEDHLGIESS 329

Query: 256 VNGALPK 262
           ++  LPK
Sbjct: 330 ISAGLPK 336


>gi|27882093|gb|AAH44517.1| Zgc:55862 [Danio rerio]
          Length = 330

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 61/191 (31%), Positives = 90/191 (47%), Gaps = 18/191 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S+ W +    GLVTGG ++S+ GC+P +  PC H +   S P C       P C
Sbjct: 148 CNGGYPSAAWDFWTTDGLVTGGLYNSHIGCRPYTIEPCEH-HVNGSRPPCTGEGGDTPNC 206

Query: 140 HTRCT---------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
             +C          + ++G+  +      NG+      + GP   AF     T Y   L 
Sbjct: 207 DMKCEPGYSPLYKEDKHFGKTSYSVPSNQNGIMAELFKN-GPVEAAF-----TVYEDFLL 260

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    +S SA  +    +KI+GWGEENG PYW   +++   +GD G  KILRG +  
Sbjct: 261 YKSGVYQHMSGSA--LGGHAIKILGWGEENGVPYWLAANSWNTDWGDNGYFKILRGEDHC 318

Query: 251 IIESLVNGALP 261
            IES +   +P
Sbjct: 319 GIESEIVAGIP 329


>gi|146165818|ref|XP_001015807.2| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|146145394|gb|EAR95562.2| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 338

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 57/193 (29%), Positives = 89/193 (46%), Gaps = 21/193 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G  S+ W ++   G+ TGG +  ++ C+P  FPPC+H +     P C  +  P PKC
Sbjct: 156 CQGGYPSAAWKYMKATGVSTGGLYGDDSSCKPYVFPPCDH-HVVGQYPPCGPIK-PTPKC 213

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPH-----------FGPFWPAFWRSFCTKYTRP 188
             +C +    + + QD +  + +  Y  P+            GP   +F      +    
Sbjct: 214 VKQCNSQYTEKTYQQDLHHPSKV--YQLPNNAEAIQREIMAHGPVQASF------RVASD 265

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
                  VY      +     +VKI+GWG E G PYW I +++ E +G+ G  K+LRG+N
Sbjct: 266 FLTYKSGVYIRDPKLKYEGGHSVKIIGWGVEQGTPYWLIANSWNEDWGENGLFKMLRGKN 325

Query: 249 EAIIESLVNGALP 261
           E  IE+ V   LP
Sbjct: 326 ECGIEAEVVAGLP 338


>gi|194246067|gb|ACF35525.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 192

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 61/195 (31%), Positives = 92/195 (47%), Gaps = 23/195 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S+ W +     +VTGG + +  GCQP  FPPC H       P C T   P P+C
Sbjct: 7   CNGGYPSAAWQFYKDEDIVTGGLYGTEDGCQPYYFPPCEHHT-VGPLPNC-TGIKPTPEC 64

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAF--WRSFCTKYTRP 188
              C  + Y + + +DK      Y I+             GP    F  +  F   Y   
Sbjct: 65  AKTC-REGYQKSYTRDKHFGKKVYSISSDETQIKTEIYKNGPVEADFSVYADF-PSYKSG 122

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           ++Q +        S E++    ++I+GWG E+G PYW + +++ E +GDKG  KI RG +
Sbjct: 123 VYQRH--------SEEMLGGHAIRILGWGTEDGVPYWLVANSWNEDWGDKGYFKIRRGND 174

Query: 249 EAIIESLVNGALPKD 263
           E  IE  +N  +PK+
Sbjct: 175 ECGIEDDINAGIPKE 189


>gi|226471002|emb|CAX70582.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 62/183 (33%), Positives = 87/183 (47%), Gaps = 18/183 (9%)

Query: 87  STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
             W +  KRG+VTGG+  ++TGCQP  FP C H       P C T     P+C   C   
Sbjct: 166 QAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQTCQK- 223

Query: 147 NYGRGFFQDK-YQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
            Y   + QDK Y      +  +          +GP   AF       Y   L   +G   
Sbjct: 224 GYKTPYKQDKHYGDESYNVISNEKAIQKEIMMYGPVEAAF-----DVYEDFLNYKSGIYR 278

Query: 198 AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 257
            V+ S  IV    ++I+GWG E G+PYW I +++ E +G+KG  +++RGR+E  IES V 
Sbjct: 279 HVTGS--IVGGHAIRIIGWGVEKGKPYWLIANSWNEDWGEKGLFRMVRGRDECSIESHVV 336

Query: 258 GAL 260
             L
Sbjct: 337 AGL 339


>gi|196009263|ref|XP_002114497.1| expressed hypothetical protein [Trichoplax adhaerens]
 gi|190583516|gb|EDV23587.1| expressed hypothetical protein [Trichoplax adhaerens]
          Length = 333

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 67/193 (34%), Positives = 91/193 (47%), Gaps = 21/193 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G     W +    G+VTGG +HS+ GCQP   P C H      +   K L  P PKC
Sbjct: 151 CNGGFLPQAWHYWVNNGIVTGGQYHSHKGCQPYEIPKCEHHVKGPFKACGKEL--PTPKC 208

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTR-PL 189
             +C    Y + F QDK      Y I              GP   AF     T Y   P 
Sbjct: 209 SQKC-QPGYNKTFNQDKHFGKKSYSITNNIQQIQKEIMMNGPVEAAF-----TVYADFPS 262

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           +++   VY  +    +  +A VKI+GWG EN  PYW I +++   +GDKG  KI+RG++E
Sbjct: 263 YKSG--VYQHTTGGPLGGHA-VKILGWGTENNTPYWLIANSWNPTWGDKGYFKIIRGKDE 319

Query: 250 AIIESLVNGALPK 262
             IES +   +PK
Sbjct: 320 CGIESSIVAGMPK 332


>gi|56752925|gb|AAW24674.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 66/237 (27%), Positives = 108/237 (45%), Gaps = 24/237 (10%)

Query: 28  CIEARAVATATPLAFAVCRSS--KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGIS 85
           C  + AV+    ++  +C  S  K  VE ++   I+  K   +           C  G++
Sbjct: 115 CASSWAVSAVAAMSDRICIQSGGKQSVELSAIDLISCCKNCGSG----------CDGGVT 164

Query: 86  SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
             +W +  K G+VTGG+  ++TGC+P  FP C+H         C       P+C   C  
Sbjct: 165 GYSWDYWVKHGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQCKQTCQK 223

Query: 146 DNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
             Y   + QDK      Y + G+          + P    ++   Y   L   +G +Y  
Sbjct: 224 -GYNTSYEQDKHYGGFSYSVIGVESAIQKEIMMYGPV--EAYLQIYEDFLNYKSG-IYRY 279

Query: 200 SASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
           +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGR+E +IES +
Sbjct: 280 TTGKYISGHA-VRLIGWGVENGTSYWLAANTWNEDWGEKGYFRIVRGRDECLIESFI 335


>gi|56752997|gb|AAW24710.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 63/181 (34%), Positives = 89/181 (49%), Gaps = 18/181 (9%)

Query: 89  WAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 148
           W +  KRG+VTGG+  ++TGCQP  FP C H       P C T     P+C   C    Y
Sbjct: 168 WDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQTCQK-GY 225

Query: 149 GRGFFQDK------YQI--NGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
              + QDK      Y +  N   +  +   +GP   AF       Y   L   +G    V
Sbjct: 226 KTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAF-----DVYEDFLNYKSGIYRHV 280

Query: 200 SASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           + S  IV    ++I+GWG E G+PYW I +++ E +G+KG  +++RGR+E  IES V   
Sbjct: 281 TGS--IVGGHAIRIIGWGVEKGKPYWLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAG 338

Query: 260 L 260
           L
Sbjct: 339 L 339


>gi|312374701|gb|EFR22198.1| hypothetical protein AND_15621 [Anopheles darlingi]
          Length = 335

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 66/195 (33%), Positives = 91/195 (46%), Gaps = 24/195 (12%)

Query: 80  CSSGISSSTWA-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 138
           C+ G   + W+ WVHK GLV+GG   SN GCQP +  PC H +   + P C+      PK
Sbjct: 152 CNGGFPGAAWSYWVHK-GLVSGGPFGSNLGCQPYAIAPCEH-HVNGTRPSCEGEGGKTPK 209

Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTR 187
           C  +C  D+Y   + +DK    G   Y  P             GP   AF     T Y  
Sbjct: 210 CVKKC-QDSYTVPYAKDKRY--GSKSYSIPRHEDQIRKEIMTNGPVEGAF-----TVYED 261

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
            L    G    V+   +++    ++I+GWG EN   YW I +++   +GD G  KILRG 
Sbjct: 262 LLHYKEGVYQHVTG--KMLGGHAIRILGWGVENNTKYWLIANSWNSDWGDNGFFKILRGE 319

Query: 248 NEAIIESLVNGALPK 262
           +   IES +   LPK
Sbjct: 320 DHLGIESSIAAGLPK 334


>gi|355697726|gb|EHH28274.1| Cathepsin B [Macaca mulatta]
          Length = 339

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 65/196 (33%), Positives = 96/196 (48%), Gaps = 20/196 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W ++ ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAGAWNFLTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  N   +              GP   AF     + Y+  L 
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319

Query: 251 IIESLVNGALPK-DNY 265
            IES V   +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335


>gi|204022102|dbj|BAG71148.1| cathepsin B-N2 [Tuberaphis takenouchii]
          Length = 334

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 64/194 (32%), Positives = 94/194 (48%), Gaps = 25/194 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W    K GLVTGG ++S  GCQP   PPC    Y  +   C+    P  K 
Sbjct: 154 CHGGYPIKAWERFRKHGLVTGGDYNSGEGCQPYRVPPCPFDEYGNNT--CR--GKPAEKN 209

Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFDPH-FGPFWPAF--WRSFCTKYTR 187
           H RCT   YG           + +D Y +N   +  D   +GP   ++  +  F      
Sbjct: 210 H-RCTRMCYGNQNLDFKEDHRYTRDAYYLNYQIIQNDLMTYGPIEASYDVYDDF------ 262

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           P +++   VY  + +A  +    VK++GWGEE G PYW +V+++ +Q+GD+G  KI RG 
Sbjct: 263 PNYKSG--VYMKTENASYLGGHAVKLIGWGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGT 320

Query: 248 NEAIIESLVNGALP 261
           NE  I++   G +P
Sbjct: 321 NECGIDNSTTGGVP 334


>gi|300122171|emb|CBK22745.2| unnamed protein product [Blastocystis hominis]
          Length = 319

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 59/192 (30%), Positives = 90/192 (46%), Gaps = 22/192 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G S+  W ++ + G+VTGG ++S   C+   FPPC+H       P+C T     PKC
Sbjct: 138 CQGGYSAMAWEYLRRTGVVTGGQYNSTEWCKSYPFPPCSHG-IEGQYPQCSTKPPVVPKC 196

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP---------HFGPFWPAF--WRSFCTKYTRP 188
            T C  + Y   + +D+Y+ + +    +            GP   +F  +  F T  +  
Sbjct: 197 ETTC-QEGYPIEYEKDRYKFSNVYQLENNVDQIKNEIMENGPVDASFQVYEDFMTYKSGI 255

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
                G+          +   TVKI+GWGEENG  YW  V+++  ++G+ G  +I  G N
Sbjct: 256 YHHVEGK---------FMNLHTVKIIGWGEENGEAYWKAVNSWNSEWGENGLFRIRLGTN 306

Query: 249 EAIIESLVNGAL 260
           E  IES V G L
Sbjct: 307 ECTIESQVEGGL 318


>gi|204022096|dbj|BAG71145.1| cathepsin B-N1 [Tuberaphis sumatrana]
          Length = 334

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 65/194 (33%), Positives = 92/194 (47%), Gaps = 25/194 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           CS G     W    K GLVTGG + S  GCQP   PPC    Y  +    K    P  K 
Sbjct: 154 CSGGNPIKAWERFQKHGLVTGGNYDSGEGCQPYKVPPCPLDEYGNNTCSGK----PAEKN 209

Query: 140 HTRCTNDNYG---------RGFFQDKYQINGLGLYFDP-HFGPFWPAF--WRSFCTKYTR 187
           H RCT   YG           + +D Y +    + +D   +GP   +F  +  F      
Sbjct: 210 H-RCTRMCYGNQNLDFKEDHHYTRDAYYLTYGTIQYDVLAYGPIEASFEVYDDF------ 262

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           P +++   VY    +A  +    VK++GWGEE G PYW +V+++ +Q+GD+G  KI RG 
Sbjct: 263 PSYKSG--VYTKMENATYLGGHAVKLIGWGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGT 320

Query: 248 NEAIIESLVNGALP 261
           NE  I++   G +P
Sbjct: 321 NECGIDNSTTGGVP 334


>gi|50540542|ref|NP_998501.1| cathepsin B, a precursor [Danio rerio]
 gi|34784038|gb|AAH56688.1| Cathepsin B, a [Danio rerio]
 gi|37681773|gb|AAQ97764.1| cathepsin B [Danio rerio]
 gi|41351445|gb|AAH65589.1| Cathepsin B, a [Danio rerio]
          Length = 330

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 64/193 (33%), Positives = 89/193 (46%), Gaps = 22/193 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S+ W +    GLVTGG ++S+ GC+P +  PC H +   S P C       P C
Sbjct: 148 CNGGYPSAAWDFWATEGLVTGGLYNSHIGCRPYTIEPCEH-HVNGSRPPCSGEGGDTPNC 206

Query: 140 HTRCTNDNYGRGFFQDKY-----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
             +C    Y   + QDK+           Q + +   F    GP   AF     T Y   
Sbjct: 207 DMKC-EPGYSPSYKQDKHFGKTSYSVPSNQNSIMAELFKN--GPVEGAF-----TVYEDF 258

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L   +G    +S S   V    +KI+GWGEENG PYW   +++   +GD G  KILRG +
Sbjct: 259 LLYKSGVYQHMSGSP--VGGHAIKILGWGEENGVPYWLAANSWNTDWGDNGYFKILRGED 316

Query: 249 EAIIESLVNGALP 261
              IES +   +P
Sbjct: 317 HCGIESEIVAGIP 329


>gi|17565162|ref|NP_503382.1| Protein W07B8.4 [Caenorhabditis elegans]
 gi|351059398|emb|CCD74288.1| Protein W07B8.4 [Caenorhabditis elegans]
          Length = 335

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 65/201 (32%), Positives = 88/201 (43%), Gaps = 29/201 (14%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  K GLVTGG+  S  GC+P S  PC       + PEC    +  PKC
Sbjct: 145 CEGGYPIQAWRYWVKNGLVTGGSFESQYGCKPYSIAPCGETIDGVTWPECPMKISDTPKC 204

Query: 140 HTRCT-NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL--------- 189
              CT N++Y   + QDK            HFG    A  RS     T  L         
Sbjct: 205 EHHCTGNNSYPIPYDQDK------------HFGASAYAIGRSAKQIQTEILAHGPVEVGF 252

Query: 190 ------FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
                 +     +Y   A  E+  +A VK++GWG +NG PYW   +++   +G+KG  +I
Sbjct: 253 IVYEDFYLYKTGIYTHVAGGELGGHA-VKMLGWGVDNGTPYWLAANSWNTVWGEKGYFRI 311

Query: 244 LRGRNEAIIESLVNGALPKDN 264
           LRG +E  IES     +P  N
Sbjct: 312 LRGVDECGIESAAVAGMPDLN 332


>gi|325302582|dbj|BAJ83491.1| cathepsin B-like peptidase [Echinococcus multilocularis]
          Length = 338

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 61/204 (29%), Positives = 89/204 (43%), Gaps = 43/204 (21%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G+  + W +    G+V+GG + S+ GC+P   PPC H + + + P+CK   +  PKC
Sbjct: 154 CNGGLPENAWRYWAIDGIVSGGLYGSHVGCRPYEIPPCEH-HTSGNRPDCKG-NSKTPKC 211

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
             +C         F  KYQ        D HF          +  + +         VY  
Sbjct: 212 QRQCVES------FDGKYQA-------DKHFAS------NVYNVRASEEDIMNEILVYG- 251

Query: 200 SASAEIVAYA---------------------TVKIVGWGEENGRPYWTIVSTFGEQFGDK 238
              A+ + YA                      VKI+GWGEENG PYW   +++   +GD 
Sbjct: 252 PVEADFIVYADFLTYKSGVYQHVKGGFLGGHAVKILGWGEENGVPYWLCANSWNTDWGDG 311

Query: 239 GTIKILRGRNEAIIESLVNGALPK 262
           G  KILRG N   IE+ +N  +PK
Sbjct: 312 GFFKILRGYNHCKIEADINAGIPK 335


>gi|30995341|gb|AAO59414.2| cathepsin B endopeptidase [Schistosoma japonicum]
 gi|226472794|emb|CAX71083.1| cathepsin B [Schistosoma japonicum]
 gi|226472796|emb|CAX71084.1| cathepsin B [Schistosoma japonicum]
 gi|226472798|emb|CAX71085.1| cathepsin B [Schistosoma japonicum]
 gi|226472802|emb|CAX71087.1| cathepsin B [Schistosoma japonicum]
 gi|226472806|emb|CAX71089.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 62/194 (31%), Positives = 90/194 (46%), Gaps = 23/194 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH--------ANYTTSEPECKT 131
           C+ G   S W +   +G+VTG  +++  GCQP  FPPC H         +     P CK 
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEHHTLGPLPVCDGDVETPPCKR 223

Query: 132 LATPQPKCHTRCTNDN-YGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRP 188
             T Q   +    ND  YG+  ++ K     +      H GP    F  +  F   Y   
Sbjct: 224 --TCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQH-GPVEVDFEVYADF-PNYKSG 279

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           ++Q          S  ++    V+++GWGEEN  PYW I +++   +GD G  KI+RG+N
Sbjct: 280 VYQ--------HVSGALLGGHAVRLLGWGEENNVPYWLIANSWNTDWGDNGYFKIIRGKN 331

Query: 249 EAIIESLVNGALPK 262
           E  IES VN  +PK
Sbjct: 332 ECGIESDVNAGIPK 345


>gi|346472613|gb|AEO36151.1| hypothetical protein [Amblyomma maculatum]
          Length = 373

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 66/193 (34%), Positives = 89/193 (46%), Gaps = 20/193 (10%)

Query: 80  CSSGISSSTWA-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 138
           C+ G   S W+ WVHK G+VTGG + S+ GC P     C+H    T  P C     P P+
Sbjct: 190 CNGGFPGSAWSYWVHK-GIVTGGNYDSDEGCMPYPIKACDHHVNGTLGP-CDKTIPPTPR 247

Query: 139 CHTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPL 189
           C  R     Y   F  DK      Y +              GP    F     T Y   L
Sbjct: 248 C-VRMCRKGYDVDFMDDKHYGRHAYSVPAKAKQIQAEIMMNGPVEADF-----TVYEDFL 301

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
              +G VY     + +  +A ++++GWG ENG PYW   +++  ++GDKG  KILRG +E
Sbjct: 302 HYKSG-VYQRHTDSALGGHA-IRLLGWGVENGVPYWLAANSWNTEWGDKGFFKILRGSDE 359

Query: 250 AIIESLVNGALPK 262
             IES +   LPK
Sbjct: 360 CGIESDIVAGLPK 372


>gi|226472800|emb|CAX71086.1| cathepsin B [Schistosoma japonicum]
 gi|226472804|emb|CAX71088.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 62/194 (31%), Positives = 90/194 (46%), Gaps = 23/194 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH--------ANYTTSEPECKT 131
           C+ G   S W +   +G+VTG  +++  GCQP  FPPC H         +     P CK 
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEHHTLGPLPVCDGDVETPPCKR 223

Query: 132 LATPQPKCHTRCTNDN-YGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRP 188
             T Q   +    ND  YG+  ++ K     +      H GP    F  +  F   Y   
Sbjct: 224 --TCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQH-GPVEVDFEVYADF-PNYKSG 279

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           ++Q          S  ++    V+++GWGEEN  PYW I +++   +GD G  KI+RG+N
Sbjct: 280 VYQ--------HVSGALLGGHAVRLLGWGEENNVPYWLIANSWNTDWGDNGYFKIIRGKN 331

Query: 249 EAIIESLVNGALPK 262
           E  IES VN  +PK
Sbjct: 332 ECGIESDVNAGIPK 345


>gi|268555790|ref|XP_002635884.1| Hypothetical protein CBG01104 [Caenorhabditis briggsae]
          Length = 337

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 61/196 (31%), Positives = 91/196 (46%), Gaps = 25/196 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  K+GLV+GG++ S  GC+P S  PC       + P+C       P+C
Sbjct: 147 CDGGYPLQAWRYWVKQGLVSGGSYESQYGCKPYSIAPCGQTVNGVTWPKCPAQEEATPEC 206

Query: 140 HTRCTN-DNYGRGFFQDKYQINGLGLYFDP-------------HFGPFWPAFWRSFCTKY 185
            + CT+  +Y   + +DK+   GL  Y  P               GP    F        
Sbjct: 207 ASHCTSKSSYSVAYEKDKHY--GLSAY--PVGRKEAQIQTEILQHGPVEAGFL------V 256

Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
               ++    +Y   +  E+  +A VKI+GWG ENG  YW + +++   +G+KG  +ILR
Sbjct: 257 YSDFYRYKSGIYTHVSGQELGGHA-VKILGWGVENGTKYWLVANSWNINWGEKGYFRILR 315

Query: 246 GRNEAIIESLVNGALP 261
           GRNE  IES V   +P
Sbjct: 316 GRNECGIESAVVAGIP 331


>gi|226472810|emb|CAX71091.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 62/194 (31%), Positives = 90/194 (46%), Gaps = 23/194 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH--------ANYTTSEPECKT 131
           C+ G   S W +   +G+VTG  +++  GCQP  FPPC H         +     P CK 
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEHHTLGPLPVCDGDVETPPCKR 223

Query: 132 LATPQPKCHTRCTNDN-YGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRP 188
             T Q   +    ND  YG+  ++ K     +      H GP    F  +  F   Y   
Sbjct: 224 --TCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQH-GPVEVDFEVYADF-PNYKSG 279

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           ++Q          S  ++    V+++GWGEEN  PYW I +++   +GD G  KI+RG+N
Sbjct: 280 VYQ--------HVSGALLGGHAVRLLGWGEENNVPYWLIANSWNTDWGDNGYFKIIRGKN 331

Query: 249 EAIIESLVNGALPK 262
           E  IES VN  +PK
Sbjct: 332 ECGIESDVNAGIPK 345


>gi|351695295|gb|EHA98213.1| Cathepsin B [Heterocephalus glaber]
          Length = 340

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 62/203 (30%), Positives = 96/203 (47%), Gaps = 33/203 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S+ W +  K+GLV+GG + S+ GC+P S PPC H +   + P+C       PKC
Sbjct: 150 CNGGYPSAAWKYWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGTRPQCTGEGGDTPKC 208

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY-A 198
              C    Y   + +DK            HFG  + ++  S   K        NG V  A
Sbjct: 209 SKTC-EPGYSPSYKEDK------------HFG--YDSYSVSSNEKEIMAEIYKNGPVEGA 253

Query: 199 VSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
            +  ++ + Y T               ++I+GWG+ENG PYW + +++   +GD G  KI
Sbjct: 254 FTVFSDFLMYKTGVYKHLAGEMLGGHAIRILGWGKENGVPYWLVGNSWNVDWGDSGFFKI 313

Query: 244 LRGRNEAIIESLVNGALPK-DNY 265
           +RG +   IES +   +P+ D Y
Sbjct: 314 VRGEDHCGIESEIVAGIPRTDQY 336


>gi|444525951|gb|ELV14228.1| Cathepsin B [Tupaia chinensis]
          Length = 339

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 63/190 (33%), Positives = 94/190 (49%), Gaps = 20/190 (10%)

Query: 86  SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
           S+ W +  K+GLV+GG + S+ GC+P S PPC H +   S P C T     PKC   C  
Sbjct: 156 SAAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCSKSC-E 212

Query: 146 DNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQTNGRV 196
             Y   + +DK      Y + G+           GP   AF     + Y+  L   +G  
Sbjct: 213 PGYSSSYKEDKHYGYSSYSVPGIEKEIMAEIYKNGPVEGAF-----SVYSDFLLYKSGVY 267

Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
             V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++   IES +
Sbjct: 268 QHVTG--EMMGGHAIRILGWGTENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEI 325

Query: 257 NGALPK-DNY 265
              +P+ D Y
Sbjct: 326 VAGIPRTDQY 335


>gi|294951797|ref|XP_002787132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239901778|gb|EER18928.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 278

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 63/199 (31%), Positives = 90/199 (45%), Gaps = 26/199 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHS------NTGCQPVSFPPCNHANYTTSEPECKTLA 133
           C+ G  +S W+WVH +G+ TGG + +      + GC P  FPPC H    +  P+C   +
Sbjct: 89  CNGGFPNSAWSWVHDKGIATGGDYVAEDDMTKDDGCWPYDFPPCAHHVNDSKYPKCPKDS 148

Query: 134 TPQPKCHTRCTNDNYGRGFFQDK----------YQINGLGLYFDPHFGPFWPAFWRSFCT 183
              P C  +C N  Y      D+          Y +N          GP   +F     T
Sbjct: 149 YETPNCAEQCHNPKYTTTLRDDRHFMVESSPYQYSVNDAKNAIRTD-GPVSASF-----T 202

Query: 184 KYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
            Y   L   +G VY    S E +    VKI+GWGEE+G+ YW +V+++ E +GD G  KI
Sbjct: 203 VYEDFLAYKSG-VYK-HTSGEYLGGHAVKIIGWGEESGQAYWLVVNSWNEDWGDHGLFKI 260

Query: 244 LRGRNEAIIESLVNGALPK 262
             G     I+  + G  PK
Sbjct: 261 ALGN--CGIDDYLLGGTPK 277


>gi|73586701|gb|AAI02998.1| CTSB protein [Bos taurus]
          Length = 335

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 64/191 (33%), Positives = 92/191 (48%), Gaps = 19/191 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  K+GLV+GG ++S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + +DK      Y +              GP   AF     + Y+  L 
Sbjct: 208 SKTC-EPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAF-----SVYSDFLL 261

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V  S EI+    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 262 YKSGVYQHV--SGEIMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHC 319

Query: 251 IIESLVNGALP 261
            IES +   +P
Sbjct: 320 GIESEIVAGMP 330


>gi|341904470|gb|EGT60303.1| hypothetical protein CAEBREN_20420 [Caenorhabditis brenneri]
          Length = 351

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 80/266 (30%), Positives = 112/266 (42%), Gaps = 39/266 (14%)

Query: 7   SRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQR 66
           S+IRD S             SC    AV+ A  ++  +C +SK     T     A     
Sbjct: 114 SKIRDQS-------------SCGSCWAVSAAETISDRICIASKGQ---TQVSISADDINA 157

Query: 67  CAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE 126
           C  +         C+ G     W    K G VTGG++   TGC+P  +PPC H    T  
Sbjct: 158 CCGMACGNG----CNGGYPIEAWRHYVKNGYVTGGSYQEKTGCKPYPYPPCEHHVNGTHY 213

Query: 127 PECKTLATPQPKCHTRCTNDNYGRGFFQD------KYQINGLGLYFDPHF---GPFWPAF 177
             C +   P  KC   C    Y   + QD       Y ++             GP   AF
Sbjct: 214 KPCPSDMYPTDKCERSC-QAGYSLTYKQDLHFGQSAYAVSKKATEIQKEIMTNGPVEVAF 272

Query: 178 WRSFCTKYTRPLFQT-NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFG 236
                T Y    F+  +G VY  +A A +  +A VK++GWG +NG PYW   +++ E +G
Sbjct: 273 -----TVYAD--FEVYSGGVYVHTAGASLGGHA-VKMLGWGVDNGTPYWLCANSWNEDWG 324

Query: 237 DKGTIKILRGRNEAIIESLVNGALPK 262
           + G  +I+RG NE  IE  V G +PK
Sbjct: 325 ENGYFRIIRGVNECGIEHGVVGGIPK 350


>gi|56757271|gb|AAW26807.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 66/237 (27%), Positives = 108/237 (45%), Gaps = 24/237 (10%)

Query: 28  CIEARAVATATPLAFAVCRSS--KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGIS 85
           C  + AV+    ++  +C  S  K  VE ++   I+  K   +           C  G++
Sbjct: 115 CASSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCKNCGSG----------CDGGVT 164

Query: 86  SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
             +W +  K G+VTGG+  ++TGC+P  FP C+H         C       P+C   C  
Sbjct: 165 GYSWDYWVKHGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQCKQTCQK 223

Query: 146 DNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
             Y   + QDK      Y + G+          + P    ++   Y   L   +G +Y  
Sbjct: 224 -GYNTSYEQDKHYGEFSYNVIGVESVIQKEIMMYGPV--EAYLHIYEDFLNYKSG-IYRY 279

Query: 200 SASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
           +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGR+E +IES +
Sbjct: 280 TTGQFISGHA-VRLIGWGVENGTSYWLAANTWNEDWGEKGYFRIVRGRDECLIESFI 335


>gi|291385792|ref|XP_002709482.1| PREDICTED: cathepsin B [Oryctolagus cuniculus]
          Length = 339

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 64/190 (33%), Positives = 93/190 (48%), Gaps = 20/190 (10%)

Query: 86  SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
           S  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C T     P+C   C  
Sbjct: 156 SGAWNFWTKKGLVSGGLYDSHVGCKPYSIPPCEH-HVNGSRPAC-TGEGDTPRCSKTC-E 212

Query: 146 DNYGRGFFQDKYQINGLGLYFDPHF---------GPFWPAFWRSFCTKYTRPLFQTNGRV 196
             Y   + +DK+                      GP   AF     T Y+  L   +G V
Sbjct: 213 PGYSPSYKEDKHYGYSSYSVSSDENEIKAEIYKNGPVEGAF-----TVYSDFLMYKSG-V 266

Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
           Y    + +I+    ++I+GWGEENG PYW + +++   +GDKG  KILRG++   IES +
Sbjct: 267 YQ-HTTGDIMGGHAIRILGWGEENGVPYWLVANSWNTDWGDKGFFKILRGQDHCGIESEI 325

Query: 257 NGALPK-DNY 265
              +P+ D Y
Sbjct: 326 VAGIPRTDQY 335


>gi|194387364|dbj|BAG60046.1| unnamed protein product [Homo sapiens]
          Length = 245

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 73/255 (28%), Positives = 113/255 (44%), Gaps = 31/255 (12%)

Query: 23  PYALSCIEARAVATATPLAFAVCRSSKMHV--ECTSFRFIAGVKQRCAWLVSRWMTIWVC 80
           P ++ C  + A      ++  +C  +  HV  E ++   +      C            C
Sbjct: 6   PLSIPCRMSWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGD---------GC 56

Query: 81  SSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCH 140
           + G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC 
Sbjct: 57  NGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCS 114

Query: 141 TRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLFQ 191
             C    Y   + QDK Y  N   +              GP   AF     + Y+  L  
Sbjct: 115 KIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLLY 168

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
            +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++   
Sbjct: 169 KSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCG 226

Query: 252 IESLVNGALPK-DNY 265
           IES V   +P+ D Y
Sbjct: 227 IESEVVAGIPRTDQY 241


>gi|326916753|ref|XP_003204669.1| PREDICTED: cathepsin B-like [Meleagris gallopavo]
          Length = 340

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 89/194 (45%), Gaps = 22/194 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  +RGLV+GG + S+ GC+P + PPC H +   S P C       P+C
Sbjct: 150 CNGGYPSGAWRYWTERGLVSGGLYDSHVGCRPYTIPPCEH-HVNGSRPPCTGEGGETPRC 208

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRP 188
              C    Y   + +DK+   G+  Y  P             GP   AF       Y   
Sbjct: 209 SRHC-EPGYSPSYKEDKHY--GITSYGVPRSEKEIMAEIYKNGPVEGAF-----IVYEDF 260

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L   +G    VS   E V    ++I+GWG ENG PYW   +++   +GD G  KILRG +
Sbjct: 261 LMYKSGVYQHVSG--EQVGGHAIRILGWGVENGTPYWLAANSWNTDWGDNGFFKILRGED 318

Query: 249 EAIIESLVNGALPK 262
              IES +   +P+
Sbjct: 319 HCGIESEIVAGVPR 332


>gi|118358706|ref|XP_001012594.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89294361|gb|EAR92349.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 346

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 70/192 (36%), Positives = 92/192 (47%), Gaps = 18/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   +   +    GLVTG  + +N+ CQ  +F PC H   +   P C T   P P C
Sbjct: 161 CDGGWPEAAMDYYVNTGLVTGDLYGNNSWCQAYTFAPCAHHVTSDIYPPC-TGELPTPPC 219

Query: 140 HTRC-TNDNYGRGFFQDKYQ-INGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPL 189
              C +N  +   + +D ++     G+  D           GP   A      T Y   L
Sbjct: 220 INSCDSNSTHTIPYSKDIHRGSKAYGIAKDEKAIMAEIYKNGPIEVAL-----TVYEDFL 274

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
               G VY      E+  +A VK+VGWG ENG PYWTIV+++ E +GDKGT KILRG+NE
Sbjct: 275 TYKTG-VYQHVTGDELGGHA-VKMVGWGVENGTPYWTIVNSWNESWGDKGTFKILRGKNE 332

Query: 250 AIIESLVNGALP 261
             IES    ALP
Sbjct: 333 CGIESSCVTALP 344


>gi|56758716|gb|AAW27498.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 56/183 (30%), Positives = 89/183 (48%), Gaps = 12/183 (6%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G++  +W +  K G+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGVTGYSWDYWVKHGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
              C    Y   + QDK      Y + G+          + P    ++   Y   L   +
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGGFSYSVIGVESAIQKEIMMYGPV--EAYLEIYEDFLNYKS 274

Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
           G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGR+E +IE
Sbjct: 275 G-IYRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRDECLIE 332

Query: 254 SLV 256
           S +
Sbjct: 333 SFI 335


>gi|116177489|gb|ABJ80691.1| cathepsin B [Hippoglossus hippoglossus]
          Length = 330

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 58/192 (30%), Positives = 94/192 (48%), Gaps = 18/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S+ W +  K GLV+GG ++S+ GC+P + PPC H +   S P C       PKC
Sbjct: 148 CNGGYPSAAWDFWTKEGLVSGGLYNSHIGCRPYTIPPCEH-HVNGSRPHCSGEGGDTPKC 206

Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
              C          + +YG+  +  +  +  +      + GP   AF       Y   + 
Sbjct: 207 VHSCEAGYSPTYTKDKHYGKSSYSVEASVEQIQAEISQN-GPVEGAF-----IVYEDFVM 260

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY  +  + +  +A +K++GWGEE+G PYW   +++   +G+ G  KILRG +  
Sbjct: 261 YKSG-VYQHTTGSALGGHA-IKVLGWGEEDGVPYWLCANSWNTDWGENGFFKILRGSDHC 318

Query: 251 IIESLVNGALPK 262
            IES +   +PK
Sbjct: 319 GIESEIVAGIPK 330


>gi|24158605|pdb|1GMY|A Chain A, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
 gi|24158606|pdb|1GMY|B Chain B, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
 gi|24158607|pdb|1GMY|C Chain C, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
          Length = 261

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 72  CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 129

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  N   +              GP   AF     + Y+  L 
Sbjct: 130 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 183

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 184 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 241

Query: 251 IIESLVNGALPK-DNY 265
            IES V   +P+ D Y
Sbjct: 242 GIESEVVAGIPRTDQY 257


>gi|339236191|ref|XP_003379650.1| cathepsin B [Trichinella spiralis]
 gi|316977649|gb|EFV60721.1| cathepsin B [Trichinella spiralis]
          Length = 356

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 69/194 (35%), Positives = 92/194 (47%), Gaps = 22/194 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W++  K GLVTGG + ++ GC+P  F PCNH +  T  P C     P P C
Sbjct: 171 CQGGDPHQAWSFWVKYGLVTGGNYTTHDGCRPYPFAPCNHHSNGTYGP-CSHDLEPTPVC 229

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPH-----------FGPFWPAFWRSFCTKYTRP 188
              C +  Y   + +DKY   GL  Y   +            GP   AF       Y   
Sbjct: 230 KKACQS-TYKIQYNKDKYY--GLKAYSLHNKASDLQKELMMNGPMEVAF-----EVYEDF 281

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L    G VY     + +  +A V+++GWGEENG PYW + +++  ++GDKG  KI RGRN
Sbjct: 282 LLYKTG-VYQHHTGSVLGGHA-VRLLGWGEENGVPYWLLANSWNTEWGDKGFFKIYRGRN 339

Query: 249 EAIIESLVNGALPK 262
           E  IES     L K
Sbjct: 340 ECGIESEAVAGLYK 353


>gi|56759588|gb|AAW28820.1| Parcxpwnx02 [Periplaneta americana]
          Length = 343

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 63/188 (33%), Positives = 93/188 (49%), Gaps = 12/188 (6%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W +    G+V+GG+++S+ GCQP +  PC H    T +P C    TP  +C
Sbjct: 162 CNGGEPGAAWDYWVSTGIVSGGSYNSHQGCQPYAIEPCEHHVNGTRKP-CGEGDTP--RC 218

Query: 140 HTRCT---NDNYG--RGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNG 194
             RC    +  YG  R F +  Y + G             PA   +  T Y   L    G
Sbjct: 219 VKRCEEGYDVPYGKDRHFGKSAYAVPGSVKAIQKELLLNGPA--EAALTVYDDFLHYRTG 276

Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
               VS  A  +    V+++GWG E+G PYW + +++   +GD G  +ILRG++E  IES
Sbjct: 277 VYQHVSGGA--LGGHAVRLLGWGVEDGTPYWLLANSWNYDWGDNGYFRILRGQDECGIES 334

Query: 255 LVNGALPK 262
            +NG LPK
Sbjct: 335 DINGGLPK 342


>gi|197098184|ref|NP_001126573.1| cathepsin B precursor [Pongo abelii]
 gi|75061687|sp|Q5R6D1.1|CATB_PONAB RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|55731764|emb|CAH92586.1| hypothetical protein [Pongo abelii]
 gi|55731953|emb|CAH92685.1| hypothetical protein [Pongo abelii]
          Length = 339

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  N   +              GP   AF     + Y+  L 
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSERDIMAEIYKNGPVEGAF-----SVYSDFLL 261

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319

Query: 251 IIESLVNGALPK-DNY 265
            IES V   +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335


>gi|16307393|gb|AAH10240.1| Cathepsin B [Homo sapiens]
          Length = 339

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  N   +              GP   AF     + Y+  L 
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319

Query: 251 IIESLVNGALPK-DNY 265
            IES V   +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335


>gi|332862712|ref|XP_003317964.1| PREDICTED: cathepsin B isoform 1 [Pan troglodytes]
 gi|332862714|ref|XP_003317965.1| PREDICTED: cathepsin B isoform 2 [Pan troglodytes]
 gi|332862716|ref|XP_003317966.1| PREDICTED: cathepsin B isoform 3 [Pan troglodytes]
 gi|332862718|ref|XP_519607.3| PREDICTED: cathepsin B isoform 5 [Pan troglodytes]
 gi|410057614|ref|XP_003954244.1| PREDICTED: cathepsin B [Pan troglodytes]
 gi|410262606|gb|JAA19269.1| cathepsin B [Pan troglodytes]
 gi|410262608|gb|JAA19270.1| cathepsin B [Pan troglodytes]
 gi|410359820|gb|JAA44654.1| cathepsin B [Pan troglodytes]
 gi|410359822|gb|JAA44655.1| cathepsin B [Pan troglodytes]
 gi|410359824|gb|JAA44656.1| cathepsin B [Pan troglodytes]
 gi|410359826|gb|JAA44657.1| cathepsin B [Pan troglodytes]
 gi|410359828|gb|JAA44658.1| cathepsin B [Pan troglodytes]
          Length = 339

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  N   +              GP   AF     + Y+  L 
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319

Query: 251 IIESLVNGALPK-DNY 265
            IES V   +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335


>gi|426358853|ref|XP_004046705.1| PREDICTED: cathepsin B isoform 1 [Gorilla gorilla gorilla]
 gi|426358855|ref|XP_004046706.1| PREDICTED: cathepsin B isoform 2 [Gorilla gorilla gorilla]
 gi|426358857|ref|XP_004046707.1| PREDICTED: cathepsin B isoform 3 [Gorilla gorilla gorilla]
          Length = 339

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  N   +              GP   AF     + Y+  L 
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319

Query: 251 IIESLVNGALPK-DNY 265
            IES V   +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335


>gi|161343855|tpg|DAA06108.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 342

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 61/197 (30%), Positives = 93/197 (47%), Gaps = 27/197 (13%)

Query: 78  WVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           + C  G     W++  + G+VTGG + S  GC P   PPC       SE +       QP
Sbjct: 157 FACHGGYPIKAWSYFRRHGIVTGGGYQSGEGCAPYRVPPC------FSEEDGNNTCRGQP 210

Query: 138 -KCHTRCTNDNYG---------RGFFQDKYQINGLGLYFDPH-FGPFWPAF--WRSFCTK 184
            + H RCT   YG           F +D Y +    +  D   +GP   +   +  F   
Sbjct: 211 MEKHHRCTRMCYGDQEIDYDDDHRFTRDYYYLTYASIQKDVMTYGPIEASMEVYDDF--- 267

Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 244
              P +++   VY  S +A  +    VK++GWGEE+G PYW +V+++ E +GDKG  KI 
Sbjct: 268 ---PSYKSG--VYEKSENATYLGGHAVKLIGWGEEDGVPYWLMVNSWSEMWGDKGLFKIR 322

Query: 245 RGRNEAIIESLVNGALP 261
           RG NE  +++ +   +P
Sbjct: 323 RGTNECSVDNSMTAGVP 339


>gi|60816353|gb|AAX36379.1| cathepsin B [synthetic construct]
 gi|61358313|gb|AAX41546.1| cathepsin B [synthetic construct]
          Length = 339

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  N   +              GP   AF     + Y+  L 
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319

Query: 251 IIESLVNGALPK-DNY 265
            IES V   +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335


>gi|262368170|pdb|3K9M|A Chain A, Cathepsin B In Complex With Stefin A
 gi|262368172|pdb|3K9M|B Chain B, Cathepsin B In Complex With Stefin A
          Length = 254

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 94/194 (48%), Gaps = 19/194 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 71  CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 128

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  N   +              GP   AF     + Y+  L 
Sbjct: 129 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 182

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 183 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 240

Query: 251 IIESLVNGALPKDN 264
            IES V   +P+ +
Sbjct: 241 GIESEVVAGIPRTD 254


>gi|181192|gb|AAA52129.1| preprocathepsin B [Homo sapiens]
 gi|193787271|dbj|BAG52477.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  N   +              GP   AF     + Y+  L 
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319

Query: 251 IIESLVNGALPK-DNY 265
            IES V   +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335


>gi|56758864|gb|AAW27572.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 59/189 (31%), Positives = 89/189 (47%), Gaps = 12/189 (6%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
           +  C    Y   + QDK      Y + G+            P    ++   Y   L   +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLGIESVIQKDIMMHGPV--EAYLEIYEDFLNYKS 274

Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
           G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE +IE
Sbjct: 275 G-IYRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332

Query: 254 SLVNGALPK 262
           S +   L K
Sbjct: 333 SEIAAGLIK 341


>gi|397467300|ref|XP_003805362.1| PREDICTED: cathepsin B [Pan paniscus]
          Length = 339

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  N   +              GP   AF     + Y+  L 
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319

Query: 251 IIESLVNGALPK-DNY 265
            IES V   +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335


>gi|193783549|dbj|BAG53460.1| unnamed protein product [Homo sapiens]
          Length = 276

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 65/199 (32%), Positives = 97/199 (48%), Gaps = 20/199 (10%)

Query: 77  IWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 136
           ++ C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     
Sbjct: 84  LFSCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDT 141

Query: 137 PKCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTR 187
           PKC   C    Y   + QDK Y  N   +              GP   AF     + Y+ 
Sbjct: 142 PKCSKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSD 195

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
            L   +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG+
Sbjct: 196 FLLYKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQ 253

Query: 248 NEAIIESLVNGALPK-DNY 265
           +   IES V   +P+ D Y
Sbjct: 254 DHCGIESEVVAGIPRTDQY 272


>gi|427787723|gb|JAA59313.1| Putative cathepsin b-like cysteine protease form 2 [Rhipicephalus
           pulchellus]
          Length = 338

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 61/195 (31%), Positives = 92/195 (47%), Gaps = 22/195 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G+ S  W +  ++G+VTGG + +  GCQP S     +       P    L +P P C
Sbjct: 152 CKGGVPSYAWMFYKEKGIVTGGLYGTEDGCQPYSIHTTRYTTTGLLPPPINDL-SPMPPC 210

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAF--WRSFCTKYTRP 188
              C   +YG+ + +DK      Y ++G            GP    F  +  F + Y   
Sbjct: 211 KRECRK-SYGKKYSEDKHYGEKVYTLSGDEAQIKTEIFKNGPVEADFAVYADFYS-YKSG 268

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           ++Q + RV   S +        ++I+GWG ENG PYW   +++ E +GDKG  KI RG N
Sbjct: 269 VYQAHSRVRCGSHA--------IRILGWGTENGVPYWLAANSWTEHWGDKGYFKIRRGNN 320

Query: 249 EAIIESLVNGALPKD 263
           E  IE  +N  +PK+
Sbjct: 321 ECGIEEDINAGIPKE 335


>gi|30583753|gb|AAP36125.1| Homo sapiens cathepsin B [synthetic construct]
 gi|61370555|gb|AAX43516.1| cathepsin B [synthetic construct]
          Length = 340

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  N   +              GP   AF     + Y+  L 
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319

Query: 251 IIESLVNGALPK-DNY 265
            IES V   +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335


>gi|4503139|ref|NP_001899.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538431|ref|NP_680090.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538433|ref|NP_680091.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538435|ref|NP_680092.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538437|ref|NP_680093.1| cathepsin B preproprotein [Homo sapiens]
 gi|68067549|sp|P07858.3|CATB_HUMAN RecName: Full=Cathepsin B; AltName: Full=APP secretase; Short=APPS;
           AltName: Full=Cathepsin B1; Contains: RecName:
           Full=Cathepsin B light chain; Contains: RecName:
           Full=Cathepsin B heavy chain; Flags: Precursor
 gi|291888|gb|AAC37547.1| cathepsin B [Homo sapiens]
 gi|63102437|gb|AAH95408.1| Cathepsin B [Homo sapiens]
 gi|119586034|gb|EAW65630.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586036|gb|EAW65632.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586037|gb|EAW65633.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586038|gb|EAW65634.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586039|gb|EAW65635.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586040|gb|EAW65636.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|168277954|dbj|BAG10955.1| cathepsin B precursor [synthetic construct]
 gi|193786804|dbj|BAG52127.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  N   +              GP   AF     + Y+  L 
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319

Query: 251 IIESLVNGALPK-DNY 265
            IES V   +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335


>gi|333361087|pdb|3AI8|B Chain B, Cathepsin B In Complex With The Nitroxoline
 gi|333361088|pdb|3AI8|A Chain A, Cathepsin B In Complex With The Nitroxoline
          Length = 256

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 94/194 (48%), Gaps = 19/194 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 73  CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 130

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  N   +              GP   AF     + Y+  L 
Sbjct: 131 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 184

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 185 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 242

Query: 251 IIESLVNGALPKDN 264
            IES V   +P+ +
Sbjct: 243 GIESEVVAGIPRTD 256


>gi|268557308|ref|XP_002636643.1| Hypothetical protein CBG23351 [Caenorhabditis briggsae]
          Length = 351

 Score = 90.9 bits (224), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 80/267 (29%), Positives = 112/267 (41%), Gaps = 41/267 (15%)

Query: 7   SRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQR 66
           S+IRD S             SC    AV+ A  ++  +C +S      T     A     
Sbjct: 114 SKIRDQS-------------SCGSCWAVSAAETISDRICIASNGK---TQISISADDINA 157

Query: 67  CAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE 126
           C  +V        C+ G     W    K+G VTGG++   +GC+P  +PPC H    T  
Sbjct: 158 CCGMVCGNG----CNGGYPIEAWRHYVKKGYVTGGSYQEKSGCKPYPYPPCEHHVNGTHY 213

Query: 127 PECKTLATPQPKCHTRC--------TNDNYGRGFFQDKYQINGLGLYFDPHF---GPFWP 175
             C +   P  KC   C        T D +   F Q  Y ++             GP   
Sbjct: 214 KPCPSNMYPTDKCEHSCQAGYPLTYTQDLH---FGQSAYAVSKKPAEIQKEIMTHGPVEV 270

Query: 176 AFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQF 235
           AF     T Y       +G VY  +A A +  +A VK++GWG +NG PYW   +++ E +
Sbjct: 271 AF-----TVY-EDFEHYSGGVYVHTAGASLGGHA-VKMLGWGVDNGTPYWLCANSWNEDW 323

Query: 236 GDKGTIKILRGRNEAIIESLVNGALPK 262
           G+ G  +I+RG NE  IES V G  PK
Sbjct: 324 GENGYFRIIRGVNECGIESGVVGGTPK 350


>gi|209863077|ref|NP_001119612.2| cathepsin B-912 precursor [Acyrthosiphon pisum]
          Length = 342

 Score = 90.9 bits (224), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 61/197 (30%), Positives = 93/197 (47%), Gaps = 27/197 (13%)

Query: 78  WVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           + C  G     W++  + G+VTGG + S  GC P   PPC       SE +       QP
Sbjct: 157 FACHGGYPIKAWSYFRRHGIVTGGDYQSGEGCAPYRVPPC------FSEEDGNNTCRGQP 210

Query: 138 -KCHTRCTNDNYG---------RGFFQDKYQINGLGLYFDPH-FGPFWPAF--WRSFCTK 184
            + H RCT   YG           F +D Y +    +  D   +GP   +   +  F   
Sbjct: 211 MEKHHRCTRMCYGDQEIDYDDDHRFTRDYYYLTYASIQKDVMTYGPIEASMEVYDDF--- 267

Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 244
              P +++   VY  S +A  +    VK++GWGEE+G PYW +V+++ E +GDKG  KI 
Sbjct: 268 ---PSYKSG--VYEKSENATYLGGHAVKLIGWGEEDGVPYWLMVNSWSEMWGDKGLFKIR 322

Query: 245 RGRNEAIIESLVNGALP 261
           RG NE  +++ +   +P
Sbjct: 323 RGTNECSVDNSMTAGVP 339


>gi|87246247|gb|ABD35300.1| cathepsin B-like cysteine protease [Triatoma infestans]
          Length = 333

 Score = 90.9 bits (224), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 63/191 (32%), Positives = 88/191 (46%), Gaps = 17/191 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   S W + HK G+V+GG + S  GCQP S  PC H+ + +S P C  + T  PKC
Sbjct: 151 CLGGSPESAWEYWHKFGIVSGGNYGSKQGCQPYSIAPCEHSIHGSS-PACGGV-TDTPKC 208

Query: 140 HTRCTND---NYGRGFF--QDKYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQ 191
             +C       Y + F+  Q  Y I              GP   +F           LF 
Sbjct: 209 KKQCEKGYSIPYDKAFYYGQPGYAIPNDAQKIQAEILKNGPIVASFL------VYEDLFS 262

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
               VY    + E +    +KI GWG ENG PYW + +++   +G+ G  KI RG++E  
Sbjct: 263 YKEGVYQ-HVAGEFLGGHVIKIFGWGIENGTPYWLVANSWNTDWGNNGFFKIPRGKDECG 321

Query: 252 IESLVNGALPK 262
           IE  V+  LP+
Sbjct: 322 IEIDVSAGLPR 332


>gi|157833437|pdb|1PBH|A Chain A, Crystal Structure Of Human Recombinant Procathepsin B At
           3.2 Angstrom Resolution
 gi|157835646|pdb|2PBH|A Chain A, Crystal Structure Of Human Procathepsin B At 3.3 Angstrom
           Resolution
 gi|157836863|pdb|3PBH|A Chain A, Refined Crystal Structure Of Human Procathepsin B At 2.5
           Angstrom Resolution
          Length = 317

 Score = 90.9 bits (224), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 94/194 (48%), Gaps = 19/194 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 134 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 191

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  N   +              GP   AF     + Y+  L 
Sbjct: 192 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 245

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 246 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 303

Query: 251 IIESLVNGALPKDN 264
            IES V   +P+ +
Sbjct: 304 GIESEVVAGIPRTD 317


>gi|302564570|ref|NP_001181828.1| cathepsin B precursor [Macaca mulatta]
          Length = 339

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  N   +              GP   AF     + Y+  L 
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319

Query: 251 IIESLVNGALPK-DNY 265
            IES V   +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335


>gi|37788265|gb|AAO64472.1| cathepsin B precursor [Fundulus heteroclitus]
          Length = 330

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 62/195 (31%), Positives = 96/195 (49%), Gaps = 24/195 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  ++ W +  ++GLVTGG ++S+ GC+P +  PC H +   S P C       P+C
Sbjct: 148 CNGGYPANAWEFWTEQGLVTGGLYNSHIGCRPYTIEPCEH-HVNGSRPPCTGEGGDTPEC 206

Query: 140 HTRC---------TNDNYGR---GFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTR 187
            T+C          + +YG+   G   ++ QI        P  G F    +  F      
Sbjct: 207 VTQCEAGYTPSYQKDKHYGKTSYGVPSEEEQIQSEIYKNGPVEGAF--IVYEDF------ 258

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           P +++   VY     + +  +A +K++GWGEENG PYW   +++   +GD G  KILRG 
Sbjct: 259 PSYKSG--VYQHVTGSALGGHA-IKMIGWGEENGVPYWLCANSWNTDWGDNGFFKILRGS 315

Query: 248 NEAIIESLVNGALPK 262
           N   IES V   +PK
Sbjct: 316 NHCGIESEVVAGIPK 330


>gi|999909|pdb|1HUC|B Chain B, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
           Human Liver Cathepsin B: The Structural Basis For Its
           Specificity
 gi|999911|pdb|1HUC|D Chain D, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
           Human Liver Cathepsin B: The Structural Basis For Its
           Specificity
 gi|1421164|pdb|1CSB|B Chain B, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
           2.1 Angstroms Resolution: A Basis For The Design Of
           Specific Epoxysuccinyl Inhibitors
 gi|1421167|pdb|1CSB|E Chain E, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
           2.1 Angstroms Resolution: A Basis For The Design Of
           Specific Epoxysuccinyl Inhibitors
 gi|122920711|pdb|2IPP|B Chain B, Crystal Structure Of The Tetragonal Form Of Human Liver
           Cathepsin B
          Length = 205

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 94/194 (48%), Gaps = 19/194 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 22  CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 79

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  N   +              GP   AF     + Y+  L 
Sbjct: 80  SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 133

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 134 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 191

Query: 251 IIESLVNGALPKDN 264
            IES V   +P+ +
Sbjct: 192 GIESEVVAGIPRTD 205


>gi|181178|gb|AAA52125.1| lysosomal proteinase cathepsin B, partial [Homo sapiens]
          Length = 209

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 94/194 (48%), Gaps = 19/194 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 20  CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 77

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  N   +              GP   AF     + Y+  L 
Sbjct: 78  SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 131

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 132 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 189

Query: 251 IIESLVNGALPKDN 264
            IES V   +P+ +
Sbjct: 190 GIESEVVAGIPRTD 203


>gi|75076082|sp|Q4R5M2.1|CATB_MACFA RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|67970521|dbj|BAE01603.1| unnamed protein product [Macaca fascicularis]
 gi|355779504|gb|EHH63980.1| Cathepsin B [Macaca fascicularis]
 gi|383411999|gb|AFH29213.1| cathepsin B preproprotein [Macaca mulatta]
 gi|384942194|gb|AFI34702.1| cathepsin B preproprotein [Macaca mulatta]
          Length = 339

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  N   +              GP   AF     + Y+  L 
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319

Query: 251 IIESLVNGALPK-DNY 265
            IES V   +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335


>gi|343961899|dbj|BAK62537.1| cathepsin B precursor [Pan troglodytes]
          Length = 195

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 64/194 (32%), Positives = 97/194 (50%), Gaps = 19/194 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 6   CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 63

Query: 140 HTRCTNDNYGRGFFQDK-YQINGL-------GLYFDPH-FGPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  N         G+  + +  GP   AF     + Y+  L 
Sbjct: 64  SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKGIMAEIYKNGPVEGAF-----SVYSDFLL 117

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 118 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 175

Query: 251 IIESLVNGALPKDN 264
            IES V   +P+ +
Sbjct: 176 GIESEVVAGIPRTD 189


>gi|29374025|gb|AAO73003.1| cathepsin B [Fasciola gigantica]
          Length = 339

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 55/187 (29%), Positives = 86/187 (45%), Gaps = 7/187 (3%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  + G+VTGG   + TGCQP  F  C+H   +     C     P P C
Sbjct: 155 CRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCPHYTYPTPPC 214

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
              C    Y + + QDK+  N        H         ++   + T  +FQ  G VY  
Sbjct: 215 ARACQT-GYNKTYEQDKFYGNS-SYNVGEHESYIMQEIMKNGPVEVTFAIFQDFG-VYRS 271

Query: 200 S----ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
                 + + +    V+++GWG ENG  YW + +++ E++G+ G  +++RGRNE  IES 
Sbjct: 272 GIYHHVAGKFIGRHAVRMIGWGVENGVNYWLMANSWNEEWGENGYFRMVRGRNECGIESE 331

Query: 256 VNGALPK 262
           V   +P+
Sbjct: 332 VVAGMPR 338


>gi|449667614|ref|XP_002166962.2| PREDICTED: cathepsin B-like [Hydra magnipapillata]
          Length = 330

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 64/193 (33%), Positives = 94/193 (48%), Gaps = 21/193 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   S W +   +G+VTGG ++S+ GCQP + P C+H    +  P   +L  P PKC
Sbjct: 148 CNGGYPQSAWEFFKTKGIVTGGPYNSHKGCQPYAIPACDHHVPHSKNPCNGSL--PTPKC 205

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTR-PL 189
              C    Y   +  DK Y +    +  D +         GP   AF     T +   P 
Sbjct: 206 EKVCEK-GYNITYKNDKHYGVTSYSINNDQNEIMREIMTNGPVEAAF-----TVFADFPN 259

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           +++   VY   +  E+  +A +KI+GWG EN  PYW + +++   +GD G  KILRG +E
Sbjct: 260 YKSG--VYQHVSGEELGGHA-IKILGWGVENNTPYWLVANSWNPSWGDNGFFKILRGSDE 316

Query: 250 AIIESLVNGALPK 262
             IE  V   LPK
Sbjct: 317 CGIEDEVVAGLPK 329


>gi|402877481|ref|XP_003902454.1| PREDICTED: cathepsin B [Papio anubis]
          Length = 339

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  N   +              GP   AF     + Y+  L 
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319

Query: 251 IIESLVNGALPK-DNY 265
            IES V   +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335


>gi|118358710|ref|XP_001012596.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89294363|gb|EAR92351.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 346

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 71/192 (36%), Positives = 91/192 (47%), Gaps = 18/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   +   +    GLVTG  + +N+ CQ  S  PC H   +   P C T   P P C
Sbjct: 161 CDGGWPEAAMDYYVNNGLVTGDLYGNNSWCQAYSLAPCAHHVTSDVYPPC-TGELPTPPC 219

Query: 140 HTRC-TNDNYGRGFFQD------KYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPL 189
              C +N  Y   + +D       Y I+             GP   AF     T Y   L
Sbjct: 220 VKSCDSNSTYTIPYPKDLHKGSKAYSIDQNEQAIMTEIQTNGPIEVAF-----TVYEDFL 274

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
              +G VY     +E+  +A VK+VGWG ENG PYW IV+++ E +GDKGT KILRG+NE
Sbjct: 275 TYKSG-VYQHVTGSELGGHA-VKMVGWGVENGTPYWIIVNSWNESWGDKGTFKILRGQNE 332

Query: 250 AIIESLVNGALP 261
             IES    ALP
Sbjct: 333 CGIESECVTALP 344


>gi|226471008|emb|CAX70585.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 62/183 (33%), Positives = 88/183 (48%), Gaps = 18/183 (9%)

Query: 87  STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
             W +  KRG+VTGG+  ++TGCQP  FP C H       P C T     P+C   C   
Sbjct: 166 QAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQTCQK- 223

Query: 147 NYGRGFFQDK------YQI--NGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
            Y   + QDK      Y +  N   +  +   +GP   AF       Y   L   +G   
Sbjct: 224 GYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAF-----DVYEDFLNYKSGIYR 278

Query: 198 AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 257
            V+ S  IV    ++I+GWG E G+PYW I +++ E +G+ G  +++RGR+E  IES V 
Sbjct: 279 HVAGS--IVGGHAIRIIGWGVEKGKPYWLIANSWNEDWGENGLFRMVRGRDECSIESHVV 336

Query: 258 GAL 260
             L
Sbjct: 337 AGL 339


>gi|395507317|ref|XP_003757972.1| PREDICTED: cathepsin B [Sarcophilus harrisii]
          Length = 342

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 58/196 (29%), Positives = 93/196 (47%), Gaps = 26/196 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C       PKC
Sbjct: 152 CNGGFPAGAWKYWIKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPACTGEGGDTPKC 210

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAF--WRSFCTKYT 186
           + +C    Y   +  DK+   G   Y  P             GP   AF  +  F  +Y 
Sbjct: 211 NKKC-EAGYSPDYKDDKHY--GTTAYNVPSSEKEIMAEIYKNGPVEGAFIVYADF-LQYK 266

Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
             ++Q          + +++    ++++GWG E+G PYW   +++   +GD G  KILRG
Sbjct: 267 SGVYQ--------HVTGDMLGGHAIRVLGWGVEDGVPYWLAANSWNTDWGDNGFFKILRG 318

Query: 247 RNEAIIESLVNGALPK 262
           ++   IES +   +P+
Sbjct: 319 KDHCGIESEMVAGIPR 334


>gi|4204370|gb|AAD11445.1| cathepsin B protease, partial [Fasciola hepatica]
          Length = 247

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 55/187 (29%), Positives = 86/187 (45%), Gaps = 7/187 (3%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  + G+VTGG   + TGCQP  F  C+H   +     C     P P C
Sbjct: 63  CRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCPHYTYPTPPC 122

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
              C    Y + + QDK+  N        H         ++   + T  +FQ  G VY  
Sbjct: 123 ARACQT-GYNKTYEQDKFYGNS-SYNVGEHESYIMQEIMKNGPVEVTFAIFQDFG-VYRS 179

Query: 200 S----ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
                 + + +    V+++GWG ENG  YW + +++ E++G+ G  +++RGRNE  IES 
Sbjct: 180 GIYHHVAGKFIGRHAVRMIGWGVENGVNYWLMANSWNEEWGENGYFRMVRGRNECGIESE 239

Query: 256 VNGALPK 262
           V   +P+
Sbjct: 240 VVAGMPR 246


>gi|226471006|emb|CAX70584.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 62/183 (33%), Positives = 88/183 (48%), Gaps = 18/183 (9%)

Query: 87  STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
             W +  KRG+VTGG+  ++TGCQP  FP C H       P C T     P+C   C   
Sbjct: 166 QAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQTCQK- 223

Query: 147 NYGRGFFQDK------YQI--NGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
            Y   + QDK      Y +  N   +  +   +GP   AF       Y   L   +G   
Sbjct: 224 GYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAF-----DVYEDFLNYKSGIYR 278

Query: 198 AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 257
            V+ S  IV    ++I+GWG E G+PYW I +++ E +G+ G  +++RGR+E  IES V 
Sbjct: 279 HVAGS--IVGGHAIRIIGWGVEKGKPYWLIANSWNEDWGENGLFRMVRGRDECSIESHVV 336

Query: 258 GAL 260
             L
Sbjct: 337 AGL 339


>gi|308488328|ref|XP_003106358.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
 gi|308253708|gb|EFO97660.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
          Length = 343

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 65/197 (32%), Positives = 88/197 (44%), Gaps = 25/197 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  K GLVTGG++ S  GC+P S  PC       + P+C       PKC
Sbjct: 153 CEGGYPIQAWKYWVKNGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPNSDADTPKC 212

Query: 140 HTRCT-NDNYGRGFFQDKY-------------QINGLGLYFDPHFGPFWPAFWRSFCTKY 185
              CT N +Y   + +DK+             QI    L      GP    F     T Y
Sbjct: 213 VDHCTSNSSYPIPYEKDKHYGATAYAVSRKVDQIQSEIL----KNGPVEVGF-----TVY 263

Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
               +Q    VY   A  E+  +A VK++GWG +NG PYW   +++   +G+ G  +ILR
Sbjct: 264 AD-FYQYKSGVYVHVAGPELGGHA-VKLLGWGVDNGTPYWLAANSWNTNWGENGYFRILR 321

Query: 246 GRNEAIIESLVNGALPK 262
           G NE  IES V   +P 
Sbjct: 322 GVNECGIESQVVAGMPD 338


>gi|294954734|ref|XP_002788292.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239903555|gb|EER20088.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 317

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 63/205 (30%), Positives = 88/205 (42%), Gaps = 38/205 (18%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHS------NTGCQPVSFPPCNHANYTTSEPECKTLA 133
           C  G   S W+WVH  G+ TGG + +        GC P  FPPC H    T  P+C   +
Sbjct: 128 CDGGYPDSAWSWVHDEGIATGGDYVARGNLTKGDGCWPYDFPPCAHHINDTKYPKCPKGS 187

Query: 134 TPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
              P C  +C N  Y      D++ +    L   P+           +     +   +T+
Sbjct: 188 YETPNCVEQCHNPKYSTSLKNDRHYM----LESSPY----------QYSVNNAKNAIRTD 233

Query: 194 GRVYAVSASAE-IVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGD 237
           G V A     E  +AY +               VKI+GWGEENG  YW +V+++ E +GD
Sbjct: 234 GPVSASYLVYEDFLAYKSGVYKHTSGSYLGGHAVKIIGWGEENGEAYWLVVNSWNEDWGD 293

Query: 238 KGTIKILRGRNEAIIESLVNGALPK 262
            G  KI  G N  I + L+ G  PK
Sbjct: 294 HGLFKIALG-NCQIDDDLL-GGTPK 316


>gi|204022094|dbj|BAG71144.1| cathepsin B-N1 [Tuberaphis taiwana]
          Length = 334

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 65/194 (33%), Positives = 92/194 (47%), Gaps = 25/194 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           CS G     W    K GLVTGG + S  GCQP   PPC    Y  +   C+    P  K 
Sbjct: 154 CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNT--CR--GKPAEKN 209

Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFD-PHFGPFWPAF--WRSFCTKYTR 187
           H RCT   YG           + +D Y +    +  D   +GP   +F  +  F      
Sbjct: 210 H-RCTRMCYGNQDLDFKEDHHYTRDAYYLTYGTIQNDILAYGPIEASFEVYDDF------ 262

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           P +++   VY    +A  +    VK++GWGEE G PYW +V+++ +Q+GD+G  KI RG 
Sbjct: 263 PSYKSG--VYTKMENATYLGGHAVKLIGWGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGT 320

Query: 248 NEAIIESLVNGALP 261
           NE  I++   G +P
Sbjct: 321 NECGIDNSTTGGVP 334


>gi|56759488|gb|AAW27884.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 90.5 bits (223), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 59/189 (31%), Positives = 88/189 (46%), Gaps = 12/189 (6%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
              C    Y   + QDK      Y + G+            P    ++   Y   L   +
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGGFSYNVLGIESVIQKDIMMHGPV--EAYLEIYEDFLNYKS 274

Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
           G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE +IE
Sbjct: 275 G-IYRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332

Query: 254 SLVNGALPK 262
           S +   L K
Sbjct: 333 SEIAAGLIK 341


>gi|193716207|ref|XP_001950562.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
          Length = 340

 Score = 90.5 bits (223), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 64/197 (32%), Positives = 93/197 (47%), Gaps = 25/197 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G     W   + RGLVTGG + S  GC+P   PPC +     +E        P+ K 
Sbjct: 157 CNGGYPIKAWESFNNRGLVTGGDYQSGEGCEPYRVPPCPY----DAEGHNTCAGKPREKN 212

Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFD-PHFGPFWPAF--WRSFCTKYTR 187
           H RCT   YG           F +D Y +    +  D   +GP   +F  +  F      
Sbjct: 213 H-RCTRTCYGNQDLDYNDDHRFTRDSYYLTYSSIQKDVMRYGPIEASFDMYDDF------ 265

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           P +++   VY  S +A  +    VK++GWGEE+G  YW +V+++ E +GD G  KI RG 
Sbjct: 266 PSYKSG--VYVRSENASYLGGHAVKLIGWGEEHGVLYWLMVNSWNEGWGDNGLFKIRRGT 323

Query: 248 NEAIIESLVNGALPKDN 264
           NE  I++   G +P  N
Sbjct: 324 NECGIDNSTTGGVPVAN 340


>gi|195729971|gb|ACG50796.1| cathepsin B1 [Trichobilharzia szidati]
          Length = 342

 Score = 90.5 bits (223), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 62/194 (31%), Positives = 91/194 (46%), Gaps = 20/194 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W +  + G+VTG +  ++TGCQP  FP C H +     P C       PKC
Sbjct: 159 CQGGFPGAAWDYWVEEGIVTGSSKENHTGCQPYPFPKCEH-HTKGKYPACGEKIYKTPKC 217

Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
             +C    Y   + +DKY          + + +      H GP   AF     T Y+  L
Sbjct: 218 QQKCQK-GYKTPYKKDKYYGKLSYNVLSKEDAIKKEIMMH-GPVEAAF-----TVYSDFL 270

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
              +G +Y       I  +A V+I+GWG E   PYW I +++ E +G+KG  +ILRG++ 
Sbjct: 271 NYKSG-IYKHMKGTVIGGHA-VRIIGWGVEKKTPYWLIANSWNEDWGEKGYFRILRGKDV 328

Query: 250 AIIESLVNGALPKD 263
             IES V   LP +
Sbjct: 329 CGIESAVTAGLPHN 342


>gi|167538317|ref|XP_001750823.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163770644|gb|EDQ84327.1| predicted protein [Monosiga brevicollis MX1]
          Length = 341

 Score = 90.5 bits (223), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 62/191 (32%), Positives = 91/191 (47%), Gaps = 17/191 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           CS G  S+ W+W    G+VTGG ++S+ GCQP S P C+H + +   P C     P P C
Sbjct: 159 CSGGYPSAAWSWFKTTGIVTGGNYNSSQGCQPYSLPNCDH-HVSGQYPACSGEG-PTPAC 216

Query: 140 HTRCT---NDNYG--RGFFQDKYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQ 191
              C    N+ Y   + F    Y + G            GP   AF     T Y   L  
Sbjct: 217 KKSCEAGYNNTYSNDKHFGATAYSVAGEADKIATEIMTNGPVEGAF-----TVYEDLLTY 271

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
            +G VY    + +++    +KI+GWG E+G  YW + +++   +GD G  KI +G +E  
Sbjct: 272 KSG-VYQ-HTTGQVLGGHAIKIIGWGVESGVDYWWVANSWNNDWGDNGFFKIKKGVDECG 329

Query: 252 IESLVNGALPK 262
           IES +   +PK
Sbjct: 330 IESQIVAGMPK 340


>gi|158261501|dbj|BAF82928.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score = 90.5 bits (223), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  N   +              GP   AF     + Y+  L 
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPAEGAF-----SVYSDFLL 261

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319

Query: 251 IIESLVNGALPK-DNY 265
            IES V   +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335


>gi|48762485|dbj|BAD23812.1| cathepsin B-N1 [Tuberaphis styraci]
          Length = 340

 Score = 90.1 bits (222), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 65/194 (33%), Positives = 92/194 (47%), Gaps = 25/194 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           CS G     W    K GLVTGG + S  GCQP   PPC    Y  +   C+    P  K 
Sbjct: 157 CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNT--CR--GKPAEKN 212

Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFD-PHFGPFWPAF--WRSFCTKYTR 187
           H RCT   YG           + +D Y +    +  D   +GP   +F  +  F      
Sbjct: 213 H-RCTRMCYGNQDLDFKEDHHYTRDAYYLTYGTIQNDILAYGPIEASFEVYDDF------ 265

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           P +++   VY    +A  +    VK++GWGEE G PYW +V+++ +Q+GD+G  KI RG 
Sbjct: 266 PSYKSG--VYTKMENATYLGGHAVKLIGWGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGT 323

Query: 248 NEAIIESLVNGALP 261
           NE  I++   G +P
Sbjct: 324 NECGIDNSTTGGVP 337


>gi|332244666|ref|XP_003271495.1| PREDICTED: cathepsin B [Nomascus leucogenys]
          Length = 351

 Score = 90.1 bits (222), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 64/196 (32%), Positives = 95/196 (48%), Gaps = 20/196 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 162 CNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 219

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  N   +              GP   AF     + Y+  L 
Sbjct: 220 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 273

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    ++   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 274 YKSGVYQHITG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 331

Query: 251 IIESLVNGALPK-DNY 265
            IES V   +P+ D Y
Sbjct: 332 GIESEVVAGIPRTDQY 347


>gi|204022108|dbj|BAG71151.1| cathepsin B-N [Cerataphis jamuritsu]
          Length = 333

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 60/190 (31%), Positives = 90/190 (47%), Gaps = 17/190 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G     W    K GLVTGG + S  GCQP   PPC    Y  +    K +     +C
Sbjct: 153 CNGGYPIRAWERFRKHGLVTGGNYDSYEGCQPYRVPPCPLDEYGNNTCHGKPMEKNH-RC 211

Query: 140 HTRCTND-----NYGRGFFQDKYQINGLGLYFDP-HFGPFWPAF--WRSFCTKYTRPLFQ 191
              C  D     N    + +D Y +    +  D   +GP   +F  +  F      P ++
Sbjct: 212 TRMCYGDQDLDFNNDHHYTRDAYYLTYGTIQNDVLTYGPIEASFEVYDDF------PSYK 265

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
           +   VY  + +A  +    VK++GWGEE G PYW +V+++ +Q+GD+G  KI RG NE  
Sbjct: 266 SG--VYVKTENASYLGGHAVKLIGWGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECG 323

Query: 252 IESLVNGALP 261
           I++   G +P
Sbjct: 324 IDNSTTGGVP 333


>gi|185135431|ref|NP_001117776.1| procathepsin B precursor [Oncorhynchus mykiss]
 gi|14582897|gb|AAK69705.1|AF358667_1 procathepsin B [Oncorhynchus mykiss]
          Length = 330

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 64/194 (32%), Positives = 92/194 (47%), Gaps = 23/194 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G  S+ W +  + GLVTGG + SN GC+P S  PC H +   + P C T     PKC
Sbjct: 148 CMGGFPSAAWDYWAESGLVTGGLYGSNIGCRPYSIAPCEH-HVNGTRPPC-TGEGDTPKC 205

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
            + C N  Y   + +DK    G   Y  P             GP   AF     + Y   
Sbjct: 206 VSEC-NAGYTPSYKKDKR--FGKQTYSVPPKEQQIMTELYKNGPVEAAF-----SVYEDF 257

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L    G    V+   +++    +KI+GWG+EN  PYW + +++   +GD G  KILRG++
Sbjct: 258 LLYKTGVYQHVTG--QMLGGHAIKILGWGKENNTPYWLVANSWNTDWGDNGFFKILRGKD 315

Query: 249 EAIIESLVNGALPK 262
           E  IES +   +P+
Sbjct: 316 ECGIESEIVAGIPR 329


>gi|48762493|dbj|BAD23816.1| cathepsin B-N1 [Tuberaphis coreana]
          Length = 340

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 65/194 (33%), Positives = 92/194 (47%), Gaps = 25/194 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           CS G     W    K GLVTGG + S  GCQP   PPC    Y  +   C+    P  K 
Sbjct: 157 CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNT--CR--GKPAEKN 212

Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFD-PHFGPFWPAF--WRSFCTKYTR 187
           H RCT   YG           + +D Y +    +  D   +GP   +F  +  F      
Sbjct: 213 H-RCTRMCYGNQDLDFKEDHHYTRDAYYLTYGTIQNDILAYGPIEASFEVYDDF------ 265

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           P +++   VY    +A  +    VK++GWGEE G PYW +V+++ +Q+GD+G  KI RG 
Sbjct: 266 PSYKSG--VYTKMENATYLGGHAVKLIGWGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGT 323

Query: 248 NEAIIESLVNGALP 261
           NE  I++   G +P
Sbjct: 324 NECGIDNSTTGGVP 337


>gi|389611087|dbj|BAM19154.1| cathepsin B [Papilio polytes]
          Length = 334

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 66/191 (34%), Positives = 90/191 (47%), Gaps = 19/191 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G+ +  W +    GLV+GG+++S  GC+P   PPC H       P C    T  PKC
Sbjct: 151 CNGGMPTLAWEYWKHFGLVSGGSYNSTQGCRPYEIPPCEHHVPGNRLP-CSG-DTKTPKC 208

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
             +C  DNY   + QDK      Y + G   +        GP   AF     T Y   L 
Sbjct: 209 IKKC-EDNYNVAYKQDKHYGKHIYSVRGGEDHIKAELYKNGPVEGAF-----TVYADLLS 262

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY   A   +  +A +KI+GWG ENG  YW I +++   +GD G  KILRG +  
Sbjct: 263 YKSG-VYKHVAGDALGGHA-IKIMGWGVENGNKYWLIANSWNSDWGDNGFFKILRGEDHC 320

Query: 251 IIESLVNGALP 261
            IES +    P
Sbjct: 321 GIESSIVAGEP 331


>gi|204022092|dbj|BAG71143.1| cathepsin B-N2 [Tuberaphis coreana]
          Length = 334

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 65/194 (33%), Positives = 92/194 (47%), Gaps = 25/194 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           CS G     W    K GLVTGG + S  GCQP   PPC    Y  +   C+    P  K 
Sbjct: 154 CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNT--CR--GKPAEKN 209

Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFD-PHFGPFWPAF--WRSFCTKYTR 187
           H RCT   YG           + +D Y +    +  D   +GP   +F  +  F      
Sbjct: 210 H-RCTRMCYGNQDLDFKEDHHYTRDAYYLTYGTIQNDILAYGPIEASFEVYDDF------ 262

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           P +++   VY    +A  +    VK++GWGEE G PYW +V+++ +Q+GD+G  KI RG 
Sbjct: 263 PSYKSG--VYTKMENATYLGGHAVKLIGWGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGT 320

Query: 248 NEAIIESLVNGALP 261
           NE  I++   G +P
Sbjct: 321 NECGIDNSTTGGVP 334


>gi|118364222|ref|XP_001015333.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297100|gb|EAR95088.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 341

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 67/189 (35%), Positives = 90/189 (47%), Gaps = 16/189 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +S  ++  K GLVTG  +++   CQ  SF PC H   T   P C T   P PKC
Sbjct: 160 CNGGYPASAMSYYVKTGLVTGDLYNTTGWCQAYSFAPCAHHVDTPLYPAC-TGELPTPKC 218

Query: 140 HTRCTNDNYGRGFFQDK----YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQT 192
              C +   G+ +   K    Y +              GP   AF     T Y   L   
Sbjct: 219 AKTC-DSGSGQTYTVHKGSKAYSVGKTQEAIMTEIQTNGPVEAAF-----TVYEDFLNYK 272

Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
           +G    V+  A  +    +KIVGWG EN  PYW +V+++ + +GD GT KILRG+NE  I
Sbjct: 273 SGVYKHVTGKA--LGGHAIKIVGWGVENNTPYWIVVNSWNQTWGDNGTFKILRGKNECGI 330

Query: 253 ESLVNGALP 261
           E+ V  ALP
Sbjct: 331 EAQVVTALP 339


>gi|204022098|dbj|BAG71146.1| cathepsin B-N2 [Tuberaphis sumatrana]
          Length = 334

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 65/194 (33%), Positives = 91/194 (46%), Gaps = 25/194 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           CS G     W    K GLVTGG + S  GCQP   PPC    Y  +    K    P  K 
Sbjct: 154 CSGGYPIKAWERFKKHGLVTGGNYESGEGCQPYRVPPCPLDEYGNNTCSGK----PTEKN 209

Query: 140 HTRCTNDNYG---------RGFFQDKYQINGLGLYFDP-HFGPFWPAF--WRSFCTKYTR 187
           H RCT   YG           + +D Y +    +  D   +GP   +F  +  F      
Sbjct: 210 H-RCTRMCYGNQDLDFKEDHHYTRDAYYLTYGTIQNDVLAYGPIEASFEVYDDF------ 262

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           P +++   VY    +A  +    VK++GWGEE G PYW +V+++ +Q+GD+G  KI RG 
Sbjct: 263 PSYKSG--VYTKMENATYLGGHAVKLIGWGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGT 320

Query: 248 NEAIIESLVNGALP 261
           NE  I++   G +P
Sbjct: 321 NECGIDNSTTGGVP 334


>gi|345790427|ref|XP_543203.3| PREDICTED: cathepsin B [Canis lupus familiaris]
          Length = 339

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 64/191 (33%), Positives = 93/191 (48%), Gaps = 19/191 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + +DK      Y ++             GP   AF     T Y+  L 
Sbjct: 208 SKIC-EPGYSPSYKEDKHYGCSSYSVSDNEKEIMAEIYKNGPVEAAF-----TVYSDFLL 261

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY    + E++    V+I+GWG E+G PYW + +++   +GD G  KILRGR+  
Sbjct: 262 YKSG-VYQ-HVTGEMMGGHAVRILGWGVEDGTPYWLVGNSWNTDWGDNGFFKILRGRDHC 319

Query: 251 IIESLVNGALP 261
            IES +   +P
Sbjct: 320 GIESEIVAGIP 330


>gi|432852559|ref|XP_004067308.1| PREDICTED: cathepsin B-like [Oryzias latipes]
          Length = 330

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 59/194 (30%), Positives = 94/194 (48%), Gaps = 22/194 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  ++ W +  K GLVTGG + S+ GC+P + PPC H +   + P C       P+C
Sbjct: 148 CNGGYPTAAWDFWTKEGLVTGGLYDSHVGCRPYTIPPCEH-HVNGTRPPCTGEGGDTPQC 206

Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRP 188
             +C          + +YG+  +  +   N +      + GP   AF  +  F      P
Sbjct: 207 INQCESGYTPSYKKDKHYGKTSYSVEANENQIQTEIYKN-GPVEGAFMVYEDF------P 259

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           ++++   VY    S  ++    +KI+GWG E+G PYW   +++   +GD G  KILRG +
Sbjct: 260 MYKSG--VYQ-HVSGSLIGGHAIKILGWGVEDGVPYWLCANSWNTDWGDNGYFKILRGSD 316

Query: 249 EAIIESLVNGALPK 262
              IES V   +PK
Sbjct: 317 HCGIESEVVAGIPK 330


>gi|149698064|ref|XP_001498242.1| PREDICTED: cathepsin B [Equus caballus]
          Length = 340

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 63/193 (32%), Positives = 91/193 (47%), Gaps = 22/193 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C       PKC
Sbjct: 150 CNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPCTGEGGDTPKC 208

Query: 140 HTRCTNDNYGRGFFQDKY-----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
              C    Y   + +DK+           +   +   F    GP   AF     T Y+  
Sbjct: 209 SKIC-EPGYSPSYKEDKHYGCSSYSVSSSEKEIMAEIFKN--GPVEAAF-----TVYSD- 259

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
             Q    VY   A   +  +A V+I+GWG ENG PYW + +++   +GD G  KILRG++
Sbjct: 260 FLQYKSGVYQHVAGDMMGGHA-VRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQD 318

Query: 249 EAIIESLVNGALP 261
              IES +   +P
Sbjct: 319 HCGIESEIVAGIP 331


>gi|74179506|dbj|BAE44111.1| cathepsin B preproprotein [Cyprinus carpio]
          Length = 330

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 65/194 (33%), Positives = 88/194 (45%), Gaps = 22/194 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S+ W +    GLVTGG ++S+ GC+P +  PC H +   S P C       P C
Sbjct: 148 CNGGYPSAAWDFWSSDGLVTGGLYNSHIGCRPYTIEPCEH-HVNGSRPPCTGEGGDTPNC 206

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
              C    Y   + QDK+   G   Y  P             GP   AF     T Y   
Sbjct: 207 DMSC-EPGYSPSYKQDKHF--GKTSYSVPSNQKDIMKELYKNGPVEGAF-----TVYEDF 258

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L   +G    VS  A  +    +KI+GWGEENG PYW   +++   +GD G  KILRG +
Sbjct: 259 LSYKSGVYQHVSGPA--LGGHAIKILGWGEENGVPYWLAANSWNTDWGDNGYFKILRGED 316

Query: 249 EAIIESLVNGALPK 262
              IES +   +P+
Sbjct: 317 HCGIESEIVAGIPQ 330


>gi|341888136|gb|EGT44071.1| hypothetical protein CAEBREN_13576 [Caenorhabditis brenneri]
          Length = 337

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 63/200 (31%), Positives = 89/200 (44%), Gaps = 31/200 (15%)

Query: 80  CSSGISSSTWA-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 138
           C  G     W  WVH  GLVTGG++ S  GC+P S  PC       + P+C       P+
Sbjct: 147 CEGGYPIQAWRYWVHN-GLVTGGSYESQYGCKPYSIAPCGQTVNGVTWPKCAADEVATPE 205

Query: 139 CHTRCTN-DNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL-------- 189
           C  +CT+  +Y   + QDK            H+G    A  ++     T  +        
Sbjct: 206 CVKQCTSKSDYAVPYDQDK------------HYGSSAYAIRQNVAQIQTEIMRNGPVEVG 253

Query: 190 -------FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
                  +Q    +Y   A  E+  +A VKI+GWG ENG PYW   +++   +G+KG  +
Sbjct: 254 FLVYSDFYQYKSGIYKHVAGRELGGHA-VKILGWGVENGTPYWLAANSWNVNWGEKGYFR 312

Query: 243 ILRGRNEAIIESLVNGALPK 262
           I RG NE  IES V   +P 
Sbjct: 313 IRRGTNECGIESSVVAGIPD 332


>gi|56754307|gb|AAW25341.1| unknown [Schistosoma japonicum]
          Length = 309

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 67/241 (27%), Positives = 107/241 (44%), Gaps = 20/241 (8%)

Query: 28  CIEARAVATATPLAFAVCRSS--KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGIS 85
           C  + AV+    ++  +C  S  K  VE ++   I+  K   +           C  G++
Sbjct: 82  CASSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCKNCGS----------GCDGGVT 131

Query: 86  SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
             +W +    G+VTGG+  ++TGC+P  FP C+H         C       P+C   C  
Sbjct: 132 GYSWDYWVSHGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQCKQTCQK 190

Query: 146 DNYGRGFFQDK----YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSA 201
             Y   + QDK    +  N L +               ++   Y   L   +G +Y  + 
Sbjct: 191 -GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGTVEAYLEIYEDFLNYKSG-IYRYTT 248

Query: 202 SAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
              I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE +IES +   L 
Sbjct: 249 GQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLI 307

Query: 262 K 262
           K
Sbjct: 308 K 308


>gi|403307501|ref|XP_003944231.1| PREDICTED: cathepsin B [Saimiri boliviensis boliviensis]
          Length = 351

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 162 CNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 219

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  N   +              GP   AF     + Y+  L 
Sbjct: 220 SKSC-EPGYTPTYKQDKHYGYNSYSVSNSERDIMAEIYKNGPVEGAF-----SVYSDFLL 273

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 274 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHC 331

Query: 251 IIESLVNGALPK-DNY 265
            IES V   +P+ D Y
Sbjct: 332 GIESEVVAGIPRTDQY 347


>gi|332376204|gb|AEE63242.1| unknown [Dendroctonus ponderosae]
          Length = 338

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 61/192 (31%), Positives = 88/192 (45%), Gaps = 20/192 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G  +  W +    G+VTGGA++S+ GC+  S  PC H     S P+C +L    P+C
Sbjct: 156 CDGGYVAEPWDYWRTDGIVTGGAYNSSQGCKDYSLEPCEHHVEVGSRPQCSSLNFDTPEC 215

Query: 140 HTRC--TNDNYGRGF--------FQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
              C  ++ +Y            F ++ Q+    L      GP   AF     T Y   L
Sbjct: 216 VRSCYESSLDYTESLTFGQQVSTFTNEKQMQLEIL----KNGPIEAAF-----TVYNDFL 266

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
              +G VY  +A  E V    +K++GWG E G  YW I +++   +GD G  K LRG + 
Sbjct: 267 SYKSG-VYQATAQDESVGGHAIKVLGWGVEEGTKYWLIANSWNTDWGDNGYFKFLRGVDH 325

Query: 250 AIIESLVNGALP 261
             IES    +LP
Sbjct: 326 CGIESETAASLP 337


>gi|161343871|tpg|DAA06116.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 276

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 64/197 (32%), Positives = 93/197 (47%), Gaps = 31/197 (15%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPC---NHANYTTSEPECKTLATPQ 136
           CS G     W    K GLVTGG + S  GC+P   PPC   +  N T S         P 
Sbjct: 94  CSGGYPIRAWKRYKKHGLVTGGNYKSGEGCEPYRVPPCPNDDQGNNTCS-------GQPM 146

Query: 137 PKCHTRCTNDNYGRG---------FFQDKYQINGLGLYFDP-HFGPFWPAF--WRSFCTK 184
            K H RCT   YG           + +D Y +   G+  D  ++GP   +F  +  F   
Sbjct: 147 EKNH-RCTRMCYGDQDLDFDEDHRYTRDHYYLTYRGIQKDVINYGPIEASFDVYDDF--- 202

Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 244
              P +++   +Y  S +A  +   +VK++GWGEE G  YW +V+++   +GDKG  KI 
Sbjct: 203 ---PSYKSG--IYVKSENASYLGGHSVKLIGWGEEYGVLYWLMVNSWNADWGDKGLFKIR 257

Query: 245 RGRNEAIIESLVNGALP 261
           RG NE  +++   G +P
Sbjct: 258 RGTNECGVDNSTTGGVP 274


>gi|45822211|emb|CAE47502.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
          Length = 331

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 57/190 (30%), Positives = 89/190 (46%), Gaps = 21/190 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G  +  W++    G+ TGG + S  GCQP S  PC H +   ++ +C TL    P C
Sbjct: 149 CEGGYPTMAWSYWIDTGITTGGLYGSKQGCQPYSLQPCEH-HTEGNKVQCSTLDYDTPSC 207

Query: 140 HTRCTND--------NYGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRPL 189
             +C +          +G G  ++ Y +  +      + GP   AF  +  F   Y   +
Sbjct: 208 KHKCDDSALNYKSELTFGSGSVRNFYSVANIQKEILTN-GPVEAAFDVYSDF-VNYKSGV 265

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           +Q          + E +    V+I+GWGEE+G PYW + +++ E +GDKG  KI RG NE
Sbjct: 266 YQ--------HVAGEYLGGHAVRILGWGEESGVPYWLVANSWNEDWGDKGLFKIRRGNNE 317

Query: 250 AIIESLVNGA 259
           +  E  +  A
Sbjct: 318 SGFEDSIVAA 327


>gi|187107122|ref|NP_001119621.1| cathepsin B-3098 precursor [Acyrthosiphon pisum]
 gi|161343841|tpg|DAA06101.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 337

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 59/195 (30%), Positives = 93/195 (47%), Gaps = 27/195 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP-- 137
           C+ G     W      GLVTGG + S  GC+P   PPC +      + + K   + QP  
Sbjct: 155 CNGGYPIRAWKRFKNHGLVTGGNYKSGEGCEPYRVPPCPY------DKDGKNTCSGQPME 208

Query: 138 ---KCHTRCTND-----NYGRGFFQDKYQINGLGLYFDP-HFGPFWPAF--WRSFCTKYT 186
              KC  +C  D     N    + +D Y +   G+  D  ++GP   +F  +  F     
Sbjct: 209 SNHKCSKKCYGDEDIDFNKDHRYTRDDYYLTYRGIQKDVINYGPIETSFDVYDDF----- 263

Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
            P +++   +Y  S +A  +   +VK++GWGEE G  YW +V+++   +GDKG  KI RG
Sbjct: 264 -PNYKSG--IYVKSENASYLGGHSVKLIGWGEEYGVLYWLMVNSWNADWGDKGLFKIRRG 320

Query: 247 RNEAIIESLVNGALP 261
            NE  +++   G +P
Sbjct: 321 TNECRVDNSTTGGVP 335


>gi|426220597|ref|XP_004004501.1| PREDICTED: cathepsin B [Ovis aries]
          Length = 335

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 63/191 (32%), Positives = 91/191 (47%), Gaps = 19/191 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
              C    Y   +  DK      Y ++             GP   AF     + Y+  L 
Sbjct: 208 SKIC-EPGYSPSYKDDKHFGCSSYSVSSNEKEIMAEIYKNGPVEGAF-----SVYSDFLL 261

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    VS   E++    ++I+GWG EN  PYW + +++   +GDKG  KILRG++  
Sbjct: 262 YKSGVYQHVSG--EMMGGHAIRILGWGVENDTPYWLVGNSWNTDWGDKGFFKILRGQDHC 319

Query: 251 IIESLVNGALP 261
            IES +   +P
Sbjct: 320 GIESEIVAGMP 330


>gi|226474168|emb|CAX71570.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 59/189 (31%), Positives = 89/189 (47%), Gaps = 12/189 (6%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
           +  C    Y   + QDK      Y +  +            PA   ++   Y   L   +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPA--EAYLEIYEDFLNYKS 274

Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
           G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE +IE
Sbjct: 275 G-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332

Query: 254 SLVNGALPK 262
           S +   L K
Sbjct: 333 SEIAAGLIK 341


>gi|379067374|gb|AFC90100.1| cathepsin B [Capra hircus]
          Length = 335

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 63/191 (32%), Positives = 91/191 (47%), Gaps = 19/191 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
              C    Y   +  DK      Y ++             GP   AF     + Y+  L 
Sbjct: 208 SKIC-EPGYSPSYKDDKHFGCSSYSVSSNEKEIMAEIYKNGPVEGAF-----SVYSDFLL 261

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    VS   E++    ++I+GWG EN  PYW + +++   +GDKG  KILRG++  
Sbjct: 262 YKSGVYQHVSG--EMMGGHAIRILGWGVENDTPYWLVGNSWNTDWGDKGFFKILRGQDHC 319

Query: 251 IIESLVNGALP 261
            IES +   +P
Sbjct: 320 GIESEIVAGMP 330


>gi|91078958|ref|XP_974220.1| PREDICTED: similar to cathepsin b [Tribolium castaneum]
 gi|270004841|gb|EFA01289.1| cathepsin B precursor [Tribolium castaneum]
          Length = 334

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 55/194 (28%), Positives = 87/194 (44%), Gaps = 18/194 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W +   +G+V+GG+  SN GC+P    PC H +   + P C       P C
Sbjct: 149 CNGGFPGAAWHYWVNKGIVSGGSFGSNQGCRPYEIAPCEH-HVNGTRPPCTGDDNKTPSC 207

Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
             +C          + N+G+  +    ++  +      + GP   AF      +    L 
Sbjct: 208 KQQCEKGYNVPYKKDKNFGKEAYSISSEVQQIQKEIMTN-GPVEGAF------EVYEDLL 260

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
                VY      E +    ++I+GWG E G PYW I +++   +GD GT KILRG +  
Sbjct: 261 SYKKGVYQ-HVKGEALGGHAIRILGWGTEKGTPYWLIANSWNSDWGDNGTFKILRGEDHC 319

Query: 251 IIESLVNGALPKDN 264
            IES +   +PKD+
Sbjct: 320 GIESSIVAGIPKDS 333


>gi|157167366|ref|XP_001653890.1| cathepsin b [Aedes aegypti]
 gi|54289254|gb|AAV31917.1| lysosomal cathepsin B [Aedes aegypti]
 gi|108874249|gb|EAT38474.1| AAEL009637-PA [Aedes aegypti]
          Length = 340

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 60/192 (31%), Positives = 89/192 (46%), Gaps = 18/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W++  ++GLV+GG   S+ GCQP +  PC H +   S P C+      PKC
Sbjct: 157 CNGGFPGAAWSYWVRKGLVSGGPFGSDQGCQPYAIAPCEH-HVNGSRPSCEGEGGKTPKC 215

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
             +C   +Y   + +DK      Y I              GP   AF     T Y   L 
Sbjct: 216 VKKC-QASYNVPYAKDKMYGKSSYSIANHEKQIQKEIMTNGPVEGAF-----TVYEDLLN 269

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
              G  + V    +++    ++I+GWG E+G  YW I +++   +GD G  KILRG +  
Sbjct: 270 YKEGVYHHVHG--KMLGGHAIRILGWGVEDGTKYWLIANSWNSDWGDNGFFKILRGEDHL 327

Query: 251 IIESLVNGALPK 262
            IES +   LPK
Sbjct: 328 GIESSIAAGLPK 339


>gi|34979797|gb|AAQ83887.1| cathepsin B [Branchiostoma belcheri tsingtauense]
          Length = 332

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 95/194 (48%), Gaps = 23/194 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W +  + GLVTGG + S+ GCQP    PC H +   S P C  L  P P+C
Sbjct: 149 CNGGFPEAAWEYWKRDGLVTGGPYGSHQGCQPYEIKPCEH-HINGSRPACGKL-EPTPRC 206

Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTR-P 188
              C +  Y   F +DK+          ++  + +    + GP   AF     T Y   P
Sbjct: 207 KKSCES-GYNVTFAKDKHYAKTAYSVSSKVQQIQMEIMTN-GPVEAAF-----TVYADFP 259

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
            +++   VY   + AE+  +A VK++GWG E   PYW I +++   +G+ G  KILRG++
Sbjct: 260 HYKSG--VYQHESGAELGGHA-VKMIGWGTEGSTPYWLIANSWNTDWGNMGFFKILRGQD 316

Query: 249 EAIIESLVNGALPK 262
           E  IE  +    PK
Sbjct: 317 ECGIERDIVAGEPK 330


>gi|301776581|ref|XP_002923704.1| PREDICTED: cathepsin B-like [Ailuropoda melanoleuca]
 gi|281347694|gb|EFB23278.1| hypothetical protein PANDA_012896 [Ailuropoda melanoleuca]
          Length = 339

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 67/199 (33%), Positives = 97/199 (48%), Gaps = 26/199 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGFPAEAWNFWTKQGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDKY------------QINGLGLYFDPHFGPFWPAFWRSFCTKYTR 187
              C    Y   + +DK+            +     +Y +   GP   AF     T Y+ 
Sbjct: 208 SKFC-EPGYTPSYKEDKHYGCSSYSVSSSEKEIMAEIYKN---GPVEAAF-----TVYSD 258

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
            L   +G VY    + E++    V+I+GWG ENG PYW + +++   +GD G  KILRGR
Sbjct: 259 FLLYKSG-VYQ-HVTGEMMGGHAVRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGR 316

Query: 248 NEAIIESLVNGALP-KDNY 265
           +   IES +   +P  D Y
Sbjct: 317 DHCGIESEIVAGIPCTDQY 335


>gi|189096178|pdb|3CBJ|A Chain A, Chagasin-cathepsin B Complex
 gi|189096180|pdb|3CBK|A Chain A, Chagasin-Cathepsin B
          Length = 266

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 64/196 (32%), Positives = 95/196 (48%), Gaps = 20/196 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC  A+   + P C T     PKC
Sbjct: 77  CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPC-EAHVNGARPPC-TGEGDTPKC 134

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  N   +              GP   AF     + Y+  L 
Sbjct: 135 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 188

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 189 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 246

Query: 251 IIESLVNGALPK-DNY 265
            IES V   +P+ D Y
Sbjct: 247 GIESEVVAGIPRTDQY 262


>gi|327322926|gb|AEA48884.1| cathepsin B [Oplegnathus fasciatus]
          Length = 330

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 61/195 (31%), Positives = 88/195 (45%), Gaps = 24/195 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S+ W +  K GLV+GG + S+ GC+P +  PC H +   S P C       P+C
Sbjct: 148 CNGGYPSAAWDFWTKEGLVSGGLYDSHIGCRPYTIAPCEH-HVNGSRPSCTGEGGDTPQC 206

Query: 140 HTRCT---------NDNYGRG---FFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTR 187
            T+C          + ++G+       D+ QI        P  G F           Y  
Sbjct: 207 ITKCEAGYTPSYKEDKHFGKTSYTVLSDEEQIQSEIFKNGPVEGAF---------IVYED 257

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
            +   +G    VS SA  V    +KI+GWG E+G PYW   +++   +GD G  K LRG 
Sbjct: 258 FVLYKSGVYQHVSGSA--VGGHAIKILGWGVEDGVPYWLCANSWNTDWGDNGFFKFLRGS 315

Query: 248 NEAIIESLVNGALPK 262
           +   IES V   +PK
Sbjct: 316 DHCGIESEVVAGIPK 330


>gi|296221607|ref|XP_002756833.1| PREDICTED: cathepsin B, partial [Callithrix jacchus]
          Length = 330

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 64/196 (32%), Positives = 95/196 (48%), Gaps = 20/196 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 141 CNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 198

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  +   +  +           GP   AF     + Y   L 
Sbjct: 199 SKSC-EPGYSPTYKQDKHYGYDSYSVSNNERDIMAEIYKNGPVEGAF-----SVYADFLL 252

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+   E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 253 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHC 310

Query: 251 IIESLVNGALPK-DNY 265
            IES V   +P+ D Y
Sbjct: 311 GIESEVVAGIPRTDQY 326


>gi|56758040|gb|AAW27160.1| unknown [Schistosoma japonicum]
          Length = 216

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 64/199 (32%), Positives = 88/199 (44%), Gaps = 32/199 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +   +G+VTGG+  ++TGCQP  FP C H +     P C T     P+C
Sbjct: 33  CQGGFPGQAWDYWVTQGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 91

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF----------------GPFWPAFWRSFCT 183
              C    Y   + QDK+       Y D  +                GP   AF      
Sbjct: 92  KQTCQK-GYKTPYEQDKH-------YGDESYNVISNEKAIQKEIMMNGPVEAAF-----D 138

Query: 184 KYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
            Y   L   +G    V+ S  IV    ++I+GWG E   PYW I +++ E +G+KG  +I
Sbjct: 139 VYEDFLNYKSGIYRHVTGS--IVGGHAIRIIGWGVEKRTPYWLIANSWNEDWGEKGLFRI 196

Query: 244 LRGRNEAIIESLVNGALPK 262
           +RGR+E  IES V   L K
Sbjct: 197 VRGRDECSIESHVVAGLIK 215


>gi|193688334|ref|XP_001945855.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 313

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 56/181 (30%), Positives = 88/181 (48%), Gaps = 21/181 (11%)

Query: 87  STWAWVHKRGLVTGGAHHSNTGCQPVSFPP----CNHANYTTSEPECKTLATPQPKCHTR 142
           S W ++   G+V+GG ++SN GCQP  FPP      H  +T  +      +      H R
Sbjct: 147 SIWEYLKSHGVVSGGKYNSNDGCQPFKFPPIANILTHLQHTCDDHCYGNTSINYNHDHVR 206

Query: 143 CTNDNYGR-GFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSA 201
             N    R G+ Q + Q           +GP    F    C  +   L   +G VY  S 
Sbjct: 207 VRNYYTIRTGYIQKEVQT----------YGPVAVQF--KVCDDF---LLYKSG-VYVKSD 250

Query: 202 SAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
           +A+++     K++GWG ENG  YW +++++G ++G KG  KI RG N+  +ES+V   +P
Sbjct: 251 NAKVIRTQYAKLIGWGVENGVDYWLVINSWGHEWGQKGLFKIKRGTNQCGVESVVYAGVP 310

Query: 262 K 262
           +
Sbjct: 311 E 311


>gi|323147412|gb|ADX32985.1| cathepsin B [Pinctada fucata]
          Length = 366

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 61/192 (31%), Positives = 93/192 (48%), Gaps = 19/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  + GLVTGG ++S+ GCQP +   C+H      +P C       P C
Sbjct: 183 CNGGFLSGAWEYYKRDGLVTGGQYNSHQGCQPYTVKACDHHVVGKLQP-CSKKEEHTPVC 241

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF--GPFWPAFWRSFCTKYTR-PLF 190
              C +  Y   + +DK      Y + G+          GP   AF     T Y   P +
Sbjct: 242 KHECES-GYNVSYTKDKHYGATAYSVRGVQQIMTEIMTNGPVEGAF-----TVYADFPQY 295

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
           ++   VY  +  + +  +A +KI+GWG E G  YW + +++   +G++GT KILRGR+E 
Sbjct: 296 KSG--VYKHTTGSPLGGHA-IKIMGWGTEGGDDYWLVANSWNPDWGNQGTFKILRGRDEC 352

Query: 251 IIESLVNGALPK 262
            IES +    PK
Sbjct: 353 GIESQIAAGEPK 364


>gi|213514196|ref|NP_001133994.1| Cathepsin B precursor [Salmo salar]
 gi|209156086|gb|ACI34275.1| Cathepsin B precursor [Salmo salar]
          Length = 330

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 65/193 (33%), Positives = 91/193 (47%), Gaps = 22/193 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S+   +  K GLV+GG + S+ GC+P S PPC H +   + P CK      P+C
Sbjct: 148 CNGGYPSAACDFWTKEGLVSGGLYDSHIGCRPYSIPPCEH-HVNGTRPPCKGEEGDTPQC 206

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
             +C    Y  G+ QDK+   G   Y  P             GP   AF     T Y   
Sbjct: 207 TNQC-EPGYTPGYKQDKHF--GKRSYSVPSDEKEIMKELYKNGPVEGAF-----TVYEDF 258

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L   +G    VS SA  V    +K++GWGEE G PYW   +++   +G+ G  KI+RG +
Sbjct: 259 LLYKSGVYRHVSGSA--VGGHAIKVLGWGEEGGIPYWLAANSWNTDWGENGFFKIVRGED 316

Query: 249 EAIIESLVNGALP 261
              IES +   +P
Sbjct: 317 HCGIESEMVAGIP 329


>gi|204022090|dbj|BAG71142.1| cathepsin B-N3 [Tuberaphis styraci]
          Length = 334

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 64/194 (32%), Positives = 90/194 (46%), Gaps = 25/194 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           CS G     W    K GLVTGG + S  GCQP   PPC    Y  +    K    P  K 
Sbjct: 154 CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVPPCPLDEYGNNTCSGK----PAEKN 209

Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFDP-HFGPFWPAF--WRSFCTKYTR 187
           H RCT   YG           + +D Y +    +  D   +GP   +F  +  F      
Sbjct: 210 H-RCTQMCYGNQNLDFKEDHHYTRDAYYLTYGTIQNDVLAYGPIEASFEVYDDF------ 262

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           P +++   VY    +A  +    VK++GWGEE G PYW +V+++ +Q+GD+G  KI RG 
Sbjct: 263 PSYKSG--VYTKMENATYLGGHAVKLIGWGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGT 320

Query: 248 NEAIIESLVNGALP 261
           NE   ++   G +P
Sbjct: 321 NECGTDNSTTGGVP 334


>gi|241998314|ref|XP_002433800.1| longipain, putative [Ixodes scapularis]
 gi|215495559|gb|EEC05200.1| longipain, putative [Ixodes scapularis]
          Length = 339

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 62/193 (32%), Positives = 93/193 (48%), Gaps = 21/193 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W++  ++G+VTGG + ++ GC P   P C+H    T  P  +    P PKC
Sbjct: 157 CNGGFPGAAWSYWVEKGIVTGGNYDTDEGCMPYPVPSCDHHVNGTLGPCGQD--PPTPKC 214

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTR-PL 189
             R     Y   F  DK      Y ++             GP   AF     T Y   PL
Sbjct: 215 -VRLCRKGYNIDFKDDKHYGKSSYSVSSNETQIQMEIMKNGPVEGAF-----TVYADFPL 268

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           +++   VY  S S + +    ++I+GWG ENG P+W + +++  ++GDKG  KILRG NE
Sbjct: 269 YKSG--VYK-SHSTDALGGHAIRILGWGVENGVPFWLVANSWNTEWGDKGYFKILRGSNE 325

Query: 250 AIIESLVNGALPK 262
             IE  +   +PK
Sbjct: 326 CGIEEDIVAGIPK 338


>gi|56756907|gb|AAW26625.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 58/189 (30%), Positives = 88/189 (46%), Gaps = 12/189 (6%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
           +  C    Y   + QDK      Y +  +            P    ++   Y   L   +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV--EAYLEIYEDFLNYKS 274

Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
           G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE +IE
Sbjct: 275 G-IYRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332

Query: 254 SLVNGALPK 262
           S +   L K
Sbjct: 333 SEIAAGLIK 341


>gi|442754445|gb|JAA69382.1| Putative cathepsin b precursor [Ixodes ricinus]
          Length = 340

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 64/195 (32%), Positives = 93/195 (47%), Gaps = 25/195 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  ++ W++   +G+VTGG + ++ GC P   P C+H    T  P  +    P PKC
Sbjct: 158 CNGGFPAAAWSYWVDKGIVTGGNYDTDEGCMPYPVPSCDHHVNGTLGPCGQD--PPTPKC 215

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTR- 187
             R     Y   F  DK+   G   Y  P             GP   AF     T Y   
Sbjct: 216 -VRLCRKGYNVDFKDDKHY--GKSSYSVPSNETQIQMEIMKNGPVEGAF-----TVYADF 267

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           PL+++   VY  S S + +    ++I+GWG EN  PYW + +++  ++GDKG  KILRG 
Sbjct: 268 PLYKSG--VYK-SHSTDALGGHAIRILGWGVENDVPYWLVANSWNTEWGDKGYFKILRGS 324

Query: 248 NEAIIESLVNGALPK 262
           NE  IE  +   +PK
Sbjct: 325 NECGIEEDIVAGIPK 339


>gi|154761391|gb|ABS85545.1| cathepsin B preproprotein [Biomphalaria glabrata]
          Length = 333

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 59/192 (30%), Positives = 90/192 (46%), Gaps = 22/192 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  ++ W W    G+V+GG + +N GC P S P C+H  +TT + +      P PKC
Sbjct: 154 CNGGYPAAAWEWYVDTGVVSGGQYGTNEGCMPYSLPHCDH--HTTGKYQPCPAVVPTPKC 211

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF--GPFWPAF--WRSFCTKYTRPL 189
             +C    Y + +  DK      Y + G+          GP   AF  +  F +  T   
Sbjct: 212 EKKCLT-GYPKSYSNDKTRGKKSYGVRGVQSIMQELVDNGPVTAAFDVYSDFLSYKTGVY 270

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
             T G      A         VKI+G+G E+G+ YW + +++ E +GDKG  KI +G++E
Sbjct: 271 RHTTGSYEGGHA---------VKIIGYGTESGQDYWLVANSWNEDWGDKGFFKIAKGKDE 321

Query: 250 AIIESLVNGALP 261
             IES +    P
Sbjct: 322 CGIESSIVAGDP 333


>gi|203648|gb|AAA40993.1| cathepsin (EC 3.4.22.1), partial [Rattus norvegicus]
          Length = 271

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 62/192 (32%), Positives = 94/192 (48%), Gaps = 19/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  ++GLV+GG ++S+ GC P + PPC H +   S P C T     PKC
Sbjct: 82  CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPKC 139

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
           +  C    Y   + +DK      Y ++             GP   AF     T ++  L 
Sbjct: 140 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAF-----TVFSDFLT 193

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY   A  +++    ++I+GWG ENG PYW + +++   +GD G  KILRG N  
Sbjct: 194 YKSG-VYKHEA-GDVMGGHAIRILGWGIENGVPYWLVANSWNVDWGDNGFFKILRGENHC 251

Query: 251 IIESLVNGALPK 262
            IES +   +P+
Sbjct: 252 GIESEIVAGIPR 263


>gi|145498570|ref|XP_001435272.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124402403|emb|CAK67875.1| unnamed protein product [Paramecium tetraurelia]
          Length = 325

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 63/183 (34%), Positives = 87/183 (47%), Gaps = 18/183 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +   +GLVTG     N+ C+P +FPPC+H         C   + P P C
Sbjct: 143 CNGGFPSGAWNYFKNKGLVTGDLFGDNSWCRPYTFPPCDHHVDDGKYGPCGD-SQPTPAC 201

Query: 140 HTRCTNDNYGRGFFQDKYQ-INGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
              CT  + GR +  DK + I+   +             FGP   +F     T Y   L 
Sbjct: 202 VKSCTAQS-GRNYDSDKIRSIDSYSVSSKVEQIQNEIMTFGPVEASF-----TVYEDFLT 255

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY   A A +  +A VKI+GWG E   PYW +V+++ E +G+ G  KILRG N  
Sbjct: 256 YKSG-VYQNVAGANLGGHA-VKIIGWGVEKNVPYWLVVNSWNEGWGENGLFKILRGSNHV 313

Query: 251 IIE 253
            IE
Sbjct: 314 GIE 316


>gi|226473762|emb|CAX71566.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
 gi|226474170|emb|CAX71571.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 58/189 (30%), Positives = 88/189 (46%), Gaps = 12/189 (6%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
           +  C    Y   + QDK      Y +  +            P    ++   Y   L   +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV--EAYIEIYEDFLNYKS 274

Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
           G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE +IE
Sbjct: 275 G-IYRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332

Query: 254 SLVNGALPK 262
           S +   L K
Sbjct: 333 SEIAAGLIK 341


>gi|161343867|tpg|DAA06114.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 340

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 91/197 (46%), Gaps = 25/197 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G     W      GLVTGG + S  GC+P   PPC H     +E        P  K 
Sbjct: 157 CNGGYPIKAWERFKSHGLVTGGDYKSGEGCEPYRVPPCRHH----AEGNNSCSDKPMEKN 212

Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFD-PHFGPFWPAF--WRSFCTKYTR 187
           H RCT   YG           + +D Y +    +  D  ++GP   +F  +  F      
Sbjct: 213 H-RCTRMCYGDQDLDFDDDHRYTRDSYYLTYGSIQKDVMNYGPIEASFDVYDDF------ 265

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           P +++   VY  S +A  +    VK++GWGEE+G PYW +V+++   +GDKG  KI RG 
Sbjct: 266 PSYKSG--VYIRSDNASYLGGHAVKLIGWGEESGVPYWLMVNSWNTDWGDKGLFKIQRGT 323

Query: 248 NEAIIESLVNGALPKDN 264
           NE  +++     +P  N
Sbjct: 324 NECGVDNSTTAGVPVTN 340


>gi|204022083|dbj|BAG71139.1| cathepsin B-S [Astegopteryx styracophila]
          Length = 335

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 55/192 (28%), Positives = 94/192 (48%), Gaps = 24/192 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  +RG+ TGG + SN GC P   PPC        + + + L   +P  
Sbjct: 155 CQGGNPIKAWKYFKRRGITTGGDYGSNEGCAPYKVPPC-------YDDQGEFLCQGKPTE 207

Query: 140 HT-RCTNDNYGRGFFQDKYQINGLGLYFDPH---------FGPFWPAFWRSFCTKYTRPL 189
           H  +C    YG    +++Y++  + +  D           +GP   +F       Y   +
Sbjct: 208 HNHKCPRACYGNSTVENRYKVESIYV-LDSFKTIEQDIRTYGPVEASF-----DVYDDFI 261

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
              +G +Y  + +A  V   +VK++GWGEE+G PYW +V+++ + +G++GT +I++GRNE
Sbjct: 262 TYKSG-IYQKTPNALYVGGHSVKLIGWGEEDGIPYWLLVNSWSKFWGEQGTFRIIKGRNE 320

Query: 250 AIIESLVNGALP 261
             IE      +P
Sbjct: 321 CGIERSATAGIP 332


>gi|56758658|gb|AAW27469.1| unknown [Schistosoma japonicum]
          Length = 181

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 63/181 (34%), Positives = 88/181 (48%), Gaps = 18/181 (9%)

Query: 91  WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 150
           ++ KRG+VTGG+  ++TGCQP  FP C H       P C T     P+C  +C    Y  
Sbjct: 9   YLVKRGIVTGGSKENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQKCQK-GYKT 66

Query: 151 GFFQDK------YQI--NGLGLYFDPHF-GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSA 201
            + QDK      Y +  N   +  +    GP   AF       Y   L   +G    V+ 
Sbjct: 67  PYEQDKNYGDQRYNVISNAKAIQKEIMMNGPVEAAF-----DVYEDFLNYKSGIYRHVTG 121

Query: 202 SAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
           S  IV    ++I+GWG E   PYW I +++ E +G+KG  +I+RGR+E  IES V   L 
Sbjct: 122 S--IVGGHAIRIIGWGVEKRTPYWLIANSWNEDWGEKGLFRIVRGRDECSIESNVVAGLI 179

Query: 262 K 262
           K
Sbjct: 180 K 180


>gi|313229093|emb|CBY18245.1| unnamed protein product [Oikopleura dioica]
          Length = 355

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 65/192 (33%), Positives = 93/192 (48%), Gaps = 24/192 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   +   +   RGLVTGG + +   CQP +   C H +     P C T     PKC
Sbjct: 163 CNGGYPLAAMEYFVTRGLVTGGLYGTKDTCQPYTLEACEH-HVPGDRPPC-TEGGGTPKC 220

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTR- 187
             +C  D   + +  DK  ++G   Y  P           H+GP   AF     T Y+  
Sbjct: 221 SHQCIPDYTTKAYKDDK--VHGHKAYSVPNDVGKIQQEIMHYGPVEAAF-----TVYSDF 273

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           P +++   VY  ++ +E+  +A +KI+GWG E G  YW I +++   +GDKGT KILRG 
Sbjct: 274 PSYKSG--VYRHTSGSELGGHA-IKIIGWGTEGGDDYWLINNSWNSDWGDKGTFKILRGS 330

Query: 248 NEAIIESLVNGA 259
           NE  IE  V  A
Sbjct: 331 NECGIEGEVVAA 342


>gi|9955277|pdb|1QDQ|A Chain A, X-Ray Crystal Structure Of Bovine Cathepsin B-Ca074
           Complex
          Length = 253

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 62/185 (33%), Positives = 89/185 (48%), Gaps = 19/185 (10%)

Query: 86  SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
           S  W +  K+GLV+GG ++S+ GC+P S PPC H +   S P C T     PKC   C  
Sbjct: 77  SGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCSKTC-E 133

Query: 146 DNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQTNGRV 196
             Y   + +DK      Y +              GP   AF     + Y+  L   +G  
Sbjct: 134 PGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAF-----SVYSDFLLYKSGVY 188

Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
             V  S EI+    ++I+GWG ENG PYW + +++   +GD G  KILRG++   IES +
Sbjct: 189 QHV--SGEIMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEI 246

Query: 257 NGALP 261
              +P
Sbjct: 247 VAGMP 251


>gi|31872149|gb|AAP59456.1| cathepsin B precursor [Araneus ventricosus]
          Length = 334

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 65/193 (33%), Positives = 91/193 (47%), Gaps = 21/193 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   S W +   +GLVTGG ++S+ GCQP +   C H       P    + TPQ  C
Sbjct: 152 CNGGFPGSAWEYWVDKGLVTGGLYNSHVGCQPYTIASCEHHTKGKLPPCGDIVDTPQ--C 209

Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
              C    Y   +  DKY          Q + +      + GP   AF     T Y   +
Sbjct: 210 VHMCEK-GYNVSYRADKYFGKKSYSIDEQEDQIKTEISTN-GPVEAAF-----TVYADFV 262

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
              +G VY      E+  +A V+I+GWG E+G PYW + +++   +GDKG  KILRG +E
Sbjct: 263 TYKSG-VYRHVTGEEMGGHA-VRILGWGTESGTPYWLVANSWNTDWGDKGYFKILRGSDE 320

Query: 250 AIIESLVNGALPK 262
             IES +   LPK
Sbjct: 321 CGIESSIVAGLPK 333


>gi|14141821|gb|AAK07477.2|AF329480_1 probable cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
 gi|289743431|gb|ADD20463.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
          Length = 340

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 54/192 (28%), Positives = 94/192 (48%), Gaps = 18/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W++  ++G+V+GG + S+ GC+P    PC H +   + P C+      P+C
Sbjct: 157 CNGGFPGAAWSYWVRKGIVSGGPYGSSQGCRPYEIAPCEH-HVNGTRPPCEKEYGKTPRC 215

Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
             +C         T+ ++G   +     ++ +      H GP   AF     T Y   + 
Sbjct: 216 QHKCQASYKVDYKTDKHFGSRAYSISKNVHDIQEEIMTH-GPVEGAF-----TVYEDLIL 269

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY      E+  +A ++I+GWG E   PYW + +++   +G+ G  KILRG++  
Sbjct: 270 YKDG-VYEHVHGKELGGHA-IRIIGWGVEKDIPYWLVANSWNTDWGNNGFFKILRGKDHC 327

Query: 251 IIESLVNGALPK 262
            IES ++  LPK
Sbjct: 328 GIESSISAGLPK 339


>gi|226474184|emb|CAX71578.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 58/189 (30%), Positives = 88/189 (46%), Gaps = 12/189 (6%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
           +  C    Y   + QDK      Y +  +            P    ++   Y   L   +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMVHGPV--EAYLEIYEDFLNYKS 274

Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
           G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE +IE
Sbjct: 275 G-IYRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332

Query: 254 SLVNGALPK 262
           S +   L K
Sbjct: 333 SEIAAGLIK 341


>gi|226473756|emb|CAX71563.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 58/189 (30%), Positives = 88/189 (46%), Gaps = 12/189 (6%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
           +  C    Y   + QDK      Y +  +            P    ++   Y   L   +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV--EAYLEIYEDFLNYKS 274

Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
           G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE +IE
Sbjct: 275 G-IYRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332

Query: 254 SLVNGALPK 262
           S +   L K
Sbjct: 333 SEIAAGLIK 341


>gi|187103108|ref|NP_001119614.1| cathepsin B-1418 precursor [Acyrthosiphon pisum]
 gi|163300438|tpg|DAA06126.1| TPA_inf: cathepsin B transcript 1418 [Acyrthosiphon pisum]
 gi|239788654|dbj|BAH70998.1| ACYPI000010 [Acyrthosiphon pisum]
          Length = 346

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 60/195 (30%), Positives = 91/195 (46%), Gaps = 28/195 (14%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   S W +  + G+VTGG + S  GCQP S  PC     T  E +  T     P C
Sbjct: 160 CDGGSPESAWYFFMRHGIVTGGDYGSEDGCQPYSIYPCGKGRNTCIEDDPDT-----PDC 214

Query: 140 HTR-CTNDNYGRGFFQDKYQINGL------------GLYFDPHFGPFWPAFWRSFCTKYT 186
             + CTN NY + +  D + ++ +             LY +   GP   AF+      YT
Sbjct: 215 SIKTCTNSNYSKNYRADLHYVDTVYSLSRSEEDIMKDLYKN---GPVQAAFY-----VYT 266

Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
             ++  +G VY+ +   +I     +KI+GWG ++G  YW   +++   +G+ G  +ILRG
Sbjct: 267 DFMYYKSG-VYSYT-RGQIEGGHAIKILGWGVDDGTKYWLCANSWSRSWGENGLFRILRG 324

Query: 247 RNEAIIESLVNGALP 261
            NE  IE  V   +P
Sbjct: 325 NNECHIEDRVIAGMP 339


>gi|1705630|sp|P00787.2|CATB_RAT RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; AltName:
           Full=RSG-2; Contains: RecName: Full=Cathepsin B light
           chain; Contains: RecName: Full=Cathepsin B heavy chain;
           Flags: Precursor
 gi|1524328|emb|CAA57792.1| cathepsin b [Rattus norvegicus]
          Length = 339

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 62/192 (32%), Positives = 94/192 (48%), Gaps = 19/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  ++GLV+GG ++S+ GC P + PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
           +  C    Y   + +DK      Y ++             GP   AF     T ++  L 
Sbjct: 208 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAF-----TVFSDFLT 261

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY   A  +++    ++I+GWG ENG PYW + +++   +GD G  KILRG N  
Sbjct: 262 YKSG-VYKHEA-GDVMGGHAIRILGWGIENGVPYWLVANSWNVDWGDNGFFKILRGENHC 319

Query: 251 IIESLVNGALPK 262
            IES +   +P+
Sbjct: 320 GIESEIVAGIPR 331


>gi|82830420|ref|NP_072119.2| cathepsin B preproprotein [Rattus norvegicus]
 gi|47939014|gb|AAH72490.1| Cathepsin B [Rattus norvegicus]
 gi|149030258|gb|EDL85314.1| rCG52258, isoform CRA_a [Rattus norvegicus]
          Length = 339

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 62/192 (32%), Positives = 94/192 (48%), Gaps = 19/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  ++GLV+GG ++S+ GC P + PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
           +  C    Y   + +DK      Y ++             GP   AF     T ++  L 
Sbjct: 208 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAF-----TVFSDFLT 261

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY   A  +++    ++I+GWG ENG PYW + +++   +GD G  KILRG N  
Sbjct: 262 YKSG-VYKHEA-GDVMGGHAIRILGWGIENGVPYWLVANSWNVDWGDNGFFKILRGENHC 319

Query: 251 IIESLVNGALPK 262
            IES +   +P+
Sbjct: 320 GIESEIVAGIPR 331


>gi|171474007|gb|AAX31052.2| SJCHGC09761 protein [Schistosoma japonicum]
          Length = 342

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 59/192 (30%), Positives = 91/192 (47%), Gaps = 18/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---------ANYTTSEPECK 130
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H          +     P+CK
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCK 218

Query: 131 TLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
              T Q   +T    D +  GF  +   +  +        GP       ++   Y   L 
Sbjct: 219 Q--TCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-----EAYLEIYEDFLN 271

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE 
Sbjct: 272 YKSG-IYRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNEC 329

Query: 251 IIESLVNGALPK 262
           +IES +   L K
Sbjct: 330 LIESEIAAGLIK 341


>gi|449267314|gb|EMC78276.1| Cathepsin B [Columba livia]
          Length = 340

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 60/193 (31%), Positives = 88/193 (45%), Gaps = 22/193 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C       P+C
Sbjct: 150 CNGGYPSGAWRYWTEKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPCTGEGGETPRC 208

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRP 188
              C    Y   + +DK+   G+  Y  P             GP   AF       Y   
Sbjct: 209 SRHC-EPGYSPSYKEDKHY--GITSYGVPRSEKEIMAEIYKNGPVEGAF-----IVYEDF 260

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L   +G    V+   E V    ++++GWG +NG PYW   +++   +GD G  KILRG +
Sbjct: 261 LMYKSGVYQHVTG--EQVGGHAIRLLGWGVDNGTPYWLAANSWNTDWGDNGFFKILRGED 318

Query: 249 EAIIESLVNGALP 261
              IES +   +P
Sbjct: 319 HCGIESEIVAGIP 331


>gi|256090364|ref|XP_002581165.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228444|emb|CCD74615.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 303

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 62/193 (32%), Positives = 95/193 (49%), Gaps = 16/193 (8%)

Query: 67  CAWLVSRWMTIWVC--SSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTT 124
           CA+     M+   C  S G  +   + V   G+VTG +  +NTGC+P  FP C H  +T 
Sbjct: 118 CAFGAVEAMSERSCIQSGGKQNVELSAVDLEGIVTGSSKENNTGCEPYPFPKCEH--FTK 175

Query: 125 SE-PECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCT 183
            + P C +     P+C T C    Y   + QDK++     +     +GP   +F     T
Sbjct: 176 GQYPPCGSKIYKTPRCKTTCQK-RYKTSYAQDKHRAIQKEIM---KYGPVEASF-----T 226

Query: 184 KYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
            Y   L   +G +Y    + E +    ++I+GWG EN  PYW I +++ E +G+ G  +I
Sbjct: 227 VYEDFLNYKSG-IYK-HITGETLGGHAIRIIGWGVENKTPYWLIANSWNEDWGENGYFRI 284

Query: 244 LRGRNEAIIESLV 256
           +RGR+E  IES V
Sbjct: 285 VRGRDECSIESEV 297


>gi|193603738|ref|XP_001943652.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 337

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 64/185 (34%), Positives = 86/185 (46%), Gaps = 24/185 (12%)

Query: 89  WAWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPKCH-TRCTND 146
           W ++ K GL TGG + SN GCQP S  PC  +AN  + E E        P+C+  +CTN+
Sbjct: 165 WKYIKKNGLCTGGEYGSNEGCQPYSIVPCPRNANSCSKENE------DTPQCYKDQCTNN 218

Query: 147 NYGRGFFQDKYQINGL-GLYFDPHF--------GPFWPAFWRSFCTKYTRPLFQTNGRVY 197
           NY      D Y    +  +   P          GP   A       K         G +Y
Sbjct: 219 NYETPLVSDLYYAYKVYSVKPKPEIIMSEVFKNGPVVAAM------KVYDDFLCYKGGIY 272

Query: 198 AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 257
             +       +A VKI+GWGE++G  YW   +T+G  +G  G  KI RGRNE  IE+ + 
Sbjct: 273 QYTTGGLKGDHA-VKIMGWGEDDGIDYWLCANTWGNSWGMGGMFKIRRGRNECGIENRIT 331

Query: 258 GALPK 262
           G LPK
Sbjct: 332 GGLPK 336


>gi|226474174|emb|CAX71573.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 74/264 (28%), Positives = 114/264 (43%), Gaps = 37/264 (14%)

Query: 7   SRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSS--KMHVECTSFRFIAGVK 64
           S+IRD S              C  + AV+    ++  +C  S  K  VE ++   I+   
Sbjct: 107 SQIRDQS-------------QCGSSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISC-- 151

Query: 65  QRCAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTT 124
             C +  S       C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H     
Sbjct: 152 --CKYCGSG------CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKG 202

Query: 125 SEPECKTLATPQPKCHTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFW 178
               C       P+C+  C    Y   + QDK      Y +  +            P   
Sbjct: 203 KYRACGDKLYKTPQCNQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-- 259

Query: 179 RSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDK 238
            ++   Y   L   +G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+K
Sbjct: 260 EAYLEIYEDFLNYKSG-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEK 317

Query: 239 GTIKILRGRNEAIIESLVNGALPK 262
           G  +I+RGRNE +IES +   L K
Sbjct: 318 GYFRIVRGRNECLIESEIAAGLIK 341


>gi|1311050|pdb|1CPJ|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1311051|pdb|1CPJ|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1421561|pdb|1THE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B- Inhibitor Complex: Implications For
           Structure-Based Inhibitor Design
 gi|1421562|pdb|1THE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B- Inhibitor Complex: Implications For
           Structure-Based Inhibitor Design
          Length = 260

 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 61/192 (31%), Positives = 94/192 (48%), Gaps = 19/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  ++GLV+GG ++S+ GC P + PPC H +   + P C T     PKC
Sbjct: 77  CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGARPPC-TGEGDTPKC 134

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
           +  C    Y   + +DK      Y ++             GP   AF     T ++  L 
Sbjct: 135 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAF-----TVFSDFLT 188

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY   A  +++    ++I+GWG ENG PYW + +++   +GD G  KILRG N  
Sbjct: 189 YKSG-VYKHEA-GDVMGGHAIRILGWGIENGVPYWLVANSWNADWGDNGFFKILRGENHC 246

Query: 251 IIESLVNGALPK 262
            IES +   +P+
Sbjct: 247 GIESEIVAGIPR 258


>gi|56754499|gb|AAW25437.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 58/189 (30%), Positives = 88/189 (46%), Gaps = 12/189 (6%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
           +  C    Y   + QDK      Y +  +            P    ++   Y   L   +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV--EAYLEIYEDFLNYKS 274

Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
           G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE +IE
Sbjct: 275 G-IYRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332

Query: 254 SLVNGALPK 262
           S +   L K
Sbjct: 333 SEIAAGLIK 341


>gi|56756587|gb|AAW26466.1| unknown [Schistosoma japonicum]
          Length = 216

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 63/199 (31%), Positives = 89/199 (44%), Gaps = 32/199 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +   +G+VTGG+  ++TGCQP  FP C H +     P C T     P+C
Sbjct: 33  CQGGFPGVAWDYWVTQGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 91

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF----------------GPFWPAFWRSFCT 183
             +C    Y   + QDK+       Y D  +                GP   AF      
Sbjct: 92  KQKCQK-GYKTPYKQDKH-------YGDESYNVISNEKAIQKEIMMNGPVEAAF-----D 138

Query: 184 KYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
            Y   L   +G    V+ S  IV    ++I+GWG +   PYW I +++ E +G+KG  +I
Sbjct: 139 VYEDFLNYKSGIYRHVTGS--IVGGHAIRIIGWGVKKRTPYWLIANSWNEDWGEKGLFRI 196

Query: 244 LRGRNEAIIESLVNGALPK 262
           +RGR+E  IES V   L K
Sbjct: 197 VRGRDECSIESNVVAGLIK 215


>gi|1127275|pdb|1CTE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1127276|pdb|1CTE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
          Length = 254

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 61/192 (31%), Positives = 94/192 (48%), Gaps = 19/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  ++GLV+GG ++S+ GC P + PPC H +   + P C T     PKC
Sbjct: 71  CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGARPPC-TGEGDTPKC 128

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
           +  C    Y   + +DK      Y ++             GP   AF     T ++  L 
Sbjct: 129 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAF-----TVFSDFLT 182

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY   A  +++    ++I+GWG ENG PYW + +++   +GD G  KILRG N  
Sbjct: 183 YKSG-VYKHEA-GDVMGGHAIRILGWGIENGVPYWLVANSWNADWGDNGFFKILRGENHC 240

Query: 251 IIESLVNGALPK 262
            IES +   +P+
Sbjct: 241 GIESEIVAGIPR 252


>gi|56756475|gb|AAW26410.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 59/192 (30%), Positives = 91/192 (47%), Gaps = 18/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---------ANYTTSEPECK 130
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H          +     P+CK
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCK 218

Query: 131 TLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
              T Q   +T    D +  GF  +   +  +        GP       ++   Y   L 
Sbjct: 219 Q--TCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-----EAYLEIYEDFLN 271

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE 
Sbjct: 272 YKSG-IYRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNEC 329

Query: 251 IIESLVNGALPK 262
           +IES +   L K
Sbjct: 330 LIESEIAAGLIK 341


>gi|226474172|emb|CAX71572.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 58/189 (30%), Positives = 88/189 (46%), Gaps = 12/189 (6%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
           +  C    Y   + QDK      Y +  +            P    ++   Y   L   +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV--EAYLEIYEDFLNYKS 274

Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
           G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE +IE
Sbjct: 275 G-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332

Query: 254 SLVNGALPK 262
           S +   L K
Sbjct: 333 SEIAAGLIK 341


>gi|118424551|gb|ABK90823.1| cathepsin B-like cysteine proteinase [Spodoptera exigua]
          Length = 341

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 93/194 (47%), Gaps = 25/194 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---ANYTTSEPECKTLATPQ 136
           C+ G+ +  W +    GLV+GG+++S+ GC+P   PPC H    N      + KT     
Sbjct: 156 CNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCNGDSKT----- 210

Query: 137 PKCHTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTR 187
           PKCH  C + +Y   + +DK      Y ++    +        GP   AF     T Y+ 
Sbjct: 211 PKCHKTCES-SYNVDYHKDKRYGKHVYSVSSKEDHIKAELYKNGPVEGAF-----TVYSD 264

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
            L   NG VY  +    +  +A +KI+GWG ENG  YW I +++   +GD G  KILRG 
Sbjct: 265 LLNYKNG-VYKHTVGNALGGHA-IKILGWGVENGNKYWLIANSWNSDWGDNGFFKILRGE 322

Query: 248 NEAIIESLVNGALP 261
           +   IES +    P
Sbjct: 323 DHCGIESSIVAGEP 336


>gi|204022104|dbj|BAG71149.1| cathepsin B-N [Astegopteryx styracophila]
          Length = 332

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 64/194 (32%), Positives = 91/194 (46%), Gaps = 25/194 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W    K GLVTGG + S  GCQP    PC    Y  +   C+    P  K 
Sbjct: 152 CHGGYPIKAWERFQKHGLVTGGDYDSGEGCQPYRVSPCPLDEYGNNT--CR--GKPAEKN 207

Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFDPH-FGPFWPAF--WRSFCTKYTR 187
           H RCT   YG           F +D Y +    +  D   +GP   ++  +  F      
Sbjct: 208 H-RCTRMCYGNQDLDFKKDHHFTRDAYYLTFGIIQRDVMAYGPIEASYDVYDDF------ 260

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           P +++   VY  + +A  +    VK++GWGEE G PYW +V+++ +Q+GDKG  KI RG 
Sbjct: 261 PSYKSG--VYVRTENATYLGGHAVKLIGWGEEYGVPYWLMVNSWNDQWGDKGLFKIRRGT 318

Query: 248 NEAIIESLVNGALP 261
           NE  I++   G +P
Sbjct: 319 NECGIDNSTTGGVP 332


>gi|56752787|gb|AAW24605.1| unknown [Schistosoma japonicum]
          Length = 309

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 74/264 (28%), Positives = 114/264 (43%), Gaps = 37/264 (14%)

Query: 7   SRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSS--KMHVECTSFRFIAGVK 64
           S+IRD S              C  + AV+    ++  +C  S  K  VE ++   I+   
Sbjct: 74  SQIRDQS-------------QCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISC-- 118

Query: 65  QRCAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTT 124
             C +  S       C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H     
Sbjct: 119 --CKYCGSG------CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKG 169

Query: 125 SEPECKTLATPQPKCHTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFW 178
               C       P+C+  C    Y   + QDK      Y +  +            P   
Sbjct: 170 KYRACGDKLYKTPQCNQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-- 226

Query: 179 RSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDK 238
            ++   Y   L   +G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+K
Sbjct: 227 EAYLEIYEDFLNYKSG-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEK 284

Query: 239 GTIKILRGRNEAIIESLVNGALPK 262
           G  +I+RGRNE +IES +   L K
Sbjct: 285 GYFRIVRGRNECLIESEIAAGLIK 308


>gi|6681079|ref|NP_031824.1| cathepsin B preproprotein [Mus musculus]
 gi|115712|sp|P10605.2|CATB_MOUSE RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
           RecName: Full=Cathepsin B light chain; Contains:
           RecName: Full=Cathepsin B heavy chain; Flags: Precursor
 gi|239907|gb|AAB20536.1| preprocathepsin B [Mus sp.]
 gi|309152|gb|AAA37375.1| cathepsin B [Mus musculus]
 gi|13879360|gb|AAH06656.1| Cathepsin B [Mus musculus]
 gi|26350521|dbj|BAC38900.1| unnamed protein product [Mus musculus]
 gi|74180941|dbj|BAE27751.1| unnamed protein product [Mus musculus]
 gi|74191261|dbj|BAE39458.1| unnamed protein product [Mus musculus]
 gi|74198944|dbj|BAE30691.1| unnamed protein product [Mus musculus]
 gi|74208073|dbj|BAE29144.1| unnamed protein product [Mus musculus]
 gi|148704123|gb|EDL36070.1| cathepsin B, isoform CRA_a [Mus musculus]
          Length = 339

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 63/203 (31%), Positives = 95/203 (46%), Gaps = 34/203 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W++  K+GLV+GG ++S+ GC P + PPC H +   S P C T     P+C
Sbjct: 150 CNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPRC 207

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY-A 198
           +  C    Y   + +DK            HFG  + ++  S   K        NG V  A
Sbjct: 208 NKSC-EAGYSPSYKEDK------------HFG--YTSYSVSNSVKEIMAEIYKNGPVEGA 252

Query: 199 VSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
            +  ++ + Y +               ++I+GWG ENG PYW   +++   +GD G  KI
Sbjct: 253 FTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVENGVPYWLAANSWNLDWGDNGFFKI 312

Query: 244 LRGRNEAIIESLVNGALPK-DNY 265
           LRG N   IES +   +P+ D Y
Sbjct: 313 LRGENHCGIESEIVAGIPRTDQY 335


>gi|204022106|dbj|BAG71150.1| cathepsin B-N [Astegopteryx spinocephala]
          Length = 332

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 92/194 (47%), Gaps = 25/194 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W    K GLVTGG + S+ GCQP    PC    Y  +   C+    P  K 
Sbjct: 152 CHGGYPIKAWERFKKHGLVTGGNYDSSEGCQPYRVSPCPLDEYGNNT--CR--GKPAEKN 207

Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFDPH-FGPFWPAF--WRSFCTKYTR 187
           H RCT   YG           F +D Y +    +  D   +GP   ++  +  F      
Sbjct: 208 H-RCTRMCYGDQDRDFKEDHRFTRDAYYLTYGTIQKDVMTYGPIEASYEVYDDF------ 260

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           P +++   VY  + +A  +    VK++GWGEE G PYW +V+++ +Q+GD+G  KI RG 
Sbjct: 261 PSYKSG--VYVRTENATYLGGHAVKLIGWGEEYGVPYWLMVNSWNDQWGDRGLFKIRRGT 318

Query: 248 NEAIIESLVNGALP 261
           NE  I++   G +P
Sbjct: 319 NECGIDNSTTGGVP 332


>gi|260786791|ref|XP_002588440.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
 gi|229273602|gb|EEN44451.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
          Length = 332

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 62/194 (31%), Positives = 93/194 (47%), Gaps = 23/194 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W +  + GLVTGG + S  GCQP    PC H +   S P C  +  P P+C
Sbjct: 149 CHGGFPEAAWEYWKQDGLVTGGPYGSMQGCQPYEIAPCEH-HINGSRPACGKI-EPTPRC 206

Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTR-P 188
              C +  Y   F +DK+          ++  + +    + GP   AF     T Y   P
Sbjct: 207 KKTCES-GYNVTFNKDKHYAKSAYSVSSKVQQIQMEIMTN-GPVEAAF-----TVYADFP 259

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
            +++   VY   + AE+  +A VK++GWG E   PYW I +++   +GD G  KILRG++
Sbjct: 260 HYKSG--VYQHESGAELGGHA-VKMIGWGMEGSTPYWLIANSWNSDWGDMGFFKILRGQD 316

Query: 249 EAIIESLVNGALPK 262
           E  IE  +    P+
Sbjct: 317 ECGIERDIVAGEPR 330


>gi|147906534|ref|NP_001090927.1| cathepsin B precursor [Sus scrofa]
 gi|187470655|sp|A1E295.1|CATB_PIG RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|118490058|gb|ABK96810.1| cathepsin B [Sus scrofa]
          Length = 335

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 64/191 (33%), Positives = 94/191 (49%), Gaps = 19/191 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK------YQI--NGLGLYFDPH-FGPFWPAFWRSFCTKYTRPLF 190
              C    Y   + +DK      Y I  N   +  + +  GP   AF     T Y+    
Sbjct: 208 SKIC-EPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAF-----TVYSD-FL 260

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
           Q    VY    + +++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 261 QYKSGVYQ-HVTGDLMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHC 319

Query: 251 IIESLVNGALP 261
            IES +   +P
Sbjct: 320 GIESEIVAGIP 330


>gi|255040225|gb|ACT99885.1| cathepsin B2 [Opisthorchis viverrini]
          Length = 337

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 57/190 (30%), Positives = 84/190 (44%), Gaps = 15/190 (7%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W +    G+VTGG+     GC+   FP C+H   +   P C       PKC
Sbjct: 149 CQGGYPPAAWDFWQAYGIVTGGSKEDPMGCRSYPFPKCSHHG-SKKYPPCPHRIYDTPKC 207

Query: 140 HTRCTNDNYG------RGFFQDKYQINGLGLYFDPHF-GPFWPAFWRSFCTKYTRPLFQT 192
             +C   N        R       Q + + +  +    GP   AF      +     F  
Sbjct: 208 VPKCDTPNIDYETDKTRANITYNVQRSQMAIMKEIMINGPVEAAF------EVYEDFFGY 261

Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
              VY   ++ E +    ++I+GWGEENG PYW I +++ E +G+ G  K+LRG+NE  I
Sbjct: 262 KQGVY-FHSTGEFIGGHAIRILGWGEENGTPYWLIANSWNEGWGEDGYFKMLRGKNECGI 320

Query: 253 ESLVNGALPK 262
           E  V   LP+
Sbjct: 321 EDEVTAGLPE 330


>gi|91078964|ref|XP_974298.1| PREDICTED: similar to putative cathepsin B-like like proteinase
           [Tribolium castaneum]
 gi|270004838|gb|EFA01286.1| cathepsin B precursor [Tribolium castaneum]
          Length = 335

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 61/194 (31%), Positives = 92/194 (47%), Gaps = 26/194 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPK 138
           C+ G  +  WA+  + G+VTGG + +  GC+  + PPC H  +T  + P C  +  P P+
Sbjct: 154 CNGGWPAEAWAYWAETGIVTGGKYETKDGCKAYTVPPCEH--HTEGDLPACGDI-VPTPQ 210

Query: 139 CHTRCT--------NDNYGRGFFQ---DKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTR 187
           C   C         +D      +Q   D+ QI    +   P    F    +  F   Y  
Sbjct: 211 CKKECDAGVDIEYKSDLRKGSAYQTSSDESQIQTEIMTNGPVEADF--DVYEDF-LNYKS 267

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
            ++Q     YA   +        +KI+GWG E+G PYW   +++ E +GDKG  KILRG+
Sbjct: 268 GVYQQTTGNYAGGHA--------IKILGWGVEDGTPYWLAANSWNEDWGDKGYFKILRGQ 319

Query: 248 NEAIIESLVNGALP 261
           NE  IES + G +P
Sbjct: 320 NECGIESDIIGGIP 333


>gi|171948776|gb|ACB59245.1| cathepsin B [Sus scrofa]
          Length = 335

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 64/191 (33%), Positives = 94/191 (49%), Gaps = 19/191 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK------YQI--NGLGLYFDPH-FGPFWPAFWRSFCTKYTRPLF 190
              C    Y   + +DK      Y I  N   +  + +  GP   AF     T Y+    
Sbjct: 208 SKIC-EPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAF-----TVYSD-FL 260

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
           Q    VY    + +++    ++I+GWG ENG PYW + +++   +GD G  KILRG++  
Sbjct: 261 QYKSGVYQ-HVTGDLMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHC 319

Query: 251 IIESLVNGALP 261
            IES +   +P
Sbjct: 320 GIESEIVAGIP 330


>gi|50657025|emb|CAH04630.1| cathepsin B [Suberites domuncula]
          Length = 331

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 59/193 (30%), Positives = 91/193 (47%), Gaps = 20/193 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W +    GLVTGG ++S  GCQP     C+H      +P C +     P+C
Sbjct: 145 CNGGYLGAAWRYFEHTGLVTGGQYNSKEGCQPYLIASCDHHVVGKKQP-CASKEEHTPRC 203

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTR-PL 189
              C    Y   F +DK      Y +              GP   AF     T Y   P 
Sbjct: 204 SKTC-EAGYDVSFEKDKHFGASAYSVRSSVEAIQTEIMTNGPVEGAF-----TVYADFPT 257

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           +++   VY  ++ A +  +A ++I+GWG ENG PYW + +++ E +G  G  KI+RG+++
Sbjct: 258 YKSG--VYQHTSGAMLGGHA-IRILGWGTENGTPYWLVANSWNEDWGAMGYFKIIRGKDD 314

Query: 250 AIIESLVNGALPK 262
             IES +   +PK
Sbjct: 315 CGIESQITAGMPK 327


>gi|56756380|gb|AAW26363.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 58/189 (30%), Positives = 88/189 (46%), Gaps = 12/189 (6%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
           +  C    Y   + QDK      Y +  +            P    ++   Y   L   +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV--EAYLEIYEDFLNYKS 274

Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
           G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE +IE
Sbjct: 275 G-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332

Query: 254 SLVNGALPK 262
           S +   L K
Sbjct: 333 SEIAAGLIK 341


>gi|226473758|emb|CAX71564.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 58/189 (30%), Positives = 88/189 (46%), Gaps = 12/189 (6%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
           +  C    Y   + QDK      Y +  +            P    ++   Y   L   +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV--EAYLEIYEDFLNYKS 274

Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
           G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE +IE
Sbjct: 275 G-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332

Query: 254 SLVNGALPK 262
           S +   L K
Sbjct: 333 SEIAAGLIK 341


>gi|25988674|gb|AAN76202.1| lysosomal cysteine proteinase cathepsin B/green fluorescent protein
           EGFP fusion protein [synthetic construct]
          Length = 578

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 62/192 (32%), Positives = 94/192 (48%), Gaps = 19/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  ++GLV+GG ++S+ GC P + PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
           +  C    Y   + +DK      Y ++             GP   AF     T ++  L 
Sbjct: 208 NKMCEA-GYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAF-----TVFSDFLT 261

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY   A  +++    ++I+GWG ENG PYW + +++   +GD G  KILRG N  
Sbjct: 262 YKSG-VYKHEA-GDVMGGHAIRILGWGIENGVPYWLVANSWNVDWGDNGFFKILRGENHC 319

Query: 251 IIESLVNGALPK 262
            IES +   +P+
Sbjct: 320 GIESEIVAGIPR 331


>gi|401758196|gb|AFQ01133.1| cathepsin B [Chilo suppressalis]
          Length = 350

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 60/190 (31%), Positives = 93/190 (48%), Gaps = 32/190 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE--------PECKT 131
           C  G+ +  W +  K G+V+GG + S  GCQP + PPCNH  +   E        P+CK 
Sbjct: 153 CEGGVLTRAWIYYKKIGIVSGGGYKSKQGCQPYTIPPCNHLVWGEIEQCKNIPMTPKCKN 212

Query: 132 L-ATPQ--------PKCHTRCTNDNYGRGFFQDK------YQINGLGLYFDPH-FGPFWP 175
           +   P+        P+C  +C N NY   + +DK      Y++    ++ + + +GP   
Sbjct: 213 IPVIPEQCKYIPITPECEKKC-NKNYKVCYSKDKHRGKSVYRVKKSEIFKEIYEYGPV-- 269

Query: 176 AFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQF 235
               S+ T Y   L    G +Y  + S + +   +VKI+GWGEE G  YW   ++F   +
Sbjct: 270 ---TSYFTVYEDFLNYKEG-IYNYT-SGQKLGLHSVKIIGWGEERGIKYWLAANSFNTDW 324

Query: 236 GDKGTIKILR 245
           GDKG  KI+R
Sbjct: 325 GDKGFFKIIR 334


>gi|443692853|gb|ELT94358.1| hypothetical protein CAPTEDRAFT_221292 [Capitella teleta]
          Length = 374

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 60/195 (30%), Positives = 92/195 (47%), Gaps = 25/195 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W +    GLV+GG + ++ GC+P S  PC H    T  P C     P PKC
Sbjct: 191 CNGGFPPAAWEYFRDTGLVSGGQYGTHQGCRPYSIAPCEHHVNGTRLP-CSGEG-PTPKC 248

Query: 140 HTRC---------TNDNYGRGFF---QDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTR 187
              C          + N+G   +    D+ QI    +   P  G F    +  F      
Sbjct: 249 ERTCEKGYKVKYEDDKNFGYTAYSVDNDEKQIMTEIMTNGPVEGAF--TVYADF------ 300

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           P +++   VY   +  E+  +A ++++GWG E+G PYW + +++   +GD G  KILRG+
Sbjct: 301 PTYKSG--VYQHVSGGELGGHA-IRVLGWGVEDGTPYWLVANSWNSDWGDNGFFKILRGQ 357

Query: 248 NEAIIESLVNGALPK 262
           NE  IE  +   LPK
Sbjct: 358 NECGIEGEIVAGLPK 372


>gi|1942645|pdb|1MIR|A Chain A, Rat Procathepsin B
 gi|1942646|pdb|1MIR|B Chain B, Rat Procathepsin B
          Length = 322

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 61/192 (31%), Positives = 94/192 (48%), Gaps = 19/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  ++GLV+GG ++S+ GC P + PPC H +   + P C T     PKC
Sbjct: 133 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGARPPC-TGEGDTPKC 190

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
           +  C    Y   + +DK      Y ++             GP   AF     T ++  L 
Sbjct: 191 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAF-----TVFSDFLT 244

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY   A  +++    ++I+GWG ENG PYW + +++   +GD G  KILRG N  
Sbjct: 245 YKSG-VYKHEA-GDVMGGHAIRILGWGIENGVPYWLVANSWNADWGDNGFFKILRGENHC 302

Query: 251 IIESLVNGALPK 262
            IES +   +P+
Sbjct: 303 GIESEIVAGIPR 314


>gi|226466816|emb|CAX69543.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 337

 Score = 87.0 bits (214), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 69/251 (27%), Positives = 118/251 (47%), Gaps = 33/251 (13%)

Query: 21  RRPYALS-CIEARAVATATPLAFAVCRSSK--MHVECTSFRFIAGVKQRCAWLVSRWMTI 77
           +R Y  S C  + A+A+   ++  +C  +   + VE ++   ++    +CA         
Sbjct: 101 KRIYDQSQCYSSWAMASVAAISDRICIQTNGTVKVELSAIELVSCC-SKCAV-------- 151

Query: 78  WVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
             C+ G S S W +  + GLVTG ++ +N+GC P  FP C+H + + S P C  +    P
Sbjct: 152 -GCNFGYSESAWYYWVENGLVTGESNGNNSGCLPYPFPKCDHGS-SDSYPMCGYVVYTPP 209

Query: 138 KCHTRCT-------NDN--YGRGFFQDKYQINGLGLYFDPHFGPFWPA-FWRSFCTKYTR 187
            C+  C        ND+  +G+  +Q K   + +       +GP   + F       Y  
Sbjct: 210 VCNGTCRPGYPIPYNDDKHFGKSAYQVKQNESDIRREI-MLYGPVEASIFIYDDFVDYKS 268

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
            +++          +  ++   +V+I+GWG ENG PYW   +++ E++G  G  KILRG 
Sbjct: 269 GVYK--------HLTGRLITIQSVRIIGWGIENGIPYWLCANSWNEEWGLNGFFKILRGS 320

Query: 248 NEAIIESLVNG 258
           NE  IE+ VN 
Sbjct: 321 NECEIEAFVNA 331


>gi|390994431|gb|AFM37365.1| cathepsin B2 [Dictyocaulus viviparus]
          Length = 346

 Score = 87.0 bits (214), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 58/194 (29%), Positives = 89/194 (45%), Gaps = 21/194 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W +  ++G+V+GG++ S +GC+P  FPPC H    T    C     P   C
Sbjct: 162 CDGGFPYAAWNYWVEKGIVSGGSYTSKSGCKPYPFPPCEHHTNGTHYHPCPKDLYPTNTC 221

Query: 140 HTRC--------TNDN-YGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRP 188
             +C        TND  YG   +    ++  +      H GP   A+  +  F   Y + 
Sbjct: 222 EHKCQSGYATAYTNDKRYGAKAYTVAARVKAIQKEIMLH-GPVEVAYDVYEDF-EHYLKG 279

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           +++     Y        +    VK++GWG ENG PYW   +++   +G+ G  +ILRG +
Sbjct: 280 IYKHTAGSY--------LGGHAVKMIGWGTENGIPYWICSNSWNSDWGENGFFRILRGTD 331

Query: 249 EAIIESLVNGALPK 262
           E  IES V   LPK
Sbjct: 332 ECGIESGVVAGLPK 345


>gi|340053922|emb|CCC48215.1| cysteine peptidase C (CPC) [Trypanosoma vivax Y486]
          Length = 334

 Score = 87.0 bits (214), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 59/186 (31%), Positives = 85/186 (45%), Gaps = 18/186 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  + GLV+         CQP  FPPC H+   +  P C  +    PKC
Sbjct: 159 CDGGYPDEAWLYFTESGLVS-------DYCQPYPFPPCKHSGGRSKNPSCHDMHFHTPKC 211

Query: 140 HTRCTNDNYG--RGFFQDKYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNGR 195
           +  CT+      R F  + Y + G   Y    +  GPF  AF     T Y   L   +G 
Sbjct: 212 NATCTDKRIPVVRYFASESYSLQGEEDYKRELYLRGPFEVAF-----TVYEDFLAYESG- 265

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           VY   +   +  +A V++VGWGE NG PYW I +++   +G+ G +   RG++E  IES 
Sbjct: 266 VYKHVSGGPVGGHA-VRVVGWGERNGVPYWKIANSWNTDWGENGYLYFYRGKDECGIESQ 324

Query: 256 VNGALP 261
            +   P
Sbjct: 325 GSAGTP 330


>gi|48425700|pdb|1SP4|B Chain B, Crystal Structure Of Ns-134 In Complex With Bovine
           Cathepsin B: A Two Headed Epoxysuccinyl Inhibitor
           Extends Along The Whole Active Site Cleft
          Length = 205

 Score = 87.0 bits (214), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 63/185 (34%), Positives = 94/185 (50%), Gaps = 19/185 (10%)

Query: 86  SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
           S  W +  K+GLV+GG ++S+ GC+P S PPC H +   S P C T     PKC+  C  
Sbjct: 29  SGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCNKTC-E 85

Query: 146 DNYGRGFFQDK------YQI--NGLGLYFDPH-FGPFWPAFWRSFCTKYTRPLFQTNGRV 196
             Y   + +DK      Y +  N   +  + +  GP   AF     + Y+  L   +G  
Sbjct: 86  PGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAF-----SVYSDFLLYKSGVY 140

Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
             V  S EI+    ++I+GWG ENG PYW + +++   +GD G  KILRG++   IES +
Sbjct: 141 QHV--SGEIMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEI 198

Query: 257 NGALP 261
              +P
Sbjct: 199 VAGMP 203


>gi|410956528|ref|XP_003984894.1| PREDICTED: cathepsin B [Felis catus]
          Length = 339

 Score = 87.0 bits (214), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 63/191 (32%), Positives = 87/191 (45%), Gaps = 19/191 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
              C    Y   + +DK Y  N   +              GP   AF             
Sbjct: 208 SKIC-EPGYTPSYKEDKHYGCNSYSVSNSEKEIMAEIYKNGPVEAAF------SVFSDFL 260

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
           Q    VY    + E++    V+I+GWG EN  PYW + +++   +GD G  KILRGR+  
Sbjct: 261 QYKSGVYQ-HVTGEMMGGHAVRILGWGVENDTPYWLVGNSWNTDWGDHGFFKILRGRDHC 319

Query: 251 IIESLVNGALP 261
            IES V   +P
Sbjct: 320 GIESEVVAGIP 330


>gi|161343861|tpg|DAA06111.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 323

 Score = 87.0 bits (214), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 59/190 (31%), Positives = 88/190 (46%), Gaps = 14/190 (7%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK- 138
           C  G +   W +   +G+VTGG + SN GCQP    PC+H    +S   C +L   Q   
Sbjct: 134 CDGGSAYKAWEFTMGKGIVTGGPYDSNEGCQPYKNRPCDHYG-DSSLTNCSSLRRTQMMF 192

Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGL-------YFDPHFGPFWPAFWRSFCTKYTRPLFQ 191
           C  +C N NY   +  D Y+ + + +               + P    +F   Y   +  
Sbjct: 193 CRDKCVNKNYKVKYEDDLYKTSVVYMTSWTNVKQIQQEIMTYGPV--TAFMYVYENFMGY 250

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             G VY  S + E++ Y  VK++GWG +E G  YW  ++++   +G+ G  KILRG N  
Sbjct: 251 KEG-VYK-STAGELIGYHHVKLIGWGVDEAGIEYWLAMNSWNSNWGNDGLFKILRGYNFC 308

Query: 251 IIESLVNGAL 260
            IE LV   L
Sbjct: 309 SIELLVMAGL 318


>gi|28373366|pdb|1ITO|A Chain A, Crystal Structure Analysis Of Bovine Spleen Cathepsin B-
           E64c Complex
 gi|88192750|pdb|2DC6|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca073 Complex
 gi|88192751|pdb|2DC7|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca042 Complex
 gi|88192752|pdb|2DC8|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca059 Complex
 gi|88192753|pdb|2DC9|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca074me Complex
 gi|88192754|pdb|2DCA|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca075 Complex
 gi|88192755|pdb|2DCB|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca076 Complex
 gi|88192756|pdb|2DCC|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca077 Complex
 gi|88192757|pdb|2DCD|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca078 Complex
          Length = 256

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 62/185 (33%), Positives = 92/185 (49%), Gaps = 19/185 (10%)

Query: 86  SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
           S  W +  K+GLV+GG ++S+ GC+P S PPC H +   S P C T     PKC   C  
Sbjct: 77  SGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCSKTC-E 133

Query: 146 DNYGRGFFQDKY--------QINGLGLYFDPH-FGPFWPAFWRSFCTKYTRPLFQTNGRV 196
             Y   + +DK+          N   +  + +  GP   AF     + Y+  L   +G  
Sbjct: 134 PGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAF-----SVYSDFLLYKSGVY 188

Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
             V  S EI+    ++I+GWG ENG PYW + +++   +GD G  KILRG++   IES +
Sbjct: 189 QHV--SGEIMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEI 246

Query: 257 NGALP 261
              +P
Sbjct: 247 VAGMP 251


>gi|56757646|gb|AAW26973.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 59/189 (31%), Positives = 87/189 (46%), Gaps = 12/189 (6%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
           +  C    Y   + QDK      Y +      F        P    ++   Y   L   +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSGESVFQKDIMMHGPV--EAYLEIYEDFLNYKS 274

Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
           G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE  IE
Sbjct: 275 G-IYRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIE 332

Query: 254 SLVNGALPK 262
           S +   L K
Sbjct: 333 SEIAAGLIK 341


>gi|27806671|ref|NP_776456.1| cathepsin B precursor [Bos taurus]
 gi|115312124|sp|P07688.5|CATB_BOVIN RecName: Full=Cathepsin B; AltName: Full=BCSB; Contains: RecName:
           Full=Cathepsin B light chain; Contains: RecName:
           Full=Cathepsin B heavy chain; Flags: Precursor
 gi|289402|gb|AAA03064.1| cathepsin B [Bos taurus]
 gi|809479|gb|AAA80198.1| cathepsin B [Bos taurus]
 gi|296484950|tpg|DAA27065.1| TPA: cathepsin B precursor [Bos taurus]
          Length = 335

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 63/185 (34%), Positives = 93/185 (50%), Gaps = 19/185 (10%)

Query: 86  SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
           S  W +  K+GLV+GG ++S+ GC+P S PPC H +   S P C T     PKC   C  
Sbjct: 156 SGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCSKTC-E 212

Query: 146 DNYGRGFFQDK------YQI--NGLGLYFDPH-FGPFWPAFWRSFCTKYTRPLFQTNGRV 196
             Y   + +DK      Y +  N   +  + +  GP   AF     + Y+  L   +G  
Sbjct: 213 PGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAF-----SVYSDFLLYKSGVY 267

Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
             V  S EI+    ++I+GWG ENG PYW + +++   +GD G  KILRG++   IES +
Sbjct: 268 QHV--SGEIMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEI 325

Query: 257 NGALP 261
              +P
Sbjct: 326 VAGMP 330


>gi|308390275|gb|ADO32581.1| cathepsin B [Marsupenaeus japonicus]
          Length = 332

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 67/198 (33%), Positives = 94/198 (47%), Gaps = 27/198 (13%)

Query: 80  CSSGISSSTWA-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 138
           C+ G   + +  WVH  G+V+GG+ +S  GCQP    PC H +     P+C       PK
Sbjct: 149 CNGGFPGAAFKYWVHS-GIVSGGSFNSTQGCQPYEIAPCEH-HVPGPRPKCSE-GGGTPK 205

Query: 139 CHTRCTN----------DNYGRGF--FQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYT 186
           C  RC N           + G+ +   +D+ QI     Y     GP   AF     T Y 
Sbjct: 206 CVKRCENGYTVDYESDLHHGGKAYSIMKDEDQIK----YEIMKNGPVEGAF-----TVYV 256

Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
             L   +G VY       +  +A ++I+GWGEENG PYW   +++   +GD G  KILRG
Sbjct: 257 DFLHYKSG-VYQHRHGLPLGGHA-IRILGWGEENGTPYWLCANSWNTDWGDNGLFKILRG 314

Query: 247 RNEAIIESLVNGALPKDN 264
            +   IES ++  LPK N
Sbjct: 315 SDHCGIESEISAGLPKLN 332


>gi|56756114|gb|AAW26235.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 68/246 (27%), Positives = 110/246 (44%), Gaps = 30/246 (12%)

Query: 28  CIEARAVATATPLAFAVCRSS--KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGIS 85
           C  + AV++   ++  +C  S  K  VE ++   I+  K   +           C  G  
Sbjct: 115 CASSWAVSSVGAMSDRICIQSGGKQSVELSAIDLISCCKNCGSG----------CDGGYF 164

Query: 86  SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---------ANYTTSEPECKTLATPQ 136
             +W +    G+VTGG+  ++TGC+P  FP C+H          +     P+CK   T Q
Sbjct: 165 LPSWDYWVSHGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYETPQCKQ--TCQ 222

Query: 137 PKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
              +T    D +  GF  +   +  +        GP       ++   Y   L   +G +
Sbjct: 223 KGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-----EAYLEIYEDFLNYKSG-I 276

Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
           Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE +IES +
Sbjct: 277 YRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEI 335

Query: 257 NGALPK 262
              L K
Sbjct: 336 AAGLIK 341


>gi|204022085|dbj|BAG71140.1| cathepsin B-S [Astegopteryx spinocephala]
          Length = 335

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 53/191 (27%), Positives = 92/191 (48%), Gaps = 22/191 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  + G+ TGG + SN GC P   PPC        + + + L   +P  
Sbjct: 155 CQGGNPIKAWKYFKRHGITTGGDYGSNEGCAPYKVPPC-------YDDQGEFLCQGKPTE 207

Query: 140 HT-RCTNDNYGRGFFQDKYQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
           H  +C    YG    +++Y++  + +             +GP   +F       Y   + 
Sbjct: 208 HNHKCPRACYGNSTVENRYKVKSIYVLDSSKTIEQDIRKYGPVEASF-----DVYDDFIT 262

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G +Y  + +A  V   +VK++GWGEE+G PYW +V+++ + +G++GT +I++GRNE 
Sbjct: 263 YKSG-IYQKTPNAFYVGGHSVKLIGWGEEDGIPYWLLVNSWSKFWGEQGTFRIIKGRNEC 321

Query: 251 IIESLVNGALP 261
            IE      +P
Sbjct: 322 GIERSATAGVP 332


>gi|427785213|gb|JAA58058.1| Putative cathepsin l culex quinquefasciatus cathepsin l
           [Rhipicephalus pulchellus]
          Length = 346

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 61/199 (30%), Positives = 90/199 (45%), Gaps = 32/199 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   S W++  K G+VTGG + S+ GC P     C+H    T  P C     P P+C
Sbjct: 163 CNGGFPGSAWSFWVKTGIVTGGNYDSDDGCMPYPIKACDHHVNGTLGP-CDKKIPPTPRC 221

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA- 198
              C      +G+  D +         D H+G    ++      K  +    TNG V A 
Sbjct: 222 VHMCR-----KGYDVDYHD--------DKHYGK--SSYSVPSEEKQIQAEIMTNGPVEAD 266

Query: 199 VSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
            +  ++ V Y +               ++++GWG ENG PYW   +++  ++GDKG  KI
Sbjct: 267 FTVYSDFVHYKSGVYQRHTDEALGGHAIRLLGWGVENGVPYWLAANSWNTEWGDKGFFKI 326

Query: 244 LRGRNEAIIESLVNGALPK 262
           LRG +E  IE  V   LPK
Sbjct: 327 LRGSDECGIEDDVVAGLPK 345


>gi|328718094|ref|XP_003246386.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
          Length = 340

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 90/197 (45%), Gaps = 25/197 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G     W    KRGLVTGG + S  GC+P   PPC +     +E        P+   
Sbjct: 157 CNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPY----DAEGHNTCAGKPRESN 212

Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFDPH-FGPFWPAF--WRSFCTKYTR 187
           H RCT   YG           + +D Y +    +  D   +GP   +F  +  F      
Sbjct: 213 H-RCTRMCYGNQDLDFDEDHRYTRDSYYLTYGSIQKDVMTYGPIEASFDVYDDF------ 265

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           P +++   VY  S +A  +    VK++GWGEE G PYW +V+++   +GD G  KI RG 
Sbjct: 266 PSYKSG--VYVKSENATYLGGHAVKLIGWGEEYGVPYWLMVNSWNADWGDNGLFKIRRGT 323

Query: 248 NEAIIESLVNGALPKDN 264
           NE  I++     +P  N
Sbjct: 324 NECGIDNSTTAGVPVTN 340


>gi|49036806|gb|AAT48984.1| cathepsin B-like proteinase [Triatoma sordida]
          Length = 331

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 64/195 (32%), Positives = 87/195 (44%), Gaps = 26/195 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G  +S W +    G+V+GG + S  GCQP S  PC H +   S P C       P C
Sbjct: 150 CDGGFPASAWDYWQNEGIVSGGNYGSKQGCQPYSIAPCEH-HVPGSRPACSG-GGDTPDC 207

Query: 140 HTRCTNDNYGRGFFQDKY------------QINGLGLYFDPHFGPFWPAFWRSFCTKYTR 187
             +C ++  G  + QD Y            QI    L      GP   AF     T Y  
Sbjct: 208 RNQC-DEGSGISYDQDHYYGETVYTLDEAKQIQAEIL----KNGPVEAAF-----TVYED 257

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
            L    G VY   A   +  +A +KI+GWG EN  PYW + +++   +G+ G  KILRG 
Sbjct: 258 LLNYKEG-VYQHVAGEALGGHA-IKILGWGVENDTPYWLVANSWNTDWGNNGFFKILRGS 315

Query: 248 NEAIIESLVNGALPK 262
           +E  IE  +   LP+
Sbjct: 316 DECGIEDQIVAGLPR 330


>gi|198429088|ref|XP_002120307.1| PREDICTED: similar to cathepsin B [Ciona intestinalis]
          Length = 364

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 63/193 (32%), Positives = 92/193 (47%), Gaps = 21/193 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   S W + +  GLVTGG + S TGC P    PC H +     P+C       P C
Sbjct: 182 CNGGFPGSAWKYWNSDGLVTGGLYGSKTGCLPYQIKPCEH-HVPGDRPKCSE-GGGTPSC 239

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTR-PL 189
            ++C   N    + QDK Y ++   +  DP          GP   AF     T Y   P 
Sbjct: 240 VSKCKG-NTTIHYNQDKHYGLSSYAVGSDPTQIQTEIMTHGPVEGAF-----TVYADFPT 293

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           +++   VY    +  ++    ++I+GWG ENG  YW + +++   +GDKG  KILRG +E
Sbjct: 294 YKSG--VYK-HVTGGVLGGHAIRILGWGSENGVAYWLVANSWNTDWGDKGYFKILRGSDE 350

Query: 250 AIIESLVNGALPK 262
             IES V   +P+
Sbjct: 351 CGIESSVVAGIPQ 363


>gi|121073189|gb|ABM47071.1| cathepsin B2 [Clonorchis sinensis]
 gi|358341868|dbj|GAA36574.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 343

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 59/194 (30%), Positives = 80/194 (41%), Gaps = 23/194 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +    G+VTGG+    TGC+P  FP C H +     P C     P PKC
Sbjct: 155 CDGGFPPMAWDFWKTHGIVTGGSKEEPTGCRPYPFPKCQHHS-QGHYPPCPRRIYPTPKC 213

Query: 140 HTRC-----------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
              C           T  N      Q +  I    L   P           +F      P
Sbjct: 214 VKHCDTPKIDYQKDKTRANTSYNVHQSEVAIMKEILLNGP--------VEATFEVHEDFP 265

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
            +++    +A   S   V    ++I+GWGEENG PYW I +++ E +G+KG ++ LRG N
Sbjct: 266 EYKSGIYFHAWGGS---VGGHAIRILGWGEENGVPYWLIANSWNEDWGEKGYLRFLRGHN 322

Query: 249 EAIIESLVNGALPK 262
           E  IE      LP 
Sbjct: 323 ECGIEEEATAGLPD 336


>gi|440913587|gb|ELR63025.1| Cathepsin B [Bos grunniens mutus]
          Length = 335

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 63/185 (34%), Positives = 93/185 (50%), Gaps = 19/185 (10%)

Query: 86  SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
           S  W +  K+GLV+GG ++S+ GC+P S PPC H +   S P C T     PKC   C  
Sbjct: 156 SGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCSKTC-E 212

Query: 146 DNYGRGFFQDK------YQI--NGLGLYFDPH-FGPFWPAFWRSFCTKYTRPLFQTNGRV 196
             Y   + +DK      Y +  N   +  + +  GP   AF     + Y+  L   +G  
Sbjct: 213 PGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAF-----SVYSDFLLYKSGVY 267

Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
             V  S EI+    ++I+GWG ENG PYW + +++   +GD G  KILRG++   IES +
Sbjct: 268 QHV--SGEIMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEI 325

Query: 257 NGALP 261
              +P
Sbjct: 326 VAGMP 330


>gi|49036808|gb|AAT48985.1| cathepsin B-like proteinase [Triatoma vitticeps]
          Length = 332

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 59/192 (30%), Positives = 87/192 (45%), Gaps = 19/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G + + W + HK G+V+GG + S  GCQP S  PC H+    S P C+ +    PKC
Sbjct: 150 CLGGSAENAWEYWHKFGIVSGGNYGSKQGCQPYSIAPCEHS-IPGSRPACEGVRD-TPKC 207

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
             +C    YG  +  D  Y   G  +  D           GP   +            LF
Sbjct: 208 KKQCEK-GYGIPYGDDLCYGQPGYTIENDAQKIQAEILKNGPIVASIL------VYEDLF 260

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
                VY    + E++    +KI+GWG EN  PYW + +++   +G+ G  KILRG +E 
Sbjct: 261 SYKAGVYQ-HVAGEVLGGHVIKILGWGVENDTPYWLVANSWNTDWGNNGFFKILRGSDEC 319

Query: 251 IIESLVNGALPK 262
            IE  +   +P+
Sbjct: 320 GIEDQIVAGIPR 331


>gi|201023315|ref|NP_001128400.1| cathepsin B-16D2 precursor [Acyrthosiphon pisum]
          Length = 340

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 90/197 (45%), Gaps = 25/197 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G     W    KRGLVTGG + S  GC+P   PPC +     +E        P+   
Sbjct: 157 CNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPY----DAEGHNTCAGKPRESN 212

Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFDPH-FGPFWPAF--WRSFCTKYTR 187
           H RCT   YG           + +D Y +    +  D   +GP   +F  +  F      
Sbjct: 213 H-RCTRMCYGNQDLDFDEDHRYTRDSYYLTYGSIQKDVMTYGPIEASFDVYDDF------ 265

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           P +++   VY  S +A  +    VK++GWGEE G PYW +V+++   +GD G  KI RG 
Sbjct: 266 PSYKSG--VYVKSENATYLGGHAVKLIGWGEEYGVPYWLMVNSWNADWGDNGLFKIRRGT 323

Query: 248 NEAIIESLVNGALPKDN 264
           NE  I++     +P  N
Sbjct: 324 NECGIDNSTTAGVPVTN 340


>gi|496317|dbj|BAA04103.1| Sarcophaga pro-cathepsin B [Sarcophaga peregrina]
          Length = 344

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 56/192 (29%), Positives = 90/192 (46%), Gaps = 18/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + WA+  ++G+V+GG + S+ GC+P    PC H +   + P C       P C
Sbjct: 161 CNGGFPGAAWAYWTRKGIVSGGPYGSSQGCRPYEIAPCEH-HVNGTRPPCDGEHGKTPSC 219

Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
              C         T+ ++G   +  K  +  +      + GP   AF     T Y   + 
Sbjct: 220 RHECQKSYDVDYKTDKHFGSKSYSVKRNVKDIQKEIMQN-GPVEGAF-----TVYEDLIL 273

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY      E+  +A ++I+GWG EN  PYW I +++   +G+ G  K+LRG +  
Sbjct: 274 YKDG-VYQHVHGRELGGHA-IRILGWGVENKTPYWLIANSWNTDWGNNGFFKMLRGEDHC 331

Query: 251 IIESLVNGALPK 262
            IES +   LPK
Sbjct: 332 GIESAIAAGLPK 343


>gi|118429531|gb|ABK91813.1| cathepsin B-like cysteine proteinase precursor [Clonorchis
           sinensis]
 gi|358331549|dbj|GAA37857.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 343

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 63/191 (32%), Positives = 86/191 (45%), Gaps = 19/191 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           CS G  +  W +    G+VTGG+    +GC+   FP C H +     P C     P P+C
Sbjct: 155 CSGGYPAVAWDYWGAHGIVTGGSKEDPSGCRSYPFPKCEH-HVQGHYPPCPHQYYPTPEC 213

Query: 140 HTRCTNDNYGRGFFQDKYQIN-GLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C  D  G  + +DK + N    +Y             GP    F     T Y     
Sbjct: 214 VQHC--DTPGIDYVKDKTRANMSYNIYSSEILIMKEIMLRGPVEAVF-----TVYED-FL 265

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
           Q    VY  S  A +  +A ++I+GWGEE   PYW I +++ E +G+KG +K LRG NE 
Sbjct: 266 QYKFGVYFHSWGAPLSEHA-IRILGWGEEGDVPYWLIANSWNEDWGEKGYMKFLRGLNEC 324

Query: 251 IIESLVNGALP 261
            IE  V   LP
Sbjct: 325 GIEDDVTAGLP 335


>gi|353228456|emb|CCD74627.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 333

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 68/255 (26%), Positives = 104/255 (40%), Gaps = 59/255 (23%)

Query: 28  CIEARAVATATPLAFAVCRSSK--MHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGIS 85
           C    AVA+A  ++   C  +   M V+ ++   I+  K +             C  G S
Sbjct: 109 CDSGWAVASAASISDRTCIQTNGTMKVQLSAIELISCSKNKLG-----------CQIGFS 157

Query: 86  SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRC-- 143
             +W +  K GLVTG      TGC P  FP C+H + + S P+C  +    P C   C  
Sbjct: 158 EFSWDYWLKNGLVTGDP----TGCLPYPFPKCDHRS-SNSYPKCGYITYTAPPCTKTCRS 212

Query: 144 -------TNDNYGRGFF---------QDKYQING---LGLYFDPHFGPFWPAFWRSFCTK 184
                   + +YGR  +         + +  +NG    G++    F  +    +R     
Sbjct: 213 GYPIPYKADKHYGRVIYSLRPNESDIRKEIMMNGPVEAGIFVHSDFLNYKSGVYRHI--- 269

Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 244
                            + ++V   +V+I+GWG EN  PYW   +++ E +G  G  KIL
Sbjct: 270 -----------------TGQLVTIHSVRIIGWGIENDIPYWLCANSWNEDWGLNGYFKIL 312

Query: 245 RGRNEAIIESLVNGA 259
           RG NE  IES VN  
Sbjct: 313 RGSNECEIESFVNAG 327


>gi|256052329|ref|XP_002569725.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228436|emb|CCD74607.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 345

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 57/186 (30%), Positives = 88/186 (47%), Gaps = 18/186 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  GI    W +  K G+VTG +  ++TGC+P  FP C H +     P C +     P+C
Sbjct: 163 CEGGILGPAWDFWVKEGIVTGSSKENHTGCEPYPFPKCEH-HTKGKYPPCGSKIYKTPRC 221

Query: 140 HTRCTNDNYGRGFFQDKYQ-INGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK++  +   +  D          +GP   +F     T Y   L 
Sbjct: 222 KQTCQK-KYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEASF-----TVYEDFLN 275

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    ++  A  +    ++I+GWG EN  PYW I +++ E +G+ G  +I+RGR+E 
Sbjct: 276 YKSGIYKHITGEA--LGGHAIRIIGWGVENKTPYWLIANSWNEDWGENGYFRIVRGRDEC 333

Query: 251 IIESLV 256
            IES V
Sbjct: 334 FIESEV 339


>gi|22531389|emb|CAD44625.1| cathepsin B1 isotype 2 [Schistosoma mansoni]
          Length = 340

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 58/186 (31%), Positives = 89/186 (47%), Gaps = 18/186 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  GI    W +  K G+VTG +  ++TGC+P  FP C H +     P C +     P+C
Sbjct: 158 CEGGILGPAWDFWVKEGIVTGSSKENHTGCEPYPFPKCEH-HTKGKYPPCGSKIYKTPRC 216

Query: 140 HTRCTNDNYGRGFFQDKYQ-INGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK++  +   +  D          +GP   +F     T Y   L 
Sbjct: 217 KQTCQK-KYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEASF-----TVYEDFLN 270

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G +Y    + E +    ++I+GWG EN  PYW I +++ E +G+ G  +I+RGR+E 
Sbjct: 271 YKSG-IYK-HITGEALGGHAIRIIGWGVENKTPYWLIANSWNEDWGENGYFRIVRGRDEC 328

Query: 251 IIESLV 256
            IES V
Sbjct: 329 FIESEV 334


>gi|343474530|emb|CCD13852.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 335

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 60/189 (31%), Positives = 87/189 (46%), Gaps = 24/189 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     WA+    G+ +G        CQP  FP C+H   +T+ P+C  L    P C
Sbjct: 158 CLGGDPDMAWAYFSSEGIASGR-------CQPYPFPRCSHYTNSTTYPQCSALHLWTPTC 210

Query: 140 HTRCTNDNYGRGFFQ--DKYQING-----LGLYFDPHFGPFWPAFWRSFCTKYTRPLFQT 192
           +  CT+    +  ++    Y ++G       LYF    GPF   F           LF  
Sbjct: 211 NPACTDSTISKKKYRGLKSYSLSGEEDFRRELYFR---GPFQAVF------DVWSDLFAY 261

Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
              VY     A I A+A V+IVGWG ++G PYW I +++  ++GD+G   +LRG NE  I
Sbjct: 262 KHGVYKHVGGAFIGAHA-VRIVGWGNQSGVPYWKIANSWNAEWGDRGYFFMLRGDNECGI 320

Query: 253 ESLVNGALP 261
           E   +  +P
Sbjct: 321 EDSGSAGVP 329


>gi|48762476|dbj|BAD23809.1| cathepsin B-S [Tuberaphis styraci]
 gi|204022069|dbj|BAG71132.1| cathepsin B-S1 [Tuberaphis styraci]
          Length = 349

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 58/195 (29%), Positives = 88/195 (45%), Gaps = 20/195 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +   +G+ TGG + +  GC P   PPC       +         P  + 
Sbjct: 154 CGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPCYDEQGKNT-----CGGKPMERN 208

Query: 140 HTRCTNDNYGRGFFQDKYQ------INGLGLYFDP--HFGPFWPAFWRSFCTKYTRPLFQ 191
           H +C    YG+   QD+Y+      IN +         +GP   +F       Y      
Sbjct: 209 H-QCPKTCYGKTTVQDRYKTKNEYVINSIETIEQDLMTYGPVEASF-----DVYDDFSVY 262

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
            +G +Y  +  A+     ++KI+GWGEENG PYW  V+++ + +GD GT KI++GRNE  
Sbjct: 263 KSG-IYRKTPKAKYEGGHSIKIIGWGEENGTPYWLAVNSWSKFWGDHGTFKIIKGRNECG 321

Query: 252 IESLVNGALPKDNYG 266
           IE  V   +P  + G
Sbjct: 322 IERAVTAGIPSTSRG 336


>gi|354471594|ref|XP_003498026.1| PREDICTED: cathepsin B-like [Cricetulus griseus]
 gi|344254255|gb|EGW10359.1| Cathepsin B [Cricetulus griseus]
          Length = 339

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 62/194 (31%), Positives = 94/194 (48%), Gaps = 19/194 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  K+GLV+GG ++S+ GC P + PPC H +   S P+C T     PKC
Sbjct: 150 CNGGYPSGAWNFWIKKGLVSGGLYNSHVGCLPYTIPPCEH-HVNGSRPQC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + +DK      Y ++             GP   AF     T ++  L 
Sbjct: 208 TKSC-EAGYSPSYKEDKHYGYTSYSVSNNEKEIMAEIYKNGPVEGAF-----TVFSDFLT 261

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY   A  +I+    ++I+GWG EN  PYW + +++   +GD G  KILRG +  
Sbjct: 262 YKSG-VYKHEA-GDIMGGHAIRILGWGVENSVPYWLVANSWNVDWGDNGLFKILRGEDHC 319

Query: 251 IIESLVNGALPKDN 264
            IES +   +P+ +
Sbjct: 320 GIESEIVAGIPRTD 333


>gi|226469952|emb|CAX70257.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 58/183 (31%), Positives = 86/183 (46%), Gaps = 12/183 (6%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +    G+VTGG+  ++TGCQP  FP C H +     P C       P+C
Sbjct: 159 CDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHHS-IGKYPSCGDKMYKTPQC 217

Query: 140 HTRCTNDNYGRGFFQDKY----QINGLG--LYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
             +C    Y   +  DK+     IN +   L        + P    ++   +   L   +
Sbjct: 218 KRKCQK-GYTTPYEHDKHYGGIAINVIKNELAIQKEIMMYGPV--EAYLLIFEDFLNYKS 274

Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
           G +Y  + +   V    V+I+GWG ENG  YW   +T+ E +G+KG  +I+RGRNE  IE
Sbjct: 275 G-IYKYT-TGSFVGEHYVRIIGWGIENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIE 332

Query: 254 SLV 256
           S+V
Sbjct: 333 SVV 335


>gi|392922404|ref|NP_507186.3| Protein CPR-2 [Caenorhabditis elegans]
 gi|206994217|emb|CAB04322.3| Protein CPR-2 [Caenorhabditis elegans]
          Length = 326

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 67/195 (34%), Positives = 93/195 (47%), Gaps = 33/195 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATP---- 135
           C  G     + W  +RG+VTGG  +  TGC+P    PCN  N       C  L TP    
Sbjct: 153 CDGGFPYRAFQWWARRGVVTGG-DYLGTGCKPYPIRPCNSDN-------CVNLQTPPCRL 204

Query: 136 --QPKCHTRCTND-NYGRGFFQDKYQINGL--GLYFDPHFGPFWPAF--WRSFCTKYTRP 188
             QP   T  TND NYG   +     +  +   +Y++   GP   AF  +  F  KY   
Sbjct: 205 SCQPGYRTTYTNDKNYGNSAYPVPRTVAAIQADIYYN---GPVVAAFIVYEDF-EKYKSG 260

Query: 189 LFQ-TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           +++   GR     A         VK++GWG E G PYW  V+++G Q+G+ GT +ILRG 
Sbjct: 261 IYRHIAGRSKGGHA---------VKLIGWGTERGTPYWLAVNSWGSQWGESGTFRILRGV 311

Query: 248 NEAIIESLVNGALPK 262
           +E  IES +   LP+
Sbjct: 312 DECGIESRIVAGLPR 326


>gi|390357905|ref|XP_003729132.1| PREDICTED: cathepsin B-like [Strongylocentrotus purpuratus]
          Length = 354

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 90/194 (46%), Gaps = 23/194 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   S W +    G+VTGG  +S+ GCQP     C+H    T  P C+    P P+C
Sbjct: 173 CNGGFPGSAWEYYKDTGIVTGGQWNSSQGCQPYQIKSCDHHVNGTKGP-CQGEG-PTPEC 230

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAF--WRSFCTKYTRP 188
             +C   +Y   + QDK Y ++   +  +P          GP    F  +  F T  +  
Sbjct: 231 KHKC-EASYSTPYEQDKHYALSVNSISNNPEATQTEIMTNGPVEADFTVYEDFPTYKSGV 289

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
              T G V    A         +KI+GWG E G  YW + +++  ++GD G  KILRG N
Sbjct: 290 YQHTTGGVLGGHA---------IKILGWGVEEGTKYWLVANSWNNEWGDNGFFKILRGSN 340

Query: 249 EAIIESLVNGALPK 262
           E  IES +N  +PK
Sbjct: 341 ECGIESDINFGIPK 354


>gi|330434688|gb|AEC22812.1| cathepsin B [Macrobrachium nipponense]
          Length = 331

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 64/196 (32%), Positives = 97/196 (49%), Gaps = 27/196 (13%)

Query: 80  CSSGISSSTWA-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 138
           C+ G   + +  WVH  G+V+GGA +S  GCQP    PC H + +   P+C    +  PK
Sbjct: 148 CNGGFPGAAFQYWVHS-GIVSGGAFNSTQGCQPYEIAPCEH-HVSGPRPKCAEGGS-TPK 204

Query: 139 CHTRCTND---------NYGRGFF---QDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYT 186
           CH  C ++         ++G   +   +D+ QI     Y     GP   AF     T Y 
Sbjct: 205 CHKNCESNYVVDYESDLHHGSKHYSVDKDETQIK----YDIMTNGPVEGAF-----TVYV 255

Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
             L   +G VY  +    +  +A ++++GWGEE+G PYW   +++   +GD G  KILRG
Sbjct: 256 DFLHYKSG-VYQHTHGLPLGGHA-IRVLGWGEEDGTPYWLCANSWNTDWGDNGYFKILRG 313

Query: 247 RNEAIIESLVNGALPK 262
            +   IES ++  LPK
Sbjct: 314 SDHCGIESEISAGLPK 329


>gi|126303983|ref|XP_001381634.1| PREDICTED: cathepsin B-like [Monodelphis domestica]
          Length = 337

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 62/198 (31%), Positives = 90/198 (45%), Gaps = 23/198 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C       P C
Sbjct: 151 CNGGFPAGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPACTGEEGDTPTC 209

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
             +C  + Y   +  DK    G   Y  P             GP   AF     + Y   
Sbjct: 210 RKKC-EEGYSTQYKDDKNY--GSTSYSVPSSEQEIMAEIYKNGPVEGAF-----SVYEDF 261

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L   +G    V  + E++    ++I+GWG ENG  YW   +++   +GD G  K LRG+N
Sbjct: 262 LHYKSGVYQHV--AGEMLGGHAIRILGWGVENGIRYWLAANSWNIDWGDNGFFKFLRGKN 319

Query: 249 EAIIESLVNGALPK-DNY 265
              IES +   +P+ D Y
Sbjct: 320 HCGIESEIIAGIPRTDQY 337


>gi|256090368|ref|XP_002581167.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|22531387|emb|CAD44624.1| cathepsin B1 isotype 1 [Schistosoma mansoni]
 gi|353228442|emb|CCD74613.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 340

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 58/186 (31%), Positives = 88/186 (47%), Gaps = 18/186 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  GI    W +  K G+VTG +  ++TGC+P  FP C H +     P C +     P+C
Sbjct: 158 CEGGILGPAWDYWVKEGIVTGSSKENHTGCEPYPFPKCEH-HTKGKYPPCGSKIYKTPRC 216

Query: 140 HTRCTNDNYGRGFFQDKYQ-INGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK++  +   +  D          +GP    F     T Y   L 
Sbjct: 217 KQTCQK-KYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGF-----TVYEDFLN 270

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G +Y    + E +    ++I+GWG EN  PYW I +++ E +G+ G  +I+RGR+E 
Sbjct: 271 YKSG-IYK-HITGETLGGHAIRIIGWGVENKTPYWLIANSWNEDWGENGYFRIVRGRDEC 328

Query: 251 IIESLV 256
            IES V
Sbjct: 329 SIESEV 334


>gi|226474160|emb|CAX71567.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 58/192 (30%), Positives = 90/192 (46%), Gaps = 18/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---------ANYTTSEPECK 130
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H          +     P+CK
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCK 218

Query: 131 TLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
            +   Q   +T    D +  GF  +   +  +        GP       ++   Y   L 
Sbjct: 219 QIC--QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-----EAYLEIYEDFLN 271

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE 
Sbjct: 272 YKSG-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNEC 329

Query: 251 IIESLVNGALPK 262
            IES +   L K
Sbjct: 330 SIESEIAAGLIK 341


>gi|341900875|gb|EGT56810.1| hypothetical protein CAEBREN_32632 [Caenorhabditis brenneri]
          Length = 287

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 57/194 (29%), Positives = 88/194 (45%), Gaps = 19/194 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G +   W +  K GL TGG++ S  GC+P S  PC       + P C     P P C
Sbjct: 100 CGGGNAFKAWQYWGKHGLPTGGSYESQFGCKPYSIAPCGKTVGNVTYPACTNTTLPTPSC 159

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPH-----------FGPFWPAFWRSFCTKYTRP 188
             +CT+ N G     DK +  G  +   P+            GP    F       Y   
Sbjct: 160 EKKCTSKN-GYPVDIDKDRHYGASVDQLPNRQIEIQSDVMLNGPIETTF-----EVYDDF 213

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L  T G    ++ + +   + +V+I+GWG   G PYW + +++G+++G+ GT + LRG N
Sbjct: 214 LQYTTGIYVHLTGNKQ--GHLSVRILGWGMYEGVPYWLLANSWGKEWGENGTFRALRGTN 271

Query: 249 EAIIESLVNGALPK 262
           E  +E+     +PK
Sbjct: 272 ECGLEANCVSGMPK 285


>gi|204022071|dbj|BAG71133.1| cathepsin B-S2 [Tuberaphis coreana]
          Length = 334

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 56/190 (29%), Positives = 88/190 (46%), Gaps = 20/190 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +   +G+ TGG +++  GC P   PPC +      E  C     P  + 
Sbjct: 154 CGGGNPMKAWEYFRTQGVTTGGDYNTKEGCMPYKVPPCRNKQ---GENICD--EQPMERN 208

Query: 140 HTRCTNDNYGRGFFQDKYQ------INGLGLYFDP--HFGPFWPAFWRSFCTKYTRPLFQ 191
           H +C    YG+   Q++Y+      IN +         +GP   +F           L  
Sbjct: 209 H-QCPKTCYGKTTVQNRYKTKSEYYINSIKTIEQDIKTYGPVEASF------DCYDDLSV 261

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
               +Y  S +A+     ++KI+GWG+E+G PYW  V+++ + +GD GT KI++GRNE  
Sbjct: 262 YKSGIYRKSPNAKYKGGHSIKIIGWGQEDGTPYWLAVNSWSKFWGDHGTFKIIKGRNECG 321

Query: 252 IESLVNGALP 261
           IE  V   +P
Sbjct: 322 IERAVTAGIP 331


>gi|161671340|gb|ABX75522.1| cathepsin b [Lycosa singoriensis]
          Length = 247

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 61/193 (31%), Positives = 87/193 (45%), Gaps = 21/193 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   S W +   +G+ TGG  +S+ GCQP   P C H + T   P C  +    PKC
Sbjct: 65  CDGGFPPSAWEFWVDKGIATGGLWNSHIGCQPYEIPACEH-HTTGDRPPCSDIVD-TPKC 122

Query: 140 HTRCT---NDNY--GRGFFQDKYQINGLGLYFDPHF---GPFWPAF--WRSFCTKYTRPL 189
              C    N +Y   + F +  Y I  L           GP   AF  +  F   Y   +
Sbjct: 123 VHLCEKGYNTSYRDDKHFGKKSYSIESLEQQIQTEIFKNGPVEGAFSVYSDF-INYKSGV 181

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           +Q +        S E +    ++++GWG EN  PYW   +++   +GDKG  KILRG +E
Sbjct: 182 YQHH--------SGESLGGHAIRVLGWGYENDVPYWLCANSWNTDWGDKGYFKILRGSDE 233

Query: 250 AIIESLVNGALPK 262
             IES +   +PK
Sbjct: 234 CGIESSIVAGIPK 246


>gi|56755451|gb|AAW25905.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 58/192 (30%), Positives = 90/192 (46%), Gaps = 18/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---------ANYTTSEPECK 130
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H          +     P+CK
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCK 218

Query: 131 TLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
            +   Q   +T    D +  GF  +   +  +        GP       ++   Y   L 
Sbjct: 219 QIC--QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-----EAYLEIYEDFLN 271

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE 
Sbjct: 272 YKSG-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNEC 329

Query: 251 IIESLVNGALPK 262
            IES +   L K
Sbjct: 330 SIESEIAAGLIK 341


>gi|226474180|emb|CAX71576.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 59/192 (30%), Positives = 90/192 (46%), Gaps = 18/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---------ANYTTSEPECK 130
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H          +     P+CK
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYETPQCK 218

Query: 131 TLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
              T Q   +T    D +  GF  +   +  +        GP       ++   Y   L 
Sbjct: 219 Q--TCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-----EAYLEIYEDFLN 271

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE 
Sbjct: 272 YKSG-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNEC 329

Query: 251 IIESLVNGALPK 262
            IES +   L K
Sbjct: 330 SIESEIAAGLIK 341


>gi|289743429|gb|ADD20462.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
          Length = 340

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 55/195 (28%), Positives = 93/195 (47%), Gaps = 24/195 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W +  ++G+V+GG + S+ GC+P    PC H +   + P C+      P+C
Sbjct: 157 CNGGFPGAAWGYWVRKGIVSGGPYGSSQGCRPYEIAPCEH-HVNGTRPPCEKEYGKTPRC 215

Query: 140 HTRC---------TNDNYGRGFF---QDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTR 187
             +C         T+ ++G   +   ++   I G  +   P  G F         T Y  
Sbjct: 216 QHKCQASYKVDYKTDKHFGSRAYSISKNVRDIQGEIMTNGPVEGAF---------TVYED 266

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
            +   +G VY      E+  +A ++I+GWG E   PYW I +++   +G+ G  KILRG+
Sbjct: 267 LILYKDG-VYEHVHGKELGGHA-IRIIGWGVEKDTPYWLIANSWNTDWGNNGFFKILRGK 324

Query: 248 NEAIIESLVNGALPK 262
           +   IES ++  LPK
Sbjct: 325 DHCGIESSISAGLPK 339


>gi|74221319|dbj|BAE42140.1| unnamed protein product [Mus musculus]
          Length = 339

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 64/200 (32%), Positives = 95/200 (47%), Gaps = 28/200 (14%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W++  K+GLV+GG ++S+ GC P + PPC H +   S P C T     P+C
Sbjct: 150 CNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPRC 207

Query: 140 HTRCTNDNYGRGFFQDKY-------------QINGLGLYFDPHFGPFWPAFWRSFCTKYT 186
           +  C    Y   + +DK+             +I       DP  G F         T ++
Sbjct: 208 NKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNDPVEGAF---------TVFS 257

Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
             L   +G VY   A   +  +A ++I+GWG  NG PYW   +++   +GD G  KILRG
Sbjct: 258 DFLTYKSG-VYKHEAGDMMGGHA-IRILGWGVGNGVPYWLAANSWNLDWGDNGFFKILRG 315

Query: 247 RNEAIIESLVNGALPK-DNY 265
            N   IES +   +P+ D Y
Sbjct: 316 ENHCGIESEIVAGIPRTDQY 335


>gi|395842321|ref|XP_003793966.1| PREDICTED: cathepsin B [Otolemur garnettii]
          Length = 339

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 63/192 (32%), Positives = 95/192 (49%), Gaps = 24/192 (12%)

Query: 86  SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
           +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC   C  
Sbjct: 156 AEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPAC-TGEGDTPKCSKTC-E 212

Query: 146 DNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRPLFQTNG 194
             Y   + +DK+   G   Y  P             GP   AF     + Y+  L   +G
Sbjct: 213 PGYSPTYKEDKH--FGYTSYSLPTNEWEIMAEIYKNGPVEGAF-----SVYSDFLLYKSG 265

Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
            VY    + +++    ++I+GWGEENG PYW + +++   +GD G  +ILRG++   IES
Sbjct: 266 -VYQ-HLTGDMMGGHAIRILGWGEENGVPYWLVANSWNTDWGDGGFFRILRGQDHCGIES 323

Query: 255 LVNGALPK-DNY 265
            V   +P+ D Y
Sbjct: 324 EVVAGIPRTDQY 335


>gi|341888224|gb|EGT44159.1| hypothetical protein CAEBREN_15022 [Caenorhabditis brenneri]
          Length = 332

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 55/194 (28%), Positives = 89/194 (45%), Gaps = 18/194 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G +   W +  K GL TGG++ +  GC+P S  PC       + P C     P P C
Sbjct: 144 CGGGNAFKAWQYWGKHGLPTGGSYETQFGCKPYSIAPCGKTVGNVTYPACTNTTLPTPSC 203

Query: 140 HTRCTNDN-YGRGFFQDKY-------QINGLGLYFDPHF---GPFWPAFWRSFCTKYTRP 188
             +CT+ N Y     +D++       Q+    +         GP    F       Y   
Sbjct: 204 EKKCTSKNGYPVDIDKDRHYGASSVDQLPNRQIEIQSDVMLNGPIETTF-----EVYDDF 258

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L  T G    ++ + +   + +V+I+GWG   G PYW + +++G+++G+ GT + LRG N
Sbjct: 259 LQYTTGIYVHLTGNKQ--GHLSVRILGWGMYEGVPYWLLANSWGKEWGENGTFRALRGTN 316

Query: 249 EAIIESLVNGALPK 262
           E  +E+    A+PK
Sbjct: 317 ECGLEANCVSAMPK 330


>gi|226474164|emb|CAX71568.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
 gi|226474166|emb|CAX71569.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 58/192 (30%), Positives = 90/192 (46%), Gaps = 18/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---------ANYTTSEPECK 130
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H          +     P+CK
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCK 218

Query: 131 TLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
            +   Q   +T    D +  GF  +   +  +        GP       ++   Y   L 
Sbjct: 219 QIC--QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-----EAYLEIYEDFLN 271

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE 
Sbjct: 272 YKSG-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNEC 329

Query: 251 IIESLVNGALPK 262
            IES +   L K
Sbjct: 330 SIESEIAAGLIK 341


>gi|56756410|gb|AAW26378.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 59/192 (30%), Positives = 90/192 (46%), Gaps = 18/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---------ANYTTSEPECK 130
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H          +     P+CK
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCK 218

Query: 131 TLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
              T Q   +T    D +  GF  +   +  +        GP       ++   Y   L 
Sbjct: 219 Q--TCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-----EAYLEIYEDFLN 271

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE 
Sbjct: 272 YKSG-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNEC 329

Query: 251 IIESLVNGALPK 262
            IES +   L K
Sbjct: 330 SIESEIAAGLIK 341


>gi|300835056|gb|ADK37857.1| putative cathepsin precursor [Sitobion avenae]
          Length = 340

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 59/193 (30%), Positives = 88/193 (45%), Gaps = 17/193 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  K GLVTGG + S  GC+P   PPC   +   +    K +     +C
Sbjct: 157 CHGGYPIKAWKYFSKHGLVTGGNYKSGEGCEPYRVPPCPRDDKGNNTCAGKPIEKNH-RC 215

Query: 140 HTRCTND-----NYGRGFFQDKYQINGLGLYFDPH-FGPFWPAF--WRSFCTKYTRPLFQ 191
              C  D     N    F +D Y +    +  D   +GP   +F  +  F      P ++
Sbjct: 216 TRMCYGDQDLDYNDDHRFTRDFYYLTYGSIQKDVMTYGPIEASFDVYDDF------PSYK 269

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
           +   VY  + +A  +    VK++GWG E G PYW +V+++  Q+GDKG  KI RG NE  
Sbjct: 270 SG--VYEKTENASYLGGHAVKLIGWGVEEGTPYWLMVNSWNAQWGDKGLFKIRRGTNECG 327

Query: 252 IESLVNGALPKDN 264
           I++     +P  N
Sbjct: 328 IDNSTTAGVPVTN 340


>gi|340380665|ref|XP_003388842.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
          Length = 333

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 63/202 (31%), Positives = 88/202 (43%), Gaps = 39/202 (19%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHAN---YTTSEPECKTLATPQ 136
           C+ G     W +  K GLVTGG + S+ GCQP   P CNH     Y     E KT     
Sbjct: 149 CAGGFLQQAWEYWVKDGLVTGGQYGSDEGCQPYLIPKCNHHEPGPYENCTGEGKT----- 203

Query: 137 PKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
           P+C   C      R  +   Y+        D H+G    A  R    +  +    TNG V
Sbjct: 204 PQCERTC------RSGYTTSYEA-------DLHYGEKAYAVHRE--VEAIQTEIMTNGPV 248

Query: 197 ------------YAVSASAEIVAYA----TVKIVGWGEENGRPYWTIVSTFGEQFGDKGT 240
                       Y       +V +A     ++I+GWG ENG PYW I +++   +GDKG 
Sbjct: 249 EGAFTVYSDFPTYKSGVYQHVVGHALGGHAIRILGWGTENGVPYWLIANSWNPSWGDKGY 308

Query: 241 IKILRGRNEAIIESLVNGALPK 262
            K++RG+++  IES +    PK
Sbjct: 309 FKMIRGKDDCGIESNIVAGTPK 330


>gi|38373697|gb|AAR19103.1| cathepsin B [Uronema marinum]
          Length = 350

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 66/202 (32%), Positives = 101/202 (50%), Gaps = 32/202 (15%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAH-----HSNTGCQPVSFPPCNHANYTTSEPECKTLAT 134
           C+ G ++  W +  K GLV+G  +     +S T CQP SFPPC+H +       C  L  
Sbjct: 158 CNGGYTAGAWNYYVKTGLVSGNLYTDDNQNSKTECQPYSFPPCSH-HVQGEYQACTDL-- 214

Query: 135 PQ---PKCHTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRS 180
           PQ   PKC+T C +      + QD ++  G+  Y  P            +G    +F   
Sbjct: 215 PQFNTPKCYTECNSQYTQNSYEQDLHK--GVSSYSVPKSEEQIKAEIYQYGSTTASF--- 269

Query: 181 FCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGT 240
               Y+  L  ++G VY  ++ + +  +A +K++GWG ENG PYW   +++   +G+ G 
Sbjct: 270 --NVYSDFLTYSSG-VYQNTSGSYMGGHA-IKMLGWGVENGTPYWLCANSWNSSWGENGF 325

Query: 241 IKILRGRNEAIIES-LVNGALP 261
            KILRG NE  IES +V G +P
Sbjct: 326 FKILRGSNECGIESGMVAGFVP 347


>gi|56752809|gb|AAW24616.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 59/192 (30%), Positives = 90/192 (46%), Gaps = 18/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---------ANYTTSEPECK 130
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H          +     P+CK
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCK 218

Query: 131 TLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
              T Q   +T    D +  GF  +   +  +        GP       ++   Y   L 
Sbjct: 219 Q--TCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-----EAYLEIYEDFLN 271

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE 
Sbjct: 272 YKSG-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNEC 329

Query: 251 IIESLVNGALPK 262
            IES +   L K
Sbjct: 330 SIESEIAAGLIK 341


>gi|204022088|dbj|BAG71141.1| cathepsin B-N2 [Tuberaphis styraci]
          Length = 334

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 89/194 (45%), Gaps = 25/194 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           CS G     W    K GLVTGG + S  GCQP    PC    Y  +    K    P  K 
Sbjct: 154 CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVSPCPLDEYGNNTCSGK----PAEKN 209

Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFDP-HFGPFWPAF--WRSFCTKYTR 187
           H RCT   YG           + +D Y +    +  D   +GP   +F  +  F      
Sbjct: 210 H-RCTQMCYGNQNLDFKEDHHYTRDAYYLTYGTIQNDVLAYGPIEASFEVYDDF------ 262

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           P +++   VY    +A  +    VK++GWGEE G PYW +V+++ +Q+GD+G  KI RG 
Sbjct: 263 PSYKSG--VYTKMENATYLGGHAVKLIGWGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGT 320

Query: 248 NEAIIESLVNGALP 261
           NE   ++   G +P
Sbjct: 321 NECGTDNSTTGGVP 334


>gi|256090674|ref|XP_002581308.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 250

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 68/253 (26%), Positives = 104/253 (41%), Gaps = 59/253 (23%)

Query: 30  EARAVATATPLAFAVCRSSK--MHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISSS 87
           E  AVA+A  ++   C  +   M V+ ++   I+  K +             C  G S  
Sbjct: 28  ELWAVASAASISDRTCIQTNGTMKVQLSAIELISCSKNKLG-----------CQIGFSEF 76

Query: 88  TWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRC---- 143
           +W +  K GLVTG      TGC P  FP C+H + + S P+C  +    P C   C    
Sbjct: 77  SWDYWLKNGLVTGDP----TGCLPYPFPKCDHRS-SNSYPKCGYITYTAPPCTKTCRSGY 131

Query: 144 -----TNDNYGRGFF---------QDKYQING---LGLYFDPHFGPFWPAFWRSFCTKYT 186
                 + +YGR  +         + +  +NG    G++    F  +    +R       
Sbjct: 132 PIPYKADKHYGRVIYSLRPNESDIRKEIMMNGPVEAGIFVHSDFLNYKSGVYRHI----- 186

Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
                          + ++V   +V+I+GWG EN  PYW   +++ E +G  G  KILRG
Sbjct: 187 ---------------TGQLVTIHSVRIIGWGIENDIPYWLCANSWNEDWGLNGYFKILRG 231

Query: 247 RNEAIIESLVNGA 259
            NE  IES VN  
Sbjct: 232 SNECEIESFVNAG 244


>gi|166030318|gb|ABY78826.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 60/189 (31%), Positives = 86/189 (45%), Gaps = 24/189 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     WA+    G+ +G        CQP  FP C+H   +T+ P+C  L    P C
Sbjct: 158 CLGGDPDMAWAYFSSEGIASGR-------CQPYPFPRCSHYTNSTTYPQCSALHLWTPTC 210

Query: 140 HTRCTNDNYGRGFFQ--DKYQING-----LGLYFDPHFGPFWPAFWRSFCTKYTRPLFQT 192
           +  CT+    +  ++    Y  +G       LYF    GPF   F           LF  
Sbjct: 211 NPACTDSTISKKKYRGLKSYSFSGEEDFRRELYFR---GPFQAVF------DVWSDLFAY 261

Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
              VY     A I A+A V+IVGWG ++G PYW I +++  ++GD+G   +LRG NE  I
Sbjct: 262 KHGVYKHVGGAFIGAHA-VRIVGWGNQSGVPYWKIANSWNAEWGDRGYFFMLRGDNECGI 320

Query: 253 ESLVNGALP 261
           E   +  +P
Sbjct: 321 EDSGSAGVP 329


>gi|226474176|emb|CAX71574.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 57/189 (30%), Positives = 87/189 (46%), Gaps = 12/189 (6%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
           +  C    Y   + QDK      Y +  +            P    ++   Y   L   +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV--EAYLEIYEDFLNYKS 274

Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
           G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE  I+
Sbjct: 275 G-IYRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSID 332

Query: 254 SLVNGALPK 262
           S +   L K
Sbjct: 333 SEIAAGLIK 341


>gi|226473754|emb|CAX71562.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 329

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 59/192 (30%), Positives = 90/192 (46%), Gaps = 18/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---------ANYTTSEPECK 130
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H          +     P+CK
Sbjct: 146 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCK 205

Query: 131 TLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
              T Q   +T    D +  GF  +   +  +        GP       ++   Y   L 
Sbjct: 206 Q--TCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-----EAYLEIYEDFLN 258

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE 
Sbjct: 259 YKSG-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNEC 316

Query: 251 IIESLVNGALPK 262
            IES +   L K
Sbjct: 317 SIESEIAAGLIK 328


>gi|51947600|gb|AAU14266.1| cathepsin B-N [Myzus persicae]
          Length = 338

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 71/251 (28%), Positives = 110/251 (43%), Gaps = 35/251 (13%)

Query: 27  SCIEARAVATATPLAFAVCRSSKMHV-ECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGIS 85
           +C    AVAT++  A  +C ++     E  S   I      C +          C+ G  
Sbjct: 110 NCGSCWAVATSSAFADRLCVATNADFNELLSAEEITFCCHTCGF---------GCNGGYP 160

Query: 86  SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
              W    K+GLVTGG + S  GC+P   PPC + +   +     T A    + + RCT 
Sbjct: 161 IKAWKRFSKKGLVTGGDYKSGEGCEPYRVPPCPNDDQGNN-----TCAGKPMESNHRCTR 215

Query: 146 DNYGRG---------FFQDKYQINGLGLYFDPH-FGPFWPAF--WRSFCTKYTRPLFQTN 193
             YG           + +D Y +    +  D   +GP   +F  +  F      P +++ 
Sbjct: 216 MCYGDQDLDFDEDHRYTRDYYYLTYGSIQKDVMTYGPIEASFDVYDDF------PSYKSG 269

Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
             VY  S +A  +    VK++GWGEE G PYW +V+++ E +GD G  KI RG NE  ++
Sbjct: 270 --VYVKSENASYLGGHAVKLIGWGEEYGVPYWLMVNSWNEDWGDHGFFKIQRGTNECGVD 327

Query: 254 SLVNGALPKDN 264
           +     +P  N
Sbjct: 328 NSTTAGVPVTN 338


>gi|132566367|gb|ABO34080.1| cathepsin B5 [Clonorchis sinensis]
          Length = 343

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 58/186 (31%), Positives = 86/186 (46%), Gaps = 9/186 (4%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G  +  W +    G+VTGG+    +GC+   FP C H +     P C     P P+C
Sbjct: 155 CRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEH-HVQGHYPPCPRELYPTPEC 213

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWR----SFCTKYTRPLFQTNGR 195
             +C  D    G+ +DK + N     +            R    +  T Y   L  ++G 
Sbjct: 214 VQQC--DTPDVGYLEDKTRANMSYNIYASEISIMKEIMLRGPVEAIFTMYEDFLRYSSG- 270

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           VY  +  A +  +A V+I+GWGE    PYW I +++ E +G++G +K LRG NE  IE  
Sbjct: 271 VYFHALGAPMSGHA-VRILGWGELGNVPYWLIANSWNEDWGEEGYMKFLRGYNECGIEDD 329

Query: 256 VNGALP 261
           V   LP
Sbjct: 330 VTAGLP 335


>gi|204022081|dbj|BAG71138.1| cathepsin B-S1 [Tuberaphis takenouchii]
          Length = 332

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 51/190 (26%), Positives = 85/190 (44%), Gaps = 20/190 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +   +G+ TGG + S  GC P   PPC       +         P  + 
Sbjct: 154 CEGGYPIKAWQYFRTQGVPTGGDYDSKEGCAPYKIPPCFDQKGKNT-----CAGKPLERN 208

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLFQ 191
           H +C    YG    Q +Y++    +   P+        +GP   +F           L  
Sbjct: 209 H-QCPKTCYGSTTVQKRYKVKNEYVLNSPNTMEQDLIKYGPIEASF------NLFDDLSA 261

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
               +Y  +  A+ ++  ++KI+GWG+ENG PYW  V+++ + +G++GT +I++GRNE  
Sbjct: 262 YKSGIYQKTPKAKFLSGHSIKIIGWGKENGVPYWLAVNSWSKFWGEQGTFRIIKGRNECG 321

Query: 252 IESLVNGALP 261
           IE      +P
Sbjct: 322 IERSATAGIP 331


>gi|209863086|ref|NP_001119616.2| cathepsin B-1674 precursor [Acyrthosiphon pisum]
 gi|239799412|dbj|BAH70627.1| ACYPI000012 [Acyrthosiphon pisum]
          Length = 334

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 53/177 (29%), Positives = 87/177 (49%), Gaps = 13/177 (7%)

Query: 88  TWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT-ND 146
            W ++   GLV+GG +++N GCQP   PP  +      E  C      + +C+   T N 
Sbjct: 167 VWEYLKNHGLVSGGKYNTNNGCQPSKIPPIGNLPTGLYENTC------EKRCYGNNTINY 220

Query: 147 NYGRGFFQDKYQINGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEI 205
           N      ++ Y I    +  +  ++GP   AF R F   +    F     VY  + ++E 
Sbjct: 221 NQDHVKIKNHYDIEYEDIQREVQNYGPVSMAF-RVFDNDF----FLYKSGVYEKTTNSEF 275

Query: 206 VAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           + +   K++GWG ENG  YW +V+++G ++G  G  KI RG +E  IE+ V+   P+
Sbjct: 276 IQWQYAKLIGWGVENGVDYWLLVNSWGYEWGQNGLFKIKRGTDECNIETFVHAGEPQ 332


>gi|239938574|gb|ACS36086.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 72/242 (29%), Positives = 108/242 (44%), Gaps = 31/242 (12%)

Query: 27  SCIEARAVATATPLAFAVCRSS----KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSS 82
           +C    AV+TA+ L+  +C +S    ++HV  T      G   +C +          C+ 
Sbjct: 26  NCGSCWAVSTASALSDRICIASNGRKQVHVSATDILSCCG--NQCGYG---------CNG 74

Query: 83  GISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 142
           G     + +  K+G VTGG + + +GC+P  F PC H    T   EC   AT  PKC  +
Sbjct: 75  GWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGECPNEAT-TPKCVRK 133

Query: 143 C-----TNDNYGRGFFQDKYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQTNG 194
           C      +    R   +D Y++              GP   AF     T Y    +   G
Sbjct: 134 CQKSYKKSYKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAF-----TVYEDFSYYKKG 188

Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
            +Y  +A      +A +KI+GWG+ENG PYW I +++   +G+ G  +ILRG N   IE 
Sbjct: 189 -IYKHTAGKARGGHA-IKIIGWGKENGVPYWLIANSWHNDWGENGYFRILRGSNHCGIEE 246

Query: 255 LV 256
            V
Sbjct: 247 NV 248


>gi|221107055|ref|XP_002166984.1| PREDICTED: cathepsin B-like [Hydra magnipapillata]
          Length = 330

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 61/190 (32%), Positives = 93/190 (48%), Gaps = 27/190 (14%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G     W +    G VTGG ++S+ GCQP   P C H    + +P C+  + P PKC
Sbjct: 148 CNGGRLGPAWNFFKYAGAVTGGQYNSSEGCQPYEIPSCEHHTSGSKKP-CEG-SEPTPKC 205

Query: 140 HTRCTNDNYGRGFFQDKYQINGL------------GLYFDPHFGPFWPAFWRSFCTKYTR 187
              C  + Y   +  DK++++               +Y +   GP   AF     T Y+ 
Sbjct: 206 KRSC-REGYNVSYSDDKHKVSSHYSIANDEEQIKNEIYLN---GPVEAAF-----TVYSD 256

Query: 188 -PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
            P +++   VY  +    +  +A +KI+GWG EN  PYW + +++   +GDKG  KILRG
Sbjct: 257 FPNYKSG--VYKYTTGNALGGHA-IKILGWGVENNVPYWLVANSWNPDWGDKGFFKILRG 313

Query: 247 RNEAIIESLV 256
            NE  IE+ V
Sbjct: 314 SNECGIEASV 323


>gi|328697984|ref|XP_003240502.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
          Length = 339

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 60/195 (30%), Positives = 84/195 (43%), Gaps = 21/195 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G     W +    GLVTGG + S  GC+P   PPC      TS         P  K 
Sbjct: 156 CNGGYPIKAWKYFSSHGLVTGGNYKSGEGCEPYRVPPCPRNEDGTS----SCAGQPIEKN 211

Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFD-PHFGPFWPAFWRSFCTKYTRPL 189
           H RCT   YG           F +D Y +    +  D  ++GP   +F            
Sbjct: 212 H-RCTRMCYGNQDLDYNDDHRFTRDYYYLTYGSIQKDVMNYGPIEASF------DVYDDF 264

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           +     VY  + +A  +    VK++GWG E G PYW +V+++  Q+GD G  KI RG +E
Sbjct: 265 YSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGIPYWLMVNSWSAQWGDNGLFKIRRGTDE 324

Query: 250 AIIESLVNGALPKDN 264
             I+S     +P  N
Sbjct: 325 CGIDSATTAGVPVTN 339


>gi|46195455|ref|NP_990702.1| cathepsin B precursor [Gallus gallus]
 gi|1168790|sp|P43233.1|CATB_CHICK RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
           RecName: Full=Cathepsin B light chain; Contains:
           RecName: Full=Cathepsin B heavy chain; Flags: Precursor
 gi|603203|gb|AAA87075.1| cathepsin B [Gallus gallus]
          Length = 340

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 61/194 (31%), Positives = 87/194 (44%), Gaps = 22/194 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  +RGLV+GG + S+ GC+  + PPC H +   S P C       P+C
Sbjct: 150 CNGGYPSGAWRYWTERGLVSGGLYDSHVGCRAYTIPPCEH-HVNGSRPPCTGEGGETPRC 208

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRP 188
              C    Y   + +DK+   G+  Y  P             GP   AF       Y   
Sbjct: 209 SRHC-EPGYSPSYKEDKHY--GITSYGVPRSEKEIMAEIYKNGPVEGAF-----IVYEDF 260

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L   +G    VS   E V    ++I+GWG ENG PYW   +++   +G  G  KILRG +
Sbjct: 261 LMYKSGVYQHVSG--EQVGGHAIRILGWGVENGTPYWLAANSWNTDWGITGFFKILRGED 318

Query: 249 EAIIESLVNGALPK 262
              IES +   +P+
Sbjct: 319 HCGIESEIVAGVPR 332


>gi|313233819|emb|CBY09988.1| unnamed protein product [Oikopleura dioica]
          Length = 356

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 63/195 (32%), Positives = 92/195 (47%), Gaps = 27/195 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W +  + GLV+GG +H  TGCQP +  PC H +     P C       PKC
Sbjct: 165 CNGGFPQAAWEYWVQNGLVSGGLYHG-TGCQPYAIEPCEH-HTEGDRPPCTGEEGTTPKC 222

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAF--WRSFCTKYT 186
             +C  D Y   F QDK+   G   Y  P             GP   AF  +  F     
Sbjct: 223 SHKCV-DGYTGNFAQDKHY--GSVAYRIPANEKAIMNEIYKNGPVEGAFIVYEDF----- 274

Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
            P +++   VY+    + +  +A ++++GWGEENG  YW   +++   +G+ G  KI RG
Sbjct: 275 -PTYKSG--VYSHHTGSALGGHA-IRVLGWGEENGEKYWLCGNSWNTDWGNNGFFKIKRG 330

Query: 247 RNEAIIESLVNGALP 261
            NE  IES + G +P
Sbjct: 331 VNECGIESEMVGGIP 345


>gi|392920988|ref|NP_506011.2| Protein F57F5.1 [Caenorhabditis elegans]
 gi|206994319|emb|CAB00098.2| Protein F57F5.1 [Caenorhabditis elegans]
          Length = 351

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 63/192 (32%), Positives = 88/192 (45%), Gaps = 17/192 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G     W    K+G VTGG++   TGC+P  +PPC H    T    C +   P  KC
Sbjct: 167 CNGGYPIEAWRHYVKKGYVTGGSYQDKTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKC 226

Query: 140 HTRCTNDNYGRGFFQD------KYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QD       Y ++             GP   AF     T Y     
Sbjct: 227 ERSC-QAGYALTYQQDLHFGQSAYAVSKKAAEIQKEIMTHGPVEVAF-----TVY-EDFE 279

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY  +A A +  +A VK++GWG +NG PYW   +++ E +G+ G  +I+RG NE 
Sbjct: 280 HYSGGVYVHTAGASLGGHA-VKMLGWGVDNGTPYWLCANSWNEDWGENGYFRIIRGVNEC 338

Query: 251 IIESLVNGALPK 262
            IE  V G +PK
Sbjct: 339 GIEGGVVGGIPK 350


>gi|294894290|ref|XP_002774786.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239880403|gb|EER06602.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 830

 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 65/233 (27%), Positives = 96/233 (41%), Gaps = 65/233 (27%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHS------NTGCQPVSFPPCNHANYTTSEPECKTL- 132
           C+ G  +S W+WVH +G+ TGG + +      + GC P  FPPC H    T  PEC  + 
Sbjct: 605 CNGGFPNSAWSWVHDKGIATGGDYVAKDDMTKDDGCWPYDFPPCAHHINDTKYPECPKVS 664

Query: 133 -------ATPQ-------------PKCHTRCTNDNYGRGFFQDK----------YQINGL 162
                  AT +             P C  +C N  Y      D+          Y +N  
Sbjct: 665 CSGESPPATAETATVIAYQNSYETPNCAEQCHNPKYTTTLRDDRHFMLESSPYQYSVNDA 724

Query: 163 --GLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYAT---------- 210
              +  D   GP +      FC     P    +    + S   + +AY +          
Sbjct: 725 KNAIRTDGPVGPIY------FCD----PNVNFDQVSASFSVYEDFLAYKSGVYKHTSGEY 774

Query: 211 -----VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 258
                VKI+GWGEE+G+ YW +V+++ E +GD G  KI  G N  I ++L+ G
Sbjct: 775 LGGHAVKIIGWGEESGQAYWIVVNSWNEDWGDHGLFKIALG-NCGIDDNLLGG 826


>gi|350535627|ref|NP_001233013.1| uncharacterized protein LOC100164982 precursor [Acyrthosiphon
           pisum]
 gi|239789514|dbj|BAH71377.1| ACYPI005957 [Acyrthosiphon pisum]
          Length = 339

 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 60/197 (30%), Positives = 90/197 (45%), Gaps = 25/197 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G     W +    GLVTGG + S  GC+P   PPC        + +      P+ K 
Sbjct: 156 CNGGYPIKAWKYFSTHGLVTGGNYKSGKGCEPYRVPPCPR----NEDGKSSCAGKPKEKN 211

Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFDP-HFGPFWPAF--WRSFCTKYTR 187
           H RCT   YG           F +D Y +    +  D  ++GP   +F  +  F      
Sbjct: 212 H-RCTRMCYGNQDLDYDDDHRFTRDFYYLTYGSIQKDVLNYGPIEASFDVYDDF------ 264

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           P +++   VY  + +A  +    VK++GWG E G PYW +V+++  Q+GD G  KI RG 
Sbjct: 265 PSYKSG--VYQRTPNATKLGGHAVKLIGWGVEEGTPYWLMVNSWNAQWGDNGLFKIRRGT 322

Query: 248 NEAIIESLVNGALPKDN 264
           +E  I+S     +P  N
Sbjct: 323 DECRIDSATTAGVPVTN 339


>gi|357613937|gb|EHJ68797.1| cathepsin B-like cysteine proteinase [Danaus plexippus]
          Length = 334

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 64/191 (33%), Positives = 87/191 (45%), Gaps = 19/191 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ GI S  W +    G+V+GG ++S+ GC P   PPC H       P C    T  PKC
Sbjct: 151 CNGGIPSFAWEYWKHFGIVSGGNYNSSQGCLPYEIPPCEHHVPGNRIP-CNG-ETSTPKC 208

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
           H  C  + Y   +  DK      Y + G   +        GP   AF     T Y   L 
Sbjct: 209 HRSCRKE-YTNSYKSDKKYGKHVYSVGGGEEHIKAEIFKNGPVEGAF-----TVYADLLT 262

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY  +    +  +A +KI+GWG ENG  YW I +++   +GD G  KILRG +  
Sbjct: 263 YKSG-VYKHTEGEALGGHA-IKIMGWGVENGNKYWLIANSWNSDWGDNGFFKILRGEDHC 320

Query: 251 IIESLVNGALP 261
            IES +    P
Sbjct: 321 GIESSIVAGEP 331


>gi|226469950|emb|CAX70256.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 84.3 bits (207), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 57/187 (30%), Positives = 84/187 (44%), Gaps = 20/187 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +    G+VTGG+  ++TGCQP  FP C H +     P C       P+C
Sbjct: 159 CDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHHS-IGKYPSCGDKIYKTPQC 217

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPH----------FGPFWPAFWRSFCTKYTRPL 189
             +C    Y   +  DK+   G+ +    +          +GP   A+   F        
Sbjct: 218 KRKCQK-GYTTPYEHDKH-YGGISINVIKNESAIQKEIMMYGPV-EAYLLIF-----EDF 269

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
                 +Y  + +   V    V+I+GWG ENG  YW   +T+ E +G+KG  +I+RGRNE
Sbjct: 270 LNYKSGIYRYT-TGSFVGEHYVRIIGWGIENGTAYWLAANTWNEDWGEKGYFRIVRGRNE 328

Query: 250 AIIESLV 256
             IES+V
Sbjct: 329 CSIESVV 335


>gi|226469948|emb|CAX70255.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 84.3 bits (207), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 57/187 (30%), Positives = 84/187 (44%), Gaps = 20/187 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +    G+VTGG+  ++TGCQP  FP C H +     P C       P+C
Sbjct: 159 CDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHHS-IGKYPSCGDKIYKTPQC 217

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPH----------FGPFWPAFWRSFCTKYTRPL 189
             +C    Y   +  DK+   G+ +    +          +GP   A+   F        
Sbjct: 218 KRKCQK-GYTTPYEHDKH-YGGISINVIKNESAIQNEIMMYGPV-EAYLLIF-----EDF 269

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
                 +Y  + +   V    V+I+GWG ENG  YW   +T+ E +G+KG  +I+RGRNE
Sbjct: 270 LNYKSGIYRYT-TGSFVGEHYVRIIGWGIENGTAYWLAANTWNEDWGEKGYFRIVRGRNE 328

Query: 250 AIIESLV 256
             IES+V
Sbjct: 329 CSIESVV 335


>gi|52630925|gb|AAU84926.1| putative cathepsin B-N [Toxoptera citricida]
          Length = 340

 Score = 84.3 bits (207), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 61/197 (30%), Positives = 89/197 (45%), Gaps = 25/197 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G     W    K GLVTGG + S  GC+P   PPC +      E    T A    + 
Sbjct: 157 CNGGYPIKAWEHFKKHGLVTGGDYKSGEGCEPYRVPPCPY-----DESGNNTCAGKPMEA 211

Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFDP-HFGPFWPAF--WRSFCTKYTR 187
           + RCT   YG           + +D Y +    +  D   +GP   +F  +  F      
Sbjct: 212 NHRCTRMCYGDQDLDFDEDHRYTRDSYYLTYGSIQKDVLTYGPVEASFDVYDDF------ 265

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           P +++   VY  S +A  +     K++GWGEE G PYW +V+++   +GD G  KI RG 
Sbjct: 266 PSYKSG--VYIRSENASYLGGHAAKLIGWGEEYGVPYWLMVNSWNADWGDNGLFKIQRGT 323

Query: 248 NEAIIESLVNGALPKDN 264
           NE  I++   G +P  N
Sbjct: 324 NECGIDNSTTGGVPITN 340


>gi|268578113|ref|XP_002644039.1| Hypothetical protein CBG17499 [Caenorhabditis briggsae]
          Length = 355

 Score = 84.0 bits (206), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 70/215 (32%), Positives = 85/215 (39%), Gaps = 36/215 (16%)

Query: 67  CAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPC--NHANYTT 124
           C  L+S     W C          W    GL TGG +    GC+P S  PC  N+ N TT
Sbjct: 153 CVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYDDQFGCKPYSIYPCDKNYPNGTT 212

Query: 125 SEPECKTLATPQPKCHTRCT-NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCT 183
           S P C    TP   C   CT N  +   + QDK            HFG       +    
Sbjct: 213 SVP-CPGYHTP--PCEDHCTSNITWPIAYKQDK------------HFGKAHYNVGKKMTD 257

Query: 184 KYTRPLFQTNGRVYA----------------VSASAEIVAYATVKIVGWGEENGRPYWTI 227
             T     TNG V A                V  + +       KI+GWG +NG PYW  
Sbjct: 258 IQTE--IMTNGPVIASFIIYEDFWDYKSGIYVHTAGDQEGGMDTKIIGWGVDNGVPYWLC 315

Query: 228 VSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           V  +G  FG+ G ++ILRG NE  IE  V  ALP 
Sbjct: 316 VHQWGTDFGENGFVRILRGVNEVNIEHQVLAALPD 350


>gi|74213457|dbj|BAE35542.1| unnamed protein product [Mus musculus]
          Length = 339

 Score = 84.0 bits (206), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 62/203 (30%), Positives = 94/203 (46%), Gaps = 34/203 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W++  K+GLV+GG ++S+ GC P + PPC H +   S P C T      +C
Sbjct: 150 CNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEH-HVNGSRPPC-TGEGDTHRC 207

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY-A 198
           +  C    Y   + +DK            HFG  + ++  S   K        NG V  A
Sbjct: 208 NKSC-EAGYSPSYKEDK------------HFG--YTSYSVSNSVKEIMAEIYKNGPVEGA 252

Query: 199 VSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
            +  ++ + Y +               ++I+GWG ENG PYW   +++   +GD G  KI
Sbjct: 253 FTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVENGVPYWLAANSWNLDWGDNGFFKI 312

Query: 244 LRGRNEAIIESLVNGALPK-DNY 265
           LRG N   IES +   +P+ D Y
Sbjct: 313 LRGENHCGIESEIVAGIPRTDQY 335


>gi|345308|pir||S31909 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - fluke
           (Schistosoma japonicum)
          Length = 316

 Score = 84.0 bits (206), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 56/187 (29%), Positives = 84/187 (44%), Gaps = 20/187 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +    G+VTGG+  ++TGCQP  FP C H +     P C       P+C
Sbjct: 133 CDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHHS-KGKYPSCGDKMYKTPQC 191

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPH----------FGPFWPAFWRSFCTKYTRPL 189
             +C    Y   +  DK+   G+ +    +          +GP   A+   F        
Sbjct: 192 KRKCQK-GYKTPYEHDKH-YGGISINVIKNESAIQKEIMMYGPV-EAYLLIF-----EDF 243

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
                 +Y  + +   V    V+I+GWG ENG  YW   +T+ E +G+KG  +I+RGRNE
Sbjct: 244 LNYKSGIYRYT-TGSFVGEHYVRIIGWGIENGTAYWLAANTWNEDWGEKGYFRIVRGRNE 302

Query: 250 AIIESLV 256
             +ES+V
Sbjct: 303 CSVESVV 309


>gi|343197337|pdb|3QSD|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With Ca074 Inhibitor
 gi|343197588|pdb|3S3Q|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11017 Inhibitor
 gi|343197589|pdb|3S3R|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
 gi|343197590|pdb|3S3R|B Chain B, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
 gi|343197591|pdb|3S3R|C Chain C, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
          Length = 254

 Score = 84.0 bits (206), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 56/186 (30%), Positives = 86/186 (46%), Gaps = 18/186 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  GI    W +  K G+VTG +  ++ GC+P  FP C H +     P C +     P+C
Sbjct: 72  CEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEH-HTKGKYPPCGSKIYKTPRC 130

Query: 140 HTRCTNDNYGRGFFQDKYQ-INGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK++  +   +  D          +GP    F     T Y   L 
Sbjct: 131 KQTC-QKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGF-----TVYEDFLN 184

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    ++   E +    ++I+GWG EN  PYW I +++ E +G+ G  +I+RGR+E 
Sbjct: 185 YKSGIYKHITG--ETLGGHAIRIIGWGVENKAPYWLIANSWNEDWGENGYFRIVRGRDEC 242

Query: 251 IIESLV 256
            IES V
Sbjct: 243 SIESEV 248


>gi|187104114|ref|NP_001119617.1| cathepsin B-16A precursor [Acyrthosiphon pisum]
 gi|161343835|tpg|DAA06098.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 340

 Score = 84.0 bits (206), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 60/197 (30%), Positives = 89/197 (45%), Gaps = 25/197 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G     W +    G+VTGG + S  GC+P   PPC        E +      P  K 
Sbjct: 157 CNGGYPIKAWKYFSSHGIVTGGNYKSGEGCEPYRVPPCPQ----DEEGKSSCAGKPIEKN 212

Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFD-PHFGPFWPAF--WRSFCTKYTR 187
           H RCT   YG           F +D Y +    +  D  ++GP   +F  +  F      
Sbjct: 213 H-RCTRMCYGNQDLDYNDDHRFTRDYYYLTYGSIQKDVMNYGPIEASFDVYDDF------ 265

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           P +++   VY  + +A  +    VK++GWG E G PYW +V+++  Q+GD G  KI RG 
Sbjct: 266 PSYKSG--VYQRTPNATKLGGHAVKLIGWGVEEGTPYWLMVNSWNAQWGDNGLFKIRRGT 323

Query: 248 NEAIIESLVNGALPKDN 264
           +E  I+S     +P  N
Sbjct: 324 DECGIDSAATAGVPVTN 340


>gi|193688336|ref|XP_001945899.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 308

 Score = 83.6 bits (205), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 55/178 (30%), Positives = 85/178 (47%), Gaps = 14/178 (7%)

Query: 87  STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT-N 145
           S W ++   G+V+GG ++SN GCQP  FPP        + P+     T    C+   T N
Sbjct: 141 SIWEYLKSHGVVSGGKYNSNDGCQPFKFPP------IANIPKHLHKHTCDDHCYGNSTIN 194

Query: 146 DNYGRGFFQDKYQINGLGLYFDPH-FGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAE 204
            N+     ++ Y I    +  +   +GP    F    C  +    F     VYA S  A+
Sbjct: 195 YNHDHVRVRNYYTIRTRDIQKEVQTYGPVVVRF--MVCDDF----FLYKSGVYAKSDKAK 248

Query: 205 IVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
            +     K++GWG ENG  YW +++++G ++G KG  KI  G N+  +ES V   LP+
Sbjct: 249 GIRTQYAKLIGWGVENGVDYWLVINSWGHEWGQKGLFKIKSGTNQCGVESFVYAGLPE 306


>gi|339242313|ref|XP_003377082.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316974149|gb|EFV57673.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 517

 Score = 83.6 bits (205), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 60/189 (31%), Positives = 92/189 (48%), Gaps = 19/189 (10%)

Query: 79  VCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 138
           + + G+  S + +  K G+ TGG +   + CQP S  PC+  +YT S P CK     Q  
Sbjct: 338 ILACGMIPSPFNYWKKMGIATGGPYGDKSCCQPYSIAPCSKCSYTASTPSCKY--DCQAD 395

Query: 139 CHTRCTNDNYGRGFFQDKYQI--NGLGLYFDPH-FGPFWPAF--WRSFCTKYTRPLFQTN 193
                ++D +   +  + Y +  N   +  + +  GP    F  +  F T Y   ++Q  
Sbjct: 396 YDIPISDDKF---YASEHYHVSSNQYEIMNEIYTHGPVVAGFIVYEDF-TYYISGIYQQT 451

Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
             V A+   A       ++I+GWGEENG PYW I +++   FG+KG  +I RG NE  IE
Sbjct: 452 TYV-AMGGHA-------IRIIGWGEENGIPYWLIANSWNTTFGEKGFFRIRRGTNECRIE 503

Query: 254 SLVNGALPK 262
           S V   +PK
Sbjct: 504 SEVYTGIPK 512



 Score = 38.5 bits (88), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 26/95 (27%), Positives = 42/95 (44%), Gaps = 6/95 (6%)

Query: 69  WLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPE 128
           +++SR   +  C SG   + + +  + GLVTGG +     C P S  PC         P+
Sbjct: 59  FVISRIAALVGCRSGKIEAAFIYWQRSGLVTGGPYGEKACCLPYSISPCTMCRPYMLAPK 118

Query: 129 CKTLATPQPKCHTRCTNDN-YGRGFF---QDKYQI 159
           C+   T Q   +     D  YG+  +   QD++ I
Sbjct: 119 CQR--TCQASYNLSLKRDKYYGKSHYYVNQDEFDI 151


>gi|321452279|gb|EFX63703.1| hypothetical protein DAPPUDRAFT_306608 [Daphnia pulex]
          Length = 340

 Score = 83.6 bits (205), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 60/195 (30%), Positives = 89/195 (45%), Gaps = 25/195 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W+   K+G+VTGG  +S+ GCQP   P C H + T   P C       PKC
Sbjct: 158 CNGGFPGAAWSHWVKKGIVTGGNFNSSQGCQPYIIPACEH-HTTGDRPPCSE-GGGTPKC 215

Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAF--WRSFCTKYTR 187
              C  D Y   + QD +          ++  + L    + GP   A   +  F T  + 
Sbjct: 216 LKTC-EDGYTVDYTQDLHYGASSYSVHKRMEDIQLEI-MNNGPVEGALTVYEDFPTYKSG 273

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
                +G+     A         ++I+GWG E G PYW I +++   +GD G IK+LRG+
Sbjct: 274 VYQHVHGKALGGHA---------IRILGWGVEEGVPYWLIANSWNTDWGDNGYIKLLRGK 324

Query: 248 NEAIIESLVNGALPK 262
           +   IES +   LPK
Sbjct: 325 DHCGIESQITAGLPK 339


>gi|226474178|emb|CAX71575.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 56/183 (30%), Positives = 85/183 (46%), Gaps = 12/183 (6%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
           +  C    Y   + QDK      Y +  +            P    ++   Y   L   +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV--EAYLEIYEDFLNYKS 274

Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
           G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE  IE
Sbjct: 275 G-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIE 332

Query: 254 SLV 256
           S +
Sbjct: 333 SEI 335


>gi|226473760|emb|CAX71565.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 57/187 (30%), Positives = 87/187 (46%), Gaps = 8/187 (4%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 140 HTRCTNDNYGRGFFQDK----YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGR 195
           +  C    Y   + QDK    +  N L +               ++   Y   L   +G 
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           +Y  +    I  +A V+++G G ENG  YW   +T+ E +G+KG  +I+RGRNE +IES 
Sbjct: 276 IYRYTTGKYISGHA-VRLIGCGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESE 334

Query: 256 VNGALPK 262
           +   L K
Sbjct: 335 IAAGLIK 341


>gi|112983908|ref|NP_001036850.1| cathepsin B precursor [Bombyx mori]
 gi|13548667|dbj|BAB40804.1| cathepsin B [Bombyx mori]
          Length = 337

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 63/186 (33%), Positives = 89/186 (47%), Gaps = 19/186 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           CS G+    W +    GLV+GG+++S+ GC+P   PPC H       P C    T  PKC
Sbjct: 152 CSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMP-CSG-DTKTPKC 209

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
             +C +  Y   + QDK      Y ++G   +        GP   AF     T Y+  L 
Sbjct: 210 TKKCES-GYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAF-----TVYSDLLS 263

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY  +    +  +A VKI+GWG EN   YW I +++   +GD G  KILRG +  
Sbjct: 264 YKSG-VYKHTQGDALGGHA-VKILGWGVENDNKYWLIANSWNSDWGDNGFFKILRGEDHC 321

Query: 251 IIESLV 256
            IES +
Sbjct: 322 GIESSI 327


>gi|118153|sp|P25792.1|CYSP_SCHMA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
           Full=Antigen Sm31; Flags: Precursor
 gi|160950|gb|AAA29865.1| cathepsin B [Schistosoma mansoni]
          Length = 340

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 57/186 (30%), Positives = 88/186 (47%), Gaps = 18/186 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  GI    W +  K G+VT  +  ++TGC+P  FP C H +     P C +     P+C
Sbjct: 158 CEGGILGPAWDYWVKEGIVTASSKENHTGCEPYPFPKCEH-HTKGKYPPCGSKIYNTPRC 216

Query: 140 HTRCTNDNYGRGFFQDKYQ-INGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK++  +   +  D          +GP   +F     T Y   L 
Sbjct: 217 KQTCQR-KYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEASF-----TVYEDFLN 270

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G +Y    + E +    ++I+GWG EN  PYW I +++ E +G+ G  +I+RGR+E 
Sbjct: 271 YKSG-IYK-HITGEALGGHAIRIIGWGVENKTPYWLIANSWNEDWGENGYFRIVRGRDEC 328

Query: 251 IIESLV 256
            IES V
Sbjct: 329 SIESEV 334


>gi|161343849|tpg|DAA06105.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 334

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 51/177 (28%), Positives = 84/177 (47%), Gaps = 13/177 (7%)

Query: 88  TWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT-ND 146
            W ++   GLV+GG +++N GCQP   PP  +      E  C      + +C+   T N 
Sbjct: 167 VWEYLKNHGLVSGGKYNTNNGCQPSKIPPIGNLPTGLYENTC------EKRCYGNNTINY 220

Query: 147 NYGRGFFQDKYQINGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEI 205
           N      ++ Y I    +  +  ++GP   AF       +    F     VY  + ++E 
Sbjct: 221 NQDHVKIKNHYDIEYEDIQREVQNYGPVSMAF-----KVFDNDFFLYKSGVYEKTTNSEF 275

Query: 206 VAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           + +   K++GWG ENG  YW +V+ +G ++G  G  KI RG +E  IE+ V+   P+
Sbjct: 276 IQWQYAKLIGWGVENGVDYWLLVNFWGYEWGQNGLFKIKRGTDECNIETFVHAGEPQ 332


>gi|226474182|emb|CAX71577.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 58/192 (30%), Positives = 89/192 (46%), Gaps = 18/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---------ANYTTSEPECK 130
           C  G    +W +   RG+VTGG+  ++T C+P  FP C+H          +     P+CK
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTSCRPYPFPKCDHFVKGKYRACGDKLYETPQCK 218

Query: 131 TLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
              T Q   +T    D +  GF  +   +  +        GP       ++   Y   L 
Sbjct: 219 Q--TCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-----EAYLEIYEDFLN 271

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G +Y  +    I  +A V+++GWG ENG  YW   +T+ E +G+KG  +I+RGRNE 
Sbjct: 272 YKSG-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNEC 329

Query: 251 IIESLVNGALPK 262
            IES +   L K
Sbjct: 330 SIESEIAAGLIK 341


>gi|146217390|gb|ABQ10737.1| cathepsin B [Penaeus monodon]
          Length = 331

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 65/198 (32%), Positives = 94/198 (47%), Gaps = 27/198 (13%)

Query: 80  CSSGISSSTWA-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 138
           C+ G   + +  WVH  G+V+GG+ +S  GCQP    PC H + +   P+C       PK
Sbjct: 148 CNGGFPGAAFKYWVHS-GIVSGGSFNSTQGCQPYEIAPCEH-HVSGPRPKCSE-GGGTPK 204

Query: 139 CHTRCT--------NDNYGRG----FFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYT 186
           C   C         +D +  G      +D+ QI     Y   + GP   AF     T Y 
Sbjct: 205 CAKTCEKGYIVDYESDLHHGGKAYSIMKDEDQIK----YEIMNNGPVEGAF-----TVYV 255

Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
             L   +G VY       +  +A ++++GWGEENG PYW   +++   +GD G  KILRG
Sbjct: 256 DFLHYKSG-VYQHRHGLPLGGHA-IRVLGWGEENGTPYWLCANSWNTDWGDNGLFKILRG 313

Query: 247 RNEAIIESLVNGALPKDN 264
            +   IES ++  LPK N
Sbjct: 314 SDHCGIESEISAGLPKVN 331


>gi|294914603|ref|XP_002778294.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239886508|gb|EER10089.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 365

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 89/207 (42%), Gaps = 37/207 (17%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGA------HHSNTGCQPVSFPPCNHANYTTSEPECKTLA 133
           CS G   ++W ++H  G+V+GG         +  GC P +FP C H    +    C    
Sbjct: 174 CSGGNPITSWTFLHTNGIVSGGGFVPEKNMKAADGCWPYNFPKCAHHQKESDYKPCAKEI 233

Query: 134 TPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
              P C + C N  YG  F +D++    L   F   FG           T   +    TN
Sbjct: 234 YDTPSCSSSCPNAKYGTAFDKDRHYTESL---FPSRFGS----------TSSIKKEIMTN 280

Query: 194 GRVYAV-SASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGD 237
           G   A  S   + ++Y +               V+I+GWG E G  YW +++++ E++GD
Sbjct: 281 GPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVDYWLVMNSWNEEWGD 340

Query: 238 KGTIKILRGRNEAIIESLVNGALPKDN 264
            GT KI++G  +  I+ ++    P  N
Sbjct: 341 HGTFKIVQG--DCGIDDMILAGTPAIN 365


>gi|121309133|dbj|BAF43801.1| Longipain [Haemaphysalis longicornis]
          Length = 341

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 64/196 (32%), Positives = 90/196 (45%), Gaps = 26/196 (13%)

Query: 80  CSSGISSSTWA-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 138
           C+ G   + W+ WVHK G+VTGG + S+ GC P     C+H    T  P C     P P+
Sbjct: 158 CNGGFPGAAWSYWVHK-GIVTGGNYDSDEGCMPYPIKACDHHVNGTLGP-CDKSIPPTPR 215

Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTR 187
           C  R     Y   F  DK+   G   Y  P             GP    F     T Y  
Sbjct: 216 C-VRMCRKGYNVDFADDKHY--GKKSYSVPSNVTQIQVEIMTNGPVEADF-----TVYAD 267

Query: 188 -PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
            PL+++   VY       +  +A ++++GWG E G PYW   +++  ++GDKG  KILRG
Sbjct: 268 FPLYKSG--VYQRHTDQALGGHA-IRLLGWGVEKGVPYWLAANSWNTEWGDKGFFKILRG 324

Query: 247 RNEAIIESLVNGALPK 262
            +E  IE  V   +P+
Sbjct: 325 SDECGIEDDVVAGIPR 340


>gi|355681635|gb|AER96808.1| cathepsin B [Mustela putorius furo]
          Length = 338

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 63/199 (31%), Positives = 94/199 (47%), Gaps = 26/199 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +    GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGFPAEAWNFWTXXGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDKY------------QINGLGLYFDPHFGPFWPAFWRSFCTKYTR 187
              C    Y   + +DK+            +     +Y +   GP   AF     + Y+ 
Sbjct: 208 SKIC-EPGYTPSYKEDKHYGCSSYSVSSSEKEIMAEIYKN---GPVEAAF-----SVYSD 258

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
            L   +G    V+   E++    V+I+GWG ENG PYW + +++   +GD G  KILRG+
Sbjct: 259 FLMYKSGVYQHVTG--EMMGGHAVRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQ 316

Query: 248 NEAIIESLVNGALP-KDNY 265
           +   IES +   +P  D Y
Sbjct: 317 DHCGIESEIVAGIPCTDQY 335


>gi|358331547|dbj|GAA35870.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 508

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 57/185 (30%), Positives = 85/185 (45%), Gaps = 9/185 (4%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G  +  W +    G+VTGG+    +GC+   FP C H +     P C     P P+C
Sbjct: 155 CRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEH-HVQGHYPPCPRELYPTPEC 213

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWR----SFCTKYTRPLFQTNGR 195
             +C  D    G+ +DK + N     +            R    +  T Y   L  ++G 
Sbjct: 214 VQQC--DTPDVGYLEDKTRANMSYNIYASEISIMKEIMLRGPVEAIFTMYEDFLRYSSG- 270

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           VY  +  A +  +A V+I+GWGE    PYW I +++ E +G++G +K LRG NE  IE  
Sbjct: 271 VYFHALGAPMSGHA-VRILGWGELGNVPYWLIANSWNEDWGEEGYMKFLRGYNECGIEDD 329

Query: 256 VNGAL 260
           V   L
Sbjct: 330 VTAVL 334


>gi|170028910|ref|XP_001842337.1| cathepsin L [Culex quinquefasciatus]
 gi|167879387|gb|EDS42770.1| cathepsin L [Culex quinquefasciatus]
          Length = 334

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 64/194 (32%), Positives = 90/194 (46%), Gaps = 23/194 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEP---ECKTLATPQ 136
           C+ G   + W++  ++GLV+GG + S+ GCQP +  PC H    T  P   E KT     
Sbjct: 152 CNGGFPGAAWSYWVRKGLVSGGPYGSDQGCQPYAISPCEHHVNGTRGPCNGEGKT----- 206

Query: 137 PKCHTRCT---NDNYGRGFF--QDKYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRP 188
           PKC  +C    N  Y +  F  +  Y I              GP   AF     T Y   
Sbjct: 207 PKCVKKCQASYNVPYAKDKFFGKSSYSIASHEQQIQKELFTNGPVEGAF-----TVYEDL 261

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L    G VY  +A   +  +A ++I+GWG EN   +W I +++   +GD G  KILRG +
Sbjct: 262 LNYKEG-VYQHTAGKMLGGHA-IRILGWGVENDTKFWLIANSWNSDWGDNGYFKILRGSD 319

Query: 249 EAIIESLVNGALPK 262
              IES +   LPK
Sbjct: 320 HLGIESSIAAGLPK 333


>gi|309202|gb|AAA37494.1| mouse preprocathepsin B [Mus musculus]
          Length = 339

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 62/203 (30%), Positives = 92/203 (45%), Gaps = 34/203 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  K+GLV+GG + S+ GC P + PPC H +   S P C T     P+C
Sbjct: 150 CNGGYPSGAWNFWTKKGLVSGGVYDSHIGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPRC 207

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY-A 198
           +  C    Y   + +DK            HFG  + ++  S   K        NG V  A
Sbjct: 208 NKSC-EAGYSPSYKEDK------------HFG--YTSYSVSNSVKEIMAEIYKNGPVEGA 252

Query: 199 VSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
            +  ++ + Y +               ++I+ WG ENG PYW   +++   +GD G  KI
Sbjct: 253 FTVFSDFLTYKSGVYKHEAGDMMGGHAIRILVWGVENGVPYWLAANSWNLDWGDNGFFKI 312

Query: 244 LRGRNEAIIESLVNGALPK-DNY 265
           LRG N   IES +   +P+ D Y
Sbjct: 313 LRGENHCGIESEIVAGIPRTDQY 335


>gi|170586854|ref|XP_001898194.1| cathepsin B-like cysteine proteinase [Brugia malayi]
 gi|158594589|gb|EDP33173.1| cathepsin B-like cysteine proteinase, putative [Brugia malayi]
          Length = 384

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 62/198 (31%), Positives = 95/198 (47%), Gaps = 22/198 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 138
           C  G   + W +    G+VTG  + +++GC+P  FPPC +H+N T  EP CK    P PK
Sbjct: 190 CFGGEPMAAWKYWVLSGIVTGSDYTNHSGCRPYPFPPCEHHSNKTHYEP-CKHDLYPTPK 248

Query: 139 CHTRCTNDNYGRGFFQDKYQ-INGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPL 189
           C+ +C + NY + +  DKY       +  D           GP   +F       YT  L
Sbjct: 249 CYKQC-DKNYTKSYKADKYYGEQAYNVENDVESIQKEIMTLGPVEASF-----EVYTDFL 302

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDK---GTIKILRG 246
             T+G    V+ S  +     VKI+GWG + G  YW   +++   +G+    G  +ILRG
Sbjct: 303 HYTSGIYKHVAGS--VGGGHAVKILGWGIDQGVSYWLAANSWNNDWGEDVFSGYFRILRG 360

Query: 247 RNEAIIESLVNGALPKDN 264
            +E  IES +   +P+ +
Sbjct: 361 ADECGIESGIVAGIPRKD 378


>gi|389608541|dbj|BAM17880.1| cathepsin B [Papilio xuthus]
          Length = 334

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 62/191 (32%), Positives = 87/191 (45%), Gaps = 19/191 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G+ +  W +    GLV+GG+++S+ GC+P   PPC H       P C    T  PKC
Sbjct: 151 CNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRLP-CSG-DTKTPKC 208

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
              C +  Y   + QDK      Y + G   +        GP   AF     T Y   L 
Sbjct: 209 VKECES-GYKVPYKQDKHYGKHVYSVRGGEDHIKAELYKNGPVEGAF-----TVYADLLS 262

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V+  A  +    +KI+GWG ENG  YW I +++   +GD G  KILRG +  
Sbjct: 263 YKSGVYKHVTGDA--LGGHAIKIMGWGVENGNKYWLIANSWNSDWGDNGFFKILRGEDHC 320

Query: 251 IIESLVNGALP 261
            IES +    P
Sbjct: 321 GIESSIVAGEP 331


>gi|1181143|emb|CAA93278.1| cysteine proteinase [Haemonchus contortus]
          Length = 341

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 71/242 (29%), Positives = 106/242 (43%), Gaps = 31/242 (12%)

Query: 27  SCIEARAVATATPLAFAVCRSS----KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSS 82
           +C    AV+TA+ L+  +C +S    ++HV  T      G   +C +          C+ 
Sbjct: 114 NCGSCWAVSTASALSDRICIASNGRKQVHVSATDILSCCG--NQCGY---------GCNG 162

Query: 83  GISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 142
           G     + +  K+G VTGG + + +GC+P  F PC H    T   EC   AT  PKC  +
Sbjct: 163 GWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGECPNEAT-TPKCVRK 221

Query: 143 CTNDNY-----GRGFFQDKYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQTNG 194
           C           R   +D Y++              GP   AF     T Y    +   G
Sbjct: 222 CQKSYKKSYKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAF-----TVYEDFSYYKKG 276

Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
            +Y  +A      +A +KI+GWG+E G PYW I +++   +G+ G  +ILRG N   IE 
Sbjct: 277 -IYKHTAGKARGGHA-IKIIGWGKEGGVPYWLIANSWHNDWGENGYFRILRGSNHCGIEE 334

Query: 255 LV 256
            V
Sbjct: 335 NV 336


>gi|239938576|gb|ACS36087.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 71/242 (29%), Positives = 107/242 (44%), Gaps = 31/242 (12%)

Query: 27  SCIEARAVATATPLAFAVCRSS----KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSS 82
           +C    AV+TA+ L+  +C +S    ++HV  T      G   +C +          C+ 
Sbjct: 26  NCGSCWAVSTASALSDRICIASNGRKQVHVSATDILSCCG--NQCGYG---------CNG 74

Query: 83  GISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 142
           G     + +  K+G VTGG + + +GC+P  F PC H    T   EC   AT  PKC  +
Sbjct: 75  GWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGECPNEAT-TPKCVRK 133

Query: 143 C-----TNDNYGRGFFQDKYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQTNG 194
           C      +    R   +D Y++              GP   AF     T Y    +   G
Sbjct: 134 CQKSYKKSYKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAF-----TVYEDFSYYKKG 188

Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
            +Y  +A      +A +KI+GWG+E G PYW I +++   +G+ G  +ILRG N   IE 
Sbjct: 189 -IYKHTAGKARGGHA-IKIIGWGKEGGVPYWLIANSWHNDWGENGYFRILRGSNHCGIEE 246

Query: 255 LV 256
            V
Sbjct: 247 NV 248


>gi|343476048|emb|CCD12737.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 63/193 (32%), Positives = 83/193 (43%), Gaps = 31/193 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   S W +    GL       +++ CQP  FP C H      +P C       PKC
Sbjct: 158 CDGGYPDSAWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCSKYDFHTPKC 210

Query: 140 HTRCT-----------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
           +T CT           ND+Y     +D ++     LYF+   GPF  AF       Y+  
Sbjct: 211 NTTCTDKAIPLIKYRGNDSYVLLHGEDDFKRE---LYFN---GPFVVAF-----QVYSDF 259

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L    G    VS   + +    V+IVGWG+ NG PYW I +++   +G  G   ILRG N
Sbjct: 260 LAYKTGVYRHVSG--DFLGGHAVRIVGWGKLNGTPYWKIANSWDTDWGMNGHFLILRGNN 317

Query: 249 EAIIESLVNGALP 261
           E  IES     LP
Sbjct: 318 ECGIESTGYAGLP 330


>gi|119887749|gb|ABM05925.1| cathepsin B-like cysteine proteinase [Helicoverpa assulta]
          Length = 338

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 62/194 (31%), Positives = 91/194 (46%), Gaps = 25/194 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---ANYTTSEPECKTLATPQ 136
           C+ G+ +  W +    GLV+GG+++S+ GC+P   PPC H    N      + KT     
Sbjct: 153 CNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCNGDSKT----- 207

Query: 137 PKCHTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTR 187
           PKC   C + NY   + +DK      + ++    +        GP   AF     T Y+ 
Sbjct: 208 PKCEKTCES-NYNVDYRKDKRYGKHVFSVSSKEDHIRAELFKNGPVEGAF-----TVYSD 261

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
            L    G VY  +    +  +A VKI+GWG ENG  YW I +++   +GD G  KILRG 
Sbjct: 262 LLNYKTG-VYKHTIGDALGGHA-VKILGWGVENGNKYWLIANSWNSDWGDNGFFKILRGE 319

Query: 248 NEAIIESLVNGALP 261
           +   IES +    P
Sbjct: 320 DHCGIESSIVAGEP 333


>gi|282400164|ref|NP_001164205.1| cathepsin B precursor [Tribolium castaneum]
 gi|270004839|gb|EFA01287.1| cathepsin B precursor [Tribolium castaneum]
          Length = 335

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 58/199 (29%), Positives = 83/199 (41%), Gaps = 34/199 (17%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G+ +  W      G+VTGG +    GC+  SF PC H +     P C     P P C
Sbjct: 154 CNGGMPAMAWLHWTVNGIVTGGNYEDTNGCKAYSFAPCEH-HVDGDLPPCGP-TKPTPDC 211

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA- 198
              C   + G          +G     DP+              K  +    TNG V A 
Sbjct: 212 KKEC---DSGSSLTYQNDLTHGSNYGIDPY-------------PKQIQTEIMTNGPVEAS 255

Query: 199 VSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
            S   + ++Y +               +KI+GWG EN  PYW + +++ E +GDKG  KI
Sbjct: 256 FSVYEDFLSYKSGVYQHLEGEYAGGHAIKILGWGVENDTPYWLVANSWNEDWGDKGYFKI 315

Query: 244 LRGRNEAIIESLVNGALPK 262
           LRG NE  IE  +   +P+
Sbjct: 316 LRGSNECGIEGSIVAGIPE 334


>gi|201023369|ref|NP_001128426.1| cathepsin B-3483 [Acyrthosiphon pisum]
 gi|328712086|ref|XP_003244726.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
          Length = 355

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 57/199 (28%), Positives = 91/199 (45%), Gaps = 25/199 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPE--------CKT 131
           CS G +++ W ++ K+G+VTGG + SN GCQP    PCN A+ T ++P         C  
Sbjct: 165 CSGGYTAAAWRYILKKGIVTGGDYGSNEGCQPWLVQPCN-ASTTAADPSSVLGPHGVCGG 223

Query: 132 LATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDP--------HFGPFWPAFWRSFCT 183
                PKC   C N  +   +  D  +   +   FD           GP+          
Sbjct: 224 DPATTPKCDLSCYNARHEGKYLDDIIKAKKV-FTFDGCSARKNLRKHGPYVVTM-----R 277

Query: 184 KYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
            Y   L   +G  + V  + + +   +V+++GWG E G+ +W + +++G  +GDKG  KI
Sbjct: 278 VYEDFLAYKSGVYHHV--TGDYLGLLSVRMIGWGLEGGQAFWLLANSWGTSWGDKGFFKI 335

Query: 244 LRGRNEAIIESLVNGALPK 262
            R  NE  IE+     +P 
Sbjct: 336 RRFVNECWIENFRYAGVPN 354


>gi|308488594|ref|XP_003106491.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
 gi|308253841|gb|EFO97793.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
          Length = 342

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 51/191 (26%), Positives = 86/191 (45%), Gaps = 15/191 (7%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G     W +  K G+ TGG++ S  GC+P S  PC       + P C     P P C
Sbjct: 157 CAGGNPLKAWQYWQKHGIPTGGSYESQFGCKPYSIAPCGKTIGNVTYPPCTNTTLPTPTC 216

Query: 140 HTRC-------TNDNYGRGFFQDKYQINGLGLYFDPHF-GPFWPAFWRSFCTKYTRPLFQ 191
             +C        + +   G   D+     + +  D    GP       +    Y   L  
Sbjct: 217 EKKCKPGYPVDLDKDRHYGVSVDQLPNRQIEIQSDVMLNGPV-----EATMEIYDDFLQY 271

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
           T G    ++ + +   + +V+I+GWG   G PYW + +++G+++G+ GT ++LRG NE  
Sbjct: 272 TTGIYVHLAGNKQ--GHLSVRILGWGMFEGVPYWLLANSWGKEWGENGTFRVLRGVNECG 329

Query: 252 IESLVNGALPK 262
           +E+     +PK
Sbjct: 330 LEANCISGMPK 340


>gi|7537454|gb|AAF35867.2| cathepsin B-like cysteine proteinase [Helicoverpa armigera]
          Length = 338

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 62/194 (31%), Positives = 91/194 (46%), Gaps = 25/194 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---ANYTTSEPECKTLATPQ 136
           C+ G+ +  W +    GLV+GG+++S+ GC+P   PPC H    N      + KT     
Sbjct: 153 CNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCNGDSKT----- 207

Query: 137 PKCHTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTR 187
           PKC   C + NY   + +DK      + ++    +        GP   AF     T Y+ 
Sbjct: 208 PKCEKTCES-NYNVDYRKDKRYGKHVFSVSSKEDHIRAELFKNGPVEGAF-----TVYSD 261

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
            L    G VY  +    +  +A VKI+GWG ENG  YW I +++   +GD G  KILRG 
Sbjct: 262 LLNYKTG-VYKHTIGDALGGHA-VKILGWGVENGNKYWLIANSWNSDWGDNGFFKILRGE 319

Query: 248 NEAIIESLVNGALP 261
           +   IES +    P
Sbjct: 320 DHCGIESSIVAGEP 333


>gi|306992171|gb|ADN19566.1| cathepsin B-like proteinase [Spodoptera frugiperda]
          Length = 341

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 62/194 (31%), Positives = 90/194 (46%), Gaps = 25/194 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---ANYTTSEPECKTLATPQ 136
           C+ G+ +  W +    GLV+GG+++S  GC+P   PPC H    N      + KT     
Sbjct: 156 CNGGMPTLAWEYWKHFGLVSGGSYNSGQGCRPYEIPPCEHHVPGNRVPCNGDSKT----- 210

Query: 137 PKCHTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTR 187
           PKCH  C   +Y   + +DK      Y ++    +        GP   AF     T Y+ 
Sbjct: 211 PKCHKTCEA-SYSVDYHKDKRYGKHVYSVSSKEDHIKAELFKNGPVEGAF-----TVYSD 264

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
            L   NG VY  +    +  +A +KI+GWG ENG  Y  I +++   +GD G  KILRG 
Sbjct: 265 LLNYKNG-VYKHTVGNALGGHA-IKILGWGVENGNKYRLIANSWNSDWGDNGFFKILRGE 322

Query: 248 NEAIIESLVNGALP 261
           +   IES +    P
Sbjct: 323 DHCGIESSIVAGEP 336


>gi|40557606|gb|AAR88096.1| cathepsin B-like cysteine protease [Callosobruchus maculatus]
          Length = 330

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 66/240 (27%), Positives = 104/240 (43%), Gaps = 30/240 (12%)

Query: 33  AVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISSSTWAWV 92
           AV++A+ ++  +C  S         R  A     C    S   ++  C  GI S T+   
Sbjct: 111 AVSSASVMSDRICIQSDQK---NQLRISAADMIECC--ESCTFSVDGCHGGIPSFTFTEW 165

Query: 93  HKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT-------- 144
              G V+GG ++S  GC     P CN        P CKTL    P C   C         
Sbjct: 166 KDSGFVSGGEYNSTNGCMSYPLPRCN--------PSCKTLYDA-PTCKKECDKGSPLKYE 216

Query: 145 -NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASA 203
            + +Y +  ++   ++           GP   +F     T Y   +   +G VY     +
Sbjct: 217 EDKHYAKQAYRIMSKVERQIQLEIIKNGPVVASF-----TVYADFIHYLSG-VYKFDGES 270

Query: 204 EIVAYATVKIVGWGEENGR-PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           +++    V+I+GWG ENG  PYW + +++ E++GD+G  KI RG+NE  IE  +   LP+
Sbjct: 271 KLLGGHAVRIIGWGIENGTYPYWLVSNSWNERWGDQGLFKIWRGKNECGIEEEITAGLPR 330


>gi|38147393|gb|AAR12009.1| cathepsin B-like proteinase [Triatoma infestans]
          Length = 332

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 56/191 (29%), Positives = 86/191 (45%), Gaps = 17/191 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G  +S W +    G+V+GG + S  GCQP S  PC H +     P C    +  P C
Sbjct: 150 CDGGYPASAWDYWQNVGIVSGGNYGSKQGCQPYSIAPCEH-HVPGPRPACSGEGS-TPDC 207

Query: 140 HTRC---TNDNYGRGFF--QDKYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQ 191
             +C   +  +Y +  +  +  Y +              GP   AF     T Y   +  
Sbjct: 208 RNQCDKRSGISYDKDLYYGESAYSLEDEAKQIQAEILKNGPVEAAF-----TVYEDLVNY 262

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
             G    V+ S  ++    +KI+GWG EN  PYW + +++   +G+ G  KILRG++E  
Sbjct: 263 KEGVYQHVAGS--VLGGHAIKILGWGVENDTPYWLVANSWNTDWGNNGFFKILRGKDECG 320

Query: 252 IESLVNGALPK 262
           IE  V+  LP+
Sbjct: 321 IEIDVSAGLPR 331


>gi|76576339|gb|ABA53863.1| cathepsin B-like cysteine protease 1 [Parelaphostrongylus tenuis]
          Length = 346

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 56/194 (28%), Positives = 84/194 (43%), Gaps = 21/194 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G +   W +    G+VTG  + + +GC+P  +PPC H        +C     P   C
Sbjct: 163 CEGGDTYKAWNYWTTDGIVTGSNYTTKSGCKPYPYPPCEHYIDAGRYKKCPKDLYPTNTC 222

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAF--WRSFCTKYTRP 188
             +C  DNY   + +DK      Y + G   +        GP    F  +  F   Y+  
Sbjct: 223 EYKC-QDNYTISYDEDKHYGAYPYVLVGDASFIQQEIMNHGPVEVTFDVYEDF-EHYSSG 280

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           +++          + E V    VK++GWG ENG  YW   +++   +G+ G  +ILRG N
Sbjct: 281 IYK--------HMAGEYVGVHAVKMLGWGTENGVDYWICANSWNSDWGENGFFRILRGEN 332

Query: 249 EAIIESLVNGALPK 262
           E  IES V    PK
Sbjct: 333 ECGIESNVVAGKPK 346


>gi|161343879|tpg|DAA06120.1| TPA_inf: cathepsin B [Toxoptera citricida]
          Length = 340

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 71/252 (28%), Positives = 107/252 (42%), Gaps = 37/252 (14%)

Query: 27  SCIEARAVATATPLAFAVCRSSKMHV-ECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGIS 85
           +C    A+AT++  A  +C ++     +  S   I     +C +          C+ G  
Sbjct: 112 NCGSCWAIATSSAFADRLCVATNADFNQLLSAEEITFCCHKCGY---------GCNGGYP 162

Query: 86  SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---ANYTTSEPECKTLATPQPKCHTR 142
              W    K GLVTGG + S  GC+P   PPC +    N T S         P  + H R
Sbjct: 163 IKAWERFKKHGLVTGGEYKSGEGCEPYRVPPCPYDESGNNTCS-------GKPMEQNH-R 214

Query: 143 CTNDNYGRGFF---------QDKYQINGLGLYFDPH-FGPFWPAFWRSFCTKYTRPLFQT 192
           CT   YG             +D Y +    +  D   +GP   +F       Y   L   
Sbjct: 215 CTRMCYGDQDLDFDDDHRHTRDSYYLTIGSIQKDVMTYGPIEASF-----DVYDDFLSYK 269

Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
           +G VY  S +A  +    VK++GWGEE G PYW +++++   +GD+G  KI RG NE  +
Sbjct: 270 SG-VYVRSENASYLGGHAVKLIGWGEEYGTPYWLMMNSWNADWGDEGLFKIRRGTNECGV 328

Query: 253 ESLVNGALPKDN 264
           ++     +P  N
Sbjct: 329 DNSTTAGVPVTN 340


>gi|170028916|ref|XP_001842340.1| cathepsin B [Culex quinquefasciatus]
 gi|167879390|gb|EDS42773.1| cathepsin B [Culex quinquefasciatus]
          Length = 339

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 60/181 (33%), Positives = 86/181 (47%), Gaps = 24/181 (13%)

Query: 91  WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 150
           WV   GLV+G  ++S+ GC+P  F PC++  +     E K      PKC   C N  Y R
Sbjct: 173 WV-DAGLVSGAPYNSSEGCKPYPFEPCSYP-FVGCHHEKK-----NPKCLHHCIN-GYDR 224

Query: 151 GFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSA 201
            + +DK      Y+I              GP    F      +     +  +  VY    
Sbjct: 225 KYRKDKFFGATAYKIPNDARMIQLEIMTNGPVATGF------EVFEDFYFYHSGVYKHVV 278

Query: 202 SAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
             ++  +A ++IVGWG ENG PYW I +++G+ +GDKG  K+LRG N   IES V   LP
Sbjct: 279 GKKVGMHA-IRIVGWGTENGTPYWLIANSYGDTWGDKGFFKMLRGSNHLGIESTVIAGLP 337

Query: 262 K 262
           +
Sbjct: 338 Q 338


>gi|183988834|gb|ACC66066.1| cathepsin B [Samia ricini]
          Length = 283

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 60/182 (32%), Positives = 86/182 (47%), Gaps = 19/182 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G+ +  W +    GLV+GG ++S+ GC+P   PPC H       P C    T  PKC
Sbjct: 112 CNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMP-CNG-DTKTPKC 169

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
              C + +Y   F +DK      Y ++G   +        GP   AF     T Y+  L 
Sbjct: 170 QKNCES-SYNVPFKKDKRYGKHVYSVSGHEDHIKAELFKNGPVEAAF-----TVYSDLLS 223

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             NG VY  +    +  +A +KI+GWG EN   YW I +++   +GD G  KILRG +  
Sbjct: 224 YKNG-VYKHTEGNALGGHA-IKIIGWGVENNNKYWLIANSWNSDWGDNGFFKILRGEDHC 281

Query: 251 II 252
            I
Sbjct: 282 GI 283


>gi|121073168|gb|ABM47070.1| cathepsin B1 [Clonorchis sinensis]
 gi|358341105|dbj|GAA29748.2| cathepsin B [Clonorchis sinensis]
          Length = 339

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 62/194 (31%), Positives = 86/194 (44%), Gaps = 23/194 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G  +  W +  + GLVTG  +++   C+P SFPPC H      +P      TPQ  C
Sbjct: 157 CQGGYPAQAWEYWVRNGLVTGDLYNTTDTCRPYSFPPCEHHVVGPRKPCTGDPTTPQ--C 214

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
             +C  + Y + +  DK Y +    ++ D        A  R   T    PL + +  VYA
Sbjct: 215 VKKCQPE-YPKTYENDKWYGLKAYSIHSDQE------AIMRDLMT--YGPL-EVDFEVYA 264

Query: 199 VSASAEIVAY----------ATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
              S     Y            V++VGWG E+G  YW I +++   +GD G  KI RG N
Sbjct: 265 DFPSYSSGVYRHVAGGLLGGHAVRLVGWGVEDGADYWLIANSWNTDWGDGGYFKIRRGVN 324

Query: 249 EAIIESLVNGALPK 262
           E  IES  N   PK
Sbjct: 325 ECGIESDANAGHPK 338


>gi|183988832|gb|ACC66065.1| cathepsin B [Antheraea assama]
          Length = 287

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 62/184 (33%), Positives = 89/184 (48%), Gaps = 19/184 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G+ +  W +    GLV+GG ++S+ GC+P   PPC H       P C    T  PKC
Sbjct: 113 CNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMP-CNG-DTKTPKC 170

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
              C + +Y   F +DK      Y ++G            GP   AF     T Y+  L 
Sbjct: 171 EKTCES-SYTVPFKKDKRYGKHVYSVSGHEDNIKAELFKNGPVEGAF-----TVYSDLLS 224

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY  +    +  +A +KI+GWG ENG  YW I +++   +GD G +KILRG +  
Sbjct: 225 YKSG-VYQHTHGNALGGHA-IKILGWGVENGSKYWLIANSWNSDWGDNGFLKILRGEDHC 282

Query: 251 IIES 254
            IES
Sbjct: 283 GIES 286


>gi|195058549|ref|XP_001995463.1| GH17748 [Drosophila grimshawi]
 gi|193896249|gb|EDV95115.1| GH17748 [Drosophila grimshawi]
          Length = 340

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 60/194 (30%), Positives = 93/194 (47%), Gaps = 22/194 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W++   RG+V+GG+++S  GC+P    PC H +     P C + +TP   C
Sbjct: 157 CNGGFPGAAWSYWTTRGIVSGGSYNSTEGCRPYEVEPCEH-HVDGPRPPCHSGSTPH--C 213

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
             +C   NY   + +DK      Y IN             GP   AF     T Y   + 
Sbjct: 214 KHQC-QPNYSVDYEKDKHFGASSYSINRNPRNIQREIMTNGPVEGAF-----TVYEDLIL 267

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGE--ENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
              G VY      ++  +A ++I+GWG   E+  PYW I +++   +GD G  +ILRG++
Sbjct: 268 YKTG-VYQHVHGKQLGGHA-IRIIGWGVWGESKVPYWLIANSWNTDWGDNGFFRILRGKD 325

Query: 249 EAIIESLVNGALPK 262
              IES ++  LPK
Sbjct: 326 HCGIESQISAGLPK 339


>gi|294873367|ref|XP_002766594.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239867622|gb|EEQ99311.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 244

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 57/207 (27%), Positives = 88/207 (42%), Gaps = 37/207 (17%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGA------HHSNTGCQPVSFPPCNHANYTTSEPECKTLA 133
           CS G   ++W ++H  G+V+GG         +  GC P SFP C H    +    C    
Sbjct: 53  CSGGNPITSWTFLHTNGIVSGGGFVPEKNMKAADGCWPYSFPKCAHHQDGSDYKPCAKEI 112

Query: 134 TPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
              P C + C N  YG  F +D++    L   F   FG           T   +    TN
Sbjct: 113 YDTPSCSSSCPNAKYGTAFDKDRHYTESL---FPSRFGS----------TSSIKKEIMTN 159

Query: 194 GRVYAV-SASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGD 237
           G   A  S   + ++Y +               V+I+GWG E G  YW +++++ E++GD
Sbjct: 160 GPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVDYWLVMNSWNEEWGD 219

Query: 238 KGTIKILRGRNEAIIESLVNGALPKDN 264
            GT KI++G  +  I+  +    P  N
Sbjct: 220 HGTFKIVQG--DCGIDDTILAGTPAMN 244


>gi|268561802|ref|XP_002638421.1| C. briggsae CBR-CPR-3 protein [Caenorhabditis briggsae]
          Length = 375

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 64/192 (33%), Positives = 86/192 (44%), Gaps = 23/192 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTT-SEPECKTLATPQPK 138
           C  G +     +    G+VTGG  ++  GC P SFPPC  +     S P CKT       
Sbjct: 165 CQGGYTIEAMKYWMNSGVVTGG-DYNGAGCMPYSFPPCKKSPCVEFSTPSCKT------T 217

Query: 139 CHTRCTNDNY--GRGFFQDKYQINGLG------LYFDPHFGPFWPAFWRSFCTKYTRPLF 190
           C  + T  +Y   + F    Y+++          Y   H GP   A +R F        +
Sbjct: 218 CQEKYTTADYKNDKHFATSAYKLSTTKNAVPTIQYEIYHNGPV-EASYRVF-----EDFY 271

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
           Q    VY    S  +V    VKI+GWG ENG  YW + +++G  FG+KG  KI RG NE 
Sbjct: 272 QYKSGVYH-HVSGNLVGGHAVKIIGWGTENGVDYWLVANSWGTSFGEKGFFKIRRGTNEC 330

Query: 251 IIESLVNGALPK 262
            IES +   L K
Sbjct: 331 QIESNIVAGLAK 342


>gi|298370749|gb|ADI80349.1| cathepsin B [Litopenaeus vannamei]
          Length = 331

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 65/198 (32%), Positives = 93/198 (46%), Gaps = 27/198 (13%)

Query: 80  CSSGISSSTWA-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 138
           C+ G   + +  WVH  G+V+GG+ +S  GCQP    PC H +     P+C +     PK
Sbjct: 148 CNGGFPGAAFKYWVHS-GIVSGGSFNSTQGCQPYEIAPCEH-HVPGPRPKC-SEGGGTPK 204

Query: 139 CHTRCT--------NDNYGRG----FFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYT 186
           C   C         +D +  G      +D+ QI     Y     GP   AF     T Y 
Sbjct: 205 CAKTCEKGYIVDYESDLHHGGKAYSIMKDEDQIK----YEIMKNGPVEGAF-----TVYV 255

Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
             L   +G VY       +  +A ++++GWGEENG PYW   +++   +GD G  KILRG
Sbjct: 256 DFLHYKSG-VYQHRHGLPLGGHA-IRVLGWGEENGTPYWLCANSWNTDWGDNGLFKILRG 313

Query: 247 RNEAIIESLVNGALPKDN 264
            +   IES ++  LPK N
Sbjct: 314 SDHCGIESEISAGLPKLN 331


>gi|294883442|ref|XP_002770942.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239874068|gb|EER02758.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 393

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 59/192 (30%), Positives = 87/192 (45%), Gaps = 23/192 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   S W W  + G+V+      ++GC P +FP C+H   T     CK   +P P C
Sbjct: 202 CDGGQPDSAWRWFSEHGVVS----ELDSGCWPYNFPECSHHVETKGMEPCKG-NSPSPVC 256

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP---------HFGPFWPAFWRSFCTKYTRPLF 190
            T C N ++   F  D++     G   D            GP   AF     T Y   L+
Sbjct: 257 STTCRNHHFKPSFESDRHFTEDEGYSLDEVDEIKKEIIDNGPVAAAF-----TVYEDFLY 311

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY     +E+  +A VKI+GWG +    YW +++++   +GD+G  KI  G  E 
Sbjct: 312 YKSG-VYKHVNGSELGGHA-VKIIGWGTDQNEQYWLVMNSWNVNWGDQGIFKIAIG--EC 367

Query: 251 IIESLVNGALPK 262
            I+S V   +PK
Sbjct: 368 GIDSEVTAGIPK 379


>gi|340380685|ref|XP_003388852.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
          Length = 341

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 59/200 (29%), Positives = 92/200 (46%), Gaps = 31/200 (15%)

Query: 80  CSSGISSSTW-AWVHK---RGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATP 135
           C  G  ++ W  W  K    G+VTGG + SN GCQP + P C+H      E    + +TP
Sbjct: 153 CDGGYPAAAWRHWADKLLYEGIVTGGQYDSNAGCQPYTIPKCDHHEPGPYENCSGSQSTP 212

Query: 136 QPKCHTRCTNDNYGRGFFQDKY-------------QINGLGLYFDPHFGPFWPAFWRSFC 182
              C   C + +Y + +  DK+              I    +   P  G F  + +  F 
Sbjct: 213 S--CKRSCIS-SYDKSYRSDKHYGKNSYSISSDVSSIQTEIMTNGPVEGAF--SVYADFP 267

Query: 183 TKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
           T YT  ++Q          +   +    +KI+GWG ENG PYW + +++   +GD G  K
Sbjct: 268 T-YTSGVYQ--------HTTGSFLGGHAIKILGWGTENGVPYWLVANSWNPSWGDSGFFK 318

Query: 243 ILRGRNEAIIESLVNGALPK 262
           I+RG++E  IES +   +P+
Sbjct: 319 IIRGKDECGIESSIVAGMPE 338


>gi|239938582|gb|ACS36090.1| cysteine proteinase [Haemonchus contortus]
          Length = 346

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 110/245 (44%), Gaps = 38/245 (15%)

Query: 27  SCIEARAVATATPLAFAVCRSS----KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSS 82
           +C    AV+TA+ L+  +C  S    +MH+  +S  F++   + C++          C  
Sbjct: 118 NCGSCWAVSTASALSDRICIESNGETQMHI--SSIDFVSCC-ESCSY---------GCDG 165

Query: 83  GISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 142
           G     + +    G VTGG + S  GC+P  F PC H    T   EC   A   PKC  R
Sbjct: 166 GWPILAFDFYTYEGAVTGGDYGSKDGCRPYPFHPCGHHGNDTYYGECPKGAK-TPKCRRR 224

Query: 143 CTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRPLFQ 191
           C   +Y + ++ DK    G   Y  PH            GP   AF     T Y    + 
Sbjct: 225 CQR-SYKKAYYMDKSY--GEDAYEVPHSVKAIQREIMKNGPVVGAF-----TVYEDFSYY 276

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
             G +Y  +A      +A +KI+GWG EN  PYW I +++   +G++G  +++RG NE  
Sbjct: 277 KKG-IYKHTAGQARGGHA-IKIIGWGVENDVPYWLIANSWHNDWGEEGYFRMIRGINECG 334

Query: 252 IESLV 256
           IE  V
Sbjct: 335 IEQEV 339


>gi|256052331|ref|XP_002569726.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228435|emb|CCD74606.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 319

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 54/180 (30%), Positives = 83/180 (46%), Gaps = 12/180 (6%)

Query: 83  GISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 142
           G  +  W +  K G+VTG +  ++T CQP  FP C H +     P C       P C   
Sbjct: 139 GFPALAWDYWVKEGIVTGSSKENHTSCQPYPFPKCEH-HTKGKYPACFEEIYKTPNCENT 197

Query: 143 CTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
           C   +Y   + QDK      Y +             + P    +    Y   L   +G +
Sbjct: 198 CQK-SYKTPYAQDKHRGKSRYNVKNDEKAIQKEIMKYGPV--EANFIVYEDFLNYKSG-I 253

Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
           Y    + ++V++  ++I+GWG EN  PYW I +++ E +G+ G  +ILRGR+E  IES V
Sbjct: 254 YK-HITGKLVSWHAIRIIGWGVENNTPYWLIPNSWNEDWGENGNFRILRGRHECSIESEV 312


>gi|145481831|ref|XP_001426938.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124394016|emb|CAK59540.1| unnamed protein product [Paramecium tetraurelia]
          Length = 332

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 60/197 (30%), Positives = 87/197 (44%), Gaps = 26/197 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHAN----YTTSEPECKTLATP 135
           C  G     W ++   G+VTGG ++  + C+P SFPPC+H N    Y+  E +   L   
Sbjct: 145 CDGGYPYGAWKYLRVDGIVTGGTYNDFSLCKPYSFPPCSHGNDSGKYSKCENDFFMLTEV 204

Query: 136 QPKCHTRCTNDNYGRGFFQDKYQI--NGLGLYFDPH--------FGPFWPAF--WRSFCT 183
            P C  +C +  + R +  DK +   N   L  D           GP    F  +  F  
Sbjct: 205 TPSCTKKC-HPQFSRTYDVDKIRSRENPYKLIKDQEQIKNEIYLNGPVQAVFTVFDDFLN 263

Query: 184 KYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
             +    QT G+     A         VKI+GWG ENG PYW  ++++ + +G  G  KI
Sbjct: 264 YKSGVYQQTTGQRRGKHA---------VKIIGWGTENGVPYWEAINSWNDGWGINGKFKI 314

Query: 244 LRGRNEAIIESLVNGAL 260
           LRG N   IE  V  ++
Sbjct: 315 LRGFNHLDIEGEVYASI 331


>gi|170787211|gb|ACB38229.1| cathepsin B [Meretrix meretrix]
          Length = 337

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 58/190 (30%), Positives = 93/190 (48%), Gaps = 19/190 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G     W ++ + G+VTGG ++S+ GC P     C+H      +P CK    P P+C
Sbjct: 155 CNGGFLEGAWNYLKRDGIVTGGPYNSHQGCLPYEIKACDHHVVGKLQP-CKGDG-PTPRC 212

Query: 140 HTRCT---NDNYGRGFFQDK--YQINGLGLYFDPHF--GPFWPAFWRSFCTKYTR-PLFQ 191
              C    N+ Y +     K  + + G+          GP   AF     T Y+  P ++
Sbjct: 213 KKECESGYNNTYSKDEHHAKTVHAVEGVEQIMTEIMTNGPVEAAF-----TVYSDFPTYK 267

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
           +   VY   +   +  +A +K +GWG E+G+ YW + +++   +GD G  KILRGR+E  
Sbjct: 268 SG--VYEHKSGGPLGGHA-IKTLGWGNEDGKDYWLVANSWNPDWGDNGFFKILRGRDECG 324

Query: 252 IES-LVNGAL 260
           IES +V G +
Sbjct: 325 IESNIVAGMM 334


>gi|308512693|gb|ADO33000.1| cathepsin B [Biston betularia]
          Length = 217

 Score = 81.3 bits (199), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 61/191 (31%), Positives = 84/191 (43%), Gaps = 19/191 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G+ +  W +    GLV+GG ++S+ GC P   PPC H       P C    T  PKC
Sbjct: 32  CNGGMPTLAWEYWKHMGLVSGGNYNSSQGCSPYVIPPCEHHVPGNRLP-CNG-DTKTPKC 89

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
              C N  Y   + +DK      Y + G   +        GP   AF     T Y   L 
Sbjct: 90  SKTCEN-GYNVLYKKDKRYGKHVYAVRGGEDHIKAELFKNGPVEAAF-----TVYADLLA 143

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    V   A  +    +KI+GWG ENG  YW I +++   +G+ G  KILRG +  
Sbjct: 144 YKSGVYKHVEGDA--LGGHAIKIIGWGVENGNKYWLIANSWNTDWGNNGFFKILRGEDHC 201

Query: 251 IIESLVNGALP 261
            IES +    P
Sbjct: 202 GIESSIVAGEP 212


>gi|194766882|ref|XP_001965553.1| GF22391 [Drosophila ananassae]
 gi|190619544|gb|EDV35068.1| GF22391 [Drosophila ananassae]
          Length = 342

 Score = 81.3 bits (199), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 68/247 (27%), Positives = 110/247 (44%), Gaps = 29/247 (11%)

Query: 27  SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
           SC    A      ++  VC  S  +V   +FRF A     C          + C+ G   
Sbjct: 112 SCGSCWAFGAVEAMSDRVCIHSNGNV---NFRFSADDLVSCCHTCG-----FGCNGGFPG 163

Query: 87  STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRC--- 143
           + W++  ++G+V+GG + S TGC+P    PC H    T  P C    +  PKC  +C   
Sbjct: 164 AAWSYWTRKGIVSGGRYGSKTGCRPYEIAPCEHHVNGTRAP-CNH-DSKTPKCQHQCEAG 221

Query: 144 ------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
                  + ++G   +  +  +  +      + GP   AF     T Y   +   +G VY
Sbjct: 222 YNVEYSKDKHFGSKSYSVRRNVRDIQEEIMTN-GPVEGAF-----TVYEDLILYKSG-VY 274

Query: 198 AVSASAEIVAYATVKIVGWGE--ENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
                 E+  +A ++I+GWG   +   PYW I +++ + +GDKG  +ILRG +   IES 
Sbjct: 275 QHEHGKELGGHA-IRILGWGVWGKEEVPYWLIANSWNDDWGDKGFFRILRGEDHCGIESS 333

Query: 256 VNGALPK 262
           ++  LPK
Sbjct: 334 ISAGLPK 340


>gi|239938584|gb|ACS36091.1| cysteine proteinase [Haemonchus contortus]
          Length = 346

 Score = 80.9 bits (198), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 109/245 (44%), Gaps = 38/245 (15%)

Query: 27  SCIEARAVATATPLAFAVCRSS----KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSS 82
           +C    AV+TA+ L+  +C  S    +MH+  +S  F++   + C +          C  
Sbjct: 118 NCGSCWAVSTASALSDRICIESNGETQMHI--SSIDFVSCC-ESCGY---------GCDG 165

Query: 83  GISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 142
           G     + +    G VTGG + S  GC+P  F PC H    T   EC   A   PKC  R
Sbjct: 166 GWPILAFDFYTYEGAVTGGDYGSKDGCRPYPFHPCGHHGNDTYYGECPKGAK-TPKCRRR 224

Query: 143 CTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRPLFQ 191
           C   +Y + ++ DK    G   Y  PH            GP   AF     T Y    + 
Sbjct: 225 CQR-SYKKAYYMDKSY--GEDAYEVPHSVKAIQREIMKNGPVVGAF-----TVYEDFSYY 276

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
             G +Y  +A      +A +KI+GWG EN  PYW I +++   +G++G  +++RG NE  
Sbjct: 277 KKG-IYKHTAGQARGGHA-IKIIGWGVENDVPYWLIANSWHNDWGEEGYFRMIRGINECG 334

Query: 252 IESLV 256
           IE  V
Sbjct: 335 IEQEV 339


>gi|126116630|gb|ABN79675.1| cathepsin B3 [Clonorchis sinensis]
          Length = 337

 Score = 80.9 bits (198), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 55/193 (28%), Positives = 86/193 (44%), Gaps = 21/193 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W +    G+VTGG+  + TGC+   FP C+H   +   P C       P C
Sbjct: 149 CQGGFPPTAWDFWQTEGIVTGGSKENPTGCRSYPFPRCSHHG-SKKYPPCSHRIYDTPNC 207

Query: 140 HTRC--------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRPL 189
             +C        T+       +  K + N +      + GP   AF  +  F    +   
Sbjct: 208 VQKCDTPDTDYATDKTRANITYNVKAKQNAIMKEIMIN-GPVEAAFQVYEDFLGYKSGVY 266

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           F ++G +    A         ++I+GWGEENG  YW I +++ + +G+ G  K+LRG+NE
Sbjct: 267 FHSDGTLLGGHA---------IRILGWGEENGVAYWLIANSWNDGWGEDGYFKMLRGKNE 317

Query: 250 AIIESLVNGALPK 262
             IE  V   LP+
Sbjct: 318 CGIEDEVTAGLPE 330


>gi|358341867|dbj|GAA49438.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 952

 Score = 80.9 bits (198), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 60/206 (29%), Positives = 94/206 (45%), Gaps = 20/206 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C +G     W +    G+VTGG+    +GC+   FP C H       P C     P P+C
Sbjct: 120 CGAGFHPMAWDFWKTHGIVTGGSKEEPSGCRSFPFPKCGHRR-KGRYPPCPRHIYPTPEC 178

Query: 140 HTRCTNDNYGRGFFQDKYQIN--------GLGLYFDPHF-GPFWPAFWRSFCTKYTRPLF 190
             +C  D     + +DK + N         + +  +    GP   +F             
Sbjct: 179 IKQC--DEPEVNYEKDKTRANISYNVYPSDISIMKEIMLNGPVEASF------GIYADFL 230

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
           + NG VY       I  +A ++I+GWGE++G PYW I +++ E +G+KG ++ LRG NE 
Sbjct: 231 EYNGGVYFHCWGGPISRHA-IRILGWGEDDGVPYWLIANSWNEDWGEKGYVRFLRGHNEC 289

Query: 251 IIESLVNGALPKDNYGVEFGEESGER 276
            IE  V  A+P D +  +  ++S  R
Sbjct: 290 GIEEEVT-AVPIDWFLRQMIKQSTLR 314



 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 23/52 (44%), Positives = 36/52 (69%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           ++I+GWGEE+G PYW + +++ E +G+KG +++LR RNE  I   V   LP 
Sbjct: 895 IRILGWGEEDGVPYWLVANSWNEDWGEKGYMRVLRWRNECGIVDQVTAGLPD 946



 Score = 43.9 bits (102), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 28/64 (43%), Gaps = 1/64 (1%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G S   W +    G+VTGG+    TGC+   FP C H       P C     P P+C
Sbjct: 708 CRGGYSPIAWDFWKTHGIVTGGSKEKPTGCRSYPFPSCEHRG-KGQYPPCPHQLYPTPEC 766

Query: 140 HTRC 143
             RC
Sbjct: 767 IKRC 770


>gi|17560488|ref|NP_506310.1| Protein F32H5.1 [Caenorhabditis elegans]
 gi|3876629|emb|CAB04249.1| Protein F32H5.1 [Caenorhabditis elegans]
          Length = 356

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 68/214 (31%), Positives = 85/214 (39%), Gaps = 36/214 (16%)

Query: 67  CAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCN--HANYTT 124
           C  L+S     W C          W    GL TGG ++   GC+P S  PC+  +AN TT
Sbjct: 154 CVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYNDQFGCKPYSIYPCDKKYANGTT 213

Query: 125 SEPECKTLATPQPKCHTRCT-NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCT 183
           S P C    TP   C   CT N  +   + QDK            HFG       +    
Sbjct: 214 SVP-CPGYHTP--TCEEHCTSNITWPIAYKQDK------------HFGKAHYNVGKKMTD 258

Query: 184 KYTRPLFQTNGRVYA----------------VSASAEIVAYATVKIVGWGEENGRPYWTI 227
                   TNG V A                V  + +       KI+GWG +NG PYW  
Sbjct: 259 IQIE--IMTNGPVIASFIIYDDFWDYKTGIYVHTAGDQEGGMDTKIIGWGVDNGVPYWLC 316

Query: 228 VSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
           V  +G  FG+ G ++ LRG NE  IE  V  ALP
Sbjct: 317 VHQWGTDFGENGFVRFLRGVNEVNIEHQVLAALP 350


>gi|268555786|ref|XP_002635882.1| Hypothetical protein CBG01102 [Caenorhabditis briggsae]
          Length = 374

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 51/191 (26%), Positives = 84/191 (43%), Gaps = 15/191 (7%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G     W +  K GL TGG++ S  GC+P S  PC+      + P C       P C
Sbjct: 189 CAGGNVFKAWQYWQKHGLPTGGSYESQFGCKPYSISPCDTVIGNITFPGCLNSTVQTPSC 248

Query: 140 HTRCT-------NDNYGRGFFQDKYQINGLGLYFDPHF-GPFWPAFWRSFCTKYTRPLFQ 191
             +C        + +   G   D+     + +  D    GP       S   +      Q
Sbjct: 249 EKKCKSGYPVELDKDRHYGVSVDQLPNRQIEIQSDVMLNGPI------SATMEVYDDFLQ 302

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
               +Y V  +     + +V+I+GWG   G PYW + +++G+Q+G+ GT ++LRG NE  
Sbjct: 303 YTTGIY-VHLTGNKQGHLSVRILGWGMYEGVPYWLLANSWGKQWGENGTFRVLRGVNECG 361

Query: 252 IESLVNGALPK 262
           +E+     +P+
Sbjct: 362 LEANCVSGMPR 372


>gi|157167368|ref|XP_001653891.1| cathepsin b [Aedes aegypti]
 gi|108874250|gb|EAT38475.1| AAEL009642-PA [Aedes aegypti]
          Length = 332

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 63/192 (32%), Positives = 87/192 (45%), Gaps = 31/192 (16%)

Query: 82  SGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHT 141
            G S   W  V   GLV+G A++S  GC+P  F PC +  +    PE        P C  
Sbjct: 160 DGTSFQYWVDV---GLVSGAAYNSTDGCKPYPFKPCLYP-FVGCHPE------KTPSCTH 209

Query: 142 RCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRPLF 190
            CT + Y   + +DKY   G   Y  P+            GP    F         + L+
Sbjct: 210 HCT-EGYDGTYRRDKYY--GSAAYKLPNDERMIQLEIMTNGPVESGF------SVYQDLY 260

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
                VY      E+  +A V+++GWG+E G PYW I +++GE +G+ G  K LRG N  
Sbjct: 261 LYKTGVYQHVVGREVGKHA-VRLIGWGKERGVPYWLIANSYGEDWGEHGYFKFLRGSNHL 319

Query: 251 IIESLVNGALPK 262
            IES+V   LPK
Sbjct: 320 GIESVVIAGLPK 331


>gi|384597848|gb|AFI23675.1| cathepsin B, partial [Brugia malayi]
          Length = 319

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 58/180 (32%), Positives = 87/180 (48%), Gaps = 19/180 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 138
           C  G   + W +    G+VTG  + +++GC+P  FPPC +H+N T  EP CK    P PK
Sbjct: 146 CFGGEPMAAWKYWVLSGIVTGSDYTNHSGCRPYPFPPCEHHSNKTHYEP-CKHDLYPTPK 204

Query: 139 CHTRCTNDNYGRGFFQDKYQ-INGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPL 189
           C+ +C + NY + +  DKY       +  D           GP   +F       YT  L
Sbjct: 205 CYKQC-DKNYTKSYKADKYYGEQAYNVENDVESIQKEIMTLGPVEASF-----EVYTDFL 258

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
             T+G    V+ S  +     VKI+GWG + G  YW   +++   +G+ G  +ILRG +E
Sbjct: 259 HYTSGIYKHVAGS--VGGGHAVKILGWGIDQGVSYWLAANSWNNDWGEDGYFRILRGADE 316


>gi|294876463|ref|XP_002767679.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239869446|gb|EER00397.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 348

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 55/198 (27%), Positives = 85/198 (42%), Gaps = 29/198 (14%)

Query: 80  CSSGISSSTWAWVHKRGLVTGG------AHHSNTGCQPVSFPPCNHANYTTSEPECKTLA 133
           C  G++   W +++K G+ TGG      +  +  GC P +FP C H    +    C   +
Sbjct: 157 CKGGVARDAWVFLNKHGIATGGDFVPKSSMEAVDGCWPYNFPRCAHYQKKSKYGPCPKKS 216

Query: 134 TPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTR------ 187
              P C  RC N+ YG    +D++        F     P+W    RS   +  +      
Sbjct: 217 YETPSCLDRCPNEKYGTPLDKDRH--------FTARAVPYWFNGIRSIKKEIMKHGPTSA 268

Query: 188 ------PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTI 241
                   F     VY  ++ A  V + TV+++GWG E G  YW   + + E++ D GT 
Sbjct: 269 SFFTYEDFFSYKSGVYKYTSGA-YVEFHTVELIGWGTEKGVDYWLAKNDWNEEWADLGTF 327

Query: 242 KILRGRNEAIIESLVNGA 259
           KI +G  +  I  LV GA
Sbjct: 328 KIAQG--DCGINDLVLGA 343


>gi|294935195|ref|XP_002781337.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239891887|gb|EER13132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 317

 Score = 80.5 bits (197), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 55/201 (27%), Positives = 85/201 (42%), Gaps = 32/201 (15%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G+  + W+++   G+ T G+  +  GC P +FP C H    +    C       P C
Sbjct: 133 CKGGMILNAWSFLKTHGIATEGSMSAADGCWPYNFPKCAHHQKKSKYEPCSKKLYDTPSC 192

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
             RC N+ YG    +D+            HF    P  +    T   +    TNG   A 
Sbjct: 193 LDRCPNEKYGIPLDKDR------------HFTAHSPDLFEG--TDNIKKEIMTNGPTSAT 238

Query: 200 -SASAEIVAYA---------------TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
            S   + V+Y                +V+I+GWG E G  YW +++++ E +GD GT KI
Sbjct: 239 FSVYEDFVSYKSGVYKHTNGTLMGIHSVEIIGWGTEKGVDYWLVMNSWNEGWGDHGTFKI 298

Query: 244 LRGRNEAIIESLVNGALPKDN 264
            +G  +  I+  V G+ P  N
Sbjct: 299 AQG--DCGIDDAVLGSPPAMN 317


>gi|167541036|gb|ABZ82028.1| cathepsin B endopeptidase [Clonorchis sinensis]
          Length = 228

 Score = 80.5 bits (197), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 55/193 (28%), Positives = 86/193 (44%), Gaps = 21/193 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W +    G+VTGG+  + TGC+   FP C+H   +   P C       P C
Sbjct: 40  CQGGFPPTAWDFWQTEGIVTGGSKENPTGCRSYPFPRCSHHG-SKKYPPCSHRIYDTPNC 98

Query: 140 HTRC--------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRPL 189
             +C        T+       +  K + N +      + GP   AF  +  F    +   
Sbjct: 99  VQKCDTPDTDYATDKTRANITYNVKAKQNAIMKEIMIN-GPVEAAFQVYEDFLGYKSGVY 157

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           F ++G +    A         ++I+GWGEENG  YW I +++ + +G+ G  K+LRG+NE
Sbjct: 158 FHSDGTLLGGHA---------IRILGWGEENGVAYWLIANSWNDGWGEDGYFKMLRGKNE 208

Query: 250 AIIESLVNGALPK 262
             IE  V   LP+
Sbjct: 209 CGIEDEVTAGLPE 221


>gi|402594312|gb|EJW88238.1| cathepsin B5 [Wuchereria bancrofti]
          Length = 407

 Score = 80.5 bits (197), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 60/195 (30%), Positives = 88/195 (45%), Gaps = 20/195 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W +    G+VTG  + +++GC+P  FPPC H N  T    CK    P PKC
Sbjct: 205 CFGGEPMAAWKYWVLSGIVTGSDYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKC 264

Query: 140 HTRCTNDNYGRGFFQDKYQ-------INGLGLYFDP--HFGPFWPAFWRSFCTKYTRPLF 190
             +C + NY + +  DKY         N + L        GP   +F       YT  L 
Sbjct: 265 DRQC-DKNYKKPYKADKYYGEQAYNVENDVELIQKEIMTLGPVEASF-----EVYTDFLH 318

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDK---GTIKILRGR 247
              G    V+ S  +     VKI+GWG + G  YW   +++   +G+    G  +ILRG 
Sbjct: 319 YIGGIYKHVAGS--VGGGHAVKILGWGIDQGVSYWLAANSWNTDWGEDVFSGYFRILRGV 376

Query: 248 NEAIIESLVNGALPK 262
           +E  IES +   +P+
Sbjct: 377 DECGIESGIVAGIPR 391


>gi|118429529|gb|ABK91812.1| cathepsin B precursor [Clonorchis sinensis]
          Length = 342

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 55/187 (29%), Positives = 80/187 (42%), Gaps = 9/187 (4%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G S   W      G+VTGG+    TGC+   FP C H       P C     P P+C
Sbjct: 155 CRGGYSPIAWDLWKTHGIVTGGSKEKPTGCRSYPFPSCEHRG-KGQYPPCPHQLYPTPEC 213

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWR----SFCTKYTRPLFQTNGR 195
             RC  D     + +DK + N     +            R    +    Y   L   +G 
Sbjct: 214 IKRC--DTKEIDYEKDKTRANISYNVYPAEQAVMKEIMLRGPVGAILHVYEDLLDYKSGV 271

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
            + V      +    ++I+GWGEE+G PYW + +++ E +G+KG +++LR RNE  I   
Sbjct: 272 YFHVWGGH--LGEHGIRILGWGEEDGVPYWLVANSWNEDWGEKGYMRVLRWRNECGIVDQ 329

Query: 256 VNGALPK 262
           V   LP 
Sbjct: 330 VTAGLPD 336


>gi|195437434|ref|XP_002066645.1| GK24603 [Drosophila willistoni]
 gi|194162730|gb|EDW77631.1| GK24603 [Drosophila willistoni]
          Length = 341

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 68/244 (27%), Positives = 101/244 (41%), Gaps = 24/244 (9%)

Query: 27  SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
           SC    A+AT + ++  +C  S       +FR        C  +       + C  G   
Sbjct: 113 SCGSCWAIATTSVMSDRLCIGSN---GVMNFRLSGLDMLSCCAICG-----FACQGGYPG 164

Query: 87  STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
           + WA+  ++GLV+GG + S  GCQP +  PC+H+    S P C        +C   C   
Sbjct: 165 AAWAYWARKGLVSGGDYGSQQGCQPYTIEPCDHSG-NGSRPVCTVGGG--VRCQHLC-EP 220

Query: 147 NYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVS 200
           +Y   F +DK      Y I+   L          P   ++  T Y   L    G  Y + 
Sbjct: 221 SYKVDFQRDKNFASKVYSISNDVLEIQKEIMTNGPV--QAILTVYEDFLSYKTGVYYHL- 277

Query: 201 ASAEIVAYATVKIVGWGEENGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 258
              E V    V+I+GWG    +  PYW + +++G  +GD G   I RG N   IE  +  
Sbjct: 278 -EGEKVGPHAVRILGWGVWGTKKVPYWLVANSWGSDWGDNGFFHIFRGENHCDIEGYIMA 336

Query: 259 ALPK 262
            LPK
Sbjct: 337 GLPK 340


>gi|56462338|gb|AAV91452.1| cysteine peptidase 2 cathepsin-B-like [Lonomia obliqua]
          Length = 338

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 62/186 (33%), Positives = 87/186 (46%), Gaps = 19/186 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G+ +  W +    G+V+GG+++S  GC P   PPC H       P C    T  PKC
Sbjct: 153 CNGGMPTLAWEYWKHAGIVSGGSYNSTQGCIPYEVPPCEHHVPGNRLP-CNG-DTKTPKC 210

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
              C    Y   F +DK      Y ++G            GP   AF     T Y+  L 
Sbjct: 211 QKTCEA-GYNVPFKKDKHYGKHVYSVSGNEDNIKAELFKNGPVEGAF-----TVYSDLLS 264

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G VY  +  + +  +A VKI+GWG ENG  YW I +++   +GD G  KILRG +  
Sbjct: 265 YKSG-VYQHTDGSALGGHA-VKILGWGVENGSKYWLIANSWNSDWGDNGFFKILRGEDHC 322

Query: 251 IIESLV 256
            IES +
Sbjct: 323 GIESSI 328


>gi|166030308|gb|ABY78821.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 62/193 (32%), Positives = 83/193 (43%), Gaps = 31/193 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   S W +    GL       +++ CQP  FP C H      +P C       PKC
Sbjct: 158 CKGGAPDSAWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCSKYHFHTPKC 210

Query: 140 HTRCT-----------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
           +T CT           N++Y     +D Y+     LYF+   GPF   F       Y+  
Sbjct: 211 NTTCTDKAIPLIKYRGNNSYMLLNGEDDYKRE---LYFN---GPFVVDF-----GVYSDF 259

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L    G    VS   +++    V+IVGWG+ NG PYW I +++   +G  G   ILRG N
Sbjct: 260 LAYKTGVYRHVSG--DVLGGHAVRIVGWGKLNGTPYWKIANSWDTDWGMNGHFLILRGNN 317

Query: 249 EAIIESLVNGALP 261
           E  IES     LP
Sbjct: 318 ECGIESTGYAGLP 330


>gi|166030314|gb|ABY78824.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 61/192 (31%), Positives = 85/192 (44%), Gaps = 30/192 (15%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W +    GL       +++ CQP  FP C H      +P C       PKC
Sbjct: 158 CDGGYPGTAWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCSKYDFHTPKC 210

Query: 140 HTRCTND---------NYGRGFF-QDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
           +T CT+          N+  G   +D Y+     LYF+   GPF  AF       Y+  L
Sbjct: 211 NTTCTDKAIPLIKYRGNHSYGLDGEDDYKRE---LYFN---GPFVVAF-----QVYSDFL 259

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
               G    VS   +++    V+IVGWG+ NG PYW I +++   +G  G   ILRG++E
Sbjct: 260 AYKTGVYRHVSG--DVLGGHAVRIVGWGKLNGTPYWKIANSWDTDWGMNGHFLILRGKDE 317

Query: 250 AIIESLVNGALP 261
             IES     LP
Sbjct: 318 CGIESEGYAGLP 329


>gi|195393194|ref|XP_002055239.1| GJ19262 [Drosophila virilis]
 gi|194149749|gb|EDW65440.1| GJ19262 [Drosophila virilis]
          Length = 338

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 58/194 (29%), Positives = 94/194 (48%), Gaps = 22/194 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W++   +G+V+GG++ S  GC+P    PC H +   + P C + +T  P+C
Sbjct: 155 CNGGFPGAAWSYWTHKGIVSGGSYGSKEGCRPYEVEPCEH-HVNGTRPPCHSGST--PRC 211

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
             +C +  Y   + +DK      Y +N   L         GP   AF     T Y   + 
Sbjct: 212 MHKCES-GYSVDYAKDKHFGAKAYSVNRNPLDIQREIMTNGPVEGAF-----TVYEDLIL 265

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
              G VY      ++  +A ++I+GWG   +N  PYW I +++   +GD G  +ILRG +
Sbjct: 266 YKTG-VYQHVHGRQLGGHA-IRILGWGVWGDNKVPYWLIGNSWNTDWGDNGFFRILRGED 323

Query: 249 EAIIESLVNGALPK 262
              IES ++  LPK
Sbjct: 324 HCGIESAISAGLPK 337


>gi|356984175|gb|AET43950.1| cathepsin B, partial [Reishia clavigera]
          Length = 209

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 60/191 (31%), Positives = 89/191 (46%), Gaps = 18/191 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S+ W      G+VTGG ++S  GCQP     C+H      +P CK      P+C
Sbjct: 28  CNGGYPSAAWEVFDHDGVVTGGQYNSKQGCQPYLIAACDHHVVGKLKP-CKGDGK-TPRC 85

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQ 191
             +C    Y   F  DK      Y ++ +    +     GP   AF     T Y+    Q
Sbjct: 86  EKKCEA-GYNVTFKDDKHYGQRSYSVSSVNDIMEELVTRGPVEAAF-----TVYSD-FLQ 138

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
            +  VY  +  + +  +A VKI+G+G ENG  YW + +++   +GD+G  KILRG +E  
Sbjct: 139 YHSGVYRHTTGSALGGHA-VKILGYGVENGDKYWLVANSWNPDWGDQGFFKILRGVDECG 197

Query: 252 IESLVNGALPK 262
           IE  +    PK
Sbjct: 198 IEGQIVAGEPK 208


>gi|326427908|gb|EGD73478.1| cathepsin B [Salpingoeca sp. ATCC 50818]
          Length = 341

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 18/191 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  ++ W +   +G+VTGG + SN GCQP S   C H      +P C  +  P P C
Sbjct: 160 CNGGYPAAAWEYWKNQGIVTGGQYDSNQGCQPYSLAKCEHHTTGPYKP-CGDIV-PTPAC 217

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQ 191
              C    Y   +  DK      Y + G+          GP   AF     T Y+  L  
Sbjct: 218 KRSC-RQGYNVTYPNDKHFGASSYGVRGVDQIATEIMTNGPVEAAF-----TVYSDFLSY 271

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
            +G VY  ++   +  +A +KI+GWG ++G  YW + +++ + +G+ G   I +G +E  
Sbjct: 272 KSG-VYQHTSGQPLGGHA-IKIIGWGVQDGTDYWIVANSWNDSWGNDGFFWIKKGTDECG 329

Query: 252 IESLVNGALPK 262
           IES V   LPK
Sbjct: 330 IESQVVAGLPK 340


>gi|349956183|dbj|GAA30948.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 337

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 55/193 (28%), Positives = 85/193 (44%), Gaps = 21/193 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +    G+VTGG+  + TGC+   FP C+H   +   P C       P C
Sbjct: 149 CQGGFPPIAWDFWQTEGIVTGGSKENPTGCRSYPFPRCSHHG-SKKYPPCSHRIYDTPNC 207

Query: 140 HTRC--------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRPL 189
             +C        T+       +  K + N +      + GP   AF  +  F    +   
Sbjct: 208 VQKCDTPDTDYATDKTRANITYNVKAKQNAIMKEIMIN-GPVEAAFQVYEDFLGYKSGVY 266

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           F ++G +    A         ++I+GWGEENG  YW I +++ + +G+ G  K+LRG+NE
Sbjct: 267 FHSDGTLLGGHA---------IRILGWGEENGVAYWLIANSWNDGWGEDGCFKMLRGKNE 317

Query: 250 AIIESLVNGALPK 262
             IE  V   LP+
Sbjct: 318 CGIEDEVTAGLPE 330


>gi|294939825|ref|XP_002782575.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239894358|gb|EER14370.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 398

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 59/198 (29%), Positives = 90/198 (45%), Gaps = 26/198 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 133
           C  G   S W+WVH  G+ TGG +        + GC P  FPPC H       P C   A
Sbjct: 211 CRGGFPYSAWSWVHDEGIATGGDYVPRDNMTEDDGCWPYDFPPCAHFFKDPKYPACPKFA 270

Query: 134 TPQPKCHTRCTNDNYGRGFFQDKY-QINGLGLYFDPHF--------GPFWPAFWRSFCTK 184
               +C ++  +      +F D+Y  +  +  +F            GP    F+      
Sbjct: 271 RVNLRCVSKLRH--MMVVYFSDRYFMVESVPYHFSADDAKNAIRTDGPVSATFYV----- 323

Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 244
           Y   L   +G VY  ++ + + A+A VKI+GWGE+ G  YW +V+++ E +GD G  KI 
Sbjct: 324 YEDFLAYKSG-VYKHTSGSLLGAHA-VKIIGWGEDGGEAYWLVVNSWNEGWGDHGLFKIA 381

Query: 245 RGRNEAIIESLVNGALPK 262
            G  +  I++ + G  PK
Sbjct: 382 LG--DCGIDNELLGGTPK 397


>gi|17559066|ref|NP_506790.1| Protein CPR-3 [Caenorhabditis elegans]
 gi|1169083|sp|P43507.1|CPR3_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 3; AltName:
           Full=Cysteine protease-related 3; Flags: Precursor
 gi|675494|gb|AAA98788.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|675496|gb|AAA98782.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|14530554|emb|CAB61032.2| Protein CPR-3 [Caenorhabditis elegans]
          Length = 370

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 60/196 (30%), Positives = 86/196 (43%), Gaps = 9/196 (4%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G S     +    G VTGG +  + GC P SF PC      ++ P CKT      K 
Sbjct: 162 CKGGYSIEALRFWASSGAVTGGDYGGH-GCMPYSFAPCTKNCPESTTPSCKTTCQSSYKT 220

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-HFGPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
                + +YG   ++     +   +  +  H+GP   ++      K     +     VY 
Sbjct: 221 EEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASY------KVYEDFYHYKSGVYH 274

Query: 199 VSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 258
            + S ++V    VKI+GWG ENG  YW I +++G  FG+KG  KI RG NE  IE  V  
Sbjct: 275 YT-SGKLVGGHAVKIIGWGVENGVDYWLIANSWGTSFGEKGFFKIRRGTNECQIEGNVVA 333

Query: 259 ALPKDNYGVEFGEESG 274
            + K     E  E+ G
Sbjct: 334 GIAKLGTHSETYEDDG 349


>gi|47217183|emb|CAG11019.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 351

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 65/213 (30%), Positives = 89/213 (41%), Gaps = 39/213 (18%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTG---------------------CQPVSFPPCN 118
           C+ G  SS W +    GLV+GG + S+ G                     C+P + PPC 
Sbjct: 148 CNGGYPSSAWNFWVSDGLVSGGLYDSHIGRIQVSLCVLLLAVDRDFVSPGCRPYTIPPCE 207

Query: 119 HANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF-- 170
           H +   S P C       P+C  RC    Y   + QDK      Y ++            
Sbjct: 208 H-HVNGSRPSCSGEGGDTPECIFRC-EAGYSPSYKQDKHFGKTSYSVSSEEDEIKQEIYK 265

Query: 171 -GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVS 229
            GP   AF     T Y   +   +G    VS SA  +    +K++GWGEENG PYW   +
Sbjct: 266 NGPVEGAF-----TVYEDFVLYKSGVYQHVSGSA--LGGHAIKMLGWGEENGVPYWLCAN 318

Query: 230 TFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           ++   +GD G  KILRG +   IES +    PK
Sbjct: 319 SWNTDWGDNGFFKILRGADHCGIESEIVAGNPK 351


>gi|48762491|dbj|BAD23815.1| cathepsin B-S1 [Tuberaphis coreana]
          Length = 334

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 54/187 (28%), Positives = 87/187 (46%), Gaps = 14/187 (7%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHA---NYTTSEPECKTLATPQ 136
           C  G     W +   +G+ TGG + +  GC P   PPC +    N    +P  +    P+
Sbjct: 154 CGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPCYNKQGKNTCGGQPMERNHQCPK 213

Query: 137 PKCHTRCTNDNYGRGFFQDKYQINGLGLYFD--PHFGPFWPAFWRSFCTKYTRPLFQTNG 194
             C+ + T  N  R   + +Y IN +         +GP   +F       Y       +G
Sbjct: 214 T-CYGKTTVQN--RYKTKSEYSINSIKTIEQDLKTYGPVEASF-----DVYDDFSVYKSG 265

Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
            +Y  +  A+     ++KI+GWG+ENG  YW  V+++ + +G+ GT KI++GRNE  IE 
Sbjct: 266 -IYRKTPKAKYEGRHSIKIIGWGQENGTTYWLAVNSWSKFWGEHGTFKIIKGRNECGIER 324

Query: 255 LVNGALP 261
            V   +P
Sbjct: 325 AVTAGIP 331


>gi|170028912|ref|XP_001842338.1| oryzain gamma chain [Culex quinquefasciatus]
 gi|167879388|gb|EDS42771.1| oryzain gamma chain [Culex quinquefasciatus]
          Length = 333

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 58/195 (29%), Positives = 84/195 (43%), Gaps = 26/195 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W    ++GLV+GG   S+ GC+P +  PC H       P CK   TP  KC
Sbjct: 152 CDGGAPGAGWKHWIEKGLVSGGPFGSDQGCRPYTIEPCVHVENGAQSP-CKDSITP--KC 208

Query: 140 HTRCT---------NDNYGRGFF---QDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTR 187
             +C          + ++G+  +    D+ QI        P    F    +  F + Y  
Sbjct: 209 IKKCLPGYNVPYAKDKSFGKSTYSIANDERQIRKEIFTNGPVEATF--TVFDDFAS-YKH 265

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
            ++Q          S  +     V+I+GWG ENG  YW   +++   +GD G  KILRG 
Sbjct: 266 GIYQ--------HTSGNLAGEHAVRILGWGVENGTKYWLAANSWNSDWGDNGYFKILRGS 317

Query: 248 NEAIIESLVNGALPK 262
           N   IES +   LPK
Sbjct: 318 NHVDIESAIVAGLPK 332


>gi|321461662|gb|EFX72692.1| hypothetical protein DAPPUDRAFT_308155 [Daphnia pulex]
          Length = 379

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 72/248 (29%), Positives = 100/248 (40%), Gaps = 30/248 (12%)

Query: 27  SCIEARAVATATPLAFAVC-RSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGIS 85
           SC    AVA    ++  +C  S   H+     R  AG    C  L  +      C  G  
Sbjct: 137 SCASCWAVAPTDVMSDRICIHSGSRHI----VRLSAGNLLSCCKLCGK-----GCKGGFP 187

Query: 86  SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTS-EPECKTLATPQPKCHTRCT 144
              W    K G+VTGG++ S+ GCQ   F PC       S + +C        +C   C 
Sbjct: 188 GGAWMHWSKHGIVTGGSYSSDYGCQKYQFFPCYQPRTKGSIKNKCPKTDNTLLECRETCR 247

Query: 145 NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV----- 199
             +Y + + QD Y   G  +Y  P+      A           P+ Q N R+Y       
Sbjct: 248 T-SYNKSYKQDLYY--GESVYRIPN-----DARAIQLEIMENGPV-QANLRIYEDFLHYK 298

Query: 200 -----SASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
                    + + Y  VKI GWG E G PYW   + + +++G+ G  KILRG N A IE 
Sbjct: 299 FGVYRHVHGQGLEYHAVKIFGWGTEGGTPYWLAANPWSKRWGNGGFFKILRGSNHAEIED 358

Query: 255 LVNGALPK 262
            V   +PK
Sbjct: 359 HVMAGIPK 366


>gi|161343877|tpg|DAA06119.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 145

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 51/152 (33%), Positives = 71/152 (46%), Gaps = 14/152 (9%)

Query: 116 PCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQD-----KYQINGLGLYFDPH- 169
           PC H       P C       P+C  +C N +YG  + +D     +Y+I G     + + 
Sbjct: 1   PCQHTESAVENP-CSNKTFFTPECKVQCYNPDYGTRYVKDNHKGTQYRIPGYTAMKEIYE 59

Query: 170 FGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVS 229
            GP   +F+        +        VYA + S + V    VKI+GWGEENG PYW   +
Sbjct: 60  NGPITASFYMY------QDFVNYQSGVYAFN-SGKYVTTQAVKILGWGEENGTPYWLAAN 112

Query: 230 TFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
           +F   +GD G +KILRG NE  IE  +   LP
Sbjct: 113 SFNTYWGDNGFVKILRGANECYIEEFMYAGLP 144


>gi|209863079|ref|NP_001119613.2| cathepsin B precursor [Acyrthosiphon pisum]
          Length = 323

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 56/191 (29%), Positives = 86/191 (45%), Gaps = 16/191 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTL-ATPQP 137
           C  G +   W     +G+VTGG   SN GCQP    PC+H  Y  S    C +L  T   
Sbjct: 134 CDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDH--YGDSRLTNCSSLRRTQMT 191

Query: 138 KCHTRCTNDNYGRGFFQDKYQINGLGL-------YFDPHFGPFWPAFWRSFCTKYTRPLF 190
            C  +C N NY   +  D ++ + + +               + P    +F   Y   + 
Sbjct: 192 VCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTYGPV--TAFMYVYENFMG 249

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
              G +Y  S + E++ Y  VK++GWG + +G  YW  ++++   +G+ G  KILRG N 
Sbjct: 250 YKEG-IYK-STTGELIGYHHVKLIGWGVDGDGTEYWLAMNSWNSNWGNDGLFKILRGYNF 307

Query: 250 AIIESLVNGAL 260
             IE LV   +
Sbjct: 308 CSIELLVMAGI 318


>gi|54289256|gb|AAV31918.1| putative vitellogenic cathepsin B [Aedes aegypti]
          Length = 332

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 62/192 (32%), Positives = 87/192 (45%), Gaps = 31/192 (16%)

Query: 82  SGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHT 141
            G S   W  V   GLV+G A+++  GC+P  F PC +  +    PE        P C  
Sbjct: 160 DGTSFQYWVDV---GLVSGAAYNNTDGCKPYPFKPCLYP-FVGCHPE------KTPSCTH 209

Query: 142 RCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRPLF 190
            CT + Y   + +DKY   G   Y  P+            GP    F         + L+
Sbjct: 210 HCT-EGYDGTYRRDKYY--GSAAYKLPNDERMIQLEIMTNGPVESGF------SVYQDLY 260

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
                VY      E+  +A V+++GWG+E G PYW I +++GE +G+ G  K LRG N  
Sbjct: 261 LYKTGVYQHVVGREVGKHA-VRLIGWGKERGVPYWLIANSYGEDWGEHGYFKFLRGSNHL 319

Query: 251 IIESLVNGALPK 262
            IES+V   LPK
Sbjct: 320 GIESVVIAGLPK 331


>gi|5764077|emb|CAB53367.1| necpain [Necator americanus]
          Length = 339

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 50/197 (25%), Positives = 80/197 (40%), Gaps = 31/197 (15%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           CS G     W WV K G+ TGG + +   C+P +F PC +         C   + P P+C
Sbjct: 155 CSGGWPFQAWEWVRKYGVCTGGDYRAKGVCKPYAFHPCGNHENQVYYGVCPKGSWPTPRC 214

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
              C    Y + + +DK+                  ++W     K  R     NG V A 
Sbjct: 215 EKFCQR-GYIKPYKKDKFYAK--------------KSYWLPNDEKEIRLDIMKNGPVQAA 259

Query: 200 SASAEIVAY----------------ATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
               E                      VKI+GWG++NG  YW I +++ + +G+ G  ++
Sbjct: 260 FDVYEDFKLYKRGIYKHKEGIQTGGHAVKIIGWGKDNGTDYWLIANSWSKDWGESGFFRM 319

Query: 244 LRGRNEAIIESLVNGAL 260
           +RG N+  IE ++   +
Sbjct: 320 VRGENDCEIEDMITAGI 336


>gi|241154720|ref|XP_002407359.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215494103|gb|EEC03744.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 58/199 (29%), Positives = 87/199 (43%), Gaps = 33/199 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G  ++ W    +RG+V+GG + +  GC+P S  PC + +     P C  +    P+C
Sbjct: 154 CKGGFPAAAWEHWKERGIVSGGLYGTPDGCKPYSLAPCEY-HTKCRIPNCIPIVH-TPEC 211

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA- 198
              C    Y + + +DK            HFG    +  R    K  +    TNG V A 
Sbjct: 212 VHHCRK-GYDKDYQEDK------------HFGQKVYSISRD--EKQIQTEIFTNGPVEAD 256

Query: 199 VSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
                + + Y +               ++I+GWG ENG PYW   +++ E +GDKG  KI
Sbjct: 257 FHVYGDFLCYKSGVYQRHSNDGRGMHAIRILGWGTENGTPYWLAANSWNENWGDKGYFKI 316

Query: 244 LRGRNEAIIESLVNGALPK 262
           LR  NE  IE  +   +PK
Sbjct: 317 LRRTNECGIEEHIYAGIPK 335


>gi|325302580|dbj|BAJ83490.1| cathepsin B-like peptidase [Echinococcus multilocularis]
          Length = 351

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 56/195 (28%), Positives = 91/195 (46%), Gaps = 23/195 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +    GLV+GG + +   C+    PPC H +   + P C+  A P PKC
Sbjct: 169 CNGGFPSQAWNFWKHEGLVSGGLYGTKGVCRAYEIPPCEH-HVNGTRPPCEGDA-PTPKC 226

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPH--------FGPFWPAF--WRSFCTKYTRP 188
              C  + Y   + +DK Y +    ++ +           GP    F  +  F T Y   
Sbjct: 227 KNVC-QEEYKVPYKKDKHYAVKVYSVHSNEDAIKHELITHGPVEADFEVYADFPT-YKSG 284

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           ++Q          S  ++    +K++GWGEE+G PYW   +++   +G+ G  KILRG+N
Sbjct: 285 VYQ--------HVSGALLGGHAIKLMGWGEEDGVPYWLCANSWNTDWGEGGFFKILRGKN 336

Query: 249 EAIIESLVNGALPKD 263
              IES +   +P++
Sbjct: 337 HCGIESDIVAGIPQN 351


>gi|204022077|dbj|BAG71136.1| cathepsin B-S1 [Tuberaphis sumatrana]
 gi|204022079|dbj|BAG71137.1| cathepsin B-S2 [Tuberaphis sumatrana]
          Length = 334

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 54/187 (28%), Positives = 89/187 (47%), Gaps = 14/187 (7%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHA---NYTTSEPECKTLATPQ 136
           C  G     W +   +G+ TGG + +  GC+P    PC +    N    +P  +    P+
Sbjct: 154 CEGGYPIKAWRYFRTQGVTTGGDYDTKEGCKPYKVAPCYNKQGKNTCGGKPMERNHQCPK 213

Query: 137 PKCHTRCTNDNYGRGFFQDKYQINGLGLYFDP--HFGPFWPAFWRSFCTKYTRPLFQTNG 194
             C+ + T+    R   + +Y IN +         +GP   +F       Y       +G
Sbjct: 214 T-CYGKTTDQK--RYKTKSEYVINSIKTIEQDIKTYGPVEASF-----DVYDDFSVYKSG 265

Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
            +Y  + +A+     +VKI+GWG+ENG PYW  V+++ + +GD GT KI++G+NE  IE 
Sbjct: 266 -IYRKTPNAKYQNGHSVKIIGWGQENGTPYWLAVNSWSKFWGDHGTFKIIKGKNECGIER 324

Query: 255 LVNGALP 261
            V   +P
Sbjct: 325 AVTAGIP 331


>gi|308466896|ref|XP_003095699.1| CRE-CPR-3 protein [Caenorhabditis remanei]
 gi|308244581|gb|EFO88533.1| CRE-CPR-3 protein [Caenorhabditis remanei]
          Length = 373

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 61/197 (30%), Positives = 87/197 (44%), Gaps = 11/197 (5%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHAN-YTTSEPECKTLATPQPK 138
           C  G S     +    G VTGG ++ N GC P SF PC  +    ++ P CKT       
Sbjct: 163 CQGGYSIEAMRFWKSNGAVTGGDYNGN-GCMPYSFAPCQKSPCVESTTPTCKTTCQSSYT 221

Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGL--YFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
                T+ +YG   ++     N +    Y   H GP   ++      K     +Q    V
Sbjct: 222 TANYTTDKHYGTSAYRLATTNNVVSTIQYEIYHNGPVEASY------KVYEDFYQYKSGV 275

Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
           Y    S ++V    VKI+GWG EN   YW + +++G +FG+ G  KI RG NE  IES V
Sbjct: 276 YHY-VSGKLVGGHAVKIIGWGTENDVDYWLVANSWGIKFGEGGFFKIRRGTNECQIESNV 334

Query: 257 NGALPKDNYGVEFGEES 273
              + K     E G++ 
Sbjct: 335 VAGVAKLGTHAEKGDDD 351


>gi|294952611|ref|XP_002787376.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239902348|gb|EER19172.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 203

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 56/186 (30%), Positives = 88/186 (47%), Gaps = 24/186 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTG------GAHHSNTGCQPVSFPPCNHANYTTSE-PECKTL 132
           C+ G      +++   G+VTG      G      GC P  F  CNH     SE P+CK +
Sbjct: 12  CNGGTFVEAMSFLEDYGVVTGNDFKPQGQLSEADGCWPYPFQKCNHVPTENSEYPKCKDV 71

Query: 133 A-TPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDP---------HFGPFWPAFWRSFC 182
           A  P P C T CTN  Y +   +D ++       F+            GP + AF     
Sbjct: 72  AHQPLPPCRTTCTNKAYKKSLKKDVHRAKSWRKVFNDAQSIKQEIFDNGPVFSAF----- 126

Query: 183 TKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
             Y    +  +G VY V  + E++++  VKI+GWG ++ + YW  ++++ E++GD G IK
Sbjct: 127 KMYEDFRYYKSG-VY-VPTTKEVLSFHLVKIIGWGADSVQEYWLAMNSWNEEWGDHGLIK 184

Query: 243 ILRGRN 248
           +  G+N
Sbjct: 185 MAFGKN 190


>gi|76576341|gb|ABA53864.1| cathepsin B-like cysteine protease 2 [Parelaphostrongylus tenuis]
          Length = 344

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 55/185 (29%), Positives = 80/185 (43%), Gaps = 16/185 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   S W +  + G+VTGG + +   C+P   PPC H    T    C  +A   P C
Sbjct: 163 CDGGYPISAWEYFVETGVVTGGLYGTKDSCRPYEIPPCGHHRNETFYGNCTQIAD-TPDC 221

Query: 140 HTRC-----TNDNYGRGFFQDKYQINGLGLYFDPH---FGPFWPAFWRSFCTKYTRPLFQ 191
            T C      + +  + F +D Y I             +GP   AF            F 
Sbjct: 222 VTTCQAGYPISYDDDKTFGKDSYTIESSVTAIQKEIMTYGPVTAAF------IVYEDFFH 275

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
            +  +Y   +  E   +A V+I+GWGEE G  YW + +++   +G+ G  +ILRG NE  
Sbjct: 276 YHRGIYKHVSGGEEGGHA-VRILGWGEEKGTAYWLVANSWNTDWGENGYFRILRGSNECG 334

Query: 252 IESLV 256
           IE  V
Sbjct: 335 IEENV 339


>gi|193610664|ref|XP_001948185.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
          Length = 324

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 56/202 (27%), Positives = 87/202 (43%), Gaps = 40/202 (19%)

Query: 79  VCSSGISSST------WAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTL 132
           +  SGI S+       W +  K+GLV+GG +++N GCQP   PP                
Sbjct: 144 ISCSGIKSNAMADDQAWKFFKKQGLVSGGKYNTNDGCQPSKIPP--------------IF 189

Query: 133 ATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPH------------FGPFWPAFWRS 180
             P+   +  C N  YG       Y  + + + +  H            +GP    F   
Sbjct: 190 NLPKKIYNRTCDNFCYGNSLID--YNHDHVKVSYTYHVLYKNIQREVQTYGPVSAYF--- 244

Query: 181 FCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGT 240
             + Y      T+G VYA +  ++ V Y + K++GWG ENG  YW +V+++G ++G  G 
Sbjct: 245 --SLYDDLFLYTSG-VYARTEKSKFVRYQSAKLIGWGVENGVDYWLLVNSWGNEWGQNGL 301

Query: 241 IKILRGRNEAIIESLVNGALPK 262
            KI RG +E          +PK
Sbjct: 302 FKIKRGTDECQFGRHTYAGVPK 323


>gi|166030310|gb|ABY78822.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 56/182 (30%), Positives = 82/182 (45%), Gaps = 24/182 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   ++W +    GL       +++ CQP  FP C H      +P C       PKC
Sbjct: 158 CDGGYPGTSWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCSKYHFHTPKC 210

Query: 140 HTRCTNDNYGRGFFQ--DKYQINGLG-----LYFDPHFGPFWPAFWRSFCTKYTRPLFQT 192
           +T CT+       ++    Y+++G       LYF+   GPF   FW      Y+  L   
Sbjct: 211 NTTCTDKAIPLIKYRGNHSYEVHGEDDYKRELYFN---GPFVVVFW-----VYSDFLAYK 262

Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
            G    VS   + +    V+IVGWG+ NG PYW I +++   +G  G +  LRG NE  I
Sbjct: 263 TGVYRHVSG--DFLGGHAVRIVGWGKLNGTPYWKIANSWDTDWGMNGHLLFLRGNNECGI 320

Query: 253 ES 254
           E+
Sbjct: 321 EA 322


>gi|984958|gb|AAC46877.1| cathepsin B-like proteinase [Ancylostoma caninum]
          Length = 343

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 66/248 (26%), Positives = 105/248 (42%), Gaps = 33/248 (13%)

Query: 27  SCIEARAVATATPLAFAVC--RSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGI 84
           SC    AV++A  ++  +C   +S + V  +    ++     C +          C  G 
Sbjct: 113 SCGSCWAVSSAEAMSDEICVQSNSTIRVMISDSDILSCCGISCGYG---------CQGGW 163

Query: 85  SSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT 144
               + W+ + G+VTGG +     C+P +F PC H         C     P PKC   C 
Sbjct: 164 PIEAYKWMQRDGVVTGGKYRQKKVCKPYAFYPCGHHQNDPYYGPCPGGLWPTPKCRKTCQ 223

Query: 145 NDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRPLFQTN 193
              Y + + +DK+       Y+ P+            GP   AF       Y    +   
Sbjct: 224 R-KYNKSYQEDKH--FATRAYYLPNNERNIRQEIYKNGPVVAAF-----RVYQDFSYYKK 275

Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
           G +Y      +  A+A VK+VGWG EN   YW I +++   +G+ G  +I+RG NE  IE
Sbjct: 276 G-IYVHKWGGQTGAHA-VKVVGWGRENATDYWLIANSWNTDWGESGYFRIVRGTNECGIE 333

Query: 254 S-LVNGAL 260
           + +V GA+
Sbjct: 334 AQMVGGAM 341


>gi|254746338|emb|CAX16634.1| putative C1A cysteine protease precursor [Manduca sexta]
          Length = 337

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 59/191 (30%), Positives = 82/191 (42%), Gaps = 19/191 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ GI S  W +    G+V+GG ++S  GC+P   PPC H       P C    T  PKC
Sbjct: 152 CNGGIPSLAWEYWKHFGIVSGGNYNSTQGCRPYEIPPCEHHVPGNRMP-CSG-DTKTPKC 209

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
              C N  Y   + +DK      Y ++    +        GP   AF     T Y   L 
Sbjct: 210 QKNCEN-GYNVMYKKDKRYGKHVYSVSAGEDHIRAELYKNGPVEGAF-----TVYADLLA 263

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             +G    +   A  +    +KI+GWG EN   YW + +++   +GD G  KILRG N  
Sbjct: 264 YKSGVYKHIQGDA--LGGHAIKILGWGVENDNKYWLVANSWNTDWGDNGFFKILRGENHC 321

Query: 251 IIESLVNGALP 261
            IE  +    P
Sbjct: 322 GIEGSIIAGEP 332


>gi|27526823|emb|CAD32937.1| pro-cathepsin B2 [Fasciola hepatica]
          Length = 337

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 57/194 (29%), Positives = 83/194 (42%), Gaps = 21/194 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W +  + G+VTGG   + TGC P  FP C H    +    C     P P C
Sbjct: 145 CQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPYPFPQCRHPGSRSQLNPCPRYTYPTPSC 204

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLY-FDPH----------FGPFWPAFWRSFCTKYTRP 188
           +  C    Y + + +DK  + G   Y  D H           GP    F       YT  
Sbjct: 205 YPYC-QAGYDKTYEKDK--VYGKTSYNVDRHEYTIMEEIMKNGPVEAGF-----IVYTDF 256

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
               +G  + V  S        ++I+GWG ENG  YW   +++   +G+ G  +ILRG +
Sbjct: 257 AVYKSGIYHHV--SGRYAGKHAIRIIGWGVENGVKYWLTANSWNVGWGENGYFRILRGTD 314

Query: 249 EAIIESLVNGALPK 262
           E  IES+V   +P+
Sbjct: 315 ECRIESIVVAGMPR 328


>gi|342181301|emb|CCC90780.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 335

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 56/182 (30%), Positives = 82/182 (45%), Gaps = 24/182 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   ++W +    GL       +++ CQP  FP C H      +P C       PKC
Sbjct: 158 CDGGYPGTSWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCSKYHFHTPKC 210

Query: 140 HTRCTNDNYGRGFFQ--DKYQINGLG-----LYFDPHFGPFWPAFWRSFCTKYTRPLFQT 192
           +T CT+       ++    Y+++G       LYF+   GPF   FW      Y+  L   
Sbjct: 211 NTTCTDKAIPLIKYRGNHSYEVHGEDDYKRELYFN---GPFVVVFW-----VYSDFLAYK 262

Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
            G    VS   + +    V+IVGWG+ NG PYW I +++   +G  G +  LRG NE  I
Sbjct: 263 TGVYRHVSG--DFLGGHAVRIVGWGKLNGTPYWKIANSWDTDWGMNGHLLFLRGNNECGI 320

Query: 253 ES 254
           E+
Sbjct: 321 EA 322


>gi|227293|prf||1701299A cathepsin B
          Length = 339

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 60/203 (29%), Positives = 91/203 (44%), Gaps = 34/203 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  K+GLV+GG + S+ GC P + PPC H +   S P C      + +C
Sbjct: 150 CNGGYPSGAWNFWTKKGLVSGGYYDSHIGCLPYTIPPCEH-HVNGSRPPCTGEGDTR-RC 207

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY-A 198
           +  C    Y   + +DK            HFG  + ++  S   K        NG V  A
Sbjct: 208 NKSC-EAGYSPSYKEDK------------HFG--YTSYSVSNSVKKIMAEIYKNGPVEGA 252

Query: 199 VSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
            +  ++ + Y +               ++I+ WG ENG PYW   +++   +GD G  KI
Sbjct: 253 FTVFSDFLTYKSGVYKHEAGDMMGGHAIRILVWGVENGVPYWAAANSWNLDWGDNGFFKI 312

Query: 244 LRGRNEAIIESLVNGALPK-DNY 265
           LRG N   IES +   +P+ D Y
Sbjct: 313 LRGENHCGIESEIVAGIPRTDQY 335


>gi|46812327|gb|AAT02230.1| cathepsin B-like proteinase [Triatoma dimidiata]
          Length = 332

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 58/192 (30%), Positives = 85/192 (44%), Gaps = 19/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   S W +    G+V+GG + S  GCQP S  PC H +   S P C+        C
Sbjct: 150 CFGGDPGSAWEYWRDVGIVSGGNYGSKEGCQPYSIAPCEH-HIPGSRPPCRGEGH-TADC 207

Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
             +C          + +Y    +  +  +  +      + GP   AF+          L 
Sbjct: 208 RKQCEKGYSIPYDKDLHYAEFVYSTERDVKEIQTEILKN-GPVEAAFF------VYEDLL 260

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
                VY   A A +  +A +KI+GWG ENG PYW I +++   +G+ G  KILRG +E 
Sbjct: 261 TYKEGVYKHVAGAPVGGHA-IKILGWGVENGTPYWLIANSWNTDWGNNGFFKILRGSDEC 319

Query: 251 IIESLVNGALPK 262
            IE  V+  LP+
Sbjct: 320 GIEIDVSAGLPR 331


>gi|312382740|gb|EFR28091.1| hypothetical protein AND_04395 [Anopheles darlingi]
          Length = 381

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 58/186 (31%), Positives = 90/186 (48%), Gaps = 15/186 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G+ S+ W +  + G+ +GGA+ S+ GCQ   F  C          +   L   QP  
Sbjct: 204 CDGGVPSAVWHYWVENGITSGGAYESHEGCQSYPFGVCKPQEIFAPHVDLICLRQCQPGY 263

Query: 140 HTRCTND-NYGRGFF---QDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGR 195
           +T    D ++GR  +   +D+ +I    LY   +FGP   +F     T YT    Q    
Sbjct: 264 NTTYLEDKHFGRVAYSVPRDEDRI----LYELFYFGPVQASF-----TVYT-DFIQYKSG 313

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           VY  +    +  + +VKIVGWG ENG  +W   +++G ++G+ G  KI+RG +   +ES 
Sbjct: 314 VYRHTYGVRVGDH-SVKIVGWGVENGTKFWLCANSWGAEWGENGFFKIIRGEDHLSVESN 372

Query: 256 VNGALP 261
           V   LP
Sbjct: 373 VVAGLP 378


>gi|154340956|ref|XP_001566431.1| cysteine peptidase C (CPC) [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134063754|emb|CAM39941.1| cysteine peptidase C (CPC) [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 340

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 73/261 (27%), Positives = 107/261 (40%), Gaps = 40/261 (15%)

Query: 5   TSSRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVK 64
           T S IRD S   + +     A++ +EA +    T       R S  H+   S  F+ G+ 
Sbjct: 113 TISEIRDQSNCGSCW-----AIAAVEAMSDRYCTVAGITDLRVSTGHL--LSCCFVCGMG 165

Query: 65  QRCAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTT 124
                          C  GI +  W W    GL       ++  CQP  FPPC H     
Sbjct: 166 ---------------CQGGIPTMAWLWWVWVGL-------TSEVCQPYPFPPCGHHTDGG 203

Query: 125 SEPECKTLATPQPKCHTRCTNDNYG--RGFFQDKYQINGLGLYFDP--HFGPFWPAFWRS 180
             P C +     P C++ C + +    +   +  Y + G   Y      +GPF  AF   
Sbjct: 204 KYPACPSTIYDTPTCNSTCADSHTALTKHKGEKSYSLRGEREYMIELMTYGPFEVAF--- 260

Query: 181 FCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGT 240
               Y   +   +G VY+ +    +  +A VK+VGWG +NG PYW I +++   +GD G 
Sbjct: 261 --DVYADFVSYKSG-VYSHTTGERLGGHA-VKLVGWGVQNGTPYWKIANSWNSDWGDNGY 316

Query: 241 IKILRGRNEAIIESLVNGALP 261
             I RG +E  IES     LP
Sbjct: 317 FLIRRGTDECGIESTGVAGLP 337


>gi|308504721|ref|XP_003114544.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
 gi|308261929|gb|EFP05882.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
          Length = 358

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 67/215 (31%), Positives = 84/215 (39%), Gaps = 36/215 (16%)

Query: 67  CAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCN--HANYTT 124
           C  L+S     W C          W    GL TGG +    GC+P S  PC+  + N TT
Sbjct: 156 CVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYEDQFGCKPYSIYPCDKKYPNGTT 215

Query: 125 SEPECKTLATPQPKCHTRCT-NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCT 183
           S P C    TP   C   CT N  +   + QDK            HFG       +    
Sbjct: 216 SVP-CPGYHTP--TCEEHCTSNITWPIAYKQDK------------HFGKAHYNVGKKMTD 260

Query: 184 KYTRPLFQTNGRVYA----------------VSASAEIVAYATVKIVGWGEENGRPYWTI 227
             T     TNG V A                V  + +       KI+GWG ++G PYW  
Sbjct: 261 IQTE--IMTNGPVIASFVIYDDFWDYKSGIYVHTAGDQEGGMDTKIIGWGVDSGVPYWLC 318

Query: 228 VSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           V  +G  FG+ G ++ LRG NE  IE  V  ALP 
Sbjct: 319 VHQWGTDFGENGFVRFLRGVNEVNIEHQVLAALPD 353


>gi|341886633|gb|EGT42568.1| hypothetical protein CAEBREN_17563 [Caenorhabditis brenneri]
          Length = 358

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 67/215 (31%), Positives = 84/215 (39%), Gaps = 36/215 (16%)

Query: 67  CAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCN--HANYTT 124
           C  L+S     W C          W    GL TGG +    GC+P +  PC+  + N TT
Sbjct: 156 CVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYDDQFGCKPYTIYPCDKKYPNGTT 215

Query: 125 SEPECKTLATPQPKCHTRCT-NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCT 183
           S P C    TP   C  RCT N  +   + QDK            HFG       +    
Sbjct: 216 SVP-CPGYHTP--VCEERCTSNITWPISYKQDK------------HFGKAHYNVGKKMTD 260

Query: 184 KYTRPLFQTNGRVYA----------------VSASAEIVAYATVKIVGWGEENGRPYWTI 227
             T      NG V A                V  + +       KI+GWG +NG PYW  
Sbjct: 261 IQTE--IMRNGPVIASFIIYDDFWDYKSGIYVHTAGDQEGGMDTKIIGWGVDNGVPYWLC 318

Query: 228 VSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           V  +G  FG+ G ++ILRG NE  IE  V  A P 
Sbjct: 319 VHQWGTDFGENGFVRILRGVNEVNIEHQVLAAQPD 353


>gi|125981197|ref|XP_001354605.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
 gi|54642915|gb|EAL31659.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
          Length = 338

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 56/192 (29%), Positives = 93/192 (48%), Gaps = 18/192 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATP--QP 137
           C+ G   + W++  ++G+V+GG + S  GC+P    PC H +   + P C   +TP  Q 
Sbjct: 155 CNGGFPGAAWSYWTRKGIVSGGPYGSTQGCRPYEIAPCEH-HVNGTRPPCSHGSTPSCQH 213

Query: 138 KCHTR-----CTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQT 192
           KC          + N+G   +  +  +  +      + GP   AF     T Y   +   
Sbjct: 214 KCQASYSVEYAKDKNFGSKSYSVRRNVAEIQQEIMTN-GPVEGAF-----TVYEDLILYK 267

Query: 193 NGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
           +G VY      E+  +A ++I+GWG   E+  PYW I +++   +GD G  +ILRG++  
Sbjct: 268 SG-VYQHEHGKELGGHA-IRILGWGVWGESKVPYWLIGNSWNTDWGDNGFFRILRGQDHC 325

Query: 251 IIESLVNGALPK 262
            IES ++  LPK
Sbjct: 326 GIESSISAGLPK 337


>gi|161343839|tpg|DAA06100.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 323

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 57/200 (28%), Positives = 85/200 (42%), Gaps = 34/200 (17%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTL-ATPQP 137
           C  G +   W     +G+VTGG   SN GCQP    PC+H  Y  S    C +L  T   
Sbjct: 134 CDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDH--YGDSRLTNCSSLRRTQMT 191

Query: 138 KCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
            C  +C N NY   +  D ++ +             +   W +   K  +    T+G V 
Sbjct: 192 VCRKKCVNKNYKVKYEDDLHKTS-----------IVYMTSWTN--VKQIQQEIMTHGPVT 238

Query: 198 AV----------------SASAEIVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGT 240
           A                 S + E++ Y  VK++GWG + +G  YW  ++++   +G+ G 
Sbjct: 239 AFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWLAMNSWNSNWGNDGL 298

Query: 241 IKILRGRNEAIIESLVNGAL 260
            KILRG N   IE LV   +
Sbjct: 299 FKILRGYNFCSIELLVMAGI 318


>gi|239793607|dbj|BAH72912.1| ACYPI000019 [Acyrthosiphon pisum]
          Length = 188

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 56/185 (30%), Positives = 75/185 (40%), Gaps = 26/185 (14%)

Query: 95  RGLVTGGAHHSNT-------GCQPVSFPPCNHANYTTSEPECKTLATPQ-PKCHTRCTND 146
           RG++TG              GCQP + PPC   N       C T    + P C  +C N 
Sbjct: 11  RGIITGDMGLCQVEIITPTQGCQPYTIPPCKLMNEKPPGHSCTTYHREETPICEKKCYNP 70

Query: 147 NYGRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
           NY   F  D Y+  G      P+         GP    F+        R L      VY 
Sbjct: 71  NYYTSFRTDIYK--GKYYKLSPYMAMKDIFDNGPITTQFYMY------RDLVDYKSGVYQ 122

Query: 199 VSASAEIVAYA--TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
               ++   +   +VKI GWGEENG PYW + ++FG  +G  GT KI RG +    +  +
Sbjct: 123 YDEQSDFDFFTVHSVKIFGWGEENGVPYWLVANSFGTDWGYNGTFKISRGNDGCFFQEKM 182

Query: 257 NGALP 261
              LP
Sbjct: 183 YAGLP 187


>gi|166030312|gb|ABY78823.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 55/181 (30%), Positives = 82/181 (45%), Gaps = 24/181 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W +    GL       +++ CQP  FP C+H      +P C       PKC
Sbjct: 158 CDGGYPDAAWRYYVSHGL-------ASSYCQPYPFPHCDHHGGKGKKPPCSKYDFHTPKC 210

Query: 140 HTRCTNDNYGRGFFQ--DKYQING-----LGLYFDPHFGPFWPAFWRSFCTKYTRPLFQT 192
           +T CT+       ++    Y+++G       LYF+   GPF  AF      +     F  
Sbjct: 211 NTTCTDKAIPLIKYRGNHSYEVHGEEDYKRELYFN---GPFVVAF------QVYSDFFAY 261

Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
              VY    S +++    V+IVGWG+ NG PYW I +++   +G  G   ILRG++E  I
Sbjct: 262 KTGVYR-HVSGDVLGGHAVRIVGWGKLNGTPYWKIANSWDTDWGMNGHFLILRGKDECGI 320

Query: 253 E 253
           E
Sbjct: 321 E 321


>gi|211853248|emb|CAP17587.1| cathepsin-like protein 4 [Crateromorpha meyeri]
          Length = 325

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 59/195 (30%), Positives = 86/195 (44%), Gaps = 36/195 (18%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHH----SNTGCQPVSFPPCNHANYTTSEPECKTLATP 135
           C  G   + W +  + GLVTGG ++     +  CQP   P C H +   S+P C +    
Sbjct: 147 CEGGFLGAAWNYWKQEGLVTGGLYNPSATESDTCQPYPLPSCEH-HINGSKPACPSKIAK 205

Query: 136 QPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGR 195
            P+C   C +  Y   + QD             H+G    +  R      T     TNG 
Sbjct: 206 TPECVHTC-HAGYPTSYEQDL------------HYGESAYSVRRRVAEIQTE--IMTNGP 250

Query: 196 VYAV-SASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKG 239
           V A  +  A+  AY +               VK++GWGEE+G PYW I +++   +GD G
Sbjct: 251 VEAAFTVYADFPAYKSGVYKRHSLRQLGGHAVKMIGWGEEDGIPYWLIANSWNSDWGDHG 310

Query: 240 TIKILRGRNEAIIES 254
             KI+RG++E  IES
Sbjct: 311 YFKIVRGQDECGIES 325


>gi|161343821|tpg|DAA06091.1| TPA_inf: cathepsin B [Aphis gossypii]
          Length = 196

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 56/194 (28%), Positives = 86/194 (44%), Gaps = 25/194 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W      GLVTGG + S  GC+P   PPC +     +         P  K 
Sbjct: 13  CHGGYPIRAWKRFKNHGLVTGGDYKSGEGCEPYRVPPCPYDEQGNN----TCAGKPMEKN 68

Query: 140 HTRCTNDNYG---------RGFFQDKYQINGLGLYFDPH-FGPFWPAF--WRSFCTKYTR 187
           H RCT   YG           + +D Y +    +  D   +GP   +F  +  F      
Sbjct: 69  H-RCTRICYGDQELDFDEDHRYTRDYYYLTYGSIQKDVMTYGPIEASFDVYSDF------ 121

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           P +++   +Y  + +A  +    VK++GWGE+ G PYW +V+++ E +GD G  KI RG 
Sbjct: 122 PSYKSG--IYERTENATYLGGHAVKLIGWGEQYGIPYWLMVNSWNEDWGDNGLFKIRRGT 179

Query: 248 NEAIIESLVNGALP 261
           NE  +++     +P
Sbjct: 180 NECGVDNSTTAGVP 193


>gi|28971815|dbj|BAC65419.1| cathepsin B [Pandalus borealis]
          Length = 328

 Score = 77.8 bits (190), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 59/183 (32%), Positives = 86/183 (46%), Gaps = 30/183 (16%)

Query: 91  WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 150
           WV K G V+GG H+SN GCQP S   C H +     P C+    P+  C   C ++ YG+
Sbjct: 157 WVTK-GFVSGGRHNSNEGCQPYSVEECEH-HIEGPRPPCEG-DMPELVCSETC-HEEYGK 212

Query: 151 GFFQD-KYQINGLGLYFDPHF-----------GPFWPAF--WRSFCTKYTRPLFQTNGRV 196
            + +D +Y   GL  Y  P             GP   AF  +  F + Y   ++Q     
Sbjct: 213 TYEEDLEY---GLEAYVLPQDVTQIQEEIMTNGPVTAAFAVYDDFLS-YKSGVYQ----- 263

Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
                +  +  Y  V+++GWGEE G PYW + +++   +GD G  KILRG +E   E  +
Sbjct: 264 ---HETGLLDGYHAVRVIGWGEEEGTPYWLVANSWNTDWGDNGLFKILRGSDECEFEGDM 320

Query: 257 NGA 259
             A
Sbjct: 321 AAA 323


>gi|204022073|dbj|BAG71134.1| cathepsin B-S1 [Tuberaphis taiwana]
          Length = 334

 Score = 77.8 bits (190), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 51/181 (28%), Positives = 84/181 (46%), Gaps = 20/181 (11%)

Query: 89  WAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 148
           W +   +G+ TGG + +  GC P   PPC +      +  C     P  + H +C    Y
Sbjct: 163 WKYFRTQGVTTGGDYGTKEGCMPYKVPPCYNKQ---GKNTCG--GQPMERNH-QCPKTCY 216

Query: 149 GRGFFQDKYQ------INGLGLYFDP--HFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVS 200
           G+   Q++Y+      IN +         +GP   +F           L      +Y  +
Sbjct: 217 GKTTVQNRYKTKSEYVINSIKTIERDIMTYGPVEASF------DVYDDLSAYKSGIYRKT 270

Query: 201 ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
             A+     ++KI+GWG++NG PYW  V+++ + +G+ GT KI++GRNE  IE  V   +
Sbjct: 271 PKAKYQGGHSIKIIGWGQQNGTPYWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGI 330

Query: 261 P 261
           P
Sbjct: 331 P 331


>gi|195438776|ref|XP_002067308.1| GK16352 [Drosophila willistoni]
 gi|194163393|gb|EDW78294.1| GK16352 [Drosophila willistoni]
          Length = 340

 Score = 77.8 bits (190), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 57/194 (29%), Positives = 93/194 (47%), Gaps = 22/194 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W++  ++G+V+GG   S  GC+P    PC H +   + P C + +T  P+C
Sbjct: 157 CNGGFPGAAWSYWTRKGIVSGGNFGSQQGCRPYEIEPCEH-HVNGTRPPCSSGST--PRC 213

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
              C + +Y   + +DK      Y I    L         GP   AF     T Y   + 
Sbjct: 214 QHVCES-SYKVDYKKDKNFGSKSYSIKNNVLDIQKEIMNNGPVEGAF-----TVYEDLIL 267

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGE--ENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
             +G VY      E+  +A ++I+GWG   +   PYW I +++   +GD G  +I+RG++
Sbjct: 268 YKSG-VYEHVHGKELGGHA-IRILGWGVWGDEKIPYWLIANSWNTDWGDNGFFRIVRGKD 325

Query: 249 EAIIESLVNGALPK 262
              IES ++  LPK
Sbjct: 326 HCGIESSISAGLPK 339


>gi|294955270|ref|XP_002788457.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
 gi|239903926|gb|EER20253.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
          Length = 392

 Score = 77.8 bits (190), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 52/196 (26%), Positives = 85/196 (43%), Gaps = 32/196 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W++++  G+ T G+  +  GC P +FP C H    +    C       P C
Sbjct: 159 CTKGRPDAAWSFLNVYGIATEGSMSAADGCWPYNFPKCGHHQQDSKYQPCPEKNYDTPPC 218

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
             RC N NYG    +D+        +F  HF P+     +   T   +    TNG   A 
Sbjct: 219 LDRCPNKNYGTPLDKDR--------HFTAHFSPY-----QLKGTDNIKKEIMTNGPTSAA 265

Query: 200 -SASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
            S   + ++Y +               V+I+GWG + G  YW +++++ E +G  GT KI
Sbjct: 266 FSMYDDFLSYESGVYKHTSGTLMGEHGVEIIGWGTKQGVDYWLVMNSWNEGWGVHGTFKI 325

Query: 244 LRGR---NEAIIESLV 256
            +G    N+  IE  +
Sbjct: 326 AQGDCGINDMAIERFM 341


>gi|194246069|gb|ACF35526.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 277

 Score = 77.8 bits (190), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 60/200 (30%), Positives = 91/200 (45%), Gaps = 35/200 (17%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G  S+ W +    G+VTGG + ++ GCQP  FPPC H +     P C T   P PKC
Sbjct: 94  CFGGYPSAAWDYYKDEGIVTGGLYGTDDGCQPYYFPPCEH-HTKGPLPNC-TDTKPTPKC 151

Query: 140 HTRCTNDNYGRGFFQDKYQINGL-GLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
              C    Y + + +DKY    +  L+ D               T+    +++ NG V A
Sbjct: 152 LQVCRK-GYEKSYSEDKYFAKTVYSLHSDE--------------TQIKTEIYK-NGPVEA 195

Query: 199 -VSASAEIVAY--------------ATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
             S   + +AY              A  + +GW  +  R  W + +++ + +GDKG  KI
Sbjct: 196 DFSVYTDFLAYKSGVYQRHSYELWEARHQNLGWALKR-RSVWLVANSWNQDWGDKGYFKI 254

Query: 244 LRGRNEAIIESLVNGALPKD 263
            RG NE  IE+ +N  +PK+
Sbjct: 255 RRGNNECGIENDINAGIPKE 274


>gi|195478432|ref|XP_002100515.1| GE16138 [Drosophila yakuba]
 gi|194188039|gb|EDX01623.1| GE16138 [Drosophila yakuba]
          Length = 340

 Score = 77.4 bits (189), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 56/194 (28%), Positives = 92/194 (47%), Gaps = 21/194 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W++  ++G+V+GG + SN GC+P    PC H    T  P     AT  PKC
Sbjct: 156 CNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPCAHGGAT--PKC 213

Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
              C          + ++G   +  +  +  +      + GP   AF     T Y   + 
Sbjct: 214 SHVCQSSYTVDYAKDKHFGSKSYSVRRNVRDIQEEIMTN-GPVEGAF-----TVYEDLIL 267

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
             +G VY      E+  +A ++I+GWG   +   PYW I +++   +GD+G  +ILRG++
Sbjct: 268 YKDG-VYQHEHGKELGGHA-IRILGWGVWGDEKIPYWLIGNSWNTDWGDQGFFRILRGQD 325

Query: 249 EAIIESLVNGALPK 262
              IES ++  LPK
Sbjct: 326 HCGIESSISAGLPK 339


>gi|347972080|ref|XP_313831.5| AGAP004531-PA [Anopheles gambiae str. PEST]
 gi|333469162|gb|EAA09191.5| AGAP004531-PA [Anopheles gambiae str. PEST]
          Length = 375

 Score = 77.4 bits (189), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 56/185 (30%), Positives = 86/185 (46%), Gaps = 15/185 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G+ S+ W +  + G+ +GGA  S+ GCQ   F  C  +  +   P C     P    
Sbjct: 200 CDGGVPSAVWHYWVENGITSGGAFGSHEGCQSYPFDVCKKSGDSNDTPRCLRFCQPGYNV 259

Query: 140 HTRCTNDNYGRGFF---QDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
            T   + +YGR  +   +D+ +I    +Y   +FGP    F     T YT    Q    V
Sbjct: 260 -TYPEDKHYGRVAYTVPKDEERI----MYEVFNFGPAQATF-----TMYT-DFVQYKSGV 308

Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
           Y  +    +  + +VK++GWG EN   YW   +++G Q+GD G  KI+RG +    E+ V
Sbjct: 309 YRHTFGVRVGTH-SVKVMGWGVENDVKYWLCANSWGAQWGDGGFFKIVRGEDHLSFETNV 367

Query: 257 NGALP 261
              LP
Sbjct: 368 VAGLP 372


>gi|320166129|gb|EFW43028.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
          Length = 332

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 57/193 (29%), Positives = 88/193 (45%), Gaps = 20/193 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G    +W +   +G+VTG  +++   C+P  FP C H   +   P+C +     PKC
Sbjct: 148 CDGGQLGPSWDYYKNKGIVTGYLYNTTGYCKPYDFPACAHHEASPDYPDCPSTDYSTPKC 207

Query: 140 HTRC----TNDNYGRGFF--QDKYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTR-PL 189
              C    T + Y       Q  Y +              GP   AF     T Y+  P 
Sbjct: 208 TKSCVAGYTANTYTADLHYGQSSYSVGRTDAAIQTEILNHGPVEAAF-----TVYSDFPT 262

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           +++   VY  ++ + +  +A + IVGWG E+G PYW + +++   +GD G  KILRG  +
Sbjct: 263 YRSG--VYKHTSGSVLGGHA-ISIVGWGTESGSPYWLVKNSWNPSWGDGGFFKILRG--D 317

Query: 250 AIIESLVNGALPK 262
             I + V G LPK
Sbjct: 318 CGINNDVVGGLPK 330


>gi|194895314|ref|XP_001978227.1| GG19486 [Drosophila erecta]
 gi|190649876|gb|EDV47154.1| GG19486 [Drosophila erecta]
          Length = 340

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 56/194 (28%), Positives = 91/194 (46%), Gaps = 21/194 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W++  ++G+V+GG + SN GC+P    PC H +   + P C       PKC
Sbjct: 156 CNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEIAPCEH-HVNGTRPPCGH-GGGTPKC 213

Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
              C          + ++G   +  K  +  +      + GP   AF     T Y   + 
Sbjct: 214 SHVCESGYTVDYAKDKHFGSKSYSVKRNVRDIQEEIMTN-GPVEGAF-----TVYEDLIL 267

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
             +G VY      E+  +A ++I+GWG   E   PYW I +++   +GD G  +ILRG++
Sbjct: 268 YKDG-VYQHQHGKELGGHA-IRILGWGVWGEEKIPYWLIGNSWNTDWGDNGFFRILRGQD 325

Query: 249 EAIIESLVNGALPK 262
              IES ++  LPK
Sbjct: 326 HCGIESSISAGLPK 339


>gi|268566089|ref|XP_002647469.1| Hypothetical protein CBG06541 [Caenorhabditis briggsae]
          Length = 280

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 61/192 (31%), Positives = 91/192 (47%), Gaps = 27/192 (14%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     + W + RG+VTGG     +GC+P  F PCN    +   PE KT     P C
Sbjct: 106 CEGGYPIQAFRWWNSRGVVTGG-DFRGSGCRPYPFAPCN----SYKCPEEKT-----PTC 155

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + +DK + ++   +  +           GP   AF     T Y   ++
Sbjct: 156 SLSC-QFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAF-----TMY-EDMY 208

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
           +    VY  +A   +  +A +KI+GWG +NG PYW I +++G  +G+ G +K+ RG NE 
Sbjct: 209 KYKSGVYRHTAGRLLGGHA-IKIIGWGTQNGIPYWLIANSWGADWGENGFLKMRRGVNEC 267

Query: 251 IIESLVNGALPK 262
            IES V   +PK
Sbjct: 268 GIESAVVAGMPK 279


>gi|195130519|ref|XP_002009699.1| GI15503 [Drosophila mojavensis]
 gi|193908149|gb|EDW07016.1| GI15503 [Drosophila mojavensis]
          Length = 342

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 56/193 (29%), Positives = 95/193 (49%), Gaps = 22/193 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W++   +G+V+GG+++SN GC+P    PC H +   + P CK   TP   C
Sbjct: 159 CNGGFPGAAWSYWTHKGIVSGGSYNSNEGCRPYEIEPCEH-HVNGTRPPCKNGRTPS--C 215

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
             +C + +Y   + +DK +      +  +P          GP   AF     T Y   + 
Sbjct: 216 KHQCES-SYSVDYAKDKHFGSKSYSIRRNPREIQREIMTNGPVEGAF-----TVYEDLIL 269

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
             +G VY      E+  +A ++I+GWG   ++  PYW I +++   +GD G  +I+RG +
Sbjct: 270 YKSG-VYKHVHGKELGGHA-IRILGWGVWGDSKVPYWLIGNSWNTDWGDNGFFRIVRGED 327

Query: 249 EAIIESLVNGALP 261
              IES ++  LP
Sbjct: 328 HCGIESAISAGLP 340


>gi|161343869|tpg|DAA06115.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 337

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 57/189 (30%), Positives = 90/189 (47%), Gaps = 12/189 (6%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W      GLVTGG ++S  GC+P   PP N  N ++S+         +  C
Sbjct: 157 CHGGYPIKAWKRFSTHGLVTGGDYNSGEGCEPYRVPPSNDGNSSSSDQPLAINHICRRHC 216

Query: 140 HTRCTND-NYGRGFFQDKYQINGLGLYFDP-HFGPFWPAF--WRSFCTKYTRPLFQTNGR 195
           +   + D N    + +D Y +    +  D   +GP   +F  +  F      P +++   
Sbjct: 217 YGNQSIDFNDDHRYTRDYYYLTYGSIQKDVLTYGPIEASFDVYDDF------PSYKSG-- 268

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           VY  S +A  +    VK++GWGEE+G PYW +V+++  Q+GD G  KI RG NE  +++ 
Sbjct: 269 VYVKSDNASYLGGHAVKLIGWGEEDGTPYWLMVNSWNTQWGDNGFFKIRRGTNECGVDNS 328

Query: 256 VNGALPKDN 264
               +P  N
Sbjct: 329 TTAGVPVTN 337


>gi|984960|gb|AAC46878.1| cathepsin B proteinase, partial [Ancylostoma caninum]
          Length = 340

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 58/185 (31%), Positives = 83/185 (44%), Gaps = 24/185 (12%)

Query: 89  WAWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPKCHTRCTNDN 147
           + W+ +  +VTGG +     C+P +F PC NH N     P C     P PKC   C    
Sbjct: 165 YRWMQRSVVVTGGKYRQKDVCKPYAFYPCGNHTNERYYGP-CPRGLWPTPKCRKACQR-K 222

Query: 148 YGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
           Y + + +DKY       Y+ P             GP   AF      K  +      G +
Sbjct: 223 YNKSYNEDKY--FATRSYYLPSNERSIREEIYKNGPVVAAF------KVYQDFSYYRGGI 274

Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE-SL 255
           Y      +  A+A VK+VGWG ENG  YW I +++   +G+ G  +I RG NE  IE  +
Sbjct: 275 YVHKWGGQTGAHA-VKVVGWGRENGTDYWLIANSWNTDWGENGYFRIARGSNECGIEGQM 333

Query: 256 VNGAL 260
           V+G +
Sbjct: 334 VSGVM 338


>gi|291291827|gb|ADD91786.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 70/242 (28%), Positives = 105/242 (43%), Gaps = 31/242 (12%)

Query: 27  SCIEARAVATATPLAFAVCRSS----KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSS 82
           +C    AV+TA+ L+  +C +S    ++HV  T      G   +C +          C+ 
Sbjct: 26  NCGSCWAVSTASALSDRICIASNGRKQVHVSATDILSCCG--NQCGYG---------CNG 74

Query: 83  GISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 142
           G     + +  K+G VTGG + + +GC+P  F PC H    T   EC   AT  PKC  +
Sbjct: 75  GWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGECPNEAT-TPKCVRK 133

Query: 143 C-----TNDNYGRGFFQDKYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQTNG 194
           C      +    R   +D Y+               GP   AF     T Y    +   G
Sbjct: 134 CQKSYKKSYKKDRSIGKDAYEEPNAEKATQREIMKNGPVVGAF-----TVYEDFSYYKKG 188

Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
            +Y  +A      +A +KI+GWG+E G PYW I +++   +G+ G  +IL G N   IE 
Sbjct: 189 -IYKHTAGKARGGHA-IKIIGWGKEGGVPYWLIANSWHNDWGENGYFRILCGSNHCGIEE 246

Query: 255 LV 256
            V
Sbjct: 247 NV 248


>gi|268557292|ref|XP_002636635.1| C. briggsae CBR-CPR-1 protein [Caenorhabditis briggsae]
          Length = 330

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 57/192 (29%), Positives = 80/192 (41%), Gaps = 39/192 (20%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G       W   +G+VTGG +H   GC+P    PC   N     PE KT     P C
Sbjct: 156 CEGGYPIQALRWWDSKGVVTGGDYH-GAGCKPYPIAPCTSGNC----PESKT-----PAC 205

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL---------- 189
              C    Y   + +DK            HFG    A  RS     T  +          
Sbjct: 206 SLSC-QSGYSTAYAKDK------------HFGASAYAVARSVAAIQTEIMTNGPVEAAFT 252

Query: 190 -----FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 244
                ++    VY  +A   +  +A +KI+GWG E+G PYW + +++G  +G+ G  KIL
Sbjct: 253 VYEDFYKYKSGVYKHTAGKALGGHA-IKIIGWGTESGSPYWLVANSWGTNWGESGFFKIL 311

Query: 245 RGRNEAIIESLV 256
           RG ++  IE  V
Sbjct: 312 RGDDQCGIEGAV 323


>gi|300176937|emb|CBK25506.2| unnamed protein product [Blastocystis hominis]
          Length = 320

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 56/199 (28%), Positives = 86/199 (43%), Gaps = 33/199 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W+W H  G+ TGG + S   C    FP C+H +     P C     P P+C
Sbjct: 138 CNGGWPSMAWSWFHSTGVTTGGEYGSKDWCNAYEFPKCDH-HVEGKYPPCGE-TQPTPEC 195

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA- 198
             +C  + Y   + +DK            HF  F  A+      +  +    TNG +   
Sbjct: 196 VEKC-QEGYPVEYKKDK------------HF--FGEAYHVPSNVEAIKTELMTNGPIEVD 240

Query: 199 VSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
            S   + + Y +               VK+VGWG E+G  YW I +++ E +G+ G  +I
Sbjct: 241 FSVYEDFMTYKSGIYQHVAGKYLGGHAVKLVGWGVEDGVEYWKIANSWNEDWGENGYFRI 300

Query: 244 LRGRNEAIIESLVNGALPK 262
           + G+NE  IES     +P+
Sbjct: 301 IAGKNECGIESDGVAGIPE 319


>gi|193629592|ref|XP_001944624.1| PREDICTED: cathepsin B-like isoform 4 [Acyrthosiphon pisum]
          Length = 331

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 58/204 (28%), Positives = 89/204 (43%), Gaps = 40/204 (19%)

Query: 79  VCSSGISSSTWAWVHK---------RGLVTGGA-HHSNTGCQPVSFPP-CNHANYTTSEP 127
           +  SGI +S   WV            GLV+GG+ +++N GCQP   PP CN         
Sbjct: 146 ISCSGIKASANGWVRDGLAWEYFKTHGLVSGGSIYNTNDGCQPSKIPPVCN--------- 196

Query: 128 ECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGP---------FWPAFW 178
                  P       C +  YG      KY  + + + +  H  P         + P   
Sbjct: 197 ------LPTKINKRTCVDYCYGNDTI--KYNHDHVKVRYYYHVKPKDIQKEVQTYGPV-- 246

Query: 179 RSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDK 238
            +        +F     VY ++ +A+ V    VK++GWG ENG  YW +V+++G ++G  
Sbjct: 247 -TAALNLYDDIFLHKSGVYTLTKNAKYVRLQYVKLIGWGVENGVDYWLLVNSWGNEWGQN 305

Query: 239 GTIKILRGRNEAIIESLVNGALPK 262
           G +KI RG+    +ES V  A+PK
Sbjct: 306 GLLKIKRGKYGCAVESFVYAAVPK 329


>gi|7507648|pir||T24819 hypothetical protein T10H4.12 - Caenorhabditis elegans
          Length = 324

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 63/212 (29%), Positives = 89/212 (41%), Gaps = 25/212 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G S     +    G VTGG +  + GC P SF PC      ++ P CKT      K 
Sbjct: 100 CKGGYSIEALRFWASSGAVTGGDYGGH-GCMPYSFAPCTKNCPESTTPSCKTTCQSSYKT 158

Query: 140 HTRCTNDNYGRGFFQ--DKYQ--INGLGLYFDP-------------HFGPFWPAFWRSFC 182
                + +YG   +   +++Q  +N    Y                H+GP   ++     
Sbjct: 159 EEYKKDKHYGELVWHSFNRFQRFLNRASAYKVTTTKSVTEIQTEIYHYGPVEASY----- 213

Query: 183 TKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
            K     +     VY  + S ++V    VKI+GWG ENG  YW I +++G  FG+KG  K
Sbjct: 214 -KVYEDFYHYKSGVYHYT-SGKLVGGHAVKIIGWGVENGVDYWLIANSWGTSFGEKGFFK 271

Query: 243 ILRGRNEAIIESLVNGALPKDNYGVEFGEESG 274
           I RG NE  IE  V   + K     E  E+ G
Sbjct: 272 IRRGTNECQIEGNVVAGIAKLGTHSETYEDDG 303


>gi|195352458|ref|XP_002042729.1| GM17589 [Drosophila sechellia]
 gi|194126760|gb|EDW48803.1| GM17589 [Drosophila sechellia]
          Length = 340

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 55/194 (28%), Positives = 91/194 (46%), Gaps = 21/194 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W++  ++G+V+GG + SN GC+P    PC H +   + P C    +  PKC
Sbjct: 156 CNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEH-HVNGTRPPCAN-GSGTPKC 213

Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
              C          + ++G   +  K  +  +      + GP   AF     T Y   + 
Sbjct: 214 SHVCQSSYTVDYAKDKHFGSKSYSVKRNVREIQEEIMTN-GPVEGAF-----TVYEDLIL 267

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
             +G VY      E+  +A ++I+GWG       PYW I +++   +GD G  +ILRG++
Sbjct: 268 YKDG-VYQHEHGKELGGHA-IRILGWGVWGNEKIPYWLIGNSWNTDWGDHGFFRILRGQD 325

Query: 249 EAIIESLVNGALPK 262
              IES ++  LPK
Sbjct: 326 HCGIESSISAGLPK 339


>gi|204022075|dbj|BAG71135.1| cathepsin B-S2 [Tuberaphis taiwana]
          Length = 334

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 51/181 (28%), Positives = 86/181 (47%), Gaps = 20/181 (11%)

Query: 89  WAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 148
           W +   +G+ TGG + +  GC P   PPC +      +  C     P  + H +C    Y
Sbjct: 163 WKYFRTQGVTTGGDYGTKEGCMPYKVPPCYNKQ---GKNTCG--GQPMERNH-QCPKTCY 216

Query: 149 GRGFFQDKYQ------INGLGLYFD--PHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVS 200
           G+   Q++Y+      +N +         +GP   +F       Y       +G +Y  +
Sbjct: 217 GKTTVQNRYKTKSEYVMNSIKTIEQDLKTYGPVEASF-----DVYDDFSVYKSG-IYRKT 270

Query: 201 ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
             A+     ++KI+GWG++NG PYW  V+++ + +G+ GT KI++GRNE  IE  V   +
Sbjct: 271 PKAKYQGGHSIKIIGWGQQNGTPYWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGI 330

Query: 261 P 261
           P
Sbjct: 331 P 331


>gi|56754337|gb|AAW25356.1| SJCHGC00056 protein [Schistosoma japonicum]
          Length = 342

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 56/165 (33%), Positives = 74/165 (44%), Gaps = 18/165 (10%)

Query: 105 SNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDK-------- 156
           ++TGCQP  FP C H       P C T     P+C   C    Y   F QDK        
Sbjct: 184 NHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQTCQK-GYKTPFEQDKPFGEGSSN 241

Query: 157 YQINGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVG 215
            Q N      D   +GP   AF       Y   L   +G    V+ S  IV    ++I+G
Sbjct: 242 VQNNEKVFQRDIMMYGPVEAAF-----DVYEDFLNSKSGISRHVTGS--IVGGHPIRIIG 294

Query: 216 WGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
           WG E G PYW I +++ E +G+ G  +++RGR+E  IES V   L
Sbjct: 295 WGVEKGNPYWLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339


>gi|144952804|gb|ABP04056.1| cathepsin B-4 [Clonorchis sinensis]
          Length = 347

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 53/180 (29%), Positives = 81/180 (45%), Gaps = 11/180 (6%)

Query: 89  WAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDN 147
           W +    G+VTGG +  +  C P  FPPC H     SE P C       P+C + C    
Sbjct: 165 WDYWRDNGIVTGGEYKDSHTCLPYPFPPCRHHGAKGSEYPPCPEKMYSTPQCVSEC-QKG 223

Query: 148 YGRGFFQDKYQIN-GLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQT----NGRVYAVSAS 202
           Y   +  DK + +    LY            W     + T  ++       G VY    +
Sbjct: 224 YATKYEDDKIRASTSYNLYRS--VTAIQKEIWMRGPVEATMNVYTDFANYAGGVYK-HTT 280

Query: 203 AEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
            E++    ++++GWG EE+G PYW   +++   +G+KG  +ILRG +   IES V+  LP
Sbjct: 281 GELLGGHAIRLLGWGVEEDGTPYWLAANSWNPSWGEKGFFRILRGSDHCGIESDVSAGLP 340


>gi|328722316|ref|XP_003247542.1| PREDICTED: cathepsin B-like isoform 2 [Acyrthosiphon pisum]
 gi|328722318|ref|XP_003247543.1| PREDICTED: cathepsin B-like isoform 3 [Acyrthosiphon pisum]
          Length = 276

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 58/204 (28%), Positives = 89/204 (43%), Gaps = 40/204 (19%)

Query: 79  VCSSGISSSTWAWVHK---------RGLVTGGA-HHSNTGCQPVSFPP-CNHANYTTSEP 127
           +  SGI +S   WV            GLV+GG+ +++N GCQP   PP CN         
Sbjct: 91  ISCSGIKASANGWVRDGLAWEYFKTHGLVSGGSIYNTNDGCQPSKIPPVCN--------- 141

Query: 128 ECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGP---------FWPAFW 178
                  P       C +  YG      KY  + + + +  H  P         + P   
Sbjct: 142 ------LPTKINKRTCVDYCYGNDTI--KYNHDHVKVRYYYHVKPKDIQKEVQTYGPV-- 191

Query: 179 RSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDK 238
            +        +F     VY ++ +A+ V    VK++GWG ENG  YW +V+++G ++G  
Sbjct: 192 -TAALNLYDDIFLHKSGVYTLTKNAKYVRLQYVKLIGWGVENGVDYWLLVNSWGNEWGQN 250

Query: 239 GTIKILRGRNEAIIESLVNGALPK 262
           G +KI RG+    +ES V  A+PK
Sbjct: 251 GLLKIKRGKYGCAVESFVYAAVPK 274


>gi|358341561|dbj|GAA37330.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 347

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 53/180 (29%), Positives = 81/180 (45%), Gaps = 11/180 (6%)

Query: 89  WAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDN 147
           W +    G+VTGG +  +  C P  FPPC H     SE P C       P+C + C    
Sbjct: 165 WDYWRDNGIVTGGEYKDSHTCLPYPFPPCRHHGAKGSEYPPCPEKMYSTPQCVSEC-QKG 223

Query: 148 YGRGFFQDKYQIN-GLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQT----NGRVYAVSAS 202
           Y   +  DK + +    LY            W     + T  ++       G VY    +
Sbjct: 224 YATKYEDDKIRASTSYNLYRS--VTTIQKEIWMRGPVEATMNVYTDFANYAGGVYK-HTT 280

Query: 203 AEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
            E++    ++++GWG EE+G PYW   +++   +G+KG  +ILRG +   IES V+  LP
Sbjct: 281 GELLGGHAIRLLGWGVEEDGTPYWLAANSWNPSWGEKGFFRILRGSDHCGIESDVSAGLP 340


>gi|187097096|ref|NP_001119608.1| cathepsin B-348 precursor [Acyrthosiphon pisum]
 gi|161343833|tpg|DAA06097.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 342

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 68/247 (27%), Positives = 105/247 (42%), Gaps = 32/247 (12%)

Query: 27  SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
           SC    A      ++  VC  S       +F F A     C W        + C+ G   
Sbjct: 115 SCGSCWAFGAVEAMSDRVCIHSN---GTKNFHFSAENLVSCCWTCG-----FGCNGGFPG 166

Query: 87  STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRC--- 143
           + W +   +G+V+GG + SN GC P    PC H    T  P CK      P C  +C   
Sbjct: 167 AAWNYWKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGP-CKE-GGKTPTCVKKCEEG 224

Query: 144 ------TNDNYGRGFFQDKYQINGL--GLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGR 195
                  + ++G+  +  +  ++ +   +Y +   GP   AF     T Y   +    G 
Sbjct: 225 YKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTN---GPVEGAF-----TVYEDFIAYRAG- 275

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGR-PYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
           VY   A   +  +A ++I+GWG +NG  PYW + +++   +G  G  KILRG +E  IE 
Sbjct: 276 VYKHVAGKALGGHA-IRILGWGVQNGEIPYWLVANSWNTDWGSDGFFKILRGSDECGIEG 334

Query: 255 LVNGALP 261
            +N  LP
Sbjct: 335 QINAGLP 341


>gi|226466652|emb|CAX69461.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 340

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 50/171 (29%), Positives = 80/171 (46%), Gaps = 7/171 (4%)

Query: 96  GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQD 155
           G+VTGG++   +GCQP   P C++ +  +   +C       P+C   C  D Y + +  D
Sbjct: 172 GIVTGGSYEDQSGCQPYPLPKCSY-HPESRFLDCNNNTFEFPQCTNEC-QDGYNKTYDDD 229

Query: 156 KYQ----INGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATV 211
           K+      N  G   D            +  +  T  L   +G VY  +  +  + + T+
Sbjct: 230 KFYGERIYNVYGTQEDIQKEILMNGPVIASISVNTDFLVYKSG-VYLPTPRSRNLGWITL 288

Query: 212 KIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           +I+GWG E   PYW   +++ E++GD G +KI RG     IES V   +PK
Sbjct: 289 RIIGWGYEGKIPYWLCANSWNEEWGDNGYVKIQRGVQAGYIESYVRAPIPK 339


>gi|18921171|ref|NP_572920.1| cathepsin B1, isoform A [Drosophila melanogaster]
 gi|7292926|gb|AAF48317.1| cathepsin B1, isoform A [Drosophila melanogaster]
 gi|16767940|gb|AAL28188.1| GH06546p [Drosophila melanogaster]
 gi|220944992|gb|ACL85039.1| CG10992-PA [synthetic construct]
 gi|220954816|gb|ACL89951.1| CG10992-PA [synthetic construct]
          Length = 340

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 58/194 (29%), Positives = 90/194 (46%), Gaps = 21/194 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W++  ++G+V+GG + SN GC+P    PC H +   + P C       PKC
Sbjct: 156 CNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEH-HVNGTRPPCAH-GGRTPKC 213

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
              C +  Y   + +DK      Y +              GP   AF     T Y   + 
Sbjct: 214 SHVCQS-GYTVDYAKDKHFGSKSYSVRRNVREIQEEIMTNGPVEGAF-----TVYEDLIL 267

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
             +G VY      E+  +A ++I+GWG   E   PYW I +++   +GD G  +ILRG++
Sbjct: 268 YKDG-VYQHEHGKELGGHA-IRILGWGVWGEEKIPYWLIGNSWNTDWGDHGFFRILRGQD 325

Query: 249 EAIIESLVNGALPK 262
              IES ++  LPK
Sbjct: 326 HCGIESSISAGLPK 339


>gi|312271213|gb|ADQ57304.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 347

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 56/199 (28%), Positives = 83/199 (41%), Gaps = 31/199 (15%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W++    G+VTG  + S +GC+P  +PPC H        +C     P   C
Sbjct: 164 CDGGDPYAAWSYWVSNGIVTGSNYTSKSGCKPYPYPPCEHHIPEHHYKKCPKDIYPTNTC 223

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV-YA 198
             +C          QD Y I+      D H+G    A  +   +   +    TNG V  A
Sbjct: 224 EYKC----------QDGYSIS---YNSDKHYGASVYAVAQDVAS--IQKEIMTNGPVEVA 268

Query: 199 VSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
                +   Y++               VK++GWG ENG  YW   +++   +G+ G  +I
Sbjct: 269 FDVYEDFEHYSSGIYKHTTGDYLGGHAVKMLGWGTENGTDYWICANSWNSDWGENGFFRI 328

Query: 244 LRGRNEAIIESLVNGALPK 262
           LRG +E  IES V    PK
Sbjct: 329 LRGVDECQIESSVVAGEPK 347


>gi|328701234|ref|XP_001948885.2| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 326

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 62/244 (25%), Positives = 103/244 (42%), Gaps = 42/244 (17%)

Query: 24  YALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSG 83
           YA + + A  +  AT   +    S++  + C      +G+K+R    V+R +        
Sbjct: 117 YAATGVFADRMCIATNGNYNQLLSTEELISC------SGIKEREDGYVNRVLV------- 163

Query: 84  ISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC---- 139
                W +    GLV+GG +++N GCQP   P   ++     +  C      +       
Sbjct: 164 -----WEYFKTHGLVSGGKYNTNEGCQPSKVPTVYNSQTKIYKRTCVEYCYGKDTINYNH 218

Query: 140 -HTRCTNDNYGR-GFFQDKYQING-LGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
            H + +N  + R    Q + Q  G + ++FD H                   LF     V
Sbjct: 219 DHVKVSNHYFIRIKDIQKEVQTYGPVSVFFDLH-----------------DDLFLYKSGV 261

Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
           YA +  ++   Y   K++GWG ENG  YW +V+++G ++G  G  KI RG +E  +ES V
Sbjct: 262 YAKTEKSKDKRYHHAKLIGWGVENGVDYWLLVNSWGYEWGQNGLFKIKRGTDECSVESHV 321

Query: 257 NGAL 260
              L
Sbjct: 322 YAGL 325


>gi|195566634|ref|XP_002106884.1| GD15875 [Drosophila simulans]
 gi|194204277|gb|EDX17853.1| GD15875 [Drosophila simulans]
          Length = 340

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 56/194 (28%), Positives = 90/194 (46%), Gaps = 21/194 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W++  ++G+V+GG + SN GC+P    PC H    T  P      T  PKC
Sbjct: 156 CNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPCAHGGGT--PKC 213

Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
              C          + ++G   +  K  +  +      + GP   AF     T Y   + 
Sbjct: 214 SHVCQSSYTVDYAKDKHFGSKSYSVKRNVREIQEEIMTN-GPVEGAF-----TVYEDLIL 267

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
             +G VY      E+  +A ++I+GWG   +   PYW I +++   +GD G  +ILRG++
Sbjct: 268 YKDG-VYQHEHGKELGGHA-IRILGWGVWGDEKIPYWLIGNSWNTDWGDHGFFRILRGQD 325

Query: 249 EAIIESLVNGALPK 262
              IES ++  LPK
Sbjct: 326 HCGIESSISAGLPK 339


>gi|166030316|gb|ABY78825.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 59/193 (30%), Positives = 81/193 (41%), Gaps = 31/193 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W +    GL       +++ CQP  FP C H      +P C       PKC
Sbjct: 158 CDGGYPDAAWRYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCSKYDFHTPKC 210

Query: 140 HTRCT-----------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
           +T CT           ND+Y     +D ++     LYF+   GPF  AF       ++  
Sbjct: 211 NTTCTDKAIPLIEYRGNDSYVLLHGEDDFKRE---LYFN---GPFVVAF-----QVFSDF 259

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L    G    VS   + +    V+IVGWG+ NG PYW I +++   +G  G    LRG N
Sbjct: 260 LAYKTGVYRHVSG--DFLGGHAVRIVGWGKLNGTPYWKIANSWDTDWGMNGHFLFLRGNN 317

Query: 249 EAIIESLVNGALP 261
           E  IE      LP
Sbjct: 318 ECGIEFEGYAGLP 330


>gi|442616292|ref|NP_001259536.1| cathepsin B1, isoform B [Drosophila melanogaster]
 gi|440216755|gb|AGB95378.1| cathepsin B1, isoform B [Drosophila melanogaster]
          Length = 330

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 58/194 (29%), Positives = 90/194 (46%), Gaps = 21/194 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W++  ++G+V+GG + SN GC+P    PC H +   + P C       PKC
Sbjct: 146 CNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEH-HVNGTRPPCAH-GGRTPKC 203

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
              C +  Y   + +DK      Y +              GP   AF     T Y   + 
Sbjct: 204 SHVCQS-GYTVDYAKDKHFGSKSYSVRRNVREIQEEIMTNGPVEGAF-----TVYEDLIL 257

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
             +G VY      E+  +A ++I+GWG   E   PYW I +++   +GD G  +ILRG++
Sbjct: 258 YKDG-VYQHEHGKELGGHA-IRILGWGVWGEEKIPYWLIGNSWNTDWGDHGFFRILRGQD 315

Query: 249 EAIIESLVNGALPK 262
              IES ++  LPK
Sbjct: 316 HCGIESSISAGLPK 329


>gi|294885809|ref|XP_002771442.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239875086|gb|EER03258.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 527

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 80/282 (28%), Positives = 112/282 (39%), Gaps = 58/282 (20%)

Query: 10  RDMSYGATV----------YNRRPYALSC-IEARAVATATPLAFAVCRSSKMHVECTSFR 58
           +D  YG  V          Y R P  LS   EA    + +P      RS +      SF+
Sbjct: 274 KDHIYGKDVGSHTDEVCIFYERVPLGLSFPKEATKEISGSPGE----RSQEWRQLIQSFK 329

Query: 59  FIAG--VKQRCAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPP 116
            +AG   + R A L     +I V  S I     A    RG +T G      GC P  FPP
Sbjct: 330 KLAGGRPRDRTALLSIDRSSIEVQPSRICGDYVA----RGNLTKG-----DGCWPYDFPP 380

Query: 117 CNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPA 176
           C H    T  P+C   +   P C  +C N  Y      D++ +    L   P+       
Sbjct: 381 CAHHINDTKYPKCPKGSYETPNCVEQCHNPKYTTSLKNDRHYM----LESSPY------- 429

Query: 177 FWRSFCTKYTRPLFQTNGRVYAVSASAE-IVAYAT---------------VKIVGWGEEN 220
               +     +   +T+G + A     E  +AY +               VKI+GWGEEN
Sbjct: 430 ---QYSVNNAKNAIRTDGPISASYLVYEDFLAYKSGVYKHTSGSYLGGHAVKIIGWGEEN 486

Query: 221 GRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           G  YW +V+++ E +GD+G  KI  G  E  I+  + G  PK
Sbjct: 487 GEAYWLVVNSWNEDWGDQGLFKIALGNCE--IDDDLLGGTPK 526


>gi|329668994|gb|AEB96385.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 316

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 53/186 (28%), Positives = 81/186 (43%), Gaps = 18/186 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   S W +  + G+VTGG + +   C+P   PPC      T    C T     P C
Sbjct: 135 CDGGWPVSAWQYFVETGVVTGGLYGTKDACRPYEIPPCGIHKNETFYSNC-TQEIDTPDC 193

Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
            T C          +  YG+  +     ++ +       +GP   AF     T Y    F
Sbjct: 194 KTTCQAGYPISYDDDKTYGKTAYSVSNSVHAIQKEI-MTYGPVVAAF-----TVYDD-FF 246

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
                +Y   + AE   +A V+I+GWG++ G PYW + +++   +G+ G  +ILRG +E 
Sbjct: 247 HYKTGIYKHVSGAEAGGHA-VRILGWGQQGGVPYWLVANSWNTDWGENGYFRILRGSDEC 305

Query: 251 IIESLV 256
            IE  V
Sbjct: 306 GIEDGV 311


>gi|341888694|gb|EGT44629.1| hypothetical protein CAEBREN_31940 [Caenorhabditis brenneri]
          Length = 374

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 56/184 (30%), Positives = 79/184 (42%), Gaps = 10/184 (5%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTT-SEPECKTLATPQPK 138
           C  G S     +    G VTGG  ++  GC P SF PC   +    + P CKT      K
Sbjct: 167 CQGGYSIEALRFWKSSGAVTGG-DYNGAGCMPYSFAPCKKDSCAQGTTPSCKTTCQSSYK 225

Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
                 + ++G   ++    +  +      H GP   +F      K     ++    VY 
Sbjct: 226 TAEYTKDKHFGTTAYKITNSVAAIQTEI-YHNGPVEASF------KVYEDFYKYKSGVYQ 278

Query: 199 VSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 258
            + S ++V    VKI+GWG ENG  YW I +++G  FGD G  K+ RG NE  IE  V  
Sbjct: 279 YT-SGKLVGGHAVKIIGWGTENGVDYWLIANSWGTTFGDSGFFKMRRGTNEVGIEGNVVA 337

Query: 259 ALPK 262
              K
Sbjct: 338 GTAK 341


>gi|32566081|ref|NP_506002.2| Protein CPR-1 [Caenorhabditis elegans]
 gi|32172429|sp|P25807.2|CPR1_CAEEL RecName: Full=Gut-specific cysteine proteinase; Flags: Precursor
 gi|1395200|gb|AAB88058.1| gut-specific cysteine protease-1 [Caenorhabditis elegans]
 gi|24817276|emb|CAB01410.2| Protein CPR-1 [Caenorhabditis elegans]
          Length = 329

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 57/188 (30%), Positives = 82/188 (43%), Gaps = 31/188 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G       W   +G+VTGG +H   GC+P    PC   N     PE KT     P C
Sbjct: 155 CEGGYPIQALRWWDSKGVVTGGDYH-GAGCKPYPIAPCTSGNC----PESKT-----PSC 204

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRP 188
              C +  Y   + +DK+   G+  Y  P             GP   AF           
Sbjct: 205 SMSCQS-GYSTAYAKDKH--FGVSAYAVPKNAASIQAEIYANGPVEAAF------SVYED 255

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
            ++    VY  +A   +  +A +KI+GWG E+G PYW + +++G  +G+ G  KI RG +
Sbjct: 256 FYKYKSGVYKHTAGKYLGGHA-IKIIGWGTESGSPYWLVANSWGVNWGESGFFKIYRGDD 314

Query: 249 EAIIESLV 256
           +  IES V
Sbjct: 315 QCGIESAV 322


>gi|29374023|gb|AAO73002.1| cathepsin B [Fasciola gigantica]
          Length = 335

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 59/198 (29%), Positives = 82/198 (41%), Gaps = 33/198 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G  S  W +  + G+V+GG   + TGC P  FP C+H   T     C       PKC
Sbjct: 155 CEGGYPSMAWDYWWRHGIVSGGTLENPTGCLPYPFPKCSHLEETPGLAPCPRELYATPKC 214

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQ--TNGRVY 197
             +C    Y +   +DK  I G   Y              +   + T  + +  TNG V 
Sbjct: 215 EKQC-QAGYSKTSEEDK--IKGKSSY--------------NVGDRETDIMMEIITNGPVS 257

Query: 198 AVSASAE--------IVAYATVK------IVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
            +    E        I  Y +        I+GWG ENG  YW   +++ E +G+ G  +I
Sbjct: 258 TIYYIFEDFTVYKSGIYQYTSGSLMGGHGIIGWGVENGVKYWLAANSWNEGWGENGYFRI 317

Query: 244 LRGRNEAIIESLVNGALP 261
            RG NE  IES +N  LP
Sbjct: 318 RRGTNECGIESRINAGLP 335


>gi|161343863|tpg|DAA06112.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 340

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 61/192 (31%), Positives = 86/192 (44%), Gaps = 20/192 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   + W +   +G+V+GG + S  GC P    PC H    T  P CK      P C
Sbjct: 158 CNGGFPGAAWHYWKTKGIVSGGPYGSKMGCIPYEIAPCEHHVNGTRGP-CKE-GGKTPAC 215

Query: 140 HTRCTNDNYGRGFFQDKYQ---INGLGLYFDP------HFGPFWPAFWRSFCTKYTRPLF 190
             +C  D Y   + QD ++      LG   D         GP   AF     T Y   + 
Sbjct: 216 VKKC-EDGYKVPYAQDLHRGKSAYSLGNDVDQIRQEIYTNGPVEGAF-----TVYEDFIA 269

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGR-PYWTIVSTFGEQFGDKGTIKILRGRNE 249
              G VY   A   +  +A ++I+GWG +NG  PYW + +++   +G  G  KILRG +E
Sbjct: 270 YRAG-VYKHVAGKALGGHA-IRILGWGVQNGEIPYWLVANSWNSDWGSDGFFKILRGSDE 327

Query: 250 AIIESLVNGALP 261
             IE  +N  LP
Sbjct: 328 CGIEGQINAGLP 339


>gi|347972088|ref|XP_313836.5| AGAP004534-PA [Anopheles gambiae str. PEST]
 gi|333469166|gb|EAA09182.5| AGAP004534-PA [Anopheles gambiae str. PEST]
          Length = 334

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 60/194 (30%), Positives = 87/194 (44%), Gaps = 29/194 (14%)

Query: 80  CSSG-ISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 138
           C+ G +  +++ +    GLV+GGA++S  GC+P  F PC +       P         PK
Sbjct: 156 CNGGFLDGTSFQYWVDAGLVSGGAYNSTDGCKPYPFKPCEY-------PFNDCHVEISPK 208

Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTR 187
           C   C  D   R + +DK  + G   Y  P             GP    F       Y  
Sbjct: 209 CTHHC-RDGVDRHYSKDK--LFGKVAYSVPRDERAIRYEIMTNGPVEAGF-----DVYED 260

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
            L   +G VY      +I  +A V+I+GWG + G PYW I +++G+ +GD G  K +RG 
Sbjct: 261 VLLYKSG-VYRHVYGEQIGKHA-VRIIGWGRDGGIPYWLIANSYGDDWGDHGYFKFVRGS 318

Query: 248 NEAIIESLVNGALP 261
           N   IES +   LP
Sbjct: 319 NHLGIESKIITGLP 332


>gi|159179|gb|AAA29178.1| cysteine proteinase, partial [Haemonchus contortus]
          Length = 341

 Score = 74.7 bits (182), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 64/245 (26%), Positives = 102/245 (41%), Gaps = 29/245 (11%)

Query: 27  SCIEARAVATATPLAFAVCRSSKM--HVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGI 84
           +C    AV+TA  ++  +C ++K    V  ++   +      C +          C  G 
Sbjct: 109 NCGSCWAVSTAAAISDRICIATKARKQVNISATDLVTCCTPTCGF---------GCDGGW 159

Query: 85  SSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRC- 143
           S   W +    GLV+GG + S   C+P    PC H    T   EC   A+  P C  +C 
Sbjct: 160 SIKAWEYFTYAGLVSGGEYRSKRCCRPYPIHPCGHHGNDTYYGECPEEAS-TPSCKKKCQ 218

Query: 144 --------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGR 195
                    +  YG   FQ    +  +      + GP       SF       L+++   
Sbjct: 219 PGYRKLYRMDKRYGTDAFQLPKSVEAIQKELLKN-GPVTA----SFAVYEDFSLYKSG-- 271

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           +Y  +A  E+  Y  VK++GWG EN   YW I +++ + +G+ G  +I+RG N+  IE  
Sbjct: 272 IYRHTA-GELRGYHAVKMIGWGTENRTDYWLIANSWHDDWGENGYFRIIRGINDCGIEEN 330

Query: 256 VNGAL 260
           V   L
Sbjct: 331 VAAGL 335


>gi|269146930|gb|ACZ28411.1| cathepsin b [Simulium nigrimanum]
          Length = 168

 Score = 74.7 bits (182), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 56/175 (32%), Positives = 77/175 (44%), Gaps = 22/175 (12%)

Query: 99  TGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYQ 158
           +GG   SN GC P    PC H +   + P C       PKC   C   +Y   + QDK  
Sbjct: 5   SGGPFGSNQGCHPYKIAPCEH-HVNGTRPACNGEEGKTPKCIKHC-QASYTVAYEQDKSY 62

Query: 159 INGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVA 207
             G   Y  PH            GP   AF     T Y   L Q    VY    + +++ 
Sbjct: 63  --GAKSYSVPHHVAQIQKEIMTNGPVEGAF-----TVY-EDLVQYKDGVYQ-HVTGKMLG 113

Query: 208 YATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
              ++I+GWG EN  PYW I +++   +G+ G  KILRG +   IES ++  +PK
Sbjct: 114 GHAIRILGWGVENDVPYWLIANSWNTDWGNNGFFKILRGSDHCGIESQISAGIPK 168


>gi|118118|sp|P19092.1|CYSP1_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
           Precursor
 gi|159173|gb|AAA29175.1| cysteine protease (AC-1) [Haemonchus contortus]
          Length = 342

 Score = 74.7 bits (182), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 64/265 (24%), Positives = 109/265 (41%), Gaps = 36/265 (13%)

Query: 10  RDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKM--HVECTSFRFIAGVKQRC 67
           RD+    T +  R  A +C    AV+TA  ++  +C +SK    V  ++   +   + +C
Sbjct: 94  RDVWKNCTTFYIRDQA-NCGSCWAVSTAAAISDRICIASKAEKQVNISATDIMTCCRPQC 152

Query: 68  AWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEP 127
                       C  G     W +    G+V+GG + +   C+P    PC H    T   
Sbjct: 153 GD---------GCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHHGNDTYYG 203

Query: 128 ECKTLATPQPKCHTRC---------TNDNYGRGFFQDKYQINGLG---LYFDPHFGPFWP 175
           EC+  A P P C  +C          +  YG+  +  K  +  +    L   P    F  
Sbjct: 204 ECRGTA-PTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILRNGPVVASF-- 260

Query: 176 AFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQF 235
           A +  F   Y   +++          + E+  Y  VK++GWG EN   +W I +++   +
Sbjct: 261 AVYEDF-RHYKSGIYK--------HTAGELRGYHAVKMIGWGNENNTDFWLIANSWHNDW 311

Query: 236 GDKGTIKILRGRNEAIIESLVNGAL 260
           G+KG  +I+RG N+  IE  +   +
Sbjct: 312 GEKGYFRIIRGTNDCGIEGTIAAGI 336


>gi|299471123|emb|CBN78981.1| cathepsin B-like proteinase [Ectocarpus siliculosus]
          Length = 557

 Score = 74.3 bits (181), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 58/193 (30%), Positives = 89/193 (46%), Gaps = 23/193 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHH---SNTGCQPVSFPPCNH--ANYTTSEPECKTLAT 134
           C+ G   S W W  K G+VTGG +    + T C+P  F PC H      +  P C     
Sbjct: 367 CNGGQPGSAWKWFTKTGVVTGGDYADIGTGTTCKPYEFMPCAHHVDPGASGYPACPDGEY 426

Query: 135 PQPKCHTRCTNDNYGRGFF-QDK------YQINGL-GLYFDP-HFGPFWPAFWRSFCTKY 185
           P P+C + C+  N+  G + +DK      Y + G+  +  D   +G    AF        
Sbjct: 427 PTPECLSECSETNFSGGSYGEDKKMAREAYSLAGIENIQRDMMKYGSVTAAF------SV 480

Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKI 243
                  +G VY   + + +  +A VK++GWG  E +G  YW I +++   +G+ G  +I
Sbjct: 481 FSDFLTYSGGVYTHESGSFMGGHA-VKMIGWGTDEVSGEDYWLIANSWNPSWGEGGLFRI 539

Query: 244 LRGRNEAIIESLV 256
           LRG NE  IE  +
Sbjct: 540 LRGVNECGIEGQI 552


>gi|166030328|gb|ABY78831.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score = 74.3 bits (181), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 56/187 (29%), Positives = 81/187 (43%), Gaps = 31/187 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  + G+       +++GCQP  FP C H     ++  C       PKC
Sbjct: 158 CKGGFPGFAWLYYVEYGI-------ASSGCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKC 210

Query: 140 HTRCT-----------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
           +  CT           N  Y     ++ Y+     LYF+   GPF   F+      YT  
Sbjct: 211 NATCTDKSIPLVKYRGNATYLLLHGEEDYKRE---LYFN---GPFVAVFF-----VYTD- 258

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           LF     VY  +   + +    V+IVGWG+ NG PYW + +++   +G  G + ILRG N
Sbjct: 259 LFAYKSGVYR-NVDGDFLGGQAVRIVGWGKLNGTPYWKVANSWDTDWGMNGYMLILRGNN 317

Query: 249 EAIIESL 255
           E  IE L
Sbjct: 318 ECNIEHL 324


>gi|118122|sp|P25793.1|CYSP2_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 2; Flags:
           Precursor
 gi|159165|gb|AAA29171.1| cathepsin B-like cysteine protease [Haemonchus contortus]
          Length = 342

 Score = 74.3 bits (181), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 64/265 (24%), Positives = 109/265 (41%), Gaps = 36/265 (13%)

Query: 10  RDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKM--HVECTSFRFIAGVKQRC 67
           RD+    T +  R  A +C    AV+TA  ++  +C +SK    V  ++   +   + +C
Sbjct: 94  RDVWKNCTTFYIRDQA-NCGSCWAVSTAAAISDRICIASKAEKQVNISATDIMTCCRPQC 152

Query: 68  AWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEP 127
                       C  G     W +    G+V+GG + +   C+P    PC H    T   
Sbjct: 153 GD---------GCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHHGNDTYYG 203

Query: 128 ECKTLATPQPKCHTRC---------TNDNYGRGFFQDKYQINGLG---LYFDPHFGPFWP 175
           EC+  A P P C  +C          +  YG+  +  K  +  +    L   P    F  
Sbjct: 204 ECRGTA-PTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKNGPVVASF-- 260

Query: 176 AFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQF 235
           A +  F   Y   +++          + E+  Y  VK++GWG EN   +W I +++   +
Sbjct: 261 AVYEDF-RHYKSGIYK--------HTAGELRGYHAVKMIGWGNENNTDFWLIANSWHNDW 311

Query: 236 GDKGTIKILRGRNEAIIESLVNGAL 260
           G+KG  +I+RG N+  IE  +   +
Sbjct: 312 GEKGYFRIVRGSNDCGIEGTIAAGI 336


>gi|161343851|tpg|DAA06106.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 333

 Score = 74.3 bits (181), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 54/191 (28%), Positives = 78/191 (40%), Gaps = 21/191 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  K GLVTGG  +S  GCQP  FPPC      T    C   +    KC
Sbjct: 153 CQGGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPC------TGNNSCSGQSEKNHKC 206

Query: 140 HTRCTNDNYGRGFFQDKYQI--NGLGLYFDPH------FGPFWPAFWRSFCTKYTRPLFQ 191
             +C   N    +  D+  +  +   L +D        +GP   +F              
Sbjct: 207 QKKCFG-NTSISYRGDRRYVERSPYVLAYDNMQNDIMTYGPIESSF------DVYDDFIS 259

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
               VY  S +A  +   +VK +GWG E    YW +++++   +GD G  KI RG NE  
Sbjct: 260 YKSGVYFKSPNATYLGGHSVKCIGWGVERNVSYWLMMNSWNNTWGDGGNFKIRRGTNECQ 319

Query: 252 IESLVNGALPK 262
           +E      +P+
Sbjct: 320 VEDSSTAGMPE 330


>gi|407080581|gb|AFS89610.1| procathepsin B precursor [Phenacoccus solenopsis]
          Length = 309

 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 62/192 (32%), Positives = 81/192 (42%), Gaps = 36/192 (18%)

Query: 90  AWVH--KRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDN 147
           AW H  K G+V+GG++ S  GCQP   PPC H +       C T   P P C   C    
Sbjct: 128 AWDHWVKHGIVSGGSYGSKEGCQPYHLPPCEH-HRAGPRRNC-TKYGPTPSCARVC---- 181

Query: 148 YGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAE--- 204
                 Q  Y+I+      D HFG  W A       K  R     NG V A  A+ E   
Sbjct: 182 ------QPDYKIS---YEDDLHFGKQWYAL-APHNEKIIRTEIFHNGPVEATMAAYEDFY 231

Query: 205 -------------IVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
                         V    VKI+GWG  ++   PYW + ++F   +G+ G  KI RG NE
Sbjct: 232 TYESGIYHHIEGTFVCDHAVKIIGWGTDKKTNTPYWLVANSFNTDWGEYGFFKIKRGVNE 291

Query: 250 AIIESLVNGALP 261
             IE+ +   +P
Sbjct: 292 CGIENKITAGIP 303


>gi|268555420|ref|XP_002635699.1| Hypothetical protein CBG22436 [Caenorhabditis briggsae]
          Length = 317

 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 56/192 (29%), Positives = 88/192 (45%), Gaps = 30/192 (15%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C        + W +K+G+VTGG  +  +GC+P  F PC     T SE          P+C
Sbjct: 145 CKGASPLQAFRWWNKKGVVTGG-DYRGSGCKPYPFAPCTALPCTKSE---------TPRC 194

Query: 140 HTRCTNDNYGRGFFQDKY-----QINGL---GLYFDPHFGPFWPAF--WRSFCTKYTRPL 189
              C    Y + + +DKY      I G+    +  +   GP   AF  +  F   Y   +
Sbjct: 195 SLNC-QPAYSKAYSKDKYFGTPAYIVGMDVAAIQTEITNGPVEAAFIVYDDF-NHYRSGV 252

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           ++          + ++V    VKI+GWG +NG PYW + +++G  +G+ G  K+LRG +E
Sbjct: 253 YR--------HVAGKLVGGHAVKIIGWGIQNGAPYWLMANSWGPYWGENGFFKMLRGVDE 304

Query: 250 AIIESLVNGALP 261
             IES +    P
Sbjct: 305 CGIESTIVAGKP 316


>gi|157058749|gb|ABV03132.1| cathepsin B-3098 [Acyrthosiphon pisum]
          Length = 256

 Score = 73.9 bits (180), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 51/172 (29%), Positives = 80/172 (46%), Gaps = 25/172 (14%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPC----NHANYTTSEPECKTLATP 135
           C+ G     W      GLVTGG + S  GC+P   PPC    +  N  + +P       P
Sbjct: 97  CNGGYPIRAWKRFKNHGLVTGGNYKSGEGCEPYRVPPCPYDKDGKNTCSGQP-----MEP 151

Query: 136 QPKCHTRCTND-----NYGRGFFQDKYQINGLGLYFDP-HFGPFWPAF--WRSFCTKYTR 187
             KC  +C  D     N    + +D Y +   G+  D  ++GP   +F  +  F      
Sbjct: 152 NHKCSKKCYGDEDIDFNKDHRYTRDDYYLTYRGIQKDVINYGPIEASFDVYDDF------ 205

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKG 239
           P +++   +Y  S +A  +   +VK++GWGEE G  YW +V+++   +GDKG
Sbjct: 206 PNYKSG--IYVKSENASYLGGHSVKLIGWGEEYGVLYWLMVNSWNADWGDKG 255


>gi|326515156|dbj|BAK03491.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 471

 Score = 73.6 bits (179), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 58/197 (29%), Positives = 84/197 (42%), Gaps = 31/197 (15%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G  S+ W +    GLVTGG  +SN GC P     C+H      +P C  +  P P C
Sbjct: 286 CEGGYPSAAWDYFQSTGLVTGGDWNSNQGCYPYQLQACDHHVTGKYQP-CGDI-QPTPAC 343

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAF---WRSFCTK-YTRPLFQTNGR 195
              C N+                    D HFG    +     +S  T+ YT    + +  
Sbjct: 344 ANSCQNNATWSS---------------DKHFGASSYSVGTDQQSIMTEIYTNGPVEASYD 388

Query: 196 VYA--VSASAEIVAYAT--------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
           VYA  VS  + +  + T        VKI+GWG +   PYW + +++   +G+ G   ILR
Sbjct: 389 VYADFVSYKSGVYQHVTGDYLGGHAVKIIGWGVDGSTPYWIVANSWNNDWGNNGFFNILR 448

Query: 246 GRNEAIIESLVNGALPK 262
           G +E  IE  +   +PK
Sbjct: 449 GSDECGIEDGIVAGIPK 465


>gi|124502519|gb|ABN13633.1| cysteine proteinase [Haemonchus contortus]
          Length = 342

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 64/265 (24%), Positives = 108/265 (40%), Gaps = 36/265 (13%)

Query: 10  RDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKM--HVECTSFRFIAGVKQRC 67
           RD+    T +  R  A +C    AV+TA  ++  +C +SK    V  ++   +   + +C
Sbjct: 94  RDVWKNCTTFYIRDQA-NCGSCWAVSTAAAISDRICIASKAEKQVNISATDIMTCCRPQC 152

Query: 68  AWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEP 127
                       C  G     W +    G+V+GG + +   C+P    PC H    T   
Sbjct: 153 GD---------GCEGGWPIEAWKYFIYDGVVSGGEYLTKGVCRPYPIHPCGHHGNDTYYG 203

Query: 128 ECKTLATPQPKCHTRC---------TNDNYGRGFFQDKYQINGLG---LYFDPHFGPFWP 175
           EC+  A P P C   C          +  YG+  +  K  +  +    L   P    F  
Sbjct: 204 ECRGTA-PTPPCKKECRPGVRKVYRIDKRYGKDAYIVKQSVKAIQSEILRNGPVVASF-- 260

Query: 176 AFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQF 235
           A +  F   Y   +++          + E+  Y  VK++GWG EN   +W I +++   +
Sbjct: 261 AVYEDF-RHYKSGIYK--------HTAGELRGYHAVKMIGWGNENNTDFWLIANSWHNDW 311

Query: 236 GDKGTIKILRGRNEAIIESLVNGAL 260
           G+KG  +I+RG N+  IE  +   +
Sbjct: 312 GEKGYFRIIRGTNDCGIEGTIAAGI 336


>gi|335347291|gb|AEH42093.1| cysteine proteinase 6 [Haemonchus contortus]
          Length = 346

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 64/241 (26%), Positives = 107/241 (44%), Gaps = 30/241 (12%)

Query: 27  SCIEARAVATATPLAFAVCRSSKM--HVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGI 84
           +C    AV+TA+ L+  +C +SK    V  +S  F++     C +          C  G 
Sbjct: 118 NCGSCWAVSTASVLSDRICIASKQKKQVHISSIDFVSCC-DSCGFG---------CEGGW 167

Query: 85  SSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT 144
               + +   +G+VTGG + S TGC+P  F PC H    T   EC    +  P+C  +C 
Sbjct: 168 PIDAFEYYSYQGVVTGGDYGSKTGCRPYPFHPCGHHGNETYYGECPKEES-TPECVKQCQ 226

Query: 145 ---------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGR 195
                    +  +G  +++ +  +  +        GP   +F     T Y    +   G 
Sbjct: 227 KGYKNSYRRDKTWGEDYYEVENSVKAIQREI-MRSGPVVSSF-----TVYDDFSYYVKG- 279

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           +Y  +A     ++A +KI+GWG E   PYW I +++   +G+KG  +++RG N   IE  
Sbjct: 280 IYKHTAGKARGSHA-IKIIGWGTEKNVPYWIIANSWHNDWGEKGFFRMVRGTNHCGIEED 338

Query: 256 V 256
           V
Sbjct: 339 V 339


>gi|10803443|emb|CAC13134.1| putative cathepsin B.8 [Ostertagia ostertagi]
          Length = 197

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 51/162 (31%), Positives = 70/162 (43%), Gaps = 16/162 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   S W +  K G+VTG  H +N GC+P  FP C H +  T    CK    P PKC
Sbjct: 43  CNGGDPLSAWKFWVKEGIVTGSNHSTNAGCKPYPFPACEHHSNKTHYDPCKHDLFPTPKC 102

Query: 140 HTRCTNDNYGRGFFQDKY---QINGLGLYFDP------HFGPFWPAFWRSFCTKYTRPLF 190
              C      R + +DKY      G+  + +        +GP   AF      +      
Sbjct: 103 EKSCQATFGERTYKEDKYFGRSAYGVKNHMEAIQKEIITYGPVEVAF------EVYEDFL 156

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFG 232
              G +Y     A    +A VK++GWG +NG PYW  + T G
Sbjct: 157 NYAGGIYVHQGGALGGGHA-VKMIGWGIDNGVPYWXHLPTHG 197


>gi|308504375|ref|XP_003114371.1| CRE-CPR-1 protein [Caenorhabditis remanei]
 gi|308261756|gb|EFP05709.1| CRE-CPR-1 protein [Caenorhabditis remanei]
          Length = 366

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 55/192 (28%), Positives = 81/192 (42%), Gaps = 39/192 (20%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G       W   +G+VTGG +H   GC+P    PC   N     PE KT     P C
Sbjct: 192 CEGGYPIQALRWWDSKGVVTGGDYH-GAGCKPYPIAPCTSGNC----PESKT-----PSC 241

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL---------- 189
              C +  Y   + +DK            HFG    A  R   +  T  +          
Sbjct: 242 SLSCQS-GYTTAYAKDK------------HFGTSAYAVARKVASIQTEIMTNGPVEAAFT 288

Query: 190 -----FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 244
                ++    VY  +A   +  +A +KI+GWG E+G PYW + +++G  +G+ G  +I 
Sbjct: 289 VYEDFYKYKSGVYKHTAGKALGGHA-IKIIGWGTESGSPYWLVANSWGNSWGESGFFRIF 347

Query: 245 RGRNEAIIESLV 256
           RG ++  IES V
Sbjct: 348 RGDDQCGIESAV 359


>gi|28932700|gb|AAO60044.1| midgut cysteine proteinase 1 [Rhipicephalus appendiculatus]
          Length = 332

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 85/196 (43%), Gaps = 31/196 (15%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C      S+ A +  R LV      +  GCQP S PPC         P C T   P PKC
Sbjct: 156 CDGRCHCSSVAILQGRRLVPEPVR-TEDGCQPYSLPPC--------VPNC-THPEPTPKC 205

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP---------HFGPFWPAF--WRSFCTKYTRP 188
              C    Y + + +DK+    +                 GP   AF  +  F   Y   
Sbjct: 206 QHVCRK-GYEKSYEEDKHFAKNVYRLLKKCDAIKTDIYKNGPVESAFFVYADF-PSYKSG 263

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           ++Q +          + +    +KI+GWG E+G PYW + +++   +GDKG  KILRG++
Sbjct: 264 VYQQH--------MIKFMGVHAIKILGWGTEDGVPYWLVANSWNVGWGDKGYFKILRGKD 315

Query: 249 EAIIESLVNGALPKDN 264
           E  IE +++  +P ++
Sbjct: 316 ECGIEEVIDAGIPMED 331


>gi|343470805|emb|CCD16605.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 56/194 (28%), Positives = 82/194 (42%), Gaps = 31/194 (15%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  + G+       +++ CQP  FP C H     ++  C       PKC
Sbjct: 159 CKGGFPGFAWRYYVEYGI-------TSSSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKC 211

Query: 140 HTRCT-----------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
           +  CT           N  Y     ++ Y+     LYF+   GPF   F+      YT  
Sbjct: 212 NATCTDKSVPLIKYRGNATYLLLHGEEDYKRE---LYFN---GPFVAVFY-----VYTD- 259

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           LF     VY  +   + +    VK+VGWG+ NG PYW + +++   +G  G + ILRG N
Sbjct: 260 LFAYKSGVYR-NVDGDFLGGTAVKVVGWGKLNGTPYWKVANSWDTDWGMDGYLLILRGNN 318

Query: 249 EAIIESLVNGALPK 262
           E  IE L     P+
Sbjct: 319 ECNIEHLGFAGTPE 332


>gi|380791571|gb|AFE67661.1| cathepsin B preproprotein, partial [Macaca mulatta]
          Length = 311

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 54/172 (31%), Positives = 80/172 (46%), Gaps = 19/172 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK Y  N   +              GP   AF     + Y+  L 
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
             +G    V  + E++    ++I+GWG ENG PYW + +++   +GD G  K
Sbjct: 262 YKSGVYQHV--TGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFK 311


>gi|312374702|gb|EFR22199.1| hypothetical protein AND_15622 [Anopheles darlingi]
          Length = 339

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 60/182 (32%), Positives = 78/182 (42%), Gaps = 29/182 (15%)

Query: 91  WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 150
           WV   GLV+GGA++S  GC+P  F PC +       P         PKC   C +    +
Sbjct: 174 WVDA-GLVSGGAYNSTEGCKPYPFKPCLY-------PFTDCHREESPKCKHHCQH-GVDK 224

Query: 151 GFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
            + +DK  + G   Y  P             GP    F           +F     VY  
Sbjct: 225 RYARDK--VFGSVAYSVPRDERVIRYEIMTNGPVEGGF------DVYEDVFLYKSGVYR- 275

Query: 200 SASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
               E V    V+I+GWG E G PYW I +++GE +GD G  KI+RG N   IES V   
Sbjct: 276 HVYGEHVGKHAVRIIGWGREGGIPYWLISNSYGEDWGDHGYFKIVRGINHLGIESKVITG 335

Query: 260 LP 261
           LP
Sbjct: 336 LP 337


>gi|209863073|ref|NP_001119610.2| cathepsin B-1852 [Acyrthosiphon pisum]
          Length = 333

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 54/191 (28%), Positives = 78/191 (40%), Gaps = 21/191 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  K GLVTGG  +S  GCQP  FPPC      T    C   +    KC
Sbjct: 153 CQGGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPC------TGNNSCSGQSEKNHKC 206

Query: 140 HTRCTNDNYGRGFFQDKYQI--NGLGLYFDPH------FGPFWPAFWRSFCTKYTRPLFQ 191
             +C   N    +  D+  +  +   L +D        +GP   +F              
Sbjct: 207 QKKCFG-NTSISYRGDRRYVERSPYVLAYDNMQNDIMTYGPIESSF------DVYDDFIS 259

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
               VY  S +A  +   +VK +GWG E    YW +++++   +GD G  KI RG NE  
Sbjct: 260 YKSGVYFKSPNATYLGGHSVKCIGWGVERNVSYWLMMNSWNSTWGDGGYFKIRRGTNECQ 319

Query: 252 IESLVNGALPK 262
           +E      +P+
Sbjct: 320 VEDSSTAGVPE 330


>gi|161343829|tpg|DAA06095.1| TPA_inf: cathepsin B [Aphis gossypii]
          Length = 280

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 57/208 (27%), Positives = 81/208 (38%), Gaps = 27/208 (12%)

Query: 27  SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
           +C  + A++ A+ +   +C  S    E  +    A     C +L       + C  G   
Sbjct: 87  NCRSSYAISVASAVTDRICIHSN---ETKNPIMSAQQIISCCYLCG-----YGCDGGSQF 138

Query: 87  STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ-PKCHTRCTN 145
            +W +  + G V+GG ++SN GCQP   PPC   N  +    C T    + P C  +C N
Sbjct: 139 ESWDFYRRHGFVSGGDYNSNQGCQPYMIPPCKLINEKSPRHSCTTYNREETPACEIKCNN 198

Query: 146 DNYGRGFFQDKYQINGLGLY--------FDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
            NY   F  D Y+     +Y        FD   GP    F+        R L      VY
Sbjct: 199 PNYYSSFKTDIYKGKYYQVYPFMAMKEIFDN--GPITTQFYM------YRDLIDYKSGVY 250

Query: 198 AVSAS--AEIVAYATVKIVGWGEENGRP 223
                   +       KI+GWGEENG P
Sbjct: 251 QYDEGFYGDFFTVQGXKIIGWGEENGDP 278


>gi|161343875|tpg|DAA06118.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 210

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/177 (28%), Positives = 83/177 (46%), Gaps = 22/177 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W +  + G+VTGG + S  GCQP S  P      T  + +  T     P C
Sbjct: 45  CDGGSPEAAWYFFMRHGIVTGGDYESGDGCQPYSIYPRGKGRNTCIDDDIDT-----PDC 99

Query: 140 HTR-CTNDNYGRGFFQDKYQINGL--------GLYFDPH-FGPFWPAFWRSFCTKYTRPL 189
             R CTN NY +G+  D + ++ +         +  D +  GP   AF+      YT  +
Sbjct: 100 SIRTCTNSNYTKGYRADLHYVDTVYSLSRSEEDIMTDIYKNGPVQAAFY-----VYTDFM 154

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
           +  +G VY+ +   +I     +KI+GWG ++   YW   +++   +G+ G  +ILRG
Sbjct: 155 YYKSG-VYSYT-RGQIEGGHAIKILGWGVDDNTKYWLCANSWSRSWGENGLFRILRG 209


>gi|308507719|ref|XP_003116043.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
 gi|308250987|gb|EFO94939.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
          Length = 356

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 60/187 (32%), Positives = 84/187 (44%), Gaps = 31/187 (16%)

Query: 96  GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQD 155
           G VTGG +  + GC+P SF PC++   + + P C      Q KC +  T  NY +G   D
Sbjct: 156 GAVTGGDYKGD-GCKPYSFAPCSNCVESKTTPSC------QSKCQSTYTVTNY-KG---D 204

Query: 156 KYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQT----NGRV-YAVSASAEIVAYAT 210
           K+     G   + H      + +R   +    P+ Q     NG V  A +   +   Y +
Sbjct: 205 KHYGKNEGKVTERHKHLECTSAYRLDTSSNAVPIIQNEIYQNGPVEVAYTVYDDFYHYKS 264

Query: 211 ---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
                          VKI+GWG E G  YW + +++G  FGDKG  KI RG NE  IES 
Sbjct: 265 GVYHHVTGKDTGGHAVKIIGWGTEKGVDYWLVTNSWGTSFGDKGFFKIRRGTNECGIESN 324

Query: 256 VNGALPK 262
           V   + K
Sbjct: 325 VVAGMAK 331


>gi|166030320|gb|ABY78827.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 53/158 (33%), Positives = 71/158 (44%), Gaps = 24/158 (15%)

Query: 109 CQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT-----------NDNYGRGFFQDKY 157
           CQP  FP C H     ++  C       PKC+  CT           N  Y     ++ Y
Sbjct: 180 CQPYPFPHCEHRGAQGNKTPCSKYNFDTPKCNATCTDKSIPLVKYRGNATYLLLHGEEDY 239

Query: 158 QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWG 217
           +     LYF+   GPF   F+      YT  LF     VY  +   +I+    V+IVGWG
Sbjct: 240 KRE---LYFN---GPFVAVFF-----VYTD-LFAYKSGVYR-NVDGDILGGQAVRIVGWG 286

Query: 218 EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           + NG PYW + +T+   +G  G + ILRG NE  IE L
Sbjct: 287 KLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHL 324


>gi|294877489|ref|XP_002768007.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239870145|gb|EER00725.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 344

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 58/228 (25%), Positives = 90/228 (39%), Gaps = 59/228 (25%)

Query: 80  CSSGISSSTWAWVHKRGLVTG------GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLA 133
           C  GI+ + W+++   G+VTG      G+  +  GC P SFP C H    +    C  + 
Sbjct: 133 CQGGIARAAWSFLKMHGIVTGGDFVPKGSMSAADGCWPYSFPKCAHDQEDSKYEPCPEVR 192

Query: 134 TP--------------------QPKCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHFGP 172
            P                     P C  RC N+ YG    +D+ +    L   F+     
Sbjct: 193 VPPLGERHQRGAGASIHQKLYDTPSCLDRCPNEKYGTPRDKDRHFTARALPYLFEG---- 248

Query: 173 FWPAFWRSFCTKYTRPLFQTNGRVYAVSASAE----------------IVAYATVKIVGW 216
                     T   +    TNG   A  ++ E                 +   +V+I+GW
Sbjct: 249 ----------TDNIKKEIMTNGPTSASFSTYEDFSSYKSGVYKHTSGGYLGDHSVEIIGW 298

Query: 217 GEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDN 264
           G E G  YW +++++ E +GD GT KI +G  +  I+  V G+LP  N
Sbjct: 299 GTEKGVDYWLVMNSWNEGWGDHGTFKIAQG--DCGIDDAVQGSLPAMN 344


>gi|268572243|ref|XP_002648913.1| Hypothetical protein CBG17826 [Caenorhabditis briggsae]
          Length = 323

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 59/192 (30%), Positives = 89/192 (46%), Gaps = 29/192 (15%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     + W + RG+VTGG     +GC+P  F PC       S PE KT     P C
Sbjct: 151 CKGGYPIQAFRWWNSRGVVTGG-DFRGSGCRPYPFAPC------ISCPEEKT-----PTC 198

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + +DK + ++   +  +           GP   AF     T Y   ++
Sbjct: 199 SLSC-QFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAF-----TMY-EDMY 251

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
           +    VY  +A   +  +A +KI+GWG +NG PYW I +++G  +G+ G +K+ RG NE 
Sbjct: 252 KYKSGVYRHTAGRLLGGHA-IKIIGWGTQNGIPYWLIANSWGANWGENGFLKMRRGVNEC 310

Query: 251 IIESLVNGALPK 262
            IE  V   +P+
Sbjct: 311 GIERAVVAGMPR 322


>gi|56755295|gb|AAW25827.1| SJCHGC06356 protein [Schistosoma japonicum]
          Length = 279

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 49/171 (28%), Positives = 79/171 (46%), Gaps = 7/171 (4%)

Query: 96  GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQD 155
           G+VTGG++   +GCQP   P C++ +  +   +C       P+C   C  D Y + +  D
Sbjct: 111 GIVTGGSYEDQSGCQPYPLPKCSY-HPESRFLDCNNNTFEFPQCTNEC-QDGYNKTYDDD 168

Query: 156 KYQ----INGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATV 211
           K+      N  G   D            +  +  T  L   +G VY  +  +  + + T+
Sbjct: 169 KFYGERIYNVYGTQEDIQKEILMNGPVIASISVNTDFLVYKSG-VYLPTPRSRNLGWITL 227

Query: 212 KIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           +I+GWG E   PYW   +++ E++G  G +KI RG     IES V   +PK
Sbjct: 228 RIIGWGYEGKIPYWLCANSWNEEWGANGYVKIQRGVQAGYIESYVRAPIPK 278


>gi|329669000|gb|AEB96388.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 232

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 54/192 (28%), Positives = 81/192 (42%), Gaps = 31/192 (16%)

Query: 87  STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
           + W++    G+VTG  + S +GC+P  +PPC H        +C     P   C  +C   
Sbjct: 56  AAWSYWVSNGIVTGSNYTSKSGCKPYPYPPCEHHIPEHHYKKCPKDIYPTNTCEYKC--- 112

Query: 147 NYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV-YAVSASAEI 205
                  QD Y I+      D H+G    A  +   +   +    TNG V  A     + 
Sbjct: 113 -------QDGYSIS---YNSDKHYGASVYAVAQDVAS--IQKEIMTNGPVEVAFDVYEDF 160

Query: 206 VAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             Y++               VK++GWG ENG  YW   +++   +G+ G  +ILRG +E 
Sbjct: 161 EHYSSGIYKHTTGDYLGGHAVKMLGWGTENGTDYWICANSWNSDWGENGFFRILRGVDEC 220

Query: 251 IIESLVNGALPK 262
            IES V    PK
Sbjct: 221 EIESGVVAGEPK 232


>gi|294891881|ref|XP_002773785.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239878989|gb|EER05601.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 455

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 53/167 (31%), Positives = 79/167 (47%), Gaps = 20/167 (11%)

Query: 91  WVHKRGLVTGGAHH------SNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRC 143
           ++   GLVTGG +       ++ GC P  FP CNH     S+ P C  +    P C T C
Sbjct: 231 FMKNHGLVTGGEYKPPEELGNDDGCWPYPFPKCNHVPGLESKYPRCAQVRD-LPACATTC 289

Query: 144 TNDNYGRGFFQDKYQINGLGLYFDPHFGP-------FWPAFWRSFCTKYTRPLFQTNGRV 196
            N  YG    +D ++    G       GP       F      +  T Y    F  +G V
Sbjct: 290 PNKAYGTSMQKDTHRAKSWGRL---PIGPEKIKQEIFDNGPVAAMMTLYEDFRFYKSG-V 345

Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
           Y V  + +++A  T+K++GWG E+G+ YW  V+ + E++GD G IK+
Sbjct: 346 Y-VHKTGQMLAAHTLKLIGWGVESGQEYWLAVNAWNEEWGDHGMIKL 391


>gi|343474137|emb|CCD14154.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 56/194 (28%), Positives = 81/194 (41%), Gaps = 31/194 (15%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  + G+       +++ CQP  FP C H     ++  C       PKC
Sbjct: 159 CKGGFPGFAWRYYVEYGI-------TSSSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKC 211

Query: 140 HTRCT-----------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
           +  CT           N  Y     ++ Y+     LYF+   GPF   F+      YT  
Sbjct: 212 NATCTDKAIPLIKYRGNATYLLLHGEEDYKRE---LYFN---GPFVAVFY-----VYTD- 259

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           LF     VY      + +    VK+VGWG+ NG PYW + +++   +G  G + ILRG N
Sbjct: 260 LFAYKSGVYR-HVDGDFLGGTAVKVVGWGKLNGTPYWKLANSWDTDWGMGGYLLILRGNN 318

Query: 249 EAIIESLVNGALPK 262
           E  IE L     P+
Sbjct: 319 ECNIEHLGFAGTPE 332


>gi|166030330|gb|ABY78832.1| cathepsin B-like protease [Trypanosoma congolense]
 gi|343476577|emb|CCD12360.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 56/194 (28%), Positives = 81/194 (41%), Gaps = 31/194 (15%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  + G+       +++ CQP  FP C H     ++  C       PKC
Sbjct: 159 CKGGFPGFAWRYYVEYGI-------TSSSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKC 211

Query: 140 HTRCT-----------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
           +  CT           N  Y     ++ Y+     LYF+   GPF   F+      YT  
Sbjct: 212 NATCTDKAIPLIKYRGNATYLLLHGEEDYKRE---LYFN---GPFVAVFY-----VYTD- 259

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           LF     VY      + +    VK+VGWG+ NG PYW + +++   +G  G + ILRG N
Sbjct: 260 LFAYKSGVYR-HVDGDFLGGTAVKVVGWGKLNGTPYWKLANSWDTDWGMGGYLLILRGNN 318

Query: 249 EAIIESLVNGALPK 262
           E  IE L     P+
Sbjct: 319 ECNIEHLGFAGTPE 332


>gi|343474132|emb|CCD14149.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 56/194 (28%), Positives = 81/194 (41%), Gaps = 31/194 (15%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  + G+       +++ CQP  FP C H     ++  C       PKC
Sbjct: 159 CKGGFPGFAWRYYVEYGI-------TSSSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKC 211

Query: 140 HTRCT-----------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
           +  CT           N  Y     ++ Y+     LYF+   GPF   F+      YT  
Sbjct: 212 NATCTDKAIPLIKYRGNATYLLLHGEEDYKRE---LYFN---GPFVAVFY-----VYTD- 259

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           LF     VY      + +    VK+VGWG+ NG PYW + +++   +G  G + ILRG N
Sbjct: 260 LFAYKSGVYR-HVDGDFLGGTAVKVVGWGKLNGTPYWKLANSWDTDWGMGGYLLILRGNN 318

Query: 249 EAIIESLVNGALPK 262
           E  IE L     P+
Sbjct: 319 ECNIEHLGFAGTPE 332


>gi|90074902|dbj|BAE87131.1| unnamed protein product [Macaca fascicularis]
          Length = 296

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 56/187 (29%), Positives = 80/187 (42%), Gaps = 45/187 (24%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
              C    Y   + QDK            H+G                         Y+V
Sbjct: 208 SKIC-EPGYSPTYKQDK------------HYG----------------------YNSYSV 232

Query: 200 SASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           S S + +     K       NG PYW + +++   +GD G  KILRG++   IES V   
Sbjct: 233 SNSEKDIMAEIYK-------NGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 285

Query: 260 LPK-DNY 265
           +P+ D Y
Sbjct: 286 IPRTDQY 292


>gi|729283|sp|Q06544.1|CYSP3_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 3
 gi|159952|gb|AAA29436.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
          Length = 174

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 54/185 (29%), Positives = 76/185 (41%), Gaps = 32/185 (17%)

Query: 88  TWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDN 147
            W +    G+VTGG +     C+P  FPPC          EC   A   PKC   C    
Sbjct: 1   AWQYFALEGVVTGGNYRKQGCCRPYEFPPCGRHGKEPYYGECYDTAK-TPKCQKTCQ--- 56

Query: 148 YGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVA 207
             RG+ +   +        D HFG    A+      K  +     NG V A     E  A
Sbjct: 57  --RGYLKAYKE--------DKHFGK--SAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFA 104

Query: 208 Y----------------ATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
           +                  VKI+GWG+E G PYW I +++ + +G+KG  +++RG N   
Sbjct: 105 HYKSGIYKHTAGRMTGGHAVKIIGWGKEKGTPYWLIANSWHDDWGEKGFYRMIRGINNCR 164

Query: 252 IESLV 256
           IE +V
Sbjct: 165 IEEMV 169


>gi|335347289|gb|AEH42092.1| cysteine proteinase 1 [Haemonchus contortus]
          Length = 332

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 62/226 (27%), Positives = 97/226 (42%), Gaps = 31/226 (13%)

Query: 33  AVATATPLAFAVCRSSKMHVE--CTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISSSTWA 90
           AV+ A  ++  +C  SK  V+   +    +A   + C            C+ G+    W 
Sbjct: 125 AVSAAETMSDRICVQSKGRVQKMISDVDILACCGRECGR---------GCNGGMDHKAWE 175

Query: 91  WVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPKCHTRCTNDNYG 149
           +V + G+VTGG +     C+P    PC NH     S P   +  TP   C   C    YG
Sbjct: 176 YVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPA--CKKYCQY-GYG 232

Query: 150 RGFFQDKYQINGLGLYFDPHF---------GPFWPAFWRSFCTKYTRPLFQTNGRVYAVS 200
           + + +DK  +  + +  +            GP   AF       Y    F T G +Y  +
Sbjct: 233 KRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAF-----ITYEDFSFYTKG-IYVHT 286

Query: 201 ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
              +  A+A VK+VGWG ENG  YW + +++   +G+ G  +ILRG
Sbjct: 287 RGRQRGAHA-VKVVGWGVENGTKYWNVANSWSTDWGEDGYFRILRG 331


>gi|239938580|gb|ACS36089.1| cysteine proteinase [Haemonchus contortus]
          Length = 332

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 62/226 (27%), Positives = 97/226 (42%), Gaps = 31/226 (13%)

Query: 33  AVATATPLAFAVCRSSKMHVE--CTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISSSTWA 90
           AV+ A  ++  +C  SK  V+   +    +A   + C            C+ G+    W 
Sbjct: 125 AVSAAETMSDRICVQSKGRVQKMISDVDILACCGRECGR---------GCNGGMDHKAWE 175

Query: 91  WVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPKCHTRCTNDNYG 149
           +V + G+VTGG +     C+P    PC NH     S P   +  TP   C   C    YG
Sbjct: 176 YVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPA--CKKYCQY-GYG 232

Query: 150 RGFFQDKYQINGLGLYFDPHF---------GPFWPAFWRSFCTKYTRPLFQTNGRVYAVS 200
           + + +DK  +  + +  +            GP   AF       Y    F T G +Y  +
Sbjct: 233 KRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAF-----ITYEDFSFYTKG-IYVHT 286

Query: 201 ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
              +  A+A VK+VGWG ENG  YW + +++   +G+ G  +ILRG
Sbjct: 287 RGRQRGAHA-VKVVGWGVENGTKYWNVANSWSTDWGENGYFRILRG 331


>gi|294888035|ref|XP_002772321.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239876433|gb|EER04137.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 200

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 51/169 (30%), Positives = 77/169 (45%), Gaps = 21/169 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHS------NTGCQPVSFPPCNHANYTTSEPECKTLA 133
           C  G   S W+WVH +G+ TGG + +      + GC P  FPPC H    T  P+C  ++
Sbjct: 47  CGGGDPYSAWSWVHDKGIATGGDYVAKDDMTKDDGCWPYDFPPCAHHINDTKYPKCPKVS 106

Query: 134 TPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
                 H    +  Y       K  I   G        P   +F     T Y   L   +
Sbjct: 107 CSGDDRHFMLESSPYHYSVNDAKNAIRTDG--------PVSASF-----TVYEDFLAYRS 153

Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
           G VY  ++ + +  +A VKI+GWGE++G+ YW  V+++ E +GD G  +
Sbjct: 154 G-VYKHTSGSYLGGHA-VKIIGWGEKSGQAYWLAVNSWNEDWGDHGLFR 200


>gi|341904369|gb|EGT60202.1| hypothetical protein CAEBREN_08101 [Caenorhabditis brenneri]
          Length = 330

 Score = 71.6 bits (174), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 57/186 (30%), Positives = 82/186 (44%), Gaps = 27/186 (14%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G       W   +G+VTGG +H   GC+P    PC     + S PE KT     P C
Sbjct: 156 CEGGYPIQALRWWDSKGVVTGGDYH-GAGCKPYPIAPCT----SGSCPESKT-----PAC 205

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
              C +  Y   + +DK      Y +              GP   AF     T Y    +
Sbjct: 206 SLSCQS-GYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAF-----TVY-EDFY 258

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
           +    VY  +A   +  +A +KI+GWG E+G PYW + +++G  +G+ G  KI RG ++ 
Sbjct: 259 KYKSGVYKHTAGKALGGHA-IKIIGWGTESGSPYWLVANSWGTSWGESGFFKIFRGDDQC 317

Query: 251 IIESLV 256
            IES V
Sbjct: 318 GIESAV 323


>gi|343472937|emb|CCD15042.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 55/187 (29%), Positives = 80/187 (42%), Gaps = 31/187 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  + G+       +++ CQP  FP C H     ++  C       PKC
Sbjct: 158 CKGGFPGFAWLYYVEYGI-------TSSQCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKC 210

Query: 140 HTRCT-----------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
           +  CT           N  Y     ++ Y+     LYF+   GPF   F+      YT  
Sbjct: 211 NATCTDKSIPLVKYRGNATYLLLHGEEDYKRE---LYFN---GPFVAVFF-----VYTD- 258

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           LF     VY  +   + +    V+IVGWG+ NG PYW + +++   +G  G + ILRG N
Sbjct: 259 LFAYKSGVYR-NVDGDFLGGQAVRIVGWGKLNGTPYWKVANSWDTDWGMNGYMLILRGNN 317

Query: 249 EAIIESL 255
           E  IE L
Sbjct: 318 ECNIEHL 324


>gi|404250524|gb|AFR54113.1| cysteine proteinase, partial [Haemonchus contortus]
          Length = 332

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 53/177 (29%), Positives = 80/177 (45%), Gaps = 20/177 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 138
           C+ G+    W +V + G+VTGG +     C+P    PC NH     S P   +  TP   
Sbjct: 165 CNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTP--A 222

Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF---------GPFWPAFWRSFCTKYTRPL 189
           C   C    YG+ + +DK  +  + +  +            GP   AF       Y    
Sbjct: 223 CKKYCQY-GYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAF-----ITYEDFS 276

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
           F T G +Y  +   +  A+A VK+VGWG ENG  YW + +++   +G+ G  +ILRG
Sbjct: 277 FYTKG-IYVHTRGRQRGAHA-VKVVGWGVENGTKYWNVANSWSTDWGENGYFRILRG 331


>gi|157167285|ref|XP_001658487.1| cathepsin b [Aedes aegypti]
 gi|108876478|gb|EAT40703.1| AAEL007590-PA [Aedes aegypti]
          Length = 313

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 54/182 (29%), Positives = 81/182 (44%), Gaps = 23/182 (12%)

Query: 89  WAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRC----- 143
           W++  K+G+ +GG + SN GC P   PP          P+       +P C TRC     
Sbjct: 141 WSYWVKQGVSSGGPYGSNQGCHPYPMPPSCPKPSEGDYPD-------EPNCSTRCNAGYN 193

Query: 144 -TNDNYGRGFFQDKYQI--NGLGLYFDPHF-GPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
            T D   R F +  Y I  +   +  D    GP    F      ++   +   +G VY  
Sbjct: 194 VTEDLRDRRFGRVAYSIPADERKIMEDIFVNGPVQAVF------QWYEDIVNYSGGVYR- 246

Query: 200 SASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
             S  +     VK++GWG E+G  YW + +++G  +GD G  K++RG N   IE  V+  
Sbjct: 247 HQSGRLKGGHAVKLIGWGVEDGTKYWLVANSWGRVWGDDGFFKMVRGENHCGIEENVHAG 306

Query: 260 LP 261
           LP
Sbjct: 307 LP 308


>gi|119638954|gb|ABL85236.1| cysteine proteinase 2 [Necator americanus]
          Length = 347

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 53/186 (28%), Positives = 80/186 (43%), Gaps = 14/186 (7%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+SG+    + +  ++G+ +GG + +   C+P  F PC +  +      C     P P C
Sbjct: 164 CTSGVPRQAFNYAIRKGVCSGGPYGTKGVCKPYPFYPCGYHAHLPYYGPCPDGMWPTPTC 223

Query: 140 HTRCTND-----NYGRGFFQDKYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQT 192
              C +D     N  R F      + G        F  GP    +     T Y    +  
Sbjct: 224 EKACQSDYTVPYNDDRIFGSKTIVLTGEEKIKREIFNNGPLVATY-----TVYEDFAYYK 278

Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
           NG +Y         A+A VKI+GWGEENG  YW I +++   +G+ G  ++LRG N   I
Sbjct: 279 NG-IYMTGLGRATGAHA-VKIIGWGEENGVKYWLIANSWNTDWGENGFFRMLRGTNLCDI 336

Query: 253 ESLVNG 258
           E    G
Sbjct: 337 ELSATG 342


>gi|193606095|ref|XP_001951499.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 330

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 61/245 (24%), Positives = 96/245 (39%), Gaps = 47/245 (19%)

Query: 33  AVATATPLAFAVCRSSK----MHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISSST 88
           A ATA  LA  +C ++       +      F  G+K + +  V                 
Sbjct: 116 AYATAGVLADRMCIATNGSYNQLLSTEELIFCGGIKTKQSGAVR------------GDDV 163

Query: 89  WAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 148
           W ++   GLV+GG +++N GCQP   PP    N  T              C  RC  +N 
Sbjct: 164 WEYLKSHGLVSGGKYNTNDGCQPSKIPPI--GNIPTH--------LYNHTCEERCYGNNT 213

Query: 149 GRGFFQDKYQINGLGLYFDPH-----------FGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
              ++ D  +++    Y++             +GP    F      +     F     VY
Sbjct: 214 IH-YYHDHVKVSH---YYNIKSNEDIQKEVQTYGPVSVKF------RVYDDFFLYKSGVY 263

Query: 198 AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 257
             +  +  V     K++GWG ENG  YW +V+++G ++G  G  KI RG NE  +E  V 
Sbjct: 264 VKTEKSLYVRRHFAKLIGWGVENGVDYWLLVNSWGNEWGQNGLFKIKRGTNEVHVEDYVY 323

Query: 258 GALPK 262
              P+
Sbjct: 324 AGEPE 328


>gi|86279343|gb|ABC88767.1| putative cathepsin B-like proteinase [Tenebrio molitor]
          Length = 321

 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 51/183 (27%), Positives = 78/183 (42%), Gaps = 13/183 (7%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   S   +    G+V+GG  +SN GC+P +    +          C+   +     
Sbjct: 151 CGGGYMMSALDFYINEGIVSGGDVNSNEGCRPYTADAHDQGQTPACTKSCRNGYSTSYSA 210

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
                +++Y      D+ Q   +        GP    F      +  +  +     VY  
Sbjct: 211 DKHYGSNDYVVSSVIDQIQYEVMTN------GPIIVNF------EVFQDFYNYVSGVYR- 257

Query: 200 SASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
             S E V +  VKIVGWG ENG PYW I +++G  +GD G  K+LRG+NE  IE+     
Sbjct: 258 HVSGESVGFHVVKIVGWGVENGVPYWLIANSWGSSWGDHGFFKMLRGQNECGIENYPYAV 317

Query: 260 LPK 262
           +P+
Sbjct: 318 MPR 320


>gi|91089437|ref|XP_966750.1| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270012705|gb|EFA09153.1| cathepsin B precursor [Tribolium castaneum]
          Length = 324

 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 54/188 (28%), Positives = 80/188 (42%), Gaps = 43/188 (22%)

Query: 91  WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 150
           WV K G+V+GG ++SN GCQP          Y  S      L +  PKC T+C N  Y  
Sbjct: 163 WVAK-GIVSGGDYNSNEGCQP----------YEGSA----FLNSVTPKCSTKCLNSKYTT 207

Query: 151 GFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYAT 210
            + +DK            H+G  +         +    +      V  +    +  +Y +
Sbjct: 208 PYAKDK------------HYGTDFIYMTSKNVAEIQTEIMNNGPVVTHMDVYEDFYSYKS 255

Query: 211 ---------------VKIVGWGEENGRPYWTIVSTFGEQFGD-KGTIKILRGRNEAIIES 254
                          VKI+GWG E G PYW I +++G ++ D  G  KILRG+N   IE+
Sbjct: 256 GVYQHVSGNSMGGHAVKIIGWGTEKGVPYWLIANSWGAKWADLDGFYKILRGKNHCKIET 315

Query: 255 LVNGALPK 262
            + G  P+
Sbjct: 316 YIYGGTPQ 323


>gi|166030324|gb|ABY78829.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 52/158 (32%), Positives = 69/158 (43%), Gaps = 24/158 (15%)

Query: 109 CQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT-----------NDNYGRGFFQDKY 157
           CQP  FP C H     ++  C       PKC+  CT           N  Y     ++ Y
Sbjct: 180 CQPYPFPHCEHRGAQGNKTPCSKYNFDTPKCNATCTDKSIPLVKYRGNATYLLLHGEEDY 239

Query: 158 QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWG 217
           +     LYF+   GPF   F+      YT  LF     VY      + +    VK+VGWG
Sbjct: 240 KRE---LYFN---GPFVAVFY-----VYTD-LFAYKSGVYR-HVDGDFLGGTAVKVVGWG 286

Query: 218 EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           + NG PYW + +T+   +G  G + ILRG NE  IE L
Sbjct: 287 KLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHL 324


>gi|339241013|ref|XP_003376432.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316974853|gb|EFV58323.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 551

 Score = 71.2 bits (173), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 55/188 (29%), Positives = 90/188 (47%), Gaps = 20/188 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPEC-KTLATPQPK 138
           C+ G    T+ +    G+ TGG + SN  C+P   PPC++ + T + P+C K+  +  P 
Sbjct: 359 CNGGYPQRTFKYWVYSGMPTGGPYGSNDTCKPYPIPPCSNCSETRT-PKCSKSCISTYPL 417

Query: 139 CHTRCTNDNYGRGFFQ----DKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNG 194
                 + +YG  ++Q    +K  +  + LY     GP          + Y   L    G
Sbjct: 418 SLNE--DRHYGSTYYQFWLGEKSMMKDISLY-----GPIVAGM-----SVYEDFLHYKEG 465

Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
            VY   +   +  +A V+I+GWGE++  PYW + +++   FG+ G  KI RG +E  IES
Sbjct: 466 -VYTQESGIFLGGHA-VRIIGWGEQDNIPYWLVANSWNTTFGEDGLFKIRRGFDECGIES 523

Query: 255 LVNGALPK 262
            V+    K
Sbjct: 524 YVSAGRAK 531


>gi|119638965|gb|ABL85237.1| cysteine proteinase 3 [Necator americanus]
          Length = 360

 Score = 71.2 bits (173), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 61/199 (30%), Positives = 86/199 (43%), Gaps = 30/199 (15%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G +   W +    G+ TGG + +   C+P +F PC   +Y     +C   + P PKC
Sbjct: 159 CGGGHTIRAWEYFKNTGVCTGGLYGTKDSCKPYAFYPCKDESYG----KCPKDSFPTPKC 214

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRP 188
              C    Y + +  DKY  N    Y  P             GP   +F       Y   
Sbjct: 215 RKICQY-KYSKKYADDKYYANSA--YRIPQNETWIKLEIMRNGPVTASF-----RIYPDF 266

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEE--NGR--PYWTIVSTFGEQFGD-KGTIKI 243
            F   G VY  S   E+  +A +KI+GWG E  NG   PYW I +++G  +G+  G  +I
Sbjct: 267 GFYEKG-VYVTSGGRELGGHA-IKIIGWGTEKVNGTDLPYWLIANSWGTDWGENNGYFRI 324

Query: 244 LRGRNEAIIESLVNGALPK 262
           LRG+N   IE  V   + K
Sbjct: 325 LRGQNHCQIEQKVIAGMIK 343


>gi|341878049|gb|EGT33984.1| CBN-CPR-1 protein [Caenorhabditis brenneri]
          Length = 330

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 57/186 (30%), Positives = 81/186 (43%), Gaps = 27/186 (14%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G       W   +G+VTGG +H   GC+P    PC     + S PE KT     P C
Sbjct: 156 CEGGYPIQALRWWDSKGVVTGGDYH-GAGCKPYPIAPCT----SGSCPESKT-----PAC 205

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + +DK      Y +              GP   AF     T Y    +
Sbjct: 206 SLSC-QPGYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAF-----TVY-EDFY 258

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
           +    VY  +A   +  +A +KI+GWG E+G PYW + +++G  +G+ G  KI RG ++ 
Sbjct: 259 KYKSGVYKHTAGKALGGHA-IKIIGWGTESGSPYWLVANSWGTSWGESGFFKIFRGDDQC 317

Query: 251 IIESLV 256
            IES V
Sbjct: 318 GIESAV 323


>gi|157092993|gb|ABV22151.1| cysteine proteinase [Perkinsus chesapeaki]
          Length = 396

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 51/180 (28%), Positives = 77/180 (42%), Gaps = 27/180 (15%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHS------NTGCQPVSFPPCNHANYTTSEPECKTLA 133
           C  G S   W W+H  G+VTGG + +      + GC P   PPC H   +T  P+C    
Sbjct: 207 CHGGSSLDAWQWLHTTGVVTGGDYSAEKDMTESDGCWPYDIPPCAHYTNSTLYPKCPKTK 266

Query: 134 TPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF----------GPFWPAFWRSFCT 183
              P C   C N  Y     +D++ +    L                GP   ++      
Sbjct: 267 YDFPTCQESCPNKKYDTPMEKDRHFVEEESLSALRSIDAIKKEIMTNGPVSASY-----L 321

Query: 184 KYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
            Y   L   +G VY  ++   +  +A VKI+GWGE+    YW +V+++ + +GD G  KI
Sbjct: 322 VYDDFLTYKSG-VYKRTSHNALGGHA-VKIIGWGED----YWLVVNSWNKNWGDNGMFKI 375


>gi|167508668|gb|ABZ81540.1| cathepsin B-like cysteine protease [Caenorhabditis brenneri]
          Length = 193

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 58/200 (29%), Positives = 85/200 (42%), Gaps = 24/200 (12%)

Query: 67  CAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCN--HANYTT 124
           C  L+S     W C          W    GL TGG +    GC+P +  PC+  + N TT
Sbjct: 5   CVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYDDQFGCKPYTIYPCDKTYPNGTT 64

Query: 125 SEPECKTLATPQPKCHTRCTND-----------NYGRGFFQDKYQINGLGLYFDPHFGPF 173
           S P C    TP   C  RCT++           ++G+  +    ++  +      + GP 
Sbjct: 65  SVP-CPGYHTPV--CEERCTSNITWPISYKQVKHFGKAHYNVGKKMTDIQTEIMRN-GPV 120

Query: 174 WPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGE 233
             +F            +     +Y  +A  +     T KI+GWG +NG PYW  V  +G 
Sbjct: 121 IASF------IIYDDFWDYKSGIYVHTAGDQEGGMDT-KIIGWGVDNGVPYWLCVHQWGT 173

Query: 234 QFGDKGTIKILRGRNEAIIE 253
            FG+ G ++ILRG NE  IE
Sbjct: 174 DFGENGFMRILRGVNEVHIE 193


>gi|166030332|gb|ABY78833.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 51/184 (27%), Positives = 78/184 (42%), Gaps = 26/184 (14%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W +    G+       +++ CQP  FP C H      +P C       P+C
Sbjct: 159 CEGGYPDAAWEYYVSHGI-------TSSQCQPYPFPRCEHRGAQGKKPPCSKYKFVTPQC 211

Query: 140 HTRCTNDNYGRGFFQ--DKYQING-----LGLYFDPHFGPFWPAFW-RSFCTKYTRPLFQ 191
           +  CT+ +     ++    Y++ G       LYF+   GPF   F   S    Y   ++Q
Sbjct: 212 NATCTDKSVPLIKYRGNHSYEVRGEEDYKRELYFN---GPFVVRFQVHSDFLAYKSGVYQ 268

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
                     +   +    V+IVGWG+ NG PYW + +++   +G  G   ILRG NE  
Sbjct: 269 --------HVAGNFLGGKAVRIVGWGKLNGTPYWKVANSWDTDWGMNGYFLILRGDNECN 320

Query: 252 IESL 255
           IE L
Sbjct: 321 IEHL 324


>gi|255040223|gb|ACT99884.1| truncated cathepsin B [Opisthorchis viverrini]
          Length = 313

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 48/164 (29%), Positives = 73/164 (44%), Gaps = 9/164 (5%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G  +  W +    G+VTGG+    +GC+   FP C+H +     P C     P P+C
Sbjct: 155 CRGGYPAVAWDYWRTHGIVTGGSKEDPSGCRSYPFPKCDH-HVQGHYPPCPRQIYPTPEC 213

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWR----SFCTKYTRPLFQTNGR 195
              C  D    G+ +DK + N     +            R    +  T Y     Q   R
Sbjct: 214 VQDC--DTPELGYLEDKTRANISYNIYASEISIMKEIMLRGPVEAVFTVY-EDFLQYKSR 270

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKG 239
           VY  +  A +  +A ++I+GWGEE   PYW I +++ E +G+KG
Sbjct: 271 VYFHAWGAPMSGHA-IRILGWGEEGDVPYWLIANSWNEDWGEKG 313


>gi|300176938|emb|CBK25507.2| unnamed protein product [Blastocystis hominis]
          Length = 320

 Score = 70.5 bits (171), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 53/193 (27%), Positives = 80/193 (41%), Gaps = 21/193 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W W    G+ TGG + S   C   SFP C H       P  ++  TP+  C
Sbjct: 138 CDGGWLDMAWRWFQSTGVTTGGEYGSKDWCNAYSFPKCEHHAEGKYPPCGESQETPE--C 195

Query: 140 HTRCTND-----NYGRGFFQDKYQINGLGLYFDPHF---GPFWPAF--WRSFCTKYTRPL 189
             +C           + FF + Y + G            GP   +F  +  F T Y   +
Sbjct: 196 VKQCQEGYPVEYEKDKHFFGEAYYVQGGIDAIKTELMTNGPLEVSFFVYEDFLT-YKSGI 254

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           +Q          + + +    VK+VGWG E+G  YW I +++ E +G+ G  +I+ G+ E
Sbjct: 255 YQ--------HVAGKYLGGHAVKLVGWGVEDGIEYWKIANSWNEDWGENGYFRIVAGKGE 306

Query: 250 AIIESLVNGALPK 262
             IE    G +PK
Sbjct: 307 CGIEVGPIGGIPK 319


>gi|294916952|ref|XP_002778399.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239886773|gb|EER10194.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 228

 Score = 70.5 bits (171), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 55/184 (29%), Positives = 84/184 (45%), Gaps = 24/184 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHH------SNTGCQPVSFPPCNHANYTTSE-PECKTL 132
           C  G       ++   GLVTGG +       ++ GC P  FP CNH     S+ P C  +
Sbjct: 39  CMFGSVPEGLNFMKNHGLVTGGEYKPPEKLGNDDGCWPYPFPKCNHVPGLESKYPRCAQV 98

Query: 133 ATPQPKCHTRCTNDNYGRGFFQDKYQINGLG-LYFDPH--------FGPFWPAFWRSFCT 183
               P C T C N  YG    +D ++    G L   P          GP       +  T
Sbjct: 99  RD-LPACATTCPNKAYGTSMQKDTHRAKSWGRLPIGPEKIKQEIFDNGPV-----AAMMT 152

Query: 184 KYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
            Y    +  +G VY V  + +++A  T+K++GWG E+G+ YW  ++ + E++GD G IK+
Sbjct: 153 LYEDFRYYKSG-VY-VHKTGQLLAAHTLKLIGWGVESGQEYWLAMNAWNEEWGDHGMIKL 210

Query: 244 LRGR 247
             G+
Sbjct: 211 AVGK 214


>gi|166030322|gb|ABY78828.1| cathepsin B-like protease [Trypanosoma congolense]
 gi|343471419|emb|CCD16168.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score = 70.5 bits (171), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 50/155 (32%), Positives = 68/155 (43%), Gaps = 18/155 (11%)

Query: 109 CQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLG----- 163
           CQP  FP C H     ++  C       P+C+T CT+       ++ K     L      
Sbjct: 180 CQPYPFPQCEHQGAQGNKTPCSNYKFVTPQCNTTCTDKTIPLIKYRGKDAYMLLPGEEEF 239

Query: 164 ---LYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN 220
              LYF+   GPF      +    YT  LF     VY  +     +    VK+VGWG+ N
Sbjct: 240 KRELYFN---GPF-----VAILFVYTD-LFAYKSGVYR-NVDGSYMGVTAVKVVGWGKLN 289

Query: 221 GRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           G PYW + +T+   +G  G + ILRG NE  IE L
Sbjct: 290 GTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHL 324


>gi|170060936|ref|XP_001866022.1| cathepsin B [Culex quinquefasciatus]
 gi|167879259|gb|EDS42642.1| cathepsin B [Culex quinquefasciatus]
          Length = 341

 Score = 70.5 bits (171), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 53/192 (27%), Positives = 84/192 (43%), Gaps = 28/192 (14%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  +RG+ +GG ++S  GC P     C+ A+     P+C        KC
Sbjct: 158 CQGGNLGPAWQFWVQRGVSSGGPYNSRQGCHPYPVDVCHSADEDADTPKCTR------KC 211

Query: 140 HT--RCTNDNYGRGFFQDKYQINGLGLYFDPHF---GPFWPAF-----WRSFCTKYTRPL 189
            +    TN +  R F +  Y ++             GP   +F     ++++ T   R +
Sbjct: 212 QSMYNVTNVSDDRRFGRVAYSVSQDEERIKEEIFRNGPVQASFDVYLDFKAYKTGVYRHV 271

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           F              +     VK++GWG ENG  YW   +++GE +G++G  KI+RG N 
Sbjct: 272 F------------GPMEGGHAVKMIGWGVENGTKYWLCSNSWGEDWGERGFFKIVRGENH 319

Query: 250 AIIESLVNGALP 261
             IES V+  LP
Sbjct: 320 CGIESDVHAGLP 331


>gi|389593817|ref|XP_003722157.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
 gi|321438655|emb|CBZ12414.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
          Length = 340

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 50/179 (27%), Positives = 69/179 (38%), Gaps = 18/179 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  GI +  W W    G+ T         CQP  F PC+H   +   P C +     PKC
Sbjct: 166 CHGGIPTVAWLWWVWVGIAT-------EDCQPYPFDPCSHHGNSEKYPPCPSTIYDTPKC 218

Query: 140 HTRCTNDNYGRGFFQ--DKYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNGR 195
           +T C         ++    Y + G           GP           +           
Sbjct: 219 NTTCERSEMDLVKYKGSTSYSVKGEKELMIELMTNGPL------ELTMQVYSDFVGYKSG 272

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
           VY      E +    VK+VGWG ++G PYW + +++   +GDKG   I RG NE  IES
Sbjct: 273 VYK-HVLGEFLGGHAVKLVGWGTQDGVPYWKVANSWNTDWGDKGYFLIQRGNNECKIES 330


>gi|268570495|ref|XP_002648548.1| Hypothetical protein CBG24861 [Caenorhabditis briggsae]
          Length = 323

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 57/181 (31%), Positives = 86/181 (47%), Gaps = 29/181 (16%)

Query: 91  WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 150
           W + RG+VTGG     +GC+P  F PC       S PE KT     P C   C    Y  
Sbjct: 162 WWNSRGVVTGG-DFRGSGCRPYPFAPC------ISCPEEKT-----PTCSLSC-QFGYST 208

Query: 151 GFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSA 201
            + +DK + ++   +  +           GP   AF     T Y   +++    VY  +A
Sbjct: 209 AYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAF-----TMY-EDMYKYKSGVYRHTA 262

Query: 202 SAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
              +  +A +KI+GWG +NG PYW I +++G  +G+ G +K+ RG NE  IE  V   +P
Sbjct: 263 GRLLGGHA-IKIIGWGTQNGIPYWLIANSWGANWGENGFLKMRRGVNECGIERAVVAGMP 321

Query: 262 K 262
           +
Sbjct: 322 R 322


>gi|343476073|emb|CCD12715.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 50/155 (32%), Positives = 68/155 (43%), Gaps = 18/155 (11%)

Query: 109 CQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLG----- 163
           CQP  FP C H     ++  C       P+C+T CT+       ++ K     L      
Sbjct: 180 CQPYPFPQCEHHGAQGNKTPCSNYKFVTPQCNTTCTDKTIPLIKYRGKDAYMLLPGEEEF 239

Query: 164 ---LYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN 220
              LYF+   GPF      +    YT  LF     VY  +     +    VK+VGWG+ N
Sbjct: 240 KRELYFN---GPF-----VAILFVYTD-LFAYKSGVYR-NVDGSYMGVTAVKVVGWGKLN 289

Query: 221 GRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           G PYW + +T+   +G  G + ILRG NE  IE L
Sbjct: 290 GTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHL 324


>gi|294929081|ref|XP_002779258.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239888294|gb|EER11053.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 288

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 56/190 (29%), Positives = 79/190 (41%), Gaps = 48/190 (25%)

Query: 80  CSSGISSSTWAWVHKRGLVTG------GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLA 133
           C  G       ++   G+VTG      G   S  GC P  FP C HA Y++         
Sbjct: 112 CQGGNLLEGLNFLKNHGIVTGDEFKPAGQLSSADGCWPYPFPKCKHAGYSS--------- 162

Query: 134 TPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
              P C T+CTN  Y     QD ++    G           PA  ++      + +F TN
Sbjct: 163 ---PACQTKCTNKAYKTSLQQDLHRAKSFGRL---------PAIPQNI----KQEIF-TN 205

Query: 194 G------------RVYA----VSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGD 237
           G            RVY     V  +       T+KI+GWG E+G+ YW  V+++ E++GD
Sbjct: 206 GPVIGMLSIYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGVESGQDYWLAVNSWNEEWGD 265

Query: 238 KGTIKILRGR 247
            G IK+  GR
Sbjct: 266 HGMIKLAVGR 275


>gi|1848229|gb|AAB48119.1| cathepsin B-like protease [Leishmania major]
          Length = 340

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 49/179 (27%), Positives = 70/179 (39%), Gaps = 18/179 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  GI +  W W    G+ T         CQP  F PC+H   +   P C +     PKC
Sbjct: 166 CHGGIPTVAWLWWVWVGIAT-------EDCQPYPFDPCSHHGNSEKYPPCPSTIYDTPKC 218

Query: 140 HTRCTNDNYGRGFFQ--DKYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNGR 195
           +T C  +      ++    Y + G           GP           +           
Sbjct: 219 NTTCERNEMDLVKYKGSTSYSVKGEKELMIELMTNGPL------ELTMQVYSDFVGYKSG 272

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
           VY      + +    VK+VGWG ++G PYW + +++   +GDKG   I RG NE  IES
Sbjct: 273 VYK-HVLGDFLGGHAVKLVGWGTQDGVPYWKVANSWNTDWGDKGYFLIQRGNNECKIES 330


>gi|239790303|dbj|BAH71722.1| ACYPI001175 [Acyrthosiphon pisum]
          Length = 330

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 61/245 (24%), Positives = 95/245 (38%), Gaps = 47/245 (19%)

Query: 33  AVATATPLAFAVCRSSK----MHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISSST 88
           A ATA  LA  +C ++       +      F  G+K + +  V                 
Sbjct: 116 AYATAGVLADRMCIATNGSYNQLLSTEELIFCGGIKTKQSGAVR------------GDDV 163

Query: 89  WAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 148
           W ++   GLV+GG +++N GCQP   PP    N  T              C  RC  +N 
Sbjct: 164 WEYLKSHGLVSGGKYNTNDGCQPSKIPPI--GNIPTH--------LYNHTCEERCYGNNT 213

Query: 149 GRGFFQDKYQINGLGLYFDPH-----------FGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
              ++ D  +++    Y++             +GP    F      +     F     VY
Sbjct: 214 IH-YYHDHVKVSH---YYNIKSNEDIQKEVQTYGPVSVKF------RVYDDFFLYKSGVY 263

Query: 198 AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 257
             +  +  V     K++GWG ENG  YW +V+ +G ++G  G  KI RG NE  +E  V 
Sbjct: 264 VKTEKSLYVRRHFAKLIGWGVENGVDYWLLVNFWGNEWGQNGLFKIKRGTNEVHVEDYVY 323

Query: 258 GALPK 262
              P+
Sbjct: 324 AGEPE 328


>gi|170060938|ref|XP_001866023.1| cathepsin B [Culex quinquefasciatus]
 gi|167879260|gb|EDS42643.1| cathepsin B [Culex quinquefasciatus]
          Length = 353

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 48/185 (25%), Positives = 79/185 (42%), Gaps = 14/185 (7%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G+    W +  ++G+ +GG ++S  GC    F  C+  +     P+C          
Sbjct: 167 CQGGVLGPAWDYWVQKGVSSGGPYNSKQGCHSYPFDTCHSPDEDDDAPKCSRKCQSSYSV 226

Query: 140 HTRCTNDNYGR---GFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
                +  +GR       D+++I    ++ +   GP   AF            F+T    
Sbjct: 227 QDVSKDRRFGRVAYSVVADEHRIME-EIFVN---GPVQAAF-------QVYLDFKTYKSG 275

Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
                +  +     +KI+GWG ENG  YW   +++GE +GD G  KI+RG N   IE+ V
Sbjct: 276 VYRHVTGPLEGGHAIKILGWGVENGTKYWLCSNSWGEDWGDHGFFKIVRGENHLGIETDV 335

Query: 257 NGALP 261
           +  LP
Sbjct: 336 HAGLP 340


>gi|339239305|ref|XP_003381207.1| cathepsin B [Trichinella spiralis]
 gi|316975778|gb|EFV59177.1| cathepsin B [Trichinella spiralis]
          Length = 343

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 54/190 (28%), Positives = 83/190 (43%), Gaps = 31/190 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G     + + ++ G+ TGG + S +GC+P S  P          P   + A   P C
Sbjct: 163 CNGGFPLLAFKYWNEIGVPTGGPYGSKSGCKPFSIAP----------PTSSSTAAQTPLC 212

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPH------------FGPFWPAF--WRSFCTKY 185
             +C +D Y R   +D+Y      L    +             GP   A   + SF   Y
Sbjct: 213 QLKCISD-YKRKLDKDRYYGESYYLITSSNQPVKTIQREIMDHGPVVAAMEIFESFLY-Y 270

Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
              ++  N R    S     +    VK++GWGE+   PYW +V+++   FG++G  KI R
Sbjct: 271 KSGVYSANKRNDDPS-----LGLHAVKLIGWGEQKRIPYWLVVNSWNTTFGEQGLFKIRR 325

Query: 246 GRNEAIIESL 255
           G NE  IE+L
Sbjct: 326 GTNECGIENL 335


>gi|166030326|gb|ABY78830.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 51/162 (31%), Positives = 73/162 (45%), Gaps = 24/162 (14%)

Query: 105 SNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT-----------NDNYGRGFF 153
           +++GCQP  FP C H     ++  C       PKC+  CT           N  Y     
Sbjct: 176 ASSGCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKCNATCTDKSIPLVKYRGNATYLLLHG 235

Query: 154 QDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKI 213
           ++ Y+     LYF+   GPF   F+      YT  LF     VY  +   + +    V+I
Sbjct: 236 EEDYKRE---LYFN---GPFVAVFF-----VYTD-LFAYKSGVYR-NVDGDFLGGQAVRI 282

Query: 214 VGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           VGWG+ NG PYW + +++   +G  G + IL G NE  IE L
Sbjct: 283 VGWGKLNGTPYWKVANSWDTDWGMNGYMLILGGNNECNIEHL 324


>gi|294945206|ref|XP_002784584.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239897729|gb|EER16380.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 298

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 54/186 (29%), Positives = 82/186 (44%), Gaps = 24/186 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNT------GCQPVSFPPCNHA-NYTTSEPECKTL 132
           C+ G      +++   G+VTG             GC P  F  CNH     T  P+CK +
Sbjct: 108 CNGGRLVEAMSFLRDHGVVTGNDFKPQDQLREADGCWPYPFQKCNHVPTEGTGYPKCKDV 167

Query: 133 AT-PQPKCHTRCTNDNYGRGFFQDKYQ-------INGLGLYFDPHF--GPFWPAFWRSFC 182
              P P C T CTN  Y +   +D ++       +N         F  GP + AF     
Sbjct: 168 VQQPVPPCRTTCTNKAYKKSLEKDVHRAKSWRKVLNDAQSIKQEIFDNGPVFSAF----- 222

Query: 183 TKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
             Y    +  +G VY V  + E+     +KI+GWG ++ R YW  ++ + E++GD G IK
Sbjct: 223 EMYKDFRYYKSG-VY-VPTTKEVDCLHVIKIIGWGADSVREYWLAMNAWNEEWGDHGLIK 280

Query: 243 ILRGRN 248
           +  G+N
Sbjct: 281 MAFGKN 286


>gi|401415968|ref|XP_003872479.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|322488703|emb|CBZ23950.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 340

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 56/194 (28%), Positives = 78/194 (40%), Gaps = 48/194 (24%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  GI +  W W    G+ T         CQP  F PC+H   ++  P C       PKC
Sbjct: 166 CYGGIPAMAWLWWVWVGVTT-------ELCQPYPFGPCSHHGNSSKYPPCPNTIYNTPKC 218

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL---FQTNGRV 196
           +T C N         +  +  G+                 S+  K  R L      NG +
Sbjct: 219 NTTCDN------VEMELVKYKGV----------------SSYSIKGERELMVELMNNGPL 256

Query: 197 -YAVSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGT 240
             A+   A+ VAY +               VK+VGWG ++G PYW I +++   +GDKG 
Sbjct: 257 EVAMQVYADFVAYKSGVYKHVSGDHLGGHAVKLVGWGVKDGIPYWKIANSWNTDWGDKGY 316

Query: 241 IKILRGRNEAIIES 254
             I RG +E  IES
Sbjct: 317 FLIQRGNDECGIES 330


>gi|728602|emb|CAA88490.1| cathepsin B-like enzyme [Leishmania mexicana]
 gi|1586011|prf||2202319A cathepsin B-like Cys protease
          Length = 340

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 56/194 (28%), Positives = 78/194 (40%), Gaps = 48/194 (24%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  GI +  W W    G+ T         CQP  F PC+H   ++  P C       PKC
Sbjct: 166 CYGGIPAMAWLWWVWVGVTT-------ELCQPYPFGPCSHHGNSSKYPPCPNTIYNTPKC 218

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL---FQTNGRV 196
           +T C N         +  +  G+                 S+  K  R L      NG +
Sbjct: 219 NTTCDN------VEMELVKYKGV----------------SSYSIKGERELDHELMNNGPL 256

Query: 197 -YAVSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGT 240
             A+   A+ VAY +               VK+VGWG ++G PYW I +++   +GDKG 
Sbjct: 257 EVAMQVYADFVAYKSGVYKHVSGDHLGGHAVKLVGWGVKDGIPYWKIANSWNTDWGDKGY 316

Query: 241 IKILRGRNEAIIES 254
             I RG +E  IES
Sbjct: 317 FLIQRGNDECGIES 330


>gi|409905640|gb|AFV46426.1| cysteine protease C [Leishmania donovani]
          Length = 345

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 51/179 (28%), Positives = 69/179 (38%), Gaps = 18/179 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  GI +  W W    G+ T         CQP  F PC+H   +   P C       PKC
Sbjct: 171 CYGGIPTMAWLWWVWVGITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKC 223

Query: 140 HTRCTNDNYGRGFFQ--DKYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNGR 195
           +T C         ++    Y + G           GP           +           
Sbjct: 224 NTTCEKSEMDLVKYKGGTSYSVKGEKELMIELMTNGPL------EVTMQVYSDFVGYKSG 277

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
           VY    S +++    VK+VGWG + G PYW I +++   +GDKG   I RG NE  IES
Sbjct: 278 VYK-HVSGDLLGGHAVKLVGWGTQGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIES 335


>gi|294879717|ref|XP_002768767.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239871616|gb|EER01485.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 157

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 52/163 (31%), Positives = 75/163 (46%), Gaps = 23/163 (14%)

Query: 108 GCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFD 167
           GC P  FPPC H    T  P+C     P P C  +C N  Y      D++ +     Y  
Sbjct: 2   GCWPYDFPPCAHHINDTKYPKCPKGLYPTPNCVEQCHNPKYTTTLRDDRHFMLESSPY-- 59

Query: 168 PHF------------GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVG 215
            H+            GP   +F     T Y   L   +G VY  ++ + +  +A VKI+G
Sbjct: 60  -HYSVNDAKNAIRTDGPVSASF-----TVYEDFLAYRSG-VYKHTSGSYLGGHA-VKIIG 111

Query: 216 WGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 258
           WGE++G+ YW  V+++ E +GD G  KI  G N  I + L+ G
Sbjct: 112 WGEKSGQAYWLAVNSWNEDWGDHGLFKIALG-NCGIDDDLLGG 153


>gi|146092987|ref|XP_001466605.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania infantum JPCM5]
 gi|398018677|ref|XP_003862503.1| cysteine peptidase C (CPC) [Leishmania donovani]
 gi|12005276|gb|AAG44365.1| cathepsin B-like cysteine protease [Leishmania donovani]
 gi|134070968|emb|CAM69644.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania infantum JPCM5]
 gi|322500733|emb|CBZ35810.1| cysteine peptidase C (CPC) [Leishmania donovani]
          Length = 340

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 51/179 (28%), Positives = 69/179 (38%), Gaps = 18/179 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  GI +  W W    G+ T         CQP  F PC+H   +   P C       PKC
Sbjct: 166 CYGGIPTMAWLWWVWVGITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKC 218

Query: 140 HTRCTNDNYGRGFFQ--DKYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNGR 195
           +T C         ++    Y + G           GP           +           
Sbjct: 219 NTTCEKSEMDLVKYKGGTSYSVKGEKELMIELMTNGPL------EVTMQVYSDFVGYKSG 272

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
           VY    S +++    VK+VGWG + G PYW I +++   +GDKG   I RG NE  IES
Sbjct: 273 VYK-HVSGDLLGGHAVKLVGWGTQGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIES 330


>gi|189239879|ref|XP_968767.2| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270012755|gb|EFA09203.1| cathepsin B precursor [Tribolium castaneum]
          Length = 353

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 57/202 (28%), Positives = 82/202 (40%), Gaps = 45/202 (22%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G S   W +    GLV+GG ++++ GCQP S    N                  P+C
Sbjct: 143 CKGGYSYYAWKYYTSTGLVSGGDYNTSRGCQPYSKSNFNDG--------------VSPEC 188

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA- 198
              C N  Y   +  D+            HFG       ++  T     L +  G V A 
Sbjct: 189 SKTCQNTKYPTSYLNDR------------HFGDGTYYILKNVTTIQQEILLR-GGPVMAG 235

Query: 199 ---------------VSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTI-K 242
                          V  S  ++    VKI+GWG ENG  YW + +++G+ +G  G + K
Sbjct: 236 FDVYEDFKLYREGVYVHTSGALLGSHAVKIIGWGTENGWAYWLVANSWGKDWGALGGVFK 295

Query: 243 ILRGRNEAIIE-SLVNGALPKD 263
           I RG NE  IE S++ G + KD
Sbjct: 296 IRRGTNECKIEQSIITGHVRKD 317


>gi|17384033|emb|CAD12394.1| cysteine proteinase [Leishmania infantum]
          Length = 340

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 51/179 (28%), Positives = 69/179 (38%), Gaps = 18/179 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  GI +  W W    G+ T         CQP  F PC+H   +   P C       PKC
Sbjct: 166 CYGGIPTMAWLWWVWVGITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKC 218

Query: 140 HTRCTNDNYGRGFFQ--DKYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNGR 195
           +T C         ++    Y + G           GP           +           
Sbjct: 219 NTTCEKSEMDLVKYKGGTSYSVKGEKELMIELMTNGPL------EVTMQVYSDFVGYKSG 272

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
           VY    S +++    VK+VGWG + G PYW I +++   +GDKG   I RG NE  IES
Sbjct: 273 VYK-HVSGDLLGGHAVKLVGWGTQGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIES 330


>gi|268566077|ref|XP_002647467.1| Hypothetical protein CBG06539 [Caenorhabditis briggsae]
          Length = 332

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 57/179 (31%), Positives = 83/179 (46%), Gaps = 15/179 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTS-EPECKTLATPQPK 138
           C  G S     W    G+VTGG +  + GC+P  F  CN A    +  PEC    + Q K
Sbjct: 157 CDGGYSIQALRWWVFDGVVTGGDYQGD-GCKPYQF--CNSAGCPDAVTPECAL--SCQSK 211

Query: 139 CHTRCTND-NYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
            +T    D N+G   +     +N +      + GP   +F      K     ++    VY
Sbjct: 212 YNTEYAKDKNFGTSAYYVGMTVNAIQTDIMTN-GPVEASF------KVYEDFYKYKSGVY 264

Query: 198 AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
              A   +  +A +KI+GWG ENG  YW I +++G ++G+ G  KI RG NE  IE+ V
Sbjct: 265 KYIAGKMLGGHA-IKIIGWGTENGTAYWLIANSWGTKWGENGFFKIRRGVNECGIENNV 322


>gi|1008858|gb|AAA79004.1| cathepsin B-like thiol protease [Aedes aegypti]
          Length = 342

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 52/193 (26%), Positives = 80/193 (41%), Gaps = 14/193 (7%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  ++G+ +GG ++S  GC P     C+ +      P+C          
Sbjct: 156 CKGGYLGPAWQFWVEQGVSSGGPYNSRQGCHPYPIDVCDASGEEADTPKCSKRCQSGYNV 215

Query: 140 HTRCTNDNYGRGFF---QDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
                +  YGR  +    D+ +I    +Y +   GP   AF         + L      V
Sbjct: 216 TDVWQDRRYGRVAYSIPNDEQKIMEE-IYIN---GPVQAAF------MTYQDLHAYKSGV 265

Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
           Y       +     VK++GWG ENG  YW + +++G+ +GD G  KI+RG N   IE  V
Sbjct: 266 YR-HVWGHMAGGHAVKLMGWGVENGLKYWLVANSWGDDWGDNGFFKIVRGENHCGIEKDV 324

Query: 257 NGALPKDNYGVEF 269
           +  LP  N   E 
Sbjct: 325 HAGLPSFNKHKEL 337


>gi|1345924|sp|P25802.3|CYSP1_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
           Precursor
          Length = 341

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 60/247 (24%), Positives = 96/247 (38%), Gaps = 41/247 (16%)

Query: 23  PYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSS 82
           P   +C    AV++A  ++  +C +SK   +         V   C W          C  
Sbjct: 111 PDQANCGSCWAVSSAAAMSDRICIASKGAKQV--LISAQDVVSCCTWCGDG------CEG 162

Query: 83  GISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 142
           G   S + +    G+VTGG +++   C+P    PC H    T   EC  +A   P+C  R
Sbjct: 163 GWPISAFRFHADEGVVTGGDYNTKGSCRPYEIHPCGHHGNETYYGECVGMAD-TPRCKRR 221

Query: 143 CTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSAS 202
           C    Y + +  D+Y               +  A+      K  +     NG V A    
Sbjct: 222 CLL-GYPKSYPSDRY---------------YKKAYQLKNSVKAIQKDIMKNGPVVATYTV 265

Query: 203 AEIVAY----------------ATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
            E  A+                  VK++GWGEE G PYW + +++ + +G+ G  ++ RG
Sbjct: 266 YEDFAHYRSGIYKHKAGRKTGLHAVKVIGWGEEKGTPYWIVANSWHDDWGENGFFRMHRG 325

Query: 247 RNEAIIE 253
            N+   E
Sbjct: 326 SNDCGFE 332


>gi|157167283|ref|XP_001658486.1| cathepsin b [Aedes aegypti]
 gi|108876477|gb|EAT40702.1| AAEL007599-PA [Aedes aegypti]
          Length = 342

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 52/193 (26%), Positives = 80/193 (41%), Gaps = 14/193 (7%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  ++G+ +GG ++S  GC P     C+ +      P+C          
Sbjct: 156 CKGGYLGPAWQFWVEQGVSSGGPYNSRQGCHPYPIDVCDASGEEADTPKCSKRCQSGYNV 215

Query: 140 HTRCTNDNYGRGFF---QDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
                +  YGR  +    D+ +I    +Y +   GP   AF         + L      V
Sbjct: 216 TDVWQDRRYGRVAYSIPNDEQKIMEE-IYIN---GPVQAAF------MTYQDLHAYKSGV 265

Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
           Y       +     VK++GWG ENG  YW + +++G+ +GD G  KI+RG N   IE  V
Sbjct: 266 YR-HVWGHMAGGHAVKLMGWGVENGLKYWLVANSWGDDWGDNGFFKIVRGENHCGIEKDV 324

Query: 257 NGALPKDNYGVEF 269
           +  LP  N   E 
Sbjct: 325 HAGLPSFNKHKEL 337


>gi|157058747|gb|ABV03131.1| cathepsin B-2744 [Myzus persicae]
          Length = 261

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 48/169 (28%), Positives = 76/169 (44%), Gaps = 14/169 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK- 138
           C  G +   W +   +G+VTGG + SN GCQP    PC+H    +S   C +L   Q   
Sbjct: 98  CDGGSAYKAWEFTMGKGIVTGGPYDSNEGCQPYKNRPCDHYG-DSSLTNCSSLRRTQMMF 156

Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGL-------YFDPHFGPFWPAFWRSFCTKYTRPLFQ 191
           C  +C N NY   +  D Y+ + + +               + P    +F   Y   +  
Sbjct: 157 CRDKCVNKNYKVKYEDDLYKTSVVYMTSWTNVKQIQQEIMTYGPV--TAFMYVYENFMGY 214

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKG 239
             G VY  S + E++ Y  VK++GWG +E G  YW  ++++   +G  G
Sbjct: 215 KEG-VYK-STAGELIGYHHVKLIGWGVDEAGIEYWLAMNSWNSNWGTNG 261


>gi|21930117|gb|AAM82155.1| cysteine proteinase [Ancylostoma ceylanicum]
          Length = 348

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 52/185 (28%), Positives = 76/185 (41%), Gaps = 7/185 (3%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 138
           C  G  +  + +  + GL TGG +     CQP +F PC NHA+     P C     P P 
Sbjct: 164 CKGGYPARAFGYAWRYGLSTGGPYGEKDACQPYAFYPCGNHAHEPYYGP-CPDELWPTPT 222

Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCT---KYTRPLFQTNGR 195
           C   C    Y   F +DK   +     F       +    R       K  R        
Sbjct: 223 CRRTCQL-GYPIPFEKDKIFNDQTYYIFGNETEIKYEIMTRGPVVATYKVYRDFDYYKKG 281

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           VY +    E+     VKI+GWG+ N  PYW + +++   +GD G  +I+RG +   IE  
Sbjct: 282 VY-IHREGEVTGLHAVKIIGWGKGNDVPYWLVANSWNTDWGDNGYFRIVRGTDNCEIERQ 340

Query: 256 VNGAL 260
           + G +
Sbjct: 341 MVGGI 345


>gi|12004577|gb|AAG44098.1| cathepsin B cysteine protease [Leishmania chagasi]
          Length = 340

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 52/179 (29%), Positives = 72/179 (40%), Gaps = 18/179 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  GI +  W W    G+ T         CQP  F PC+H   +   P C       PKC
Sbjct: 166 CYGGIPTMAWLWWVWVGITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKC 218

Query: 140 HTRCTNDNYGRGFFQ--DKYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNGR 195
           +T C         ++    Y + G           GP            Y+  +   +G 
Sbjct: 219 NTTCEKSEMDLVKYKGGTSYSVKGEKELMIELMTNGPL-----EVTMQVYSDFVGYKSGG 273

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
              VS   +++    VK+VGWG + G PYW I +++   +GDKG   I RG NE  IES
Sbjct: 274 YKHVSG--DLLGGHAVKLVGWGTQGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIES 330


>gi|390994429|gb|AFM37364.1| cathepsin B1 [Dictyocaulus viviparus]
          Length = 350

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 53/197 (26%), Positives = 77/197 (39%), Gaps = 26/197 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +    G+ TGG +     C+P +F PC H        EC     P P+C
Sbjct: 165 CRGGYPIEAWRYFMLHGVCTGGHYAEKDVCKPYAFHPCGHHRNEIYYGECPKEIFPTPQC 224

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPH-----------FGPFWPAF--WRSFCTKYT 186
              C    Y   +  DK  I G   Y  P+            GP   AF  +  F    +
Sbjct: 225 TQSC-QAGYASDYEDDK--IYGKSAYALPNNEKAIQREIMTNGPVQAAFMVYEDFSRYRS 281

Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILR 245
                T GR     A         VK++GWG +++G  YW   +++   +G+ G  +I+R
Sbjct: 282 GIYVHTAGRREGGHA---------VKLIGWGVDDDGNKYWLAANSWNSDWGENGYFRIVR 332

Query: 246 GRNEAIIESLVNGALPK 262
           G +   IES V   +P 
Sbjct: 333 GVDHCGIESAVVAGMPD 349


>gi|72389769|ref|XP_845179.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|427931064|pdb|4HWY|A Chain A, Trypanosoma Brucei Procathepsin B Solved From 40 Fs
           Free-electron Laser Pulse Data By Serial Femtosecond
           X-ray Crystallography
 gi|40557577|gb|AAR88085.1| cathepsin B-like cysteine protease [Trypanosoma brucei]
 gi|62360039|gb|AAX80461.1| cysteine peptidase C (CPC) [Trypanosoma brucei]
 gi|70801714|gb|AAZ11620.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
          Length = 340

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 53/187 (28%), Positives = 77/187 (41%), Gaps = 19/187 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTS-EPECKTLATPQPK 138
           C+ G     WA+    GLV+         CQP  FP C+H + + +  P C       PK
Sbjct: 162 CNGGDPDRAWAYFSSTGLVS-------DYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPK 214

Query: 139 CHTRCTNDNYGRGFFQD--KYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNG 194
           C+  C +       ++    Y + G   Y    F  GPF  AF               N 
Sbjct: 215 CNYTCDDPTIPVVNYRSWTSYALQGEDDYMRELFFRGPFEVAF------DVYEDFIAYNS 268

Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
            VY    S + +    V++VGWG  NG PYW I +++  ++G  G   I RG +E  IE 
Sbjct: 269 GVYH-HVSGQYLGGHAVRLVGWGTSNGVPYWKIANSWNTEWGMDGYFLIRRGSSECGIED 327

Query: 255 LVNGALP 261
             +  +P
Sbjct: 328 GGSAGIP 334


>gi|343477197|emb|CCD11909.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 51/183 (27%), Positives = 78/183 (42%), Gaps = 24/183 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W +    G+       +++ CQP  FP C H      +  C       P+C
Sbjct: 159 CEGGYPDAAWEYYVSHGI-------ASSQCQPYPFPRCEHRGAQGKKTPCSKYKFVTPQC 211

Query: 140 HTRCTNDNYGRGFFQ--DKYQING-----LGLYFDPHFGPFWPAFWRSFCTKYTRPLFQT 192
           +  CT+       ++    Y++ G       LYF+   GPF   F       ++  L   
Sbjct: 212 NATCTDKTIPLIKYRGNHSYEVRGEEDYKRELYFN---GPFVVRF-----QVHSDFLAYK 263

Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
           NG    V+ +   +    V+IVGWG+ NG PYW + +++   +G  G   ILRG NE  I
Sbjct: 264 NGVYQHVAGN--FLGGKAVRIVGWGKLNGTPYWKVANSWDTDWGMNGYFLILRGDNECNI 321

Query: 253 ESL 255
           E L
Sbjct: 322 EHL 324


>gi|261328564|emb|CBH11542.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like,
           putative [Trypanosoma brucei gambiense DAL972]
          Length = 340

 Score = 67.8 bits (164), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 53/187 (28%), Positives = 77/187 (41%), Gaps = 19/187 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTS-EPECKTLATPQPK 138
           C+ G     WA+    GLV+         CQP  FP C+H + + +  P C       PK
Sbjct: 162 CNGGDPDRAWAYFSSTGLVS-------DYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPK 214

Query: 139 CHTRCTNDNYGRGFFQD--KYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNG 194
           C+  C +       ++    Y + G   Y    F  GPF  AF               N 
Sbjct: 215 CNYTCDDPTIPVVNYRSWTSYALQGEDDYMRELFFRGPFEVAF------DVYEDFIAYNS 268

Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
            VY    S + +    V++VGWG  NG PYW I +++  ++G  G   I RG +E  IE 
Sbjct: 269 GVYH-HVSGQYLGGHAVRLVGWGTSNGVPYWKIANSWNTEWGMDGYFLIRRGSSECGIED 327

Query: 255 LVNGALP 261
             +  +P
Sbjct: 328 GGSAGIP 334


>gi|355332948|pdb|3MOR|A Chain A, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
 gi|355332949|pdb|3MOR|B Chain B, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
          Length = 317

 Score = 67.8 bits (164), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 53/187 (28%), Positives = 77/187 (41%), Gaps = 19/187 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTS-EPECKTLATPQPK 138
           C+ G     WA+    GLV+         CQP  FP C+H + + +  P C       PK
Sbjct: 139 CNGGDPDRAWAYFSSTGLVS-------DYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPK 191

Query: 139 CHTRCTNDNYGRGFFQD--KYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNG 194
           C+  C +       ++    Y + G   Y    F  GPF  AF               N 
Sbjct: 192 CNYTCDDPTIPVVNYRSWTSYALQGEDDYMRELFFRGPFEVAF------DVYEDFIAYNS 245

Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
            VY    S + +    V++VGWG  NG PYW I +++  ++G  G   I RG +E  IE 
Sbjct: 246 GVYH-HVSGQYLGGHAVRLVGWGTSNGVPYWKIANSWNTEWGMDGYFLIRRGSSECGIED 304

Query: 255 LVNGALP 261
             +  +P
Sbjct: 305 GGSAGIP 311


>gi|239938578|gb|ACS36088.1| cysteine proteinase [Haemonchus contortus]
          Length = 332

 Score = 67.8 bits (164), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 58/233 (24%), Positives = 93/233 (39%), Gaps = 45/233 (19%)

Query: 33  AVATATPLAFAVCRSSKMHVE--CTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISSSTWA 90
           AV+ A  ++  +C  SK  V+   +    +A   + C            C+ G+    W 
Sbjct: 125 AVSAAETMSDRICVQSKGRVQKMISDVDILACCGRECGR---------GCNGGMDHKAWE 175

Query: 91  WVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPKCHTRCTNDNYG 149
           +V + G+VTGG +     C+P    PC NH     S P   +  TP   C   C    YG
Sbjct: 176 YVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPA--CKKYCQY-GYG 232

Query: 150 RGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAY- 208
           + + +DK  +  + +  +                K  +     NG V A S + E  ++ 
Sbjct: 233 KRYEKDKSYVKSVYILDEDE--------------KAIQREMMKNGPVQAASITYEDFSFY 278

Query: 209 ---------------ATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
                            VK+VGWG ENG  YW + +++   +G+ G  +ILRG
Sbjct: 279 RRGIYVHTRGRQRGAHAVKVVGWGVENGTKYWNVANSWSTDWGEDGYFRILRG 331


>gi|339831342|gb|AEK20867.1| cathepsin B [Eimeria tenella]
          Length = 512

 Score = 67.4 bits (163), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 58/203 (28%), Positives = 79/203 (38%), Gaps = 36/203 (17%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAH---HSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 136
           CS G     W W    G+VTGG +   H+   C P   P C H +     P+C+      
Sbjct: 310 CSGGQPRMAWRWFSNDGVVTGGDYNELHTGKSCWPYEIPFCRH-HSEGPYPKCEGPLPKA 368

Query: 137 PKCHTRCTNDNYGRGF--FQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNG 194
           PKC   C    Y      F+D           D HF     A+      +  R L +   
Sbjct: 369 PKCRKDCEEAEYTSKVKPFKD-----------DLHFAT--SAYSVEGRDQIKRELMENGT 415

Query: 195 RVYAVSASAEIVAYA---------------TVKIVGWGEENGRPYWTIVSTFGEQFGDKG 239
              A     + + Y                 VK++G+G E+GR YW  V+++ E +GDKG
Sbjct: 416 LTGAFLVYEDFLLYKEGVYHHVTGMPMGGHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKG 475

Query: 240 TIKILRGRNEAIIESLVNGALPK 262
           T KI  G  EA I+    G  PK
Sbjct: 476 TFKIEMG--EAGIDKEFCGGEPK 496


>gi|270012756|gb|EFA09204.1| cathepsin B precursor [Tribolium castaneum]
          Length = 369

 Score = 67.4 bits (163), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 60/197 (30%), Positives = 85/197 (43%), Gaps = 33/197 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G +   W +    GLV+GG ++++TGCQP S    N+   T             P C
Sbjct: 142 CKGGYTYYAWNYFMLTGLVSGGDYNTSTGCQPYS--ELNYYRIT-------------PPC 186

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF------------GPFWPAFWRSFCTKYTR 187
           +T C ND Y   +  DK+   G  +Y+ P              GP   AF      K  R
Sbjct: 187 NTTCQNDKYPIPYVSDKHF--GDSIYYIPQNETAIQNEILSGGGPVVAAFDVYGDFKIYR 244

Query: 188 PLFQTNGRVYAVS--ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGT-IKIL 244
              Q +  +  V    S  +     VKI+GWG ENG  YW   +++G+ +G  G   KI 
Sbjct: 245 DGEQHDTILEGVYIYTSGALFGRTAVKIIGWGTENGWAYWLAANSWGKDWGALGGFFKIR 304

Query: 245 RGRNE-AIIESLVNGAL 260
           RG NE    ES++ G +
Sbjct: 305 RGTNECGFEESIIAGQV 321


>gi|359427491|gb|AEV46267.1| eimeripain [Eimeria tenella]
          Length = 512

 Score = 67.4 bits (163), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 58/203 (28%), Positives = 79/203 (38%), Gaps = 36/203 (17%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAH---HSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 136
           CS G     W W    G+VTGG +   H+   C P   P C H +     P+C+      
Sbjct: 310 CSGGQPRMAWRWFSNDGVVTGGDYNELHTGKSCWPYEIPFCRH-HSEGPYPKCEGPLPKA 368

Query: 137 PKCHTRCTNDNYGRGF--FQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNG 194
           PKC   C    Y      F+D           D HF     A+      +  R L +   
Sbjct: 369 PKCRKDCEEAEYTSKVKPFKD-----------DLHFAT--SAYSVEGRDQIKRELMENGT 415

Query: 195 RVYAVSASAEIVAYA---------------TVKIVGWGEENGRPYWTIVSTFGEQFGDKG 239
              A     + + Y                 VK++G+G E+GR YW  V+++ E +GDKG
Sbjct: 416 LTGAFLVYEDFLLYKEGVYHHVTGMPMGGHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKG 475

Query: 240 TIKILRGRNEAIIESLVNGALPK 262
           T KI  G  EA I+    G  PK
Sbjct: 476 TFKIEMG--EAGIDKEFCGGEPK 496


>gi|296863454|pdb|3HHI|A Chain A, Crystal Structure Of Cathepsin B From T. Brucei In Complex
           With Ca074
 gi|296863455|pdb|3HHI|B Chain B, Crystal Structure Of Cathepsin B From T. Brucei In Complex
           With Ca074
          Length = 325

 Score = 67.4 bits (163), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 53/187 (28%), Positives = 76/187 (40%), Gaps = 19/187 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTS-EPECKTLATPQPK 138
           C+ G     WA+    GLV+         CQP  FP C+H + + +  P C       PK
Sbjct: 140 CNGGDPDRAWAYFSSTGLVS-------DYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPK 192

Query: 139 CHTRCTNDNYGRGFFQD--KYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNG 194
           C   C +       ++    Y + G   Y    F  GPF  AF               N 
Sbjct: 193 CDYTCDDPTIPVVNYRSWTSYALQGEDDYMRELFFRGPFEVAF------DVYEDFIAYNS 246

Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
            VY    S + +    V++VGWG  NG PYW I +++  ++G  G   I RG +E  IE 
Sbjct: 247 GVYH-HVSGQYLGGHAVRLVGWGTSNGVPYWKIANSWNTEWGMDGYFLIRRGSSECGIED 305

Query: 255 LVNGALP 261
             +  +P
Sbjct: 306 GGSAGIP 312


>gi|407425570|gb|EKF39488.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi
           marinkellei]
          Length = 333

 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 64/237 (27%), Positives = 94/237 (39%), Gaps = 38/237 (16%)

Query: 27  SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
           SC    AVA A+ ++   C    +       R  AG    C  +       + C+ G   
Sbjct: 116 SCGSCWAVAAASAMSDRYCTLGGVR----DLRISAGDLMSCCDVCG-----YGCNGGFPE 166

Query: 87  STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
             W +    GLV+         CQP  FP C H   ++    C       PKC++ CT  
Sbjct: 167 VAWVFYVVHGLVS-------EYCQPYPFPSCAHHVNSSDLAPCSG-DYKTPKCNSTCTEK 218

Query: 147 NYGRGFFQ--DKYQINGLGLYFDPHF-------GPFWPAFWRSFCTKYTRPLFQTNGRVY 197
                 ++    Y ++G     + HF       GPF  AF      +         G VY
Sbjct: 219 KIPLIRYRGNHSYVLSG-----EEHFKRELLLNGPFEVAF------EVYADFMAYTGGVY 267

Query: 198 AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
               + +++    V++VGWGE NG PYW I +++  ++G  G   I RG NE  IES
Sbjct: 268 K-HVAGDLLGGHAVRLVGWGELNGEPYWKIANSWNHEWGMNGYFLIARGVNECGIES 323


>gi|239792046|dbj|BAH72408.1| ACYPI000003 [Acyrthosiphon pisum]
          Length = 182

 Score = 67.0 bits (162), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 54/179 (30%), Positives = 84/179 (46%), Gaps = 24/179 (13%)

Query: 95  RGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRC---------TN 145
           +G+V+GG + SN GC P    PC H    T  P CK      P C  +C          +
Sbjct: 15  KGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGP-CKE-GGKTPTCVKKCEEGYKVPYAQD 72

Query: 146 DNYGRGFFQDKYQINGL--GLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASA 203
            ++G+  +  +  ++ +   +Y +   GP   AF     T Y   +    G VY   A  
Sbjct: 73  LHHGKSAYSIRNDVDQIRQEIYTN---GPVEGAF-----TVYEDFIAYRAG-VYKHVAGK 123

Query: 204 EIVAYATVKIVGWGEENGR-PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
            +  +A ++I+GWG +NG  PYW + +++   +G  G  KILRG +E  IE  +N  LP
Sbjct: 124 ALGGHA-IRILGWGVQNGEIPYWLVANSWNTDWGSDGFFKILRGSDECGIEGQINAGLP 181


>gi|91088083|ref|XP_968689.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
          Length = 360

 Score = 67.0 bits (162), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 59/197 (29%), Positives = 86/197 (43%), Gaps = 42/197 (21%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G +   W +    GLV+GG ++++TGCQP S    N+   T             P C
Sbjct: 142 CKGGYTYYAWNYFMLTGLVSGGDYNTSTGCQPYS--ELNYYRIT-------------PPC 186

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF------------GPFWPAF--WRSFCTKY 185
           +T C ND Y   +  DK+   G  +Y+ P              GP   AF  +  F    
Sbjct: 187 NTTCQNDKYPIPYVSDKHF--GDSIYYIPQNETAIQNEILSGGGPVVAAFDVYGDFKIYR 244

Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGT-IKIL 244
                 T+G ++  +A         VKI+GWG ENG  YW   +++G+ +G  G   KI 
Sbjct: 245 DGVYIYTSGALFGRTA---------VKIIGWGTENGWAYWLAANSWGKDWGALGGFFKIR 295

Query: 245 RGRNE-AIIESLVNGAL 260
           RG NE    ES++ G +
Sbjct: 296 RGTNECGFEESIIAGQV 312


>gi|343475054|emb|CCD13447.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score = 67.0 bits (162), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 47/159 (29%), Positives = 70/159 (44%), Gaps = 19/159 (11%)

Query: 105 SNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQ--DKYQING- 161
           +++ CQP  FP C H      +P C       P C+  CT+ +     ++    Y++ G 
Sbjct: 177 TSSQCQPYPFPRCEHRGAQGKKPPCSKYNFDTPTCNATCTDKSVPLIKYRGNHSYEVRGE 236

Query: 162 ----LGLYFDPHFGPFWPAFW-RSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGW 216
                 LYF+   GPF   F   S    Y   ++Q          +   +    V+IVGW
Sbjct: 237 EDYKRELYFN---GPFVVRFQVHSDFLAYKSGVYQ--------HVAGNFLGGKAVRIVGW 285

Query: 217 GEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           G+ NG PYW + +++   +G  G   ILRG NE  IE L
Sbjct: 286 GKMNGTPYWKVANSWDTDWGMNGYFLILRGNNECNIEHL 324


>gi|107921798|gb|ABF85680.1| cathepsin B3 [Fasciola hepatica]
          Length = 278

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 46/159 (28%), Positives = 68/159 (42%), Gaps = 25/159 (15%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ GI + +W +  + G+VTGG   + TGC P  FP C+H   T   P C     P PKC
Sbjct: 132 CNGGIPAMSWDYWTREGVVTGGTLENPTGCLPYPFPKCSHGVVTPGLPPCPRDIYPTPKC 191

Query: 140 HTRCTNDNYGRGFFQDKYQ-------------INGLGLYFDPHFGPFWPAFWRSFCTKYT 186
             +C +  Y + + QDK +             I    +   P  G F+   +  F    +
Sbjct: 192 EKKC-HAGYNKTYEQDKVKGKSSYNVGEQETDIMMEIMKNGPVDGIFY--MFEDFLVYKS 248

Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYW 225
                T GR         +V    ++++GWG ENG  YW
Sbjct: 249 GIYHYTTGR---------LVGGHAIRVIGWGVENGVNYW 278


>gi|294914336|ref|XP_002778250.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239886453|gb|EER10045.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 388

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 55/177 (31%), Positives = 80/177 (45%), Gaps = 21/177 (11%)

Query: 106 NTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLY 165
           ++GC P +FP C+H   T     CK   +P P C T C N ++   F  D++     G  
Sbjct: 219 DSGCWPYNFPECSHHVDTKGMEPCKG-NSPSPVCSTTCRNHHFKPSFESDRHFTEDEGYS 277

Query: 166 FDP---------HFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGW 216
            D            GP   AF     T Y    +  +G VY     +E+  +A VKI+GW
Sbjct: 278 LDEVDEIKREIIDNGPVAAAF-----TVYEDFPYYKSG-VYKHVNGSELGGHA-VKIIGW 330

Query: 217 GEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK--DNYGVEFGE 271
           G +    YW +++++   +GD+G  KI  G  E  I+S V   +PK     GVE  E
Sbjct: 331 GIDQNEQYWLVMNSWNVNWGDQGIFKIAIG--ECGIDSEVTAGIPKYEKTSGVEQSE 385


>gi|294891885|ref|XP_002773787.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239878991|gb|EER05603.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 234

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 48/149 (32%), Positives = 71/149 (47%), Gaps = 18/149 (12%)

Query: 105 SNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLG 163
           ++ GC P  FP CNH     S+ P C  +    P C T C N  YG    +D ++    G
Sbjct: 38  NDDGCWPYPFPKCNHVPGLESKYPRCAQVRD-LPACATTCPNKAYGTSMQKDTHRAKSWG 96

Query: 164 -LYFDPHF--------GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIV 214
            L   P          GP       +  T Y    F  +G VY V  + +++A  T+K++
Sbjct: 97  RLPIGPEKIKQEIFDNGPV-----AAMMTLYEDFRFYKSG-VY-VHKTGQMLAAHTLKLI 149

Query: 215 GWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
           GWG E+G+ YW  V+ + E++GD G IK+
Sbjct: 150 GWGVESGQEYWLAVNAWNEEWGDHGMIKL 178


>gi|3087801|emb|CAA93277.1| cysteine proteinase [Haemonchus contortus]
          Length = 344

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 65/240 (27%), Positives = 101/240 (42%), Gaps = 39/240 (16%)

Query: 33  AVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS----ST 88
           AV+TA+ L+  +C +SK            G KQ              C  G         
Sbjct: 121 AVSTASALSDRICIASK------------GAKQVYVSATDILSCCHSCGDGCDGGYVIDA 168

Query: 89  WAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 148
           + +  ++G VTGG + +   C+P  F PC H    T   EC    +  P+C  +C  + Y
Sbjct: 169 FKFFAEQGAVTGGDYGAKDCCRPYPFHPCGHHGNETYYGECPEDGS-TPECVRKC-QEGY 226

Query: 149 GRGFFQDKYQINGLGLYFDP------------HFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
              + +D+  + G   Y  P              GP   AF       +    F   G +
Sbjct: 227 ETEYHEDR--VRGEDAYRLPIGSVKAIQKEIMRNGPVVAAF-----IVFDDFSFYRKG-I 278

Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
           YA  A +    +A VKI+GWG E+G PYW I +++   +G+ G  +++RG N+  IE+ V
Sbjct: 279 YAHVAGSPRGGHA-VKIIGWGTEHGVPYWIIANSWHSDWGEDGYFRMVRGINDCGIETNV 337


>gi|323448735|gb|EGB04630.1| hypothetical protein AURANDRAFT_32318 [Aureococcus anophagefferens]
          Length = 253

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 51/200 (25%), Positives = 86/200 (43%), Gaps = 32/200 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ GI SS +++    G+V GG +   +GC      PC H   ++  P C       PKC
Sbjct: 57  CNGGIPSSVYSYWALSGIVDGGNYGDKSGCWSYQLEPCAHHVNSSKYPACPD-EVRAPKC 115

Query: 140 HTRCTNDNYGRGFFQD--KYQINGLGLYFDPHFGPFWPAFWRSFCT-KYTRPLFQTNGRV 196
             +C +++      +D  K ++ G   Y     G          C  K    ++Q     
Sbjct: 116 ARKCESED------KDWTKAKVKGEKGYSVCQQGEL-----EGTCAIKMAADIYQNGPIT 164

Query: 197 YAVSASAEIVAYAT----------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGT 240
                  + +AY +                +KI+G+G E+G+ YW + +++ E +GD G 
Sbjct: 165 GMFFVKQDFLAYKSGVYEPKLLSPPLGGHAIKIMGFGTEDGKDYWLVANSWNEDWGDDGY 224

Query: 241 IKILRGRNEAIIES-LVNGA 259
            KI+RG+N   IE  ++NG 
Sbjct: 225 FKIIRGKNACQIEDPVINGG 244


>gi|339242629|ref|XP_003377240.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316973974|gb|EFV57515.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 325

 Score = 66.2 bits (160), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 54/200 (27%), Positives = 84/200 (42%), Gaps = 45/200 (22%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     + +   RG+ TGG + S  GC+P S         + SE E +T     P C
Sbjct: 153 CDGGYPDKAFIYWATRGIPTGGPYGSTKGCKPYSIG-------SNSEDEAET-----PLC 200

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFG--PFWPAFWRSFCTKYTRPLFQTNGRVY 197
             +C N+ Y     QD+            HFG  P+W     S   +  + L++    V 
Sbjct: 201 TRQCINE-YPYNLSQDR------------HFGEKPYWV---NSNEEQIMQELYKNGPVVV 244

Query: 198 AVSASAEIVAYA---------------TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
           A +   + + Y                 VK++GWG EN + YW I +++   +G+ G  K
Sbjct: 245 AFNVYEDFMYYIKGVYEHRFGKFLGGHAVKLIGWGIENSKKYWLISNSWNTTWGENGFFK 304

Query: 243 ILRGRNEAIIESLVNGALPK 262
           I+RG+N   IES V   + +
Sbjct: 305 IIRGKNCCAIESYVVAGMAR 324


>gi|254575663|gb|ACT68328.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 54/189 (28%), Positives = 79/189 (41%), Gaps = 23/189 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G ++  W W    G+VTGGA+     C+P  FP C  A+   +   C +     P C
Sbjct: 166 CDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCG-AHKGKAFNNCPSHPYATPAC 224

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRP 188
              C    YG+ +  DK  I     Y+ P+            GP    F           
Sbjct: 225 KPYCQY-GYGKRYENDK--IKAKTWYWLPNDERTIQLEIMKKGPVHATF------NIYED 275

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFG-DKGTIKILRGR 247
               NG VY  +A A +    ++KI+GWG + G  YW I +++   +G D G  +++RG 
Sbjct: 276 FEHYNGGVYIHTAGA-MEGGHSIKIIGWGVDKGVKYWLIANSWSTDWGEDGGYFRVVRGI 334

Query: 248 NEAIIESLV 256
           N   IE  V
Sbjct: 335 NNCDIEGGV 343


>gi|428174191|gb|EKX43088.1| hypothetical protein GUITHDRAFT_73372 [Guillardia theta CCMP2712]
          Length = 255

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 56/182 (30%), Positives = 81/182 (44%), Gaps = 18/182 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +    GL T   +       P  FPPC H    T    C   + P PKC
Sbjct: 85  CNGGFPTGAWRFFKMHGLTTESKY-------PYVFPPCEHHINKTHYKPCGP-SQPTPKC 136

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-GPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
                ++   R   +  Y ++   +  +    GP   AF     T Y   L   +G VY 
Sbjct: 137 VR--ASEKKPRYHGKSVYSVSPAKIQAEIMTNGPVEAAF-----TVYQDFLAYQSG-VYR 188

Query: 199 VSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 258
             +  E+  +A +KI+GWG E G  YW + +++ E +GDKGT KI RG +E  IES V  
Sbjct: 189 HVSGPELGGHA-IKIMGWGVEAGNKYWLVANSWNEDWGDKGTFKIARGDDECGIESSVVA 247

Query: 259 AL 260
            +
Sbjct: 248 GM 249


>gi|237836005|ref|XP_002367300.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
 gi|211964964|gb|EEB00160.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
 gi|221506020|gb|EEE31655.1| cysteine proteinase, putative [Toxoplasma gondii VEG]
          Length = 572

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 62/210 (29%), Positives = 87/210 (41%), Gaps = 26/210 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGG---AHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 136
           C+ G     W W  ++G+VTGG   A    T C P   P C H +     P+C     P+
Sbjct: 350 CNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAH-HAKAPFPDCDATLVPR 408

Query: 137 --PKCHTRCTNDNYGRG---FFQDKYQI---------NGLGLYFDPHFGPFWPAFWRSFC 182
             PKC   C    Y      F QD ++          + +      H GP   AF     
Sbjct: 409 KTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSRDDVKRDMMTH-GPVSGAF----- 462

Query: 183 TKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
             Y   L   +G VY   +   +  +A +KI+GWG ENG  YW  V+++   +GD G  K
Sbjct: 463 MVYEDFLSYKSG-VYKHVSGLPVGGHA-IKIIGWGTENGEEYWHAVNSWNTYWGDGGQFK 520

Query: 243 ILRGRNEAIIESLVNGALPKDNYGVEFGEE 272
           I  G+     E +   A  ++  GV  GEE
Sbjct: 521 IAMGQCGIDGEMVAGEAAWQETEGVVNGEE 550


>gi|268561866|ref|XP_002638438.1| Hypothetical protein CBG18654 [Caenorhabditis briggsae]
          Length = 396

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 53/185 (28%), Positives = 75/185 (40%), Gaps = 13/185 (7%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTL--ATPQP 137
           C  G +     +    G+VTGG +    GC P SF PC+        P CKT   A+ + 
Sbjct: 155 CQGGYTIEAMKYWMNSGVVTGGDYQG-AGCIPYSFRPCSTCKEPKDAPSCKTTCQASYKA 213

Query: 138 KCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
           K   R            +  Q+    +Y +   GP   A+      +     +     VY
Sbjct: 214 KSAYRLPTTTSSNAIVANAVQMIQTEIYNN---GPVEVAY------QVYDDFYHYKSGVY 264

Query: 198 AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 257
                 +   +A VKI+GWG E    YW + +++   FG+ G  KI RG NE  IE  V 
Sbjct: 265 YHVYGDKPSGHA-VKIIGWGTEKKVDYWLVANSWSTTFGENGFFKIRRGTNECGIEENVV 323

Query: 258 GALPK 262
             LPK
Sbjct: 324 AGLPK 328


>gi|221484923|gb|EEE23213.1| cysteine proteinase, putative [Toxoplasma gondii GT1]
          Length = 569

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 62/210 (29%), Positives = 87/210 (41%), Gaps = 26/210 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGG---AHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 136
           C+ G     W W  ++G+VTGG   A    T C P   P C H +     P+C     P+
Sbjct: 347 CNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAH-HAKAPFPDCDATLVPR 405

Query: 137 --PKCHTRCTNDNYGRG---FFQDKYQI---------NGLGLYFDPHFGPFWPAFWRSFC 182
             PKC   C    Y      F QD ++          + +      H GP   AF     
Sbjct: 406 KTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSRDDVKRDMMTH-GPVSGAF----- 459

Query: 183 TKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
             Y   L   +G VY   +   +  +A +KI+GWG ENG  YW  V+++   +GD G  K
Sbjct: 460 MVYEDFLSYKSG-VYKHVSGLPVGGHA-IKIIGWGTENGEEYWHAVNSWNTYWGDGGQFK 517

Query: 243 ILRGRNEAIIESLVNGALPKDNYGVEFGEE 272
           I  G+     E +   A  ++  GV  GEE
Sbjct: 518 IAMGQCGIDGEMVAGEAAWQETEGVVNGEE 547


>gi|294891889|ref|XP_002773789.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239878993|gb|EER05605.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 422

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 48/153 (31%), Positives = 71/153 (46%), Gaps = 16/153 (10%)

Query: 105 SNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLG 163
           ++ GC P  FP CNH     S+ P C  +    P C T C N  YG    +D ++    G
Sbjct: 262 NDDGCWPYPFPKCNHVPGLESKYPRCAQVRD-LPACATTCPNKAYGTSMQKDTHRAKSWG 320

Query: 164 -LYFDPH--------FGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIV 214
            L   P          GP       +  T Y    F     VY V  + +++A  T+K++
Sbjct: 321 RLPIGPEKIKQEIFDNGPL--RXXAAMMTLYED--FDLQVCVY-VHKTGQMLAAHTLKLI 375

Query: 215 GWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           GWG E+G+ YW  V+ + E++GD G IK+  G+
Sbjct: 376 GWGVESGQEYWLAVNAWNEEWGDHGMIKLAVGK 408


>gi|21700775|gb|AAL60053.1| cysteine proteinase [Toxoplasma gondii]
          Length = 569

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 62/210 (29%), Positives = 87/210 (41%), Gaps = 26/210 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGG---AHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 136
           C+ G     W W  ++G+VTGG   A    T C P   P C H +     P+C     P+
Sbjct: 347 CNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAH-HAKAPFPDCDATLVPR 405

Query: 137 --PKCHTRCTNDNYGRG---FFQDKYQI---------NGLGLYFDPHFGPFWPAFWRSFC 182
             PKC   C    Y      F QD ++          + +      H GP   AF     
Sbjct: 406 KTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSRDDVKRDMMTH-GPVSGAF----- 459

Query: 183 TKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
             Y   L   +G VY   +   +  +A +KI+GWG ENG  YW  V+++   +GD G  K
Sbjct: 460 MVYEDFLSYKSG-VYKHVSGLPVGGHA-IKIIGWGTENGEEYWHAVNSWNTYWGDGGQFK 517

Query: 243 ILRGRNEAIIESLVNGALPKDNYGVEFGEE 272
           I  G+     E +   A  ++  GV  GEE
Sbjct: 518 IAMGQCGIDGEMVAGEAAWQETEGVVNGEE 547


>gi|294891865|ref|XP_002773777.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239878981|gb|EER05593.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 156

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 45/153 (29%), Positives = 69/153 (45%), Gaps = 31/153 (20%)

Query: 112 VSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF 170
           + F   NHA+   S+ P+C + A  QP C T C N++Y     QD ++    G       
Sbjct: 5   IQFIXXNHASSAASQYPKCPSEALSQPACQTECINESYKTSLQQDLHRAKSWGRL----- 59

Query: 171 GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAE----------------IVAYATVKIV 214
            P  P        K  + +F  NG V  V +  E                +V   ++KI+
Sbjct: 60  -PTSP-------QKIKQEIFD-NGTVLGVISMYEDFRLYKSGVYVHTTGGLVGVHSLKII 110

Query: 215 GWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           GWG E+G+ YW  V+++ E++GD G IK+  G 
Sbjct: 111 GWGVESGQDYWLAVNSWNEEWGDHGMIKLAVGE 143


>gi|3929733|emb|CAA77178.1| cathepsin B [Homo sapiens]
          Length = 195

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 49/161 (30%), Positives = 75/161 (46%), Gaps = 19/161 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 44  CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 101

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
              C    Y   + QDK      Y ++             GP   AF     + Y+  L 
Sbjct: 102 SKIC-EPGYSPTYKQDKHYGYDSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 155

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTF 231
             +G    V  + E++    ++I+GWG ENG PYW + +++
Sbjct: 156 YKSGVYQHV--TGEMMGGHAIRILGWGVENGTPYWLVANSW 194


>gi|4325188|gb|AAD17297.1| cysteine proteinase [Ancylostoma ceylanicum]
          Length = 341

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 49/192 (25%), Positives = 78/192 (40%), Gaps = 22/192 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     + W+ + G+VTGG +     C+P SF PC           C     P PKC
Sbjct: 158 CQGGWPIEAYRWMQRDGVVTGGKYRQRDVCKPYSFYPCGQHKDVPYYGPCPGGLWPTPKC 217

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPH-----------FGPFWPAFWRSFCTKYTRP 188
             + +   Y + + +DK+       Y  P+            GP   AF           
Sbjct: 218 R-KSSQRKYNKTYQEDKH--FATRSYSLPNNERSIRQEIYKNGPVVAAF-------KVYE 267

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
            + + G +Y      +  A+A  K++GWG ENG  YW I +++   +G+ G  +I+R  +
Sbjct: 268 DYSSTGGIYVHKWGIQTGAHAD-KVIGWGRENGTDYWLIANSWNTDWGEDGYYRIVRETD 326

Query: 249 EAIIESLVNGAL 260
              IE  + G  
Sbjct: 327 NCEIERQMVGEF 338


>gi|268561878|ref|XP_002638441.1| Hypothetical protein CBG18657 [Caenorhabditis briggsae]
          Length = 372

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 63/207 (30%), Positives = 88/207 (42%), Gaps = 41/207 (19%)

Query: 96  GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEP----ECKT-LATPQPKCHTRCTNDNYGR 150
           G+VTGG  ++ TGCQP +FPPC+    + S P    +C+T       K   R  N+    
Sbjct: 162 GVVTGG-DYNGTGCQPYTFPPCSSCEASKSTPSCQKKCQTGYLEATYKNDKRFENEEQDS 220

Query: 151 GFFQDK-YQI-----NGLGLYFDP---------------------HFGPFWPAFWRSFCT 183
            +  +  YQ+      G   Y                        + GP   ++ R F  
Sbjct: 221 SYMSENFYQVLIILKGGKSAYRLSTTTSSNKISTDAIITIQTEIYNNGPVEVSY-RVF-- 277

Query: 184 KYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
                 +Q    VY    S ++     VKI+GWG EN   YW + +++G  FG+KG  KI
Sbjct: 278 ---EDFYQYKSGVYHY-VSGKLTGAHAVKIIGWGTENKVDYWLVANSWGTDFGEKGFFKI 333

Query: 244 LRGRNEAIIESLVNGALPKDNYGVEFG 270
            RG NE  IE  V   L K N G +FG
Sbjct: 334 RRGTNECGIEENVVAGLAK-NGGTKFG 359


>gi|119638992|gb|ABL85238.1| cysteine proteinase 4 [Necator americanus]
          Length = 339

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 53/192 (27%), Positives = 82/192 (42%), Gaps = 32/192 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     + ++   G+ +GG +     C+P  F PC+  NY    P  K  A   PKC
Sbjct: 158 CEGGYPIQAYFYLENTGVCSGGEYREKNVCKPYPFYPCD-GNYG---PCPKEGAFDTPKC 213

Query: 140 HTRCT---------NDNYGRG---FFQDKYQINGLGLYFDPHFGPFWPAFW--RSFCTKY 185
              C          +  +G+      QD        ++ +   GP    F+    F   Y
Sbjct: 214 RKICQFRYPVPYEEDKVFGKNSHILLQDNEARIRQEIFIN---GPVGANFYVFEDF-IHY 269

Query: 186 TRPLF-QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 244
              ++ QT G+   V A         +K++GWG ENG  YW + +++   +G+ GT +IL
Sbjct: 270 KEGIYKQTYGKWIGVHA---------IKLIGWGTENGTDYWLVANSYNYDWGENGTFRIL 320

Query: 245 RGRNEAIIESLV 256
           RG N  +IES V
Sbjct: 321 RGTNHCLIESQV 332


>gi|119638996|gb|ABL85239.1| cysteine proteinase 5 [Necator americanus]
          Length = 342

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 54/193 (27%), Positives = 77/193 (39%), Gaps = 31/193 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 138
           C  G   + + +    G+ TGG       C+P +F PC  H N     P C     P PK
Sbjct: 158 CDGGRPFAAFFFAIDNGVCTGGPFREPNVCKPYAFYPCGRHQNQKYFGP-CPKELWPTPK 216

Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
           C   C    Y   +  DK  I G   Y  P+             T+  + +F     V +
Sbjct: 217 CRKMC-QLKYNVAYKDDK--IYGNDAYSLPNNE-----------TRIMQEIFTNGPVVGS 262

Query: 199 VSASAEIVAYA---------------TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
            S  A+   Y                 VKI+GWG ++G  YW I +++   +GD+G ++ 
Sbjct: 263 FSVFADFAIYKKGVYVSNGIQQNGAHAVKIIGWGVQDGLKYWLIANSWNNDWGDEGYVRF 322

Query: 244 LRGRNEAIIESLV 256
           LRG N   IES V
Sbjct: 323 LRGDNHCGIESRV 335


>gi|160688716|gb|ABX45136.1| cathepsin B-like cysteine protease 2 [Callosobruchus maculatus]
          Length = 260

 Score = 64.7 bits (156), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 47/165 (28%), Positives = 73/165 (44%), Gaps = 25/165 (15%)

Query: 108 GCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT---------NDNYGRGFFQDKYQ 158
           GC     P CN        P CKTL    P C   C          + +Y +  ++   +
Sbjct: 111 GCMSYPLPRCN--------PSCKTLYD-APTCKKECDKGSPLKYEEDKHYAKQAYRIMSK 161

Query: 159 INGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGE 218
           +           GP   +F     T Y   +   +G VY     ++++    V+I+GWG 
Sbjct: 162 VERQIQLEIIKNGPVVASF-----TVYADFIHYLSG-VYKFDGESKLLGGHAVRIIGWGI 215

Query: 219 ENGR-PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           ENG  PYW + +++ E++GD+G  KI RG+NE  IE  +   LP+
Sbjct: 216 ENGTYPYWLVSNSWNERWGDQGLFKIWRGKNECGIEEEITAGLPR 260


>gi|390994433|gb|AFM37366.1| cathepsin B3 [Dictyocaulus viviparus]
          Length = 342

 Score = 64.7 bits (156), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 51/191 (26%), Positives = 82/191 (42%), Gaps = 21/191 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 138
           C  G   + W +    G+VTGG + +   C+P    PC NH N T     C  ++TP   
Sbjct: 162 CDGGFPDAAWEYFVSTGVVTGGLYGTKNACRPYEISPCGNHPNETFYR-NCTGVSTPS-- 218

Query: 139 CHTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
           C T C          +   GR  +     ++ +      H GP    F     + Y   +
Sbjct: 219 CKTSCQKGYPVSYKDDKTRGRKSYNLANSVSAIQKDILKH-GPLVATF-----SVYEDFM 272

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           +   G +Y  +       +A V+I+GWG EN   YW I +++   +G+ G  +++RG N+
Sbjct: 273 YYKKG-IYRYTHGGYEGGHA-VRILGWGVENNVKYWIIANSWNTDWGEDGFFRMVRGIND 330

Query: 250 AIIESLVNGAL 260
             IE  V+  L
Sbjct: 331 CGIEESVSAGL 341


>gi|328726600|ref|XP_003248962.1| PREDICTED: cathepsin B-like cysteine proteinase-like [Acyrthosiphon
           pisum]
          Length = 169

 Score = 64.7 bits (156), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 50/168 (29%), Positives = 71/168 (42%), Gaps = 21/168 (12%)

Query: 107 TGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRG---------FFQDKY 157
            GC+P   PPC      TS         P  K H RCT   YG           F +D Y
Sbjct: 13  VGCEPYRVPPCPRNEDGTSS----CAGQPIEKNH-RCTRMCYGNQDLDYNDDHRFTRDYY 67

Query: 158 QINGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGW 216
            +    +  D  ++GP   +F            +     VY  + +A  +    VK++GW
Sbjct: 68  YLTYGSIQKDVMNYGPIEASF------DVYDDFYSYKSGVYQRTPNATKLGGHAVKLIGW 121

Query: 217 GEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDN 264
           G E G PYW +V+++  Q+GD G  KI RG +E  I+S     +P  N
Sbjct: 122 GVEEGIPYWLMVNSWSAQWGDNGLFKIRRGTDECGIDSATTAGVPVTN 169


>gi|157058745|gb|ABV03130.1| cathepsin B-2744 [Sitobion avenae]
          Length = 260

 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 45/169 (26%), Positives = 73/169 (43%), Gaps = 14/169 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTL-ATPQPK 138
           C  G +   W     +G+VTGG   SN GCQP    PCNH     +   C +L  T    
Sbjct: 96  CDGGSAFKAWELTMSKGIVTGGNFDSNEGCQPYKIRPCNHYG-NGNLKNCSSLRRTQMTV 154

Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGL-------YFDPHFGPFWPAFWRSFCTKYTRPLFQ 191
           C  +C N NY   +  D ++ + + +               + P    +F   Y   +  
Sbjct: 155 CREKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTYGPV--TAFMYVYENFMGY 212

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKG 239
             G +Y  S + E++ Y  VK++GWG + +G  YW  ++++   +G  G
Sbjct: 213 KEG-IYK-STAGELIGYHHVKLIGWGVDGDGTEYWLAMNSWNSNWGTNG 259


>gi|609175|emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana rustica]
          Length = 356

 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 62/204 (30%), Positives = 84/204 (41%), Gaps = 40/204 (19%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G     W +  ++G+VT     +  N GC   S P C        EP     A P P
Sbjct: 168 CDGGYPLQAWKYFVRKGVVTDECDPYFDNEGC---SHPGC--------EP-----AYPTP 211

Query: 138 KCHTRCTNDN--YGRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTR 187
           KCH +C   N  + R      + +N   +  DPH         GP   +F     T Y  
Sbjct: 212 KCHRKCVKQNLLWSR---SKHFGVNAYMISSDPHSIMTEVYKNGPVEVSF-----TVYED 263

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRG 246
                +G VY    + +I+    VK++GWG  E+G  YW + + +   +GD G  KI RG
Sbjct: 264 FAHYKSG-VYK-HVTGDIMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIRRG 321

Query: 247 RNEAIIESLVNGALPKD-NYGVEF 269
            NE  IE  V   LP   N  VE 
Sbjct: 322 TNECEIEDEVVAGLPSARNLNVEL 345


>gi|294898091|ref|XP_002776152.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239882839|gb|EER07968.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 382

 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 55/193 (28%), Positives = 82/193 (42%), Gaps = 31/193 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   S W+WVH +G+ TG         +  + P   + +             P P C
Sbjct: 210 CGGGDPYSAWSWVHDKGIATGEGSRPKRVSESEAIPVIAYQDIY-----------PTPNC 258

Query: 140 HTRCTNDNYGRGFFQDK----------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
             +C N  Y      D+          Y +N          GP   +F     T Y   L
Sbjct: 259 VEQCRNPKYTTTLRDDRHFMLESSPYHYSVNDAKNAIRTD-GPVSASF-----TVYEDFL 312

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
              +G VY  ++ + +  +A VKI+GWGE++G+ YW  V+++ E +GDKG  KI  G N 
Sbjct: 313 AYKSG-VYKHTSGSYLGGHA-VKIIGWGEKSGQAYWLAVNSWNEDWGDKGLFKIALG-NC 369

Query: 250 AIIESLVNGALPK 262
            I + L+ G  PK
Sbjct: 370 GIDDDLLGGT-PK 381


>gi|91078960|ref|XP_974244.1| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270004840|gb|EFA01288.1| cathepsin B precursor [Tribolium castaneum]
          Length = 319

 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 53/184 (28%), Positives = 85/184 (46%), Gaps = 15/184 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPEC-KTLATPQPK 138
           CS G   + + +  K+G+V+GG  +SN GC+P +      A+     P C K+     P 
Sbjct: 149 CSGGYMMAAFDFYIKQGVVSGGDLNSNEGCRPYT----ADAHDKGVTPSCTKSCRKGYPT 204

Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
            ++  ++ +YG   +     ++ +      + GP   +F      K  +  +     VY 
Sbjct: 205 SYS--SDKHYGSKDYIVDAGVSNIQYEIMTN-GPIIVSF------KVYQDFYNYGSGVYH 255

Query: 199 VSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 258
              S        VKIVGWG E  + YW I +++G  +G+ G  KILRG+NE  IE+    
Sbjct: 256 -HVSGNYTGNHIVKIVGWGTEKEQDYWLIANSWGSSWGEHGFFKILRGKNECGIENNPYA 314

Query: 259 ALPK 262
            LPK
Sbjct: 315 VLPK 318


>gi|2944340|gb|AAC05262.1| cathepsin B-like cysteine protease GCP7 [Haemonchus contortus]
          Length = 348

 Score = 63.9 bits (154), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 53/189 (28%), Positives = 78/189 (41%), Gaps = 23/189 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G ++  W W    G+VTGGA+     C+P  FP C  A+   +   C +     P C
Sbjct: 166 CDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCG-AHKGKAFNNCPSHPYATPAC 224

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRP 188
              C    YG+ +  DK  I     Y+ P+            GP    F           
Sbjct: 225 KPYCQY-GYGKRYENDK--IKARTWYWLPNDERTIQLEIMQKGPVHATF------NIYED 275

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFG-DKGTIKILRGR 247
                G VY  +A A +    ++KI+GWG + G  YW I +++   +G D G  +++RG 
Sbjct: 276 FEHYEGGVYIHTAGA-MEGGHSIKIIGWGVDKGVKYWLIANSWSTDWGEDGGYFRVVRGI 334

Query: 248 NEAIIESLV 256
           N   IE  V
Sbjct: 335 NNCDIEGGV 343


>gi|159175|gb|AAA29176.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score = 63.9 bits (154), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 58/253 (22%), Positives = 96/253 (37%), Gaps = 42/253 (16%)

Query: 27  SCIEARAVATATPLAFAVCRSSK--MHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGI 84
           +C    AV+TA  ++  +C ++K    V  +    +     RC            C  G 
Sbjct: 113 NCGSCWAVSTAAAISDRICIATKGKKQVYASDTDILTCCGARCGL---------GCRGGW 163

Query: 85  SSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT 144
               W +    G+V+GG +     C P    PC      T    C  +A P P C  +C 
Sbjct: 164 PIEAWKFFEYDGVVSGGPYLGKGCCSPYPLHPCGRHGNDTFYGNCVGMA-PTPPCKRKC- 221

Query: 145 NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAE 204
                +  F+  Y++       D  +G     +         R   +  G V AV A  E
Sbjct: 222 -----QPGFRGMYRV-------DKRYGEPGRTYTLPRSEVKIRRDIKERGSVVAVFAVYE 269

Query: 205 IVA-----------------YATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
             +                 Y  VK++GWG++NG  YW I +++ + +G+ G  +++RG 
Sbjct: 270 DFSHYQSGIYKHTAGRFTGGYHAVKMIGWGKDNGTDYWLIANSWHDDWGENGFFRMIRGI 329

Query: 248 NEAIIESLVNGAL 260
           N   IE  V+  +
Sbjct: 330 NNCGIEEQVDAGI 342


>gi|157058741|gb|ABV03128.1| cathepsin B-2744 [Aulacorthum solani]
          Length = 255

 Score = 63.9 bits (154), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 42/165 (25%), Positives = 70/165 (42%), Gaps = 14/165 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTL-ATPQPK 138
           C  G +   W     +G+VTGG + SN GCQP    PC+H    +S   C +L  T    
Sbjct: 96  CDGGSAFKAWELTMSKGIVTGGNYDSNEGCQPYKNRPCDHYG-DSSLTNCSSLRRTQMTV 154

Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGL-------YFDPHFGPFWPAFWRSFCTKYTRPLFQ 191
           C  +C N NY   +  D ++ + + +               + P         Y    F 
Sbjct: 155 CREKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTYGPV----TALMYVYENFM 210

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQF 235
              +    S + E++ Y  VK++GWG +E+G  YW  ++++   +
Sbjct: 211 GYKKGIYKSTAGELIGYHHVKLIGWGVDEDGTEYWLAMNSWNSNW 255


>gi|308157698|gb|EFO60800.1| Cathepsin B-like cysteine proteinase 3 precursor [Giardia lamblia
           P15]
          Length = 627

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 31/68 (45%), Positives = 41/68 (60%)

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           +Y+   + ++     V IVGWGEENG PYW   +T+G  +GD+G  KI RG NE  IE+ 
Sbjct: 220 IYSSGPNTKLRGGHAVMIVGWGEENGVPYWDCANTYGTNWGDQGYFKIKRGSNELKIETW 279

Query: 256 VNGALPKD 263
              ALP D
Sbjct: 280 PGSALPID 287


>gi|254575665|gb|ACT68329.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 55/192 (28%), Positives = 79/192 (41%), Gaps = 29/192 (15%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCN-HANYTTSEPECKTLATP--Q 136
           C  G ++  W W    G+VTGGA+     C+P  FP C  H     +       ATP  +
Sbjct: 166 CDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCGAHKGKAFNNCPSHPYATPARK 225

Query: 137 PKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKY 185
           P C        YG+ +  DK  I     Y+ P+            GP    F        
Sbjct: 226 PYCQY-----GYGKRYENDK--IKARTWYWLPNDERTIQLEIMQKGPVHATF------NI 272

Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFG-DKGTIKIL 244
                  NG VY  +A A +    ++KI+GWG + G  YW I +++   +G D G  +++
Sbjct: 273 YEDFEHYNGGVYIHTAGA-MEGGHSIKIIGWGVDKGVKYWLIANSWSTDWGEDGGYFRVV 331

Query: 245 RGRNEAIIESLV 256
           RG N   IE  V
Sbjct: 332 RGINNCDIEGGV 343


>gi|189502866|gb|ACE06814.1| unknown [Schistosoma japonicum]
          Length = 121

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 28/62 (45%), Positives = 40/62 (64%)

Query: 201 ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
            S  ++    V+++GWGEEN  PYW I +++   +GD G  KI+RG+NE  IES VN  +
Sbjct: 57  VSGALLGGHAVRLLGWGEENNVPYWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNAGI 116

Query: 261 PK 262
           PK
Sbjct: 117 PK 118


>gi|159120206|ref|XP_001710319.1| Cathepsin B-like cysteine proteinase 3 precursor [Giardia lamblia
           ATCC 50803]
 gi|157438437|gb|EDO82645.1| Cathepsin B-like cysteine proteinase 3 precursor [Giardia lamblia
           ATCC 50803]
          Length = 804

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 30/53 (56%), Positives = 35/53 (66%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKD 263
           V IVGWGEENG PYW   +T+G  +GD+G  KI RG NE  IE+    ALP D
Sbjct: 235 VMIVGWGEENGVPYWDCANTYGTNWGDQGYFKIKRGSNELKIETWPGSALPID 287


>gi|91089435|ref|XP_966663.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
 gi|270012706|gb|EFA09154.1| cathepsin B precursor [Tribolium castaneum]
          Length = 320

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 55/199 (27%), Positives = 80/199 (40%), Gaps = 43/199 (21%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G SS  W +    G+V+GG  +++ GC P S      A   ++ P C +        
Sbjct: 148 CGGGYSSRAWQYWVTDGIVSGGDFNTSQGCHPYSV----QAFRDSTTPNCSSF------- 196

Query: 140 HTRCTNDNYGRGFFQDKY-------------QINGLGLYFDPHFGPF--WPAFWRSFCTK 184
              CTN  Y + + +DK              QI    +   P    +  +  F+      
Sbjct: 197 ---CTNPKYQKNYSEDKRYGARSYRIAKNIEQIQAEIMTSGPVQASYVVYDDFYSYQNGV 253

Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGT-IKI 243
           Y   L   +GR              +VKI+GWG ENG  YW + +++G  +G  G   K 
Sbjct: 254 YQHVLGNVSGR-------------HSVKILGWGRENGTDYWLVANSWGRDWGRLGGFFKF 300

Query: 244 LRGRNEAIIESLVNGALPK 262
           LRG N   IES + G  PK
Sbjct: 301 LRGENHCDIESNILGGDPK 319


>gi|308161545|gb|EFO63987.1| Cathepsin B-like cysteine proteinase [Giardia lamblia P15]
          Length = 804

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 31/68 (45%), Positives = 41/68 (60%)

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           +Y+   + ++     V IVGWGEENG PYW   +T+G  +GD+G  KI RG NE  IE+ 
Sbjct: 220 IYSSGPNTKLRGGHAVMIVGWGEENGVPYWDCANTYGTNWGDQGYFKIKRGSNELKIETW 279

Query: 256 VNGALPKD 263
              ALP D
Sbjct: 280 PGSALPID 287


>gi|159111216|ref|XP_001705840.1| Hypothetical protein GL50803_113303 [Giardia lamblia ATCC 50803]
 gi|157433930|gb|EDO78166.1| hypothetical protein GL50803_113303 [Giardia lamblia ATCC 50803]
          Length = 804

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 31/68 (45%), Positives = 41/68 (60%)

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           +Y+   + ++     V IVGWGEENG PYW   +T+G  +GD+G  KI RG NE  IE+ 
Sbjct: 220 IYSSGPNTKLGGGHAVMIVGWGEENGVPYWDCANTYGTNWGDQGYFKIKRGSNELKIETW 279

Query: 256 VNGALPKD 263
              ALP D
Sbjct: 280 PGSALPID 287


>gi|157058739|gb|ABV03127.1| cathepsin B-2744 [Acyrthosiphon pisum]
          Length = 260

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 46/170 (27%), Positives = 75/170 (44%), Gaps = 16/170 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTL-ATPQP 137
           C  G +   W     +G+VTGG   SN GCQP    PC+H  Y  S    C +L  T   
Sbjct: 96  CDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDH--YGDSRLTNCSSLRRTQMT 153

Query: 138 KCHTRCTNDNYGRGFFQDKYQINGLGL-------YFDPHFGPFWPAFWRSFCTKYTRPLF 190
            C  +C N NY   +  D ++ + + +               + P    +F   Y   + 
Sbjct: 154 VCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTYGPV--TAFMYVYENFMG 211

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKG 239
              G +Y  S + E++ Y  VK++GWG + +G  YW  ++++   +G+ G
Sbjct: 212 YKEG-IYK-STTGELIGYHHVKLIGWGVDGDGTEYWLAMNSWNSNWGNDG 259


>gi|300952942|gb|ADK46902.1| cathepsin B [Radopholus similis]
          Length = 356

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 49/184 (26%), Positives = 81/184 (44%), Gaps = 25/184 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G     +    + G+VTG  + +N GC+P  F P     Y+T            P+C
Sbjct: 178 CNGGYPDEAFEHYAQSGVVTGSGNSANQGCKPYPFLPHTTVEYST------------PEC 225

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLY-------FDPHFGPFWPAFWRSFCTKYTRPLFQT 192
             +C N  Y + + QDK+   G+ +Y        D  +         +    Y   +F  
Sbjct: 226 SKKCENYQYKKAYKQDKH--FGMSVYNVQFSDPVDIQYEIMNNGPVEANMIVYYDFMFYK 283

Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEE--NGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
           +G VY       +  +A V+IVGWG +     PYW + +++   +G+ G  +I RG +E+
Sbjct: 284 SG-VYQTVFPWPLGGHA-VRIVGWGVDGPTKVPYWLVANSWNTDWGEDGYFRIRRGTDES 341

Query: 251 IIES 254
            IES
Sbjct: 342 YIES 345


>gi|19526442|gb|AAL89717.1|AF483623_1 cathepsin B [Apriona germari]
          Length = 324

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 52/189 (27%), Positives = 78/189 (41%), Gaps = 27/189 (14%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G  +  + +    G+ +GG + S  GC+P          YT +      ++   P+C
Sbjct: 153 CRGGFLNEPYKYWVTNGIPSGGDYGSKLGCKP----------YTAA------VSGETPQC 196

Query: 140 HTRCTNDNYGRGFFQD------KYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
              C +  Y + + +D       YQ+NG  L          P    ++   Y    F + 
Sbjct: 197 QKACVS-GYEKSWEKDLRHATSAYQVNGGVLQIQREILDNGPV--TAYMEVYED--FYSY 251

Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
           G       S   V    VKI+GWG EN  PYW   +++G  FG+ G  +ILRG N A IE
Sbjct: 252 GTGIYQHTSGSFVGGHAVKIIGWGSENDVPYWIAANSWGTGFGEDGFFRILRGSNCAGIE 311

Query: 254 SLVNGALPK 262
           S +    P 
Sbjct: 312 SYIVAGYPN 320


>gi|170030060|ref|XP_001842908.1| cathepsin B [Culex quinquefasciatus]
 gi|167865914|gb|EDS29297.1| cathepsin B [Culex quinquefasciatus]
          Length = 320

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 52/189 (27%), Positives = 79/189 (41%), Gaps = 24/189 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G    TW +    GL + G + S  GC    F      +Y  ++P    L T    C
Sbjct: 149 CDGGYVGKTWQYWVDSGLTSEGPYKSGQGCNSYPF-----GSYCVNDP----LPTCSRTC 199

Query: 140 H-----TRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNG 194
                 T   +  YG   ++  +  N +      + GP    F      +     +Q   
Sbjct: 200 QAGYPLTYSQDLKYGGSAYRVMWNENAIMTEIYQN-GPVVVQF------EVFADFYQYKS 252

Query: 195 RVY-AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
            VY  V+ + E   +  V+++GWG ENG  YW + +++G ++GDKG  K +RG N   IE
Sbjct: 253 GVYRHVTGATE--GWHAVRVIGWGVENGVKYWLVANSWGVRWGDKGFFKFVRGENHLGIE 310

Query: 254 SLVNGALPK 262
             V   LPK
Sbjct: 311 DFVYAGLPK 319


>gi|268619140|gb|ACZ13346.1| cathepsin B-like cysteine proteinase [Bursaphelenchus xylophilus]
          Length = 405

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 49/168 (29%), Positives = 72/168 (42%), Gaps = 20/168 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK- 138
           C+ G+    +    + G  TG     + GCQP  F  C H   +T  P C ++  P+ K 
Sbjct: 140 CNGGLEEVAFEKFIENGFPTGSEVDKHQGCQPYPFKHCAHHVNSTEYPPCDSV--PEYKA 197

Query: 139 --CHTRCTNDNYGRGFFQDKYQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTRP 188
             C   C  D Y R + +D Y       + D           GP   +F     T Y   
Sbjct: 198 DTCSHECQKD-YDRKYEEDLYYGKEQYGFSDEAPIQREIMTNGPVAVSF-----TVYESF 251

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFG 236
           L+ + G +Y  +    I  Y  V++VGWG ENG  YW I +++ EQ+G
Sbjct: 252 LYYSGG-IYRSTPGERIKGYHAVRVVGWGVENGTKYWKIANSWNEQWG 298


>gi|94958151|gb|ABF47216.1| cathepsin B [Nicotiana benthamiana]
          Length = 356

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 58/197 (29%), Positives = 81/197 (41%), Gaps = 43/197 (21%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G     W +  ++G+VT     +  N GC   S P C        EP     A P P
Sbjct: 168 CDGGYPLQAWKYFVRKGVVTDECDPYFDNEGC---SHPGC--------EP-----AYPTP 211

Query: 138 KCHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKY 185
           KCH +C   N      + F  + Y I+      DPH         GP   +F     T Y
Sbjct: 212 KCHRKCVKQNLLWSKSKHFGVNAYMISS-----DPHSIMTELYKNGPVEVSF-----TVY 261

Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKIL 244
                  +G VY    + +++    VK++GWG  E+G  YW + + +   +GD G  KI 
Sbjct: 262 EDFAHYKSG-VYK-HVTGDVMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIR 319

Query: 245 RGRNEAIIESLVNGALP 261
           RG +E  IE  V   LP
Sbjct: 320 RGTDECEIEDEVVAGLP 336


>gi|107921791|gb|ABF85679.1| cathepsin B2 [Fasciola hepatica]
          Length = 278

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 42/150 (28%), Positives = 62/150 (41%), Gaps = 7/150 (4%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  + G+VTGG   + TGCQP  F  C+H   +     C     P+P C
Sbjct: 132 CRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCPHYTYPKPPC 191

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
              C    Y + + QDK+  N        H         ++   + T  +FQ  G VY  
Sbjct: 192 ARACQT-GYNKTYEQDKFYGNS-SYNVGEHESYIMQEIMKNGPVEVTFAIFQDFG-VYRS 248

Query: 200 S----ASAEIVAYATVKIVGWGEENGRPYW 225
                 + + +    V+++GWG ENG  YW
Sbjct: 249 GIYHHVAGKFIGRHAVRMIGWGVENGVNYW 278


>gi|194352768|emb|CAQ00112.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326488519|dbj|BAJ93928.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326508126|dbj|BAJ99330.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 355

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 53/194 (27%), Positives = 85/194 (43%), Gaps = 37/194 (19%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C+ G   S W +  + G+VT     +   TGCQ                P C+  A P P
Sbjct: 169 CNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQ---------------HPGCEP-AYPTP 212

Query: 138 KCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
           KCH +C  +N  + + ++K + +N   ++ +PH         GP   AF     T Y   
Sbjct: 213 KCHRKCKVEN--QVWKKNKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAF-----TVYEDF 265

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGR 247
               +G    ++    ++    VK++GWG  + G  YW + + +   +GD G  KI+RG+
Sbjct: 266 AHYKSGVYKHITGG--VMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGK 323

Query: 248 NEAIIESLVNGALP 261
           NE  IE  V   +P
Sbjct: 324 NECGIEEDVTAGMP 337


>gi|14582576|gb|AAK69541.1|AF283476_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
          Length = 352

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 60/213 (28%), Positives = 86/213 (40%), Gaps = 43/213 (20%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G   + W +  + G+VT     +   TGC   S P C        EP     A P P
Sbjct: 164 CDGGYPIAAWQYFKRTGVVTSECDPYFDQTGC---SHPGC--------EP-----AYPTP 207

Query: 138 KCHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKY 185
            C  +C   N      + F  + Y++N      D H         GP   +F     T Y
Sbjct: 208 ACEKKCVKKNLLWSESKHFSVNAYRVNS-----DQHSIMTEVYTNGPAEVSF-----TVY 257

Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKIL 244
                  +G VY     +E+  +A VK++GWG  E+G  YW + + +   +GD G  KI+
Sbjct: 258 EDFAHYKSG-VYKHVTGSEMGGHA-VKLIGWGTSEDGEDYWLLANQWNRSWGDDGYFKII 315

Query: 245 RGRNEAIIESLVNGALPKDNYGVEFGEESGERL 277
           RG NE  IE +  G     N  +E G    + L
Sbjct: 316 RGTNECGIEDVTAGMPSTKNLDIESGVRDDDSL 348


>gi|3087799|emb|CAA93276.1| cysteine proteinase [Haemonchus contortus]
          Length = 350

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 49/172 (28%), Positives = 74/172 (43%), Gaps = 26/172 (15%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCN-HANYTTSEPECKTLATP--Q 136
           C  G     W WV + G+VTGG +     C+P +F PC  H       P   + +TP  +
Sbjct: 164 CEGGYDHLAWEWVQRFGVVTGGPYQQKGVCRPYAFHPCGLHHGRRYDCPWDHSFSTPACK 223

Query: 137 PKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF---------GPFWPAFWRSFCTKYTR 187
           P C        YG+ + +DK+ +    +  +            GP   AF        T 
Sbjct: 224 PYCQF-----GYGKRYEKDKFFVKSTYILDNDEKVIQREMMKNGPVQAAF-------ITY 271

Query: 188 PLFQT-NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDK 238
             F    G +Y      E  A+A VK++GWG ENG  YWT+ +++ + +G K
Sbjct: 272 EDFSPYKGGIYVHVKGRERGAHA-VKLIGWGVENGTKYWTVANSWHDDWGGK 322


>gi|159177|gb|AAA29177.1| cysteine proteinase [Haemonchus contortus]
          Length = 342

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 61/244 (25%), Positives = 100/244 (40%), Gaps = 25/244 (10%)

Query: 27  SCIEARAVATATPLAFAVCRSS--KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGI 84
           +C    AV+TA  ++  +C ++  +  V  +S   +     +C +          C  G 
Sbjct: 108 NCGSCWAVSTAAAISDRICIATNGEKQVNISSTDILTCCNPQCGF---------GCGGGW 158

Query: 85  SSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT 144
           S   W +    G+V+GG + +   C+P    PC H    T   EC   A   P C  +C 
Sbjct: 159 SIRAWEYFVYEGVVSGGEYLTKGVCRPYPIHPCGHHGNDTYYGECPREAA-TPPCKKKC- 216

Query: 145 NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWR------SFCTKYTRPLFQTNGRVYA 198
              Y + F  DK Q   +    +P          R      SF       L++T   VY 
Sbjct: 217 QPGYKKIFRMDKRQ-GKVAYGVEPKEEAIQREILRHGPVVASFAVYEDFSLYKTG--VYK 273

Query: 199 VSASAEIVAYATVKIVGWGEENGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
            +A A +  Y  VK++GWG ++     YW I +++   +G+ G  + +RG N+  IE  V
Sbjct: 274 HTAGA-LRGYHAVKMMGWGVDSKTKAKYWLIANSWHNDWGENGYFRFIRGINDCEIEDTV 332

Query: 257 NGAL 260
              +
Sbjct: 333 AAGI 336


>gi|38639325|gb|AAR25800.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
          Length = 354

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 61/204 (29%), Positives = 82/204 (40%), Gaps = 36/204 (17%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G   + W +  + G+VT     +   TGC               S P C+ L  P P
Sbjct: 168 CDGGYPIAAWRYFKRSGVVTEECDPYFDTTGC---------------SHPGCEPL-YPTP 211

Query: 138 KCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPL 189
           KCH +C   N         Y +N   +  DP          GP   +F     T Y    
Sbjct: 212 KCHRKCVKGNV-LWRKSKHYGVNAYRVSHDPQSIMAEVYKNGPVEVSF-----TVYEDFA 265

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
              +G VY       +  +A VK++GWG  E G  YW IV+++   +G+ G  KI RG N
Sbjct: 266 HYKSG-VYKHVTGGNMGGHA-VKLIGWGTSEQGEDYWLIVNSWNRGWGEDGYFKIRRGTN 323

Query: 249 EAIIESLVNGALPKD-NYGVEFGE 271
           E  IE  V   LP   N  VE G+
Sbjct: 324 ECGIEHSVVAGLPSARNLNVELGD 347


>gi|3087803|emb|CAA93279.1| cysteine protease [Haemonchus contortus]
          Length = 325

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 62/220 (28%), Positives = 91/220 (41%), Gaps = 38/220 (17%)

Query: 27  SCIEARAVATATPLAFAVCRSS----KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSS 82
           +C    AV+TA  L+  +C S+    ++++  T              L   +   + C  
Sbjct: 118 NCGSCWAVSTAAALSDRICISTNGTKQVNISATDI------------LTCCYKCGYGCQG 165

Query: 83  GISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 142
           G     W +V + G VTGG   + + C+   FPPC H    T   EC   A   PKC T 
Sbjct: 166 GWPIEAWEYVAREGAVTGGRLLAKSCCRSHPFPPCGHHGNETYYGECGGRAR-TPKCRTS 224

Query: 143 CTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRPLFQ 191
           CT   Y   +  DK  I G   Y  P+            GP   AF     T Y    + 
Sbjct: 225 CT-PGYKNSYSDDK--IRGKDAYELPNSVKAIQREIMKNGPVVAAF-----TVYADFSYY 276

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTF 231
             G +Y  +A     ++A VK++GWGEE   PYW + +++
Sbjct: 277 KKG-IYKHTAGRARGSHA-VKVIGWGEEGDVPYWIVKNSW 314


>gi|157167281|ref|XP_001658485.1| cathepsin b [Aedes aegypti]
 gi|108876476|gb|EAT40701.1| AAEL007585-PA [Aedes aegypti]
          Length = 386

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 48/189 (25%), Positives = 76/189 (40%), Gaps = 20/189 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  ++GL +GG  +S  GC P          Y   E          PKC
Sbjct: 194 CRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHP----------YPIGECRIPGEDEDTPKC 243

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
             +C +       +QD++   G   Y  P+           F     +  F T   ++A 
Sbjct: 244 SNKCRSGYNVTDVWQDRHY--GRVAYSLPN--DERKIMEEIFINGPVQAAFHTYLDLHAY 299

Query: 200 SAS------AEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
            +         +     VK++GWG ENG  YW + +++G ++G+ G  KI+RG N   IE
Sbjct: 300 KSGIYRHVWGPLSGGHAVKLLGWGVENGVKYWLVANSWGREWGENGFFKIVRGENHCGIE 359

Query: 254 SLVNGALPK 262
             ++  LP 
Sbjct: 360 ENIHAGLPN 368


>gi|44965462|gb|AAS49538.1| cathepsin B [Protopterus dolloi]
          Length = 225

 Score = 60.8 bits (146), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 67/152 (44%), Gaps = 22/152 (14%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  ++GLV+GG + S  GC+P + PPC H +   S P C       PKC
Sbjct: 83  CNGGYPSGAWQYWTEKGLVSGGLYGSGIGCRPYTIPPCEH-HVNGSRPSCSGEGGDTPKC 141

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
             +C +  Y   + +DK  I G   Y  P             GP   AF     T Y   
Sbjct: 142 VQKC-DSGYTPAYEKDK--IYGQSAYSVPSSPESIMEEIYKDGPVEGAF-----TVYEDF 193

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEEN 220
           L   +G VY    + E V    +KI+GWG EN
Sbjct: 194 LLYKSG-VYQ-HHTGEAVGGHAIKILGWGIEN 223


>gi|3912916|gb|AAC78691.1| thiol protease [Trichuris suis]
          Length = 348

 Score = 60.8 bits (146), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 51/201 (25%), Positives = 84/201 (41%), Gaps = 38/201 (18%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPV-------------SFPPCNHANYTTSE 126
           C+ G     W      G  TGG      GC+P               + PC +  Y    
Sbjct: 153 CNGGFPIEAWRHFTVAGNCTGGKTIDKYGCKPYKPTGPIGRHLKRNDYAPCPNDTYYG-- 210

Query: 127 PECKTLATPQPKCHTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAF 177
            EC  +A   P+C  RC         ++  YG+  +  K  +  +      + GP   +F
Sbjct: 211 -ECVGMAD-TPRCKRRCLLGYPKSYPSDRYYGKSAYIVKQSVKAIQREIMKN-GPVVASF 267

Query: 178 --WRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQF 235
             +  F   Y   +++          + E+  Y  VKI+GWG+EN   +W I +++ + +
Sbjct: 268 AVYEDF-RHYKSGIYK--------HTAGELRGYHAVKIIGWGKENNTDFWLIANSWHQDW 318

Query: 236 GDKGTIKILRGRNEAIIESLV 256
           G+KG  +I+RG+NE  IE+ V
Sbjct: 319 GEKGYFRIVRGKNECGIETDV 339


>gi|21699|emb|CAA46811.1| cathepsin B [Triticum aestivum]
          Length = 353

 Score = 60.8 bits (146), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 52/194 (26%), Positives = 86/194 (44%), Gaps = 35/194 (18%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C+ G   S W +  + G+VT     +   TGCQ                P C+  A P P
Sbjct: 165 CNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQ---------------HPGCEP-AYPTP 208

Query: 138 KCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
           KC  +C  +N  + + ++K + +N   ++ +PH         GP   AF  ++C      
Sbjct: 209 KCQRKCKVEN--QAWKENKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAF--TYCQILDFA 264

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGR 247
            +++   VY    +  ++    VK++GWG  + G  YW + + +   +GD G  KI+RG 
Sbjct: 265 HYKSG--VYK-HITGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGE 321

Query: 248 NEAIIESLVNGALP 261
           NE  IE  V   +P
Sbjct: 322 NECGIEGDVTAGMP 335


>gi|48762483|dbj|BAD23811.1| cathepsin B-S [Tuberaphis takenouchii]
          Length = 155

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 40/162 (24%), Positives = 67/162 (41%), Gaps = 20/162 (12%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +   +G+ TGG + S  GC P   PPC       +         P  + 
Sbjct: 5   CEGGYPIKAWQYFRTQGVPTGGDYDSKEGCAPYKIPPCFDQKGKNT-----CAGKPLERN 59

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLFQ 191
           H +C    YG    Q +Y++    +   P+        +GP   +F           L  
Sbjct: 60  H-QCPKTCYGSTTVQKRYKVKNEYVLNSPNTMEQDLIKYGPIEASF------NLFDDLSA 112

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGE 233
               +Y  +  A+ ++  ++KI+GWG+ENG PYW  V+++ +
Sbjct: 113 YKSGIYQKTPKAKFLSGHSIKIIGWGKENGVPYWLAVNSWSK 154


>gi|71656032|ref|XP_816569.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
 gi|70881707|gb|EAN94718.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
          Length = 333

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 66/241 (27%), Positives = 95/241 (39%), Gaps = 48/241 (19%)

Query: 27  SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
           SC    AVA A+ ++   C    +       R  AG    C  +       + C+ G   
Sbjct: 116 SCGSCWAVAAASAISDRYCTLGGVR----DLRISAGDLMSCCDVCG-----YGCNGGYPE 166

Query: 87  STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT-- 144
             W +    G+V+         CQP  FP C H   ++    C       P C++ CT  
Sbjct: 167 VAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCSG-EYDTPTCNSTCTDK 218

Query: 145 ---------NDNY---GRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQT 192
                    N +Y   G   F+ +  +NG          PF  +F     + Y   L  T
Sbjct: 219 KVPLIKYRGNTSYLLSGEESFKRELLLNG----------PFEVSF-----SVYADFLAYT 263

Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
            G VY   A   +  +A V+IVGWGE NG PYW I +++  ++G  G   I RG +E  I
Sbjct: 264 GG-VYKHVAGTFLGGHA-VRIVGWGELNGEPYWKIANSWNREWGMNGYFLIARGVDECGI 321

Query: 253 E 253
           E
Sbjct: 322 E 322


>gi|357116869|ref|XP_003560199.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
          Length = 350

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 66/238 (27%), Positives = 96/238 (40%), Gaps = 45/238 (18%)

Query: 52  VECTSFRFIAGVKQRCAWLVSR------WMTIWVCSSGISSSTWAWVHKRGLVTG--GAH 103
           VEC   RF   +    +  V+       +M    C  G   S W ++ + G+VT     +
Sbjct: 132 VECLQDRFCIHLNMNISLSVNDLVACCGFMCGDGCDGGYPISAWQYLVENGVVTDECDPY 191

Query: 104 HSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDK--YQING 161
               GC+    P C        EP     A P P C  +C   N     +Q+K  + IN 
Sbjct: 192 FDQVGCK---HPGC--------EP-----AYPTPACEKKCKVQNQ---VWQEKKHFSINA 232

Query: 162 LGLYFDPHF--------GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKI 213
             +  DPH         GP   AF     T Y       +G VY    + E++    VK+
Sbjct: 233 YRVNSDPHDIMAEVYKNGPVEVAF-----TVYEDFAHYKSG-VYE-HITGEMMGGHAVKL 285

Query: 214 VGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVEFG 270
           +GWG   +G+ YW + + +   +GD G  KI+RG+NE  IE  V   +P     V  G
Sbjct: 286 IGWGTSADGKDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVVAGMPSTKNTVRTG 343


>gi|253747613|gb|EET02212.1| Hypothetical protein GL50581_498 [Giardia intestinalis ATCC 50581]
          Length = 807

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 29/66 (43%), Positives = 38/66 (57%)

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           +Y    + ++     V IVGWGEENG PYW   +T+G  +GD G  +I RG NE  IE+ 
Sbjct: 220 IYVSGPNTKLSGGHAVMIVGWGEENGVPYWDCANTYGTNWGDHGYFRIKRGSNELKIETW 279

Query: 256 VNGALP 261
              ALP
Sbjct: 280 PGAALP 285


>gi|157131748|ref|XP_001662318.1| cathepsin b [Aedes aegypti]
 gi|108871395|gb|EAT35620.1| AAEL012216-PA [Aedes aegypti]
          Length = 386

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 47/189 (24%), Positives = 76/189 (40%), Gaps = 20/189 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  ++GL +GG  +S  GC P          Y   E          PKC
Sbjct: 194 CRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHP----------YPIGECRIPGEDEDTPKC 243

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
             +C +       +QD++   G   Y  P+           F     +  F T   ++A 
Sbjct: 244 SNKCRSGYNVTDVWQDRHY--GRVAYSLPN--DERKIMEEIFINGPVQAAFHTYLDLHAY 299

Query: 200 SAS------AEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
            +         +     VK++GWG ENG  YW + +++G ++G+ G  K++RG N   IE
Sbjct: 300 KSGIYRHVWGPLSGGHAVKLLGWGVENGVKYWLVANSWGREWGENGFFKMVRGENHCGIE 359

Query: 254 SLVNGALPK 262
             ++  LP 
Sbjct: 360 ENIHAGLPN 368


>gi|157111449|ref|XP_001651570.1| cathepsin b [Aedes aegypti]
 gi|108868331|gb|EAT32556.1| AAEL015312-PA [Aedes aegypti]
          Length = 386

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 47/189 (24%), Positives = 76/189 (40%), Gaps = 20/189 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  ++GL +GG  +S  GC P          Y   E          PKC
Sbjct: 194 CRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHP----------YPIGECRIPGEDEDTPKC 243

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
             +C +       +QD++   G   Y  P+           F     +  F T   ++A 
Sbjct: 244 SNKCRSGYNVTDVWQDRHY--GRVAYSLPN--DERKIMEEIFINGPVQAAFHTYLDLHAY 299

Query: 200 SAS------AEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
            +         +     VK++GWG ENG  YW + +++G ++G+ G  K++RG N   IE
Sbjct: 300 KSGIYRHVWGPLSGGHAVKLLGWGVENGVKYWLVANSWGREWGENGFFKMVRGENHCGIE 359

Query: 254 SLVNGALPK 262
             ++  LP 
Sbjct: 360 ENIHAGLPN 368


>gi|414886872|tpg|DAA62886.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
 gi|414886873|tpg|DAA62887.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
          Length = 208

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 57/196 (29%), Positives = 78/196 (39%), Gaps = 41/196 (20%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPP-CNHANYTTSEPECKTLATPQPK 138
           C  G     W +  + G+VT         C P   P  C H       P C+  A P PK
Sbjct: 22  CDGGYPIEAWRYFVQNGVVT-------DECDPYFDPVGCKH-------PGCEP-AYPTPK 66

Query: 139 CHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKYT 186
           C  +C   N      + F  D Y+IN      DPH         GP   AF     T Y 
Sbjct: 67  CEKKCKEQNQVWQEKKHFSIDAYRINS-----DPHDIMAEVYKNGPVEVAF-----TVYE 116

Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILR 245
                 +G    ++    I+    VK++GWG  + G  YW + + +   +GD G  KI+R
Sbjct: 117 DFAHYKSGVYKHITGG--IMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIR 174

Query: 246 GRNEAIIESLVNGALP 261
           G+NE  IE  V   +P
Sbjct: 175 GKNECGIEEGVVAGMP 190


>gi|414886870|tpg|DAA62884.1| TPA: cathepsin B-like cysteine proteinase 3 [Zea mays]
          Length = 347

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 64/230 (27%), Positives = 88/230 (38%), Gaps = 47/230 (20%)

Query: 52  VECTSFRFIAGVKQRCAWLVSR------WMTIWVCSSGISSSTWAWVHKRGLVTGGAHHS 105
           VEC   RF   +       V+       +M    C  G     W +  + G+VT      
Sbjct: 127 VECLQDRFCIHLNMSILLSVNDLLACCGFMCGDGCDGGYPIEAWRYFVQNGVVT------ 180

Query: 106 NTGCQPVSFPP-CNHANYTTSEPECKTLATPQPKCHTRCTNDNY----GRGFFQDKYQIN 160
              C P   P  C H       P C+  A P PKC  +C   N      + F  D Y+IN
Sbjct: 181 -DECDPYFDPVGCKH-------PGCEP-AYPTPKCEKKCKEQNQVWQEKKHFSIDAYRIN 231

Query: 161 GLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVK 212
                 DPH         GP   AF     T Y       +G    ++    I+    VK
Sbjct: 232 S-----DPHDIMAEVYKNGPVEVAF-----TVYEDFAHYKSGVYKHITGG--IMGGHAVK 279

Query: 213 IVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
           ++GWG  + G  YW + + +   +GD G  KI+RG+NE  IE  V   +P
Sbjct: 280 LIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEGVVAGMP 329


>gi|5031250|gb|AAD38132.1|AF127592_1 vitellogenic cathepsin-B like protease [Aedes aegypti]
          Length = 386

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 47/188 (25%), Positives = 76/188 (40%), Gaps = 20/188 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  ++GL +GG  +S  GC P          Y   E          PKC
Sbjct: 194 CRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHP----------YPIGECRIPGEDEDTPKC 243

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
             +C +       +QD++   G   Y  P+           F     +  F T   ++A 
Sbjct: 244 SNKCRSGYNVTDVWQDRHI--GRVAYSLPN--DERKIMEEIFINGPVQAAFHTYLDLHAY 299

Query: 200 SAS------AEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
            +         +     VK++GWG ENG  YW + +++G ++G+ G  K++RG N   IE
Sbjct: 300 KSGIYRHVWGPLSGGHAVKLLGWGVENGVKYWLVANSWGREWGENGFFKMVRGENHCGIE 359

Query: 254 SLVNGALP 261
             ++  LP
Sbjct: 360 ENIHAGLP 367


>gi|226472808|emb|CAX71090.1| cathepsin B [Schistosoma japonicum]
          Length = 325

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 46/159 (28%), Positives = 67/159 (42%), Gaps = 23/159 (14%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH--------ANYTTSEPECKT 131
           C+ G   S W +   +G+VTG  +++  GCQP  FPPC H         +     P CK 
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEHHTLGPLPVCDGDVETPPCKR 223

Query: 132 LATPQPKCHTRCTNDN-YGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRP 188
             T Q   +    ND  YG+  ++ K     +      H GP    F  +  F   Y   
Sbjct: 224 --TCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQH-GPVEVDFEVYADF-PNYKSG 279

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 227
           ++Q          S  ++    V+++GWGEEN  PYW I
Sbjct: 280 VYQ--------HVSGALLGGHAVRLLGWGEENNVPYWLI 310


>gi|226497010|ref|NP_001150152.1| LOC100283781 precursor [Zea mays]
 gi|195637168|gb|ACG38052.1| cathepsin B-like cysteine proteinase 3 precursor [Zea mays]
          Length = 347

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 64/230 (27%), Positives = 88/230 (38%), Gaps = 47/230 (20%)

Query: 52  VECTSFRFIAGVKQRCAWLVSR------WMTIWVCSSGISSSTWAWVHKRGLVTGGAHHS 105
           VEC   RF   +       V+       +M    C  G     W +  + G+VT      
Sbjct: 127 VECLQDRFCIHLNMSILLSVNDLLACCGFMCGDGCDGGYPIEAWRYFVQNGVVT------ 180

Query: 106 NTGCQPVSFPP-CNHANYTTSEPECKTLATPQPKCHTRCTNDNY----GRGFFQDKYQIN 160
              C P   P  C H       P C+  A P PKC  +C   N      + F  D Y+IN
Sbjct: 181 -DECDPYFDPVGCKH-------PGCEP-AYPTPKCEKKCKEQNQVWQEKKHFSIDAYRIN 231

Query: 161 GLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVK 212
                 DPH         GP   AF     T Y       +G    ++    I+    VK
Sbjct: 232 S-----DPHDIMAEVYKNGPVEVAF-----TVYEDFAHYKSGVYKHITGG--IMGGHAVK 279

Query: 213 IVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
           ++GWG  + G  YW + + +   +GD G  KI+RG+NE  IE  V   +P
Sbjct: 280 LIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEGVVAGMP 329


>gi|170030062|ref|XP_001842909.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
 gi|167865915|gb|EDS29298.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
          Length = 288

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 53/195 (27%), Positives = 76/195 (38%), Gaps = 34/195 (17%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G    T+ +  K GL +GG +HS  GC+P  F                       KC
Sbjct: 115 CDGGYVHKTFDYWVKYGLTSGGPYHSGQGCKPYPFGGATQD------------VNIVLKC 162

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP------------HFGPFWPAFWRSFCTKYTR 187
             +C    Y   + QD    +G   Y  P              GP   +F          
Sbjct: 163 DRQC-QAGYPLTYSQD--LKHGASSYILPWGDENAMKAEIYQNGPIVTSF------DVYG 213

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
             FQ    VY     A   ++A V+++GWG ENG  YW   +++ E++G+ G  KI+RG 
Sbjct: 214 DFFQYRSGVYRHVTGAYKGSHA-VRVIGWGVENGVKYWLCANSWNERWGENGFFKIVRGE 272

Query: 248 NEAIIESLVNGALPK 262
           N   +E +    LPK
Sbjct: 273 NHVGVEDISYAGLPK 287


>gi|321446975|gb|EFX60976.1| hypothetical protein DAPPUDRAFT_274869 [Daphnia pulex]
          Length = 71

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 25/52 (48%), Positives = 35/52 (67%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           ++I+GWG E G PYW I + +   +GD G IK+LRG++   IES + G LPK
Sbjct: 19  IRILGWGVEEGVPYWLIANNWNTDWGDNGYIKLLRGKDHCGIESQITGGLPK 70


>gi|123483120|ref|XP_001323959.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
 gi|121906833|gb|EAY11736.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
          Length = 255

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 30/77 (38%), Positives = 45/77 (58%), Gaps = 6/77 (7%)

Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 244
           YT  LF+   R Y    +       TV+I+GWG+E G PYW I++ +G  +G+ G ++I 
Sbjct: 181 YTGGLFEDPPRDYIADRTH------TVEIIGWGQEKGIPYWIILNQYGRLWGENGMMRIR 234

Query: 245 RGRNEAIIESLVNGALP 261
            GR++A +ES V  A P
Sbjct: 235 MGRDDARVESYVLAAEP 251


>gi|110456454|gb|ABG74712.1| cathepsin B preproprotein-like protein [Diaphorina citri]
          Length = 125

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 25/57 (43%), Positives = 36/57 (63%)

Query: 206 VAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           +    V+++GWG EN  PYW + +++ + +GD GT KILRG NEA IE   N   P+
Sbjct: 65  IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNVGYPQ 121


>gi|332374788|gb|AEE62535.1| unknown [Dendroctonus ponderosae]
          Length = 328

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 24/52 (46%), Positives = 38/52 (73%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           VKI+GWG E+G  YW + +++ E++G+ G  +I+RGR+E  IES ++ ALP 
Sbjct: 273 VKILGWGVEDGVKYWLVANSWNERWGENGLFRIIRGRDEVGIESTIDAALPD 324


>gi|268560898|ref|XP_002638183.1| Hypothetical protein CBG22612 [Caenorhabditis briggsae]
          Length = 721

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 52/199 (26%), Positives = 76/199 (38%), Gaps = 39/199 (19%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G       +   +G+VTGG    + GC P S+  C+  +   + P+CK       +C
Sbjct: 148 CQGGFVLEAMKFWKSKGVVTGGDFQGD-GCIPYSYGSCSDCHTAQTTPKCKN------EC 200

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
             + T + Y     +DKY            +G        S   +  +     NG V A 
Sbjct: 201 QVKYTKNEYK----EDKY------------YGSSAYRLSTSNAVRTIQSEILRNGPVEAT 244

Query: 200 SASAEIVAY----------------ATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
               E   Y                  VKI+GWG E    YW I +++G  FG+ G  K+
Sbjct: 245 YQVYEDFYYYKSGVYEYISGRHMGGHAVKIIGWGVEENVNYWLIANSWGTGFGENGFFKM 304

Query: 244 LRGRNEAIIESLVNGALPK 262
            RG NE  IE+ V   + K
Sbjct: 305 RRGNNECGIENYVVAGMAK 323


>gi|189308076|gb|ACD86922.1| cysteine protease [Caenorhabditis brenneri]
          Length = 228

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 38/78 (48%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W ++ K G  TGG++ +  GC+P S  PC      T+ P C T     P C
Sbjct: 150 CEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNTTWPACPTDGYDTPAC 209

Query: 140 HTRCTNDNYGRGFFQDKY 157
             +CTN NY   +  DK+
Sbjct: 210 VNKCTNSNYNVAYKDDKH 227


>gi|389608479|dbj|BAM17849.1| tubulointerstitial nephritis antigen [Papilio xuthus]
          Length = 429

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 25/57 (43%), Positives = 38/57 (66%)

Query: 204 EIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
           E+  + +V+I+GWGE+ G  YW + +++G Q+G+ G  +I RG NEA IES V   L
Sbjct: 364 ELKGFHSVRIIGWGEDRGDRYWVVANSWGRQWGENGYFRIARGSNEADIESFVVTGL 420


>gi|195346663|ref|XP_002039877.1| GM15657 [Drosophila sechellia]
 gi|194135226|gb|EDW56742.1| GM15657 [Drosophila sechellia]
          Length = 431

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 52/191 (27%), Positives = 75/191 (39%), Gaps = 32/191 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP-- 137
           C  G   + W ++HK+G+V       +  C P          YT     CK     +   
Sbjct: 253 CEGGHLDAAWRYLHKKGVV-------DENCYP----------YTQHRDTCKIRHNSRSLR 295

Query: 138 --KCHTRCTNDNYGRGFFQDKYQINGLGLYFDP--HFGPFWPAFWRSFCTKYTRPLFQTN 193
              C T    D          Y +N          H GP           +  R  F  +
Sbjct: 296 ANGCQTPVNVDRDTLYTVGPAYSLNREADIMAEIFHSGPVQATM------RVNRDFFAYS 349

Query: 194 GRVYAVSAS--AEIVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
           G VY  +A+    +  + +VK+VGWGEE NG  YW   +++G  +G+ G  +ILRG NE 
Sbjct: 350 GGVYRETAANRKALTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEHGYFRILRGSNEC 409

Query: 251 IIESLVNGALP 261
            IE  V  + P
Sbjct: 410 GIEDYVLASWP 420


>gi|3088522|gb|AAD03404.1| cathepsin B-like protease precursor [Trypanosoma cruzi]
 gi|407859283|gb|EKG06969.1| cysteine peptidase C (CPC) [Trypanosoma cruzi]
          Length = 333

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 65/250 (26%), Positives = 97/250 (38%), Gaps = 48/250 (19%)

Query: 27  SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
           SC    AVA A+ ++   C    +       R  AG    C  +       + C+ G   
Sbjct: 116 SCGSCWAVAAASAMSDRYCTLGGVR----DLRISAGDLMSCCDVCG-----YGCNGGYPE 166

Query: 87  STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT-- 144
             W +    G+V+         CQP  FP C H   ++    C       P C++ CT  
Sbjct: 167 VAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCSG-EYDTPTCNSTCTDK 218

Query: 145 ---------NDNY---GRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQT 192
                    N +Y   G   F+ +  +NG          PF  +F     + Y   +  T
Sbjct: 219 KIPLIKYRGNTSYILSGEESFKRELLLNG----------PFEVSF-----SVYADFVAYT 263

Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
            G VY       +  +A V+IVGWGE NG PYW I +++  ++G  G   I RG +E  I
Sbjct: 264 GG-VYKHVTGVFLGGHA-VRIVGWGELNGEPYWKIANSWNHEWGMNGYFLIARGVDECGI 321

Query: 253 ESLVNGALPK 262
           E      +P+
Sbjct: 322 EGSGVAGIPR 331


>gi|195488613|ref|XP_002092389.1| GE11695 [Drosophila yakuba]
 gi|194178490|gb|EDW92101.1| GE11695 [Drosophila yakuba]
          Length = 431

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 54/191 (28%), Positives = 77/191 (40%), Gaps = 32/191 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W ++HK+G+V       +  C P          YT     CK     +   
Sbjct: 253 CEGGHLDAAWRYLHKKGVV-------DESCYP----------YTQQRDTCKIRHNSRSLR 295

Query: 140 HTRC-TNDNYGRGFFQD---KYQINGLGLYFDP--HFGPFWPAFWRSFCTKYTRPLFQTN 193
              C T  N  R  F      Y +N          H GP           +  R  F   
Sbjct: 296 ANGCQTPYNVDRDTFYTVGPAYSLNREADIMAEIFHSGPVQATM------RVNRDFFAYA 349

Query: 194 GRVYAVSASAEI--VAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
           G VY  +A+  +    + +VK+VGWGEE NG  YW   +++G  +G++G  +ILRG NE 
Sbjct: 350 GGVYRQTAANRMAPTGFHSVKLVGWGEEHNGEKYWIAANSWGPWWGERGYFRILRGSNEC 409

Query: 251 IIESLVNGALP 261
            IE  V  + P
Sbjct: 410 GIEEYVLASWP 420


>gi|6165885|gb|AAF04727.1|AF101239_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
          Length = 352

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 59/213 (27%), Positives = 85/213 (39%), Gaps = 43/213 (20%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G   + W +  + G+VT     +   TGC   S P C        EP     A P P
Sbjct: 164 CDGGYPIAAWQYFKRTGVVTSECDPYFDQTGC---SHPGC--------EP-----AYPTP 207

Query: 138 KCHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKY 185
            C  +C   N      + F  + Y++N      D H         GP   +F     T Y
Sbjct: 208 ACEKKCVKKNLLWSESKHFSVNAYRVNS-----DQHSIMTEVYTNGPAEVSF-----TVY 257

Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKIL 244
                  +G VY     +E+  +A VK++GWG  E+G  YW + + +   +G  G  KI+
Sbjct: 258 EDFAHYKSG-VYKHVTGSEMGGHA-VKLIGWGTSEDGEDYWLLANQWNRSWGGDGYFKII 315

Query: 245 RGRNEAIIESLVNGALPKDNYGVEFGEESGERL 277
           RG NE  IE +  G     N  +E G    + L
Sbjct: 316 RGTNECGIEDVTAGTPSTKNLDIESGVRDDDSL 348


>gi|194384502|dbj|BAG59411.1| unnamed protein product [Homo sapiens]
          Length = 273

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 41/66 (62%), Gaps = 1/66 (1%)

Query: 201 ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
            + E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++   IES V   +
Sbjct: 204 VTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGI 263

Query: 261 PK-DNY 265
           P+ D Y
Sbjct: 264 PRTDQY 269


>gi|326492684|dbj|BAJ90198.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 355

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 54/194 (27%), Positives = 85/194 (43%), Gaps = 37/194 (19%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C+ G   S W +  + G+VT     +   TGCQ    P C        EP     A P P
Sbjct: 169 CNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQ---HPGC--------EP-----AYPTP 212

Query: 138 KCHTRCTNDNYGRGFFQDKYQ-INGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
           KCH +C  +N  + + ++K+  +N   ++ +PH         GP   AF     T Y   
Sbjct: 213 KCHRKCKVEN--QVWKKNKHSSVNAYRVHSNPHDIMAEVYKNGPVEVAF-----TVYEDF 265

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGR 247
               +G    ++    ++    VK++GWG  + G  YW + + +   +G  G  KI+RG+
Sbjct: 266 AHYKSGVYKHITGG--VMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGGDGYFKIIRGK 323

Query: 248 NEAIIESLVNGALP 261
           NE  IE  V   +P
Sbjct: 324 NECGIEEDVTAGMP 337


>gi|3087797|emb|CAA93275.1| cysteine proteinase [Haemonchus contortus]
          Length = 330

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 46/161 (28%), Positives = 69/161 (42%), Gaps = 19/161 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G+    W +V + G+VTGG +     C+P    PC       S P   +  TP   C
Sbjct: 165 CNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYHLHPCEITGKFWSCPRDHSFRTPA--C 222

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF---------GPFWPAFWRSFCTKYTRPLF 190
              C    YG+ + +DK  +  + +  +            GP   AF     T Y    F
Sbjct: 223 KKYCQY-GYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAF-----TTYEDFSF 276

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTF 231
              G +Y  S   +  A+A VK+VGWG ENG  YW + +++
Sbjct: 277 YRKG-IYVHSYGRQRGAHA-VKVVGWGVENGTKYWNVANSW 315


>gi|1644295|emb|CAB03627.1| cysteine proteinase [Haemonchus contortus]
          Length = 345

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 67/236 (28%), Positives = 101/236 (42%), Gaps = 33/236 (13%)

Query: 33  AVATATPLAFAVCRSSK--MHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISSSTWA 90
           AV+TA+ L+  +C +SK    +  +S   ++  K          +  + C  G     + 
Sbjct: 124 AVSTASALSDRICIASKGETQLHISSIDIVSCCK----------LCGYGCDGGWPIEAFD 173

Query: 91  WVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEP---ECKTLATP------QPKCH 140
           +  ++G VTG    S  GC+P  F P   + N T        CK   T         + H
Sbjct: 174 YFSRQGAVTGETT-SKDGCRPYPFHPLWTYGNDTVGRRMSGRCKHSKTVGEGVKRVTRNH 232

Query: 141 TRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVS 200
           TR T     R    +  Q +  G   D   GP    F     T Y    +   G +Y   
Sbjct: 233 TRRTGLTARRLRITEFCQSHSEG---DHGNGPVVAVF-----TVYEDFSYYKKG-IYVHI 283

Query: 201 ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
           A     A+A +KI+GWG ENG PYW I +++ + +G++G  +I+RG NE  IE  V
Sbjct: 284 AGKARGAHA-IKIIGWGVENGLPYWLIANSWHDDWGEQGLFRIVRGINECGIEQEV 338


>gi|741376|prf||2007265A cathepsin B
          Length = 153

 Score = 58.2 bits (139), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 41/66 (62%), Gaps = 1/66 (1%)

Query: 201 ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
            + E++    ++I+GWG ENG PYW + +++   +GD G  KILRG++   IES V   +
Sbjct: 84  VTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGI 143

Query: 261 PK-DNY 265
           P+ D Y
Sbjct: 144 PRTDQY 149


>gi|294877495|ref|XP_002768009.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239870149|gb|EER00727.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 180

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 36/75 (48%), Gaps = 6/75 (8%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHS------NTGCQPVSFPPCNHANYTTSEPECKTLA 133
           C  G   S W+WVH +G+ TGG + +      + GC P  FPPC H    T  P+C    
Sbjct: 100 CGGGDPYSAWSWVHDKGIATGGDYVAKDDMTKDDGCWPYDFPPCAHHINDTKYPKCPEGL 159

Query: 134 TPQPKCHTRCTNDNY 148
            P P C  +C N  Y
Sbjct: 160 YPTPNCVEQCHNPKY 174


>gi|256052327|ref|XP_002569724.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 96

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 24/55 (43%), Positives = 39/55 (70%)

Query: 202 SAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
           + ++ ++  ++I+GWGEEN  PYW I +++ E +G+ G  +ILRGR+E  IES V
Sbjct: 35  TGKLFSWHAIRIIGWGEENNTPYWLIPNSWNEDWGENGNFRILRGRHECSIESEV 89


>gi|328702238|ref|XP_001943280.2| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 328

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 67/165 (40%), Gaps = 24/165 (14%)

Query: 88  TWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH-------ANYTTSEPECKTLATPQPKCH 140
            W ++   GLV+GG ++++ GCQP   PP           NYT ++             H
Sbjct: 161 VWEYLKSHGLVSGGKYNTSDGCQPSKIPPIEEYMEYSEIKNYTCNDHCYGNKTINYNDDH 220

Query: 141 TRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVS 200
            + +N      ++Q +Y+     +    ++GP    F+         P    N R     
Sbjct: 221 VKVSN------YYQVQYEDIQEEV---QNYGPVSVEFY--IRDDIFTPFLSINPRFQRRK 269

Query: 201 ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
                     VK++GWG ENG  YW +V ++G + G  G  K+ R
Sbjct: 270 YKG------YVKLIGWGVENGEDYWLLVDSWGYERGQNGVFKVER 308


>gi|44968648|gb|AAS49594.1| cathepsin B [Scyliorhinus canicula]
          Length = 206

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 54/202 (26%), Positives = 86/202 (42%), Gaps = 22/202 (10%)

Query: 27  SCIEARAVATATPLAFAVCRSS--KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGI 84
           SC    A      ++  +C  S  K++VE ++   ++  K  C            C+ G 
Sbjct: 19  SCGSCWAFGAVEAMSDRICIHSRGKVNVEVSAEDLLSCCKLECGN---------GCNGGY 69

Query: 85  SSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT 144
            S  W +    GLV+GG ++S+ GC+P S  PC H +   S P+C +     P+C  RC 
Sbjct: 70  PSGAWEFWTNDGLVSGGLYYSHIGCRPYSISPCEH-HVNGSRPKC-SGEIETPRCSRRC- 126

Query: 145 NDNYGRGFFQDKYQINGLGLY-FDPHFGPFWPAFWRSFCTKYTRPLFQT----NGRVYAV 199
              Y   + +DK+   GL  Y             +++   +    +F+        VY  
Sbjct: 127 EAGYSPKYSEDKHY--GLTSYSIGSDVTEIMTEIYKNGPVEAALEVFKDFLLYKSGVYQH 184

Query: 200 SASAEIVAYATVKIVGWGEENG 221
                I  +A +KI+GWGEENG
Sbjct: 185 KTGGSIGGHA-IKILGWGEENG 205


>gi|328726763|ref|XP_003249034.1| PREDICTED: cathepsin B-like cysteine proteinase-like, partial
           [Acyrthosiphon pisum]
          Length = 129

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 40/69 (57%)

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           VY  + +A  +    VK++GWG E G PYW +V+++  Q+GD G  KI RG +E  I+S 
Sbjct: 61  VYQRTPNATKLGGHAVKLIGWGVEEGTPYWLMVNSWNAQWGDNGLFKIRRGTDECRIDSA 120

Query: 256 VNGALPKDN 264
               +P  N
Sbjct: 121 TTAGVPVTN 129


>gi|224128101|ref|XP_002320244.1| predicted protein [Populus trichocarpa]
 gi|222861017|gb|EEE98559.1| predicted protein [Populus trichocarpa]
          Length = 339

 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 55/198 (27%), Positives = 79/198 (39%), Gaps = 33/198 (16%)

Query: 74  WMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLA 133
           WM    C  G     W +  + G+VT         C P         +   S P C+   
Sbjct: 145 WMCGAGCDGGSPIDAWRYFVQSGVVT-------EECDPY------FDDIGCSHPGCEP-G 190

Query: 134 TPQPKCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTK 184
            P PKC  +C + N  + + + K + +N   +  DPH         GP   AF     T 
Sbjct: 191 FPTPKCERKCADKN--KLWAESKHFSVNAYRIDSDPHSIMAEVSSNGPVEVAF-----TV 243

Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKI 243
           Y       +G    ++  A  +    VK++GWG  E+G  YW + + +   +GD G  KI
Sbjct: 244 YEDFAHYKSGVYKHITGDA--MGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKI 301

Query: 244 LRGRNEAIIESLVNGALP 261
            RG NE  IE  V   LP
Sbjct: 302 KRGTNECGIEGAVVAGLP 319


>gi|18378947|ref|NP_563648.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|16226808|gb|AAL16267.1|AF428337_1 At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|14532526|gb|AAK63991.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|25090140|gb|AAN72238.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|332189292|gb|AEE27413.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
          Length = 362

 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 62/213 (29%), Positives = 87/213 (40%), Gaps = 35/213 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C+ G   + W +    G+VT     +  NTGC   S P C        EP     A P P
Sbjct: 174 CNGGYPIAAWRYFKHHGVVTEECDPYFDNTGC---SHPGC--------EP-----AYPTP 217

Query: 138 KCHTRCTNDN--------YGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
           KC  +C + N        YG   ++ +   + +      + GP   AF     T Y    
Sbjct: 218 KCARKCVSGNQLWRESKHYGVSAYKVRSHPDDIMAEVYKN-GPVEVAF-----TVYEDFA 271

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
              +G VY       I  +A VK++GWG  ++G  YW + + +   +GD G  KI RG N
Sbjct: 272 HYKSG-VYKHITGTNIGGHA-VKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTN 329

Query: 249 EAIIESLVNGALPKDNYGVEFGEESGERLSEEF 281
           E  IE  V   LP D   V+    S + L   F
Sbjct: 330 ECGIEHGVVAGLPSDRNVVKGITTSDDLLVSSF 362


>gi|162813|gb|AAA30434.1| cathepsin B, partial [Bos taurus]
          Length = 122

 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 25/61 (40%), Positives = 38/61 (62%)

Query: 201 ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
            S EI+    ++I+GWG ENG PYW + +++   +GD G  KILRG++   IES +   +
Sbjct: 57  VSGEIMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGM 116

Query: 261 P 261
           P
Sbjct: 117 P 117


>gi|294889976|ref|XP_002773021.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239877724|gb|EER04837.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 342

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 63/248 (25%), Positives = 98/248 (39%), Gaps = 53/248 (21%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHH------SNTGCQPVSFPPCNHANYTTSEPECKTLA 133
           C  G  +    ++   G+VTGG +       ++ GC P  FP CNH            + 
Sbjct: 114 CRRGSVAEGLIFMKNHGIVTGGEYKPPKKLGNDDGCWPYPFPKCNHV---------PGMK 164

Query: 134 TPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYF--DPHFGPFWPAFWRSFCTKYTRPLFQ 191
              P+C ++        G        +GL      D H    W     S   K  + +F 
Sbjct: 165 VKYPRCGSKV-------GRLAAPSHCDGLHCRRAGDVHRAKSWGRLPISP-EKIKQEIFD 216

Query: 192 TNGRVYAVSASAE----------------IVAYATVKIVGWGEENGRPYWTIVSTFGEQF 235
            NG V A+    E                +V   T+K++GWG E G+ YW  V+++ E++
Sbjct: 217 -NGPVAAIMTIHEDFRLYKSGVYEYKTGAMVGAHTLKLIGWGVEAGQEYWLAVNSWNEEW 275

Query: 236 GDKGTIKILRGRNEAIIES-------LVNGALPKDNYGVEFG---EESGERLSEEFGVRA 285
           GD+G IK+  G+N    ES        VN  L +D    E G   +++  +L E+  V  
Sbjct: 276 GDQGKIKLAVGKNALDEESRQQVPRRAVN-ELDEDAMMAESGAKTQKAMAQLKEDVFVEK 334

Query: 286 ESSEEFRE 293
           +    F E
Sbjct: 335 QVHSHFEE 342


>gi|161343873|tpg|DAA06117.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 254

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 1/86 (1%)

Query: 74  WMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLA 133
           ++  + C  G    +W +  + G V+GG ++SN GCQP + PPC   N       C T  
Sbjct: 126 YLCGYGCDGGSLFESWDFYRRHGFVSGGEYNSNQGCQPYTIPPCKLINEKPPGHSCTTFN 185

Query: 134 TPQ-PKCHTRCTNDNYGRGFFQDKYQ 158
             + P C  +C N NY   F  D Y+
Sbjct: 186 REETPTCEKKCNNPNYYTSFRADIYR 211


>gi|222424744|dbj|BAH20325.1| AT1G02305 [Arabidopsis thaliana]
          Length = 293

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 62/213 (29%), Positives = 87/213 (40%), Gaps = 35/213 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C+ G   + W +    G+VT     +  NTGC   S P C        EP     A P P
Sbjct: 105 CNGGYPIAAWRYFKHHGVVTEECDPYFDNTGC---SHPGC--------EP-----AYPTP 148

Query: 138 KCHTRCTNDN--------YGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
           KC  +C + N        YG   ++ +   + +      + GP   AF     T Y    
Sbjct: 149 KCARKCVSGNQLWRESKHYGVSAYKVRSHPDDIMAEVYKN-GPVEVAF-----TVYEDFA 202

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
              +G VY       I  +A VK++GWG  ++G  YW + + +   +GD G  KI RG N
Sbjct: 203 HYKSG-VYKHITGTNIGGHA-VKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTN 260

Query: 249 EAIIESLVNGALPKDNYGVEFGEESGERLSEEF 281
           E  IE  V   LP D   V+    S + L   F
Sbjct: 261 ECGIEHGVVAGLPSDRNVVKGITTSDDLLVSSF 293


>gi|71424150|ref|XP_812694.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
 gi|70877506|gb|EAN90843.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
          Length = 333

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 67/253 (26%), Positives = 100/253 (39%), Gaps = 41/253 (16%)

Query: 5   TSSRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVK 64
           T + IRD S             SC    AVA A+ ++   C    +       R  AG  
Sbjct: 107 TVTEIRDQS-------------SCGSCWAVAAASAISDRYCTLGGVR----DLRISAGDL 149

Query: 65  QRCAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTT 124
             C  +       + C+ G     W +    G+V+         CQP  FP C H   ++
Sbjct: 150 MSCCDVCG-----FGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSS 197

Query: 125 SEPECKTLATPQPKCHTRCTNDNYGRGFFQDK--YQINGLGLYFDPHF--GPFWPAFWRS 180
               C       P C++ CT+       ++    Y ++G   +       GPF  +F   
Sbjct: 198 DLSPCSG-EYDTPTCNSTCTDKKIPLIKYRGNTSYVLSGEEPFKRELILNGPFEVSF--- 253

Query: 181 FCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGT 240
             + Y   +  T G VY   A   +  +A V+IVGWGE NG PYW I +++  ++G  G 
Sbjct: 254 --SVYADFVAYTGG-VYKHVAGIFLGGHA-VRIVGWGELNGEPYWKIANSWNREWGMNGY 309

Query: 241 IKILRGRNEAIIE 253
             I RG +E  IE
Sbjct: 310 FLIARGVDECGIE 322


>gi|270012757|gb|EFA09205.1| cathepsin B precursor [Tribolium castaneum]
          Length = 348

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 48/182 (26%), Positives = 73/182 (40%), Gaps = 36/182 (19%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G ++  W +    G+V+GG ++S+ GCQP S     +A  +     C+         
Sbjct: 146 CVGGYTAKAWDYYINEGIVSGGDYNSSEGCQPYSKASFQYAVASKCVKACQNDKYDV--- 202

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
                + +YG  F+  +  +  +                        +    TNG V A 
Sbjct: 203 -KYDDDKHYGDSFYTLETNVTQI------------------------QTEILTNGPVMAT 237

Query: 200 SASAEIVAY-------ATVKIVGWGEENGRPYWTIVSTFGEQFGDKGT-IKILRGRNEAI 251
               E + Y       + V I+ WG E G PYW I +++G  +GD G  IKI RG NE  
Sbjct: 238 FNVFEDIIYYKSGIQLSNVSILRWGTEEGVPYWLIANSWGTWWGDLGGFIKIKRGTNECA 297

Query: 252 IE 253
           IE
Sbjct: 298 IE 299


>gi|16768502|gb|AAL28470.1| GM06507p [Drosophila melanogaster]
          Length = 430

 Score = 57.4 bits (137), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 52/190 (27%), Positives = 74/190 (38%), Gaps = 31/190 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK- 138
           C  G   + W ++HK+G+V       +  C P          YT     CK   +   K 
Sbjct: 253 CEGGHLDAAWRYLHKKGVV-------DENCYP----------YTQHRDTCKIRHSRSLKA 295

Query: 139 --CHTRCTNDNYGRGFFQDKYQINGLGLYFDP--HFGPFWPAFWRSFCTKYTRPLFQTNG 194
             C      D          Y +N          H GP           +  R  F  +G
Sbjct: 296 NGCQKPVNVDRDSLYTVGPAYSLNREADIMAEIFHSGPVQATM------RVNRDFFAYSG 349

Query: 195 RVYAVSASAEI--VAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
            VY  +A+       + +VK+VGWGEE NG  YW   +++G  +G+ G  +ILRG NE  
Sbjct: 350 GVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEHGYFRILRGSNECG 409

Query: 252 IESLVNGALP 261
           IE  V  + P
Sbjct: 410 IEEYVLASWP 419


>gi|3929817|emb|CAA77181.1| cathepsin B [Mus musculus]
          Length = 194

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 46/162 (28%), Positives = 69/162 (42%), Gaps = 33/162 (20%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  K+GLV+GG + S+ GC P + PPC H    +  P      T  P+C
Sbjct: 46  CNGGYPSGAWNFWTKKGLVSGGVYDSHIGCLPYTIPPCEHHVNGSRPPMHGEGDT--PRC 103

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY-A 198
           +  C    Y   + +DK            HFG  + ++  S   K        NG V  A
Sbjct: 104 NKSC-EAGYSPSYKEDK------------HFG--YTSYSVSNSVKEIMAEIYKNGPVEGA 148

Query: 199 VSASAEIVAYAT---------------VKIVGWGEENGRPYW 225
            +  ++ + Y +               ++I+GWG ENG PYW
Sbjct: 149 FTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVENGVPYW 190


>gi|44965401|gb|AAS49537.1| cathepsin B [Latimeria chalumnae]
          Length = 225

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 56/206 (27%), Positives = 86/206 (41%), Gaps = 29/206 (14%)

Query: 27  SCIEARAVATATPLAFAVCRSSK--MHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGI 84
           SC    A      ++  VC  SK  ++VE ++   ++     C +          C+ G 
Sbjct: 37  SCGSCWAFGAVEAISDRVCIHSKGKVNVEISAEDLLSCCGMECGF---------GCNGGY 87

Query: 85  SSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT 144
            S  W +  + GLV+GG   S+ GC+P + PPC H +   S P C       PKC  +C 
Sbjct: 88  PSGAWNFWTETGLVSGGLFKSHIGCRPYTIPPCEH-HVNGSRPSCTGEEGDTPKCVMQC- 145

Query: 145 NDNYGRGFFQDKY--------QINGLGLYFDPH-FGPFWPAFWRSFCTKYTRPLFQTNGR 195
              Y   +F+DK+          N   +  + +  GP   AF     T Y   L   +G 
Sbjct: 146 EAGYTPSYFKDKHFGSTSYAVSSNEADIQIEIYKNGPVEGAF-----TVYEDFLQYKSGV 200

Query: 196 VYAVSASAEIVAYATVKIVGWGEENG 221
              V+  A  V    ++I+GWG E+G
Sbjct: 201 YKHVTGDA--VGGHAIRILGWGVESG 224


>gi|194753202|ref|XP_001958906.1| GF12327 [Drosophila ananassae]
 gi|190620204|gb|EDV35728.1| GF12327 [Drosophila ananassae]
          Length = 431

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 53/191 (27%), Positives = 78/191 (40%), Gaps = 32/191 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W ++HK+G+V       +  C P          YT     CK     +   
Sbjct: 252 CDGGHLDAAWRFLHKKGVV-------DDSCYP----------YTQQRDTCKIRHNSRSLK 294

Query: 140 HTRCT-NDNYGRGFFQD---KYQINGLGLYFDP--HFGPFWPAFWRSFCTKYTRPLFQTN 193
              C  + N  R  F      Y +N  G       H GP           +  R  F  +
Sbjct: 295 ANGCRPSPNVDRDSFYTVGPAYTLNREGDIMAEIYHSGPVQATM------RVYRDFFSYS 348

Query: 194 GRVYAVSASAEIV--AYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
           G +Y  +A+       + +VK+VGWGEE NG  YW   +++G  +G++G  +ILRG NE 
Sbjct: 349 GGIYRQTAANRGAPQGFHSVKLVGWGEEHNGDKYWIAANSWGPWWGERGYFRILRGSNEC 408

Query: 251 IIESLVNGALP 261
            IE  V  + P
Sbjct: 409 GIEEYVLASWP 419


>gi|157058767|gb|ABV03141.1| cathepsin B-348 [Sitobion avenae]
          Length = 252

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 58/222 (26%), Positives = 85/222 (38%), Gaps = 42/222 (18%)

Query: 27  SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
           SC    A      ++  VC  SK      +F F A     C W        + C+ G   
Sbjct: 52  SCGSCWAFGAVEAMSDRVCIHSK---GTKNFHFSAENLVSCCWTCG-----FGCNGGFPG 103

Query: 87  STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
           + W +   +G+V+GG + SN GC P    PC H    T  P CK      PKC  +C  D
Sbjct: 104 AAWHYWKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGP-CKE-GGKTPKCVKKC-ED 160

Query: 147 NYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY-AVSASAEI 205
            Y   + QD ++                 A+  S      R    TNG V  A +   + 
Sbjct: 161 GYKVPYEQDLHRGKS--------------AYSLSNDVDQIRQEIYTNGPVEGAFTVYEDF 206

Query: 206 VAY---------------ATVKIVGWGEENGR-PYWTIVSTF 231
           +AY                 ++I+GWG +NG  PYW + +++
Sbjct: 207 IAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANSW 248


>gi|29840882|gb|AAP05883.1| similar to GenBank Accession Number X70968 cathepsin B in
           Schistosoma japonicum [Schistosoma japonicum]
          Length = 312

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 41/78 (52%), Gaps = 1/78 (1%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ GI    W +    G+VTGG++ ++TGCQP  FP C H + + +   C+      P+C
Sbjct: 161 CNGGIPGMAWDYWKDEGIVTGGSNETHTGCQPYPFPECIHHSTSINHSSCEVKYYSTPEC 220

Query: 140 HTRCTNDNYGRGFFQDKY 157
           +  C  D Y   +  DKY
Sbjct: 221 YQTCQPD-YAIQYENDKY 237


>gi|255548165|ref|XP_002515139.1| cathepsin B, putative [Ricinus communis]
 gi|223545619|gb|EEF47123.1| cathepsin B, putative [Ricinus communis]
          Length = 376

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 58/194 (29%), Positives = 79/194 (40%), Gaps = 37/194 (19%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G     W +    G+VT     +  N GC   S P C        EP       P P
Sbjct: 186 CDGGYPMYAWRYFVHHGVVTEECDPYFDNIGC---SHPGC--------EP-----GFPTP 229

Query: 138 KCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
           KC  +C + N  + + Q K Y +N   +  DPH         GP   +F     T Y   
Sbjct: 230 KCVRKCIDKN--QLWRQSKHYSVNAYRISSDPHDVMAEVYKNGPVEVSF-----TVYEDF 282

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
               +G VY    + E++    VK++GWG  +NG  YW + + +   +GD G  KI RG 
Sbjct: 283 AHYKSG-VYK-HITGEVMGGHAVKLIGWGTSDNGEDYWLLANQWNRGWGDDGYFKIRRGT 340

Query: 248 NEAIIESLVNGALP 261
           NE  IE      LP
Sbjct: 341 NECGIEDDAVAGLP 354


>gi|40643250|emb|CAC83720.1| cathepsin B [Hordeum vulgare subsp. vulgare]
 gi|326494236|dbj|BAJ90387.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326499864|dbj|BAJ90767.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 344

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 77/196 (39%), Gaps = 41/196 (20%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQP-VSFPPCNHANYTTSEPECKTLATPQPK 138
           C  G   S W +  + G+VT         C P      C H       P C+  A P P 
Sbjct: 164 CDGGYPISAWQYFVQNGVVT-------EECDPYFDQVGCKH-------PGCEP-AYPTPV 208

Query: 139 CHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKYT 186
           C  +C   N      + F  D YQ+N      DPH         GP   AF     T Y 
Sbjct: 209 CEKKCKVQNQVWQEKKHFSIDAYQVNS-----DPHDIMAEVYKNGPVEVAF-----TVYE 258

Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILR 245
                 +G    ++    ++    VK++GWG  + G  YW + + +   +GD G  KI+R
Sbjct: 259 DFAHYKSGVYKHITGG--VMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIR 316

Query: 246 GRNEAIIESLVNGALP 261
           G+NE  IE  V   +P
Sbjct: 317 GKNECGIEEDVTAGMP 332


>gi|156708110|gb|ABU93313.1| cathepsin B4 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 26/53 (49%), Positives = 34/53 (64%)

Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
            V + GWG ENG PYW + +++G  +G+KG  KILRG N   IES V   +PK
Sbjct: 228 AVLLCGWGVENGLPYWLVQNSWGPAWGEKGFFKILRGSNHCEIESYVTLGVPK 280


>gi|170045773|ref|XP_001850470.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
 gi|167868692|gb|EDS32075.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
          Length = 463

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 60/210 (28%), Positives = 86/210 (40%), Gaps = 38/210 (18%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECK-----TLAT 134
           CS G   + W +V K G V       N  C P          Y +++  CK     TL T
Sbjct: 252 CSGGHLDTAWNYVRKVGTV-------NDECYP----------YISAQNACKIRPSDTLIT 294

Query: 135 PQPKCHTRCTNDN-YGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
                 T+    N Y  G          + +    H GP           +  R  F   
Sbjct: 295 ANCDLPTKVDRTNMYKMGPAFSLNNETDIMIEIKKH-GPVQAIL------RVHRDFFSYK 347

Query: 194 GRVY----AVSASAEIVAYATVKIVGWGEE-NG---RPYWTIVSTFGEQFGDKGTIKILR 245
             +Y    A SA  E   Y +V+++GWGEE NG     YW  V+++G  +G+ G  +I+R
Sbjct: 348 SGIYRHSAASSAGDERAGYHSVRLIGWGEERNGYETTKYWVAVNSWGRWWGENGRFRIVR 407

Query: 246 GRNEAIIESLVNGALPKDNYGVEFGEESGE 275
           G+NE  IES V  +LP  +  V+   + GE
Sbjct: 408 GQNECEIESYVLASLPYVHQQVKPMRQVGE 437


>gi|123469339|ref|XP_001317882.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
 gi|121900627|gb|EAY05659.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
          Length = 241

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 35/51 (68%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
           V+++GWG+ENG  YW +++  G+ +G  GT+ I  G NE +IES + GA P
Sbjct: 188 VELIGWGKENGVEYWILLNQHGKNWGINGTMHIKMGSNEGLIESFIYGATP 238


>gi|2317912|gb|AAC24376.1| cathepsin B-like cysteine proteinase [Arabidopsis thaliana]
          Length = 357

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 62/203 (30%), Positives = 81/203 (39%), Gaps = 51/203 (25%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C+ G     W +    G+VT     +  NTGC   S P C        EP       P P
Sbjct: 169 CNGGFPMGAWLYFKYHGVVTQECDPYFDNTGC---SHPGC--------EP-----TYPTP 212

Query: 138 KCHTRCTNDN--------YGRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSF 181
           KC  +C + N        YG G     Y+IN      DP          GP   AF    
Sbjct: 213 KCERKCVSRNQLWGESKHYGVG----AYRINP-----DPQDIMAEVYKNGPVEVAF---- 259

Query: 182 CTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGT 240
            T Y       +G VY      +I  +A VK++GWG  ++G  YW + + +   +GD G 
Sbjct: 260 -TVYEDFAHYKSG-VYKYITGTKIGGHA-VKLIGWGTSDDGEDYWLLANQWNRSWGDDGY 316

Query: 241 IKILRGRNEAIIESLVNGALPKD 263
            KI RG NE  IE  V   LP +
Sbjct: 317 FKIRRGTNECGIEQSVVAGLPSE 339


>gi|224064398|ref|XP_002301456.1| predicted protein [Populus trichocarpa]
 gi|222843182|gb|EEE80729.1| predicted protein [Populus trichocarpa]
          Length = 325

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 57/200 (28%), Positives = 83/200 (41%), Gaps = 37/200 (18%)

Query: 74  WMTIWVCSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKT 131
           WM    C  G     W +  + G+VT     +  + GC   S P C        EP    
Sbjct: 131 WMCGDGCDGGYPIDAWRYFVQSGVVTEECDPYFDDIGC---SHPGC--------EP---- 175

Query: 132 LATPQPKCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFC 182
              P PKC  +C + N  + + + K + +N   +  DPH         GP   AF     
Sbjct: 176 -GFPTPKCERKCADKN--KLWAESKHFSVNAYRIDSDPHSIMAEVSMNGPVEVAF----- 227

Query: 183 TKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTI 241
           T Y       +G VY    + +++    VK++GWG  ++G  YW + + +   +GD G  
Sbjct: 228 TVYEDFAHYKSG-VYK-HITGDVMGGHAVKLIGWGTSDDGEDYWLLANQWNRGWGDDGYF 285

Query: 242 KILRGRNEAIIESLVNGALP 261
           KI RG NE  IE  V   LP
Sbjct: 286 KIRRGTNECGIEEDVVAGLP 305


>gi|156708118|gb|ABU93317.1| cathepsin B8 cysteine protease, partial [Monocercomonoides sp. PA]
          Length = 275

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 50/200 (25%), Positives = 74/200 (37%), Gaps = 63/200 (31%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G +   W W+ K+G+ T         C P          Y +            P C
Sbjct: 121 CEGGYADRVWNWIQKKGITT-------EQCLP----------YVSGSGRV-------PTC 156

Query: 140 HTRCTN-DNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
            ++C N  N  R F                         W SF +K        NG VYA
Sbjct: 157 PSKCKNGSNIVRSFVSS----------------------WGSFNSKTVMDEVANNGPVYA 194

Query: 199 --------VSASAEIVAYAT--------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
                   ++  + I  + T        V ++GWG ENG PYW + +++G  +G+KG  +
Sbjct: 195 CFEVFEDFLNYKSGIYQHKTGKSKGWHHVMLMGWGTENGVPYWLLQNSWGSGWGEKGFFR 254

Query: 243 ILRGRNEAIIESLVNGALPK 262
           I RG N+  I+ +    LPK
Sbjct: 255 IRRGTNDCHIDEIFYSGLPK 274


>gi|195165479|ref|XP_002023566.1| GL19846 [Drosophila persimilis]
 gi|194105700|gb|EDW27743.1| GL19846 [Drosophila persimilis]
          Length = 329

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 45/168 (26%), Positives = 77/168 (45%), Gaps = 18/168 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATP--QP 137
           C+ G   + W++  ++G+V+GG + S  GC+P    PC H +   + P C   +TP  Q 
Sbjct: 155 CNGGFPGAAWSYWTRKGIVSGGPYGSTQGCRPYEIAPCEH-HVNGTRPPCSHGSTPSCQH 213

Query: 138 KCHTR-----CTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQT 192
           KC          + N+G   +  +  +  +      + GP   AF     T Y   +   
Sbjct: 214 KCQASYSVEYAKDKNFGSKSYSVRRNVAEIQQEIMTN-GPVEGAF-----TVYEDLILYK 267

Query: 193 NGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDK 238
           +G VY      E+  +A ++I+GWG   E+  PYW I +++   +GD 
Sbjct: 268 SG-VYQHEHGKELGGHA-IRILGWGVWGESKVPYWLIGNSWNTDWGDN 313


>gi|18378945|ref|NP_563647.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332189291|gb|AEE27412.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
          Length = 379

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 62/203 (30%), Positives = 81/203 (39%), Gaps = 51/203 (25%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C+ G     W +    G+VT     +  NTGC   S P C        EP       P P
Sbjct: 191 CNGGFPMGAWLYFKYHGVVTQECDPYFDNTGC---SHPGC--------EP-----TYPTP 234

Query: 138 KCHTRCTNDN--------YGRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSF 181
           KC  +C + N        YG G     Y+IN      DP          GP   AF    
Sbjct: 235 KCERKCVSRNQLWGESKHYGVG----AYRINP-----DPQDIMAEVYKNGPVEVAF---- 281

Query: 182 CTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGT 240
            T Y       +G VY      +I  +A VK++GWG  ++G  YW + + +   +GD G 
Sbjct: 282 -TVYEDFAHYKSG-VYKYITGTKIGGHA-VKLIGWGTSDDGEDYWLLANQWNRSWGDDGY 338

Query: 241 IKILRGRNEAIIESLVNGALPKD 263
            KI RG NE  IE  V   LP +
Sbjct: 339 FKIRRGTNECGIEQSVVAGLPSE 361


>gi|291236586|ref|XP_002738220.1| PREDICTED: cathepsin B preproprotein-like [Saccoglossus
           kowalevskii]
          Length = 93

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 25/61 (40%), Positives = 38/61 (62%)

Query: 202 SAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
           + E +    +KI+GWG E+G  YW + +++ E +GD+G  KILRG +E  IES +    P
Sbjct: 32  TGEALGGHAIKILGWGNEDGHDYWLVANSWNEDWGDQGFFKILRGVDECGIESQITAGSP 91

Query: 262 K 262
           K
Sbjct: 92  K 92


>gi|356572872|ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max]
          Length = 356

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 59/194 (30%), Positives = 82/194 (42%), Gaps = 37/194 (19%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G     W ++   G+VT     +    GC   S P C        EP  +T     P
Sbjct: 168 CDGGYPLYAWQYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYRT-----P 211

Query: 138 KCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
           KC  +C + N  + + + K Y +N   +  DPH         GP   AF     T Y   
Sbjct: 212 KCVKKCVSGN--QVWKKSKHYSVNAYRVSSDPHDIMTEVYKNGPVEVAF-----TVYEDF 264

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
               +G VY      E+  +A VK++GWG  E+G  YW + + +  ++GD G  KI RG 
Sbjct: 265 AHYKSG-VYKHITGYELGGHA-VKLIGWGTTEDGEDYWLLANQWNREWGDDGYFKIRRGT 322

Query: 248 NEAIIESLVNGALP 261
           NE  IE  V   LP
Sbjct: 323 NECGIEEDVTAGLP 336


>gi|195585648|ref|XP_002082593.1| GD25141 [Drosophila simulans]
 gi|194194602|gb|EDX08178.1| GD25141 [Drosophila simulans]
          Length = 484

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 52/191 (27%), Positives = 74/191 (38%), Gaps = 32/191 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK- 138
           C  G   + W ++HK+G+V       +  C P          YT     CK     +   
Sbjct: 253 CEGGHLDAAWRYLHKKGVV-------DENCYP----------YTQHRDTCKIRHNSRSLR 295

Query: 139 ---CHTRCTNDNYGRGFFQDKYQINGLGLYFDP--HFGPFWPAFWRSFCTKYTRPLFQTN 193
              C T    D          Y +N          H GP           +  R  F  +
Sbjct: 296 ANGCQTPVNVDRDTLYTVGPAYSLNREADIMAEIFHSGPVQATM------RVNRDFFAYS 349

Query: 194 GRVYAVSASAEIV--AYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
           G VY  +A+       + +VK+VGWGEE NG  YW   +++G  +G+ G  +ILRG NE 
Sbjct: 350 GGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEHGYFRILRGSNEC 409

Query: 251 IIESLVNGALP 261
            IE  V  + P
Sbjct: 410 GIEEYVLASWP 420


>gi|289724789|gb|ADD18342.1| putative cysteine proteinase TIN-ag [Glossina morsitans morsitans]
          Length = 387

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 3/79 (3%)

Query: 187 RPLFQTNGRVYAVSASAE--IVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKI 243
           R  F  +G +Y  +A++    V + +VK++GWGEE +G  YW   +++G  +G+ G  +I
Sbjct: 298 RDFFSYSGGIYRHTAASRGSPVGFHSVKLIGWGEEHDGNKYWIATNSWGTWWGEHGNFRI 357

Query: 244 LRGRNEAIIESLVNGALPK 262
           LRG NE  IE  V  A P 
Sbjct: 358 LRGSNECGIEEYVLAAWPN 376


>gi|10803437|emb|CAC13131.1| putative cathepsin B.5 [Ostertagia ostertagi]
          Length = 196

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 43/153 (28%), Positives = 63/153 (41%), Gaps = 17/153 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  K G+ TGG++ S +GC+P   PPC H    T    C T     P C
Sbjct: 44  CEGGYPIEAWKYWVKTGICTGGSYESQSGCKPYPIPPCGHHKNQTYFGPCPTDEYDTPVC 103

Query: 140 HTRCT---------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
             +C          + +YG   +     + G+      + GP   A+     T Y    +
Sbjct: 104 TNKCIAAYKTPYSDDKHYGTSAYNVAKTVAGIQKEIMTN-GPVEAAY-----TVY-EDFY 156

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRP 223
           Q  G VY  +  AE+  +A V+I+GWG     P
Sbjct: 157 QYTGGVYTHTGGAEVGGHA-VRILGWGVRQQDP 188


>gi|116779190|gb|ABK21175.1| unknown [Picea sitchensis]
 gi|148907952|gb|ABR17096.1| unknown [Picea sitchensis]
 gi|224284884|gb|ACN40172.1| unknown [Picea sitchensis]
          Length = 350

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 47/194 (24%), Positives = 75/194 (38%), Gaps = 38/194 (19%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G   S W +    G+VT     +    GCQ                P C+ L  P P
Sbjct: 164 CDGGYPLSAWQYFISTGVVTAECDPYFDEAGCQ---------------HPGCEPL-YPTP 207

Query: 138 KCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
           +C  +C ++N   G  + ++      +   P         +      YT+   + +  VY
Sbjct: 208 QCVKQCKDENQNWGNSK-RFSATAYRITSKP---------YDIMAEVYTKGPVEVDFLVY 257

Query: 198 AVSA----------SAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
              A          + + +    VK++GWG ENG  YW + +++   +G+ G  KI RG 
Sbjct: 258 EDFAHYKSGVYKYITGDFLGGHAVKLIGWGTENGTDYWLVANSWNTAWGEDGYFKIARGS 317

Query: 248 NEAIIESLVNGALP 261
           NE  IE  V   +P
Sbjct: 318 NECSIEEDVVAGMP 331


>gi|327408413|emb|CCA30060.1| unnamed protein product [Neospora caninum Liverpool]
          Length = 463

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 51/213 (23%), Positives = 77/213 (36%), Gaps = 32/213 (15%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHS---NTGCQPVSFPPCNHANYTTSEPECKTLATPQ 136
           C+ G     W W  ++G+VTGG   +    T C P   P C H +     P C T   P+
Sbjct: 241 CNGGQPGMAWRWFERKGVVTGGDFDTLGKGTTCWPYEIPFCAH-HAKAPFPNCDTDVRPR 299

Query: 137 --PKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNG 194
             PKC   C    Y               L FD        ++         R +     
Sbjct: 300 KTPKCRKDCEEAAYSEHV-----------LPFDKDVHKASSSYSLRSRDAVKRDMMAHGT 348

Query: 195 RVYAVSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKG 239
              A     + + Y +               +KI+GWG E+G  YW  V+++   +GD G
Sbjct: 349 VTGAFMVYEDFLNYKSGVYKHVYGGPLGGHAIKIIGWGTEDGEEYWHAVNSWNTYWGDSG 408

Query: 240 TIKILRGRNEAIIESLVNGALPKDNYGVEFGEE 272
             KI  G+     E +   A  ++  GV  G++
Sbjct: 409 HFKIEMGQCGVDNEMVAGEAAWQETEGVVNGDK 441


>gi|56758644|gb|AAW27462.1| unknown [Schistosoma japonicum]
          Length = 294

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 39/78 (50%), Gaps = 2/78 (2%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  KRG+VTGG+  ++TGCQP  FP C H +     P C T     P+C
Sbjct: 159 CQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 217

Query: 140 HTRCTNDNYGRGFFQDKY 157
             +C    Y   + QDK+
Sbjct: 218 KQKCQK-GYKTPYEQDKH 234


>gi|149030260|gb|EDL85316.1| rCG52258, isoform CRA_c [Rattus norvegicus]
          Length = 130

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 38/61 (62%)

Query: 202 SAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
           + +++    ++I+GWG ENG PYW + +++   +GD G  KILRG N   IES +   +P
Sbjct: 62  AGDVMGGHAIRILGWGIENGVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAGIP 121

Query: 262 K 262
           +
Sbjct: 122 R 122


>gi|297843028|ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335237|gb|EFH65654.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 360

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 58/195 (29%), Positives = 81/195 (41%), Gaps = 35/195 (17%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C+ G   + W +    G+VT     +  NTGC   S P C        EP     A P P
Sbjct: 172 CNGGYPIAAWRYFKHHGVVTEECDPYFDNTGC---SHPGC--------EP-----AYPTP 215

Query: 138 KCHTRCTNDN--------YGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
           KC  +C + N        YG   ++ +   + +      + GP   AF     T Y    
Sbjct: 216 KCARKCVSGNQLWRESKHYGVSAYKVRSHPDDIMAEVYKN-GPVEVAF-----TVYEDFA 269

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
              +G VY       I  +A VK++GWG  ++G  YW + + +   +GD G  KI RG N
Sbjct: 270 HYKSG-VYKHITGTNIGGHA-VKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTN 327

Query: 249 EAIIESLVNGALPKD 263
           E  IE  V   LP D
Sbjct: 328 ECGIEHGVVAGLPSD 342


>gi|326490902|dbj|BAJ90118.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326508404|dbj|BAJ99469.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514912|dbj|BAJ99817.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 345

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 54/199 (27%), Positives = 75/199 (37%), Gaps = 47/199 (23%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G     W +  + G+VT          GCQ                P C+  A P P
Sbjct: 165 CDGGYPIFAWQYFVENGVVTDECDPFFDQVGCQ---------------HPGCEP-AYPTP 208

Query: 138 KCHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF--------GPFWPAF--WRSFCT 183
            C  +C   N      + F  D YQ+N      DPH         GP   +F  +  F  
Sbjct: 209 VCEKKCKVQNQVWEEKKHFSIDAYQVNS-----DPHDIMAEVYKNGPVEVSFIIYEDFAH 263

Query: 184 KYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIK 242
             +    Q  GR+    A+         K++GWG  + G  YW + + +   +GD G  K
Sbjct: 264 YKSGVYKQITGRMVGGHAA---------KLIGWGTSDAGEDYWLLANQWNRGWGDDGYFK 314

Query: 243 ILRGRNEAIIESLVNGALP 261
           I+RG NE  IE  VN  +P
Sbjct: 315 IIRGTNECGIEGDVNAGMP 333


>gi|195426329|ref|XP_002061289.1| GK20838 [Drosophila willistoni]
 gi|194157374|gb|EDW72275.1| GK20838 [Drosophila willistoni]
          Length = 432

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 51/190 (26%), Positives = 75/190 (39%), Gaps = 31/190 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W ++HK+G++       +  C P          YT S   CK   +   K 
Sbjct: 255 CEGGHLDAAWRYLHKKGVL-------DESCYP----------YTQSRGTCKVRHSGSLKA 297

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----HFGPFWPAFWRSFCTKYTRPLFQTNG 194
           H         R           L    D      H GP           +  R  F  +G
Sbjct: 298 HGCRPAPGVDRDSLYTVGPAYSLSREADIKAEIFHSGPVQATM------RVYRDFFSYSG 351

Query: 195 RVYAVSAS--AEIVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
            +Y  +A+       + +VK+VGWGEE NG  YW   +++G  +G++G  +ILRG NE  
Sbjct: 352 GIYRQTAANRGAPTGFHSVKLVGWGEEHNGDKYWIAANSWGPWWGERGYFRILRGSNECG 411

Query: 252 IESLVNGALP 261
           IE  V  + P
Sbjct: 412 IEDYVLASWP 421


>gi|157058751|gb|ABV03133.1| cathepsin B-3098 [Aulacorthum solani]
          Length = 215

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 44/154 (28%), Positives = 68/154 (44%), Gaps = 25/154 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W    K GLVTGG + S  GC+P   PPC +  Y  +     T +    + 
Sbjct: 75  CYGGYPIRAWKSFKKHGLVTGGNYKSGEGCEPYRVPPCPYDEYGNN-----TCSGQPMES 129

Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFDP-HFGPFWPAF--WRSFCTKYTR 187
           + RCT   YG           + +D Y +   G+  D  ++GP   +F  +  F      
Sbjct: 130 NHRCTRMCYGNQDLDFDQDHRYTRDHYYLTYRGIQKDVINYGPIEASFDVYDDF------ 183

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENG 221
           P +++   +Y  S +A  +   +VK++GWGEE G
Sbjct: 184 PSYKSG--IYVKSENASYLGGHSVKLIGWGEEYG 215


>gi|194882138|ref|XP_001975170.1| GG20712 [Drosophila erecta]
 gi|190658357|gb|EDV55570.1| GG20712 [Drosophila erecta]
          Length = 431

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 54/191 (28%), Positives = 76/191 (39%), Gaps = 32/191 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W ++HK+G+V       +  C P          YT     CK     +   
Sbjct: 253 CDGGHLDAAWRYLHKKGVV-------DESCYP----------YTQHRDTCKIRHNSRSLR 295

Query: 140 HTRC-TNDNYGRGFFQD---KYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTN 193
              C T  N  R  F      Y +N         F  GP           +  R  F  +
Sbjct: 296 ANGCETPVNVDRDTFYTVGPAYSLNREADIMAEIFNSGPVQATM------RVNRDFFSYS 349

Query: 194 GRVYAVSASAE--IVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
             VY  +A+       + +VK+VGWGEE NG  YW   +++G  +G+KG  +ILRG NE 
Sbjct: 350 RGVYRQTAANREAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEKGYFRILRGSNEC 409

Query: 251 IIESLVNGALP 261
            IE  V  + P
Sbjct: 410 GIEEYVLASWP 420


>gi|156708116|gb|ABU93316.1| cathepsin B7 cysteine protease, partial [Monocercomonoides sp. PA]
          Length = 273

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 48/200 (24%), Positives = 71/200 (35%), Gaps = 63/200 (31%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G +   W W+ K+G+ T         C P          Y +            P C
Sbjct: 119 CNGGYADRVWNWIQKKGITT-------EQCIP----------YVSGSGRV-------PTC 154

Query: 140 HTRCTN-DNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
            ++C N  N  R F                         W SF +K        NG VYA
Sbjct: 155 PSKCKNGSNIVRSFVSS----------------------WGSFNSKTVMDEVANNGPVYA 192

Query: 199 V----------------SASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
                              +     +  V ++GWG ENG PYW + +++G  +G+KG  +
Sbjct: 193 CFEVFEDFYNYRSGVYQHKTGRSQGWHHVMLMGWGTENGVPYWLLQNSWGSGWGEKGFFR 252

Query: 243 ILRGRNEAIIESLVNGALPK 262
           I RG N+  I+ +    LPK
Sbjct: 253 IRRGTNDCHIDEIFYSGLPK 272


>gi|328712825|ref|XP_001945477.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Acyrthosiphon pisum]
          Length = 487

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 35/89 (39%), Positives = 48/89 (53%), Gaps = 7/89 (7%)

Query: 184 KYTRPLFQTNGRVYAVSAS--AEIVAYATVKIVGWGEE--NGR--PYWTIVSTFGEQFGD 237
           K ++  F     VY  S         Y TV+IVGWGEE  NGR   YW + +++G  +G+
Sbjct: 375 KVSKEFFMYESGVYKCSKLDLGSKTGYHTVRIVGWGEEQQNGRTVKYWIVSNSWGLWWGE 434

Query: 238 KGTIKILRGRNEAIIESLVNGALPK-DNY 265
            G  +IL+G NE  IE  V  A+P  DN+
Sbjct: 435 SGYFRILKGTNECQIEDFVVAAMPDIDNF 463


>gi|24657813|ref|NP_726176.1| secreted Wg-interacting molecule, isoform A [Drosophila
           melanogaster]
 gi|24657819|ref|NP_611652.2| secreted Wg-interacting molecule, isoform B [Drosophila
           melanogaster]
 gi|21064305|gb|AAM29382.1| RE01730p [Drosophila melanogaster]
 gi|21626543|gb|AAF46818.2| secreted Wg-interacting molecule, isoform A [Drosophila
           melanogaster]
 gi|21626544|gb|AAM68213.1| secreted Wg-interacting molecule, isoform B [Drosophila
           melanogaster]
 gi|220949028|gb|ACL87057.1| CG3074-PA [synthetic construct]
 gi|220958134|gb|ACL91610.1| CG3074-PA [synthetic construct]
          Length = 431

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 31/81 (38%), Positives = 45/81 (55%), Gaps = 3/81 (3%)

Query: 184 KYTRPLFQTNGRVYAVSASAEI--VAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGT 240
           +  R  F  +G VY  +A+       + +VK+VGWGEE NG  YW   +++G  +G+ G 
Sbjct: 340 RVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEHGY 399

Query: 241 IKILRGRNEAIIESLVNGALP 261
            +ILRG NE  IE  V  + P
Sbjct: 400 FRILRGSNECGIEEYVLASWP 420


>gi|349604734|gb|AEQ00202.1| Cathepsin B-like protein, partial [Equus caballus]
          Length = 134

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 38/61 (62%)

Query: 201 ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
            + +++    V+I+GWG ENG PYW + +++   +GD G  KILRG++   IES +   +
Sbjct: 65  VAGDMMGGHAVRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGI 124

Query: 261 P 261
           P
Sbjct: 125 P 125


>gi|21693|emb|CAA46810.1| cathepsin B [Triticum aestivum]
          Length = 305

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 54/197 (27%), Positives = 78/197 (39%), Gaps = 43/197 (21%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G   S W +  + G+VT     +    GC+    P C        EP     A P P
Sbjct: 125 CDGGYPISAWQYFVQNGVVTDECDPYFDQVGCK---HPGC--------EP-----AYPTP 168

Query: 138 KCHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKY 185
            C  +C   N      + F  + YQ+N      DPH         GP   AF     T Y
Sbjct: 169 VCEKKCKVQNQVWEEKKHFSINAYQVNS-----DPHDIMAEVYNNGPVEVAF-----TVY 218

Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKIL 244
                  +G    ++    ++    VK++GWG  + G  YW + + +   +GD G  KI+
Sbjct: 219 EDFAHYKSGVYKHITGG--VMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKII 276

Query: 245 RGRNEAIIESLVNGALP 261
           RG+NE  IE  V   +P
Sbjct: 277 RGKNECGIEEDVTAGMP 293


>gi|357116879|ref|XP_003560204.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
          Length = 351

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 61/241 (25%), Positives = 97/241 (40%), Gaps = 46/241 (19%)

Query: 52  VECTSFRFIAGVKQRCAWLVSRWMTI--WVCSSGISS----STWAWVHKRGLVTG--GAH 103
           VEC   RF   +    +  V+  +    ++C SG +     S W +  ++G+VT     +
Sbjct: 131 VECLQDRFCIHLNMNISLSVNDLLACCGFLCGSGCNGGYPISAWRYFRRKGVVTDECDPY 190

Query: 104 HSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLG 163
               GC+    P C        EP  +T     PKC  +C   N      Q  + ++   
Sbjct: 191 FDQVGCK---HPGC--------EPAYRT-----PKCEKKCKVQNEVWKE-QKHFSVDAYR 233

Query: 164 LYFDPHF--------GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVG 215
           ++ +PH         GP   AF     T Y       +G    ++    ++    VK++G
Sbjct: 234 VHSNPHDIMAEVYTNGPVEVAF-----TVYEDFAHYKSGVYKHITGG--VMGGHAVKLIG 286

Query: 216 WGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKD-----NYGVEF 269
           WG  + G  YW + + +   +GD G  KI+RG+NE  IE  V   +P       NY   F
Sbjct: 287 WGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVVAGMPSTKNMARNYDDAF 346

Query: 270 G 270
           G
Sbjct: 347 G 347


>gi|291000017|ref|XP_002682576.1| cathepsin C [Naegleria gruberi]
 gi|284096203|gb|EFC49832.1| cathepsin C [Naegleria gruberi]
          Length = 430

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 27/58 (46%), Positives = 37/58 (63%), Gaps = 1/58 (1%)

Query: 204 EIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
           E+V +A V +VGWG ENG PYW I +++   +GD G  KILRG +E  +ES     +P
Sbjct: 372 EVVNHA-VLMVGWGVENGTPYWKIKNSWSTTWGDNGYFKILRGSDECGVESDAEAGIP 428


>gi|356505709|ref|XP_003521632.1| PREDICTED: cathepsin B-like [Glycine max]
          Length = 357

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 57/197 (28%), Positives = 81/197 (41%), Gaps = 43/197 (21%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G     W ++   G+VT     +    GC   S P C        EP  +T     P
Sbjct: 169 CDGGYPLYAWRYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYRT-----P 212

Query: 138 KCHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKY 185
           KC  +C + N      + +    Y++N      DPH         GP   AF     T Y
Sbjct: 213 KCVKKCVSGNQVWKKSKHYSVSAYRVNS-----DPHDIMAEVYKNGPVEVAF-----TVY 262

Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKIL 244
               +  +G VY      E+  +A VK++GWG  ++G  YW + + +  ++GD G  KI 
Sbjct: 263 EDFAYYKSG-VYKHITGYELGGHA-VKLIGWGTTDDGEDYWLLANQWNREWGDDGYFKIR 320

Query: 245 RGRNEAIIESLVNGALP 261
           RG NE  IE  V   LP
Sbjct: 321 RGTNECGIEEDVTAGLP 337


>gi|10803452|emb|CAB97365.2| putative cathepsin B.2 [Ostertagia ostertagi]
          Length = 194

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 47/168 (27%), Positives = 62/168 (36%), Gaps = 32/168 (19%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +    G+VTGG +     C+P  FPPC          EC   A   PKC
Sbjct: 43  CEGGWPMKAWQYFXLEGVVTGGNYRKQGCCRPYEFPPCGRHGKEPYYGECYDSAK-TPKC 101

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
              C      RG+ +   +        D HFG    A+      K  +     NG V A 
Sbjct: 102 QKTCQ-----RGYLKPYKE--------DKHFGK--SAYRLPNNVKAIQRDIMKNGPVVAG 146

Query: 200 SASAEIVAY----------------ATVKIVGWGEENGRPYWTIVSTF 231
               E  A+                  VKI+GWG+E G PYW I +++
Sbjct: 147 FIVYEDFAHYKSGIYKHTAGRMTGGHAVKIIGWGKEXGTPYWLIANSW 194


>gi|294897889|ref|XP_002776090.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
 gi|239882699|gb|EER07906.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
          Length = 134

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 39/144 (27%), Positives = 63/144 (43%), Gaps = 31/144 (21%)

Query: 137 PKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
           P C + C N  YG  F +D++    L   F   FG           T   +    TNG  
Sbjct: 6   PSCSSSCPNAKYGTAFDKDRHYTESL---FPSRFG----------STSSIKKEIMTNGPT 52

Query: 197 YAV-SASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGT 240
            A  S   + ++Y +               V+I+GWG E G  YW +++++ E++GD GT
Sbjct: 53  SAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVDYWLVMNSWNEEWGDHGT 112

Query: 241 IKILRGRNEAIIESLVNGALPKDN 264
            KI++G  +  I+ ++    P  N
Sbjct: 113 FKIVQG--DCGIDDMILAGTPAIN 134


>gi|146163742|ref|XP_001012227.2| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|146145940|gb|EAR91982.2| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 581

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 29/109 (26%), Positives = 55/109 (50%), Gaps = 1/109 (0%)

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           +   G +Y  +   +   +A + +VGWG ENG  YW + +++G  +G+KG  +++RG N 
Sbjct: 209 YNYTGGIYVNTTEVDYHNHA-ISVVGWGVENGTKYWIVRNSWGSYWGEKGYFRLVRGINS 267

Query: 250 AIIESLVNGALPKDNYGVEFGEESGERLSEEFGVRAESSEEFRENGEEE 298
             IES    A+PKD +  +    +    + +   R       +EN +++
Sbjct: 268 LNIESDCAWAVPKDTWTNDVRNTTASNTNSQSNFRQLHDCVRQENNQKD 316


>gi|156708112|gb|ABU93314.1| cathepsin B5 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 50/162 (30%), Positives = 71/162 (43%), Gaps = 24/162 (14%)

Query: 103 HHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ---PKCHTRCTNDNYGRGFFQDKYQI 159
           H  N G    S+    H+  TT E  C    +     P C  +CTN   G    + K + 
Sbjct: 125 HGCNGGSPLFSWEWVKHSGITTEE--CIPYVSGGGRVPSCPKKCTN---GSAIVRTKAKS 179

Query: 160 NGL--GLYFDPHF---GPFWPAF--WRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVK 212
            GL  G          GPF  AF  +  F +  +       G++    A         V 
Sbjct: 180 VGLVKGDKMQNELYSRGPFEAAFSVYEDFKSYKSGVYHHITGKMLGGHA---------VM 230

Query: 213 IVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
           +VGWG E+G PYW I +++G  +G++G  KILRG+NE  IE+
Sbjct: 231 VVGWGVEDGTPYWLIQNSWGTTWGEQGFFKILRGKNECGIET 272


>gi|158285208|ref|XP_001687862.1| AGAP007684-PA [Anopheles gambiae str. PEST]
 gi|158285210|ref|XP_308187.4| AGAP007684-PB [Anopheles gambiae str. PEST]
 gi|157019881|gb|EDO64511.1| AGAP007684-PA [Anopheles gambiae str. PEST]
 gi|157019882|gb|EAA04576.4| AGAP007684-PB [Anopheles gambiae str. PEST]
          Length = 463

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 34/97 (35%), Positives = 53/97 (54%), Gaps = 8/97 (8%)

Query: 187 RPLFQTNGRVYAVSASA----EIVAYATVKIVGWGEE----NGRPYWTIVSTFGEQFGDK 238
           R  F     +Y  SA+A    E  AY +V+++GWGEE    +   YW  ++++G+ +G+ 
Sbjct: 342 RDFFSYRSGIYRHSAAATPAEERSAYHSVRLIGWGEERVGYDVVKYWIAINSWGQWWGEN 401

Query: 239 GTIKILRGRNEAIIESLVNGALPKDNYGVEFGEESGE 275
           G  +ILRG NE  IES V  + P  +  V+   + GE
Sbjct: 402 GRFRILRGSNECDIESYVLASNPYVHEHVQAIRKVGE 438


>gi|297843026|ref|XP_002889394.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335236|gb|EFH65653.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 359

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 57/186 (30%), Positives = 78/186 (41%), Gaps = 35/186 (18%)

Query: 89  WAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
           W +    G+VT     +  NTGC   S P C        EP       P PKC  +C ++
Sbjct: 180 WLYFKYHGVVTEECDPYFDNTGC---SHPGC--------EP-----GYPTPKCVRKCVSE 223

Query: 147 NYGRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
           N   G  +  Y ++   +  DP          GP   AF     T Y       +G VY 
Sbjct: 224 NQLWGESK-HYGVSAYRINHDPQDIMAEVYKNGPVEVAF-----TVYEDFAHYKSG-VYK 276

Query: 199 VSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 257
                +I  +A VK++GWG  ++G  YW + + +   +GD G  KI RG NE  IE  V 
Sbjct: 277 HITGTKIGGHA-VKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVV 335

Query: 258 GALPKD 263
             LP D
Sbjct: 336 AGLPSD 341


>gi|224285427|gb|ACN40436.1| unknown [Picea sitchensis]
          Length = 350

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 51/199 (25%), Positives = 77/199 (38%), Gaps = 48/199 (24%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G   S W +    G+VT     +  + GCQ                P C+ L  P P
Sbjct: 164 CDGGYPISAWQYFISTGVVTAECDPYFDDAGCQ---------------HPGCEPL-YPTP 207

Query: 138 KCHTRCTNDNYGRG----FFQDKYQINGLGLYFDPHF---GPFWPAF--------WRSFC 182
           +C  +C ++N   G    F    Y+I+             GP   +F        ++S  
Sbjct: 208 QCVKQCKDENQKWGNSKRFSATAYRISSKPYDIMAEVYTNGPVEVSFSVYEDFAHYKSGV 267

Query: 183 TKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
            KYT+                + +    VK+VGWG E+G  YW + +++   +G+ G  K
Sbjct: 268 YKYTK---------------GDYMGGHAVKLVGWGTEDGTDYWLVANSWNTAWGEDGYFK 312

Query: 243 ILRGRNEAIIESLVNGALP 261
           I RG NE  IE  V   +P
Sbjct: 313 IARGSNECGIEGDVVAGMP 331


>gi|116784401|gb|ABK23329.1| unknown [Picea sitchensis]
          Length = 350

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 51/199 (25%), Positives = 77/199 (38%), Gaps = 48/199 (24%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G   S W +    G+VT     +  + GCQ                P C+ L  P P
Sbjct: 164 CDGGYPISAWQYFISTGVVTAECDPYFDDAGCQ---------------HPGCEPL-YPTP 207

Query: 138 KCHTRCTNDNYGRG----FFQDKYQINGLGLYFDPHF---GPFWPAF--------WRSFC 182
           +C  +C ++N   G    F    Y+I+             GP   +F        ++S  
Sbjct: 208 QCVKQCKDENQKWGNSKRFSATAYRISSKPYDIMAEVYTNGPVEVSFSVYEDFAHYKSGV 267

Query: 183 TKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
            KYT+                + +    VK+VGWG E+G  YW + +++   +G+ G  K
Sbjct: 268 YKYTK---------------GDYMGGHAVKLVGWGTEDGTDYWLVANSWNTAWGEDGYFK 312

Query: 243 ILRGRNEAIIESLVNGALP 261
           I RG NE  IE  V   +P
Sbjct: 313 IARGSNECGIEGDVVAGMP 331


>gi|195154396|ref|XP_002018108.1| GL16940 [Drosophila persimilis]
 gi|194113904|gb|EDW35947.1| GL16940 [Drosophila persimilis]
          Length = 433

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 48/188 (25%), Positives = 79/188 (42%), Gaps = 26/188 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W ++HK+G+V       +  C P +     H +        ++L     + 
Sbjct: 255 CEGGHLDAAWRYLHKKGVV-------DESCYPYT----QHRDTCKIRHNSRSLKANGCRP 303

Query: 140 HTRCTNDNY---GRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
                 D++   G  +  +K       +Y   H GP           +  R  F  +  V
Sbjct: 304 SANVDRDSFYTVGPAYTLNKESDIMAEIY---HSGPVQATM------RVYRDFFSYSSGV 354

Query: 197 YAVSAS--AEIVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
           Y  +A+       + +VK+VGWGEE NG  YW   +++G  +G++G  +ILRG NE  IE
Sbjct: 355 YRQTAANRGAPTGFHSVKLVGWGEEHNGDKYWIAANSWGPWWGERGYFRILRGSNECGIE 414

Query: 254 SLVNGALP 261
             V  + P
Sbjct: 415 DYVLASWP 422


>gi|283468816|emb|CAO98753.1| putative cathepsin B [Fasciola hepatica]
          Length = 112

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 38/60 (63%)

Query: 202 SAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
           +  +V    ++++GWG ENG  YW I +++ E +G+KG  ++ RG NE  IE+ +N  LP
Sbjct: 53  TGRLVGGHAIRVIGWGVENGVKYWLIANSWNEGWGEKGYFRMRRGNNECGIEARINAGLP 112


>gi|56758130|gb|AAW27205.1| unknown [Schistosoma japonicum]
          Length = 279

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 38/78 (48%), Gaps = 2/78 (2%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  KRG+VTGG+  ++TGCQP  FP C H +     P C T     P+C
Sbjct: 159 CQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 217

Query: 140 HTRCTNDNYGRGFFQDKY 157
              C    Y   + QDK+
Sbjct: 218 KQTCQK-GYKTPYEQDKH 234


>gi|125810908|ref|XP_001361665.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
 gi|54636841|gb|EAL26244.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
          Length = 433

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 48/188 (25%), Positives = 79/188 (42%), Gaps = 26/188 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W ++HK+G+V       +  C P +     H +        ++L     + 
Sbjct: 255 CEGGHLDAAWRYLHKKGVV-------DESCYPYT----QHRDTCKIRHNSRSLKANGCRP 303

Query: 140 HTRCTNDNY---GRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
                 D++   G  +  +K       +Y   H GP           +  R  F  +  V
Sbjct: 304 SANVDRDSFYTVGPAYTLNKESDIMAEIY---HSGPVQATM------RVYRDFFSYSSGV 354

Query: 197 YAVSAS--AEIVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
           Y  +A+       + +VK+VGWGEE NG  YW   +++G  +G++G  +ILRG NE  IE
Sbjct: 355 YRQTAANRGAPTGFHSVKLVGWGEEHNGDKYWIAANSWGPWWGERGYFRILRGSNECGIE 414

Query: 254 SLVNGALP 261
             V  + P
Sbjct: 415 DYVLASWP 422


>gi|114153242|gb|ABI52787.1| cathepsin B-like protein [Argas monolakensis]
          Length = 91

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 22/52 (42%), Positives = 33/52 (63%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           ++I+GWG E   PYW + +++  ++GD G  KILRG NE  IE  +   +PK
Sbjct: 39  IRIIGWGVEEDVPYWLVANSWNREWGDNGYFKILRGSNECGIEDDIVAGIPK 90


>gi|161343853|tpg|DAA06107.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 217

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 38/80 (47%), Gaps = 1/80 (1%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ-PK 138
           C  G    +W +  + G V+GG ++SN GCQP + PPC   N       C T    + P 
Sbjct: 130 CDGGSLFESWDYYRRHGFVSGGDYNSNQGCQPYTIPPCKLMNEKPPGHSCTTYHREETPI 189

Query: 139 CHTRCTNDNYGRGFFQDKYQ 158
           C  +C N NY   F  D Y+
Sbjct: 190 CEKKCYNPNYYTSFRTDIYK 209


>gi|508264|gb|AAA96833.1| cysteine protease, partial [Caenorhabditis elegans]
          Length = 198

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 42/163 (25%), Positives = 67/163 (41%), Gaps = 19/163 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G     W    K+G VTGG++   TGC+P  +PPC H    T    C +   P  + 
Sbjct: 44  CNGGYPIEAWRHYVKKGYVTGGSYQDKTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTGQN 103

Query: 140 HTRCTNDNYGRGFFQDKY-----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
                  +    + +D +           +  G+      H         R   T +   
Sbjct: 104 ANALGKLDIALTYHKDLHFRTILHTPASKEAAGIPKGIKTH------GQLRGGITVF-ED 156

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTF 231
               +G VY  +A A +  +A VK++GWG +NG PYW I +++
Sbjct: 157 FEHYSGGVYVHTAGASLGGHA-VKMLGWGVDNGTPYWLIANSW 198


>gi|170028894|ref|XP_001842329.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167879379|gb|EDS42762.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 355

 Score = 54.3 bits (129), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 24/45 (53%), Positives = 31/45 (68%)

Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
           +VKI+GWG ENG  YW I STFG  +G++GT   LRG N  ++ S
Sbjct: 202 SVKIIGWGVENGTEYWLITSTFGIGWGNQGTAMFLRGVNHLVLPS 246


>gi|195026034|ref|XP_001986167.1| GH20676 [Drosophila grimshawi]
 gi|193902167|gb|EDW01034.1| GH20676 [Drosophila grimshawi]
          Length = 432

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 45/78 (57%), Gaps = 3/78 (3%)

Query: 187 RPLFQTNGRVYAVSASAEIVA--YATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKI 243
           R  F  +  VY  +A+    A  + +VK+VGWGEE NG  YW   +++G  +G++G  +I
Sbjct: 344 RDFFSYSSGVYQHTAANRGAATGFHSVKLVGWGEEHNGVKYWIAANSWGPWWGERGYFRI 403

Query: 244 LRGRNEAIIESLVNGALP 261
           LRG NE  IE  V  + P
Sbjct: 404 LRGSNECGIEEYVLASWP 421


>gi|300176830|emb|CBK25399.2| unnamed protein product [Blastocystis hominis]
          Length = 563

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 46/74 (62%), Gaps = 1/74 (1%)

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L +  G +Y  +  A+ + +A + +VGWGEE+G+ YW   +++G  +G+KG  +I+RG N
Sbjct: 204 LMEYKGGIYRDTTGAKTLDHA-ISVVGWGEEDGQKYWIARNSWGTFWGEKGWFRIVRGEN 262

Query: 249 EAIIESLVNGALPK 262
              IE+    A+P+
Sbjct: 263 NLGIEADCQWAVPR 276



 Score = 41.6 bits (96), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 20/64 (31%), Positives = 35/64 (54%), Gaps = 1/64 (1%)

Query: 199 VSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 257
           V     IV Y  V++ GWGE E+G  YW   +++G  +G+ G  +++ G ++ +I    N
Sbjct: 496 VDDRGHIVGYHAVEVAGWGETEDGTKYWIARNSWGPYWGEHGWFRMIVGVSKGLITGYCN 555

Query: 258 GALP 261
             +P
Sbjct: 556 WGVP 559


>gi|195121981|ref|XP_002005491.1| GI19039 [Drosophila mojavensis]
 gi|193910559|gb|EDW09426.1| GI19039 [Drosophila mojavensis]
          Length = 432

 Score = 53.9 bits (128), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 44/79 (55%), Gaps = 3/79 (3%)

Query: 187 RPLFQTNGRVYAVSAS--AEIVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKI 243
           R  F  +G VY  +A+       + +VKIVGWGEE +G  YW   +++G  +G+ G  +I
Sbjct: 343 RDFFSYSGGVYRQTAANRGAPTGFHSVKIVGWGEEHDGVKYWIAANSWGPWWGEHGYFRI 402

Query: 244 LRGRNEAIIESLVNGALPK 262
           LRG NE  IE  V  + P 
Sbjct: 403 LRGSNECGIEEYVLASWPN 421


>gi|63115212|gb|AAY33830.1| cathepsin B, partial [Siniperca chuatsi]
          Length = 69

 Score = 53.9 bits (128), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 23/53 (43%), Positives = 33/53 (62%)

Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
            +KI+GWGEE+G PYW   +++   +GD G  K LRG +   IES +   +PK
Sbjct: 17  AIKILGWGEEDGVPYWLCANSWNTDWGDNGFFKFLRGSDHCRIESEIVAGIPK 69


>gi|308494436|ref|XP_003109407.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
 gi|308246820|gb|EFO90772.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
          Length = 470

 Score = 53.9 bits (128), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 46/82 (56%), Gaps = 4/82 (4%)

Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN--GRP--YWTIVSTFGEQFGDKGT 240
           Y   ++Q +       AS+    Y +V+++GWG ++  GRP  YW   +++G Q+G+ G 
Sbjct: 368 YAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDGY 427

Query: 241 IKILRGRNEAIIESLVNGALPK 262
            KILRG N   IES V GA  K
Sbjct: 428 FKILRGENHCEIESFVIGAWGK 449


>gi|107921773|gb|ABF85678.1| cathepsin B1 [Fasciola hepatica]
          Length = 278

 Score = 53.9 bits (128), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 45/157 (28%), Positives = 61/157 (38%), Gaps = 21/157 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + W +  + G+VTGG   + TGC P  FP C H    +    C     P P C
Sbjct: 132 CQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPYPFPQCRHPGSRSQLNPCPGYIYPTPSC 191

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLY-FDPH----------FGPFWPAFWRSFCTKYTRP 188
           +  C    Y + + +DK  + G   Y  D H           GP    F       YT  
Sbjct: 192 YPYC-QAGYDKTYEEDK--VYGKTSYNVDRHEYTIMQEIMKNGPVEAGF-----IVYTDF 243

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYW 225
               +G  + V  S        ++I+GWG ENG  YW
Sbjct: 244 AVYKSGIYHHV--SGRYAGKHAIRIIGWGVENGVNYW 278


>gi|323448265|gb|EGB04166.1| hypothetical protein AURANDRAFT_32974 [Aureococcus anophagefferens]
          Length = 298

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 57/202 (28%), Positives = 78/202 (38%), Gaps = 30/202 (14%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTG------CQPVSFPPCNHANYTTSEP------ 127
           C  G   + W +V K G VTGG  ++ TG      C     P C+H      +P      
Sbjct: 92  CDGGQIITPWTYVAKAGAVTGG-QYNGTGPFGAGLCADWFAPHCHHHGPRGDDPYPAEGD 150

Query: 128 -ECKTLATPQPKCHTRCTNDNYGRGFFQDKYQING---------LGLYFDPHFGPFWPAF 177
             C +  +P+       T       F  DK+   G           +      GP   AF
Sbjct: 151 AGCPSEKSPEGPKACDATAAAGHDAFAADKHTFAGDVQTASGEAAIMAMIAEGGPVETAF 210

Query: 178 WRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGD 237
                T Y        G +Y      E   +A VK VGWG ENG  YW + +++   +G+
Sbjct: 211 -----TVY-EDFENYAGGIYHHVTGEEAGGHA-VKFVGWGVENGTKYWKVANSWNPYWGE 263

Query: 238 KGTIKILRGRNEAIIESLVNGA 259
            G  +ILRG NE  IE  V G+
Sbjct: 264 AGYFRILRGSNEGGIEDQVTGS 285


>gi|294937366|ref|XP_002782055.1| cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239893340|gb|EER13850.1| cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 159

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 45/160 (28%), Positives = 68/160 (42%), Gaps = 34/160 (21%)

Query: 105 SNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLG 163
           S  GC P  FP CNH     S  P C  ++      H   T+ +    + +D ++    G
Sbjct: 4   SADGCWPYPFPKCNHVRSAASRYPACPAVSPSAVGAHQMETSYSL---YIRDLHRAKSFG 60

Query: 164 LYFDPHFGPFWPAFWRSFCTKYTRPLFQTNG------------RVYA----VSASAEIVA 207
                      PA  ++      + +F TNG            RVY     V  +     
Sbjct: 61  RL---------PAIPQNI----KQEIF-TNGPVIGMLSIYEDIRVYKAGVYVHQTGSFQG 106

Query: 208 YATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
             T+KI+GWG E+G+ YW  V+++ E++GD G IK+  GR
Sbjct: 107 IHTLKIIGWGVESGQDYWLAVNSWNEEWGDHGMIKLAVGR 146


>gi|341891358|gb|EGT47293.1| hypothetical protein CAEBREN_29072 [Caenorhabditis brenneri]
          Length = 349

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 46/82 (56%), Gaps = 4/82 (4%)

Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN--GRP--YWTIVSTFGEQFGDKGT 240
           Y   ++Q +       AS+    Y +V+++GWG ++  GRP  YW   +++G Q+G+ G 
Sbjct: 247 YAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDGY 306

Query: 241 IKILRGRNEAIIESLVNGALPK 262
            KILRG N   IES V GA  K
Sbjct: 307 FKILRGDNHCEIESFVVGAWGK 328


>gi|66270083|gb|AAY43371.1| cathepsin-like cysteine protease [Phytophthora infestans]
          Length = 635

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 25/79 (31%), Positives = 47/79 (59%), Gaps = 1/79 (1%)

Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
           T    + +G ++    +A  V +A + IVGWGEENG P+W + +++G  +G+ G ++++R
Sbjct: 227 TDGFLKYSGGIFDDKTNATDVDHA-ISIVGWGEENGVPFWVLRNSWGSFWGESGWMRLVR 285

Query: 246 GRNEAIIESLVNGALPKDN 264
           G N   +E      +P+D+
Sbjct: 286 GVNNVGVEGECAFGVPRDD 304


>gi|341898422|gb|EGT54357.1| hypothetical protein CAEBREN_10381 [Caenorhabditis brenneri]
          Length = 466

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 46/82 (56%), Gaps = 4/82 (4%)

Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN--GRP--YWTIVSTFGEQFGDKGT 240
           Y   ++Q +       AS+    Y +V+++GWG ++  GRP  YW   +++G Q+G+ G 
Sbjct: 364 YAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDGY 423

Query: 241 IKILRGRNEAIIESLVNGALPK 262
            KILRG N   IES V GA  K
Sbjct: 424 FKILRGDNHCEIESFVIGAWGK 445


>gi|428169747|gb|EKX38678.1| hypothetical protein GUITHDRAFT_76993, partial [Guillardia theta
           CCMP2712]
          Length = 85

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 26/58 (44%), Positives = 36/58 (62%)

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
           VY  SA A+ V    V +VGWG ENG  YW + +++G+  GD+G  K+ +G NE  IE
Sbjct: 28  VYTKSAKAQKVGGHAVVLVGWGRENGVDYWLVQNSWGKSSGDEGMWKVRKGSNECGIE 85


>gi|328712827|ref|XP_003244913.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Acyrthosiphon pisum]
          Length = 487

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/83 (39%), Positives = 45/83 (54%), Gaps = 6/83 (7%)

Query: 184 KYTRPLFQTNGRVYAVS--ASAEIVAYATVKIVGWGEE--NGR--PYWTIVSTFGEQFGD 237
           K ++  F     VY  S  A      Y TV+IVGWGEE  NGR   YW + +++G  +G+
Sbjct: 375 KVSKEFFMYESGVYRCSNLALGSKTGYHTVRIVGWGEEQQNGRTVKYWIVSNSWGLWWGE 434

Query: 238 KGTIKILRGRNEAIIESLVNGAL 260
            G  +IL+G NE  IE  V  A+
Sbjct: 435 SGYFRILKGTNECQIEDFVVAAM 457


>gi|301119245|ref|XP_002907350.1| cysteine protease family C01A, putative [Phytophthora infestans
           T30-4]
 gi|262105862|gb|EEY63914.1| cysteine protease family C01A, putative [Phytophthora infestans
           T30-4]
          Length = 710

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 25/79 (31%), Positives = 47/79 (59%), Gaps = 1/79 (1%)

Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
           T    + +G ++    +A  V +A + IVGWGEENG P+W + +++G  +G+ G ++++R
Sbjct: 227 TDGFLKYSGGIFDDKTNATDVDHA-ISIVGWGEENGVPFWVLRNSWGSFWGESGWMRLVR 285

Query: 246 GRNEAIIESLVNGALPKDN 264
           G N   +E      +P+D+
Sbjct: 286 GVNNVGVEGECAFGVPRDD 304


>gi|145541902|ref|XP_001456639.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124424451|emb|CAK89242.1| unnamed protein product [Paramecium tetraurelia]
          Length = 487

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 36/52 (69%)

Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
           +V   GWGEENG  +W + +++GEQ+G++G  ++ RG +E+ IES+   A P
Sbjct: 415 SVLCYGWGEENGVKFWLLQNSWGEQWGEQGNFRMKRGTDESAIESMAEAADP 466


>gi|66506619|ref|XP_393283.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
           [Apis mellifera]
          Length = 439

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 31/89 (34%), Positives = 49/89 (55%), Gaps = 8/89 (8%)

Query: 184 KYTRPLFQTNGRVYAVSASAEIV--AYATVKIVGWGEE----NGRP--YWTIVSTFGEQF 235
           K  +  F     +Y  +  AE+    Y +V+I+GWGE+    +G P  YW +V+++G+++
Sbjct: 350 KVYQDFFSYESGIYMHTPIAELYESGYHSVRIIGWGEDISTDSGLPIKYWLVVNSWGQEW 409

Query: 236 GDKGTIKILRGRNEAIIESLVNGALPKDN 264
           G+ G  +I RG NE  IES V     K N
Sbjct: 410 GENGLFRIRRGINECDIESFVVAVWAKTN 438


>gi|268564843|ref|XP_002639246.1| Hypothetical protein CBG03805 [Caenorhabditis briggsae]
          Length = 526

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 30/79 (37%), Positives = 45/79 (56%), Gaps = 4/79 (5%)

Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN--GRP--YWTIVSTFGEQFGDKGT 240
           Y   ++Q +       AS+    Y +V+++GWG ++  GRP  YW   +++G Q+G+ G 
Sbjct: 424 YAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDGY 483

Query: 241 IKILRGRNEAIIESLVNGA 259
            KILRG N   IES V GA
Sbjct: 484 FKILRGENHCEIESFVIGA 502


>gi|262217337|gb|ACY38050.1| cathepsin B [Dactylis glomerata]
          Length = 348

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 57/208 (27%), Positives = 82/208 (39%), Gaps = 42/208 (20%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQP-VSFPPCNHANYTTSEPECKTLATPQPK 138
           C  G     W +  + G+VT         C P      C H       P C+  A   PK
Sbjct: 162 CDGGYPIKAWQYFVQSGVVT-------EECDPYFDQVGCKH-------PGCEP-AYDTPK 206

Query: 139 CHTRCTNDNYGRGFFQDK--YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
           C  +C   N     +++K  + IN   +  DPH         GP   AF     T Y   
Sbjct: 207 CEKKCKVQNQ---VWEEKKHFSINAYRVNSDPHDIMAEVYKNGPVEVAF-----TVYEDF 258

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGR 247
               +G    V+    ++    VK++GWG  + G  YW + + +   +GD G  KI+RG+
Sbjct: 259 AHYKSGVYKHVTGG--VMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGK 316

Query: 248 NEAIIESLVNGALPK-----DNYGVEFG 270
           NE  IE  V   +P       N+G  FG
Sbjct: 317 NECGIEEEVVAGMPSTKNMAGNHGSAFG 344


>gi|195384166|ref|XP_002050789.1| GJ20006 [Drosophila virilis]
 gi|194145586|gb|EDW61982.1| GJ20006 [Drosophila virilis]
          Length = 432

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 44/78 (56%), Gaps = 3/78 (3%)

Query: 187 RPLFQTNGRVYAVSAS--AEIVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKI 243
           R  F  +G +Y  +A+       + +VK+VGWGEE +G  YW   +++G  +G+ G  +I
Sbjct: 344 RDFFSYSGGIYRQTAANRGAPTGFHSVKLVGWGEEHDGVKYWIAANSWGPWWGEHGYFRI 403

Query: 244 LRGRNEAIIESLVNGALP 261
           LRG NE  IE  V  + P
Sbjct: 404 LRGSNECGIEEYVLASWP 421


>gi|428168267|gb|EKX37214.1| hypothetical protein GUITHDRAFT_78289 [Guillardia theta CCMP2712]
          Length = 224

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/92 (41%), Positives = 52/92 (56%), Gaps = 15/92 (16%)

Query: 171 GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYA-----TVKIVGWG--EENGRP 223
           GP + AFW      Y+  +  T G VY  SAS E +A        V +VGWG  +E G+ 
Sbjct: 140 GPVFAAFWV-----YSDFMAYTGG-VY--SASKEALAQGKTGGHAVMMVGWGTDKETGQD 191

Query: 224 YWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           YW + +++ E++GDKG  KI RG +E  IESL
Sbjct: 192 YWLLQNSWSEKWGDKGRFKIKRGVDECGIESL 223


>gi|157058769|gb|ABV03142.1| cathepsin B-348 [Myzus persicae]
          Length = 246

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 60/215 (27%), Positives = 86/215 (40%), Gaps = 28/215 (13%)

Query: 27  SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
           SC    A      ++  VC  SK      +F F A     C W        + C+ G   
Sbjct: 48  SCGSCWAFGAVEAMSDRVCIHSKG---AKNFHFSAENLVSCCWTCG-----FGCNGGFPG 99

Query: 87  STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
           + W +   +G+V+GG + S  GC P    PC H    T  P CK      P C  +C  D
Sbjct: 100 AAWHYWKTKGIVSGGPYGSKMGCIPYEIAPCEHHVNGTRGP-CKE-GGKTPACVKKC-ED 156

Query: 147 NYGRGFFQDKYQ---INGLGLYFDP------HFGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
            Y   + QD ++      LG   D         GP   AF     T Y   +    G VY
Sbjct: 157 GYKVPYAQDLHRGKSAYSLGNDVDQIRQEIYTNGPVEGAF-----TVYEDFIAYRAG-VY 210

Query: 198 AVSASAEIVAYATVKIVGWGEENGR-PYWTIVSTF 231
              A   +  +A ++I+GWG +NG  PYW + +++
Sbjct: 211 KHVAGKALGGHA-IRILGWGVQNGEIPYWLVANSW 244


>gi|157058743|gb|ABV03129.1| cathepsin B-2744 [Pterocomma populeum]
          Length = 244

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/154 (25%), Positives = 61/154 (39%), Gaps = 14/154 (9%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK- 138
           C  G +   W +    G+VTGG  +SN GCQP    PC+H    +S   C +    Q   
Sbjct: 96  CHGGSAFKAWEFTMGNGIVTGGNFNSNEGCQPYKNRPCDHYG-DSSMTNCSSFRRTQMSI 154

Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGL-------YFDPHFGPFWPAFWRSFCTKYTRPLFQ 191
           C  +C N NY   +  D ++ + + +               + P         Y    F 
Sbjct: 155 CREKCVNKNYKVKYEDDLHKTSVVYMTSWTNVTQIQQEIMTYGPV----TALMYVYENFM 210

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWG-EENGRPY 224
                   S   ++V Y  VK++GWG +++G  Y
Sbjct: 211 GYKEGIYKSTVGDLVGYHHVKLIGWGVDDDGNEY 244


>gi|300121294|emb|CBK21674.2| unnamed protein product [Blastocystis hominis]
          Length = 561

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 15/80 (18%)

Query: 198 AVSASAEIVAYA---------------TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
           A+ A+ E+VAY                 + +VGWGEE+G+ YW + +++G  +G+ G  +
Sbjct: 197 ALDATDELVAYKGGIFEDKTGTTSLNHAISVVGWGEEDGKKYWIVRNSWGTYWGENGWFR 256

Query: 243 ILRGRNEAIIESLVNGALPK 262
           I+RG N   IES    A+P+
Sbjct: 257 IVRGTNNLGIESECTWAVPR 276


>gi|291228863|ref|XP_002734398.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 451

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 24/49 (48%), Positives = 34/49 (69%)

Query: 208 YATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
           + +VK++GWG ENG  YW   +++G ++G+ G  KILRG NE  IES V
Sbjct: 364 WHSVKLLGWGVENGIKYWLGANSWGTKWGEDGYFKILRGENECNIESYV 412


>gi|294952605|ref|XP_002787373.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239902345|gb|EER19169.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 185

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 41/136 (30%), Positives = 64/136 (47%), Gaps = 18/136 (13%)

Query: 131 TLATPQPKCHTRCTNDNYGRGFFQDKYQ-------INGLGLYFDPHF--GPFWPAFWRSF 181
            +  P P C T CTN  Y +   +D ++       +N         F  GP   +F    
Sbjct: 53  VVQQPVPPCRTTCTNKAYKKSLEKDVHRAKSWRKVLNDAQSIKQEIFDNGPVLSSF---- 108

Query: 182 CTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTI 241
              Y    +  +G VY V  + E     ++KI+GWG  +GR YW  V+++ E++GD G I
Sbjct: 109 -KMYEDFRYYKSG-VY-VPTTKESSTSHSIKIIGWGGASGREYWLAVNSWNEEWGDHGLI 165

Query: 242 KILRGRN--EAIIESL 255
           K+  G+N  E I+ S+
Sbjct: 166 KMAFGKNRLEKIVLSI 181


>gi|157116531|ref|XP_001658537.1| tubulointerstitial nephritis antigen [Aedes aegypti]
 gi|108883447|gb|EAT47672.1| AAEL001232-PA [Aedes aegypti]
          Length = 462

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/83 (39%), Positives = 46/83 (55%), Gaps = 8/83 (9%)

Query: 187 RPLFQTNGRVY---AVSASAEIVA-YATVKIVGWGEENG----RPYWTIVSTFGEQFGDK 238
           R  F     +Y   A S SA+  A Y +V+++GWGEE        YW  V+++G  +G+ 
Sbjct: 340 RDFFSYKSGIYRHSAASTSADQRAGYHSVRLIGWGEERHGYEVTKYWIAVNSWGTWWGEN 399

Query: 239 GTIKILRGRNEAIIESLVNGALP 261
           G  +ILRG NE  IES V  +LP
Sbjct: 400 GRFRILRGSNECEIESYVLASLP 422


>gi|270011021|gb|EFA07469.1| cathepsin B precursor [Tribolium castaneum]
          Length = 327

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 50/187 (26%), Positives = 79/187 (42%), Gaps = 28/187 (14%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G     W+++ K GLV       +  C P S       N     P    L T    C
Sbjct: 146 CNGGYLDRAWSYIRKIGLV-------DEQCFPYS-----ATNEKCRIPRRGDLVTAN--C 191

Query: 140 HTRCTNDNYGRGFFQDKYQINGLG--LYFDPHFGPFWPAF--WRSFCTKYTRPLFQTNGR 195
                 D   +      Y++      +Y   H GP       +  F T Y R +++    
Sbjct: 192 QLPTNVDRRSKYKVAPAYRVGNETDIMYEILHSGPVQATMKVYHDFFT-YKRGIYR---- 246

Query: 196 VYAVSASAEIVAYATVKIVGWGEENG----RPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
            ++  ++ +   Y +V+IVGWGEE      + YW + +++G ++G+ G  +ILRG NE  
Sbjct: 247 -HSPISTNDRTGYHSVRIVGWGEEYSPEGLKKYWKVANSWGPEWGENGYFRILRGSNECE 305

Query: 252 IESLVNG 258
           IES V G
Sbjct: 306 IESFVLG 312


>gi|67613207|ref|XP_667285.1| preprocathepsin c precursor [Cryptosporidium hominis TU502]
 gi|54658406|gb|EAL37056.1| preprocathepsin c precursor [Cryptosporidium hominis]
          Length = 635

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/123 (31%), Positives = 56/123 (45%), Gaps = 21/123 (17%)

Query: 139 CHTRCTNDNYGRGFFQDKYQING---LGLYFDPHFGPFWPAFWRSFCTKYTR----PLFQ 191
           C+  C  D      F+     NG   + ++ D     +    + S    +T+    P  Q
Sbjct: 472 CYGCCDEDRMKEEIFK-----NGPIAVAMHIDTSLLVYENGVYDSIPNDHTKYCDLPNKQ 526

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
            NG  Y   A         + IVGWGEENG PYW I +++G  +G+KG  KI RG+N   
Sbjct: 527 LNGWEYTNHA---------IAIVGWGEENGIPYWIIRNSWGANWGNKGYAKIRRGKNIGG 577

Query: 252 IES 254
           IE+
Sbjct: 578 IEN 580


>gi|383861394|ref|XP_003706171.1| PREDICTED: tubulointerstitial nephritis antigen-like [Megachile
           rotundata]
          Length = 442

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 44/83 (53%), Gaps = 9/83 (10%)

Query: 189 LFQTNGRVYAVSASAEIVA--YATVKIVGWGEE-------NGRPYWTIVSTFGEQFGDKG 239
            F     VY  S +AE+    Y +V+I+GWGEE           YW + +++G+Q+G+ G
Sbjct: 358 FFSYESGVYKHSVTAELYESDYHSVRIIGWGEEPPTYSRNTPLKYWLVANSWGQQWGENG 417

Query: 240 TIKILRGRNEAIIESLVNGALPK 262
             +I +G NE  IES V G   K
Sbjct: 418 LFRIQKGTNECEIESFVLGVWAK 440


>gi|340712697|ref|XP_003394892.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
           terrestris]
          Length = 445

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%), Gaps = 10/83 (12%)

Query: 184 KYTRPLFQTNGRVYAVSASAEIVA--YATVKIVGWGEENGR--------PYWTIVSTFGE 233
           K  +  F     +Y  +A+ E  A  Y +V+I+GWGE+            YW +V+++G+
Sbjct: 355 KVYQDFFSYESGIYKHTATTEHYAFGYHSVRIIGWGEDTSAHRHHNLPIKYWLVVNSWGQ 414

Query: 234 QFGDKGTIKILRGRNEAIIESLV 256
           Q+G+ G  +I RG NE  IES V
Sbjct: 415 QWGESGLFRIQRGTNECDIESFV 437


>gi|312383398|gb|EFR28501.1| hypothetical protein AND_03481 [Anopheles darlingi]
          Length = 573

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 34/97 (35%), Positives = 51/97 (52%), Gaps = 8/97 (8%)

Query: 187 RPLFQTNGRVYAVSASA----EIVAYATVKIVGWGEE----NGRPYWTIVSTFGEQFGDK 238
           R  F     +Y  SA+A    E  AY +V+++GWGEE    +   YW  V+++G  +G+ 
Sbjct: 451 RDFFSYQNGIYRHSAAATPAEERSAYHSVRLIGWGEERVGYDMVKYWIAVNSWGTWWGEN 510

Query: 239 GTIKILRGRNEAIIESLVNGALPKDNYGVEFGEESGE 275
           G  +ILRG NE  IES V  + P  +  V+     G+
Sbjct: 511 GRFRILRGTNECEIESYVLASNPYVHQHVQTVRNVGD 547


>gi|300121755|emb|CBK22330.2| unnamed protein product [Blastocystis hominis]
          Length = 562

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 46/74 (62%), Gaps = 1/74 (1%)

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           L +  G +Y  +  A+ + + ++ +VGWGEE+G+ YW   +++G  +G+KG  +I+RG N
Sbjct: 204 LMEYKGGIYRDTTGAKSLDH-SISVVGWGEEDGQKYWIARNSWGTFWGEKGWFRIVRGEN 262

Query: 249 EAIIESLVNGALPK 262
              IE+    A+P+
Sbjct: 263 NLGIEADCQWAVPR 276


>gi|332030944|gb|EGI70570.1| Uncharacterized peptidase C1-like protein F26E4.3 [Acromyrmex
           echinatior]
          Length = 501

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 31/68 (45%), Positives = 42/68 (61%), Gaps = 7/68 (10%)

Query: 196 VYAVSASAEI--VAYATVKIVGWGEEN---GRP--YWTIVSTFGEQFGDKGTIKILRGRN 248
           +Y  S SAE+    Y +V+I+GWGEE    G P  YW +V+++G  +G+ G  KI RG N
Sbjct: 426 IYRHSQSAELHDSGYHSVRIIGWGEERSYRGPPLKYWLVVNSWGYNWGENGLFKIQRGTN 485

Query: 249 EAIIESLV 256
           E  IES V
Sbjct: 486 ECEIESYV 493


>gi|10803454|emb|CAB97366.2| putative cathepsin B.3 [Ostertagia ostertagi]
          Length = 196

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 42/161 (26%), Positives = 66/161 (40%), Gaps = 17/161 (10%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G S+  W +    G+ +GG +     C+P +F PC +    T   EC       P C
Sbjct: 44  CNGGYSARAWLYARNSGVCSGGRYQEKGVCKPYTFHPCGYHKNQTYYGECPKHTYQTPAC 103

Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
              C    YG+ + +DK Y  +   +  D           GP   +F       Y     
Sbjct: 104 KKYCQY-GYGKRYEKDKIYAXDAYRVSSDEAAIRAEIFARGPVQASF-----ATYEDFAH 157

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTF 231
             +G +Y  +A      +A VKI+GWG ENG   W + +++
Sbjct: 158 YKSG-IYVHTAGKRRGGHA-VKIIGWGVENGTKXWIVANSW 196


>gi|115605092|gb|ABJ15785.1| cathepsin B [Bos taurus]
          Length = 118

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 42/78 (53%), Gaps = 3/78 (3%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  K+GLV+GG ++S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 1   CNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 58

Query: 140 HTRCTNDNYGRGFFQDKY 157
              C    Y   + +DK+
Sbjct: 59  SKTC-EPGYSPSYKEDKH 75


>gi|326430261|gb|EGD75831.1| hypothetical protein PTSG_07950 [Salpingoeca sp. ATCC 50818]
          Length = 381

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 29/77 (37%), Positives = 44/77 (57%), Gaps = 1/77 (1%)

Query: 203 AEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           ++ + +  V IVGWG E+  PYW + +++G  FG  G  KI RG NE  IES +  +L  
Sbjct: 285 SDSIGWHAVIIVGWGVEDNTPYWLVQNSWGTGFGIDGYFKIARGTNECNIESRLVTSL-V 343

Query: 263 DNYGVEFGEESGERLSE 279
           +  GV F   SG  +++
Sbjct: 344 NTEGVVFASTSGAAVAK 360


>gi|350408961|ref|XP_003488566.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
           impatiens]
          Length = 445

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%), Gaps = 10/83 (12%)

Query: 184 KYTRPLFQTNGRVYAVSASAEIVA--YATVKIVGWGEENGR--------PYWTIVSTFGE 233
           K  +  F     +Y  +A+ E  A  Y +V+I+GWGE+            YW +V+++G+
Sbjct: 355 KVYQDFFSYESGIYKHTATTEHYAFGYHSVRIIGWGEDTSAHRYRNLPIKYWLVVNSWGQ 414

Query: 234 QFGDKGTIKILRGRNEAIIESLV 256
           Q+G+ G  +I RG NE  IES V
Sbjct: 415 QWGESGLFRIQRGTNECDIESFV 437


>gi|256082975|ref|XP_002577726.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
           mansoni]
          Length = 1471

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 21/36 (58%), Positives = 30/36 (83%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
           V +VG+GEENGR YW I +++GE++G+KG IKI +G
Sbjct: 319 VLVVGYGEENGRSYWLIKNSWGEEWGEKGYIKISKG 354


>gi|449670327|ref|XP_002160467.2| PREDICTED: dipeptidyl peptidase 1-like [Hydra magnipapillata]
          Length = 458

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 23/47 (48%), Positives = 36/47 (76%)

Query: 215 GWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
           G+GEE+G+ YW + +++GE++G+KG  +I RG +E  IESLV  A+P
Sbjct: 405 GYGEEDGQKYWIVKNSWGEEWGEKGYFRIRRGTDEIAIESLVVYAVP 451


>gi|225437812|ref|XP_002281936.1| PREDICTED: cathepsin B-like isoform 1 [Vitis vinifera]
 gi|359480250|ref|XP_003632421.1| PREDICTED: cathepsin B-like [Vitis vinifera]
          Length = 358

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 58/196 (29%), Positives = 83/196 (42%), Gaps = 41/196 (20%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G     W +    G+VT     +   TGC   S P C        EP       P P
Sbjct: 169 CDGGYPLYAWRYFIHHGVVTEECDPYFDATGC---SHPGC--------EP-----GYPTP 212

Query: 138 KCHTRCTNDN--------YGRGFFQ---DKYQINGLGLYFDPHFGPFWPAFWRSFCTKYT 186
           KC  +CT++N        YG+  ++   D YQI    +Y +   GP   AF     T Y 
Sbjct: 213 KCVRKCTDENQLWRKAKRYGQSAYRISSDPYQIMA-EVYKN---GPVEVAF-----TVYE 263

Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILR 245
                 +G VY  + + +++    VK++GWG  ++G  YW + + +   +GD G   I R
Sbjct: 264 DFAHYESG-VYRYT-TGDVMGGHAVKLIGWGTTDDGEDYWILANQWNRNWGDDGYFMIRR 321

Query: 246 GRNEAIIESLVNGALP 261
           G NE  IE  V   LP
Sbjct: 322 GVNECGIEEGVVAGLP 337


>gi|62320420|dbj|BAD94873.1| cathepsin B-like cysteine proteinase like protein [Arabidopsis
           thaliana]
          Length = 183

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 56/186 (30%), Positives = 76/186 (40%), Gaps = 35/186 (18%)

Query: 89  WAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
           W +    G+VT     +  NTGC   S P C        EP       P PKC  +C + 
Sbjct: 4   WLYFKYHGVVTQECDPYFDNTGC---SHPGC--------EP-----TYPTPKCERKCVSR 47

Query: 147 NYGRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
           N   G  +  Y +    +  DP          GP   AF     T Y       +G VY 
Sbjct: 48  NQLWGESK-HYGVGAYRINPDPQDIMAEVYKNGPVEVAF-----TVYEDFAHYKSG-VYK 100

Query: 199 VSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 257
                +I  +A VK++GWG  ++G  YW + + +   +GD G  KI RG NE  IE  V 
Sbjct: 101 YITGTKIGGHA-VKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQSVV 159

Query: 258 GALPKD 263
             LP +
Sbjct: 160 AGLPSE 165


>gi|345488309|ref|XP_001605531.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
           [Nasonia vitripennis]
          Length = 481

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 26/59 (44%), Positives = 38/59 (64%), Gaps = 6/59 (10%)

Query: 207 AYATVKIVGWGEE----NGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
            Y +V+IVGWGEE    NG+P  +W + +++G  +G+ G  +I+RG NE  IES V G 
Sbjct: 414 GYHSVRIVGWGEEPSPYNGKPIKFWRVANSWGRDWGEDGYFRIVRGNNECEIESFVLGV 472


>gi|297744106|emb|CBI37076.3| unnamed protein product [Vitis vinifera]
          Length = 392

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 56/196 (28%), Positives = 82/196 (41%), Gaps = 41/196 (20%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G     W +    G+VT     +   TGC               S P C+    P P
Sbjct: 203 CDGGYPLYAWRYFIHHGVVTEECDPYFDATGC---------------SHPGCEP-GYPTP 246

Query: 138 KCHTRCTNDN--------YGRGFFQ---DKYQINGLGLYFDPHFGPFWPAFWRSFCTKYT 186
           KC  +CT++N        YG+  ++   D YQI    +Y +   GP   AF     T Y 
Sbjct: 247 KCVRKCTDENQLWRKAKRYGQSAYRISSDPYQIMA-EVYKN---GPVEVAF-----TVYE 297

Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILR 245
                 +G VY  + + +++    VK++GWG  ++G  YW + + +   +GD G   I R
Sbjct: 298 DFAHYESG-VYRYT-TGDVMGGHAVKLIGWGTTDDGEDYWILANQWNRNWGDDGYFMIRR 355

Query: 246 GRNEAIIESLVNGALP 261
           G NE  IE  V   LP
Sbjct: 356 GVNECGIEEGVVAGLP 371


>gi|161343857|tpg|DAA06109.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 163

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 43/171 (25%), Positives = 67/171 (39%), Gaps = 25/171 (14%)

Query: 108 GCQPVSFPPCNHANYTTSEPE--------CKTLATPQPKCHTRCTNDNYGRGFFQDKYQI 159
           G QP    PCN A+ T ++P         C       PKC   C N  +   +  D  + 
Sbjct: 1   GRQPWLVQPCN-ASTTAADPSSVLGPHGVCGGDPATTPKCDLSCYNARHEGKYLDDIIKA 59

Query: 160 NGLGLYFDP--------HFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATV 211
             +   FD           GP+          +           VY    + + +   +V
Sbjct: 60  KKV-FTFDGCSARKNLRKHGPY------VVTMRVYEDFLAYKSGVYH-HVTGDYLGLLSV 111

Query: 212 KIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           +++GWG E G+ +W   +++G  +GDKG  KI R  NE  IE+     +PK
Sbjct: 112 RMIGWGLEGGQAFWLFANSWGTSWGDKGFFKIRRFVNERWIENFRYAGVPK 162


>gi|189238903|ref|XP_967834.2| PREDICTED: similar to tubulointerstitial nephritis antigen
           [Tribolium castaneum]
          Length = 453

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 50/187 (26%), Positives = 79/187 (42%), Gaps = 28/187 (14%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G     W+++ K GLV       +  C P S       N     P    L T    C
Sbjct: 272 CNGGYLDRAWSYIRKIGLV-------DEQCFPYS-----ATNEKCRIPRRGDLVTAN--C 317

Query: 140 HTRCTNDNYGRGFFQDKYQINGLG--LYFDPHFGPFWPAF--WRSFCTKYTRPLFQTNGR 195
                 D   +      Y++      +Y   H GP       +  F T Y R +++    
Sbjct: 318 QLPTNVDRRSKYKVAPAYRVGNETDIMYEILHSGPVQATMKVYHDFFT-YKRGIYR---- 372

Query: 196 VYAVSASAEIVAYATVKIVGWGEENG----RPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
            ++  ++ +   Y +V+IVGWGEE      + YW + +++G ++G+ G  +ILRG NE  
Sbjct: 373 -HSPISTNDRTGYHSVRIVGWGEEYSPEGLKKYWKVANSWGPEWGENGYFRILRGSNECE 431

Query: 252 IESLVNG 258
           IES V G
Sbjct: 432 IESFVLG 438


>gi|449446774|ref|XP_004141146.1| PREDICTED: cathepsin B-like [Cucumis sativus]
          Length = 348

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 55/193 (28%), Positives = 79/193 (40%), Gaps = 35/193 (18%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G   S W +  + G+VT     +   TGC   S P C        EP     A P P
Sbjct: 169 CDGGYPISAWRYFVRHGVVTEQCDPYFDTTGC---SHPGC--------EP-----AYPTP 212

Query: 138 KCHTRCTNDN--------YGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
           +C   C + N        YG   ++ K   N +      + GP   +F     T Y    
Sbjct: 213 RCVRHCVDKNQIWRKTKHYGVSAYRVKRDPNDIMAEVYKN-GPVEVSF-----TVYEDFA 266

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
              +G VY    + +++    VK++GWG  ++G  YW + + +   +GD G  KI RG N
Sbjct: 267 HYKSG-VYK-HITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRRGTN 324

Query: 249 EAIIESLVNGALP 261
           E  IE  V   LP
Sbjct: 325 ECGIEEDVVAGLP 337


>gi|449489527|ref|XP_004158338.1| PREDICTED: cathepsin B-like [Cucumis sativus]
          Length = 349

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 55/193 (28%), Positives = 79/193 (40%), Gaps = 35/193 (18%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G   S W +  + G+VT     +   TGC   S P C        EP     A P P
Sbjct: 170 CDGGYPISAWRYFVRHGVVTEQCDPYFDTTGC---SHPGC--------EP-----AYPTP 213

Query: 138 KCHTRCTNDN--------YGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
           +C   C + N        YG   ++ K   N +      + GP   +F     T Y    
Sbjct: 214 RCVRHCVDKNQIWRKTKHYGVSAYRVKRDPNDIMAEVYKN-GPVEVSF-----TVYEDFA 267

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
              +G VY    + +++    VK++GWG  ++G  YW + + +   +GD G  KI RG N
Sbjct: 268 HYKSG-VYK-HITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRRGTN 325

Query: 249 EAIIESLVNGALP 261
           E  IE  V   LP
Sbjct: 326 ECGIEEDVVAGLP 338


>gi|357623033|gb|EHJ74345.1| tubulointerstitial nephritis antigen [Danaus plexippus]
          Length = 426

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 34/51 (66%)

Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
           +V+IVGWGE+ G  YW + +++G  +G+ G  +I RG NE+ IES V   L
Sbjct: 366 SVRIVGWGEDRGDKYWVVANSWGCDWGENGYFRIARGSNESGIESFVVTVL 416


>gi|312082955|ref|XP_003143660.1| hypothetical protein LOAG_08080 [Loa loa]
 gi|307761175|gb|EFO20409.1| hypothetical protein LOAG_08080 [Loa loa]
          Length = 339

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 28/63 (44%), Positives = 39/63 (61%), Gaps = 4/63 (6%)

Query: 204 EIVAYATVKIVGWGEE--NGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           EI  Y +V+++GWGE+   G P  YW   +++G  +G+ GT +ILRG N   IES V GA
Sbjct: 256 EIEGYHSVRLLGWGEDYSTGIPVKYWIAANSWGTNWGENGTFRILRGENHCEIESFVIGA 315

Query: 260 LPK 262
             K
Sbjct: 316 WGK 318


>gi|268572255|ref|XP_002648916.1| Hypothetical protein CBG17829 [Caenorhabditis briggsae]
          Length = 220

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 23/46 (50%), Positives = 33/46 (71%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
           +KI+GWG +NG PYW I +++G ++G+ G  KI RG NE  IE+ V
Sbjct: 165 IKIIGWGTQNGIPYWLIANSWGTKWGENGFFKIRRGVNECGIENNV 210


>gi|193202653|ref|NP_492593.2| Protein F26E4.3 [Caenorhabditis elegans]
 gi|205371857|sp|P90850.3|YCF2E_CAEEL RecName: Full=Uncharacterized peptidase C1-like protein F26E4.3;
           Flags: Precursor
 gi|166157004|emb|CAB03007.2| Protein F26E4.3 [Caenorhabditis elegans]
          Length = 452

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 46/82 (56%), Gaps = 4/82 (4%)

Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN--GRP--YWTIVSTFGEQFGDKGT 240
           Y   ++Q +       AS+    Y +V+++GWG ++  G+P  YW   +++G Q+G+ G 
Sbjct: 350 YAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGY 409

Query: 241 IKILRGRNEAIIESLVNGALPK 262
            K+LRG N   IES V GA  K
Sbjct: 410 FKVLRGENHCEIESFVIGAWGK 431


>gi|126647906|ref|XP_001388062.1| preprocathepsin c precursor [Cryptosporidium parvum Iowa II]
 gi|126117150|gb|EAZ51250.1| preprocathepsin c precursor, putative [Cryptosporidium parvum Iowa
           II]
          Length = 635

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 33/73 (45%), Positives = 40/73 (54%), Gaps = 10/73 (13%)

Query: 183 TKY-TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTI 241
           TKY   P  Q NG  Y   A         + IVGWGEENG PYW I +++G  +G KG  
Sbjct: 517 TKYCDLPNKQLNGWEYTNHA---------IAIVGWGEENGIPYWIIRNSWGANWGKKGYA 567

Query: 242 KILRGRNEAIIES 254
           KI RG+N   IE+
Sbjct: 568 KIRRGKNIGGIEN 580


>gi|260782761|ref|XP_002586451.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
 gi|229271561|gb|EEN42462.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
          Length = 272

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 38/125 (30%), Positives = 64/125 (51%), Gaps = 13/125 (10%)

Query: 137 PKCHTRCTNDNYGRGFFQDKYQI-----NGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQ 191
           P+C ++CT + +    F   Y       N + +    + GP   AF     T Y+  +  
Sbjct: 146 PECMSKCTGEGHAYQKFYGLYLYTVSGENQIKVEIMTN-GPVEAAF-----TVYSDIVHY 199

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
            +G VY  ++  ++  +A VK++GWG E+   YW + +++G  +GD+G  KI RG +E  
Sbjct: 200 KSG-VYHHTSGGKLGGHA-VKVLGWGVEDEEEYWLVANSWGPDWGDQGFFKIKRGSDECG 257

Query: 252 IESLV 256
           IES V
Sbjct: 258 IESRV 262


>gi|260821944|ref|XP_002606363.1| hypothetical protein BRAFLDRAFT_118514 [Branchiostoma floridae]
 gi|229291704|gb|EEN62373.1| hypothetical protein BRAFLDRAFT_118514 [Branchiostoma floridae]
          Length = 113

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 26/64 (40%), Positives = 39/64 (60%), Gaps = 6/64 (9%)

Query: 207 AYATVKIVGWGEENGRPY------WTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
            + +V+I+GWG E   PY      WT+ +++G Q+G++G  +I+RG NE  IES V G  
Sbjct: 32  GWHSVRIIGWGVEMSDPYQAPIKYWTVANSWGTQWGEEGYFRIVRGENECQIESFVLGVW 91

Query: 261 PKDN 264
            K N
Sbjct: 92  GKVN 95


>gi|357511627|ref|XP_003626102.1| Cathepsin L-like proteinase [Medicago truncatula]
 gi|355501117|gb|AES82320.1| Cathepsin L-like proteinase [Medicago truncatula]
          Length = 351

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 56/197 (28%), Positives = 77/197 (39%), Gaps = 43/197 (21%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C+ G   S W ++   G+VT     +    GC   S P C        EP  +T     P
Sbjct: 163 CAGGTPFSAWIYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPTYRT-----P 206

Query: 138 KCHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKY 185
           KC  +C N N      + +    Y +N      DP          GP   AF     T Y
Sbjct: 207 KCVKKCVNGNQLWETSKHYSVKAYTVNS-----DPQDIMAEVYKNGPVEVAF-----TVY 256

Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKIL 244
                  +G    ++  A  +    VK+VGWG  + G  YW + + +   +GD G  KI 
Sbjct: 257 EDFAHYKSGVYKHITGFA--LGGHAVKLVGWGTSHEGEDYWLLANQWNTNWGDDGYFKIK 314

Query: 245 RGRNEAIIESLVNGALP 261
           RG NE  IE+ V   LP
Sbjct: 315 RGTNECGIENAVTAGLP 331


>gi|294890224|ref|XP_002773108.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239878009|gb|EER04924.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 109

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 41/67 (61%), Gaps = 3/67 (4%)

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           VY  ++  E+  +A VKI+GWGEE G+ YW +V+++ E +GD G  KI  G  E  I+  
Sbjct: 45  VYKHTSGKELGGHA-VKIIGWGEETGQAYWLVVNSWNEDWGDNGLFKIALGNCE--IDDD 101

Query: 256 VNGALPK 262
           + G  PK
Sbjct: 102 LLGGTPK 108


>gi|87240981|gb|ABD32839.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
           propeptide [Medicago truncatula]
          Length = 356

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 56/197 (28%), Positives = 77/197 (39%), Gaps = 43/197 (21%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C+ G   S W ++   G+VT     +    GC   S P C        EP  +T     P
Sbjct: 168 CAGGTPFSAWIYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPTYRT-----P 211

Query: 138 KCHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKY 185
           KC  +C N N      + +    Y +N      DP          GP   AF     T Y
Sbjct: 212 KCVKKCVNGNQLWETSKHYSVKAYTVNS-----DPQDIMAEVYKNGPVEVAF-----TVY 261

Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKIL 244
                  +G    ++  A  +    VK+VGWG  + G  YW + + +   +GD G  KI 
Sbjct: 262 EDFAHYKSGVYKHITGFA--LGGHAVKLVGWGTSHEGEDYWLLANQWNTNWGDDGYFKIK 319

Query: 245 RGRNEAIIESLVNGALP 261
           RG NE  IE+ V   LP
Sbjct: 320 RGTNECGIENAVTAGLP 336


>gi|300121514|emb|CBK22033.2| unnamed protein product [Blastocystis hominis]
          Length = 476

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 35/52 (67%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           + +VGWGEENG PYW + +++G  +G++G  +I+RG+N   IE      +P+
Sbjct: 137 ISVVGWGEENGIPYWIVRNSWGTYWGEEGFFRIVRGKNNLGIEEGCTYGIPR 188



 Score = 37.7 bits (86), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 22/78 (28%), Positives = 38/78 (48%), Gaps = 3/78 (3%)

Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKI 243
           T+      G V+  S   + +    V++ GWG  EE   PYW + +++G  +G+ G  +I
Sbjct: 396 TQTFLDYTGGVFT-SREGKWLGKHAVEVTGWGVDEETRTPYWIVRNSWGTYWGENGWFRI 454

Query: 244 LRGRNEAIIESLVNGALP 261
             G+N   IE +    +P
Sbjct: 455 AMGQNLLNIEQMCTWGVP 472


>gi|325184271|emb|CCA18763.1| cathepsin B putative [Albugo laibachii Nc14]
 gi|325190706|emb|CCA25201.1| cathepsin B putative [Albugo laibachii Nc14]
          Length = 436

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 32/84 (38%), Positives = 42/84 (50%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVEFG 270
           V+IVGWGEENG  YW   +++G  +G  G  KI+RG N   IES  +  +P         
Sbjct: 253 VEIVGWGEENGVKYWHARNSWGSFWGMNGFFKIVRGTNNLAIESDCHYVVPDIREEEVVF 312

Query: 271 EESGERLSEEFGVRAESSEEFREN 294
           EE        +G+R    EE  EN
Sbjct: 313 EEHPIYGGSHYGIRPFRPEEALEN 336


>gi|281208776|gb|EFA82951.1| peptidase C1A family protein [Polysphondylium pallidum PN500]
          Length = 1308

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 41/176 (23%), Positives = 69/176 (39%), Gaps = 21/176 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   + + +V K G+VT       + CQP + P C  A     +  C       P C
Sbjct: 136 CEGGDPYTAYKYVQKNGVVT-------SNCQPYTIPTCPPA-----QQPCMNFVN-TPPC 182

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRS----FCTKYTRPLFQTNGR 195
             +C N +    F QD + +  +     P+          +     C +           
Sbjct: 183 SAKCANSSVN--FQQDLHHLKTV-YAVKPNVAAIQNEIVTNGPVEACFEVYEDFLGYKSG 239

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
           VY   +  ++  +  +KIVG+G  NG PYW   +++   +G+ G   I  G+NE +
Sbjct: 240 VYTHKSGKDLGGHC-IKIVGFGVSNGTPYWICNNSWTTSWGNNGIFWIEAGKNECV 294


>gi|322788703|gb|EFZ14296.1| hypothetical protein SINV_07506 [Solenopsis invicta]
          Length = 443

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 30/68 (44%), Positives = 41/68 (60%), Gaps = 7/68 (10%)

Query: 196 VYAVSASAEI--VAYATVKIVGWGEEN---GRP--YWTIVSTFGEQFGDKGTIKILRGRN 248
           +Y  S SAE+    Y +V+I+GWGEE    G P  YW + +++G  +GD G  KI +G N
Sbjct: 368 IYRHSRSAELHDSGYHSVRIIGWGEERSYRGPPLKYWLVANSWGYNWGDNGLFKIQKGTN 427

Query: 249 EAIIESLV 256
           E  IES V
Sbjct: 428 ECEIESYV 435


>gi|257215762|emb|CAX83033.1| Cysteine PRotease related protein [Schistosoma japonicum]
          Length = 233

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 25/64 (39%), Positives = 31/64 (48%), Gaps = 1/64 (1%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  KRG+VTGG+  ++TGCQP  FP C H       P C T     P+C
Sbjct: 159 CKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQC 217

Query: 140 HTRC 143
              C
Sbjct: 218 KQTC 221


>gi|340508280|gb|EGR34021.1| papain family cysteine protease, putative [Ichthyophthirius
           multifiliis]
          Length = 620

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 25/56 (44%), Positives = 34/56 (60%)

Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNY 265
            V IVGWG ENG  YW + +++G  +G+KG  + LRG N   IE     A+PKD +
Sbjct: 225 VVSIVGWGVENGVKYWIVRNSWGSYWGEKGFYRQLRGVNMINIEQFCYWAVPKDTW 280


>gi|348690656|gb|EGZ30470.1| papain-like cysteine protease C1 [Phytophthora sojae]
          Length = 647

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 46/79 (58%), Gaps = 1/79 (1%)

Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
           T    + +G ++    +A    +A + IVGWGEE+G P+W + +++G  +G+ G ++++R
Sbjct: 228 TDGFLKYSGGIFDDKTNATETDHA-ISIVGWGEEDGVPFWVLRNSWGSFWGEDGWMRLVR 286

Query: 246 GRNEAIIESLVNGALPKDN 264
           G N   +E      +PKD+
Sbjct: 287 GVNNVGVEGECAFGVPKDD 305


>gi|224064400|ref|XP_002301457.1| predicted protein [Populus trichocarpa]
 gi|222843183|gb|EEE80730.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 58/197 (29%), Positives = 78/197 (39%), Gaps = 43/197 (21%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C+ G   S W +    G+VT     +  + GC   S P C        EP       P P
Sbjct: 169 CNGGYPISAWRYFVHHGVVTEECDPYFDDIGC---SHPGC--------EP-----GYPTP 212

Query: 138 KCHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKY 185
           KC  +C N N      + +    Y+I+      DP          GP   AF     T Y
Sbjct: 213 KCARKCVNKNQLWKKSKHYGVKPYRIDS-----DPESIMAEIYKNGPVEVAF-----TVY 262

Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKIL 244
                  +G VY       +  +A VK++GWG  E+G  YW + + +   +GD G  KI 
Sbjct: 263 EDFAHYKSG-VYKHITGGMMGGHA-VKLIGWGTSEDGEAYWLLANQWNRGWGDDGYFKIR 320

Query: 245 RGRNEAIIESLVNGALP 261
           RG NE  IE  V   LP
Sbjct: 321 RGTNECGIEGDVVAGLP 337


>gi|157058763|gb|ABV03139.1| cathepsin B-348 [Acyrthosiphon pisum]
          Length = 248

 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 55/217 (25%), Positives = 89/217 (41%), Gaps = 32/217 (14%)

Query: 27  SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
           SC    A      ++  VC  S       +F F A     C W        + C+ G   
Sbjct: 50  SCGSCWAFGAVEAMSDRVCIHSNG---TKNFHFSAENLVSCCWTCG-----FGCNGGFPG 101

Query: 87  STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRC--- 143
           + W +   +G+V+GG + SN GC P    PC H    T  P CK      P C  +C   
Sbjct: 102 AAWNYWKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGP-CKE-GGKTPTCVKKCEEG 159

Query: 144 ------TNDNYGRGFFQDKYQINGL--GLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGR 195
                  + ++G+  +  +  ++ +   +Y +   GP   AF     T Y   +    G 
Sbjct: 160 YKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTN---GPVEGAF-----TVYEDFIAYRAG- 210

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGR-PYWTIVSTF 231
           VY   A   +  +A ++I+GWG +NG  PYW + +++
Sbjct: 211 VYKHVAGKALGGHA-IRILGWGVQNGEIPYWLVANSW 246


>gi|168000937|ref|XP_001753172.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695871|gb|EDQ82213.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 347

 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 50/196 (25%), Positives = 76/196 (38%), Gaps = 42/196 (21%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G     W +    G+VT     +    GC   + P C    Y T E          P
Sbjct: 171 CEGGYPIRAWKYFKHSGVVTNKCDPYFDQKGC---AHPGC----YPTYE---------TP 214

Query: 138 KCHTRCTNDNYGRGFFQDKYQ-INGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
           KC  +C +D +   + Q K+  +N   +  +P                YT    +    V
Sbjct: 215 KCEKQCVDDEF---WVQSKHLGVNAYEMSMEPE---------DLMAELYTNGPVEVAFEV 262

Query: 197 YAVSASAEIVAYA----------TVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILR 245
           Y   A  +   Y            VK++GWG  ++G  YWTIV+++   +G+ G  +I+R
Sbjct: 263 YEDFAHYKTGVYKHLFGGFMGGHAVKLIGWGTTDDGVDYWTIVNSWNTNWGEDGLFRIVR 322

Query: 246 GRNEAIIESLVNGALP 261
           G +E  IES     LP
Sbjct: 323 GNDECGIESNAVAGLP 338


>gi|111054118|gb|ABH04250.1| cathepsin B precursor [Sus scrofa]
          Length = 61

 Score = 50.8 bits (120), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 21/53 (39%), Positives = 35/53 (66%)

Query: 202 SAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
           + +++    ++I+GWG ENG PYW + +++   +GD G  KILRG++   IES
Sbjct: 7   TGDLMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIES 59


>gi|156708122|gb|ABU93319.1| cathepsin B10 cysteine protease [Monocercomonoides sp. PA]
          Length = 283

 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 42/175 (24%), Positives = 73/175 (41%), Gaps = 29/175 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G   ++W WV   G+ T       +G   +  P C H     S  +  T+   +   
Sbjct: 128 CNGGYQENSWTWVLTTGITTESCWPYRSGSGRI--PSCPHRCVNGSVLQRNTINNYRRLD 185

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
            +   ++ Y  G  Q  Y +     Y D  +              Y++ +++        
Sbjct: 186 SSELQDELYNNGPIQVTYVV-----YEDFFY--------------YSKGIYK-------- 218

Query: 200 SASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
             S   V    V ++GWG E+G  YW + +++G ++G++G  +ILRG NE  IES
Sbjct: 219 HLSGNKVGGHAVVLMGWGIEDGVKYWLVQNSWGYEWGEQGYFRILRGSNECGIES 273


>gi|242014495|ref|XP_002427925.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
           corporis]
 gi|212512409|gb|EEB15187.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
           corporis]
          Length = 473

 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 28/63 (44%), Positives = 38/63 (60%), Gaps = 5/63 (7%)

Query: 207 AYATVKIVGWGEEN---GRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
            Y +VKI+GWGEE    G+P  YW   +++G+Q+G+ G  KI RG NE  IE  V  A  
Sbjct: 371 GYHSVKILGWGEETNIYGQPIKYWLAANSWGQQWGENGFFKIRRGTNECEIEEFVLAAWA 430

Query: 262 KDN 264
           + N
Sbjct: 431 ETN 433


>gi|145546673|ref|XP_001459019.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124426842|emb|CAK91622.1| unnamed protein product [Paramecium tetraurelia]
          Length = 476

 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 39/141 (27%), Positives = 64/141 (45%), Gaps = 21/141 (14%)

Query: 128 ECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLG---------LYFDPHFGPFWPAFW 178
           ECK +   + K H R  N  +  G +    ++N +          L F+P F      F+
Sbjct: 340 ECKAV---EKKKHYRVINYRFIGGAYGKSNELNIMEEIHKNGPVVLNFEPSFDFM---FY 393

Query: 179 RSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDK 238
                  T P +  NG          ++ Y      GWGEENG  YW + +++G+Q+G+ 
Sbjct: 394 VGGVFHSTIPDWIINGLAKPEWVDHSVLCY------GWGEENGVKYWLLQNSWGKQWGEN 447

Query: 239 GTIKILRGRNEAIIESLVNGA 259
           G  ++ RG++E+ IES+   A
Sbjct: 448 GRFRMKRGQDESSIESMAEAA 468


>gi|348570708|ref|XP_003471139.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
           porcellus]
          Length = 468

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 27/55 (49%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +ILRG NE  IES V G 
Sbjct: 402 SVKITGWGEETLPDGRTLKYWTAANSWGPSWGERGHFRILRGSNECDIESFVLGV 456


>gi|300176576|emb|CBK24241.2| unnamed protein product [Blastocystis hominis]
          Length = 563

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 33/51 (64%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
           V I+GWG EN  PYW + +++G  +G+ G  +ILRG N   IES  + A+P
Sbjct: 200 VNIIGWGSENETPYWIVRNSWGSSWGEDGYFRILRGVNLLGIESSCSYAVP 250



 Score = 41.2 bits (95), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 32/52 (61%), Gaps = 1/52 (1%)

Query: 211 VKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
           V++VGWG  E G  YW   + +GE +G+KG  +I+ G N  +IES  +  +P
Sbjct: 509 VEVVGWGRTEEGVEYWIGRNNWGENWGEKGWFRIMMGGNNLLIESSCSWGVP 560


>gi|351709947|gb|EHB12866.1| Tubulointerstitial nephritis antigen-like protein [Heterocephalus
           glaber]
          Length = 467

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 27/55 (49%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +ILRG NE  IES V G 
Sbjct: 401 SVKITGWGEEMLPDGRTLKYWTAANSWGPSWGERGYFRILRGSNECDIESFVLGV 455


>gi|146163744|ref|XP_001471259.1| cathepsin z [Tetrahymena thermophila]
 gi|146145941|gb|EDK31861.1| cathepsin z [Tetrahymena thermophila SB210]
          Length = 585

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 49/86 (56%), Gaps = 4/86 (4%)

Query: 181 FCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGT 240
           + T+Y R  +   G +Y  ++S     +  +++VGWGEEN   YW I +++G  +G+KG 
Sbjct: 203 YATEYLR--YNYTGGIYNDTSSYPGTNHV-IEVVGWGEENNEKYWIIRNSWGSYWGEKGF 259

Query: 241 IKILRGRNEAIIESL-VNGALPKDNY 265
            + LRG N   IES   N A+P D +
Sbjct: 260 YRQLRGVNMLNIESSNCNWAVPLDTW 285


>gi|294952601|ref|XP_002787371.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239902343|gb|EER19167.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 744

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 42/140 (30%), Positives = 57/140 (40%), Gaps = 22/140 (15%)

Query: 108 GCQPVSFPPCNHANYTTSE-PECKTLA-TPQPKCHTRCTNDNYGRGFFQDKYQING---- 161
           GC P  F  CNH     +E P+CK  A  P P C T CTN  Y R   +D ++  G    
Sbjct: 494 GCWPYPFQKCNHVPTEKTEYPKCKDAAHPPLPPCRTTCTNKAYKRSLKKDVHRAKGWRKV 553

Query: 162 -------LGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIV 214
                      FD   GP + AF      +Y +        VY V  + E  ++  +KI+
Sbjct: 554 LNNAQSVKQEIFD--NGPVFSAFKMYEDFRYYK------SGVY-VPTTEEFHSFHLIKII 604

Query: 215 GWGEENGRPYWTIVSTFGEQ 234
           GWG         +VS   E+
Sbjct: 605 GWGVHPDAQDLGVVSLLNEE 624


>gi|297723949|ref|NP_001174338.1| Os05g0310500 [Oryza sativa Japonica Group]
 gi|255676228|dbj|BAH93066.1| Os05g0310500, partial [Oryza sativa Japonica Group]
          Length = 234

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 58/208 (27%), Positives = 82/208 (39%), Gaps = 42/208 (20%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G     W +  + G+VT     +    GC+    P C        EP     A P P
Sbjct: 46  CDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCK---HPGC--------EP-----AYPTP 89

Query: 138 KCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
            C  +C   N  + + + K + +N   +  DPH         GP   AF     T Y   
Sbjct: 90  VCEKKCKVQN--QVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAF-----TVYEDF 142

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGR 247
               +G VY       +  +A VK++GWG  + G  YW + + +   +GD G  KI+RG 
Sbjct: 143 AHYKSG-VYKHITGGMMGGHA-VKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFKIIRGT 200

Query: 248 NEAIIESLVNGALPKD-----NYGVEFG 270
           NE  IE  V   +P       NY   FG
Sbjct: 201 NECGIEEDVVAGMPSTKNMVRNYDSAFG 228


>gi|30575714|gb|AAP33049.1| cysteine proteinase 1 [Clonorchis sinensis]
          Length = 326

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 23/52 (44%), Positives = 34/52 (65%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           V  VG+G +NG+PYW + +++GE FG+KG  +I RG     I S+V  A+ K
Sbjct: 275 VLTVGYGVQNGKPYWIVKNSWGEDFGEKGYFRIYRGDGTCGINSIVTTAIIK 326


>gi|157058765|gb|ABV03140.1| cathepsin B-348 [Aulacorthum solani]
          Length = 237

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 57/211 (27%), Positives = 78/211 (36%), Gaps = 41/211 (19%)

Query: 27  SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
           SC    A      ++  VC  SK      +F F A     C W        + C+ G   
Sbjct: 52  SCGSCWAFGAVEAMSDRVCIHSK---GTKNFHFSAENLVSCCWTCG-----FGCNGGFPG 103

Query: 87  STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
           + W +   +G+V+GG + SN GC P    PC H    T  P CK      PKC  +C  D
Sbjct: 104 AAWNYWKTKGIVSGGPYGSNMGCIPYEVAPCEHHVNGTRGP-CKE-GGKTPKCVKKC-ED 160

Query: 147 NYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY-AVSASAEI 205
            Y   + QD             H G    A+  S      R    TNG V  A +   + 
Sbjct: 161 GYKVPYAQDL------------HHGK--SAYSLSNDVDQIRQEIYTNGPVEGAFTVYEDF 206

Query: 206 VAY---------------ATVKIVGWGEENG 221
           +AY                 ++I+GWG +NG
Sbjct: 207 IAYRAGVYKHVAGKALGGHAIRILGWGVQNG 237


>gi|388500062|gb|AFK38097.1| unknown [Lotus japonicus]
          Length = 357

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 56/194 (28%), Positives = 79/194 (40%), Gaps = 37/194 (19%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G     W ++   G+VT     +    GC   S P C        EP  +T     P
Sbjct: 169 CDGGYPLYAWRYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYQT-----P 212

Query: 138 KCHTRCTNDNYGRGFFQDKY-QINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
           KC  +C   N  + + + KY  +N   +  DP+         GP   AF     T Y   
Sbjct: 213 KCVRKCVKGN--QIWKKSKYFSVNAYSVKSDPYDIMAEVYKNGPVEVAF-----TVYEDF 265

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
               +G VY     +++  +A VK++GWG  + G  YW I + +   +GD G   I RG 
Sbjct: 266 AHYKSG-VYKHITGSQLGGHA-VKLIGWGTTDEGEDYWLIANQWNRSWGDDGYFMIRRGT 323

Query: 248 NEAIIESLVNGALP 261
           NE  IE  V   LP
Sbjct: 324 NECGIEEDVTAGLP 337


>gi|59895951|gb|AAX11351.1| cathepsin B-like cysteine protease [Oryza sativa Japonica Group]
 gi|125551767|gb|EAY97476.1| hypothetical protein OsI_19406 [Oryza sativa Indica Group]
 gi|215694023|dbj|BAG89222.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215712372|dbj|BAG94499.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765382|dbj|BAG87079.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222631058|gb|EEE63190.1| hypothetical protein OsJ_17999 [Oryza sativa Japonica Group]
          Length = 358

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 56/208 (26%), Positives = 81/208 (38%), Gaps = 42/208 (20%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G     W +  + G+VT     +    GC+                P C+  A P P
Sbjct: 170 CDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCK---------------HPGCEP-AYPTP 213

Query: 138 KCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
            C  +C   N  + + + K + +N   +  DPH         GP   AF     T Y   
Sbjct: 214 VCEKKCKVQN--QVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAF-----TVYEDF 266

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGR 247
               +G VY       +  +A VK++GWG  + G  YW + + +   +GD G  KI+RG 
Sbjct: 267 AHYKSG-VYKHITGGMMGGHA-VKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFKIIRGT 324

Query: 248 NEAIIESLVNGALPKD-----NYGVEFG 270
           NE  IE  V   +P       NY   FG
Sbjct: 325 NECGIEEDVVAGMPSTKNMVRNYDSAFG 352


>gi|56755425|gb|AAW25892.1| unknown [Schistosoma japonicum]
          Length = 226

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 51/209 (24%), Positives = 80/209 (38%), Gaps = 32/209 (15%)

Query: 26  LSCIEARAVATATPLAFAVCRSS--KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSG 83
           +S I   AV+    ++  +C  S  K  VE ++   I+   + C            C  G
Sbjct: 35  ISFINKHAVSAVGAMSDRICIQSGGKQSVELSAIDLISCC-ENCGS---------GCDGG 84

Query: 84  ISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRC 143
                W +    G+VTGG+  ++TGCQP  FP C H +     P C       P+C  +C
Sbjct: 85  FPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHHS-IGKYPSCGDKIYKTPQCKRKC 143

Query: 144 TNDNYGRGFFQDKYQINGLGLYFDPH----------FGPFWPAFWRSFCTKYTRPLFQTN 193
               Y   +  DK+   G+ +    +          +GP   A+   F            
Sbjct: 144 -QKGYTTPYEHDKHY-GGISINVIKNESAIQKEIMMYGPV-EAYLLIF-----EDFLNYK 195

Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGR 222
             +Y  + +   V    V+I+GWG EN R
Sbjct: 196 SGIYRYT-TGSFVGEHYVRIIGWGIENER 223


>gi|270012758|gb|EFA09206.1| cathepsin B precursor [Tribolium castaneum]
          Length = 326

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 46/168 (27%), Positives = 71/168 (42%), Gaps = 33/168 (19%)

Query: 87  STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
           + W +    G+ +GG ++S+ GCQP S     +A  +    EC    T +          
Sbjct: 153 NAWDYYINEGIASGGDYNSSEGCQPYSESSFQYAEAS----ECVKFYTLETNVAQ----- 203

Query: 147 NYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIV 206
                  Q +   NG  + +   F  F        C K        +G  Y    S + V
Sbjct: 204 ------IQMEILTNGPVMAYYNVFEDF-------ACHK--------SGVYYY--KSGKFV 240

Query: 207 AYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGT-IKILRGRNEAIIE 253
              +VK++GWG E G PYW I +++G ++G+ G   K+ RG NE  IE
Sbjct: 241 GRHSVKVIGWGTEEGIPYWLIANSWGSEWGELGGFFKMRRGTNECWIE 288


>gi|47212965|emb|CAF93376.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 271

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 51/209 (24%), Positives = 77/209 (36%), Gaps = 53/209 (25%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G     W ++ +RG+VT         C P   P    A  +    + +++   + + 
Sbjct: 75  CAGGRLDGAWWYLRRRGVVT-------EDCYPYRPPQQTPAELSRCMMQSRSVGRGKRQA 127

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
             RC N N    +  D YQ                P +  S   K      Q NG V A+
Sbjct: 128 TQRCPNTN---NYQNDIYQST--------------PPYRLSTSEKEIMKEIQDNGPVQAI 170

Query: 200 SASAEI-------------VAYA-----------TVKIVGWGEENG-----RPYWTIVST 230
               E              V++            +VKI GWGEE       R YW   ++
Sbjct: 171 MEVHEDFFMYNSGIYKHTDVSFTKPPHYRKHGTHSVKITGWGEERNFDGTTRKYWIAANS 230

Query: 231 FGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +G+ +G+ G  +I RG NE  IE+ V G 
Sbjct: 231 WGKNWGENGYFRIARGENECEIEAFVIGV 259


>gi|395734831|ref|XP_003776483.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin B-like [Pongo abelii]
          Length = 350

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 46/191 (24%), Positives = 77/191 (40%), Gaps = 22/191 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPV-SFPPCNHANY------TTSEPECKTL 132
           C+ G  +  W +   +GLV+GG + S+ GC+   S  PC H  +      T   P+C   
Sbjct: 164 CNGGXPNEGWNFWTGKGLVSGGLYDSHVGCRLFPSLLPCKHHIHGXPYVXTGDSPKCSMT 223

Query: 133 ATPQPKCHTRCTNDNYGRGFFQ--DKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
             P     T   + +YG   +   D  +     +Y +      +  +      K+    +
Sbjct: 224 CEPG---QTYKXDKHYGCSSYSISDSTKDIMTNIYKNDXVEEAFSVYLDFLMYKFKE--Y 278

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
           Q          + E+     + I+G   EN   YW + + +   +GD G  KILRG++  
Sbjct: 279 Q--------GVTGEMXGGHAICILGCKVENSTSYWLVANXWNRDWGDNGFFKILRGQDHY 330

Query: 251 IIESLVNGALP 261
            IES V   +P
Sbjct: 331 GIESEVVAEIP 341


>gi|124487938|gb|ABN12052.1| cathepsin B endopeptidase-like protein [Maconellicoccus hirsutus]
          Length = 66

 Score = 50.4 bits (119), Expect = 0.001,   Method: Composition-based stats.
 Identities = 23/61 (37%), Positives = 36/61 (59%), Gaps = 2/61 (3%)

Query: 211 VKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVE 268
           ++I+GWG  ++   PYW + +++   +GD G  KI RG NE  IE  +N  +PK N  + 
Sbjct: 6   IRILGWGVCKKTNAPYWLVANSWNTDWGDHGYFKIKRGSNECGIEDSINAGIPKLNKDLR 65

Query: 269 F 269
           F
Sbjct: 66  F 66


>gi|46948154|gb|AAT07059.1| cathepsin F-like cysteine proteinase, partial [Brugia malayi]
          Length = 461

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/50 (44%), Positives = 35/50 (70%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
           V I G+G EN  PYWTI +++GEQ+G+ G  +++RG+N   +  LV+ A+
Sbjct: 410 VLITGYGIENNLPYWTIKNSWGEQWGENGYFQLMRGKNICGVSDLVSSAI 459


>gi|294891623|ref|XP_002773656.1| hypothetical protein Pmar_PMAR011495 [Perkinsus marinus ATCC 50983]
 gi|239878860|gb|EER05472.1| hypothetical protein Pmar_PMAR011495 [Perkinsus marinus ATCC 50983]
          Length = 815

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 40/66 (60%)

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           +Y  +A +  +    V+I+G+G E   P+W +++++G+ +G+ G  ++LRGRN   IE L
Sbjct: 572 LYTTTAGSPEIGNHAVRIIGFGVEGNVPFWLLMNSWGDDWGEHGCFRMLRGRNLCGIEEL 631

Query: 256 VNGALP 261
             G  P
Sbjct: 632 PVGMDP 637


>gi|307201161|gb|EFN81067.1| Uncharacterized peptidase C1-like protein F26E4.3 [Harpegnathos
           saltator]
          Length = 443

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/68 (42%), Positives = 41/68 (60%), Gaps = 7/68 (10%)

Query: 196 VYAVSASAEI--VAYATVKIVGWGEE---NGRP--YWTIVSTFGEQFGDKGTIKILRGRN 248
           VY  S SAE+    Y +++I+GWGEE    G P  YW + +++G  +G+ G  +I RG N
Sbjct: 368 VYRHSRSAELHDSGYHSMRIIGWGEEPSYRGPPLKYWLVANSWGRHWGENGLFRIQRGTN 427

Query: 249 EAIIESLV 256
           E  IES V
Sbjct: 428 ECEIESYV 435


>gi|291408920|ref|XP_002720687.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Oryctolagus
           cuniculus]
          Length = 467

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/55 (49%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +ILRG NE  IES V G 
Sbjct: 401 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRILRGTNECDIESFVLGV 455


>gi|48762481|dbj|BAD23810.1| cathepsin B-S [Tuberaphis taiwana]
          Length = 182

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/150 (24%), Positives = 67/150 (44%), Gaps = 24/150 (16%)

Query: 89  WAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 148
           W +   +G+ TGG + +  GC P   PPC +      +  C     P  + H +C    Y
Sbjct: 47  WKYFRTQGVTTGGDYDTKEGCMPYKVPPCYNKQ---GKNTCG--GQPMERNH-QCPKTCY 100

Query: 149 GRGFFQDKYQ------INGLGLYFD--PHFGPFWPAF--WRSFCTKYTRPLFQTNGRVYA 198
           G+   Q++Y+      +N +         +GP   +F  +  F       ++++   +Y 
Sbjct: 101 GKTTVQNRYKTKSEYVMNSIKTIEQDLKTYGPVEASFDVYDDFS------VYKSG--IYR 152

Query: 199 VSASAEIVAYATVKIVGWGEENGRPYWTIV 228
            +  A+     ++KI+GWG++NG PYW  V
Sbjct: 153 KTPKAKYQGGHSIKIIGWGQQNGTPYWLAV 182


>gi|126330441|ref|XP_001381244.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Monodelphis
           domestica]
          Length = 466

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/54 (50%), Positives = 36/54 (66%), Gaps = 5/54 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 258
           +VKI GWGEE   +G+   YWT  +++G  +G+KG  +ILRG NE  IES V G
Sbjct: 399 SVKITGWGEERQPDGQRLKYWTAANSWGPTWGEKGHFRILRGANECDIESFVVG 452


>gi|224285256|gb|ACN40354.1| unknown [Picea sitchensis]
          Length = 350

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 51/195 (26%), Positives = 75/195 (38%), Gaps = 40/195 (20%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C+ G   S W +  +RG+VT     +  N GC           N+   EP     + P P
Sbjct: 163 CNGGFPLSAWRYFSRRGVVTDECDPYFDNDGC-----------NHPGCEP-----SYPTP 206

Query: 138 KCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF--------GPFWPAF--WRSFCTKYTR 187
           +C   C ++   R      Y  N   +  DP+         GP   +F  +  F    T 
Sbjct: 207 RCVKNCKDNQ--RWSHSKHYSANAYRIKSDPYNIMAEVFNNGPVEVSFSVYEDFAHYETG 264

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILRG 246
                 GR     A         VK++GWG  ++G  YW I +++   +G+ G  KI RG
Sbjct: 265 VYKHVQGRYLGGHA---------VKLIGWGTTDDGIDYWLIANSWNTAWGEGGYFKIARG 315

Query: 247 RNEAIIESLVNGALP 261
            NE  IE      +P
Sbjct: 316 VNECGIERDPVAGMP 330


>gi|168020784|ref|XP_001762922.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685734|gb|EDQ72127.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 345

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/54 (42%), Positives = 37/54 (68%), Gaps = 1/54 (1%)

Query: 211 VKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKD 263
           VK+VGWG  ++G  YW++V+++   +G+ GT +ILRG++E  IES     LP +
Sbjct: 285 VKLVGWGTTDDGVDYWSMVNSWNTNWGEDGTFRILRGKDECGIESNAVAGLPSN 338


>gi|145525479|ref|XP_001448556.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124416111|emb|CAK81159.1| unnamed protein product [Paramecium tetraurelia]
          Length = 490

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 35/50 (70%)

Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +V   GWGEENG  YW + +++G+Q+G+ G  ++ RG++E+ IES+   A
Sbjct: 433 SVLCYGWGEENGVKYWLLQNSWGKQWGENGRFRMKRGQDESSIESMAEAA 482


>gi|215687149|dbj|BAG90919.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 403

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 56/208 (26%), Positives = 81/208 (38%), Gaps = 42/208 (20%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G     W +  + G+VT     +    GC+                P C+  A P P
Sbjct: 215 CDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCK---------------HPGCEP-AYPTP 258

Query: 138 KCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
            C  +C   N  + + + K + +N   +  DPH         GP   AF     T Y   
Sbjct: 259 VCEKKCKVQN--QVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAF-----TVYEDF 311

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGR 247
               +G VY       +  +A VK++GWG  + G  YW + + +   +GD G  KI+RG 
Sbjct: 312 AHYKSG-VYKHITGGMMGGHA-VKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFKIIRGT 369

Query: 248 NEAIIESLVNGALPKD-----NYGVEFG 270
           NE  IE  V   +P       NY   FG
Sbjct: 370 NECGIEEDVVAGMPSTKNMVRNYDSAFG 397


>gi|4099305|gb|AAD00577.1| cysteine proteinase [Clonorchis sinensis]
          Length = 180

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 41/148 (27%), Positives = 62/148 (41%), Gaps = 9/148 (6%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G  +  W +    G+VTGG+    +GC+   FP C H +     P C     P P+C
Sbjct: 38  CRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEH-HVQGHYPPCPRELYPTPEC 96

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWR----SFCTKYTRPLFQTNGR 195
             +C  D    G+ +DK + N     +            R    +  T Y   L  ++G 
Sbjct: 97  VQQC--DTPDVGYLEDKTRANMSYNIYASEISIMKEIMLRGPVEAIFTMYEDFLRYSSG- 153

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRP 223
           VY  +  A +  +A V+I+GWGE    P
Sbjct: 154 VYFHALGAPMSGHA-VRILGWGELGNVP 180


>gi|66801417|ref|XP_629634.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
 gi|60463014|gb|EAL61210.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
          Length = 323

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 33/97 (34%), Positives = 52/97 (53%), Gaps = 10/97 (10%)

Query: 196 VYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE- 253
           VY  S++ ++ ++A V++VGWG   +G  YW   +++G  +GDKG  KI RG +EA  E 
Sbjct: 211 VYIKSSNTQVESHA-VRVVGWGTTSDGVDYWIAANSWGTGWGDKGYFKIRRGSDEAAFEE 269

Query: 254 -----SLVNGALPKDNYGVE--FGEESGERLSEEFGV 283
                +    ++P   YG+E  FG  S   L   F +
Sbjct: 270 GFITVTADTASVPTSQYGLEYQFGGNSSTFLKPSFLI 306


>gi|328871084|gb|EGG19455.1| peptidase C1A family protein [Dictyostelium fasciculatum]
          Length = 352

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 45/184 (24%), Positives = 78/184 (42%), Gaps = 21/184 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G + +   ++ K+G+V+         C P + P C  A     +P    + TPQ  C
Sbjct: 136 CQGGDAYTAMKFIQKKGIVS-------NDCLPYTIPTCAPA----QQPCLNFVDTPQ--C 182

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRS----FCTKYTRPLFQTNGR 195
             +C+N +Y   + QD + I+G+    +P           +     C +           
Sbjct: 183 VEKCSNASYT--YAQDLHFIDGV-YSMNPTVNAIQQEIMTNGPVEACFEVYEDFLGYKSG 239

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           VY  +   ++  +  VK++GWG +N   YW   +++   +G++G   I  G NE  IES 
Sbjct: 240 VYQHTTGKDLGGHC-VKMIGWGTQNNELYWICNNSWTTYWGNQGVFWIKAGVNECGIESD 298

Query: 256 VNGA 259
           V  A
Sbjct: 299 VVAA 302


>gi|307175943|gb|EFN65753.1| Uncharacterized peptidase C1-like protein F26E4.3 [Camponotus
           floridanus]
          Length = 443

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/68 (42%), Positives = 41/68 (60%), Gaps = 7/68 (10%)

Query: 196 VYAVSASAEI--VAYATVKIVGWGEE---NGRP--YWTIVSTFGEQFGDKGTIKILRGRN 248
           VY  S SAE+    Y +V+I+GWGEE    G P  YW + +++G  +G+ G  +I +G N
Sbjct: 368 VYRHSRSAELHDSGYHSVRIIGWGEEPSYRGPPLKYWLVANSWGHNWGENGLFRIQKGTN 427

Query: 249 EAIIESLV 256
           E  IES V
Sbjct: 428 ECEIESYV 435


>gi|300121248|emb|CBK21629.2| unnamed protein product [Blastocystis hominis]
          Length = 559

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 23/53 (43%), Positives = 33/53 (62%)

Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
            + +VGWGEENG  YW   +++G  +G++G  +I RG N   IES    A+PK
Sbjct: 225 AISVVGWGEENGEKYWIGRNSWGNYWGEEGWFRIARGINNLAIESECQWAVPK 277



 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 28/71 (39%), Positives = 42/71 (59%), Gaps = 2/71 (2%)

Query: 182 CTKYTRPLF-QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGT 240
           C+   R  F   +G VY  S S+ +VA   V+I GWG ENGRPYW   +++GE +G++G 
Sbjct: 477 CSMTVRESFLDYHGGVYE-SDSSPMVAGHIVEIAGWGVENGRPYWIGRNSWGEYWGEEGW 535

Query: 241 IKILRGRNEAI 251
            +I   ++  I
Sbjct: 536 FRIDMEKDSGI 546


>gi|197129222|gb|ACH45720.1| putative cathepsin B variant 2 [Taeniopygia guttata]
          Length = 236

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 1/64 (1%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  +RGLV+GG + S+ GC+P S PPC H +   + P C       P+C
Sbjct: 150 CNGGYPSGAWRYWTERGLVSGGLYDSHVGCRPYSIPPCEH-HVNGTRPPCTGEGGSTPRC 208

Query: 140 HTRC 143
              C
Sbjct: 209 SRHC 212


>gi|281204808|gb|EFA79003.1| hypothetical protein PPL_08471 [Polysphondylium pallidum PN500]
          Length = 322

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 31/51 (60%)

Query: 212 KIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           +++GWGEENG PYW  ++++G +FG  G  K+  G N A  ES +    P 
Sbjct: 213 RVIGWGEENGTPYWLALNSWGTEFGMDGAFKVPMGENIAGFESQLLSVKPN 263


>gi|403223101|dbj|BAM41232.1| cysteine proteinase [Theileria orientalis strain Shintoku]
          Length = 489

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 21/44 (47%), Positives = 31/44 (70%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
           V I+GWGE +G  YW + +++G+ +GDKG  K+ RGRN   +ES
Sbjct: 417 VAIIGWGESDGFKYWLVRNSWGKDWGDKGFFKLTRGRNAFGVES 460


>gi|123377855|ref|XP_001298125.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
 gi|121878571|gb|EAX85195.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
          Length = 135

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/71 (35%), Positives = 40/71 (56%)

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           +  +G   +V +  E      V I GWG+E   P+W I++++G  +G  G++K LRG N 
Sbjct: 64  YYKSGVYQSVLSEEESSFQHAVVIYGWGKEKETPFWWILNSYGPNWGINGSMKFLRGSNH 123

Query: 250 AIIESLVNGAL 260
             IE+ V+ AL
Sbjct: 124 CNIETHVSSAL 134


>gi|294890618|ref|XP_002773230.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239878281|gb|EER05046.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 238

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 38/86 (44%), Gaps = 3/86 (3%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHH---SNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 136
           CS G   ++W ++H  G+V+G       +  GC P +FP C H    +    C       
Sbjct: 133 CSGGNPITSWTFLHTNGIVSGKLSKNMKAADGCWPYNFPKCAHHQKESDYKPCAKELYDT 192

Query: 137 PKCHTRCTNDNYGRGFFQDKYQINGL 162
           P C + C N  YG  F +D++    L
Sbjct: 193 PSCSSSCPNAKYGTAFDKDRHYTESL 218


>gi|86451908|gb|ABC97349.1| cathepsin B [Streblomastix strix]
          Length = 312

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/52 (46%), Positives = 33/52 (63%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           +K+VGWG  +G  YWTIV+++ E +G  G + I RG +E  IES V    PK
Sbjct: 259 IKVVGWGILDGVKYWTIVNSWAEDWGFDGLLLIRRGVDECGIESDVVAGQPK 310


>gi|12658201|gb|AAK01061.1| cysteine proteinase [Metagonimus yokogawai]
          Length = 179

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 43/155 (27%), Positives = 61/155 (39%), Gaps = 27/155 (17%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +  + GLVTGG+  + +GC+   FP CNH       P C     P P C
Sbjct: 38  CHGGFPPRAWDFWMENGLVTGGSKENPSGCRSYPFPKCNHHGKGPDAP-CPEKIFPTPAC 96

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAF--WRSFCTKYT 186
           +  C  D     +  DK +      Y  P+            GP   AF  +  F    +
Sbjct: 97  NKTC--DTPEVNYILDKTKAK--SSYNVPNSEKAIMKEIMQNGPVEAAFEVYEDFLHYES 152

Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENG 221
              F + GR+    A         ++++GWGEENG
Sbjct: 153 GVYFHSFGRMIGGHA---------IRMLGWGEENG 178


>gi|402585860|gb|EJW79799.1| cysteine protease 6 [Wuchereria bancrofti]
          Length = 242

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 36/50 (72%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
           V I G+G ENG PYWTI +++GE++G+ G  +++RG++   +  LV+ A+
Sbjct: 191 VLITGYGIENGLPYWTIKNSWGEEWGENGYFRLMRGKDICGVSDLVSSAI 240


>gi|330805199|ref|XP_003290573.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
 gi|325079281|gb|EGC32888.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
          Length = 313

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 53/190 (27%), Positives = 79/190 (41%), Gaps = 29/190 (15%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G   S W ++ K+G+VT         C+P + P C  A     +P    + TP   C
Sbjct: 144 CEGGDDVSAWNFLKKQGVVT-------QECKPYTIPTCPPA----QQPCLNFVNTPN--C 190

Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFD--PHFGPFWPAFWRSFCTKYTRPLFQ 191
             +C + N    + QDK      Y IN +          GP    F     + Y   L  
Sbjct: 191 VKQCES-NSTLIYSQDKHKMAKIYSINSVEAIMQEISTNGPVEACF-----SVYEDFLGY 244

Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
            +G VY    + + +    VKI G+G  NG  YW++ +++   +GD G   I RG +E  
Sbjct: 245 KSG-VYQ-HTTGKFLGGHCVKIFGYGTLNGVNYWSVANSWTTSWGDNGIFLIKRGSDECG 302

Query: 252 IESLVNGALP 261
           IE  V   +P
Sbjct: 303 IEDEVVAGIP 312


>gi|324512900|gb|ADY45327.1| Peptidase C1-like protein [Ascaris suum]
          Length = 450

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 37/63 (58%), Gaps = 4/63 (6%)

Query: 204 EIVAYATVKIVGWGEENGR----PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           ++  Y +V+I+GWGE+        YW   +++G ++G+ G  +ILRG N   IES V GA
Sbjct: 369 KVQGYHSVRIIGWGEDYSTGPQVKYWLAANSWGNEWGEDGLFRILRGENHCEIESFVIGA 428

Query: 260 LPK 262
             K
Sbjct: 429 WGK 431


>gi|339248603|ref|XP_003373289.1| cathepsin B [Trichinella spiralis]
 gi|316970616|gb|EFV54519.1| cathepsin B [Trichinella spiralis]
          Length = 576

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/76 (38%), Positives = 42/76 (55%), Gaps = 10/76 (13%)

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN--GRP--YWTIVSTFGEQFGDKGTIKI 243
           P     G  YA S       Y +V+I+GWG ++  G P  YW   +++GE++G+ G  +I
Sbjct: 479 PYANDKGPAYARSG------YHSVRILGWGVDHSTGVPIKYWLCANSWGEEWGENGLFRI 532

Query: 244 LRGRNEAIIESLVNGA 259
           LRG N   IES + GA
Sbjct: 533 LRGENHCDIESFIIGA 548


>gi|294898471|ref|XP_002776250.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239883121|gb|EER08066.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 219

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 41/91 (45%), Gaps = 7/91 (7%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHH------SNTGCQPVSFPPCNHANYTTSE-PECKTL 132
           C+ G       ++   G+VTG          S  GC P   P CNHA+   S+ P+C + 
Sbjct: 85  CNRGNLIEGLNFMKNHGIVTGNEFKPADQLASADGCWPYPLPKCNHASSAASQYPKCPSE 144

Query: 133 ATPQPKCHTRCTNDNYGRGFFQDKYQINGLG 163
           A  QP C T C N++Y     QD ++    G
Sbjct: 145 ALSQPACQTECINESYKTSLQQDLHRAKSWG 175


>gi|156708114|gb|ABU93315.1| cathepsin B6 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 19/40 (47%), Positives = 30/40 (75%)

Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
            V ++GWG E+G PYW + +++G  +G+KG  KI+RG+NE
Sbjct: 228 AVLLIGWGVEDGVPYWLLQNSWGPAWGEKGHFKIIRGKNE 267


>gi|66814230|ref|XP_641294.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
 gi|60469326|gb|EAL67320.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
          Length = 291

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 42/150 (28%), Positives = 64/150 (42%), Gaps = 21/150 (14%)

Query: 117 CNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF--GPFW 174
           C + N+  S P     A P            Y   F ++  Q+NG        F  GP  
Sbjct: 158 CKNCNFDLSNPTADCFAQP-----------TYTTYFVEEHGQVNGSVAMMQEIFARGPIA 206

Query: 175 PAFWRSFC-TKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGE 233
                +     YT  +F +     +V ++ EI     + I+GWG ENG  YW   +++G 
Sbjct: 207 CGMEVTDAFESYTSGVFTS-----SVGSTGEI--NHEISIIGWGTENGVDYWIGRNSWGT 259

Query: 234 QFGDKGTIKILRGRNEAIIESLVNGALPKD 263
            FG+ G  +I RG +   IES  + A+PK+
Sbjct: 260 YFGELGFFRIQRGIDLLSIESACDWAVPKN 289


>gi|290989996|ref|XP_002677623.1| cathepsin B [Naegleria gruberi]
 gi|284091231|gb|EFC44879.1| cathepsin B [Naegleria gruberi]
          Length = 321

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 22/46 (47%), Positives = 29/46 (63%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
           +KI+GWG E G  YW + +++   +G  GT KILRG NE  IE  V
Sbjct: 264 IKIIGWGVEGGVDYWLVANSWSTDWGIDGTFKILRGHNECGIEDDV 309


>gi|402588459|gb|EJW82392.1| papain family cysteine protease containing protein [Wuchereria
           bancrofti]
          Length = 323

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 55/115 (47%), Gaps = 8/115 (6%)

Query: 148 YGRGFFQDKYQIN---GLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAE 204
           YG  F    YQIN       +F  +  P   A   +F  +Y    F  +G +      + 
Sbjct: 212 YGEIFIDKLYQINPDPNAMAWFVANVAPI--ALNLAFPKRYK---FYKSGILPDTDECST 266

Query: 205 IVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +      +++G+G ENG+ YW + +++GE +GD+G  KI RG N   +E+ V  A
Sbjct: 267 MEPNHAAEVIGYGTENGKKYWLLKNSWGEWWGDQGFFKIERGINACKVETYVASA 321


>gi|168026641|ref|XP_001765840.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683017|gb|EDQ69431.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 339

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/55 (43%), Positives = 34/55 (61%), Gaps = 1/55 (1%)

Query: 211 VKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDN 264
           VK++GWG  ++G  YWTIV+++   +G+ G  +I RG NE  IES     LP D 
Sbjct: 279 VKLIGWGTTDDGVDYWTIVNSWNTNWGEHGLFRIARGGNECGIESYAVAGLPFDK 333


>gi|145540170|ref|XP_001455775.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124423583|emb|CAK88378.1| unnamed protein product [Paramecium tetraurelia]
          Length = 500

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 20/52 (38%), Positives = 35/52 (67%)

Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
           +V   GWGEE+G  +W + +++G Q+G+ G+ ++ RG +E+ IES+   A P
Sbjct: 427 SVLCYGWGEEDGVKFWLLQNSWGSQWGENGSFRMKRGVDESAIESMAEAADP 478


>gi|338722032|ref|XP_003364468.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
           [Equus caballus]
          Length = 436

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 370 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 424


>gi|116242314|gb|ABJ89814.1| cysteine protease preprotein [Clonorchis sinensis]
          Length = 326

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 34/52 (65%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           V  VG+G +NG+PYW + +++GE FG++G  +I RG     I S+V  A+ K
Sbjct: 275 VLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTTAIIK 326


>gi|297465285|ref|XP_887401.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
           [Bos taurus]
 gi|297472148|ref|XP_002685665.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Bos taurus]
 gi|296490232|tpg|DAA32345.1| TPA: tubulointerstitial nephritis antigen-like 1-like [Bos taurus]
          Length = 534

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 468 SVKITGWGEETLPDGRTIKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 522


>gi|157779038|gb|ABV71063.1| cathepsin L3 precursor [Schistosoma mansoni]
 gi|360044915|emb|CCD82463.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
           mansoni]
          Length = 370

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 21/36 (58%), Positives = 30/36 (83%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
           V +VG+GEENGR YW I +++GE++G+KG IKI +G
Sbjct: 319 VLVVGYGEENGRSYWLIKNSWGEEWGEKGYIKISKG 354


>gi|10803450|emb|CAB97364.2| putative cathepsin B.1 [Ostertagia ostertagi]
          Length = 199

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 43/166 (25%), Positives = 67/166 (40%), Gaps = 22/166 (13%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G S   W +  ++G+VTGG +++   C+P    PC +        EC  LA   P+C
Sbjct: 43  CQGGWSIRAWYYFAEQGVVTGGNYNTKGSCRPYEIHPCGYHKDEPYYGECDDLAD-TPRC 101

Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
             RC         ++ +YGR  +Q    +  +      + GP    F     T Y     
Sbjct: 102 KRRCQLGYPKSYPSDKHYGRTAYQLPMSVESIQREIMRN-GPVVAGF-----TVY-EDFA 154

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGR----PYWTIVSTFG 232
              G +Y  ++  +   +A VK++GWG E       PYW      G
Sbjct: 155 HYKGGIYKHTSGKKTGGHA-VKVIGWGSEQKGSEKIPYWXHCXLHG 199


>gi|417401428|gb|JAA47600.1| Putative cysteine proteinase tin-ag [Desmodus rotundus]
          Length = 466

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 400 SVKITGWGEESLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 454


>gi|355724275|gb|AES08176.1| tubulointerstitial nephritis antigen-like 1 [Mustela putorius furo]
          Length = 454

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 388 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 442


>gi|85068702|gb|ABC69431.1| cysteine protease [Clonorchis sinensis]
          Length = 326

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 34/52 (65%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           V  VG+G +NG+PYW + +++GE FG++G  +I RG     I S+V  A+ K
Sbjct: 275 VLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTTAIIK 326


>gi|149694136|ref|XP_001503950.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 1
           [Equus caballus]
          Length = 467

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 401 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 455


>gi|7219908|gb|AAF40479.1| cystein protease [Clonorchis sinensis]
          Length = 326

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 34/52 (65%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           V  VG+G +NG+PYW + +++GE FG++G  +I RG     I S+V  A+ K
Sbjct: 275 VLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTTAIIK 326


>gi|118429515|gb|ABK91805.1| cysteine proteinase 7 precursor [Clonorchis sinensis]
          Length = 326

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 34/52 (65%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           V  VG+G +NG+PYW + +++GE FG++G  +I RG     I S+V  A+ K
Sbjct: 275 VLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTTAIIK 326


>gi|395856781|ref|XP_003800797.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Otolemur garnettii]
          Length = 436

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 370 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 424


>gi|118429527|gb|ABK91811.1| cathepsin F precursor [Clonorchis sinensis]
          Length = 326

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 34/52 (65%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           V  VG+G +NG+PYW + +++GE FG++G  +I RG     I S+V  A+ K
Sbjct: 275 VLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTTAIIK 326


>gi|6649593|gb|AAF21470.1|U85983_1 cysteine proteinase [Clonorchis sinensis]
          Length = 259

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 34/52 (65%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           V  VG+G +NG+PYW + +++GE FG++G  +I RG     I S+V  A+ K
Sbjct: 208 VLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTTAIIK 259


>gi|256086900|ref|XP_002579622.1| cathepsin B (C01 family) [Schistosoma mansoni]
          Length = 204

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 38/67 (56%)

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           VY  +  +  + +  ++I+GWG E   PYW   +++ +++G+ G +K+ RG     IES 
Sbjct: 137 VYFPTPKSSNLGWINLRIIGWGYEGKTPYWLCANSWSKEWGENGYVKVRRGVQAGYIESY 196

Query: 256 VNGALPK 262
           V   +PK
Sbjct: 197 VRAPIPK 203


>gi|431891156|gb|ELK02033.1| Tubulointerstitial nephritis antigen-like protein [Pteropus alecto]
          Length = 467

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 401 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGTNECDIESFVLGV 455


>gi|395856779|ref|XP_003800796.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Otolemur garnettii]
          Length = 467

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 401 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 455


>gi|290980376|ref|XP_002672908.1| predicted protein [Naegleria gruberi]
 gi|284086488|gb|EFC40164.1| predicted protein [Naegleria gruberi]
          Length = 261

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 29/70 (41%), Positives = 44/70 (62%), Gaps = 2/70 (2%)

Query: 185 YTRPLFQTNGRVYAVSASA-EIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
           Y   L+ ++G VY  SA+  + +A   V+I+GWG ENG  YW + + +G+ +G +G I I
Sbjct: 184 YQDFLYYSSG-VYQHSANLRQPIAKFVVRIIGWGVENGVKYWIVPNIWGKTWGMQGYIWI 242

Query: 244 LRGRNEAIIE 253
            RG NE+ IE
Sbjct: 243 RRGNNESNIE 252


>gi|345794363|ref|XP_535330.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Canis lupus
           familiaris]
          Length = 467

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 401 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 455


>gi|344287518|ref|XP_003415500.1| PREDICTED: tubulointerstitial nephritis antigen isoform 1
           [Loxodonta africana]
          Length = 468

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 402 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 456


>gi|353228747|emb|CCD74918.1| cathepsin B (C01 family) [Schistosoma mansoni]
          Length = 229

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 38/67 (56%)

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           VY  +  +  + +  ++I+GWG E   PYW   +++ +++G+ G +K+ RG     IES 
Sbjct: 162 VYFPTPKSSNLGWINLRIIGWGYEGKTPYWLCANSWSKEWGENGYVKVRRGVQAGYIESY 221

Query: 256 VNGALPK 262
           V   +PK
Sbjct: 222 VRAPIPK 228


>gi|290998718|ref|XP_002681927.1| predicted protein [Naegleria gruberi]
 gi|284095553|gb|EFC49183.1| predicted protein [Naegleria gruberi]
          Length = 303

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 17/36 (47%), Positives = 29/36 (80%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
           + +VGWGEENG+ YW + +++GE +G++G  +I+RG
Sbjct: 248 ISVVGWGEENGKKYWIVRNSWGEPYGEQGFFRIIRG 283


>gi|449663703|ref|XP_002169139.2| PREDICTED: uncharacterized protein LOC100198320 [Hydra
            magnipapillata]
          Length = 1092

 Score = 48.5 bits (114), Expect = 0.003,   Method: Composition-based stats.
 Identities = 28/79 (35%), Positives = 43/79 (54%), Gaps = 5/79 (6%)

Query: 182  CTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTI 241
            C + T   + +   VY      E+  +A V I+G+G EN +PYW I +++G+ +GD G +
Sbjct: 960  CARKTFKFYSSG--VYDDPKCTEVTDHAVV-IIGYGVENNKPYWLIKNSWGKLWGDNGYM 1016

Query: 242  KILRGRNEAIIESLVNGAL 260
            KI    N  +   L NGAL
Sbjct: 1017 KI--DMNNNLCGVLTNGAL 1033


>gi|344287520|ref|XP_003415501.1| PREDICTED: tubulointerstitial nephritis antigen isoform 2
           [Loxodonta africana]
          Length = 437

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 371 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 425


>gi|426221788|ref|XP_004005089.1| PREDICTED: tubulointerstitial nephritis antigen-like [Ovis aries]
          Length = 362

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 296 SVKITGWGEETLPDGRTVKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 350


>gi|417409900|gb|JAA51439.1| Putative cysteine proteinase tin-ag, partial [Desmodus rotundus]
          Length = 346

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 280 SVKITGWGEESLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 334


>gi|297814171|ref|XP_002874969.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297320806|gb|EFH51228.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 359

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 55/192 (28%), Positives = 75/192 (39%), Gaps = 33/192 (17%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G   + W +    G+VT     +  NTGC   S P C        EP     A P P
Sbjct: 171 CDGGYPIAAWQYFSYSGVVTEECDPYFDNTGC---SHPGC--------EP-----AYPTP 214

Query: 138 KCHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
           +C  +C +DN      + +    Y +N             GP   +F     T Y     
Sbjct: 215 RCLRKCVSDNKLWSESKHYSVSTYTVNSSPQDIMAEVYKNGPVEVSF-----TVYEDFAH 269

Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
             +G VY     + I  +A VK++GWG  N G  YW + + +   +GD G   I RG NE
Sbjct: 270 YKSG-VYKHITGSNIGGHA-VKLIGWGTSNEGEDYWLMANQWNRGWGDDGYFMIRRGTNE 327

Query: 250 AIIESLVNGALP 261
             IE      LP
Sbjct: 328 CGIEDEPVAGLP 339


>gi|358421824|ref|XP_003585145.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bos taurus]
          Length = 428

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 362 SVKITGWGEETLPDGRTIKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 416


>gi|358255491|dbj|GAA57187.1| cathepsin L [Clonorchis sinensis]
          Length = 368

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 20/46 (43%), Positives = 34/46 (73%)

Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           ++ +VG+GEENG PYW I +++GE +G+KG +++ RG N   + S+
Sbjct: 317 SMVVVGYGEENGTPYWIIKNSWGEHWGEKGYLRLRRGVNMCGVASV 362


>gi|15150360|gb|AAK85411.1| cathepsin B-like protease [Trypanosoma rangeli]
          Length = 207

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 45/158 (28%), Positives = 69/158 (43%), Gaps = 18/158 (11%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G     W +  + G+V+         CQP  FPPC H   +T    C ++    P C
Sbjct: 65  CNGGDPDWAWLYYVETGIVS-------EFCQPYPFPPCAHHVNSTHYTPC-SVEYDTPFC 116

Query: 140 HTRCTNDNYGRGFF-QDKYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNGRV 196
           +  CTN      +  +  Y ++G   Y    F  GPF  AF     T Y   +  ++G  
Sbjct: 117 NITCTNTIPPIKYKGRISYSLSGEEDYKRELFLYGPFEVAF-----TVYEDFVAYSDGVY 171

Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQ 234
              S +A  +    V++VGWG  NG PYW I +++  +
Sbjct: 172 KHFSGNA--LGGHAVRLVGWGNLNGTPYWKIANSWNHE 207


>gi|332254560|ref|XP_003276397.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Nomascus leucogenys]
          Length = 436

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 370 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 424


>gi|332254558|ref|XP_003276396.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Nomascus leucogenys]
          Length = 467

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 401 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 455


>gi|159488843|ref|XP_001702410.1| papain-type cysteine protease [Chlamydomonas reinhardtii]
 gi|158271078|gb|EDO96905.1| papain-type cysteine protease [Chlamydomonas reinhardtii]
          Length = 382

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 22/65 (33%), Positives = 38/65 (58%), Gaps = 1/65 (1%)

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           +  NG +Y    S +      V++VGWGEE+G  YW + +++G  +G++G  ++ RG N 
Sbjct: 242 WHYNGGIYK-DTSGDTELDHDVEVVGWGEEDGEKYWIVRNSWGTYWGERGFFRVRRGDNS 300

Query: 250 AIIES 254
             +ES
Sbjct: 301 LQLES 305


>gi|5881566|dbj|BAA84280.1| Cysteine proteinase [Clonorchis sinensis]
          Length = 232

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 34/52 (65%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           V  VG+G +NG+PYW + +++GE FG++G  +I RG     I S+V  A+ K
Sbjct: 181 VLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTTAIIK 232


>gi|363742306|ref|XP_428202.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Gallus
           gallus]
          Length = 464

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 46/196 (23%), Positives = 75/196 (38%), Gaps = 30/196 (15%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAH-HSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 138
           CS G     W ++ +RG+VT   +  ++   QP + P   H+  T       T   P P+
Sbjct: 269 CSGGRLDGAWWYLRRRGVVTDECYPFTSQDSQPAAQPCMMHSRSTGRGKRQATARCPNPQ 328

Query: 139 CHTRCTNDNYG-----------RGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTR 187
            H    ND Y            +   ++  +   +    + H   F           Y  
Sbjct: 329 THA---NDIYQSTPAYRLAPSEKEIMKELMENGPVQAILEVHEDFFL----------YKS 375

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-----GRPYWTIVSTFGEQFGDKGTIK 242
            +++            +     +VKI GWGEE       + YWT  +++G  +G+ G  +
Sbjct: 376 GIYRHTAVAEGKGPKHQQHGTHSVKITGWGEEQLPDGQVQKYWTAANSWGRAWGEDGHFR 435

Query: 243 ILRGRNEAIIESLVNG 258
           I RG NE  +ES V G
Sbjct: 436 IARGVNECEVESFVVG 451


>gi|30678927|ref|NP_849281.1| cathepsin B [Arabidopsis thaliana]
 gi|3859606|gb|AAC72872.1| contains similarity to cysteine proteases (Pfam: PF00112,
           E=1.3e-79, N=1) [Arabidopsis thaliana]
 gi|7268205|emb|CAB77732.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332656653|gb|AEE82053.1| cathepsin B [Arabidopsis thaliana]
          Length = 359

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 56/194 (28%), Positives = 80/194 (41%), Gaps = 37/194 (19%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G   + W +    G+VT     +  NTGC   S P C        EP     A P P
Sbjct: 171 CDGGYPIAAWQYFSYSGVVTEECDPYFDNTGC---SHPGC--------EP-----AYPTP 214

Query: 138 KCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
           KC  +C +DN  + + + K Y ++   +  +P          GP   +F     T Y   
Sbjct: 215 KCSRKCVSDN--KLWSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSF-----TVYEDF 267

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGR 247
               +G VY     + I  +A VK++GWG  + G  YW + + +   +GD G   I RG 
Sbjct: 268 AHYKSG-VYKHITGSNIGGHA-VKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIRRGT 325

Query: 248 NEAIIESLVNGALP 261
           NE  IE      LP
Sbjct: 326 NECGIEDEPVAGLP 339


>gi|426328832|ref|XP_004025452.1| PREDICTED: tubulointerstitial nephritis antigen-like [Gorilla
           gorilla gorilla]
          Length = 462

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 396 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 450


>gi|332808277|ref|XP_524645.3| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
           antigen-like 1 [Pan troglodytes]
          Length = 472

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 406 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 460


>gi|11545918|ref|NP_071447.1| tubulointerstitial nephritis antigen-like isoform 1 precursor [Homo
           sapiens]
 gi|61213628|sp|Q9GZM7.1|TINAL_HUMAN RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
           Full=Glucocorticoid-inducible protein 5; AltName:
           Full=Oxidized LDL-responsive gene 2 protein;
           Short=OLRG-2; AltName: Full=Tubulointerstitial nephritis
           antigen-related protein; Short=TIN Ag-related protein;
           Short=TIN-Ag-RP; Flags: Precursor
 gi|11602840|gb|AAG38876.1|AF236150_1 tubulointerstitial nephritis antigen-related protein precursor
           [Homo sapiens]
 gi|11275667|gb|AAG33699.1| oxidized-LDL responsive gene 2 [Homo sapiens]
 gi|11527793|dbj|BAB18636.1| glucocorticoid-inducible protein [Homo sapiens]
 gi|11527809|dbj|BAB18727.1| glucocorticoid-inducible protein [Homo sapiens]
 gi|11761715|gb|AAG40154.1| tubulointerstitial nephritis antigen-related protein [Homo sapiens]
 gi|22761462|dbj|BAC11596.1| unnamed protein product [Homo sapiens]
 gi|37181967|gb|AAQ88787.1| LCN7 [Homo sapiens]
 gi|40353044|gb|AAH64633.1| Tubulointerstitial nephritis antigen-like 1 [Homo sapiens]
 gi|119628009|gb|EAX07604.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
           sapiens]
 gi|119628010|gb|EAX07605.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
           sapiens]
 gi|119628011|gb|EAX07606.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
           sapiens]
 gi|158258977|dbj|BAF85459.1| unnamed protein product [Homo sapiens]
 gi|261858502|dbj|BAI45773.1| tubulointerstitial nephritis antigen-like 1 [synthetic construct]
 gi|410265400|gb|JAA20666.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410307560|gb|JAA32380.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410307562|gb|JAA32381.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410307564|gb|JAA32382.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410335249|gb|JAA36571.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
          Length = 467

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 401 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 455


>gi|355557764|gb|EHH14544.1| hypothetical protein EGK_00488 [Macaca mulatta]
 gi|355745087|gb|EHH49712.1| hypothetical protein EGM_00421 [Macaca fascicularis]
 gi|384948750|gb|AFI37980.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
           [Macaca mulatta]
 gi|384948752|gb|AFI37981.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
           [Macaca mulatta]
 gi|387540550|gb|AFJ70902.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
           [Macaca mulatta]
          Length = 467

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 401 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 455


>gi|297665716|ref|XP_002811185.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 3
           [Pongo abelii]
          Length = 436

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 370 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 424


>gi|290974021|ref|XP_002669745.1| predicted protein [Naegleria gruberi]
 gi|284083296|gb|EFC37001.1| predicted protein [Naegleria gruberi]
          Length = 335

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 25/59 (42%), Positives = 36/59 (61%), Gaps = 1/59 (1%)

Query: 204 EIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
           +I ++++ KI+GWG  E+  PYW  V  FG  +G+ G   +LRG +E  IES    ALP
Sbjct: 272 DIGSFSSTKIIGWGVAEDQTPYWICVFEFGTDWGNNGMFWMLRGADECGIESSAWSALP 330


>gi|397515891|ref|XP_003828175.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2 [Pan
           paniscus]
          Length = 436

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 370 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 424


>gi|397515889|ref|XP_003828174.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1 [Pan
           paniscus]
          Length = 467

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 401 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 455


>gi|324711034|ref|NP_001191343.1| tubulointerstitial nephritis antigen-like isoform 2 precursor [Homo
           sapiens]
 gi|194391000|dbj|BAG60618.1| unnamed protein product [Homo sapiens]
          Length = 436

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 370 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 424


>gi|403293251|ref|XP_003937634.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Saimiri boliviensis boliviensis]
          Length = 436

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 370 SVKITGWGEETRPDGRKLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 424


>gi|403293249|ref|XP_003937633.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Saimiri boliviensis boliviensis]
          Length = 467

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 401 SVKITGWGEETRPDGRKLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 455


>gi|297665714|ref|XP_002811184.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
           [Pongo abelii]
          Length = 467

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 401 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 455


>gi|18411686|ref|NP_567215.1| cathepsin B [Arabidopsis thaliana]
 gi|13877861|gb|AAK44008.1|AF370193_1 putative cathepsin B cysteine protease [Arabidopsis thaliana]
 gi|17473834|gb|AAL38343.1| unknown protein [Arabidopsis thaliana]
 gi|21281113|gb|AAM45063.1| putative cathepsin B cysteine protease [Arabidopsis thaliana]
 gi|21554165|gb|AAM63244.1| cathepsin B-like cysteine protease, putative [Arabidopsis thaliana]
 gi|24417490|gb|AAN60355.1| unknown [Arabidopsis thaliana]
 gi|24899725|gb|AAN65077.1| unknown protein [Arabidopsis thaliana]
 gi|51968702|dbj|BAD43043.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51969104|dbj|BAD43244.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51969220|dbj|BAD43302.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970472|dbj|BAD43928.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970630|dbj|BAD44007.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970704|dbj|BAD44044.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970802|dbj|BAD44093.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970974|dbj|BAD44179.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51971008|dbj|BAD44196.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51971116|dbj|BAD44250.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|62320144|dbj|BAD94342.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|110740287|dbj|BAF02040.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332656652|gb|AEE82052.1| cathepsin B [Arabidopsis thaliana]
          Length = 359

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 56/194 (28%), Positives = 80/194 (41%), Gaps = 37/194 (19%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G   + W +    G+VT     +  NTGC   S P C        EP     A P P
Sbjct: 171 CDGGYPIAAWQYFSYSGVVTEECDPYFDNTGC---SHPGC--------EP-----AYPTP 214

Query: 138 KCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
           KC  +C +DN  + + + K Y ++   +  +P          GP   +F     T Y   
Sbjct: 215 KCSRKCVSDN--KLWSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSF-----TVYEDF 267

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGR 247
               +G VY     + I  +A VK++GWG  + G  YW + + +   +GD G   I RG 
Sbjct: 268 AHYKSG-VYKHITGSNIGGHA-VKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIRRGT 325

Query: 248 NEAIIESLVNGALP 261
           NE  IE      LP
Sbjct: 326 NECGIEDEPVAGLP 339


>gi|301609080|ref|XP_002934105.1| PREDICTED: cathepsin S-like [Xenopus (Silurana) tropicalis]
          Length = 334

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 21/38 (55%), Positives = 29/38 (76%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           V IVG+ +ENG+ YW + +++GE FGDKG IK+ R RN
Sbjct: 283 VLIVGYSKENGQYYWLVKNSWGEYFGDKGYIKMARKRN 320


>gi|296207307|ref|XP_002750588.1| PREDICTED: tubulointerstitial nephritis antigen-like [Callithrix
           jacchus]
          Length = 467

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 401 SVKITGWGEETWPDGRKLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 455


>gi|321478457|gb|EFX89414.1| hypothetical protein DAPPUDRAFT_303204 [Daphnia pulex]
          Length = 442

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 41/77 (53%), Gaps = 6/77 (7%)

Query: 189 LFQTNGRVYAVSA--SAEIVAYATVKIVGWG----EENGRPYWTIVSTFGEQFGDKGTIK 242
            F   G VY  S   S +   Y +V+IVGWG    + N   YW + +++G  +G+ G  +
Sbjct: 358 FFLYRGGVYRYSGTNSQQRSGYHSVRIVGWGVDSSKRNPTKYWLVANSWGRLWGEDGYFR 417

Query: 243 ILRGRNEAIIESLVNGA 259
           I+RG NE+ IE  V  A
Sbjct: 418 IVRGENESDIEKFVLAA 434


>gi|393906608|gb|EFO21301.2| ctsf protein [Loa loa]
          Length = 472

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 20/50 (40%), Positives = 35/50 (70%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
           V I G+G ENG PYWTI +++G+Q+G+ G  +++ G++   +  LV+ A+
Sbjct: 421 VLITGYGVENGLPYWTIKNSWGDQWGEDGYFRLMLGKDVCGVSDLVSSAI 470


>gi|357158628|ref|XP_003578189.1| PREDICTED: thiol protease aleurain-like [Brachypodium distachyon]
          Length = 363

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 21/38 (55%), Positives = 28/38 (73%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           V  VG+G ENG PYW I +++GE +GDKG  K+ RG+N
Sbjct: 311 VLAVGYGVENGTPYWLIKNSWGESWGDKGYFKMERGKN 348


>gi|335290878|ref|XP_003127800.2| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Sus scrofa]
          Length = 362

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 296 SVKITGWGEETLPDGRMLKYWTAANSWGPGWGERGHFRIVRGANECDIESFVLGV 350


>gi|332254562|ref|XP_003276398.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 3
           [Nomascus leucogenys]
          Length = 362

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 296 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 350


>gi|255647484|gb|ACU24206.1| unknown [Glycine max]
          Length = 327

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 52/185 (28%), Positives = 76/185 (41%), Gaps = 43/185 (23%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G     W ++   G+VT     +    GC   S P C        EP  +T     P
Sbjct: 169 CDGGYPLYAWRYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYRT-----P 212

Query: 138 KCHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKY 185
           KC  +C + N      + +    Y++N      DPH         GP   AF     T Y
Sbjct: 213 KCVKKCVSGNQVWKKSKHYSVSAYRVNS-----DPHDIMAEVYKNGPVEVAF-----TVY 262

Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKIL 244
               +  +G VY      E+  +A VK++GWG  ++G  YW + + +  ++GD G  KI 
Sbjct: 263 EDFAYYKSG-VYKHITGYELGGHA-VKLIGWGTTDDGEDYWLLANQWNREWGDDGYFKIR 320

Query: 245 RGRNE 249
           RG NE
Sbjct: 321 RGTNE 325


>gi|312080834|ref|XP_003142769.1| ctsf protein [Loa loa]
          Length = 437

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 20/50 (40%), Positives = 35/50 (70%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
           V I G+G ENG PYWTI +++G+Q+G+ G  +++ G++   +  LV+ A+
Sbjct: 386 VLITGYGVENGLPYWTIKNSWGDQWGEDGYFRLMLGKDVCGVSDLVSSAI 435


>gi|197725747|gb|ACH73069.1| cathepsin B precursor [Epinephelus coioides]
          Length = 333

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 40/78 (51%), Gaps = 2/78 (2%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S+ W +    GLV+GG + S+ GC+P + PPC H +   + P C       P+C
Sbjct: 148 CNGGYPSAAWDFWTDVGLVSGGLYDSHVGCRPYTIPPCEH-HVNGTRPPCTGEGGDTPQC 206

Query: 140 HTRCTNDNYGRGFFQDKY 157
             +C +  Y   +  DK+
Sbjct: 207 ILQCES-GYTPSYKADKH 223


>gi|48762489|dbj|BAD23814.1| cathepsin B-N [Tuberaphis takenouchii]
          Length = 163

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/70 (41%), Positives = 32/70 (45%), Gaps = 5/70 (7%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W W  K GLVTGG + S  GCQP   PPC    Y  +   C+    P  K 
Sbjct: 38  CHGGYPIKAWEWFKKHGLVTGGDYDSGEGCQPYRVPPCPLDEYGNN--TCR--GKPAEKN 93

Query: 140 HTRCTNDNYG 149
           H RCT   YG
Sbjct: 94  H-RCTRMCYG 102


>gi|354472325|ref|XP_003498390.1| PREDICTED: tubulointerstitial nephritis antigen [Cricetulus
           griseus]
 gi|344245030|gb|EGW01134.1| Tubulointerstitial nephritis antigen-like [Cricetulus griseus]
          Length = 465

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 400 SVKITGWGEEKLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIESFVLGV 454


>gi|324713036|ref|NP_001191344.1| tubulointerstitial nephritis antigen-like isoform 3 [Homo sapiens]
 gi|119628008|gb|EAX07603.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_a [Homo
           sapiens]
          Length = 362

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 296 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 350


>gi|134023803|gb|AAI35570.1| LOC100124858 protein [Xenopus (Silurana) tropicalis]
          Length = 484

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 32/55 (58%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEENGRP-----YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE GR      YW   +++G  +G+ G  +I RG NE  IE+ + G 
Sbjct: 417 SVKITGWGEERGRDGQTHKYWLAANSWGRDWGEDGYFRIARGENECEIETFIVGV 471


>gi|145486176|ref|XP_001429095.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124396185|emb|CAK61697.1| unnamed protein product [Paramecium tetraurelia]
          Length = 464

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 19/52 (36%), Positives = 34/52 (65%)

Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
           +V   GWGEE G  +W + +++G+Q+G+ G  ++ RG +E+ IES+   + P
Sbjct: 373 SVLCYGWGEEEGVKFWMLQNSWGDQWGESGNFRMKRGVDESAIESMAEASDP 424


>gi|226477902|emb|CAX72658.1| Cathepsin L precursor [Schistosoma japonicum]
 gi|226488903|emb|CAX74801.1| Cathepsin L precursor [Schistosoma japonicum]
          Length = 372

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 19/35 (54%), Positives = 29/35 (82%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
           V +VG+G E+G+PYW I +++GE +GDKG +KIL+
Sbjct: 321 VLLVGYGIEDGKPYWLIKNSWGEDWGDKGYVKILK 355


>gi|395730851|ref|XP_003775799.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Pongo
           abelii]
          Length = 362

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 296 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 350


>gi|226469954|emb|CAX70258.1| Cathepsin L precursor [Schistosoma japonicum]
          Length = 372

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 19/35 (54%), Positives = 29/35 (82%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
           V +VG+G E+G+PYW I +++GE +GDKG +KIL+
Sbjct: 321 VLLVGYGIEDGKPYWLIKNSWGEDWGDKGYVKILK 355


>gi|56758090|gb|AAW27185.1| SJCHGC06231 protein [Schistosoma japonicum]
          Length = 372

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 19/35 (54%), Positives = 29/35 (82%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
           V +VG+G E+G+PYW I +++GE +GDKG +KIL+
Sbjct: 321 VLLVGYGIEDGKPYWLIKNSWGEDWGDKGYVKILK 355


>gi|402853710|ref|XP_003891533.1| PREDICTED: tubulointerstitial nephritis antigen-like [Papio anubis]
          Length = 362

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 296 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 350


>gi|327239610|gb|AEA39649.1| cathepsin B [Epinephelus coioides]
          Length = 171

 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 40/78 (51%), Gaps = 2/78 (2%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S+ W +    GLV+GG + S+ GC+P + PPC H +   + P C       P+C
Sbjct: 44  CNGGYPSAAWDFWTDVGLVSGGLYDSHVGCRPYTIPPCEH-HVNGTRPPCTGEGGDTPQC 102

Query: 140 HTRCTNDNYGRGFFQDKY 157
             +C +  Y   +  DK+
Sbjct: 103 ILQCES-GYTPSYKADKH 119


>gi|410910940|ref|XP_003968948.1| PREDICTED: tubulointerstitial nephritis antigen-like [Takifugu
           rubripes]
          Length = 477

 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 51/209 (24%), Positives = 77/209 (36%), Gaps = 53/209 (25%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G     W ++ +RG+VT         C P   P    A       + +++   + + 
Sbjct: 272 CTGGRIDGAWWFLRRRGVVT-------EDCYPYRPPQQTPAELGRCMMQSRSVGRGKRQA 324

Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
             RC N N    +  D YQ                P +  S   K      Q NG V A+
Sbjct: 325 TQRCPNTN---NYQNDIYQST--------------PPYRLSTNEKEIMKEIQDNGPVQAI 367

Query: 200 SASAEI-------------VAYA-----------TVKIVGWGEENG-----RPYWTIVST 230
               E              V++            +VKI GWGEE       R YW   ++
Sbjct: 368 MEVHEDFFVYKSGIYKHTDVSFTKPPQYRKHGTHSVKITGWGEERNVDGAKRKYWIAANS 427

Query: 231 FGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +G+ +G++G  +I RG NE  IE+ V G 
Sbjct: 428 WGKNWGEEGYFRIARGENECEIEAFVIGV 456


>gi|328712819|ref|XP_001942906.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Acyrthosiphon pisum]
 gi|328712821|ref|XP_003244911.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Acyrthosiphon pisum]
          Length = 463

 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 43/82 (52%), Gaps = 6/82 (7%)

Query: 184 KYTRPLFQTNGRVYAVS--ASAEIVAYATVKIVGWGEE--NGR--PYWTIVSTFGEQFGD 237
           K +R  F     VY  S  AS     Y +V+IVGWGEE   G+   YW   +++G  +G+
Sbjct: 353 KVSRDFFMYKSGVYKCSNLASGSRTGYHSVRIVGWGEEYQGGKIVKYWIASNSWGSWWGE 412

Query: 238 KGTIKILRGRNEAIIESLVNGA 259
            G  +IL+G +E  IE  V  A
Sbjct: 413 NGYFRILKGVDECEIEDFVIAA 434


>gi|297282815|ref|XP_002802331.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
           mulatta]
          Length = 322

 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 256 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 310


>gi|182509202|ref|NP_001116812.1| tubulointerstitial nephritis antigen precursor [Bombyx mori]
 gi|81303350|gb|ABB71105.1| TIN-ag-RP [Bombyx mori]
          Length = 404

 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 33/52 (63%)

Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
           +V+IVGWGE+    YW + +++G  +G+KG  +I RG +   IES V   LP
Sbjct: 350 SVRIVGWGEDAEDKYWIVANSWGTSWGEKGYFRIARGHSGTGIESSVLTVLP 401


>gi|312266|emb|CAA51531.1| cathepsin B-like enzyme [Gallus gallus]
          Length = 156

 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 38/78 (48%), Gaps = 2/78 (2%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  +RGLV+GG + S+ GC   + PPC H +   S P C       P+C
Sbjct: 63  CNGGYPSGAWRYWTERGLVSGGLYDSHVGCAGYTIPPCEH-HVNGSRPPCTGEGGETPRC 121

Query: 140 HTRCTNDNYGRGFFQDKY 157
              C    Y   + +DK+
Sbjct: 122 SRHC-EPGYSPSYKEDKH 138


>gi|15723276|gb|AAL06326.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 63/244 (25%), Positives = 91/244 (37%), Gaps = 61/244 (25%)

Query: 5   TSSRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVK 64
           T + IRD S             SC    AVA A+ ++   C    +       R  AG  
Sbjct: 12  TITEIRDQS-------------SCGSCWAVAAASAISDRYCTLGGVR----DLRISAGDL 54

Query: 65  QRCAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTT 124
             C  +       + C+ G     W +    G+V+         CQP  FP C H   ++
Sbjct: 55  MSCCDVCG-----YGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSS 102

Query: 125 SEPECKTLATPQPKCHTRCT-----------NDNY---GRGFFQDKYQINGLGLYFDPHF 170
               C       P C++ CT           N +Y   G   F+ +  +NG         
Sbjct: 103 DLSPCSG-EYDTPTCNSTCTDKKVPLIKYRGNTSYLLSGEESFKRELLLNG--------- 152

Query: 171 GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVST 230
            PF  +F     + Y   L  T G VY   A   +  +A V+IVGWGE NG PYW I ++
Sbjct: 153 -PFEVSF-----SVYADFLAYTGG-VYKHVAGTFLGGHA-VRIVGWGELNGEPYWKIANS 204

Query: 231 FGEQ 234
           +  +
Sbjct: 205 WNHE 208


>gi|14042811|dbj|BAB55403.1| unnamed protein product [Homo sapiens]
          Length = 218

 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 152 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 206


>gi|13469701|gb|AAK27318.1| cysteine proteinase [Clonorchis sinensis]
          Length = 179

 Score = 47.8 bits (112), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 41/153 (26%), Positives = 55/153 (35%), Gaps = 23/153 (15%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C  G     W +    G+VTGG+     GC+P  FP C H +     P C     P PKC
Sbjct: 38  CDGGFPPMAWDFWKTHGIVTGGSKEEPAGCRPYPFPKCQHHS-QGHYPPCPRRIYPTPKC 96

Query: 140 HTRC-----------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
              C           T  N      Q +  I    L   P           +F      P
Sbjct: 97  VKHCDTPKIDYQKDKTRANTSYNVHQSEVAIMKEILLNGP--------VEATFEVHEDFP 148

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENG 221
            +++    +A   S   V    ++I+GWGEENG
Sbjct: 149 EYKSGIYFHAWGGS---VGGHAIRILGWGEENG 178


>gi|14290553|gb|AAH09048.1| TINAGL1 protein [Homo sapiens]
          Length = 218

 Score = 47.8 bits (112), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V G 
Sbjct: 152 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 206


>gi|156708106|gb|ABU93311.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
          Length = 282

 Score = 47.8 bits (112), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 23/52 (44%), Positives = 31/52 (59%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           V  +GWG E+  PYW   +++G  +G+KG  KILRG N   IE+ V G   K
Sbjct: 229 VLCIGWGVEDNTPYWLCQNSWGPAWGEKGHFKILRGSNHCGIENQVYGPQMK 280


>gi|340509339|gb|EGR34889.1| nucleotide binding protein, putative [Ichthyophthirius multifiliis]
          Length = 732

 Score = 47.8 bits (112), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 25/48 (52%), Positives = 35/48 (72%), Gaps = 3/48 (6%)

Query: 210 TVKIVGWGEE--NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
           +V  VGWGE+  NG+ YW + +++GE +G+KG  KI RG NEA IES+
Sbjct: 665 SVLCVGWGEDDINGK-YWIVQNSWGESWGEKGYFKIARGNNEASIESM 711


>gi|58617822|gb|AAW80530.1| cathepsin L-like cysteine protease [Leishmania infantum]
          Length = 234

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 38/56 (67%)

Query: 198 AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
           AV AS+ +   + V +VG+ +  G PYW I +++GE +G+KG ++++ GRN  +++
Sbjct: 127 AVDASSFMSYQSGVLLVGYNKTGGVPYWVIKNSWGEDWGEKGYVRVVMGRNACLLK 182


>gi|358255476|dbj|GAA57175.1| cathepsin L [Clonorchis sinensis]
          Length = 385

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 20/39 (51%), Positives = 28/39 (71%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           V +VG+GEENG PYW I +++G  +G+ G +KILR  N 
Sbjct: 334 VLLVGYGEENGIPYWLIKNSWGPHWGENGYVKILRDHNN 372


>gi|118429523|gb|ABK91809.1| cathepsin L-like proteinase precursor [Clonorchis sinensis]
          Length = 373

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 20/39 (51%), Positives = 28/39 (71%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           V +VG+GEENG PYW I +++G  +G+ G +KILR  N 
Sbjct: 322 VLLVGYGEENGIPYWLIKNSWGPHWGENGYVKILRDHNN 360


>gi|301122543|ref|XP_002908998.1| cathepsin B, cysteine protease family C01A, putative [Phytophthora
           infestans T30-4]
 gi|262099760|gb|EEY57812.1| cathepsin B, cysteine protease family C01A, putative [Phytophthora
           infestans T30-4]
          Length = 384

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 22/44 (50%), Positives = 31/44 (70%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
           V+IVGWGEE+G  YW I +++G  +G  G  KI+RG+N   IE+
Sbjct: 228 VEIVGWGEEDGVKYWHIRNSWGTYWGMNGFFKIVRGKNNLGIEA 271


>gi|15723280|gb|AAL06328.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 63/244 (25%), Positives = 91/244 (37%), Gaps = 61/244 (25%)

Query: 5   TSSRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVK 64
           T + IRD S             SC    AVA A+ ++   C    +       R  AG  
Sbjct: 12  TITEIRDQS-------------SCGSCWAVAAASAISDRYCTLGGVR----DLRISAGDL 54

Query: 65  QRCAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTT 124
             C  +       + C+ G     W +    G+V+         CQP  FP C H   ++
Sbjct: 55  MSCCDVCG-----YGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSS 102

Query: 125 SEPECKTLATPQPKCHTRCT-----------NDNY---GRGFFQDKYQINGLGLYFDPHF 170
               C       P C++ CT           N +Y   G   F+ +  +NG         
Sbjct: 103 DLSPCSG-EYDTPTCNSTCTDKKVPLIKYRGNTSYLLSGEESFKRELLLNG--------- 152

Query: 171 GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVST 230
            PF  +F     + Y   L  T G VY   A   +  +A V+IVGWGE NG PYW I ++
Sbjct: 153 -PFEVSF-----SVYADFLAYTGG-VYKHVAGIFLGGHA-VRIVGWGELNGEPYWKIANS 204

Query: 231 FGEQ 234
           +  +
Sbjct: 205 WNHE 208


>gi|440801087|gb|ELR22112.1| papain family cysteine protease subfamily protein, partial
           [Acanthamoeba castellanii str. Neff]
          Length = 557

 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 24/61 (39%), Positives = 39/61 (63%), Gaps = 1/61 (1%)

Query: 200 SASAEIVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 258
           S++++ V    + +VGWG + NG  YW I +++  Q+GDKG  ++ RG N+A IE  V+ 
Sbjct: 275 SSASDYVGGHAIAVVGWGTDVNGVDYWLIENSWSTQWGDKGYYRMKRGVNQAGIEGYVSA 334

Query: 259 A 259
           A
Sbjct: 335 A 335


>gi|290973351|ref|XP_002669412.1| predicted protein [Naegleria gruberi]
 gi|284082959|gb|EFC36668.1| predicted protein [Naegleria gruberi]
          Length = 488

 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 22/44 (50%), Positives = 29/44 (65%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
           V +VGWGEENG PYW + +++G  +G  G  KI RG +E   ES
Sbjct: 435 VLLVGWGEENGVPYWLVKNSWGTSWGINGFFKIKRGTDECDCES 478


>gi|66911417|gb|AAH97299.1| Tubulointerstitial nephritis antigen-like 1 [Rattus norvegicus]
 gi|149024087|gb|EDL80584.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
 gi|149024088|gb|EDL80585.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
 gi|149024089|gb|EDL80586.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
          Length = 467

 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IE+ V G 
Sbjct: 401 SVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLGV 455


>gi|320162754|gb|EFW39653.1| papain family cysteine protease [Capsaspora owczarzaki ATCC 30864]
          Length = 589

 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 52/180 (28%), Positives = 77/180 (42%), Gaps = 29/180 (16%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGC--QPVSFPPCNHANYTTSEPECKTLATPQP 137
           C+ G   + +AW+   G+      H +TG   +  + P  ++    T EP  K  A P  
Sbjct: 424 CNGGDPLAAYAWIAVNGI------HDDTGTWYEAKNLPCTDYYKCHTCEPSGKCNAVPN- 476

Query: 138 KCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNGR 195
                C N  +G   F    +I G        F  GP       +     T  L    G 
Sbjct: 477 -----CLN--FGVAQFG---EIVGEAAMKAEIFARGPV------AVTIAVTTDLINYTGG 520

Query: 196 VYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
           V+  +  A I    +V + GWG +N G PYWTIV+++G  +G+ G  +I+RG N   IES
Sbjct: 521 VFHDTTGA-IGDDHSVMLTGWGVDNSGTPYWTIVNSWGTYWGETGAARIVRGVNNLGIES 579


>gi|357511629|ref|XP_003626103.1| Cathepsin B [Medicago truncatula]
 gi|87240982|gb|ABD32840.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
           propeptide [Medicago truncatula]
 gi|355501118|gb|AES82321.1| Cathepsin B [Medicago truncatula]
          Length = 357

 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 53/195 (27%), Positives = 74/195 (37%), Gaps = 39/195 (20%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G     W ++   G+VT     +    GC   S P C        EP  +T     P
Sbjct: 169 CDGGTPIYAWRYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYQT-----P 212

Query: 138 KCHTRCTNDN--YGRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTR 187
           KC  +C   N  + R      Y +    +  DP          GP   AF     T +  
Sbjct: 213 KCVRKCVKGNQIWKR---SKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAF-----TVFED 264

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRG 246
                +G    ++ SA  +    VK++GWG  + G  YW + + +   +GD G  KI RG
Sbjct: 265 FAHYKSGVYKHITGSA--LGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRG 322

Query: 247 RNEAIIESLVNGALP 261
            NE  IE  V   LP
Sbjct: 323 TNECGIEDDVTAGLP 337


>gi|217073630|gb|ACJ85175.1| unknown [Medicago truncatula]
          Length = 359

 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 53/195 (27%), Positives = 74/195 (37%), Gaps = 39/195 (20%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G     W ++   G+VT     +    GC   S P C        EP  +T     P
Sbjct: 171 CDGGTPIYAWRYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYQT-----P 214

Query: 138 KCHTRCTNDN--YGRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTR 187
           KC  +C   N  + R      Y +    +  DP          GP   AF     T +  
Sbjct: 215 KCVRKCVKGNQIWKR---SKHYSVKAYRVKSDPQDIMTEVYKNGPVEVAF-----TVFED 266

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRG 246
                +G    ++ SA  +    VK++GWG  + G  YW + + +   +GD G  KI RG
Sbjct: 267 FAHYKSGVYKHITGSA--LGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRG 324

Query: 247 RNEAIIESLVNGALP 261
            NE  IE  V   LP
Sbjct: 325 TNECGIEDDVTAGLP 339


>gi|291000228|ref|XP_002682681.1| predicted protein [Naegleria gruberi]
 gi|284096309|gb|EFC49937.1| predicted protein [Naegleria gruberi]
          Length = 225

 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 22/46 (47%), Positives = 29/46 (63%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
           +KIVGWG EN   YW + +++G  +G  G  KI RG NE  IE+ V
Sbjct: 180 IKIVGWGVENNVKYWLVANSWGPDWGLNGLFKIKRGDNECGIEADV 225


>gi|217072748|gb|ACJ84734.1| unknown [Medicago truncatula]
 gi|388505480|gb|AFK40806.1| unknown [Medicago truncatula]
          Length = 359

 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 53/195 (27%), Positives = 74/195 (37%), Gaps = 39/195 (20%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G     W ++   G+VT     +    GC   S P C        EP  +T     P
Sbjct: 171 CDGGTPIYAWRYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYQT-----P 214

Query: 138 KCHTRCTNDN--YGRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTR 187
           KC  +C   N  + R      Y +    +  DP          GP   AF     T +  
Sbjct: 215 KCVRKCVKGNQIWKR---SKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAF-----TVFED 266

Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRG 246
                +G    ++ SA  +    VK++GWG  + G  YW + + +   +GD G  KI RG
Sbjct: 267 FAHYKSGVYKHITGSA--LGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRG 324

Query: 247 RNEAIIESLVNGALP 261
            NE  IE  V   LP
Sbjct: 325 TNECGIEDDVTAGLP 339


>gi|170579559|ref|XP_001894882.1| cathepsin F-like cysteine proteinase [Brugia malayi]
 gi|158598358|gb|EDP36268.1| cathepsin F-like cysteine proteinase, putative [Brugia malayi]
          Length = 137

 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 20/50 (40%), Positives = 35/50 (70%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
           V I G+G E+  PYWTI +++GEQ+G+ G  +++RG++   +  LV+ A+
Sbjct: 86  VLITGYGIEDNLPYWTIKNSWGEQWGENGYFRLMRGKDICGVSDLVSSAI 135


>gi|294935201|ref|XP_002781340.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239891890|gb|EER13135.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 77

 Score = 47.4 bits (111), Expect = 0.009,   Method: Composition-based stats.
 Identities = 23/55 (41%), Positives = 34/55 (61%), Gaps = 2/55 (3%)

Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDN 264
           T  I+GWG E G  YW +++++ E +GD GT KI +G  +  I+  V G+LP  N
Sbjct: 25  TSLIIGWGTEKGVDYWLVMNSWNEGWGDHGTFKIAQG--DCGIDDAVQGSLPAMN 77


>gi|115621283|ref|XP_782184.2| PREDICTED: tubulointerstitial nephritis antigen-like
           [Strongylocentrotus purpuratus]
          Length = 450

 Score = 47.4 bits (111), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 45/81 (55%), Gaps = 6/81 (7%)

Query: 185 YTRPLFQTNGRVYAVSAS-AEIVAYATVKIVGWGEE-----NGRPYWTIVSTFGEQFGDK 238
           Y R +++   + +  S S ++   + +VKIVGWG +     N   YW   +++G  +G++
Sbjct: 361 YNRGVYRNVKQEFTASQSDSDQAGWHSVKIVGWGIDRSDWYNPIKYWLCTNSWGRNWGEQ 420

Query: 239 GTIKILRGRNEAIIESLVNGA 259
           G  +I+RG NE  IES V G 
Sbjct: 421 GMFRIVRGVNECEIESFVLGV 441


>gi|270132817|ref|NP_075965.2| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
 gi|270132824|ref|NP_001161805.1| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
 gi|61213616|sp|Q99JR5.1|TINAL_MOUSE RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
           Full=Adrenocortical zonation factor 1; Short=AZ-1;
           AltName: Full=Androgen-regulated gene 1 protein;
           AltName: Full=Tubulointerstitial nephritis
           antigen-related protein; Short=TARP; Flags: Precursor
 gi|13543125|gb|AAH05738.1| Tinagl1 protein [Mus musculus]
 gi|17391278|gb|AAH18539.1| Tinagl1 protein [Mus musculus]
 gi|30314458|dbj|BAC76038.1| tubulointersititial nephritis antigen-related protein [Mus
           musculus]
 gi|148698197|gb|EDL30144.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
           musculus]
 gi|148698198|gb|EDL30145.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
           musculus]
          Length = 466

 Score = 47.4 bits (111), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IE+ V G 
Sbjct: 400 SVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLGV 454


>gi|188501543|gb|ACD54672.1| cysteine protease [Adineta vaga]
          Length = 333

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 35/59 (59%)

Query: 198 AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
           AV    +I  Y  ++IVG+G E G+PYW   ++ G+ +G++G  +I R +N   I  LV
Sbjct: 241 AVDYVVKINKYYELQIVGYGVERGKPYWICKNSLGQNWGEEGYFRIARDKNMCRIAELV 299


>gi|85068698|gb|ABC69429.1| cysteine protease [Clonorchis sinensis]
          Length = 326

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 33/52 (63%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           V  VG+G +NG+PYW + +++GE FG++G  +I RG     I S+V  A  K
Sbjct: 275 VLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTTARIK 326


>gi|85068706|gb|ABC69433.1| cysteine protease [Clonorchis sinensis]
          Length = 326

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 33/52 (63%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           V  VG+G +NG+PYW + +++GE FG++G  +I RG     I S+V  A  K
Sbjct: 275 VLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTTARIK 326


>gi|85068704|gb|ABC69432.1| cysteine protease [Clonorchis sinensis]
          Length = 326

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 33/52 (63%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           V  VG+G +NG+PYW + +++GE FG++G  +I RG     I S+V  A  K
Sbjct: 275 VLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTTARIK 326


>gi|12060418|dbj|BAB20596.1| ARG1 [Mus musculus]
 gi|71059879|emb|CAJ18483.1| Lcn7 [Mus musculus]
          Length = 415

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IE+ V G 
Sbjct: 349 SVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLGV 403


>gi|145490612|ref|XP_001431306.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124398410|emb|CAK63908.1| unnamed protein product [Paramecium tetraurelia]
          Length = 490

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 19/52 (36%), Positives = 34/52 (65%)

Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
           +V   GWGEE+G  +W + +++G Q+G+ G  ++ RG +E+ IES+   + P
Sbjct: 399 SVLCYGWGEEDGVKFWMLQNSWGNQWGEGGNFRMKRGVDESAIESMAEASDP 450


>gi|312068028|ref|XP_003137021.1| papain family cysteine protease containing protein [Loa loa]
 gi|307767820|gb|EFO27054.1| papain family cysteine protease containing protein [Loa loa]
          Length = 332

 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 32/48 (66%)

Query: 212 KIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +++G+G ENG+ YW I +++GE +GD G  KI RG N   +E+ V  A
Sbjct: 283 EVIGYGTENGKKYWLIKNSWGEWWGDHGFFKIERGINACQVETYVASA 330


>gi|198434980|ref|XP_002126076.1| PREDICTED: similar to LOC100124858 protein [Ciona intestinalis]
          Length = 541

 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 43/78 (55%), Gaps = 11/78 (14%)

Query: 196 VYAVSASAEIVA-------YATVKIVGWGEE----NGRPYWTIVSTFGEQFGDKGTIKIL 244
           VY+ +A   IV        Y +VKI+GWGE+    N   YW + +++G  +G+ G  +I 
Sbjct: 456 VYSSTAIDNIVVEQVKDNTYHSVKIIGWGEKKSKTNSGKYWIVQNSWGANWGEGGYFRIR 515

Query: 245 RGRNEAIIESLVNGALPK 262
           +G NE  IE ++  A P+
Sbjct: 516 KGVNECGIEEMILAAWPQ 533


>gi|340503546|gb|EGR30116.1| hypothetical protein IMG5_141560 [Ichthyophthirius multifiliis]
          Length = 599

 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 23/53 (43%), Positives = 33/53 (62%)

Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           +V  VGWGE     YW + +++GE +G+KG  KI RG +E+ IES+   A  K
Sbjct: 532 SVLCVGWGENEDGKYWLVQNSWGEDWGEKGYFKIRRGTDESNIESMGERAFIK 584


>gi|324105223|gb|ADY18374.1| cathepsin B [Glycera tridactyla]
          Length = 117

 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 20/61 (32%), Positives = 33/61 (54%), Gaps = 1/61 (1%)

Query: 83  GISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 142
           G   S W +    G+VTGG ++++ GC+P + P C H +   + P C +   P P+C  +
Sbjct: 1   GFPRSAWEYFKVTGIVTGGQYNTHEGCRPYTIPKCEH-HVNGTLPPCSSTIKPTPRCERK 59

Query: 143 C 143
           C
Sbjct: 60  C 60


>gi|85068700|gb|ABC69430.1| cysteine protease [Clonorchis sinensis]
          Length = 326

 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 33/52 (63%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           V  VG+G +NG+PYW + +++GE FG++G  +I RG     I S+V  A  K
Sbjct: 275 VLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTTARIK 326


>gi|148704124|gb|EDL36071.1| cathepsin B, isoform CRA_b [Mus musculus]
          Length = 237

 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W++  K+GLV+GG ++S+ GC P + PPC H +   S P C T     P+C
Sbjct: 144 CNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPRC 201

Query: 140 HTR 142
           + +
Sbjct: 202 NKK 204


>gi|294895531|ref|XP_002775206.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239881224|gb|EER07022.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 130

 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 29/38 (76%)

Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           T+KI+GWG E+G+ YW  V+++ E++GD G IK+  GR
Sbjct: 80  TLKIIGWGVESGQDYWLAVNSWNEEWGDHGMIKLAVGR 117


>gi|156708120|gb|ABU93318.1| cathepsin B9 cysteine protease, partial [Monocercomonoides sp. PA]
          Length = 382

 Score = 47.0 bits (110), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 23/45 (51%), Positives = 29/45 (64%)

Query: 218 EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           +E G PYW IV+++GE FG  G + I RG NE  IES V   +PK
Sbjct: 320 KEEGIPYWIIVNSWGEDFGMDGILLIKRGVNECGIESDVYTGIPK 364


>gi|86451924|gb|ABC97357.1| cathepsin B [Streblomastix strix]
          Length = 283

 Score = 47.0 bits (110), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 31/52 (59%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           V IVGWG E+  PYW + +++G  +G+ G  KILRG +    ES V    P+
Sbjct: 229 VLIVGWGVEDEVPYWLVQNSWGTDWGENGFFKILRGSDHCECESNVTAGYPE 280


>gi|294956046|ref|XP_002788796.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239904363|gb|EER20592.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 130

 Score = 47.0 bits (110), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 29/38 (76%)

Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           T+KI+GWG E+G+ YW  V+++ E++GD G IK+  GR
Sbjct: 80  TLKIIGWGVESGQDYWLAVNSWNEEWGDHGMIKLAVGR 117


>gi|290990464|ref|XP_002677856.1| predicted protein [Naegleria gruberi]
 gi|284091466|gb|EFC45112.1| predicted protein [Naegleria gruberi]
          Length = 231

 Score = 47.0 bits (110), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 20/36 (55%), Positives = 26/36 (72%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
           VKI+GWG ENG  YW I +++G  FG +G  KI+RG
Sbjct: 184 VKIIGWGTENGVDYWLIANSWGTTFGLQGFFKIVRG 219


>gi|146386348|gb|ABQ23962.1| cathepsin B [Oryctolagus cuniculus]
          Length = 228

 Score = 47.0 bits (110), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 38/72 (52%), Gaps = 3/72 (4%)

Query: 86  SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
           S  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C T     P+C   C  
Sbjct: 135 SGAWNFWTKKGLVSGGLYDSHVGCKPYSIPPCEH-HVNGSRPAC-TGEGDTPRCSKTC-E 191

Query: 146 DNYGRGFFQDKY 157
             Y   + +DK+
Sbjct: 192 PGYSPSYKEDKH 203


>gi|348513412|ref|XP_003444236.1| PREDICTED: cathepsin S-like [Oreochromis niloticus]
          Length = 328

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 40/64 (62%), Gaps = 2/64 (3%)

Query: 186 TRPLFQTNGR-VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 244
           +RP F    R VY  +   + V + ++ +VG+G E G+ YW + +++G QFG++G IK+ 
Sbjct: 252 SRPQFHFYHRGVYMDNTCTQKVNHGSL-VVGYGREKGQDYWLVKNSWGVQFGEEGYIKMA 310

Query: 245 RGRN 248
           R RN
Sbjct: 311 RNRN 314


>gi|348676075|gb|EGZ15893.1| papain-like cysteine protease C1 [Phytophthora sojae]
          Length = 383

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 21/44 (47%), Positives = 31/44 (70%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
           V+IVGWGEE+G  YW + +++G  +G  G  KI+RG+N   IE+
Sbjct: 224 VEIVGWGEEDGVKYWHVRNSWGTYWGMNGFFKIVRGKNNLGIEA 267


>gi|16758354|ref|NP_446034.1| tubulointerstitial nephritis antigen-like precursor [Rattus
           norvegicus]
 gi|61213054|sp|Q9EQT5.1|TINAL_RAT RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
           Full=Glucocorticoid-inducible protein 5; Flags:
           Precursor
 gi|11527795|dbj|BAB18637.1| glucocorticoid-inducible protein [Rattus norvegicus]
          Length = 467

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 36/55 (65%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IE+ V G 
Sbjct: 401 SVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGINECDIETFVLGV 455


>gi|253744204|gb|EET00443.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 309

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 36/54 (66%), Gaps = 1/54 (1%)

Query: 208 YATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
           Y +V+IVG+G  + G+ YW + + +G  +G+ G  +I+RG+NE  IE  V GA+
Sbjct: 245 YLSVEIVGYGTSDEGQDYWIVKNYWGSNWGEDGYFRIVRGQNECQIEEAVYGAI 298


>gi|294871893|ref|XP_002766082.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239866672|gb|EEQ98799.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 118

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 29/38 (76%)

Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           T+KI+GWG E+G+ YW  V+++ E++GD G IK+  GR
Sbjct: 68  TLKIIGWGVESGQDYWLAVNSWNEEWGDHGMIKLAVGR 105


>gi|308163070|gb|EFO65432.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 97

 Score = 46.6 bits (109), Expect = 0.013,   Method: Composition-based stats.
 Identities = 27/77 (35%), Positives = 42/77 (54%), Gaps = 4/77 (5%)

Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWG---EENGRPYWTIVSTFGEQFGDKGTIKI 243
           R      G VY      +I ++A V+I+G+G   +E+  PYW + ++ G  +G+ G   I
Sbjct: 18  RDFLYYRGGVYRHVYGVQISSHA-VEIIGYGTTDDEDRVPYWIVKNSLGPNWGEDGYFNI 76

Query: 244 LRGRNEAIIESLVNGAL 260
           +RG NE  IES V+  L
Sbjct: 77  VRGSNECDIESAVHSGL 93


>gi|388499754|gb|AFK37943.1| unknown [Lotus japonicus]
          Length = 209

 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 45/147 (30%), Positives = 65/147 (44%), Gaps = 20/147 (13%)

Query: 125 SEPECKTLATPQPKCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWP 175
           S P C+  A   PKC  +C   N  + + + K + +N   +  DP+         GP   
Sbjct: 53  SHPGCEP-AYQTPKCVRKCVKGN--QIWKKSKHFSVNAYSVKSDPYDIMAEVYKNGPVEV 109

Query: 176 AFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQ 234
           AF     T Y       +G VY     +++  +A VK++GWG  + G  YW I + +   
Sbjct: 110 AF-----TVYEDFAHYKSG-VYKHITGSQLGGHA-VKLIGWGTTDEGEDYWLIANQWNRS 162

Query: 235 FGDKGTIKILRGRNEAIIESLVNGALP 261
           +GD G   I RG NE  IE  V   LP
Sbjct: 163 WGDDGYFMIRRGTNECGIEEDVTAGLP 189


>gi|195455847|ref|XP_002074892.1| GK22908 [Drosophila willistoni]
 gi|194170977|gb|EDW85878.1| GK22908 [Drosophila willistoni]
          Length = 381

 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 30/38 (78%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           V +VG+G ENG+ YWTI +++GE +G+ G  +++RG+N
Sbjct: 331 VLVVGYGSENGQDYWTIKNSWGENWGESGYFRLIRGQN 368


>gi|294916338|ref|XP_002778359.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239886683|gb|EER10154.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 105

 Score = 46.6 bits (109), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 24/52 (46%), Positives = 34/52 (65%), Gaps = 2/52 (3%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
           VKI+GWGE++G+ YW  V+++ E +GD G  KI  G N  I + L+ G  PK
Sbjct: 55  VKIIGWGEKSGQAYWLAVNSWNEDWGDHGLFKIALG-NCGIDDDLLGGT-PK 104


>gi|395526635|ref|XP_003765465.1| PREDICTED: tubulointerstitial nephritis antigen-like [Sarcophilus
           harrisii]
          Length = 467

 Score = 46.6 bits (109), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 35/55 (63%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   +G+   YWT  +++G  +G+ G  +I+RG NE  IES V G 
Sbjct: 400 SVKITGWGEEIQPDGQKVKYWTAANSWGPTWGENGYFRIVRGANECDIESFVVGV 454


>gi|156708108|gb|ABU93312.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score = 46.6 bits (109), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 52/190 (27%), Positives = 73/190 (38%), Gaps = 39/190 (20%)

Query: 70  LVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPEC 129
           LVS   T   C+ G     WAW    G+ T       +G   V                 
Sbjct: 117 LVSCDTTDMGCNGGYMDHAWAWTKSHGITTEKCMPYQSGSGRV----------------- 159

Query: 130 KTLATPQPKCHTRCTNDNYGRGFFQDK----YQINGLGLYFDPH-FGPFWPAFWRSFCTK 184
                  P C  +C N   G    ++K     ++N   +  + +  GP   AF     T 
Sbjct: 160 -------PACPAKCVN---GSAIVRNKSVSYKKLNAQQMMEELYENGPISVAF-----TV 204

Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 244
           Y   +   +G VY V  +  I     V  VGWG E+  PYW   +++G  +G+KG  KIL
Sbjct: 205 YYDFMNYKSG-VY-VHKTGGIAGGHAVLCVGWGVEDNTPYWLCQNSWGPAWGEKGHFKIL 262

Query: 245 RGRNEAIIES 254
           RG N   IE+
Sbjct: 263 RGSNHCGIEN 272


>gi|52546914|gb|AAU81590.1| cysteine proteinase, partial [Petunia x hybrida]
          Length = 122

 Score = 46.6 bits (109), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 26/67 (38%), Positives = 36/67 (53%), Gaps = 2/67 (2%)

Query: 196 VYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
           VY      E+  +A VK++GWG  E+G  YW + + +   +GD G  KI RG NE  IE 
Sbjct: 37  VYKHVTGDELGGHA-VKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIRRGTNECDIED 95

Query: 255 LVNGALP 261
            V   +P
Sbjct: 96  EVVAGMP 102


>gi|156708104|gb|ABU93310.1| cathepsin B1 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score = 46.2 bits (108), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 52/190 (27%), Positives = 73/190 (38%), Gaps = 39/190 (20%)

Query: 70  LVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPEC 129
           LVS   T   C+ G     WAW    G+ T       +G   V                 
Sbjct: 117 LVSCDTTDMGCNGGYMDHAWAWTKSHGVTTEKCMPYQSGSGRV----------------- 159

Query: 130 KTLATPQPKCHTRCTNDNYGRGFFQDK----YQINGLGLYFDPH-FGPFWPAFWRSFCTK 184
                  P C  +C N   G    ++K     ++N   +  + +  GP   AF     T 
Sbjct: 160 -------PACPAKCVN---GSAIVRNKSVSYKKLNAQQMMEELYENGPISVAF-----TV 204

Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 244
           Y   +   +G VY V  +  I     V  VGWG E+  PYW   +++G  +G+KG  KIL
Sbjct: 205 YYDFMNYKSG-VY-VHKTGGIAGGHAVLCVGWGVEDNTPYWLCQNSWGPAWGEKGHFKIL 262

Query: 245 RGRNEAIIES 254
           RG N   IE+
Sbjct: 263 RGSNHCGIEN 272


>gi|345327151|ref|XP_001507103.2| PREDICTED: tubulointerstitial nephritis antigen-like
           [Ornithorhynchus anatinus]
          Length = 327

 Score = 46.2 bits (108), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 35/55 (63%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE   NGR   +W   +++G  +G+ G+ +ILRG NE  IES V G 
Sbjct: 255 SVKITGWGEELQPNGRRVKFWRAANSWGPTWGEGGSFRILRGCNECDIESFVVGV 309


>gi|312283137|dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila]
          Length = 362

 Score = 46.2 bits (108), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 55/194 (28%), Positives = 79/194 (40%), Gaps = 37/194 (19%)

Query: 80  CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
           C  G   + W +    G+VT     +  +TGC   S P C        EP     A P P
Sbjct: 174 CDGGYPIAAWQYFSYSGVVTEECDPYFDDTGC---SHPGC--------EP-----AYPTP 217

Query: 138 KCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
           KC  +C + N  + + Q K Y ++   +  +P          GP   +F     T Y   
Sbjct: 218 KCMRKCVSGN--QLWSQSKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSF-----TVYEDF 270

Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
               +G VY     + I  +A VK++GWG  + G  YW + + +   +GD G   I RG 
Sbjct: 271 AHYKSG-VYKHITGSNIGGHA-VKLIGWGTTDEGEDYWLLANQWNRSWGDDGYFMIRRGT 328

Query: 248 NEAIIESLVNGALP 261
           NE  IE      LP
Sbjct: 329 NECGIEDEPVAGLP 342


>gi|294876288|ref|XP_002767632.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239869318|gb|EER00350.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 97

 Score = 46.2 bits (108), Expect = 0.016,   Method: Composition-based stats.
 Identities = 23/60 (38%), Positives = 36/60 (60%), Gaps = 2/60 (3%)

Query: 202 SAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
           S   +   +V+I+GWG E G  YW +++++ E +GD GT KI +G  +  I  +V GA P
Sbjct: 37  SGTFMGVHSVEIIGWGIEKGVDYWLVMNSWNEDWGDNGTFKIAQG--DCGINDMVLGAPP 94


>gi|170579333|ref|XP_001894785.1| Papain family cysteine protease containing protein [Brugia malayi]
 gi|158598509|gb|EDP36387.1| Papain family cysteine protease containing protein [Brugia malayi]
          Length = 324

 Score = 46.2 bits (108), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 32/115 (27%), Positives = 55/115 (47%), Gaps = 8/115 (6%)

Query: 148 YGRGFFQDKYQIN---GLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAE 204
           YG  F    YQI+       +F  +  P   A   +F  +Y    F  +G +      + 
Sbjct: 213 YGEIFINKLYQIDPDPNAMAWFVANVAPI--ALNLAFPKRYK---FYKSGVLPDTDECST 267

Query: 205 IVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +      +++G+G ENG+ YW + +++GE +GD+G  K+ RG N   +E+ V  A
Sbjct: 268 MEPNHAAEVIGYGTENGKKYWLLKNSWGEWWGDQGFFKMERGVNACKVETYVASA 322


>gi|403343435|gb|EJY71046.1| Papain family cysteine protease containing protein [Oxytricha
           trifallax]
          Length = 619

 Score = 46.2 bits (108), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 24/102 (23%), Positives = 53/102 (51%), Gaps = 6/102 (5%)

Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
           G +Y  +   + + +  V +VG+G ENG  +W + +++G  +G+ G ++++RG N   IE
Sbjct: 220 GGIYQDTTGDQNIVH-DVSVVGFGVENGTKFWVVRNSWGSHYGENGFVRVIRGVNNIAIE 278

Query: 254 SLVNGALPKDNYGVEFGEESGERLSEEFGVRAESSEEFRENG 295
           +    A P D +      ++ +    +       ++++R+NG
Sbjct: 279 TDCAWATPVDTWTNRVPHKTTDAEKND-----PKNDKYRKNG 315


>gi|351712812|gb|EHB15731.1| Dipeptidyl-peptidase 1 [Heterocephalus glaber]
          Length = 462

 Score = 46.2 bits (108), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 34/53 (64%), Gaps = 2/53 (3%)

Query: 211 VKIVGWGEE--NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
           V +VG+G +  NG  YW + +++G  +G+KG  +ILRG +E  IES+   A P
Sbjct: 406 VLLVGYGTDSANGMDYWIVKNSWGTSWGEKGYFRILRGTDECAIESIAMAATP 458


>gi|303277733|ref|XP_003058160.1| cathepsin [Micromonas pusilla CCMP1545]
 gi|226460817|gb|EEH58111.1| cathepsin [Micromonas pusilla CCMP1545]
          Length = 583

 Score = 46.2 bits (108), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 18/33 (54%), Positives = 24/33 (72%)

Query: 213 IVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
           +VGWG ENG  YW + +T+GE FG+KG  K+ R
Sbjct: 419 VVGWGVENGMKYWLVRNTYGEDFGEKGYFKLER 451


>gi|384249023|gb|EIE22506.1| cysteine proteinase [Coccomyxa subellipsoidea C-169]
          Length = 404

 Score = 45.8 bits (107), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 45/83 (54%), Gaps = 1/83 (1%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVEFG 270
           V++ GWGEE+G P+W + +++G  +G+ G  +I RG N   +E   +    +  + +E  
Sbjct: 258 VEVTGWGEEHGVPFWIVRNSWGTFWGEMGFFRIERGINSLFLED-SDCWYAEPEHEMEDE 316

Query: 271 EESGERLSEEFGVRAESSEEFRE 293
            E GE +   +GV    SE+  E
Sbjct: 317 VEDGELVGSMYGVLDAKSEQGSE 339


>gi|281209002|gb|EFA83177.1| cathepsin Z precursor [Polysphondylium pallidum PN500]
          Length = 309

 Score = 45.8 bits (107), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 33/59 (55%), Gaps = 13/59 (22%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVEF 269
           V IVGWGEENG  YW + +++G  +G++G  +I+RG              P +N G+E 
Sbjct: 249 VSIVGWGEENGESYWIVRNSWGMYYGEQGFFRIVRGS-------------PFENLGIEL 294


>gi|149030259|gb|EDL85315.1| rCG52258, isoform CRA_b [Rattus norvegicus]
          Length = 210

 Score = 45.8 bits (107), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 24/61 (39%), Positives = 35/61 (57%), Gaps = 2/61 (3%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
           C+ G  S  W +  ++GLV+GG ++S+ GC P + PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 140 H 140
           +
Sbjct: 208 N 208


>gi|221505681|gb|EEE31326.1| cathepsin L, putative [Toxoplasma gondii VEG]
          Length = 733

 Score = 45.8 bits (107), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 25/49 (51%), Positives = 32/49 (65%), Gaps = 5/49 (10%)

Query: 211 VKIVGWGE---ENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
           V IVGWGE   ENG+P  YW + +T+G  +G  G +KI RG+N   IES
Sbjct: 656 VTIVGWGETDGENGKPQKYWIVRNTWGPNWGVDGYVKIARGKNLGGIES 704


>gi|237838179|ref|XP_002368387.1| cathepsin C [Toxoplasma gondii ME49]
 gi|211966051|gb|EEB01247.1| cathepsin C [Toxoplasma gondii ME49]
 gi|221484340|gb|EEE22636.1| cathepsin C, putative [Toxoplasma gondii GT1]
          Length = 733

 Score = 45.8 bits (107), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 25/49 (51%), Positives = 32/49 (65%), Gaps = 5/49 (10%)

Query: 211 VKIVGWGE---ENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
           V IVGWGE   ENG+P  YW + +T+G  +G  G +KI RG+N   IES
Sbjct: 656 VTIVGWGETDGENGKPQKYWIVRNTWGPNWGVDGYVKIARGKNLGGIES 704


>gi|70919569|gb|AAZ15654.1| cathepsin C1 [Toxoplasma gondii]
          Length = 730

 Score = 45.8 bits (107), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 25/49 (51%), Positives = 32/49 (65%), Gaps = 5/49 (10%)

Query: 211 VKIVGWGE---ENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
           V IVGWGE   ENG+P  YW + +T+G  +G  G +KI RG+N   IES
Sbjct: 653 VTIVGWGETDGENGKPQKYWIVRNTWGPNWGVDGYVKIARGKNLGGIES 701


>gi|294950069|ref|XP_002786445.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239900737|gb|EER18241.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 149

 Score = 45.8 bits (107), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 18/45 (40%), Positives = 32/45 (71%)

Query: 199 VSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
           V  + ++V   T+KI+GWG E+G+ YW  ++++ E++GD G IK+
Sbjct: 79  VHTTGDLVGSHTLKIIGWGVESGQEYWLAMNSWNEEWGDHGLIKM 123


>gi|407399825|gb|EKF28451.1| cysteine peptidase, putative, partial [Trypanosoma cruzi
           marinkellei]
          Length = 257

 Score = 45.8 bits (107), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 18/50 (36%), Positives = 36/50 (72%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
           V +VG+ +    PYWTI +++G+Q+G++G I+I +G N+ +++  V+ A+
Sbjct: 119 VLLVGYNDSAPVPYWTIKNSWGKQWGEEGYIRIAKGSNQCLVKDRVSSAV 168


>gi|268563232|ref|XP_002638788.1| Hypothetical protein CBG05143 [Caenorhabditis briggsae]
          Length = 426

 Score = 45.8 bits (107), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 55/200 (27%), Positives = 79/200 (39%), Gaps = 60/200 (30%)

Query: 91  WVHKRGLVTGGAHHSNTGCQPVSFP-----PCNHANYTTSEPECKTLATPQPKCHTRCTN 145
           WV++ GLVTGG      GC+P SF      PC+ A +  +E E +T       C  RC N
Sbjct: 223 WVNQ-GLVTGG----RDGCRPYSFDLSCGVPCSPATFFEAE-EKRT-------CMRRCQN 269

Query: 146 DNYGRGFFQDK------YQINGLGLYFDPHFGPFW--PAFWRSFCTKYTRPLFQTNGR-- 195
             Y + + +DK      Y +    +   P        P     F  K T  L  T  R  
Sbjct: 270 IYYQQKYEEDKHFATFAYSLYPRSMTVSPDGKERVKVPTIIGHFNDKNTEKLNVTEYRNV 329

Query: 196 ------VYAVSASA-------------------------EIVAYATVKIVGWGE-ENGRP 223
                 +Y  +  A                          IV +  V+++GWGE ++G+ 
Sbjct: 330 IKKEILLYGPTTMAFPVPEEFLHYSSGVFRPFPLDGFDDRIVYWHVVRLIGWGESDDGQH 389

Query: 224 YWTIVSTFGEQFGDKGTIKI 243
           YW  V++FG  +GD G  KI
Sbjct: 390 YWLAVNSFGNHWGDNGIFKI 409


>gi|159950|gb|AAA29435.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
          Length = 105

 Score = 45.8 bits (107), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 28/40 (70%)

Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
            VK++GWGEE G PYW + +++ + +G+ G  ++ RG N+
Sbjct: 53  AVKVIGWGEEKGTPYWIVANSWHDDWGENGFFRMHRGSND 92


>gi|443687066|gb|ELT90166.1| hypothetical protein CAPTEDRAFT_138389 [Capitella teleta]
          Length = 446

 Score = 45.8 bits (107), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 37/58 (63%), Gaps = 1/58 (1%)

Query: 204 EIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
           EI  +A V +VG+G + G  YW + +++G+ +G++G  +ILRG +E  IES+     P
Sbjct: 388 EITNHA-VLLVGYGADEGTKYWIVKNSWGKGWGEEGYFRILRGADECAIESIAVETFP 444


>gi|37903264|gb|AAO64475.1| cathepsin K [Fundulus heteroclitus]
          Length = 191

 Score = 45.8 bits (107), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 20/38 (52%), Positives = 26/38 (68%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           V  VG+G E G  YW I +++GE FGD+G IK+ R RN
Sbjct: 140 VLAVGYGTEKGEDYWLIKNSWGEHFGDEGYIKMARNRN 177


>gi|151573016|gb|ABS17683.1| cathepsin L-1 [Artemia persimilis]
          Length = 334

 Score = 45.4 bits (106), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 36/58 (62%), Gaps = 1/58 (1%)

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           F + G  Y  S  ++ + +  V +VG+G +NG+ YW + +++ E +GD+G IKI R R
Sbjct: 263 FYSKGVYYEPSCDSDDLDHG-VLVVGYGSDNGKDYWLVKNSWSEHWGDQGYIKIARNR 319


>gi|58332124|ref|NP_001011214.1| cathepsin S, gene 2 precursor [Xenopus (Silurana) tropicalis]
 gi|56556518|gb|AAH87770.1| cathepsin S [Xenopus (Silurana) tropicalis]
          Length = 332

 Score = 45.4 bits (106), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 28/39 (71%)

Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           +V +VG+G +NG  YW + +++G  +GDKG IK+ R RN
Sbjct: 280 SVLVVGYGTDNGNDYWLVKNSWGAGYGDKGYIKMARNRN 318


>gi|350540002|ref|NP_001232104.1| putative cathepsin B variant 2 precursor [Taeniopygia guttata]
 gi|197129221|gb|ACH45719.1| putative cathepsin B variant 2 [Taeniopygia guttata]
          Length = 261

 Score = 45.4 bits (106), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 28/48 (58%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEP 127
           C+ G  S  W +  +RGLV+GG + S+ GC+P S PPC H    T  P
Sbjct: 150 CNGGYPSGAWRYWTERGLVSGGLYDSHVGCRPYSIPPCEHHVNGTRPP 197


>gi|116242316|gb|ABJ89815.1| putative cathepsin L preprotein [Clonorchis sinensis]
          Length = 371

 Score = 45.4 bits (106), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 28/38 (73%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           V +VG+G +NG PYW I +++GE +G+ G ++ILR  N
Sbjct: 320 VLVVGYGVDNGVPYWLIKNSWGEDWGENGYVRILRNHN 357


>gi|327281715|ref|XP_003225592.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
           carolinensis]
          Length = 520

 Score = 45.4 bits (106), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 33/55 (60%), Gaps = 5/55 (9%)

Query: 210 TVKIVGWGEE-----NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           +VKI GWGEE     + + YW   +++G+ +G+ G  +I RG NE  IE+ V G 
Sbjct: 452 SVKITGWGEEQMPDGSNQKYWIAANSWGKDWGEHGYFRITRGENECEIETFVVGV 506


>gi|308462797|ref|XP_003093679.1| hypothetical protein CRE_29188 [Caenorhabditis remanei]
 gi|308249543|gb|EFO93495.1| hypothetical protein CRE_29188 [Caenorhabditis remanei]
          Length = 353

 Score = 45.4 bits (106), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 23/83 (27%), Positives = 47/83 (56%), Gaps = 8/83 (9%)

Query: 168 PHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASA--EIVAYATVKIVGWGEENGRPYW 225
           P FGP       +F     + +      +Y+ S +   + +A  T+ I+G+G +NG+P+W
Sbjct: 263 PVFGPV------AFGMPVPKSIMYYKSGIYSPSPADCNQPIAAHTMSIIGYGIDNGKPFW 316

Query: 226 TIVSTFGEQFGDKGTIKILRGRN 248
           T+ +++G ++G+ G +++ RG N
Sbjct: 317 TVKNSWGPRWGENGYMRMARGSN 339


>gi|129270160|ref|NP_001038442.2| tubulointerstitial nephritis antigen-like precursor [Danio rerio]
 gi|126632071|gb|AAI33830.1| Si:dkey-158b13.1 [Danio rerio]
          Length = 471

 Score = 45.4 bits (106), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 26/80 (32%), Positives = 39/80 (48%), Gaps = 5/80 (6%)

Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENG-----RPYWTIVSTFGEQFGDKG 239
           Y   +F+     Y   +     A  +V+I GWGEE       R YW   +++G+ +G+ G
Sbjct: 372 YKSGIFRHTDVNYHKPSQYRKHATHSVRITGWGEERDYSGRTRKYWIGANSWGKNWGEDG 431

Query: 240 TIKILRGRNEAIIESLVNGA 259
             +I RG NE  IE+ V G 
Sbjct: 432 YFRIARGVNECDIETFVIGV 451


>gi|344238391|gb|EGV94494.1| Ras-specific guanine nucleotide-releasing factor 1 [Cricetulus
            griseus]
          Length = 1632

 Score = 45.4 bits (106), Expect = 0.032,   Method: Composition-based stats.
 Identities = 18/49 (36%), Positives = 32/49 (65%)

Query: 214  VGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
            VG+GE++G PYW + +++G  +GDKG   I RG+N   + +  +  +P+
Sbjct: 1583 VGYGEKDGIPYWIVKNSWGTNWGDKGYFLIERGKNMCGLAACASYPIPQ 1631


>gi|294936554|ref|XP_002781799.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
 gi|239892784|gb|EER13594.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
          Length = 88

 Score = 45.4 bits (106), Expect = 0.033,   Method: Composition-based stats.
 Identities = 17/36 (47%), Positives = 28/36 (77%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
           V+I+GWG E G  YW +++++ E++GD GT KI++G
Sbjct: 37  VEIIGWGTEKGVDYWLVMNSWNEEWGDHGTFKIVQG 72


>gi|28932706|gb|AAO60047.1| midgut cysteine proteinase 4 [Rhipicephalus appendiculatus]
          Length = 345

 Score = 45.4 bits (106), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 20/38 (52%), Positives = 28/38 (73%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
           V +VG+GEE G PYW + +++G  +G+ G IKILR RN
Sbjct: 295 VLLVGYGEERGVPYWIVKNSWGPGWGEGGYIKILRNRN 332


>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
          Length = 338

 Score = 45.1 bits (105), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 36/58 (62%), Gaps = 1/58 (1%)

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           F + G  Y  S  ++ + +  V +VG+G +NG+ YW + +++ E +GD+G IKI R R
Sbjct: 267 FYSKGVYYEPSCDSDDLDHG-VLVVGYGSDNGKDYWLVKNSWSEHWGDEGYIKIARNR 323


>gi|358339354|dbj|GAA47434.1| cathepsin F [Clonorchis sinensis]
          Length = 603

 Score = 45.1 bits (105), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 30/49 (61%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
           V  VG+G ENG PYWT+ +++G  FG+ G  +I RG     I  LV+ A
Sbjct: 552 VLTVGYGTENGLPYWTVKNSWGTAFGEDGYFRIYRGGGTCGINRLVSTA 600


>gi|37903252|gb|AAO64474.1| cathepsin F [Fundulus heteroclitus]
          Length = 166

 Score = 45.1 bits (105), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 32/51 (62%)

Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
            V +VG+GE NG P+W I +++GE +G++G   + RG N   I  + + A+
Sbjct: 114 AVLLVGYGERNGTPFWAIKNSWGEDYGEQGYYYLYRGSNACGINKMCSSAV 164


>gi|332375406|gb|AEE62844.1| unknown [Dendroctonus ponderosae]
          Length = 320

 Score = 45.1 bits (105), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 29/39 (74%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           V +VG+G ENG  YW + +++ E +G+ G +K+LRG+NE
Sbjct: 270 VLVVGYGSENGVNYWLVKNSWAEDWGESGYLKLLRGQNE 308


>gi|253744515|gb|EET00718.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 306

 Score = 45.1 bits (105), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 25/68 (36%), Positives = 41/68 (60%), Gaps = 4/68 (5%)

Query: 196 VYAVSASAEIVAYATVKIVGWG---EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
           VY     ++I ++A V+I+G+G   +E+  PYW + ++ G  +G++G   I+RG NE  I
Sbjct: 235 VYRHVYGSQISSHA-VEIIGYGAADDEDSTPYWIVKNSLGSGWGEEGYFNIVRGSNECDI 293

Query: 253 ESLVNGAL 260
           ES V   L
Sbjct: 294 ESAVYSGL 301


>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
          Length = 338

 Score = 45.1 bits (105), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 36/58 (62%), Gaps = 1/58 (1%)

Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
           F + G  Y  S  ++ + +  V +VG+G +NG+ YW + +++ E +GD+G IKI R R
Sbjct: 267 FYSKGVYYEPSCDSDDLDHG-VLVVGYGSDNGKDYWLVKNSWSEHWGDEGYIKIARNR 323


>gi|15723274|gb|AAL06325.1| cathepsin B-like protease [Trypanosoma cruzi]
 gi|15723278|gb|AAL06327.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score = 45.1 bits (105), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 59/234 (25%), Positives = 90/234 (38%), Gaps = 41/234 (17%)

Query: 5   TSSRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVK 64
           T + IRD S             SC    AVA A+ ++   C    +       R  AG  
Sbjct: 12  TVTEIRDQS-------------SCGSCWAVAAASAISDRYCTLGGVR----DLRISAGDL 54

Query: 65  QRCAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTT 124
             C  +       + C+ G     W +    G+V+         CQP  FP C H   ++
Sbjct: 55  MSCCDVCG-----FGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSS 102

Query: 125 SEPECKTLATPQPKCHTRCTNDNYGRGFFQDK--YQINGLGLYFDPHF--GPFWPAFWRS 180
               C       P C++ CT+       ++    Y ++G   +       GPF  +F   
Sbjct: 103 DLSPCSG-EYDTPTCNSTCTDKKIPLIKYRGNTSYVLSGEEPFKRELILNGPFEVSF--- 158

Query: 181 FCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQ 234
             + Y   +  T G VY   A   +  +A V+IVGWGE NG PYW I +++  +
Sbjct: 159 --SVYADFVAYTGG-VYKHVAGIFLGGHA-VRIVGWGELNGEPYWKIANSWNHE 208


>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
          Length = 322

 Score = 45.1 bits (105), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 17/39 (43%), Positives = 28/39 (71%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           V +VG+G  NG+ YW + +++G  FG+ G  ++LRG+NE
Sbjct: 272 VLVVGYGTSNGKKYWIVKNSWGGSFGESGYFRLLRGKNE 310


>gi|410904753|ref|XP_003965856.1| PREDICTED: cathepsin S-like [Takifugu rubripes]
          Length = 334

 Score = 45.1 bits (105), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 37/54 (68%), Gaps = 1/54 (1%)

Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
           VY   + ++ + +A V  VG+G  NG+ YW + +++G +FGDKG I+++R +N+
Sbjct: 269 VYDDPSCSQTINHA-VLAVGYGTLNGQDYWLVKNSWGVKFGDKGYIRMVRNKND 321


>gi|403355703|gb|EJY77438.1| Papain family cysteine protease containing protein [Oxytricha
           trifallax]
          Length = 617

 Score = 45.1 bits (105), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 34/55 (61%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNY 265
           + IVG+G ENG  YW + +++G  +G+ G  +++RG N   +ES    A+P D +
Sbjct: 232 ISIVGYGVENGTKYWVVRNSWGTSWGESGFARVIRGINNLNLESDCAYAVPVDTW 286



 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 21/78 (26%), Positives = 40/78 (51%), Gaps = 3/78 (3%)

Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKI 243
           T  +    G +++   +  I+ +  + + GWG  E     YW + +++G  +G+KG +KI
Sbjct: 528 TDKMHDYMGGIFSEKKAVPIINH-IISVAGWGLDEATNTEYWIVRNSWGTYWGEKGWMKI 586

Query: 244 LRGRNEAIIESLVNGALP 261
               +   IE+  NGA+P
Sbjct: 587 KMHSDNLAIETDCNGAIP 604


>gi|218139209|gb|ACK57788.1| cathepsin C [Litopenaeus vannamei]
          Length = 451

 Score = 45.1 bits (105), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 36/53 (67%), Gaps = 2/53 (3%)

Query: 211 VKIVGWGEE--NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
           V +VG+GE+   G  YW++ +++GE++G+ G  +I RG +E  IES+   A+P
Sbjct: 397 VLLVGYGEDEATGEKYWSVKNSWGEEWGEDGYFRIRRGVDECAIESMAVEAVP 449


>gi|294893015|ref|XP_002774310.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239879603|gb|EER06126.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 81

 Score = 45.1 bits (105), Expect = 0.042,   Method: Composition-based stats.
 Identities = 19/48 (39%), Positives = 31/48 (64%)

Query: 199 VSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
           V  +  +V   ++KI+GWG E+G+ YW  V+++ E+ GD G IK+  G
Sbjct: 20  VHTTGGLVGVHSLKIIGWGVESGQDYWLAVNSWNEESGDHGMIKLAVG 67


>gi|449498128|ref|XP_002193225.2| PREDICTED: tubulointerstitial nephritis antigen [Taeniopygia
           guttata]
          Length = 469

 Score = 45.1 bits (105), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 37/56 (66%), Gaps = 5/56 (8%)

Query: 210 TVKIVGWG---EENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
           +VK++GWG   ++NG+   +W   +++G+ +G+ G  +ILRG+NE  IE L+   L
Sbjct: 411 SVKLLGWGALPDKNGQKQKFWIAANSWGKSWGENGYFRILRGQNECDIEKLILATL 466


>gi|221219800|gb|ACM08561.1| Cathepsin B precursor [Salmo salar]
 gi|221222296|gb|ACM09809.1| Cathepsin B precursor [Salmo salar]
          Length = 205

 Score = 45.1 bits (105), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 27/48 (56%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEP 127
           C+ G  S+ W +    GLVTGG + S+ GC+P S PPC H    T  P
Sbjct: 148 CNGGYPSAAWDFWTTEGLVTGGLYDSHVGCRPYSIPPCEHHVNGTRPP 195


>gi|159114116|ref|XP_001707283.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157435387|gb|EDO79609.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 332

 Score = 45.1 bits (105), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 41/77 (53%), Gaps = 4/77 (5%)

Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWG---EENGRPYWTIVSTFGEQFGDKGTIKI 243
           R      G VY      +I ++A V+I+G+G   +E   PYW + ++ G  +G++G   I
Sbjct: 252 RDFLYYRGGVYKHVYGIQISSHA-VEIIGYGTTDDEERIPYWIVKNSLGPNWGEEGYFNI 310

Query: 244 LRGRNEAIIESLVNGAL 260
           +RG NE  IES V   L
Sbjct: 311 VRGSNECDIESAVYSGL 327


>gi|221221056|gb|ACM09189.1| Cathepsin B precursor [Salmo salar]
 gi|221222300|gb|ACM09811.1| Cathepsin B precursor [Salmo salar]
          Length = 207

 Score = 45.1 bits (105), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 27/48 (56%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEP 127
           C+ G  S+ W +    GLVTGG + S+ GC+P S PPC H    T  P
Sbjct: 148 CNGGYPSAAWDFWTTEGLVTGGLYDSHVGCRPYSIPPCEHHVNGTRPP 195


>gi|156375635|ref|XP_001630185.1| predicted protein [Nematostella vectensis]
 gi|156217201|gb|EDO38122.1| predicted protein [Nematostella vectensis]
          Length = 311

 Score = 45.1 bits (105), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 35/63 (55%)

Query: 199 VSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 258
           V A+ + +    +KI+GWG E+   YW   +++G  +G +G  KI RG +E  IE  +  
Sbjct: 247 VHATGKQLGGHAIKILGWGTEDNVDYWLCANSWGANWGIQGYFKIRRGTDECGIEDGLAA 306

Query: 259 ALP 261
            LP
Sbjct: 307 GLP 309


>gi|77744608|gb|ABB02268.1| cathepsin B [Ovis aries]
          Length = 76

 Score = 44.7 bits (104), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 19/40 (47%), Positives = 26/40 (65%)

Query: 80  CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH 119
           C+ G  S  W +  K+GLV+GG + S+ GC+P S PPC H
Sbjct: 34  CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH 73


>gi|431920312|gb|ELK18347.1| Cathepsin H [Pteropus alecto]
          Length = 232

 Score = 44.7 bits (104), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 32/51 (62%)

Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
           V  VG+GEENG+PYW + +++G Q+G  G   I RG+N   + +  +  +P
Sbjct: 180 VLAVGYGEENGKPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPIP 230


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.134    0.438 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,266,296,325
Number of Sequences: 23463169
Number of extensions: 230236207
Number of successful extensions: 453287
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2343
Number of HSP's successfully gapped in prelim test: 439
Number of HSP's that attempted gapping in prelim test: 449832
Number of HSP's gapped (non-prelim): 3097
length of query: 298
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 157
effective length of database: 9,050,888,538
effective search space: 1420989500466
effective search space used: 1420989500466
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)