BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy1911
         (342 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|110456454|gb|ABG74712.1| cathepsin B preproprotein-like protein [Diaphorina citri]
          Length = 125

 Score =  213 bits (543), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 98/98 (100%), Positives = 98/98 (100%)

Query: 66  KKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE 125
           KKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE
Sbjct: 19  KKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE 78

Query: 126 NDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFN 163
           NDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFN
Sbjct: 79  NDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFN 116



 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 24/24 (100%), Positives = 24/24 (100%)

Query: 317 NCYNPSYESTYRFDLKKGKKAHMV 340
           NCYNPSYESTYRFDLKKGKKAHMV
Sbjct: 1   NCYNPSYESTYRFDLKKGKKAHMV 24


>gi|22535408|emb|CAC87118.1| cathepsin B-like protease [Nilaparvata lugens]
          Length = 347

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 81/155 (52%), Positives = 108/155 (69%), Gaps = 2/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P+ FDAR+KW +C SLR I DQ NCGSCWAVSVA A +DRLCIASN  + G IS++ ++
Sbjct: 92  VPKYFDARKKWKKCKSLREIRDQGNCGSCWAVSVAAAFADRLCIASNAKWNGHISSRELM 151

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GC GG+P  AW F   +G+VTGGDY+S +GCQPY +APCEHH++G   NC+ 
Sbjct: 152 SCCSYCGFGCEGGFPDAAWVFIKRHGLVTGGDYHSHDGCQPYPIAPCEHHMEGSKPNCSA 211

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP C+  C + S    Y+ D +KGK A++V
Sbjct: 212 SPTEPTPACETTCTHGS-SLAYQKDRQKGKSAYLV 245



 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 52/97 (53%), Positives = 64/97 (65%), Gaps = 4/97 (4%)

Query: 66  KKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLGW 122
           K A++VP      Q  I+++GP+VA F VY DF  YKSGVY +H      G HAV+V+GW
Sbjct: 240 KSAYLVPVGEKQTQLEIFKNGPIVAAFKVYEDFFMYKSGVYKRHPESPFRGRHAVKVIGW 299

Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           G +N +PYWLV NSW+  WGD G FKI RG NE D E
Sbjct: 300 GEQNGLPYWLVQNSWDYDWGDKGLFKIARG-NECDFE 335


>gi|27882093|gb|AAH44517.1| Zgc:55862 [Danio rerio]
          Length = 330

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 89/192 (46%), Positives = 115/192 (59%), Gaps = 6/192 (3%)

Query: 152 GENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGL--PRNFDAREKWPECPSLRHIADQ 209
           G N  D++  +  R+     +   L  M  Q  +GL  P+NFDARE+WP CP+L+ I DQ
Sbjct: 43  GHNFRDVDYSYVKRLCGTFLKGPKLPVM-VQYTEGLKLPKNFDAREQWPNCPTLKEIRDQ 101

Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWG 268
            +CGSCWA   A AISDR+CI SN   + +IS+Q ++ C  +C  GCNGG+P  AW FW 
Sbjct: 102 GSCGSCWAFGAAEAISDRVCIQSNAKVSVEISSQDLLTCCDSCGMGCNGGYPSAAWDFWT 161

Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYR 328
            +G+VTGG YNS  GC+PYT+ PCEHHV G    CT  G   TP C   C  P Y   Y+
Sbjct: 162 TDGLVTGGLYNSHIGCRPYTIEPCEHHVNGSRPPCTGEGG-DTPNCDMKC-EPGYSPLYK 219

Query: 329 FDLKKGKKAHMV 340
            D   GK ++ V
Sbjct: 220 EDKHFGKTSYSV 231



 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 54/99 (54%), Positives = 73/99 (73%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+ K ++ VP  +   M +++++GP+ A F+VY DFL YKSGVYQH  G ++G HA+++L
Sbjct: 223 HFGKTSYSVPSNQNGIMAELFKNGPVEAAFTVYEDFLLYKSGVYQHMSGSALGGHAIKIL 282

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG EN +PYWL ANSWN  WGD+G FKILRGE+   IE
Sbjct: 283 GWGEENGVPYWLAANSWNTDWGDNGYFKILRGEDHCGIE 321


>gi|74179506|dbj|BAE44111.1| cathepsin B preproprotein [Cyprinus carpio]
          Length = 330

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 90/192 (46%), Positives = 114/192 (59%), Gaps = 6/192 (3%)

Query: 152 GENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGL--PRNFDAREKWPECPSLRHIADQ 209
           G N  D++  +  R+     +   L  M  Q A  L  P NFDARE+WP CP+L+ I DQ
Sbjct: 43  GHNFHDVDYSYVKRLCGTLLKGPRLPVM-VQYADDLKLPTNFDAREQWPNCPTLKEIRDQ 101

Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWG 268
            +CGSCWA   A AISDR+CI SN   + +ISAQ ++ C   C  GCNGG+P  AW FW 
Sbjct: 102 GSCGSCWAFGAAEAISDRVCIHSNAKVSVEISAQDLLTCCDGCGMGCNGGYPSAAWDFWS 161

Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYR 328
            +G+VTGG YNS  GC+PYT+ PCEHHV G    CT  G   TP C  +C  P Y  +Y+
Sbjct: 162 SDGLVTGGLYNSHIGCRPYTIEPCEHHVNGSRPPCTGEGG-DTPNCDMSC-EPGYSPSYK 219

Query: 329 FDLKKGKKAHMV 340
            D   GK ++ V
Sbjct: 220 QDKHFGKTSYSV 231



 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 56/108 (51%), Positives = 77/108 (71%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y P+     H+ K ++ VP  + + M+++Y++GP+   F+VY DFL YKSGVYQH  G +
Sbjct: 214 YSPSYKQDKHFGKTSYSVPSNQKDIMKELYKNGPVEGAFTVYEDFLSYKSGVYQHVSGPA 273

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+++LGWG EN +PYWL ANSWN  WGD+G FKILRGE+   IE
Sbjct: 274 LGGHAIKILGWGEENGVPYWLAANSWNTDWGDNGYFKILRGEDHCGIE 321


>gi|37788265|gb|AAO64472.1| cathepsin B precursor [Fundulus heteroclitus]
          Length = 330

 Score =  167 bits (424), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 89/192 (46%), Positives = 116/192 (60%), Gaps = 6/192 (3%)

Query: 152 GENEADIEMGFNNRVEANSSEDDDLETMGCQNAKG--LPRNFDAREKWPECPSLRHIADQ 209
           G N  D++ G+   +     +   L  M  Q+A G  LP+ FDARE+WPECP+L+ I DQ
Sbjct: 43  GHNFHDVDYGYVKNLCGTLLKGPKLPIM-VQSAGGMKLPKQFDAREQWPECPTLKEIRDQ 101

Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWG 268
            +CGSCWA   A AISDR+CI + G  + +IS+Q ++ C  +C  GCNGG+P  AW FW 
Sbjct: 102 GSCGSCWAFGAAEAISDRICIHTKGKVSVEISSQDLLTCCDSCGMGCNGGYPANAWEFWT 161

Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYR 328
             G+VTGG YNS  GC+PYT+ PCEHHV G    CT  G   TPEC   C    Y  +Y+
Sbjct: 162 EQGLVTGGLYNSHIGCRPYTIEPCEHHVNGSRPPCTGEGG-DTPECVTQC-EAGYTPSYQ 219

Query: 329 FDLKKGKKAHMV 340
            D   GK ++ V
Sbjct: 220 KDKHYGKTSYGV 231



 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 56/108 (51%), Positives = 70/108 (64%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y P+     HY K ++ VP      Q  IY++GP+   F VY DF  YKSGVYQH  G +
Sbjct: 214 YTPSYQKDKHYGKTSYGVPSEEEQIQSEIYKNGPVEGAFIVYEDFPSYKSGVYQHVTGSA 273

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA++++GWG EN +PYWL ANSWN  WGD+G FKILRG N   IE
Sbjct: 274 LGGHAIKMIGWGEENGVPYWLCANSWNTDWGDNGFFKILRGSNHCGIE 321


>gi|50540542|ref|NP_998501.1| cathepsin B, a precursor [Danio rerio]
 gi|34784038|gb|AAH56688.1| Cathepsin B, a [Danio rerio]
 gi|37681773|gb|AAQ97764.1| cathepsin B [Danio rerio]
 gi|41351445|gb|AAH65589.1| Cathepsin B, a [Danio rerio]
          Length = 330

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 86/192 (44%), Positives = 115/192 (59%), Gaps = 6/192 (3%)

Query: 152 GENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGL--PRNFDAREKWPECPSLRHIADQ 209
           G N  D++  +  ++     +   L  M  Q  +GL  P+NFDARE+WP CP+L+ I DQ
Sbjct: 43  GHNFRDVDYSYVKKLCGTFLKGPKLPVM-VQYTEGLKLPKNFDAREQWPNCPTLKEIRDQ 101

Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWG 268
            +CGSCWA   A AISDR+CI S+   + +IS+Q ++ C  +C  GCNGG+P  AW FW 
Sbjct: 102 GSCGSCWAFGAAEAISDRVCIHSDAKVSVEISSQDLLTCCDSCGMGCNGGYPSAAWDFWA 161

Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYR 328
             G+VTGG YNS  GC+PYT+ PCEHHV G    C+  G   TP C   C  P Y  +Y+
Sbjct: 162 TEGLVTGGLYNSHIGCRPYTIEPCEHHVNGSRPPCSGEGG-DTPNCDMKC-EPGYSPSYK 219

Query: 329 FDLKKGKKAHMV 340
            D   GK ++ V
Sbjct: 220 QDKHFGKTSYSV 231



 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 55/108 (50%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y P+     H+ K ++ VP  + + M +++++GP+   F+VY DFL YKSGVYQH  G  
Sbjct: 214 YSPSYKQDKHFGKTSYSVPSNQNSIMAELFKNGPVEGAFTVYEDFLLYKSGVYQHMSGSP 273

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+++LGWG EN +PYWL ANSWN  WGD+G FKILRGE+   IE
Sbjct: 274 VGGHAIKILGWGEENGVPYWLAANSWNTDWGDNGYFKILRGEDHCGIE 321


>gi|51038793|gb|AAT94175.1| cathepsin B [Paralichthys olivaceus]
 gi|121053785|gb|ABM47001.1| cathepsin B [Paralichthys olivaceus]
          Length = 330

 Score =  164 bits (415), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 87/192 (45%), Positives = 115/192 (59%), Gaps = 6/192 (3%)

Query: 152 GENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGL--PRNFDAREKWPECPSLRHIADQ 209
           G N  +++  +  R+     +   L  M  Q A GL  P  FDARE+WPECP+L+ I DQ
Sbjct: 43  GHNFHNVDYSYVRRLCGTMLKGPKLPIM-VQYAGGLKLPAEFDAREQWPECPTLKEIRDQ 101

Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWG 268
            +CGSCWA   A AISDR+CI S G  + +IS++ ++ C  +C  GCNGG+P  AW FW 
Sbjct: 102 GSCGSCWAFGAAEAISDRVCIHSGGKISVEISSEDLLTCCDSCGMGCNGGYPSSAWDFWT 161

Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYR 328
             G+V+GG YNS  GC+PYT++PCEHHV G    CT  G   TPEC   C    Y  +Y+
Sbjct: 162 KEGLVSGGLYNSHIGCRPYTISPCEHHVNGSRPPCTGEGG-DTPECISRC-EAGYSPSYK 219

Query: 329 FDLKKGKKAHMV 340
            D   GK ++ V
Sbjct: 220 QDKHYGKSSYSV 231



 Score =  110 bits (276), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 55/108 (50%), Positives = 69/108 (63%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y P+     HY K ++ V         +I ++GP+   F+VY DF+ YKSGVYQH  G  
Sbjct: 214 YSPSYKQDKHYGKSSYSVEGSVEQIQAEISKNGPVEGAFTVYEDFVMYKSGVYQHVSGSV 273

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA++VLGWG E+ IPYWL ANSWN  WGD+G FKILRG N   IE
Sbjct: 274 LGGHAIKVLGWGEEDGIPYWLCANSWNTDWGDNGFFKILRGSNHCGIE 321


>gi|254746338|emb|CAX16634.1| putative C1A cysteine protease precursor [Manduca sexta]
          Length = 337

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 101/155 (65%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFD R+KWP+CP+L  I DQ +CGSCWA     A++DR C  SNG      S++ ++
Sbjct: 83  LPENFDPRDKWPDCPTLNEIRDQGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSSEDLL 142

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C P C  GCNGG P LAW +W H G+V+GG+YNS +GC+PY + PCEHHV G    C+ 
Sbjct: 143 SCCPICGLGCNGGIPSLAWEYWKHFGIVSGGNYNSTQGCRPYEIPPCEHHVPGNRMPCS- 201

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G  KTP+C++NC N  Y   Y+ D + GK  + V
Sbjct: 202 -GDTKTPKCQKNCEN-GYNVMYKKDKRYGKHVYSV 234



 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 52/81 (64%), Positives = 65/81 (80%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           ++Y++GP+   F+VYAD L YKSGVY+H  GD++G HA+++LGWGVEND  YWLVANSWN
Sbjct: 244 ELYKNGPVEGAFTVYADLLAYKSGVYKHIQGDALGGHAIKILGWGVENDNKYWLVANSWN 303

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGD+G FKILRGEN   IE
Sbjct: 304 TDWGDNGFFKILRGENHCGIE 324


>gi|197725747|gb|ACH73069.1| cathepsin B precursor [Epinephelus coioides]
          Length = 333

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 78/155 (50%), Positives = 102/155 (65%), Gaps = 3/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+NFD+RE+WP CP+L+ I DQ +CGSCWA   A AISDRLCI SNG  + +IS++ ++
Sbjct: 79  LPKNFDSREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRLCIHSNGKVSVEISSEDLL 138

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C  +C  GCNGG+P  AW FW   G+V+GG Y+S  GC+PYT+ PCEHHV G    CT 
Sbjct: 139 TCCDSCGMGCNGGYPSAAWDFWTDVGLVSGGLYDSHVGCRPYTIPPCEHHVNGTRPPCTG 198

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G   TP+C   C    Y  +Y+ D   GK ++ V
Sbjct: 199 EGG-DTPQCILQC-ESGYTPSYKADKHYGKSSYSV 231



 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 50/92 (54%), Gaps = 5/92 (5%)

Query: 54  YLPTSIPLSHYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y P+     HY K ++ VP      Q  IY++GP+   F+VY DFL YK+GVYQH  G +
Sbjct: 214 YTPSYKADKHYGKSSYSVPSDEEQIQSEIYKNGPVEGAFTVYEDFLLYKTGVYQHMTGSA 273

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGD 143
           +G HA++   W  E       + +S  D WGD
Sbjct: 274 VGGHAIK--SWLGEEVCSLLALCHSDTD-WGD 302


>gi|312374701|gb|EFR22198.1| hypothetical protein AND_15621 [Anopheles darlingi]
          Length = 335

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 102/155 (65%), Gaps = 3/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFD+RE+WP CP++R I DQ +CGSCWA     A+SDR+CIAS G    + SA+ +V
Sbjct: 83  LPENFDSREQWPNCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIASGGKIHFRFSAEDLV 142

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  AW +W H G+V+GG + S  GCQPY +APCEHHV G   +C  
Sbjct: 143 SCCHTCGFGCNGGFPGAAWSYWVHKGLVSGGPFGSNLGCQPYAIAPCEHHVNGTRPSCEG 202

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G  KTP+C + C + SY   Y  D + G K++ +
Sbjct: 203 EGG-KTPKCVKKCQD-SYTVPYAKDKRYGSKSYSI 235



 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 52/98 (53%), Positives = 67/98 (68%), Gaps = 2/98 (2%)

Query: 64  YFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
           Y  K++ +PR      ++I  +GP+   F+VY D L YK GVYQH  G  +G HA+R+LG
Sbjct: 228 YGSKSYSIPRHEDQIRKEIMTNGPVEGAFTVYEDLLHYKEGVYQHVTGKMLGGHAIRILG 287

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           WGVEN+  YWL+ANSWN  WGD+G FKILRGE+   IE
Sbjct: 288 WGVENNTKYWLIANSWNSDWGDNGFFKILRGEDHLGIE 325


>gi|256077361|ref|XP_002574974.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
 gi|18181863|emb|CAC85211.2| cathepsin B endopeptidase [Schistosoma mansoni]
 gi|353231645|emb|CCD79000.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
          Length = 347

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 77/155 (49%), Positives = 101/155 (65%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP++FDAR +WP CPS+  I DQS+CGSCWA     A+SDR+CI S G     +SA+++V
Sbjct: 94  LPKSFDARVEWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIKSKGKHKPFLSAENLV 153

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C  GCNGG+P  AW +W + G+VTG  YN+  GCQPY   PCEHHV GPL +C  
Sbjct: 154 SCCSSCGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEHHVIGPLPSCD- 212

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G ++TP CK NC  P Y   Y  D   G+K + +
Sbjct: 213 -GDVETPSCKTNC-QPGYNIPYEKDKWYGEKVYRI 245



 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 52/87 (59%), Positives = 63/87 (72%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M ++  +GP+   F VYADF  YKSGVYQH  G  +G HAVR+LGWG EN++PYWL+ANS
Sbjct: 253 MLELMRNGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIANS 312

Query: 137 WNDHWGDHGTFKILRGENEADIEMGFN 163
           WN  WGD G FKI+RG+NE  IE   N
Sbjct: 313 WNSDWGDKGYFKIVRGKNECGIESDVN 339


>gi|112983908|ref|NP_001036850.1| cathepsin B precursor [Bombyx mori]
 gi|13548667|dbj|BAB40804.1| cathepsin B [Bombyx mori]
          Length = 337

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 78/175 (44%), Positives = 108/175 (61%), Gaps = 9/175 (5%)

Query: 172 EDDDLETMGCQNAK-----GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISD 226
           ED+   T+  +  K     GLP NFD R+KWP+CP+L  + DQ +CGSCWA     A++D
Sbjct: 63  EDEHFATLPIKTHKIDLIAGLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTD 122

Query: 227 RLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQ 285
           R+C  SNG      SA+ +++C P C  GC+GG P+LAW +W H G+V+GG YNS +GC+
Sbjct: 123 RVCTYSNGTKHFHFSAEDLLSCCPICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCR 182

Query: 286 PYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           PY + PCEHHV G    C+  G  KTP+C + C    Y+  Y+ D + GK  + V
Sbjct: 183 PYEIPPCEHHVPGNRMPCS--GDTKTPKCTKKC-ESGYDVNYKQDKQYGKHVYTV 234



 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 65/81 (80%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +++++GP+   F+VY+D L YKSGVY+H  GD++G HAV++LGWGVEND  YWL+ANSWN
Sbjct: 244 ELFKNGPVEGAFTVYSDLLSYKSGVYKHTQGDALGGHAVKILGWGVENDNKYWLIANSWN 303

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGD+G FKILRGE+   IE
Sbjct: 304 SDWGDNGFFKILRGEDHCGIE 324


>gi|121073168|gb|ABM47070.1| cathepsin B1 [Clonorchis sinensis]
 gi|358341105|dbj|GAA29748.2| cathepsin B [Clonorchis sinensis]
          Length = 339

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 80/155 (51%), Positives = 99/155 (63%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAREKWP C S+  I DQSNCGSCWA   A AISDR+CIAS G    +IS + +V
Sbjct: 88  LPESFDAREKWPYCSSIAEIRDQSNCGSCWAFGAAGAISDRICIASGGKHQPRISPEDLV 147

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C  +C  GC GG+P  AW +W  NG+VTG  YN+ + C+PY+  PCEHHV GP + CT 
Sbjct: 148 DCCADCGMGCQGGYPAQAWEYWVRNGLVTGDLYNTTDTCRPYSFPPCEHHVVGPRKPCT- 206

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G   TP+C + C  P Y  TY  D   G KA+ +
Sbjct: 207 -GDPTTPQCVKKC-QPEYPKTYENDKWYGLKAYSI 239



 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 50/87 (57%), Positives = 58/87 (66%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           MR +  +GPL   F VYADF  Y SGVY+H  G  +G HAVR++GWGVE+   YWL+ANS
Sbjct: 247 MRDLMTYGPLEVDFEVYADFPSYSSGVYRHVAGGLLGGHAVRLVGWGVEDGADYWLIANS 306

Query: 137 WNDHWGDHGTFKILRGENEADIEMGFN 163
           WN  WGD G FKI RG NE  IE   N
Sbjct: 307 WNTDWGDGGYFKIRRGVNECGIESDAN 333


>gi|327322926|gb|AEA48884.1| cathepsin B [Oplegnathus fasciatus]
          Length = 330

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 78/156 (50%), Positives = 102/156 (65%), Gaps = 3/156 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDARE+WP CP+L+ I DQ +CGSCWA   A AISDR+CI SN   + +IS++ ++
Sbjct: 79  LPEEFDAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNAKVSVEISSEDLL 138

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C  +C  GCNGG+P  AW FW   G+V+GG Y+S  GC+PYT+APCEHHV G   +CT 
Sbjct: 139 TCCMSCGMGCNGGYPSAAWDFWTKEGLVSGGLYDSHIGCRPYTIAPCEHHVNGSRPSCTG 198

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
            G   TP+C   C    Y  +Y+ D   GK ++ VL
Sbjct: 199 EGG-DTPQCITKC-EAGYTPSYKEDKHFGKTSYTVL 232



 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 52/108 (48%), Positives = 70/108 (64%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y P+     H+ K ++ V       Q  I+++GP+   F VY DF+ YKSGVYQH  G +
Sbjct: 214 YTPSYKEDKHFGKTSYTVLSDEEQIQSEIFKNGPVEGAFIVYEDFVLYKSGVYQHVSGSA 273

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+++LGWGVE+ +PYWL ANSWN  WGD+G FK LRG +   IE
Sbjct: 274 VGGHAIKILGWGVEDGVPYWLCANSWNTDWGDNGFFKFLRGSDHCGIE 321


>gi|226821413|gb|ACO82382.1| cathepsin B [Lutjanus argentimaculatus]
          Length = 330

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 78/156 (50%), Positives = 101/156 (64%), Gaps = 3/156 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FD+RE+WP CP+L+ I DQ +CGSCWA   + AISDRLCI SN   + +ISA+ ++
Sbjct: 79  LPKAFDSREQWPNCPTLKEIRDQGSCGSCWAFGASEAISDRLCIHSNAKVSVEISAEDLL 138

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C  +C  GCNGG+P  AW FW   G+V+GG Y+S  GC+PYT+ PCEHHV G    CT 
Sbjct: 139 TCCDSCGMGCNGGYPSAAWDFWTKEGLVSGGLYDSHVGCRPYTIPPCEHHVNGSRPPCTG 198

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
            G   TP+C   C    Y  +YR D   GK ++ VL
Sbjct: 199 EGG-DTPQCLSQC-EAGYTPSYREDKHYGKTSYSVL 232



 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 56/108 (51%), Positives = 71/108 (65%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y P+     HY K ++ V    A  Q  IY++GP+   F+VY DF+ YKSGVYQH  G +
Sbjct: 214 YTPSYREDKHYGKTSYSVLSDEAEIQYEIYKNGPVEGAFTVYEDFVLYKSGVYQHVSGSA 273

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA++VLGWG EN +PYWL ANSWN  WGD+G FK LRG +   IE
Sbjct: 274 VGGHAIKVLGWGEENGVPYWLCANSWNTDWGDNGFFKFLRGSDHCGIE 321


>gi|183988832|gb|ACC66065.1| cathepsin B [Antheraea assama]
          Length = 287

 Score =  161 bits (407), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 101/155 (65%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFD R+KWP+CP+L  I DQ +CGSCWA     A++DR+CI SN       SA+ +V
Sbjct: 44  LPENFDPRDKWPDCPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 103

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C P C  GCNGG P LAW +W H G+V+GG+YNS +GC+PY + PCEHHV G    C  
Sbjct: 104 SCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCN- 162

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G  KTP+C++ C + SY   ++ D + GK  + V
Sbjct: 163 -GDTKTPKCEKTCES-SYTVPFKKDKRYGKHVYSV 195



 Score =  111 bits (277), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 47/85 (55%), Positives = 64/85 (75%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           N   +++++GP+   F+VY+D L YKSGVYQH  G+++G HA+++LGWGVEN   YWL+A
Sbjct: 201 NIKAELFKNGPVEGAFTVYSDLLSYKSGVYQHTHGNALGGHAIKILGWGVENGSKYWLIA 260

Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
           NSWN  WGD+G  KILRGE+   IE
Sbjct: 261 NSWNSDWGDNGFLKILRGEDHCGIE 285


>gi|344195776|gb|AEM98130.1| cathepsin B [Cynoglossus semilaevis]
          Length = 332

 Score =  160 bits (406), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 77/155 (49%), Positives = 102/155 (65%), Gaps = 3/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAR +WP+CP+L+ + DQ +CGSCWA   A AISDRLCI SNG    +ISA+ ++
Sbjct: 79  LPVDFDARVQWPQCPTLKEVRDQGSCGSCWAFGAAEAISDRLCIHSNGLMNVEISAEDLL 138

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C  GCNGG+P  AW FW  +G+V+GG Y+S  GC+PY++APCEHHV G    CT 
Sbjct: 139 SCCDSCGMGCNGGYPSAAWEFWTTDGLVSGGLYDSHIGCRPYSIAPCEHHVNGSRPPCTG 198

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G   TP+C + C    Y   Y  D   GK ++ V
Sbjct: 199 EGG-DTPQCTKKC-EAGYTPGYTQDKHYGKLSYSV 231



 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 58/114 (50%), Positives = 72/114 (63%), Gaps = 2/114 (1%)

Query: 48  KKKKRLYLPTSIPLSHYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQ 105
           KK +  Y P      HY K ++ V       Q  IY++GP+   F+VY DFL YK+GVYQ
Sbjct: 208 KKCEAGYTPGYTQDKHYGKLSYSVDDSEKEIQLEIYKNGPVEGAFTVYEDFLLYKTGVYQ 267

Query: 106 HNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           H  G ++G HA++VLGWG EN  PYWL ANSWN  WGD+G FKILRG +   IE
Sbjct: 268 HVTGSAVGGHAIKVLGWGEENGTPYWLCANSWNTDWGDNGFFKILRGSDHCGIE 321


>gi|116177489|gb|ABJ80691.1| cathepsin B [Hippoglossus hippoglossus]
          Length = 330

 Score =  160 bits (406), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 85/192 (44%), Positives = 113/192 (58%), Gaps = 6/192 (3%)

Query: 152 GENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGL--PRNFDAREKWPECPSLRHIADQ 209
           G N  D++  +  R+     +   L  M  Q A GL  P  FD+RE+WPECP+L+ I DQ
Sbjct: 43  GHNFRDVDYSYVRRLCGTMLKGPKLPIM-VQYAGGLKLPAQFDSREQWPECPTLKEIRDQ 101

Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWG 268
            +CGSCWA   A AISDR+CI S    + +IS++ ++ C   C  GCNGG+P  AW FW 
Sbjct: 102 GSCGSCWAFGAAEAISDRVCIHSGSKVSVEISSEDLLTCCDACGMGCNGGYPSAAWDFWT 161

Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYR 328
             G+V+GG YNS  GC+PYT+ PCEHHV G   +C+  G   TP+C  +C    Y  TY 
Sbjct: 162 KEGLVSGGLYNSHIGCRPYTIPPCEHHVNGSRPHCSGEGG-DTPKCVHSC-EAGYSPTYT 219

Query: 329 FDLKKGKKAHMV 340
            D   GK ++ V
Sbjct: 220 KDKHYGKSSYSV 231



 Score =  111 bits (277), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 53/108 (49%), Positives = 69/108 (63%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY K ++ V         +I ++GP+   F VY DF+ YKSGVYQH  G +
Sbjct: 214 YSPTYTKDKHYGKSSYSVEASVEQIQAEISQNGPVEGAFIVYEDFVMYKSGVYQHTTGSA 273

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA++VLGWG E+ +PYWL ANSWN  WG++G FKILRG +   IE
Sbjct: 274 LGGHAIKVLGWGEEDGVPYWLCANSWNTDWGENGFFKILRGSDHCGIE 321


>gi|195729973|gb|ACG50797.1| cathepsin B2 [Trichobilharzia szidati]
          Length = 344

 Score =  160 bits (405), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 83/191 (43%), Positives = 110/191 (57%), Gaps = 4/191 (2%)

Query: 151 RGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQS 210
           R ++ +DI        + N      L T    +   LP+ FDAR+ WP CPS+  I DQS
Sbjct: 56  RFKSVSDIRRMLGALPDPNGGYLPTLCTGYTPSLDELPKEFDARKHWPHCPSISEIRDQS 115

Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGH 269
           +CGSCWA     A+SDR+CI S G     +SA+++VAC  +C  GCNGG+P  AW +W  
Sbjct: 116 SCGSCWAFGAVEAMSDRICIESKGLHKPFLSAENLVACCSSCGMGCNGGFPHSAWSYWKR 175

Query: 270 NGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRF 329
           +G+VTG  YN+ +GCQPY   PCEHHV GP  +C   G ++TP+CK  C  P Y   Y  
Sbjct: 176 SGIVTGDLYNTTDGCQPYEFPPCEHHVVGPRPSCG--GDVETPKCKTTC-QPGYNIPYNK 232

Query: 330 DLKKGKKAHMV 340
           D   GK  + V
Sbjct: 233 DKWYGKTVYRV 243



 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 53/90 (58%), Positives = 65/90 (72%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M+++ +HGP+   F VYADF  YKSGVYQH  G  +G HAVR+LGWG EN +PYWL+ANS
Sbjct: 251 MKEVMDHGPVEVDFEVYADFPNYKSGVYQHVSGGLLGGHAVRLLGWGEENGVPYWLIANS 310

Query: 137 WNDHWGDHGTFKILRGENEADIEMGFNNRV 166
           WN  WGD+G FKI+RG NE  IE   N  +
Sbjct: 311 WNSDWGDNGYFKIIRGRNECGIESDVNAGI 340


>gi|348534156|ref|XP_003454569.1| PREDICTED: cathepsin B-like [Oreochromis niloticus]
          Length = 330

 Score =  160 bits (405), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 78/155 (50%), Positives = 100/155 (64%), Gaps = 3/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR++WP CP+L+ I DQ +CGSCWA   A AISDR+CI SNG    +IS++ ++
Sbjct: 79  LPKEFDARQQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNGKVNVEISSEDLL 138

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C  +C  GCNGG+P  AW FW   G+V+GG Y S  GC+PYT+APCEHHV G    CT 
Sbjct: 139 TCCDSCGMGCNGGYPSAAWDFWASEGLVSGGLYESHIGCRPYTIAPCEHHVNGSRPPCTG 198

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G   TPEC + C    Y  +Y  D   GK ++ V
Sbjct: 199 EGG-DTPECVRQC-ESGYTPSYIQDKHYGKTSYSV 231



 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 58/108 (53%), Positives = 72/108 (66%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y P+ I   HY K ++ VP      Q  IY++GP+   F+VY DFL YK+GVYQH  G +
Sbjct: 214 YTPSYIQDKHYGKTSYSVPSDEQQIQTEIYKNGPVEGAFTVYEDFLLYKTGVYQHVSGSA 273

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA++VLGWG EN  PYWL ANSWN  WGD+G FKILRG +   IE
Sbjct: 274 VGGHAIKVLGWGEENGTPYWLCANSWNTDWGDNGYFKILRGSDHCGIE 321


>gi|432946172|ref|XP_004083803.1| PREDICTED: cathepsin B-like [Oryzias latipes]
          Length = 330

 Score =  160 bits (405), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD RE+WP CP+L+ I DQ NCGSCWA   A AISDR+CI S G  + +ISA+ ++
Sbjct: 79  LPDSFDPREQWPNCPTLKQIRDQGNCGSCWAFGAAEAISDRICIQSGGKISLEISAEDLL 138

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C  GC GG+P  AW FW + G+VTGG ++S+ GC+PYTLAPCEHHV G    C  
Sbjct: 139 TCCDECGMGCFGGFPSAAWEFWTNKGLVTGGLFDSKVGCRPYTLAPCEHHVNGSRPPCQ- 197

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G+++TP+C   C N  Y  +Y  D   G++++ +
Sbjct: 198 -GEVETPKCVTQCNN-GYSLSYPKDKHFGQRSYSI 230



 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 56/99 (56%), Positives = 73/99 (73%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+ ++++ +P  +   M ++Y++GP+ A FSVYADFL YK+GVYQH  GD +G HAV++L
Sbjct: 222 HFGQRSYSIPSQQEQIMTELYKNGPVEAAFSVYADFLLYKNGVYQHVTGDMLGGHAVKIL 281

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG EN  PYWLVANSWN  WGD G FKI RG +E  IE
Sbjct: 282 GWGEENGTPYWLVANSWNSDWGDKGFFKIKRGNDECGIE 320


>gi|333408990|gb|AEF32260.1| cathepsin B [Cristaria plicata]
          Length = 347

 Score =  160 bits (405), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 76/157 (48%), Positives = 104/157 (66%), Gaps = 3/157 (1%)

Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
           + LP NFDAR +WP CP+++ + DQ +CGSCWA     A+SDR+CIASNG    +ISA+ 
Sbjct: 93  RDLPTNFDARTQWPNCPTVKEVRDQGDCGSCWAFGAVEAMSDRICIASNGKVNAEISAED 152

Query: 245 IVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
           ++AC  +C  GC GG+P  AWR++   G+VTGG YNS +GCQPY +  C+HHV G LQ C
Sbjct: 153 LLACCSSCGEGCQGGFPAEAWRYYEREGLVTGGLYNSSQGCQPYMIPACDHHVVGHLQPC 212

Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
               + KTP+C + C   +Y  TY+ D   GK ++ V
Sbjct: 213 P-KEEAKTPKCSKKC-EANYNVTYKDDKHYGKNSYSV 247



 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 60/123 (48%), Positives = 77/123 (62%), Gaps = 1/123 (0%)

Query: 38  KKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCN-AMRQIYEHGPLVAIFSVYADF 96
           K++ K  K  KK +  Y  T     HY K ++ V      M +I  +GP+ A F+VY DF
Sbjct: 214 KEEAKTPKCSKKCEANYNVTYKDDKHYGKNSYSVDSVEKIMTEIMTNGPVEAAFTVYEDF 273

Query: 97  LQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEA 156
           L YKSGVYQH  G  +G HAV++LGWG +N  PYW+VANSWN  WG+ G F ILRG++E 
Sbjct: 274 LSYKSGVYQHRTGQELGGHAVKILGWGEDNGTPYWIVANSWNPDWGNQGFFNILRGKDEC 333

Query: 157 DIE 159
            IE
Sbjct: 334 GIE 336


>gi|239938584|gb|ACS36091.1| cysteine proteinase [Haemonchus contortus]
          Length = 346

 Score =  160 bits (405), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 85/189 (44%), Positives = 113/189 (59%), Gaps = 19/189 (10%)

Query: 157 DIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCW 216
           D+     NR  A  +EDD+           +P +FDAR  WP C S+RHI DQ+NCGSCW
Sbjct: 72  DLRFVNQNRKPAVENEDDE--------GDDIPESFDARTHWPNCTSIRHIRDQANCGSCW 123

Query: 217 AVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTG 275
           AVS A+A+SDR+CI SNG     IS+   V+C  +C +GC+GGWP LA+ F+ + G VTG
Sbjct: 124 AVSTASALSDRICIESNGETQMHISSIDFVSCCESCGYGCDGGWPILAFDFYTYEGAVTG 183

Query: 276 GDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGK----LKTPECKQNCYNPSYESTYRFDL 331
           GDY S++GC+PY   PC HH      N T  G+     KTP+C++ C   SY+  Y  D 
Sbjct: 184 GDYGSKDGCRPYPFHPCGHH-----GNDTYYGECPKGAKTPKCRRRC-QRSYKKAYYMDK 237

Query: 332 KKGKKAHMV 340
             G+ A+ V
Sbjct: 238 SYGEDAYEV 246



 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 54/125 (43%), Positives = 82/125 (65%), Gaps = 7/125 (5%)

Query: 37  KKKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYA 94
           K  K +++ ++  KK  Y+  S     Y + A+ VP       R+I ++GP+V  F+VY 
Sbjct: 217 KTPKCRRRCQRSYKKAYYMDKS-----YGEDAYEVPHSVKAIQREIMKNGPVVGAFTVYE 271

Query: 95  DFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGEN 154
           DF  YK G+Y+H  G + G HA++++GWGVEND+PYWL+ANSW++ WG+ G F+++RG N
Sbjct: 272 DFSYYKKGIYKHTAGQARGGHAIKIIGWGVENDVPYWLIANSWHNDWGEEGYFRMIRGIN 331

Query: 155 EADIE 159
           E  IE
Sbjct: 332 ECGIE 336


>gi|239938582|gb|ACS36090.1| cysteine proteinase [Haemonchus contortus]
          Length = 346

 Score =  160 bits (405), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 85/189 (44%), Positives = 113/189 (59%), Gaps = 19/189 (10%)

Query: 157 DIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCW 216
           D+     NR  A  +EDD+           +P +FDAR  WP C S+RHI DQ+NCGSCW
Sbjct: 72  DLRFVNQNRKPAVENEDDE--------GDDIPESFDARTHWPNCTSIRHIRDQANCGSCW 123

Query: 217 AVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTG 275
           AVS A+A+SDR+CI SNG     IS+   V+C  +C +GC+GGWP LA+ F+ + G VTG
Sbjct: 124 AVSTASALSDRICIESNGETQMHISSIDFVSCCESCSYGCDGGWPILAFDFYTYEGAVTG 183

Query: 276 GDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGK----LKTPECKQNCYNPSYESTYRFDL 331
           GDY S++GC+PY   PC HH      N T  G+     KTP+C++ C   SY+  Y  D 
Sbjct: 184 GDYGSKDGCRPYPFHPCGHH-----GNDTYYGECPKGAKTPKCRRRC-QRSYKKAYYMDK 237

Query: 332 KKGKKAHMV 340
             G+ A+ V
Sbjct: 238 SYGEDAYEV 246



 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 54/125 (43%), Positives = 82/125 (65%), Gaps = 7/125 (5%)

Query: 37  KKKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYA 94
           K  K +++ ++  KK  Y+  S     Y + A+ VP       R+I ++GP+V  F+VY 
Sbjct: 217 KTPKCRRRCQRSYKKAYYMDKS-----YGEDAYEVPHSVKAIQREIMKNGPVVGAFTVYE 271

Query: 95  DFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGEN 154
           DF  YK G+Y+H  G + G HA++++GWGVEND+PYWL+ANSW++ WG+ G F+++RG N
Sbjct: 272 DFSYYKKGIYKHTAGQARGGHAIKIIGWGVENDVPYWLIANSWHNDWGEEGYFRMIRGIN 331

Query: 155 EADIE 159
           E  IE
Sbjct: 332 ECGIE 336


>gi|225711544|gb|ACO11618.1| Cathepsin B precursor [Caligus rogercresseyi]
          Length = 332

 Score =  160 bits (404), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 75/158 (47%), Positives = 103/158 (65%), Gaps = 5/158 (3%)

Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
            + LP +FDARE WP CPS+R I DQ +CGSCWA   A A+SDR+CI +N      ISA+
Sbjct: 80  TEALPSDFDAREHWPNCPSIRLIRDQGSCGSCWAFGAAEAMSDRICIHTNKNV--NISAE 137

Query: 244 HIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
           ++++C  +C +GCNGG+P  AW++W   G+V+GG Y S  GCQPY + PCEHHV G  Q 
Sbjct: 138 NLLSCCYSCGFGCNGGFPGAAWKYWTSKGLVSGGLYGSHSGCQPYDIEPCEHHVNGTRQP 197

Query: 303 CTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           C   G  +TP+C + C N +Y   Y  DL  G+ ++ +
Sbjct: 198 CAEGG--RTPKCHRTCENENYSVPYDKDLSFGRSSYSI 233



 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 50/81 (61%), Positives = 61/81 (75%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I ++GP+ A FSVY+DF+  KSGVY+H  G  +G HA+R+LGWGVE   PYWLVANSWN
Sbjct: 243 EIMDNGPVEAAFSVYSDFMNDKSGVYRHVKGSLLGGHAIRILGWGVEKGTPYWLVANSWN 302

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGD GTFKILRG +   IE
Sbjct: 303 TDWGDKGTFKILRGSDHCGIE 323


>gi|154089579|gb|ABS57370.1| cathepsin B2 [Trichobilharzia regenti]
          Length = 344

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 83/191 (43%), Positives = 109/191 (57%), Gaps = 4/191 (2%)

Query: 151 RGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQS 210
           R ++ +DI        + N      L T    +   LP+ FDAR+ WP CPS+  I DQS
Sbjct: 56  RFKSVSDIRRMLGALPDPNGGHLPTLCTGYTPSLDELPKEFDARKYWPHCPSISEIRDQS 115

Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGH 269
           +CGSCWA     A+SDR+CI S G     +SA+++VAC  +C  GCNGG+P  AW +W  
Sbjct: 116 SCGSCWAFGAVEAMSDRICIESKGLHKPFLSAENLVACCSSCGMGCNGGFPHSAWSYWKR 175

Query: 270 NGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRF 329
           +G+VTG  YN  +GCQPY   PCEHHV GP  +C   G ++TP+CK  C  P Y   Y  
Sbjct: 176 SGIVTGDLYNPTDGCQPYEFPPCEHHVVGPRPSCE--GDVETPKCKTTC-QPGYNIPYNK 232

Query: 330 DLKKGKKAHMV 340
           D   GK  + V
Sbjct: 233 DKWYGKTVYRV 243



 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 54/90 (60%), Positives = 65/90 (72%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M+++ EHGP+   F VYADF  YKSGVYQH  G  +G HAVR+LGWG EN +PYWL+ANS
Sbjct: 251 MKEVKEHGPVEVDFEVYADFPNYKSGVYQHVSGGLLGGHAVRLLGWGEENGVPYWLIANS 310

Query: 137 WNDHWGDHGTFKILRGENEADIEMGFNNRV 166
           WN  WGD+G FKI+RG NE  IE   N  +
Sbjct: 311 WNSDWGDNGYFKIIRGRNECGIESDVNAGI 340


>gi|7537454|gb|AAF35867.2| cathepsin B-like cysteine proteinase [Helicoverpa armigera]
          Length = 338

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 74/151 (49%), Positives = 95/151 (62%), Gaps = 8/151 (5%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFD R+KWP CP+L  + DQ +CGSCWA     A++DR C  SNG      SA+ ++
Sbjct: 84  LPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTQHFHFSAEDLL 143

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C P C  GCNGG P LAW +W H G+V+GG YNS +GC+PY + PCEHHV G    C  
Sbjct: 144 SCCPICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCN- 202

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKK 336
            G  KTP+C++ C     ES Y  D +K K+
Sbjct: 203 -GDSKTPKCEKTC-----ESNYNVDYRKDKR 227



 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 47/81 (58%), Positives = 64/81 (79%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +++++GP+   F+VY+D L YK+GVY+H  GD++G HAV++LGWGVEN   YWL+ANSWN
Sbjct: 245 ELFKNGPVEGAFTVYSDLLNYKTGVYKHTIGDALGGHAVKILGWGVENGNKYWLIANSWN 304

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGD+G FKILRGE+   IE
Sbjct: 305 SDWGDNGFFKILRGEDHCGIE 325


>gi|119887749|gb|ABM05925.1| cathepsin B-like cysteine proteinase [Helicoverpa assulta]
          Length = 338

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 74/151 (49%), Positives = 95/151 (62%), Gaps = 8/151 (5%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFD R+KWP CP+L  + DQ +CGSCWA     A++DR C  SNG      SA+ ++
Sbjct: 84  LPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTQHFHFSAEDLL 143

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C P C  GCNGG P LAW +W H G+V+GG YNS +GC+PY + PCEHHV G    C  
Sbjct: 144 SCCPICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCN- 202

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKK 336
            G  KTP+C++ C     ES Y  D +K K+
Sbjct: 203 -GDSKTPKCEKTC-----ESNYNVDYRKDKR 227



 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 47/81 (58%), Positives = 64/81 (79%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +++++GP+   F+VY+D L YK+GVY+H  GD++G HAV++LGWGVEN   YWL+ANSWN
Sbjct: 245 ELFKNGPVEGAFTVYSDLLNYKTGVYKHTIGDALGGHAVKILGWGVENGNKYWLIANSWN 304

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGD+G FKILRGE+   IE
Sbjct: 305 SDWGDNGFFKILRGEDHCGIE 325


>gi|196009263|ref|XP_002114497.1| expressed hypothetical protein [Trichoplax adhaerens]
 gi|190583516|gb|EDV23587.1| expressed hypothetical protein [Trichoplax adhaerens]
          Length = 333

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 74/160 (46%), Positives = 105/160 (65%), Gaps = 4/160 (2%)

Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
           ++   LP++FD+R+KW  CPS+R I DQ +CGSCW+     +I+DR+CI SNG     IS
Sbjct: 77  EDTSDLPKSFDSRDKWRMCPSIREIRDQGSCGSCWSFGAVESITDRICIHSNGKVKVHIS 136

Query: 242 AQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
           A+ ++ C  +C  GCNGG+   AW +W +NG+VTGG Y+S +GCQPY +  CEHHV+GP 
Sbjct: 137 AEDLMTCCTSCGMGCNGGFLPQAWHYWVNNGIVTGGQYHSHKGCQPYEIPKCEHHVKGPF 196

Query: 301 QNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           + C    +L TP+C Q C  P Y  T+  D   GKK++ +
Sbjct: 197 KACG--KELPTPKCSQKC-QPGYNKTFNQDKHFGKKSYSI 233



 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 53/99 (53%), Positives = 70/99 (70%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+ KK++ +        ++I  +GP+ A F+VYADF  YKSGVYQH  G  +G HAV++L
Sbjct: 225 HFGKKSYSITNNIQQIQKEIMMNGPVEAAFTVYADFPSYKSGVYQHTTGGPLGGHAVKIL 284

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG EN+ PYWL+ANSWN  WGD G FKI+RG++E  IE
Sbjct: 285 GWGTENNTPYWLIANSWNPTWGDKGYFKIIRGKDECGIE 323


>gi|240992702|ref|XP_002404475.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215491572|gb|EEC01213.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 78/155 (50%), Positives = 99/155 (63%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAREKWP C S+  I DQS CGSCWA   A A+SDR+CI S G     ISA+ ++
Sbjct: 85  LPESFDAREKWPHCNSIHLIRDQSTCGSCWAFGAAEAMSDRVCIHSKGKIQVNISAEDLL 144

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C  +C  GCNGG P  AW +W  +G+VTGG Y + +GC+PY+LAPCEHH +G L NCT 
Sbjct: 145 DCCDSCGAGCNGGTPAAAWEYWKESGLVTGGLYGTNDGCKPYSLAPCEHHTKGSLPNCT- 203

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G + TP+C   C    Y   Y+ D   GKK + +
Sbjct: 204 -GTVPTPKCVHLC-RKGYGKDYQDDKHFGKKVYSI 236



 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 59/109 (54%), Positives = 73/109 (66%), Gaps = 2/109 (1%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+ KK + +       Q  I+++GP+ A F V ADFL YKSGVYQH+  D IG HA+R+L
Sbjct: 228 HFGKKVYSISSDEKQIQTEIFKNGPVEADFIVLADFLSYKSGVYQHHSDDVIGGHAIRIL 287

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEAN 169
           GWG EN  PYWL ANSWN+ WGDHG FKILRG++E  IE   N  +  N
Sbjct: 288 GWGTENGTPYWLAANSWNEDWGDHGYFKILRGKDECGIEEDINAGIPKN 336


>gi|185135431|ref|NP_001117776.1| procathepsin B precursor [Oncorhynchus mykiss]
 gi|14582897|gb|AAK69705.1|AF358667_1 procathepsin B [Oncorhynchus mykiss]
          Length = 330

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 102/155 (65%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAR +WP CP+++ I DQ +CGSCWA   A AISDR CI SNG  + +ISA+ ++
Sbjct: 79  LPDSFDARLQWPNCPTIKEIRDQGSCGSCWAFGAAEAISDRYCIHSNGKVSVEISAEDLL 138

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC GG+P  AW +W  +G+VTGG Y S  GC+PY++APCEHHV G    CT 
Sbjct: 139 SCCDACGMGCMGGFPSAAWDYWAESGLVTGGLYGSNIGCRPYSIAPCEHHVNGTRPPCT- 197

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G+  TP+C   C N  Y  +Y+ D + GK+ + V
Sbjct: 198 -GEGDTPKCVSEC-NAGYTPSYKKDKRFGKQTYSV 230



 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 57/108 (52%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y P+      + K+ + VP      M ++Y++GP+ A FSVY DFL YK+GVYQH  G  
Sbjct: 213 YTPSYKKDKRFGKQTYSVPPKEQQIMTELYKNGPVEAAFSVYEDFLLYKTGVYQHVTGQM 272

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+++LGWG EN+ PYWLVANSWN  WGD+G FKILRG++E  IE
Sbjct: 273 LGGHAIKILGWGKENNTPYWLVANSWNTDWGDNGFFKILRGKDECGIE 320


>gi|31872149|gb|AAP59456.1| cathepsin B precursor [Araneus ventricosus]
          Length = 334

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 100/155 (64%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD+RE+WP CP++  I DQ +CGSCWA   A A+SDR CI SNG    +ISA+ ++
Sbjct: 83  LPESFDSREQWPNCPTISEIRDQGSCGSCWAFGAAEAMSDRHCIHSNGKVNVEISAEDLL 142

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C  +C  GCNGG+P  AW +W   G+VTGG YNS  GCQPYT+A CEHH +G L  C  
Sbjct: 143 TCCDSCGMGCNGGFPGSAWEYWVDKGLVTGGLYNSHVGCQPYTIASCEHHTKGKLPPCGD 202

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           +  + TP+C   C    Y  +YR D   GKK++ +
Sbjct: 203 I--VDTPQCVHMC-EKGYNVSYRADKYFGKKSYSI 234



 Score =  114 bits (285), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 51/81 (62%), Positives = 62/81 (76%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I  +GP+ A F+VYADF+ YKSGVY+H  G+ +G HAVR+LGWG E+  PYWLVANSWN
Sbjct: 244 EISTNGPVEAAFTVYADFVTYKSGVYRHVTGEEMGGHAVRILGWGTESGTPYWLVANSWN 303

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGD G FKILRG +E  IE
Sbjct: 304 TDWGDKGYFKILRGSDECGIE 324


>gi|170028910|ref|XP_001842337.1| cathepsin L [Culex quinquefasciatus]
 gi|167879387|gb|EDS42770.1| cathepsin L [Culex quinquefasciatus]
          Length = 334

 Score =  158 bits (399), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 75/155 (48%), Positives = 100/155 (64%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFDARE+WP CP++R I DQ +CGSCWA     A+SDR+CI S G    ++SA+ +V
Sbjct: 83  LPENFDAREQWPNCPTIREIRDQGSCGSCWAFGAVEAMSDRICIHSKGKVHFRVSAEDLV 142

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  AW +W   G+V+GG Y S +GCQPY ++PCEHHV G    C  
Sbjct: 143 SCCHTCGFGCNGGFPGAAWSYWVRKGLVSGGPYGSDQGCQPYAISPCEHHVNGTRGPCN- 201

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G+ KTP+C + C   SY   Y  D   GK ++ +
Sbjct: 202 -GEGKTPKCVKKC-QASYNVPYAKDKFFGKSSYSI 234



 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 46/82 (56%), Positives = 60/82 (73%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++++ +GP+   F+VY D L YK GVYQH  G  +G HA+R+LGWGVEND  +WL+ANSW
Sbjct: 243 KELFTNGPVEGAFTVYEDLLNYKEGVYQHTAGKMLGGHAIRILGWGVENDTKFWLIANSW 302

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N  WGD+G FKILRG +   IE
Sbjct: 303 NSDWGDNGYFKILRGSDHLGIE 324


>gi|226472808|emb|CAX71090.1| cathepsin B [Schistosoma japonicum]
          Length = 325

 Score =  158 bits (399), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 79/179 (44%), Positives = 108/179 (60%), Gaps = 6/179 (3%)

Query: 151 RGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQS 210
           R +  +DI        + N  + + L T        LP++FDAR++W  CPS+  I DQS
Sbjct: 59  RFKTVSDIRRMLGALPDPNGEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQS 118

Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGH 269
           +CGSCWA     A+SDR+CI S G +   +SA+++V+C  +C  GCNGG+P  AW +W +
Sbjct: 119 SCGSCWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSSCGMGCNGGFPHSAWLYWKN 178

Query: 270 NGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---YNPSYES 325
            G+VTG  YN+  GCQPY   PCEHH  GPL  C   G ++TP CK+ C   YN SYE+
Sbjct: 179 QGIVTGDLYNTTNGCQPYEFPPCEHHTLGPLPVCD--GDVETPPCKRTCQAGYNVSYEN 235



 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 35/58 (60%), Positives = 45/58 (77%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           M+++ +HGP+   F VYADF  YKSGVYQH  G  +G HAVR+LGWG EN++PYWL+A
Sbjct: 254 MKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIA 311


>gi|56759588|gb|AAW28820.1| Parcxpwnx02 [Periplaneta americana]
          Length = 343

 Score =  157 bits (398), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 89/212 (41%), Positives = 113/212 (53%), Gaps = 19/212 (8%)

Query: 135 NSWNDHWGDHGTFKILRGENEADIEMGF-----NNRVEANSSEDDDLETMGCQNAKGLPR 189
           NS N  W  H  F       E    MG      N R+   S ED D+E         +P 
Sbjct: 45  NSLNTTWKAHRNFGNDIPLREIKKLMGVRRSLENFRLPEKSMEDIDIE---------IPE 95

Query: 190 NFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACT 249
            FD RE+WPECP+L+ I DQ +CGSCWA     A+SDR+CI S G      SA+ ++ C 
Sbjct: 96  EFDPREQWPECPTLKEIRDQGSCGSCWAFGAVEAMSDRVCIHSKGKTHFHFSAEDLLTCC 155

Query: 250 PNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGK 308
            +C +GCNGG P  AW +W   G+V+GG YNS +GCQPY + PCEHHV G  + C   G+
Sbjct: 156 SSCGFGCNGGEPGAAWDYWVSTGIVSGGSYNSHQGCQPYAIEPCEHHVNGTRKPC---GE 212

Query: 309 LKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             TP C + C    Y+  Y  D   GK A+ V
Sbjct: 213 GDTPRCVKRC-EEGYDVPYGKDRHFGKSAYAV 243



 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 53/103 (51%), Positives = 71/103 (68%), Gaps = 2/103 (1%)

Query: 63  HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+ K A+ VP       +++  +GP  A  +VY DFL Y++GVYQH  G ++G HAVR+L
Sbjct: 235 HFGKSAYAVPGSVKAIQKELLLNGPAEAALTVYDDFLHYRTGVYQHVSGGALGGHAVRLL 294

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFN 163
           GWGVE+  PYWL+ANSWN  WGD+G F+ILRG++E  IE   N
Sbjct: 295 GWGVEDGTPYWLLANSWNYDWGDNGYFRILRGQDECGIESDIN 337


>gi|118424551|gb|ABK90823.1| cathepsin B-like cysteine proteinase [Spodoptera exigua]
          Length = 341

 Score =  157 bits (398), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 75/155 (48%), Positives = 95/155 (61%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFD R+KWP CP+L  + DQ +CGSCWA     A++DR C  SNG      SA+ ++
Sbjct: 87  LPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSAEDLL 146

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C P C  GCNGG P LAW +W H G+V+GG YNS +GC+PY + PCEHHV G    C  
Sbjct: 147 SCCPVCGLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCN- 205

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G  KTP+C + C   SY   Y  D + GK  + V
Sbjct: 206 -GDSKTPKCHKTC-ESSYNVDYHKDKRYGKHVYSV 238



 Score =  110 bits (276), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 46/81 (56%), Positives = 64/81 (79%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           ++Y++GP+   F+VY+D L YK+GVY+H  G+++G HA+++LGWGVEN   YWL+ANSWN
Sbjct: 248 ELYKNGPVEGAFTVYSDLLNYKNGVYKHTVGNALGGHAIKILGWGVENGNKYWLIANSWN 307

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGD+G FKILRGE+   IE
Sbjct: 308 SDWGDNGFFKILRGEDHCGIE 328


>gi|38147393|gb|AAR12009.1| cathepsin B-like proteinase [Triatoma infestans]
          Length = 332

 Score =  157 bits (398), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 74/155 (47%), Positives = 105/155 (67%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR+ WP C S+  I DQ +CGSCWA     A+SDR+CI SNG     +SA+++V
Sbjct: 81  LPKEFDARKHWPNCTSIAEIRDQGSCGSCWAFGAVEAMSDRICIHSNGKLQVHLSAENLV 140

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C +GC+GG+P  AW +W + G+V+GG+Y S++GCQPY++APCEHHV GP   C+ 
Sbjct: 141 SCCDSCGFGCDGGYPASAWDYWQNVGIVSGGNYGSKQGCQPYSIAPCEHHVPGPRPACS- 199

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G+  TP+C+  C   S  S Y  DL  G+ A+ +
Sbjct: 200 -GEGSTPDCRNQCDKRSGIS-YDKDLYYGESAYSL 232



 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 49/82 (59%), Positives = 64/82 (78%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I ++GP+ A F+VY D + YK GVYQH  G  +G HA+++LGWGVEND PYWLVANSWN
Sbjct: 242 EILKNGPVEAAFTVYEDLVNYKEGVYQHVAGSVLGGHAIKILGWGVENDTPYWLVANSWN 301

Query: 139 DHWGDHGTFKILRGENEADIEM 160
             WG++G FKILRG++E  IE+
Sbjct: 302 TDWGNNGFFKILRGKDECGIEI 323


>gi|183988834|gb|ACC66066.1| cathepsin B [Samia ricini]
          Length = 283

 Score =  157 bits (398), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 99/155 (63%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FD R+KWPEC +L  I DQ +CGSCWA     A++DR+CI SN       SA+ +V
Sbjct: 43  LPEIFDPRDKWPECLTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 102

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C P C  GCNGG P LAW +W H G+V+GG+YNS +GC+PY + PCEHHV G    C  
Sbjct: 103 SCCPICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCN- 161

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G  KTP+C++NC   SY   ++ D + GK  + V
Sbjct: 162 -GDTKTPKCQKNC-ESSYNVPFKKDKRYGKHVYSV 194



 Score =  107 bits (267), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 44/80 (55%), Positives = 65/80 (81%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +++++GP+ A F+VY+D L YK+GVY+H  G+++G HA++++GWGVEN+  YWL+ANSWN
Sbjct: 204 ELFKNGPVEAAFTVYSDLLSYKNGVYKHTEGNALGGHAIKIIGWGVENNNKYWLIANSWN 263

Query: 139 DHWGDHGTFKILRGENEADI 158
             WGD+G FKILRGE+   I
Sbjct: 264 SDWGDNGFFKILRGEDHCGI 283


>gi|225717770|gb|ACO14731.1| Cathepsin B precursor [Caligus clemensi]
          Length = 331

 Score =  157 bits (397), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 74/158 (46%), Positives = 104/158 (65%), Gaps = 5/158 (3%)

Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
            + +P  FDARE WP CPS+R I DQ +CGSCWA   A A+SDR+CI ++      ISA+
Sbjct: 79  TESIPDTFDAREHWPNCPSIRLIRDQGSCGSCWAFGAAEAMSDRVCIHTHKNV--NISAE 136

Query: 244 HIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
           ++++C   C +GCNGG+P  AWRFW + G+V+GG Y S +GCQPY + PCEHHV G  + 
Sbjct: 137 NLLSCCYTCGFGCNGGFPGAAWRFWENKGLVSGGLYGSHKGCQPYLIEPCEHHVNGTRKP 196

Query: 303 CTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           C   G  +TP+C + C N +Y  +Y  DL  G+ ++ +
Sbjct: 197 CAEGG--RTPKCHKTCDNKNYPISYEKDLSFGRSSYSI 232



 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 50/81 (61%), Positives = 61/81 (75%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
            I  +GP+ A FSVY+DF+ YKSGVY+H  G  +G HA+R+LGWG+E   PYWLVANSWN
Sbjct: 242 DIMTNGPVEAAFSVYSDFMSYKSGVYRHVKGSLLGGHAIRILGWGMEKGTPYWLVANSWN 301

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGD+GTFKILRG +   IE
Sbjct: 302 TDWGDNGTFKILRGSDHCGIE 322


>gi|357613937|gb|EHJ68797.1| cathepsin B-like cysteine proteinase [Danaus plexippus]
          Length = 334

 Score =  157 bits (397), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 74/155 (47%), Positives = 97/155 (62%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFD R+KWP CP+L  I DQ +CGSCWA     A++DR C  SNG      SA+ ++
Sbjct: 82  LPENFDPRDKWPNCPTLNEIRDQGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSAEDLL 141

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C P C  GCNGG P  AW +W H G+V+GG+YNS +GC PY + PCEHHV G    C  
Sbjct: 142 SCCPVCGLGCNGGIPSFAWEYWKHFGIVSGGNYNSSQGCLPYEIPPCEHHVPGNRIPCN- 200

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G+  TP+C ++C    Y ++Y+ D K GK  + V
Sbjct: 201 -GETSTPKCHRSC-RKEYTNSYKSDKKYGKHVYSV 233



 Score =  110 bits (275), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 47/81 (58%), Positives = 64/81 (79%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I+++GP+   F+VYAD L YKSGVY+H  G+++G HA++++GWGVEN   YWL+ANSWN
Sbjct: 243 EIFKNGPVEGAFTVYADLLTYKSGVYKHTEGEALGGHAIKIMGWGVENGNKYWLIANSWN 302

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGD+G FKILRGE+   IE
Sbjct: 303 SDWGDNGFFKILRGEDHCGIE 323


>gi|347972086|ref|XP_313835.5| AGAP004533-PA [Anopheles gambiae str. PEST]
 gi|333469165|gb|EAA09183.5| AGAP004533-PA [Anopheles gambiae str. PEST]
          Length = 337

 Score =  157 bits (397), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 73/147 (49%), Positives = 96/147 (65%), Gaps = 5/147 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFD+RE+WP CP++R I DQ +CGSCWA     A+SDR+C+AS G    + SA+ +V
Sbjct: 85  LPENFDSREQWPNCPTIREIRDQGSCGSCWAFGAVEAMSDRVCVASGGKIHFRFSAEDLV 144

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  AW +W   G+V+GG + S  GCQPY +APCEHHV G   +C  
Sbjct: 145 SCCHTCGFGCNGGFPGAAWSYWVRKGLVSGGPFGSNLGCQPYAIAPCEHHVNGTRPSCEG 204

Query: 306 LGKLKTPECKQNC---YNPSYESTYRF 329
            G  KTP+C + C   YN  Y+   RF
Sbjct: 205 EGG-KTPKCVKKCQESYNVPYQKDKRF 230



 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 48/82 (58%), Positives = 59/82 (71%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  +GP+   F+VY D L YK GVYQH  G  +G HA+R+LGWGVEN   YWL+ANSW
Sbjct: 246 KEIMTNGPVEGAFTVYEDLLHYKEGVYQHVTGKMLGGHAIRILGWGVENGTKYWLIANSW 305

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N  WGD+G FKILRGE+   IE
Sbjct: 306 NSDWGDNGFFKILRGEDHLGIE 327


>gi|240992699|ref|XP_002404474.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215491571|gb|EEC01212.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score =  157 bits (397), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 99/155 (63%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAREKW  C S+  I DQS CGSCWA   A A+SDR+CI S G     ISA+ ++
Sbjct: 85  LPESFDAREKWSHCASIHLIRDQSTCGSCWAFGAAEAMSDRVCIHSKGKIQVDISAEDLL 144

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C  +C  GCNGG+P  AW +W  +G+VTGG Y + +GC+PY+LAPCEHH +G L NCT 
Sbjct: 145 DCCDSCGAGCNGGYPAAAWEYWKESGLVTGGLYGTSDGCKPYSLAPCEHHTKGSLPNCT- 203

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G + TP+C   C    Y   Y+ D   G+K + +
Sbjct: 204 -GTVPTPKCVHLC-RKGYGKDYQDDKHFGRKVYSI 236



 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 57/91 (62%), Positives = 70/91 (76%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I+++GP+ A F+VYADFL YKSGVYQH  GD +G HA+R+LGWG EN  PYWLVANSWN
Sbjct: 246 EIFKNGPVEADFTVYADFLSYKSGVYQHQSGDVLGGHAIRILGWGTENGTPYWLVANSWN 305

Query: 139 DHWGDHGTFKILRGENEADIEMGFNNRVEAN 169
           + WGDHG FKILRG++E  IE   N  +  N
Sbjct: 306 EDWGDHGYFKILRGKDECGIEDDINAGIPKN 336


>gi|30995341|gb|AAO59414.2| cathepsin B endopeptidase [Schistosoma japonicum]
 gi|226472794|emb|CAX71083.1| cathepsin B [Schistosoma japonicum]
 gi|226472796|emb|CAX71084.1| cathepsin B [Schistosoma japonicum]
 gi|226472798|emb|CAX71085.1| cathepsin B [Schistosoma japonicum]
 gi|226472802|emb|CAX71087.1| cathepsin B [Schistosoma japonicum]
 gi|226472806|emb|CAX71089.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  157 bits (397), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 79/179 (44%), Positives = 108/179 (60%), Gaps = 6/179 (3%)

Query: 151 RGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQS 210
           R +  +DI        + N  + + L T        LP++FDAR++W  CPS+  I DQS
Sbjct: 59  RFKTVSDIRRMLGALPDPNGEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQS 118

Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGH 269
           +CGSCWA     A+SDR+CI S G +   +SA+++V+C  +C  GCNGG+P  AW +W +
Sbjct: 119 SCGSCWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSSCGMGCNGGFPHSAWLYWKN 178

Query: 270 NGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---YNPSYES 325
            G+VTG  YN+  GCQPY   PCEHH  GPL  C   G ++TP CK+ C   YN SYE+
Sbjct: 179 QGIVTGDLYNTTNGCQPYEFPPCEHHTLGPLPVCD--GDVETPPCKRTCQAGYNVSYEN 235



 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 53/87 (60%), Positives = 66/87 (75%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M+++ +HGP+   F VYADF  YKSGVYQH  G  +G HAVR+LGWG EN++PYWL+ANS
Sbjct: 254 MKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIANS 313

Query: 137 WNDHWGDHGTFKILRGENEADIEMGFN 163
           WN  WGD+G FKI+RG+NE  IE   N
Sbjct: 314 WNTDWGDNGYFKIIRGKNECGIESDVN 340


>gi|389608541|dbj|BAM17880.1| cathepsin B [Papilio xuthus]
          Length = 334

 Score =  157 bits (397), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 74/155 (47%), Positives = 97/155 (62%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFD R+KWP CP+L  + DQ +CGSCWA     A++DR+C  SNG      SA+ ++
Sbjct: 82  LPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRICTYSNGTKHFHFSAEDLL 141

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C P C  GCNGG P LAW +W H G+V+GG YNS +GC+PY + PCEHHV G    C+ 
Sbjct: 142 SCCPICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRLPCS- 200

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G  KTP+C + C    Y+  Y+ D   GK  + V
Sbjct: 201 -GDTKTPKCVKEC-ESGYKVPYKQDKHYGKHVYSV 233



 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 64/81 (79%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           ++Y++GP+   F+VYAD L YKSGVY+H  GD++G HA++++GWGVEN   YWL+ANSWN
Sbjct: 243 ELYKNGPVEGAFTVYADLLSYKSGVYKHVTGDALGGHAIKIMGWGVENGNKYWLIANSWN 302

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGD+G FKILRGE+   IE
Sbjct: 303 SDWGDNGFFKILRGEDHCGIE 323


>gi|226472800|emb|CAX71086.1| cathepsin B [Schistosoma japonicum]
 gi|226472804|emb|CAX71088.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  157 bits (396), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 79/179 (44%), Positives = 108/179 (60%), Gaps = 6/179 (3%)

Query: 151 RGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQS 210
           R +  +DI        + N  + + L T        LP++FDAR++W  CPS+  I DQS
Sbjct: 59  RFKTVSDIRRMLGALPDPNGEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQS 118

Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGH 269
           +CGSCWA     A+SDR+CI S G +   +SA+++V+C  +C  GCNGG+P  AW +W +
Sbjct: 119 SCGSCWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSSCGMGCNGGFPHSAWLYWKN 178

Query: 270 NGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---YNPSYES 325
            G+VTG  YN+  GCQPY   PCEHH  GPL  C   G ++TP CK+ C   YN SYE+
Sbjct: 179 QGIVTGDLYNTTNGCQPYEFPPCEHHTLGPLPVCD--GDVETPPCKRTCQAGYNVSYEN 235



 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 53/90 (58%), Positives = 67/90 (74%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M+++ +HGP+   F VYADF  YKSGVYQH  G  +G HAVR+LGWG EN++PYWL+ANS
Sbjct: 254 MKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIANS 313

Query: 137 WNDHWGDHGTFKILRGENEADIEMGFNNRV 166
           WN  WGD+G FKI+RG+NE  IE   N  +
Sbjct: 314 WNTDWGDNGYFKIIRGKNECGIESDVNAGI 343


>gi|306992171|gb|ADN19566.1| cathepsin B-like proteinase [Spodoptera frugiperda]
          Length = 341

 Score =  157 bits (396), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 75/155 (48%), Positives = 95/155 (61%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFD R+KWP CP+L  + DQ +CGSCWA     A++DR C  SNG      SA+ ++
Sbjct: 87  LPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSAEDLL 146

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C P C  GCNGG P LAW +W H G+V+GG YNS +GC+PY + PCEHHV G    C  
Sbjct: 147 SCCPVCGLGCNGGMPTLAWEYWKHFGLVSGGSYNSGQGCRPYEIPPCEHHVPGNRVPCN- 205

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G  KTP+C + C   SY   Y  D + GK  + V
Sbjct: 206 -GDSKTPKCHKTC-EASYSVDYHKDKRYGKHVYSV 238



 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 44/81 (54%), Positives = 63/81 (77%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +++++GP+   F+VY+D L YK+GVY+H  G+++G HA+++LGWGVEN   Y L+ANSWN
Sbjct: 248 ELFKNGPVEGAFTVYSDLLNYKNGVYKHTVGNALGGHAIKILGWGVENGNKYRLIANSWN 307

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGD+G FKILRGE+   IE
Sbjct: 308 SDWGDNGFFKILRGEDHCGIE 328


>gi|195130519|ref|XP_002009699.1| GI15503 [Drosophila mojavensis]
 gi|193908149|gb|EDW07016.1| GI15503 [Drosophila mojavensis]
          Length = 342

 Score =  157 bits (396), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 74/155 (47%), Positives = 93/155 (60%), Gaps = 9/155 (5%)

Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
            +   LP +FDAR  WP CP++  I DQ +CGSCWA     A+SDR+CI SNG      S
Sbjct: 85  DDGDDLPESFDARTAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSNGTVNFHFS 144

Query: 242 AQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
           A+ +V+C   C +GCNGG+P  AW +W H G+V+GG YNS EGC+PY + PCEHHV G  
Sbjct: 145 AEDLVSCCHTCGFGCNGGFPGAAWSYWTHKGIVSGGSYNSNEGCRPYEIEPCEHHVNGTR 204

Query: 301 QNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
             C      +TP CK  C     ES+Y  D  K K
Sbjct: 205 PPCK---NGRTPSCKHQC-----ESSYSVDYAKDK 231



 Score =  104 bits (259), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 50/111 (45%), Positives = 71/111 (63%), Gaps = 6/111 (5%)

Query: 63  HYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRV 119
           H+  K++ +   PR    R+I  +GP+   F+VY D + YKSGVY+H  G  +G HA+R+
Sbjct: 232 HFGSKSYSIRRNPR-EIQREIMTNGPVEGAFTVYEDLILYKSGVYKHVHGKELGGHAIRI 290

Query: 120 LGWGVEND--IPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEA 168
           LGWGV  D  +PYWL+ NSWN  WGD+G F+I+RGE+   IE   +  + A
Sbjct: 291 LGWGVWGDSKVPYWLIGNSWNTDWGDNGFFRIVRGEDHCGIESAISAGLPA 341


>gi|225708580|gb|ACO10136.1| Cathepsin B precursor [Osmerus mordax]
          Length = 329

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 80/193 (41%), Positives = 112/193 (58%), Gaps = 10/193 (5%)

Query: 152 GENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSN 211
           G+N  ++++ +   +         L  +       LP  FDAR++WP CP+++ I DQ +
Sbjct: 43  GQNFYNVDLSYVQGLCGTLQNKPTLPELEHPAGVKLPDTFDARQQWPNCPTIQDIRDQGS 102

Query: 212 CGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHN 270
           CGSCWA   A AISDRLCI SN   T +ISA+ +++C   C  GC GG+P  AW +W  +
Sbjct: 103 CGSCWAFGAAEAISDRLCIHSNAKITVEISAEDLLSCCEECGMGCFGGYPSAAWEYWAKS 162

Query: 271 GVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---YNPSYESTY 327
           G+VTGG Y S +GC+PY++ PCEHHV G    C   G+  TP+C+  C   Y P+YE   
Sbjct: 163 GLVTGGLYGSNKGCRPYSIPPCEHHVNGTRPPCQ--GEGDTPKCQTKCIDGYTPAYEKDK 220

Query: 328 RFDLKKGKKAHMV 340
            F    GKK + V
Sbjct: 221 YF----GKKTYSV 229



 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 58/108 (53%), Positives = 74/108 (68%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y P      ++ KK + VP  +   M ++Y++GP+ A FSVY DFL YKSGVYQH  GD 
Sbjct: 212 YTPAYEKDKYFGKKTYSVPSKQEQIMTELYKNGPVEAAFSVYEDFLLYKSGVYQHLTGDM 271

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+++LGWG EN+ PYWL ANSWN  WG+ G FKILRG +E  IE
Sbjct: 272 LGGHAIKILGWGKENNTPYWLAANSWNTDWGNQGFFKILRGGDECGIE 319


>gi|389611087|dbj|BAM19154.1| cathepsin B [Papilio polytes]
          Length = 334

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 74/155 (47%), Positives = 98/155 (63%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFD R+KWP CP+L  + DQ +CGSCWA     A++DR+C  SNG      SA+ ++
Sbjct: 82  LPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 141

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C P C  GCNGG P LAW +W H G+V+GG YNS +GC+PY + PCEHHV G    C+ 
Sbjct: 142 SCCPICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSTQGCRPYEIPPCEHHVPGNRLPCS- 200

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G  KTP+C + C + +Y   Y+ D   GK  + V
Sbjct: 201 -GDTKTPKCIKKCED-NYNVAYKQDKHYGKHIYSV 233



 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 64/81 (79%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           ++Y++GP+   F+VYAD L YKSGVY+H  GD++G HA++++GWGVEN   YWL+ANSWN
Sbjct: 243 ELYKNGPVEGAFTVYADLLSYKSGVYKHVAGDALGGHAIKIMGWGVENGNKYWLIANSWN 302

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGD+G FKILRGE+   IE
Sbjct: 303 SDWGDNGFFKILRGEDHCGIE 323


>gi|432852559|ref|XP_004067308.1| PREDICTED: cathepsin B-like [Oryzias latipes]
          Length = 330

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 98/155 (63%), Gaps = 3/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDAR +WP CP+L+ I DQ +CGSCWA   A AISDR+CI SN   + +IS++ ++
Sbjct: 79  LPTEFDARAQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNARVSVEISSEDLL 138

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C  +C  GCNGG+P  AW FW   G+VTGG Y+S  GC+PYT+ PCEHHV G    CT 
Sbjct: 139 TCCESCGMGCNGGYPTAAWDFWTKEGLVTGGLYDSHVGCRPYTIPPCEHHVNGTRPPCTG 198

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G   TP+C   C    Y  +Y+ D   GK ++ V
Sbjct: 199 EGG-DTPQCINQC-ESGYTPSYKKDKHYGKTSYSV 231



 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 55/108 (50%), Positives = 69/108 (63%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y P+     HY K ++ V         +IY++GP+   F VY DF  YKSGVYQH  G  
Sbjct: 214 YTPSYKKDKHYGKTSYSVEANENQIQTEIYKNGPVEGAFMVYEDFPMYKSGVYQHVSGSL 273

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           IG HA+++LGWGVE+ +PYWL ANSWN  WGD+G FKILRG +   IE
Sbjct: 274 IGGHAIKILGWGVEDGVPYWLCANSWNTDWGDNGYFKILRGSDHCGIE 321


>gi|346470617|gb|AEO35153.1| hypothetical protein [Amblyomma maculatum]
          Length = 335

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 77/157 (49%), Positives = 97/157 (61%), Gaps = 4/157 (2%)

Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
           K LP +FDAREKW  C S+  I DQS CGSCWA     A+SDR+CI S G     ISA+ 
Sbjct: 82  KDLPESFDAREKWSHCNSIHVIRDQSTCGSCWAFGATEAMSDRVCIHSKGKVQVNISAED 141

Query: 245 IVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
           ++ C  +C  GCNGG+P  AW F+  +G+VTGG Y + +GCQPY   PCEHH  GPL NC
Sbjct: 142 LLTCCDSCGAGCNGGYPAAAWEFYKTDGIVTGGLYGTDDGCQPYYFPPCEHHTVGPLPNC 201

Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           T  G   TP+C ++C    YE +Y  D    KK + +
Sbjct: 202 T--GIKPTPQCVRDC-RKGYEKSYSEDKHYAKKVYTL 235



 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 56/106 (52%), Positives = 74/106 (69%), Gaps = 2/106 (1%)

Query: 63  HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY KK + +         +I+++GP+ A F+VYADF+ YKSGVYQ +  D++G HA+R+L
Sbjct: 227 HYAKKVYTLSADETQIKTEIFKNGPVEADFTVYADFVSYKSGVYQRHSDDALGGHAIRIL 286

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRV 166
           GWG EN +PYWLVANSWN+ WGD G FKILRG +E  IE   N  +
Sbjct: 287 GWGTENGVPYWLVANSWNEDWGDKGYFKILRGNDECGIEDDINAGI 332


>gi|91078958|ref|XP_974220.1| PREDICTED: similar to cathepsin b [Tribolium castaneum]
 gi|270004841|gb|EFA01289.1| cathepsin B precursor [Tribolium castaneum]
          Length = 334

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 74/159 (46%), Positives = 102/159 (64%), Gaps = 3/159 (1%)

Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
           +A  +P +FDAR++WP CP++R I DQ +CGSCWA     A+SDR+CI S G    ++SA
Sbjct: 76  DAMEIPDDFDARKQWPNCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIHSKGAVNVRLSA 135

Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ 301
             +V+C  +C  GCNGG+P  AW +W + G+V+GG + S +GC+PY +APCEHHV G   
Sbjct: 136 DDLVSCCYSCGMGCNGGFPGAAWHYWVNKGIVSGGSFGSNQGCRPYEIAPCEHHVNGTRP 195

Query: 302 NCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            CT     KTP CKQ C    Y   Y+ D   GK+A+ +
Sbjct: 196 PCTGDDN-KTPSCKQQC-EKGYNVPYKKDKNFGKEAYSI 232



 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 50/94 (53%), Positives = 64/94 (68%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  +GP+   F VY D L YK GVYQH  G+++G HA+R+LGWG E   PYWL+ANSW
Sbjct: 241 KEIMTNGPVEGAFEVYEDLLSYKKGVYQHVKGEALGGHAIRILGWGTEKGTPYWLIANSW 300

Query: 138 NDHWGDHGTFKILRGENEADIEMGFNNRVEANSS 171
           N  WGD+GTFKILRGE+   IE      +  +SS
Sbjct: 301 NSDWGDNGTFKILRGEDHCGIESSIVAGIPKDSS 334


>gi|296221607|ref|XP_002756833.1| PREDICTED: cathepsin B, partial [Callithrix jacchus]
          Length = 330

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 72/156 (46%), Positives = 103/156 (66%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP+CP+++ I DQ +CGSCWA     AISDR+CI +N + + ++SA+ ++
Sbjct: 71  LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 130

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  +    GCNGG+P  AW FW   G+V+GG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 131 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 190

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C ++C  P Y  TY+ D   G  ++ V
Sbjct: 191 --GEGDTPKCSKSC-EPGYSPTYKQDKHYGYDSYSV 223



 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 74/108 (68%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVYADFL YKSGVYQH  G+ 
Sbjct: 206 YSPTYKQDKHYGYDSYSVSNNERDIMAEIYKNGPVEGAFSVYADFLLYKSGVYQHVTGEM 265

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLV NSWN  WGD+G FKILRG++   IE
Sbjct: 266 MGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIE 313


>gi|395507317|ref|XP_003757972.1| PREDICTED: cathepsin B [Sarcophilus harrisii]
          Length = 342

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 77/170 (45%), Positives = 107/170 (62%), Gaps = 13/170 (7%)

Query: 173 DDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIAS 232
           DDD++         LP NFDARE+WP+CP+++ I DQ +CGSCWA     AISDR+C+ +
Sbjct: 77  DDDMK---------LPENFDAREQWPKCPTIKEIRDQGSCGSCWAFGAVEAISDRICVHT 127

Query: 233 NGYFTGQISAQHIVACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
           NGY T ++SA+ +++C    C  GCNGG+P  AW++W   G+V+GG Y+S  GC+PY++ 
Sbjct: 128 NGYITIEVSAEDLLSCCGLQCGEGCNGGFPAGAWKYWIKKGLVSGGLYDSHVGCRPYSIP 187

Query: 291 PCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           PCEHHV G    CT  G   TP+C + C    Y   Y+ D   G  A+ V
Sbjct: 188 PCEHHVNGSRPACTGEGG-DTPKCNKKC-EAGYSPDYKDDKHYGTTAYNV 235



 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 65/117 (55%), Positives = 78/117 (66%), Gaps = 2/117 (1%)

Query: 45  KKKKKKKRLYLPTSIPLSHYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSG 102
           K  KK +  Y P      HY   A+ VP      M +IY++GP+   F VYADFLQYKSG
Sbjct: 209 KCNKKCEAGYSPDYKDDKHYGTTAYNVPSSEKEIMAEIYKNGPVEGAFIVYADFLQYKSG 268

Query: 103 VYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           VYQH  GD +G HA+RVLGWGVE+ +PYWL ANSWN  WGD+G FKILRG++   IE
Sbjct: 269 VYQHVTGDMLGGHAIRVLGWGVEDGVPYWLAANSWNTDWGDNGFFKILRGKDHCGIE 325


>gi|194766882|ref|XP_001965553.1| GF22391 [Drosophila ananassae]
 gi|190619544|gb|EDV35068.1| GF22391 [Drosophila ananassae]
          Length = 342

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 77/170 (45%), Positives = 100/170 (58%), Gaps = 6/170 (3%)

Query: 174 DDLETMG--CQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIA 231
           D  E +G   Q    +P+ FDAREKWP CP++  I DQ +CGSCWA     A+SDR+CI 
Sbjct: 73  DKQEVLGYLSQKVDDIPKEFDAREKWPNCPTINEIRDQGSCGSCWAFGAVEAMSDRVCIH 132

Query: 232 SNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
           SNG    + SA  +V+C   C +GCNGG+P  AW +W   G+V+GG Y S+ GC+PY +A
Sbjct: 133 SNGNVNFRFSADDLVSCCHTCGFGCNGGFPGAAWSYWTRKGIVSGGRYGSKTGCRPYEIA 192

Query: 291 PCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           PCEHHV G    C      KTP+C+  C    Y   Y  D   G K++ V
Sbjct: 193 PCEHHVNGTRAPCNH--DSKTPKCQHQC-EAGYNVEYSKDKHFGSKSYSV 239



 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 51/101 (50%), Positives = 69/101 (68%), Gaps = 4/101 (3%)

Query: 63  HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+  K++ V R   +   +I  +GP+   F+VY D + YKSGVYQH  G  +G HA+R+L
Sbjct: 231 HFGSKSYSVRRNVRDIQEEIMTNGPVEGAFTVYEDLILYKSGVYQHEHGKELGGHAIRIL 290

Query: 121 GWGV--ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGV  + ++PYWL+ANSWND WGD G F+ILRGE+   IE
Sbjct: 291 GWGVWGKEEVPYWLIANSWNDDWGDKGFFRILRGEDHCGIE 331


>gi|187097096|ref|NP_001119608.1| cathepsin B-348 precursor [Acyrthosiphon pisum]
 gi|161343833|tpg|DAA06097.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 342

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 75/155 (48%), Positives = 96/155 (61%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDARE+WP CP++R + DQ +CGSCWA     A+SDR+CI SNG      SA+++V
Sbjct: 91  LPETFDARERWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSNGTKNFHFSAENLV 150

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  AW +W   G+V+GG Y S  GC PY +APCEHHV G    C  
Sbjct: 151 SCCWTCGFGCNGGFPGAAWNYWKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGPCKE 210

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G  KTP C + C    Y+  Y  DL  GK A+ +
Sbjct: 211 GG--KTPTCVKKC-EEGYKVPYAQDLHHGKSAYSI 242



 Score =  110 bits (276), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 59/134 (44%), Positives = 82/134 (61%), Gaps = 2/134 (1%)

Query: 37  KKKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCNAMRQ-IYEHGPLVAIFSVYAD 95
           K+  K     KK ++   +P +  L H      +    + +RQ IY +GP+   F+VY D
Sbjct: 209 KEGGKTPTCVKKCEEGYKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVYED 268

Query: 96  FLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLVANSWNDHWGDHGTFKILRGEN 154
           F+ Y++GVY+H  G ++G HA+R+LGWGV+N +IPYWLVANSWN  WG  G FKILRG +
Sbjct: 269 FIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANSWNTDWGSDGFFKILRGSD 328

Query: 155 EADIEMGFNNRVEA 168
           E  IE   N  + A
Sbjct: 329 ECGIEGQINAGLPA 342


>gi|30583753|gb|AAP36125.1| Homo sapiens cathepsin B [synthetic construct]
 gi|61370555|gb|AAX43516.1| cathepsin B [synthetic construct]
          Length = 340

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 79/193 (40%), Positives = 115/193 (59%), Gaps = 8/193 (4%)

Query: 152 GENEADIEMGFNNRVEAN--SSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQ 209
           G N  +++MG+  R+              M  ++ K LP +FDARE+WP+CP+++ I DQ
Sbjct: 44  GHNFYNVDMGYLKRLCGTFLGGPKPPQRVMFTEDLK-LPASFDAREQWPQCPTIKEIRDQ 102

Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW--GCNGGWPQLAWRFW 267
            +CGSCWA     AISDR+CI +N + + ++SA+ ++ C  +    GCNGG+P  AW FW
Sbjct: 103 GSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFW 162

Query: 268 GHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTY 327
              G+V+GG Y S  GC+PY++ PCEHHV G    CT  G+  TP+C + C  P Y  TY
Sbjct: 163 TRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT--GEGDTPKCSKIC-EPGYSPTY 219

Query: 328 RFDLKKGKKAHMV 340
           + D   G  ++ V
Sbjct: 220 KQDKHYGYNSYSV 232



 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322


>gi|403307501|ref|XP_003944231.1| PREDICTED: cathepsin B [Saimiri boliviensis boliviensis]
          Length = 351

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 72/156 (46%), Positives = 103/156 (66%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP+CP+++ I DQ +CGSCWA     AISDR+CI +N + + ++SA+ ++
Sbjct: 92  LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 151

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  +    GCNGG+P  AW FW   G+V+GG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 152 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 211

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C ++C  P Y  TY+ D   G  ++ V
Sbjct: 212 --GEGDTPKCSKSC-EPGYTPTYKQDKHYGYNSYSV 244



 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 59/108 (54%), Positives = 74/108 (68%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 227 YTPTYKQDKHYGYNSYSVSNSERDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 286

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLV NSWN  WGD+G FKILRG++   IE
Sbjct: 287 MGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIE 334


>gi|260786791|ref|XP_002588440.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
 gi|229273602|gb|EEN44451.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
          Length = 332

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 72/160 (45%), Positives = 104/160 (65%), Gaps = 6/160 (3%)

Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
           NA+ +P  FD+R +W  CP+++ + DQ +CGSCWA + A A+SDR C+ASNG     +S+
Sbjct: 76  NAQDIPDTFDSRTQWANCPTIKEVRDQGSCGSCWAEAAAEAMSDRTCVASNGKVQVHLSS 135

Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ 301
           ++++AC   C  GC+GG+P+ AW +W  +G+VTGG Y S +GCQPY +APCEHH+ G   
Sbjct: 136 ENLMACCETCGMGCHGGFPEAAWEYWKQDGLVTGGPYGSMQGCQPYEIAPCEHHINGSRP 195

Query: 302 NCTLLGKLK-TPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            C   GK++ TP CK+ C    Y  T+  D    K A+ V
Sbjct: 196 AC---GKIEPTPRCKKTC-ESGYNVTFNKDKHYAKSAYSV 231



 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 54/99 (54%), Positives = 67/99 (67%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY K A+ V         +I  +GP+ A F+VYADF  YKSGVYQH  G  +G HAV+++
Sbjct: 223 HYAKSAYSVSSKVQQIQMEIMTNGPVEAAFTVYADFPHYKSGVYQHESGAELGGHAVKMI 282

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG+E   PYWL+ANSWN  WGD G FKILRG++E  IE
Sbjct: 283 GWGMEGSTPYWLIANSWNSDWGDMGFFKILRGQDECGIE 321


>gi|16307393|gb|AAH10240.1| Cathepsin B [Homo sapiens]
          Length = 339

 Score =  154 bits (390), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 79/193 (40%), Positives = 115/193 (59%), Gaps = 8/193 (4%)

Query: 152 GENEADIEMGFNNRVEAN--SSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQ 209
           G N  +++MG+  R+              M  ++ K LP +FDARE+WP+CP+++ I DQ
Sbjct: 44  GHNFYNVDMGYLKRLCGTFLGGPKPPQRVMFTEDLK-LPASFDAREQWPQCPTIKEIRDQ 102

Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW--GCNGGWPQLAWRFW 267
            +CGSCWA     AISDR+CI +N + + ++SA+ ++ C  +    GCNGG+P  AW FW
Sbjct: 103 GSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFW 162

Query: 268 GHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTY 327
              G+V+GG Y S  GC+PY++ PCEHHV G    CT  G+  TP+C + C  P Y  TY
Sbjct: 163 TRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT--GEGDTPKCSKIC-EPGYSPTY 219

Query: 328 RFDLKKGKKAHMV 340
           + D   G  ++ V
Sbjct: 220 KQDKHYGYNSYSV 232



 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322


>gi|348587350|ref|XP_003479431.1| PREDICTED: cathepsin B-like [Cavia porcellus]
          Length = 340

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 73/156 (46%), Positives = 102/156 (65%), Gaps = 4/156 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP CP+++ I DQ +CGSCWA     A+SDRLCI +NG+   ++SA+ ++
Sbjct: 80  LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAMSDRLCIHTNGHVNVEVSAEDLL 139

Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C  P C  GCNGG+P  AW++W   G+V+GG Y S  GC+PY++ PCEHHV G    CT
Sbjct: 140 SCCGPLCGEGCNGGYPTEAWKYWTRKGLVSGGLYGSHVGCRPYSIPPCEHHVNGTRPKCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G   TP+C + C  P Y  +Y+ D   G  ++ V
Sbjct: 200 GEGG-DTPKCSKTC-EPGYSPSYKEDKYYGYSSYSV 233



 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 57/108 (52%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y P+     +Y   ++ VP      M +IY++GP+ A FSV++DFL YKSGVY+H  G+ 
Sbjct: 216 YSPSYKEDKYYGYSSYSVPSTEKEIMAEIYKNGPVEAAFSVFSDFLTYKSGVYKHVAGEV 275

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWG EN +PYWLV NSWN  WGD+G FKILRGE+   IE
Sbjct: 276 LGGHAIRILGWGKENGVPYWLVGNSWNVDWGDNGFFKILRGEDHCGIE 323


>gi|226468762|emb|CAX76409.1| cathepsin B [Schistosoma japonicum]
 gi|257206178|emb|CAX82740.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 78/179 (43%), Positives = 108/179 (60%), Gaps = 6/179 (3%)

Query: 151 RGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQS 210
           R +  +DI        + N  + + L T        LP++FDAR++W  CPS+  I DQS
Sbjct: 59  RFKTVSDIRRMLGALPDPNGEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQS 118

Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGH 269
           +CGSCWA     A+SDR+CI S G +   +SA+++V+C  +C  GCNGG+P  AW +W +
Sbjct: 119 SCGSCWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSSCGMGCNGGFPHSAWLYWKN 178

Query: 270 NGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---YNPSYES 325
            G+VTG  YN+  GCQPY   PCEH+  GPL  C   G ++TP CK+ C   YN SYE+
Sbjct: 179 QGIVTGDLYNTTNGCQPYEFPPCEHNTLGPLPVCD--GDVETPPCKRTCQAGYNVSYEN 235



 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 53/87 (60%), Positives = 66/87 (75%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M+++ +HGP+   F VYADF  YKSGVYQH  G  +G HAVR+LGWG EN++PYWL+ANS
Sbjct: 254 MKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIANS 313

Query: 137 WNDHWGDHGTFKILRGENEADIEMGFN 163
           WN  WGD+G FKI+RG+NE  IE   N
Sbjct: 314 WNTDWGDNGYFKIIRGKNECGIESDVN 340


>gi|161343863|tpg|DAA06112.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 340

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 75/155 (48%), Positives = 98/155 (63%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFDARE WP CP++R + DQ +CGSCWA     A+SDR+CI S G      SA+++V
Sbjct: 89  LPENFDAREHWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGAKNFHFSAENLV 148

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  AW +W   G+V+GG Y S+ GC PY +APCEHHV G    C  
Sbjct: 149 SCCRTCGFGCNGGFPGAAWHYWKTKGIVSGGPYGSKMGCIPYEIAPCEHHVNGTRGPCKE 208

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G  KTP C + C +  Y+  Y  DL +GK A+ +
Sbjct: 209 GG--KTPACVKKCED-GYKVPYAQDLHRGKSAYSL 240



 Score =  110 bits (275), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 50/92 (54%), Positives = 67/92 (72%), Gaps = 1/92 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLVANS 136
           ++IY +GP+   F+VY DF+ Y++GVY+H  G ++G HA+R+LGWGV+N +IPYWLVANS
Sbjct: 249 QEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANS 308

Query: 137 WNDHWGDHGTFKILRGENEADIEMGFNNRVEA 168
           WN  WG  G FKILRG +E  IE   N  + A
Sbjct: 309 WNSDWGSDGFFKILRGSDECGIEGQINAGLPA 340


>gi|49036806|gb|AAT48984.1| cathepsin B-like proteinase [Triatoma sordida]
          Length = 331

 Score =  154 bits (389), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 72/165 (43%), Positives = 106/165 (64%), Gaps = 13/165 (7%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAR++WP CPS+  I DQ +CGSCWA     A+SDR+CI SNG     +SA+++V
Sbjct: 81  IPDEFDARKQWPNCPSITDIRDQGSCGSCWAFGAVEAMSDRICIHSNGKLQVHLSAENLV 140

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C +GC+GG+P  AW +W + G+V+GG+Y S++GCQPY++APCEHHV G    C+ 
Sbjct: 141 SCCDSCGYGCDGGFPASAWDYWQNEGIVSGGNYGSKQGCQPYSIAPCEHHVPGSRPACS- 199

Query: 306 LGKLKTPECKQNC-------YNPSY---ESTYRFDLKKGKKAHMV 340
            G   TP+C+  C       Y+  +   E+ Y  D  K  +A ++
Sbjct: 200 -GGGDTPDCRNQCDEGSGISYDQDHYYGETVYTLDEAKQIQAEIL 243



 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 50/81 (61%), Positives = 64/81 (79%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I ++GP+ A F+VY D L YK GVYQH  G+++G HA+++LGWGVEND PYWLVANSWN
Sbjct: 241 EILKNGPVEAAFTVYEDLLNYKEGVYQHVAGEALGGHAIKILGWGVENDTPYWLVANSWN 300

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WG++G FKILRG +E  IE
Sbjct: 301 TDWGNNGFFKILRGSDECGIE 321


>gi|73586701|gb|AAI02998.1| CTSB protein [Bos taurus]
          Length = 335

 Score =  154 bits (389), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 75/156 (48%), Positives = 100/156 (64%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP CP+++ I DQ +CGSCWA     AISDR+CI SNG    ++SA+ ++
Sbjct: 80  LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 139

Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C    C  GCNGG+P  AW FW   G+V+GG YNS  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCDGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  +Y+ D   G  ++ V
Sbjct: 200 --GEGDTPKCSKTC-EPGYSPSYKEDKHFGCSSYSV 232



 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 53/83 (63%), Positives = 65/83 (78%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+   FSVY+DFL YKSGVYQH  G+ +G HA+R+LGWGVEN  PYWLV NS
Sbjct: 240 MAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGNS 299

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRG++   IE
Sbjct: 300 WNTDWGDNGFFKILRGQDHCGIE 322


>gi|56462338|gb|AAV91452.1| cysteine peptidase 2 cathepsin-B-like [Lonomia obliqua]
          Length = 338

 Score =  154 bits (388), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 73/155 (47%), Positives = 95/155 (61%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFD R+KWP CP+L  I DQ +CGSCWA     A++DR+C  S+G      SA+ ++
Sbjct: 84  LPENFDPRDKWPNCPTLNEIRDQGSCGSCWAFGAVEAMTDRVCTYSDGTKHFHFSAEDLL 143

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C P C  GCNGG P LAW +W H G+V+GG YNS +GC PY + PCEHHV G    C  
Sbjct: 144 SCCPICGLGCNGGMPTLAWEYWKHAGIVSGGSYNSTQGCIPYEVPPCEHHVPGNRLPCN- 202

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G  KTP+C++ C    Y   ++ D   GK  + V
Sbjct: 203 -GDTKTPKCQKTC-EAGYNVPFKKDKHYGKHVYSV 235



 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 53/99 (53%), Positives = 69/99 (69%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY K  + V     N   +++++GP+   F+VY+D L YKSGVYQH  G ++G HAV++L
Sbjct: 227 HYGKHVYSVSGNEDNIKAELFKNGPVEGAFTVYSDLLSYKSGVYQHTDGSALGGHAVKIL 286

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN   YWL+ANSWN  WGD+G FKILRGE+   IE
Sbjct: 287 GWGVENGSKYWLIANSWNSDWGDNGFFKILRGEDHCGIE 325


>gi|157833437|pdb|1PBH|A Chain A, Crystal Structure Of Human Recombinant Procathepsin B At
           3.2 Angstrom Resolution
 gi|157835646|pdb|2PBH|A Chain A, Crystal Structure Of Human Procathepsin B At 3.3 Angstrom
           Resolution
 gi|157836863|pdb|3PBH|A Chain A, Refined Crystal Structure Of Human Procathepsin B At 2.5
           Angstrom Resolution
          Length = 317

 Score =  154 bits (388), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP+CP+++ I DQ +CGSCWA     AISDR+CI +N + + ++SA+ ++
Sbjct: 64  LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 123

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  +    GCNGG+P  AW FW   G+V+GG Y S  GC+PY++ PCEHHV G    CT
Sbjct: 124 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 183

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  TY+ D   G  ++ V
Sbjct: 184 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 216



 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 199 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 258

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 259 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 306


>gi|410956528|ref|XP_003984894.1| PREDICTED: cathepsin B [Felis catus]
          Length = 339

 Score =  154 bits (388), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 74/156 (47%), Positives = 101/156 (64%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFDARE WP CP+++ I DQ +CGSCWA     AISDR+CI +NG+   ++SA+ ++
Sbjct: 80  LPENFDAREHWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICILTNGHVNVEVSAEDML 139

Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  + C  GCNGG+P  AW FW   G+V+GG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  +Y+ D   G  ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYTPSYKEDKHYGCNSYSV 232



 Score =  128 bits (322), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 57/83 (68%), Positives = 67/83 (80%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+ A FSV++DFLQYKSGVYQH  G+ +G HAVR+LGWGVEND PYWLV NS
Sbjct: 240 MAEIYKNGPVEAAFSVFSDFLQYKSGVYQHVTGEMMGGHAVRILGWGVENDTPYWLVGNS 299

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGDHG FKILRG +   IE
Sbjct: 300 WNTDWGDHGFFKILRGRDHCGIE 322


>gi|197098184|ref|NP_001126573.1| cathepsin B precursor [Pongo abelii]
 gi|75061687|sp|Q5R6D1.1|CATB_PONAB RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|55731764|emb|CAH92586.1| hypothetical protein [Pongo abelii]
 gi|55731953|emb|CAH92685.1| hypothetical protein [Pongo abelii]
          Length = 339

 Score =  154 bits (388), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP+CP+++ I DQ +CGSCWA     AISDR+CI +N + + ++SA+ ++
Sbjct: 80  LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  +    GCNGG+P  AW FW   G+V+GG Y S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  TY+ D   G  ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232



 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSERDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322


>gi|341888136|gb|EGT44071.1| hypothetical protein CAEBREN_13576 [Caenorhabditis brenneri]
          Length = 337

 Score =  154 bits (388), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 78/162 (48%), Positives = 102/162 (62%), Gaps = 5/162 (3%)

Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
           A+ +P +FDARE+WP C S+ +I DQS+CGSCWAV+ A  ISDR CIASNG     ISA+
Sbjct: 72  AENIPDHFDAREQWPNCVSIDNIRDQSDCGSCWAVAAAETISDRTCIASNGEVNVLISAE 131

Query: 244 HIVACTP---NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP 299
            +++C     NC  GC GG+P  AWR+W HNG+VTGG Y SQ GC+PY++APC   V G 
Sbjct: 132 DLLSCCTGGYNCGDGCEGGYPIQAWRYWVHNGLVTGGSYESQYGCKPYSIAPCGQTVNGV 191

Query: 300 LQNCTLLGKLKTPECKQNCYNPS-YESTYRFDLKKGKKAHMV 340
                   ++ TPEC + C + S Y   Y  D   G  A+ +
Sbjct: 192 TWPKCAADEVATPECVKQCTSKSDYAVPYDQDKHYGSSAYAI 233



 Score =  107 bits (268), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 52/99 (52%), Positives = 66/99 (66%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   A+ + +  A  Q  I  +GP+   F VY+DF QYKSG+Y+H  G  +G HAV++L
Sbjct: 225 HYGSSAYAIRQNVAQIQTEIMRNGPVEVGFLVYSDFYQYKSGIYKHVAGRELGGHAVKIL 284

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN  PYWL ANSWN +WG+ G F+I RG NE  IE
Sbjct: 285 GWGVENGTPYWLAANSWNVNWGEKGYFRIRRGTNECGIE 323


>gi|226472810|emb|CAX71091.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  154 bits (388), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 78/179 (43%), Positives = 107/179 (59%), Gaps = 6/179 (3%)

Query: 151 RGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQS 210
           R +  +DI        + N  + + L T        LP++FDAR++W  CPS+  I DQS
Sbjct: 59  RFKTVSDIRRMLGALPDPNGEQLETLCTGYELTVNELPKSFDARKEWTHCPSISEIRDQS 118

Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGH 269
           +CGS WA     A+SDR+CI S G +   +SA+++V+C  +C  GCNGG+P  AW +W +
Sbjct: 119 SCGSYWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSSCGMGCNGGFPHSAWLYWKN 178

Query: 270 NGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---YNPSYES 325
            G+VTG  YN+  GCQPY   PCEHH  GPL  C   G ++TP CK+ C   YN SYE+
Sbjct: 179 QGIVTGDLYNTTNGCQPYEFPPCEHHTLGPLPVCD--GDVETPPCKRTCQAGYNVSYEN 235



 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 53/87 (60%), Positives = 66/87 (75%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M+++ +HGP+   F VYADF  YKSGVYQH  G  +G HAVR+LGWG EN++PYWL+ANS
Sbjct: 254 MKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIANS 313

Query: 137 WNDHWGDHGTFKILRGENEADIEMGFN 163
           WN  WGD+G FKI+RG+NE  IE   N
Sbjct: 314 WNTDWGDNGYFKIIRGKNECGIESDVN 340


>gi|332862712|ref|XP_003317964.1| PREDICTED: cathepsin B isoform 1 [Pan troglodytes]
 gi|332862714|ref|XP_003317965.1| PREDICTED: cathepsin B isoform 2 [Pan troglodytes]
 gi|332862716|ref|XP_003317966.1| PREDICTED: cathepsin B isoform 3 [Pan troglodytes]
 gi|332862718|ref|XP_519607.3| PREDICTED: cathepsin B isoform 5 [Pan troglodytes]
 gi|410057614|ref|XP_003954244.1| PREDICTED: cathepsin B [Pan troglodytes]
 gi|410262606|gb|JAA19269.1| cathepsin B [Pan troglodytes]
 gi|410262608|gb|JAA19270.1| cathepsin B [Pan troglodytes]
 gi|410359820|gb|JAA44654.1| cathepsin B [Pan troglodytes]
 gi|410359822|gb|JAA44655.1| cathepsin B [Pan troglodytes]
 gi|410359824|gb|JAA44656.1| cathepsin B [Pan troglodytes]
 gi|410359826|gb|JAA44657.1| cathepsin B [Pan troglodytes]
 gi|410359828|gb|JAA44658.1| cathepsin B [Pan troglodytes]
          Length = 339

 Score =  154 bits (388), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP+CP+++ I DQ +CGSCWA     AISDR+CI +N + + ++SA+ ++
Sbjct: 80  LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  +    GCNGG+P  AW FW   G+V+GG Y S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  TY+ D   G  ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232



 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322


>gi|397467300|ref|XP_003805362.1| PREDICTED: cathepsin B [Pan paniscus]
          Length = 339

 Score =  154 bits (388), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP+CP+++ I DQ +CGSCWA     AISDR+CI +N + + ++SA+ ++
Sbjct: 80  LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  +    GCNGG+P  AW FW   G+V+GG Y S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  TY+ D   G  ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232



 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322


>gi|157058763|gb|ABV03139.1| cathepsin B-348 [Acyrthosiphon pisum]
          Length = 248

 Score =  153 bits (387), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 75/155 (48%), Positives = 96/155 (61%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDARE+WP CP++R + DQ +CGSCWA     A+SDR+CI SNG      SA+++V
Sbjct: 26  LPETFDARERWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSNGTKNFHFSAENLV 85

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  AW +W   G+V+GG Y S  GC PY +APCEHHV G    C  
Sbjct: 86  SCCWTCGFGCNGGFPGAAWNYWKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGPCKE 145

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G  KTP C + C    Y+  Y  DL  GK A+ +
Sbjct: 146 GG--KTPTCVKKC-EEGYKVPYAQDLHHGKSAYSI 177



 Score = 87.4 bits (215), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 66/104 (63%), Gaps = 2/104 (1%)

Query: 37  KKKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCNAMRQ-IYEHGPLVAIFSVYAD 95
           K+  K     KK ++   +P +  L H      +    + +RQ IY +GP+   F+VY D
Sbjct: 144 KEGGKTPTCVKKCEEGYKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVYED 203

Query: 96  FLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLVANSWN 138
           F+ Y++GVY+H  G ++G HA+R+LGWGV+N +IPYWLVANSWN
Sbjct: 204 FIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANSWN 247


>gi|4503139|ref|NP_001899.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538431|ref|NP_680090.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538433|ref|NP_680091.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538435|ref|NP_680092.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538437|ref|NP_680093.1| cathepsin B preproprotein [Homo sapiens]
 gi|68067549|sp|P07858.3|CATB_HUMAN RecName: Full=Cathepsin B; AltName: Full=APP secretase; Short=APPS;
           AltName: Full=Cathepsin B1; Contains: RecName:
           Full=Cathepsin B light chain; Contains: RecName:
           Full=Cathepsin B heavy chain; Flags: Precursor
 gi|291888|gb|AAC37547.1| cathepsin B [Homo sapiens]
 gi|63102437|gb|AAH95408.1| Cathepsin B [Homo sapiens]
 gi|119586034|gb|EAW65630.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586036|gb|EAW65632.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586037|gb|EAW65633.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586038|gb|EAW65634.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586039|gb|EAW65635.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586040|gb|EAW65636.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|168277954|dbj|BAG10955.1| cathepsin B precursor [synthetic construct]
 gi|193786804|dbj|BAG52127.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP+CP+++ I DQ +CGSCWA     AISDR+CI +N + + ++SA+ ++
Sbjct: 80  LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  +    GCNGG+P  AW FW   G+V+GG Y S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  TY+ D   G  ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232



 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322


>gi|426358853|ref|XP_004046705.1| PREDICTED: cathepsin B isoform 1 [Gorilla gorilla gorilla]
 gi|426358855|ref|XP_004046706.1| PREDICTED: cathepsin B isoform 2 [Gorilla gorilla gorilla]
 gi|426358857|ref|XP_004046707.1| PREDICTED: cathepsin B isoform 3 [Gorilla gorilla gorilla]
          Length = 339

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP+CP+++ I DQ +CGSCWA     AISDR+CI +N + + ++SA+ ++
Sbjct: 80  LPESFDAREQWPQCPTVKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  +    GCNGG+P  AW FW   G+V+GG Y S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  TY+ D   G  ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232



 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322


>gi|158261501|dbj|BAF82928.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP+CP+++ I DQ +CGSCWA     AISDR+CI +N + + ++SA+ ++
Sbjct: 80  LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  +    GCNGG+P  AW FW   G+V+GG Y S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  TY+ D   G  ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232



 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 74/108 (68%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP    FSVY+DFL YKSGVYQH  G+ 
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPAEGAFSVYSDFLLYKSGVYQHVTGEM 274

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322


>gi|181192|gb|AAA52129.1| preprocathepsin B [Homo sapiens]
 gi|193787271|dbj|BAG52477.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP+CP+++ I DQ +CGSCWA     AISDR+CI +N + + ++SA+ ++
Sbjct: 80  LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  +    GCNGG+P  AW FW   G+V+GG Y S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  TY+ D   G  ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232



 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322


>gi|333361087|pdb|3AI8|B Chain B, Cathepsin B In Complex With The Nitroxoline
 gi|333361088|pdb|3AI8|A Chain A, Cathepsin B In Complex With The Nitroxoline
          Length = 256

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP+CP+++ I DQ +CGSCWA     AISDR+CI +N + + ++SA+ ++
Sbjct: 3   LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 62

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  +    GCNGG+P  AW FW   G+V+GG Y S  GC+PY++ PCEHHV G    CT
Sbjct: 63  TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 122

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  TY+ D   G  ++ V
Sbjct: 123 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 155



 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 138 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 197

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 198 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 245


>gi|195058549|ref|XP_001995463.1| GH17748 [Drosophila grimshawi]
 gi|193896249|gb|EDV95115.1| GH17748 [Drosophila grimshawi]
          Length = 340

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 72/157 (45%), Positives = 91/157 (57%), Gaps = 5/157 (3%)

Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
           K LP  FD+ + WP CP++R I DQ +CGSCWA     A+SDR+CI SN       SA  
Sbjct: 86  KDLPEEFDSSKNWPNCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIHSNATVNFHFSADD 145

Query: 245 IVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
           +V C   C +GCNGG+P  AW +W   G+V+GG YNS EGC+PY + PCEHHV GP   C
Sbjct: 146 LVTCCHTCGFGCNGGFPGAAWSYWTTRGIVSGGSYNSTEGCRPYEVEPCEHHVDGPRPPC 205

Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                  TP CK  C  P+Y   Y  D   G  ++ +
Sbjct: 206 H---SGSTPHCKHQC-QPNYSVDYEKDKHFGASSYSI 238



 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 48/90 (53%), Positives = 65/90 (72%), Gaps = 3/90 (3%)

Query: 72  PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV--ENDIP 129
           PR N  R+I  +GP+   F+VY D + YK+GVYQH  G  +G HA+R++GWGV  E+ +P
Sbjct: 242 PR-NIQREIMTNGPVEGAFTVYEDLILYKTGVYQHVHGKQLGGHAIRIIGWGVWGESKVP 300

Query: 130 YWLVANSWNDHWGDHGTFKILRGENEADIE 159
           YWL+ANSWN  WGD+G F+ILRG++   IE
Sbjct: 301 YWLIANSWNTDWGDNGFFRILRGKDHCGIE 330


>gi|34979797|gb|AAQ83887.1| cathepsin B [Branchiostoma belcheri tsingtauense]
          Length = 332

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 72/160 (45%), Positives = 103/160 (64%), Gaps = 6/160 (3%)

Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
           NA+ +P  FD+R +W  CP+++ + DQ +CGSCWA++   A+SDR+C+AS G     ISA
Sbjct: 76  NAQDIPDTFDSRTQWANCPTIKEVRDQGSCGSCWALAAVEAMSDRICVASKGSTMAHISA 135

Query: 243 QHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ 301
           + + +C  +C  GCNGG+P+ AW +W  +G+VTGG Y S +GCQPY + PCEHH+ G   
Sbjct: 136 EDLNSCCKSCGNGCNGGFPEAAWEYWKRDGLVTGGPYGSHQGCQPYEIKPCEHHINGSRP 195

Query: 302 NCTLLGKLK-TPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            C   GKL+ TP CK++C    Y  T+  D    K A+ V
Sbjct: 196 AC---GKLEPTPRCKKSC-ESGYNVTFAKDKHYAKTAYSV 231



 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 53/99 (53%), Positives = 66/99 (66%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY K A+ V         +I  +GP+ A F+VYADF  YKSGVYQH  G  +G HAV+++
Sbjct: 223 HYAKTAYSVSSKVQQIQMEIMTNGPVEAAFTVYADFPHYKSGVYQHESGAELGGHAVKMI 282

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG E   PYWL+ANSWN  WG+ G FKILRG++E  IE
Sbjct: 283 GWGTEGSTPYWLIANSWNTDWGNMGFFKILRGQDECGIE 321


>gi|24158605|pdb|1GMY|A Chain A, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
 gi|24158606|pdb|1GMY|B Chain B, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
 gi|24158607|pdb|1GMY|C Chain C, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
          Length = 261

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP+CP+++ I DQ +CGSCWA     AISDR+CI +N + + ++SA+ ++
Sbjct: 2   LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 61

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  +    GCNGG+P  AW FW   G+V+GG Y S  GC+PY++ PCEHHV G    CT
Sbjct: 62  TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 121

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  TY+ D   G  ++ V
Sbjct: 122 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 154



 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 137 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 196

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 197 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 244


>gi|308390275|gb|ADO32581.1| cathepsin B [Marsupenaeus japonicus]
          Length = 332

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 71/156 (45%), Positives = 100/156 (64%), Gaps = 4/156 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P+ FD+R  WP CP++  I DQ +CGSCWA      +SDR CI S G      S++++V
Sbjct: 80  MPKEFDSRAAWPMCPTIGEIRDQGSCGSCWAFGAVEVMSDRQCIHSKGKSNFHYSSENLV 139

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  A+++W H+G+V+GG +NS +GCQPY +APCEHHV GP   C+ 
Sbjct: 140 SCCHLCGFGCNGGFPGAAFKYWVHSGIVSGGSFNSTQGCQPYEIAPCEHHVPGPRPKCSE 199

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
            G   TP+C + C N  Y   Y  DL  G KA+ ++
Sbjct: 200 GG--GTPKCVKRCEN-GYTVDYESDLHHGGKAYSIM 232



 Score =  110 bits (276), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 59/81 (72%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I ++GP+   F+VY DFL YKSGVYQH  G  +G HA+R+LGWG EN  PYWL ANSWN
Sbjct: 241 EIMKNGPVEGAFTVYVDFLHYKSGVYQHRHGLPLGGHAIRILGWGEENGTPYWLCANSWN 300

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGD+G FKILRG +   IE
Sbjct: 301 TDWGDNGLFKILRGSDHCGIE 321


>gi|345790427|ref|XP_543203.3| PREDICTED: cathepsin B [Canis lupus familiaris]
          Length = 339

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 73/156 (46%), Positives = 102/156 (65%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP CP+++ I DQ +CGSCWA     AISDR+CI +NG+   ++SA+ ++
Sbjct: 80  LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVNVEVSAEDML 139

Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  + C  GCNGG+P  AW FW   G+V+GG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  +Y+ D   G  ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPSYKEDKHYGCSSYSV 232



 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 53/83 (63%), Positives = 65/83 (78%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+ A F+VY+DFL YKSGVYQH  G+ +G HAVR+LGWGVE+  PYWLV NS
Sbjct: 240 MAEIYKNGPVEAAFTVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGVEDGTPYWLVGNS 299

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRG +   IE
Sbjct: 300 WNTDWGDNGFFKILRGRDHCGIE 322


>gi|302564570|ref|NP_001181828.1| cathepsin B precursor [Macaca mulatta]
          Length = 339

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP+CP+++ I DQ +CGSCWA     AISDR+CI +N + + ++SA+ ++
Sbjct: 80  LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C       GCNGG+P  AW FW   G+V+GG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  TY+ D   G  ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232



 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322


>gi|402877481|ref|XP_003902454.1| PREDICTED: cathepsin B [Papio anubis]
          Length = 339

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP+CP+++ I DQ +CGSCWA     AISDR+CI +N + + ++SA+ ++
Sbjct: 80  LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C       GCNGG+P  AW FW   G+V+GG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  TY+ D   G  ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232



 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322


>gi|298370749|gb|ADI80349.1| cathepsin B [Litopenaeus vannamei]
          Length = 331

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 72/156 (46%), Positives = 99/156 (63%), Gaps = 4/156 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FD+R  WP CP++  I DQ +CGSCWA      +SDR CI S G      SA+++V
Sbjct: 79  LPKEFDSRAAWPMCPTIGEIRDQGSCGSCWAFGAVEVMSDRQCIHSKGKSNFHYSAENLV 138

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  A+++W H+G+V+GG +NS +GCQPY +APCEHHV GP   C+ 
Sbjct: 139 SCCHLCGFGCNGGFPGAAFKYWVHSGIVSGGSFNSTQGCQPYEIAPCEHHVPGPRPKCSE 198

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
            G   TP+C + C    Y   Y  DL  G KA+ ++
Sbjct: 199 GG--GTPKCAKTC-EKGYIVDYESDLHHGGKAYSIM 231



 Score =  111 bits (277), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 50/81 (61%), Positives = 59/81 (72%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I ++GP+   F+VY DFL YKSGVYQH  G  +G HA+RVLGWG EN  PYWL ANSWN
Sbjct: 240 EIMKNGPVEGAFTVYVDFLHYKSGVYQHRHGLPLGGHAIRVLGWGEENGTPYWLCANSWN 299

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGD+G FKILRG +   IE
Sbjct: 300 TDWGDNGLFKILRGSDHCGIE 320


>gi|262368170|pdb|3K9M|A Chain A, Cathepsin B In Complex With Stefin A
 gi|262368172|pdb|3K9M|B Chain B, Cathepsin B In Complex With Stefin A
          Length = 254

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP+CP+++ I DQ +CGSCWA     AISDR+CI +N + + ++SA+ ++
Sbjct: 1   LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 60

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  +    GCNGG+P  AW FW   G+V+GG Y S  GC+PY++ PCEHHV G    CT
Sbjct: 61  TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 120

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  TY+ D   G  ++ V
Sbjct: 121 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 153



 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 136 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 195

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 196 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 243


>gi|161343829|tpg|DAA06095.1| TPA_inf: cathepsin B [Aphis gossypii]
          Length = 280

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 72/154 (46%), Positives = 105/154 (68%), Gaps = 2/154 (1%)

Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
             GLP NFDAR++WP CPS+ HI +Q NC S +A+SVA+A++DR+CI SN      +SAQ
Sbjct: 60  TNGLPINFDARKRWPNCPSIGHIYNQGNCRSSYAISVASAVTDRICIHSNETKNPIMSAQ 119

Query: 244 HIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH-HVQGPLQ 301
            I++C   C +GC+GG    +W F+  +G V+GGDYNS +GCQPY + PC+  + + P  
Sbjct: 120 QIISCCYLCGYGCDGGSQFESWDFYRRHGFVSGGDYNSNQGCQPYMIPPCKLINEKSPRH 179

Query: 302 NCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
           +CT   + +TP C+  C NP+Y S+++ D+ KGK
Sbjct: 180 SCTTYNREETPACEIKCNNPNYYSSFKTDIYKGK 213



 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 44/76 (57%), Gaps = 3/76 (3%)

Query: 57  TSIPLSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN---FGDSIG 113
           +S     Y  K + V    AM++I+++GP+   F +Y D + YKSGVYQ++   +GD   
Sbjct: 203 SSFKTDIYKGKYYQVYPFMAMKEIFDNGPITTQFYMYRDLIDYKSGVYQYDEGFYGDFFT 262

Query: 114 LHAVRVLGWGVENDIP 129
           +   +++GWG EN  P
Sbjct: 263 VQGXKIIGWGEENGDP 278


>gi|60816353|gb|AAX36379.1| cathepsin B [synthetic construct]
 gi|61358313|gb|AAX41546.1| cathepsin B [synthetic construct]
          Length = 339

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 73/156 (46%), Positives = 102/156 (65%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP+CP+++ I DQ +CGSCWA     AISDR+CI +N + + ++SA+ ++
Sbjct: 80  LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139

Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  + C  GCNGG+P  AW FW   G+V+GG Y S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGSRCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  TY+ D   G  ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232



 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322


>gi|146217390|gb|ABQ10737.1| cathepsin B [Penaeus monodon]
          Length = 331

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 71/156 (45%), Positives = 99/156 (63%), Gaps = 4/156 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P+ FD+R  WP CP++  I DQ +CGSCWA      +SDR CI S G      SA+++V
Sbjct: 79  MPKEFDSRAAWPMCPTIGEIRDQGSCGSCWAFGAVEVMSDRQCIHSKGKSNFHYSAENLV 138

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  A+++W H+G+V+GG +NS +GCQPY +APCEHHV GP   C+ 
Sbjct: 139 SCCHLCGFGCNGGFPGAAFKYWVHSGIVSGGSFNSTQGCQPYEIAPCEHHVSGPRPKCSE 198

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
            G   TP+C + C    Y   Y  DL  G KA+ ++
Sbjct: 199 GG--GTPKCAKTC-EKGYIVDYESDLHHGGKAYSIM 231



 Score =  111 bits (277), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 50/81 (61%), Positives = 58/81 (71%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I  +GP+   F+VY DFL YKSGVYQH  G  +G HA+RVLGWG EN  PYWL ANSWN
Sbjct: 240 EIMNNGPVEGAFTVYVDFLHYKSGVYQHRHGLPLGGHAIRVLGWGEENGTPYWLCANSWN 299

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGD+G FKILRG +   IE
Sbjct: 300 TDWGDNGLFKILRGSDHCGIE 320


>gi|157058769|gb|ABV03142.1| cathepsin B-348 [Myzus persicae]
          Length = 246

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 75/155 (48%), Positives = 98/155 (63%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFDARE WP CP++R + DQ +CGSCWA     A+SDR+CI S G      SA+++V
Sbjct: 24  LPENFDAREHWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGAKNFHFSAENLV 83

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  AW +W   G+V+GG Y S+ GC PY +APCEHHV G    C  
Sbjct: 84  SCCWTCGFGCNGGFPGAAWHYWKTKGIVSGGPYGSKMGCIPYEIAPCEHHVNGTRGPCKE 143

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G  KTP C + C +  Y+  Y  DL +GK A+ +
Sbjct: 144 GG--KTPACVKKCED-GYKVPYAQDLHRGKSAYSL 175



 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 36/62 (58%), Positives = 51/62 (82%), Gaps = 1/62 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLVANS 136
           ++IY +GP+   F+VY DF+ Y++GVY+H  G ++G HA+R+LGWGV+N +IPYWLVANS
Sbjct: 184 QEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANS 243

Query: 137 WN 138
           WN
Sbjct: 244 WN 245


>gi|225713216|gb|ACO12454.1| Cathepsin B precursor [Lepeophtheirus salmonis]
 gi|290561811|gb|ADD38303.1| Cathepsin B [Lepeophtheirus salmonis]
          Length = 333

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 72/155 (46%), Positives = 98/155 (63%), Gaps = 5/155 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P NFD+R+KWP CP++  I DQ +CGSCWA     A+SDRLCI SN      +SA++++
Sbjct: 84  IPENFDSRQKWPHCPTISLIRDQGSCGSCWAFGAVEAMSDRLCIHSNKIV--NVSAENLL 141

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C +GCNGG+P  AW FW   G+V+GG Y S +GCQPY +APCEHH  G    C+ 
Sbjct: 142 SCCYSCGFGCNGGFPGAAWSFWKKKGLVSGGLYGSHKGCQPYAIAPCEHHANGTRPPCS- 200

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G  +TP+C   C N  Y   Y  D   G+ ++ V
Sbjct: 201 -GGGRTPKCHTFCENEDYSLPYEKDKSFGRSSYSV 234



 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 52/81 (64%), Positives = 63/81 (77%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I  +GP+ A FSVY+DFL YKSGVY+H  G  +G HA+R+LGWGVEN  PYWLVANSWN
Sbjct: 244 EIMNNGPVEAAFSVYSDFLNYKSGVYRHVKGSLLGGHAIRILGWGVENGTPYWLVANSWN 303

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGD+GTFKIL+G +   IE
Sbjct: 304 TDWGDNGTFKILKGSDHCGIE 324


>gi|75076082|sp|Q4R5M2.1|CATB_MACFA RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|67970521|dbj|BAE01603.1| unnamed protein product [Macaca fascicularis]
 gi|355779504|gb|EHH63980.1| Cathepsin B [Macaca fascicularis]
 gi|383411999|gb|AFH29213.1| cathepsin B preproprotein [Macaca mulatta]
 gi|384942194|gb|AFI34702.1| cathepsin B preproprotein [Macaca mulatta]
          Length = 339

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP+CP+++ I DQ +CGSCWA     AISDR+CI +N + + ++SA+ ++
Sbjct: 80  LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C       GCNGG+P  AW FW   G+V+GG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  TY+ D   G  ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232



 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322


>gi|354471594|ref|XP_003498026.1| PREDICTED: cathepsin B-like [Cricetulus griseus]
 gi|344254255|gb|EGW10359.1| Cathepsin B [Cricetulus griseus]
          Length = 339

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 73/142 (51%), Positives = 95/142 (66%), Gaps = 7/142 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFDARE+W  CP+++ I DQ +CGSCWA     A+SDRLCI +NG+   ++SA+ ++
Sbjct: 80  LPENFDAREQWSNCPTIKQIRDQGSCGSCWAFGAVGAMSDRLCIHTNGHVNVEVSAEDLL 139

Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C    C  GCNGG+P  AW FW   G+V+GG YNS  GC PYT+ PCEHHV G    CT
Sbjct: 140 TCCGSQCGDGCNGGYPSGAWNFWIKKGLVSGGLYNSHVGCLPYTIPPCEHHVNGSRPQCT 199

Query: 305 LLGKLKTPECKQNC---YNPSY 323
             G+  TP+C ++C   Y+PSY
Sbjct: 200 --GEGDTPKCTKSCEAGYSPSY 219



 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 53/83 (63%), Positives = 67/83 (80%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+   F+V++DFL YKSGVY+H  GD +G HA+R+LGWGVEN +PYWLVANS
Sbjct: 240 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDIMGGHAIRILGWGVENSVPYWLVANS 299

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRGE+   IE
Sbjct: 300 WNVDWGDNGLFKILRGEDHCGIE 322


>gi|380791571|gb|AFE67661.1| cathepsin B preproprotein, partial [Macaca mulatta]
          Length = 311

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP+CP+++ I DQ +CGSCWA     AISDR+CI +N + + ++SA+ ++
Sbjct: 80  LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C       GCNGG+P  AW FW   G+V+GG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  TY+ D   G  ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232



 Score =  110 bits (276), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 54/97 (55%), Positives = 67/97 (69%), Gaps = 2/97 (2%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFK 148
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FK
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFK 311


>gi|118429531|gb|ABK91813.1| cathepsin B-like cysteine proteinase precursor [Clonorchis
           sinensis]
 gi|358331549|dbj|GAA37857.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 343

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 72/143 (50%), Positives = 90/143 (62%), Gaps = 2/143 (1%)

Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
           +A  LP+NFDAR KWP CPS+  I DQS CGSCWA     A+SDRLCI SNG F   +SA
Sbjct: 82  DAMRLPKNFDARTKWPHCPSISEIRDQSGCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSA 141

Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ 301
             +++C  NC +GC+GG+P +AW +WG +G+VTGG      GC+ Y    CEHHVQG   
Sbjct: 142 VDLLSCCENCGYGCSGGYPAVAWDYWGAHGIVTGGSKEDPSGCRSYPFPKCEHHVQGHYP 201

Query: 302 NCTLLGKLKTPECKQNCYNPSYE 324
            C       TPEC Q+C  P  +
Sbjct: 202 PCP-HQYYPTPECVQHCDTPGID 223



 Score =  111 bits (277), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 48/83 (57%), Positives = 62/83 (74%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M++I   GP+ A+F+VY DFLQYK GVY H++G  +  HA+R+LGWG E D+PYWL+ANS
Sbjct: 245 MKEIMLRGPVEAVFTVYEDFLQYKFGVYFHSWGAPLSEHAIRILGWGEEGDVPYWLIANS 304

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN+ WG+ G  K LRG NE  IE
Sbjct: 305 WNEDWGEKGYMKFLRGLNECGIE 327


>gi|126681075|gb|ABO26563.1| cathepsin B-like cysteine protease form 1 [Ixodes ricinus]
          Length = 337

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 74/155 (47%), Positives = 98/155 (63%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAREKW  C S+  I DQS CGSCWA   A A+SDR+CI S G     ISA+ ++
Sbjct: 85  LPESFDAREKWSHCASINLIRDQSTCGSCWAFGAAEAMSDRVCIHSEGGIQVNISAEDLL 144

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C  +C  GC+GG+P  AW +W  +G+V+ G Y + +GC+PY+LAPCEHH +G L NCT 
Sbjct: 145 DCCDSCGAGCDGGYPAAAWEYWKESGLVSDGLYGTPDGCKPYSLAPCEHHTKGSLPNCT- 203

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G + TP+C   C    Y   Y+ D   GKK + +
Sbjct: 204 -GTVPTPKCVHLC-RKGYGKDYQHDKHFGKKVYSI 236



 Score =  127 bits (320), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 60/106 (56%), Positives = 76/106 (71%), Gaps = 2/106 (1%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+ KK + +       Q  I+++GP+ A F+VYADFL YKSGVYQH+ GD +G HA+R+L
Sbjct: 228 HFGKKVYSISSNEKQIQTEIFKNGPVEADFTVYADFLSYKSGVYQHHSGDVLGGHAIRIL 287

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRV 166
           GWG EN  PYWLVANSWN+ WGDHG FKILRG++E  IE   N  +
Sbjct: 288 GWGTENGTPYWLVANSWNEDWGDHGYFKILRGKDECGIEDDINAGI 333


>gi|147906534|ref|NP_001090927.1| cathepsin B precursor [Sus scrofa]
 gi|187470655|sp|A1E295.1|CATB_PIG RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|118490058|gb|ABK96810.1| cathepsin B [Sus scrofa]
          Length = 335

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 73/156 (46%), Positives = 102/156 (65%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP++FDARE+WP CP+++ I DQ +CGSCWA     AISDR+CI SNG    ++SA+ ++
Sbjct: 80  LPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDML 139

Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  + C  GCNGG+P  AW FW   G+V+GG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGDECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  +Y+ D   G  ++ +
Sbjct: 200 --GEGDTPKCSKIC-EPGYTPSYKEDKHFGCSSYSI 232



 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 54/83 (65%), Positives = 66/83 (79%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+   F+VY+DFLQYKSGVYQH  GD +G HA+R+LGWGVEN  PYWLV NS
Sbjct: 240 MAEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGNS 299

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRG++   IE
Sbjct: 300 WNTDWGDNGFFKILRGQDHCGIE 322


>gi|90074902|dbj|BAE87131.1| unnamed protein product [Macaca fascicularis]
          Length = 296

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 72/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP+CP+++ I DQ +CGSCWA     AISDR+CI +N + + ++SA+ ++
Sbjct: 80  LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C       GCNGG+P  AW FW   G+V+GG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  TY+ D   G  ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232



 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 23/34 (67%), Positives = 26/34 (76%)

Query: 126 NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           N  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 246 NGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 279


>gi|171948776|gb|ACB59245.1| cathepsin B [Sus scrofa]
          Length = 335

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 73/156 (46%), Positives = 101/156 (64%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDARE+WP CP+++ I DQ +CGSCWA     AISDR+CI SNG    ++SA+ ++
Sbjct: 80  LPKGFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDML 139

Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  + C  GCNGG+P  AW FW   G+V+GG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGDECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  +Y+ D   G  ++ +
Sbjct: 200 --GEGDTPKCSKIC-EPGYTPSYKEDKHFGCSSYSI 232



 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 54/83 (65%), Positives = 66/83 (79%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+   F+VY+DFLQYKSGVYQH  GD +G HA+R+LGWGVEN  PYWLV NS
Sbjct: 240 MAEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGNS 299

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRG++   IE
Sbjct: 300 WNTDWGDNGFFKILRGQDHCGIE 322


>gi|149698064|ref|XP_001498242.1| PREDICTED: cathepsin B [Equus caballus]
          Length = 340

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 73/150 (48%), Positives = 99/150 (66%), Gaps = 4/150 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFDARE+WP CP+++ I DQ +CGSCWA     AISDR+CI +NG+ + ++SA+ ++
Sbjct: 80  LPENFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVSVEVSAEDML 139

Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  + C  GCNGG+P  AW FW   G+V+GG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKG 334
             G   TP+C + C  P Y  +Y+ D   G
Sbjct: 200 GEGG-DTPKCSKIC-EPGYSPSYKEDKHYG 227



 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 55/83 (66%), Positives = 67/83 (80%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +I+++GP+ A F+VY+DFLQYKSGVYQH  GD +G HAVR+LGWGVEN  PYWLV NS
Sbjct: 241 MAEIFKNGPVEAAFTVYSDFLQYKSGVYQHVAGDMMGGHAVRILGWGVENGTPYWLVGNS 300

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRG++   IE
Sbjct: 301 WNTDWGDNGFFKILRGQDHCGIE 323


>gi|25988674|gb|AAN76202.1| lysosomal cysteine proteinase cathepsin B/green fluorescent protein
           EGFP fusion protein [synthetic construct]
          Length = 578

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 74/166 (44%), Positives = 100/166 (60%), Gaps = 5/166 (3%)

Query: 177 ETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYF 236
           E +G      LP +FDARE+W  CP++  I DQ +CGSCWA     A+SDR+CI +NG  
Sbjct: 70  ERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRV 129

Query: 237 TGQISAQHIVACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH 294
             ++SA+ ++ C    C  GCNGG+P  AW FW   G+V+GG YNS  GC PYT+ PCEH
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 189

Query: 295 HVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           HV G    CT  G+  TP+C + C    Y ++Y+ D   G  ++ V
Sbjct: 190 HVNGSRPPCT--GEGDTPKCNKMC-EAGYSTSYKEDKHYGYTSYSV 232



 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 53/83 (63%), Positives = 67/83 (80%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+   F+V++DFL YKSGVY+H  GD +G HA+R+LGWG+EN +PYWLVANS
Sbjct: 240 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVANS 299

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRGEN   IE
Sbjct: 300 WNVDWGDNGFFKILRGENHCGIE 322


>gi|321452279|gb|EFX63703.1| hypothetical protein DAPPUDRAFT_306608 [Daphnia pulex]
          Length = 340

 Score =  151 bits (381), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 70/157 (44%), Positives = 98/157 (62%), Gaps = 4/157 (2%)

Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
           + +P  FDARE+WP+CP+++ I DQ +CGSCWA     A+SDR+CI S G     +SA++
Sbjct: 87  QAIPEAFDAREQWPDCPTIQEIRDQGSCGSCWAFGAVEAMSDRICIHSKGEVNAHLSAEN 146

Query: 245 IVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
           +V+C   C +GCNGG+P  AW  W   G+VTGG++NS +GCQPY +  CEHH  G    C
Sbjct: 147 LVSCCYTCGFGCNGGFPGAAWSHWVKKGIVTGGNFNSSQGCQPYIIPACEHHTTGDRPPC 206

Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           +  G   TP+C + C +  Y   Y  DL  G  ++ V
Sbjct: 207 SEGG--GTPKCLKTCED-GYTVDYTQDLHYGASSYSV 240



 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 45/81 (55%), Positives = 59/81 (72%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I  +GP+    +VY DF  YKSGVYQH  G ++G HA+R+LGWGVE  +PYWL+ANSWN
Sbjct: 250 EIMNNGPVEGALTVYEDFPTYKSGVYQHVHGKALGGHAIRILGWGVEEGVPYWLIANSWN 309

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGD+G  K+LRG++   IE
Sbjct: 310 TDWGDNGYIKLLRGKDHCGIE 330


>gi|157058767|gb|ABV03141.1| cathepsin B-348 [Sitobion avenae]
          Length = 252

 Score =  151 bits (381), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 74/155 (47%), Positives = 97/155 (62%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDARE WP CP++R + DQ +CGSCWA     A+SDR+CI S G      SA+++V
Sbjct: 28  LPETFDAREHWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGTKNFHFSAENLV 87

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  AW +W   G+V+GG Y S  GC PY +APCEHHV G    C  
Sbjct: 88  SCCWTCGFGCNGGFPGAAWHYWKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGPCKE 147

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G  KTP+C + C +  Y+  Y  DL +GK A+ +
Sbjct: 148 GG--KTPKCVKKCED-GYKVPYEQDLHRGKSAYSL 179



 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 37/65 (56%), Positives = 52/65 (80%), Gaps = 1/65 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLVANS 136
           ++IY +GP+   F+VY DF+ Y++GVY+H  G ++G HA+R+LGWGV+N +IPYWLVANS
Sbjct: 188 QEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANS 247

Query: 137 WNDHW 141
           WN  W
Sbjct: 248 WNTDW 252


>gi|301776581|ref|XP_002923704.1| PREDICTED: cathepsin B-like [Ailuropoda melanoleuca]
 gi|281347694|gb|EFB23278.1| hypothetical protein PANDA_012896 [Ailuropoda melanoleuca]
          Length = 339

 Score =  151 bits (381), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 73/150 (48%), Positives = 98/150 (65%), Gaps = 5/150 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFDARE+WP CP+++ I DQ +CGSCWA     AISDR+CI +NG+   ++SA+ ++
Sbjct: 80  LPENFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVNVEVSAEDML 139

Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  + C  GCNGG+P  AW FW   G+V+GG Y S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKG 334
             G+  TP+C + C  P Y  +Y+ D   G
Sbjct: 200 --GEGDTPKCSKFC-EPGYTPSYKEDKHYG 226



 Score =  120 bits (301), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 54/83 (65%), Positives = 65/83 (78%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+ A F+VY+DFL YKSGVYQH  G+ +G HAVR+LGWGVEN  PYWLV NS
Sbjct: 240 MAEIYKNGPVEAAFTVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGVENGTPYWLVGNS 299

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRG +   IE
Sbjct: 300 WNTDWGDNGFFKILRGRDHCGIE 322


>gi|91078964|ref|XP_974298.1| PREDICTED: similar to putative cathepsin B-like like proteinase
           [Tribolium castaneum]
 gi|270004838|gb|EFA01286.1| cathepsin B precursor [Tribolium castaneum]
          Length = 335

 Score =  151 bits (381), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 71/155 (45%), Positives = 97/155 (62%), Gaps = 5/155 (3%)

Query: 183 NAKGLPRNFDAREKWPECPSL-RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
           +   +P +FDARE WPEC S+   I DQ++CGSCWA   A A+SDR+CI SN      IS
Sbjct: 80  DVNAIPESFDAREAWPECASIIGDIRDQASCGSCWAFGAAEAMSDRICIHSNATVKVSIS 139

Query: 242 AQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
            + +  C   C  GCNGGWP  AW +W   G+VTGG Y +++GC+ YT+ PCEHH +G L
Sbjct: 140 TEDLNTCCYECGDGCNGGWPAEAWAYWAETGIVTGGKYETKDGCKAYTVPPCEHHTEGDL 199

Query: 301 QNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
             C  +  + TP+CK+ C +   +  Y+ DL+KG 
Sbjct: 200 PACGDI--VPTPQCKKEC-DAGVDIEYKSDLRKGS 231



 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 50/81 (61%), Positives = 60/81 (74%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I  +GP+ A F VY DFL YKSGVYQ   G+  G HA+++LGWGVE+  PYWL ANSWN
Sbjct: 245 EIMTNGPVEADFDVYEDFLNYKSGVYQQTTGNYAGGHAIKILGWGVEDGTPYWLAANSWN 304

Query: 139 DHWGDHGTFKILRGENEADIE 159
           + WGD G FKILRG+NE  IE
Sbjct: 305 EDWGDKGYFKILRGQNECGIE 325


>gi|157167366|ref|XP_001653890.1| cathepsin b [Aedes aegypti]
 gi|54289254|gb|AAV31917.1| lysosomal cathepsin B [Aedes aegypti]
 gi|108874249|gb|EAT38474.1| AAEL009637-PA [Aedes aegypti]
          Length = 340

 Score =  150 bits (380), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 70/155 (45%), Positives = 97/155 (62%), Gaps = 3/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
            P NFD+R +WP CP++  I DQ +CGSCWA     A+SDR+CI S G    ++S++ +V
Sbjct: 88  FPENFDSRTQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRICIHSEGKVHFRVSSEDLV 147

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  AW +W   G+V+GG + S +GCQPY +APCEHHV G   +C  
Sbjct: 148 SCCHTCGFGCNGGFPGAAWSYWVRKGLVSGGPFGSDQGCQPYAIAPCEHHVNGSRPSCEG 207

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G  KTP+C + C   SY   Y  D   GK ++ +
Sbjct: 208 EGG-KTPKCVKKC-QASYNVPYAKDKMYGKSSYSI 240



 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 46/82 (56%), Positives = 58/82 (70%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  +GP+   F+VY D L YK GVY H  G  +G HA+R+LGWGVE+   YWL+ANSW
Sbjct: 249 KEIMTNGPVEGAFTVYEDLLNYKEGVYHHVHGKMLGGHAIRILGWGVEDGTKYWLIANSW 308

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N  WGD+G FKILRGE+   IE
Sbjct: 309 NSDWGDNGFFKILRGEDHLGIE 330


>gi|14141821|gb|AAK07477.2|AF329480_1 probable cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
 gi|289743431|gb|ADD20463.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
          Length = 340

 Score =  150 bits (380), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 70/155 (45%), Positives = 97/155 (62%), Gaps = 3/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P++FD+R++WP CP++  I DQ +CGSCWA     A+SDR+CI SNG      SA  +V
Sbjct: 88  IPKDFDSRKQWPHCPTIWEIRDQGSCGSCWAFGAVEAMSDRVCIHSNGTVNFHFSADDLV 147

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  AW +W   G+V+GG Y S +GC+PY +APCEHHV G    C  
Sbjct: 148 SCCHTCGFGCNGGFPGAAWSYWVRKGIVSGGPYGSSQGCRPYEIAPCEHHVNGTRPPCEK 207

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
               KTP C+  C   SY+  Y+ D   G +A+ +
Sbjct: 208 EYG-KTPRCQHKC-QASYKVDYKTDKHFGSRAYSI 240



 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 49/99 (49%), Positives = 68/99 (68%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+  +A+ + +   +   +I  HGP+   F+VY D + YK GVY+H  G  +G HA+R++
Sbjct: 232 HFGSRAYSISKNVHDIQEEIMTHGPVEGAFTVYEDLILYKDGVYEHVHGKELGGHAIRII 291

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVE DIPYWLVANSWN  WG++G FKILRG++   IE
Sbjct: 292 GWGVEKDIPYWLVANSWNTDWGNNGFFKILRGKDHCGIE 330


>gi|344281458|ref|XP_003412496.1| PREDICTED: cathepsin B-like [Loxodonta africana]
          Length = 340

 Score =  150 bits (380), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 79/192 (41%), Positives = 110/192 (57%), Gaps = 5/192 (2%)

Query: 152 GENEADIEMGFNNRVEANSSEDDDL-ETMGCQNAKGLPRNFDAREKWPECPSLRHIADQS 210
           G N  D++M +  R+         L + +       LP NFDARE WP CP+++ I DQ 
Sbjct: 44  GHNFYDVDMSYVKRLCGTLLNGPKLPQRVHLAEEMDLPENFDARENWPNCPTIKEIRDQG 103

Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACT-PNCW-GCNGGWPQLAWRFWG 268
           +CGSCWA     AISDR+CI +NG    ++SA+ ++ C    C  GCNGG+P  AW FW 
Sbjct: 104 SCGSCWAFGAVEAISDRVCIHTNGNVNVEVSAEDLLTCCHMECGDGCNGGFPAGAWNFWT 163

Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYR 328
             G+V+GG Y+S  GC+PY++ PCEHHV G    C   G  +TP+C + C  P Y  +Y+
Sbjct: 164 KKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCKGEGG-ETPKCSKTC-EPGYSPSYK 221

Query: 329 FDLKKGKKAHMV 340
            D   G  ++ V
Sbjct: 222 EDKHYGYSSYGV 233



 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 62/125 (49%), Positives = 79/125 (63%), Gaps = 2/125 (1%)

Query: 37  KKKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYA 94
           K +  +  K  K  +  Y P+     HY   ++ VP      M +IY++GP+   FSVY 
Sbjct: 199 KGEGGETPKCSKTCEPGYSPSYKEDKHYGYSSYGVPSSEQEIMAEIYKNGPVEGAFSVYT 258

Query: 95  DFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGEN 154
           DFL YKSGVYQH  G+ +G HA+R+LGWGVEN  PYWL ANSWN  WGD+G FKILRG++
Sbjct: 259 DFLVYKSGVYQHVTGEEVGGHAIRILGWGVENGTPYWLAANSWNTDWGDNGFFKILRGQD 318

Query: 155 EADIE 159
              IE
Sbjct: 319 HCGIE 323


>gi|330434688|gb|AEC22812.1| cathepsin B [Macrobrachium nipponense]
          Length = 331

 Score =  150 bits (379), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 71/155 (45%), Positives = 98/155 (63%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P+ FD+R  W  CP++  I DQ +CGSCWA      ++DR CI SNG      SA+++V
Sbjct: 79  IPKEFDSRTAWSMCPTISEIRDQGSCGSCWAFGAVEVMTDRDCIHSNGTKNFHYSAENLV 138

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  A+++W H+G+V+GG +NS +GCQPY +APCEHHV GP   C  
Sbjct: 139 SCCHLCGFGCNGGFPGAAFQYWVHSGIVSGGAFNSTQGCQPYEIAPCEHHVSGPRPKCAE 198

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G   TP+C +NC   +Y   Y  DL  G K + V
Sbjct: 199 GG--STPKCHKNC-ESNYVVDYESDLHHGSKHYSV 230



 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 57/81 (70%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
            I  +GP+   F+VY DFL YKSGVYQH  G  +G HA+RVLGWG E+  PYWL ANSWN
Sbjct: 240 DIMTNGPVEGAFTVYVDFLHYKSGVYQHTHGLPLGGHAIRVLGWGEEDGTPYWLCANSWN 299

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGD+G FKILRG +   IE
Sbjct: 300 TDWGDNGYFKILRGSDHCGIE 320


>gi|195438776|ref|XP_002067308.1| GK16352 [Drosophila willistoni]
 gi|194163393|gb|EDW78294.1| GK16352 [Drosophila willistoni]
          Length = 340

 Score =  150 bits (379), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 70/150 (46%), Positives = 93/150 (62%), Gaps = 9/150 (6%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAREKW  CP++  I DQ +CGSCWA     A+SDR+CI S G     +SA  +V
Sbjct: 88  IPTEFDAREKWSNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSQGKVNFHLSADDLV 147

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  AW +W   G+V+GG++ SQ+GC+PY + PCEHHV G    C+ 
Sbjct: 148 SCCHTCGFGCNGGFPGAAWSYWTRKGIVSGGNFGSQQGCRPYEIEPCEHHVNGTRPPCS- 206

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
                TP C+  C     ES+Y+ D KK K
Sbjct: 207 --SGSTPRCQHVC-----ESSYKVDYKKDK 229



 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 45/87 (51%), Positives = 62/87 (71%), Gaps = 2/87 (2%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND--IPYWL 132
           +  ++I  +GP+   F+VY D + YKSGVY+H  G  +G HA+R+LGWGV  D  IPYWL
Sbjct: 244 DIQKEIMNNGPVEGAFTVYEDLILYKSGVYEHVHGKELGGHAIRILGWGVWGDEKIPYWL 303

Query: 133 VANSWNDHWGDHGTFKILRGENEADIE 159
           +ANSWN  WGD+G F+I+RG++   IE
Sbjct: 304 IANSWNTDWGDNGFFRIVRGKDHCGIE 330


>gi|223646922|gb|ACN10219.1| Cathepsin B precursor [Salmo salar]
 gi|223647940|gb|ACN10728.1| Cathepsin B precursor [Salmo salar]
 gi|223672785|gb|ACN12574.1| Cathepsin B precursor [Salmo salar]
          Length = 330

 Score =  150 bits (379), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 71/147 (48%), Positives = 94/147 (63%), Gaps = 5/147 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FD R++WP CP+L+ I DQ +CGSCWA   A AISDR+CI SN   + +IS++ ++
Sbjct: 79  LPDTFDPRQQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNAKVSVEISSEDLL 138

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C  GCNGG+P  AW FW   G+VTGG Y+S  GC+PY++ PCEHHV G    CT 
Sbjct: 139 SCCDSCGMGCNGGYPSAAWDFWTTEGLVTGGLYDSHVGCRPYSIPPCEHHVNGTRPPCTG 198

Query: 306 LGKLKTPECKQNC---YNPSYESTYRF 329
             +  TP+C   C   Y P Y+    F
Sbjct: 199 E-EGDTPQCSNQCETGYTPGYKQDKHF 224



 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 50/99 (50%), Positives = 68/99 (68%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+ K ++ +P      M ++ ++GP+   F+VY DFL YKSGVYQH  G ++G HA++VL
Sbjct: 223 HFGKNSYSLPSEEQQIMAELLKNGPVEGAFTVYEDFLLYKSGVYQHVSGSAVGGHAIKVL 282

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG E   PYWL ANSWN  WG++G FKILRG++   IE
Sbjct: 283 GWGEEGGTPYWLAANSWNTDWGENGFFKILRGKDHCGIE 321


>gi|351695295|gb|EHA98213.1| Cathepsin B [Heterocephalus glaber]
          Length = 340

 Score =  150 bits (379), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 70/156 (44%), Positives = 102/156 (65%), Gaps = 4/156 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAR++WP CP+++ I DQ +CGSCWA     A+SDR+CI +NG+   ++SA+ ++
Sbjct: 80  LPESFDARQQWPNCPTIKEIRDQGSCGSCWAFGAVGAMSDRVCIHTNGHVNVEVSAEDLL 139

Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C    C  GCNGG+P  AW++W   G+V+GG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 140 SCCGLECGDGCNGGYPSAAWKYWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGTRPQCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G   TP+C + C  P Y  +Y+ D   G  ++ V
Sbjct: 200 GEGG-DTPKCSKTC-EPGYSPSYKEDKHFGYDSYSV 233



 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 48/83 (57%), Positives = 64/83 (77%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+   F+V++DFL YK+GVY+H  G+ +G HA+R+LGWG EN +PYWLV NS
Sbjct: 241 MAEIYKNGPVEGAFTVFSDFLMYKTGVYKHLAGEMLGGHAIRILGWGKENGVPYWLVGNS 300

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD G FKI+RGE+   IE
Sbjct: 301 WNVDWGDSGFFKIVRGEDHCGIE 323


>gi|496317|dbj|BAA04103.1| Sarcophaga pro-cathepsin B [Sarcophaga peregrina]
          Length = 344

 Score =  150 bits (379), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 73/156 (46%), Positives = 95/156 (60%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAR+ WP CP++  I DQ +CGSCWA     A+SDRLCI SN       SA  +V
Sbjct: 92  VPEEFDARKAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRLCIHSNATIHFHFSADDLV 151

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  AW +W   G+V+GG Y S +GC+PY +APCEHHV G    C  
Sbjct: 152 SCCHTCGFGCNGGFPGAAWAYWTRKGIVSGGPYGSSQGCRPYEIAPCEHHVNGTRPPCD- 210

Query: 306 LGKL-KTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G+  KTP C+  C   SY+  Y+ D   G K++ V
Sbjct: 211 -GEHGKTPSCRHEC-QKSYDVDYKTDKHFGSKSYSV 244



 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 50/99 (50%), Positives = 69/99 (69%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+  K++ V R   +  ++I ++GP+   F+VY D + YK GVYQH  G  +G HA+R+L
Sbjct: 236 HFGSKSYSVKRNVKDIQKEIMQNGPVEGAFTVYEDLILYKDGVYQHVHGRELGGHAIRIL 295

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN  PYWL+ANSWN  WG++G FK+LRGE+   IE
Sbjct: 296 GWGVENKTPYWLIANSWNTDWGNNGFFKMLRGEDHCGIE 334


>gi|86279341|gb|ABC88766.1| putative cathepsin B-like like proteinase [Tenebrio molitor]
          Length = 301

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 73/156 (46%), Positives = 100/156 (64%), Gaps = 7/156 (4%)

Query: 183 NAKGLPRNFDAREKWPECPSLR-HIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
           N   +P +FDARE WPEC S+   I DQ++CGSCWA     A+SDR+CI S+     +IS
Sbjct: 80  NLDAIPESFDAREAWPECTSIIGEIRDQASCGSCWAFGAVEAMSDRICIHSDASVKVRIS 139

Query: 242 AQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
           A+ +  C  +C  GCNGGWP LAW +W   G+VTGG Y   EGC+ Y++ PC+HHV G L
Sbjct: 140 AEDLNDCCYDCGDGCNGGWPDLAWSYWSSTGIVTGGLYGVDEGCKAYSIKPCDHHVDGNL 199

Query: 301 QNCTLLGKL-KTPECKQNCYNPSYESTYRFDLKKGK 335
             C   G + +TP CK++C + S +  Y+ DL++G 
Sbjct: 200 GPC---GDIQRTPACKKSCDSTS-DLEYKSDLRRGS 231


>gi|355681635|gb|AER96808.1| cathepsin B [Mustela putorius furo]
          Length = 338

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 71/150 (47%), Positives = 100/150 (66%), Gaps = 5/150 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD+RE+WP CP+++ I DQ +CGSCWA     AISDR+CI +NG+ + ++SA+ ++
Sbjct: 80  LPESFDSREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVSVEVSAEDML 139

Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  + C  GCNGG+P  AW FW   G+V+GG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGDQCGDGCNGGFPAEAWNFWTXXGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKG 334
             G+  TP+C + C  P Y  +Y+ D   G
Sbjct: 200 --GEGDTPKCSKIC-EPGYTPSYKEDKHYG 226



 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 55/83 (66%), Positives = 66/83 (79%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+ A FSVY+DFL YKSGVYQH  G+ +G HAVR+LGWGVEN  PYWLV NS
Sbjct: 240 MAEIYKNGPVEAAFSVYSDFLMYKSGVYQHVTGEMMGGHAVRILGWGVENGTPYWLVGNS 299

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRG++   IE
Sbjct: 300 WNTDWGDNGFFKILRGQDHCGIE 322


>gi|157058765|gb|ABV03140.1| cathepsin B-348 [Aulacorthum solani]
          Length = 237

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 74/155 (47%), Positives = 96/155 (61%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDARE WP CP++R + DQ +CGSCWA     A+SDR+CI S G      SA+++V
Sbjct: 28  LPETFDAREHWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGTKNFHFSAENLV 87

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  AW +W   G+V+GG Y S  GC PY +APCEHHV G    C  
Sbjct: 88  SCCWTCGFGCNGGFPGAAWNYWKTKGIVSGGPYGSNMGCIPYEVAPCEHHVNGTRGPCKE 147

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G  KTP+C + C +  Y+  Y  DL  GK A+ +
Sbjct: 148 GG--KTPKCVKKCED-GYKVPYAQDLHHGKSAYSL 179



 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 35/91 (38%), Positives = 54/91 (59%), Gaps = 1/91 (1%)

Query: 37  KKKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCNAMRQ-IYEHGPLVAIFSVYAD 95
           K+  K  K  KK +    +P +  L H      +    + +RQ IY +GP+   F+VY D
Sbjct: 146 KEGGKTPKCVKKCEDGYKVPYAQDLHHGKSAYSLSNDVDQIRQEIYTNGPVEGAFTVYED 205

Query: 96  FLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN 126
           F+ Y++GVY+H  G ++G HA+R+LGWGV+N
Sbjct: 206 FIAYRAGVYKHVAGKALGGHAIRILGWGVQN 236


>gi|338815385|gb|AEJ08755.1| cathepsin B [Crassostrea ariakensis]
          Length = 341

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 74/158 (46%), Positives = 97/158 (61%), Gaps = 5/158 (3%)

Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
           K LP +FD+R +WP CP+L+ + DQ  CGSCWA     A+SDR+CI S G     ISA+ 
Sbjct: 87  KDLPASFDSRTQWPNCPTLKEVRDQGACGSCWAFGAVEAMSDRICIKSQGKENVHISAED 146

Query: 245 IVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
           + +C   C  GC GG+P  AW ++  +G+VTGG YNS +GCQPYT+  C+HHV G LQ C
Sbjct: 147 LTSCCRTCGNGCEGGFPSAAWSYYKRDGLVTGGQYNSHQGCQPYTIKACDHHVVGKLQPC 206

Query: 304 TL-LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           +  +G   TP+CK  C    Y  TY  D   G  A+ V
Sbjct: 207 SKDIG--PTPKCKHTC-EAGYNVTYEKDKHYGMSAYSV 241



 Score =  114 bits (284), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 54/98 (55%), Positives = 66/98 (67%), Gaps = 1/98 (1%)

Query: 63  HYFKKAHMVPRCN-AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
           HY   A+ V      M +I  +GP+   F+VYADF QYKSGVY+H  G  +G HA+++LG
Sbjct: 233 HYGMSAYSVHGVEKIMTEIMTNGPVEGAFTVYADFPQYKSGVYKHTTGQPLGGHAIKILG 292

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           WG EN   YWLVANSWN  WGD G FKILRG++E  IE
Sbjct: 293 WGTENGDDYWLVANSWNPDWGDQGFFKILRGQDECGIE 330


>gi|426220597|ref|XP_004004501.1| PREDICTED: cathepsin B [Ovis aries]
          Length = 335

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 73/156 (46%), Positives = 100/156 (64%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP CP+++ I DQ +CGSCWA     AISDR+CI S G    ++SA+ ++
Sbjct: 80  LPDSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSKGRVNVEVSAEDML 139

Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  + C  GCNGG+P  AW FW   G+V+GG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGSECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  +Y+ D   G  ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPSYKDDKHFGCSSYSV 232



 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 54/83 (65%), Positives = 65/83 (78%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+   FSVY+DFL YKSGVYQH  G+ +G HA+R+LGWGVEND PYWLV NS
Sbjct: 240 MAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEMMGGHAIRILGWGVENDTPYWLVGNS 299

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD G FKILRG++   IE
Sbjct: 300 WNTDWGDKGFFKILRGQDHCGIE 322


>gi|379067374|gb|AFC90100.1| cathepsin B [Capra hircus]
          Length = 335

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 73/156 (46%), Positives = 100/156 (64%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP CP+++ I DQ +CGSCWA     AISDR+CI S G    ++SA+ ++
Sbjct: 80  LPDSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSKGRVNVEVSAEDML 139

Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  + C  GCNGG+P  AW FW   G+V+GG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGSECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  +Y+ D   G  ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPSYKDDKHFGCSSYSV 232



 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 54/83 (65%), Positives = 65/83 (78%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+   FSVY+DFL YKSGVYQH  G+ +G HA+R+LGWGVEND PYWLV NS
Sbjct: 240 MAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEMMGGHAIRILGWGVENDTPYWLVGNS 299

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD G FKILRG++   IE
Sbjct: 300 WNTDWGDKGFFKILRGQDHCGIE 322


>gi|195393194|ref|XP_002055239.1| GJ19262 [Drosophila virilis]
 gi|194149749|gb|EDW65440.1| GJ19262 [Drosophila virilis]
          Length = 338

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 72/155 (46%), Positives = 89/155 (57%), Gaps = 5/155 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDAR  WP+CP++  I DQ +CGSCWA     A+SDR+CI SN       SA  +V
Sbjct: 86  LPEEFDARTAWPDCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSNATVNFHFSADDLV 145

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  AW +W H G+V+GG Y S+EGC+PY + PCEHHV G    C  
Sbjct: 146 SCCHTCGFGCNGGFPGAAWSYWTHKGIVSGGSYGSKEGCRPYEVEPCEHHVNGTRPPCH- 204

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP C   C    Y   Y  D   G KA+ V
Sbjct: 205 --SGSTPRCMHKC-ESGYSVDYAKDKHFGAKAYSV 236



 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 51/101 (50%), Positives = 69/101 (68%), Gaps = 4/101 (3%)

Query: 63  HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+  KA+ V R   +  R+I  +GP+   F+VY D + YK+GVYQH  G  +G HA+R+L
Sbjct: 228 HFGAKAYSVNRNPLDIQREIMTNGPVEGAFTVYEDLILYKTGVYQHVHGRQLGGHAIRIL 287

Query: 121 GWGV--ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGV  +N +PYWL+ NSWN  WGD+G F+ILRGE+   IE
Sbjct: 288 GWGVWGDNKVPYWLIGNSWNTDWGDNGFFRILRGEDHCGIE 328


>gi|126303983|ref|XP_001381634.1| PREDICTED: cathepsin B-like [Monodelphis domestica]
          Length = 337

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 72/156 (46%), Positives = 100/156 (64%), Gaps = 4/156 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFDARE+WP CP+++ I DQ +CGSCWA     AISDR+C+ SNG    ++SA+ ++
Sbjct: 81  LPENFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICVHSNGNANVEVSAEDLL 140

Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C  + C  GCNGG+P  AW FW   G+V+GG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 141 SCCGSECGDGCNGGFPAGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPACT 200

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
              +  TP C++ C    Y + Y+ D   G  ++ V
Sbjct: 201 GE-EGDTPTCRKKC-EEGYSTQYKDDKNYGSTSYSV 234



 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 56/99 (56%), Positives = 69/99 (69%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           +Y   ++ VP      M +IY++GP+   FSVY DFL YKSGVYQH  G+ +G HA+R+L
Sbjct: 226 NYGSTSYSVPSSEQEIMAEIYKNGPVEGAFSVYEDFLHYKSGVYQHVAGEMLGGHAIRIL 285

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN I YWL ANSWN  WGD+G FK LRG+N   IE
Sbjct: 286 GWGVENGIRYWLAANSWNIDWGDNGFFKFLRGKNHCGIE 324


>gi|289743429|gb|ADD20462.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
          Length = 340

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 70/155 (45%), Positives = 95/155 (61%), Gaps = 3/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P+ FD+R +WP CP++  I DQ +CGSCWA     A+SDR+CI SNG      SA  +V
Sbjct: 88  IPKEFDSRNQWPHCPTIWEIRDQGSCGSCWAFGAVEAMSDRVCIHSNGTVNFHFSADDLV 147

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  AW +W   G+V+GG Y S +GC+PY +APCEHHV G    C  
Sbjct: 148 SCCHTCGFGCNGGFPGAAWGYWVRKGIVSGGPYGSSQGCRPYEIAPCEHHVNGTRPPCEK 207

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
               KTP C+  C   SY+  Y+ D   G +A+ +
Sbjct: 208 EYG-KTPRCQHKC-QASYKVDYKTDKHFGSRAYSI 240



 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 44/81 (54%), Positives = 59/81 (72%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I  +GP+   F+VY D + YK GVY+H  G  +G HA+R++GWGVE D PYWL+ANSWN
Sbjct: 250 EIMTNGPVEGAFTVYEDLILYKDGVYEHVHGKELGGHAIRIIGWGVEKDTPYWLIANSWN 309

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WG++G FKILRG++   IE
Sbjct: 310 TDWGNNGFFKILRGKDHCGIE 330


>gi|417399216|gb|JAA46636.1| Putative cathepsin b [Desmodus rotundus]
          Length = 340

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 73/156 (46%), Positives = 99/156 (63%), Gaps = 4/156 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP+CP+++ I DQ +CGSCWA     AISDR+CI SNG    ++SA+ ++
Sbjct: 80  LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGLQNVEVSAEDLL 139

Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C    C  GCNGG+P  AW FW   G+V+GG Y+S  GC+PY++ PCEHHV G    C+
Sbjct: 140 TCCGFQCGEGCNGGFPSGAWNFWKKQGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCS 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G   TP+C + C  P Y  +Y+ D   G   + V
Sbjct: 200 GEGG-DTPKCSKIC-EPGYSPSYKEDKHFGCDTYSV 233



 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 73/108 (67%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y P+     H+    + VP      M +IY++GP+ A FSVY+DFL YKSGVYQH  G+ 
Sbjct: 216 YSPSYKEDKHFGCDTYSVPSDEKEIMVEIYKNGPVEAAFSVYSDFLLYKSGVYQHVTGEM 275

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HAVR+LGWGVEN  PYWLV NSWN  WGD+G FKILRG +   IE
Sbjct: 276 VGGHAVRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGRDHCGIE 323


>gi|187103108|ref|NP_001119614.1| cathepsin B-1418 precursor [Acyrthosiphon pisum]
 gi|163300438|tpg|DAA06126.1| TPA_inf: cathepsin B transcript 1418 [Acyrthosiphon pisum]
 gi|239788654|dbj|BAH70998.1| ACYPI000010 [Acyrthosiphon pisum]
          Length = 346

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 98/153 (64%), Gaps = 8/153 (5%)

Query: 182 QNAKGLPRNFDAREKWPECPSLR-HIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQI 240
           ++ + LP NFDARE+WPEC SL   I DQSNCGSCWAVS A+  SDRLCIA+ G     +
Sbjct: 85  ESNEALPENFDARERWPECSSLLGSIKDQSNCGSCWAVSAASVFSDRLCIATGGAVARNL 144

Query: 241 SAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP 299
           SA+ +  C   C  GC+GG P+ AW F+  +G+VTGGDY S++GCQPY++ PC     G 
Sbjct: 145 SAEQLNTCCYRCGNGCDGGSPESAWYFFMRHGIVTGGDYGSEDGCQPYSIYPC-----GK 199

Query: 300 LQNCTLLGKLKTPECK-QNCYNPSYESTYRFDL 331
            +N  +     TP+C  + C N +Y   YR DL
Sbjct: 200 GRNTCIEDDPDTPDCSIKTCTNSNYSKNYRADL 232



 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 47/99 (47%), Positives = 66/99 (66%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY    + + R   + M+ +Y++GP+ A F VY DF+ YKSGVY +  G   G HA+++L
Sbjct: 233 HYVDTVYSLSRSEEDIMKDLYKNGPVQAAFYVYTDFMYYKSGVYSYTRGQIEGGHAIKIL 292

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGV++   YWL ANSW+  WG++G F+ILRG NE  IE
Sbjct: 293 GWGVDDGTKYWLCANSWSRSWGENGLFRILRGNNECHIE 331


>gi|355697726|gb|EHH28274.1| Cathepsin B [Macaca mulatta]
          Length = 339

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 71/156 (45%), Positives = 100/156 (64%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP+CP+++ I DQ +CGSCWA     AISDR+CI +N + + ++SA+ ++
Sbjct: 80  LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C       GCNGG+P  AW F    G+V+GG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGIMCGDGCNGGYPAGAWNFLTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  TY+ D   G  ++ V
Sbjct: 200 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 232



 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322


>gi|431918315|gb|ELK17542.1| Cathepsin B [Pteropus alecto]
          Length = 359

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 72/148 (48%), Positives = 95/148 (64%), Gaps = 6/148 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP CP+++ I DQ +CGSCWA     AISDR+CI +NG    ++SA+ ++
Sbjct: 103 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICILTNGNVNVEVSAEDLL 162

Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C    C  GCNGG+P  AW FW   G+V+GG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 163 TCCGFQCGEGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 222

Query: 305 LLGKLKTPECKQNC---YNPSYESTYRF 329
             G   TP+C + C   Y PSY+    F
Sbjct: 223 GEGG-STPKCSRICEAGYTPSYKEDKHF 249



 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 59/108 (54%), Positives = 74/108 (68%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRCNA--MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y P+     H+   ++ VP      M +IY++GP+ A FSVY+DFL YKSGVYQH  G+ 
Sbjct: 239 YTPSYKEDKHFGCSSYSVPSSETEIMAEIYKNGPVEAAFSVYSDFLLYKSGVYQHVTGEM 298

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HAVR+LGWGVE+  PYWLV NSWN  WGD G FKILRG++   IE
Sbjct: 299 MGGHAVRILGWGVEDGTPYWLVGNSWNTDWGDSGFFKILRGQDHCGIE 346


>gi|410916585|ref|XP_003971767.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
          Length = 328

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 71/155 (45%), Positives = 103/155 (66%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAR++WP+C +++ I DQ +CGSCWA   A AISDRLCI S    + +ISA+ ++
Sbjct: 77  LPDSFDARKQWPDCRTIQQIRDQGSCGSCWAFGAAEAISDRLCIHSGSKISLEISAEDLL 136

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+P  AW FW   G+VTGG   S+ GC+PY++APCEHHV G    C  
Sbjct: 137 SCCDECGMGCSGGYPSSAWEFWTKKGLVTGGLCGSEVGCRPYSIAPCEHHVNGTRPPCQ- 195

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G  +TP+C++ C +  Y ++Y  D   GK+++ +
Sbjct: 196 -GTQETPKCEKKCID-GYLTSYLKDKHFGKRSYSL 228



 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 57/124 (45%), Positives = 83/124 (66%), Gaps = 2/124 (1%)

Query: 38  KKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYAD 95
           +  ++  K +KK    YL + +   H+ K+++ +P  +   M ++Y++GP+ A F+VYAD
Sbjct: 195 QGTQETPKCEKKCIDGYLTSYLKDKHFGKRSYSLPSQQEQIMTELYKNGPVEAAFTVYAD 254

Query: 96  FLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENE 155
           FL YK+GVYQH  G+ +G HA+++LGWG E+  PYWL ANSWN  WGD G FKI RG +E
Sbjct: 255 FLLYKTGVYQHVTGEVLGGHAIKILGWGEESGTPYWLAANSWNGDWGDKGFFKIKRGNDE 314

Query: 156 ADIE 159
             IE
Sbjct: 315 CGIE 318


>gi|340380665|ref|XP_003388842.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
          Length = 333

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 72/155 (46%), Positives = 99/155 (63%), Gaps = 6/155 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAR+KW +CPS+  I DQ +CGSCWA+    A+SDR C++        ISA++++
Sbjct: 82  IPDTFDARQKWSDCPSISDIRDQGSCGSCWALGAVEAMSDRYCVSFQENV--HISAENLM 139

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C  GC GG+ Q AW +W  +G+VTGG Y S EGCQPY +  C HH  GP +NCT 
Sbjct: 140 TCCKFCGNGCAGGFLQQAWEYWVKDGLVTGGQYGSDEGCQPYLIPKCNHHEPGPYENCT- 198

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G+ KTP+C++ C    Y ++Y  DL  G+KA+ V
Sbjct: 199 -GEGKTPQCERTC-RSGYTTSYEADLHYGEKAYAV 231



 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 53/99 (53%), Positives = 73/99 (73%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPR-CNAMR-QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY +KA+ V R   A++ +I  +GP+   F+VY+DF  YKSGVYQH  G ++G HA+R+L
Sbjct: 223 HYGEKAYAVHREVEAIQTEIMTNGPVEGAFTVYSDFPTYKSGVYQHVVGHALGGHAIRIL 282

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG EN +PYWL+ANSWN  WGD G FK++RG+++  IE
Sbjct: 283 GWGTENGVPYWLIANSWNPSWGDKGYFKMIRGKDDCGIE 321


>gi|45822211|emb|CAE47502.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
          Length = 331

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 69/147 (46%), Positives = 97/147 (65%), Gaps = 5/147 (3%)

Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           +P +FDARE W EC   +  + DQS+CGSCWAV+ A+A+SDR CIAS G     +SA+++
Sbjct: 79  VPNSFDARENWKECSDVISTVVDQSDCGSCWAVAAASAMSDRRCIASQGKLKVPVSAENL 138

Query: 246 VACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           ++C  +C +GC GG+P +AW +W   G+ TGG Y S++GCQPY+L PCEHH +G    C+
Sbjct: 139 LSCCDSCGYGCEGGYPTMAWSYWIDTGITTGGLYGSKQGCQPYSLQPCEHHTEGNKVQCS 198

Query: 305 LLGKLKTPECKQNCYNPS--YESTYRF 329
            L    TP CK  C + +  Y+S   F
Sbjct: 199 TL-DYDTPSCKHKCDDSALNYKSELTF 224



 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 51/85 (60%), Positives = 64/85 (75%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           N  ++I  +GP+ A F VY+DF+ YKSGVYQH  G+ +G HAVR+LGWG E+ +PYWLVA
Sbjct: 237 NIQKEILTNGPVEAAFDVYSDFVNYKSGVYQHVAGEYLGGHAVRILGWGEESGVPYWLVA 296

Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
           NSWN+ WGD G FKI RG NE+  E
Sbjct: 297 NSWNEDWGDKGLFKIRRGNNESGFE 321


>gi|326916753|ref|XP_003204669.1| PREDICTED: cathepsin B-like [Meleagris gallopavo]
          Length = 340

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 70/156 (44%), Positives = 100/156 (64%), Gaps = 4/156 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FD+R++WP CP++  I DQ +CGSCWA     AISDR+C+ +N   + ++SA+ ++
Sbjct: 80  LPDTFDSRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLL 139

Query: 247 ACTP-NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C    C  GCNGG+P  AWR+W   G+V+GG Y+S  GC+PYT+ PCEHHV G    CT
Sbjct: 140 SCCGFECGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRPYTIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G  +TP C ++C  P Y  +Y+ D   G  ++ V
Sbjct: 200 GEGG-ETPRCSRHC-EPGYSPSYKEDKHYGITSYGV 233



 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 73/108 (67%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y P+     HY   ++ VPR     M +IY++GP+   F VY DFL YKSGVYQH  G+ 
Sbjct: 216 YSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVSGEQ 275

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWL ANSWN  WGD+G FKILRGE+   IE
Sbjct: 276 VGGHAIRILGWGVENGTPYWLAANSWNTDWGDNGFFKILRGEDHCGIE 323


>gi|329668994|gb|AEB96385.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 316

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 84/203 (41%), Positives = 116/203 (57%), Gaps = 19/203 (9%)

Query: 148 KILRGENEADIE--------MGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPE 199
           ++ + E  ADIE          F NR   N   +DD E  G +    +P +FDAR  WP 
Sbjct: 25  QLFKAEPRADIEHLRRKVMKSKFINR--NNKPREDDTEIDGSK----IPDSFDARVTWPH 78

Query: 200 CPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA-CTPNCWGCNGG 258
           CPS+ +I DQS CGSCWA S A  +SDR+CIAS+G+   ++SA  I++ CT   +GC+GG
Sbjct: 79  CPSISYIRDQSQCGSCWAFSSAEVMSDRVCIASHGHKKVELSADDILSCCTDGGYGCDGG 138

Query: 259 WPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL-QNCTLLGKLKTPECKQN 317
           WP  AW+++   GVVTGG Y +++ C+PY + PC  H       NCT   ++ TP+CK  
Sbjct: 139 WPVSAWQYFVETGVVTGGLYGTKDACRPYEIPPCGIHKNETFYSNCTQ--EIDTPDCKTT 196

Query: 318 CYNPSYESTYRFDLKKGKKAHMV 340
           C    Y  +Y  D   GK A+ V
Sbjct: 197 C-QAGYPISYDDDKTYGKTAYSV 218



 Score =  110 bits (275), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 47/84 (55%), Positives = 62/84 (73%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  +GP+VA F+VY DF  YK+G+Y+H  G   G HAVR+LGWG +  +PYWLVANSW
Sbjct: 227 KEIMTYGPVVAAFTVYDDFFHYKTGIYKHVSGAEAGGHAVRILGWGQQGGVPYWLVANSW 286

Query: 138 NDHWGDHGTFKILRGENEADIEMG 161
           N  WG++G F+ILRG +E  IE G
Sbjct: 287 NTDWGENGYFRILRGSDECGIEDG 310


>gi|170787211|gb|ACB38229.1| cathepsin B [Meretrix meretrix]
          Length = 337

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 73/157 (46%), Positives = 92/157 (58%), Gaps = 4/157 (2%)

Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
           K LP  FDAR +WP+CPSL+ + DQ  CGSCWA     A +DRLCI S G     +SA+ 
Sbjct: 84  KDLPDTFDARTQWPDCPSLKEVRDQGACGSCWAFGCVEAATDRLCIQSKGIVNAHLSAED 143

Query: 245 IVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
           + +C   C  GCNGG+ + AW +   +G+VTGG YNS +GC PY +  C+HHV G LQ C
Sbjct: 144 LTSCCRTCGNGCNGGFLEGAWNYLKRDGIVTGGPYNSHQGCLPYEIKACDHHVVGKLQPC 203

Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
              G   TP CK+ C    Y +TY  D    K  H V
Sbjct: 204 K--GDGPTPRCKKEC-ESGYNNTYSKDEHHAKTVHAV 237



 Score =  107 bits (267), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 52/98 (53%), Positives = 65/98 (66%), Gaps = 1/98 (1%)

Query: 63  HYFKKAHMVPRC-NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
           H+ K  H V      M +I  +GP+ A F+VY+DF  YKSGVY+H  G  +G HA++ LG
Sbjct: 229 HHAKTVHAVEGVEQIMTEIMTNGPVEAAFTVYSDFPTYKSGVYEHKSGGPLGGHAIKTLG 288

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           WG E+   YWLVANSWN  WGD+G FKILRG +E  IE
Sbjct: 289 WGNEDGKDYWLVANSWNPDWGDNGFFKILRGRDECGIE 326


>gi|160333103|ref|NP_001103948.1| capthepsin B, b precursor [Danio rerio]
 gi|133777414|gb|AAI15255.1| Ctsbb protein [Danio rerio]
          Length = 326

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 72/155 (46%), Positives = 99/155 (63%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD R++WP C +L  I DQ +CGSCWA     +ISDR+CI S G  + +ISA+ ++
Sbjct: 75  LPDSFDLRDQWPNCKTLSQIRDQGSCGSCWAFGAVESISDRICIHSKGKQSPEISAEDLL 134

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GC+GG+P  AW +W  +G+VTGG YNS  GC+PY++APCEHHV G    C+ 
Sbjct: 135 SCCDQCGFGCSGGFPAEAWDYWRRSGLVTGGLYNSDVGCRPYSIAPCEHHVNGTRPPCS- 193

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G+  TP+C   C  P Y   Y+ D   G K + V
Sbjct: 194 -GEQDTPKCTGVCI-PKYSVPYKQDKHFGSKVYNV 226



 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 55/99 (55%), Positives = 70/99 (70%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+  K + VP  +   M ++Y +GP+ A F+VY DF  YKSGVYQH  G ++G HAV++L
Sbjct: 218 HFGSKVYNVPSDQQQIMTELYTNGPVEAAFTVYEDFPLYKSGVYQHLTGSALGGHAVKIL 277

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG EN  P+WLVANSWN  WGD+G FKILRG +E  IE
Sbjct: 278 GWGEENGTPFWLVANSWNSDWGDNGYFKILRGHDECGIE 316


>gi|45822203|emb|CAE47498.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
          Length = 328

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 68/155 (43%), Positives = 97/155 (62%), Gaps = 3/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FDAR++WP CP++R I DQ +CGSCWA     A+SDR+CI SNG      S+  +V
Sbjct: 77  IPADFDARQQWPHCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIHSNGESNFHFSSDDLV 136

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GCNGG+P  AW +W   G+V+GG Y +++GC+PY + PCEHH  G    C  
Sbjct: 137 SCCWTCGMGCNGGYPGAAWHYWVRKGLVSGGQYGTKQGCRPYEIPPCEHHTNGSRPACD- 195

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             +  TP+C ++C   +Y+  Y  DL  G KA+ +
Sbjct: 196 ASEGNTPKCAKSC-ESNYKINYSNDLHFGSKAYSI 229



 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 51/84 (60%), Positives = 63/84 (75%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I ++GP+   FSVYADF+ YK+GVYQH  G  +G HA+R+ GWGVEN+ PYWL+ANSWN
Sbjct: 239 EILQNGPVEGAFSVYADFVNYKTGVYQHIKGQFLGGHAIRIFGWGVENNTPYWLIANSWN 298

Query: 139 DHWGDHGTFKILRGENEADIEMGF 162
             WGD GTFKILRG +   IE G 
Sbjct: 299 TDWGDSGTFKILRGSDHCGIESGI 322


>gi|443692853|gb|ELT94358.1| hypothetical protein CAPTEDRAFT_221292 [Capitella teleta]
          Length = 374

 Score =  147 bits (372), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 68/158 (43%), Positives = 101/158 (63%), Gaps = 4/158 (2%)

Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
           A  LP +FDAR++W  CP+++ + DQ +CGSCWA     A+SDR+CIAS G     IS++
Sbjct: 119 AVNLPDDFDARKEWTGCPTIKEVRDQGSCGSCWAFGAVEAMSDRICIASKGNVHAHISSE 178

Query: 244 HIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
            +++C  +C  GCNGG+P  AW ++   G+V+GG Y + +GC+PY++APCEHHV G    
Sbjct: 179 DLLSCCSSCGMGCNGGFPPAAWEYFRDTGLVSGGQYGTHQGCRPYSIAPCEHHVNGTRLP 238

Query: 303 CTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           C+  G+  TP+C++ C    Y+  Y  D   G  A+ V
Sbjct: 239 CS--GEGPTPKCERTC-EKGYKVKYEDDKNFGYTAYSV 273



 Score =  120 bits (302), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 54/83 (65%), Positives = 63/83 (75%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +I  +GP+   F+VYADF  YKSGVYQH  G  +G HA+RVLGWGVE+  PYWLVANS
Sbjct: 281 MTEIMTNGPVEGAFTVYADFPTYKSGVYQHVSGGELGGHAIRVLGWGVEDGTPYWLVANS 340

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRG+NE  IE
Sbjct: 341 WNSDWGDNGFFKILRGQNECGIE 363


>gi|326427908|gb|EGD73478.1| cathepsin B [Salpingoeca sp. ATCC 50818]
          Length = 341

 Score =  147 bits (372), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 74/157 (47%), Positives = 100/157 (63%), Gaps = 6/157 (3%)

Query: 187 LPRNFDAREKWPE-CPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP +FD+R +W   CPS++ + DQ+NCGSCWA     A++DR CIAS G  T  ISA+ +
Sbjct: 89  LPTSFDSRTQWGSMCPSVKEVRDQANCGSCWAFGAVEAMTDRTCIASKGAQTPHISAEDL 148

Query: 246 VAC-TPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
           + C T  C  GCNGG+P  AW +W + G+VTGG Y+S +GCQPY+LA CEHH  GP + C
Sbjct: 149 LTCCTFTCGDGCNGGYPAAAWEYWKNQGIVTGGQYDSNQGCQPYSLAKCEHHTTGPYKPC 208

Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             +  + TP CK++C    Y  TY  D   G  ++ V
Sbjct: 209 GDI--VPTPACKRSCRQ-GYNVTYPNDKHFGASSYGV 242



 Score =  104 bits (259), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 44/81 (54%), Positives = 60/81 (74%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I  +GP+ A F+VY+DFL YKSGVYQH  G  +G HA++++GWGV++   YW+VANSWN
Sbjct: 251 EIMTNGPVEAAFTVYSDFLSYKSGVYQHTSGQPLGGHAIKIIGWGVQDGTDYWIVANSWN 310

Query: 139 DHWGDHGTFKILRGENEADIE 159
           D WG+ G F I +G +E  IE
Sbjct: 311 DSWGNDGFFWIKKGTDECGIE 331


>gi|213514196|ref|NP_001133994.1| Cathepsin B precursor [Salmo salar]
 gi|209156086|gb|ACI34275.1| Cathepsin B precursor [Salmo salar]
          Length = 330

 Score =  147 bits (372), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 71/155 (45%), Positives = 99/155 (63%), Gaps = 3/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+NFD R +WP CP+L+ + DQ +CGSCWA   A AISDR+CI SN   + +IS++ ++
Sbjct: 79  LPKNFDPRLQWPNCPTLKEVRDQGSCGSCWAFGAAEAISDRVCIHSNAKVSVEISSEDLL 138

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C  GCNGG+P  A  FW   G+V+GG Y+S  GC+PY++ PCEHHV G    C  
Sbjct: 139 SCCESCGMGCNGGYPSAACDFWTKEGLVSGGLYDSHIGCRPYSIPPCEHHVNGTRPPCKG 198

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             +  TP+C   C  P Y   Y+ D   GK+++ V
Sbjct: 199 E-EGDTPQCTNQC-EPGYTPGYKQDKHFGKRSYSV 231



 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 52/99 (52%), Positives = 72/99 (72%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+ K+++ VP      M+++Y++GP+   F+VY DFL YKSGVY+H  G ++G HA++VL
Sbjct: 223 HFGKRSYSVPSDEKEIMKELYKNGPVEGAFTVYEDFLLYKSGVYRHVSGSAVGGHAIKVL 282

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG E  IPYWL ANSWN  WG++G FKI+RGE+   IE
Sbjct: 283 GWGEEGGIPYWLAANSWNTDWGENGFFKIVRGEDHCGIE 321


>gi|156365510|ref|XP_001626688.1| predicted protein [Nematostella vectensis]
 gi|156213574|gb|EDO34588.1| predicted protein [Nematostella vectensis]
          Length = 259

 Score =  147 bits (372), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 70/155 (45%), Positives = 97/155 (62%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+RE+WP CP+++ + DQ  CGSCWA     A+SDR CI S G     ISA+ ++
Sbjct: 4   VPDHFDSREQWPHCPTIKEVRDQGACGSCWAFGAVEAMSDRYCIKSEGKVMPHISAEDLL 63

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GCNGG+P+ AW  W   G+VTGG Y+S +GCQPY +A C+HHV G L+ C  
Sbjct: 64  SCCETCGMGCNGGYPESAWDHWKSKGLVTGGQYDSHKGCQPYKIAACDHHVVGKLKPCK- 122

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G   TP+C++ C    Y  +Y  D   G+ A+ V
Sbjct: 123 -GDSPTPKCERKC-EAGYNVSYSDDKHFGQSAYSV 155



 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 54/102 (52%), Positives = 68/102 (66%), Gaps = 2/102 (1%)

Query: 63  HYFKKAHMVPRCNA--MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+ + A+ V    A   ++I  +GP+   F+VYADF  YKSGVYQH  G ++G HA+++L
Sbjct: 147 HFGQSAYSVRSDPAEIQKEIMTNGPVEGAFTVYADFPTYKSGVYQHTSGSALGGHAIKIL 206

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGF 162
           GWG EN  PYWLVANSWN  WGD G FKI RG +E  IE G 
Sbjct: 207 GWGEENGTPYWLVANSWNSDWGDEGFFKIKRGNDECGIESGI 248


>gi|82830420|ref|NP_072119.2| cathepsin B preproprotein [Rattus norvegicus]
 gi|47939014|gb|AAH72490.1| Cathepsin B [Rattus norvegicus]
 gi|149030258|gb|EDL85314.1| rCG52258, isoform CRA_a [Rattus norvegicus]
          Length = 339

 Score =  147 bits (372), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 74/166 (44%), Positives = 100/166 (60%), Gaps = 5/166 (3%)

Query: 177 ETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYF 236
           E +G      LP +FDARE+W  CP++  I DQ +CGSCWA     A+SDR+CI +NG  
Sbjct: 70  ERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRV 129

Query: 237 TGQISAQHIVACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH 294
             ++SA+ ++ C    C  GCNGG+P  AW FW   G+V+GG YNS  GC PYT+ PCEH
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 189

Query: 295 HVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           HV G    CT  G+  TP+C + C    Y ++Y+ D   G  ++ V
Sbjct: 190 HVNGSRPPCT--GEGDTPKCNKMC-EAGYSTSYKEDKHYGYTSYSV 232



 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 53/83 (63%), Positives = 67/83 (80%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+   F+V++DFL YKSGVY+H  GD +G HA+R+LGWG+EN +PYWLVANS
Sbjct: 240 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVANS 299

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRGEN   IE
Sbjct: 300 WNVDWGDNGFFKILRGENHCGIE 322


>gi|405971658|gb|EKC36483.1| Cathepsin B [Crassostrea gigas]
          Length = 341

 Score =  147 bits (372), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 73/158 (46%), Positives = 95/158 (60%), Gaps = 5/158 (3%)

Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
           K LP  FD+R +WP CP+L+ + DQ  CGSCWA     A+SDR+CI S G     ISA+ 
Sbjct: 87  KDLPATFDSRTQWPNCPTLKEVRDQGACGSCWAFGAVEAMSDRICIKSQGKENTHISAED 146

Query: 245 IVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
           + +C   C  GC GG+P  AW ++  +G+VTGG YNS +GC PYT+  C+HHV G LQ C
Sbjct: 147 LTSCCRTCGNGCEGGFPSAAWSYYKKDGLVTGGQYNSHQGCLPYTIKACDHHVVGKLQPC 206

Query: 304 T-LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           +  +G   TP+CK  C    Y  TY  D   G  A+ V
Sbjct: 207 SKSIG--PTPKCKHTC-EAGYNVTYEKDKHYGSSAYSV 241



 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 54/98 (55%), Positives = 66/98 (67%), Gaps = 1/98 (1%)

Query: 63  HYFKKAHMVPRCN-AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
           HY   A+ V      M +I  +GP+   F+VYADF QYKSGVY+H  G  +G HA+++LG
Sbjct: 233 HYGSSAYSVHGVEKIMTEIMTNGPVEGAFTVYADFPQYKSGVYKHTTGQPLGGHAIKILG 292

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           WG EN   YWLVANSWN  WGD G FKILRG++E  IE
Sbjct: 293 WGTENGDDYWLVANSWNPDWGDQGFFKILRGQDECGIE 330


>gi|201023319|ref|NP_001128401.1| cathepsin B-10270 precursor [Acyrthosiphon pisum]
 gi|239788119|dbj|BAH70754.1| ACYPI000021 [Acyrthosiphon pisum]
          Length = 341

 Score =  147 bits (371), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 71/153 (46%), Positives = 95/153 (62%), Gaps = 3/153 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAR KW EC S+ HI +Q NC + WA+SV +AI+DR+CI S    T   S Q ++
Sbjct: 87  MPETFDARNKWFECVSIAHIWNQGNCAADWAISVTSAINDRICIKSKKNITAFYSPQKML 146

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C  GCNGG+   AW++W   G+VTGGDY S EGCQP+ + PC H V        +
Sbjct: 147 SCCDDCGDGCNGGYSGAAWQYWMKRGLVTGGDYGSNEGCQPWLIPPCNHTVMDERSPSYM 206

Query: 306 LGKLK--TPECKQNCYNPSYESTYRFDLKKGKK 336
            GK K  TP+C  NCYNP+Y   +  D+ KG +
Sbjct: 207 CGKYKSETPQCTLNCYNPNYSKPFLKDISKGIR 239



 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 47/91 (51%), Positives = 56/91 (61%), Gaps = 2/91 (2%)

Query: 74  CNAM--RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYW 131
           C+ M   ++ +HGP  AI  VY DFL YKSG+YQH  G  +G   V+V+GWGV   + YW
Sbjct: 244 CSGMIRNELKKHGPATAIMRVYEDFLTYKSGIYQHVTGKLLGQITVKVIGWGVYRGVQYW 303

Query: 132 LVANSWNDHWGDHGTFKILRGENEADIEMGF 162
           L ANSW   WGD G FKI RG NE   E  F
Sbjct: 304 LAANSWGTSWGDKGFFKIRRGYNECLFEDYF 334


>gi|203648|gb|AAA40993.1| cathepsin (EC 3.4.22.1), partial [Rattus norvegicus]
          Length = 271

 Score =  147 bits (371), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 74/166 (44%), Positives = 100/166 (60%), Gaps = 5/166 (3%)

Query: 177 ETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYF 236
           E +G      LP +FDARE+W  CP++  I DQ +CGSCWA     A+SDR+CI +NG  
Sbjct: 2   ERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRV 61

Query: 237 TGQISAQHIVACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH 294
             ++SA+ ++ C    C  GCNGG+P  AW FW   G+V+GG YNS  GC PYT+ PCEH
Sbjct: 62  NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 121

Query: 295 HVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           HV G    CT  G+  TP+C + C    Y ++Y+ D   G  ++ V
Sbjct: 122 HVNGSRPPCT--GEGDTPKCNKMC-EAGYSTSYKEDKHYGYTSYSV 164



 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 53/83 (63%), Positives = 67/83 (80%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+   F+V++DFL YKSGVY+H  GD +G HA+R+LGWG+EN +PYWLVANS
Sbjct: 172 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVANS 231

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRGEN   IE
Sbjct: 232 WNVDWGDNGFFKILRGENHCGIE 254


>gi|1705630|sp|P00787.2|CATB_RAT RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; AltName:
           Full=RSG-2; Contains: RecName: Full=Cathepsin B light
           chain; Contains: RecName: Full=Cathepsin B heavy chain;
           Flags: Precursor
 gi|1524328|emb|CAA57792.1| cathepsin b [Rattus norvegicus]
          Length = 339

 Score =  147 bits (371), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 74/166 (44%), Positives = 100/166 (60%), Gaps = 5/166 (3%)

Query: 177 ETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYF 236
           E +G      LP +FDARE+W  CP++  I DQ +CGSCWA     A+SDR+CI +NG  
Sbjct: 70  ERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRV 129

Query: 237 TGQISAQHIVACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH 294
             ++SA+ ++ C    C  GCNGG+P  AW FW   G+V+GG YNS  GC PYT+ PCEH
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 189

Query: 295 HVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           HV G    CT  G+  TP+C + C    Y ++Y+ D   G  ++ V
Sbjct: 190 HVNGSRPPCT--GEGDTPKCNKMC-EAGYSTSYKEDKHYGYTSYSV 232



 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 53/83 (63%), Positives = 67/83 (80%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+   F+V++DFL YKSGVY+H  GD +G HA+R+LGWG+EN +PYWLVANS
Sbjct: 240 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVANS 299

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRGEN   IE
Sbjct: 300 WNVDWGDNGFFKILRGENHCGIE 322


>gi|189096178|pdb|3CBJ|A Chain A, Chagasin-cathepsin B Complex
 gi|189096180|pdb|3CBK|A Chain A, Chagasin-Cathepsin B
          Length = 266

 Score =  147 bits (371), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 70/156 (44%), Positives = 99/156 (63%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP+CP+++ I DQ +CGS WA     AISDR+CI +N + + ++SA+ ++
Sbjct: 7   LPASFDAREQWPQCPTIKEIRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 66

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  +    GCNGG+P  AW FW   G+V+GG Y S  GC+PY++ PCE HV G    CT
Sbjct: 67  TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEAHVNGARPPCT 126

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  TY+ D   G  ++ V
Sbjct: 127 --GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 159



 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 142 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 201

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 202 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 249


>gi|6681079|ref|NP_031824.1| cathepsin B preproprotein [Mus musculus]
 gi|115712|sp|P10605.2|CATB_MOUSE RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
           RecName: Full=Cathepsin B light chain; Contains:
           RecName: Full=Cathepsin B heavy chain; Flags: Precursor
 gi|239907|gb|AAB20536.1| preprocathepsin B [Mus sp.]
 gi|309152|gb|AAA37375.1| cathepsin B [Mus musculus]
 gi|13879360|gb|AAH06656.1| Cathepsin B [Mus musculus]
 gi|26350521|dbj|BAC38900.1| unnamed protein product [Mus musculus]
 gi|74180941|dbj|BAE27751.1| unnamed protein product [Mus musculus]
 gi|74191261|dbj|BAE39458.1| unnamed protein product [Mus musculus]
 gi|74198944|dbj|BAE30691.1| unnamed protein product [Mus musculus]
 gi|74208073|dbj|BAE29144.1| unnamed protein product [Mus musculus]
 gi|148704123|gb|EDL36070.1| cathepsin B, isoform CRA_a [Mus musculus]
          Length = 339

 Score =  147 bits (371), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 73/148 (49%), Positives = 92/148 (62%), Gaps = 7/148 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDARE+W  CP++  I DQ +CGSCWA     AISDR CI +NG    ++SA+ ++
Sbjct: 80  LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLL 139

Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C    C  GCNGG+P  AW FW   G+V+GG YNS  GC PYT+ PCEHHV G    CT
Sbjct: 140 TCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNC---YNPSYESTYRF 329
             G+  TP C ++C   Y+PSY+    F
Sbjct: 200 --GEGDTPRCNKSCEAGYSPSYKEDKHF 225



 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 53/83 (63%), Positives = 66/83 (79%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+   F+V++DFL YKSGVY+H  GD +G HA+R+LGWGVEN +PYWL ANS
Sbjct: 240 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVENGVPYWLAANS 299

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRGEN   IE
Sbjct: 300 WNLDWGDNGFFKILRGENHCGIE 322


>gi|74221319|dbj|BAE42140.1| unnamed protein product [Mus musculus]
          Length = 339

 Score =  147 bits (371), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 73/148 (49%), Positives = 92/148 (62%), Gaps = 7/148 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDARE+W  CP++  I DQ +CGSCWA     AISDR CI +NG    ++SA+ ++
Sbjct: 80  LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLL 139

Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C    C  GCNGG+P  AW FW   G+V+GG YNS  GC PYT+ PCEHHV G    CT
Sbjct: 140 TCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNC---YNPSYESTYRF 329
             G+  TP C ++C   Y+PSY+    F
Sbjct: 200 --GEGDTPRCNKSCEAGYSPSYKEDKHF 225



 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 51/83 (61%), Positives = 64/83 (77%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++ P+   F+V++DFL YKSGVY+H  GD +G HA+R+LGWGV N +PYWL ANS
Sbjct: 240 MAEIYKNDPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVGNGVPYWLAANS 299

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRGEN   IE
Sbjct: 300 WNLDWGDNGFFKILRGENHCGIE 322


>gi|449267314|gb|EMC78276.1| Cathepsin B [Columba livia]
          Length = 340

 Score =  147 bits (370), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 69/156 (44%), Positives = 100/156 (64%), Gaps = 4/156 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD+R +WP CP++  I DQ +CGSCWA     AISDR+C+ +N   + ++SA+ ++
Sbjct: 80  LPDSFDSRTQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLL 139

Query: 247 ACTP-NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C    C  GCNGG+P  AWR+W   G+V+GG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 140 SCCGFECGMGCNGGYPSGAWRYWTEKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G  +TP C ++C  P Y  +Y+ D   G  ++ V
Sbjct: 200 GEGG-ETPRCSRHC-EPGYSPSYKEDKHYGITSYGV 233



 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 59/108 (54%), Positives = 73/108 (67%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y P+     HY   ++ VPR     M +IY++GP+   F VY DFL YKSGVYQH  G+ 
Sbjct: 216 YSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVTGEQ 275

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGV+N  PYWL ANSWN  WGD+G FKILRGE+   IE
Sbjct: 276 VGGHAIRLLGWGVDNGTPYWLAANSWNTDWGDNGFFKILRGEDHCGIE 323


>gi|340380685|ref|XP_003388852.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
          Length = 341

 Score =  147 bits (370), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 73/156 (46%), Positives = 100/156 (64%), Gaps = 14/156 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQ--ISAQH 244
           +P  FDAR+KWP+CP++  + DQ  CGSCWA     A+SDR CI+    F  Q  ISA++
Sbjct: 86  IPDTFDARQKWPDCPTIGTVRDQGACGSCWAFGAVEAMSDRYCIS----FKEQVNISAEN 141

Query: 245 IVACTPNCW-GCNGGWPQLAWRFWG----HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP 299
           +++C   C  GC+GG+P  AWR W     + G+VTGG Y+S  GCQPYT+  C+HH  GP
Sbjct: 142 LLSCCETCGSGCDGGYPAAAWRHWADKLLYEGIVTGGQYDSNAGCQPYTIPKCDHHEPGP 201

Query: 300 LQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
            +NC+  G   TP CK++C + SY+ +YR D   GK
Sbjct: 202 YENCS--GSQSTPSCKRSCIS-SYDKSYRSDKHYGK 234



 Score =  114 bits (285), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 60/81 (74%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I  +GP+   FSVYADF  Y SGVYQH  G  +G HA+++LGWG EN +PYWLVANSWN
Sbjct: 249 EIMTNGPVEGAFSVYADFPTYTSGVYQHTTGSFLGGHAIKILGWGTENGVPYWLVANSWN 308

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGD G FKI+RG++E  IE
Sbjct: 309 PSWGDSGFFKIIRGKDECGIE 329


>gi|1311050|pdb|1CPJ|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1311051|pdb|1CPJ|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1421561|pdb|1THE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B- Inhibitor Complex: Implications For
           Structure-Based Inhibitor Design
 gi|1421562|pdb|1THE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B- Inhibitor Complex: Implications For
           Structure-Based Inhibitor Design
          Length = 260

 Score =  147 bits (370), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 72/156 (46%), Positives = 97/156 (62%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+W  CP++  I DQ +CGSCWA     A+SDR+CI +NG    ++SA+ ++
Sbjct: 7   LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLL 66

Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C    C  GCNGG+P  AW FW   G+V+GG YNS  GC PYT+ PCEHHV G    CT
Sbjct: 67  TCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGARPPCT 126

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C    Y ++Y+ D   G  ++ V
Sbjct: 127 --GEGDTPKCNKMC-EAGYSTSYKEDKHYGYTSYSV 159



 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 53/83 (63%), Positives = 67/83 (80%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+   F+V++DFL YKSGVY+H  GD +G HA+R+LGWG+EN +PYWLVANS
Sbjct: 167 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVANS 226

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRGEN   IE
Sbjct: 227 WNADWGDNGFFKILRGENHCGIE 249


>gi|125981197|ref|XP_001354605.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
 gi|54642915|gb|EAL31659.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
          Length = 338

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 70/155 (45%), Positives = 92/155 (59%), Gaps = 5/155 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAR+ WP CP++  I DQ +CGSCWA     A+SDR+CI S G     +SA  +V
Sbjct: 86  IPEEFDARKAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSEGKVNFHLSADDLV 145

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  AW +W   G+V+GG Y S +GC+PY +APCEHHV G    C+ 
Sbjct: 146 SCCHICGFGCNGGFPGAAWSYWTRKGIVSGGPYGSTQGCRPYEIAPCEHHVNGTRPPCS- 204

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP C+  C   SY   Y  D   G K++ V
Sbjct: 205 --HGSTPSCQHKC-QASYSVEYAKDKNFGSKSYSV 236



 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 45/84 (53%), Positives = 61/84 (72%), Gaps = 2/84 (2%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV--ENDIPYWLVAN 135
           ++I  +GP+   F+VY D + YKSGVYQH  G  +G HA+R+LGWGV  E+ +PYWL+ N
Sbjct: 245 QEIMTNGPVEGAFTVYEDLILYKSGVYQHEHGKELGGHAIRILGWGVWGESKVPYWLIGN 304

Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
           SWN  WGD+G F+ILRG++   IE
Sbjct: 305 SWNTDWGDNGFFRILRGQDHCGIE 328


>gi|195165479|ref|XP_002023566.1| GL19846 [Drosophila persimilis]
 gi|194105700|gb|EDW27743.1| GL19846 [Drosophila persimilis]
          Length = 329

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 70/155 (45%), Positives = 92/155 (59%), Gaps = 5/155 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAR+ WP CP++  I DQ +CGSCWA     A+SDR+CI S G     +SA  +V
Sbjct: 86  IPEEFDARKAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSEGKVNFHLSADDLV 145

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  AW +W   G+V+GG Y S +GC+PY +APCEHHV G    C+ 
Sbjct: 146 SCCHICGFGCNGGFPGAAWSYWTRKGIVSGGPYGSTQGCRPYEIAPCEHHVNGTRPPCS- 204

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP C+  C   SY   Y  D   G K++ V
Sbjct: 205 --HGSTPSCQHKC-QASYSVEYAKDKNFGSKSYSV 236



 Score = 87.4 bits (215), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 37/68 (54%), Positives = 49/68 (72%), Gaps = 2/68 (2%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV--ENDIPYWLVAN 135
           ++I  +GP+   F+VY D + YKSGVYQH  G  +G HA+R+LGWGV  E+ +PYWL+ N
Sbjct: 245 QEIMTNGPVEGAFTVYEDLILYKSGVYQHEHGKELGGHAIRILGWGVWGESKVPYWLIGN 304

Query: 136 SWNDHWGD 143
           SWN  WGD
Sbjct: 305 SWNTDWGD 312


>gi|1127275|pdb|1CTE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1127276|pdb|1CTE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
          Length = 254

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 72/156 (46%), Positives = 97/156 (62%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+W  CP++  I DQ +CGSCWA     A+SDR+CI +NG    ++SA+ ++
Sbjct: 1   LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLL 60

Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C    C  GCNGG+P  AW FW   G+V+GG YNS  GC PYT+ PCEHHV G    CT
Sbjct: 61  TCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGARPPCT 120

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C    Y ++Y+ D   G  ++ V
Sbjct: 121 --GEGDTPKCNKMC-EAGYSTSYKEDKHYGYTSYSV 153



 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 53/83 (63%), Positives = 67/83 (80%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+   F+V++DFL YKSGVY+H  GD +G HA+R+LGWG+EN +PYWLVANS
Sbjct: 161 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVANS 220

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRGEN   IE
Sbjct: 221 WNADWGDNGFFKILRGENHCGIE 243


>gi|390994431|gb|AFM37365.1| cathepsin B2 [Dictyocaulus viviparus]
          Length = 346

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 69/182 (37%), Positives = 101/182 (55%), Gaps = 8/182 (4%)

Query: 160 MGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVS 219
           +   ++   N    DD++         +P +FD+R +WP CPS++ I DQS+CGSCWA  
Sbjct: 72  VAIPSKYRVNEVTHDDIDD------SAIPSSFDSRTQWPNCPSIKSIRDQSSCGSCWAFG 125

Query: 220 VANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDY 278
            A A++DR+CIAS G     +SA  +++C   C +GC+GG+P  AW +W   G+V+GG Y
Sbjct: 126 AAEAMTDRICIASKGAIQFTVSADDLLSCCDECGFGCDGGFPYAAWNYWVEKGIVSGGSY 185

Query: 279 NSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAH 338
            S+ GC+PY   PCEHH  G   +        T  C+  C    Y + Y  D + G KA+
Sbjct: 186 TSKSGCKPYPFPPCEHHTNGTHYHPCPKDLYPTNTCEHKC-QSGYATAYTNDKRYGAKAY 244

Query: 339 MV 340
            V
Sbjct: 245 TV 246



 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 48/100 (48%), Positives = 66/100 (66%), Gaps = 2/100 (2%)

Query: 64  YFKKAHMVP-RCNAM-RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
           Y  KA+ V  R  A+ ++I  HGP+   + VY DF  Y  G+Y+H  G  +G HAV+++G
Sbjct: 239 YGAKAYTVAARVKAIQKEIMLHGPVEVAYDVYEDFEHYLKGIYKHTAGSYLGGHAVKMIG 298

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMG 161
           WG EN IPYW+ +NSWN  WG++G F+ILRG +E  IE G
Sbjct: 299 WGTENGIPYWICSNSWNSDWGENGFFRILRGTDECGIESG 338


>gi|44965401|gb|AAS49537.1| cathepsin B [Latimeria chalumnae]
          Length = 225

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 70/142 (49%), Positives = 93/142 (65%), Gaps = 6/142 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFD+R +WP+CP+++ I DQ +CGSCWA     AISDR+CI S G    +ISA+ ++
Sbjct: 13  LPENFDSRTQWPKCPTIQEIRDQGSCGSCWAFGAVEAISDRVCIHSKGKVNVEISAEDLL 72

Query: 247 ACTP-NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C    C +GCNGG+P  AW FW   G+V+GG + S  GC+PYT+ PCEHHV G   +CT
Sbjct: 73  SCCGMECGFGCNGGYPSGAWNFWTETGLVSGGLFKSHIGCRPYTIPPCEHHVNGSRPSCT 132

Query: 305 LLGKLKTPECKQNC---YNPSY 323
              +  TP+C   C   Y PSY
Sbjct: 133 GE-EGDTPKCVMQCEAGYTPSY 153



 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 36/75 (48%), Positives = 50/75 (66%), Gaps = 2/75 (2%)

Query: 54  YLPTSIPLSHYFKKAHMVPRCNAMRQI--YEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y P+     H+   ++ V    A  QI  Y++GP+   F+VY DFLQYKSGVY+H  GD+
Sbjct: 149 YTPSYFKDKHFGSTSYAVSSNEADIQIEIYKNGPVEGAFTVYEDFLQYKSGVYKHVTGDA 208

Query: 112 IGLHAVRVLGWGVEN 126
           +G HA+R+LGWGVE+
Sbjct: 209 VGGHAIRILGWGVES 223


>gi|282400164|ref|NP_001164205.1| cathepsin B precursor [Tribolium castaneum]
 gi|270004839|gb|EFA01287.1| cathepsin B precursor [Tribolium castaneum]
          Length = 335

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 77/156 (49%), Positives = 96/156 (61%), Gaps = 7/156 (4%)

Query: 183 NAKGLPRNFDAREKWPEC-PSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
           NA+ +P +FDARE WP+C P + +I DQS CGSCWA     A+SDR+CI SN      IS
Sbjct: 80  NAQEIPDSFDAREAWPDCAPIIGNIRDQSTCGSCWAFGAVEAMSDRICIHSNATVKVNIS 139

Query: 242 AQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
           A+  + C   C  GCNGG P +AW  W  NG+VTGG+Y    GC+ Y+ APCEHHV G L
Sbjct: 140 AEDPLDCCTICGMGCNGGMPAMAWLHWTVNGIVTGGNYEDTNGCKAYSFAPCEHHVDGDL 199

Query: 301 QNCTLLGKLK-TPECKQNCYNPSYESTYRFDLKKGK 335
             C   G  K TP+CK+ C + S   TY+ DL  G 
Sbjct: 200 PPC---GPTKPTPDCKKECDSGS-SLTYQNDLTHGS 231



 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 55/81 (67%), Positives = 63/81 (77%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I  +GP+ A FSVY DFL YKSGVYQH  G+  G HA+++LGWGVEND PYWLVANSWN
Sbjct: 245 EIMTNGPVEASFSVYEDFLSYKSGVYQHLEGEYAGGHAIKILGWGVENDTPYWLVANSWN 304

Query: 139 DHWGDHGTFKILRGENEADIE 159
           + WGD G FKILRG NE  IE
Sbjct: 305 EDWGDKGYFKILRGSNECGIE 325


>gi|195566634|ref|XP_002106884.1| GD15875 [Drosophila simulans]
 gi|194204277|gb|EDX17853.1| GD15875 [Drosophila simulans]
          Length = 340

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 70/155 (45%), Positives = 93/155 (60%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FD+R++WP CP++  I DQ +CGSCWA     A+SDR+CI S G      SA  +V
Sbjct: 87  LPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLV 146

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  AW +W   G+V+GG Y S +GC+PY ++PCEHHV G    C  
Sbjct: 147 SCCHTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPCAH 206

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G   TP+C   C + SY   Y  D   G K++ V
Sbjct: 207 GG--GTPKCSHVCQS-SYTVDYAKDKHFGSKSYSV 238



 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 50/101 (49%), Positives = 65/101 (64%), Gaps = 4/101 (3%)

Query: 63  HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+  K++ V R       +I  +GP+   F+VY D + YK GVYQH  G  +G HA+R+L
Sbjct: 230 HFGSKSYSVKRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRIL 289

Query: 121 GWGVEND--IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGV  D  IPYWL+ NSWN  WGDHG F+ILRG++   IE
Sbjct: 290 GWGVWGDEKIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIE 330


>gi|154761391|gb|ABS85545.1| cathepsin B preproprotein [Biomphalaria glabrata]
          Length = 333

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 73/155 (47%), Positives = 96/155 (61%), Gaps = 6/155 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFD R KWP+C SL  I DQ+NCGSCWA   A A++DR+CIA  G     ISA+ I 
Sbjct: 87  LPDNFDPRTKWPDCASLNEIRDQANCGSCWAFGSAEAMTDRICIAGKGNI--HISAEDIN 144

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C  +C  GCNGG+P  AW ++   GVV+GG Y + EGC PY+L  C+HH  G  Q C  
Sbjct: 145 DCCKSCGMGCNGGYPAAAWEWYVDTGVVSGGQYGTNEGCMPYSLPHCDHHTTGKYQPCPA 204

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           +  + TP+C++ C    Y  +Y  D  +GKK++ V
Sbjct: 205 V--VPTPKCEKKCLT-GYPKSYSNDKTRGKKSYGV 236



 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 44/83 (53%), Positives = 62/83 (74%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M+++ ++GP+ A F VY+DFL YK+GVY+H  G   G HAV+++G+G E+   YWLVANS
Sbjct: 243 MQELVDNGPVTAAFDVYSDFLSYKTGVYRHTTGSYEGGHAVKIIGYGTESGQDYWLVANS 302

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN+ WGD G FKI +G++E  IE
Sbjct: 303 WNEDWGDKGFFKIAKGKDECGIE 325


>gi|308511959|ref|XP_003118162.1| CRE-CPR-6 protein [Caenorhabditis remanei]
 gi|308238808|gb|EFO82760.1| CRE-CPR-6 protein [Caenorhabditis remanei]
          Length = 387

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 68/155 (43%), Positives = 96/155 (61%), Gaps = 1/155 (0%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P NFD+RE WP+C S+R+I DQS+CGSCWA     A+SDR+CIAS+G     +SA  ++
Sbjct: 105 IPENFDSRENWPKCQSIRNIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVSLSADDLL 164

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C +GCNGG P  AWR+W  +G+VTG +Y +  GC+PY   PCEHH +    +   
Sbjct: 165 SCCRSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANSGCKPYPFPPCEHHSKKTHFDPCP 224

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP+C++ C     + TY  D   G  A+ V
Sbjct: 225 HDLYPTPKCEKKCIADYTDKTYSEDKFYGASAYGV 259



 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 46/84 (54%), Positives = 56/84 (66%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           +++  HGPL   F VY DFL Y  GVY H  G   G HAV+++GWG+EN IPYW  ANSW
Sbjct: 268 KELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLVGWGIENGIPYWTCANSW 327

Query: 138 NDHWGDHGTFKILRGENEADIEMG 161
           N  WG+ G F+ILRG +E  IE G
Sbjct: 328 NTDWGEDGFFRILRGVDECGIESG 351


>gi|323147412|gb|ADX32985.1| cathepsin B [Pinctada fucata]
          Length = 366

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 72/157 (45%), Positives = 96/157 (61%), Gaps = 3/157 (1%)

Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
           K LP  FDAR +W  CP+++ I DQ +CGSCWA     ++SDR+CI SNG     ISA+ 
Sbjct: 112 KDLPDTFDARTQWSNCPTIKEIRDQGSCGSCWAFGAVESMSDRICIKSNGQQNAHISAED 171

Query: 245 IVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
           + +C  +C  GCNGG+   AW ++  +G+VTGG YNS +GCQPYT+  C+HHV G LQ C
Sbjct: 172 LTSCCRSCGNGCNGGFLSGAWEYYKRDGLVTGGQYNSHQGCQPYTVKACDHHVVGKLQPC 231

Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           +   +  TP CK  C    Y  +Y  D   G  A+ V
Sbjct: 232 SKK-EEHTPVCKHEC-ESGYNVSYTKDKHYGATAYSV 266



 Score =  110 bits (275), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 52/98 (53%), Positives = 65/98 (66%), Gaps = 1/98 (1%)

Query: 63  HYFKKAHMVPRCN-AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
           HY   A+ V      M +I  +GP+   F+VYADF QYKSGVY+H  G  +G HA++++G
Sbjct: 258 HYGATAYSVRGVQQIMTEIMTNGPVEGAFTVYADFPQYKSGVYKHTTGSPLGGHAIKIMG 317

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           WG E    YWLVANSWN  WG+ GTFKILRG +E  IE
Sbjct: 318 WGTEGGDDYWLVANSWNPDWGNQGTFKILRGRDECGIE 355


>gi|148222779|ref|NP_001080410.1| uncharacterized protein LOC380102 precursor [Xenopus laevis]
 gi|28302291|gb|AAH46667.1| Cg10992 protein [Xenopus laevis]
          Length = 333

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 75/184 (40%), Positives = 108/184 (58%), Gaps = 7/184 (3%)

Query: 152 GENEADIEMGFNNRVEANSSEDDDLE-TMGCQNAKGLPRNFDAREKWPECPSLRHIADQS 210
           G N A+ ++ +  R+         L+   G  +   LP +FD+R  WP CP++R I DQ 
Sbjct: 44  GHNFANADVHYVKRLCGTHLNGPQLQKRFGFADDLDLPDSFDSRAAWPNCPTIREIRDQG 103

Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP-NC-WGCNGGWPQLAWRFWG 268
           +CGSCWA     AISDR+C+ +NG    ++SA+ +++C    C  GCNGG+P  AWRFW 
Sbjct: 104 SCGSCWAFGAVEAISDRVCVHTNGKVNVEVSAEDLLSCCGFKCGMGCNGGYPSGAWRFWT 163

Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---YNPSYES 325
             G+V+GG Y+S  GC+PY++ PCEHHV G   +C    +  TP+C + C   Y P+Y S
Sbjct: 164 ETGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPSCKGE-EGDTPKCMKTCEEGYTPAYGS 222

Query: 326 TYRF 329
              F
Sbjct: 223 DKHF 226



 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 59/125 (47%), Positives = 76/125 (60%), Gaps = 2/125 (1%)

Query: 37  KKKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYA 94
           K ++    K  K  +  Y P      H+   ++ VP      M  IY++GP+   F VYA
Sbjct: 199 KGEEGDTPKCMKTCEEGYTPAYGSDKHFGATSYGVPSSEKEIMADIYKNGPVEGAFVVYA 258

Query: 95  DFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGEN 154
           DF  YKSGVYQH  G+ +G HA+++LGWGVEN  PYWL ANSWN  WGD+G FKILRG++
Sbjct: 259 DFPLYKSGVYQHETGEELGGHAIKILGWGVENGTPYWLCANSWNTDWGDNGFFKILRGKD 318

Query: 155 EADIE 159
              IE
Sbjct: 319 HCGIE 323


>gi|390994433|gb|AFM37366.1| cathepsin B3 [Dictyocaulus viviparus]
          Length = 342

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 66/159 (41%), Positives = 102/159 (64%), Gaps = 6/159 (3%)

Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
           ++  G+P +FDAR +WP CPS+  I DQ++CGSCWA +V  +ISDR+CIA++   T + S
Sbjct: 88  EDTAGIPESFDARTQWPHCPSISLIRDQADCGSCWAFAVGESISDRVCIATDANKTAEFS 147

Query: 242 AQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
            + I+ C   C +GC+GG+P  AW ++   GVVTGG Y ++  C+PY ++PC +H     
Sbjct: 148 VEDILTCCDECGFGCDGGFPDAAWEYFVSTGVVTGGLYGTKNACRPYEISPCGNHPNETF 207

Query: 301 -QNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAH 338
            +NCT    + TP CK +C    Y  +Y+ D  +G+K++
Sbjct: 208 YRNCT---GVSTPSCKTSC-QKGYPVSYKDDKTRGRKSY 242



 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 46/82 (56%), Positives = 62/82 (75%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           + I +HGPLVA FSVY DF+ YK G+Y++  G   G HAVR+LGWGVEN++ YW++ANSW
Sbjct: 253 KDILKHGPLVATFSVYEDFMYYKKGIYRYTHGGYEGGHAVRILGWGVENNVKYWIIANSW 312

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N  WG+ G F+++RG N+  IE
Sbjct: 313 NTDWGEDGFFRMVRGINDCGIE 334


>gi|195437434|ref|XP_002066645.1| GK24603 [Drosophila willistoni]
 gi|194162730|gb|EDW77631.1| GK24603 [Drosophila willistoni]
          Length = 341

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 70/192 (36%), Positives = 109/192 (56%), Gaps = 11/192 (5%)

Query: 150 LRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQ 209
           L G +E   +    ++ E     DD++      +   LP +FDAR +W  CP++  I +Q
Sbjct: 58  LMGVHEESYKYPLPDKQEVLGESDDEI------SLADLPVDFDARLRWTSCPTISEIREQ 111

Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWG 268
            +CGSCWA++  + +SDRLCI SNG    ++S   +++C   C + C GG+P  AW +W 
Sbjct: 112 GSCGSCWAIATTSVMSDRLCIGSNGVMNFRLSGLDMLSCCAICGFACQGGYPGAAWAYWA 171

Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYR 328
             G+V+GGDY SQ+GCQPYT+ PC+H   G    CT+ G ++   C+  C  PSY+  ++
Sbjct: 172 RKGLVSGGDYGSQQGCQPYTIEPCDHSGNGSRPVCTVGGGVR---CQHLC-EPSYKVDFQ 227

Query: 329 FDLKKGKKAHMV 340
            D     K + +
Sbjct: 228 RDKNFASKVYSI 239



 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 49/84 (58%), Positives = 60/84 (71%), Gaps = 2/84 (2%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV--ENDIPYWLVAN 135
           ++I  +GP+ AI +VY DFL YK+GVY H  G+ +G HAVR+LGWGV     +PYWLVAN
Sbjct: 248 KEIMTNGPVQAILTVYEDFLSYKTGVYYHLEGEKVGPHAVRILGWGVWGTKKVPYWLVAN 307

Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
           SW   WGD+G F I RGEN  DIE
Sbjct: 308 SWGSDWGDNGFFHIFRGENHCDIE 331


>gi|18921171|ref|NP_572920.1| cathepsin B1, isoform A [Drosophila melanogaster]
 gi|7292926|gb|AAF48317.1| cathepsin B1, isoform A [Drosophila melanogaster]
 gi|16767940|gb|AAL28188.1| GH06546p [Drosophila melanogaster]
 gi|220944992|gb|ACL85039.1| CG10992-PA [synthetic construct]
 gi|220954816|gb|ACL89951.1| CG10992-PA [synthetic construct]
          Length = 340

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 69/155 (44%), Positives = 93/155 (60%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FD+R++WP CP++  I DQ +CGSCWA     A+SDR+CI S G      SA  +V
Sbjct: 87  LPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLV 146

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  AW +W   G+V+GG Y S +GC+PY ++PCEHHV G    C  
Sbjct: 147 SCCHTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPCAH 206

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G  +TP+C   C +  Y   Y  D   G K++ V
Sbjct: 207 GG--RTPKCSHVCQS-GYTVDYAKDKHFGSKSYSV 238



 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 50/101 (49%), Positives = 65/101 (64%), Gaps = 4/101 (3%)

Query: 63  HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+  K++ V R       +I  +GP+   F+VY D + YK GVYQH  G  +G HA+R+L
Sbjct: 230 HFGSKSYSVRRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRIL 289

Query: 121 GWGV--ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGV  E  IPYWL+ NSWN  WGDHG F+ILRG++   IE
Sbjct: 290 GWGVWGEEKIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIE 330


>gi|195478432|ref|XP_002100515.1| GE16138 [Drosophila yakuba]
 gi|194188039|gb|EDX01623.1| GE16138 [Drosophila yakuba]
          Length = 340

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 69/155 (44%), Positives = 92/155 (59%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FD+R++WP CP++  I DQ +CGSCWA     A+SDR+CI S G      SA  +V
Sbjct: 87  IPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLV 146

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  AW +W   G+V+GG Y S +GC+PY ++PCEHHV G    C  
Sbjct: 147 SCCHTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPCAH 206

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G   TP+C   C   SY   Y  D   G K++ V
Sbjct: 207 GG--ATPKCSHVC-QSSYTVDYAKDKHFGSKSYSV 238



 Score =  104 bits (259), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 65/101 (64%), Gaps = 4/101 (3%)

Query: 63  HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+  K++ V R   +   +I  +GP+   F+VY D + YK GVYQH  G  +G HA+R+L
Sbjct: 230 HFGSKSYSVRRNVRDIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRIL 289

Query: 121 GWGVEND--IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGV  D  IPYWL+ NSWN  WGD G F+ILRG++   IE
Sbjct: 290 GWGVWGDEKIPYWLIGNSWNTDWGDQGFFRILRGQDHCGIE 330


>gi|46195455|ref|NP_990702.1| cathepsin B precursor [Gallus gallus]
 gi|1168790|sp|P43233.1|CATB_CHICK RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
           RecName: Full=Cathepsin B light chain; Contains:
           RecName: Full=Cathepsin B heavy chain; Flags: Precursor
 gi|603203|gb|AAA87075.1| cathepsin B [Gallus gallus]
          Length = 340

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 69/156 (44%), Positives = 98/156 (62%), Gaps = 4/156 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FD R++WP CP++  I DQ +CGSCWA     AISDR+C+ +N   + ++SA+ ++
Sbjct: 80  LPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLL 139

Query: 247 ACTP-NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C    C  GCNGG+P  AWR+W   G+V+GG Y+S  GC+ YT+ PCEHHV G    CT
Sbjct: 140 SCCGFECGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRAYTIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G  +TP C ++C  P Y  +Y+ D   G  ++ V
Sbjct: 200 GEGG-ETPRCSRHC-EPGYSPSYKEDKHYGITSYGV 233



 Score =  120 bits (302), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 59/108 (54%), Positives = 71/108 (65%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y P+     HY   ++ VPR     M +IY++GP+   F VY DFL YKSGVYQH  G+ 
Sbjct: 216 YSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVSGEQ 275

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWL ANSWN  WG  G FKILRGE+   IE
Sbjct: 276 VGGHAIRILGWGVENGTPYWLAANSWNTDWGITGFFKILRGEDHCGIE 323


>gi|309202|gb|AAA37494.1| mouse preprocathepsin B [Mus musculus]
          Length = 339

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 72/148 (48%), Positives = 92/148 (62%), Gaps = 7/148 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDARE+W  CP++  I DQ +CGSCWA     AISDR CI +NG    ++SA+ ++
Sbjct: 80  LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLL 139

Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C    C  GCNGG+P  AW FW   G+V+GG Y+S  GC PYT+ PCEHHV G    CT
Sbjct: 140 TCCGIQCGDGCNGGYPSGAWNFWTKKGLVSGGVYDSHIGCLPYTIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNC---YNPSYESTYRF 329
             G+  TP C ++C   Y+PSY+    F
Sbjct: 200 --GEGDTPRCNKSCEAGYSPSYKEDKHF 225



 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 52/83 (62%), Positives = 65/83 (78%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+   F+V++DFL YKSGVY+H  GD +G HA+R+L WGVEN +PYWL ANS
Sbjct: 240 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILVWGVENGVPYWLAANS 299

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRGEN   IE
Sbjct: 300 WNLDWGDNGFFKILRGENHCGIE 322


>gi|442616292|ref|NP_001259536.1| cathepsin B1, isoform B [Drosophila melanogaster]
 gi|440216755|gb|AGB95378.1| cathepsin B1, isoform B [Drosophila melanogaster]
          Length = 330

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 69/155 (44%), Positives = 93/155 (60%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FD+R++WP CP++  I DQ +CGSCWA     A+SDR+CI S G      SA  +V
Sbjct: 77  LPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLV 136

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  AW +W   G+V+GG Y S +GC+PY ++PCEHHV G    C  
Sbjct: 137 SCCHTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPCAH 196

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G  +TP+C   C +  Y   Y  D   G K++ V
Sbjct: 197 GG--RTPKCSHVCQS-GYTVDYAKDKHFGSKSYSV 228



 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 50/101 (49%), Positives = 65/101 (64%), Gaps = 4/101 (3%)

Query: 63  HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+  K++ V R       +I  +GP+   F+VY D + YK GVYQH  G  +G HA+R+L
Sbjct: 220 HFGSKSYSVRRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRIL 279

Query: 121 GWGV--ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGV  E  IPYWL+ NSWN  WGDHG F+ILRG++   IE
Sbjct: 280 GWGVWGEEKIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIE 320


>gi|341904470|gb|EGT60303.1| hypothetical protein CAEBREN_20420 [Caenorhabditis brenneri]
          Length = 351

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 75/156 (48%), Positives = 93/156 (59%), Gaps = 3/156 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R +WP CPS+  I DQS+CGSCWAVS A  ISDR+CIAS G     ISA  I 
Sbjct: 97  IPDSFDSRAQWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASKGQTQVSISADDIN 156

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           AC       GCNGG+P  AWR +  NG VTGG Y  + GC+PY   PCEHHV G      
Sbjct: 157 ACCGMACGNGCNGGYPIEAWRHYVKNGYVTGGSYQEKTGCKPYPYPPCEHHVNGTHYKPC 216

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                 T +C+++C    Y  TY+ DL  G+ A+ V
Sbjct: 217 PSDMYPTDKCERSC-QAGYSLTYKQDLHFGQSAYAV 251



 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 50/102 (49%), Positives = 68/102 (66%), Gaps = 2/102 (1%)

Query: 63  HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+ + A+ V +      ++I  +GP+   F+VYADF  Y  GVY H  G S+G HAV++L
Sbjct: 243 HFGQSAYAVSKKATEIQKEIMTNGPVEVAFTVYADFEVYSGGVYVHTAGASLGGHAVKML 302

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGF 162
           GWGV+N  PYWL ANSWN+ WG++G F+I+RG NE  IE G 
Sbjct: 303 GWGVDNGTPYWLCANSWNEDWGENGYFRIIRGVNECGIEHGV 344


>gi|50657025|emb|CAH04630.1| cathepsin B [Suberites domuncula]
          Length = 331

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 82/199 (41%), Positives = 109/199 (54%), Gaps = 11/199 (5%)

Query: 145 GTFKILRGENEADI--EMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPS 202
           G  K   G +E DI  +MG    V      D  L        K +P  FDAR +WP+CP+
Sbjct: 38  GVNKRFEGLSEVDIRRQMG----VLQGGPLDIKLPEKDITPLKDVPDMFDARMQWPDCPT 93

Query: 203 LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQ 261
           ++ I DQ  CGSCWA     ++SDR CI  N   +  ISA+ ++AC   C  GCNGG+  
Sbjct: 94  IKEIRDQGACGSCWAFGAVESMSDRFCIHFNQ--SAHISAEDLMACCETCGMGCNGGYLG 151

Query: 262 LAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNP 321
            AWR++ H G+VTGG YNS+EGCQPY +A C+HHV G  Q C    +  TP C + C   
Sbjct: 152 AAWRYFEHTGLVTGGQYNSKEGCQPYLIASCDHHVVGKKQPCA-SKEEHTPRCSKTC-EA 209

Query: 322 SYESTYRFDLKKGKKAHMV 340
            Y+ ++  D   G  A+ V
Sbjct: 210 GYDVSFEKDKHFGASAYSV 228



 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 60/81 (74%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I  +GP+   F+VYADF  YKSGVYQH  G  +G HA+R+LGWG EN  PYWLVANSWN
Sbjct: 238 EIMTNGPVEGAFTVYADFPTYKSGVYQHTSGAMLGGHAIRILGWGTENGTPYWLVANSWN 297

Query: 139 DHWGDHGTFKILRGENEADIE 159
           + WG  G FKI+RG+++  IE
Sbjct: 298 EDWGAMGYFKIIRGKDDCGIE 318


>gi|1942645|pdb|1MIR|A Chain A, Rat Procathepsin B
 gi|1942646|pdb|1MIR|B Chain B, Rat Procathepsin B
          Length = 322

 Score =  145 bits (365), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 73/166 (43%), Positives = 99/166 (59%), Gaps = 5/166 (3%)

Query: 177 ETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYF 236
           E +G      LP +FDARE+W  CP++  I DQ +CGS WA     A+SDR+CI +NG  
Sbjct: 53  ERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSSWAFGAVEAMSDRICIHTNGRV 112

Query: 237 TGQISAQHIVACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH 294
             ++SA+ ++ C    C  GCNGG+P  AW FW   G+V+GG YNS  GC PYT+ PCEH
Sbjct: 113 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 172

Query: 295 HVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           HV G    CT  G+  TP+C + C    Y ++Y+ D   G  ++ V
Sbjct: 173 HVNGARPPCT--GEGDTPKCNKMC-EAGYSTSYKEDKHYGYTSYSV 215



 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 53/83 (63%), Positives = 67/83 (80%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+   F+V++DFL YKSGVY+H  GD +G HA+R+LGWG+EN +PYWLVANS
Sbjct: 223 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVANS 282

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRGEN   IE
Sbjct: 283 WNADWGDNGFFKILRGENHCGIE 305


>gi|346472613|gb|AEO36151.1| hypothetical protein [Amblyomma maculatum]
          Length = 373

 Score =  145 bits (365), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 72/167 (43%), Positives = 95/167 (56%), Gaps = 6/167 (3%)

Query: 178 TMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIAS---NG 234
           T+     + LP NFDARE WP+CP++R I DQ +CGSCWA     AISDR CI S     
Sbjct: 109 TLDVSALRVLPENFDAREHWPDCPTIREIRDQGSCGSCWAFGAVEAISDRTCIHSPEGKP 168

Query: 235 YFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCE 293
                ++A  +++C   C  GCNGG+P  AW +W H G+VTGG+Y+S EGC PY +  C+
Sbjct: 169 RVIAHLAADDVLSCCTECGAGCNGGFPGSAWSYWVHKGIVTGGNYDSDEGCMPYPIKACD 228

Query: 294 HHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           HHV G L  C       TP C + C    Y+  +  D   G+ A+ V
Sbjct: 229 HHVNGTLGPCDKTIP-PTPRCVRMC-RKGYDVDFMDDKHYGRHAYSV 273



 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 56/99 (56%), Positives = 68/99 (68%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY + A+ VP      Q  I  +GP+ A F+VY DFL YKSGVYQ +   ++G HA+R+L
Sbjct: 265 HYGRHAYSVPAKAKQIQAEIMMNGPVEADFTVYEDFLHYKSGVYQRHTDSALGGHAIRLL 324

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN +PYWL ANSWN  WGD G FKILRG +E  IE
Sbjct: 325 GWGVENGVPYWLAANSWNTEWGDKGFFKILRGSDECGIE 363


>gi|327281751|ref|XP_003225610.1| PREDICTED: cathepsin B-like [Anolis carolinensis]
          Length = 330

 Score =  144 bits (364), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 72/184 (39%), Positives = 108/184 (58%), Gaps = 7/184 (3%)

Query: 146 TFKILRGENEADIEMGFNNRVEANSSEDDDL-ETMGCQNAKGLPRNFDAREKWPECPSLR 204
           T  +  G N  +++M +  ++         L E     +   LP +FD+R++WP CP++ 
Sbjct: 28  TLVVRAGHNFHNVDMSYLKKLCGTYLHGPKLPERFAFADDVELPDSFDSRKQWPSCPTIN 87

Query: 205 HIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP-NC-WGCNGGWPQL 262
            I DQ +CGSCWA     AISDR+C+ +NG    +ISA+ +++C    C  GCNGG+P  
Sbjct: 88  EIRDQGSCGSCWAFGAVEAISDRVCVHTNGKVNVEISAEDLLSCCGFECGMGCNGGYPSG 147

Query: 263 AWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---Y 319
           AW++W   G+V+GG Y+S  GC+PY++ PCEHH  G    C+  G  +TPEC + C   Y
Sbjct: 148 AWKYWTEKGLVSGGLYDSHVGCRPYSIPPCEHHTNGTRPPCSGEGG-ETPECVKKCEDGY 206

Query: 320 NPSY 323
            P+Y
Sbjct: 207 TPAY 210



 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 57/114 (50%), Positives = 75/114 (65%), Gaps = 2/114 (1%)

Query: 48  KKKKRLYLPTSIPLSHYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQ 105
           KK +  Y P      HY   ++ +PR     M +IY++GP+   F VY+DFL YKSGVYQ
Sbjct: 200 KKCEDGYTPAYKQDKHYGVTSYGIPRSEKEIMAEIYKNGPVEGAFVVYSDFLMYKSGVYQ 259

Query: 106 HNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           H  G+ +G HA+R+LGWGV+N  PYWL ANSWN  WG+ G F+ILRG++   IE
Sbjct: 260 HVSGEEVGGHAIRILGWGVDNGTPYWLAANSWNTDWGEDGFFRILRGQDHCGIE 313


>gi|335347291|gb|AEH42093.1| cysteine proteinase 6 [Haemonchus contortus]
          Length = 346

 Score =  144 bits (364), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 74/159 (46%), Positives = 99/159 (62%), Gaps = 11/159 (6%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FDAR KWP C S++HI DQ+NCGSCWAVS A+ +SDR+CIAS       IS+   V
Sbjct: 94  IPESFDARTKWPNCTSIKHIRDQANCGSCWAVSTASVLSDRICIASKQKKQVHISSIDFV 153

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C +GC GGWP  A+ ++ + GVVTGGDY S+ GC+PY   PC HH      N T 
Sbjct: 154 SCCDSCGFGCEGGWPIDAFEYYSYQGVVTGGDYGSKTGCRPYPFHPCGHH-----GNETY 208

Query: 306 LGKL----KTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G+      TPEC + C    Y+++YR D   G+  + V
Sbjct: 209 YGECPKEESTPECVKQC-QKGYKNSYRRDKTWGEDYYEV 246



 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 38/82 (46%), Positives = 59/82 (71%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R+I   GP+V+ F+VY DF  Y  G+Y+H  G + G HA++++GWG E ++PYW++ANSW
Sbjct: 255 REIMRSGPVVSSFTVYDDFSYYVKGIYKHTAGKARGSHAIKIIGWGTEKNVPYWIIANSW 314

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           ++ WG+ G F+++RG N   IE
Sbjct: 315 HNDWGEKGFFRMVRGTNHCGIE 336


>gi|197129222|gb|ACH45720.1| putative cathepsin B variant 2 [Taeniopygia guttata]
          Length = 236

 Score =  144 bits (364), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 68/142 (47%), Positives = 94/142 (66%), Gaps = 6/142 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFD+R +WP CP++  I DQ +CGSCWA     AISDR+C+ +N   + ++SA+ ++
Sbjct: 80  LPDNFDSRTQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLL 139

Query: 247 ACTP-NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C    C  GCNGG+P  AWR+W   G+V+GG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 140 SCCGFECGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRPYSIPPCEHHVNGTRPPCT 199

Query: 305 LLGKLKTPECKQNC---YNPSY 323
             G   TP C ++C   Y+PSY
Sbjct: 200 GEGG-STPRCSRHCEPGYSPSY 220


>gi|195352458|ref|XP_002042729.1| GM17589 [Drosophila sechellia]
 gi|194126760|gb|EDW48803.1| GM17589 [Drosophila sechellia]
          Length = 340

 Score =  144 bits (364), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 69/155 (44%), Positives = 92/155 (59%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FD+R++WP CP++  I DQ +CGSCWA     A+SDR+CI S G      SA  +V
Sbjct: 87  LPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLV 146

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  AW +W   G+V+GG Y S +GC+PY ++PCEHHV G    C  
Sbjct: 147 SCCHTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPCA- 205

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP+C   C + SY   Y  D   G K++ V
Sbjct: 206 -NGSGTPKCSHVCQS-SYTVDYAKDKHFGSKSYSV 238



 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 65/101 (64%), Gaps = 4/101 (3%)

Query: 63  HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+  K++ V R       +I  +GP+   F+VY D + YK GVYQH  G  +G HA+R+L
Sbjct: 230 HFGSKSYSVKRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRIL 289

Query: 121 GWGVEND--IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGV  +  IPYWL+ NSWN  WGDHG F+ILRG++   IE
Sbjct: 290 GWGVWGNEKIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIE 330


>gi|187105118|ref|NP_001119619.1| cathepsin B-5880 precursor [Acyrthosiphon pisum]
 gi|163300442|tpg|DAA06127.1| TPA_inf: cathepsin B transcript 5880 [Acyrthosiphon pisum]
 gi|239790051|dbj|BAH71611.1| ACYPI000015 [Acyrthosiphon pisum]
 gi|239790053|dbj|BAH71612.1| ACYPI000015 [Acyrthosiphon pisum]
          Length = 302

 Score =  144 bits (363), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 72/149 (48%), Positives = 99/149 (66%), Gaps = 2/149 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP++FDAR KW  CPS+  + DQ NC S +A+SVA+A+SDR+CI SNG    ++SAQ I+
Sbjct: 53  LPKSFDARAKWYMCPSIGMVYDQGNCKSSYAISVASAVSDRICIHSNGTVKPKLSAQQIL 112

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG    +W F+  +G+V+GG+Y S EGCQPYT+ PC+ H +  ++N   
Sbjct: 113 SCCYLCGDGCSGGQHFESWDFYRRHGLVSGGEYGSNEGCQPYTIEPCQ-HTETAVENACS 171

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKG 334
              L TPECK  CYNP Y + Y  D  +G
Sbjct: 172 NKTLFTPECKVQCYNPDYGTRYVKDNHQG 200



 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 50/91 (54%), Positives = 65/91 (71%)

Query: 69  HMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDI 128
           + VP   AM++IYE+GP+ A F +Y DF+ Y+SGVY +N G  +   AV++LGWG EN  
Sbjct: 203 YRVPAYTAMKEIYENGPITASFYMYQDFVNYQSGVYAYNSGKYVTTQAVKILGWGEENGT 262

Query: 129 PYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           PYWL ANS+N +WGD+G  KILRG NE  IE
Sbjct: 263 PYWLAANSFNTYWGDNGFVKILRGANECYIE 293


>gi|47217183|emb|CAG11019.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 351

 Score =  144 bits (363), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 74/176 (42%), Positives = 102/176 (57%), Gaps = 24/176 (13%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FD+RE+WP CP+L+ I DQ +CGSCWA   + A+SDR+CI SN   + ++SAQ ++
Sbjct: 79  LPKEFDSREQWPNCPTLKEIRDQGSCGSCWAFGASEAMSDRVCIHSNAKVSVELSAQDLL 138

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQ---------------------EGC 284
            C  +C  GCNGG+P  AW FW  +G+V+GG Y+S                       GC
Sbjct: 139 TCCNSCGMGCNGGYPSSAWNFWVSDGLVSGGLYDSHIGRIQVSLCVLLLAVDRDFVSPGC 198

Query: 285 QPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           +PYT+ PCEHHV G   +C+  G   TPEC   C    Y  +Y+ D   GK ++ V
Sbjct: 199 RPYTIPPCEHHVNGSRPSCSGEGG-DTPECIFRC-EAGYSPSYKQDKHFGKTSYSV 252



 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 48/82 (58%), Positives = 63/82 (76%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++IY++GP+   F+VY DF+ YKSGVYQH  G ++G HA+++LGWG EN +PYWL ANSW
Sbjct: 261 QEIYKNGPVEGAFTVYEDFVLYKSGVYQHVSGSALGGHAIKMLGWGEENGVPYWLCANSW 320

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N  WGD+G FKILRG +   IE
Sbjct: 321 NTDWGDNGFFKILRGADHCGIE 342


>gi|148229459|ref|NP_001079570.1| cathepsin B precursor [Xenopus laevis]
 gi|28277314|gb|AAH44689.1| MGC53360 protein [Xenopus laevis]
          Length = 333

 Score =  144 bits (363), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 72/184 (39%), Positives = 109/184 (59%), Gaps = 7/184 (3%)

Query: 152 GENEADIEMGFNNRVEANSSEDDDLE-TMGCQNAKGLPRNFDAREKWPECPSLRHIADQS 210
           G N A+ ++ +  R+     +   L+   G  +   LP +FD+R  WP CP++R I DQ 
Sbjct: 44  GHNFANADLHYVKRLCGTLLKGPQLQKRFGFADGLELPDSFDSRAAWPNCPTIREIRDQG 103

Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPN--CWGCNGGWPQLAWRFWG 268
           +CGSCWA     AISDR+C+ +NG    ++SA+ +++C  +    GCNGG+P  AW+FW 
Sbjct: 104 SCGSCWAFGAVEAISDRVCVHTNGKVNVEVSAEDLLSCCGDECGMGCNGGYPSGAWQFWT 163

Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---YNPSYES 325
             G+V+GG Y+S  GC+PY++ PCEHHV G    C    +  TP+C + C   Y+P+Y +
Sbjct: 164 ETGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPACKGE-EGDTPKCVKQCEEGYSPAYGT 222

Query: 326 TYRF 329
              F
Sbjct: 223 DKHF 226



 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 59/125 (47%), Positives = 78/125 (62%), Gaps = 2/125 (1%)

Query: 37  KKKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYA 94
           K ++    K  K+ +  Y P      H+   ++ VP      M +IY++GP+   F VYA
Sbjct: 199 KGEEGDTPKCVKQCEEGYSPAYGTDKHFGTTSYGVPTSEKEIMAEIYKNGPVEGAFLVYA 258

Query: 95  DFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGEN 154
           DF  YKSGVYQH  G+ +G HA+++LGWGVEN  PYWL ANSWN  WGD+G FKILRG++
Sbjct: 259 DFPLYKSGVYQHETGEELGGHAIKILGWGVENGTPYWLCANSWNTDWGDNGFFKILRGKD 318

Query: 155 EADIE 159
              IE
Sbjct: 319 HCGIE 323


>gi|60600065|gb|AAX26576.1| unknown [Schistosoma japonicum]
          Length = 190

 Score =  144 bits (362), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 71/165 (43%), Positives = 99/165 (60%), Gaps = 3/165 (1%)

Query: 151 RGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQS 210
           R +  +DI        + N  + + L T        LP++FDAR++W  CPS+  I DQS
Sbjct: 28  RFKTVSDIRRMLGALPDPNGEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQS 87

Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGH 269
           +CGSCWA     A+SDR+CI S G +   +SA+++V+C  +C  GCNGG+P  AW +W +
Sbjct: 88  SCGSCWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSSCGMGCNGGFPHSAWLYWKN 147

Query: 270 NGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPEC 314
            G+VTG  YN+  GCQPY   PCEH+  GPL  C   G ++TP C
Sbjct: 148 QGIVTGDLYNTTNGCQPYEFPPCEHNTLGPLPVCD--GDVETPPC 190


>gi|268579855|ref|XP_002644910.1| C. briggsae CBR-CPR-6 protein [Caenorhabditis briggsae]
          Length = 376

 Score =  144 bits (362), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 67/155 (43%), Positives = 96/155 (61%), Gaps = 1/155 (0%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+RE WP+C S+R+I DQS+CGSCWA     A+SDR+CIAS+G     +SA  ++
Sbjct: 106 IPESFDSRENWPKCQSIRNIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVSLSADDLL 165

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C +GCNGG P  AWR+W  +G+VTG +Y +  GC+PY   PCEHH +    +   
Sbjct: 166 SCCRSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANSGCKPYPFPPCEHHSKKTHFDPCP 225

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP+C++ C     + TY  D   G  A+ V
Sbjct: 226 HDLYPTPKCEKKCIADYTDKTYSEDKFYGHSAYGV 260



 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 45/85 (52%), Positives = 56/85 (65%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           +++  HGPL   F VY DFL Y  GVY H  G   G HAV+++GWG+E+ IPYW  ANSW
Sbjct: 269 KELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIEDGIPYWTCANSW 328

Query: 138 NDHWGDHGTFKILRGENEADIEMGF 162
           N  WG+ G F+ILRG +E  IE G 
Sbjct: 329 NTDWGEDGFFRILRGVDECGIESGV 353


>gi|194895314|ref|XP_001978227.1| GG19486 [Drosophila erecta]
 gi|190649876|gb|EDV47154.1| GG19486 [Drosophila erecta]
          Length = 340

 Score =  144 bits (362), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 69/150 (46%), Positives = 88/150 (58%), Gaps = 8/150 (5%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FD+R++WP CP++  I DQ  CGSCWA     A+SDR+CI S G      SA  +V
Sbjct: 87  IPEEFDSRKQWPNCPTIGEIRDQGECGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLV 146

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG+P  AW +W   G+V+GG Y S +GC+PY +APCEHHV G    C  
Sbjct: 147 SCCHTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEIAPCEHHVNGTRPPCGH 206

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
            G   TP+C   C     ES Y  D  K K
Sbjct: 207 GG--GTPKCSHVC-----ESGYTVDYAKDK 229



 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 66/101 (65%), Gaps = 4/101 (3%)

Query: 63  HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+  K++ V R   +   +I  +GP+   F+VY D + YK GVYQH  G  +G HA+R+L
Sbjct: 230 HFGSKSYSVKRNVRDIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHQHGKELGGHAIRIL 289

Query: 121 GWGV--ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGV  E  IPYWL+ NSWN  WGD+G F+ILRG++   IE
Sbjct: 290 GWGVWGEEKIPYWLIGNSWNTDWGDNGFFRILRGQDHCGIE 330


>gi|1777779|gb|AAB40605.1| cathepsin B-like cysteine proteinase [Ascaris suum]
 gi|324515014|gb|ADY46062.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
          Length = 398

 Score =  144 bits (362), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 67/155 (43%), Positives = 95/155 (61%), Gaps = 1/155 (0%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAREKW +C SL++I DQS+CGSCWA     A+SDR+CIASNG     +SA  ++
Sbjct: 121 IPEAFDAREKWDQCASLKNIRDQSSCGSCWAFGAVEAMSDRICIASNGKIQVSLSADDLL 180

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C +GC+GG P  AW++W   G+VTG ++  ++GC+PY   PCEHH          
Sbjct: 181 SCCKSCGFGCDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPYPFPPCEHHSNKTHYQPCK 240

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP+C++ C +   E TY  D   G+ A+ V
Sbjct: 241 HDLYPTPKCEKKCLDIYTEKTYAEDKFFGETAYGV 275



 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 50/113 (44%), Positives = 66/113 (58%), Gaps = 14/113 (12%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  HGP+   F VY DFL Y  G+Y H  G   G HAV++LGWGVE  +PYWLVANSW
Sbjct: 284 KEILTHGPVEVAFEVYEDFLMYDGGIYVHTGGKIGGGHAVKMLGWGVEQGVPYWLVANSW 343

Query: 138 NDHWGDHGTFKILRGENEADIEMG--------------FNNRVEANSSEDDDL 176
           N  WG+ G F+I+RG +E  IE                ++ R   ++ EDDD+
Sbjct: 344 NTDWGEDGFFRIIRGIDECGIESSVVGGLPKLNRTYKKYHRRYRLDNDEDDDI 396


>gi|44965462|gb|AAS49538.1| cathepsin B [Protopterus dolloi]
          Length = 225

 Score =  144 bits (362), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 70/156 (44%), Positives = 98/156 (62%), Gaps = 4/156 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFD+R +WP CP++R I DQ +CGSCWA     ++SDR+C+ S G    ++SA+ ++
Sbjct: 13  LPDNFDSRTQWPNCPTIREIRDQGSCGSCWAFGAVESMSDRVCVHSGGKQNVEVSAEDLL 72

Query: 247 ACTP-NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C    C  GCNGG+P  AW++W   G+V+GG Y S  GC+PYT+ PCEHHV G   +C+
Sbjct: 73  SCCGFECGMGCNGGYPSGAWQYWTEKGLVSGGLYGSGIGCRPYTIPPCEHHVNGSRPSCS 132

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G   TP+C Q C +  Y   Y  D   G+ A+ V
Sbjct: 133 GEGG-DTPKCVQKC-DSGYTPAYEKDKIYGQSAYSV 166



 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 33/66 (50%), Positives = 49/66 (74%), Gaps = 2/66 (3%)

Query: 64  YFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
           Y + A+ VP    + M +IY+ GP+   F+VY DFL YKSGVYQH+ G+++G HA+++LG
Sbjct: 159 YGQSAYSVPSSPESIMEEIYKDGPVEGAFTVYEDFLLYKSGVYQHHTGEAVGGHAIKILG 218

Query: 122 WGVEND 127
           WG+EN+
Sbjct: 219 WGIENN 224


>gi|56758644|gb|AAW27462.1| unknown [Schistosoma japonicum]
          Length = 294

 Score =  144 bits (362), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 75/195 (38%), Positives = 108/195 (55%), Gaps = 13/195 (6%)

Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
           +IL G  + D EM    R   +   D ++E         +P  FD+R+KWP C S+  I 
Sbjct: 61  RILMGARKEDAEMKRKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110

Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRF 266
           DQS CGSCWA     A++DR+CI S G  + ++SA  +++C  +C  GC GG+P +AW +
Sbjct: 111 DQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGDGCQGGFPGVAWDY 170

Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
           W   G+VTGG   +  GCQPY    CEHH +G    C      KTP+CKQ C    Y++ 
Sbjct: 171 WVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACG-TKIYKTPQCKQKC-QKGYKTP 228

Query: 327 YRFDLKKGKKAHMVL 341
           Y  D   G++++ V+
Sbjct: 229 YEQDKHYGEESYNVI 243



 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 31/43 (72%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           ++I  +GP+ A F VY DFL YKSG+Y+H  G  +G HA+R++
Sbjct: 251 KEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRII 293


>gi|308504233|ref|XP_003114300.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
 gi|308261685|gb|EFP05638.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
          Length = 351

 Score =  144 bits (362), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 75/156 (48%), Positives = 92/156 (58%), Gaps = 3/156 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R +WP CPS+  I DQS+CGSCWAVS A  ISDR+CIASNG     ISA  I 
Sbjct: 97  IPDSFDSRAQWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNGKTQLSISADDIN 156

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           AC       GCNGG+P  AWR +   G VTGG Y  + GC+PY   PCEHHV G      
Sbjct: 157 ACCGMVCGNGCNGGYPIEAWRHYVKKGYVTGGSYQEKTGCKPYPYPPCEHHVNGTHYKPC 216

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                 T +C+++C    Y  TY  DL  G+ A+ V
Sbjct: 217 PSNMYPTDKCERSC-QAGYALTYTQDLHFGQSAYAV 251



 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 51/102 (50%), Positives = 67/102 (65%), Gaps = 2/102 (1%)

Query: 63  HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+ + A+ V +      ++I  HGP+   FSVY DF  Y  GVY H  G S+G HAV++L
Sbjct: 243 HFGQSAYAVSKKVTEIQKEIMTHGPVEVAFSVYEDFEHYSGGVYVHTAGASLGGHAVKML 302

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGF 162
           GWGV+N  PYWL ANSWN+ WG++G F+I+RG NE  IE G 
Sbjct: 303 GWGVDNGTPYWLCANSWNEDWGENGYFRIIRGVNECGIESGV 344


>gi|221221056|gb|ACM09189.1| Cathepsin B precursor [Salmo salar]
 gi|221222300|gb|ACM09811.1| Cathepsin B precursor [Salmo salar]
          Length = 207

 Score =  143 bits (361), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 71/156 (45%), Positives = 97/156 (62%), Gaps = 4/156 (2%)

Query: 152 GENEADIEMGFNNRVEANSSEDDDLETMGCQNAKG--LPRNFDAREKWPECPSLRHIADQ 209
           G N  +++  +  R+     +   L TM  Q A    LP  FD R++WP CP+L+ I DQ
Sbjct: 43  GPNFHNVDYSYVKRLCGTLLKGPKLPTM-VQYAGDVELPDTFDPRQQWPNCPTLKEIRDQ 101

Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWG 268
            +CGSCWA   A AISDR+CI SN   + +IS++ +++C  +C  GCNGG+P  AW FW 
Sbjct: 102 GSCGSCWAFGAAEAISDRVCIHSNAKVSVEISSEDLLSCCDSCGMGCNGGYPSAAWDFWT 161

Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
             G+VTGG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 162 TEGLVTGGLYDSHVGCRPYSIPPCEHHVNGTRPPCT 197


>gi|56758130|gb|AAW27205.1| unknown [Schistosoma japonicum]
          Length = 279

 Score =  143 bits (361), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 75/195 (38%), Positives = 107/195 (54%), Gaps = 13/195 (6%)

Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
           +IL G  + D EM    R   +   D ++E         +P  FD+R+KWP C S+  I 
Sbjct: 61  RILMGARKEDAEMKRKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110

Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRF 266
           DQS CGSCWA     A++DR+CI S G  + ++SA  +++C  +C  GC GG+P +AW +
Sbjct: 111 DQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGDGCQGGFPGVAWDY 170

Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
           W   G+VTGG   +  GCQPY    CEHH +G    C      KTP+CKQ C    Y++ 
Sbjct: 171 WVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACG-TKIYKTPQCKQTC-QKGYKTP 228

Query: 327 YRFDLKKGKKAHMVL 341
           Y  D   G +++ V+
Sbjct: 229 YEQDKHYGDESYNVI 243


>gi|107921798|gb|ABF85680.1| cathepsin B3 [Fasciola hepatica]
          Length = 278

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 73/155 (47%), Positives = 94/155 (60%), Gaps = 2/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAR+KWP CPS+  I DQS+C SCWAVS A+AI+DR+CI SNG    ++SA  IV
Sbjct: 63  LPESFDARQKWPNCPSISEIRDQSSCSSCWAVSSASAITDRICIHSNGQKKPRLSAIDIV 122

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG P ++W +W   GVVTGG   +  GC PY    C H V  P      
Sbjct: 123 SCCAYCGYGCNGGIPAMSWDYWTREGVVTGGTLENPTGCLPYPFPKCSHGVVTPGLPPCP 182

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP+C++ C+   Y  TY  D  KGK ++ V
Sbjct: 183 RDIYPTPKCEKKCHA-GYNKTYEQDKVKGKSSYNV 216



 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 28/57 (49%), Positives = 41/57 (71%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYW 131
           + M +I ++GP+  IF ++ DFL YKSG+Y +  G  +G HA+RV+GWGVEN + YW
Sbjct: 222 DIMMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGVENGVNYW 278


>gi|313233819|emb|CBY09988.1| unnamed protein product [Oikopleura dioica]
          Length = 356

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 70/156 (44%), Positives = 95/156 (60%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFD+RE WP+CPS+  + DQ +CGSCWA   + AISDR CI SN  FT  +S++ ++
Sbjct: 95  LPANFDSREAWPDCPSISEVRDQGSCGSCWAFGASEAISDRTCIHSNAAFTFDLSSEDLL 154

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C       GCNGG+PQ AW +W  NG+V+GG Y+   GCQPY + PCEHH +G    CT
Sbjct: 155 SCCGYVCGNGCNGGFPQAAWEYWVQNGLVSGGLYHGT-GCQPYAIEPCEHHTEGDRPPCT 213

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
              +  TP+C   C +  Y   +  D   G  A+ +
Sbjct: 214 GE-EGTTPKCSHKCVD-GYTGNFAQDKHYGSVAYRI 247



 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 54/111 (48%), Positives = 68/111 (61%), Gaps = 2/111 (1%)

Query: 63  HYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   A+ +P      M +IY++GP+   F VY DF  YKSGVY H+ G ++G HA+RVL
Sbjct: 239 HYGSVAYRIPANEKAIMNEIYKNGPVEGAFIVYEDFPTYKSGVYSHHTGSALGGHAIRVL 298

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSS 171
           GWG EN   YWL  NSWN  WG++G FKI RG NE  IE      + A+ S
Sbjct: 299 GWGEENGEKYWLCGNSWNTDWGNNGFFKIKRGVNECGIESEMVGGIPASES 349


>gi|1169189|sp|P43157.1|CYSP_SCHJA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
           Full=Antigen Sj31; Flags: Precursor
 gi|11167|emb|CAA50305.1| cathepsin B [Schistosoma japonicum]
          Length = 342

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 76/194 (39%), Positives = 107/194 (55%), Gaps = 13/194 (6%)

Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
           +IL G  + D EM  N R   +   D ++E         +P  FD+R+KWP C S+  I 
Sbjct: 61  RILMGARKEDAEMKRNRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110

Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRF 266
           DQS CGSCWA     A++DR+CI S G  + ++SA  +++C  +C  GC GG+P +AW +
Sbjct: 111 DQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLISCCKDCGDGCQGGFPGVAWDY 170

Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
           W   G+VTGG   +  GCQPY    CEHH +G    C      KTP+CKQ C    Y++ 
Sbjct: 171 WVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACG-TKIYKTPQCKQTC-QKGYKTP 228

Query: 327 YRFDLKKGKKAHMV 340
           Y  D   G +++ V
Sbjct: 229 YEQDKHYGDESYNV 242



 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 45/82 (54%), Positives = 60/82 (73%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R I  +GP+ A F VY DFL YKSG+Y+H  G  +G HA+R++GWGVE   PYWL+ANSW
Sbjct: 251 RDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTPYWLIANSW 310

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG+ G F+++RG +E  IE
Sbjct: 311 NEDWGEKGLFRMVRGRDECSIE 332


>gi|358331547|dbj|GAA35870.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 508

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 67/136 (49%), Positives = 84/136 (61%), Gaps = 2/136 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+NFDAR+ WP C S+  I DQS+CGSCWA     A+SDRLCI SNG F   +SA  ++
Sbjct: 86  LPKNFDARKTWPHCSSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLL 145

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C +GC GG+P +AW +W  +G+VTGG      GC+ Y    CEHHVQG    C  
Sbjct: 146 SCCKDCGFGCRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEHHVQGHYPPCP- 204

Query: 306 LGKLKTPECKQNCYNP 321
                TPEC Q C  P
Sbjct: 205 RELYPTPECVQQCDTP 220



 Score =  108 bits (269), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 64/182 (35%), Positives = 87/182 (47%), Gaps = 26/182 (14%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           + M++I   GP+ AIF++Y DFL+Y SGVY H  G  +  HAVR+LGWG   ++PYWL+A
Sbjct: 243 SIMKEIMLRGPVEAIFTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWGELGNVPYWLIA 302

Query: 135 NSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDLETMG----CQNAKGLPRN 190
           NSWN+ WG+ G  K LRG NE  I             EDD    +G    C   K +P  
Sbjct: 303 NSWNEDWGEEGYMKFLRGYNECGI-------------EDDVTAVLGNAWSCPAIKVVPSK 349

Query: 191 FDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP 250
           F          SL      + CG           SDR   +   +    +S   +V+  P
Sbjct: 350 FI---------SLMKELYANACGCFRVYYTLLLYSDRYGSSYKIWLPTDVSGDLVVSFYP 400

Query: 251 NC 252
           +C
Sbjct: 401 DC 402


>gi|74213457|dbj|BAE35542.1| unnamed protein product [Mus musculus]
          Length = 339

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 72/148 (48%), Positives = 91/148 (61%), Gaps = 7/148 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDARE+W  CP++  I DQ +CGSCWA     AISDR CI +NG    ++SA+ ++
Sbjct: 80  LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLL 139

Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C    C  GCNGG+P  AW FW   G+V+GG YNS  GC PYT+ PCEHHV G    CT
Sbjct: 140 TCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNC---YNPSYESTYRF 329
             G+  T  C ++C   Y+PSY+    F
Sbjct: 200 --GEGDTHRCNKSCEAGYSPSYKEDKHF 225



 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 53/83 (63%), Positives = 66/83 (79%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+   F+V++DFL YKSGVY+H  GD +G HA+R+LGWGVEN +PYWL ANS
Sbjct: 240 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVENGVPYWLAANS 299

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRGEN   IE
Sbjct: 300 WNLDWGDNGFFKILRGENHCGIE 322


>gi|255040223|gb|ACT99884.1| truncated cathepsin B [Opisthorchis viverrini]
          Length = 313

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 67/136 (49%), Positives = 85/136 (62%), Gaps = 2/136 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+NFDAR KWP C S+  I DQS+CGSCWA     A+SDRLCI SNG F   +SA  ++
Sbjct: 86  LPKNFDARSKWPHCSSVSEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGSFNKSLSAVDLL 145

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C +GC GG+P +AW +W  +G+VTGG      GC+ Y    C+HHVQG    C  
Sbjct: 146 SCCKDCGFGCRGGYPAVAWDYWRTHGIVTGGSKEDPSGCRSYPFPKCDHHVQGHYPPCPR 205

Query: 306 LGKLKTPECKQNCYNP 321
                TPEC Q+C  P
Sbjct: 206 Q-IYPTPECVQDCDTP 220



 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 40/71 (56%), Positives = 54/71 (76%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           + M++I   GP+ A+F+VY DFLQYKS VY H +G  +  HA+R+LGWG E D+PYWL+A
Sbjct: 243 SIMKEIMLRGPVEAVFTVYEDFLQYKSRVYFHAWGAPMSGHAIRILGWGEEGDVPYWLIA 302

Query: 135 NSWNDHWGDHG 145
           NSWN+ WG+ G
Sbjct: 303 NSWNEDWGEKG 313


>gi|324507953|gb|ADY43363.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
          Length = 352

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 67/155 (43%), Positives = 95/155 (61%), Gaps = 1/155 (0%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAREKW +C SL++I DQS+CGSCWA     A+SDR+CIASNG     +SA  ++
Sbjct: 80  IPEAFDAREKWDQCASLKNIRDQSSCGSCWAFGAVEAMSDRICIASNGKIQVSLSADDLL 139

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C +GC+GG P  AW++W   G+VTG ++  ++GC+PY   PCEHH          
Sbjct: 140 SCCKSCGFGCDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPYPFPPCEHHSNKTHYQPCK 199

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP+C++ C +   E TY  D   G+ A+ V
Sbjct: 200 HDLYPTPKCEKKCLDIYTEKTYAEDKFFGETAYGV 234



 Score =  104 bits (259), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 45/82 (54%), Positives = 56/82 (68%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  HGP+   F VY DFL Y  G+Y H  G   G HAV++LGWGVE  +PYWLVANSW
Sbjct: 243 KEILTHGPVEVAFEVYEDFLMYDGGIYVHTGGKIGGGHAVKMLGWGVEQGVPYWLVANSW 302

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N  WG+ G F+I+RG +E  IE
Sbjct: 303 NTDWGEDGFFRIIRGIDECGIE 324


>gi|341888137|gb|EGT44072.1| hypothetical protein CAEBREN_10156 [Caenorhabditis brenneri]
          Length = 344

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 72/171 (42%), Positives = 102/171 (59%), Gaps = 5/171 (2%)

Query: 175 DLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG 234
           D + +  + +  +P +FDARE+WP C S+ +I DQS+CGSCWA + A AISDR CIASNG
Sbjct: 70  DEDIVATEVSDAIPDHFDAREQWPSCVSIDNIRDQSDCGSCWAFAAAEAISDRTCIASNG 129

Query: 235 YFTGQISAQHIVACTPNCW----GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
                +S++ +++C    +    GC GG+P  AW++WG +G+VTGG Y SQ GC+PY++A
Sbjct: 130 AVNTLLSSEDLLSCCTGIFSCGNGCEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIA 189

Query: 291 PCEHHVQGPLQNCTLLGKLKTPECKQNCY-NPSYESTYRFDLKKGKKAHMV 340
           PC   V G            TP+C   C  N +Y + Y  D   G  A+ V
Sbjct: 190 PCGQTVNGVTWPKCPEDTEPTPKCVDACTSNHTYPTAYLQDKHFGATAYAV 240



 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 47/81 (58%), Positives = 61/81 (75%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I ++GP+   F+VY DF QY +GVY H  G S+G HAV++LGWGV+N  PYWLVANSWN
Sbjct: 250 EILKNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGVDNGTPYWLVANSWN 309

Query: 139 DHWGDHGTFKILRGENEADIE 159
            +WG+ G F+I+RG NE  IE
Sbjct: 310 INWGEKGYFRIIRGLNECGIE 330


>gi|17565162|ref|NP_503382.1| Protein W07B8.4 [Caenorhabditis elegans]
 gi|351059398|emb|CCD74288.1| Protein W07B8.4 [Caenorhabditis elegans]
          Length = 335

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 73/164 (44%), Positives = 100/164 (60%), Gaps = 5/164 (3%)

Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
           + A  +P ++D R+ WP+C S+ +I DQS+CGSCWAV+ A AISDR CIASNG     +S
Sbjct: 68  ETADSIPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGDVNTLLS 127

Query: 242 AQHIVACTP---NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQ 297
           A+ I+ C     NC  GC GG+P  AWR+W  NG+VTGG + SQ GC+PY++APC   + 
Sbjct: 128 AEDILTCCTGKFNCGDGCEGGYPIQAWRYWVKNGLVTGGSFESQYGCKPYSIAPCGETID 187

Query: 298 GPLQNCTLLGKLKTPECKQNCY-NPSYESTYRFDLKKGKKAHMV 340
           G       +    TP+C+ +C  N SY   Y  D   G  A+ +
Sbjct: 188 GVTWPECPMKISDTPKCEHHCTGNNSYPIPYDQDKHFGASAYAI 231



 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 62/99 (62%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+   A+ + R       +I  HGP+   F VY DF  YK+G+Y H  G  +G HAV++L
Sbjct: 223 HFGASAYAIGRSAKQIQTEILAHGPVEVGFIVYEDFYLYKTGIYTHVAGGELGGHAVKML 282

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGV+N  PYWL ANSWN  WG+ G F+ILRG +E  IE
Sbjct: 283 GWGVDNGTPYWLAANSWNTVWGEKGYFRILRGVDECGIE 321


>gi|221219800|gb|ACM08561.1| Cathepsin B precursor [Salmo salar]
 gi|221222296|gb|ACM09809.1| Cathepsin B precursor [Salmo salar]
          Length = 205

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 71/156 (45%), Positives = 97/156 (62%), Gaps = 4/156 (2%)

Query: 152 GENEADIEMGFNNRVEANSSEDDDLETMGCQNAKG--LPRNFDAREKWPECPSLRHIADQ 209
           G N  +++  +  R+     +   L TM  Q A    LP  FD R++WP CP+L+ I DQ
Sbjct: 43  GPNFHNVDYSYVKRLCGTLLKGPKLPTM-VQYAGDVELPDTFDPRQQWPNCPTLKEIRDQ 101

Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWG 268
            +CGSCWA   A AISDR+CI SN   + +IS++ +++C  +C  GCNGG+P  AW FW 
Sbjct: 102 GSCGSCWAFGAAEAISDRVCIHSNAKVSVEISSEDLLSCCDSCGMGCNGGYPSAAWDFWT 161

Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
             G+VTGG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 162 TEGLVTGGLYDSHVGCRPYSIPPCEHHVNGTRPPCT 197


>gi|268557308|ref|XP_002636643.1| Hypothetical protein CBG23351 [Caenorhabditis briggsae]
          Length = 351

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 75/156 (48%), Positives = 91/156 (58%), Gaps = 3/156 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R +WP CPS+  I DQS+CGSCWAVS A  ISDR+CIASNG     ISA  I 
Sbjct: 97  VPDSFDSRTQWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNGKTQISISADDIN 156

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           AC       GCNGG+P  AWR +   G VTGG Y  + GC+PY   PCEHHV G      
Sbjct: 157 ACCGMVCGNGCNGGYPIEAWRHYVKKGYVTGGSYQEKSGCKPYPYPPCEHHVNGTHYKPC 216

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                 T +C+ +C    Y  TY  DL  G+ A+ V
Sbjct: 217 PSNMYPTDKCEHSC-QAGYPLTYTQDLHFGQSAYAV 251



 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 50/102 (49%), Positives = 67/102 (65%), Gaps = 2/102 (1%)

Query: 63  HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+ + A+ V +      ++I  HGP+   F+VY DF  Y  GVY H  G S+G HAV++L
Sbjct: 243 HFGQSAYAVSKKPAEIQKEIMTHGPVEVAFTVYEDFEHYSGGVYVHTAGASLGGHAVKML 302

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGF 162
           GWGV+N  PYWL ANSWN+ WG++G F+I+RG NE  IE G 
Sbjct: 303 GWGVDNGTPYWLCANSWNEDWGENGYFRIIRGVNECGIESGV 344


>gi|341900876|gb|EGT56811.1| hypothetical protein CAEBREN_29569 [Caenorhabditis brenneri]
          Length = 344

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 72/171 (42%), Positives = 101/171 (59%), Gaps = 5/171 (2%)

Query: 175 DLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG 234
           D + +  + +  +P  FDARE+WP C S+ +I DQS+CGSCWA + A AISDR CIASNG
Sbjct: 70  DEDIVATEVSDAIPDRFDAREQWPSCVSIDNIRDQSDCGSCWAFAAAEAISDRTCIASNG 129

Query: 235 YFTGQISAQHIVACTPNCW----GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
                +S++ +++C    +    GC GG+P  AW++WG +G+VTGG Y SQ GC+PY++A
Sbjct: 130 AVNTLLSSEDLLSCCTGIFSCGNGCEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIA 189

Query: 291 PCEHHVQGPLQNCTLLGKLKTPECKQNCY-NPSYESTYRFDLKKGKKAHMV 340
           PC   V G            TP+C   C  N +Y + Y  D   G  A+ V
Sbjct: 190 PCGQTVNGVTWPKCPEDTEPTPKCVDACTSNHTYPTAYLQDKHFGATAYAV 240



 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 47/81 (58%), Positives = 61/81 (75%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I ++GP+   F+VY DF QY +GVY H  G S+G HAV++LGWGV+N  PYWLVANSWN
Sbjct: 250 EILKNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGVDNGTPYWLVANSWN 309

Query: 139 DHWGDHGTFKILRGENEADIE 159
            +WG+ G F+I+RG NE  IE
Sbjct: 310 INWGEKGYFRIIRGLNECGIE 330


>gi|268555790|ref|XP_002635884.1| Hypothetical protein CBG01104 [Caenorhabditis briggsae]
          Length = 337

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 70/156 (44%), Positives = 97/156 (62%), Gaps = 2/156 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P ++D R+ W +C S+ +I DQS+CGSCWAV+ A  ISDRLCIASNG     +SA+ ++
Sbjct: 78  IPESYDVRDHWSKCISVDNIRDQSDCGSCWAVAAAETISDRLCIASNGSINTFVSAEDLL 137

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C  GC+GG+P  AWR+W   G+V+GG Y SQ GC+PY++APC   V G       
Sbjct: 138 SCCTSCGDGCDGGYPLQAWRYWVKQGLVSGGSYESQYGCKPYSIAPCGQTVNGVTWPKCP 197

Query: 306 LGKLKTPECKQNCYN-PSYESTYRFDLKKGKKAHMV 340
             +  TPEC  +C +  SY   Y  D   G  A+ V
Sbjct: 198 AQEEATPECASHCTSKSSYSVAYEKDKHYGLSAYPV 233



 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 56/99 (56%), Positives = 68/99 (68%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   A+ V R  A  Q  I +HGP+ A F VY+DF +YKSG+Y H  G  +G HAV++L
Sbjct: 225 HYGLSAYPVGRKEAQIQTEILQHGPVEAGFLVYSDFYRYKSGIYTHVSGQELGGHAVKIL 284

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN   YWLVANSWN +WG+ G F+ILRG NE  IE
Sbjct: 285 GWGVENGTKYWLVANSWNINWGEKGYFRILRGRNECGIE 323


>gi|38373697|gb|AAR19103.1| cathepsin B [Uronema marinum]
          Length = 350

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 72/163 (44%), Positives = 100/163 (61%), Gaps = 9/163 (5%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD RE +P+C SL+ + DQSNCGSCWA     AISDR+CIAS      +IS+++++
Sbjct: 86  LPESFDLREAYPKCESLQQVRDQSNCGSCWAFGTVEAISDRICIASGQKDQTRISSENLL 145

Query: 247 AC---TPNC-WGCNGGWPQLAWRFWGHNGVVTGGDY-----NSQEGCQPYTLAPCEHHVQ 297
           +C   T  C  GCNGG+   AW ++   G+V+G  Y     NS+  CQPY+  PC HHVQ
Sbjct: 146 SCCRGTFACGMGCNGGYTAGAWNYYVKTGLVSGNLYTDDNQNSKTECQPYSFPPCSHHVQ 205

Query: 298 GPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           G  Q CT L +  TP+C   C +   +++Y  DL KG  ++ V
Sbjct: 206 GEYQACTDLPQFNTPKCYTECNSQYTQNSYEQDLHKGVSSYSV 248



 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 50/84 (59%), Positives = 62/84 (73%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +IY++G   A F+VY+DFL Y SGVYQ+  G  +G HA+++LGWGVEN  PYWL ANSWN
Sbjct: 258 EIYQYGSTTASFNVYSDFLTYSSGVYQNTSGSYMGGHAIKMLGWGVENGTPYWLCANSWN 317

Query: 139 DHWGDHGTFKILRGENEADIEMGF 162
             WG++G FKILRG NE  IE G 
Sbjct: 318 SSWGENGFFKILRGSNECGIESGM 341


>gi|170586854|ref|XP_001898194.1| cathepsin B-like cysteine proteinase [Brugia malayi]
 gi|158594589|gb|EDP33173.1| cathepsin B-like cysteine proteinase, putative [Brugia malayi]
          Length = 384

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 68/155 (43%), Positives = 96/155 (61%), Gaps = 2/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FDAR+ WPEC SLR+I DQS+CGSCWAV+   A+SDR+CI S G     +SA  ++
Sbjct: 121 IPESFDARKNWPECASLRNIRDQSSCGSCWAVAAVEAMSDRICITSKGKKQVILSADDLL 180

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GC GG P  AW++W  +G+VTG DY +  GC+PY   PCEHH          
Sbjct: 181 SCCKTCGFGCFGGEPMAAWKYWVLSGIVTGSDYTNHSGCRPYPFPPCEHHSNKTHYEPCK 240

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP+C + C + +Y  +Y+ D   G++A+ V
Sbjct: 241 HDLYPTPKCYKQC-DKNYTKSYKADKYYGEQAYNV 274



 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 44/88 (50%), Positives = 58/88 (65%), Gaps = 3/88 (3%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I   GP+ A F VY DFL Y SG+Y+H  G   G HAV++LGWG++  + YWL ANSW
Sbjct: 283 KEIMTLGPVEASFEVYTDFLHYTSGIYKHVAGSVGGGHAVKILGWGIDQGVSYWLAANSW 342

Query: 138 NDHWGD---HGTFKILRGENEADIEMGF 162
           N+ WG+    G F+ILRG +E  IE G 
Sbjct: 343 NNDWGEDVFSGYFRILRGADECGIESGI 370


>gi|87246247|gb|ABD35300.1| cathepsin B-like cysteine protease [Triatoma infestans]
          Length = 333

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 66/147 (44%), Positives = 95/147 (64%), Gaps = 6/147 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDAR++WP C ++  I DQ +CGSCWA     A+SDRLCI SNG     +SA++++
Sbjct: 82  LPDEFDARKQWPNCSTIGEIRDQGSCGSCWAFGAVEAMSDRLCIHSNGKLQVHLSAENLL 141

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C  GC GG P+ AW +W   G+V+GG+Y S++GCQPY++APCEH + G    C  
Sbjct: 142 SCCDSCGDGCLGGSPESAWEYWHKFGIVSGGNYGSKQGCQPYSIAPCEHSIHGSSPACG- 200

Query: 306 LGKLKTPECKQNC---YNPSYESTYRF 329
            G   TP+CK+ C   Y+  Y+  + +
Sbjct: 201 -GVTDTPKCKKQCEKGYSIPYDKAFYY 226



 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 51/109 (46%), Positives = 72/109 (66%), Gaps = 7/109 (6%)

Query: 58  SIPLS---HYFKKAHMVPRCNAMR---QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           SIP     +Y +  + +P  +A +   +I ++GP+VA F VY D   YK GVYQH  G+ 
Sbjct: 217 SIPYDKAFYYGQPGYAIPN-DAQKIQAEILKNGPIVASFLVYEDLFSYKEGVYQHVAGEF 275

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEM 160
           +G H +++ GWG+EN  PYWLVANSWN  WG++G FKI RG++E  IE+
Sbjct: 276 LGGHVIKIFGWGIENGTPYWLVANSWNTDWGNNGFFKIPRGKDECGIEI 324


>gi|132566367|gb|ABO34080.1| cathepsin B5 [Clonorchis sinensis]
          Length = 343

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 67/136 (49%), Positives = 84/136 (61%), Gaps = 2/136 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+NFDAR+ WP C S+  I DQS+CGSCWA     A+SDRLCI SNG F   +SA  ++
Sbjct: 86  LPKNFDARKTWPHCSSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLL 145

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C +GC GG+P +AW +W  +G+VTGG      GC+ Y    CEHHVQG    C  
Sbjct: 146 SCCKDCGFGCRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEHHVQGHYPPCP- 204

Query: 306 LGKLKTPECKQNCYNP 321
                TPEC Q C  P
Sbjct: 205 RELYPTPECVQQCDTP 220



 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 46/85 (54%), Positives = 60/85 (70%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           + M++I   GP+ AIF++Y DFL+Y SGVY H  G  +  HAVR+LGWG   ++PYWL+A
Sbjct: 243 SIMKEIMLRGPVEAIFTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWGELGNVPYWLIA 302

Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
           NSWN+ WG+ G  K LRG NE  IE
Sbjct: 303 NSWNEDWGEEGYMKFLRGYNECGIE 327


>gi|45361295|ref|NP_989225.1| cathepsin B precursor [Xenopus (Silurana) tropicalis]
 gi|38969948|gb|AAH63365.1| hypothetical protein MGC75969 [Xenopus (Silurana) tropicalis]
          Length = 333

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 73/184 (39%), Positives = 106/184 (57%), Gaps = 7/184 (3%)

Query: 152 GENEADIEMGFNNRVEANSSEDDDLE-TMGCQNAKGLPRNFDAREKWPECPSLRHIADQS 210
           G N A+ ++ +  R+         L+   G  +   LP +FD+R  WP CP++R + DQ 
Sbjct: 44  GHNFANADLHYVKRLCGTHLNGPQLQKRFGFADGMELPDSFDSRAAWPNCPTIREVRDQG 103

Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP-NC-WGCNGGWPQLAWRFWG 268
           +CGSCWA     AISDR+C+ +NG    ++SA+ +++C    C  GCNGG+P  AW+FW 
Sbjct: 104 SCGSCWAFGAVEAISDRVCVHTNGKVNVEVSAEDLLSCCGFECGMGCNGGYPSGAWKFWT 163

Query: 269 HNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---YNPSYES 325
             G+V+GG Y+S  GC+PY++ PCEHHV G    C    +  TP+C + C   Y P Y S
Sbjct: 164 ETGLVSGGLYDSHLGCRPYSIPPCEHHVNGSRPACKGE-EGDTPKCVKQCEDGYAPVYGS 222

Query: 326 TYRF 329
              F
Sbjct: 223 DKHF 226



 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 59/125 (47%), Positives = 78/125 (62%), Gaps = 2/125 (1%)

Query: 37  KKKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYA 94
           K ++    K  K+ +  Y P      H+   ++ VP      M +IY++GP+   F VYA
Sbjct: 199 KGEEGDTPKCVKQCEDGYAPVYGSDKHFGATSYGVPSSEKEIMAEIYKNGPVEGAFLVYA 258

Query: 95  DFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGEN 154
           DF  YKSGVYQH  G+ +G HA+++LGWGVEN  PYWL ANSWN  WGD+G FKILRG++
Sbjct: 259 DFPMYKSGVYQHETGEELGGHAIKILGWGVENGTPYWLCANSWNTDWGDNGFFKILRGKD 318

Query: 155 EADIE 159
              IE
Sbjct: 319 HCGIE 323


>gi|1181143|emb|CAA93278.1| cysteine proteinase [Haemonchus contortus]
          Length = 341

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 71/141 (50%), Positives = 93/141 (65%), Gaps = 5/141 (3%)

Query: 182 QNAKG--LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQ 239
           +N KG  +P +FDAR KWP+C SL+HI DQ+NCGSCWAVS A+A+SDR+CIASNG     
Sbjct: 83  KNDKGEDIPESFDARTKWPKCSSLKHIRDQANCGSCWAVSTASALSDRICIASNGRKQVH 142

Query: 240 ISAQHIVACTPN-C-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQ 297
           +SA  I++C  N C +GCNGGWP  A+ ++   G VTGGDY +  GC+PY   PC HH +
Sbjct: 143 VSATDILSCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGK 202

Query: 298 GPLQNCTLLGKLKTPECKQNC 318
                     +  TP+C + C
Sbjct: 203 DTYYG-ECPNEATTPKCVRKC 222



 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 46/96 (47%), Positives = 66/96 (68%), Gaps = 2/96 (2%)

Query: 66  KKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWG 123
           K A+ VP       R+I ++GP+V  F+VY DF  YK G+Y+H  G + G HA++++GWG
Sbjct: 238 KDAYEVPNSEKAIQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWG 297

Query: 124 VENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
            E  +PYWL+ANSW++ WG++G F+ILRG N   IE
Sbjct: 298 KEGGVPYWLIANSWHNDWGENGYFRILRGSNHCGIE 333


>gi|325302582|dbj|BAJ83491.1| cathepsin B-like peptidase [Echinococcus multilocularis]
          Length = 338

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 67/146 (45%), Positives = 92/146 (63%), Gaps = 5/146 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDAR+ WP+CP++  I DQ  CGSCWA     A+SDR+CI S G    +ISA  ++
Sbjct: 84  LPSEFDARKAWPDCPTIGEIRDQGTCGSCWAFGATEAMSDRICIHSEGKEVVRISADDLL 143

Query: 247 ACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C      +GCNGG P+ AWR+W  +G+V+GG Y S  GC+PY + PCEHH  G   +C 
Sbjct: 144 SCCGLFCGFGCNGGLPENAWRYWAIDGIVSGGLYGSHVGCRPYEIPPCEHHTSGNRPDCK 203

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFD 330
             G  KTP+C++ C   S++  Y+ D
Sbjct: 204 --GNSKTPKCQRQCVE-SFDGKYQAD 226



 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 54/92 (58%), Positives = 63/92 (68%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           + M +I  +GP+ A F VYADFL YKSGVYQH  G  +G HAV++LGWG EN +PYWL A
Sbjct: 242 DIMNEILVYGPVEADFIVYADFLTYKSGVYQHVKGGFLGGHAVKILGWGEENGVPYWLCA 301

Query: 135 NSWNDHWGDHGTFKILRGENEADIEMGFNNRV 166
           NSWN  WGD G FKILRG N   IE   N  +
Sbjct: 302 NSWNTDWGDGGFFKILRGYNHCKIEADINAGI 333


>gi|392920988|ref|NP_506011.2| Protein F57F5.1 [Caenorhabditis elegans]
 gi|206994319|emb|CAB00098.2| Protein F57F5.1 [Caenorhabditis elegans]
          Length = 351

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 74/157 (47%), Positives = 91/157 (57%), Gaps = 3/157 (1%)

Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
            +P +FD+R  WP CPS+  I DQS+CGSCWAVS A  ISDR+CIASN      ISA  I
Sbjct: 96  AVPDSFDSRTAWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNAKTILSISADDI 155

Query: 246 VACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
            AC       GCNGG+P  AWR +   G VTGG Y  + GC+PY   PCEHHV G     
Sbjct: 156 NACCGMVCGNGCNGGYPIEAWRHYVKKGYVTGGSYQDKTGCKPYPYPPCEHHVNGTHYKP 215

Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                  T +C+++C    Y  TY+ DL  G+ A+ V
Sbjct: 216 CPSNMYPTDKCERSC-QAGYALTYQQDLHFGQSAYAV 251



 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 48/95 (50%), Positives = 65/95 (68%), Gaps = 2/95 (2%)

Query: 63  HYFKKAHMVPRCNA--MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+ + A+ V +  A   ++I  HGP+   F+VY DF  Y  GVY H  G S+G HAV++L
Sbjct: 243 HFGQSAYAVSKKAAEIQKEIMTHGPVEVAFTVYEDFEHYSGGVYVHTAGASLGGHAVKML 302

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENE 155
           GWGV+N  PYWL ANSWN+ WG++G F+I+RG NE
Sbjct: 303 GWGVDNGTPYWLCANSWNEDWGENGYFRIIRGVNE 337


>gi|157058745|gb|ABV03130.1| cathepsin B-2744 [Sitobion avenae]
          Length = 260

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 68/156 (43%), Positives = 96/156 (61%), Gaps = 2/156 (1%)

Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           +PR FDAR+ +  C   +  + DQ NC S WAV+VA+  SDRLCIASNG FT  +SAQ++
Sbjct: 26  IPREFDARQYFGSCADVIGDVKDQGNCASSWAVAVASTFSDRLCIASNGQFTDNLSAQNL 85

Query: 246 VAC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           ++C      GC+GG    AW      G+VTGG+++S EGCQPY + PC H+  G L+NC+
Sbjct: 86  LSCGDEEKMGCDGGSAFKAWELTMSKGIVTGGNFDSNEGCQPYKIRPCNHYGNGNLKNCS 145

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            L + +   C++ C N +Y+  Y  DL K    +M 
Sbjct: 146 SLRRTQMTVCREKCVNKNYKVKYEDDLHKTSIVYMT 181



 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 31/69 (44%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVANS 136
           ++I  +GP+ A   VY +F+ YK G+Y+   G+ IG H V+++GWGV+ D   YWL  NS
Sbjct: 191 QEIMTYGPVTAFMYVYENFMGYKEGIYKSTAGELIGYHHVKLIGWGVDGDGTEYWLAMNS 250

Query: 137 WNDHWGDHG 145
           WN +WG +G
Sbjct: 251 WNSNWGTNG 259


>gi|384597848|gb|AFI23675.1| cathepsin B, partial [Brugia malayi]
          Length = 319

 Score =  141 bits (356), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 68/155 (43%), Positives = 96/155 (61%), Gaps = 2/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FDAR+ WPEC SLR+I DQS+CGSCWAV+   A+SDR+CI S G     +SA  ++
Sbjct: 77  IPESFDARKNWPECASLRNIRDQSSCGSCWAVAAVEAMSDRICITSKGKKQVILSADDLL 136

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GC GG P  AW++W  +G+VTG DY +  GC+PY   PCEHH          
Sbjct: 137 SCCKTCGFGCFGGEPMAAWKYWVLSGIVTGSDYTNHSGCRPYPFPPCEHHSNKTHYEPCK 196

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP+C + C + +Y  +Y+ D   G++A+ V
Sbjct: 197 HDLYPTPKCYKQC-DKNYTKSYKADKYYGEQAYNV 230



 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 41/78 (52%), Positives = 55/78 (70%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I   GP+ A F VY DFL Y SG+Y+H  G   G HAV++LGWG++  + YWL ANSW
Sbjct: 239 KEIMTLGPVEASFEVYTDFLHYTSGIYKHVAGSVGGGHAVKILGWGIDQGVSYWLAANSW 298

Query: 138 NDHWGDHGTFKILRGENE 155
           N+ WG+ G F+ILRG +E
Sbjct: 299 NNDWGEDGYFRILRGADE 316


>gi|227293|prf||1701299A cathepsin B
          Length = 339

 Score =  141 bits (356), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 71/148 (47%), Positives = 91/148 (61%), Gaps = 7/148 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDARE+W  CP++  I DQ +CGSCWA     AISDR CI +NG    ++SA+ ++
Sbjct: 80  LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLL 139

Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C    C  GCNGG+P  AW FW   G+V+GG Y+S  GC PYT+ PCEHHV G    CT
Sbjct: 140 TCCGIQCGDGCNGGYPSGAWNFWTKKGLVSGGYYDSHIGCLPYTIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNC---YNPSYESTYRF 329
             G+  T  C ++C   Y+PSY+    F
Sbjct: 200 --GEGDTRRCNKSCEAGYSPSYKEDKHF 225



 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 51/83 (61%), Positives = 64/83 (77%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+   F+V++DFL YKSGVY+H  GD +G HA+R+L WGVEN +PYW  ANS
Sbjct: 240 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILVWGVENGVPYWAAANS 299

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRGEN   IE
Sbjct: 300 WNLDWGDNGFFKILRGENHCGIE 322


>gi|56753443|gb|AAW24925.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  141 bits (356), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 75/195 (38%), Positives = 106/195 (54%), Gaps = 13/195 (6%)

Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
           +IL G  + D EM    R   +   D ++E         +P  FD+R+KWP C S+  I 
Sbjct: 61  RILMGARKEDAEMKRKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110

Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRF 266
           DQS CGSCWA     A++DR+CI S G  + ++SA  +++C  +C  GC GG+P +AW +
Sbjct: 111 DQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGDGCQGGFPGVAWDY 170

Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
           W   G+VTGG   +  GCQPY    CEHH +G    C      KTP+CKQ C    Y++ 
Sbjct: 171 WVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACG-TKIYKTPQCKQKC-QKGYKTP 228

Query: 327 YRFDLKKGKKAHMVL 341
           Y  D   G + + V+
Sbjct: 229 YEQDKNYGDQRYNVI 243



 Score =  110 bits (276), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 45/82 (54%), Positives = 62/82 (75%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R+I  +GP+ A F VY DFL YKSG+Y+H  G  +G HA+R++GWGVE   PYWL+ANSW
Sbjct: 251 REIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIGWGVEKGKPYWLIANSW 310

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG++G F+++RG +E  IE
Sbjct: 311 NEDWGENGLFRMVRGRDECSIE 332


>gi|300176937|emb|CBK25506.2| unnamed protein product [Blastocystis hominis]
          Length = 320

 Score =  141 bits (355), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 67/134 (50%), Positives = 87/134 (64%), Gaps = 5/134 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD  EKWPECPSL+ I DQS CGSCWA   A A +DRLCIAS G    ++S Q ++
Sbjct: 69  LPESFDPVEKWPECPSLKEIRDQSVCGSCWAFGAAEAATDRLCIASKGKIQDRLSDQDLL 128

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C  +C +GCNGGWP +AW ++   GV TGG+Y S++ C  Y    C+HHV+G    C  
Sbjct: 129 TCCESCGFGCNGGWPSMAWSWFHSTGVTTGGEYGSKDWCNAYEFPKCDHHVEGKYPPC-- 186

Query: 306 LGKLK-TPECKQNC 318
            G+ + TPEC + C
Sbjct: 187 -GETQPTPECVEKC 199



 Score =  107 bits (267), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 72/99 (72%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPR-CNAMR-QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+F +A+ VP    A++ ++  +GP+   FSVY DF+ YKSG+YQH  G  +G HAV+++
Sbjct: 212 HFFGEAYHVPSNVEAIKTELMTNGPIEVDFSVYEDFMTYKSGIYQHVAGKYLGGHAVKLV 271

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVE+ + YW +ANSWN+ WG++G F+I+ G+NE  IE
Sbjct: 272 GWGVEDGVEYWKIANSWNEDWGENGYFRIIAGKNECGIE 310


>gi|341887135|gb|EGT43070.1| hypothetical protein CAEBREN_13756 [Caenorhabditis brenneri]
          Length = 398

 Score =  141 bits (355), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 65/155 (41%), Positives = 95/155 (61%), Gaps = 1/155 (0%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+RE WP+C S++ I DQS+CGSCWA     A+SDR+CIAS+G     +SA  ++
Sbjct: 120 IPESFDSRENWPKCESIKAIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVSLSADDLL 179

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C +GCNGG P  AWR+W  +G+VTG ++ +  GC+PY   PCEHH +    +   
Sbjct: 180 SCCRSCGFGCNGGDPLAAWRYWVKDGIVTGSNFTANSGCKPYPFPPCEHHSKKTHFDPCP 239

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP+C++ C     + TY  D   G  A+ V
Sbjct: 240 HDLYPTPKCEKRCNAEYTDKTYSEDKFYGSSAYGV 274



 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 46/84 (54%), Positives = 57/84 (67%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           +++  HGPL   F VY DFL Y  GVY H  G   G HAV+++GWG+E+ IPYW VANSW
Sbjct: 283 KELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIEDGIPYWTVANSW 342

Query: 138 NDHWGDHGTFKILRGENEADIEMG 161
           N  WG+ G F+ILRG +E  IE G
Sbjct: 343 NTDWGEDGFFRILRGVDECGIESG 366


>gi|294894292|ref|XP_002774787.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239880404|gb|EER06603.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 414

 Score =  141 bits (355), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 73/157 (46%), Positives = 93/157 (59%), Gaps = 10/157 (6%)

Query: 182 QNAKGLPRNFDAREKWPECP-SLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQI 240
           +  + LP +FDAR  +P C   +RHI DQS+CGSCWA  V  A +DRLCI SNG FT  +
Sbjct: 137 EELQDLPTDFDARTAFPNCSKVIRHIRDQSDCGSCWAFGVTEAFNDRLCIKSNGTFTELL 196

Query: 241 SAQHIVACTPNCWGCNGGWPQLAWRFWGHN-GVVTGGDYNSQ------EGCQPYTLAPCE 293
           SA  + AC P+ +GC+GG P LAW  W HN G+ TGGDY ++      +GC PY   PC 
Sbjct: 197 SAGEMNACAPS-FGCDGGIPSLAWS-WVHNKGIATGGDYLAEDDMTKDDGCWPYDFPPCA 254

Query: 294 HHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
           HHV             +TP C + C+NP Y +T R D
Sbjct: 255 HHVNDSKYPKCPKDSYETPNCAEQCHNPKYTTTLRDD 291



 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 51/125 (40%), Positives = 67/125 (53%), Gaps = 7/125 (5%)

Query: 43  KKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCNA-MRQIYEHGPLV------AIFSVYAD 95
           K     +  R +L  S+P  +    A    R +  +  IY   P V      A F VY D
Sbjct: 283 KYTTTLRDDRHFLVESVPYEYSVNDAKNAIRTDGPVGPIYFCDPSVNFDQVSASFIVYED 342

Query: 96  FLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENE 155
           FL Y+SGVY+H  G  +G HAV+++GWG E    YWLV NSWN+ WGD+G FKI  G  E
Sbjct: 343 FLAYRSGVYKHTSGKELGGHAVKIIGWGEETGQAYWLVVNSWNEDWGDNGLFKIALGNCE 402

Query: 156 ADIEM 160
            D ++
Sbjct: 403 IDDDL 407


>gi|25146613|ref|NP_741818.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
 gi|1169087|sp|P43510.1|CPR6_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 6; AltName:
           Full=Cysteine protease-related 6; Flags: Precursor
 gi|671715|gb|AAA98787.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|695294|gb|AAA98789.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351058213|emb|CCD65628.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
          Length = 379

 Score =  141 bits (355), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 65/155 (41%), Positives = 96/155 (61%), Gaps = 1/155 (0%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+ WP+C S++ I DQS+CGSCWA     A+SDR+CIAS+G     +SA  ++
Sbjct: 105 IPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLL 164

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C +GCNGG P  AWR+W  +G+VTG +Y +  GC+PY   PCEHH +    +   
Sbjct: 165 SCCKSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCP 224

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP+C++ C +   + TY  D   G  A+ V
Sbjct: 225 HDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGV 259



 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 45/84 (53%), Positives = 57/84 (67%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           +++  HGPL   F VY DFL Y  GVY H  G   G HAV+++GWG+++ IPYW VANSW
Sbjct: 268 KELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIDDGIPYWTVANSW 327

Query: 138 NDHWGDHGTFKILRGENEADIEMG 161
           N  WG+ G F+ILRG +E  IE G
Sbjct: 328 NTDWGEDGFFRILRGVDECGIESG 351


>gi|71984043|ref|NP_001024426.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
 gi|351058214|emb|CCD65629.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
          Length = 378

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 65/155 (41%), Positives = 96/155 (61%), Gaps = 1/155 (0%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+ WP+C S++ I DQS+CGSCWA     A+SDR+CIAS+G     +SA  ++
Sbjct: 104 IPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLL 163

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C +GCNGG P  AWR+W  +G+VTG +Y +  GC+PY   PCEHH +    +   
Sbjct: 164 SCCKSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCP 223

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP+C++ C +   + TY  D   G  A+ V
Sbjct: 224 HDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGV 258



 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 45/84 (53%), Positives = 57/84 (67%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           +++  HGPL   F VY DFL Y  GVY H  G   G HAV+++GWG+++ IPYW VANSW
Sbjct: 267 KELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIDDGIPYWTVANSW 326

Query: 138 NDHWGDHGTFKILRGENEADIEMG 161
           N  WG+ G F+ILRG +E  IE G
Sbjct: 327 NTDWGEDGFFRILRGVDECGIESG 350


>gi|76576341|gb|ABA53864.1| cathepsin B-like cysteine protease 2 [Parelaphostrongylus tenuis]
          Length = 344

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 69/156 (44%), Positives = 97/156 (62%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FDAR +WP CPS+ +I DQS CGSCWA   A A+SDR+CIAS+G  T ++SA  I+
Sbjct: 94  IPDSFDARVQWPHCPSISYIRDQSQCGSCWAFGSAEAMSDRVCIASHGNKTVELSADDIL 153

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ-NCT 304
           +C  +C  GC+GG+P  AW ++   GVVTGG Y +++ C+PY + PC HH       NCT
Sbjct: 154 SCCYDCGDGCDGGYPISAWEYFVETGVVTGGLYGTKDSCRPYEIPPCGHHRNETFYGNCT 213

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            +    TP+C   C    Y  +Y  D   GK ++ +
Sbjct: 214 QIA--DTPDCVTTC-QAGYPISYDDDKTFGKDSYTI 246



 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 45/82 (54%), Positives = 55/82 (67%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  +GP+ A F VY DF  Y  G+Y+H  G   G HAVR+LGWG E    YWLVANSW
Sbjct: 255 KEIMTYGPVTAAFIVYEDFFHYHRGIYKHVSGGEEGGHAVRILGWGEEKGTAYWLVANSW 314

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N  WG++G F+ILRG NE  IE
Sbjct: 315 NTDWGENGYFRILRGSNECGIE 336


>gi|193209594|ref|NP_001123113.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
 gi|351058222|emb|CCD65637.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
          Length = 369

 Score =  140 bits (354), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 65/155 (41%), Positives = 96/155 (61%), Gaps = 1/155 (0%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+ WP+C S++ I DQS+CGSCWA     A+SDR+CIAS+G     +SA  ++
Sbjct: 95  IPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLL 154

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C +GCNGG P  AWR+W  +G+VTG +Y +  GC+PY   PCEHH +    +   
Sbjct: 155 SCCKSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCP 214

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP+C++ C +   + TY  D   G  A+ V
Sbjct: 215 HDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGV 249



 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 45/84 (53%), Positives = 57/84 (67%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           +++  HGPL   F VY DFL Y  GVY H  G   G HAV+++GWG+++ IPYW VANSW
Sbjct: 258 KELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIDDGIPYWTVANSW 317

Query: 138 NDHWGDHGTFKILRGENEADIEMG 161
           N  WG+ G F+ILRG +E  IE G
Sbjct: 318 NTDWGEDGFFRILRGVDECGIESG 341


>gi|194246069|gb|ACF35526.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 277

 Score =  140 bits (354), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 70/145 (48%), Positives = 86/145 (59%), Gaps = 4/145 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE W  C S+  I DQS CGSC A     A+SDR+CI + G     ISAQ ++
Sbjct: 25  LPESFDAREAWSHCDSIHLIRDQSTCGSCRAFGATEAMSDRICIHTKGRVQVNISAQDLL 84

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C  GC GG+P  AW ++   G+VTGG Y + +GCQPY   PCEHH +GPL NCT 
Sbjct: 85  TCCHQCGMGCFGGYPSAAWDYYKDEGIVTGGLYGTDDGCQPYYFPPCEHHTKGPLPNCT- 143

Query: 306 LGKLKTPECKQNCYNPSYESTYRFD 330
                TP+C Q C    YE +Y  D
Sbjct: 144 -DTKPTPKCLQVC-RKGYEKSYSED 166



 Score = 81.3 bits (199), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 46/89 (51%), Positives = 56/89 (62%), Gaps = 4/89 (4%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQ-HNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           +IY++GP+ A FSVY DFL YKSGVYQ H++      H  + LGW ++     WLVANSW
Sbjct: 186 EIYKNGPVEADFSVYTDFLAYKSGVYQRHSYELWEARH--QNLGWALKRR-SVWLVANSW 242

Query: 138 NDHWGDHGTFKILRGENEADIEMGFNNRV 166
           N  WGD G FKI RG NE  IE   N  +
Sbjct: 243 NQDWGDKGYFKIRRGNNECGIENDINAGI 271


>gi|29374027|gb|AAO73004.1| cathepsin B [Fasciola gigantica]
          Length = 337

 Score =  140 bits (353), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 72/155 (46%), Positives = 93/155 (60%), Gaps = 2/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAR+KW  CPS+  I DQS+C SCWAVS A+AI+DR+CI SNG    ++SA  IV
Sbjct: 86  LPESFDARQKWANCPSISEIRDQSSCSSCWAVSSASAITDRICIHSNGQKKPRLSAIDIV 145

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG P ++W +W   GVVTGG   +  GC PY    C H V  P      
Sbjct: 146 SCCAYCGYGCNGGIPAMSWDYWTREGVVTGGTLENPTGCLPYPFPKCSHGVVTPGLPPCP 205

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP+C++ C+   Y  TY  D  KGK ++ V
Sbjct: 206 RDIYPTPKCEKKCH-AGYNKTYEQDKVKGKSSYNV 239



 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 45/87 (51%), Positives = 62/87 (71%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +I ++GP+  IF ++ DFL YKSG+Y +  G  +G HA+RV+GWGVEN + YWL+ANS
Sbjct: 247 MMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGVENGVKYWLIANS 306

Query: 137 WNDHWGDHGTFKILRGENEADIEMGFN 163
           WN+ WG+ G F++ RG NE  IE   N
Sbjct: 307 WNEGWGEKGYFRMRRGNNECGIEARIN 333


>gi|156255405|gb|ABU62925.1| cathepsin B [Fasciola hepatica]
          Length = 337

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 72/155 (46%), Positives = 93/155 (60%), Gaps = 2/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAR+KW  CPS+  I DQS+C SCWAVS A+AI+DR+CI SNG    ++SA  IV
Sbjct: 86  LPESFDARQKWANCPSISEIRDQSSCSSCWAVSSASAITDRICIHSNGQKKPRLSAIDIV 145

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG P ++W +W   GVVTGG   +  GC PY    C H V  P      
Sbjct: 146 SCCAYCGYGCNGGIPAMSWDYWTREGVVTGGTLENPTGCLPYPFPKCSHGVVTPGLPPCP 205

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP+C++ C+   Y  TY  D  KGK ++ V
Sbjct: 206 RDIYPTPKCEKKCH-AGYNKTYEQDKVKGKSSYNV 239



 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 45/89 (50%), Positives = 63/89 (70%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           + M +I ++GP+  IF ++ DFL YKSG+Y +  G  +G HA+RV+GWGVEN + YWL+A
Sbjct: 245 DIMMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGVENGVKYWLIA 304

Query: 135 NSWNDHWGDHGTFKILRGENEADIEMGFN 163
           NSWN+ WG+ G F++ RG NE  IE   N
Sbjct: 305 NSWNEGWGEKGYFRMRRGNNECGIEARIN 333


>gi|167538317|ref|XP_001750823.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163770644|gb|EDQ84327.1| predicted protein [Monosiga brevicollis MX1]
          Length = 341

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 72/157 (45%), Positives = 97/157 (61%), Gaps = 6/157 (3%)

Query: 187 LPRNFDAREKW-PECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP  FD+RE+W   CPS + I DQ+ CGSCWA     +++DR+CIAS G     ISAQ +
Sbjct: 88  LPTAFDSREQWGSTCPSTKEIRDQAACGSCWAFGAVESMTDRICIASKGSLRPHISAQDL 147

Query: 246 VACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
           + C    C  GC+GG+P  AW ++   G+VTGG+YNS +GCQPY+L  C+HHV G    C
Sbjct: 148 MTCCLFTCGSGCSGGYPSAAWSWFKTTGIVTGGNYNSSQGCQPYSLPNCDHHVSGQYPAC 207

Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           +  G+  TP CK++C    Y +TY  D   G  A+ V
Sbjct: 208 S--GEGPTPACKKSC-EAGYNNTYSNDKHFGATAYSV 241



 Score =  103 bits (258), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 44/81 (54%), Positives = 59/81 (72%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I  +GP+   F+VY D L YKSGVYQH  G  +G HA++++GWGVE+ + YW VANSWN
Sbjct: 251 EIMTNGPVEGAFTVYEDLLTYKSGVYQHTTGQVLGGHAIKIIGWGVESGVDYWWVANSWN 310

Query: 139 DHWGDHGTFKILRGENEADIE 159
           + WGD+G FKI +G +E  IE
Sbjct: 311 NDWGDNGFFKIKKGVDECGIE 331


>gi|241154720|ref|XP_002407359.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215494103|gb|EEC03744.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 68/155 (43%), Positives = 93/155 (60%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAR KW  C S+  I DQS CGSCWA     A+SDR+CI S G     ISA+ ++
Sbjct: 85  LPESFDARAKWSHCDSIHLIRDQSTCGSCWAFGATEAMSDRICIHSKGKMQVNISAEDLL 144

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C  GC GG+P  AW  W   G+V+GG Y + +GC+PY+LAPCE+H +  + NC  
Sbjct: 145 DCCDTCGHGCKGGFPAAAWEHWKERGIVSGGLYGTPDGCKPYSLAPCEYHTKCRIPNCIP 204

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           +  + TPEC  +C    Y+  Y+ D   G+K + +
Sbjct: 205 I--VHTPECVHHC-RKGYDKDYQEDKHFGQKVYSI 236



 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 54/99 (54%), Positives = 67/99 (67%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+ +K + + R     Q  I+ +GP+ A F VY DFL YKSGVYQ +  D  G+HA+R+L
Sbjct: 228 HFGQKVYSISRDEKQIQTEIFTNGPVEADFHVYGDFLCYKSGVYQRHSNDGRGMHAIRIL 287

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG EN  PYWL ANSWN++WGD G FKILR  NE  IE
Sbjct: 288 GWGTENGTPYWLAANSWNENWGDKGYFKILRRTNECGIE 326


>gi|308488594|ref|XP_003106491.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
 gi|308253841|gb|EFO97793.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
          Length = 342

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 94/258 (36%), Positives = 128/258 (49%), Gaps = 42/258 (16%)

Query: 92  VYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLVANSWNDHWGDHGTFKIL 150
           V+ DFLQ+ +G+Y H  G+  G  +V  LGWG+ E  IP        N  W   G  K+ 
Sbjct: 3   VFDDFLQHTTGIYVHLAGNKQGHLSVGTLGWGMFEELIPK-------NSFW-TAGIPKVS 54

Query: 151 RGENEADIE-----MGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRH 205
           R    + +      +GFN+     S E+ DL              FDARE+WPEC S+  
Sbjct: 55  RSFMLSTLVKDPEIIGFNDLGPTFSPENSDLSPF-----------FDARERWPECSSIPL 103

Query: 206 IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW----GCNGGWPQ 261
           I D S C S WA + A ++SDRLCI S G     +SAQ +++C         GC GG P 
Sbjct: 104 INDISECKSSWAFAAAESMSDRLCINSGGMIDTILSAQELLSCCTGVLSCGEGCAGGNPL 163

Query: 262 LAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG----PLQNCTLLGKLKTPECKQN 317
            AW++W  +G+ TGG Y SQ GC+PY++APC   +      P  N T    L TP C++ 
Sbjct: 164 KAWQYWQKHGIPTGGSYESQFGCKPYSIAPCGKTIGNVTYPPCTNTT----LPTPTCEKK 219

Query: 318 CYNPSYESTYRFDLKKGK 335
           C     +  Y  DL K +
Sbjct: 220 C-----KPGYPVDLDKDR 232



 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 42/99 (42%), Positives = 61/99 (61%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY      +P  +      +  +GP+ A   +Y DFLQY +G+Y H  G+  G  +VR+L
Sbjct: 233 HYGVSVDQLPNRQIEIQSDVMLNGPVEATMEIYDDFLQYTTGIYVHLAGNKQGHLSVRIL 292

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG+   +PYWL+ANSW   WG++GTF++LRG NE  +E
Sbjct: 293 GWGMFEGVPYWLLANSWGKEWGENGTFRVLRGVNECGLE 331


>gi|56759504|gb|AAW27892.1| unknown [Schistosoma japonicum]
          Length = 279

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 66/155 (42%), Positives = 93/155 (60%), Gaps = 3/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FD+R+KWP C S+  I DQS CGSCWA     A++DR+CI S G  + ++SA  ++
Sbjct: 27  IPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLI 86

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C  GC GG+P +AW +W   G+VTGG   +  GCQPY    CEHH +G    C  
Sbjct: 87  SCCEDCGQGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACGT 146

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
               KTP+CKQ C    Y++ Y  D   G++++ V
Sbjct: 147 K-IYKTPQCKQTC-QKGYKTPYEQDKHYGEESYNV 179



 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 46/82 (56%), Positives = 60/82 (73%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R I  +GP+ A F VY DFL YKSG+Y+H  G  +G HA+R++GWGVE   PYWL+ANSW
Sbjct: 188 RDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTPYWLIANSW 247

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG+ G F+I+RG +E  IE
Sbjct: 248 NEDWGEKGLFRIVRGRDECSIE 269


>gi|239938574|gb|ACS36086.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 68/134 (50%), Positives = 89/134 (66%), Gaps = 3/134 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FDAR KWP+C SL+HI DQ+NCGSCWAVS A+A+SDR+CIASNG     +SA  I+
Sbjct: 2   IPESFDARTKWPKCSSLKHIHDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDIL 61

Query: 247 ACTPN-C-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C  N C +GCNGGWP  A+ ++   G VTGGDY +  GC+PY   PC HH +       
Sbjct: 62  SCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYG-E 120

Query: 305 LLGKLKTPECKQNC 318
              +  TP+C + C
Sbjct: 121 CPNEATTPKCVRKC 134



 Score =  107 bits (267), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 47/96 (48%), Positives = 67/96 (69%), Gaps = 2/96 (2%)

Query: 66  KKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWG 123
           K A+ VP       R+I ++GP+V  F+VY DF  YK G+Y+H  G + G HA++++GWG
Sbjct: 150 KDAYEVPNSEKAIQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWG 209

Query: 124 VENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
            EN +PYWL+ANSW++ WG++G F+ILRG N   IE
Sbjct: 210 KENGVPYWLIANSWHNDWGENGYFRILRGSNHCGIE 245


>gi|239938576|gb|ACS36087.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 68/134 (50%), Positives = 89/134 (66%), Gaps = 3/134 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FDAR KWP+C SL+HI DQ+NCGSCWAVS A+A+SDR+CIASNG     +SA  I+
Sbjct: 2   IPESFDARTKWPKCSSLKHIRDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDIL 61

Query: 247 ACTPN-C-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C  N C +GCNGGWP  A+ ++   G VTGGDY +  GC+PY   PC HH +       
Sbjct: 62  SCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYG-E 120

Query: 305 LLGKLKTPECKQNC 318
              +  TP+C + C
Sbjct: 121 CPNEATTPKCVRKC 134



 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 46/96 (47%), Positives = 66/96 (68%), Gaps = 2/96 (2%)

Query: 66  KKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWG 123
           K A+ VP       R+I ++GP+V  F+VY DF  YK G+Y+H  G + G HA++++GWG
Sbjct: 150 KDAYEVPNSEKAIQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWG 209

Query: 124 VENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
            E  +PYWL+ANSW++ WG++G F+ILRG N   IE
Sbjct: 210 KEGGVPYWLIANSWHNDWGENGYFRILRGSNHCGIE 245


>gi|49036808|gb|AAT48985.1| cathepsin B-like proteinase [Triatoma vitticeps]
          Length = 332

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 67/155 (43%), Positives = 96/155 (61%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAR+ WP C S+  I DQ +CGSCWA     A+SDR+CI SNG     +SA++++
Sbjct: 81  VPDEFDARKHWPNCSSITEIRDQGSCGSCWAFGAVEAMSDRICIHSNGKLQVHLSAENLL 140

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C +GC GG  + AW +W   G+V+GG+Y S++GCQPY++APCEH + G    C  
Sbjct: 141 SCCDSCGYGCLGGSAENAWEYWHKFGIVSGGNYGSKQGCQPYSIAPCEHSIPGSRPACE- 199

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G   TP+CK+ C    Y   Y  DL  G+  + +
Sbjct: 200 -GVRDTPKCKKQC-EKGYGIPYGDDLCYGQPGYTI 232



 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 61/81 (75%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I ++GP+VA   VY D   YK+GVYQH  G+ +G H +++LGWGVEND PYWLVANSWN
Sbjct: 242 EILKNGPIVASILVYEDLFSYKAGVYQHVAGEVLGGHVIKILGWGVENDTPYWLVANSWN 301

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WG++G FKILRG +E  IE
Sbjct: 302 TDWGNNGFFKILRGSDECGIE 322


>gi|44968648|gb|AAS49594.1| cathepsin B [Scyliorhinus canicula]
          Length = 206

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 64/144 (44%), Positives = 95/144 (65%), Gaps = 7/144 (4%)

Query: 193 AREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP-N 251
           +RE+WP+CP+++ I DQ +CGSCWA     A+SDR+CI S G    ++SA+ +++C    
Sbjct: 1   SREQWPDCPTIKEIRDQGSCGSCWAFGAVEAMSDRICIHSRGKVNVEVSAEDLLSCCKLE 60

Query: 252 CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLK 310
           C  GCNGG+P  AW FW ++G+V+GG Y S  GC+PY+++PCEHHV G    C+  G+++
Sbjct: 61  CGNGCNGGYPSGAWEFWTNDGLVSGGLYYSHIGCRPYSISPCEHHVNGSRPKCS--GEIE 118

Query: 311 TPECKQNC---YNPSYESTYRFDL 331
           TP C + C   Y+P Y     + L
Sbjct: 119 TPRCSRRCEAGYSPKYSEDKHYGL 142



 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 30/50 (60%), Positives = 38/50 (76%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN 126
           M +IY++GP+ A   V+ DFL YKSGVYQH  G SIG HA+++LGWG EN
Sbjct: 155 MTEIYKNGPVEAALEVFKDFLLYKSGVYQHKTGGSIGGHAIKILGWGEEN 204


>gi|300176938|emb|CBK25507.2| unnamed protein product [Blastocystis hominis]
          Length = 320

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 69/147 (46%), Positives = 89/147 (60%), Gaps = 6/147 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD  EKWPECPSL+ I DQS CGSCWA   A A +DRLCIAS G    ++S Q ++
Sbjct: 69  LPESFDPVEKWPECPSLKEIRDQSVCGSCWAFGAAEAATDRLCIASKGKIQDRLSEQDLL 128

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C  +C +GC+GGW  +AWR++   GV TGG+Y S++ C  Y+   CEHH +G    C  
Sbjct: 129 TCCDSCGFGCDGGWLDMAWRWFQSTGVTTGGEYGSKDWCNAYSFPKCEHHAEGKYPPCGE 188

Query: 306 LGKLKTPECKQNC---YNPSYESTYRF 329
               +TPEC + C   Y   YE    F
Sbjct: 189 --SQETPECVKQCQEGYPVEYEKDKHF 213



 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 72/101 (71%), Gaps = 2/101 (1%)

Query: 63  HYFKKAHMVPR-CNAMR-QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+F +A+ V    +A++ ++  +GPL   F VY DFL YKSG+YQH  G  +G HAV+++
Sbjct: 212 HFFGEAYYVQGGIDAIKTELMTNGPLEVSFFVYEDFLTYKSGIYQHVAGKYLGGHAVKLV 271

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMG 161
           GWGVE+ I YW +ANSWN+ WG++G F+I+ G+ E  IE+G
Sbjct: 272 GWGVEDGIEYWKIANSWNEDWGENGYFRIVAGKGECGIEVG 312


>gi|449667614|ref|XP_002166962.2| PREDICTED: cathepsin B-like [Hydra magnipapillata]
          Length = 330

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 70/156 (44%), Positives = 96/156 (61%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPE-CPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP  FDAR++W   CPSL  + DQ  CGSCWA   A A++DR+CIA+ G    +IS + +
Sbjct: 78  LPIEFDARKEWGSICPSLLEVRDQGECGSCWAFGAAEAMTDRICIATKGKNQVRISTEDL 137

Query: 246 VACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           + C  +C +GCNGG+PQ AW F+   G+VTGG YNS +GCQPY +  C+HHV      C 
Sbjct: 138 LTCCDSCGFGCNGGYPQSAWEFFKTKGIVTGGPYNSHKGCQPYAIPACDHHVPHSKNPCN 197

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G L TP+C++ C    Y  TY+ D   G  ++ +
Sbjct: 198 --GSLPTPKCEKVC-EKGYNITYKNDKHYGVTSYSI 230



 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 53/83 (63%), Positives = 66/83 (79%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           MR+I  +GP+ A F+V+ADF  YKSGVYQH  G+ +G HA+++LGWGVEN+ PYWLVANS
Sbjct: 238 MREIMTNGPVEAAFTVFADFPNYKSGVYQHVSGEELGGHAIKILGWGVENNTPYWLVANS 297

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRG +E  IE
Sbjct: 298 WNPSWGDNGFFKILRGSDECGIE 320


>gi|28971815|dbj|BAC65419.1| cathepsin B [Pandalus borealis]
          Length = 328

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 67/158 (42%), Positives = 94/158 (59%), Gaps = 4/158 (2%)

Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
            K +P  FDARE+WP CP +  I DQ NCGSCWAVS A+ ++DR CI + G    + S++
Sbjct: 73  TKEIPVEFDAREQWPHCPCIDEIRDQGNCGSCWAVSAASVMTDRTCIDTEGLVDFRFSSE 132

Query: 244 HIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
           ++ AC   C   C GG    A+  W   G V+GG +NS EGCQPY++  CEHH++GP   
Sbjct: 133 NVAACCTECGNACYGGDEDTAFTHWVTKGFVSGGRHNSNEGCQPYSVEECEHHIEGPRPP 192

Query: 303 CTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           C   G +    C + C+   Y  TY  DL+ G +A+++
Sbjct: 193 CE--GDMPELVCSETCHE-EYGKTYEEDLEYGLEAYVL 227



 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 54/98 (55%), Positives = 67/98 (68%), Gaps = 2/98 (2%)

Query: 64  YFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
           Y  +A+++P+       +I  +GP+ A F+VY DFL YKSGVYQH  G   G HAVRV+G
Sbjct: 220 YGLEAYVLPQDVTQIQEEIMTNGPVTAAFAVYDDFLSYKSGVYQHETGLLDGYHAVRVIG 279

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           WG E   PYWLVANSWN  WGD+G FKILRG +E + E
Sbjct: 280 WGEEEGTPYWLVANSWNTDWGDNGLFKILRGSDECEFE 317


>gi|157058741|gb|ABV03128.1| cathepsin B-2744 [Aulacorthum solani]
          Length = 255

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 70/170 (41%), Positives = 100/170 (58%), Gaps = 9/170 (5%)

Query: 173 DDDLETMGCQNAKGLPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIA 231
           D++ ET+       +PR FDAR+ +  C   +  + DQ NC S WAV+VA+  +DRLCIA
Sbjct: 19  DNNYETV-------IPRTFDARQYFVSCSDVIGDVKDQGNCASSWAVAVASTFTDRLCIA 71

Query: 232 SNGYFTGQISAQHIVAC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
           SNG FT  +SAQ++++C      GC+GG    AW      G+VTGG+Y+S EGCQPY   
Sbjct: 72  SNGQFTDNLSAQNLMSCGNEEKMGCDGGSAFKAWELTMSKGIVTGGNYDSNEGCQPYKNR 131

Query: 291 PCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           PC+H+    L NC+ L + +   C++ C N +Y+  Y  DL K    +M 
Sbjct: 132 PCDHYGDSSLTNCSSLRRTQMTVCREKCVNKNYKVKYEDDLHKTSIVYMT 181



 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 29/65 (44%), Positives = 44/65 (67%), Gaps = 1/65 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVANS 136
           ++I  +GP+ A+  VY +F+ YK G+Y+   G+ IG H V+++GWGV+ D   YWL  NS
Sbjct: 191 QEIMTYGPVTALMYVYENFMGYKKGIYKSTAGELIGYHHVKLIGWGVDEDGTEYWLAMNS 250

Query: 137 WNDHW 141
           WN +W
Sbjct: 251 WNSNW 255


>gi|339236191|ref|XP_003379650.1| cathepsin B [Trichinella spiralis]
 gi|316977649|gb|EFV60721.1| cathepsin B [Trichinella spiralis]
          Length = 356

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 68/150 (45%), Positives = 92/150 (61%), Gaps = 7/150 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FD+R++WP CP++  I DQSNCGSCWA     AISDR+CIA++G     IS+  ++
Sbjct: 102 IPVEFDSRKQWPYCPTIGEIRDQSNCGSCWAFGAVEAISDRICIATDGRQKPHISSTDLL 161

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GC GG P  AW FW   G+VTGG+Y + +GC+PY  APC HH  G    C+ 
Sbjct: 162 SCCKICGFGCQGGDPHQAWSFWVKYGLVTGGNYTTHDGCRPYPFAPCNHHSNGTYGPCSH 221

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
             +  TP CK+ C     +STY+    K K
Sbjct: 222 DLE-PTPVCKKAC-----QSTYKIQYNKDK 245



 Score =  111 bits (277), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 49/82 (59%), Positives = 60/82 (73%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           +++  +GP+   F VY DFL YK+GVYQH+ G  +G HAVR+LGWG EN +PYWL+ANSW
Sbjct: 263 KELMMNGPMEVAFEVYEDFLLYKTGVYQHHTGSVLGGHAVRLLGWGEENGVPYWLLANSW 322

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N  WGD G FKI RG NE  IE
Sbjct: 323 NTEWGDKGFFKIYRGRNECGIE 344


>gi|201023321|ref|NP_001128402.1| cathepsin B-1874 precursor [Acyrthosiphon pisum]
          Length = 315

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 74/154 (48%), Positives = 106/154 (68%), Gaps = 2/154 (1%)

Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
             GLP NFD+R+KWP CPS+ HI +Q NC S +AV+ A+A SDR+CI SNG     +SAQ
Sbjct: 58  TSGLPINFDSRKKWPNCPSIGHIYNQGNCRSSYAVAAASAASDRICIQSNGTKNPIMSAQ 117

Query: 244 HIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCE-HHVQGPLQ 301
            I++C   C  GC+GG    +W ++  +G V+GGDYNS +GCQPYT+ PC+  + + P  
Sbjct: 118 QIISCCYLCGHGCDGGSLFESWDYYRRHGFVSGGDYNSNQGCQPYTIPPCKLMNEKPPGH 177

Query: 302 NCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
           +CT   + +TP C++ CYNP+Y +++R D+ KGK
Sbjct: 178 SCTTYHREETPICEKKCYNPNYYTSFRTDIYKGK 211



 Score = 90.9 bits (224), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 48/112 (42%), Positives = 69/112 (61%), Gaps = 8/112 (7%)

Query: 50  KKRLYLP---TSIPLSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH 106
           +K+ Y P   TS     Y  K + +    AM+ I+++GP+   F +Y D + YKSGVYQ+
Sbjct: 191 EKKCYNPNYYTSFRTDIYKGKYYKLSPYMAMKDIFDNGPITTQFYMYRDLVDYKSGVYQY 250

Query: 107 N----FGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGEN 154
           +    F D   +H+V++ GWG EN +PYWLVANS+   WG +GTFKI RG +
Sbjct: 251 DEQSDF-DFFTVHSVKIFGWGEENGVPYWLVANSFGTDWGYNGTFKISRGND 301


>gi|198429088|ref|XP_002120307.1| PREDICTED: similar to cathepsin B [Ciona intestinalis]
          Length = 364

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 65/134 (48%), Positives = 86/134 (64%), Gaps = 4/134 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FD+R++WP CPS+ +I DQ +CGSCWA     A+SDR CI SNG    +ISA+ ++
Sbjct: 112 IPNQFDSRKQWPHCPSISYIRDQGSCGSCWAFGAVEAMSDRYCIRSNGKIQVEISAEDLL 171

Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C    C  GCNGG+P  AW++W  +G+VTGG Y S+ GC PY + PCEHHV G    C+
Sbjct: 172 SCCGFECGDGCNGGFPGSAWKYWNSDGLVTGGLYGSKTGCLPYQIKPCEHHVPGDRPKCS 231

Query: 305 LLGKLKTPECKQNC 318
             G   TP C   C
Sbjct: 232 EGG--GTPSCVSKC 243



 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 50/81 (61%), Positives = 59/81 (72%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I  HGP+   F+VYADF  YKSGVY+H  G  +G HA+R+LGWG EN + YWLVANSWN
Sbjct: 274 EIMTHGPVEGAFTVYADFPTYKSGVYKHVTGGVLGGHAIRILGWGSENGVAYWLVANSWN 333

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGD G FKILRG +E  IE
Sbjct: 334 TDWGDKGYFKILRGSDECGIE 354


>gi|149030259|gb|EDL85315.1| rCG52258, isoform CRA_b [Rattus norvegicus]
          Length = 210

 Score =  139 bits (349), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 68/142 (47%), Positives = 89/142 (62%), Gaps = 4/142 (2%)

Query: 177 ETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYF 236
           E +G      LP +FDARE+W  CP++  I DQ +CGSCWA     A+SDR+CI +NG  
Sbjct: 70  ERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRV 129

Query: 237 TGQISAQHIVACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH 294
             ++SA+ ++ C    C  GCNGG+P  AW FW   G+V+GG YNS  GC PYT+ PCEH
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 189

Query: 295 HVQGPLQNCTLLGKLKTPECKQ 316
           HV G    CT  G+  TP+C +
Sbjct: 190 HVNGSRPPCT--GEGDTPKCNK 209


>gi|3087803|emb|CAA93279.1| cysteine protease [Haemonchus contortus]
          Length = 325

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 71/163 (43%), Positives = 98/163 (60%), Gaps = 5/163 (3%)

Query: 179 MGCQNAKG--LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYF 236
           +G +N +G  +P +FDAR  WP C SL HI DQ+NCGSCWAVS A A+SDR+CI++NG  
Sbjct: 84  VGDENDEGDDIPESFDARTHWPNCSSLTHIRDQANCGSCWAVSTAAALSDRICISTNGTK 143

Query: 237 TGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH 295
              ISA  I+ C   C +GC GGWP  AW +    G VTGG   ++  C+ +   PC HH
Sbjct: 144 QVNISATDILTCCYKCGYGCQGGWPIEAWEYVAREGAVTGGRLLAKSCCRSHPFPPCGHH 203

Query: 296 VQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAH 338
                      G+ +TP+C+ +C  P Y+++Y  D  +GK A+
Sbjct: 204 GNETYYG-ECGGRARTPKCRTSC-TPGYKNSYSDDKIRGKDAY 244



 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 36/64 (56%), Positives = 50/64 (78%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R+I ++GP+VA F+VYADF  YK G+Y+H  G + G HAV+V+GWG E D+PYW+V NSW
Sbjct: 255 REIMKNGPVVAAFTVYADFSYYKKGIYKHTAGRARGSHAVKVIGWGEEGDVPYWIVKNSW 314

Query: 138 NDHW 141
           ++ W
Sbjct: 315 HNDW 318


>gi|121073189|gb|ABM47071.1| cathepsin B2 [Clonorchis sinensis]
 gi|358341868|dbj|GAA36574.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 343

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 64/140 (45%), Positives = 88/140 (62%), Gaps = 2/140 (1%)

Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
           ++K +P++FDAR  WP CPS+  I DQS+CGSCWA     A+SDRLCI S+G F   +SA
Sbjct: 82  DSKLIPKSFDARATWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSSGAFNKSLSA 141

Query: 243 QHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ 301
             +++C  +C  GC+GG+P +AW FW  +G+VTGG      GC+PY    C+HH QG   
Sbjct: 142 VDLLSCCKDCGDGCDGGFPPMAWDFWKTHGIVTGGSKEEPTGCRPYPFPKCQHHSQGHYP 201

Query: 302 NCTLLGKLKTPECKQNCYNP 321
            C       TP+C ++C  P
Sbjct: 202 PCPRR-IYPTPKCVKHCDTP 220



 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 46/83 (55%), Positives = 62/83 (74%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M++I  +GP+ A F V+ DF +YKSG+Y H +G S+G HA+R+LGWG EN +PYWL+ANS
Sbjct: 245 MKEILLNGPVEATFEVHEDFPEYKSGIYFHAWGGSVGGHAIRILGWGEENGVPYWLIANS 304

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN+ WG+ G  + LRG NE  IE
Sbjct: 305 WNEDWGEKGYLRFLRGHNECGIE 327


>gi|312271213|gb|ADQ57304.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 347

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 59/111 (53%), Positives = 78/111 (70%), Gaps = 1/111 (0%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P++FD+R  WPECPSL  I DQS+CGSCWAV    A++DR+CIAS G     ISA  ++
Sbjct: 95  IPKSFDSRTNWPECPSLYSIRDQSSCGSCWAVGAVEAMTDRICIASKGNQKVTISADDLL 154

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHV 296
           +C   C +GC+GG P  AW +W  NG+VTG +Y S+ GC+PY   PCEHH+
Sbjct: 155 SCCDECGFGCDGGDPYAAWSYWVSNGIVTGSNYTSKSGCKPYPYPPCEHHI 205



 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 46/99 (46%), Positives = 63/99 (63%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY    + V +   +  ++I  +GP+   F VY DF  Y SG+Y+H  GD +G HAV++L
Sbjct: 240 HYGASVYAVAQDVASIQKEIMTNGPVEVAFDVYEDFEHYSSGIYKHTTGDYLGGHAVKML 299

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG EN   YW+ ANSWN  WG++G F+ILRG +E  IE
Sbjct: 300 GWGTENGTDYWICANSWNSDWGENGFFRILRGVDECQIE 338


>gi|161343853|tpg|DAA06107.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 217

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 74/154 (48%), Positives = 106/154 (68%), Gaps = 2/154 (1%)

Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
             GLP NFD+R+KWP CPS+ HI +Q NC S +AV+ A+A SDR+CI SNG     +SAQ
Sbjct: 58  TSGLPINFDSRKKWPNCPSIGHIYNQGNCRSSYAVAAASAASDRICIQSNGTKNPIMSAQ 117

Query: 244 HIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCE-HHVQGPLQ 301
            I++C   C  GC+GG    +W ++  +G V+GGDYNS +GCQPYT+ PC+  + + P  
Sbjct: 118 QIISCCYLCGHGCDGGSLFESWDYYRRHGFVSGGDYNSNQGCQPYTIPPCKLMNEKPPGH 177

Query: 302 NCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
           +CT   + +TP C++ CYNP+Y +++R D+ KGK
Sbjct: 178 SCTTYHREETPICEKKCYNPNYYTSFRTDIYKGK 211


>gi|209863079|ref|NP_001119613.2| cathepsin B precursor [Acyrthosiphon pisum]
          Length = 323

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 66/156 (42%), Positives = 96/156 (61%), Gaps = 2/156 (1%)

Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           +PR FDAR+ +  C + +  + DQ NC S WAV+VA+  +DRLCIASNG FT  +SAQ++
Sbjct: 64  IPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGQFTDNLSAQNL 123

Query: 246 VACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           ++C      GC+GG    AW    + G+VTGG+++S EGCQPY   PC+H+    L NC+
Sbjct: 124 MSCGDGEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDHYGDSRLTNCS 183

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            L + +   C++ C N +Y+  Y  DL K    +M 
Sbjct: 184 SLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMT 219



 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 40/84 (47%), Positives = 56/84 (66%), Gaps = 1/84 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVANS 136
           ++I  +GP+ A   VY +F+ YK G+Y+   G+ IG H V+++GWGV+ D   YWL  NS
Sbjct: 229 QEIMTYGPVTAFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWLAMNS 288

Query: 137 WNDHWGDHGTFKILRGENEADIEM 160
           WN +WG+ G FKILRG N   IE+
Sbjct: 289 WNSNWGNDGLFKILRGYNFCSIEL 312


>gi|161343839|tpg|DAA06100.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 323

 Score =  138 bits (348), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 66/156 (42%), Positives = 96/156 (61%), Gaps = 2/156 (1%)

Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           +PR FDAR+ +  C + +  + DQ NC S WAV+VA+  +DRLCIASNG FT  +SAQ++
Sbjct: 64  IPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGQFTDNLSAQNL 123

Query: 246 VACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           ++C      GC+GG    AW    + G+VTGG+++S EGCQPY   PC+H+    L NC+
Sbjct: 124 MSCGDGEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDHYGDSRLTNCS 183

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            L + +   C++ C N +Y+  Y  DL K    +M 
Sbjct: 184 SLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMT 219



 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 41/84 (48%), Positives = 56/84 (66%), Gaps = 1/84 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVANS 136
           ++I  HGP+ A   VY +F+ YK G+Y+   G+ IG H V+++GWGV+ D   YWL  NS
Sbjct: 229 QEIMTHGPVTAFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWLAMNS 288

Query: 137 WNDHWGDHGTFKILRGENEADIEM 160
           WN +WG+ G FKILRG N   IE+
Sbjct: 289 WNSNWGNDGLFKILRGYNFCSIEL 312


>gi|3087801|emb|CAA93277.1| cysteine proteinase [Haemonchus contortus]
          Length = 344

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 71/164 (43%), Positives = 99/164 (60%), Gaps = 11/164 (6%)

Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
            +   +P +FDAR  WP C SL HI DQ++CGSCWAVS A+A+SDR+CIAS G     +S
Sbjct: 86  DDGDDIPESFDARTHWPNCSSLTHIRDQADCGSCWAVSTASALSDRICIASKGAKQVYVS 145

Query: 242 AQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
           A  I++C  +C  GC+GG+   A++F+   G VTGGDY +++ C+PY   PC HH     
Sbjct: 146 ATDILSCCHSCGDGCDGGYVIDAFKFFAEQGAVTGGDYGAKDCCRPYPFHPCGHH----- 200

Query: 301 QNCTLLGKL----KTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            N T  G+      TPEC + C    YE+ Y  D  +G+ A+ +
Sbjct: 201 GNETYYGECPEDGSTPECVRKC-QEGYETEYHEDRVRGEDAYRL 243



 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 38/82 (46%), Positives = 58/82 (70%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  +GP+VA F V+ DF  Y+ G+Y H  G   G HAV+++GWG E+ +PYW++ANSW
Sbjct: 253 KEIMRNGPVVAAFIVFDDFSFYRKGIYAHVAGSPRGGHAVKIIGWGTEHGVPYWIIANSW 312

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           +  WG+ G F+++RG N+  IE
Sbjct: 313 HSDWGEDGYFRMVRGINDCGIE 334


>gi|170028912|ref|XP_001842338.1| oryzain gamma chain [Culex quinquefasciatus]
 gi|167879388|gb|EDS42771.1| oryzain gamma chain [Culex quinquefasciatus]
          Length = 333

 Score =  138 bits (347), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 66/155 (42%), Positives = 90/155 (58%), Gaps = 5/155 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFD+RE+WP CP++R I DQ +CGSCWA     A+SDR+CI S G    ++SA+ ++
Sbjct: 83  LPENFDSREQWPNCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIHSKGKVLFRVSAEDLL 142

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C  NC  GC+GG P   W+ W   G+V+GG + S +GC+PYT+ PC H   G    C  
Sbjct: 143 TCCTNCGHGCDGGAPGAGWKHWIEKGLVSGGPFGSDQGCRPYTIEPCVHVENGAQSPCK- 201

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP+C + C  P Y   Y  D   GK  + +
Sbjct: 202 --DSITPKCIKKCL-PGYNVPYAKDKSFGKSTYSI 233



 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 49/82 (59%), Positives = 60/82 (73%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I+ +GP+ A F+V+ DF  YK G+YQH  G+  G HAVR+LGWGVEN   YWL ANSW
Sbjct: 242 KEIFTNGPVEATFTVFDDFASYKHGIYQHTSGNLAGEHAVRILGWGVENGTKYWLAANSW 301

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N  WGD+G FKILRG N  DIE
Sbjct: 302 NSDWGDNGYFKILRGSNHVDIE 323


>gi|268555788|ref|XP_002635883.1| C. briggsae CBR-CPR-5 protein [Caenorhabditis briggsae]
          Length = 345

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 71/171 (41%), Positives = 100/171 (58%), Gaps = 5/171 (2%)

Query: 175 DLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG 234
           D + +  +    +P +FDAR++WP C S+ +I DQS+CGSCWA + A AISDR CIASNG
Sbjct: 71  DEDIVATEVFDAIPDHFDARDQWPSCVSINNIRDQSDCGSCWAFAAAEAISDRTCIASNG 130

Query: 235 YFTGQISAQHIVACTPNCW----GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
                +S+Q +++C         GC GG+P  AW++W  +G+VTGG Y SQ GC+PY++A
Sbjct: 131 AVNTLLSSQDLLSCCTGLLSCGNGCEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIA 190

Query: 291 PCEHHVQGPLQNCTLLGKLKTPECKQNCY-NPSYESTYRFDLKKGKKAHMV 340
           PC   V G            TP+C + C  N +Y + Y  D   G  A+ V
Sbjct: 191 PCGQTVNGVTWPKCPDDTEPTPKCVEACTSNNTYPTPYLQDKHFGATAYAV 241



 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 47/81 (58%), Positives = 61/81 (75%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I ++GP+   F+VY DF QY +GVY H  G S+G HAV++LGWGV+N  PYWLVANSWN
Sbjct: 251 EILKNGPVEVAFTVYEDFYQYTTGVYVHTSGASLGGHAVKILGWGVDNGTPYWLVANSWN 310

Query: 139 DHWGDHGTFKILRGENEADIE 159
            +WG+ G F+I+RG NE  IE
Sbjct: 311 VNWGEKGYFRIIRGLNECGIE 331


>gi|1345924|sp|P25802.3|CYSP1_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
           Precursor
          Length = 341

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 72/143 (50%), Positives = 88/143 (61%), Gaps = 5/143 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P ++D R +W  C SL HI DQ+NCGSCWAVS A A+SDR+CIAS G     ISAQ +V
Sbjct: 91  IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVV 150

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC GGWP  A+RF    GVVTGGDYN++  C+PY + PC HH          
Sbjct: 151 SCCTWCGDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPYEIHPCGHHGNETYYG-EC 209

Query: 306 LGKLKTPECKQNC---YNPSYES 325
           +G   TP CK+ C   Y  SY S
Sbjct: 210 VGMADTPRCKRRCLLGYPKSYPS 232



 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 55/127 (43%), Positives = 80/127 (62%), Gaps = 6/127 (4%)

Query: 48  KKKKRLYLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQ 105
           K++  L  P S P   Y+KKA+ +        + I ++GP+VA ++VY DF  Y+SG+Y+
Sbjct: 219 KRRCLLGYPKSYPSDRYYKKAYQLKNSVKAIQKDIMKNGPVVATYTVYEDFAHYRSGIYK 278

Query: 106 HNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNR 165
           H  G   GLHAV+V+GWG E   PYW+VANSW+D WG++G F++ RG N+     GF  R
Sbjct: 279 HKAGRKTGLHAVKVIGWGEEKGTPYWIVANSWHDDWGENGFFRMHRGSNDC----GFEER 334

Query: 166 VEANSSE 172
           + A S +
Sbjct: 335 MAAGSVQ 341


>gi|226469952|emb|CAX70257.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 70/156 (44%), Positives = 93/156 (59%), Gaps = 3/156 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   AISDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSRCGSSWAVSAVGAISDRICIQSGGKQSVELSAIDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  NC  GC+GG+P  AW +W  +G+VTGG   +  GCQPY    CEHH  G   +C  
Sbjct: 150 SCCENCGSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHHSIGKYPSCG- 208

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
               KTP+CK+ C    Y + Y  D   G  A  V+
Sbjct: 209 DKMYKTPQCKRKC-QKGYTTPYEHDKHYGGIAINVI 243



 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 40/82 (48%), Positives = 58/82 (70%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  +GP+ A   ++ DFL YKSG+Y++  G  +G H VR++GWG+EN   YWL AN+W
Sbjct: 251 KEIMMYGPVEAYLLIFEDFLNYKSGIYKYTTGSFVGEHYVRIIGWGIENGTAYWLAANTW 310

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG+ G F+I+RG NE  IE
Sbjct: 311 NEDWGEKGYFRIVRGRNECSIE 332


>gi|161343861|tpg|DAA06111.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 323

 Score =  137 bits (346), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 67/156 (42%), Positives = 95/156 (60%), Gaps = 2/156 (1%)

Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           +P+ FDAR+ +  C + +  + DQ NC S WAV+VA+  +DRLCIASNG FT  +SAQ++
Sbjct: 64  IPKEFDARQYFISCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGKFTDNLSAQNL 123

Query: 246 VACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           ++C  +   GC+GG    AW F    G+VTGG Y+S EGCQPY   PC+H+    L NC+
Sbjct: 124 MSCGDDEKLGCDGGSAYKAWEFTMGKGIVTGGPYDSNEGCQPYKNRPCDHYGDSSLTNCS 183

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            L + +   C+  C N +Y+  Y  DL K    +M 
Sbjct: 184 SLRRTQMMFCRDKCVNKNYKVKYEDDLYKTSVVYMT 219



 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 42/84 (50%), Positives = 56/84 (66%), Gaps = 1/84 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLVANS 136
           ++I  +GP+ A   VY +F+ YK GVY+   G+ IG H V+++GWGV E  I YWL  NS
Sbjct: 229 QEIMTYGPVTAFMYVYENFMGYKEGVYKSTAGELIGYHHVKLIGWGVDEAGIEYWLAMNS 288

Query: 137 WNDHWGDHGTFKILRGENEADIEM 160
           WN +WG+ G FKILRG N   IE+
Sbjct: 289 WNSNWGNDGLFKILRGYNFCSIEL 312


>gi|157058739|gb|ABV03127.1| cathepsin B-2744 [Acyrthosiphon pisum]
          Length = 260

 Score =  137 bits (346), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 66/156 (42%), Positives = 96/156 (61%), Gaps = 2/156 (1%)

Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           +PR FDAR+ +  C + +  + DQ NC S WAV+VA+  +DRLCIASNG FT  +SAQ++
Sbjct: 26  IPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGQFTDNLSAQNL 85

Query: 246 VACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           ++C      GC+GG    AW    + G+VTGG+++S EGCQPY   PC+H+    L NC+
Sbjct: 86  MSCGDGEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDHYGDSRLTNCS 145

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            L + +   C++ C N +Y+  Y  DL K    +M 
Sbjct: 146 SLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMT 181



 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 31/69 (44%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVANS 136
           ++I  +GP+ A   VY +F+ YK G+Y+   G+ IG H V+++GWGV+ D   YWL  NS
Sbjct: 191 QEIMTYGPVTAFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWLAMNS 250

Query: 137 WNDHWGDHG 145
           WN +WG+ G
Sbjct: 251 WNSNWGNDG 259


>gi|161343873|tpg|DAA06117.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 254

 Score =  137 bits (345), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 71/154 (46%), Positives = 105/154 (68%), Gaps = 2/154 (1%)

Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
             GLP NFD+R+KWP CPS+ HI +Q NC S +AV+ A+A SDR+CI SN      +SAQ
Sbjct: 60  TNGLPTNFDSRKKWPNCPSIGHIYNQGNCRSSYAVAAASAASDRICIHSNSTKNPIMSAQ 119

Query: 244 HIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH-HVQGPLQ 301
            I++C   C +GC+GG    +W F+  +G V+GG+YNS +GCQPYT+ PC+  + + P  
Sbjct: 120 QIISCCYLCGYGCDGGSLFESWDFYRRHGFVSGGEYNSNQGCQPYTIPPCKLINEKPPGH 179

Query: 302 NCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
           +CT   + +TP C++ C NP+Y +++R D+ +GK
Sbjct: 180 SCTTFNREETPTCEKKCNNPNYYTSFRADIYRGK 213



 Score = 42.4 bits (98), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 31/51 (60%)

Query: 57  TSIPLSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN 107
           TS     Y  K + V    AM++I+++GP+   F +Y D + YKSGVYQ++
Sbjct: 203 TSFRADIYRGKYYKVSPYMAMKEIFDNGPITTQFYMYRDLVDYKSGVYQYD 253


>gi|148704124|gb|EDL36071.1| cathepsin B, isoform CRA_b [Mus musculus]
          Length = 237

 Score =  137 bits (345), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 67/132 (50%), Positives = 83/132 (62%), Gaps = 4/132 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDARE+W  CP++  I DQ +CGSCWA     AISDR CI +NG    ++SA+ ++
Sbjct: 74  LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLL 133

Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C    C  GCNGG+P  AW FW   G+V+GG YNS  GC PYT+ PCEHHV G    CT
Sbjct: 134 TCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNGSRPPCT 193

Query: 305 LLGKLKTPECKQ 316
             G+  TP C +
Sbjct: 194 --GEGDTPRCNK 203


>gi|332376204|gb|AEE63242.1| unknown [Dendroctonus ponderosae]
          Length = 338

 Score =  137 bits (345), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 66/145 (45%), Positives = 91/145 (62%), Gaps = 6/145 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FDARE WP C SL+ I DQS+CGSCWA     A+SDR+CI S+      +SA+ + 
Sbjct: 84  VPESFDARENWPRCDSLKQIRDQSSCGSCWAFGAVEAMSDRICIHSDQSNQVYVSAEDLN 143

Query: 247 ACTPNCW----GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQ-GPLQ 301
           +C    +    GC+GG+    W +W  +G+VTGG YNS +GC+ Y+L PCEHHV+ G   
Sbjct: 144 SCCFGLFACGLGCDGGYVAEPWDYWRTDGIVTGGAYNSSQGCKDYSLEPCEHHVEVGSRP 203

Query: 302 NCTLLGKLKTPECKQNCYNPSYEST 326
            C+ L    TPEC ++CY  S + T
Sbjct: 204 QCSSL-NFDTPECVRSCYESSLDYT 227



 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 48/82 (58%), Positives = 59/82 (71%), Gaps = 1/82 (1%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD-SIGLHAVRVLGWGVENDIPYWLVANSW 137
           +I ++GP+ A F+VY DFL YKSGVYQ    D S+G HA++VLGWGVE    YWL+ANSW
Sbjct: 248 EILKNGPIEAAFTVYNDFLSYKSGVYQATAQDESVGGHAIKVLGWGVEEGTKYWLIANSW 307

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N  WGD+G FK LRG +   IE
Sbjct: 308 NTDWGDNGYFKFLRGVDHCGIE 329


>gi|56756436|gb|AAW26391.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  137 bits (345), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 74/195 (37%), Positives = 104/195 (53%), Gaps = 13/195 (6%)

Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
           +IL G  + D EM    R   +   D ++E         +P  FD+R+KWP C S+  I 
Sbjct: 61  RILMGARKEDAEMKRKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110

Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRF 266
           DQS CGSCWA     A++DR+CI S G  + ++SA  +++C  +C  GC GG+P  AW +
Sbjct: 111 DQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGDGCKGGFPGQAWDY 170

Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
           W   G+VTGG   +  GCQPY    CEH  +G    C      KTP+CKQ C    Y++ 
Sbjct: 171 WVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACG-TKIYKTPQCKQTC-QKGYKTP 228

Query: 327 YRFDLKKGKKAHMVL 341
           Y  D   G + + V+
Sbjct: 229 YEQDKHYGDQRYNVI 243



 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 45/82 (54%), Positives = 61/82 (74%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R+I  +GP+ A F VY DFL YKSG+Y+H  G  +G HA+R++GWGVE   PYWL+ANSW
Sbjct: 251 REIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKGKPYWLIANSW 310

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG+ G F+++RG +E  IE
Sbjct: 311 NEDWGEKGLFRMVRGRDECSIE 332


>gi|226469950|emb|CAX70256.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  137 bits (344), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 78/196 (39%), Positives = 109/196 (55%), Gaps = 15/196 (7%)

Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
           +IL G  + D EM +  R   +   D ++E         +P  FD+R+KWP C S+  I 
Sbjct: 61  RILLGGGKEDAEMKWKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110

Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRF 266
           DQS CGS WAVS   A+SDR+CI S G  + ++SA  +++C  NC  GC+GG+P  AW +
Sbjct: 111 DQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCENCGSGCDGGFPGPAWDY 170

Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKL-KTPECKQNCYNPSYES 325
           W  +G+VTGG   +  GCQPY    CEHH  G   +C    K+ KTP+CK+ C    Y +
Sbjct: 171 WVSHGIVTGGSKENHTGCQPYPFPKCEHHSIGKYPSCG--DKIYKTPQCKRKC-QKGYTT 227

Query: 326 TYRFDLKKGKKAHMVL 341
            Y  D   G  +  V+
Sbjct: 228 PYEHDKHYGGISINVI 243



 Score =  100 bits (249), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 40/82 (48%), Positives = 58/82 (70%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  +GP+ A   ++ DFL YKSG+Y++  G  +G H VR++GWG+EN   YWL AN+W
Sbjct: 251 KEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGIENGTAYWLAANTW 310

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG+ G F+I+RG NE  IE
Sbjct: 311 NEDWGEKGYFRIVRGRNECSIE 332


>gi|157058747|gb|ABV03131.1| cathepsin B-2744 [Myzus persicae]
          Length = 261

 Score =  137 bits (344), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 67/156 (42%), Positives = 95/156 (60%), Gaps = 2/156 (1%)

Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           +P+ FDAR+ +  C + +  + DQ NC S WAV+VA+  +DRLCIASNG FT  +SAQ++
Sbjct: 28  IPKEFDARQYFISCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGKFTDNLSAQNL 87

Query: 246 VACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           ++C  +   GC+GG    AW F    G+VTGG Y+S EGCQPY   PC+H+    L NC+
Sbjct: 88  MSCGDDEKLGCDGGSAYKAWEFTMGKGIVTGGPYDSNEGCQPYKNRPCDHYGDSSLTNCS 147

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            L + +   C+  C N +Y+  Y  DL K    +M 
Sbjct: 148 SLRRTQMMFCRDKCVNKNYKVKYEDDLYKTSVVYMT 183



 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 33/69 (47%), Positives = 46/69 (66%), Gaps = 1/69 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLVANS 136
           ++I  +GP+ A   VY +F+ YK GVY+   G+ IG H V+++GWGV E  I YWL  NS
Sbjct: 193 QEIMTYGPVTAFMYVYENFMGYKEGVYKSTAGELIGYHHVKLIGWGVDEAGIEYWLAMNS 252

Query: 137 WNDHWGDHG 145
           WN +WG +G
Sbjct: 253 WNSNWGTNG 261


>gi|226469948|emb|CAX70255.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 78/196 (39%), Positives = 109/196 (55%), Gaps = 15/196 (7%)

Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
           +IL G  + D EM +  R   +   D ++E         +P  FD+R+KWP C S+  I 
Sbjct: 61  RILLGGGKEDAEMKWKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110

Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRF 266
           DQS CGS WAVS   A+SDR+CI S G  + ++SA  +++C  NC  GC+GG+P  AW +
Sbjct: 111 DQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCENCGSGCDGGFPGPAWDY 170

Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKL-KTPECKQNCYNPSYES 325
           W  +G+VTGG   +  GCQPY    CEHH  G   +C    K+ KTP+CK+ C    Y +
Sbjct: 171 WVSHGIVTGGSKENHTGCQPYPFPKCEHHSIGKYPSCG--DKIYKTPQCKRKC-QKGYTT 227

Query: 326 TYRFDLKKGKKAHMVL 341
            Y  D   G  +  V+
Sbjct: 228 PYEHDKHYGGISINVI 243



 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 40/81 (49%), Positives = 57/81 (70%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I  +GP+ A   ++ DFL YKSG+Y++  G  +G H VR++GWG+EN   YWL AN+WN
Sbjct: 252 EIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGIENGTAYWLAANTWN 311

Query: 139 DHWGDHGTFKILRGENEADIE 159
           + WG+ G F+I+RG NE  IE
Sbjct: 312 EDWGEKGYFRIVRGRNECSIE 332


>gi|161671340|gb|ABX75522.1| cathepsin b [Lycosa singoriensis]
          Length = 247

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 66/150 (44%), Positives = 93/150 (62%), Gaps = 4/150 (2%)

Query: 192 DAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPN 251
           D+RE+WP+CPS+  I DQ +CGSCWA     A+SDR CI SNG    ++S + +++C  +
Sbjct: 1   DSREQWPDCPSISEIRDQGSCGSCWAFGAVEAMSDRHCIHSNGKVKIEVSPEDLLSCCSS 60

Query: 252 C-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLK 310
           C  GC+GG+P  AW FW   G+ TGG +NS  GCQPY +  CEHH  G    C+ +  + 
Sbjct: 61  CGMGCDGGFPPSAWEFWVDKGIATGGLWNSHIGCQPYEIPACEHHTTGDRPPCSDI--VD 118

Query: 311 TPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           TP+C   C    Y ++YR D   GKK++ +
Sbjct: 119 TPKCVHLC-EKGYNTSYRDDKHFGKKSYSI 147



 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 57/99 (57%), Positives = 73/99 (73%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+ KK++ +       Q  I+++GP+   FSVY+DF+ YKSGVYQH+ G+S+G HA+RVL
Sbjct: 139 HFGKKSYSIESLEQQIQTEIFKNGPVEGAFSVYSDFINYKSGVYQHHSGESLGGHAIRVL 198

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG END+PYWL ANSWN  WGD G FKILRG +E  IE
Sbjct: 199 GWGYENDVPYWLCANSWNTDWGDKGYFKILRGSDECGIE 237


>gi|157058743|gb|ABV03129.1| cathepsin B-2744 [Pterocomma populeum]
          Length = 244

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 66/169 (39%), Positives = 99/169 (58%), Gaps = 3/169 (1%)

Query: 175 DLETMGCQNAKGLPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASN 233
           D +T+       +P+ FDAR  +  C + +  + DQ NC S WAV+VA+  +DRLCIA+ 
Sbjct: 13  DRKTVDANYRTDVPKEFDARRHFVSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIATG 72

Query: 234 GYFTGQISAQHIVAC--TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP 291
           G FT  +SAQ++++C  +    GC+GG    AW F   NG+VTGG++NS EGCQPY   P
Sbjct: 73  GKFTDNLSAQNLMSCGDSEKFVGCHGGSAFKAWEFTMGNGIVTGGNFNSNEGCQPYKNRP 132

Query: 292 CEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           C+H+    + NC+   + +   C++ C N +Y+  Y  DL K    +M 
Sbjct: 133 CDHYGDSSMTNCSSFRRTQMSICREKCVNKNYKVKYEDDLHKTSVVYMT 181



 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 36/50 (72%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND 127
           ++I  +GP+ A+  VY +F+ YK G+Y+   GD +G H V+++GWGV++D
Sbjct: 191 QEIMTYGPVTALMYVYENFMGYKEGIYKSTVGDLVGYHHVKLIGWGVDDD 240


>gi|27806671|ref|NP_776456.1| cathepsin B precursor [Bos taurus]
 gi|115312124|sp|P07688.5|CATB_BOVIN RecName: Full=Cathepsin B; AltName: Full=BCSB; Contains: RecName:
           Full=Cathepsin B light chain; Contains: RecName:
           Full=Cathepsin B heavy chain; Flags: Precursor
 gi|289402|gb|AAA03064.1| cathepsin B [Bos taurus]
 gi|809479|gb|AAA80198.1| cathepsin B [Bos taurus]
 gi|296484950|tpg|DAA27065.1| TPA: cathepsin B precursor [Bos taurus]
          Length = 335

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 73/156 (46%), Positives = 98/156 (62%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP CP+++ I DQ +CGSCWA     AISDR+CI SNG    ++SA+ ++
Sbjct: 80  LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 139

Query: 247 A--CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
                    GCNGG+P  AW FW   G+V+GG YNS  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  +Y+ D   G  ++ V
Sbjct: 200 --GEGDTPKCSKTC-EPGYSPSYKEDKHFGCSSYSV 232



 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 53/83 (63%), Positives = 65/83 (78%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+   FSVY+DFL YKSGVYQH  G+ +G HA+R+LGWGVEN  PYWLV NS
Sbjct: 240 MAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGNS 299

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRG++   IE
Sbjct: 300 WNTDWGDNGFFKILRGQDHCGIE 322


>gi|146165818|ref|XP_001015807.2| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|146145394|gb|EAR95562.2| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 338

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 67/161 (41%), Positives = 91/161 (56%), Gaps = 5/161 (3%)

Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
           Q    +P +FD+RE+WP C S++ I DQS CGSCWA +     SDR+CIASN      IS
Sbjct: 82  QTNDPIPESFDSREQWPNCNSIKTIRDQSTCGSCWAFAATETYSDRICIASNQELQTSIS 141

Query: 242 AQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
           ++ ++ C   C  GC GG+P  AW++    GV TGG Y     C+PY   PC+HHV G  
Sbjct: 142 SEDLLECCATCGNGCQGGYPSAAWKYMKATGVSTGGLYGDDSSCKPYVFPPCDHHVVGQY 201

Query: 301 QNCTLLGKLK-TPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             C   G +K TP+C + C +   E TY+ DL    K + +
Sbjct: 202 PPC---GPIKPTPKCVKQCNSQYTEKTYQQDLHHPSKVYQL 239



 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 51/101 (50%), Positives = 68/101 (67%), Gaps = 5/101 (4%)

Query: 63  HYFKKAHMVPRCNA---MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI-GLHAVR 118
           H+  K + +P  NA    R+I  HGP+ A F V +DFL YKSGVY  +      G H+V+
Sbjct: 231 HHPSKVYQLPN-NAEAIQREIMAHGPVQASFRVASDFLTYKSGVYIRDPKLKYEGGHSVK 289

Query: 119 VLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           ++GWGVE   PYWL+ANSWN+ WG++G FK+LRG+NE  IE
Sbjct: 290 IIGWGVEQGTPYWLIANSWNEDWGENGLFKMLRGKNECGIE 330


>gi|350540002|ref|NP_001232104.1| putative cathepsin B variant 2 precursor [Taeniopygia guttata]
 gi|197129221|gb|ACH45719.1| putative cathepsin B variant 2 [Taeniopygia guttata]
          Length = 261

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 59/120 (49%), Positives = 82/120 (68%), Gaps = 2/120 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFD+R +WP CP++  I DQ +CGSCWA     AISDR+C+ +N   + ++SA+ ++
Sbjct: 80  LPDNFDSRTQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLL 139

Query: 247 ACTP-NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C    C  GCNGG+P  AWR+W   G+V+GG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 140 SCCGFECGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRPYSIPPCEHHVNGTRPPCT 199


>gi|121309133|dbj|BAF43801.1| Longipain [Haemaphysalis longicornis]
          Length = 341

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 66/155 (42%), Positives = 92/155 (59%), Gaps = 3/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R +W +CP++R I DQ +CGSCWA     A+SDR CI S       ++A  ++
Sbjct: 89  IPDHFDSRHRWHDCPTIREIRDQGSCGSCWAFGAVEAMSDRHCIHSGAKNIVHLAADDVL 148

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C  GCNGG+P  AW +W H G+VTGG+Y+S EGC PY +  C+HHV G L  C  
Sbjct: 149 SCCMSCGSGCNGGFPGAAWSYWVHKGIVTGGNYDSDEGCMPYPIKACDHHVNGTLGPCD- 207

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP C + C    Y   +  D   GKK++ V
Sbjct: 208 KSIPPTPRCVRMC-RKGYNVDFADDKHYGKKSYSV 241



 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 55/99 (55%), Positives = 68/99 (68%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY KK++ VP        +I  +GP+ A F+VYADF  YKSGVYQ +   ++G HA+R+L
Sbjct: 233 HYGKKSYSVPSNVTQIQVEIMTNGPVEADFTVYADFPLYKSGVYQRHTDQALGGHAIRLL 292

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVE  +PYWL ANSWN  WGD G FKILRG +E  IE
Sbjct: 293 GWGVEKGVPYWLAANSWNTEWGDKGFFKILRGSDECGIE 331


>gi|393909827|gb|EJD75608.1| cysteine endopeptidase [Loa loa]
          Length = 383

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 65/155 (41%), Positives = 94/155 (60%), Gaps = 2/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FDAR+ WPEC SLR++ DQS+CGSCWAV+   A+SDR+CI S G     +SA  ++
Sbjct: 123 IPESFDARKHWPECASLRNVRDQSSCGSCWAVAAVEAMSDRICIMSKGKKQVTLSADDLL 182

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GC GG P  AW++W   G+VTG +Y +  GC+PY   PCEHH          
Sbjct: 183 SCCKTCGFGCFGGEPMAAWKYWVLRGIVTGSEYTNHSGCRPYPFPPCEHHNNKTHYEPCK 242

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP+C + C + +Y  +Y+ D   G++ + V
Sbjct: 243 HDLYPTPKCVKKC-DKNYGKSYKADKYYGEQVYNV 276



 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 46/85 (54%), Positives = 57/85 (67%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I   GP+ A F VY DFL Y  G+Y+H  G   G HAV+VLGWG++  +PYWL ANSW
Sbjct: 285 KEIMTLGPVEASFEVYTDFLYYTGGIYKHVAGSMGGGHAVKVLGWGIDQGVPYWLAANSW 344

Query: 138 NDHWGDHGTFKILRGENEADIEMGF 162
           N  WG+ G F+ILRG NE  IE G 
Sbjct: 345 NTDWGEDGYFRILRGVNECGIESGI 369


>gi|552159|gb|AAA29434.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
          Length = 240

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 72/143 (50%), Positives = 88/143 (61%), Gaps = 5/143 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P ++D R +W  C SL HI DQ+NCGSCWAVS A A+SDR+CIAS G     ISAQ +V
Sbjct: 95  IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVV 154

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC GGWP  A+RF    GVVTGGDYN++  C+PY + PC HH          
Sbjct: 155 SCCTWCGDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPYEIHPCGHHGNETYYG-EC 213

Query: 306 LGKLKTPECKQNC---YNPSYES 325
           +G   TP CK+ C   Y  SY S
Sbjct: 214 VGMADTPRCKRRCLLGYPKSYPS 236


>gi|17565164|ref|NP_503383.1| Protein CPR-5 [Caenorhabditis elegans]
 gi|1169086|sp|P43509.1|CPR5_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 5; AltName:
           Full=Cysteine protease-related 5; Flags: Precursor
 gi|671713|gb|AAA98786.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|675502|gb|AAA98784.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351059399|emb|CCD74289.1| Protein CPR-5 [Caenorhabditis elegans]
          Length = 344

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 68/171 (39%), Positives = 102/171 (59%), Gaps = 5/171 (2%)

Query: 175 DLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG 234
           D + +  + +  +P +FDAR++WP C S+ +I DQS+CGSCWA + A AISDR CIASNG
Sbjct: 70  DEDIVATEVSDAIPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNG 129

Query: 235 YFTGQISAQHIVACTPNCW----GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
                +S++ +++C    +    GC GG+P  AW++W  +G+VTGG Y +Q GC+PY++A
Sbjct: 130 AVNTLLSSEDLLSCCTGMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIA 189

Query: 291 PCEHHVQGPLQNCTLLGKLKTPECKQNCYNP-SYESTYRFDLKKGKKAHMV 340
           PC   V G            TP+C  +C +  +Y + Y  D   G  A+ V
Sbjct: 190 PCGETVNGVKWPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAV 240



 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 47/81 (58%), Positives = 59/81 (72%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I  +GP+   F+VY DF QY +GVY H  G S+G HAV++LGWGV+N  PYWLVANSWN
Sbjct: 250 EILTNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGVDNGTPYWLVANSWN 309

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WG+ G F+I+RG NE  IE
Sbjct: 310 VAWGEKGYFRIIRGLNECGIE 330


>gi|29840882|gb|AAP05883.1| similar to GenBank Accession Number X70968 cathepsin B in
           Schistosoma japonicum [Schistosoma japonicum]
          Length = 312

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 65/155 (41%), Positives = 89/155 (57%), Gaps = 2/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FD+R+ W  C S+R I DQS+CGSCWA     ++SDR+CI S G  + ++SA +++
Sbjct: 92  LPKYFDSRKYWKNCSSIRTIRDQSSCGSCWAFGAVESMSDRICIHSKGRISIELSAVNLL 151

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG P +AW +W   G+VTGG   +  GCQPY    C HH      +   
Sbjct: 152 SCCSRCGFGCNGGIPGMAWDYWKDEGIVTGGSNETHTGCQPYPFPECIHHSTSINHSSCE 211

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           +    TPEC Q C  P Y   Y  D   GK ++ V
Sbjct: 212 VKYYSTPECYQTC-QPDYAIQYENDKYYGKSSYYV 245



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 33/49 (67%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWG 123
           + M++I  +GP+ A F VY DFL YK+GVY++  G  +G HA+R+   G
Sbjct: 251 SIMKEILLNGPVEATFYVYDDFLNYKTGVYKYVTGSLLGGHAIRITWLG 299


>gi|552158|gb|AAA29433.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
          Length = 236

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 72/143 (50%), Positives = 88/143 (61%), Gaps = 5/143 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P ++D R +W  C SL HI DQ+NCGSCWAVS A A+SDR+CIAS G     ISAQ +V
Sbjct: 91  IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVV 150

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC GGWP  A+RF    GVVTGGDYN++  C+PY + PC HH          
Sbjct: 151 SCCTWCGDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPYEIHPCGHHGNETYYG-EC 209

Query: 306 LGKLKTPECKQNC---YNPSYES 325
           +G   TP CK+ C   Y  SY S
Sbjct: 210 VGMADTPRCKRRCLLGYPKSYPS 232


>gi|226471004|emb|CAX70583.1| Cysteine PRotease related protein [Schistosoma japonicum]
          Length = 304

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 74/195 (37%), Positives = 104/195 (53%), Gaps = 13/195 (6%)

Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
           +IL G  + D EM    R   +   D ++E         +P  FD+R+KWP C S+  I 
Sbjct: 23  RILMGARKEDAEMKRKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 72

Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRF 266
           DQS CGSCWA     A++DR+CI S G  + ++SA  +++C  +C  GC GG+P  AW +
Sbjct: 73  DQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLISCCKDCGDGCKGGFPGQAWDY 132

Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
           W   G+VTGG   +  GCQPY    CEH  +G    C      KTP+CKQ C    Y++ 
Sbjct: 133 WVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACG-TKIYKTPQCKQTC-QKGYKTP 190

Query: 327 YRFDLKKGKKAHMVL 341
           Y  D   G + + V+
Sbjct: 191 YEQDKHYGDQRYNVI 205



 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 46/82 (56%), Positives = 61/82 (74%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R+I  +GP+ A F VY DFL YKSG+Y+H  G  +G HA+R++GWGVE   PYWL+ANSW
Sbjct: 213 REIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTPYWLIANSW 272

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG+ G F+I+RG +E  IE
Sbjct: 273 NEDWGEKGLFRIVRGRDECSIE 294


>gi|28373366|pdb|1ITO|A Chain A, Crystal Structure Analysis Of Bovine Spleen Cathepsin B-
           E64c Complex
 gi|88192750|pdb|2DC6|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca073 Complex
 gi|88192751|pdb|2DC7|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca042 Complex
 gi|88192752|pdb|2DC8|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca059 Complex
 gi|88192753|pdb|2DC9|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca074me Complex
 gi|88192754|pdb|2DCA|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca075 Complex
 gi|88192755|pdb|2DCB|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca076 Complex
 gi|88192756|pdb|2DCC|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca077 Complex
 gi|88192757|pdb|2DCD|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca078 Complex
          Length = 256

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 73/156 (46%), Positives = 98/156 (62%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP CP+++ I DQ +CGSCWA     AISDR+CI SNG    ++SA+ ++
Sbjct: 1   LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 60

Query: 247 A--CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
                    GCNGG+P  AW FW   G+V+GG YNS  GC+PY++ PCEHHV G    CT
Sbjct: 61  TCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCT 120

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  +Y+ D   G  ++ V
Sbjct: 121 --GEGDTPKCSKTC-EPGYSPSYKEDKHFGCSSYSV 153



 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 53/83 (63%), Positives = 65/83 (78%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+   FSVY+DFL YKSGVYQH  G+ +G HA+R+LGWGVEN  PYWLV NS
Sbjct: 161 MAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGNS 220

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRG++   IE
Sbjct: 221 WNTDWGDNGFFKILRGQDHCGIE 243


>gi|444525951|gb|ELV14228.1| Cathepsin B [Tupaia chinensis]
          Length = 339

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 70/156 (44%), Positives = 100/156 (64%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+W  CP+++ I DQ +CGSCWA     +ISDR+CI +NG+   ++SA+ ++
Sbjct: 80  LPESFDAREQWSSCPTIKEIRDQGSCGSCWAFGAVESISDRICIHTNGHVNVEVSAEDML 139

Query: 247 A--CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
                    GCNGG+P  AW FW   G+V+GG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGGQCGEGCNGGYPSAAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C ++C  P Y S+Y+ D   G  ++ V
Sbjct: 200 --GEGDTPKCSKSC-EPGYSSSYKEDKHYGYSSYSV 232



 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 57/99 (57%), Positives = 71/99 (71%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   ++ VP      M +IY++GP+   FSVY+DFL YKSGVYQH  G+ +G HA+R+L
Sbjct: 224 HYGYSSYSVPGIEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRIL 283

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG EN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 284 GWGTENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 322


>gi|427785213|gb|JAA58058.1| Putative cathepsin l culex quinquefasciatus cathepsin l
           [Rhipicephalus pulchellus]
          Length = 346

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 74/170 (43%), Positives = 96/170 (56%), Gaps = 12/170 (7%)

Query: 175 DLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCI--AS 232
           DL ++G      LP NFD+RE WPEC ++  I DQ +CGSCWA     A+SDR CI   S
Sbjct: 85  DLSSLG-----PLPENFDSRENWPECTTIGEIRDQGSCGSCWAFGAVEAMSDRTCIHSPS 139

Query: 233 NGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP 291
            G     +SA  +++C   C  GCNGG+P  AW FW   G+VTGG+Y+S +GC PY +  
Sbjct: 140 GGPKRVHLSADDLLSCCRTCGNGCNGGFPGSAWSFWVKTGIVTGGNYDSDDGCMPYPIKA 199

Query: 292 CEHHVQGPLQNCTLLGKL-KTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           C+HHV G L  C    K+  TP C   C    Y+  Y  D   GK ++ V
Sbjct: 200 CDHHVNGTLGPCD--KKIPPTPRCVHMC-RKGYDVDYHDDKHYGKSSYSV 246



 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 55/99 (55%), Positives = 70/99 (70%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY K ++ VP      Q  I  +GP+ A F+VY+DF+ YKSGVYQ +  +++G HA+R+L
Sbjct: 238 HYGKSSYSVPSEEKQIQAEIMTNGPVEADFTVYSDFVHYKSGVYQRHTDEALGGHAIRLL 297

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN +PYWL ANSWN  WGD G FKILRG +E  IE
Sbjct: 298 GWGVENGVPYWLAANSWNTEWGDKGFFKILRGSDECGIE 336


>gi|146386348|gb|ABQ23962.1| cathepsin B [Oryctolagus cuniculus]
          Length = 228

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 69/146 (47%), Positives = 94/146 (64%), Gaps = 5/146 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP CP+++ I DQ +CGSCWA     AISDR+CI +NG+   ++SA+ ++
Sbjct: 59  LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNGHVNVEVSAEDML 118

Query: 247 A--CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
                    GCNGG+P  AW FW   G+V+GG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 119 TCCGGQCGDGCNGGYPSGAWNFWTKKGLVSGGLYDSHVGCKPYSIPPCEHHVNGSRPACT 178

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFD 330
             G+  TP C + C  P Y  +Y+ D
Sbjct: 179 --GEGDTPRCSKTC-EPGYSPSYKED 201


>gi|56753605|gb|AAW25005.1| SJCHGC02852 protein [Schistosoma japonicum]
          Length = 346

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 65/155 (41%), Positives = 89/155 (57%), Gaps = 2/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FD+R+ W  C S+R I DQS+CGSCWA     ++SDR+CI S G  + ++SA +++
Sbjct: 92  LPKYFDSRKYWKNCSSIRTIRDQSSCGSCWAFGAVESMSDRICIHSKGRISIELSAVNLL 151

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GCNGG P +AW +W   G+VTGG   +  GCQPY    C HH      +   
Sbjct: 152 SCCSRCGFGCNGGIPGMAWDYWKDEGIVTGGSNETHTGCQPYPFPECIHHSTSINHSSCE 211

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           +    TPEC Q C  P Y   Y  D   GK ++ V
Sbjct: 212 VKYYSTPECYQTC-QPDYAIQYENDKYYGKSSYYV 245



 Score =  104 bits (259), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 51/101 (50%), Positives = 67/101 (66%), Gaps = 4/101 (3%)

Query: 63  HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           +Y K ++ V     + M++I  +GP+ A F V+ DFL YK+GVY++  G  +G HA+R++
Sbjct: 237 YYGKSSYYVTSDEVSIMKEILLNGPVEATFYVFDDFLNYKTGVYKYVTGSLLGGHAIRII 296

Query: 121 GWGVE--NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGV   N  PYWL ANSWN  WGD G FKILRG NE  IE
Sbjct: 297 GWGVSTLNHTPYWLCANSWNKQWGDKGYFKILRGSNECGIE 337


>gi|326515156|dbj|BAK03491.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 471

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 71/159 (44%), Positives = 97/159 (61%), Gaps = 11/159 (6%)

Query: 187 LPRNFDARE--KWPEC-PSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
           LP +FD R+  KWP C  SL H+ DQ +CGSCWA   A A++DR+CIASNG     +SA+
Sbjct: 214 LPTSFDPRDGSKWPACKDSLNHVRDQGSCGSCWAFGAAEAMTDRICIASNGQNNFYLSAE 273

Query: 244 HIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
            + +C  +C  GC GG+P  AW ++   G+VTGGD+NS +GC PY L  C+HHV G  Q 
Sbjct: 274 DLTSCCDSCGMGCEGGYPSAAWDYFQSTGLVTGGDWNSNQGCYPYQLQACDHHVTGKYQP 333

Query: 303 CTLLGKLK-TPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           C   G ++ TP C  +C N    +T+  D   G  ++ V
Sbjct: 334 C---GDIQPTPACANSCQN---NATWSSDKHFGASSYSV 366



 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 50/86 (58%), Positives = 65/86 (75%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY +GP+ A + VYADF+ YKSGVYQH  GD +G HAV+++GWGV+   PYW+VANS
Sbjct: 374 MTEIYTNGPVEASYDVYADFVSYKSGVYQHVTGDYLGGHAVKIIGWGVDGSTPYWIVANS 433

Query: 137 WNDHWGDHGTFKILRGENEADIEMGF 162
           WN+ WG++G F ILRG +E  IE G 
Sbjct: 434 WNNDWGNNGFFNILRGSDECGIEDGI 459


>gi|118365170|ref|XP_001015806.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297573|gb|EAR95561.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 340

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 67/161 (41%), Positives = 92/161 (57%), Gaps = 6/161 (3%)

Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
           NA  +P  FDARE+WP C S++ I DQS CGSCWA +     SDR+CIASN      IS+
Sbjct: 84  NADPIPEFFDAREQWPNCQSIKLIRDQSTCGSCWAFAATETFSDRICIASNQTLQTSISS 143

Query: 243 QHIVACTPN-C-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
           + ++ C  + C  GC GG+P  AW +    GV TGG Y     C+PY   PC+HHV G  
Sbjct: 144 EDLLECCADYCGMGCKGGYPSAAWGYMKRQGVSTGGLYGDDTSCKPYIFPPCDHHVTGQY 203

Query: 301 QNCTLLGKLK-TPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           Q C   G ++ TP+C + C +   ++TY  DL    + + +
Sbjct: 204 QPC---GPIQPTPQCVKECNSEYTQNTYEKDLHFASQTYSI 241



 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 46/83 (55%), Positives = 58/83 (69%), Gaps = 1/83 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI-GLHAVRVLGWGVENDIPYWLVANS 136
           R+I  HGP+ A F V ADFL YKSGVY  N      G H+V+++GWG E + PYWL+ANS
Sbjct: 250 REIMAHGPVQASFKVAADFLTYKSGVYIRNPKLKYEGGHSVKIIGWGKEGNTPYWLIANS 309

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN+ WG+ G F++LRG NE  IE
Sbjct: 310 WNEDWGEKGLFRMLRGRNECGIE 332


>gi|341891084|gb|EGT47019.1| CBN-CPR-4 protein [Caenorhabditis brenneri]
          Length = 335

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 68/155 (43%), Positives = 90/155 (58%), Gaps = 1/155 (0%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAR +WP C S+ +I DQS+CGSCWA + A A SDR CIASNG     +SA+ ++
Sbjct: 81  IPATFDARTQWPSCVSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVL 140

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  NC +GC GG+P  AW++   +G  TGG Y +Q GC+PY+LAPC   V         
Sbjct: 141 SCCSNCGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNTTWPACP 200

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP C   C N +Y   Y+ D   G  A+ V
Sbjct: 201 TDGYDTPACVNKCTNSNYNVAYKDDKHFGSTAYAV 235



 Score =  117 bits (292), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 54/99 (54%), Positives = 69/99 (69%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+   A+ V +  A  Q  I  HGP+ A F+VY DF QYKSGVY H  G+ +G HA+R+L
Sbjct: 227 HFGSTAYAVGKKVAQIQAEIIAHGPVEAAFTVYEDFYQYKSGVYVHTTGEELGGHAIRIL 286

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG +N  PYWLVANSWN +WG++G F+I+RG NE  IE
Sbjct: 287 GWGTDNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIE 325


>gi|29374023|gb|AAO73002.1| cathepsin B [Fasciola gigantica]
          Length = 335

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 67/155 (43%), Positives = 92/155 (59%), Gaps = 2/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAREKWP C S+  I DQS+C SCWAV  A+A++DR+CI SNG    ++SA  +V
Sbjct: 86  LPESFDAREKWPNCSSISEIPDQSSCSSCWAVGTASAMTDRICIHSNGEKKPRLSAVDLV 145

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C P C +GC GG+P +AW +W  +G+V+GG   +  GC PY    C H  + P      
Sbjct: 146 SCCPYCGYGCEGGYPSMAWDYWWRHGIVSGGTLENPTGCLPYPFPKCSHLEETPGLAPCP 205

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP+C++ C    Y  T   D  KGK ++ V
Sbjct: 206 RELYATPKCEKQC-QAGYSKTSEEDKIKGKSSYNV 239



 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 42/89 (47%), Positives = 59/89 (66%), Gaps = 2/89 (2%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           + M +I  +GP+  I+ ++ DF  YKSG+YQ+  G  +G H +  +GWGVEN + YWL A
Sbjct: 245 DIMMEIITNGPVSTIYYIFEDFTVYKSGIYQYTSGSLMGGHGI--IGWGVENGVKYWLAA 302

Query: 135 NSWNDHWGDHGTFKILRGENEADIEMGFN 163
           NSWN+ WG++G F+I RG NE  IE   N
Sbjct: 303 NSWNEGWGENGYFRIRRGTNECGIESRIN 331


>gi|440913587|gb|ELR63025.1| Cathepsin B [Bos grunniens mutus]
          Length = 335

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 72/156 (46%), Positives = 98/156 (62%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAR++WP CP+++ I DQ +CGSCWA     AISDR+CI SNG    ++SA+ ++
Sbjct: 80  LPESFDARKQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 139

Query: 247 A--CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
                    GCNGG+P  AW FW   G+V+GG YNS  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  +Y+ D   G  ++ V
Sbjct: 200 --GEGDTPKCSKTC-EPGYSPSYKEDKHFGCSSYSV 232



 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 53/83 (63%), Positives = 65/83 (78%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+   FSVY+DFL YKSGVYQH  G+ +G HA+R+LGWGVEN  PYWLV NS
Sbjct: 240 MAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGNS 299

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRG++   IE
Sbjct: 300 WNTDWGDNGFFKILRGQDHCGIE 322


>gi|9955277|pdb|1QDQ|A Chain A, X-Ray Crystal Structure Of Bovine Cathepsin B-Ca074
           Complex
          Length = 253

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 70/156 (44%), Positives = 94/156 (60%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP CP+++ I DQ +CGSCWA     AISDR+CI SNG    ++SA+ ++
Sbjct: 1   LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 60

Query: 247 ACTPNCWGCNGGW--PQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C     G       P  AW FW   G+V+GG YNS  GC+PY++ PCEHHV G    CT
Sbjct: 61  TCCGGECGDGCNGGEPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCT 120

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  +Y+ D   G  ++ V
Sbjct: 121 --GEGDTPKCSKTC-EPGYSPSYKEDKHFGCSSYSV 153



 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 54/83 (65%), Positives = 66/83 (79%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+   FSVY+DFL YKSGVYQH  G+ +G HA+R+LGWGVEN  PYWLVANS
Sbjct: 161 MAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVANS 220

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRG++   IE
Sbjct: 221 WNTDWGDNGFFKILRGQDHCGIE 243


>gi|291385792|ref|XP_002709482.1| PREDICTED: cathepsin B [Oryctolagus cuniculus]
          Length = 339

 Score =  134 bits (338), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 69/146 (47%), Positives = 94/146 (64%), Gaps = 5/146 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDARE+WP CP+++ I DQ +CGSCWA     AISDR+CI +NG+   ++SA+ ++
Sbjct: 80  LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNGHVNVEVSAEDML 139

Query: 247 A--CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
                    GCNGG+P  AW FW   G+V+GG Y+S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGGQCGDGCNGGYPSGAWNFWTKKGLVSGGLYDSHVGCKPYSIPPCEHHVNGSRPACT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFD 330
             G+  TP C + C  P Y  +Y+ D
Sbjct: 200 --GEGDTPRCSKTC-EPGYSPSYKED 222



 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 52/81 (64%), Positives = 64/81 (79%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +IY++GP+   F+VY+DFL YKSGVYQH  GD +G HA+R+LGWG EN +PYWLVANSWN
Sbjct: 242 EIYKNGPVEGAFTVYSDFLMYKSGVYQHTTGDIMGGHAIRILGWGEENGVPYWLVANSWN 301

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGD G FKILRG++   IE
Sbjct: 302 TDWGDKGFFKILRGQDHCGIE 322


>gi|345308|pir||S31909 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - fluke
           (Schistosoma japonicum)
          Length = 316

 Score =  134 bits (338), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 67/156 (42%), Positives = 94/156 (60%), Gaps = 3/156 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS C S WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 64  IPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVGAMSDRICIQSGGKQSVELSAIDLI 123

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  NC  GC+GG+P  AW +W  +G+VTGG   +  GCQPY    CEHH +G   +C  
Sbjct: 124 SCCENCGSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHHSKGKYPSCGD 183

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
               KTP+CK+ C    Y++ Y  D   G  +  V+
Sbjct: 184 K-MYKTPQCKRKC-QKGYKTPYEHDKHYGGISINVI 217



 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 39/82 (47%), Positives = 58/82 (70%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  +GP+ A   ++ DFL YKSG+Y++  G  +G H VR++GWG+EN   YWL AN+W
Sbjct: 225 KEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGIENGTAYWLAANTW 284

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG+ G F+I+RG NE  +E
Sbjct: 285 NEDWGEKGYFRIVRGRNECSVE 306


>gi|257215762|emb|CAX83033.1| Cysteine PRotease related protein [Schistosoma japonicum]
          Length = 233

 Score =  134 bits (338), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 70/176 (39%), Positives = 97/176 (55%), Gaps = 12/176 (6%)

Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
           +IL G  + D EM    R   +   D ++E         +P  FD+R+KWP C S+  I 
Sbjct: 61  RILMGARKEDAEMKRKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110

Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRF 266
           DQS CGSCWA     A++DR+CI S G  + ++SA  +++C  +C  GC GG+P  AW +
Sbjct: 111 DQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLISCCKDCGDGCKGGFPGQAWDY 170

Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPS 322
           W   G+VTGG   +  GCQPY    CEH  +G    C      KTP+CKQ C++ S
Sbjct: 171 WVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACG-TKIYKTPQCKQTCHSIS 225


>gi|312271211|gb|ADQ57303.1| cathepsin B-like cysteine proteinase 1 [Angiostrongylus
           cantonensis]
          Length = 394

 Score =  134 bits (337), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 65/156 (41%), Positives = 92/156 (58%), Gaps = 3/156 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAR+ W  C S+++I DQS+CGSCWA     A+SDR+CIASN      +SA  ++
Sbjct: 121 IPETFDARQHWSNCQSIKNIRDQSSCGSCWAFGAVEAMSDRICIASNEKIQVTLSADDLL 180

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GC GG P  AW++W  +G+VTG ++ + +GC+PY   PCEHH      +   
Sbjct: 181 SCCRTCGFGCEGGDPMFAWQYWVDHGIVTGSNFTANQGCKPYPFPPCEHHSNKTRFDPCR 240

Query: 306 LGKLKTPECKQNCYNPSY-ESTYRFDLKKGKKAHMV 340
                TP+C + C  PSY E  Y  D   G+ A+ V
Sbjct: 241 HDLYPTPKCSKKCV-PSYKEKNYDDDRFYGRTAYGV 275



 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 43/84 (51%), Positives = 56/84 (66%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  HGP+   F VY DFL Y  G+Y H  G   G HAV+++GWG++   PYWL+ANSW
Sbjct: 284 KEILTHGPVEVAFEVYEDFLHYAGGIYVHTGGKLGGGHAVKLIGWGIDQGTPYWLIANSW 343

Query: 138 NDHWGDHGTFKILRGENEADIEMG 161
           N  WG+ G F+ILRG +E  IE G
Sbjct: 344 NTDWGEEGFFRILRGVDECGIESG 367


>gi|308488328|ref|XP_003106358.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
 gi|308253708|gb|EFO97660.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
          Length = 343

 Score =  134 bits (336), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 70/159 (44%), Positives = 94/159 (59%), Gaps = 5/159 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P ++D R+ + +C S+ +I DQS+CGSCWAV+ A AISDR CIASNG     +SA+ I+
Sbjct: 81  IPDHYDVRDDFSQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGVVNTLLSAEDIL 140

Query: 247 ACTPNCW----GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
            C    +    GC GG+P  AW++W  NG+VTGG Y SQ GC+PY++APC   V G    
Sbjct: 141 TCCIGEYYCGDGCEGGYPIQAWKYWVKNGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWP 200

Query: 303 CTLLGKLKTPECKQNCY-NPSYESTYRFDLKKGKKAHMV 340
                   TP+C  +C  N SY   Y  D   G  A+ V
Sbjct: 201 KCPNSDADTPKCVDHCTSNSSYPIPYEKDKHYGATAYAV 239



 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 54/99 (54%), Positives = 68/99 (68%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   A+ V R       +I ++GP+   F+VYADF QYKSGVY H  G  +G HAV++L
Sbjct: 231 HYGATAYAVSRKVDQIQSEILKNGPVEVGFTVYADFYQYKSGVYVHVAGPELGGHAVKLL 290

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGV+N  PYWL ANSWN +WG++G F+ILRG NE  IE
Sbjct: 291 GWGVDNGTPYWLAANSWNTNWGENGYFRILRGVNECGIE 329


>gi|161343851|tpg|DAA06106.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 333

 Score =  134 bits (336), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 67/159 (42%), Positives = 97/159 (61%), Gaps = 8/159 (5%)

Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
           A  +P  FDAR++W  C ++  I DQ NCGSCWA S + A +DRLCIASNG F   +SA+
Sbjct: 81  AGNIPNEFDARKRWKNCTTIGTIRDQGNCGSCWAFSTSGAFADRLCIASNGSFNQLLSAE 140

Query: 244 HIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
           H+ +C   C  GC GG+P  AWR++  +G+VTGG++NS EGCQPY   PC  +     Q+
Sbjct: 141 HVTSCCYRCGLGCQGGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPCTGNNSCSGQS 200

Query: 303 CTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
                  K  +C++ C+  +   +YR D +  +++  VL
Sbjct: 201 ------EKNHKCQKKCFGNT-SISYRGDRRYVERSPYVL 232



 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 49/123 (39%), Positives = 75/123 (60%), Gaps = 5/123 (4%)

Query: 42  KKKKKKKKKKRLYLPTSIPLS---HYFKKA-HMVPRCNAMRQIYEHGPLVAIFSVYADFL 97
           + +K  K +K+ +  TSI       Y +++ +++   N    I  +GP+ + F VY DF+
Sbjct: 199 QSEKNHKCQKKCFGNTSISYRGDRRYVERSPYVLAYDNMQNDIMTYGPIESSFDVYDDFI 258

Query: 98  QYKSGVYQHNFGDS-IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEA 156
            YKSGVY  +   + +G H+V+ +GWGVE ++ YWL+ NSWN+ WGD G FKI RG NE 
Sbjct: 259 SYKSGVYFKSPNATYLGGHSVKCIGWGVERNVSYWLMMNSWNNTWGDGGNFKIRRGTNEC 318

Query: 157 DIE 159
            +E
Sbjct: 319 QVE 321


>gi|195729971|gb|ACG50796.1| cathepsin B1 [Trichobilharzia szidati]
          Length = 342

 Score =  134 bits (336), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 73/180 (40%), Positives = 105/180 (58%), Gaps = 13/180 (7%)

Query: 171 SEDDDLE-----TMGCQNAK-GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAI 224
           SED++L      T+  QN    +P +FD+R+KW +C S+ +I DQS CG CWA +   A+
Sbjct: 68  SEDEELRKKRRPTVDHQNVSLEIPSSFDSRKKWRQCKSISNIRDQSRCGPCWAFAAVEAM 127

Query: 225 SDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEG 283
           SDR+CI S G  + ++SA  +++C   C  GC GG+P  AW +W   G+VTG    +  G
Sbjct: 128 SDRICIQSKGKKSVELSAVDLLSCCTECGLGCQGGFPGAAWDYWVEEGIVTGSSKENHTG 187

Query: 284 CQPYTLAPCEHHVQGPLQNCTLLGK--LKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
           CQPY    CEHH +G    C   G+   KTP+C+Q C    Y++ Y+ D   GK ++ VL
Sbjct: 188 CQPYPFPKCEHHTKGKYPAC---GEKIYKTPKCQQKC-QKGYKTPYKKDKYYGKLSYNVL 243



 Score =  114 bits (285), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 49/92 (53%), Positives = 65/92 (70%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  HGP+ A F+VY+DFL YKSG+Y+H  G  IG HAVR++GWGVE   PYWL+ANSW
Sbjct: 251 KEIMMHGPVEAAFTVYSDFLNYKSGIYKHMKGTVIGGHAVRIIGWGVEKKTPYWLIANSW 310

Query: 138 NDHWGDHGTFKILRGENEADIEMGFNNRVEAN 169
           N+ WG+ G F+ILRG++   IE      +  N
Sbjct: 311 NEDWGEKGYFRILRGKDVCGIESAVTAGLPHN 342


>gi|395842321|ref|XP_003793966.1| PREDICTED: cathepsin B [Otolemur garnettii]
          Length = 339

 Score =  134 bits (336), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 69/156 (44%), Positives = 99/156 (63%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP++FDARE+W  CP+++ I DQ +CGSCWA     +ISDR+CI +NG+ + ++SA+ ++
Sbjct: 80  LPKSFDAREQWSHCPTIKEIRDQGSCGSCWAFGAVESISDRICIHTNGHVSVEVSAEDLL 139

Query: 247 A--CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
                    GCNGG+P  AW FW   G+V+GG Y S  GC+PY++ PCEHHV G    CT
Sbjct: 140 TCCGGQCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPACT 199

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+  TP+C + C  P Y  TY+ D   G  ++ +
Sbjct: 200 --GEGDTPKCSKTC-EPGYSPTYKEDKHFGYTSYSL 232



 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 58/108 (53%), Positives = 74/108 (68%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     H+   ++ +P      M +IY++GP+   FSVY+DFL YKSGVYQH  GD 
Sbjct: 215 YSPTYKEDKHFGYTSYSLPTNEWEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHLTGDM 274

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWG EN +PYWLVANSWN  WGD G F+ILRG++   IE
Sbjct: 275 MGGHAIRILGWGEENGVPYWLVANSWNTDWGDGGFFRILRGQDHCGIE 322


>gi|157058735|gb|ABV03125.1| cathepsin B-16 [Aulacorthum solani]
          Length = 246

 Score =  134 bits (336), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 62/147 (42%), Positives = 93/147 (63%), Gaps = 5/147 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +PR FDAR++W  C ++  + DQ NCGSCWA   ++A +DRLC+A++G F   +S + I 
Sbjct: 68  IPRTFDARKRWRHCKTIGEVRDQGNCGSCWAFGTSSAFADRLCVATDGDFNELLSPEEIA 127

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GC+GG+P  AW+++  +G+VTGG+Y S EGC+PY + PC+HH QG   +C+ 
Sbjct: 128 FCCHTCGFGCHGGYPIKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCQHHHQGN-NSCSD 186

Query: 306 LGKLKTPECKQNCY---NPSYESTYRF 329
               K   C + CY   +  Y   +RF
Sbjct: 187 KPMEKNHRCTRMCYGDQDLDYNDDHRF 213


>gi|308500570|ref|XP_003112470.1| CRE-CPR-4 protein [Caenorhabditis remanei]
 gi|308267038|gb|EFP10991.1| CRE-CPR-4 protein [Caenorhabditis remanei]
          Length = 335

 Score =  134 bits (336), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 70/156 (44%), Positives = 94/156 (60%), Gaps = 3/156 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAR +WP C S+ +I DQS+CGSCWA + A A SDR CIASNG     +SA+ ++
Sbjct: 81  IPATFDARTQWPNCVSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVL 140

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL-QNCT 304
           +C  NC +GC+GG+P  AW++   +G  TGG Y +Q GC+PY+LAPC   V      +C 
Sbjct: 141 SCCSNCGYGCDGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPDCP 200

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G   TP C   C N  Y + Y+ D   G  A+ V
Sbjct: 201 DDG-YNTPACVNKCTNTKYNTAYKDDKHFGSTAYAV 235



 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 54/99 (54%), Positives = 68/99 (68%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+   A+ V +  A  Q  I  HGP+ A F+VY DF QYKSGVY H  G  +G HA+R+L
Sbjct: 227 HFGSTAYAVGKKVAQIQAEIIAHGPVEAAFTVYEDFYQYKSGVYVHTTGQELGGHAIRIL 286

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG +N  PYWLVANSWN +WG++G F+I+RG NE  IE
Sbjct: 287 GWGTDNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIE 325


>gi|325302580|dbj|BAJ83490.1| cathepsin B-like peptidase [Echinococcus multilocularis]
          Length = 351

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 71/164 (43%), Positives = 94/164 (57%), Gaps = 8/164 (4%)

Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCI----ASNGYFT 237
           +N K LP +FD R+KWP C +L  I DQ +CGSCWA   A A+SDRLCI     S     
Sbjct: 91  ENYKSLPASFDPRKKWPNCKTLFEIRDQGSCGSCWAFGAAEAMSDRLCIQQQTVSGRAVM 150

Query: 238 GQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHV 296
            ++SA  +++C  +C  GCNGG+P  AW FW H G+V+GG Y ++  C+ Y + PCEHHV
Sbjct: 151 VRLSADDLLSCCRDCGMGCNGGFPSQAWNFWKHEGLVSGGLYGTKGVCRAYEIPPCEHHV 210

Query: 297 QGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G    C   G   TP+CK  C    Y+  Y+ D     K + V
Sbjct: 211 NGTRPPCE--GDAPTPKCKNVCQE-EYKVPYKKDKHYAVKVYSV 251



 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 47/81 (58%), Positives = 59/81 (72%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           ++  HGP+ A F VYADF  YKSGVYQH  G  +G HA++++GWG E+ +PYWL ANSWN
Sbjct: 261 ELITHGPVEADFEVYADFPTYKSGVYQHVSGALLGGHAIKLMGWGEEDGVPYWLCANSWN 320

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WG+ G FKILRG+N   IE
Sbjct: 321 TDWGEGGFFKILRGKNHCGIE 341


>gi|55793947|gb|AAV65884.1| cathepsin B1 isotype 4 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 65/156 (41%), Positives = 93/156 (59%), Gaps = 3/156 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KW +C S+ +I DQS CGSCWA +   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSSFDSRKKWHQCKSISNIRDQSRCGSCWAFAAVEAMSDRICIESKGKKSVELSAVDLL 149

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC GG+P  AW +W  +G+VTG    +  GCQPY    CEHH  G    C  
Sbjct: 150 SCCTECGLGCQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCEHHTTGKYPECG- 208

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
               KTP+C Q C    Y++ Y+ D   G+ ++ VL
Sbjct: 209 EKIYKTPKCHQKC-QKGYKTPYKKDKYYGRMSYNVL 243



 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 45/87 (51%), Positives = 64/87 (73%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  HGP+   F+V++DFL YKSG+Y++  G  IG HAVR++GWGVE   PYWL+ANSW
Sbjct: 251 KEIMMHGPVEVAFTVHSDFLNYKSGIYKYMTGAEIGEHAVRIIGWGVEKKTPYWLIANSW 310

Query: 138 NDHWGDHGTFKILRGENEADIEMGFNN 164
           N+ WG+ G F++LRG++E  IE    +
Sbjct: 311 NEDWGEKGYFRMLRGKDECGIESAVTS 337


>gi|91089435|ref|XP_966663.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
 gi|270012706|gb|EFA09154.1| cathepsin B precursor [Tribolium castaneum]
          Length = 320

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 67/178 (37%), Positives = 96/178 (53%), Gaps = 20/178 (11%)

Query: 169 NSSEDD-----DLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANA 223
           N + DD     D ET      + +P+NFDAR  WP+C S+R I +Q +CGSCWA      
Sbjct: 56  NGARDDPAFFTDTETKNVTIPEQIPQNFDARIVWPQCESIRKIRNQGSCGSCWAFGAVET 115

Query: 224 ISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQE 282
           +SDRLCIASN     + SAQ ++AC   C  GC GG+   AW++W  +G+V+GGD+N+ +
Sbjct: 116 MSDRLCIASNATKKFEFSAQDLLACCKECGHGCGGGYSSRAWQYWVTDGIVSGGDFNTSQ 175

Query: 283 GCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           GC PY++                     TP C   C NP Y+  Y  D + G +++ +
Sbjct: 176 GCHPYSV--------------QAFRDSTTPNCSSFCTNPKYQKNYSEDKRYGARSYRI 219



 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 45/82 (54%), Positives = 54/82 (65%), Gaps = 1/82 (1%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I   GP+ A + VY DF  Y++GVYQH  G+  G H+V++LGWG EN   YWLVANSW 
Sbjct: 229 EIMTSGPVQASYVVYDDFYSYQNGVYQHVLGNVSGRHSVKILGWGRENGTDYWLVANSWG 288

Query: 139 DHWGD-HGTFKILRGENEADIE 159
             WG   G FK LRGEN  DIE
Sbjct: 289 RDWGRLGGFFKFLRGENHCDIE 310


>gi|17559068|ref|NP_504682.1| Protein CPR-4 [Caenorhabditis elegans]
 gi|1169085|sp|P43508.1|CPR4_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 4; AltName:
           Full=Cysteine protease-related 4; Flags: Precursor
 gi|675500|gb|AAA98785.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|695293|gb|AAA98783.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351063163|emb|CCD71204.1| Protein CPR-4 [Caenorhabditis elegans]
          Length = 335

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 73/178 (41%), Positives = 97/178 (54%), Gaps = 2/178 (1%)

Query: 165 RVEANSSEDDDLETMGCQ-NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANA 223
           R E  +    D+E +    N   +P  FDAR +WP C S+ +I DQS+CGSCWA + A A
Sbjct: 58  RTEFVAPHTPDVEVVKHDINEDTIPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAEA 117

Query: 224 ISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQE 282
            SDR CIASNG     +SA+ +++C  NC +GC GG+P  AW++   +G  TGG Y +Q 
Sbjct: 118 ASDRFCIASNGAVNTLLSAEDVLSCCSNCGYGCEGGYPINAWKYLVKSGFCTGGSYEAQF 177

Query: 283 GCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           GC+PY+LAPC   V              TP C   C N +Y   Y  D   G  A+ V
Sbjct: 178 GCKPYSLAPCGETVGNVTWPSCPDDGYDTPACVNKCTNKNYNVAYTADKHFGSTAYAV 235



 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 51/99 (51%), Positives = 67/99 (67%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+   A+ V +       +I  HGP+ A F+VY DF QYK+GVY H  G  +G HA+R+L
Sbjct: 227 HFGSTAYAVGKKVSQIQAEIIAHGPVEAAFTVYEDFYQYKTGVYVHTTGQELGGHAIRIL 286

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG +N  PYWLVANSWN +WG++G F+I+RG NE  IE
Sbjct: 287 GWGTDNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIE 325


>gi|55793945|gb|AAV65883.1| cathepsin B1 isotype 3 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 65/156 (41%), Positives = 93/156 (59%), Gaps = 3/156 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KW +C S+ +I DQS CGSCWA +   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSSFDSRKKWHQCKSISNIRDQSRCGSCWAFTAVEAMSDRICIESKGKKSVELSAVDLL 149

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC GG+P  AW +W  +G+VTG    +  GCQPY    CEHH  G    C  
Sbjct: 150 SCCTECGLGCQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCEHHTTGKYPECG- 208

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
               KTP+C Q C    Y++ Y+ D   G+ ++ VL
Sbjct: 209 EKIYKTPKCHQKC-QKGYKTPYKKDKYYGRMSYNVL 243



 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 47/82 (57%), Positives = 64/82 (78%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  HGP+ A F+V++DFL YKSG+Y++  G  IG HAVR++GWGVE   PYWL+ANSW
Sbjct: 251 KEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIGWGVEKKTPYWLIANSW 310

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG+ G F+ILRG++E  IE
Sbjct: 311 NEDWGEKGYFRILRGKDECGIE 332


>gi|209863073|ref|NP_001119610.2| cathepsin B-1852 [Acyrthosiphon pisum]
          Length = 333

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 66/156 (42%), Positives = 96/156 (61%), Gaps = 8/156 (5%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAR++W  C ++  I DQ NCGSCWA S + A +DRLCIASNG F   +SA+H+ 
Sbjct: 84  IPNEFDARKRWKNCTTIGTIRDQGNCGSCWAFSTSGAFADRLCIASNGSFNQLLSAEHVT 143

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC GG+P  AWR++  +G+VTGG++NS EGCQPY   PC  +     Q+   
Sbjct: 144 SCCYRCGLGCQGGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPCTGNNSCSGQS--- 200

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
               K  +C++ C+  +   +YR D +  +++  VL
Sbjct: 201 ---EKNHKCQKKCFGNT-SISYRGDRRYVERSPYVL 232



 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 49/123 (39%), Positives = 74/123 (60%), Gaps = 5/123 (4%)

Query: 42  KKKKKKKKKKRLYLPTSIPLS---HYFKKA-HMVPRCNAMRQIYEHGPLVAIFSVYADFL 97
           + +K  K +K+ +  TSI       Y +++ +++   N    I  +GP+ + F VY DF+
Sbjct: 199 QSEKNHKCQKKCFGNTSISYRGDRRYVERSPYVLAYDNMQNDIMTYGPIESSFDVYDDFI 258

Query: 98  QYKSGVYQHNFGDS-IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEA 156
            YKSGVY  +   + +G H+V+ +GWGVE ++ YWL+ NSWN  WGD G FKI RG NE 
Sbjct: 259 SYKSGVYFKSPNATYLGGHSVKCIGWGVERNVSYWLMMNSWNSTWGDGGYFKIRRGTNEC 318

Query: 157 DIE 159
            +E
Sbjct: 319 QVE 321


>gi|55793941|gb|AAV65881.1| cathepsin B1 isotype 1 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 65/156 (41%), Positives = 93/156 (59%), Gaps = 3/156 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KW +C S+ +I DQS CGSCWA +   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSSFDSRKKWHQCKSISNIRDQSRCGSCWAFAAVEAMSDRICIESKGKKSVELSAVDLL 149

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC GG+P  AW +W  +G+VTG    +  GCQPY    CEHH  G    C  
Sbjct: 150 SCCTECGLGCQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCEHHTTGKYPECG- 208

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
               KTP+C Q C    Y++ Y+ D   G+ ++ VL
Sbjct: 209 EKIYKTPKCHQKC-QKGYKTPYKKDKYYGRMSYNVL 243



 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 47/82 (57%), Positives = 64/82 (78%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  HGP+ A F+V++DFL YKSG+Y++  G  IG HAVR++GWGVE   PYWL+ANSW
Sbjct: 251 KEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIGWGVEKKTPYWLIANSW 310

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG+ G F+ILRG++E  IE
Sbjct: 311 NEDWGEKGYFRILRGKDECGIE 332


>gi|349956183|dbj|GAA30948.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 337

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 65/155 (41%), Positives = 89/155 (57%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P+ FDAR++WP CP++  I DQS+CGSCWA     A+SDRLCI +NG FT +ISA  ++
Sbjct: 80  IPKAFDARKQWPHCPTIGEIRDQSSCGSCWAFGAVEAMSDRLCIHTNGTFTKRISAVDLI 139

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GC GG+P +AW FW   G+VTGG   +  GC+ Y    C HH       C+ 
Sbjct: 140 SCCGYCGFGCQGGFPPIAWDFWQTEGIVTGGSKENPTGCRSYPFPRCSHHGSKKYPPCSH 199

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP C Q C  P  ++ Y  D  +    + V
Sbjct: 200 R-IYDTPNCVQKCDTP--DTDYATDKTRANITYNV 231



 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 50/83 (60%), Positives = 63/83 (75%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M++I  +GP+ A F VY DFL YKSGVY H+ G  +G HA+R+LGWG EN + YWL+ANS
Sbjct: 239 MKEIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWGEENGVAYWLIANS 298

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WND WG+ G FK+LRG+NE  IE
Sbjct: 299 WNDGWGEDGCFKMLRGKNECGIE 321


>gi|268558600|ref|XP_002637291.1| C. briggsae CBR-CPR-4 protein [Caenorhabditis briggsae]
          Length = 335

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 71/156 (45%), Positives = 93/156 (59%), Gaps = 3/156 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAR +WP C S+ +I DQS+CGSCWA + A A SDR CIASNG     +SA+ ++
Sbjct: 81  IPDTFDARTQWPSCVSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVL 140

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL-QNCT 304
           +C  NC +GC GG+P  AW++   +G  TGG Y SQ GC+PY+LAPC   V      +C 
Sbjct: 141 SCCSNCGYGCEGGYPINAWKYLVKSGFCTGGSYVSQFGCKPYSLAPCGETVGNTTWPDCP 200

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G   TP C   C N +Y   Y+ D   G  A+ V
Sbjct: 201 QDG-YNTPSCVNKCTNNNYNIAYKDDKHFGSTAYAV 235



 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 54/99 (54%), Positives = 68/99 (68%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+   A+ V +  A  Q  I  HGP+ A F+VY DF QYKSGVY H  G  +G HA+R+L
Sbjct: 227 HFGSTAYAVGKKVAQIQAEILAHGPVEAAFTVYEDFYQYKSGVYVHTTGQELGGHAIRIL 286

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG +N  PYWLVANSWN +WG++G F+I+RG NE  IE
Sbjct: 287 GWGTDNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIE 325


>gi|118122|sp|P25793.1|CYSP2_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 2; Flags:
           Precursor
 gi|159165|gb|AAA29171.1| cathepsin B-like cysteine protease [Haemonchus contortus]
          Length = 342

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 71/160 (44%), Positives = 98/160 (61%), Gaps = 13/160 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P ++D R+ W  C +  +I DQ+NCGSCWAVS A AISDR+CIAS       ISA  I+
Sbjct: 87  IPPSYDPRDVWKNCTTF-YIRDQANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIM 145

Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  P C  GC GGWP  AW+++ ++GVV+GG+Y +++ C+PY + PC HH      N T
Sbjct: 146 TCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHH-----GNDT 200

Query: 305 LLGKLK----TPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+ +    TP CK+ C  P     YR D + GK A++V
Sbjct: 201 YYGECRGTAPTPPCKRKC-RPGVRKMYRIDKRYGKDAYIV 239



 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 46/98 (46%), Positives = 69/98 (70%), Gaps = 2/98 (2%)

Query: 64  YFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
           Y K A++V +       +I ++GP+VA F+VY DF  YKSG+Y+H  G+  G HAV+++G
Sbjct: 232 YGKDAYIVKQSVKAIQSEILKNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIG 291

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           WG EN+  +WL+ANSW++ WG+ G F+I+RG N+  IE
Sbjct: 292 WGNENNTDFWLIANSWHNDWGEKGYFRIVRGSNDCGIE 329


>gi|390994429|gb|AFM37364.1| cathepsin B1 [Dictyocaulus viviparus]
          Length = 350

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 72/161 (44%), Positives = 94/161 (58%), Gaps = 13/161 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P NFDAREKW +C S+R I DQS+CGSCWAVS A  +SDR CI S+G     +SA  I+
Sbjct: 95  IPENFDAREKWSQCDSIRTIRDQSHCGSCWAVSAAETMSDRTCIHSDGKINVGLSATDIL 154

Query: 247 AC--TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C  T    GC GG+P  AWR++  +GV TGG Y  ++ C+PY   PC HH     +N  
Sbjct: 155 SCCGTTCGRGCRGGYPIEAWRYFMLHGVCTGGHYAEKDVCKPYAFHPCGHH-----RNEI 209

Query: 305 LLGK-----LKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+       TP+C Q+C    Y S Y  D   GK A+ +
Sbjct: 210 YYGECPKEIFPTPQCTQSC-QAGYASDYEDDKIYGKSAYAL 249



 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 46/99 (46%), Positives = 64/99 (64%), Gaps = 3/99 (3%)

Query: 64  YFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
           Y K A+ +P       R+I  +GP+ A F VY DF +Y+SG+Y H  G   G HAV+++G
Sbjct: 242 YGKSAYALPNNEKAIQREIMTNGPVQAAFMVYEDFSRYRSGIYVHTAGRREGGHAVKLIG 301

Query: 122 WGVEND-IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           WGV++D   YWL ANSWN  WG++G F+I+RG +   IE
Sbjct: 302 WGVDDDGNKYWLAANSWNSDWGENGYFRIVRGVDHCGIE 340


>gi|193716207|ref|XP_001950562.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
          Length = 340

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 62/147 (42%), Positives = 92/147 (62%), Gaps = 5/147 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P++FDAR+KW  C ++  + DQ NCGSCWA++ ++A +DRLC+A+N  F   +SA+ I 
Sbjct: 88  IPKHFDARKKWKRCHTIGKVRDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEIT 147

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C  +C +GCNGG+P  AW  + + G+VTGGDY S EGC+PY + PC +  +G    C  
Sbjct: 148 FCCSSCGYGCNGGYPIKAWESFNNRGLVTGGDYQSGEGCEPYRVPPCPYDAEGH-NTCAG 206

Query: 306 LGKLKTPECKQNCY---NPSYESTYRF 329
             + K   C + CY   +  Y   +RF
Sbjct: 207 KPREKNHRCTRTCYGNQDLDYNDDHRF 233



 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 41/97 (42%), Positives = 63/97 (64%), Gaps = 1/97 (1%)

Query: 64  YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGW 122
           + + ++ +   +  + +  +GP+ A F +Y DF  YKSGVY  +   S +G HAV+++GW
Sbjct: 233 FTRDSYYLTYSSIQKDVMRYGPIEASFDMYDDFPSYKSGVYVRSENASYLGGHAVKLIGW 292

Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           G E+ + YWL+ NSWN+ WGD+G FKI RG NE  I+
Sbjct: 293 GEEHGVLYWLMVNSWNEGWGDNGLFKIRRGTNECGID 329


>gi|118118|sp|P19092.1|CYSP1_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
           Precursor
 gi|159173|gb|AAA29175.1| cysteine protease (AC-1) [Haemonchus contortus]
          Length = 342

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 71/160 (44%), Positives = 98/160 (61%), Gaps = 13/160 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P ++D R+ W  C +  +I DQ+NCGSCWAVS A AISDR+CIAS       ISA  I+
Sbjct: 87  IPPSYDPRDVWKNCTTF-YIRDQANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIM 145

Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  P C  GC GGWP  AW+++ ++GVV+GG+Y +++ C+PY + PC HH      N T
Sbjct: 146 TCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHH-----GNDT 200

Query: 305 LLGKLK----TPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G+ +    TP CK+ C  P     YR D + GK A++V
Sbjct: 201 YYGECRGTAPTPPCKRKC-RPGVRKMYRIDKRYGKDAYIV 239



 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 46/98 (46%), Positives = 68/98 (69%), Gaps = 2/98 (2%)

Query: 64  YFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
           Y K A++V +       +I  +GP+VA F+VY DF  YKSG+Y+H  G+  G HAV+++G
Sbjct: 232 YGKDAYIVKQSVKAIQSEILRNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIG 291

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           WG EN+  +WL+ANSW++ WG+ G F+I+RG N+  IE
Sbjct: 292 WGNENNTDFWLIANSWHNDWGEKGYFRIIRGTNDCGIE 329


>gi|56758716|gb|AAW27498.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS C S WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVAAMSDRICIQSGGKQSVELSAIDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  NC  GC+GG    +W +W  +G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCENCGSGCDGGVTGYSWDYWVKHGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+CKQ C   YN SYE
Sbjct: 209 -DKLYKTPQCKQTCQKGYNTSYE 230



 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 42/82 (51%), Positives = 58/82 (70%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  +GP+ A   +Y DFL YKSG+Y++  G  I  HAVR++GWGVEN   YWL AN+W
Sbjct: 251 KEIMMYGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAANTW 310

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG+ G F+I+RG +E  IE
Sbjct: 311 NEDWGEKGYFRIVRGRDECLIE 332


>gi|126116630|gb|ABN79675.1| cathepsin B3 [Clonorchis sinensis]
          Length = 337

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 65/155 (41%), Positives = 88/155 (56%), Gaps = 4/155 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P+ FDAR++WP CP++  I DQS+CGSCWA     A+SDRLCI +NG FT +ISA  ++
Sbjct: 80  IPKAFDARKQWPHCPTIGEIRDQSSCGSCWAFGAVEAMSDRLCIHTNGTFTKRISAVDLI 139

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GC GG+P  AW FW   G+VTGG   +  GC+ Y    C HH       C+ 
Sbjct: 140 SCCGYCGFGCQGGFPPTAWDFWQTEGIVTGGSKENPTGCRSYPFPRCSHHGSKKYPPCSH 199

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP C Q C  P  ++ Y  D  +    + V
Sbjct: 200 R-IYDTPNCVQKCDTP--DTDYATDKTRANITYNV 231



 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 50/83 (60%), Positives = 63/83 (75%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M++I  +GP+ A F VY DFL YKSGVY H+ G  +G HA+R+LGWG EN + YWL+ANS
Sbjct: 239 MKEIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWGEENGVAYWLIANS 298

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WND WG+ G FK+LRG+NE  IE
Sbjct: 299 WNDGWGEDGYFKMLRGKNECGIE 321


>gi|56754307|gb|AAW25341.1| unknown [Schistosoma japonicum]
          Length = 309

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS C S WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 57  IPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVGAMSDRICIQSGGKQSVELSAIDLI 116

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  NC  GC+GG    +W +W  +G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 117 SCCKNCGSGCDGGVTGYSWDYWVSHGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 175

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+CKQ C   YN SYE
Sbjct: 176 -DKLYKTPQCKQTCQKGYNTSYE 197



 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 47/99 (47%), Positives = 63/99 (63%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   ++ V    ++ Q  I  HG + A   +Y DFL YKSG+Y++  G  I  HAVR++
Sbjct: 201 HYGGFSYNVLSVESVIQKDIMMHGTVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 260

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN   YWL AN+WN+ WG+ G F+I+RG NE  IE
Sbjct: 261 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 299


>gi|56757271|gb|AAW26807.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS C S WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVGAMSDRICIQSGGKQSVELSAIDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  NC  GC+GG    +W +W  +G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKNCGSGCDGGVTGYSWDYWVKHGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+CKQ C   YN SYE
Sbjct: 209 -DKLYKTPQCKQTCQKGYNTSYE 230



 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 42/82 (51%), Positives = 58/82 (70%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  +GP+ A   +Y DFL YKSG+Y++  G  I  HAVR++GWGVEN   YWL AN+W
Sbjct: 251 KEIMMYGPVEAYLHIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTSYWLAANTW 310

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG+ G F+I+RG +E  IE
Sbjct: 311 NEDWGEKGYFRIVRGRDECLIE 332


>gi|294954734|ref|XP_002788292.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239903555|gb|EER20088.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 317

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 65/151 (43%), Positives = 88/151 (58%), Gaps = 8/151 (5%)

Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP +FDAR  +P C   + HI DQS CGSCWA  V  A +DRLC+ SNG FT  +SA  +
Sbjct: 60  LPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCVKSNGTFTELLSAGEM 119

Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQ------EGCQPYTLAPCEHHVQGP 299
            AC P+ +GC+GG+P  AW +    G+ TGGDY ++      +GC PY   PC HH+   
Sbjct: 120 NACAPS-YGCDGGYPDSAWSWVHDEGIATGGDYVARGNLTKGDGCWPYDFPPCAHHINDT 178

Query: 300 LQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
                  G  +TP C + C+NP Y ++ + D
Sbjct: 179 KYPKCPKGSYETPNCVEQCHNPKYSTSLKND 209



 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 52/118 (44%), Positives = 65/118 (55%), Gaps = 8/118 (6%)

Query: 43  KKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSG 102
           K     K  R Y+  S P  +           NA   I   GP+ A + VY DFL YKSG
Sbjct: 201 KYSTSLKNDRHYMLESSPYQYSVN--------NAKNAIRTDGPVSASYLVYEDFLAYKSG 252

Query: 103 VYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEM 160
           VY+H  G  +G HAV+++GWG EN   YWLV NSWN+ WGDHG FKI  G  + D ++
Sbjct: 253 VYKHTSGSYLGGHAVKIIGWGEENGEAYWLVVNSWNEDWGDHGLFKIALGNCQIDDDL 310


>gi|27526823|emb|CAD32937.1| pro-cathepsin B2 [Fasciola hepatica]
          Length = 337

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 70/155 (45%), Positives = 86/155 (55%), Gaps = 2/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAREKWP C S+R I DQS+CGSCWAV+   A+SDR+CI SNG    ++SA  +V
Sbjct: 76  LPESFDAREKWPLCRSIRQIPDQSSCGSCWAVAGVGAMSDRVCIHSNGMMQPELSAIDLV 135

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC GG P  AW +W  NG+VTGG   +  GC PY    C H       N   
Sbjct: 136 SCCSYCGNGCQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPYPFPQCRHPGSRSQLNPCP 195

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP C   C    Y+ TY  D   GK ++ V
Sbjct: 196 RYTYPTPSCYPYC-QAGYDKTYEKDKVYGKTSYNV 229



 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 46/83 (55%), Positives = 59/83 (71%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +I ++GP+ A F VY DF  YKSG+Y H  G   G HA+R++GWGVEN + YWL ANS
Sbjct: 237 MEEIMKNGPVEAGFIVYTDFAVYKSGIYHHVSGRYAGKHAIRIIGWGVENGVKYWLTANS 296

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WG++G F+ILRG +E  IE
Sbjct: 297 WNVGWGENGYFRILRGTDECRIE 319


>gi|56752925|gb|AAW24674.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS C S WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVAAMSDRICIQSGGKQSVELSAIDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  NC  GC+GG    +W +W  +G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKNCGSGCDGGVTGYSWDYWVKHGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+CKQ C   YN SYE
Sbjct: 209 -DKLYKTPQCKQTCQKGYNTSYE 230



 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 42/82 (51%), Positives = 58/82 (70%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  +GP+ A   +Y DFL YKSG+Y++  G  I  HAVR++GWGVEN   YWL AN+W
Sbjct: 251 KEIMMYGPVEAYLQIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTSYWLAANTW 310

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG+ G F+I+RG +E  IE
Sbjct: 311 NEDWGEKGYFRIVRGRDECLIE 332


>gi|55793943|gb|AAV65882.1| cathepsin B1 isotype 2 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 65/156 (41%), Positives = 92/156 (58%), Gaps = 3/156 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KW +C S+ +I DQS CGSCWA +   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSSFDSRKKWRQCKSISNIRDQSRCGSCWAFAAVEAMSDRICIESKGKKSVELSAVDLL 149

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC GG+P  AW +W  +G+VTG    +  GCQPY    CEHH  G    C  
Sbjct: 150 SCCTECGLGCQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCEHHTTGKYPECG- 208

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
               KTP+C Q C    Y++ Y  D   G+ ++ VL
Sbjct: 209 EKIYKTPKCHQKC-QKGYKTPYGKDKYYGRMSYNVL 243



 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 47/82 (57%), Positives = 64/82 (78%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  HGP+ A F+V++DFL YKSG+Y++  G  IG HAVR++GWGVE   PYWL+ANSW
Sbjct: 251 KEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIGWGVEKKTPYWLIANSW 310

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG+ G F+ILRG++E  IE
Sbjct: 311 NEDWGEKGYFRILRGKDECGIE 332


>gi|157058775|gb|ABV03145.1| cathepsin B-16D [Myzus persicae]
          Length = 236

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 70/159 (44%), Positives = 92/159 (57%), Gaps = 12/159 (7%)

Query: 164 NRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANA 223
           N +    SED D       N   +PR FDAR KW  C ++  + DQ NCGSCWAV+ ++A
Sbjct: 64  NNMNLYKSEDADY------NNTYIPRFFDARRKWRHCSTIGRVRDQGNCGSCWAVATSSA 117

Query: 224 ISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQE 282
            +DRLC+A+N  F   +SA+ I  C   C +GCNGG+P  AW+ +   G+VTGGDY S E
Sbjct: 118 FADRLCVATNADFNELLSAEEITFCCHTCGFGCNGGYPIKAWKRFSKKGLVTGGDYKSGE 177

Query: 283 GCQPYTLAPCEHHVQGPLQNCTLLGKLKTP--ECKQNCY 319
           GC+PY + PC +  QG   N T  GK       C + CY
Sbjct: 178 GCEPYRVPPCPNDDQG---NNTCAGKPMESNHRCTRMCY 213


>gi|124502519|gb|ABN13633.1| cysteine proteinase [Haemonchus contortus]
          Length = 342

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 76/182 (41%), Positives = 103/182 (56%), Gaps = 20/182 (10%)

Query: 165 RVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAI 224
           R+     ED D E         +P ++D R+ W  C +  +I DQ+NCGSCWAVS A AI
Sbjct: 72  RLNLMVKEDPDPEV-------DIPPSYDPRDVWKNCTTF-YIRDQANCGSCWAVSTAAAI 123

Query: 225 SDRLCIASNGYFTGQISAQHIVACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQE 282
           SDR+CIAS       ISA  I+ C  P C  GC GGWP  AW+++ ++GVV+GG+Y ++ 
Sbjct: 124 SDRICIASKAEKQVNISATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKG 183

Query: 283 GCQPYTLAPCEHHVQGPLQNCTLLGKLK----TPECKQNCYNPSYESTYRFDLKKGKKAH 338
            C+PY + PC HH      N T  G+ +    TP CK+ C  P     YR D + GK A+
Sbjct: 184 VCRPYPIHPCGHH-----GNDTYYGECRGTAPTPPCKKEC-RPGVRKVYRIDKRYGKDAY 237

Query: 339 MV 340
           +V
Sbjct: 238 IV 239



 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 46/98 (46%), Positives = 68/98 (69%), Gaps = 2/98 (2%)

Query: 64  YFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
           Y K A++V +       +I  +GP+VA F+VY DF  YKSG+Y+H  G+  G HAV+++G
Sbjct: 232 YGKDAYIVKQSVKAIQSEILRNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIG 291

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           WG EN+  +WL+ANSW++ WG+ G F+I+RG N+  IE
Sbjct: 292 WGNENNTDFWLIANSWHNDWGEKGYFRIIRGTNDCGIE 329


>gi|56756410|gb|AAW26378.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+CKQ C   YN SYE
Sbjct: 209 -DKLYKTPQCKQTCQKGYNTSYE 230



 Score =  104 bits (259), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   ++ V    ++ Q  I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN   YWL AN+WN+ WG+ G F+I+RG NE  IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIE 332


>gi|189308076|gb|ACD86922.1| cysteine protease [Caenorhabditis brenneri]
          Length = 228

 Score =  131 bits (329), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 65/145 (44%), Positives = 86/145 (59%), Gaps = 1/145 (0%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAR +WP C S+ +I DQS+CGSCWA + A A SDR CIASNG     +SA+ ++
Sbjct: 81  IPATFDARTQWPSCVSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVL 140

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  NC +GC GG+P  AW++   +G  TGG Y +Q GC+PY+LAPC   V         
Sbjct: 141 SCCSNCGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNTTWPACP 200

Query: 306 LGKLKTPECKQNCYNPSYESTYRFD 330
                TP C   C N +Y   Y+ D
Sbjct: 201 TDGYDTPACVNKCTNSNYNVAYKDD 225


>gi|407080581|gb|AFS89610.1| procathepsin B precursor [Phenacoccus solenopsis]
          Length = 309

 Score =  131 bits (329), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 77/179 (43%), Positives = 110/179 (61%), Gaps = 5/179 (2%)

Query: 160 MGFN-NRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAV 218
           MG N + ++ N + D +   +  + ++ LP  FD+R +WP CP++R I DQ +CG+CWA 
Sbjct: 23  MGINYSELKPNVTPDLEPPFVVSKISENLPDEFDSRVRWPNCPTIREIRDQGSCGACWAF 82

Query: 219 SVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGD 277
           + A A+SDR+CI S+       SA ++++C  +C  GC G    LAW  W  +G+V+GG 
Sbjct: 83  AAAEAMSDRVCIHSSQTKHFHFSALNLLSCCDSCEKGCLGCDHHLAWDHWVKHGIVSGGS 142

Query: 278 YNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKK 336
           Y S+EGCQPY L PCEHH  GP +NCT  G   TP C + C  P Y+ +Y  DL  GK+
Sbjct: 143 YGSKEGCQPYHLPPCEHHRAGPRRNCTKYG--PTPSCARVC-QPDYKISYEDDLHFGKQ 198



 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 40/83 (48%), Positives = 56/83 (67%), Gaps = 2/83 (2%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE--NDIPYWLVANS 136
           +I+ +GP+ A  + Y DF  Y+SG+Y H  G  +  HAV+++GWG +   + PYWLVANS
Sbjct: 213 EIFHNGPVEATMAAYEDFYTYESGIYHHIEGTFVCDHAVKIIGWGTDKKTNTPYWLVANS 272

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           +N  WG++G FKI RG NE  IE
Sbjct: 273 FNTDWGEYGFFKIKRGVNECGIE 295


>gi|157058773|gb|ABV03144.1| cathepsin B-16D [Sitobion avenae]
          Length = 215

 Score =  131 bits (329), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 64/136 (47%), Positives = 88/136 (64%), Gaps = 6/136 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +PR+FDAR KW  C ++  + DQ NCGSCWAV+ ++A +DRLC+A++G F   +SA+ I 
Sbjct: 70  IPRHFDARRKWRHCQTIGEVRDQGNCGSCWAVATSSAFADRLCVATDGDFNQLLSAEEIT 129

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GCNGG+P  AW  +  +G+VTGGDY S+EGC+PY + PC +   G   N T 
Sbjct: 130 FCCHTCGFGCNGGYPIKAWERFKKHGLVTGGDYKSEEGCEPYRVPPCPYDESG---NNTC 186

Query: 306 LGKL--KTPECKQNCY 319
            GK   K   C + CY
Sbjct: 187 AGKPMEKNHRCTRMCY 202


>gi|313229093|emb|CBY18245.1| unnamed protein product [Oikopleura dioica]
          Length = 355

 Score =  131 bits (329), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 67/155 (43%), Positives = 87/155 (56%), Gaps = 3/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FD+R  W +C  +  I DQ  CGSCWA   A AISDR+CIAS G      +A+ ++
Sbjct: 94  IPATFDSRSNWSDCSVIGKIRDQGGCGSCWAFGAAEAISDRICIASKGATDVMYAAEDVL 153

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GCNGG+P  A  ++   G+VTGG Y +++ CQPYTL  CEHHV G    CT 
Sbjct: 154 SCCLTCGNGCNGGYPLAAMEYFVTRGLVTGGLYGTKDTCQPYTLEACEHHVPGDRPPCTE 213

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G   TP+C   C        Y+ D   G KA+ V
Sbjct: 214 GG--GTPKCSHQCIPDYTTKAYKDDKVHGHKAYSV 246



 Score =  108 bits (269), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 50/95 (52%), Positives = 64/95 (67%), Gaps = 2/95 (2%)

Query: 67  KAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV 124
           KA+ VP       ++I  +GP+ A F+VY+DF  YKSGVY+H  G  +G HA++++GWG 
Sbjct: 242 KAYSVPNDVGKIQQEIMHYGPVEAAFTVYSDFPSYKSGVYRHTSGSELGGHAIKIIGWGT 301

Query: 125 ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           E    YWL+ NSWN  WGD GTFKILRG NE  IE
Sbjct: 302 EGGDDYWLINNSWNSDWGDKGTFKILRGSNECGIE 336


>gi|56756114|gb|AAW26235.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  131 bits (329), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 65/143 (45%), Positives = 91/143 (63%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS C S WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSSVGAMSDRICIQSGGKQSVELSAIDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  NC  GC+GG+   +W +W  +G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKNCGSGCDGGYFLPSWDYWVSHGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL +TP+CKQ C   YN SYE
Sbjct: 209 -DKLYETPQCKQTCQKGYNTSYE 230



 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 44/82 (53%), Positives = 57/82 (69%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           + I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++GWGVEN   YWL AN+W
Sbjct: 251 KDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAANTW 310

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG+ G F+I+RG NE  IE
Sbjct: 311 NEDWGEKGYFRIVRGRNECLIE 332


>gi|56756380|gb|AAW26363.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  130 bits (328), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAIGAMSDRICIQSGGKQSVKLSAVDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  NC  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCENCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+C Q C   YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230



 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   ++ V    ++ Q  I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN   YWL AN+WN+ WG+ G F+I+RG NE  IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332


>gi|157058733|gb|ABV03124.1| cathepsin B-16a [Acyrthosiphon pisum]
          Length = 274

 Score =  130 bits (328), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 61/147 (41%), Positives = 91/147 (61%), Gaps = 5/147 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +PR FDAR +W  C ++  + DQ +CGSCWA++ ++A +DRLC+A+NG F   +SA+ I 
Sbjct: 84  IPRTFDARRRWRHCKTIGEVRDQGHCGSCWAMATSSAFADRLCVATNGDFNELLSAEEIT 143

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GCNGG+P  AW+++  +G+VTGG+Y S EGC+PY + PC    +G   +C  
Sbjct: 144 FCCHTCGFGCNGGYPIKAWKYFSSHGIVTGGNYKSGEGCEPYRVPPCPQDEEGK-SSCAG 202

Query: 306 LGKLKTPECKQNCY---NPSYESTYRF 329
               K   C + CY   +  Y   +RF
Sbjct: 203 KPIEKNHRCTRMCYGNQDLDYNEDHRF 229


>gi|56752809|gb|AAW24616.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  130 bits (328), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+CKQ C   YN SYE
Sbjct: 209 -DKLYKTPQCKQTCQKGYNTSYE 230



 Score =  104 bits (259), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   ++ V    ++ Q  I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN   YWL AN+WN+ WG+ G F+I+RG NE  IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIE 332


>gi|171474007|gb|AAX31052.2| SJCHGC09761 protein [Schistosoma japonicum]
          Length = 342

 Score =  130 bits (328), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+CKQ C   YN SYE
Sbjct: 209 -DKLYKTPQCKQTCQKGYNTSYE 230



 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   ++ V    ++ Q  I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLI 293

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN   YWL AN+WN+ WG+ G F+I+RG NE  IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332


>gi|255040225|gb|ACT99885.1| cathepsin B2 [Opisthorchis viverrini]
          Length = 337

 Score =  130 bits (328), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 65/149 (43%), Positives = 86/149 (57%), Gaps = 5/149 (3%)

Query: 177 ETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYF 236
           E++G +N   +P+ FDARE+WP CP++  I DQS+CGSCWA     A+SDRLCI SNG F
Sbjct: 73  ESLGDEN---IPKTFDAREQWPHCPTIGQIRDQSSCGSCWAFGAVEAMSDRLCIHSNGTF 129

Query: 237 TGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH 295
           T  +S+  +V+C   C +GC GG+P  AW FW   G+VTGG      GC+ Y    C HH
Sbjct: 130 TKSLSSIDLVSCCGYCGFGCQGGYPPAAWDFWQAYGIVTGGSKEDPMGCRSYPFPKCSHH 189

Query: 296 VQGPLQNCTLLGKLKTPECKQNCYNPSYE 324
                  C       TP+C   C  P+ +
Sbjct: 190 GSKKYPPCPHR-IYDTPKCVPKCDTPNID 217



 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 49/83 (59%), Positives = 62/83 (74%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M++I  +GP+ A F VY DF  YK GVY H+ G+ IG HA+R+LGWG EN  PYWL+ANS
Sbjct: 239 MKEIMINGPVEAAFEVYEDFFGYKQGVYFHSTGEFIGGHAIRILGWGEENGTPYWLIANS 298

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN+ WG+ G FK+LRG+NE  IE
Sbjct: 299 WNEGWGEDGYFKMLRGKNECGIE 321


>gi|56759488|gb|AAW27884.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  130 bits (328), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+CKQ C   YN SYE
Sbjct: 209 -DKLYKTPQCKQTCQKGYNTSYE 230



 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 44/82 (53%), Positives = 57/82 (69%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           + I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++GWGVEN   YWL AN+W
Sbjct: 251 KDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAANTW 310

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG+ G F+I+RG NE  IE
Sbjct: 311 NEDWGEKGYFRIVRGRNECLIE 332


>gi|56756475|gb|AAW26410.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  130 bits (328), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+CKQ C   YN SYE
Sbjct: 209 -DKLYKTPQCKQTCQKGYNTSYE 230



 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 44/82 (53%), Positives = 57/82 (69%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           + I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++GWGVEN   YWL AN+W
Sbjct: 251 KDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAANTW 310

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG+ G F+I+RG NE  IE
Sbjct: 311 NEDWGEKGYFRIVRGRNECLIE 332


>gi|294894290|ref|XP_002774786.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239880403|gb|EER06602.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 830

 Score =  130 bits (328), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 74/210 (35%), Positives = 102/210 (48%), Gaps = 46/210 (21%)

Query: 149 ILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPS-LRHIA 207
           ++RG N+  I+ G+                   +  + LP +FDAR  +P C   + HI 
Sbjct: 516 LMRGSNDKAIKKGY-----------------AIEELQDLPTDFDARTAFPNCSKVIGHIR 558

Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFW 267
           DQS CGSCWA  V  A +DRLCI SNG FT  +SA  + AC P+  GCNGG+P  AW + 
Sbjct: 559 DQSACGSCWAFGVTEAFNDRLCIKSNGTFTELLSAGEMNACAPS-HGCNGGFPNSAWSWV 617

Query: 268 GHNGVVTGGDYNSQ------EGCQPYTLAPCEHHVQ------------------GPLQNC 303
              G+ TGGDY ++      +GC PY   PC HH+                      +  
Sbjct: 618 HDKGIATGGDYVAKDDMTKDDGCWPYDFPPCAHHINDTKYPECPKVSCSGESPPATAETA 677

Query: 304 TLLG---KLKTPECKQNCYNPSYESTYRFD 330
           T++      +TP C + C+NP Y +T R D
Sbjct: 678 TVIAYQNSYETPNCAEQCHNPKYTTTLRDD 707



 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 40/67 (59%), Positives = 50/67 (74%)

Query: 86  LVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHG 145
           + A FSVY DFL YKSGVY+H  G+ +G HAV+++GWG E+   YW+V NSWN+ WGDHG
Sbjct: 749 VSASFSVYEDFLAYKSGVYKHTSGEYLGGHAVKIIGWGEESGQAYWIVVNSWNEDWGDHG 808

Query: 146 TFKILRG 152
            FKI  G
Sbjct: 809 LFKIALG 815


>gi|161343827|tpg|DAA06094.1| TPA_inf: cathepsin B [Aphis gossypii]
          Length = 207

 Score =  130 bits (327), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 60/129 (46%), Positives = 86/129 (66%), Gaps = 5/129 (3%)

Query: 171 SEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCI 230
           SED++ + +       +PR FDAR+KW  C ++  I DQ NCGSCWA++ ++A +DRLC+
Sbjct: 76  SEDENYDNL----LGRIPRKFDARKKWRNCKTIGAIRDQGNCGSCWALATSSAFADRLCV 131

Query: 231 ASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTL 289
           ASNG F   +SA+ +  C   C +GCNGG+P  AW  +  +G+VTGGDY S+EGC+PY +
Sbjct: 132 ASNGNFNQLLSAEELTFCCHKCGFGCNGGYPIKAWERFMKHGLVTGGDYKSREGCEPYRV 191

Query: 290 APCEHHVQG 298
            PC +   G
Sbjct: 192 PPCPYDELG 200


>gi|294951797|ref|XP_002787132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239901778|gb|EER18928.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 278

 Score =  130 bits (327), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 68/151 (45%), Positives = 86/151 (56%), Gaps = 8/151 (5%)

Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP +FDAR  +P C   + HI DQS CGSCWA  V  A +DRLCI S+G FT  +SA  +
Sbjct: 21  LPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSHGTFTELLSAGEM 80

Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQ------EGCQPYTLAPCEHHVQGP 299
            AC P+  GCNGG+P  AW +    G+ TGGDY ++      +GC PY   PC HHV   
Sbjct: 81  NACAPS-HGCNGGFPNSAWSWVHDKGIATGGDYVAEDDMTKDDGCWPYDFPPCAHHVNDS 139

Query: 300 LQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
                     +TP C + C+NP Y +T R D
Sbjct: 140 KYPKCPKDSYETPNCAEQCHNPKYTTTLRDD 170



 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 44/78 (56%), Positives = 55/78 (70%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           +A   I   GP+ A F+VY DFL YKSGVY+H  G+ +G HAV+++GWG E+   YWLV 
Sbjct: 186 DAKNAIRTDGPVSASFTVYEDFLAYKSGVYKHTSGEYLGGHAVKIIGWGEESGQAYWLVV 245

Query: 135 NSWNDHWGDHGTFKILRG 152
           NSWN+ WGDHG FKI  G
Sbjct: 246 NSWNEDWGDHGLFKIALG 263


>gi|241998314|ref|XP_002433800.1| longipain, putative [Ixodes scapularis]
 gi|215495559|gb|EEC05200.1| longipain, putative [Ixodes scapularis]
          Length = 339

 Score =  130 bits (327), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 63/158 (39%), Positives = 91/158 (57%), Gaps = 10/158 (6%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FD+R++W +CP++R I DQ  CGSCWA     ++SDR CI S       ++A  ++
Sbjct: 88  IPAQFDSRQQWQDCPTIREIRDQGACGSCWAFGAVESMSDRHCIHSGAKNIVHLAADDVL 147

Query: 247 ACTPNCWGC----NGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
           +C   CWGC    NGG+P  AW +W   G+VTGG+Y++ EGC PY +  C+HHV G L  
Sbjct: 148 SC---CWGCGSGCNGGFPGAAWSYWVEKGIVTGGNYDTDEGCMPYPVPSCDHHVNGTLGP 204

Query: 303 CTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           C       TP+C + C    Y   ++ D   GK ++ V
Sbjct: 205 CGQ--DPPTPKCVRLC-RKGYNIDFKDDKHYGKSSYSV 239



 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 54/99 (54%), Positives = 69/99 (69%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY K ++ V         +I ++GP+   F+VYADF  YKSGVY+ +  D++G HA+R+L
Sbjct: 231 HYGKSSYSVSSNETQIQMEIMKNGPVEGAFTVYADFPLYKSGVYKSHSTDALGGHAIRIL 290

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN +P+WLVANSWN  WGD G FKILRG NE  IE
Sbjct: 291 GWGVENGVPFWLVANSWNTEWGDKGYFKILRGSNECGIE 329


>gi|392922404|ref|NP_507186.3| Protein CPR-2 [Caenorhabditis elegans]
 gi|206994217|emb|CAB04322.3| Protein CPR-2 [Caenorhabditis elegans]
          Length = 326

 Score =  130 bits (327), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 72/155 (46%), Positives = 92/155 (59%), Gaps = 13/155 (8%)

Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
           P NFDAR +WP+C S++ I +QSNCGSCWA S A  ISDR CIASNG     IS   ++ 
Sbjct: 84  PLNFDARTRWPQCKSMKLIREQSNCGSCWAFSTAEVISDRTCIASNGTQQPIISPTDLLT 143

Query: 248 CTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           C   +C  GC+GG+P  A+++W   GVVTGGDY    GC+PY + PC         NC  
Sbjct: 144 CCGMSCGEGCDGGFPYRAFQWWARRGVVTGGDYLGT-GCKPYPIRPCNS------DNCV- 195

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
              L+TP C+ +C  P Y +TY  D   G  A+ V
Sbjct: 196 --NLQTPPCRLSC-QPGYRTTYTNDKNYGNSAYPV 227



 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 53/99 (53%), Positives = 66/99 (66%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           +Y   A+ VPR  A  Q  IY +GP+VA F VY DF +YKSG+Y+H  G S G HAV+++
Sbjct: 219 NYGNSAYPVPRTVAAIQADIYYNGPVVAAFIVYEDFEKYKSGIYRHIAGRSKGGHAVKLI 278

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG E   PYWL  NSW   WG+ GTF+ILRG +E  IE
Sbjct: 279 GWGTERGTPYWLAVNSWGSQWGESGTFRILRGVDECGIE 317


>gi|226474160|emb|CAX71567.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  130 bits (327), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+CKQ C   YN SYE
Sbjct: 209 -DKLYKTPQCKQICQKGYNTSYE 230



 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   ++ V    ++ Q  I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN   YWL AN+WN+ WG+ G F+I+RG NE  IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIE 332


>gi|51947600|gb|AAU14266.1| cathepsin B-N [Myzus persicae]
          Length = 338

 Score =  130 bits (327), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 64/136 (47%), Positives = 85/136 (62%), Gaps = 6/136 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +PR FDAR KW  C ++  + DQ NCGSCWAV+ ++A +DRLC+A+N  F   +SA+ I 
Sbjct: 86  IPRFFDARRKWRHCSTIGRVRDQGNCGSCWAVATSSAFADRLCVATNADFNELLSAEEIT 145

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GCNGG+P  AW+ +   G+VTGGDY S EGC+PY + PC +  QG   N T 
Sbjct: 146 FCCHTCGFGCNGGYPIKAWKRFSKKGLVTGGDYKSGEGCEPYRVPPCPNDDQG---NNTC 202

Query: 306 LGKLKTP--ECKQNCY 319
            GK       C + CY
Sbjct: 203 AGKPMESNHRCTRMCY 218



 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 62/97 (63%), Gaps = 1/97 (1%)

Query: 64  YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGW 122
           Y +  + +   +  + +  +GP+ A F VY DF  YKSGVY  +   S +G HAV+++GW
Sbjct: 231 YTRDYYYLTYGSIQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENASYLGGHAVKLIGW 290

Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           G E  +PYWL+ NSWN+ WGDHG FKI RG NE  ++
Sbjct: 291 GEEYGVPYWLMVNSWNEDWGDHGFFKIQRGTNECGVD 327


>gi|187104114|ref|NP_001119617.1| cathepsin B-16A precursor [Acyrthosiphon pisum]
 gi|161343835|tpg|DAA06098.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 340

 Score =  130 bits (327), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 61/147 (41%), Positives = 91/147 (61%), Gaps = 5/147 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +PR FDAR +W  C ++  + DQ +CGSCWA++ ++A +DRLC+A+NG F   +SA+ I 
Sbjct: 88  IPRTFDARRRWRHCKTIGEVRDQGHCGSCWAMATSSAFADRLCVATNGDFNELLSAEEIT 147

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GCNGG+P  AW+++  +G+VTGG+Y S EGC+PY + PC    +G   +C  
Sbjct: 148 FCCHTCGFGCNGGYPIKAWKYFSSHGIVTGGNYKSGEGCEPYRVPPCPQDEEGK-SSCAG 206

Query: 306 LGKLKTPECKQNCY---NPSYESTYRF 329
               K   C + CY   +  Y   +RF
Sbjct: 207 KPIEKNHRCTRMCYGNQDLDYNDDHRF 233



 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 43/83 (51%), Positives = 56/83 (67%), Gaps = 1/83 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVENDIPYWLVANS 136
           + +  +GP+ A F VY DF  YKSGVYQ     + +G HAV+++GWGVE   PYWL+ NS
Sbjct: 247 KDVMNYGPIEASFDVYDDFPSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGTPYWLMVNS 306

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKI RG +E  I+
Sbjct: 307 WNAQWGDNGLFKIRRGTDECGID 329


>gi|159179|gb|AAA29178.1| cysteine proteinase, partial [Haemonchus contortus]
          Length = 341

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 70/160 (43%), Positives = 93/160 (58%), Gaps = 13/160 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  +D R+ W  C S  +I DQ+NCGSCWAVS A AISDR+CIA+       ISA  +V
Sbjct: 86  IPEEYDPRKIWSNCTSF-YIRDQANCGSCWAVSTAAAISDRICIATKARKQVNISATDLV 144

Query: 247 AC-TPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C TP C +GC+GGW   AW ++ + G+V+GG+Y S+  C+PY + PC HH      N T
Sbjct: 145 TCCTPTCGFGCDGGWSIKAWEYFTYAGLVSGGEYRSKRCCRPYPIHPCGHH-----GNDT 199

Query: 305 LLG----KLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G    +  TP CK+ C  P Y   YR D + G  A  +
Sbjct: 200 YYGECPEEASTPSCKKKC-QPGYRKLYRMDKRYGTDAFQL 238



 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 42/82 (51%), Positives = 62/82 (75%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           +++ ++GP+ A F+VY DF  YKSG+Y+H  G+  G HAV+++GWG EN   YWL+ANSW
Sbjct: 247 KELLKNGPVTASFAVYEDFSLYKSGIYRHTAGELRGYHAVKMIGWGTENRTDYWLIANSW 306

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           +D WG++G F+I+RG N+  IE
Sbjct: 307 HDDWGENGYFRIIRGINDCGIE 328


>gi|201023369|ref|NP_001128426.1| cathepsin B-3483 [Acyrthosiphon pisum]
 gi|328712086|ref|XP_003244726.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
          Length = 355

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 70/163 (42%), Positives = 92/163 (56%), Gaps = 10/163 (6%)

Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
           A   P +FDAR  W  C S+ HI +Q NC + WA+SV +A++DR+CIAS G  T   S Q
Sbjct: 93  ANETPESFDARYHWFNCTSISHIWNQGNCAADWAISVTSAMNDRICIASQGNITALYSPQ 152

Query: 244 HIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCE--------H 294
            +V+C  +C  GC+GG+   AWR+    G+VTGGDY S EGCQP+ + PC          
Sbjct: 153 KLVSCCEDCGNGCSGGYTAAAWRYILKKGIVTGGDYGSNEGCQPWLVQPCNASTTAADPS 212

Query: 295 HVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKA 337
            V GP   C       TP+C  +CYN  +E  Y  D+ K KK 
Sbjct: 213 SVLGPHGVCG-GDPATTPKCDLSCYNARHEGKYLDDIIKAKKV 254



 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 48/94 (51%), Positives = 59/94 (62%)

Query: 66  KKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE 125
           KK      C+A + + +HGP V    VY DFL YKSGVY H  GD +GL +VR++GWG+E
Sbjct: 252 KKVFTFDGCSARKNLRKHGPYVVTMRVYEDFLAYKSGVYHHVTGDYLGLLSVRMIGWGLE 311

Query: 126 NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
               +WL+ANSW   WGD G FKI R  NE  IE
Sbjct: 312 GGQAFWLLANSWGTSWGDKGFFKIRRFVNECWIE 345


>gi|17559066|ref|NP_506790.1| Protein CPR-3 [Caenorhabditis elegans]
 gi|1169083|sp|P43507.1|CPR3_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 3; AltName:
           Full=Cysteine protease-related 3; Flags: Precursor
 gi|675494|gb|AAA98788.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|675496|gb|AAA98782.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|14530554|emb|CAB61032.2| Protein CPR-3 [Caenorhabditis elegans]
          Length = 370

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 73/185 (39%), Positives = 99/185 (53%), Gaps = 13/185 (7%)

Query: 158 IEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWA 217
           +++ F   +E +S    +L   G    + LP  FDAREKWP+C +++ I +Q+ CGSCWA
Sbjct: 63  MDVKFAEPLEKDSDVASELFVRGEIVPEPLPDTFDAREKWPDCNTIKLIRNQATCGSCWA 122

Query: 218 VSVANAISDRLCIASNGYFTGQISAQHIVAC--TPNCWGCNGGWPQLAWRFWGHNGVVTG 275
              A  ISDR+CI SNG     IS + I++C  T   +GC GG+   A RFW  +G VTG
Sbjct: 123 FGAAEVISDRVCIQSNGTQQPVISVEDILSCCGTTCGYGCKGGYSIEALRFWASSGAVTG 182

Query: 276 GDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
           GDY    GC PY+ APC        +NC    +  TP CK  C +      Y+ D   G 
Sbjct: 183 GDYGGH-GCMPYSFAPCT-------KNCP---ESTTPSCKTTCQSSYKTEEYKKDKHYGA 231

Query: 336 KAHMV 340
            A+ V
Sbjct: 232 SAYKV 236



 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 48/101 (47%), Positives = 64/101 (63%), Gaps = 4/101 (3%)

Query: 63  HYFKKAHMVPRCNAMRQI----YEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVR 118
           HY   A+ V    ++ +I    Y +GP+ A + VY DF  YKSGVY +  G  +G HAV+
Sbjct: 228 HYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHYKSGVYHYTSGKLVGGHAVK 287

Query: 119 VLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           ++GWGVEN + YWL+ANSW   +G+ G FKI RG NE  IE
Sbjct: 288 IIGWGVENGVDYWLIANSWGTSFGEKGFFKIRRGTNECQIE 328


>gi|56755451|gb|AAW25905.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+CKQ C   YN SYE
Sbjct: 209 -DKLYKTPQCKQICQKGYNTSYE 230



 Score =  104 bits (259), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   ++ V    ++ Q  I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN   YWL AN+WN+ WG+ G F+I+RG NE  IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIE 332


>gi|226474164|emb|CAX71568.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
 gi|226474166|emb|CAX71569.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 66/143 (46%), Positives = 90/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+CKQ C   YN SYE
Sbjct: 209 -DKLYKTPQCKQICQKGYNTSYE 230



 Score =  103 bits (258), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   ++ V    ++ Q  I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN   YWL AN+WN+ WG+ G F+I+RG NE  IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIE 332


>gi|107921773|gb|ABF85678.1| cathepsin B1 [Fasciola hepatica]
          Length = 278

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 70/155 (45%), Positives = 86/155 (55%), Gaps = 2/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAREKWP C S+R I DQS+CGSCWAV+   A+SDR+CI SNG    ++SA  +V
Sbjct: 63  LPESFDAREKWPLCRSIRQIPDQSSCGSCWAVAGVGAMSDRVCIHSNGMMQPELSAIDLV 122

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC GG P  AW +W  NG+VTGG   +  GC PY    C H       N   
Sbjct: 123 SCCSYCGNGCQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPYPFPQCRHPGSRSQLNPCP 182

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP C   C    Y+ TY  D   GK ++ V
Sbjct: 183 GYIYPTPSCYPYC-QAGYDKTYEEDKVYGKTSYNV 216



 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 29/55 (52%), Positives = 39/55 (70%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYW 131
           M++I ++GP+ A F VY DF  YKSG+Y H  G   G HA+R++GWGVEN + YW
Sbjct: 224 MQEIMKNGPVEAGFIVYTDFAVYKSGIYHHVSGRYAGKHAIRIIGWGVENGVNYW 278


>gi|339242313|ref|XP_003377082.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316974149|gb|EFV57673.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 517

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 94/277 (33%), Positives = 129/277 (46%), Gaps = 44/277 (15%)

Query: 62  SHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
           SHY+         + M++IY+ GP+VA F VY DFL Y SG  Q   G+        +  
Sbjct: 140 SHYYVNQDEF---DIMQEIYQRGPVVAGFKVYHDFLYYISG--QFICGNKRCEEEENLTS 194

Query: 122 WGV------ENDIPYWLVANSWNDHWGD-------------HGTFKILRGENEADIEMGF 162
           W V      E +    LV  +     G               G      G ++ +I +  
Sbjct: 195 WEVNFAYVEEQEKKNALVKLNLKRRKGTKLKQKLCMQKEKIMGLNPYFSGMSKEEILIRM 254

Query: 163 NNRVEANSSE-DDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVA 221
             ++  +S+E D  L        K LP++FD+REKWPEC  +R I DQSNCGSCWAVS A
Sbjct: 255 GTKLMNSSTEFDSKLSNNNEALIKKLPKHFDSREKWPECEWIRFIRDQSNCGSCWAVSAA 314

Query: 222 NAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQ 281
           + ++DR CIAS G  T  IS + I+AC         G     + +W   G+ TGG Y  +
Sbjct: 315 SVMTDRHCIASKGQETPYISDEQILAC---------GMIPSPFNYWKKMGIATGGPYGDK 365

Query: 282 EGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC 318
             CQPY++APC          C+      TP CK +C
Sbjct: 366 SCCQPYSIAPC--------SKCSYTA--STPSCKYDC 392



 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 47/83 (56%), Positives = 59/83 (71%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY HGP+VA F VY DF  Y SG+YQ     ++G HA+R++GWG EN IPYWL+ANS
Sbjct: 421 MNEIYTHGPVVAGFIVYEDFTYYISGIYQQTTYVAMGGHAIRIIGWGEENGIPYWLIANS 480

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  +G+ G F+I RG NE  IE
Sbjct: 481 WNTTFGEKGFFRIRRGTNECRIE 503



 Score = 45.1 bits (105), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 24/94 (25%), Positives = 44/94 (46%), Gaps = 15/94 (15%)

Query: 242 AQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ 301
           A  +++      GC  G  + A+ +W  +G+VTGG Y  +  C PY+++PC         
Sbjct: 57  ALFVISRIAALVGCRSGKIEAAFIYWQRSGLVTGGPYGEKACCLPYSISPCT-------- 108

Query: 302 NCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
                  +  P+C++ C     +++Y   LK+ K
Sbjct: 109 --MCRPYMLAPKCQRTC-----QASYNLSLKRDK 135


>gi|226474180|emb|CAX71576.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 65/143 (45%), Positives = 90/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL +TP+CKQ C   YN SYE
Sbjct: 209 -DKLYETPQCKQTCQKGYNTSYE 230



 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   ++ V    ++ Q  I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN   YWL AN+WN+ WG+ G F+I+RG NE  IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIE 332


>gi|55793949|gb|AAV65885.1| cathepsin B1 isotype 5 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 62/156 (39%), Positives = 94/156 (60%), Gaps = 3/156 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R++WP+C S+ +I DQS CG+ WA +   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPTSFDSRKEWPQCKSISNIRDQSRCGAGWAFAAVQAMSDRICIESKGKKSVELSAVDLL 149

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC  G+P +AW +W   G+VTGG   +  GCQPY    CEHH +G    C  
Sbjct: 150 SCCIECGLGCQMGFPGIAWDYWVQEGIVTGGSKENHTGCQPYPFPKCEHHTKGRYPECGE 209

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
           +  +K P+C Q C    Y++ Y  D   GK ++ +L
Sbjct: 210 IIYMK-PKCHQKC-QKGYKTPYEKDKYYGKVSYNLL 243



 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 46/87 (52%), Positives = 64/87 (73%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  HGP+ A F V++DFL YKSG+Y+H  G  IG H VR++GWGVE + PYWL+ANSW
Sbjct: 251 KEIMMHGPVEASFRVHSDFLNYKSGIYKHMTGIDIGSHVVRIIGWGVEKETPYWLIANSW 310

Query: 138 NDHWGDHGTFKILRGENEADIEMGFNN 164
           N+ WG+ G F++LRG++E  IE    +
Sbjct: 311 NEDWGEKGYFRMLRGKDECGIESAVTS 337


>gi|221107055|ref|XP_002166984.1| PREDICTED: cathepsin B-like [Hydra magnipapillata]
          Length = 330

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 70/144 (48%), Positives = 85/144 (59%), Gaps = 10/144 (6%)

Query: 187 LPRNFDAREKW-PECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP ++D REKW   CPS   I DQ +CGSCWA     A +DR+CI SNG     ISA+ +
Sbjct: 77  LPDSYDTREKWGSTCPSTTEIRDQGSCGSCWAFGAVEAFTDRICIQSNGAKNPHISAEDL 136

Query: 246 VACTPNCW---GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
           + C    W   GCNGG    AW F+ + G VTGG YNS EGCQPY +  CEHH  G  + 
Sbjct: 137 LTCC-GFWCGFGCNGGRLGPAWNFFKYAGAVTGGQYNSSEGCQPYEIPSCEHHTSGSKKP 195

Query: 303 CTLLGKLKTPECKQNC---YNPSY 323
           C   G   TP+CK++C   YN SY
Sbjct: 196 CE--GSEPTPKCKRSCREGYNVSY 217



 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 51/81 (62%), Positives = 66/81 (81%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +IY +GP+ A F+VY+DF  YKSGVY++  G+++G HA+++LGWGVEN++PYWLVANSWN
Sbjct: 240 EIYLNGPVEAAFTVYSDFPNYKSGVYKYTTGNALGGHAIKILGWGVENNVPYWLVANSWN 299

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGD G FKILRG NE  IE
Sbjct: 300 PDWGDKGFFKILRGSNECGIE 320


>gi|226474184|emb|CAX71578.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+C Q C   YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230



 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   ++ V    ++ Q  I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMVHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLI 293

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN   YWL AN+WN+ WG+ G F+I+RG NE  IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332


>gi|226473762|emb|CAX71566.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
 gi|226474170|emb|CAX71571.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+C Q C   YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230



 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   ++ V    ++ Q  I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYIEIYEDFLNYKSGIYRYTTGKYISGHAVRLI 293

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN   YWL AN+WN+ WG+ G F+I+RG NE  IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332


>gi|226474176|emb|CAX71574.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+C Q C   YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230



 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 47/99 (47%), Positives = 64/99 (64%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   ++ V    ++ Q  I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLI 293

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN   YWL AN+WN+ WG+ G F+I+RG NE  I+
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSID 332


>gi|226474172|emb|CAX71572.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+C Q C   YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230



 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   ++ V    ++ Q  I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN   YWL AN+WN+ WG+ G F+I+RG NE  IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332


>gi|56756907|gb|AAW26625.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+C Q C   YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230



 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   ++ V    ++ Q  I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLI 293

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN   YWL AN+WN+ WG+ G F+I+RG NE  IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332


>gi|56758864|gb|AAW27572.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+C Q C   YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230



 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 44/82 (53%), Positives = 57/82 (69%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           + I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++GWGVEN   YWL AN+W
Sbjct: 251 KDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAANTW 310

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG+ G F+I+RG NE  IE
Sbjct: 311 NEDWGEKGYFRIVRGRNECLIE 332


>gi|226474168|emb|CAX71570.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+C Q C   YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230



 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 63/99 (63%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   ++ V    ++ Q  I  HGP  A   +Y DFL YKSG+Y++  G  I  HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPAEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN   YWL AN+WN+ WG+ G F+I+RG NE  IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332


>gi|161343867|tpg|DAA06114.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 340

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 58/134 (43%), Positives = 87/134 (64%), Gaps = 2/134 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P+ FDAR+KW +C ++  + DQ NCGSCWA++ ++A +DRLC+A++  F   +S + + 
Sbjct: 88  IPKKFDARKKWRKCKTIGAVRDQGNCGSCWALATSSAFADRLCVATDADFNEFLSPEELT 147

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GCNGG+P  AW  +  +G+VTGGDY S EGC+PY + PC HH +G   +C+ 
Sbjct: 148 FCCHTCGYGCNGGYPIKAWERFKSHGLVTGGDYKSGEGCEPYRVPPCRHHAEGN-NSCSD 206

Query: 306 LGKLKTPECKQNCY 319
               K   C + CY
Sbjct: 207 KPMEKNHRCTRMCY 220



 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 42/97 (43%), Positives = 62/97 (63%), Gaps = 1/97 (1%)

Query: 64  YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLGW 122
           Y + ++ +   +  + +  +GP+ A F VY DF  YKSGVY + +    +G HAV+++GW
Sbjct: 233 YTRDSYYLTYGSIQKDVMNYGPIEASFDVYDDFPSYKSGVYIRSDNASYLGGHAVKLIGW 292

Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           G E+ +PYWL+ NSWN  WGD G FKI RG NE  ++
Sbjct: 293 GEESGVPYWLMVNSWNTDWGDKGLFKIQRGTNECGVD 329


>gi|226474174|emb|CAX71573.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAIDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+C Q C   YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230



 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   ++ V    ++ Q  I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN   YWL AN+WN+ WG+ G F+I+RG NE  IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332


>gi|22531389|emb|CAD44625.1| cathepsin B1 isotype 2 [Schistosoma mansoni]
          Length = 340

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 68/156 (43%), Positives = 89/156 (57%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P NFD+R+KWP C S+  I DQS CGSCWA     A+SDR CI S G    ++SA  ++
Sbjct: 89  IPSNFDSRKKWPGCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLL 148

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C  GC GG    AW FW   G+VTG    +  GC+PY    CEHH +G    C  
Sbjct: 149 SCCESCGLGCEGGILGPAWDFWVKEGIVTGSSKENHTGCEPYPFPKCEHHTKGKYPPCG- 207

Query: 306 LGKL-KTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             K+ KTP CKQ C    Y++ Y  D  +GK ++ V
Sbjct: 208 -SKIYKTPRCKQTC-QKKYKTPYTQDKHRGKSSYNV 241



 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 46/82 (56%), Positives = 67/82 (81%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I ++GP+ A F+VY DFL YKSG+Y+H  G+++G HA+R++GWGVEN  PYWL+ANSW
Sbjct: 250 KEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVENKTPYWLIANSW 309

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG++G F+I+RG +E  IE
Sbjct: 310 NEDWGENGYFRIVRGRDECFIE 331


>gi|226473756|emb|CAX71563.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+C Q C   YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230



 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   ++ V    ++ Q  I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLI 293

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN   YWL AN+WN+ WG+ G F+I+RG NE  IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332


>gi|56754499|gb|AAW25437.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+C Q C   YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230



 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   ++ V    ++ Q  I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLI 293

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN   YWL AN+WN+ WG+ G F+I+RG NE  IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332


>gi|161343845|tpg|DAA06103.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 261

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 59/134 (44%), Positives = 84/134 (62%), Gaps = 2/134 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +PR+FDAR KW  C ++  + DQ NCGSCWA++ ++A +DRLC+A+N  F   +SA+ I 
Sbjct: 88  IPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEIT 147

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C  +C +GCNGG+P  AW  +   G+VTGGDY S EGC+PY + PC +  +G    C  
Sbjct: 148 FCCHSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPYDAEGH-NTCAG 206

Query: 306 LGKLKTPECKQNCY 319
             +     C + CY
Sbjct: 207 KPRESNHRCTRMCY 220


>gi|226473760|emb|CAX71565.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+C Q C   YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230



 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 43/82 (52%), Positives = 56/82 (68%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           + I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++G GVEN   YWL AN+W
Sbjct: 251 KDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGCGVENGTAYWLAANTW 310

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG+ G F+I+RG NE  IE
Sbjct: 311 NEDWGEKGYFRIVRGRNECLIE 332


>gi|256052329|ref|XP_002569725.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228436|emb|CCD74607.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 345

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 68/156 (43%), Positives = 89/156 (57%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P NFD+R+KWP C S+  I DQS CGSCWA     A+SDR CI S G    ++SA  ++
Sbjct: 94  IPSNFDSRKKWPGCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLL 153

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C  GC GG    AW FW   G+VTG    +  GC+PY    CEHH +G    C  
Sbjct: 154 SCCESCGLGCEGGILGPAWDFWVKEGIVTGSSKENHTGCEPYPFPKCEHHTKGKYPPCG- 212

Query: 306 LGKL-KTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             K+ KTP CKQ C    Y++ Y  D  +GK ++ V
Sbjct: 213 -SKIYKTPRCKQTC-QKKYKTPYTQDKHRGKSSYNV 246



 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 46/82 (56%), Positives = 67/82 (81%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I ++GP+ A F+VY DFL YKSG+Y+H  G+++G HA+R++GWGVEN  PYWL+ANSW
Sbjct: 255 KEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVENKTPYWLIANSW 314

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG++G F+I+RG +E  IE
Sbjct: 315 NEDWGENGYFRIVRGRDECFIE 336


>gi|226474178|emb|CAX71575.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+C Q C   YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230



 Score =  104 bits (259), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   ++ V    ++ Q  I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN   YWL AN+WN+ WG+ G F+I+RG NE  IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIE 332


>gi|56757646|gb|AAW26973.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+C Q C   YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230



 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 44/82 (53%), Positives = 57/82 (69%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           + I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++GWGVEN   YWL AN+W
Sbjct: 251 KDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAANTW 310

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG+ G F+I+RG NE  IE
Sbjct: 311 NEDWGEKGYFRIVRGRNECSIE 332


>gi|145481831|ref|XP_001426938.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124394016|emb|CAK59540.1| unnamed protein product [Paramecium tetraurelia]
          Length = 332

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 69/154 (44%), Positives = 91/154 (59%), Gaps = 11/154 (7%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +F A+EKWP CPS+  I DQ NCGSCWAVS A+ +SDRLCIAS      QISA+ ++
Sbjct: 71  LPPSFSAQEKWPGCPSIELIPDQGNCGSCWAVSAASTMSDRLCIASGQTDKRQISAEDLL 130

Query: 247 ACT-PNC-----WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH-HVQGP 299
           +C   NC      GC+GG+P  AW++   +G+VTGG YN    C+PY+  PC H +  G 
Sbjct: 131 SCCGINCELDGNGGCDGGYPYGAWKYLRVDGIVTGGTYNDFSLCKPYSFPPCSHGNDSGK 190

Query: 300 LQNCT---LLGKLKTPECKQNCYNPSYESTYRFD 330
              C     +    TP C + C+ P +  TY  D
Sbjct: 191 YSKCENDFFMLTEVTPSCTKKCH-PQFSRTYDVD 223



 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 59/81 (72%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +IY +GP+ A+F+V+ DFL YKSGVYQ   G   G HAV+++GWG EN +PYW   NSWN
Sbjct: 244 EIYLNGPVQAVFTVFDDFLNYKSGVYQQTTGQRRGKHAVKIIGWGTENGVPYWEAINSWN 303

Query: 139 DHWGDHGTFKILRGENEADIE 159
           D WG +G FKILRG N  DIE
Sbjct: 304 DGWGINGKFKILRGFNHLDIE 324


>gi|226473758|emb|CAX71564.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+C Q C   YN SYE
Sbjct: 209 -DKLYKTPQCNQTCQKGYNTSYE 230



 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   ++ V    ++ Q  I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN   YWL AN+WN+ WG+ G F+I+RG NE  IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332


>gi|393902164|gb|EFO13452.2| hypothetical protein LOAG_15077, partial [Loa loa]
          Length = 186

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 69/155 (44%), Positives = 97/155 (62%), Gaps = 6/155 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP++FDAR +WP C S+  +A+Q  CGSCWA+S A+ +SDRLCIA+N     QISA+ ++
Sbjct: 11  LPKHFDARLRWPLCWSVHVVANQGGCGSCWAISAASVMSDRLCIATNYSNQKQISAEDLI 70

Query: 247 ACTPNCWGCNGG-WPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C GC G  W   A+ +W ++GVVTGGDY S EGC+PYT AP   +   P      
Sbjct: 71  SCCTECGGCQGSHWALSAFIYWRNHGVVTGGDYGSFEGCKPYTTAP---NCGSPCSFEYY 127

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             K+ +P C++ C  P Y  +Y  DL   +KA+ +
Sbjct: 128 RRKI-SPACQKTC-QPLYGLSYEEDLISSQKAYWI 160


>gi|56752787|gb|AAW24605.1| unknown [Schistosoma japonicum]
          Length = 309

 Score =  128 bits (322), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 65/143 (45%), Positives = 89/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 57  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 116

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C  
Sbjct: 117 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG- 175

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL KTP+C Q C   YN SYE
Sbjct: 176 -DKLYKTPQCNQTCQKGYNTSYE 197



 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   ++ V    ++ Q  I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++
Sbjct: 201 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 260

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN   YWL AN+WN+ WG+ G F+I+RG NE  IE
Sbjct: 261 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 299


>gi|984958|gb|AAC46877.1| cathepsin B-like proteinase [Ancylostoma caninum]
          Length = 343

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 76/208 (36%), Positives = 107/208 (51%), Gaps = 9/208 (4%)

Query: 132 LVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANS--SEDDDLETMGCQNAKGLPR 189
           L   ++ D+  +H +F   R E   + E     R+  +    E    E +        P 
Sbjct: 34  LSGQAFVDYINEHQSF--YRAEYSPEAEAFVKARIMDSKYLVEPKKEEVLEDVYGNDPPA 91

Query: 190 NFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACT 249
           +FDAR  WPEC S+  I DQS+CGSCWAVS A A+SD +C+ SN      IS   I++C 
Sbjct: 92  SFDARTHWPECRSIGTIRDQSSCGSCWAVSSAEAMSDEICVQSNSTIRVMISDSDILSCC 151

Query: 250 P-NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLG 307
             +C +GC GGWP  A+++   +GVVTGG Y  ++ C+PY   PC HH   P       G
Sbjct: 152 GISCGYGCQGGWPIEAYKWMQRDGVVTGGKYRQKKVCKPYAFYPCGHHQNDPYYGPCPGG 211

Query: 308 KLKTPECKQNC---YNPSYESTYRFDLK 332
              TP+C++ C   YN SY+    F  +
Sbjct: 212 LWPTPKCRKTCQRKYNKSYQEDKHFATR 239



 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 50/99 (50%), Positives = 67/99 (67%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+  +A+ +P    N  ++IY++GP+VA F VY DF  YK G+Y H +G   G HAV+V+
Sbjct: 235 HFATRAYYLPNNERNIRQEIYKNGPVVAAFRVYQDFSYYKKGIYVHKWGGQTGAHAVKVV 294

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG EN   YWL+ANSWN  WG+ G F+I+RG NE  IE
Sbjct: 295 GWGRENATDYWLIANSWNTDWGESGYFRIVRGTNECGIE 333


>gi|157058731|gb|ABV03123.1| cathepsin B-16D1 [Acyrthosiphon pisum]
          Length = 243

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 59/134 (44%), Positives = 84/134 (62%), Gaps = 2/134 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +PR+FDAR KW  C ++  + DQ NCGSCWA++ ++A +DRLC+A+N  F   +SA+ I 
Sbjct: 86  IPRHFDARRKWRSCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEIT 145

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C  +C +GCNGG+P  AW  +   G+VTGGDY S EGC+PY + PC +  +G    C  
Sbjct: 146 FCCYSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPYDAEGH-NTCAG 204

Query: 306 LGKLKTPECKQNCY 319
             +     C + CY
Sbjct: 205 KPRESNHRCTRMCY 218


>gi|312266|emb|CAA51531.1| cathepsin B-like enzyme [Gallus gallus]
          Length = 156

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 63/147 (42%), Positives = 89/147 (60%), Gaps = 4/147 (2%)

Query: 196 KWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP-NC-W 253
           +WP CP++  I DQ +CGSCWA      ISDR+C+ +N   + ++SA+ +++C    C  
Sbjct: 2   QWPNCPTISEIRDQGSCGSCWAFGSVEVISDRICVHTNAKVSVEVSAEDLLSCCGFECGM 61

Query: 254 GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPE 313
           GCNGG+P  AWR+W   G+V+GG Y+S  GC  YT+ PCEHHV G    CT  G  +TP 
Sbjct: 62  GCNGGYPSGAWRYWTERGLVSGGLYDSHVGCAGYTIPPCEHHVNGSRPPCTGEGG-ETPR 120

Query: 314 CKQNCYNPSYESTYRFDLKKGKKAHMV 340
           C ++C  P Y  +Y+ D   G   + V
Sbjct: 121 CSRHC-EPGYSPSYKEDKHYGSHIYGV 146


>gi|159177|gb|AAA29177.1| cysteine proteinase [Haemonchus contortus]
          Length = 342

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 71/160 (44%), Positives = 99/160 (61%), Gaps = 14/160 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  +D REK+ +C +  +I DQ+NCGSCWAVS A AISDR+CIA+NG     IS+  I+
Sbjct: 86  IPEEYDPREKF-KCSTF-YIRDQANCGSCWAVSTAAAISDRICIATNGEKQVNISSTDIL 143

Query: 247 A-CTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
             C P C +GC GGW   AW ++ + GVV+GG+Y ++  C+PY + PC HH      N T
Sbjct: 144 TCCNPQCGFGCGGGWSIRAWEYFVYEGVVSGGEYLTKGVCRPYPIHPCGHH-----GNDT 198

Query: 305 LLG----KLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             G    +  TP CK+ C  P Y+  +R D ++GK A+ V
Sbjct: 199 YYGECPREAATPPCKKKC-QPGYKKIFRMDKRQGKVAYGV 237



 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 47/101 (46%), Positives = 68/101 (67%), Gaps = 6/101 (5%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDI--PYWLVAN 135
           R+I  HGP+VA F+VY DF  YK+GVY+H  G   G HAV+++GWGV++     YWL+AN
Sbjct: 246 REILRHGPVVASFAVYEDFSLYKTGVYKHTAGALRGYHAVKMMGWGVDSKTKAKYWLIAN 305

Query: 136 SWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDL 176
           SW++ WG++G F+ +RG N+ +IE    + V A   + D L
Sbjct: 306 SWHNDWGENGYFRFIRGINDCEIE----DTVAAGIVDVDSL 342


>gi|328718094|ref|XP_003246386.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
          Length = 340

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 59/134 (44%), Positives = 84/134 (62%), Gaps = 2/134 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +PR+FDAR KW  C ++  + DQ NCGSCWA++ ++A +DRLC+A+N  F   +SA+ I 
Sbjct: 88  IPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEIT 147

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C  +C +GCNGG+P  AW  +   G+VTGGDY S EGC+PY + PC +  +G    C  
Sbjct: 148 FCCHSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPYDAEGH-NTCAG 206

Query: 306 LGKLKTPECKQNCY 319
             +     C + CY
Sbjct: 207 KPRESNHRCTRMCY 220



 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 43/97 (44%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 64  YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLGW 122
           Y + ++ +   +  + +  +GP+ A F VY DF  YKSGVY +      +G HAV+++GW
Sbjct: 233 YTRDSYYLTYGSIQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENATYLGGHAVKLIGW 292

Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           G E  +PYWL+ NSWN  WGD+G FKI RG NE  I+
Sbjct: 293 GEEYGVPYWLMVNSWNADWGDNGLFKIRRGTNECGID 329


>gi|201023315|ref|NP_001128400.1| cathepsin B-16D2 precursor [Acyrthosiphon pisum]
          Length = 340

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 59/134 (44%), Positives = 84/134 (62%), Gaps = 2/134 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +PR+FDAR KW  C ++  + DQ NCGSCWA++ ++A +DRLC+A+N  F   +SA+ I 
Sbjct: 88  IPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEIT 147

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C  +C +GCNGG+P  AW  +   G+VTGGDY S EGC+PY + PC +  +G    C  
Sbjct: 148 FCCHSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPYDAEGH-NTCAG 206

Query: 306 LGKLKTPECKQNCY 319
             +     C + CY
Sbjct: 207 KPRESNHRCTRMCY 220



 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 43/97 (44%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 64  YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLGW 122
           Y + ++ +   +  + +  +GP+ A F VY DF  YKSGVY +      +G HAV+++GW
Sbjct: 233 YTRDSYYLTYGSIQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENATYLGGHAVKLIGW 292

Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           G E  +PYWL+ NSWN  WGD+G FKI RG NE  I+
Sbjct: 293 GEEYGVPYWLMVNSWNADWGDNGLFKIRRGTNECGID 329


>gi|118358710|ref|XP_001012596.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89294363|gb|EAR92351.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 346

 Score =  128 bits (321), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 73/166 (43%), Positives = 99/166 (59%), Gaps = 14/166 (8%)

Query: 182 QNAKGLPRNFDAREKWPE-CPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQ- 239
           + A  LP  FD+R +W + C SL  + DQSNCGSCWA   A ++SDR CI       GQ 
Sbjct: 88  RQANNLPSEFDSRVQWGDKCSSLWEVRDQSNCGSCWAFGAAESLSDRHCI-----HLGQD 142

Query: 240 --ISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHV 296
             +S Q++V C   C +GC+GGWP+ A  ++ +NG+VTG  Y +   CQ Y+LAPC HHV
Sbjct: 143 IRLSTQNLVTCCDECGFGCDGGWPEAAMDYYVNNGLVTGDLYGNNSWCQAYSLAPCAHHV 202

Query: 297 QGPLQ-NCTLLGKLKTPECKQNC-YNPSYESTYRFDLKKGKKAHMV 340
              +   CT  G+L TP C ++C  N +Y   Y  DL KG KA+ +
Sbjct: 203 TSDVYPPCT--GELPTPPCVKSCDSNSTYTIPYPKDLHKGSKAYSI 246



 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 51/83 (61%), Positives = 63/83 (75%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +I  +GP+   F+VY DFL YKSGVYQH  G  +G HAV+++GWGVEN  PYW++ NS
Sbjct: 254 MTEIQTNGPIEVAFTVYEDFLTYKSGVYQHVTGSELGGHAVKMVGWGVENGTPYWIIVNS 313

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN+ WGD GTFKILRG+NE  IE
Sbjct: 314 WNESWGDKGTFKILRGQNECGIE 336


>gi|157058771|gb|ABV03143.1| cathepsin B-16D [Aulacorthum solani]
          Length = 201

 Score =  128 bits (321), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 56/113 (49%), Positives = 81/113 (71%), Gaps = 1/113 (0%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +PR+FDAR KW  C ++  + DQ NCGSCWA++ ++A +DRLC+A+NG F   +SA+ I 
Sbjct: 72  IPRHFDARRKWRHCQTIGKVRDQGNCGSCWAMATSSAFADRLCVATNGDFNELLSAEEIT 131

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG 298
            C   C +GC+GG+P  AW+ +  +G+VTGG+YNS EGC+PY + PC +  QG
Sbjct: 132 FCCHTCGFGCHGGYPIKAWKRFNKHGLVTGGNYNSGEGCEPYRVPPCPYDDQG 184


>gi|4325188|gb|AAD17297.1| cysteine proteinase [Ancylostoma ceylanicum]
          Length = 341

 Score =  127 bits (320), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 78/213 (36%), Positives = 110/213 (51%), Gaps = 10/213 (4%)

Query: 124 VENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANS--SEDDDLETMGC 181
           VE D+   L   ++ D+  +H +F   R E   + E     R+  +   +E    E +  
Sbjct: 26  VEKDVEK-LTGQAFVDYINEHQSF--YRAEYSPEAEAFVKARIMDSKFLAEQKKEEVLAD 82

Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
                 P +FDAR +WPEC S+  I DQS CGSCWAVS A A+SD +C+ SN      IS
Sbjct: 83  VYGDDPPDSFDARTQWPECRSIGTIRDQSACGSCWAVSSAEAMSDEICVQSNSTIKVMIS 142

Query: 242 AQHIVACTP-NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP 299
              I++C   +C +GC GGWP  A+R+   +GVVTGG Y  ++ C+PY+  PC  H   P
Sbjct: 143 DTDILSCCGLDCGYGCQGGWPIEAYRWMQRDGVVTGGKYRQRDVCKPYSFYPCGQHKDVP 202

Query: 300 LQNCTLLGKLKTPECK---QNCYNPSYESTYRF 329
                  G   TP+C+   Q  YN +Y+    F
Sbjct: 203 YYGPCPGGLWPTPKCRKSSQRKYNKTYQEDKHF 235



 Score = 91.3 bits (225), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 47/117 (40%), Positives = 71/117 (60%), Gaps = 3/117 (2%)

Query: 45  KKKKKKKRLYLPTSIPLSHYFKKAHMVPRCN-AMRQ-IYEHGPLVAIFSVYADFLQYKSG 102
           K +K  +R Y  T     H+  +++ +P    ++RQ IY++GP+VA F VY D+     G
Sbjct: 216 KCRKSSQRKYNKTYQEDKHFATRSYSLPNNERSIRQEIYKNGPVVAAFKVYEDYSS-TGG 274

Query: 103 VYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +Y H +G   G HA +V+GWG EN   YWL+ANSWN  WG+ G ++I+R  +  +IE
Sbjct: 275 IYVHKWGIQTGAHADKVIGWGRENGTDYWLIANSWNTDWGEDGYYRIVRETDNCEIE 331


>gi|239793652|dbj|BAH72931.1| ACYPI000018 [Acyrthosiphon pisum]
          Length = 239

 Score =  127 bits (319), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 55/116 (47%), Positives = 79/116 (68%), Gaps = 1/116 (0%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +PR+FDAR KW  C ++  + DQ NCGSCWA++ ++A +DRLC+A+N  F   +SA+ I 
Sbjct: 88  IPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATNTDFNELLSAEEIT 147

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ 301
            C  +C +GCNGG+P  AW  +   G+VTGGDY S EGC+PY + PC +  +G + 
Sbjct: 148 FCCHSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPYDAEGHIH 203


>gi|226474182|emb|CAX71577.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  127 bits (319), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 64/143 (44%), Positives = 89/143 (62%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   +W +W   G+VTGG   +   C+PY    C+H V+G  + C  
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTSCRPYPFPKCDHFVKGKYRACG- 208

Query: 306 LGKL-KTPECKQNC---YNPSYE 324
             KL +TP+CKQ C   YN SYE
Sbjct: 209 -DKLYETPQCKQTCQKGYNTSYE 230



 Score =  104 bits (259), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   ++ V    ++ Q  I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++
Sbjct: 234 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 293

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN   YWL AN+WN+ WG+ G F+I+RG NE  IE
Sbjct: 294 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIE 332


>gi|157058753|gb|ABV03134.1| cathepsin B-84 [Acyrthosiphon pisum]
          Length = 230

 Score =  127 bits (319), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 57/134 (42%), Positives = 85/134 (63%), Gaps = 2/134 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FD+R +W  C ++ H+ +Q NCGSCWA     A +DRLC+A+NG F   ISA+ + 
Sbjct: 44  VPEFFDSRLEWDYCETIGHVRNQGNCGSCWAHGTTGAFADRLCVATNGEFNELISAEELT 103

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GCNGG+P  AW+++  +GVVTGGDY++ +GCQPY + PC    +G   +C+ 
Sbjct: 104 FCCHTCGFGCNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPCVKDDEG-HNSCSG 162

Query: 306 LGKLKTPECKQNCY 319
               +  +C + CY
Sbjct: 163 QPTERNHKCSKKCY 176



 Score = 38.5 bits (88), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 22/65 (33%), Positives = 34/65 (52%), Gaps = 3/65 (4%)

Query: 44  KKKKKKKKRLYLPTSIPL--SHY-FKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYK 100
           ++  K  K+ Y   +I    +HY  K A+ +      +    +GP+ A F VY DF+ Y+
Sbjct: 166 ERNHKCSKKCYGDDTIDYKKNHYKTKDAYYLKNTTMQKDTMVYGPIEASFDVYDDFMNYE 225

Query: 101 SGVYQ 105
           SGVYQ
Sbjct: 226 SGVYQ 230


>gi|161343837|tpg|DAA06099.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 255

 Score =  127 bits (319), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 58/134 (43%), Positives = 84/134 (62%), Gaps = 2/134 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P++FDAR KW  C ++  + DQ NCGSCWA++ ++A +DRLC+A+N  F   +SA+ I 
Sbjct: 88  IPKHFDARRKWRSCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEIT 147

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C  +C +GCNGG+P  AW  +   G+VTGGDY S EGC+PY + PC +  +G    C  
Sbjct: 148 FCCHSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPYDAEGH-NTCAG 206

Query: 306 LGKLKTPECKQNCY 319
             +     C + CY
Sbjct: 207 KPRESNHRCTRMCY 220


>gi|291291827|gb|ADD91786.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score =  127 bits (318), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 64/134 (47%), Positives = 86/134 (64%), Gaps = 3/134 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +  +R KWP+C SL+ I DQ+NCGSCWAVS A+A+SDR+CIASNG     +SA  I+
Sbjct: 2   IPESPYSRTKWPKCSSLKPIRDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDIL 61

Query: 247 ACTPN-C-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C  N C +GCNGGWP  A+ ++   G VTGGDY +  GC+PY   PC HH +       
Sbjct: 62  SCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYG-E 120

Query: 305 LLGKLKTPECKQNC 318
              +  TP+C + C
Sbjct: 121 CPNEATTPKCVRKC 134



 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 41/82 (50%), Positives = 60/82 (73%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R+I ++GP+V  F+VY DF  YK G+Y+H  G + G HA++++GWG E  +PYWL+ANSW
Sbjct: 164 REIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWGKEGGVPYWLIANSW 223

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           ++ WG++G F+IL G N   IE
Sbjct: 224 HNDWGENGYFRILCGSNHCGIE 245


>gi|46812327|gb|AAT02230.1| cathepsin B-like proteinase [Triatoma dimidiata]
          Length = 332

 Score =  127 bits (318), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 66/151 (43%), Positives = 91/151 (60%), Gaps = 14/151 (9%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIA-----SNGYFTGQIS 241
           +P  FDAR++WP CPS+  I DQ +CGSCWA+ +      RLC+      SNG     +S
Sbjct: 81  IPDEFDARKQWPNCPSITDIRDQGSCGSCWALELL-----RLCLIVFVSHSNGKLQVHLS 135

Query: 242 AQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
           A+++V C  +C  GC GG P  AW +W   G+V+GG+Y S+EGCQPY++APCEHH+ G  
Sbjct: 136 AENLVTCCGSCGAGCFGGDPGSAWEYWRDVGIVSGGNYGSKEGCQPYSIAPCEHHIPGSR 195

Query: 301 QNCTLLGKLKTPECKQNCYNPSYESTYRFDL 331
             C   G+  T +C++ C    Y   Y  DL
Sbjct: 196 PPCR--GEGHTADCRKQC-EKGYSIPYDKDL 223



 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 47/82 (57%), Positives = 61/82 (74%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I ++GP+ A F VY D L YK GVY+H  G  +G HA+++LGWGVEN  PYWL+ANSWN
Sbjct: 242 EILKNGPVEAAFFVYEDLLTYKEGVYKHVAGAPVGGHAIKILGWGVENGTPYWLIANSWN 301

Query: 139 DHWGDHGTFKILRGENEADIEM 160
             WG++G FKILRG +E  IE+
Sbjct: 302 TDWGNNGFFKILRGSDECGIEI 323


>gi|308466896|ref|XP_003095699.1| CRE-CPR-3 protein [Caenorhabditis remanei]
 gi|308244581|gb|EFO88533.1| CRE-CPR-3 protein [Caenorhabditis remanei]
          Length = 373

 Score =  127 bits (318), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 68/154 (44%), Positives = 85/154 (55%), Gaps = 12/154 (7%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDARE WP+C S++ I +Q+ CGSCWA   A  ISDR+CI SNG     IS + I+
Sbjct: 93  IPDTFDARENWPDCKSIKLIRNQATCGSCWAFGAAEVISDRICIQSNGTQQPIISVEDIL 152

Query: 247 AC--TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C  T    GC GG+   A RFW  NG VTGGDYN   GC PY+ APC+   + P    T
Sbjct: 153 SCCGTTCGKGCQGGYSIEAMRFWKSNGAVTGGDYNGN-GCMPYSFAPCQ---KSPCVEST 208

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAH 338
                 TP CK  C +    + Y  D   G  A+
Sbjct: 209 ------TPTCKTTCQSSYTTANYTTDKHYGTSAY 236



 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 56/130 (43%), Positives = 72/130 (55%), Gaps = 10/130 (7%)

Query: 63  HYFKKAHMVPRCNAM-----RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAV 117
           HY   A+ +   N +      +IY +GP+ A + VY DF QYKSGVY +  G  +G HAV
Sbjct: 230 HYGTSAYRLATTNNVVSTIQYEIYHNGPVEASYKVYEDFYQYKSGVYHYVSGKLVGGHAV 289

Query: 118 RVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRV-----EANSSE 172
           +++GWG END+ YWLVANSW   +G+ G FKI RG NE  IE      V      A   +
Sbjct: 290 KIIGWGTENDVDYWLVANSWGIKFGEGGFFKIRRGTNECQIESNVVAGVAKLGTHAEKGD 349

Query: 173 DDDLETMGCQ 182
           DDD     C 
Sbjct: 350 DDDGSATSCS 359


>gi|119638996|gb|ABL85239.1| cysteine proteinase 5 [Necator americanus]
          Length = 342

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 72/158 (45%), Positives = 86/158 (54%), Gaps = 12/158 (7%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDAREKWP C S+R I DQSNCGSCWAVS A+ +SDRLCI SNG      S   I+
Sbjct: 89  LPETFDAREKWPNCTSIRTIRDQSNCGSCWAVSAASVMSDRLCIQSNGTIQSWASDTDIL 148

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  NC  GC+GG P  A+ F   NGV TGG +     C+PY   PC  H     QN   
Sbjct: 149 SCCWNCGMGCDGGRPFAAFFFAIDNGVCTGGPFREPNVCKPYAFYPCGRH-----QNQKY 203

Query: 306 LGKL-----KTPECKQNCYNPSYESTYRFDLKKGKKAH 338
            G        TP+C++ C    Y   Y+ D   G  A+
Sbjct: 204 FGPCPKELWPTPKCRKMC-QLKYNVAYKDDKIYGNDAY 240



 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 46/98 (46%), Positives = 64/98 (65%), Gaps = 2/98 (2%)

Query: 64  YFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
           Y   A+ +P      M++I+ +GP+V  FSV+ADF  YK GVY  N     G HAV+++G
Sbjct: 235 YGNDAYSLPNNETRIMQEIFTNGPVVGSFSVFADFAIYKKGVYVSNGIQQNGAHAVKIIG 294

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           WGV++ + YWL+ANSWN+ WGD G  + LRG+N   IE
Sbjct: 295 WGVQDGLKYWLIANSWNNDWGDEGYVRFLRGDNHCGIE 332


>gi|358341867|dbj|GAA49438.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 952

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 62/133 (46%), Positives = 76/133 (57%), Gaps = 2/133 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAR  WP CPS+  I DQS+CGSCWA     A+SDRLCI S G F   +SA  +V
Sbjct: 639 LPESFDARANWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSKGAFNKSLSAVDLV 698

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC GG+  +AW FW  +G+VTGG      GC+ Y    CEH  +G    C  
Sbjct: 699 SCCTECGCGCRGGYSPIAWDFWKTHGIVTGGSKEKPTGCRSYPFPSCEHRGKGQYPPCP- 757

Query: 306 LGKLKTPECKQNC 318
                TPEC + C
Sbjct: 758 HQLYPTPECIKRC 770



 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 63/159 (39%), Positives = 86/159 (54%), Gaps = 4/159 (2%)

Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
           + K LP++FDAR KWP CPS+  I DQS+C S WA     ++SDRLCI SNG F   +SA
Sbjct: 47  DEKELPKSFDARTKWPHCPSISEIRDQSSCESFWAFGAVESMSDRLCIHSNGAFNKSLSA 106

Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ 301
             +++C  +C  GC  G+  +AW FW  +G+VTGG      GC+ +    C H  +G   
Sbjct: 107 TDLLSCCEDCGLGCGAGFHPMAWDFWKTHGIVTGGSKEEPSGCRSFPFPKCGHRRKGRYP 166

Query: 302 NCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            C       TPEC + C  P  E  Y  D  +   ++ V
Sbjct: 167 PCP-RHIYPTPECIKQCDEP--EVNYEKDKTRANISYNV 202



 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 61/85 (71%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           + M++I  +GP+ A F +YADFL+Y  GVY H +G  I  HA+R+LGWG ++ +PYWL+A
Sbjct: 208 SIMKEIMLNGPVEASFGIYADFLEYNGGVYFHCWGGPISRHAIRILGWGEDDGVPYWLIA 267

Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
           NSWN+ WG+ G  + LRG NE  IE
Sbjct: 268 NSWNEDWGEKGYVRFLRGHNECGIE 292



 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 44/83 (53%), Positives = 57/83 (68%)

Query: 76  AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVAN 135
            M++I   GP+ AI  VY D L YKSGVY H +G  +G H +R+LGWG E+ +PYWLVAN
Sbjct: 854 VMKEIMLRGPVGAILHVYEDLLDYKSGVYFHVWGGHLGEHGIRILGWGEEDGVPYWLVAN 913

Query: 136 SWNDHWGDHGTFKILRGENEADI 158
           SWN+ WG+ G  ++LR  NE  I
Sbjct: 914 SWNEDWGEKGYMRVLRWRNECGI 936


>gi|300122171|emb|CBK22745.2| unnamed protein product [Blastocystis hominis]
          Length = 319

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 61/149 (40%), Positives = 86/149 (57%), Gaps = 6/149 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FD+ EKWPECPS+  + DQS+C SCWA  V    +DR+CI S G    ++SA+ ++
Sbjct: 69  LPKEFDSSEKWPECPSILEVRDQSSCASCWAFGVVEVATDRICIESKGKNQVRLSAEDVL 128

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C  +C + C GG+  +AW +    GVVTGG YNS E C+ Y   PC H ++G    C+ 
Sbjct: 129 ECCKDCGFQCQGGYSAMAWEYLRRTGVVTGGQYNSTEWCKSYPFPPCSHGIEGQYPQCST 188

Query: 306 LGKLKTPECKQNC---YNPSYE-STYRFD 330
              +  P+C+  C   Y   YE   Y+F 
Sbjct: 189 KPPV-VPKCETTCQEGYPIEYEKDRYKFS 216



 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 41/81 (50%), Positives = 53/81 (65%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I E+GP+ A F VY DF+ YKSG+Y H  G  + LH V+++GWG EN   YW   NSWN
Sbjct: 231 EIMENGPVDASFQVYEDFMTYKSGIYHHVEGKFMNLHTVKIIGWGEENGEAYWKAVNSWN 290

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WG++G F+I  G NE  IE
Sbjct: 291 SEWGENGLFRIRLGTNECTIE 311


>gi|239788404|dbj|BAH70886.1| ACYPI000014 [Acyrthosiphon pisum]
          Length = 335

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 57/134 (42%), Positives = 85/134 (63%), Gaps = 2/134 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FD+R +W  C ++ H+ +Q NCGSCWA     A +DRLC+A+NG F   ISA+ + 
Sbjct: 84  VPEFFDSRLEWDYCETIGHVRNQGNCGSCWAHGTTGAFADRLCVATNGEFNELISAEELT 143

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GCNGG+P  AW+++  +GVVTGGDY++ +GCQPY + PC    +G   +C+ 
Sbjct: 144 FCCHRCVFGCNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPCVKDDEGH-NSCSG 202

Query: 306 LGKLKTPECKQNCY 319
               +  +C + CY
Sbjct: 203 QPTERNHKCSKKCY 216



 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 54/134 (40%), Positives = 74/134 (55%), Gaps = 4/134 (2%)

Query: 30  KKKEEEKKKKKKKKKKKKKKKKRLYLPTSIPL--SHY-FKKAHMVPRCNAMRQIYEHGPL 86
           K  E       +  ++  K  K+ Y   +I    +HY  K A+ +      +    +GP+
Sbjct: 192 KDDEGHNSCSGQPTERNHKCSKKCYGDDTIDYKKNHYKTKDAYYLKNTTMQKDTMVYGPI 251

Query: 87  VAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHG 145
            A F VY DF+ Y+SGVYQ     S +G HAV+++GWGVE   PYWL+ NSW + WGD G
Sbjct: 252 EASFDVYDDFMNYESGVYQRTGNASYLGGHAVKMIGWGVEEGTPYWLMVNSWGEQWGDKG 311

Query: 146 TFKILRGENEADIE 159
            FKILRG +E  IE
Sbjct: 312 MFKILRGTDECGIE 325


>gi|187105116|ref|NP_001119618.1| cathepsin B-84 precursor [Acyrthosiphon pisum]
 gi|161343843|tpg|DAA06102.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 335

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 57/134 (42%), Positives = 85/134 (63%), Gaps = 2/134 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FD+R +W  C ++ H+ +Q NCGSCWA     A +DRLC+A+NG F   ISA+ + 
Sbjct: 84  VPEFFDSRLEWDYCETIGHVRNQGNCGSCWAHGTTGAFADRLCVATNGEFNELISAEELT 143

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GCNGG+P  AW+++  +GVVTGGDY++ +GCQPY + PC    +G   +C+ 
Sbjct: 144 FCCHRCGFGCNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPCVKDDEGH-NSCSG 202

Query: 306 LGKLKTPECKQNCY 319
               +  +C + CY
Sbjct: 203 QPTERNHKCSKKCY 216



 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 54/134 (40%), Positives = 74/134 (55%), Gaps = 4/134 (2%)

Query: 30  KKKEEEKKKKKKKKKKKKKKKKRLYLPTSIPL--SHY-FKKAHMVPRCNAMRQIYEHGPL 86
           K  E       +  ++  K  K+ Y   +I    +HY  K A+ +      +    +GP+
Sbjct: 192 KDDEGHNSCSGQPTERNHKCSKKCYGDDTIDYKKNHYKTKDAYYLKNTTMQKDTMVYGPI 251

Query: 87  VAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHG 145
            A F VY DF+ Y+SGVYQ     S +G HAV+++GWGVE   PYWL+ NSW + WGD G
Sbjct: 252 EASFDVYDDFMNYESGVYQRTGNASYLGGHAVKMIGWGVEEGTPYWLMVNSWGEQWGDKG 311

Query: 146 TFKILRGENEADIE 159
            FKILRG +E  IE
Sbjct: 312 MFKILRGTDECGIE 325


>gi|339241013|ref|XP_003376432.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316974853|gb|EFV58323.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 551

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 61/150 (40%), Positives = 94/150 (62%), Gaps = 18/150 (12%)

Query: 188 PRNFDAREKWPECP-SLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           P  FD+R+ WP+C   +  I DQ+NCGSCWAVS A+ +SDR CIA++G FT  +S   ++
Sbjct: 290 PVEFDSRKHWPQCEKVISFIKDQANCGSCWAVSSASVMSDRTCIATDGQFTTLLSDAELL 349

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C +GCNGG+PQ  +++W ++G+ TGG Y S + C+PY + PC         NC+ 
Sbjct: 350 SCCTSCGYGCNGGYPQRTFKYWVYSGMPTGGPYGSNDTCKPYPIPPC--------SNCS- 400

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
             + +TP+C ++C      STY   L + +
Sbjct: 401 --ETRTPKCSKSCI-----STYPLSLNEDR 423



 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 47/85 (55%), Positives = 60/85 (70%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           + M+ I  +GP+VA  SVY DFL YK GVY    G  +G HAVR++GWG +++IPYWLVA
Sbjct: 438 SMMKDISLYGPIVAGMSVYEDFLHYKEGVYTQESGIFLGGHAVRIIGWGEQDNIPYWLVA 497

Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
           NSWN  +G+ G FKI RG +E  IE
Sbjct: 498 NSWNTTFGEDGLFKIRRGFDECGIE 522


>gi|256090368|ref|XP_002581167.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|22531387|emb|CAD44624.1| cathepsin B1 isotype 1 [Schistosoma mansoni]
 gi|353228442|emb|CCD74613.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 340

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 66/156 (42%), Positives = 89/156 (57%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGSCWA     A+SDR CI S G    ++SA  ++
Sbjct: 89  IPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLL 148

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C  GC GG    AW +W   G+VTG    +  GC+PY    CEHH +G    C  
Sbjct: 149 SCCESCGLGCEGGILGPAWDYWVKEGIVTGSSKENHTGCEPYPFPKCEHHTKGKYPPCG- 207

Query: 306 LGKL-KTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             K+ KTP CKQ C    Y++ Y  D  +GK ++ V
Sbjct: 208 -SKIYKTPRCKQTC-QKKYKTPYTQDKHRGKSSYNV 241



 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 46/82 (56%), Positives = 67/82 (81%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I ++GP+ A F+VY DFL YKSG+Y+H  G+++G HA+R++GWGVEN  PYWL+ANSW
Sbjct: 250 KEIMKYGPVEAGFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGVENKTPYWLIANSW 309

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG++G F+I+RG +E  IE
Sbjct: 310 NEDWGENGYFRIVRGRDECSIE 331


>gi|300835056|gb|ADK37857.1| putative cathepsin precursor [Sitobion avenae]
          Length = 340

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 63/149 (42%), Positives = 92/149 (61%), Gaps = 9/149 (6%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +PR FDAR+KW  C ++  + DQ +CGSCWA   ++A +DRLC+A++G F   +SA+ I 
Sbjct: 88  IPRTFDARKKWRHCRTIGEVRDQGHCGSCWAFGTSSAFADRLCVATDGDFNELLSAEEIT 147

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GC+GG+P  AW+++  +G+VTGG+Y S EGC+PY + PC    +G   N T 
Sbjct: 148 FCCHTCGFGCHGGYPIKAWKYFSKHGLVTGGNYKSGEGCEPYRVPPCPRDDKG---NNTC 204

Query: 306 LGKL--KTPECKQNCY---NPSYESTYRF 329
            GK   K   C + CY   +  Y   +RF
Sbjct: 205 AGKPIEKNHRCTRMCYGDQDLDYNDDHRF 233



 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 44/83 (53%), Positives = 55/83 (66%), Gaps = 1/83 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVENDIPYWLVANS 136
           + +  +GP+ A F VY DF  YKSGVY+     S +G HAV+++GWGVE   PYWL+ NS
Sbjct: 247 KDVMTYGPIEASFDVYDDFPSYKSGVYEKTENASYLGGHAVKLIGWGVEEGTPYWLMVNS 306

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD G FKI RG NE  I+
Sbjct: 307 WNAQWGDKGLFKIRRGTNECGID 329


>gi|401758196|gb|AFQ01133.1| cathepsin B [Chilo suppressalis]
          Length = 350

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 68/171 (39%), Positives = 95/171 (55%), Gaps = 19/171 (11%)

Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
           P  FDARE W  CP+L+ I DQ  CGSCWAV+  +A++DR+CI S G      S + +++
Sbjct: 85  PNQFDAREHWKNCPTLKDIRDQGGCGSCWAVAAVSAMTDRMCILSKGKEHFYFSIKDVLS 144

Query: 248 CTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC--- 303
           C   C  GC GG    AW ++   G+V+GG Y S++GCQPYT+ PC H V G ++ C   
Sbjct: 145 CCGYCGNGCEGGVLTRAWIYYKKIGIVSGGGYKSKQGCQPYTIPPCNHLVWGEIEQCKNI 204

Query: 304 TLLGKLK--------------TPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            +  K K              TPEC++ C N +Y+  Y  D  +GK  + V
Sbjct: 205 PMTPKCKNIPVIPEQCKYIPITPECEKKC-NKNYKVCYSKDKHRGKSVYRV 254



 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 43/89 (48%), Positives = 60/89 (67%)

Query: 63  HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGW 122
           H  K  + V +    ++IYE+GP+ + F+VY DFL YK G+Y +  G  +GLH+V+++GW
Sbjct: 246 HRGKSVYRVKKSEIFKEIYEYGPVTSYFTVYEDFLNYKEGIYNYTSGQKLGLHSVKIIGW 305

Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILR 151
           G E  I YWL ANS+N  WGD G FKI+R
Sbjct: 306 GEERGIKYWLAANSFNTDWGDKGFFKIIR 334


>gi|7507648|pir||T24819 hypothetical protein T10H4.12 - Caenorhabditis elegans
          Length = 324

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 71/180 (39%), Positives = 97/180 (53%), Gaps = 13/180 (7%)

Query: 158 IEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWA 217
           +++ F   +E +S    +L   G    + LP  FDAREKWP+C +++ I +Q+ CGSCWA
Sbjct: 1   MDVKFAEPLEKDSDVASELFVRGEIVPEPLPDTFDAREKWPDCNTIKLIRNQATCGSCWA 60

Query: 218 VSVANAISDRLCIASNGYFTGQISAQHIVAC--TPNCWGCNGGWPQLAWRFWGHNGVVTG 275
              A  ISDR+CI SNG     IS + I++C  T   +GC GG+   A RFW  +G VTG
Sbjct: 61  FGAAEVISDRVCIQSNGTQQPVISVEDILSCCGTTCGYGCKGGYSIEALRFWASSGAVTG 120

Query: 276 GDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
           GDY    GC PY+ APC        +NC    +  TP CK  C +      Y+ D   G+
Sbjct: 121 GDYGGH-GCMPYSFAPC-------TKNCP---ESTTPSCKTTCQSSYKTEEYKKDKHYGE 169



 Score =  101 bits (251), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 44/81 (54%), Positives = 57/81 (70%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +IY +GP+ A + VY DF  YKSGVY +  G  +G HAV+++GWGVEN + YWL+ANSW 
Sbjct: 202 EIYHYGPVEASYKVYEDFYHYKSGVYHYTSGKLVGGHAVKIIGWGVENGVDYWLIANSWG 261

Query: 139 DHWGDHGTFKILRGENEADIE 159
             +G+ G FKI RG NE  IE
Sbjct: 262 TSFGEKGFFKIRRGTNECQIE 282


>gi|159175|gb|AAA29176.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 73/177 (41%), Positives = 92/177 (51%), Gaps = 11/177 (6%)

Query: 169 NSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRL 228
           N   +DD +T        LP N+D R  W  C S   I DQ+NCGSCWAVS A AISDR+
Sbjct: 76  NPVVNDDNDT-----GADLPENYDPRIVWKNCSSFHTIRDQANCGSCWAVSTAAAISDRI 130

Query: 229 CIASNGYFTGQISAQHIVACT-PNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQP 286
           CIA+ G      S   I+ C    C  GC GGWP  AW+F+ ++GVV+GG Y  +  C P
Sbjct: 131 CIATKGKKQVYASDTDILTCCGARCGLGCRGGWPIEAWKFFEYDGVVSGGPYLGKGCCSP 190

Query: 287 YTLAPCEHHVQGPLQ-NCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVLM 342
           Y L PC  H       NC  +G   TP CK+ C  P +   YR D + G+      +
Sbjct: 191 YPLHPCGRHGNDTFYGNC--VGMAPTPPCKRKC-QPGFRGMYRVDKRYGEPGRTYTL 244



 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 44/96 (45%), Positives = 65/96 (67%), Gaps = 3/96 (3%)

Query: 67  KAHMVPRCNA--MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD-SIGLHAVRVLGWG 123
           + + +PR      R I E G +VA+F+VY DF  Y+SG+Y+H  G  + G HAV+++GWG
Sbjct: 240 RTYTLPRSEVKIRRDIKERGSVVAVFAVYEDFSHYQSGIYKHTAGRFTGGYHAVKMIGWG 299

Query: 124 VENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
            +N   YWL+ANSW+D WG++G F+++RG N   IE
Sbjct: 300 KDNGTDYWLIANSWHDDWGENGFFRMIRGINNCGIE 335


>gi|204022102|dbj|BAG71148.1| cathepsin B-N2 [Tuberaphis takenouchii]
          Length = 334

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 64/136 (47%), Positives = 85/136 (62%), Gaps = 6/136 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P NFDAR+KW +C ++  + DQ NCG+CWA   ++A +DRLCIA+NG F   +SA+ + 
Sbjct: 85  IPSNFDARKKWRKCSTVGKVRDQGNCGTCWAFGTSSAFADRLCIATNGEFNELLSAEELA 144

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C  GC+GG+P  AW  +  +G+VTGGDYNS EGCQPY + PC     G   N T 
Sbjct: 145 FCCHKCGSGCHGGYPIKAWERFRKHGLVTGGDYNSGEGCQPYRVPPCPFDEYG---NNTC 201

Query: 306 LGKL--KTPECKQNCY 319
            GK   K   C + CY
Sbjct: 202 RGKPAEKNHRCTRMCY 217



 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 45/97 (46%), Positives = 59/97 (60%), Gaps = 1/97 (1%)

Query: 64  YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGW 122
           Y + A+ +        +  +GP+ A + VY DF  YKSGVY      S +G HAV+++GW
Sbjct: 230 YTRDAYYLNYQIIQNDLMTYGPIEASYDVYDDFPNYKSGVYMKTENASYLGGHAVKLIGW 289

Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           G E  +PYWL+ NSWND WGD G FKI RG NE  I+
Sbjct: 290 GEEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGID 326


>gi|161343825|tpg|DAA06093.1| TPA_inf: cathepsin B [Aphis gossypii]
          Length = 199

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 56/125 (44%), Positives = 85/125 (68%), Gaps = 5/125 (4%)

Query: 171 SEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCI 230
           SED++ + +  +    +P+ FDAR+KW  C ++  + DQ NCGSCWA+S ++A +DRLC+
Sbjct: 76  SEDENYDNLFGR----IPKKFDARKKWRHCTTIGKVRDQGNCGSCWALSTSSAFADRLCV 131

Query: 231 ASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTL 289
           A+NG F   +SA+ +  C   C +GCNGG+P  AW  +  +G+VTGG+Y S EGC+PY +
Sbjct: 132 ATNGDFNQLLSAEELTFCCHKCGYGCNGGYPIKAWERFKKHGLVTGGEYKSGEGCEPYRV 191

Query: 290 APCEH 294
            PC +
Sbjct: 192 PPCPY 196


>gi|161343859|tpg|DAA06110.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 260

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 58/133 (43%), Positives = 83/133 (62%), Gaps = 2/133 (1%)

Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
           PR FDAR+KW  C ++  + DQ +CGSCWA   ++A +DRLC+A++G F   +SA+ I  
Sbjct: 89  PRTFDARKKWRHCKTIGEVRDQGHCGSCWAFGTSSAFADRLCVATDGDFNELLSAEEITF 148

Query: 248 CTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
           C   C +GCNGG P  AW+++  +G+VTGG+Y S EGC+PY + PC    +G    C   
Sbjct: 149 CCHTCGFGCNGGDPIKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCPRDDKGK-NTCAGK 207

Query: 307 GKLKTPECKQNCY 319
            + K   C + CY
Sbjct: 208 PREKNHRCTRMCY 220


>gi|343197337|pdb|3QSD|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With Ca074 Inhibitor
 gi|343197588|pdb|3S3Q|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11017 Inhibitor
 gi|343197589|pdb|3S3R|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
 gi|343197590|pdb|3S3R|B Chain B, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
 gi|343197591|pdb|3S3R|C Chain C, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
          Length = 254

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 65/155 (41%), Positives = 87/155 (56%), Gaps = 3/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGSCWA     A+SDR CI S G    ++SA  ++
Sbjct: 3   IPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLL 62

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C  GC GG    AW +W   G+VTG    +  GC+PY    CEHH +G    C  
Sbjct: 63  SCCESCGLGCEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEHHTKGKYPPCGS 122

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
               KTP CKQ C    Y++ Y  D  +GK ++ V
Sbjct: 123 K-IYKTPRCKQTC-QKKYKTPYTQDKHRGKSSYNV 155



 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 46/82 (56%), Positives = 67/82 (81%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I ++GP+ A F+VY DFL YKSG+Y+H  G+++G HA+R++GWGVEN  PYWL+ANSW
Sbjct: 164 KEIMKYGPVEAGFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGVENKAPYWLIANSW 223

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG++G F+I+RG +E  IE
Sbjct: 224 NEDWGENGYFRIVRGRDECSIE 245


>gi|219565128|dbj|BAH04068.1| cathepsin B [Equus caballus]
          Length = 162

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 55/107 (51%), Positives = 77/107 (71%), Gaps = 2/107 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFDARE+WP CP+++ I DQ +CGSCWA     AISDR+CI +NG+ + ++SA+ ++
Sbjct: 56  LPENFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVSVEVSAEDML 115

Query: 247 ACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP 291
            C  + C  GCNGG+P  AW FW   G+V+GG Y+S  GC+PY++ P
Sbjct: 116 TCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIPP 162


>gi|350535627|ref|NP_001233013.1| uncharacterized protein LOC100164982 precursor [Acyrthosiphon
           pisum]
 gi|239789514|dbj|BAH71377.1| ACYPI005957 [Acyrthosiphon pisum]
          Length = 339

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 59/147 (40%), Positives = 91/147 (61%), Gaps = 5/147 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +PR FDAR++W  C ++  + DQ +CGSCWA   ++A +DRLC+A++G F   +SA+ + 
Sbjct: 87  IPRTFDARKRWRHCKTIGEVRDQGHCGSCWAFGTSSAFADRLCVATDGDFNELLSAEELT 146

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C  GCNGG+P  AW+++  +G+VTGG+Y S +GC+PY + PC  +  G   +C  
Sbjct: 147 FCCHACGHGCNGGYPIKAWKYFSTHGLVTGGNYKSGKGCEPYRVPPCPRNEDGK-SSCAG 205

Query: 306 LGKLKTPECKQNCY---NPSYESTYRF 329
             K K   C + CY   +  Y+  +RF
Sbjct: 206 KPKEKNHRCTRMCYGNQDLDYDDDHRF 232



 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 43/83 (51%), Positives = 56/83 (67%), Gaps = 1/83 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVENDIPYWLVANS 136
           + +  +GP+ A F VY DF  YKSGVYQ     + +G HAV+++GWGVE   PYWL+ NS
Sbjct: 246 KDVLNYGPIEASFDVYDDFPSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGTPYWLMVNS 305

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKI RG +E  I+
Sbjct: 306 WNAQWGDNGLFKIRRGTDECRID 328


>gi|119638965|gb|ABL85237.1| cysteine proteinase 3 [Necator americanus]
          Length = 360

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 64/162 (39%), Positives = 94/162 (58%), Gaps = 10/162 (6%)

Query: 158 IEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWA 217
           +E  F +  E    +++D++      ++ +P +FDAR+KWP+C S+  I DQS+CGSCWA
Sbjct: 66  MESRFLDNEEGEMLKEEDMDF-----SEEIPVSFDARDKWPKCTSIGFIRDQSHCGSCWA 120

Query: 218 VSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGG 276
           VS A  +SDRLC+ SNG     +S   I+AC PNC  GC GG    AW ++ + GV TGG
Sbjct: 121 VSSAETMSDRLCVQSNGTIKVLLSDTDILACCPNCGAGCGGGHTIRAWEYFKNTGVCTGG 180

Query: 277 DYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC 318
            Y +++ C+PY   PC+    G            TP+C++ C
Sbjct: 181 LYGTKDSCKPYAFYPCKDESYGKCPK----DSFPTPKCRKIC 218



 Score = 90.9 bits (224), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 43/104 (41%), Positives = 64/104 (61%), Gaps = 7/104 (6%)

Query: 63  HYFKKAHMVPRCNA--MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           +Y   A+ +P+       +I  +GP+ A F +Y DF  Y+ GVY  + G  +G HA++++
Sbjct: 231 YYANSAYRIPQNETWIKLEIMRNGPVTASFRIYPDFGFYEKGVYVTSGGRELGGHAIKII 290

Query: 121 GWGVE----NDIPYWLVANSWNDHWGD-HGTFKILRGENEADIE 159
           GWG E     D+PYWL+ANSW   WG+ +G F+ILRG+N   IE
Sbjct: 291 GWGTEKVNGTDLPYWLIANSWGTDWGENNGYFRILRGQNHCQIE 334


>gi|256052331|ref|XP_002569726.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228435|emb|CCD74606.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 319

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 65/155 (41%), Positives = 86/155 (55%), Gaps = 3/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P NFD+R+KWP C S+  I DQS CGS WA     A+SDR CI S G    ++SA  ++
Sbjct: 67  IPSNFDSRKKWPGCKSIATIRDQSRCGSSWAFGAVEAMSDRSCIQSGGKQNVELSAVDLL 126

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C  G  GG+P LAW +W   G+VTG    +   CQPY    CEHH +G    C  
Sbjct: 127 SCCEHCGDGFEGGFPALAWDYWVKEGIVTGSSKENHTSCQPYPFPKCEHHTKGKYPAC-F 185

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
               KTP C+  C   SY++ Y  D  +GK  + V
Sbjct: 186 EEIYKTPNCENTC-QKSYKTPYAQDKHRGKSRYNV 219



 Score =  110 bits (275), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 45/82 (54%), Positives = 63/82 (76%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I ++GP+ A F VY DFL YKSG+Y+H  G  +  HA+R++GWGVEN+ PYWL+ NSW
Sbjct: 228 KEIMKYGPVEANFIVYEDFLNYKSGIYKHITGKLVSWHAIRIIGWGVENNTPYWLIPNSW 287

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG++G F+ILRG +E  IE
Sbjct: 288 NEDWGENGNFRILRGRHECSIE 309


>gi|52630925|gb|AAU84926.1| putative cathepsin B-N [Toxoptera citricida]
          Length = 340

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 65/152 (42%), Positives = 92/152 (60%), Gaps = 10/152 (6%)

Query: 171 SEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCI 230
           SED++ + +  +    +PR FDAR+KW  C ++  I DQ NCGSCWA++ ++A +DRLC+
Sbjct: 76  SEDENYDNLFGR----IPRKFDARKKWRNCKTIGAIRDQGNCGSCWALATSSAFADRLCV 131

Query: 231 ASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTL 289
            SN  F   +SA+ +  C   C +GCNGG+P  AW  +  +G+VTGGDY S EGC+PY +
Sbjct: 132 VSNEDFNQLLSAEELTFCCHKCGFGCNGGYPIKAWEHFKKHGLVTGGDYKSGEGCEPYRV 191

Query: 290 APCEHHVQGPLQNCTLLGKLKTP--ECKQNCY 319
            PC +   G   N T  GK       C + CY
Sbjct: 192 PPCPYDESG---NNTCAGKPMEANHRCTRMCY 220



 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 43/97 (44%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 64  YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGW 122
           Y + ++ +   +  + +  +GP+ A F VY DF  YKSGVY  +   S +G HA +++GW
Sbjct: 233 YTRDSYYLTYGSIQKDVLTYGPVEASFDVYDDFPSYKSGVYIRSENASYLGGHAAKLIGW 292

Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           G E  +PYWL+ NSWN  WGD+G FKI RG NE  I+
Sbjct: 293 GEEYGVPYWLMVNSWNADWGDNGLFKIQRGTNECGID 329


>gi|157058737|gb|ABV03126.1| cathepsin B-16 [Myzus persicae]
          Length = 238

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 58/133 (43%), Positives = 83/133 (62%), Gaps = 2/133 (1%)

Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
           PR FDAR+KW  C ++  + DQ +CGSCWA   ++A +DRLC+A++G F   +SA+ I  
Sbjct: 67  PRTFDARKKWRHCKTIGEVRDQGHCGSCWAFGTSSAFADRLCVATDGDFNELLSAEEITF 126

Query: 248 CTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
           C   C +GCNGG P  AW+++  +G+VTGG+Y S EGC+PY + PC    +G    C   
Sbjct: 127 CCHTCGFGCNGGDPIKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCPRDDKGK-NTCAGK 185

Query: 307 GKLKTPECKQNCY 319
            + K   C + CY
Sbjct: 186 PREKNHRCTRMCY 198


>gi|157058757|gb|ABV03136.1| cathepsin B-84 [Pterocomma populeum]
          Length = 218

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 65/156 (41%), Positives = 93/156 (59%), Gaps = 3/156 (1%)

Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           G+P+ FDAR +W  C ++  + DQ NCGSCWA   + A +DRLCIA+ G F   ISA+ +
Sbjct: 43  GIPKAFDARLEWKYCKTIGQVRDQGNCGSCWAHGTSGAFADRLCIATKGDFNELISAEEL 102

Query: 246 VACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
             C   C  GCNGG P  AW+++  +GVVTGG+YN+  GCQPY + PC +  +G   +C+
Sbjct: 103 TFCCHLCGIGCNGGNPLRAWQYFKRHGVVTGGNYNTTNGCQPYRVPPCTNGDKGHY-SCS 161

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
              K +  +C + CY       Y+ D  K K A+ +
Sbjct: 162 GQQKERNHKCLKTCYGDK-TVDYKRDHYKTKDAYYL 196


>gi|211853248|emb|CAP17587.1| cathepsin-like protein 4 [Crateromorpha meyeri]
          Length = 325

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 68/160 (42%), Positives = 91/160 (56%), Gaps = 11/160 (6%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FD+R +WP+C ++  I DQSNCGSCWA     ++SDR CI    +    ISA +++
Sbjct: 80  IPDMFDSRTQWPDCKTIGLIEDQSNCGSCWAFGATESMSDRYCIHMKMHLL--ISAANLM 137

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYN----SQEGCQPYTLAPCEHHVQGPLQ 301
            C  NC  GC GG+   AW +W   G+VTGG YN      + CQPY L  CEHH+ G   
Sbjct: 138 ECCRNCGNGCEGGFLGAAWNYWKQEGLVTGGLYNPSATESDTCQPYPLPSCEHHINGSKP 197

Query: 302 NCTLLGKL-KTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            C    K+ KTPEC   C+   Y ++Y  DL  G+ A+ V
Sbjct: 198 ACP--SKIAKTPECVHTCH-AGYPTSYEQDLHYGESAYSV 234



 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 52/99 (52%), Positives = 69/99 (69%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY + A+ V R       +I  +GP+ A F+VYADF  YKSGVY+ +    +G HAV+++
Sbjct: 226 HYGESAYSVRRRVAEIQTEIMTNGPVEAAFTVYADFPAYKSGVYKRHSLRQLGGHAVKMI 285

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG E+ IPYWL+ANSWN  WGDHG FKI+RG++E  IE
Sbjct: 286 GWGEEDGIPYWLIANSWNSDWGDHGYFKIVRGQDECGIE 324


>gi|340501578|gb|EGR28345.1| hypothetical protein IMG5_177790 [Ichthyophthirius multifiliis]
          Length = 356

 Score =  124 bits (312), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 64/160 (40%), Positives = 100/160 (62%), Gaps = 6/160 (3%)

Query: 184 AKGLPRNFDAREKW-PECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
            + LP+NFD+R++W  +CPSL  + DQS CGSCWA + A ++SDR+CI +      ++S 
Sbjct: 94  VENLPKNFDSRKQWGSKCPSLNEVRDQSTCGSCWAFAAAESLSDRICIHTGEDV--RLST 151

Query: 243 QHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ 301
           +++V+C  +C  GCNGG+P+ A +++   G+VTG  +     CQ Y+  PC HHV    +
Sbjct: 152 ENLVSCCSSCGDGCNGGYPEAAMQYFVKTGLVTGDLFGDNNFCQAYSFPPCAHHV-ASTK 210

Query: 302 NCTLLGKLKTPECKQNCYNPS-YESTYRFDLKKGKKAHMV 340
                G++ TPECK+ C + S  +  Y  DL KG+K++ V
Sbjct: 211 YPPCKGEVPTPECKKKCDDDSKVKRPYNEDLYKGQKSYSV 250



 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 52/83 (62%), Positives = 64/83 (77%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +I  +GP+   F+VY DF+ YKSGVYQH  G+ +G HAV+++GWGVEND PYWL+ NS
Sbjct: 258 MTEIMNNGPVEVAFTVYEDFVTYKSGVYQHVTGEQLGGHAVKMIGWGVENDTPYWLIVNS 317

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN+ WGD GTFKILRG NE  IE
Sbjct: 318 WNETWGDQGTFKILRGSNECGIE 340


>gi|161343879|tpg|DAA06120.1| TPA_inf: cathepsin B [Toxoptera citricida]
          Length = 340

 Score =  124 bits (312), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 60/136 (44%), Positives = 86/136 (63%), Gaps = 6/136 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P+ FDAR+KW  C ++  + DQ NCGSCWA++ ++A +DRLC+A+N  F   +SA+ I 
Sbjct: 88  IPKKFDARKKWRHCTTIGAVRDQGNCGSCWAIATSSAFADRLCVATNADFNQLLSAEEIT 147

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GCNGG+P  AW  +  +G+VTGG+Y S EGC+PY + PC +   G   N T 
Sbjct: 148 FCCHKCGYGCNGGYPIKAWERFKKHGLVTGGEYKSGEGCEPYRVPPCPYDESG---NNTC 204

Query: 306 LGKL--KTPECKQNCY 319
            GK   +   C + CY
Sbjct: 205 SGKPMEQNHRCTRMCY 220



 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 43/83 (51%), Positives = 55/83 (66%), Gaps = 1/83 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVENDIPYWLVANS 136
           + +  +GP+ A F VY DFL YKSGVY  +   S +G HAV+++GWG E   PYWL+ NS
Sbjct: 247 KDVMTYGPIEASFDVYDDFLSYKSGVYVRSENASYLGGHAVKLIGWGEEYGTPYWLMMNS 306

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD G FKI RG NE  ++
Sbjct: 307 WNADWGDEGLFKIRRGTNECGVD 329


>gi|118358706|ref|XP_001012594.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89294361|gb|EAR92349.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 346

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 73/172 (42%), Positives = 99/172 (57%), Gaps = 15/172 (8%)

Query: 176 LETMGCQNAKGLPRNFDAREKWPE-CPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG 234
           LET+  Q A GLP  FDAR +W + C SL  + DQS CGSCWA   A ++SDR CI    
Sbjct: 83  LETVSAQ-ANGLPEEFDARVQWGDKCSSLWEVRDQSTCGSCWAFGAAESLSDRHCI---- 137

Query: 235 YFTGQ---ISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
              GQ   +S Q+++ C   C  GC+GGWP+ A  ++ + G+VTG  Y +   CQ YT A
Sbjct: 138 -HLGQDIRLSTQNLLTCCAACGDGCDGGWPEAAMDYYVNTGLVTGDLYGNNSWCQAYTFA 196

Query: 291 PCEHHVQGPLQ-NCTLLGKLKTPECKQNC-YNPSYESTYRFDLKKGKKAHMV 340
           PC HHV   +   CT  G+L TP C  +C  N ++   Y  D+ +G KA+ +
Sbjct: 197 PCAHHVTSDIYPPCT--GELPTPPCINSCDSNSTHTIPYSKDIHRGSKAYGI 246



 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 51/83 (61%), Positives = 64/83 (77%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+    +VY DFL YK+GVYQH  GD +G HAV+++GWGVEN  PYW + NS
Sbjct: 254 MAEIYKNGPIEVALTVYEDFLTYKTGVYQHVTGDELGGHAVKMVGWGVENGTPYWTIVNS 313

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN+ WGD GTFKILRG+NE  IE
Sbjct: 314 WNESWGDKGTFKILRGKNECGIE 336


>gi|327408413|emb|CCA30060.1| unnamed protein product [Neospora caninum Liverpool]
          Length = 463

 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 65/154 (42%), Positives = 90/154 (58%), Gaps = 10/154 (6%)

Query: 187 LPRNFDAREKWPECPSLR-HIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           +P NFDAR  +P C  +  H+ DQ +CGSCWA +   A +DRLCI S G     +S QH 
Sbjct: 168 VPANFDARTAFPVCKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKGVMPLSTQHT 227

Query: 246 VAC--TPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNS---QEGCQPYTLAPCEHHVQG 298
            +C    +C  +GCNGG P +AWR++   GVVTGGD+++      C PY +  C HH + 
Sbjct: 228 TSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDFDTLGKGTTCWPYEIPFCAHHAKA 287

Query: 299 PLQNC-TLLGKLKTPECKQNCYNPSY-ESTYRFD 330
           P  NC T +   KTP+C+++C   +Y E    FD
Sbjct: 288 PFPNCDTDVRPRKTPKCRKDCEEAAYSEHVLPFD 321



 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 49/127 (38%), Positives = 68/127 (53%), Gaps = 4/127 (3%)

Query: 38  KKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAH----MVPRCNAMRQIYEHGPLVAIFSVY 93
           + +K  K +K  ++  Y    +P      KA     +  R    R +  HG +   F VY
Sbjct: 297 RPRKTPKCRKDCEEAAYSEHVLPFDKDVHKASSSYSLRSRDAVKRDMMAHGTVTGAFMVY 356

Query: 94  ADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGE 153
            DFL YKSGVY+H +G  +G HA++++GWG E+   YW   NSWN +WGD G FKI  G+
Sbjct: 357 EDFLNYKSGVYKHVYGGPLGGHAIKIIGWGTEDGEEYWHAVNSWNTYWGDSGHFKIEMGQ 416

Query: 154 NEADIEM 160
              D EM
Sbjct: 417 CGVDNEM 423


>gi|204022100|dbj|BAG71147.1| cathepsin B-N1 [Tuberaphis takenouchii]
          Length = 334

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 62/136 (45%), Positives = 87/136 (63%), Gaps = 6/136 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P NFDAR+KW +C ++  + DQ +CGSCWA   ++A +DRLCIA++G F   +SA+ + 
Sbjct: 85  IPSNFDARKKWRKCSTIGEVRDQGHCGSCWAFGTSSAFADRLCIATDGEFNELLSAEELA 144

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GC+GG+P  AW ++  +G+VTGGDY+S EGCQPY + PC     G   N T 
Sbjct: 145 FCCHKCGFGCHGGYPIKAWEWFKKHGLVTGGDYDSGEGCQPYRVPPCPLDEYG---NNTC 201

Query: 306 LGKL--KTPECKQNCY 319
            GK   K   C + CY
Sbjct: 202 RGKPAEKNHRCTRMCY 217



 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 47/98 (47%), Positives = 62/98 (63%), Gaps = 1/98 (1%)

Query: 63  HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLG 121
           H+ + A+ +      + +  +GP+ A F VY DF  YKSGVY      S +G HAV+++G
Sbjct: 229 HWTRDAYYLTYTTIQKDVMAYGPIEASFDVYDDFPNYKSGVYMKTENASYLGGHAVKLIG 288

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           WG E  +PYWL+ NSWND WGD G FKILRG NE  I+
Sbjct: 289 WGEEYGVPYWLLVNSWNDQWGDQGLFKILRGTNECGID 326


>gi|204022108|dbj|BAG71151.1| cathepsin B-N [Cerataphis jamuritsu]
          Length = 333

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 64/139 (46%), Positives = 87/139 (62%), Gaps = 6/139 (4%)

Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
           A+ +P NFDAR+KW +C S+  + DQ +CGSCWA   ++A +DRLCIA+ G F   +SA+
Sbjct: 81  AQRIPSNFDARKKWKKCLSIGEVRDQGHCGSCWAFGTSSAFADRLCIATEGEFNELLSAE 140

Query: 244 HIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
            +  C   C +GCNGG+P  AW  +  +G+VTGG+Y+S EGCQPY + PC     G   N
Sbjct: 141 ELTFCCHKCGFGCNGGYPIRAWERFRKHGLVTGGNYDSYEGCQPYRVPPCPLDEYG---N 197

Query: 303 CTLLGKL--KTPECKQNCY 319
            T  GK   K   C + CY
Sbjct: 198 NTCHGKPMEKNHRCTRMCY 216



 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 47/98 (47%), Positives = 60/98 (61%), Gaps = 1/98 (1%)

Query: 63  HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLG 121
           HY + A+ +        +  +GP+ A F VY DF  YKSGVY      S +G HAV+++G
Sbjct: 228 HYTRDAYYLTYGTIQNDVLTYGPIEASFEVYDDFPSYKSGVYVKTENASYLGGHAVKLIG 287

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           WG E  +PYWL+ NSWND WGD G FKI RG NE  I+
Sbjct: 288 WGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGID 325


>gi|999909|pdb|1HUC|B Chain B, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
           Human Liver Cathepsin B: The Structural Basis For Its
           Specificity
 gi|999911|pdb|1HUC|D Chain D, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
           Human Liver Cathepsin B: The Structural Basis For Its
           Specificity
 gi|1421164|pdb|1CSB|B Chain B, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
           2.1 Angstroms Resolution: A Basis For The Design Of
           Specific Epoxysuccinyl Inhibitors
 gi|1421167|pdb|1CSB|E Chain E, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
           2.1 Angstroms Resolution: A Basis For The Design Of
           Specific Epoxysuccinyl Inhibitors
 gi|122920711|pdb|2IPP|B Chain B, Crystal Structure Of The Tetragonal Form Of Human Liver
           Cathepsin B
          Length = 205

 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 87  YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 146

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 147 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 194



 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 62/104 (59%), Gaps = 5/104 (4%)

Query: 239 QISAQHIVACTPNCWG--CNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHV 296
           ++SA+ ++ C  +  G  CNGG+P  AW FW   G+V+GG Y S  GC+PY++ PCEHHV
Sbjct: 4   EVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHV 63

Query: 297 QGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G    CT  G+  TP+C + C  P Y  TY+ D   G  ++ V
Sbjct: 64  NGSRPPCT--GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 104


>gi|741376|prf||2007265A cathepsin B
          Length = 153

 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 29  YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 88

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 89  MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 136


>gi|328697984|ref|XP_003240502.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
          Length = 339

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 59/147 (40%), Positives = 87/147 (59%), Gaps = 5/147 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +PR FDAR +W  C ++  + DQ  CGSCWA   ++A +DRLC+A++G F   +SA+ + 
Sbjct: 87  IPRTFDARRRWRHCKTIGEVRDQGYCGSCWAFGTSSAFADRLCVATDGDFNELLSAEELT 146

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C  GCNGG+P  AW+++  +G+VTGG+Y S EGC+PY + PC  +  G   +C  
Sbjct: 147 FCCHTCGNGCNGGYPIKAWKYFSSHGLVTGGNYKSGEGCEPYRVPPCPRNEDG-TSSCAG 205

Query: 306 LGKLKTPECKQNCY---NPSYESTYRF 329
               K   C + CY   +  Y   +RF
Sbjct: 206 QPIEKNHRCTRMCYGNQDLDYNDDHRF 232



 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 43/83 (51%), Positives = 57/83 (68%), Gaps = 1/83 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVENDIPYWLVANS 136
           + +  +GP+ A F VY DF  YKSGVYQ     + +G HAV+++GWGVE  IPYWL+ NS
Sbjct: 246 KDVMNYGPIEASFDVYDDFYSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGIPYWLMVNS 305

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           W+  WGD+G FKI RG +E  I+
Sbjct: 306 WSAQWGDNGLFKIRRGTDECGID 328


>gi|194384502|dbj|BAG59411.1| unnamed protein product [Homo sapiens]
          Length = 273

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 149 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 208

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 209 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 256



 Score = 73.9 bits (180), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 29/48 (60%), Positives = 37/48 (77%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG 234
           LP +FDARE+WP+CP+++ I DQ +CGSCWA     AISDR+CI  NG
Sbjct: 80  LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHVNG 127


>gi|181178|gb|AAA52125.1| lysosomal proteinase cathepsin B, partial [Homo sapiens]
          Length = 209

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 85  YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 144

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 145 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 192



 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 62/104 (59%), Gaps = 5/104 (4%)

Query: 239 QISAQHIVACTPNCWG--CNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHV 296
           ++SA+ ++ C  +  G  CNGG+P  AW FW   G+V+GG Y S  GC+PY++ PCEHHV
Sbjct: 2   EVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHV 61

Query: 297 QGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G    CT  G+  TP+C + C  P Y  TY+ D   G  ++ V
Sbjct: 62  NGSRPPCT--GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 102


>gi|157058755|gb|ABV03135.1| cathepsin B-84 [Aulacorthum solani]
          Length = 218

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 62/155 (40%), Positives = 91/155 (58%), Gaps = 3/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FD+R +W  C ++ H+ +Q NCGSCWA     A +DRLC+A+NG     ISA+ + 
Sbjct: 44  VPEFFDSRLEWKYCKTIGHVRNQGNCGSCWAHGTTGAFADRLCVATNGEVNQLISAEEVT 103

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GCNGG P  AW+++  +GVVTGGDYN+ +GCQPY + PC    +G   +C+ 
Sbjct: 104 FCCHRCGFGCNGGNPLRAWQYFKRHGVVTGGDYNTTDGCQPYRVPPCVKDDKG-HNSCSG 162

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
               +  +C + CY       Y+ D  K K A+ +
Sbjct: 163 QPTERNHKCSKKCYGDD-TVDYKSDHYKTKDAYYL 196


>gi|76576339|gb|ABA53863.1| cathepsin B-like cysteine protease 1 [Parelaphostrongylus tenuis]
          Length = 346

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 54/113 (47%), Positives = 74/113 (65%), Gaps = 1/113 (0%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P++FDAR  WP+C SLR + DQS CGS WAV+   AI DR+CIAS G     +SA  I+
Sbjct: 94  IPKSFDARTNWPKCASLRTVRDQSACGSGWAVAAVGAIMDRICIASEGKQQVILSADDIL 153

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG 298
           +C   C +GC GG    AW +W  +G+VTG +Y ++ GC+PY   PCEH++  
Sbjct: 154 SCCTECGYGCEGGDTYKAWNYWTTDGIVTGSNYTTKSGCKPYPYPPCEHYIDA 206



 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 45/82 (54%), Positives = 60/82 (73%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  HGP+   F VY DF  Y SG+Y+H  G+ +G+HAV++LGWG EN + YW+ ANSW
Sbjct: 256 QEIMNHGPVEVTFDVYEDFEHYSSGIYKHMAGEYVGVHAVKMLGWGTENGVDYWICANSW 315

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N  WG++G F+ILRGENE  IE
Sbjct: 316 NSDWGENGFFRILRGENECGIE 337


>gi|204022077|dbj|BAG71136.1| cathepsin B-S1 [Tuberaphis sumatrana]
 gi|204022079|dbj|BAG71137.1| cathepsin B-S2 [Tuberaphis sumatrana]
          Length = 334

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 57/133 (42%), Positives = 82/133 (61%), Gaps = 3/133 (2%)

Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
           P+ FD+RE W  C  + HI DQ NCGSCW+ S   A +DRLC+++ G F   +S + +  
Sbjct: 86  PQQFDSRENWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNELLSPEELAF 145

Query: 248 CTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
           C  +C  GC GG+P  AWR++   GV TGGDY+++EGC+PY +APC ++ QG    C   
Sbjct: 146 CCKDCGNGCEGGYPIKAWRYFRTQGVTTGGDYDTKEGCKPYKVAPC-YNKQGK-NTCGGK 203

Query: 307 GKLKTPECKQNCY 319
              +  +C + CY
Sbjct: 204 PMERNHQCPKTCY 216



 Score = 94.0 bits (232), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 45/107 (42%), Positives = 64/107 (59%), Gaps = 2/107 (1%)

Query: 66  KKAHMVPRCNAMRQ-IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI-GLHAVRVLGWG 123
           K  +++     + Q I  +GP+ A F VY DF  YKSG+Y+          H+V+++GWG
Sbjct: 228 KSEYVINSIKTIEQDIKTYGPVEASFDVYDDFSVYKSGIYRKTPNAKYQNGHSVKIIGWG 287

Query: 124 VENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANS 170
            EN  PYWL  NSW+  WGDHGTFKI++G+NE  IE      + ++S
Sbjct: 288 QENGTPYWLAVNSWSKFWGDHGTFKIIKGKNECGIERAVTAGIPSSS 334


>gi|187107122|ref|NP_001119621.1| cathepsin B-3098 precursor [Acyrthosiphon pisum]
 gi|161343841|tpg|DAA06101.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 337

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 63/157 (40%), Positives = 94/157 (59%), Gaps = 6/157 (3%)

Query: 164 NRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANA 223
           ++V  N  ++DD       N + +P  FDAR+KW  C ++  + DQ NCGS WA+S ++A
Sbjct: 67  DKVNYNMYKNDDH----ADNYQEIPMKFDARKKWIRCKTIGEVRDQGNCGSDWALSTSSA 122

Query: 224 ISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQE 282
            +DRLC+A+NG F   +SA+ I  C   C  GCNGG+P  AW+ + ++G+VTGG+Y S E
Sbjct: 123 FADRLCVATNGDFNQLLSAEEITFCCHKCGNGCNGGYPIRAWKRFKNHGLVTGGNYKSGE 182

Query: 283 GCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCY 319
           GC+PY + PC +   G    C+        +C + CY
Sbjct: 183 GCEPYRVPPCPYDKDGK-NTCSGQPMESNHKCSKKCY 218



 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 39/97 (40%), Positives = 57/97 (58%), Gaps = 1/97 (1%)

Query: 64  YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGW 122
           Y +  + +      + +  +GP+   F VY DF  YKSG+Y  +   S +G H+V+++GW
Sbjct: 231 YTRDDYYLTYRGIQKDVINYGPIETSFDVYDDFPNYKSGIYVKSENASYLGGHSVKLIGW 290

Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           G E  + YWL+ NSWN  WGD G FKI RG NE  ++
Sbjct: 291 GEEYGVLYWLMVNSWNADWGDKGLFKIRRGTNECRVD 327


>gi|347546077|gb|AEP03186.1| cathepsin B [Diuraphis noxia]
          Length = 239

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 52/106 (49%), Positives = 76/106 (71%), Gaps = 1/106 (0%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P+ FDAR+KW +C ++  + DQ  CGSCWAVS ++A +DRLCIA++G F   +SA  I 
Sbjct: 47  IPKTFDARKKWVQCDTIGRVRDQGQCGSCWAVSTSSAFADRLCIATDGDFNELLSADEIT 106

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP 291
            C   C +GC+GG+P  AW+ +  +G+VTGGD++S EGC+PY + P
Sbjct: 107 FCCYTCGFGCDGGYPIKAWKQFSRHGLVTGGDFDSGEGCEPYRVPP 152


>gi|204022104|dbj|BAG71149.1| cathepsin B-N [Astegopteryx styracophila]
          Length = 332

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 62/136 (45%), Positives = 86/136 (63%), Gaps = 6/136 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P+ FDAR+KW +C ++  + DQ  CGSCWA   ++A +DRLCIA+NG F   +SA+ + 
Sbjct: 83  IPKFFDARKKWRKCFTIGEVRDQGKCGSCWAFGTSSAFADRLCIATNGEFNELLSAEELT 142

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GC+GG+P  AW  +  +G+VTGGDY+S EGCQPY ++PC     G   N T 
Sbjct: 143 FCCHKCGFGCHGGYPIKAWERFQKHGLVTGGDYDSGEGCQPYRVSPCPLDEYG---NNTC 199

Query: 306 LGK--LKTPECKQNCY 319
            GK   K   C + CY
Sbjct: 200 RGKPAEKNHRCTRMCY 215



 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 45/98 (45%), Positives = 61/98 (62%), Gaps = 1/98 (1%)

Query: 63  HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLG 121
           H+ + A+ +      R +  +GP+ A + VY DF  YKSGVY      + +G HAV+++G
Sbjct: 227 HFTRDAYYLTFGIIQRDVMAYGPIEASYDVYDDFPSYKSGVYVRTENATYLGGHAVKLIG 286

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           WG E  +PYWL+ NSWND WGD G FKI RG NE  I+
Sbjct: 287 WGEEYGVPYWLMVNSWNDQWGDKGLFKIRRGTNECGID 324


>gi|194387364|dbj|BAG60046.1| unnamed protein product [Homo sapiens]
          Length = 245

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 121 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 180

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 181 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 228



 Score =  108 bits (269), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 55/131 (41%), Positives = 77/131 (58%), Gaps = 5/131 (3%)

Query: 212 CGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWG--CNGGWPQLAWRFWGH 269
           C   WA     AISDR+CI +N + + ++SA+ ++ C  +  G  CNGG+P  AW FW  
Sbjct: 11  CRMSWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTR 70

Query: 270 NGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRF 329
            G+V+GG Y S  GC+PY++ PCEHHV G    CT  G+  TP+C + C  P Y  TY+ 
Sbjct: 71  KGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT--GEGDTPKCSKIC-EPGYSPTYKQ 127

Query: 330 DLKKGKKAHMV 340
           D   G  ++ V
Sbjct: 128 DKHYGYNSYSV 138


>gi|339831342|gb|AEK20867.1| cathepsin B [Eimeria tenella]
          Length = 512

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 66/162 (40%), Positives = 96/162 (59%), Gaps = 14/162 (8%)

Query: 191 FDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACT 249
           FDARE +P+C   + H+ DQ +CGSCWA +   A++DR CI S G     +S QH  +C 
Sbjct: 241 FDAREAFPQCAEVIGHVRDQGDCGSCWAFASTEALNDRFCIKSGGRHREALSPQHTTSCC 300

Query: 250 P--NC--WGCNGGWPQLAWRFWGHNGVVTGGDYN---SQEGCQPYTLAPCEHHVQGPLQN 302
              +C  +GC+GG P++AWR++ ++GVVTGGDYN   + + C PY +  C HH +GP   
Sbjct: 301 DLLHCLSFGCSGGQPRMAWRWFSNDGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPK 360

Query: 303 CTLLGKL-KTPECKQNCYNPSYES---TYRFDLKKGKKAHMV 340
           C   G L K P+C+++C    Y S    ++ DL     A+ V
Sbjct: 361 CE--GPLPKAPKCRKDCEEAEYTSKVKPFKDDLHFATSAYSV 400



 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 60/99 (60%), Gaps = 1/99 (1%)

Query: 63  HYFKKAHMVP-RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
           H+   A+ V  R    R++ E+G L   F VY DFL YK GVY H  G  +G HAV+V+G
Sbjct: 392 HFATSAYSVEGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMGGHAVKVIG 451

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEM 160
           +G E+   YWL  NSWN++WGD GTFKI  GE   D E 
Sbjct: 452 FGNEDGRDYWLAVNSWNEYWGDKGTFKIEMGEAGIDKEF 490


>gi|254575663|gb|ACT68328.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 69/165 (41%), Positives = 91/165 (55%), Gaps = 6/165 (3%)

Query: 166 VEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAIS 225
           +E + ++++ L      +   +P +FD+REKW +CPSLR I DQSNCGSCWAVS A  +S
Sbjct: 75  IERSYNQENVLPVANITSNDDIPESFDSREKWKDCPSLRVIPDQSNCGSCWAVSAAQCMS 134

Query: 226 DRLCIASNGYFTGQISAQHIVACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEG 283
           DRLCI S G     +SA  I+AC      +GC+GG+   AW++    GVVTGG Y  +  
Sbjct: 135 DRLCIHSQGRKKVLLSATDILACCGKFCGYGCDGGYNARAWKWATIAGVVTGGAYKEKGN 194

Query: 284 CQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---YNPSYES 325
           C+PY    C  H      NC       TP CK  C   Y   YE+
Sbjct: 195 CKPYVFPQCGAHKGKAFNNCP-SHPYATPACKPYCQYGYGKRYEN 238



 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 39/84 (46%), Positives = 56/84 (66%), Gaps = 1/84 (1%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I + GP+ A F++Y DF  Y  GVY H  G   G H+++++GWGV+  + YWL+ANSW+
Sbjct: 259 EIMKKGPVHATFNIYEDFEHYNGGVYIHTAGAMEGGHSIKIIGWGVDKGVKYWLIANSWS 318

Query: 139 DHWG-DHGTFKILRGENEADIEMG 161
             WG D G F+++RG N  DIE G
Sbjct: 319 TDWGEDGGYFRVVRGINNCDIEGG 342


>gi|332244666|ref|XP_003271495.1| PREDICTED: cathepsin B [Nomascus leucogenys]
          Length = 351

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 66/163 (40%), Positives = 94/163 (57%), Gaps = 12/163 (7%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSC--W-----AVSVANAISDRLCIASNGYFTGQ 239
           LP +F ARE+WP+CP++     Q   G    W     A     AISDR+CI +N + + +
Sbjct: 85  LPESFYAREQWPQCPTIXXXRAQPGRGGLTRWGSFLQAFGAVEAISDRICIHTNAHISVE 144

Query: 240 ISAQHIVACTPNCWG--CNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQ 297
           +SA+ ++ C  +  G  CNGG+P  AW FW   G+V+GG Y+S  GC+PY++ PCEHHV 
Sbjct: 145 VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVN 204

Query: 298 GPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           G    CT  G+  TP+C + C  P Y  TY+ D   G  ++ V
Sbjct: 205 GSRPPCT--GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 244



 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 227 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHITGEM 286

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 287 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 334


>gi|359427491|gb|AEV46267.1| eimeripain [Eimeria tenella]
          Length = 512

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 66/162 (40%), Positives = 96/162 (59%), Gaps = 14/162 (8%)

Query: 191 FDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACT 249
           FDARE +P+C   + H+ DQ +CGSCWA +   A++DR CI S G     +S QH  +C 
Sbjct: 241 FDAREAFPQCAEVIGHVRDQGDCGSCWAFASTEALNDRFCIKSGGRHREALSPQHTTSCC 300

Query: 250 P--NC--WGCNGGWPQLAWRFWGHNGVVTGGDYN---SQEGCQPYTLAPCEHHVQGPLQN 302
              +C  +GC+GG P++AWR++ ++GVVTGGDYN   + + C PY +  C HH +GP   
Sbjct: 301 DLLHCLSFGCSGGQPRMAWRWFSNDGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPK 360

Query: 303 CTLLGKL-KTPECKQNCYNPSYES---TYRFDLKKGKKAHMV 340
           C   G L K P+C+++C    Y S    ++ DL     A+ V
Sbjct: 361 CE--GPLPKAPKCRKDCEEAEYTSKVKPFKDDLHFATSAYSV 400



 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 60/99 (60%), Gaps = 1/99 (1%)

Query: 63  HYFKKAHMVP-RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
           H+   A+ V  R    R++ E+G L   F VY DFL YK GVY H  G  +G HAV+V+G
Sbjct: 392 HFATSAYSVEGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMGGHAVKVIG 451

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEM 160
           +G E+   YWL  NSWN++WGD GTFKI  GE   D E 
Sbjct: 452 FGNEDGRDYWLAVNSWNEYWGDKGTFKIEMGEAGIDKEF 490


>gi|118429529|gb|ABK91812.1| cathepsin B precursor [Clonorchis sinensis]
          Length = 342

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 61/133 (45%), Positives = 75/133 (56%), Gaps = 2/133 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAR  WP CPS+  I DQS+CGSCWA     A+SDRLCI S G F   +SA  +V
Sbjct: 86  LPESFDARANWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSKGAFNKSLSAVDLV 145

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC GG+  +AW  W  +G+VTGG      GC+ Y    CEH  +G    C  
Sbjct: 146 SCCTECGCGCRGGYSPIAWDLWKTHGIVTGGSKEKPTGCRSYPFPSCEHRGKGQYPPCPH 205

Query: 306 LGKLKTPECKQNC 318
                TPEC + C
Sbjct: 206 Q-LYPTPECIKRC 217



 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 44/83 (53%), Positives = 57/83 (68%)

Query: 76  AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVAN 135
            M++I   GP+ AI  VY D L YKSGVY H +G  +G H +R+LGWG E+ +PYWLVAN
Sbjct: 244 VMKEIMLRGPVGAILHVYEDLLDYKSGVYFHVWGGHLGEHGIRILGWGEEDGVPYWLVAN 303

Query: 136 SWNDHWGDHGTFKILRGENEADI 158
           SWN+ WG+ G  ++LR  NE  I
Sbjct: 304 SWNEDWGEKGYMRVLRWRNECGI 326


>gi|193783549|dbj|BAG53460.1| unnamed protein product [Homo sapiens]
          Length = 276

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 75/108 (69%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 152 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 211

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 212 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 259



 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 43/101 (42%), Positives = 57/101 (56%), Gaps = 6/101 (5%)

Query: 240 ISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP 299
           +S   I  C    + CNGG+P  AW FW   G+V+GG Y S  GC+PY++ PCEHHV G 
Sbjct: 75  LSEVFITGCL---FSCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGS 131

Query: 300 LQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
              CT  G+  TP+C + C  P Y  TY+ D   G  ++ V
Sbjct: 132 RPPCT--GEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSV 169


>gi|118364222|ref|XP_001015333.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297100|gb|EAR95088.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 341

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 74/177 (41%), Positives = 102/177 (57%), Gaps = 18/177 (10%)

Query: 172 EDDDLETMGCQNA--KGLPRNFDAREKWPE-CPSLRHIADQSNCGSCWAVSVANAISDRL 228
           E D+ E +   NA    LP  FDAR++W + C SL  + DQSNCGSCWA     +++DR 
Sbjct: 75  EGDNGENLPVSNAVKADLPTAFDARQQWGDKCTSLWEVRDQSNCGSCWAFGAVESLTDRH 134

Query: 229 CIASNGYFTGQ---ISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGC 284
           CI       GQ   +SAQ+++ C   C  GCNGG+P  A  ++   G+VTG  YN+   C
Sbjct: 135 CI-----HLGQDIRLSAQNMLTCCATCGQGCNGGYPASAMSYYVKTGLVTGDLYNTTGWC 189

Query: 285 QPYTLAPCEHHVQGPLQ-NCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           Q Y+ APC HHV  PL   CT  G+L TP+C + C + S ++   + + KG KA+ V
Sbjct: 190 QAYSFAPCAHHVDTPLYPACT--GELPTPKCAKTCDSGSGQT---YTVHKGSKAYSV 241



 Score =  120 bits (302), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 51/83 (61%), Positives = 66/83 (79%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +I  +GP+ A F+VY DFL YKSGVY+H  G ++G HA++++GWGVEN+ PYW+V NS
Sbjct: 249 MTEIQTNGPVEAAFTVYEDFLNYKSGVYKHVTGKALGGHAIKIVGWGVENNTPYWIVVNS 308

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+GTFKILRG+NE  IE
Sbjct: 309 WNQTWGDNGTFKILRGKNECGIE 331


>gi|2944340|gb|AAC05262.1| cathepsin B-like cysteine protease GCP7 [Haemonchus contortus]
          Length = 348

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 69/165 (41%), Positives = 91/165 (55%), Gaps = 6/165 (3%)

Query: 166 VEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAIS 225
           +E + ++++ L      +   +P +FD+REKW +CPSLR I DQSNCGSCWAVS A  +S
Sbjct: 75  IERSYNQENVLPIANITSNDDIPESFDSREKWKDCPSLRVIPDQSNCGSCWAVSAAQCMS 134

Query: 226 DRLCIASNGYFTGQISAQHIVACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEG 283
           DRLCI S G     +SA  I+AC      +GC+GG+   AW++    GVVTGG Y  +  
Sbjct: 135 DRLCIHSQGRKKVLLSATDILACCGKFCGYGCDGGYNARAWKWATIAGVVTGGAYKEKGN 194

Query: 284 CQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---YNPSYES 325
           C+PY    C  H      NC       TP CK  C   Y   YE+
Sbjct: 195 CKPYVFPQCGAHKGKAFNNCP-SHPYATPACKPYCQYGYGKRYEN 238



 Score = 91.3 bits (225), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 39/84 (46%), Positives = 57/84 (67%), Gaps = 1/84 (1%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I + GP+ A F++Y DF  Y+ GVY H  G   G H+++++GWGV+  + YWL+ANSW+
Sbjct: 259 EIMQKGPVHATFNIYEDFEHYEGGVYIHTAGAMEGGHSIKIIGWGVDKGVKYWLIANSWS 318

Query: 139 DHWG-DHGTFKILRGENEADIEMG 161
             WG D G F+++RG N  DIE G
Sbjct: 319 TDWGEDGGYFRVVRGINNCDIEGG 342


>gi|119638992|gb|ABL85238.1| cysteine proteinase 4 [Necator americanus]
          Length = 339

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 68/157 (43%), Positives = 92/157 (58%), Gaps = 7/157 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDAREKWP C S+  I D S CGSCWAVS A+ +SDRLCI +NG     +S+  I+
Sbjct: 88  LPERFDAREKWPHCASIGLIRDHSACGSCWAVSAASVMSDRLCIQTNGTNQKILSSADIL 147

Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           AC   +C  GC GG+P  A+ +  + GV +GG+Y  +  C+PY   PC+ +  GP   C 
Sbjct: 148 ACCGEDCGSGCEGGYPIQAYFYLENTGVCSGGEYREKNVCKPYPFYPCDGNY-GP---CP 203

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
             G   TP+C++ C    Y   Y  D   GK +H++L
Sbjct: 204 KEGAFDTPKCRKIC-QFRYPVPYEEDKVFGKNSHILL 239



 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 46/97 (47%), Positives = 68/97 (70%), Gaps = 3/97 (3%)

Query: 66  KKAHMVPRCNAMR---QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGW 122
           K +H++ + N  R   +I+ +GP+ A F V+ DF+ YK G+Y+  +G  IG+HA++++GW
Sbjct: 233 KNSHILLQDNEARIRQEIFINGPVGANFYVFEDFIHYKEGIYKQTYGKWIGVHAIKLIGW 292

Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           G EN   YWLVANS+N  WG++GTF+ILRG N   IE
Sbjct: 293 GTENGTDYWLVANSYNYDWGENGTFRILRGTNHCLIE 329


>gi|242001640|ref|XP_002435463.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215498799|gb|EEC08293.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 223

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 54/88 (61%), Positives = 69/88 (78%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I+++GP+ A F+ YADFL YKSGVYQH+  D IG HA+R+LGWG E++ PYWL+ANSWN
Sbjct: 132 EIFQNGPVEAEFTAYADFLSYKSGVYQHHSRDIIGRHAIRILGWGSEDNNPYWLLANSWN 191

Query: 139 DHWGDHGTFKILRGENEADIEMGFNNRV 166
           + WGDHG FK+LRG NE DIE   N  +
Sbjct: 192 EDWGDHGYFKMLRGVNECDIESFVNAGI 219



 Score =  104 bits (259), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 54/125 (43%), Positives = 74/125 (59%), Gaps = 4/125 (3%)

Query: 217 AVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTG 275
           A     A+SDR+CI SNG     ISA+ ++ C   C  GC+GG    AW++W   G+V+G
Sbjct: 1   AFGAVEAMSDRVCIHSNGRVQVDISAEDLMDCCDKCGSGCSGGVSAAAWQYWKDAGLVSG 60

Query: 276 GDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
           G YN+ +GC+PY+LAPCEH  QG L  C  +G L TP+CK+ C    YE +Y  D    K
Sbjct: 61  GLYNTTDGCKPYSLAPCEHSSQGSLPEC--VGTLPTPKCKRQC-REGYERSYDDDKYFAK 117

Query: 336 KAHMV 340
             + +
Sbjct: 118 NVYSI 122


>gi|157167368|ref|XP_001653891.1| cathepsin b [Aedes aegypti]
 gi|108874250|gb|EAT38475.1| AAEL009642-PA [Aedes aegypti]
          Length = 332

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 63/154 (40%), Positives = 90/154 (58%), Gaps = 10/154 (6%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAREKWP C S+  I +Q  CG+CWAV+  + +SDRLCI S G F  +++A+ ++
Sbjct: 85  IPEFFDAREKWPYCKSISTIKNQGLCGACWAVAAVSVMSDRLCIHSEGKFDVELAAEDLM 144

Query: 247 ACTPNCW-GCNGGWPQ-LAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  +C  GCNGG+    ++++W   G+V+G  YNS +GC+PY   PC +    P   C 
Sbjct: 145 GCCKDCGNGCNGGFLDGTSFQYWVDVGLVSGAAYNSTDGCKPYPFKPCLY----PFVGCH 200

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAH 338
                KTP C  +C    Y+ TYR D   G  A+
Sbjct: 201 ---PEKTPSCTHHC-TEGYDGTYRRDKYYGSAAY 230



 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 49/99 (49%), Positives = 64/99 (64%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           +Y   A+ +P    M Q  I  +GP+ + FSVY D   YK+GVYQH  G  +G HAVR++
Sbjct: 224 YYGSAAYKLPNDERMIQLEIMTNGPVESGFSVYQDLYLYKTGVYQHVVGREVGKHAVRLI 283

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG E  +PYWL+ANS+ + WG+HG FK LRG N   IE
Sbjct: 284 GWGKERGVPYWLIANSYGEDWGEHGYFKFLRGSNHLGIE 322


>gi|294877495|ref|XP_002768009.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239870149|gb|EER00727.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 180

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 66/154 (42%), Positives = 85/154 (55%), Gaps = 8/154 (5%)

Query: 182 QNAKGLPRNFDAREKWPECP-SLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQI 240
           +  + LP +FDAR  +P C   + HI DQS CGSCWA  V  A +DRLCI S+G FT  +
Sbjct: 27  EELQDLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSDGAFTELL 86

Query: 241 SAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQ------EGCQPYTLAPCEH 294
           SA  + ACT   +GC GG P  AW +    G+ TGGDY ++      +GC PY   PC H
Sbjct: 87  SAGEMNACTLF-FGCGGGDPYSAWSWVHDKGIATGGDYVAKDDMTKDDGCWPYDFPPCAH 145

Query: 295 HVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYR 328
           H+          G   TP C + C+NP Y +T R
Sbjct: 146 HINDTKYPKCPEGLYPTPNCVEQCHNPKYTTTLR 179


>gi|343961899|dbj|BAK62537.1| cathepsin B precursor [Pan troglodytes]
          Length = 195

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 60/108 (55%), Positives = 74/108 (68%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V       M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 71  YSPTYKQDKHYGYNSYSVSNSEKGIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 130

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+R+LGWGVEN  PYWLVANSWN  WGD+G FKILRG++   IE
Sbjct: 131 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 178



 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 41/87 (47%), Positives = 53/87 (60%), Gaps = 3/87 (3%)

Query: 254 GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPE 313
           GCNGG+P  AW FW   G+V+GG Y S  GC+PY++ PCEHHV G    CT  G+  TP+
Sbjct: 5   GCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT--GEGDTPK 62

Query: 314 CKQNCYNPSYESTYRFDLKKGKKAHMV 340
           C + C  P Y  TY+ D   G  ++ V
Sbjct: 63  CSKIC-EPGYSPTYKQDKHYGYNSYSV 88


>gi|327239610|gb|AEA39649.1| cathepsin B [Epinephelus coioides]
          Length = 171

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 63/129 (48%), Positives = 81/129 (62%), Gaps = 3/129 (2%)

Query: 213 GSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNG 271
           GSCWA   A AISDRLCI SNG  + +IS++ ++AC  +C  GCNGG+P  AW FW   G
Sbjct: 1   GSCWAFGAAEAISDRLCIHSNGKVSVEISSEDLLACCDSCGMGCNGGYPSAAWDFWTDVG 60

Query: 272 VVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDL 331
           +V+GG Y+S  GC+PYT+ PCEHHV G    CT  G   TP+C   C    Y  +Y+ D 
Sbjct: 61  LVSGGLYDSHVGCRPYTIPPCEHHVNGTRPPCTGEGG-DTPQCILQC-ESGYTPSYKADK 118

Query: 332 KKGKKAHMV 340
             GK ++ V
Sbjct: 119 HYGKSSYSV 127



 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/62 (43%), Positives = 37/62 (59%), Gaps = 2/62 (3%)

Query: 54  YLPTSIPLSHYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y P+     HY K ++ VP      Q  IY++GP+   F+VY DFL YK+GVYQH  G +
Sbjct: 110 YTPSYKADKHYGKSSYSVPSDEEQIQSEIYKNGPVEGAFTVYEDFLLYKTGVYQHMTGSA 169

Query: 112 IG 113
           +G
Sbjct: 170 VG 171


>gi|312105965|ref|XP_003150617.1| hypothetical protein LOAG_15077 [Loa loa]
          Length = 150

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 67/146 (45%), Positives = 92/146 (63%), Gaps = 6/146 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP++FDAR +WP C S+  +A+Q  CGSCWA+S A+ +SDRLCIA+N     QISA+ ++
Sbjct: 8   LPKHFDARLRWPLCWSVHVVANQGGCGSCWAISAASVMSDRLCIATNYSNQKQISAEDLI 67

Query: 247 ACTPNCWGCNGG-WPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C GC G  W   A+ +W ++GVVTGGDY S EGC+PYT AP   +   P      
Sbjct: 68  SCCTECGGCQGSHWALSAFIYWRNHGVVTGGDYGSFEGCKPYTTAP---NCGSPCSFEYY 124

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDL 331
             K+ +P C++ C  P Y  +Y  DL
Sbjct: 125 RRKI-SPACQKTC-QPLYGLSYEEDL 148


>gi|149030260|gb|EDL85316.1| rCG52258, isoform CRA_c [Rattus norvegicus]
          Length = 130

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 53/83 (63%), Positives = 67/83 (80%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+   F+V++DFL YKSGVY+H  GD +G HA+R+LGWG+EN +PYWLVANS
Sbjct: 31  MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVANS 90

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRGEN   IE
Sbjct: 91  WNVDWGDNGFFKILRGENHCGIE 113


>gi|91089437|ref|XP_966750.1| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270012705|gb|EFA09153.1| cathepsin B precursor [Tribolium castaneum]
          Length = 324

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 61/147 (41%), Positives = 81/147 (55%), Gaps = 15/147 (10%)

Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
           + +P  FD R  W +CPSL++I +Q NCGSCWA      ++DRLCIAS G    + SA  
Sbjct: 80  EAIPETFDGRTHWSQCPSLKNIRNQGNCGSCWAFGSVEVMTDRLCIASKGKTKFEFSADD 139

Query: 245 IVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
           ++AC   C  GC+GG P  A+ +W   G+V+GGDYNS EGCQPY             +  
Sbjct: 140 LLACCTACGKGCDGGAPYRAFEYWVAKGIVSGGDYNSNEGCQPY-------------EGS 186

Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFD 330
             L  + TP+C   C N  Y + Y  D
Sbjct: 187 AFLNSV-TPKCSTKCLNSKYTTPYAKD 212



 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 45/82 (54%), Positives = 57/82 (69%), Gaps = 1/82 (1%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I  +GP+V    VY DF  YKSGVYQH  G+S+G HAV+++GWG E  +PYWL+ANSW 
Sbjct: 233 EIMNNGPVVTHMDVYEDFYSYKSGVYQHVSGNSMGGHAVKIIGWGTEKGVPYWLIANSWG 292

Query: 139 DHWGD-HGTFKILRGENEADIE 159
             W D  G +KILRG+N   IE
Sbjct: 293 AKWADLDGFYKILRGKNHCKIE 314


>gi|347972088|ref|XP_313836.5| AGAP004534-PA [Anopheles gambiae str. PEST]
 gi|333469166|gb|EAA09182.5| AGAP004534-PA [Anopheles gambiae str. PEST]
          Length = 334

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 63/156 (40%), Positives = 94/156 (60%), Gaps = 10/156 (6%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FDAR  WP C SLR I +Q  CGSCWAV+ A+ +SDR+CI SNG     ++A+ ++
Sbjct: 87  IPESFDARNHWPNCESLRAIRNQGTCGSCWAVAAASVMSDRVCIHSNGTINVALAAEDLM 146

Query: 247 ACTPNCW-GCNGGWPQ-LAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  +C  GCNGG+    ++++W   G+V+GG YNS +GC+PY   PCE+    P  +C 
Sbjct: 147 GCCVDCGNGCNGGFLDGTSFQYWVDAGLVSGGAYNSTDGCKPYPFKPCEY----PFNDCH 202

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           +     +P+C  +C +   +  Y  D   GK A+ V
Sbjct: 203 V---EISPKCTHHCRD-GVDRHYSKDKLFGKVAYSV 234



 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 54/96 (56%), Positives = 68/96 (70%), Gaps = 2/96 (2%)

Query: 66  KKAHMVPRCN-AMR-QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWG 123
           K A+ VPR   A+R +I  +GP+ A F VY D L YKSGVY+H +G+ IG HAVR++GWG
Sbjct: 229 KVAYSVPRDERAIRYEIMTNGPVEAGFDVYEDVLLYKSGVYRHVYGEQIGKHAVRIIGWG 288

Query: 124 VENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
            +  IPYWL+ANS+ D WGDHG FK +RG N   IE
Sbjct: 289 RDGGIPYWLIANSYGDDWGDHGYFKFVRGSNHLGIE 324


>gi|341888694|gb|EGT44629.1| hypothetical protein CAEBREN_31940 [Caenorhabditis brenneri]
          Length = 374

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 67/156 (42%), Positives = 88/156 (56%), Gaps = 12/156 (7%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FD+RE+WPEC S++ I +Q+ CGSCWA   A  ISDR+CI SN   T  IS + I+
Sbjct: 97  LPDTFDSREQWPECKSIKLIRNQATCGSCWAFGAAEIISDRICIQSNATQTPIISVEDIL 156

Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C   +C  GC GG+   A RFW  +G VTGGDYN   GC PY+ APC+        +C 
Sbjct: 157 SCCGVSCGKGCQGGYSIEALRFWKSSGAVTGGDYNGA-GCMPYSFAPCKK------DSC- 208

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
              +  TP CK  C +    + Y  D   G  A+ +
Sbjct: 209 --AQGTTPSCKTTCQSSYKTAEYTKDKHFGTTAYKI 242



 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 56/130 (43%), Positives = 72/130 (55%), Gaps = 13/130 (10%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+   A+ +    A  Q  IY +GP+ A F VY DF +YKSGVYQ+  G  +G HAV+++
Sbjct: 234 HFGTTAYKITNSVAAIQTEIYHNGPVEASFKVYEDFYKYKSGVYQYTSGKLVGGHAVKII 293

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANS--------SE 172
           GWG EN + YWL+ANSW   +GD G FK+ RG NE  IE    N V   +         E
Sbjct: 294 GWGTENGVDYWLIANSWGTTFGDSGFFKMRRGTNEVGIE---GNVVAGTAKLGTHDEKRE 350

Query: 173 DDDLETMGCQ 182
           DDD     C 
Sbjct: 351 DDDGAATSCS 360


>gi|170028916|ref|XP_001842340.1| cathepsin B [Culex quinquefasciatus]
 gi|167879390|gb|EDS42773.1| cathepsin B [Culex quinquefasciatus]
          Length = 339

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 69/198 (34%), Positives = 108/198 (54%), Gaps = 19/198 (9%)

Query: 145 GTFKILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLR 204
           G F+ ++G  E+ ++    ++    SS D+ +          +P  FDAREKWP C S+ 
Sbjct: 59  GEFRSIKGIYESPLDFTLPSKRLHASSLDEVV----------IPDRFDAREKWPFCQSIH 108

Query: 205 HIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQ-L 262
            + +Q  CGSCWAV+  + +SDRLCI S+G    +++ + ++ C  +C  GCNGG+    
Sbjct: 109 SVRNQGTCGSCWAVATVSVMSDRLCIHSDGEVNLELATEDLMGCCKDCGNGCNGGFLDGT 168

Query: 263 AWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPS 322
           A+++W   G+V+G  YNS EGC+PY   PC +    P   C    + K P+C  +C N  
Sbjct: 169 AFQYWVDAGLVSGAPYNSSEGCKPYPFEPCSY----PFVGCH--HEKKNPKCLHHCIN-G 221

Query: 323 YESTYRFDLKKGKKAHMV 340
           Y+  YR D   G  A+ +
Sbjct: 222 YDRKYRKDKFFGATAYKI 239



 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 47/94 (50%), Positives = 61/94 (64%), Gaps = 2/94 (2%)

Query: 68  AHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE 125
           A+ +P    M Q  I  +GP+   F V+ DF  Y SGVY+H  G  +G+HA+R++GWG E
Sbjct: 236 AYKIPNDARMIQLEIMTNGPVATGFEVFEDFYFYHSGVYKHVVGKKVGMHAIRIVGWGTE 295

Query: 126 NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           N  PYWL+ANS+ D WGD G FK+LRG N   IE
Sbjct: 296 NGTPYWLIANSYGDTWGDKGFFKMLRGSNHLGIE 329


>gi|52630945|gb|AAU84936.1| putative cathepsin B-S [Toxoptera citricida]
          Length = 335

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 59/138 (42%), Positives = 84/138 (60%), Gaps = 2/138 (1%)

Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
           N   +P  FDAR +W  C ++  + +Q NCGSCWA     A +DRLCIA+NG F   ISA
Sbjct: 80  NDSEIPNFFDARIQWSHCKTIGEVRNQGNCGSCWAHGTTGAFADRLCIATNGDFNELISA 139

Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ 301
           + +  C   C +GCNGG P  AW+++  +GVVTGG+YN+ +GCQPY + PC    +G   
Sbjct: 140 EELTFCCHRCGFGCNGGNPLKAWQYFKRHGVVTGGNYNTTDGCQPYKVPPCVKDEEGH-N 198

Query: 302 NCTLLGKLKTPECKQNCY 319
           +C+        +C ++CY
Sbjct: 199 SCSGQPTEPNHKCSRSCY 216



 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 44/95 (46%), Positives = 59/95 (62%), Gaps = 1/95 (1%)

Query: 66  KKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN-FGDSIGLHAVRVLGWGV 124
           K A+ +      +    +GP+ A F VY DF+ Y+SGVYQ       +G HAV+++GWG 
Sbjct: 231 KNAYYLNIDTMQKDTIAYGPIEASFDVYDDFVNYESGVYQKTEDAKYLGGHAVKMIGWGE 290

Query: 125 ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           E+  PYWL+ NSW + WG +G FKILRG NE  IE
Sbjct: 291 EDGTPYWLMVNSWGEQWGANGMFKILRGTNECGIE 325


>gi|320166129|gb|EFW43028.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
          Length = 332

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 61/172 (35%), Positives = 93/172 (54%), Gaps = 1/172 (0%)

Query: 170 SSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLC 229
           + ED  L       A+ +P  FD+R  WP CP+++ + DQS CGSCWA     ++SDR+C
Sbjct: 62  TPEDQRLPLKVAPIAEAIPDTFDSRTNWPACPTIKEVRDQSACGSCWAFGAVESMSDRIC 121

Query: 230 IASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           IASN     ++SA  +++C  +C  GC+GG    +W ++ + G+VTG  YN+   C+PY 
Sbjct: 122 IASNATKIVRLSASDLLSCCTSCGDGCDGGQLGPSWDYYKNKGIVTGYLYNTTGYCKPYD 181

Query: 289 LAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
              C HH   P           TP+C ++C      +TY  DL  G+ ++ V
Sbjct: 182 FPACAHHEASPDYPDCPSTDYSTPKCTKSCVAGYTANTYTADLHYGQSSYSV 233



 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 53/106 (50%), Positives = 68/106 (64%), Gaps = 8/106 (7%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY + ++ V R +A  Q  I  HGP+ A F+VY+DF  Y+SGVY+H  G  +G HA+ ++
Sbjct: 225 HYGQSSYSVGRTDAAIQTEILNHGPVEAAFTVYSDFPTYRSGVYKHTSGSVLGGHAISIV 284

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRV 166
           GWG E+  PYWLV NSWN  WGD G FKILRG      + G NN V
Sbjct: 285 GWGTESGSPYWLVKNSWNPSWGDGGFFKILRG------DCGINNDV 324


>gi|86279343|gb|ABC88767.1| putative cathepsin B-like proteinase [Tenebrio molitor]
          Length = 321

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 63/158 (39%), Positives = 90/158 (56%), Gaps = 15/158 (9%)

Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
           NA+ +P +FDAR KWP C SL  I DQ  CGSCWA +   ++SDR+CI S+G      S 
Sbjct: 79  NARDVPESFDARTKWPNCDSLNRIRDQGACGSCWAFASIESMSDRICIHSSGSAQFMFSP 138

Query: 243 QHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
           + +++C  +C  C GG+   A  F+ + G+V+GGD NS EGC+PYT    + H QG    
Sbjct: 139 EDLLSCCTSCGDCGGGYMMSALDFYINEGIVSGGDVNSNEGCRPYT---ADAHDQG---- 191

Query: 303 CTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                  +TP C ++C N  Y ++Y  D   G   ++V
Sbjct: 192 -------QTPACTKSCRN-GYSTSYSADKHYGSNDYVV 221



 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 45/81 (55%), Positives = 61/81 (75%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           ++  +GP++  F V+ DF  Y SGVY+H  G+S+G H V+++GWGVEN +PYWL+ANSW 
Sbjct: 231 EVMTNGPIIVNFEVFQDFYNYVSGVYRHVSGESVGFHVVKIVGWGVENGVPYWLIANSWG 290

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGDHG FK+LRG+NE  IE
Sbjct: 291 SSWGDHGFFKMLRGQNECGIE 311


>gi|237836005|ref|XP_002367300.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
 gi|211964964|gb|EEB00160.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
 gi|221506020|gb|EEE31655.1| cysteine proteinase, putative [Toxoplasma gondii VEG]
          Length = 572

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 63/154 (40%), Positives = 92/154 (59%), Gaps = 10/154 (6%)

Query: 187 LPRNFDAREKWPECPSLR-HIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           +P +FDAR  +P C  +  H+ DQ +CGSCWA +   A +DRLCI S G     +SAQH 
Sbjct: 277 VPAHFDARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKGLMPLSAQHT 336

Query: 246 VAC--TPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNS---QEGCQPYTLAPCEHHVQG 298
            +C    +C  +GCNGG P +AWR++   GVVTGGD+++      C PY +  C HH + 
Sbjct: 337 TSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAHHAKA 396

Query: 299 PLQNC-TLLGKLKTPECKQNCYNPSY-ESTYRFD 330
           P  +C   L   KTP+C+++C   +Y ++ + FD
Sbjct: 397 PFPDCDATLVPRKTPKCRKDCEEQAYADNVHPFD 430



 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 51/125 (40%), Positives = 68/125 (54%), Gaps = 4/125 (3%)

Query: 40  KKKKKKKKKKKKRLYLPTSIPLSHYFKKA----HMVPRCNAMRQIYEHGPLVAIFSVYAD 95
           +K  K +K  +++ Y     P      KA     +  R +  R +  HGP+   F VY D
Sbjct: 408 RKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSRDDVKRDMMTHGPVSGAFMVYED 467

Query: 96  FLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENE 155
           FL YKSGVY+H  G  +G HA++++GWG EN   YW   NSWN +WGD G FKI  G+  
Sbjct: 468 FLSYKSGVYKHVSGLPVGGHAIKIIGWGTENGEEYWHAVNSWNTYWGDGGQFKIAMGQCG 527

Query: 156 ADIEM 160
            D EM
Sbjct: 528 IDGEM 532


>gi|118153|sp|P25792.1|CYSP_SCHMA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
           Full=Antigen Sm31; Flags: Precursor
 gi|160950|gb|AAA29865.1| cathepsin B [Schistosoma mansoni]
          Length = 340

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 64/156 (41%), Positives = 86/156 (55%), Gaps = 5/156 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P NFD+R+KWP C S+  I DQS CGSCW+     A+SDR CI S G    ++SA  ++
Sbjct: 89  IPSNFDSRKKWPGCKSIATIRDQSRCGSCWSFGAVEAMSDRSCIQSGGKQNVELSAVDLL 148

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C  +C  GC GG    AW +W   G+VT     +  GC+PY    CEHH +G    C  
Sbjct: 149 TCCESCGLGCEGGILGPAWDYWVKEGIVTASSKENHTGCEPYPFPKCEHHTKGKYPPCG- 207

Query: 306 LGKL-KTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             K+  TP CKQ C    Y++ Y  D  +GK ++ V
Sbjct: 208 -SKIYNTPRCKQTCQR-KYKTPYTQDKHRGKSSYNV 241



 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 46/82 (56%), Positives = 67/82 (81%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I ++GP+ A F+VY DFL YKSG+Y+H  G+++G HA+R++GWGVEN  PYWL+ANSW
Sbjct: 250 KEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVENKTPYWLIANSW 309

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG++G F+I+RG +E  IE
Sbjct: 310 NEDWGENGYFRIVRGRDECSIE 331


>gi|204022088|dbj|BAG71141.1| cathepsin B-N2 [Tuberaphis styraci]
          Length = 334

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 61/136 (44%), Positives = 86/136 (63%), Gaps = 6/136 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FDAR+KW +C ++  + DQ NCGSCWA   ++A +DRLCIA++G F   +S + + 
Sbjct: 85  IPSSFDARKKWRKCSTIGEVRDQGNCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELA 144

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GC+GG+P  AW  +  +G+VTGG+Y+S EGCQPY ++PC     G   N T 
Sbjct: 145 FCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVSPCPLDEYG---NNTC 201

Query: 306 LGKL--KTPECKQNCY 319
            GK   K   C Q CY
Sbjct: 202 SGKPAEKNHRCTQMCY 217



 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 45/98 (45%), Positives = 59/98 (60%), Gaps = 1/98 (1%)

Query: 63  HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLG 121
           HY + A+ +        +  +GP+ A F VY DF  YKSGVY +      +G HAV+++G
Sbjct: 229 HYTRDAYYLTYGTIQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIG 288

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           WG E  +PYWL+ NSWND WGD G FKI RG NE   +
Sbjct: 289 WGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGTD 326


>gi|157058749|gb|ABV03132.1| cathepsin B-3098 [Acyrthosiphon pisum]
          Length = 256

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 63/157 (40%), Positives = 94/157 (59%), Gaps = 6/157 (3%)

Query: 164 NRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANA 223
           ++V  N  ++DD       N + +P  FDAR+KW  C ++  + DQ NCGS WA+S ++A
Sbjct: 9   DKVNYNMYKNDDHA----DNYQEIPMKFDARKKWIRCKTIGEVRDQGNCGSDWALSTSSA 64

Query: 224 ISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQE 282
            +DRLC+A+NG F   +SA+ I  C   C  GCNGG+P  AW+ + ++G+VTGG+Y S E
Sbjct: 65  FADRLCVATNGDFNQLLSAEEITFCCHKCGNGCNGGYPIRAWKRFKNHGLVTGGNYKSGE 124

Query: 283 GCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCY 319
           GC+PY + PC +   G    C+        +C + CY
Sbjct: 125 GCEPYRVPPCPYDKDGK-NTCSGQPMEPNHKCSKKCY 160



 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 33/83 (39%), Positives = 49/83 (59%), Gaps = 1/83 (1%)

Query: 64  YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGW 122
           Y +  + +      + +  +GP+ A F VY DF  YKSG+Y  +   S +G H+V+++GW
Sbjct: 173 YTRDDYYLTYRGIQKDVINYGPIEASFDVYDDFPNYKSGIYVKSENASYLGGHSVKLIGW 232

Query: 123 GVENDIPYWLVANSWNDHWGDHG 145
           G E  + YWL+ NSWN  WGD G
Sbjct: 233 GEEYGVLYWLMVNSWNADWGDKG 255


>gi|55793951|gb|AAV65886.1| cathepsin B1 isotype 6 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 54/99 (54%), Positives = 73/99 (73%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY K A+ VP    +  ++I  HGP+ + F+VY+DFL YKSG+Y+H  G  IG+H VR++
Sbjct: 234 HYGKVAYNVPNNEDSIKKEIMMHGPVGSFFTVYSDFLNYKSGIYKHMKGTEIGVHTVRIV 293

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVE   PYWL+ANSWN+ WG+ G F+ILRG++E DIE
Sbjct: 294 GWGVEKGTPYWLIANSWNEGWGEKGYFRILRGKDECDIE 332



 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 62/155 (40%), Positives = 88/155 (56%), Gaps = 3/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FD+R+KW +C S+  I DQS CGS WA +    +SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSTFDSRKKWSQCKSISSIHDQSRCGSGWAFAAVEVMSDRICIQSKGEKSVELSAVDLL 149

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC GG+P  AW +W   GVVTG    +  GCQPY    CEH+  G    C  
Sbjct: 150 SCCRECGLGCLGGFPGSAWDYWVEEGVVTGSSGENHTGCQPYPFPKCEHNTTGKYPACG- 208

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
               +TP+C++ C    Y++ Y+ D   GK A+ V
Sbjct: 209 QKIYETPKCQKKC-QKGYKTPYKKDKHYGKVAYNV 242


>gi|161343869|tpg|DAA06115.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 337

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 53/106 (50%), Positives = 75/106 (70%), Gaps = 1/106 (0%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FDAR KW  C ++  + DQ NCGSCWAV+ ++A +DRLC+A+ G F   +SA+ I 
Sbjct: 88  IPEHFDARNKWVYCDTIGRVRDQGNCGSCWAVATSSAFADRLCVATTGDFNELLSAEEIT 147

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP 291
            C   C +GC+GG+P  AW+ +  +G+VTGGDYNS EGC+PY + P
Sbjct: 148 FCCHTCGFGCHGGYPIKAWKRFSTHGLVTGGDYNSGEGCEPYRVPP 193



 Score = 90.9 bits (224), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 42/97 (43%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 64  YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLGW 122
           Y +  + +   +  + +  +GP+ A F VY DF  YKSGVY + +    +G HAV+++GW
Sbjct: 230 YTRDYYYLTYGSIQKDVLTYGPIEASFDVYDDFPSYKSGVYVKSDNASYLGGHAVKLIGW 289

Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           G E+  PYWL+ NSWN  WGD+G FKI RG NE  ++
Sbjct: 290 GEEDGTPYWLMVNSWNTQWGDNGFFKIRRGTNECGVD 326


>gi|221484923|gb|EEE23213.1| cysteine proteinase, putative [Toxoplasma gondii GT1]
          Length = 569

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 63/154 (40%), Positives = 92/154 (59%), Gaps = 10/154 (6%)

Query: 187 LPRNFDAREKWPECPSLR-HIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           +P +FDAR  +P C  +  H+ DQ +CGSCWA +   A +DRLCI S G     +SAQH 
Sbjct: 274 VPAHFDARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKRLMPLSAQHT 333

Query: 246 VAC--TPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNS---QEGCQPYTLAPCEHHVQG 298
            +C    +C  +GCNGG P +AWR++   GVVTGGD+++      C PY +  C HH + 
Sbjct: 334 TSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAHHAKA 393

Query: 299 PLQNC-TLLGKLKTPECKQNCYNPSY-ESTYRFD 330
           P  +C   L   KTP+C+++C   +Y ++ + FD
Sbjct: 394 PFPDCDATLVPRKTPKCRKDCEEQAYADNVHPFD 427



 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 51/125 (40%), Positives = 68/125 (54%), Gaps = 4/125 (3%)

Query: 40  KKKKKKKKKKKKRLYLPTSIPLSHYFKKA----HMVPRCNAMRQIYEHGPLVAIFSVYAD 95
           +K  K +K  +++ Y     P      KA     +  R +  R +  HGP+   F VY D
Sbjct: 405 RKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSRDDVKRDMMTHGPVSGAFMVYED 464

Query: 96  FLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENE 155
           FL YKSGVY+H  G  +G HA++++GWG EN   YW   NSWN +WGD G FKI  G+  
Sbjct: 465 FLSYKSGVYKHVSGLPVGGHAIKIIGWGTENGEEYWHAVNSWNTYWGDGGQFKIAMGQCG 524

Query: 156 ADIEM 160
            D EM
Sbjct: 525 IDGEM 529


>gi|204022106|dbj|BAG71150.1| cathepsin B-N [Astegopteryx spinocephala]
          Length = 332

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 60/136 (44%), Positives = 86/136 (63%), Gaps = 6/136 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P+ FDAR+KW +C ++  + DQ  CGSCWA   ++A +DRLCIA++G F   +SA+ + 
Sbjct: 83  IPKFFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGDFNELLSAEELT 142

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GC+GG+P  AW  +  +G+VTGG+Y+S EGCQPY ++PC     G   N T 
Sbjct: 143 FCCHTCGYGCHGGYPIKAWERFKKHGLVTGGNYDSSEGCQPYRVSPCPLDEYG---NNTC 199

Query: 306 LGK--LKTPECKQNCY 319
            GK   K   C + CY
Sbjct: 200 RGKPAEKNHRCTRMCY 215



 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 43/97 (44%), Positives = 60/97 (61%), Gaps = 1/97 (1%)

Query: 64  YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGW 122
           + + A+ +      + +  +GP+ A + VY DF  YKSGVY      + +G HAV+++GW
Sbjct: 228 FTRDAYYLTYGTIQKDVMTYGPIEASYEVYDDFPSYKSGVYVRTENATYLGGHAVKLIGW 287

Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           G E  +PYWL+ NSWND WGD G FKI RG NE  I+
Sbjct: 288 GEEYGVPYWLMVNSWNDQWGDRGLFKIRRGTNECGID 324


>gi|1644295|emb|CAB03627.1| cysteine proteinase [Haemonchus contortus]
          Length = 345

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 62/127 (48%), Positives = 79/127 (62%), Gaps = 13/127 (10%)

Query: 166 VEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAIS 225
           VE    EDDD+           P +FDAR  W  C SLRHI DQ+NCGSCWAVS A+A+S
Sbjct: 84  VENADDEDDDI-----------PESFDARTHWANCTSLRHIRDQANCGSCWAVSTASALS 132

Query: 226 DRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGC 284
           DR+CIAS G     IS+  IV+C   C +GC+GGWP  A+ ++   G VT G+  S++GC
Sbjct: 133 DRICIASKGETQLHISSIDIVSCCKLCGYGCDGGWPIEAFDYFSRQGAVT-GETTSKDGC 191

Query: 285 QPYTLAP 291
           +PY   P
Sbjct: 192 RPYPFHP 198



 Score =  108 bits (269), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 44/76 (57%), Positives = 59/76 (77%)

Query: 84  GPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGD 143
           GP+VA+F+VY DF  YK G+Y H  G + G HA++++GWGVEN +PYWL+ANSW+D WG+
Sbjct: 260 GPVVAVFTVYEDFSYYKKGIYVHIAGKARGAHAIKIIGWGVENGLPYWLIANSWHDDWGE 319

Query: 144 HGTFKILRGENEADIE 159
            G F+I+RG NE  IE
Sbjct: 320 QGLFRIVRGINECGIE 335


>gi|21700775|gb|AAL60053.1| cysteine proteinase [Toxoplasma gondii]
          Length = 569

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 63/154 (40%), Positives = 92/154 (59%), Gaps = 10/154 (6%)

Query: 187 LPRNFDAREKWPECPSLR-HIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           +P +FDAR  +P C  +  H+ DQ +CGSCWA +   A +DRLCI S G     +SAQH 
Sbjct: 274 VPAHFDARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKRLMPLSAQHT 333

Query: 246 VAC--TPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNS---QEGCQPYTLAPCEHHVQG 298
            +C    +C  +GCNGG P +AWR++   GVVTGGD+++      C PY +  C HH + 
Sbjct: 334 TSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAHHAKA 393

Query: 299 PLQNC-TLLGKLKTPECKQNCYNPSY-ESTYRFD 330
           P  +C   L   KTP+C+++C   +Y ++ + FD
Sbjct: 394 PFPDCDATLVPRKTPKCRKDCEEQAYADNVHPFD 427



 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 51/125 (40%), Positives = 68/125 (54%), Gaps = 4/125 (3%)

Query: 40  KKKKKKKKKKKKRLYLPTSIPLSHYFKKA----HMVPRCNAMRQIYEHGPLVAIFSVYAD 95
           +K  K +K  +++ Y     P      KA     +  R +  R +  HGP+   F VY D
Sbjct: 405 RKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSRDDVKRDMMTHGPVSGAFMVYED 464

Query: 96  FLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENE 155
           FL YKSGVY+H  G  +G HA++++GWG EN   YW   NSWN +WGD G FKI  G+  
Sbjct: 465 FLSYKSGVYKHVSGLPVGGHAIKIIGWGTENGEEYWHAVNSWNTYWGDGGQFKIAMGQCG 524

Query: 156 ADIEM 160
            D EM
Sbjct: 525 IDGEM 529


>gi|189308104|gb|ACD86936.1| cysteine protease [Caenorhabditis brenneri]
          Length = 210

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 57/111 (51%), Positives = 76/111 (68%), Gaps = 1/111 (0%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAR +WP C S+ +I DQS+CGSCWA + A A SDR CIASNG     +SA+ ++
Sbjct: 81  IPATFDARTQWPSCVSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVL 140

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHV 296
           +C  NC +GC GG+P  AW++   +G  TGG Y +Q GC+PY+LAPC   V
Sbjct: 141 SCCSNCGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETV 191


>gi|54289256|gb|AAV31918.1| putative vitellogenic cathepsin B [Aedes aegypti]
          Length = 332

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 62/154 (40%), Positives = 90/154 (58%), Gaps = 10/154 (6%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAREKWP C S+  I +Q  CG+CWAV+  + +SDRLCI S G F  +++A+ ++
Sbjct: 85  IPEFFDAREKWPYCKSISTIKNQGLCGACWAVATVSVMSDRLCIHSEGKFDVELAAEDLM 144

Query: 247 ACTPNCW-GCNGGWPQ-LAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  +C  GCNGG+    ++++W   G+V+G  YN+ +GC+PY   PC +    P   C 
Sbjct: 145 GCCKDCGNGCNGGFLDGTSFQYWVDVGLVSGAAYNNTDGCKPYPFKPCLY----PFVGCH 200

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAH 338
                KTP C  +C    Y+ TYR D   G  A+
Sbjct: 201 ---PEKTPSCTHHC-TEGYDGTYRRDKYYGSAAY 230



 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 49/99 (49%), Positives = 64/99 (64%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           +Y   A+ +P    M Q  I  +GP+ + FSVY D   YK+GVYQH  G  +G HAVR++
Sbjct: 224 YYGSAAYKLPNDERMIQLEIMTNGPVESGFSVYQDLYLYKTGVYQHVVGREVGKHAVRLI 283

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG E  +PYWL+ANS+ + WG+HG FK LRG N   IE
Sbjct: 284 GWGKERGVPYWLIANSYGEDWGEHGYFKFLRGSNHLGIE 322


>gi|268561878|ref|XP_002638441.1| Hypothetical protein CBG18657 [Caenorhabditis briggsae]
          Length = 372

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 68/180 (37%), Positives = 97/180 (53%), Gaps = 20/180 (11%)

Query: 156 ADIEMGF---NNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNC 212
           +++EM F   + +    S +D+ L   G      +P +FDAR+ WP C S++ I +Q+ C
Sbjct: 46  SELEMKFKVMDLKFSEISPKDEPLTVQGVY----VPISFDARDHWPNCKSIKLIRNQAYC 101

Query: 213 GSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW--GCNGGWPQLAWRFWGHN 270
           G+CWA   A  ISDR+CI S G     IS + I++C  +    GC GG+P    +FW ++
Sbjct: 102 GACWAFGAAEIISDRICIQSGGAHQPIISVEDILSCCGSSCGEGCKGGYPLEGLKFWMNS 161

Query: 271 GVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
           GVVTGGDYN   GCQPYT  PC           +      TP C++ C     E+TY+ D
Sbjct: 162 GVVTGGDYNGT-GCQPYTFPPCS----------SCEASKSTPSCQKKCQTGYLEATYKND 210



 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 43/81 (53%), Positives = 55/81 (67%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +IY +GP+   + V+ DF QYKSGVY +  G   G HAV+++GWG EN + YWLVANSW 
Sbjct: 263 EIYNNGPVEVSYRVFEDFYQYKSGVYHYVSGKLTGAHAVKIIGWGTENKVDYWLVANSWG 322

Query: 139 DHWGDHGTFKILRGENEADIE 159
             +G+ G FKI RG NE  IE
Sbjct: 323 TDFGEKGFFKIRRGTNECGIE 343


>gi|312374702|gb|EFR22199.1| hypothetical protein AND_15622 [Anopheles darlingi]
          Length = 339

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 56/134 (41%), Positives = 87/134 (64%), Gaps = 9/134 (6%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R++WP C SLR I +Q  CGSCWAV+ A+ +SDR+CI +NG     I+A+ ++
Sbjct: 92  IPESFDSRDRWPNCDSLREIRNQGTCGSCWAVAAASVMSDRVCIHTNGTRNVAIAAEDLM 151

Query: 247 ACTPNCW-GCNGGWPQ-LAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
            C  +C  GC GG+    ++++W   G+V+GG YNS EGC+PY   PC +    P  +C 
Sbjct: 152 GCCADCGNGCEGGFLDGTSFQYWVDAGLVSGGAYNSTEGCKPYPFKPCLY----PFTDCH 207

Query: 305 LLGKLKTPECKQNC 318
              + ++P+CK +C
Sbjct: 208 ---REESPKCKHHC 218



 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 48/94 (51%), Positives = 64/94 (68%), Gaps = 2/94 (2%)

Query: 68  AHMVPRCNAM--RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE 125
           A+ VPR   +   +I  +GP+   F VY D   YKSGVY+H +G+ +G HAVR++GWG E
Sbjct: 236 AYSVPRDERVIRYEIMTNGPVEGGFDVYEDVFLYKSGVYRHVYGEHVGKHAVRIIGWGRE 295

Query: 126 NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
             IPYWL++NS+ + WGDHG FKI+RG N   IE
Sbjct: 296 GGIPYWLISNSYGEDWGDHGYFKIVRGINHLGIE 329


>gi|91078960|ref|XP_974244.1| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270004840|gb|EFA01288.1| cathepsin B precursor [Tribolium castaneum]
          Length = 319

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 63/158 (39%), Positives = 91/158 (57%), Gaps = 15/158 (9%)

Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
           N + +P+ FDAR+KWP+C SL  I DQ +CGSCWA +    +SDR+CI S+G      SA
Sbjct: 77  NPQDIPKTFDARKKWPKCDSLNRIRDQGSCGSCWAFAAVETMSDRICIHSSGAKKFFFSA 136

Query: 243 QHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
           + +++C   C  C+GG+   A+ F+   GVV+GGD NS EGC+PYT    + H +G    
Sbjct: 137 EDLLSCCTACGSCSGGYMMAAFDFYIKQGVVSGGDLNSNEGCRPYT---ADAHDKG---- 189

Query: 303 CTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                   TP C ++C    Y ++Y  D   G K ++V
Sbjct: 190 -------VTPSCTKSC-RKGYPTSYSSDKHYGSKDYIV 219



 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 62/99 (62%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY  K ++V     N   +I  +GP++  F VY DF  Y SGVY H  G+  G H V+++
Sbjct: 211 HYGSKDYIVDAGVSNIQYEIMTNGPIIVSFKVYQDFYNYGSGVYHHVSGNYTGNHIVKIV 270

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG E +  YWL+ANSW   WG+HG FKILRG+NE  IE
Sbjct: 271 GWGTEKEQDYWLIANSWGSSWGEHGFFKILRGKNECGIE 309


>gi|204022090|dbj|BAG71142.1| cathepsin B-N3 [Tuberaphis styraci]
          Length = 334

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 58/134 (43%), Positives = 83/134 (61%), Gaps = 2/134 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FDAR+KW +C ++  + DQ NCGSCWA   ++A +DRLCIA++G F   +S + + 
Sbjct: 85  IPSSFDARKKWRKCSTIGEVRDQGNCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELA 144

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GC+GG+P  AW  +  +G+VTGG+Y+S EGCQPY + PC     G    C+ 
Sbjct: 145 FCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVPPCPLDEYGN-NTCSG 203

Query: 306 LGKLKTPECKQNCY 319
               K   C Q CY
Sbjct: 204 KPAEKNHRCTQMCY 217



 Score = 97.4 bits (241), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 45/94 (47%), Positives = 58/94 (61%), Gaps = 1/94 (1%)

Query: 63  HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLG 121
           HY + A+ +        +  +GP+ A F VY DF  YKSGVY +      +G HAV+++G
Sbjct: 229 HYTRDAYYLTYGTIQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIG 288

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENE 155
           WG E  +PYWL+ NSWND WGD G FKI RG NE
Sbjct: 289 WGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNE 322


>gi|268555420|ref|XP_002635699.1| Hypothetical protein CBG22436 [Caenorhabditis briggsae]
          Length = 317

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 68/158 (43%), Positives = 87/158 (55%), Gaps = 13/158 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAR +WP C S++ I +Q+ CGSCWA   A  +SDR+CIAS G     IS   ++
Sbjct: 75  IPTYFDARTRWPNCRSIKMIRNQATCGSCWAFGAAEVMSDRICIASMGTKQPIISPTDLL 134

Query: 247 ACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C  N   +GC G  P  A+R+W   GVVTGGDY    GC+PY  APC          CT
Sbjct: 135 SCCGNFCGYGCKGASPLQAFRWWNKKGVVTGGDYRG-SGCKPYPFAPCTA------LPCT 187

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVLM 342
              K +TP C  NC  P+Y   Y  D   G  A++V M
Sbjct: 188 ---KSETPRCSLNC-QPAYSKAYSKDKYFGTPAYIVGM 221



 Score =  101 bits (251), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 43/84 (51%), Positives = 61/84 (72%)

Query: 76  AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVAN 135
           A++    +GP+ A F VY DF  Y+SGVY+H  G  +G HAV+++GWG++N  PYWL+AN
Sbjct: 225 AIQTEITNGPVEAAFIVYDDFNHYRSGVYRHVAGKLVGGHAVKIIGWGIQNGAPYWLMAN 284

Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
           SW  +WG++G FK+LRG +E  IE
Sbjct: 285 SWGPYWGENGFFKMLRGVDECGIE 308


>gi|161343865|tpg|DAA06113.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 335

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 56/134 (41%), Positives = 83/134 (61%), Gaps = 2/134 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FD+R +W  C ++  + +Q NCGSCWA     A +DRLCIA++G F   ISA+ + 
Sbjct: 84  VPEFFDSRLEWKNCKTIGEVRNQGNCGSCWAHGTTGAFADRLCIATDGEFNELISAEELT 143

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GCNGG P  AW+++  +GVVTGG+YN+ +GCQPY + PC    +G   +C+ 
Sbjct: 144 FCCHTCGFGCNGGNPLKAWKYFKRHGVVTGGNYNTTDGCQPYRVPPCVRDDEGH-NSCSG 202

Query: 306 LGKLKTPECKQNCY 319
               +  +C + CY
Sbjct: 203 QPTERNHKCSKKCY 216



 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 47/100 (47%), Positives = 62/100 (62%), Gaps = 2/100 (2%)

Query: 62  SHY-FKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRV 119
           +HY  K A+ +      +    +GP+ A F VY DF  Y+SGVYQ     S +G HAV++
Sbjct: 226 NHYKTKDAYYLSNTTMQKDTMVYGPIEASFDVYDDFTSYESGVYQKTENASYLGGHAVKM 285

Query: 120 LGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +GWGVE   PYWL+ NSW + WGD G FKILRG +E  +E
Sbjct: 286 IGWGVEEGTPYWLMVNSWGEQWGDKGMFKILRGTDECGVE 325


>gi|161343871|tpg|DAA06116.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 276

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 63/151 (41%), Positives = 91/151 (60%), Gaps = 9/151 (5%)

Query: 170 SSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLC 229
           + +DDD       N + +P  FDAR+KW  C ++  + DQ +CGS WA+S ++A SDRLC
Sbjct: 15  TGDDDD-------NYQEIPIKFDARKKWLRCKTIGEVRDQGHCGSDWAMSTSSAFSDRLC 67

Query: 230 IASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           +A+NG F   +SA+ I  C   C  GC+GG+P  AW+ +  +G+VTGG+Y S EGC+PY 
Sbjct: 68  VATNGDFNQLLSAEEITFCCHTCGDGCSGGYPIRAWKRYKKHGLVTGGNYKSGEGCEPYR 127

Query: 289 LAPCEHHVQGPLQNCTLLGKLKTPECKQNCY 319
           + PC +  QG    C+     K   C + CY
Sbjct: 128 VPPCPNDDQGN-NTCSGQPMEKNHRCTRMCY 157



 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 42/106 (39%), Positives = 60/106 (56%), Gaps = 1/106 (0%)

Query: 64  YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGW 122
           Y +  + +      + +  +GP+ A F VY DF  YKSG+Y  +   S +G H+V+++GW
Sbjct: 170 YTRDHYYLTYRGIQKDVINYGPIEASFDVYDDFPSYKSGIYVKSENASYLGGHSVKLIGW 229

Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEA 168
           G E  + YWL+ NSWN  WGD G FKI RG NE  ++      V A
Sbjct: 230 GEEYGVLYWLMVNSWNADWGDKGLFKIRRGTNECGVDNSTTGGVPA 275


>gi|404250524|gb|AFR54113.1| cysteine proteinase, partial [Haemonchus contortus]
          Length = 332

 Score =  120 bits (302), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 65/143 (45%), Positives = 81/143 (56%), Gaps = 6/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+RE W  C S+ +I DQSNCGSCWAVS A  +SDR+C+ S G     IS   I+
Sbjct: 95  IPESFDSREVWKSCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDIL 154

Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           AC    C  GCNGG    AW +    GVVTGG Y  +  C+PY L PC +H  G   +C 
Sbjct: 155 ACCGSECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNH-GGKFWSCP 213

Query: 305 LLGKLKTPECKQNC---YNPSYE 324
                +TP CK+ C   Y   YE
Sbjct: 214 RDHSFRTPACKKYCQYGYGKRYE 236



 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 39/75 (52%), Positives = 51/75 (68%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++ ++GP+ A F  Y DF  Y  G+Y H  G   G HAV+V+GWGVEN   YW VANSW
Sbjct: 257 REMMKNGPVQAAFITYEDFSFYTKGIYVHTRGRQRGAHAVKVVGWGVENGTKYWNVANSW 316

Query: 138 NDHWGDHGTFKILRG 152
           +  WG++G F+ILRG
Sbjct: 317 STDWGENGYFRILRG 331


>gi|157058759|gb|ABV03137.1| cathepsin B-84 [Rhopalosiphum padi]
          Length = 219

 Score =  120 bits (302), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 62/155 (40%), Positives = 91/155 (58%), Gaps = 3/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAR +W  C ++  + +Q NCGSCWA     A +DRLC+A+NG F   ISA+ + 
Sbjct: 46  VPDFFDARIEWKYCKTIGEVRNQGNCGSCWAHGTTGAFADRLCVATNGDFNELISAEELT 105

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GCNGG P  AW ++  +GVVTGG+YN+ +GCQPY + PC    +G   +C+ 
Sbjct: 106 FCCHTCGFGCNGGNPIRAWLYFKRHGVVTGGNYNTTDGCQPYKVPPCIRDEEGH-NSCSG 164

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
               +   C ++CY  +  S Y+    K K A+ +
Sbjct: 165 QRTERNHRCSKSCYGNT-TSDYKNGHYKTKDAYYL 198


>gi|29374025|gb|AAO73003.1| cathepsin B [Fasciola gigantica]
          Length = 339

 Score =  120 bits (302), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 60/155 (38%), Positives = 88/155 (56%), Gaps = 2/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAR +WP+C ++  I DQ++CGSCWA + A+A+SDR+CI SNG    +++A   +
Sbjct: 86  LPESFDARSQWPQCWTISEIRDQASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAADPL 145

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC GG+P  AW +W   G+VTGG + ++ GCQP+    C+H       +   
Sbjct: 146 SCCTYCGQGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCP 205

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP C + C    Y  TY  D   G  ++ V
Sbjct: 206 HYTYPTPPCARAC-QTGYNKTYEQDKFYGNSSYNV 239



 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 43/83 (51%), Positives = 63/83 (75%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M++I ++GP+   F+++ DF  Y+SG+Y H  G  IG HAVR++GWGVEN + YWL+ANS
Sbjct: 247 MQEIMKNGPVEVTFAIFQDFGVYRSGIYHHVAGKFIGRHAVRMIGWGVENGVNYWLMANS 306

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN+ WG++G F+++RG NE  IE
Sbjct: 307 WNEEWGENGYFRMVRGRNECGIE 329


>gi|428174191|gb|EKX43088.1| hypothetical protein GUITHDRAFT_73372 [Guillardia theta CCMP2712]
          Length = 255

 Score =  120 bits (302), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 57/112 (50%), Positives = 72/112 (64%), Gaps = 9/112 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P NFDAR  WP+CPS+ HI DQS CGSCWA     A+SDRLCIASNG    ++SA+ ++
Sbjct: 15  IPDNFDARTNWPQCPSIAHIRDQSTCGSCWAFGAVEAMSDRLCIASNGTVKDELSAEDML 74

Query: 247 A-CTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHV 296
           + C   C  GCNGG+P  AWRF+  +G+ T   Y       PY   PCEHH+
Sbjct: 75  SCCLVQCGMGCNGGFPTGAWRFFKMHGLTTESKY-------PYVFPPCEHHI 119



 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 66/97 (68%)

Query: 63  HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGW 122
           ++ K  + V       +I  +GP+ A F+VY DFL Y+SGVY+H  G  +G HA++++GW
Sbjct: 146 YHGKSVYSVSPAKIQAEIMTNGPVEAAFTVYQDFLAYQSGVYRHVSGPELGGHAIKIMGW 205

Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GVE    YWLVANSWN+ WGD GTFKI RG++E  IE
Sbjct: 206 GVEAGNKYWLVANSWNEDWGDKGTFKIARGDDECGIE 242


>gi|442754445|gb|JAA69382.1| Putative cathepsin b precursor [Ixodes ricinus]
          Length = 340

 Score =  120 bits (301), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 58/99 (58%), Positives = 71/99 (71%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY K ++ VP      Q  I ++GP+   F+VYADF  YKSGVY+ +  D++G HA+R+L
Sbjct: 232 HYGKSSYSVPSNETQIQMEIMKNGPVEGAFTVYADFPLYKSGVYKSHSTDALGGHAIRIL 291

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEND+PYWLVANSWN  WGD G FKILRG NE  IE
Sbjct: 292 GWGVENDVPYWLVANSWNTEWGDKGYFKILRGSNECGIE 330



 Score = 94.0 bits (232), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 54/161 (33%), Positives = 81/161 (50%), Gaps = 15/161 (9%)

Query: 187 LPRNFDAREKWPECPSLRHIAD---QSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
           +P  FD+R++W + P   H  D   +             ++SDR CI S       ++A 
Sbjct: 88  IPAQFDSRQQWQDWP--HHPGDPGTKERADPVGHFGAVESMSDRHCIHSGAKNIVHLAAD 145

Query: 244 HIVACTPNCWGC----NGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP 299
            +++C   CWGC    NGG+P  AW +W   G+VTGG+Y++ EGC PY +  C+HHV G 
Sbjct: 146 DVLSC---CWGCGSGCNGGFPAAAWSYWVDKGIVTGGNYDTDEGCMPYPVPSCDHHVNGT 202

Query: 300 LQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           L  C       TP+C + C    Y   ++ D   GK ++ V
Sbjct: 203 LGPCGQ--DPPTPKCVRLC-RKGYNVDFKDDKHYGKSSYSV 240


>gi|349604734|gb|AEQ00202.1| Cathepsin B-like protein, partial [Equus caballus]
          Length = 134

 Score =  120 bits (301), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 61/110 (55%), Positives = 76/110 (69%), Gaps = 5/110 (4%)

Query: 54  YLPTSIPLSHYFKKAHMVPRCNAMRQIYE----HGPLVAIFSVYADFLQYKSGVYQHNFG 109
           Y P+     HY   ++ V R  A R+ ++    +GP+ A F+VY+DFLQYKSGVYQH  G
Sbjct: 9   YSPSYKEDKHYGCSSYSVSR-GARRRSWQRSSKNGPVEAAFTVYSDFLQYKSGVYQHVAG 67

Query: 110 DSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           D +G HAVR+LGWGVEN  PYWLV NSWN  WGD+G FKILRG++   IE
Sbjct: 68  DMMGGHAVRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIE 117


>gi|268561802|ref|XP_002638421.1| C. briggsae CBR-CPR-3 protein [Caenorhabditis briggsae]
          Length = 375

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 64/154 (41%), Positives = 88/154 (57%), Gaps = 12/154 (7%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDAR++WP+C SL+ I +Q++CGSCWA   A  ISDR+CI SNG     ISA+ I+
Sbjct: 95  LPDTFDARDQWPDCKSLKFIRNQASCGSCWAFGAAEVISDRVCIQSNGTQQPIISAEDIL 154

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C  +    GC GG+   A ++W ++GVVTGGDYN   GC PY+  PC+   + P     
Sbjct: 155 SCCGSTCGKGCQGGYTIEAMKYWMNSGVVTGGDYNGA-GCMPYSFPPCK---KSPCV--- 207

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAH 338
              +  TP CK  C      + Y+ D      A+
Sbjct: 208 ---EFSTPSCKTTCQEKYTTADYKNDKHFATSAY 238



 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 50/101 (49%), Positives = 63/101 (62%), Gaps = 4/101 (3%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +IY +GP+ A + V+ DF QYKSGVY H  G+ +G HAV+++GWG EN + YWLVANSW 
Sbjct: 253 EIYHNGPVEASYRVFEDFYQYKSGVYHHVSGNLVGGHAVKIIGWGTENGVDYWLVANSWG 312

Query: 139 DHWGDHGTFKILRGENEADIE----MGFNNRVEANSSEDDD 175
             +G+ G FKI RG NE  IE     G       N   DDD
Sbjct: 313 TSFGEKGFFKIRRGTNECQIESNIVAGLAKLGTHNEKTDDD 353


>gi|209863077|ref|NP_001119612.2| cathepsin B-912 precursor [Acyrthosiphon pisum]
          Length = 342

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 59/147 (40%), Positives = 86/147 (58%), Gaps = 5/147 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P+ FDAR++W  C ++  + DQ NCGSCWA++ ++A +DRLCIA+N  F   +SA+ + 
Sbjct: 90  IPKKFDARKEWRRCITIGQVRDQGNCGSCWALATSSAFADRLCIATNYEFNELLSAEELT 149

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C + C+GG+P  AW ++  +G+VTGGDY S EGC PY + PC     G    C  
Sbjct: 150 FCCHLCGFACHGGYPIKAWSYFRRHGIVTGGDYQSGEGCAPYRVPPCFSEEDGN-NTCRG 208

Query: 306 LGKLKTPECKQNCYNP---SYESTYRF 329
               K   C + CY      Y+  +RF
Sbjct: 209 QPMEKHHRCTRMCYGDQEIDYDDDHRF 235



 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 39/97 (40%), Positives = 62/97 (63%), Gaps = 1/97 (1%)

Query: 64  YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGW 122
           + +  + +   +  + +  +GP+ A   VY DF  YKSGVY+ +   + +G HAV+++GW
Sbjct: 235 FTRDYYYLTYASIQKDVMTYGPIEASMEVYDDFPSYKSGVYEKSENATYLGGHAVKLIGW 294

Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           G E+ +PYWL+ NSW++ WGD G FKI RG NE  ++
Sbjct: 295 GEEDGVPYWLMVNSWSEMWGDKGLFKIRRGTNECSVD 331


>gi|161343831|tpg|DAA06096.1| TPA_inf: cathepsin B [Aphis gossypii]
          Length = 194

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 63/128 (49%), Positives = 83/128 (64%), Gaps = 3/128 (2%)

Query: 171 SEDDDLETMGC-QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLC 229
           SE D L T      ++ LP ++D  + W EC S+  I DQSNCGSCWA+S A+A S RLC
Sbjct: 48  SEKDTLLTYDSPAGSEPLPESYDVTQTWSECKSVVSIRDQSNCGSCWALSTASAFSGRLC 107

Query: 230 IASNGYFTGQISAQHIVACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           IASN  F   +S ++I +C    C  GCNGG P+ AW++   NG+ TGG+YNS EGCQPY
Sbjct: 108 IASNMDFNIVLSGEYINSCCNGKCGDGCNGGHPEKAWKYIKKNGLCTGGEYNSNEGCQPY 167

Query: 288 TLAPCEHH 295
           ++ PC  +
Sbjct: 168 SIFPCPRN 175


>gi|19526442|gb|AAL89717.1|AF483623_1 cathepsin B [Apriona germari]
          Length = 324

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 63/157 (40%), Positives = 91/157 (57%), Gaps = 18/157 (11%)

Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
            G+P +FDARE+WP C S+R I D+  CGSCWA +    +SDRLC+AS G      SA+ 
Sbjct: 82  SGIPDSFDAREQWPFCESIRTIRDEGACGSCWAFAAVEVMSDRLCLASEGRKKFIFSAEE 141

Query: 245 IVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
           +V+C   C  GC GG+    +++W  NG+ +GGDY S+ GC+PYT A     V G     
Sbjct: 142 VVSCCTACGGGCRGGFLNEPYKYWVTNGIPSGGDYGSKLGCKPYTAA-----VSG----- 191

Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                 +TP+C++ C +  YE ++  DL+    A+ V
Sbjct: 192 ------ETPQCQKACVS-GYEKSWEKDLRHATSAYQV 221



 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 43/82 (52%), Positives = 58/82 (70%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R+I ++GP+ A   VY DF  Y +G+YQH  G  +G HAV+++GWG END+PYW+ ANSW
Sbjct: 230 REILDNGPVTAYMEVYEDFYSYGTGIYQHTSGSFVGGHAVKIIGWGSENDVPYWIAANSW 289

Query: 138 NDHWGDHGTFKILRGENEADIE 159
              +G+ G F+ILRG N A IE
Sbjct: 290 GTGFGEDGFFRILRGSNCAGIE 311


>gi|21930117|gb|AAM82155.1| cysteine proteinase [Ancylostoma ceylanicum]
          Length = 348

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 55/134 (41%), Positives = 81/134 (60%), Gaps = 2/134 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAR++WP C S++HI DQS+CGSCWAV+ A+A+SDR+C  +NG     +S   ++
Sbjct: 94  IPDTFDARDRWPNCTSMKHIRDQSSCGSCWAVAAASAMSDRVCALTNGRINRILSDTEVL 153

Query: 247 ACT-PNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C   +C +GC GG+P  A+ +    G+ TGG Y  ++ CQPY   PC +H   P     
Sbjct: 154 SCCFGSCGFGCKGGYPARAFGYAWRYGLSTGGPYGEKDACQPYAFYPCGNHAHEPYYGPC 213

Query: 305 LLGKLKTPECKQNC 318
                 TP C++ C
Sbjct: 214 PDELWPTPTCRRTC 227



 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 46/81 (56%), Positives = 59/81 (72%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I   GP+VA + VY DF  YK GVY H  G+  GLHAV+++GWG  ND+PYWLVANSWN
Sbjct: 258 EIMTRGPVVATYKVYRDFDYYKKGVYIHREGEVTGLHAVKIIGWGKGNDVPYWLVANSWN 317

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGD+G F+I+RG +  +IE
Sbjct: 318 TDWGDNGYFRIVRGTDNCEIE 338


>gi|335347289|gb|AEH42092.1| cysteine proteinase 1 [Haemonchus contortus]
          Length = 332

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 65/143 (45%), Positives = 81/143 (56%), Gaps = 6/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+RE W  C S+ +I DQSNCGSCWAVS A  +SDR+C+ S G     IS   I+
Sbjct: 95  IPESFDSREVWKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDIL 154

Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           AC    C  GCNGG    AW +    GVVTGG Y  +  C+PY L PC +H  G   +C 
Sbjct: 155 ACCGRECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNH-GGKFWSCP 213

Query: 305 LLGKLKTPECKQNC---YNPSYE 324
                +TP CK+ C   Y   YE
Sbjct: 214 RDHSFRTPACKKYCQYGYGKRYE 236



 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 39/75 (52%), Positives = 50/75 (66%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++ ++GP+ A F  Y DF  Y  G+Y H  G   G HAV+V+GWGVEN   YW VANSW
Sbjct: 257 REMMKNGPVQAAFITYEDFSFYTKGIYVHTRGRQRGAHAVKVVGWGVENGTKYWNVANSW 316

Query: 138 NDHWGDHGTFKILRG 152
           +  WG+ G F+ILRG
Sbjct: 317 STDWGEDGYFRILRG 331


>gi|239938580|gb|ACS36089.1| cysteine proteinase [Haemonchus contortus]
          Length = 332

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 65/143 (45%), Positives = 81/143 (56%), Gaps = 6/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+RE W  C S+ +I DQSNCGSCWAVS A  +SDR+C+ S G     IS   I+
Sbjct: 95  IPESFDSREVWKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDIL 154

Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           AC    C  GCNGG    AW +    GVVTGG Y  +  C+PY L PC +H  G   +C 
Sbjct: 155 ACCGRECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNH-GGKFWSCP 213

Query: 305 LLGKLKTPECKQNC---YNPSYE 324
                +TP CK+ C   Y   YE
Sbjct: 214 RDHSFRTPACKKYCQYGYGKRYE 236



 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 39/75 (52%), Positives = 51/75 (68%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++ ++GP+ A F  Y DF  Y  G+Y H  G   G HAV+V+GWGVEN   YW VANSW
Sbjct: 257 REMMKNGPVQAAFITYEDFSFYTKGIYVHTRGRQRGAHAVKVVGWGVENGTKYWNVANSW 316

Query: 138 NDHWGDHGTFKILRG 152
           +  WG++G F+ILRG
Sbjct: 317 STDWGENGYFRILRG 331


>gi|427787723|gb|JAA59313.1| Putative cathepsin b-like cysteine protease form 2 [Rhipicephalus
           pulchellus]
          Length = 338

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 63/159 (39%), Positives = 89/159 (55%), Gaps = 11/159 (6%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAR+ W +C S+  I DQS+CG+CWA     AISDR+CI + G     ISAQ ++
Sbjct: 83  LPESFDARQHWRKCNSIHVIRDQSSCGACWAFGAVEAISDRICIHTKGSVQVNISAQDLL 142

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG----PLQ 301
            C   C  GC GG P  AW F+   G+VTGG Y +++GCQPY++    +   G    P+ 
Sbjct: 143 TCCDYCRTGCKGGVPSYAWMFYKEKGIVTGGLYGTEDGCQPYSIHTTRYTTTGLLPPPIN 202

Query: 302 NCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           + + +     P CK+ C   SY   Y  D   G+K + +
Sbjct: 203 DLSPM-----PPCKREC-RKSYGKKYSEDKHYGEKVYTL 235



 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 54/103 (52%), Positives = 68/103 (66%), Gaps = 2/103 (1%)

Query: 63  HYFKKAHMVPRCNAM--RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY +K + +    A    +I+++GP+ A F+VYADF  YKSGVYQ +     G HA+R+L
Sbjct: 227 HYGEKVYTLSGDEAQIKTEIFKNGPVEADFAVYADFYSYKSGVYQAHSRVRCGSHAIRIL 286

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFN 163
           GWG EN +PYWL ANSW +HWGD G FKI RG NE  IE   N
Sbjct: 287 GWGTENGVPYWLAANSWTEHWGDKGYFKIRRGNNECGIEEDIN 329


>gi|402585445|gb|EJW79385.1| hypothetical protein WUBG_09708, partial [Wuchereria bancrofti]
          Length = 190

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 65/147 (44%), Positives = 88/147 (59%), Gaps = 10/147 (6%)

Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
           P  FDAR +WP C S+  +A+Q  CGSCWA+S A+ +SDRLCIA+N     QISA+ +++
Sbjct: 49  PEQFDARLQWPLCWSVHQVANQGGCGSCWAISAASVMSDRLCIATNYSNQKQISAEDLIS 108

Query: 248 CTPNCWGCNGG-WPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL- 305
           C   C GC G  W   A+ +W ++G+VTGGDY S EGC+PY  AP   +   P   C+  
Sbjct: 109 CCAECGGCQGSNWALSAFIYWRNHGIVTGGDYGSFEGCKPYATAP---NCGSP---CSFE 162

Query: 306 -LGKLKTPECKQNCYNPSYESTYRFDL 331
              K   P C++ C  P Y  +Y  DL
Sbjct: 163 YYRKKAAPICQKTC-QPLYGLSYEEDL 188


>gi|48425700|pdb|1SP4|B Chain B, Crystal Structure Of Ns-134 In Complex With Bovine
           Cathepsin B: A Two Headed Epoxysuccinyl Inhibitor
           Extends Along The Whole Active Site Cleft
          Length = 205

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 53/83 (63%), Positives = 65/83 (78%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+   FSVY+DFL YKSGVYQH  G+ +G HA+R+LGWGVEN  PYWLV NS
Sbjct: 113 MAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGNS 172

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRG++   IE
Sbjct: 173 WNTDWGDNGFFKILRGQDHCGIE 195



 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 49/82 (59%), Gaps = 3/82 (3%)

Query: 259 WPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC 318
           +P  AW FW   G+V+GG YNS  GC+PY++ PCEHHV G    CT  G+  TP+C + C
Sbjct: 27  FPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCT--GEGDTPKCNKTC 84

Query: 319 YNPSYESTYRFDLKKGKKAHMV 340
             P Y  +Y+ D   G  ++ V
Sbjct: 85  -EPGYSPSYKEDKHFGCSSYSV 105


>gi|321461662|gb|EFX72692.1| hypothetical protein DAPPUDRAFT_308155 [Daphnia pulex]
          Length = 379

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 65/196 (33%), Positives = 101/196 (51%), Gaps = 3/196 (1%)

Query: 147 FKILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHI 206
           F+   G    D    F +      +E   L +     +  +P  FDAR +WP CP++  I
Sbjct: 73  FRSFMGARAYDPWRYFMSVKRRQVNERRSLSSPSGFYSSSIPAEFDARLRWPNCPTIGEI 132

Query: 207 ADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWR 265
            +Q +C SCWAV+  + +SDR+CI S      ++SA ++++C   C  GC GG+P  AW 
Sbjct: 133 FEQGSCASCWAVAPTDVMSDRICIHSGSRHIVRLSAGNLLSCCKLCGKGCKGGFPGGAWM 192

Query: 266 FWGHNGVVTGGDYNSQEGCQPYTLAPC-EHHVQGPLQNCTLLGKLKTPECKQNCYNPSYE 324
            W  +G+VTGG Y+S  GCQ Y   PC +   +G ++N          EC++ C   SY 
Sbjct: 193 HWSKHGIVTGGSYSSDYGCQKYQFFPCYQPRTKGSIKNKCPKTDNTLLECRETC-RTSYN 251

Query: 325 STYRFDLKKGKKAHMV 340
            +Y+ DL  G+  + +
Sbjct: 252 KSYKQDLYYGESVYRI 267



 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 43/81 (53%), Positives = 54/81 (66%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I E+GP+ A   +Y DFL YK GVY+H  G  +  HAV++ GWG E   PYWL AN W+
Sbjct: 277 EIMENGPVQANLRIYEDFLHYKFGVYRHVHGQGLEYHAVKIFGWGTEGGTPYWLAANPWS 336

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WG+ G FKILRG N A+IE
Sbjct: 337 KRWGNGGFFKILRGSNHAEIE 357


>gi|254575665|gb|ACT68329.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 61/140 (43%), Positives = 82/140 (58%), Gaps = 2/140 (1%)

Query: 166 VEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAIS 225
           +E + ++++ L      +   +P +FD+REKW +CPSLR I DQSNCGSCWAVS A  +S
Sbjct: 75  IERSYNQENVLPIANITSNDDIPESFDSREKWKDCPSLRVIPDQSNCGSCWAVSAAQCMS 134

Query: 226 DRLCIASNGYFTGQISAQHIVACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEG 283
           DRLCI S G     +SA  I+AC      +GC+GG+   AW++    GVVTGG Y  +  
Sbjct: 135 DRLCIHSQGRKKVLLSATDILACCGKFCGYGCDGGYNARAWKWATIAGVVTGGAYKEKGN 194

Query: 284 CQPYTLAPCEHHVQGPLQNC 303
           C+PY    C  H      NC
Sbjct: 195 CKPYVFPQCGAHKGKAFNNC 214



 Score = 90.9 bits (224), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 39/84 (46%), Positives = 56/84 (66%), Gaps = 1/84 (1%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I + GP+ A F++Y DF  Y  GVY H  G   G H+++++GWGV+  + YWL+ANSW+
Sbjct: 259 EIMQKGPVHATFNIYEDFEHYNGGVYIHTAGAMEGGHSIKIIGWGVDKGVKYWLIANSWS 318

Query: 139 DHWG-DHGTFKILRGENEADIEMG 161
             WG D G F+++RG N  DIE G
Sbjct: 319 TDWGEDGGYFRVVRGINNCDIEGG 342


>gi|347972080|ref|XP_313831.5| AGAP004531-PA [Anopheles gambiae str. PEST]
 gi|333469162|gb|EAA09191.5| AGAP004531-PA [Anopheles gambiae str. PEST]
          Length = 375

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 52/99 (52%), Positives = 69/99 (69%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY + A+ VP+     M +++  GP  A F++Y DF+QYKSGVY+H FG  +G H+V+V+
Sbjct: 266 HYGRVAYTVPKDEERIMYEVFNFGPAQATFTMYTDFVQYKSGVYRHTFGVRVGTHSVKVM 325

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEND+ YWL ANSW   WGD G FKI+RGE+    E
Sbjct: 326 GWGVENDVKYWLCANSWGAQWGDGGFFKIVRGEDHLSFE 364



 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 56/156 (35%), Positives = 79/156 (50%), Gaps = 14/156 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAR+KW  CPS+  + +Q  C S +AV+  + ++DR C+ S G       A  ++
Sbjct: 131 LPMSFDARQKWSYCPSMNMVRNQGCCDSSYAVAAVSTMTDRWCVHSEGKAQFNFGAYDVL 190

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GC+GG P   W +W  NG+ +GG + S EGCQ Y           P   C  
Sbjct: 191 SCCHRCGFGCDGGVPSAVWHYWVENGITSGGAFGSHEGCQSY-----------PFDVCKK 239

Query: 306 LGKLK-TPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            G    TP C + C  P Y  TY  D   G+ A+ V
Sbjct: 240 SGDSNDTPRCLRFC-QPGYNVTYPEDKHYGRVAYTV 274


>gi|48762476|dbj|BAD23809.1| cathepsin B-S [Tuberaphis styraci]
 gi|204022069|dbj|BAG71132.1| cathepsin B-S1 [Tuberaphis styraci]
          Length = 349

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 55/133 (41%), Positives = 79/133 (59%), Gaps = 3/133 (2%)

Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
           P+ FD+RE W  C  + HI DQ NCGSCW+ S   A +DRLC+++ G F   +S + +  
Sbjct: 86  PKQFDSRENWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEELAF 145

Query: 248 CTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
           C  +C  GC GG+P  AW+++   GV TGGDY+++EGC PY + PC +  QG    C   
Sbjct: 146 CCMDCGKGCGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPC-YDEQGK-NTCGGK 203

Query: 307 GKLKTPECKQNCY 319
              +  +C + CY
Sbjct: 204 PMERNHQCPKTCY 216



 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 47/122 (38%), Positives = 69/122 (56%), Gaps = 2/122 (1%)

Query: 51  KRLYLPTSIPLSHYFKKAHMVPRCNAMRQ-IYEHGPLVAIFSVYADFLQYKSGVYQHNF- 108
           K  Y  T++   +  K  +++     + Q +  +GP+ A F VY DF  YKSG+Y+    
Sbjct: 213 KTCYGKTTVQDRYKTKNEYVINSIETIEQDLMTYGPVEASFDVYDDFSVYKSGIYRKTPK 272

Query: 109 GDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEA 168
               G H+++++GWG EN  PYWL  NSW+  WGDHGTFKI++G NE  IE      + +
Sbjct: 273 AKYEGGHSIKIIGWGEENGTPYWLAVNSWSKFWGDHGTFKIIKGRNECGIERAVTAGIPS 332

Query: 169 NS 170
            S
Sbjct: 333 TS 334


>gi|204022094|dbj|BAG71144.1| cathepsin B-N1 [Tuberaphis taiwana]
          Length = 334

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 60/136 (44%), Positives = 85/136 (62%), Gaps = 6/136 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FDAR+KW +C ++  + DQ  CGSCWA   ++A +DRLCIA++G F   +SA+ + 
Sbjct: 85  IPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGEFNELLSAEELA 144

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GC+GG+P  AW  +  +G+VTGG+Y+S EGCQPY + PC     G   N T 
Sbjct: 145 FCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYG---NNTC 201

Query: 306 LGKL--KTPECKQNCY 319
            GK   K   C + CY
Sbjct: 202 RGKPAEKNHRCTRMCY 217



 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 47/98 (47%), Positives = 60/98 (61%), Gaps = 1/98 (1%)

Query: 63  HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLG 121
           HY + A+ +        I  +GP+ A F VY DF  YKSGVY +      +G HAV+++G
Sbjct: 229 HYTRDAYYLTYGTIQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIG 288

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           WG E  +PYWL+ NSWND WGD G FKI RG NE  I+
Sbjct: 289 WGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGID 326


>gi|5764077|emb|CAB53367.1| necpain [Necator americanus]
          Length = 339

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 64/154 (41%), Positives = 83/154 (53%), Gaps = 4/154 (2%)

Query: 188 PRNFDAREKWPECPSLR-HIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           P  FDAR+ WP C  +  H+ DQS CGSCWAVS A+ +SDRLC+ SNG     +S   I+
Sbjct: 85  PEKFDARDAWPYCREIIGHVRDQSRCGSCWAVSAASVMSDRLCVQSNGKIKLHVSDTDIL 144

Query: 247 ACTPNCWG--CNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           AC     G  C+GGWP  AW +    GV TGGDY ++  C+PY   PC +H         
Sbjct: 145 ACCGEFCGDGCSGGWPFQAWEWVRKYGVCTGGDYRAKGVCKPYAFHPCGNHENQVYYGVC 204

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAH 338
             G   TP C++ C    Y   Y+ D    KK++
Sbjct: 205 PKGSWPTPRCEKFC-QRGYIKPYKKDKFYAKKSY 237



 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 44/98 (44%), Positives = 64/98 (65%), Gaps = 2/98 (2%)

Query: 64  YFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
           Y KK++ +P         I ++GP+ A F VY DF  YK G+Y+H  G   G HAV+++G
Sbjct: 232 YAKKSYWLPNDEKEIRLDIMKNGPVQAAFDVYEDFKLYKRGIYKHKEGIQTGGHAVKIIG 291

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           WG +N   YWL+ANSW+  WG+ G F+++RGEN+ +IE
Sbjct: 292 WGKDNGTDYWLIANSWSKDWGESGFFRMVRGENDCEIE 329


>gi|162813|gb|AAA30434.1| cathepsin B, partial [Bos taurus]
          Length = 122

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 53/83 (63%), Positives = 65/83 (78%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+   FSVY+DFL YKSGVYQH  G+ +G HA+R+LGWGVEN  PYWLV NS
Sbjct: 27  MAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGNS 86

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKILRG++   IE
Sbjct: 87  WNTDWGDNGFFKILRGQDHCGIE 109


>gi|107921791|gb|ABF85679.1| cathepsin B2 [Fasciola hepatica]
          Length = 278

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 87/155 (56%), Gaps = 2/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAR +WP+C ++  I DQ++CGSCWA + A+A+SDR+CI SNG    +++A   +
Sbjct: 63  LPESFDARSQWPQCWTISEIRDQASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAADPL 122

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC GG+P  AW +W   G+VTGG + ++ GCQP+    C+H       +   
Sbjct: 123 SCCTYCGQGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCP 182

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                 P C + C    Y  TY  D   G  ++ V
Sbjct: 183 HYTYPKPPCARAC-QTGYNKTYEQDKFYGNSSYNV 216



 Score = 68.2 bits (165), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 27/55 (49%), Positives = 40/55 (72%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYW 131
           M++I ++GP+   F+++ DF  Y+SG+Y H  G  IG HAVR++GWGVEN + YW
Sbjct: 224 MQEIMKNGPVEVTFAIFQDFGVYRSGIYHHVAGKFIGRHAVRMIGWGVENGVNYW 278


>gi|3087797|emb|CAA93275.1| cysteine proteinase [Haemonchus contortus]
          Length = 330

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 64/143 (44%), Positives = 80/143 (55%), Gaps = 7/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+RE W  C S+ +I DQSN GSCWAVS A  +SDR+C+ S G     IS   I+
Sbjct: 95  IPESFDSREVWKNCSSITYIRDQSNSGSCWAVSAAETMSDRICVQSKGRVQKMISDVDIL 154

Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           AC    C  GCNGG    AW +    GVVTGG Y  +  C+PY L PCE  + G   +C 
Sbjct: 155 ACCGRECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYHLHPCE--ITGKFWSCP 212

Query: 305 LLGKLKTPECKQNC---YNPSYE 324
                +TP CK+ C   Y   YE
Sbjct: 213 RDHSFRTPACKKYCQYGYGKRYE 235



 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 32/64 (50%), Positives = 45/64 (70%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++ ++GP+ A F+ Y DF  Y+ G+Y H++G   G HAV+V+GWGVEN   YW VANSW
Sbjct: 256 REMMKNGPVQAAFTTYEDFSFYRKGIYVHSYGRQRGAHAVKVVGWGVENGTKYWNVANSW 315

Query: 138 NDHW 141
           +  W
Sbjct: 316 STDW 319


>gi|300952942|gb|ADK46902.1| cathepsin B [Radopholus similis]
          Length = 356

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 66/160 (41%), Positives = 94/160 (58%), Gaps = 14/160 (8%)

Query: 183 NAKGLPRNFDAREKWPECP-SLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
           +A  LP++FD+R+++ +C   +  I DQSNCGSCWAVS A+ I DR+CIASNG     IS
Sbjct: 104 DATKLPQHFDSRKQFTKCAKVIGTIQDQSNCGSCWAVSSASVIQDRICIASNGEQKVHIS 163

Query: 242 AQHIVAC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
           AQ I++C T    GCNGG+P  A+  +  +GVVTG   ++ +GC+PY   P         
Sbjct: 164 AQDILSCATDRSQGCNGGYPDEAFEHYAQSGVVTGSGNSANQGCKPYPFLP--------- 214

Query: 301 QNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            + T+  +  TPEC + C N  Y+  Y+ D   G   + V
Sbjct: 215 -HTTV--EYSTPECSKKCENYQYKKAYKQDKHFGMSVYNV 251



 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 45/83 (54%), Positives = 58/83 (69%), Gaps = 2/83 (2%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE--NDIPYWLVANS 136
           +I  +GP+ A   VY DF+ YKSGVYQ  F   +G HAVR++GWGV+    +PYWLVANS
Sbjct: 262 EIMNNGPVEANMIVYYDFMFYKSGVYQTVFPWPLGGHAVRIVGWGVDGPTKVPYWLVANS 321

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WG+ G F+I RG +E+ IE
Sbjct: 322 WNTDWGEDGYFRIRRGTDESYIE 344


>gi|48762491|dbj|BAD23815.1| cathepsin B-S1 [Tuberaphis coreana]
          Length = 334

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 54/133 (40%), Positives = 79/133 (59%), Gaps = 3/133 (2%)

Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
           P+ FD+R  W  C  + HI DQ NCGSCW+ S   A +DRLC+++ G F   +S + +  
Sbjct: 86  PQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEELAF 145

Query: 248 CTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
           C  +C  GC GG+P  AW+++   GV TGGDY+++EGC PY + PC ++ QG    C   
Sbjct: 146 CCKDCGQGCGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPC-YNKQGK-NTCGGQ 203

Query: 307 GKLKTPECKQNCY 319
              +  +C + CY
Sbjct: 204 PMERNHQCPKTCY 216



 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 45/122 (36%), Positives = 68/122 (55%), Gaps = 2/122 (1%)

Query: 51  KRLYLPTSIPLSHYFKKAHMVPRCNAMRQ-IYEHGPLVAIFSVYADFLQYKSGVYQHNFG 109
           K  Y  T++   +  K  + +     + Q +  +GP+ A F VY DF  YKSG+Y+    
Sbjct: 213 KTCYGKTTVQNRYKTKSEYSINSIKTIEQDLKTYGPVEASFDVYDDFSVYKSGIYRKTPK 272

Query: 110 DSI-GLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEA 168
               G H+++++GWG EN   YWL  NSW+  WG+HGTFKI++G NE  IE      + +
Sbjct: 273 AKYEGRHSIKIIGWGQENGTTYWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGIPS 332

Query: 169 NS 170
           +S
Sbjct: 333 SS 334


>gi|157058751|gb|ABV03133.1| cathepsin B-3098 [Aulacorthum solani]
          Length = 215

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 58/139 (41%), Positives = 83/139 (59%), Gaps = 2/139 (1%)

Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
            N + +PR FDAR+KW  C ++  + DQ NC S WA+S ++A +DRLC+A+NG F   +S
Sbjct: 1   DNYQEIPRKFDARKKWLRCKTIGEVRDQGNCASGWALSTSSAFADRLCVATNGDFNQLLS 60

Query: 242 AQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
           A+ I  C   C  GC GG+P  AW+ +  +G+VTGG+Y S EGC+PY + PC +   G  
Sbjct: 61  AEEITFCCHTCGNGCYGGYPIRAWKSFKKHGLVTGGNYKSGEGCEPYRVPPCPYDEYGN- 119

Query: 301 QNCTLLGKLKTPECKQNCY 319
             C+         C + CY
Sbjct: 120 NTCSGQPMESNHRCTRMCY 138



 Score = 45.1 bits (105), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 32/49 (65%), Gaps = 1/49 (2%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVE 125
           + +  +GP+ A F VY DF  YKSG+Y  +   S +G H+V+++GWG E
Sbjct: 165 KDVINYGPIEASFDVYDDFPSYKSGIYVKSENASYLGGHSVKLIGWGEE 213


>gi|268560898|ref|XP_002638183.1| Hypothetical protein CBG22612 [Caenorhabditis briggsae]
          Length = 721

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 63/153 (41%), Positives = 81/153 (52%), Gaps = 15/153 (9%)

Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
           P +FDAR+ WP C S++ I DQ+ CGSCWA   A  ISDR+CI SNG     IS + I+ 
Sbjct: 81  PTSFDARDYWPNCKSIKMIRDQAYCGSCWAFGAAEVISDRICIQSNGTDQPIISPEDILT 140

Query: 248 CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCE--HHVQGPLQNCTL 305
           C  N  GC GG+   A +FW   GVVTGGD+   +GC PY+   C   H  Q        
Sbjct: 141 CCTNSHGCQGGFVLEAMKFWKSKGVVTGGDFQG-DGCIPYSYGSCSDCHTAQ-------- 191

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAH 338
                TP+CK  C     ++ Y+ D   G  A+
Sbjct: 192 ----TTPKCKNECQVKYTKNEYKEDKYYGSSAY 220



 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 46/101 (45%), Positives = 67/101 (66%), Gaps = 4/101 (3%)

Query: 63  HYFKKAHMVPRCNAMR----QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVR 118
           +Y   A+ +   NA+R    +I  +GP+ A + VY DF  YKSGVY++  G  +G HAV+
Sbjct: 214 YYGSSAYRLSTSNAVRTIQSEILRNGPVEATYQVYEDFYYYKSGVYEYISGRHMGGHAVK 273

Query: 119 VLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           ++GWGVE ++ YWL+ANSW   +G++G FK+ RG NE  IE
Sbjct: 274 IIGWGVEENVNYWLIANSWGTGFGENGFFKMRRGNNECGIE 314


>gi|239938578|gb|ACS36088.1| cysteine proteinase [Haemonchus contortus]
          Length = 332

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 64/143 (44%), Positives = 80/143 (55%), Gaps = 6/143 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R  W  C S+ +I DQSNCGSCWAVS A  +SDR+C+ S G     IS   I+
Sbjct: 95  IPESFDSRVVWKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDIL 154

Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           AC    C  GCNGG    AW +    GVVTGG Y  +  C+PY L PC +H  G   +C 
Sbjct: 155 ACCGRECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNH-GGKFWSCP 213

Query: 305 LLGKLKTPECKQNC---YNPSYE 324
                +TP CK+ C   Y   YE
Sbjct: 214 RDHSFRTPACKKYCQYGYGKRYE 236



 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 38/75 (50%), Positives = 50/75 (66%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++ ++GP+ A    Y DF  Y+ G+Y H  G   G HAV+V+GWGVEN   YW VANSW
Sbjct: 257 REMMKNGPVQAASITYEDFSFYRRGIYVHTRGRQRGAHAVKVVGWGVENGTKYWNVANSW 316

Query: 138 NDHWGDHGTFKILRG 152
           +  WG+ G F+ILRG
Sbjct: 317 STDWGEDGYFRILRG 331


>gi|268572243|ref|XP_002648913.1| Hypothetical protein CBG17826 [Caenorhabditis briggsae]
          Length = 323

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 65/156 (41%), Positives = 86/156 (55%), Gaps = 15/156 (9%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R +W  C S+  I DQ+ CGSCWA S A  ISDR+CIA+ G     IS   ++
Sbjct: 81  IPPSFDSRTRWSNCTSIEMIRDQAQCGSCWAFSTAEVISDRICIATKGTQQPTISPTDML 140

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           AC  N    GC GG+P  A+R+W   GVVTGGD+    GC+PY  APC   +  P +   
Sbjct: 141 ACCGNSCGDGCKGGYPIQAFRWWNSRGVVTGGDFRGS-GCRPYPFAPC---ISCPEE--- 193

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                KTP C  +C    Y + Y  D + G  A+ V
Sbjct: 194 -----KTPTCSLSC-QFGYSTAYAKDKRFGVSAYAV 223



 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 46/95 (48%), Positives = 64/95 (67%), Gaps = 2/95 (2%)

Query: 67  KAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV 124
            A+ V R  A  Q  I  +GP+V  F++Y D  +YKSGVY+H  G  +G HA++++GWG 
Sbjct: 219 SAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGT 278

Query: 125 ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +N IPYWL+ANSW  +WG++G  K+ RG NE  IE
Sbjct: 279 QNGIPYWLIANSWGANWGENGFLKMRRGVNECGIE 313


>gi|161343855|tpg|DAA06108.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 342

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 58/147 (39%), Positives = 85/147 (57%), Gaps = 5/147 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P+ FDAR++W  C ++  + DQ NCGSCWA++ ++A +DRLCIA+N  F   +SA+ + 
Sbjct: 90  IPKKFDARKEWRRCITIGQVRDQGNCGSCWALATSSAFADRLCIATNYEFNELLSAEELT 149

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C + C+GG+P  AW ++  +G+VTGG Y S EGC PY + PC     G    C  
Sbjct: 150 FCCHLCGFACHGGYPIKAWSYFRRHGIVTGGGYQSGEGCAPYRVPPCFSEEDGN-NTCRG 208

Query: 306 LGKLKTPECKQNCYNP---SYESTYRF 329
               K   C + CY      Y+  +RF
Sbjct: 209 QPMEKHHRCTRMCYGDQEIDYDDDHRF 235



 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 39/97 (40%), Positives = 62/97 (63%), Gaps = 1/97 (1%)

Query: 64  YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGW 122
           + +  + +   +  + +  +GP+ A   VY DF  YKSGVY+ +   + +G HAV+++GW
Sbjct: 235 FTRDYYYLTYASIQKDVMTYGPIEASMEVYDDFPSYKSGVYEKSENATYLGGHAVKLIGW 294

Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           G E+ +PYWL+ NSW++ WGD G FKI RG NE  ++
Sbjct: 295 GEEDGVPYWLMVNSWSEMWGDKGLFKIRRGTNECSVD 331


>gi|17510377|ref|NP_490763.1| Protein Y65B4A.2 [Caenorhabditis elegans]
 gi|373220066|emb|CCD71920.1| Protein Y65B4A.2 [Caenorhabditis elegans]
          Length = 421

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 60/157 (38%), Positives = 85/157 (54%), Gaps = 10/157 (6%)

Query: 174 DDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASN 233
           D+LE     N+  +P+NFDAR+KWP CPS+ ++ +Q  CGSC+AV+ A   SDR CI SN
Sbjct: 128 DELENF---NSSDVPKNFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRACIHSN 184

Query: 234 GYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCE 293
           G F   +S + I+ C   C  C GG P  A  +W + G+VTGG    ++GC+PY+    +
Sbjct: 185 GTFKSLLSEEDIIGCCSVCGNCYGGDPLKALTYWVNQGLVTGG----RDGCRPYSF---D 237

Query: 294 HHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
                P    T     +   C + C N  Y+  Y  D
Sbjct: 238 LSCGVPCSPATFFEAEEKRTCMKRCQNIYYQQKYEED 274



 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 37/88 (42%), Positives = 50/88 (56%), Gaps = 10/88 (11%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQ----HNFGDSIGL-HAVRVLGWGVEND-IPYW 131
           ++I  +GP    F V  +FL Y SGV++      F D I   H VR++GWG  +D   YW
Sbjct: 327 KEILLYGPTTMAFPVPEEFLHYSSGVFRPYPTDGFDDRIVYWHVVRLIGWGESDDGTHYW 386

Query: 132 LVANSWNDHWGDHGTFKILRGENEADIE 159
           L  NS+ +HWGD+G FKI    N  D+E
Sbjct: 387 LAVNSFGNHWGDNGLFKI----NTDDME 410


>gi|227018340|gb|ACP18836.1| cysteine proteinase 3 [Chrysomela tremula]
          Length = 190

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 55/107 (51%), Positives = 78/107 (72%), Gaps = 1/107 (0%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P NFDARE WPEC S+R I DQS+CGSCWAV+ A A+SDR+CI S G     +S + ++
Sbjct: 83  IPENFDARENWPECESIRMIRDQSDCGSCWAVAAAAAVSDRICIYSYGANQTIVSDEDLL 142

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPC 292
           +C  +C +GC+GG+   AW +W ++G+V+GG YNS  GC+ Y++ PC
Sbjct: 143 SCCDDCGFGCDGGYSWEAWNYWKNDGIVSGGPYNSTRGCKAYSMQPC 189


>gi|358341865|dbj|GAA49436.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 515

 Score =  117 bits (294), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 61/158 (38%), Positives = 86/158 (54%), Gaps = 8/158 (5%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAR+ W +CPS+R I  QS+CGSCWA     A+SDRLCI S   +   +SA  ++
Sbjct: 81  IPMQFDARKYWLKCPSIREIRGQSSCGSCWAFGAVEAMSDRLCIHSGAKYQKGLSAVDLL 140

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG--PLQNC 303
           +C   C +GC+GG+P  AW +W  +G+VTGG   +  GC+ Y    C H  +G  PL   
Sbjct: 141 SCCWKCGYGCDGGFPAQAWNYWSTDGIVTGGSKENPSGCRSYPFPSCSHDERGRHPLCPS 200

Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
            +     TP C + C        Y  +L K   ++ VL
Sbjct: 201 EI---YHTPRCTKKCDTDKLH--YSAELTKANSSYNVL 233



 Score = 40.0 bits (92), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 15/28 (53%), Positives = 21/28 (75%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVY 104
           M +I  +GP+ A+F VY DFLQY+ G+Y
Sbjct: 240 MMEIMNNGPVEAVFDVYEDFLQYEKGIY 267


>gi|56752811|gb|AAW24617.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 73/195 (37%), Positives = 102/195 (52%), Gaps = 13/195 (6%)

Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
           +IL G  + D EM    R   +   D ++E         +P  FD+R+KWP C S+  I 
Sbjct: 61  RILMGARKEDAEMKRKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110

Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA-CTPNCWGCNGGWPQLAWRF 266
           DQS CGSCWA     A++DR+CI S G  + ++SA  +++ C     GC GG+P  AW +
Sbjct: 111 DQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCKDCGGGCKGGFPGQAWDY 170

Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
           W   G+VTGG   +  GCQPY    CEH  +G    C      KTP+CKQ C    Y++ 
Sbjct: 171 WVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACG-TKIYKTPQCKQTC-QKGYKTP 228

Query: 327 YRFDLKKGKKAHMVL 341
           Y  D   G + + V+
Sbjct: 229 YEQDKHYGDQRYNVI 243



 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 45/82 (54%), Positives = 61/82 (74%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R+I  +GP+ A F VY DFL YKSG+Y+H  G  +G HA+R++GWGVE   PYWL+ANSW
Sbjct: 251 REIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKGKPYWLIANSW 310

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG+ G F+++RG +E  IE
Sbjct: 311 NEDWGEKGLFRMVRGRDECSIE 332


>gi|226473754|emb|CAX71562.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 329

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 64/144 (44%), Positives = 82/144 (56%), Gaps = 22/144 (15%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG--YFTGQISAQH 244
           +P +FD+R+KWP C S+  I DQS CGS WAVS   AISDR+CI S G   + G      
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAISDRICIQSGGKQSYCGS----- 144

Query: 245 IVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
                    GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C 
Sbjct: 145 ---------GCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACG 195

Query: 305 LLGKL-KTPECKQNC---YNPSYE 324
              KL KTP+CKQ C   YN SYE
Sbjct: 196 --DKLYKTPQCKQTCQKGYNTSYE 217



 Score =  103 bits (258), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 64/99 (64%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   ++ V    ++ Q  I  HGP+ A   +Y DFL YKSG+Y++  G  I  HAVR++
Sbjct: 221 HYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLI 280

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN   YWL AN+WN+ WG+ G F+I+RG NE  IE
Sbjct: 281 GWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIE 319


>gi|48762485|dbj|BAD23812.1| cathepsin B-N1 [Tuberaphis styraci]
          Length = 340

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 59/136 (43%), Positives = 84/136 (61%), Gaps = 6/136 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FDAR+KW +C ++  + DQ  CGSCWA   ++A +DRLCIA++G F   +S + + 
Sbjct: 88  IPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELA 147

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GC+GG+P  AW  +  +G+VTGG+Y+S EGCQPY + PC     G   N T 
Sbjct: 148 FCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYG---NNTC 204

Query: 306 LGKL--KTPECKQNCY 319
            GK   K   C + CY
Sbjct: 205 RGKPAEKNHRCTRMCY 220



 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 47/98 (47%), Positives = 60/98 (61%), Gaps = 1/98 (1%)

Query: 63  HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLG 121
           HY + A+ +        I  +GP+ A F VY DF  YKSGVY +      +G HAV+++G
Sbjct: 232 HYTRDAYYLTYGTIQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIG 291

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           WG E  +PYWL+ NSWND WGD G FKI RG NE  I+
Sbjct: 292 WGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGID 329


>gi|204022085|dbj|BAG71140.1| cathepsin B-S [Astegopteryx spinocephala]
          Length = 335

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 68/171 (39%), Positives = 92/171 (53%), Gaps = 9/171 (5%)

Query: 161 GFNNRV-EANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVS 219
           G+ N + E    +DD L T      K    +FDARE W  C  + H+ DQ NCGSCWA  
Sbjct: 63  GYKNYLNEVEIKKDDPLYTKNNDTIK----HFDAREDWKICKQIGHVRDQGNCGSCWAFG 118

Query: 220 VANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDY 278
              A +DRLC+A+ G F  Q+SA+ +  C   C  GC GG P  AW+++  +G+ TGGDY
Sbjct: 119 TTGAFADRLCVATGGGFNEQLSAEKLTFCCWTCGLGCQGGNPIKAWKYFKRHGITTGGDY 178

Query: 279 NSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCY-NPSYESTYR 328
            S EGC PY + PC +  QG    C         +C + CY N + E+ Y+
Sbjct: 179 GSNEGCAPYKVPPC-YDDQGEFL-CQGKPTEHNHKCPRACYGNSTVENRYK 227



 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 44/111 (39%), Positives = 70/111 (63%), Gaps = 2/111 (1%)

Query: 51  KRLYLPTSIPLSHYFKKAHMVPRCNAMRQ-IYEHGPLVAIFSVYADFLQYKSGVYQHN-F 108
           +  Y  +++   +  K  +++     + Q I ++GP+ A F VY DF+ YKSG+YQ    
Sbjct: 214 RACYGNSTVENRYKVKSIYVLDSSKTIEQDIRKYGPVEASFDVYDDFITYKSGIYQKTPN 273

Query: 109 GDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
              +G H+V+++GWG E+ IPYWL+ NSW+  WG+ GTF+I++G NE  IE
Sbjct: 274 AFYVGGHSVKLIGWGEEDGIPYWLLVNSWSKFWGEQGTFRIIKGRNECGIE 324


>gi|48762493|dbj|BAD23816.1| cathepsin B-N1 [Tuberaphis coreana]
          Length = 340

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 59/136 (43%), Positives = 84/136 (61%), Gaps = 6/136 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FDAR+KW +C ++  + DQ  CGSCWA   ++A +DRLCIA++G F   +S + + 
Sbjct: 88  IPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELA 147

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GC+GG+P  AW  +  +G+VTGG+Y+S EGCQPY + PC     G   N T 
Sbjct: 148 FCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYG---NNTC 204

Query: 306 LGKL--KTPECKQNCY 319
            GK   K   C + CY
Sbjct: 205 RGKPAEKNHRCTRMCY 220



 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 47/98 (47%), Positives = 60/98 (61%), Gaps = 1/98 (1%)

Query: 63  HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLG 121
           HY + A+ +        I  +GP+ A F VY DF  YKSGVY +      +G HAV+++G
Sbjct: 232 HYTRDAYYLTYGTIQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIG 291

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           WG E  +PYWL+ NSWND WGD G FKI RG NE  I+
Sbjct: 292 WGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGID 329


>gi|189502866|gb|ACE06814.1| unknown [Schistosoma japonicum]
          Length = 121

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 53/87 (60%), Positives = 66/87 (75%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M+++ +HGP+   F VYADF  YKSGVYQH  G  +G HAVR+LGWG EN++PYWL+ANS
Sbjct: 27  MKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIANS 86

Query: 137 WNDHWGDHGTFKILRGENEADIEMGFN 163
           WN  WGD+G FKI+RG+NE  IE   N
Sbjct: 87  WNTDWGDNGYFKIIRGKNECGIESDVN 113


>gi|268619140|gb|ACZ13346.1| cathepsin B-like cysteine proteinase [Bursaphelenchus xylophilus]
          Length = 405

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 63/154 (40%), Positives = 84/154 (54%), Gaps = 4/154 (2%)

Query: 187 LPRNFDAREKWPECPSL-RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           +P +FDA EKWPEC  +  +I DQSNCGSCWAVS A  +SDR+C+A+NG     IS    
Sbjct: 72  IPESFDAAEKWPECAEVFNNIRDQSNCGSCWAVSSAGVMSDRICVATNGKVKVSISGIAT 131

Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP-LQNCT 304
            +C     GCNGG  ++A+  +  NG  TG + +  +GCQPY    C HHV       C 
Sbjct: 132 ASCVGGD-GCNGGLEEVAFEKFIENGFPTGSEVDKHQGCQPYPFKHCAHHVNSTEYPPCD 190

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAH 338
            + + K   C   C    Y+  Y  DL  GK+ +
Sbjct: 191 SVPEYKADTCSHEC-QKDYDRKYEEDLYYGKEQY 223



 Score = 84.3 bits (207), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 41/86 (47%), Positives = 53/86 (61%), Gaps = 2/86 (2%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI-GLHAVRVLGWGVENDIPYWLVANS 136
           R+I  +GP+   F+VY  FL Y  G+Y+   G+ I G HAVRV+GWGVEN   YW +ANS
Sbjct: 233 REIMTNGPVAVSFTVYESFLYYSGGIYRSTPGERIKGYHAVRVVGWGVENGTKYWKIANS 292

Query: 137 WNDHWGDHGTFK-ILRGENEADIEMG 161
           WN+ WG          G +E+DIE G
Sbjct: 293 WNEQWGRERLLPHTPAGVDESDIEDG 318


>gi|157058761|gb|ABV03138.1| cathepsin B-84 [Myzus persicae]
          Length = 220

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 55/134 (41%), Positives = 82/134 (61%), Gaps = 2/134 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FD+R +W  C ++  + +Q NCGSCWA     A +DRLCIA++G F   ISA+ + 
Sbjct: 47  VPEFFDSRLEWKNCKTIGEVRNQGNCGSCWAHGTTGAFADRLCIATDGEFNELISAEELT 106

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GCNGG P  AW+++  +GVVTGG+YN+ +GCQP  + PC    +G   +C+ 
Sbjct: 107 FCCHTCGFGCNGGNPLKAWKYFKRHGVVTGGNYNTTDGCQPSRVPPCVRDDEG-HNSCSG 165

Query: 306 LGKLKTPECKQNCY 319
               +  +C + CY
Sbjct: 166 QPTERNHKCSKKCY 179


>gi|204022092|dbj|BAG71143.1| cathepsin B-N2 [Tuberaphis coreana]
          Length = 334

 Score =  117 bits (293), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 59/136 (43%), Positives = 84/136 (61%), Gaps = 6/136 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FDAR+KW +C ++  + DQ  CGSCWA   ++A +DRLCIA++G F   +S + + 
Sbjct: 85  IPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELA 144

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GC+GG+P  AW  +  +G+VTGG+Y+S EGCQPY + PC     G   N T 
Sbjct: 145 FCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYG---NNTC 201

Query: 306 LGKL--KTPECKQNCY 319
            GK   K   C + CY
Sbjct: 202 RGKPAEKNHRCTRMCY 217



 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 47/98 (47%), Positives = 60/98 (61%), Gaps = 1/98 (1%)

Query: 63  HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLG 121
           HY + A+ +        I  +GP+ A F VY DF  YKSGVY +      +G HAV+++G
Sbjct: 229 HYTRDAYYLTYGTIQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIG 288

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           WG E  +PYWL+ NSWND WGD G FKI RG NE  I+
Sbjct: 289 WGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGID 326


>gi|341900875|gb|EGT56810.1| hypothetical protein CAEBREN_32632 [Caenorhabditis brenneri]
          Length = 287

 Score =  117 bits (293), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 65/166 (39%), Positives = 93/166 (56%), Gaps = 8/166 (4%)

Query: 174 DDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASN 233
           DD  ++  +N+  L + FDARE+WPEC S+  I D S C S WA + A ++SDRLCI S 
Sbjct: 16  DDGPSVPTENSD-LSQFFDARERWPECMSIPQINDISECKSSWAFAAAESMSDRLCINSG 74

Query: 234 GYFTGQISAQHIVACTPNCW----GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTL 289
           G     +SAQ +++C         GC GG    AW++WG +G+ TGG Y SQ GC+PY++
Sbjct: 75  GTINTILSAQELLSCCTGVLSCGEGCGGGNAFKAWQYWGKHGLPTGGSYESQFGCKPYSI 134

Query: 290 APCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
           APC   V            L TP C++ C   + ++ Y  D+ K +
Sbjct: 135 APCGKTVGNVTYPACTNTTLPTPSCEKKC---TSKNGYPVDIDKDR 177



 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 43/99 (43%), Positives = 60/99 (60%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY      +P  +      +  +GP+   F VY DFLQY +G+Y H  G+  G  +VR+L
Sbjct: 178 HYGASVDQLPNRQIEIQSDVMLNGPIETTFEVYDDFLQYTTGIYVHLTGNKQGHLSVRIL 237

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG+   +PYWL+ANSW   WG++GTF+ LRG NE  +E
Sbjct: 238 GWGMYEGVPYWLLANSWGKEWGENGTFRALRGTNECGLE 276


>gi|256090364|ref|XP_002581165.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228444|emb|CCD74615.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 303

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 46/82 (56%), Positives = 67/82 (81%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I ++GP+ A F+VY DFL YKSG+Y+H  G+++G HA+R++GWGVEN  PYWL+ANSW
Sbjct: 213 KEIMKYGPVEASFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGVENKTPYWLIANSW 272

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG++G F+I+RG +E  IE
Sbjct: 273 NEDWGENGYFRIVRGRDECSIE 294



 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 50/145 (34%), Positives = 68/145 (46%), Gaps = 28/145 (19%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGSC A     A+S+R CI S G    ++SA  + 
Sbjct: 89  IPSSFDSRKKWPRCKSIATIRDQSRCGSCCAFGAVEAMSERSCIQSGGKQNVELSAVDL- 147

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
                                   G+VTG    +  GC+PY    CEH  +G    C   
Sbjct: 148 -----------------------EGIVTGSSKENNTGCEPYPFPKCEHFTKGQYPPCG-- 182

Query: 307 GKL-KTPECKQNCYNPSYESTYRFD 330
            K+ KTP CK  C    Y+++Y  D
Sbjct: 183 SKIYKTPRCKTTC-QKRYKTSYAQD 206


>gi|10803435|emb|CAC13130.1| putative cathepsin B.4 [Ostertagia ostertagi]
          Length = 194

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 60/128 (46%), Positives = 81/128 (63%), Gaps = 3/128 (2%)

Query: 214 SCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGV 272
           SCWAVS A A+SDR+CIAS G     ISAQ +V+C   C +GC+GGWP  AW+F+   GV
Sbjct: 1   SCWAVSSAAAMSDRICIASKGVKQVLISAQDMVSCCSYCGYGCDGGWPIKAWQFFAREGV 60

Query: 273 VTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLK 332
           VTGG+Y  Q  C+PY + PC HH + P          +TP CK+ C    Y++TY+ D +
Sbjct: 61  VTGGNYGRQGCCRPYEITPCGHHGREPYYG-ECYDDAQTPRCKRKC-QSGYKTTYKKDKR 118

Query: 333 KGKKAHMV 340
            G+KA+ +
Sbjct: 119 YGRKAYQL 126



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 40/74 (54%), Gaps = 2/74 (2%)

Query: 47  KKKKKRLYLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVY 104
           K+K +  Y  T      Y +KA+ +P       R+I  HGP+VA ++VY DF  Y  G+Y
Sbjct: 102 KRKCQSGYKTTYKKDKRYGRKAYQLPNSVKAIQREIMMHGPVVAGYTVYEDFSYYTKGIY 161

Query: 105 QHNFGDSIGLHAVR 118
           +H  G   G HAV+
Sbjct: 162 KHTAGRETGGHAVK 175


>gi|226466816|emb|CAX69543.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 337

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 62/155 (40%), Positives = 82/155 (52%), Gaps = 3/155 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FD+RE+W  CPS++ I DQS C S WA++   AISDR+CI +NG    ++SA  +V
Sbjct: 84  LPDYFDSREQWKNCPSIKRIYDQSQCYSSWAMASVAAISDRICIQTNGTVKVELSAIELV 143

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GCN G+ + AW +W  NG+VTG    +  GC PY    C+H        C  
Sbjct: 144 SCCSKCAVGCNFGYSESAWYYWVENGLVTGESNGNNSGCLPYPFPKCDHGSSDSYPMCGY 203

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           +     P C   C  P Y   Y  D   GK A+ V
Sbjct: 204 V-VYTPPVCNGTC-RPGYPIPYNDDKHFGKSAYQV 236



 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 52/103 (50%), Positives = 70/103 (67%), Gaps = 2/103 (1%)

Query: 63  HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+ K A+ V +   +  R+I  +GP+ A   +Y DF+ YKSGVY+H  G  I + +VR++
Sbjct: 228 HFGKSAYQVKQNESDIRREIMLYGPVEASIFIYDDFVDYKSGVYKHLTGRLITIQSVRII 287

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFN 163
           GWG+EN IPYWL ANSWN+ WG +G FKILRG NE +IE   N
Sbjct: 288 GWGIENGIPYWLCANSWNEEWGLNGFFKILRGSNECEIEAFVN 330


>gi|204022083|dbj|BAG71139.1| cathepsin B-S [Astegopteryx styracophila]
          Length = 335

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 68/173 (39%), Positives = 92/173 (53%), Gaps = 9/173 (5%)

Query: 161 GFNNRV-EANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVS 219
           G+ N + E    +DD L T      K    +FDARE W  C  + H+ DQ NCGSCWA  
Sbjct: 63  GYKNYLNEVEIKKDDPLYTKNNNKIK----HFDARENWKICKQIGHVRDQGNCGSCWAFG 118

Query: 220 VANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDY 278
              A +DRLC+A+ G F  Q+SA+ +  C   C  GC GG P  AW+++   G+ TGGDY
Sbjct: 119 TTGAFADRLCVATGGGFNEQLSAEKLTFCCWTCGLGCQGGNPIKAWKYFKRRGITTGGDY 178

Query: 279 NSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCY-NPSYESTYRFD 330
            S EGC PY + PC +  QG    C         +C + CY N + E+ Y+ +
Sbjct: 179 GSNEGCAPYKVPPC-YDDQGEFL-CQGKPTEHNHKCPRACYGNSTVENRYKVE 229



 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 41/83 (49%), Positives = 58/83 (69%), Gaps = 1/83 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHN-FGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           + I  +GP+ A F VY DF+ YKSG+YQ       +G H+V+++GWG E+ IPYWL+ NS
Sbjct: 242 QDIRTYGPVEASFDVYDDFITYKSGIYQKTPNALYVGGHSVKLIGWGEEDGIPYWLLVNS 301

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           W+  WG+ GTF+I++G NE  IE
Sbjct: 302 WSKFWGEQGTFRIIKGRNECGIE 324


>gi|341888224|gb|EGT44159.1| hypothetical protein CAEBREN_15022 [Caenorhabditis brenneri]
          Length = 332

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 61/153 (39%), Positives = 86/153 (56%), Gaps = 7/153 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           L + FDARE+WPEC S+  I D S C S WA + A ++SDRLCI S G     +SAQ ++
Sbjct: 72  LSQFFDARERWPECTSIPQINDISECKSSWAFAAAESMSDRLCINSGGMINTILSAQELL 131

Query: 247 ACTPNCW----GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
           +C         GC GG    AW++WG +G+ TGG Y +Q GC+PY++APC   V      
Sbjct: 132 SCCTGVLSCGEGCGGGNAFKAWQYWGKHGLPTGGSYETQFGCKPYSIAPCGKTVGNVTYP 191

Query: 303 CTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
                 L TP C++ C   + ++ Y  D+ K +
Sbjct: 192 ACTNTTLPTPSCEKKC---TSKNGYPVDIDKDR 221



 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 40/80 (50%), Positives = 55/80 (68%)

Query: 80  IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWND 139
           +  +GP+   F VY DFLQY +G+Y H  G+  G  +VR+LGWG+   +PYWL+ANSW  
Sbjct: 242 VMLNGPIETTFEVYDDFLQYTTGIYVHLTGNKQGHLSVRILGWGMYEGVPYWLLANSWGK 301

Query: 140 HWGDHGTFKILRGENEADIE 159
            WG++GTF+ LRG NE  +E
Sbjct: 302 EWGENGTFRALRGTNECGLE 321


>gi|56758470|gb|AAW27375.1| unknown [Schistosoma japonicum]
          Length = 217

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 52/118 (44%), Positives = 75/118 (63%), Gaps = 1/118 (0%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS CGS WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
           +C   C  GC+GG+   +W +W   G+VTGG   +  GC+PY    C+H V+G  + C
Sbjct: 150 SCCKYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRAC 207


>gi|356984175|gb|AET43950.1| cathepsin B, partial [Reishia clavigera]
          Length = 209

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 60/123 (48%), Positives = 79/123 (64%), Gaps = 1/123 (0%)

Query: 38  KKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCN-AMRQIYEHGPLVAIFSVYADF 96
           K   K  + +KK +  Y  T     HY ++++ V   N  M ++   GP+ A F+VY+DF
Sbjct: 77  KGDGKTPRCEKKCEAGYNVTFKDDKHYGQRSYSVSSVNDIMEELVTRGPVEAAFTVYSDF 136

Query: 97  LQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEA 156
           LQY SGVY+H  G ++G HAV++LG+GVEN   YWLVANSWN  WGD G FKILRG +E 
Sbjct: 137 LQYHSGVYRHTTGSALGGHAVKILGYGVENGDKYWLVANSWNPDWGDQGFFKILRGVDEC 196

Query: 157 DIE 159
            IE
Sbjct: 197 GIE 199



 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 48/109 (44%), Positives = 68/109 (62%), Gaps = 4/109 (3%)

Query: 233 NGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP 291
           N      +SA  ++AC  +C  GCNGG+P  AW  + H+GVVTGG YNS++GCQPY +A 
Sbjct: 5   NATVHAHVSANELLACCESCGDGCNGGYPSAAWEVFDHDGVVTGGQYNSKQGCQPYLIAA 64

Query: 292 CEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           C+HHV G L+ C   G  KTP C++ C    Y  T++ D   G++++ V
Sbjct: 65  CDHHVVGKLKPCK--GDGKTPRCEKKC-EAGYNVTFKDDKHYGQRSYSV 110


>gi|339242631|ref|XP_003377241.1| cathepsin B [Trichinella spiralis]
 gi|316973973|gb|EFV57514.1| cathepsin B [Trichinella spiralis]
          Length = 199

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 61/157 (38%), Positives = 90/157 (57%), Gaps = 13/157 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ +D R+ +P C  +  I DQSNCGSCWAVS A+ +SDR CIA+NG     +S + ++
Sbjct: 54  LPKEYDVRKAYPHCKYINFIKDQSNCGSCWAVSSASVMSDRHCIATNGTEQPFLSEEELI 113

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG+   A+ +W   G+ +GG Y  + GC+PY++APC         NC  
Sbjct: 114 SCCKTCGLGCDGGYVSHAFEYWVEKGLPSGGAYGWKTGCKPYSIAPC--------NNCD- 164

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVLM 342
             + +TP+CK  C  P Y  T + D   G K  + + 
Sbjct: 165 --EAETPKCKNTCI-PEYPLTPKDDKYFGNKIMLRIF 198


>gi|255087666|ref|XP_002505756.1| cathepsin B-like cysteine proteinase [Micromonas sp. RCC299]
 gi|226521026|gb|ACO67014.1| cathepsin B-like cysteine proteinase [Micromonas sp. RCC299]
          Length = 273

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 60/140 (42%), Positives = 74/140 (52%), Gaps = 11/140 (7%)

Query: 184 AKGLPRNFDAREKWPECPSLRHIA-DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
           A GLP +FDAR KWP C  L  +A DQ NCGSCWA++ A  +SDR CI S G    ++S 
Sbjct: 15  ALGLPESFDARTKWPTCAHLIGVARDQGNCGSCWAMAPAEVMSDRACIQSGGEIDAELSP 74

Query: 243 QHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
             ++AC    +GC GG    A+ F   NGVVTGG ++ Q  C PY  APC H  +     
Sbjct: 75  FQLLACAQGSFGCEGGESADAYEFAKSNGVVTGGGFDDQNTCAPYPFAPCHHPCE----- 129

Query: 303 CTLLGKLKTPECKQNCYNPS 322
                   TP C   C   S
Sbjct: 130 -----VFPTPACPATCVGGS 144



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 13/87 (14%)

Query: 79  QIYEHGPLVAIFS-VYADFLQYKSGVYQHN-----FGDSIGLHAVRVLGWG------VEN 126
           +IY +GP+ +    +Y +F  YKSGV++ +      G + G H V+V+GWG       E 
Sbjct: 174 EIYHNGPVSSYAGDIYEEFYAYKSGVFRESPSVAQRGANHGGHVVKVIGWGKADPAKGEG 233

Query: 127 DIPYWLVANSWNDHWGDHGTFKILRGE 153
           +  YW+V NSW + WGD G  +I  GE
Sbjct: 234 EGYYWIVVNSWLN-WGDDGVGRIAVGE 259


>gi|204022096|dbj|BAG71145.1| cathepsin B-N1 [Tuberaphis sumatrana]
          Length = 334

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 60/136 (44%), Positives = 84/136 (61%), Gaps = 6/136 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P NFDAR+KW +C ++  + DQ +CGSCWA   ++A +DRLCIA++G F   +S + + 
Sbjct: 85  IPSNFDARKKWRKCSTIGEVRDQGHCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELA 144

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GC+GG P  AW  +  +G+VTGG+Y+S EGCQPY + PC     G   N T 
Sbjct: 145 FCCHKCGFGCSGGNPIKAWERFQKHGLVTGGNYDSGEGCQPYKVPPCPLDEYG---NNTC 201

Query: 306 LGKL--KTPECKQNCY 319
            GK   K   C + CY
Sbjct: 202 SGKPAEKNHRCTRMCY 217



 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 46/98 (46%), Positives = 60/98 (61%), Gaps = 1/98 (1%)

Query: 63  HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLG 121
           HY + A+ +        +  +GP+ A F VY DF  YKSGVY +      +G HAV+++G
Sbjct: 229 HYTRDAYYLTYGTIQYDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIG 288

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           WG E  +PYWL+ NSWND WGD G FKI RG NE  I+
Sbjct: 289 WGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGID 326


>gi|204022071|dbj|BAG71133.1| cathepsin B-S2 [Tuberaphis coreana]
          Length = 334

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 49/108 (45%), Positives = 67/108 (62%), Gaps = 1/108 (0%)

Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
           P+ FD+R  W  C  + HI DQ NCGSCW+ S   A +DRLC+++ G F   +S + +  
Sbjct: 86  PQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEELTF 145

Query: 248 CTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH 294
           C  +C  GC GG P  AW ++   GV TGGDYN++EGC PY + PC +
Sbjct: 146 CCKDCGQGCGGGNPMKAWEYFRTQGVTTGGDYNTKEGCMPYKVPPCRN 193



 Score = 90.9 bits (224), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 45/122 (36%), Positives = 68/122 (55%), Gaps = 2/122 (1%)

Query: 51  KRLYLPTSIPLSHYFKKAHMVPRCNAMRQ-IYEHGPLVAIFSVYADFLQYKSGVYQHNFG 109
           K  Y  T++   +  K  + +     + Q I  +GP+ A F  Y D   YKSG+Y+ +  
Sbjct: 213 KTCYGKTTVQNRYKTKSEYYINSIKTIEQDIKTYGPVEASFDCYDDLSVYKSGIYRKSPN 272

Query: 110 DSI-GLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEA 168
               G H+++++GWG E+  PYWL  NSW+  WGDHGTFKI++G NE  IE      + +
Sbjct: 273 AKYKGGHSIKIIGWGQEDGTPYWLAVNSWSKFWGDHGTFKIIKGRNECGIERAVTAGIPS 332

Query: 169 NS 170
           +S
Sbjct: 333 SS 334


>gi|119638954|gb|ABL85236.1| cysteine proteinase 2 [Necator americanus]
          Length = 347

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 69/176 (39%), Positives = 90/176 (51%), Gaps = 10/176 (5%)

Query: 167 EANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISD 226
           E +  ED DL       A  LP +FDAREKWPECPS+  I DQS  G CWAVS A  ++D
Sbjct: 81  EMDQQEDIDL-------AVSLPESFDAREKWPECPSIGLIRDQSAGGGCWAVSSAEVMTD 133

Query: 227 RLCIASNGYFTGQISAQHIVACTPN-CW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGC 284
           R+CI SNG     +S   I++C    C  GC  G P+ A+ +    GV +GG Y ++  C
Sbjct: 134 RICIQSNGTKQVYVSETDILSCCGQRCGSGCTSGVPRQAFNYAIRKGVCSGGPYGTKGVC 193

Query: 285 QPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           +PY   PC +H   P       G   TP C++ C    Y   Y  D   G K  ++
Sbjct: 194 KPYPFYPCGYHAHLPYYGPCPDGMWPTPTCEKAC-QSDYTVPYNDDRIFGSKTIVL 248



 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 44/83 (53%), Positives = 62/83 (74%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R+I+ +GPLVA ++VY DF  YK+G+Y    G + G HAV+++GWG EN + YWL+ANSW
Sbjct: 256 REIFNNGPLVATYTVYEDFAYYKNGIYMTGLGRATGAHAVKIIGWGEENGVKYWLIANSW 315

Query: 138 NDHWGDHGTFKILRGENEADIEM 160
           N  WG++G F++LRG N  DIE+
Sbjct: 316 NTDWGENGFFRMLRGTNLCDIEL 338


>gi|269146930|gb|ACZ28411.1| cathepsin b [Simulium nigrimanum]
          Length = 168

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 52/98 (53%), Positives = 68/98 (69%), Gaps = 2/98 (2%)

Query: 64  YFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
           Y  K++ VP       ++I  +GP+   F+VY D +QYK GVYQH  G  +G HA+R+LG
Sbjct: 62  YGAKSYSVPHHVAQIQKEIMTNGPVEGAFTVYEDLVQYKDGVYQHVTGKMLGGHAIRILG 121

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           WGVEND+PYWL+ANSWN  WG++G FKILRG +   IE
Sbjct: 122 WGVENDVPYWLIANSWNTDWGNNGFFKILRGSDHCGIE 159



 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 38/67 (56%), Gaps = 2/67 (2%)

Query: 274 TGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKK 333
           +GG + S +GC PY +APCEHHV G    C    + KTP+C ++C   SY   Y  D   
Sbjct: 5   SGGPFGSNQGCHPYKIAPCEHHVNGTRPACNGE-EGKTPKCIKHC-QASYTVAYEQDKSY 62

Query: 334 GKKAHMV 340
           G K++ V
Sbjct: 63  GAKSYSV 69


>gi|170030062|ref|XP_001842909.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
 gi|167865915|gb|EDS29298.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
          Length = 288

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 52/86 (60%), Positives = 66/86 (76%), Gaps = 1/86 (1%)

Query: 75  NAMR-QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLV 133
           NAM+ +IY++GP+V  F VY DF QY+SGVY+H  G   G HAVRV+GWGVEN + YWL 
Sbjct: 193 NAMKAEIYQNGPIVTSFDVYGDFFQYRSGVYRHVTGAYKGSHAVRVIGWGVENGVKYWLC 252

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
           ANSWN+ WG++G FKI+RGEN   +E
Sbjct: 253 ANSWNERWGENGFFKIVRGENHVGVE 278



 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 62/173 (35%), Positives = 94/173 (54%), Gaps = 14/173 (8%)

Query: 169 NSSEDDDLETMGCQ-NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDR 227
           N SE ++L  +  Q + + LP +FDAR+KWP CPSL  I  Q +CGSC+AVS A  I+DR
Sbjct: 28  NESELNNLPRLQNQRSVRALPASFDARQKWPYCPSLNQIRSQGSCGSCYAVSTAAVITDR 87

Query: 228 LCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
            CI S G       +   ++C  +C+ C+GG+    + +W   G+ +GG Y+S +GC+PY
Sbjct: 88  YCIHSGGERQFYFGSTGYLSCCTDCYKCDGGYVHKTFDYWVKYGLTSGGPYHSGQGCKPY 147

Query: 288 TLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                     G  Q+  ++ K     C + C    Y  TY  DLK G  ++++
Sbjct: 148 PFG-------GATQDVNIVLK-----CDRQC-QAGYPLTYSQDLKHGASSYIL 187


>gi|166030312|gb|ABY78823.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 60/134 (44%), Positives = 82/134 (61%), Gaps = 11/134 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD+ EKWP CP++R IADQS CGSCWAVS A+AISDR C    G    +ISA H++
Sbjct: 90  LPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASAISDRHCTV-GGVQQLRISAAHLL 148

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
           +C  +C +GC+GG+P  AWR++  +G+ +         CQPY    C+HH  +G    C+
Sbjct: 149 SCCKDCGYGCDGGYPDAAWRYYVSHGLAS-------SYCQPYPFPHCDHHGGKGKKPPCS 201

Query: 305 LLGKLKTPECKQNC 318
                 TP+C   C
Sbjct: 202 KY-DFHTPKCNTTC 214



 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 45/82 (54%), Positives = 59/82 (71%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++Y +GP V  F VY+DF  YK+GVY+H  GD +G HAVR++GWG  N  PYW +ANSW
Sbjct: 240 RELYFNGPFVVAFQVYSDFFAYKTGVYRHVSGDVLGGHAVRIVGWGKLNGTPYWKIANSW 299

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           +  WG +G F ILRG++E  IE
Sbjct: 300 DTDWGMNGHFLILRGKDECGIE 321


>gi|268563232|ref|XP_002638788.1| Hypothetical protein CBG05143 [Caenorhabditis briggsae]
          Length = 426

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 62/184 (33%), Positives = 92/184 (50%), Gaps = 7/184 (3%)

Query: 147 FKILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHI 206
           FK  R +   +  M    +   + +    LE +    +  LP++FDAR+KWP CPS+ ++
Sbjct: 103 FKYTRNQTAVEEYMEHIRKFFESDAMKRHLEELENYKSSSLPKHFDARQKWPNCPSISNV 162

Query: 207 ADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRF 266
            +Q  CGSC+AV+ A   SDR CI SNG F   +S + I+ C   C  C GG P  A  +
Sbjct: 163 PNQGGCGSCFAVAAAGVASDRACIHSNGTFKSLLSEEDIIGCCSVCGNCYGGDPLKALTY 222

Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
           W + G+VTGG    ++GC+PY+    +     P    T     +   C + C N  Y+  
Sbjct: 223 WVNQGLVTGG----RDGCRPYSF---DLSCGVPCSPATFFEAEEKRTCMRRCQNIYYQQK 275

Query: 327 YRFD 330
           Y  D
Sbjct: 276 YEED 279



 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 45/121 (37%), Positives = 66/121 (54%), Gaps = 17/121 (14%)

Query: 50  KKRLYLPTSIPLSHY----FKKAHMVPRCNAMR-QIYEHGPLVAIFSVYADFLQYKSGVY 104
           K+R+ +PT I   H+     +K ++    N ++ +I  +GP    F V  +FL Y SGV+
Sbjct: 301 KERVKVPTII--GHFNDKNTEKLNVTEYRNVIKKEILLYGPTTMAFPVPEEFLHYSSGVF 358

Query: 105 Q----HNFGDSIGL-HAVRVLGWGVENDIP-YWLVANSWNDHWGDHGTFKILRGENEADI 158
           +      F D I   H VR++GWG  +D   YWL  NS+ +HWGD+G FKI    N  D+
Sbjct: 359 RPFPLDGFDDRIVYWHVVRLIGWGESDDGQHYWLAVNSFGNHWGDNGIFKI----NTDDM 414

Query: 159 E 159
           E
Sbjct: 415 E 415


>gi|145498570|ref|XP_001435272.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124402403|emb|CAK67875.1| unnamed protein product [Paramecium tetraurelia]
          Length = 325

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 58/137 (42%), Positives = 80/137 (58%), Gaps = 11/137 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FDAR++WP C S++ + DQS CGSCWA   A A+SDRLCIA+ G  T   +   + 
Sbjct: 75  IPESFDARQQWPNCESIKEVRDQSTCGSCWAFGAAEAMSDRLCIAT-GKQTRISTEDLLT 133

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQ----GPLQ 301
            C   C  GCNGG+P  AW ++ + G+VTG  +     C+PYT  PC+HHV     GP  
Sbjct: 134 CCGITCGMGCNGGFPSGAWNYFKNKGLVTGDLFGDNSWCRPYTFPPCDHHVDDGKYGPCG 193

Query: 302 NCTLLGKLKTPECKQNC 318
           +        TP C ++C
Sbjct: 194 D-----SQPTPACVKSC 205



 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/84 (58%), Positives = 63/84 (75%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I   GP+ A F+VY DFL YKSGVYQ+  G ++G HAV+++GWGVE ++PYWLV NSWN
Sbjct: 236 EIMTFGPVEASFTVYEDFLTYKSGVYQNVAGANLGGHAVKIIGWGVEKNVPYWLVVNSWN 295

Query: 139 DHWGDHGTFKILRGENEADIEMGF 162
           + WG++G FKILRG N   IE G 
Sbjct: 296 EGWGENGLFKILRGSNHVGIEGGI 319


>gi|341891034|gb|EGT46969.1| hypothetical protein CAEBREN_30419 [Caenorhabditis brenneri]
          Length = 422

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 62/184 (33%), Positives = 91/184 (49%), Gaps = 7/184 (3%)

Query: 147 FKILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHI 206
           FK  R +   +  M    +   + +    LE +    +  LP+ FDAR+KWP CPS+ ++
Sbjct: 99  FKYTRNQTAVEEYMEHIRKFFESDAMKRHLEELDNYKSSDLPKAFDARQKWPNCPSISNV 158

Query: 207 ADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRF 266
            +Q  CGSC+AV+ A   SDR CI SNG F   +S + I+ C   C  C GG P  A  +
Sbjct: 159 PNQGGCGSCFAVAAAGVASDRACIHSNGTFKALLSEEDIIGCCSVCGNCYGGDPLKALTY 218

Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
           W + G+VTGG    ++GC+PY+    +     P    T     +   C + C N  Y+  
Sbjct: 219 WVNQGLVTGG----RDGCRPYSF---DLSCGVPCSPATFFEAEEKRTCMRRCQNIYYQQR 271

Query: 327 YRFD 330
           Y  D
Sbjct: 272 YEED 275



 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 45/121 (37%), Positives = 65/121 (53%), Gaps = 17/121 (14%)

Query: 50  KKRLYLPTSIPLSHY----FKKAHMVPRCNAMR-QIYEHGPLVAIFSVYADFLQYKSGVY 104
           K+R+ +PT I   H+     +K ++    N ++ +I  +GP    F V  +FL Y SGV+
Sbjct: 297 KERVKVPTII--GHFNDKNTEKLNVTEYRNVIKKEILLYGPTTMAFPVPEEFLHYSSGVF 354

Query: 105 Q----HNFGDSIGL-HAVRVLGWG-VENDIPYWLVANSWNDHWGDHGTFKILRGENEADI 158
           +      F D I   H VR++GWG  E+   YWL  NS+  HWGD+G FKI    N  D+
Sbjct: 355 RPFPLDGFDDRIVYWHVVRLIGWGQSEDGTHYWLAVNSFGSHWGDNGLFKI----NTDDM 410

Query: 159 E 159
           E
Sbjct: 411 E 411


>gi|161343881|tpg|DAA06121.1| TPA_inf: cathepsin B [Toxoptera citricida]
          Length = 182

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 53/111 (47%), Positives = 76/111 (68%), Gaps = 2/111 (1%)

Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           +P+ FDAR+ +  C + +  + DQ NC S WAV+VA+  +DRLCIA+NG FT  +SAQ++
Sbjct: 66  IPKEFDARQYFFNCANVIGDVKDQGNCASSWAVAVASTFTDRLCIATNGTFTQNLSAQNL 125

Query: 246 VACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH 295
           ++C  +   GCNGG    AW F    G+VTGG+++S EGCQPY   PC+H+
Sbjct: 126 MSCGDDEKSGCNGGSAFKAWEFITGKGIVTGGNFDSNEGCQPYKNRPCDHY 176


>gi|308485822|ref|XP_003105109.1| hypothetical protein CRE_20700 [Caenorhabditis remanei]
 gi|308257054|gb|EFP01007.1| hypothetical protein CRE_20700 [Caenorhabditis remanei]
          Length = 410

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 62/184 (33%), Positives = 92/184 (50%), Gaps = 7/184 (3%)

Query: 147 FKILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHI 206
           FK  R +   +  M    +   + +    LE +    +  LP++FDAR+KWP CPS+ ++
Sbjct: 87  FKYTRNQTAVEEYMEHIRKFFESDAMKRHLEELENYKSSDLPKHFDARQKWPNCPSISNV 146

Query: 207 ADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRF 266
            +Q  CGSC+AV+ A   SDR CI SNG F   +S + I+ C   C  C GG P  A  +
Sbjct: 147 PNQGGCGSCFAVAAAGVASDRACIHSNGTFKALLSEEDIIGCCSVCGNCYGGDPLKALTY 206

Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
           W + G+VTGG    ++GC+PY+    +     P    T     +   C + C N  Y+  
Sbjct: 207 WVNQGLVTGG----RDGCRPYSF---DLSCGVPCSPATFFEAEEKRTCMRRCQNIYYQQK 259

Query: 327 YRFD 330
           Y  D
Sbjct: 260 YEED 263



 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 45/121 (37%), Positives = 65/121 (53%), Gaps = 17/121 (14%)

Query: 50  KKRLYLPTSIPLSHY----FKKAHMVPRCNAMR-QIYEHGPLVAIFSVYADFLQYKSGVY 104
           K+R+ +PT I   H+     +K ++    N ++ +I  +GP    F V  +FL Y SGV+
Sbjct: 285 KERVKVPTII--GHFNDKNTEKLNVTEYRNVIKKEILLYGPTTMAFPVPEEFLHYSSGVF 342

Query: 105 Q----HNFGDSIGL-HAVRVLGWGVENDIP-YWLVANSWNDHWGDHGTFKILRGENEADI 158
           +      F D I   H VR++GWG   D   YWL  NS+ +HWGD+G FKI    N  D+
Sbjct: 343 RPFPLDGFDDRIVYWHVVRLIGWGESGDGQHYWLAINSFGNHWGDNGLFKI----NTDDM 398

Query: 159 E 159
           E
Sbjct: 399 E 399


>gi|189239879|ref|XP_968767.2| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270012755|gb|EFA09203.1| cathepsin B precursor [Tribolium castaneum]
          Length = 353

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 57/157 (36%), Positives = 85/157 (54%), Gaps = 16/157 (10%)

Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           +P  FDARE WP+C   + +I +Q  CGSCWA + A  +SDRLC+A+NG    + S + +
Sbjct: 73  IPATFDAREYWPQCKDVIGNIRNQGKCGSCWAFAAAEVMSDRLCVATNGSVKFEFSPEDL 132

Query: 246 VACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           + C   C   C GG+   AW+++   G+V+GGDYN+  GCQPY+ +     V        
Sbjct: 133 INCCETCGKKCKGGYSYYAWKYYTSTGLVSGGDYNTSRGCQPYSKSNFNDGV-------- 184

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
                 +PEC + C N  Y ++Y  D   G   + +L
Sbjct: 185 ------SPECSKTCQNTKYPTSYLNDRHFGDGTYYIL 215



 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 43/77 (55%), Positives = 50/77 (64%), Gaps = 1/77 (1%)

Query: 84  GPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGD 143
           GP++A F VY DF  Y+ GVY H  G  +G HAV+++GWG EN   YWLVANSW   WG 
Sbjct: 230 GPVMAGFDVYEDFKLYREGVYVHTSGALLGSHAVKIIGWGTENGWAYWLVANSWGKDWGA 289

Query: 144 -HGTFKILRGENEADIE 159
             G FKI RG NE  IE
Sbjct: 290 LGGVFKIRRGTNECKIE 306


>gi|170030060|ref|XP_001842908.1| cathepsin B [Culex quinquefasciatus]
 gi|167865914|gb|EDS29297.1| cathepsin B [Culex quinquefasciatus]
          Length = 320

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 52/83 (62%), Positives = 62/83 (74%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY++GP+V  F V+ADF QYKSGVY+H  G + G HAVRV+GWGVEN + YWLVANS
Sbjct: 228 MTEIYQNGPVVVQFEVFADFYQYKSGVYRHVTGATEGWHAVRVIGWGVENGVKYWLVANS 287

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           W   WGD G FK +RGEN   IE
Sbjct: 288 WGVRWGDKGFFKFVRGENHLGIE 310



 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 61/160 (38%), Positives = 84/160 (52%), Gaps = 14/160 (8%)

Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
           ++ + LP +FD+R+KWP CPSL  I DQ  CGSC+ VS A AI+DR CI S G       
Sbjct: 76  RSVRSLPESFDSRQKWPNCPSLNQIRDQGCCGSCYVVSTAAAITDRYCIHSGGQKQFTFG 135

Query: 242 AQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ 301
           A   +AC  +C+ C+GG+    W++W  +G+ + G Y S +GC  Y      + V  PL 
Sbjct: 136 ATDYLACCTDCFKCDGGYVGKTWQYWVDSGLTSEGPYKSGQGCNSYPFG--SYCVNDPL- 192

Query: 302 NCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
                     P C + C    Y  TY  DLK G  A+ V+
Sbjct: 193 ----------PTCSRTC-QAGYPLTYSQDLKYGGSAYRVM 221


>gi|56752997|gb|AAW24710.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 72/195 (36%), Positives = 101/195 (51%), Gaps = 13/195 (6%)

Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
           +IL G  + D EM  N R   +   D ++E         +P  FD+R+KWP C S+  I 
Sbjct: 61  RILMGARKEDAEMKRNRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110

Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQL-AWRF 266
           DQS CGSCWA     A++DR+CI S G  + ++SA  +++C  +C G   G     AW +
Sbjct: 111 DQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDY 170

Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
           W   G+VTGG   +  GCQPY    CEH  +G    C      KTP+CKQ C    Y++ 
Sbjct: 171 WVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACG-TKIYKTPQCKQTC-QKGYKTP 228

Query: 327 YRFDLKKGKKAHMVL 341
           Y  D   G + + V+
Sbjct: 229 YEQDKHYGDQRYNVI 243



 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 45/82 (54%), Positives = 61/82 (74%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R+I  +GP+ A F VY DFL YKSG+Y+H  G  +G HA+R++GWGVE   PYWL+ANSW
Sbjct: 251 REIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKGKPYWLIANSW 310

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG+ G F+++RG +E  IE
Sbjct: 311 NEDWGEKGLFRMVRGRDECSIE 332


>gi|71424150|ref|XP_812694.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
 gi|70877506|gb|EAN90843.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
          Length = 333

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 60/134 (44%), Positives = 80/134 (59%), Gaps = 12/134 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           L   FDA E WPECP++  I DQS+CGSCWAV+ A+AISDR C    G    +ISA  ++
Sbjct: 92  LQDRFDAGEAWPECPTVTEIRDQSSCGSCWAVAAASAISDRYCTL-GGVRDLRISAGDLM 150

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP-LQNCT 304
           +C   C +GCNGG+P++AW ++  +G+V+       E CQPY    C HHV    L  C+
Sbjct: 151 SCCDVCGFGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCS 203

Query: 305 LLGKLKTPECKQNC 318
             G+  TP C   C
Sbjct: 204 --GEYDTPTCNSTC 215



 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 43/82 (52%), Positives = 54/82 (65%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++  +GP    FSVYADF+ Y  GVY+H  G  +G HAVR++GWG  N  PYW +ANSW
Sbjct: 241 RELILNGPFEVSFSVYADFVAYTGGVYKHVAGIFLGGHAVRIVGWGELNGEPYWKIANSW 300

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N  WG +G F I RG +E  IE
Sbjct: 301 NREWGMNGYFLIARGVDECGIE 322


>gi|239790489|dbj|BAH71802.1| ACYPI000009 [Acyrthosiphon pisum]
          Length = 178

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 53/111 (47%), Positives = 75/111 (67%), Gaps = 2/111 (1%)

Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           +PR FDAR+ +  C + +  + DQ NC S WAV+VA+  +DRLCIASNG FT  +SAQ++
Sbjct: 64  IPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGQFTDNLSAQNL 123

Query: 246 VACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH 295
           ++C      GC+GG    AW    + G+VTGG+++S EGCQPY   PC+H+
Sbjct: 124 MSCGDGEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDHY 174


>gi|5031250|gb|AAD38132.1|AF127592_1 vitellogenic cathepsin-B like protease [Aedes aegypti]
          Length = 386

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 60/134 (44%), Positives = 79/134 (58%), Gaps = 13/134 (9%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDAREKWPECPSLR I DQ  CGSCWAVS A+A++DR C+ S G       +  ++
Sbjct: 125 LPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLL 184

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C  GC GG    AW+FW   G+ +GG  NS++GC PY           P+  C +
Sbjct: 185 SCCHSCGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPY-----------PIGECRI 233

Query: 306 LGKLK-TPECKQNC 318
            G+ + TP+C   C
Sbjct: 234 PGEDEDTPKCSNKC 247



 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 45/83 (54%), Positives = 59/83 (71%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +I+ +GP+ A F  Y D   YKSG+Y+H +G   G HAV++LGWGVEN + YWLVANS
Sbjct: 277 MEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLLGWGVENGVKYWLVANS 336

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           W   WG++G FK++RGEN   IE
Sbjct: 337 WGREWGENGFFKMVRGENHCGIE 359


>gi|226471002|emb|CAX70582.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 71/195 (36%), Positives = 102/195 (52%), Gaps = 13/195 (6%)

Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
           +IL G  + D EM    R   +   D ++E         +P  FD+R+KWP C S+  I 
Sbjct: 61  RILMGARKEDAEMKRKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110

Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQL-AWRF 266
           DQS CGSCWA     A++DR+CI S G  + ++SA  +++C  +C G   G     AW +
Sbjct: 111 DQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDY 170

Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
           W   G+VTGG   +  GCQPY    CEH  +G    C      KTP+CKQ C    Y++ 
Sbjct: 171 WVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACG-TKIYKTPQCKQTC-QKGYKTP 228

Query: 327 YRFDLKKGKKAHMVL 341
           Y+ D   G +++ V+
Sbjct: 229 YKQDKHYGDESYNVI 243



 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 44/82 (53%), Positives = 61/82 (74%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  +GP+ A F VY DFL YKSG+Y+H  G  +G HA+R++GWGVE   PYWL+ANSW
Sbjct: 251 KEIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKGKPYWLIANSW 310

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG+ G F+++RG +E  IE
Sbjct: 311 NEDWGEKGLFRMVRGRDECSIE 332


>gi|226466652|emb|CAX69461.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 340

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 65/193 (33%), Positives = 101/193 (52%), Gaps = 8/193 (4%)

Query: 151 RGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKG-LPRNFDAREKWPECPSLRHIADQ 209
           R  +  DIE  F   +E  + +   ++T+   +    +PR+FDAR  W  C ++R I D+
Sbjct: 52  RFRSSKDIEKMFRKYIEIENIQTKHIKTISHNSINMEIPRSFDARYHWINCSTIRQIHDE 111

Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVAC--TPNCWGCNGGWPQLAWRFW 267
           S C + WA++  ++ISDR+CI SNG  + Q+SA+  ++C  +P   GC  G       +W
Sbjct: 112 SLCRADWAIATVDSISDRICIRSNGRISVQLSARDAISCGFSP---GCFHGSEVEVLVYW 168

Query: 268 GHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTY 327
              G+VTGG Y  Q GCQPY L  C +H +    +C      + P+C   C +  Y  TY
Sbjct: 169 ITYGIVTGGSYEDQSGCQPYPLPKCSYHPESRFLDCN-NNTFEFPQCTNECQD-GYNKTY 226

Query: 328 RFDLKKGKKAHMV 340
             D   G++ + V
Sbjct: 227 DDDKFYGERIYNV 239



 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 41/86 (47%), Positives = 54/86 (62%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN-FGDSIGLHAVRVLGWGVENDIPYWLV 133
           +  ++I  +GP++A  SV  DFL YKSGVY       ++G   +R++GWG E  IPYWL 
Sbjct: 245 DIQKEILMNGPVIASISVNTDFLVYKSGVYLPTPRSRNLGWITLRIIGWGYEGKIPYWLC 304

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
           ANSWN+ WGD+G  KI RG     IE
Sbjct: 305 ANSWNEEWGDNGYVKIQRGVQAGYIE 330


>gi|157111449|ref|XP_001651570.1| cathepsin b [Aedes aegypti]
 gi|108868331|gb|EAT32556.1| AAEL015312-PA [Aedes aegypti]
          Length = 386

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 60/134 (44%), Positives = 79/134 (58%), Gaps = 13/134 (9%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDAREKWPECPSLR I DQ  CGSCWAVS A+A++DR C+ S G       +  ++
Sbjct: 125 LPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLL 184

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C  GC GG    AW+FW   G+ +GG  NS++GC PY           P+  C +
Sbjct: 185 SCCHSCGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPY-----------PIGECRI 233

Query: 306 LGKLK-TPECKQNC 318
            G+ + TP+C   C
Sbjct: 234 PGEDEDTPKCSNKC 247



 Score =  103 bits (258), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 49/99 (49%), Positives = 66/99 (66%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY + A+ +P      M +I+ +GP+ A F  Y D   YKSG+Y+H +G   G HAV++L
Sbjct: 261 HYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLL 320

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN + YWLVANSW   WG++G FK++RGEN   IE
Sbjct: 321 GWGVENGVKYWLVANSWGREWGENGFFKMVRGENHCGIE 359


>gi|157131748|ref|XP_001662318.1| cathepsin b [Aedes aegypti]
 gi|108871395|gb|EAT35620.1| AAEL012216-PA [Aedes aegypti]
          Length = 386

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 60/134 (44%), Positives = 79/134 (58%), Gaps = 13/134 (9%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDAREKWPECPSLR I DQ  CGSCWAVS A+A++DR C+ S G       +  ++
Sbjct: 125 LPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLL 184

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C  GC GG    AW+FW   G+ +GG  NS++GC PY           P+  C +
Sbjct: 185 SCCHSCGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPY-----------PIGECRI 233

Query: 306 LGKLK-TPECKQNC 318
            G+ + TP+C   C
Sbjct: 234 PGEDEDTPKCSNKC 247



 Score =  103 bits (258), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 49/99 (49%), Positives = 66/99 (66%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY + A+ +P      M +I+ +GP+ A F  Y D   YKSG+Y+H +G   G HAV++L
Sbjct: 261 HYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLL 320

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN + YWLVANSW   WG++G FK++RGEN   IE
Sbjct: 321 GWGVENGVKYWLVANSWGREWGENGFFKMVRGENHCGIE 359


>gi|204022098|dbj|BAG71146.1| cathepsin B-N2 [Tuberaphis sumatrana]
          Length = 334

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 56/134 (41%), Positives = 81/134 (60%), Gaps = 2/134 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAR+KW +C ++  + DQ +CGSCWA   ++A +DRLCIA++G F   +S + + 
Sbjct: 85  IPSYFDARKKWRKCLTIGEVRDQGHCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELA 144

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
            C   C +GC+GG+P  AW  +  +G+VTGG+Y S EGCQPY + PC     G    C+ 
Sbjct: 145 FCCHKCGFGCSGGYPIKAWERFKKHGLVTGGNYESGEGCQPYRVPPCPLDEYGN-NTCSG 203

Query: 306 LGKLKTPECKQNCY 319
               K   C + CY
Sbjct: 204 KPTEKNHRCTRMCY 217



 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 46/98 (46%), Positives = 60/98 (61%), Gaps = 1/98 (1%)

Query: 63  HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLG 121
           HY + A+ +        +  +GP+ A F VY DF  YKSGVY +      +G HAV+++G
Sbjct: 229 HYTRDAYYLTYGTIQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIG 288

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           WG E  +PYWL+ NSWND WGD G FKI RG NE  I+
Sbjct: 289 WGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGID 326


>gi|268570495|ref|XP_002648548.1| Hypothetical protein CBG24861 [Caenorhabditis briggsae]
          Length = 323

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 64/156 (41%), Positives = 85/156 (54%), Gaps = 15/156 (9%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R +W  C S+  I DQ+ CGSCWA S A  ISDR+CIA+ G     IS   ++
Sbjct: 81  IPPSFDSRTRWSNCTSIEMIRDQAQCGSCWAFSTAEVISDRICIATKGTQQPTISPTDML 140

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           AC  N    GC G +P  A+R+W   GVVTGGD+    GC+PY  APC   +  P +   
Sbjct: 141 ACCGNSCGDGCKGRYPIQAFRWWNSRGVVTGGDFRGS-GCRPYPFAPC---ISCPEE--- 193

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                KTP C  +C    Y + Y  D + G  A+ V
Sbjct: 194 -----KTPTCSLSC-QFGYSTAYAKDKRFGVSAYAV 223



 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 46/95 (48%), Positives = 64/95 (67%), Gaps = 2/95 (2%)

Query: 67  KAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV 124
            A+ V R  A  Q  I  +GP+V  F++Y D  +YKSGVY+H  G  +G HA++++GWG 
Sbjct: 219 SAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGT 278

Query: 125 ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +N IPYWL+ANSW  +WG++G  K+ RG NE  IE
Sbjct: 279 QNGIPYWLIANSWGANWGENGFLKMRRGVNECGIE 313


>gi|157167281|ref|XP_001658485.1| cathepsin b [Aedes aegypti]
 gi|108876476|gb|EAT40701.1| AAEL007585-PA [Aedes aegypti]
          Length = 386

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 60/134 (44%), Positives = 79/134 (58%), Gaps = 13/134 (9%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDAREKWPECPSLR I DQ  CGSCWAVS A+A++DR C+ S G       +  ++
Sbjct: 125 LPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLL 184

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C  GC GG    AW+FW   G+ +GG  NS++GC PY           P+  C +
Sbjct: 185 SCCHSCGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPY-----------PIGECRI 233

Query: 306 LGKLK-TPECKQNC 318
            G+ + TP+C   C
Sbjct: 234 PGEDEDTPKCSNKC 247



 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 50/99 (50%), Positives = 66/99 (66%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY + A+ +P      M +I+ +GP+ A F  Y D   YKSG+Y+H +G   G HAV++L
Sbjct: 261 HYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLL 320

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWGVEN + YWLVANSW   WG++G FKI+RGEN   IE
Sbjct: 321 GWGVENGVKYWLVANSWGREWGENGFFKIVRGENHCGIE 359


>gi|71656032|ref|XP_816569.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
 gi|70881707|gb|EAN94718.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
          Length = 333

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 59/134 (44%), Positives = 80/134 (59%), Gaps = 12/134 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           L   FDA E WP+CP++  I DQS+CGSCWAV+ A+AISDR C    G    +ISA  ++
Sbjct: 92  LQDRFDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAISDRYCTL-GGVRDLRISAGDLM 150

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP-LQNCT 304
           +C   C +GCNGG+P++AW ++  +G+V+       E CQPY    C HHV    L  C+
Sbjct: 151 SCCDVCGYGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCS 203

Query: 305 LLGKLKTPECKQNC 318
             G+  TP C   C
Sbjct: 204 --GEYDTPTCNSTC 215



 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 44/82 (53%), Positives = 54/82 (65%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++  +GP    FSVYADFL Y  GVY+H  G  +G HAVR++GWG  N  PYW +ANSW
Sbjct: 241 RELLLNGPFEVSFSVYADFLAYTGGVYKHVAGTFLGGHAVRIVGWGELNGEPYWKIANSW 300

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N  WG +G F I RG +E  IE
Sbjct: 301 NREWGMNGYFLIARGVDECGIE 322


>gi|28932700|gb|AAO60044.1| midgut cysteine proteinase 1 [Rhipicephalus appendiculatus]
          Length = 332

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 52/99 (52%), Positives = 74/99 (74%), Gaps = 2/99 (2%)

Query: 63  HYFKKAH-MVPRCNAMR-QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+ K  + ++ +C+A++  IY++GP+ + F VYADF  YKSGVYQ +    +G+HA+++L
Sbjct: 222 HFAKNVYRLLKKCDAIKTDIYKNGPVESAFFVYADFPSYKSGVYQQHMIKFMGVHAIKIL 281

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG E+ +PYWLVANSWN  WGD G FKILRG++E  IE
Sbjct: 282 GWGTEDGVPYWLVANSWNVGWGDKGYFKILRGKDECGIE 320



 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 58/155 (37%), Positives = 80/155 (51%), Gaps = 12/155 (7%)

Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
           P +F  RE W  C S+R I DQS CGSCWA + A +ISDR+CI +NG     ISA+ ++A
Sbjct: 88  PESFTPREYWSHCSSIRVIRDQSACGSCWAFAAAESISDRICIHTNGKVQVNISAEDLLA 147

Query: 248 CTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
           C   C  GC+G     +        +V      +++GCQPY+L PC       + NCT  
Sbjct: 148 CCHTCGHGCDGRCHCSSVAILQGRRLVP-EPVRTEDGCQPYSLPPC-------VPNCT-- 197

Query: 307 GKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
               TP+C+  C    YE +Y  D    K  + +L
Sbjct: 198 HPEPTPKCQHVC-RKGYEKSYEEDKHFAKNVYRLL 231


>gi|157092993|gb|ABV22151.1| cysteine proteinase [Perkinsus chesapeaki]
          Length = 396

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 61/148 (41%), Positives = 86/148 (58%), Gaps = 8/148 (5%)

Query: 185 KGLPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
           K LP +F+A E++ EC S + HI DQS CGSCWA +   A +DRLCI S G FT  +S  
Sbjct: 137 KDLPVSFNATEEFKECSSVIGHIRDQSACGSCWAFAPTEAFNDRLCIKSAGNFTSLLSPG 196

Query: 244 HIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQ------EGCQPYTLAPCEHHVQ 297
           ++ AC+    GC+GG    AW++    GVVTGGDY+++      +GC PY + PC H+  
Sbjct: 197 NVAACSKTS-GCHGGSSLDAWQWLHTTGVVTGGDYSAEKDMTESDGCWPYDIPPCAHYTN 255

Query: 298 GPLQNCTLLGKLKTPECKQNCYNPSYES 325
             L       K   P C+++C N  Y++
Sbjct: 256 STLYPKCPKTKYDFPTCQESCPNKKYDT 283



 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 38/76 (50%), Positives = 54/76 (71%), Gaps = 4/76 (5%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  +GP+ A + VY DFL YKSGVY+    +++G HAV+++GWG +    YWLV NSW
Sbjct: 308 KEIMTNGPVSASYLVYDDFLTYKSGVYKRTSHNALGGHAVKIIGWGED----YWLVVNSW 363

Query: 138 NDHWGDHGTFKILRGE 153
           N +WGD+G FKI  G+
Sbjct: 364 NKNWGDNGMFKIGCGQ 379


>gi|1008858|gb|AAA79004.1| cathepsin B-like thiol protease [Aedes aegypti]
          Length = 342

 Score =  114 bits (285), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 60/155 (38%), Positives = 87/155 (56%), Gaps = 11/155 (7%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAR+KW +CPSL  I +Q  CGSCWA+S A+A++DR CI S G       A  ++
Sbjct: 87  LPESFDARQKWSQCPSLNVIRNQGCCGSCWAISAASAMTDRWCIKSKGKEQFSFGATDML 146

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           AC   C  GC GG+   AW+FW   GV +GG YNS++GC PY +  C+   +        
Sbjct: 147 ACCHACGDGCKGGYLGPAWQFWVEQGVSSGGPYNSRQGCHPYPIDVCDASGE-------- 198

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             +  TP+C + C +    +    D + G+ A+ +
Sbjct: 199 --EADTPKCSKRCQSGYNVTDVWQDRRYGRVAYSI 231



 Score =  108 bits (269), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 52/98 (53%), Positives = 66/98 (67%), Gaps = 2/98 (2%)

Query: 64  YFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
           Y + A+ +P      M +IY +GP+ A F  Y D   YKSGVY+H +G   G HAV+++G
Sbjct: 224 YGRVAYSIPNDEQKIMEEIYINGPVQAAFMTYQDLHAYKSGVYRHVWGHMAGGHAVKLMG 283

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           WGVEN + YWLVANSW D WGD+G FKI+RGEN   IE
Sbjct: 284 WGVENGLKYWLVANSWGDDWGDNGFFKIVRGENHCGIE 321


>gi|194246067|gb|ACF35525.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 192

 Score =  114 bits (285), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 54/106 (50%), Positives = 71/106 (66%), Gaps = 2/106 (1%)

Query: 63  HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+ KK + +         +IY++GP+ A FSVYADF  YKSGVYQ +  + +G HA+R+L
Sbjct: 81  HFGKKVYSISSDETQIKTEIYKNGPVEADFSVYADFPSYKSGVYQRHSEEMLGGHAIRIL 140

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRV 166
           GWG E+ +PYWLVANSWN+ WGD G FKI RG +E  IE   N  +
Sbjct: 141 GWGTEDGVPYWLVANSWNEDWGDKGYFKIRRGNDECGIEDDINAGI 186



 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 42/87 (48%), Positives = 54/87 (62%), Gaps = 3/87 (3%)

Query: 254 GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPE 313
           GCNGG+P  AW+F+    +VTGG Y +++GCQPY   PCEHH  GPL NCT  G   TPE
Sbjct: 6   GCNGGYPSAAWQFYKDEDIVTGGLYGTEDGCQPYYFPPCEHHTVGPLPNCT--GIKPTPE 63

Query: 314 CKQNCYNPSYESTYRFDLKKGKKAHMV 340
           C + C    Y+ +Y  D   GKK + +
Sbjct: 64  CAKTC-REGYQKSYTRDKHFGKKVYSI 89


>gi|157167283|ref|XP_001658486.1| cathepsin b [Aedes aegypti]
 gi|108876477|gb|EAT40702.1| AAEL007599-PA [Aedes aegypti]
          Length = 342

 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 60/155 (38%), Positives = 87/155 (56%), Gaps = 11/155 (7%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAR+KW +CPSL  I +Q  CGSCWA+S A+A++DR CI S G       A  ++
Sbjct: 87  LPESFDARQKWSQCPSLNVIRNQGCCGSCWAISAASAMTDRWCIKSKGKEQFSFGATDML 146

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           AC   C  GC GG+   AW+FW   GV +GG YNS++GC PY +  C+   +        
Sbjct: 147 ACCHACGDGCKGGYLGPAWQFWVEQGVSSGGPYNSRQGCHPYPIDVCDASGE-------- 198

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             +  TP+C + C +    +    D + G+ A+ +
Sbjct: 199 --EADTPKCSKRCQSGYNVTDVWQDRRYGRVAYSI 231



 Score =  108 bits (269), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 52/98 (53%), Positives = 66/98 (67%), Gaps = 2/98 (2%)

Query: 64  YFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
           Y + A+ +P      M +IY +GP+ A F  Y D   YKSGVY+H +G   G HAV+++G
Sbjct: 224 YGRVAYSIPNDEQKIMEEIYINGPVQAAFMTYQDLHAYKSGVYRHVWGHMAGGHAVKLMG 283

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           WGVEN + YWLVANSW D WGD+G FKI+RGEN   IE
Sbjct: 284 WGVENGLKYWLVANSWGDDWGDNGFFKIVRGENHCGIE 321


>gi|32566081|ref|NP_506002.2| Protein CPR-1 [Caenorhabditis elegans]
 gi|32172429|sp|P25807.2|CPR1_CAEEL RecName: Full=Gut-specific cysteine proteinase; Flags: Precursor
 gi|1395200|gb|AAB88058.1| gut-specific cysteine protease-1 [Caenorhabditis elegans]
 gi|24817276|emb|CAB01410.2| Protein CPR-1 [Caenorhabditis elegans]
          Length = 329

 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 65/156 (41%), Positives = 85/156 (54%), Gaps = 13/156 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FD+R +W EC S++ I DQ+ CGSCWA   A  ISDR CI + G     IS   ++
Sbjct: 85  VPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLL 144

Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C   +C  GC GG+P  A R+W   GVVTGGDY+   GC+PY +APC         NC 
Sbjct: 145 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGA-GCKPYPIAPCTS------GNCP 197

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
              + KTP C  +C    Y + Y  D   G  A+ V
Sbjct: 198 ---ESKTPSCSMSC-QSGYSTAYAKDKHFGVSAYAV 229



 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 51/99 (51%), Positives = 69/99 (69%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+   A+ VP+  A  Q  IY +GP+ A FSVY DF +YKSGVY+H  G  +G HA++++
Sbjct: 221 HFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYKSGVYKHTAGKYLGGHAIKII 280

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG E+  PYWLVANSW  +WG+ G FKI RG+++  IE
Sbjct: 281 GWGTESGSPYWLVANSWGVNWGESGFFKIYRGDDQCGIE 319


>gi|204022081|dbj|BAG71138.1| cathepsin B-S1 [Tuberaphis takenouchii]
          Length = 332

 Score =  114 bits (284), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 49/103 (47%), Positives = 65/103 (63%), Gaps = 1/103 (0%)

Query: 191 FDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP 250
           FD+RE W  C  +  I DQ NCGSCWA     A +DRLC+++ G F   +S + +  C  
Sbjct: 89  FDSRENWKSCKQIGRIRDQGNCGSCWAFGTTGAFADRLCVSTGGKFNELLSPEDVAFCCQ 148

Query: 251 NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPC 292
           NC  GC GG+P  AW+++   GV TGGDY+S+EGC PY + PC
Sbjct: 149 NCGKGCEGGYPIKAWQYFRTQGVPTGGDYDSKEGCAPYKIPPC 191



 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 42/111 (37%), Positives = 68/111 (61%), Gaps = 2/111 (1%)

Query: 51  KRLYLPTSIPLSHYFKKAHMVPRCNAMRQ-IYEHGPLVAIFSVYADFLQYKSGVYQHNF- 108
           K  Y  T++   +  K  +++   N M Q + ++GP+ A F+++ D   YKSG+YQ    
Sbjct: 213 KTCYGSTTVQKRYKVKNEYVLNSPNTMEQDLIKYGPIEASFNLFDDLSAYKSGIYQKTPK 272

Query: 109 GDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
              +  H+++++GWG EN +PYWL  NSW+  WG+ GTF+I++G NE  IE
Sbjct: 273 AKFLSGHSIKIIGWGKENGVPYWLAVNSWSKFWGEQGTFRIIKGRNECGIE 323


>gi|410912140|ref|XP_003969548.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
          Length = 246

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 54/108 (50%), Positives = 72/108 (66%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y P+     HY K ++ V     +   +IY++GP+   F+VY DF+ YK+GVYQH  G +
Sbjct: 130 YTPSYKQDKHYGKTSYSVSSDEDDIKHEIYKNGPVEGAFTVYEDFVLYKTGVYQHVTGSA 189

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G HA+++LGWG EN IPYWL ANSWN  WG++G FKILRG N   IE
Sbjct: 190 LGGHAIKILGWGEENGIPYWLCANSWNTDWGNNGFFKILRGSNHCGIE 237



 Score =  107 bits (267), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 53/125 (42%), Positives = 79/125 (63%), Gaps = 3/125 (2%)

Query: 217 AVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTG 275
           A   + A+SDR+CI SN   + ++SA+ +++C  +C  GCNGG+P  AW FW  +G+V+G
Sbjct: 25  AFGASEAMSDRICIHSNAKISVELSAEDLLSCCESCGMGCNGGYPSAAWDFWTKDGLVSG 84

Query: 276 GDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
           G Y+S  GC+PYT+ PCEHHV G   +C+  G  +TP+C   C    Y  +Y+ D   GK
Sbjct: 85  GLYDSHIGCRPYTIPPCEHHVNGSRPSCSGEGG-ETPQCVYRC-EAGYTPSYKQDKHYGK 142

Query: 336 KAHMV 340
            ++ V
Sbjct: 143 TSYSV 147


>gi|308504375|ref|XP_003114371.1| CRE-CPR-1 protein [Caenorhabditis remanei]
 gi|308261756|gb|EFP05709.1| CRE-CPR-1 protein [Caenorhabditis remanei]
          Length = 366

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 65/156 (41%), Positives = 85/156 (54%), Gaps = 13/156 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R  W EC S++ I DQ+ CGSCWA   A  ISDR CI + G     IS   ++
Sbjct: 122 IPASFDSRTHWSECKSIKLIRDQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDDLL 181

Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C   +C  GC GG+P  A R+W   GVVTGGDY+   GC+PY +APC         NC 
Sbjct: 182 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGA-GCKPYPIAPCTS------GNCP 234

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
              + KTP C  +C    Y + Y  D   G  A+ V
Sbjct: 235 ---ESKTPSCSLSC-QSGYTTAYAKDKHFGTSAYAV 266



 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 46/99 (46%), Positives = 68/99 (68%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+   A+ V R   +   +I  +GP+ A F+VY DF +YKSGVY+H  G ++G HA++++
Sbjct: 258 HFGTSAYAVARKVASIQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHAIKII 317

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG E+  PYWLVANSW + WG+ G F+I RG+++  IE
Sbjct: 318 GWGTESGSPYWLVANSWGNSWGESGFFRIFRGDDQCGIE 356


>gi|390357905|ref|XP_003729132.1| PREDICTED: cathepsin B-like [Strongylocentrotus purpuratus]
          Length = 354

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 51/85 (60%), Positives = 61/85 (71%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I  +GP+ A F+VY DF  YKSGVYQH  G  +G HA+++LGWGVE    YWLVANSWN
Sbjct: 265 EIMTNGPVEADFTVYEDFPTYKSGVYQHTTGGVLGGHAIKILGWGVEEGTKYWLVANSWN 324

Query: 139 DHWGDHGTFKILRGENEADIEMGFN 163
           + WGD+G FKILRG NE  IE   N
Sbjct: 325 NEWGDNGFFKILRGSNECGIESDIN 349



 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 39/83 (46%), Positives = 49/83 (59%), Gaps = 4/83 (4%)

Query: 249 TPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLG 307
           TP C   CNGG+P  AW ++   G+VTGG +NS +GCQPY +  C+HHV G    C   G
Sbjct: 166 TPECKHKCNGGFPGSAWEYYKDTGIVTGGQWNSSQGCQPYQIKSCDHHVNGTKGPCQ--G 223

Query: 308 KLKTPECKQNCYNPSYESTYRFD 330
           +  TPECK  C   SY + Y  D
Sbjct: 224 EGPTPECKHKC-EASYSTPYEQD 245



 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 28/59 (47%), Positives = 36/59 (61%), Gaps = 2/59 (3%)

Query: 260 PQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC 318
           P  AW ++   G+VTGG +NS +GCQPY +  C+HHV G    C   G+  TPECK  C
Sbjct: 117 PGSAWEYYKDTGIVTGGQWNSSQGCQPYQIKSCDHHVNGTKGPCQ--GEGPTPECKHKC 173


>gi|290989996|ref|XP_002677623.1| cathepsin B [Naegleria gruberi]
 gi|284091231|gb|EFC44879.1| cathepsin B [Naegleria gruberi]
          Length = 321

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 56/113 (49%), Positives = 74/113 (65%), Gaps = 10/113 (8%)

Query: 57  TSIPLSH--YFKKA--HMVP------RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH 106
           T+IP+S   Y+ K+  H+ P        +  ++IY +GP+   FSVY DF+ YKSGVY H
Sbjct: 194 TNIPISSQLYYAKSFSHISPWMFWERVADIQQEIYTNGPVQGGFSVYQDFMNYKSGVYSH 253

Query: 107 NFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
             G  +G HA++++GWGVE  + YWLVANSW+  WG  GTFKILRG NE  IE
Sbjct: 254 KTGSFLGGHAIKIIGWGVEGGVDYWLVANSWSTDWGIDGTFKILRGHNECGIE 306



 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 48/133 (36%), Positives = 66/133 (49%), Gaps = 26/133 (19%)

Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           GLP NFD+R++W +C  +  I +Q  CGSCWA S + ++SDR CIASNG     +S Q +
Sbjct: 85  GLPTNFDSRQQWGKC--IHPIRNQEQCGSCWAFSASESLSDRFCIASNGKVDVILSPQDM 142

Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           V+C  N  GC+GG    AW +  + G+V        + C PY                 +
Sbjct: 143 VSCDYNDMGCDGGNLDNAWWWMKNKGIVP-------DSCMPY-----------------V 178

Query: 306 LGKLKTPECKQNC 318
            G    P C  NC
Sbjct: 179 SGGGNVPACPSNC 191


>gi|407425570|gb|EKF39488.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi
           marinkellei]
          Length = 333

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 60/134 (44%), Positives = 80/134 (59%), Gaps = 12/134 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           L   FDA E WP CP++  I DQS+CGSCWAV+ A+A+SDR C    G    +ISA  ++
Sbjct: 92  LEDKFDAAEAWPNCPTITEIRDQSSCGSCWAVAAASAMSDRYCTL-GGVRDLRISAGDLM 150

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP-LQNCT 304
           +C   C +GCNGG+P++AW F+  +G+V+       E CQPY    C HHV    L  C+
Sbjct: 151 SCCDVCGYGCNGGFPEVAWVFYVVHGLVS-------EYCQPYPFPSCAHHVNSSDLAPCS 203

Query: 305 LLGKLKTPECKQNC 318
             G  KTP+C   C
Sbjct: 204 --GDYKTPKCNSTC 215



 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 44/82 (53%), Positives = 54/82 (65%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++  +GP    F VYADF+ Y  GVY+H  GD +G HAVR++GWG  N  PYW +ANSW
Sbjct: 241 RELLLNGPFEVAFEVYADFMAYTGGVYKHVAGDLLGGHAVRLVGWGELNGEPYWKIANSW 300

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N  WG +G F I RG NE  IE
Sbjct: 301 NHEWGMNGYFLIARGVNECGIE 322


>gi|341886633|gb|EGT42568.1| hypothetical protein CAEBREN_17563 [Caenorhabditis brenneri]
          Length = 358

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 64/184 (34%), Positives = 99/184 (53%), Gaps = 14/184 (7%)

Query: 165 RVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAI 224
           R    S+E+D+ +       + +P +FDAR+KWP C  +  + DQS+CGS   +  A   
Sbjct: 77  RSHEQSTENDNSQVF-----EEIPNSFDARQKWPSCSQIGAVRDQSDCGSAAHLVAAEIA 131

Query: 225 SDRLCIASNGYFTGQISAQHIVACTP-------NCWGCNGGWPQLAWRFWGHNGVVTGGD 277
           SDR CI SNG F   +SAQ  ++C         + WGC+G WP+   ++W  +G+ TGG+
Sbjct: 132 SDRTCIFSNGTFNWPLSAQDPLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGN 191

Query: 278 YNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCY-NPSYESTYRFDLKKGKK 336
           Y+ Q GC+PYT+ PC+        +    G   TP C++ C  N ++  +Y+ D   GK 
Sbjct: 192 YDDQFGCKPYTIYPCDKKYPNGTTSVPCPG-YHTPVCEERCTSNITWPISYKQDKHFGKA 250

Query: 337 AHMV 340
            + V
Sbjct: 251 HYNV 254



 Score = 90.9 bits (224), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 43/107 (40%), Positives = 62/107 (57%), Gaps = 3/107 (2%)

Query: 56  PTSIPLSHYFKKAHMVP---RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI 112
           P S     +F KAH        +   +I  +GP++A F +Y DF  YKSG+Y H  GD  
Sbjct: 238 PISYKQDKHFGKAHYNVGKKMTDIQTEIMRNGPVIASFIIYDDFWDYKSGIYVHTAGDQE 297

Query: 113 GLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           G    +++GWGV+N +PYWL  + W   +G++G  +ILRG NE +IE
Sbjct: 298 GGMDTKIIGWGVDNGVPYWLCVHQWGTDFGENGFVRILRGVNEVNIE 344


>gi|226471008|emb|CAX70585.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 71/195 (36%), Positives = 100/195 (51%), Gaps = 13/195 (6%)

Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
           +IL G  + D EM    R   +   D ++E         +P  FD+R+KWP C S+  I 
Sbjct: 61  RILMGARKEDAEMKRKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110

Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQL-AWRF 266
           DQS CGSCWA     A++DR+CI S G  + ++SA  +++C  +C G   G     AW +
Sbjct: 111 DQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDY 170

Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
           W   G+VTGG   +  GCQPY    CEH  +G    C      KTP+CKQ C    Y++ 
Sbjct: 171 WVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACG-TKIYKTPQCKQTC-QKGYKTP 228

Query: 327 YRFDLKKGKKAHMVL 341
           Y  D   G + + V+
Sbjct: 229 YEQDKHYGDQRYNVI 243



 Score =  110 bits (276), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 45/82 (54%), Positives = 62/82 (75%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R+I  +GP+ A F VY DFL YKSG+Y+H  G  +G HA+R++GWGVE   PYWL+ANSW
Sbjct: 251 REIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIGWGVEKGKPYWLIANSW 310

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG++G F+++RG +E  IE
Sbjct: 311 NEDWGENGLFRMVRGRDECSIE 332


>gi|226471006|emb|CAX70584.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 71/195 (36%), Positives = 100/195 (51%), Gaps = 13/195 (6%)

Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
           +IL G  + D EM    R   +   D ++E         +P  FD+R+KWP C S+  I 
Sbjct: 61  RILMGARKEDAEMKRKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110

Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQL-AWRF 266
           DQS CGSCWA     A++DR+CI S G  + ++SA  +++C  +C G   G     AW +
Sbjct: 111 DQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDY 170

Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYEST 326
           W   G+VTGG   +  GCQPY    CEH  +G    C      KTP+CKQ C    Y++ 
Sbjct: 171 WVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACG-TKIYKTPQCKQTC-QKGYKTP 228

Query: 327 YRFDLKKGKKAHMVL 341
           Y  D   G + + V+
Sbjct: 229 YEQDKHYGDQRYNVI 243



 Score =  110 bits (276), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 45/82 (54%), Positives = 62/82 (75%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R+I  +GP+ A F VY DFL YKSG+Y+H  G  +G HA+R++GWGVE   PYWL+ANSW
Sbjct: 251 REIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIGWGVEKGKPYWLIANSW 310

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG++G F+++RG +E  IE
Sbjct: 311 NEDWGENGLFRMVRGRDECSIE 332


>gi|17560488|ref|NP_506310.1| Protein F32H5.1 [Caenorhabditis elegans]
 gi|3876629|emb|CAB04249.1| Protein F32H5.1 [Caenorhabditis elegans]
          Length = 356

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 96/179 (53%), Gaps = 10/179 (5%)

Query: 171 SEDDDLETMGCQNA-KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLC 229
           S D+  E  G  N    +P +FD+R+KWP C  +  + DQS+CGS   +      SDR C
Sbjct: 75  SNDEVSEKTGNDNVLVDIPSSFDSRQKWPSCSQIGAVRDQSDCGSAAHLVAVEIASDRTC 134

Query: 230 IASNGYFTGQISAQHIVACTP-------NCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQE 282
           IASNG F   +SAQ  ++C         + WGC+G WP+   ++W  +G+ TGG+YN Q 
Sbjct: 135 IASNGTFNWPLSAQDPLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYNDQF 194

Query: 283 GCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCY-NPSYESTYRFDLKKGKKAHMV 340
           GC+PY++ PC+        +    G   TP C+++C  N ++   Y+ D   GK  + V
Sbjct: 195 GCKPYSIYPCDKKYANGTTSVPCPG-YHTPTCEEHCTSNITWPIAYKQDKHFGKAHYNV 252



 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 40/107 (37%), Positives = 61/107 (57%), Gaps = 3/107 (2%)

Query: 56  PTSIPLSHYFKKAHMVP---RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI 112
           P +     +F KAH        +   +I  +GP++A F +Y DF  YK+G+Y H  GD  
Sbjct: 236 PIAYKQDKHFGKAHYNVGKKMTDIQIEIMTNGPVIASFIIYDDFWDYKTGIYVHTAGDQE 295

Query: 113 GLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           G    +++GWGV+N +PYWL  + W   +G++G  + LRG NE +IE
Sbjct: 296 GGMDTKIIGWGVDNGVPYWLCVHQWGTDFGENGFVRFLRGVNEVNIE 342


>gi|3088522|gb|AAD03404.1| cathepsin B-like protease precursor [Trypanosoma cruzi]
 gi|407859283|gb|EKG06969.1| cysteine peptidase C (CPC) [Trypanosoma cruzi]
          Length = 333

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 58/134 (43%), Positives = 80/134 (59%), Gaps = 12/134 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           L   FDA E WP+CP++  I DQS+CGSCWAV+ A+A+SDR C    G    +ISA  ++
Sbjct: 92  LQDRFDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAMSDRYCTL-GGVRDLRISAGDLM 150

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP-LQNCT 304
           +C   C +GCNGG+P++AW ++  +G+V+       E CQPY    C HHV    L  C+
Sbjct: 151 SCCDVCGYGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCS 203

Query: 305 LLGKLKTPECKQNC 318
             G+  TP C   C
Sbjct: 204 --GEYDTPTCNSTC 215



 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 43/82 (52%), Positives = 54/82 (65%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++  +GP    FSVYADF+ Y  GVY+H  G  +G HAVR++GWG  N  PYW +ANSW
Sbjct: 241 RELLLNGPFEVSFSVYADFVAYTGGVYKHVTGVFLGGHAVRIVGWGELNGEPYWKIANSW 300

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N  WG +G F I RG +E  IE
Sbjct: 301 NHEWGMNGYFLIARGVDECGIE 322


>gi|268566077|ref|XP_002647467.1| Hypothetical protein CBG06539 [Caenorhabditis briggsae]
          Length = 332

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 67/161 (41%), Positives = 86/161 (53%), Gaps = 21/161 (13%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAR KWP+C S++ I +Q+NCGSCWA   A  ISDR+CIA+ G     IS   +V
Sbjct: 87  IPETFDARTKWPKCKSIKLIRNQANCGSCWAFGAAEVISDRICIATKGARQPVISPMDMV 146

Query: 247 ACTPN-C-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTL---APCEHHVQGPLQ 301
            C    C +GC+GG+   A R+W  +GVVTGGDY   +GC+PY     A C   V     
Sbjct: 147 DCCGEYCGYGCDGGYSIQALRWWVFDGVVTGGDYQG-DGCKPYQFCNSAGCPDAV----- 200

Query: 302 NCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVLM 342
                    TPEC  +C    Y + Y  D   G  A+ V M
Sbjct: 201 ---------TPECALSC-QSKYNTEYAKDKNFGTSAYYVGM 231



 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 50/103 (48%), Positives = 67/103 (65%), Gaps = 5/103 (4%)

Query: 75  NAMR-QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLV 133
           NA++  I  +GP+ A F VY DF +YKSGVY++  G  +G HA++++GWG EN   YWL+
Sbjct: 234 NAIQTDIMTNGPVEASFKVYEDFYKYKSGVYKYIAGKMLGGHAIKIIGWGTENGTAYWLI 293

Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDL 176
           ANSW   WG++G FKI RG NE  IE    N V A  ++ D L
Sbjct: 294 ANSWGTKWGENGFFKIRRGVNECGIE----NNVVAGKADVDTL 332


>gi|341878049|gb|EGT33984.1| CBN-CPR-1 protein [Caenorhabditis brenneri]
          Length = 330

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 64/156 (41%), Positives = 86/156 (55%), Gaps = 13/156 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R  W EC S++ I +Q+ CGSCWA   A  ISDR CI + G     IS   ++
Sbjct: 86  IPASFDSRTHWSECKSIKLIRNQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDDLL 145

Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C   +C  GC GG+P  A R+W   GVVTGGDY+   GC+PY +APC         +C 
Sbjct: 146 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGA-GCKPYPIAPCTS------GSCP 198

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
              + KTP C  +C  P Y + Y  D   G  A+ V
Sbjct: 199 ---ESKTPACSLSC-QPGYTTAYAKDKHFGTSAYAV 230



 Score =  107 bits (267), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 46/99 (46%), Positives = 67/99 (67%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+   A+ V +   +   +I  +GP+ A F+VY DF +YKSGVY+H  G ++G HA++++
Sbjct: 222 HFGTSAYAVAKKVASIQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHAIKII 281

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG E+  PYWLVANSW   WG+ G FKI RG+++  IE
Sbjct: 282 GWGTESGSPYWLVANSWGTSWGESGFFKIFRGDDQCGIE 320


>gi|294939825|ref|XP_002782575.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239894358|gb|EER14370.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 398

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 56/127 (44%), Positives = 71/127 (55%), Gaps = 9/127 (7%)

Query: 182 QNAKGLPRNFDAREKWPECP-SLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQI 240
           +  K LP +FDAR  +P+C   + H+ DQS CG CWA  V  A +DRLCI SNG FT  +
Sbjct: 135 EELKDLPTDFDARTAFPKCSKVIGHVRDQSACGDCWAFGVTEAFNDRLCIKSNGTFTKLL 194

Query: 241 SAQHIVACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDY------NSQEGCQPYTLAPC 292
           SA  + AC P+    GC GG+P  AW +    G+ TGGDY         +GC PY   PC
Sbjct: 195 SAGEMNACAPSLKDPGCRGGFPYSAWSWVHDEGIATGGDYVPRDNMTEDDGCWPYDFPPC 254

Query: 293 EHHVQGP 299
            H  + P
Sbjct: 255 AHFFKDP 261



 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 49/109 (44%), Positives = 65/109 (59%), Gaps = 8/109 (7%)

Query: 52  RLYLPTSIPLSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           R ++  S+P  ++F         +A   I   GP+ A F VY DFL YKSGVY+H  G  
Sbjct: 291 RYFMVESVP--YHFSAD------DAKNAIRTDGPVSATFYVYEDFLAYKSGVYKHTSGSL 342

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEM 160
           +G HAV+++GWG +    YWLV NSWN+ WGDHG FKI  G+   D E+
Sbjct: 343 LGAHAVKIIGWGEDGGEAYWLVVNSWNEGWGDHGLFKIALGDCGIDNEL 391


>gi|166030314|gb|ABY78824.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 59/134 (44%), Positives = 80/134 (59%), Gaps = 11/134 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD+ EKWP CP++R IADQS CGSCWAVS A+AISDR C    G    +ISA H++
Sbjct: 90  LPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASAISDRYCTV-GGVQQLRISAAHLL 148

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
           +C  +C +GC+GG+P  AW ++  +G+       +   CQPY    C HH  +G    C+
Sbjct: 149 SCCKDCGYGCDGGYPGTAWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCS 201

Query: 305 LLGKLKTPECKQNC 318
                 TP+C   C
Sbjct: 202 KY-DFHTPKCNTTC 214



 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 46/82 (56%), Positives = 60/82 (73%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++Y +GP V  F VY+DFL YK+GVY+H  GD +G HAVR++GWG  N  PYW +ANSW
Sbjct: 240 RELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDVLGGHAVRIVGWGKLNGTPYWKIANSW 299

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           +  WG +G F ILRG++E  IE
Sbjct: 300 DTDWGMNGHFLILRGKDECGIE 321


>gi|15723274|gb|AAL06325.1| cathepsin B-like protease [Trypanosoma cruzi]
 gi|15723278|gb|AAL06327.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 59/130 (45%), Positives = 79/130 (60%), Gaps = 12/130 (9%)

Query: 191 FDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP 250
           FDA E WPECP++  I DQS+CGSCWAV+ A+AISDR C    G    +ISA  +++C  
Sbjct: 1   FDAGEAWPECPTVTEIRDQSSCGSCWAVAAASAISDRYCTL-GGVRDLRISAGDLMSCCD 59

Query: 251 NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP-LQNCTLLGK 308
            C +GCNGG+P++AW ++  +G+V+       E CQPY    C HHV    L  C+  G+
Sbjct: 60  VCGFGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCS--GE 110

Query: 309 LKTPECKQNC 318
             TP C   C
Sbjct: 111 YDTPTCNSTC 120



 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 33/61 (54%), Positives = 42/61 (68%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++  +GP    FSVYADF+ Y  GVY+H  G  +G HAVR++GWG  N  PYW +ANSW
Sbjct: 146 RELILNGPFEVSFSVYADFVAYTGGVYKHVAGIFLGGHAVRIVGWGELNGEPYWKIANSW 205

Query: 138 N 138
           N
Sbjct: 206 N 206


>gi|268557292|ref|XP_002636635.1| C. briggsae CBR-CPR-1 protein [Caenorhabditis briggsae]
          Length = 330

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 50/99 (50%), Positives = 69/99 (69%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+   A+ V R  A  Q  I  +GP+ A F+VY DF +YKSGVY+H  G ++G HA++++
Sbjct: 222 HFGASAYAVARSVAAIQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHAIKII 281

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG E+  PYWLVANSW  +WG+ G FKILRG+++  IE
Sbjct: 282 GWGTESGSPYWLVANSWGTNWGESGFFKILRGDDQCGIE 320



 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 64/156 (41%), Positives = 86/156 (55%), Gaps = 13/156 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R +W EC S++ I +Q+ CGSCWA   A  ISDR CI + G     IS   ++
Sbjct: 86  IPASFDSRTQWSECKSIKLIRNQATCGSCWAFGAAEIISDRTCIETKGAQQPIISPDDLL 145

Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C   +C  GC GG+P  A R+W   GVVTGGDY+   GC+PY +APC         NC 
Sbjct: 146 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGA-GCKPYPIAPCTS------GNCP 198

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
              + KTP C  +C    Y + Y  D   G  A+ V
Sbjct: 199 ---ESKTPACSLSC-QSGYSTAYAKDKHFGASAYAV 230


>gi|166030316|gb|ABY78825.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 60/134 (44%), Positives = 80/134 (59%), Gaps = 11/134 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD+ EKWP CP++R IADQS CGSCWAVS A+AISDR C    G    +ISA H++
Sbjct: 90  LPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASAISDRHCTV-GGVQQLRISAAHLL 148

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
           +C  +C  GC+GG+P  AWR++  +G+       +   CQPY    C HH  +G    C+
Sbjct: 149 SCCKDCGDGCDGGYPDAAWRYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCS 201

Query: 305 LLGKLKTPECKQNC 318
                 TP+C   C
Sbjct: 202 KY-DFHTPKCNTTC 214



 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 45/83 (54%), Positives = 58/83 (69%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++Y +GP V  F V++DFL YK+GVY+H  GD +G HAVR++GWG  N  PYW +ANSW
Sbjct: 241 RELYFNGPFVVAFQVFSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKLNGTPYWKIANSW 300

Query: 138 NDHWGDHGTFKILRGENEADIEM 160
           +  WG +G F  LRG NE  IE 
Sbjct: 301 DTDWGMNGHFLFLRGNNECGIEF 323


>gi|144952804|gb|ABP04056.1| cathepsin B-4 [Clonorchis sinensis]
          Length = 347

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 65/144 (45%), Positives = 80/144 (55%), Gaps = 6/144 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTG-QISAQHI 245
           LP  FDARE WPEC ++  I DQS CGSCWA +   A+SDR+CI SN      Q+SA  +
Sbjct: 86  LPSEFDAREHWPECRTIPQIRDQSGCGSCWAFAAVTAMSDRVCIHSNQTLVNVQLSATDL 145

Query: 246 VACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNC 303
           +AC   C +GC GGW  +AW +W  NG+VTGG+Y     C PY   PC HH  +G     
Sbjct: 146 LACCTTCGFGCVGGWGGMAWDYWRDNGIVTGGEYKDSHTCLPYPFPPCRHHGAKGSEYPP 205

Query: 304 TLLGKLKTPECKQNC---YNPSYE 324
                  TP+C   C   Y   YE
Sbjct: 206 CPEKMYSTPQCVSECQKGYATKYE 229



 Score =  100 bits (250), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 45/93 (48%), Positives = 60/93 (64%), Gaps = 1/93 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVANS 136
           ++I+  GP+ A  +VY DF  Y  GVY+H  G+ +G HA+R+LGWGVE D  PYWL ANS
Sbjct: 250 KEIWMRGPVEATMNVYTDFANYAGGVYKHTTGELLGGHAIRLLGWGVEEDGTPYWLAANS 309

Query: 137 WNDHWGDHGTFKILRGENEADIEMGFNNRVEAN 169
           WN  WG+ G F+ILRG +   IE   +  +  N
Sbjct: 310 WNPSWGEKGFFRILRGSDHCGIESDVSAGLPVN 342


>gi|268561866|ref|XP_002638438.1| Hypothetical protein CBG18654 [Caenorhabditis briggsae]
          Length = 396

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 68/187 (36%), Positives = 97/187 (51%), Gaps = 18/187 (9%)

Query: 158 IEMGFNNRVEANSSE-DDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCW 216
           +++ F    E+  SE  DDLE    +    LP  FD+R +WP C S++ I DQ+ CGSCW
Sbjct: 58  MDVRFAEVPESEKSEKSDDLEF---ETLIQLPTAFDSRVQWPNCNSIKLIRDQTYCGSCW 114

Query: 217 AVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW--GCNGGWPQLAWRFWGHNGVVT 274
           A + A  ISDR+CI SNG     IS + I++C  +    GC GG+   A ++W ++GVVT
Sbjct: 115 AFAAAEIISDRICIQSNGTQQPIISPEDILSCCGSSCNNGCQGGYTIEAMKYWMNSGVVT 174

Query: 275 GGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKG 334
           GGDY    GC PY+  PC           T       P CK  C   SY++   + L   
Sbjct: 175 GGDYQGA-GCIPYSFRPCS----------TCKEPKDAPSCKTTC-QASYKAKSAYRLPTT 222

Query: 335 KKAHMVL 341
             ++ ++
Sbjct: 223 TSSNAIV 229



 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 49/112 (43%), Positives = 66/112 (58%), Gaps = 4/112 (3%)

Query: 48  KKKKRLYLPTSIPLSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN 107
           K K    LPT+   +     A  + +     +IY +GP+   + VY DF  YKSGVY H 
Sbjct: 212 KAKSAYRLPTTTSSNAIVANAVQMIQ----TEIYNNGPVEVAYQVYDDFYHYKSGVYYHV 267

Query: 108 FGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +GD    HAV+++GWG E  + YWLVANSW+  +G++G FKI RG NE  IE
Sbjct: 268 YGDKPSGHAVKIIGWGTEKKVDYWLVANSWSTTFGENGFFKIRRGTNECGIE 319


>gi|270012757|gb|EFA09205.1| cathepsin B precursor [Tribolium castaneum]
          Length = 348

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 60/149 (40%), Positives = 88/149 (59%), Gaps = 7/149 (4%)

Query: 152 GENEADIE--MGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPS-LRHIAD 208
           G N  DI+  +GF   +  +   D  ++T   + AK +P +FDAREKWPEC   +  I D
Sbjct: 42  GTNSLDIKSRLGF---LGLHPDPDYKIQTKHHKIAKSIPESFDAREKWPECKDVIGKIRD 98

Query: 209 QSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFW 267
           Q  CGSCWA +    ++DRLCI + G      S ++++ C  +C   C GG+   AW ++
Sbjct: 99  QGTCGSCWAFASTEVMTDRLCIGTKGETKFVFSPENLLTCCEDCRLECVGGYTAKAWDYY 158

Query: 268 GHNGVVTGGDYNSQEGCQPYTLAPCEHHV 296
            + G+V+GGDYNS EGCQPY+ A  ++ V
Sbjct: 159 INEGIVSGGDYNSSEGCQPYSKASFQYAV 187



 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 38/82 (46%), Positives = 49/82 (59%), Gaps = 10/82 (12%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I  +GP++A F+V+ D + YKSG+   N         V +L WG E  +PYWL+ANSW 
Sbjct: 227 EILTNGPVMATFNVFEDIIYYKSGIQLSN---------VSILRWGTEEGVPYWLIANSWG 277

Query: 139 DHWGDHGTF-KILRGENEADIE 159
             WGD G F KI RG NE  IE
Sbjct: 278 TWWGDLGGFIKIKRGTNECAIE 299


>gi|167541036|gb|ABZ82028.1| cathepsin B endopeptidase [Clonorchis sinensis]
          Length = 228

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 50/83 (60%), Positives = 63/83 (75%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M++I  +GP+ A F VY DFL YKSGVY H+ G  +G HA+R+LGWG EN + YWL+ANS
Sbjct: 130 MKEIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWGEENGVAYWLIANS 189

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WND WG+ G FK+LRG+NE  IE
Sbjct: 190 WNDGWGEDGYFKMLRGKNECGIE 212



 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 47/125 (37%), Positives = 63/125 (50%), Gaps = 4/125 (3%)

Query: 217 AVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTG 275
           A     A+SDRLCI +NG FT +ISA  +++C   C +GC GG+P  AW FW   G+VTG
Sbjct: 1   AFGAVEAMSDRLCIHTNGTFTKRISAVDLISCCGYCGFGCQGGFPPTAWDFWQTEGIVTG 60

Query: 276 GDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
           G   +  GC+ Y    C HH       C+      TP C Q C  P  ++ Y  D  +  
Sbjct: 61  GSKENPTGCRSYPFPRCSHHGSKKYPPCSHR-IYDTPNCVQKCDTP--DTDYATDKTRAN 117

Query: 336 KAHMV 340
             + V
Sbjct: 118 ITYNV 122


>gi|256090674|ref|XP_002581308.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 250

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 49/86 (56%), Positives = 65/86 (75%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  +GP+ A   V++DFL YKSGVY+H  G  + +H+VR++GWG+ENDIPYWL ANSW
Sbjct: 157 KEIMMNGPVEAGIFVHSDFLNYKSGVYRHITGQLVTIHSVRIIGWGIENDIPYWLCANSW 216

Query: 138 NDHWGDHGTFKILRGENEADIEMGFN 163
           N+ WG +G FKILRG NE +IE   N
Sbjct: 217 NEDWGLNGYFKILRGSNECEIESFVN 242



 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 41/123 (33%), Positives = 60/123 (48%), Gaps = 6/123 (4%)

Query: 216 WAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTG 275
           WAV+ A +ISDR CI +NG    Q+SA  +++C+ N  GC  G+ + +W +W  NG+VTG
Sbjct: 30  WAVASAASISDRTCIQTNGTMKVQLSAIELISCSKNKLGCQIGFSEFSWDYWLKNGLVTG 89

Query: 276 GDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
                  GC PY    C+H        C  +     P C + C    Y   Y+ D   G+
Sbjct: 90  ----DPTGCLPYPFPKCDHRSSNSYPKCGYI-TYTAPPCTKTC-RSGYPIPYKADKHYGR 143

Query: 336 KAH 338
             +
Sbjct: 144 VIY 146


>gi|358341561|dbj|GAA37330.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 347

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 65/144 (45%), Positives = 80/144 (55%), Gaps = 6/144 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTG-QISAQHI 245
           LP  FDARE WPEC ++  I DQS CGSCWA +   A+SDR+CI SN      Q+SA  +
Sbjct: 86  LPSEFDAREHWPECRTIPQIRDQSGCGSCWAFAAVTAMSDRVCIHSNQTLVNVQLSATDL 145

Query: 246 VACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNC 303
           +AC   C +GC GGW  +AW +W  NG+VTGG+Y     C PY   PC HH  +G     
Sbjct: 146 LACCTTCGFGCVGGWGGMAWDYWRDNGIVTGGEYKDSHTCLPYPFPPCRHHGAKGSEYPP 205

Query: 304 TLLGKLKTPECKQNC---YNPSYE 324
                  TP+C   C   Y   YE
Sbjct: 206 CPEKMYSTPQCVSECQKGYATKYE 229



 Score =  100 bits (250), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 45/93 (48%), Positives = 60/93 (64%), Gaps = 1/93 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVANS 136
           ++I+  GP+ A  +VY DF  Y  GVY+H  G+ +G HA+R+LGWGVE D  PYWL ANS
Sbjct: 250 KEIWMRGPVEATMNVYTDFANYAGGVYKHTTGELLGGHAIRLLGWGVEEDGTPYWLAANS 309

Query: 137 WNDHWGDHGTFKILRGENEADIEMGFNNRVEAN 169
           WN  WG+ G F+ILRG +   IE   +  +  N
Sbjct: 310 WNPSWGEKGFFRILRGSDHCGIESDVSAGLPVN 342


>gi|239792046|dbj|BAH72408.1| ACYPI000003 [Acyrthosiphon pisum]
          Length = 182

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 59/134 (44%), Positives = 82/134 (61%), Gaps = 2/134 (1%)

Query: 37  KKKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCNAMRQ-IYEHGPLVAIFSVYAD 95
           K+  K     KK ++   +P +  L H      +    + +RQ IY +GP+   F+VY D
Sbjct: 49  KEGGKTPTCVKKCEEGYKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVYED 108

Query: 96  FLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLVANSWNDHWGDHGTFKILRGEN 154
           F+ Y++GVY+H  G ++G HA+R+LGWGV+N +IPYWLVANSWN  WG  G FKILRG +
Sbjct: 109 FIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANSWNTDWGSDGFFKILRGSD 168

Query: 155 EADIEMGFNNRVEA 168
           E  IE   N  + A
Sbjct: 169 ECGIEGQINAGLPA 182



 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 32/70 (45%), Positives = 39/70 (55%), Gaps = 3/70 (4%)

Query: 271 GVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
           G+V+GG Y S  GC PY +APCEHHV G    C   G  KTP C + C    Y+  Y  D
Sbjct: 16  GIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGPCKEGG--KTPTCVKKC-EEGYKVPYAQD 72

Query: 331 LKKGKKAHMV 340
           L  GK A+ +
Sbjct: 73  LHHGKSAYSI 82


>gi|15723276|gb|AAL06326.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 58/130 (44%), Positives = 79/130 (60%), Gaps = 12/130 (9%)

Query: 191 FDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP 250
           FDA E WP+CP++  I DQS+CGSCWAV+ A+AISDR C    G    +ISA  +++C  
Sbjct: 1   FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAISDRYCTL-GGVRDLRISAGDLMSCCD 59

Query: 251 NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP-LQNCTLLGK 308
            C +GCNGG+P++AW ++  +G+V+       E CQPY    C HHV    L  C+  G+
Sbjct: 60  VCGYGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCS--GE 110

Query: 309 LKTPECKQNC 318
             TP C   C
Sbjct: 111 YDTPTCNSTC 120



 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 34/61 (55%), Positives = 42/61 (68%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++  +GP    FSVYADFL Y  GVY+H  G  +G HAVR++GWG  N  PYW +ANSW
Sbjct: 146 RELLLNGPFEVSFSVYADFLAYTGGVYKHVAGTFLGGHAVRIVGWGELNGEPYWKIANSW 205

Query: 138 N 138
           N
Sbjct: 206 N 206


>gi|15723280|gb|AAL06328.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 58/130 (44%), Positives = 79/130 (60%), Gaps = 12/130 (9%)

Query: 191 FDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP 250
           FDA E WP+CP++  I DQS+CGSCWAV+ A+AISDR C    G    +ISA  +++C  
Sbjct: 1   FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAISDRYCTL-GGVRDLRISAGDLMSCCD 59

Query: 251 NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP-LQNCTLLGK 308
            C +GCNGG+P++AW ++  +G+V+       E CQPY    C HHV    L  C+  G+
Sbjct: 60  VCGYGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCS--GE 110

Query: 309 LKTPECKQNC 318
             TP C   C
Sbjct: 111 YDTPTCNSTC 120



 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 34/61 (55%), Positives = 42/61 (68%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++  +GP    FSVYADFL Y  GVY+H  G  +G HAVR++GWG  N  PYW +ANSW
Sbjct: 146 RELLLNGPFEVSFSVYADFLAYTGGVYKHVAGIFLGGHAVRIVGWGELNGEPYWKIANSW 205

Query: 138 N 138
           N
Sbjct: 206 N 206


>gi|353228456|emb|CCD74627.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 333

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 57/154 (37%), Positives = 81/154 (52%), Gaps = 6/154 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FD+RE+W +CPS+  I DQS C S WAV+ A +ISDR CI +NG    Q+SA  ++
Sbjct: 84  LPDYFDSREQWKDCPSINIIHDQSKCDSGWAVASAASISDRTCIQTNGTMKVQLSAIELI 143

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
           +C+ N  GC  G+ + +W +W  NG+VTG       GC PY    C+H        C  +
Sbjct: 144 SCSKNKLGCQIGFSEFSWDYWLKNGLVTG----DPTGCLPYPFPKCDHRSSNSYPKCGYI 199

Query: 307 GKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                P C + C    Y   Y+ D   G+  + +
Sbjct: 200 -TYTAPPCTKTC-RSGYPIPYKADKHYGRVIYSL 231



 Score =  111 bits (277), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 49/86 (56%), Positives = 65/86 (75%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  +GP+ A   V++DFL YKSGVY+H  G  + +H+VR++GWG+ENDIPYWL ANSW
Sbjct: 240 KEIMMNGPVEAGIFVHSDFLNYKSGVYRHITGQLVTIHSVRIIGWGIENDIPYWLCANSW 299

Query: 138 NDHWGDHGTFKILRGENEADIEMGFN 163
           N+ WG +G FKILRG NE +IE   N
Sbjct: 300 NEDWGLNGYFKILRGSNECEIESFVN 325


>gi|161343877|tpg|DAA06119.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 145

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 49/91 (53%), Positives = 64/91 (70%)

Query: 69  HMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDI 128
           + +P   AM++IYE+GP+ A F +Y DF+ Y+SGVY  N G  +   AV++LGWG EN  
Sbjct: 46  YRIPGYTAMKEIYENGPITASFYMYQDFVNYQSGVYAFNSGKYVTTQAVKILGWGEENGT 105

Query: 129 PYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           PYWL ANS+N +WGD+G  KILRG NE  IE
Sbjct: 106 PYWLAANSFNTYWGDNGFVKILRGANECYIE 136


>gi|3087799|emb|CAA93276.1| cysteine proteinase [Haemonchus contortus]
          Length = 350

 Score =  111 bits (277), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 61/156 (39%), Positives = 80/156 (51%), Gaps = 13/156 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R  W  C S+ ++ DQS CGSCWAVS A+ +SDR+C+ + G     +S   I+
Sbjct: 94  IPESFDSRIVWKNCSSITYVRDQSRCGSCWAVSAASTMSDRICVQTKGKLQTILSDTDIL 153

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C       GC GG+  LAW +    GVVTGG Y  +  C+PY   PC  H  G   +C 
Sbjct: 154 SCCGRMCGDGCEGGYDHLAWEWVQRFGVVTGGPYQQKGVCRPYAFHPCGLH-HGRRYDCP 212

Query: 305 LLGKLKTPECKQNC---YNPSYE-------STYRFD 330
                 TP CK  C   Y   YE       STY  D
Sbjct: 213 WDHSFSTPACKPYCQFGYGKRYEKDKFFVKSTYILD 248



 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 34/65 (52%), Positives = 44/65 (67%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++ ++GP+ A F  Y DF  YK G+Y H  G   G HAV+++GWGVEN   YW VANSW
Sbjct: 256 REMMKNGPVQAAFITYEDFSPYKGGIYVHVKGRERGAHAVKLIGWGVENGTKYWTVANSW 315

Query: 138 NDHWG 142
           +D WG
Sbjct: 316 HDDWG 320


>gi|332374788|gb|AEE62535.1| unknown [Dendroctonus ponderosae]
          Length = 328

 Score =  111 bits (277), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 59/138 (42%), Positives = 79/138 (57%), Gaps = 10/138 (7%)

Query: 187 LPRNFDAREKWPECPSLR-HIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           +P +FD+R  WPEC  +   I DQS CGSCWA +   A+SDR+CI SN      +S+Q +
Sbjct: 81  IPESFDSRTAWPECTQIIGMIRDQSRCGSCWAFAAVEAMSDRICIHSNATKKLLVSSQDL 140

Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNS-QEGCQPYTLAPCEHHVQGPLQNCT 304
           + C     GCNGGWP +AW  W  NG+VTGG Y + ++GC+ Y L  C+ H       C 
Sbjct: 141 LTCG-TAGGCNGGWPAVAWSDW-TNGIVTGGLYGALEQGCKSYFLEGCDDHP----NKCR 194

Query: 305 LLGKLKTPECKQNCYNPS 322
               + TP C + C  PS
Sbjct: 195 --NYVSTPACVEQCDEPS 210



 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 43/81 (53%), Positives = 59/81 (72%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I  +GP+ A   VY DF QY+SG+YQ    +  G HAV++LGWGVE+ + YWLVANSWN
Sbjct: 235 EIMTNGPVEATMDVYVDFAQYQSGIYQLTTDEYEGGHAVKILGWGVEDGVKYWLVANSWN 294

Query: 139 DHWGDHGTFKILRGENEADIE 159
           + WG++G F+I+RG +E  IE
Sbjct: 295 ERWGENGLFRIIRGRDEVGIE 315


>gi|40557606|gb|AAR88096.1| cathepsin B-like cysteine protease [Callosobruchus maculatus]
          Length = 330

 Score =  111 bits (277), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 58/140 (41%), Positives = 79/140 (56%), Gaps = 13/140 (9%)

Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
           + K LP  FDAR++W +C S++ I DQS CGSCWAVS A+ +SDR+CI S+     +ISA
Sbjct: 77  DGKDLPEEFDARKQWSKCESIKEIRDQSGCGSCWAVSSASVMSDRICIQSDQKNQLRISA 136

Query: 243 QHIVACTPNCW----GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG 298
             ++ C  +C     GC+GG P   +  W  +G V+GG+YNS  GC  Y L  C      
Sbjct: 137 ADMIECCESCTFSVDGCHGGIPSFTFTEWKDSGFVSGGEYNSTNGCMSYPLPRCN----- 191

Query: 299 PLQNCTLLGKLKTPECKQNC 318
              +C  L     P CK+ C
Sbjct: 192 --PSCKTL--YDAPTCKKEC 207



 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/103 (52%), Positives = 73/103 (70%), Gaps = 7/103 (6%)

Query: 63  HYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS--IGLHAV 117
           HY K+A+ +          +I ++GP+VA F+VYADF+ Y SGVY+ + G+S  +G HAV
Sbjct: 220 HYAKQAYRIMSKVERQIQLEIIKNGPVVASFTVYADFIHYLSGVYKFD-GESKLLGGHAV 278

Query: 118 RVLGWGVENDI-PYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           R++GWG+EN   PYWLV+NSWN+ WGD G FKI RG+NE  IE
Sbjct: 279 RIIGWGIENGTYPYWLVSNSWNERWGDQGLFKIWRGKNECGIE 321


>gi|3929733|emb|CAA77178.1| cathepsin B [Homo sapiens]
          Length = 195

 Score =  111 bits (277), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 58/131 (44%), Positives = 79/131 (60%), Gaps = 7/131 (5%)

Query: 212 CGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWG--CNGGWPQLAWRFWGH 269
           CGSCWA     AISDR+CI +N   + ++SA+ ++ C  +  G  CNGG+P  AW FW  
Sbjct: 1   CGSCWAFGAVEAISDRICIHTN--VSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTR 58

Query: 270 NGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRF 329
            G+V+GG Y S  GC+PY++ PCEHHV G    CT  G+  TP+C + C  P Y  TY+ 
Sbjct: 59  KGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT--GEGDTPKCSKIC-EPGYSPTYKQ 115

Query: 330 DLKKGKKAHMV 340
           D   G  ++ V
Sbjct: 116 DKHYGYDSYSV 126



 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 48/87 (55%), Positives = 60/87 (68%), Gaps = 2/87 (2%)

Query: 54  YLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y PT     HY   ++ V     + M +IY++GP+   FSVY+DFL YKSGVYQH  G+ 
Sbjct: 109 YSPTYKQDKHYGYDSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 168

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWN 138
           +G HA+R+LGWGVEN  PYWLVANSWN
Sbjct: 169 MGGHAIRILGWGVENGTPYWLVANSWN 195


>gi|308488550|ref|XP_003106469.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
 gi|308253819|gb|EFO97771.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
          Length = 205

 Score =  111 bits (277), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 60/81 (74%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I  HGP+   F+VY DF QY +GVY H  G S+G HAV++LGWGV+N  PYWLVANSWN
Sbjct: 110 EILAHGPIEVAFTVYEDFYQYTTGVYVHTAGKSLGGHAVKILGWGVDNGTPYWLVANSWN 169

Query: 139 DHWGDHGTFKILRGENEADIE 159
            +WG+ G F+I+RG NE  IE
Sbjct: 170 VNWGEKGYFRIIRGLNECGIE 190



 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 50/92 (54%), Gaps = 1/92 (1%)

Query: 250 PNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKL 309
           P+   C GG+P  AW++W  +G+VTGG Y SQ GC+PY++APC   V G           
Sbjct: 9   PSFSSCEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTE 68

Query: 310 KTPECKQNCY-NPSYESTYRFDLKKGKKAHMV 340
            TP+C + C  N +Y + Y  D   G  A+ V
Sbjct: 69  PTPKCVEACTSNNTYPTGYLQDKHFGATAYAV 100


>gi|3912916|gb|AAC78691.1| thiol protease [Trichuris suis]
          Length = 348

 Score =  110 bits (276), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 51/115 (44%), Positives = 77/115 (66%), Gaps = 3/115 (2%)

Query: 48  KKKKRLYLPTSIPLSHYFKKAHMVPRCNA---MRQIYEHGPLVAIFSVYADFLQYKSGVY 104
           K++  L  P S P   Y+ K+  + + +     R+I ++GP+VA F+VY DF  YKSG+Y
Sbjct: 222 KRRCLLGYPKSYPSDRYYGKSAYIVKQSVKAIQREIMKNGPVVASFAVYEDFRHYKSGIY 281

Query: 105 QHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +H  G+  G HAV+++GWG EN+  +WL+ANSW+  WG+ G F+I+RG+NE  IE
Sbjct: 282 KHTAGELRGYHAVKIIGWGKENNTDFWLIANSWHQDWGEKGYFRIVRGKNECGIE 336



 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 67/168 (39%), Positives = 87/168 (51%), Gaps = 13/168 (7%)

Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
           A  +P +FD R  W  C SL  I DQ+ CGSCWAVS A  +SDR+C+ SN      IS  
Sbjct: 81  ALSIPPSFDVRSLWHVC-SLNLIRDQAKCGSCWAVSAAETMSDRICVQSNCSIKACISDT 139

Query: 244 HIVACTP-NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQ--- 297
            I++C    C +GCNGG+P  AWR +   G  TGG    + GC+PY    P   H++   
Sbjct: 140 DILSCCGLYCGYGCNGGFPIEAWRHFTVAGNCTGGKTIDKYGCKPYKPTGPIGRHLKRND 199

Query: 298 -GPLQNCTL----LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
             P  N T     +G   TP CK+ C    Y  +Y  D   GK A++V
Sbjct: 200 YAPCPNDTYYGECVGMADTPRCKRRCL-LGYPKSYPSDRYYGKSAYIV 246


>gi|193603738|ref|XP_001943652.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 337

 Score =  110 bits (276), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 69/174 (39%), Positives = 96/174 (55%), Gaps = 9/174 (5%)

Query: 171 SEDDDLETMGCQ-NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLC 229
           SE D L T     + + LP ++D  + W EC S+  I DQSNCGSCWA+S A+A SDRLC
Sbjct: 69  SEKDILLTYDVSIDLESLPESYDITQTWSECKSVVSIRDQSNCGSCWALSTASAFSDRLC 128

Query: 230 IASNGYFTGQISAQHIVACTPNCWGCNGGW--PQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           I SN      +S ++I +C     G       P+ AW++   NG+ TGG+Y S EGCQPY
Sbjct: 129 ITSNMGVNKVLSGEYINSCCNGKCGNGCNGGHPEKAWKYIKKNGLCTGGEYGSNEGCQPY 188

Query: 288 TLAPCEHHVQGPLQNCTLLGKLKTPEC-KQNCYNPSYESTYRFDLKKGKKAHMV 340
           ++ PC  +      +C+   +  TP+C K  C N +YE+    DL    K + V
Sbjct: 189 SIVPCPRNA----NSCSKENE-DTPQCYKDQCTNNNYETPLVSDLYYAYKVYSV 237



 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 42/83 (50%), Positives = 57/83 (68%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +++++GP+VA   VY DFL YK G+YQ+  G   G HAV+++GWG ++ I YWL AN+
Sbjct: 245 MSEVFKNGPVVAAMKVYDDFLCYKGGIYQYTTGGLKGDHAVKIMGWGEDDGIDYWLCANT 304

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           W + WG  G FKI RG NE  IE
Sbjct: 305 WGNSWGMGGMFKIRRGRNECGIE 327


>gi|161343847|tpg|DAA06104.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 187

 Score =  110 bits (276), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 52/101 (51%), Positives = 68/101 (67%), Gaps = 1/101 (0%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDAR KW EC S+ HI +Q NC + WA+SV +AI+DR+CI S    T   S Q ++
Sbjct: 87  MPETFDARNKWFECVSIAHIWNQGNCAADWAISVTSAINDRICIKSKKNITAFYSPQKML 146

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQP 286
           +C  +C  GCNGG+   AW++W   G+VTGGDY S EGCQP
Sbjct: 147 SCCDDCGDGCNGGYSGAAWQYWMKRGLVTGGDYGSNEGCQP 187


>gi|343470805|emb|CCD16605.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score =  110 bits (275), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 61/139 (43%), Positives = 77/139 (55%), Gaps = 13/139 (9%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDA E WP CP++R IADQS C + WAVS A+AISDR C    G    +ISA H++
Sbjct: 91  LPETFDAAEHWPHCPTIREIADQSECRASWAVSTASAISDRYCTVGKGK-QLRISAAHLL 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C  GC GG+P  AWR++   G+       +   CQPY    CEH  QG   N T 
Sbjct: 150 SCCKDCGDGCKGGFPGFAWRYYVEYGI-------TSSSCQPYPFPRCEH--QGAQGNKTP 200

Query: 306 LGK--LKTPECKQNCYNPS 322
             K    TP+C   C + S
Sbjct: 201 CSKYNFDTPKCNATCTDKS 219



 Score =  100 bits (249), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 48/93 (51%), Positives = 61/93 (65%), Gaps = 1/93 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++Y +GP VA+F VY D   YKSGVY++  GD +G  AV+V+GWG  N  PYW VANSW
Sbjct: 242 RELYFNGPFVAVFYVYTDLFAYKSGVYRNVDGDFLGGTAVKVVGWGKLNGTPYWKVANSW 301

Query: 138 NDHWGDHGTFKILRGENEADIE-MGFNNRVEAN 169
           +  WG  G   ILRG NE +IE +GF    E +
Sbjct: 302 DTDWGMDGYLLILRGNNECNIEHLGFAGTPETS 334


>gi|323448735|gb|EGB04630.1| hypothetical protein AURANDRAFT_32318 [Aureococcus anophagefferens]
          Length = 253

 Score =  110 bits (275), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 52/120 (43%), Positives = 72/120 (60%), Gaps = 2/120 (1%)

Query: 200 CPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP-NCWGCNGG 258
           CPSL+ I DQ+NCGSCWA     A++DR+CIASNG  T  +SAQ + +C      GCNGG
Sbjct: 1   CPSLKEIRDQANCGSCWAFGSTEAMTDRMCIASNGTVTTHLSAQDVTSCDKLGDMGCNGG 60

Query: 259 WPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC 318
            P   + +W  +G+V GG+Y  + GC  Y L PC HHV    +      +++ P+C + C
Sbjct: 61  IPSSVYSYWALSGIVDGGNYGDKSGCWSYQLEPCAHHVNSS-KYPACPDEVRAPKCARKC 119



 Score = 94.0 bits (232), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 42/82 (51%), Positives = 57/82 (69%), Gaps = 1/82 (1%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNF-GDSIGLHAVRVLGWGVENDIPYWLVANSW 137
            IY++GP+  +F V  DFL YKSGVY+       +G HA++++G+G E+   YWLVANSW
Sbjct: 156 DIYQNGPITGMFFVKQDFLAYKSGVYEPKLLSPPLGGHAIKIMGFGTEDGKDYWLVANSW 215

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WGD G FKI+RG+N   IE
Sbjct: 216 NEDWGDDGYFKIIRGKNACQIE 237


>gi|56758658|gb|AAW27469.1| unknown [Schistosoma japonicum]
          Length = 181

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 45/82 (54%), Positives = 61/82 (74%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  +GP+ A F VY DFL YKSG+Y+H  G  +G HA+R++GWGVE   PYWL+ANSW
Sbjct: 90  KEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTPYWLIANSW 149

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG+ G F+I+RG +E  IE
Sbjct: 150 NEDWGEKGLFRIVRGRDECSIE 171



 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 27/71 (38%), Positives = 36/71 (50%), Gaps = 2/71 (2%)

Query: 271 GVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
           G+VTGG   +  GCQPY    CEH  +G    C      KTP+CKQ C    Y++ Y  D
Sbjct: 14  GIVTGGSKENHTGCQPYPFPKCEHLTKGKYPACG-TKIYKTPQCKQKC-QKGYKTPYEQD 71

Query: 331 LKKGKKAHMVL 341
              G + + V+
Sbjct: 72  KNYGDQRYNVI 82


>gi|325303156|tpg|DAA34330.1| TPA_inf: cysteine proteinase cathepsin L [Amblyomma variegatum]
          Length = 207

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 49/105 (46%), Positives = 67/105 (63%), Gaps = 4/105 (3%)

Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG---YFTGQIS 241
             LP NFDARE+WP+CP++  I DQ +CGSCWA     A+SDR CI S          ++
Sbjct: 103 TALPENFDAREQWPDCPTIGEIRDQGSCGSCWAFGAVEAMSDRTCIHSPARKPRVNVHLA 162

Query: 242 AQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQ 285
           A  +++C  +C  GCNGG+P  AW +W H+G+V GG Y++ EGC 
Sbjct: 163 ADDVLSCCKDCGAGCNGGFPGAAWSYWVHHGIVDGGHYDTDEGCM 207


>gi|170060938|ref|XP_001866023.1| cathepsin B [Culex quinquefasciatus]
 gi|167879260|gb|EDS42643.1| cathepsin B [Culex quinquefasciatus]
          Length = 353

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 59/156 (37%), Positives = 83/156 (53%), Gaps = 11/156 (7%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDAR+KWP+CPSLR I +Q  CGSCWA+S A A +DR CI S  + T    +  ++
Sbjct: 98  LPEQFDARDKWPQCPSLREIRNQGCCGSCWAISAAEAFTDRWCIHSPEHTTFSFGSFDLI 157

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C  GC GG    AW +W   GV +GG YNS++GC  Y    C      P ++   
Sbjct: 158 SCCHSCGDGCQGGVLGPAWDYWVQKGVSSGGPYNSKQGCHSYPFDTC----HSPDED--- 210

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
                 P+C + C +         D + G+ A+ V+
Sbjct: 211 ---DDAPKCSRKCQSSYSVQDVSKDRRFGRVAYSVV 243



 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 48/83 (57%), Positives = 59/83 (71%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +I+ +GP+ A F VY DF  YKSGVY+H  G   G HA+++LGWGVEN   YWL +NS
Sbjct: 250 MEEIFVNGPVQAAFQVYLDFKTYKSGVYRHVTGPLEGGHAIKILGWGVENGTKYWLCSNS 309

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           W + WGDHG FKI+RGEN   IE
Sbjct: 310 WGEDWGDHGFFKIVRGENHLGIE 332


>gi|194246059|gb|ACF35521.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 217

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 55/117 (47%), Positives = 72/117 (61%), Gaps = 4/117 (3%)

Query: 225 SDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEG 283
           SDR+CI + G     ISA+ ++ C  +C  GCNGG+P  AW+F+   G+VTGG Y +++G
Sbjct: 1   SDRICIHTKGKVQVNISAEDLLTCCDSCGSGCNGGYPSAAWQFYKDEGIVTGGLYGTEDG 60

Query: 284 CQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           CQPY   PCEHH  GPL NCT  G   TPEC + C    YE +Y  D   GKK + +
Sbjct: 61  CQPYYFPPCEHHTVGPLPNCT--GIKPTPECAKTC-REGYEKSYTRDKHFGKKVYSI 114



 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 52/106 (49%), Positives = 70/106 (66%), Gaps = 2/106 (1%)

Query: 63  HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+ KK + +         +I ++GP+ A F+VYADF  YKSGVYQ +  + +G HA+R+L
Sbjct: 106 HFGKKVYSISSDETQIKTEICKNGPVEADFNVYADFPSYKSGVYQRHSKEMLGGHAIRIL 165

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRV 166
           GWG E+ +PYWLVANSWN+ WGD G FKI RG +E  IE   N  +
Sbjct: 166 GWGTEDGVPYWLVANSWNEDWGDKGYFKIRRGNDECGIENDINAGI 211


>gi|291236586|ref|XP_002738220.1| PREDICTED: cathepsin B preproprotein-like [Saccoglossus
           kowalevskii]
          Length = 93

 Score =  110 bits (275), Expect = 1e-21,   Method: Composition-based stats.
 Identities = 49/83 (59%), Positives = 63/83 (75%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +I ++GP+   F+VYADF  YKSGVYQH  G+++G HA+++LGWG E+   YWLVANS
Sbjct: 1   MAEIQKYGPVEGAFTVYADFPSYKSGVYQHETGEALGGHAIKILGWGNEDGHDYWLVANS 60

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN+ WGD G FKILRG +E  IE
Sbjct: 61  WNEDWGDQGFFKILRGVDECGIE 83


>gi|268578113|ref|XP_002644039.1| Hypothetical protein CBG17499 [Caenorhabditis briggsae]
          Length = 355

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 97/179 (54%), Gaps = 10/179 (5%)

Query: 170 SSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLC 229
           S ED + E    +    +P +FD+R++WPEC  +  + DQS+CGS   +      SDR C
Sbjct: 75  SHEDQETEN-SAEVLINIPASFDSRQQWPECTQIGAVRDQSDCGSAAHLVAVEMASDRTC 133

Query: 230 IASNGYFTGQISAQHIVACTP-------NCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQE 282
           I+SNG F   +SAQ  ++C         + WGC+G WP+   ++W  +G+ TGG+Y+ Q 
Sbjct: 134 ISSNGTFNWPLSAQDPLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYDDQF 193

Query: 283 GCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCY-NPSYESTYRFDLKKGKKAHMV 340
           GC+PY++ PC+ +      +    G   TP C+ +C  N ++   Y+ D   GK  + V
Sbjct: 194 GCKPYSIYPCDKNYPNGTTSVPCPG-YHTPPCEDHCTSNITWPIAYKQDKHFGKAHYNV 251



 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 42/107 (39%), Positives = 62/107 (57%), Gaps = 3/107 (2%)

Query: 56  PTSIPLSHYFKKAHMVP---RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI 112
           P +     +F KAH        +   +I  +GP++A F +Y DF  YKSG+Y H  GD  
Sbjct: 235 PIAYKQDKHFGKAHYNVGKKMTDIQTEIMTNGPVIASFIIYEDFWDYKSGIYVHTAGDQE 294

Query: 113 GLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           G    +++GWGV+N +PYWL  + W   +G++G  +ILRG NE +IE
Sbjct: 295 GGMDTKIIGWGVDNGVPYWLCVHQWGTDFGENGFVRILRGVNEVNIE 341


>gi|343474137|emb|CCD14154.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 60/135 (44%), Positives = 75/135 (55%), Gaps = 13/135 (9%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDA E WP CP++R IADQS C + WAVS A+AISDR C    G    +ISA H++
Sbjct: 91  LPETFDAAEHWPHCPTIREIADQSECRASWAVSTASAISDRYCTVGKGK-QLRISAAHLL 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C  GC GG+P  AWR++   G+       +   CQPY    CEH  QG   N T 
Sbjct: 150 SCCKDCGDGCKGGFPGFAWRYYVEYGI-------TSSSCQPYPFPRCEH--QGAQGNKTP 200

Query: 306 LGK--LKTPECKQNC 318
             K    TP+C   C
Sbjct: 201 CSKYNFDTPKCNATC 215



 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 49/93 (52%), Positives = 62/93 (66%), Gaps = 1/93 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++Y +GP VA+F VY D   YKSGVY+H  GD +G  AV+V+GWG  N  PYW +ANSW
Sbjct: 242 RELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKLNGTPYWKLANSW 301

Query: 138 NDHWGDHGTFKILRGENEADIE-MGFNNRVEAN 169
           +  WG  G   ILRG NE +IE +GF    EA+
Sbjct: 302 DTDWGMGGYLLILRGNNECNIEHLGFAGTPEAS 334


>gi|56755295|gb|AAW25827.1| SJCHGC06356 protein [Schistosoma japonicum]
          Length = 279

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 58/156 (37%), Positives = 86/156 (55%), Gaps = 7/156 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +PR+FDAR  W  C ++R I D+S C + WA++  ++ISDR+CI SNG  + Q+SA+  +
Sbjct: 28  IPRSFDARYHWINCSTIRQIHDESLCRADWAIATVDSISDRICIRSNGRISVQLSARDAI 87

Query: 247 AC--TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C  +P   GC  G       +W   G+VTGG Y  Q GCQPY L  C +H +    +C 
Sbjct: 88  SCGFSP---GCFHGSEVEVLVYWITYGIVTGGSYEDQSGCQPYPLPKCSYHPESRFLDCN 144

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                + P+C   C +  Y  TY  D   G++ + V
Sbjct: 145 -NNTFEFPQCTNECQD-GYNKTYDDDKFYGERIYNV 178



 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 40/86 (46%), Positives = 53/86 (61%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN-FGDSIGLHAVRVLGWGVENDIPYWLV 133
           +  ++I  +GP++A  SV  DFL YKSGVY       ++G   +R++GWG E  IPYWL 
Sbjct: 184 DIQKEILMNGPVIASISVNTDFLVYKSGVYLPTPRSRNLGWITLRIIGWGYEGKIPYWLC 243

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
           ANSWN+ WG +G  KI RG     IE
Sbjct: 244 ANSWNEEWGANGYVKIQRGVQAGYIE 269


>gi|166030330|gb|ABY78832.1| cathepsin B-like protease [Trypanosoma congolense]
 gi|343476577|emb|CCD12360.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 60/135 (44%), Positives = 75/135 (55%), Gaps = 13/135 (9%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDA E WP CP++R IADQS C + WAVS A+AISDR C    G    +ISA H++
Sbjct: 91  LPETFDAAEHWPHCPTIREIADQSECRASWAVSTASAISDRYCTVGKGK-QLRISAAHLL 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C  GC GG+P  AWR++   G+       +   CQPY    CEH  QG   N T 
Sbjct: 150 SCCKDCGDGCKGGFPGFAWRYYVEYGI-------TSSSCQPYPFPRCEH--QGAQGNKTP 200

Query: 306 LGK--LKTPECKQNC 318
             K    TP+C   C
Sbjct: 201 CSKYNFDTPKCNATC 215



 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 49/93 (52%), Positives = 62/93 (66%), Gaps = 1/93 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++Y +GP VA+F VY D   YKSGVY+H  GD +G  AV+V+GWG  N  PYW +ANSW
Sbjct: 242 RELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKLNGTPYWKLANSW 301

Query: 138 NDHWGDHGTFKILRGENEADIE-MGFNNRVEAN 169
           +  WG  G   ILRG NE +IE +GF    EA+
Sbjct: 302 DTDWGMGGYLLILRGNNECNIEHLGFAGTPEAS 334


>gi|312382740|gb|EFR28091.1| hypothetical protein AND_04395 [Anopheles darlingi]
          Length = 381

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 49/108 (45%), Positives = 73/108 (67%), Gaps = 2/108 (1%)

Query: 54  YLPTSIPLSHYFKKAHMVPRCN--AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           Y  T +   H+ + A+ VPR     + +++  GP+ A F+VY DF+QYKSGVY+H +G  
Sbjct: 263 YNTTYLEDKHFGRVAYSVPRDEDRILYELFYFGPVQASFTVYTDFIQYKSGVYRHTYGVR 322

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G H+V+++GWGVEN   +WL ANSW   WG++G FKI+RGE+   +E
Sbjct: 323 VGDHSVKIVGWGVENGTKFWLCANSWGAEWGENGFFKIIRGEDHLSVE 370



 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 56/156 (35%), Positives = 78/156 (50%), Gaps = 12/156 (7%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
            P +FDAR+KW  CPS+  I +Q  C S +AV+    I+DR CI S G       A  ++
Sbjct: 135 FPESFDARQKWSFCPSVGTIRNQGCCASSYAVAAVATITDRWCIHSEGKSQFSFGAYDVL 194

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCE-HHVQGPLQNCT 304
           +C   C +GC+GG P   W +W  NG+ +GG Y S EGCQ Y    C+   +  P  +  
Sbjct: 195 SCCHRCGFGCDGGVPSAVWHYWVENGITSGGAYESHEGCQSYPFGVCKPQEIFAPHVDLI 254

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                    C + C  P Y +TY  D   G+ A+ V
Sbjct: 255 ---------CLRQC-QPGYNTTYLEDKHFGRVAYSV 280


>gi|343474132|emb|CCD14149.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 60/135 (44%), Positives = 75/135 (55%), Gaps = 13/135 (9%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDA E WP CP++R IADQS C + WAVS A+AISDR C    G    +ISA H++
Sbjct: 91  LPETFDAAEHWPHCPTIREIADQSECRASWAVSTASAISDRYCTVGKGK-QLRISAAHLL 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C  GC GG+P  AWR++   G+       +   CQPY    CEH  QG   N T 
Sbjct: 150 SCCKDCGDGCKGGFPGFAWRYYVEYGI-------TSSSCQPYPFPRCEH--QGAQGNKTP 200

Query: 306 LGK--LKTPECKQNC 318
             K    TP+C   C
Sbjct: 201 CSKYNFDTPKCNATC 215



 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 49/93 (52%), Positives = 62/93 (66%), Gaps = 1/93 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++Y +GP VA+F VY D   YKSGVY+H  GD +G  AV+V+GWG  N  PYW +ANSW
Sbjct: 242 RELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKLNGTPYWKLANSW 301

Query: 138 NDHWGDHGTFKILRGENEADIE-MGFNNRVEAN 169
           +  WG  G   ILRG NE +IE +GF    EA+
Sbjct: 302 DTDWGMGGYLLILRGNNECNIEHLGFAGTPEAS 334


>gi|15723272|gb|AAL06324.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 57/130 (43%), Positives = 79/130 (60%), Gaps = 12/130 (9%)

Query: 191 FDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP 250
           FDA E WP+CP++  I DQS+CGSCWAV+ A+A+SDR C    G    +ISA  +++C  
Sbjct: 1   FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAMSDRYCTL-GGVRDLRISAGDLMSCCD 59

Query: 251 NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP-LQNCTLLGK 308
            C +GCNGG+P++AW ++  +G+V+       E CQPY    C HHV    L  C+  G+
Sbjct: 60  VCGYGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCS--GE 110

Query: 309 LKTPECKQNC 318
             TP C   C
Sbjct: 111 YDTPTCNSTC 120



 Score = 74.7 bits (182), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 33/61 (54%), Positives = 42/61 (68%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++  +GP    FSVYADF+ Y  GVY+H  G  +G HAVR++GWG  N  PYW +ANSW
Sbjct: 146 RELLLNGPFEVSFSVYADFVAYTGGVYKHVTGVFLGGHAVRIVGWGELNGEPYWKIANSW 205

Query: 138 N 138
           N
Sbjct: 206 N 206


>gi|308507719|ref|XP_003116043.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
 gi|308250987|gb|EFO94939.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
          Length = 356

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 65/166 (39%), Positives = 84/166 (50%), Gaps = 13/166 (7%)

Query: 172 EDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIA 231
           EDD       +    +P  FDAR  WP+C S++ + DQSNCGSCWA   A  ISDR+CI 
Sbjct: 55  EDDSYVLRNQRILPSIPTTFDARTNWPKCNSIKMVRDQSNCGSCWAFGAAEVISDRICIH 114

Query: 232 SNGYFTGQISAQHIVACTPNCWGCNGGWPQL--AWRFWGHNGVVTGGDYNSQEGCQPYTL 289
           SNG     ISA+ I+ C     G      Q   A +FW   G VTGGDY   +GC+PY+ 
Sbjct: 115 SNGKEQPVISAEDILTCCGKSCGNGCQGGQGLEAMKFWTTYGAVTGGDYKG-DGCKPYSF 173

Query: 290 APCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
           APC + V+             TP C+  C +    + Y+ D   GK
Sbjct: 174 APCSNCVESK----------TTPSCQSKCQSTYTVTNYKGDKHYGK 209



 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 43/81 (53%), Positives = 54/81 (66%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +IY++GP+   ++VY DF  YKSGVY H  G   G HAV+++GWG E  + YWLV NSW 
Sbjct: 242 EIYQNGPVEVAYTVYDDFYHYKSGVYHHVTGKDTGGHAVKIIGWGTEKGVDYWLVTNSWG 301

Query: 139 DHWGDHGTFKILRGENEADIE 159
             +GD G FKI RG NE  IE
Sbjct: 302 TSFGDKGFFKIRRGTNECGIE 322


>gi|256086863|ref|XP_002579605.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228447|emb|CCD74618.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 271

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 50/86 (58%), Positives = 63/86 (73%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLV 133
           + M++I  +GP+   F VY DFL YKSGVY+H  G  +G HA+R++GWG+ +N IPYWL 
Sbjct: 174 SIMKEILLNGPVEGGFYVYEDFLNYKSGVYKHITGSYLGGHAIRIIGWGIQQNHIPYWLC 233

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
           ANSWN+ WGD G FKILRG NE  IE
Sbjct: 234 ANSWNNQWGDQGYFKILRGTNECGIE 259



 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 43/125 (34%), Positives = 64/125 (51%), Gaps = 2/125 (1%)

Query: 217 AVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTG 275
           A     ++SDR+CI S    + ++SA ++++C   C +GC GG P +AW +W + G+VTG
Sbjct: 45  AFGAVESMSDRICIHSKNKISVELSAINLLSCCTRCGFGCRGGIPGMAWDYWKYEGIVTG 104

Query: 276 GDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
           G   +  GCQPY    C HH               TPEC + C +  Y   Y+ D   GK
Sbjct: 105 GSNETHTGCQPYPFPECNHHSSSKSYPPCESYYFPTPECHETCQD-DYGKPYKKDKFYGK 163

Query: 336 KAHMV 340
            ++ V
Sbjct: 164 SSYNV 168


>gi|342181301|emb|CCC90780.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 335

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 58/134 (43%), Positives = 80/134 (59%), Gaps = 11/134 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD+ EKWP CP++R IADQS CGSCWAVS A+AISDR C    G    +ISA H++
Sbjct: 90  LPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASAISDRHCTV-GGVQQLRISAAHLM 148

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
           +C  +C +GC+GG+P  +W ++  +G+       +   CQPY    C HH  +G    C+
Sbjct: 149 SCCEDCGYGCDGGYPGTSWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCS 201

Query: 305 LLGKLKTPECKQNC 318
                 TP+C   C
Sbjct: 202 KY-HFHTPKCNTTC 214



 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 45/82 (54%), Positives = 58/82 (70%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++Y +GP V +F VY+DFL YK+GVY+H  GD +G HAVR++GWG  N  PYW +ANSW
Sbjct: 240 RELYFNGPFVVVFWVYSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKLNGTPYWKIANSW 299

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           +  WG +G    LRG NE  IE
Sbjct: 300 DTDWGMNGHLLFLRGNNECGIE 321


>gi|56758040|gb|AAW27160.1| unknown [Schistosoma japonicum]
          Length = 216

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 45/82 (54%), Positives = 61/82 (74%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  +GP+ A F VY DFL YKSG+Y+H  G  +G HA+R++GWGVE   PYWL+ANSW
Sbjct: 125 KEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTPYWLIANSW 184

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG+ G F+I+RG +E  IE
Sbjct: 185 NEDWGEKGLFRIVRGRDECSIE 206



 Score = 91.3 bits (225), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 46/119 (38%), Positives = 68/119 (57%), Gaps = 3/119 (2%)

Query: 224 ISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQE 282
           ++DR+CI S G  + ++SA  +++C  +C  GC GG+P  AW +W   G+VTGG   +  
Sbjct: 1   MTDRICIQSGGQQSAELSALDLISCCEDCGDGCQGGFPGQAWDYWVTQGIVTGGSKENHT 60

Query: 283 GCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
           GCQPY    CEHH +G    C      KTP+CKQ C    Y++ Y  D   G +++ V+
Sbjct: 61  GCQPYPFPKCEHHTKGKYPACGTK-IYKTPQCKQTC-QKGYKTPYEQDKHYGDESYNVI 117


>gi|308512693|gb|ADO33000.1| cathepsin B [Biston betularia]
          Length = 217

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 47/81 (58%), Positives = 65/81 (80%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +++++GP+ A F+VYAD L YKSGVY+H  GD++G HA++++GWGVEN   YWL+ANSWN
Sbjct: 124 ELFKNGPVEAAFTVYADLLAYKSGVYKHVEGDALGGHAIKIIGWGVENGNKYWLIANSWN 183

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WG++G FKILRGE+   IE
Sbjct: 184 TDWGNNGFFKILRGEDHCGIE 204



 Score =  107 bits (268), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 54/117 (46%), Positives = 71/117 (60%), Gaps = 4/117 (3%)

Query: 225 SDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEG 283
           +DR+C  SNG      SA+ +++C P C  GCNGG P LAW +W H G+V+GG+YNS +G
Sbjct: 1   TDRVCTYSNGTKHFHFSAEDLLSCCPICGLGCNGGMPTLAWEYWKHMGLVSGGNYNSSQG 60

Query: 284 CQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           C PY + PCEHHV G    C   G  KTP+C + C N  Y   Y+ D + GK  + V
Sbjct: 61  CSPYVIPPCEHHVPGNRLPCN--GDTKTPKCSKTCEN-GYNVLYKKDKRYGKHVYAV 114


>gi|343476048|emb|CCD12737.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 59/134 (44%), Positives = 79/134 (58%), Gaps = 11/134 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD+ EKWP CP++R IADQS CGSCWAVS A+AISDR C    G    +ISA H++
Sbjct: 90  LPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASAISDRHCTV-GGVQQLRISAAHLL 148

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
           +C  +C  GC+GG+P  AW ++  +G+       +   CQPY    C HH  +G    C+
Sbjct: 149 SCCKDCGDGCDGGYPDSAWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCS 201

Query: 305 LLGKLKTPECKQNC 318
                 TP+C   C
Sbjct: 202 KY-DFHTPKCNTTC 214



 Score =  107 bits (267), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 47/82 (57%), Positives = 59/82 (71%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++Y +GP V  F VY+DFL YK+GVY+H  GD +G HAVR++GWG  N  PYW +ANSW
Sbjct: 241 RELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKLNGTPYWKIANSW 300

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           +  WG +G F ILRG NE  IE
Sbjct: 301 DTDWGMNGHFLILRGNNECGIE 322


>gi|341904369|gb|EGT60202.1| hypothetical protein CAEBREN_08101 [Caenorhabditis brenneri]
          Length = 330

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 63/156 (40%), Positives = 85/156 (54%), Gaps = 13/156 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R  W EC S++ I +Q+ CGSCWA   A  ISDR CI + G     IS   ++
Sbjct: 86  IPASFDSRTHWSECKSIKLIRNQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDDLL 145

Query: 247 ACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +C   +C  GC GG+P  A R+W   GVVTGGDY+   GC+PY +APC         +C 
Sbjct: 146 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGA-GCKPYPIAPCTS------GSCP 198

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
              + KTP C  +C    Y + Y  D   G  A+ V
Sbjct: 199 ---ESKTPACSLSC-QSGYTTAYAKDKHFGTSAYAV 230



 Score =  107 bits (267), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 46/99 (46%), Positives = 67/99 (67%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+   A+ V +   +   +I  +GP+ A F+VY DF +YKSGVY+H  G ++G HA++++
Sbjct: 222 HFGTSAYAVAKKVASIQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHAIKII 281

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG E+  PYWLVANSW   WG+ G FKI RG+++  IE
Sbjct: 282 GWGTESGSPYWLVANSWGTSWGESGFFKIFRGDDQCGIE 320


>gi|149436731|ref|XP_001513125.1| PREDICTED: cathepsin B-like [Ornithorhynchus anatinus]
          Length = 211

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 68/99 (68%), Gaps = 2/99 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFDAR++WP CP+++ I DQ +CGSCWA     AISDR+C+ +NG  + ++SA+ ++
Sbjct: 81  LPENFDARQQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRVCVHTNGQVSVEVSAEDLL 140

Query: 247 ACTP-NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEG 283
            C    C  GCNGG+P  AW +W   G+V+GG Y+S  G
Sbjct: 141 TCCGLECGMGCNGGYPTGAWTYWTKKGLVSGGLYDSHVG 179


>gi|3929817|emb|CAA77181.1| cathepsin B [Mus musculus]
          Length = 194

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 57/123 (46%), Positives = 74/123 (60%), Gaps = 7/123 (5%)

Query: 212 CGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP-NCW-GCNGGWPQLAWRFWGH 269
           CGSCWA     AISDR CI +NG    ++SA+ ++ C    C  GCNGG+P  AW FW  
Sbjct: 1   CGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTK 60

Query: 270 NGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC---YNPSYEST 326
            G+V+GG Y+S  GC PYT+ PCEHHV G      + G+  TP C ++C   Y+PSY+  
Sbjct: 61  KGLVSGGVYDSHIGCLPYTIPPCEHHVNGSRP--PMHGEGDTPRCNKSCEAGYSPSYKED 118

Query: 327 YRF 329
             F
Sbjct: 119 KHF 121



 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 36/59 (61%), Positives = 48/59 (81%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVAN 135
           M +IY++GP+   F+V++DFL YKSGVY+H  GD +G HA+R+LGWGVEN +PYWL AN
Sbjct: 136 MAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVENGVPYWLAAN 194


>gi|339242629|ref|XP_003377240.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316973974|gb|EFV57515.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 325

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 51/135 (37%), Positives = 78/135 (57%), Gaps = 13/135 (9%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP   DAR++WP+C  +  + DQ+NCGSCWAVS A+ ++DR+CI S       +S + +V
Sbjct: 84  LPFEMDARKRWPQCKYIGFVRDQANCGSCWAVSSASVMTDRICIESIAAKQPLLSEEELV 143

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C +GC+GG+P  A+ +W   G+ TGG Y S +GC+PY++                
Sbjct: 144 SCCKICGYGCDGGYPDKAFIYWATRGIPTGGPYGSTKGCKPYSIGSNSED---------- 193

Query: 306 LGKLKTPECKQNCYN 320
             + +TP C + C N
Sbjct: 194 --EAETPLCTRQCIN 206



 Score =  107 bits (268), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 44/83 (53%), Positives = 64/83 (77%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M+++Y++GP+V  F+VY DF+ Y  GVY+H FG  +G HAV+++GWG+EN   YWL++NS
Sbjct: 233 MQELYKNGPVVVAFNVYEDFMYYIKGVYEHRFGKFLGGHAVKLIGWGIENSKKYWLISNS 292

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WG++G FKI+RG+N   IE
Sbjct: 293 WNTTWGENGFFKIIRGKNCCAIE 315


>gi|340053922|emb|CCC48215.1| cysteine peptidase C (CPC) [Trypanosoma vivax Y486]
          Length = 334

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 55/135 (40%), Positives = 79/135 (58%), Gaps = 9/135 (6%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDA   WP+CP+++ IADQS+CGSCWAV+ A A+SDR C+ + G     ISA  ++
Sbjct: 91  LPESFDAATAWPDCPTIKRIADQSSCGSCWAVAAATAMSDRFCV-TGGVRDLGISAGDLL 149

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C  GC+GG+P  AW ++  +G+V+  DY     CQPY   PC+H           
Sbjct: 150 SCCTSCGDGCDGGYPDEAWLYFTESGLVS--DY-----CQPYPFPPCKHSGGRSKNPSCH 202

Query: 306 LGKLKTPECKQNCYN 320
                TP+C   C +
Sbjct: 203 DMHFHTPKCNATCTD 217



 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 47/103 (45%), Positives = 64/103 (62%), Gaps = 2/103 (1%)

Query: 59  IPLSHYF--KKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHA 116
           IP+  YF  +   +    +  R++Y  GP    F+VY DFL Y+SGVY+H  G  +G HA
Sbjct: 220 IPVVRYFASESYSLQGEEDYKRELYLRGPFEVAFTVYEDFLAYESGVYKHVSGGPVGGHA 279

Query: 117 VRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           VRV+GWG  N +PYW +ANSWN  WG++G     RG++E  IE
Sbjct: 280 VRVVGWGERNGVPYWKIANSWNTDWGENGYLYFYRGKDECGIE 322


>gi|294883442|ref|XP_002770942.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239874068|gb|EER02758.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 393

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 65/160 (40%), Positives = 85/160 (53%), Gaps = 18/160 (11%)

Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYF-TGQISAQH 244
           LP  FDARE +  C + + H+ DQS CGSCWA + + A SDRLCI S+G F    +SA H
Sbjct: 127 LPDRFDAREHFKNCATVIGHVRDQSTCGSCWAFATSEAFSDRLCIRSSGEFDLVPLSAGH 186

Query: 245 IVACTPNC-----WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP 299
             AC         +GC+GG P  AWR++  +GVV+  D     GC PY    C HHV+  
Sbjct: 187 TAACCSEAEGCFSFGCDGGQPDSAWRWFSEHGVVSELD----SGCWPYNFPECSHHVETK 242

Query: 300 -LQNCTLLGKLKTPECKQNCYN----PSYESTYRFDLKKG 334
            ++ C   G   +P C   C N    PS+ES   F   +G
Sbjct: 243 GMEPCK--GNSPSPVCSTTCRNHHFKPSFESDRHFTEDEG 280



 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 44/83 (53%), Positives = 59/83 (71%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I ++GP+ A F+VY DFL YKSGVY+H  G  +G HAV+++GWG + +  YWLV NSW
Sbjct: 291 KEIIDNGPVAAAFTVYEDFLYYKSGVYKHVNGSELGGHAVKIIGWGTDQNEQYWLVMNSW 350

Query: 138 NDHWGDHGTFKILRGENEADIEM 160
           N +WGD G FKI  GE   D E+
Sbjct: 351 NVNWGDQGIFKIAIGECGIDSEV 373


>gi|170060936|ref|XP_001866022.1| cathepsin B [Culex quinquefasciatus]
 gi|167879259|gb|EDS42642.1| cathepsin B [Culex quinquefasciatus]
          Length = 341

 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 58/155 (37%), Positives = 81/155 (52%), Gaps = 11/155 (7%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDAR++WPEC SL+ I +Q  CGSCWA+S A   +DR CI S         A  ++
Sbjct: 89  LPERFDARDRWPECTSLKQIRNQGCCGSCWAISAAETFTDRWCIHSEDKDQFSFGAYDLL 148

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C  +C  GC GG    AW+FW   GV +GG YNS++GC PY +  C    +        
Sbjct: 149 SCCHSCGDGCQGGNLGPAWQFWVQRGVSSGGPYNSRQGCHPYPVDVCHSADE-------- 200

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                TP+C + C +    +    D + G+ A+ V
Sbjct: 201 --DADTPKCTRKCQSMYNVTNVSDDRRFGRVAYSV 233



 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 45/81 (55%), Positives = 58/81 (71%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I+ +GP+ A F VY DF  YK+GVY+H FG   G HAV+++GWGVEN   YWL +NSW 
Sbjct: 243 EIFRNGPVQASFDVYLDFKAYKTGVYRHVFGPMEGGHAVKMIGWGVENGTKYWLCSNSWG 302

Query: 139 DHWGDHGTFKILRGENEADIE 159
           + WG+ G FKI+RGEN   IE
Sbjct: 303 EDWGERGFFKIVRGENHCGIE 323


>gi|308504721|ref|XP_003114544.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
 gi|308261929|gb|EFP05882.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
          Length = 358

 Score =  108 bits (269), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 58/162 (35%), Positives = 88/162 (54%), Gaps = 9/162 (5%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FD+R+KWPEC  +  + DQS+CGS   +      SDR CI SNG F   +SAQ  +
Sbjct: 94  IPTYFDSRQKWPECTQIGAVRDQSDCGSAAHLVAVELASDRTCIFSNGTFNWPLSAQDPL 153

Query: 247 ACTP-------NCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP 299
           +C         + WGC+G WP+   ++W  +G+ TGG+Y  Q GC+PY++ PC+      
Sbjct: 154 SCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYEDQFGCKPYSIYPCDKKYPNG 213

Query: 300 LQNCTLLGKLKTPECKQNCY-NPSYESTYRFDLKKGKKAHMV 340
             +    G   TP C+++C  N ++   Y+ D   GK  + V
Sbjct: 214 TTSVPCPG-YHTPTCEEHCTSNITWPIAYKQDKHFGKAHYNV 254



 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 40/107 (37%), Positives = 61/107 (57%), Gaps = 3/107 (2%)

Query: 56  PTSIPLSHYFKKAHMVP---RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI 112
           P +     +F KAH        +   +I  +GP++A F +Y DF  YKSG+Y H  GD  
Sbjct: 238 PIAYKQDKHFGKAHYNVGKKMTDIQTEIMTNGPVIASFVIYDDFWDYKSGIYVHTAGDQE 297

Query: 113 GLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           G    +++GWGV++ +PYWL  + W   +G++G  + LRG NE +IE
Sbjct: 298 GGMDTKIIGWGVDSGVPYWLCVHQWGTDFGENGFVRFLRGVNEVNIE 344


>gi|56756587|gb|AAW26466.1| unknown [Schistosoma japonicum]
          Length = 216

 Score =  108 bits (269), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 44/82 (53%), Positives = 61/82 (74%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  +GP+ A F VY DFL YKSG+Y+H  G  +G HA+R++GWGV+   PYWL+ANSW
Sbjct: 125 KEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVKKRTPYWLIANSW 184

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG+ G F+I+RG +E  IE
Sbjct: 185 NEDWGEKGLFRIVRGRDECSIE 206



 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 46/119 (38%), Positives = 70/119 (58%), Gaps = 3/119 (2%)

Query: 224 ISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQE 282
           ++DR+CI S G  + ++SA  +++C  +C  GC GG+P +AW +W   G+VTGG   +  
Sbjct: 1   MTDRICIQSGGGQSAELSALDLISCCEDCGQGCQGGFPGVAWDYWVTQGIVTGGSKENHT 60

Query: 283 GCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
           GCQPY    CEHH +G    C      KTP+CKQ C    Y++ Y+ D   G +++ V+
Sbjct: 61  GCQPYPFPKCEHHTKGKYPACGTK-IYKTPQCKQKC-QKGYKTPYKQDKHYGDESYNVI 117


>gi|166030310|gb|ABY78822.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score =  107 bits (268), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 58/134 (43%), Positives = 79/134 (58%), Gaps = 11/134 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD+ EKWP CP++R IADQS CGSCWAVS A+AISDR C    G    +ISA H++
Sbjct: 90  LPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASAISDRHCTV-GGVQQLRISAAHLM 148

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
           +C  +C  GC+GG+P  +W ++  +G+       +   CQPY    C HH  +G    C+
Sbjct: 149 SCCEDCGDGCDGGYPGTSWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCS 201

Query: 305 LLGKLKTPECKQNC 318
                 TP+C   C
Sbjct: 202 KY-HFHTPKCNTTC 214



 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 45/82 (54%), Positives = 58/82 (70%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++Y +GP V +F VY+DFL YK+GVY+H  GD +G HAVR++GWG  N  PYW +ANSW
Sbjct: 240 RELYFNGPFVVVFWVYSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKLNGTPYWKIANSW 299

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           +  WG +G    LRG NE  IE
Sbjct: 300 DTDWGMNGHLLFLRGNNECGIE 321


>gi|402583630|gb|EJW77574.1| hypothetical protein WUBG_11516 [Wuchereria bancrofti]
          Length = 168

 Score =  107 bits (268), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 58/147 (39%), Positives = 78/147 (53%), Gaps = 7/147 (4%)

Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
           A  LP  FDAR KWP CPS+ ++ +Q  CGSC+AV+VA   SDR+CIA+NG     +S+ 
Sbjct: 13  ASELPDEFDARRKWPLCPSIHNVPNQGGCGSCYAVAVAGVASDRICIATNGTVQVILSSD 72

Query: 244 HIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
            I++C  +C  C GG    A  +W + G+VTGG    ++GCQPY   P +     P    
Sbjct: 73  DIISCCISCGACTGGDSLKAMIYWVNEGIVTGG----RDGCQPY---PYDIKCGIPCPLL 125

Query: 304 TLLGKLKTPECKQNCYNPSYESTYRFD 330
                 K   C   C N  Y + Y  D
Sbjct: 126 EFAKNAKMQRCHHKCQNIYYRNDYFND 152


>gi|296863454|pdb|3HHI|A Chain A, Crystal Structure Of Cathepsin B From T. Brucei In Complex
           With Ca074
 gi|296863455|pdb|3HHI|B Chain B, Crystal Structure Of Cathepsin B From T. Brucei In Complex
           With Ca074
          Length = 325

 Score =  107 bits (268), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 59/139 (42%), Positives = 78/139 (56%), Gaps = 12/139 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD+ E WP CP++  IADQS CGSCWAV+ A+A+SDR C    G     ISA  ++
Sbjct: 72  LPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTM-GGVQDVHISAGDLL 130

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG--PLQNC 303
           AC  +C  GCNGG P  AW ++   G+V+  DY     CQPY    C HH +       C
Sbjct: 131 ACCSDCGDGCNGGDPDRAWAYFSSTGLVS--DY-----CQPYPFPHCSHHSKSKNGYPPC 183

Query: 304 TLLGKLKTPECKQNCYNPS 322
           +      TP+C   C +P+
Sbjct: 184 SQF-NFDTPKCDYTCDDPT 201



 Score =  100 bits (249), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 54/85 (63%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           MR+++  GP    F VY DF+ Y SGVY H  G  +G HAVR++GWG  N +PYW +ANS
Sbjct: 222 MRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGWGTSNGVPYWKIANS 281

Query: 137 WNDHWGDHGTFKILRGENEADIEMG 161
           WN  WG  G F I RG +E  IE G
Sbjct: 282 WNTEWGMDGYFLIRRGSSECGIEDG 306


>gi|268572255|ref|XP_002648916.1| Hypothetical protein CBG17829 [Caenorhabditis briggsae]
          Length = 220

 Score =  107 bits (268), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 48/98 (48%), Positives = 67/98 (68%), Gaps = 4/98 (4%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I  +GP+V +F++Y D  +YKSGVY+H  G  +G HA++++GWG +N IPYWL+ANSW 
Sbjct: 127 EIMTNGPVVGVFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGTQNGIPYWLIANSWG 186

Query: 139 DHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDL 176
             WG++G FKI RG NE  IE    N V A  ++ D L
Sbjct: 187 TKWGENGFFKIRRGVNECGIE----NNVVAGKADVDTL 220



 Score = 37.7 bits (86), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 27/49 (55%), Gaps = 2/49 (4%)

Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV-ACTPNC-WGCNG 257
           N  SCWA   A  ISDR+CIA+ G     IS   +V  C   C +GC+G
Sbjct: 63  NVKSCWAFGAAEVISDRICIATKGARQPIISPMDMVDCCGKYCGYGCDG 111


>gi|294914603|ref|XP_002778294.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239886508|gb|EER10089.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 365

 Score =  107 bits (268), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 46/80 (57%), Positives = 58/80 (72%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  +GP  A FSVY DFL YKSGVY+H  G  +G HAV ++GWG E  + YWLV NSW
Sbjct: 275 KEIMTNGPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVDYWLVMNSW 334

Query: 138 NDHWGDHGTFKILRGENEAD 157
           N+ WGDHGTFKI++G+   D
Sbjct: 335 NEEWGDHGTFKIVQGDCGID 354



 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 55/174 (31%), Positives = 83/174 (47%), Gaps = 12/174 (6%)

Query: 169 NSSEDDDLETMGCQNAKGLPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDR 227
           N +E+ + +    +    +P +FDAR+ + EC   + H+ DQS CGSCWA     A + R
Sbjct: 82  NGTEELEEKVYPAEELVDIPDSFDARDAFKECKDVIGHVRDQSACGSCWAFGTVEAFNAR 141

Query: 228 LCIASNGYFTGQISAQHIVACTPN-----CWGCNGGWPQLAWRFWGHNGVVTGGDY---- 278
           +CI S G     +SA  ++AC         +GC+GG P  +W F   NG+V+GG +    
Sbjct: 142 VCIKSGGKLNQLLSAADMLACCNIGHFCLSFGCSGGNPITSWTFLHTNGIVSGGGFVPEK 201

Query: 279 --NSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
              + +GC PY    C HH +             TP C  +C N  Y + +  D
Sbjct: 202 NMKAADGCWPYNFPKCAHHQKESDYKPCAKEIYDTPSCSSSCPNAKYGTAFDKD 255


>gi|355332948|pdb|3MOR|A Chain A, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
 gi|355332949|pdb|3MOR|B Chain B, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
          Length = 317

 Score =  107 bits (268), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 59/139 (42%), Positives = 78/139 (56%), Gaps = 12/139 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD+ E WP CP++  IADQS CGSCWAV+ A+A+SDR C    G     ISA  ++
Sbjct: 71  LPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTM-GGVQDVHISAGDLL 129

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG--PLQNC 303
           AC  +C  GCNGG P  AW ++   G+V+  DY     CQPY    C HH +       C
Sbjct: 130 ACCSDCGDGCNGGDPDRAWAYFSSTGLVS--DY-----CQPYPFPHCSHHSKSKNGYPPC 182

Query: 304 TLLGKLKTPECKQNCYNPS 322
           +      TP+C   C +P+
Sbjct: 183 SQF-NFDTPKCNYTCDDPT 200



 Score =  100 bits (250), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 54/85 (63%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           MR+++  GP    F VY DF+ Y SGVY H  G  +G HAVR++GWG  N +PYW +ANS
Sbjct: 221 MRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGWGTSNGVPYWKIANS 280

Query: 137 WNDHWGDHGTFKILRGENEADIEMG 161
           WN  WG  G F I RG +E  IE G
Sbjct: 281 WNTEWGMDGYFLIRRGSSECGIEDG 305


>gi|166030308|gb|ABY78821.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  107 bits (268), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 59/134 (44%), Positives = 77/134 (57%), Gaps = 11/134 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD+ EKWP CP++R IADQS CGSCWAVS A+AISDR C    G    +ISA H++
Sbjct: 90  LPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASAISDRYCTV-GGVQQLRISAAHLM 148

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
           +C  +C  GC GG P  AW ++  +G+       +   CQPY    C HH  +G    C+
Sbjct: 149 SCCEDCGDGCKGGAPDSAWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCS 201

Query: 305 LLGKLKTPECKQNC 318
                 TP+C   C
Sbjct: 202 KY-HFHTPKCNTTC 214



 Score =  107 bits (267), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 47/82 (57%), Positives = 59/82 (71%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++Y +GP V  F VY+DFL YK+GVY+H  GD +G HAVR++GWG  N  PYW +ANSW
Sbjct: 241 RELYFNGPFVVDFGVYSDFLAYKTGVYRHVSGDVLGGHAVRIVGWGKLNGTPYWKIANSW 300

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           +  WG +G F ILRG NE  IE
Sbjct: 301 DTDWGMNGHFLILRGNNECGIE 322


>gi|261328564|emb|CBH11542.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like,
           putative [Trypanosoma brucei gambiense DAL972]
          Length = 340

 Score =  107 bits (267), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 59/139 (42%), Positives = 78/139 (56%), Gaps = 12/139 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD+ E WP CP++  IADQS CGSCWAV+ A+A+SDR C    G     ISA  ++
Sbjct: 94  LPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTM-GGVQDVHISAGDLL 152

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG--PLQNC 303
           AC  +C  GCNGG P  AW ++   G+V+  DY     CQPY    C HH +       C
Sbjct: 153 ACCSDCGDGCNGGDPDRAWAYFSSTGLVS--DY-----CQPYPFPHCSHHSKSKNGYPPC 205

Query: 304 TLLGKLKTPECKQNCYNPS 322
           +      TP+C   C +P+
Sbjct: 206 SQF-NFDTPKCNYTCDDPT 223



 Score =  100 bits (249), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 54/85 (63%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           MR+++  GP    F VY DF+ Y SGVY H  G  +G HAVR++GWG  N +PYW +ANS
Sbjct: 244 MRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGWGTSNGVPYWKIANS 303

Query: 137 WNDHWGDHGTFKILRGENEADIEMG 161
           WN  WG  G F I RG +E  IE G
Sbjct: 304 WNTEWGMDGYFLIRRGSSECGIEDG 328


>gi|4204370|gb|AAD11445.1| cathepsin B protease, partial [Fasciola hepatica]
          Length = 247

 Score =  107 bits (267), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 55/148 (37%), Positives = 82/148 (55%), Gaps = 2/148 (1%)

Query: 194 REKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC- 252
           R +WP+C ++  I DQ++CGSCWA + A+A+SDR+CI SNG    +++A   ++C   C 
Sbjct: 1   RSQWPQCWTISEIRDQASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAADPLSCCTYCG 60

Query: 253 WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTP 312
            GC GG+P  AW +W   G+VTGG + ++ GCQP+    C+H       +        TP
Sbjct: 61  QGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCPHYTYPTP 120

Query: 313 ECKQNCYNPSYESTYRFDLKKGKKAHMV 340
            C + C    Y  TY  D   G  ++ V
Sbjct: 121 PCARAC-QTGYNKTYEQDKFYGNSSYNV 147



 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 43/83 (51%), Positives = 63/83 (75%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M++I ++GP+   F+++ DF  Y+SG+Y H  G  IG HAVR++GWGVEN + YWL+ANS
Sbjct: 155 MQEIMKNGPVEVTFAIFQDFGVYRSGIYHHVAGKFIGRHAVRMIGWGVENGVNYWLMANS 214

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN+ WG++G F+++RG NE  IE
Sbjct: 215 WNEEWGENGYFRMVRGRNECGIE 237


>gi|72389769|ref|XP_845179.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|427931064|pdb|4HWY|A Chain A, Trypanosoma Brucei Procathepsin B Solved From 40 Fs
           Free-electron Laser Pulse Data By Serial Femtosecond
           X-ray Crystallography
 gi|40557577|gb|AAR88085.1| cathepsin B-like cysteine protease [Trypanosoma brucei]
 gi|62360039|gb|AAX80461.1| cysteine peptidase C (CPC) [Trypanosoma brucei]
 gi|70801714|gb|AAZ11620.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
          Length = 340

 Score =  107 bits (267), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 59/139 (42%), Positives = 78/139 (56%), Gaps = 12/139 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD+ E WP CP++  IADQS CGSCWAV+ A+A+SDR C    G     ISA  ++
Sbjct: 94  LPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTM-GGVQDVHISAGDLL 152

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG--PLQNC 303
           AC  +C  GCNGG P  AW ++   G+V+  DY     CQPY    C HH +       C
Sbjct: 153 ACCSDCGDGCNGGDPDRAWAYFSSTGLVS--DY-----CQPYPFPHCSHHSKSKNGYPPC 205

Query: 304 TLLGKLKTPECKQNCYNPS 322
           +      TP+C   C +P+
Sbjct: 206 SQF-NFDTPKCNYTCDDPT 223



 Score =  100 bits (250), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 54/85 (63%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           MR+++  GP    F VY DF+ Y SGVY H  G  +G HAVR++GWG  N +PYW +ANS
Sbjct: 244 MRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGWGTSNGVPYWKIANS 303

Query: 137 WNDHWGDHGTFKILRGENEADIEMG 161
           WN  WG  G F I RG +E  IE G
Sbjct: 304 WNTEWGMDGYFLIRRGSSECGIEDG 328


>gi|294873367|ref|XP_002766594.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239867622|gb|EEQ99311.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 244

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 46/80 (57%), Positives = 58/80 (72%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  +GP  A FSVY DFL YKSGVY+H  G  +G HAV ++GWG E  + YWLV NSW
Sbjct: 154 KEIMTNGPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVDYWLVMNSW 213

Query: 138 NDHWGDHGTFKILRGENEAD 157
           N+ WGDHGTFKI++G+   D
Sbjct: 214 NEEWGDHGTFKIVQGDCGID 233



 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 46/134 (34%), Positives = 64/134 (47%), Gaps = 11/134 (8%)

Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPN-----CWGCNGGWPQL 262
           DQS CGSCWA     A + R+CI S G     +SA +++AC         +GC+GG P  
Sbjct: 1   DQSACGSCWAFGTVEAFNARVCIKSGGKLNQLLSAANMLACCNIGHFCLSFGCSGGNPIT 60

Query: 263 AWRFWGHNGVVTGGDY------NSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQ 316
           +W F   NG+V+GG +       + +GC PY+   C HH  G            TP C  
Sbjct: 61  SWTFLHTNGIVSGGGFVPEKNMKAADGCWPYSFPKCAHHQDGSDYKPCAKEIYDTPSCSS 120

Query: 317 NCYNPSYESTYRFD 330
           +C N  Y + +  D
Sbjct: 121 SCPNAKYGTAFDKD 134


>gi|156708114|gb|ABU93315.1| cathepsin B6 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 48/102 (47%), Positives = 68/102 (66%)

Query: 61  LSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           + + ++KA      N   ++ ++GP+   F+VY+DF+ YKSGVYQH  G   G HAV ++
Sbjct: 173 IRYKYEKAETYTVQNIQEELMKNGPVYFRFTVYSDFMNYKSGVYQHKSGYQEGGHAVLLI 232

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGF 162
           GWGVE+ +PYWL+ NSW   WG+ G FKI+RG+NE   E GF
Sbjct: 233 GWGVEDGVPYWLLQNSWGPAWGEKGHFKIIRGKNECGCEQGF 274



 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 48/144 (33%), Positives = 68/144 (47%), Gaps = 28/144 (19%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P NFDARE+WP    +  + DQ++CGSCWA + + AI +R  I   G   G +S Q +V
Sbjct: 63  VPENFDAREQWPG--KIYPVRDQASCGSCWAHAASEAIGNRFSIKGCG--KGMLSVQDLV 118

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
           +C     GCNGG   L+ ++   NGV T       E C PY                 + 
Sbjct: 119 SCDKGDSGCNGGSGPLSSKWLVSNGVTT-------EECLPY-----------------VS 154

Query: 307 GKLKTPECKQNCYNPSYESTYRFD 330
           G  + P C   C N S    Y+++
Sbjct: 155 GNGRVPACAAKCSNGSQIIRYKYE 178


>gi|412985820|emb|CCO17020.1| cathepsin B-like cysteine proteinase [Bathycoccus prasinos]
          Length = 541

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 59/137 (43%), Positives = 75/137 (54%), Gaps = 11/137 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIA-DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP +FDAREKWPEC      A DQ  CGSCWA++    +SDRLCIAS G    +++A  I
Sbjct: 276 LPESFDAREKWPECSEFIGEAWDQGECGSCWAIAPTKVMSDRLCIASGGKVQERLAASEI 335

Query: 246 VACTP-----NCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH--HVQG 298
           ++C       +   C GG P  A+ F    GV +GG Y  ++GC  Y   PC H  HVQ 
Sbjct: 336 LSCGQLVSEFSFGSCEGGMPDDAYEFAKEFGVASGGKYGDEKGCAAYPFPPCHHPCHVQ- 394

Query: 299 PLQNCTLLGKLKTPECK 315
           P   C L  K  T +C+
Sbjct: 395 PTPACPL--KSDTAQCQ 409



 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 45/83 (54%), Gaps = 8/83 (9%)

Query: 78  RQIYEHGPLVAIF-SVYADFLQYKSGVYQHNF-----GDSIGLHAVRVLGWGVENDIPY- 130
           R+IY  GP+ +   ++Y +F  YK G Y+ +      G S G H + V+GW  E+D  Y 
Sbjct: 440 REIYNSGPVSSYAGTIYDEFYAYKDGAYRTSADSETRGRSHGGHVIEVIGWHKESDGTYS 499

Query: 131 WLVANSWNDHWGDHGTFKILRGE 153
           W + NSW + WG  G  +I  GE
Sbjct: 500 WKIINSWLN-WGKKGHGRIAVGE 521


>gi|401415968|ref|XP_003872479.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|322488703|emb|CBZ23950.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 340

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 63/166 (37%), Positives = 84/166 (50%), Gaps = 22/166 (13%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDA EKWP C ++  I DQSNCGSCWA++   A+SDR C  S G    +IS  +++
Sbjct: 98  LPESFDASEKWPMCVTIGEIRDQSNCGSCWAIAAVEAMSDRYCTMS-GIPDRRISTTNLL 156

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG----PLQ 301
           +C   C +GC GG P +AW +W   GV T       E CQPY   PC HH       P  
Sbjct: 157 SCCFICGFGCYGGIPAMAWLWWVWVGVTT-------ELCQPYPFGPCSHHGNSSKYPPCP 209

Query: 302 NCTLLGKLKTPECKQNCYNPS-----YESTYRFDLKKGKKAHMVLM 342
           N        TP+C   C N       Y+    + +K  ++  + LM
Sbjct: 210 NTI----YNTPKCNTTCDNVEMELVKYKGVSSYSIKGERELMVELM 251



 Score =  104 bits (259), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 46/83 (55%), Positives = 59/83 (71%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M ++  +GPL     VYADF+ YKSGVY+H  GD +G HAV+++GWGV++ IPYW +ANS
Sbjct: 247 MVELMNNGPLEVAMQVYADFVAYKSGVYKHVSGDHLGGHAVKLVGWGVKDGIPYWKIANS 306

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD G F I RG +E  IE
Sbjct: 307 WNTDWGDKGYFLIQRGNDECGIE 329


>gi|154340956|ref|XP_001566431.1| cysteine peptidase C (CPC) [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134063754|emb|CAM39941.1| cysteine peptidase C (CPC) [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 340

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 45/83 (54%), Positives = 58/83 (69%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M ++  +GP    F VYADF+ YKSGVY H  G+ +G HAV+++GWGV+N  PYW +ANS
Sbjct: 247 MIELMTYGPFEVAFDVYADFVSYKSGVYSHTTGERLGGHAVKLVGWGVQNGTPYWKIANS 306

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G F I RG +E  IE
Sbjct: 307 WNSDWGDNGYFLIRRGTDECGIE 329



 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 53/136 (38%), Positives = 73/136 (53%), Gaps = 9/136 (6%)

Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
           A+ LP +FD+ +KWP+C ++  I DQSNCGSCWA++   A+SDR C  + G    ++S  
Sbjct: 95  AQELPTSFDSSDKWPKCRTISEIRDQSNCGSCWAIAAVEAMSDRYCTVA-GITDLRVSTG 153

Query: 244 HIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
           H+++C   C  GC GG P +AW +W   G+       + E CQPY   PC HH  G    
Sbjct: 154 HLLSCCFVCGMGCQGGIPTMAWLWWVWVGL-------TSEVCQPYPFPPCGHHTDGGKYP 206

Query: 303 CTLLGKLKTPECKQNC 318
                   TP C   C
Sbjct: 207 ACPSTIYDTPTCNSTC 222


>gi|166030332|gb|ABY78833.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 60/138 (43%), Positives = 79/138 (57%), Gaps = 11/138 (7%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDA E WP CP++R IADQS C + WAV+ A+AISDR C    G    +ISA  ++
Sbjct: 91  LPESFDAAEHWPHCPTIREIADQSACRASWAVATASAISDRYCTVGKGK-QLRISAADLM 149

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
           AC  +C  GC GG+P  AW ++  +G+ +     SQ  CQPY    CEH   QG    C+
Sbjct: 150 ACCKDCGGGCEGGYPDAAWEYYVSHGITS-----SQ--CQPYPFPRCEHRGAQGKKPPCS 202

Query: 305 LLGKLKTPECKQNCYNPS 322
              K  TP+C   C + S
Sbjct: 203 KY-KFVTPQCNATCTDKS 219



 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 49/86 (56%), Positives = 63/86 (73%), Gaps = 1/86 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++Y +GP V  F V++DFL YKSGVYQH  G+ +G  AVR++GWG  N  PYW VANSW
Sbjct: 241 RELYFNGPFVVRFQVHSDFLAYKSGVYQHVAGNFLGGKAVRIVGWGKLNGTPYWKVANSW 300

Query: 138 NDHWGDHGTFKILRGENEADIE-MGF 162
           +  WG +G F ILRG+NE +IE +GF
Sbjct: 301 DTDWGMNGYFLILRGDNECNIEHLGF 326


>gi|728602|emb|CAA88490.1| cathepsin B-like enzyme [Leishmania mexicana]
 gi|1586011|prf||2202319A cathepsin B-like Cys protease
          Length = 340

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 60/143 (41%), Positives = 75/143 (52%), Gaps = 17/143 (11%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDA EKWP C ++  I DQSNCGSCWA++   A+SDR C  S G    +IS  +++
Sbjct: 98  LPESFDASEKWPMCVTIGEIRDQSNCGSCWAIAAVEAMSDRYCTMS-GIPDRRISTTNLL 156

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG----PLQ 301
           +C   C +GC GG P +AW +W   GV T       E CQPY   PC HH       P  
Sbjct: 157 SCCFICGFGCYGGIPAMAWLWWVWVGVTT-------ELCQPYPFGPCSHHGNSSKYPPCP 209

Query: 302 NCTLLGKLKTPECKQNCYNPSYE 324
           N        TP+C   C N   E
Sbjct: 210 NTI----YNTPKCNTTCDNVEME 228



 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 45/81 (55%), Positives = 58/81 (71%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           ++  +GPL     VYADF+ YKSGVY+H  GD +G HAV+++GWGV++ IPYW +ANSWN
Sbjct: 249 ELMNNGPLEVAMQVYADFVAYKSGVYKHVSGDHLGGHAVKLVGWGVKDGIPYWKIANSWN 308

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGD G F I RG +E  IE
Sbjct: 309 TDWGDKGYFLIQRGNDECGIE 329


>gi|156708112|gb|ABU93314.1| cathepsin B5 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 56/81 (69%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           ++Y  GP  A FSVY DF  YKSGVY H  G  +G HAV V+GWGVE+  PYWL+ NSW 
Sbjct: 191 ELYSRGPFEAAFSVYEDFKSYKSGVYHHITGKMLGGHAVMVVGWGVEDGTPYWLIQNSWG 250

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WG+ G FKILRG+NE  IE
Sbjct: 251 TTWGEQGFFKILRGKNECGIE 271



 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 48/136 (35%), Positives = 64/136 (47%), Gaps = 28/136 (20%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFDARE+WP    +  + +Q  CGSCWA +VA    +RL I   G   G +S Q +V
Sbjct: 63  LPDNFDAREQWPG--KILPVRNQEQCGSCWAFAVAETTGNRLNILGCG--RGDMSPQDLV 118

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
           +C     GCNGG P  +W +  H+G+ T       E C PY                 + 
Sbjct: 119 SCDKVDHGCNGGSPLFSWEWVKHSGITT-------EECIPY-----------------VS 154

Query: 307 GKLKTPECKQNCYNPS 322
           G  + P C + C N S
Sbjct: 155 GGGRVPSCPKKCTNGS 170


>gi|160688716|gb|ABX45136.1| cathepsin B-like cysteine protease 2 [Callosobruchus maculatus]
          Length = 260

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 56/104 (53%), Positives = 74/104 (71%), Gaps = 9/104 (8%)

Query: 63  HYFKKAHMVPRCNAMRQI----YEHGPLVAIFSVYADFLQYKSGVYQHNFGDS--IGLHA 116
           HY K+A+ +      RQI     ++GP+VA F+VYADF+ Y SGVY+ + G+S  +G HA
Sbjct: 150 HYAKQAYRI-MSKVERQIQLEIIKNGPVVASFTVYADFIHYLSGVYKFD-GESKLLGGHA 207

Query: 117 VRVLGWGVENDI-PYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           VR++GWG+EN   PYWLV+NSWN+ WGD G FKI RG+NE  IE
Sbjct: 208 VRIIGWGIENGTYPYWLVSNSWNERWGDQGLFKIWRGKNECGIE 251



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 19/35 (54%), Positives = 25/35 (71%)

Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWA 217
           + K LP  FDAR++W +C S++ I DQS CGSCW 
Sbjct: 77  DGKDLPEEFDARKQWSKCESIKEIRDQSGCGSCWG 111


>gi|166030328|gb|ABY78831.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 59/138 (42%), Positives = 75/138 (54%), Gaps = 11/138 (7%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDA E WP CP++R IADQS C + WAVS A+AISDR C    G    +ISA  ++
Sbjct: 90  LPETFDAAEHWPHCPTIREIADQSACRASWAVSTASAISDRYCTVGGGK-QLRISAADLL 148

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
           +C   C  GC GG+P  AW ++   G+       +  GCQPY    CEH   QG    C+
Sbjct: 149 SCCKQCGDGCKGGFPGFAWLYYVEYGI-------ASSGCQPYPFPHCEHRGAQGNKTPCS 201

Query: 305 LLGKLKTPECKQNCYNPS 322
              K  TP+C   C + S
Sbjct: 202 KY-KFDTPKCNATCTDKS 218



 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 53/110 (48%), Positives = 68/110 (61%), Gaps = 4/110 (3%)

Query: 58  SIPLSHYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGL 114
           SIPL  Y   A  +      +  R++Y +GP VA+F VY D   YKSGVY++  GD +G 
Sbjct: 218 SIPLVKYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGG 277

Query: 115 HAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE-MGFN 163
            AVR++GWG  N  PYW VANSW+  WG +G   ILRG NE +IE +GF 
Sbjct: 278 QAVRIVGWGKLNGTPYWKVANSWDTDWGMNGYMLILRGNNECNIEHLGFT 327


>gi|294935195|ref|XP_002781337.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239891887|gb|EER13132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 317

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 45/83 (54%), Positives = 59/83 (71%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           N  ++I  +GP  A FSVY DF+ YKSGVY+H  G  +G+H+V ++GWG E  + YWLV 
Sbjct: 224 NIKKEIMTNGPTSATFSVYEDFVSYKSGVYKHTNGTLMGIHSVEIIGWGTEKGVDYWLVM 283

Query: 135 NSWNDHWGDHGTFKILRGENEAD 157
           NSWN+ WGDHGTFKI +G+   D
Sbjct: 284 NSWNEGWGDHGTFKIAQGDCGID 306



 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 52/143 (36%), Positives = 72/143 (50%), Gaps = 6/143 (4%)

Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           +P +FDAR+ + EC   + H+ DQS C SCWA++   A + RLCI S G F   +SA  +
Sbjct: 59  IPNSFDARDAFKECKDVIGHVWDQSACASCWAIAPVEAFNARLCIKSGGKFNQLLSAGEM 118

Query: 246 VAC--TPNCW---GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL 300
           +AC  + + W   GC GG    AW F   +G+ T G  ++ +GC PY    C HH +   
Sbjct: 119 IACCNSTHSWQPRGCKGGMILNAWSFLKTHGIATEGSMSAADGCWPYNFPKCAHHQKKSK 178

Query: 301 QNCTLLGKLKTPECKQNCYNPSY 323
                     TP C   C N  Y
Sbjct: 179 YEPCSKKLYDTPSCLDRCPNEKY 201


>gi|343477197|emb|CCD11909.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 59/134 (44%), Positives = 77/134 (57%), Gaps = 11/134 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDA E WP CP++R IADQS C + WAV+ A+AISDR C    G    +ISA  ++
Sbjct: 91  LPESFDAAEHWPHCPTIREIADQSACRASWAVATASAISDRYCTVGKGKQL-RISAADLM 149

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
           AC  +C  GC GG+P  AW ++  +G+ +     SQ  CQPY    CEH   QG    C+
Sbjct: 150 ACCKDCGGGCEGGYPDAAWEYYVSHGIAS-----SQ--CQPYPFPRCEHRGAQGKKTPCS 202

Query: 305 LLGKLKTPECKQNC 318
              K  TP+C   C
Sbjct: 203 KY-KFVTPQCNATC 215



 Score =  104 bits (259), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 48/86 (55%), Positives = 63/86 (73%), Gaps = 1/86 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++Y +GP V  F V++DFL YK+GVYQH  G+ +G  AVR++GWG  N  PYW VANSW
Sbjct: 241 RELYFNGPFVVRFQVHSDFLAYKNGVYQHVAGNFLGGKAVRIVGWGKLNGTPYWKVANSW 300

Query: 138 NDHWGDHGTFKILRGENEADIE-MGF 162
           +  WG +G F ILRG+NE +IE +GF
Sbjct: 301 DTDWGMNGYFLILRGDNECNIEHLGF 326


>gi|156708110|gb|ABU93313.1| cathepsin B4 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 48/83 (57%), Positives = 58/83 (69%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M+ + E+GPL   F VY+DF+ Y+SGVYQH  G   G HAV + GWGVEN +PYWLV NS
Sbjct: 189 MQALMEYGPLSCGFMVYSDFMNYRSGVYQHKSGYFEGGHAVLLCGWGVENGLPYWLVQNS 248

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           W   WG+ G FKILRG N  +IE
Sbjct: 249 WGPAWGEKGFFKILRGSNHCEIE 271



 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/112 (37%), Positives = 57/112 (50%), Gaps = 6/112 (5%)

Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
           P  FDARE+WP    +  + DQ++CGSCWA SVA A+ D   IA  G   G +S Q +V+
Sbjct: 64  PTEFDAREQWPG--KILPVRDQASCGSCWAHSVAEAMGDAQNIA--GCPRGAMSVQDLVS 119

Query: 248 CTPNCWGCNGGWPQLAWRFWGHNGVVTGG--DYNSQEGCQPYTLAPCEHHVQ 297
           C      CNGG  + A  +    G+ T     Y S  G  P   + C++  Q
Sbjct: 120 CDKTDSACNGGDMKKAQEYLVKTGITTEACVKYVSGSGRVPACPSKCDNGSQ 171


>gi|294897889|ref|XP_002776090.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
 gi|239882699|gb|EER07906.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
          Length = 134

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 45/76 (59%), Positives = 57/76 (75%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  +GP  A FSVY DFL YKSGVY+H  G  +G HAV ++GWG E  + YWLV NSW
Sbjct: 44  KEIMTNGPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVDYWLVMNSW 103

Query: 138 NDHWGDHGTFKILRGE 153
           N+ WGDHGTFKI++G+
Sbjct: 104 NEEWGDHGTFKIVQGD 119


>gi|294936554|ref|XP_002781799.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
 gi|239892784|gb|EER13594.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
          Length = 88

 Score =  106 bits (264), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 44/71 (61%), Positives = 54/71 (76%)

Query: 83  HGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWG 142
           +GP  A FSVY DFL YKSGVY+H  G  +G HAV ++GWG E  + YWLV NSWN+ WG
Sbjct: 3   NGPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVDYWLVMNSWNEEWG 62

Query: 143 DHGTFKILRGE 153
           DHGTFKI++G+
Sbjct: 63  DHGTFKIVQGD 73


>gi|116784401|gb|ABK23329.1| unknown [Picea sitchensis]
          Length = 350

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 47/85 (55%), Positives = 60/85 (70%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           + M ++Y +GP+   FSVY DF  YKSGVY++  GD +G HAV+++GWG E+   YWLVA
Sbjct: 239 DIMAEVYTNGPVEVSFSVYEDFAHYKSGVYKYTKGDYMGGHAVKLVGWGTEDGTDYWLVA 298

Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
           NSWN  WG+ G FKI RG NE  IE
Sbjct: 299 NSWNTAWGEDGYFKIARGSNECGIE 323



 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 54/135 (40%), Positives = 70/135 (51%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDARE WP+C S++ I DQ +CGSCWA     A+SDR CI      T  +S   +V
Sbjct: 96  LPKQFDAREAWPQCTSVQTILDQGHCGSCWAFGAVEALSDRFCIHHKVNVT--LSENDLV 153

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GC+GG+P  AW+++   GVVT         C PY   A C+H    PL   
Sbjct: 154 ACCGFMCGDGCDGGYPISAWQYFISTGVVTA-------ECDPYFDDAGCQHPGCEPL--- 203

Query: 304 TLLGKLKTPECKQNC 318
                  TP+C + C
Sbjct: 204 -----YPTPQCVKQC 213


>gi|17565158|ref|NP_503384.1| Protein W07B8.1 [Caenorhabditis elegans]
 gi|351059396|emb|CCD74286.1| Protein W07B8.1 [Caenorhabditis elegans]
          Length = 335

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 58/158 (36%), Positives = 84/158 (53%), Gaps = 7/158 (4%)

Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
           Q    L  +FDARE+WPEC S+  I D S C + WA + A ++SDRLCI S G+    +S
Sbjct: 71  QANSDLSPSFDARERWPECMSIPQINDISECKTSWAFAAAESMSDRLCINSGGFKNTILS 130

Query: 242 AQHIVACTPNCW----GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQ 297
           A+ +++C    +    GC GG P  AW++   +G+ TGG Y SQ GC+PY++ PC   V 
Sbjct: 131 AEELLSCCTGMFSCGEGCEGGNPFKAWQYIQKHGIPTGGSYESQFGCKPYSIPPCGKTVG 190

Query: 298 GPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
                        TP C++ C   +    Y  D+ K +
Sbjct: 191 NVTYPACTNTTSPTPSCEKKC---TSRIGYPIDIDKDR 225



 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 50/117 (42%), Positives = 68/117 (58%), Gaps = 3/117 (2%)

Query: 46  KKKKKKRLYLPTSIPLS-HYFKKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSG 102
           +KK   R+  P  I    HY      +P      Q  +  +GP+ A F VY DFLQY +G
Sbjct: 208 EKKCTSRIGYPIDIDKDRHYGVSVDQLPNSQIEIQSDVMLNGPIQATFEVYDDFLQYTTG 267

Query: 103 VYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +Y H  G+  G  +VR++GWGV   +PYWL ANSW   WG++GTF++LRG NE  +E
Sbjct: 268 IYVHLTGNKQGHLSVRIIGWGVWQGVPYWLCANSWGRQWGENGTFRVLRGTNECGLE 324


>gi|291000228|ref|XP_002682681.1| predicted protein [Naegleria gruberi]
 gi|284096309|gb|EFC49937.1| predicted protein [Naegleria gruberi]
          Length = 225

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 47/83 (56%), Positives = 61/83 (73%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +I  +GP+ + FSVY DF+ YKSGVY H  G  +G HA++++GWGVEN++ YWLVANS
Sbjct: 140 MNEIATNGPVQSGFSVYQDFMSYKSGVYTHQTGSFLGGHAIKIVGWGVENNVKYWLVANS 199

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           W   WG +G FKI RG+NE  IE
Sbjct: 200 WGPDWGLNGLFKIKRGDNECGIE 222



 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 43/107 (40%), Positives = 58/107 (54%), Gaps = 14/107 (13%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWA-----VSVANAISDRLCIASNGYFTGQIS 241
           LP +FD+REKWP C  +  I +Q  CGSCWA     +  +  +SDR CIAS G     +S
Sbjct: 2   LPESFDSREKWPTC--IHPIRNQEQCGSCWACKNLFIQSSEVLSDRFCIASGGKVNVVLS 59

Query: 242 AQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
            Q +V+C     GC+GG    AW +  H G+VT       + C PY+
Sbjct: 60  PQDLVSCNWYNAGCDGGILWAAWIYLKHTGIVT-------DQCLPYS 99


>gi|224285427|gb|ACN40436.1| unknown [Picea sitchensis]
          Length = 350

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 47/85 (55%), Positives = 60/85 (70%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           + M ++Y +GP+   FSVY DF  YKSGVY++  GD +G HAV+++GWG E+   YWLVA
Sbjct: 239 DIMAEVYTNGPVEVSFSVYEDFAHYKSGVYKYTKGDYMGGHAVKLVGWGTEDGTDYWLVA 298

Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
           NSWN  WG+ G FKI RG NE  IE
Sbjct: 299 NSWNTAWGEDGYFKIARGSNECGIE 323



 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 54/135 (40%), Positives = 70/135 (51%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDARE WP+C S++ I DQ +CGSCWA     A+SDR CI      T  +S   +V
Sbjct: 96  LPKQFDAREAWPQCTSVQTILDQGHCGSCWAFGAVEALSDRFCIHHKVNVT--LSENDLV 153

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GC+GG+P  AW+++   GVVT         C PY   A C+H    PL   
Sbjct: 154 ACCGFMCGDGCDGGYPISAWQYFISTGVVTA-------ECDPYFDDAGCQHPGCEPL--- 203

Query: 304 TLLGKLKTPECKQNC 318
                  TP+C + C
Sbjct: 204 -----YPTPQCVKQC 213


>gi|161343875|tpg|DAA06118.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 210

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 54/122 (44%), Positives = 74/122 (60%), Gaps = 7/122 (5%)

Query: 212 CGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHN 270
           CGSCWA S A+  SDRLCIA+ G     +SA+ +  C   C  GC+GG P+ AW F+  +
Sbjct: 1   CGSCWAASAASVFSDRLCIATGGAVARNLSAEQLNTCCYRCGNGCDGGSPEAAWYFFMRH 60

Query: 271 GVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECK-QNCYNPSYESTYRF 329
           G+VTGGDY S +GCQPY++ P     +G  +N  +   + TP+C  + C N +Y   YR 
Sbjct: 61  GIVTGGDYESGDGCQPYSIYP-----RGKGRNTCIDDDIDTPDCSIRTCTNSNYTKGYRA 115

Query: 330 DL 331
           DL
Sbjct: 116 DL 117



 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 44/92 (47%), Positives = 62/92 (67%), Gaps = 2/92 (2%)

Query: 63  HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY    + + R   + M  IY++GP+ A F VY DF+ YKSGVY +  G   G HA+++L
Sbjct: 118 HYVDTVYSLSRSEEDIMTDIYKNGPVQAAFYVYTDFMYYKSGVYSYTRGQIEGGHAIKIL 177

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRG 152
           GWGV+++  YWL ANSW+  WG++G F+ILRG
Sbjct: 178 GWGVDDNTKYWLCANSWSRSWGENGLFRILRG 209


>gi|256052327|ref|XP_002569724.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 96

 Score =  105 bits (263), Expect = 3e-20,   Method: Composition-based stats.
 Identities = 44/82 (53%), Positives = 61/82 (74%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I ++GP+ A F VY DFL YKSG+Y+H  G     HA+R++GWG EN+ PYWL+ NSW
Sbjct: 5   KEIMKYGPVEANFIVYEDFLNYKSGIYKHITGKLFSWHAIRIIGWGEENNTPYWLIPNSW 64

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG++G F+ILRG +E  IE
Sbjct: 65  NEDWGENGNFRILRGRHECSIE 86


>gi|984960|gb|AAC46878.1| cathepsin B proteinase, partial [Ancylostoma caninum]
          Length = 340

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 45/81 (55%), Positives = 58/81 (71%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +IY++GP+VA F VY DF  Y+ G+Y H +G   G HAV+V+GWG EN   YWL+ANSWN
Sbjct: 250 EIYKNGPVVAAFKVYQDFSYYRGGIYVHKWGGQTGAHAVKVVGWGRENGTDYWLIANSWN 309

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WG++G F+I RG NE  IE
Sbjct: 310 TDWGENGYFRIARGSNECGIE 330



 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 60/141 (42%), Positives = 80/141 (56%), Gaps = 6/141 (4%)

Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
           P +FDAR  WPEC S+  I DQS CGSCWAVS A A+SD++C+ SN      IS   I++
Sbjct: 88  PDSFDARAHWPECRSIGTIRDQSACGSCWAVSSAEAMSDQICVQSNRTTRVMISDTDILS 147

Query: 248 CT-PNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           C   +C +GC    P  A+R+   + VVTGG Y  ++ C+PY   PC +H          
Sbjct: 148 CCGISCGYGCE-VLPIEAYRWMQRSVVVTGGKYRQKDVCKPYAFYPCGNHTNERYYGPCP 206

Query: 306 LGKLKTPECKQNC---YNPSY 323
            G   TP+C++ C   YN SY
Sbjct: 207 RGLWPTPKCRKACQRKYNKSY 227


>gi|343472937|emb|CCD15042.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 61/138 (44%), Positives = 76/138 (55%), Gaps = 11/138 (7%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDA E WP CP++R IADQS C + WAVS A+AISDR C    G    +ISA  ++
Sbjct: 90  LPETFDAAEHWPHCPTIREIADQSECRASWAVSTASAISDRYCTVGGGK-QLRISAADLM 148

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
           AC   C  GC GG+P  AW ++   G+ +     SQ  CQPY    CEH   QG    C+
Sbjct: 149 ACCKQCGDGCKGGFPGFAWLYYVEYGITS-----SQ--CQPYPFPHCEHRGAQGNKTPCS 201

Query: 305 LLGKLKTPECKQNCYNPS 322
              K  TP+C   C + S
Sbjct: 202 KY-KFDTPKCNATCTDKS 218



 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 53/110 (48%), Positives = 68/110 (61%), Gaps = 4/110 (3%)

Query: 58  SIPLSHYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGL 114
           SIPL  Y   A  +      +  R++Y +GP VA+F VY D   YKSGVY++  GD +G 
Sbjct: 218 SIPLVKYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGG 277

Query: 115 HAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE-MGFN 163
            AVR++GWG  N  PYW VANSW+  WG +G   ILRG NE +IE +GF 
Sbjct: 278 QAVRIVGWGKLNGTPYWKVANSWDTDWGMNGYMLILRGNNECNIEHLGFT 327


>gi|299471123|emb|CBN78981.1| cathepsin B-like proteinase [Ectocarpus siliculosus]
          Length = 557

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 70/198 (35%), Positives = 93/198 (46%), Gaps = 36/198 (18%)

Query: 168 ANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSL-RHIADQSNCGSCWAVSVANAISD 226
           A SS D+D+           P NFDARE +PEC S+   + DQS+CGSCWA +   A +D
Sbjct: 272 AQSSSDEDI-----------PANFDAREAFPECASIIGRVRDQSDCGSCWAFASTEAFND 320

Query: 227 RLCIASNGYFTGQ-------------ISAQHIVACTPN-----CWGCNGGWPQLAWRFWG 268
           R CIA  G                  +SA+   AC          GCNGG P  AW+++ 
Sbjct: 321 RRCIAGIGKEDAAGAEGEATADQLLVLSAEDTTACCHGFHCGLSMGCNGGQPGSAWKWFT 380

Query: 269 HNGVVTGGDY---NSQEGCQPYTLAPCEHHVQGPLQNCTLL--GKLKTPECKQNCYNPSY 323
             GVVTGGDY    +   C+PY   PC HHV            G+  TPEC   C   ++
Sbjct: 381 KTGVVTGGDYADIGTGTTCKPYEFMPCAHHVDPGASGYPACPDGEYPTPECLSECSETNF 440

Query: 324 E-STYRFDLKKGKKAHMV 340
              +Y  D K  ++A+ +
Sbjct: 441 SGGSYGEDKKMAREAYSL 458



 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 43/87 (49%), Positives = 58/87 (66%), Gaps = 2/87 (2%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE--NDIPYWL 132
           N  R + ++G + A FSV++DFL Y  GVY H  G  +G HAV+++GWG +  +   YWL
Sbjct: 463 NIQRDMMKYGSVTAAFSVFSDFLTYSGGVYTHESGSFMGGHAVKMIGWGTDEVSGEDYWL 522

Query: 133 VANSWNDHWGDHGTFKILRGENEADIE 159
           +ANSWN  WG+ G F+ILRG NE  IE
Sbjct: 523 IANSWNPSWGEGGLFRILRGVNECGIE 549


>gi|343474530|emb|CCD13852.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 335

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 55/138 (39%), Positives = 78/138 (56%), Gaps = 11/138 (7%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDA EKWP CP++  I+DQS+CGSCWAV+ A +++DR C   +G    +ISA  ++
Sbjct: 90  LPETFDAAEKWPNCPTITEISDQSSCGSCWAVAAATSMTDRYCTI-HGVRGLRISAADLL 148

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL-QNCT 304
           AC  +C +GC GG P +AW ++   G+ +G        CQPY    C H+        C+
Sbjct: 149 ACCGDCGYGCLGGDPDMAWAYFSSEGIASG-------RCQPYPFPRCSHYTNSTTYPQCS 201

Query: 305 LLGKLKTPECKQNCYNPS 322
            L  L TP C   C + +
Sbjct: 202 AL-HLWTPTCNPACTDST 218



 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 44/82 (53%), Positives = 58/82 (70%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++Y  GP  A+F V++D   YK GVY+H  G  IG HAVR++GWG ++ +PYW +ANSW
Sbjct: 240 RELYFRGPFQAVFDVWSDLFAYKHGVYKHVGGAFIGAHAVRIVGWGNQSGVPYWKIANSW 299

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N  WGD G F +LRG+NE  IE
Sbjct: 300 NAEWGDRGYFFMLRGDNECGIE 321


>gi|166030318|gb|ABY78826.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 55/134 (41%), Positives = 76/134 (56%), Gaps = 11/134 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDA EKWP CP++  I+DQS+CGSCWAV+ A +++DR C   +G    +ISA  ++
Sbjct: 90  LPETFDAAEKWPNCPTITEISDQSSCGSCWAVAAATSMTDRYCTI-HGVRGLRISAADLL 148

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPL-QNCT 304
           AC  +C +GC GG P +AW ++   G+ +G        CQPY    C H+        C+
Sbjct: 149 ACCGDCGYGCLGGDPDMAWAYFSSEGIASG-------RCQPYPFPRCSHYTNSTTYPQCS 201

Query: 305 LLGKLKTPECKQNC 318
            L  L TP C   C
Sbjct: 202 AL-HLWTPTCNPAC 214



 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 44/82 (53%), Positives = 58/82 (70%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++Y  GP  A+F V++D   YK GVY+H  G  IG HAVR++GWG ++ +PYW +ANSW
Sbjct: 240 RELYFRGPFQAVFDVWSDLFAYKHGVYKHVGGAFIGAHAVRIVGWGNQSGVPYWKIANSW 299

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N  WGD G F +LRG+NE  IE
Sbjct: 300 NAEWGDRGYFFMLRGDNECGIE 321


>gi|283468816|emb|CAO98753.1| putative cathepsin B [Fasciola hepatica]
          Length = 112

 Score =  105 bits (262), Expect = 3e-20,   Method: Composition-based stats.
 Identities = 45/87 (51%), Positives = 62/87 (71%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +I ++GP+  IF ++ DFL YKSG+Y +  G  +G HA+RV+GWGVEN + YWL+ANS
Sbjct: 22  MMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGVENGVKYWLIANS 81

Query: 137 WNDHWGDHGTFKILRGENEADIEMGFN 163
           WN+ WG+ G F++ RG NE  IE   N
Sbjct: 82  WNEGWGEKGYFRMRRGNNECGIEARIN 108


>gi|343475054|emb|CCD13447.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 49/86 (56%), Positives = 62/86 (72%), Gaps = 1/86 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++Y +GP V  F V++DFL YKSGVYQH  G+ +G  AVR++GWG  N  PYW VANSW
Sbjct: 241 RELYFNGPFVVRFQVHSDFLAYKSGVYQHVAGNFLGGKAVRIVGWGKMNGTPYWKVANSW 300

Query: 138 NDHWGDHGTFKILRGENEADIE-MGF 162
           +  WG +G F ILRG NE +IE +GF
Sbjct: 301 DTDWGMNGYFLILRGNNECNIEHLGF 326



 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 58/137 (42%), Positives = 73/137 (53%), Gaps = 9/137 (6%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDA EKWP CP++R I DQS C + WAV+ A+AISDR C   NG      +A  + 
Sbjct: 91  LPESFDAAEKWPHCPTIREIPDQSACRASWAVATASAISDRYCTVGNGKQLRISAADLMA 150

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCTL 305
            CT    GC GG+P  AW ++  NG+ +     SQ  CQPY    CEH   QG    C+ 
Sbjct: 151 CCTGCGGGCEGGYPDAAWEYYVSNGITS-----SQ--CQPYPFPRCEHRGAQGKKPPCSK 203

Query: 306 LGKLKTPECKQNCYNPS 322
                TP C   C + S
Sbjct: 204 Y-NFDTPTCNATCTDKS 219


>gi|116779190|gb|ABK21175.1| unknown [Picea sitchensis]
 gi|148907952|gb|ABR17096.1| unknown [Picea sitchensis]
 gi|224284884|gb|ACN40172.1| unknown [Picea sitchensis]
          Length = 350

 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 47/85 (55%), Positives = 58/85 (68%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           + M ++Y  GP+   F VY DF  YKSGVY++  GD +G HAV+++GWG EN   YWLVA
Sbjct: 239 DIMAEVYTKGPVEVDFLVYEDFAHYKSGVYKYITGDFLGGHAVKLIGWGTENGTDYWLVA 298

Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
           NSWN  WG+ G FKI RG NE  IE
Sbjct: 299 NSWNTAWGEDGYFKIARGSNECSIE 323



 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 55/135 (40%), Positives = 71/135 (52%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR+ WP+C S+R I DQ +CGSCWA     A+SDR CI      T  +S   +V
Sbjct: 96  LPKQFDARKAWPQCTSVRTILDQGHCGSCWAFGAVEALSDRFCIHYKVNVT--LSENDLV 153

Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC    C  GC+GG+P  AW+++   GVVT         C PY   A C+H    PL   
Sbjct: 154 ACCGFRCGDGCDGGYPLSAWQYFISTGVVTA-------ECDPYFDEAGCQHPGCEPL--- 203

Query: 304 TLLGKLKTPECKQNC 318
                  TP+C + C
Sbjct: 204 -----YPTPQCVKQC 213


>gi|281208776|gb|EFA82951.1| peptidase C1A family protein [Polysphondylium pallidum PN500]
          Length = 1308

 Score =  105 bits (261), Expect = 4e-20,   Method: Composition-based stats.
 Identities = 55/136 (40%), Positives = 72/136 (52%), Gaps = 15/136 (11%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFDA ++WP+CP++  I +Q+ CGSCWA     +ISDR CI  N   + Q+S Q ++
Sbjct: 70  LPTNFDAAQQWPQCPTIGAIQNQAECGSCWAFGAIESISDRFCIHKNE--SVQLSFQDLI 127

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
            C     GC GG P  A+++   NGVVT         CQPYT+  C    Q P  N    
Sbjct: 128 TCDNQDNGCEGGDPYTAYKYVQKNGVVT-------SNCQPYTIPTCP-PAQQPCMNF--- 176

Query: 307 GKLKTPECKQNCYNPS 322
             + TP C   C N S
Sbjct: 177 --VNTPPCSAKCANSS 190



 Score = 92.0 bits (227), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 43/95 (45%), Positives = 60/95 (63%), Gaps = 2/95 (2%)

Query: 63  HYFKKAHMV-PRCNAMR-QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+ K  + V P   A++ +I  +GP+ A F VY DFL YKSGVY H  G  +G H ++++
Sbjct: 198 HHLKTVYAVKPNVAAIQNEIVTNGPVEACFEVYEDFLGYKSGVYTHKSGKDLGGHCIKIV 257

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENE 155
           G+GV N  PYW+  NSW   WG++G F I  G+NE
Sbjct: 258 GFGVSNGTPYWICNNSWTTSWGNNGIFWIEAGKNE 292


>gi|1848229|gb|AAB48119.1| cathepsin B-like protease [Leishmania major]
          Length = 340

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 46/85 (54%), Positives = 59/85 (69%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M ++  +GPL     VY+DF+ YKSGVY+H  GD +G HAV+++GWG ++ +PYW VANS
Sbjct: 247 MIELMTNGPLELTMQVYSDFVGYKSGVYKHVLGDFLGGHAVKLVGWGTQDGVPYWKVANS 306

Query: 137 WNDHWGDHGTFKILRGENEADIEMG 161
           WN  WGD G F I RG NE  IE G
Sbjct: 307 WNTDWGDKGYFLIQRGNNECKIESG 331



 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 58/162 (35%), Positives = 79/162 (48%), Gaps = 14/162 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDA E WP C ++  I DQSNCGSCWA++   AISDR C    G    ++S  +++
Sbjct: 98  LPEFFDAAEHWPMCLTISEIRDQSNCGSCWAIAAVEAISDRYC-TFGGVPDRRMSTSNLL 156

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG P +AW +W   G+ T       E CQPY   PC HH          
Sbjct: 157 SCCFICGLGCHGGIPTVAWLWWVWVGIAT-------EDCQPYPFDPCSHHGNSEKYPPCP 209

Query: 306 LGKLKTPECKQNCYN-----PSYESTYRFDLKKGKKAHMVLM 342
                TP+C   C         Y+ +  + +K  K+  + LM
Sbjct: 210 STIYDTPKCNTTCERNEMDLVKYKGSTSYSVKGEKELMIELM 251


>gi|312091331|ref|XP_003146940.1| cathepsin B [Loa loa]
          Length = 249

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 46/85 (54%), Positives = 57/85 (67%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I   GP+ A F VY DFL Y  G+Y+H  G   G HAV+VLGWG++  +PYWL ANSW
Sbjct: 151 KEIMTLGPVEASFEVYTDFLYYTGGIYKHVAGSMGGGHAVKVLGWGIDQGVPYWLAANSW 210

Query: 138 NDHWGDHGTFKILRGENEADIEMGF 162
           N  WG+ G F+ILRG NE  IE G 
Sbjct: 211 NTDWGEDGYFRILRGVNECGIESGI 235



 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 50/133 (37%), Positives = 74/133 (55%), Gaps = 2/133 (1%)

Query: 209 QSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFW 267
           +S+ GSCWAV+   A+SDR+CI S G     +SA  +++C   C +GC GG P  AW++W
Sbjct: 11  KSSSGSCWAVAAVEAMSDRICIMSKGKKQVTLSADDLLSCCKTCGFGCFGGEPMAAWKYW 70

Query: 268 GHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTY 327
              G+VTG +Y +  GC+PY   PCEHH               TP+C + C + +Y  +Y
Sbjct: 71  VLRGIVTGSEYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCVKKC-DKNYGKSY 129

Query: 328 RFDLKKGKKAHMV 340
           + D   G+  + V
Sbjct: 130 KADKYYGQSVYNV 142


>gi|159950|gb|AAA29435.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
          Length = 105

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 46/95 (48%), Positives = 66/95 (69%), Gaps = 4/95 (4%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           + I ++GP+VA ++VY DF  Y+SG+Y+H  G   GLHAV+V+GWG E   PYW+VANSW
Sbjct: 15  KDIMKNGPVVATYTVYEDFAHYRSGIYKHKAGRKTGLHAVKVIGWGEEKGTPYWIVANSW 74

Query: 138 NDHWGDHGTFKILRGENEADIEMGFNNRVEANSSE 172
           +D WG++G F++ RG N+     GF  R+ A S +
Sbjct: 75  HDDWGENGFFRMHRGSNDC----GFEERMAAGSVQ 105


>gi|329669000|gb|AEB96388.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 232

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 47/101 (46%), Positives = 65/101 (64%), Gaps = 2/101 (1%)

Query: 63  HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY    + V +   +  ++I  +GP+   F VY DF  Y SG+Y+H  GD +G HAV++L
Sbjct: 125 HYGASVYAVAQDVASIQKEIMTNGPVEVAFDVYEDFEHYSSGIYKHTTGDYLGGHAVKML 184

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMG 161
           GWG EN   YW+ ANSWN  WG++G F+ILRG +E +IE G
Sbjct: 185 GWGTENGTDYWICANSWNSDWGENGFFRILRGVDECEIESG 225



 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 45/89 (50%), Positives = 60/89 (67%), Gaps = 1/89 (1%)

Query: 209 QSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFW 267
           QS+CGSCWAV    A++DR+CIAS G     ISA  +++C   C +GC+G  P  AW +W
Sbjct: 2   QSSCGSCWAVGAVEAMTDRICIASKGNQKVTISADDLLSCCDECGFGCDGRDPYAAWSYW 61

Query: 268 GHNGVVTGGDYNSQEGCQPYTLAPCEHHV 296
             NG+VTG +Y S+ GC+PY   PCEHH+
Sbjct: 62  VSNGIVTGSNYTSKSGCKPYPYPPCEHHI 90


>gi|729283|sp|Q06544.1|CYSP3_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 3
 gi|159952|gb|AAA29436.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
          Length = 174

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 53/120 (44%), Positives = 73/120 (60%), Gaps = 2/120 (1%)

Query: 42  KKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQY 99
           K  K +K  +R YL       H+ K A+ +P       R I ++GP+VA F VY DF  Y
Sbjct: 47  KTPKCQKTCQRGYLKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHY 106

Query: 100 KSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           KSG+Y+H  G   G HAV+++GWG E   PYWL+ANSW+D WG+ G ++++RG N   IE
Sbjct: 107 KSGIYKHTAGRMTGGHAVKIIGWGKEKGTPYWLIANSWHDDWGEKGFYRMIRGINNCRIE 166



 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 2/78 (2%)

Query: 263 AWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPS 322
           AW+++   GVVTGG+Y  Q  C+PY   PC  H + P          KTP+C++ C    
Sbjct: 1   AWQYFALEGVVTGGNYRKQGCCRPYEFPPCGRHGKEPYYG-ECYDTAKTPKCQKTC-QRG 58

Query: 323 YESTYRFDLKKGKKAHMV 340
           Y   Y+ D   GK A+ +
Sbjct: 59  YLKAYKEDKHFGKSAYRL 76


>gi|86451908|gb|ABC97349.1| cathepsin B [Streblomastix strix]
          Length = 312

 Score =  104 bits (259), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 60/153 (39%), Positives = 79/153 (51%), Gaps = 20/153 (13%)

Query: 175 DLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG 234
           D+ T+   N   LP  FD+R  WP C  +  I DQ +CGSCWA+S    + DR CI S G
Sbjct: 67  DVSTVPVAN---LPDEFDSRTNWPNCQLIGKIYDQGHCGSCWAMSSFEVLQDRFCIKSEG 123

Query: 235 YFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH 294
             T ++S QH+ +CTP C GCNGGW   A+ F   NG++        E C PY +  C+H
Sbjct: 124 KQTPELSPQHLTSCTPGCSGCNGGWMSTAFGFMQSNGILG-------EDCIPYQMGKCKH 176

Query: 295 HVQGPLQNCTLLGKLKTPEC-KQNCYNPSYEST 326
                   C+      TP+C K  CY    +ST
Sbjct: 177 ------PGCS---TWPTPKCNKTKCYPNDTKST 200



 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 40/85 (47%), Positives = 56/85 (65%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           +  ++IYE+GP+ A F+VY D   Y+SGVYQH  G   GLHA++V+GWG+ + + YW + 
Sbjct: 217 DIQKEIYENGPVTASFAVYEDLSVYQSGVYQHVTGGFEGLHAIKVVGWGILDGVKYWTIV 276

Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
           NSW + WG  G   I RG +E  IE
Sbjct: 277 NSWAEDWGFDGLLLIRRGVDECGIE 301


>gi|156708108|gb|ABU93312.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  103 bits (258), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 46/83 (55%), Positives = 56/83 (67%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M ++YE+GP+   F+VY DF+ YKSGVY H  G   G HAV  +GWGVE++ PYWL  NS
Sbjct: 189 MEELYENGPISVAFTVYYDFMNYKSGVYVHKTGGIAGGHAVLCVGWGVEDNTPYWLCQNS 248

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           W   WG+ G FKILRG N   IE
Sbjct: 249 WGPAWGEKGHFKILRGSNHCGIE 271



 Score = 77.4 bits (189), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 48/137 (35%), Positives = 64/137 (46%), Gaps = 28/137 (20%)

Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
            LP NFD+RE+WP    +  + DQ++CGSCWA SVA  + DRL I    +  G +S Q +
Sbjct: 62  ALPENFDSREQWPG--KILPVRDQASCGSCWAFSVAETMGDRLSIKGCDF--GDMSPQDL 117

Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           V+C     GCNGG+   AW +   +G+ T       E C PY                  
Sbjct: 118 VSCDTTDMGCNGGYMDHAWAWTKSHGITT-------EKCMPYQ----------------- 153

Query: 306 LGKLKTPECKQNCYNPS 322
            G  + P C   C N S
Sbjct: 154 SGSGRVPACPAKCVNGS 170


>gi|156708104|gb|ABU93310.1| cathepsin B1 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  103 bits (258), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 46/83 (55%), Positives = 56/83 (67%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M ++YE+GP+   F+VY DF+ YKSGVY H  G   G HAV  +GWGVE++ PYWL  NS
Sbjct: 189 MEELYENGPISVAFTVYYDFMNYKSGVYVHKTGGIAGGHAVLCVGWGVEDNTPYWLCQNS 248

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           W   WG+ G FKILRG N   IE
Sbjct: 249 WGPAWGEKGHFKILRGSNHCGIE 271



 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 49/137 (35%), Positives = 64/137 (46%), Gaps = 28/137 (20%)

Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
            LP NFD+RE+WP    +  + DQ++CGSCWA SVA  + DRL I    Y  G ++ Q +
Sbjct: 62  ALPENFDSREQWPG--KILPVRDQASCGSCWAFSVAETMGDRLSIKGCDY--GDMAPQDL 117

Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           V+C     GCNGG+   AW +   +GV T       E C PY                  
Sbjct: 118 VSCDTTDMGCNGGYMDHAWAWTKSHGVTT-------EKCMPYQ----------------- 153

Query: 306 LGKLKTPECKQNCYNPS 322
            G  + P C   C N S
Sbjct: 154 SGSGRVPACPAKCVNGS 170


>gi|256052325|ref|XP_002569723.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228438|emb|CCD74609.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 198

 Score =  103 bits (258), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 59/151 (39%), Positives = 79/151 (52%), Gaps = 9/151 (5%)

Query: 171 SEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCI 230
           S DD    MG   A+    +   ++KWP C S+  I DQS CGS WA     A+SDR CI
Sbjct: 50  SLDDARIQMG---ARREESDLRRKKKWPGCKSIATIRDQSRCGSSWAFGAVEAMSDRSCI 106

Query: 231 ASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTL 289
            S G    ++SA  +++C  +C  G  GG+P LAW +W   G+VTG    +   CQPY  
Sbjct: 107 QSGGKQNVELSAVDLLSCCEHCGDGFEGGFPALAWDYWVKEGIVTGSSKENHTVCQPYPF 166

Query: 290 APCEHHVQGPLQNCTLLGK--LKTPECKQNC 318
             CEHH +G    C   G+   +TP C+  C
Sbjct: 167 PKCEHHTKGKYPAC---GEEIYRTPNCENTC 194


>gi|260782761|ref|XP_002586451.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
 gi|229271561|gb|EEN42462.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
          Length = 272

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 47/81 (58%), Positives = 58/81 (71%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I  +GP+ A F+VY+D + YKSGVY H  G  +G HAV+VLGWGVE++  YWLVANSW 
Sbjct: 179 EIMTNGPVEAAFTVYSDIVHYKSGVYHHTSGGKLGGHAVKVLGWGVEDEEEYWLVANSWG 238

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WGD G FKI RG +E  IE
Sbjct: 239 PDWGDQGFFKIKRGSDECGIE 259



 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 39/103 (37%), Positives = 56/103 (54%), Gaps = 8/103 (7%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P++FDAR +W  C     I DQ +CGSCWA +    +SDRLCI + G     +S++ ++
Sbjct: 43  IPKSFDARMEWSTCVRSHKIHDQGHCGSCWAFASTEVLSDRLCIQTRGSTNIILSSEDLL 102

Query: 247 ACTPNCWGC-NGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           +C     GC +GG    AWR+    GVV          C+PYT
Sbjct: 103 SCDKAGRGCSDGGRLSEAWRYMQKKGVVA-------NRCKPYT 138


>gi|409905640|gb|AFV46426.1| cysteine protease C [Leishmania donovani]
          Length = 345

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 45/85 (52%), Positives = 58/85 (68%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M ++  +GPL     VY+DF+ YKSGVY+H  GD +G HAV+++GWG +  +PYW +ANS
Sbjct: 252 MIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGGHAVKLVGWGTQGGVPYWKIANS 311

Query: 137 WNDHWGDHGTFKILRGENEADIEMG 161
           WN  WGD G F I RG NE  IE G
Sbjct: 312 WNTDWGDKGYFLIQRGSNECGIESG 336



 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 56/137 (40%), Positives = 70/137 (51%), Gaps = 17/137 (12%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDA E WP C ++  I DQSNCGSCWA++   AISDR C    G    +IS  +++
Sbjct: 103 LPEFFDAAEHWPMCVTISEIRDQSNCGSCWAIAAVEAISDRYCTL-GGVPDRRISTSNLL 161

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG----PLQ 301
           +C   C +GC GG P +AW +W   G+ T       E CQPY   PC HH       P  
Sbjct: 162 SCCFICGFGCYGGIPTMAWLWWVWVGITT-------EVCQPYPFGPCSHHGNSDKYPPCP 214

Query: 302 NCTLLGKLKTPECKQNC 318
           N        TP+C   C
Sbjct: 215 NTI----YDTPKCNTTC 227


>gi|300121514|emb|CBK22033.2| unnamed protein product [Blastocystis hominis]
          Length = 476

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 67/208 (32%), Positives = 95/208 (45%), Gaps = 37/208 (17%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           N M++IY HGP+     V  D L+YK G+Y+   G +   H + V+GWG EN IPYW+V 
Sbjct: 95  NIMKEIYAHGPVTCSIDVPDDLLEYKGGIYEDKTGIAGDGHDISVVGWGEENGIPYWIVR 154

Query: 135 NSWNDHWGDHGTFKILRGENEADIEMG------------FNNRVEANSSEDDDLETMGCQ 182
           NSW  +WG+ G F+I+RG+N   IE G              N V        +    GC 
Sbjct: 155 NSWGTYWGEEGFFRIVRGKNNLGIEEGCTYGIPRIPEEKITNPVSLGVKHRINYFPQGCV 214

Query: 183 NA----------KGLPRNFDAREKWPECPSLRHIADQSN-------------CGSCWAVS 219
                         LP  +   E  P    +R+I D  N             CGSCWA +
Sbjct: 215 LESRKEMEEVIKSPLPHTYIKTEDLPTSYDIRNI-DGYNYATWDKNQHIPHYCGSCWAQA 273

Query: 220 VANAISDRLCIASNG-YFTGQISAQHIV 246
             +A+SDR+ +   G + T  +S Q ++
Sbjct: 274 PTSALSDRINLMRKGKWPTINLSEQEVI 301



 Score = 77.8 bits (190), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 37/83 (44%), Positives = 49/83 (59%), Gaps = 2/83 (2%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV--ENDIPYWLVANS 136
           +IY  GP+  +  V   FL Y  GV+    G  +G HAV V GWGV  E   PYW+V NS
Sbjct: 382 EIYARGPISCVMDVTQTFLDYTGGVFTSREGKWLGKHAVEVTGWGVDEETRTPYWIVRNS 441

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           W  +WG++G F+I  G+N  +IE
Sbjct: 442 WGTYWGENGWFRIAMGQNLLNIE 464


>gi|294898091|ref|XP_002776152.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239882839|gb|EER07968.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 382

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 65/183 (35%), Positives = 88/183 (48%), Gaps = 30/183 (16%)

Query: 149 ILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECP-SLRHIA 207
           ++RG N+  ++ G+                   +  + LP +FDAR  +P C   + HI 
Sbjct: 121 LMRGSNDKAVKKGY-----------------AIEELQDLPTDFDARTAFPNCSKVIGHIR 163

Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFW 267
           DQS CGSCWA  V  A +DRLCI SNG FT  +SA  + ACT   +GC GG P  AW + 
Sbjct: 164 DQSACGSCWAFGVTEAFNDRLCIKSNGAFTELLSAGEMNACTL-FFGCGGGDPYSAWSWV 222

Query: 268 GHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTY 327
              G+ TG      EG +P  ++  E       Q+        TP C + C NP Y +T 
Sbjct: 223 HDKGIATG------EGSRPKRVSESEAIPVIAYQDI-----YPTPNCVEQCRNPKYTTTL 271

Query: 328 RFD 330
           R D
Sbjct: 272 RDD 274



 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 42/86 (48%), Positives = 55/86 (63%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           +A   I   GP+ A F+VY DFL YKSGVY+H  G  +G HAV+++GWG ++   YWL  
Sbjct: 290 DAKNAIRTDGPVSASFTVYEDFLAYKSGVYKHTSGSYLGGHAVKIIGWGEKSGQAYWLAV 349

Query: 135 NSWNDHWGDHGTFKILRGENEADIEM 160
           NSWN+ WGD G FKI  G    D ++
Sbjct: 350 NSWNEDWGDKGLFKIALGNCGIDDDL 375


>gi|17384033|emb|CAD12394.1| cysteine proteinase [Leishmania infantum]
          Length = 340

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 45/85 (52%), Positives = 58/85 (68%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M ++  +GPL     VY+DF+ YKSGVY+H  GD +G HAV+++GWG +  +PYW +ANS
Sbjct: 247 MIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGGHAVKLVGWGTQGGVPYWKIANS 306

Query: 137 WNDHWGDHGTFKILRGENEADIEMG 161
           WN  WGD G F I RG NE  IE G
Sbjct: 307 WNTDWGDKGYFLIQRGSNECGIESG 331



 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 56/137 (40%), Positives = 70/137 (51%), Gaps = 17/137 (12%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDA E WP C ++  I DQSNCGSCWA++   AISDR C    G    +IS  +++
Sbjct: 98  LPEFFDAAEHWPMCVTISEIRDQSNCGSCWAIAAVEAISDRYCTL-GGVPDRRISTSNLL 156

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG----PLQ 301
           +C   C +GC GG P +AW +W   G+ T       E CQPY   PC HH       P  
Sbjct: 157 SCCFICGFGCYGGIPTMAWLWWVWVGITT-------EVCQPYPFGPCSHHGNSDKYPPCP 209

Query: 302 NCTLLGKLKTPECKQNC 318
           N        TP+C   C
Sbjct: 210 NTI----YDTPKCNTTC 222


>gi|156708106|gb|ABU93311.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
          Length = 282

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 46/82 (56%), Positives = 56/82 (68%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           +++YE+GPL   F+VY DF+ YKSGVY H  G   G HAV  +GWGVE++ PYWL  NSW
Sbjct: 190 QELYENGPLSVAFTVYYDFMNYKSGVYVHKTGGVAGGHAVLCIGWGVEDNTPYWLCQNSW 249

Query: 138 NDHWGDHGTFKILRGENEADIE 159
              WG+ G FKILRG N   IE
Sbjct: 250 GPAWGEKGHFKILRGSNHCGIE 271



 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 45/106 (42%), Positives = 60/106 (56%), Gaps = 11/106 (10%)

Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
           ++   LP NFDARE+WPE   +  + DQ++CGSCWA SVA  + DRL I   G   G +S
Sbjct: 58  ESDNALPENFDAREQWPE--QILPVRDQASCGSCWAFSVAETMGDRLSIIGCG--RGHMS 113

Query: 242 AQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
            Q +V+C     GCNGG+   AW +   +GV       + E C PY
Sbjct: 114 PQDLVSCDTTDMGCNGGYMDKAWAWTKSHGV-------TNEECMPY 152


>gi|146092987|ref|XP_001466605.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania infantum JPCM5]
 gi|398018677|ref|XP_003862503.1| cysteine peptidase C (CPC) [Leishmania donovani]
 gi|12005276|gb|AAG44365.1| cathepsin B-like cysteine protease [Leishmania donovani]
 gi|134070968|emb|CAM69644.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania infantum JPCM5]
 gi|322500733|emb|CBZ35810.1| cysteine peptidase C (CPC) [Leishmania donovani]
          Length = 340

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 45/85 (52%), Positives = 58/85 (68%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M ++  +GPL     VY+DF+ YKSGVY+H  GD +G HAV+++GWG +  +PYW +ANS
Sbjct: 247 MIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGGHAVKLVGWGTQGGVPYWKIANS 306

Query: 137 WNDHWGDHGTFKILRGENEADIEMG 161
           WN  WGD G F I RG NE  IE G
Sbjct: 307 WNTDWGDKGYFLIQRGSNECGIESG 331



 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 56/137 (40%), Positives = 70/137 (51%), Gaps = 17/137 (12%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDA E WP C ++  I DQSNCGSCWA++   AISDR C    G    +IS  +++
Sbjct: 98  LPEFFDAAEHWPMCVTISEIRDQSNCGSCWAIAAVEAISDRYCTL-GGVPDRRISTSNLL 156

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG----PLQ 301
           +C   C +GC GG P +AW +W   G+ T       E CQPY   PC HH       P  
Sbjct: 157 SCCFICGFGCYGGIPTMAWLWWVWVGITT-------EVCQPYPFGPCSHHGNSDKYPPCP 209

Query: 302 NCTLLGKLKTPECKQNC 318
           N        TP+C   C
Sbjct: 210 NTI----YDTPKCNTTC 222


>gi|339239305|ref|XP_003381207.1| cathepsin B [Trichinella spiralis]
 gi|316975778|gb|EFV59177.1| cathepsin B [Trichinella spiralis]
          Length = 343

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 58/131 (44%), Positives = 75/131 (57%), Gaps = 11/131 (8%)

Query: 45  KKKKKKKRLYLPTSIPLSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY 104
           K+K  K R Y       S+Y   +   P     R+I +HGP+VA   ++  FL YKSGVY
Sbjct: 221 KRKLDKDRYYGE-----SYYLITSSNQPVKTIQREIMDHGPVVAAMEIFESFLYYKSGVY 275

Query: 105 ---QHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMG 161
              + N   S+GLHAV+++GWG +  IPYWLV NSWN  +G+ G FKI RG NE  IE  
Sbjct: 276 SANKRNDDPSLGLHAVKLIGWGEQKRIPYWLVVNSWNTTFGEQGLFKIRRGTNECGIE-- 333

Query: 162 FNNRVEANSSE 172
            N  V A  +E
Sbjct: 334 -NLHVTAGLAE 343



 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 57/167 (34%), Positives = 81/167 (48%), Gaps = 45/167 (26%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCW------------------------------ 216
           L  +FDAREKWPEC  +  I DQS C  CW                              
Sbjct: 60  LEEHFDAREKWPECKYIGFIKDQSTCSCCWVSGDFLYHYDQWKIILLFDFSSSSSHWLFI 119

Query: 217 ----AVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNG 271
               A+S A+ ++DR CIA  G     +S + + +C  +C +GCNGG+P LA+++W   G
Sbjct: 120 STFKAMSSASVMTDRTCIAYKGEQQPFLSDEELTSCCTSCGYGCNGGFPLLAFKYWNEIG 179

Query: 272 VVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNC 318
           V TGG Y S+ GC+P+++AP       P  + T     +TP C+  C
Sbjct: 180 VPTGGPYGSKSGCKPFSIAP-------PTSSST---AAQTPLCQLKC 216


>gi|389593817|ref|XP_003722157.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
 gi|321438655|emb|CBZ12414.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
          Length = 340

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 45/85 (52%), Positives = 59/85 (69%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M ++  +GPL     VY+DF+ YKSGVY+H  G+ +G HAV+++GWG ++ +PYW VANS
Sbjct: 247 MIELMTNGPLELTMQVYSDFVGYKSGVYKHVLGEFLGGHAVKLVGWGTQDGVPYWKVANS 306

Query: 137 WNDHWGDHGTFKILRGENEADIEMG 161
           WN  WGD G F I RG NE  IE G
Sbjct: 307 WNTDWGDKGYFLIQRGNNECKIESG 331



 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 58/162 (35%), Positives = 79/162 (48%), Gaps = 14/162 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDA E WP C ++  I DQSNCGSCWA++   AISDR C    G    ++S  +++
Sbjct: 98  LPEFFDAAEHWPMCLTISEIRDQSNCGSCWAIAAVEAISDRYC-TFGGVPDRRMSTSNLL 156

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   C  GC+GG P +AW +W   G+ T       E CQPY   PC HH          
Sbjct: 157 SCCFICGLGCHGGIPTVAWLWWVWVGIAT-------EDCQPYPFDPCSHHGNSEKYPPCP 209

Query: 306 LGKLKTPECKQNCYNPS-----YESTYRFDLKKGKKAHMVLM 342
                TP+C   C         Y+ +  + +K  K+  + LM
Sbjct: 210 STIYDTPKCNTTCERSEMDLVKYKGSTSYSVKGEKELMIELM 251


>gi|268566081|ref|XP_002647468.1| Hypothetical protein CBG06540 [Caenorhabditis briggsae]
          Length = 188

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 52/106 (49%), Positives = 68/106 (64%), Gaps = 3/106 (2%)

Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
           AK +P  FDAR+KW  C S++ I +Q+NCGSCWA   A  ISDR+CI + G     IS  
Sbjct: 73  AKKIPDTFDARQKWKNCTSIKMIRNQANCGSCWAFGAAEVISDRICIVTKGARQPIISPT 132

Query: 244 HIVACTPN-C-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
            ++ C    C +GC+GG+   A R+W  NGVVTGGDY   +GC+PY
Sbjct: 133 DMLDCCGEYCGYGCDGGYSIQALRWWVSNGVVTGGDYQG-DGCKPY 177


>gi|222424744|dbj|BAH20325.1| AT1G02305 [Arabidopsis thaliana]
          Length = 293

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 52/110 (47%), Positives = 69/110 (62%), Gaps = 8/110 (7%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
           + M ++Y++GP+   F+VY DF  YKSGVY+H  G +IG HAV+++GWG  +D   YWL+
Sbjct: 180 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLL 239

Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMGF-------NNRVEANSSEDDDL 176
           AN WN  WGD G FKI RG NE  IE G         N V+  ++ DD L
Sbjct: 240 ANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVVKGITTSDDLL 289



 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 52/135 (38%), Positives = 67/135 (49%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR  W +C S+  I DQ +CGSCWA     ++SDR CI  N      +S   ++
Sbjct: 37  LPKEFDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKYN--MNVSLSVNDLL 94

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GCNGG+P  AWR++ H+GVVT       E C PY     C H    P    
Sbjct: 95  ACCGFLCGQGCNGGYPIAAWRYFKHHGVVT-------EECDPYFDNTGCSHPGCEP---- 143

Query: 304 TLLGKLKTPECKQNC 318
                  TP+C + C
Sbjct: 144 ----AYPTPKCARKC 154


>gi|270012756|gb|EFA09204.1| cathepsin B precursor [Tribolium castaneum]
          Length = 369

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 57/156 (36%), Positives = 77/156 (49%), Gaps = 17/156 (10%)

Query: 187 LPRNFDAREKWPECPSL-RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           +P  FDARE WPEC  +  +I +Q  C S WA + A  +SDRLCIA+NG    Q+S + +
Sbjct: 72  IPETFDAREYWPECADIIGNIRNQGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPEDL 131

Query: 246 VACTPNCWG-CNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           + C   C   C GG+   AW ++   G+V+GGDYN+  GCQPY+                
Sbjct: 132 IDCCHYCGNQCKGGYTYYAWNYFMLTGLVSGGDYNTSTGCQPYS---------------E 176

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           L     TP C   C N  Y   Y  D   G   + +
Sbjct: 177 LNYYRITPPCNTTCQNDKYPIPYVSDKHFGDSIYYI 212



 Score = 74.3 bits (181), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 43/110 (39%), Positives = 54/110 (49%), Gaps = 13/110 (11%)

Query: 63  HYFKKAHMVPRCNAMRQ---IYEHGPLVAIFSVYADFLQYKSG---------VYQHNFGD 110
           H+    + +P+     Q   +   GP+VA F VY DF  Y+ G         VY +  G 
Sbjct: 204 HFGDSIYYIPQNETAIQNEILSGGGPVVAAFDVYGDFKIYRDGEQHDTILEGVYIYTSGA 263

Query: 111 SIGLHAVRVLGWGVENDIPYWLVANSWNDHWGD-HGTFKILRGENEADIE 159
             G  AV+++GWG EN   YWL ANSW   WG   G FKI RG NE   E
Sbjct: 264 LFGRTAVKIIGWGTENGWAYWLAANSWGKDWGALGGFFKIRRGTNECGFE 313


>gi|18378947|ref|NP_563648.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|16226808|gb|AAL16267.1|AF428337_1 At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|14532526|gb|AAK63991.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|25090140|gb|AAN72238.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|332189292|gb|AEE27413.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
          Length = 362

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 52/110 (47%), Positives = 69/110 (62%), Gaps = 8/110 (7%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
           + M ++Y++GP+   F+VY DF  YKSGVY+H  G +IG HAV+++GWG  +D   YWL+
Sbjct: 249 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLL 308

Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMGF-------NNRVEANSSEDDDL 176
           AN WN  WGD G FKI RG NE  IE G         N V+  ++ DD L
Sbjct: 309 ANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVVKGITTSDDLL 358



 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 52/135 (38%), Positives = 67/135 (49%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR  W +C S+  I DQ +CGSCWA     ++SDR CI  N      +S   ++
Sbjct: 106 LPKEFDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKYN--MNVSLSVNDLL 163

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GCNGG+P  AWR++ H+GVVT       E C PY     C H    P    
Sbjct: 164 ACCGFLCGQGCNGGYPIAAWRYFKHHGVVT-------EECDPYFDNTGCSHPGCEP---- 212

Query: 304 TLLGKLKTPECKQNC 318
                  TP+C + C
Sbjct: 213 ----AYPTPKCARKC 223


>gi|56754337|gb|AAW25356.1| SJCHGC00056 protein [Schistosoma japonicum]
          Length = 342

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 42/82 (51%), Positives = 58/82 (70%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R I  +GP+ A F VY DFL  KSG+ +H  G  +G H +R++GWGVE   PYWL+ANSW
Sbjct: 251 RDIMMYGPVEAAFDVYEDFLNSKSGISRHVTGSIVGGHPIRIIGWGVEKGNPYWLIANSW 310

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WG++G F+++RG +E  IE
Sbjct: 311 NEDWGENGLFRMVRGRDECSIE 332



 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 51/144 (35%), Positives = 66/144 (45%), Gaps = 24/144 (16%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FD+R+KWP C S+  I DQS CGSCWA     A++DR+CI S G  + ++SA  ++
Sbjct: 90  IPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLI 149

Query: 247 A------------CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH 294
           +                 W   G      WRF   N            GCQPY    CEH
Sbjct: 150 SCCEDCGGGCKGGFPGQAWDM-GKTRDSHWRFRKKN----------HTGCQPYPFPKCEH 198

Query: 295 HVQGPLQNCTLLGKLKTPECKQNC 318
             +G    C      KTP+CKQ C
Sbjct: 199 LTKGKYPACG-TKIYKTPQCKQTC 221


>gi|91088083|ref|XP_968689.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
          Length = 360

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 57/156 (36%), Positives = 77/156 (49%), Gaps = 17/156 (10%)

Query: 187 LPRNFDAREKWPECPSL-RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           +P  FDARE WPEC  +  +I +Q  C S WA + A  +SDRLCIA+NG    Q+S + +
Sbjct: 72  IPETFDAREYWPECADIIGNIRNQGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPEDL 131

Query: 246 VACTPNCWG-CNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           + C   C   C GG+   AW ++   G+V+GGDYN+  GCQPY+                
Sbjct: 132 IDCCHYCGNQCKGGYTYYAWNYFMLTGLVSGGDYNTSTGCQPYS---------------E 176

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
           L     TP C   C N  Y   Y  D   G   + +
Sbjct: 177 LNYYRITPPCNTTCQNDKYPIPYVSDKHFGDSIYYI 212



 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 43/101 (42%), Positives = 54/101 (53%), Gaps = 4/101 (3%)

Query: 63  HYFKKAHMVPRCNAMRQ---IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRV 119
           H+    + +P+     Q   +   GP+VA F VY DF  Y+ GVY +  G   G  AV++
Sbjct: 204 HFGDSIYYIPQNETAIQNEILSGGGPVVAAFDVYGDFKIYRDGVYIYTSGALFGRTAVKI 263

Query: 120 LGWGVENDIPYWLVANSWNDHWGD-HGTFKILRGENEADIE 159
           +GWG EN   YWL ANSW   WG   G FKI RG NE   E
Sbjct: 264 IGWGTENGWAYWLAANSWGKDWGALGGFFKIRRGTNECGFE 304


>gi|86451924|gb|ABC97357.1| cathepsin B [Streblomastix strix]
          Length = 283

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 45/81 (55%), Positives = 59/81 (72%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +IYE+GP+   F VY+DF+ YKSGVY H  G   G HAV ++GWGVE+++PYWLV NSW 
Sbjct: 191 EIYEYGPVSMGFIVYSDFMSYKSGVYVHQAGYIEGGHAVLIVGWGVEDEVPYWLVQNSWG 250

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WG++G FKILRG +  + E
Sbjct: 251 TDWGENGFFKILRGSDHCECE 271



 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 41/106 (38%), Positives = 60/106 (56%), Gaps = 11/106 (10%)

Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
           +++  +P  FDAREKWP+  ++  + DQ  CGSCWA S+A  I DRL +   G   G I+
Sbjct: 58  RDSNKVPDTFDAREKWPD--AILPVRDQGECGSCWAFSIAETIGDRLGVL--GCSRGDIA 113

Query: 242 AQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
            + +V+C     GC+GG+  +AW +   NG+ T       E C PY
Sbjct: 114 PEDLVSCDIFDDGCDGGFIDMAWDWCQENGLTT-------EECIPY 152


>gi|10803441|emb|CAC13133.1| putative cathepsin B.7 [Ostertagia ostertagi]
          Length = 198

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 57/127 (44%), Positives = 77/127 (60%), Gaps = 5/127 (3%)

Query: 214 SCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGV 272
           SCWAVS A A+SDR+CIAS G     ISAQ IV+C   C  GC GGWP  AW++    GV
Sbjct: 1   SCWAVSTAAAMSDRICIASKGATQVLISAQDIVSCCTWCGAGCEGGWPIEAWKYGVTEGV 60

Query: 273 VTGGDYNSQEGCQPYTLAPCEHHVQGPLQ-NCTLLGKLKTPECKQNCYNPSYESTYRFDL 331
           VTGG++  +E C+ Y + PC +H   P   +C  +   +TP CK+ C  P Y+++Y  D 
Sbjct: 61  VTGGNFGRKECCRSYEIHPCGYHGNEPFYGHCHSMA--RTPPCKKRC-RPGYKNSYMMDK 117

Query: 332 KKGKKAH 338
           + G  A+
Sbjct: 118 RYGTSAY 124



 Score = 77.4 bits (189), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 36/64 (56%), Positives = 44/64 (68%), Gaps = 4/64 (6%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE----NDIPYWLV 133
           R I E+GP+VA F VY DF  YKSG+Y+H  G   G HAV+V+GWG E      IPYW++
Sbjct: 135 RDIMENGPVVAGFDVYEDFKYYKSGIYRHTAGKXTGGHAVKVIGWGEEXTENGTIPYWII 194

Query: 134 ANSW 137
           ANSW
Sbjct: 195 ANSW 198


>gi|166030324|gb|ABY78829.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 48/93 (51%), Positives = 61/93 (65%), Gaps = 1/93 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++Y +GP VA+F VY D   YKSGVY+H  GD +G  AV+V+GWG  N  PYW VAN+W
Sbjct: 241 RELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKLNGTPYWKVANTW 300

Query: 138 NDHWGDHGTFKILRGENEADIE-MGFNNRVEAN 169
           +  WG  G   ILRG NE +IE +GF    E +
Sbjct: 301 DTDWGMDGYLLILRGNNECNIEHLGFAGTPETS 333



 Score = 87.4 bits (215), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 57/138 (41%), Positives = 75/138 (54%), Gaps = 11/138 (7%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD+ EKWP CP++R IADQS C + WAVS A+ ISDR C    G    +ISA H++
Sbjct: 90  LPESFDSAEKWPNCPTIREIADQSACRASWAVSTASVISDRYCTV-GGVQQLRISAAHLL 148

Query: 247 A-CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
           + C     GC GG+P  AWR++   G+       +   CQPY    CEH   QG    C+
Sbjct: 149 SCCKQCGGGCKGGFPGFAWRYYVEYGI-------ASSYCQPYPFPHCEHRGAQGNKTPCS 201

Query: 305 LLGKLKTPECKQNCYNPS 322
                 TP+C   C + S
Sbjct: 202 KY-NFDTPKCNATCTDKS 218


>gi|52546914|gb|AAU81590.1| cysteine proteinase, partial [Petunia x hybrida]
          Length = 122

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 47/84 (55%), Positives = 59/84 (70%), Gaps = 1/84 (1%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVAN 135
           M ++Y++GP+   F+VY DF  YKSGVY+H  GD +G HAV+++GWG   D   YWL+AN
Sbjct: 11  MTEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGDELGGHAVKLIGWGTSEDGEDYWLLAN 70

Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
            WN  WGD G FKI RG NE DIE
Sbjct: 71  QWNRGWGDDGYFKIRRGTNECDIE 94


>gi|300121294|emb|CBK21674.2| unnamed protein product [Blastocystis hominis]
          Length = 561

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 67/229 (29%), Positives = 106/229 (46%), Gaps = 34/229 (14%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M++IY  GP+        + + YK G+++   G +   HA+ V+GWG E+   YW+V NS
Sbjct: 185 MKEIYARGPITCALDATDELVAYKGGIFEDKTGTTSLNHAISVVGWGEEDGKKYWIVRNS 244

Query: 137 WNDHWGDHGTFKILRGENEADIEMGFN---NRVEANSSEDDDLETM---------GCQNA 184
           W  +WG++G F+I+RG N   IE        RV      +D + ++          C   
Sbjct: 245 WGTYWGENGWFRIVRGTNNLGIESECTWAVPRVPEKMRLNDKMRSLHNRARYFPHSCAIR 304

Query: 185 K--------GLPRNFDAREKWPECPSLRHIADQS------------NCGSCWAVSVANAI 224
           K         LP  +   E  P+   +R+I  ++             CGSCWA    +AI
Sbjct: 305 KQEPAVVTEPLPHFYLKSEDIPKSYDIRNIDGRNYATWDKNQHIPQYCGSCWAQGSTSAI 364

Query: 225 SDRLCIASNG-YFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGV 272
           +DR+ I   G + T ++S Q ++ C  N   CNGGW    +R+    G+
Sbjct: 365 ADRINIMRKGKWPTVELSVQEVINCG-NTGSCNGGWDSGVYRYAHEEGI 412



 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%), Gaps = 1/82 (1%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLVANSW 137
           +I+  GP+    SV  +FL Y  GV+  +    +G H + V GWGV E+   YW+  NSW
Sbjct: 468 EIFARGPISCYVSVSQEFLDYTGGVFVEHDHSMLGGHIIEVAGWGVTEDGQEYWIGRNSW 527

Query: 138 NDHWGDHGTFKILRGENEADIE 159
            ++WG++G F+I   ++  +IE
Sbjct: 528 GEYWGENGWFRIQTDKDNLEIE 549



 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 52/106 (49%), Gaps = 14/106 (13%)

Query: 187 LPRNFDARE----KWPECPSLRHIADQSNCGSCWAVSVANAISDRL-CIASNGYFTGQIS 241
           LP+++D R      +      +HI     CGSCWA S A+A++DRL  +  N + T ++S
Sbjct: 43  LPKSYDPRNIDGVSYVSVSRNQHIPQY--CGSCWAFSAASAVADRLRLMTKNAWPTAELS 100

Query: 242 AQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
            Q IV C     GC+GG    A++     GV T       EGC  Y
Sbjct: 101 PQMIVNCATTAMGCHGGSMTSAYKLMKERGVPT-------EGCMRY 139


>gi|12004577|gb|AAG44098.1| cathepsin B cysteine protease [Leishmania chagasi]
          Length = 340

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 56/137 (40%), Positives = 70/137 (51%), Gaps = 17/137 (12%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDA E WP C ++  I DQSNCGSCWA++   AISDR C    G    +IS  +++
Sbjct: 98  LPEFFDAAEHWPMCVTISEIRDQSNCGSCWAIAAVEAISDRYCTL-GGVPDRRISTSNLL 156

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQG----PLQ 301
           +C   C +GC GG P +AW +W   G+ T       E CQPY   PC HH       P  
Sbjct: 157 SCCFICGFGCYGGIPTMAWLWWVWVGITT-------EVCQPYPFGPCSHHGNSDKYPPCP 209

Query: 302 NCTLLGKLKTPECKQNC 318
           N        TP+C   C
Sbjct: 210 NTI----YDTPKCNTTC 222



 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 57/85 (67%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M ++  +GPL     VY+DF+ YKSG Y+H  GD +G HAV+++GWG +  +PYW +ANS
Sbjct: 247 MIELMTNGPLEVTMQVYSDFVGYKSGGYKHVSGDLLGGHAVKLVGWGTQGGVPYWKIANS 306

Query: 137 WNDHWGDHGTFKILRGENEADIEMG 161
           WN  WGD G F I RG NE  IE G
Sbjct: 307 WNTDWGDKGYFLIQRGSNECGIESG 331


>gi|303289014|ref|XP_003063795.1| cathepsin B-like cysteine proteinase [Micromonas pusilla CCMP1545]
 gi|226454863|gb|EEH52168.1| cathepsin B-like cysteine proteinase [Micromonas pusilla CCMP1545]
          Length = 390

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 58/145 (40%), Positives = 76/145 (52%), Gaps = 18/145 (12%)

Query: 184 AKGLPRNFDAREKWPECPSLRHIA-DQSNCGSCWAVSVANAISDRLCIASNGYFTGQ--- 239
           A GLP  FDARE+WP C  +   A DQ  CGSCWAV+ A  ++DR CIA+NG   G    
Sbjct: 113 ADGLPELFDARERWPRCARVVGTALDQGKCGSCWAVATAAVLTDRACIATNGALGGGGGG 172

Query: 240 ---ISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHV 296
              +SA  +++C     GC GG  + A+ +   +GVVTGG Y  +  C PY    C+H  
Sbjct: 173 GEFLSASQLLSCGAAD-GCEGGDERDAFEYAKTHGVVTGGAYGDESTCAPYLFDACQHPC 231

Query: 297 QGPLQNCTLLGKLKTPECKQNCYNP 321
           +          K  TPEC  +C  P
Sbjct: 232 E----------KSPTPECPLSCVRP 246


>gi|145356617|ref|XP_001422524.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582767|gb|ABP00841.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 245

 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 49/120 (40%), Positives = 71/120 (59%), Gaps = 12/120 (10%)

Query: 187 LPRNFDAREKWPECPSLRHIA-DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP++FD REKWP+C +L   A DQ  CGSCWAV+ A  ++DRLCIA+NG     +SA  +
Sbjct: 2   LPKDFDVREKWPKCAALVSEALDQGECGSCWAVAPAKVMADRLCIATNGAVASHLSAMQL 61

Query: 246 VAC-----------TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH 294
           ++C           +     C+GG+P  A+     +G+V+GG +   + C PY  APC+H
Sbjct: 62  LSCGKLENGTFDAGSTYSGSCDGGFPNEAYEKARTSGIVSGGLFGDDKTCMPYAFAPCQH 121



 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 49/86 (56%), Gaps = 9/86 (10%)

Query: 74  CNAMRQIYEHGPLVA-IFSVYADFLQYKSGVYQHN-----FGDSIGLHAVRVLGWG-VEN 126
           C A+   Y HGP+ + +  V+ +F +YKSGVY  +      G++ G H + V+GWG  E+
Sbjct: 162 CMALELFY-HGPVSSYVGDVFDEFYKYKSGVYSLSKDVAARGENHGGHVMEVIGWGTTES 220

Query: 127 DIPYWLVANSWNDHWGDHGTFKILRG 152
              YW V NSW + WGD G  KI  G
Sbjct: 221 GTRYWKVYNSWLN-WGDQGYGKIAVG 245


>gi|294876288|ref|XP_002767632.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239869318|gb|EER00350.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 97

 Score =  101 bits (251), Expect = 5e-19,   Method: Composition-based stats.
 Identities = 41/79 (51%), Positives = 58/79 (73%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           N  ++I  +GP  A  S+Y DFL Y+SGVY+H  G  +G+H+V ++GWG+E  + YWLV 
Sbjct: 4   NIKKEIMTNGPTSATLSMYNDFLSYESGVYKHTSGTFMGVHSVEIIGWGIEKGVDYWLVM 63

Query: 135 NSWNDHWGDHGTFKILRGE 153
           NSWN+ WGD+GTFKI +G+
Sbjct: 64  NSWNEDWGDNGTFKIAQGD 82


>gi|449489527|ref|XP_004158338.1| PREDICTED: cathepsin B-like [Cucumis sativus]
          Length = 349

 Score =  101 bits (251), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 51/100 (51%), Positives = 66/100 (66%), Gaps = 3/100 (3%)

Query: 63  HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   A+ V R   + M ++Y++GP+   F+VY DF  YKSGVY+H  GD +G HAV+++
Sbjct: 231 HYGVSAYRVKRDPNDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 290

Query: 121 GWGVEND-IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG  +D   YWL+AN WN  WGD G FKI RG NE  IE
Sbjct: 291 GWGTTDDGEDYWLLANQWNRGWGDDGYFKIRRGTNECGIE 330



 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 52/137 (37%), Positives = 71/137 (51%), Gaps = 20/137 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP++FDARE WP+C S+  I DQ +CGSCWA     ++SDR CI  +   T  +S   ++
Sbjct: 102 LPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNIT--LSVNDLL 159

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GC+GG+P  AWR++  +GVVT       E C PY     C H    P    
Sbjct: 160 ACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EQCDPYFDTTGCSHPGCEP---- 208

Query: 304 TLLGKLKTPECKQNCYN 320
                  TP C ++C +
Sbjct: 209 ----AYPTPRCVRHCVD 221


>gi|294877489|ref|XP_002768007.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239870145|gb|EER00725.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 344

 Score =  101 bits (251), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 44/83 (53%), Positives = 56/83 (67%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           N  ++I  +GP  A FS Y DF  YKSGVY+H  G  +G H+V ++GWG E  + YWLV 
Sbjct: 251 NIKKEIMTNGPTSASFSTYEDFSSYKSGVYKHTSGGYLGDHSVEIIGWGTEKGVDYWLVM 310

Query: 135 NSWNDHWGDHGTFKILRGENEAD 157
           NSWN+ WGDHGTFKI +G+   D
Sbjct: 311 NSWNEGWGDHGTFKIAQGDCGID 333



 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 49/120 (40%), Positives = 70/120 (58%), Gaps = 12/120 (10%)

Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           +P +FDAR+ + EC   + H+ DQS CGSCWA++   A + RLCI S G F   +SA  +
Sbjct: 59  IPSSFDARDAFKECKDVIGHVWDQSACGSCWAIAPVEAFNARLCIKSGGKFNQLLSAGEM 118

Query: 246 VAC-----TPNCWGCNGGWPQLAWRFWGHNGVVTGGDY------NSQEGCQPYTLAPCEH 294
           +AC     + N  GC GG  + AW F   +G+VTGGD+      ++ +GC PY+   C H
Sbjct: 119 LACCNSVHSCNSHGCQGGIARAAWSFLKMHGIVTGGDFVPKGSMSAADGCWPYSFPKCAH 178


>gi|449446774|ref|XP_004141146.1| PREDICTED: cathepsin B-like [Cucumis sativus]
          Length = 348

 Score =  101 bits (251), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 51/100 (51%), Positives = 66/100 (66%), Gaps = 3/100 (3%)

Query: 63  HYFKKAHMVPR--CNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   A+ V R   + M ++Y++GP+   F+VY DF  YKSGVY+H  GD +G HAV+++
Sbjct: 230 HYGVSAYRVKRDPNDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 289

Query: 121 GWGVEND-IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG  +D   YWL+AN WN  WGD G FKI RG NE  IE
Sbjct: 290 GWGTTDDGEDYWLLANQWNRGWGDDGYFKIRRGTNECGIE 329



 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 52/137 (37%), Positives = 71/137 (51%), Gaps = 20/137 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP++FDARE WP+C S+  I DQ +CGSCWA     ++SDR CI  +   T  +S   ++
Sbjct: 101 LPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNIT--LSVNDLL 158

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GC+GG+P  AWR++  +GVVT       E C PY     C H    P    
Sbjct: 159 ACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EQCDPYFDTTGCSHPGCEP---- 207

Query: 304 TLLGKLKTPECKQNCYN 320
                  TP C ++C +
Sbjct: 208 ----AYPTPRCVRHCVD 220


>gi|156708120|gb|ABU93318.1| cathepsin B9 cysteine protease, partial [Monocercomonoides sp. PA]
          Length = 382

 Score =  100 bits (250), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 52/132 (39%), Positives = 68/132 (51%), Gaps = 16/132 (12%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FDAR  WP CP++ HI DQ +CGSCWA+     + DR CI SNG     +S Q I 
Sbjct: 70  IPESFDARTNWPNCPTIGHIYDQGHCGSCWAMCSFEVLQDRFCIHSNGSEKPWLSGQDIT 129

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
           +C     GCNGGW + A+ +    GV T       E C PY +  C H        C+  
Sbjct: 130 SCDSRSHGCNGGWTETAFEYAKKAGVPT-------EECVPYLMGKCHH------PGCS-- 174

Query: 307 GKLKTPECKQNC 318
              +TP CK+ C
Sbjct: 175 -SWQTPTCKKEC 185



 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 40/64 (62%), Gaps = 2/64 (3%)

Query: 63  HYFKKAHMVPR-CNAMR-QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           +Y  K++ + R   A++ ++  +GP+ A+F+ Y D   Y  GVY H  G   GLHA++++
Sbjct: 198 YYASKSYSIQRNVEAIQLELMRNGPVTAVFTTYDDLAVYWRGVYNHVMGSEQGLHAIKIV 257

Query: 121 GWGV 124
           GWGV
Sbjct: 258 GWGV 261



 Score = 39.7 bits (91), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 21/35 (60%)

Query: 125 ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           E  IPYW++ NSW + +G  G   I RG NE  IE
Sbjct: 321 EEGIPYWIIVNSWGEDFGMDGILLIKRGVNECGIE 355


>gi|297843026|ref|XP_002889394.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335236|gb|EFH65653.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 359

 Score =  100 bits (250), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 49/105 (46%), Positives = 67/105 (63%), Gaps = 1/105 (0%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
           + M ++Y++GP+   F+VY DF  YKSGVY+H  G  IG HAV+++GWG  +D   YWL+
Sbjct: 246 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTKIGGHAVKLIGWGTSDDGEDYWLL 305

Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDLET 178
           AN WN  WGD G FKI RG NE  IE G    + ++ +   D+ T
Sbjct: 306 ANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVFKDVTT 350



 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 44/103 (42%), Positives = 57/103 (55%), Gaps = 11/103 (10%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR  W +C S+  I DQ +CGSCWA     ++SDR CI  N      +SA  +V
Sbjct: 103 LPKEFDARTAWSQCTSIPRILDQGHCGSCWAFGAVESLSDRFCIKYN--LNVSLSANDVV 160

Query: 247 A--CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           A        GCNGG+P  AW ++ ++GVVT       E C PY
Sbjct: 161 ACCGLLCGLGCNGGFPMGAWLYFKYHGVVT-------EECDPY 196


>gi|297843028|ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335237|gb|EFH65654.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 360

 Score =  100 bits (250), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 47/88 (53%), Positives = 61/88 (69%), Gaps = 1/88 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
           + M ++Y++GP+   F+VY DF  YKSGVY+H  G +IG HAV+++GWG  +D   YWL+
Sbjct: 247 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLL 306

Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMG 161
           AN WN  WGD G FKI RG NE  IE G
Sbjct: 307 ANQWNRSWGDDGYFKIRRGTNECGIEHG 334



 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 52/135 (38%), Positives = 67/135 (49%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR  W +C S+  I DQ +CGSCWA     ++SDR CI  N      +S   ++
Sbjct: 104 LPKEFDARTAWSQCTSVGRILDQGHCGSCWAFGAVESLSDRFCIKYN--MNISLSVNDLL 161

Query: 247 ACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GCNGG+P  AWR++ H+GVVT       E C PY     C H    P    
Sbjct: 162 ACCGFLCGQGCNGGYPIAAWRYFKHHGVVT-------EECDPYFDNTGCSHPGCEP---- 210

Query: 304 TLLGKLKTPECKQNC 318
                  TP+C + C
Sbjct: 211 ----AYPTPKCARKC 221


>gi|166030320|gb|ABY78827.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  100 bits (250), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 47/93 (50%), Positives = 61/93 (65%), Gaps = 1/93 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R++Y +GP VA+F VY D   YKSGVY++  GD +G  AVR++GWG  N  PYW VAN+W
Sbjct: 241 RELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDILGGQAVRIVGWGKLNGTPYWKVANTW 300

Query: 138 NDHWGDHGTFKILRGENEADIE-MGFNNRVEAN 169
           +  WG  G   ILRG NE +IE +GF    E +
Sbjct: 301 DTDWGMDGYLLILRGNNECNIEHLGFAGTPETS 333



 Score = 87.4 bits (215), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 57/138 (41%), Positives = 75/138 (54%), Gaps = 11/138 (7%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD+ EKWP CP++R IADQS C + WAVS A+ ISDR C    G    +ISA H++
Sbjct: 90  LPESFDSAEKWPNCPTIREIADQSACRASWAVSTASVISDRYCTV-GGVQQLRISAAHLL 148

Query: 247 A-CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
           + C     GC GG+P  AWR++   G+       +   CQPY    CEH   QG    C+
Sbjct: 149 SCCKQCGGGCKGGFPGFAWRYYVEYGI-------ASSYCQPYPFPHCEHRGAQGNKTPCS 201

Query: 305 LLGKLKTPECKQNCYNPS 322
                 TP+C   C + S
Sbjct: 202 KY-NFDTPKCNATCTDKS 218


>gi|609175|emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana rustica]
          Length = 356

 Score =  100 bits (250), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 49/100 (49%), Positives = 66/100 (66%), Gaps = 3/100 (3%)

Query: 63  HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+   A+M+     + M ++Y++GP+   F+VY DF  YKSGVY+H  GD +G HAV+++
Sbjct: 229 HFGVNAYMISSDPHSIMTEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDIMGGHAVKLI 288

Query: 121 GWGVEND-IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG   D   YWL+AN WN  WGD G FKI RG NE +IE
Sbjct: 289 GWGTSEDGEDYWLLANQWNRGWGDDGYFKIRRGTNECEIE 328



 Score = 77.4 bits (189), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 41/103 (39%), Positives = 54/103 (52%), Gaps = 6/103 (5%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR  W  C ++  I DQ +CGSCWA     ++SDR CI         +SA  + 
Sbjct: 100 LPQEFDARVAWSNCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHYG--LNISLSANDLY 157

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTG--GDYNSQEGCQ 285
           AC       GC+GG+P  AW+++   GVVT     Y   EGC 
Sbjct: 158 ACCGFLCGDGCDGGYPLQAWKYFVRKGVVTDECDPYFDNEGCS 200


>gi|300121755|emb|CBK22330.2| unnamed protein product [Blastocystis hominis]
          Length = 562

 Score =  100 bits (250), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 69/234 (29%), Positives = 109/234 (46%), Gaps = 38/234 (16%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           N M++IY  GP+    +   + ++YK G+Y+   G     H++ V+GWG E+   YW+  
Sbjct: 183 NMMKEIYARGPITCTIADPEELMEYKGGIYRDTTGAKSLDHSISVVGWGEEDGQKYWIAR 242

Query: 135 NSWNDHWGDHGTFKILRGEN----EADI---------EMGFNNRVEAN---------SSE 172
           NSW   WG+ G F+I+RGEN    EAD          EM  N+++ +          S  
Sbjct: 243 NSWGTFWGEKGWFRIVRGENNLGIEADCQWAVPRVPEEMILNDQMRSQRNRARYFPRSCA 302

Query: 173 DDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSN-------------CGSCWAVS 219
             D + M        P  +   E  P+   +R+I D  N             CGSCWA +
Sbjct: 303 RPDTKEMKEHVVSPRPHTYIKSEDIPKNYDIRNI-DGVNYATWDKNQHIPQYCGSCWAQA 361

Query: 220 VANAISDRLCIASNG-YFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGV 272
             +A+SDR+ +   G + T ++S Q I+ C+     C GGW    +++  H G+
Sbjct: 362 PTSALSDRINLMRKGKWPTVELSVQEIINCSGKG-SCEGGWQSGVYQYAYHQGI 414



 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 30/76 (39%), Positives = 47/76 (61%), Gaps = 1/76 (1%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWG-VENDIPYWLVANSW 137
           +I+  GP+     V  +FL Y+ G+++ N  + +G H+V V GWG  E+   YW+  NSW
Sbjct: 470 EIFARGPVSCDIWVTQEFLDYQGGIFKENGSEYLGRHSVEVAGWGETEDGTKYWIGRNSW 529

Query: 138 NDHWGDHGTFKILRGE 153
             +WG+HG F+I+ GE
Sbjct: 530 GTYWGEHGWFRIIIGE 545



 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 54/106 (50%), Gaps = 14/106 (13%)

Query: 187 LPRNFDARE----KWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG-YFTGQIS 241
           LP+++D R+     +      +HI     CGSCW+ +  +++SDRL + + G +    +S
Sbjct: 43  LPKSYDPRDIDGRNYVTVTKNQHIPQY--CGSCWSFASVSSVSDRLKLMTKGKWPVHDLS 100

Query: 242 AQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
            Q I+ C  N  GC GG P  A+++   +GV        +EGC  Y
Sbjct: 101 PQVILNCDHNSNGCQGGHPLTAFKYMHDHGV-------PEEGCMRY 139


>gi|268566089|ref|XP_002647469.1| Hypothetical protein CBG06541 [Caenorhabditis briggsae]
          Length = 280

 Score =  100 bits (250), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 46/95 (48%), Positives = 63/95 (66%), Gaps = 2/95 (2%)

Query: 67  KAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV 124
            A+ V R  A  Q  I  +GP+V  F++Y D  +YKSGVY+H  G  +G HA++++GWG 
Sbjct: 176 SAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGT 235

Query: 125 ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +N IPYWL+ANSW   WG++G  K+ RG NE  IE
Sbjct: 236 QNGIPYWLIANSWGADWGENGFLKMRRGVNECGIE 270



 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 55/133 (41%), Positives = 72/133 (54%), Gaps = 13/133 (9%)

Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP-NCW-GCNGGWPQLAWRFW 267
           + CGSCWA S A  ISDR+CIA+ G     IS   ++AC   +C  GC GG+P  A+R+W
Sbjct: 59  AQCGSCWAFSTAEVISDRICIATKGTQQPTISPTDMLACCGRSCGDGCEGGYPIQAFRWW 118

Query: 268 GHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTY 327
              GVVTGGD+    GC+PY  APC  +       C    + KTP C  +C    Y + Y
Sbjct: 119 NSRGVVTGGDFRG-SGCRPYPFAPCNSY------KCP---EEKTPTCSLSC-QFGYSTAY 167

Query: 328 RFDLKKGKKAHMV 340
             D + G  A+ V
Sbjct: 168 AKDKRFGVSAYAV 180



 Score = 74.3 bits (181), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 31/66 (46%), Positives = 43/66 (65%)

Query: 83  HGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWG 142
           +GP+ A F+VY DF  YK GVYQ+  G  +G+HA++++GWG E+   YWL+ANSW    G
Sbjct: 3   NGPVEASFTVYEDFYIYKKGVYQYTAGQVVGVHAIKIMGWGTEHGTDYWLIANSWGAQCG 62

Query: 143 DHGTFK 148
               F 
Sbjct: 63  SCWAFS 68


>gi|324514184|gb|ADY45787.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
          Length = 476

 Score =  100 bits (249), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 56/163 (34%), Positives = 79/163 (48%), Gaps = 17/163 (10%)

Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
           A  LP  FDAR KW  C SL ++ +Q  CG+C+AV+     SDR CIASNG      S +
Sbjct: 190 ADSLPSEFDARRKWSYCSSLHNVPNQGGCGACYAVAAVGVASDRACIASNGTLQSMFSEE 249

Query: 244 HIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTL-----APCEHHVQG 298
            ++ C   C  C GG P  A  +W   G+VTGG    ++GC+PY++      PC   V  
Sbjct: 250 DVLGCCAVCGNCYGGDPLKALVYWVDEGLVTGG----RDGCRPYSVDLSCGVPCSPAVY- 304

Query: 299 PLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
           PL            +C + C +  ++  Y  D   G  A+ + 
Sbjct: 305 PLAE-------YRRKCYRQCQDIYFQYNYESDKHYGSMAYSMF 340



 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 57/114 (50%), Gaps = 15/114 (13%)

Query: 48  KKKKRLYLPTSIPLSHYFKKAHMVP------RCNAMRQIYEHGPLVAIFSVYADFLQYKS 101
           K  +R+ LPT I    Y  +    P      R   M+++Y  GP+   F V  +FL Y S
Sbjct: 349 KGSERVKLPTVI---GYLNETSDEPLTDKEIRQIIMKELYLWGPMTMAFPVTEEFLHYSS 405

Query: 102 GVYQ----HNFGDSIGL-HAVRVLGWG-VENDIPYWLVANSWNDHWGDHGTFKI 149
           GV+      NF D I   H  R++GWG  + D  YWL  NS+  HWGD G F+I
Sbjct: 406 GVFSPFPAANFSDRIVYWHVARLIGWGKYDGDNHYWLAVNSFGRHWGDDGVFRI 459


>gi|323448265|gb|EGB04166.1| hypothetical protein AURANDRAFT_32974 [Aureococcus anophagefferens]
          Length = 298

 Score =  100 bits (249), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 45/83 (54%), Positives = 54/83 (65%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M  I E GP+   F+VY DF  Y  G+Y H  G+  G HAV+ +GWGVEN   YW VANS
Sbjct: 197 MAMIAEGGPVETAFTVYEDFENYAGGIYHHVTGEEAGGHAVKFVGWGVENGTKYWKVANS 256

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN +WG+ G F+ILRG NE  IE
Sbjct: 257 WNPYWGEAGYFRILRGSNEGGIE 279



 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 51/117 (43%), Positives = 59/117 (50%), Gaps = 13/117 (11%)

Query: 188 PRNFDAREKWPECPSL-RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           P  FD+  +WPEC  L   I DQSNCG CWA + A A SDR CIA+ G     +SAQ  V
Sbjct: 25  PEAFDSAARWPECAKLIGDIRDQSNCGCCWAFAGAEAASDRQCIATGGAVAVPLSAQD-V 83

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT-------LAP-CEHH 295
               N  GC+GG     W +    G VTGG YN   G  P+         AP C HH
Sbjct: 84  CFNANVDGCDGGQIITPWTYVAKAGAVTGGQYN---GTGPFGAGLCADWFAPHCHHH 137


>gi|10803450|emb|CAB97364.2| putative cathepsin B.1 [Ostertagia ostertagi]
          Length = 199

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 57/131 (43%), Positives = 76/131 (58%), Gaps = 5/131 (3%)

Query: 214 SCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGV 272
           SCWAVS A+A+SDR+CIA+ G     IS Q IV+C   C +GC GGW   AW ++   GV
Sbjct: 1   SCWAVSSASAMSDRVCIATQGAKQVLISDQDIVSCCTWCGYGCQGGWSIRAWYYFAEQGV 60

Query: 273 VTGGDYNSQEGCQPYTLAPCEHHVQGPLQN-CTLLGKLKTPECKQNCYNPSYESTYRFDL 331
           VTGG+YN++  C+PY + PC +H   P    C  L    TP CK+ C    Y  +Y  D 
Sbjct: 61  VTGGNYNTKGSCRPYEIHPCGYHKDEPYYGECDDLA--DTPRCKRRC-QLGYPKSYPSDK 117

Query: 332 KKGKKAHMVLM 342
             G+ A+ + M
Sbjct: 118 HYGRTAYQLPM 128



 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 39/91 (42%), Positives = 54/91 (59%), Gaps = 7/91 (7%)

Query: 48  KKKKRLYLPTSIPLS-HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVY 104
           K++ +L  P S P   HY + A+ +P    +  R+I  +GP+VA F+VY DF  YK G+Y
Sbjct: 102 KRRCQLGYPKSYPSDKHYGRTAYQLPMSVESIQREIMRNGPVVAGFTVYEDFAHYKGGIY 161

Query: 105 QHNFGDSIGLHAVRVLGWGVEN----DIPYW 131
           +H  G   G HAV+V+GWG E      IPYW
Sbjct: 162 KHTSGKKTGGHAVKVIGWGSEQKGSEKIPYW 192


>gi|294885809|ref|XP_002771442.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239875086|gb|EER03258.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 527

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 46/86 (53%), Positives = 56/86 (65%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           NA   I   GP+ A + VY DFL YKSGVY+H  G  +G HAV+++GWG EN   YWLV 
Sbjct: 435 NAKNAIRTDGPISASYLVYEDFLAYKSGVYKHTSGSYLGGHAVKIIGWGEENGEAYWLVV 494

Query: 135 NSWNDHWGDHGTFKILRGENEADIEM 160
           NSWN+ WGD G FKI  G  E D ++
Sbjct: 495 NSWNEDWGDQGLFKIALGNCEIDDDL 520



 Score = 43.9 bits (102), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 19/58 (32%), Positives = 28/58 (48%)

Query: 273 VTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
           V  G+    +GC PY   PC HH+          G  +TP C + C+NP Y ++ + D
Sbjct: 362 VARGNLTKGDGCWPYDFPPCAHHINDTKYPKCPKGSYETPNCVEQCHNPKYTTSLKND 419


>gi|300176830|emb|CBK25399.2| unnamed protein product [Blastocystis hominis]
          Length = 563

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 70/220 (31%), Positives = 105/220 (47%), Gaps = 38/220 (17%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           N M++IY  GP+    +V  D ++YK G+Y+   G     HA+ V+GWG E+   YW+  
Sbjct: 183 NMMKEIYARGPITCSIAVPDDLMEYKGGIYRDTTGAKTLDHAISVVGWGEEDGQKYWIAR 242

Query: 135 NSWNDHWGDHGTFKILRGEN----EADI---------EMGFNNRVEAN---------SSE 172
           NSW   WG+ G F+I+RGEN    EAD          EM  N+++ +          S  
Sbjct: 243 NSWGTFWGEKGWFRIVRGENNLGIEADCQWAVPRVPEEMILNDQMRSQRNRARYFPRSCL 302

Query: 173 DDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSN-------------CGSCWAVS 219
             D   M        P  +   E  P+   +R+I D  N             CGSCWA +
Sbjct: 303 LKDANRMKEHVVSPRPHTYIKSEDIPKNYDIRNI-DGVNYATWDKNQHIPQYCGSCWAQA 361

Query: 220 VANAISDRLCIASNG-YFTGQISAQHIVACTPNCWGCNGG 258
             +A+SDR+ +   G + T ++SAQ ++ C+ N   C+GG
Sbjct: 362 PTSALSDRINLMRKGKWPTVELSAQEVINCS-NAGTCDGG 400



 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 48/81 (59%), Gaps = 1/81 (1%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWG-VENDIPYWLVANSW 137
           +I+  GP+     V  +FL Y+ G++  + G  +G HAV V GWG  E+   YW+  NSW
Sbjct: 470 EIFARGPVSCSMIVTEEFLAYQGGIFVDDRGHIVGYHAVEVAGWGETEDGTKYWIARNSW 529

Query: 138 NDHWGDHGTFKILRGENEADI 158
             +WG+HG F+++ G ++  I
Sbjct: 530 GPYWGEHGWFRMIVGVSKGLI 550



 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 54/106 (50%), Gaps = 14/106 (13%)

Query: 187 LPRNFDARE----KWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG-YFTGQIS 241
           LP+++D R+     +      +HI     CGSCW+ +  +++SDRL + + G +    +S
Sbjct: 43  LPKSYDPRDIDGRNYVTVTKNQHIPQY--CGSCWSFASVSSVSDRLKLMTKGKWPVHDLS 100

Query: 242 AQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
            Q I+ C  N  GC GG P  A+++   +GV        +EGC  Y
Sbjct: 101 PQVILNCDHNSNGCQGGHPLTAFKYMHDHGV-------PEEGCMRY 139


>gi|156708118|gb|ABU93317.1| cathepsin B8 cysteine protease, partial [Monocercomonoides sp. PA]
          Length = 275

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 43/88 (48%), Positives = 57/88 (64%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
             M ++  +GP+ A F V+ DFL YKSG+YQH  G S G H V ++GWG EN +PYWL+ 
Sbjct: 181 TVMDEVANNGPVYACFEVFEDFLNYKSGIYQHKTGKSKGWHHVMLMGWGTENGVPYWLLQ 240

Query: 135 NSWNDHWGDHGTFKILRGENEADIEMGF 162
           NSW   WG+ G F+I RG N+  I+  F
Sbjct: 241 NSWGSGWGEKGFFRIRRGTNDCHIDEIF 268



 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 40/135 (29%), Positives = 59/135 (43%), Gaps = 28/135 (20%)

Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
           P +FD R+KWP       + +Q++CGSCWA + +  +  R+ I   G + G +S Q +V+
Sbjct: 58  PASFDCRQKWPG--KAEPVRNQASCGSCWAHAASETMGFRMGI--RGCYKGVMSPQDLVS 113

Query: 248 CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLG 307
           C  N  GC GG+    W +    G+ T       E C PY                 + G
Sbjct: 114 CESNNMGCEGGYADRVWNWIQKKGITT-------EQCLPY-----------------VSG 149

Query: 308 KLKTPECKQNCYNPS 322
             + P C   C N S
Sbjct: 150 SGRVPTCPSKCKNGS 164


>gi|268555786|ref|XP_002635882.1| Hypothetical protein CBG01102 [Caenorhabditis briggsae]
          Length = 374

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 59/185 (31%), Positives = 86/185 (46%), Gaps = 45/185 (24%)

Query: 191 FDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP 250
           FDARE+WPEC S+  I D S+C S WA S A ++SDRLCI S G     +SAQ +++C  
Sbjct: 85  FDARERWPECSSIPIINDISDCKSSWAFSAAESMSDRLCINSGGMINTVLSAQELLSCCT 144

Query: 251 NCWGCN----------------------------------------GGWPQLAWRFWGHN 270
             + C                                         GG    AW++W  +
Sbjct: 145 GVFSCGEGDSEHWQFRNSKFRKPRCQKFNKEILEARRNLETREKCAGGNVFKAWQYWQKH 204

Query: 271 GVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
           G+ TGG Y SQ GC+PY+++PC+  +        L   ++TP C++ C     +S Y  +
Sbjct: 205 GLPTGGSYESQFGCKPYSISPCDTVIGNITFPGCLNSTVQTPSCEKKC-----KSGYPVE 259

Query: 331 LKKGK 335
           L K +
Sbjct: 260 LDKDR 264



 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 43/99 (43%), Positives = 61/99 (61%), Gaps = 2/99 (2%)

Query: 63  HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY      +P  +      +  +GP+ A   VY DFLQY +G+Y H  G+  G  +VR+L
Sbjct: 265 HYGVSVDQLPNRQIEIQSDVMLNGPISATMEVYDDFLQYTTGIYVHLTGNKQGHLSVRIL 324

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG+   +PYWL+ANSW   WG++GTF++LRG NE  +E
Sbjct: 325 GWGMYEGVPYWLLANSWGKQWGENGTFRVLRGVNECGLE 363


>gi|168026641|ref|XP_001765840.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683017|gb|EDQ69431.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 339

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 42/86 (48%), Positives = 60/86 (69%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
           + M ++Y +GP+   F V+ DF  YK+GVY+H +G  IG HAV+++GWG  +D + YW +
Sbjct: 237 DLMAELYTNGPIEVSFEVFEDFAHYKTGVYKHVYGRYIGGHAVKLIGWGTTDDGVDYWTI 296

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
            NSWN +WG+HG F+I RG NE  IE
Sbjct: 297 VNSWNTNWGEHGLFRIARGGNECGIE 322



 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 43/102 (42%), Positives = 59/102 (57%), Gaps = 6/102 (5%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR+ W  C ++  I DQ +CGSCWA   A +++DR CI  N   +  +S   ++
Sbjct: 95  LPKEFDARKHWGHCSTIGAILDQGHCGSCWAFGAAESLTDRFCIHMNESVS--LSENDLL 152

Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTG--GDYNSQEGC 284
           AC    C  GC+GG+P  AWR++   GVVT     Y  Q GC
Sbjct: 153 ACCGFECGDGCDGGYPIRAWRYFKRTGVVTSKCDPYFDQIGC 194


>gi|14582576|gb|AAK69541.1|AF283476_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
          Length = 352

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 50/110 (45%), Positives = 66/110 (60%), Gaps = 8/110 (7%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
           + M ++Y +GP    F+VY DF  YKSGVY+H  G  +G HAV+++GWG   D   YWL+
Sbjct: 239 SIMTEVYTNGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHAVKLIGWGTSEDGEDYWLL 298

Query: 134 ANSWNDHWGDHGTFKILRGENEA---DIEMGF----NNRVEANSSEDDDL 176
           AN WN  WGD G FKI+RG NE    D+  G     N  +E+   +DD L
Sbjct: 299 ANQWNRSWGDDGYFKIIRGTNECGIEDVTAGMPSTKNLDIESGVRDDDSL 348



 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 42/103 (40%), Positives = 57/103 (55%), Gaps = 6/103 (5%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR  WP+C S+  I DQ +CGSCWA     +++DR CI      T  +S   ++
Sbjct: 96  LPKTFDARTAWPQCLSIADILDQGHCGSCWAFGAVESLTDRFCIHYGTNVT--LSVNDLL 153

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTG--GDYNSQEGCQ 285
           AC       GC+GG+P  AW+++   GVVT     Y  Q GC 
Sbjct: 154 ACCGFLCGEGCDGGYPIAAWQYFKRTGVVTSECDPYFDQTGCS 196


>gi|217073630|gb|ACJ85175.1| unknown [Medicago truncatula]
          Length = 359

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 69/101 (68%), Gaps = 5/101 (4%)

Query: 63  HYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRV 119
           HY  KA+ V   P+ + M ++Y++GP+   F+V+ DF  YKSGVY+H  G ++G HAV++
Sbjct: 232 HYSVKAYRVKSDPQ-DIMTEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKL 290

Query: 120 LGWGVEND-IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +GWG  ++   YWL+AN WN +WGD G FKI RG NE  IE
Sbjct: 291 IGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIE 331



 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 65/135 (48%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR  W +C ++  I DQ +CGSCWA     ++ DR C  S+      +S   ++
Sbjct: 103 LPKEFDARAAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFC--SHFDMNISLSVNDLL 160

Query: 247 ACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GC+GG P  AWR+  H+GVVT       E C PY     C H    P    
Sbjct: 161 ACCGFLCGAGCDGGTPIYAWRYLAHHGVVT-------EECDPYFDQIGCSHPGCEP---- 209

Query: 304 TLLGKLKTPECKQNC 318
                 +TP+C + C
Sbjct: 210 ----AYQTPKCVRKC 220


>gi|294914336|ref|XP_002778250.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239886453|gb|EER10045.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 388

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 44/83 (53%), Positives = 59/83 (71%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R+I ++GP+ A F+VY DF  YKSGVY+H  G  +G HAV+++GWG++ +  YWLV NSW
Sbjct: 286 REIIDNGPVAAAFTVYEDFPYYKSGVYKHVNGSELGGHAVKIIGWGIDQNEQYWLVMNSW 345

Query: 138 NDHWGDHGTFKILRGENEADIEM 160
           N +WGD G FKI  GE   D E+
Sbjct: 346 NVNWGDQGIFKIAIGECGIDSEV 368


>gi|357116869|ref|XP_003560199.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
          Length = 350

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 45/86 (52%), Positives = 61/86 (70%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDI-PYWLV 133
           + M ++Y++GP+   F+VY DF  YKSGVY+H  G+ +G HAV+++GWG   D   YWL+
Sbjct: 241 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYEHITGEMMGGHAVKLIGWGTSADGKDYWLL 300

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
           AN WN  WGD G FKI+RG+NE  IE
Sbjct: 301 ANQWNRGWGDDGYFKIIRGKNECGIE 326



 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 49/135 (36%), Positives = 64/135 (47%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR KW  C ++  I DQ +CGSCWA      + DR CI  N      +S   +V
Sbjct: 98  LPKEFDARSKWSGCSTIGTILDQGHCGSCWAFGAVECLQDRFCIHLN--MNISLSVNDLV 155

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GC+GG+P  AW++   NGVVT       + C PY     C+H    P    
Sbjct: 156 ACCGFMCGDGCDGGYPISAWQYLVENGVVT-------DECDPYFDQVGCKHPGCEP---- 204

Query: 304 TLLGKLKTPECKQNC 318
                  TP C++ C
Sbjct: 205 ----AYPTPACEKKC 215


>gi|217072748|gb|ACJ84734.1| unknown [Medicago truncatula]
 gi|388505480|gb|AFK40806.1| unknown [Medicago truncatula]
          Length = 359

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 69/101 (68%), Gaps = 5/101 (4%)

Query: 63  HYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRV 119
           HY  KA+ V   P+ + M ++Y++GP+   F+V+ DF  YKSGVY+H  G ++G HAV++
Sbjct: 232 HYSVKAYRVKSDPQ-DIMAEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKL 290

Query: 120 LGWGVEND-IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +GWG  ++   YWL+AN WN +WGD G FKI RG NE  IE
Sbjct: 291 IGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIE 331



 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 66/135 (48%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR  W +C ++  I DQ +CGSCWA     ++ DR CI  +   +  +S   ++
Sbjct: 103 LPKEFDARTAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFCIHFDMNIS--LSVNDLL 160

Query: 247 ACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GC+GG P  AWR+  H+GVVT       E C PY     C H    P    
Sbjct: 161 ACCGFLCGAGCDGGTPIYAWRYLAHHGVVT-------EECDPYFDQIGCSHPGCEP---- 209

Query: 304 TLLGKLKTPECKQNC 318
                 +TP+C + C
Sbjct: 210 ----AYQTPKCVRKC 220


>gi|357511629|ref|XP_003626103.1| Cathepsin B [Medicago truncatula]
 gi|87240982|gb|ABD32840.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
           propeptide [Medicago truncatula]
 gi|355501118|gb|AES82321.1| Cathepsin B [Medicago truncatula]
          Length = 357

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 69/101 (68%), Gaps = 5/101 (4%)

Query: 63  HYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRV 119
           HY  KA+ V   P+ + M ++Y++GP+   F+V+ DF  YKSGVY+H  G ++G HAV++
Sbjct: 230 HYSVKAYRVKSDPQ-DIMAEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKL 288

Query: 120 LGWGVEND-IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +GWG  ++   YWL+AN WN +WGD G FKI RG NE  IE
Sbjct: 289 IGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIE 329



 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 66/135 (48%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR  W +C ++  I DQ +CGSCWA     ++ DR CI  +   +  +S   ++
Sbjct: 101 LPKEFDARTAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFCIHFDMNIS--LSVNDLL 158

Query: 247 ACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GC+GG P  AWR+  H+GVVT       E C PY     C H    P    
Sbjct: 159 ACCGFLCGAGCDGGTPIYAWRYLAHHGVVT-------EECDPYFDQIGCSHPGCEP---- 207

Query: 304 TLLGKLKTPECKQNC 318
                 +TP+C + C
Sbjct: 208 ----AYQTPKCVRKC 218


>gi|161343857|tpg|DAA06109.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 163

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 48/94 (51%), Positives = 58/94 (61%)

Query: 66  KKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE 125
           KK      C+A + + +HGP V    VY DFL YKSGVY H  GD +GL +VR++GWG+E
Sbjct: 60  KKVFTFDGCSARKNLRKHGPYVVTMRVYEDFLAYKSGVYHHVTGDYLGLLSVRMIGWGLE 119

Query: 126 NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
               +WL ANSW   WGD G FKI R  NE  IE
Sbjct: 120 GGQAFWLFANSWGTSWGDKGFFKIRRFVNERWIE 153


>gi|15150360|gb|AAK85411.1| cathepsin B-like protease [Trypanosoma rangeli]
          Length = 207

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 56/132 (42%), Positives = 72/132 (54%), Gaps = 12/132 (9%)

Query: 191 FDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP 250
           FDA E WP CP++  I DQS CGSCWAV+  +A+SDR C    G    +ISA  +++C  
Sbjct: 1   FDAGEAWPNCPTITEIRDQSGCGSCWAVAARSAMSDRYC-TRGGVRDLRISAGDLLSCCN 59

Query: 251 NC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP-LQNCTLLGK 308
            C  GCNGG P  AW ++   G+V+       E CQPY   PC HHV       C++  +
Sbjct: 60  ACGLGCNGGDPDWAWLYYVETGIVS-------EFCQPYPFPPCAHHVNSTHYTPCSV--E 110

Query: 309 LKTPECKQNCYN 320
             TP C   C N
Sbjct: 111 YDTPFCNITCTN 122



 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 31/61 (50%), Positives = 44/61 (72%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           R+++ +GP    F+VY DF+ Y  GVY+H  G+++G HAVR++GWG  N  PYW +ANSW
Sbjct: 145 RELFLYGPFEVAFTVYEDFVAYSDGVYKHFSGNALGGHAVRLVGWGNLNGTPYWKIANSW 204

Query: 138 N 138
           N
Sbjct: 205 N 205


>gi|94958151|gb|ABF47216.1| cathepsin B [Nicotiana benthamiana]
          Length = 356

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 48/100 (48%), Positives = 66/100 (66%), Gaps = 3/100 (3%)

Query: 63  HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+   A+M+     + M ++Y++GP+   F+VY DF  YKSGVY+H  GD +G HAV+++
Sbjct: 229 HFGVNAYMISSDPHSIMTELYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDVMGGHAVKLI 288

Query: 121 GWGVEND-IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG   D   YWL+AN WN  WGD G FKI RG +E +IE
Sbjct: 289 GWGTSEDGEDYWLLANQWNRGWGDDGYFKIRRGTDECEIE 328



 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 42/103 (40%), Positives = 56/103 (54%), Gaps = 6/103 (5%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR  WP C ++  I DQ +CGSCWA     ++SDR CI         +SA  ++
Sbjct: 100 LPQEFDARVAWPNCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHYG--LNISLSANDLL 157

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTG--GDYNSQEGCQ 285
           AC       GC+GG+P  AW+++   GVVT     Y   EGC 
Sbjct: 158 ACCGFLCGDGCDGGYPLQAWKYFVRKGVVTDECDPYFDNEGCS 200


>gi|357511627|ref|XP_003626102.1| Cathepsin L-like proteinase [Medicago truncatula]
 gi|355501117|gb|AES82320.1| Cathepsin L-like proteinase [Medicago truncatula]
          Length = 351

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 50/101 (49%), Positives = 69/101 (68%), Gaps = 5/101 (4%)

Query: 63  HYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRV 119
           HY  KA+ V   P+ + M ++Y++GP+   F+VY DF  YKSGVY+H  G ++G HAV++
Sbjct: 224 HYSVKAYTVNSDPQ-DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGFALGGHAVKL 282

Query: 120 LGWGVEND-IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +GWG  ++   YWL+AN WN +WGD G FKI RG NE  IE
Sbjct: 283 VGWGTSHEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIE 323



 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 50/137 (36%), Positives = 66/137 (48%), Gaps = 20/137 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP++FDAR  W +C ++  I DQ +CGSCWA     ++SDR CI  +      +S   I+
Sbjct: 95  LPKDFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFD--MNVSLSVNDIL 152

Query: 247 ACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GC GG P  AW +  H+GVVT       E C PY     C H    P    
Sbjct: 153 ACCGLLCGAGCAGGTPFSAWIYLAHHGVVT-------EECDPYFDQIGCSHPGCEP---- 201

Query: 304 TLLGKLKTPECKQNCYN 320
                 +TP+C + C N
Sbjct: 202 ----TYRTPKCVKKCVN 214


>gi|224285256|gb|ACN40354.1| unknown [Picea sitchensis]
          Length = 350

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 45/86 (52%), Positives = 60/86 (69%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
           N M +++ +GP+   FSVY DF  Y++GVY+H  G  +G HAV+++GWG  +D I YWL+
Sbjct: 237 NIMAEVFNNGPVEVSFSVYEDFAHYETGVYKHVQGRYLGGHAVKLIGWGTTDDGIDYWLI 296

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
           ANSWN  WG+ G FKI RG NE  IE
Sbjct: 297 ANSWNTAWGEGGYFKIARGVNECGIE 322



 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 55/135 (40%), Positives = 66/135 (48%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDAR+ WP C S R I DQ +CGSCWA +   A+SDR CI         +S   +V
Sbjct: 95  LPSKFDARKAWPHCTSTRSILDQGHCGSCWAFAAVEALSDRFCIHFQ--VNATLSENDLV 152

Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP-CEHHVQGPLQNC 303
           AC    C  GCNGG+P  AWR++   GVVT       + C PY     C H    P    
Sbjct: 153 ACCGFRCGSGCNGGFPLSAWRYFSRRGVVT-------DECDPYFDNDGCNHPGCEP---- 201

Query: 304 TLLGKLKTPECKQNC 318
                  TP C +NC
Sbjct: 202 ----SYPTPRCVKNC 212


>gi|87240981|gb|ABD32839.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
           propeptide [Medicago truncatula]
          Length = 356

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 50/101 (49%), Positives = 69/101 (68%), Gaps = 5/101 (4%)

Query: 63  HYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRV 119
           HY  KA+ V   P+ + M ++Y++GP+   F+VY DF  YKSGVY+H  G ++G HAV++
Sbjct: 229 HYSVKAYTVNSDPQ-DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGFALGGHAVKL 287

Query: 120 LGWGVEND-IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +GWG  ++   YWL+AN WN +WGD G FKI RG NE  IE
Sbjct: 288 VGWGTSHEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIE 328



 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 50/137 (36%), Positives = 66/137 (48%), Gaps = 20/137 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP++FDAR  W +C ++  I DQ +CGSCWA     ++SDR CI  +      +S   I+
Sbjct: 100 LPKDFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFD--MNVSLSVNDIL 157

Query: 247 ACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GC GG P  AW +  H+GVVT       E C PY     C H    P    
Sbjct: 158 ACCGLLCGAGCAGGTPFSAWIYLAHHGVVT-------EECDPYFDQIGCSHPGCEP---- 206

Query: 304 TLLGKLKTPECKQNCYN 320
                 +TP+C + C N
Sbjct: 207 ----TYRTPKCVKKCVN 219


>gi|60598652|gb|AAX25875.1| unknown [Schistosoma japonicum]
          Length = 195

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 57/148 (38%), Positives = 79/148 (53%), Gaps = 11/148 (7%)

Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
           +IL G  + D EM    R   +   D ++E         +P  FD+R+KWP C S+  I 
Sbjct: 28  RILMGARKEDAEMKRKRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 77

Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQL-AWRF 266
           DQS CGSCWA     A++DR+CI S G  + ++SA  +++C  +C G   G     AW +
Sbjct: 78  DQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDY 137

Query: 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEH 294
           W   G+VTGG   +  GCQPY    CEH
Sbjct: 138 WVKRGIVTGGSKENHTGCQPYPFPKCEH 165


>gi|197304333|dbj|BAG69285.1| cathepsin B-like cysteine protease [Raphanus sativus]
          Length = 343

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 50/101 (49%), Positives = 66/101 (65%), Gaps = 5/101 (4%)

Query: 63  HYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRV 119
           HY    ++V   P+ + M +IY++GP+   F+VY DF  YKSGVY+H  G +IG HAV++
Sbjct: 233 HYSINTYVVESNPQ-DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKL 291

Query: 120 LGWGVEND-IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +GWG  +D   YWL+AN WN  WGD G F I RG NE  IE
Sbjct: 292 IGWGTTDDGEDYWLLANQWNRSWGDDGYFMIRRGTNECGIE 332



 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 52/135 (38%), Positives = 71/135 (52%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP++FDAR  WP+C S+  I DQ +CGSCWA     ++SDR CI      T  +S   ++
Sbjct: 104 LPKSFDARTHWPQCTSIGKILDQGHCGSCWAFGAVESLSDRFCIQFGMNIT--LSVNDLL 161

Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC    C  GC+GG+P  AW+++ ++GVVT       E C PY     C H    P  N 
Sbjct: 162 ACCGFRCGDGCDGGYPISAWQYFSYSGVVT-------EECDPYFDQTGCSHPGCEPAYN- 213

Query: 304 TLLGKLKTPECKQNC 318
                  TP+C + C
Sbjct: 214 -------TPQCLRKC 221


>gi|294890224|ref|XP_002773108.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239878009|gb|EER04924.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 109

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 50/109 (45%), Positives = 65/109 (59%), Gaps = 8/109 (7%)

Query: 52  RLYLPTSIPLSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           R +L  S+P  +    A      NA+R     GP+ A F VY DFL Y+SGVY+H  G  
Sbjct: 2   RHFLVESVPYEYSVNDAK-----NAIRT---DGPVSASFIVYEDFLAYRSGVYKHTSGKE 53

Query: 112 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEM 160
           +G HAV+++GWG E    YWLV NSWN+ WGD+G FKI  G  E D ++
Sbjct: 54  LGGHAVKIIGWGEETGQAYWLVVNSWNEDWGDNGLFKIALGNCEIDDDL 102


>gi|330805199|ref|XP_003290573.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
 gi|325079281|gb|EGC32888.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
          Length = 313

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 48/98 (48%), Positives = 61/98 (62%), Gaps = 1/98 (1%)

Query: 63  HYFKKAHMVPRCNA-MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
           H   K + +    A M++I  +GP+ A FSVY DFL YKSGVYQH  G  +G H V++ G
Sbjct: 207 HKMAKIYSINSVEAIMQEISTNGPVEACFSVYEDFLGYKSGVYQHTTGKFLGGHCVKIFG 266

Query: 122 WGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +G  N + YW VANSW   WGD+G F I RG +E  IE
Sbjct: 267 YGTLNGVNYWSVANSWTTSWGDNGIFLIKRGSDECGIE 304



 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 61/131 (46%), Gaps = 15/131 (11%)

Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
           P +FD+R  W  C ++ +I +Q+ CGSCWA     +  DR+CI        Q+S   +V 
Sbjct: 79  PASFDSRTAWSNCTTIGYIENQARCGSCWAFGAVESAQDRICIHKG--LDVQLSFLDLVT 136

Query: 248 CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLG 307
           C  +  GC GG    AW F    GVVT       + C+PYT+  C      P      L 
Sbjct: 137 CDQSDDGCEGGDDVSAWNFLKKQGVVT-------QECKPYTIPTC------PPAQQPCLN 183

Query: 308 KLKTPECKQNC 318
            + TP C + C
Sbjct: 184 FVNTPNCVKQC 194


>gi|166030326|gb|ABY78830.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 52/110 (47%), Positives = 67/110 (60%), Gaps = 4/110 (3%)

Query: 58  SIPLSHYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGL 114
           SIPL  Y   A  +      +  R++Y +GP VA+F VY D   YKSGVY++  GD +G 
Sbjct: 218 SIPLVKYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGG 277

Query: 115 HAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE-MGFN 163
            AVR++GWG  N  PYW VANSW+  WG +G   IL G NE +IE +GF 
Sbjct: 278 QAVRIVGWGKLNGTPYWKVANSWDTDWGMNGYMLILGGNNECNIEHLGFT 327



 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 59/138 (42%), Positives = 77/138 (55%), Gaps = 11/138 (7%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD+ EKWP CP++R IADQS C + WAVS A+AISDR C    G    +ISA H++
Sbjct: 90  LPESFDSAEKWPNCPTIREIADQSACRASWAVSTASAISDRYCTVGGGK-QLRISAAHLL 148

Query: 247 A-CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
           + C     GC GG+P  AW ++   G+       +  GCQPY    CEH   QG    C+
Sbjct: 149 SCCKQCGGGCKGGFPGFAWLYYVEYGI-------ASSGCQPYPFPHCEHRGAQGNKTPCS 201

Query: 305 LLGKLKTPECKQNCYNPS 322
              K  TP+C   C + S
Sbjct: 202 KY-KFDTPKCNATCTDKS 218


>gi|402594312|gb|EJW88238.1| cathepsin B5 [Wuchereria bancrofti]
          Length = 407

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 50/128 (39%), Positives = 73/128 (57%), Gaps = 2/128 (1%)

Query: 214 SCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGV 272
           SCWAV+   A+SDR+CI S G     +SA  +++C   C +GC GG P  AW++W  +G+
Sbjct: 163 SCWAVAAVEAMSDRICITSKGKKQVILSADDLLSCCKTCGFGCFGGEPMAAWKYWVLSGI 222

Query: 273 VTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLK 332
           VTG DY +  GC+PY   PCEHH               TP+C + C + +Y+  Y+ D  
Sbjct: 223 VTGSDYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCDRQC-DKNYKKPYKADKY 281

Query: 333 KGKKAHMV 340
            G++A+ V
Sbjct: 282 YGEQAYNV 289



 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 43/88 (48%), Positives = 56/88 (63%), Gaps = 3/88 (3%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I   GP+ A F VY DFL Y  G+Y+H  G   G HAV++LGWG++  + YWL ANSW
Sbjct: 298 KEIMTLGPVEASFEVYTDFLHYIGGIYKHVAGSVGGGHAVKILGWGIDQGVSYWLAANSW 357

Query: 138 NDHWGD---HGTFKILRGENEADIEMGF 162
           N  WG+    G F+ILRG +E  IE G 
Sbjct: 358 NTDWGEDVFSGYFRILRGVDECGIESGI 385


>gi|343476073|emb|CCD12715.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 65/157 (41%), Positives = 86/157 (54%), Gaps = 15/157 (9%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD+ EKWP CP++R IADQS C + WAVS A+AISDR C    G    +ISA H++
Sbjct: 90  LPESFDSAEKWPNCPTIREIADQSACRASWAVSTASAISDRYCTVGGGK-QLRISAAHLL 148

Query: 247 A-CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
           + C     GC GG+P  AWR++   G+       +   CQPY    CEHH  QG    C+
Sbjct: 149 SCCKQCGGGCKGGFPGFAWRYYVEYGI-------ASSYCQPYPFPQCEHHGAQGNKTPCS 201

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
              K  TP+C   C     + T      +GK A+M+L
Sbjct: 202 NY-KFVTPQCNTTC----TDKTIPLIKYRGKDAYMLL 233



 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 53/109 (48%), Positives = 69/109 (63%), Gaps = 4/109 (3%)

Query: 58  SIPLSHYF-KKAHMV-PRCNAM-RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGL 114
           +IPL  Y  K A+M+ P      R++Y +GP VAI  VY D   YKSGVY++  G  +G+
Sbjct: 218 TIPLIKYRGKDAYMLLPGEEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRNVDGSYMGV 277

Query: 115 HAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE-MGF 162
            AV+V+GWG  N  PYW VAN+W+  WG  G   ILRG NE +IE +GF
Sbjct: 278 TAVKVVGWGKLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGF 326


>gi|168000937|ref|XP_001753172.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695871|gb|EDQ82213.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 347

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 41/86 (47%), Positives = 60/86 (69%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
           + M ++Y +GP+   F VY DF  YK+GVY+H FG  +G HAV+++GWG  +D + YW +
Sbjct: 245 DLMAELYTNGPVEVAFEVYEDFAHYKTGVYKHLFGGFMGGHAVKLIGWGTTDDGVDYWTI 304

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
            NSWN +WG+ G F+I+RG +E  IE
Sbjct: 305 VNSWNTNWGEDGLFRIVRGNDECGIE 330



 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 47/141 (33%), Positives = 73/141 (51%), Gaps = 22/141 (15%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR++W  CP++  I  Q +CGSCWA     +++DR CI  N   +  +S   ++
Sbjct: 103 LPKEFDARKQWSHCPTIGDILGQGHCGSCWAFGAVESLTDRFCIHLNESVS--LSENDLL 160

Query: 247 ACTP-NC-WGCNGGWPQLAWRFWGHNGVVTG--GDYNSQEGCQPYTLAPCEHHVQGPLQN 302
           AC    C +GC GG+P  AW+++ H+GVVT     Y  Q+GC      P           
Sbjct: 161 ACCGFECGYGCEGGYPIRAWKYFKHSGVVTNKCDPYFDQKGCAHPGCYP----------- 209

Query: 303 CTLLGKLKTPECKQNCYNPSY 323
                  +TP+C++ C +  +
Sbjct: 210 -----TYETPKCEKQCVDDEF 225


>gi|156708116|gb|ABU93316.1| cathepsin B7 cysteine protease, partial [Monocercomonoides sp. PA]
          Length = 273

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 42/87 (48%), Positives = 56/87 (64%)

Query: 76  AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVAN 135
            M ++  +GP+ A F V+ DF  Y+SGVYQH  G S G H V ++GWG EN +PYWL+ N
Sbjct: 180 VMDEVANNGPVYACFEVFEDFYNYRSGVYQHKTGRSQGWHHVMLMGWGTENGVPYWLLQN 239

Query: 136 SWNDHWGDHGTFKILRGENEADIEMGF 162
           SW   WG+ G F+I RG N+  I+  F
Sbjct: 240 SWGSGWGEKGFFRIRRGTNDCHIDEIF 266



 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/135 (29%), Positives = 57/135 (42%), Gaps = 28/135 (20%)

Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
           P +FD R+KWP       + +Q +CGSCWA + +  +  R+ I       G +S Q +V+
Sbjct: 56  PASFDCRQKWPG--KAEPVRNQGSCGSCWAHAASETMGFRMGIRRCS--KGVMSPQDLVS 111

Query: 248 CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLG 307
           C  N  GCNGG+    W +    G+ T       E C PY                 + G
Sbjct: 112 CESNNMGCNGGYADRVWNWIQKKGITT-------EQCIPY-----------------VSG 147

Query: 308 KLKTPECKQNCYNPS 322
             + P C   C N S
Sbjct: 148 SGRVPTCPSKCKNGS 162


>gi|6562772|emb|CAB62590.1| putative cathepsin B-like protease [Pisum sativum]
          Length = 174

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 46/86 (53%), Positives = 60/86 (69%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
           N M ++Y++GP+   FSVY DF  YKSGVY+H  G ++G HAV++ GWG  ++   YWL+
Sbjct: 76  NIMEEVYKNGPVEVAFSVYEDFAHYKSGVYKHITGSALGGHAVKLNGWGTSDEGEDYWLL 135

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
           AN WN +WGD G FKI RG NE  IE
Sbjct: 136 ANQWNTNWGDDGYFKIKRGTNECGIE 161


>gi|294929081|ref|XP_002779258.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239888294|gb|EER11053.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 288

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 61/195 (31%), Positives = 92/195 (47%), Gaps = 27/195 (13%)

Query: 157 DIEMGFNNRVEANSSEDDDLETMGCQ--NAKGLPRNFDAREKWPECPS-LRHIADQSNCG 213
           D+  G  N  + +S+ DD+   +G      K LP NFDAR+K+  C   + H+ DQS C 
Sbjct: 5   DVPTGCPNGPKPSSTSDDETRLLGPTKPELKDLPSNFDARQKFASCAGVIGHVRDQSACH 64

Query: 214 SCWAVSVANAISDRLCIASNGYFTGQISAQHIVAC------TPNCWGCNGGWPQLAWRFW 267
           +CW VS    ++DR+CI S G F   +S  +  +C       P   GC GG       F 
Sbjct: 65  NCWTVSSTGMLNDRVCIKSGGTFRDILSVGYFTSCCNPANGCPKAKGCQGGNLLEGLNFL 124

Query: 268 GHNGVVTG------GDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNP 321
            ++G+VTG      G  +S +GC PY    C+H                +P C+  C N 
Sbjct: 125 KNHGIVTGDEFKPAGQLSSADGCWPYPFPKCKH------------AGYSSPACQTKCTNK 172

Query: 322 SYESTYRFDLKKGKK 336
           +Y+++ + DL + K 
Sbjct: 173 AYKTSLQQDLHRAKS 187



 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 37/90 (41%), Positives = 57/90 (63%), Gaps = 1/90 (1%)

Query: 65  FKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV 124
           F +   +P+ N  ++I+ +GP++ + S+Y D   YK+GVY H  G   G+H ++++GWGV
Sbjct: 188 FGRLPAIPQ-NIKQEIFTNGPVIGMLSIYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGV 246

Query: 125 ENDIPYWLVANSWNDHWGDHGTFKILRGEN 154
           E+   YWL  NSWN+ WGDHG  K+  G  
Sbjct: 247 ESGQDYWLAVNSWNEEWGDHGMIKLAVGRT 276


>gi|10803443|emb|CAC13134.1| putative cathepsin B.8 [Ostertagia ostertagi]
          Length = 197

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 51/128 (39%), Positives = 73/128 (57%), Gaps = 1/128 (0%)

Query: 214 SCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGV 272
           SCWA     AISDR+CIAS G     +SA  +++C  +C +GCNGG P  AW+FW   G+
Sbjct: 1   SCWAFGAVEAISDRICIASKGKTQVTLSAADLLSCCRSCGFGCNGGDPLSAWKFWVKEGI 60

Query: 273 VTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLK 332
           VTG ++++  GC+PY    CEHH      +        TP+C+++C     E TY+ D  
Sbjct: 61  VTGSNHSTNAGCKPYPFPACEHHSNKTHYDPCKHDLFPTPKCEKSCQATFGERTYKEDKY 120

Query: 333 KGKKAHMV 340
            G+ A+ V
Sbjct: 121 FGRSAYGV 128



 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 25/54 (46%), Positives = 36/54 (66%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYW 131
           ++I  +GP+   F VY DFL Y  G+Y H  G   G HAV+++GWG++N +PYW
Sbjct: 137 KEIITYGPVEVAFEVYEDFLNYAGGIYVHQGGALGGGHAVKMIGWGIDNGVPYW 190


>gi|204022075|dbj|BAG71135.1| cathepsin B-S2 [Tuberaphis taiwana]
          Length = 334

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 53/133 (39%), Positives = 76/133 (57%), Gaps = 3/133 (2%)

Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV- 246
           P+ FD+R  W  C  + HI DQ NCGSCW+ S   A +DRLC+++ G F   +S + +  
Sbjct: 86  PQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEELAF 145

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
            C     GC GG+P  AW+++   GV TGGDY ++EGC PY + PC ++ QG    C   
Sbjct: 146 CCKDCGKGCGGGYPIKAWKYFRTQGVTTGGDYGTKEGCMPYKVPPC-YNKQGK-NTCGGQ 203

Query: 307 GKLKTPECKQNCY 319
              +  +C + CY
Sbjct: 204 PMERNHQCPKTCY 216



 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 45/122 (36%), Positives = 70/122 (57%), Gaps = 2/122 (1%)

Query: 51  KRLYLPTSIPLSHYFKKAHMVPRCNAMRQ-IYEHGPLVAIFSVYADFLQYKSGVYQHNF- 108
           K  Y  T++   +  K  +++     + Q +  +GP+ A F VY DF  YKSG+Y+    
Sbjct: 213 KTCYGKTTVQNRYKTKSEYVMNSIKTIEQDLKTYGPVEASFDVYDDFSVYKSGIYRKTPK 272

Query: 109 GDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEA 168
               G H+++++GWG +N  PYWL  NSW+  WG+HGTFKI++G NE  IE      + +
Sbjct: 273 AKYQGGHSIKIIGWGQQNGTPYWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGIPS 332

Query: 169 NS 170
           +S
Sbjct: 333 SS 334


>gi|294931810|ref|XP_002780018.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239889821|gb|EER11813.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 131

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 43/78 (55%), Positives = 54/78 (69%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           N  ++I  +GP  A FS Y DF  YKSGVY+H  G  +G H+V ++GWG E  + YWLV 
Sbjct: 30  NIKKEIMTNGPTSASFSTYEDFSSYKSGVYKHTSGGYLGDHSVEIIGWGTEKGVDYWLVM 89

Query: 135 NSWNDHWGDHGTFKILRG 152
           NSWN+ WGDHGTFKI +G
Sbjct: 90  NSWNEGWGDHGTFKIAQG 107


>gi|168020784|ref|XP_001762922.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685734|gb|EDQ72127.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 345

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 41/84 (48%), Positives = 61/84 (72%), Gaps = 1/84 (1%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVAN 135
           M +++ +GP+   F V+ DF  YK+GVY+H +G  IG HAV+++GWG  +D + YW + N
Sbjct: 245 MAELFTNGPIEVAFDVFEDFAHYKTGVYKHLYGGYIGGHAVKLVGWGTTDDGVDYWSMVN 304

Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
           SWN +WG+ GTF+ILRG++E  IE
Sbjct: 305 SWNTNWGEDGTFRILRGKDECGIE 328



 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 41/102 (40%), Positives = 57/102 (55%), Gaps = 6/102 (5%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDAR+ W  C ++  I DQ +CGSCWA     +++DR CI  N   +  +S   ++
Sbjct: 101 LPTEFDARKHWSHCSTIGDILDQGHCGSCWAFGAVESLTDRFCIHLNESVS--LSENDLL 158

Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTG--GDYNSQEGC 284
           AC    C  GC GG+P  AW+++   GVVT     Y  Q+GC
Sbjct: 159 ACCGFECGDGCEGGYPIRAWQYFKRTGVVTSKCDPYFDQKGC 200


>gi|326430261|gb|EGD75831.1| hypothetical protein PTSG_07950 [Salpingoeca sp. ATCC 50818]
          Length = 381

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 45/84 (53%), Positives = 58/84 (69%), Gaps = 2/84 (2%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQ--HNFGDSIGLHAVRVLGWGVENDIPYWLVAN 135
           R I +HGP++A + V+ DF +Y SGVY    +  DSIG HAV ++GWGVE++ PYWLV N
Sbjct: 252 RDIMQHGPVLASYEVFEDFGEYDSGVYTCPDDGSDSIGWHAVIIVGWGVEDNTPYWLVQN 311

Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
           SW   +G  G FKI RG NE +IE
Sbjct: 312 SWGTGFGIDGYFKIARGTNECNIE 335



 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 38/140 (27%), Positives = 64/140 (45%), Gaps = 28/140 (20%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTG-QISAQHI 245
           +P ++++ E + +C     I  Q +CGSCWA +    ++ R+CI S     G +++ Q +
Sbjct: 94  IPDSYNSHEAYSKCKP--DILQQGSCGSCWAFATTGVLAQRMCIKSEQIGQGYELAPQAL 151

Query: 246 VACT----------------PNCW---GCNGGWPQLAWRFWGHNGVV--TGGDYNSQEGC 284
           V+CT                  C+   GC+GG+P  A+RF    G+       Y S++G 
Sbjct: 152 VSCTDQICYTKAGDRCSSPSSTCYCSLGCDGGYPDGAFRFMQDEGITPELCVKYVSKDGT 211

Query: 285 QPYTLAPCEHHVQGPLQNCT 304
            P   +     VQ  +  CT
Sbjct: 212 DPLECS----DVQTMVSECT 227


>gi|356505709|ref|XP_003521632.1| PREDICTED: cathepsin B-like [Glycine max]
          Length = 357

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 50/101 (49%), Positives = 65/101 (64%), Gaps = 5/101 (4%)

Query: 63  HYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRV 119
           HY   A+ V   P  + M ++Y++GP+   F+VY DF  YKSGVY+H  G  +G HAV++
Sbjct: 230 HYSVSAYRVNSDPH-DIMAEVYKNGPVEVAFTVYEDFAYYKSGVYKHITGYELGGHAVKL 288

Query: 120 LGWGVEND-IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +GWG  +D   YWL+AN WN  WGD G FKI RG NE  IE
Sbjct: 289 IGWGTTDDGEDYWLLANQWNREWGDDGYFKIRRGTNECGIE 329



 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 50/135 (37%), Positives = 69/135 (51%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+NFDAR  W +C ++  I DQ +CGSCWA     ++SDR CI  +   +  +S   ++
Sbjct: 101 LPKNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDVNIS--LSVNDLL 158

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GC+GG+P  AWR+  H+GVVT       E C PY     C H    P    
Sbjct: 159 ACCGFLCGSGCDGGYPLYAWRYLAHHGVVT-------EECDPYFDQIGCSHPGCEP---- 207

Query: 304 TLLGKLKTPECKQNC 318
                 +TP+C + C
Sbjct: 208 ----AYRTPKCVKKC 218


>gi|204022073|dbj|BAG71134.1| cathepsin B-S1 [Tuberaphis taiwana]
          Length = 334

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 53/133 (39%), Positives = 76/133 (57%), Gaps = 3/133 (2%)

Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV- 246
           P+ FD+R  W  C  + HI DQ NCGSCW+ S   A +DRLC+++ G F   +S + +  
Sbjct: 86  PQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEELAF 145

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
            C     GC GG+P  AW+++   GV TGGDY ++EGC PY + PC ++ QG    C   
Sbjct: 146 CCKDCGKGCGGGYPIKAWKYFRTQGVTTGGDYGTKEGCMPYKVPPC-YNKQGK-NTCGGQ 203

Query: 307 GKLKTPECKQNCY 319
              +  +C + CY
Sbjct: 204 PMERNHQCPKTCY 216



 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 41/94 (43%), Positives = 58/94 (61%), Gaps = 1/94 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNF-GDSIGLHAVRVLGWGVENDIPYWLVANS 136
           R I  +GP+ A F VY D   YKSG+Y+        G H+++++GWG +N  PYWL  NS
Sbjct: 241 RDIMTYGPVEASFDVYDDLSAYKSGIYRKTPKAKYQGGHSIKIIGWGQQNGTPYWLAVNS 300

Query: 137 WNDHWGDHGTFKILRGENEADIEMGFNNRVEANS 170
           W+  WG+HGTFKI++G NE  IE      + ++S
Sbjct: 301 WSKFWGEHGTFKIIKGRNECGIERAVTAGIPSSS 334


>gi|356572872|ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max]
          Length = 356

 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 49/100 (49%), Positives = 64/100 (64%), Gaps = 3/100 (3%)

Query: 63  HYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   A+ V     + M ++Y++GP+   F+VY DF  YKSGVY+H  G  +G HAV+++
Sbjct: 229 HYSVNAYRVSSDPHDIMTEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGYELGGHAVKLI 288

Query: 121 GWG-VENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG  E+   YWL+AN WN  WGD G FKI RG NE  IE
Sbjct: 289 GWGTTEDGEDYWLLANQWNREWGDDGYFKIRRGTNECGIE 328



 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 49/135 (36%), Positives = 69/135 (51%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+NFDAR  W +C ++  I DQ +CGSCWA     ++SDR CI  +   +  +S   ++
Sbjct: 100 LPKNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDVNIS--LSVNDLL 157

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GC+GG+P  AW++  H+GVVT       E C PY     C H    P    
Sbjct: 158 ACCGFLCGSGCDGGYPLYAWQYLAHHGVVT-------EECDPYFDQIGCSHPGCEP---- 206

Query: 304 TLLGKLKTPECKQNC 318
                 +TP+C + C
Sbjct: 207 ----AYRTPKCVKKC 217


>gi|38048307|gb|AAR10056.1| similar to Drosophila melanogaster CG10992, partial [Drosophila
           yakuba]
          Length = 174

 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 41/88 (46%), Positives = 56/88 (63%), Gaps = 1/88 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FD+R++WP CP++  I DQ +CGSCWA     A+SDR+CI S G      SA  +V
Sbjct: 87  IPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLV 146

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVV 273
           +C   C +GCNGG+P  AW +W   G+V
Sbjct: 147 SCCHTCGFGCNGGFPGAAWSYWTRKGIV 174


>gi|297814171|ref|XP_002874969.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297320806|gb|EFH51228.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 359

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 45/86 (52%), Positives = 59/86 (68%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
           + M ++Y++GP+   F+VY DF  YKSGVY+H  G +IG HAV+++GWG  N+   YWL+
Sbjct: 246 DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSNEGEDYWLM 305

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
           AN WN  WGD G F I RG NE  IE
Sbjct: 306 ANQWNRGWGDDGYFMIRRGTNECGIE 331



 Score = 84.0 bits (206), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 43/103 (41%), Positives = 59/103 (57%), Gaps = 11/103 (10%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR  WP+C S+  I DQ +CGSCWA     ++SDR CI         +S   ++
Sbjct: 103 LPKAFDARTAWPQCTSIGKILDQGHCGSCWAFGAVESLSDRFCIQFG--MNISLSVNDLL 160

Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           AC    C  GC+GG+P  AW+++ ++GVVT       E C PY
Sbjct: 161 ACCGFRCGDGCDGGYPIAAWQYFSYSGVVT-------EECDPY 196


>gi|294899385|ref|XP_002776615.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239883670|gb|EER08431.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 233

 Score = 97.4 bits (241), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 62/101 (61%), Gaps = 8/101 (7%)

Query: 187 LPRNFDAREKWPECP-SLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP +FDAR  +P C   + HI DQS CGSCWA  V  A +DRLCI SNG FT  +SA  +
Sbjct: 117 LPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSNGTFTELLSAGEM 176

Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQP 286
            AC P+ +GC+GG+P  AW +    G+ TG      EG +P
Sbjct: 177 NACAPS-YGCDGGYPDSAWSWVHDEGIATG------EGSRP 210


>gi|10803452|emb|CAB97365.2| putative cathepsin B.2 [Ostertagia ostertagi]
          Length = 194

 Score = 97.4 bits (241), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 55/128 (42%), Positives = 73/128 (57%), Gaps = 3/128 (2%)

Query: 214 SCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGV 272
           SCWAVS A A+SDR+CIAS G     +S Q ++AC   C +GC GGWP  AW+++   GV
Sbjct: 1   SCWAVSSAAAMSDRVCIASXGAKQVLLSDQDMLACCSWCGYGCEGGWPMKAWQYFXLEGV 60

Query: 273 VTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLK 332
           VTGG+Y  Q  C+PY   PC  H + P          KTP+C++ C    Y   Y+ D  
Sbjct: 61  VTGGNYRKQGCCRPYEFPPCGRHGKEPYYG-ECYDSAKTPKCQKTC-QRGYLKPYKEDKH 118

Query: 333 KGKKAHMV 340
            GK A+ +
Sbjct: 119 FGKSAYRL 126



 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 44/98 (44%), Positives = 58/98 (59%), Gaps = 2/98 (2%)

Query: 42  KKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRC--NAMRQIYEHGPLVAIFSVYADFLQY 99
           K  K +K  +R YL       H+ K A+ +P       R I ++GP+VA F VY DF  Y
Sbjct: 97  KTPKCQKTCQRGYLKPYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHY 156

Query: 100 KSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           KSG+Y+H  G   G HAV+++GWG E   PYWL+ANSW
Sbjct: 157 KSGIYKHTAGRMTGGHAVKIIGWGKEXGTPYWLIANSW 194


>gi|328871084|gb|EGG19455.1| peptidase C1A family protein [Dictyostelium fasciculatum]
          Length = 352

 Score = 97.4 bits (241), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 47/99 (47%), Positives = 62/99 (62%), Gaps = 2/99 (2%)

Query: 63  HYFKKAH-MVPRCNAMRQ-IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           H+    + M P  NA++Q I  +GP+ A F VY DFL YKSGVYQH  G  +G H V+++
Sbjct: 198 HFIDGVYSMNPTVNAIQQEIMTNGPVEACFEVYEDFLGYKSGVYQHTTGKDLGGHCVKMI 257

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG +N+  YW+  NSW  +WG+ G F I  G NE  IE
Sbjct: 258 GWGTQNNELYWICNNSWTTYWGNQGVFWIKAGVNECGIE 296



 Score = 77.8 bits (190), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 49/147 (33%), Positives = 70/147 (47%), Gaps = 17/147 (11%)

Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
           + +P NF++ ++W  C  +  I +Q+ CGSCWA     ++SDR CI         +S Q 
Sbjct: 68  QAVPANFNSAQQWSNCSYISAIQNQARCGSCWAFGAVESVSDRFCIHKGEDVL--LSFQD 125

Query: 245 IVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCT 304
           +V C  +  GC GG    A +F    G+V+         C PYT+  C      P Q   
Sbjct: 126 LVTCDQSDNGCQGGDAYTAMKFIQKKGIVS-------NDCLPYTIPTC-----APAQQ-P 172

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDL 331
            L  + TP+C + C N SY  TY  DL
Sbjct: 173 CLNFVDTPQCVEKCSNASY--TYAQDL 197


>gi|255548165|ref|XP_002515139.1| cathepsin B, putative [Ricinus communis]
 gi|223545619|gb|EEF47123.1| cathepsin B, putative [Ricinus communis]
          Length = 376

 Score = 97.4 bits (241), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 45/86 (52%), Positives = 60/86 (69%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLV 133
           + M ++Y++GP+   F+VY DF  YKSGVY+H  G+ +G HAV+++GWG  +N   YWL+
Sbjct: 261 DVMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGEVMGGHAVKLIGWGTSDNGEDYWLL 320

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
           AN WN  WGD G FKI RG NE  IE
Sbjct: 321 ANQWNRGWGDDGYFKIRRGTNECGIE 346



 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 49/154 (31%), Positives = 66/154 (42%), Gaps = 37/154 (24%)

Query: 187 LPRNFDAREKWPECPSLRHIADQ-----------------SNCGSCWAVSVANAISDRLC 229
           LP+ FDAR  WP C ++  I  Q                  +CGSCWA     ++SDR C
Sbjct: 101 LPKEFDARTAWPHCSTIGKILGQLLSFYNIFSIFFFLFLEGHCGSCWAFGAVESLSDRFC 160

Query: 230 IASNGYFTGQISAQHIVACTPNCWG--CNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           I         +S   ++AC     G  C+GG+P  AWR++ H+GVVT       E C PY
Sbjct: 161 IHFG--MNISLSVNDLLACCGFLCGDGCDGGYPMYAWRYFVHHGVVT-------EECDPY 211

Query: 288 -TLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYN 320
                C H    P           TP+C + C +
Sbjct: 212 FDNIGCSHPGCEP--------GFPTPKCVRKCID 237


>gi|224064400|ref|XP_002301457.1| predicted protein [Populus trichocarpa]
 gi|222843183|gb|EEE80730.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score = 97.4 bits (241), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 46/86 (53%), Positives = 58/86 (67%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
           + M +IY++GP+   F+VY DF  YKSGVY+H  G  +G HAV+++GWG   D   YWL+
Sbjct: 244 SIMAEIYKNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTSEDGEAYWLL 303

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
           AN WN  WGD G FKI RG NE  IE
Sbjct: 304 ANQWNRGWGDDGYFKIRRGTNECGIE 329



 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 52/137 (37%), Positives = 67/137 (48%), Gaps = 20/137 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDAR  WP+C ++  I DQ +CGSCWA     ++SDR CI         +S   ++
Sbjct: 101 LPEEFDARTAWPQCSTIGKILDQGHCGSCWAFGAVESLSDRFCIHYG--MNISLSVNDLL 158

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GCNGG+P  AWR++ H+GVVT       E C PY     C H    P    
Sbjct: 159 ACCGFLCGSGCNGGYPISAWRYFVHHGVVT-------EECDPYFDDIGCSHPGCEP---- 207

Query: 304 TLLGKLKTPECKQNCYN 320
                  TP+C + C N
Sbjct: 208 ----GYPTPKCARKCVN 220


>gi|302823081|ref|XP_002993195.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
 gi|300138965|gb|EFJ05715.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
          Length = 342

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 54/136 (39%), Positives = 77/136 (56%), Gaps = 19/136 (13%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP++FDARE WP+C S+++I DQ +CGSCWA     A++DR CI +N   +  +S   +V
Sbjct: 99  LPKHFDAREAWPQCSSIKNILDQGHCGSCWAFGAVEALTDRFCILNNENVS--LSENDLV 156

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP-CEHHVQGPLQNCT 304
           AC  +C +GC+GG+P  AW ++   GVVT     SQ  C PY     C+H    P     
Sbjct: 157 ACCSSCGFGCDGGYPYAAWEYFAQTGVVT-----SQ--CDPYFDGKGCKHPGCEP----- 204

Query: 305 LLGKLKTPECKQNCYN 320
              +  TP C + C +
Sbjct: 205 ---EYDTPVCVKQCVD 217



 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 44/82 (53%), Positives = 59/82 (71%), Gaps = 1/82 (1%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVANSW 137
           +IY++GP+   ++VY DF  YKSGVY+H FG+ +G HAV+ +GWG  +D   YW+VANSW
Sbjct: 244 EIYKNGPVEVSYTVYEDFAHYKSGVYKHVFGEVLGGHAVKFIGWGTTDDGKDYWIVANSW 303

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N  WG+ G F+I RG NE  IE
Sbjct: 304 NRSWGEDGFFQISRGSNECGIE 325


>gi|294955270|ref|XP_002788457.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
 gi|239903926|gb|EER20253.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
          Length = 392

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 45/102 (44%), Positives = 63/102 (61%), Gaps = 3/102 (2%)

Query: 61  LSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
            + +F    +    N  ++I  +GP  A FS+Y DFL Y+SGVY+H  G  +G H V ++
Sbjct: 237 FTAHFSPYQLKGTDNIKKEIMTNGPTSAAFSMYDDFLSYESGVYKHTSGTLMGEHGVEII 296

Query: 121 GWGVENDIPYWLVANSWNDHWGDHGTFKILRGE---NEADIE 159
           GWG +  + YWLV NSWN+ WG HGTFKI +G+   N+  IE
Sbjct: 297 GWGTKQGVDYWLVMNSWNEGWGVHGTFKIAQGDCGINDMAIE 338



 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 30/72 (41%)

Query: 252 CWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKT 311
           C GC  G P  AW F    G+ T G  ++ +GC PY    C HH Q             T
Sbjct: 156 CDGCTKGRPDAAWSFLNVYGIATEGSMSAADGCWPYNFPKCGHHQQDSKYQPCPEKNYDT 215

Query: 312 PECKQNCYNPSY 323
           P C   C N +Y
Sbjct: 216 PPCLDRCPNKNY 227


>gi|166030322|gb|ABY78828.1| cathepsin B-like protease [Trypanosoma congolense]
 gi|343471419|emb|CCD16168.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 53/109 (48%), Positives = 69/109 (63%), Gaps = 4/109 (3%)

Query: 58  SIPLSHYF-KKAHMV-PRCNAM-RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGL 114
           +IPL  Y  K A+M+ P      R++Y +GP VAI  VY D   YKSGVY++  G  +G+
Sbjct: 218 TIPLIKYRGKDAYMLLPGEEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRNVDGSYMGV 277

Query: 115 HAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE-MGF 162
            AV+V+GWG  N  PYW VAN+W+  WG  G   ILRG NE +IE +GF
Sbjct: 278 TAVKVVGWGKLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGF 326



 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 64/157 (40%), Positives = 85/157 (54%), Gaps = 15/157 (9%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD+ EKWP CP++R IADQS C + WAVS A+AISDR C    G    +ISA H++
Sbjct: 90  LPESFDSAEKWPNCPTIREIADQSACRASWAVSTASAISDRYCTVGGGK-QLRISAAHLL 148

Query: 247 A-CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHH-VQGPLQNCT 304
           + C     GC GG+P  AWR++   G+       +   CQPY    CEH   QG    C+
Sbjct: 149 SCCKQCGGGCKGGFPGFAWRYYVEYGI-------ASSYCQPYPFPQCEHQGAQGNKTPCS 201

Query: 305 LLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
              K  TP+C   C     + T      +GK A+M+L
Sbjct: 202 NY-KFVTPQCNTTC----TDKTIPLIKYRGKDAYMLL 233


>gi|2317912|gb|AAC24376.1| cathepsin B-like cysteine proteinase [Arabidopsis thaliana]
          Length = 357

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 45/86 (52%), Positives = 59/86 (68%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
           + M ++Y++GP+   F+VY DF  YKSGVY++  G  IG HAV+++GWG  +D   YWL+
Sbjct: 244 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLL 303

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
           AN WN  WGD G FKI RG NE  IE
Sbjct: 304 ANQWNRSWGDDGYFKIRRGTNECGIE 329



 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 51/135 (37%), Positives = 68/135 (50%), Gaps = 22/135 (16%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR  W  C S+R I    +CGSCWA     ++SDR CI  N      +SA  ++
Sbjct: 103 LPKEFDARTAWSHCTSIRRIL--GHCGSCWAFGAVESLSDRFCIKYN--LNVSLSANDVI 158

Query: 247 ACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC      +GCNGG+P  AW ++ ++GVVT      QE C PY     C H    P    
Sbjct: 159 ACCGLLCGFGCNGGFPMGAWLYFKYHGVVT------QE-CDPYFDNTGCSHPGCEP---- 207

Query: 304 TLLGKLKTPECKQNC 318
                  TP+C++ C
Sbjct: 208 ----TYPTPKCERKC 218


>gi|414886870|tpg|DAA62884.1| TPA: cathepsin B-like cysteine proteinase 3 [Zea mays]
          Length = 347

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 45/88 (51%), Positives = 61/88 (69%), Gaps = 1/88 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLV 133
           + M ++Y++GP+   F+VY DF  YKSGVY+H  G  +G HAV+++GWG  +    YWL+
Sbjct: 236 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWLL 295

Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMG 161
           AN WN  WGD G FKI+RG+NE  IE G
Sbjct: 296 ANQWNRGWGDDGYFKIIRGKNECGIEEG 323



 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 67/135 (49%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR  W  C ++ +I +Q +CGSCWA      + DR CI  N   +  +S   ++
Sbjct: 93  LPKEFDARSAWSRCSTIGNILEQGHCGSCWAFGAVECLQDRFCIHLN--MSILLSVNDLL 150

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GC+GG+P  AWR++  NGVVT       + C PY     C+H    P    
Sbjct: 151 ACCGFMCGDGCDGGYPIEAWRYFVQNGVVT-------DECDPYFDPVGCKHPGCEP---- 199

Query: 304 TLLGKLKTPECKQNC 318
                  TP+C++ C
Sbjct: 200 ----AYPTPKCEKKC 210


>gi|226497010|ref|NP_001150152.1| LOC100283781 precursor [Zea mays]
 gi|195637168|gb|ACG38052.1| cathepsin B-like cysteine proteinase 3 precursor [Zea mays]
          Length = 347

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 45/88 (51%), Positives = 61/88 (69%), Gaps = 1/88 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLV 133
           + M ++Y++GP+   F+VY DF  YKSGVY+H  G  +G HAV+++GWG  +    YWL+
Sbjct: 236 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWLL 295

Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMG 161
           AN WN  WGD G FKI+RG+NE  IE G
Sbjct: 296 ANQWNRGWGDDGYFKIIRGKNECGIEEG 323



 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 67/135 (49%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR  W  C ++ +I DQ +CGSCWA      + DR CI  N   +  +S   ++
Sbjct: 93  LPKEFDARSAWSRCSTIGNILDQGHCGSCWAFGAVECLQDRFCIHLN--MSILLSVNDLL 150

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GC+GG+P  AWR++  NGVVT       + C PY     C+H    P    
Sbjct: 151 ACCGFMCGDGCDGGYPIEAWRYFVQNGVVT-------DECDPYFDPVGCKHPGCEP---- 199

Query: 304 TLLGKLKTPECKQNC 318
                  TP+C++ C
Sbjct: 200 ----AYPTPKCEKKC 210


>gi|224128101|ref|XP_002320244.1| predicted protein [Populus trichocarpa]
 gi|222861017|gb|EEE98559.1| predicted protein [Populus trichocarpa]
          Length = 339

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 45/84 (53%), Positives = 57/84 (67%), Gaps = 1/84 (1%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVAN 135
           M ++  +GP+   F+VY DF  YKSGVY+H  GD++G HAV+++GWG   D   YWL+AN
Sbjct: 228 MAEVSSNGPVEVAFTVYEDFAHYKSGVYKHITGDAMGGHAVKLIGWGTSEDGEDYWLLAN 287

Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
            WN  WGD G FKI RG NE  IE
Sbjct: 288 QWNRGWGDDGYFKIKRGTNECGIE 311



 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 50/139 (35%), Positives = 67/139 (48%), Gaps = 24/139 (17%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDAR  WP C ++  I DQ +CGSCWA     ++SDR CI      +  +S   ++
Sbjct: 83  LPIEFDARTAWPHCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHYGMNLS--LSVNDLL 140

Query: 247 ACTPNCW----GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQ 301
           AC    W    GC+GG P  AWR++  +GVVT       E C PY     C H    P  
Sbjct: 141 ACCG--WMCGAGCDGGSPIDAWRYFVQSGVVT-------EECDPYFDDIGCSHPGCEP-- 189

Query: 302 NCTLLGKLKTPECKQNCYN 320
                    TP+C++ C +
Sbjct: 190 ------GFPTPKCERKCAD 202


>gi|262217337|gb|ACY38050.1| cathepsin B [Dactylis glomerata]
          Length = 348

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 44/86 (51%), Positives = 60/86 (69%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLV 133
           + M ++Y++GP+   F+VY DF  YKSGVY+H  G  +G HAV+++GWG  +    YWL+
Sbjct: 237 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGGVMGGHAVKLIGWGTSDAGEDYWLL 296

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
           AN WN  WGD G FKI+RG+NE  IE
Sbjct: 297 ANQWNRGWGDDGYFKIIRGKNECGIE 322



 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 50/135 (37%), Positives = 68/135 (50%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR KW  C ++  I DQ +CGSCWA      + DR CI  N      +SA  +V
Sbjct: 94  LPKQFDARSKWSGCSTIGTILDQGHCGSCWAFGAVECLQDRFCIHQN--INISLSANDLV 151

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GC+GG+P  AW+++  +GVVT       E C PY     C+H    P  + 
Sbjct: 152 ACCGFMCGDGCDGGYPIKAWQYFVQSGVVT-------EECDPYFDQVGCKHPGCEPAYD- 203

Query: 304 TLLGKLKTPECKQNC 318
                  TP+C++ C
Sbjct: 204 -------TPKCEKKC 211


>gi|414886872|tpg|DAA62886.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
 gi|414886873|tpg|DAA62887.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
          Length = 208

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 45/88 (51%), Positives = 61/88 (69%), Gaps = 1/88 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLV 133
           + M ++Y++GP+   F+VY DF  YKSGVY+H  G  +G HAV+++GWG  +    YWL+
Sbjct: 97  DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWLL 156

Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMG 161
           AN WN  WGD G FKI+RG+NE  IE G
Sbjct: 157 ANQWNRGWGDDGYFKIIRGKNECGIEEG 184


>gi|357116879|ref|XP_003560204.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
          Length = 351

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 53/127 (41%), Positives = 72/127 (56%), Gaps = 17/127 (13%)

Query: 67  KAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN 126
           + H  P  + M ++Y +GP+   F+VY DF  YKSGVY+H  G  +G HAV+++GWG  +
Sbjct: 233 RVHSNPH-DIMAEVYTNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSD 291

Query: 127 -DIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAK 185
               YWL+AN WN  WGD G FKI+RG+NE  IE            ED      G  + K
Sbjct: 292 AGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIE------------ED---VVAGMPSTK 336

Query: 186 GLPRNFD 192
            + RN+D
Sbjct: 337 NMARNYD 343



 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 65/135 (48%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDAR +W  C ++  I DQ +CGSCWA      + DR CI  N      +S   ++
Sbjct: 97  LPTEFDARSQWSGCSTIGTILDQGHCGSCWAFGAVECLQDRFCIHLN--MNISLSVNDLL 154

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GCNGG+P  AWR++   GVVT       + C PY     C+H    P    
Sbjct: 155 ACCGFLCGSGCNGGYPISAWRYFRRKGVVT-------DECDPYFDQVGCKHPGCEP---- 203

Query: 304 TLLGKLKTPECKQNC 318
                 +TP+C++ C
Sbjct: 204 ----AYRTPKCEKKC 214


>gi|255076333|ref|XP_002501841.1| cysteine endopeptidase [Micromonas sp. RCC299]
 gi|226517105|gb|ACO63099.1| cysteine endopeptidase [Micromonas sp. RCC299]
          Length = 359

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 51/113 (45%), Positives = 67/113 (59%), Gaps = 2/113 (1%)

Query: 187 LPRNFDAREKWPECPSL-RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP NFDAR+KWP+C ++   + DQ  CGSCWAV+ A  ++DRLCIAS G    ++S Q+ 
Sbjct: 105 LPLNFDARQKWPQCRAIIGTVRDQGKCGSCWAVATAEVMNDRLCIASGGAEQRELSPQYP 164

Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYN-SQEGCQPYTLAPCEHHVQ 297
           ++C     GC GG   +A       G+V GG  N S+  C PY   PCEH  Q
Sbjct: 165 LSCYDGGSGCQGGDVAVAMHEATTKGMVFGGMLNRSKTACLPYEFEPCEHPCQ 217



 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 34/92 (36%), Positives = 54/92 (58%), Gaps = 10/92 (10%)

Query: 78  RQIYEHGPLVAIF-SVYADFLQYKSGVYQHNFGD----SIGLHAVRVLGWGVENDI--PY 130
           ++I  +GP+   F +V++DF  Y +GVY     D     +G+HA +++GWG +     PY
Sbjct: 266 QEIMTYGPVAVTFGTVHSDFYGYHAGVYTVREEDKNEEGLGMHATKLIGWGFDEATGHPY 325

Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIEMGF 162
           WL+ NSW D+WG HG  ++  G  E ++E G 
Sbjct: 326 WLMMNSW-DNWGIHGLGRV--GVGEMNMEQGI 354


>gi|215687149|dbj|BAG90919.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 403

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 51/120 (42%), Positives = 70/120 (58%), Gaps = 16/120 (13%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLV 133
           + M ++Y++GP+   F+VY DF  YKSGVY+H  G  +G HAV+++GWG  +    YWL+
Sbjct: 290 DIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLL 349

Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDA 193
           AN WN  WGD G FKI+RG NE  IE            ED      G  + K + RN+D+
Sbjct: 350 ANQWNRGWGDDGYFKIIRGTNECGIE------------ED---VVAGMPSTKNMVRNYDS 394



 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 43/103 (41%), Positives = 57/103 (55%), Gaps = 6/103 (5%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR  W +C ++  I DQ +CGSCWA      + DR CI  N      +S   +V
Sbjct: 147 LPKEFDARSAWSQCNTIGTILDQGHCGSCWAFGAVECLQDRFCIHFN--MNISLSVNDLV 204

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTG--GDYNSQEGCQ 285
           AC       GC+GG+P +AWR++  NGVVT     Y  Q GC+
Sbjct: 205 ACCGFMCGDGCDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCK 247


>gi|59895951|gb|AAX11351.1| cathepsin B-like cysteine protease [Oryza sativa Japonica Group]
 gi|125551767|gb|EAY97476.1| hypothetical protein OsI_19406 [Oryza sativa Indica Group]
 gi|215694023|dbj|BAG89222.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215712372|dbj|BAG94499.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765382|dbj|BAG87079.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222631058|gb|EEE63190.1| hypothetical protein OsJ_17999 [Oryza sativa Japonica Group]
          Length = 358

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 51/120 (42%), Positives = 70/120 (58%), Gaps = 16/120 (13%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLV 133
           + M ++Y++GP+   F+VY DF  YKSGVY+H  G  +G HAV+++GWG  +    YWL+
Sbjct: 245 DIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLL 304

Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDA 193
           AN WN  WGD G FKI+RG NE  IE            ED      G  + K + RN+D+
Sbjct: 305 ANQWNRGWGDDGYFKIIRGTNECGIE------------ED---VVAGMPSTKNMVRNYDS 349



 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 43/103 (41%), Positives = 57/103 (55%), Gaps = 6/103 (5%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR  W +C ++  I DQ +CGSCWA      + DR CI  N      +S   +V
Sbjct: 102 LPKEFDARSAWSQCNTIGTILDQGHCGSCWAFGAVECLQDRFCIHFN--MNISLSVNDLV 159

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTG--GDYNSQEGCQ 285
           AC       GC+GG+P +AWR++  NGVVT     Y  Q GC+
Sbjct: 160 ACCGFMCGDGCDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCK 202


>gi|62320420|dbj|BAD94873.1| cathepsin B-like cysteine proteinase like protein [Arabidopsis
           thaliana]
          Length = 183

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 45/86 (52%), Positives = 59/86 (68%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
           + M ++Y++GP+   F+VY DF  YKSGVY++  G  IG HAV+++GWG  +D   YWL+
Sbjct: 70  DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLL 129

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
           AN WN  WGD G FKI RG NE  IE
Sbjct: 130 ANQWNRSWGDDGYFKIRRGTNECGIE 155


>gi|312283137|dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila]
          Length = 362

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 44/86 (51%), Positives = 59/86 (68%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
           + M ++Y++GP+   F+VY DF  YKSGVY+H  G +IG HAV+++GWG  ++   YWL+
Sbjct: 249 DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTTDEGEDYWLL 308

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
           AN WN  WGD G F I RG NE  IE
Sbjct: 309 ANQWNRSWGDDGYFMIRRGTNECGIE 334



 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 50/137 (36%), Positives = 71/137 (51%), Gaps = 20/137 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR  WP+C S+ +I DQ +CGSCWA     ++SDR CI      +  +S   ++
Sbjct: 106 LPKEFDARTAWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIEFGMNIS--LSVNDLL 163

Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC    C  GC+GG+P  AW+++ ++GVVT       E C PY     C H    P    
Sbjct: 164 ACCGFRCGDGCDGGYPIAAWQYFSYSGVVT-------EECDPYFDDTGCSHPGCEP---- 212

Query: 304 TLLGKLKTPECKQNCYN 320
                  TP+C + C +
Sbjct: 213 ----AYPTPKCMRKCVS 225


>gi|40643250|emb|CAC83720.1| cathepsin B [Hordeum vulgare subsp. vulgare]
 gi|326494236|dbj|BAJ90387.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326499864|dbj|BAJ90767.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 344

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 44/86 (51%), Positives = 60/86 (69%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLV 133
           + M ++Y++GP+   F+VY DF  YKSGVY+H  G  +G HAV+++GWG  +    YWL+
Sbjct: 239 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLL 298

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
           AN WN  WGD G FKI+RG+NE  IE
Sbjct: 299 ANQWNRGWGDDGYFKIIRGKNECGIE 324



 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 51/135 (37%), Positives = 67/135 (49%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR KW  C ++  I DQ +CGSCWA      + DR CI  N   +  +SA  +V
Sbjct: 96  LPKEFDARSKWSGCSTIGKILDQGHCGSCWAFGAVECLQDRFCIHHNMNIS--LSANDLV 153

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GC+GG+P  AW+++  NGVVT       E C PY     C+H    P    
Sbjct: 154 ACCGFMCGDGCDGGYPISAWQYFVQNGVVT-------EECDPYFDQVGCKHPGCEP---- 202

Query: 304 TLLGKLKTPECKQNC 318
                  TP C++ C
Sbjct: 203 ----AYPTPVCEKKC 213


>gi|18378945|ref|NP_563647.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332189291|gb|AEE27412.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
          Length = 379

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 45/86 (52%), Positives = 59/86 (68%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
           + M ++Y++GP+   F+VY DF  YKSGVY++  G  IG HAV+++GWG  +D   YWL+
Sbjct: 266 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLL 325

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
           AN WN  WGD G FKI RG NE  IE
Sbjct: 326 ANQWNRSWGDDGYFKIRRGTNECGIE 351



 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 51/155 (32%), Positives = 68/155 (43%), Gaps = 40/155 (25%)

Query: 187 LPRNFDAREKWPECPSLRHIAD--------------------QSNCGSCWAVSVANAISD 226
           LP+ FDAR  W  C S+R I                        +CGSCWA     ++SD
Sbjct: 103 LPKEFDARTAWSHCTSIRRILVGYILNNVLLWSTITLWFWFLLGHCGSCWAFGAVESLSD 162

Query: 227 RLCIASNGYFTGQISAQHIVACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGC 284
           R CI  N      +SA  ++AC      +GCNGG+P  AW ++ ++GVVT      QE C
Sbjct: 163 RFCIKYN--LNVSLSANDVIACCGLLCGFGCNGGFPMGAWLYFKYHGVVT------QE-C 213

Query: 285 QPY-TLAPCEHHVQGPLQNCTLLGKLKTPECKQNC 318
            PY     C H    P           TP+C++ C
Sbjct: 214 DPYFDNTGCSHPGCEP--------TYPTPKCERKC 240


>gi|302764096|ref|XP_002965469.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
 gi|300166283|gb|EFJ32889.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
          Length = 331

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 44/82 (53%), Positives = 58/82 (70%), Gaps = 1/82 (1%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVANSW 137
           +IY++GP+   ++VY DF  YKSGVY+H FG  +G HAV+ +GWG  +D   YW+VANSW
Sbjct: 233 EIYKNGPVEVSYTVYEDFAHYKSGVYKHVFGQVLGGHAVKFIGWGTTDDGKDYWIVANSW 292

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N  WG+ G F+I RG NE  IE
Sbjct: 293 NRSWGEDGFFQISRGSNECGIE 314



 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 54/136 (39%), Positives = 75/136 (55%), Gaps = 19/136 (13%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP++FDARE WP+C S++ I DQ +CGSCWA     A++DR CI +N   +  +S   +V
Sbjct: 88  LPKHFDAREAWPQCASIKTILDQGHCGSCWAFGAVEALTDRFCILNNENVS--LSENDLV 145

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP-CEHHVQGPLQNCT 304
           AC  +C +GC GG+P  AW ++   GVVT     SQ  C PY     C+H    P     
Sbjct: 146 ACCSSCGFGCEGGYPYAAWEYFAQTGVVT-----SQ--CDPYFDGKGCKHPGCEP----- 193

Query: 305 LLGKLKTPECKQNCYN 320
              +  TP C + C +
Sbjct: 194 ---EYDTPVCVKQCVD 206


>gi|194352768|emb|CAQ00112.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326488519|dbj|BAJ93928.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326508126|dbj|BAJ99330.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 355

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 46/94 (48%), Positives = 63/94 (67%), Gaps = 2/94 (2%)

Query: 67  KAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN 126
           + H  P  + M ++Y++GP+   F+VY DF  YKSGVY+H  G  +G HAV+++GWG  +
Sbjct: 237 RVHSNPH-DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSD 295

Query: 127 -DIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
               YWL+AN WN  WGD G FKI+RG+NE  IE
Sbjct: 296 AGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIE 329



 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 69/135 (51%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR +W  C ++ +I DQ +CG+CWA +   ++ DR CI  N   +  +S   ++
Sbjct: 101 LPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVESLQDRFCIHLN--MSVSLSVNDLL 158

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GCNGG+P  AWR++  +GVVT       E C PY     C+H    P    
Sbjct: 159 ACCGFLCGSGCNGGYPISAWRYFRRSGVVT-------EECDPYFDQTGCQHPGCEP---- 207

Query: 304 TLLGKLKTPECKQNC 318
                  TP+C + C
Sbjct: 208 ----AYPTPKCHRKC 218


>gi|6165885|gb|AAF04727.1|AF101239_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
          Length = 352

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 49/110 (44%), Positives = 64/110 (58%), Gaps = 8/110 (7%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
           + M ++Y +GP    F+VY DF  YKSGVY+H  G  +G HAV+++GWG   D   YWL+
Sbjct: 239 SIMTEVYTNGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHAVKLIGWGTSEDGEDYWLL 298

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE-------MGFNNRVEANSSEDDDL 176
           AN WN  WG  G FKI+RG NE  IE          N  +E+   +DD L
Sbjct: 299 ANQWNRSWGGDGYFKIIRGTNECGIEDVTAGTPSTKNLDIESGVRDDDSL 348



 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 42/103 (40%), Positives = 57/103 (55%), Gaps = 6/103 (5%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR  WP+C S+  I DQ +CGSCWA     +++DR CI      T  +S   ++
Sbjct: 96  LPKTFDARTAWPQCLSIADILDQGHCGSCWAFGAVESLTDRFCIHYGTNVT--LSVNDLL 153

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTG--GDYNSQEGCQ 285
           AC       GC+GG+P  AW+++   GVVT     Y  Q GC 
Sbjct: 154 ACCGFLCGEGCDGGYPIAAWQYFKRTGVVTSECDPYFDQTGCS 196


>gi|149392557|gb|ABR26081.1| cathepsin b-like cysteine proteinase 3 [Oryza sativa Indica Group]
          Length = 142

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 51/120 (42%), Positives = 70/120 (58%), Gaps = 16/120 (13%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLV 133
           + M ++Y++GP+   F+VY DF  YKSGVY+H  G  +G HAV+++GWG  +    YWL+
Sbjct: 29  DIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLL 88

Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDA 193
           AN WN  WGD G FKI+RG NE  IE            ED      G  + K + RN+D+
Sbjct: 89  ANQWNRGWGDDGYFKIIRGTNECGIE------------ED---VVAGMPSTKNMVRNYDS 133


>gi|388500062|gb|AFK38097.1| unknown [Lotus japonicus]
          Length = 357

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 43/86 (50%), Positives = 58/86 (67%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
           + M ++Y++GP+   F+VY DF  YKSGVY+H  G  +G HAV+++GWG  ++   YWL+
Sbjct: 244 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGSQLGGHAVKLIGWGTTDEGEDYWLI 303

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
           AN WN  WGD G F I RG NE  IE
Sbjct: 304 ANQWNRSWGDDGYFMIRRGTNECGIE 329



 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 49/135 (36%), Positives = 68/135 (50%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP++FDAR  W +C ++  I DQ +CGSCWA     ++SDR CI  +      +S   ++
Sbjct: 101 LPKSFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHLD--VNVSLSVNDLL 158

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GC+GG+P  AWR+  H+GVVT       E C PY     C H    P    
Sbjct: 159 ACCGFLCGSGCDGGYPLYAWRYLAHHGVVT-------EECDPYFDQIGCSHPGCEP---- 207

Query: 304 TLLGKLKTPECKQNC 318
                 +TP+C + C
Sbjct: 208 ----AYQTPKCVRKC 218


>gi|21693|emb|CAA46810.1| cathepsin B [Triticum aestivum]
          Length = 305

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 44/86 (51%), Positives = 59/86 (68%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLV 133
           + M ++Y +GP+   F+VY DF  YKSGVY+H  G  +G HAV+++GWG  +    YWL+
Sbjct: 200 DIMAEVYNNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLL 259

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
           AN WN  WGD G FKI+RG+NE  IE
Sbjct: 260 ANQWNRGWGDDGYFKIIRGKNECGIE 285



 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 45/103 (43%), Positives = 58/103 (56%), Gaps = 6/103 (5%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR KW  C ++  I DQ +CGSCWA      + DR CI  N   T  +SA  +V
Sbjct: 57  LPKVFDARSKWSGCSTIGKILDQGHCGSCWAFGAVECLQDRFCIHHNMNIT--LSANDLV 114

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTG--GDYNSQEGCQ 285
           AC       GC+GG+P  AW+++  NGVVT     Y  Q GC+
Sbjct: 115 ACCGFMCGDGCDGGYPISAWQYFVQNGVVTDECDPYFDQVGCK 157


>gi|320167003|gb|EFW43902.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
          Length = 306

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 49/99 (49%), Positives = 58/99 (58%), Gaps = 2/99 (2%)

Query: 66  KKAHMVPRCNAMRQ--IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWG 123
           K A+ V    A  Q  I  +GP+ A FSVY DF  Y SGVY H  G   G HAV+++GWG
Sbjct: 201 KTAYQVANNMAAIQSEILANGPVEAAFSVYDDFFSYTSGVYSHQSGALDGGHAVKIVGWG 260

Query: 124 VENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGF 162
           V+   PYW+VANSW   WG  G F I RG +E  IE G 
Sbjct: 261 VDGTTPYWIVANSWGTSWGQAGFFWIKRGNDECGIEDGI 299



 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 53/133 (39%), Positives = 68/133 (51%), Gaps = 16/133 (12%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FDAR +WP   S+  I DQ  CGSCWA     A+SDRL IASN      +S Q +V
Sbjct: 81  IPTSFDARTQWPA--SIHPIRDQQQCGSCWAFGATEALSDRLAIASNNSINVVLSPQDLV 138

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
           +C    +GC+GG+P  AW +    GVVT       + C PYT         G    C + 
Sbjct: 139 SCDSTDYGCDGGYPINAWHYMQSLGVVT-------DTCYPYTSG------NGDSGTCQIT 185

Query: 307 GKLKTPECKQNCY 319
           GK KTP C    +
Sbjct: 186 GK-KTPACATATF 197


>gi|389608479|dbj|BAM17849.1| tubulointerstitial nephritis antigen [Papilio xuthus]
          Length = 429

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 46/102 (45%), Positives = 63/102 (61%), Gaps = 7/102 (6%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQ---HNFGDSIGLHAVRVLGWGVENDIPYW 131
           + M  I E GP+ A+ +VY DF  Y+ GVY+   H   +  G H+VR++GWG +    YW
Sbjct: 326 DIMYDIMESGPVQAVMTVYQDFFHYRDGVYRRSYHGNNELKGFHSVRIIGWGEDRGDRYW 385

Query: 132 LVANSWNDHWGDHGTFKILRGENEADIE----MGFNNRVEAN 169
           +VANSW   WG++G F+I RG NEADIE     G ++  EAN
Sbjct: 386 VVANSWGRQWGENGYFRIARGSNEADIESFVVTGLSDVTEAN 427



 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 48/129 (37%), Positives = 63/129 (48%), Gaps = 14/129 (10%)

Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
           P  FDAR +WP    +  I DQ  CGS WAVS+A   SDR  I SNG     +S Q +++
Sbjct: 191 PTQFDARTRWPG--FISPIVDQGWCGSDWAVSLAGVASDRFAIQSNGAENMVLSPQTLLS 248

Query: 248 CTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY--TLAPCEHHVQGPL--QN 302
           C      GC+GG   +AW F   +G+V        E C PY  ++  C    +G L    
Sbjct: 249 CNVRAQQGCHGGHIDVAWNFARGHGLV-------DEKCFPYKASVTRCPFRPRGNLIQDG 301

Query: 303 CTLLGKLKT 311
           C  L K +T
Sbjct: 302 CMPLVKRRT 310


>gi|225437812|ref|XP_002281936.1| PREDICTED: cathepsin B-like isoform 1 [Vitis vinifera]
 gi|359480250|ref|XP_003632421.1| PREDICTED: cathepsin B-like [Vitis vinifera]
          Length = 358

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 43/86 (50%), Positives = 60/86 (69%), Gaps = 1/86 (1%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVAN 135
           M ++Y++GP+   F+VY DF  Y+SGVY++  GD +G HAV+++GWG  +D   YW++AN
Sbjct: 246 MAEVYKNGPVEVAFTVYEDFAHYESGVYRYTTGDVMGGHAVKLIGWGTTDDGEDYWILAN 305

Query: 136 SWNDHWGDHGTFKILRGENEADIEMG 161
            WN +WGD G F I RG NE  IE G
Sbjct: 306 QWNRNWGDDGYFMIRRGVNECGIEEG 331



 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 51/135 (37%), Positives = 69/135 (51%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP++FDAR  WP+C ++  I DQ +CGSCWA     ++SDR CI         +S   ++
Sbjct: 101 LPKHFDARTAWPQCSTIGKILDQGHCGSCWAFGAVESLSDRFCIHFG--MNISLSVNDLL 158

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP-CEHHVQGPLQNC 303
           AC       GC+GG+P  AWR++ H+GVVT       E C PY  A  C H    P    
Sbjct: 159 ACCGFLCGSGCDGGYPLYAWRYFIHHGVVT-------EECDPYFDATGCSHPGCEP---- 207

Query: 304 TLLGKLKTPECKQNC 318
                  TP+C + C
Sbjct: 208 ----GYPTPKCVRKC 218


>gi|156375635|ref|XP_001630185.1| predicted protein [Nematostella vectensis]
 gi|156217201|gb|EDO38122.1| predicted protein [Nematostella vectensis]
          Length = 311

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 44/103 (42%), Positives = 64/103 (62%), Gaps = 3/103 (2%)

Query: 63  HYFKKAHMVPRCNA---MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRV 119
           ++ K A+ +P  N       I  +GP+ A F+++ DF  Y+SG+Y H  G  +G HA+++
Sbjct: 202 YHAKSAYKLPAKNVEAIQTDIMNNGPVEADFTIFQDFYAYRSGIYVHATGKQLGGHAIKI 261

Query: 120 LGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGF 162
           LGWG E+++ YWL ANSW  +WG  G FKI RG +E  IE G 
Sbjct: 262 LGWGTEDNVDYWLCANSWGANWGIQGYFKIRRGTDECGIEDGL 304



 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 39/91 (42%), Positives = 54/91 (59%), Gaps = 2/91 (2%)

Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
           A+ +P NFDAR++WP   S+  I +Q  CGSCWA   +  +SDR  IAS       +SAQ
Sbjct: 80  AENIPENFDARKQWPG--SIHPIRNQGQCGSCWAFGASEVLSDRFAIASKNQIYVTLSAQ 137

Query: 244 HIVACTPNCWGCNGGWPQLAWRFWGHNGVVT 274
            +V C  +  GC+GGWP  AW +    G++T
Sbjct: 138 QLVDCDLDNSGCSGGWPINAWNYMVKTGLLT 168


>gi|18411686|ref|NP_567215.1| cathepsin B [Arabidopsis thaliana]
 gi|13877861|gb|AAK44008.1|AF370193_1 putative cathepsin B cysteine protease [Arabidopsis thaliana]
 gi|17473834|gb|AAL38343.1| unknown protein [Arabidopsis thaliana]
 gi|21281113|gb|AAM45063.1| putative cathepsin B cysteine protease [Arabidopsis thaliana]
 gi|21554165|gb|AAM63244.1| cathepsin B-like cysteine protease, putative [Arabidopsis thaliana]
 gi|24417490|gb|AAN60355.1| unknown [Arabidopsis thaliana]
 gi|24899725|gb|AAN65077.1| unknown protein [Arabidopsis thaliana]
 gi|51968702|dbj|BAD43043.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51969104|dbj|BAD43244.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51969220|dbj|BAD43302.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970472|dbj|BAD43928.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970630|dbj|BAD44007.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970704|dbj|BAD44044.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970802|dbj|BAD44093.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970974|dbj|BAD44179.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51971008|dbj|BAD44196.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51971116|dbj|BAD44250.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|62320144|dbj|BAD94342.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|110740287|dbj|BAF02040.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332656652|gb|AEE82052.1| cathepsin B [Arabidopsis thaliana]
          Length = 359

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 44/86 (51%), Positives = 59/86 (68%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
           + M ++Y++GP+   F+VY DF  YKSGVY+H  G +IG HAV+++GWG  ++   YWL+
Sbjct: 246 DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLM 305

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
           AN WN  WGD G F I RG NE  IE
Sbjct: 306 ANQWNRGWGDDGYFMIRRGTNECGIE 331



 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 50/135 (37%), Positives = 69/135 (51%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR  WP+C S+ +I DQ +CGSCWA     ++SDR CI         +S   ++
Sbjct: 103 LPKAFDARTAWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIQFG--MNISLSVNDLL 160

Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC    C  GC+GG+P  AW+++ ++GVVT       E C PY     C H    P    
Sbjct: 161 ACCGFRCGDGCDGGYPIAAWQYFSYSGVVT-------EECDPYFDNTGCSHPGCEP---- 209

Query: 304 TLLGKLKTPECKQNC 318
                  TP+C + C
Sbjct: 210 ----AYPTPKCSRKC 220


>gi|30678927|ref|NP_849281.1| cathepsin B [Arabidopsis thaliana]
 gi|3859606|gb|AAC72872.1| contains similarity to cysteine proteases (Pfam: PF00112,
           E=1.3e-79, N=1) [Arabidopsis thaliana]
 gi|7268205|emb|CAB77732.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332656653|gb|AEE82053.1| cathepsin B [Arabidopsis thaliana]
          Length = 359

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 44/86 (51%), Positives = 59/86 (68%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
           + M ++Y++GP+   F+VY DF  YKSGVY+H  G +IG HAV+++GWG  ++   YWL+
Sbjct: 246 DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLM 305

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
           AN WN  WGD G F I RG NE  IE
Sbjct: 306 ANQWNRGWGDDGYFMIRRGTNECGIE 331



 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 67/135 (49%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR  WP+C S+ +I    +CGSCWA     ++SDR CI         +S   ++
Sbjct: 103 LPKAFDARTAWPQCTSIGNILGLGHCGSCWAFGAVESLSDRFCIQFG--MNISLSVNDLL 160

Query: 247 ACTP-NCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC    C  GC+GG+P  AW+++ ++GVVT       E C PY     C H    P    
Sbjct: 161 ACCGFRCGDGCDGGYPIAAWQYFSYSGVVT-------EECDPYFDNTGCSHPGCEP---- 209

Query: 304 TLLGKLKTPECKQNC 318
                  TP+C + C
Sbjct: 210 ----AYPTPKCSRKC 220


>gi|290992302|ref|XP_002678773.1| predicted protein [Naegleria gruberi]
 gi|284092387|gb|EFC46029.1| predicted protein [Naegleria gruberi]
          Length = 236

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 43/85 (50%), Positives = 59/85 (69%), Gaps = 2/85 (2%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDI--PYWLVA 134
           M  + ++GP+ A FSVY DF+ YKSGVY H  G  +G HA++++GWGV++    PYW++A
Sbjct: 142 MEDMQQNGPVQAAFSVYRDFMSYKSGVYHHVSGSLLGGHAIKMVGWGVDSATNKPYWIIA 201

Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
           NSW   WG +G F ILRG +E  IE
Sbjct: 202 NSWGPSWGLNGFFWILRGSDECGIE 226



 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 40/98 (40%), Positives = 55/98 (56%), Gaps = 9/98 (9%)

Query: 191 FDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP 250
           FD+R KWP C  +  I +Q  CGSCWA S +  +SDR CIAS G     +S Q++V+C  
Sbjct: 17  FDSRTKWPHC--VHPIRNQEQCGSCWAFSASEVLSDRFCIASGGKVDVVLSPQYMVSCDS 74

Query: 251 NCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
             +GC+GG+   AW F    G+ +       + C PYT
Sbjct: 75  TDYGCDGGYLNNAWAFLAGTGIPS-------DKCAPYT 105


>gi|195026034|ref|XP_001986167.1| GH20676 [Drosophila grimshawi]
 gi|193902167|gb|EDW01034.1| GH20676 [Drosophila grimshawi]
          Length = 432

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 45/89 (50%), Positives = 58/89 (65%), Gaps = 4/89 (4%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH---NFGDSIGLHAVRVLGWGVE-NDIPY 130
           + M +IY  GP+ A  +VY DF  Y SGVYQH   N G + G H+V+++GWG E N + Y
Sbjct: 325 DIMAEIYHSGPVQATMTVYRDFFSYSSGVYQHTAANRGAATGFHSVKLVGWGEEHNGVKY 384

Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIE 159
           W+ ANSW   WG+ G F+ILRG NE  IE
Sbjct: 385 WIAANSWGPWWGERGYFRILRGSNECGIE 413



 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 38/92 (41%), Positives = 52/92 (56%), Gaps = 2/92 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LPR+F+A EKW     +  + DQ  CG+ W +S  +  SDR  I S G    Q+SAQ+I+
Sbjct: 187 LPRSFNAVEKWST--FISEVPDQGWCGASWVLSTTSVASDRFAIQSQGKEVVQLSAQNIL 244

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
           +CT    GC+GG    AWR+   NGV+    Y
Sbjct: 245 SCTRRQQGCDGGHLDAAWRYMHKNGVLDANCY 276


>gi|297744106|emb|CBI37076.3| unnamed protein product [Vitis vinifera]
          Length = 392

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 43/86 (50%), Positives = 60/86 (69%), Gaps = 1/86 (1%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVAN 135
           M ++Y++GP+   F+VY DF  Y+SGVY++  GD +G HAV+++GWG  +D   YW++AN
Sbjct: 280 MAEVYKNGPVEVAFTVYEDFAHYESGVYRYTTGDVMGGHAVKLIGWGTTDDGEDYWILAN 339

Query: 136 SWNDHWGDHGTFKILRGENEADIEMG 161
            WN +WGD G F I RG NE  IE G
Sbjct: 340 QWNRNWGDDGYFMIRRGVNECGIEEG 365



 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 51/171 (29%), Positives = 70/171 (40%), Gaps = 56/171 (32%)

Query: 187 LPRNFDAREKWPECPSL------------------------------------RHIADQS 210
           LP++FDAR  WP+C ++                                     +I DQ 
Sbjct: 99  LPKHFDARTAWPQCSTIGKILGRLLDSFSSYFDDFFCFGCTDALYFSYHLLVPFYIKDQG 158

Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW--GCNGGWPQLAWRFWG 268
           +CGSCWA     ++SDR CI         +S   ++AC       GC+GG+P  AWR++ 
Sbjct: 159 HCGSCWAFGAVESLSDRFCIHFG--MNISLSVNDLLACCGFLCGSGCDGGYPLYAWRYFI 216

Query: 269 HNGVVTGGDYNSQEGCQPYTLAP-CEHHVQGPLQNCTLLGKLKTPECKQNC 318
           H+GVVT       E C PY  A  C H    P           TP+C + C
Sbjct: 217 HHGVVT-------EECDPYFDATGCSHPGCEP--------GYPTPKCVRKC 252


>gi|297723949|ref|NP_001174338.1| Os05g0310500 [Oryza sativa Japonica Group]
 gi|255676228|dbj|BAH93066.1| Os05g0310500, partial [Oryza sativa Japonica Group]
          Length = 234

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 44/86 (51%), Positives = 59/86 (68%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLV 133
           + M ++Y++GP+   F+VY DF  YKSGVY+H  G  +G HAV+++GWG  +    YWL+
Sbjct: 121 DIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLL 180

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
           AN WN  WGD G FKI+RG NE  IE
Sbjct: 181 ANQWNRGWGDDGYFKIIRGTNECGIE 206



 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 51/112 (45%), Gaps = 20/112 (17%)

Query: 210 SNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWG--CNGGWPQLAWRFW 267
            +CGSCWA      + DR CI  N      +S   +VAC     G  C+GG+P +AWR++
Sbjct: 1   GHCGSCWAFGAVECLQDRFCIHFN--MNISLSVNDLVACCGFMCGDGCDGGYPIMAWRYF 58

Query: 268 GHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNCTLLGKLKTPECKQNC 318
             NGVVT       + C PY     C+H    P           TP C++ C
Sbjct: 59  VRNGVVT-------DECDPYFDQVGCKHPGCEP--------AYPTPVCEKKC 95


>gi|224064398|ref|XP_002301456.1| predicted protein [Populus trichocarpa]
 gi|222843182|gb|EEE80729.1| predicted protein [Populus trichocarpa]
          Length = 325

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 45/86 (52%), Positives = 58/86 (67%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
           + M ++  +GP+   F+VY DF  YKSGVY+H  GD +G HAV+++GWG  +D   YWL+
Sbjct: 212 SIMAEVSMNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWLL 271

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
           AN WN  WGD G FKI RG NE  IE
Sbjct: 272 ANQWNRGWGDDGYFKIRRGTNECGIE 297



 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 53/113 (46%), Gaps = 24/113 (21%)

Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW----GCNGGWPQLAWRF 266
           +CGSCWA     ++SDR CI      +  +S   ++AC    W    GC+GG+P  AWR+
Sbjct: 93  HCGSCWAFGAVESLSDRFCIHYGMNLS--LSVNDLLACCG--WMCGDGCDGGYPIDAWRY 148

Query: 267 WGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNCTLLGKLKTPECKQNC 318
           +  +GVVT       E C PY     C H    P           TP+C++ C
Sbjct: 149 FVQSGVVT-------EECDPYFDDIGCSHPGCEP--------GFPTPKCERKC 186


>gi|308163070|gb|EFO65432.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 97

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 41/86 (47%), Positives = 57/86 (66%), Gaps = 3/86 (3%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND---IPYWLV 133
           M+ +   GP+ A+ SVY DFL Y+ GVY+H +G  I  HAV ++G+G  +D   +PYW+V
Sbjct: 1   MQALANDGPVQAVMSVYRDFLYYRGGVYRHVYGVQISSHAVEIIGYGTTDDEDRVPYWIV 60

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
            NS   +WG+ G F I+RG NE DIE
Sbjct: 61  KNSLGPNWGEDGYFNIVRGSNECDIE 86


>gi|339248603|ref|XP_003373289.1| cathepsin B [Trichinella spiralis]
 gi|316970616|gb|EFV54519.1| cathepsin B [Trichinella spiralis]
          Length = 576

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 48/96 (50%), Positives = 61/96 (63%), Gaps = 14/96 (14%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQH---------NFGDSIGLHAVRVLGWGVEND 127
           M +I  +GP+ A F V+ DF  YKSGVYQH          +  S G H+VR+LGWGV++ 
Sbjct: 448 MTEIMANGPVQATFLVHEDFFMYKSGVYQHLPYANDKGPAYARS-GYHSVRILGWGVDHS 506

Query: 128 ----IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
               I YWL ANSW + WG++G F+ILRGEN  DIE
Sbjct: 507 TGVPIKYWLCANSWGEEWGENGLFRILRGENHCDIE 542



 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 37/110 (33%), Positives = 57/110 (51%), Gaps = 5/110 (4%)

Query: 158 IEMGFNNRVEANSSEDD--DLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSC 215
           ++ GF+ R+     E    ++  +  + +  LP +FDARE+WP    +  + DQ +C S 
Sbjct: 280 LDEGFSYRLGTLLPEKSVKNMNEILIEMSNFLPESFDARERWPS--FIHPVRDQGDCASS 337

Query: 216 WAVSVANAISDRLCIASNGYFTGQISAQHIVACT-PNCWGCNGGWPQLAW 264
           WA S     +DRL I S G F   +S Q +++C      GCNGG+   AW
Sbjct: 338 WAFSTTAVSADRLAIQSGGKFYNPLSVQQLLSCNQARQRGCNGGYLDRAW 387


>gi|328702238|ref|XP_001943280.2| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 328

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 46/162 (28%), Positives = 86/162 (53%), Gaps = 19/162 (11%)

Query: 165 RVEANSSEDDDLETM--GCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVAN 222
           RVE  +   +  +T+  G      + + FDAR++WP+C ++    ++ N    WA + A 
Sbjct: 60  RVETTTKSKELNKTLDSGVVKDNRIHKEFDARKRWPQCKTIGEFRNEGNFALSWAYAAAG 119

Query: 223 AISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQL-----AWRFWGHNGVVTGGD 277
            ++DR+CIA+NG +   IS + +++C+    G +GG+  +      W +   +G+V+GG 
Sbjct: 120 VLADRMCIATNGSYNQLISTEELISCS----GVSGGYHGIVSEREVWEYLKSHGLVSGGK 175

Query: 278 YNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCY 319
           YN+ +GCQP  + P E +++          ++K   C  +CY
Sbjct: 176 YNTSDGCQPSKIPPIEEYME--------YSEIKNYTCNDHCY 209



 Score = 43.1 bits (100), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 24/35 (68%)

Query: 117 VRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILR 151
           V+++GWGVEN   YWL+ +SW    G +G FK+ R
Sbjct: 274 VKLIGWGVENGEDYWLLVDSWGYERGQNGVFKVER 308


>gi|157167285|ref|XP_001658487.1| cathepsin b [Aedes aegypti]
 gi|108876478|gb|EAT40703.1| AAEL007590-PA [Aedes aegypti]
          Length = 313

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 43/83 (51%), Positives = 55/83 (66%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M  I+ +GP+ A+F  Y D + Y  GVY+H  G   G HAV+++GWGVE+   YWLVANS
Sbjct: 218 MEDIFVNGPVQAVFQWYEDIVNYSGGVYRHQSGRLKGGHAVKLIGWGVEDGTKYWLVANS 277

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           W   WGD G FK++RGEN   IE
Sbjct: 278 WGRVWGDDGFFKMVRGENHCGIE 300



 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 56/158 (35%), Positives = 77/158 (48%), Gaps = 13/158 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP++FDAR++WP+C SL  I  Q  CGSC  VS A+A++DR CI S G       A  ++
Sbjct: 62  LPKSFDARQQWPQCSSLNEIRTQGCCGSCAYVSGASAMTDRWCIHSKGKKQFTFGAFDLL 121

Query: 247 ACTPNCWGCNGGWPQLA--WRFWGHNGVVTGGDYNSQEGCQPYTLAP-CEHHVQGPLQNC 303
           +C   C G   G       W +W   GV +GG Y S +GC PY + P C    +G   + 
Sbjct: 122 SCCYECGGGCTGGGIPGPIWSYWVKQGVSSGGPYGSNQGCHPYPMPPSCPKPSEGDYPD- 180

Query: 304 TLLGKLKTPECKQNCYNPSYESTYRF-DLKKGKKAHMV 340
                   P C   C N  Y  T    D + G+ A+ +
Sbjct: 181 -------EPNCSTRC-NAGYNVTEDLRDRRFGRVAYSI 210


>gi|38639325|gb|AAR25800.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
          Length = 354

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 66/101 (65%), Gaps = 5/101 (4%)

Query: 63  HYFKKAHMV---PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRV 119
           HY   A+ V   P+ + M ++Y++GP+   F+VY DF  YKSGVY+H  G ++G HAV++
Sbjct: 229 HYGVNAYRVSHDPQ-SIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGGNMGGHAVKL 287

Query: 120 LGWGV-ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +GWG  E    YWL+ NSWN  WG+ G FKI RG NE  IE
Sbjct: 288 IGWGTSEQGEDYWLIVNSWNRGWGEDGYFKIRRGTNECGIE 328



 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 56/163 (34%), Positives = 79/163 (48%), Gaps = 23/163 (14%)

Query: 162 FNNRVEANSSEDDDLETMGCQN---AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAV 218
           F   +    + + DLE +        K LP+ FDAR+ WP+C ++  I DQ +CGSCWA 
Sbjct: 72  FKRLLGVKPAREGDLEGIPVLTHPRLKELPKEFDARKAWPQCSTIGKILDQGHCGSCWAF 131

Query: 219 SVANAISDRLCIASNGYFTGQISAQHIVACTPNCW--GCNGGWPQLAWRFWGHNGVVTGG 276
               ++SDR CI  N   +  +S   ++AC       GC+GG+P  AWR++  +GVVT  
Sbjct: 132 GAVESLSDRFCIHYN--LSISLSVNDLLACCSFLCGSGCDGGYPIAAWRYFKRSGVVT-- 187

Query: 277 DYNSQEGCQPY-TLAPCEHHVQGPLQNCTLLGKLKTPECKQNC 318
                E C PY     C H    PL          TP+C + C
Sbjct: 188 -----EECDPYFDTTGCSHPGCEPLY--------PTPKCHRKC 217


>gi|328726600|ref|XP_003248962.1| PREDICTED: cathepsin B-like cysteine proteinase-like [Acyrthosiphon
           pisum]
          Length = 169

 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 43/83 (51%), Positives = 57/83 (68%), Gaps = 1/83 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVENDIPYWLVANS 136
           + +  +GP+ A F VY DF  YKSGVYQ     + +G HAV+++GWGVE  IPYWL+ NS
Sbjct: 76  KDVMNYGPIEASFDVYDDFYSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGIPYWLMVNS 135

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           W+  WGD+G FKI RG +E  I+
Sbjct: 136 WSAQWGDNGLFKIRRGTDECGID 158


>gi|156708122|gb|ABU93319.1| cathepsin B10 cysteine protease [Monocercomonoides sp. PA]
          Length = 283

 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 39/81 (48%), Positives = 55/81 (67%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           ++Y +GP+   + VY DF  Y  G+Y+H  G+ +G HAV ++GWG+E+ + YWLV NSW 
Sbjct: 192 ELYNNGPIQVTYVVYEDFFYYSKGIYKHLSGNKVGGHAVVLMGWGIEDGVKYWLVQNSWG 251

Query: 139 DHWGDHGTFKILRGENEADIE 159
             WG+ G F+ILRG NE  IE
Sbjct: 252 YEWGEQGYFRILRGSNECGIE 272



 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 50/154 (32%), Positives = 78/154 (50%), Gaps = 20/154 (12%)

Query: 176 LETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGY 235
           +E    +++  +P +FDAR++WP   ++  + DQ  CGSCWA S+A ++ DR  I   G 
Sbjct: 53  VEKFTIEDSFYVPESFDARDEWPN--AILPVRDQEKCGSCWAFSIAESLGDRFGILGCG- 109

Query: 236 FTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-----TLA 290
             G +S Q +++C  N  GCNGG+ + +W +    G+ T       E C PY      + 
Sbjct: 110 -KGHLSPQDLISCDSNDLGCNGGYQENSWTWVLTTGITT-------ESCWPYRSGSGRIP 161

Query: 291 PCEHH-VQGP-LQNCTL--LGKLKTPECKQNCYN 320
            C H  V G  LQ  T+    +L + E +   YN
Sbjct: 162 SCPHRCVNGSVLQRNTINNYRRLDSSELQDELYN 195


>gi|300121248|emb|CBK21629.2| unnamed protein product [Blastocystis hominis]
          Length = 559

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 75/262 (28%), Positives = 109/262 (41%), Gaps = 41/262 (15%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           N M++IY  GP+    +V  DF+ YK G+Y+   G    +HA+ V+GWG EN   YW+  
Sbjct: 184 NMMKEIYARGPITCGIAVPQDFVDYKGGIYKDESGAVEKVHAISVVGWGEENGEKYWIGR 243

Query: 135 NSWNDHWGDHGTFKILRGENEADIEMGFNNRV----EANSSEDDDLETM------GCQN- 183
           NSW ++WG+ G F+I RG N   IE      V    EA  S +     +      GC + 
Sbjct: 244 NSWGNYWGEEGWFRIARGINNLAIESECQWAVPKVPEARKSREFRRRELLLHVREGCVDK 303

Query: 184 ---------AKGLPRNFDAREKWPECPSLRHIADQSN-------------CGSCWAVSVA 221
                       LP  +      P    +R++ D  N             CGSCWA    
Sbjct: 304 SRAVNKEHVVSPLPHTYLKANDLPASYDIRNV-DGVNYATWNRNQHIPVWCGSCWAQGST 362

Query: 222 NAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQ 281
            A+SDR+ I   G +     A  +V    +   C+GGW    + +  H   +        
Sbjct: 363 AALSDRINIMRKGAWPAVNLAVQVVLNCGDAGSCHGGWDDGVYAY-AHEVDI------PD 415

Query: 282 EGCQPYTLAPCEHHVQGPLQNC 303
           + CQPY     E   +   +NC
Sbjct: 416 QTCQPYEAVDHECSPENICRNC 437



 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 29/71 (40%), Positives = 42/71 (59%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I+  GP+    +V   FL Y  GVY+ +    +  H V + GWGVEN  PYW+  NSW 
Sbjct: 468 EIFARGPVSCSMTVRESFLDYHGGVYESDSSPMVAGHIVEIAGWGVENGRPYWIGRNSWG 527

Query: 139 DHWGDHGTFKI 149
           ++WG+ G F+I
Sbjct: 528 EYWGEEGWFRI 538



 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 36/108 (33%), Positives = 49/108 (45%), Gaps = 18/108 (16%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSN------CGSCWAVSVANAISDRLCIASNGYFTGQ- 239
           LP+N+D R        L  ++   N      CGSCWA S  +A+SDRL + + G +    
Sbjct: 44  LPKNYDPRN----INGLNMVSVNKNQHIPVWCGSCWAFSATSAVSDRLKLMTKGAWPEHD 99

Query: 240 ISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           +S Q ++ C  N  GC GG P   +R     GV         EGC  Y
Sbjct: 100 LSVQVVINCADNAEGCGGGHPTDVYRLMNEMGV-------PAEGCMRY 140


>gi|294916338|ref|XP_002778359.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239886683|gb|EER10154.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 105

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Composition-based stats.
 Identities = 41/78 (52%), Positives = 53/78 (67%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           +A   I   GP+ A F+VY DFL Y+SGVY+H  G  +G HAV+++GWG ++   YWL  
Sbjct: 13  DAKNAIRTDGPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAVKIIGWGEKSGQAYWLAV 72

Query: 135 NSWNDHWGDHGTFKILRG 152
           NSWN+ WGDHG FKI  G
Sbjct: 73  NSWNEDWGDHGLFKIALG 90


>gi|448278133|gb|AGE43966.1| putative cathepsin B [Naegleria fowleri]
          Length = 349

 Score = 94.4 bits (233), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 43/81 (53%), Positives = 60/81 (74%), Gaps = 2/81 (2%)

Query: 80  IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLVANSWN 138
           I  +GP+++ F VY DF  Y+SG Y+H  G  +G HA++V+GWGV ++++PYW+VANSW+
Sbjct: 260 ILANGPVISGFKVYRDFYNYRSG-YKHVAGGLVGGHAIKVVGWGVTQSNVPYWIVANSWS 318

Query: 139 DHWGDHGTFKILRGENEADIE 159
           D WG +G F ILRG NE  IE
Sbjct: 319 DEWGMNGYFWILRGTNECSIE 339



 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/104 (34%), Positives = 54/104 (51%), Gaps = 9/104 (8%)

Query: 186 GLPRNFDAR--EKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
            LP +++A     +  C  L  I +Q  CGSCWA S++  ++DR CI + G     +S Q
Sbjct: 120 ALPDSYNAANDSNYYMCQQLHRIRNQEQCGSCWAFSISEMVADRFCIGTRGKINTIMSPQ 179

Query: 244 HIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
            +V+C     GCNGG    A++F    G+V+       +GC PY
Sbjct: 180 WMVSCDTADNGCNGGEFPTAFQFVETTGLVS-------DGCVPY 216


>gi|255647484|gb|ACU24206.1| unknown [Glycine max]
          Length = 327

 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 43/82 (52%), Positives = 57/82 (69%), Gaps = 1/82 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
           + M ++Y++GP+   F+VY DF  YKSGVY+H  G  +G HAV+++GWG  +D   YWL+
Sbjct: 244 DIMAEVYKNGPVEVAFTVYEDFAYYKSGVYKHITGYELGGHAVKLIGWGTTDDGEDYWLL 303

Query: 134 ANSWNDHWGDHGTFKILRGENE 155
           AN WN  WGD G FKI RG NE
Sbjct: 304 ANQWNREWGDDGYFKIRRGTNE 325



 Score = 87.4 bits (215), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 50/135 (37%), Positives = 68/135 (50%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+NFDAR  W +C ++  I DQ +CGSCWA     ++SDR CI  +      +S   ++
Sbjct: 101 LPKNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFD--VNISLSVNDLL 158

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GC+GG+P  AWR+  H+GVVT       E C PY     C H    P    
Sbjct: 159 ACCGFLCGSGCDGGYPLYAWRYLAHHGVVT-------EECDPYFDQIGCSHPGCEP---- 207

Query: 304 TLLGKLKTPECKQNC 318
                 +TP+C + C
Sbjct: 208 ----AYRTPKCVKKC 218


>gi|4099305|gb|AAD00577.1| cysteine proteinase [Clonorchis sinensis]
          Length = 180

 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 47/103 (45%), Positives = 59/103 (57%), Gaps = 2/103 (1%)

Query: 220 VANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDY 278
              A+SDRLCI SNG F   +SA  +++C  NC +GC GG+P +AW +W  +G+VTGG  
Sbjct: 2   AVEAMSDRLCIHSNGAFNKSLSAVDLLSCCENCGFGCRGGYPAVAWDYWKTHGIVTGGSK 61

Query: 279 NSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNP 321
               GC+ Y    CEHHVQG    C       TPEC Q C  P
Sbjct: 62  EDPSGCRSYPFPKCEHHVQGHYPPCP-RELYPTPECVQQCDTP 103



 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 27/55 (49%), Positives = 38/55 (69%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIP 129
           + M++I   GP+ AIF++Y DFL+Y SGVY H  G  +  HAVR+LGWG   ++P
Sbjct: 126 SIMKEIMLRGPVEAIFTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWGELGNVP 180


>gi|294950069|ref|XP_002786445.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239900737|gb|EER18241.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 149

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 40/79 (50%), Positives = 53/79 (67%), Gaps = 1/79 (1%)

Query: 72  PRCNAMRQ-IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPY 130
           P    ++Q I+EHGP+   F +Y DF  YKSGVY H  GD +G H ++++GWGVE+   Y
Sbjct: 45  PAVQQIKQEIFEHGPVFCAFDMYKDFGLYKSGVYVHTTGDLVGSHTLKIIGWGVESGQEY 104

Query: 131 WLVANSWNDHWGDHGTFKI 149
           WL  NSWN+ WGDHG  K+
Sbjct: 105 WLAMNSWNEEWGDHGLIKM 123


>gi|388499754|gb|AFK37943.1| unknown [Lotus japonicus]
          Length = 209

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 43/86 (50%), Positives = 58/86 (67%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
           + M ++Y++GP+   F+VY DF  YKSGVY+H  G  +G HAV+++GWG  ++   YWL+
Sbjct: 96  DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGSQLGGHAVKLIGWGTTDEGEDYWLI 155

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
           AN WN  WGD G F I RG NE  IE
Sbjct: 156 ANQWNRSWGDDGYFMIRRGTNECGIE 181


>gi|290990464|ref|XP_002677856.1| predicted protein [Naegleria gruberi]
 gi|284091466|gb|EFC45112.1| predicted protein [Naegleria gruberi]
          Length = 231

 Score = 94.0 bits (232), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 41/75 (54%), Positives = 54/75 (72%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I  HGP+ A F VY+DF  YKSGVY+H  G   G+HAV+++GWG EN + YWL+ANSW
Sbjct: 145 KEILTHGPVNADFMVYSDFTVYKSGVYRHQTGSFEGIHAVKIIGWGTENGVDYWLIANSW 204

Query: 138 NDHWGDHGTFKILRG 152
              +G  G FKI+RG
Sbjct: 205 GTTFGLQGFFKIVRG 219



 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 42/109 (38%), Positives = 61/109 (55%), Gaps = 9/109 (8%)

Query: 191 FDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP 250
           FD+R+KWP C  +  I DQ NCGSC++ + +  +SDR CI SNG     +S Q +V C+ 
Sbjct: 6   FDSRQKWPNC--VHPIRDQGNCGSCYSFASSEVMSDRFCIFSNGSVNVVLSPQDLVTCSW 63

Query: 251 NCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP 299
             +GCNGG P L + +   +G+V+       + C PY       HV+ P
Sbjct: 64  YSFGCNGGIPGLVFDYIHKDGLVS-------DACFPYLSYDGNTHVKCP 105


>gi|326492684|dbj|BAJ90198.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 355

 Score = 94.0 bits (232), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 45/94 (47%), Positives = 62/94 (65%), Gaps = 2/94 (2%)

Query: 67  KAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN 126
           + H  P  + M ++Y++GP+   F+VY DF  YKSGVY+H  G  +G HAV+++GWG  +
Sbjct: 237 RVHSNPH-DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSD 295

Query: 127 -DIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
               YWL+AN WN  WG  G FKI+RG+NE  IE
Sbjct: 296 AGEDYWLLANQWNRGWGGDGYFKIIRGKNECGIE 329



 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 69/135 (51%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR +W  C ++ +I DQ +CG+CWA +   ++ DR CI  N   +  +S   ++
Sbjct: 101 LPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVESLQDRFCIHLN--MSVSLSVNDLL 158

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GCNGG+P  AWR++  +GVVT       E C PY     C+H    P    
Sbjct: 159 ACCGFLCGSGCNGGYPISAWRYFRRSGVVT-------EECDPYFDQTGCQHPGCEP---- 207

Query: 304 TLLGKLKTPECKQNC 318
                  TP+C + C
Sbjct: 208 ----AYPTPKCHRKC 218


>gi|159114116|ref|XP_001707283.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157435387|gb|EDO79609.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 332

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 42/86 (48%), Positives = 57/86 (66%), Gaps = 3/86 (3%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND---IPYWLV 133
           M+ +   GP+ A+ SVY DFL Y+ GVY+H +G  I  HAV ++G+G  +D   IPYW+V
Sbjct: 235 MQTLANDGPVQAVMSVYRDFLYYRGGVYKHVYGIQISSHAVEIIGYGTTDDEERIPYWIV 294

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
            NS   +WG+ G F I+RG NE DIE
Sbjct: 295 KNSLGPNWGEEGYFNIVRGSNECDIE 320



 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 42/88 (47%), Gaps = 2/88 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FD RE++P+C  +  + DQ  CG+CWA S   A  DR C+          S Q+ V
Sbjct: 104 IPDAFDLREEYPQC--ITPVYDQGYCGACWAFSATGAFGDRRCMQWLDPVGVPYSQQYTV 161

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVT 274
           +C     GC GG     W F   +G  T
Sbjct: 162 SCDDLDLGCAGGTSFNVWTFLTEHGTTT 189


>gi|56755425|gb|AAW25892.1| unknown [Schistosoma japonicum]
          Length = 226

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 50/116 (43%), Positives = 69/116 (59%), Gaps = 5/116 (4%)

Query: 217 AVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFWGHNGVVTG 275
           AVS   A+SDR+CI S G  + ++SA  +++C  NC  GC+GG+P  AW +W  +G+VTG
Sbjct: 42  AVSAVGAMSDRICIQSGGKQSVELSAIDLISCCENCGSGCDGGFPGPAWDYWVSHGIVTG 101

Query: 276 GDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKL-KTPECKQNCYNPSYESTYRFD 330
           G   +  GCQPY    CEHH  G   +C    K+ KTP+CK+ C    Y + Y  D
Sbjct: 102 GSKENHTGCQPYPFPKCEHHSIGKYPSCG--DKIYKTPQCKRKC-QKGYTTPYEHD 154



 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 22/50 (44%), Positives = 36/50 (72%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND 127
           ++I  +GP+ A   ++ DFL YKSG+Y++  G  +G H VR++GWG+EN+
Sbjct: 173 KEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGIENE 222


>gi|10803437|emb|CAC13131.1| putative cathepsin B.5 [Ostertagia ostertagi]
          Length = 196

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 55/134 (41%), Positives = 72/134 (53%), Gaps = 13/134 (9%)

Query: 214 SCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPN-CW-GCNGGWPQLAWRFWGHNG 271
           SCWA   A A+SDR+CIAS G     ISA  +++C    C  GC GG+P  AW++W   G
Sbjct: 1   SCWAFGAAEAMSDRICIASQGKTQVTISADDVLSCCGKKCGNGCEGGYPIEAWKYWVKTG 60

Query: 272 VVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLG-----KLKTPECKQNCYNPSYEST 326
           + TGG Y SQ GC+PY + PC HH     +N T  G     +  TP C   C   +Y++ 
Sbjct: 61  ICTGGSYESQSGCKPYPIPPCGHH-----KNQTYFGPCPTDEYDTPVCTNKCIA-AYKTP 114

Query: 327 YRFDLKKGKKAHMV 340
           Y  D   G  A+ V
Sbjct: 115 YSDDKHYGTSAYNV 128



 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 31/69 (44%), Positives = 41/69 (59%), Gaps = 2/69 (2%)

Query: 63  HYFKKAHMVPRCNA--MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVL 120
           HY   A+ V +  A   ++I  +GP+ A ++VY DF QY  GVY H  G  +G HAVR+L
Sbjct: 120 HYGTSAYNVAKTVAGIQKEIMTNGPVEAAYTVYEDFYQYTGGVYTHTGGAEVGGHAVRIL 179

Query: 121 GWGVENDIP 129
           GWGV    P
Sbjct: 180 GWGVRQQDP 188


>gi|23344736|gb|AAN28681.1| cathepsin B [Theromyzon tessulatum]
          Length = 65

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 39/65 (60%), Positives = 51/65 (78%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           +++ +HGP+ A  +VY+DFLQYKSGVY H  GD +G HAV+++GWGVEN +PYWLV NSW
Sbjct: 1   KELMKHGPVEAALTVYSDFLQYKSGVYHHVAGDELGGHAVKLIGWGVENKVPYWLVVNSW 60

Query: 138 NDHWG 142
              WG
Sbjct: 61  GTTWG 65


>gi|294891865|ref|XP_002773777.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239878981|gb|EER05593.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 156

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 36/77 (46%), Positives = 55/77 (71%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I+++G ++ + S+Y DF  YKSGVY H  G  +G+H+++++GWGVE+   YWL  NSW
Sbjct: 68  QEIFDNGTVLGVISMYEDFRLYKSGVYVHTTGGLVGVHSLKIIGWGVESGQDYWLAVNSW 127

Query: 138 NDHWGDHGTFKILRGEN 154
           N+ WGDHG  K+  GE 
Sbjct: 128 NEEWGDHGMIKLAVGET 144


>gi|321478457|gb|EFX89414.1| hypothetical protein DAPPUDRAFT_303204 [Daphnia pulex]
          Length = 442

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 51/158 (32%), Positives = 84/158 (53%), Gaps = 15/158 (9%)

Query: 9   TKKKKKKKKKKEEKKKKKKKKKKKEEEKKKKKKKKKKKKKKKKRLYLPTSIPLSHYFKKA 68
           + +  + +K K  ++      K +     ++K  +  K  +K     P +  ++ +    
Sbjct: 279 SGRTGQVEKCKVPRRGNLATMKCQLVNAAERKSDRSDKPPRKGLFRSPPAYRIAPFED-- 336

Query: 69  HMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS---IGLHAVRVLGWGVE 125
                 + M +I +HGP+ A   V+ DF  Y+ GVY+++  +S    G H+VR++GWGV+
Sbjct: 337 ------DIMNEILQHGPVQATMRVHPDFFLYRGGVYRYSGTNSQQRSGYHSVRIVGWGVD 390

Query: 126 ----NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
               N   YWLVANSW   WG+ G F+I+RGENE+DIE
Sbjct: 391 SSKRNPTKYWLVANSWGRLWGEDGYFRIVRGENESDIE 428



 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 39/103 (37%), Positives = 53/103 (51%), Gaps = 10/103 (9%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD R +W +  +L+ + DQ  CG+ WA S A   +DRL I S G+    +S Q+++
Sbjct: 185 LPMSFDGRIEWRD--TLQDVRDQGWCGASWAFSTAAVAADRLAIQSRGHEVYPLSMQNLL 242

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           AC      GCNGG    AW +    GVV        E C PY 
Sbjct: 243 ACNNRGQQGCNGGHLDRAWNYMRRFGVVN-------EECYPYI 278


>gi|395734831|ref|XP_003776483.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin B-like [Pongo abelii]
          Length = 350

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 58/160 (36%), Positives = 83/160 (51%), Gaps = 15/160 (9%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASN-----GYFTGQIS 241
           LP +FD  E+WP+ P  R I DQ + G CWA+    AISD +CI  N     G    ++S
Sbjct: 93  LPESFDPXEQWPDXPX-REIRDQGSYGFCWALGALEAISDWICIHPNVGGAQGGNHVEVS 151

Query: 242 AQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPL 300
           A+  + C     GCNGG P   W FW   G+V+GG Y+S  GC+ + +L PC+HH+ G  
Sbjct: 152 AEDKLTCLCGD-GCNGGXPNEGWNFWTGKGLVSGGLYDSHVGCRLFPSLLPCKHHIHGX- 209

Query: 301 QNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340
                +    +P+C   C  P    TY+ D   G  ++ +
Sbjct: 210 ---PYVXTGDSPKCSMTC-EPG--QTYKXDKHYGCSSYSI 243



 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 43/85 (50%), Positives = 53/85 (62%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           + M  IY++  +   FSVY DFL YK   YQ   G+  G HA+ +LG  VEN   YWLVA
Sbjct: 249 DIMTNIYKNDXVEEAFSVYLDFLMYKFKEYQGVTGEMXGGHAICILGCKVENSTSYWLVA 308

Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
           N WN  WGD+G FKILRG++   IE
Sbjct: 309 NXWNRDWGDNGFFKILRGQDHYGIE 333


>gi|195121981|ref|XP_002005491.1| GI19039 [Drosophila mojavensis]
 gi|193910559|gb|EDW09426.1| GI19039 [Drosophila mojavensis]
          Length = 432

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 43/89 (48%), Positives = 56/89 (62%), Gaps = 4/89 (4%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH---NFGDSIGLHAVRVLGWGVEND-IPY 130
           + M +IY  GP+ A   VY DF  Y  GVY+    N G   G H+V+++GWG E+D + Y
Sbjct: 324 DIMAEIYHSGPVQATMRVYRDFFSYSGGVYRQTAANRGAPTGFHSVKIVGWGEEHDGVKY 383

Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIE 159
           W+ ANSW   WG+HG F+ILRG NE  IE
Sbjct: 384 WIAANSWGPWWGEHGYFRILRGSNECGIE 412



 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 42/105 (40%), Positives = 54/105 (51%), Gaps = 9/105 (8%)

Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
           + GLPR F+A E+W     +  + DQ  CGS W +S  +  SDR  I S G    Q+S Q
Sbjct: 184 SSGLPRKFNAVERWSS--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSQGKEVVQLSPQ 241

Query: 244 HIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           +I++CT    GC GG    AWR+    GVV        E C PYT
Sbjct: 242 NILSCTRRQQGCEGGHLDAAWRYLHKKGVV-------DETCYPYT 279


>gi|294876463|ref|XP_002767679.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239869446|gb|EER00397.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 348

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 53/149 (35%), Positives = 74/149 (49%), Gaps = 12/149 (8%)

Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           +P +FDAR+ + EC   + H+ DQS C SCWA++   A S RLCI S G F   +SA  +
Sbjct: 83  IPSSFDARDAFKECKDVIGHVWDQSACASCWAIAPVQAFSARLCIKSGGKFNQLLSAGEL 142

Query: 246 VAC-----TPNCWGCNGGWPQLAWRFWGHNGVVTGGDY------NSQEGCQPYTLAPCEH 294
           +AC     +    GC GG  + AW F   +G+ TGGD+       + +GC PY    C H
Sbjct: 143 LACCNLAHSCEARGCKGGVARDAWVFLNKHGIATGGDFVPKSSMEAVDGCWPYNFPRCAH 202

Query: 295 HVQGPLQNCTLLGKLKTPECKQNCYNPSY 323
           + +            +TP C   C N  Y
Sbjct: 203 YQKKSKYGPCPKKSYETPSCLDRCPNEKY 231



 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 36/76 (47%), Positives = 48/76 (63%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I +HGP  A F  Y DF  YKSGVY++  G  +  H V ++GWG E  + YWL  N W
Sbjct: 258 KEIMKHGPTSASFFTYEDFFSYKSGVYKYTSGAYVEFHTVELIGWGTEKGVDYWLAKNDW 317

Query: 138 NDHWGDHGTFKILRGE 153
           N+ W D GTFKI +G+
Sbjct: 318 NEEWADLGTFKIAQGD 333


>gi|294879717|ref|XP_002768767.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239871616|gb|EER01485.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 157

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 42/86 (48%), Positives = 56/86 (65%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           +A   I   GP+ A F+VY DFL Y+SGVY+H  G  +G HAV+++GWG ++   YWL  
Sbjct: 65  DAKNAIRTDGPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAVKIIGWGEKSGQAYWLAV 124

Query: 135 NSWNDHWGDHGTFKILRGENEADIEM 160
           NSWN+ WGDHG FKI  G    D ++
Sbjct: 125 NSWNEDWGDHGLFKIALGNCGIDDDL 150



 Score = 43.5 bits (101), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 24/49 (48%)

Query: 282 EGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
           +GC PY   PC HH+          G   TP C + C+NP Y +T R D
Sbjct: 1   DGCWPYDFPPCAHHINDTKYPKCPKGLYPTPNCVEQCHNPKYTTTLRDD 49


>gi|118398308|ref|XP_001031483.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89285812|gb|EAR83820.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 591

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 79/285 (27%), Positives = 121/285 (42%), Gaps = 57/285 (20%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWG-VENDIPYWLVAN 135
           M++IY+ GP+    +V    L Y  G++    GD    H + V+G+G ++N   YW+V N
Sbjct: 195 MQEIYQRGPITCGIAVPDALLNYTGGIFYDRTGDLEIEHDISVVGYGTLKNGTKYWMVRN 254

Query: 136 SWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDD--DLETM------------GC 181
           SW  +WG++G F+I+RG N  +IE      V  ++  +D  +L T+            GC
Sbjct: 255 SWGTYWGENGFFRIIRGVNNLNIESACAWAVPRDTWSNDVRNLTTVNEKPVSNFQKSSGC 314

Query: 182 QN--------------------AKGLPRNFDAREKWPECPSLRHIADQSN------CGSC 215
           +                     A  LP++F     W       +++   N      CGSC
Sbjct: 315 KRESIFNLPEKIKSSRPHEYLKAADLPKSF----TWQNAYGKNYLSITRNQHIPVYCGSC 370

Query: 216 WAVSVANAISDRLCIASNGYFTG-QISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVT 274
           WA    ++I+DR+ IA NG F    +S Q I+ C      C+GG     + F   NG+  
Sbjct: 371 WAHGATSSIADRINIARNGTFPQVALSPQVIINCKAGG-SCSGGNAMGVYEFGHTNGI-- 427

Query: 275 GGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCY 319
                 +E CQ Y     E      +Q C         E  QNC+
Sbjct: 428 -----PEESCQQYVAKNPEKFTCSDIQQCM---NCAPSEKGQNCW 464



 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 42/83 (50%), Gaps = 2/83 (2%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE--NDIPYWLVANS 136
           +I+  GP+            Y  G+++ N   +   H V V+GWGV+    + YW+  NS
Sbjct: 489 EIFARGPIGCGIEATLKLENYSGGIFEQNLLFTSLNHEVAVVGWGVDEATGVEYWIARNS 548

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           W  +WG++G F+I   +N   IE
Sbjct: 549 WGSYWGENGYFRIRMHKNNNGIE 571



 Score = 42.0 bits (97), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 4/66 (6%)

Query: 212 CGSCWAVSVANAISDRLCIASNGYFTG-QISAQHIVACTPNC--WGCNGGWPQLAWRFWG 268
           CGSCWA +  +A+SDR+ IA N  F    +S Q +++C  +    GCNGG  + A+  W 
Sbjct: 74  CGSCWAFAATSALSDRIKIARNATFPDINLSPQFLLSCQQDQEDLGCNGGDARNAFA-WI 132

Query: 269 HNGVVT 274
           H+  +T
Sbjct: 133 HSNNIT 138


>gi|290975216|ref|XP_002670339.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
 gi|284083897|gb|EFC37595.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
          Length = 350

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 71/135 (52%), Gaps = 9/135 (6%)

Query: 154 NEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCG 213
           NE D++    ++  + ++   D   +     +  P  FDARE+WP+C  +R I +Q NCG
Sbjct: 92  NENDLKGEVMDKDNSTNTPLSDSRYLTILRLRDFPTQFDAREQWPQC--IRSIKNQKNCG 149

Query: 214 SCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVV 273
           SCWA S ++ ++DR CI S G     +S Q +V+C+    GCNGG+    WRF    G V
Sbjct: 150 SCWAFSASSVLADRFCIKSGGKVNVDLSPQFMVSCSGQNNGCNGGFFDATWRFLVSVGTV 209

Query: 274 TGGDYNSQEGCQPYT 288
           +       E C PY 
Sbjct: 210 S-------EACVPYV 217



 Score = 84.0 bits (206), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 39/86 (45%), Positives = 52/86 (60%), Gaps = 2/86 (2%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND--IPYWL 132
           + M  +  +GP+     VY DF  YKSGVY H  G  +G HAV+++GWG ++   +PYW+
Sbjct: 254 DIMADLKANGPIQVAMGVYRDFYSYKSGVYHHVSGRYVGGHAVKIVGWGYDSASKLPYWI 313

Query: 133 VANSWNDHWGDHGTFKILRGENEADI 158
            ANSW + WG  G F ILRG  E  I
Sbjct: 314 CANSWGEDWGIKGYFWILRGRGECGI 339


>gi|161343821|tpg|DAA06091.1| TPA_inf: cathepsin B [Aphis gossypii]
          Length = 196

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 41/97 (42%), Positives = 63/97 (64%), Gaps = 1/97 (1%)

Query: 64  YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGW 122
           Y +  + +   +  + +  +GP+ A F VY+DF  YKSG+Y+     + +G HAV+++GW
Sbjct: 89  YTRDYYYLTYGSIQKDVMTYGPIEASFDVYSDFPSYKSGIYERTENATYLGGHAVKLIGW 148

Query: 123 GVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           G +  IPYWL+ NSWN+ WGD+G FKI RG NE  ++
Sbjct: 149 GEQYGIPYWLMVNSWNEDWGDNGLFKIRRGTNECGVD 185



 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 47/78 (60%), Gaps = 6/78 (7%)

Query: 245 IVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
           +  C   C +GC+GG+P  AW+ + ++G+VTGGDY S EGC+PY + PC +  QG   N 
Sbjct: 2   LTFCCHTCGFGCHGGYPIRAWKRFKNHGLVTGGDYKSGEGCEPYRVPPCPYDEQG---NN 58

Query: 304 TLLGKL--KTPECKQNCY 319
           T  GK   K   C + CY
Sbjct: 59  TCAGKPMEKNHRCTRICY 76


>gi|291000017|ref|XP_002682576.1| cathepsin C [Naegleria gruberi]
 gi|284096203|gb|EFC49832.1| cathepsin C [Naegleria gruberi]
          Length = 430

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 49/127 (38%), Positives = 70/127 (55%), Gaps = 19/127 (14%)

Query: 49  KKKRLYLPTSIPLSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNF 108
           +++RL+      +  Y+  +H     + M +IY++GPL   F VY D   YK GVY+H  
Sbjct: 297 RQQRLHSSNYYFVGGYYGNSH---ELSMMHEIYQNGPLAIGFEVYPDLRNYKHGVYKHVT 353

Query: 109 GDSI---GL-------------HAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRG 152
            + +   GL             HAV ++GWGVEN  PYW + NSW+  WGD+G FKILRG
Sbjct: 354 AEELKAQGLSEDEMIPHFEVVNHAVLMVGWGVENGTPYWKIKNSWSTTWGDNGYFKILRG 413

Query: 153 ENEADIE 159
            +E  +E
Sbjct: 414 SDECGVE 420



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 39/82 (47%), Gaps = 7/82 (8%)

Query: 206 IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWR 265
           + +Q  CGSC+A S ++    R+ I SN       S Q IV C+    GC+GG+P L  +
Sbjct: 207 VRNQEQCGSCYAFSSSDMFGSRVRIPSNLTQVPVYSPQDIVDCSAYSQGCDGGFPFLVGK 266

Query: 266 FWGHNGVVTGGDYNSQEGCQPY 287
           +    G+         E C PY
Sbjct: 267 YAMDYGLTV-------ESCDPY 281


>gi|294890618|ref|XP_002773230.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239878281|gb|EER05046.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 238

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 52/153 (33%), Positives = 75/153 (49%), Gaps = 9/153 (5%)

Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           +P +FDAR+ + EC   + H+ DQS CGSCWA     A + R+CI S G     +SA  +
Sbjct: 59  IPDSFDARDAFKECKDVIGHVRDQSACGSCWAFGTVEAFNARVCIKSGGKLNQLLSAADM 118

Query: 246 VACTPN-----CWGCNGGWPQLAWRFWGHNGVVTG---GDYNSQEGCQPYTLAPCEHHVQ 297
           +AC         +GC+GG P  +W F   NG+V+G    +  + +GC PY    C HH +
Sbjct: 119 LACCNIEHFCLSFGCSGGNPITSWTFLHTNGIVSGKLSKNMKAADGCWPYNFPKCAHHQK 178

Query: 298 GPLQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
                        TP C  +C N  Y + +  D
Sbjct: 179 ESDYKPCAKELYDTPSCSSSCPNAKYGTAFDKD 211


>gi|294889976|ref|XP_002773021.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239877724|gb|EER04837.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 342

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 56/82 (68%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I+++GP+ AI +++ DF  YKSGVY++  G  +G H ++++GWGVE    YWL  NSW
Sbjct: 212 QEIFDNGPVAAIMTIHEDFRLYKSGVYEYKTGAMVGAHTLKLIGWGVEAGQEYWLAVNSW 271

Query: 138 NDHWGDHGTFKILRGENEADIE 159
           N+ WGD G  K+  G+N  D E
Sbjct: 272 NEEWGDQGKIKLAVGKNALDEE 293



 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 41/121 (33%), Positives = 60/121 (49%), Gaps = 13/121 (10%)

Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP NF+A+ K+  C   + HI DQ+ C +CWA +     +DR+CI S G  T  +S  ++
Sbjct: 39  LPSNFNAQIKFASCADVIGHIRDQAECHNCWASASVGMFNDRVCIQSGGRITDILSLAYL 98

Query: 246 VAC------TPNCWGCNGGWPQLAWRFWGHNGVVTGGDY------NSQEGCQPYTLAPCE 293
            +C       P   GC  G       F  ++G+VTGG+Y       + +GC PY    C 
Sbjct: 99  TSCCNHANGCPKSDGCRRGSVAEGLIFMKNHGIVTGGEYKPPKKLGNDDGCWPYPFPKCN 158

Query: 294 H 294
           H
Sbjct: 159 H 159


>gi|294893885|ref|XP_002774682.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239880102|gb|EER06498.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 121

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 44/89 (49%), Positives = 57/89 (64%), Gaps = 2/89 (2%)

Query: 187 LPRNFDAREKWPECP-SLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP +FDAR  +P C   + HI DQS CGSCWA  V  A +DRLC+ SNG FT  +SA  +
Sbjct: 34  LPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCVKSNGTFTELLSAGEM 93

Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVT 274
            AC P+ +GC+GG+P  AW +    G+ T
Sbjct: 94  NACAPS-YGCDGGYPDSAWSWVHDEGIAT 121


>gi|123478051|ref|XP_001322190.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
 gi|121905031|gb|EAY09967.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
          Length = 288

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 39/80 (48%), Positives = 52/80 (65%)

Query: 80  IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWND 139
           I   GP+     VY+D + YKSG+Y H  G+ +G HAV ++GWG +N I YW+++NSWN 
Sbjct: 200 IMTEGPVTTSLKVYSDLMYYKSGIYTHTKGEFLGHHAVEIIGWGTKNGIDYWIISNSWNT 259

Query: 140 HWGDHGTFKILRGENEADIE 159
            WG +G F I RG NE  IE
Sbjct: 260 TWGMNGLFLIKRGVNECHIE 279



 Score = 57.4 bits (137), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 33/101 (32%), Positives = 49/101 (48%), Gaps = 11/101 (10%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +++  E++P+C     + DQ  CGSCW+ +V+ + S R C   N       S  H+V
Sbjct: 68  IPMSYNFTERFPQCDF--GVLDQGKCGSCWSFAVSKSFSHRYCRKYNKPVL--FSQSHLV 123

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           AC     GC GG    AWR+    G+         + CQPY
Sbjct: 124 ACDRRNSGCGGGIEVNAWRYIDLRGL-------PLDSCQPY 157


>gi|253743418|gb|EES99819.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 296

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 45/91 (49%), Positives = 63/91 (69%), Gaps = 2/91 (2%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLV 133
           +A+  +  HGP+VA F+V  DF+ YKSGVYQH +G  +G HAV V+G+GV ++ + YW V
Sbjct: 201 SAIDVLLSHGPVVATFNVAQDFMYYKSGVYQHRWGVWLGGHAVEVVGYGVTDSGLDYWTV 260

Query: 134 ANSWNDHWGDHGTFKILRGENEADIEM-GFN 163
            NSW   WG+ G F+I+RG +E  IE  GF+
Sbjct: 261 RNSWGPDWGEDGYFRIVRGSDECGIEQEGFH 291



 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 44/88 (50%), Gaps = 2/88 (2%)

Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           G P ++D R+++P C  +  + DQ +CGSCWA S     +D  C +         S Q++
Sbjct: 75  GAPESYDFRDEYPHC--ITEVVDQGSCGSCWAFSSIQTFADHRCRSGLDATGVSYSVQYV 132

Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVV 273
           + C     GCNGG P  A+ F    G V
Sbjct: 133 LDCDRKDHGCNGGEPTKAFDFLHSTGTV 160


>gi|328726763|ref|XP_003249034.1| PREDICTED: cathepsin B-like cysteine proteinase-like, partial
           [Acyrthosiphon pisum]
          Length = 129

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 43/83 (51%), Positives = 56/83 (67%), Gaps = 1/83 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVENDIPYWLVANS 136
           + +  +GP+ A F VY DF  YKSGVYQ     + +G HAV+++GWGVE   PYWL+ NS
Sbjct: 36  KDVLNYGPIEASFDVYDDFPSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGTPYWLMVNS 95

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           WN  WGD+G FKI RG +E  I+
Sbjct: 96  WNAQWGDNGLFKIRRGTDECRID 118


>gi|403332696|gb|EJY65386.1| Cathepsin B [Oxytricha trifallax]
          Length = 297

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 40/76 (52%), Positives = 49/76 (64%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
            +I  HGP+   F+VY DF  Y+SGVY     D  G HA+++LG+GVEN  PYWL ANSW
Sbjct: 208 SEIVAHGPVEGAFTVYTDFFNYQSGVYTPTTSDVAGGHAIKILGFGVENGTPYWLCANSW 267

Query: 138 NDHWGDHGTFKILRGE 153
              WG  G FKI +GE
Sbjct: 268 GPSWGMQGFFKIKQGE 283



 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 49/141 (34%), Positives = 64/141 (45%), Gaps = 26/141 (18%)

Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
           P NFDAR++W     +  I DQ  CG+CWA     A+SDR  IASNG      S + +V+
Sbjct: 77  PDNFDARQQWGS--KIHAIRDQQQCGACWAFGATEALSDRFTIASNGSVDVVFSPEDLVS 134

Query: 248 CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLG 307
           C  N +GCNGG+  +AW F   +GVV        + C PY+                  G
Sbjct: 135 CDTNDYGCNGGYMDMAWEFLDQHGVVA-------DSCFPYS-----------------AG 170

Query: 308 KLKTPECKQNCYNPSYESTYR 328
               P C   C + S E  Y 
Sbjct: 171 SGFAPACASKCADGSAEKKYS 191


>gi|294945206|ref|XP_002784584.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239897729|gb|EER16380.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 298

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 64/190 (33%), Positives = 90/190 (47%), Gaps = 19/190 (10%)

Query: 171 SEDDDLETMGCQNAK----GLPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAIS 225
           SE  D E    ++ K     LP  FDAR+K+  C   + H+ DQ  CG+CWAV     ++
Sbjct: 13  SESSDEEIRLVESTKPVVENLPPEFDARQKFNYCRDVIGHVRDQGRCGNCWAVCPTEVLN 72

Query: 226 DRLCIASNGYFTGQISAQHIVAC---TPNCW---GCNGGWPQLAWRFWGHNGVVTGGDYN 279
           DRLCI S+G     +SA ++ +C      C    GCNGG    A  F   +GVVTG D+ 
Sbjct: 73  DRLCIKSSGKIQEILSAGYVTSCCNPAHGCLHAKGCNGGRLVEAMSFLRDHGVVTGNDFK 132

Query: 280 SQ------EGCQPYTLAPCEH-HVQGP-LQNCTLLGKLKTPECKQNCYNPSYESTYRFDL 331
            Q      +GC PY    C H   +G     C  + +   P C+  C N +Y+ +   D+
Sbjct: 133 PQDQLREADGCWPYPFQKCNHVPTEGTGYPKCKDVVQQPVPPCRTTCTNKAYKKSLEKDV 192

Query: 332 KKGKKAHMVL 341
            + K    VL
Sbjct: 193 HRAKSWRKVL 202



 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 35/90 (38%), Positives = 57/90 (63%), Gaps = 2/90 (2%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I+++GP+ + F +Y DF  YKSGVY     +   LH ++++GWG ++   YWL  N+W
Sbjct: 210 QEIFDNGPVFSAFEMYKDFRYYKSGVYVPTTKEVDCLHVIKIIGWGADSVREYWLAMNAW 269

Query: 138 NDHWGDHGTFKILRGENEADIEMGFNNRVE 167
           N+ WGDHG  K+  G+N   +E G  +R +
Sbjct: 270 NEEWGDHGLIKMAFGKNR--LENGTFHRAD 297


>gi|146163742|ref|XP_001012227.2| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|146145940|gb|EAR91982.2| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 581

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 76/273 (27%), Positives = 116/273 (42%), Gaps = 58/273 (21%)

Query: 75  NAMRQIYEHGPL-VAIFSVYADFLQYK--SGVYQHNFGDSIGLHAVRVLGWGVENDIPYW 131
           N M++I+  GP+   I S   D+L+Y    G+Y +        HA+ V+GWGVEN   YW
Sbjct: 186 NMMQEIFNRGPIGCGIAS--NDYLRYNYTGGIYVNTTEVDYHNHAISVVGWGVENGTKYW 243

Query: 132 LVANSWNDHWGDHGTFKILRGENEADIEM-------------GFNNRVEANSSEDDDLET 178
           +V NSW  +WG+ G F+++RG N  +IE                 N   +N++   +   
Sbjct: 244 IVRNSWGSYWGEKGYFRLVRGINSLNIESDCAWAVPKDTWTNDVRNTTASNTNSQSNFRQ 303

Query: 179 M-GCQ--------------------NAKGLPRNFDAREKWPECPSLRHIADQSN------ 211
           +  C                     N   LP+++D    W     + +++   N      
Sbjct: 304 LHDCVRQENNQKDQVILSPLPHQYLNGAVLPKSWD----WRNISGVNYLSVTRNQHIPQY 359

Query: 212 CGSCWAVSVANAISDRLCIASNGYFTG-QISAQHIVACTPNCWGCNGGWPQLAWRFWGHN 270
           CGSCWA    ++I+DR+ IA N  F   ++S Q I+ C      CNGG P   + F    
Sbjct: 360 CGSCWAHGTTSSIADRINIARNRTFPDIELSVQAIINCKAGG-SCNGGQPISVYSFAHKK 418

Query: 271 GVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
           GV        +E CQ Y     +      +Q C
Sbjct: 419 GV-------PEESCQNYVAKNPQKFSCSDIQRC 444



 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 31/90 (34%), Positives = 44/90 (48%), Gaps = 7/90 (7%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE--NDIPYWLVANS 136
           +I+  GP+    +V   F  Y  GVY       I  H + V+GWGV+   +  YW+  NS
Sbjct: 488 EIFARGPISCGIAVTNKFEAYTGGVYSEKSLTRIN-HEIAVVGWGVDETTNTEYWIGRNS 546

Query: 137 WNDHWGDHGTFKI-LRGEN---EADIEMGF 162
           W  +WG+ G F+I +  EN   E D   G 
Sbjct: 547 WGTYWGEDGFFRIKMHSENLKIETDCSWGV 576



 Score = 43.5 bits (101), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 31/108 (28%), Positives = 49/108 (45%), Gaps = 18/108 (16%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSN------CGSCWAVSVANAISDRLCIASNGYFTG-Q 239
           LP NF     W +   + ++    N      CGSCWA +  + +SDR+ IA    F    
Sbjct: 42  LPENF----FWGDVDGVNYLTVTKNQHIPQYCGSCWAFTATSTLSDRIKIARKAAFPDIL 97

Query: 240 ISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           IS Q +++C     GC+GG    ++++   N +       + E C PY
Sbjct: 98  ISPQVLISCDDFSNGCHGGNILTSYQWIAQNNI-------TDETCSPY 138


>gi|508264|gb|AAA96833.1| cysteine protease, partial [Caenorhabditis elegans]
          Length = 198

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 54/110 (49%), Positives = 59/110 (53%), Gaps = 14/110 (12%)

Query: 214 SCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWG--CNGGWPQLAWRFWGHNG 271
           SCWAVS A  ISDR+CIASN      ISA  I AC     G  CNGG+P  AWR +   G
Sbjct: 1   SCWAVSAAETISDRICIASNAKTILSISADDINACCGMVCGNGCNGGYPIEAWRHYVKKG 60

Query: 272 VVTGGDYNSQEGCQPYTLAPCEHHVQGPL------------QNCTLLGKL 309
            VTGG Y  + GC+PY   PCEHHV G              QN   LGKL
Sbjct: 61  YVTGGSYQDKTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTGQNANALGKL 110



 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 32/60 (53%), Positives = 40/60 (66%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           + I  HG L    +V+ DF  Y  GVY H  G S+G HAV++LGWGV+N  PYWL+ANSW
Sbjct: 139 KGIKTHGQLRGGITVFEDFEHYSGGVYVHTAGASLGGHAVKMLGWGVDNGTPYWLIANSW 198


>gi|195384166|ref|XP_002050789.1| GJ20006 [Drosophila virilis]
 gi|194145586|gb|EDW61982.1| GJ20006 [Drosophila virilis]
          Length = 432

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 41/89 (46%), Positives = 56/89 (62%), Gaps = 4/89 (4%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH---NFGDSIGLHAVRVLGWGVEND-IPY 130
           + M +IY  GP+ A   +Y DF  Y  G+Y+    N G   G H+V+++GWG E+D + Y
Sbjct: 325 DIMAEIYHSGPVQATMRIYRDFFSYSGGIYRQTAANRGAPTGFHSVKLVGWGEEHDGVKY 384

Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIE 159
           W+ ANSW   WG+HG F+ILRG NE  IE
Sbjct: 385 WIAANSWGPWWGEHGYFRILRGSNECGIE 413



 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 42/102 (41%), Positives = 53/102 (51%), Gaps = 9/102 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LPR F+A EKW     +  + DQ  CGS W +S  +  SDR  I S G    Q+SAQ+I+
Sbjct: 187 LPRKFNAVEKWSS--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSQGKEVVQLSAQNIL 244

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           +CT    GC GG    AWR+    GV+        E C PYT
Sbjct: 245 SCTRRQQGCEGGHLDAAWRYLHKKGVL-------DEKCYPYT 279


>gi|340508280|gb|EGR34021.1| papain family cysteine protease, putative [Ichthyophthirius
           multifiliis]
          Length = 620

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 68/265 (25%), Positives = 108/265 (40%), Gaps = 63/265 (23%)

Query: 75  NAMRQIYEHGPLVA-IFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLV 133
           N M++I+  GP+   I+S       Y  G+Y          H V ++GWGVEN + YW+V
Sbjct: 183 NIMQEIFNRGPVACNIYSTEYLRYNYTGGIYNDTTAYPETNHVVSIVGWGVENGVKYWIV 242

Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDLETMGCQN---------- 183
            NSW  +WG+ G ++ LRG N  +IE      V  ++  +D+ + +   N          
Sbjct: 243 RNSWGSYWGEKGFYRQLRGVNMINIEQFCYWAVPKDTWTNDERDKIQTSNEQEKQESNQE 302

Query: 184 ---------------------------------AKGLPRNFDAREKWPECPSLRHIADQS 210
                                             + +P++FD    W     + +++   
Sbjct: 303 KINNFFKFSNYTCRRESPKNQPQLIKGKQPYQIIQKVPKSFD----WRNVNGVNYLSHTR 358

Query: 211 N------CGSCWAVSVANAISDRLCIASNGYF-TGQISAQHIVACTPNCWGCNGGWPQLA 263
           N      CGSCWA    +++SDR+ IA N  +    +S Q I+ C      C GG PQ  
Sbjct: 359 NQHIPQYCGSCWAHGTTSSLSDRINIARNKTWPDTSLSVQAIINCNAGG-SCEGGNPQTV 417

Query: 264 WRFWGHNGVVTGGDYNSQEGCQPYT 288
           + F  + G+        +E CQ Y 
Sbjct: 418 YEFANNKGI-------PEESCQNYV 435



 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 41/83 (49%), Gaps = 2/83 (2%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE--NDIPYWLVANS 136
           +IY  GP+     V + F  Y  G+Y       +  H + V+GWG++      YW+  NS
Sbjct: 494 EIYMRGPISCGIHVSSKFEAYNGGIYSERSILPVINHEIAVVGWGIDEKTKTEYWIGRNS 553

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           W  +WG+ G F+I   +N   IE
Sbjct: 554 WGTYWGESGFFRIQMHKNNLGIE 576



 Score = 41.2 bits (95), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 41/77 (53%), Gaps = 8/77 (10%)

Query: 212 CGSCWAVSVANAISDRLCIASNGYFTG-QISAQHIVACTPNCWGCNGGWPQLAWRFWGHN 270
           CGSCWA + ++++SDR+ I  N  +    I+ Q +V+C     GC+GG    ++++   N
Sbjct: 66  CGSCWAQAASSSLSDRIKIVRNAQWPDILIAPQVLVSCNKYSNGCHGGSAADSFQWIKEN 125

Query: 271 GVVTGGDYNSQEGCQPY 287
            +       + E C PY
Sbjct: 126 NI-------TDESCSPY 135


>gi|28974200|gb|AAO61484.1| cathepsin B [Sterkiella histriomuscorum]
          Length = 294

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 40/76 (52%), Positives = 49/76 (64%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
            +I  HGP+   F+VY DF  Y+SGVY     D  G HA+++LG+GVEN  PYWL ANSW
Sbjct: 205 SEIVSHGPVEGAFTVYTDFFNYQSGVYTPTTTDVAGGHAIKILGYGVENGTPYWLCANSW 264

Query: 138 NDHWGDHGTFKILRGE 153
              WG  G FKI +GE
Sbjct: 265 GPAWGMSGFFKIKQGE 280



 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 37/102 (36%), Positives = 52/102 (50%), Gaps = 12/102 (11%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P NFDAR++W     +  I DQ  CGSCWA     A SDR  I         +S + +V
Sbjct: 76  VPENFDARQQWGS--KIHAIRDQQQCGSCWAFGATEAFSDRFAINGKDVI---LSPEDLV 130

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           +C  N +GCNGG+  +AW +   +G  T       + C PY+
Sbjct: 131 SCDTNDYGCNGGYMDVAWEYLADHGAAT-------DSCFPYS 165


>gi|291228863|ref|XP_002734398.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 451

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 48/89 (53%), Positives = 57/89 (64%), Gaps = 8/89 (8%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHN---FGDSIGLHA-----VRVLGWGVENDIPY 130
           +I E+GP+ A F V  DF  Y SGVY+H      D+   HA     V++LGWGVEN I Y
Sbjct: 321 EIMENGPVQASFEVKEDFFMYGSGVYRHTPIASNDAEQYHASEWHSVKLLGWGVENGIKY 380

Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIE 159
           WL ANSW   WG+ G FKILRGENE +IE
Sbjct: 381 WLGANSWGTKWGEDGYFKILRGENECNIE 409



 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 43/105 (40%), Positives = 55/105 (52%), Gaps = 10/105 (9%)

Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
           K +P++FDAR+KW     +  I DQ NC S WA S     SDRL I S+G     +S QH
Sbjct: 177 KKIPKSFDARDKWGS--MITGILDQGNCASSWAFSTVGVASDRLAIQSSGETGMTLSPQH 234

Query: 245 IVAC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           +++C T    GC+GG    AW F    GVV+         C PYT
Sbjct: 235 LLSCNTRGQRGCSGGHIDRAWWFMRKRGVVS-------NDCYPYT 272


>gi|159108157|ref|XP_001704351.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157432412|gb|EDO76677.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 360

 Score = 91.3 bits (225), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 42/86 (48%), Positives = 60/86 (69%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLV 133
           +A+  +  HGP+VA F+V  DF+ YKSGVYQH +G  +G HAV ++G+GV ++ + YW V
Sbjct: 265 SAIDVLLAHGPVVATFNVAQDFMYYKSGVYQHRWGLWLGGHAVEIIGYGVTDSGLDYWTV 324

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
            NSW   WG+ G F+I+RG +E  IE
Sbjct: 325 RNSWGPDWGEDGYFRIVRGGDECGIE 350



 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 45/88 (51%), Gaps = 2/88 (2%)

Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           G P ++D R+++P C  +  + DQ NCGSCWA S     +D  C +         S Q++
Sbjct: 139 GAPESYDFRDEYPHC--ITEVVDQGNCGSCWAFSSVQTFADHRCRSGLDATGVSYSVQYV 196

Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVV 273
           + C     GCNGG P  A+ F  + G V
Sbjct: 197 LDCDRKDHGCNGGEPVNAFNFLHNTGTV 224


>gi|308161503|gb|EFO63946.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 363

 Score = 90.9 bits (224), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 42/86 (48%), Positives = 60/86 (69%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLV 133
           +A+  +  HGP+VA F+V  DF+ YKSGVYQH +G  +G HAV ++G+GV ++ + YW V
Sbjct: 268 SAIDVLLAHGPVVATFNVAQDFMYYKSGVYQHRWGVWLGGHAVEIVGYGVTDSGLDYWTV 327

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
            NSW   WG+ G F+I+RG +E  IE
Sbjct: 328 RNSWGPDWGEDGYFRIVRGGDECGIE 353



 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 45/88 (51%), Gaps = 2/88 (2%)

Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           G P ++D RE++P C  +  + DQ +CGSCWA S     +D  C +         S Q++
Sbjct: 142 GAPESYDFREEYPHC--ITEVVDQGSCGSCWAFSSIQTFADHRCRSGLDATGVSYSVQYV 199

Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVV 273
           + C     GCNGG P  A+ F  + G V
Sbjct: 200 LDCDRKDHGCNGGEPVNAFNFLHNTGTV 227


>gi|253744515|gb|EET00718.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 306

 Score = 90.9 bits (224), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 41/86 (47%), Positives = 55/86 (63%), Gaps = 3/86 (3%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND---IPYWLV 133
           M+ +   GP+ A  +VY DFL Y+SGVY+H +G  I  HAV ++G+G  +D    PYW+V
Sbjct: 209 MQALANDGPVQASMAVYRDFLYYRSGVYRHVYGSQISSHAVEIIGYGAADDEDSTPYWIV 268

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
            NS    WG+ G F I+RG NE DIE
Sbjct: 269 KNSLGSGWGEEGYFNIVRGSNECDIE 294



 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/104 (34%), Positives = 50/104 (48%), Gaps = 9/104 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD RE++P+C  +  + DQ +CGSCWA S  +A  DR C+          S Q+ +
Sbjct: 78  IPASFDFREEYPQC--ITPVYDQGHCGSCWAFSATSAFGDRRCMQGLDSAGVPYSQQYTI 135

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
           +C     GC GG     W F   +G  T         C PYT A
Sbjct: 136 SCDYLDLGCAGGLSFSVWTFLTEHGTTT-------LECVPYTDA 172


>gi|38639319|gb|AAR25797.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
          Length = 218

 Score = 90.5 bits (223), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 56/163 (34%), Positives = 79/163 (48%), Gaps = 23/163 (14%)

Query: 162 FNNRVEANSSEDDDLETMGCQN---AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAV 218
           F   +    + + DLE +        K LP+ FDAR+ WP+C ++  I DQ +CGSCWA 
Sbjct: 70  FKRLLGVKPAREGDLEGIPVLTHPRLKELPKEFDARKAWPQCSTIGKILDQGHCGSCWAF 129

Query: 219 SVANAISDRLCIASNGYFTGQISAQHIVACTPNCW--GCNGGWPQLAWRFWGHNGVVTGG 276
               ++SDR CI  N   +  +S   ++AC       GC+GG+P  AWR++  +GVVT  
Sbjct: 130 GAVESLSDRFCIHYN--LSISLSVNDLLACCSFLCGSGCDGGYPIAAWRYFKRSGVVT-- 185

Query: 277 DYNSQEGCQPY-TLAPCEHHVQGPLQNCTLLGKLKTPECKQNC 318
                E C PY     C H    PL          TP+C + C
Sbjct: 186 -----EECDPYFDTTGCSHPGCEPLY--------PTPKCHRKC 215


>gi|449498128|ref|XP_002193225.2| PREDICTED: tubulointerstitial nephritis antigen [Taeniopygia
           guttata]
          Length = 469

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 48/121 (39%), Positives = 70/121 (57%), Gaps = 18/121 (14%)

Query: 57  TSIPLSHYFKKAHMVPRC-----------NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQ 105
           T+ P  + F+K++ + RC           + M++I + GP+ AI  VY DF  YK G+YQ
Sbjct: 339 TNGPCPNAFEKSNRLYRCASHYRVSSKETDIMKEIKDRGPVQAIMKVYEDFFLYKEGIYQ 398

Query: 106 HN--FGDSIGLHAVRVLGWGVEND-----IPYWLVANSWNDHWGDHGTFKILRGENEADI 158
           H+   G     H+V++LGWG   D       +W+ ANSW   WG++G F+ILRG+NE DI
Sbjct: 399 HSQKAGSKWKTHSVKLLGWGALPDKNGQKQKFWIAANSWGKSWGENGYFRILRGQNECDI 458

Query: 159 E 159
           E
Sbjct: 459 E 459



 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 40/95 (42%), Positives = 54/95 (56%), Gaps = 3/95 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
            P  F A  +WPE   +    DQ NCG+ WA S A+  +DR+ I S G  T  +SAQ+++
Sbjct: 222 FPAIFSAIYEWPE--WIHDPLDQRNCGASWAFSTASVAADRIAIHSKGQITDNLSAQNLI 279

Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNS 280
           +C T N  GCNGG    AWR+   +GVV+   Y S
Sbjct: 280 SCDTRNQHGCNGGSIDGAWRYLKTHGVVSYACYPS 314


>gi|270012758|gb|EFA09206.1| cathepsin B precursor [Tribolium castaneum]
          Length = 326

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 41/82 (50%), Positives = 57/82 (69%), Gaps = 1/82 (1%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +I  +GP++A ++V+ DF  +KSGVY +  G  +G H+V+V+GWG E  IPYWL+ANSW 
Sbjct: 207 EILTNGPVMAYYNVFEDFACHKSGVYYYKSGKFVGRHSVKVIGWGTEEGIPYWLIANSWG 266

Query: 139 DHWGD-HGTFKILRGENEADIE 159
             WG+  G FK+ RG NE  IE
Sbjct: 267 SEWGELGGFFKMRRGTNECWIE 288



 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 46/104 (44%), Positives = 67/104 (64%), Gaps = 2/104 (1%)

Query: 187 LPRNFDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           +P +FDAREKWPEC   +  I +Q NCGSCWA +    ++DRLCI+S G      S +++
Sbjct: 76  IPESFDAREKWPECKDVIGKIRNQGNCGSCWAFASTEVMTDRLCISSKGKIKFVFSPENL 135

Query: 246 VACTPNCWGCNGG-WPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           + C  +C     G + + AW ++ + G+ +GGDYNS EGCQPY+
Sbjct: 136 LTCCKDCGCGCKGGYIKNAWDYYINEGIASGGDYNSSEGCQPYS 179


>gi|290973351|ref|XP_002669412.1| predicted protein [Naegleria gruberi]
 gi|284082959|gb|EFC36668.1| predicted protein [Naegleria gruberi]
          Length = 488

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 44/94 (46%), Positives = 53/94 (56%), Gaps = 9/94 (9%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGL---------HAVRVLGWGVE 125
           N M ++Y  GPL   F VY DF  YK GVY H+      +         HAV ++GWG E
Sbjct: 384 NMMYELYHGGPLAIAFEVYDDFFNYKGGVYTHSTALKTKIAEPGWEETNHAVLLVGWGEE 443

Query: 126 NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           N +PYWLV NSW   WG +G FKI RG +E D E
Sbjct: 444 NGVPYWLVKNSWGTSWGINGFFKIKRGTDECDCE 477



 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 49/110 (44%), Gaps = 15/110 (13%)

Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSN----CGSCWAVSVANAISDRLCIASNGYFT 237
           ++   LP+ F     W     +  +    N    CGSC+A S ++    R+ + +NG  T
Sbjct: 251 EDVNALPKEF----SWTNVNGMNLVVPVRNQGVFCGSCYAFSSSDMFGSRVRVITNGTKT 306

Query: 238 GQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
              S Q IV C+    GC+GG+  L  ++    G+       ++E C PY
Sbjct: 307 PVYSPQDIVECSAYSQGCDGGFMYLVSKYAEDYGL-------AEESCDPY 349


>gi|308811264|ref|XP_003082940.1| cysteine proteinase (ISS) [Ostreococcus tauri]
 gi|116054818|emb|CAL56895.1| cysteine proteinase (ISS) [Ostreococcus tauri]
          Length = 362

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 50/123 (40%), Positives = 65/123 (52%), Gaps = 15/123 (12%)

Query: 187 LPRNFDAREKWPECPSL-RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP  FD REKWP+C +L     DQ  CGSCWAV+ A A++DRLCIA+NG     +SA  +
Sbjct: 88  LPDTFDVREKWPKCAALVSEAVDQGACGSCWAVAPAKAMTDRLCIATNGAVNTHVSAIQL 147

Query: 246 VACTPNCWGC--------------NGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP 291
           ++C  +                   GG+P  A+      GVV+GG    Q+ C PY  AP
Sbjct: 148 LSCNSHSNSAYTYDENLAGGSGGCMGGYPTEAYETAHRVGVVSGGLNGDQDTCMPYPFAP 207

Query: 292 CEH 294
           C H
Sbjct: 208 CHH 210



 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 34/97 (35%), Positives = 52/97 (53%), Gaps = 14/97 (14%)

Query: 79  QIYEHGPLVA-IFSVYADFLQYKSGVYQHN-----FGDSIGLHAVRVLGWGVEND-IPYW 131
           +I+E GP+   +  VY +F QY+ GVY+ +      G + G H + V+GWG   + + YW
Sbjct: 256 EIFERGPVTTFVGDVYDEFYQYERGVYKLSKDPAARGKNHGGHVMEVIGWGKSAEGVRYW 315

Query: 132 LVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEA 168
            V NSW + WG+ G  +I  G      E+   + VEA
Sbjct: 316 KVYNSWLN-WGERGYGEIAVG------ELSIGDNVEA 345


>gi|32129435|sp|P92133.2|CATB3_GIALA RecName: Full=Cathepsin B-like CP3; AltName: Full=Cathepsin B-like
           protease B3; Flags: Precursor
 gi|1763663|gb|AAB58260.1| cysteine protease [Giardia intestinalis]
 gi|11691660|emb|CAC18648.1| cathepsin B-like cysteine protease 3 [Giardia intestinalis]
          Length = 299

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 39/84 (46%), Positives = 57/84 (67%), Gaps = 1/84 (1%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVAN 135
           M+ +   GPL   F+VY+DF+ Y+SGVYQH +G   G HAV ++G+G ++D + YW++ N
Sbjct: 206 MKALATGGPLQTAFTVYSDFMYYESGVYQHTYGRVEGGHAVDMVGYGTDDDGVDYWIIKN 265

Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
           SW   WG+ G F+I+R  NE  IE
Sbjct: 266 SWGPDWGEDGYFRIIRMTNECGIE 289



 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 38/108 (35%), Positives = 52/108 (48%), Gaps = 9/108 (8%)

Query: 180 GCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQ 239
           G  +A   P +FD RE++P C  +  + DQ  CGSCWA S   ++ DR C A       +
Sbjct: 67  GTVSATQAPDSFDFREEYPHC--IPEVVDQGGCGSCWAFSSVASVGDRRCFAGLDKKAVK 124

Query: 240 ISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
            S Q++V+C      C+GGW    WRF    G  T       + C PY
Sbjct: 125 YSPQYVVSCDRGDMACDGGWLPSVWRFLTKTGTTT-------DECVPY 165


>gi|239793607|dbj|BAH72912.1| ACYPI000019 [Acyrthosiphon pisum]
          Length = 188

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 41/83 (49%), Positives = 58/83 (69%), Gaps = 5/83 (6%)

Query: 76  AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN----FGDSIGLHAVRVLGWGVENDIPYW 131
           AM+ I+++GP+   F +Y D + YKSGVYQ++    F D   +H+V++ GWG EN +PYW
Sbjct: 93  AMKDIFDNGPITTQFYMYRDLVDYKSGVYQYDEQSDF-DFFTVHSVKIFGWGEENGVPYW 151

Query: 132 LVANSWNDHWGDHGTFKILRGEN 154
           LVANS+   WG +GTFKI RG +
Sbjct: 152 LVANSFGTDWGYNGTFKISRGND 174



 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 24/55 (43%), Positives = 39/55 (70%), Gaps = 1/55 (1%)

Query: 282 EGCQPYTLAPCE-HHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
           +GCQPYT+ PC+  + + P  +CT   + +TP C++ CYNP+Y +++R D+ KGK
Sbjct: 30  QGCQPYTIPPCKLMNEKPPGHSCTTYHREETPICEKKCYNPNYYTSFRTDIYKGK 84


>gi|290977636|ref|XP_002671543.1| predicted protein [Naegleria gruberi]
 gi|284085113|gb|EFC38799.1| predicted protein [Naegleria gruberi]
          Length = 268

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 45/109 (41%), Positives = 63/109 (57%), Gaps = 15/109 (13%)

Query: 185 KGLPRN------FDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTG 238
           K +P+N      FDAREKW +C  +  I +Q  CGSCWA S + A SDRLCIA+NG    
Sbjct: 81  KKMPKNLKAASHFDAREKWEDC--IHEIRNQEECGSCWAFSASEAFSDRLCIATNGSVNI 138

Query: 239 QISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
            +S Q++V+C    +GC+GG+   AW F  + G+ +       + C PY
Sbjct: 139 VLSPQYMVSCDATDYGCDGGYLNNAWNFLANTGIPS-------DECVPY 180



 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 22/47 (46%), Positives = 30/47 (63%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
           +  + I E+G + + FSVY DF  YKSGVY H  G   G HA++V+G
Sbjct: 221 DIQKDIQENGSIQSGFSVYKDFFSYKSGVYHHVTGSLAGGHAIKVIG 267


>gi|427783627|gb|JAA57265.1| hypothetical protein [Rhipicephalus pulchellus]
          Length = 483

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 47/111 (42%), Positives = 68/111 (61%), Gaps = 14/111 (12%)

Query: 63  HYFKKAHMVP--RCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN-FGDSI------- 112
           H+    + VP    + M++IY +GP+ A+  V  DF  Y+SGVY+H    +S+       
Sbjct: 327 HFSTPPYRVPANEEDIMQEIYANGPVQALILVKEDFFLYRSGVYRHTRIAESLRPQYSRS 386

Query: 113 GLHAVRVLGWGVEND----IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           G H+VR+LGWGV+      I YWL ANSW   WG++G F+I+RGE+E+ IE
Sbjct: 387 GWHSVRILGWGVDRSQYRPIKYWLCANSWGHGWGENGYFRIVRGEDESQIE 437



 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/104 (37%), Positives = 50/104 (48%), Gaps = 13/104 (12%)

Query: 187 LPRNFDAREKWPECPSLRH-IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP  FDAR +W     L H + DQ +C + WA S A   SDRL I S G    ++S Q +
Sbjct: 200 LPEEFDARIRWS---GLVHGVRDQGDCANSWAFSTAAVASDRLSIQSRGVDKVELSPQDL 256

Query: 246 VACTPNC--WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           ++C        C GG P   WRF  + G V+       E C PY
Sbjct: 257 MSCLNGGRRVVCQGGHPDRGWRFLLNYGGVS-------EECYPY 293


>gi|326490902|dbj|BAJ90118.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326508404|dbj|BAJ99469.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514912|dbj|BAJ99817.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 345

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 42/90 (46%), Positives = 57/90 (63%), Gaps = 1/90 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLV 133
           + M ++Y++GP+   F +Y DF  YKSGVY+   G  +G HA +++GWG  +    YWL+
Sbjct: 240 DIMAEVYKNGPVEVSFIIYEDFAHYKSGVYKQITGRMVGGHAAKLIGWGTSDAGEDYWLL 299

Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMGFN 163
           AN WN  WGD G FKI+RG NE  IE   N
Sbjct: 300 ANQWNRGWGDDGYFKIIRGTNECGIEGDVN 329



 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 41/103 (39%), Positives = 56/103 (54%), Gaps = 6/103 (5%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR KW  C ++  I DQ +CG+CWA      + DR CI  +      +S   +V
Sbjct: 97  LPKEFDARSKWSGCSTIGKILDQGHCGACWAFGAVECLQDRFCIHHS--VNVSLSVNDLV 154

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTG--GDYNSQEGCQ 285
           AC       GC+GG+P  AW+++  NGVVT     +  Q GCQ
Sbjct: 155 ACCGFLCGDGCDGGYPIFAWQYFVENGVVTDECDPFFDQVGCQ 197


>gi|322788703|gb|EFZ14296.1| hypothetical protein SINV_07506 [Solenopsis invicta]
          Length = 443

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 43/93 (46%), Positives = 59/93 (63%), Gaps = 8/93 (8%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI---GLHAVRVLGWGVEND---- 127
           + M++I   GP+ A   VY DF  YKSG+Y+H+    +   G H+VR++GWG E      
Sbjct: 340 DIMQEILTSGPVQATMRVYQDFFIYKSGIYRHSRSAELHDSGYHSVRIIGWGEERSYRGP 399

Query: 128 -IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
            + YWLVANSW  +WGD+G FKI +G NE +IE
Sbjct: 400 PLKYWLVANSWGYNWGDNGLFKIQKGTNECEIE 432



 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 49/92 (53%), Gaps = 3/92 (3%)

Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
           +   LPR FD+R +W     +  I DQ  CG+ WAVS A+  SDR  I S G    ++SA
Sbjct: 199 DPDALPREFDSRTRWSR--DISGIHDQGWCGASWAVSTADVASDRYSIMSKGAEAPELSA 256

Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVV 273
           Q +++C      GC GG+   AW F    G+V
Sbjct: 257 QQLLSCNNRGQQGCRGGYLDRAWLFMRKFGLV 288


>gi|289724789|gb|ADD18342.1| putative cysteine proteinase TIN-ag [Glossina morsitans morsitans]
          Length = 387

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 40/89 (44%), Positives = 56/89 (62%), Gaps = 4/89 (4%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNF---GDSIGLHAVRVLGWGVEND-IPY 130
           + M +I+  GP+ A  +VY DF  Y  G+Y+H     G  +G H+V+++GWG E+D   Y
Sbjct: 279 DIMAEIFMSGPVQATLTVYRDFFSYSGGIYRHTAASRGSPVGFHSVKLIGWGEEHDGNKY 338

Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIE 159
           W+  NSW   WG+HG F+ILRG NE  IE
Sbjct: 339 WIATNSWGTWWGEHGNFRILRGSNECGIE 367



 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 41/102 (40%), Positives = 54/102 (52%), Gaps = 9/102 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LPR+F++ +KW     +  + DQ  CGS W +S A+  SDR  I S G    Q+S Q+I+
Sbjct: 142 LPRSFNSIDKWAS--YISDVLDQGWCGSSWVISTASVASDRFAIQSRGKEVIQLSPQNIL 199

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           +CT    GCNGG    AWR+    GVV        E C PY 
Sbjct: 200 SCTRRQQGCNGGHLDAAWRYLHKQGVV-------DESCYPYV 234


>gi|294937366|ref|XP_002782055.1| cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239893340|gb|EER13850.1| cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 159

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 37/90 (41%), Positives = 57/90 (63%), Gaps = 1/90 (1%)

Query: 65  FKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV 124
           F +   +P+ N  ++I+ +GP++ + S+Y D   YK+GVY H  G   G+H ++++GWGV
Sbjct: 59  FGRLPAIPQ-NIKQEIFTNGPVIGMLSIYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGV 117

Query: 125 ENDIPYWLVANSWNDHWGDHGTFKILRGEN 154
           E+   YWL  NSWN+ WGDHG  K+  G  
Sbjct: 118 ESGQDYWLAVNSWNEEWGDHGMIKLAVGRT 147


>gi|13469701|gb|AAK27318.1| cysteine proteinase [Clonorchis sinensis]
          Length = 179

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 43/103 (41%), Positives = 61/103 (59%), Gaps = 2/103 (1%)

Query: 220 VANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDY 278
              A+SDRLCI S+G F   +SA  +++C  +C +GC+GG+P +AW FW  +G+VTGG  
Sbjct: 2   AVEAMSDRLCIHSSGAFNKSLSAVDLLSCCKDCGYGCDGGFPPMAWDFWKTHGIVTGGSK 61

Query: 279 NSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNP 321
               GC+PY    C+HH QG    C       TP+C ++C  P
Sbjct: 62  EEPAGCRPYPFPKCQHHSQGHYPPCPRR-IYPTPKCVKHCDTP 103



 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 27/52 (51%), Positives = 39/52 (75%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDI 128
           M++I  +GP+ A F V+ DF +YKSG+Y H +G S+G HA+R+LGWG EN +
Sbjct: 128 MKEILLNGPVEATFEVHEDFPEYKSGIYFHAWGGSVGGHAIRILGWGEENGV 179


>gi|159109223|ref|XP_001704877.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157432952|gb|EDO77203.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 300

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 38/84 (45%), Positives = 57/84 (67%), Gaps = 1/84 (1%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVAN 135
           M+ +   GPL   F VY+DF+ Y+SGVYQH +G   G HAV ++G+G ++D + YW++ N
Sbjct: 207 MKALSTSGPLQVAFLVYSDFMYYESGVYQHTYGYMEGGHAVEMVGYGTDDDGVDYWIIRN 266

Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
           SW   WG+ G F+++RG N+  IE
Sbjct: 267 SWGPDWGEDGYFRMIRGINDCSIE 290



 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 49/101 (48%), Gaps = 9/101 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD RE++P C  +  + DQ  CGSCWA S      DR C+A       + S Q++V
Sbjct: 75  VPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVAGLDKKPVKYSPQYVV 132

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           +C      CNGGW    W+F    G  T       + C PY
Sbjct: 133 SCDHGDMACNGGWLPNVWKFLTKTGTTT-------DECVPY 166


>gi|449283627|gb|EMC90232.1| Tubulointerstitial nephritis antigen [Columba livia]
          Length = 469

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 47/121 (38%), Positives = 69/121 (57%), Gaps = 18/121 (14%)

Query: 57  TSIPLSHYFKKAHMVPRC-----------NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQ 105
           T+ P  +  +K++ + RC           N M++I + GP+ AI  VY DF  YK G+Y+
Sbjct: 339 TNGPCPNALEKSNRLYRCASHYRVSSKETNIMKEIMDKGPVQAIMKVYEDFFLYKEGIYR 398

Query: 106 HN--FGDSIGLHAVRVLGWGVEND-----IPYWLVANSWNDHWGDHGTFKILRGENEADI 158
           H+   G     H+V++LGWG   D       +W+ ANSW   WG++G F+ILRG+NE DI
Sbjct: 399 HSQKAGSKWKTHSVKLLGWGALADKNGQKQKFWIAANSWGKSWGENGYFRILRGQNECDI 458

Query: 159 E 159
           E
Sbjct: 459 E 459



 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 39/95 (41%), Positives = 52/95 (54%), Gaps = 3/95 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
            P  F A   WPE   +    DQ NCG+ WA S A+  +DR+ I S G  T  +S Q+++
Sbjct: 222 FPVFFAATYAWPE--WIHDPLDQRNCGASWAFSTASVAADRIAIHSEGQITDNLSVQNLI 279

Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNS 280
           +C T N  GCNGG    AWR+   +GVV+   Y S
Sbjct: 280 SCDTRNQHGCNGGNIDSAWRYLKTHGVVSYACYPS 314


>gi|308157829|gb|EFO60849.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 300

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 38/84 (45%), Positives = 57/84 (67%), Gaps = 1/84 (1%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVAN 135
           M+ +   GPL   F VY+DF+ Y+SGVYQH +G   G HAV ++G+G ++D + YW++ N
Sbjct: 207 MKALSTTGPLQVAFLVYSDFMYYESGVYQHTYGYMEGGHAVEMVGYGTDDDGVDYWIIRN 266

Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
           SW   WG+ G F+++RG N+  IE
Sbjct: 267 SWGPDWGEDGYFRMIRGINDCSIE 290



 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 50/101 (49%), Gaps = 9/101 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD RE++P C  +  + DQ  CGSCWA S      DR CIA       + S Q++V
Sbjct: 75  VPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCIAGLDKKPVKYSPQYVV 132

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           +C      CNGGW   AW+F    G  T       + C PY
Sbjct: 133 SCDHGNMACNGGWLPNAWKFLTKTGTTT-------DECVPY 166


>gi|239788200|dbj|BAH70790.1| ACYPI000013 [Acyrthosiphon pisum]
          Length = 165

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 36/75 (48%), Positives = 53/75 (70%), Gaps = 1/75 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +PR FDAR +W  C ++  + DQ +CGSCWA++ ++A +DRLC+A+NG F   +SA+ I 
Sbjct: 88  IPRTFDARRRWRHCKTIGEVRDQGHCGSCWAMATSSAFADRLCVATNGDFNELLSAEEIT 147

Query: 247 ACTPNC-WGCNGGWP 260
            C   C +GCNGG+P
Sbjct: 148 FCCHTCGFGCNGGYP 162


>gi|294956046|ref|XP_002788796.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239904363|gb|EER20592.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 130

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 37/88 (42%), Positives = 57/88 (64%), Gaps = 1/88 (1%)

Query: 65  FKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV 124
           F +   +P+ N  ++I+ +GP++ + S+Y D   YK+GVY H  G   G+H ++++GWGV
Sbjct: 30  FGRLPAIPQ-NIKQEIFTNGPVIGMLSIYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGV 88

Query: 125 ENDIPYWLVANSWNDHWGDHGTFKILRG 152
           E+   YWL  NSWN+ WGDHG  K+  G
Sbjct: 89  ESGQDYWLAVNSWNEEWGDHGMIKLAVG 116


>gi|307201161|gb|EFN81067.1| Uncharacterized peptidase C1-like protein F26E4.3 [Harpegnathos
           saltator]
          Length = 443

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 59/93 (63%), Gaps = 8/93 (8%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI---GLHAVRVLGWGVEND---- 127
           + M++I   GP+ A   VY DF  YK+GVY+H+    +   G H++R++GWG E      
Sbjct: 340 DIMQEILTSGPVQATMRVYQDFFVYKNGVYRHSRSAELHDSGYHSMRIIGWGEEPSYRGP 399

Query: 128 -IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
            + YWLVANSW  HWG++G F+I RG NE +IE
Sbjct: 400 PLKYWLVANSWGRHWGENGLFRIQRGTNECEIE 432



 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 39/92 (42%), Positives = 51/92 (55%), Gaps = 3/92 (3%)

Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
           +   LPR FDAR +WP    +  I DQ  CG+ WAVS A+  SDR  I S G    ++SA
Sbjct: 199 DPDALPREFDARTRWPR--DISGIHDQGWCGASWAVSTADVASDRFAIMSKGAEDVELSA 256

Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVV 273
           QH+++C      GC GG+   AW F    G+V
Sbjct: 257 QHLLSCNNRGQQGCRGGYLDRAWLFMRKFGLV 288


>gi|159108625|ref|XP_001704582.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157432649|gb|EDO76908.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 298

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 39/84 (46%), Positives = 55/84 (65%), Gaps = 1/84 (1%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLVAN 135
           M+ +   GPL   F+VY+DF+ Y+ GVYQH +G   G HAV ++G+G  E D+ YW++ N
Sbjct: 205 MKALATGGPLQTAFTVYSDFMYYEGGVYQHTYGRVEGGHAVEMVGYGTDEYDVDYWIIRN 264

Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
           SW   WG+ G F+I+R  NE  IE
Sbjct: 265 SWGPDWGEDGYFRIIRMTNECGIE 288



 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 38/108 (35%), Positives = 52/108 (48%), Gaps = 9/108 (8%)

Query: 180 GCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQ 239
           G  +A   P +FD RE++P C  +  + DQ  CGSCWA S   ++ DR C A       +
Sbjct: 67  GTVSATQAPDSFDFREEYPHC--IPEVVDQGGCGSCWAFSSVASVGDRRCFAGLDKKAVK 124

Query: 240 ISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
            S Q++V+C      C+GGW    WRF    G  T       + C PY
Sbjct: 125 YSPQYVVSCDRGDMACDGGWLPSVWRFLTKTGTTT-------DECVPY 165


>gi|167508668|gb|ABZ81540.1| cathepsin B-like cysteine protease [Caenorhabditis brenneri]
          Length = 193

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 43/117 (36%), Positives = 65/117 (55%), Gaps = 3/117 (2%)

Query: 46  KKKKKKRLYLPTSIPLSHYFKKAHMVP---RCNAMRQIYEHGPLVAIFSVYADFLQYKSG 102
           +++    +  P S     +F KAH        +   +I  +GP++A F +Y DF  YKSG
Sbjct: 77  EERCTSNITWPISYKQVKHFGKAHYNVGKKMTDIQTEIMRNGPVIASFIIYDDFWDYKSG 136

Query: 103 VYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +Y H  GD  G    +++GWGV+N +PYWL  + W   +G++G  +ILRG NE  IE
Sbjct: 137 IYVHTAGDQEGGMDTKIIGWGVDNGVPYWLCVHQWGTDFGENGFMRILRGVNEVHIE 193



 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 50/87 (57%), Gaps = 3/87 (3%)

Query: 253 WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTP 312
           WGC+G WP+   ++W  +G+ TGG+Y+ Q GC+PYT+ PC+        +    G   TP
Sbjct: 16  WGCDGSWPKDILKWWQTHGLCTGGNYDDQFGCKPYTIYPCDKTYPNGTTSVPCPG-YHTP 74

Query: 313 ECKQNCY-NPSYESTYRFDLKKGKKAH 338
            C++ C  N ++  +Y+  +K   KAH
Sbjct: 75  VCEERCTSNITWPISYK-QVKHFGKAH 100


>gi|308160258|gb|EFO62754.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 298

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 39/84 (46%), Positives = 56/84 (66%), Gaps = 1/84 (1%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLVAN 135
           M+ +   GPL   F+VY+DF+ Y+ GVYQH +G + G HAV ++G+G  E D+ YW++ N
Sbjct: 205 MKALATGGPLQTAFTVYSDFMYYQGGVYQHVYGRAEGGHAVEMVGYGTDEYDVDYWIIRN 264

Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
           SW   WG+ G F+I+R  NE  IE
Sbjct: 265 SWGPDWGEDGYFRIIRMTNECGIE 288



 Score = 74.7 bits (182), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 38/108 (35%), Positives = 54/108 (50%), Gaps = 9/108 (8%)

Query: 180 GCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQ 239
           G  +A  +P +FD RE++P C  +  + DQ  CGSCWA S   ++ DR C+A       +
Sbjct: 67  GTVSATQVPDSFDFREEYPHC--IPEVVDQGGCGSCWAFSSVASVGDRRCVAGLDKKAVR 124

Query: 240 ISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
            S Q++V+C      C+GGW    WRF    G  T       + C PY
Sbjct: 125 YSPQYVVSCDRGDMACDGGWLPSVWRFLVKTGTTT-------DECVPY 165


>gi|294891881|ref|XP_002773785.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239878989|gb|EER05601.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 455

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 33/72 (45%), Positives = 51/72 (70%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I+++GP+ A+ ++Y DF  YKSGVY H  G  +  H ++++GWGVE+   YWL  N+W
Sbjct: 320 QEIFDNGPVAAMMTLYEDFRFYKSGVYVHKTGQMLAAHTLKLIGWGVESGQEYWLAVNAW 379

Query: 138 NDHWGDHGTFKI 149
           N+ WGDHG  K+
Sbjct: 380 NEEWGDHGMIKL 391



 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 62/220 (28%), Positives = 96/220 (43%), Gaps = 21/220 (9%)

Query: 135 NSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDLET--MGCQNA--KGLPRN 190
           NS    W         +G +  D+  G +N    +S+ D+  E   +G  N     LP +
Sbjct: 89  NSMQQSWTASKDQPPFKGMSIKDLPAGCSNDTMFSSTLDEGGENRLLGPTNPVLTTLPSS 148

Query: 191 FDAREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVAC- 248
           FDAR+K+  C   + H+ +Q  C +CWA +     +DR+CI S G  T  +S  ++ +C 
Sbjct: 149 FDARQKFASCADVIGHVREQGECNNCWASAAVGMFNDRVCIKSGGRITDILSLGYLTSCC 208

Query: 249 -----TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQE------GCQPYTLAPCEH--H 295
                 P   GC  G       F  ++G+VTGG+Y   E      GC PY    C H   
Sbjct: 209 NRANGCPKSNGCMFGSVPEGLNFMKNHGLVTGGEYKPPEELGNDDGCWPYPFPKCNHVPG 268

Query: 296 VQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGK 335
           ++     C  +  L  P C   C N +Y ++ + D  + K
Sbjct: 269 LESKYPRCAQVRDL--PACATTCPNKAYGTSMQKDTHRAK 306


>gi|328872536|gb|EGG20903.1| hypothetical protein DFA_00770 [Dictyostelium fasciculatum]
          Length = 313

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 44/90 (48%), Positives = 53/90 (58%), Gaps = 5/90 (5%)

Query: 76  AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIG---LHAVRVLGWGVENDIPYWL 132
           A   IY +GP++A+F +Y D   YKSGVY  +  DS      HA RV+GWGVE+ + YWL
Sbjct: 164 AKADIYLNGPIIAVFDLYTDIYNYKSGVYIKS--DSATYKETHAGRVIGWGVEDGVQYWL 221

Query: 133 VANSWNDHWGDHGTFKILRGENEADIEMGF 162
            ANSW   WG  G FKI  G NE   E  F
Sbjct: 222 AANSWGTGWGQQGLFKIRSGTNEVGFEANF 251



 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 40/107 (37%), Positives = 57/107 (53%), Gaps = 11/107 (10%)

Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSN-CGSCWAVSVANAISDRLCIASNGYFTGQIS 241
           +A  LP +FD+R+KW +C S   + DQ   C SCWA++    ++DRLC+AS G     +S
Sbjct: 29  DASNLPASFDSRQKWSDCFS--PVRDQGQKCSSCWAMTATGVLADRLCVASGGKVKKVLS 86

Query: 242 AQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
            Q ++ C  N   GC GG       ++  NGVVT       E C+ Y
Sbjct: 87  PQELIDCDRNGNLGCGGGRLDTPLAYFRDNGVVT-------EKCESY 126


>gi|330846430|ref|XP_003295033.1| hypothetical protein DICPUDRAFT_51857 [Dictyostelium purpureum]
 gi|325074364|gb|EGC28440.1| hypothetical protein DICPUDRAFT_51857 [Dictyostelium purpureum]
          Length = 257

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 48/103 (46%), Positives = 62/103 (60%), Gaps = 10/103 (9%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P++FDAR +WP C  +  I +Q  CGSCWA S +  +SDRLCIASNG     +S Q +V
Sbjct: 31  IPQSFDARTQWPNC--IHPILNQEQCGSCWAFSASEVLSDRLCIASNGKTGVVLSPQALV 88

Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           +C      GCNGG PQLAW +   +G+ T        GC PYT
Sbjct: 89  SCDIFGNQGCNGGIPQLAWEYMELHGIPT-------YGCFPYT 124



 Score = 67.4 bits (163), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 32/75 (42%), Positives = 46/75 (61%), Gaps = 3/75 (4%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVE--NDIPYWLVA 134
           + I + GP+     VY+DF+ Y SGVY    G S +G HA++++GWG +  ++  YW+VA
Sbjct: 165 QDIMKFGPIQGTMEVYSDFMSYTSGVYTMTPGSSLLGGHAIKIVGWGFDQASNQNYWIVA 224

Query: 135 NSWNDHWGDHGTFKI 149
           NSW   WG  G F I
Sbjct: 225 NSWGPSWGIDGFFWI 239


>gi|294895531|ref|XP_002775206.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239881224|gb|EER07022.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 130

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 37/88 (42%), Positives = 57/88 (64%), Gaps = 1/88 (1%)

Query: 65  FKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV 124
           F +   +P+ N  ++I+ +GP++ + S+Y D   YK+GVY H  G   G+H ++++GWGV
Sbjct: 30  FGRLPAIPQ-NIKQEIFTNGPVIGMLSLYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGV 88

Query: 125 ENDIPYWLVANSWNDHWGDHGTFKILRG 152
           E+   YWL  NSWN+ WGDHG  K+  G
Sbjct: 89  ESGQDYWLAVNSWNEEWGDHGMIKLAVG 116


>gi|21699|emb|CAA46811.1| cathepsin B [Triticum aestivum]
          Length = 353

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 45/96 (46%), Positives = 61/96 (63%), Gaps = 4/96 (4%)

Query: 67  KAHMVPRCNAMRQIYEHGPLVAIFSV--YADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV 124
           + H  P  + M ++Y++GP+   F+     DF  YKSGVY+H  G  +G HAV+++GWG 
Sbjct: 233 RVHSNPH-DIMAEVYKNGPVEVAFTYCQILDFAHYKSGVYKHITGGVMGGHAVKLIGWGT 291

Query: 125 EN-DIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
            +    YWL+AN WN  WGD G FKI+RGENE  IE
Sbjct: 292 SDAGEDYWLLANQWNRGWGDDGYFKIIRGENECGIE 327



 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 49/135 (36%), Positives = 70/135 (51%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR +W  C ++ +I DQ +CG+CWA +   A+ DR CI  N   +  +S   ++
Sbjct: 97  LPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVEALQDRFCIHLN--MSVSLSVNDLL 154

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GCNGG+P  AWR++  +GVVT       E C PY     C+H    P    
Sbjct: 155 ACCGFLCGSGCNGGYPISAWRYFRRSGVVT-------EECDPYFDQTGCQHPGCEP---- 203

Query: 304 TLLGKLKTPECKQNC 318
                  TP+C++ C
Sbjct: 204 ----AYPTPKCQRKC 214


>gi|294888035|ref|XP_002772321.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239876433|gb|EER04137.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 200

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 46/96 (47%), Positives = 57/96 (59%), Gaps = 7/96 (7%)

Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFW 267
           DQS CGSCWA  V  A +DRLCI S+G FT  +SA  + ACT   +GC GG P  AW + 
Sbjct: 1   DQSACGSCWAFGVTEAFNDRLCIKSDGAFTELLSAGEMNACTLF-FGCGGGDPYSAWSWV 59

Query: 268 GHNGVVTGGDYNSQ------EGCQPYTLAPCEHHVQ 297
              G+ TGGDY ++      +GC PY   PC HH+ 
Sbjct: 60  HDKGIATGGDYVAKDDMTKDDGCWPYDFPPCAHHIN 95



 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 38/74 (51%), Positives = 51/74 (68%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           +A   I   GP+ A F+VY DFL Y+SGVY+H  G  +G HAV+++GWG ++   YWL  
Sbjct: 127 DAKNAIRTDGPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAVKIIGWGEKSGQAYWLAV 186

Query: 135 NSWNDHWGDHGTFK 148
           NSWN+ WGDHG F+
Sbjct: 187 NSWNEDWGDHGLFR 200


>gi|170045773|ref|XP_001850470.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
 gi|167868692|gb|EDS32075.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
          Length = 463

 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 46/117 (39%), Positives = 68/117 (58%), Gaps = 12/117 (10%)

Query: 55  LPTSIPLSHYFKKAHMVPRCN---AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS 111
           LPT +  ++ +K        N    M +I +HGP+ AI  V+ DF  YKSG+Y+H+   S
Sbjct: 299 LPTKVDRTNMYKMGPAFSLNNETDIMIEIKKHGPVQAILRVHRDFFSYKSGIYRHSAASS 358

Query: 112 IG-----LHAVRVLGWGVEND----IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
            G      H+VR++GWG E +      YW+  NSW   WG++G F+I+RG+NE +IE
Sbjct: 359 AGDERAGYHSVRLIGWGEERNGYETTKYWVAVNSWGRWWGENGRFRIVRGQNECEIE 415



 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 38/104 (36%), Positives = 50/104 (48%), Gaps = 9/104 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDA   WP    +  + DQ  CGS WA+S A+  SDR  I S G    Q++ Q I+
Sbjct: 186 LPTHFDATTYWPG--FIGEVKDQGWCGSSWALSTASVASDRFAILSKGREIVQLAPQQII 243

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
           +C     GC+GG    AW +    G V        + C PY  A
Sbjct: 244 SCVRRSQGCSGGHLDTAWNYVRKVGTV-------NDECYPYISA 280


>gi|66506619|ref|XP_393283.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
           [Apis mellifera]
          Length = 439

 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 43/94 (45%), Positives = 56/94 (59%), Gaps = 9/94 (9%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI---GLHAVRVLGWG--VEND-- 127
           + MR+I   GP+ A   VY DF  Y+SG+Y H     +   G H+VR++GWG  +  D  
Sbjct: 334 DIMREILTSGPVQATMKVYQDFFSYESGIYMHTPIAELYESGYHSVRIIGWGEDISTDSG 393

Query: 128 --IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
             I YWLV NSW   WG++G F+I RG NE DIE
Sbjct: 394 LPIKYWLVVNSWGQEWGENGLFRIRRGINECDIE 427



 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 35/92 (38%), Positives = 51/92 (55%), Gaps = 3/92 (3%)

Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
           + + LPR FDAR +W     +  + DQ  CG+ WA+S A   SDR  + S G  +  +SA
Sbjct: 193 DPESLPREFDARTRWRR--QISGVDDQGWCGASWAISTAQVASDRFAVMSKGTDSVLLSA 250

Query: 243 QHIVACTPNCW-GCNGGWPQLAWRFWGHNGVV 273
           QH+++C      GC+GG+   AW F    G+V
Sbjct: 251 QHLLSCNKKGQRGCDGGYLDRAWLFMRKFGLV 282


>gi|114153242|gb|ABI52787.1| cathepsin B-like protein [Argas monolakensis]
          Length = 91

 Score = 88.2 bits (217), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 41/78 (52%), Positives = 51/78 (65%), Gaps = 1/78 (1%)

Query: 83  HGPLVAIFSVYADFLQYKSGVYQHNFGDSI-GLHAVRVLGWGVENDIPYWLVANSWNDHW 141
           H   V +  V+ DF   +  V   +  D + G HA+R++GWGVE D+PYWLVANSWN  W
Sbjct: 4   HSAGVRLSPVFTDFGHLQGQVCTSDTVDVLMGGHAIRIIGWGVEEDVPYWLVANSWNREW 63

Query: 142 GDHGTFKILRGENEADIE 159
           GD+G FKILRG NE  IE
Sbjct: 64  GDNGYFKILRGSNECGIE 81


>gi|294916952|ref|XP_002778399.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239886773|gb|EER10194.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 228

 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 34/77 (44%), Positives = 53/77 (68%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I+++GP+ A+ ++Y DF  YKSGVY H  G  +  H ++++GWGVE+   YWL  N+W
Sbjct: 139 QEIFDNGPVAAMMTLYEDFRYYKSGVYVHKTGQLLAAHTLKLIGWGVESGQEYWLAMNAW 198

Query: 138 NDHWGDHGTFKILRGEN 154
           N+ WGDHG  K+  G+ 
Sbjct: 199 NEEWGDHGMIKLAVGKT 215



 Score = 47.4 bits (111), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 36/126 (28%), Positives = 54/126 (42%), Gaps = 16/126 (12%)

Query: 224 ISDRLCIASNGYFTGQISAQHIVAC------TPNCWGCNGGWPQLAWRFWGHNGVVTGGD 277
            +DR+CI S G  T  +S  ++ +C       P   GC  G       F  ++G+VTGG+
Sbjct: 2   FNDRVCIKSGGKTTDILSLGYLTSCCNRANGCPKSNGCMFGSVPEGLNFMKNHGLVTGGE 61

Query: 278 YNSQE------GCQPYTLAPCEH--HVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRF 329
           Y   E      GC PY    C H   ++     C  +  L  P C   C N +Y ++ + 
Sbjct: 62  YKPPEKLGNDDGCWPYPFPKCNHVPGLESKYPRCAQVRDL--PACATTCPNKAYGTSMQK 119

Query: 330 DLKKGK 335
           D  + K
Sbjct: 120 DTHRAK 125


>gi|403377404|gb|EJY88697.1| hypothetical protein OXYTRI_00086 [Oxytricha trifallax]
          Length = 351

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 40/101 (39%), Positives = 61/101 (60%), Gaps = 9/101 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FD R KWP+C  LR I DQ+NCG+CWA + +  ++DR+CI +NG    ++S Q +V
Sbjct: 120 IPLEFDFRTKWPQC--LRKIRDQANCGACWAFTGSGMLADRICILTNGTINEELSPQDMV 177

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
            C+ + +GC GG+   A  +  + GV       ++E C PY
Sbjct: 178 DCSHDNFGCEGGYLMNALDYLMNEGV-------TKESCTPY 211



 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 37/101 (36%), Positives = 56/101 (55%), Gaps = 8/101 (7%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGW-GVENDIPYWLVANS 136
           R + ++GPL+   +VY DF+ Y +G Y+   G+ +G HAV+++GW   +     WL+ N 
Sbjct: 250 RDLMQNGPLMVGLTVYEDFINYATGDYKFVAGEIVGGHAVKLMGWRTTQKGQTSWLIQNQ 309

Query: 137 WNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDLE 177
           WND WG+ G   IL  ENE  I+      +    + D DLE
Sbjct: 310 WNDDWGEQGFGYIL--ENEVGID-----SIGVGCTPDIDLE 343


>gi|290992564|ref|XP_002678904.1| predicted protein [Naegleria gruberi]
 gi|284092518|gb|EFC46160.1| predicted protein [Naegleria gruberi]
          Length = 289

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 40/75 (53%), Positives = 51/75 (68%), Gaps = 3/75 (4%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN---DIPYWLVA 134
           + I  +GP+ A FSVY DF  YKSGVY+H  G   G HA++++GWGV +   D PYW+VA
Sbjct: 215 KDIQANGPVQAAFSVYQDFFSYKSGVYRHVSGSLAGGHAIKIVGWGVTSDGKDTPYWIVA 274

Query: 135 NSWNDHWGDHGTFKI 149
           NSWN +WG  G F I
Sbjct: 275 NSWNTNWGQEGFFWI 289



 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 40/99 (40%), Positives = 55/99 (55%), Gaps = 9/99 (9%)

Query: 190 NFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACT 249
           +FDAR KW +C  +  I DQ  CGSCWA S +  +SDR CIASNG     +S ++++ C 
Sbjct: 86  SFDARTKWGKC--VHPIRDQQQCGSCWAFSASEVLSDRFCIASNGSVDVVLSPEYMLQCD 143

Query: 250 PNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
              +GC+GG+   AW F    G+         + C PYT
Sbjct: 144 STDYGCDGGYLNNAWAFLAGTGI-------PSDKCDPYT 175


>gi|403345965|gb|EJY72367.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 38/78 (48%), Positives = 52/78 (66%)

Query: 80  IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWND 139
           I + GP+   F+VYADF  YKSG+Y H  G + G HAV++LGWG +    YW+VANSW +
Sbjct: 212 IQQSGPVETGFTVYADFFNYKSGIYHHVSGGAEGGHAVKILGWGKQGSENYWIVANSWGE 271

Query: 140 HWGDHGTFKILRGENEAD 157
            WG+ G F I +G++  D
Sbjct: 272 SWGEKGFFNIRQGDSGID 289



 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 44/117 (37%), Positives = 66/117 (56%), Gaps = 10/117 (8%)

Query: 171 SEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCI 230
           S   D+      NA  +P +FD+R +W  C  +  I DQ+ CGSCWA + + ++SDR CI
Sbjct: 63  SHSSDIPAFTQINA-AVPDSFDSRTQWQGC--VHPIRDQAQCGSCWAFAASESLSDRFCI 119

Query: 231 ASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           AS G     +S Q +V+C  N +GC+GG+  LAW++    GV +       + C+PY
Sbjct: 120 ASQGKVNVVLSPQDMVSCDTNNYGCDGGYLNLAWQYLEKKGVAS-------DSCEPY 169


>gi|125810908|ref|XP_001361665.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
 gi|54636841|gb|EAL26244.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
          Length = 433

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 43/89 (48%), Positives = 54/89 (60%), Gaps = 4/89 (4%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH---NFGDSIGLHAVRVLGWGVE-NDIPY 130
           + M +IY  GP+ A   VY DF  Y SGVY+    N G   G H+V+++GWG E N   Y
Sbjct: 326 DIMAEIYHSGPVQATMRVYRDFFSYSSGVYRQTAANRGAPTGFHSVKLVGWGEEHNGDKY 385

Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIE 159
           W+ ANSW   WG+ G F+ILRG NE  IE
Sbjct: 386 WIAANSWGPWWGERGYFRILRGSNECGIE 414



 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 43/103 (41%), Positives = 53/103 (51%), Gaps = 9/103 (8%)

Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           GLP  F+A EKW     +  + DQ  CGS W +S  +  SDR  I S G    Q+SAQ+I
Sbjct: 188 GLPAAFNAVEKWSS--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSKGKEAVQLSAQNI 245

Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           ++CT    GC GG    AWR+    GVV        E C PYT
Sbjct: 246 LSCTRRQQGCEGGHLDAAWRYLHKKGVV-------DESCYPYT 281


>gi|195154396|ref|XP_002018108.1| GL16940 [Drosophila persimilis]
 gi|194113904|gb|EDW35947.1| GL16940 [Drosophila persimilis]
          Length = 433

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 43/89 (48%), Positives = 54/89 (60%), Gaps = 4/89 (4%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH---NFGDSIGLHAVRVLGWGVE-NDIPY 130
           + M +IY  GP+ A   VY DF  Y SGVY+    N G   G H+V+++GWG E N   Y
Sbjct: 326 DIMAEIYHSGPVQATMRVYRDFFSYSSGVYRQTAANRGAPTGFHSVKLVGWGEEHNGDKY 385

Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIE 159
           W+ ANSW   WG+ G F+ILRG NE  IE
Sbjct: 386 WIAANSWGPWWGERGYFRILRGSNECGIE 414



 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 43/103 (41%), Positives = 53/103 (51%), Gaps = 9/103 (8%)

Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           GLP  F+A EKW     +  + DQ  CGS W +S  +  SDR  I S G    Q+SAQ+I
Sbjct: 188 GLPAAFNAVEKWSS--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSKGKEAVQLSAQNI 245

Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           ++CT    GC GG    AWR+    GVV        E C PYT
Sbjct: 246 LSCTRRQQGCEGGHLDAAWRYLHKKGVV-------DESCYPYT 281


>gi|403362666|gb|EJY81064.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 38/78 (48%), Positives = 52/78 (66%)

Query: 80  IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWND 139
           I + GP+   F+VYADF  YKSG+Y H  G + G HAV++LGWG +    YW+VANSW +
Sbjct: 212 IQQSGPVETGFTVYADFFNYKSGIYHHVSGGAEGGHAVKILGWGKQGSENYWIVANSWGE 271

Query: 140 HWGDHGTFKILRGENEAD 157
            WG+ G F I +G++  D
Sbjct: 272 SWGEKGFFNIRQGDSGID 289



 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 44/117 (37%), Positives = 66/117 (56%), Gaps = 10/117 (8%)

Query: 171 SEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCI 230
           S   D+      NA  +P +FD+R +W  C  +  I DQ+ CGSCWA + + ++SDR CI
Sbjct: 63  SHSSDIPAFTQINA-AVPDSFDSRTQWQGC--VHPIRDQAQCGSCWAFAASESLSDRFCI 119

Query: 231 ASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           AS G     +S Q +V+C  N +GC+GG+  LAW++    GV +       + C+PY
Sbjct: 120 ASQGKVNVVLSPQDMVSCDTNNYGCDGGYLNLAWQYLEKKGVAS-------DSCEPY 169


>gi|290987261|ref|XP_002676341.1| predicted protein [Naegleria gruberi]
 gi|284089943|gb|EFC43597.1| predicted protein [Naegleria gruberi]
          Length = 218

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 38/85 (44%), Positives = 54/85 (63%), Gaps = 4/85 (4%)

Query: 80  IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV----ENDIPYWLVAN 135
           I   G L A   +Y DF+QY+ GVY+H  G+ +  H+VR++GWG+    +  IPYW+  N
Sbjct: 124 IMNGGSLAASLDIYRDFVQYRGGVYRHLVGNYMFTHSVRIVGWGITSPQQGSIPYWICGN 183

Query: 136 SWNDHWGDHGTFKILRGENEADIEM 160
           +W + WG  G F ILRG NE +IE+
Sbjct: 184 NWTEEWGMQGWFWILRGSNECNIEL 208



 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 33/99 (33%), Positives = 50/99 (50%), Gaps = 14/99 (14%)

Query: 200 CPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGW 259
           C  L  I D+  CG CWA  VA  +SDR C++S       +S Q++++C  N  GC+ G+
Sbjct: 1   CKQLSLIRDEQQCG-CWAFVVAEVVSDRFCVSSKTKVNEVLSPQYLISCDSNNGGCSYGY 59

Query: 260 PQLAWRFWGHNGVVTGGDYNSQEGCQPYT------LAPC 292
              A++F  + G+VT       E C P+       + PC
Sbjct: 60  FDTAFQFVENQGIVT-------ENCFPFVSGEGNYIPPC 91


>gi|12658201|gb|AAK01061.1| cysteine proteinase [Metagonimus yokogawai]
          Length = 179

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 47/122 (38%), Positives = 64/122 (52%), Gaps = 4/122 (3%)

Query: 220 VANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDY 278
              A++DRLCI SN      IS+  +++C  +C +GC+GG+P  AW FW  NG+VTGG  
Sbjct: 2   AVEAMTDRLCIHSNATIKKHISSTDLLSCCESCGFGCHGGFPPRAWDFWMENGLVTGGSK 61

Query: 279 NSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAH 338
            +  GC+ Y    C HH +GP   C       TP C + C  P  E  Y  D  K K ++
Sbjct: 62  ENPSGCRSYPFPKCNHHGKGPDAPCPEK-IFPTPACNKTCDTP--EVNYILDKTKAKSSY 118

Query: 339 MV 340
            V
Sbjct: 119 NV 120



 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 31/52 (59%), Positives = 40/52 (76%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDI 128
           M++I ++GP+ A F VY DFL Y+SGVY H+FG  IG HA+R+LGWG EN I
Sbjct: 128 MKEIMQNGPVEAAFEVYEDFLHYESGVYFHSFGRMIGGHAIRMLGWGEENGI 179


>gi|357623033|gb|EHJ74345.1| tubulointerstitial nephritis antigen [Danaus plexippus]
          Length = 426

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 40/88 (45%), Positives = 59/88 (67%), Gaps = 3/88 (3%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN-FGDSI--GLHAVRVLGWGVENDIPYW 131
           + M  I E GP+ A+ +V+ DF  Y  G+Y+ + +GD+   GLH+VR++GWG +    YW
Sbjct: 322 DIMYDIMESGPVHAVMTVHQDFFHYHDGIYRRSPYGDNTLQGLHSVRIVGWGEDRGDKYW 381

Query: 132 LVANSWNDHWGDHGTFKILRGENEADIE 159
           +VANSW   WG++G F+I RG NE+ IE
Sbjct: 382 VVANSWGCDWGENGYFRIARGSNESGIE 409



 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 54/104 (51%), Gaps = 10/104 (9%)

Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
           PR+FDAR +WP    +  + DQ  CGS WAV++A   SDR  I SNG     +S Q +++
Sbjct: 187 PRDFDARRRWPN--FISPVLDQGWCGSDWAVTIATVASDRFAIQSNGAERMVLSPQVLLS 244

Query: 248 C-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
           C      GC GG   +AW F   +G+V        E C PY  A
Sbjct: 245 CNIRRQQGCRGGHIDVAWNFARGHGLV-------DEECFPYKAA 281


>gi|268564843|ref|XP_002639246.1| Hypothetical protein CBG03805 [Caenorhabditis briggsae]
          Length = 526

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 45/93 (48%), Positives = 57/93 (61%), Gaps = 12/93 (12%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHN------FGDSI--GLHAVRVLGWGVEND--- 127
           ++  +GP+ A F V+ DF  Y  GVYQH+         S+  G H+VRVLGWGV++    
Sbjct: 404 ELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGR 463

Query: 128 -IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
            I YWL ANSW   WG+ G FKILRGEN  +IE
Sbjct: 464 PIKYWLCANSWGTQWGEDGYFKILRGENHCEIE 496



 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 52/149 (34%), Positives = 71/149 (47%), Gaps = 16/149 (10%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAR+KW     +  IADQ +CGS WAVS     SDRL I S G     +S+Q ++
Sbjct: 258 LPEHFDARDKWGHL--IHPIADQGDCGSSWAVSTTGISSDRLSIISEGRINASLSSQQLL 315

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH--HVQGPLQNC 303
           +C  +   GC GG+   AW +    GVV  GD+     C PY         H   P ++ 
Sbjct: 316 SCNQHRQKGCEGGYLDRAWWYIRKLGVV--GDH-----CYPYVSGQSREPGHCLIPKRDY 368

Query: 304 TLLGKLKTPECKQNC----YNPSYESTYR 328
           T    L+ P   Q+       P Y+ + R
Sbjct: 369 TNRQGLRCPSGSQDSTAFKMTPPYKVSSR 397


>gi|300176576|emb|CBK24241.2| unnamed protein product [Blastocystis hominis]
          Length = 563

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 43/105 (40%), Positives = 58/105 (55%), Gaps = 1/105 (0%)

Query: 56  PTSIPLSHYFKKAHMVPRCNAMR-QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGL 114
           P S    +Y      V   +AM  QIY +GP+    SV  D   Y++G++  N   S+  
Sbjct: 138 PISSYHKYYISSFDAVDGISAMMDQIYYNGPITCKISVTNDLQNYRNGIFSRNTSSSLYD 197

Query: 115 HAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           H V ++GWG EN+ PYW+V NSW   WG+ G F+ILRG N   IE
Sbjct: 198 HYVNIIGWGSENETPYWIVRNSWGSSWGEDGYFRILRGVNLLGIE 242



 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 45/82 (54%), Gaps = 1/82 (1%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWG-VENDIPYWLVANSW 137
           +IYE GP+     V   F +Y  GV+       +G H V V+GWG  E  + YW+  N+W
Sbjct: 471 EIYERGPITCFMVVTEQFQRYTGGVFVEEDHHYLGGHIVEVVGWGRTEEGVEYWIGRNNW 530

Query: 138 NDHWGDHGTFKILRGENEADIE 159
            ++WG+ G F+I+ G N   IE
Sbjct: 531 GENWGEKGWFRIMMGGNNLLIE 552



 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 47/108 (43%), Gaps = 12/108 (11%)

Query: 183 NAKGLPRNFDAR--EKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQI 240
           N   LP  +D R  +       +R       C +CWA +  +A+SDRL + S G +   +
Sbjct: 325 NLSSLPTQYDIRSLDGVDYSTPIRTQRAPQFCNACWAQAAVSALSDRLQLQSRGAWPMVV 384

Query: 241 -SAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
            S Q +V C      C+GG P   +RF   + +       S E CQ Y
Sbjct: 385 LSTQMVVNCATG--SCDGGDPGEVYRFAYMSSI-------SDESCQVY 423


>gi|294891885|ref|XP_002773787.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239878991|gb|EER05603.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 234

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 33/72 (45%), Positives = 51/72 (70%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I+++GP+ A+ ++Y DF  YKSGVY H  G  +  H ++++GWGVE+   YWL  N+W
Sbjct: 107 QEIFDNGPVAAMMTLYEDFRFYKSGVYVHKTGQMLAAHTLKLIGWGVESGQEYWLAVNAW 166

Query: 138 NDHWGDHGTFKI 149
           N+ WGDHG  K+
Sbjct: 167 NEEWGDHGMIKL 178


>gi|253747738|gb|EET02294.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 305

 Score = 87.4 bits (215), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 39/83 (46%), Positives = 52/83 (62%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M  +   GP+   F V+ DFL Y  G+Y   +G SIG HAV ++G+G  N+  YW+V NS
Sbjct: 214 MTSLLNEGPVQTGFYVHEDFLYYVGGIYHKTYGSSIGGHAVLIVGYGSMNNHDYWIVRNS 273

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           W   WG++G F+ILRG NE  IE
Sbjct: 274 WGSDWGENGYFRILRGTNECGIE 296



 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 34/101 (33%), Positives = 44/101 (43%), Gaps = 9/101 (8%)

Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
           P   D R+  PEC       DQS+C  C+A +   A+S R CIA        +SAQH+V+
Sbjct: 82  PDRLDYRQTHPEC--FFEPEDQSDCSCCYAFATLGALSTRRCIAKLDASVVPLSAQHMVS 139

Query: 248 CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           C     GC GG    +W F    G +          C PY 
Sbjct: 140 CDHGEAGCQGGGFNTSWAFLETEGAI-------MRDCLPYV 173


>gi|340382603|ref|XP_003389808.1| PREDICTED: hypothetical protein LOC100632176 [Amphimedon
           queenslandica]
          Length = 570

 Score = 87.4 bits (215), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 65/228 (28%), Positives = 96/228 (42%), Gaps = 34/228 (14%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138
           +IY  GP+           QY  G++       +  H V V+GWGVEN + YW+V NSW 
Sbjct: 194 EIYARGPIGCGIDATEKLEQYTGGIFSERKLLPMINHEVSVVGWGVENGVEYWIVRNSWG 253

Query: 139 DHWGDHGTFKILRGENEADIEM----GFNNRVEANSSEDDDL-----------------E 177
            +WG++G F+I+  ++   IE     G     E N                        +
Sbjct: 254 TYWGENGFFRIMMHKDNLAIETECDWGVPLLKEPNKQHQVHKQQQQQQQEYKCSCVKKSD 313

Query: 178 TMGCQNAKGLPRNFDAREKWPECPSLRHIA--DQSN----------CGSCWAVSVANAIS 225
           ++        P  +   E  P    +R+I   D S           CGSCWA+   +A+S
Sbjct: 314 SVKTHVHTPEPHTYIKLEDIPAAYDIRNINGNDYSTVNRNQHIPQYCGSCWAMGTTSALS 373

Query: 226 DRLCIASNG-YFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGV 272
           DR+ +   G Y    +S Q +V C  N  GC+GG P  A+ +   NGV
Sbjct: 374 DRIKLMRKGAYPVINLSPQVLVDCANNSHGCDGGDPTAAYSYIYENGV 421



 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 46/84 (54%), Gaps = 1/84 (1%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVAN 135
           + +I+  GP+    +V + F QY  GV+    G     H + + GWGV +  + YW+  N
Sbjct: 475 LSEIFARGPIACTIAVTSAFEQYTGGVFNDTTGAKSLDHEISIAGWGVTSGGVKYWIGRN 534

Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
           SW  +WG+ G F+++RG +   +E
Sbjct: 535 SWGTYWGEAGWFRLIRGVDNLGVE 558



 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 39/123 (31%), Positives = 59/123 (47%), Gaps = 19/123 (15%)

Query: 183 NAKGLPRNFDAREK----WPECPSLRHIADQSNCGSCWAVSVANAISDRLCIA-SNGYFT 237
           N   LP  F   +K    +   P  +HI     CGSCWA+   +A+SDR+ I  +N Y  
Sbjct: 47  NLADLPSQFSWEDKDGQNYLTPPRNQHIPQY--CGSCWAMGTTSALSDRISIMRNNTYPM 104

Query: 238 GQISAQHIVACTPNCWG---CNGGWPQLAWRFWGHNGVV--TGGDYNSQEG-CQPYTLAP 291
            Q++ Q I+    NC G   C GG P   + +   +G+   T  +Y ++ G C P  +  
Sbjct: 105 VQLATQVII----NCRGGGSCQGGNPGGVYEYIHRHGLPDETCQNYEARNGECTPIEI-- 158

Query: 292 CEH 294
           CE+
Sbjct: 159 CEN 161


>gi|308159555|gb|EFO62082.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 305

 Score = 87.4 bits (215), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 44/106 (41%), Positives = 61/106 (57%), Gaps = 3/106 (2%)

Query: 57  TSIPLSHYFKKAHMVPRCN---AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIG 113
           T +  + ++K A   P  N    M  +   GP+   F V+ DFL Y  G+Y   +G S+G
Sbjct: 191 TLVEDAFHYKAASASPLNNYNEIMVSLLADGPVQTGFYVHEDFLYYVGGIYHKVYGSSLG 250

Query: 114 LHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
            HAV ++G+G  ND  YW+V NSW   WG++G F+ILRG NE  IE
Sbjct: 251 GHAVLIVGYGSMNDHDYWIVRNSWGPDWGENGYFRILRGTNECGIE 296



 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 40/121 (33%), Positives = 51/121 (42%), Gaps = 15/121 (12%)

Query: 168 ANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDR 227
            N +EDD           G P   D R+  PEC       DQ  C  C+A +   A+S R
Sbjct: 68  VNITEDD------LYPPDGSPDRLDYRQTHPEC--FFEPEDQKECSCCYAFATIGALSTR 119

Query: 228 LCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
            CIA        +S QH+V+C     GC GG  + +W F    GVV       +  C PY
Sbjct: 120 RCIAKLDSQAVSLSVQHMVSCDNGEAGCLGGEFESSWAFLETEGVV-------KSDCLPY 172

Query: 288 T 288
           T
Sbjct: 173 T 173


>gi|32129434|sp|P92132.2|CATB2_GIALA RecName: Full=Cathepsin B-like CP2; AltName: Full=Cathepsin B-like
           protease B2; Flags: Precursor
 gi|11691658|emb|CAC18647.1| cathepsin B-like protease 2 [Giardia intestinalis]
          Length = 300

 Score = 87.4 bits (215), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 37/84 (44%), Positives = 57/84 (67%), Gaps = 1/84 (1%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVAN 135
           M+ +   GPL   F V++DF+ Y+SGVYQH +G   G HAV ++G+G ++D + YW++ N
Sbjct: 207 MKALSTSGPLQVAFLVHSDFMYYESGVYQHTYGYMEGGHAVEMVGYGTDDDGVDYWIIKN 266

Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
           SW   WG+ G F+++RG N+  IE
Sbjct: 267 SWGPDWGEDGYFRMIRGINDCSIE 290



 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 49/101 (48%), Gaps = 9/101 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD RE++P C  +  + DQ  CGSCWA S      DR C+A       + S Q++V
Sbjct: 75  VPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVAGLDKKPVKYSPQYVV 132

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           +C      CNGGW    W+F    G  T       + C PY
Sbjct: 133 SCDHGDMACNGGWLPNVWKFLTKTGTTT-------DECVPY 166


>gi|56757237|gb|AAW26790.1| unknown [Schistosoma japonicum]
          Length = 170

 Score = 87.4 bits (215), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 39/81 (48%), Positives = 55/81 (67%), Gaps = 1/81 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C S+  I DQS C S WAVS   A+SDR+CI S G  + ++SA  ++
Sbjct: 90  IPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVGAMSDRICIQSGGKQSVELSAIDLI 149

Query: 247 ACTPNCW-GCNGGWPQLAWRF 266
           +C  NC  GC+GG+P  AW +
Sbjct: 150 SCCENCGSGCDGGFPGPAWDY 170


>gi|242014495|ref|XP_002427925.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
           corporis]
 gi|212512409|gb|EEB15187.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
           corporis]
          Length = 473

 Score = 87.4 bits (215), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 43/94 (45%), Positives = 57/94 (60%), Gaps = 9/94 (9%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD----SIGLHAVRVLGWGVENDI-- 128
           + M +I + GP+ A   VY DF  YKSGVY  +  +    + G H+V++LGWG E +I  
Sbjct: 329 DIMEEIMQSGPVQATMKVYQDFFSYKSGVYTKSNTERESSNFGYHSVKILGWGEETNIYG 388

Query: 129 ---PYWLVANSWNDHWGDHGTFKILRGENEADIE 159
               YWL ANSW   WG++G FKI RG NE +IE
Sbjct: 389 QPIKYWLAANSWGQQWGENGFFKIRRGTNECEIE 422



 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 44/105 (41%), Positives = 54/105 (51%), Gaps = 9/105 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAR KWP    +   ADQ  CG+ WAVS A+  SDR  I S G     +S QH++
Sbjct: 190 LPNSFDARNKWPG--WISGPADQGWCGASWAVSTASVASDRYAIMSKGLTKVDLSPQHLL 247

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP 291
           +C     GC GG    AW F    G+V   DY     C P+T  P
Sbjct: 248 SCNKGQRGCQGGHLSRAWTFIRKFGLVD--DY-----CYPWTGTP 285


>gi|308494436|ref|XP_003109407.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
 gi|308246820|gb|EFO90772.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
          Length = 470

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 45/93 (48%), Positives = 57/93 (61%), Gaps = 12/93 (12%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHN------FGDSI--GLHAVRVLGWGVEND--- 127
           ++  +GP+ A F V+ DF  Y  GVYQH+         S+  G H+VRVLGWGV++    
Sbjct: 348 ELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGR 407

Query: 128 -IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
            I YWL ANSW   WG+ G FKILRGEN  +IE
Sbjct: 408 PIKYWLCANSWGTQWGEDGYFKILRGENHCEIE 440



 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 42/103 (40%), Positives = 56/103 (54%), Gaps = 10/103 (9%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAR+KW     +  +ADQ +CGS WAVS     SDRL I S G     +S+Q ++
Sbjct: 202 LPEHFDARDKWGHL--IHPVADQGDCGSSWAVSTTGISSDRLSIISEGRINASLSSQQLL 259

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           +C  +   GC GG+   AW +    GVV  GD+     C PY 
Sbjct: 260 SCNQHRQKGCEGGYLDRAWWYIRKLGVV--GDH-----CYPYV 295


>gi|308488534|ref|XP_003106461.1| CRE-CPR-5 protein [Caenorhabditis remanei]
 gi|308253811|gb|EFO97763.1| CRE-CPR-5 protein [Caenorhabditis remanei]
          Length = 153

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 38/84 (45%), Positives = 53/84 (63%)

Query: 175 DLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG 234
           D + +  + A  +P +FDAR+KW  C S+ +I DQS+CGSCWA + A AISDR CIASNG
Sbjct: 70  DEDIVATEVADAIPDSFDARDKWSSCVSINNIRDQSDCGSCWAFAAAEAISDRTCIASNG 129

Query: 235 YFTGQISAQHIVACTPNCWGCNGG 258
                +S+Q +++C      C  G
Sbjct: 130 AVNTLLSSQDLLSCCVGVLSCGNG 153


>gi|294871893|ref|XP_002766082.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239866672|gb|EEQ98799.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 118

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 36/88 (40%), Positives = 56/88 (63%), Gaps = 1/88 (1%)

Query: 65  FKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV 124
           F +   +P+ N  ++I+ +GP++   ++Y D   YK+GVY H  G   G+H ++++GWGV
Sbjct: 18  FGRLPAIPQ-NIKQEIFTNGPVIGALTIYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGV 76

Query: 125 ENDIPYWLVANSWNDHWGDHGTFKILRG 152
           E+   YWL  NSWN+ WGDHG  K+  G
Sbjct: 77  ESGQDYWLAVNSWNEEWGDHGMIKLAVG 104


>gi|332030944|gb|EGI70570.1| Uncharacterized peptidase C1-like protein F26E4.3 [Acromyrmex
           echinatior]
          Length = 501

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 41/93 (44%), Positives = 58/93 (62%), Gaps = 8/93 (8%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI---GLHAVRVLGWGVEND---- 127
           + M++I   GP+ A   VY DF  YK+G+Y+H+    +   G H+VR++GWG E      
Sbjct: 398 DIMQEILTSGPVQATMRVYQDFFVYKNGIYRHSQSAELHDSGYHSVRIIGWGEERSYRGP 457

Query: 128 -IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
            + YWLV NSW  +WG++G FKI RG NE +IE
Sbjct: 458 PLKYWLVVNSWGYNWGENGLFKIQRGTNECEIE 490



 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 37/107 (34%), Positives = 56/107 (52%), Gaps = 10/107 (9%)

Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
           +   LPR FD+R +W     + ++ DQ  CG+ WA+S A+  +DR  I S G    ++SA
Sbjct: 257 DPDALPREFDSRTRWSR--DISNVHDQGWCGASWAISTADVATDRFSIMSKGAEDAELSA 314

Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           QH+++C      GC GG+   AW F    G+V        + C P+T
Sbjct: 315 QHLLSCNNRGQQGCRGGYLDRAWLFMRKFGLV-------DKDCYPWT 354


>gi|350408961|ref|XP_003488566.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
           impatiens]
          Length = 445

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 41/96 (42%), Positives = 54/96 (56%), Gaps = 11/96 (11%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD---SIGLHAVRVLGWGVEND---- 127
           + M +I   GP+ A   VY DF  Y+SG+Y+H       + G H+VR++GWG +      
Sbjct: 339 DIMYEILTSGPVQATMKVYQDFFSYESGIYKHTATTEHYAFGYHSVRIIGWGEDTSAHRY 398

Query: 128 ----IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
               I YWLV NSW   WG+ G F+I RG NE DIE
Sbjct: 399 RNLPIKYWLVVNSWGQQWGESGLFRIQRGTNECDIE 434



 Score = 64.3 bits (155), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 54/106 (50%), Gaps = 10/106 (9%)

Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
           + + LPR FDAR +WP    +  I DQ  CG+ WA+S     SDR  + S G  +  +SA
Sbjct: 198 DPESLPREFDARIRWPR--EISDIDDQGWCGASWAISTTRVASDRFALMSKGADSVLLSA 255

Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           QH+++C       C+GG+   AW +    G+V        E C P+
Sbjct: 256 QHLLSCNNRGQQACSGGYLDRAWLYMRKFGLV-------DEDCYPW 294


>gi|189238903|ref|XP_967834.2| PREDICTED: similar to tubulointerstitial nephritis antigen
           [Tribolium castaneum]
          Length = 453

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 47/115 (40%), Positives = 64/115 (55%), Gaps = 10/115 (8%)

Query: 55  LPTSIPLSHYFKKA---HMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN---F 108
           LPT++     +K A    +    + M +I   GP+ A   VY DF  YK G+Y+H+    
Sbjct: 319 LPTNVDRRSKYKVAPAYRVGNETDIMYEILHSGPVQATMKVYHDFFTYKRGIYRHSPIST 378

Query: 109 GDSIGLHAVRVLGWGVENDIP----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
            D  G H+VR++GWG E        YW VANSW   WG++G F+ILRG NE +IE
Sbjct: 379 NDRTGYHSVRIVGWGEEYSPEGLKKYWKVANSWGPEWGENGYFRILRGSNECEIE 433



 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 40/107 (37%), Positives = 53/107 (49%), Gaps = 10/107 (9%)

Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
           +   LPR FD+  KWP    +  I DQ  CGS WA++ A   SDR  I S G     +SA
Sbjct: 201 DPNSLPREFDSEFKWPG--WMSEIQDQGWCGSSWAITTAAVASDRFAILSKGREKVTLSA 258

Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           QH+++C       CNGG+   AW +    G+V        E C PY+
Sbjct: 259 QHLLSCDRRGQQSCNGGYLDRAWSYIRKIGLV-------DEQCFPYS 298


>gi|294891623|ref|XP_002773656.1| hypothetical protein Pmar_PMAR011495 [Perkinsus marinus ATCC 50983]
 gi|239878860|gb|EER05472.1| hypothetical protein Pmar_PMAR011495 [Perkinsus marinus ATCC 50983]
          Length = 815

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 42/107 (39%), Positives = 64/107 (59%), Gaps = 8/107 (7%)

Query: 59  IPLSHYFKKAHMVP-RCNAMRQIYEHGPLVAIFSVYADFLQY----KSGVYQHNFGD-SI 112
           +PL H+     + P     MR + E G ++  F  +A+F ++    + G+Y    G   I
Sbjct: 525 LPLYHFHPI--LAPNEAVMMRTVQETGSVIVSFRAHANFQEFFMFNRFGLYTTTAGSPEI 582

Query: 113 GLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           G HAVR++G+GVE ++P+WL+ NSW D WG+HG F++LRG N   IE
Sbjct: 583 GNHAVRIIGFGVEGNVPFWLLMNSWGDDWGEHGCFRMLRGRNLCGIE 629



 Score = 47.0 bits (110), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 21/44 (47%), Positives = 31/44 (70%), Gaps = 3/44 (6%)

Query: 188 PR-NFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCI 230
           PR +FDAR +WP+CP    +A Q  CGSC+A+ V+   +DR+C+
Sbjct: 386 PREHFDARIEWPQCPF--PVAMQGMCGSCFAIVVSTVGTDRVCV 427


>gi|182509202|ref|NP_001116812.1| tubulointerstitial nephritis antigen precursor [Bombyx mori]
 gi|81303350|gb|ABB71105.1| TIN-ag-RP [Bombyx mori]
          Length = 404

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 55/88 (62%), Gaps = 3/88 (3%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN-FGDSI--GLHAVRVLGWGVENDIPYW 131
           + M  I   GP + I +VY DF  Y+ G+Y+H   GD +  GLH+VR++GWG + +  YW
Sbjct: 306 DIMYDIMTSGPALGIMTVYQDFFHYREGIYRHTRHGDQLMRGLHSVRIVGWGEDAEDKYW 365

Query: 132 LVANSWNDHWGDHGTFKILRGENEADIE 159
           +VANSW   WG+ G F+I RG +   IE
Sbjct: 366 IVANSWGTSWGEKGYFRIARGHSGTGIE 393



 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 39/101 (38%), Positives = 55/101 (54%), Gaps = 10/101 (9%)

Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
           P  FDAR +W     +  IADQ  CGS WAVS+A+ + DR  I S G    ++S+Q +++
Sbjct: 186 PDEFDARREWY--GYISPIADQDWCGSDWAVSIASIVGDRFSIQSFGTENVRMSSQTLLS 243

Query: 248 C-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           C      GCNGG   +A+ F   +G+V+       E C PY
Sbjct: 244 CHLKGQRGCNGGNLDIAFDFVKTHGLVS-------EQCFPY 277


>gi|340712697|ref|XP_003394892.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
           terrestris]
          Length = 445

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 41/96 (42%), Positives = 54/96 (56%), Gaps = 11/96 (11%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD---SIGLHAVRVLGWGVEND---- 127
           + M +I   GP+ A   VY DF  Y+SG+Y+H       + G H+VR++GWG +      
Sbjct: 339 DIMYEILTSGPVQATMKVYQDFFSYESGIYKHTATTEHYAFGYHSVRIIGWGEDTSAHRH 398

Query: 128 ----IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
               I YWLV NSW   WG+ G F+I RG NE DIE
Sbjct: 399 HNLPIKYWLVVNSWGQQWGESGLFRIQRGTNECDIE 434



 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 54/106 (50%), Gaps = 10/106 (9%)

Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
           + + LPR FDAR +WP    +  I DQ  CG+ WA+S     SDR  + S G  +  +SA
Sbjct: 198 DPESLPREFDARIRWPR--EISDIDDQGWCGASWAISATRVASDRFALMSKGADSVLLSA 255

Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           QH+++C       C+GG+   AW +    G+V        E C P+
Sbjct: 256 QHLLSCNNRGQQACSGGYLDRAWLYMRKFGLV-------DEDCYPW 294


>gi|323447573|gb|EGB03489.1| hypothetical protein AURANDRAFT_72715 [Aureococcus anophagefferens]
          Length = 812

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 40/88 (45%), Positives = 54/88 (61%), Gaps = 2/88 (2%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI--GLHAVRVLGWGVENDIPYWL 132
           N  ++I  HGP+   F+VY  F+ YKSGVY   + + +  G HAV+++GWG E    YWL
Sbjct: 466 NMQKEIMTHGPIQVAFNVYKSFMSYKSGVYAKKWYELMPEGGHAVKIVGWGTEGGKDYWL 525

Query: 133 VANSWNDHWGDHGTFKILRGENEADIEM 160
           VANSWN  WGD G FKI  G     +++
Sbjct: 526 VANSWNTSWGDEGYFKIAVGAESISLDV 553



 Score = 64.3 bits (155), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 38/107 (35%), Positives = 52/107 (48%), Gaps = 10/107 (9%)

Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
            N   +P  F+A  +W     ++ I DQ  CGSCWA S A  +SDR  I  N      +S
Sbjct: 335 DNITDVPSEFNAVTQWKGL--VQPIRDQQQCGSCWAFSAAEVLSDRNAIQHNKA-EPVLS 391

Query: 242 AQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
            + +V+C     GCNGG    AW +  + G+VT       + C PYT
Sbjct: 392 PEDLVSCDRVDQGCNGGNLGTAWTYLKNTGIVT-------DACFPYT 431


>gi|403371460|gb|EJY85611.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 39/78 (50%), Positives = 51/78 (65%)

Query: 80  IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWND 139
           I E GP+   F+VY DF  Y SGVY H  GD+ G HAV++LGWG +    YW+VANSW +
Sbjct: 212 IQESGPVETGFTVYQDFYNYNSGVYHHVTGDAEGGHAVKILGWGKQGLENYWIVANSWGE 271

Query: 140 HWGDHGTFKILRGENEAD 157
            WG+ G F I +G++  D
Sbjct: 272 DWGEKGYFNIRQGDSGID 289



 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 41/101 (40%), Positives = 60/101 (59%), Gaps = 9/101 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD+R +W +C  +  I DQ+ CGSCWA + A ++SDR CIAS G     +S Q +V
Sbjct: 78  LPDSFDSRTQWKDC--VHPIRDQAQCGSCWAFAAAESLSDRFCIASQGKVNLVLSPQDMV 135

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           +C  + +GC GG+   AW++    GV       S + C+PY
Sbjct: 136 SCDTSNFGCFGGYLDQAWQYLEQQGV-------SSDSCEPY 169


>gi|157116531|ref|XP_001658537.1| tubulointerstitial nephritis antigen [Aedes aegypti]
 gi|108883447|gb|EAT47672.1| AAEL001232-PA [Aedes aegypti]
          Length = 462

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 41/94 (43%), Positives = 58/94 (61%), Gaps = 9/94 (9%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-----IGLHAVRVLGWGVEND-- 127
           + M +I +HGP+ AI  V+ DF  YKSG+Y+H+   +      G H+VR++GWG E    
Sbjct: 321 DIMLEIKKHGPVQAIMRVHRDFFSYKSGIYRHSAASTSADQRAGYHSVRLIGWGEERHGY 380

Query: 128 --IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
               YW+  NSW   WG++G F+ILRG NE +IE
Sbjct: 381 EVTKYWIAVNSWGTWWGENGRFRILRGSNECEIE 414



 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 42/104 (40%), Positives = 51/104 (49%), Gaps = 9/104 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDA   WP    +  + DQ  CGS WAVS A+  SDR  I S G  T Q++ Q IV
Sbjct: 185 LPTHFDATNYWPG--FIGKVRDQGWCGSSWAVSTASVASDRFAILSKGRETVQLAPQQIV 242

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
           +C     GC+GG    AW +    G V        E C PY  A
Sbjct: 243 SCVRRSQGCSGGHLDTAWSYLRKVGTV-------NEECYPYISA 279


>gi|328869211|gb|EGG17589.1| hypothetical protein DFA_08585 [Dictyostelium fasciculatum]
          Length = 323

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 48/103 (46%), Positives = 59/103 (57%), Gaps = 10/103 (9%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FDARE+WP C  +  + +Q  CGSCWA S + A+SDRLCIAS G     +S Q +V
Sbjct: 95  IPSTFDAREQWPGC--VHAVLNQEQCGSCWAFSSSEALSDRLCIASKGQVNVTLSPQALV 152

Query: 247 ACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           AC      GCNGG PQLAW +    G+ T         C PYT
Sbjct: 153 ACDDIGNQGCNGGVPQLAWEYMEWKGLPT-------FECYPYT 188



 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 38/107 (35%), Positives = 61/107 (57%), Gaps = 8/107 (7%)

Query: 61  LSHYFKKAHMVPRCNAM----RQIYEHGPLVAIFSVYADFLQYKSGVYQHN-FGDSIGLH 115
           +++Y  K   +  CN++     +I  +GP+V    VY DF+ Y SGVY ++   + +G H
Sbjct: 207 MTYYRAKPFSMTTCNSVACIQNEIITYGPVVGTMMVYQDFMSYSSGVYVYDGTAELLGGH 266

Query: 116 AVRVLGWGVE--NDIPYWLVANSWNDHWGD-HGTFKILRGENEADIE 159
           A+ ++GWG +  + + YW+V NSW+  WG   G F I RG N   I+
Sbjct: 267 AIEIVGWGTDATSKLDYWIVKNSWSAAWGGLDGYFWIQRGTNMCGID 313


>gi|307175943|gb|EFN65753.1| Uncharacterized peptidase C1-like protein F26E4.3 [Camponotus
           floridanus]
          Length = 443

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 41/93 (44%), Positives = 59/93 (63%), Gaps = 8/93 (8%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI---GLHAVRVLGWGVEND---- 127
           + M++I   GP+ A   VY DF  Y+SGVY+H+    +   G H+VR++GWG E      
Sbjct: 340 DIMQEILTSGPVQATMRVYQDFFVYQSGVYRHSRSAELHDSGYHSVRIIGWGEEPSYRGP 399

Query: 128 -IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
            + YWLVANSW  +WG++G F+I +G NE +IE
Sbjct: 400 PLKYWLVANSWGHNWGENGLFRIQKGTNECEIE 432



 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 42/107 (39%), Positives = 57/107 (53%), Gaps = 10/107 (9%)

Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
           +   LPR F++R +WP    +  I DQ  CG+ WAVS A+  SDR  I S G  T ++SA
Sbjct: 199 DPDALPREFNSRTRWPR--DISDIHDQGWCGASWAVSTADVASDRFAIMSKGAETVELSA 256

Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           QH+++C      GC GG+   AW F    G+V        E C P+T
Sbjct: 257 QHLLSCNNRGQQGCKGGYLDRAWLFMRKFGLV-------DEECYPWT 296


>gi|270011021|gb|EFA07469.1| cathepsin B precursor [Tribolium castaneum]
          Length = 327

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/115 (40%), Positives = 64/115 (55%), Gaps = 10/115 (8%)

Query: 55  LPTSIPLSHYFKKA---HMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN---F 108
           LPT++     +K A    +    + M +I   GP+ A   VY DF  YK G+Y+H+    
Sbjct: 193 LPTNVDRRSKYKVAPAYRVGNETDIMYEILHSGPVQATMKVYHDFFTYKRGIYRHSPIST 252

Query: 109 GDSIGLHAVRVLGWGVENDIP----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
            D  G H+VR++GWG E        YW VANSW   WG++G F+ILRG NE +IE
Sbjct: 253 NDRTGYHSVRIVGWGEEYSPEGLKKYWKVANSWGPEWGENGYFRILRGSNECEIE 307



 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 41/103 (39%), Positives = 52/103 (50%), Gaps = 10/103 (9%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LPR FD+  KWP   S   I DQ  CGS WA++ A   SDR  I S G     +SAQH++
Sbjct: 79  LPREFDSEFKWPGWMS--EIQDQGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQHLL 136

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           +C       CNGG+   AW +    G+V        E C PY+
Sbjct: 137 SCDRRGQQSCNGGYLDRAWSYIRKIGLV-------DEQCFPYS 172


>gi|253748582|gb|EET02635.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 298

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 39/84 (46%), Positives = 54/84 (64%), Gaps = 1/84 (1%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV-ENDIPYWLVAN 135
           M+ +   GPL   F+VY+DF+ Y+ GVYQH  G   G HAV ++G+G  E D+ YW++ N
Sbjct: 205 MKALVTGGPLQTAFTVYSDFMYYEGGVYQHMSGRVEGGHAVEMVGYGTDEYDVDYWIIRN 264

Query: 136 SWNDHWGDHGTFKILRGENEADIE 159
           SW   WG+ G F+I+R  NE  IE
Sbjct: 265 SWGPDWGEDGYFRIIRMTNECGIE 288



 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 34/88 (38%), Positives = 47/88 (53%), Gaps = 2/88 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD RE++P C  +  + DQ +CGSCWA S   ++ DR C A         S Q++V
Sbjct: 74  VPDSFDFREEYPHC--IPEVVDQGSCGSCWAFSSVASLGDRRCFAGLDKKAVTYSPQYVV 131

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVT 274
           +C      C+GGW Q  WRF    G  T
Sbjct: 132 SCDHGDMACDGGWLQSVWRFLTKTGTTT 159


>gi|193629592|ref|XP_001944624.1| PREDICTED: cathepsin B-like isoform 4 [Acyrthosiphon pisum]
          Length = 331

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 42/134 (31%), Positives = 72/134 (53%), Gaps = 14/134 (10%)

Query: 189 RNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVAC 248
           + FDAR++WP+C ++  + ++ N    WA +     +DR+CIA+NG +   +S + +++C
Sbjct: 89  KEFDARKRWPQCKTIGEVYNEGNALLSWAYATTGVFADRMCIATNGSYNKHLSTEELISC 148

Query: 249 TPNCWGCNGGWPQ--LAWRFWGHNGVVTGGD-YNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +      N GW +  LAW ++  +G+V+GG  YN+ +GCQP  + P           C L
Sbjct: 149 SGIKASAN-GWVRDGLAWEYFKTHGLVSGGSIYNTNDGCQPSKIPPV----------CNL 197

Query: 306 LGKLKTPECKQNCY 319
             K+    C   CY
Sbjct: 198 PTKINKRTCVDYCY 211



 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 34/92 (36%), Positives = 55/92 (59%), Gaps = 2/92 (2%)

Query: 69  HMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN-FGDSIGLHAVRVLGWGVEND 127
           H+ P+ +  +++  +GP+ A  ++Y D   +KSGVY        + L  V+++GWGVEN 
Sbjct: 230 HVKPK-DIQKEVQTYGPVTAALNLYDDIFLHKSGVYTLTKNAKYVRLQYVKLIGWGVENG 288

Query: 128 IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           + YWL+ NSW + WG +G  KI RG+    +E
Sbjct: 289 VDYWLLVNSWGNEWGQNGLLKIKRGKYGCAVE 320


>gi|193202653|ref|NP_492593.2| Protein F26E4.3 [Caenorhabditis elegans]
 gi|205371857|sp|P90850.3|YCF2E_CAEEL RecName: Full=Uncharacterized peptidase C1-like protein F26E4.3;
           Flags: Precursor
 gi|166157004|emb|CAB03007.2| Protein F26E4.3 [Caenorhabditis elegans]
          Length = 452

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/93 (47%), Positives = 57/93 (61%), Gaps = 12/93 (12%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHN------FGDSI--GLHAVRVLGWGVEND--- 127
           ++  +GP+ A F V+ DF  Y  GVYQH+         S+  G H+VRVLGWGV++    
Sbjct: 330 ELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGK 389

Query: 128 -IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
            I YWL ANSW   WG+ G FK+LRGEN  +IE
Sbjct: 390 PIKYWLCANSWGTQWGEDGYFKVLRGENHCEIE 422



 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 42/103 (40%), Positives = 57/103 (55%), Gaps = 10/103 (9%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAR+KW   P +  +ADQ +CGS W+VS     SDRL I S G     +S+Q ++
Sbjct: 184 LPEHFDARDKWG--PLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQLL 241

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           +C  +   GC GG+   AW +    GVV  GD+     C PY 
Sbjct: 242 SCNQHRQKGCEGGYLDRAWWYIRKLGVV--GDH-----CYPYV 277


>gi|341891358|gb|EGT47293.1| hypothetical protein CAEBREN_29072 [Caenorhabditis brenneri]
          Length = 349

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/93 (47%), Positives = 57/93 (61%), Gaps = 12/93 (12%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHN------FGDSI--GLHAVRVLGWGVEND--- 127
           ++  +GP+ A F V+ DF  Y  GVYQH+         S+  G H+VRVLGWGV++    
Sbjct: 227 ELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGR 286

Query: 128 -IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
            I YWL ANSW   WG+ G FKILRG+N  +IE
Sbjct: 287 PIKYWLCANSWGTQWGEDGYFKILRGDNHCEIE 319


>gi|290988628|ref|XP_002677000.1| predicted protein [Naegleria gruberi]
 gi|284090605|gb|EFC44256.1| predicted protein [Naegleria gruberi]
          Length = 158

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 39/86 (45%), Positives = 53/86 (61%), Gaps = 2/86 (2%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND--IPYWL 132
           + M  +  +GPL A   VY DF  YKSGVY H  G  +G HA++++GWGV++   +PYW+
Sbjct: 62  DMMADLKANGPLQATMIVYKDFFSYKSGVYHHVSGRMVGAHAIKIVGWGVDSASKLPYWI 121

Query: 133 VANSWNDHWGDHGTFKILRGENEADI 158
            ANSW + WG  G F I RG  E  +
Sbjct: 122 CANSWGEDWGLDGYFWIARGRGECGL 147


>gi|341898422|gb|EGT54357.1| hypothetical protein CAEBREN_10381 [Caenorhabditis brenneri]
          Length = 466

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/93 (47%), Positives = 57/93 (61%), Gaps = 12/93 (12%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHN------FGDSI--GLHAVRVLGWGVEND--- 127
           ++  +GP+ A F V+ DF  Y  GVYQH+         S+  G H+VRVLGWGV++    
Sbjct: 344 ELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGR 403

Query: 128 -IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
            I YWL ANSW   WG+ G FKILRG+N  +IE
Sbjct: 404 PIKYWLCANSWGTQWGEDGYFKILRGDNHCEIE 436



 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 40/103 (38%), Positives = 55/103 (53%), Gaps = 10/103 (9%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD+R+KW     +  + DQ +CGS WAVS     SDRL I S G     +S+Q ++
Sbjct: 198 LPEHFDSRDKWGHL--INPVVDQGDCGSSWAVSTTGISSDRLAIISEGRINASLSSQQLL 255

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           +C  +   GC GG+   AW +    GVV  GD+     C PY 
Sbjct: 256 SCNQHRQKGCEGGYLDRAWWYIRKLGVV--GDH-----CYPYV 291


>gi|12958837|gb|AAK09441.1|AF339098_1 cathepsin b-like precursor protein [Ancylostoma ceylanicum]
          Length = 180

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 40/81 (49%), Positives = 55/81 (67%), Gaps = 2/81 (2%)

Query: 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA 247
           P +FDAR +WPEC ++  I DQS+CGSCWAV+ A+A+SD +C+ SN      IS   I++
Sbjct: 90  PESFDARTQWPECRAIGTIRDQSSCGSCWAVASASAMSDEMCVQSNSSIKLMISDTDILS 149

Query: 248 CT-PNC-WGCNGGWPQLAWRF 266
           C    C +GC GGWP  A+R+
Sbjct: 150 CCGLECGYGCQGGWPIEAYRW 170


>gi|383861394|ref|XP_003706171.1| PREDICTED: tubulointerstitial nephritis antigen-like [Megachile
           rotundata]
          Length = 442

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 40/95 (42%), Positives = 57/95 (60%), Gaps = 10/95 (10%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI---GLHAVRVLGWGVE------ 125
           + M++I   GP+ A   VY DF  Y+SGVY+H+    +     H+VR++GWG E      
Sbjct: 337 DIMQEILTSGPVQATMRVYQDFFSYESGVYKHSVTAELYESDYHSVRIIGWGEEPPTYSR 396

Query: 126 -NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
              + YWLVANSW   WG++G F+I +G NE +IE
Sbjct: 397 NTPLKYWLVANSWGQQWGENGLFRIQKGTNECEIE 431



 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 40/106 (37%), Positives = 55/106 (51%), Gaps = 10/106 (9%)

Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
           + + LPR FD+R +WP    +  I DQ  CG+ WA+S A   SDR  I S G    ++SA
Sbjct: 196 DPESLPREFDSRTRWPR--DISKITDQGWCGASWAISSAQVASDRFAIMSKGTDAVELSA 253

Query: 243 QHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           QH+++C      GC+GG    AW F    G+V        E C P+
Sbjct: 254 QHLLSCNNRGQQGCSGGHLDRAWMFMRRFGLV-------DENCYPW 292


>gi|348690656|gb|EGZ30470.1| papain-like cysteine protease C1 [Phytophthora sojae]
          Length = 647

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 64/250 (25%), Positives = 105/250 (42%), Gaps = 57/250 (22%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY  GP+    +V   FL+Y  G++      +   HA+ ++GWG E+ +P+W++ NS
Sbjct: 212 MAEIYARGPIACSVAVTDGFLKYSGGIFDDKTNATETDHAISIVGWGEEDGVPFWVLRNS 271

Query: 137 WNDHWGDHGTFKILRGENEADIE---------------------------MGFNNRVEAN 169
           W   WG+ G  +++RG N   +E                                 V  N
Sbjct: 272 WGSFWGEDGWMRLVRGVNNVGVEGECAFGVPKDDGWPTPTKIEEEEPVQEEEEKKDVVEN 331

Query: 170 SSEDDDLETM--GCQ--------------------NAKGLPRNFDARE----KWPECPSL 203
           + ED  +E+   GC+                    + K LP+ +D R+     +      
Sbjct: 332 TEEDTSVESKLGGCRQKLHFAGGERVISPLPHETIDVKDLPKAWDWRDVNGRNFVTWDKN 391

Query: 204 RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTG-QISAQHIVACTPNCWGCNGGWPQL 262
           +HI     CGSCWA    +A+SDR+ I  N  +    +S Q ++ C      CNGG P L
Sbjct: 392 QHIPQY--CGSCWAQGTTSALSDRISILRNASWPEIALSPQVLINCHAGG-TCNGGNPGL 448

Query: 263 AWRFWGHNGV 272
            + +   +G+
Sbjct: 449 VYEYAHRHGI 458



 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 39/137 (28%), Positives = 62/137 (45%), Gaps = 8/137 (5%)

Query: 63  HYFKKAHMVPRCNAMR-QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLG 121
           +Y  +   V   + M+ +IY+ GP+         F  Y  G+Y  +    +  H + V G
Sbjct: 503 YYVSEYGSVSGADRMKAEIYKRGPIGCGVHATEKFEAYTGGIYSEHVMFPLINHEISVAG 562

Query: 122 WGV--ENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVE-ANSSEDDD--- 175
           WG   E D  YW+  NSW  +WG++G F+I    N   IE   +  V   + S+ DD   
Sbjct: 563 WGYDEETDTEYWIGRNSWGTYWGENGWFRIQMHHNNLGIEQDCDWGVPLPDGSKPDDFVI 622

Query: 176 -LETMGCQNAKGLPRNF 191
            ++  G +  +   RNF
Sbjct: 623 TVDYEGNEEGQATARNF 639



 Score = 42.7 bits (99), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 35/117 (29%), Positives = 52/117 (44%), Gaps = 27/117 (23%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSN------CGSCWAVSVANAISDRLCIASNGYFTGQ- 239
           LP+NFD    W       ++    N      CGSCW+ +  +A++DR+ IA     + + 
Sbjct: 57  LPKNFD----WRNVNGTNYVTISRNQHIPHYCGSCWSFAATSALADRIMIAKERSPSNKP 112

Query: 240 ---------ISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
                    +S Q I+ C     GC+GG    A+R+   NGV        +EGCQ Y
Sbjct: 113 SVEVHREVVLSPQVILNCDKKDNGCHGGDQLEAYRYIKKNGV-------PEEGCQRY 162


>gi|66801417|ref|XP_629634.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
 gi|60463014|gb|EAL61210.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
          Length = 323

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 41/89 (46%), Positives = 56/89 (62%), Gaps = 1/89 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLV 133
           +A  +I  +GP++A F +Y+DF  +K  VY  +    +  HAVRV+GWG  +D + YW+ 
Sbjct: 183 DAQYEIMTNGPVIATFMLYSDFKPHKWDVYIKSSNTQVESHAVRVVGWGTTSDGVDYWIA 242

Query: 134 ANSWNDHWGDHGTFKILRGENEADIEMGF 162
           ANSW   WGD G FKI RG +EA  E GF
Sbjct: 243 ANSWGTGWGDKGYFKIRRGSDEAAFEEGF 271



 Score = 58.2 bits (139), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/97 (30%), Positives = 51/97 (52%), Gaps = 11/97 (11%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD R  W +C  +  + +Q +CGSCWA   +  ++DR+CI S+      +S Q+++
Sbjct: 46  IPASFDVRTNWGDC--MSPVREQQSCGSCWAQVTSGILADRMCIESDKNIKMLLSPQYLM 103

Query: 247 ACTPNCW---------GCNGGWPQLAWRFWGHNGVVT 274
            C  +C          GC GG+  LA     + G+V+
Sbjct: 104 DCDGSCVSDGVSGCNNGCKGGFVGLALTRLINEGIVS 140


>gi|324512900|gb|ADY45327.1| Peptidase C1-like protein [Ascaris suum]
          Length = 450

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 42/97 (43%), Positives = 55/97 (56%), Gaps = 12/97 (12%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH--------NFGDSIGLHAVRVLGWGVEN 126
           + M +I  +GP+ A F VY DF  Y  GVYQH              G H+VR++GWG + 
Sbjct: 326 DIMTEIITNGPVQATFLVYEDFFMYSGGVYQHLDLHEHKEEERKVQGYHSVRIIGWGEDY 385

Query: 127 D----IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
                + YWL ANSW + WG+ G F+ILRGEN  +IE
Sbjct: 386 STGPQVKYWLAANSWGNEWGEDGLFRILRGENHCEIE 422



 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 38/104 (36%), Positives = 55/104 (52%), Gaps = 10/104 (9%)

Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
           + LP +FDAREKWP    +  + DQ +C S W+ S     +DRL I ++G     +SAQ 
Sbjct: 182 RELPSSFDAREKWP--LYIHPVRDQGDCASSWSHSTTATSADRLSIITDGRVNIPLSAQQ 239

Query: 245 IVACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           +++C  +   GC GG+   AW +    GVV+       E C PY
Sbjct: 240 LLSCNQHRQRGCEGGYLDRAWWYIRKLGVVS-------ELCYPY 276


>gi|193688336|ref|XP_001945899.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 308

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 38/105 (36%), Positives = 64/105 (60%), Gaps = 3/105 (2%)

Query: 189 RNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVAC 248
           + FDAR++WP+C ++  + ++ N    WA +VA  ++DR CIA+NG +   +S + +++C
Sbjct: 67  KEFDARKRWPKCKTIGEVHNEGNFALGWAYAVAGVLADRTCIATNGGYNKLLSTEELISC 126

Query: 249 TPNCWGCNGGWP--QLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP 291
           +      NG  P  +  W +   +GVV+GG YNS +GCQP+   P
Sbjct: 127 S-GIKENNGSVPSERSIWEYLKSHGVVSGGKYNSNDGCQPFKFPP 170



 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 36/86 (41%), Positives = 50/86 (58%), Gaps = 1/86 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVLGWGVENDIPYWLV 133
           +  +++  +GP+V  F V  DF  YKSGVY + +    I     +++GWGVEN + YWLV
Sbjct: 212 DIQKEVQTYGPVVVRFMVCDDFFLYKSGVYAKSDKAKGIRTQYAKLIGWGVENGVDYWLV 271

Query: 134 ANSWNDHWGDHGTFKILRGENEADIE 159
            NSW   WG  G FKI  G N+  +E
Sbjct: 272 INSWGHEWGQKGLFKIKSGTNQCGVE 297


>gi|281200411|gb|EFA74631.1| hypothetical protein PPL_11599 [Polysphondylium pallidum PN500]
          Length = 311

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 51/151 (33%), Positives = 75/151 (49%), Gaps = 32/151 (21%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R  WP C  +  + +Q  CGSCWA + + ++SDRLCIAS G     +S Q +V
Sbjct: 81  VPNSFDSRTNWPGC--VHAVLNQGQCGSCWAFAASESLSDRLCIASQGAINVTLSPQALV 138

Query: 247 ACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C      GCNGG PQ+AW +   +G+ T       + C PYT                 
Sbjct: 139 SCDIEFNQGCNGGIPQMAWEYLELHGIPT-------DSCFPYT----------------- 174

Query: 306 LGKLKTPECKQNCYNPSYESTYRFDLKKGKK 336
            G    P+C++ C + S     ++ L KGK 
Sbjct: 175 SGNGTAPDCQKECSDGS-----KYQLYKGKT 200



 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 37/103 (35%), Positives = 56/103 (54%), Gaps = 7/103 (6%)

Query: 64  YFKKAHMVPRCNAMRQI----YEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI-GLHAVR 118
           Y  K   +  C+++  I    + +GP+     VY DF+ Y SGVY    G  + G HA++
Sbjct: 196 YKGKTFTLKTCSSVAAIQANVFAYGPIEGTMDVYQDFMSYTSGVYVMTPGSKLLGGHAIK 255

Query: 119 VLGWGVEND--IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           ++GWG ++   + YW+V NSW   WG +G F I RG N   I+
Sbjct: 256 IVGWGTDSTSGLDYWIVQNSWGSDWGMNGFFWIQRGTNMCGID 298


>gi|363732245|ref|XP_419905.3| PREDICTED: tubulointerstitial nephritis antigen [Gallus gallus]
          Length = 467

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 41/95 (43%), Positives = 59/95 (62%), Gaps = 13/95 (13%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNF--GDSIGLHAVRVLGWGVENDIP--- 129
           + M +I   GP+ AI  VY DF  YK G+Y+H++  G     H+V++LGWG    +P   
Sbjct: 368 DIMEEIMAKGPVQAIMKVYEDFFLYKEGIYRHSYKAGSKWKTHSVKLLGWG---SLPGKN 424

Query: 130 -----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
                +W+ ANSW  +WG++G F+ILRG+NE DIE
Sbjct: 425 GQKQKFWIAANSWGKYWGENGYFRILRGQNECDIE 459



 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 42/117 (35%), Positives = 60/117 (51%), Gaps = 13/117 (11%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
            P  F A   WP+   +    DQ NCG+ WA S A+  +DR+ I S+G  T  +S Q+++
Sbjct: 222 FPEFFAATYAWPD--WIHDPLDQRNCGASWAFSTASVAADRITIHSDGQITDNLSVQNLI 279

Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
           +C T N  GCNGG    AWR+   +GVV+         C P      +HH+  P +N
Sbjct: 280 SCDTGNQRGCNGGSIDGAWRYLTTHGVVS-------YACYPSFW---KHHLDSPSEN 326


>gi|326916361|ref|XP_003204476.1| PREDICTED: tubulointerstitial nephritis antigen-like [Meleagris
           gallopavo]
          Length = 467

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 41/95 (43%), Positives = 59/95 (62%), Gaps = 13/95 (13%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNF--GDSIGLHAVRVLGWGVENDIP--- 129
           + M +I   GP+ AI  VY DF  YK G+Y+H++  G     H+V++LGWG    +P   
Sbjct: 368 DIMEEIMAKGPVQAIMKVYEDFFLYKEGIYRHSYKAGSKWKTHSVKLLGWG---SLPGKN 424

Query: 130 -----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
                +W+ ANSW  +WG++G F+ILRG+NE DIE
Sbjct: 425 GQKQKFWIAANSWGKYWGENGYFRILRGQNECDIE 459



 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 37/95 (38%), Positives = 53/95 (55%), Gaps = 3/95 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
            P  F A   WP+   +    DQ NCG+ WA S A+  +DR+ I S+G  T  +S Q+++
Sbjct: 222 FPEFFAATYAWPD--WIHDPLDQRNCGASWAFSTASVAADRIAIHSDGQITDNLSVQNLI 279

Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNS 280
           +C T N  GC GG  + AWR+   +GVV+   Y S
Sbjct: 280 SCDTKNQHGCGGGNIEGAWRYLKTHGVVSYACYPS 314


>gi|348513320|ref|XP_003444190.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oreochromis
           niloticus]
          Length = 499

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 39/96 (40%), Positives = 60/96 (62%), Gaps = 13/96 (13%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI--------GLHAVRVLGWGVENDI 128
           M++I ++GP+ AI  V+ DF  YK+G+Y+H              G H+VR+ GWG + ++
Sbjct: 377 MKEIMDNGPVQAIMEVHEDFFVYKTGIYKHTDVSFTKPPQYRKHGTHSVRITGWGEDRNV 436

Query: 129 P-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
                 YW+ ANSW  +WG++G F+I+RGENE +IE
Sbjct: 437 DGTSRKYWIAANSWGKNWGENGYFRIVRGENECEIE 472



 Score = 71.2 bits (173), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 56/163 (34%), Positives = 77/163 (47%), Gaps = 22/163 (13%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  F++ EKWP    +    DQ NC + WA S A   SDR+ I S G+ T ++S Q+++
Sbjct: 227 LPPYFNSAEKWPG--KIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPRLSPQNLI 284

Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C T N  GC GG    AW +    GVVT       E C PY      H     +  C +
Sbjct: 285 SCDTRNQGGCAGGRIDGAWWYLRRRGVVT-------EDCYPYQPP---HQTPAEVGRCMM 334

Query: 306 ----LGKLK---TPEC--KQNCYNPSYESTYRFDLKKGKKAHM 339
               +G+ K   T  C   QN +N  Y+ST  + L   +K  M
Sbjct: 335 QSRSVGRGKRQATQRCPNTQNYHNDIYQSTPPYRLSSNEKEIM 377


>gi|403365170|gb|EJY82363.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 52/78 (66%)

Query: 80  IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWND 139
           I + GP+   F++Y DFL Y SG+Y H  G ++G HAV++LGWG +    YW+VANSW +
Sbjct: 212 IQQSGPVETGFTIYEDFLNYNSGIYHHVTGGNMGGHAVKILGWGKQGLENYWIVANSWGE 271

Query: 140 HWGDHGTFKILRGENEAD 157
            WG+ G F I +G++  D
Sbjct: 272 DWGEKGYFNIRQGDSGID 289



 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 37/101 (36%), Positives = 57/101 (56%), Gaps = 9/101 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FD+R +W +C  +  I DQ+ CGSCWA +   ++SDR CIAS G     +S Q ++
Sbjct: 78  LPDSFDSRTQWKDC--VHPIRDQAKCGSCWAFAAVESLSDRFCIASQGKVNLVLSPQDML 135

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           +C  + + C GG+   AW++    GV         + C+PY
Sbjct: 136 SCDASNFCCFGGYLDTAWQYLEQQGV-------GSDSCEPY 169


>gi|328722316|ref|XP_003247542.1| PREDICTED: cathepsin B-like isoform 2 [Acyrthosiphon pisum]
 gi|328722318|ref|XP_003247543.1| PREDICTED: cathepsin B-like isoform 3 [Acyrthosiphon pisum]
          Length = 276

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 42/134 (31%), Positives = 72/134 (53%), Gaps = 14/134 (10%)

Query: 189 RNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVAC 248
           + FDAR++WP+C ++  + ++ N    WA +     +DR+CIA+NG +   +S + +++C
Sbjct: 34  KEFDARKRWPQCKTIGEVYNEGNALLSWAYATTGVFADRMCIATNGSYNKHLSTEELISC 93

Query: 249 TPNCWGCNGGWPQ--LAWRFWGHNGVVTGGD-YNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +      N GW +  LAW ++  +G+V+GG  YN+ +GCQP  + P           C L
Sbjct: 94  SGIKASAN-GWVRDGLAWEYFKTHGLVSGGSIYNTNDGCQPSKIPPV----------CNL 142

Query: 306 LGKLKTPECKQNCY 319
             K+    C   CY
Sbjct: 143 PTKINKRTCVDYCY 156



 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 35/102 (34%), Positives = 60/102 (58%), Gaps = 5/102 (4%)

Query: 59  IPLSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN-FGDSIGLHAV 117
           + + +Y+   H+ P+ +  +++  +GP+ A  ++Y D   +KSGVY        + L  V
Sbjct: 168 VKVRYYY---HVKPK-DIQKEVQTYGPVTAALNLYDDIFLHKSGVYTLTKNAKYVRLQYV 223

Query: 118 RVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +++GWGVEN + YWL+ NSW + WG +G  KI RG+    +E
Sbjct: 224 KLIGWGVENGVDYWLLVNSWGNEWGQNGLLKIKRGKYGCAVE 265


>gi|432884030|ref|XP_004074413.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oryzias
           latipes]
          Length = 474

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 41/96 (42%), Positives = 58/96 (60%), Gaps = 13/96 (13%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI--------GLHAVRVLGWGVENDI 128
           M++I E+GP+ AI  V+ DF  YK+G+Y+H    S         G H+VR+ GWG + D 
Sbjct: 352 MKEIMENGPVQAIMEVHEDFFVYKNGIYKHTDVSSTKPPQYRKHGTHSVRITGWGEDKDY 411

Query: 129 P-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
                 YW+ ANSW  +WG++G F+I RG NE +IE
Sbjct: 412 DGTPRKYWIAANSWGKNWGENGFFRIARGANECEIE 447



 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 59/162 (36%), Positives = 78/162 (48%), Gaps = 20/162 (12%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LPR F++ EKWP    +    DQ NC + WA S A   SDR+ I S G+ T Q+S Q+++
Sbjct: 202 LPRYFNSSEKWPN--KIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPQLSPQNLI 259

Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY---TLAPCEHHVQGPLQN 302
           +C T N  GC GG    AW +    GVVT       E C PY     AP E  V   +  
Sbjct: 260 SCDTRNQGGCAGGRIDGAWWYLRRRGVVT-------ENCYPYQPPQQAPAE--VGRCMMQ 310

Query: 303 CTLLGKLK---TPECKQ--NCYNPSYESTYRFDLKKGKKAHM 339
              +G+ K   T  C    N +N  Y+ST  + L   +K  M
Sbjct: 311 SRAVGRGKRQATQRCPNTYNYHNDIYQSTPPYKLSSNEKEIM 352


>gi|301119245|ref|XP_002907350.1| cysteine protease family C01A, putative [Phytophthora infestans
           T30-4]
 gi|262105862|gb|EEY63914.1| cysteine protease family C01A, putative [Phytophthora infestans
           T30-4]
          Length = 710

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 62/242 (25%), Positives = 104/242 (42%), Gaps = 49/242 (20%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY  GP+    +V   FL+Y  G++      +   HA+ ++GWG EN +P+W++ NS
Sbjct: 211 MAEIYARGPIACSVAVTDGFLKYSGGIFDDKTNATDVDHAISIVGWGEENGVPFWVLRNS 270

Query: 137 WNDHWGDHGTFKILRGENEADIEMGFNNRVEANSS-------------------EDDDLE 177
           W   WG+ G  +++RG N   +E      V  +                     E+  +E
Sbjct: 271 WGSFWGESGWMRLVRGVNNVGVEGECAFGVPRDDGWPTPTKIEEKEEDKVKEPQEETSVE 330

Query: 178 TM--GCQ--------------------NAKGLPRNFDARE----KWPECPSLRHIADQSN 211
           +   GC+                    +   LP+++D R+     +      +HI     
Sbjct: 331 STLGGCRQKLHFAGGERVISPLPHETMDVTDLPKSWDWRDVNGKNYVTWDKNQHIPKY-- 388

Query: 212 CGSCWAVSVANAISDRLCIASNGYFTG-QISAQHIVACTPNCWGCNGGWPQLAWRFWGHN 270
           CGSCWA    +A+SDR+ I  N  +    +S Q ++ C      CNGG P L + +   +
Sbjct: 389 CGSCWAQGTTSALSDRISILRNASWPEIALSPQVLINCHAGG-TCNGGNPGLVYEYAHRH 447

Query: 271 GV 272
           G+
Sbjct: 448 GI 449



 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 41/83 (49%), Gaps = 2/83 (2%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV--ENDIPYWLVANS 136
           +IY+ GP+       + F  Y  G+Y  +    +  H + V GWG   E D  YW+  NS
Sbjct: 511 EIYKRGPIGCGVHATSKFESYTGGIYSEHVMFPLINHEISVAGWGYDEETDTEYWIGRNS 570

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           W  +WG++G F+I    N   IE
Sbjct: 571 WGTYWGENGWFRIQMHHNNLGIE 593



 Score = 42.0 bits (97), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 34/118 (28%), Positives = 53/118 (44%), Gaps = 27/118 (22%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSN------CGSCWAVSVANAISDRLCI---------- 230
           LP+NFD    W      R+++   N      CGSCW+ +  +A++DR+ I          
Sbjct: 56  LPKNFD----WRNVNGTRYVSISRNQHIPHYCGSCWSFAATSALADRILIFKERNPGNKP 111

Query: 231 ASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           +   +    +S Q I+ C     GC+GG    A+R+   +GV        +EGCQ Y 
Sbjct: 112 SVEVHRGVVLSPQVILNCDKKDNGCHGGDQLEAYRYIKEHGV-------PEEGCQRYA 162


>gi|308804940|ref|XP_003079782.1| Cysteine proteinase Cathepsin F (ISS) [Ostreococcus tauri]
 gi|116058239|emb|CAL53428.1| Cysteine proteinase Cathepsin F (ISS) [Ostreococcus tauri]
          Length = 498

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 44/114 (38%), Positives = 62/114 (54%), Gaps = 2/114 (1%)

Query: 187 LPRNFDAREKWPECPSL-RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LPR+FDAR+++P+C  L   + DQ  CGSCWAV+    ++DRLCI+S G    ++S Q  
Sbjct: 257 LPRHFDARDEYPKCARLIGTVRDQGKCGSCWAVAATEIMNDRLCISSGGKEVAELSPQFA 316

Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP 299
           ++C  +  GC GG            GV  GG  + +  C PY   PC+H    P
Sbjct: 317 LSCYNSGAGCEGGDVVDTLTLALAKGVPHGGMLD-KGACLPYQFEPCDHPCMIP 369



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 47/80 (58%), Gaps = 5/80 (6%)

Query: 78  RQIYEHGPLVAIFS-VYADFLQYKSGVYQ--HNFGDSIGLHAVRVLGWGVENDIP-YWLV 133
           ++I   G +   F  V+ DF  +K GVY+   + G  +G HA +++GWGV  +   YW++
Sbjct: 408 KEIKNRGSVAVTFGPVHEDFYGHKEGVYKVTESSGRELGNHATKLIGWGVTQEGDHYWIM 467

Query: 134 ANSWNDHWGDHGTFKILRGE 153
            NSW  +WG++G  K+  GE
Sbjct: 468 VNSWR-NWGENGVGKVRMGE 486


>gi|159115721|ref|XP_001708083.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157436192|gb|EDO80409.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 305

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 38/83 (45%), Positives = 52/83 (62%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M  +   GP+   F V+ DFL Y  G+Y   +G S+G HAV ++G+G  N+  YW+V NS
Sbjct: 214 MVSLLADGPVQTGFYVHEDFLYYVGGIYHKVYGTSLGGHAVLIVGYGSMNNHDYWIVRNS 273

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           W   WG++G F+ILRG NE  IE
Sbjct: 274 WGSDWGENGYFRILRGTNECGIE 296



 Score = 57.4 bits (137), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 39/121 (32%), Positives = 50/121 (41%), Gaps = 15/121 (12%)

Query: 168 ANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDR 227
            N +EDD           G P   D R+  PEC       DQ  C  C+A +   A+S R
Sbjct: 68  VNITEDD------LYPPAGSPDRLDYRQTHPEC--FFEPEDQKECSCCYAFATLGALSTR 119

Query: 228 LCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
            CIA        +S QH+V+C     GC GG  + +W F    G V       +  C PY
Sbjct: 120 RCIAKLDPQAVSLSVQHMVSCDSGEAGCQGGEFESSWAFLETEGAV-------KSDCLPY 172

Query: 288 T 288
           T
Sbjct: 173 T 173


>gi|390367767|ref|XP_787947.3| PREDICTED: cathepsin B-like [Strongylocentrotus purpuratus]
          Length = 146

 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 35/68 (51%), Positives = 44/68 (64%)

Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
           K LP NFDARE WP CP+++ + DQ +CGSCWA     AISDR+CI S G     ISA+ 
Sbjct: 76  KDLPENFDARENWPNCPTIKEVRDQGSCGSCWAFGAVEAISDRICIKSKGQTQVHISAED 135

Query: 245 IVACTPNC 252
           ++ C   C
Sbjct: 136 LMTCCKTC 143


>gi|412992960|emb|CCO16493.1| cysteine proteinase, putative [Bathycoccus prasinos]
          Length = 396

 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 52/144 (36%), Positives = 71/144 (49%), Gaps = 7/144 (4%)

Query: 186 GLPRNFDAREKWPECPSL-RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
           GLPR FDAR++W EC  L   + DQ  CGSCWAV+    ++DR+CIA     T ++S Q+
Sbjct: 145 GLPRQFDARKEWAECKGLIGTVRDQGKCGSCWAVAATEVMNDRVCIAHGK--TEELSPQY 202

Query: 245 IVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDY-NSQEGCQPYTLAPCEHHVQGP---L 300
            ++C     GC GG      +     GV TGG + +S   C PY    C+H  Q P    
Sbjct: 203 ALSCYSAGAGCEGGNVIDTLQEAIEKGVPTGGMFGDSSSACLPYEFEACDHPCQVPGTIA 262

Query: 301 QNCTLLGKLKTPECKQNCYNPSYE 324
           + C       TP  +     P+ E
Sbjct: 263 EECPTTCADGTPISETEMMRPTSE 286



 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 34/92 (36%), Positives = 52/92 (56%), Gaps = 11/92 (11%)

Query: 78  RQIYEHGPLVAIFS-VYADFLQYKSGVY-QHNFGDSIGLHAVRVLGWGVENDI------- 128
           ++++++G +   F  V  DF  +K GVY Q   G  +GLHA +++GWG E D        
Sbjct: 300 QELHKYGSMAVTFGPVCDDFYGHKHGVYEQPEGGKPLGLHATKIIGWGFEGDDEETGKGG 359

Query: 129 -PYWLVANSWNDHWGDHGTFKILRGENEADIE 159
            PYW++ NSW  +WG+HG  +I  GE   + E
Sbjct: 360 KPYWIMINSWQ-NWGEHGVGRIGIGEMSIESE 390


>gi|410910940|ref|XP_003968948.1| PREDICTED: tubulointerstitial nephritis antigen-like [Takifugu
           rubripes]
          Length = 477

 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 40/96 (41%), Positives = 58/96 (60%), Gaps = 13/96 (13%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI--------GLHAVRVLGWGVENDI 128
           M++I ++GP+ AI  V+ DF  YKSG+Y+H              G H+V++ GWG E ++
Sbjct: 355 MKEIQDNGPVQAIMEVHEDFFVYKSGIYKHTDVSFTKPPQYRKHGTHSVKITGWGEERNV 414

Query: 129 P-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
                 YW+ ANSW  +WG+ G F+I RGENE +IE
Sbjct: 415 DGAKRKYWIAANSWGKNWGEEGYFRIARGENECEIE 450



 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 43/102 (42%), Positives = 54/102 (52%), Gaps = 10/102 (9%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  F++ EKWP    +    DQ NC + WA S A   SDR+ I S G+ T Q+S Q+++
Sbjct: 205 LPLYFNSAEKWPG--KIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPQLSPQNLI 262

Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           +C T N  GC GG    AW F    GVVT       E C PY
Sbjct: 263 SCDTRNQGGCTGGRIDGAWWFLRRRGVVT-------EDCYPY 297


>gi|66810163|ref|XP_638805.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
 gi|74897075|sp|Q54QD9.1|CTSB_DICDI RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Flags:
           Precursor
 gi|60467425|gb|EAL65448.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
          Length = 311

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 36/77 (46%), Positives = 51/77 (66%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M++I  +GP+ A F+V+ DFL YKSGVY H  G  +G H V+++G+G  N + Y+   N 
Sbjct: 223 MQEIVTNGPVEACFTVFEDFLAYKSGVYVHTTGKDLGGHCVKLVGFGTLNGVDYYAANNQ 282

Query: 137 WNDHWGDHGTFKILRGE 153
           W   WGD+GTF I RG+
Sbjct: 283 WTTSWGDNGTFLIKRGD 299



 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 45/136 (33%), Positives = 64/136 (47%), Gaps = 15/136 (11%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +F+A+  WP C ++  I +Q+ CGSCWA     + +DRLCI +N     Q+S   +V
Sbjct: 79  IPTSFNAQTNWPNCTTISQIQNQARCGSCWAFGATESATDRLCIHNNENV--QLSFMDMV 136

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLL 306
            C     GC GG    AW +    G V+       E C PYT+  C      P      L
Sbjct: 137 TCDETDNGCEGGDAFSAWNWLRKQGAVS-------EECLPYTIPTC------PPAQQPCL 183

Query: 307 GKLKTPECKQNCYNPS 322
             + TP C + C + S
Sbjct: 184 NFVNTPSCTKECQSNS 199


>gi|395833440|ref|XP_003789742.1| PREDICTED: tubulointerstitial nephritis antigen [Otolemur
           garnettii]
          Length = 464

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 42/96 (43%), Positives = 60/96 (62%), Gaps = 13/96 (13%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQH---NFGDS-----IGLHAVRVLGWGVENDI 128
           M++I ++GP+ AI  V+ DF  YKSG+Y+H     G+S     +  HAV++LGWG     
Sbjct: 353 MKEIMQNGPVQAIMQVHEDFFHYKSGIYRHVASTHGESENYRKLRTHAVKLLGWGTLRGA 412

Query: 129 P-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
                 +W+ ANSW   WG++G F+ILRG NE+DIE
Sbjct: 413 QGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 448



 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 35/94 (37%), Positives = 48/94 (51%), Gaps = 5/94 (5%)

Query: 187 LPRNFDAREKWPECPSLRH-IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP  F A  KWP      H   DQ NC + WA S A+  +DR+ I S G +T  +S Q++
Sbjct: 205 LPEFFVASYKWP---GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNL 261

Query: 246 VACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
           ++C   N  GCN G    AW +    G+V+   Y
Sbjct: 262 ISCCAKNRHGCNSGSIDRAWWYLRKRGLVSHACY 295


>gi|294926967|ref|XP_002779086.1| Gut-specific cysteine proteinase precursor, putative [Perkinsus
           marinus ATCC 50983]
 gi|239888027|gb|EER10881.1| Gut-specific cysteine proteinase precursor, putative [Perkinsus
           marinus ATCC 50983]
          Length = 283

 Score = 84.3 bits (207), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 52/175 (29%), Positives = 79/175 (45%), Gaps = 15/175 (8%)

Query: 135 NSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAK--GLPRNFD 192
           NS  + W         +G +  D+  G  N  + +S+ DD+   +G    +   LP +FD
Sbjct: 89  NSMQNSWTASKDQPPFKGMSIKDVPTGCPNGPKPSSTSDDETRLLGSTKPELTNLPSDFD 148

Query: 193 AREKWPECPS-LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVAC--- 248
           AR+K+  C   + H+ DQ  C +CWA       +DR+CI S G F   +S  +  +C   
Sbjct: 149 ARQKFASCAEVIGHVRDQGACHNCWATGSTGMFNDRVCIKSGGSFQNILSLGYFTSCCNP 208

Query: 249 ---TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYN------SQEGCQPYTLAPCEH 294
               P   GC GG       F  ++G+VTG ++       S +GC PY    C H
Sbjct: 209 ANGCPKAKGCEGGNLLEGLNFLKNHGIVTGNEFKPASQLVSADGCWPYPFPKCNH 263


>gi|194753202|ref|XP_001958906.1| GF12327 [Drosophila ananassae]
 gi|190620204|gb|EDV35728.1| GF12327 [Drosophila ananassae]
          Length = 431

 Score = 84.3 bits (207), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 41/89 (46%), Positives = 53/89 (59%), Gaps = 4/89 (4%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH---NFGDSIGLHAVRVLGWGVE-NDIPY 130
           + M +IY  GP+ A   VY DF  Y  G+Y+    N G   G H+V+++GWG E N   Y
Sbjct: 323 DIMAEIYHSGPVQATMRVYRDFFSYSGGIYRQTAANRGAPQGFHSVKLVGWGEEHNGDKY 382

Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIE 159
           W+ ANSW   WG+ G F+ILRG NE  IE
Sbjct: 383 WIAANSWGPWWGERGYFRILRGSNECGIE 411



 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 42/105 (40%), Positives = 56/105 (53%), Gaps = 9/105 (8%)

Query: 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
             GLP +F+A E+WP    +  + DQ  CGS W +S  +  SDR  I S G    ++SAQ
Sbjct: 183 TDGLPSSFNAVERWPS--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSKGKEAVRLSAQ 240

Query: 244 HIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           +I++CT    GC+GG    AWRF    GVV        + C PYT
Sbjct: 241 NILSCTRRQQGCDGGHLDAAWRFLHKKGVV-------DDSCYPYT 278


>gi|290979437|ref|XP_002672440.1| predicted protein [Naegleria gruberi]
 gi|284086017|gb|EFC39696.1| predicted protein [Naegleria gruberi]
          Length = 354

 Score = 84.3 bits (207), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 40/94 (42%), Positives = 56/94 (59%), Gaps = 6/94 (6%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           N    I  +G + + F++Y DF+ Y+SGVY+H    ++G HAV ++GWGVE+   YWL  
Sbjct: 262 NIKAAIMSYGSVQSGFTIYRDFMSYRSGVYKHVSTTTLGGHAVALIGWGVESGTNYWLAV 321

Query: 135 NSWNDHWGDHGTFKILRGENEADIEMGFNNRVEA 168
           NSW  +WG  G FKI +G      E G  N+V A
Sbjct: 322 NSWGSNWGMSGYFKIAQG------ECGIENQVYA 349



 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 42/118 (35%), Positives = 57/118 (48%), Gaps = 11/118 (9%)

Query: 171 SEDDDLETMGCQNAKGLPRNFDAREKWPEC-PSLRHIADQSNCGSCWAVSVANAISDRLC 229
           S D D   +  +    LP NFDAR +W  C P++R   DQ  CG+CWA S    ++ RLC
Sbjct: 116 STDPDTPRLDIEPRVDLPMNFDARTQWRGCIPAVR---DQQTCGACWAFSATYVLAHRLC 172

Query: 230 IASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           IA+NG     +S ++ V C      C GG+ + AW F    G          + C PY
Sbjct: 173 IATNGKTNVVLSPEYQVQCDTMNKACQGGYLKYAWSFLERTGTTV-------DSCIPY 223


>gi|327281715|ref|XP_003225592.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
           carolinensis]
          Length = 520

 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 40/97 (41%), Positives = 59/97 (60%), Gaps = 15/97 (15%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS--------IGLHAVRVLGWGVENDI 128
           M+++ E+GP+ AI  V+ DF  Y++G+Y+H    +         G H+V++ GWG E  +
Sbjct: 405 MKELMENGPVQAILEVHEDFFMYRTGIYRHTAVAAGKPEQYRRHGTHSVKITGWG-EEQM 463

Query: 129 P------YWLVANSWNDHWGDHGTFKILRGENEADIE 159
           P      YW+ ANSW   WG+HG F+I RGENE +IE
Sbjct: 464 PDGSNQKYWIAANSWGKDWGEHGYFRITRGENECEIE 500



 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 52/162 (32%), Positives = 73/162 (45%), Gaps = 19/162 (11%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  F+A +KW     +    DQ NC   WA S A   SDR+ I S G+ T  +S Q+++
Sbjct: 254 LPSYFNAADKWSG--MIHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMTPALSPQNLL 311

Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGP--LQNC 303
           +C T +  GCNGG    AW F    GVVT       + C P++     H    P  + + 
Sbjct: 312 SCNTRHQQGCNGGRIDGAWWFLRRRGVVT-------DECYPFSNQETNHSPNAPACMMHS 364

Query: 304 TLLGKLKTPECKQNCYNPS------YESTYRFDLKKGKKAHM 339
              G+ K  +    C NP       Y+ST  + L   +K  M
Sbjct: 365 RSTGRGKR-QAIARCPNPRSHANEIYQSTPAYRLSSNEKEIM 405


>gi|126310154|ref|XP_001364630.1| PREDICTED: tubulointerstitial nephritis antigen [Monodelphis
           domestica]
          Length = 468

 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 42/96 (43%), Positives = 60/96 (62%), Gaps = 13/96 (13%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQH--NFGD------SIGLHAVRVLGWGVENDI 128
           M++I ++GP+ AI  V+ DF  YKSG+Y+H  N  D      ++  HAV++ GWGV    
Sbjct: 357 MKEIMQNGPVQAIMQVHEDFFHYKSGIYRHINNLKDESEKYRNLRTHAVKLTGWGVLRGA 416

Query: 129 P-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
                 +W+ ANSW   WG++G F+ILRG NE+DIE
Sbjct: 417 QGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 452



 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 50/102 (49%), Gaps = 3/102 (2%)

Query: 178 TMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFT 237
           T+   +   LP  F +  KWP         DQ NC + WA S A+  +DR+ I S G +T
Sbjct: 200 TVTLPSQTDLPEFFISSYKWP--GWTHDPLDQKNCAASWAFSTASVAADRIAIQSKGRYT 257

Query: 238 GQISAQHIVA-CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
             +S Q++++ C  N  GC GG    AW +    G+V+   Y
Sbjct: 258 DNLSPQNLISCCVKNRHGCKGGSIDRAWWYLRKRGLVSHACY 299


>gi|66270083|gb|AAY43371.1| cathepsin-like cysteine protease [Phytophthora infestans]
          Length = 635

 Score = 84.0 bits (206), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 61/236 (25%), Positives = 101/236 (42%), Gaps = 49/236 (20%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M +IY  GP+    +V   FL+Y  G++      +   HA+ ++GWG EN +P+W++ NS
Sbjct: 211 MAEIYARGPIACSVAVTDGFLKYSGGIFDDKTNATDVDHAISIVGWGEENGVPFWVLRNS 270

Query: 137 WNDHWGDHGTFKILRGENEADIEMGFNNRVEANSS-------------------EDDDLE 177
           W   WG+ G  +++RG N   +E      V  +                     E+  +E
Sbjct: 271 WGSFWGESGWMRLVRGVNNVGVEGECAFGVPRDDGWPTPTKIEEKEEDKVKEPQEETSVE 330

Query: 178 TM--GCQ--------------------NAKGLPRNFDARE----KWPECPSLRHIADQSN 211
           +   GC+                    +   LP+++D R+     +      +HI     
Sbjct: 331 STLGGCRQKLHFAGGERVISPLPHETMDVTDLPKSWDWRDVNGKNYVTWDKNQHIPKY-- 388

Query: 212 CGSCWAVSVANAISDRLCIASNGYFTG-QISAQHIVACTPNCWGCNGGWPQLAWRF 266
           CGSCWA    +A+SDR+ I  N  +    +S Q ++ C      CNGG P L + +
Sbjct: 389 CGSCWAQGTTSALSDRISILRNASWPEIALSPQVLINCHAGG-TCNGGNPGLVYEY 443



 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/118 (28%), Positives = 54/118 (45%), Gaps = 5/118 (4%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV--ENDIPYWLVANS 136
           +IY+ GP+       + F  Y  G+Y  +    +  H + V GWG   E D  YW+  NS
Sbjct: 511 EIYKRGPIGCGVHATSKFESYTGGIYSEHVMFPLINHEISVAGWGYDEETDTEYWIGRNS 570

Query: 137 WNDHWGDHGTFKILRGENEADIEMGFNNRV---EANSSEDDDLETMGCQNAKGLPRNF 191
           W  +WG++G F+I    N   IE   +  V   + +   D  ++  G +  +   RNF
Sbjct: 571 WGTYWGENGWFRIQMHHNNLGIEQDCDWGVPLPDGSKPNDFVVDYQGNEAGEATDRNF 628



 Score = 41.2 bits (95), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 34/118 (28%), Positives = 53/118 (44%), Gaps = 27/118 (22%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSN------CGSCWAVSVANAISDRLCI---------- 230
           LP+NFD    W      R+++   N      CGSCW+ +  +A++DR+ I          
Sbjct: 56  LPKNFD----WRNVNGTRYVSISRNQHIPHYCGSCWSFAATSALADRILIFKERNPGNKP 111

Query: 231 ASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           +   +    +S Q I+ C     GC+GG    A+R+   +GV        +EGCQ Y 
Sbjct: 112 SVEVHRGVVLSPQVILNCDKKDNGCHGGDQLEAYRYIKEHGV-------PEEGCQRYA 162


>gi|312082955|ref|XP_003143660.1| hypothetical protein LOAG_08080 [Loa loa]
 gi|307761175|gb|EFO20409.1| hypothetical protein LOAG_08080 [Loa loa]
          Length = 339

 Score = 84.0 bits (206), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 44/92 (47%), Positives = 62/92 (67%), Gaps = 9/92 (9%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH--NFGDSI-GLHAVRVLGWGVE--NDIP 129
           + M +I  +GP+ A F V+ DF  + +GVY+H    G+ I G H+VR+LGWG +    IP
Sbjct: 220 DIMSEILTNGPVQATFRVHGDF--FIAGVYKHLPTVGEEIEGYHSVRLLGWGEDYSTGIP 277

Query: 130 --YWLVANSWNDHWGDHGTFKILRGENEADIE 159
             YW+ ANSW  +WG++GTF+ILRGEN  +IE
Sbjct: 278 VKYWIAANSWGTNWGENGTFRILRGENHCEIE 309



 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 39/102 (38%), Positives = 54/102 (52%), Gaps = 10/102 (9%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDAR+KWP+   +  I DQ +C S WA S A   +DRL + + G     +SAQ  +
Sbjct: 80  LPTSFDARQKWPDF--IHPIQDQGDCASSWAQSTAATSADRLALITEGRQNVALSAQQFL 137

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           +C  +   GC GG+   AW +    GVV+       E C PY
Sbjct: 138 SCNQHRQKGCEGGYLDRAWWYIRKFGVVS-------EECYPY 172


>gi|351709947|gb|EHB12866.1| Tubulointerstitial nephritis antigen-like protein [Heterocephalus
           glaber]
          Length = 467

 Score = 84.0 bits (206), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 41/96 (42%), Positives = 54/96 (56%), Gaps = 13/96 (13%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD--------SIGLHAVRVLGWGVE--- 125
           M+++ E+GP+ A+  VY DF  YKSG+Y H              G H+V++ GWG E   
Sbjct: 354 MKELMENGPVQALMEVYEDFFLYKSGIYSHTLVSMGRPEQYRRHGTHSVKITGWGEEMLP 413

Query: 126 --NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
               + YW  ANSW   WG+ G F+ILRG NE DIE
Sbjct: 414 DGRTLKYWTAANSWGPSWGERGYFRILRGSNECDIE 449



 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 41/112 (36%), Positives = 56/112 (50%), Gaps = 5/112 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ F+A +KWP    +    DQ NC   WA S A   SDR+ I S G+ T  +S Q+++
Sbjct: 203 LPKAFEASKKWPN--MIHDPLDQGNCAGSWAFSTAAVASDRVSIHSMGHMTPVLSPQNLL 260

Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDY--NSQEGCQPYTLAPCEHH 295
           +C T +  GC GG    AW F    GVV+   Y  +  E  +     PC  H
Sbjct: 261 SCDTHHQQGCQGGRLDGAWWFLRRRGVVSDHCYPFSGHEQAEAGPATPCMMH 312


>gi|193606095|ref|XP_001951499.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 330

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 41/107 (38%), Positives = 62/107 (57%), Gaps = 4/107 (3%)

Query: 54  YLPTSIPLSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSI 112
           Y    + +SHY+   ++    +  +++  +GP+   F VY DF  YKSGVY +      +
Sbjct: 216 YYHDHVKVSHYY---NIKSNEDIQKEVQTYGPVSVKFRVYDDFFLYKSGVYVKTEKSLYV 272

Query: 113 GLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
             H  +++GWGVEN + YWL+ NSW + WG +G FKI RG NE  +E
Sbjct: 273 RRHFAKLIGWGVENGVDYWLLVNSWGNEWGQNGLFKIKRGTNEVHVE 319



 Score = 77.4 bits (189), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 41/137 (29%), Positives = 68/137 (49%), Gaps = 17/137 (12%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +   FDAR+ WP+C ++  + D  N    WA + A  ++DR+CIA+NG +   +S + ++
Sbjct: 86  IHEEFDARKGWPQCKTIGEVHDDGNTRWGWAYATAGVLADRMCIATNGSYNQLLSTEELI 145

Query: 247 AC----TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
            C    T       G      W +   +G+V+GG YN+ +GCQP  + P   ++   L N
Sbjct: 146 FCGGIKTKQSGAVRG---DDVWEYLKSHGLVSGGKYNTNDGCQPSKIPPIG-NIPTHLYN 201

Query: 303 CTLLGKLKTPECKQNCY 319
            T         C++ CY
Sbjct: 202 HT---------CEERCY 209


>gi|115621283|ref|XP_782184.2| PREDICTED: tubulointerstitial nephritis antigen-like
           [Strongylocentrotus purpuratus]
          Length = 450

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 40/99 (40%), Positives = 60/99 (60%), Gaps = 14/99 (14%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH---------NFGDSIGLHAVRVLGWGVE 125
           + M +IY++GP+ A F+V  DF  Y  GVY++         +  D  G H+V+++GWG++
Sbjct: 337 DIMTEIYQNGPVQATFNVKNDFFVYNRGVYRNVKQEFTASQSDSDQAGWHSVKIVGWGID 396

Query: 126 -----NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
                N I YWL  NSW  +WG+ G F+I+RG NE +IE
Sbjct: 397 RSDWYNPIKYWLCTNSWGRNWGEQGMFRIVRGVNECEIE 435



 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 45/128 (35%), Positives = 60/128 (46%), Gaps = 10/128 (7%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDARE WP    +  + DQ  CGS WA+S A+  SDRL I S G    ++S QH++
Sbjct: 197 LPETFDARENWPGL--IDEVIDQGKCGSSWAISTASVASDRLAIQSMGEINPRLSEQHLL 254

Query: 247 ACTPNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C      GC+GG+   AW      G V+         C PY     E  +   L+    
Sbjct: 255 SCNIRGQRGCSGGYLDRAWYHLRRAGAVS-------RACYPYHSGLDEDTIMQKLRCRVA 307

Query: 306 LGKLKTPE 313
            G  + PE
Sbjct: 308 YGSSQCPE 315


>gi|428180143|gb|EKX49011.1| cathepsin B-like cysteine protease [Guillardia theta CCMP2712]
          Length = 330

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 45/103 (43%), Positives = 61/103 (59%), Gaps = 9/103 (8%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI-GLHAVRVLGWGVE-------N 126
           + M++IY +GP+ A F VY  F+ YKSGVY H   D + G HA++++GWGVE        
Sbjct: 221 DIMKEIYLNGPVEAGFRVYTSFMSYKSGVYHHRILDIMEGGHAIKIVGWGVEPPKRFWQK 280

Query: 127 DIPYWLVANSWNDHWGDHGTFKILRGENE-ADIEMGFNNRVEA 168
              YW+ ANSW   WG +G FKI RG+N     E G  ++V A
Sbjct: 281 PTKYWICANSWTADWGMNGFFKIRRGKNRFGQSECGIEDQVFA 323



 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 42/110 (38%), Positives = 58/110 (52%), Gaps = 4/110 (3%)

Query: 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQH 244
           K LP +F+  E WP    +  I DQ+ CGSCWA + +  +SDR  IASNG     +S + 
Sbjct: 92  KDLPESFNCYENWPN--YMHPIRDQARCGSCWAFAASEVLSDRFAIASNGTVNKILSPED 149

Query: 245 IVACTPNCWGCNGGWPQLAWRFWGHNGVVTGG--DYNSQEGCQPYTLAPC 292
           +V+C     GC GG+   AW +   NG+VT     Y +Q+G  P     C
Sbjct: 150 LVSCDKGDMGCQGGYLDKAWDYLKTNGIVTESCFPYAAQKGVAPSCRISC 199


>gi|496968|gb|AAA96831.1| cysteine protease homologue, partial [Ancylostoma caninum]
          Length = 197

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 43/95 (45%), Positives = 60/95 (63%), Gaps = 2/95 (2%)

Query: 45  KKKKKKKRLYLPTSIPLSHYFKKAHMVPRCN-AMRQ-IYEHGPLVAIFSVYADFLQYKSG 102
           K +K  +R Y  +     H+  +A+ +P    ++RQ IY++GP+VA F VY DF  YK G
Sbjct: 103 KCRKTCQRKYYKSYQEDKHFATRAYYLPNNERSIRQEIYKNGPVVAAFRVYQDFSYYKKG 162

Query: 103 VYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           +Y H +G   G HAV+V+GWG EN   YWL+ANSW
Sbjct: 163 IYVHKWGGQTGAHAVKVVGWGRENATDYWLIANSW 197



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/130 (29%), Positives = 61/130 (46%), Gaps = 4/130 (3%)

Query: 214 SCWAVSVANAISDRLCIASNGYFTGQISAQHIVACT-PNC-WGCNGGWPQLAWRFWGHNG 271
           SCWAVS A A+SD +C+ SN      IS   I++C   +C +GC GGW   A+++     
Sbjct: 1   SCWAVSSAEAMSDEICVQSNSTIRVMISDSDILSCCGISCGYGCQGGWSIEAYKWMQRER 60

Query: 272 VVTGGDYNSQEGCQPYTLAP-CEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
                +   +  C+P   +    +H   P       G   TP+C++ C    Y+S Y+ D
Sbjct: 61  CCYRWENTDRRVCKPVRPSIRVGNHPNDPYYGPCPGGLWPTPKCRKTCQRKYYKS-YQED 119

Query: 331 LKKGKKAHMV 340
                +A+ +
Sbjct: 120 KHFATRAYYL 129


>gi|452268|emb|CAA80451.1| cathepsin B-like protease [Fasciola hepatica]
          Length = 104

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 50/88 (56%), Gaps = 1/88 (1%)

Query: 209 QSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQLAWRFW 267
           Q  CG+CWA     A+SDR+CI S G     +SA+ +++C   C  GC GG P LAW +W
Sbjct: 1   QGQCGTCWAFGAVGAMSDRVCIHSKGQMKPHLSARDLLSCCEFCGRGCRGGSPALAWDYW 60

Query: 268 GHNGVVTGGDYNSQEGCQPYTLAPCEHH 295
             +G+VTGG      GC PY    C HH
Sbjct: 61  KSSGIVTGGSLEEPTGCAPYPFPKCAHH 88


>gi|294952611|ref|XP_002787376.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239902348|gb|EER19172.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 203

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 33/80 (41%), Positives = 52/80 (65%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I+++GP+ + F +Y DF  YKSGVY     + +  H V+++GWG ++   YWL  NSW
Sbjct: 114 QEIFDNGPVFSAFKMYEDFRYYKSGVYVPTTKEVLSFHLVKIIGWGADSVQEYWLAMNSW 173

Query: 138 NDHWGDHGTFKILRGENEAD 157
           N+ WGDHG  K+  G+N  +
Sbjct: 174 NEEWGDHGLIKMAFGKNRLE 193



 Score = 46.2 bits (108), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 31/99 (31%), Positives = 42/99 (42%), Gaps = 14/99 (14%)

Query: 254 GCNGGWPQLAWRFWGHNGVVTGGDYNSQ------EGCQPYTLAPCEHHVQGPLQN----- 302
           GCNGG    A  F    GVVTG D+  Q      +GC PY    C H    P +N     
Sbjct: 11  GCNGGTFVEAMSFLEDYGVVTGNDFKPQGQLSEADGCWPYPFQKCNH---VPTENSEYPK 67

Query: 303 CTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMVL 341
           C  +     P C+  C N +Y+ + + D+ + K    V 
Sbjct: 68  CKDVAHQPLPPCRTTCTNKAYKKSLKKDVHRAKSWRKVF 106


>gi|327282776|ref|XP_003226118.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
           carolinensis]
          Length = 476

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 39/98 (39%), Positives = 60/98 (61%), Gaps = 13/98 (13%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFG--------DSIGLHAVRVLGWGVEN 126
           + M++I E+GP+ A+  VY DF  YKSG+Y+H +              H+++++GWG   
Sbjct: 369 DIMKEIKENGPVQAVMQVYDDFFLYKSGIYKHIWSLEGKTQNRHQKKPHSIKIVGWGTLR 428

Query: 127 DIP-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
           D       +W+ ANSW + WG++G F+ILRG+NE DIE
Sbjct: 429 DAEGQRQKFWIAANSWGNSWGENGYFRILRGQNECDIE 466



 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 35/93 (37%), Positives = 49/93 (52%), Gaps = 3/93 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
            P  F A  +WP    +    DQ NC + WA S A+  +DR+ I S G FT  +S QH++
Sbjct: 224 FPEFFVAWHEWPG--WIHDPLDQRNCAASWAFSTASVAADRIAIHSKGRFTDNLSPQHLI 281

Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
           +C T N +GC GG    AW +    G+V+   Y
Sbjct: 282 SCDTRNQYGCKGGSITGAWSYLKKYGLVSHACY 314


>gi|294893015|ref|XP_002774310.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239879603|gb|EER06126.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 81

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 33/68 (48%), Positives = 47/68 (69%)

Query: 86  LVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHG 145
           ++ + S+Y DF  YKSGVY H  G  +G+H+++++GWGVE+   YWL  NSWN+  GDHG
Sbjct: 1   VLGVISMYEDFRLYKSGVYVHTTGGLVGVHSLKIIGWGVESGQDYWLAVNSWNEESGDHG 60

Query: 146 TFKILRGE 153
             K+  GE
Sbjct: 61  MIKLAVGE 68


>gi|149941230|emb|CAO02547.1| putative cathepsin B-like cysteine protease [Vigna unguiculata]
          Length = 201

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 50/135 (37%), Positives = 67/135 (49%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFDAR  W +C ++  I DQ +CGSCWA     ++SDR CI  +      +S   ++
Sbjct: 18  LPVNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFD--VNISLSVNDLL 75

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GCNGG+P  AWR+  ++GVVT       E C PY     C H    P    
Sbjct: 76  ACCGFLCGSGCNGGYPLSAWRYLSNHGVVT-------EECDPYFDQTGCSHPGCEP---- 124

Query: 304 TLLGKLKTPECKQNC 318
                 +TP+C + C
Sbjct: 125 ----AYRTPKCVKKC 135



 Score = 42.7 bits (99), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 25/35 (71%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFG 109
           + M ++Y++GP+   F+VY DF  YKSGVY+H  G
Sbjct: 161 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTG 195


>gi|158285208|ref|XP_001687862.1| AGAP007684-PA [Anopheles gambiae str. PEST]
 gi|158285210|ref|XP_308187.4| AGAP007684-PB [Anopheles gambiae str. PEST]
 gi|157019881|gb|EDO64511.1| AGAP007684-PA [Anopheles gambiae str. PEST]
 gi|157019882|gb|EAA04576.4| AGAP007684-PB [Anopheles gambiae str. PEST]
          Length = 463

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 39/94 (41%), Positives = 57/94 (60%), Gaps = 9/94 (9%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFG-----DSIGLHAVRVLGWGVE---- 125
           + M +I + G + AI  VY DF  Y+SG+Y+H+       +    H+VR++GWG E    
Sbjct: 323 DIMAEIKDRGTVQAIMRVYRDFFSYRSGIYRHSAAATPAEERSAYHSVRLIGWGEERVGY 382

Query: 126 NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           + + YW+  NSW   WG++G F+ILRG NE DIE
Sbjct: 383 DVVKYWIAINSWGQWWGENGRFRILRGSNECDIE 416



 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 40/104 (38%), Positives = 48/104 (46%), Gaps = 9/104 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FDA E W     +    DQ  CGS WA S A   SDR  I S G    Q++ Q ++
Sbjct: 187 LPTRFDASEHWTGL--VAEARDQGWCGSSWAFSTATMASDRFAILSKGREMVQLAPQQML 244

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
           AC     GC+GG    AW++    GVV        E C PY  A
Sbjct: 245 ACVRRQQGCSGGHLDTAWQYLRRTGVV-------NEECYPYIAA 281


>gi|77744608|gb|ABB02268.1| cathepsin B [Ovis aries]
          Length = 76

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 38/75 (50%), Positives = 51/75 (68%), Gaps = 2/75 (2%)

Query: 224 ISDRLCIASNGYFTGQISAQHIVACT-PNCW-GCNGGWPQLAWRFWGHNGVVTGGDYNSQ 281
           ISDR+CI S G    ++SA+ ++ C    C  GCNGG+P  AW FW   G+V+GG Y+S 
Sbjct: 1   ISDRICIHSKGRVNVEVSAEDMLTCCGSECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSH 60

Query: 282 EGCQPYTLAPCEHHV 296
            GC+PY++ PCEHHV
Sbjct: 61  VGCRPYSIPPCEHHV 75


>gi|21695|emb|CAA46812.1| cathepsin B [Triticum aestivum]
          Length = 310

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 49/135 (36%), Positives = 70/135 (51%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR +W  C ++ +I DQ +CG+CWA +   A+ DR CI  N   +  +S   ++
Sbjct: 97  LPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVEALQDRFCIHLN--MSVSLSVNDLL 154

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GCNGG+P  AWR++  +GVVT       E C PY     C+H    P    
Sbjct: 155 ACCGFLCGSGCNGGYPISAWRYFRRSGVVT-------EECDPYFDQTGCQHPGCEP---- 203

Query: 304 TLLGKLKTPECKQNC 318
                  TP+C++ C
Sbjct: 204 ----AYPTPKCQRKC 214



 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 48/79 (60%), Gaps = 4/79 (5%)

Query: 67  KAHMVPRCNAMRQIYEHGPLVAIFSV--YADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV 124
           + H  P  + M ++Y++GP+   F+     DF  YKSGVY+H  G  +G HAV+++GWG 
Sbjct: 233 RVHSNPH-DIMAEVYKNGPVEVAFTYCQILDFAHYKSGVYKHITGGVMGGHAVKLIGWGT 291

Query: 125 EN-DIPYWLVANSWNDHWG 142
            +    YWL+AN WN  WG
Sbjct: 292 SDAGEDYWLLANQWNRGWG 310


>gi|403343435|gb|EJY71046.1| Papain family cysteine protease containing protein [Oxytricha
           trifallax]
          Length = 619

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 69/257 (26%), Positives = 104/257 (40%), Gaps = 64/257 (24%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANS 136
           M++IY+ GP+    +V      Y  G+YQ   GD   +H V V+G+GVEN   +W+V NS
Sbjct: 196 MQEIYQRGPIACGIAVPDSLETYTGGIYQDTTGDQNIVHDVSVVGFGVENGTKFWVVRNS 255

Query: 137 WNDHWGDHGTFKILRGENEADIEMG---------FNNRV-------EANSSEDD------ 174
           W  H+G++G  +++RG N   IE           + NRV       E N  ++D      
Sbjct: 256 WGSHYGENGFVRVIRGVNNIAIETDCAWATPVDTWTNRVPHKTTDAEKNDPKNDKYRKNG 315

Query: 175 -----------DLETMGCQ----------------------NAKGLPRNFDARE----KW 197
                        +  GC+                      +A  LP N D R      +
Sbjct: 316 PYPSGMENEFLSTKNHGCRRVAKAAFKAGQVKTEVMPWEEIDAAALPANLDWRNVNNTNF 375

Query: 198 PECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQI--SAQHIVACTPNCWGC 255
                 +HI     CGSCWA    ++++DR  I    +    I  +AQ I+ C      C
Sbjct: 376 LSWSKNQHIPQY--CGSCWAQGTTSSLADRFNILLGDHNPTPIDLAAQTIINCQAGG-SC 432

Query: 256 NGGWPQLAWRFWGHNGV 272
           NGG P   + +    G+
Sbjct: 433 NGGDPSGVYEYAFETGI 449



 Score = 54.3 bits (129), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 53/101 (52%), Gaps = 5/101 (4%)

Query: 63  HYFKKAHMVPRCNAMR-QIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSIGLHAVRVL 120
           +Y    + +   N M+ +I+++GP+    SV   F  Y +G+Y + +F   I  H + V+
Sbjct: 499 YYVSNYYGLSGANKMKAEIFKNGPISCGISVTDGFEAYSTGIYSESSFFPQIN-HEIAVV 557

Query: 121 GWGVE--NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           GWG++      YW+  NSW  +WG+ G F+I    +   IE
Sbjct: 558 GWGLDEATKTEYWIGRNSWGTYWGEQGFFRIKMHSDNLAIE 598



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/120 (31%), Positives = 59/120 (49%), Gaps = 13/120 (10%)

Query: 212 CGSCWAVSVANAISDRLCIASNGYFTG-QISAQHIVACTPNCWGCNGGWPQLAWRFWGHN 270
           CGSCWA +  ++ISDR+ IA    +    I+ Q +++C+ N  GC+GG    A++F  H 
Sbjct: 77  CGSCWAQAATSSISDRIKIARKAAWPDINIAPQVVISCSMNDDGCHGGEAISAYQFM-HQ 135

Query: 271 GVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFD 330
             VT       E C  Y     ++ V+     C  +   K  +  Q+C+ P   +TYR D
Sbjct: 136 SEVT------DETCSIYQARGHDNGVE-----CAPINVCKNCQPFQDCFVPDEYNTYRVD 184


>gi|115605092|gb|ABJ15785.1| cathepsin B [Bos taurus]
          Length = 118

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 40/86 (46%), Positives = 53/86 (61%), Gaps = 3/86 (3%)

Query: 255 CNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPEC 314
           CNGG+P  AW FW   G+V+GG YNS  GC+PY++ PCEHHV G    CT  G+  TP+C
Sbjct: 1   CNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCT--GEGDTPKC 58

Query: 315 KQNCYNPSYESTYRFDLKKGKKAHMV 340
            + C  P Y  +Y+ D   G  ++ V
Sbjct: 59  SKTC-EPGYSPSYKEDKHFGCSSYSV 83



 Score = 43.1 bits (100), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 18/28 (64%), Positives = 23/28 (82%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVY 104
           M +IY++GP+   FSVY+DFL YKSGVY
Sbjct: 91  MAEIYKNGPVEGAFSVYSDFLLYKSGVY 118


>gi|149941232|emb|CAO02548.1| putative cathepsin B-like cysteine protease,putative [Vigna
           unguiculata]
          Length = 195

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 50/135 (37%), Positives = 67/135 (49%), Gaps = 20/135 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP NFDAR  W +C ++  I DQ +CGSCWA     ++SDR CI  +      +S   ++
Sbjct: 18  LPVNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFD--VNISLSVNDLL 75

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY-TLAPCEHHVQGPLQNC 303
           AC       GCNGG+P  AWR+  ++GVVT       E C PY     C H    P    
Sbjct: 76  ACCGFLCGSGCNGGYPLSAWRYLSNHGVVT-------EECDPYFDQTGCSHPGCEP---- 124

Query: 304 TLLGKLKTPECKQNC 318
                 +TP+C + C
Sbjct: 125 ----AYRTPKCVKKC 135



 Score = 42.7 bits (99), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 25/35 (71%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFG 109
           + M ++Y++GP+   F+VY DF  YKSGVY+H  G
Sbjct: 161 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTG 195


>gi|403354695|gb|EJY76909.1| Cathepsin B [Oxytricha trifallax]
          Length = 311

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 35/80 (43%), Positives = 50/80 (62%)

Query: 80  IYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWND 139
           ++  GP+VA+F V+ DF+ Y  G+Y    GD +G HAV++LG+GVEN   Y++  N W  
Sbjct: 224 LFNKGPMVAVFDVFEDFINYGGGIYNKVSGDKLGKHAVKLLGYGVENSTNYYIGVNQWGK 283

Query: 140 HWGDHGTFKILRGENEADIE 159
            WG+ G F+I  GE   D E
Sbjct: 284 DWGEDGYFRIKAGEVLIDNE 303



 Score = 70.9 bits (172), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 37/107 (34%), Positives = 57/107 (53%), Gaps = 10/107 (9%)

Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
           Q AK +P ++D R  +P C +   I DQ+ CGSCWA +  N +  R C+A+ G    ++S
Sbjct: 82  QVAKQMPSSYDVRTVYPMCEN--RIKDQAQCGSCWAFATTNVLEYRYCMATKGKKYPELS 139

Query: 242 AQHIVAC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
            Q++++C     WGC+GG+    + +    GV T       E C PY
Sbjct: 140 PQNLISCFNSASWGCDGGYIDQTFLYLEMMGVNT-------EQCMPY 179


>gi|14789619|gb|AAH10745.1| Tubulointerstitial nephritis antigen [Mus musculus]
          Length = 475

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 43/124 (34%), Positives = 68/124 (54%), Gaps = 24/124 (19%)

Query: 60  PLSHYFKKAHMVPRCNA-----------MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNF 108
           P  + F+K++ + +C+            MR+I ++GP+ AI  V+ DF  YK+G+Y+H  
Sbjct: 336 PCPNSFEKSNRIYQCSPPYRVSSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVV 395

Query: 109 GDS--------IGLHAVRVLGWGVENDI-----PYWLVANSWNDHWGDHGTFKILRGENE 155
             +        +  HAV++ GWG           +W+ ANSW   WG++G F+ILRG NE
Sbjct: 396 STNEEPEKYKKLRTHAVKLTGWGTLRGARGKKEKFWIAANSWGKSWGENGYFRILRGVNE 455

Query: 156 ADIE 159
           +DIE
Sbjct: 456 SDIE 459



 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 36/94 (38%), Positives = 48/94 (51%), Gaps = 5/94 (5%)

Query: 187 LPRNFDAREKWPECPSLRH-IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP  F A  KWP      H   DQ NC + WA S A+  +DR+ I S G +T  +S Q++
Sbjct: 216 LPEIFIASYKWP---GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNL 272

Query: 246 VACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
           ++C   N  GCN G    AW F    G+V+   Y
Sbjct: 273 ISCCAKNRHGCNSGSIDRAWWFLRKRGLVSHACY 306


>gi|53850626|ref|NP_001005549.1| tubulointerstitial nephritis antigen precursor [Rattus norvegicus]
 gi|51858645|gb|AAH81887.1| Tubulointerstitial nephritis antigen [Rattus norvegicus]
 gi|149019129|gb|EDL77770.1| tubulointerstitial nephritis antigen [Rattus norvegicus]
          Length = 475

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 43/124 (34%), Positives = 68/124 (54%), Gaps = 24/124 (19%)

Query: 60  PLSHYFKKAHMVPRCN-----------AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNF 108
           P  + F+K++ + +C+            MR+I ++GP+ AI  V+ DF  YK+G+Y+H  
Sbjct: 336 PCPNSFEKSNRIYQCSPPYRISSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVV 395

Query: 109 GDS--------IGLHAVRVLGWGVENDIP-----YWLVANSWNDHWGDHGTFKILRGENE 155
             +        +  HAV++ GWG           +W+ ANSW   WG++G F+ILRG NE
Sbjct: 396 STNEEPEKYRKLRTHAVKLTGWGTLRGAQGKKEKFWIAANSWGKSWGENGYFRILRGVNE 455

Query: 156 ADIE 159
           +DIE
Sbjct: 456 SDIE 459



 Score = 58.2 bits (139), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 36/94 (38%), Positives = 48/94 (51%), Gaps = 5/94 (5%)

Query: 187 LPRNFDAREKWPECPSLRH-IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP  F A  KWP      H   DQ NC + WA S A+  +DR+ I S G +T  +S Q++
Sbjct: 216 LPEVFIASYKWP---GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNL 272

Query: 246 VACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
           ++C   N  GCN G    AW F    G+V+   Y
Sbjct: 273 ISCCAKNRHGCNSGSIDRAWWFLRKRGLVSHACY 306


>gi|227499499|ref|NP_036163.3| tubulointerstitial nephritis antigen precursor [Mus musculus]
 gi|4929827|gb|AAD34171.1| tubulo-interstitial nephritis antigen [Mus musculus]
 gi|148694397|gb|EDL26344.1| tubulointerstitial nephritis antigen, isoform CRA_a [Mus musculus]
          Length = 475

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 43/124 (34%), Positives = 68/124 (54%), Gaps = 24/124 (19%)

Query: 60  PLSHYFKKAHMVPRCNA-----------MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNF 108
           P  + F+K++ + +C+            MR+I ++GP+ AI  V+ DF  YK+G+Y+H  
Sbjct: 336 PCPNSFEKSNRIYQCSPPYRVSSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVV 395

Query: 109 GDS--------IGLHAVRVLGWGVENDI-----PYWLVANSWNDHWGDHGTFKILRGENE 155
             +        +  HAV++ GWG           +W+ ANSW   WG++G F+ILRG NE
Sbjct: 396 STNEEPEKYKKLRTHAVKLTGWGTLRGARGKKEKFWIAANSWGKSWGENGYFRILRGVNE 455

Query: 156 ADIE 159
           +DIE
Sbjct: 456 SDIE 459



 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 36/94 (38%), Positives = 48/94 (51%), Gaps = 5/94 (5%)

Query: 187 LPRNFDAREKWPECPSLRH-IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP  F A  KWP      H   DQ NC + WA S A+  +DR+ I S G +T  +S Q++
Sbjct: 216 LPEIFIASYKWP---GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNL 272

Query: 246 VACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
           ++C   N  GCN G    AW F    G+V+   Y
Sbjct: 273 ISCCAKNRHGCNSGSIDRAWWFLRKRGLVSHACY 306


>gi|146163744|ref|XP_001471259.1| cathepsin z [Tetrahymena thermophila]
 gi|146145941|gb|EDK31861.1| cathepsin z [Tetrahymena thermophila SB210]
          Length = 585

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 68/260 (26%), Positives = 111/260 (42%), Gaps = 59/260 (22%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYK--SGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVA 134
           M++I+  GP +A +    ++L+Y    G+Y          H + V+GWG EN+  YW++ 
Sbjct: 189 MQEIFNRGP-IACYIYATEYLRYNYTGGIYNDTSSYPGTNHVIEVVGWGEENNEKYWIIR 247

Query: 135 NSWNDHWGDHGTFKILRGENEADIEMG----------FNNRV-------EANSSEDDDLE 177
           NSW  +WG+ G ++ LRG N  +IE            + N V       E +++  ++  
Sbjct: 248 NSWGSYWGEKGFYRQLRGVNMLNIESSNCNWAVPLDTWTNDVRNTTKVTEVSNNHTNNFR 307

Query: 178 TMGC--------------------QNAKGLPRNFDAREKWPECPSLRHIADQSN------ 211
              C                     NA  LP N+D    W     + +++   N      
Sbjct: 308 HTTCIRESNKNSTQLITGPLPHEYINAASLPANWD----WRNINGVNYLSFTRNQHIPQY 363

Query: 212 CGSCWAVSVANAISDRLCIASNGYFTG-QISAQHIVACTPNCWGCNGGWPQLAWRFWGHN 270
           CGSCWA    ++++DR+ IA N  +    +S Q ++ C      CNGG P   ++F    
Sbjct: 364 CGSCWAHGTTSSLADRINIARNRTWPDIALSVQVVLNCQAGG-SCNGGQPMGVYQFANKQ 422

Query: 271 GVVTGGDYNSQEGCQPYTLA 290
           G+        +E CQ Y  A
Sbjct: 423 GI-------PEESCQNYLAA 435



 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 42/83 (50%), Gaps = 2/83 (2%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE--NDIPYWLVANS 136
           +IY  GP+     V   F  Y  G+Y+ +    +  H + V+GWG +    + YW+  NS
Sbjct: 492 EIYARGPISCGIYVTNKFEAYTGGIYKESTAFPMINHEIAVVGWGTDPQTGVEYWIGRNS 551

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           W  +WG++G F+I   +    IE
Sbjct: 552 WGTYWGENGFFRIQMHKQNLAIE 574


>gi|431838263|gb|ELK00195.1| Tubulointerstitial nephritis antigen [Pteropus alecto]
          Length = 425

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 40/96 (41%), Positives = 58/96 (60%), Gaps = 13/96 (13%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS--------IGLHAVRVLGWGVENDI 128
           M++I  +GP+ AI  V+ DF  YKSG+Y+H    +        +  HAV++ GWG     
Sbjct: 314 MKEIIHNGPVQAIMQVHEDFFHYKSGIYRHVTSTNEKSEKYQKLQTHAVKLTGWGTLRGA 373

Query: 129 P-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
                 +W+VANSW + WG++G F+ILRG NE+DIE
Sbjct: 374 QGRKEKFWIVANSWGNSWGENGYFRILRGVNESDIE 409



 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 33/93 (35%), Positives = 47/93 (50%), Gaps = 3/93 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  F A  KWP         DQ NC + WA S A+  +DR+ I S G +T  +S Q+++
Sbjct: 166 LPEFFVAYYKWPGW--THGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLI 223

Query: 247 ACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
           +C   N  GC+ G    AW +    G+V+   Y
Sbjct: 224 SCCAKNRHGCSSGSIDRAWWYLRKRGLVSHACY 256


>gi|348553066|ref|XP_003462348.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
           porcellus]
          Length = 475

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 43/127 (33%), Positives = 69/127 (54%), Gaps = 24/127 (18%)

Query: 57  TSIPLSHYFKKAHMVPRCN-----------AMRQIYEHGPLVAIFSVYADFLQYKSGVYQ 105
            + P  ++ +K++ + +C+            M++I ++GP+ AI  V+ DF  YK+G+Y+
Sbjct: 333 ATTPCPNHIEKSNRIYQCSPPYRVSSNETQIMKEIMQNGPVQAIMKVHEDFFSYKTGIYR 392

Query: 106 HNFGDS--------IGLHAVRVLGWGVENDI-----PYWLVANSWNDHWGDHGTFKILRG 152
           H    S        +  HAV++ GWG           +W+ ANSW   WG++G FKILRG
Sbjct: 393 HVTSTSEDSEKYQKLRTHAVKLTGWGTLKGARGKKEKFWIAANSWGKSWGENGYFKILRG 452

Query: 153 ENEADIE 159
            NE+DIE
Sbjct: 453 VNESDIE 459



 Score = 58.2 bits (139), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 40/129 (31%), Positives = 60/129 (46%), Gaps = 21/129 (16%)

Query: 187 LPRNFDAREKWPECPSLRH-IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP  F A  KWP      H   DQ NC + WA S A+  +DR+ I S+G +T  +S Q++
Sbjct: 217 LPEFFIASYKWP---GWTHGPLDQKNCAASWAFSTASVAADRIAIQSSGRYTANLSPQNL 273

Query: 246 VACTPN-CWGCNGGWPQLAWRFWGHNGVVTGG------DYNSQEGC----------QPYT 288
           ++C      GC GG    AW +    G+V+        D N+  GC          + + 
Sbjct: 274 ISCCARKRHGCGGGSVDRAWWYLRKRGLVSHACYPLFKDQNATNGCAMASRSDGRGKRHA 333

Query: 289 LAPCEHHVQ 297
             PC +H++
Sbjct: 334 TTPCPNHIE 342


>gi|328712819|ref|XP_001942906.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Acyrthosiphon pisum]
 gi|328712821|ref|XP_003244911.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Acyrthosiphon pisum]
          Length = 463

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 50/160 (31%), Positives = 79/160 (49%), Gaps = 19/160 (11%)

Query: 28  KKKKKEEEKKKKKKKKKKKKKKKKRLYLPTSIPLSHYFKKAHMVPRCNAMRQIYEHGPLV 87
           KKKK+   +   + +    +  K RL+    +          +      M +I   GP+ 
Sbjct: 298 KKKKETMAQCPSRVRSNNDRTTKTRLHRVGPV--------YRVATEEGIMHEILTSGPVQ 349

Query: 88  AIFSVYADFLQYKSGVYQHN---FGDSIGLHAVRVLGWGVEND----IPYWLVANSWNDH 140
           A+  V  DF  YKSGVY+ +    G   G H+VR++GWG E      + YW+ +NSW   
Sbjct: 350 AVMKVSRDFFMYKSGVYKCSNLASGSRTGYHSVRIVGWGEEYQGGKIVKYWIASNSWGSW 409

Query: 141 WGDHGTFKILRGENEADIEMGFNNRVEANSSEDDDLETMG 180
           WG++G F+IL+G +E +IE    + V A  ++ DD +  G
Sbjct: 410 WGENGYFRILKGVDECEIE----DFVIAAWADIDDFDVTG 445



 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 36/120 (30%), Positives = 54/120 (45%), Gaps = 19/120 (15%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           L R++DARE W     +    DQ  CG+ WA++     +DR  I S    +  +S QH++
Sbjct: 195 LRRSYDAREVWGN--YISSPIDQGWCGASWAITTVQVTTDRFGIMSKRAISDVLSPQHLL 252

Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C   N  GC GG    AW +    G++T       E C P+         QG +  C +
Sbjct: 253 SCNNLNQQGCQGGHLTRAWNWIRKFGLIT-------EECYPW---------QGRMSTCAV 296


>gi|56756124|gb|AAW26240.1| unknown [Schistosoma japonicum]
          Length = 159

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 43/112 (38%), Positives = 62/112 (55%), Gaps = 14/112 (12%)

Query: 148 KILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIA 207
           +IL G  + D EM  N R   +   D ++E         +P  FD+R+KWP C S+  I 
Sbjct: 61  RILMGARKEDAEMKRNRRPTVDH-HDLNVE---------IPSQFDSRKKWPHCKSISQIR 110

Query: 208 DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGW 259
           DQS CGSCWA     A++DR+CI S G  + ++SA  +++C  +C    GGW
Sbjct: 111 DQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDC----GGW 158


>gi|312083604|ref|XP_003143931.1| hypothetical protein LOAG_08355 [Loa loa]
          Length = 188

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 34/66 (51%), Positives = 47/66 (71%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FDAR+ WPEC SLR++ DQS+CGSCWAV+   A+SDR+CI S G     +SA  ++
Sbjct: 120 IPESFDARKHWPECASLRNVRDQSSCGSCWAVAAVEAMSDRICIMSKGKKQVTLSADDLL 179

Query: 247 ACTPNC 252
           +C   C
Sbjct: 180 SCCKTC 185


>gi|290980376|ref|XP_002672908.1| predicted protein [Naegleria gruberi]
 gi|284086488|gb|EFC40164.1| predicted protein [Naegleria gruberi]
          Length = 261

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 43/104 (41%), Positives = 57/104 (54%), Gaps = 4/104 (3%)

Query: 60  PLSHYFKKA--HMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH--NFGDSIGLH 115
           PL  Y  KA  ++    +  + I + G ++    +Y DFL Y SGVYQH  N    I   
Sbjct: 149 PLVLYKTKAVQNLTGEHDMQQAILQGGSIMTELDMYQDFLYYSSGVYQHSANLRQPIAKF 208

Query: 116 AVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
            VR++GWGVEN + YW+V N W   WG  G   I RG NE++IE
Sbjct: 209 VVRIIGWGVENGVKYWIVPNIWGKTWGMQGYIWIRRGNNESNIE 252



 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 42/88 (47%), Gaps = 13/88 (14%)

Query: 216 WAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTG 275
           W    +  +SDR+C+ S+  F  ++S Q+I+ C    +GCNGG+    + F    G+ T 
Sbjct: 66  WGHVPSATVSDRMCVQSSAKFQERLSTQYILECDTRDFGCNGGYLNTEFEFELKRGIPT- 124

Query: 276 GDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
                 E C PY+       V G L NC
Sbjct: 125 ------EKCVPYS------AVNGTLANC 140


>gi|12330246|gb|AAG52660.1| cysteine proteinase [Metagonimus yokogawai]
          Length = 179

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 45/122 (36%), Positives = 62/122 (50%), Gaps = 4/122 (3%)

Query: 220 VANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDY 278
              A++DRLCI SN      ISA  +++C  +C +GC+GG+P  AW FW  NG+VTGG  
Sbjct: 2   AVEAMTDRLCIHSNATIKKHISATDLLSCCESCGFGCHGGFPPRAWDFWMENGLVTGGSK 61

Query: 279 NSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAH 338
            +  GC+ Y    C HH +G    C       TP C  +C  P  +  Y  D    K ++
Sbjct: 62  ENPSGCRSYPFPRCSHHGKGKYPPCPKT-IFDTPNCVDHCDKPDID--YAADKTHAKSSY 118

Query: 339 MV 340
            V
Sbjct: 119 NV 120



 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 26/52 (50%), Positives = 38/52 (73%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDI 128
           M++I  +GP+ A F VY DF++YKSG+Y H+ G  +G HA+R+LGWG E  +
Sbjct: 128 MKEIMRNGPVEAAFMVYEDFIEYKSGIYFHSHGKLLGGHAIRMLGWGEEKGV 179


>gi|66805843|ref|XP_636643.1| hypothetical protein DDB_G0288563 [Dictyostelium discoideum AX4]
 gi|60465035|gb|EAL63141.1| hypothetical protein DDB_G0288563 [Dictyostelium discoideum AX4]
          Length = 314

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 46/104 (44%), Positives = 61/104 (58%), Gaps = 11/104 (10%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFT-GQISAQHI 245
           +P +FD+R +WP+C  +  I +Q  CGSCWA S +  +SDRLCIASN     G +S Q +
Sbjct: 88  IPTSFDSRVQWPDC--IHPILNQEQCGSCWAFSSSEVLSDRLCIASNNKTNPGALSPQTL 145

Query: 246 VAC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           VAC      GC+GG PQLAW +    G+ T       + C PYT
Sbjct: 146 VACDVYGNDGCSGGIPQLAWEYMELKGLPT-------DSCVPYT 182



 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 38/95 (40%), Positives = 55/95 (57%), Gaps = 7/95 (7%)

Query: 62  SHYFKKAHMVPRCNAMRQIYE----HGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHA 116
           S Y  K   +  C++++ I E    +GP+V    VY DF+ Y SGVY    G S +G HA
Sbjct: 202 SLYRAKPFTLKTCSSVQCIQENILAYGPIVGTMEVYEDFMSYSSGVYVMTPGSSLLGGHA 261

Query: 117 VRVLGWGVE--NDIPYWLVANSWNDHWGDHGTFKI 149
           ++++GWG +  + + YW+VANSW   WG  G F I
Sbjct: 262 IKIVGWGFDQTSQLNYWIVANSWGADWGQQGFFFI 296


>gi|111054118|gb|ABH04250.1| cathepsin B precursor [Sus scrofa]
          Length = 61

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 35/55 (63%), Positives = 42/55 (76%)

Query: 105 QHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           +H  GD +G HA+R+LGWGVEN  PYWLV NSWN  WGD+G FKILRG++   IE
Sbjct: 4   KHVTGDLMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIE 58


>gi|312383398|gb|EFR28501.1| hypothetical protein AND_03481 [Anopheles darlingi]
          Length = 573

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 38/94 (40%), Positives = 57/94 (60%), Gaps = 9/94 (9%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFG-----DSIGLHAVRVLGWGVE---- 125
           + M +I E G + AI  VY DF  Y++G+Y+H+       +    H+VR++GWG E    
Sbjct: 432 DIMTEIKERGTVQAILRVYRDFFSYQNGIYRHSAAATPAEERSAYHSVRLIGWGEERVGY 491

Query: 126 NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           + + YW+  NSW   WG++G F+ILRG NE +IE
Sbjct: 492 DMVKYWIAVNSWGTWWGENGRFRILRGTNECEIE 525



 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 37/104 (35%), Positives = 49/104 (47%), Gaps = 9/104 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP +FDA + WP    +    DQ  CGS WA+S     SDR  I S G    Q++ Q ++
Sbjct: 296 LPSHFDAADHWPRL--VGEARDQGWCGSSWALSTTTMASDRFAILSKGREQVQLAPQQLL 353

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLA 290
           AC      C+GG    AW++    GVV        + C PY  A
Sbjct: 354 ACVRRQQACSGGHLDTAWQYLRRVGVV-------NDECYPYIAA 390


>gi|350596935|ref|XP_001927698.4| PREDICTED: tubulointerstitial nephritis antigen, partial [Sus
           scrofa]
          Length = 368

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 43/124 (34%), Positives = 68/124 (54%), Gaps = 24/124 (19%)

Query: 60  PLSHYFKKAHMVPRCN-----------AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNF 108
           P  + F+K++ + +C+            MR+I ++GP+ AI  V+ DF  YK+G+Y+H  
Sbjct: 229 PCPNNFEKSNRIYQCSPPYRVSSNETEIMREIMQNGPVQAIMQVHEDFFHYKTGIYRHVT 288

Query: 109 GDS--------IGLHAVRVLGWGVENDIP-----YWLVANSWNDHWGDHGTFKILRGENE 155
             +        +  HAV++ GWG           +W+ ANSW   WG++G F+ILRG NE
Sbjct: 289 STNEESDKYRKLRTHAVKLTGWGTLKGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNE 348

Query: 156 ADIE 159
           +DIE
Sbjct: 349 SDIE 352



 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/93 (36%), Positives = 47/93 (50%), Gaps = 3/93 (3%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  F A  KWP         DQ NC + WA S A+  +DR+ I S G +T  +S Q+++
Sbjct: 109 LPEFFVASYKWPGW--THGPLDQKNCAASWAFSTASVAADRIAIQSEGRYTANLSPQNLI 166

Query: 247 ACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
           +C   N  GCN G    AW +    G+V+   Y
Sbjct: 167 SCCAKNRHGCNSGSIDRAWWYLRKRGLVSHACY 199


>gi|290982673|ref|XP_002674054.1| predicted protein [Naegleria gruberi]
 gi|284087642|gb|EFC41310.1| predicted protein [Naegleria gruberi]
          Length = 673

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 42/101 (41%), Positives = 58/101 (57%), Gaps = 4/101 (3%)

Query: 72  PRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVE-NDIP 129
           P  N   +I  +GP+ A F VY+DF  YKSG+YQ   G + +G HAV+VLGW  + N  P
Sbjct: 218 PITNYQTEIMTNGPVEADFDVYSDFYSYKSGIYQKTAGSTYVGGHAVKVLGWASDSNGTP 277

Query: 130 YWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANS 170
           YW+  N W   WG  G F I RG +  + +  F+N + A +
Sbjct: 278 YWIAQNQWGTSWGMGGYFYIYRGNSTLNCK--FDNYMIAGT 316



 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 43/124 (34%), Positives = 64/124 (51%), Gaps = 9/124 (7%)

Query: 165 RVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAI 224
           R E +SSE+    T   ++   +P  FD+R KWP+C  +  I +Q  CGSCWA +     
Sbjct: 66  RGEESSSEEARYNTRDVKSTVAIPDTFDSRTKWPQC--IHGIRNQGQCGSCWAFATTGVF 123

Query: 225 SDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGC 284
           SDRLCI +N      IS + ++ C    + C GG+   +W+F+ + G+         E C
Sbjct: 124 SDRLCITTNNVSNVVISPEFLIECDKTSFACQGGYGYYSWKFFMNTGI-------PLESC 176

Query: 285 QPYT 288
            PYT
Sbjct: 177 VPYT 180


>gi|6562770|emb|CAB62589.1| putative cathepsin B-like protease [Pisum sativum]
          Length = 206

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 42/103 (40%), Positives = 58/103 (56%), Gaps = 11/103 (10%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR  WP+C ++  I DQ +CGSCWA     ++SDR CI         +S   ++
Sbjct: 102 LPKEFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFG--VDVPLSVNDLL 159

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           AC       GC+GG+P  AW+++ H+GVVT       E C PY
Sbjct: 160 ACCGFLCGSGCDGGYPISAWKYFAHHGVVT-------EECDPY 195


>gi|361069783|gb|AEW09203.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153583|gb|AFG58928.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153585|gb|AFG58929.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153587|gb|AFG58930.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153589|gb|AFG58931.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153591|gb|AFG58932.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153593|gb|AFG58933.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153595|gb|AFG58934.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153597|gb|AFG58935.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153599|gb|AFG58936.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153601|gb|AFG58937.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153603|gb|AFG58938.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153605|gb|AFG58939.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153607|gb|AFG58940.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153609|gb|AFG58941.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
          Length = 68

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 38/64 (59%), Positives = 45/64 (70%)

Query: 96  FLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENE 155
           F  YKSGVY++  GD +G HAV+++GWG E    YWLVANSWN  WG+ G FKI RG NE
Sbjct: 1   FAHYKSGVYKYIKGDLMGGHAVKLVGWGTEGGTDYWLVANSWNTAWGEDGYFKIARGSNE 60

Query: 156 ADIE 159
             IE
Sbjct: 61  CGIE 64


>gi|47212965|emb|CAF93376.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 271

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 39/96 (40%), Positives = 57/96 (59%), Gaps = 13/96 (13%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSI--------GLHAVRVLGWGVENDI 128
           M++I ++GP+ AI  V+ DF  Y SG+Y+H              G H+V++ GWG E + 
Sbjct: 158 MKEIQDNGPVQAIMEVHEDFFMYNSGIYKHTDVSFTKPPHYRKHGTHSVKITGWGEERNF 217

Query: 129 P-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
                 YW+ ANSW  +WG++G F+I RGENE +IE
Sbjct: 218 DGTTRKYWIAANSWGKNWGENGYFRIARGENECEIE 253



 Score = 70.9 bits (172), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 56/163 (34%), Positives = 74/163 (45%), Gaps = 22/163 (13%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  F++ EKWP    +    DQ NC + WA S A   SDR+ I S G+ T Q+S Q+++
Sbjct: 8   LPLYFNSAEKWPG--KIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPQLSPQNLI 65

Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTL 305
           +C T N  GC GG    AW +    GVVT       E C PY            L  C +
Sbjct: 66  SCDTRNQGGCAGGRLDGAWWYLRRRGVVT-------EDCYPYRPP---QQTPAELSRCMM 115

Query: 306 ----LGKLK---TPEC--KQNCYNPSYESTYRFDLKKGKKAHM 339
               +G+ K   T  C    N  N  Y+ST  + L   +K  M
Sbjct: 116 QSRSVGRGKRQATQRCPNTNNYQNDIYQSTPPYRLSTSEKEIM 158


>gi|32129433|sp|P92131.3|CATB1_GIALA RecName: Full=Cathepsin B-like CP1; AltName: Full=Cathepsin B-like
           protease B1; Flags: Precursor
          Length = 303

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/85 (43%), Positives = 55/85 (64%), Gaps = 2/85 (2%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD-SIGLHAVRVLGWGVEND-IPYWLVA 134
           M  +   GPL  +  VYAD   Y+SGVY+H +G  ++G HA+ ++G+G  +D   YW++ 
Sbjct: 210 MGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIK 269

Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
           NSW   WG++G F+I+RG NE  IE
Sbjct: 270 NSWGPDWGENGYFRIVRGVNECRIE 294



 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 31/89 (34%), Positives = 45/89 (50%), Gaps = 2/89 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FD R+++P+C  ++   DQ +CGSCWA S      DR C           S QH++
Sbjct: 79  IPPQFDFRDEYPQC--VKPALDQGSCGSCWAFSAIGVFGDRRCAMGIDKEAVSYSQQHLI 136

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTG 275
           +C+   +GC+GG  Q  W F    G  T 
Sbjct: 137 SCSLENFGCDGGDFQPTWSFLTFTGATTA 165


>gi|159112288|ref|XP_001706373.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157434469|gb|EDO78699.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 303

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/85 (43%), Positives = 55/85 (64%), Gaps = 2/85 (2%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD-SIGLHAVRVLGWGVEND-IPYWLVA 134
           M  +   GPL  +  VYAD   Y+SGVY+H +G  ++G HA+ ++G+G  +D   YW++ 
Sbjct: 210 MGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIK 269

Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
           NSW   WG++G F+I+RG NE  IE
Sbjct: 270 NSWGPDWGENGYFRIVRGVNECRIE 294



 Score = 64.3 bits (155), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 44/89 (49%), Gaps = 2/89 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FD R+++P+C  ++   DQ +CG CWA S      DR C           S QH++
Sbjct: 79  IPPQFDFRDEYPQC--VKPALDQGSCGGCWAFSAIGVFGDRRCAMGIDKEAVSYSQQHLI 136

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTG 275
           +C+   +GC+GG  Q  W F    G  T 
Sbjct: 137 SCSLENFGCDGGDFQPTWSFLTFTGATTA 165


>gi|11691656|emb|CAC18646.1| cathepsin B-like protease 1 [Giardia intestinalis]
          Length = 303

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/85 (43%), Positives = 55/85 (64%), Gaps = 2/85 (2%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD-SIGLHAVRVLGWGVEND-IPYWLVA 134
           M  +   GPL  +  VYAD   Y+SGVY+H +G  ++G HA+ ++G+G  +D   YW++ 
Sbjct: 210 MGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIK 269

Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
           NSW   WG++G F+I+RG NE  IE
Sbjct: 270 NSWGPDWGENGYFRIVRGVNECRIE 294



 Score = 64.3 bits (155), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 44/89 (49%), Gaps = 2/89 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FD R+++P+C  ++   DQ +CG CWA S      DR C           S QH++
Sbjct: 79  IPPQFDFRDEYPQC--VKPALDQGSCGECWAFSAIGVFGDRRCAMGIDKEAVSYSQQHLI 136

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTG 275
           +C+   +GC+GG  Q  W F    G  T 
Sbjct: 137 SCSLENFGCDGGDFQPTWSFLTFTGATTA 165


>gi|48762489|dbj|BAD23814.1| cathepsin B-N [Tuberaphis takenouchii]
          Length = 163

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 45/103 (43%), Positives = 63/103 (61%), Gaps = 6/103 (5%)

Query: 220 VANAISDRLCIASNGYFTGQISAQHIVACTPNC-WGCNGGWPQLAWRFWGHNGVVTGGDY 278
            ++A +DRLCIA++G F   +SA+ +  C   C +GC+GG+P  AW ++  +G+VTGGDY
Sbjct: 2   TSSAFADRLCIATDGEFNELLSAEELAFCCHKCGFGCHGGYPIKAWEWFKKHGLVTGGDY 61

Query: 279 NSQEGCQPYTLAPCEHHVQGPLQNCTLLGKL--KTPECKQNCY 319
           +S EGCQPY + PC     G   N T  GK   K   C + CY
Sbjct: 62  DSGEGCQPYRVPPCPLDEYG---NNTCRGKPAEKNHRCTRMCY 101



 Score = 37.7 bits (86), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 16/43 (37%), Positives = 24/43 (55%)

Query: 63  HYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQ 105
           H+ + A+ +      + +  +GP+ A F VY DF  YKSGVY 
Sbjct: 113 HWTRDAYYLTYTTIQKDVMAYGPIEASFDVYDDFPNYKSGVYM 155


>gi|195426329|ref|XP_002061289.1| GK20838 [Drosophila willistoni]
 gi|194157374|gb|EDW72275.1| GK20838 [Drosophila willistoni]
          Length = 432

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 39/85 (45%), Positives = 51/85 (60%), Gaps = 4/85 (4%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQH---NFGDSIGLHAVRVLGWGVE-NDIPYWLVA 134
           +I+  GP+ A   VY DF  Y  G+Y+    N G   G H+V+++GWG E N   YW+ A
Sbjct: 329 EIFHSGPVQATMRVYRDFFSYSGGIYRQTAANRGAPTGFHSVKLVGWGEEHNGDKYWIAA 388

Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
           NSW   WG+ G F+ILRG NE  IE
Sbjct: 389 NSWGPWWGERGYFRILRGSNECGIE 413



 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 40/103 (38%), Positives = 53/103 (51%), Gaps = 9/103 (8%)

Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           GLP +F+A +KW     +  + DQ  CGS W +S  +  SDR  I S G    Q+S Q+I
Sbjct: 188 GLPASFNAVDKWSR--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSQGKEVVQLSPQNI 245

Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           ++CT    GC GG    AWR+    GV+        E C PYT
Sbjct: 246 LSCTRRQQGCEGGHLDAAWRYLHKKGVL-------DESCYPYT 281


>gi|193610664|ref|XP_001948185.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
          Length = 324

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 60/105 (57%), Gaps = 2/105 (1%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           + + FDAR+ W +C ++  + +  N    WA +   A +DR+C+A+NG +   +S + ++
Sbjct: 85  ISKEFDARKHWSQCKTIGEVYNDGNSDLSWAYATTGAFADRMCVATNGSYNQLLSTEQLI 144

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAP 291
           +C+      N      AW+F+   G+V+GG YN+ +GCQP  + P
Sbjct: 145 SCSG--IKSNAMADDQAWKFFKKQGLVSGGKYNTNDGCQPSKIPP 187



 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 50/82 (60%), Gaps = 1/82 (1%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNF-GDSIGLHAVRVLGWGVENDIPYWLV 133
           N  R++  +GP+ A FS+Y D   Y SGVY        +   + +++GWGVEN + YWL+
Sbjct: 229 NIQREVQTYGPVSAYFSLYDDLFLYTSGVYARTEKSKFVRYQSAKLIGWGVENGVDYWLL 288

Query: 134 ANSWNDHWGDHGTFKILRGENE 155
            NSW + WG +G FKI RG +E
Sbjct: 289 VNSWGNEWGQNGLFKIKRGTDE 310


>gi|63115212|gb|AAY33830.1| cathepsin B, partial [Siniperca chuatsi]
          Length = 69

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 35/60 (58%), Positives = 45/60 (75%)

Query: 100 KSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
           K GVYQH +G ++G HA+++LGWG E+ +PYWL ANSWN  WGD+G FK LRG +   IE
Sbjct: 1   KFGVYQHVYGSAVGGHAIKILGWGEEDGVPYWLCANSWNTDWGDNGFFKFLRGSDHCRIE 60


>gi|195585648|ref|XP_002082593.1| GD25141 [Drosophila simulans]
 gi|194194602|gb|EDX08178.1| GD25141 [Drosophila simulans]
          Length = 484

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 39/89 (43%), Positives = 52/89 (58%), Gaps = 4/89 (4%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS---IGLHAVRVLGWGVE-NDIPY 130
           + M +I+  GP+ A   V  DF  Y  GVY+    +     G H+V+++GWG E N   Y
Sbjct: 324 DIMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKY 383

Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIE 159
           W+ ANSW   WG+HG F+ILRG NE  IE
Sbjct: 384 WIAANSWGSWWGEHGYFRILRGSNECGIE 412



 Score = 71.2 bits (173), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 41/103 (39%), Positives = 54/103 (52%), Gaps = 9/103 (8%)

Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           GLP +F+A +KW     +  + DQ  CG+ W +S  +  SDR  I S G    Q+SAQ+I
Sbjct: 186 GLPSSFNALDKWSS--YISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKEAVQLSAQNI 243

Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           ++CT    GC GG    AWR+    GVV        E C PYT
Sbjct: 244 LSCTRRQQGCEGGHLDAAWRYLHKKGVV-------DENCYPYT 279


>gi|126330441|ref|XP_001381244.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Monodelphis
           domestica]
          Length = 466

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 41/98 (41%), Positives = 58/98 (59%), Gaps = 13/98 (13%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQH---NFGDSI-----GLHAVRVLGWGVEN 126
           + M+++ E+GP+ A+  V+ DF  YKSG+Y+H   + G        G H+V++ GWG E 
Sbjct: 350 DIMKELMENGPVQALMEVHEDFFLYKSGIYKHTPASLGKPARYRQHGTHSVKITGWGEER 409

Query: 127 D-----IPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
                 + YW  ANSW   WG+ G F+ILRG NE DIE
Sbjct: 410 QPDGQRLKYWTAANSWGPTWGEKGHFRILRGANECDIE 447



 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 50/157 (31%), Positives = 70/157 (44%), Gaps = 7/157 (4%)

Query: 142 GDHGTFKILRGENEADIEMGFNNRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECP 201
           G+H  F  +  E      +G      +  + ++    M  Q    LP  F+A +KWP   
Sbjct: 158 GNHSAFWGMTLEEGIQYRLGTVRPASSVMNMNEIQMVMAPQET--LPLAFNASDKWPGL- 214

Query: 202 SLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVAC-TPNCWGCNGGWP 260
            +    DQ NC   WA S A   SDR+ I S G+ T  +S Q++++C T N  GC GG  
Sbjct: 215 -IHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMTPALSPQNLLSCDTHNQKGCRGGRL 273

Query: 261 QLAWRFWGHNGVVTGGDYNSQEGCQPYT--LAPCEHH 295
             AW F    G+V+   Y    G +  T   APC  H
Sbjct: 274 DGAWWFLRRRGLVSNHCYPFSAGNRDATAPAAPCMMH 310


>gi|253742315|gb|EES99155.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 303

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 38/85 (44%), Positives = 53/85 (62%), Gaps = 2/85 (2%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD-SIGLHAVRVLGWGVEND-IPYWLVA 134
           M+ +   GP+  +  VYAD L Y  GVY+H +G  S GLHA+ ++G+G  +D   YW + 
Sbjct: 210 MQMLVSGGPVQTMIVVYADLLYYAGGVYRHTYGPISNGLHALEMVGYGTTDDGTDYWTIK 269

Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
           NSW   WG+ G F+I+RG NE  IE
Sbjct: 270 NSWGSDWGEDGYFRIVRGVNECRIE 294



 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 41/88 (46%), Gaps = 2/88 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FD R+++P C S   + DQ +CG CWA S       R C           S QH++
Sbjct: 79  LPAQFDFRDEYPHCVS--PVFDQGSCGGCWAFSAIGMFGSRRCAVGIDKAAVLYSQQHLI 136

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVT 274
           +C+   +GC+GG     W F    G  T
Sbjct: 137 SCSTENFGCSGGDFFPTWSFLTQTGATT 164


>gi|1763659|gb|AAB58258.1| cysteine protease [Giardia intestinalis]
          Length = 269

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 37/85 (43%), Positives = 55/85 (64%), Gaps = 2/85 (2%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD-SIGLHAVRVLGWGVEND-IPYWLVA 134
           M  +   GPL  +  VYAD   Y+SGVY+H +G  ++G HA+ ++G+G  +D   YW++ 
Sbjct: 176 MGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIK 235

Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
           NSW   WG++G F+I+RG NE  IE
Sbjct: 236 NSWGPDWGENGYFRIVRGVNECRIE 260



 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 44/88 (50%), Gaps = 2/88 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P  FD R+++P+C  ++   DQ +CG CWA S      DR C           S QH++
Sbjct: 45  IPPQFDFRDEYPQC--VKPALDQGSCGECWAFSAIGVFGDRRCAMGIDKEAVSYSQQHLI 102

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVT 274
           +C+   +GC+GG  Q  W F    G  T
Sbjct: 103 SCSLENFGCDGGDFQPTWSFLTFTGATT 130


>gi|159117627|ref|XP_001709033.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157437148|gb|EDO81359.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 308

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 33/83 (39%), Positives = 53/83 (63%), Gaps = 1/83 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVANS 136
           R +   GP+ A+F+VY DF  Y  G+Y + +G+ +G  +V ++G+G  ++   YW+V N 
Sbjct: 208 RAVALRGPMQAMFTVYEDFTYYLEGIYSYTYGNRVGFLSVEIVGYGTSDEGQDYWIVKNY 267

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           W   WG+ G F+I+RG+NE  IE
Sbjct: 268 WGPGWGEDGYFRIVRGQNECQIE 290



 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 48/93 (51%), Gaps = 5/93 (5%)

Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
           +N   +P +FD RE++P+C  +  + D   C S WA S  +A S R C+        + S
Sbjct: 70  ENEDPVPDHFDFREEYPQC--ITEVIDIGLCSSSWAYSAVDAFSHRRCLTGLDQEATRYS 127

Query: 242 AQHIVAC--TPNCWGCNGGWPQLAWRFWGHNGV 272
           AQ+I++C  T  C+G +     +AW F    G+
Sbjct: 128 AQYILSCSSTNGCFGFSTR-ESIAWDFIATTGI 159


>gi|354483193|ref|XP_003503779.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cricetulus
           griseus]
          Length = 475

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 43/124 (34%), Positives = 67/124 (54%), Gaps = 24/124 (19%)

Query: 60  PLSHYFKKAHMVPRCN-----------AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNF 108
           P  + F+K++ + +C+            MR+I  +GP+ AI  V+ DF  YK+G+Y+H  
Sbjct: 336 PCPNSFEKSNRIYQCSPPYRVSSNETEIMREIIRNGPVQAIMQVHEDFFYYKTGIYRHVI 395

Query: 109 GDS--------IGLHAVRVLGWGVENDI-----PYWLVANSWNDHWGDHGTFKILRGENE 155
             +        +  HAV++ GWG           +W+ ANSW   WG++G F+ILRG NE
Sbjct: 396 STNEESEKYRKLRSHAVKLTGWGTLRGAGGKKEKFWIAANSWGKSWGENGYFRILRGVNE 455

Query: 156 ADIE 159
           +DIE
Sbjct: 456 SDIE 459



 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 34/94 (36%), Positives = 47/94 (50%), Gaps = 5/94 (5%)

Query: 187 LPRNFDAREKWPECPSLRH-IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP  F +  KWP      H   DQ NC + WA S A+  +DR+ I S G +T  +S Q++
Sbjct: 216 LPEVFISSYKWP---GWTHGPLDQKNCAASWAFSTASVAADRIAIQSRGRYTANLSPQNL 272

Query: 246 VACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
           ++C      GCN G    AW F    G+V+   Y
Sbjct: 273 ISCCAKKRHGCNSGSIDRAWWFLRKRGLVSHACY 306


>gi|6562768|emb|CAB62588.1| putative cathepsin B-like protease [Pisum sativum]
          Length = 166

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 42/103 (40%), Positives = 58/103 (56%), Gaps = 11/103 (10%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP+ FDAR  WP+C ++  I DQ +CGSCWA     ++SDR CI         +S   ++
Sbjct: 62  LPKEFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFG--VDVPLSVNDLL 119

Query: 247 ACTPNCW--GCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           AC       GC+GG+P  AW+++ H+GVVT       E C PY
Sbjct: 120 ACCGFLCGSGCDGGYPISAWKYFAHHGVVT-------EECDPY 155


>gi|440907441|gb|ELR57591.1| Tubulointerstitial nephritis antigen [Bos grunniens mutus]
          Length = 476

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 39/96 (40%), Positives = 56/96 (58%), Gaps = 13/96 (13%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS--------IGLHAVRVLGWGVENDI 128
           MR+I ++GP+ AI  V+ DF  YK+G+Y+H    +           HAV++ GWG     
Sbjct: 365 MREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGA 424

Query: 129 P-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
                 +W+ ANSW   WG++G F+ILRG NE+DIE
Sbjct: 425 QGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 460



 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 33/94 (35%), Positives = 46/94 (48%), Gaps = 5/94 (5%)

Query: 187 LPRNFDAREKWPECPSLRH-IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP  F A  KWP      H   DQ NC + WA S A+  +DR+ I S G +T  +S Q++
Sbjct: 217 LPEFFIASYKWP---GWTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLSPQNL 273

Query: 246 VACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
           ++C      GCN      AW +    G+V+   Y
Sbjct: 274 ISCCAKKRRGCNSESVDRAWWYLRKRGLVSHACY 307


>gi|16768502|gb|AAL28470.1| GM06507p [Drosophila melanogaster]
          Length = 430

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 39/89 (43%), Positives = 52/89 (58%), Gaps = 4/89 (4%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS---IGLHAVRVLGWGVE-NDIPY 130
           + M +I+  GP+ A   V  DF  Y  GVY+    +     G H+V+++GWG E N   Y
Sbjct: 323 DIMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKY 382

Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIE 159
           W+ ANSW   WG+HG F+ILRG NE  IE
Sbjct: 383 WIAANSWGSWWGEHGYFRILRGSNECGIE 411



 Score = 71.2 bits (173), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 41/103 (39%), Positives = 54/103 (52%), Gaps = 9/103 (8%)

Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           GLP +F+A +KW     +  + DQ  CG+ W +S  +  SDR  I S G    Q+SAQ+I
Sbjct: 186 GLPNSFNALDKWSS--YISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNI 243

Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           ++CT    GC GG    AWR+    GVV        E C PYT
Sbjct: 244 LSCTRRQQGCEGGHLDAAWRYLHKKGVV-------DENCYPYT 279


>gi|253748399|gb|EET02549.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 303

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 38/85 (44%), Positives = 53/85 (62%), Gaps = 2/85 (2%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD-SIGLHAVRVLGWGVEND-IPYWLVA 134
           M+ +   GP+  +  VYAD L Y  GVY+H +G  S GLHA+ ++G+G  +D   YW + 
Sbjct: 210 MQMLVSGGPVQTMIVVYADLLYYAGGVYRHTYGPISNGLHALEMVGYGTTDDGTDYWTIK 269

Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
           NSW   WG+ G F+I+RG NE  IE
Sbjct: 270 NSWGSDWGEDGYFRIVRGVNECRIE 294



 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 41/88 (46%), Gaps = 2/88 (2%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  FD R+++P C S   + DQ +CG CWA S       R C           S QH++
Sbjct: 79  LPAQFDFRDEYPHCVS--PVFDQGSCGGCWAFSAIGMFGSRRCAVGIDKAAVLYSQQHLI 136

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVT 274
           +C+   +GC+GG     W F    G  T
Sbjct: 137 SCSTENFGCSGGDFFPTWSFLTQTGATT 164


>gi|195346663|ref|XP_002039877.1| GM15657 [Drosophila sechellia]
 gi|194135226|gb|EDW56742.1| GM15657 [Drosophila sechellia]
          Length = 431

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 39/89 (43%), Positives = 52/89 (58%), Gaps = 4/89 (4%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS---IGLHAVRVLGWGVE-NDIPY 130
           + M +I+  GP+ A   V  DF  Y  GVY+    +     G H+V+++GWG E N   Y
Sbjct: 324 DIMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKALTGFHSVKLVGWGEEHNGEKY 383

Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIE 159
           W+ ANSW   WG+HG F+ILRG NE  IE
Sbjct: 384 WIAANSWGSWWGEHGYFRILRGSNECGIE 412



 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 41/103 (39%), Positives = 54/103 (52%), Gaps = 9/103 (8%)

Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           GLP +F+A +KW     +  + DQ  CG+ W +S  +  SDR  I S G    Q+SAQ+I
Sbjct: 186 GLPSSFNALDKWSS--YISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKEAVQLSAQNI 243

Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           ++CT    GC GG    AWR+    GVV        E C PYT
Sbjct: 244 LSCTRRQQGCEGGHLDAAWRYLHKKGVV-------DENCYPYT 279


>gi|24657813|ref|NP_726176.1| secreted Wg-interacting molecule, isoform A [Drosophila
           melanogaster]
 gi|24657819|ref|NP_611652.2| secreted Wg-interacting molecule, isoform B [Drosophila
           melanogaster]
 gi|21064305|gb|AAM29382.1| RE01730p [Drosophila melanogaster]
 gi|21626543|gb|AAF46818.2| secreted Wg-interacting molecule, isoform A [Drosophila
           melanogaster]
 gi|21626544|gb|AAM68213.1| secreted Wg-interacting molecule, isoform B [Drosophila
           melanogaster]
 gi|220949028|gb|ACL87057.1| CG3074-PA [synthetic construct]
 gi|220958134|gb|ACL91610.1| CG3074-PA [synthetic construct]
          Length = 431

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 39/89 (43%), Positives = 52/89 (58%), Gaps = 4/89 (4%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS---IGLHAVRVLGWGVE-NDIPY 130
           + M +I+  GP+ A   V  DF  Y  GVY+    +     G H+V+++GWG E N   Y
Sbjct: 324 DIMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKY 383

Query: 131 WLVANSWNDHWGDHGTFKILRGENEADIE 159
           W+ ANSW   WG+HG F+ILRG NE  IE
Sbjct: 384 WIAANSWGSWWGEHGYFRILRGSNECGIE 412



 Score = 71.2 bits (173), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 41/103 (39%), Positives = 54/103 (52%), Gaps = 9/103 (8%)

Query: 186 GLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           GLP +F+A +KW     +  + DQ  CG+ W +S  +  SDR  I S G    Q+SAQ+I
Sbjct: 186 GLPSSFNALDKWSS--YISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNI 243

Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           ++CT    GC GG    AWR+    GVV        E C PYT
Sbjct: 244 LSCTRRQQGCEGGHLDAAWRYLHKKGVV-------DENCYPYT 279


>gi|193688334|ref|XP_001945855.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 313

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 37/116 (31%), Positives = 66/116 (56%), Gaps = 2/116 (1%)

Query: 189 RNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVAC 248
           + FDAR++WP+C ++  + ++ N    WA + A  ++DR CIA+NG +   +S + +++C
Sbjct: 74  KEFDARKRWPKCKTIGEVHNEGNFAFGWAYAAAGVLADRTCIATNGGYNKLLSTEELISC 133

Query: 249 TPNCWGCNGGWPQLA-WRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNC 303
           +      NG   + + W +   +GVV+GG YNS +GCQP+   P  + +      C
Sbjct: 134 S-GIKETNGNVNERSIWEYLKSHGVVSGGKYNSNDGCQPFKFPPIANILTHLQHTC 188



 Score = 77.8 bits (190), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 42/110 (38%), Positives = 59/110 (53%), Gaps = 4/110 (3%)

Query: 54  YLPTSIPLSH---YFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFG 109
           Y  TSI  +H     +  + +      +++  +GP+   F V  DFL YKSGVY + +  
Sbjct: 193 YGNTSINYNHDHVRVRNYYTIRTGYIQKEVQTYGPVAVQFKVCDDFLLYKSGVYVKSDNA 252

Query: 110 DSIGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
             I     +++GWGVEN + YWLV NSW   WG  G FKI RG N+  +E
Sbjct: 253 KVIRTQYAKLIGWGVENGVDYWLVINSWGHEWGQKGLFKIKRGTNQCGVE 302


>gi|239790303|dbj|BAH71722.1| ACYPI001175 [Acyrthosiphon pisum]
          Length = 330

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 40/107 (37%), Positives = 61/107 (57%), Gaps = 4/107 (3%)

Query: 54  YLPTSIPLSHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVY-QHNFGDSI 112
           Y    + +SHY+   ++    +  +++  +GP+   F VY DF  YKSGVY +      +
Sbjct: 216 YYHDHVKVSHYY---NIKSNEDIQKEVQTYGPVSVKFRVYDDFFLYKSGVYVKTEKSLYV 272

Query: 113 GLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
             H  +++GWGVEN + YWL+ N W + WG +G FKI RG NE  +E
Sbjct: 273 RRHFAKLIGWGVENGVDYWLLVNFWGNEWGQNGLFKIKRGTNEVHVE 319



 Score = 77.4 bits (189), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 41/137 (29%), Positives = 68/137 (49%), Gaps = 17/137 (12%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +   FDAR+ WP+C ++  + D  N    WA + A  ++DR+CIA+NG +   +S + ++
Sbjct: 86  IHEEFDARKGWPQCKTIGEVHDDGNTRWGWAYATAGVLADRMCIATNGSYNQLLSTEELI 145

Query: 247 AC----TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQN 302
            C    T       G      W +   +G+V+GG YN+ +GCQP  + P   ++   L N
Sbjct: 146 FCGGIKTKQSGAVRG---DDVWEYLKSHGLVSGGKYNTNDGCQPSKIPPIG-NIPTHLYN 201

Query: 303 CTLLGKLKTPECKQNCY 319
            T         C++ CY
Sbjct: 202 HT---------CEERCY 209


>gi|78042562|ref|NP_001030279.1| tubulointerstitial nephritis antigen [Bos taurus]
 gi|108861910|sp|Q3SZI1.1|TINAG_BOVIN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
 gi|74354008|gb|AAI02844.1| Tubulointerstitial nephritis antigen [Bos taurus]
 gi|296474572|tpg|DAA16687.1| TPA: tubulointerstitial nephritis antigen [Bos taurus]
          Length = 476

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 39/96 (40%), Positives = 56/96 (58%), Gaps = 13/96 (13%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS--------IGLHAVRVLGWGVENDI 128
           MR+I ++GP+ AI  V+ DF  YK+G+Y+H    +           HAV++ GWG     
Sbjct: 365 MREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGA 424

Query: 129 P-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
                 +W+ ANSW   WG++G F+ILRG NE+DIE
Sbjct: 425 QGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 460



 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 34/94 (36%), Positives = 47/94 (50%), Gaps = 5/94 (5%)

Query: 187 LPRNFDAREKWPECPSLRH-IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP  F A  KWP      H   DQ NC + WA S A+  +DR+ I S G +T  +S Q++
Sbjct: 217 LPEFFIASYKWP---GWTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLSPQNL 273

Query: 246 VACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
           ++C      GCN G    AW +    G+V+   Y
Sbjct: 274 ISCCAKKRHGCNSGSVDRAWWYLRKRGLVSHACY 307


>gi|403357104|gb|EJY78168.1| Cathepsin B [Oxytricha trifallax]
          Length = 349

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 39/101 (38%), Positives = 58/101 (57%), Gaps = 9/101 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD+R+KWP C  +  I DQ  CGSCWA + +  +SDR CI S G     +S Q +V
Sbjct: 125 IPESFDSRDKWPNC--IHGIRDQQLCGSCWAFASSAFLSDRFCIHSEGQINEDLSPQDLV 182

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPY 287
           +C+   +GC+GG    +  F  + G+V+       E C+PY
Sbjct: 183 SCSYENFGCSGGQLTESVDFLIYEGIVS-------EKCKPY 216



 Score = 64.7 bits (156), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 44/80 (55%), Gaps = 1/80 (1%)

Query: 79  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWG-VENDIPYWLVANSW 137
           ++  +GP++   SVY D + YK GVY++  G+ +G HA++++GWG  E    +W   N W
Sbjct: 256 ELMTNGPMMVGLSVYEDLMNYKEGVYEYTTGNQVGGHAIKIIGWGHTEKGELFWKCQNQW 315

Query: 138 NDHWGDHGTFKILRGENEAD 157
              WG  G   I  GE   D
Sbjct: 316 GKDWGMGGYINIKAGELGMD 335


>gi|294952605|ref|XP_002787373.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239902345|gb|EER19169.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 185

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 33/80 (41%), Positives = 53/80 (66%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137
           ++I+++GP+++ F +Y DF  YKSGVY     +S   H+++++GWG  +   YWL  NSW
Sbjct: 96  QEIFDNGPVLSSFKMYEDFRYYKSGVYVPTTKESSTSHSIKIIGWGGASGREYWLAVNSW 155

Query: 138 NDHWGDHGTFKILRGENEAD 157
           N+ WGDHG  K+  G+N  +
Sbjct: 156 NEEWGDHGLIKMAFGKNRLE 175


>gi|134023803|gb|AAI35570.1| LOC100124858 protein [Xenopus (Silurana) tropicalis]
          Length = 484

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 41/98 (41%), Positives = 56/98 (57%), Gaps = 13/98 (13%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN--------FGDSIGLHAVRVLGWGVEN 126
           + M+++YE+GP+ AI  V+ DF  YKSG+Y+               G H+V++ GWG E 
Sbjct: 368 DIMKELYENGPVQAIMEVHEDFFMYKSGIYRRTPVTEREPEHHRRHGTHSVKITGWGEER 427

Query: 127 DIP-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
                   YWL ANSW   WG+ G F+I RGENE +IE
Sbjct: 428 GRDGQTHKYWLAANSWGRDWGEDGYFRIARGENECEIE 465



 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 55/163 (33%), Positives = 74/163 (45%), Gaps = 15/163 (9%)

Query: 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISA 242
           N   LP +F+A EKWP    +    DQ NC   WA S A   SDR+ I S G+ T  +S 
Sbjct: 217 NNDILPSHFNAAEKWPGL--VHEPLDQGNCAGSWAFSTAAVASDRISIQSMGHMTQSLSP 274

Query: 243 QHIVAC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQ 301
           Q++++C T N  GC GG    AW +    GVV+       E C P+T      H    + 
Sbjct: 275 QNLLSCDTRNQHGCRGGRVDGAWWYLRRRGVVS-------EPCYPFTSLNTNGHSAPCMM 327

Query: 302 NCTLLGKLK---TPECKQNCY--NPSYESTYRFDLKKGKKAHM 339
               +G+ K   T  C    Y  N  Y+ST  + L   +K  M
Sbjct: 328 QSRSMGRGKRQATNNCPNQYYSSNEIYQSTPAYRLASSEKDIM 370


>gi|308162940|gb|EFO65307.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 303

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 37/85 (43%), Positives = 56/85 (65%), Gaps = 2/85 (2%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD-SIGLHAVRVLGWGVEND-IPYWLVA 134
           M  +   GP+  +  VY+D   Y+SGVY+H +G  S+GLHA+ ++G+G  +D   YW++ 
Sbjct: 210 MHMLATGGPVQTMIVVYSDLSYYESGVYKHTYGTISLGLHALEMVGYGTTDDGTDYWIIR 269

Query: 135 NSWNDHWGDHGTFKILRGENEADIE 159
           NSW   WG++G F+I+RG NE  IE
Sbjct: 270 NSWGADWGENGYFRIVRGVNECRIE 294



 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/95 (33%), Positives = 48/95 (50%), Gaps = 6/95 (6%)

Query: 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQIS 241
           + A  +P  FD R+++P+C  +  + DQ +CG CWA S      DR C+A         S
Sbjct: 74  EPADPIPSQFDFRDEYPQC--VTPVMDQGSCGGCWAFSAIGVFGDRRCVAGIDKEGVPYS 131

Query: 242 AQHIVACTPNCWGCNGG--WPQLAWRFWGHNGVVT 274
            Q++++C+    GC+GG  WP   W F    G  T
Sbjct: 132 QQYLISCSTENHGCDGGDFWP--TWSFLTLTGATT 164


>gi|395526635|ref|XP_003765465.1| PREDICTED: tubulointerstitial nephritis antigen-like [Sarcophilus
           harrisii]
          Length = 467

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 39/98 (39%), Positives = 57/98 (58%), Gaps = 13/98 (13%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGD--------SIGLHAVRVLGWGVE- 125
           + M+++ E+GP+ A+  V+ DF  YKSG+Y+H              G H+V++ GWG E 
Sbjct: 351 DIMKELMENGPVQALLEVHEDFFLYKSGIYKHTPASLGKPERYRQHGTHSVKITGWGEEI 410

Query: 126 ----NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
                 + YW  ANSW   WG++G F+I+RG NE DIE
Sbjct: 411 QPDGQKVKYWTAANSWGPTWGENGYFRIVRGANECDIE 448



 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 42/112 (37%), Positives = 53/112 (47%), Gaps = 5/112 (4%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           LP  F A  KWP    +    DQ NC   WA S A   SDR+ I S G+ +  +S Q+++
Sbjct: 202 LPSAFSASNKWPGL--IHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMSPALSPQNLL 259

Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQ--PYTLAPCEHH 295
           +C T N  GC GG    AW F    G+V+   Y   EG        APC  H
Sbjct: 260 SCNTHNQHGCRGGRLDGAWWFLRRRGLVSNNCYPFSEGDHNGAAPAAPCMMH 311


>gi|145347486|ref|XP_001418195.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578424|gb|ABO96488.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 330

 Score = 81.3 bits (199), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 41/109 (37%), Positives = 61/109 (55%), Gaps = 2/109 (1%)

Query: 187 LPRNFDAREKWPECPSLR-HIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP +FDAR  +P+C  L   + DQ  CGSCWAV+    ++DRLC+A++G    ++S Q+ 
Sbjct: 112 LPTSFDARVAYPKCSRLLGAVRDQGRCGSCWAVAATEVMNDRLCVATDGENADELSPQYA 171

Query: 246 VACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTLAPCEH 294
           ++C  +  GC+GG      R     G+  GG  +S   C PY    C+H
Sbjct: 172 LSCFDSGSGCDGGDVLDTLRIAFTKGIPYGGMLDSN-ACLPYEFEACDH 219



 Score = 46.2 bits (108), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 28/72 (38%), Positives = 41/72 (56%), Gaps = 6/72 (8%)

Query: 94  ADFLQYKSGVYQ--HNFGDSIGLHAVRVLGWGV-ENDIPYWLVANSWNDHWGDHGTFKIL 150
            D     SGVY   ++ G+ +G HA +++GWGV E    YW + NSW + WG++G  K+ 
Sbjct: 256 GDVTHTGSGVYTVPNDAGEPLGQHATKLIGWGVSEEGEHYWWMVNSWRN-WGENGVSKVR 314

Query: 151 RGENEADIEMGF 162
            G  E +IE G 
Sbjct: 315 MG--EMNIESGI 324


>gi|344264196|ref|XP_003404179.1| PREDICTED: tubulointerstitial nephritis antigen [Loxodonta
           africana]
          Length = 476

 Score = 81.3 bits (199), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 39/96 (40%), Positives = 57/96 (59%), Gaps = 13/96 (13%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS--------IGLHAVRVLGWGVENDI 128
           M++I ++GP+ AI  V+ DF  YK+G+Y+H    S        +  HAV++ GWG+    
Sbjct: 365 MKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVIRTSEESEKYQKLRTHAVKLTGWGMMKGA 424

Query: 129 P-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
                 +W+ ANSW   WG+ G F+ILRG NE+DIE
Sbjct: 425 KGRKEKFWVAANSWGKSWGEDGYFRILRGVNESDIE 460



 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 37/94 (39%), Positives = 50/94 (53%), Gaps = 5/94 (5%)

Query: 187 LPRNFDAREKWPECPSLRH-IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP  F A  KWP      H   DQ NC + WA S A+  +DR+ I SNG +T  +S Q++
Sbjct: 217 LPEFFVASYKWP---GWTHGPLDQKNCAASWAFSTASVAADRIAIQSNGRYTANLSPQNL 273

Query: 246 VA-CTPNCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
           ++ CT N  GCN G    AW +    G+V+   Y
Sbjct: 274 ISCCTKNRHGCNSGSVDRAWWYLRKRGLVSHACY 307


>gi|253744204|gb|EET00443.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 309

 Score = 81.3 bits (199), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 35/83 (42%), Positives = 53/83 (63%), Gaps = 1/83 (1%)

Query: 78  RQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEND-IPYWLVANS 136
           R I   GP+ A+F+VY DF  Y  G+Y H +G + G  +V ++G+G  ++   YW+V N 
Sbjct: 209 RAIALRGPMQAMFTVYEDFAYYLEGIYSHVYGGTAGYLSVEIVGYGTSDEGQDYWIVKNY 268

Query: 137 WNDHWGDHGTFKILRGENEADIE 159
           W  +WG+ G F+I+RG+NE  IE
Sbjct: 269 WGSNWGEDGYFRIVRGQNECQIE 291



 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 8/106 (7%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +FD RE++P+C  +  + D   C S WA S   A   R C+        + SAQ+I+
Sbjct: 75  IPDHFDFREEYPQC--ITEVIDMGTCSSSWAHSPVEAFGHRRCMNGVDQEATRYSAQYIL 132

Query: 247 AC-TPNCWGCNGGWPQLAWRFWGHNGV-----VTGGDYNSQEGCQP 286
           +C T N      G   ++W F    G+     V   DY+  E   P
Sbjct: 133 SCATTNGCLAFPGQGVVSWDFIATTGIPLESCVKYTDYDKTESSYP 178


>gi|130502070|ref|NP_001076255.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
 gi|818411|gb|AAC48477.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
          Length = 474

 Score = 81.3 bits (199), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 57/96 (59%), Gaps = 13/96 (13%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS--------IGLHAVRVLGWGVENDI 128
           M++I ++GP+ AI  V+ DF  YK+G+Y+H    +        +  HAV++ GWG     
Sbjct: 363 MKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVISTNEESEKYRKLQTHAVKLTGWGTLKGA 422

Query: 129 -----PYWLVANSWNDHWGDHGTFKILRGENEADIE 159
                 +W+ ANSW   WG++G F+ILRG NE+DIE
Sbjct: 423 RGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 458



 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 39/69 (56%), Gaps = 1/69 (1%)

Query: 211 NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP-NCWGCNGGWPQLAWRFWGH 269
           NC + WA S A+  +DR+ I SNG +T  +S Q++++C   N  GCN G    AW +   
Sbjct: 237 NCAASWAFSTASVAADRIAIQSNGRYTANLSPQNLISCCAKNRHGCNSGSIDRAWWYLRK 296

Query: 270 NGVVTGGDY 278
            G+V+   Y
Sbjct: 297 RGLVSHACY 305


>gi|426250116|ref|XP_004018784.1| PREDICTED: tubulointerstitial nephritis antigen [Ovis aries]
          Length = 476

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 39/96 (40%), Positives = 56/96 (58%), Gaps = 13/96 (13%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS--------IGLHAVRVLGWGVENDI 128
           MR+I ++GP+ AI  V+ DF  YK+G+Y+H    +           HAV++ GWG     
Sbjct: 365 MREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGA 424

Query: 129 -----PYWLVANSWNDHWGDHGTFKILRGENEADIE 159
                 +W+ ANSW   WG++G F+ILRG NE+DIE
Sbjct: 425 HGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 460



 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 38/116 (32%), Positives = 53/116 (45%), Gaps = 14/116 (12%)

Query: 164 NRVEANSSEDDDLETMGCQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANA 223
           N V A+ +E  DL           P  F A  KWP         DQ NC + WA S A+ 
Sbjct: 205 NEVTASLAETTDL-----------PEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASV 251

Query: 224 ISDRLCIASNGYFTGQISAQHIVACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
            +DR+ I S G +T  +S Q++++C      GCN G    AW +    G+V+   Y
Sbjct: 252 AADRIAIQSQGRYTANLSPQNLISCCAKKRHGCNSGSVDRAWWYLRKRGLVSHACY 307


>gi|145514872|ref|XP_001443341.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124410719|emb|CAK75944.1| unnamed protein product [Paramecium tetraurelia]
          Length = 358

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 40/100 (40%), Positives = 59/100 (59%), Gaps = 2/100 (2%)

Query: 75  NAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGL--HAVRVLGWGVENDIPYWL 132
           N  R+I  +GP+VA+  V+ DFL YK GVY+   G S     HAV+V+GWG ++ + YW+
Sbjct: 255 NIKREILNNGPIVAVIQVFKDFLVYKGGVYEVVEGSSKFQYGHAVKVIGWGKQDGVNYWV 314

Query: 133 VANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANSSE 172
           + NSW D WG  G   +  G+N+  +E      + A S+E
Sbjct: 315 IENSWGDSWGLKGLAYVAVGQNQLQLEAYSVAPIVAASTE 354



 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 49/102 (48%), Gaps = 9/102 (8%)

Query: 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV 246
           +P +++ RE  PEC   + I  Q NC S ++++  +A SDRLC + NG F  Q+S Q  +
Sbjct: 131 IPESYNFREAQPECA--QPIYFQGNCSSSYSIAAVSATSDRLCKSKNGEFQDQLSPQSPI 188

Query: 247 ACTPNCWGCNGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYT 288
           +C    + C GG            G V+         C PY+
Sbjct: 189 SCDDKNYKCGGGSVTRVLEVGKKQGFVS-------TSCLPYS 223


>gi|196009233|ref|XP_002114482.1| hypothetical protein TRIADDRAFT_28083 [Trichoplax adhaerens]
 gi|190583501|gb|EDV23572.1| hypothetical protein TRIADDRAFT_28083 [Trichoplax adhaerens]
          Length = 466

 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 44/109 (40%), Positives = 63/109 (57%), Gaps = 11/109 (10%)

Query: 62  SHYFKKAHMVPRCN---AMRQIYEHGPLVAIFSVYADFLQYKSGVYQHN-FGDS-----I 112
           ++Y+        CN    MR++ ++GP+   F VY DF  YK G+YQH   GDS     I
Sbjct: 348 TNYYYVGGFYGACNEYLMMRELVKNGPISISFEVYGDFKHYKGGIYQHTGLGDSYNPWQI 407

Query: 113 GLHAVRVLGWGVE--NDIPYWLVANSWNDHWGDHGTFKILRGENEADIE 159
             HAV ++G+G +  +   YW+V NSW   WG++G F+ILRG +E  IE
Sbjct: 408 TNHAVLLVGYGTDQKSGKDYWIVKNSWGTKWGENGFFRILRGVDECSIE 456



 Score = 46.6 bits (109), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 31/105 (29%), Positives = 49/105 (46%), Gaps = 15/105 (14%)

Query: 187 LPRNFDAREKWPECPSLRHIA---DQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ 243
            P+ FD    W    ++ +++   +Q  CGSC+A S       RL + S       +S Q
Sbjct: 235 FPKQFD----WRNVSNVNYVSPVRNQGACGSCYAFSSMAMYEARLRVLSKNSVKRVMSPQ 290

Query: 244 HIVACTPNCWGCNGGWPQLAWRFWGHN-GVVTGGDYNSQEGCQPY 287
            +V+C+    GC GG+P L    +G + G+V       +E C PY
Sbjct: 291 DVVSCSEYAQGCAGGFPYLIAGKYGEDFGLV-------EESCFPY 328


>gi|47125398|gb|AAH70278.1| Tubulointerstitial nephritis antigen [Homo sapiens]
 gi|190690249|gb|ACE86899.1| tubulointerstitial nephritis antigen protein [synthetic construct]
 gi|190691623|gb|ACE87586.1| tubulointerstitial nephritis antigen protein [synthetic construct]
 gi|312150986|gb|ADQ32005.1| tubulointerstitial nephritis antigen [synthetic construct]
          Length = 476

 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 57/96 (59%), Gaps = 13/96 (13%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS--------IGLHAVRVLGWGVENDI 128
           M++I ++GP+ AI  V+ DF  YK+G+Y+H    +        +  HAV++ GWG     
Sbjct: 365 MKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGA 424

Query: 129 P-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
                 +W+ ANSW   WG++G F+ILRG NE+DIE
Sbjct: 425 QGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 460



 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 35/94 (37%), Positives = 48/94 (51%), Gaps = 5/94 (5%)

Query: 187 LPRNFDAREKWPECPSLRH-IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP  F A  KWP      H   DQ NC + WA S A+  +DR+ I S G +T  +S Q++
Sbjct: 217 LPEFFVASYKWP---GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNL 273

Query: 246 VACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
           ++C   N  GCN G    AW +    G+V+   Y
Sbjct: 274 ISCCAKNRHGCNSGSIDRAWWYLRKRGLVSHACY 307


>gi|6009533|dbj|BAA84949.1| tubulointerstitial nephritis antigen [Homo sapiens]
          Length = 476

 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 57/96 (59%), Gaps = 13/96 (13%)

Query: 77  MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS--------IGLHAVRVLGWGVENDI 128
           M++I ++GP+ AI  V+ DF  YK+G+Y+H    +        +  HAV++ GWG     
Sbjct: 365 MKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGA 424

Query: 129 P-----YWLVANSWNDHWGDHGTFKILRGENEADIE 159
                 +W+ ANSW   WG++G F+ILRG NE+DIE
Sbjct: 425 QGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 460



 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 35/94 (37%), Positives = 48/94 (51%), Gaps = 5/94 (5%)

Query: 187 LPRNFDAREKWPECPSLRH-IADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHI 245
           LP  F A  KWP      H   DQ NC + WA S A+  +DR+ I S G +T  +S Q++
Sbjct: 217 LPEFFVASYKWP---GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNL 273

Query: 246 VACTP-NCWGCNGGWPQLAWRFWGHNGVVTGGDY 278
           ++C   N  GCN G    AW +    G+V+   Y
Sbjct: 274 ISCCAKNRHGCNSGSIDRAWWYLRKRGLVSHACY 307


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.133    0.431 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,978,831,710
Number of Sequences: 23463169
Number of extensions: 290740234
Number of successful extensions: 11118598
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 33926
Number of HSP's successfully gapped in prelim test: 4560
Number of HSP's that attempted gapping in prelim test: 8787244
Number of HSP's gapped (non-prelim): 1417178
length of query: 342
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 199
effective length of database: 9,003,962,200
effective search space: 1791788477800
effective search space used: 1791788477800
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 77 (34.3 bits)