BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy15353
         (344 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225713216|gb|ACO12454.1| Cathepsin B precursor [Lepeophtheirus salmonis]
 gi|290561811|gb|ADD38303.1| Cathepsin B [Lepeophtheirus salmonis]
          Length = 333

 Score =  241 bits (614), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 137/349 (39%), Positives = 183/349 (52%), Gaps = 22/349 (6%)

Query: 1   MIHILVFLL----GCTLVRGELYK--FSDAYIDQINREANTWTAGRNFPANLSEEYLRQF 54
           MI    FLL    G    RG +     S  YID IN+++ TW AG NF   +S  Y+R  
Sbjct: 1   MILKFAFLLTVYAGAAYSRGAVSNGILSKDYIDSINKDSKTWRAGSNFDEEISTSYIRGL 60

Query: 55  LIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
           +     + D     LP    T        +P+ FD+R++WP+C TI  + D G+C +   
Sbjct: 61  MGVLPNHKDYLPPALPTLLGT------EQIPENFDSRQKWPHCPTISLIRDQGSCGSCWA 114

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
           F AV A SDR CI S    N  +S E + SCC  C +     C+ G     W+F  K+G 
Sbjct: 115 FGAVEAMSDRLCIHSNKIVN--VSAENLLSCCYSCGF----GCNGGFPGAAWSFWKKKGL 168

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           V+GG YG   GCQP  I+PC HH +    P     + PK  CHT C N  Y   + +DK 
Sbjct: 169 VSGGLYGSHKGCQPYAIAPCEHHANGTRPPCSGGGRTPK--CHTFCENEDYSLPYEKDKS 226

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
               +Y V  +   I+ EI+ +GP  A F++Y DF +YKSGVY+H   + L    H+ ++
Sbjct: 227 FGRSSYSVKSDPKQIQLEIMNNGPVEAAFSVYSDFLNYKSGVYRHVKGSLLGG--HAIRI 284

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           +GWG ENGTPYWLV N+W   WGD GT KIL+G   C  E  I AG P+
Sbjct: 285 LGWGVENGTPYWLVANSWNTDWGDNGTFKILKGSDHCGIEGSIVAGLPQ 333


>gi|312374701|gb|EFR22198.1| hypothetical protein AND_15621 [Anopheles darlingi]
          Length = 335

 Score =  239 bits (610), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 135/353 (38%), Positives = 192/353 (54%), Gaps = 29/353 (8%)

Query: 1   MIHILVFLLGCTLVRG---ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA 57
           M+ + V ++  T   G   + Y  S  +ID+IN +A+TW AGRNF  ++S  Y+R  +  
Sbjct: 1   MVLLAVAVVSGTTAAGSGNKKYALSAKFIDEINSKASTWRAGRNFHPDVSLSYIRGLMGV 60

Query: 58  DAKYFDQSDRPLPGDRKTYDPEY----SATV---PDRFDAREQWPNCGTIGHVPDTGACA 110
               +           K  +PE+    SA V   P+ FD+REQWPNC TI  + D G+C 
Sbjct: 61  HQDAY-----------KFREPEFVHDLSADVDDLPENFDSREQWPNCPTIREIRDQGSCG 109

Query: 111 APHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLH 170
           +   F AV A SDR CI S G+ +   S E + SCC  C +     C+ G     W++  
Sbjct: 110 SCWAFGAVEAMSDRVCIASGGKIHFRFSAEDLVSCCHTCGF----GCNGGFPGAAWSYWV 165

Query: 171 KRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFF 230
            +G V+GG +G   GCQP  I+PC HH +  T PSCE +     KC  +C + +Y   + 
Sbjct: 166 HKGLVSGGPFGSNLGCQPYAIAPCEHHVNG-TRPSCEGEGGKTPKCVKKCQD-SYTVPYA 223

Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLH 290
           +DK   + +Y +  +ED I+KEI+ +GP    F +Y+D  HYK GVY+H +   L    H
Sbjct: 224 KDKRYGSKSYSIPRHEDQIRKEIMTNGPVEGAFTVYEDLLHYKEGVYQHVTGKMLGG--H 281

Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           + +++GWG EN T YWL+ N+W   WGD G  KILRG+     E  IAAG PK
Sbjct: 282 AIRILGWGVENNTKYWLIANSWNSDWGDNGFFKILRGEDHLGIESSIAAGLPK 334


>gi|87246247|gb|ABD35300.1| cathepsin B-like cysteine protease [Triatoma infestans]
          Length = 333

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 138/348 (39%), Positives = 185/348 (53%), Gaps = 21/348 (6%)

Query: 1   MIHILVF-LLGCTLVRGELYK--FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA 57
           M  ++ F LL C +    +     SD +ID IN    TW AGRNF  N  ++YL+  L  
Sbjct: 1   MKELIPFSLLICGIFSASIPTDPLSDEFIDYINSLQTTWRAGRNFAPNTPKKYLKS-LAG 59

Query: 58  DAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
                 ++   LP      D     T+PD FDAR+QWPNC TIG + D G+C +   F A
Sbjct: 60  GVHKNTKNGFTLP----IRDVSLDITLPDEFDARKQWPNCSTIGEIRDQGSCGSCWAFGA 115

Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
           V A SDR CI S G+    LS E + SCC  C       C  GS    W + HK G V+G
Sbjct: 116 VEAMSDRLCIHSNGKLQVHLSAENLLSCCDSC----GDGCLGGSPESAWEYWHKFGIVSG 171

Query: 178 GDYGDRTGCQPSTISPCSH--HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
           G+YG + GCQP +I+PC H  HGS+P      +    K +C    + P Y + F+  +  
Sbjct: 172 GNYGSKQGCQPYSIAPCEHSIHGSSPACGGVTDTPKCKKQCEKGYSIP-YDKAFYYGQP- 229

Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
               Y + ++   I+ EIL +GP  A+F +Y+D + YK GVY+H +   L    H  K+ 
Sbjct: 230 ---GYAIPNDAQKIQAEILKNGPIVASFLVYEDLFSYKEGVYQHVAGEFLGG--HVIKIF 284

Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           GWG ENGTPYWLV N+W   WG+ G  KI RGK EC  E  ++AG P+
Sbjct: 285 GWGIENGTPYWLVANSWNTDWGNNGFFKIPRGKDECGIEIDVSAGLPR 332


>gi|204022077|dbj|BAG71136.1| cathepsin B-S1 [Tuberaphis sumatrana]
 gi|204022079|dbj|BAG71137.1| cathepsin B-S2 [Tuberaphis sumatrana]
          Length = 334

 Score =  238 bits (606), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 134/344 (38%), Positives = 187/344 (54%), Gaps = 18/344 (5%)

Query: 1   MIHILVFLLGCT-LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
            + I+  +  C  L    L   SD  I  IN  A TW A R FPAN SEEY    L+   
Sbjct: 4   FVTIVCAIFVCVYLTEPTLQFLSDERIKYINEVAKTWKAERYFPANTSEEYFIG-LLGSR 62

Query: 60  KYFDQSDRPLPGDRKTYDPEYSAT-VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
            Y + ++     + K YDP Y     P +FD+RE W +C  IGH+ D G C +   F+  
Sbjct: 63  GYKNYTNE---AEIKKYDPLYVENDSPQQFDSRENWKSCKQIGHIRDQGNCGSCWSFSTT 119

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
           GAF+DR C+ + G+ N  LS E +A CCK C       C  G   + W +   +G  TGG
Sbjct: 120 GAFADRLCVSTGGKFNELLSPEELAFCCKDC----GNGCEGGYPIKAWRYFRTQGVTTGG 175

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
           DY  + GC+P  ++PC +     T   C  + + +   + +C    YG+   Q +++T  
Sbjct: 176 DYDTKEGCKPYKVAPCYNKQGKNT---CGGKPMER---NHQCPKTCYGKTTDQKRYKTKS 229

Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
            Y V ++   I+++I  +GP  A+F +YDDF  YKSG+Y+ T NAK +N  HS K+IGWG
Sbjct: 230 EY-VINSIKTIEQDIKTYGPVEASFDVYDDFSVYKSGIYRKTPNAKYQN-GHSVKIIGWG 287

Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            ENGTPYWL +N+W   WGD GT KI++GK EC  E  + AG P
Sbjct: 288 QENGTPYWLAVNSWSKFWGDHGTFKIIKGKNECGIERAVTAGIP 331


>gi|160333103|ref|NP_001103948.1| capthepsin B, b precursor [Danio rerio]
 gi|133777414|gb|AAI15255.1| Ctsbb protein [Danio rerio]
          Length = 326

 Score =  237 bits (605), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 140/346 (40%), Positives = 185/346 (53%), Gaps = 24/346 (6%)

Query: 1   MIHILVF-LLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
           M  + VF LL  T  R +L+   D  I  IN   +TWTAG NF  N+ +EYL+       
Sbjct: 1   MWRVCVFVLLSVTCARPQLHTH-DEMISFINAARSTWTAGVNF-DNVPKEYLKSLC---- 54

Query: 60  KYFDQSDRPLPGDRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
                    L G R  +  ++S  V  PD FD R+QWPNC T+  + D G+C +   F A
Sbjct: 55  ------GTVLKGPRLPHTVKHSTNVKLPDSFDLRDQWPNCKTLSQIRDQGSCGSCWAFGA 108

Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
           V + SDR CI SKG+Q+  +S E + SCC  C +     CS G     W++  + G VTG
Sbjct: 109 VESISDRICIHSKGKQSPEISAEDLLSCCDQCGF----GCSGGFPAEAWDYWRRSGLVTG 164

Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
           G Y    GC+P +I+PC HH +    P    Q  PK  C   C  P Y   + QDKH  +
Sbjct: 165 GLYNSDVGCRPYSIAPCEHHVNGTRPPCSGEQDTPK--CTGVCI-PKYSVPYKQDKHFGS 221

Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
             Y V  ++  I  E+  +GP  A F +Y+DF  YKSGVY+H + + L    H+ K++GW
Sbjct: 222 KVYNVPSDQQQIMTELYTNGPVEAAFTVYEDFPLYKSGVYQHLTGSALGG--HAVKILGW 279

Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           G ENGTP+WLV N+W   WGD G  KILRG  EC  E  + AG PK
Sbjct: 280 GEENGTPFWLVANSWNSDWGDNGYFKILRGHDECGIESEMVAGLPK 325


>gi|49036808|gb|AAT48985.1| cathepsin B-like proteinase [Triatoma vitticeps]
          Length = 332

 Score =  237 bits (604), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 138/343 (40%), Positives = 183/343 (53%), Gaps = 19/343 (5%)

Query: 4   ILVFLLGCTLVRGELYK--FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
           IL  LL C      +     SD +ID IN    TW AGRNF  N  ++YL+   +A    
Sbjct: 5   ILFSLLICGTFSASIPTDPLSDEFIDYINSLQTTWRAGRNFAPNTPKKYLKS--LAGVHK 62

Query: 62  FDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
              +   LP  + + D     TVPD FDAR+ WPNC +I  + D G+C +   F AV A 
Sbjct: 63  DANNAFTLPKRQVSVD----VTVPDEFDARKHWPNCSSITEIRDQGSCGSCWAFGAVEAM 118

Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
           SDR CI S G+    LS E + SCC  C Y     C  GS    W + HK G V+GG+YG
Sbjct: 119 SDRICIHSNGKLQVHLSAENLLSCCDSCGY----GCLGGSAENAWEYWHKFGIVSGGNYG 174

Query: 182 DRTGCQPSTISPCSHHGSAP-TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
            + GCQP +I+PC H  S P + P+CE  +    KC  +C    YG  +  D       Y
Sbjct: 175 SKQGCQPYSIAPCEH--SIPGSRPACEGVR-DTPKCKKQCEK-GYGIPYGDDLCYGQPGY 230

Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE 300
            ++++   I+ EIL +GP  A+  +Y+D + YK+GVY+H +   L    H  K++GWG E
Sbjct: 231 TIENDAQKIQAEILKNGPIVASILVYEDLFSYKAGVYQHVAGEVLGG--HVIKILGWGVE 288

Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           N TPYWLV N+W   WG+ G  KILRG  EC  E  I AG P+
Sbjct: 289 NDTPYWLVANSWNTDWGNNGFFKILRGSDECGIEDQIVAGIPR 331


>gi|308390275|gb|ADO32581.1| cathepsin B [Marsupenaeus japonicus]
          Length = 332

 Score =  236 bits (603), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 136/344 (39%), Positives = 180/344 (52%), Gaps = 18/344 (5%)

Query: 2   IHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA--DA 59
           + + + L+    V    +  SD +I  +  E +TW AGRNF  +LS  Y R+ +    D+
Sbjct: 3   VIVGLLLVAAVAVSANNHFLSDKFIKMLQSEDSTWEAGRNFNRHLSIRYFRRLMGVHPDS 62

Query: 60  KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
           KY       +PG      PE +  +P  FD+R  WP C TIG + D G+C +   F AV 
Sbjct: 63  KYH------MPGYEAHKIPE-NFDMPKEFDSRAAWPMCPTIGEIRDQGSCGSCWAFGAVE 115

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
             SDR+CI SKG+ N   S+E + SCC +C +  N     G+ F+ W  +H  G V+GG 
Sbjct: 116 VMSDRQCIHSKGKSNFHYSSENLVSCCHLCGFGCNGGFP-GAAFKYW--VHS-GIVSGGS 171

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
           +    GCQP  I+PC HH   P     E    PK  C  RC N  Y   +  D H     
Sbjct: 172 FNSTQGCQPYEIAPCEHHVPGPRPKCSEGGGTPK--CVKRCEN-GYTVDYESDLHHGGKA 228

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y +  +ED IK EI+ +GP    F +Y DF HYKSGVY+H     L    H+ +++GWG 
Sbjct: 229 YSIMKDEDQIKYEIMKNGPVEGAFTVYVDFLHYKSGVYQHRHGLPLGG--HAIRILGWGE 286

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           ENGTPYWL  N+W   WGD G  KILRG   C  E  I+AG PK
Sbjct: 287 ENGTPYWLCANSWNTDWGDNGLFKILRGSDHCGIESEISAGLPK 330


>gi|56462338|gb|AAV91452.1| cysteine peptidase 2 cathepsin-B-like [Lonomia obliqua]
          Length = 338

 Score =  236 bits (602), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 126/325 (38%), Positives = 171/325 (52%), Gaps = 14/325 (4%)

Query: 18  LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
           +Y  S+ +I+ +N +  TWTAGRNFPAN    +++  + A      + D  L   + T+D
Sbjct: 23  VYPLSEDFINILNSKPKTWTAGRNFPANTPFAHIKMLMGAL-----KDDNILKLPKMTHD 77

Query: 78  PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
            E  A++P+ FD R++WPNC T+  + D G+C +   F AV A +DR C  S G ++   
Sbjct: 78  AELIASLPENFDPRDKWPNCPTLNEIRDQGSCGSCWAFGAVEAMTDRVCTYSDGTKHFHF 137

Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
           S E + SCC IC       C+ G     W +    G V+GG Y    GC P  + PC HH
Sbjct: 138 SAEDLLSCCPICGL----GCNGGMPTLAWEYWKHAGIVSGGSYNSTQGCIPYEVPPCEHH 193

Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
                LP   + K P  KC   C    Y   F +DKH     Y V  NED IK E+  +G
Sbjct: 194 VPGNRLPCNGDTKTP--KCQKTC-EAGYNVPFKKDKHYGKHVYSVSGNEDNIKAELFKNG 250

Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
           P    F +Y D   YKSGVY+HT  + L    H+ K++GWG ENG+ YWL+ N+W   WG
Sbjct: 251 PVEGAFTVYSDLLSYKSGVYQHTDGSALGG--HAVKILGWGVENGSKYWLIANSWNSDWG 308

Query: 318 DRGTVKILRGKYECAFEYLIAAGKP 342
           D G  KILRG+  C  E  I  G+P
Sbjct: 309 DNGFFKILRGEDHCGIESSIVTGEP 333


>gi|48762476|dbj|BAD23809.1| cathepsin B-S [Tuberaphis styraci]
 gi|204022069|dbj|BAG71132.1| cathepsin B-S1 [Tuberaphis styraci]
          Length = 349

 Score =  235 bits (600), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 130/340 (38%), Positives = 182/340 (53%), Gaps = 17/340 (5%)

Query: 4   ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
           +    +   L    L   SD  I  IN  A TW A R FPAN SEEY    L+    Y +
Sbjct: 8   VCAIFVSVYLAEPTLQFLSDERIKYINEVAKTWKAERYFPANTSEEYFIG-LLGSRGYKN 66

Query: 64  QSDRPLPGDRKTYDPEY-SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
            ++     + K YDP Y     P +FD+RE W +C  IGH+ D G C +   F+  GAF+
Sbjct: 67  YTNEV---EIKKYDPLYVENNSPKQFDSRENWKSCKQIGHIRDQGNCGSCWSFSTTGAFA 123

Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
           DR C+ + G+ N+ LS E +A CC  C     K C  G   + W +   +G  TGGDY  
Sbjct: 124 DRLCVSTGGKFNQLLSPEELAFCCMDC----GKGCGGGYPIKAWKYFRTQGVTTGGDYDT 179

Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
           + GC P  + PC       T   C  + + +   + +C    YG+   QD+++T   Y +
Sbjct: 180 KEGCMPYKVPPCYDEQGKNT---CGGKPMER---NHQCPKTCYGKTTVQDRYKTKNEYVI 233

Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
           +  E  I+++++ +GP  A+F +YDDF  YKSG+Y+ T  AK E   HS K+IGWG ENG
Sbjct: 234 NSIE-TIEQDLMTYGPVEASFDVYDDFSVYKSGIYRKTPKAKYEG-GHSIKIIGWGEENG 291

Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           TPYWL +N+W   WGD GT KI++G+ EC  E  + AG P
Sbjct: 292 TPYWLAVNSWSKFWGDHGTFKIIKGRNECGIERAVTAGIP 331


>gi|146217390|gb|ABQ10737.1| cathepsin B [Penaeus monodon]
          Length = 331

 Score =  234 bits (597), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 137/350 (39%), Positives = 177/350 (50%), Gaps = 28/350 (8%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           M  IL  L+    V    +  SD +I Q+  E +TW AGRNF  +LS +Y R+ +     
Sbjct: 1   MRVILGLLVAAVAVNASSHFLSDKFIRQLQSEDSTWEAGRNFNKHLSIKYFRRLMGVHP- 59

Query: 61  YFDQSDRPLPGDRKTYDPEYSA-------TVPDRFDAREQWPNCGTIGHVPDTGACAAPH 113
                      D K + P+Y A        +P  FD+R  WP C TIG + D G+C +  
Sbjct: 60  -----------DSKFHMPKYEAHQIPENFEMPKEFDSRAAWPMCPTIGEIRDQGSCGSCW 108

Query: 114 IFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRG 173
            F AV   SDR+CI SKG+ N   S E + SCC +C +  N     G+ F+ W  +H  G
Sbjct: 109 AFGAVEVMSDRQCIHSKGKSNFHYSAENLVSCCHLCGFGCNGGFP-GAAFKYW--VHS-G 164

Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
            V+GG +    GCQP  I+PC HH S P     E    PK  C   C    Y   +  D 
Sbjct: 165 IVSGGSFNSTQGCQPYEIAPCEHHVSGPRPKCSEGGGTPK--CAKTCEK-GYIVDYESDL 221

Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
           H     Y +  +ED IK EI+ +GP    F +Y DF HYKSGVY+H     L    H+ +
Sbjct: 222 HHGGKAYSIMKDEDQIKYEIMNNGPVEGAFTVYVDFLHYKSGVYQHRHGLPLGG--HAIR 279

Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           ++GWG ENGTPYWL  N+W   WGD G  KILRG   C  E  I+AG PK
Sbjct: 280 VLGWGEENGTPYWLCANSWNTDWGDNGLFKILRGSDHCGIESEISAGLPK 329


>gi|341900876|gb|EGT56811.1| hypothetical protein CAEBREN_29569 [Caenorhabditis brenneri]
          Length = 344

 Score =  234 bits (596), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 129/321 (40%), Positives = 171/321 (53%), Gaps = 15/321 (4%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLP-GDRKTYDPEYSA 82
           A ID +N     WTAG        E+ +++ +  DAKY       +P  D      E S 
Sbjct: 31  ALIDYVNSAQKLWTAGHQVVPK--EKIMKKLM--DAKYV------VPHKDEDIVATEVSD 80

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
            +PDRFDAREQWP+C +I ++ D   C +   FAA  A SDR CI S G  N  LS+E +
Sbjct: 81  AIPDRFDAREQWPSCVSIDNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDL 140

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            SCC    +     C  G   + W +  K G VTGG Y  + GC+P +I+PC    +  T
Sbjct: 141 LSCCT-GIFSCGNGCEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVT 199

Query: 203 LPSCENQKVPKLKCHTRCT-NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
            P C     P  KC   CT N TY   + QDKH     Y V    + I+ EIL +GP   
Sbjct: 200 WPKCPEDTEPTPKCVDACTSNHTYPTAYLQDKHFGATAYAVGKKVEQIQTEILKNGPIEV 259

Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT 321
            F +Y+DFY Y +GVY HT+ A L    H+ K++GWG +NGTPYWLV N+W  +WG++G 
Sbjct: 260 AFTVYEDFYQYTTGVYVHTAGASLGG--HAVKILGWGVDNGTPYWLVANSWNINWGEKGY 317

Query: 322 VKILRGKYECAFEYLIAAGKP 342
            +I+RG  EC  E+   AG P
Sbjct: 318 FRIIRGLNECGIEHSAVAGIP 338


>gi|347972086|ref|XP_313835.5| AGAP004533-PA [Anopheles gambiae str. PEST]
 gi|333469165|gb|EAA09183.5| AGAP004533-PA [Anopheles gambiae str. PEST]
          Length = 337

 Score =  234 bits (596), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 130/350 (37%), Positives = 188/350 (53%), Gaps = 22/350 (6%)

Query: 1   MIHILVFLL---GCTLVRG--ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFL 55
           + H++V  L   G     G  + Y  S  +I++IN +A TW AG+NF  + S  Y+R  +
Sbjct: 2   LFHLVVIALAAVGTNAAAGGSKKYPLSSKFIEEINTKATTWRAGQNFHPDTSLTYIRGLM 61

Query: 56  IA--DAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPH 113
               DA  F + +         +D      +P+ FD+REQWPNC TI  + D G+C +  
Sbjct: 62  GVHPDADKFREPE-------ILHDLSDGDELPENFDSREQWPNCPTIREIRDQGSCGSCW 114

Query: 114 IFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRG 173
            F AV A SDR C+ S G+ +   S E + SCC  C +     C+ G     W++  ++G
Sbjct: 115 AFGAVEAMSDRVCVASGGKIHFRFSAEDLVSCCHTCGF----GCNGGFPGAAWSYWVRKG 170

Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
            V+GG +G   GCQP  I+PC HH +  T PSCE +     KC  +C   +Y   + +DK
Sbjct: 171 LVSGGPFGSNLGCQPYAIAPCEHHVNG-TRPSCEGEGGKTPKCVKKCQE-SYNVPYQKDK 228

Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
                +Y +  +E  I+KEI+ +GP    F +Y+D  HYK GVY+H +   L    H+ +
Sbjct: 229 RFGASSYSIARHEAQIQKEIMTNGPVEGAFTVYEDLLHYKEGVYQHVTGKMLGG--HAIR 286

Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           ++GWG ENGT YWL+ N+W   WGD G  KILRG+     E  I+AG PK
Sbjct: 287 ILGWGVENGTKYWLIANSWNSDWGDNGFFKILRGEDHLGIESSISAGLPK 336


>gi|161343863|tpg|DAA06112.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 340

 Score =  233 bits (595), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 132/344 (38%), Positives = 182/344 (52%), Gaps = 15/344 (4%)

Query: 1   MIHILVFLLGCTL-VRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
           ++ +L+F  GC   +R +L   SD +ID IN     W+AGRNF  N    YL+  +    
Sbjct: 9   LVGLLIFSFGCCDDIRVDLDPLSDEFIDHINSIQYYWSAGRNFHKNTPMSYLKGLM---G 65

Query: 60  KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
            +   +  P      +Y  +    +P+ FDARE WPNC TI  V D G+C +   F AV 
Sbjct: 66  VHESNAHYPKLEQLVSYT-DTPTDLPENFDAREHWPNCPTIREVRDQGSCGSCWAFGAVE 124

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           A SDR CI SKG +N   S E + SCC+ C +     C+ G     W++   +G V+GG 
Sbjct: 125 AMSDRVCIHSKGAKNFHFSAENLVSCCRTCGF----GCNGGFPGAAWHYWKTKGIVSGGP 180

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
           YG + GC P  I+PC HH +    P  E  K P   C  +C +  Y   + QD HR    
Sbjct: 181 YGSKMGCIPYEIAPCEHHVNGTRGPCKEGGKTP--ACVKKCED-GYKVPYAQDLHRGKSA 237

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y + ++ D I++EI  +GP    F +Y+DF  Y++GVYKH +   L    H+ +++GWG 
Sbjct: 238 YSLGNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGG--HAIRILGWGV 295

Query: 300 ENG-TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           +NG  PYWLV N+W   WG  G  KILRG  EC  E  I AG P
Sbjct: 296 QNGEIPYWLVANSWNSDWGSDGFFKILRGSDECGIEGQINAGLP 339


>gi|38147393|gb|AAR12009.1| cathepsin B-like proteinase [Triatoma infestans]
          Length = 332

 Score =  233 bits (594), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 131/346 (37%), Positives = 184/346 (53%), Gaps = 18/346 (5%)

Query: 1   MIHILVF-LLGCTLVRGELYK--FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA 57
           M  ++ F LL C +    +     SD +ID IN    TW AGRNF  N  ++YL+   +A
Sbjct: 1   MKELIPFSLLICGIFSASIPTDPLSDEFIDYINSLQTTWRAGRNFAPNTPKKYLKS--LA 58

Query: 58  DAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
                  +   LP  + + D     T+P  FDAR+ WPNC +I  + D G+C +   F A
Sbjct: 59  GVHKDANNAFTLPKRQVSLD----VTLPKEFDARKHWPNCTSIAEIRDQGSCGSCWAFGA 114

Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
           V A SDR CI S G+    LS E + SCC  C +     C  G     W++    G V+G
Sbjct: 115 VEAMSDRICIHSNGKLQVHLSAENLVSCCDSCGF----GCDGGYPASAWDYWQNVGIVSG 170

Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
           G+YG + GCQP +I+PC HH   P  P+C  +      C  +C   + G  + +D +   
Sbjct: 171 GNYGSKQGCQPYSIAPCEHHVPGPR-PACSGEGSTP-DCRNQCDKRS-GISYDKDLYYGE 227

Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
             Y ++D    I+ EIL +GP  A F +Y+D  +YK GVY+H + + L    H+ K++GW
Sbjct: 228 SAYSLEDEAKQIQAEILKNGPVEAAFTVYEDLVNYKEGVYQHVAGSVLGG--HAIKILGW 285

Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           G EN TPYWLV N+W   WG+ G  KILRGK EC  E  ++AG P+
Sbjct: 286 GVENDTPYWLVANSWNTDWGNNGFFKILRGKDECGIEIDVSAGLPR 331


>gi|157167366|ref|XP_001653890.1| cathepsin b [Aedes aegypti]
 gi|54289254|gb|AAV31917.1| lysosomal cathepsin B [Aedes aegypti]
 gi|108874249|gb|EAT38474.1| AAEL009637-PA [Aedes aegypti]
          Length = 340

 Score =  233 bits (594), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 124/329 (37%), Positives = 181/329 (55%), Gaps = 13/329 (3%)

Query: 15  RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRK 74
           R  ++  S  +IDQIN +A TW AG NF    S  ++R  +         +D+ +P    
Sbjct: 24  RQRIHPLSQKFIDQINSKATTWKAGPNFSPETSMSFIRGLM----GVHKDADKFMP-PVY 78

Query: 75  TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
            ++ E     P+ FD+R QWPNC TIG + D G+C +   F AV A SDR CI S+G+ +
Sbjct: 79  LHEMEADDDFPENFDSRTQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRICIHSEGKVH 138

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             +S+E + SCC  C +     C+ G     W++  ++G V+GG +G   GCQP  I+PC
Sbjct: 139 FRVSSEDLVSCCHTCGF----GCNGGFPGAAWSYWVRKGLVSGGPFGSDQGCQPYAIAPC 194

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            HH +  + PSCE +     KC  +C   +Y   + +DK     +Y + ++E  I+KEI+
Sbjct: 195 EHHVNG-SRPSCEGEGGKTPKCVKKC-QASYNVPYAKDKMYGKSSYSIANHEKQIQKEIM 252

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            +GP    F +Y+D  +YK GVY H     L    H+ +++GWG E+GT YWL+ N+W  
Sbjct: 253 TNGPVEGAFTVYEDLLNYKEGVYHHVHGKMLGG--HAIRILGWGVEDGTKYWLIANSWNS 310

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            WGD G  KILRG+     E  IAAG PK
Sbjct: 311 DWGDNGFFKILRGEDHLGIESSIAAGLPK 339


>gi|49036806|gb|AAT48984.1| cathepsin B-like proteinase [Triatoma sordida]
          Length = 331

 Score =  233 bits (593), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 137/350 (39%), Positives = 180/350 (51%), Gaps = 34/350 (9%)

Query: 4   ILVFLLGCTLVRGELYK--FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
           IL  LL C      +     SD +ID IN    TW AGRNF  N  ++YL+   +A    
Sbjct: 5   ILFSLLICGTFSASIPTDPLSDEFIDYINTLQTTWRAGRNFAPNTPKKYLKS--LAGVHK 62

Query: 62  FDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
              +   LP  + + D     T+PD FDAR+QWPNC +I  + D G+C +   F AV A 
Sbjct: 63  NANNAFTLPKRKVSLD----VTIPDEFDARKQWPNCPSITDIRDQGSCGSCWAFGAVEAM 118

Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
           SDR CI S G+    LS E + SCC  C Y     C  G     W++    G V+GG+YG
Sbjct: 119 SDRICIHSNGKLQVHLSAENLVSCCDSCGY----GCDGGFPASAWDYWQNEGIVSGGNYG 174

Query: 182 DRTGCQPSTISPCSHH--GSAPT------LPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
            + GCQP +I+PC HH  GS P        P C NQ            +   G  + QD 
Sbjct: 175 SKQGCQPYSIAPCEHHVPGSRPACSGGGDTPDCRNQ-----------CDEGSGISYDQDH 223

Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
           +     Y +D+ +  I+ EIL +GP  A F +Y+D  +YK GVY+H +   L    H+ K
Sbjct: 224 YYGETVYTLDEAKQ-IQAEILKNGPVEAAFTVYEDLLNYKEGVYQHVAGEALGG--HAIK 280

Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           ++GWG EN TPYWLV N+W   WG+ G  KILRG  EC  E  I AG P+
Sbjct: 281 ILGWGVENDTPYWLVANSWNTDWGNNGFFKILRGSDECGIEDQIVAGLPR 330


>gi|194766882|ref|XP_001965553.1| GF22391 [Drosophila ananassae]
 gi|190619544|gb|EDV35068.1| GF22391 [Drosophila ananassae]
          Length = 342

 Score =  232 bits (592), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 130/352 (36%), Positives = 187/352 (53%), Gaps = 21/352 (5%)

Query: 1   MIHILVFLLGCTLVRG-----ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFL 55
           M  +LV  + C L  G     ++   SD +I+ +  +  TW AGRNF   +SEEY+R  +
Sbjct: 1   MKLLLVATVACLLAMGSCEENKIPLLSDEFIELVKTKTRTWQAGRNFDEGVSEEYIRGLM 60

Query: 56  IA--DAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPH 113
               DA  F   D+    +   Y  +    +P  FDARE+WPNC TI  + D G+C +  
Sbjct: 61  GVHPDAYKFALPDKQ---EVLGYLSQKVDDIPKEFDAREKWPNCPTINEIRDQGSCGSCW 117

Query: 114 IFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRG 173
            F AV A SDR CI S G  N   S + + SCC  C +     C+ G     W++  ++G
Sbjct: 118 AFGAVEAMSDRVCIHSNGNVNFRFSADDLVSCCHTCGF----GCNGGFPGAAWSYWTRKG 173

Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
            V+GG YG +TGC+P  I+PC HH +    P   + K P  KC  +C    Y   + +DK
Sbjct: 174 IVSGGRYGSKTGCRPYEIAPCEHHVNGTRAPCNHDSKTP--KCQHQC-EAGYNVEYSKDK 230

Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
           H  + +Y V  N   I++EI+ +GP    F +Y+D   YKSGVY+H    +L    H+ +
Sbjct: 231 HFGSKSYSVRRNVRDIQEEIMTNGPVEGAFTVYEDLILYKSGVYQHEHGKELGG--HAIR 288

Query: 294 LIGWGT--ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           ++GWG   +   PYWL+ N+W   WGD+G  +ILRG+  C  E  I+AG PK
Sbjct: 289 ILGWGVWGKEEVPYWLIANSWNDDWGDKGFFRILRGEDHCGIESSISAGLPK 340


>gi|31872149|gb|AAP59456.1| cathepsin B precursor [Araneus ventricosus]
          Length = 334

 Score =  232 bits (592), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 129/326 (39%), Positives = 173/326 (53%), Gaps = 16/326 (4%)

Query: 18  LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
           ++  S+  I+ +N    TW AGRNF   ++ +Y+R  L     + D     LP  R    
Sbjct: 24  IHPLSEKMIEYVNFMNTTWKAGRNFHEGVTMKYIRGLL---GVHKDNHKYRLPSIRHAV- 79

Query: 78  PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
                 +P+ FD+REQWPNC TI  + D G+C +   F A  A SDR CI S G+ N  +
Sbjct: 80  ---PGDLPESFDSREQWPNCPTISEIRDQGSCGSCWAFGAAEAMSDRHCIHSNGKVNVEI 136

Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
           S E + +CC  C       C+ G     W +   +G VTGG Y    GCQP TI+ C HH
Sbjct: 137 SAEDLLTCCDSC----GMGCNGGFPGSAWEYWVDKGLVTGGLYNSHVGCQPYTIASCEHH 192

Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
            +   LP C    V   +C   C    Y   +  DK+    +Y +D+ ED IK EI  +G
Sbjct: 193 -TKGKLPPC-GDIVDTPQCVHMCEK-GYNVSYRADKYFGKKSYSIDEQEDQIKTEISTNG 249

Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
           P  A F +Y DF  YKSGVY+H +  ++    H+ +++GWGTE+GTPYWLV N+W   WG
Sbjct: 250 PVEAAFTVYADFVTYKSGVYRHVTGEEMGG--HAVRILGWGTESGTPYWLVANSWNTDWG 307

Query: 318 DRGTVKILRGKYECAFEYLIAAGKPK 343
           D+G  KILRG  EC  E  I AG PK
Sbjct: 308 DKGYFKILRGSDECGIESSIVAGLPK 333


>gi|91078958|ref|XP_974220.1| PREDICTED: similar to cathepsin b [Tribolium castaneum]
 gi|270004841|gb|EFA01289.1| cathepsin B precursor [Tribolium castaneum]
          Length = 334

 Score =  232 bits (591), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 128/345 (37%), Positives = 179/345 (51%), Gaps = 18/345 (5%)

Query: 2   IHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA--DA 59
           + IL  +     +   ++  S  +I QIN + +TW AG NF  N+   Y+R+ +    ++
Sbjct: 4   LPILTIICTAASLSVAVHPLSKEFIQQINEKQSTWKAGPNFAENVPMSYIRRLMGVPPNS 63

Query: 60  KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
           KY   S +     R   D   +  +PD FDAR+QWPNC TI  + D G+C +   F AV 
Sbjct: 64  KYHMPSVK-----RHLLD---AMEIPDDFDARKQWPNCPTIREIRDQGSCGSCWAFGAVE 115

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           A SDR CI SKG  N  LS + + SCC  C       C+ G     W++   +G V+GG 
Sbjct: 116 AMSDRVCIHSKGAVNVRLSADDLVSCCYSC----GMGCNGGFPGAAWHYWVNKGIVSGGS 171

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
           +G   GC+P  I+PC HH +  T P C         C  +C    Y   + +DK+     
Sbjct: 172 FGSNQGCRPYEIAPCEHHVNG-TRPPCTGDDNKTPSCKQQCEK-GYNVPYKKDKNFGKEA 229

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y +      I+KEI+ +GP    F +Y+D   YK GVY+H     L    H+ +++GWGT
Sbjct: 230 YSISSEVQQIQKEIMTNGPVEGAFEVYEDLLSYKKGVYQHVKGEALGG--HAIRILGWGT 287

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           E GTPYWL+ N+W   WGD GT KILRG+  C  E  I AG PK+
Sbjct: 288 EKGTPYWLIANSWNSDWGDNGTFKILRGEDHCGIESSIVAGIPKD 332


>gi|341888137|gb|EGT44072.1| hypothetical protein CAEBREN_10156 [Caenorhabditis brenneri]
          Length = 344

 Score =  231 bits (590), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 128/321 (39%), Positives = 170/321 (52%), Gaps = 15/321 (4%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLP-GDRKTYDPEYSA 82
           A ID +N     WTAG        E+ +++ +  DAKY       +P  D      E S 
Sbjct: 31  ALIDYVNSAQKLWTAGHQVVPK--EKIMKKLM--DAKYV------VPHKDEDIVATEVSD 80

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
            +PD FDAREQWP+C +I ++ D   C +   FAA  A SDR CI S G  N  LS+E +
Sbjct: 81  AIPDHFDAREQWPSCVSIDNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDL 140

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            SCC    +     C  G   + W +  K G VTGG Y  + GC+P +I+PC    +  T
Sbjct: 141 LSCCT-GIFSCGNGCEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVT 199

Query: 203 LPSCENQKVPKLKCHTRCT-NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
            P C     P  KC   CT N TY   + QDKH     Y V    + I+ EIL +GP   
Sbjct: 200 WPKCPEDTEPTPKCVDACTSNHTYPTAYLQDKHFGATAYAVGKKVEQIQTEILKNGPIEV 259

Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT 321
            F +Y+DFY Y +GVY HT+ A L    H+ K++GWG +NGTPYWLV N+W  +WG++G 
Sbjct: 260 AFTVYEDFYQYTTGVYVHTAGASLGG--HAVKILGWGVDNGTPYWLVANSWNINWGEKGY 317

Query: 322 VKILRGKYECAFEYLIAAGKP 342
            +I+RG  EC  E+   AG P
Sbjct: 318 FRIIRGLNECGIEHSAVAGIP 338


>gi|195438776|ref|XP_002067308.1| GK16352 [Drosophila willistoni]
 gi|194163393|gb|EDW78294.1| GK16352 [Drosophila willistoni]
          Length = 340

 Score =  231 bits (589), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 124/344 (36%), Positives = 194/344 (56%), Gaps = 16/344 (4%)

Query: 4   ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA--DAKY 61
           +L+ ++  ++   + +  SD +I+ +  +ANTWT GRNF  ++SE+Y+R  +    DA  
Sbjct: 8   LLLMVVYLSMFEAKDHLLSDEFIELVRGKANTWTVGRNFHESVSEKYIRGLMGVHPDADK 67

Query: 62  FDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
           F   D+     +   D +  + +P  FDARE+W NC TIG + D G+C +   F AV A 
Sbjct: 68  FALPDKMEVLGKLVEDSD--SDIPTEFDAREKWSNCPTIGEIRDQGSCGSCWAFGAVEAM 125

Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
           SDR CI S+G+ N  LS + + SCC  C +     C+ G     W++  ++G V+GG++G
Sbjct: 126 SDRVCIHSQGKVNFHLSADDLVSCCHTCGF----GCNGGFPGAAWSYWTRKGIVSGGNFG 181

Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW 241
            + GC+P  I PC HH +  T P C +   P  +C   C + +Y   + +DK+  + +Y 
Sbjct: 182 SQQGCRPYEIEPCEHHVNG-TRPPCSSGSTP--RCQHVCES-SYKVDYKKDKNFGSKSYS 237

Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-- 299
           + +N   I+KEI+ +GP    F +Y+D   YKSGVY+H    +L    H+ +++GWG   
Sbjct: 238 IKNNVLDIQKEIMNNGPVEGAFTVYEDLILYKSGVYEHVHGKELGG--HAIRILGWGVWG 295

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           +   PYWL+ N+W   WGD G  +I+RGK  C  E  I+AG PK
Sbjct: 296 DEKIPYWLIANSWNTDWGDNGFFRIVRGKDHCGIESSISAGLPK 339


>gi|125981197|ref|XP_001354605.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
 gi|54642915|gb|EAL31659.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
          Length = 338

 Score =  231 bits (588), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 125/328 (38%), Positives = 182/328 (55%), Gaps = 18/328 (5%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
            SD +I+ +  +A+TW  GRNF  ++SEEY+R  +     + D     LP  R      Y
Sbjct: 23  LSDEFIELVRSKASTWQVGRNFKESVSEEYIRGLM---GVHPDAHKFALPEKRIVLGDLY 79

Query: 81  S---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
           +     +P+ FDAR+ WPNC TIG + D G+C +   F AV A SDR CI S+G+ N  L
Sbjct: 80  ADDGVDIPEEFDARKAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSEGKVNFHL 139

Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
           S + + SCC IC +     C+ G     W++  ++G V+GG YG   GC+P  I+PC HH
Sbjct: 140 SADDLVSCCHICGF----GCNGGFPGAAWSYWTRKGIVSGGPYGSTQGCRPYEIAPCEHH 195

Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
            +  T P C +   P   C  +C   +Y   + +DK+  + +Y V  N   I++EI+ +G
Sbjct: 196 VNG-TRPPCSHGSTP--SCQHKC-QASYSVEYAKDKNFGSKSYSVRRNVAEIQQEIMTNG 251

Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT--ENGTPYWLVINTWGPH 315
           P    F +Y+D   YKSGVY+H    +L    H+ +++GWG   E+  PYWL+ N+W   
Sbjct: 252 PVEGAFTVYEDLILYKSGVYQHEHGKELGG--HAIRILGWGVWGESKVPYWLIGNSWNTD 309

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPK 343
           WGD G  +ILRG+  C  E  I+AG PK
Sbjct: 310 WGDNGFFRILRGQDHCGIESSISAGLPK 337


>gi|240992699|ref|XP_002404474.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215491571|gb|EEC01212.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score =  230 bits (586), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 135/346 (39%), Positives = 179/346 (51%), Gaps = 20/346 (5%)

Query: 4   ILVFLLGCTLVRGELYK-----FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
           ++V LL       E++       SD  I+ IN+   TW AGRNF  ++S  Y+R  +   
Sbjct: 6   LVVGLLAAVCFGREIHPKKWHPLSDQMINFINKINTTWKAGRNFDKSISMSYIRGLMGVH 65

Query: 59  AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
            K  +        D      E    +P+ FDARE+W +C +I  + D   C +   F A 
Sbjct: 66  PKSKEYRLAEFVHD------EIPDDLPESFDAREKWSHCASIHLIRDQSTCGSCWAFGAA 119

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
            A SDR CI SKG+    +S E +  CC  C       C+ G     W +  + G VTGG
Sbjct: 120 EAMSDRVCIHSKGKIQVDISAEDLLDCCDSC----GAGCNGGYPAAAWEYWKESGLVTGG 175

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
            YG   GC+P +++PC HH +  +LP+C    VP  KC   C    YG+ +  DKH    
Sbjct: 176 LYGTSDGCKPYSLAPCEHH-TKGSLPNCTGT-VPTPKCVHLCRK-GYGKDYQDDKHFGRK 232

Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
            Y +  +E  I+ EI  +GP  A F +Y DF  YKSGVY+H S   L    H+ +++GWG
Sbjct: 233 VYSISSDEKQIQTEIFKNGPVEADFTVYADFLSYKSGVYQHQSGDVLGG--HAIRILGWG 290

Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           TENGTPYWLV N+W   WGD G  KILRGK EC  E  I AG PKN
Sbjct: 291 TENGTPYWLVANSWNEDWGDHGYFKILRGKDECGIEDDINAGIPKN 336


>gi|344195776|gb|AEM98130.1| cathepsin B [Cynoglossus semilaevis]
          Length = 332

 Score =  230 bits (586), Expect = 9e-58,   Method: Compositional matrix adjust.
 Identities = 133/345 (38%), Positives = 177/345 (51%), Gaps = 21/345 (6%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           +I  L  +L   L R  L   S+  ++ IN+  +TW AG NF  N+   YLR+       
Sbjct: 5   VIPFLAAILSVGLARPPLKTLSNEMVNHINKVNSTWKAGLNF-QNVDYSYLRRL------ 57

Query: 61  YFDQSDRPLPGDRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
                   L G +     +++A V  P  FDAR QWP C T+  V D G+C +   F A 
Sbjct: 58  ----CGTMLKGPKLPVKLQFTADVQLPVDFDARVQWPQCPTLKEVRDQGSCGSCWAFGAA 113

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
            A SDR CI S G  N  +S E + SCC  C       C+ G     W F    G V+GG
Sbjct: 114 EAISDRLCIHSNGLMNVEISAEDLLSCCDSC----GMGCNGGYPSAAWEFWTTDGLVSGG 169

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
            Y    GC+P +I+PC HH +  + P C  +     +C  +C    Y  G+ QDKH   L
Sbjct: 170 LYDSHIGCRPYSIAPCEHHVNG-SRPPCTGEGGDTPQCTKKC-EAGYTPGYTQDKHYGKL 227

Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
           +Y VDD+E  I+ EI  +GP    F +Y+DF  YK+GVY+H + + +    H+ K++GWG
Sbjct: 228 SYSVDDSEKEIQLEIYKNGPVEGAFTVYEDFLLYKTGVYQHVTGSAVGG--HAIKVLGWG 285

Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            ENGTPYWL  N+W   WGD G  KILRG   C  E  I AG PK
Sbjct: 286 EENGTPYWLCANSWNTDWGDNGFFKILRGSDHCGIESEIVAGIPK 330


>gi|45822203|emb|CAE47498.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
          Length = 328

 Score =  229 bits (585), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 127/333 (38%), Positives = 177/333 (53%), Gaps = 26/333 (7%)

Query: 17  ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
           EL+  SD +I+ IN   +TWTAGRNF  + S +Y+ + +             LP D K Y
Sbjct: 16  ELHPLSDEFINSINAAKSTWTAGRNFAQDKSMDYIIKLMGV-----------LP-DHKNY 63

Query: 77  DPEY------SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSK 130
            P        +  +P  FDAR+QWP+C TI  + D G+C +   F AV A SDR CI S 
Sbjct: 64  MPPVLTHKLEALEIPADFDARQQWPHCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIHSN 123

Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPST 190
           G+ N   S++ + SCC  C       C+ G     W++  ++G V+GG YG + GC+P  
Sbjct: 124 GESNFHFSSDDLVSCCWTC----GMGCNGGYPGAAWHYWVRKGLVSGGQYGTKQGCRPYE 179

Query: 191 ISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIK 250
           I PC HH +  + P+C+  +    KC   C +  Y   +  D H  +  Y +  +   I+
Sbjct: 180 IPPCEHHTNG-SRPACDASEGNTPKCAKSCES-NYKINYSNDLHFGSKAYSISSDVKQIQ 237

Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVIN 310
            EIL +GP    F++Y DF +YK+GVY+H     L    H+ ++ GWG EN TPYWL+ N
Sbjct: 238 AEILQNGPVEGAFSVYADFVNYKTGVYQHIKGQFLGG--HAIRIFGWGVENNTPYWLIAN 295

Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           +W   WGD GT KILRG   C  E  I AG PK
Sbjct: 296 SWNTDWGDSGTFKILRGSDHCGIESGIVAGLPK 328


>gi|240992702|ref|XP_002404475.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215491572|gb|EEC01213.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score =  229 bits (585), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 133/346 (38%), Positives = 180/346 (52%), Gaps = 20/346 (5%)

Query: 4   ILVFLLGCTLVRGELYK-----FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
           ++V LL       E++       SD  I+ IN+   TW AGRNF  ++S  Y+R  +   
Sbjct: 6   LVVGLLAAVCFGREIHPKKWHPLSDQMINFINKINTTWKAGRNFDKSISMSYIRGLMGVH 65

Query: 59  AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
            K  +        D      E    +P+ FDARE+WP+C +I  + D   C +   F A 
Sbjct: 66  PKSKEYRLAEFVHD------EIPDDLPESFDAREKWPHCNSIHLIRDQSTCGSCWAFGAA 119

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
            A SDR CI SKG+    +S E +  CC  C       C+ G+    W +  + G VTGG
Sbjct: 120 EAMSDRVCIHSKGKIQVNISAEDLLDCCDSC----GAGCNGGTPAAAWEYWKESGLVTGG 175

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
            YG   GC+P +++PC HH +  +LP+C    VP  KC   C    YG+ +  DKH    
Sbjct: 176 LYGTNDGCKPYSLAPCEHH-TKGSLPNCTGT-VPTPKCVHLCRK-GYGKDYQDDKHFGKK 232

Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
            Y +  +E  I+ EI  +GP  A F +  DF  YKSGVY+H S+  +    H+ +++GWG
Sbjct: 233 VYSISSDEKQIQTEIFKNGPVEADFIVLADFLSYKSGVYQHHSDDVIGG--HAIRILGWG 290

Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           TENGTPYWL  N+W   WGD G  KILRGK EC  E  I AG PKN
Sbjct: 291 TENGTPYWLAANSWNEDWGDHGYFKILRGKDECGIEEDINAGIPKN 336


>gi|170028910|ref|XP_001842337.1| cathepsin L [Culex quinquefasciatus]
 gi|167879387|gb|EDS42770.1| cathepsin L [Culex quinquefasciatus]
          Length = 334

 Score =  229 bits (585), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 126/323 (39%), Positives = 175/323 (54%), Gaps = 14/323 (4%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
            S  +IDQIN +A TW AGRNF  +    Y+R  +         +D+ +P     +D + 
Sbjct: 25  LSGKFIDQINAKATTWRAGRNFHPDTPMSYIRGLM----GVHKDADKFMP-PVMLHDLDE 79

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
              +P+ FDAREQWPNC TI  + D G+C +   F AV A SDR CI SKG+ +  +S E
Sbjct: 80  GDDLPENFDAREQWPNCPTIREIRDQGSCGSCWAFGAVEAMSDRICIHSKGKVHFRVSAE 139

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SCC  C +     C+ G     W++  ++G V+GG YG   GCQP  ISPC HH + 
Sbjct: 140 DLVSCCHTCGF----GCNGGFPGAAWSYWVRKGLVSGGPYGSDQGCQPYAISPCEHHVNG 195

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
              P     K P  KC  +C   +Y   + +DK     +Y +  +E  I+KE+  +GP  
Sbjct: 196 TRGPCNGEGKTP--KCVKKC-QASYNVPYAKDKFFGKSSYSIASHEQQIQKELFTNGPVE 252

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
             F +Y+D  +YK GVY+HT+   L    H+ +++GWG EN T +WL+ N+W   WGD G
Sbjct: 253 GAFTVYEDLLNYKEGVYQHTAGKMLGG--HAIRILGWGVENDTKFWLIANSWNSDWGDNG 310

Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
             KILRG      E  IAAG PK
Sbjct: 311 YFKILRGSDHLGIESSIAAGLPK 333


>gi|187097096|ref|NP_001119608.1| cathepsin B-348 precursor [Acyrthosiphon pisum]
 gi|161343833|tpg|DAA06097.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 342

 Score =  229 bits (583), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 132/346 (38%), Positives = 180/346 (52%), Gaps = 17/346 (4%)

Query: 1   MIHILVFLLG---CTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA 57
           ++ +L+F  G      VR +L   SD +ID IN     W+AGRNF  +    Y++  +  
Sbjct: 9   LVGLLIFSFGRVDGATVRVDLNPLSDEFIDHINSIQYYWSAGRNFHKDTPISYIKGLMGV 68

Query: 58  DAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
             K    ++ P      TY+ + S  +P+ FDARE+WPNC TI  V D G+C +   F A
Sbjct: 69  HEK---NAEYPKLEQLLTYN-DASTDLPETFDARERWPNCPTIREVRDQGSCGSCWAFGA 124

Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
           V A SDR CI S G +N   S E + SCC  C +     C+ G     WN+   +G V+G
Sbjct: 125 VEAMSDRVCIHSNGTKNFHFSAENLVSCCWTCGF----GCNGGFPGAAWNYWKTKGIVSG 180

Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
           G YG   GC P  I+PC HH +    P  E  K P   C  +C    Y   + QD H   
Sbjct: 181 GPYGSNMGCIPYEIAPCEHHVNGTRGPCKEGGKTP--TCVKKCEE-GYKVPYAQDLHHGK 237

Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
             Y + ++ D I++EI  +GP    F +Y+DF  Y++GVYKH +   L    H+ +++GW
Sbjct: 238 SAYSIRNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGG--HAIRILGW 295

Query: 298 GTENG-TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           G +NG  PYWLV N+W   WG  G  KILRG  EC  E  I AG P
Sbjct: 296 GVQNGEIPYWLVANSWNTDWGSDGFFKILRGSDECGIEGQINAGLP 341


>gi|323147412|gb|ADX32985.1| cathepsin B [Pinctada fucata]
          Length = 366

 Score =  229 bits (583), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 132/323 (40%), Positives = 168/323 (52%), Gaps = 12/323 (3%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
            SD  I  IN+   +W AG+NF     E+ L    I    Y D    P        D E 
Sbjct: 54  LSDEMIWFINKVNTSWKAGQNFHHIKQEDRLDHVKIMCGTYLD---VPPHLQLPVRDIEP 110

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
              +PD FDAR QW NC TI  + D G+C +   F AV + SDR CIKS GQQN  +S E
Sbjct: 111 RKDLPDTFDARTQWSNCPTIKEIRDQGSCGSCWAFGAVESMSDRICIKSNGQQNAHISAE 170

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SCC+ C       C+ G +   W +  + G VTGG Y    GCQP T+  C HH   
Sbjct: 171 DLTSCCRSC----GNGCNGGFLSGAWEYYKRDGLVTGGQYNSHQGCQPYTVKACDHHVVG 226

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
              P C  ++     C   C +  Y   + +DKH     Y V   +  I  EI+ +GP  
Sbjct: 227 KLQP-CSKKEEHTPVCKHECES-GYNVSYTKDKHYGATAYSVRGVQQ-IMTEIMTNGPVE 283

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
             F +Y DF  YKSGVYKHT+ + L    H+ K++GWGTE G  YWLV N+W P WG++G
Sbjct: 284 GAFTVYADFPQYKSGVYKHTTGSPLGG--HAIKIMGWGTEGGDDYWLVANSWNPDWGNQG 341

Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
           T KILRG+ EC  E  IAAG+PK
Sbjct: 342 TFKILRGRDECGIESQIAAGEPK 364


>gi|321452279|gb|EFX63703.1| hypothetical protein DAPPUDRAFT_306608 [Daphnia pulex]
          Length = 340

 Score =  228 bits (582), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 130/352 (36%), Positives = 174/352 (49%), Gaps = 22/352 (6%)

Query: 1   MIHILVFLLGCTLVRGELYKF---------SDAYIDQINREANTWTAGRNFPANLSEEYL 51
           M  +L  +    LV  +  K          SD +I+ IN   +TW AGRNF  N     L
Sbjct: 1   MKIVLSIIFAVVLVTSQAKKLKSNKYFNPLSDEFINHINSMKSTWKAGRNFGKNFPMGAL 60

Query: 52  RQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAA 111
            Q +         S+  +P  +       +  +P+ FDAREQWP+C TI  + D G+C +
Sbjct: 61  TQMM----GVHPDSNLYMPPLKNVSQMYSNQAIPEAFDAREQWPDCPTIQEIRDQGSCGS 116

Query: 112 PHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHK 171
              F AV A SDR CI SKG+ N  LS E + SCC  C +     C+ G     W+   K
Sbjct: 117 CWAFGAVEAMSDRICIHSKGEVNAHLSAENLVSCCYTCGF----GCNGGFPGAAWSHWVK 172

Query: 172 RGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQ 231
           +G VTGG++    GCQP  I  C HH +    P  E    P  KC   C +  Y   + Q
Sbjct: 173 KGIVTGGNFNSSQGCQPYIIPACEHHTTGDRPPCSEGGGTP--KCLKTCED-GYTVDYTQ 229

Query: 232 DKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHS 291
           D H    +Y V    + I+ EI+ +GP      +Y+DF  YKSGVY+H     L    H+
Sbjct: 230 DLHYGASSYSVHKRMEDIQLEIMNNGPVEGALTVYEDFPTYKSGVYQHVHGKALGG--HA 287

Query: 292 GKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            +++GWG E G PYWL+ N+W   WGD G +K+LRGK  C  E  I AG PK
Sbjct: 288 IRILGWGVEEGVPYWLIANSWNTDWGDNGYIKLLRGKDHCGIESQITAGLPK 339


>gi|118424551|gb|ABK90823.1| cathepsin B-like cysteine proteinase [Spodoptera exigua]
          Length = 341

 Score =  228 bits (582), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 124/339 (36%), Positives = 171/339 (50%), Gaps = 14/339 (4%)

Query: 4   ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
           +    L    V   L   +D +I+ IN + N+W AGRNFP N    ++++         D
Sbjct: 12  VCTLALASASVEDLLNPLTDEFINLINTKQNSWKAGRNFPVNTPLTHIKKLT---GVLVD 68

Query: 64  QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSD 123
                LP  +  +D +  A +P+ FD R++WPNC T+  V D G+C +   F AV A +D
Sbjct: 69  THLSKLP--KVEHDADLIADLPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTD 126

Query: 124 RRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDR 183
           R C  S G ++   S E + SCC +C       C+ G     W +    G V+GG Y   
Sbjct: 127 RYCTYSNGTKHFHFSAEDLLSCCPVCGL----GCNGGMPTLAWEYWKHFGLVSGGSYNSS 182

Query: 184 TGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD 243
            GC+P  I PC HH     +P   + K PK  CH  C + +Y   + +DK      Y V 
Sbjct: 183 QGCRPYEIPPCEHHVPGNRMPCNGDSKTPK--CHKTCES-SYNVDYHKDKRYGKHVYSVS 239

Query: 244 DNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
             ED IK E+  +GP    F +Y D  +YK+GVYKHT    L    H+ K++GWG ENG 
Sbjct: 240 SKEDHIKAELYKNGPVEGAFTVYSDLLNYKNGVYKHTVGNALGG--HAIKILGWGVENGN 297

Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            YWL+ N+W   WGD G  KILRG+  C  E  I AG+P
Sbjct: 298 KYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 336


>gi|18921171|ref|NP_572920.1| cathepsin B1, isoform A [Drosophila melanogaster]
 gi|7292926|gb|AAF48317.1| cathepsin B1, isoform A [Drosophila melanogaster]
 gi|16767940|gb|AAL28188.1| GH06546p [Drosophila melanogaster]
 gi|220944992|gb|ACL85039.1| CG10992-PA [synthetic construct]
 gi|220954816|gb|ACL89951.1| CG10992-PA [synthetic construct]
          Length = 340

 Score =  228 bits (582), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 126/337 (37%), Positives = 181/337 (53%), Gaps = 17/337 (5%)

Query: 12  TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG 71
            L  GE    SD +I+ +  +A TWT GRNF A+++E ++R+ +     + D     LP 
Sbjct: 15  ALTSGEPSLLSDEFIEVVRSKAKTWTVGRNFDASVTEGHIRRLM---GVHPDAHKFALPD 71

Query: 72  DRKTYDPEYSATV---PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIK 128
            R+     Y  +V   P+ FD+R+QWPNC TIG + D G+C +   F AV A SDR CI 
Sbjct: 72  KREVLGDLYVNSVDELPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIH 131

Query: 129 SKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQP 188
           S G+ N   S + + SCC  C +     C+ G     W++  ++G V+GG YG   GC+P
Sbjct: 132 SGGKVNFHFSADDLVSCCHTCGF----GCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRP 187

Query: 189 STISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA 248
             ISPC HH +    P     + P  KC   C +  Y   + +DKH  + +Y V  N   
Sbjct: 188 YEISPCEHHVNGTRPPCAHGGRTP--KCSHVCQS-GYTVDYAKDKHFGSKSYSVRRNVRE 244

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT--ENGTPYW 306
           I++EI+ +GP    F +Y+D   YK GVY+H    +L    H+ +++GWG   E   PYW
Sbjct: 245 IQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGG--HAIRILGWGVWGEEKIPYW 302

Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           L+ N+W   WGD G  +ILRG+  C  E  I+AG PK
Sbjct: 303 LIGNSWNTDWGDHGFFRILRGQDHCGIESSISAGLPK 339


>gi|204022071|dbj|BAG71133.1| cathepsin B-S2 [Tuberaphis coreana]
          Length = 334

 Score =  228 bits (580), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 124/340 (36%), Positives = 182/340 (53%), Gaps = 17/340 (5%)

Query: 4   ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
           +    +   L    L   SD  I  IN  A TW A R FPAN SEEY    L+    Y +
Sbjct: 8   VCAIFVSVYLAEPTLQFLSDERIKYINEVAKTWKAERYFPANTSEEYFIG-LLGSRGYKN 66

Query: 64  QSDRPLPGDRKTYDPEYSAT-VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
            ++     + K YDP Y     P +FD+R  W +C  IGH+ D G C +   F+  GAF+
Sbjct: 67  YTNE---FEIKKYDPLYVENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFA 123

Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
           DR C+ + G+ N+ LS E +  CCK C     + C  G+  + W +   +G  TGGDY  
Sbjct: 124 DRLCVSTGGKFNQLLSPEELTFCCKDC----GQGCGGGNPMKAWEYFRTQGVTTGGDYNT 179

Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
           + GC P  + PC +         C+ Q + +   + +C    YG+   Q++++T   Y++
Sbjct: 180 KEGCMPYKVPPCRNKQGENI---CDEQPMER---NHQCPKTCYGKTTVQNRYKTKSEYYI 233

Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
           +  +  I+++I  +GP  A+F  YDD   YKSG+Y+ + NAK +   HS K+IGWG E+G
Sbjct: 234 NSIK-TIEQDIKTYGPVEASFDCYDDLSVYKSGIYRKSPNAKYKG-GHSIKIIGWGQEDG 291

Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           TPYWL +N+W   WGD GT KI++G+ EC  E  + AG P
Sbjct: 292 TPYWLAVNSWSKFWGDHGTFKIIKGRNECGIERAVTAGIP 331


>gi|338815385|gb|AEJ08755.1| cathepsin B [Crassostrea ariakensis]
          Length = 341

 Score =  228 bits (580), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 137/351 (39%), Positives = 175/351 (49%), Gaps = 24/351 (6%)

Query: 5   LVFLLGCTLVRGELYKF------------SDAYIDQINREANTWTAGRNFPANLSEEYLR 52
           +  L+ C LV G +               SD  I  IN+   TW AG+NF     ++ L 
Sbjct: 1   MKVLVLCALVAGAMSALVEFRDKDIFEPLSDEMIWFINKLNTTWKAGQNFHHIAKDDRLA 60

Query: 53  QFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAP 112
              +    Y +        ++K    E    +P  FD+R QWPNC T+  V D GAC + 
Sbjct: 61  HVKMMCGTYLNTPPELRLPEKKM---EPLKDLPASFDSRTQWPNCPTLKEVRDQGACGSC 117

Query: 113 HIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKR 172
             F AV A SDR CIKS+G++N  +S E + SCC+ C       C  G     W++  + 
Sbjct: 118 WAFGAVEAMSDRICIKSQGKENVHISAEDLTSCCRTC----GNGCEGGFPSAAWSYYKRD 173

Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQD 232
           G VTGG Y    GCQP TI  C HH      P C     P  KC   C    Y   + +D
Sbjct: 174 GLVTGGQYNSHQGCQPYTIKACDHHVVGKLQP-CSKDIGPTPKCKHTC-EAGYNVTYEKD 231

Query: 233 KHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
           KH     Y V   E  I  EI+ +GP    F +Y DF  YKSGVYKHT+   L    H+ 
Sbjct: 232 KHYGMSAYSVHGVEK-IMTEIMTNGPVEGAFTVYADFPQYKSGVYKHTTGQPLGG--HAI 288

Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           K++GWGTENG  YWLV N+W P WGD+G  KILRG+ EC  E  I+AG+PK
Sbjct: 289 KILGWGTENGDDYWLVANSWNPDWGDQGFFKILRGQDECGIESQISAGEPK 339


>gi|73586701|gb|AAI02998.1| CTSB protein [Bos taurus]
          Length = 335

 Score =  228 bits (580), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 135/346 (39%), Positives = 179/346 (51%), Gaps = 23/346 (6%)

Query: 1   MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
           ++  L  LL  T  R  LY    SD  ++ +N++  TW AG NF  N+   Y+++   A 
Sbjct: 4   LLATLSCLLVLTSARSSLYFPPLSDELVNFVNKQNTTWKAGHNF-YNVDLSYVKKLCGAI 62

Query: 59  AKYFDQSDRPLPGDRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
                     L G +      ++A V  P+ FDAREQWPNC TI  + D G+C +   F 
Sbjct: 63  ----------LGGPKLPQRDAFAADVVLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFG 112

Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
           AV A SDR CI S G+ N  +S E + +CC     +    C+ G     WNF  K+G V+
Sbjct: 113 AVEAISDRICIHSNGRVNVEVSAEDMLTCCD---GECGDGCNGGFPSGAWNFWTKKGLVS 169

Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
           GG Y    GC+P +I PC HH +    P       PK  C   C  P Y   + +DKH  
Sbjct: 170 GGLYNSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKTC-EPGYSPSYKEDKHFG 226

Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
             +Y V +NE  I  EI  +GP    F++Y DF  YKSGVY+H S   +    H+ +++G
Sbjct: 227 CSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGG--HAIRILG 284

Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           WG ENGTPYWLV N+W   WGD G  KILRG+  C  E  I AG P
Sbjct: 285 WGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 330


>gi|405971658|gb|EKC36483.1| Cathepsin B [Crassostrea gigas]
          Length = 341

 Score =  227 bits (579), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 137/351 (39%), Positives = 175/351 (49%), Gaps = 24/351 (6%)

Query: 5   LVFLLGCTLVRGELYKF------------SDAYIDQINREANTWTAGRNFPANLSEEYLR 52
           +  L+ C LV G +               SD  I  IN+   TW AG+NF     ++ L 
Sbjct: 1   MKVLVLCALVAGAMSALVEFRDKDIFEPLSDEMIWFINKMNTTWKAGQNFHHIAKDDRLA 60

Query: 53  QFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAP 112
              +    Y +        ++K    E    +P  FD+R QWPNC T+  V D GAC + 
Sbjct: 61  HVKMMCGTYLNTPPELRLPEKKM---EPLKDLPATFDSRTQWPNCPTLKEVRDQGACGSC 117

Query: 113 HIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKR 172
             F AV A SDR CIKS+G++N  +S E + SCC+ C       C  G     W++  K 
Sbjct: 118 WAFGAVEAMSDRICIKSQGKENTHISAEDLTSCCRTC----GNGCEGGFPSAAWSYYKKD 173

Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQD 232
           G VTGG Y    GC P TI  C HH      P C     P  KC   C    Y   + +D
Sbjct: 174 GLVTGGQYNSHQGCLPYTIKACDHHVVGKLQP-CSKSIGPTPKCKHTC-EAGYNVTYEKD 231

Query: 233 KHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
           KH  +  Y V   E  I  EI+ +GP    F +Y DF  YKSGVYKHT+   L    H+ 
Sbjct: 232 KHYGSSAYSVHGVEK-IMTEIMTNGPVEGAFTVYADFPQYKSGVYKHTTGQPLGG--HAI 288

Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           K++GWGTENG  YWLV N+W P WGD+G  KILRG+ EC  E  I+AG+PK
Sbjct: 289 KILGWGTENGDDYWLVANSWNPDWGDQGFFKILRGQDECGIESQISAGEPK 339


>gi|17565164|ref|NP_503383.1| Protein CPR-5 [Caenorhabditis elegans]
 gi|1169086|sp|P43509.1|CPR5_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 5; AltName:
           Full=Cysteine protease-related 5; Flags: Precursor
 gi|671713|gb|AAA98786.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|675502|gb|AAA98784.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351059399|emb|CCD74289.1| Protein CPR-5 [Caenorhabditis elegans]
          Length = 344

 Score =  227 bits (579), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 125/321 (38%), Positives = 169/321 (52%), Gaps = 15/321 (4%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLP-GDRKTYDPEYSA 82
           A ID +N     WTAG      + +E + + L+ D KY       +P  D      E S 
Sbjct: 31  ALIDYVNSAQKLWTAGHQV---IPKEKITKKLM-DVKYL------VPHKDEDIVATEVSD 80

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
            +PD FDAR+QWPNC +I ++ D   C +   FAA  A SDR CI S G  N  LS+E +
Sbjct: 81  AIPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDL 140

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            SCC    +     C  G   + W +  K G VTGG Y  + GC+P +I+PC    +   
Sbjct: 141 LSCCT-GMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAPCGETVNGVK 199

Query: 203 LPSCENQKVPKLKCHTRCTNP-TYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
            P+C     P  KC   CT+   Y   + QDKH  +  Y V    + I+ EIL +GP   
Sbjct: 200 WPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIEV 259

Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT 321
            F +Y+DFY Y +GVY HT+ A L    H+ K++GWG +NGTPYWLV N+W   WG++G 
Sbjct: 260 AFTVYEDFYQYTTGVYVHTAGASLGG--HAVKILGWGVDNGTPYWLVANSWNVAWGEKGY 317

Query: 322 VKILRGKYECAFEYLIAAGKP 342
            +I+RG  EC  E+   AG P
Sbjct: 318 FRIIRGLNECGIEHSAVAGIP 338


>gi|170028912|ref|XP_001842338.1| oryzain gamma chain [Culex quinquefasciatus]
 gi|167879388|gb|EDS42771.1| oryzain gamma chain [Culex quinquefasciatus]
          Length = 333

 Score =  227 bits (578), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 129/343 (37%), Positives = 182/343 (53%), Gaps = 15/343 (4%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           ++   V ++   + R  +   S+ +IDQIN +A TW AGRNF  +    Y R  +     
Sbjct: 5   LLTATVIVVLWAMYRVSINPLSEKFIDQINAKATTWHAGRNFHPDTPLSYFRGLM----G 60

Query: 61  YFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
               +D+ +P     +D +    +P+ FD+REQWPNC TI  + D G+C +   F AV A
Sbjct: 61  VHKDADKFMP-PVMLHDLDEGDDLPENFDSREQWPNCPTIREIRDQGSCGSCWAFGAVEA 119

Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
            SDR CI SKG+    +S E + +CC  C +     C  G+    W    ++G V+GG +
Sbjct: 120 MSDRVCIHSKGKVLFRVSAEDLLTCCTNCGH----GCDGGAPGAGWKHWIEKGLVSGGPF 175

Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
           G   GC+P TI PC H  +    P C++   P  KC  +C  P Y   + +DK     TY
Sbjct: 176 GSDQGCRPYTIEPCVHVENGAQSP-CKDSITP--KCIKKCL-PGYNVPYAKDKSFGKSTY 231

Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE 300
            + ++E  I+KEI  +GP  ATF ++DDF  YK G+Y+HTS        H+ +++GWG E
Sbjct: 232 SIANDERQIRKEIFTNGPVEATFTVFDDFASYKHGIYQHTSGNLAGE--HAVRILGWGVE 289

Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           NGT YWL  N+W   WGD G  KILRG      E  I AG PK
Sbjct: 290 NGTKYWLAANSWNSDWGDNGYFKILRGSNHVDIESAIVAGLPK 332


>gi|379067374|gb|AFC90100.1| cathepsin B [Capra hircus]
          Length = 335

 Score =  226 bits (577), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 134/346 (38%), Positives = 178/346 (51%), Gaps = 23/346 (6%)

Query: 1   MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
           ++  L  LL  T  R  L+    SD  ++ +N++  TW AG NF  N+   Y+++   A 
Sbjct: 4   LLATLSCLLVLTSARSSLHFPPLSDEMVNYVNKQNTTWKAGHNF-YNVDLSYVKKLCGA- 61

Query: 59  AKYFDQSDRPLPGDRKTYDPEYSA--TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
                     L G +      ++A   +PD FDAREQWPNC TI  + D G+C +   F 
Sbjct: 62  ---------ILGGPKLPQRDAFAADMVLPDSFDAREQWPNCPTIKEIRDQGSCGSCWAFG 112

Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
           AV A SDR CI SKG+ N  +S E + +CC     +    C+ G     WNF  K+G V+
Sbjct: 113 AVEAISDRICIHSKGRVNVEVSAEDMLTCCG---SECGDGCNGGFPSGAWNFWTKKGLVS 169

Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
           GG Y    GC+P +I PC HH +    P       P  KC   C  P Y   +  DKH  
Sbjct: 170 GGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTP--KCSKIC-EPGYSPSYKDDKHFG 226

Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
             +Y V  NE  I  EI  +GP    F++Y DF  YKSGVY+H S   +    H+ +++G
Sbjct: 227 CSSYSVSSNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEMMGG--HAIRILG 284

Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           WG EN TPYWLV N+W   WGD+G  KILRG+  C  E  I AG P
Sbjct: 285 WGVENDTPYWLVGNSWNTDWGDKGFFKILRGQDHCGIESEIVAGMP 330


>gi|417399216|gb|JAA46636.1| Putative cathepsin b [Desmodus rotundus]
          Length = 340

 Score =  226 bits (577), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 133/345 (38%), Positives = 177/345 (51%), Gaps = 24/345 (6%)

Query: 5   LVFLLGCTLV------RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
           L+  L C  V      R E    SD  ++ +N++  TW AG NF  N+   Y+++     
Sbjct: 4   LLATLSCLAVLTTARSRLEFQPLSDELVNYVNKQNTTWKAGHNF-YNVDLSYVKKLCGTK 62

Query: 59  AKYFDQSDR-PLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
                   R  L GD           +P+ FDAREQWP C TI  + D G+C +   F A
Sbjct: 63  LGGPKLPQRLSLAGD---------IALPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGA 113

Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
           V A SDR CI+S G QN  +S E + +CC    +   + C+ G     WNF  K+G V+G
Sbjct: 114 VEAISDRICIRSNGLQNVEVSAEDLLTCCG---FQCGEGCNGGFPSGAWNFWKKQGLVSG 170

Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
           G Y    GC+P +I PC HH +  + P C  +     KC   C  P Y   + +DKH   
Sbjct: 171 GLYDSHVGCRPYSIPPCEHHVNG-SRPPCSGEGGDTPKCSKIC-EPGYSPSYKEDKHFGC 228

Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
            TY V  +E  I  EI  +GP  A F++Y DF  YKSGVY+H +   +    H+ +++GW
Sbjct: 229 DTYSVPSDEKEIMVEIYKNGPVEAAFSVYSDFLLYKSGVYQHVTGEMVGG--HAVRILGW 286

Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           G ENGTPYWLV N+W   WGD G  KILRG+  C  E  I AG P
Sbjct: 287 GVENGTPYWLVGNSWNTDWGDNGFFKILRGRDHCGIESEIVAGIP 331


>gi|195478432|ref|XP_002100515.1| GE16138 [Drosophila yakuba]
 gi|194188039|gb|EDX01623.1| GE16138 [Drosophila yakuba]
          Length = 340

 Score =  226 bits (576), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 126/337 (37%), Positives = 184/337 (54%), Gaps = 17/337 (5%)

Query: 12  TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA--DAKYFDQSD-RP 68
            L  GE    SD +I+ +  +A TWT GRNF A+++E ++R+ +    DA  F  +D R 
Sbjct: 15  ALTAGEPSLLSDEFIELVRSKAKTWTVGRNFDASVTEGHIRRLMGVHPDAHKFALADKRE 74

Query: 69  LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIK 128
           + GD      +    +P+ FD+R+QWPNC TIG + D G+C +   F AV A SDR CI 
Sbjct: 75  VLGDLYMNSVD---EIPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIH 131

Query: 129 SKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQP 188
           S G+ N   S + + SCC  C +     C+ G     W++  ++G V+GG YG   GC+P
Sbjct: 132 SGGKVNFHFSADDLVSCCHTCGF----GCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRP 187

Query: 189 STISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA 248
             ISPC HH +    P       P  KC   C + +Y   + +DKH  + +Y V  N   
Sbjct: 188 YEISPCEHHVNGTRPPCAHGGATP--KCSHVCQS-SYTVDYAKDKHFGSKSYSVRRNVRD 244

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT--ENGTPYW 306
           I++EI+ +GP    F +Y+D   YK GVY+H    +L    H+ +++GWG   +   PYW
Sbjct: 245 IQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGG--HAIRILGWGVWGDEKIPYW 302

Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           L+ N+W   WGD+G  +ILRG+  C  E  I+AG PK
Sbjct: 303 LIGNSWNTDWGDQGFFRILRGQDHCGIESSISAGLPK 339


>gi|48762491|dbj|BAD23815.1| cathepsin B-S1 [Tuberaphis coreana]
          Length = 334

 Score =  226 bits (576), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 126/340 (37%), Positives = 181/340 (53%), Gaps = 17/340 (5%)

Query: 4   ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
           +    +   L    L   SD  I  IN  A TW A R FPAN SEEY    L+    Y +
Sbjct: 8   VCAIFVSVYLAEPTLQFLSDERIKYINEVAKTWKAERYFPANTSEEYFIG-LLGSRGYKN 66

Query: 64  QSDRPLPGDRKTYDPEYSAT-VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
            ++     + K YDP Y     P +FD+R  W +C  IGH+ D G C +   F+  GAF+
Sbjct: 67  YTNEF---EIKKYDPLYVENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFA 123

Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
           DR C+ + G+ N+ LS E +A CCK C     + C  G   + W +   +G  TGGDY  
Sbjct: 124 DRLCVSTGGKFNQLLSPEELAFCCKDC----GQGCGGGYPIKAWKYFRTQGVTTGGDYDT 179

Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
           + GC P  + PC +     T   C  Q + +   + +C    YG+   Q++++T   Y +
Sbjct: 180 KEGCMPYKVPPCYNKQGKNT---CGGQPMER---NHQCPKTCYGKTTVQNRYKTKSEYSI 233

Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
           +  +  I++++  +GP  A+F +YDDF  YKSG+Y+ T  AK E   HS K+IGWG ENG
Sbjct: 234 NSIK-TIEQDLKTYGPVEASFDVYDDFSVYKSGIYRKTPKAKYEG-RHSIKIIGWGQENG 291

Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           T YWL +N+W   WG+ GT KI++G+ EC  E  + AG P
Sbjct: 292 TTYWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGIP 331


>gi|426220597|ref|XP_004004501.1| PREDICTED: cathepsin B [Ovis aries]
          Length = 335

 Score =  226 bits (576), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 134/346 (38%), Positives = 178/346 (51%), Gaps = 23/346 (6%)

Query: 1   MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
           ++  L  LL  T  R  L+    SD  ++ +N++  TW AG NF  N+   Y+++   A 
Sbjct: 4   LLATLSCLLVLTSARSSLHFPPLSDEMVNYVNKQNTTWKAGHNF-YNVDLSYVKKLCGA- 61

Query: 59  AKYFDQSDRPLPGDRKTYDPEYSA--TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
                     L G +      ++A   +PD FDAREQWPNC TI  + D G+C +   F 
Sbjct: 62  ---------ILGGPKLPQRDAFAADMVLPDSFDAREQWPNCPTIKEIRDQGSCGSCWAFG 112

Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
           AV A SDR CI SKG+ N  +S E + +CC     +    C+ G     WNF  K+G V+
Sbjct: 113 AVEAISDRICIHSKGRVNVEVSAEDMLTCCG---SECGDGCNGGFPSGAWNFWTKKGLVS 169

Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
           GG Y    GC+P +I PC HH +    P       P  KC   C  P Y   +  DKH  
Sbjct: 170 GGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTP--KCSKIC-EPGYSPSYKDDKHFG 226

Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
             +Y V  NE  I  EI  +GP    F++Y DF  YKSGVY+H S   +    H+ +++G
Sbjct: 227 CSSYSVSSNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEMMGG--HAIRILG 284

Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           WG EN TPYWLV N+W   WGD+G  KILRG+  C  E  I AG P
Sbjct: 285 WGVENDTPYWLVGNSWNTDWGDKGFFKILRGQDHCGIESEIVAGMP 330


>gi|268555788|ref|XP_002635883.1| C. briggsae CBR-CPR-5 protein [Caenorhabditis briggsae]
          Length = 345

 Score =  226 bits (576), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 126/320 (39%), Positives = 168/320 (52%), Gaps = 13/320 (4%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
           A ID +N     WTAG      + +E + + L+ D KY          D      E    
Sbjct: 32  ALIDYVNSAQKLWTAGHQV---IPKEKITKKLM-DVKYLVPHK-----DEDIVATEVFDA 82

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +PD FDAR+QWP+C +I ++ D   C +   FAA  A SDR CI S G  N  LS++ + 
Sbjct: 83  IPDHFDARDQWPSCVSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSQDLL 142

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC       N  C  G   + W +  K G VTGG Y  + GC+P +I+PC    +  T 
Sbjct: 143 SCCTGLLSCGN-GCEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTW 201

Query: 204 PSCENQKVPKLKCHTRCT-NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
           P C +   P  KC   CT N TY   + QDKH     Y V    + I+ EIL +GP    
Sbjct: 202 PKCPDDTEPTPKCVEACTSNNTYPTPYLQDKHFGATAYAVGKKVEQIQTEILKNGPVEVA 261

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F +Y+DFY Y +GVY HTS A L    H+ K++GWG +NGTPYWLV N+W  +WG++G  
Sbjct: 262 FTVYEDFYQYTTGVYVHTSGASLGG--HAVKILGWGVDNGTPYWLVANSWNVNWGEKGYF 319

Query: 323 KILRGKYECAFEYLIAAGKP 342
           +I+RG  EC  E+   AG P
Sbjct: 320 RIIRGLNECGIEHSAVAGIP 339


>gi|225717770|gb|ACO14731.1| Cathepsin B precursor [Caligus clemensi]
          Length = 331

 Score =  226 bits (575), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 124/323 (38%), Positives = 167/323 (51%), Gaps = 16/323 (4%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
            S+++I  +N EA  W AG NF    S  Y+R  +     + D    PLP    T     
Sbjct: 25  LSESFIASVNEEAQIWKAGPNFHPETSSNYIRSLMGVLPNHRDYLPPPLPNLLGT----- 79

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
             ++PD FDARE WPNC +I  + D G+C +   F A  A SDR CI +   +N  +S E
Sbjct: 80  -ESIPDTFDAREHWPNCPSIRLIRDQGSCGSCWAFGAAEAMSDRVCIHT--HKNVNISAE 136

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SCC  C +     C+ G     W F   +G V+GG YG   GCQP  I PC HH + 
Sbjct: 137 NLLSCCYTCGF----GCNGGFPGAAWRFWENKGLVSGGLYGSHKGCQPYLIEPCEHHVNG 192

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
              P  E  + PK  CH  C N  Y   + +D      +Y +  +   I+ +I+ +GP  
Sbjct: 193 TRKPCAEGGRTPK--CHKTCDNKNYPISYEKDLSFGRSSYSIRSDPKQIQMDIMTNGPVE 250

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
           A F++Y DF  YKSGVY+H   + L    H+ +++GWG E GTPYWLV N+W   WGD G
Sbjct: 251 AAFSVYSDFMSYKSGVYRHVKGSLLGG--HAIRILGWGMEKGTPYWLVANSWNTDWGDNG 308

Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
           T KILRG   C  E  + AG P+
Sbjct: 309 TFKILRGSDHCGIEDSVVAGLPR 331


>gi|345790427|ref|XP_543203.3| PREDICTED: cathepsin B [Canis lupus familiaris]
          Length = 339

 Score =  225 bits (574), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 132/341 (38%), Positives = 172/341 (50%), Gaps = 18/341 (5%)

Query: 2   IHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
           +  LV L G    R      SD  +D +N+   TW AG NF  N+   YLR+       +
Sbjct: 8   LSCLVMLTG-AQSRLPFRALSDELVDYVNKRNTTWKAGHNF-HNVDPSYLRRLC---GTF 62

Query: 62  FDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
                 P     +      +  +P+ FDAREQWPNC TI  + D G+C +   F AV A 
Sbjct: 63  LGGPKLP-----QRVQFAKNLILPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAI 117

Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
           SDR CI++ G  N  +S E + +CC     D    C+ G     WNF  K+G V+GG Y 
Sbjct: 118 SDRICIRTNGHVNVEVSAEDMLTCCGDQCGD---GCNGGFPAEAWNFWTKQGLVSGGLYD 174

Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW 241
              GC+P +I PC HH +    P       PK  C   C  P Y   + +DKH    +Y 
Sbjct: 175 SHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPSYKEDKHYGCSSYS 231

Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
           V DNE  I  EI  +GP  A F +Y DF  YKSGVY+H +   +    H+ +++GWG E+
Sbjct: 232 VSDNEKEIMAEIYKNGPVEAAFTVYSDFLLYKSGVYQHVTGEMMGG--HAVRILGWGVED 289

Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           GTPYWLV N+W   WGD G  KILRG+  C  E  I AG P
Sbjct: 290 GTPYWLVGNSWNTDWGDNGFFKILRGRDHCGIESEIVAGIP 330


>gi|195566634|ref|XP_002106884.1| GD15875 [Drosophila simulans]
 gi|194204277|gb|EDX17853.1| GD15875 [Drosophila simulans]
          Length = 340

 Score =  225 bits (573), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 124/337 (36%), Positives = 180/337 (53%), Gaps = 17/337 (5%)

Query: 12  TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG 71
            L  GE    SD +I+ +  +A TW  GRNF A+++E ++R+ +     + D     LP 
Sbjct: 15  ALTSGEPSLLSDEFIEVVRSKAKTWKVGRNFDASVTEGHIRRLM---GVHPDAHKFALPD 71

Query: 72  DRKTYDPEYSATV---PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIK 128
            R+     Y  +V   P+ FD+R+QWPNC TIG + D G+C +   F AV A SDR CI 
Sbjct: 72  KREVLGDLYMNSVDELPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIH 131

Query: 129 SKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQP 188
           S G+ N   S + + SCC  C +     C+ G     W++  ++G V+GG YG   GC+P
Sbjct: 132 SGGKVNFHFSADDLVSCCHTCGF----GCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRP 187

Query: 189 STISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA 248
             ISPC HH +    P       P  KC   C + +Y   + +DKH  + +Y V  N   
Sbjct: 188 YEISPCEHHVNGTRPPCAHGGGTP--KCSHVCQS-SYTVDYAKDKHFGSKSYSVKRNVRE 244

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT--ENGTPYW 306
           I++EI+ +GP    F +Y+D   YK GVY+H    +L    H+ +++GWG   +   PYW
Sbjct: 245 IQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGG--HAIRILGWGVWGDEKIPYW 302

Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           L+ N+W   WGD G  +ILRG+  C  E  I+AG PK
Sbjct: 303 LIGNSWNTDWGDHGFFRILRGQDHCGIESSISAGLPK 339


>gi|126681075|gb|ABO26563.1| cathepsin B-like cysteine protease form 1 [Ixodes ricinus]
          Length = 337

 Score =  225 bits (573), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 133/346 (38%), Positives = 179/346 (51%), Gaps = 20/346 (5%)

Query: 4   ILVFLLGCTLVRGELYK-----FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
           ++V LL       E++       SD  I+ IN+   TW AGRNF  ++S  Y+R  +  +
Sbjct: 6   LVVGLLAAVCFGREIHPKRWHPLSDQMINFINKINTTWKAGRNFDKSISMSYIRGLMGVN 65

Query: 59  AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
            K     +  LP   +    E    +P+ FDARE+W +C +I  + D   C +   F A 
Sbjct: 66  PK---SKEYRLP---EFVHEEIPDDLPESFDAREKWSHCASINLIRDQSTCGSCWAFGAA 119

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
            A SDR CI S+G     +S E +  CC  C       C  G     W +  + G V+ G
Sbjct: 120 EAMSDRVCIHSEGGIQVNISAEDLLDCCDSC----GAGCDGGYPAAAWEYWKESGLVSDG 175

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
            YG   GC+P +++PC HH +  +LP+C    VP  KC   C    YG+ +  DKH    
Sbjct: 176 LYGTPDGCKPYSLAPCEHH-TKGSLPNCTGT-VPTPKCVHLCRK-GYGKDYQHDKHFGKK 232

Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
            Y +  NE  I+ EI  +GP  A F +Y DF  YKSGVY+H S   L    H+ +++GWG
Sbjct: 233 VYSISSNEKQIQTEIFKNGPVEADFTVYADFLSYKSGVYQHHSGDVLGG--HAIRILGWG 290

Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           TENGTPYWLV N+W   WGD G  KILRGK EC  E  I AG PK+
Sbjct: 291 TENGTPYWLVANSWNEDWGDHGYFKILRGKDECGIEDDINAGIPKD 336


>gi|194895314|ref|XP_001978227.1| GG19486 [Drosophila erecta]
 gi|190649876|gb|EDV47154.1| GG19486 [Drosophila erecta]
          Length = 340

 Score =  225 bits (573), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 127/339 (37%), Positives = 181/339 (53%), Gaps = 21/339 (6%)

Query: 12  TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA--DAKYFDQSDRPL 69
            L  GE    SD +I+ +  +A TWT GRNF ++++E Y+R+ +    DA  F  +D+  
Sbjct: 15  ALTSGEPSFLSDEFIELVRSKAKTWTVGRNFDSSVTEGYIRRLMGVHPDAHKFALADK-- 72

Query: 70  PGDRKTYDPEYSATV---PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRC 126
              R+     Y  TV   P+ FD+R+QWPNC TIG + D G C +   F AV A SDR C
Sbjct: 73  ---REVLGDLYMNTVDQIPEEFDSRKQWPNCPTIGEIRDQGECGSCWAFGAVEAMSDRVC 129

Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGC 186
           I S G+ N   S + + SCC  C +     C+ G     W++  ++G V+GG YG   GC
Sbjct: 130 IHSGGKVNFHFSADDLVSCCHTCGF----GCNGGFPGAAWSYWTRKGIVSGGPYGSNQGC 185

Query: 187 QPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNE 246
           +P  I+PC HH +    P       P  KC   C +  Y   + +DKH  + +Y V  N 
Sbjct: 186 RPYEIAPCEHHVNGTRPPCGHGGGTP--KCSHVCES-GYTVDYAKDKHFGSKSYSVKRNV 242

Query: 247 DAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT--ENGTP 304
             I++EI+ +GP    F +Y+D   YK GVY+H    +L    H+ +++GWG   E   P
Sbjct: 243 RDIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHQHGKELGG--HAIRILGWGVWGEEKIP 300

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           YWL+ N+W   WGD G  +ILRG+  C  E  I+AG PK
Sbjct: 301 YWLIGNSWNTDWGDNGFFRILRGQDHCGIESSISAGLPK 339


>gi|298370749|gb|ADI80349.1| cathepsin B [Litopenaeus vannamei]
          Length = 331

 Score =  224 bits (572), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 131/325 (40%), Positives = 169/325 (52%), Gaps = 18/325 (5%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA--DAKYFDQSDRPLPGDRKTYDP 78
            SD +I  +  E +TW AGRNF  +LS  Y R+ +    D+KY       +P       P
Sbjct: 21  LSDKFIKLLQSEDSTWEAGRNFNKHLSIRYFRRLMGVHPDSKYH------MPKYEVHQIP 74

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           E +  +P  FD+R  WP C TIG + D G+C +   F AV   SDR+CI SKG+ N   S
Sbjct: 75  E-NFELPKEFDSRAAWPMCPTIGEIRDQGSCGSCWAFGAVEVMSDRQCIHSKGKSNFHYS 133

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
            E + SCC +C +  N     G+ F+ W  +H  G V+GG +    GCQP  I+PC HH 
Sbjct: 134 AENLVSCCHLCGFGCNGGFP-GAAFKYW--VHS-GIVSGGSFNSTQGCQPYEIAPCEHHV 189

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
             P     E    PK  C   C    Y   +  D H     Y +  +ED IK EI+ +GP
Sbjct: 190 PGPRPKCSEGGGTPK--CAKTCEK-GYIVDYESDLHHGGKAYSIMKDEDQIKYEIMKNGP 246

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
               F +Y DF HYKSGVY+H     L    H+ +++GWG ENGTPYWL  N+W   WGD
Sbjct: 247 VEGAFTVYVDFLHYKSGVYQHRHGLPLGG--HAIRVLGWGEENGTPYWLCANSWNTDWGD 304

Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
            G  KILRG   C  E  I+AG PK
Sbjct: 305 NGLFKILRGSDHCGIESEISAGLPK 329


>gi|112983908|ref|NP_001036850.1| cathepsin B precursor [Bombyx mori]
 gi|13548667|dbj|BAB40804.1| cathepsin B [Bombyx mori]
          Length = 337

 Score =  224 bits (572), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 123/325 (37%), Positives = 172/325 (52%), Gaps = 16/325 (4%)

Query: 19  YKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFL-IADAKYFDQSDRPLPGDRKTYD 77
           Y  SD +I+ IN + N+W AGRNFP + S  +L++ + + + ++F  +  P+    KT+ 
Sbjct: 23  YPLSDEFINTINLKQNSWKAGRNFPRDTSFAHLKKIMGVIEDEHF--ATLPI----KTHK 76

Query: 78  PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
            +  A +P+ FD R++WP+C T+  V D G+C +   F AV A +DR C  S G ++   
Sbjct: 77  IDLIAGLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHF 136

Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
           S E + SCC IC       CS G     W +    G V+GG Y    GC+P  I PC HH
Sbjct: 137 SAEDLLSCCPIC----GLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHH 192

Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
                +P   + K P  KC  +C +  Y   + QDK      Y V  +ED I+ E+  +G
Sbjct: 193 VPGNRMPCSGDTKTP--KCTKKCES-GYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNG 249

Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
           P    F +Y D   YKSGVYKHT    L    H+ K++GWG EN   YWL+ N+W   WG
Sbjct: 250 PVEGAFTVYSDLLSYKSGVYKHTQGDALGG--HAVKILGWGVENDNKYWLIANSWNSDWG 307

Query: 318 DRGTVKILRGKYECAFEYLIAAGKP 342
           D G  KILRG+  C  E  I  G+P
Sbjct: 308 DNGFFKILRGEDHCGIESSIVTGEP 332


>gi|195352458|ref|XP_002042729.1| GM17589 [Drosophila sechellia]
 gi|194126760|gb|EDW48803.1| GM17589 [Drosophila sechellia]
          Length = 340

 Score =  224 bits (571), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 123/337 (36%), Positives = 179/337 (53%), Gaps = 17/337 (5%)

Query: 12  TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG 71
            L  GE    SD +I+ +  +A TW  GRNF A+++E ++R+ +     + D     LP 
Sbjct: 15  ALTSGEPSLLSDEFIEVVRSKAKTWKVGRNFDASVTEGHIRRLM---GVHPDAHKFALPD 71

Query: 72  DRKTYDPEYSATV---PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIK 128
            R+     Y  ++   P+ FD+R+QWPNC TIG + D G+C +   F AV A SDR CI 
Sbjct: 72  KREVLGDLYMNSLDELPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIH 131

Query: 129 SKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQP 188
           S G+ N   S + + SCC  C +     C+ G     W++  ++G V+GG YG   GC+P
Sbjct: 132 SGGKVNFHFSADDLVSCCHTCGF----GCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRP 187

Query: 189 STISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA 248
             ISPC HH +    P       P  KC   C + +Y   + +DKH  + +Y V  N   
Sbjct: 188 YEISPCEHHVNGTRPPCANGSGTP--KCSHVCQS-SYTVDYAKDKHFGSKSYSVKRNVRE 244

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT--ENGTPYW 306
           I++EI+ +GP    F +Y+D   YK GVY+H    +L    H+ +++GWG       PYW
Sbjct: 245 IQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGG--HAIRILGWGVWGNEKIPYW 302

Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           L+ N+W   WGD G  +ILRG+  C  E  I+AG PK
Sbjct: 303 LIGNSWNTDWGDHGFFRILRGQDHCGIESSISAGLPK 339


>gi|196009263|ref|XP_002114497.1| expressed hypothetical protein [Trichoplax adhaerens]
 gi|190583516|gb|EDV23587.1| expressed hypothetical protein [Trichoplax adhaerens]
          Length = 333

 Score =  224 bits (570), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 129/346 (37%), Positives = 184/346 (53%), Gaps = 25/346 (7%)

Query: 4   ILVFLLGCTLVRGELYK----FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFL--IA 57
           I++ L      +G  +      S   ID +N  + +W AG NF A L   Y++     + 
Sbjct: 6   IVITLFAVFSAQGAYFPNHQPLSQDLIDYVNLVSTSWKAGTNF-AGLPVSYVKYLCGALE 64

Query: 58  DAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
           D  +F      LP     +  E ++ +P  FD+R++W  C +I  + D G+C +   F A
Sbjct: 65  DPNHFQ-----LP----IHVHEDTSDLPKSFDSRDKWRMCPSIREIRDQGSCGSCWSFGA 115

Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
           V + +DR CI S G+    +S E + +CC  C       C+ G + + W++    G VTG
Sbjct: 116 VESITDRICIHSNGKVKVHISAEDLMTCCTSC----GMGCNGGFLPQAWHYWVNNGIVTG 171

Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
           G Y    GCQP  I  C HH   P   +C  +++P  KC  +C  P Y + F QDKH   
Sbjct: 172 GQYHSHKGCQPYEIPKCEHHVKGP-FKAC-GKELPTPKCSQKC-QPGYNKTFNQDKHFGK 228

Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
            +Y + +N   I+KEI+ +GP  A F +Y DF  YKSGVY+HT+   L    H+ K++GW
Sbjct: 229 KSYSITNNIQQIQKEIMMNGPVEAAFTVYADFPSYKSGVYQHTTGGPLGG--HAVKILGW 286

Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           GTEN TPYWL+ N+W P WGD+G  KI+RGK EC  E  I AG PK
Sbjct: 287 GTENNTPYWLIANSWNPTWGDKGYFKIIRGKDECGIESSIVAGMPK 332


>gi|348534156|ref|XP_003454569.1| PREDICTED: cathepsin B-like [Oreochromis niloticus]
          Length = 330

 Score =  223 bits (569), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 126/334 (37%), Positives = 173/334 (51%), Gaps = 21/334 (6%)

Query: 12  TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG 71
           +L R  L+  S   ++ IN+   TW AG NF  N+   Y+R+               L G
Sbjct: 16  SLARPHLHPLSSEMVNHINKLNTTWKAGHNF-HNVDYSYVRKLC----------GTMLKG 64

Query: 72  DRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKS 129
            +     +Y+  V  P  FDAR+QWPNC T+  + D G+C +   F A  A SDR CI S
Sbjct: 65  PKLPVMVQYAGDVKLPKEFDARQQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHS 124

Query: 130 KGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPS 189
            G+ N  +S+E + +CC  C       C+ G     W+F    G V+GG Y    GC+P 
Sbjct: 125 NGKVNVEISSEDLLTCCDSC----GMGCNGGYPSAAWDFWASEGLVSGGLYESHIGCRPY 180

Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
           TI+PC HH +  + P C  +     +C  +C +  Y   + QDKH    +Y V  +E  I
Sbjct: 181 TIAPCEHHVNG-SRPPCTGEGGDTPECVRQCES-GYTPSYIQDKHYGKTSYSVPSDEQQI 238

Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVI 309
           + EI  +GP    F +Y+DF  YK+GVY+H S + +    H+ K++GWG ENGTPYWL  
Sbjct: 239 QTEIYKNGPVEGAFTVYEDFLLYKTGVYQHVSGSAVGG--HAIKVLGWGEENGTPYWLCA 296

Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           N+W   WGD G  KILRG   C  E  I AG PK
Sbjct: 297 NSWNTDWGDNGYFKILRGSDHCGIESEIVAGIPK 330


>gi|389611087|dbj|BAM19154.1| cathepsin B [Papilio polytes]
          Length = 334

 Score =  223 bits (569), Expect = 8e-56,   Method: Compositional matrix adjust.
 Identities = 122/322 (37%), Positives = 168/322 (52%), Gaps = 14/322 (4%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
            SD +I+ IN + +TW AGRNFP +   +++++ +        + DR        ++ + 
Sbjct: 24  LSDDFINLINSKQDTWKAGRNFPVDTPVKHIQKLMGTL-----KDDRFTTLVTLQHEVDL 78

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
            A++P+ FD R++WPNC T+  V D G+C +   F AV A +DR C  S G ++   S E
Sbjct: 79  IASLPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 138

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SCC IC       C+ G     W +    G V+GG Y    GC+P  I PC HH   
Sbjct: 139 DLLSCCPICGL----GCNGGMPTLAWEYWKHFGLVSGGSYNSTQGCRPYEIPPCEHHVPG 194

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
             LP   + K PK  C  +C +  Y   + QDKH     Y V   ED IK E+  +GP  
Sbjct: 195 NRLPCSGDTKTPK--CIKKCED-NYNVAYKQDKHYGKHIYSVRGGEDHIKAELYKNGPVE 251

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
             F +Y D   YKSGVYKH +   L    H+ K++GWG ENG  YWL+ N+W   WGD G
Sbjct: 252 GAFTVYADLLSYKSGVYKHVAGDALGG--HAIKIMGWGVENGNKYWLIANSWNSDWGDNG 309

Query: 321 TVKILRGKYECAFEYLIAAGKP 342
             KILRG+  C  E  I AG+P
Sbjct: 310 FFKILRGEDHCGIESSIVAGEP 331


>gi|34979797|gb|AAQ83887.1| cathepsin B [Branchiostoma belcheri tsingtauense]
          Length = 332

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 131/325 (40%), Positives = 166/325 (51%), Gaps = 20/325 (6%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
            +   ID +N    TW AG NF    +  Y++          D ++  LP      + + 
Sbjct: 24  LTQEIIDYVNTIDTTWKAGWNF-QGATVSYVKGLC---GVIRDPNNHKLPLKLHELNAQ- 78

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
              +PD FD+R QW NC TI  V D G+C +    AAV A SDR C+ SKG     +S E
Sbjct: 79  --DIPDTFDSRTQWANCPTIKEVRDQGSCGSCWALAAVEAMSDRICVASKGSTMAHISAE 136

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH--G 198
            + SCCK C       C+ G     W +  + G VTGG YG   GCQP  I PC HH  G
Sbjct: 137 DLNSCCKSC----GNGCNGGFPEAAWEYWKRDGLVTGGPYGSHQGCQPYEIKPCEHHING 192

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           S P     E    P  +C   C +  Y   F +DKH     Y V      I+ EI+ +GP
Sbjct: 193 SRPACGKLE----PTPRCKKSCES-GYNVTFAKDKHYAKTAYSVSSKVQQIQMEIMTNGP 247

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
             A F +Y DF HYKSGVY+H S A+L    H+ K+IGWGTE  TPYWL+ N+W   WG+
Sbjct: 248 VEAAFTVYADFPHYKSGVYQHESGAELGG--HAVKMIGWGTEGSTPYWLIANSWNTDWGN 305

Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
            G  KILRG+ EC  E  I AG+PK
Sbjct: 306 MGFFKILRGQDECGIERDIVAGEPK 330


>gi|14141821|gb|AAK07477.2|AF329480_1 probable cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
 gi|289743431|gb|ADD20463.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
          Length = 340

 Score =  223 bits (567), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 121/328 (36%), Positives = 171/328 (52%), Gaps = 16/328 (4%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
            SD +++ + ++A TWT GRNF       + RQ +     + D  +  LP  R     E 
Sbjct: 23  LSDKFMEIVRQKAKTWTVGRNFHKLTPMSHYRQLM---GVHPDAHNYALPDKRMVLREEE 79

Query: 81  -----SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
                +  +P  FD+R+QWP+C TI  + D G+C +   F AV A SDR CI S G  N 
Sbjct: 80  LVGLGNNMIPKDFDSRKQWPHCPTIWEIRDQGSCGSCWAFGAVEAMSDRVCIHSNGTVNF 139

Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
             S + + SCC  C +     C+ G     W++  ++G V+GG YG   GC+P  I+PC 
Sbjct: 140 HFSADDLVSCCHTCGF----GCNGGFPGAAWSYWVRKGIVSGGPYGSSQGCRPYEIAPCE 195

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           HH +  T P CE +     +C  +C   +Y   +  DKH  +  Y +  N   I++EI+ 
Sbjct: 196 HHVNG-TRPPCEKEYGKTPRCQHKC-QASYKVDYKTDKHFGSRAYSISKNVHDIQEEIMT 253

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
           HGP    F +Y+D   YK GVY+H    +L    H+ ++IGWG E   PYWLV N+W   
Sbjct: 254 HGPVEGAFTVYEDLILYKDGVYEHVHGKELGG--HAIRIIGWGVEKDIPYWLVANSWNTD 311

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPK 343
           WG+ G  KILRGK  C  E  I+AG PK
Sbjct: 312 WGNNGFFKILRGKDHCGIESSISAGLPK 339


>gi|330434688|gb|AEC22812.1| cathepsin B [Macrobrachium nipponense]
          Length = 331

 Score =  223 bits (567), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 129/328 (39%), Positives = 170/328 (51%), Gaps = 18/328 (5%)

Query: 18  LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFL--IADAKYFDQSDRPLPGDRKT 75
           ++  SD +I  +  E  TW AGRNF  NL   YL+  +   AD+K+       +    K 
Sbjct: 18  IHPLSDKFIQLLQNEKTTWKAGRNFNKNLPMRYLKSLMGVHADSKFH------MSPVHKH 71

Query: 76  YDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
             PE    +P  FD+R  W  C TI  + D G+C +   F AV   +DR CI S G +N 
Sbjct: 72  KIPE-GFKIPKEFDSRTAWSMCPTISEIRDQGSCGSCWAFGAVEVMTDRDCIHSNGTKNF 130

Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
             S E + SCC +C +  N     G+ F+ W  +H  G V+GG +    GCQP  I+PC 
Sbjct: 131 HYSAENLVSCCHLCGFGCNGGFP-GAAFQYW--VHS-GIVSGGAFNSTQGCQPYEIAPCE 186

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           HH S P     E    PK  CH  C +  Y   +  D H  +  Y VD +E  IK +I+ 
Sbjct: 187 HHVSGPRPKCAEGGSTPK--CHKNCES-NYVVDYESDLHHGSKHYSVDKDETQIKYDIMT 243

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
           +GP    F +Y DF HYKSGVY+HT    L    H+ +++GWG E+GTPYWL  N+W   
Sbjct: 244 NGPVEGAFTVYVDFLHYKSGVYQHTHGLPLGG--HAIRVLGWGEEDGTPYWLCANSWNTD 301

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPK 343
           WGD G  KILRG   C  E  I+AG PK
Sbjct: 302 WGDNGYFKILRGSDHCGIESEISAGLPK 329


>gi|346470617|gb|AEO35153.1| hypothetical protein [Amblyomma maculatum]
          Length = 335

 Score =  223 bits (567), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 130/323 (40%), Positives = 166/323 (51%), Gaps = 15/323 (4%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
            SD  I+ IN+   TW AGRNF  N    YL+  +     + D  +  LP     Y  + 
Sbjct: 27  LSDEMINFINKLNTTWKAGRNFDKNTPVSYLKGLM---GVHPDSKNYRLP---LFYHEDI 80

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
              +P+ FDARE+W +C +I  + D   C +   F A  A SDR CI SKG+    +S E
Sbjct: 81  PKDLPESFDAREKWSHCNSIHVIRDQSTCGSCWAFGATEAMSDRVCIHSKGKVQVNISAE 140

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + +CC  C       C+ G     W F    G VTGG YG   GCQP    PC HH   
Sbjct: 141 DLLTCCDSC----GAGCNGGYPAAAWEFYKTDGIVTGGLYGTDDGCQPYYFPPCEHHTVG 196

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
           P LP+C   K P  +C   C    Y + + +DKH     Y +  +E  IK EI  +GP  
Sbjct: 197 P-LPNCTGIK-PTPQCVRDCRK-GYEKSYSEDKHYAKKVYTLSADETQIKTEIFKNGPVE 253

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
           A F +Y DF  YKSGVY+  S+  L    H+ +++GWGTENG PYWLV N+W   WGD+G
Sbjct: 254 ADFTVYADFVSYKSGVYQRHSDDALGG--HAIRILGWGTENGVPYWLVANSWNEDWGDKG 311

Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
             KILRG  EC  E  I AG PK
Sbjct: 312 YFKILRGNDECGIEDDINAGIPK 334


>gi|389608541|dbj|BAM17880.1| cathepsin B [Papilio xuthus]
          Length = 334

 Score =  222 bits (566), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 124/337 (36%), Positives = 172/337 (51%), Gaps = 14/337 (4%)

Query: 6   VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
           + L    +    L   SD +I+ IN + ++W AGRNFP++   +++++ +        + 
Sbjct: 9   LLLCAFAVTADTLDPLSDDFINLINSKQDSWKAGRNFPSDTPFKHIKKLMGTL-----RD 63

Query: 66  DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
           DR        ++ E  A++P+ FD R++WPNC T+  V D G+C +   F AV A +DR 
Sbjct: 64  DRFTTLVTMQHEVELIASLPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRI 123

Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG 185
           C  S G ++   S E + SCC IC       C+ G     W +    G V+GG Y    G
Sbjct: 124 CTYSNGTKHFHFSAEDLLSCCPIC----GLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQG 179

Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN 245
           C+P  I PC HH     LP   + K PK  C   C +  Y   + QDKH     Y V   
Sbjct: 180 CRPYEIPPCEHHVPGNRLPCSGDTKTPK--CVKECES-GYKVPYKQDKHYGKHVYSVRGG 236

Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPY 305
           ED IK E+  +GP    F +Y D   YKSGVYKH +   L    H+ K++GWG ENG  Y
Sbjct: 237 EDHIKAELYKNGPVEGAFTVYADLLSYKSGVYKHVTGDALGG--HAIKIMGWGVENGNKY 294

Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           WL+ N+W   WGD G  KILRG+  C  E  I AG+P
Sbjct: 295 WLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 331


>gi|289743429|gb|ADD20462.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
          Length = 340

 Score =  222 bits (566), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 119/325 (36%), Positives = 170/325 (52%), Gaps = 10/325 (3%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA--DAKYFDQSDRPLPGDRKTYDP 78
            SD +++ + ++A TWT GRNF       + RQ +    DA Y+   D+ +    +    
Sbjct: 23  LSDRFMEIVRQKAKTWTVGRNFHKLTPMSHYRQLMGVHPDAHYYALPDKRMVLREEELVG 82

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
             +  +P  FD+R QWP+C TI  + D G+C +   F AV A SDR CI S G  N   S
Sbjct: 83  LGNDMIPKEFDSRNQWPHCPTIWEIRDQGSCGSCWAFGAVEAMSDRVCIHSNGTVNFHFS 142

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
            + + SCC  C +     C+ G     W +  ++G V+GG YG   GC+P  I+PC HH 
Sbjct: 143 ADDLVSCCHTCGF----GCNGGFPGAAWGYWVRKGIVSGGPYGSSQGCRPYEIAPCEHHV 198

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           +  T P CE +     +C  +C   +Y   +  DKH  +  Y +  N   I+ EI+ +GP
Sbjct: 199 NG-TRPPCEKEYGKTPRCQHKC-QASYKVDYKTDKHFGSRAYSISKNVRDIQGEIMTNGP 256

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
               F +Y+D   YK GVY+H    +L    H+ ++IGWG E  TPYWL+ N+W   WG+
Sbjct: 257 VEGAFTVYEDLILYKDGVYEHVHGKELGG--HAIRIIGWGVEKDTPYWLIANSWNTDWGN 314

Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
            G  KILRGK  C  E  I+AG PK
Sbjct: 315 NGFFKILRGKDHCGIESSISAGLPK 339


>gi|427787723|gb|JAA59313.1| Putative cathepsin b-like cysteine protease form 2 [Rhipicephalus
           pulchellus]
          Length = 338

 Score =  222 bits (565), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 129/333 (38%), Positives = 174/333 (52%), Gaps = 17/333 (5%)

Query: 12  TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFL-IADAKYFDQSDRPLP 70
            +V   ++  SD  ID IN+   TW AGRNF  N+   Y++  + +A  K      R LP
Sbjct: 18  VMVPPSVHPLSDEMIDFINKLNTTWKAGRNFDKNVPFSYIKGLMGVARNK-----TRRLP 72

Query: 71  GDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSK 130
               +  P+    +P+ FDAR+ W  C +I  + D  +C A   F AV A SDR CI +K
Sbjct: 73  TLMHSSIPD---NLPESFDARQHWRKCNSIHVIRDQSSCGACWAFGAVEAISDRICIHTK 129

Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPST 190
           G     +S + + +CC  CR      C  G     W F  ++G VTGG YG   GCQP +
Sbjct: 130 GSVQVNISAQDLLTCCDYCR----TGCKGGVPSYAWMFYKEKGIVTGGLYGTEDGCQPYS 185

Query: 191 ISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIK 250
           I   + + +   LP   N   P   C   C   +YG+ + +DKH     Y +  +E  IK
Sbjct: 186 IH-TTRYTTTGLLPPPINDLSPMPPCKRECRK-SYGKKYSEDKHYGEKVYTLSGDEAQIK 243

Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVIN 310
            EI  +GP  A FA+Y DFY YKSGVY+  S  +  +  H+ +++GWGTENG PYWL  N
Sbjct: 244 TEIFKNGPVEADFAVYADFYSYKSGVYQAHSRVRCGS--HAIRILGWGTENGVPYWLAAN 301

Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           +W  HWGD+G  KI RG  EC  E  I AG PK
Sbjct: 302 SWTEHWGDKGYFKIRRGNNECGIEEDINAGIPK 334


>gi|333408990|gb|AEF32260.1| cathepsin B [Cristaria plicata]
          Length = 347

 Score =  222 bits (565), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 134/354 (37%), Positives = 179/354 (50%), Gaps = 26/354 (7%)

Query: 1   MIHILVFLLGCTLVRGELYKF-------SDAYIDQINREA--NTWTAGRNFP--ANLSEE 49
           ++ + V+ +     + EL+KF       S+  I+ +N      TW AG NFP   NL ++
Sbjct: 7   LLLLGVWTVSAIPPKDELFKFIRVFRPMSEEMINFLNMPGPGATWKAGNNFPFIRNLDDK 66

Query: 50  YLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGAC 109
            L    +   K     + P P   K  +P     +P  FDAR QWPNC T+  V D G C
Sbjct: 67  LLYAKRLCGTKL----NNPNPLPVKNIEPLRD--LPTNFDARTQWPNCPTVKEVRDQGDC 120

Query: 110 AAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFL 169
            +   F AV A SDR CI S G+ N  +S E + +CC  C     + C  G     W + 
Sbjct: 121 GSCWAFGAVEAMSDRICIASNGKVNAEISAEDLLACCSSC----GEGCQGGFPAEAWRYY 176

Query: 170 HKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF 229
            + G VTGG Y    GCQP  I  C HH      P C  ++    KC  +C    Y   +
Sbjct: 177 EREGLVTGGLYNSSQGCQPYMIPACDHHVVGHLQP-CPKEEAKTPKCSKKC-EANYNVTY 234

Query: 230 FQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL 289
             DKH    +Y VD  E  I  EI+ +GP  A F +Y+DF  YKSGVY+H +  +L    
Sbjct: 235 KDDKHYGKNSYSVDSVEK-IMTEIMTNGPVEAAFTVYEDFLSYKSGVYQHRTGQELGG-- 291

Query: 290 HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           H+ K++GWG +NGTPYW+V N+W P WG++G   ILRGK EC  E  I AG PK
Sbjct: 292 HAVKILGWGEDNGTPYWIVANSWNPDWGNQGFFNILRGKDECGIESQIVAGLPK 345


>gi|306992171|gb|ADN19566.1| cathepsin B-like proteinase [Spodoptera frugiperda]
          Length = 341

 Score =  222 bits (565), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 120/322 (37%), Positives = 166/322 (51%), Gaps = 14/322 (4%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
            +D +I+ IN + N+W AGRNFP N    ++++         D     LP  +  +D + 
Sbjct: 29  LTDEFINLINSKQNSWKAGRNFPVNTPLTHIKKLT---GVLVDTHLSKLP--KAEHDMDL 83

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
            A++P+ FD R++WPNC T+  V D G+C +   F AV A +DR C  S G ++   S E
Sbjct: 84  IASLPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSAE 143

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SCC +C       C+ G     W +    G V+GG Y    GC+P  I PC HH   
Sbjct: 144 DLLSCCPVCGL----GCNGGMPTLAWEYWKHFGLVSGGSYNSGQGCRPYEIPPCEHHVPG 199

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
             +P   + K P  KCH  C   +Y   + +DK      Y V   ED IK E+  +GP  
Sbjct: 200 NRVPCNGDSKTP--KCHKTC-EASYSVDYHKDKRYGKHVYSVSSKEDHIKAELFKNGPVE 256

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
             F +Y D  +YK+GVYKHT    L    H+ K++GWG ENG  Y L+ N+W   WGD G
Sbjct: 257 GAFTVYSDLLNYKNGVYKHTVGNALGG--HAIKILGWGVENGNKYRLIANSWNSDWGDNG 314

Query: 321 TVKILRGKYECAFEYLIAAGKP 342
             KILRG+  C  E  I AG+P
Sbjct: 315 FFKILRGEDHCGIESSIVAGEP 336


>gi|355697726|gb|EHH28274.1| Cathepsin B [Macaca mulatta]
          Length = 339

 Score =  222 bits (565), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 126/338 (37%), Positives = 174/338 (51%), Gaps = 17/338 (5%)

Query: 6   VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
           +  LG    R   +  SD  ++ +N++  TW AG NF  N+   YL++       +    
Sbjct: 11  LLALGDARSRPSFHPLSDELVNYVNKQNTTWQAGHNF-YNVDVSYLKRLC---GTFLGG- 65

Query: 66  DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
             P P  R  +  +    +P+ FDAREQWP C TI  + D G+C +   F AV A SDR 
Sbjct: 66  --PKPPQRVMFTEDLK--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121

Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG 185
           CI +    +  +S E + +CC I   D    C+ G     WNFL ++G V+GG Y    G
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGIMCGD---GCNGGYPAGAWNFLTRKGLVSGGLYDSHVG 178

Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN 245
           C+P +I PC HH +    P       PK  C   C  P Y   + QDKH    +Y V ++
Sbjct: 179 CRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNS 235

Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPY 305
           E  I  EI  +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTPY
Sbjct: 236 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPY 293

Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           WLV N+W   WGD G  KILRG+  C  E  + AG P+
Sbjct: 294 WLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331


>gi|239788404|dbj|BAH70886.1| ACYPI000014 [Acyrthosiphon pisum]
          Length = 335

 Score =  222 bits (565), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 127/348 (36%), Positives = 187/348 (53%), Gaps = 25/348 (7%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           +I I V LL       + +  S  YI++IN  A TW A +NFP N  +E + + L+   +
Sbjct: 5   VILISVILLSVYFTE-QAHFLSKDYINKINEVAKTWKAKQNFPENTPKEQIVR-LLGSKR 62

Query: 61  YFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
               S  P+  + + Y    ++ VP+ FD+R +W  C TIGHV + G C +       GA
Sbjct: 63  LLGVSKSPIKENDELYMD--NSEVPEFFDSRLEWDYCETIGHVRNQGNCGSCWAHGTTGA 120

Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
           F+DR C+ + G+ N  +S E +  CC  C +     C+ G   + W +  + G VTGGDY
Sbjct: 121 FADRLCVATNGEFNELISAEELTFCCHRCVF----GCNGGYPLKAWQYFKRHGVVTGGDY 176

Query: 181 GDRTGCQPSTISPCSH----HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
               GCQP  + PC      H S    P+  N K  K KC+   T       + ++ ++T
Sbjct: 177 DTTDGCQPYRVPPCVKDDEGHNSCSGQPTERNHKCSK-KCYGDDT-----IDYKKNHYKT 230

Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKL 294
              Y++ +    ++K+ + +GP  A+F +YDDF +Y+SGVY+ T NA   +YL  H+ K+
Sbjct: 231 KDAYYLKNT--TMQKDTMVYGPIEASFDVYDDFMNYESGVYQRTGNA---SYLGGHAVKM 285

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           IGWG E GTPYWL++N+WG  WGD+G  KILRG  EC  E    AG P
Sbjct: 286 IGWGVEEGTPYWLMVNSWGEQWGDKGMFKILRGTDECGIESSCTAGVP 333


>gi|204022081|dbj|BAG71138.1| cathepsin B-S1 [Tuberaphis takenouchii]
          Length = 332

 Score =  222 bits (565), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 126/344 (36%), Positives = 181/344 (52%), Gaps = 18/344 (5%)

Query: 1   MIHILVFLLGCT-LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
            I I+  +  C  L +  L   SD  I  IN  A TW A R FPAN S+EY+   L+   
Sbjct: 4   FITIVCAIFVCVYLAKPTLQFLSDERIKYINEVAKTWKAERFFPANTSKEYIMG-LLGSR 62

Query: 60  KYFDQSDRPLPGDRKTYDPEYSATVP-DRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
            Y + S      + KTYDP Y      ++FD+RE W +C  IG + D G C +   F   
Sbjct: 63  GYTNYSSEV---EIKTYDPLYEENASVEQFDSRENWKSCKQIGRIRDQGNCGSCWAFGTT 119

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
           GAF+DR C+ + G+ N  LS E VA CC+ C     K C  G   + W +   +G  TGG
Sbjct: 120 GAFADRLCVSTGGKFNELLSPEDVAFCCQNC----GKGCEGGYPIKAWQYFRTQGVPTGG 175

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
           DY  + GC P  I PC       T   C  + + +   + +C    YG    Q +++   
Sbjct: 176 DYDSKEGCAPYKIPPCFDQKGKNT---CAGKPLER---NHQCPKTCYGSTTVQKRYKVKN 229

Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
            Y V ++ + ++++++ +GP  A+F L+DD   YKSG+Y+ T  AK  +  HS K+IGWG
Sbjct: 230 EY-VLNSPNTMEQDLIKYGPIEASFNLFDDLSAYKSGIYQKTPKAKFLS-GHSIKIIGWG 287

Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            ENG PYWL +N+W   WG++GT +I++G+ EC  E    AG P
Sbjct: 288 KENGVPYWLAVNSWSKFWGEQGTFRIIKGRNECGIERSATAGIP 331


>gi|25988674|gb|AAN76202.1| lysosomal cysteine proteinase cathepsin B/green fluorescent protein
           EGFP fusion protein [synthetic construct]
          Length = 578

 Score =  221 bits (564), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 128/329 (38%), Positives = 173/329 (52%), Gaps = 23/329 (6%)

Query: 18  LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYFDQSDRPLPGDRK 74
            +  SD  I+ IN++  TW AGRNF  N+   YL++    ++   K        LP +R 
Sbjct: 23  FHPLSDDMINYINKQNTTWQAGRNF-YNVDISYLKKLCGTVLGGPK--------LP-ERV 72

Query: 75  TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
            +  + +  +P+ FDAREQW NC TI  + D G+C +   F AV A SDR CI + G+ N
Sbjct: 73  GFSEDIN--LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVN 130

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             +S E + +CC I   D    C+ G     WNF  ++G V+GG Y    GC P TI PC
Sbjct: 131 VEVSAEDLLTCCGIQCGD---GCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPC 187

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            HH +    P       PK  C+  C    Y   + +DKH    +Y V D+E  I  EI 
Sbjct: 188 EHHVNGSRPPCTGEGDTPK--CNKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIY 244

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            +GP    F ++ DF  YKSGVYKH +   +    H+ +++GWG ENG PYWLV N+W  
Sbjct: 245 KNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGG--HAIRILGWGIENGVPYWLVANSWNV 302

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            WGD G  KILRG+  C  E  I AG P+
Sbjct: 303 DWGDNGFFKILRGENHCGIESEIVAGIPR 331


>gi|52630945|gb|AAU84936.1| putative cathepsin B-S [Toxoptera citricida]
          Length = 335

 Score =  221 bits (564), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 132/350 (37%), Positives = 185/350 (52%), Gaps = 29/350 (8%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           +I I V LL   L   + +  S +Y+D+IN  A TW A +NFP  +++E + + L   +K
Sbjct: 5   IILISVVLLSVYLTE-QAHFLSKSYVDKINEVAKTWKAKQNFPEYMTKEQIVRLL--GSK 61

Query: 61  YFDQSDRPLPGDRKTYDPEY--SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
                 + L    K  D EY   + +P+ FDAR QW +C TIG V + G C +       
Sbjct: 62  NLTSVPKSLI---KENDSEYINDSEIPNFFDARIQWSHCKTIGEVRNQGNCGSCWAHGTT 118

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
           GAF+DR CI + G  N  +S E +  CC  C +     C+ G+  + W +  + G VTGG
Sbjct: 119 GAFADRLCIATNGDFNELISAEELTFCCHRCGF----GCNGGNPLKAWQYFKRHGVVTGG 174

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKV-PKLKCHTRCTNPT---YGRGFFQDKH 234
           +Y    GCQP  + PC          SC  Q   P  KC   C       Y +G ++ K+
Sbjct: 175 NYNTTDGCQPYKVPPCVKDEEGHN--SCSGQPTEPNHKCSRSCYGDKTCDYKKGHYKTKN 232

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSG 292
              L      N D ++K+ +A+GP  A+F +YDDF +Y+SGVY+ T +AK   YL  H+ 
Sbjct: 233 AYYL------NIDTMQKDTIAYGPIEASFDVYDDFVNYESGVYQKTEDAK---YLGGHAV 283

Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           K+IGWG E+GTPYWL++N+WG  WG  G  KILRG  EC  E    AG P
Sbjct: 284 KMIGWGEEDGTPYWLMVNSWGEQWGANGMFKILRGTNECGIEGSPTAGVP 333


>gi|187105116|ref|NP_001119618.1| cathepsin B-84 precursor [Acyrthosiphon pisum]
 gi|161343843|tpg|DAA06102.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 335

 Score =  221 bits (563), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 127/348 (36%), Positives = 187/348 (53%), Gaps = 25/348 (7%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           +I I V LL       + +  S  YI++IN  A TW A +NFP N  +E + + L+   +
Sbjct: 5   VILISVVLLSVYFTE-QAHFLSKDYINKINEVAKTWKAKQNFPENTPKEQIVR-LLGSKR 62

Query: 61  YFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
               S  P+  + + Y    ++ VP+ FD+R +W  C TIGHV + G C +       GA
Sbjct: 63  LLGVSKSPIKENDELYMD--NSEVPEFFDSRLEWDYCETIGHVRNQGNCGSCWAHGTTGA 120

Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
           F+DR C+ + G+ N  +S E +  CC  C +     C+ G   + W +  + G VTGGDY
Sbjct: 121 FADRLCVATNGEFNELISAEELTFCCHRCGF----GCNGGYPLKAWQYFKRHGVVTGGDY 176

Query: 181 GDRTGCQPSTISPCSH----HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
               GCQP  + PC      H S    P+  N K  K KC+   T       + ++ ++T
Sbjct: 177 DTTDGCQPYRVPPCVKDDEGHNSCSGQPTERNHKCSK-KCYGDDT-----IDYKKNHYKT 230

Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKL 294
              Y++ +    ++K+ + +GP  A+F +YDDF +Y+SGVY+ T NA   +YL  H+ K+
Sbjct: 231 KDAYYLKNT--TMQKDTMVYGPIEASFDVYDDFMNYESGVYQRTGNA---SYLGGHAVKM 285

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           IGWG E GTPYWL++N+WG  WGD+G  KILRG  EC  E    AG P
Sbjct: 286 IGWGVEEGTPYWLMVNSWGEQWGDKGMFKILRGTDECGIESSCTAGVP 333


>gi|1942645|pdb|1MIR|A Chain A, Rat Procathepsin B
 gi|1942646|pdb|1MIR|B Chain B, Rat Procathepsin B
          Length = 322

 Score =  221 bits (563), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 128/329 (38%), Positives = 173/329 (52%), Gaps = 23/329 (6%)

Query: 18  LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYFDQSDRPLPGDRK 74
            +  SD  I+ IN++  TW AGRNF  N+   YL++    ++   K        LP +R 
Sbjct: 6   FHPLSDDMINYINKQNTTWQAGRNF-YNVDISYLKKLCGTVLGGPK--------LP-ERV 55

Query: 75  TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
            +  + +  +P+ FDAREQW NC TI  + D G+C +   F AV A SDR CI + G+ N
Sbjct: 56  GFSEDIN--LPESFDAREQWSNCPTIAQIRDQGSCGSSWAFGAVEAMSDRICIHTNGRVN 113

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             +S E + +CC I   D    C+ G     WNF  ++G V+GG Y    GC P TI PC
Sbjct: 114 VEVSAEDLLTCCGIQCGD---GCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPC 170

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            HH +    P       PK  C+  C    Y   + +DKH    +Y V D+E  I  EI 
Sbjct: 171 EHHVNGARPPCTGEGDTPK--CNKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIY 227

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            +GP    F ++ DF  YKSGVYKH +   +    H+ +++GWG ENG PYWLV N+W  
Sbjct: 228 KNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGG--HAIRILGWGIENGVPYWLVANSWNA 285

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            WGD G  KILRG+  C  E  I AG P+
Sbjct: 286 DWGDNGFFKILRGENHCGIESEIVAGIPR 314


>gi|17559068|ref|NP_504682.1| Protein CPR-4 [Caenorhabditis elegans]
 gi|1169085|sp|P43508.1|CPR4_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 4; AltName:
           Full=Cysteine protease-related 4; Flags: Precursor
 gi|675500|gb|AAA98785.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|695293|gb|AAA98783.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351063163|emb|CCD71204.1| Protein CPR-4 [Caenorhabditis elegans]
          Length = 335

 Score =  221 bits (563), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 128/343 (37%), Positives = 180/343 (52%), Gaps = 16/343 (4%)

Query: 4   ILVFLLGCT--LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
           IL  L+  T  LV   + K  +A  + +N + + W A    P +++ E +++ L+     
Sbjct: 5   ILAALVAVTAGLVIPLVPKTQEAITEYVNSKQSLWKA--EIPKDITIEQVKKRLMRT--- 59

Query: 62  FDQSDRPLPGDRKTYDPEYSA-TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
             +   P   D +    + +  T+P  FDAR QWPNC +I ++ D   C +   FAA  A
Sbjct: 60  --EFVAPHTPDVEVVKHDINEDTIPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAEA 117

Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
            SDR CI S G  N  LS E V SCC  C Y     C  G     W +L K G  TGG Y
Sbjct: 118 ASDRFCIASNGAVNTLLSAEDVLSCCSNCGY----GCEGGYPINAWKYLVKSGFCTGGSY 173

Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
             + GC+P +++PC       T PSC +       C  +CTN  Y   +  DKH  +  Y
Sbjct: 174 EAQFGCKPYSLAPCGETVGNVTWPSCPDDGYDTPACVNKCTNKNYNVAYTADKHFGSTAY 233

Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE 300
            V      I+ EI+AHGP  A F +Y+DFY YK+GVY HT+  +L    H+ +++GWGT+
Sbjct: 234 AVGKKVSQIQAEIIAHGPVEAAFTVYEDFYQYKTGVYVHTTGQELGG--HAIRILGWGTD 291

Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           NGTPYWLV N+W  +WG+ G  +I+RG  EC  E+ +  G PK
Sbjct: 292 NGTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVPK 334


>gi|147906534|ref|NP_001090927.1| cathepsin B precursor [Sus scrofa]
 gi|187470655|sp|A1E295.1|CATB_PIG RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|118490058|gb|ABK96810.1| cathepsin B [Sus scrofa]
          Length = 335

 Score =  221 bits (562), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 130/344 (37%), Positives = 176/344 (51%), Gaps = 19/344 (5%)

Query: 1   MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
           ++  L  L+  T  R  L+    SD  ++ IN++  TWTAG NF  N+   Y+++     
Sbjct: 4   LLATLSCLVLLTSARESLHFQPLSDELVNFINKQNTTWTAGHNF-YNVDLSYVKKLC--- 59

Query: 59  AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
             +      P    R  +  +    +P  FDAREQWPNC TI  + D G+C +   F AV
Sbjct: 60  GTFLGGPKLP---QRAAFAAD--MILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAV 114

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
            A SDR CI+S G+ N  +S E + +CC     +    C+ G     WNF  K+G V+GG
Sbjct: 115 EAISDRICIRSNGRVNVEVSAEDMLTCCGD---ECGDGCNGGFPSGAWNFWTKKGLVSGG 171

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
            Y    GC+P +I PC HH +    P       PK  C   C  P Y   + +DKH    
Sbjct: 172 LYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYTPSYKEDKHFGCS 228

Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
           +Y +  NE  I  EI  +GP    F +Y DF  YKSGVY+H +   +    H+ +++GWG
Sbjct: 229 SYSISRNEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGG--HAIRILGWG 286

Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            ENGTPYWLV N+W   WGD G  KILRG+  C  E  I AG P
Sbjct: 287 VENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGIP 330


>gi|171948776|gb|ACB59245.1| cathepsin B [Sus scrofa]
          Length = 335

 Score =  220 bits (561), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 130/344 (37%), Positives = 176/344 (51%), Gaps = 19/344 (5%)

Query: 1   MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
           ++  L  L+  T  R  L+    SD  ++ IN++  TWTAG NF  N+   Y+++     
Sbjct: 4   LLATLSCLVLLTSARESLHFQPLSDELVNFINKQNTTWTAGHNF-YNVDLSYVKKLC--- 59

Query: 59  AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
             +      P    R  +  +    +P  FDAREQWPNC TI  + D G+C +   F AV
Sbjct: 60  GTFLGGPKLP---QRAAFAAD--MILPKGFDAREQWPNCPTIKEIRDQGSCGSCWAFGAV 114

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
            A SDR CI+S G+ N  +S E + +CC     +    C+ G     WNF  K+G V+GG
Sbjct: 115 EAISDRICIRSNGRVNVEVSAEDMLTCCGD---ECGDGCNGGFPSGAWNFWTKKGLVSGG 171

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
            Y    GC+P +I PC HH +    P       PK  C   C  P Y   + +DKH    
Sbjct: 172 LYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYTPSYKEDKHFGCS 228

Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
           +Y +  NE  I  EI  +GP    F +Y DF  YKSGVY+H +   +    H+ +++GWG
Sbjct: 229 SYSISRNEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGG--HAIRILGWG 286

Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            ENGTPYWLV N+W   WGD G  KILRG+  C  E  I AG P
Sbjct: 287 VENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGIP 330


>gi|351695295|gb|EHA98213.1| Cathepsin B [Heterocephalus glaber]
          Length = 340

 Score =  220 bits (561), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 129/343 (37%), Positives = 175/343 (51%), Gaps = 22/343 (6%)

Query: 5   LVFLLGCTLVRGE--LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYF 62
           L FLL  T        +  SD  ++ IN++  TW AG NF  N+   Y+++         
Sbjct: 8   LCFLLALTGAYNAPWFHPLSDELVNYINKQNTTWQAGHNF-HNVHLSYVKRLC------- 59

Query: 63  DQSDRPLPGDRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
                 L G R     +++  V  P+ FDAR+QWPNC TI  + D G+C +   F AVGA
Sbjct: 60  ---GTYLGGPRLPQRIKFAEIVDLPESFDARQQWPNCPTIKEIRDQGSCGSCWAFGAVGA 116

Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
            SDR CI + G  N  +S E + SCC +   +    C+ G     W +  K+G V+GG Y
Sbjct: 117 MSDRVCIHTNGHVNVEVSAEDLLSCCGL---ECGDGCNGGYPSAAWKYWTKKGLVSGGLY 173

Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
               GC+P +I PC HH +  T P C  +     KC   C  P Y   + +DKH    +Y
Sbjct: 174 DSHVGCRPYSIPPCEHHVNG-TRPQCTGEGGDTPKCSKTC-EPGYSPSYKEDKHFGYDSY 231

Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE 300
            V  NE  I  EI  +GP    F ++ DF  YK+GVYKH +   L    H+ +++GWG E
Sbjct: 232 SVSSNEKEIMAEIYKNGPVEGAFTVFSDFLMYKTGVYKHLAGEMLGG--HAIRILGWGKE 289

Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           NG PYWLV N+W   WGD G  KI+RG+  C  E  I AG P+
Sbjct: 290 NGVPYWLVGNSWNVDWGDSGFFKIVRGEDHCGIESEIVAGIPR 332


>gi|195058549|ref|XP_001995463.1| GH17748 [Drosophila grimshawi]
 gi|193896249|gb|EDV95115.1| GH17748 [Drosophila grimshawi]
          Length = 340

 Score =  220 bits (561), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 120/345 (34%), Positives = 176/345 (51%), Gaps = 16/345 (4%)

Query: 2   IHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
           + +L  +        E +  SD +I+ +  +A TWT GRNF A +SE ++R  +     +
Sbjct: 8   VSLLALVAMTKATESEPHMLSDEFIELVKSKATTWTPGRNFDAAVSEHHIRALM---GVH 64

Query: 62  FDQSDRPLPGDRKTYDPE-YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
            D     LP  R+    +     +P+ FD+ + WPNC TI  + D G+C +   F AV A
Sbjct: 65  PDSHKFTLPEKRELLGADGEDKDLPEEFDSSKNWPNCPTIREIRDQGSCGSCWAFGAVEA 124

Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
            SDR CI S    N   S + + +CC  C +     C+ G     W++   RG V+GG Y
Sbjct: 125 MSDRVCIHSNATVNFHFSADDLVTCCHTCGF----GCNGGFPGAAWSYWTTRGIVSGGSY 180

Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
               GC+P  + PC HH   P  P C +   P   C  +C  P Y   + +DKH    +Y
Sbjct: 181 NSTEGCRPYEVEPCEHHVDGPR-PPCHSGSTP--HCKHQC-QPNYSVDYEKDKHFGASSY 236

Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT- 299
            ++ N   I++EI+ +GP    F +Y+D   YK+GVY+H    +L    H+ ++IGWG  
Sbjct: 237 SINRNPRNIQREIMTNGPVEGAFTVYEDLILYKTGVYQHVHGKQLGG--HAIRIIGWGVW 294

Query: 300 -ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            E+  PYWL+ N+W   WGD G  +ILRGK  C  E  I+AG PK
Sbjct: 295 GESKVPYWLIANSWNTDWGDNGFFRILRGKDHCGIESQISAGLPK 339


>gi|1705630|sp|P00787.2|CATB_RAT RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; AltName:
           Full=RSG-2; Contains: RecName: Full=Cathepsin B light
           chain; Contains: RecName: Full=Cathepsin B heavy chain;
           Flags: Precursor
 gi|1524328|emb|CAA57792.1| cathepsin b [Rattus norvegicus]
          Length = 339

 Score =  220 bits (561), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 126/325 (38%), Positives = 170/325 (52%), Gaps = 17/325 (5%)

Query: 19  YKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP 78
           +  SD  I+ IN++  TW AGRNF  N+   YL++            + P   +R  +  
Sbjct: 24  HPLSDDMINYINKQNTTWQAGRNF-YNVDISYLKKLC---GTVLGGPNLP---ERVGFSE 76

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           + +  +P+ FDAREQW NC TI  + D G+C +   F AV A SDR CI + G+ N  +S
Sbjct: 77  DIN--LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVS 134

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
            E + +CC I   D    C+ G     WNF  ++G V+GG Y    GC P TI PC HH 
Sbjct: 135 AEDLLTCCGIQCGD---GCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHV 191

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           +    P       PK  C+  C    Y   + +DKH    +Y V D+E  I  EI  +GP
Sbjct: 192 NGSRPPCTGEGDTPK--CNKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGP 248

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
               F ++ DF  YKSGVYKH +   +    H+ +++GWG ENG PYWLV N+W   WGD
Sbjct: 249 VEGAFTVFSDFLTYKSGVYKHEAGDVMGG--HAIRILGWGIENGVPYWLVANSWNVDWGD 306

Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
            G  KILRG+  C  E  I AG P+
Sbjct: 307 NGFFKILRGENHCGIESEIVAGIPR 331


>gi|82830420|ref|NP_072119.2| cathepsin B preproprotein [Rattus norvegicus]
 gi|47939014|gb|AAH72490.1| Cathepsin B [Rattus norvegicus]
 gi|149030258|gb|EDL85314.1| rCG52258, isoform CRA_a [Rattus norvegicus]
          Length = 339

 Score =  220 bits (561), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 128/329 (38%), Positives = 173/329 (52%), Gaps = 23/329 (6%)

Query: 18  LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYFDQSDRPLPGDRK 74
            +  SD  I+ IN++  TW AGRNF  N+   YL++    ++   K        LP +R 
Sbjct: 23  FHPLSDDMINYINKQNTTWQAGRNF-YNVDISYLKKLCGTVLGGPK--------LP-ERV 72

Query: 75  TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
            +  + +  +P+ FDAREQW NC TI  + D G+C +   F AV A SDR CI + G+ N
Sbjct: 73  GFSEDIN--LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVN 130

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             +S E + +CC I   D    C+ G     WNF  ++G V+GG Y    GC P TI PC
Sbjct: 131 VEVSAEDLLTCCGIQCGD---GCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPC 187

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            HH +    P       PK  C+  C    Y   + +DKH    +Y V D+E  I  EI 
Sbjct: 188 EHHVNGSRPPCTGEGDTPK--CNKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIY 244

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            +GP    F ++ DF  YKSGVYKH +   +    H+ +++GWG ENG PYWLV N+W  
Sbjct: 245 KNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGG--HAIRILGWGIENGVPYWLVANSWNV 302

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            WGD G  KILRG+  C  E  I AG P+
Sbjct: 303 DWGDNGFFKILRGENHCGIESEIVAGIPR 331


>gi|221107055|ref|XP_002166984.1| PREDICTED: cathepsin B-like [Hydra magnipapillata]
          Length = 330

 Score =  219 bits (559), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 127/341 (37%), Positives = 174/341 (51%), Gaps = 25/341 (7%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKY 61
           L+FL        +     +  I+ +N    TWTAG NF   ++E Y++     L      
Sbjct: 6   LLFLFAGVGALPQHRGLFNEEINIVNSLKTTWTAGVNFGPEVTESYIKGLCGTLEEKENI 65

Query: 62  FDQSDRPLPGDRKTYDPEYSATVPDRFDAREQW-PNCGTIGHVPDTGACAAPHIFAAVGA 120
            +    P+            AT+PD +D RE+W   C +   + D G+C +   F AV A
Sbjct: 66  LEVKQIPV-----------IATLPDSYDTREKWGSTCPSTTEIRDQGSCGSCWAFGAVEA 114

Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKI-CRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           F+DR CI+S G +N  +S E + +CC   C +     C+ G +   WNF    G+VTGG 
Sbjct: 115 FTDRICIQSNGAKNPHISAEDLLTCCGFWCGF----GCNGGRLGPAWNFFKYAGAVTGGQ 170

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
           Y    GCQP  I  C HH S    P CE  + P  KC   C    Y   +  DKH+ +  
Sbjct: 171 YNSSEGCQPYEIPSCEHHTSGSKKP-CEGSE-PTPKCKRSCRE-GYNVSYSDDKHKVSSH 227

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y + ++E+ IK EI  +GP  A F +Y DF +YKSGVYK+T+   L    H+ K++GWG 
Sbjct: 228 YSIANDEEQIKNEIYLNGPVEAAFTVYSDFPNYKSGVYKYTTGNALGG--HAIKILGWGV 285

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           EN  PYWLV N+W P WGD+G  KILRG  EC  E  + AG
Sbjct: 286 ENNVPYWLVANSWNPDWGDKGFFKILRGSNECGIEASVVAG 326


>gi|161343865|tpg|DAA06113.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 335

 Score =  219 bits (559), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 126/348 (36%), Positives = 185/348 (53%), Gaps = 25/348 (7%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           +I I V LL   L   + +  S  Y+++IN  A TW A +NFP N   E + + L+   +
Sbjct: 5   VILISVVLLSVYLTE-QAHFLSKEYVNKINEVAKTWKAKQNFPENTPREDIVR-LLGSKR 62

Query: 61  YFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
               +  P+  +   Y    +  VP+ FD+R +W NC TIG V + G C +       GA
Sbjct: 63  LLGLNKSPIKENDILYVD--NGEVPEFFDSRLEWKNCKTIGEVRNQGNCGSCWAHGTTGA 120

Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
           F+DR CI + G+ N  +S E +  CC  C +     C+ G+  + W +  + G VTGG+Y
Sbjct: 121 FADRLCIATDGEFNELISAEELTFCCHTCGF----GCNGGNPLKAWKYFKRHGVVTGGNY 176

Query: 181 GDRTGCQPSTISPCSH----HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
               GCQP  + PC      H S    P+  N K  K KC+   T       + ++ ++T
Sbjct: 177 NTTDGCQPYRVPPCVRDDEGHNSCSGQPTERNHKCSK-KCYGDET-----INYKKNHYKT 230

Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKL 294
              Y++ +    ++K+ + +GP  A+F +YDDF  Y+SGVY+ T NA   +YL  H+ K+
Sbjct: 231 KDAYYLSNT--TMQKDTMVYGPIEASFDVYDDFTSYESGVYQKTENA---SYLGGHAVKM 285

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           IGWG E GTPYWL++N+WG  WGD+G  KILRG  EC  E    AG P
Sbjct: 286 IGWGVEEGTPYWLMVNSWGEQWGDKGMFKILRGTDECGVESSCTAGVP 333


>gi|204022100|dbj|BAG71147.1| cathepsin B-N1 [Tuberaphis takenouchii]
          Length = 334

 Score =  219 bits (559), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 134/336 (39%), Positives = 179/336 (53%), Gaps = 28/336 (8%)

Query: 17  ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
           + Y   + YI+QIN  A TW AG NF   LS +   + L   +K    + +  P   KT+
Sbjct: 17  QAYFLEEDYINQINTNAKTWKAGVNFDPKLSIDSFVKLL--GSKGVQAAKQTSPDMFKTH 74

Query: 77  DPEYSAT---VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
           D  Y++    +P  FDAR++W  C TIG V D G C +   F    AF+DR CI + G+ 
Sbjct: 75  DEAYNSLPNRIPSNFDARKKWRKCSTIGEVRDQGHCGSCWAFGTSSAFADRLCIATDGEF 134

Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
           N  LS E +A CC  C +     C  G   + W +  K G VTGGDY    GCQP  + P
Sbjct: 135 NELLSAEELAFCCHKCGF----GCHGGYPIKAWEWFKKHGLVTGGDYDSGEGCQPYRVPP 190

Query: 194 C--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDA 248
           C    +G+     +C  +  P  K H RCT   YG     F +D H T   Y++      
Sbjct: 191 CPLDEYGNN----TCRGK--PAEKNH-RCTRMCYGNQELDFKEDHHWTRDAYYL--TYTT 241

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYW 306
           I+K+++A+GP  A+F +YDDF +YKSGVY  T NA   +YL  H+ KLIGWG E G PYW
Sbjct: 242 IQKDVMAYGPIEASFDVYDDFPNYKSGVYMKTENA---SYLGGHAVKLIGWGEEYGVPYW 298

Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           L++N+W   WGD+G  KILRG  EC  +     G P
Sbjct: 299 LLVNSWNDQWGDQGLFKILRGTNECGIDNSTTGGVP 334


>gi|302564570|ref|NP_001181828.1| cathepsin B precursor [Macaca mulatta]
          Length = 339

 Score =  219 bits (559), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 125/338 (36%), Positives = 173/338 (51%), Gaps = 17/338 (5%)

Query: 6   VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
           +  LG    R   +  SD  ++ +N++  TW AG NF  N+   YL++       +    
Sbjct: 11  LLALGDARSRPSFHPLSDELVNYVNKQNTTWQAGHNF-YNVDVSYLKRLC---GTFLGG- 65

Query: 66  DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
             P P  R  +  +    +P+ FDAREQWP C TI  + D G+C +   F AV A SDR 
Sbjct: 66  --PKPPQRVMFTEDLK--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121

Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG 185
           CI +    +  +S E + +CC I   D    C+ G     WNF  ++G V+GG Y    G
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGIMCGD---GCNGGYPAGAWNFWTRKGLVSGGLYDSHVG 178

Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN 245
           C+P +I PC HH +    P       PK  C   C  P Y   + QDKH    +Y V ++
Sbjct: 179 CRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNS 235

Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPY 305
           E  I  EI  +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTPY
Sbjct: 236 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPY 293

Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           WLV N+W   WGD G  KILRG+  C  E  + AG P+
Sbjct: 294 WLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331


>gi|402877481|ref|XP_003902454.1| PREDICTED: cathepsin B [Papio anubis]
          Length = 339

 Score =  219 bits (559), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 125/338 (36%), Positives = 173/338 (51%), Gaps = 17/338 (5%)

Query: 6   VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
           +  LG    R   +  SD  ++ +N++  TW AG NF  N+   YL++       +    
Sbjct: 11  LLALGDARSRPSFHPLSDELVNYVNKQNTTWQAGHNF-YNVDVSYLKRLC---GTFLGG- 65

Query: 66  DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
             P P  R  +  +    +P+ FDAREQWP C TI  + D G+C +   F AV A SDR 
Sbjct: 66  --PKPPQRVMFTEDLK--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121

Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG 185
           CI +    +  +S E + +CC I   D    C+ G     WNF  ++G V+GG Y    G
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGIMCGD---GCNGGYPAGAWNFWTRKGLVSGGLYDSHVG 178

Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN 245
           C+P +I PC HH +    P       PK  C   C  P Y   + QDKH    +Y V ++
Sbjct: 179 CRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNS 235

Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPY 305
           E  I  EI  +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTPY
Sbjct: 236 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPY 293

Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           WLV N+W   WGD G  KILRG+  C  E  + AG P+
Sbjct: 294 WLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331


>gi|195437434|ref|XP_002066645.1| GK24603 [Drosophila willistoni]
 gi|194162730|gb|EDW77631.1| GK24603 [Drosophila willistoni]
          Length = 341

 Score =  219 bits (558), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 129/349 (36%), Positives = 184/349 (52%), Gaps = 20/349 (5%)

Query: 2   IHILVFLLGCTLVRGELYKF-SDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           I+ L  LL   L   +   F SDA+++++ R+A TW  GRNF  ++SE+YLR  +    +
Sbjct: 5   IYFLWLLLVTFLTINDAADFLSDAFMEKVRRKAKTWNLGRNFHESISEKYLRGLMGVHEE 64

Query: 61  YFDQSDRPLPGDRKTY---DPEYS-ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
            +     PLP  ++     D E S A +P  FDAR +W +C TI  + + G+C +    A
Sbjct: 65  SYKY---PLPDKQEVLGESDDEISLADLPVDFDARLRWTSCPTISEIREQGSCGSCWAIA 121

Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
                SDR CI S G  N  LS   + SCC IC +    +C  G     W +  ++G V+
Sbjct: 122 TTSVMSDRLCIGSNGVMNFRLSGLDMLSCCAICGF----ACQGGYPGAAWAYWARKGLVS 177

Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
           GGDYG + GCQP TI PC H G+  + P C       ++C   C  P+Y   F +DK+  
Sbjct: 178 GGDYGSQQGCQPYTIEPCDHSGNG-SRPVCTVGG--GVRCQHLC-EPSYKVDFQRDKNFA 233

Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
           +  Y + ++   I+KEI+ +GP  A   +Y+DF  YK+GVY H    K+    H+ +++G
Sbjct: 234 SKVYSISNDVLEIQKEIMTNGPVQAILTVYEDFLSYKTGVYYHLEGEKVGP--HAVRILG 291

Query: 297 WGT--ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           WG       PYWLV N+WG  WGD G   I RG+  C  E  I AG PK
Sbjct: 292 WGVWGTKKVPYWLVANSWGSDWGDNGFFHIFRGENHCDIEGYIMAGLPK 340


>gi|75076082|sp|Q4R5M2.1|CATB_MACFA RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|67970521|dbj|BAE01603.1| unnamed protein product [Macaca fascicularis]
 gi|355779504|gb|EHH63980.1| Cathepsin B [Macaca fascicularis]
 gi|383411999|gb|AFH29213.1| cathepsin B preproprotein [Macaca mulatta]
 gi|384942194|gb|AFI34702.1| cathepsin B preproprotein [Macaca mulatta]
          Length = 339

 Score =  219 bits (558), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 125/338 (36%), Positives = 173/338 (51%), Gaps = 17/338 (5%)

Query: 6   VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
           +  LG    R   +  SD  ++ +N++  TW AG NF  N+   YL++       +    
Sbjct: 11  LLALGDARSRPSFHPLSDELVNYVNKQNTTWQAGHNF-YNVDVSYLKRLC---GTFLGG- 65

Query: 66  DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
             P P  R  +  +    +P+ FDAREQWP C TI  + D G+C +   F AV A SDR 
Sbjct: 66  --PKPPQRVMFTEDLK--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121

Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG 185
           CI +    +  +S E + +CC I   D    C+ G     WNF  ++G V+GG Y    G
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGIMCGD---GCNGGYPAGAWNFWTRKGLVSGGLYDSHVG 178

Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN 245
           C+P +I PC HH +    P       PK  C   C  P Y   + QDKH    +Y V ++
Sbjct: 179 CRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNS 235

Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPY 305
           E  I  EI  +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTPY
Sbjct: 236 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPY 293

Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           WLV N+W   WGD G  KILRG+  C  E  + AG P+
Sbjct: 294 WLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331


>gi|56759588|gb|AAW28820.1| Parcxpwnx02 [Periplaneta americana]
          Length = 343

 Score =  219 bits (558), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 122/323 (37%), Positives = 170/323 (52%), Gaps = 16/323 (4%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
            SD +ID IN    TW A RNF  ++    +++ L+   +  +    P     K+ + + 
Sbjct: 36  LSDDFIDHINSLNTTWKAHRNFGNDIPLREIKK-LMGVRRSLENFRLP----EKSME-DI 89

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
              +P+ FD REQWP C T+  + D G+C +   F AV A SDR CI SKG+ +   S E
Sbjct: 90  DIEIPEEFDPREQWPECPTLKEIRDQGSCGSCWAFGAVEAMSDRVCIHSKGKTHFHFSAE 149

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + +CC  C +     C+ G     W++    G V+GG Y    GCQP  I PC HH + 
Sbjct: 150 DLLTCCSSCGF----GCNGGEPGAAWDYWVSTGIVSGGSYNSHQGCQPYAIEPCEHHVNG 205

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
              P C     P+  C  RC    Y   + +D+H     Y V  +  AI+KE+L +GP  
Sbjct: 206 TRKP-CGEGDTPR--CVKRCEE-GYDVPYGKDRHFGKSAYAVPGSVKAIQKELLLNGPAE 261

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
           A   +YDDF HY++GVY+H S   L    H+ +L+GWG E+GTPYWL+ N+W   WGD G
Sbjct: 262 AALTVYDDFLHYRTGVYQHVSGGALGG--HAVRLLGWGVEDGTPYWLLANSWNYDWGDNG 319

Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
             +ILRG+ EC  E  I  G PK
Sbjct: 320 YFRILRGQDECGIESDINGGLPK 342


>gi|496317|dbj|BAA04103.1| Sarcophaga pro-cathepsin B [Sarcophaga peregrina]
          Length = 344

 Score =  219 bits (558), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 118/325 (36%), Positives = 173/325 (53%), Gaps = 12/325 (3%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA--DAKYFDQSDRPLPGDRKTYDP 78
            SD +++ +  +A TWT GRN+  ++   + R+ +    DA  F   ++ L    +    
Sbjct: 29  LSDEFLEIVRSKAKTWTPGRNYDKSVPRSHFRRLMGVHPDAHKFTLHEKSLVLGEEVGLA 88

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           +  + VP+ FDAR+ WPNC TIG + D G+C +   F AV A SDR CI S    +   S
Sbjct: 89  D--SDVPEEFDARKAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRLCIHSNATIHFHFS 146

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
            + + SCC  C +     C+ G     W +  ++G V+GG YG   GC+P  I+PC HH 
Sbjct: 147 ADDLVSCCHTCGF----GCNGGFPGAAWAYWTRKGIVSGGPYGSSQGCRPYEIAPCEHHV 202

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           +  T P C+ +      C   C   +Y   +  DKH  + +Y V  N   I+KEI+ +GP
Sbjct: 203 NG-TRPPCDGEHGKTPSCRHECQK-SYDVDYKTDKHFGSKSYSVKRNVKDIQKEIMQNGP 260

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
               F +Y+D   YK GVY+H    +L    H+ +++GWG EN TPYWL+ N+W   WG+
Sbjct: 261 VEGAFTVYEDLILYKDGVYQHVHGRELGG--HAIRILGWGVENKTPYWLIANSWNTDWGN 318

Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
            G  K+LRG+  C  E  IAAG PK
Sbjct: 319 NGFFKMLRGEDHCGIESAIAAGLPK 343


>gi|51038793|gb|AAT94175.1| cathepsin B [Paralichthys olivaceus]
 gi|121053785|gb|ABM47001.1| cathepsin B [Paralichthys olivaceus]
          Length = 330

 Score =  219 bits (558), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 126/334 (37%), Positives = 172/334 (51%), Gaps = 21/334 (6%)

Query: 12  TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG 71
           +L R  L   S   ++ IN+   TW AG NF  N+   Y+R+               L G
Sbjct: 16  SLARPHLQPLSSEMVNYINKLNTTWKAGHNF-HNVDYSYVRRLC----------GTMLKG 64

Query: 72  DRKTYDPEYSA--TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKS 129
            +     +Y+    +P  FDAREQWP C T+  + D G+C +   F A  A SDR CI S
Sbjct: 65  PKLPIMVQYAGGLKLPAEFDAREQWPECPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHS 124

Query: 130 KGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPS 189
            G+ +  +S+E + +CC  C       C+ G     W+F  K G V+GG Y    GC+P 
Sbjct: 125 GGKISVEISSEDLLTCCDSC----GMGCNGGYPSSAWDFWTKEGLVSGGLYNSHIGCRPY 180

Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
           TISPC HH +  + P C  +     +C +RC    Y   + QDKH    +Y V+ + + I
Sbjct: 181 TISPCEHHVNG-SRPPCTGEGGDTPECISRC-EAGYSPSYKQDKHYGKSSYSVEGSVEQI 238

Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVI 309
           + EI  +GP    F +Y+DF  YKSGVY+H S + L    H+ K++GWG E+G PYWL  
Sbjct: 239 QAEISKNGPVEGAFTVYEDFVMYKSGVYQHVSGSVLGG--HAIKVLGWGEEDGIPYWLCA 296

Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           N+W   WGD G  KILRG   C  E  I AG PK
Sbjct: 297 NSWNTDWGDNGFFKILRGSNHCGIESEIVAGIPK 330


>gi|268558600|ref|XP_002637291.1| C. briggsae CBR-CPR-4 protein [Caenorhabditis briggsae]
          Length = 335

 Score =  219 bits (558), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 126/344 (36%), Positives = 177/344 (51%), Gaps = 14/344 (4%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           ++   +  L   LV   + K  +A  + +N + + W A    P +++ E +++ L+    
Sbjct: 4   VVFASLLALATGLVIPVVPKTPEAITEYVNSKQSLWKA--EIPKHITIEQVKKRLMRT-- 59

Query: 61  YFDQSDRPLPGDRKTYDPEYSA-TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
              +   P   D +    +    T+PD FDAR QWP+C +I ++ D   C +   FAA  
Sbjct: 60  ---EFVAPHTPDVEVIKHDIQEDTIPDTFDARTQWPSCVSINNIRDQSDCGSCWAFAAAE 116

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           A SDR CI S G  N  LS E V SCC  C Y     C  G     W +L K G  TGG 
Sbjct: 117 AASDRFCIASNGAVNTLLSAEDVLSCCSNCGY----GCEGGYPINAWKYLVKSGFCTGGS 172

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
           Y  + GC+P +++PC       T P C         C  +CTN  Y   +  DKH  +  
Sbjct: 173 YVSQFGCKPYSLAPCGETVGNTTWPDCPQDGYNTPSCVNKCTNNNYNIAYKDDKHFGSTA 232

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y V      I+ EILAHGP  A F +Y+DFY YKSGVY HT+  +L    H+ +++GWGT
Sbjct: 233 YAVGKKVAQIQAEILAHGPVEAAFTVYEDFYQYKSGVYVHTTGQELGG--HAIRILGWGT 290

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           +NGTPYWLV N+W  +WG+ G  +I+RG  EC  E+ +  G PK
Sbjct: 291 DNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVPK 334


>gi|254746338|emb|CAX16634.1| putative C1A cysteine protease precursor [Manduca sexta]
          Length = 337

 Score =  219 bits (558), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 121/324 (37%), Positives = 165/324 (50%), Gaps = 14/324 (4%)

Query: 19  YKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP 78
           +  SDA+I  IN + NTW AGRNFP      ++ + + A      Q D      +  +D 
Sbjct: 23  HPLSDAFIRLINSKQNTWRAGRNFPTTTPFAHINKLMGAL-----QDDNVAKMPKVEHDA 77

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           +  A++P+ FD R++WP+C T+  + D G+C +   F AV A +DR C  S G ++   S
Sbjct: 78  DLIASLPENFDPRDKWPDCPTLNEIRDQGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFS 137

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
           +E + SCC IC       C+ G     W +    G V+GG+Y    GC+P  I PC HH 
Sbjct: 138 SEDLLSCCPICGL----GCNGGIPSLAWEYWKHFGIVSGGNYNSTQGCRPYEIPPCEHHV 193

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
               +P   + K P  KC   C N  Y   + +DK      Y V   ED I+ E+  +GP
Sbjct: 194 PGNRMPCSGDTKTP--KCQKNCEN-GYNVMYKKDKRYGKHVYSVSAGEDHIRAELYKNGP 250

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
               F +Y D   YKSGVYKH     L    H+ K++GWG EN   YWLV N+W   WGD
Sbjct: 251 VEGAFTVYADLLAYKSGVYKHIQGDALGG--HAIKILGWGVENDNKYWLVANSWNTDWGD 308

Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
            G  KILRG+  C  E  I AG+P
Sbjct: 309 NGFFKILRGENHCGIEGSIIAGEP 332


>gi|308500570|ref|XP_003112470.1| CRE-CPR-4 protein [Caenorhabditis remanei]
 gi|308267038|gb|EFP10991.1| CRE-CPR-4 protein [Caenorhabditis remanei]
          Length = 335

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 127/344 (36%), Positives = 176/344 (51%), Gaps = 14/344 (4%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           +I   +  +   LV     K  +A  + +N + + W A    P  LS E +++ L+    
Sbjct: 4   VIFAALVAVATGLVIPVAPKTPEAITEYVNSKQSLWKA--EIPKGLSIEQVKKRLMRT-- 59

Query: 61  YFDQSDRPLPGDRKTYDPEYSA-TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
              +   P   D +  + +    T+P  FDAR QWPNC +I ++ D   C +   FAA  
Sbjct: 60  ---EFVAPHTPDVEVVEHDIQEDTIPATFDARTQWPNCVSINNIRDQSDCGSCWAFAAAE 116

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           A SDR CI S G  N  LS E V SCC  C Y     C  G     W +L K G  TGG 
Sbjct: 117 AASDRFCIASNGAVNTLLSAEDVLSCCSNCGY----GCDGGYPINAWKYLVKSGFCTGGS 172

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
           Y  + GC+P +++PC       T P C +       C  +CTN  Y   +  DKH  +  
Sbjct: 173 YEAQFGCKPYSLAPCGETVGNVTWPDCPDDGYNTPACVNKCTNTKYNTAYKDDKHFGSTA 232

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y V      I+ EI+AHGP  A F +Y+DFY YKSGVY HT+  +L    H+ +++GWGT
Sbjct: 233 YAVGKKVAQIQAEIIAHGPVEAAFTVYEDFYQYKSGVYVHTTGQELGG--HAIRILGWGT 290

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           +NGTPYWLV N+W  +WG+ G  +I+RG  EC  E+ +  G PK
Sbjct: 291 DNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVPK 334


>gi|126303983|ref|XP_001381634.1| PREDICTED: cathepsin B-like [Monodelphis domestica]
          Length = 337

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 125/341 (36%), Positives = 174/341 (51%), Gaps = 21/341 (6%)

Query: 6   VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYF 62
           + +L     R  +   SD  ++ IN+   TW AG NF  N    Y+++     +  AK  
Sbjct: 11  LVVLTSAKSRLSIPPLSDEMVNHINKLNTTWQAGHNF-LNADMSYVKKLCGTFMGGAKLL 69

Query: 63  DQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
            Q  R +  D        +  +P+ FDAREQWPNC TI  + D G+C +   F AV A S
Sbjct: 70  PQ--RMILAD--------NMKLPENFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAIS 119

Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
           DR C+ S G  N  +S E + SCC     +    C+ G     WNF  K+G V+GG Y  
Sbjct: 120 DRICVHSNGNANVEVSAEDLLSCCG---SECGDGCNGGFPAGAWNFWTKKGLVSGGLYDS 176

Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
             GC+P +I PC HH +  + P+C  ++     C  +C    Y   +  DK+  + +Y V
Sbjct: 177 HVGCRPYSIPPCEHHVNG-SRPACTGEEGDTPTCRKKCEE-GYSTQYKDDKNYGSTSYSV 234

Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
             +E  I  EI  +GP    F++Y+DF HYKSGVY+H +   L    H+ +++GWG ENG
Sbjct: 235 PSSEQEIMAEIYKNGPVEGAFSVYEDFLHYKSGVYQHVAGEMLGG--HAIRILGWGVENG 292

Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
             YWL  N+W   WGD G  K LRGK  C  E  I AG P+
Sbjct: 293 IRYWLAANSWNIDWGDNGFFKFLRGKNHCGIESEIIAGIPR 333


>gi|225711544|gb|ACO11618.1| Cathepsin B precursor [Caligus rogercresseyi]
          Length = 332

 Score =  218 bits (556), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 123/323 (38%), Positives = 164/323 (50%), Gaps = 16/323 (4%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
            S  YI  IN  +  W AGRNF    S  YLR  +     + D    PLP    T     
Sbjct: 26  LSSEYIHSINEASEIWKAGRNFHPETSSNYLRSLMGVLPNHKDHLPPPLPSLLGT----- 80

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
              +P  FDARE WPNC +I  + D G+C +   F A  A SDR CI +   +N  +S E
Sbjct: 81  -EALPSDFDAREHWPNCPSIRLIRDQGSCGSCWAFGAAEAMSDRICIHT--NKNVNISAE 137

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SCC  C +     C+ G     W +   +G V+GG YG  +GCQP  I PC HH + 
Sbjct: 138 NLLSCCYSCGF----GCNGGFPGAAWKYWTSKGLVSGGLYGSHSGCQPYDIEPCEHHVNG 193

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
              P  E  + PK  CH  C N  Y   + +D      +Y +  +   I+ EI+ +GP  
Sbjct: 194 TRQPCAEGGRTPK--CHRTCENENYSVPYDKDLSFGRSSYSIRSDPKQIQLEIMDNGPVE 251

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
           A F++Y DF + KSGVY+H   + L    H+ +++GWG E GTPYWLV N+W   WGD+G
Sbjct: 252 AAFSVYSDFMNDKSGVYRHVKGSLLGG--HAIRILGWGVEKGTPYWLVANSWNTDWGDKG 309

Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
           T KILRG   C  E  +  G P+
Sbjct: 310 TFKILRGSDHCGIEGSVVTGLPR 332


>gi|204022106|dbj|BAG71150.1| cathepsin B-N [Astegopteryx spinocephala]
          Length = 332

 Score =  218 bits (556), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 139/350 (39%), Positives = 176/350 (50%), Gaps = 30/350 (8%)

Query: 5   LVFLLGCTLVR----GELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           L FLL   L       E Y   + YI+QIN  A TW AG NF   LS E   + L   +K
Sbjct: 1   LAFLLSVVLFSVYQTEEAYFLEEDYINQINENAKTWKAGINFDPKLSVENFVKLL--GSK 58

Query: 61  YFDQSDRPLPGDRKTYDPEY-SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
               + +  P   KT D  Y +  +P  FDAR++W  C TIG V D G C +   F    
Sbjct: 59  GVQAAKKASPDMFKTDDKTYENQRIPKFFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSS 118

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           AF+DR CI + G  N  LS E +  CC  C Y     C  G   + W    K G VTGG+
Sbjct: 119 AFADRLCIATDGDFNELLSAEELTFCCHTCGY----GCHGGYPIKAWERFKKHGLVTGGN 174

Query: 180 YGDRTGCQPSTISPC--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYG---RGFFQDKH 234
           Y    GCQP  +SPC    +G+     +C  +  P  K H RCT   YG   R F +D  
Sbjct: 175 YDSSEGCQPYRVSPCPLDEYGNN----TCRGK--PAEKNH-RCTRMCYGDQDRDFKEDHR 227

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSG 292
            T   Y++      I+K+++ +GP  A++ +YDDF  YKSGVY  T NA    YL  H+ 
Sbjct: 228 FTRDAYYL--TYGTIQKDVMTYGPIEASYEVYDDFPSYKSGVYVRTENA---TYLGGHAV 282

Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           KLIGWG E G PYWL++N+W   WGDRG  KI RG  EC  +     G P
Sbjct: 283 KLIGWGEEYGVPYWLMVNSWNDQWGDRGLFKIRRGTNECGIDNSTTGGVP 332


>gi|195393194|ref|XP_002055239.1| GJ19262 [Drosophila virilis]
 gi|194149749|gb|EDW65440.1| GJ19262 [Drosophila virilis]
          Length = 338

 Score =  218 bits (555), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 120/348 (34%), Positives = 184/348 (52%), Gaps = 18/348 (5%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA--D 58
           +  +L F+        + +  S+ +++ +  +A TWT GRNF A++SE ++R  +    D
Sbjct: 3   LFLLLAFVAIAAATEDDPHMLSEEFMELVRGKAKTWTVGRNFDASVSEHHIRGLMGVHPD 62

Query: 59  AKYFDQSDRP-LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
           A  F   ++  + G+    D      +P+ FDAR  WP+C TIG + D G+C +   F A
Sbjct: 63  AHKFTLPEKSQVLGNLMEAD---GGDLPEEFDARTAWPDCPTIGEIRDQGSCGSCWAFGA 119

Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
           V A SDR CI S    N   S + + SCC  C +     C+ G     W++   +G V+G
Sbjct: 120 VEAMSDRVCIHSNATVNFHFSADDLVSCCHTCGF----GCNGGFPGAAWSYWTHKGIVSG 175

Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
           G YG + GC+P  + PC HH +  T P C +   P  +C  +C +  Y   + +DKH   
Sbjct: 176 GSYGSKEGCRPYEVEPCEHHVNG-TRPPCHSGSTP--RCMHKCES-GYSVDYAKDKHFGA 231

Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
             Y V+ N   I++EI+ +GP    F +Y+D   YK+GVY+H    +L    H+ +++GW
Sbjct: 232 KAYSVNRNPLDIQREIMTNGPVEGAFTVYEDLILYKTGVYQHVHGRQLGG--HAIRILGW 289

Query: 298 GT--ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           G   +N  PYWL+ N+W   WGD G  +ILRG+  C  E  I+AG PK
Sbjct: 290 GVWGDNKVPYWLIGNSWNTDWGDNGFFRILRGEDHCGIESAISAGLPK 337


>gi|204022094|dbj|BAG71144.1| cathepsin B-N1 [Tuberaphis taiwana]
          Length = 334

 Score =  218 bits (555), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 137/352 (38%), Positives = 179/352 (50%), Gaps = 28/352 (7%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           ++ +L  +L       + Y   + YI+QIN  A TW AG NF   LS +   + L   +K
Sbjct: 1   LVILLSVVLFSVYRTEQAYFLEEDYINQINANAKTWKAGVNFDPKLSIDSFVKLL--GSK 58

Query: 61  YFDQSDRPLPGDRKTYDPEY---SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
               + +  P   KT+D  Y   S  +P  FDAR++W  C TIG V D G C +   F  
Sbjct: 59  GVQAAKQASPDMFKTHDEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGT 118

Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
             AF+DR CI + G+ N  LS E +A CC  C +     CS G   R W    K G VTG
Sbjct: 119 SSAFADRLCIATDGEFNELLSAEELAFCCHKCGF----GCSGGYPIRAWERFKKHGLVTG 174

Query: 178 GDYGDRTGCQPSTISPC--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQD 232
           G+Y    GCQP  + PC    +G+     +C  +  P  K H RCT   YG     F +D
Sbjct: 175 GNYDSGEGCQPYRVPPCPLDEYGNN----TCRGK--PAEKNH-RCTRMCYGNQDLDFKED 227

Query: 233 KHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--H 290
            H T   Y++      I+ +ILA+GP  A+F +YDDF  YKSGVY    NA    YL  H
Sbjct: 228 HHYTRDAYYL--TYGTIQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENA---TYLGGH 282

Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           + KLIGWG E G PYWL++N+W   WGD+G  KI RG  EC  +     G P
Sbjct: 283 AVKLIGWGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 334


>gi|195130519|ref|XP_002009699.1| GI15503 [Drosophila mojavensis]
 gi|193908149|gb|EDW07016.1| GI15503 [Drosophila mojavensis]
          Length = 342

 Score =  218 bits (555), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 122/329 (37%), Positives = 180/329 (54%), Gaps = 18/329 (5%)

Query: 19  YKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA--DAKYFDQSDRP-LPGDRKT 75
           +  SD +I+ +  +A TWT GRNF A++SE ++R  +    DA  F   ++  + G+   
Sbjct: 25  HMLSDEFIELVRSKAKTWTPGRNFDASVSEGHIRGLMGVHPDAHKFTLPEKSQVLGNLVG 84

Query: 76  YDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
            D +    +P+ FDAR  WPNC TIG + D G+C +   F AV A SDR CI S G  N 
Sbjct: 85  DDGD---DLPESFDARTAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSNGTVNF 141

Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
             S E + SCC  C +     C+ G     W++   +G V+GG Y    GC+P  I PC 
Sbjct: 142 HFSAEDLVSCCHTCGF----GCNGGFPGAAWSYWTHKGIVSGGSYNSNEGCRPYEIEPCE 197

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           HH +  T P C+N + P   C  +C + +Y   + +DKH  + +Y +  N   I++EI+ 
Sbjct: 198 HHVNG-TRPPCKNGRTP--SCKHQCES-SYSVDYAKDKHFGSKSYSIRRNPREIQREIMT 253

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT--ENGTPYWLVINTWG 313
           +GP    F +Y+D   YKSGVYKH    +L    H+ +++GWG   ++  PYWL+ N+W 
Sbjct: 254 NGPVEGAFTVYEDLILYKSGVYKHVHGKELGG--HAIRILGWGVWGDSKVPYWLIGNSWN 311

Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKP 342
             WGD G  +I+RG+  C  E  I+AG P
Sbjct: 312 TDWGDNGFFRIVRGEDHCGIESAISAGLP 340


>gi|357613937|gb|EHJ68797.1| cathepsin B-like cysteine proteinase [Danaus plexippus]
          Length = 334

 Score =  218 bits (555), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 122/343 (35%), Positives = 168/343 (48%), Gaps = 15/343 (4%)

Query: 1   MIHILVFLLGCTLVRGEL-YKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
           +I  +  +  C +   E+ +  SD +ID IN + NTW AGRNF    + + +++ + A  
Sbjct: 3   LIRAICLVFLCGIAVSEIPHPLSDKFIDLINSKQNTWIAGRNFDIGRTLKSIKKLMGALE 62

Query: 60  KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
             +      +  D  T +      +P+ FD R++WPNC T+  + D G+C +   F AV 
Sbjct: 63  DKYLHKLYTVEHDDDTIN-----NLPENFDPRDKWPNCPTLNEIRDQGSCGSCWAFGAVE 117

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           A +DR C  S G ++   S E + SCC +C       C+ G     W +    G V+GG+
Sbjct: 118 AMTDRYCTYSNGTKHFHFSAEDLLSCCPVCGL----GCNGGIPSFAWEYWKHFGIVSGGN 173

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
           Y    GC P  I PC HH     +P       PK  CH  C    Y   +  DK      
Sbjct: 174 YNSSQGCLPYEIPPCEHHVPGNRIPCNGETSTPK--CHRSCRK-EYTNSYKSDKKYGKHV 230

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y V   E+ IK EI  +GP    F +Y D   YKSGVYKHT    L    H+ K++GWG 
Sbjct: 231 YSVGGGEEHIKAEIFKNGPVEGAFTVYADLLTYKSGVYKHTEGEALGG--HAIKIMGWGV 288

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           ENG  YWL+ N+W   WGD G  KILRG+  C  E  I AG+P
Sbjct: 289 ENGNKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 331


>gi|119887749|gb|ABM05925.1| cathepsin B-like cysteine proteinase [Helicoverpa assulta]
          Length = 338

 Score =  218 bits (555), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 119/324 (36%), Positives = 167/324 (51%), Gaps = 18/324 (5%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQF--LIADAKYFDQSDRPLPGDRKTYDP 78
            SD +I+ IN + N+W AGRNFP +    ++++   ++ D      S       +  ++ 
Sbjct: 26  LSDDFINLINTKQNSWKAGRNFPEHTPFAHIKKLAGVLPDYHLSKLS-------KVEHED 78

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           E  A++P+ FD R++WPNC T+  V D G+C +   F AV A +DR C  S G Q+   S
Sbjct: 79  ELIASLPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTQHFHFS 138

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
            E + SCC IC       C+ G     W +    G V+GG Y    GC+P  I PC HH 
Sbjct: 139 AEDLLSCCPICGL----GCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHV 194

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
               +P   + K P  KC   C +  Y   + +DK      + V   ED I+ E+  +GP
Sbjct: 195 PGNRMPCNGDSKTP--KCEKTCES-NYNVDYRKDKRYGKHVFSVSSKEDHIRAELFKNGP 251

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
               F +Y D  +YK+GVYKHT    L    H+ K++GWG ENG  YWL+ N+W   WGD
Sbjct: 252 VEGAFTVYSDLLNYKTGVYKHTIGDALGG--HAVKILGWGVENGNKYWLIANSWNSDWGD 309

Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
            G  KILRG+  C  E  I AG+P
Sbjct: 310 NGFFKILRGEDHCGIESSIVAGEP 333


>gi|344281458|ref|XP_003412496.1| PREDICTED: cathepsin B-like [Loxodonta africana]
          Length = 340

 Score =  218 bits (555), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 124/345 (35%), Positives = 179/345 (51%), Gaps = 18/345 (5%)

Query: 1   MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
           ++  L  L+  T  +  LY    SD  ++ +N+   TW AG NF  ++   Y+++     
Sbjct: 4   LLATLCCLVVLTSAQSRLYFKPLSDELVNHVNKLNTTWQAGHNF-YDVDMSYVKRLC--- 59

Query: 59  AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
               +    P    ++ +  E    +P+ FDARE WPNC TI  + D G+C +   F AV
Sbjct: 60  GTLLNGPKLP----QRVHLAE-EMDLPENFDARENWPNCPTIKEIRDQGSCGSCWAFGAV 114

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
            A SDR CI + G  N  +S E + +CC +   +    C+ G     WNF  K+G V+GG
Sbjct: 115 EAISDRVCIHTNGNVNVEVSAEDLLTCCHM---ECGDGCNGGFPAGAWNFWTKKGLVSGG 171

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
            Y    GC+P +I PC HH +  + P C+ +     KC   C  P Y   + +DKH    
Sbjct: 172 LYDSHVGCRPYSIPPCEHHVNG-SRPPCKGEGGETPKCSKTC-EPGYSPSYKEDKHYGYS 229

Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
           +Y V  +E  I  EI  +GP    F++Y DF  YKSGVY+H +  ++    H+ +++GWG
Sbjct: 230 SYGVPSSEQEIMAEIYKNGPVEGAFSVYTDFLVYKSGVYQHVTGEEVGG--HAIRILGWG 287

Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            ENGTPYWL  N+W   WGD G  KILRG+  C  E  I AG P+
Sbjct: 288 VENGTPYWLAANSWNTDWGDNGFFKILRGQDHCGIESEIVAGIPR 332


>gi|332862712|ref|XP_003317964.1| PREDICTED: cathepsin B isoform 1 [Pan troglodytes]
 gi|332862714|ref|XP_003317965.1| PREDICTED: cathepsin B isoform 2 [Pan troglodytes]
 gi|332862716|ref|XP_003317966.1| PREDICTED: cathepsin B isoform 3 [Pan troglodytes]
 gi|332862718|ref|XP_519607.3| PREDICTED: cathepsin B isoform 5 [Pan troglodytes]
 gi|410057614|ref|XP_003954244.1| PREDICTED: cathepsin B [Pan troglodytes]
 gi|410262606|gb|JAA19269.1| cathepsin B [Pan troglodytes]
 gi|410262608|gb|JAA19270.1| cathepsin B [Pan troglodytes]
 gi|410359820|gb|JAA44654.1| cathepsin B [Pan troglodytes]
 gi|410359822|gb|JAA44655.1| cathepsin B [Pan troglodytes]
 gi|410359824|gb|JAA44656.1| cathepsin B [Pan troglodytes]
 gi|410359826|gb|JAA44657.1| cathepsin B [Pan troglodytes]
 gi|410359828|gb|JAA44658.1| cathepsin B [Pan troglodytes]
          Length = 339

 Score =  218 bits (554), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 124/339 (36%), Positives = 172/339 (50%), Gaps = 19/339 (5%)

Query: 6   VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
           + +L     R   +  SD  ++ +N+   TW AG NF  N+   YL++   A        
Sbjct: 11  LLVLANARSRPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLCGAFL------ 63

Query: 66  DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
             P P  R  +  +    +P+ FDAREQWP C TI  + D G+C +   F AV A SDR 
Sbjct: 64  GGPKPPQRVMFTEDLK--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121

Query: 126 CIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
           CI +    +  +S E + +CC  +C       C+ G     WNF  ++G V+GG Y    
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 177

Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
           GC+P +I PC HH +    P       PK  C   C  P Y   + QDKH    +Y V +
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSN 234

Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTP 304
           +E  I  EI  +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTP
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTP 292

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           YWLV N+W   WGD G  KILRG+  C  E  + AG P+
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331


>gi|260786791|ref|XP_002588440.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
 gi|229273602|gb|EEN44451.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
          Length = 332

 Score =  218 bits (554), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 127/325 (39%), Positives = 165/325 (50%), Gaps = 20/325 (6%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
            +   ID +N    TW AG NF    +  Y++          D ++  LP      + + 
Sbjct: 24  LTQEIIDYVNSIDTTWKAGWNF-QGATVSYVKGLC---GVIRDPNNHKLPLKLHELNAQ- 78

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
              +PD FD+R QW NC TI  V D G+C +    AA  A SDR C+ S G+    LS+E
Sbjct: 79  --DIPDTFDSRTQWANCPTIKEVRDQGSCGSCWAEAAAEAMSDRTCVASNGKVQVHLSSE 136

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH--G 198
            + +CC+ C       C  G     W +  + G VTGG YG   GCQP  I+PC HH  G
Sbjct: 137 NLMACCETC----GMGCHGGFPEAAWEYWKQDGLVTGGPYGSMQGCQPYEIAPCEHHING 192

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           S P     E    P  +C   C +  Y   F +DKH     Y V      I+ EI+ +GP
Sbjct: 193 SRPACGKIE----PTPRCKKTCES-GYNVTFNKDKHYAKSAYSVSSKVQQIQMEIMTNGP 247

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
             A F +Y DF HYKSGVY+H S A+L    H+ K+IGWG E  TPYWL+ N+W   WGD
Sbjct: 248 VEAAFTVYADFPHYKSGVYQHESGAELGG--HAVKMIGWGMEGSTPYWLIANSWNSDWGD 305

Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
            G  KILRG+ EC  E  I AG+P+
Sbjct: 306 MGFFKILRGQDECGIERDIVAGEPR 330


>gi|204022104|dbj|BAG71149.1| cathepsin B-N [Astegopteryx styracophila]
          Length = 332

 Score =  218 bits (554), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 137/350 (39%), Positives = 179/350 (51%), Gaps = 30/350 (8%)

Query: 5   LVFLLGCTLVR----GELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           L FLL   L+      + Y   + YI+QIN  A TW AG NF   LS E   + L   +K
Sbjct: 1   LAFLLSVVLLSVYQTEQAYFLEEDYINQINENAKTWKAGINFDPKLSIENFVKLL--GSK 58

Query: 61  YFDQSDRPLPGDRKTYDPEY-SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
               + +  P   KT D  Y +  +P  FDAR++W  C TIG V D G C +   F    
Sbjct: 59  GVQAAKKASPDMFKTIDKAYENQKIPKFFDARKKWRKCFTIGEVRDQGKCGSCWAFGTSS 118

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           AF+DR CI + G+ N  LS E +  CC  C +     C  G   + W    K G VTGGD
Sbjct: 119 AFADRLCIATNGEFNELLSAEELTFCCHKCGF----GCHGGYPIKAWERFQKHGLVTGGD 174

Query: 180 YGDRTGCQPSTISPC--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKH 234
           Y    GCQP  +SPC    +G+     +C  +  P  K H RCT   YG     F +D H
Sbjct: 175 YDSGEGCQPYRVSPCPLDEYGNN----TCRGK--PAEKNH-RCTRMCYGNQDLDFKKDHH 227

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSG 292
            T   Y++      I+++++A+GP  A++ +YDDF  YKSGVY  T NA    YL  H+ 
Sbjct: 228 FTRDAYYL--TFGIIQRDVMAYGPIEASYDVYDDFPSYKSGVYVRTENA---TYLGGHAV 282

Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           KLIGWG E G PYWL++N+W   WGD+G  KI RG  EC  +     G P
Sbjct: 283 KLIGWGEEYGVPYWLMVNSWNDQWGDKGLFKIRRGTNECGIDNSTTGGVP 332


>gi|348587350|ref|XP_003479431.1| PREDICTED: cathepsin B-like [Cavia porcellus]
          Length = 340

 Score =  218 bits (554), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 124/327 (37%), Positives = 171/327 (52%), Gaps = 18/327 (5%)

Query: 18  LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
            +  SD  ++ +N+   TW AGRNF  N+   Y+++       Y      P    R  + 
Sbjct: 23  FHPLSDELVNYVNKLNTTWQAGRNF-HNVDISYVKRLC---GTYLGGPRLP---QRVQFA 75

Query: 78  PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
            +    +P+ FDAREQWPNC TI  + D G+C +   F AV A SDR CI + G  N  +
Sbjct: 76  EDLD--LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAMSDRLCIHTNGHVNVEV 133

Query: 138 STEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
           S E + SCC  +C     + C+ G     W +  ++G V+GG YG   GC+P +I PC H
Sbjct: 134 SAEDLLSCCGPLC----GEGCNGGYPTEAWKYWTRKGLVSGGLYGSHVGCRPYSIPPCEH 189

Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
           H +  T P C  +     KC   C  P Y   + +DK+    +Y V   E  I  EI  +
Sbjct: 190 HVNG-TRPKCTGEGGDTPKCSKTC-EPGYSPSYKEDKYYGYSSYSVPSTEKEIMAEIYKN 247

Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHW 316
           GP  A F+++ DF  YKSGVYKH +   L    H+ +++GWG ENG PYWLV N+W   W
Sbjct: 248 GPVEAAFSVFSDFLTYKSGVYKHVAGEVLGG--HAIRILGWGKENGVPYWLVGNSWNVDW 305

Query: 317 GDRGTVKILRGKYECAFEYLIAAGKPK 343
           GD G  KILRG+  C  E  + AG P+
Sbjct: 306 GDNGFFKILRGEDHCGIESEVVAGIPR 332


>gi|7537454|gb|AAF35867.2| cathepsin B-like cysteine proteinase [Helicoverpa armigera]
          Length = 338

 Score =  218 bits (554), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 119/324 (36%), Positives = 167/324 (51%), Gaps = 18/324 (5%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQF--LIADAKYFDQSDRPLPGDRKTYDP 78
            SD +I+ IN + N+W AGRNFP +    ++++   ++ D      S       +  ++ 
Sbjct: 26  LSDDFINLINTKQNSWKAGRNFPEHTPFAHIKRLAGVLPDYHLSKLS-------KVEHED 78

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           E  A++P+ FD R++WPNC T+  V D G+C +   F AV A +DR C  S G Q+   S
Sbjct: 79  ELIASLPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTQHFHFS 138

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
            E + SCC IC       C+ G     W +    G V+GG Y    GC+P  I PC HH 
Sbjct: 139 AEDLLSCCPICGL----GCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHV 194

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
               +P   + K P  KC   C +  Y   + +DK      + V   ED I+ E+  +GP
Sbjct: 195 PGNRMPCNGDSKTP--KCEKTCES-NYNVDYRKDKRYGKHVFSVSSKEDHIRAELFKNGP 251

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
               F +Y D  +YK+GVYKHT    L    H+ K++GWG ENG  YWL+ N+W   WGD
Sbjct: 252 VEGAFTVYSDLLNYKTGVYKHTIGDALGG--HAVKILGWGVENGNKYWLIANSWNSDWGD 309

Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
            G  KILRG+  C  E  I AG+P
Sbjct: 310 NGFFKILRGEDHCGIESSIVAGEP 333


>gi|27882093|gb|AAH44517.1| Zgc:55862 [Danio rerio]
          Length = 330

 Score =  217 bits (553), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 123/340 (36%), Positives = 173/340 (50%), Gaps = 21/340 (6%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
           ++  L  +  R  L   S   ++ IN+   TWTAG NF  ++   Y+++           
Sbjct: 9   VISALSVSWARPRLAPLSHEMVNFINKANTTWTAGHNF-RDVDYSYVKRLC--------- 58

Query: 65  SDRPLPGDRKTYDPEYSA--TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
               L G +     +Y+    +P  FDAREQWPNC T+  + D G+C +   F A  A S
Sbjct: 59  -GTFLKGPKLPVMVQYTEGLKLPKNFDAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAIS 117

Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
           DR CI+S  + +  +S++ + +CC  C       C+ G     W+F    G VTGG Y  
Sbjct: 118 DRVCIQSNAKVSVEISSQDLLTCCDSC----GMGCNGGYPSAAWDFWTTDGLVTGGLYNS 173

Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
             GC+P TI PC HH +  + P C  +      C  +C  P Y   + +DKH    +Y V
Sbjct: 174 HIGCRPYTIEPCEHHVNG-SRPPCTGEGGDTPNCDMKC-EPGYSPLYKEDKHFGKTSYSV 231

Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
             N++ I  E+  +GP  A F +Y+DF  YKSGVY+H S + L    H+ K++GWG ENG
Sbjct: 232 PSNQNGIMAELFKNGPVEAAFTVYEDFLLYKSGVYQHMSGSALGG--HAIKILGWGEENG 289

Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            PYWL  N+W   WGD G  KILRG+  C  E  I AG P
Sbjct: 290 VPYWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAGIP 329


>gi|328697984|ref|XP_003240502.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
          Length = 339

 Score =  217 bits (552), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 129/347 (37%), Positives = 179/347 (51%), Gaps = 19/347 (5%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           ++ +L  +L       + Y   ++YI+ IN  A TWTAG NF  +  E+ L + L   +K
Sbjct: 4   LVILLSVVLFSVYQTEQAYFLEESYIEMINDVATTWTAGVNFDPSTPEKDLIKML--GSK 61

Query: 61  YFDQSDRPLPGDRKTYDPEYSAT--VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
             + +        KT+D  Y+    +P  FDAR +W +C TIG V D G C +   F   
Sbjct: 62  GVEAAKNASAHMFKTHDVAYNNNGYIPRTFDARRRWRHCKTIGEVRDQGYCGSCWAFGTS 121

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
            AF+DR C+ + G  N  LS E +  CC  C       C+ G   + W +    G VTGG
Sbjct: 122 SAFADRLCVATDGDFNELLSAEELTFCCHTC----GNGCNGGYPIKAWKYFSSHGLVTGG 177

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF--FQDKHRT 236
           +Y    GC+P  + PC  +    +  SC  Q + K   + RCT   YG     + D HR 
Sbjct: 178 NYKSGEGCEPYRVPPCPRNEDGTS--SCAGQPIEK---NHRCTRMCYGNQDLDYNDDHRF 232

Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNA-KLENYLHSGKLI 295
           T  Y+      +I+K+++ +GP  A+F +YDDFY YKSGVY+ T NA KL    H+ KLI
Sbjct: 233 TRDYYYL-TYGSIQKDVMNYGPIEASFDVYDDFYSYKSGVYQRTPNATKLGG--HAVKLI 289

Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           GWG E G PYWL++N+W   WGD G  KI RG  EC  +    AG P
Sbjct: 290 GWGVEEGIPYWLMVNSWSAQWGDNGLFKIRRGTDECGIDSATTAGVP 336


>gi|185135431|ref|NP_001117776.1| procathepsin B precursor [Oncorhynchus mykiss]
 gi|14582897|gb|AAK69705.1|AF358667_1 procathepsin B [Oncorhynchus mykiss]
          Length = 330

 Score =  217 bits (552), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 126/326 (38%), Positives = 165/326 (50%), Gaps = 18/326 (5%)

Query: 18  LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
           L   S   +  IN    TWTAG+NF  N+   Y++       K         P   +   
Sbjct: 22  LPLLSPEMVQYINNADTTWTAGQNF-HNVDISYVKSLCGTLLKG--------PRLPELVQ 72

Query: 78  PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
            +   ++PD FDAR QWPNC TI  + D G+C +   F A  A SDR CI S G+ +  +
Sbjct: 73  SDEDMSLPDSFDARLQWPNCPTIKEIRDQGSCGSCWAFGAAEAISDRYCIHSNGKVSVEI 132

Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
           S E + SCC  C       C  G     W++  + G VTGG YG   GC+P +I+PC HH
Sbjct: 133 SAEDLLSCCDAC----GMGCMGGFPSAAWDYWAESGLVTGGLYGSNIGCRPYSIAPCEHH 188

Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
            +    P       PK  C + C N  Y   + +DK     TY V   E  I  E+  +G
Sbjct: 189 VNGTRPPCTGEGDTPK--CVSEC-NAGYTPSYKKDKRFGKQTYSVPPKEQQIMTELYKNG 245

Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
           P  A F++Y+DF  YK+GVY+H +   L    H+ K++GWG EN TPYWLV N+W   WG
Sbjct: 246 PVEAAFSVYEDFLLYKTGVYQHVTGQMLGG--HAIKILGWGKENNTPYWLVANSWNTDWG 303

Query: 318 DRGTVKILRGKYECAFEYLIAAGKPK 343
           D G  KILRGK EC  E  I AG P+
Sbjct: 304 DNGFFKILRGKDECGIESEIVAGIPR 329


>gi|300835056|gb|ADK37857.1| putative cathepsin precursor [Sitobion avenae]
          Length = 340

 Score =  217 bits (552), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 127/349 (36%), Positives = 181/349 (51%), Gaps = 22/349 (6%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           ++ +L  +L       + Y    +YID IN  A TWTAG NF  ++ E++  + L   +K
Sbjct: 4   LVILLSVVLFSVYQTEQAYFLEKSYIDMINEVATTWTAGVNFDPSIPEDHFIKML--GSK 61

Query: 61  YFDQSDRPLPGDRKTYDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
             + + +    + KT D  Y      +P  FDAR++W +C TIG V D G C +   F  
Sbjct: 62  GVESAKQASAHEFKTNDVAYDNHFGHIPRTFDARKKWRHCRTIGEVRDQGHCGSCWAFGT 121

Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
             AF+DR C+ + G  N  LS E +  CC  C +     C  G   + W +  K G VTG
Sbjct: 122 SSAFADRLCVATDGDFNELLSAEEITFCCHTCGF----GCHGGYPIKAWKYFSKHGLVTG 177

Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF--FQDKHR 235
           G+Y    GC+P  + PC          +C  + + K   + RCT   YG     + D HR
Sbjct: 178 GNYKSGEGCEPYRVPPCPRDDKGNN--TCAGKPIEK---NHRCTRMCYGDQDLDYNDDHR 232

Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGK 293
            T  ++      +I+K+++ +GP  A+F +YDDF  YKSGVY+ T NA   +YL  H+ K
Sbjct: 233 FTRDFYYL-TYGSIQKDVMTYGPIEASFDVYDDFPSYKSGVYEKTENA---SYLGGHAVK 288

Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           LIGWG E GTPYWL++N+W   WGD+G  KI RG  EC  +    AG P
Sbjct: 289 LIGWGVEEGTPYWLMVNSWNAQWGDKGLFKIRRGTNECGIDNSTTAGVP 337


>gi|401758196|gb|AFQ01133.1| cathepsin B [Chilo suppressalis]
          Length = 350

 Score =  216 bits (551), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 134/368 (36%), Positives = 185/368 (50%), Gaps = 45/368 (12%)

Query: 1   MIHILVFLLGCTLV-----RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFL 55
           M   L+F     +V        L+  SD +I+ IN   NTW AGRNFP     +Y+   +
Sbjct: 1   MFRTLLFTCAICVVCVVASNVHLHPLSDEFIESINFNQNTWIAGRNFPKKTPLKYIYNLM 60

Query: 56  --IADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPH 113
             ++D++  +   R     RKT         P++FDARE W NC T+  + D G C +  
Sbjct: 61  GTLSDSRMDNLPQRNYTFSRKT-------KYPNQFDAREHWKNCPTLKDIRDQGGCGSCW 113

Query: 114 IFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRG 173
             AAV A +DR CI SKG+++   S + V SCC  C       C  G + R W +  K G
Sbjct: 114 AVAAVSAMTDRMCILSKGKEHFYFSIKDVLSCCGYC----GNGCEGGVLTRAWIYYKKIG 169

Query: 174 SVTGGDYGDRTGCQPSTISPCSHH--------GSAPTLPSCENQKVPKL----------- 214
            V+GG Y  + GCQP TI PC+H          + P  P C+N  +P +           
Sbjct: 170 IVSGGGYKSKQGCQPYTIPPCNHLVWGEIEQCKNIPMTPKCKN--IPVIPEQCKYIPITP 227

Query: 215 KCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKS 274
           +C  +C N  Y   + +DKHR    Y V  +E  I KEI  +GP T+ F +Y+DF +YK 
Sbjct: 228 ECEKKC-NKNYKVCYSKDKHRGKSVYRVKKSE--IFKEIYEYGPVTSYFTVYEDFLNYKE 284

Query: 275 GVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILR-GKYECAF 333
           G+Y +TS  KL   LHS K+IGWG E G  YWL  N++   WGD+G  KI+R G   C  
Sbjct: 285 GIYNYTSGQKLG--LHSVKIIGWGEERGIKYWLAANSFNTDWGDKGFFKIIREGVGSCGI 342

Query: 334 EYLIAAGK 341
              + AG+
Sbjct: 343 SDNVVAGR 350


>gi|209863077|ref|NP_001119612.2| cathepsin B-912 precursor [Acyrthosiphon pisum]
          Length = 342

 Score =  216 bits (551), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 129/345 (37%), Positives = 180/345 (52%), Gaps = 23/345 (6%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
           ++ LLG   V  + Y   + +ID IN +A TW AG NF  N  +EY+ + L   +K    
Sbjct: 11  VILLLG-VCVTEQAYFLEEDFIDSINEKAKTWKAGINFDPNTPKEYIVKLL--GSKGVQV 67

Query: 65  SDRPLPGDRKTYDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
             +      KT D  Y      +P +FDAR++W  C TIG V D G C +    A   AF
Sbjct: 68  PHKLNLKMYKTDDEAYVNLFGRIPKKFDARKEWRRCITIGQVRDQGNCGSCWALATSSAF 127

Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
           +DR CI +  + N  LS E +  CC +C +    +C  G   + W++  + G VTGGDY 
Sbjct: 128 ADRLCIATNYEFNELLSAEELTFCCHLCGF----ACHGGYPIKAWSYFRRHGIVTGGDYQ 183

Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF--FQDKHRTTLT 239
              GC P  + PC          +C  Q + K   H RCT   YG     + D HR T  
Sbjct: 184 SGEGCAPYRVPPCFSEEDGNN--TCRGQPMEK---HHRCTRMCYGDQEIDYDDDHRFTRD 238

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGW 297
           Y+      +I+K+++ +GP  A+  +YDDF  YKSGVY+ + NA    YL  H+ KLIGW
Sbjct: 239 YYYL-TYASIQKDVMTYGPIEASMEVYDDFPSYKSGVYEKSENA---TYLGGHAVKLIGW 294

Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           G E+G PYWL++N+W   WGD+G  KI RG  EC+ +  + AG P
Sbjct: 295 GEEDGVPYWLMVNSWSEMWGDKGLFKIRRGTNECSVDNSMTAGVP 339


>gi|397467300|ref|XP_003805362.1| PREDICTED: cathepsin B [Pan paniscus]
          Length = 339

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 123/339 (36%), Positives = 172/339 (50%), Gaps = 19/339 (5%)

Query: 6   VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
           + +L     R   +  SD  ++ +N+   TW AG NF  N+   YL++       +    
Sbjct: 11  LLVLANARSRPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLC---GTFLGG- 65

Query: 66  DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
             P P  R  +  +    +P+ FDAREQWP C TI  + D G+C +   F AV A SDR 
Sbjct: 66  --PKPPQRVMFTEDLK--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121

Query: 126 CIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
           CI +    +  +S E + +CC  +C       C+ G     WNF  ++G V+GG Y    
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 177

Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
           GC+P +I PC HH +    P       PK  C   C  P Y   + QDKH    +Y V +
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSN 234

Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTP 304
           +E  I  EI  +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTP
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTP 292

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           YWLV N+W   WGD G  KILRG+  C  E  + AG P+
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331


>gi|341891084|gb|EGT47019.1| CBN-CPR-4 protein [Caenorhabditis brenneri]
          Length = 335

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 124/343 (36%), Positives = 174/343 (50%), Gaps = 12/343 (3%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           ++   +  L   LV   + K  +A  + +N + + W A    P +++ E +++ L+    
Sbjct: 4   VVFASLVALATGLVIPIVPKTPEAITEYVNSKQSLWKA--EIPKHITIEQVKKRLMRTEF 61

Query: 61  YFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
               S    P            T+P  FDAR QWP+C +I ++ D   C +   FAA  A
Sbjct: 62  VAPHS----PDAEFVKHDIQEDTIPATFDARTQWPSCVSINNIRDQSDCGSCWAFAAAEA 117

Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
            SDR CI S G  N  LS E V SCC  C Y     C  G     W +L K G  TGG Y
Sbjct: 118 ASDRFCIASNGAVNTLLSAEDVLSCCSNCGY----GCEGGYPINAWKYLVKSGFCTGGSY 173

Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
             + GC+P +++PC       T P+C         C  +CTN  Y   +  DKH  +  Y
Sbjct: 174 EAQFGCKPYSLAPCGETVGNTTWPACPTDGYDTPACVNKCTNSNYNVAYKDDKHFGSTAY 233

Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE 300
            V      I+ EI+AHGP  A F +Y+DFY YKSGVY HT+  +L    H+ +++GWGT+
Sbjct: 234 AVGKKVAQIQAEIIAHGPVEAAFTVYEDFYQYKSGVYVHTTGEELGG--HAIRILGWGTD 291

Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           NGTPYWLV N+W  +WG+ G  +I+RG  EC  E+ +  G PK
Sbjct: 292 NGTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVPK 334


>gi|50540542|ref|NP_998501.1| cathepsin B, a precursor [Danio rerio]
 gi|34784038|gb|AAH56688.1| Cathepsin B, a [Danio rerio]
 gi|37681773|gb|AAQ97764.1| cathepsin B [Danio rerio]
 gi|41351445|gb|AAH65589.1| Cathepsin B, a [Danio rerio]
          Length = 330

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 122/340 (35%), Positives = 172/340 (50%), Gaps = 21/340 (6%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
           ++  L  +  R  L   S   ++ IN+   TWTAG NF  ++   Y+++           
Sbjct: 9   VISALSVSWARPRLPPLSHEMVNFINKANTTWTAGHNF-RDVDYSYVKKLC--------- 58

Query: 65  SDRPLPGDRKTYDPEYSA--TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
               L G +     +Y+    +P  FDAREQWPNC T+  + D G+C +   F A  A S
Sbjct: 59  -GTFLKGPKLPVMVQYTEGLKLPKNFDAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAIS 117

Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
           DR CI S  + +  +S++ + +CC  C       C+ G     W+F    G VTGG Y  
Sbjct: 118 DRVCIHSDAKVSVEISSQDLLTCCDSC----GMGCNGGYPSAAWDFWATEGLVTGGLYNS 173

Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
             GC+P TI PC HH +  + P C  +      C  +C  P Y   + QDKH    +Y V
Sbjct: 174 HIGCRPYTIEPCEHHVNG-SRPPCSGEGGDTPNCDMKC-EPGYSPSYKQDKHFGKTSYSV 231

Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
             N+++I  E+  +GP    F +Y+DF  YKSGVY+H S + +    H+ K++GWG ENG
Sbjct: 232 PSNQNSIMAELFKNGPVEGAFTVYEDFLLYKSGVYQHMSGSPVGG--HAIKILGWGEENG 289

Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            PYWL  N+W   WGD G  KILRG+  C  E  I AG P
Sbjct: 290 VPYWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAGIP 329


>gi|296221607|ref|XP_002756833.1| PREDICTED: cathepsin B, partial [Callithrix jacchus]
          Length = 330

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 123/330 (37%), Positives = 171/330 (51%), Gaps = 19/330 (5%)

Query: 15  RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRK 74
           R   +  SD  ++ +N++  TW AG NF  N+   YL++       +      P P  R 
Sbjct: 11  RPSFHPLSDELVNYVNKQNTTWQAGHNF-YNVDLSYLKRLC---GTFLGG---PKPPQRV 63

Query: 75  TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
            +  + +  +P+ FDAREQWP C TI  + D G+C +   F AV A SDR CI +    +
Sbjct: 64  KFAEDLN--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVS 121

Query: 135 RPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
             +S E + +CC  +C       C+ G     WNF  ++G V+GG Y    GC+P +I P
Sbjct: 122 VEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPP 177

Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
           C HH +    P       PK  C   C  P Y   + QDKH    +Y V +NE  I  EI
Sbjct: 178 CEHHVNGSRPPCTGEGDTPK--CSKSC-EPGYSPTYKQDKHYGYDSYSVSNNERDIMAEI 234

Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWG 313
             +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTPYWLV N+W 
Sbjct: 235 YKNGPVEGAFSVYADFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVGNSWN 292

Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
             WGD G  KILRG+  C  E  + AG P+
Sbjct: 293 TDWGDNGFFKILRGQDHCGIESEVVAGIPR 322


>gi|28971815|dbj|BAC65419.1| cathepsin B [Pandalus borealis]
          Length = 328

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 124/324 (38%), Positives = 175/324 (54%), Gaps = 16/324 (4%)

Query: 17  ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
           EL   SD +++ +  +  TW AGRNF  ++S+++L+  L    K  D    PL    K  
Sbjct: 16  ELDPLSDEFLELLQSKQMTWKAGRNFAKDISKDFLKS-LNCVRKNPDIPKLPL----KNV 70

Query: 77  DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
            P  +  +P  FDAREQWP+C  I  + D G C +    +A    +DR CI ++G  +  
Sbjct: 71  TP--TKEIPVEFDAREQWPHCPCIDEIRDQGNCGSCWAVSAASVMTDRTCIDTEGLVDFR 128

Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
            S+E VA+CC  C      +C  G     +     +G V+GG +    GCQP ++  C H
Sbjct: 129 FSSENVAACCTEC----GNACYGGDEDTAFTHWVTKGFVSGGRHNSNEGCQPYSVEECEH 184

Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
           H   P  P CE   +P+L C   C +  YG+ + +D       Y +  +   I++EI+ +
Sbjct: 185 HIEGPR-PPCEGD-MPELVCSETC-HEEYGKTYEEDLEYGLEAYVLPQDVTQIQEEIMTN 241

Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHW 316
           GP TA FA+YDDF  YKSGVY+H +   L+ Y H+ ++IGWG E GTPYWLV N+W   W
Sbjct: 242 GPVTAAFAVYDDFLSYKSGVYQHETGL-LDGY-HAVRVIGWGEEEGTPYWLVANSWNTDW 299

Query: 317 GDRGTVKILRGKYECAFEYLIAAG 340
           GD G  KILRG  EC FE  +AA 
Sbjct: 300 GDNGLFKILRGSDECEFEGDMAAA 323


>gi|410956528|ref|XP_003984894.1| PREDICTED: cathepsin B [Felis catus]
          Length = 339

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 130/347 (37%), Positives = 177/347 (51%), Gaps = 29/347 (8%)

Query: 5   LVFLLGCTLV------RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---L 55
           L+  L C +V      R  L   SD  +D +N+   TW AG NF  ++   YLR+    +
Sbjct: 4   LLATLSCLVVLTNAQSRPPLQLLSDELVDYVNKRNTTWKAGHNF-YHVEPSYLRRLCGTI 62

Query: 56  IADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIF 115
           +   K        LP  R ++  +    +P+ FDARE WPNC TI  + D G+C +   F
Sbjct: 63  LGGPK--------LP-QRVSFAED--MVLPENFDAREHWPNCPTIKEIRDQGSCGSCWAF 111

Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
            AV A SDR CI + G  N  +S E + +CC     D    C+ G     WNF  K+G V
Sbjct: 112 GAVEAISDRICILTNGHVNVEVSAEDMLTCCGDQCGD---GCNGGFPAEAWNFWTKQGLV 168

Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
           +GG Y    GC+P +I PC HH +    P       PK  C   C  P Y   + +DKH 
Sbjct: 169 SGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYTPSYKEDKHY 225

Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
              +Y V ++E  I  EI  +GP  A F+++ DF  YKSGVY+H +   +    H+ +++
Sbjct: 226 GCNSYSVSNSEKEIMAEIYKNGPVEAAFSVFSDFLQYKSGVYQHVTGEMMGG--HAVRIL 283

Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           GWG EN TPYWLV N+W   WGD G  KILRG+  C  E  + AG P
Sbjct: 284 GWGVENDTPYWLVGNSWNTDWGDHGFFKILRGRDHCGIESEVVAGIP 330


>gi|268555790|ref|XP_002635884.1| Hypothetical protein CBG01104 [Caenorhabditis briggsae]
          Length = 337

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 121/347 (34%), Positives = 185/347 (53%), Gaps = 21/347 (6%)

Query: 1   MIHILVFL--LGCTLVRGELYKFSDAY-IDQINREANTWTAGRNFPANLS-EEYLRQFLI 56
           M ++++F      T++   L++  D + I+ IN   + WTAG   P+  + +++ R+ + 
Sbjct: 1   MEYLILFFAYFSTTVLASNLHQLLDFHEINHINSIQSLWTAG---PSKFAFQKFQRRLMR 57

Query: 57  ADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
           ++     +S+  L  DRK  +     T+P+ +D R+ W  C ++ ++ D   C +    A
Sbjct: 58  SEHVKSHKSEDIL--DRKVLE-----TIPESYDVRDHWSKCISVDNIRDQSDCGSCWAVA 110

Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
           A    SDR CI S G  N  +S E + SCC  C       C  G   + W +  K+G V+
Sbjct: 111 AAETISDRLCIASNGSINTFVSAEDLLSCCTSC----GDGCDGGYPLQAWRYWVKQGLVS 166

Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTN-PTYGRGFFQDKHR 235
           GG Y  + GC+P +I+PC    +  T P C  Q+    +C + CT+  +Y   + +DKH 
Sbjct: 167 GGSYESQYGCKPYSIAPCGQTVNGVTWPKCPAQEEATPECASHCTSKSSYSVAYEKDKHY 226

Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
               Y V   E  I+ EIL HGP  A F +Y DFY YKSG+Y H S  +L    H+ K++
Sbjct: 227 GLSAYPVGRKEAQIQTEILQHGPVEAGFLVYSDFYRYKSGIYTHVSGQELGG--HAVKIL 284

Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           GWG ENGT YWLV N+W  +WG++G  +ILRG+ EC  E  + AG P
Sbjct: 285 GWGVENGTKYWLVANSWNINWGEKGYFRILRGRNECGIESAVVAGIP 331


>gi|426358853|ref|XP_004046705.1| PREDICTED: cathepsin B isoform 1 [Gorilla gorilla gorilla]
 gi|426358855|ref|XP_004046706.1| PREDICTED: cathepsin B isoform 2 [Gorilla gorilla gorilla]
 gi|426358857|ref|XP_004046707.1| PREDICTED: cathepsin B isoform 3 [Gorilla gorilla gorilla]
          Length = 339

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 122/339 (35%), Positives = 172/339 (50%), Gaps = 19/339 (5%)

Query: 6   VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
           + +L     R   +  SD  ++ +N+   TW AG NF  N+   YL++       +    
Sbjct: 11  LLVLANARSRPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLC---GTFLGG- 65

Query: 66  DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
             P P  R  +  +    +P+ FDAREQWP C T+  + D G+C +   F AV A SDR 
Sbjct: 66  --PKPPQRVMFTEDLK--LPESFDAREQWPQCPTVKEIRDQGSCGSCWAFGAVEAISDRI 121

Query: 126 CIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
           CI +    +  +S E + +CC  +C       C+ G     WNF  ++G V+GG Y    
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 177

Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
           GC+P +I PC HH +    P       PK  C   C  P Y   + QDKH    +Y V +
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSN 234

Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTP 304
           +E  I  EI  +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTP
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTP 292

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           YWLV N+W   WGD G  KILRG+  C  E  + AG P+
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331


>gi|427785213|gb|JAA58058.1| Putative cathepsin l culex quinquefasciatus cathepsin l
           [Rhipicephalus pulchellus]
          Length = 346

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 130/333 (39%), Positives = 166/333 (49%), Gaps = 16/333 (4%)

Query: 13  LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGD 72
           L+  E    SD  I  IN    TW AGRN P      Y+R  L       +     LP  
Sbjct: 27  LIPAETDASSDKMIQYINYLNTTWQAGRN-PGFEDPAYVRGLLGVSP---ENHRYRLP-- 80

Query: 73  RKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSK-- 130
            +  D      +P+ FD+RE WP C TIG + D G+C +   F AV A SDR CI S   
Sbjct: 81  ERRLDLSSLGPLPENFDSRENWPECTTIGEIRDQGSCGSCWAFGAVEAMSDRTCIHSPSG 140

Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPST 190
           G +   LS + + SCC+ C       C+ G     W+F  K G VTGG+Y    GC P  
Sbjct: 141 GPKRVHLSADDLLSCCRTC----GNGCNGGFPGSAWSFWVKTGIVTGGNYDSDDGCMPYP 196

Query: 191 ISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIK 250
           I  C HH +  TL  C+ +  P  +C   C    Y   +  DKH    +Y V   E  I+
Sbjct: 197 IKACDHHVNG-TLGPCDKKIPPTPRCVHMCRK-GYDVDYHDDKHYGKSSYSVPSEEKQIQ 254

Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVIN 310
            EI+ +GP  A F +Y DF HYKSGVY+  ++  L    H+ +L+GWG ENG PYWL  N
Sbjct: 255 AEIMTNGPVEADFTVYSDFVHYKSGVYQRHTDEALGG--HAIRLLGWGVENGVPYWLAAN 312

Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           +W   WGD+G  KILRG  EC  E  + AG PK
Sbjct: 313 SWNTEWGDKGFFKILRGSDECGIEDDVVAGLPK 345


>gi|301776581|ref|XP_002923704.1| PREDICTED: cathepsin B-like [Ailuropoda melanoleuca]
 gi|281347694|gb|EFB23278.1| hypothetical protein PANDA_012896 [Ailuropoda melanoleuca]
          Length = 339

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 131/344 (38%), Positives = 175/344 (50%), Gaps = 23/344 (6%)

Query: 5   LVFLLGCTLV------RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
           L+  L C +V      R      SD  ++ +N+   TW AG NF  N+   YLR+     
Sbjct: 4   LLACLSCLVVLAGAQSRPPFQLLSDELVNYVNKRNTTWKAGHNF-HNVDPSYLRRLC--- 59

Query: 59  AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
             +      P    ++ +  E +  +P+ FDAREQWPNC TI  + D G+C +   F AV
Sbjct: 60  GTFLGGPKLP----QRVWFAE-NMVLPENFDAREQWPNCPTIKEIRDQGSCGSCWAFGAV 114

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
            A SDR CI++ G  N  +S E + +CC     D    C+ G     WNF  K+G V+GG
Sbjct: 115 EAISDRICIRTNGHVNVEVSAEDMLTCCGDQCGD---GCNGGFPAEAWNFWTKQGLVSGG 171

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
            Y    GC+P +I PC HH +    P       PK  C   C  P Y   + +DKH    
Sbjct: 172 LYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKFC-EPGYTPSYKEDKHYGCS 228

Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
           +Y V  +E  I  EI  +GP  A F +Y DF  YKSGVY+H +   +    H+ +++GWG
Sbjct: 229 SYSVSSSEKEIMAEIYKNGPVEAAFTVYSDFLLYKSGVYQHVTGEMMGG--HAVRILGWG 286

Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            ENGTPYWLV N+W   WGD G  KILRG+  C  E  I AG P
Sbjct: 287 VENGTPYWLVGNSWNTDWGDNGFFKILRGRDHCGIESEIVAGIP 330


>gi|204022088|dbj|BAG71141.1| cathepsin B-N2 [Tuberaphis styraci]
          Length = 334

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 135/336 (40%), Positives = 174/336 (51%), Gaps = 28/336 (8%)

Query: 17  ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
           + Y   + YI+QIN  A TW AG NF   LS +   + L   +K    + +  P   KT+
Sbjct: 17  QAYFLEEDYINQINANAKTWKAGVNFDPKLSIDSFVKLL--GSKGVQAAKQASPVMFKTH 74

Query: 77  DPEY---SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
           D  Y   S  +P  FDAR++W  C TIG V D G C +   F    AF+DR CI + G+ 
Sbjct: 75  DEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGNCGSCWAFGTSSAFADRLCIATDGEF 134

Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
           N  LS E +A CC  C +     CS G   R W    K G VTGG+Y    GCQP  +SP
Sbjct: 135 NELLSPEELAFCCHKCGF----GCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVSP 190

Query: 194 C--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDA 248
           C    +G+     +C  +  P  K H RCT   YG     F +D H T   Y++      
Sbjct: 191 CPLDEYGNN----TCSGK--PAEKNH-RCTQMCYGNQNLDFKEDHHYTRDAYYL--TYGT 241

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYW 306
           I+ ++LA+GP  A+F +YDDF  YKSGVY    NA    YL  H+ KLIGWG E G PYW
Sbjct: 242 IQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENA---TYLGGHAVKLIGWGEEYGVPYW 298

Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           L++N+W   WGD+G  KI RG  EC  +     G P
Sbjct: 299 LLVNSWNDQWGDQGLFKIRRGTNECGTDNSTTGGVP 334


>gi|327322926|gb|AEA48884.1| cathepsin B [Oplegnathus fasciatus]
          Length = 330

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 121/331 (36%), Positives = 171/331 (51%), Gaps = 21/331 (6%)

Query: 15  RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRK 74
           R  L   S   ++ IN+   TW AG NF  N+   Y+++               L G + 
Sbjct: 19  RPRLQPLSSEMVNYINKFNTTWKAGHNF-HNVDYSYIQRLC----------GTMLKGPKL 67

Query: 75  TYDPEYSA--TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
               +Y+    +P+ FDAREQWPNC T+  + D G+C +   F A  A SDR CI S  +
Sbjct: 68  PVMVQYTGDLKLPEEFDAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNAK 127

Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTIS 192
            +  +S+E + +CC  C       C+ G     W+F  K G V+GG Y    GC+P TI+
Sbjct: 128 VSVEISSEDLLTCCMSC----GMGCNGGYPSAAWDFWTKEGLVSGGLYDSHIGCRPYTIA 183

Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
           PC HH +  + PSC  +     +C T+C    Y   + +DKH    +Y V  +E+ I+ E
Sbjct: 184 PCEHHVNG-SRPSCTGEGGDTPQCITKC-EAGYTPSYKEDKHFGKTSYTVLSDEEQIQSE 241

Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
           I  +GP    F +Y+DF  YKSGVY+H S + +    H+ K++GWG E+G PYWL  N+W
Sbjct: 242 IFKNGPVEGAFIVYEDFVLYKSGVYQHVSGSAVGG--HAIKILGWGVEDGVPYWLCANSW 299

Query: 313 GPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
              WGD G  K LRG   C  E  + AG PK
Sbjct: 300 NTDWGDNGFFKFLRGSDHCGIESEVVAGIPK 330


>gi|45361295|ref|NP_989225.1| cathepsin B precursor [Xenopus (Silurana) tropicalis]
 gi|38969948|gb|AAH63365.1| hypothetical protein MGC75969 [Xenopus (Silurana) tropicalis]
          Length = 333

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 129/348 (37%), Positives = 176/348 (50%), Gaps = 24/348 (6%)

Query: 2   IHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
           + +L FL      R   +    S   ++ IN+   TW AG NF AN    Y+++      
Sbjct: 5   VVVLCFLASIASARHLPFFAPLSGDMVNYINKMNTTWKAGHNF-ANADLHYVKRLC---- 59

Query: 60  KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
                +    P  +K +       +PD FD+R  WPNC TI  V D G+C +   F AV 
Sbjct: 60  ----GTHLNGPQLQKRFGFADGMELPDSFDSRAAWPNCPTIREVRDQGSCGSCWAFGAVE 115

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           A SDR C+ + G+ N  +S E + SCC    ++    C+ G     W F  + G V+GG 
Sbjct: 116 AISDRVCVHTNGKVNVEVSAEDLLSCCG---FECGMGCNGGYPSGAWKFWTETGLVSGGL 172

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTN---PTYGRGFFQDKHRT 236
           Y    GC+P +I PC HH +  + P+C+ ++    KC  +C +   P YG     DKH  
Sbjct: 173 YDSHLGCRPYSIPPCEHHVNG-SRPACKGEEGDTPKCVKQCEDGYAPVYG----SDKHFG 227

Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
             +Y V  +E  I  EI  +GP    F +Y DF  YKSGVY+H +  +L    H+ K++G
Sbjct: 228 ATSYGVPSSEKEIMAEIYKNGPVEGAFLVYADFPMYKSGVYQHETGEELGG--HAIKILG 285

Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           WG ENGTPYWL  N+W   WGD G  KILRGK  C  E  I AG PKN
Sbjct: 286 WGVENGTPYWLCANSWNTDWGDNGFFKILRGKDHCGIESEIVAGIPKN 333


>gi|354471594|ref|XP_003498026.1| PREDICTED: cathepsin B-like [Cricetulus griseus]
 gi|344254255|gb|EGW10359.1| Cathepsin B [Cricetulus griseus]
          Length = 339

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 128/330 (38%), Positives = 171/330 (51%), Gaps = 25/330 (7%)

Query: 18  LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYFDQSDRPLPGDRK 74
            +  SD  I+ IN+   TW AGRNF  N+   YL++    ++   K        LP +R 
Sbjct: 23  FHPLSDDLINYINKRNTTWQAGRNF-HNVDISYLKRLCGTIMGGPK--------LP-ERV 72

Query: 75  TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
            +  +    +P+ FDAREQW NC TI  + D G+C +   F AVGA SDR CI + G  N
Sbjct: 73  AFAEDME--LPENFDAREQWSNCPTIKQIRDQGSCGSCWAFGAVGAMSDRLCIHTNGHVN 130

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             +S E + +CC     D    C+ G     WNF  K+G V+GG Y    GC P TI PC
Sbjct: 131 VEVSAEDLLTCCGSQCGD---GCNGGYPSGAWNFWIKKGLVSGGLYNSHVGCLPYTIPPC 187

Query: 195 SHHGSAPTLPSCENQ-KVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
            HH +  + P C  +   PK    T+     Y   + +DKH    +Y V +NE  I  EI
Sbjct: 188 EHHVNG-SRPQCTGEGDTPKC---TKSCEAGYSPSYKEDKHYGYTSYSVSNNEKEIMAEI 243

Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWG 313
             +GP    F ++ DF  YKSGVYKH +   +    H+ +++GWG EN  PYWLV N+W 
Sbjct: 244 YKNGPVEGAFTVFSDFLTYKSGVYKHEAGDIMGG--HAIRILGWGVENSVPYWLVANSWN 301

Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
             WGD G  KILRG+  C  E  I AG P+
Sbjct: 302 VDWGDNGLFKILRGEDHCGIESEIVAGIPR 331


>gi|431918315|gb|ELK17542.1| Cathepsin B [Pteropus alecto]
          Length = 359

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 128/347 (36%), Positives = 180/347 (51%), Gaps = 28/347 (8%)

Query: 5   LVFLLGCTLV------RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---L 55
           L+  L C +V      R      SD  ++ +N+   TW AG NF  N+   Y+++    +
Sbjct: 27  LLTTLSCLVVLTSARNRPNFPPLSDELVNYVNKRNTTWKAGHNF-HNVDLSYVKRLCGTI 85

Query: 56  IADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIF 115
           +   K        LP  ++ +  E    +P+ FDAREQWPNC TI  + D G+C +   F
Sbjct: 86  LGGPK--------LP--QRVWLAE-DLVLPESFDAREQWPNCPTIKEIRDQGSCGSCWAF 134

Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
            AV A SDR CI + G  N  +S E + +CC    +   + C+ G     WNF  K+G V
Sbjct: 135 GAVEAISDRICILTNGNVNVEVSAEDLLTCCG---FQCGEGCNGGFPSGAWNFWTKKGLV 191

Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
           +GG Y    GC+P +I PC HH +  + P C  +     KC +R     Y   + +DKH 
Sbjct: 192 SGGLYDSHVGCRPYSIPPCEHHVNG-SRPPCTGEGGSTPKC-SRICEAGYTPSYKEDKHF 249

Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
              +Y V  +E  I  EI  +GP  A F++Y DF  YKSGVY+H +   +    H+ +++
Sbjct: 250 GCSSYSVPSSETEIMAEIYKNGPVEAAFSVYSDFLLYKSGVYQHVTGEMMGG--HAVRIL 307

Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           GWG E+GTPYWLV N+W   WGD G  KILRG+  C  E  I AG P
Sbjct: 308 GWGVEDGTPYWLVGNSWNTDWGDSGFFKILRGQDHCGIESEIVAGLP 354


>gi|4503139|ref|NP_001899.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538431|ref|NP_680090.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538433|ref|NP_680091.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538435|ref|NP_680092.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538437|ref|NP_680093.1| cathepsin B preproprotein [Homo sapiens]
 gi|68067549|sp|P07858.3|CATB_HUMAN RecName: Full=Cathepsin B; AltName: Full=APP secretase; Short=APPS;
           AltName: Full=Cathepsin B1; Contains: RecName:
           Full=Cathepsin B light chain; Contains: RecName:
           Full=Cathepsin B heavy chain; Flags: Precursor
 gi|291888|gb|AAC37547.1| cathepsin B [Homo sapiens]
 gi|63102437|gb|AAH95408.1| Cathepsin B [Homo sapiens]
 gi|119586034|gb|EAW65630.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586036|gb|EAW65632.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586037|gb|EAW65633.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586038|gb|EAW65634.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586039|gb|EAW65635.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586040|gb|EAW65636.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|168277954|dbj|BAG10955.1| cathepsin B precursor [synthetic construct]
 gi|193786804|dbj|BAG52127.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score =  215 bits (547), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 123/339 (36%), Positives = 171/339 (50%), Gaps = 19/339 (5%)

Query: 6   VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
           + +L     R   +  SD  ++ +N+   TW AG NF  N+   YL++       +    
Sbjct: 11  LLVLANARSRPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLC---GTFLGG- 65

Query: 66  DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
             P P  R  +  +    +P  FDAREQWP C TI  + D G+C +   F AV A SDR 
Sbjct: 66  --PKPPQRVMFTEDLK--LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121

Query: 126 CIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
           CI +    +  +S E + +CC  +C       C+ G     WNF  ++G V+GG Y    
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 177

Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
           GC+P +I PC HH +    P       PK  C   C  P Y   + QDKH    +Y V +
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSN 234

Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTP 304
           +E  I  EI  +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTP
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTP 292

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           YWLV N+W   WGD G  KILRG+  C  E  + AG P+
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331


>gi|432946172|ref|XP_004083803.1| PREDICTED: cathepsin B-like [Oryzias latipes]
          Length = 330

 Score =  215 bits (547), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 125/329 (37%), Positives = 173/329 (52%), Gaps = 18/329 (5%)

Query: 14  VRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDR 73
            R +L   S   ID +N+   TWTAG+NF  N    +++       K        LP   
Sbjct: 18  ARPQLPLLSLEMIDFVNKLNTTWTAGQNF-HNKDSSFVKGLCGTILK-----GPKLP--E 69

Query: 74  KTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
             +D E    +PD FD REQWPNC T+  + D G C +   F A  A SDR CI+S G+ 
Sbjct: 70  LAHDVE-GIKLPDSFDPREQWPNCPTLKQIRDQGNCGSCWAFGAAEAISDRICIQSGGKI 128

Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
           +  +S E + +CC  C       C  G     W F   +G VTGG +  + GC+P T++P
Sbjct: 129 SLEISAEDLLTCCDEC----GMGCFGGFPSAAWEFWTNKGLVTGGLFDSKVGCRPYTLAP 184

Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
           C HH +  + P C+ + V   KC T+C N  Y   + +DKH    +Y +   ++ I  E+
Sbjct: 185 CEHHVNG-SRPPCQGE-VETPKCVTQCNN-GYSLSYPKDKHFGQRSYSIPSQQEQIMTEL 241

Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWG 313
             +GP  A F++Y DF  YK+GVY+H +   L    H+ K++GWG ENGTPYWLV N+W 
Sbjct: 242 YKNGPVEAAFSVYADFLLYKNGVYQHVTGDMLGG--HAVKILGWGEENGTPYWLVANSWN 299

Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKP 342
             WGD+G  KI RG  EC  E  + AG P
Sbjct: 300 SDWGDKGFFKIKRGNDECGIESEMVAGAP 328


>gi|197098184|ref|NP_001126573.1| cathepsin B precursor [Pongo abelii]
 gi|75061687|sp|Q5R6D1.1|CATB_PONAB RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|55731764|emb|CAH92586.1| hypothetical protein [Pongo abelii]
 gi|55731953|emb|CAH92685.1| hypothetical protein [Pongo abelii]
          Length = 339

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 122/330 (36%), Positives = 169/330 (51%), Gaps = 19/330 (5%)

Query: 15  RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRK 74
           R   +  SD  ++ +N+   TW AG NF  N+   YL++       +      P P  R 
Sbjct: 20  RPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDVSYLKKLC---GTFLGG---PKPPQRV 72

Query: 75  TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
            +  +    +P+ FDAREQWP C TI  + D G+C +   F AV A SDR CI +    +
Sbjct: 73  MFTEDLK--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVS 130

Query: 135 RPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
             +S E + +CC  +C       C+ G     WNF  ++G V+GG Y    GC+P +I P
Sbjct: 131 VEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPP 186

Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
           C HH +    P       PK  C   C  P Y   + QDKH    +Y V ++E  I  EI
Sbjct: 187 CEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNSERDIMAEI 243

Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWG 313
             +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTPYWLV N+W 
Sbjct: 244 YKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWN 301

Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
             WGD G  KILRG+  C  E  + AG P+
Sbjct: 302 TDWGDNGFFKILRGQDHCGIESEVVAGIPR 331


>gi|410916585|ref|XP_003971767.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
          Length = 328

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 121/324 (37%), Positives = 165/324 (50%), Gaps = 18/324 (5%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
            S   ID IN+   TWTAG+NF  N+   Y++        +      P     +      
Sbjct: 23  LSSEMIDFINKVNTTWTAGQNF-HNVDSSYVKGLC---GTFLKGPKLP-----QVLHNTE 73

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
              +PD FDAR+QWP+C TI  + D G+C +   F A  A SDR CI S  + +  +S E
Sbjct: 74  GIRLPDSFDARKQWPDCRTIQQIRDQGSCGSCWAFGAAEAISDRLCIHSGSKISLEISAE 133

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SCC  C       CS G     W F  K+G VTGG  G   GC+P +I+PC HH + 
Sbjct: 134 DLLSCCDEC----GMGCSGGYPSSAWEFWTKKGLVTGGLCGSEVGCRPYSIAPCEHHVNG 189

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
              P    Q+ PK  C  +C +  Y   + +DKH    +Y +   ++ I  E+  +GP  
Sbjct: 190 TRPPCQGTQETPK--CEKKCID-GYLTSYLKDKHFGKRSYSLPSQQEQIMTELYKNGPVE 246

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
           A F +Y DF  YK+GVY+H +   L    H+ K++GWG E+GTPYWL  N+W   WGD+G
Sbjct: 247 AAFTVYADFLLYKTGVYQHVTGEVLGG--HAIKILGWGEESGTPYWLAANSWNGDWGDKG 304

Query: 321 TVKILRGKYECAFEYLIAAGKPKN 344
             KI RG  EC  E  + AG P N
Sbjct: 305 FFKIKRGNDECGIESEMVAGTPLN 328


>gi|158261501|dbj|BAF82928.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 123/339 (36%), Positives = 171/339 (50%), Gaps = 19/339 (5%)

Query: 6   VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
           + +L     R   +  SD  ++ +N+   TW AG NF  N+   YL++       +    
Sbjct: 11  LLVLANARSRPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLC---GTFLGG- 65

Query: 66  DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
             P P  R  +  +    +P  FDAREQWP C TI  + D G+C +   F AV A SDR 
Sbjct: 66  --PKPPQRVMFTEDLK--LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121

Query: 126 CIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
           CI +    +  +S E + +CC  +C       C+ G     WNF  ++G V+GG Y    
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 177

Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
           GC+P +I PC HH +    P       PK  C   C  P Y   + QDKH    +Y V +
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSN 234

Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTP 304
           +E  I  EI  +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTP
Sbjct: 235 SEKDIMAEIYKNGPAEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTP 292

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           YWLV N+W   WGD G  KILRG+  C  E  + AG P+
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331


>gi|27806671|ref|NP_776456.1| cathepsin B precursor [Bos taurus]
 gi|115312124|sp|P07688.5|CATB_BOVIN RecName: Full=Cathepsin B; AltName: Full=BCSB; Contains: RecName:
           Full=Cathepsin B light chain; Contains: RecName:
           Full=Cathepsin B heavy chain; Flags: Precursor
 gi|289402|gb|AAA03064.1| cathepsin B [Bos taurus]
 gi|809479|gb|AAA80198.1| cathepsin B [Bos taurus]
 gi|296484950|tpg|DAA27065.1| TPA: cathepsin B precursor [Bos taurus]
          Length = 335

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 132/348 (37%), Positives = 175/348 (50%), Gaps = 27/348 (7%)

Query: 1   MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
           ++  L  LL  T  R  LY    SD  ++ +N++  TW AG NF  N+   Y+++   A 
Sbjct: 4   LLATLSCLLVLTSARSSLYFPPLSDELVNFVNKQNTTWKAGHNF-YNVDLSYVKKLCGAI 62

Query: 59  AKYFDQSDRPLPGDRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
                     L G +      ++A V  P+ FDAREQWPNC TI  + D G+C +   F 
Sbjct: 63  ----------LGGPKLPQRDAFAADVVLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFG 112

Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRT--WNFLHKRGS 174
           AV A SDR CI S G+ N  +S E +     +              F +  WNF  K+G 
Sbjct: 113 AVEAISDRICIHSNGRVNVEVSAEDM-----LTCCGGECGDGCNGGFPSGAWNFWTKKGL 167

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           V+GG Y    GC+P +I PC HH +    P       PK  C   C  P Y   + +DKH
Sbjct: 168 VSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKTC-EPGYSPSYKEDKH 224

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
               +Y V +NE  I  EI  +GP    F++Y DF  YKSGVY+H S   +    H+ ++
Sbjct: 225 FGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGG--HAIRI 282

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           +GWG ENGTPYWLV N+W   WGD G  KILRG+  C  E  I AG P
Sbjct: 283 LGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 330


>gi|204022092|dbj|BAG71143.1| cathepsin B-N2 [Tuberaphis coreana]
          Length = 334

 Score =  214 bits (546), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 135/336 (40%), Positives = 173/336 (51%), Gaps = 28/336 (8%)

Query: 17  ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
           + Y   + YI+QIN  A TW AG NF   LS +   + L   +K    + +  P   KT+
Sbjct: 17  QAYFLEEDYINQINANAKTWKAGVNFDPKLSIDSFVKLL--GSKGVQAAKQASPDMFKTH 74

Query: 77  DPEY---SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
           D  Y   S  +P  FDAR++W  C TIG V D G C +   F    AF+DR CI + G+ 
Sbjct: 75  DEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGEF 134

Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
           N  LS E +A CC  C +     CS G   R W    K G VTGG+Y    GCQP  + P
Sbjct: 135 NELLSPEELAFCCHKCGF----GCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPP 190

Query: 194 C--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDA 248
           C    +G+     +C  +  P  K H RCT   YG     F +D H T   Y++      
Sbjct: 191 CPLDEYGNN----TCRGK--PAEKNH-RCTRMCYGNQDLDFKEDHHYTRDAYYL--TYGT 241

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYW 306
           I+ +ILA+GP  A+F +YDDF  YKSGVY    NA    YL  H+ KLIGWG E G PYW
Sbjct: 242 IQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENA---TYLGGHAVKLIGWGEEYGVPYW 298

Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           L++N+W   WGD+G  KI RG  EC  +     G P
Sbjct: 299 LLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 334


>gi|6681079|ref|NP_031824.1| cathepsin B preproprotein [Mus musculus]
 gi|115712|sp|P10605.2|CATB_MOUSE RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
           RecName: Full=Cathepsin B light chain; Contains:
           RecName: Full=Cathepsin B heavy chain; Flags: Precursor
 gi|239907|gb|AAB20536.1| preprocathepsin B [Mus sp.]
 gi|309152|gb|AAA37375.1| cathepsin B [Mus musculus]
 gi|13879360|gb|AAH06656.1| Cathepsin B [Mus musculus]
 gi|26350521|dbj|BAC38900.1| unnamed protein product [Mus musculus]
 gi|74180941|dbj|BAE27751.1| unnamed protein product [Mus musculus]
 gi|74191261|dbj|BAE39458.1| unnamed protein product [Mus musculus]
 gi|74198944|dbj|BAE30691.1| unnamed protein product [Mus musculus]
 gi|74208073|dbj|BAE29144.1| unnamed protein product [Mus musculus]
 gi|148704123|gb|EDL36070.1| cathepsin B, isoform CRA_a [Mus musculus]
          Length = 339

 Score =  214 bits (546), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 126/329 (38%), Positives = 171/329 (51%), Gaps = 23/329 (6%)

Query: 18  LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYFDQSDRPLPGDRK 74
            +  SD  I+ IN++  TW AGRNF  N+   YL++    ++   K        LPG R 
Sbjct: 23  FHPLSDDLINYINKQNTTWQAGRNF-YNVDISYLKKLCGTVLGGPK--------LPG-RV 72

Query: 75  TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
            +  +    +P+ FDAREQW NC TIG + D G+C +   F AV A SDR CI + G+ N
Sbjct: 73  AFGEDID--LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVN 130

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             +S E + +CC I   D    C+ G     W+F  K+G V+GG Y    GC P TI PC
Sbjct: 131 VEVSAEDLLTCCGIQCGD---GCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPC 187

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            HH +    P       P  +C+  C    Y   + +DKH    +Y V ++   I  EI 
Sbjct: 188 EHHVNGSRPPCTGEGDTP--RCNKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIY 244

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            +GP    F ++ DF  YKSGVYKH +   +    H+ +++GWG ENG PYWL  N+W  
Sbjct: 245 KNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGG--HAIRILGWGVENGVPYWLAANSWNL 302

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            WGD G  KILRG+  C  E  I AG P+
Sbjct: 303 DWGDNGFFKILRGENHCGIESEIVAGIPR 331


>gi|48762493|dbj|BAD23816.1| cathepsin B-N1 [Tuberaphis coreana]
          Length = 340

 Score =  214 bits (546), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 135/336 (40%), Positives = 173/336 (51%), Gaps = 28/336 (8%)

Query: 17  ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
           + Y   + YI+QIN  A TW AG NF   LS +   + L   +K    + +  P   KT+
Sbjct: 20  QAYFLEEDYINQINANAKTWKAGVNFDPKLSIDSFVKLL--GSKGVQAAKQASPDMFKTH 77

Query: 77  DPEY---SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
           D  Y   S  +P  FDAR++W  C TIG V D G C +   F    AF+DR CI + G+ 
Sbjct: 78  DEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGEF 137

Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
           N  LS E +A CC  C +     CS G   R W    K G VTGG+Y    GCQP  + P
Sbjct: 138 NELLSPEELAFCCHKCGF----GCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPP 193

Query: 194 C--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDA 248
           C    +G+     +C  +  P  K H RCT   YG     F +D H T   Y++      
Sbjct: 194 CPLDEYGNN----TCRGK--PAEKNH-RCTRMCYGNQDLDFKEDHHYTRDAYYL--TYGT 244

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYW 306
           I+ +ILA+GP  A+F +YDDF  YKSGVY    NA    YL  H+ KLIGWG E G PYW
Sbjct: 245 IQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENA---TYLGGHAVKLIGWGEEYGVPYW 301

Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           L++N+W   WGD+G  KI RG  EC  +     G P
Sbjct: 302 LLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 337


>gi|181192|gb|AAA52129.1| preprocathepsin B [Homo sapiens]
 gi|193787271|dbj|BAG52477.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score =  214 bits (546), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 123/339 (36%), Positives = 171/339 (50%), Gaps = 19/339 (5%)

Query: 6   VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
           + +L     R   +  SD  ++ +N+   TW AG NF  N+   YL++       +    
Sbjct: 11  LLVLANARSRPSFHPVSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLC---GTFLGG- 65

Query: 66  DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
             P P  R  +  +    +P  FDAREQWP C TI  + D G+C +   F AV A SDR 
Sbjct: 66  --PKPPQRVMFTEDLK--LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121

Query: 126 CIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
           CI +    +  +S E + +CC  +C       C+ G     WNF  ++G V+GG Y    
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 177

Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
           GC+P +I PC HH +    P       PK  C   C  P Y   + QDKH    +Y V +
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSN 234

Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTP 304
           +E  I  EI  +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTP
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTP 292

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           YWLV N+W   WGD G  KILRG+  C  E  + AG P+
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331


>gi|146165818|ref|XP_001015807.2| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|146145394|gb|EAR95562.2| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 338

 Score =  214 bits (546), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 122/325 (37%), Positives = 167/325 (51%), Gaps = 16/325 (4%)

Query: 21  FSDAYIDQINREAN-TWTAGR--NFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
           FS+ ++++ N+  N TW A R   F   +  E L+  L A       +  P+    +T D
Sbjct: 27  FSEKFVEEFNKRYNSTWRAARYQKF-EEMDPETLQGHLGALIDEPLWAKLPIKNVEQTND 85

Query: 78  PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
           P     +P+ FD+REQWPNC +I  + D   C +   FAA   +SDR CI S  +    +
Sbjct: 86  P-----IPESFDSREQWPNCNSIKTIRDQSTCGSCWAFAATETYSDRICIASNQELQTSI 140

Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
           S+E +  CC  C       C  G     W ++   G  TGG YGD + C+P    PC HH
Sbjct: 141 SSEDLLECCATC----GNGCQGGYPSAAWKYMKATGVSTGGLYGDDSSCKPYVFPPCDHH 196

Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
                 P C   K P  KC  +C +    + + QD H  +  Y + +N +AI++EI+AHG
Sbjct: 197 -VVGQYPPCGPIK-PTPKCVKQCNSQYTEKTYQQDLHHPSKVYQLPNNAEAIQREIMAHG 254

Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
           P  A+F +  DF  YKSGVY      K E   HS K+IGWG E GTPYWL+ N+W   WG
Sbjct: 255 PVQASFRVASDFLTYKSGVYIRDPKLKYEGG-HSVKIIGWGVEQGTPYWLIANSWNEDWG 313

Query: 318 DRGTVKILRGKYECAFEYLIAAGKP 342
           + G  K+LRGK EC  E  + AG P
Sbjct: 314 ENGLFKMLRGKNECGIEAEVVAGLP 338


>gi|449667614|ref|XP_002166962.2| PREDICTED: cathepsin B-like [Hydra magnipapillata]
          Length = 330

 Score =  214 bits (545), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 120/344 (34%), Positives = 178/344 (51%), Gaps = 19/344 (5%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           +I  ++  +  T+ +  +++   ++I  IN    TW AG NF  +++ +Y+R    A   
Sbjct: 4   IIFGVLIAMVFTMPKNSMFQ---SHIHTINNMKTTWEAGENFGPHITSDYIRNLCGALKT 60

Query: 61  YFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPN-CGTIGHVPDTGACAAPHIFAAVG 119
              +        ++ +D      +P  FDAR++W + C ++  V D G C +   F A  
Sbjct: 61  PLSKKLPIKDLSKEVHD------LPIEFDARKEWGSICPSLLEVRDQGECGSCWAFGAAE 114

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           A +DR CI +KG+    +STE + +CC  C +     C+ G     W F   +G VTGG 
Sbjct: 115 AMTDRICIATKGKNQVRISTEDLLTCCDSCGF----GCNGGYPQSAWEFFKTKGIVTGGP 170

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
           Y    GCQP  I  C HH      P   N  +P  KC   C    Y   +  DKH    +
Sbjct: 171 YNSHKGCQPYAIPACDHHVPHSKNPC--NGSLPTPKCEKVCEK-GYNITYKNDKHYGVTS 227

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y ++++++ I +EI+ +GP  A F ++ DF +YKSGVY+H S  +L    H+ K++GWG 
Sbjct: 228 YSINNDQNEIMREIMTNGPVEAAFTVFADFPNYKSGVYQHVSGEELGG--HAIKILGWGV 285

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           EN TPYWLV N+W P WGD G  KILRG  EC  E  + AG PK
Sbjct: 286 ENNTPYWLVANSWNPSWGDNGFFKILRGSDECGIEDEVVAGLPK 329


>gi|204022073|dbj|BAG71134.1| cathepsin B-S1 [Tuberaphis taiwana]
          Length = 334

 Score =  214 bits (545), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 128/340 (37%), Positives = 181/340 (53%), Gaps = 17/340 (5%)

Query: 4   ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
           +    +   L    L   SD  I  IN  A TW A R FPAN SEEY    L+    Y +
Sbjct: 8   VCAIFVSVYLAEPTLQFLSDERIKYINEVAKTWKAERYFPANTSEEYFIG-LLGSRGYKN 66

Query: 64  QSDRPLPGDRKTYDPEYSAT-VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
            ++     + K YDP Y     P +FD+R  W +C  IGH+ D G C +   F+  GAF+
Sbjct: 67  YTNEV---EIKKYDPLYVENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFA 123

Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
           DR C+ + G+ N+ LS E +A     C  D  K C  G   + W +   +G  TGGDYG 
Sbjct: 124 DRLCVSTGGKFNQLLSPEELA----FCCKDCGKGCGGGYPIKAWKYFRTQGVTTGGDYGT 179

Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
           + GC P  + PC +     T   C  Q  P  + H +C    YG+   Q++++T   Y V
Sbjct: 180 KEGCMPYKVPPCYNKQGKNT---CGGQ--PMERNH-QCPKTCYGKTTVQNRYKTKSEY-V 232

Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
            ++   I+++I+ +GP  A+F +YDD   YKSG+Y+ T  AK +   HS K+IGWG +NG
Sbjct: 233 INSIKTIERDIMTYGPVEASFDVYDDLSAYKSGIYRKTPKAKYQG-GHSIKIIGWGQQNG 291

Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           TPYWL +N+W   WG+ GT KI++G+ EC  E  + AG P
Sbjct: 292 TPYWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGIP 331


>gi|204022083|dbj|BAG71139.1| cathepsin B-S [Astegopteryx styracophila]
          Length = 335

 Score =  214 bits (545), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 123/327 (37%), Positives = 169/327 (51%), Gaps = 24/327 (7%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
            SD  I+ IN+ A TW A R FPAN+S+EY+   L +       ++  +  D    DP Y
Sbjct: 25  LSDERIEYINKIAKTWKAERYFPANMSKEYITGLLGSRGYKNYLNEVEIKKD----DPLY 80

Query: 81  SAT--VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           +        FDARE W  C  IGHV D G C +   F   GAF+DR C+ + G  N  LS
Sbjct: 81  TKNNNKIKHFDARENWKICKQIGHVRDQGNCGSCWAFGTTGAFADRLCVATGGGFNEQLS 140

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
            E +  CC  C       C  G+  + W +  +RG  TGGDYG   GC P  + PC    
Sbjct: 141 AEKLTFCCWTC----GLGCQGGNPIKAWKYFKRRGITTGGDYGSNEGCAPYKVPPCYDDQ 196

Query: 199 S---APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
                   P+  N K P+           YG    +++++    Y V D+   I+++I  
Sbjct: 197 GEFLCQGKPTEHNHKCPR---------ACYGNSTVENRYKVESIY-VLDSFKTIEQDIRT 246

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
           +GP  A+F +YDDF  YKSG+Y+ T NA L    HS KLIGWG E+G PYWL++N+W   
Sbjct: 247 YGPVEASFDVYDDFITYKSGIYQKTPNA-LYVGGHSVKLIGWGEEDGIPYWLLVNSWSKF 305

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
           WG++GT +I++G+ EC  E    AG P
Sbjct: 306 WGEQGTFRIIKGRNECGIERSATAGIP 332


>gi|30583753|gb|AAP36125.1| Homo sapiens cathepsin B [synthetic construct]
 gi|61370555|gb|AAX43516.1| cathepsin B [synthetic construct]
          Length = 340

 Score =  214 bits (544), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 123/339 (36%), Positives = 171/339 (50%), Gaps = 19/339 (5%)

Query: 6   VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
           + +L     R   +  SD  ++ +N+   TW AG NF  N+   YL++       +    
Sbjct: 11  LLVLANARSRPSFHPVSDELVNYVNKRNTTWQAGHNF-YNVDMGYLKRLC---GTFLGG- 65

Query: 66  DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
             P P  R  +  +    +P  FDAREQWP C TI  + D G+C +   F AV A SDR 
Sbjct: 66  --PKPPQRVMFTEDLK--LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121

Query: 126 CIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
           CI +    +  +S E + +CC  +C       C+ G     WNF  ++G V+GG Y    
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 177

Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
           GC+P +I PC HH +    P       PK  C   C  P Y   + QDKH    +Y V +
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSN 234

Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTP 304
           +E  I  EI  +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTP
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTP 292

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           YWLV N+W   WGD G  KILRG+  C  E  + AG P+
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331


>gi|16307393|gb|AAH10240.1| Cathepsin B [Homo sapiens]
          Length = 339

 Score =  214 bits (544), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 123/339 (36%), Positives = 171/339 (50%), Gaps = 19/339 (5%)

Query: 6   VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
           + +L     R   +  SD  ++ +N+   TW AG NF  N+   YL++       +    
Sbjct: 11  LLVLANARSRPSFHPVSDELVNYVNKRNTTWQAGHNF-YNVDMGYLKRLC---GTFLGG- 65

Query: 66  DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
             P P  R  +  +    +P  FDAREQWP C TI  + D G+C +   F AV A SDR 
Sbjct: 66  --PKPPQRVMFTEDLK--LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121

Query: 126 CIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
           CI +    +  +S E + +CC  +C       C+ G     WNF  ++G V+GG Y    
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 177

Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
           GC+P +I PC HH +    P       PK  C   C  P Y   + QDKH    +Y V +
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSN 234

Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTP 304
           +E  I  EI  +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTP
Sbjct: 235 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTP 292

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           YWLV N+W   WGD G  KILRG+  C  E  + AG P+
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331


>gi|60816353|gb|AAX36379.1| cathepsin B [synthetic construct]
 gi|61358313|gb|AAX41546.1| cathepsin B [synthetic construct]
          Length = 339

 Score =  214 bits (544), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 124/338 (36%), Positives = 171/338 (50%), Gaps = 17/338 (5%)

Query: 6   VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
           + +L     R   +  SD  ++ +N+   TW AG NF  N+   YL++       +    
Sbjct: 11  LLVLANARSRPSFHPVSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLC---GTFLGG- 65

Query: 66  DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
             P P  R  +  +    +P  FDAREQWP C TI  + D G+C +   F AV A SDR 
Sbjct: 66  --PKPPQRVMFTEDLK--LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121

Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG 185
           CI +    +  +S E + +CC   R  D   C+ G     WNF  ++G V+GG Y    G
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCG-SRCGDG--CNGGYPAEAWNFWTRKGLVSGGLYESHVG 178

Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN 245
           C+P +I PC HH +    P       PK  C   C  P Y   + QDKH    +Y V ++
Sbjct: 179 CRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNS 235

Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPY 305
           E  I  EI  +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTPY
Sbjct: 236 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPY 293

Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           WLV N+W   WGD G  KILRG+  C  E  + AG P+
Sbjct: 294 WLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331


>gi|48762485|dbj|BAD23812.1| cathepsin B-N1 [Tuberaphis styraci]
          Length = 340

 Score =  214 bits (544), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 135/336 (40%), Positives = 172/336 (51%), Gaps = 28/336 (8%)

Query: 17  ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
           + Y     YI+QIN  A TW AG NF   LS +   + L   +K    + +  P   KT+
Sbjct: 20  QAYFLEKDYINQINANAKTWKAGVNFDPKLSIDSFVKLL--GSKGVQAAKQASPDMFKTH 77

Query: 77  DPEY---SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
           D  Y   S  +P  FDAR++W  C TIG V D G C +   F    AF+DR CI + G+ 
Sbjct: 78  DEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGEF 137

Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
           N  LS E +A CC  C +     CS G   R W    K G VTGG+Y    GCQP  + P
Sbjct: 138 NELLSPEELAFCCHKCGF----GCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPP 193

Query: 194 C--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDA 248
           C    +G+     +C  +  P  K H RCT   YG     F +D H T   Y++      
Sbjct: 194 CPLDEYGNN----TCRGK--PAEKNH-RCTRMCYGNQDLDFKEDHHYTRDAYYL--TYGT 244

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYW 306
           I+ +ILA+GP  A+F +YDDF  YKSGVY    NA    YL  H+ KLIGWG E G PYW
Sbjct: 245 IQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENA---TYLGGHAVKLIGWGEEYGVPYW 301

Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           L++N+W   WGD+G  KI RG  EC  +     G P
Sbjct: 302 LLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 337


>gi|157833437|pdb|1PBH|A Chain A, Crystal Structure Of Human Recombinant Procathepsin B At
           3.2 Angstrom Resolution
 gi|157835646|pdb|2PBH|A Chain A, Crystal Structure Of Human Procathepsin B At 3.3 Angstrom
           Resolution
 gi|157836863|pdb|3PBH|A Chain A, Refined Crystal Structure Of Human Procathepsin B At 2.5
           Angstrom Resolution
          Length = 317

 Score =  214 bits (544), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 122/330 (36%), Positives = 168/330 (50%), Gaps = 19/330 (5%)

Query: 15  RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRK 74
           R   +  SD  ++ +N+   TW AG NF  N+   YL++       +      P P  R 
Sbjct: 4   RPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLC---GTFLGG---PKPPQRV 56

Query: 75  TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
            +  +    +P  FDAREQWP C TI  + D G+C +   F AV A SDR CI +    +
Sbjct: 57  MFTEDLK--LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVS 114

Query: 135 RPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
             +S E + +CC  +C       C+ G     WNF  ++G V+GG Y    GC+P +I P
Sbjct: 115 VEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPP 170

Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
           C HH +    P       PK  C   C  P Y   + QDKH    +Y V ++E  I  EI
Sbjct: 171 CEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEI 227

Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWG 313
             +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTPYWLV N+W 
Sbjct: 228 YKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWN 285

Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
             WGD G  KILRG+  C  E  + AG P+
Sbjct: 286 TDWGDNGFFKILRGQDHCGIESEVVAGIPR 315


>gi|149698064|ref|XP_001498242.1| PREDICTED: cathepsin B [Equus caballus]
          Length = 340

 Score =  214 bits (544), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 128/344 (37%), Positives = 176/344 (51%), Gaps = 22/344 (6%)

Query: 5   LVFLLGCTLV------RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
           L+  L C +V      R      SD  ++ +N+   TW AG NF  N+   Y+++     
Sbjct: 4   LLATLSCLVVLTNARSRPYFQPLSDELVNYVNKRNTTWKAGHNF-HNVDLSYVKRLC--- 59

Query: 59  AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
             +      P    ++ +  E    +P+ FDAREQWPNC TI  + D G+C +   F AV
Sbjct: 60  GTFLGGPKLP----QRVWFAE-DVVLPENFDAREQWPNCPTIKEIRDQGSCGSCWAFGAV 114

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
            A SDR CI++ G  +  +S E + +CC     D    C+ G     WNF  K+G V+GG
Sbjct: 115 EAISDRICIRTNGHVSVEVSAEDMLTCCGDQCGD---GCNGGFPAEAWNFWTKQGLVSGG 171

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
            Y    GC+P +I PC HH +  + P C  +     KC   C  P Y   + +DKH    
Sbjct: 172 LYDSHVGCRPYSIPPCEHHVNG-SRPPCTGEGGDTPKCSKIC-EPGYSPSYKEDKHYGCS 229

Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
           +Y V  +E  I  EI  +GP  A F +Y DF  YKSGVY+H +   +    H+ +++GWG
Sbjct: 230 SYSVSSSEKEIMAEIFKNGPVEAAFTVYSDFLQYKSGVYQHVAGDMMGG--HAVRILGWG 287

Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            ENGTPYWLV N+W   WGD G  KILRG+  C  E  I AG P
Sbjct: 288 VENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGIP 331


>gi|37788265|gb|AAO64472.1| cathepsin B precursor [Fundulus heteroclitus]
          Length = 330

 Score =  214 bits (544), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 122/342 (35%), Positives = 172/342 (50%), Gaps = 19/342 (5%)

Query: 4   ILVFLLGCTLVRGE--LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
           +LV     ++ RG   ++  S   I+ IN+   TW AG NF  ++   Y++         
Sbjct: 6   LLVLAASLSVSRGRPHIHPLSSDMINYINKLNTTWKAGHNF-HDVDYGYVKNLC---GTL 61

Query: 62  FDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
                 P+              +P +FDAREQWP C T+  + D G+C +   F A  A 
Sbjct: 62  LKGPKLPI-----MVQSAGGMKLPKQFDAREQWPECPTLKEIRDQGSCGSCWAFGAAEAI 116

Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
           SDR CI +KG+ +  +S++ + +CC  C       C+ G     W F  ++G VTGG Y 
Sbjct: 117 SDRICIHTKGKVSVEISSQDLLTCCDSC----GMGCNGGYPANAWEFWTEQGLVTGGLYN 172

Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW 241
              GC+P TI PC HH +  + P C  +     +C T+C    Y   + +DKH    +Y 
Sbjct: 173 SHIGCRPYTIEPCEHHVNG-SRPPCTGEGGDTPECVTQC-EAGYTPSYQKDKHYGKTSYG 230

Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
           V   E+ I+ EI  +GP    F +Y+DF  YKSGVY+H + + L    H+ K+IGWG EN
Sbjct: 231 VPSEEEQIQSEIYKNGPVEGAFIVYEDFPSYKSGVYQHVTGSALGG--HAIKMIGWGEEN 288

Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           G PYWL  N+W   WGD G  KILRG   C  E  + AG PK
Sbjct: 289 GVPYWLCANSWNTDWGDNGFFKILRGSNHCGIESEVVAGIPK 330


>gi|161343855|tpg|DAA06108.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 342

 Score =  214 bits (544), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 128/345 (37%), Positives = 179/345 (51%), Gaps = 23/345 (6%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
           ++ LLG   V  + Y   + +ID IN +A TW AG NF  N  +EY+ + L   +K    
Sbjct: 11  VILLLG-VCVTEQAYFLEEDFIDSINEKAKTWKAGINFDPNTPKEYIVKLL--GSKGVQV 67

Query: 65  SDRPLPGDRKTYDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
             +      KT D  Y      +P +FDAR++W  C TIG V D G C +    A   AF
Sbjct: 68  PHKLNLKMYKTDDEAYVNLFGRIPKKFDARKEWRRCITIGQVRDQGNCGSCWALATSSAF 127

Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
           +DR CI +  + N  LS E +  CC +C +    +C  G   + W++  + G VTGG Y 
Sbjct: 128 ADRLCIATNYEFNELLSAEELTFCCHLCGF----ACHGGYPIKAWSYFRRHGIVTGGGYQ 183

Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF--FQDKHRTTLT 239
              GC P  + PC          +C  Q + K   H RCT   YG     + D HR T  
Sbjct: 184 SGEGCAPYRVPPCFSEEDGNN--TCRGQPMEK---HHRCTRMCYGDQEIDYDDDHRFTRD 238

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGW 297
           Y+      +I+K+++ +GP  A+  +YDDF  YKSGVY+ + NA    YL  H+ KLIGW
Sbjct: 239 YYYL-TYASIQKDVMTYGPIEASMEVYDDFPSYKSGVYEKSENA---TYLGGHAVKLIGW 294

Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           G E+G PYWL++N+W   WGD+G  KI RG  EC+ +  + AG P
Sbjct: 295 GEEDGVPYWLMVNSWSEMWGDKGLFKIRRGTNECSVDNSMTAGVP 339


>gi|204022096|dbj|BAG71145.1| cathepsin B-N1 [Tuberaphis sumatrana]
          Length = 334

 Score =  214 bits (544), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 133/336 (39%), Positives = 174/336 (51%), Gaps = 28/336 (8%)

Query: 17  ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
           + Y   + YI+QIN  A TW AG NF   LS +   + L   +K    + +  P   KT+
Sbjct: 17  QAYFLEEDYINQINANAKTWKAGVNFDPKLSIDSFVKLL--GSKGVQAAKQASPDMFKTH 74

Query: 77  DPEY---SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
           D  Y   S  +P  FDAR++W  C TIG V D G C +   F    AF+DR CI + G+ 
Sbjct: 75  DEAYNNWSNRIPSNFDARKKWRKCSTIGEVRDQGHCGSCWAFGTSSAFADRLCIATDGEF 134

Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
           N  LS E +A CC  C +     CS G+  + W    K G VTGG+Y    GCQP  + P
Sbjct: 135 NELLSPEELAFCCHKCGF----GCSGGNPIKAWERFQKHGLVTGGNYDSGEGCQPYKVPP 190

Query: 194 C--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDA 248
           C    +G+     +C  +  P  K H RCT   YG     F +D H T   Y++      
Sbjct: 191 CPLDEYGNN----TCSGK--PAEKNH-RCTRMCYGNQNLDFKEDHHYTRDAYYL--TYGT 241

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYW 306
           I+ ++LA+GP  A+F +YDDF  YKSGVY    NA    YL  H+ KLIGWG E G PYW
Sbjct: 242 IQYDVLAYGPIEASFEVYDDFPSYKSGVYTKMENA---TYLGGHAVKLIGWGEEYGVPYW 298

Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           L++N+W   WGD+G  KI RG  EC  +     G P
Sbjct: 299 LLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 334


>gi|148229459|ref|NP_001079570.1| cathepsin B precursor [Xenopus laevis]
 gi|28277314|gb|AAH44689.1| MGC53360 protein [Xenopus laevis]
          Length = 333

 Score =  213 bits (543), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 131/349 (37%), Positives = 176/349 (50%), Gaps = 24/349 (6%)

Query: 1   MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
           ++  L FL      R   Y    S   ++ IN+   TW AG NF AN    Y+++     
Sbjct: 4   LVVALCFLASIASSRHLPYFAPLSHDMVNYINKVNTTWKAGHNF-ANADLHYVKRLCGTL 62

Query: 59  AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
            K         P  +K +       +PD FD+R  WPNC TI  + D G+C +   F AV
Sbjct: 63  LKG--------PQLQKRFGFADGLELPDSFDSRAAWPNCPTIREIRDQGSCGSCWAFGAV 114

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
            A SDR C+ + G+ N  +S E + SCC     +    C+ G     W F  + G V+GG
Sbjct: 115 EAISDRVCVHTNGKVNVEVSAEDLLSCCGD---ECGMGCNGGYPSGAWQFWTETGLVSGG 171

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCT---NPTYGRGFFQDKHR 235
            Y    GC+P +I PC HH +  + P+C+ ++    KC  +C    +P YG     DKH 
Sbjct: 172 LYDSHVGCRPYSIPPCEHHVNG-SRPACKGEEGDTPKCVKQCEEGYSPAYG----TDKHF 226

Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
            T +Y V  +E  I  EI  +GP    F +Y DF  YKSGVY+H +  +L    H+ K++
Sbjct: 227 GTTSYGVPTSEKEIMAEIYKNGPVEGAFLVYADFPLYKSGVYQHETGEELGG--HAIKIL 284

Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           GWG ENGTPYWL  N+W   WGD G  KILRGK  C  E  I AG PKN
Sbjct: 285 GWGVENGTPYWLCANSWNTDWGDNGFFKILRGKDHCGIESEIVAGVPKN 333


>gi|74179506|dbj|BAE44111.1| cathepsin B preproprotein [Cyprinus carpio]
          Length = 330

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 123/341 (36%), Positives = 167/341 (48%), Gaps = 21/341 (6%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
           ++  L  +  R      S   ++ IN+   TW AG NF  ++   Y+++           
Sbjct: 9   VISALSVSWARPRFAPLSREMVNFINKANTTWKAGHNF-HDVDYSYVKRLC--------- 58

Query: 65  SDRPLPGDRKTYDPEYS--ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
               L G R     +Y+    +P  FDAREQWPNC T+  + D G+C +   F A  A S
Sbjct: 59  -GTLLKGPRLPVMVQYADDLKLPTNFDAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAIS 117

Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
           DR CI S  + +  +S + + +CC  C       C+ G     W+F    G VTGG Y  
Sbjct: 118 DRVCIHSNAKVSVEISAQDLLTCCDGC----GMGCNGGYPSAAWDFWSSDGLVTGGLYNS 173

Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
             GC+P TI PC HH +  + P C  +      C   C  P Y   + QDKH    +Y V
Sbjct: 174 HIGCRPYTIEPCEHHVNG-SRPPCTGEGGDTPNCDMSC-EPGYSPSYKQDKHFGKTSYSV 231

Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
             N+  I KE+  +GP    F +Y+DF  YKSGVY+H S   L    H+ K++GWG ENG
Sbjct: 232 PSNQKDIMKELYKNGPVEGAFTVYEDFLSYKSGVYQHVSGPALGG--HAIKILGWGEENG 289

Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            PYWL  N+W   WGD G  KILRG+  C  E  I AG P+
Sbjct: 290 VPYWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAGIPQ 330


>gi|309202|gb|AAA37494.1| mouse preprocathepsin B [Mus musculus]
          Length = 339

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 126/329 (38%), Positives = 170/329 (51%), Gaps = 23/329 (6%)

Query: 18  LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYFDQSDRPLPGDRK 74
            +  SD  I+ IN++  TW AGRNF  N+   YL++    ++   K        LPG R 
Sbjct: 23  FHPLSDDLINYINKQNTTWQAGRNF-YNVDISYLKKLCGTVLGGPK--------LPG-RV 72

Query: 75  TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
            +  +    +P+ FDAREQW NC TIG + D G+C +   F AV A SDR CI + G+ N
Sbjct: 73  AFGEDID--LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVN 130

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             +S E + +CC I   D    C+ G     WNF  K+G V+GG Y    GC P TI PC
Sbjct: 131 VEVSAEDLLTCCGIQCGD---GCNGGYPSGAWNFWTKKGLVSGGVYDSHIGCLPYTIPPC 187

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            HH +    P       P  +C+  C    Y   + +DKH    +Y V ++   I  EI 
Sbjct: 188 EHHVNGSRPPCTGEGDTP--RCNKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIY 244

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            +GP    F ++ DF  YKSGVYKH +   +    H+ +++ WG ENG PYWL  N+W  
Sbjct: 245 KNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGG--HAIRILVWGVENGVPYWLAANSWNL 302

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            WGD G  KILRG+  C  E  I AG P+
Sbjct: 303 DWGDNGFFKILRGENHCGIESEIVAGIPR 331


>gi|204022075|dbj|BAG71135.1| cathepsin B-S2 [Tuberaphis taiwana]
          Length = 334

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 128/340 (37%), Positives = 181/340 (53%), Gaps = 17/340 (5%)

Query: 4   ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
           +    +   L    L   SD  I  IN  A TW A R FPAN SEEY    L+    Y +
Sbjct: 8   VCAIFVSVYLAEPTLQFLSDERIKYINEVAKTWKAERYFPANTSEEYFIG-LLGSRGYKN 66

Query: 64  QSDRPLPGDRKTYDPEYSAT-VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
            ++     + K YDP Y     P +FD+R  W +C  IGH+ D G C +   F+  GAF+
Sbjct: 67  YTNEV---EIKKYDPLYVENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFA 123

Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
           DR C+ + G+ N+ LS E +A     C  D  K C  G   + W +   +G  TGGDYG 
Sbjct: 124 DRLCVSTGGKFNQLLSPEELA----FCCKDCGKGCGGGYPIKAWKYFRTQGVTTGGDYGT 179

Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
           + GC P  + PC +     T   C  Q  P  + H +C    YG+   Q++++T   Y V
Sbjct: 180 KEGCMPYKVPPCYNKQGKNT---CGGQ--PMERNH-QCPKTCYGKTTVQNRYKTKSEY-V 232

Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
            ++   I++++  +GP  A+F +YDDF  YKSG+Y+ T  AK +   HS K+IGWG +NG
Sbjct: 233 MNSIKTIEQDLKTYGPVEASFDVYDDFSVYKSGIYRKTPKAKYQG-GHSIKIIGWGQQNG 291

Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           TPYWL +N+W   WG+ GT KI++G+ EC  E  + AG P
Sbjct: 292 TPYWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGIP 331


>gi|326916753|ref|XP_003204669.1| PREDICTED: cathepsin B-like [Meleagris gallopavo]
          Length = 340

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 122/344 (35%), Positives = 172/344 (50%), Gaps = 18/344 (5%)

Query: 2   IHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
           + IL  L+     R   Y    S   ++ IN+   TW AG NF  N    Y+++      
Sbjct: 5   VSILCVLVAFANARSIPYYPPLSSDLVNHINKLNTTWKAGHNF-HNTDMSYVKKLC---G 60

Query: 60  KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
            +      P     +  D      +PD FD+R+QWPNC TI  + D G+C +   F AV 
Sbjct: 61  TFLGGPKLP-----ERVDFAADIDLPDTFDSRKQWPNCPTISEIRDQGSCGSCWAFGAVE 115

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           A SDR C+ +  + +  +S E + SCC    ++    C+ G     W +  +RG V+GG 
Sbjct: 116 AISDRICVHTNAKVSVEVSAEDLLSCCG---FECGMGCNGGYPSGAWRYWTERGLVSGGL 172

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
           Y    GC+P TI PC HH +  + P C  +     +C   C  P Y   + +DKH    +
Sbjct: 173 YDSHVGCRPYTIPPCEHHVNG-SRPPCTGEGGETPRCSRHC-EPGYSPSYKEDKHYGITS 230

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y V  +E  I  EI  +GP    F +Y+DF  YKSGVY+H S  ++    H+ +++GWG 
Sbjct: 231 YGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVSGEQVGG--HAIRILGWGV 288

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           ENGTPYWL  N+W   WGD G  KILRG+  C  E  I AG P+
Sbjct: 289 ENGTPYWLAANSWNTDWGDNGFFKILRGEDHCGIESEIVAGVPR 332


>gi|223646922|gb|ACN10219.1| Cathepsin B precursor [Salmo salar]
 gi|223647940|gb|ACN10728.1| Cathepsin B precursor [Salmo salar]
 gi|223672785|gb|ACN12574.1| Cathepsin B precursor [Salmo salar]
          Length = 330

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 125/340 (36%), Positives = 170/340 (50%), Gaps = 21/340 (6%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
           LV  L  +     L   S   +D IN+   TW AG NF  N+   Y+++           
Sbjct: 9   LVSGLSVSWAWPRLPPLSHQMVDYINKANTTWKAGPNF-HNVDYSYVKRLC--------- 58

Query: 65  SDRPLPGDRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
               L G +     +Y+  V  PD FD R+QWPNC T+  + D G+C +   F A  A S
Sbjct: 59  -GTLLKGPKLPTMVQYAGDVELPDTFDPRQQWPNCPTLKEIRDQGSCGSCWAFGAAEAIS 117

Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
           DR CI S  + +  +S+E + SCC  C       C+ G     W+F    G VTGG Y  
Sbjct: 118 DRVCIHSNAKVSVEISSEDLLSCCDSC----GMGCNGGYPSAAWDFWTTEGLVTGGLYDS 173

Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
             GC+P +I PC HH +  T P C  ++    +C  +C    Y  G+ QDKH    +Y +
Sbjct: 174 HVGCRPYSIPPCEHHVNG-TRPPCTGEEGDTPQCSNQCET-GYTPGYKQDKHFGKNSYSL 231

Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
              E  I  E+L +GP    F +Y+DF  YKSGVY+H S + +    H+ K++GWG E G
Sbjct: 232 PSEEQQIMAELLKNGPVEGAFTVYEDFLLYKSGVYQHVSGSAVGG--HAIKVLGWGEEGG 289

Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           TPYWL  N+W   WG+ G  KILRGK  C  E  + AG P
Sbjct: 290 TPYWLAANSWNTDWGENGFFKILRGKDHCGIESEMVAGVP 329


>gi|432852559|ref|XP_004067308.1| PREDICTED: cathepsin B-like [Oryzias latipes]
          Length = 330

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 122/331 (36%), Positives = 169/331 (51%), Gaps = 21/331 (6%)

Query: 15  RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRK 74
           R   +  S   ++ IN+   TW AG NF  N    Y+++               L G + 
Sbjct: 19  RPRFHPLSSDMVNYINKLNTTWKAGHNF-KNADYSYVQKLC----------GTMLKGPKL 67

Query: 75  TYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
               +Y+  V  P  FDAR QWPNC T+  + D G+C +   F A  A SDR CI S  +
Sbjct: 68  PIMVQYAGDVKLPTEFDARAQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNAR 127

Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTIS 192
            +  +S+E + +CC+ C       C+ G     W+F  K G VTGG Y    GC+P TI 
Sbjct: 128 VSVEISSEDLLTCCESC----GMGCNGGYPTAAWDFWTKEGLVTGGLYDSHVGCRPYTIP 183

Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
           PC HH +  T P C  +     +C  +C +  Y   + +DKH    +Y V+ NE+ I+ E
Sbjct: 184 PCEHHVNG-TRPPCTGEGGDTPQCINQCES-GYTPSYKKDKHYGKTSYSVEANENQIQTE 241

Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
           I  +GP    F +Y+DF  YKSGVY+H S + +    H+ K++GWG E+G PYWL  N+W
Sbjct: 242 IYKNGPVEGAFMVYEDFPMYKSGVYQHVSGSLIGG--HAIKILGWGVEDGVPYWLCANSW 299

Query: 313 GPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
              WGD G  KILRG   C  E  + AG PK
Sbjct: 300 NTDWGDNGYFKILRGSDHCGIESEVVAGIPK 330


>gi|76576339|gb|ABA53863.1| cathepsin B-like cysteine protease 1 [Parelaphostrongylus tenuis]
          Length = 346

 Score =  213 bits (541), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 114/324 (35%), Positives = 169/324 (52%), Gaps = 21/324 (6%)

Query: 26  IDQINREANTWTAG-----RNFPANLSEEYL-RQFLIADAKYFDQSDRPLPGDRKTYDPE 79
           ++ IN+    +TA       N P ++    +  +++   AKY          + KT++  
Sbjct: 38  VNYINKAQKLFTAKLSPRFANLPRDIKHRLMGSKYVALPAKY--------RMNEKTHNDI 89

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
            ++T+P  FDAR  WP C ++  V D  AC +    AAVGA  DR CI S+G+Q   LS 
Sbjct: 90  DNSTIPKSFDARTNWPKCASLRTVRDQSACGSGWAVAAVGAIMDRICIASEGKQQVILSA 149

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
           + + SCC  C Y     C  G  ++ WN+    G VTG +Y  ++GC+P    PC H+  
Sbjct: 150 DDILSCCTECGY----GCEGGDTYKAWNYWTTDGIVTGSNYTTKSGCKPYPYPPCEHYID 205

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
           A     C     P   C  +C +  Y   + +DKH     Y +  +   I++EI+ HGP 
Sbjct: 206 AGRYKKCPKDLYPTNTCEYKCQD-NYTISYDEDKHYGAYPYVLVGDASFIQQEIMNHGPV 264

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
             TF +Y+DF HY SG+YKH +   +   +H+ K++GWGTENG  YW+  N+W   WG+ 
Sbjct: 265 EVTFDVYEDFEHYSSGIYKHMAGEYVG--VHAVKMLGWGTENGVDYWICANSWNSDWGEN 322

Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
           G  +ILRG+ EC  E  + AGKPK
Sbjct: 323 GFFRILRGENECGIESNVVAGKPK 346


>gi|74213457|dbj|BAE35542.1| unnamed protein product [Mus musculus]
          Length = 339

 Score =  213 bits (541), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 126/329 (38%), Positives = 173/329 (52%), Gaps = 23/329 (6%)

Query: 18  LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYFDQSDRPLPGDRK 74
            +  SD  I+ IN++  TW AGRNF  N+   YL++    ++   K        LPG R 
Sbjct: 23  FHPLSDDLINYINKQNTTWQAGRNF-YNVDISYLKKLCGTVLGGPK--------LPG-RV 72

Query: 75  TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
            +  +    +P+ FDAREQW NC TIG + D G+C +   F AV A SDR CI + G+ N
Sbjct: 73  AFGEDID--LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVN 130

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             +S E + +CC I   D    C+ G     W+F  K+G V+GG Y    GC P TI PC
Sbjct: 131 VEVSAEDLLTCCGIQCGD---GCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPC 187

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            HH +  + P C  +     +C+  C    Y   + +DKH    +Y V ++   I  EI 
Sbjct: 188 EHHVNG-SRPPCTGEG-DTHRCNKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIY 244

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            +GP    F ++ DF  YKSGVYKH +   +    H+ +++GWG ENG PYWL  N+W  
Sbjct: 245 KNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGG--HAIRILGWGVENGVPYWLAANSWNL 302

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            WGD G  KILRG+  C  E  I AG P+
Sbjct: 303 DWGDNGFFKILRGENHCGIESEIVAGIPR 331


>gi|256090368|ref|XP_002581167.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|22531387|emb|CAD44624.1| cathepsin B1 isotype 1 [Schistosoma mansoni]
 gi|353228442|emb|CCD74613.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 340

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 123/348 (35%), Positives = 181/348 (52%), Gaps = 24/348 (6%)

Query: 1   MIHILVFLLGCTLVRGELYK-FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIAD 58
           +  ++ FL     V+ E ++  SD  I  IN   N  W A         E+  R   + D
Sbjct: 8   IASLITFLEAHISVKNEKFEPLSDDIISYINEHPNAGWRA---------EKSNRFHSLDD 58

Query: 59  AKYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPH 113
           A+    + R  P  R+T  P     +++  +P  FD+R++WP C +I  + D   C +  
Sbjct: 59  ARIQMGARREEPDLRRTRRPTVDHNDWNVEIPSSFDSRKKWPRCKSIATIRDQSRCGSCW 118

Query: 114 IFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRG 173
            F AV A SDR CI+S G+QN  LS   + SCC+ C       C  G +   W++  K G
Sbjct: 119 AFGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCESC----GLGCEGGILGPAWDYWVKEG 174

Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
            VTG    + TGC+P     C HH +    P C ++     +C   C    Y   + QDK
Sbjct: 175 IVTGSSKENHTGCEPYPFPKCEHH-TKGKYPPCGSKIYKTPRCKQTCQK-KYKTPYTQDK 232

Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
           HR   +Y V ++E AI+KEI+ +GP  A F +Y+DF +YKSG+YKH +   L    H+ +
Sbjct: 233 HRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYEDFLNYKSGIYKHITGETLGG--HAIR 290

Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
           +IGWG EN TPYWL+ N+W   WG+ G  +I+RG+ EC+ E  + AG+
Sbjct: 291 IIGWGVENKTPYWLIANSWNEDWGENGYFRIVRGRDECSIESEVTAGR 338


>gi|204022085|dbj|BAG71140.1| cathepsin B-S [Astegopteryx spinocephala]
          Length = 335

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 122/324 (37%), Positives = 170/324 (52%), Gaps = 18/324 (5%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
            SD  I+ IN+ A TW A R FPAN+S+EY+   L +       ++  +  D    DP Y
Sbjct: 25  ISDERIEYINKIAKTWKAERYFPANMSKEYIMGLLGSRGYKNYLNEVEIKKD----DPLY 80

Query: 81  SAT--VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           +        FDARE W  C  IGHV D G C +   F   GAF+DR C+ + G  N  LS
Sbjct: 81  TKNNDTIKHFDAREDWKICKQIGHVRDQGNCGSCWAFGTTGAFADRLCVATGGGFNEQLS 140

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
            E +  CC  C       C  G+  + W +  + G  TGGDYG   GC P  + PC +  
Sbjct: 141 AEKLTFCCWTC----GLGCQGGNPIKAWKYFKRHGITTGGDYGSNEGCAPYKVPPC-YDD 195

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
               L  C+ +  P    H +C    YG    +++++    Y V D+   I+++I  +GP
Sbjct: 196 QGEFL--CQGK--PTEHNH-KCPRACYGNSTVENRYKVKSIY-VLDSSKTIEQDIRKYGP 249

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
             A+F +YDDF  YKSG+Y+ T NA      HS KLIGWG E+G PYWL++N+W   WG+
Sbjct: 250 VEASFDVYDDFITYKSGIYQKTPNAFYVG-GHSVKLIGWGEEDGIPYWLLVNSWSKFWGE 308

Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
           +GT +I++G+ EC  E    AG P
Sbjct: 309 QGTFRIIKGRNECGIERSATAGVP 332


>gi|47217183|emb|CAG11019.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 351

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 129/355 (36%), Positives = 170/355 (47%), Gaps = 42/355 (11%)

Query: 12  TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG 71
           +L R  L   S   ++ IN+  +TWTAG NF  N+   Y+++               L G
Sbjct: 16  SLARPHLKPLSSEMVNYINKLNSTWTAGHNF-HNVDYSYVKKLC----------GTLLKG 64

Query: 72  DRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKS 129
            +      Y+  +  P  FD+REQWPNC T+  + D G+C +   F A  A SDR CI S
Sbjct: 65  PKLPLMIRYAGDIKLPKEFDSREQWPNCPTLKEIRDQGSCGSCWAFGASEAMSDRVCIHS 124

Query: 130 KGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG---- 185
             + +  LS + + +CC  C       C+ G     WNF    G V+GG Y    G    
Sbjct: 125 NAKVSVELSAQDLLTCCNSC----GMGCNGGYPSSAWNFWVSDGLVSGGLYDSHIGRIQV 180

Query: 186 -----------------CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG 228
                            C+P TI PC HH +  + PSC  +     +C  RC    Y   
Sbjct: 181 SLCVLLLAVDRDFVSPGCRPYTIPPCEHHVNG-SRPSCSGEGGDTPECIFRC-EAGYSPS 238

Query: 229 FFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENY 288
           + QDKH    +Y V   ED IK+EI  +GP    F +Y+DF  YKSGVY+H S + L   
Sbjct: 239 YKQDKHFGKTSYSVSSEEDEIKQEIYKNGPVEGAFTVYEDFVLYKSGVYQHVSGSALGG- 297

Query: 289 LHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            H+ K++GWG ENG PYWL  N+W   WGD G  KILRG   C  E  I AG PK
Sbjct: 298 -HAIKMLGWGEENGVPYWLCANSWNTDWGDNGFFKILRGADHCGIESEIVAGNPK 351


>gi|403307501|ref|XP_003944231.1| PREDICTED: cathepsin B [Saimiri boliviensis boliviensis]
          Length = 351

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 123/340 (36%), Positives = 175/340 (51%), Gaps = 25/340 (7%)

Query: 11  CTLVRGELYK------FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
           C LV  + ++       S+  ++ +N++  TW AG NF  N+   YL++       +   
Sbjct: 22  CLLVLADSWRGPSFHPLSEELVNYVNKQNTTWQAGHNF-YNVDLSYLKRLC---GTFLGG 77

Query: 65  SDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDR 124
              P P  R  +  + +  +P+ FDAREQWP C TI  + D G+C +   F AV A SDR
Sbjct: 78  ---PKPPQRVKFAEDLN--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDR 132

Query: 125 RCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDR 183
            CI +    +  +S E + +CC  +C       C+ G     WNF  ++G V+GG Y   
Sbjct: 133 ICIHTNAHVSVEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYDSH 188

Query: 184 TGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD 243
            GC+P +I PC HH +    P       PK  C   C  P Y   + QDKH    +Y V 
Sbjct: 189 VGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKSC-EPGYTPTYKQDKHYGYNSYSVS 245

Query: 244 DNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
           ++E  I  EI  +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGT
Sbjct: 246 NSERDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGT 303

Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           PYWLV N+W   WGD G  KILRG+  C  E  + AG P+
Sbjct: 304 PYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 343


>gi|440913587|gb|ELR63025.1| Cathepsin B [Bos grunniens mutus]
          Length = 335

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 130/348 (37%), Positives = 174/348 (50%), Gaps = 27/348 (7%)

Query: 1   MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
           ++  L  LL  T  R  LY    SD  ++ +N++  TW AG NF  N+   Y+++     
Sbjct: 4   LLATLSCLLVLTSARSSLYFPPLSDELVNFVNKQNTTWKAGHNF-YNVDLSYVKKLC--- 59

Query: 59  AKYFDQSDRPLPGDRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
                     L G +      ++A V  P+ FDAR+QWPNC TI  + D G+C +   F 
Sbjct: 60  -------GTILGGPKLPQRDAFAADVVLPESFDARKQWPNCPTIKEIRDQGSCGSCWAFG 112

Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRT--WNFLHKRGS 174
           AV A SDR CI S G+ N  +S E +     +              F +  WNF  K+G 
Sbjct: 113 AVEAISDRICIHSNGRVNVEVSAEDM-----LTCCGGECGDGCNGGFPSGAWNFWTKKGL 167

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           V+GG Y    GC+P +I PC HH +    P       PK  C   C  P Y   + +DKH
Sbjct: 168 VSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKTC-EPGYSPSYKEDKH 224

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
               +Y V +NE  I  EI  +GP    F++Y DF  YKSGVY+H S   +    H+ ++
Sbjct: 225 FGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGG--HAIRI 282

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           +GWG ENGTPYWLV N+W   WGD G  KILRG+  C  E  I AG P
Sbjct: 283 LGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 330


>gi|86279343|gb|ABC88767.1| putative cathepsin B-like proteinase [Tenebrio molitor]
          Length = 321

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 122/340 (35%), Positives = 169/340 (49%), Gaps = 28/340 (8%)

Query: 4   ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
           +LV +L  +L   E+   S  +ID INR  ++W AGRNFP N + EYL +       + D
Sbjct: 9   VLVAVLSASL--AEIDVLSSEFIDSINRIQSSWVAGRNFPENTTNEYLYKLNGFIGLHPD 66

Query: 64  QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSD 123
            + +P P    T++      VP+ FDAR +WPNC ++  + D GAC +   FA++ + SD
Sbjct: 67  PNYKP-PVLVHTFNAR---DVPESFDARTKWPNCDSLNRIRDQGACGSCWAFASIESMSD 122

Query: 124 RRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDR 183
           R CI S G      S E + SCC  C       C  G +    +F    G V+GGD    
Sbjct: 123 RICIHSSGSAQFMFSPEDLLSCCTSC-----GDCGGGYMMSALDFYINEGIVSGGDVNSN 177

Query: 184 TGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD 243
            GC+P T +     G  P              C   C N  Y   +  DKH  +  Y V 
Sbjct: 178 EGCRPYT-ADAHDQGQTPA-------------CTKSCRN-GYSTSYSADKHYGSNDYVVS 222

Query: 244 DNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
              D I+ E++ +GP    F ++ DFY+Y SGVY+H S   +    H  K++GWG ENG 
Sbjct: 223 SVIDQIQYEVMTNGPIIVNFEVFQDFYNYVSGVYRHVSGESVG--FHVVKIVGWGVENGV 280

Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           PYWL+ N+WG  WGD G  K+LRG+ EC  E    A  P+
Sbjct: 281 PYWLIANSWGSSWGDHGFFKMLRGQNECGIENYPYAVMPR 320


>gi|282400164|ref|NP_001164205.1| cathepsin B precursor [Tribolium castaneum]
 gi|270004839|gb|EFA01287.1| cathepsin B precursor [Tribolium castaneum]
          Length = 335

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 127/346 (36%), Positives = 170/346 (49%), Gaps = 18/346 (5%)

Query: 1   MIHILVFLLGCTLVRG--ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
            +H  V L    LV G   L+  SD +I++IN   +TW AGRNF  +    +++Q L   
Sbjct: 4   FLHFAVVLATVALVYGGVHLHPLSDDFINRINSRKSTWKAGRNFDIDTPISHIKQLLGVL 63

Query: 59  AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAA 117
            +  +    P     K      +  +PD FDARE WP+C   IG++ D   C +   F A
Sbjct: 64  PETENTPKLP-----KKIHSINAQEIPDSFDAREAWPDCAPIIGNIRDQSTCGSCWAFGA 118

Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
           V A SDR CI S       +S E    CC IC       C+ G     W      G VTG
Sbjct: 119 VEAMSDRICIHSNATVKVNISAEDPLDCCTIC----GMGCNGGMPAMAWLHWTVNGIVTG 174

Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
           G+Y D  GC+  + +PC HH     LP C   K P   C   C + +     +Q+     
Sbjct: 175 GNYEDTNGCKAYSFAPCEHHVDG-DLPPCGPTK-PTPDCKKECDSGS--SLTYQNDLTHG 230

Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
             Y +D     I+ EI+ +GP  A+F++Y+DF  YKSGVY+H          H+ K++GW
Sbjct: 231 SNYGIDPYPKQIQTEIMTNGPVEASFSVYEDFLSYKSGVYQHLEGEYAGG--HAIKILGW 288

Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           G EN TPYWLV N+W   WGD+G  KILRG  EC  E  I AG P+
Sbjct: 289 GVENDTPYWLVANSWNEDWGDKGYFKILRGSNECGIEGSIVAGIPE 334


>gi|198429088|ref|XP_002120307.1| PREDICTED: similar to cathepsin B [Ciona intestinalis]
          Length = 364

 Score =  211 bits (538), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 124/349 (35%), Positives = 176/349 (50%), Gaps = 19/349 (5%)

Query: 1   MIHILVFLLGCTLVRGELYKFS-----DAYIDQINREANTWTAGRNF-PANLSEEYLRQF 54
           ++++ + +L    V  E Y  S     +A +  +N+   TW A  NF P     E L+  
Sbjct: 28  VLNMKLLVLLSAFVLSECYVISKEDNFNAIVKTVNKANTTWKASLNFDPTYYVPEDLK-- 85

Query: 55  LIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
           L+   K        L     +Y       +P++FD+R+QWP+C +I ++ D G+C +   
Sbjct: 86  LLCGVKEDKHGYSKL---ETSYHNLEGIKIPNQFDSRKQWPHCPSISYIRDQGSCGSCWA 142

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
           F AV A SDR CI+S G+    +S E + SCC    ++    C+ G     W + +  G 
Sbjct: 143 FGAVEAMSDRYCIRSNGKIQVEISAEDLLSCCG---FECGDGCNGGFPGSAWKYWNSDGL 199

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           VTGG YG +TGC P  I PC HH         E    P   C ++C   T    + QDKH
Sbjct: 200 VTGGLYGSKTGCLPYQIKPCEHHVPGDRPKCSEGGGTPS--CVSKCKGNTTIH-YNQDKH 256

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
               +Y V  +   I+ EI+ HGP    F +Y DF  YKSGVYKH +   L    H+ ++
Sbjct: 257 YGLSSYAVGSDPTQIQTEIMTHGPVEGAFTVYADFPTYKSGVYKHVTGGVLGG--HAIRI 314

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           +GWG+ENG  YWLV N+W   WGD+G  KILRG  EC  E  + AG P+
Sbjct: 315 LGWGSENGVAYWLVANSWNTDWGDKGYFKILRGSDECGIESSVVAGIPQ 363


>gi|55793949|gb|AAV65885.1| cathepsin B1 isotype 5 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  211 bits (538), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 125/352 (35%), Positives = 181/352 (51%), Gaps = 25/352 (7%)

Query: 1   MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIA 57
           +I  +  L    L   E+     SD  I  IN+  +  WTA R+          R   + 
Sbjct: 8   IISFMSILTAHILPENEIQFEPLSDEMIAYINQHPDAGWTASRSD---------RFKSLK 58

Query: 58  DAKYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAP 112
           DA+    + R     RK   P     + S  +P  FD+R++WP C +I ++ D   C A 
Sbjct: 59  DARILLGAMREDEELRKKRRPTVDHQDVSLEIPTSFDSRKEWPQCKSISNIRDQSRCGAG 118

Query: 113 HIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKR 172
             FAAV A SDR CI+SKG+++  LS   + SCC  C       C  G     W++  + 
Sbjct: 119 WAFAAVQAMSDRICIESKGKKSVELSAVDLLSCCIEC----GLGCQMGFPGIAWDYWVQE 174

Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQD 232
           G VTGG   + TGCQP     C HH +    P C      K KCH +C    Y   + +D
Sbjct: 175 GIVTGGSKENHTGCQPYPFPKCEHH-TKGRYPECGEIIYMKPKCHQKCQK-GYKTPYEKD 232

Query: 233 KHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
           K+   ++Y +  NED+IKKEI+ HGP  A+F ++ DF +YKSG+YKH +   + +  H  
Sbjct: 233 KYYGKVSYNLLKNEDSIKKEIMMHGPVEASFRVHSDFLNYKSGIYKHMTGIDIGS--HVV 290

Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           ++IGWG E  TPYWL+ N+W   WG++G  ++LRGK EC  E  + +G P++
Sbjct: 291 RIIGWGVEKETPYWLIANSWNEDWGEKGYFRMLRGKDECGIESAVTSGLPRD 342


>gi|395507317|ref|XP_003757972.1| PREDICTED: cathepsin B [Sarcophilus harrisii]
          Length = 342

 Score =  211 bits (538), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 119/326 (36%), Positives = 166/326 (50%), Gaps = 20/326 (6%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYFDQSDRPLPGDRKTYD 77
            SD  ++ +N+   TW AG NF  N+   Y+++    ++  AK   Q       D K   
Sbjct: 26  LSDEMVNYVNKLNTTWKAGHNF-RNVDMSYVKKLCGTVMGGAKQLPQRVMLADDDMK--- 81

Query: 78  PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
                 +P+ FDAREQWP C TI  + D G+C +   F AV A SDR C+ + G     +
Sbjct: 82  ------LPENFDAREQWPKCPTIKEIRDQGSCGSCWAFGAVEAISDRICVHTNGYITIEV 135

Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
           S E + SCC +      + C+ G     W +  K+G V+GG Y    GC+P +I PC HH
Sbjct: 136 SAEDLLSCCGL---QCGEGCNGGFPAGAWKYWIKKGLVSGGLYDSHVGCRPYSIPPCEHH 192

Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
            +  + P+C  +     KC+ +C    Y   +  DKH  T  Y V  +E  I  EI  +G
Sbjct: 193 VNG-SRPACTGEGGDTPKCNKKC-EAGYSPDYKDDKHYGTTAYNVPSSEKEIMAEIYKNG 250

Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
           P    F +Y DF  YKSGVY+H +   L    H+ +++GWG E+G PYWL  N+W   WG
Sbjct: 251 PVEGAFIVYADFLQYKSGVYQHVTGDMLGG--HAIRVLGWGVEDGVPYWLAANSWNTDWG 308

Query: 318 DRGTVKILRGKYECAFEYLIAAGKPK 343
           D G  KILRGK  C  E  + AG P+
Sbjct: 309 DNGFFKILRGKDHCGIESEMVAGIPR 334


>gi|442616292|ref|NP_001259536.1| cathepsin B1, isoform B [Drosophila melanogaster]
 gi|440216755|gb|AGB95378.1| cathepsin B1, isoform B [Drosophila melanogaster]
          Length = 330

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 122/337 (36%), Positives = 175/337 (51%), Gaps = 27/337 (8%)

Query: 12  TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG 71
            L  GE    SD +I+           GRNF A+++E ++R+ +     + D     LP 
Sbjct: 15  ALTSGEPSLLSDEFIE----------VGRNFDASVTEGHIRRLM---GVHPDAHKFALPD 61

Query: 72  DRKTYDPEYSATV---PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIK 128
            R+     Y  +V   P+ FD+R+QWPNC TIG + D G+C +   F AV A SDR CI 
Sbjct: 62  KREVLGDLYVNSVDELPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIH 121

Query: 129 SKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQP 188
           S G+ N   S + + SCC  C +     C+ G     W++  ++G V+GG YG   GC+P
Sbjct: 122 SGGKVNFHFSADDLVSCCHTCGF----GCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRP 177

Query: 189 STISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA 248
             ISPC HH +    P     + P  KC   C +  Y   + +DKH  + +Y V  N   
Sbjct: 178 YEISPCEHHVNGTRPPCAHGGRTP--KCSHVCQS-GYTVDYAKDKHFGSKSYSVRRNVRE 234

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT--ENGTPYW 306
           I++EI+ +GP    F +Y+D   YK GVY+H    +L    H+ +++GWG   E   PYW
Sbjct: 235 IQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGG--HAIRILGWGVWGEEKIPYW 292

Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           L+ N+W   WGD G  +ILRG+  C  E  I+AG PK
Sbjct: 293 LIGNSWNTDWGDHGFFRILRGQDHCGIESSISAGLPK 329


>gi|225708580|gb|ACO10136.1| Cathepsin B precursor [Osmerus mordax]
          Length = 329

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 122/325 (37%), Positives = 167/325 (51%), Gaps = 23/325 (7%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFL--IADAKYFDQSDRPLPGDRKTYDP 78
            S   I  INR   TW AG+NF  N+   Y++     + +     + + P          
Sbjct: 25  LSSEMIQYINRLNTTWKAGQNF-YNVDLSYVQGLCGTLQNKPTLPELEHPA--------- 74

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
                +PD FDAR+QWPNC TI  + D G+C +   F A  A SDR CI S  +    +S
Sbjct: 75  --GVKLPDTFDARQQWPNCPTIQDIRDQGSCGSCWAFGAAEAISDRLCIHSNAKITVEIS 132

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
            E + SCC+ C       C  G     W +  K G VTGG YG   GC+P +I PC HH 
Sbjct: 133 AEDLLSCCEEC----GMGCFGGYPSAAWEYWAKSGLVTGGLYGSNKGCRPYSIPPCEHHV 188

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           +  T P C+ +     KC T+C +  Y   + +DK+    TY V   ++ I  E+  +GP
Sbjct: 189 NG-TRPPCQGEG-DTPKCQTKCID-GYTPAYEKDKYFGKKTYSVPSKQEQIMTELYKNGP 245

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
             A F++Y+DF  YKSGVY+H +   L    H+ K++GWG EN TPYWL  N+W   WG+
Sbjct: 246 VEAAFSVYEDFLLYKSGVYQHLTGDMLGG--HAIKILGWGKENNTPYWLAANSWNTDWGN 303

Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
           +G  KILRG  EC  E  + AG P+
Sbjct: 304 QGFFKILRGGDECGIESEVVAGIPQ 328


>gi|50657025|emb|CAH04630.1| cathepsin B [Suberites domuncula]
          Length = 331

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 126/317 (39%), Positives = 162/317 (51%), Gaps = 19/317 (5%)

Query: 29  INREANTWTAGRN--FPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPD 86
           IN+   TW AG N  F      +  RQ  +      D     LP   K   P     VPD
Sbjct: 28  INKLGTTWKAGVNKRFEGLSEVDIRRQMGVLQGGPLDIK---LP--EKDITP--LKDVPD 80

Query: 87  RFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC 146
            FDAR QWP+C TI  + D GAC +   F AV + SDR CI     Q+  +S E + +CC
Sbjct: 81  MFDARMQWPDCPTIKEIRDQGACGSCWAFGAVESMSDRFCIHF--NQSAHISAEDLMACC 138

Query: 147 KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSC 206
           + C       C+ G +   W +    G VTGG Y  + GCQP  I+ C HH      P C
Sbjct: 139 ETC----GMGCNGGYLGAAWRYFEHTGLVTGGQYNSKEGCQPYLIASCDHHVVGKKQP-C 193

Query: 207 ENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALY 266
            +++    +C   C    Y   F +DKH     Y V  + +AI+ EI+ +GP    F +Y
Sbjct: 194 ASKEEHTPRCSKTC-EAGYDVSFEKDKHFGASAYSVRSSVEAIQTEIMTNGPVEGAFTVY 252

Query: 267 DDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILR 326
            DF  YKSGVY+HTS A L    H+ +++GWGTENGTPYWLV N+W   WG  G  KI+R
Sbjct: 253 ADFPTYKSGVYQHTSGAMLGG--HAIRILGWGTENGTPYWLVANSWNEDWGAMGYFKIIR 310

Query: 327 GKYECAFEYLIAAGKPK 343
           GK +C  E  I AG PK
Sbjct: 311 GKDDCGIESQITAGMPK 327


>gi|449267314|gb|EMC78276.1| Cathepsin B [Columba livia]
          Length = 340

 Score =  211 bits (536), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 122/347 (35%), Positives = 174/347 (50%), Gaps = 24/347 (6%)

Query: 2   IHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LI 56
           + IL  L+     R   Y    S   ++ IN+   TW AG NF  N    Y++Q     +
Sbjct: 5   VSILCVLVAFANARSVPYYRPLSSDLVNHINKLNTTWKAGHNF-YNTDMSYVKQLCGTFL 63

Query: 57  ADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
              K  ++ D    GD +         +PD FD+R QWPNC TI  + D G+C +   F 
Sbjct: 64  GGPKLPERVD--FAGDME---------LPDSFDSRTQWPNCPTISEIRDQGSCGSCWAFG 112

Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
           AV A SDR C+ +  + +  +S E + SCC    ++    C+ G     W +  ++G V+
Sbjct: 113 AVEAISDRICVHTNAKVSVEVSAEDLLSCCG---FECGMGCNGGYPSGAWRYWTEKGLVS 169

Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
           GG Y    GC+P +I PC HH +  + P C  +     +C   C  P Y   + +DKH  
Sbjct: 170 GGLYDSHVGCRPYSIPPCEHHVNG-SRPPCTGEGGETPRCSRHC-EPGYSPSYKEDKHYG 227

Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
             +Y V  +E  I  EI  +GP    F +Y+DF  YKSGVY+H +  ++    H+ +L+G
Sbjct: 228 ITSYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVTGEQVGG--HAIRLLG 285

Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           WG +NGTPYWL  N+W   WGD G  KILRG+  C  E  I AG P 
Sbjct: 286 WGVDNGTPYWLAANSWNTDWGDNGFFKILRGEDHCGIESEIVAGIPS 332


>gi|154089579|gb|ABS57370.1| cathepsin B2 [Trichobilharzia regenti]
          Length = 344

 Score =  211 bits (536), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 129/323 (39%), Positives = 162/323 (50%), Gaps = 13/323 (4%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
            S   I  IN EANT       P   S   +R+ L A     D +   LP     Y P  
Sbjct: 33  LSSELIHFINHEANTTWKAAPSPRFKSVSDIRRMLGALP---DPNGGHLPTLCTGYTPSL 89

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
              +P  FDAR+ WP+C +I  + D  +C +   F AV A SDR CI+SKG     LS E
Sbjct: 90  D-ELPKEFDARKYWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGLHKPFLSAE 148

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + +CC  C       C+ G     W++  + G VTG  Y    GCQP    PC HH   
Sbjct: 149 NLVACCSSC----GMGCNGGFPHSAWSYWKRSGIVTGDLYNPTDGCQPYEFPPCEHHVVG 204

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
           P  PSCE   V   KC T C  P Y   + +DK      Y V  N++AI KE+  HGP  
Sbjct: 205 PR-PSCEGD-VETPKCKTTC-QPGYNIPYNKDKWYGKTVYRVHSNQEAIMKEVKEHGPVE 261

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
             F +Y DF +YKSGVY+H S   L    H+ +L+GWG ENG PYWL+ N+W   WGD G
Sbjct: 262 VDFEVYADFPNYKSGVYQHVSGGLLGG--HAVRLLGWGEENGVPYWLIANSWNSDWGDNG 319

Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
             KI+RG+ EC  E  + AG PK
Sbjct: 320 YFKIIRGRNECGIESDVNAGIPK 342


>gi|148222779|ref|NP_001080410.1| uncharacterized protein LOC380102 precursor [Xenopus laevis]
 gi|28302291|gb|AAH46667.1| Cg10992 protein [Xenopus laevis]
          Length = 333

 Score =  211 bits (536), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 128/349 (36%), Positives = 173/349 (49%), Gaps = 24/349 (6%)

Query: 1   MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
           ++  L FL      R   Y    S   ++ IN+   TW AG NF AN    Y+++     
Sbjct: 4   LVVALCFLASIANSRHLPYFAPLSHDMVNYINKVNTTWKAGHNF-ANADVHYVKRLC--- 59

Query: 59  AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
                 +    P  +K +       +PD FD+R  WPNC TI  + D G+C +   F AV
Sbjct: 60  -----GTHLNGPQLQKRFGFADDLDLPDSFDSRAAWPNCPTIREIRDQGSCGSCWAFGAV 114

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
            A SDR C+ + G+ N  +S E + SCC    +     C+ G     W F  + G V+GG
Sbjct: 115 EAISDRVCVHTNGKVNVEVSAEDLLSCCG---FKCGMGCNGGYPSGAWRFWTETGLVSGG 171

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTN---PTYGRGFFQDKHR 235
            Y    GC+P +I PC HH +  + PSC+ ++    KC   C     P YG     DKH 
Sbjct: 172 LYDSHVGCRPYSIPPCEHHVNG-SRPSCKGEEGDTPKCMKTCEEGYTPAYG----SDKHF 226

Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
              +Y V  +E  I  +I  +GP    F +Y DF  YKSGVY+H +  +L    H+ K++
Sbjct: 227 GATSYGVPSSEKEIMADIYKNGPVEGAFVVYADFPLYKSGVYQHETGEELGG--HAIKIL 284

Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           GWG ENGTPYWL  N+W   WGD G  KILRGK  C  E  + AG PKN
Sbjct: 285 GWGVENGTPYWLCANSWNTDWGDNGFFKILRGKDHCGIESEVVAGIPKN 333


>gi|256052329|ref|XP_002569725.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228436|emb|CCD74607.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 345

 Score =  211 bits (536), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 123/345 (35%), Positives = 178/345 (51%), Gaps = 24/345 (6%)

Query: 4   ILVFLLGCTLVRGELYK-FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKY 61
           ++  L     ++ E +K  SD  I  IN   N  W A         E+  R   + DA+ 
Sbjct: 16  LITHLDAHISIKNEKFKPLSDDIISYINEHPNAGWRA---------EKSNRFHSLDDARI 66

Query: 62  FDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
              + R  P  R+   P     E++  +P  FD+R++WP C +I  + D   C +   F 
Sbjct: 67  QMGARREEPDLRRKRRPTVDHNEWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCWAFG 126

Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
           AV A SDR CI+S G+QN  LS   + SCC+ C       C  G +   W+F  K G VT
Sbjct: 127 AVEAMSDRSCIQSGGKQNVELSAVDLLSCCESC----GLGCEGGILGPAWDFWVKEGIVT 182

Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
           G    + TGC+P     C HH +    P C ++     +C   C    Y   + QDKHR 
Sbjct: 183 GSSKENHTGCEPYPFPKCEHH-TKGKYPPCGSKIYKTPRCKQTCQK-KYKTPYTQDKHRG 240

Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
             +Y V ++E AI+KEI+ +GP  A+F +Y+DF +YKSG+YKH +   L    H+ ++IG
Sbjct: 241 KSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGG--HAIRIIG 298

Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
           WG EN TPYWL+ N+W   WG+ G  +I+RG+ EC  E  + AG+
Sbjct: 299 WGVENKTPYWLIANSWNEDWGENGYFRIVRGRDECFIESEVIAGQ 343


>gi|341888136|gb|EGT44071.1| hypothetical protein CAEBREN_13576 [Caenorhabditis brenneri]
          Length = 337

 Score =  211 bits (536), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 115/324 (35%), Positives = 169/324 (52%), Gaps = 17/324 (5%)

Query: 22  SDAYIDQINREANTWTAG-RNFPAN-LSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
           ++  I  IN   + W AG ++ P   +    ++   +A  K+++    P+          
Sbjct: 22  TELLIQHINSVQSLWRAGYQDVPKEKMMGNLMKPEHVAPHKFYEV--EPI---------S 70

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
            +  +PD FDAREQWPNC +I ++ D   C +    AA    SDR CI S G+ N  +S 
Sbjct: 71  VAENIPDHFDAREQWPNCVSIDNIRDQSDCGSCWAVAAAETISDRTCIASNGEVNVLISA 130

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
           E + SCC    Y+    C  G   + W +    G VTGG Y  + GC+P +I+PC    +
Sbjct: 131 EDLLSCCT-GGYNCGDGCEGGYPIQAWRYWVHNGLVTGGSYESQYGCKPYSIAPCGQTVN 189

Query: 200 APTLPSCENQKVPKLKCHTRCTNPT-YGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
             T P C   +V   +C  +CT+ + Y   + QDKH  +  Y +  N   I+ EI+ +GP
Sbjct: 190 GVTWPKCAADEVATPECVKQCTSKSDYAVPYDQDKHYGSSAYAIRQNVAQIQTEIMRNGP 249

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
               F +Y DFY YKSG+YKH +  +L    H+ K++GWG ENGTPYWL  N+W  +WG+
Sbjct: 250 VEVGFLVYSDFYQYKSGIYKHVAGRELGG--HAVKILGWGVENGTPYWLAANSWNVNWGE 307

Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
           +G  +I RG  EC  E  + AG P
Sbjct: 308 KGYFRIRRGTNECGIESSVVAGIP 331


>gi|17565162|ref|NP_503382.1| Protein W07B8.4 [Caenorhabditis elegans]
 gi|351059398|emb|CCD74288.1| Protein W07B8.4 [Caenorhabditis elegans]
          Length = 335

 Score =  211 bits (536), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 117/343 (34%), Positives = 168/343 (48%), Gaps = 15/343 (4%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           M+ +L  LL        +   +  +I+ IN     WTA         E Y   F + +  
Sbjct: 1   MLKLLPSLLFILAASAVVLPRNKLFINHINSAQKLWTA---------EHYTTPFEVKNLM 51

Query: 61  YFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
             +     L  D K    E + ++PD +D R+ WP C ++ ++ D   C +    AA  A
Sbjct: 52  KVEHVAAHLDKDIKL--AETADSIPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEA 109

Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
            SDR CI S G  N  LS E + +CC   +++    C  G   + W +  K G VTGG +
Sbjct: 110 ISDRTCIASNGDVNTLLSAEDILTCCT-GKFNCGDGCEGGYPIQAWRYWVKNGLVTGGSF 168

Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCT-NPTYGRGFFQDKHRTTLT 239
             + GC+P +I+PC       T P C  +     KC   CT N +Y   + QDKH     
Sbjct: 169 ESQYGCKPYSIAPCGETIDGVTWPECPMKISDTPKCEHHCTGNNSYPIPYDQDKHFGASA 228

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y +  +   I+ EILAHGP    F +Y+DFY YK+G+Y H +  +L    H+ K++GWG 
Sbjct: 229 YAIGRSAKQIQTEILAHGPVEVGFIVYEDFYLYKTGIYTHVAGGELGG--HAVKMLGWGV 286

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           +NGTPYWL  N+W   WG++G  +ILRG  EC  E    AG P
Sbjct: 287 DNGTPYWLAANSWNTVWGEKGYFRILRGVDECGIESAAVAGMP 329


>gi|325302580|dbj|BAJ83490.1| cathepsin B-like peptidase [Echinococcus multilocularis]
          Length = 351

 Score =  211 bits (536), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 134/330 (40%), Positives = 167/330 (50%), Gaps = 23/330 (6%)

Query: 21  FSDAYIDQINREANTWTA--GRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP 78
            S A ID +NR   TW A   R F    S   +RQ L A     D   R LP        
Sbjct: 39  LSSAIIDYVNRINTTWKAEPSRRF---TSPSQVRQQLGA---LPDPMGRRLPVLYSL--S 90

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP-- 136
           E   ++P  FD R++WPNC T+  + D G+C +   F A  A SDR CI+ +    R   
Sbjct: 91  ENYKSLPASFDPRKKWPNCKTLFEIRDQGSCGSCWAFGAAEAMSDRLCIQQQTVSGRAVM 150

Query: 137 --LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS + + SCC+ C       C+ G   + WNF    G V+GG YG +  C+   I PC
Sbjct: 151 VRLSADDLLSCCRDC----GMGCNGGFPSQAWNFWKHEGLVSGGLYGTKGVCRAYEIPPC 206

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            HH +  T P CE    P  KC   C    Y   + +DKH     Y V  NEDAIK E++
Sbjct: 207 EHHVNG-TRPPCEGD-APTPKCKNVCQE-EYKVPYKKDKHYAVKVYSVHSNEDAIKHELI 263

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            HGP  A F +Y DF  YKSGVY+H S A L    H+ KL+GWG E+G PYWL  N+W  
Sbjct: 264 THGPVEADFEVYADFPTYKSGVYQHVSGALLGG--HAIKLMGWGEEDGVPYWLCANSWNT 321

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WG+ G  KILRGK  C  E  I AG P+N
Sbjct: 322 DWGEGGFFKILRGKNHCGIESDIVAGIPQN 351


>gi|161343867|tpg|DAA06114.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 340

 Score =  210 bits (535), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 126/352 (35%), Positives = 178/352 (50%), Gaps = 28/352 (7%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           ++ +L  +L    +  + Y     YID IN +A TW AG NFP +  +E + + L +   
Sbjct: 4   VLILLSVILFSVYMTEQAYFLEKDYIDSINAQATTWKAGVNFPPSTPKEAILRLLGSRGV 63

Query: 61  YF-DQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
              ++++  +   R +        +P +FDAR++W  C TIG V D G C +    A   
Sbjct: 64  QIPNKANYKMYKSRDSNYDNLFGRIPKKFDARKKWRKCKTIGAVRDQGNCGSCWALATSS 123

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           AF+DR C+ +    N  LS E +  CC  C Y     C+ G   + W      G VTGGD
Sbjct: 124 AFADRLCVATDADFNEFLSPEELTFCCHTCGY----GCNGGYPIKAWERFKSHGLVTGGD 179

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK--HRTT 237
           Y    GC+P  + PC HH       SC ++ + K   + RCT   YG         HR T
Sbjct: 180 YKSGEGCEPYRVPPCRHHAEGNN--SCSDKPMEK---NHRCTRMCYGDQDLDFDDDHRYT 234

Query: 238 -----LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--H 290
                LTY       +I+K+++ +GP  A+F +YDDF  YKSGVY  + NA   +YL  H
Sbjct: 235 RDSYYLTY------GSIQKDVMNYGPIEASFDVYDDFPSYKSGVYIRSDNA---SYLGGH 285

Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           + KLIGWG E+G PYWL++N+W   WGD+G  KI RG  EC  +    AG P
Sbjct: 286 AVKLIGWGEESGVPYWLMVNSWNTDWGDKGLFKIQRGTNECGVDNSTTAGVP 337


>gi|355681635|gb|AER96808.1| cathepsin B [Mustela putorius furo]
          Length = 338

 Score =  210 bits (535), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 128/344 (37%), Positives = 174/344 (50%), Gaps = 23/344 (6%)

Query: 5   LVFLLGCTLV------RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
           L+  L C +V      R      SD  +  +N++  TW AG NF  N+ + YL++     
Sbjct: 4   LLATLSCLVVLTSAQRRPPFQPLSDELVHYVNKQNTTWKAGHNF-HNVDQSYLKKLC--- 59

Query: 59  AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
             +      P P  R  +    +  +P+ FD+REQWPNC TI  + D G+C +   F AV
Sbjct: 60  GTFLGG---PKPPQRLWF--AENMILPESFDSREQWPNCPTIKEIRDQGSCGSCWAFGAV 114

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
            A SDR CI++ G  +  +S E + +CC     D    C+ G     WNF    G V+GG
Sbjct: 115 EAISDRICIRTNGHVSVEVSAEDMLTCCGDQCGD---GCNGGFPAEAWNFWTXXGLVSGG 171

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
            Y    GC+P +I PC HH +    P       PK  C   C  P Y   + +DKH    
Sbjct: 172 LYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYTPSYKEDKHYGCS 228

Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
           +Y V  +E  I  EI  +GP  A F++Y DF  YKSGVY+H +   +    H+ +++GWG
Sbjct: 229 SYSVSSSEKEIMAEIYKNGPVEAAFSVYSDFLMYKSGVYQHVTGEMMGG--HAVRILGWG 286

Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            ENGTPYWLV N+W   WGD G  KILRG+  C  E  I AG P
Sbjct: 287 VENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGIP 330


>gi|118365170|ref|XP_001015806.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297573|gb|EAR95561.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 340

 Score =  210 bits (535), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 127/347 (36%), Positives = 175/347 (50%), Gaps = 20/347 (5%)

Query: 4   ILVFLLGCTL-VRGELYKFSDA---YIDQINREAN-TWTAGRNFPANLSEEYLRQFLIAD 58
           + + +LGC        +KF +     + ++N   N TW A R +P    E+  R+ L+  
Sbjct: 6   LSILILGCLFSTSANCFKFGEMSPFIVFEVNSNPNSTWKAAR-YPH--FEKMTREQLLGH 62

Query: 59  AKYFDQSD-RPLPGDRKTYDPEYSA-TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
               D+ D   LP   K +DP  +A  +P+ FDAREQWPNC +I  + D   C +   FA
Sbjct: 63  LGSLDEPDWVKLP--TKEFDPNANADPIPEFFDAREQWPNCQSIKLIRDQSTCGSCWAFA 120

Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
           A   FSDR CI S       +S+E +  CC   C       C  G     W ++ ++G  
Sbjct: 121 ATETFSDRICIASNQTLQTSISSEDLLECCADYC----GMGCKGGYPSAAWGYMKRQGVS 176

Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
           TGG YGD T C+P    PC HH +    P    Q  P+  C   C +      + +D H 
Sbjct: 177 TGGLYGDDTSCKPYIFPPCDHHVTGQYQPCGPIQPTPQ--CVKECNSEYTQNTYEKDLHF 234

Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
            + TY +  N  AI++EI+AHGP  A+F +  DF  YKSGVY      K E   HS K+I
Sbjct: 235 ASQTYSIKQNVQAIQREIMAHGPVQASFKVAADFLTYKSGVYIRNPKLKYEGG-HSVKII 293

Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           GWG E  TPYWL+ N+W   WG++G  ++LRG+ EC  E  I AG P
Sbjct: 294 GWGKEGNTPYWLIANSWNEDWGEKGLFRMLRGRNECGIEAQIVAGLP 340


>gi|204022102|dbj|BAG71148.1| cathepsin B-N2 [Tuberaphis takenouchii]
          Length = 334

 Score =  210 bits (534), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 128/334 (38%), Positives = 170/334 (50%), Gaps = 24/334 (7%)

Query: 17  ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
           + Y   + YI+QIN  A TW AG NF   LS +   + L   +K    + +  P   KT+
Sbjct: 17  QAYFLEEDYINQINANAKTWKAGANFDPKLSIDSFVKLL--GSKGVQAAKQASPDMFKTH 74

Query: 77  DPEYSAT---VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
           D  Y++    +P  FDAR++W  C T+G V D G C     F    AF+DR CI + G+ 
Sbjct: 75  DEAYNSLPNRIPSNFDARKKWRKCSTVGKVRDQGNCGTCWAFGTSSAFADRLCIATNGEF 134

Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
           N  LS E +A CC  C       C  G   + W    K G VTGGDY    GCQP  + P
Sbjct: 135 NELLSAEELAFCCHKC----GSGCHGGYPIKAWERFRKHGLVTGGDYNSGEGCQPYRVPP 190

Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDAIK 250
           C          +C  +  P  K H RCT   YG     F +D   T   Y++  N   I+
Sbjct: 191 CPFDEYGNN--TCRGK--PAEKNH-RCTRMCYGNQNLDFKEDHRYTRDAYYL--NYQIIQ 243

Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYWLV 308
            +++ +GP  A++ +YDDF +YKSGVY  T NA   +YL  H+ KLIGWG E G PYWL+
Sbjct: 244 NDLMTYGPIEASYDVYDDFPNYKSGVYMKTENA---SYLGGHAVKLIGWGEEYGVPYWLL 300

Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           +N+W   WGD+G  KI RG  EC  +     G P
Sbjct: 301 VNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 334


>gi|187104114|ref|NP_001119617.1| cathepsin B-16A precursor [Acyrthosiphon pisum]
 gi|161343835|tpg|DAA06098.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 340

 Score =  210 bits (534), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 121/346 (34%), Positives = 175/346 (50%), Gaps = 16/346 (4%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNF-PANLSEEYLRQFLIADA 59
           ++ +L  +L       + Y   ++YI+ IN  A TWTAG NF P+   +++++       
Sbjct: 4   LVILLSVVLFSVYQTEQAYFLEESYIEMINDVATTWTAGVNFDPSTPEKDFIKMLGSKGV 63

Query: 60  KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
           +    +   +       +   +  +P  FDAR +W +C TIG V D G C +    A   
Sbjct: 64  EAAKNASAHMFKTHDVANDNNNGYIPRTFDARRRWRHCKTIGEVRDQGHCGSCWAMATSS 123

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           AF+DR C+ + G  N  LS E +  CC  C +     C+ G   + W +    G VTGG+
Sbjct: 124 AFADRLCVATNGDFNELLSAEEITFCCHTCGF----GCNGGYPIKAWKYFSSHGIVTGGN 179

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF--FQDKHRTT 237
           Y    GC+P  + PC       +  SC  + + K   + RCT   YG     + D HR T
Sbjct: 180 YKSGEGCEPYRVPPCPQDEEGKS--SCAGKPIEK---NHRCTRMCYGNQDLDYNDDHRFT 234

Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNA-KLENYLHSGKLIG 296
             Y+      +I+K+++ +GP  A+F +YDDF  YKSGVY+ T NA KL    H+ KLIG
Sbjct: 235 RDYYYL-TYGSIQKDVMNYGPIEASFDVYDDFPSYKSGVYQRTPNATKLGG--HAVKLIG 291

Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           WG E GTPYWL++N+W   WGD G  KI RG  EC  +    AG P
Sbjct: 292 WGVEEGTPYWLMVNSWNAQWGDNGLFKIRRGTDECGIDSAATAGVP 337


>gi|313229093|emb|CBY18245.1| unnamed protein product [Oikopleura dioica]
          Length = 355

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 125/332 (37%), Positives = 164/332 (49%), Gaps = 15/332 (4%)

Query: 8   LLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDR 67
           L G T+       FS A ID++N     WTAG NF    + E +R +L A    +   D 
Sbjct: 25  LFGFTIGIAAASDFS-AIIDEVNTANAGWTAGENFHEQTTLEDVRSWLGA----WSNKDY 79

Query: 68  PLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCI 127
             P  +K    +    +P  FD+R  W +C  IG + D G C +   F A  A SDR CI
Sbjct: 80  DWP--QKYPHDDLVGDIPATFDSRSNWSDCSVIGKIRDQGGCGSCWAFGAAEAISDRICI 137

Query: 128 KSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQ 187
            SKG  +   + E V SCC  C       C+ G       +   RG VTGG YG +  CQ
Sbjct: 138 ASKGATDVMYAAEDVLSCCLTC----GNGCNGGYPLAAMEYFVTRGLVTGGLYGTKDTCQ 193

Query: 188 PSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNED 247
           P T+  C HH      P  E    PK  C  +C      + +  DK      Y V ++  
Sbjct: 194 PYTLEACEHHVPGDRPPCTEGGGTPK--CSHQCIPDYTTKAYKDDKVHGHKAYSVPNDVG 251

Query: 248 AIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWL 307
            I++EI+ +GP  A F +Y DF  YKSGVY+HTS ++L    H+ K+IGWGTE G  YWL
Sbjct: 252 KIQQEIMHYGPVEAAFTVYSDFPSYKSGVYRHTSGSELGG--HAIKIIGWGTEGGDDYWL 309

Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
           + N+W   WGD+GT KILRG  EC  E  + A
Sbjct: 310 INNSWNSDWGDKGTFKILRGSNECGIEGEVVA 341


>gi|55793951|gb|AAV65886.1| cathepsin B1 isotype 6 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  209 bits (532), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 120/328 (36%), Positives = 173/328 (52%), Gaps = 19/328 (5%)

Query: 21  FSDAYIDQINREANT-WTAGRN--FPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
            SD  I  IN+  +  WTA R+  F +      L   +  D K   +  RP      T D
Sbjct: 30  LSDEIIAYINQHPDAGWTASRSDRFKSVEDARILLGVMREDEK-LRKKRRP------TVD 82

Query: 78  PE-YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
            +  S  +P  FD+R++W  C +I  + D   C +   FAAV   SDR CI+SKG+++  
Sbjct: 83  HQNVSLEIPSTFDSRKKWSQCKSISSIHDQSRCGSGWAFAAVEVMSDRICIQSKGEKSVE 142

Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
           LS   + SCC+ C       C  G     W++  + G VTG    + TGCQP     C H
Sbjct: 143 LSAVDLLSCCREC----GLGCLGGFPGSAWDYWVEEGVVTGSSGENHTGCQPYPFPKCEH 198

Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
           + +    P+C  +     KC  +C    Y   + +DKH   + Y V +NED+IKKEI+ H
Sbjct: 199 NTTG-KYPACGQKIYETPKCQKKCQK-GYKTPYKKDKHYGKVAYNVPNNEDSIKKEIMMH 256

Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHW 316
           GP  + F +Y DF +YKSG+YKH    ++   +H+ +++GWG E GTPYWL+ N+W   W
Sbjct: 257 GPVGSFFTVYSDFLNYKSGIYKHMKGTEIG--VHTVRIVGWGVEKGTPYWLIANSWNEGW 314

Query: 317 GDRGTVKILRGKYECAFEYLIAAGKPKN 344
           G++G  +ILRGK EC  E L+  G P+N
Sbjct: 315 GEKGYFRILRGKDECDIESLVIGGLPRN 342


>gi|74221319|dbj|BAE42140.1| unnamed protein product [Mus musculus]
          Length = 339

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 124/329 (37%), Positives = 169/329 (51%), Gaps = 23/329 (6%)

Query: 18  LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYFDQSDRPLPGDRK 74
            +  SD  I+ IN++  TW AGRNF  N+   YL++    ++   K        LPG R 
Sbjct: 23  FHPLSDDLINYINKQNTTWQAGRNF-YNVDISYLKKLCGTVLGGPK--------LPG-RV 72

Query: 75  TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
            +  +    +P+ FDAREQW NC TIG + D G+C +   F AV A SDR CI + G+ N
Sbjct: 73  AFGEDID--LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVN 130

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             +S E + +CC I   D    C+ G     W+F  K+G V+GG Y    GC P TI PC
Sbjct: 131 VEVSAEDLLTCCGIQCGD---GCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPC 187

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            HH +    P       P  +C+  C    Y   + +DKH    +Y V ++   I  EI 
Sbjct: 188 EHHVNGSRPPCTGEGDTP--RCNKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIY 244

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            + P    F ++ DF  YKSGVYKH +   +    H+ +++GWG  NG PYWL  N+W  
Sbjct: 245 KNDPVEGAFTVFSDFLTYKSGVYKHEAGDMMGG--HAIRILGWGVGNGVPYWLAANSWNL 302

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            WGD G  KILRG+  C  E  I AG P+
Sbjct: 303 DWGDNGFFKILRGENHCGIESEIVAGIPR 331


>gi|227293|prf||1701299A cathepsin B
          Length = 339

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 125/329 (37%), Positives = 172/329 (52%), Gaps = 23/329 (6%)

Query: 18  LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYFDQSDRPLPGDRK 74
            +  SD  I+ IN++  TW AGRN P N+   YL++    ++   K        LPG R 
Sbjct: 23  FHPLSDDLINYINKQNTTWQAGRN-PYNVDISYLKKLCGTVLGGPK--------LPG-RV 72

Query: 75  TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
            +  +    +P+ FDAREQW NC TIG + D G+C +   F AV A SDR CI + G+ N
Sbjct: 73  AFGEDID--LPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVN 130

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             +S E + +CC I   D    C+ G     WNF  K+G V+GG Y    GC P TI PC
Sbjct: 131 VEVSAEDLLTCCGIQCGD---GCNGGYPSGAWNFWTKKGLVSGGYYDSHIGCLPYTIPPC 187

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            HH +  + P C  +   + +C+  C    Y   + +DKH    +Y V ++   I  EI 
Sbjct: 188 EHHVNG-SRPPCTGEGDTR-RCNKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKKIMAEIY 244

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            +GP    F ++ DF  YKSGVYKH +   +    H+ +++ WG ENG PYW   N+W  
Sbjct: 245 KNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGG--HAIRILVWGVENGVPYWAAANSWNL 302

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            WGD G  KILRG+  C  E  I AG P+
Sbjct: 303 DWGDNGFFKILRGENHCGIESEIVAGIPR 331


>gi|340380665|ref|XP_003388842.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
          Length = 333

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 121/329 (36%), Positives = 165/329 (50%), Gaps = 23/329 (6%)

Query: 17  ELYKFSDAYIDQINREANTWTAGRN--FPANLSEEYLRQFLIADAKYFDQSDRPLPGDRK 74
           +    SD  ID +N    TWTA R+  FP+    +      + D K+       LP   K
Sbjct: 23  DFQALSDDVIDYVNSLNTTWTAARSPRFPSGNEVDVKDLCGVLDVKH------TLPYKEK 76

Query: 75  TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
                    +PD FDAR++W +C +I  + D G+C +     AV A SDR C+    Q+N
Sbjct: 77  VS----VGAIPDTFDARQKWSDCPSISDIRDQGSCGSCWALGAVEAMSDRYCVSF--QEN 130

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             +S E + +CCK C       C+ G + + W +  K G VTGG YG   GCQP  I  C
Sbjct: 131 VHISAENLMTCCKFC----GNGCAGGFLQQAWEYWVKDGLVTGGQYGSDEGCQPYLIPKC 186

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
           +HH   P        K P+  C   C +  Y   +  D H     Y V    +AI+ EI+
Sbjct: 187 NHHEPGPYENCTGEGKTPQ--CERTCRS-GYTTSYEADLHYGEKAYAVHREVEAIQTEIM 243

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            +GP    F +Y DF  YKSGVY+H     L    H+ +++GWGTENG PYWL+ N+W P
Sbjct: 244 TNGPVEGAFTVYSDFPTYKSGVYQHVVGHALGG--HAIRILGWGTENGVPYWLIANSWNP 301

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            WGD+G  K++RGK +C  E  I AG PK
Sbjct: 302 SWGDKGYFKMIRGKDDCGIESNIVAGTPK 330


>gi|350535627|ref|NP_001233013.1| uncharacterized protein LOC100164982 precursor [Acyrthosiphon
           pisum]
 gi|239789514|dbj|BAH71377.1| ACYPI005957 [Acyrthosiphon pisum]
          Length = 339

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 129/347 (37%), Positives = 180/347 (51%), Gaps = 19/347 (5%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           ++ +L  +L       + Y   ++YI+ IN  A TW AG NF  +  E    + L   +K
Sbjct: 4   LVILLSVVLFSVYQTEQAYFLEESYIEMINDVATTWKAGVNFDPSTPETDFIKML--GSK 61

Query: 61  YFDQSDRPLPGDRKTYDPEYS--ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
             + +        KT+D  Y+  + +P  FDAR++W +C TIG V D G C +   F   
Sbjct: 62  GVEAAKNASAHMFKTHDVAYNKFSYIPRTFDARKRWRHCKTIGEVRDQGHCGSCWAFGTS 121

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
            AF+DR C+ + G  N  LS E +  CC  C +     C+ G   + W +    G VTGG
Sbjct: 122 SAFADRLCVATDGDFNELLSAEELTFCCHACGH----GCNGGYPIKAWKYFSTHGLVTGG 177

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF--FQDKHRT 236
           +Y    GC+P  + PC  +    +  SC  +  PK K H RCT   YG     + D HR 
Sbjct: 178 NYKSGKGCEPYRVPPCPRNEDGKS--SCAGK--PKEKNH-RCTRMCYGNQDLDYDDDHRF 232

Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNA-KLENYLHSGKLI 295
           T  ++      +I+K++L +GP  A+F +YDDF  YKSGVY+ T NA KL    H+ KLI
Sbjct: 233 TRDFYYL-TYGSIQKDVLNYGPIEASFDVYDDFPSYKSGVYQRTPNATKLGG--HAVKLI 289

Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           GWG E GTPYWL++N+W   WGD G  KI RG  EC  +    AG P
Sbjct: 290 GWGVEEGTPYWLMVNSWNAQWGDNGLFKIRRGTDECRIDSATTAGVP 336


>gi|443692853|gb|ELT94358.1| hypothetical protein CAPTEDRAFT_221292 [Capitella teleta]
          Length = 374

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 126/324 (38%), Positives = 168/324 (51%), Gaps = 17/324 (5%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
            S   +D +N +A+T W A     +  S     + L    K  D +   LP  R   +  
Sbjct: 65  LSQEIVDYVNTKADTTWKA--EVTSKWSSVAEVKNLCGSLK--DPNGSRLPIMRHKLE-- 118

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
            +  +PD FDAR++W  C TI  V D G+C +   F AV A SDR CI SKG  +  +S+
Sbjct: 119 -AVNLPDDFDARKEWTGCPTIKEVRDQGSCGSCWAFGAVEAMSDRICIASKGNVHAHISS 177

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
           E + SCC  C       C+ G     W +    G V+GG YG   GC+P +I+PC HH +
Sbjct: 178 EDLLSCCSSC----GMGCNGGFPPAAWEYFRDTGLVSGGQYGTHQGCRPYSIAPCEHHVN 233

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
              LP C  +  P  KC   C    Y   +  DK+     Y VD++E  I  EI+ +GP 
Sbjct: 234 GTRLP-CSGEG-PTPKCERTCEK-GYKVKYEDDKNFGYTAYSVDNDEKQIMTEIMTNGPV 290

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
              F +Y DF  YKSGVY+H S  +L    H+ +++GWG E+GTPYWLV N+W   WGD 
Sbjct: 291 EGAFTVYADFPTYKSGVYQHVSGGELGG--HAIRVLGWGVEDGTPYWLVANSWNSDWGDN 348

Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
           G  KILRG+ EC  E  I AG PK
Sbjct: 349 GFFKILRGQNECGIEGEIVAGLPK 372


>gi|156365510|ref|XP_001626688.1| predicted protein [Nematostella vectensis]
 gi|156213574|gb|EDO34588.1| predicted protein [Nematostella vectensis]
          Length = 259

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 116/260 (44%), Positives = 142/260 (54%), Gaps = 9/260 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           VPD FD+REQWP+C TI  V D GAC +   F AV A SDR CIKS+G+    +S E + 
Sbjct: 4   VPDHFDSREQWPHCPTIKEVRDQGACGSCWAFGAVEAMSDRYCIKSEGKVMPHISAEDLL 63

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC+ C       C+ G     W+    +G VTGG Y    GCQP  I+ C HH      
Sbjct: 64  SCCETC----GMGCNGGYPESAWDHWKSKGLVTGGQYDSHKGCQPYKIAACDHHVVGKLK 119

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P C+    P  KC  +C    Y   +  DKH     Y V  +   I+KEI+ +GP    F
Sbjct: 120 P-CKGDS-PTPKCERKC-EAGYNVSYSDDKHFGQSAYSVRSDPAEIQKEIMTNGPVEGAF 176

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
            +Y DF  YKSGVY+HTS + L    H+ K++GWG ENGTPYWLV N+W   WGD G  K
Sbjct: 177 TVYADFPTYKSGVYQHTSGSALGG--HAIKILGWGEENGTPYWLVANSWNSDWGDEGFFK 234

Query: 324 ILRGKYECAFEYLIAAGKPK 343
           I RG  EC  E  I  G PK
Sbjct: 235 IKRGNDECGIESGIVGGLPK 254


>gi|195165479|ref|XP_002023566.1| GL19846 [Drosophila persimilis]
 gi|194105700|gb|EDW27743.1| GL19846 [Drosophila persimilis]
          Length = 329

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 118/325 (36%), Positives = 173/325 (53%), Gaps = 27/325 (8%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
            SD +I+ +  +A+TW  GRNF  ++SEEY+R  +     + D     LP  R      Y
Sbjct: 23  LSDEFIELVRSKASTWQVGRNFKESVSEEYIRGLM---GVHPDAHKFALPEKRIVLGDLY 79

Query: 81  S---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
           +     +P+ FDAR+ WPNC TIG + D G+C +   F AV A SDR CI S+G+ N  L
Sbjct: 80  ADDGIDIPEEFDARKAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSEGKVNFHL 139

Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
           S + + SCC IC +     C+ G     W++  ++G V+GG YG   GC+P  I+PC HH
Sbjct: 140 SADDLVSCCHICGF----GCNGGFPGAAWSYWTRKGIVSGGPYGSTQGCRPYEIAPCEHH 195

Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
            +  T P C +   P   C  +C   +Y   + +DK+  + +Y V  N   I++EI+ +G
Sbjct: 196 VNG-TRPPCSHGSTP--SCQHKC-QASYSVEYAKDKNFGSKSYSVRRNVAEIQQEIMTNG 251

Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT--ENGTPYWLVINTWGPH 315
           P    F +Y+D   YKSGVY+H    +L    H+ +++GWG   E+  PYWL+ N+W   
Sbjct: 252 PVEGAFTVYEDLILYKSGVYQHEHGKELGG--HAIRILGWGVWGESKVPYWLIGNSWNTD 309

Query: 316 WGDRGTVKILRGKYECAFEYLIAAG 340
           WGD            C  E  I+AG
Sbjct: 310 WGDND---------HCGIESSISAG 325


>gi|193716207|ref|XP_001950562.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
          Length = 340

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 131/353 (37%), Positives = 175/353 (49%), Gaps = 32/353 (9%)

Query: 2   IHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
           + +L  +     V  + Y     +ID IN  A TW AG NF  N  +EY  + L   +K 
Sbjct: 5   LMLLSVIFVSVYVTEQAYFLQKDFIDNINNHATTWKAGVNFDPNTPKEYFLKML--GSKG 62

Query: 62  FDQSDRPLPGDRKTYDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
               D+      KT+D  Y      +P  FDAR++W  C TIG V D G C +    A  
Sbjct: 63  VQIPDKHNIHMYKTHDAAYDNLFGRIPKHFDARKKWKRCHTIGKVRDQGNCGSCWAMATS 122

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
            AF+DR C+ +    N  LS E +  CC  C Y     C+ G   + W   + RG VTGG
Sbjct: 123 SAFADRLCVATNADFNELLSAEEITFCCSSCGY----GCNGGYPIKAWESFNNRGLVTGG 178

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG--FFQDKHRT 236
           DY    GC+P  + PC +   A    +C  +  P+ K H RCT   YG     + D HR 
Sbjct: 179 DYQSGEGCEPYRVPPCPY--DAEGHNTCAGK--PREKNH-RCTRTCYGNQDLDYNDDHRF 233

Query: 237 T-----LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL-- 289
           T     LTY       +I+K+++ +GP  A+F +YDDF  YKSGVY  + NA   +YL  
Sbjct: 234 TRDSYYLTY------SSIQKDVMRYGPIEASFDMYDDFPSYKSGVYVRSENA---SYLGG 284

Query: 290 HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           H+ KLIGWG E+G  YWL++N+W   WGD G  KI RG  EC  +     G P
Sbjct: 285 HAVKLIGWGEEHGVLYWLMVNSWNEGWGDNGLFKIRRGTNECGIDNSTTGGVP 337


>gi|116177489|gb|ABJ80691.1| cathepsin B [Hippoglossus hippoglossus]
          Length = 330

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 120/334 (35%), Positives = 170/334 (50%), Gaps = 21/334 (6%)

Query: 12  TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG 71
           +L R  L   S   ++ IN+   TW AG NF  ++   Y+R+               L G
Sbjct: 16  SLARPHLQPLSKEMVNYINKMNTTWKAGHNF-RDVDYSYVRRLC----------GTMLKG 64

Query: 72  DRKTYDPEYSA--TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKS 129
            +     +Y+    +P +FD+REQWP C T+  + D G+C +   F A  A SDR CI S
Sbjct: 65  PKLPIMVQYAGGLKLPAQFDSREQWPECPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHS 124

Query: 130 KGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPS 189
             + +  +S+E + +CC  C       C+ G     W+F  K G V+GG Y    GC+P 
Sbjct: 125 GSKVSVEISSEDLLTCCDAC----GMGCNGGYPSAAWDFWTKEGLVSGGLYNSHIGCRPY 180

Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
           TI PC HH +  + P C  +     KC   C    Y   + +DKH    +Y V+ + + I
Sbjct: 181 TIPPCEHHVNG-SRPHCSGEGGDTPKCVHSC-EAGYSPTYTKDKHYGKSSYSVEASVEQI 238

Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVI 309
           + EI  +GP    F +Y+DF  YKSGVY+HT+ + L    H+ K++GWG E+G PYWL  
Sbjct: 239 QAEISQNGPVEGAFIVYEDFVMYKSGVYQHTTGSALGG--HAIKVLGWGEEDGVPYWLCA 296

Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           N+W   WG+ G  KILRG   C  E  I AG PK
Sbjct: 297 NSWNTDWGENGFFKILRGSDHCGIESEIVAGIPK 330


>gi|204022098|dbj|BAG71146.1| cathepsin B-N2 [Tuberaphis sumatrana]
          Length = 334

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 132/336 (39%), Positives = 172/336 (51%), Gaps = 28/336 (8%)

Query: 17  ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
           + Y   + YI+ IN  A TW AG NF   LS +   + L   +K    + +  P   KT+
Sbjct: 17  QAYFLEEDYINHINANAKTWKAGVNFDPKLSIDSFVKLL--GSKGVQAAKQASPDMFKTH 74

Query: 77  DPEY---SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
           D  Y   S  +P  FDAR++W  C TIG V D G C +   F    AF+DR CI + G+ 
Sbjct: 75  DEAYNNWSNRIPSYFDARKKWRKCLTIGEVRDQGHCGSCWAFGTSSAFADRLCIATDGEF 134

Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
           N  LS E +A CC  C +     CS G   + W    K G VTGG+Y    GCQP  + P
Sbjct: 135 NELLSPEELAFCCHKCGF----GCSGGYPIKAWERFKKHGLVTGGNYESGEGCQPYRVPP 190

Query: 194 C--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDA 248
           C    +G+     +C  +  P  K H RCT   YG     F +D H T   Y++      
Sbjct: 191 CPLDEYGNN----TCSGK--PTEKNH-RCTRMCYGNQDLDFKEDHHYTRDAYYL--TYGT 241

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYW 306
           I+ ++LA+GP  A+F +YDDF  YKSGVY    NA    YL  H+ KLIGWG E G PYW
Sbjct: 242 IQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENA---TYLGGHAVKLIGWGEEYGVPYW 298

Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           L++N+W   WGD+G  KI RG  EC  +     G P
Sbjct: 299 LLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 334


>gi|204022090|dbj|BAG71142.1| cathepsin B-N3 [Tuberaphis styraci]
          Length = 334

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 133/336 (39%), Positives = 171/336 (50%), Gaps = 28/336 (8%)

Query: 17  ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
           + Y     YI+QIN  A TW AG NF   LS +   + L   +K    + +      KT+
Sbjct: 17  QAYFLEVDYINQINANAKTWKAGVNFDPKLSIDSFVKLL--GSKGVQAAKQASLVMFKTH 74

Query: 77  DPEY---SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
           D  Y   S  +P  FDAR++W  C TIG V D G C +   F    AF+DR CI + G+ 
Sbjct: 75  DEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGNCGSCWAFGTSSAFADRLCIATDGEF 134

Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
           N  LS E +A CC  C +     CS G   R W    K G VTGG+Y    GCQP  + P
Sbjct: 135 NELLSPEELAFCCHKCGF----GCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVPP 190

Query: 194 C--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDA 248
           C    +G+     +C  +  P  K H RCT   YG     F +D H T   Y++      
Sbjct: 191 CPLDEYGNN----TCSGK--PAEKNH-RCTQMCYGNQNLDFKEDHHYTRDAYYL--TYGT 241

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYW 306
           I+ ++LA+GP  A+F +YDDF  YKSGVY    NA    YL  H+ KLIGWG E G PYW
Sbjct: 242 IQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENA---TYLGGHAVKLIGWGEEYGVPYW 298

Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           L++N+W   WGD+G  KI RG  EC  +     G P
Sbjct: 299 LLVNSWNDQWGDQGLFKIRRGTNECGTDNSTTGGVP 334


>gi|393909827|gb|EJD75608.1| cysteine endopeptidase [Loa loa]
          Length = 383

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 118/336 (35%), Positives = 169/336 (50%), Gaps = 16/336 (4%)

Query: 12  TLVRGELYKFSDA-YIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLP 70
           T +  E    SD   ID +N     W A  N   NL    ++  L+         D    
Sbjct: 54  TKIAPEAENLSDQELIDYVNSHQTLWKAEMN-KFNLYSNTVKYGLLGVNNMKQSVD---- 108

Query: 71  GDRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIK 128
             +K   P   +T+  P+ FDAR+ WP C ++ +V D  +C +    AAV A SDR CI 
Sbjct: 109 -GKKNLSPTRHSTIFIPESFDARKHWPECASLRNVRDQSSCGSCWAVAAVEAMSDRICIM 167

Query: 129 SKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQP 188
           SKG++   LS + + SCCK C +     C  G     W +   RG VTG +Y + +GC+P
Sbjct: 168 SKGKKQVTLSADDLLSCCKTCGF----GCFGGEPMAAWKYWVLRGIVTGSEYTNHSGCRP 223

Query: 189 STISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA 248
               PC HH +      C++   P  KC  +C +  YG+ +  DK+     Y V+ N ++
Sbjct: 224 YPFPPCEHHNNKTHYEPCKHDLYPTPKCVKKC-DKNYGKSYKADKYYGEQVYNVESNVES 282

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLV 308
           I+KEI+  GP  A+F +Y DF +Y  G+YKH + +      H+ K++GWG + G PYWL 
Sbjct: 283 IQKEIMTLGPVEASFEVYTDFLYYTGGIYKHVAGSMGGG--HAVKVLGWGIDQGVPYWLA 340

Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            N+W   WG+ G  +ILRG  EC  E  I AG PK 
Sbjct: 341 ANSWNTDWGEDGYFRILRGVNECGIESGIIAGIPKQ 376


>gi|22531389|emb|CAD44625.1| cathepsin B1 isotype 2 [Schistosoma mansoni]
          Length = 340

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 122/346 (35%), Positives = 177/346 (51%), Gaps = 24/346 (6%)

Query: 1   MIHILVFLLGCTLVRGELYK-FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIAD 58
           +  ++  L     ++ E +K  SD  I  IN   N  W A         E+  R   + D
Sbjct: 8   IASLITHLDAHISIKNEKFKPLSDDIISYINEHPNAGWRA---------EKSNRFHSLDD 58

Query: 59  AKYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPH 113
           A+    + R  P  R+   P     E++  +P  FD+R++WP C +I  + D   C +  
Sbjct: 59  ARIQMGARREEPDLRRKRRPTVDHNEWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCW 118

Query: 114 IFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRG 173
            F AV A SDR CI+S G+QN  LS   + SCC+ C       C  G +   W+F  K G
Sbjct: 119 AFGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCESC----GLGCEGGILGPAWDFWVKEG 174

Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
            VTG    + TGC+P     C HH +    P C ++     +C   C    Y   + QDK
Sbjct: 175 IVTGSSKENHTGCEPYPFPKCEHH-TKGKYPPCGSKIYKTPRCKQTCQK-KYKTPYTQDK 232

Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
           HR   +Y V ++E AI+KEI+ +GP  A+F +Y+DF +YKSG+YKH +   L    H+ +
Sbjct: 233 HRGKSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGG--HAIR 290

Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
           +IGWG EN TPYWL+ N+W   WG+ G  +I+RG+ EC  E  + A
Sbjct: 291 IIGWGVENKTPYWLIANSWNEDWGENGYFRIVRGRDECFIESEVIA 336


>gi|294914603|ref|XP_002778294.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239886508|gb|EER10089.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 365

 Score =  207 bits (528), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 122/333 (36%), Positives = 173/333 (51%), Gaps = 28/333 (8%)

Query: 24  AYIDQINREANTWTA----GRNFPANL--SEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
           + +D++N + N WTA    GR +  +L  +++    FL    +           + K Y 
Sbjct: 44  SLVDEVNSKQNLWTASTEQGRFYGRSLGDAKKLCGTFLNGTEEL----------EEKVYP 93

Query: 78  PEYSATVPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
            E    +PD FDAR+ +  C   IGHV D  AC +   F  V AF+ R CIKS G+ N+ 
Sbjct: 94  AEELVDIPDSFDARDAFKECKDVIGHVRDQSACGSCWAFGTVEAFNARVCIKSGGKLNQL 153

Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT------GCQPST 190
           LS   + +CC I  +  +  CS G+   +W FLH  G V+GG +          GC P  
Sbjct: 154 LSAADMLACCNIGHFCLSFGCSGGNPITSWTFLHTNGIVSGGGFVPEKNMKAADGCWPYN 213

Query: 191 ISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD-DNEDAI 249
              C+HH        C  +      C + C N  YG  F +D+H T   +     +  +I
Sbjct: 214 FPKCAHHQKESDYKPCAKEIYDTPSCSSSCPNAKYGTAFDKDRHYTESLFPSRFGSTSSI 273

Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVI 309
           KKEI+ +GPT+A F++Y+DF  YKSGVYKHTS   L    H+ ++IGWGTE G  YWLV+
Sbjct: 274 KKEIMTNGPTSAAFSVYEDFLSYKSGVYKHTSGGFLGG--HAVEIIGWGTEKGVDYWLVM 331

Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           N+W   WGD GT KI++G  +C  + +I AG P
Sbjct: 332 NSWNEEWGDHGTFKIVQG--DCGIDDMILAGTP 362


>gi|118153|sp|P25792.1|CYSP_SCHMA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
           Full=Antigen Sm31; Flags: Precursor
 gi|160950|gb|AAA29865.1| cathepsin B [Schistosoma mansoni]
          Length = 340

 Score =  207 bits (528), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 120/348 (34%), Positives = 180/348 (51%), Gaps = 24/348 (6%)

Query: 1   MIHILVFLLGCTLVRGELYK-FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIAD 58
           +  ++ FL     V+ E ++  SD  I  IN   N  W A         E+  R   + D
Sbjct: 8   IASLITFLEAHISVKNEKFEPLSDDIISYINEHPNAGWRA---------EKSNRFHSLDD 58

Query: 59  AKYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPH 113
           A+    + R  P  R+   P     +++  +P  FD+R++WP C +I  + D   C +  
Sbjct: 59  ARIQMGARREEPDLRRKRRPTVDHNDWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCW 118

Query: 114 IFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRG 173
            F AV A SDR CI+S G+QN  LS   + +CC+ C       C  G +   W++  K G
Sbjct: 119 SFGAVEAMSDRSCIQSGGKQNVELSAVDLLTCCESC----GLGCEGGILGPAWDYWVKEG 174

Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
            VT     + TGC+P     C HH +    P C ++     +C   C    Y   + QDK
Sbjct: 175 IVTASSKENHTGCEPYPFPKCEHH-TKGKYPPCGSKIYNTPRCKQTCQR-KYKTPYTQDK 232

Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
           HR   +Y V ++E AI+KEI+ +GP  A+F +Y+DF +YKSG+YKH +   L    H+ +
Sbjct: 233 HRGKSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGG--HAIR 290

Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
           +IGWG EN TPYWL+ N+W   WG+ G  +I+RG+ EC+ E  + AG+
Sbjct: 291 IIGWGVENKTPYWLIANSWNEDWGENGYFRIVRGRDECSIESEVIAGR 338


>gi|161343851|tpg|DAA06106.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 333

 Score =  207 bits (528), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 129/348 (37%), Positives = 173/348 (49%), Gaps = 35/348 (10%)

Query: 4   ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
           + VF   C+    + Y  +  YI  IN  A TW AG NF      +++   L +      
Sbjct: 10  VFVFFSSCSE---QTYFLNKDYISTINSVAKTWKAGINFHPETPLKFILGLLGSKG---- 62

Query: 64  QSDRPLPGDRKTYDPEYS--ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
             D    G  K++DP YS    +P+ FDAR++W NC TIG + D G C +   F+  GAF
Sbjct: 63  -VDVSSAGPFKSHDPLYSPAGNIPNEFDARKRWKNCTTIGTIRDQGNCGSCWAFSTSGAF 121

Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
           +DR CI S G  N+ LS E+V SCC  C       C  G   R W +  K G VTGG++ 
Sbjct: 122 ADRLCIASNGSFNQLLSAEHVTSCCYRC----GLGCQGGYPIRAWRYYSKHGLVTGGNFN 177

Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRC---TNPTY-GRGFFQDKHRTT 237
              GCQP    PC+ +       SC  Q     KC  +C   T+ +Y G   + ++    
Sbjct: 178 SFEGCQPYMFPPCTGNN------SCSGQSEKNHKCQKKCFGNTSISYRGDRRYVERSPYV 231

Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLI 295
           L Y      D ++ +I+ +GP  ++F +YDDF  YKSGVY  + NA    YL  HS K I
Sbjct: 232 LAY------DNMQNDIMTYGPIESSFDVYDDFISYKSGVYFKSPNA---TYLGGHSVKCI 282

Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           GWG E    YWL++N+W   WGD G  KI RG  EC  E    AG P+
Sbjct: 283 GWGVERNVSYWLMMNSWNNTWGDGGNFKIRRGTNECQVEDSSTAGMPE 330


>gi|325302582|dbj|BAJ83491.1| cathepsin B-like peptidase [Echinococcus multilocularis]
          Length = 338

 Score =  207 bits (528), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 130/323 (40%), Positives = 165/323 (51%), Gaps = 21/323 (6%)

Query: 26  IDQINREANT-WTAGRN--FPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA 82
           ID IN +ANT W AG+N  F   LS +     L      F+     LP           A
Sbjct: 29  IDYINNKANTTWRAGKNKRFTDALSAKSQMGSL------FNPGGSMLPTKSFYLSSTQKA 82

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
            +P  FDAR+ WP+C TIG + D G C +   F A  A SDR CI S+G++   +S + +
Sbjct: 83  ALPSEFDARKAWPDCPTIGEIRDQGTCGSCWAFGATEAMSDRICIHSEGKEVVRISADDL 142

Query: 143 ASCCKI-CRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
            SCC + C +     C+ G     W +    G V+GG YG   GC+P  I PC HH S  
Sbjct: 143 LSCCGLFCGF----GCNGGLPENAWRYWAIDGIVSGGLYGSHVGCRPYEIPPCEHHTSG- 197

Query: 202 TLPSCE-NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
             P C+ N K PK  C  +C     G+ +  DKH  +  Y V  +E+ I  EIL +GP  
Sbjct: 198 NRPDCKGNSKTPK--CQRQCVESFDGK-YQADKHFASNVYNVRASEEDIMNEILVYGPVE 254

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
           A F +Y DF  YKSGVY+H     L    H+ K++GWG ENG PYWL  N+W   WGD G
Sbjct: 255 ADFIVYADFLTYKSGVYQHVKGGFLGG--HAVKILGWGEENGVPYWLCANSWNTDWGDGG 312

Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
             KILRG   C  E  I AG PK
Sbjct: 313 FFKILRGYNHCKIEADINAGIPK 335


>gi|195729973|gb|ACG50797.1| cathepsin B2 [Trichobilharzia szidati]
          Length = 344

 Score =  207 bits (527), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 129/324 (39%), Positives = 164/324 (50%), Gaps = 15/324 (4%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
            S   I  IN EANT W A  +     S   +R+ L A     D +   LP     Y P 
Sbjct: 33  LSSELIHFINHEANTTWKAAPSSRFK-SVSDIRRMLGALP---DPNGGYLPTLCTGYTPS 88

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
               +P  FDAR+ WP+C +I  + D  +C +   F AV A SDR CI+SKG     LS 
Sbjct: 89  LD-ELPKEFDARKHWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGLHKPFLSA 147

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
           E + +CC  C       C+ G     W++  + G VTG  Y    GCQP    PC HH  
Sbjct: 148 ENLVACCSSC----GMGCNGGFPHSAWSYWKRSGIVTGDLYNTTDGCQPYEFPPCEHHVV 203

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
            P  PSC    V   KC T C  P Y   + +DK      Y V  N++AI KE++ HGP 
Sbjct: 204 GPR-PSC-GGDVETPKCKTTC-QPGYNIPYNKDKWYGKTVYRVHSNQEAIMKEVMDHGPV 260

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
              F +Y DF +YKSGVY+H S   L    H+ +L+GWG ENG PYWL+ N+W   WGD 
Sbjct: 261 EVDFEVYADFPNYKSGVYQHVSGGLLGG--HAVRLLGWGEENGVPYWLIANSWNSDWGDN 318

Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
           G  KI+RG+ EC  E  + AG PK
Sbjct: 319 GYFKIIRGRNECGIESDVNAGIPK 342


>gi|204022108|dbj|BAG71151.1| cathepsin B-N [Cerataphis jamuritsu]
          Length = 333

 Score =  207 bits (527), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 128/335 (38%), Positives = 170/335 (50%), Gaps = 27/335 (8%)

Query: 17  ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
           + Y   + YI QIN  A TW AG NF   LS +     L   +K    + +  P   KT 
Sbjct: 17  QAYFLEEDYIKQINANAKTWEAGVNFDPKLSIDSFVNLL--GSKGVQAAKKASPDMFKTG 74

Query: 77  DPEYSAT--VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
           D  Y+    +P  FDAR++W  C +IG V D G C +   F    AF+DR CI ++G+ N
Sbjct: 75  DKAYNLAQRIPSNFDARKKWKKCLSIGEVRDQGHCGSCWAFGTSSAFADRLCIATEGEFN 134

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS E +  CC  C +     C+ G   R W    K G VTGG+Y    GCQP  + PC
Sbjct: 135 ELLSAEELTFCCHKCGF----GCNGGYPIRAWERFRKHGLVTGGNYDSYEGCQPYRVPPC 190

Query: 195 --SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDAI 249
               +G+     +C  + + K   + RCT   YG     F  D H T   Y++      I
Sbjct: 191 PLDEYGNN----TCHGKPMEK---NHRCTRMCYGDQDLDFNNDHHYTRDAYYL--TYGTI 241

Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYWL 307
           + ++L +GP  A+F +YDDF  YKSGVY  T NA   +YL  H+ KLIGWG E G PYWL
Sbjct: 242 QNDVLTYGPIEASFEVYDDFPSYKSGVYVKTENA---SYLGGHAVKLIGWGEEYGVPYWL 298

Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           ++N+W   WGD+G  KI RG  EC  +     G P
Sbjct: 299 LVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 333


>gi|346472613|gb|AEO36151.1| hypothetical protein [Amblyomma maculatum]
          Length = 373

 Score =  207 bits (527), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 124/321 (38%), Positives = 162/321 (50%), Gaps = 17/321 (5%)

Query: 26  IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVP 85
           I+ INR   TW AG N   +  E+ +    +      + S   LP   +T D      +P
Sbjct: 66  IEYINRLNTTWKAGHNSGYDNPEDVIPLLGVRP----ENSRYRLP--ERTLDVSALRVLP 119

Query: 86  DRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP---LSTEYV 142
           + FDARE WP+C TI  + D G+C +   F AV A SDR CI S   + R    L+ + V
Sbjct: 120 ENFDAREHWPDCPTIREIRDQGSCGSCWAFGAVEAISDRTCIHSPEGKPRVIAHLAADDV 179

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            SCC  C       C+ G     W++   +G VTGG+Y    GC P  I  C HH +  T
Sbjct: 180 LSCCTEC----GAGCNGGFPGSAWSYWVHKGIVTGGNYDSDEGCMPYPIKACDHHVNG-T 234

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
           L  C+    P  +C   C    Y   F  DKH     Y V      I+ EI+ +GP  A 
Sbjct: 235 LGPCDKTIPPTPRCVRMCRK-GYDVDFMDDKHYGRHAYSVPAKAKQIQAEIMMNGPVEAD 293

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F +Y+DF HYKSGVY+  +++ L    H+ +L+GWG ENG PYWL  N+W   WGD+G  
Sbjct: 294 FTVYEDFLHYKSGVYQRHTDSALGG--HAIRLLGWGVENGVPYWLAANSWNTEWGDKGFF 351

Query: 323 KILRGKYECAFEYLIAAGKPK 343
           KILRG  EC  E  I AG PK
Sbjct: 352 KILRGSDECGIESDIVAGLPK 372


>gi|46812327|gb|AAT02230.1| cathepsin B-like proteinase [Triatoma dimidiata]
          Length = 332

 Score =  207 bits (526), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 122/342 (35%), Positives = 169/342 (49%), Gaps = 17/342 (4%)

Query: 4   ILVFLLGCTLVRGELYK--FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
           IL  LL C      +     SD +ID IN    TW AGRNF  N  ++YL+   +A    
Sbjct: 5   ILFSLLICGTFSASIPTDPLSDEFIDYINTLQTTWRAGRNFAPNTPKKYLKS--LAGVHK 62

Query: 62  FDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
              +   LP  + + D     T+PD FDAR+QWPNC +I  + D G+C +      +   
Sbjct: 63  NANNAFTLPKRKVSLD----VTIPDEFDARKQWPNCPSITDIRDQGSCGSCWALELLRLC 118

Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
                  S G+    LS E + +CC  C       C  G     W +    G V+GG+YG
Sbjct: 119 LIVFVSHSNGKLQVHLSAENLVTCCGSC----GAGCFGGDPGSAWEYWRDVGIVSGGNYG 174

Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW 241
            + GCQP +I+PC HH    + P C  +      C  +C    Y   + +D H     Y 
Sbjct: 175 SKEGCQPYSIAPCEHHIPG-SRPPCRGEG-HTADCRKQCEK-GYSIPYDKDLHYAEFVYS 231

Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
            + +   I+ EIL +GP  A F +Y+D   YK GVYKH + A +    H+ K++GWG EN
Sbjct: 232 TERDVKEIQTEILKNGPVEAAFFVYEDLLTYKEGVYKHVAGAPVGG--HAIKILGWGVEN 289

Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           GTPYWL+ N+W   WG+ G  KILRG  EC  E  ++AG P+
Sbjct: 290 GTPYWLIANSWNTDWGNNGFFKILRGSDECGIEIDVSAGLPR 331


>gi|226821413|gb|ACO82382.1| cathepsin B [Lutjanus argentimaculatus]
          Length = 330

 Score =  207 bits (526), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 120/332 (36%), Positives = 166/332 (50%), Gaps = 21/332 (6%)

Query: 14  VRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDR 73
            R  L   S   ++ IN+   TW AG NF  N+   Y+++               L G +
Sbjct: 18  ARPRLKPLSSEMVNYINKVNTTWKAGHNF-HNVDFSYVQRLC----------GTMLKGPK 66

Query: 74  KTYDPEYSA--TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
                +Y+    +P  FD+REQWPNC T+  + D G+C +   F A  A SDR CI S  
Sbjct: 67  LPIMVQYAGDMKLPKAFDSREQWPNCPTLKEIRDQGSCGSCWAFGASEAISDRLCIHSNA 126

Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTI 191
           + +  +S E + +CC  C       C+ G     W+F  K G V+GG Y    GC+P TI
Sbjct: 127 KVSVEISAEDLLTCCDSC----GMGCNGGYPSAAWDFWTKEGLVSGGLYDSHVGCRPYTI 182

Query: 192 SPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKK 251
            PC HH +  + P C  +     +C ++C    Y   + +DKH    +Y V  +E  I+ 
Sbjct: 183 PPCEHHVNG-SRPPCTGEGGDTPQCLSQC-EAGYTPSYREDKHYGKTSYSVLSDEAEIQY 240

Query: 252 EILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINT 311
           EI  +GP    F +Y+DF  YKSGVY+H S + +    H+ K++GWG ENG PYWL  N+
Sbjct: 241 EIYKNGPVEGAFTVYEDFVLYKSGVYQHVSGSAVGG--HAIKVLGWGEENGVPYWLCANS 298

Query: 312 WGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           W   WGD G  K LRG   C  E  I AG PK
Sbjct: 299 WNTDWGDNGFFKFLRGSDHCGIESEIVAGIPK 330


>gi|209863073|ref|NP_001119610.2| cathepsin B-1852 [Acyrthosiphon pisum]
          Length = 333

 Score =  206 bits (525), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 129/348 (37%), Positives = 174/348 (50%), Gaps = 35/348 (10%)

Query: 4   ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
           + VF   C+    + Y  +  YI  IN  A TW AG NF      +++   L +      
Sbjct: 10  VFVFFSSCSE---QTYFLNKDYISTINSVAKTWKAGINFHPETPLKFILGLLGSKGVEVS 66

Query: 64  QSDRPLPGDRKTYDPEYSAT--VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
            +     G  K++DP YS T  +P+ FDAR++W NC TIG + D G C +   F+  GAF
Sbjct: 67  SA-----GPFKSHDPLYSPTGNIPNEFDARKRWKNCTTIGTIRDQGNCGSCWAFSTSGAF 121

Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
           +DR CI S G  N+ LS E+V SCC  C       C  G   R W +  K G VTGG++ 
Sbjct: 122 ADRLCIASNGSFNQLLSAEHVTSCCYRC----GLGCQGGYPIRAWRYYSKHGLVTGGNFN 177

Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRC---TNPTY-GRGFFQDKHRTT 237
              GCQP    PC+ +       SC  Q     KC  +C   T+ +Y G   + ++    
Sbjct: 178 SFEGCQPYMFPPCTGNN------SCSGQSEKNHKCQKKCFGNTSISYRGDRRYVERSPYV 231

Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLI 295
           L Y      D ++ +I+ +GP  ++F +YDDF  YKSGVY  + NA    YL  HS K I
Sbjct: 232 LAY------DNMQNDIMTYGPIESSFDVYDDFISYKSGVYFKSPNA---TYLGGHSVKCI 282

Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           GWG E    YWL++N+W   WGD G  KI RG  EC  E    AG P+
Sbjct: 283 GWGVERNVSYWLMMNSWNSTWGDGGYFKIRRGTNECQVEDSSTAGVPE 330


>gi|45822211|emb|CAE47502.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
          Length = 331

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 123/346 (35%), Positives = 166/346 (47%), Gaps = 35/346 (10%)

Query: 4   ILVFLLGCTLV-RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYF 62
           I+  LL   L  +G     S+ +I+ IN + +TW AG+NF  NLS + ++  L A     
Sbjct: 6   IITLLLPIVLSYKGSPNPLSNDFINYINSKQSTWVAGKNFDENLSIQEIKNLLGAKKGKL 65

Query: 63  DQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAF 121
                   G  K +       VP+ FDARE W  C   I  V D   C +    AA  A 
Sbjct: 66  --------GVAKEFTHSEDIQVPNSFDARENWKECSDVISTVVDQSDCGSCWAVAAASAM 117

Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
           SDRRCI S+G+   P+S E + SCC  C Y     C  G     W++    G  TGG YG
Sbjct: 118 SDRRCIASQGKLKVPVSAENLLSCCDSCGY----GCEGGYPTMAWSYWIDTGITTGGLYG 173

Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNP--------TYGRGFFQDK 233
            + GCQP ++ PC HH     +  C         C  +C +         T+G G  ++ 
Sbjct: 174 SKQGCQPYSLQPCEHHTEGNKV-QCSTLDYDTPSCKHKCDDSALNYKSELTFGSGSVRNF 232

Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
           +              I+KEIL +GP  A F +Y DF +YKSGVY+H +   L    H+ +
Sbjct: 233 YSVA----------NIQKEILTNGPVEAAFDVYSDFVNYKSGVYQHVAGEYLGG--HAVR 280

Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
           ++GWG E+G PYWLV N+W   WGD+G  KI RG  E  FE  I A
Sbjct: 281 ILGWGEESGVPYWLVANSWNEDWGDKGLFKIRRGNNESGFEDSIVA 326


>gi|51947600|gb|AAU14266.1| cathepsin B-N [Myzus persicae]
          Length = 338

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 123/347 (35%), Positives = 177/347 (51%), Gaps = 20/347 (5%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           ++ +L  +     +  + Y     +ID IN +A TW AG NF    S+E++ + L   ++
Sbjct: 4   VLMLLSVIFVSVYMTEQAYFLEKDFIDNINAQATTWKAGVNFDPKTSKEHIMKLL--GSR 61

Query: 61  YFDQSDRPLPGDRKTYDPEYSAT-VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
                ++      K+ D EY  T +P  FDAR +W +C TIG V D G C +    A   
Sbjct: 62  GVQIPNKNNMNLYKSEDAEYDNTYIPRFFDARRKWRHCSTIGRVRDQGNCGSCWAVATSS 121

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           AF+DR C+ +    N  LS E +  CC  C +     C+ G   + W    K+G VTGGD
Sbjct: 122 AFADRLCVATNADFNELLSAEEITFCCHTCGF----GCNGGYPIKAWKRFSKKGLVTGGD 177

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF--FQDKHRTT 237
           Y    GC+P  + PC +        +C  +    ++ + RCT   YG     F + HR T
Sbjct: 178 YKSGEGCEPYRVPPCPNDDQGNN--TCAGKP---MESNHRCTRMCYGDQDLDFDEDHRYT 232

Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLI 295
             Y+      +I+K+++ +GP  A+F +YDDF  YKSGVY  + NA   +YL  H+ KLI
Sbjct: 233 RDYYYL-TYGSIQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENA---SYLGGHAVKLI 288

Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           GWG E G PYWL++N+W   WGD G  KI RG  EC  +    AG P
Sbjct: 289 GWGEEYGVPYWLMVNSWNEDWGDHGFFKIQRGTNECGVDNSTTAGVP 335


>gi|52630925|gb|AAU84926.1| putative cathepsin B-N [Toxoptera citricida]
          Length = 340

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 121/346 (34%), Positives = 178/346 (51%), Gaps = 22/346 (6%)

Query: 4   ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
           +L  +L    +  + Y   + YI++IN +A TW AG NF     +E++ + L +      
Sbjct: 7   LLSVILFSVYMTEQAYFLEEDYINKINEQATTWKAGVNFDPKTPKEHILKLLGSKGVQIP 66

Query: 64  Q--SDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
              + +    + + YD  +   +P +FDAR++W NC TIG + D G C +    A   AF
Sbjct: 67  SKLNHKMYKSEDENYDNLF-GRIPRKFDARKKWRNCKTIGAIRDQGNCGSCWALATSSAF 125

Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
           +DR C+ S    N+ LS E +  CC  C +     C+ G   + W    K G VTGGDY 
Sbjct: 126 ADRLCVVSNEDFNQLLSAEELTFCCHKCGF----GCNGGYPIKAWEHFKKHGLVTGGDYK 181

Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTL 238
              GC+P  + PC +  S     +C  +    ++ + RCT   YG     F +D   T  
Sbjct: 182 SGEGCEPYRVPPCPYDESGNN--TCAGKP---MEANHRCTRMCYGDQDLDFDEDHRYTRD 236

Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIG 296
           +Y++     +I+K++L +GP  A+F +YDDF  YKSGVY  + NA   +YL  H+ KLIG
Sbjct: 237 SYYL--TYGSIQKDVLTYGPVEASFDVYDDFPSYKSGVYIRSENA---SYLGGHAAKLIG 291

Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           WG E G PYWL++N+W   WGD G  KI RG  EC  +     G P
Sbjct: 292 WGEEYGVPYWLMVNSWNADWGDNGLFKIQRGTNECGIDNSTTGGVP 337


>gi|91078960|ref|XP_974244.1| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270004840|gb|EFA01288.1| cathepsin B precursor [Tribolium castaneum]
          Length = 319

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 127/349 (36%), Positives = 174/349 (49%), Gaps = 38/349 (10%)

Query: 2   IHILVFLL--GCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF----- 54
           ++ L+FLL    ++ R E+   S  +ID IN++ + W A RNFP N + EYL +      
Sbjct: 1   MYFLIFLLLASISVSRAEIDIQSQDFIDSINQKQSHWVARRNFPENTTNEYLYKLNGFLG 60

Query: 55  LIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
           L  D  Y  +  +        ++P+    +P  FDAR++WP C ++  + D G+C +   
Sbjct: 61  LHPDPNYMPEKIK------HNFNPQ---DIPKTFDARKKWPKCDSLNRIRDQGSCGSCWA 111

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
           FAAV   SDR CI S G +    S E + SCC  C      SCS G +   ++F  K+G 
Sbjct: 112 FAAVETMSDRICIHSSGAKKFFFSAEDLLSCCTAC-----GSCSGGYMMAAFDFYIKQGV 166

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           V+GGD     GC+P T      H    T PSC           T+     Y   +  DKH
Sbjct: 167 VSGGDLNSNEGCRPYTADA---HDKGVT-PSC-----------TKSCRKGYPTSYSSDKH 211

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
             +  Y VD     I+ EI+ +GP   +F +Y DFY+Y SGVY H S     N  H  K+
Sbjct: 212 YGSKDYIVDAGVSNIQYEIMTNGPIIVSFKVYQDFYNYGSGVYHHVSGNYTGN--HIVKI 269

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           +GWGTE    YWL+ N+WG  WG+ G  KILRGK EC  E    A  PK
Sbjct: 270 VGWGTEKEQDYWLIANSWGSSWGEHGFFKILRGKNECGIENNPYAVLPK 318


>gi|213514196|ref|NP_001133994.1| Cathepsin B precursor [Salmo salar]
 gi|209156086|gb|ACI34275.1| Cathepsin B precursor [Salmo salar]
          Length = 330

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 123/344 (35%), Positives = 174/344 (50%), Gaps = 25/344 (7%)

Query: 5   LVFLLGCTL----VRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           L  +LG  L     R  L   S   ++ IN+   TW AG NF  N+   Y+++       
Sbjct: 5   LFLVLGSGLSISWARPHLPPLSHEMVNFINKANTTWKAGHNF-HNVDYSYVKRLC----- 58

Query: 61  YFDQSDRPLPGDRKTYDPEYS--ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
                   L G + +   +Y+    +P  FD R QWPNC T+  V D G+C +   F A 
Sbjct: 59  -----GTLLKGPKLSTMVQYTEDMELPKNFDPRLQWPNCPTLKEVRDQGSCGSCWAFGAA 113

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
            A SDR CI S  + +  +S+E + SCC+ C       C+ G      +F  K G V+GG
Sbjct: 114 EAISDRVCIHSNAKVSVEISSEDLLSCCESC----GMGCNGGYPSAACDFWTKEGLVSGG 169

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
            Y    GC+P +I PC HH +  T P C+ ++    +C  +C  P Y  G+ QDKH    
Sbjct: 170 LYDSHIGCRPYSIPPCEHHVNG-TRPPCKGEEGDTPQCTNQC-EPGYTPGYKQDKHFGKR 227

Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
           +Y V  +E  I KE+  +GP    F +Y+DF  YKSGVY+H S + +    H+ K++GWG
Sbjct: 228 SYSVPSDEKEIMKELYKNGPVEGAFTVYEDFLLYKSGVYRHVSGSAVGG--HAIKVLGWG 285

Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            E G PYWL  N+W   WG+ G  KI+RG+  C  E  + AG P
Sbjct: 286 EEGGIPYWLAANSWNTDWGENGFFKIVRGEDHCGIESEMVAGIP 329


>gi|71984043|ref|NP_001024426.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
 gi|351058214|emb|CCD65629.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
          Length = 378

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 115/323 (35%), Positives = 160/323 (49%), Gaps = 10/323 (3%)

Query: 23  DAYIDQINREANTWTAG--RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
           D  ID +N   N WTA   R F +   E    ++ +    +   S +      KT D + 
Sbjct: 43  DDLIDYVNENQNLWTAKKQRRFSSVYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDL 102

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
              +P+ FD+R+ WP C +I  + D  +C +   F AV A SDR CI S G+    LS +
Sbjct: 103 D--IPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSAD 160

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SCCK C +     C+ G     W +  K G VTG +Y    GC+P    PC HH   
Sbjct: 161 DLLSCCKSCGF----GCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKK 216

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
                C +   P  KC  +C +    + + +DK      Y V D+ +AI+KE++ HGP  
Sbjct: 217 THFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLE 276

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
             F +Y+DF +Y  GVY HT   KL    H+ KLIGWG ++G PYW V N+W   WG+ G
Sbjct: 277 IAFEVYEDFLNYDGGVYVHTG-GKLGGG-HAVKLIGWGIDDGIPYWTVANSWNTDWGEDG 334

Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
             +ILRG  EC  E  +  G PK
Sbjct: 335 FFRILRGVDECGIESGVVGGIPK 357


>gi|25146613|ref|NP_741818.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
 gi|1169087|sp|P43510.1|CPR6_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 6; AltName:
           Full=Cysteine protease-related 6; Flags: Precursor
 gi|671715|gb|AAA98787.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|695294|gb|AAA98789.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351058213|emb|CCD65628.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
          Length = 379

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 115/323 (35%), Positives = 160/323 (49%), Gaps = 10/323 (3%)

Query: 23  DAYIDQINREANTWTAG--RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
           D  ID +N   N WTA   R F +   E    ++ +    +   S +      KT D + 
Sbjct: 44  DDLIDYVNENQNLWTAKKQRRFSSVYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDL 103

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
              +P+ FD+R+ WP C +I  + D  +C +   F AV A SDR CI S G+    LS +
Sbjct: 104 D--IPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSAD 161

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SCCK C +     C+ G     W +  K G VTG +Y    GC+P    PC HH   
Sbjct: 162 DLLSCCKSCGF----GCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKK 217

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
                C +   P  KC  +C +    + + +DK      Y V D+ +AI+KE++ HGP  
Sbjct: 218 THFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLE 277

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
             F +Y+DF +Y  GVY HT   KL    H+ KLIGWG ++G PYW V N+W   WG+ G
Sbjct: 278 IAFEVYEDFLNYDGGVYVHTG-GKLGGG-HAVKLIGWGIDDGIPYWTVANSWNTDWGEDG 335

Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
             +ILRG  EC  E  +  G PK
Sbjct: 336 FFRILRGVDECGIESGVVGGIPK 358


>gi|193209594|ref|NP_001123113.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
 gi|351058222|emb|CCD65637.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
          Length = 369

 Score =  206 bits (523), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 115/323 (35%), Positives = 160/323 (49%), Gaps = 10/323 (3%)

Query: 23  DAYIDQINREANTWTAG--RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
           D  ID +N   N WTA   R F +   E    ++ +    +   S +      KT D + 
Sbjct: 34  DDLIDYVNENQNLWTAKKQRRFSSVYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDL 93

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
              +P+ FD+R+ WP C +I  + D  +C +   F AV A SDR CI S G+    LS +
Sbjct: 94  D--IPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSAD 151

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SCCK C +     C+ G     W +  K G VTG +Y    GC+P    PC HH   
Sbjct: 152 DLLSCCKSCGF----GCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKK 207

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
                C +   P  KC  +C +    + + +DK      Y V D+ +AI+KE++ HGP  
Sbjct: 208 THFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLE 267

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
             F +Y+DF +Y  GVY HT   KL    H+ KLIGWG ++G PYW V N+W   WG+ G
Sbjct: 268 IAFEVYEDFLNYDGGVYVHTG-GKLGGG-HAVKLIGWGIDDGIPYWTVANSWNTDWGEDG 325

Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
             +ILRG  EC  E  +  G PK
Sbjct: 326 FFRILRGVDECGIESGVVGGIPK 348


>gi|339236191|ref|XP_003379650.1| cathepsin B [Trichinella spiralis]
 gi|316977649|gb|EFV60721.1| cathepsin B [Trichinella spiralis]
          Length = 356

 Score =  205 bits (522), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 119/348 (34%), Positives = 177/348 (50%), Gaps = 17/348 (4%)

Query: 1   MIHILVFLLGC--TLVRGELYK------FSDAYIDQINREANTWTAGRNFPANLSEEYLR 52
           + +IL+F L C  + V G  +       + +     IN    TW AGRN        ++ 
Sbjct: 12  LFYILLFSLPCFYSTVFGIPFGSRNQRLYFNKMATYINNLQTTWKAGRNPYFETVPSHVI 71

Query: 53  QFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAP 112
           Q ++   +        +P    +Y+      +P  FD+R+QWP C TIG + D   C + 
Sbjct: 72  QGMMGVRRSSKLETNSIPLPVISYE-HIDMEIPVEFDSRKQWPYCPTIGEIRDQSNCGSC 130

Query: 113 HIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKR 172
             F AV A SDR CI + G+Q   +S+  + SCCKIC +     C  G   + W+F  K 
Sbjct: 131 WAFGAVEAISDRICIATDGRQKPHISSTDLLSCCKICGF----GCQGGDPHQAWSFWVKY 186

Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQD 232
           G VTGG+Y    GC+P   +PC+HH +    P C +   P   C   C + TY   + +D
Sbjct: 187 GLVTGGNYTTHDGCRPYPFAPCNHHSNGTYGP-CSHDLEPTPVCKKACQS-TYKIQYNKD 244

Query: 233 KHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
           K+     Y + +    ++KE++ +GP    F +Y+DF  YK+GVY+H + + L    H+ 
Sbjct: 245 KYYGLKAYSLHNKASDLQKELMMNGPMEVAFEVYEDFLLYKTGVYQHHTGSVLGG--HAV 302

Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           +L+GWG ENG PYWL+ N+W   WGD+G  KI RG+ EC  E    AG
Sbjct: 303 RLLGWGEENGVPYWLLANSWNTEWGDKGFFKIYRGRNECGIESEAVAG 350


>gi|390994431|gb|AFM37365.1| cathepsin B2 [Dictyocaulus viviparus]
          Length = 346

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 112/349 (32%), Positives = 171/349 (48%), Gaps = 20/349 (5%)

Query: 1   MIHILVFLLGCTLVRGELYKFS-DAYIDQINREANTWTAG-----RNFPANLSEEYLRQF 54
           ++ +  F+     + G+  + + D  +D +N+  N +TA        +P  +    +   
Sbjct: 11  LVAVAAFVPQSERILGKNVELTGDDLVDYVNKAQNLFTAKLSPRFSEYPTAIKRRLMGSK 70

Query: 55  LIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
            +A    +  ++        T+D    + +P  FD+R QWPNC +I  + D  +C +   
Sbjct: 71  YVAIPSKYRVNEV-------THDDIDDSAIPSSFDSRTQWPNCPSIKSIRDQSSCGSCWA 123

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
           F A  A +DR CI SKG     +S + + SCC  C +     C  G  +  WN+  ++G 
Sbjct: 124 FGAAEAMTDRICIASKGAIQFTVSADDLLSCCDECGF----GCDGGFPYAAWNYWVEKGI 179

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           V+GG Y  ++GC+P    PC HH +      C     P   C  +C +  Y   +  DK 
Sbjct: 180 VSGGSYTSKSGCKPYPFPPCEHHTNGTHYHPCPKDLYPTNTCEHKCQS-GYATAYTNDKR 238

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
                Y V     AI+KEI+ HGP    + +Y+DF HY  G+YKHT+ + L    H+ K+
Sbjct: 239 YGAKAYTVAARVKAIQKEIMLHGPVEVAYDVYEDFEHYLKGIYKHTAGSYLGG--HAVKM 296

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           IGWGTENG PYW+  N+W   WG+ G  +ILRG  EC  E  + AG PK
Sbjct: 297 IGWGTENGIPYWICSNSWNSDWGENGFFRILRGTDECGIESGVVAGLPK 345


>gi|183988832|gb|ACC66065.1| cathepsin B [Antheraea assama]
          Length = 287

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 111/299 (37%), Positives = 158/299 (52%), Gaps = 14/299 (4%)

Query: 36  WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWP 95
           W AGRNFP +    ++++ +       D +   LP  + T+D +  A++P+ FD R++WP
Sbjct: 1   WRAGRNFPIHTPFAHIKKLM---GSLKDDNILKLP--KVTHDADLIASLPENFDPRDKWP 55

Query: 96  NCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNK 155
           +C T+  + D G+C +   F AV A +DR CI S   ++   S E + SCC IC      
Sbjct: 56  DCPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPICGL---- 111

Query: 156 SCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLK 215
            C+ G     W +    G V+GG+Y    GC+P  I PC HH     +P   + K P  K
Sbjct: 112 GCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTP--K 169

Query: 216 CHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSG 275
           C   C + +Y   F +DK      Y V  +ED IK E+  +GP    F +Y D   YKSG
Sbjct: 170 CEKTCES-SYTVPFKKDKRYGKHVYSVSGHEDNIKAELFKNGPVEGAFTVYSDLLSYKSG 228

Query: 276 VYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
           VY+HT    L    H+ K++GWG ENG+ YWL+ N+W   WGD G +KILRG+  C  E
Sbjct: 229 VYQHTHGNALGG--HAIKILGWGVENGSKYWLIANSWNSDWGDNGFLKILRGEDHCGIE 285


>gi|91078964|ref|XP_974298.1| PREDICTED: similar to putative cathepsin B-like like proteinase
           [Tribolium castaneum]
 gi|270004838|gb|EFA01286.1| cathepsin B precursor [Tribolium castaneum]
          Length = 335

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 124/342 (36%), Positives = 173/342 (50%), Gaps = 22/342 (6%)

Query: 6   VFLLGCTLVRGEL--YKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
           V L    L  G L  +  SD +I+ IN +  TW AGRNF  +     +++ L    K  +
Sbjct: 9   VVLATIALSYGGLNPHPLSDEFINAINSKKTTWKAGRNFDIHTPLANIKKLLGVLPKKAN 68

Query: 64  QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTI-GHVPDTGACAAPHIFAAVGAFS 122
                L    K +  + +A +P+ FDARE WP C +I G + D  +C +   F A  A S
Sbjct: 69  ARQLEL----KVHSVDVNA-IPESFDAREAWPECASIIGDIRDQASCGSCWAFGAAEAMS 123

Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
           DR CI S       +STE + +CC    Y+    C+ G     W +  + G VTGG Y  
Sbjct: 124 DRICIHSNATVKVSISTEDLNTCC----YECGDGCNGGWPAEAWAYWAETGIVTGGKYET 179

Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
           + GC+  T+ PC HH +   LP+C    VP  +C   C         ++   R    Y  
Sbjct: 180 KDGCKAYTVPPCEHH-TEGDLPAC-GDIVPTPQCKKECDAGVDIE--YKSDLRKGSAYQT 235

Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTE 300
             +E  I+ EI+ +GP  A F +Y+DF +YKSGVY+ T+     NY   H+ K++GWG E
Sbjct: 236 SSDESQIQTEIMTNGPVEADFDVYEDFLNYKSGVYQQTTG----NYAGGHAIKILGWGVE 291

Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           +GTPYWL  N+W   WGD+G  KILRG+ EC  E  I  G P
Sbjct: 292 DGTPYWLAANSWNEDWGDKGYFKILRGQNECGIESDIIGGIP 333


>gi|55793943|gb|AAV65882.1| cathepsin B1 isotype 2 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  204 bits (520), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 123/346 (35%), Positives = 180/346 (52%), Gaps = 17/346 (4%)

Query: 1   MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIA 57
           +I  +  L    L   E+     SD  I  IN+  +  WTA R+      E+     ++ 
Sbjct: 8   IISFMSILTAHILPENEIQFEPLSDEMIAYINQHPDAGWTASRSDRFKSLED---ARILL 64

Query: 58  DAKYFDQSDRPLPGDRKTYDPE-YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
            A + D+  R     R T D +  S  +P  FD+R++W  C +I ++ D   C +   FA
Sbjct: 65  GAMHEDEELRK--KRRPTVDHQNVSLEIPSSFDSRKKWRQCKSISNIRDQSRCGSCWAFA 122

Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
           AV A SDR CI+SKG+++  LS   + SCC  C       C  G     W++  + G VT
Sbjct: 123 AVEAMSDRICIESKGKKSVELSAVDLLSCCTEC----GLGCQGGFPGAAWDYWVEDGIVT 178

Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
           G    + TGCQP     C HH +    P C  +     KCH +C    Y   + +DK+  
Sbjct: 179 GSSKENHTGCQPYPFPKCEHHTTG-KYPECGEKIYKTPKCHQKCQK-GYKTPYGKDKYYG 236

Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
            ++Y V +NE+AIKKEI+ HGP  A F ++ DF +YKSG+YK+ + A++    H+ ++IG
Sbjct: 237 RMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGG--HAVRIIG 294

Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           WG E  TPYWL+ N+W   WG++G  +ILRGK EC  E  +  G P
Sbjct: 295 WGVEKKTPYWLIANSWNEDWGEKGYFRILRGKDECGIESEVTGGLP 340


>gi|55793941|gb|AAV65881.1| cathepsin B1 isotype 1 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  204 bits (520), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 123/346 (35%), Positives = 180/346 (52%), Gaps = 17/346 (4%)

Query: 1   MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIA 57
           +I  +  L    L   E+     SD  I  IN+  +  WTA R+      E+     ++ 
Sbjct: 8   IISFMSILTAHILPENEIQFEPLSDEMIAYINQHPDAGWTASRSDRFKSLED---ARILL 64

Query: 58  DAKYFDQSDRPLPGDRKTYDPE-YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
            A + D+  R     R T D +  S  +P  FD+R++W  C +I ++ D   C +   FA
Sbjct: 65  GAMHEDEELRK--KRRPTVDHQNVSLEIPSSFDSRKKWHQCKSISNIRDQSRCGSCWAFA 122

Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
           AV A SDR CI+SKG+++  LS   + SCC  C       C  G     W++  + G VT
Sbjct: 123 AVEAMSDRICIESKGKKSVELSAVDLLSCCTEC----GLGCQGGFPGAAWDYWVEDGIVT 178

Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
           G    + TGCQP     C HH +    P C  +     KCH +C    Y   + +DK+  
Sbjct: 179 GSSKENHTGCQPYPFPKCEHHTTG-KYPECGEKIYKTPKCHQKCQK-GYKTPYKKDKYYG 236

Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
            ++Y V +NE+AIKKEI+ HGP  A F ++ DF +YKSG+YK+ + A++    H+ ++IG
Sbjct: 237 RMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGG--HAVRIIG 294

Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           WG E  TPYWL+ N+W   WG++G  +ILRGK EC  E  +  G P
Sbjct: 295 WGVEKKTPYWLIANSWNEDWGEKGYFRILRGKDECGIESEVTGGLP 340


>gi|17565158|ref|NP_503384.1| Protein W07B8.1 [Caenorhabditis elegans]
 gi|351059396|emb|CCD74286.1| Protein W07B8.1 [Caenorhabditis elegans]
          Length = 335

 Score =  204 bits (520), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 119/345 (34%), Positives = 171/345 (49%), Gaps = 14/345 (4%)

Query: 1   MIHILVFLLGCTL-VRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
           M  IL+ L+G      G      D  I  +N +  TWTAG   PA LS   + + L+ DA
Sbjct: 1   MRKILICLIGVLFQADGVPPSEIDRIIHYVNSQKTTWTAG--IPA-LSRNSMLKTLVTDA 57

Query: 60  KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
                  +     +   D      +   FDARE+WP C +I  + D   C     FAA  
Sbjct: 58  ATIGFKIQNFGVSQANSD------LSPSFDARERWPECMSIPQINDISECKTSWAFAAAE 111

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           + SDR CI S G +N  LS E + SCC    +   + C  G+ F+ W ++ K G  TGG 
Sbjct: 112 SMSDRLCINSGGFKNTILSAEELLSCCT-GMFSCGEGCEGGNPFKAWQYIQKHGIPTGGS 170

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPT-YGRGFFQDKHRTTL 238
           Y  + GC+P +I PC       T P+C N   P   C  +CT+   Y     +D+H    
Sbjct: 171 YESQFGCKPYSIPPCGKTVGNVTYPACTNTTSPTPSCEKKCTSRIGYPIDIDKDRHYGVS 230

Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
              + +++  I+ +++ +GP  ATF +YDDF  Y +G+Y H +  K + +L S ++IGWG
Sbjct: 231 VDQLPNSQIEIQSDVMLNGPIQATFEVYDDFLQYTTGIYVHLTGNK-QGHL-SVRIIGWG 288

Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
              G PYWL  N+WG  WG+ GT ++LRG  EC  E    +G PK
Sbjct: 289 VWQGVPYWLCANSWGRQWGENGTFRVLRGTNECGLESNCVSGMPK 333


>gi|55793947|gb|AAV65884.1| cathepsin B1 isotype 4 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  204 bits (519), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 121/348 (34%), Positives = 182/348 (52%), Gaps = 17/348 (4%)

Query: 1   MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIA 57
           +I  +  L    L   E+     SD  I  IN+  +  WTA R+      E+     ++ 
Sbjct: 8   IISFMSILTAHILPENEIQFEPLSDEMIAYINQHPDAGWTASRSDRFKSLED---ARILL 64

Query: 58  DAKYFDQSDRPLPGDRKTYDPE-YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
            A + D+  R     R T D +  S  +P  FD+R++W  C +I ++ D   C +   FA
Sbjct: 65  GAMHEDEELRK--KRRPTVDHQNVSLEIPSSFDSRKKWHQCKSISNIRDQSRCGSCWAFA 122

Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
           AV A SDR CI+SKG+++  LS   + SCC  C       C  G     W++  + G VT
Sbjct: 123 AVEAMSDRICIESKGKKSVELSAVDLLSCCTEC----GLGCQGGFPGAAWDYWVEDGIVT 178

Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
           G    + TGCQP     C HH +    P C  +     KCH +C    Y   + +DK+  
Sbjct: 179 GSSKENHTGCQPYPFPKCEHHTTG-KYPECGEKIYKTPKCHQKCQK-GYKTPYKKDKYYG 236

Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
            ++Y V +NE+AIKKEI+ HGP    F ++ DF +YKSG+YK+ + A++    H+ ++IG
Sbjct: 237 RMSYNVLNNENAIKKEIMMHGPVEVAFTVHSDFLNYKSGIYKYMTGAEIGE--HAVRIIG 294

Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           WG E  TPYWL+ N+W   WG++G  ++LRGK EC  E  + +G P++
Sbjct: 295 WGVEKKTPYWLIANSWNEDWGEKGYFRMLRGKDECGIESAVTSGLPRD 342


>gi|91089437|ref|XP_966750.1| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270012705|gb|EFA09153.1| cathepsin B precursor [Tribolium castaneum]
          Length = 324

 Score =  204 bits (519), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 124/347 (35%), Positives = 176/347 (50%), Gaps = 36/347 (10%)

Query: 4   ILVFLLG-CTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYF 62
           ++VF+L   + +  +    SD +I+ IN + +TWTAGRNFP +   E+L++   A     
Sbjct: 6   VVVFVLTFSSALSAQNPILSDEFINSINAQQSTWTAGRNFPEDTPIEHLKRLNGALIT-- 63

Query: 63  DQSDRPLPGDRKTY----DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
                 L G  +T+     PE    +P+ FD R  W  C ++ ++ + G C +   F +V
Sbjct: 64  ----PDLVGKNQTHVINVIPE---AIPETFDGRTHWSQCPSLKNIRNQGNCGSCWAFGSV 116

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
              +DR CI SKG+     S + + +CC  C     K C  G+ +R + +   +G V+GG
Sbjct: 117 EVMTDRLCIASKGKTKFEFSADDLLACCTAC----GKGCDGGAPYRAFEYWVAKGIVSGG 172

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR-TT 237
           DY    GCQP       + GSA       N   P  KC T+C N  Y   + +DKH  T 
Sbjct: 173 DYNSNEGCQP-------YEGSAFL-----NSVTP--KCSTKCLNSKYTTPYAKDKHYGTD 218

Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
             Y    N   I+ EI+ +GP      +Y+DFY YKSGVY+H S   +    H+ K+IGW
Sbjct: 219 FIYMTSKNVAEIQTEIMNNGPVVTHMDVYEDFYSYKSGVYQHVSGNSMGG--HAVKIIGW 276

Query: 298 GTENGTPYWLVINTWGPHWGDR-GTVKILRGKYECAFEYLIAAGKPK 343
           GTE G PYWL+ N+WG  W D  G  KILRGK  C  E  I  G P+
Sbjct: 277 GTEKGVPYWLIANSWGAKWADLDGFYKILRGKNHCKIETYIYGGTPQ 323


>gi|195729971|gb|ACG50796.1| cathepsin B1 [Trichobilharzia szidati]
          Length = 342

 Score =  204 bits (519), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 125/348 (35%), Positives = 174/348 (50%), Gaps = 17/348 (4%)

Query: 1   MIHILVFLLGCTLVRGELY--KFSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIA 57
           ++ ++  L    L   E+     SD  I  IN+  +  WTA R+     S E  R  L A
Sbjct: 8   IVSLMSILTAHILTDNEVQFEPLSDEMIAYINQHPDAGWTASRSDRFK-SVEDARILLGA 66

Query: 58  DAKYFDQSDRPLPGDRKTYDPE-YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
                 + +      R T D +  S  +P  FD+R++W  C +I ++ D   C     FA
Sbjct: 67  ----MSEDEELRKKRRPTVDHQNVSLEIPSSFDSRKKWRQCKSISNIRDQSRCGPCWAFA 122

Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
           AV A SDR CI+SKG+++  LS   + SCC  C       C  G     W++  + G VT
Sbjct: 123 AVEAMSDRICIQSKGKKSVELSAVDLLSCCTEC----GLGCQGGFPGAAWDYWVEEGIVT 178

Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
           G    + TGCQP     C HH +    P+C  +     KC  +C    Y   + +DK+  
Sbjct: 179 GSSKENHTGCQPYPFPKCEHH-TKGKYPACGEKIYKTPKCQQKCQK-GYKTPYKKDKYYG 236

Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
            L+Y V   EDAIKKEI+ HGP  A F +Y DF +YKSG+YKH     +    H+ ++IG
Sbjct: 237 KLSYNVLSKEDAIKKEIMMHGPVEAAFTVYSDFLNYKSGIYKHMKGTVIGG--HAVRIIG 294

Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           WG E  TPYWL+ N+W   WG++G  +ILRGK  C  E  + AG P N
Sbjct: 295 WGVEKKTPYWLIANSWNEDWGEKGYFRILRGKDVCGIESAVTAGLPHN 342


>gi|161343879|tpg|DAA06120.1| TPA_inf: cathepsin B [Toxoptera citricida]
          Length = 340

 Score =  204 bits (519), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 127/351 (36%), Positives = 183/351 (52%), Gaps = 26/351 (7%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNF-PANLSEEYLRQFLIADA 59
           ++ +L  +L    +  + Y     YI++IN +A+TWTAG NF P+   E+ LR   +  +
Sbjct: 4   VLILLSVILFSVYMTEQAYFLEKDYINKINEKASTWTAGFNFDPSTPKEDILR---LLGS 60

Query: 60  KYFDQSDRPLPGDRKTYDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
           K      +      K+ D EY      +P +FDAR++W +C TIG V D G C +    A
Sbjct: 61  KGVQTPSKINHKMYKSEDKEYDNLFGRIPKKFDARKKWRHCTTIGAVRDQGNCGSCWAIA 120

Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
              AF+DR C+ +    N+ LS E +  CC  C Y     C+ G   + W    K G VT
Sbjct: 121 TSSAFADRLCVATNADFNQLLSAEEITFCCHKCGY----GCNGGYPIKAWERFKKHGLVT 176

Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK--H 234
           GG+Y    GC+P  + PC +  S     +C  + + +   + RCT   YG         H
Sbjct: 177 GGEYKSGEGCEPYRVPPCPYDESGNN--TCSGKPMEQ---NHRCTRMCYGDQDLDFDDDH 231

Query: 235 RTTL-TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HS 291
           R T  +Y++     +I+K+++ +GP  A+F +YDDF  YKSGVY  + NA   +YL  H+
Sbjct: 232 RHTRDSYYLTIG--SIQKDVMTYGPIEASFDVYDDFLSYKSGVYVRSENA---SYLGGHA 286

Query: 292 GKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            KLIGWG E GTPYWL++N+W   WGD G  KI RG  EC  +    AG P
Sbjct: 287 VKLIGWGEEYGTPYWLMMNSWNADWGDEGLFKIRRGTNECGVDNSTTAGVP 337


>gi|1311050|pdb|1CPJ|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1311051|pdb|1CPJ|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1421561|pdb|1THE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B- Inhibitor Complex: Implications For
           Structure-Based Inhibitor Design
 gi|1421562|pdb|1THE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B- Inhibitor Complex: Implications For
           Structure-Based Inhibitor Design
          Length = 260

 Score =  204 bits (519), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 109/260 (41%), Positives = 141/260 (54%), Gaps = 8/260 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FDAREQW NC TI  + D G+C +   F AV A SDR CI + G+ N  +S E + 
Sbjct: 7   LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLL 66

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           +CC I   D    C+ G     WNF  ++G V+GG Y    GC P TI PC HH +    
Sbjct: 67  TCCGIQCGD---GCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGARP 123

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P       PK  C+  C    Y   + +DKH    +Y V D+E  I  EI  +GP    F
Sbjct: 124 PCTGEGDTPK--CNKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAF 180

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
            ++ DF  YKSGVYKH +   +    H+ +++GWG ENG PYWLV N+W   WGD G  K
Sbjct: 181 TVFSDFLTYKSGVYKHEAGDVMGG--HAIRILGWGIENGVPYWLVANSWNADWGDNGFFK 238

Query: 324 ILRGKYECAFEYLIAAGKPK 343
           ILRG+  C  E  I AG P+
Sbjct: 239 ILRGENHCGIESEIVAGIPR 258


>gi|183988834|gb|ACC66066.1| cathepsin B [Samia ricini]
          Length = 283

 Score =  204 bits (518), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 112/298 (37%), Positives = 154/298 (51%), Gaps = 15/298 (5%)

Query: 36  WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWP 95
           W+AGRNFP + S  +++     + +Y+      +     T+D E  AT+P+ FD R++WP
Sbjct: 1   WSAGRNFPTHTSFAHIKILREHERRYY------MEVAYVTHDVELIATLPEIFDPRDKWP 54

Query: 96  NCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNK 155
            C T+  + D G+C +   F AV A +DR CI S   ++   S E + SCC IC      
Sbjct: 55  ECLTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCPIC----GL 110

Query: 156 SCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLK 215
            C+ G     W +    G V+GG+Y    GC+P  I PC HH     +P   + K P  K
Sbjct: 111 GCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDTKTP--K 168

Query: 216 CHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSG 275
           C   C + +Y   F +DK      Y V  +ED IK E+  +GP  A F +Y D   YK+G
Sbjct: 169 CQKNCES-SYNVPFKKDKRYGKHVYSVSGHEDHIKAELFKNGPVEAAFTVYSDLLSYKNG 227

Query: 276 VYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAF 333
           VYKHT    L    H+ K+IGWG EN   YWL+ N+W   WGD G  KILRG+  C  
Sbjct: 228 VYKHTEGNALGG--HAIKIIGWGVENNNKYWLIANSWNSDWGDNGFFKILRGEDHCGI 283


>gi|1127275|pdb|1CTE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1127276|pdb|1CTE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
          Length = 254

 Score =  204 bits (518), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 109/260 (41%), Positives = 141/260 (54%), Gaps = 8/260 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FDAREQW NC TI  + D G+C +   F AV A SDR CI + G+ N  +S E + 
Sbjct: 1   LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLL 60

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           +CC I   D    C+ G     WNF  ++G V+GG Y    GC P TI PC HH +    
Sbjct: 61  TCCGIQCGD---GCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGARP 117

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P       PK  C+  C    Y   + +DKH    +Y V D+E  I  EI  +GP    F
Sbjct: 118 PCTGEGDTPK--CNKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAF 174

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
            ++ DF  YKSGVYKH +   +    H+ +++GWG ENG PYWLV N+W   WGD G  K
Sbjct: 175 TVFSDFLTYKSGVYKHEAGDVMGG--HAIRILGWGIENGVPYWLVANSWNADWGDNGFFK 232

Query: 324 ILRGKYECAFEYLIAAGKPK 343
           ILRG+  C  E  I AG P+
Sbjct: 233 ILRGENHCGIESEIVAGIPR 252


>gi|340380685|ref|XP_003388852.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
          Length = 341

 Score =  204 bits (518), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 117/270 (43%), Positives = 151/270 (55%), Gaps = 15/270 (5%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           E  A +PD FDAR++WP+C TIG V D GAC +   F AV A SDR CI  K Q N  +S
Sbjct: 81  EVPAVIPDTFDARQKWPDCPTIGTVRDQGACGSCWAFGAVEAMSDRYCISFKEQVN--IS 138

Query: 139 TEYVASCCKICRYDDNKSCSHG---SVFRTW-NFLHKRGSVTGGDYGDRTGCQPSTISPC 194
            E + SCC+ C       C  G   + +R W + L   G VTGG Y    GCQP TI  C
Sbjct: 139 AENLLSCCETC----GSGCDGGYPAAAWRHWADKLLYEGIVTGGQYDSNAGCQPYTIPKC 194

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            HH   P      +Q  P   C   C + +Y + +  DKH    +Y +  +  +I+ EI+
Sbjct: 195 DHHEPGPYENCSGSQSTPS--CKRSCIS-SYDKSYRSDKHYGKNSYSISSDVSSIQTEIM 251

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            +GP    F++Y DF  Y SGVY+HT+ + L    H+ K++GWGTENG PYWLV N+W P
Sbjct: 252 TNGPVEGAFSVYADFPTYTSGVYQHTTGSFLGG--HAIKILGWGTENGVPYWLVANSWNP 309

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WGD G  KI+RGK EC  E  I AG P+ 
Sbjct: 310 SWGDSGFFKIIRGKDECGIESSIVAGMPEQ 339


>gi|187105118|ref|NP_001119619.1| cathepsin B-5880 precursor [Acyrthosiphon pisum]
 gi|163300442|tpg|DAA06127.1| TPA_inf: cathepsin B transcript 5880 [Acyrthosiphon pisum]
 gi|239790051|dbj|BAH71611.1| ACYPI000015 [Acyrthosiphon pisum]
 gi|239790053|dbj|BAH71612.1| ACYPI000015 [Acyrthosiphon pisum]
          Length = 302

 Score =  204 bits (518), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 108/259 (41%), Positives = 145/259 (55%), Gaps = 10/259 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDAR +W  C +IG V D G C + +  +   A SDR CI S G     LS + + 
Sbjct: 53  LPKSFDARAKWYMCPSIGMVYDQGNCKSSYAISVASAVSDRICIHSNGTVKPKLSAQQIL 112

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC +C       CS G  F +W+F  + G V+GG+YG   GCQP TI PC H  +A   
Sbjct: 113 SCCYLC----GDGCSGGQHFESWDFYRRHGLVSGGEYGSNEGCQPYTIEPCQHTETA-VE 167

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
            +C N+ +   +C  +C NP YG  + +D H+ T  Y V        KEI  +GP TA+F
Sbjct: 168 NACSNKTLFTPECKVQCYNPDYGTRYVKDNHQGT-HYRVP--AYTAMKEIYENGPITASF 224

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
            +Y DF +Y+SGVY + S   +     + K++GWG ENGTPYWL  N++  +WGD G VK
Sbjct: 225 YMYQDFVNYQSGVYAYNSGKYVTT--QAVKILGWGEENGTPYWLAANSFNTYWGDNGFVK 282

Query: 324 ILRGKYECAFEYLIAAGKP 342
           ILRG  EC  E  + AG P
Sbjct: 283 ILRGANECYIEEFMYAGLP 301


>gi|291385792|ref|XP_002709482.1| PREDICTED: cathepsin B [Oryctolagus cuniculus]
          Length = 339

 Score =  204 bits (518), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 125/328 (38%), Positives = 172/328 (52%), Gaps = 21/328 (6%)

Query: 18  LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
           L+  SD  ++ IN++  TW AG NF  N+   YL++               L G +    
Sbjct: 23  LHPLSDELVNFINKQNTTWQAGHNF-FNVEVSYLKKLC----------GTFLGGPKLPRR 71

Query: 78  PEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
            E++  +  P+ FDAREQWPNC TI  + D G+C +   F AV A SDR CI + G  N 
Sbjct: 72  VEFADDIKLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNGHVNV 131

Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
            +S E + +CC     D             WNF  K+G V+GG Y    GC+P +I PC 
Sbjct: 132 EVSAEDMLTCCGGQCGDGCNGGYPSGA---WNFWTKKGLVSGGLYDSHVGCKPYSIPPCE 188

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           HH +  + P+C  +     +C   C  P Y   + +DKH    +Y V  +E+ IK EI  
Sbjct: 189 HHVNG-SRPACTGEG-DTPRCSKTC-EPGYSPSYKEDKHYGYSSYSVSSDENEIKAEIYK 245

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
           +GP    F +Y DF  YKSGVY+HT+   +    H+ +++GWG ENG PYWLV N+W   
Sbjct: 246 NGPVEGAFTVYSDFLMYKSGVYQHTTGDIMGG--HAIRILGWGEENGVPYWLVANSWNTD 303

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPK 343
           WGD+G  KILRG+  C  E  I AG P+
Sbjct: 304 WGDKGFFKILRGQDHCGIESEIVAGIPR 331


>gi|170787211|gb|ACB38229.1| cathepsin B [Meretrix meretrix]
          Length = 337

 Score =  204 bits (518), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 128/348 (36%), Positives = 176/348 (50%), Gaps = 28/348 (8%)

Query: 2   IHILVFLLGCTLVRGELYK--FSDAYIDQIN-REANTWTAG----RNFPANLSEEYLRQF 54
           + +LVF+      R + +   FS+A+++  N R+  +W A     +N P     +Y++  
Sbjct: 4   LLVLVFVGAAWSYRFDFHDDYFSEAFVNYHNSRDDVSWKATTENFKNVPYKGRMDYVKSL 63

Query: 55  LIADAKYFDQSDRPLPGDRK--TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAP 112
             A+         P P + K    + E    +PD FDAR QWP+C ++  V D GAC + 
Sbjct: 64  CGAN---------PAPPEMKFPVKEIEVPKDLPDTFDARTQWPDCPSLKEVRDQGACGSC 114

Query: 113 HIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKR 172
             F  V A +DR CI+SKG  N  LS E + SCC+ C       C+ G +   WN+L + 
Sbjct: 115 WAFGCVEAATDRLCIQSKGIVNAHLSAEDLTSCCRTC----GNGCNGGFLEGAWNYLKRD 170

Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQD 232
           G VTGG Y    GC P  I  C HH      P C+    P  +C   C +  Y   + +D
Sbjct: 171 GIVTGGPYNSHQGCLPYEIKACDHHVVGKLQP-CKGDG-PTPRCKKECES-GYNNTYSKD 227

Query: 233 KHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
           +H     + V+  E  I  EI+ +GP  A F +Y DF  YKSGVY+H S   L    H+ 
Sbjct: 228 EHHAKTVHAVEGVEQ-IMTEIMTNGPVEAAFTVYSDFPTYKSGVYEHKSGGPLGG--HAI 284

Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           K +GWG E+G  YWLV N+W P WGD G  KILRG+ EC  E  I AG
Sbjct: 285 KTLGWGNEDGKDYWLVANSWNPDWGDNGFFKILRGRDECGIESNIVAG 332


>gi|203648|gb|AAA40993.1| cathepsin (EC 3.4.22.1), partial [Rattus norvegicus]
          Length = 271

 Score =  203 bits (517), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 109/261 (41%), Positives = 141/261 (54%), Gaps = 8/261 (3%)

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
            +P+ FDAREQW NC TI  + D G+C +   F AV A SDR CI + G+ N  +S E +
Sbjct: 11  NLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDL 70

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            +CC I   D    C+ G     WNF  ++G V+GG Y    GC P TI PC HH +   
Sbjct: 71  LTCCGIQCGD---GCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGSR 127

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
            P       PK  C+  C    Y   + +DKH    +Y V D+E  I  EI  +GP    
Sbjct: 128 PPCTGEGDTPK--CNKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGA 184

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F ++ DF  YKSGVYKH +   +    H+ +++GWG ENG PYWLV N+W   WGD G  
Sbjct: 185 FTVFSDFLTYKSGVYKHEAGDVMGG--HAIRILGWGIENGVPYWLVANSWNVDWGDNGFF 242

Query: 323 KILRGKYECAFEYLIAAGKPK 343
           KILRG+  C  E  I AG P+
Sbjct: 243 KILRGENHCGIESEIVAGIPR 263


>gi|76576341|gb|ABA53864.1| cathepsin B-like cysteine protease 2 [Parelaphostrongylus tenuis]
          Length = 344

 Score =  203 bits (517), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 116/319 (36%), Positives = 162/319 (50%), Gaps = 17/319 (5%)

Query: 23  DAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA 82
           DA +D +N +   + A    PA   EE   +  I  +K+  +S +P    R     E   
Sbjct: 42  DALVDYVNNQQQLFKAE---PAAAIEEL--RMKIMKSKFISRSKKP----RVDEIGEEGF 92

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
            +PD FDAR QWP+C +I ++ D   C +   F +  A SDR CI S G +   LS + +
Sbjct: 93  KIPDSFDARVQWPHCPSISYIRDQSQCGSCWAFGSAEAMSDRVCIASHGNKTVELSADDI 152

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            SCC    YD    C  G     W +  + G VTGG YG +  C+P  I PC HH +   
Sbjct: 153 LSCC----YDCGDGCDGGYPISAWEYFVETGVVTGGLYGTKDSCRPYEIPPCGHHRNETF 208

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
             +C  Q      C T C    Y   +  DK     +Y ++ +  AI+KEI+ +GP TA 
Sbjct: 209 YGNC-TQIADTPDCVTTC-QAGYPISYDDDKTFGKDSYTIESSVTAIQKEIMTYGPVTAA 266

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F +Y+DF+HY  G+YKH S    E   H+ +++GWG E GT YWLV N+W   WG+ G  
Sbjct: 267 FIVYEDFFHYHRGIYKHVSGG--EEGGHAVRILGWGEEKGTAYWLVANSWNTDWGENGYF 324

Query: 323 KILRGKYECAFEYLIAAGK 341
           +ILRG  EC  E  + AG+
Sbjct: 325 RILRGSNECGIEENVVAGR 343


>gi|255040225|gb|ACT99885.1| cathepsin B2 [Opisthorchis viverrini]
          Length = 337

 Score =  203 bits (517), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 108/260 (41%), Positives = 144/260 (55%), Gaps = 9/260 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDAREQWP+C TIG + D  +C +   F AV A SDR CI S G   + LS+  + 
Sbjct: 80  IPKTFDAREQWPHCPTIGQIRDQSSCGSCWAFGAVEAMSDRLCIHSNGTFTKSLSSIDLV 139

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC  C +     C  G     W+F    G VTGG   D  GC+      CSHHGS    
Sbjct: 140 SCCGYCGF----GCQGGYPPAAWDFWQAYGIVTGGSKEDPMGCRSYPFPKCSHHGSK-KY 194

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P C ++     KC  +C  P     +  DK R  +TY V  ++ AI KEI+ +GP  A F
Sbjct: 195 PPCPHRIYDTPKCVPKCDTPNID--YETDKTRANITYNVQRSQMAIMKEIMINGPVEAAF 252

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
            +Y+DF+ YK GVY H++   +    H+ +++GWG ENGTPYWL+ N+W   WG+ G  K
Sbjct: 253 EVYEDFFGYKQGVYFHSTGEFIGG--HAIRILGWGEENGTPYWLIANSWNEGWGEDGYFK 310

Query: 324 ILRGKYECAFEYLIAAGKPK 343
           +LRGK EC  E  + AG P+
Sbjct: 311 MLRGKNECGIEDEVTAGLPE 330


>gi|308488328|ref|XP_003106358.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
 gi|308253708|gb|EFO97660.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
          Length = 343

 Score =  203 bits (517), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 114/323 (35%), Positives = 164/323 (50%), Gaps = 15/323 (4%)

Query: 22  SDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG-DRKTYDPEY 80
           ++  I+Q+N     WTAG        ++  ++ ++   KY  +++   P  +      + 
Sbjct: 28  TELLINQVNSAQQLWTAGH-------QDAPKERIL---KYLMKAEHVKPHREEDVVQVDV 77

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           +  +PD +D R+ +  C ++ ++ D   C +    AA  A SDR CI S G  N  LS E
Sbjct: 78  ADVIPDHYDVRDDFSQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGVVNTLLSAE 137

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + +CC I  Y     C  G   + W +  K G VTGG Y  + GC+P +I+PC    + 
Sbjct: 138 DILTCC-IGEYYCGDGCEGGYPIQAWKYWVKNGLVTGGSYESQFGCKPYSIAPCGQTVNG 196

Query: 201 PTLPSCENQKVPKLKCHTRCT-NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
            T P C N      KC   CT N +Y   + +DKH     Y V    D I+ EIL +GP 
Sbjct: 197 VTWPKCPNSDADTPKCVDHCTSNSSYPIPYEKDKHYGATAYAVSRKVDQIQSEILKNGPV 256

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
              F +Y DFY YKSGVY H +  +L    H+ KL+GWG +NGTPYWL  N+W  +WG+ 
Sbjct: 257 EVGFTVYADFYQYKSGVYVHVAGPELGG--HAVKLLGWGVDNGTPYWLAANSWNTNWGEN 314

Query: 320 GTVKILRGKYECAFEYLIAAGKP 342
           G  +ILRG  EC  E  + AG P
Sbjct: 315 GYFRILRGVNECGIESQVVAGMP 337


>gi|328718094|ref|XP_003246386.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
          Length = 340

 Score =  203 bits (517), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 126/349 (36%), Positives = 174/349 (49%), Gaps = 24/349 (6%)

Query: 2   IHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
           + +L  +     V  + Y     +ID IN +A TW AG NF  +  +E+  + L   +K 
Sbjct: 5   LMLLSVIFVSVYVTEQTYFLQKDFIDNINNQATTWKAGVNFDPDTPKEHFLKML--GSKG 62

Query: 62  FDQSDRPLPGDRKTYDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
               ++      KT+D  Y      +P  FDAR +W  C TIG V D G C +    A  
Sbjct: 63  VQIPNKHNIHMYKTHDAAYDKLFGRIPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMATS 122

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
            AF+DR C+ +    N  LS E +  CC  C +     C+ G   + W    KRG VTGG
Sbjct: 123 SAFADRLCVATNADFNELLSAEEITFCCHSCGF----GCNGGYPIKAWERFKKRGLVTGG 178

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHR 235
           DY    GC+P  + PC +   A    +C  +  P+   H RCT   YG     F +D   
Sbjct: 179 DYQSGEGCEPYRVPPCPY--DAEGHNTCAGK--PRESNH-RCTRMCYGNQDLDFDEDHRY 233

Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGK 293
           T  +Y++     +I+K+++ +GP  A+F +YDDF  YKSGVY  + NA    YL  H+ K
Sbjct: 234 TRDSYYL--TYGSIQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENA---TYLGGHAVK 288

Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           LIGWG E G PYWL++N+W   WGD G  KI RG  EC  +    AG P
Sbjct: 289 LIGWGEEYGVPYWLMVNSWNADWGDNGLFKIRRGTNECGIDNSTTAGVP 337


>gi|308504233|ref|XP_003114300.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
 gi|308261685|gb|EFP05638.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
          Length = 351

 Score =  203 bits (517), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 109/328 (33%), Positives = 163/328 (49%), Gaps = 28/328 (8%)

Query: 26  IDQINREANTWTAG-----RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD--- 77
           +D +N++  ++ A       ++P  + ++ +   +I            +P + + ++   
Sbjct: 41  VDYVNKQQTSFKAKLGSYFSSYPDTIKKQLMGAKMIE-----------IPDEYRVFEMTH 89

Query: 78  PE-YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
           PE   A +PD FD+R QWPNC +I  + D  +C +    +A    SDR CI S G+    
Sbjct: 90  PEVLDAAIPDSFDSRAQWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNGKTQLS 149

Query: 137 LSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
           +S + + +CC  +C       C+ G     W    K+G VTGG Y ++TGC+P    PC 
Sbjct: 150 ISADDINACCGMVC----GNGCNGGYPIEAWRHYVKKGYVTGGSYQEKTGCKPYPYPPCE 205

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           HH +      C +   P  KC   C    Y   + QD H     Y V      I+KEI+ 
Sbjct: 206 HHVNGTHYKPCPSNMYPTDKCERSC-QAGYALTYTQDLHFGQSAYAVSKKVTEIQKEIMT 264

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
           HGP    F++Y+DF HY  GVY HT+ A L    H+ K++GWG +NGTPYWL  N+W   
Sbjct: 265 HGPVEVAFSVYEDFEHYSGGVYVHTAGASLGG--HAVKMLGWGVDNGTPYWLCANSWNED 322

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPK 343
           WG+ G  +I+RG  EC  E  +  G PK
Sbjct: 323 WGENGYFRIIRGVNECGIESGVVGGIPK 350


>gi|170028916|ref|XP_001842340.1| cathepsin B [Culex quinquefasciatus]
 gi|167879390|gb|EDS42773.1| cathepsin B [Culex quinquefasciatus]
          Length = 339

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 116/323 (35%), Positives = 163/323 (50%), Gaps = 15/323 (4%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
           F+D++++Q+   A TWT    F   +      +F      Y    D  LP  R       
Sbjct: 31  FNDSFLEQVLARAKTWTPDTAFRGGIR---FGEFRSIKGIYESPLDFTLPSKRLHASSLD 87

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
              +PDRFDARE+WP C +I  V + G C +    A V   SDR CI S G+ N  L+TE
Sbjct: 88  EVVIPDRFDAREKWPFCQSIHSVRNQGTCGSCWAVATVSVMSDRLCIHSDGEVNLELATE 147

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            +  CCK C    N     G+ F+ W      G V+G  Y    GC+P    PCS+    
Sbjct: 148 DLMGCCKDCGNGCNGGFLDGTAFQYWV---DAGLVSGAPYNSSEGCKPYPFEPCSY---- 200

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
           P +     +K P  KC   C N  Y R + +DK      Y + ++   I+ EI+ +GP  
Sbjct: 201 PFVGCHHEKKNP--KCLHHCIN-GYDRKYRKDKFFGATAYKIPNDARMIQLEIMTNGPVA 257

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
             F +++DFY Y SGVYKH    K+   +H+ +++GWGTENGTPYWL+ N++G  WGD+G
Sbjct: 258 TGFEVFEDFYFYHSGVYKHVVGKKVG--MHAIRIVGWGTENGTPYWLIANSYGDTWGDKG 315

Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
             K+LRG      E  + AG P+
Sbjct: 316 FFKMLRGSNHLGIESTVIAGLPQ 338


>gi|256077361|ref|XP_002574974.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
 gi|18181863|emb|CAC85211.2| cathepsin B endopeptidase [Schistosoma mansoni]
 gi|353231645|emb|CCD79000.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
          Length = 347

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 111/263 (42%), Positives = 142/263 (53%), Gaps = 9/263 (3%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           S  +P  FDAR +WP+C +I  + D  +C +   F AV A SDR CIKSKG+    LS E
Sbjct: 91  SDELPKSFDARVEWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIKSKGKHKPFLSAE 150

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SCC  C       C+ G     W +   +G VTG  Y    GCQP    PC HH   
Sbjct: 151 NLVSCCSSC----GMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEHHVIG 206

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
           P LPSC+   V    C T C  P Y   + +DK      Y +  N +AI  E++ +GP  
Sbjct: 207 P-LPSCDGD-VETPSCKTNC-QPGYNIPYEKDKWYGEKVYRIHSNPEAIMLELMRNGPVE 263

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
             F +Y DF +YKSGVY+H S A L    H+ +L+GWG EN  PYWL+ N+W   WGD+G
Sbjct: 264 VDFEVYADFPNYKSGVYQHVSGALLGG--HAVRLLGWGEENNVPYWLIANSWNSDWGDKG 321

Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
             KI+RGK EC  E  + AG PK
Sbjct: 322 YFKIVRGKNECGIESDVNAGIPK 344


>gi|268557308|ref|XP_002636643.1| Hypothetical protein CBG23351 [Caenorhabditis briggsae]
          Length = 351

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 109/320 (34%), Positives = 158/320 (49%), Gaps = 12/320 (3%)

Query: 26  IDQINREANTWTAGR-NFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATV 84
           +D +N++  T+TA   ++ ++  +   +Q + A      +  R       T+       V
Sbjct: 41  VDYVNKQQTTFTAKLGSYFSSYPDTIKKQLMGAKMVEIPEEYRVF---EMTHPEVLDTAV 97

Query: 85  PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
           PD FD+R QWPNC +I  + D  +C +    +A    SDR CI S G+    +S + + +
Sbjct: 98  PDSFDSRTQWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNGKTQISISADDINA 157

Query: 145 CC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           CC  +C       C+ G     W    K+G VTGG Y +++GC+P    PC HH +    
Sbjct: 158 CCGMVC----GNGCNGGYPIEAWRHYVKKGYVTGGSYQEKSGCKPYPYPPCEHHVNGTHY 213

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
             C +   P  KC   C    Y   + QD H     Y V      I+KEI+ HGP    F
Sbjct: 214 KPCPSNMYPTDKCEHSC-QAGYPLTYTQDLHFGQSAYAVSKKPAEIQKEIMTHGPVEVAF 272

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
            +Y+DF HY  GVY HT+ A L    H+ K++GWG +NGTPYWL  N+W   WG+ G  +
Sbjct: 273 TVYEDFEHYSGGVYVHTAGASLGG--HAVKMLGWGVDNGTPYWLCANSWNEDWGENGYFR 330

Query: 324 ILRGKYECAFEYLIAAGKPK 343
           I+RG  EC  E  +  G PK
Sbjct: 331 IIRGVNECGIESGVVGGTPK 350


>gi|444525951|gb|ELV14228.1| Cathepsin B [Tupaia chinensis]
          Length = 339

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 128/344 (37%), Positives = 170/344 (49%), Gaps = 25/344 (7%)

Query: 5   LVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADA 59
           L  LL  T  R   Y    SD  ++ IN++  TW AG NF  N    Y+R+     +   
Sbjct: 8   LCCLLALTSARNRPYFHPLSDDLVNYINKQNTTWQAGHNF-RNADMSYVRKLCGTFLGGP 66

Query: 60  KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
           K        LP   K  +      +P+ FDAREQW +C TI  + D G+C +   F AV 
Sbjct: 67  K--------LPHRIKFAE---DMNLPESFDAREQWSSCPTIKEIRDQGSCGSCWAFGAVE 115

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           + SDR CI + G  N  +S E + +CC     +        +    WNF  K+G V+GG 
Sbjct: 116 SISDRICIHTNGHVNVEVSAEDMLTCCGGQCGEGCNGGYPSAA---WNFWTKKGLVSGGL 172

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
           Y    GC+P +I PC HH +    P       PK  C   C  P Y   + +DKH    +
Sbjct: 173 YDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKSC-EPGYSSSYKEDKHYGYSS 229

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y V   E  I  EI  +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWGT
Sbjct: 230 YSVPGIEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGT 287

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           ENGTPYWLV N+W   WGD G  KILRG+  C  E  I AG P+
Sbjct: 288 ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEIVAGIPR 331


>gi|241998314|ref|XP_002433800.1| longipain, putative [Ixodes scapularis]
 gi|215495559|gb|EEC05200.1| longipain, putative [Ixodes scapularis]
          Length = 339

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 131/350 (37%), Positives = 168/350 (48%), Gaps = 26/350 (7%)

Query: 4   ILVFLLGCTL-VRGE---------LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQ 53
           +  FLLG    VR E         L   SD  +D IN    TW AG N      E   R+
Sbjct: 5   VAFFLLGVLASVRAEEGRLMVPTYLAPLSDKMVDYINFINTTWKAGHNEGHRDLETVRRK 64

Query: 54  FLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPH 113
             ++     D     LP   +         +P +FD+R+QW +C TI  + D GAC +  
Sbjct: 65  LGVSR----DNHKYRLP---ELVHDTLEMDIPAQFDSRQQWQDCPTIREIRDQGACGSCW 117

Query: 114 IFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRG 173
            F AV + SDR CI S  +    L+ + V SCC  C       C+ G     W++  ++G
Sbjct: 118 AFGAVESMSDRHCIHSGAKNIVHLAADDVLSCCWGC----GSGCNGGFPGAAWSYWVEKG 173

Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
            VTGG+Y    GC P  +  C HH +  TL  C  Q  P  KC  R     Y   F  DK
Sbjct: 174 IVTGGNYDTDEGCMPYPVPSCDHHVNG-TLGPC-GQDPPTPKC-VRLCRKGYNIDFKDDK 230

Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
           H    +Y V  NE  I+ EI+ +GP    F +Y DF  YKSGVYK  S   L    H+ +
Sbjct: 231 HYGKSSYSVSSNETQIQMEIMKNGPVEGAFTVYADFPLYKSGVYKSHSTDALGG--HAIR 288

Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           ++GWG ENG P+WLV N+W   WGD+G  KILRG  EC  E  I AG PK
Sbjct: 289 ILGWGVENGVPFWLVANSWNTEWGDKGYFKILRGSNECGIEEDIVAGIPK 338


>gi|341904470|gb|EGT60303.1| hypothetical protein CAEBREN_20420 [Caenorhabditis brenneri]
          Length = 351

 Score =  202 bits (513), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 109/327 (33%), Positives = 161/327 (49%), Gaps = 26/327 (7%)

Query: 26  IDQINREANTWTAG-----RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
           +D IN++  T+TA       ++P  + ++ +   ++            +P + + ++ E+
Sbjct: 41  VDYINKKQTTFTAKLGAYFSDYPDTIKKQLMGAKMVE-----------IPEEYRVFEMEH 89

Query: 81  ----SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
                A +PD FD+R QWPNC +I  + D  +C +    +A    SDR CI SKGQ    
Sbjct: 90  PEVLDAAIPDSFDSRAQWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASKGQTQVS 149

Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
           +S + + +CC +        C+ G     W    K G VTGG Y ++TGC+P    PC H
Sbjct: 150 ISADDINACCGMAC---GNGCNGGYPIEAWRHYVKNGYVTGGSYQEKTGCKPYPYPPCEH 206

Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
           H +      C +   P  KC   C    Y   + QD H     Y V      I+KEI+ +
Sbjct: 207 HVNGTHYKPCPSDMYPTDKCERSC-QAGYSLTYKQDLHFGQSAYAVSKKATEIQKEIMTN 265

Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHW 316
           GP    F +Y DF  Y  GVY HT+ A L    H+ K++GWG +NGTPYWL  N+W   W
Sbjct: 266 GPVEVAFTVYADFEVYSGGVYVHTAGASLGG--HAVKMLGWGVDNGTPYWLCANSWNEDW 323

Query: 317 GDRGTVKILRGKYECAFEYLIAAGKPK 343
           G+ G  +I+RG  EC  E+ +  G PK
Sbjct: 324 GENGYFRIIRGVNECGIEHGVVGGIPK 350


>gi|308511959|ref|XP_003118162.1| CRE-CPR-6 protein [Caenorhabditis remanei]
 gi|308238808|gb|EFO82760.1| CRE-CPR-6 protein [Caenorhabditis remanei]
          Length = 387

 Score =  202 bits (513), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 112/323 (34%), Positives = 159/323 (49%), Gaps = 10/323 (3%)

Query: 23  DAYIDQINREANTWTAG--RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
           D  I+ +N   + W A   R F +   E    ++ +    +   S +      KT D + 
Sbjct: 44  DELINYVNNNQDLWRAKKQRRFTSVYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDM 103

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
              +P+ FD+RE WP C +I ++ D  +C +   F AV A SDR CI S G+    LS +
Sbjct: 104 D--IPENFDSRENWPKCQSIRNIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVSLSAD 161

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SCC+ C +     C+ G     W +  K G VTG +Y   +GC+P    PC HH   
Sbjct: 162 DLLSCCRSCGF----GCNGGDPLAAWRYWVKDGIVTGSNYTANSGCKPYPFPPCEHHSKK 217

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
                C +   P  KC  +C      + + +DK      Y V D+ +AI+KE++ HGP  
Sbjct: 218 THFDPCPHDLYPTPKCEKKCIADYTDKTYSEDKFYGASAYGVKDDVEAIQKELMTHGPLE 277

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
             F +Y+DF +Y  GVY HT   KL    H+ KL+GWG ENG PYW   N+W   WG+ G
Sbjct: 278 IAFEVYEDFLNYDGGVYVHTG-GKLGGG-HAVKLVGWGIENGIPYWTCANSWNTDWGEDG 335

Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
             +ILRG  EC  E  +  G PK
Sbjct: 336 FFRILRGVDECGIESGVVGGVPK 358


>gi|22535408|emb|CAC87118.1| cathepsin B-like protease [Nilaparvata lugens]
          Length = 347

 Score =  202 bits (513), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 117/327 (35%), Positives = 165/327 (50%), Gaps = 12/327 (3%)

Query: 20  KFSDAYIDQINRE-ANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP 78
           + ++ +ID IN    +TW AG NF  +    YL+  L       + +D     + +  + 
Sbjct: 27  EIANKWIDAINNNPKSTWKAGHNFHPDTPMSYLQGLLGVSELESNLADLDKYEEMEENEE 86

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
                VP  FDAR++W  C ++  + D G C +    +   AF+DR CI S  + N  +S
Sbjct: 87  NKKIKVPKYFDARKKWKKCKSLREIRDQGNCGSCWAVSVAAAFADRLCIASNAKWNGHIS 146

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH- 197
           +  + SCC  C +     C  G     W F+ + G VTGGDY    GCQP  I+PC HH 
Sbjct: 147 SRELMSCCSYCGF----GCEGGFPDAAWVFIKRHGLVTGGDYHSHDGCQPYPIAPCEHHM 202

Query: 198 -GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
            GS P   +   +  P   C T CT+ +    + +D+ +    Y V   E   + EI  +
Sbjct: 203 EGSKPNCSASPTEPTPA--CETTCTHGS-SLAYQKDRQKGKSAYLVPVGEKQTQLEIFKN 259

Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHW 316
           GP  A F +Y+DF+ YKSGVYK    +      H+ K+IGWG +NG PYWLV N+W   W
Sbjct: 260 GPIVAAFKVYEDFFMYKSGVYKRHPESPFRGR-HAVKVIGWGEQNGLPYWLVQNSWDYDW 318

Query: 317 GDRGTVKILRGKYECAFEYLIAAGKPK 343
           GD+G  KI RG  EC FE  + AG PK
Sbjct: 319 GDKGLFKIARGN-ECDFEKSMTAGLPK 344


>gi|121309133|dbj|BAF43801.1| Longipain [Haemaphysalis longicornis]
          Length = 341

 Score =  202 bits (513), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 126/353 (35%), Positives = 168/353 (47%), Gaps = 33/353 (9%)

Query: 4   ILVFLLGC--------TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFL 55
           +L F++G          LV  ++  F D  I+ IN    TW AGRN        Y+R  L
Sbjct: 8   LLAFVIGVWGDVLEDRYLVPVDMDNFPDKMIEYINYLNTTWQAGRNLGYE-DPRYVRTLL 66

Query: 56  IADAKYFDQSDRPLPGDRKTYDPEY-----SATVPDRFDAREQWPNCGTIGHVPDTGACA 110
                         P + K   PE      +  +PD FD+R +W +C TI  + D G+C 
Sbjct: 67  GVH-----------PNNHKYRLPEIEIDTSNVQIPDHFDSRHRWHDCPTIREIRDQGSCG 115

Query: 111 APHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLH 170
           +   F AV A SDR CI S  +    L+ + V SCC  C       C+ G     W++  
Sbjct: 116 SCWAFGAVEAMSDRHCIHSGAKNIVHLAADDVLSCCMSC----GSGCNGGFPGAAWSYWV 171

Query: 171 KRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFF 230
            +G VTGG+Y    GC P  I  C HH +  TL  C+    P  +C  R     Y   F 
Sbjct: 172 HKGIVTGGNYDSDEGCMPYPIKACDHHVNG-TLGPCDKSIPPTPRC-VRMCRKGYNVDFA 229

Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLH 290
            DKH    +Y V  N   I+ EI+ +GP  A F +Y DF  YKSGVY+  ++  L    H
Sbjct: 230 DDKHYGKKSYSVPSNVTQIQVEIMTNGPVEADFTVYADFPLYKSGVYQRHTDQALGG--H 287

Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           + +L+GWG E G PYWL  N+W   WGD+G  KILRG  EC  E  + AG P+
Sbjct: 288 AIRLLGWGVEKGVPYWLAANSWNTEWGDKGFFKILRGSDECGIEDDVVAGIPR 340


>gi|341887135|gb|EGT43070.1| hypothetical protein CAEBREN_13756 [Caenorhabditis brenneri]
          Length = 398

 Score =  202 bits (513), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 113/323 (34%), Positives = 158/323 (48%), Gaps = 10/323 (3%)

Query: 23  DAYIDQINREANTWTAG--RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
           D  I+ +N     W A   R F     E    ++ +    +   S +      KT D + 
Sbjct: 59  DELINYVNNNQQLWKAKKQRRFSMYKGENDKHKWGLMGVNHVRLSVKGKQHLSKTKDLDM 118

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
              +P+ FD+RE WP C +I  + D  +C +   F AV A SDR CI S G+    LS +
Sbjct: 119 D--IPESFDSRENWPKCESIKAIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVSLSAD 176

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SCC+ C +     C+ G     W +  K G VTG ++   +GC+P    PC HH   
Sbjct: 177 DLLSCCRSCGF----GCNGGDPLAAWRYWVKDGIVTGSNFTANSGCKPYPFPPCEHHSKK 232

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
                C +   P  KC  RC      + + +DK   +  Y V D+ +AI+KE++ HGP  
Sbjct: 233 THFDPCPHDLYPTPKCEKRCNAEYTDKTYSEDKFYGSSAYGVKDDVEAIQKELMTHGPLE 292

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
             F +Y+DF +Y  GVY HT   KL    H+ KLIGWG E+G PYW V N+W   WG+ G
Sbjct: 293 IAFEVYEDFLNYDGGVYVHTG-GKLGGG-HAVKLIGWGIEDGIPYWTVANSWNTDWGEDG 350

Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
             +ILRG  EC  E  +  G PK
Sbjct: 351 FFRILRGVDECGIESGVVGGIPK 373


>gi|294935195|ref|XP_002781337.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239891887|gb|EER13132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 317

 Score =  201 bits (512), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 116/322 (36%), Positives = 165/322 (51%), Gaps = 13/322 (4%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS--DRPLPGDRKTYDPEYS 81
           + +D+IN + N WTA  +      +E      + DAK    +  +     +++ Y P   
Sbjct: 3   SLVDEINSKQNLWTASTD------QERFYGRSLGDAKKLCGTLLEETEGLEKRVYPPGEL 56

Query: 82  ATVPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           A +P+ FDAR+ +  C   IGHV D  ACA+    A V AF+ R CIKS G+ N+ LS  
Sbjct: 57  ADIPNSFDARDAFKECKDVIGHVWDQSACASCWAIAPVEAFNARLCIKSGGKFNQLLSAG 116

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + +CC        + C  G +   W+FL   G  T G      GC P     C+HH   
Sbjct: 117 EMIACCNSTHSWQPRGCKGGMILNAWSFLKTHGIATEGSMSAADGCWPYNFPKCAHHQKK 176

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
                C  +      C  RC N  YG    +D+H T  +  + +  D IKKEI+ +GPT+
Sbjct: 177 SKYEPCSKKLYDTPSCLDRCPNEKYGIPLDKDRHFTAHSPDLFEGTDNIKKEIMTNGPTS 236

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
           ATF++Y+DF  YKSGVYKHT+   +   +HS ++IGWGTE G  YWLV+N+W   WGD G
Sbjct: 237 ATFSVYEDFVSYKSGVYKHTNGTLMG--IHSVEIIGWGTEKGVDYWLVMNSWNEGWGDHG 294

Query: 321 TVKILRGKYECAFEYLIAAGKP 342
           T KI +G  +C  +  +    P
Sbjct: 295 TFKIAQG--DCGIDDAVLGSPP 314


>gi|55793945|gb|AAV65883.1| cathepsin B1 isotype 3 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  201 bits (512), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 118/328 (35%), Positives = 169/328 (51%), Gaps = 23/328 (7%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
            SD  I  IN+  +  WTA R+          R   + DA+    + R     RK   P 
Sbjct: 30  LSDEMIAYINQHPDAGWTASRSD---------RFKSLEDARILLGAMREDEELRKKRRPT 80

Query: 79  ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
                 S  +P  FD+R++W  C +I ++ D   C +   F AV A SDR CI+SKG+++
Sbjct: 81  VDHQNVSLEIPSSFDSRKKWHQCKSISNIRDQSRCGSCWAFTAVEAMSDRICIESKGKKS 140

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS   + SCC  C       C  G     W++  + G VTG    + TGCQP     C
Sbjct: 141 VELSAVDLLSCCTEC----GLGCQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKC 196

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            HH +    P C  +     KCH +C    Y   + +DK+   ++Y V +NE+AIKKEI+
Sbjct: 197 EHHTTG-KYPECGEKIYKTPKCHQKCQK-GYKTPYKKDKYYGRMSYNVLNNENAIKKEIM 254

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            HGP  A F ++ DF +YKSG+YK+ + A++    H+ ++IGWG E  TPYWL+ N+W  
Sbjct: 255 MHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGG--HAVRIIGWGVEKKTPYWLIANSWNE 312

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKP 342
            WG++G  +ILRGK EC  E  +  G P
Sbjct: 313 DWGEKGYFRILRGKDECGIESEVTGGLP 340


>gi|256052331|ref|XP_002569726.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228435|emb|CCD74606.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 319

 Score =  201 bits (512), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 103/263 (39%), Positives = 149/263 (56%), Gaps = 8/263 (3%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           +++  +P  FD+R++WP C +I  + D   C +   F AV A SDR CI+S G+QN  LS
Sbjct: 62  DWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSSWAFGAVEAMSDRSCIQSGGKQNVELS 121

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
              + SCC+ C          G     W++  K G VTG    + T CQP     C HH 
Sbjct: 122 AVDLLSCCEHC----GDGFEGGFPALAWDYWVKEGIVTGSSKENHTSCQPYPFPKCEHH- 176

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           +    P+C  +      C   C   +Y   + QDKHR    Y V ++E AI+KEI+ +GP
Sbjct: 177 TKGKYPACFEEIYKTPNCENTCQK-SYKTPYAQDKHRGKSRYNVKNDEKAIQKEIMKYGP 235

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
             A F +Y+DF +YKSG+YKH +  KL ++ H+ ++IGWG EN TPYWL+ N+W   WG+
Sbjct: 236 VEANFIVYEDFLNYKSGIYKHIT-GKLVSW-HAIRIIGWGVENNTPYWLIPNSWNEDWGE 293

Query: 319 RGTVKILRGKYECAFEYLIAAGK 341
            G  +ILRG++EC+ E  + AG+
Sbjct: 294 NGNFRILRGRHECSIESEVTAGR 316


>gi|226472800|emb|CAX71086.1| cathepsin B [Schistosoma japonicum]
 gi|226472804|emb|CAX71088.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  201 bits (512), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 123/325 (37%), Positives = 160/325 (49%), Gaps = 17/325 (5%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
            S   I  IN EANT W AG         +  R          +Q +    G   T +  
Sbjct: 36  LSKELIHFINYEANTTWKAGPTRRFKTVSDIRRMLGALPDPNGEQLETLCTGYELTLN-- 93

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
               +P  FDAR++W +C +I  + D  +C +   F AV A SDR CI+SKG+    LS 
Sbjct: 94  ---ELPKSFDARKEWTHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGKYKPFLSA 150

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
           E + SCC  C       C+ G     W +   +G VTG  Y    GCQP    PC HH  
Sbjct: 151 ENLVSCCSSC----GMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEHHTL 206

Query: 200 APTLPSCE-NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
            P LP C+ + + P  K   R     Y   +  DK    + Y V  N++AI KE++ HGP
Sbjct: 207 GP-LPVCDGDVETPPCK---RTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGP 262

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
               F +Y DF +YKSGVY+H S A L    H+ +L+GWG EN  PYWL+ N+W   WGD
Sbjct: 263 VEVDFEVYADFPNYKSGVYQHVSGALLGG--HAVRLLGWGEENNVPYWLIANSWNTDWGD 320

Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
            G  KI+RGK EC  E  + AG PK
Sbjct: 321 NGYFKIIRGKNECGIESDVNAGIPK 345


>gi|345308|pir||S31909 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - fluke
           (Schistosoma japonicum)
          Length = 316

 Score =  201 bits (512), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 117/331 (35%), Positives = 174/331 (52%), Gaps = 25/331 (7%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
            SD  I  IN+  N  W A ++          R   + DA+      R  P  R+   P 
Sbjct: 4   LSDEMISFINKHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLRQKRRPT 54

Query: 79  ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
               +    +P  FD+R++WP C +I  + D   CA+    +AVGA SDR CI+S G+Q+
Sbjct: 55  VDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVGAMSDRICIQSGGKQS 114

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS   + SCC+ C       C  G     W++    G VTGG   + TGCQP     C
Sbjct: 115 VELSAIDLISCCENC----GSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKC 170

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            HH S    PSC ++     +C  +C    Y   +  DKH   ++  V  NE AI+KEI+
Sbjct: 171 EHH-SKGKYPSCGDKMYKTPQCKRKCQK-GYKTPYEHDKHYGGISINVIKNESAIQKEIM 228

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKL-ENYLHSGKLIGWGTENGTPYWLVINTWG 313
            +GP  A   +++DF +YKSG+Y++T+ + + E+Y+   ++IGWG ENGT YWL  NTW 
Sbjct: 229 MYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYV---RIIGWGIENGTAYWLAANTWN 285

Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
             WG++G  +I+RG+ EC+ E ++ AG+ K+
Sbjct: 286 EDWGEKGYFRIVRGRNECSVESVVVAGRLKS 316


>gi|30995341|gb|AAO59414.2| cathepsin B endopeptidase [Schistosoma japonicum]
 gi|226472794|emb|CAX71083.1| cathepsin B [Schistosoma japonicum]
 gi|226472796|emb|CAX71084.1| cathepsin B [Schistosoma japonicum]
 gi|226472798|emb|CAX71085.1| cathepsin B [Schistosoma japonicum]
 gi|226472802|emb|CAX71087.1| cathepsin B [Schistosoma japonicum]
 gi|226472806|emb|CAX71089.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  201 bits (512), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 123/324 (37%), Positives = 158/324 (48%), Gaps = 15/324 (4%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
            S   I  IN EANT W AG         +  R          +Q +    G   T +  
Sbjct: 36  LSKELIHFINYEANTTWKAGPTRRFKTVSDIRRMLGALPDPNGEQLETLCTGYELTLN-- 93

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
               +P  FDAR++W +C +I  + D  +C +   F AV A SDR CI+SKG+    LS 
Sbjct: 94  ---ELPKSFDARKEWTHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGKYKPFLSA 150

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
           E + SCC  C       C+ G     W +   +G VTG  Y    GCQP    PC HH  
Sbjct: 151 ENLVSCCSSC----GMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEHHTL 206

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
            P LP C+   V    C   C    Y   +  DK    + Y V  N++AI KE++ HGP 
Sbjct: 207 GP-LPVCDGD-VETPPCKRTC-QAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPV 263

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
              F +Y DF +YKSGVY+H S A L    H+ +L+GWG EN  PYWL+ N+W   WGD 
Sbjct: 264 EVDFEVYADFPNYKSGVYQHVSGALLGG--HAVRLLGWGEENNVPYWLIANSWNTDWGDN 321

Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
           G  KI+RGK EC  E  + AG PK
Sbjct: 322 GYFKIIRGKNECGIESDVNAGIPK 345


>gi|226472810|emb|CAX71091.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  201 bits (511), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 123/324 (37%), Positives = 158/324 (48%), Gaps = 15/324 (4%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
            S   I  IN EANT W AG         +  R          +Q +    G   T +  
Sbjct: 36  LSKELIHFINYEANTTWKAGPTRRFKTVSDIRRMLGALPDPNGEQLETLCTGYELTVN-- 93

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
               +P  FDAR++W +C +I  + D  +C +   F AV A SDR CI+SKG+    LS 
Sbjct: 94  ---ELPKSFDARKEWTHCPSISEIRDQSSCGSYWAFGAVEAMSDRICIESKGKYKPFLSA 150

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
           E + SCC  C       C+ G     W +   +G VTG  Y    GCQP    PC HH  
Sbjct: 151 ENLVSCCSSC----GMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEHHTL 206

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
            P LP C+   V    C   C    Y   +  DK    + Y V  N++AI KE++ HGP 
Sbjct: 207 GP-LPVCDGD-VETPPCKRTC-QAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPV 263

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
              F +Y DF +YKSGVY+H S A L    H+ +L+GWG EN  PYWL+ N+W   WGD 
Sbjct: 264 EVDFEVYADFPNYKSGVYQHVSGALLGG--HAVRLLGWGEENNVPYWLIANSWNTDWGDN 321

Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
           G  KI+RGK EC  E  + AG PK
Sbjct: 322 GYFKIIRGKNECGIESDVNAGIPK 345


>gi|1777779|gb|AAB40605.1| cathepsin B-like cysteine proteinase [Ascaris suum]
 gi|324515014|gb|ADY46062.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
          Length = 398

 Score =  201 bits (511), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 108/322 (33%), Positives = 158/322 (49%), Gaps = 13/322 (4%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE--YS 81
           A  + +NR+ N W A  N       + ++  L+        + R     +K   P   Y 
Sbjct: 64  ALANYVNRKQNLWKAKFNNKFRNYSDRVKYGLMGV-----NNVRLSVKAKKNLSPTRFYD 118

Query: 82  ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
             +P+ FDARE+W  C ++ ++ D  +C +   F AV A SDR CI S G+    LS + 
Sbjct: 119 IYIPEAFDAREKWDQCASLKNIRDQSSCGSCWAFGAVEAMSDRICIASNGKIQVSLSADD 178

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
           + SCCK C +     C  G     W +  K G VTG ++  + GC+P    PC HH +  
Sbjct: 179 LLSCCKSCGF----GCDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPYPFPPCEHHSNKT 234

Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
               C++   P  KC  +C +    + + +DK      Y V+D+  +I+KEIL HGP   
Sbjct: 235 HYQPCKHDLYPTPKCEKKCLDIYTEKTYAEDKFFGETAYGVEDDVTSIQKEILTHGPVEV 294

Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT 321
            F +Y+DF  Y  G+Y HT         H+ K++GWG E G PYWLV N+W   WG+ G 
Sbjct: 295 AFEVYEDFLMYDGGIYVHTGGKIGGG--HAVKMLGWGVEQGVPYWLVANSWNTDWGEDGF 352

Query: 322 VKILRGKYECAFEYLIAAGKPK 343
            +I+RG  EC  E  +  G PK
Sbjct: 353 FRIIRGIDECGIESSVVGGLPK 374


>gi|201023315|ref|NP_001128400.1| cathepsin B-16D2 precursor [Acyrthosiphon pisum]
          Length = 340

 Score =  201 bits (511), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 125/350 (35%), Positives = 174/350 (49%), Gaps = 24/350 (6%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           ++ +L  +     +  + Y     +ID IN  A TW AG NF  +  +E+  + L   +K
Sbjct: 4   VLMLLSVIFVSFYLTEQAYFLQKDFIDNINERATTWKAGVNFDPDTPKEHFLKML--GSK 61

Query: 61  YFDQSDRPLPGDRKTYDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
                ++      KT+D  Y      +P  FDAR +W  C TIG V D G C +    A 
Sbjct: 62  GVQIPNKHNIHMYKTHDAAYDNLFGRIPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMAT 121

Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
             AF+DR C+ +    N  LS E +  CC  C +     C+ G   + W    KRG VTG
Sbjct: 122 SSAFADRLCVATNADFNELLSAEEITFCCHSCGF----GCNGGYPIKAWERFKKRGLVTG 177

Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKH 234
           GDY    GC+P  + PC +   A    +C  +  P+   H RCT   YG     F +D  
Sbjct: 178 GDYQSGEGCEPYRVPPCPY--DAEGHNTCAGK--PRESNH-RCTRMCYGNQDLDFDEDHR 232

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSG 292
            T  +Y++     +I+K+++ +GP  A+F +YDDF  YKSGVY  + NA    YL  H+ 
Sbjct: 233 YTRDSYYL--TYGSIQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENA---TYLGGHAV 287

Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           KLIGWG E G PYWL++N+W   WGD G  KI RG  EC  +    AG P
Sbjct: 288 KLIGWGEEYGVPYWLMVNSWNADWGDNGLFKIRRGTNECGIDNSTTAGVP 337


>gi|241154720|ref|XP_002407359.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215494103|gb|EEC03744.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score =  201 bits (510), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 119/324 (36%), Positives = 164/324 (50%), Gaps = 17/324 (5%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
            SD  I+ IN+   TW AG NF   +S  Y+R  L    K  +        +      E 
Sbjct: 28  LSDQMINYINKINTTWKAGSNFDKCISMSYIRGLLGVHPKSEEYRLAEFVHE------EI 81

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
              +P+ FDAR +W +C +I  + D   C +   F A  A SDR CI SKG+    +S E
Sbjct: 82  PDDLPESFDARAKWSHCDSIHLIRDQSTCGSCWAFGATEAMSDRICIHSKGKMQVNISAE 141

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            +  CC  C +     C  G     W    +RG V+GG YG   GC+P +++PC +H + 
Sbjct: 142 DLLDCCDTCGH----GCKGGFPAAAWEHWKERGIVSGGLYGTPDGCKPYSLAPCEYH-TK 196

Query: 201 PTLPSC-ENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
             +P+C      P+   H R     Y + + +DKH     Y +  +E  I+ EI  +GP 
Sbjct: 197 CRIPNCIPIVHTPECVHHCR---KGYDKDYQEDKHFGQKVYSISRDEKQIQTEIFTNGPV 253

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
            A F +Y DF  YKSGVY+  SN      +H+ +++GWGTENGTPYWL  N+W  +WGD+
Sbjct: 254 EADFHVYGDFLCYKSGVYQRHSNDG--RGMHAIRILGWGTENGTPYWLAANSWNENWGDK 311

Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
           G  KILR   EC  E  I AG PK
Sbjct: 312 GYFKILRRTNECGIEEHIYAGIPK 335


>gi|324507953|gb|ADY43363.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
          Length = 352

 Score =  201 bits (510), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 108/322 (33%), Positives = 158/322 (49%), Gaps = 13/322 (4%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE--YS 81
           A  + +NR+ N W A  N       + ++  L+        + R     +K   P   Y 
Sbjct: 23  ALANYVNRKQNLWKAKFNNKFRNYSDRVKYGLMGV-----NNVRLSVKAKKNLSPTRFYD 77

Query: 82  ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
             +P+ FDARE+W  C ++ ++ D  +C +   F AV A SDR CI S G+    LS + 
Sbjct: 78  IYIPEAFDAREKWDQCASLKNIRDQSSCGSCWAFGAVEAMSDRICIASNGKIQVSLSADD 137

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
           + SCCK C +     C  G     W +  K G VTG ++  + GC+P    PC HH +  
Sbjct: 138 LLSCCKSCGF----GCDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPYPFPPCEHHSNKT 193

Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
               C++   P  KC  +C +    + + +DK      Y V+D+  +I+KEIL HGP   
Sbjct: 194 HYQPCKHDLYPTPKCEKKCLDIYTEKTYAEDKFFGETAYGVEDDVTSIQKEILTHGPVEV 253

Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT 321
            F +Y+DF  Y  G+Y HT         H+ K++GWG E G PYWLV N+W   WG+ G 
Sbjct: 254 AFEVYEDFLMYDGGIYVHTGGKIGGG--HAVKMLGWGVEQGVPYWLVANSWNTDWGEDGF 311

Query: 322 VKILRGKYECAFEYLIAAGKPK 343
            +I+RG  EC  E  +  G PK
Sbjct: 312 FRIIRGIDECGIESSVVGGLPK 333


>gi|226474178|emb|CAX71575.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  201 bits (510), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 116/330 (35%), Positives = 169/330 (51%), Gaps = 23/330 (6%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
            SD  I  IN   N  W A ++          R   + DA+      R  P  R+   P 
Sbjct: 30  LSDEMISFINEHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80

Query: 79  ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
               + +  +P  FD+R++WP C +I  + D   C +    +AVGA SDR CI+S G+Q+
Sbjct: 81  VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS   + SCCK C       C  G +  +W++   RG VTGG   + TGC+P     C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            H        +C ++     +C+  C    Y   + QDKH    +Y V   E  I+K+I+
Sbjct: 197 DHFVKGK-YRACGDKLYKTPQCNQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +LIGWG ENGT YWL  NTW  
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRLIGWGVENGTAYWLAANTWNE 312

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WG++G  +I+RG+ EC+ E  IAAG+ K+
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGRIKS 342


>gi|226474184|emb|CAX71578.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  200 bits (509), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 114/350 (32%), Positives = 181/350 (51%), Gaps = 14/350 (4%)

Query: 1   MIHILVFLLGC-TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
           M+ I V+++   TL+   +   ++  I+ ++ E  ++          +++  R   + DA
Sbjct: 1   MLKIAVYIVSLFTLLEAHVTTRNNERIEPLSDEMISFINEHPNAGWKADKSDRFHSVDDA 60

Query: 60  KYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
           ++     +  P  R+   P     + +  +P  FD+R++WP C +I  + D   C +   
Sbjct: 61  RFLLGGRKEDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
            +AVGA SDR CI+S G+Q+  LS   + SCCK C       C  G +  +W++   RG 
Sbjct: 121 VSAVGAMSDRICIQSGGKQSVELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGI 176

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           VTGG   + TGC+P     C H        +C ++     +C+  C    Y   + QDKH
Sbjct: 177 VTGGSKENHTGCRPYPFPKCDHFVKGK-YRACGDKLYKTPQCNQTC-QKGYNTSYEQDKH 234

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
               +Y V   E  I+K+I+ HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +L
Sbjct: 235 YGGFSYNVLSVESVIQKDIMVHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRL 292

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           IGWG ENGT YWL  NTW   WG++G  +I+RG+ EC  E  IAAG  K+
Sbjct: 293 IGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|380791571|gb|AFE67661.1| cathepsin B preproprotein, partial [Macaca mulatta]
          Length = 311

 Score =  200 bits (509), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 116/318 (36%), Positives = 161/318 (50%), Gaps = 17/318 (5%)

Query: 6   VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
           +  LG    R   +  SD  ++ +N++  TW AG NF  N+   YL++       +    
Sbjct: 11  LLALGDARSRPSFHPLSDELVNYVNKQNTTWQAGHNF-YNVDVSYLKRLC---GTFLGG- 65

Query: 66  DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
             P P  R  +  +    +P+ FDAREQWP C TI  + D G+C +   F AV A SDR 
Sbjct: 66  --PKPPQRVMFTEDLK--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121

Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG 185
           CI +    +  +S E + +CC I   D    C+ G     WNF  ++G V+GG Y    G
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGIMCGD---GCNGGYPAGAWNFWTRKGLVSGGLYDSHVG 178

Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN 245
           C+P +I PC HH +    P       P  KC   C  P Y   + QDKH    +Y V ++
Sbjct: 179 CRPYSIPPCEHHVNGSRPPCTGEGDTP--KCSKIC-EPGYSPTYKQDKHYGYNSYSVSNS 235

Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPY 305
           E  I  EI  +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTPY
Sbjct: 236 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPY 293

Query: 306 WLVINTWGPHWGDRGTVK 323
           WLV N+W   WGD G  K
Sbjct: 294 WLVANSWNTDWGDNGFFK 311


>gi|312271213|gb|ADQ57304.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 347

 Score =  200 bits (508), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 111/324 (34%), Positives = 161/324 (49%), Gaps = 21/324 (6%)

Query: 26  IDQINREANTWTAG-----RNFPANLSEEYL-RQFLIADAKYFDQSDRPLPGDRKTYDPE 79
           +D IN+    +TA       NFP  +    +  +++   AKY          + KT+   
Sbjct: 39  VDYINKAQKLFTAKLSPRFANFPNEIKRRLMGSKYVALPAKY--------RVNEKTHSDI 90

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
              T+P  FD+R  WP C ++  + D  +C +     AV A +DR CI SKG Q   +S 
Sbjct: 91  DDTTIPKSFDSRTNWPECPSLYSIRDQSSCGSCWAVGAVEAMTDRICIASKGNQKVTISA 150

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
           + + SCC  C +     C  G  +  W++    G VTG +Y  ++GC+P    PC HH  
Sbjct: 151 DDLLSCCDECGF----GCDGGDPYAAWSYWVSNGIVTGSNYTSKSGCKPYPYPPCEHHIP 206

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
                 C     P   C  +C +  Y   +  DKH     Y V  +  +I+KEI+ +GP 
Sbjct: 207 EHHYKKCPKDIYPTNTCEYKCQD-GYSISYNSDKHYGASVYAVAQDVASIQKEIMTNGPV 265

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
              F +Y+DF HY SG+YKHT+   L    H+ K++GWGTENGT YW+  N+W   WG+ 
Sbjct: 266 EVAFDVYEDFEHYSSGIYKHTTGDYLGG--HAVKMLGWGTENGTDYWICANSWNSDWGEN 323

Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
           G  +ILRG  EC  E  + AG+PK
Sbjct: 324 GFFRILRGVDECQIESSVVAGEPK 347


>gi|46195455|ref|NP_990702.1| cathepsin B precursor [Gallus gallus]
 gi|1168790|sp|P43233.1|CATB_CHICK RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
           RecName: Full=Cathepsin B light chain; Contains:
           RecName: Full=Cathepsin B heavy chain; Flags: Precursor
 gi|603203|gb|AAA87075.1| cathepsin B [Gallus gallus]
          Length = 340

 Score =  200 bits (508), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 114/323 (35%), Positives = 161/323 (49%), Gaps = 16/323 (4%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
            S   ++ IN+   T  AG NF  N    Y+++       +      P     +  D   
Sbjct: 26  LSSDLVNHINKLNTTGRAGHNF-HNTDMSYVKKLC---GTFLGGPKAP-----ERVDFAE 76

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
              +PD FD R+QWPNC TI  + D G+C +   F AV A SDR C+ +  + +  +S E
Sbjct: 77  DMDLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAE 136

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SCC    ++    C+ G     W +  +RG V+GG Y    GC+  TI PC HH + 
Sbjct: 137 DLLSCCG---FECGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRAYTIPPCEHHVNG 193

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
            + P C  +     +C   C  P Y   + +DKH    +Y V  +E  I  EI  +GP  
Sbjct: 194 -SRPPCTGEGGETPRCSRHC-EPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVE 251

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
             F +Y+DF  YKSGVY+H S  ++    H+ +++GWG ENGTPYWL  N+W   WG  G
Sbjct: 252 GAFIVYEDFLMYKSGVYQHVSGEQVGG--HAIRILGWGVENGTPYWLAANSWNTDWGITG 309

Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
             KILRG+  C  E  I AG P+
Sbjct: 310 FFKILRGEDHCGIESEIVAGVPR 332


>gi|226469952|emb|CAX70257.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 117/331 (35%), Positives = 172/331 (51%), Gaps = 25/331 (7%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
            SD  I  IN+  N  W A ++          R   + DA+      R  P  R+   P 
Sbjct: 30  LSDEMISFINKHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80

Query: 79  ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
               +    +P  FD+R++WP C +I  + D   C +    +AVGA SDR CI+S G+Q+
Sbjct: 81  VDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSRCGSSWAVSAVGAISDRICIQSGGKQS 140

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS   + SCC+ C       C  G     W++    G VTGG   + TGCQP     C
Sbjct: 141 VELSAIDLISCCENC----GSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKC 196

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            HH S    PSC ++     +C  +C    Y   +  DKH   +   V  NE AI+KEI+
Sbjct: 197 EHH-SIGKYPSCGDKMYKTPQCKRKCQK-GYTTPYEHDKHYGGIAINVIKNELAIQKEIM 254

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKL-ENYLHSGKLIGWGTENGTPYWLVINTWG 313
            +GP  A   +++DF +YKSG+YK+T+ + + E+Y+   ++IGWG ENGT YWL  NTW 
Sbjct: 255 MYGPVEAYLLIFEDFLNYKSGIYKYTTGSFVGEHYV---RIIGWGIENGTAYWLAANTWN 311

Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
             WG++G  +I+RG+ EC+ E ++ AG+ K+
Sbjct: 312 EDWGEKGYFRIVRGRNECSIESVVVAGRLKS 342


>gi|226474176|emb|CAX71574.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 113/350 (32%), Positives = 182/350 (52%), Gaps = 14/350 (4%)

Query: 1   MIHILVFLLGC-TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
           M+ I V+++   TL+   +   ++  I+ ++ E  ++          +++  R   + DA
Sbjct: 1   MLKIAVYIVSLFTLLEAHVTTRNNERIEPLSDEMISFINEHPNAGWKADKSDRFHSVDDA 60

Query: 60  KYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
           ++     +  P  R+   P     + +  +P  FD+R++WP C +I  + D   C +   
Sbjct: 61  RFLLGGRKEDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
            +AVGA SDR CI+S G+Q+  LS   + SCCK C       C  G +  +W++   RG 
Sbjct: 121 VSAVGAMSDRICIQSGGKQSVELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGI 176

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           VTGG   + TGC+P     C H        +C ++     +C+  C    Y   + QDKH
Sbjct: 177 VTGGSKENHTGCRPYPFPKCDHFVKGK-YRACGDKLYKTPQCNQTC-QKGYNTSYEQDKH 234

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
               +Y V   E  I+K+I+ HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +L
Sbjct: 235 YGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRL 292

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           IGWG ENGT YWL  NTW   WG++G  +I+RG+ EC+ +  IAAG  K+
Sbjct: 293 IGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIDSEIAAGLIKS 342


>gi|226474172|emb|CAX71572.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 114/350 (32%), Positives = 181/350 (51%), Gaps = 14/350 (4%)

Query: 1   MIHILVFLLGC-TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
           M+ I V+++   TL+   +   ++  I+ ++ E  ++          +++  R   + DA
Sbjct: 1   MLKIAVYIVSLFTLLEAHVTTRNNERIEPLSDEMISFINEHPNAGWKADKSDRFHSVDDA 60

Query: 60  KYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
           ++     +  P  R+   P     + +  +P  FD+R++WP C +I  + D   C +   
Sbjct: 61  RFLLGGRKEDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
            +AVGA SDR CI+S G+Q+  LS   + SCCK C       C  G +  +W++   RG 
Sbjct: 121 VSAVGAMSDRICIQSGGKQSVELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGI 176

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           VTGG   + TGC+P     C H        +C ++     +C+  C    Y   + QDKH
Sbjct: 177 VTGGSKENHTGCRPYPFPKCDHFVKGK-YRACGDKLYKTPQCNQTC-QKGYNTSYEQDKH 234

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
               +Y V   E  I+K+I+ HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +L
Sbjct: 235 YGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRL 292

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           IGWG ENGT YWL  NTW   WG++G  +I+RG+ EC  E  IAAG  K+
Sbjct: 293 IGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|327281751|ref|XP_003225610.1| PREDICTED: cathepsin B-like [Anolis carolinensis]
          Length = 330

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 111/306 (36%), Positives = 161/306 (52%), Gaps = 16/306 (5%)

Query: 38  AGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNC 97
           AG NF  N+   YL++       Y      P   +R  +  +    +PD FD+R+QWP+C
Sbjct: 33  AGHNF-HNVDMSYLKKLC---GTYLHGPKLP---ERFAFADD--VELPDSFDSRKQWPSC 83

Query: 98  GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSC 157
            TI  + D G+C +   F AV A SDR C+ + G+ N  +S E + SCC    ++    C
Sbjct: 84  PTINEIRDQGSCGSCWAFGAVEAISDRVCVHTNGKVNVEISAEDLLSCCG---FECGMGC 140

Query: 158 SHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCH 217
           + G     W +  ++G V+GG Y    GC+P +I PC HH +  T P C  +     +C 
Sbjct: 141 NGGYPSGAWKYWTEKGLVSGGLYDSHVGCRPYSIPPCEHHTNG-TRPPCSGEGGETPECV 199

Query: 218 TRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVY 277
            +C +  Y   + QDKH    +Y +  +E  I  EI  +GP    F +Y DF  YKSGVY
Sbjct: 200 KKCED-GYTPAYKQDKHYGVTSYGIPRSEKEIMAEIYKNGPVEGAFVVYSDFLMYKSGVY 258

Query: 278 KHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
           +H S  ++    H+ +++GWG +NGTPYWL  N+W   WG+ G  +ILRG+  C  E  I
Sbjct: 259 QHVSGEEVGG--HAIRILGWGVDNGTPYWLAANSWNTDWGEDGFFRILRGQDHCGIESEI 316

Query: 338 AAGKPK 343
            AG PK
Sbjct: 317 VAGIPK 322


>gi|268579855|ref|XP_002644910.1| C. briggsae CBR-CPR-6 protein [Caenorhabditis briggsae]
          Length = 376

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 115/324 (35%), Positives = 161/324 (49%), Gaps = 11/324 (3%)

Query: 23  DAYIDQINREANTWTAGRN--FPANLSE-EYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
           D  ID IN   N WTA +   F +   E +   ++ +    +   S +      KT D +
Sbjct: 44  DELIDYINDNQNLWTAKKQKRFTSVYGETDDKAKWGLMGVNHVRLSVKGKQHLSKTKDLD 103

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
               +P+ FD+RE WP C +I ++ D  +C +   F AV A SDR CI S G+    LS 
Sbjct: 104 LD--IPESFDSRENWPKCQSIRNIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVSLSA 161

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
           + + SCC+ C +     C+ G     W +  K G VTG +Y   +GC+P    PC HH  
Sbjct: 162 DDLLSCCRSCGF----GCNGGDPLAAWRYWVKDGIVTGSNYTANSGCKPYPFPPCEHHSK 217

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
                 C +   P  KC  +C      + + +DK      Y V D+ +AI+KE++ HGP 
Sbjct: 218 KTHFDPCPHDLYPTPKCEKKCIADYTDKTYSEDKFYGHSAYGVKDDVEAIQKELMTHGPL 277

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
              F +Y+DF +Y  GVY HT   KL    H+ KLIGWG E+G PYW   N+W   WG+ 
Sbjct: 278 EIAFEVYEDFLNYDGGVYVHTG-GKLGGG-HAVKLIGWGIEDGIPYWTCANSWNTDWGED 335

Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
           G  +ILRG  EC  E  +  G PK
Sbjct: 336 GFFRILRGVDECGIESGVVGGIPK 359


>gi|226474168|emb|CAX71570.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 114/350 (32%), Positives = 181/350 (51%), Gaps = 14/350 (4%)

Query: 1   MIHILVFLLGC-TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
           M+ I V+++   TL+   +   ++  I+ ++ E  ++          +++  R   + DA
Sbjct: 1   MLKIAVYIVSLFTLLEAHVTTRNNERIEPLSDEMISFINEHPNAGWKADKSDRFHSVDDA 60

Query: 60  KYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
           ++     +  P  R+   P     + +  +P  FD+R++WP C +I  + D   C +   
Sbjct: 61  RFLLGGRKEDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
            +AVGA SDR CI+S G+Q+  LS   + SCCK C       C  G +  +W++   RG 
Sbjct: 121 VSAVGAMSDRICIQSGGKQSVELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGI 176

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           VTGG   + TGC+P     C H        +C ++     +C+  C    Y   + QDKH
Sbjct: 177 VTGGSKENHTGCRPYPFPKCDHFVKG-KYRACGDKLYKTPQCNQTC-QKGYNTSYEQDKH 234

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
               +Y V   E  I+K+I+ HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +L
Sbjct: 235 YGGFSYNVLSVESVIQKDIMMHGPAEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRL 292

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           IGWG ENGT YWL  NTW   WG++G  +I+RG+ EC  E  IAAG  K+
Sbjct: 293 IGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|226473758|emb|CAX71564.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 116/330 (35%), Positives = 168/330 (50%), Gaps = 23/330 (6%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
            SD  I  IN+  N  W A ++          R   + DA+      R  P  R+   P 
Sbjct: 30  LSDEMISFINKHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLRQKRRPT 80

Query: 79  ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
               + +  +P  FD+R++WP C +I  + D   C +    +AVGA SDR CI+S G+Q+
Sbjct: 81  VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS   + SCCK C       C  G +  +W++   RG VTGG   + TGC+P     C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            H        +C ++     +C+  C    Y   + QDKH    +Y V   E  I+K+I+
Sbjct: 197 DHFVKGK-YRACGDKLYKTPQCNQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +LIGWG ENGT YWL  NTW  
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRLIGWGVENGTAYWLAANTWNE 312

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WG++G  +I+RG+ EC  E  IAAG  K+
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|226474174|emb|CAX71573.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 114/350 (32%), Positives = 181/350 (51%), Gaps = 14/350 (4%)

Query: 1   MIHILVFLLGC-TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
           M+ I V+++   TL+   +   ++  I+ ++ E  ++          +++  R   + DA
Sbjct: 1   MLKIAVYIVSLFTLLEAHVTTRNNERIEPLSDEMISFINEHPNAGWKADKSDRFHSVDDA 60

Query: 60  KYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
           ++     +  P  R+   P     + +  +P  FD+R++WP C +I  + D   C +   
Sbjct: 61  RFLLGGRKEDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
            +AVGA SDR CI+S G+Q+  LS   + SCCK C       C  G +  +W++   RG 
Sbjct: 121 VSAVGAMSDRICIQSGGKQSVELSAIDLISCCKYC----GSGCDGGFLGPSWDYWVLRGI 176

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           VTGG   + TGC+P     C H        +C ++     +C+  C    Y   + QDKH
Sbjct: 177 VTGGSKENHTGCRPYPFPKCDHFVKG-KYRACGDKLYKTPQCNQTC-QKGYNTSYEQDKH 234

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
               +Y V   E  I+K+I+ HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +L
Sbjct: 235 YGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRL 292

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           IGWG ENGT YWL  NTW   WG++G  +I+RG+ EC  E  IAAG  K+
Sbjct: 293 IGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|226474164|emb|CAX71568.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
 gi|226474166|emb|CAX71569.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 116/330 (35%), Positives = 168/330 (50%), Gaps = 23/330 (6%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
            SD  I  IN+  N  W A ++          R   + DA+      R  P  R+   P 
Sbjct: 30  LSDEMISFINKHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80

Query: 79  ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
               + +  +P  FD+R++WP C +I  + D   C +    +AVGA SDR CI+S G+Q+
Sbjct: 81  VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS   + SCCK C       C  G +  +W++   RG VTGG   + TGC+P     C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            H        +C ++     +C   C    Y   + QDKH    +Y V   E  I+K+I+
Sbjct: 197 DHFVKG-KYRACGDKLYKTPQCKQIC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +LIGWG ENGT YWL  NTW  
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRLIGWGVENGTAYWLAANTWNE 312

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WG++G  +I+RG+ EC+ E  IAAG  K+
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|56756907|gb|AAW26625.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 114/350 (32%), Positives = 180/350 (51%), Gaps = 14/350 (4%)

Query: 1   MIHILVFLLGC-TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
           M+ I V+++   TL+   +   ++  ++ ++ E  ++          +++  R   + DA
Sbjct: 1   MLKIAVYIVSLFTLLEAHVTTRNNERVEPLSDEMISFINEHPNAGWKADKSDRFHSVDDA 60

Query: 60  KYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
           +      R  P  R+   P     + +  +P  FD+R++WP C +I  + D   C +   
Sbjct: 61  RILLGGRREDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
            +AVGA SDR CI+S G+Q+  LS   + SCCK C       C  G +  +W++   RG 
Sbjct: 121 VSAVGAMSDRICIQSGGKQSVELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGI 176

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           VTGG   + TGC+P     C H        +C ++     +C+  C    Y   + QDKH
Sbjct: 177 VTGGSKENHTGCRPYPFPKCDHFVKGK-YRACGDKLYKTPQCNQTC-QKGYNTSYEQDKH 234

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
               +Y V   E  I+K+I+ HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +L
Sbjct: 235 YGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRL 292

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           IGWG ENGT YWL  NTW   WG++G  +I+RG+ EC  E  IAAG  K+
Sbjct: 293 IGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|56754499|gb|AAW25437.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  199 bits (506), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 116/330 (35%), Positives = 168/330 (50%), Gaps = 23/330 (6%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
            SD  I  IN+  N  W A ++          R   + DA+      R  P  R+   P 
Sbjct: 30  LSDEMISFINKHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80

Query: 79  ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
               + +  +P  FD+R++WP C +I  + D   C +    +AVGA SDR CI+S G+Q+
Sbjct: 81  VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS   + SCCK C       C  G +  +W++   RG VTGG   + TGC+P     C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            H        +C ++     +C+  C    Y   + QDKH    +Y V   E  I+K+I+
Sbjct: 197 DHFVKGK-YRACGDKLYKTPQCNQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +LIGWG ENGT YWL  NTW  
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRLIGWGVENGTAYWLAANTWNE 312

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WG++G  +I+RG+ EC  E  IAAG  K+
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|239938582|gb|ACS36090.1| cysteine proteinase [Haemonchus contortus]
          Length = 346

 Score =  199 bits (506), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 106/301 (35%), Positives = 159/301 (52%), Gaps = 15/301 (4%)

Query: 42  FPANLS-EEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTI 100
           F A+++   Y  Q  + D ++ +Q+ +P   +    + +    +P+ FDAR  WPNC +I
Sbjct: 55  FEADVTPHSYNVQHKLMDLRFVNQNRKPAVEN----EDDEGDDIPESFDARTHWPNCTSI 110

Query: 101 GHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHG 160
            H+ D   C +    +   A SDR CI+S G+    +S+    SCC+ C Y     C  G
Sbjct: 111 RHIRDQANCGSCWAVSTASALSDRICIESNGETQMHISSIDFVSCCESCSY----GCDGG 166

Query: 161 SVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSC-ENQKVPKLKCHTR 219
                ++F    G+VTGGDYG + GC+P    PC HHG+      C +  K P  KC  R
Sbjct: 167 WPILAFDFYTYEGAVTGGDYGSKDGCRPYPFHPCGHHGNDTYYGECPKGAKTP--KCRRR 224

Query: 220 CTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH 279
           C   +Y + ++ DK      Y V  +  AI++EI+ +GP    F +Y+DF +YK G+YKH
Sbjct: 225 CQR-SYKKAYYMDKSYGEDAYEVPHSVKAIQREIMKNGPVVGAFTVYEDFSYYKKGIYKH 283

Query: 280 TSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
           T+        H+ K+IGWG EN  PYWL+ N+W   WG+ G  +++RG  EC  E  + A
Sbjct: 284 TAGQARGG--HAIKIIGWGVENDVPYWLIANSWHNDWGEEGYFRMIRGINECGIEQEVVA 341

Query: 340 G 340
           G
Sbjct: 342 G 342


>gi|226474180|emb|CAX71576.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 116/330 (35%), Positives = 167/330 (50%), Gaps = 23/330 (6%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
            SD  I  IN   N  W A ++          R   + DA+      R  P  R+   P 
Sbjct: 30  LSDEMISFINEHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80

Query: 79  ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
               + +  +P  FD+R++WP C +I  + D   C +    +AVGA SDR CI+S G+Q+
Sbjct: 81  VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS   + SCCK C       C  G +  +W++   RG VTGG   + TGC+P     C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            H        +C ++     +C   C    Y   + QDKH    +Y V   E  I+K+I+
Sbjct: 197 DHFVKG-KYRACGDKLYETPQCKQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +LIGWG ENGT YWL  NTW  
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRLIGWGVENGTAYWLAANTWNE 312

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WG++G  +I+RG+ EC+ E  IAAG  K+
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|56758864|gb|AAW27572.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 116/330 (35%), Positives = 168/330 (50%), Gaps = 23/330 (6%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
            SD  I  IN+  N  W A ++          R   + DA+      R  P  R+   P 
Sbjct: 30  LSDEMISFINKHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80

Query: 79  ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
               + +  +P  FD+R++WP C +I  + D   C +    +AVGA SDR CI+S G+Q+
Sbjct: 81  VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS   + SCCK C       C  G +  +W++   RG VTGG   + TGC+P     C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            H        +C ++     +C+  C    Y   + QDKH    +Y V   E  I+K+I+
Sbjct: 197 DHFVKG-KYRACGDKLYKTPQCNQTC-QKGYNTSYEQDKHYGGFSYNVLGIESVIQKDIM 254

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +LIGWG ENGT YWL  NTW  
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRLIGWGVENGTAYWLAANTWNE 312

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WG++G  +I+RG+ EC  E  IAAG  K+
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|226473762|emb|CAX71566.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
 gi|226474170|emb|CAX71571.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 114/350 (32%), Positives = 180/350 (51%), Gaps = 14/350 (4%)

Query: 1   MIHILVFLLGC-TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
           M+ I V+++   TL+   +   ++  ++ ++ E  ++          +++  R   + DA
Sbjct: 1   MLKIAVYIVSLFTLLEAHVTTRNNERVEPLSDEMISFINEHPNAGWKADKSDRFHSVDDA 60

Query: 60  KYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
           +      R  P  R+   P     + +  +P  FD+R++WP C +I  + D   C +   
Sbjct: 61  RILLGGRREDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
            +AVGA SDR CI+S G+Q+  LS   + SCCK C       C  G +  +W++   RG 
Sbjct: 121 VSAVGAMSDRICIQSGGKQSVELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGI 176

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           VTGG   + TGC+P     C H        +C ++     +C+  C    Y   + QDKH
Sbjct: 177 VTGGSKENHTGCRPYPFPKCDHFVKGK-YRACGDKLYKTPQCNQTC-QKGYNTSYEQDKH 234

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
               +Y V   E  I+K+I+ HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +L
Sbjct: 235 YGGFSYNVLSVESVIQKDIMMHGPVEAYIEIYEDFLNYKSGIYRYTTGKYISG--HAVRL 292

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           IGWG ENGT YWL  NTW   WG++G  +I+RG+ EC  E  IAAG  K+
Sbjct: 293 IGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|226474160|emb|CAX71567.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 116/330 (35%), Positives = 167/330 (50%), Gaps = 23/330 (6%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
            SD  I  IN+  N  W A ++          R   + DA+      R  P  R+   P 
Sbjct: 30  LSDEMISFINKHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80

Query: 79  ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
               +    +P  FD+R++WP C +I  + D   C +    +AVGA SDR CI+S G+Q+
Sbjct: 81  VDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS   + SCCK C       C  G +  +W++   RG VTGG   + TGC+P     C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            H        +C ++     +C   C    Y   + QDKH    +Y V   E  I+K+I+
Sbjct: 197 DHFVKG-KYRACGDKLYKTPQCKQIC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +LIGWG ENGT YWL  NTW  
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRLIGWGVENGTAYWLAANTWNE 312

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WG++G  +I+RG+ EC+ E  IAAG  K+
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|226468762|emb|CAX76409.1| cathepsin B [Schistosoma japonicum]
 gi|257206178|emb|CAX82740.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 122/324 (37%), Positives = 158/324 (48%), Gaps = 15/324 (4%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
            S   I  IN EANT W AG         +  R          +Q +    G   T +  
Sbjct: 36  LSKELIHFINYEANTTWKAGPTRRFKTVSDIRRMLGALPDPNGEQLETLCTGYELTLN-- 93

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
               +P  FDAR++W +C +I  + D  +C +   F AV A SDR CI+SKG+    LS 
Sbjct: 94  ---ELPKSFDARKEWTHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGKYKPFLSA 150

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
           E + SCC  C       C+ G     W +   +G VTG  Y    GCQP    PC H+  
Sbjct: 151 ENLVSCCSSC----GMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEHNTL 206

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
            P LP C+   V    C   C    Y   +  DK    + Y V  N++AI KE++ HGP 
Sbjct: 207 GP-LPVCDGD-VETPPCKRTC-QAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPV 263

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
              F +Y DF +YKSGVY+H S A L    H+ +L+GWG EN  PYWL+ N+W   WGD 
Sbjct: 264 EVDFEVYADFPNYKSGVYQHVSGALLGG--HAVRLLGWGEENNVPYWLIANSWNTDWGDN 321

Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
           G  KI+RGK EC  E  + AG PK
Sbjct: 322 GYFKIIRGKNECGIESDVNAGIPK 345


>gi|239938584|gb|ACS36091.1| cysteine proteinase [Haemonchus contortus]
          Length = 346

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 106/301 (35%), Positives = 159/301 (52%), Gaps = 15/301 (4%)

Query: 42  FPANLS-EEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTI 100
           F A+++   Y  Q  + D ++ +Q+ +P   +    + +    +P+ FDAR  WPNC +I
Sbjct: 55  FEADVTPHSYNVQHKLMDLRFVNQNRKPAVEN----EDDEGDDIPESFDARTHWPNCTSI 110

Query: 101 GHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHG 160
            H+ D   C +    +   A SDR CI+S G+    +S+    SCC+ C Y     C  G
Sbjct: 111 RHIRDQANCGSCWAVSTASALSDRICIESNGETQMHISSIDFVSCCESCGY----GCDGG 166

Query: 161 SVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSC-ENQKVPKLKCHTR 219
                ++F    G+VTGGDYG + GC+P    PC HHG+      C +  K P  KC  R
Sbjct: 167 WPILAFDFYTYEGAVTGGDYGSKDGCRPYPFHPCGHHGNDTYYGECPKGAKTP--KCRRR 224

Query: 220 CTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH 279
           C   +Y + ++ DK      Y V  +  AI++EI+ +GP    F +Y+DF +YK G+YKH
Sbjct: 225 CQR-SYKKAYYMDKSYGEDAYEVPHSVKAIQREIMKNGPVVGAFTVYEDFSYYKKGIYKH 283

Query: 280 TSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
           T+        H+ K+IGWG EN  PYWL+ N+W   WG+ G  +++RG  EC  E  + A
Sbjct: 284 TAGQARGG--HAIKIIGWGVENDVPYWLIANSWHNDWGEEGYFRMIRGINECGIEQEVVA 341

Query: 340 G 340
           G
Sbjct: 342 G 342


>gi|56756475|gb|AAW26410.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  198 bits (504), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 116/330 (35%), Positives = 167/330 (50%), Gaps = 23/330 (6%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
            SD  I  IN+  N  W A ++          R   + DA+      R  P  R+   P 
Sbjct: 30  LSDEMISFINKHPNAGWKADKSD---------RFHSVDDARNLLGGRREDPNLRQKRRPT 80

Query: 79  ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
               + +  +P  FD+R++WP C +I  + D   C +    +AVGA SDR CI+S G+Q+
Sbjct: 81  VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS   + SCCK C       C  G +  +W++   RG VTGG   + TGC+P     C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            H        +C ++     +C   C    Y   + QDKH    +Y V   E  I+K+I+
Sbjct: 197 DHFVKGK-YRACGDKLYKTPQCKQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +LIGWG ENGT YWL  NTW  
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRLIGWGVENGTAYWLAANTWNE 312

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WG++G  +I+RG+ EC  E  IAAG  K+
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|56752809|gb|AAW24616.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  198 bits (504), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 116/330 (35%), Positives = 166/330 (50%), Gaps = 23/330 (6%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
            SD  I  IN   N  W A ++          R   + DA+      R  P  R+   P 
Sbjct: 30  LSDEMISFINEHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80

Query: 79  ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
               +    +P  FD+R++WP C +I  + D   C +    +AVGA SDR CI+S G+Q+
Sbjct: 81  VDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS   + SCCK C       C  G +  +W++   RG VTGG   + TGC+P     C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            H        +C ++     +C   C    Y   + QDKH    +Y V   E  I+K+I+
Sbjct: 197 DHFVKGK-YRACGDKLYKTPQCKQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +LIGWG ENGT YWL  NTW  
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRLIGWGVENGTAYWLAANTWNE 312

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WG++G  +I+RG+ EC+ E  IAAG  K+
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|56756410|gb|AAW26378.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  198 bits (504), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 115/330 (34%), Positives = 167/330 (50%), Gaps = 23/330 (6%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
            SD  I  IN+  N  W A ++          R   + DA+      +  P  R+   P 
Sbjct: 30  LSDEMISFINKHPNAGWKADKSD---------RFHSVDDARILLGGRKEDPNLRQKRRPT 80

Query: 79  ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
               +    +P  FD+R++WP C +I  + D   C +    +AVGA SDR CI+S G+Q+
Sbjct: 81  VDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS   + SCCK C       C  G +  +W++   RG VTGG   + TGC+P     C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            H        +C ++     +C   C    Y   + QDKH    +Y V   E  I+K+I+
Sbjct: 197 DHFVKG-KYRACGDKLYKTPQCKQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +LIGWG ENGT YWL  NTW  
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRLIGWGVENGTAYWLAANTWNE 312

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WG++G  +I+RG+ EC+ E  IAAG  K+
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|226473756|emb|CAX71563.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  198 bits (504), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 116/330 (35%), Positives = 166/330 (50%), Gaps = 23/330 (6%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
            SD  I  IN   N  W A ++          R   + DA+      R  P  R+   P 
Sbjct: 30  LSDEMISFINEHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80

Query: 79  ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
               +    +P  FD+R++WP C +I  + D   C +    +AVGA SDR CI+S G+Q+
Sbjct: 81  VDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS   + SCCK C       C  G +  +W++   RG VTGG   + TGC+P     C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            H        +C ++     +C+  C    Y   + QDKH    +Y V   E  I+K+I+
Sbjct: 197 DHFVKGK-YRACGDKLYKTPQCNQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +LIGWG ENGT YWL  NTW  
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRLIGWGVENGTAYWLAANTWNE 312

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WG++G  +I+RG+ EC  E  IAAG  K+
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|56752925|gb|AAW24674.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  198 bits (504), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 115/329 (34%), Positives = 168/329 (51%), Gaps = 21/329 (6%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP-- 78
            SD  I  IN+  N       + A+ S+ +     + DA+      R  P  R+   P  
Sbjct: 30  LSDEMILFINKHPNA-----GWKADKSDRF---HSVDDARILLGGRREDPNLRQKRRPTV 81

Query: 79  ---EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
              + +  +P  FD+R++WP C +I  + D   CA+    +AV A SDR CI+S G+Q+ 
Sbjct: 82  DHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVAAMSDRICIQSGGKQSV 141

Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
            LS   + SCCK C       C  G    +W++  K G VTGG   + TGC+P     C 
Sbjct: 142 ELSAIDLISCCKNC----GSGCDGGVTGYSWDYWVKHGIVTGGSKENHTGCRPYPFPKCD 197

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           H        +C ++     +C   C    Y   + QDKH    +Y V   E AI+KEI+ 
Sbjct: 198 HFVKGK-YRACGDKLYKTPQCKQTC-QKGYNTSYEQDKHYGGFSYSVIGVESAIQKEIMM 255

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
           +GP  A   +Y+DF +YKSG+Y++T+   +    H+ +LIGWG ENGT YWL  NTW   
Sbjct: 256 YGPVEAYLQIYEDFLNYKSGIYRYTTGKYISG--HAVRLIGWGVENGTSYWLAANTWNED 313

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           WG++G  +I+RG+ EC  E  I AG+ K+
Sbjct: 314 WGEKGYFRIVRGRDECLIESFIVAGQIKS 342


>gi|171474007|gb|AAX31052.2| SJCHGC09761 protein [Schistosoma japonicum]
          Length = 342

 Score =  198 bits (504), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 116/330 (35%), Positives = 166/330 (50%), Gaps = 23/330 (6%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
            SD  I  IN   N  W A ++          R   + DA+      R  P  R+   P 
Sbjct: 30  LSDEMISFINEHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80

Query: 79  ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
               + +  +P  FD+R++WP C +I  + D   C +    +AVGA SDR CI+S G+Q+
Sbjct: 81  IDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS   + SCCK C       C  G +  +W++   RG VTGG   + TGC+P     C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            H        +C ++     +C   C    Y   + QDKH    +Y V   E  I+K+I+
Sbjct: 197 DHFVKGK-YRACGDKLYKTPQCKQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +LIGWG ENGT YWL  NTW  
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRLIGWGVENGTAYWLAANTWNE 312

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WG++G  +I+RG+ EC  E  IAAG  K+
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|56753605|gb|AAW25005.1| SJCHGC02852 protein [Schistosoma japonicum]
          Length = 346

 Score =  198 bits (503), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 118/326 (36%), Positives = 167/326 (51%), Gaps = 14/326 (4%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
            SD  I  IN++ N  W A R      S  + +  +       DQ     P     +  +
Sbjct: 32  LSDELITFINKQPNIEWKADRTTRFT-SIHHAKSMMGVLLNSVDQHKLHHP---IIHHND 87

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
            +  +P  FD+R+ W NC +I  + D  +C +   F AV + SDR CI SKG+ +  LS 
Sbjct: 88  INIKLPKYFDSRKYWKNCSSIRTIRDQSSCGSCWAFGAVESMSDRICIHSKGRISIELSA 147

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
             + SCC  C +     C+ G     W++    G VTGG     TGCQP     C HH +
Sbjct: 148 VNLLSCCSRCGF----GCNGGIPGMAWDYWKDEGIVTGGSNETHTGCQPYPFPECIHHST 203

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
           +    SCE +     +C+  C  P Y   +  DK+    +Y+V  +E +I KEIL +GP 
Sbjct: 204 SINHSSCEVKYYSTPECYQTC-QPDYAIQYENDKYYGKSSYYVTSDEVSIMKEILLNGPV 262

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG--TENGTPYWLVINTWGPHWG 317
            ATF ++DDF +YK+GVYK+ + + L    H+ ++IGWG  T N TPYWL  N+W   WG
Sbjct: 263 EATFYVFDDFLNYKTGVYKYVTGSLLGG--HAIRIIGWGVSTLNHTPYWLCANSWNKQWG 320

Query: 318 DRGTVKILRGKYECAFEYLIAAGKPK 343
           D+G  KILRG  EC  E ++ AG PK
Sbjct: 321 DKGYFKILRGSNECGIESMVTAGLPK 346


>gi|262368170|pdb|3K9M|A Chain A, Cathepsin B In Complex With Stefin A
 gi|262368172|pdb|3K9M|B Chain B, Cathepsin B In Complex With Stefin A
          Length = 254

 Score =  198 bits (503), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 104/261 (39%), Positives = 141/261 (54%), Gaps = 10/261 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDAREQWP C TI  + D G+C +   F AV A SDR CI +    +  +S E + 
Sbjct: 1   LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 60

Query: 144 SCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
           +CC  +C       C+ G     WNF  ++G V+GG Y    GC+P +I PC HH +   
Sbjct: 61  TCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSR 116

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
            P       PK    ++   P Y   + QDKH    +Y V ++E  I  EI  +GP    
Sbjct: 117 PPCTGEGDTPKC---SKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGA 173

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTPYWLV N+W   WGD G  
Sbjct: 174 FSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWNTDWGDNGFF 231

Query: 323 KILRGKYECAFEYLIAAGKPK 343
           KILRG+  C  E  + AG P+
Sbjct: 232 KILRGQDHCGIESEVVAGIPR 252


>gi|56757646|gb|AAW26973.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  198 bits (503), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 115/330 (34%), Positives = 167/330 (50%), Gaps = 23/330 (6%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
            SD  I  IN   N  W A ++          R   + DA+      R  P  R+   P 
Sbjct: 30  LSDEMISFINEHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80

Query: 79  ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
               + +  +P  FD+R++WP C +I  + D   C +    +AVGA SDR CI+S G+Q+
Sbjct: 81  VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS   + SCCK C       C  G +  +W++   RG VTGG   + TGC+P     C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            H        +C ++     +C+  C    Y   + QDKH    +Y V   E   +K+I+
Sbjct: 197 DHFVKGK-YRACGDKLYKTPQCNQTC-QKGYNTSYEQDKHYGGFSYNVLSGESVFQKDIM 254

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +LIGWG ENGT YWL  NTW  
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRLIGWGVENGTAYWLAANTWNE 312

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WG++G  +I+RG+ EC+ E  IAAG  K+
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|332376204|gb|AEE63242.1| unknown [Dendroctonus ponderosae]
          Length = 338

 Score =  198 bits (503), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 118/343 (34%), Positives = 164/343 (47%), Gaps = 14/343 (4%)

Query: 2   IHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
           I +L F+  C     +L+  SD YI  IN +A TW AG+NF  +   ++ R   IA    
Sbjct: 7   IAVLAFVAVCHGTSLDLHPLSDEYIASINEKATTWKAGKNFEVD---DWERVKKIAAGVL 63

Query: 62  FDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
             ++          +D   S  VP+ FDARE WP C ++  + D  +C +   F AV A 
Sbjct: 64  PRKAALRFVTQNNPHDE--SEEVPESFDARENWPRCDSLKQIRDQSSCGSCWAFGAVEAM 121

Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
           SDR CI S       +S E + SCC    +     C  G V   W++    G VTGG Y 
Sbjct: 122 SDRICIHSDQSNQVYVSAEDLNSCC-FGLFACGLGCDGGYVAEPWDYWRTDGIVTGGAYN 180

Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRC--TNPTYGRGFFQDKHRTTLT 239
              GC+  ++ PC HH    + P C +      +C   C  ++  Y       +  +T T
Sbjct: 181 SSQGCKDYSLEPCEHHVEVGSRPQCSSLNFDTPECVRSCYESSLDYTESLTFGQQVSTFT 240

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
                NE  ++ EIL +GP  A F +Y+DF  YKSGVY+ T+  +     H+ K++GWG 
Sbjct: 241 -----NEKQMQLEILKNGPIEAAFTVYNDFLSYKSGVYQATAQDESVGG-HAIKVLGWGV 294

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           E GT YWL+ N+W   WGD G  K LRG   C  E   AA  P
Sbjct: 295 EEGTKYWLIANSWNTDWGDNGYFKFLRGVDHCGIESETAASLP 337


>gi|56759488|gb|AAW27884.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  198 bits (503), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 116/330 (35%), Positives = 166/330 (50%), Gaps = 23/330 (6%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
            SD  I  IN   N  W A ++          R   + DA+      R  P  R+   P 
Sbjct: 30  LSDEMISFINEHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80

Query: 79  ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
               + +  +P  FD+R++WP C +I  + D   C +    +AVGA SDR CI+S G+Q+
Sbjct: 81  IDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS   + SCCK C       C  G +  +W++   RG VTGG   + TGC+P     C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            H        +C ++     +C   C    Y   + QDKH    +Y V   E  I+K+I+
Sbjct: 197 DHFVKGK-YRACGDKLYKTPQCKQTC-QKGYNTSYEQDKHYGGFSYNVLGIESVIQKDIM 254

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +LIGWG ENGT YWL  NTW  
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRLIGWGVENGTAYWLAANTWNE 312

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WG++G  +I+RG+ EC  E  IAAG  K+
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|24158605|pdb|1GMY|A Chain A, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
 gi|24158606|pdb|1GMY|B Chain B, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
 gi|24158607|pdb|1GMY|C Chain C, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
          Length = 261

 Score =  198 bits (503), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 104/261 (39%), Positives = 141/261 (54%), Gaps = 10/261 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDAREQWP C TI  + D G+C +   F AV A SDR CI +    +  +S E + 
Sbjct: 2   LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 61

Query: 144 SCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
           +CC  +C       C+ G     WNF  ++G V+GG Y    GC+P +I PC HH +   
Sbjct: 62  TCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSR 117

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
            P       PK    ++   P Y   + QDKH    +Y V ++E  I  EI  +GP    
Sbjct: 118 PPCTGEGDTPKC---SKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGA 174

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTPYWLV N+W   WGD G  
Sbjct: 175 FSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWNTDWGDNGFF 232

Query: 323 KILRGKYECAFEYLIAAGKPK 343
           KILRG+  C  E  + AG P+
Sbjct: 233 KILRGQDHCGIESEVVAGIPR 253


>gi|56755451|gb|AAW25905.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  198 bits (503), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 114/329 (34%), Positives = 168/329 (51%), Gaps = 21/329 (6%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP-- 78
            SD  I  IN+  N       + A+ S+ +     + DA+      +  P  R+   P  
Sbjct: 30  LSDEMISFINKHPNA-----GWKADKSDRF---HSVDDARILLGGRKEDPNLRQKRRPTV 81

Query: 79  ---EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
              +    +P  FD+R++WP C +I  + D   C +    +AVGA SDR CI+S G+Q+ 
Sbjct: 82  DHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSV 141

Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
            LS   + SCCK C       C  G +  +W++   RG VTGG   + TGC+P     C 
Sbjct: 142 ELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCD 197

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           H        +C ++     +C   C    Y   + QDKH    +Y V   E  I+K+I+ 
Sbjct: 198 HFVKG-KYRACGDKLYKTPQCKQIC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
           HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +LIGWG ENGT YWL  NTW   
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRLIGWGVENGTAYWLAANTWNED 313

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           WG++G  +I+RG+ EC+ E  IAAG  K+
Sbjct: 314 WGEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|56757271|gb|AAW26807.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  198 bits (503), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 115/329 (34%), Positives = 168/329 (51%), Gaps = 21/329 (6%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP-- 78
            SD  I  IN+  N       + A+ S+ +     + DA+      R  P  R+   P  
Sbjct: 30  LSDEMILFINKHPNA-----GWKADKSDRF---HSVDDARILLGGRREDPNLREKRRPTV 81

Query: 79  ---EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
              + +  +P  FD+R++WP C +I  + D   CA+    +AVGA SDR CI+S G+Q+ 
Sbjct: 82  DHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVGAMSDRICIQSGGKQSV 141

Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
            LS   + SCCK C       C  G    +W++  K G VTGG   + TGC+P     C 
Sbjct: 142 ELSAIDLISCCKNC----GSGCDGGVTGYSWDYWVKHGIVTGGSKENHTGCRPYPFPKCD 197

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           H        +C ++     +C   C    Y   + QDKH    +Y V   E  I+KEI+ 
Sbjct: 198 HFVKGK-YRACGDKLYKTPQCKQTC-QKGYNTSYEQDKHYGEFSYNVIGVESVIQKEIMM 255

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
           +GP  A   +Y+DF +YKSG+Y++T+   +    H+ +LIGWG ENGT YWL  NTW   
Sbjct: 256 YGPVEAYLHIYEDFLNYKSGIYRYTTGQFISG--HAVRLIGWGVENGTSYWLAANTWNED 313

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           WG++G  +I+RG+ EC  E  I AG+ K+
Sbjct: 314 WGEKGYFRIVRGRDECLIESFIVAGQIKS 342


>gi|333361087|pdb|3AI8|B Chain B, Cathepsin B In Complex With The Nitroxoline
 gi|333361088|pdb|3AI8|A Chain A, Cathepsin B In Complex With The Nitroxoline
          Length = 256

 Score =  198 bits (503), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 104/261 (39%), Positives = 141/261 (54%), Gaps = 10/261 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDAREQWP C TI  + D G+C +   F AV A SDR CI +    +  +S E + 
Sbjct: 3   LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 62

Query: 144 SCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
           +CC  +C       C+ G     WNF  ++G V+GG Y    GC+P +I PC HH +   
Sbjct: 63  TCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSR 118

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
            P       PK    ++   P Y   + QDKH    +Y V ++E  I  EI  +GP    
Sbjct: 119 PPCTGEGDTPKC---SKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGA 175

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTPYWLV N+W   WGD G  
Sbjct: 176 FSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWNTDWGDNGFF 233

Query: 323 KILRGKYECAFEYLIAAGKPK 343
           KILRG+  C  E  + AG P+
Sbjct: 234 KILRGQDHCGIESEVVAGIPR 254


>gi|170586854|ref|XP_001898194.1| cathepsin B-like cysteine proteinase [Brugia malayi]
 gi|158594589|gb|EDP33173.1| cathepsin B-like cysteine proteinase, putative [Brugia malayi]
          Length = 384

 Score =  197 bits (502), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 111/323 (34%), Positives = 168/323 (52%), Gaps = 18/323 (5%)

Query: 26  IDQINREANTWTAGRNFPANLSEEYLRQFLIA--DAKYFDQSDRPLPGDRKTYDPEYSAT 83
           ID +N     W AG N   NL  + ++  L+   + K   +  + L   R +     +  
Sbjct: 67  IDYVNSHQTLWKAGMN-KFNLYSDTVKYGLLGVNNRKKSVEHKKNLSPIRHS-----NIF 120

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FDAR+ WP C ++ ++ D  +C +    AAV A SDR CI SKG++   LS + + 
Sbjct: 121 IPESFDARKNWPECASLRNIRDQSSCGSCWAVAAVEAMSDRICITSKGKKQVILSADDLL 180

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCCK C +     C  G     W +    G VTG DY + +GC+P    PC HH +    
Sbjct: 181 SCCKTCGF----GCFGGEPMAAWKYWVLSGIVTGSDYTNHSGCRPYPFPPCEHHSNKTHY 236

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
             C++   P  KC+ +C +  Y + +  DK+     Y V+++ ++I+KEI+  GP  A+F
Sbjct: 237 EPCKHDLYPTPKCYKQC-DKNYTKSYKADKYYGEQAYNVENDVESIQKEIMTLGPVEASF 295

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD---RG 320
            +Y DF HY SG+YKH + +      H+ K++GWG + G  YWL  N+W   WG+    G
Sbjct: 296 EVYTDFLHYTSGIYKHVAGSVGGG--HAVKILGWGIDQGVSYWLAANSWNNDWGEDVFSG 353

Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
             +ILRG  EC  E  I AG P+
Sbjct: 354 YFRILRGADECGIESGIVAGIPR 376


>gi|145498570|ref|XP_001435272.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124402403|emb|CAK67875.1| unnamed protein product [Paramecium tetraurelia]
          Length = 325

 Score =  197 bits (502), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 117/322 (36%), Positives = 167/322 (51%), Gaps = 17/322 (5%)

Query: 22  SDAYIDQINREANTWTAGRNFP-ANLSEEYLRQFLIADAKYFDQSD-RPLPGDRKTYDPE 79
           S  + D +N + +TW +G N      +E  L+  +     + D+ D   LP     ++  
Sbjct: 17  SQTFYDFVNSQQSTWVSGHNQRWEQFNEATLKTQM---GTFLDEPDFMKLPESTVQFE-- 71

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
            +  +P+ FDAR+QWPNC +I  V D   C +   F A  A SDR CI + G+Q R +ST
Sbjct: 72  -NLEIPESFDARQQWPNCESIKEVRDQSTCGSCWAFGAAEAMSDRLCIAT-GKQTR-IST 128

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
           E + +CC I        C+ G     WN+   +G VTG  +GD + C+P T  PC HH  
Sbjct: 129 EDLLTCCGITC---GMGCNGGFPSGAWNYFKNKGLVTGDLFGDNSWCRPYTFPPCDHHVD 185

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
                 C + + P   C   CT  + GR +  DK R+  +Y V    + I+ EI+  GP 
Sbjct: 186 DGKYGPCGDSQ-PTPACVKSCTAQS-GRNYDSDKIRSIDSYSVSSKVEQIQNEIMTFGPV 243

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
            A+F +Y+DF  YKSGVY++ + A L    H+ K+IGWG E   PYWLV+N+W   WG+ 
Sbjct: 244 EASFTVYEDFLTYKSGVYQNVAGANLGG--HAVKIIGWGVEKNVPYWLVVNSWNEGWGEN 301

Query: 320 GTVKILRGKYECAFEYLIAAGK 341
           G  KILRG      E  I AG+
Sbjct: 302 GLFKILRGSNHVGIEGGIYAGR 323


>gi|56759504|gb|AAW27892.1| unknown [Schistosoma japonicum]
          Length = 279

 Score =  197 bits (502), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 98/260 (37%), Positives = 149/260 (57%), Gaps = 8/260 (3%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           +  +P +FD+R++WP+C +I  + D   C +   F AV A +DR CI+S GQQ+  LS  
Sbjct: 24  NVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSAL 83

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SCC+ C     + C  G     W++  KRG VTGG   + TGCQP     C HH + 
Sbjct: 84  DLISCCEDC----GQGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHH-TK 138

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
              P+C  +     +C   C    Y   + QDKH    +Y V +NE  I+++I+ +GP  
Sbjct: 139 GKYPACGTKIYKTPQCKQTCQK-GYKTPYEQDKHYGEESYNVQNNEKVIQRDIMMYGPVE 197

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
           A F +Y+DF +YKSG+Y+H + + +    H+ ++IGWG E  TPYWL+ N+W   WG++G
Sbjct: 198 AAFDVYEDFLNYKSGIYRHVTGSIVGG--HAIRIIGWGVEKRTPYWLIANSWNEDWGEKG 255

Query: 321 TVKILRGKYECAFEYLIAAG 340
             +I+RG+ EC+ E  + AG
Sbjct: 256 LFRIVRGRDECSIESNVVAG 275


>gi|268555420|ref|XP_002635699.1| Hypothetical protein CBG22436 [Caenorhabditis briggsae]
          Length = 317

 Score =  197 bits (502), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 126/342 (36%), Positives = 178/342 (52%), Gaps = 28/342 (8%)

Query: 2   IHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
           I +L  LL  T     +     A ++ +N   + +TA  +   N++EE+ ++F + D KY
Sbjct: 2   ILVLAVLLEATSAFVPIT--GQALVNYVNSAQSMFTAEYS---NVTEEF-KKFRVMDVKY 55

Query: 62  FDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
                   P  R +       ++P  FDAR +WPNC +I  + +   C +   F A    
Sbjct: 56  AAPHS---PELRASQVNTVLPSIPTYFDARTRWPNCRSIKMIRNQATCGSCWAFGAAEVM 112

Query: 122 SDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
           SDR CI S G +   +S   + SCC   C Y   K  S    FR WN   K+G VTGGDY
Sbjct: 113 SDRICIASMGTKQPIISPTDLLSCCGNFCGYG-CKGASPLQAFRWWN---KKGVVTGGDY 168

Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
              +GC+P   +PC+       LP C   + P+  C   C  P Y + + +DK+  T  Y
Sbjct: 169 -RGSGCKPYPFAPCT------ALP-CTKSETPR--CSLNC-QPAYSKAYSKDKYFGTPAY 217

Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE 300
            V  +  AI+ EI  +GP  A F +YDDF HY+SGVY+H +   +    H+ K+IGWG +
Sbjct: 218 IVGMDVAAIQTEI-TNGPVEAAFIVYDDFNHYRSGVYRHVAGKLVGG--HAVKIIGWGIQ 274

Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           NG PYWL+ N+WGP+WG+ G  K+LRG  EC  E  I AGKP
Sbjct: 275 NGAPYWLMANSWGPYWGENGFFKMLRGVDECGIESTIVAGKP 316


>gi|1345924|sp|P25802.3|CYSP1_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
           Precursor
          Length = 341

 Score =  197 bits (502), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 107/291 (36%), Positives = 155/291 (53%), Gaps = 10/291 (3%)

Query: 50  YLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGAC 109
           Y +Q L+ D KY DQ++ P          E +  +P+ +D R QW NC ++ H+PD   C
Sbjct: 58  YFKQRLM-DLKYIDQNNIPDEEVEDEELEENNDDIPESYDPRIQWANCSSLFHIPDQANC 116

Query: 110 AAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFL 169
            +    ++  A SDR CI SKG +   +S + V SCC  C       C  G     + F 
Sbjct: 117 GSCWAVSSAAAMSDRICIASKGAKQVLISAQDVVSCCTWC----GDGCEGGWPISAFRFH 172

Query: 170 HKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF 229
              G VTGGDY  +  C+P  I PC HHG+      C        +C  RC    Y + +
Sbjct: 173 ADEGVVTGGDYNTKGSCRPYEIHPCGHHGNETYYGECVGM-ADTPRCKRRCL-LGYPKSY 230

Query: 230 FQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL 289
             D++     Y + ++  AI+K+I+ +GP  AT+ +Y+DF HY+SG+YKH +  K    L
Sbjct: 231 PSDRYYKK-AYQLKNSVKAIQKDIMKNGPVVATYTVYEDFAHYRSGIYKHKAGRKTG--L 287

Query: 290 HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           H+ K+IGWG E GTPYW+V N+W   WG+ G  ++ RG  +C FE  +AAG
Sbjct: 288 HAVKVIGWGEEKGTPYWIVANSWHDDWGENGFFRMHRGSNDCGFEERMAAG 338


>gi|343197337|pdb|3QSD|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With Ca074 Inhibitor
 gi|343197588|pdb|3S3Q|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11017 Inhibitor
 gi|343197589|pdb|3S3R|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
 gi|343197590|pdb|3S3R|B Chain B, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
 gi|343197591|pdb|3S3R|C Chain C, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
          Length = 254

 Score =  197 bits (501), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 100/258 (38%), Positives = 144/258 (55%), Gaps = 8/258 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FD+R++WP C +I  + D   C +   F AV A SDR CI+S G+QN  LS   + 
Sbjct: 3   IPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLL 62

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC+ C       C  G +   W++  K G VTG    +  GC+P     C HH +    
Sbjct: 63  SCCESC----GLGCEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEHH-TKGKY 117

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P C ++     +C   C    Y   + QDKHR   +Y V ++E AI+KEI+ +GP  A F
Sbjct: 118 PPCGSKIYKTPRCKQTCQK-KYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGF 176

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
            +Y+DF +YKSG+YKH +   L    H+ ++IGWG EN  PYWL+ N+W   WG+ G  +
Sbjct: 177 TVYEDFLNYKSGIYKHITGETLGG--HAIRIIGWGVENKAPYWLIANSWNEDWGENGYFR 234

Query: 324 ILRGKYECAFEYLIAAGK 341
           I+RG+ EC+ E  + AG+
Sbjct: 235 IVRGRDECSIESEVTAGR 252


>gi|226469950|emb|CAX70256.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  197 bits (501), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 101/267 (37%), Positives = 155/267 (58%), Gaps = 10/267 (3%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           + +  +P +FD+R++WP+C +I  + D   C +    +AVGA SDR CI+S G+Q+  LS
Sbjct: 85  DLNVEIPSQFDSRKKWPHCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELS 144

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
              + SCC+ C       C  G     W++    G VTGG   + TGCQP     C HH 
Sbjct: 145 AIDLISCCENC----GSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHH- 199

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           S    PSC ++     +C  +C    Y   +  DKH   ++  V  NE AI+KEI+ +GP
Sbjct: 200 SIGKYPSCGDKIYKTPQCKRKCQK-GYTTPYEHDKHYGGISINVIKNESAIQKEIMMYGP 258

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKL-ENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
             A   +++DF +YKSG+Y++T+ + + E+Y+   ++IGWG ENGT YWL  NTW   WG
Sbjct: 259 VEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYV---RIIGWGIENGTAYWLAANTWNEDWG 315

Query: 318 DRGTVKILRGKYECAFEYLIAAGKPKN 344
           ++G  +I+RG+ EC+ E ++ AG+ K+
Sbjct: 316 EKGYFRIVRGRNECSIESVVVAGRLKS 342


>gi|9955277|pdb|1QDQ|A Chain A, X-Ray Crystal Structure Of Bovine Cathepsin B-Ca074
           Complex
          Length = 253

 Score =  197 bits (500), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 111/259 (42%), Positives = 140/259 (54%), Gaps = 8/259 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FDAREQWPNC TI  + D G+C +   F AV A SDR CI S G+ N  +S E + 
Sbjct: 1   LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 60

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           +CC     D             WNF  K+G V+GG Y    GC+P +I PC HH +    
Sbjct: 61  TCCGGECGDGCNGGEPSG---AWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRP 117

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P       PK  C   C  P Y   + +DKH    +Y V +NE  I  EI  +GP    F
Sbjct: 118 PCTGEGDTPK--CSKTC-EPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAF 174

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
           ++Y DF  YKSGVY+H S   +    H+ +++GWG ENGTPYWLV N+W   WGD G  K
Sbjct: 175 SVYSDFLLYKSGVYQHVSGEIMGG--HAIRILGWGVENGTPYWLVANSWNTDWGDNGFFK 232

Query: 324 ILRGKYECAFEYLIAAGKP 342
           ILRG+  C  E  I AG P
Sbjct: 233 ILRGQDHCGIESEIVAGMP 251


>gi|226474182|emb|CAX71577.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  197 bits (500), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 115/330 (34%), Positives = 166/330 (50%), Gaps = 23/330 (6%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
            SD  I  IN   N  W A ++          R   + DA+      R  P  R+   P 
Sbjct: 30  LSDEMISFINEHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80

Query: 79  ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
               + +  +P  FD+R++WP C +I  + D   C +    +AVGA SDR CI+S G+Q+
Sbjct: 81  VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS   + SCCK C       C  G +  +W++   RG VTGG   + T C+P     C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTSCRPYPFPKC 196

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            H        +C ++     +C   C    Y   + QDKH    +Y V   E  I+K+I+
Sbjct: 197 DHFVKG-KYRACGDKLYETPQCKQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +LIGWG ENGT YWL  NTW  
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRLIGWGVENGTAYWLAANTWNE 312

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WG++G  +I+RG+ EC+ E  IAAG  K+
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIKS 342


>gi|161343869|tpg|DAA06115.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 337

 Score =  197 bits (500), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 119/347 (34%), Positives = 169/347 (48%), Gaps = 21/347 (6%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           +  +L  +        + Y   + +I+ IN +A TW AG NF  N   + + + L   ++
Sbjct: 4   VFMLLSVIFVSVYATEQAYFLQEDFINNINEQATTWKAGMNFDPNTPHDDIIKLL--GSR 61

Query: 61  YFDQSDRPLPGDRKTYDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
                D+      KT+D  Y      +P+ FDAR +W  C TIG V D G C +    A 
Sbjct: 62  GVQNPDKVNHKLYKTHDEAYDNLFGRIPEHFDARNKWVYCDTIGRVRDQGNCGSCWAVAT 121

Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
             AF+DR C+ + G  N  LS E +  CC  C +     C  G   + W      G VTG
Sbjct: 122 SSAFADRLCVATTGDFNELLSAEEITFCCHTCGF----GCHGGYPIKAWKRFSTHGLVTG 177

Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
           GDY    GC+P  + P +   S+ +        + +  C+   +        F D HR T
Sbjct: 178 GDYNSGEGCEPYRVPPSNDGNSSSSDQPLAINHICRRHCYGNQSID------FNDDHRYT 231

Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLI 295
             Y+      +I+K++L +GP  A+F +YDDF  YKSGVY  + NA   +YL  H+ KLI
Sbjct: 232 RDYYYL-TYGSIQKDVLTYGPIEASFDVYDDFPSYKSGVYVKSDNA---SYLGGHAVKLI 287

Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           GWG E+GTPYWL++N+W   WGD G  KI RG  EC  +    AG P
Sbjct: 288 GWGEEDGTPYWLMVNSWNTQWGDNGFFKIRRGTNECGVDNSTTAGVP 334


>gi|56758716|gb|AAW27498.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  196 bits (499), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 114/330 (34%), Positives = 167/330 (50%), Gaps = 23/330 (6%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
            SD  I  IN+  N  W A ++          R   + DA+      +  P  R+   P 
Sbjct: 30  LSDEMISFINKHPNAGWKADKSD---------RFHSVDDARILLGGRKEDPNLRQKRRPT 80

Query: 79  ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
               + +  +P  FD+R++WP C +I  + D   CA+    +AV A SDR CI+S G+Q+
Sbjct: 81  VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVAAMSDRICIQSGGKQS 140

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS   + SCC+ C       C  G    +W++  K G VTGG   + TGC+P     C
Sbjct: 141 VELSAIDLISCCENC----GSGCDGGVTGYSWDYWVKHGIVTGGSKENHTGCRPYPFPKC 196

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            H        +C ++     +C   C    Y   + QDKH    +Y V   E AI+KEI+
Sbjct: 197 DHFVKGK-YRACGDKLYKTPQCKQTC-QKGYNTSYEQDKHYGGFSYSVIGVESAIQKEIM 254

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            +GP  A   +Y+DF +YKSG+Y++T+   +    H+ +LIGWG ENGT YWL  NTW  
Sbjct: 255 MYGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRLIGWGVENGTAYWLAANTWNE 312

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WG++G  +I+RG+ EC  E  I AG+ K+
Sbjct: 313 DWGEKGYFRIVRGRDECLIESFIVAGQIKS 342


>gi|56752787|gb|AAW24605.1| unknown [Schistosoma japonicum]
          Length = 309

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 107/294 (36%), Positives = 156/294 (53%), Gaps = 13/294 (4%)

Query: 56  IADAKYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACA 110
           + DA+      R  P  R+   P     + +  +P  FD+R++WP C +I  + D   C 
Sbjct: 24  VDDARILLGGRREDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCG 83

Query: 111 APHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLH 170
           +    +AVGA SDR CI+S G+Q+  LS   + SCCK C       C  G +  +W++  
Sbjct: 84  SSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKYC----GSGCDGGFLGPSWDYWV 139

Query: 171 KRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFF 230
            RG VTGG   + TGC+P     C H        +C ++     +C+  C    Y   + 
Sbjct: 140 LRGIVTGGSKENHTGCRPYPFPKCDHFVKG-KYRACGDKLYKTPQCNQTC-QKGYNTSYE 197

Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLH 290
           QDKH    +Y V   E  I+K+I+ HGP  A   +Y+DF +YKSG+Y++T+   +    H
Sbjct: 198 QDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--H 255

Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           + +LIGWG ENGT YWL  NTW   WG++G  +I+RG+ EC  E  IAAG  K+
Sbjct: 256 AVRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 309


>gi|56756436|gb|AAW26391.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 99/265 (37%), Positives = 149/265 (56%), Gaps = 8/265 (3%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           + +  +P +FD+R++WP+C +I  + D   C +   F AV A +DR CI+S GQQ+  LS
Sbjct: 85  DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELS 144

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
              + SCC+ C       C  G   + W++  KRG VTGG   + TGCQP     C H  
Sbjct: 145 ALDLISCCEDC----GDGCKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHL- 199

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           +    P+C  +     +C   C    Y   + QDKH     Y V  NE AI++EI+ +GP
Sbjct: 200 TKGKYPACGTKIYKTPQCKQTCQK-GYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGP 258

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
             A F +Y+DF +YKSG+Y+H + + +    H+ ++IGWG E G PYWL+ N+W   WG+
Sbjct: 259 VEAAFDVYEDFLNYKSGIYRHVTGSIVGG--HAIRIIGWGVEKGKPYWLIANSWNEDWGE 316

Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
           +G  +++RG+ EC+ E  + AG  K
Sbjct: 317 KGLFRMVRGRDECSIESHVVAGLIK 341


>gi|56753443|gb|AAW24925.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 98/262 (37%), Positives = 148/262 (56%), Gaps = 8/262 (3%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           + +  +P +FD+R++WP+C +I  + D   C +   F AV A +DR CI+S GQQ+  LS
Sbjct: 85  DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELS 144

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
              + SCC+ C       C  G     W++  KRG VTGG   + TGCQP     C HH 
Sbjct: 145 ALDLISCCEDC----GDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHH- 199

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           +    P+C  +     +C  +C    Y   + QDK+     Y V  NE AI++EI+ +GP
Sbjct: 200 TKGKYPACGTKIYKTPQCKQKCQK-GYKTPYEQDKNYGDQRYNVISNEKAIQREIMMYGP 258

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
             A F +Y+DF +YKSG+Y+H + + +    H+ ++IGWG E G PYWL+ N+W   WG+
Sbjct: 259 VEAAFDVYEDFLNYKSGIYRHVAGSIVGG--HAIRIIGWGVEKGKPYWLIANSWNEDWGE 316

Query: 319 RGTVKILRGKYECAFEYLIAAG 340
            G  +++RG+ EC+ E  + AG
Sbjct: 317 NGLFRMVRGRDECSIESHVVAG 338


>gi|189096178|pdb|3CBJ|A Chain A, Chagasin-cathepsin B Complex
 gi|189096180|pdb|3CBK|A Chain A, Chagasin-Cathepsin B
          Length = 266

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 105/261 (40%), Positives = 140/261 (53%), Gaps = 10/261 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDAREQWP C TI  + D G+C +   F AV A SDR CI +    +  +S E + 
Sbjct: 7   LPASFDAREQWPQCPTIKEIRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 66

Query: 144 SCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
           +CC  +C       C+ G     WNF  ++G V+GG Y    GC+P +I PC  H +   
Sbjct: 67  TCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEAHVNGAR 122

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
            P       PK  C   C  P Y   + QDKH    +Y V ++E  I  EI  +GP    
Sbjct: 123 PPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGA 179

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTPYWLV N+W   WGD G  
Sbjct: 180 FSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWNTDWGDNGFF 237

Query: 323 KILRGKYECAFEYLIAAGKPK 343
           KILRG+  C  E  + AG P+
Sbjct: 238 KILRGQDHCGIESEVVAGIPR 258


>gi|56756114|gb|AAW26235.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 113/350 (32%), Positives = 178/350 (50%), Gaps = 14/350 (4%)

Query: 1   MIHILVFLLGC-TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
           M+ I V+++   TL+   + K ++  I+ ++ E  ++          +++  R   + DA
Sbjct: 1   MLKIAVYIVSLFTLLEAHVTKRNNQRIEPLSDEMISFINKHPNAGWKADKSDRFHSVDDA 60

Query: 60  KYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
           +      +     R+   P     + +  +P  FD+R++WP C +I  + D   CA+   
Sbjct: 61  RILLGGRKEDSNLRQKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWA 120

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
            ++VGA SDR CI+S G+Q+  LS   + SCCK C       C  G    +W++    G 
Sbjct: 121 VSSVGAMSDRICIQSGGKQSVELSAIDLISCCKNC----GSGCDGGYFLPSWDYWVSHGI 176

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           VTGG   + TGC+P     C H        +C ++     +C   C    Y   + QDKH
Sbjct: 177 VTGGSKENHTGCRPYPFPKCDHFVKGK-YRACGDKLYETPQCKQTC-QKGYNTSYEQDKH 234

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
               +Y V   E  I+K+I+ HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +L
Sbjct: 235 YGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRL 292

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           IGWG ENGT YWL  NTW   WG++G  +I+RG+ EC  E  IAAG  K+
Sbjct: 293 IGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|312271211|gb|ADQ57303.1| cathepsin B-like cysteine proteinase 1 [Angiostrongylus
           cantonensis]
          Length = 394

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 111/330 (33%), Positives = 158/330 (47%), Gaps = 31/330 (9%)

Query: 26  IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY--------- 76
           ID +NR+ N W A ++          R+F+     Y D++   L G    +         
Sbjct: 66  IDYVNRKQNLWKAKKH----------RRFV----HYPDRTKWGLMGVNNVHLSVKAKQHL 111

Query: 77  --DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
               +    +P+ FDAR+ W NC +I ++ D  +C +   F AV A SDR CI S  +  
Sbjct: 112 SSTKDLDIDIPETFDARQHWSNCQSIKNIRDQSSCGSCWAFGAVEAMSDRICIASNEKIQ 171

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS + + SCC+ C +     C  G     W +    G VTG ++    GC+P    PC
Sbjct: 172 VTLSADDLLSCCRTCGF----GCEGGDPMFAWQYWVDHGIVTGSNFTANQGCKPYPFPPC 227

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            HH +      C +   P  KC  +C      + +  D+      Y V ++  AI+KEIL
Sbjct: 228 EHHSNKTRFDPCRHDLYPTPKCSKKCVPSYKEKNYDDDRFYGRTAYGVKNDVAAIQKEIL 287

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            HGP    F +Y+DF HY  G+Y HT   KL    H+ KLIGWG + GTPYWL+ N+W  
Sbjct: 288 THGPVEVAFEVYEDFLHYAGGIYVHTG-GKLGGG-HAVKLIGWGIDQGTPYWLIANSWNT 345

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WG+ G  +ILRG  EC  E  +  G PK+
Sbjct: 346 DWGEEGFFRILRGVDECGIESGVVGGIPKS 375


>gi|161343839|tpg|DAA06100.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 323

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 105/273 (38%), Positives = 148/273 (54%), Gaps = 11/273 (4%)

Query: 72  DRKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSK 130
           +RKT D  Y   +P  FDAR+ + +C   IG V D G CA+    A    F+DR CI S 
Sbjct: 52  NRKTVDNSYKTDIPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASN 111

Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPST 190
           GQ    LS + + SC       +   C  GS F+ W     +G VTGG++    GCQP  
Sbjct: 112 GQFTDNLSAQNLMSCGD----GEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYK 167

Query: 191 ISPCSHHGSAPTLPSCENQKVPKLK-CHTRCTNPTYGRGFFQDKHRTTLTYWVD-DNEDA 248
             PC H+G +  L +C + +  ++  C  +C N  Y   +  D H+T++ Y     N   
Sbjct: 168 NRPCDHYGDSR-LTNCSSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQ 226

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE-NGTPYWL 307
           I++EI+ HGP TA   +Y++F  YK G+YK T+  +L  Y H  KLIGWG + +GT YWL
Sbjct: 227 IQQEIMTHGPVTAFMYVYENFMGYKEGIYKSTT-GELIGY-HHVKLIGWGVDGDGTEYWL 284

Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
            +N+W  +WG+ G  KILRG   C+ E L+ AG
Sbjct: 285 AMNSWNSNWGNDGLFKILRGYNFCSIELLVMAG 317


>gi|329668994|gb|AEB96385.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 316

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 104/293 (35%), Positives = 157/293 (53%), Gaps = 13/293 (4%)

Query: 49  EYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGA 108
           E+LR+ ++  +K+ +++++P   D +       + +PD FDAR  WP+C +I ++ D   
Sbjct: 36  EHLRRKVMK-SKFINRNNKPREDDTEID----GSKIPDSFDARVTWPHCPSISYIRDQSQ 90

Query: 109 CAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNF 168
           C +   F++    SDR CI S G +   LS + + SCC     D    C  G     W +
Sbjct: 91  CGSCWAFSSAEVMSDRVCIASHGHKKVELSADDILSCCT----DGGYGCDGGWPVSAWQY 146

Query: 169 LHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG 228
             + G VTGG YG +  C+P  I PC  H +     +C  Q++    C T C    Y   
Sbjct: 147 FVETGVVTGGLYGTKDACRPYEIPPCGIHKNETFYSNC-TQEIDTPDCKTTC-QAGYPIS 204

Query: 229 FFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENY 288
           +  DK      Y V ++  AI+KEI+ +GP  A F +YDDF+HYK+G+YKH S A+    
Sbjct: 205 YDDDKTYGKTAYSVSNSVHAIQKEIMTYGPVVAAFTVYDDFFHYKTGIYKHVSGAEAGG- 263

Query: 289 LHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
            H+ +++GWG + G PYWLV N+W   WG+ G  +ILRG  EC  E  + AG+
Sbjct: 264 -HAVRILGWGQQGGVPYWLVANSWNTDWGENGYFRILRGSDECGIEDGVVAGQ 315


>gi|28373366|pdb|1ITO|A Chain A, Crystal Structure Analysis Of Bovine Spleen Cathepsin B-
           E64c Complex
 gi|88192750|pdb|2DC6|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca073 Complex
 gi|88192751|pdb|2DC7|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca042 Complex
 gi|88192752|pdb|2DC8|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca059 Complex
 gi|88192753|pdb|2DC9|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca074me Complex
 gi|88192754|pdb|2DCA|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca075 Complex
 gi|88192755|pdb|2DCB|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca076 Complex
 gi|88192756|pdb|2DCC|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca077 Complex
 gi|88192757|pdb|2DCD|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca078 Complex
          Length = 256

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 109/261 (41%), Positives = 139/261 (53%), Gaps = 12/261 (4%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FDAREQWPNC TI  + D G+C +   F AV A SDR CI S G+ N  +S E + 
Sbjct: 1   LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDM- 59

Query: 144 SCCKICRYDDNKSCSHGSVFRT--WNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
               +              F +  WNF  K+G V+GG Y    GC+P +I PC HH +  
Sbjct: 60  ----LTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGS 115

Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
             P       PK  C   C  P Y   + +DKH    +Y V +NE  I  EI  +GP   
Sbjct: 116 RPPCTGEGDTPK--CSKTC-EPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEG 172

Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT 321
            F++Y DF  YKSGVY+H S   +    H+ +++GWG ENGTPYWLV N+W   WGD G 
Sbjct: 173 AFSVYSDFLLYKSGVYQHVSGEIMGG--HAIRILGWGVENGTPYWLVGNSWNTDWGDNGF 230

Query: 322 VKILRGKYECAFEYLIAAGKP 342
            KILRG+  C  E  I AG P
Sbjct: 231 FKILRGQDHCGIESEIVAGMP 251


>gi|226469948|emb|CAX70255.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 114/351 (32%), Positives = 189/351 (53%), Gaps = 16/351 (4%)

Query: 1   MIHILVFLLGC-TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
           M+ I V+++   TL+   +   ++  I+ ++ E  ++          +++  R   + DA
Sbjct: 1   MLKIAVYIVSLFTLLEAHVTTRNNERIEPLSDEMISFINKHPNAGWKADKSDRFHSVDDA 60

Query: 60  KYF---DQSDRPLPGDRK-TYDP-EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
           +      + D  +   R+ T D  + +  +P +FD+R++WP+C +I  + D   C +   
Sbjct: 61  RILLGGGKEDAEMKWKRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSQCGSSWA 120

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
            +AVGA SDR CI+S G+Q+  LS   + SCC+ C       C  G     W++    G 
Sbjct: 121 VSAVGAMSDRICIQSGGKQSVELSAIDLISCCENC----GSGCDGGFPGPAWDYWVSHGI 176

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           VTGG   + TGCQP     C HH S    PSC ++     +C  +C    Y   +  DKH
Sbjct: 177 VTGGSKENHTGCQPYPFPKCEHH-SIGKYPSCGDKIYKTPQCKRKCQK-GYTTPYEHDKH 234

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKL-ENYLHSGK 293
              ++  V  NE AI+ EI+ +GP  A   +++DF +YKSG+Y++T+ + + E+Y+   +
Sbjct: 235 YGGISINVIKNESAIQNEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYV---R 291

Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           +IGWG ENGT YWL  NTW   WG++G  +I+RG+ EC+ E ++ AG+ K+
Sbjct: 292 IIGWGIENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESVVVAGRLKS 342


>gi|1169189|sp|P43157.1|CYSP_SCHJA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
           Full=Antigen Sj31; Flags: Precursor
 gi|11167|emb|CAA50305.1| cathepsin B [Schistosoma japonicum]
          Length = 342

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 98/265 (36%), Positives = 149/265 (56%), Gaps = 8/265 (3%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           + +  +P +FD+R++WP+C +I  + D   C +   F AV A +DR CI+S G Q+  LS
Sbjct: 85  DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELS 144

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
              + SCCK C       C  G     W++  KRG VTGG   + TGCQP     C HH 
Sbjct: 145 ALDLISCCKDC----GDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHH- 199

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           +    P+C  +     +C   C    Y   + QDKH    +Y V +NE  I+++I+ +GP
Sbjct: 200 TKGKYPACGTKIYKTPQCKQTCQK-GYKTPYEQDKHYGDESYNVQNNEKVIQRDIMMYGP 258

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
             A F +Y+DF +YKSG+Y+H + + +    H+ ++IGWG E  TPYWL+ N+W   WG+
Sbjct: 259 VEAAFDVYEDFLNYKSGIYRHVTGSIVGG--HAIRIIGWGVEKRTPYWLIANSWNEDWGE 316

Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
           +G  +++RG+ EC+ E  + AG  K
Sbjct: 317 KGLFRMVRGRDECSIESDVVAGLIK 341


>gi|32566081|ref|NP_506002.2| Protein CPR-1 [Caenorhabditis elegans]
 gi|32172429|sp|P25807.2|CPR1_CAEEL RecName: Full=Gut-specific cysteine proteinase; Flags: Precursor
 gi|1395200|gb|AAB88058.1| gut-specific cysteine protease-1 [Caenorhabditis elegans]
 gi|24817276|emb|CAB01410.2| Protein CPR-1 [Caenorhabditis elegans]
          Length = 329

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 117/321 (36%), Positives = 163/321 (50%), Gaps = 23/321 (7%)

Query: 23  DAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA 82
            A +D +N   + +   +     ++EE ++ F + D KY       +   R T      A
Sbjct: 31  QALVDYVNSAQSLF---KTEHVEITEEEMK-FKLMDGKYAAAHSDEI---RATEQEVVLA 83

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           +VP  FD+R QW  C +I  + D   C +   F A    SDR CI++KG Q   +S + +
Sbjct: 84  SVPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDL 143

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            SCC          C  G   +   +   +G VTGGDY    GC+P  I+PC       T
Sbjct: 144 LSCCG---SSCGNGCEGGYPIQALRWWDSKGVVTGGDY-HGAGCKPYPIAPC-------T 192

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
             +C   K P   C   C +  Y   + +DKH     Y V  N  +I+ EI A+GP  A 
Sbjct: 193 SGNCPESKTPS--CSMSCQS-GYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAA 249

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F++Y+DFY YKSGVYKHT+   L    H+ K+IGWGTE+G+PYWLV N+WG +WG+ G  
Sbjct: 250 FSVYEDFYKYKSGVYKHTAGKYLGG--HAIKIIGWGTESGSPYWLVANSWGVNWGESGFF 307

Query: 323 KILRGKYECAFEYLIAAGKPK 343
           KI RG  +C  E  + AGK K
Sbjct: 308 KIYRGDDQCGIESAVVAGKAK 328


>gi|56756380|gb|AAW26363.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 113/330 (34%), Positives = 167/330 (50%), Gaps = 23/330 (6%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
            SD  I  IN   N  W A ++          R   + DA+      +  P  R+   P 
Sbjct: 30  LSDEMISFINEHPNAGWKADKSD---------RFHSVDDARILLGGRKEDPNLRQRRRPT 80

Query: 79  ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
               + +  +P  FD+R++WP C +I  + D   C +    +A+GA SDR CI+S G+Q+
Sbjct: 81  VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAIGAMSDRICIQSGGKQS 140

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS   + SCC+ C       C  G +  +W++   RG VTGG   + TGC+P     C
Sbjct: 141 VKLSAVDLISCCENC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            H        +C ++     +C+  C    Y   + QDKH    +Y V   E  I+K+I+
Sbjct: 197 DHFVKGK-YRACGDKLYKTPQCNQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +LIGWG ENGT YWL  NTW  
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRLIGWGVENGTAYWLAANTWNE 312

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WG++G  +I+RG+ EC  E  IAAG  K+
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|226471004|emb|CAX70583.1| Cysteine PRotease related protein [Schistosoma japonicum]
          Length = 304

 Score =  194 bits (493), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 99/262 (37%), Positives = 147/262 (56%), Gaps = 8/262 (3%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           + +  +P +FD+R++WP+C +I  + D   C +   F AV A +DR CI+S G Q+  LS
Sbjct: 47  DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELS 106

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
              + SCCK C       C  G   + W++  KRG VTGG   + TGCQP     C H  
Sbjct: 107 ALDLISCCKDC----GDGCKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHL- 161

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           +    P+C  +     +C   C    Y   + QDKH     Y V  NE AI++EI+ +GP
Sbjct: 162 TKGKYPACGTKIYKTPQCKQTC-QKGYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGP 220

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
             A F +Y+DF +YKSG+Y+H + + +    H+ ++IGWG E  TPYWL+ N+W   WG+
Sbjct: 221 VEAAFDVYEDFLNYKSGIYRHVTGSIVGG--HAIRIIGWGVEKRTPYWLIANSWNEDWGE 278

Query: 319 RGTVKILRGKYECAFEYLIAAG 340
           +G  +I+RG+ EC+ E  + AG
Sbjct: 279 KGLFRIVRGRDECSIESHVVAG 300


>gi|384597848|gb|AFI23675.1| cathepsin B, partial [Brugia malayi]
          Length = 319

 Score =  194 bits (493), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 106/310 (34%), Positives = 162/310 (52%), Gaps = 15/310 (4%)

Query: 26  IDQINREANTWTAGRNFPANLSEEYLRQFLIA--DAKYFDQSDRPLPGDRKTYDPEYSAT 83
           ID +N     W AG N   NL  + ++  L+   + K   +  + L   R +     +  
Sbjct: 23  IDYVNSHQTLWKAGMN-KFNLYSDTVKYGLLGVNNRKKSVEHKKNLSPIRHS-----NIF 76

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FDAR+ WP C ++ ++ D  +C +    AAV A SDR CI SKG++   LS + + 
Sbjct: 77  IPESFDARKNWPECASLRNIRDQSSCGSCWAVAAVEAMSDRICITSKGKKQVILSADDLL 136

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCCK C +     C  G     W +    G VTG DY + +GC+P    PC HH +    
Sbjct: 137 SCCKTCGF----GCFGGEPMAAWKYWVLSGIVTGSDYTNHSGCRPYPFPPCEHHSNKTHY 192

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
             C++   P  KC+ +C +  Y + +  DK+     Y V+++ ++I+KEI+  GP  A+F
Sbjct: 193 EPCKHDLYPTPKCYKQC-DKNYTKSYKADKYYGEQAYNVENDVESIQKEIMTLGPVEASF 251

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
            +Y DF HY SG+YKH + +      H+ K++GWG + G  YWL  N+W   WG+ G  +
Sbjct: 252 EVYTDFLHYTSGIYKHVAGSVGGG--HAVKILGWGIDQGVSYWLAANSWNNDWGEDGYFR 309

Query: 324 ILRGKYECAF 333
           ILRG  EC  
Sbjct: 310 ILRGADECGM 319


>gi|294876463|ref|XP_002767679.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239869446|gb|EER00397.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 348

 Score =  194 bits (492), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 121/336 (36%), Positives = 168/336 (50%), Gaps = 28/336 (8%)

Query: 16  GELYK-----FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLP 70
            ELY+        + +++IN + N WTA  +      E +  +  +       +    L 
Sbjct: 19  AELYEDTRPAIMQSLVNEINSKQNLWTASTD-----QERFYGRLKLCGT--LHEGTEGL- 70

Query: 71  GDRKTYDPEYSATVPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKS 129
            + K Y P   A +P  FDAR+ +  C   IGHV D  ACA+    A V AFS R CIKS
Sbjct: 71  -EEKVYPPGELADIPSSFDARDAFKECKDVIGHVWDQSACASCWAIAPVQAFSARLCIKS 129

Query: 130 KGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT----- 184
            G+ N+ LS   + +CC +    + + C  G     W FL+K G  TGGD+  ++     
Sbjct: 130 GGKFNQLLSAGELLACCNLAHSCEARGCKGGVARDAWVFLNKHGIATGGDFVPKSSMEAV 189

Query: 185 -GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT--TLTYW 241
            GC P     C+H+        C  +      C  RC N  YG    +D+H T   + YW
Sbjct: 190 DGCWPYNFPRCAHYQKKSKYGPCPKKSYETPSCLDRCPNEKYGTPLDKDRHFTARAVPYW 249

Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
            +    +IKKEI+ HGPT+A+F  Y+DF+ YKSGVYK+TS A +E   H+ +LIGWGTE 
Sbjct: 250 FNGIR-SIKKEIMKHGPTSASFFTYEDFFSYKSGVYKYTSGAYVE--FHTVELIGWGTEK 306

Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
           G  YWL  N W   W D GT KI +G  +C    L+
Sbjct: 307 GVDYWLAKNDWNEEWADLGTFKIAQG--DCGINDLV 340


>gi|126116630|gb|ABN79675.1| cathepsin B3 [Clonorchis sinensis]
          Length = 337

 Score =  194 bits (492), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 104/260 (40%), Positives = 140/260 (53%), Gaps = 9/260 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDAR+QWP+C TIG + D  +C +   F AV A SDR CI + G   + +S   + 
Sbjct: 80  IPKAFDARKQWPHCPTIGEIRDQSSCGSCWAFGAVEAMSDRLCIHTNGTFTKRISAVDLI 139

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC  C +     C  G     W+F    G VTGG   + TGC+      CSHHGS    
Sbjct: 140 SCCGYCGF----GCQGGFPPTAWDFWQTEGIVTGGSKENPTGCRSYPFPRCSHHGSK-KY 194

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P C ++      C  +C  P     +  DK R  +TY V   ++AI KEI+ +GP  A F
Sbjct: 195 PPCSHRIYDTPNCVQKCDTPD--TDYATDKTRANITYNVKAKQNAIMKEIMINGPVEAAF 252

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
            +Y+DF  YKSGVY H+    L    H+ +++GWG ENG  YWL+ N+W   WG+ G  K
Sbjct: 253 QVYEDFLGYKSGVYFHSDGTLLGG--HAIRILGWGEENGVAYWLIANSWNDGWGEDGYFK 310

Query: 324 ILRGKYECAFEYLIAAGKPK 343
           +LRGK EC  E  + AG P+
Sbjct: 311 MLRGKNECGIEDEVTAGLPE 330


>gi|349956183|dbj|GAA30948.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 337

 Score =  194 bits (492), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 104/260 (40%), Positives = 140/260 (53%), Gaps = 9/260 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDAR+QWP+C TIG + D  +C +   F AV A SDR CI + G   + +S   + 
Sbjct: 80  IPKAFDARKQWPHCPTIGEIRDQSSCGSCWAFGAVEAMSDRLCIHTNGTFTKRISAVDLI 139

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC  C +     C  G     W+F    G VTGG   + TGC+      CSHHGS    
Sbjct: 140 SCCGYCGF----GCQGGFPPIAWDFWQTEGIVTGGSKENPTGCRSYPFPRCSHHGSK-KY 194

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P C ++      C  +C  P     +  DK R  +TY V   ++AI KEI+ +GP  A F
Sbjct: 195 PPCSHRIYDTPNCVQKCDTPD--TDYATDKTRANITYNVKAKQNAIMKEIMINGPVEAAF 252

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
            +Y+DF  YKSGVY H+    L    H+ +++GWG ENG  YWL+ N+W   WG+ G  K
Sbjct: 253 QVYEDFLGYKSGVYFHSDGTLLGG--HAIRILGWGEENGVAYWLIANSWNDGWGEDGCFK 310

Query: 324 ILRGKYECAFEYLIAAGKPK 343
           +LRGK EC  E  + AG P+
Sbjct: 311 MLRGKNECGIEDEVTAGLPE 330


>gi|209863079|ref|NP_001119613.2| cathepsin B precursor [Acyrthosiphon pisum]
          Length = 323

 Score =  194 bits (492), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 104/273 (38%), Positives = 148/273 (54%), Gaps = 11/273 (4%)

Query: 72  DRKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSK 130
           +RKT D  Y   +P  FDAR+ + +C   IG V D G CA+    A    F+DR CI S 
Sbjct: 52  NRKTVDNSYKTDIPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASN 111

Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPST 190
           GQ    LS + + SC       +   C  GS F+ W     +G VTGG++    GCQP  
Sbjct: 112 GQFTDNLSAQNLMSCGD----GEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYK 167

Query: 191 ISPCSHHGSAPTLPSCENQKVPKLK-CHTRCTNPTYGRGFFQDKHRTTLTYWVD-DNEDA 248
             PC H+G +  L +C + +  ++  C  +C N  Y   +  D H+T++ Y     N   
Sbjct: 168 NRPCDHYGDSR-LTNCSSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQ 226

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE-NGTPYWL 307
           I++EI+ +GP TA   +Y++F  YK G+YK T+  +L  Y H  KLIGWG + +GT YWL
Sbjct: 227 IQQEIMTYGPVTAFMYVYENFMGYKEGIYKSTT-GELIGY-HHVKLIGWGVDGDGTEYWL 284

Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
            +N+W  +WG+ G  KILRG   C+ E L+ AG
Sbjct: 285 AMNSWNSNWGNDGLFKILRGYNFCSIELLVMAG 317


>gi|201023319|ref|NP_001128401.1| cathepsin B-10270 precursor [Acyrthosiphon pisum]
 gi|239788119|dbj|BAH70754.1| ACYPI000021 [Acyrthosiphon pisum]
          Length = 341

 Score =  194 bits (492), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 122/345 (35%), Positives = 162/345 (46%), Gaps = 16/345 (4%)

Query: 3   HILVFLLGCTLVRGELYKFSDAYI-DQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
            IL+ LL C +       F   +I D +NR    W AG N    L     +  +      
Sbjct: 6   EILLLLLFCNIWLSCNANFKLQHIVDHVNRANVPWEAGIN---QLGTSDYKNIVGTWGFQ 62

Query: 62  FDQSDRPLPGDR-KTYD-PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
            +  D  + G +   YD  + S  +P+ FDAR +W  C +I H+ + G CAA    +   
Sbjct: 63  KNGKDIDIIGHKVHNYDLDDGSNDMPETFDARNKWFECVSIAHIWNQGNCAADWAISVTS 122

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           A +DR CIKSK       S + + SCC  C       C+ G     W +  KRG VTGGD
Sbjct: 123 AINDRICIKSKKNITAFYSPQKMLSCCDDC----GDGCNGGYSGAAWQYWMKRGLVTGGD 178

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPS--CENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
           YG   GCQP  I PC+H       PS  C   K    +C   C NP Y + F +D  +  
Sbjct: 179 YGSNEGCQPWLIPPCNHTVMDERSPSYMCGKYKSETPQCTLNCYNPNYSKPFLKDISKGI 238

Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
              W       I+ E+  HGP TA   +Y+DF  YKSG+Y+H +   L     + K+IGW
Sbjct: 239 RIDW--HCSGMIRNELKKHGPATAIMRVYEDFLTYKSGIYQHVTGKLLGQI--TVKVIGW 294

Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           G   G  YWL  N+WG  WGD+G  KI RG  EC FE    +G+P
Sbjct: 295 GVYRGVQYWLAANSWGTSWGDKGFFKIRRGYNECLFEDYFISGRP 339


>gi|226473760|emb|CAX71565.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  194 bits (492), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 115/330 (34%), Positives = 165/330 (50%), Gaps = 23/330 (6%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
            SD  I  IN   N  W A ++          R   + DA+      R  P  R+   P 
Sbjct: 30  LSDEMISFINEHPNAGWKADKSD---------RFHSVDDARILLGGRREDPNLREKRRPT 80

Query: 79  ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
               +    +P  FD+R++WP C +I  + D   C +    +AVGA SDR CI+S G+Q+
Sbjct: 81  VDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQS 140

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS   + SCCK C       C  G +  +W++   RG VTGG   + TGC+P     C
Sbjct: 141 VELSAVDLISCCKYC----GSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKC 196

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            H        +C ++     +C+  C    Y   + QDKH    +Y V   E  I+K+I+
Sbjct: 197 DHFVKGK-YRACGDKLYKTPQCNQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIM 254

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +LIG G ENGT YWL  NTW  
Sbjct: 255 MHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISG--HAVRLIGCGVENGTAYWLAANTWNE 312

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WG++G  +I+RG+ EC  E  IAAG  K+
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIKS 342


>gi|17559066|ref|NP_506790.1| Protein CPR-3 [Caenorhabditis elegans]
 gi|1169083|sp|P43507.1|CPR3_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 3; AltName:
           Full=Cysteine protease-related 3; Flags: Precursor
 gi|675494|gb|AAA98788.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|675496|gb|AAA98782.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|14530554|emb|CAB61032.2| Protein CPR-3 [Caenorhabditis elegans]
          Length = 370

 Score =  194 bits (492), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 119/352 (33%), Positives = 171/352 (48%), Gaps = 33/352 (9%)

Query: 4   ILVFLLGCT-LVRGELYKFS------DAYIDQINREANTWTAGRNFPANLSEEYLRQFLI 56
           + +FL GC+  V  E+   +         +D +N    +W A  N  +    E+  +F +
Sbjct: 7   LALFLAGCSAFVLDEIRGINIGQSPQKVLVDHVNTVQTSWVAEHNEIS----EFEMKFKV 62

Query: 57  ADAKYFD--QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
            D K+ +  + D  +  +           +PD FDARE+WP+C TI  + +   C +   
Sbjct: 63  MDVKFAEPLEKDSDVASELFVRGEIVPEPLPDTFDAREKWPDCNTIKLIRNQATCGSCWA 122

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRG 173
           F A    SDR CI+S G Q   +S E + SCC   C Y     C  G       F    G
Sbjct: 123 FGAAEVISDRVCIQSNGTQQPVISVEDILSCCGTTCGY----GCKGGYSIEALRFWASSG 178

Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
           +VTGGDYG   GC P + +PC+ +    T PSC+          T C +      + +DK
Sbjct: 179 AVTGGDYGGH-GCMPYSFAPCTKNCPESTTPSCK----------TTCQSSYKTEEYKKDK 227

Query: 234 HRTTLTYWVDDNEDA--IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHS 291
           H     Y V   +    I+ EI  +GP  A++ +Y+DFYHYKSGVY +TS   +    H+
Sbjct: 228 HYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHYKSGVYHYTSGKLVGG--HA 285

Query: 292 GKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            K+IGWG ENG  YWL+ N+WG  +G++G  KI RG  EC  E  + AG  K
Sbjct: 286 VKIIGWGVENGVDYWLIANSWGTSFGEKGFFKIRRGTNECQIEGNVVAGIAK 337


>gi|29374025|gb|AAO73003.1| cathepsin B [Fasciola gigantica]
          Length = 339

 Score =  193 bits (491), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 111/330 (33%), Positives = 165/330 (50%), Gaps = 16/330 (4%)

Query: 17  ELYKFSDAYIDQINREAN-TWTAGRNFP-ANLSEEYLRQFLIADAKYFDQSDRP-LPGDR 73
           +   FSD  I  +N E+  +W A R+   +N+    L    +++      + RP +  D 
Sbjct: 22  QFEAFSDELIRFVNEESGASWKAARSTRFSNVDHFKLHLGALSETPEERNALRPTIKHDI 81

Query: 74  KTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
              D      +P+ FDAR QWP C TI  + D  +C +    AA  A SDR CI S GQ 
Sbjct: 82  SKND------LPESFDARSQWPQCWTISEIRDQASCGSCWATAAASAMSDRVCIHSNGQM 135

Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
              L+     SCC  C     + C  G   + W++  + G VTGG + +RTGCQP   + 
Sbjct: 136 RPRLAAADPLSCCTYC----GQGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTK 191

Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
           C H G +     C +   P   C   C    Y + + QDK     +Y V ++E  I +EI
Sbjct: 192 CDHVGDSRKYSRCPHYTYPTPPCARACQT-GYNKTYEQDKFYGNSSYNVGEHESYIMQEI 250

Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWG 313
           + +GP   TFA++ DF  Y+SG+Y H +   +    H+ ++IGWG ENG  YWL+ N+W 
Sbjct: 251 MKNGPVEVTFAIFQDFGVYRSGIYHHVAGKFIGR--HAVRMIGWGVENGVNYWLMANSWN 308

Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
             WG+ G  +++RG+ EC  E  + AG P+
Sbjct: 309 EEWGENGYFRMVRGRNECGIESEVVAGMPR 338


>gi|157167368|ref|XP_001653891.1| cathepsin b [Aedes aegypti]
 gi|108874250|gb|EAT38475.1| AAEL009642-PA [Aedes aegypti]
          Length = 332

 Score =  193 bits (491), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 115/346 (33%), Positives = 173/346 (50%), Gaps = 22/346 (6%)

Query: 2   IHILVFLLGCTLV-RGELYK---FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA 57
           + +LVF +G  ++ R E      F+D ++ Q+ R A TWT    F   +  E  +     
Sbjct: 4   VKLLVFAIGVVVIARSERLGDDPFNDGFLAQVQRHAKTWTPDATFRDGIRFENFQ----- 58

Query: 58  DAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
           + K   +S        K +D  Y+  +P+ FDARE+WP C +I  + + G C A    AA
Sbjct: 59  NMKGIFESKIGFRLPTKRHDVAYNMDIPEFFDAREKWPYCKSISTIKNQGLCGACWAVAA 118

Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
           V   SDR CI S+G+ +  L+ E +  CCK C    N     G+ F+ W  +   G V+G
Sbjct: 119 VSVMSDRLCIHSEGKFDVELAAEDLMGCCKDCGNGCNGGFLDGTSFQYWVDV---GLVSG 175

Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
             Y    GC+P    PC +         C  +K P   C   CT   Y   + +DK+  +
Sbjct: 176 AAYNSTDGCKPYPFKPCLY-----PFVGCHPEKTP--SCTHHCTE-GYDGTYRRDKYYGS 227

Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
             Y + ++E  I+ EI+ +GP  + F++Y D Y YK+GVY+H    ++    H+ +LIGW
Sbjct: 228 AAYKLPNDERMIQLEIMTNGPVESGFSVYQDLYLYKTGVYQHVVGREVGK--HAVRLIGW 285

Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           G E G PYWL+ N++G  WG+ G  K LRG      E ++ AG PK
Sbjct: 286 GKERGVPYWLIANSYGEDWGEHGYFKFLRGSNHLGIESVVIAGLPK 331


>gi|300122171|emb|CBK22745.2| unnamed protein product [Blastocystis hominis]
          Length = 319

 Score =  193 bits (491), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 108/313 (34%), Positives = 154/313 (49%), Gaps = 21/313 (6%)

Query: 28  QINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDR 87
           ++N++ N+W A  N P      ++            ++ +PLP        E    +P  
Sbjct: 26  RVNKQQNSWVANENTPLRDYSSFIGTL---------KNKKPLPIRSIPIKRE----LPKE 72

Query: 88  FDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK 147
           FD+ E+WP C +I  V D  +CA+   F  V   +DR CI+SKG+    LS E V  CCK
Sbjct: 73  FDSSEKWPECPSILEVRDQSSCASCWAFGVVEVATDRICIESKGKNQVRLSAEDVLECCK 132

Query: 148 ICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCE 207
            C +     C  G     W +L + G VTGG Y     C+     PCS HG     P C 
Sbjct: 133 DCGFQ----CQGGYSAMAWEYLRRTGVVTGGQYNSTEWCKSYPFPPCS-HGIEGQYPQCS 187

Query: 208 NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD 267
            +     KC T C    Y   + +D+++ +  Y +++N D IK EI+ +GP  A+F +Y+
Sbjct: 188 TKPPVVPKCETTCQE-GYPIEYEKDRYKFSNVYQLENNVDQIKNEIMENGPVDASFQVYE 246

Query: 268 DFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRG 327
           DF  YKSG+Y H    K  N LH+ K+IGWG ENG  YW  +N+W   WG+ G  +I  G
Sbjct: 247 DFMTYKSGIYHHVE-GKFMN-LHTVKIIGWGEENGEAYWKAVNSWNSEWGENGLFRIRLG 304

Query: 328 KYECAFEYLIAAG 340
             EC  E  +  G
Sbjct: 305 TNECTIESQVEGG 317


>gi|341888694|gb|EGT44629.1| hypothetical protein CAEBREN_31940 [Caenorhabditis brenneri]
          Length = 374

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 116/321 (36%), Positives = 162/321 (50%), Gaps = 23/321 (7%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG-DRKTYDPEYSA 82
           A +D IN+  ++W A  N    ++EE ++ F + D ++ D      P  D     PE   
Sbjct: 43  ALVDYINKAQSSWVAEHN---EMTEEEMK-FKVMDERFADPLQDGEPELDWGEIVPE--- 95

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
            +PD FD+REQWP C +I  + +   C +   F A    SDR CI+S   Q   +S E +
Sbjct: 96  PLPDTFDSREQWPECKSIKLIRNQATCGSCWAFGAAEIISDRICIQSNATQTPIISVEDI 155

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            SCC +      K C  G       F    G+VTGGDY +  GC P + +PC        
Sbjct: 156 LSCCGV---SCGKGCQGGYSIEALRFWKSSGAVTGGDY-NGAGCMPYSFAPCKKD----- 206

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
             SC     P   C T C +      + +DKH  T  Y + ++  AI+ EI  +GP  A+
Sbjct: 207 --SCAQGTTPS--CKTTCQSSYKTAEYTKDKHFGTTAYKITNSVAAIQTEIYHNGPVEAS 262

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F +Y+DFY YKSGVY++TS   +    H+ K+IGWGTENG  YWL+ N+WG  +GD G  
Sbjct: 263 FKVYEDFYKYKSGVYQYTSGKLVGG--HAVKIIGWGTENGVDYWLIANSWGTTFGDSGFF 320

Query: 323 KILRGKYECAFEYLIAAGKPK 343
           K+ RG  E   E  + AG  K
Sbjct: 321 KMRRGTNEVGIEGNVVAGTAK 341


>gi|268557292|ref|XP_002636635.1| C. briggsae CBR-CPR-1 protein [Caenorhabditis briggsae]
          Length = 330

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 115/320 (35%), Positives = 164/320 (51%), Gaps = 23/320 (7%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
           A +D +N   + +T        +SEE+++  ++ D KY       +   R T      A+
Sbjct: 33  ALVDYVNSAQSLFTTEH---VEVSEEFMKSRVM-DVKYAAAHSDEI---RATEVNTVLAS 85

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FD+R QW  C +I  + +   C +   F A    SDR CI++KG Q   +S + + 
Sbjct: 86  IPASFDSRTQWSECKSIKLIRNQATCGSCWAFGAAEIISDRTCIETKGAQQPIISPDDLL 145

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC          C  G   +   +   +G VTGGDY    GC+P  I+PC       T 
Sbjct: 146 SCCG---SSCGNGCEGGYPIQALRWWDSKGVVTGGDY-HGAGCKPYPIAPC-------TS 194

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
            +C   K P   C   C +  Y   + +DKH     Y V  +  AI+ EI+ +GP  A F
Sbjct: 195 GNCPESKTPA--CSLSCQSG-YSTAYAKDKHFGASAYAVARSVAAIQTEIMTNGPVEAAF 251

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
            +Y+DFY YKSGVYKHT+   L    H+ K+IGWGTE+G+PYWLV N+WG +WG+ G  K
Sbjct: 252 TVYEDFYKYKSGVYKHTAGKALGG--HAIKIIGWGTESGSPYWLVANSWGTNWGESGFFK 309

Query: 324 ILRGKYECAFEYLIAAGKPK 343
           ILRG  +C  E  + AGK +
Sbjct: 310 ILRGDDQCGIEGAVVAGKAR 329


>gi|187103108|ref|NP_001119614.1| cathepsin B-1418 precursor [Acyrthosiphon pisum]
 gi|163300438|tpg|DAA06126.1| TPA_inf: cathepsin B transcript 1418 [Acyrthosiphon pisum]
 gi|239788654|dbj|BAH70998.1| ACYPI000010 [Acyrthosiphon pisum]
          Length = 346

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 120/359 (33%), Positives = 171/359 (47%), Gaps = 44/359 (12%)

Query: 2   IHILVFLLGCTLVRGELYKFSDA--YIDQINRE-ANTWTAGRNFPANLSEEYLRQFLIAD 58
           I I++  +    VR      ++A   ID +N +  NTW A                +  D
Sbjct: 7   IFIVLATMVAVAVRESSAVTNEATFIIDSVNADPGNTWRASDT-----------NVIPGD 55

Query: 59  AKYFDQSDRPLPGD---------RKTYDPEYSATVPDRFDAREQWPNCGTI-GHVPDTGA 108
            K F+Q    LP +         +K+ + E +  +P+ FDARE+WP C ++ G + D   
Sbjct: 56  GKNFNQLMGVLPRNFNSFRFAPIKKSAEDESNEALPENFDARERWPECSSLLGSIKDQSN 115

Query: 109 CAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNF 168
           C +    +A   FSDR CI + G   R LS E + +CC  C       C  GS    W F
Sbjct: 116 CGSCWAVSAASVFSDRLCIATGGAVARNLSAEQLNTCCYRC----GNGCDGGSPESAWYF 171

Query: 169 LHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA-----PTLPSCENQKVPKLKCHTRCTNP 223
             + G VTGGDYG   GCQP +I PC    +      P  P C  +          CTN 
Sbjct: 172 FMRHGIVTGGDYGSEDGCQPYSIYPCGKGRNTCIEDDPDTPDCSIKT---------CTNS 222

Query: 224 TYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNA 283
            Y + +  D H     Y +  +E+ I K++  +GP  A F +Y DF +YKSGVY +T   
Sbjct: 223 NYSKNYRADLHYVDTVYSLSRSEEDIMKDLYKNGPVQAAFYVYTDFMYYKSGVYSYT-RG 281

Query: 284 KLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           ++E   H+ K++GWG ++GT YWL  N+W   WG+ G  +ILRG  EC  E  + AG P
Sbjct: 282 QIEGG-HAIKILGWGVDDGTKYWLCANSWSRSWGENGLFRILRGNNECHIEDRVIAGMP 339


>gi|118364222|ref|XP_001015333.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297100|gb|EAR95088.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 341

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 123/358 (34%), Positives = 174/358 (48%), Gaps = 37/358 (10%)

Query: 2   IHILVFLLGCTL--------VRGELYKFSD---AYIDQINREANTWTAGRNFPANLSEEY 50
           I I+  LL   L        ++ + +K+SD      +++N    TW AG N         
Sbjct: 6   IFIVAALLSAALTGFYTYEALKHKEFKYSDRLKQLAEEVNNANTTWKAGENI-------- 57

Query: 51  LRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT---VPDRFDAREQWPN-CGTIGHVPDT 106
             +++ AD          L GD     P  +A    +P  FDAR+QW + C ++  V D 
Sbjct: 58  --KWINADIAGVKAHLGALEGDNGENLPVSNAVKADLPTAFDARQQWGDKCTSLWEVRDQ 115

Query: 107 GACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTW 166
             C +   F AV + +DR CI   GQ  R LS + + +CC  C     + C+ G      
Sbjct: 116 SNCGSCWAFGAVESLTDRHCIH-LGQDIR-LSAQNMLTCCATC----GQGCNGGYPASAM 169

Query: 167 NFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYG 226
           ++  K G VTG  Y     CQ  + +PC+HH   P  P+C  + +P  KC   C +   G
Sbjct: 170 SYYVKTGLVTGDLYNTTGWCQAYSFAPCAHHVDTPLYPACTGE-LPTPKCAKTCDS---G 225

Query: 227 RGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLE 286
            G     H+ +  Y V   ++AI  EI  +GP  A F +Y+DF +YKSGVYKH +   L 
Sbjct: 226 SGQTYTVHKGSKAYSVGKTQEAIMTEIQTNGPVEAAFTVYEDFLNYKSGVYKHVTGKALG 285

Query: 287 NYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
              H+ K++GWG EN TPYW+V+N+W   WGD GT KILRGK EC  E  +    P N
Sbjct: 286 G--HAIKIVGWGVENNTPYWIVVNSWNQTWGDNGTFKILRGKNECGIEAQVVTALPLN 341


>gi|91089435|ref|XP_966663.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
 gi|270012706|gb|EFA09154.1| cathepsin B precursor [Tribolium castaneum]
          Length = 320

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 114/326 (34%), Positives = 159/326 (48%), Gaps = 31/326 (9%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
            S  +I+ IN++  +W AG NFP N    +LR    A      + D     D +T +   
Sbjct: 22  LSQQFINAINQKHPSWLAGPNFPPNTPHSHLRSLNGA------RDDPAFFTDTETKNVTI 75

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
              +P  FDAR  WP C +I  + + G+C +   F AV   SDR CI S   +    S +
Sbjct: 76  PEQIPQNFDARIVWPQCESIRKIRNQGSCGSCWAFGAVETMSDRLCIASNATKKFEFSAQ 135

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + +CCK C +     C  G   R W +    G V+GGD+    GC P ++         
Sbjct: 136 DLLACCKECGH----GCGGGYSSRAWQYWVTDGIVSGGDFNTSQGCHPYSVQAFRDS--- 188

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
            T P+C           + CTNP Y + + +DK     +Y +  N + I+ EI+  GP  
Sbjct: 189 -TTPNCS----------SFCTNPKYQKNYSEDKRYGARSYRIAKNIEQIQAEIMTSGPVQ 237

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENY--LHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
           A++ +YDDFY Y++GVY+H     L N    HS K++GWG ENGT YWLV N+WG  WG 
Sbjct: 238 ASYVVYDDFYSYQNGVYQHV----LGNVSGRHSVKILGWGRENGTDYWLVANSWGRDWGR 293

Query: 319 RGT-VKILRGKYECAFEYLIAAGKPK 343
            G   K LRG+  C  E  I  G PK
Sbjct: 294 LGGFFKFLRGENHCDIESNILGGDPK 319


>gi|54289256|gb|AAV31918.1| putative vitellogenic cathepsin B [Aedes aegypti]
          Length = 332

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 114/346 (32%), Positives = 173/346 (50%), Gaps = 22/346 (6%)

Query: 2   IHILVFLLGCTLV-RGELYK---FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA 57
           + +LVF +G  ++ R E      F+D ++ Q+ R A TWT    F   +  E  +     
Sbjct: 4   VKLLVFAIGVVVIARSERLGDDPFNDGFLAQVQRHAKTWTPDATFRDGIRFENFQ----- 58

Query: 58  DAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
           + K   +S        K +D  Y+  +P+ FDARE+WP C +I  + + G C A    A 
Sbjct: 59  NMKGIFESKIGFRLPTKRHDVAYNMDIPEFFDAREKWPYCKSISTIKNQGLCGACWAVAT 118

Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
           V   SDR CI S+G+ +  L+ E +  CCK C    N     G+ F+ W  +   G V+G
Sbjct: 119 VSVMSDRLCIHSEGKFDVELAAEDLMGCCKDCGNGCNGGFLDGTSFQYWVDV---GLVSG 175

Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
             Y +  GC+P    PC +         C  +K P   C   CT   Y   + +DK+  +
Sbjct: 176 AAYNNTDGCKPYPFKPCLY-----PFVGCHPEKTP--SCTHHCTE-GYDGTYRRDKYYGS 227

Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
             Y + ++E  I+ EI+ +GP  + F++Y D Y YK+GVY+H    ++    H+ +LIGW
Sbjct: 228 AAYKLPNDERMIQLEIMTNGPVESGFSVYQDLYLYKTGVYQHVVGREVGK--HAVRLIGW 285

Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           G E G PYWL+ N++G  WG+ G  K LRG      E ++ AG PK
Sbjct: 286 GKERGVPYWLIANSYGEDWGEHGYFKFLRGSNHLGIESVVIAGLPK 331


>gi|320166129|gb|EFW43028.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
          Length = 332

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 115/318 (36%), Positives = 151/318 (47%), Gaps = 16/318 (5%)

Query: 26  IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVP 85
           +D IN    TW A R       +E  R     +       D+ LP          +  +P
Sbjct: 29  VDHINSLKTTWVAERPTRFGSFDEVARLCGALETP----EDQRLPLKVA----PIAEAIP 80

Query: 86  DRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASC 145
           D FD+R  WP C TI  V D  AC +   F AV + SDR CI S   +   LS   + SC
Sbjct: 81  DTFDSRTNWPACPTIKEVRDQSACGSCWAFGAVESMSDRICIASNATKIVRLSASDLLSC 140

Query: 146 CKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPS 205
           C  C       C  G +  +W++   +G VTG  Y     C+P     C+HH ++P  P 
Sbjct: 141 CTSC----GDGCDGGQLGPSWDYYKNKGIVTGYLYNTTGYCKPYDFPACAHHEASPDYPD 196

Query: 206 CENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFAL 265
           C +      KC   C        +  D H    +Y V   + AI+ EIL HGP  A F +
Sbjct: 197 CPSTDYSTPKCTKSCVAGYTANTYTADLHYGQSSYSVGRTDAAIQTEILNHGPVEAAFTV 256

Query: 266 YDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKIL 325
           Y DF  Y+SGVYKHTS + L    H+  ++GWGTE+G+PYWLV N+W P WGD G  KIL
Sbjct: 257 YSDFPTYRSGVYKHTSGSVLGG--HAISIVGWGTESGSPYWLVKNSWNPSWGDGGFFKIL 314

Query: 326 RGKYECAFEYLIAAGKPK 343
           RG  +C     +  G PK
Sbjct: 315 RG--DCGINNDVVGGLPK 330


>gi|38373697|gb|AAR19103.1| cathepsin B [Uronema marinum]
          Length = 350

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 114/347 (32%), Positives = 176/347 (50%), Gaps = 18/347 (5%)

Query: 4   ILVFLLGCTLVRGELYKFSDAYIDQINR--EANTWTAGRNFP-ANLSEEYLRQFL--IAD 58
           +L+ L+  ++   +   F+   ++++N     +TW AG N     +S + ++  +  IA 
Sbjct: 6   LLIALIVASVQAFDFKLFTSEIMEEVNNYNTGSTWKAGYNKRFEGMSFDQIQAMMGTIAT 65

Query: 59  AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
             +    +R  P     ++   + ++P+ FD RE +P C ++  V D   C +   F  V
Sbjct: 66  PVHMIPDERYTP-----FETIQNLSLPESFDLREAYPKCESLQQVRDQSNCGSCWAFGTV 120

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
            A SDR CI S  +    +S+E + SCC+   +     C+ G     WN+  K G V+G 
Sbjct: 121 EAISDRICIASGQKDQTRISSENLLSCCR-GTFACGMGCNGGYTAGAWNYYVKTGLVSGN 179

Query: 179 DYGD-----RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
            Y D     +T CQP +  PCSHH         +  +    KC+T C +      + QD 
Sbjct: 180 LYTDDNQNSKTECQPYSFPPCSHHVQGEYQACTDLPQFNTPKCYTECNSQYTQNSYEQDL 239

Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
           H+   +Y V  +E+ IK EI  +G TTA+F +Y DF  Y SGVY++TS + +    H+ K
Sbjct: 240 HKGVSSYSVPKSEEQIKAEIYQYGSTTASFNVYSDFLTYSSGVYQNTSGSYMGG--HAIK 297

Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           ++GWG ENGTPYWL  N+W   WG+ G  KILRG  EC  E  + AG
Sbjct: 298 MLGWGVENGTPYWLCANSWNSSWGENGFFKILRGSNECGIESGMVAG 344


>gi|27526823|emb|CAD32937.1| pro-cathepsin B2 [Fasciola hepatica]
          Length = 337

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 119/331 (35%), Positives = 164/331 (49%), Gaps = 26/331 (7%)

Query: 21  FSDAYIDQINREAN-TWTAGRNFPAN--LSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
           FSD  I  IN ++  +W A    P++  ++ E+ +Q L       +++    P +R+T  
Sbjct: 16  FSDELIHYINEKSGASWKAA---PSSRFINIEHFKQHL----GLLEET----PEERQTRR 64

Query: 78  PEYSATV-----PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
           P     V     P+ FDARE+WP C +I  +PD  +C +    A VGA SDR CI S G 
Sbjct: 65  PTVRYNVSDNDLPESFDAREKWPLCRSIRQIPDQSSCGSCWAVAGVGAMSDRVCIHSNGM 124

Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTIS 192
               LS   + SCC  C       C  GS    W++  + G VTGG   + TGC P    
Sbjct: 125 MQPELSAIDLVSCCSYC----GNGCQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPYPFP 180

Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
            C H GS   L  C     P   C+  C    Y + + +DK     +Y VD +E  I +E
Sbjct: 181 QCRHPGSRSQLNPCPRYTYPTPSCYPYC-QAGYDKTYEKDKVYGKTSYNVDRHEYTIMEE 239

Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
           I+ +GP  A F +Y DF  YKSG+Y H S        H+ ++IGWG ENG  YWL  N+W
Sbjct: 240 IMKNGPVEAGFIVYTDFAVYKSGIYHHVSGRYAGK--HAIRIIGWGVENGVKYWLTANSW 297

Query: 313 GPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
              WG+ G  +ILRG  EC  E ++ AG P+
Sbjct: 298 NVGWGENGYFRILRGTDECRIESIVVAGMPR 328


>gi|313233819|emb|CBY09988.1| unnamed protein product [Oikopleura dioica]
          Length = 356

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 117/320 (36%), Positives = 159/320 (49%), Gaps = 15/320 (4%)

Query: 23  DAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA 82
           D  I ++N    +W AG NF +N + ++     +A        D  LP +    D +   
Sbjct: 41  DDIIAKVNSADLSWKAGANFNSNYAPKH-----VAGLCGTIMGDDRLPVNHLLNDADLE- 94

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
            +P  FD+RE WP+C +I  V D G+C +   F A  A SDR CI S       LS+E +
Sbjct: 95  -LPANFDSREAWPDCPSISEVRDQGSCGSCWAFGASEAISDRTCIHSNAAFTFDLSSEDL 153

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            SCC    Y     C+ G     W +  + G V+GG Y   TGCQP  I PC HH +   
Sbjct: 154 LSCCG---YVCGNGCNGGFPQAAWEYWVQNGLVSGGLY-HGTGCQPYAIEPCEHH-TEGD 208

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
            P C  ++    KC  +C +  Y   F QDKH  ++ Y +  NE AI  EI  +GP    
Sbjct: 209 RPPCTGEEGTTPKCSHKCVD-GYTGNFAQDKHYGSVAYRIPANEKAIMNEIYKNGPVEGA 267

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F +Y+DF  YKSGVY H + + L    H+ +++GWG ENG  YWL  N+W   WG+ G  
Sbjct: 268 FIVYEDFPTYKSGVYSHHTGSALGG--HAIRVLGWGEENGEKYWLCGNSWNTDWGNNGFF 325

Query: 323 KILRGKYECAFEYLIAAGKP 342
           KI RG  EC  E  +  G P
Sbjct: 326 KIKRGVNECGIESEMVGGIP 345


>gi|56754307|gb|AAW25341.1| unknown [Schistosoma japonicum]
          Length = 309

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 106/294 (36%), Positives = 153/294 (52%), Gaps = 13/294 (4%)

Query: 56  IADAKYFDQSDRPLPGDRKTYDP-----EYSATVPDRFDAREQWPNCGTIGHVPDTGACA 110
           + DA+      R  P  R+   P     + +  +P  FD+R++WP C +I  + D   CA
Sbjct: 24  VDDARILLGGRREDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSRCA 83

Query: 111 APHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLH 170
           +    +AVGA SDR CI+S G+Q+  LS   + SCCK C       C  G    +W++  
Sbjct: 84  SSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCKNC----GSGCDGGVTGYSWDYWV 139

Query: 171 KRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFF 230
             G VTGG   + TGC+P     C H        +C ++     +C   C    Y   + 
Sbjct: 140 SHGIVTGGSKENHTGCRPYPFPKCDHFVKGK-YRACGDKLYKTPQCKQTC-QKGYNTSYE 197

Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLH 290
           QDKH    +Y V   E  I+K+I+ HG   A   +Y+DF +YKSG+Y++T+   +    H
Sbjct: 198 QDKHYGGFSYNVLSVESVIQKDIMMHGTVEAYLEIYEDFLNYKSGIYRYTTGQFISG--H 255

Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           + +LIGWG ENGT YWL  NTW   WG++G  +I+RG+ EC  E  IAAG  K+
Sbjct: 256 AVRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 309


>gi|308488594|ref|XP_003106491.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
 gi|308253841|gb|EFO97793.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
          Length = 342

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 105/320 (32%), Positives = 161/320 (50%), Gaps = 13/320 (4%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
              +++  + + WTAG      +S  ++   L+ D +    +D        T+ PE S  
Sbjct: 34  GMFEELIPKNSFWTAGI---PKVSRSFMLSTLVKDPEIIGFNDL-----GPTFSPENSDL 85

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
            P  FDARE+WP C +I  + D   C +   FAA  + SDR CI S G  +  LS + + 
Sbjct: 86  SP-FFDARERWPECSSIPLINDISECKSSWAFAAAESMSDRLCINSGGMIDTILSAQELL 144

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC        + C+ G+  + W +  K G  TGG Y  + GC+P +I+PC       T 
Sbjct: 145 SCCT-GVLSCGEGCAGGNPLKAWQYWQKHGIPTGGSYESQFGCKPYSIAPCGKTIGNVTY 203

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P C N  +P   C  +C  P Y     +D+H       + + +  I+ +++ +GP  AT 
Sbjct: 204 PPCTNTTLPTPTCEKKC-KPGYPVDLDKDRHYGVSVDQLPNRQIEIQSDVMLNGPVEATM 262

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
            +YDDF  Y +G+Y H +  K + +L S +++GWG   G PYWL+ N+WG  WG+ GT +
Sbjct: 263 EIYDDFLQYTTGIYVHLAGNK-QGHL-SVRILGWGMFEGVPYWLLANSWGKEWGENGTFR 320

Query: 324 ILRGKYECAFEYLIAAGKPK 343
           +LRG  EC  E    +G PK
Sbjct: 321 VLRGVNECGLEANCISGMPK 340


>gi|121073189|gb|ABM47071.1| cathepsin B2 [Clonorchis sinensis]
 gi|358341868|dbj|GAA36574.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 343

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 105/262 (40%), Positives = 137/262 (52%), Gaps = 9/262 (3%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           S  +P  FDAR  WP+C +I  + D  +C +   F AV A SDR CI S G  N+ LS  
Sbjct: 83  SKLIPKSFDARATWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSSGAFNKSLSAV 142

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SCCK C       C  G     W+F    G VTGG   + TGC+P     C HH S 
Sbjct: 143 DLLSCCKDC----GDGCDGGFPPMAWDFWKTHGIVTGGSKEEPTGCRPYPFPKCQHH-SQ 197

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
              P C  +  P  KC   C  P     + +DK R   +Y V  +E AI KEIL +GP  
Sbjct: 198 GHYPPCPRRIYPTPKCVKHCDTPKID--YQKDKTRANTSYNVHQSEVAIMKEILLNGPVE 255

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
           ATF +++DF  YKSG+Y H     +    H+ +++GWG ENG PYWL+ N+W   WG++G
Sbjct: 256 ATFEVHEDFPEYKSGIYFHAWGGSVGG--HAIRILGWGEENGVPYWLIANSWNEDWGEKG 313

Query: 321 TVKILRGKYECAFEYLIAAGKP 342
            ++ LRG  EC  E    AG P
Sbjct: 314 YLRFLRGHNECGIEEEATAGLP 335


>gi|392920988|ref|NP_506011.2| Protein F57F5.1 [Caenorhabditis elegans]
 gi|206994319|emb|CAB00098.2| Protein F57F5.1 [Caenorhabditis elegans]
          Length = 351

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 104/280 (37%), Positives = 142/280 (50%), Gaps = 12/280 (4%)

Query: 69  LPGDRKTYD---PEYS-ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDR 124
           +P + + ++   PE   A VPD FD+R  WPNC +I  + D  +C +    +A    SDR
Sbjct: 78  IPEEYRVFEMTHPEVEDAAVPDSFDSRTAWPNCPSISKIRDQSSCGSCWAVSAAETISDR 137

Query: 125 RCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDR 183
            CI S  +    +S + + +CC  +C       C+ G     W    K+G VTGG Y D+
Sbjct: 138 ICIASNAKTILSISADDINACCGMVC----GNGCNGGYPIEAWRHYVKKGYVTGGSYQDK 193

Query: 184 TGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD 243
           TGC+P    PC HH +      C +   P  KC   C    Y   + QD H     Y V 
Sbjct: 194 TGCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKCERSC-QAGYALTYQQDLHFGQSAYAVS 252

Query: 244 DNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
                I+KEI+ HGP    F +Y+DF HY  GVY HT+ A L    H+ K++GWG +NGT
Sbjct: 253 KKAAEIQKEIMTHGPVEVAFTVYEDFEHYSGGVYVHTAGASLGG--HAVKMLGWGVDNGT 310

Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           PYWL  N+W   WG+ G  +I+RG  EC  E  +  G PK
Sbjct: 311 PYWLCANSWNEDWGENGYFRIIRGVNECGIEGGVVGGIPK 350


>gi|21930117|gb|AAM82155.1| cysteine proteinase [Ancylostoma ceylanicum]
          Length = 348

 Score =  191 bits (485), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 114/318 (35%), Positives = 160/318 (50%), Gaps = 13/318 (4%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
           A++D IN++ + + A  +  A   EE++R   I D K+    ++  P +    + E    
Sbjct: 39  AFVDYINQQQSFFRAEYSPDA---EEFVRN-RIMDVKFAVDPEKTEP-NYVLANTEMKVD 93

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +PD FDAR++WPNC ++ H+ D  +C +    AA  A SDR C  + G+ NR LS   V 
Sbjct: 94  IPDTFDARDRWPNCTSMKHIRDQSSCGSCWAVAAASAMSDRVCALTNGRINRILSDTEVL 153

Query: 144 SCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
           SCC   C +     C  G   R + +  + G  TGG YG++  CQP    PC +H   P 
Sbjct: 154 SCCFGSCGF----GCKGGYPARAFGYAWRYGLSTGGPYGEKDACQPYAFYPCGNHAHEPY 209

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
              C ++  P   C   C    Y   F +DK     TY++  NE  IK EI+  GP  AT
Sbjct: 210 YGPCPDELWPTPTCRRTC-QLGYPIPFEKDKIFNDQTYYIFGNETEIKYEIMTRGPVVAT 268

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           + +Y DF +YK GVY H         LH+ K+IGWG  N  PYWLV N+W   WGD G  
Sbjct: 269 YKVYRDFDYYKKGVYIHREGEVTG--LHAVKIIGWGKGNDVPYWLVANSWNTDWGDNGYF 326

Query: 323 KILRGKYECAFEYLIAAG 340
           +I+RG   C  E  +  G
Sbjct: 327 RIVRGTDNCEIERQMVGG 344


>gi|156255405|gb|ABU62925.1| cathepsin B [Fasciola hepatica]
          Length = 337

 Score =  191 bits (485), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 109/324 (33%), Positives = 165/324 (50%), Gaps = 14/324 (4%)

Query: 21  FSDAYIDQINREAN-TWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
           FSD  I  IN E+  +W A  +   N  ++  +   + +    D++ +     R+T    
Sbjct: 26  FSDELIHYINEESGASWKAAPSTRFNNIDQVKQNLGVLEETPEDRNTQ-----RQTVRYS 80

Query: 80  YSAT-VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
            S   +P+ FDAR++W NC +I  + D  +C++    ++  A +DR CI S GQ+   LS
Sbjct: 81  VSENDLPESFDARQKWANCPSISEIRDQSSCSSCWAVSSASAITDRICIHSNGQKKPRLS 140

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
              + SCC  C Y     C+ G    +W++  + G VTGG   + TGC P     CSH  
Sbjct: 141 AIDIVSCCAYCGY----GCNGGIPAMSWDYWTREGVVTGGTLENPTGCLPYPFPKCSHGV 196

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
             P LP C     P  KC  +C +  Y + + QDK +   +Y V   E  I  EI+ +GP
Sbjct: 197 VTPGLPPCPRDIYPTPKCEKKC-HAGYNKTYEQDKVKGKSSYNVGGQETDIMMEIMKNGP 255

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
               F +++DF  YKSG+Y +T+   +    H+ ++IGWG ENG  YWL+ N+W   WG+
Sbjct: 256 VDGIFYMFEDFLVYKSGIYHYTTGRLVGG--HAIRVIGWGVENGVKYWLIANSWNEGWGE 313

Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
           +G  ++ RG  EC  E  I AG P
Sbjct: 314 KGYFRMRRGNNECGIEARINAGLP 337


>gi|161343861|tpg|DAA06111.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 323

 Score =  191 bits (484), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 103/271 (38%), Positives = 142/271 (52%), Gaps = 9/271 (3%)

Query: 73  RKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
           RKT D  Y   +P  FDAR+ + +C   IG V D G CA+    A    F+DR CI S G
Sbjct: 53  RKTADINYKTDIPKEFDARQYFISCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNG 112

Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTI 191
           +    LS + + SC      D+   C  GS ++ W F   +G VTGG Y    GCQP   
Sbjct: 113 KFTDNLSAQNLMSCGD----DEKLGCDGGSAYKAWEFTMGKGIVTGGPYDSNEGCQPYKN 168

Query: 192 SPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD-DNEDAIK 250
            PC H+G +        ++   + C  +C N  Y   +  D ++T++ Y     N   I+
Sbjct: 169 RPCDHYGDSSLTNCSSLRRTQMMFCRDKCVNKNYKVKYEDDLYKTSVVYMTSWTNVKQIQ 228

Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVI 309
           +EI+ +GP TA   +Y++F  YK GVYK T+  +L  Y H  KLIGWG  E G  YWL +
Sbjct: 229 QEIMTYGPVTAFMYVYENFMGYKEGVYKSTA-GELIGY-HHVKLIGWGVDEAGIEYWLAM 286

Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           N+W  +WG+ G  KILRG   C+ E L+ AG
Sbjct: 287 NSWNSNWGNDGLFKILRGYNFCSIELLVMAG 317


>gi|29374023|gb|AAO73002.1| cathepsin B [Fasciola gigantica]
          Length = 335

 Score =  191 bits (484), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 111/328 (33%), Positives = 161/328 (49%), Gaps = 24/328 (7%)

Query: 21  FSDAYIDQINREAN-TWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
           FSD  I  +N E+  +W A R+   N  E++ +     +           P +R T  P 
Sbjct: 26  FSDELIRYVNEESGASWKAARSTRFNNIEQFKKHLGALEET---------PEERNTRRPT 76

Query: 79  -EYSAT---VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
             YS +   +P+ FDARE+WPNC +I  +PD  +C++        A +DR CI S G++ 
Sbjct: 77  VRYSVSENDLPESFDAREKWPNCSSISEIPDQSSCSSCWAVGTASAMTDRICIHSNGEKK 136

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS   + SCC  C Y     C  G     W++  + G V+GG   + TGC P     C
Sbjct: 137 PRLSAVDLVSCCPYCGY----GCEGGYPSMAWDYWWRHGIVSGGTLENPTGCLPYPFPKC 192

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
           SH    P L  C  +     KC  +C    Y +   +DK +   +Y V D E  I  EI+
Sbjct: 193 SHLEETPGLAPCPRELYATPKCEKQC-QAGYSKTSEEDKIKGKSSYNVGDRETDIMMEII 251

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            +GP +  + +++DF  YKSG+Y++TS + +  +     +IGWG ENG  YWL  N+W  
Sbjct: 252 TNGPVSTIYYIFEDFTVYKSGIYQYTSGSLMGGH----GIIGWGVENGVKYWLAANSWNE 307

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKP 342
            WG+ G  +I RG  EC  E  I AG P
Sbjct: 308 GWGENGYFRIRRGTNECGIESRINAGLP 335


>gi|154761391|gb|ABS85545.1| cathepsin B preproprotein [Biomphalaria glabrata]
          Length = 333

 Score =  191 bits (484), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 125/349 (35%), Positives = 175/349 (50%), Gaps = 25/349 (7%)

Query: 1   MIHILVFLLGCTLVR----GELYKFSDAYIDQINREANT-WTAGRNF-PANLSE-EYLRQ 53
           ++ ILV + G   V       +   SDA I  IN  ANT W AGRNF PA +     L  
Sbjct: 3   IMRILVAICGLLAVALATPFHIEPLSDAEIFYINHVANTTWKAGRNFHPAEIKRARALLG 62

Query: 54  FLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPH 113
             +A+ K +++    +    K   P     +PD FD R +WP+C ++  + D   C +  
Sbjct: 63  VNMAENKAYNR----IHLKYKQVQPRND--LPDNFDPRTKWPDCASLNEIRDQANCGSCW 116

Query: 114 IFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRG 173
            F +  A +DR CI  KG  N  +S E +  CCK C       C+ G     W +    G
Sbjct: 117 AFGSAEAMTDRICIAGKG--NIHISAEDINDCCKSC----GMGCNGGYPAAAWEWYVDTG 170

Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
            V+GG YG   GC P ++  C HH +    P      VP  KC  +C    Y + +  DK
Sbjct: 171 VVSGGQYGTNEGCMPYSLPHCDHHTTGKYQPC--PAVVPTPKCEKKCLT-GYPKSYSNDK 227

Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
            R   +Y V   + +I +E++ +GP TA F +Y DF  YK+GVY+HT+ +      H+ K
Sbjct: 228 TRGKKSYGVRGVQ-SIMQELVDNGPVTAAFDVYSDFLSYKTGVYRHTTGSYEGG--HAVK 284

Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           +IG+GTE+G  YWLV N+W   WGD+G  KI +GK EC  E  I AG P
Sbjct: 285 IIGYGTESGQDYWLVANSWNEDWGDKGFFKIAKGKDECGIESSIVAGDP 333


>gi|118358710|ref|XP_001012596.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89294363|gb|EAR92351.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 346

 Score =  190 bits (483), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 107/318 (33%), Positives = 162/318 (50%), Gaps = 17/318 (5%)

Query: 27  DQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPD 86
           +++N    TW AG N       +++   +     +         G +       +  +P 
Sbjct: 42  EKVNNSNTTWKAGENI------KWINSDIAGVKAHMGTLLNQKSGVKLEKVNRQANNLPS 95

Query: 87  RFDAREQWPN-CGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASC 145
            FD+R QW + C ++  V D   C +   F A  + SDR CI   GQ  R LST+ + +C
Sbjct: 96  EFDSRVQWGDKCSSLWEVRDQSNCGSCWAFGAAESLSDRHCIH-LGQDIR-LSTQNLVTC 153

Query: 146 CKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPS 205
           C  C +     C  G      ++    G VTG  YG+ + CQ  +++PC+HH ++   P 
Sbjct: 154 CDECGF----GCDGGWPEAAMDYYVNNGLVTGDLYGNNSWCQAYSLAPCAHHVTSDVYPP 209

Query: 206 CENQKVPKLKCHTRC-TNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFA 264
           C  + +P   C   C +N TY   + +D H+ +  Y +D NE AI  EI  +GP    F 
Sbjct: 210 CTGE-LPTPPCVKSCDSNSTYTIPYPKDLHKGSKAYSIDQNEQAIMTEIQTNGPIEVAFT 268

Query: 265 LYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKI 324
           +Y+DF  YKSGVY+H + ++L    H+ K++GWG ENGTPYW+++N+W   WGD+GT KI
Sbjct: 269 VYEDFLTYKSGVYQHVTGSELGG--HAVKMVGWGVENGTPYWIIVNSWNESWGDKGTFKI 326

Query: 325 LRGKYECAFEYLIAAGKP 342
           LRG+ EC  E       P
Sbjct: 327 LRGQNECGIESECVTALP 344


>gi|29374027|gb|AAO73004.1| cathepsin B [Fasciola gigantica]
          Length = 337

 Score =  190 bits (482), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 108/324 (33%), Positives = 165/324 (50%), Gaps = 14/324 (4%)

Query: 21  FSDAYIDQINREAN-TWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
           FSD  I  IN E+  +W A  +   N  ++  +   + +    D++ +     R+T    
Sbjct: 26  FSDELIHYINEESGASWKAAPSTRFNNIDQVKQNLGVLEETPEDRNTQ-----RQTVRYS 80

Query: 80  YSAT-VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
            S   +P+ FDAR++W NC +I  + D  +C++    ++  A +DR CI S GQ+   LS
Sbjct: 81  VSENDLPESFDARQKWANCPSISEIRDQSSCSSCWAVSSASAITDRICIHSNGQKKPRLS 140

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
              + SCC  C Y     C+ G    +W++  + G VTGG   + TGC P     CSH  
Sbjct: 141 AIDIVSCCAYCGY----GCNGGIPAMSWDYWTREGVVTGGTLENPTGCLPYPFPKCSHGV 196

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
             P LP C     P  KC  +C +  Y + + QDK +   +Y V + E     EI+ +GP
Sbjct: 197 VTPGLPPCPRDIYPTPKCEKKC-HAGYNKTYEQDKVKGKSSYNVGEQETDFMMEIMKNGP 255

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
               F +++DF  YKSG+Y +T+   +    H+ ++IGWG ENG  YWL+ N+W   WG+
Sbjct: 256 VDGIFYMFEDFLVYKSGIYHYTTGRLVGG--HAIRVIGWGVENGVKYWLIANSWNEGWGE 313

Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
           +G  ++ RG  EC  E  I AG P
Sbjct: 314 KGYFRMRRGNNECGIEARINAGLP 337


>gi|395842321|ref|XP_003793966.1| PREDICTED: cathepsin B [Otolemur garnettii]
          Length = 339

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 120/345 (34%), Positives = 174/345 (50%), Gaps = 23/345 (6%)

Query: 5   LVFLLGCTLV------RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
           L+  L C LV      +   +  SD  ++ IN++ +TW AG NF  N+   YL++     
Sbjct: 4   LLASLCCLLVLTSAWSKPYFHPLSDELVNFINKQNSTWQAGHNF-RNVDMSYLKRLC--- 59

Query: 59  AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
             +      P    R  +  + +  +P  FDAREQW +C TI  + D G+C +   F AV
Sbjct: 60  GSFLGGPKLP---QRVKFAKDMN--LPKSFDAREQWSHCPTIKEIRDQGSCGSCWAFGAV 114

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
            + SDR CI + G  +  +S E + +CC     D             WNF  ++G V+GG
Sbjct: 115 ESISDRICIHTNGHVSVEVSAEDLLTCCGGQCGDGCNGGYPA---EAWNFWTRKGLVSGG 171

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
            Y    GC+P +I PC HH +  + P+C  +     KC   C  P Y   + +DKH    
Sbjct: 172 LYESHVGCRPYSIPPCEHHVNG-SRPACTGEG-DTPKCSKTC-EPGYSPTYKEDKHFGYT 228

Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
           +Y +  NE  I  EI  +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG
Sbjct: 229 SYSLPTNEWEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHLTGDMMGG--HAIRILGWG 286

Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            ENG PYWLV N+W   WGD G  +ILRG+  C  E  + AG P+
Sbjct: 287 EENGVPYWLVANSWNTDWGDGGFFRILRGQDHCGIESEVVAGIPR 331


>gi|332244666|ref|XP_003271495.1| PREDICTED: cathepsin B [Nomascus leucogenys]
          Length = 351

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 120/342 (35%), Positives = 166/342 (48%), Gaps = 31/342 (9%)

Query: 15  RGELYKFSDAYIDQINREANTWTAG-----RNFPANLSEEYLRQFLIADAKYFDQSDRPL 69
           R   +  SD  ++ +N+   TW  G      NF  N+   YL++       +      P 
Sbjct: 20  RPSFHPLSDELVNYVNKRNTTWQVGCGAASYNF-YNVDVSYLKRLC---GTFLGG---PK 72

Query: 70  PGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHV---PDTGAC----AAPHIFAAVGAFS 122
           P  R T+  + +  +P+ F AREQWP C TI      P  G      +    F AV A S
Sbjct: 73  PPQRVTFTEDLN--LPESFYAREQWPQCPTIXXXRAQPGRGGLTRWGSFLQAFGAVEAIS 130

Query: 123 DRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
           DR CI +    +  +S E + +CC  +C       C+ G     WNF  ++G V+GG Y 
Sbjct: 131 DRICIHTNAHISVEVSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYD 186

Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW 241
              GC+P +I PC HH +    P       PK  C   C  P Y   + QDKH    +Y 
Sbjct: 187 SHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYS 243

Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
           V ++E  I  EI  +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG EN
Sbjct: 244 VSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHITGEMMGG--HAIRILGWGVEN 301

Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           GTPYWLV N+W   WGD G  KILRG+  C  E  + AG P+
Sbjct: 302 GTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 343


>gi|341888224|gb|EGT44159.1| hypothetical protein CAEBREN_15022 [Caenorhabditis brenneri]
          Length = 332

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 110/334 (32%), Positives = 164/334 (49%), Gaps = 22/334 (6%)

Query: 18  LYKFSDAY----IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG-- 71
            +  SD++    ID +N +   WTAG   P    E  L+  +          D  L G  
Sbjct: 11  FFAISDSFDPLIIDYVNSQNTLWTAG--IPKIPRESMLKTLV---------KDPHLAGFR 59

Query: 72  DRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
           D     P  ++ +   FDARE+WP C +I  + D   C +   FAA  + SDR CI S G
Sbjct: 60  DHGPSVPTENSDLSQFFDARERWPECTSIPQINDISECKSSWAFAAAESMSDRLCINSGG 119

Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTI 191
             N  LS + + SCC        + C  G+ F+ W +  K G  TGG Y  + GC+P +I
Sbjct: 120 MINTILSAQELLSCCTGV-LSCGEGCGGGNAFKAWQYWGKHGLPTGGSYETQFGCKPYSI 178

Query: 192 SPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPT-YGRGFFQDKHRTTLTY-WVDDNEDAI 249
           +PC       T P+C N  +P   C  +CT+   Y     +D+H    +   + + +  I
Sbjct: 179 APCGKTVGNVTYPACTNTTLPTPSCEKKCTSKNGYPVDIDKDRHYGASSVDQLPNRQIEI 238

Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVI 309
           + +++ +GP   TF +YDDF  Y +G+Y H +  K + +L S +++GWG   G PYWL+ 
Sbjct: 239 QSDVMLNGPIETTFEVYDDFLQYTTGIYVHLTGNK-QGHL-SVRILGWGMYEGVPYWLLA 296

Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           N+WG  WG+ GT + LRG  EC  E    +  PK
Sbjct: 297 NSWGKEWGENGTFRALRGTNECGLEANCVSAMPK 330


>gi|187107122|ref|NP_001119621.1| cathepsin B-3098 precursor [Acyrthosiphon pisum]
 gi|161343841|tpg|DAA06101.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 337

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 117/348 (33%), Positives = 172/348 (49%), Gaps = 23/348 (6%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           +I   V L+   L   + Y     +ID IN++A TW AG N   N  +E++ + L   ++
Sbjct: 5   IILASVILISVYLTE-QAYFLEKDFIDNINKQATTWKAGVNSAPNTPKEHILRLL--GSR 61

Query: 61  YFDQSDRPLPGDRKTYD-PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
                D+      K  D  +    +P +FDAR++W  C TIG V D G C +    +   
Sbjct: 62  GVQIPDKVNYNMYKNDDHADNYQEIPMKFDARKKWIRCKTIGEVRDQGNCGSDWALSTSS 121

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           AF+DR C+ + G  N+ LS E +  CC  C       C+ G   R W      G VTGG+
Sbjct: 122 AFADRLCVATNGDFNQLLSAEEITFCCHKC----GNGCNGGYPIRAWKRFKNHGLVTGGN 177

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRT 236
           Y    GC+P  + PC +        +C  Q    ++ + +C+   YG     F +D   T
Sbjct: 178 YKSGEGCEPYRVPPCPYDKDGKN--TCSGQP---MESNHKCSKKCYGDEDIDFNKDHRYT 232

Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKL 294
              Y++      I+K+++ +GP   +F +YDDF +YKSG+Y  + NA   +YL  HS KL
Sbjct: 233 RDDYYL--TYRGIQKDVINYGPIETSFDVYDDFPNYKSGIYVKSENA---SYLGGHSVKL 287

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           IGWG E G  YWL++N+W   WGD+G  KI RG  EC  +     G P
Sbjct: 288 IGWGEEYGVLYWLMVNSWNADWGDKGLFKIRRGTNECRVDNSTTGGVP 335


>gi|118429531|gb|ABK91813.1| cathepsin B-like cysteine proteinase precursor [Clonorchis
           sinensis]
 gi|358331549|dbj|GAA37857.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 343

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 101/259 (38%), Positives = 136/259 (52%), Gaps = 9/259 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDAR +WP+C +I  + D   C +   F AV A SDR CI S G  N+ LS   + 
Sbjct: 86  LPKNFDARTKWPHCPSISEIRDQSGCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLL 145

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC+ C Y     CS G     W++    G VTGG   D +GC+      C HH      
Sbjct: 146 SCCENCGY----GCSGGYPAVAWDYWGAHGIVTGGSKEDPSGCRSYPFPKCEHHVQG-HY 200

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P C +Q  P  +C   C  P  G  + +DK R  ++Y +  +E  I KEI+  GP  A F
Sbjct: 201 PPCPHQYYPTPECVQHCDTP--GIDYVKDKTRANMSYNIYSSEILIMKEIMLRGPVEAVF 258

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
            +Y+DF  YK GVY H+  A L    H+ +++GWG E   PYWL+ N+W   WG++G +K
Sbjct: 259 TVYEDFLQYKFGVYFHSWGAPLSE--HAIRILGWGEEGDVPYWLIANSWNEDWGEKGYMK 316

Query: 324 ILRGKYECAFEYLIAAGKP 342
            LRG  EC  E  + AG P
Sbjct: 317 FLRGLNECGIEDDVTAGLP 335


>gi|294877489|ref|XP_002768007.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239870145|gb|EER00725.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 344

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 122/351 (34%), Positives = 171/351 (48%), Gaps = 44/351 (12%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK----YFDQSDRPLPGDRKTYDPE 79
           + +D++N + N WTA  +      +E      + DAK       +  + L  ++K Y  E
Sbjct: 3   SLVDEVNSKQNLWTASTD------QERFYGRSLGDAKKLCGTLPEETKGL--EKKVYPTE 54

Query: 80  YSATVPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
             A +P  FDAR+ +  C   IGHV D  AC +    A V AF+ R CIKS G+ N+ LS
Sbjct: 55  ELADIPSSFDARDAFKECKDVIGHVWDQSACGSCWAIAPVEAFNARLCIKSGGKFNQLLS 114

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDR------TGCQPSTIS 192
              + +CC      ++  C  G     W+FL   G VTGGD+  +       GC P +  
Sbjct: 115 AGEMLACCNSVHSCNSHGCQGGIARAAWSFLKMHGIVTGGDFVPKGSMSAADGCWPYSFP 174

Query: 193 PCSHHGSAPTLPSCENQKVPKL--------------------KCHTRCTNPTYGRGFFQD 232
            C+H         C   +VP L                     C  RC N  YG    +D
Sbjct: 175 KCAHDQEDSKYEPCPEVRVPPLGERHQRGAGASIHQKLYDTPSCLDRCPNEKYGTPRDKD 234

Query: 233 KHRTTLTY-WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHS 291
           +H T     ++ +  D IKKEI+ +GPT+A+F+ Y+DF  YKSGVYKHTS   L +  HS
Sbjct: 235 RHFTARALPYLFEGTDNIKKEIMTNGPTSASFSTYEDFSSYKSGVYKHTSGGYLGD--HS 292

Query: 292 GKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            ++IGWGTE G  YWLV+N+W   WGD GT KI +G  +C  +  +    P
Sbjct: 293 VEIIGWGTEKGVDYWLVMNSWNEGWGDHGTFKIAQG--DCGIDDAVQGSLP 341


>gi|193610664|ref|XP_001948185.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
          Length = 324

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 126/351 (35%), Positives = 163/351 (46%), Gaps = 40/351 (11%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           +  + + LL C L   E  K S   + Q N E NT  A  N   N ++E   + L+   K
Sbjct: 5   LFLMSIMLLSCYLT--EQAKLSRDNMIQTNIETNTLKALDNIDLNSAKE---EHLMLLGK 59

Query: 61  YFDQSDRPLPGDRKTYDPEYSA--TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
               +        KT DP Y A   +   FDAR+ W  C TIG V + G       +A  
Sbjct: 60  RGVAATFKSKLLYKTRDPRYVAYGKISKEFDARKHWSQCKTIGEVYNDGNSDLSWAYATT 119

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKI---CRYDDNKSCSHGSVFRTWNFLHKRGSV 175
           GAF+DR C+ + G  N+ LSTE + SC  I      DD          + W F  K+G V
Sbjct: 120 GAFADRMCVATNGSYNQLLSTEQLISCSGIKSNAMADD----------QAWKFFKKQGLV 169

Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH- 234
           +GG Y    GCQPS I P  +              +PK   +  C N  YG       H 
Sbjct: 170 SGGKYNTNDGCQPSKIPPIFN--------------LPKKIYNRTCDNFCYGNSLIDYNHD 215

Query: 235 --RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
             + + TY V      I++E+  +GP +A F+LYDD + Y SGVY  T  +K   Y  S 
Sbjct: 216 HVKVSYTYHVLYKN--IQREVQTYGPVSAYFSLYDDLFLYTSGVYARTEKSKFVRY-QSA 272

Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           KLIGWG ENG  YWL++N+WG  WG  G  KI RG  EC F     AG PK
Sbjct: 273 KLIGWGVENGVDYWLLVNSWGNEWGQNGLFKIKRGTDECQFGRHTYAGVPK 323


>gi|221484923|gb|EEE23213.1| cysteine proteinase, putative [Toxoplasma gondii GT1]
          Length = 569

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 104/267 (38%), Positives = 147/267 (55%), Gaps = 16/267 (5%)

Query: 84  VPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           VP  FDAR  +P C   +GHV D G C +   FA+  AF+DR CI+S+G++  PLS ++ 
Sbjct: 274 VPAHFDARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKRLMPLSAQHT 333

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPSTISPCSHHGS 199
            SCC    +  +  C+ G     W +  ++G VTGGD+   G  T C P  +  C+HH  
Sbjct: 334 TSCCNAI-HCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAHHAK 392

Query: 200 APTLPSCENQKVPKL--KCHTRCTNPTYGRG---FFQDKHRTTLTYWVDDNEDAIKKEIL 254
           AP  P C+   VP+   KC   C    Y      F QD H+ T  Y +   +D +K++++
Sbjct: 393 AP-FPDCDATLVPRKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSRDD-VKRDMM 450

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            HGP +  F +Y+DF  YKSGVYKH S   +    H+ K+IGWGTENG  YW  +N+W  
Sbjct: 451 THGPVSGAFMVYEDFLSYKSGVYKHVSGLPVGG--HAIKIIGWGTENGEEYWHAVNSWNT 508

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGK 341
           +WGD G  KI  G  +C  +  + AG+
Sbjct: 509 YWGDGGQFKIAMG--QCGIDGEMVAGE 533


>gi|1181143|emb|CAA93278.1| cysteine proteinase [Haemonchus contortus]
          Length = 341

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 105/310 (33%), Positives = 155/310 (50%), Gaps = 23/310 (7%)

Query: 32  EANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAR 91
           E     AG NF   L           D  + +Q+ +P+  D+     +    +P+ FDAR
Sbjct: 52  EVEATPAGHNFDRKL----------MDLSFINQNRKPVFDDKN----DKGEDIPESFDAR 97

Query: 92  EQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICR 150
            +WP C ++ H+ D   C +    +   A SDR CI S G++   +S   + SCC   C 
Sbjct: 98  TKWPKCSSLKHIRDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDILSCCGNQCG 157

Query: 151 YDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQK 210
           Y     C+ G   + +N+  K+G+VTGGDY   +GC+P    PC HHG       C N+ 
Sbjct: 158 Y----GCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGECPNEA 213

Query: 211 VPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFY 270
               KC  +C        + +D+      Y V ++E AI++EI+ +GP    F +Y+DF 
Sbjct: 214 TTP-KCVRKCQKSYKKS-YKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAFTVYEDFS 271

Query: 271 HYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYE 330
           +YK G+YKHT+        H+ K+IGWG E G PYWL+ N+W   WG+ G  +ILRG   
Sbjct: 272 YYKKGIYKHTAGKARGG--HAIKIIGWGKEGGVPYWLIANSWHNDWGENGYFRILRGSNH 329

Query: 331 CAFEYLIAAG 340
           C  E  + AG
Sbjct: 330 CGIEENVVAG 339


>gi|19526442|gb|AAL89717.1|AF483623_1 cathepsin B [Apriona germari]
          Length = 324

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 114/322 (35%), Positives = 164/322 (50%), Gaps = 31/322 (9%)

Query: 22  SDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS-DRPLPGDRKTYDPEY 80
           ++A+I  IN +A TWTA +NF     E+      +AD    ++  +  LP        E 
Sbjct: 28  TEAFIQSINEKATTWTARKNFEGRTPEQLK---ALADVIGINRDPNVTLP----VVFHEA 80

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
            + +PD FDAREQWP C +I  + D GAC +   FAAV   SDR C+ S+G++    S E
Sbjct: 81  ISGIPDSFDAREQWPFCESIRTIRDEGACGSCWAFAAVEVMSDRLCLASEGRKKFIFSAE 140

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            V SCC  C       C  G +   + +    G  +GGDYG + GC+P T    +  G  
Sbjct: 141 EVVSCCTAC----GGGCRGGFLNEPYKYWVTNGIPSGGDYGSKLGCKPYT---AAVSGET 193

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
           P             +C   C +  Y + + +D    T  Y V+     I++EIL +GP T
Sbjct: 194 P-------------QCQKACVS-GYEKSWEKDLRHATSAYQVNGGVLQIQREILDNGPVT 239

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
           A   +Y+DFY Y +G+Y+HTS + +    H+ K+IGWG+EN  PYW+  N+WG  +G+ G
Sbjct: 240 AYMEVYEDFYSYGTGIYQHTSGSFVGG--HAVKIIGWGSENDVPYWIAANSWGTGFGEDG 297

Query: 321 TVKILRGKYECAFEYLIAAGKP 342
             +ILRG      E  I AG P
Sbjct: 298 FFRILRGSNCAGIESYIVAGYP 319


>gi|21700775|gb|AAL60053.1| cysteine proteinase [Toxoplasma gondii]
          Length = 569

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 104/267 (38%), Positives = 147/267 (55%), Gaps = 16/267 (5%)

Query: 84  VPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           VP  FDAR  +P C   +GHV D G C +   FA+  AF+DR CI+S+G++  PLS ++ 
Sbjct: 274 VPAHFDARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKRLMPLSAQHT 333

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPSTISPCSHHGS 199
            SCC    +  +  C+ G     W +  ++G VTGGD+   G  T C P  +  C+HH  
Sbjct: 334 TSCCNAI-HCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAHHAK 392

Query: 200 APTLPSCENQKVPKL--KCHTRCTNPTYGRG---FFQDKHRTTLTYWVDDNEDAIKKEIL 254
           AP  P C+   VP+   KC   C    Y      F QD H+ T  Y +   +D +K++++
Sbjct: 393 AP-FPDCDATLVPRKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSRDD-VKRDMM 450

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            HGP +  F +Y+DF  YKSGVYKH S   +    H+ K+IGWGTENG  YW  +N+W  
Sbjct: 451 THGPVSGAFMVYEDFLSYKSGVYKHVSGLPVGG--HAIKIIGWGTENGEEYWHAVNSWNT 508

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGK 341
           +WGD G  KI  G  +C  +  + AG+
Sbjct: 509 YWGDGGQFKIAMG--QCGIDGEMVAGE 533


>gi|308504375|ref|XP_003114371.1| CRE-CPR-1 protein [Caenorhabditis remanei]
 gi|308261756|gb|EFP05709.1| CRE-CPR-1 protein [Caenorhabditis remanei]
          Length = 366

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 114/320 (35%), Positives = 159/320 (49%), Gaps = 23/320 (7%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
           A +D +N   + +T        +SEE ++  ++ D KY       +   R T       T
Sbjct: 69  ALVDYVNSAQSLFTTEH---VEVSEEVMKSRVM-DVKYAAAHSDEI---RATEVDTVLDT 121

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FD+R  W  C +I  + D   C +   F A    SDR CI++KG Q   +S + + 
Sbjct: 122 IPASFDSRTHWSECKSIKLIRDQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDDLL 181

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC          C  G   +   +   +G VTGGDY    GC+P  I+PC       T 
Sbjct: 182 SCCG---SSCGNGCEGGYPIQALRWWDSKGVVTGGDY-HGAGCKPYPIAPC-------TS 230

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
            +C   K P   C   C +  Y   + +DKH  T  Y V     +I+ EI+ +GP  A F
Sbjct: 231 GNCPESKTPS--CSLSCQSG-YTTAYAKDKHFGTSAYAVARKVASIQTEIMTNGPVEAAF 287

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
            +Y+DFY YKSGVYKHT+   L    H+ K+IGWGTE+G+PYWLV N+WG  WG+ G  +
Sbjct: 288 TVYEDFYKYKSGVYKHTAGKALGG--HAIKIIGWGTESGSPYWLVANSWGNSWGESGFFR 345

Query: 324 ILRGKYECAFEYLIAAGKPK 343
           I RG  +C  E  + AGK K
Sbjct: 346 IFRGDDQCGIESAVVAGKAK 365


>gi|294883442|ref|XP_002770942.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239874068|gb|EER02758.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 393

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 116/326 (35%), Positives = 161/326 (49%), Gaps = 20/326 (6%)

Query: 23  DAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGD---RKTYDPE 79
           D+  D +N+   TW A         +E  +   + D K    +    P     +   +  
Sbjct: 69  DSLADALNQGQKTWVASSK------QERFKGASVFDVKALCGTILNGPSKLPKKPASEST 122

Query: 80  YSATVPDRFDAREQWPNCGT-IGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR-PL 137
             + +PDRFDARE + NC T IGHV D   C +   FA   AFSDR CI+S G+ +  PL
Sbjct: 123 ALSNLPDRFDAREHFKNCATVIGHVRDQSTCGSCWAFATSEAFSDRLCIRSSGEFDLVPL 182

Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
           S  + A+CC       +  C  G     W +  + G V+  D    +GC P     CSHH
Sbjct: 183 SAGHTAACCSEAEGCFSFGCDGGQPDSAWRWFSEHGVVSELD----SGCWPYNFPECSHH 238

Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
                +  C+    P   C T C N  +   F  D+H T    +  D  D IKKEI+ +G
Sbjct: 239 VETKGMEPCKGNS-PSPVCSTTCRNHHFKPSFESDRHFTEDEGYSLDEVDEIKKEIIDNG 297

Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
           P  A F +Y+DF +YKSGVYKH + ++L    H+ K+IGWGT+    YWLV+N+W  +WG
Sbjct: 298 PVAAAFTVYEDFLYYKSGVYKHVNGSELGG--HAVKIIGWGTDQNEQYWLVMNSWNVNWG 355

Query: 318 DRGTVKILRGKYECAFEYLIAAGKPK 343
           D+G  KI  G  EC  +  + AG PK
Sbjct: 356 DQGIFKIAIG--ECGIDSEVTAGIPK 379


>gi|157092993|gb|ABV22151.1| cysteine proteinase [Perkinsus chesapeaki]
          Length = 396

 Score =  187 bits (475), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 119/335 (35%), Positives = 168/335 (50%), Gaps = 42/335 (12%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS--DRPLPGDRKTYDPEYS 81
           + +D+IN + N W A      ++ +E  +   ++DAK    +  ++P     K Y  +  
Sbjct: 83  SLVDEINSKQNAWMA------SIEQERFKGASMSDAKRLCGTWLEKPENIREKLYTADEL 136

Query: 82  ATVPDRFDAREQWPNCGT-IGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
             +P  F+A E++  C + IGH+ D  AC +   FA   AF+DR CIKS G     LS  
Sbjct: 137 KDLPVSFNATEEFKECSSVIGHIRDQSACGSCWAFAPTEAFNDRLCIKSAGNFTSLLSPG 196

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG------DRTGCQPSTISPC 194
            VA+C K         C  GS    W +LH  G VTGGDY       +  GC P  I PC
Sbjct: 197 NVAACSK------TSGCHGGSSLDAWQWLHTTGVVTGGDYSAEKDMTESDGCWPYDIPPC 250

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNE-------D 247
           +H+ ++   P C   K     C   C N  Y     +D+H      +V++         D
Sbjct: 251 AHYTNSTLYPKCPKTKYDFPTCQESCPNKKYDTPMEKDRH------FVEEESLSALRSID 304

Query: 248 AIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWL 307
           AIKKEI+ +GP +A++ +YDDF  YKSGVYK TS+  L    H+ K+IGWG +    YWL
Sbjct: 305 AIKKEIMTNGPVSASYLVYDDFLTYKSGVYKRTSHNALGG--HAVKIIGWGED----YWL 358

Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           V+N+W  +WGD G  KI  G  +C  E  + AG P
Sbjct: 359 VVNSWNKNWGDNGMFKI--GCGQCGIEDNVLAGTP 391


>gi|132566367|gb|ABO34080.1| cathepsin B5 [Clonorchis sinensis]
          Length = 343

 Score =  187 bits (474), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 97/259 (37%), Positives = 134/259 (51%), Gaps = 9/259 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDAR+ WP+C +I  + D  +C +   F AV A SDR CI S G  N+ LS   + 
Sbjct: 86  LPKNFDARKTWPHCSSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLL 145

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCCK C +     C  G     W++    G VTGG   D +GC+      C HH      
Sbjct: 146 SCCKDCGF----GCRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEHHVQG-HY 200

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P C  +  P  +C  +C  P    G+ +DK R  ++Y +  +E +I KEI+  GP  A F
Sbjct: 201 PPCPRELYPTPECVQQCDTPDV--GYLEDKTRANMSYNIYASEISIMKEIMLRGPVEAIF 258

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
            +Y+DF  Y SGVY H   A +    H+ +++GWG     PYWL+ N+W   WG+ G +K
Sbjct: 259 TMYEDFLRYSSGVYFHALGAPMSG--HAVRILGWGELGNVPYWLIANSWNEDWGEEGYMK 316

Query: 324 ILRGKYECAFEYLIAAGKP 342
            LRG  EC  E  + AG P
Sbjct: 317 FLRGYNECGIEDDVTAGLP 335


>gi|237836005|ref|XP_002367300.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
 gi|211964964|gb|EEB00160.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
 gi|221506020|gb|EEE31655.1| cysteine proteinase, putative [Toxoplasma gondii VEG]
          Length = 572

 Score =  187 bits (474), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 104/267 (38%), Positives = 146/267 (54%), Gaps = 16/267 (5%)

Query: 84  VPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           VP  FDAR  +P C   +GHV D G C +   FA+  AF+DR CI+S+G+   PLS ++ 
Sbjct: 277 VPAHFDARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKGLMPLSAQHT 336

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPSTISPCSHHGS 199
            SCC    +  +  C+ G     W +  ++G VTGGD+   G  T C P  +  C+HH  
Sbjct: 337 TSCCNAI-HCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAHHAK 395

Query: 200 APTLPSCENQKVPKL--KCHTRCTNPTYGRG---FFQDKHRTTLTYWVDDNEDAIKKEIL 254
           AP  P C+   VP+   KC   C    Y      F QD H+ T  Y +   +D +K++++
Sbjct: 396 AP-FPDCDATLVPRKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSRDD-VKRDMM 453

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            HGP +  F +Y+DF  YKSGVYKH S   +    H+ K+IGWGTENG  YW  +N+W  
Sbjct: 454 THGPVSGAFMVYEDFLSYKSGVYKHVSGLPVGG--HAIKIIGWGTENGEEYWHAVNSWNT 511

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGK 341
           +WGD G  KI  G  +C  +  + AG+
Sbjct: 512 YWGDGGQFKIAMG--QCGIDGEMVAGE 536


>gi|341900875|gb|EGT56810.1| hypothetical protein CAEBREN_32632 [Caenorhabditis brenneri]
          Length = 287

 Score =  186 bits (473), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 95/267 (35%), Positives = 141/267 (52%), Gaps = 4/267 (1%)

Query: 78  PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
           P  ++ +   FDARE+WP C +I  + D   C +   FAA  + SDR CI S G  N  L
Sbjct: 22  PTENSDLSQFFDARERWPECMSIPQINDISECKSSWAFAAAESMSDRLCINSGGTINTIL 81

Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
           S + + SCC        + C  G+ F+ W +  K G  TGG Y  + GC+P +I+PC   
Sbjct: 82  SAQELLSCCTGV-LSCGEGCGGGNAFKAWQYWGKHGLPTGGSYESQFGCKPYSIAPCGKT 140

Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPT-YGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
               T P+C N  +P   C  +CT+   Y     +D+H       + + +  I+ +++ +
Sbjct: 141 VGNVTYPACTNTTLPTPSCEKKCTSKNGYPVDIDKDRHYGASVDQLPNRQIEIQSDVMLN 200

Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHW 316
           GP   TF +YDDF  Y +G+Y H +  K + +L S +++GWG   G PYWL+ N+WG  W
Sbjct: 201 GPIETTFEVYDDFLQYTTGIYVHLTGNK-QGHL-SVRILGWGMYEGVPYWLLANSWGKEW 258

Query: 317 GDRGTVKILRGKYECAFEYLIAAGKPK 343
           G+ GT + LRG  EC  E    +G PK
Sbjct: 259 GENGTFRALRGTNECGLEANCVSGMPK 285


>gi|341878049|gb|EGT33984.1| CBN-CPR-1 protein [Caenorhabditis brenneri]
          Length = 330

 Score =  186 bits (473), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 103/261 (39%), Positives = 137/261 (52%), Gaps = 16/261 (6%)

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           T+P  FD+R  W  C +I  + +   C +   F A    SDR CI++KG Q   +S + +
Sbjct: 85  TIPASFDSRTHWSECKSIKLIRNQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDDL 144

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            SCC          C  G   +   +   +G VTGGDY    GC+P  I+PC       T
Sbjct: 145 LSCCG---SSCGNGCEGGYPIQALRWWDSKGVVTGGDY-HGAGCKPYPIAPC-------T 193

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
             SC   K P   C   C  P Y   + +DKH  T  Y V     +I+ EI+ +GP  A 
Sbjct: 194 SGSCPESKTPA--CSLSC-QPGYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAA 250

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F +Y+DFY YKSGVYKHT+   L    H+ K+IGWGTE+G+PYWLV N+WG  WG+ G  
Sbjct: 251 FTVYEDFYKYKSGVYKHTAGKALGG--HAIKIIGWGTESGSPYWLVANSWGTSWGESGFF 308

Query: 323 KILRGKYECAFEYLIAAGKPK 343
           KI RG  +C  E  + AGK +
Sbjct: 309 KIFRGDDQCGIESAVVAGKAR 329


>gi|335347291|gb|AEH42093.1| cysteine proteinase 6 [Haemonchus contortus]
          Length = 346

 Score =  186 bits (473), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 99/300 (33%), Positives = 158/300 (52%), Gaps = 13/300 (4%)

Query: 42  FPANLS-EEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTI 100
           F A+++   Y  Q  + D ++ +Q+ +P+  D      +    +P+ FDAR +WPNC +I
Sbjct: 55  FEADVTPHSYNVQHKLMDLRFVNQNRKPVVEDAS----DKGDDIPESFDARTKWPNCTSI 110

Query: 101 GHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHG 160
            H+ D   C +    +     SDR CI SK ++   +S+    SCC  C +     C  G
Sbjct: 111 KHIRDQANCGSCWAVSTASVLSDRICIASKQKKQVHISSIDFVSCCDSCGF----GCEGG 166

Query: 161 SVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRC 220
                + +   +G VTGGDYG +TGC+P    PC HHG+      C  ++    +C  +C
Sbjct: 167 WPIDAFEYYSYQGVVTGGDYGSKTGCRPYPFHPCGHHGNETYYGECPKEESTP-ECVKQC 225

Query: 221 TNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHT 280
               Y   + +DK      Y V+++  AI++EI+  GP  ++F +YDDF +Y  G+YKHT
Sbjct: 226 -QKGYKNSYRRDKTWGEDYYEVENSVKAIQREIMRSGPVVSSFTVYDDFSYYVKGIYKHT 284

Query: 281 SNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           +     +  H+ K+IGWGTE   PYW++ N+W   WG++G  +++RG   C  E  + AG
Sbjct: 285 AGKARGS--HAIKIIGWGTEKNVPYWIIANSWHNDWGEKGFFRMVRGTNHCGIEEDVVAG 342


>gi|145481831|ref|XP_001426938.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124394016|emb|CAK59540.1| unnamed protein product [Paramecium tetraurelia]
          Length = 332

 Score =  186 bits (473), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 118/336 (35%), Positives = 164/336 (48%), Gaps = 28/336 (8%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD-- 58
           M+  +  ++     R     F  A+++ I     TWTA           Y R    +D  
Sbjct: 1   MLQFICLIISLVSARN---PFITAFVNSIK---TTWTA---------TNYERWNEKSDGF 45

Query: 59  -AKYFDQ-SDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
            +KYF+   D   P + K +  E    +P  F A+E+WP C +I  +PD G C +    +
Sbjct: 46  YSKYFNVIVDHSEPVEYKYH--EKLENLPPSFSAQEKWPGCPSIELIPDQGNCGSCWAVS 103

Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKI-CRYDDNKSCSHGSVFRTWNFLHKRGSV 175
           A    SDR CI S     R +S E + SCC I C  D N  C  G  +  W +L   G V
Sbjct: 104 AASTMSDRLCIASGQTDKRQISAEDLLSCCGINCELDGNGGCDGGYPYGAWKYLRVDGIV 163

Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCT---NPTYGRGFFQD 232
           TGG Y D + C+P +  PCSH   +     CEN      +    CT   +P + R +  D
Sbjct: 164 TGGTYNDFSLCKPYSFPPCSHGNDSGKYSKCENDFFMLTEVTPSCTKKCHPQFSRTYDVD 223

Query: 233 KHRTTLT-YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHS 291
           K R+    Y +  +++ IK EI  +GP  A F ++DDF +YKSGVY+ T+  +     H+
Sbjct: 224 KIRSRENPYKLIKDQEQIKNEIYLNGPVQAVFTVFDDFLNYKSGVYQQTTGQRRGK--HA 281

Query: 292 GKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRG 327
            K+IGWGTENG PYW  IN+W   WG  G  KILRG
Sbjct: 282 VKIIGWGTENGVPYWEAINSWNDGWGINGKFKILRG 317


>gi|226466816|emb|CAX69543.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 337

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 101/258 (39%), Positives = 133/258 (51%), Gaps = 8/258 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +PD FD+REQW NC +I  + D   C +    A+V A SDR CI++ G     LS   + 
Sbjct: 84  LPDYFDSREQWKNCPSIKRIYDQSQCYSSWAMASVAAISDRICIQTNGTVKVELSAIELV 143

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC  C       C+ G     W +  + G VTG   G+ +GC P     C H GS+ + 
Sbjct: 144 SCCSKCAV----GCNFGYSESAWYYWVENGLVTGESNGNNSGCLPYPFPKCDH-GSSDSY 198

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P C         C+  C  P Y   +  DKH     Y V  NE  I++EI+ +GP  A+ 
Sbjct: 199 PMCGYVVYTPPVCNGTC-RPGYPIPYNDDKHFGKSAYQVKQNESDIRREIMLYGPVEASI 257

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
            +YDDF  YKSGVYKH +   +   + S ++IGWG ENG PYWL  N+W   WG  G  K
Sbjct: 258 FIYDDFVDYKSGVYKHLTGRLIT--IQSVRIIGWGIENGIPYWLCANSWNEEWGLNGFFK 315

Query: 324 ILRGKYECAFEYLIAAGK 341
           ILRG  EC  E  + AG+
Sbjct: 316 ILRGSNECEIEAFVNAGR 333


>gi|201023321|ref|NP_001128402.1| cathepsin B-1874 precursor [Acyrthosiphon pisum]
          Length = 315

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 101/265 (38%), Positives = 146/265 (55%), Gaps = 9/265 (3%)

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
           +++ +P  FD+R++WPNC +IGH+ + G C + +  AA  A SDR CI+S G +N  +S 
Sbjct: 57  FTSGLPINFDSRKKWPNCPSIGHIYNQGNCRSSYAVAAASAASDRICIQSNGTKNPIMSA 116

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
           + + SCC +C +     C  GS+F +W++  + G V+GGDY    GCQP TI PC     
Sbjct: 117 QQIISCCYLCGH----GCDGGSLFESWDYYRRHGFVSGGDYNSNQGCQPYTIPPCKLMNE 172

Query: 200 APTLPSCEN-QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
            P   SC    +     C  +C NP Y   F  D ++     +   +     K+I  +GP
Sbjct: 173 KPPGHSCTTYHREETPICEKKCYNPNYYTSFRTDIYKGK---YYKLSPYMAMKDIFDNGP 229

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENY-LHSGKLIGWGTENGTPYWLVINTWGPHWG 317
            T  F +Y D   YKSGVY++   +  + + +HS K+ GWG ENG PYWLV N++G  WG
Sbjct: 230 ITTQFYMYRDLVDYKSGVYQYDEQSDFDFFTVHSVKIFGWGEENGVPYWLVANSFGTDWG 289

Query: 318 DRGTVKILRGKYECAFEYLIAAGKP 342
             GT KI RG   C F+  + AG P
Sbjct: 290 YNGTFKISRGNDGCFFQEKMYAGLP 314


>gi|327408413|emb|CCA30060.1| unnamed protein product [Neospora caninum Liverpool]
          Length = 463

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 110/286 (38%), Positives = 156/286 (54%), Gaps = 18/286 (6%)

Query: 65  SDRPLPGDRKTYDPEYSATVPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSD 123
           S  PLP   KT     +  VP  FDAR  +P C   +GHV D G C +   FA+  AF+D
Sbjct: 151 SGVPLPA--KTVFENANEPVPANFDARTAFPVCKDVVGHVRDQGDCGSCWAFASTEAFND 208

Query: 124 RRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY--- 180
           R CI+S+G+   PLST++  SCC    +  +  C+ G     W +  ++G VTGGD+   
Sbjct: 209 RLCIRSQGKGVMPLSTQHTTSCCNAI-HCASFGCNGGQPGMAWRWFERKGVVTGGDFDTL 267

Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKL--KCHTRCTNPTYGR---GFFQDKHR 235
           G  T C P  I  C+HH  AP  P+C+    P+   KC   C    Y      F +D H+
Sbjct: 268 GKGTTCWPYEIPFCAHHAKAP-FPNCDTDVRPRKTPKCRKDCEEAAYSEHVLPFDKDVHK 326

Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
            + +Y +  + DA+K++++AHG  T  F +Y+DF +YKSGVYKH     L    H+ K+I
Sbjct: 327 ASSSYSL-RSRDAVKRDMMAHGTVTGAFMVYEDFLNYKSGVYKHVYGGPLGG--HAIKII 383

Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
           GWGTE+G  YW  +N+W  +WGD G  KI  G  +C  +  + AG+
Sbjct: 384 GWGTEDGEEYWHAVNSWNTYWGDSGHFKIEMG--QCGVDNEMVAGE 427


>gi|340501578|gb|EGR28345.1| hypothetical protein IMG5_177790 [Ichthyophthirius multifiliis]
          Length = 356

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 100/262 (38%), Positives = 144/262 (54%), Gaps = 11/262 (4%)

Query: 84  VPDRFDAREQW-PNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           +P  FD+R+QW   C ++  V D   C +   FAA  + SDR CI + G+  R LSTE +
Sbjct: 97  LPKNFDSRKQWGSKCPSLNEVRDQSTCGSCWAFAAAESLSDRICIHT-GEDVR-LSTENL 154

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            SCC  C       C+ G       +  K G VTG  +GD   CQ  +  PC+HH ++  
Sbjct: 155 VSCCSSC----GDGCNGGYPEAAMQYFVKTGLVTGDLFGDNNFCQAYSFPPCAHHVASTK 210

Query: 203 LPSCENQKVPKLKCHTRCTNPT-YGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
            P C+ + VP  +C  +C + +   R + +D ++   +Y V  +  AI  EI+ +GP   
Sbjct: 211 YPPCKGE-VPTPECKKKCDDDSKVKRPYNEDLYKGQKSYSVSSDPKAIMTEIMNNGPVEV 269

Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT 321
            F +Y+DF  YKSGVY+H +  +L    H+ K+IGWG EN TPYWL++N+W   WGD+GT
Sbjct: 270 AFTVYEDFVTYKSGVYQHVTGEQLGG--HAVKMIGWGVENDTPYWLIVNSWNETWGDQGT 327

Query: 322 VKILRGKYECAFEYLIAAGKPK 343
            KILRG  EC  E  +    P+
Sbjct: 328 FKILRGSNECGIEDEVVTALPQ 349


>gi|161671340|gb|ABX75522.1| cathepsin b [Lycosa singoriensis]
          Length = 247

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 131/255 (51%), Gaps = 9/255 (3%)

Query: 89  DAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKI 148
           D+REQWP+C +I  + D G+C +   F AV A SDR CI S G+    +S E + SCC  
Sbjct: 1   DSREQWPDCPSISEIRDQGSCGSCWAFGAVEAMSDRHCIHSNGKVKIEVSPEDLLSCCSS 60

Query: 149 CRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCEN 208
           C       C  G     W F   +G  TGG +    GCQP  I  C HH +    P  + 
Sbjct: 61  C----GMGCDGGFPPSAWEFWVDKGIATGGLWNSHIGCQPYEIPACEHHTTGDRPPCSDI 116

Query: 209 QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDD 268
              PK  C   C    Y   +  DKH    +Y ++  E  I+ EI  +GP    F++Y D
Sbjct: 117 VDTPK--CVHLCEK-GYNTSYRDDKHFGKKSYSIESLEQQIQTEIFKNGPVEGAFSVYSD 173

Query: 269 FYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGK 328
           F +YKSGVY+H S   L    H+ +++GWG EN  PYWL  N+W   WGD+G  KILRG 
Sbjct: 174 FINYKSGVYQHHSGESLGG--HAIRVLGWGYENDVPYWLCANSWNTDWGDKGYFKILRGS 231

Query: 329 YECAFEYLIAAGKPK 343
            EC  E  I AG PK
Sbjct: 232 DECGIESSIVAGIPK 246


>gi|144952804|gb|ABP04056.1| cathepsin B-4 [Clonorchis sinensis]
          Length = 347

 Score =  185 bits (470), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 115/330 (34%), Positives = 161/330 (48%), Gaps = 19/330 (5%)

Query: 21  FSDAYIDQINREAN-TWTAGRNFPANLSEEYLRQFL--IADAKYFDQSDRPLPGDRKTYD 77
            SD  +D +N + + TW A ++      EE +R  L  + + +   +  RP         
Sbjct: 26  LSDELVDYVNSQVDATWKAAKSERFKTLEE-IRSVLGTMREDQNVKEFRRPTISHE---- 80

Query: 78  PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKS-KGQQNRP 136
            + +  +P  FDARE WP C TI  + D   C +   FAAV A SDR CI S +   N  
Sbjct: 81  -DITLELPSEFDAREHWPECRTIPQIRDQSGCGSCWAFAAVTAMSDRVCIHSNQTLVNVQ 139

Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
           LS   + +CC  C +              W++    G VTGG+Y D   C P    PC H
Sbjct: 140 LSATDLLACCTTCGFGCVGG----WGGMAWDYWRDNGIVTGGEYKDSHTCLPYPFPPCRH 195

Query: 197 HGS-APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           HG+     P C  +     +C + C    Y   +  DK R + +Y +  +  AI+KEI  
Sbjct: 196 HGAKGSEYPPCPEKMYSTPQCVSECQK-GYATKYEDDKIRASTSYNLYRSVTAIQKEIWM 254

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGP 314
            GP  AT  +Y DF +Y  GVYKHT+   L    H+ +L+GWG E +GTPYWL  N+W P
Sbjct: 255 RGPVEATMNVYTDFANYAGGVYKHTTGELLGG--HAIRLLGWGVEEDGTPYWLAANSWNP 312

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WG++G  +ILRG   C  E  ++AG P N
Sbjct: 313 SWGEKGFFRILRGSDHCGIESDVSAGLPVN 342


>gi|358331547|dbj|GAA35870.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 508

 Score =  184 bits (468), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 102/299 (34%), Positives = 146/299 (48%), Gaps = 13/299 (4%)

Query: 36  WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWP 95
           W +GR      S++ +  F         ++ RP       +D   +  +P  FDAR+ WP
Sbjct: 42  WISGRRPKRFESDDLIHMFGAKRETREQKAQRPT----LRHDGFDNMRLPKNFDARKTWP 97

Query: 96  NCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNK 155
           +C +I  + D  +C +   F AV A SDR CI S G  N+ LS   + SCCK C +    
Sbjct: 98  HCSSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLLSCCKDCGF---- 153

Query: 156 SCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLK 215
            C  G     W++    G VTGG   D +GC+      C HH      P C  +  P  +
Sbjct: 154 GCRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEHHVQG-HYPPCPRELYPTPE 212

Query: 216 CHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSG 275
           C  +C  P    G+ +DK R  ++Y +  +E +I KEI+  GP  A F +Y+DF  Y SG
Sbjct: 213 CVQQCDTPDV--GYLEDKTRANMSYNIYASEISIMKEIMLRGPVEAIFTMYEDFLRYSSG 270

Query: 276 VYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
           VY H   A +    H+ +++GWG     PYWL+ N+W   WG+ G +K LRG  EC  E
Sbjct: 271 VYFHALGAPMSG--HAVRILGWGELGNVPYWLIANSWNEDWGEEGYMKFLRGYNECGIE 327


>gi|268555786|ref|XP_002635882.1| Hypothetical protein CBG01102 [Caenorhabditis briggsae]
          Length = 374

 Score =  184 bits (467), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 111/361 (30%), Positives = 174/361 (48%), Gaps = 45/361 (12%)

Query: 21  FSDA--YIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD-QSDRPLPGDRKTYD 77
           FSD+   I+ +N + + WTAG      +S++Y+ + L  D +    ++  P    +  + 
Sbjct: 19  FSDSTKIINYVNSQKSLWTAGN---PKISKDYMLKTLTTDPETVGFRNLGPTFYSKNIFS 75

Query: 78  PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
           PE +    + FDARE+WP C +I  + D   C +   F+A  + SDR CI S G  N  L
Sbjct: 76  PE-NLDDSNFFDARERWPECSSIPIINDISDCKSSWAFSAAESMSDRLCINSGGMINTVL 134

Query: 138 STEYVASCCK---ICRYDDNK--------------------------------SCSHGSV 162
           S + + SCC     C   D++                                 C+ G+V
Sbjct: 135 SAQELLSCCTGVFSCGEGDSEHWQFRNSKFRKPRCQKFNKEILEARRNLETREKCAGGNV 194

Query: 163 FRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTN 222
           F+ W +  K G  TGG Y  + GC+P +ISPC       T P C N  V    C  +C +
Sbjct: 195 FKAWQYWQKHGLPTGGSYESQFGCKPYSISPCDTVIGNITFPGCLNSTVQTPSCEKKCKS 254

Query: 223 PTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSN 282
             Y     +D+H       + + +  I+ +++ +GP +AT  +YDDF  Y +G+Y H + 
Sbjct: 255 -GYPVELDKDRHYGVSVDQLPNRQIEIQSDVMLNGPISATMEVYDDFLQYTTGIYVHLTG 313

Query: 283 AKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            K + +L S +++GWG   G PYWL+ N+WG  WG+ GT ++LRG  EC  E    +G P
Sbjct: 314 NK-QGHL-SVRILGWGMYEGVPYWLLANSWGKQWGENGTFRVLRGVNECGLEANCVSGMP 371

Query: 343 K 343
           +
Sbjct: 372 R 372


>gi|341904369|gb|EGT60202.1| hypothetical protein CAEBREN_08101 [Caenorhabditis brenneri]
          Length = 330

 Score =  184 bits (466), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 102/261 (39%), Positives = 137/261 (52%), Gaps = 16/261 (6%)

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           T+P  FD+R  W  C +I  + +   C +   F A    SDR CI++KG Q   +S + +
Sbjct: 85  TIPASFDSRTHWSECKSIKLIRNQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDDL 144

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            SCC          C  G   +   +   +G VTGGDY    GC+P  I+PC       T
Sbjct: 145 LSCCG---SSCGNGCEGGYPIQALRWWDSKGVVTGGDY-HGAGCKPYPIAPC-------T 193

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
             SC   K P   C   C +  Y   + +DKH  T  Y V     +I+ EI+ +GP  A 
Sbjct: 194 SGSCPESKTPA--CSLSCQS-GYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAA 250

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F +Y+DFY YKSGVYKHT+   L    H+ K+IGWGTE+G+PYWLV N+WG  WG+ G  
Sbjct: 251 FTVYEDFYKYKSGVYKHTAGKALGG--HAIKIIGWGTESGSPYWLVANSWGTSWGESGFF 308

Query: 323 KILRGKYECAFEYLIAAGKPK 343
           KI RG  +C  E  + AGK +
Sbjct: 309 KIFRGDDQCGIESAVVAGKAR 329


>gi|358341561|dbj|GAA37330.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 347

 Score =  184 bits (466), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 114/330 (34%), Positives = 160/330 (48%), Gaps = 19/330 (5%)

Query: 21  FSDAYIDQINREAN-TWTAGRNFPANLSEEYLRQFL--IADAKYFDQSDRPLPGDRKTYD 77
            SD  +D +N + + TW A ++      EE +R  L  + + +   +  RP         
Sbjct: 26  LSDELVDYVNSQVDATWKAAKSERFKTLEE-IRSVLGTMREDQNVKEFRRPTISHE---- 80

Query: 78  PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKS-KGQQNRP 136
            + +  +P  FDARE WP C TI  + D   C +   FAAV A SDR CI S +   N  
Sbjct: 81  -DITLELPSEFDAREHWPECRTIPQIRDQSGCGSCWAFAAVTAMSDRVCIHSNQTLVNVQ 139

Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
           LS   + +CC  C +              W++    G VTGG+Y D   C P    PC H
Sbjct: 140 LSATDLLACCTTCGFGCVGG----WGGMAWDYWRDNGIVTGGEYKDSHTCLPYPFPPCRH 195

Query: 197 HGS-APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           HG+     P C  +     +C + C    Y   +  DK R + +Y +  +   I+KEI  
Sbjct: 196 HGAKGSEYPPCPEKMYSTPQCVSECQK-GYATKYEDDKIRASTSYNLYRSVTTIQKEIWM 254

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGP 314
            GP  AT  +Y DF +Y  GVYKHT+   L    H+ +L+GWG E +GTPYWL  N+W P
Sbjct: 255 RGPVEATMNVYTDFANYAGGVYKHTTGELLGG--HAIRLLGWGVEEDGTPYWLAANSWNP 312

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WG++G  +ILRG   C  E  ++AG P N
Sbjct: 313 SWGEKGFFRILRGSDHCGIESDVSAGLPVN 342


>gi|328701234|ref|XP_001948885.2| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 326

 Score =  183 bits (465), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 124/347 (35%), Positives = 170/347 (48%), Gaps = 34/347 (9%)

Query: 2   IHILVFLLGCTLVRGELYKFS-DAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           I +LV ++  +    E  K S D  ID+ + E NT  AG N   + +EE     L     
Sbjct: 4   ILLLVSIMLLSFCLTEQAKLSHDNTIDKSDVETNTLKAGENVGPHSAEEERLMLLGTRGV 63

Query: 61  YFDQSDRPLPGDRKTYDPEY--SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
                 + L    KT DP Y     +   FDAR++WP C TIG V + G       +AA 
Sbjct: 64  EAATKSKML---YKTRDPRYIIDNQIHKEFDARKRWPQCKTIGEVHNEGNELLSWAYAAT 120

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRT--WNFLHKRGSVT 176
           G F+DR CI + G  N+ LSTE + SC  I   +D      G V R   W +    G V+
Sbjct: 121 GVFADRMCIATNGNYNQLLSTEELISCSGIKERED------GYVNRVLVWEYFKTHGLVS 174

Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDK 233
           GG Y    GCQPS +         PT+ + +  K+ K  C   C    YG+    +  D 
Sbjct: 175 GGKYNTNEGCQPSKV---------PTVYNSQT-KIYKRTCVEYC----YGKDTINYNHDH 220

Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
            + +  Y++   +  I+KE+  +GP +  F L+DD + YKSGVY  T  +K + Y H  K
Sbjct: 221 VKVSNHYFIRIKD--IQKEVQTYGPVSVFFDLHDDLFLYKSGVYAKTEKSKDKRY-HHAK 277

Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           LIGWG ENG  YWL++N+WG  WG  G  KI RG  EC+ E  + AG
Sbjct: 278 LIGWGVENGVDYWLLVNSWGYEWGQNGLFKIKRGTDECSVESHVYAG 324


>gi|268561802|ref|XP_002638421.1| C. briggsae CBR-CPR-3 protein [Caenorhabditis briggsae]
          Length = 375

 Score =  183 bits (464), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 113/325 (34%), Positives = 160/325 (49%), Gaps = 26/325 (8%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
           A +  IN    +W A  N   ++SE+ ++ F + D ++ D  +  +  +           
Sbjct: 39  ALVAHINSMQTSWIAEHN---DISEDEMK-FKVMDQRFADPLEEEVQDEGLVRGEVVPEP 94

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +PD FDAR+QWP+C ++  + +  +C +   F A    SDR CI+S G Q   +S E + 
Sbjct: 95  LPDTFDARDQWPDCKSLKFIRNQASCGSCWAFGAAEVISDRVCIQSNGTQQPIISAEDIL 154

Query: 144 SCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP- 201
           SCC   C     K C  G       +    G VTGGDY +  GC P +  PC        
Sbjct: 155 SCCGSTC----GKGCQGGYTIEAMKYWMNSGVVTGGDY-NGAGCMPYSFPPCKKSPCVEF 209

Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA---IKKEILAHGP 258
           + PSC      K  C  + T   Y      DKH  T  Y +   ++A   I+ EI  +GP
Sbjct: 210 STPSC------KTTCQEKYTTADYKN----DKHFATSAYKLSTTKNAVPTIQYEIYHNGP 259

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
             A++ +++DFY YKSGVY H S   +    H+ K+IGWGTENG  YWLV N+WG  +G+
Sbjct: 260 VEASYRVFEDFYQYKSGVYHHVSGNLVGG--HAVKIIGWGTENGVDYWLVANSWGTSFGE 317

Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
           +G  KI RG  EC  E  I AG  K
Sbjct: 318 KGFFKIRRGTNECQIESNIVAGLAK 342


>gi|226471002|emb|CAX70582.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  183 bits (464), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 98/262 (37%), Positives = 147/262 (56%), Gaps = 8/262 (3%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           + +  +P +FD+R++WP+C +I  + D   C +   F AV A +DR CI+S GQQ+  LS
Sbjct: 85  DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELS 144

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
              + SC      D    C  G   + W++  KRG VTGG   + TGCQP     C H  
Sbjct: 145 ALDLISC----CEDCGGGCKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHL- 199

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           +    P+C  +     +C   C    Y   + QDKH    +Y V  NE AI+KEI+ +GP
Sbjct: 200 TKGKYPACGTKIYKTPQCKQTCQK-GYKTPYKQDKHYGDESYNVISNEKAIQKEIMMYGP 258

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
             A F +Y+DF +YKSG+Y+H + + +    H+ ++IGWG E G PYWL+ N+W   WG+
Sbjct: 259 VEAAFDVYEDFLNYKSGIYRHVTGSIVGG--HAIRIIGWGVEKGKPYWLIANSWNEDWGE 316

Query: 319 RGTVKILRGKYECAFEYLIAAG 340
           +G  +++RG+ EC+ E  + AG
Sbjct: 317 KGLFRMVRGRDECSIESHVVAG 338


>gi|91088083|ref|XP_968689.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
          Length = 360

 Score =  183 bits (464), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 121/324 (37%), Positives = 162/324 (50%), Gaps = 35/324 (10%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
           + I+QIN + + WTAG N P +  E  L    I    + D + +P     +  +P+ +  
Sbjct: 21  SLINQINSQQSAWTAGIN-PFDDIESRLGFLGI----HPDPNFKP-----EIKEPQATQN 70

Query: 84  V-PDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
           V P+ FDARE WP C   IG++ + G C++   FAA    SDR CI + G+    LS E 
Sbjct: 71  VIPETFDAREYWPECADIIGNIRNQGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPED 130

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
           +  CC  C       C  G  +  WN+    G V+GGDY   TGCQP   S  +++   P
Sbjct: 131 LIDCCHYC----GNQCKGGYTYYAWNYFMLTGLVSGGDYNTSTGCQP--YSELNYYRITP 184

Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG-PTT 260
                         C+T C N  Y   +  DKH     Y++  NE AI+ EIL+ G P  
Sbjct: 185 -------------PCNTTCQNDKYPIPYVSDKHFGDSIYYIPQNETAIQNEILSGGGPVV 231

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
           A F +Y DF  Y+ GVY +TS A       + K+IGWGTENG  YWL  N+WG  WG  G
Sbjct: 232 AAFDVYGDFKIYRDGVYIYTSGALFGR--TAVKIIGWGTENGWAYWLAANSWGKDWGALG 289

Query: 321 T-VKILRGKYECAFEYLIAAGKPK 343
              KI RG  EC FE  I AG+ +
Sbjct: 290 GFFKIRRGTNECGFEESIIAGQVR 313


>gi|118358706|ref|XP_001012594.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89294361|gb|EAR92349.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 346

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 107/319 (33%), Positives = 158/319 (49%), Gaps = 17/319 (5%)

Query: 26  IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVP 85
           I ++N   +TW AG N       +++   +     +         G +       +  +P
Sbjct: 41  IQKVNSSNSTWKAGEN------TKWINSDIAGVKAHMGVKLGQESGIKLETVSAQANGLP 94

Query: 86  DRFDAREQWPN-CGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
           + FDAR QW + C ++  V D   C +   F A  + SDR CI   GQ  R LST+ + +
Sbjct: 95  EEFDARVQWGDKCSSLWEVRDQSTCGSCWAFGAAESLSDRHCIH-LGQDIR-LSTQNLLT 152

Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
           CC  C       C  G      ++    G VTG  YG+ + CQ  T +PC+HH ++   P
Sbjct: 153 CCAAC----GDGCDGGWPEAAMDYYVNTGLVTGDLYGNNSWCQAYTFAPCAHHVTSDIYP 208

Query: 205 SCENQKVPKLKCHTRC-TNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
            C  + +P   C   C +N T+   + +D HR +  Y +  +E AI  EI  +GP     
Sbjct: 209 PCTGE-LPTPPCINSCDSNSTHTIPYSKDIHRGSKAYGIAKDEKAIMAEIYKNGPIEVAL 267

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
            +Y+DF  YK+GVY+H +  +L    H+ K++GWG ENGTPYW ++N+W   WGD+GT K
Sbjct: 268 TVYEDFLTYKTGVYQHVTGDELGG--HAVKMVGWGVENGTPYWTIVNSWNESWGDKGTFK 325

Query: 324 ILRGKYECAFEYLIAAGKP 342
           ILRGK EC  E       P
Sbjct: 326 ILRGKNECGIESSCVTALP 344


>gi|326427908|gb|EGD73478.1| cathepsin B [Salpingoeca sp. ATCC 50818]
          Length = 341

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 117/326 (35%), Positives = 157/326 (48%), Gaps = 24/326 (7%)

Query: 23  DAYIDQINREANTWTAGRNFP-ANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP--- 78
           ++  + IN     W AG N    N++ +Y+R+          Q    L G   T D    
Sbjct: 34  ESIANDINARNVGWKAGVNERFVNVTMDYIRK----------QMGTRLEGSPVTLDVKHV 83

Query: 79  EYSATVPDRFDAREQWPN-CGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
           E  A +P  FD+R QW + C ++  V D   C +   F AV A +DR CI SKG Q   +
Sbjct: 84  EVPADLPTSFDSRTQWGSMCPSVKEVRDQANCGSCWAFGAVEAMTDRTCIASKGAQTPHI 143

Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
           S E + +CC     D    C+ G     W +   +G VTGG Y    GCQP +++ C HH
Sbjct: 144 SAEDLLTCCTFTCGD---GCNGGYPAAAWEYWKNQGIVTGGQYDSNQGCQPYSLAKCEHH 200

Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
            + P  P      VP   C   C    Y   +  DKH    +Y V    D I  EI+ +G
Sbjct: 201 TTGPYKPC--GDIVPTPACKRSCRQ-GYNVTYPNDKHFGASSYGVR-GVDQIATEIMTNG 256

Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
           P  A F +Y DF  YKSGVY+HTS   L    H+ K+IGWG ++GT YW+V N+W   WG
Sbjct: 257 PVEAAFTVYSDFLSYKSGVYQHTSGQPLGG--HAIKIIGWGVQDGTDYWIVANSWNDSWG 314

Query: 318 DRGTVKILRGKYECAFEYLIAAGKPK 343
           + G   I +G  EC  E  + AG PK
Sbjct: 315 NDGFFWIKKGTDECGIESQVVAGLPK 340


>gi|121073168|gb|ABM47070.1| cathepsin B1 [Clonorchis sinensis]
 gi|358341105|dbj|GAA29748.2| cathepsin B [Clonorchis sinensis]
          Length = 339

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 112/346 (32%), Positives = 159/346 (45%), Gaps = 21/346 (6%)

Query: 4   ILVFLLGCTLVRGE----LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFL--IA 57
           I+  LL     R E        SD  +  IN +ANT      +    +   +R+ L  + 
Sbjct: 8   IMYALLCAESFRAEYIPSFESLSDEIVHYINHKANTTWKAAKYQRFKTISDVRRVLGAVP 67

Query: 58  DAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
           D   F    R L    +  +      +P+ FDARE+WP C +I  + D   C +   F A
Sbjct: 68  DPNGFGLEKRCLLSTIREQE------LPESFDAREKWPYCSSIAEIRDQSNCGSCWAFGA 121

Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
            GA SDR CI S G+    +S E +  CC  C       C  G   + W +  + G VTG
Sbjct: 122 AGAISDRICIASGGKHQPRISPEDLVDCCADC----GMGCQGGYPAQAWEYWVRNGLVTG 177

Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
             Y     C+P +  PC HH   P  P   +   P+  C  +C  P Y + +  DK    
Sbjct: 178 DLYNTTDTCRPYSFPPCEHHVVGPRKPCTGDPTTPQ--CVKKC-QPEYPKTYENDKWYGL 234

Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
             Y +  +++AI ++++ +GP    F +Y DF  Y SGVY+H +   L    H+ +L+GW
Sbjct: 235 KAYSIHSDQEAIMRDLMTYGPLEVDFEVYADFPSYSSGVYRHVAGGLLGG--HAVRLVGW 292

Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           G E+G  YWL+ N+W   WGD G  KI RG  EC  E    AG PK
Sbjct: 293 GVEDGADYWLIANSWNTDWGDGGYFKIRRGVNECGIESDANAGHPK 338


>gi|239938574|gb|ACS36086.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 96/258 (37%), Positives = 139/258 (53%), Gaps = 9/258 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FDAR +WP C ++ H+ D   C +    +   A SDR CI S G++   +S   + 
Sbjct: 2   IPESFDARTKWPKCSSLKHIHDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDIL 61

Query: 144 SCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
           SCC   C Y     C+ G   + +N+  K+G+VTGGDY   +GC+P    PC HHG    
Sbjct: 62  SCCGNQCGY----GCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTY 117

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
              C N+     KC  +C        + +D+      Y V ++E AI++EI+ +GP    
Sbjct: 118 YGECPNEATTP-KCVRKCQKSYKKS-YKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGA 175

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F +Y+DF +YK G+YKHT+        H+ K+IGWG ENG PYWL+ N+W   WG+ G  
Sbjct: 176 FTVYEDFSYYKKGIYKHTAGKARGG--HAIKIIGWGKENGVPYWLIANSWHNDWGENGYF 233

Query: 323 KILRGKYECAFEYLIAAG 340
           +ILRG   C  E  + AG
Sbjct: 234 RILRGSNHCGIEENVVAG 251


>gi|326515156|dbj|BAK03491.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 471

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 116/321 (36%), Positives = 155/321 (48%), Gaps = 22/321 (6%)

Query: 26  IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVP 85
           +D +N    TWTAG N     +   LR   + +     +    LP  R    P+  A +P
Sbjct: 164 VDFVNALGTTWTAGHN--KRFTYNTLRH--VKNLCGAKKGGPKLPVKRI---PKKMA-LP 215

Query: 86  DRFDARE--QWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
             FD R+  +WP C  ++ HV D G+C +   F A  A +DR CI S GQ N  LS E +
Sbjct: 216 TSFDPRDGSKWPACKDSLNHVRDQGSCGSCWAFGAAEAMTDRICIASNGQNNFYLSAEDL 275

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            SCC  C       C  G     W++    G VTGGD+    GC P  +  C HH +   
Sbjct: 276 TSCCDSC----GMGCEGGYPSAAWDYFQSTGLVTGGDWNSNQGCYPYQLQACDHHVTGKY 331

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
            P  + Q  P   C   C N      +  DKH    +Y V  ++ +I  EI  +GP  A+
Sbjct: 332 QPCGDIQPTPA--CANSCQN---NATWSSDKHFGASSYSVGTDQQSIMTEIYTNGPVEAS 386

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           + +Y DF  YKSGVY+H +   L    H+ K+IGWG +  TPYW+V N+W   WG+ G  
Sbjct: 387 YDVYADFVSYKSGVYQHVTGDYLGG--HAVKIIGWGVDGSTPYWIVANSWNNDWGNNGFF 444

Query: 323 KILRGKYECAFEYLIAAGKPK 343
            ILRG  EC  E  I AG PK
Sbjct: 445 NILRGSDECGIEDGIVAGIPK 465


>gi|3087801|emb|CAA93277.1| cysteine proteinase [Haemonchus contortus]
          Length = 344

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 98/290 (33%), Positives = 151/290 (52%), Gaps = 15/290 (5%)

Query: 56  IADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIF 115
           + D ++   + +P+  D      +    +P+ FDAR  WPNC ++ H+ D   C +    
Sbjct: 67  LMDRRFIKHNRKPIVEDVN----DDGDDIPESFDARTHWPNCSSLTHIRDQADCGSCWAV 122

Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
           +   A SDR CI SKG +   +S   + SCC  C       C  G V   + F  ++G+V
Sbjct: 123 STASALSDRICIASKGAKQVYVSATDILSCCHSC----GDGCDGGYVIDAFKFFAEQGAV 178

Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSC-ENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           TGGDYG +  C+P    PC HHG+      C E+   P+  C  +C    Y   + +D+ 
Sbjct: 179 TGGDYGAKDCCRPYPFHPCGHHGNETYYGECPEDGSTPE--CVRKCQE-GYETEYHEDRV 235

Query: 235 RTTLTYWVD-DNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
           R    Y +   +  AI+KEI+ +GP  A F ++DDF  Y+ G+Y H + +      H+ K
Sbjct: 236 RGEDAYRLPIGSVKAIQKEIMRNGPVVAAFIVFDDFSFYRKGIYAHVAGSPRGG--HAVK 293

Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           +IGWGTE+G PYW++ N+W   WG+ G  +++RG  +C  E  + AGK K
Sbjct: 294 IIGWGTEHGVPYWIIANSWHSDWGEDGYFRMVRGINDCGIETNVVAGKFK 343


>gi|294954734|ref|XP_002788292.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239903555|gb|EER20088.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 317

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 115/334 (34%), Positives = 160/334 (47%), Gaps = 34/334 (10%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGD---RKTYDPEY 80
           + +D+IN +  TWTA      +  ++  +   + DAK    +      D   RK Y  E 
Sbjct: 3   SLVDEINSKQTTWTA------STGQKRFKNLSLRDAKMLCGTRMRGSNDKVIRKGYAIEE 56

Query: 81  SATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
              +P  FDAR  +PNC   IGH+ D  AC +   F    AF+DR C+KS G     LS 
Sbjct: 57  LQDLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCVKSNGTFTELLSA 116

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDR------TGCQPSTISP 193
             + +C        +  C  G     W+++H  G  TGGDY  R       GC P    P
Sbjct: 117 GEMNACAP------SYGCDGGYPDSAWSWVHDEGIATGGDYVARGNLTKGDGCWPYDFPP 170

Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH----RTTLTYWVDDNEDAI 249
           C+HH +    P C         C  +C NP Y      D+H     +   Y V++ ++AI
Sbjct: 171 CAHHINDTKYPKCPKGSYETPNCVEQCHNPKYSTSLKNDRHYMLESSPYQYSVNNAKNAI 230

Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVI 309
           + +    GP +A++ +Y+DF  YKSGVYKHTS + L    H+ K+IGWG ENG  YWLV+
Sbjct: 231 RTD----GPVSASYLVYEDFLAYKSGVYKHTSGSYLGG--HAVKIIGWGEENGEAYWLVV 284

Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           N+W   WGD G  KI  G   C  +  +  G PK
Sbjct: 285 NSWNEDWGDHGLFKIALGN--CQIDDDLLGGTPK 316


>gi|300176938|emb|CBK25507.2| unnamed protein product [Blastocystis hominis]
          Length = 320

 Score =  182 bits (461), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 102/316 (32%), Positives = 147/316 (46%), Gaps = 22/316 (6%)

Query: 28  QINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDR 87
           ++N    TW A    P     +YL              DR LP             +P+ 
Sbjct: 26  EVNAMKTTWIANEAIPTRDYTQYLGVLF---------GDRQLPSKTIVA----RGDLPES 72

Query: 88  FDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK 147
           FD  E+WP C ++  + D   C +   F A  A +DR CI SKG+    LS + + +CC 
Sbjct: 73  FDPVEKWPECPSLKEIRDQSVCGSCWAFGAAEAATDRLCIASKGKIQDRLSEQDLLTCCD 132

Query: 148 ICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCE 207
            C +     C  G +   W +    G  TGG+YG +  C   +   C HH      P  E
Sbjct: 133 SCGF----GCDGGWLDMAWRWFQSTGVTTGGEYGSKDWCNAYSFPKCEHHAEGKYPPCGE 188

Query: 208 NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD 267
           +Q+ P+  C  +C    Y   + +DKH     Y+V    DAIK E++ +GP   +F +Y+
Sbjct: 189 SQETPE--CVKQCQE-GYPVEYEKDKHFFGEAYYVQGGIDAIKTELMTNGPLEVSFFVYE 245

Query: 268 DFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRG 327
           DF  YKSG+Y+H +   L    H+ KL+GWG E+G  YW + N+W   WG+ G  +I+ G
Sbjct: 246 DFLTYKSGIYQHVAGKYLGG--HAVKLVGWGVEDGIEYWKIANSWNEDWGENGYFRIVAG 303

Query: 328 KYECAFEYLIAAGKPK 343
           K EC  E     G PK
Sbjct: 304 KGECGIEVGPIGGIPK 319


>gi|56752997|gb|AAW24710.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  182 bits (461), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 97/262 (37%), Positives = 146/262 (55%), Gaps = 8/262 (3%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           + +  +P +FD+R++WP+C +I  + D   C +   F AV A +DR CI+S GQQ+  LS
Sbjct: 85  DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELS 144

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
              + SC      D    C  G   + W++  KRG VTGG   + TGCQP     C H  
Sbjct: 145 ALDLISC----CEDCGGGCKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHL- 199

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           +    P+C  +     +C   C    Y   + QDKH     Y V  NE AI++EI+ +GP
Sbjct: 200 TKGKYPACGTKIYKTPQCKQTC-QKGYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGP 258

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
             A F +Y+DF +YKSG+Y+H + + +    H+ ++IGWG E G PYWL+ N+W   WG+
Sbjct: 259 VEAAFDVYEDFLNYKSGIYRHVTGSIVGG--HAIRIIGWGVEKGKPYWLIANSWNEDWGE 316

Query: 319 RGTVKILRGKYECAFEYLIAAG 340
           +G  +++RG+ EC+ E  + AG
Sbjct: 317 KGLFRMVRGRDECSIESHVVAG 338


>gi|226473754|emb|CAX71562.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 329

 Score =  181 bits (460), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 109/329 (33%), Positives = 164/329 (49%), Gaps = 34/329 (10%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP-- 78
            SD  I  IN+  N       + A+ S+ +     + DA++     +  P  R+   P  
Sbjct: 30  LSDEMISFINKHPNA-----GWKADKSDRF---HSVDDARFLLGGRKEDPNLRQKRRPTV 81

Query: 79  ---EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
              + +  +P  FD+R++WP C +I  + D   C +    +AVGA SDR CI+S G+Q+ 
Sbjct: 82  DHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAISDRICIQSGGKQS- 140

Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
                Y  S            C  G +  +W++   RG VTGG   + TGC+P     C 
Sbjct: 141 -----YCGS-----------GCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCD 184

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           H        +C ++     +C   C    Y   + QDKH    +Y V   E  I+K+I+ 
Sbjct: 185 HFVKG-KYRACGDKLYKTPQCKQTC-QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 242

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
           HGP  A   +Y+DF +YKSG+Y++T+   +    H+ +LIGWG ENGT YWL  NTW   
Sbjct: 243 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISG--HAVRLIGWGVENGTAYWLAANTWNED 300

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           WG++G  +I+RG+ EC+ E  IAAG  K+
Sbjct: 301 WGEKGYFRIVRGRNECSIESEIAAGLIKS 329


>gi|40557606|gb|AAR88096.1| cathepsin B-like cysteine protease [Callosobruchus maculatus]
          Length = 330

 Score =  181 bits (460), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 114/342 (33%), Positives = 165/342 (48%), Gaps = 22/342 (6%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
           L  ++ CT  + EL   SD YI+Q+N +   W AGRNF  + S   +++ L         
Sbjct: 8   LAAVVSCTFAQPELDFLSDEYIEQLNSKNLPWKAGRNFERDTSLYNIQRLLSVG------ 61

Query: 65  SDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDR 124
           +  P       +  +    +P+ FDAR+QW  C +I  + D   C +    ++    SDR
Sbjct: 62  TINPPSEFETIFHEDDGKDLPEEFDARKQWSKCESIKEIRDQSGCGSCWAVSSASVMSDR 121

Query: 125 RCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
            CI+S  +    +S   +  CC+ C +  +  C  G    T+      G V+GG+Y    
Sbjct: 122 ICIQSDQKNQLRISAADMIECCESCTFSVD-GCHGGIPSFTFTEWKDSGFVSGGEYNSTN 180

Query: 185 GCQPSTISPCSHHGSAPTLPSCENQ-KVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD 243
           GC    +  C+        PSC+     P   C   C   +  + + +DKH     Y + 
Sbjct: 181 GCMSYPLPRCN--------PSCKTLYDAPT--CKKECDKGSPLK-YEEDKHYAKQAYRIM 229

Query: 244 DN-EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
              E  I+ EI+ +GP  A+F +Y DF HY SGVYK    +KL    H+ ++IGWG ENG
Sbjct: 230 SKVERQIQLEIIKNGPVVASFTVYADFIHYLSGVYKFDGESKLLGG-HAVRIIGWGIENG 288

Query: 303 T-PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           T PYWLV N+W   WGD+G  KI RGK EC  E  I AG P+
Sbjct: 289 TYPYWLVSNSWNERWGDQGLFKIWRGKNECGIEEEITAGLPR 330


>gi|159179|gb|AAA29178.1| cysteine proteinase, partial [Haemonchus contortus]
          Length = 341

 Score =  181 bits (460), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 106/313 (33%), Positives = 161/313 (51%), Gaps = 28/313 (8%)

Query: 46  LSEEYLRQFLIADAKYFDQSDRPLPGDRKTY----------------DPEYSATVPDRFD 89
           LS E L  +L  +   F+ +  P PG ++                  DPE    +P+ +D
Sbjct: 32  LSGEPLVAYLRKNQNLFEVNSTPTPGFKQKIMDIKFRNQNPNLIVKDDPEPEDDIPEEYD 91

Query: 90  AREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK-I 148
            R+ W NC +  ++ D   C +    +   A SDR CI +K ++   +S   + +CC   
Sbjct: 92  PRKIWSNCTSF-YIRDQANCGSCWAVSTAAAISDRICIATKARKQVNISATDLVTCCTPT 150

Query: 149 CRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSC-E 207
           C +     C  G   + W +    G V+GG+Y  +  C+P  I PC HHG+      C E
Sbjct: 151 CGF----GCDGGWSIKAWEYFTYAGLVSGGEYRSKRCCRPYPIHPCGHHGNDTYYGECPE 206

Query: 208 NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD 267
               P   C  +C  P Y + +  DK   T  + +  + +AI+KE+L +GP TA+FA+Y+
Sbjct: 207 EASTPS--CKKKC-QPGYRKLYRMDKRYGTDAFQLPKSVEAIQKELLKNGPVTASFAVYE 263

Query: 268 DFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRG 327
           DF  YKSG+Y+HT+  +L  Y H+ K+IGWGTEN T YWL+ N+W   WG+ G  +I+RG
Sbjct: 264 DFSLYKSGIYRHTA-GELRGY-HAVKMIGWGTENRTDYWLIANSWHDDWGENGYFRIIRG 321

Query: 328 KYECAFEYLIAAG 340
             +C  E  +AAG
Sbjct: 322 INDCGIEENVAAG 334


>gi|308507719|ref|XP_003116043.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
 gi|308250987|gb|EFO94939.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
          Length = 356

 Score =  181 bits (460), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 114/349 (32%), Positives = 167/349 (47%), Gaps = 35/349 (10%)

Query: 11  CTLVRGELYKFSDAYIDQINREANTWTA-------GRNFPANLSEEYLRQFLIADAKYFD 63
           C++ R +++    A ++ IN+  ++W A              +   Y    L  D  Y  
Sbjct: 2   CSICRPKVHLTGKALVEHINKVQSSWVAEYTEISESEKKSKVMDSRYANPSLDEDDSYVL 61

Query: 64  QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSD 123
           ++ R LP            ++P  FDAR  WP C +I  V D   C +   F A    SD
Sbjct: 62  RNQRILP------------SIPTTFDARTNWPKCNSIKMVRDQSNCGSCWAFGAAEVISD 109

Query: 124 RRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-GD 182
           R CI S G++   +S E + +CC     +  +        + W      G+VTGGDY GD
Sbjct: 110 RICIHSNGKEQPVISAEDILTCCGKSCGNGCQGGQGLEAMKFWT---TYGAVTGGDYKGD 166

Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR--GFFQDKHR---TT 237
             GC+P + +PCS+   + T PSC+++            +  YG+  G   ++H+    T
Sbjct: 167 --GCKPYSFAPCSNCVESKTTPSCQSKCQSTYTVTNYKGDKHYGKNEGKVTERHKHLECT 224

Query: 238 LTYWVDDNEDA---IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
             Y +D + +A   I+ EI  +GP    + +YDDFYHYKSGVY H +        H+ K+
Sbjct: 225 SAYRLDTSSNAVPIIQNEIYQNGPVEVAYTVYDDFYHYKSGVYHHVTGKDTGG--HAVKI 282

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           IGWGTE G  YWLV N+WG  +GD+G  KI RG  EC  E  + AG  K
Sbjct: 283 IGWGTEKGVDYWLVTNSWGTSFGDKGFFKIRRGTNECGIESNVVAGMAK 331


>gi|167538317|ref|XP_001750823.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163770644|gb|EDQ84327.1| predicted protein [Monosiga brevicollis MX1]
          Length = 341

 Score =  181 bits (460), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 110/319 (34%), Positives = 151/319 (47%), Gaps = 21/319 (6%)

Query: 28  QINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYS--ATVP 85
           ++N+   +WTAG N           +F  A   +       L G  +  + + +  A +P
Sbjct: 40  EVNQAQTSWTAGVN----------SRFARATDDFIKSQMGVLEGGPQLPEKDIAVLADLP 89

Query: 86  DRFDAREQW-PNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
             FD+REQW   C +   + D  AC +   F AV + +DR CI SKG     +S + + +
Sbjct: 90  TAFDSREQWGSTCPSTKEIRDQAACGSCWAFGAVESMTDRICIASKGSLRPHISAQDLMT 149

Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
           CC    +     CS G     W++    G VTGG+Y    GCQP ++  C HH S    P
Sbjct: 150 CC---LFTCGSGCSGGYPSAAWSWFKTTGIVTGGNYNSSQGCQPYSLPNCDHHVSG-QYP 205

Query: 205 SCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFA 264
           +C  +  P   C   C    Y   +  DKH     Y V    D I  EI+ +GP    F 
Sbjct: 206 ACSGEG-PTPACKKSC-EAGYNNTYSNDKHFGATAYSVAGEADKIATEIMTNGPVEGAFT 263

Query: 265 LYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKI 324
           +Y+D   YKSGVY+HT+   L    H+ K+IGWG E+G  YW V N+W   WGD G  KI
Sbjct: 264 VYEDLLTYKSGVYQHTTGQVLGG--HAIKIIGWGVESGVDYWWVANSWNNDWGDNGFFKI 321

Query: 325 LRGKYECAFEYLIAAGKPK 343
            +G  EC  E  I AG PK
Sbjct: 322 KKGVDECGIESQIVAGMPK 340


>gi|984958|gb|AAC46877.1| cathepsin B-like proteinase [Ancylostoma caninum]
          Length = 343

 Score =  181 bits (460), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 103/319 (32%), Positives = 152/319 (47%), Gaps = 17/319 (5%)

Query: 23  DAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA 82
            A++D IN   + + A  +  A    E   +  I D+KY  +     P   +  +  Y  
Sbjct: 37  QAFVDYINEHQSFYRAEYSPEA----EAFVKARIMDSKYLVE-----PKKEEVLEDVYGN 87

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
             P  FDAR  WP C +IG + D  +C +    ++  A SD  C++S       +S   +
Sbjct: 88  DPPASFDARTHWPECRSIGTIRDQSSCGSCWAVSSAEAMSDEICVQSNSTIRVMISDSDI 147

Query: 143 ASCCKI-CRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
            SCC I C Y     C  G     + ++ + G VTGG Y  +  C+P    PC HH + P
Sbjct: 148 LSCCGISCGY----GCQGGWPIEAYKWMQRDGVVTGGKYRQKKVCKPYAFYPCGHHQNDP 203

Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
               C     P  KC   C    Y + + +DKH  T  Y++ +NE  I++EI  +GP  A
Sbjct: 204 YYGPCPGGLWPTPKCRKTCQR-KYNKSYQEDKHFATRAYYLPNNERNIRQEIYKNGPVVA 262

Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT 321
            F +Y DF +YK G+Y H    +     H+ K++GWG EN T YWL+ N+W   WG+ G 
Sbjct: 263 AFRVYQDFSYYKKGIYVHKWGGQTG--AHAVKVVGWGRENATDYWLIANSWNTDWGESGY 320

Query: 322 VKILRGKYECAFEYLIAAG 340
            +I+RG  EC  E  +  G
Sbjct: 321 FRIVRGTNECGIEAQMVGG 339


>gi|442754445|gb|JAA69382.1| Putative cathepsin b precursor [Ixodes ricinus]
          Length = 340

 Score =  181 bits (459), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 126/351 (35%), Positives = 161/351 (45%), Gaps = 27/351 (7%)

Query: 4   ILVFLLGCT----------LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQ 53
           + +FLLG            +V   L   SD  +D IN    TW AG N      E   R+
Sbjct: 5   VALFLLGVLASVRAEEGRLMVPAYLAPLSDKMVDYINFINTTWKAGHNEGHRDLETVRRK 64

Query: 54  FLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAP- 112
             +    + D     LP   +         +P +FD+R+QW +       P T   A P 
Sbjct: 65  LGV----HRDNHKYRLP---ELVHDTLEMDIPAQFDSRQQWQDWPHHPGDPGTKERADPV 117

Query: 113 HIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKR 172
             F AV + SDR CI S  +    L+ + V SCC  C       C+ G     W++   +
Sbjct: 118 GHFGAVESMSDRHCIHSGAKNIVHLAADDVLSCCWGC----GSGCNGGFPAAAWSYWVDK 173

Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQD 232
           G VTGG+Y    GC P  +  C HH +  TL  C  Q  P  KC  R     Y   F  D
Sbjct: 174 GIVTGGNYDTDEGCMPYPVPSCDHHVNG-TLGPC-GQDPPTPKC-VRLCRKGYNVDFKDD 230

Query: 233 KHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
           KH    +Y V  NE  I+ EI+ +GP    F +Y DF  YKSGVYK  S   L    H+ 
Sbjct: 231 KHYGKSSYSVPSNETQIQMEIMKNGPVEGAFTVYADFPLYKSGVYKSHSTDALGG--HAI 288

Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           +++GWG EN  PYWLV N+W   WGD+G  KILRG  EC  E  I AG PK
Sbjct: 289 RILGWGVENDVPYWLVANSWNTEWGDKGYFKILRGSNECGIEEDIVAGIPK 339


>gi|392922404|ref|NP_507186.3| Protein CPR-2 [Caenorhabditis elegans]
 gi|206994217|emb|CAB04322.3| Protein CPR-2 [Caenorhabditis elegans]
          Length = 326

 Score =  181 bits (458), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 109/319 (34%), Positives = 158/319 (49%), Gaps = 25/319 (7%)

Query: 26  IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA-TV 84
           +D IN  A+T+    N+     + + R          ++ + P P + +  + E+     
Sbjct: 32  VDHINSAASTFQT-ENYAVTHEKMHTRSM-------HEKFNAPFPDEFRATEREFVLDAT 83

Query: 85  PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
           P  FDAR +WP C ++  + +   C +   F+     SDR CI S G Q   +S   + +
Sbjct: 84  PLNFDARTRWPQCKSMKLIREQSNCGSCWAFSTAEVISDRTCIASNGTQQPIISPTDLLT 143

Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
           CC +      + C  G  +R + +  +RG VTGGDY   TGC+P  I PC+         
Sbjct: 144 CCGM---SCGEGCDGGFPYRAFQWWARRGVVTGGDYLG-TGCKPYPIRPCNSD------- 192

Query: 205 SCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFA 264
           +C N + P   C   C  P Y   +  DK+     Y V     AI+ +I  +GP  A F 
Sbjct: 193 NCVNLQTPP--CRLSC-QPGYRTTYTNDKNYGNSAYPVPRTVAAIQADIYYNGPVVAAFI 249

Query: 265 LYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKI 324
           +Y+DF  YKSG+Y+H +        H+ KLIGWGTE GTPYWL +N+WG  WG+ GT +I
Sbjct: 250 VYEDFEKYKSGIYRHIAGRSKGG--HAVKLIGWGTERGTPYWLAVNSWGSQWGESGTFRI 307

Query: 325 LRGKYECAFEYLIAAGKPK 343
           LRG  EC  E  I AG P+
Sbjct: 308 LRGVDECGIESRIVAGLPR 326


>gi|226471006|emb|CAX70584.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  181 bits (458), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 97/262 (37%), Positives = 145/262 (55%), Gaps = 8/262 (3%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           + +  +P +FD+R++WP+C +I  + D   C +   F AV A +DR CI+S GQQ+  LS
Sbjct: 85  DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELS 144

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
              + S    C  D    C  G   + W++  KRG VTGG   + TGCQP     C H  
Sbjct: 145 ALDLIS----CCEDCGGGCKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHL- 199

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           +    P+C  +     +C   C    Y   + QDKH     Y V  NE AI++EI+ +GP
Sbjct: 200 TKGKYPACGTKIYKTPQCKQTCQK-GYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGP 258

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
             A F +Y+DF +YKSG+Y+H + + +    H+ ++IGWG E G PYWL+ N+W   WG+
Sbjct: 259 VEAAFDVYEDFLNYKSGIYRHVAGSIVGG--HAIRIIGWGVEKGKPYWLIANSWNEDWGE 316

Query: 319 RGTVKILRGKYECAFEYLIAAG 340
            G  +++RG+ EC+ E  + AG
Sbjct: 317 NGLFRMVRGRDECSIESHVVAG 338


>gi|239938576|gb|ACS36087.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score =  181 bits (458), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 95/258 (36%), Positives = 138/258 (53%), Gaps = 9/258 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FDAR +WP C ++ H+ D   C +    +   A SDR CI S G++   +S   + 
Sbjct: 2   IPESFDARTKWPKCSSLKHIRDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDIL 61

Query: 144 SCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
           SCC   C Y     C+ G   + +N+  K+G+VTGGDY   +GC+P    PC HHG    
Sbjct: 62  SCCGNQCGY----GCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTY 117

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
              C N+     KC  +C        + +D+      Y V ++E AI++EI+ +GP    
Sbjct: 118 YGECPNEATTP-KCVRKCQKSYKKS-YKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGA 175

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F +Y+DF +YK G+YKHT+        H+ K+IGWG E G PYWL+ N+W   WG+ G  
Sbjct: 176 FTVYEDFSYYKKGIYKHTAGKARGG--HAIKIIGWGKEGGVPYWLIANSWHNDWGENGYF 233

Query: 323 KILRGKYECAFEYLIAAG 340
           +ILRG   C  E  + AG
Sbjct: 234 RILRGSNHCGIEENVVAG 251


>gi|226471008|emb|CAX70585.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  180 bits (457), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 97/262 (37%), Positives = 145/262 (55%), Gaps = 8/262 (3%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           + +  +P +FD+R++WP+C +I  + D   C +   F AV A +DR CI+S GQQ+  LS
Sbjct: 85  DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELS 144

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
              + S    C  D    C  G   + W++  KRG VTGG   + TGCQP     C H  
Sbjct: 145 ALDLIS----CCEDCGGGCKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHL- 199

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           +    P+C  +     +C   C    Y   + QDKH     Y V  NE AI++EI+ +GP
Sbjct: 200 TKGKYPACGTKIYKTPQCKQTCQK-GYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGP 258

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
             A F +Y+DF +YKSG+Y+H + + +    H+ ++IGWG E G PYWL+ N+W   WG+
Sbjct: 259 VEAAFDVYEDFLNYKSGIYRHVAGSIVGG--HAIRIIGWGVEKGKPYWLIANSWNEDWGE 316

Query: 319 RGTVKILRGKYECAFEYLIAAG 340
            G  +++RG+ EC+ E  + AG
Sbjct: 317 NGLFRMVRGRDECSIESHVVAG 338


>gi|211853248|emb|CAP17587.1| cathepsin-like protein 4 [Crateromorpha meyeri]
          Length = 325

 Score =  180 bits (457), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 121/349 (34%), Positives = 163/349 (46%), Gaps = 41/349 (11%)

Query: 2   IHILVFLLGCTL-------VRGELYKFSDAYIDQINREANTWTAGRN--FPANLSEEYLR 52
           + +   LL C L       +    Y F +  I ++NRE   W AGR   F  + +EEY+ 
Sbjct: 1   MKLTALLLVCALLSINAAHIESNYYPF-EKEIYEVNRENLGWVAGRQKRFEGH-TEEYIA 58

Query: 53  QFL-IADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAA 111
               +  +     SD P+  D           +PD FD+R QWP+C TIG + D   C +
Sbjct: 59  GLCGVKGSIPLPLSDLPVLED-----------IPDMFDSRTQWPDCKTIGLIEDQSNCGS 107

Query: 112 PHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHK 171
              F A  + SDR CI  K   +  +S   +  CC+ C       C  G +   WN+  +
Sbjct: 108 CWAFGATESMSDRYCIHMK--MHLLISAANLMECCRNC----GNGCEGGFLGAAWNYWKQ 161

Query: 172 RGSVTGGDYG----DRTGCQPSTISPCSHH--GSAPTLPSCENQKVPKLKCHTRCTNPTY 225
            G VTGG Y     +   CQP  +  C HH  GS P  PS    K+ K        +  Y
Sbjct: 162 EGLVTGGLYNPSATESDTCQPYPLPSCEHHINGSKPACPS----KIAKTPECVHTCHAGY 217

Query: 226 GRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKL 285
              + QD H     Y V      I+ EI+ +GP  A F +Y DF  YKSGVYK  S  +L
Sbjct: 218 PTSYEQDLHYGESAYSVRRRVAEIQTEIMTNGPVEAAFTVYADFPAYKSGVYKRHSLRQL 277

Query: 286 ENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
               H+ K+IGWG E+G PYWL+ N+W   WGD G  KI+RG+ EC  E
Sbjct: 278 GG--HAVKMIGWGEEDGIPYWLIANSWNSDWGDHGYFKIVRGQDECGIE 324


>gi|300176937|emb|CBK25506.2| unnamed protein product [Blastocystis hominis]
          Length = 320

 Score =  180 bits (457), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 102/316 (32%), Positives = 151/316 (47%), Gaps = 22/316 (6%)

Query: 28  QINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDR 87
           ++N    TW A    P     +YL    +   K   + +  + GD           +P+ 
Sbjct: 26  EVNAMKTTWLANEAIPTRDYTQYLGA--LRGGKQLPEKNIAIRGD-----------LPES 72

Query: 88  FDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK 147
           FD  E+WP C ++  + D   C +   F A  A +DR CI SKG+    LS + + +CC+
Sbjct: 73  FDPVEKWPECPSLKEIRDQSVCGSCWAFGAAEAATDRLCIASKGKIQDRLSDQDLLTCCE 132

Query: 148 ICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCE 207
            C +     C+ G     W++ H  G  TGG+YG +  C       C HH      P  E
Sbjct: 133 SCGF----GCNGGWPSMAWSWFHSTGVTTGGEYGSKDWCNAYEFPKCDHHVEGKYPPCGE 188

Query: 208 NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD 267
            Q  P+  C  +C    Y   + +DKH     Y V  N +AIK E++ +GP    F++Y+
Sbjct: 189 TQPTPE--CVEKCQE-GYPVEYKKDKHFFGEAYHVPSNVEAIKTELMTNGPIEVDFSVYE 245

Query: 268 DFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRG 327
           DF  YKSG+Y+H +   L    H+ KL+GWG E+G  YW + N+W   WG+ G  +I+ G
Sbjct: 246 DFMTYKSGIYQHVAGKYLGG--HAVKLVGWGVEDGVEYWKIANSWNEDWGENGYFRIIAG 303

Query: 328 KYECAFEYLIAAGKPK 343
           K EC  E    AG P+
Sbjct: 304 KNECGIESDGVAGIPE 319


>gi|56752811|gb|AAW24617.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 98/265 (36%), Positives = 147/265 (55%), Gaps = 8/265 (3%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           + +  +P +FD+R++WP+C +I  + D   C +   F AV A +DR CI+S GQQ+  LS
Sbjct: 85  DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELS 144

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
              + SCCK C              + W++  KRG VTGG   + TGCQP     C H  
Sbjct: 145 ALDLISCCKDCGGGCKGGFPG----QAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHL- 199

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           +    P+C  +     +C   C    Y   + QDKH     Y V  NE AI++EI+ +GP
Sbjct: 200 TKGKYPACGTKIYKTPQCKQTCQK-GYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGP 258

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
             A F +Y+DF +YKSG+Y+H + + +    H+ ++IGWG E G PYWL+ N+W   WG+
Sbjct: 259 VEAAFDVYEDFLNYKSGIYRHVTGSIVGG--HAIRIIGWGVEKGKPYWLIANSWNEDWGE 316

Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
           +G  +++RG+ EC+ E  + AG  K
Sbjct: 317 KGLFRMVRGRDECSIESHVVAGLIK 341


>gi|390994433|gb|AFM37366.1| cathepsin B3 [Dictyocaulus viviparus]
          Length = 342

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 93/275 (33%), Positives = 143/275 (52%), Gaps = 9/275 (3%)

Query: 66  DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
           DR +   +   + E +A +P+ FDAR QWP+C +I  + D   C +   FA   + SDR 
Sbjct: 75  DRRIGKPQLQENEEDTAGIPESFDARTQWPHCPSISLIRDQADCGSCWAFAVGESISDRV 134

Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG 185
           CI +   +    S E + +CC  C +     C  G     W +    G VTGG YG +  
Sbjct: 135 CIATDANKTAEFSVEDILTCCDECGF----GCDGGFPDAAWEYFVSTGVVTGGLYGTKNA 190

Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN 245
           C+P  ISPC +H +     +C     P   C T C    Y   +  DK R   +Y + ++
Sbjct: 191 CRPYEISPCGNHPNETFYRNCTGVSTP--SCKTSC-QKGYPVSYKDDKTRGRKSYNLANS 247

Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPY 305
             AI+K+IL HGP  ATF++Y+DF +YK G+Y++T         H+ +++GWG EN   Y
Sbjct: 248 VSAIQKDILKHGPLVATFSVYEDFMYYKKGIYRYTHGGYEGG--HAVRILGWGVENNVKY 305

Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           W++ N+W   WG+ G  +++RG  +C  E  ++AG
Sbjct: 306 WIIANSWNTDWGEDGFFRMVRGINDCGIEESVSAG 340


>gi|4204370|gb|AAD11445.1| cathepsin B protease, partial [Fasciola hepatica]
          Length = 247

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 92/253 (36%), Positives = 132/253 (52%), Gaps = 7/253 (2%)

Query: 91  REQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICR 150
           R QWP C TI  + D  +C +    AA  A SDR CI S GQ    L+     SCC  C 
Sbjct: 1   RSQWPQCWTISEIRDQASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAADPLSCCTYC- 59

Query: 151 YDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQK 210
               + C  G   + W++  + G VTGG + +RTGCQP   + C H G +     C +  
Sbjct: 60  ---GQGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCPHYT 116

Query: 211 VPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFY 270
            P   C   C    Y + + QDK     +Y V ++E  I +EI+ +GP   TFA++ DF 
Sbjct: 117 YPTPPCARACQT-GYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEVTFAIFQDFG 175

Query: 271 HYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYE 330
            Y+SG+Y H +   +    H+ ++IGWG ENG  YWL+ N+W   WG+ G  +++RG+ E
Sbjct: 176 VYRSGIYHHVAGKFIGR--HAVRMIGWGVENGVNYWLMANSWNEEWGENGYFRMVRGRNE 233

Query: 331 CAFEYLIAAGKPK 343
           C  E  + AG P+
Sbjct: 234 CGIESEVVAGMPR 246


>gi|157058767|gb|ABV03141.1| cathepsin B-348 [Sitobion avenae]
          Length = 252

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 96/234 (41%), Positives = 130/234 (55%), Gaps = 10/234 (4%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FDARE WPNC TI  V D G+C +   F AV A SDR CI SKG +N   S E + 
Sbjct: 28  LPETFDAREHWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGTKNFHFSAENLV 87

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC  C +     C+ G     W++   +G V+GG YG   GC P  I+PC HH +    
Sbjct: 88  SCCWTCGF----GCNGGFPGAAWHYWKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRG 143

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P  E  K P  KC  +C +  Y   + QD HR    Y + ++ D I++EI  +GP    F
Sbjct: 144 PCKEGGKTP--KCVKKCED-GYKVPYEQDLHRGKSAYSLSNDVDQIRQEIYTNGPVEGAF 200

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG-TPYWLVINTWGPHW 316
            +Y+DF  Y++GVYKH +   L    H+ +++GWG +NG  PYWLV N+W   W
Sbjct: 201 TVYEDFIAYRAGVYKHVAGKALGG--HAIRILGWGVQNGEIPYWLVANSWNTDW 252


>gi|308466896|ref|XP_003095699.1| CRE-CPR-3 protein [Caenorhabditis remanei]
 gi|308244581|gb|EFO88533.1| CRE-CPR-3 protein [Caenorhabditis remanei]
          Length = 373

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 111/324 (34%), Positives = 152/324 (46%), Gaps = 24/324 (7%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
           A +D IN    +W A  N    +S+  ++ F + D ++ D       G+           
Sbjct: 37  ALVDHINTAQTSWLAEHNV---ISDSEMK-FKVMDERFADPLPEEESGEILVSGEIVPEP 92

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +PD FDARE WP+C +I  + +   C +   F A    SDR CI+S G Q   +S E + 
Sbjct: 93  IPDTFDARENWPDCKSIKLIRNQATCGSCWAFGAAEVISDRICIQSNGTQQPIISVEDIL 152

Query: 144 SCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
           SCC   C     K C  G       F    G+VTGGDY +  GC P + +PC        
Sbjct: 153 SCCGTTC----GKGCQGGYSIEAMRFWKSNGAVTGGDY-NGNGCMPYSFAPCQK------ 201

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA---IKKEILAHGPT 259
              C     P   C T C +      +  DKH  T  Y +    +    I+ EI  +GP 
Sbjct: 202 -SPCVESTTPT--CKTTCQSSYTTANYTTDKHYGTSAYRLATTNNVVSTIQYEIYHNGPV 258

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
            A++ +Y+DFY YKSGVY + S   +    H+ K+IGWGTEN   YWLV N+WG  +G+ 
Sbjct: 259 EASYKVYEDFYQYKSGVYHYVSGKLVGG--HAVKIIGWGTENDVDYWLVANSWGIKFGEG 316

Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
           G  KI RG  EC  E  + AG  K
Sbjct: 317 GFFKIRRGTNECQIESNVVAGVAK 340


>gi|268572243|ref|XP_002648913.1| Hypothetical protein CBG17826 [Caenorhabditis briggsae]
          Length = 323

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 99/260 (38%), Positives = 135/260 (51%), Gaps = 18/260 (6%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FD+R +W NC +I  + D   C +   F+     SDR CI +KG Q   +S   + 
Sbjct: 81  IPPSFDSRTRWSNCTSIEMIRDQAQCGSCWAFSTAEVISDRICIATKGTQQPTISPTDML 140

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           +CC     D  K       FR WN    RG VTGGD+   +GC+P   +PC         
Sbjct: 141 ACCGNSCGDGCKGGYPIQAFRWWN---SRGVVTGGDF-RGSGCRPYPFAPCI-------- 188

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
            SC  +K P   C   C    Y   + +DK      Y V  N  AI+ EI+ +GP    F
Sbjct: 189 -SCPEEKTPT--CSLSC-QFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAF 244

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
            +Y+D Y YKSGVY+HT+   L    H+ K+IGWGT+NG PYWL+ N+WG +WG+ G +K
Sbjct: 245 TMYEDMYKYKSGVYRHTAGRLLGG--HAIKIIGWGTQNGIPYWLIANSWGANWGENGFLK 302

Query: 324 ILRGKYECAFEYLIAAGKPK 343
           + RG  EC  E  + AG P+
Sbjct: 303 MRRGVNECGIERAVVAGMPR 322


>gi|347972080|ref|XP_313831.5| AGAP004531-PA [Anopheles gambiae str. PEST]
 gi|333469162|gb|EAA09191.5| AGAP004531-PA [Anopheles gambiae str. PEST]
          Length = 375

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 107/334 (32%), Positives = 154/334 (46%), Gaps = 40/334 (11%)

Query: 16  GELYKFSDAYIDQINREANTWTAGRNFPAN-------LSEEYLRQFLIADAKYFDQSDRP 68
           G +     A+++ IN  + TW AG N   N       LS+E ++ F +       + ++P
Sbjct: 72  GNVLTSQAAFVEAINNRSTTWKAGVNPQRNDQYRTGVLSDESMK-FQLPLGFVLKKDEQP 130

Query: 69  LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIK 128
           LP                 FDAR++W  C ++  V + G C + +  AAV   +DR C+ 
Sbjct: 131 LPMS---------------FDARQKWSYCPSMNMVRNQGCCDSSYAVAAVSTMTDRWCVH 175

Query: 129 SKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQP 188
           S+G+         V SCC  C +     C  G     W++  + G  +GG +G   GCQ 
Sbjct: 176 SEGKAQFNFGAYDVLSCCHRCGF----GCDGGVPSAVWHYWVENGITSGGAFGSHEGCQS 231

Query: 189 STISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA 248
                C   G +   P C            R   P Y   + +DKH   + Y V  +E+ 
Sbjct: 232 YPFDVCKKSGDSNDTPRC-----------LRFCQPGYNVTYPEDKHYGRVAYTVPKDEER 280

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLV 308
           I  E+   GP  ATF +Y DF  YKSGVY+HT   ++    HS K++GWG EN   YWL 
Sbjct: 281 IMYEVFNFGPAQATFTMYTDFVQYKSGVYRHTFGVRVGT--HSVKVMGWGVENDVKYWLC 338

Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            N+WG  WGD G  KI+RG+   +FE  + AG P
Sbjct: 339 ANSWGAQWGDGGFFKIVRGEDHLSFETNVVAGLP 372


>gi|294939825|ref|XP_002782575.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239894358|gb|EER14370.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 398

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 114/333 (34%), Positives = 160/333 (48%), Gaps = 31/333 (9%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYF------DQSDRPLPGDRKTYD 77
           + +D+IN + NTWTA      +  +E  +   + DAK        D +D+ +    K Y 
Sbjct: 83  SLVDEINAKQNTWTA------SAEQEKFKTSSLRDAKMLCGTLTRDSNDKVV---EKVYA 133

Query: 78  PEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
            E    +P  FDAR  +P C   IGHV D  AC     F    AF+DR CIKS G   + 
Sbjct: 134 IEELKDLPTDFDARTAFPKCSKVIGHVRDQSACGDCWAFGVTEAFNDRLCIKSNGTFTKL 193

Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT------GCQPST 190
           LS   + +C    +   +  C  G  +  W+++H  G  TGGDY  R       GC P  
Sbjct: 194 LSAGEMNACAPSLK---DPGCRGGFPYSAWSWVHDEGIATGGDYVPRDNMTEDDGCWPYD 250

Query: 191 ISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIK 250
             PC+H    P  P+C       L+C ++  +      +F D++    +     + D  K
Sbjct: 251 FPPCAHFFKDPKYPACPKFARVNLRCVSKLRHMMVV--YFSDRYFMVESVPYHFSADDAK 308

Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVIN 310
             I   GP +ATF +Y+DF  YKSGVYKHTS + L    H+ K+IGWG + G  YWLV+N
Sbjct: 309 NAIRTDGPVSATFYVYEDFLAYKSGVYKHTSGSLLG--AHAVKIIGWGEDGGEAYWLVVN 366

Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           +W   WGD G  KI  G  +C  +  +  G PK
Sbjct: 367 SWNEGWGDHGLFKIALG--DCGIDNELLGGTPK 397


>gi|193629592|ref|XP_001944624.1| PREDICTED: cathepsin B-like isoform 4 [Acyrthosiphon pisum]
          Length = 331

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 125/354 (35%), Positives = 169/354 (47%), Gaps = 36/354 (10%)

Query: 1   MIHILVF--LLGCTLVRGELYKFS-DAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA 57
           M +IL F  ++  +    E  K S D  I   + + NT  +  NF  N  EE     L+ 
Sbjct: 1   MANILFFTSIMLLSFYLTEQTKSSHDNMIANSDIKTNTLKSVENFGPNSGEEENIMMLLG 60

Query: 58  D--AKYFDQSDRPLPGDRKTYDPEYSATVPD--RFDAREQWPNCGTIGHVPDTGACAAPH 113
               +   +S +P     K  +P Y     +   FDAR++WP C TIG V + G      
Sbjct: 61  TRGVEAATKSKKPY----KIRNPRYVIDNQNHKEFDARKRWPQCKTIGEVYNEGNALLSW 116

Query: 114 IFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVF--RTWNFLHK 171
            +A  G F+DR CI + G  N+ LSTE + SC  I      K+ ++G V     W +   
Sbjct: 117 AYATTGVFADRMCIATNGSYNKHLSTEELISCSGI------KASANGWVRDGLAWEYFKT 170

Query: 172 RGSVTGGD-YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFF 230
            G V+GG  Y    GCQPS I P  +      LP+  N++         C +  YG    
Sbjct: 171 HGLVSGGSIYNTNDGCQPSKIPPVCN------LPTKINKRT--------CVDYCYGNDTI 216

Query: 231 QDKH-RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL 289
           +  H    + Y+       I+KE+  +GP TA   LYDD + +KSGVY  T NAK    L
Sbjct: 217 KYNHDHVKVRYYYHVKPKDIQKEVQTYGPVTAALNLYDDIFLHKSGVYTLTKNAKYVR-L 275

Query: 290 HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
              KLIGWG ENG  YWL++N+WG  WG  G +KI RGKY CA E  + A  PK
Sbjct: 276 QYVKLIGWGVENGVDYWLLVNSWGNEWGQNGLLKIKRGKYGCAVESFVYAAVPK 329


>gi|321461662|gb|EFX72692.1| hypothetical protein DAPPUDRAFT_308155 [Daphnia pulex]
          Length = 379

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 111/305 (36%), Positives = 143/305 (46%), Gaps = 23/305 (7%)

Query: 51  LRQFLIADA----KYFDQSDRPLPGDRKTYDPEYSAT---VPDRFDAREQWPNCGTIGHV 103
            R F+ A A    +YF    R    +R++           +P  FDAR +WPNC TIG +
Sbjct: 73  FRSFMGARAYDPWRYFMSVKRRQVNERRSLSSPSGFYSSSIPAEFDARLRWPNCPTIGEI 132

Query: 104 PDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVF 163
            + G+CA+    A     SDR CI S  +    LS   + SCCK+C     K C  G   
Sbjct: 133 FEQGSCASCWAVAPTDVMSDRICIHSGSRHIVRLSAGNLLSCCKLC----GKGCKGGFPG 188

Query: 164 RTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPK-----LKCHT 218
             W    K G VTGG Y    GCQ     PC      P        K PK     L+C  
Sbjct: 189 GAWMHWSKHGIVTGGSYSSDYGCQKYQFFPCYQ----PRTKGSIKNKCPKTDNTLLECRE 244

Query: 219 RCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYK 278
            C   +Y + + QD +     Y + ++  AI+ EI+ +GP  A   +Y+DF HYK GVY+
Sbjct: 245 TCRT-SYNKSYKQDLYYGESVYRIPNDARAIQLEIMENGPVQANLRIYEDFLHYKFGVYR 303

Query: 279 HTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIA 338
           H     LE   H+ K+ GWGTE GTPYWL  N W   WG+ G  KILRG      E  + 
Sbjct: 304 HVHGQGLE--YHAVKIFGWGTEGGTPYWLAANPWSKRWGNGGFFKILRGSNHAEIEDHVM 361

Query: 339 AGKPK 343
           AG PK
Sbjct: 362 AGIPK 366


>gi|161343871|tpg|DAA06116.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 276

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 107/275 (38%), Positives = 141/275 (51%), Gaps = 27/275 (9%)

Query: 77  DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
           D +    +P +FDAR++W  C TIG V D G C +    +   AFSDR C+ + G  N+ 
Sbjct: 18  DDDNYQEIPIKFDARKKWLRCKTIGEVRDQGHCGSDWAMSTSSAFSDRLCVATNGDFNQL 77

Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
           LS E +  CC  C       CS G   R W    K G VTGG+Y    GC+P  + PC +
Sbjct: 78  LSAEEITFCCHTC----GDGCSGGYPIRAWKRYKKHGLVTGGNYKSGEGCEPYRVPPCPN 133

Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG--FFQDKHRTT-----LTYWVDDNEDAI 249
                   +C  Q + K   + RCT   YG     F + HR T     LTY        I
Sbjct: 134 DDQGNN--TCSGQPMEK---NHRCTRMCYGDQDLDFDEDHRYTRDHYYLTY------RGI 182

Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYWL 307
           +K+++ +GP  A+F +YDDF  YKSG+Y  + NA   +YL  HS KLIGWG E G  YWL
Sbjct: 183 QKDVINYGPIEASFDVYDDFPSYKSGIYVKSENA---SYLGGHSVKLIGWGEEYGVLYWL 239

Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           ++N+W   WGD+G  KI RG  EC  +     G P
Sbjct: 240 MVNSWNADWGDKGLFKIRRGTNECGVDNSTTGGVP 274


>gi|268570495|ref|XP_002648548.1| Hypothetical protein CBG24861 [Caenorhabditis briggsae]
          Length = 323

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 99/260 (38%), Positives = 135/260 (51%), Gaps = 18/260 (6%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FD+R +W NC +I  + D   C +   F+     SDR CI +KG Q   +S   + 
Sbjct: 81  IPPSFDSRTRWSNCTSIEMIRDQAQCGSCWAFSTAEVISDRICIATKGTQQPTISPTDML 140

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           +CC     D  K       FR WN    RG VTGGD+   +GC+P   +PC         
Sbjct: 141 ACCGNSCGDGCKGRYPIQAFRWWN---SRGVVTGGDF-RGSGCRPYPFAPCI-------- 188

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
            SC  +K P   C   C    Y   + +DK      Y V  N  AI+ EI+ +GP    F
Sbjct: 189 -SCPEEKTPT--CSLSC-QFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAF 244

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
            +Y+D Y YKSGVY+HT+   L    H+ K+IGWGT+NG PYWL+ N+WG +WG+ G +K
Sbjct: 245 TMYEDMYKYKSGVYRHTAGRLLGG--HAIKIIGWGTQNGIPYWLIANSWGANWGENGFLK 302

Query: 324 ILRGKYECAFEYLIAAGKPK 343
           + RG  EC  E  + AG P+
Sbjct: 303 MRRGVNECGIERAVVAGMPR 322


>gi|118118|sp|P19092.1|CYSP1_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
           Precursor
 gi|159173|gb|AAA29175.1| cysteine protease (AC-1) [Haemonchus contortus]
          Length = 342

 Score =  178 bits (452), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 102/285 (35%), Positives = 145/285 (50%), Gaps = 12/285 (4%)

Query: 56  IADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIF 115
           I D KY  Q    +  +    DP+    +P  +D R+ W NC T  ++ D   C +    
Sbjct: 63  IMDIKYKHQKLNLMVKE----DPDPEVDIPPSYDPRDVWKNCTTF-YIRDQANCGSCWAV 117

Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
           +   A SDR CI SK ++   +S   + +CC   R      C  G     W +    G V
Sbjct: 118 STAAAISDRICIASKAEKQVNISATDIMTCC---RPQCGDGCEGGWPIEAWKYFIYDGVV 174

Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
           +GG+Y  +  C+P  I PC HHG+      C     P   C  +C  P   + +  DK  
Sbjct: 175 SGGEYLTKDVCRPYPIHPCGHHGNDTYYGECRGT-APTPPCKRKC-RPGVRKMYRIDKRY 232

Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
               Y V  +  AI+ EIL +GP  A+FA+Y+DF HYKSG+YKHT+  +L  Y H+ K+I
Sbjct: 233 GKDAYIVKQSVKAIQSEILRNGPVVASFAVYEDFRHYKSGIYKHTA-GELRGY-HAVKMI 290

Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           GWG EN T +WL+ N+W   WG++G  +I+RG  +C  E  IAAG
Sbjct: 291 GWGNENNTDFWLIANSWHNDWGEKGYFRIIRGTNDCGIEGTIAAG 335


>gi|161343849|tpg|DAA06105.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 334

 Score =  178 bits (452), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 114/355 (32%), Positives = 175/355 (49%), Gaps = 42/355 (11%)

Query: 5   LVFLLGCTLVRGELYK-----FSDAYIDQINREANTWTAGRNF-PANLSEEYLRQFLIAD 58
           ++FL+   L+   L +       D  ID+     +T   G N  P ++ EE+L   +++ 
Sbjct: 4   VLFLVSTMLLNSYLSEQATLFHDDNIIDKSVMGTDTLKVGENVGPNSVEEEHL---MLSG 60

Query: 59  AKYFDQSDRPL----PGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
            +  + + +        +R+ +  E    +   FDAR++WP+C TIG V + G       
Sbjct: 61  TRGVEATSKSKMLHKTRNRRCFSVEIDHQIDQEFDARKRWPHCKTIGEVHNDGNSLLSWA 120

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSV--FRTWNFLHKR 172
           +   G F+DR CI + G  N+ LSTE + SC  I      K    GSV  +  W +L   
Sbjct: 121 YVPTGVFADRMCIATNGTYNQLLSTEELISCSGI------KEDEFGSVNDYYVWEYLKNH 174

Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRC---TNPTYGRGF 229
           G V+GG Y    GCQPS I P    G+ PT        + +  C  RC       Y +  
Sbjct: 175 GLVSGGKYNTNNGCQPSKIPPI---GNLPT-------GLYENTCEKRCYGNNTINYNQDH 224

Query: 230 FQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD-DFYHYKSGVYKHTSNAKLENY 288
            + K+   + Y      + I++E+  +GP +  F ++D DF+ YKSGVY+ T+N++   +
Sbjct: 225 VKIKNHYDIEY------EDIQREVQNYGPVSMAFKVFDNDFFLYKSGVYEKTTNSEFIQW 278

Query: 289 LHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            ++ KLIGWG ENG  YWL++N WG  WG  G  KI RG  EC  E  + AG+P+
Sbjct: 279 QYA-KLIGWGVENGVDYWLLVNFWGYEWGQNGLFKIKRGTDECNIETFVHAGEPQ 332


>gi|5764077|emb|CAB53367.1| necpain [Necator americanus]
          Length = 339

 Score =  178 bits (451), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 92/258 (35%), Positives = 140/258 (54%), Gaps = 9/258 (3%)

Query: 85  PDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS-TEYV 142
           P++FDAR+ WP C   IGHV D   C +    +A    SDR C++S G+    +S T+ +
Sbjct: 85  PEKFDARDAWPYCREIIGHVRDQSRCGSCWAVSAASVMSDRLCVQSNGKIKLHVSDTDIL 144

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
           A C + C       CS G  F+ W ++ K G  TGGDY  +  C+P    PC +H +   
Sbjct: 145 ACCGEFC----GDGCSGGWPFQAWEWVRKYGVCTGGDYRAKGVCKPYAFHPCGNHENQVY 200

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
              C     P  +C   C    Y + + +DK     +YW+ ++E  I+ +I+ +GP  A 
Sbjct: 201 YGVCPKGSWPTPRCEKFCQR-GYIKPYKKDKFYAKKSYWLPNDEKEIRLDIMKNGPVQAA 259

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F +Y+DF  YK G+YKH     ++   H+ K+IGWG +NGT YWL+ N+W   WG+ G  
Sbjct: 260 FDVYEDFKLYKRGIYKHKEG--IQTGGHAVKIIGWGKDNGTDYWLIANSWSKDWGESGFF 317

Query: 323 KILRGKYECAFEYLIAAG 340
           +++RG+ +C  E +I AG
Sbjct: 318 RMVRGENDCEIEDMITAG 335


>gi|118122|sp|P25793.1|CYSP2_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 2; Flags:
           Precursor
 gi|159165|gb|AAA29171.1| cathepsin B-like cysteine protease [Haemonchus contortus]
          Length = 342

 Score =  178 bits (451), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 97/264 (36%), Positives = 138/264 (52%), Gaps = 8/264 (3%)

Query: 77  DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
           DP+    +P  +D R+ W NC T  ++ D   C +    +   A SDR CI SK ++   
Sbjct: 80  DPDPEVDIPPSYDPRDVWKNCTTF-YIRDQANCGSCWAVSTAAAISDRICIASKAEKQVN 138

Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
           +S   + +CC   R      C  G     W +    G V+GG+Y  +  C+P  I PC H
Sbjct: 139 ISATDIMTCC---RPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGH 195

Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
           HG+      C     P   C  +C  P   + +  DK      Y V  +  AI+ EIL +
Sbjct: 196 HGNDTYYGECRGT-APTPPCKRKC-RPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKN 253

Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHW 316
           GP  A+FA+Y+DF HYKSG+YKHT+  +L  Y H+ K+IGWG EN T +WL+ N+W   W
Sbjct: 254 GPVVASFAVYEDFRHYKSGIYKHTA-GELRGY-HAVKMIGWGNENNTDFWLIANSWHNDW 311

Query: 317 GDRGTVKILRGKYECAFEYLIAAG 340
           G++G  +I+RG  +C  E  IAAG
Sbjct: 312 GEKGYFRIVRGSNDCGIEGTIAAG 335


>gi|193606095|ref|XP_001951499.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 330

 Score =  177 bits (450), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 115/328 (35%), Positives = 157/328 (47%), Gaps = 28/328 (8%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
             ++ ID  + E N+  AG N   N +EE   Q L+   +    +   +    +      
Sbjct: 24  LHNSIIDPSDMETNSLKAGENVLPNSAEEE-HQMLLETREVEAATKSKIMYKTRHPRSAI 82

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
              + + FDAR+ WP C TIG V D G       +A  G  +DR CI + G  N+ LSTE
Sbjct: 83  DNQIHEEFDARKGWPQCKTIGEVHDDGNTRWGWAYATAGVLADRMCIATNGSYNQLLSTE 142

Query: 141 YVASCCKICRYDDNKSCSHGSVF--RTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
            +  C  I      K+   G+V     W +L   G V+GG Y    GCQPS I P    G
Sbjct: 143 ELIFCGGI------KTKQSGAVRGDDVWEYLKSHGLVSGGKYNTNDGCQPSKIPPI---G 193

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRG---FFQDKHRTTLTYWVDDNEDAIKKEILA 255
           + PT        +    C  RC    YG     ++ D  + +  Y +  NED I+KE+  
Sbjct: 194 NIPT-------HLYNHTCEERC----YGNNTIHYYHDHVKVSHYYNIKSNED-IQKEVQT 241

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
           +GP +  F +YDDF+ YKSGVY  T  + L    H  KLIGWG ENG  YWL++N+WG  
Sbjct: 242 YGPVSVKFRVYDDFFLYKSGVYVKTEKS-LYVRRHFAKLIGWGVENGVDYWLLVNSWGNE 300

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPK 343
           WG  G  KI RG  E   E  + AG+P+
Sbjct: 301 WGQNGLFKIKRGTNEVHVEDYVYAGEPE 328


>gi|157058745|gb|ABV03130.1| cathepsin B-2744 [Sitobion avenae]
          Length = 260

 Score =  177 bits (449), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 96/252 (38%), Positives = 137/252 (54%), Gaps = 11/252 (4%)

Query: 73  RKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
           RKT D  Y   +P  FDAR+ + +C   IG V D G CA+    A    FSDR CI S G
Sbjct: 15  RKTVDISYKIDIPREFDARQYFGSCADVIGDVKDQGNCASSWAVAVASTFSDRLCIASNG 74

Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTI 191
           Q    LS + + SC      ++   C  GS F+ W     +G VTGG++    GCQP  I
Sbjct: 75  QFTDNLSAQNLLSCGD----EEKMGCDGGSAFKAWELTMSKGIVTGGNFDSNEGCQPYKI 130

Query: 192 SPCSHHGSAPTLPSCENQKVPKLK-CHTRCTNPTYGRGFFQDKHRTTLTYWVD-DNEDAI 249
            PC+H+G+   L +C + +  ++  C  +C N  Y   +  D H+T++ Y     N   I
Sbjct: 131 RPCNHYGNG-NLKNCSSLRRTQMTVCREKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQI 189

Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE-NGTPYWLV 308
           ++EI+ +GP TA   +Y++F  YK G+YK T+  +L  Y H  KLIGWG + +GT YWL 
Sbjct: 190 QQEIMTYGPVTAFMYVYENFMGYKEGIYKSTA-GELIGYHHV-KLIGWGVDGDGTEYWLA 247

Query: 309 INTWGPHWGDRG 320
           +N+W  +WG  G
Sbjct: 248 MNSWNSNWGTNG 259


>gi|118429529|gb|ABK91812.1| cathepsin B precursor [Clonorchis sinensis]
          Length = 342

 Score =  177 bits (449), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 98/259 (37%), Positives = 131/259 (50%), Gaps = 9/259 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FDAR  WP+C +I  + D  +C +   F AV A SDR CI SKG  N+ LS   + 
Sbjct: 86  LPESFDARANWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSKGAFNKSLSAVDLV 145

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC  C       C  G     W+     G VTGG     TGC+      C H G     
Sbjct: 146 SCCTEC----GCGCRGGYSPIAWDLWKTHGIVTGGSKEKPTGCRSYPFPSCEHRGKG-QY 200

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P C +Q  P  +C  RC   T    + +DK R  ++Y V   E A+ KEI+  GP  A  
Sbjct: 201 PPCPHQLYPTPECIKRCD--TKEIDYEKDKTRANISYNVYPAEQAVMKEIMLRGPVGAIL 258

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
            +Y+D   YKSGVY H     L    H  +++GWG E+G PYWLV N+W   WG++G ++
Sbjct: 259 HVYEDLLDYKSGVYFHVWGGHLGE--HGIRILGWGEEDGVPYWLVANSWNEDWGEKGYMR 316

Query: 324 ILRGKYECAFEYLIAAGKP 342
           +LR + EC     + AG P
Sbjct: 317 VLRWRNECGIVDQVTAGLP 335


>gi|209863086|ref|NP_001119616.2| cathepsin B-1674 precursor [Acyrthosiphon pisum]
 gi|239799412|dbj|BAH70627.1| ACYPI000012 [Acyrthosiphon pisum]
          Length = 334

 Score =  177 bits (448), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 113/353 (32%), Positives = 177/353 (50%), Gaps = 38/353 (10%)

Query: 5   LVFLLGCTLVRGELYK-----FSDAYIDQINREANTWTAGRNF-PANLSEEYLRQFLIAD 58
           ++FL+   L+   L +       D  ID+     +T   G N  P ++ EE+L   +++ 
Sbjct: 4   VLFLVSTMLLNSYLSEQATLFHDDNIIDKSVMGTDTLKVGENVGPNSVEEEHL---MLSG 60

Query: 59  AKYFDQSDRPL----PGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
            +  + + +        +R+ +  E    +   FDAR++WP+C TIG V + G       
Sbjct: 61  TRGVEATSKSKMLHKTRNRRCFRVEIDHQIDQEFDARKRWPHCKTIGEVHNDGNSLLSWA 120

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
           +   G F+DR CI + G  N+ LSTE + SC  I + D+  S +   V   W +L   G 
Sbjct: 121 YVPTGVFADRMCIATNGTYNQLLSTEELISCSGI-KEDEFGSVNDDYV---WEYLKNHGL 176

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRC---TNPTYGRGFFQ 231
           V+GG Y    GCQPS I P    G+ PT        + +  C  RC       Y +   +
Sbjct: 177 VSGGKYNTNNGCQPSKIPPI---GNLPT-------GLYENTCEKRCYGNNTINYNQDHVK 226

Query: 232 DKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD-DFYHYKSGVYKHTSNAKLENYLH 290
            K+   + Y      + I++E+  +GP +  F ++D DF+ YKSGVY+ T+N++   + +
Sbjct: 227 IKNHYDIEY------EDIQREVQNYGPVSMAFRVFDNDFFLYKSGVYEKTTNSEFIQWQY 280

Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           + KLIGWG ENG  YWL++N+WG  WG  G  KI RG  EC  E  + AG+P+
Sbjct: 281 A-KLIGWGVENGVDYWLLVNSWGYEWGQNGLFKIKRGTDECNIETFVHAGEPQ 332


>gi|124502519|gb|ABN13633.1| cysteine proteinase [Haemonchus contortus]
          Length = 342

 Score =  177 bits (448), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 102/285 (35%), Positives = 144/285 (50%), Gaps = 12/285 (4%)

Query: 56  IADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIF 115
           I D KY  Q    +  +    DP+    +P  +D R+ W NC T  ++ D   C +    
Sbjct: 63  IMDIKYNHQRLNLMVKE----DPDPEVDIPPSYDPRDVWKNCTTF-YIRDQANCGSCWAV 117

Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
           +   A SDR CI SK ++   +S   + +CC   R      C  G     W +    G V
Sbjct: 118 STAAAISDRICIASKAEKQVNISATDIMTCC---RPQCGDGCEGGWPIEAWKYFIYDGVV 174

Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
           +GG+Y  +  C+P  I PC HHG+      C     P   C   C  P   + +  DK  
Sbjct: 175 SGGEYLTKGVCRPYPIHPCGHHGNDTYYGECRGT-APTPPCKKEC-RPGVRKVYRIDKRY 232

Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
               Y V  +  AI+ EIL +GP  A+FA+Y+DF HYKSG+YKHT+  +L  Y H+ K+I
Sbjct: 233 GKDAYIVKQSVKAIQSEILRNGPVVASFAVYEDFRHYKSGIYKHTA-GELRGY-HAVKMI 290

Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           GWG EN T +WL+ N+W   WG++G  +I+RG  +C  E  IAAG
Sbjct: 291 GWGNENNTDFWLIANSWHNDWGEKGYFRIIRGTNDCGIEGTIAAG 335


>gi|312374702|gb|EFR22199.1| hypothetical protein AND_15622 [Anopheles darlingi]
          Length = 339

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 102/322 (31%), Positives = 155/322 (48%), Gaps = 18/322 (5%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
           F+DA++ ++   A +W    NF +N+     R       K   +S        K YD  Y
Sbjct: 34  FNDAFLRRVLARARSWKPDTNFRSNIHYHTFRSL-----KGIGESRTGFKVPIKHYDYVY 88

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
              +P+ FD+R++WPNC ++  + + G C +    AA    SDR CI + G +N  ++ E
Sbjct: 89  DIDIPESFDSRDRWPNCDSLREIRNQGTCGSCWAVAAASVMSDRVCIHTNGTRNVAIAAE 148

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            +  CC  C          G+ F+ W      G V+GG Y    GC+P    PC +    
Sbjct: 149 DLMGCCADCGNGCEGGFLDGTSFQYWV---DAGLVSGGAYNSTEGCKPYPFKPCLY---- 201

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
                C  ++ PK K H  C +    R + +DK   ++ Y V  +E  I+ EI+ +GP  
Sbjct: 202 -PFTDCHREESPKCKHH--CQHGVDKR-YARDKVFGSVAYSVPRDERVIRYEIMTNGPVE 257

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
             F +Y+D + YKSGVY+H     +    H+ ++IGWG E G PYWL+ N++G  WGD G
Sbjct: 258 GGFDVYEDVFLYKSGVYRHVYGEHVGK--HAVRIIGWGREGGIPYWLISNSYGEDWGDHG 315

Query: 321 TVKILRGKYECAFEYLIAAGKP 342
             KI+RG      E  +  G P
Sbjct: 316 YFKIVRGINHLGIESKVITGLP 337


>gi|294873367|ref|XP_002766594.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239867622|gb|EEQ99311.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 244

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 98/245 (40%), Positives = 134/245 (54%), Gaps = 11/245 (4%)

Query: 105 DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFR 164
           D  AC +   F  V AF+ R CIKS G+ N+ LS   + +CC I  +  +  CS G+   
Sbjct: 1   DQSACGSCWAFGTVEAFNARVCIKSGGKLNQLLSAANMLACCNIGHFCLSFGCSGGNPIT 60

Query: 165 TWNFLHKRGSVTGGDYGDRT------GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHT 218
           +W FLH  G V+GG +          GC P +   C+HH        C  +      C +
Sbjct: 61  SWTFLHTNGIVSGGGFVPEKNMKAADGCWPYSFPKCAHHQDGSDYKPCAKEIYDTPSCSS 120

Query: 219 RCTNPTYGRGFFQDKHRT-TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVY 277
            C N  YG  F +D+H T +L      +  +IKKEI+ +GPT+A F++Y+DF  YKSGVY
Sbjct: 121 SCPNAKYGTAFDKDRHYTESLFPSRFGSTSSIKKEIMTNGPTSAAFSVYEDFLSYKSGVY 180

Query: 278 KHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
           KHTS   L    H+ ++IGWGTE G  YWLV+N+W   WGD GT KI++G  +C  +  I
Sbjct: 181 KHTSGGFLGG--HAVEIIGWGTEKGVDYWLVMNSWNEEWGDHGTFKIVQG--DCGIDDTI 236

Query: 338 AAGKP 342
            AG P
Sbjct: 237 LAGTP 241


>gi|157058769|gb|ABV03142.1| cathepsin B-348 [Myzus persicae]
          Length = 246

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 94/230 (40%), Positives = 129/230 (56%), Gaps = 10/230 (4%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FDARE WPNC TI  V D G+C +   F AV A SDR CI SKG +N   S E + 
Sbjct: 24  LPENFDAREHWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGAKNFHFSAENLV 83

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC  C +     C+ G     W++   +G V+GG YG + GC P  I+PC HH +    
Sbjct: 84  SCCWTCGF----GCNGGFPGAAWHYWKTKGIVSGGPYGSKMGCIPYEIAPCEHHVNGTRG 139

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P  E  K P   C  +C +  Y   + QD HR    Y + ++ D I++EI  +GP    F
Sbjct: 140 PCKEGGKTP--ACVKKCED-GYKVPYAQDLHRGKSAYSLGNDVDQIRQEIYTNGPVEGAF 196

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG-TPYWLVINTW 312
            +Y+DF  Y++GVYKH +   L    H+ +++GWG +NG  PYWLV N+W
Sbjct: 197 TVYEDFIAYRAGVYKHVAGKALGG--HAIRILGWGVQNGEIPYWLVANSW 244


>gi|239790303|dbj|BAH71722.1| ACYPI001175 [Acyrthosiphon pisum]
          Length = 330

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 115/328 (35%), Positives = 156/328 (47%), Gaps = 28/328 (8%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
             ++ ID  + E N+  AG N   N +EE   Q L+   +    +   +    +      
Sbjct: 24  LHNSIIDPSDMETNSLKAGENVLPNSAEEE-HQMLLETREVEAATKSKIMYKTRHPRSAI 82

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
              + + FDAR+ WP C TIG V D G       +A  G  +DR CI + G  N+ LSTE
Sbjct: 83  DNQIHEEFDARKGWPQCKTIGEVHDDGNTRWGWAYATAGVLADRMCIATNGSYNQLLSTE 142

Query: 141 YVASCCKICRYDDNKSCSHGSVF--RTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
            +  C  I      K+   G+V     W +L   G V+GG Y    GCQPS I P    G
Sbjct: 143 ELIFCGGI------KTKQSGAVRGDDVWEYLKSHGLVSGGKYNTNDGCQPSKIPPI---G 193

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRG---FFQDKHRTTLTYWVDDNEDAIKKEILA 255
           + PT        +    C  RC    YG     ++ D  + +  Y +  NED I+KE+  
Sbjct: 194 NIPT-------HLYNHTCEERC----YGNNTIHYYHDHVKVSHYYNIKSNED-IQKEVQT 241

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
           +GP +  F +YDDF+ YKSGVY  T  + L    H  KLIGWG ENG  YWL++N WG  
Sbjct: 242 YGPVSVKFRVYDDFFLYKSGVYVKTEKS-LYVRRHFAKLIGWGVENGVDYWLLVNFWGNE 300

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPK 343
           WG  G  KI RG  E   E  + AG+P+
Sbjct: 301 WGQNGLFKIKRGTNEVHVEDYVYAGEPE 328


>gi|312091331|ref|XP_003146940.1| cathepsin B [Loa loa]
          Length = 249

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 93/239 (38%), Positives = 131/239 (54%), Gaps = 10/239 (4%)

Query: 106 TGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRT 165
           +G+C A    AAV A SDR CI SKG++   LS + + SCCK C +     C  G     
Sbjct: 14  SGSCWA---VAAVEAMSDRICIMSKGKKQVTLSADDLLSCCKTCGF----GCFGGEPMAA 66

Query: 166 WNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTY 225
           W +   RG VTG +Y + +GC+P    PC HH +      C++   P  KC  +C +  Y
Sbjct: 67  WKYWVLRGIVTGSEYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCVKKC-DKNY 125

Query: 226 GRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKL 285
           G+ +  DK+     Y V+ N ++I+KEI+  GP  A+F +Y DF +Y  G+YKH + +  
Sbjct: 126 GKSYKADKYYGQSVYNVESNVESIQKEIMTLGPVEASFEVYTDFLYYTGGIYKHVAGSMG 185

Query: 286 ENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
               H+ K++GWG + G PYWL  N+W   WG+ G  +ILRG  EC  E  I AG PK 
Sbjct: 186 GG--HAVKVLGWGIDQGVPYWLAANSWNTDWGEDGYFRILRGVNECGIESGIIAGIPKQ 242


>gi|7507648|pir||T24819 hypothetical protein T10H4.12 - Caenorhabditis elegans
          Length = 324

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 103/269 (38%), Positives = 136/269 (50%), Gaps = 16/269 (5%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +PD FDARE+WP+C TI  + +   C +   F A    SDR CI+S G Q   +S E + 
Sbjct: 30  LPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDIL 89

Query: 144 SCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
           SCC   C Y     C  G       F    G+VTGGDYG   GC P + +PC+ +    T
Sbjct: 90  SCCGTTCGY----GCKGGYSIEALRFWASSGAVTGGDYGGH-GCMPYSFAPCTKNCPEST 144

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGF------FQDKHRTTLTYWVDDNEDA--IKKEIL 254
            PSC+       K      +  YG         FQ        Y V   +    I+ EI 
Sbjct: 145 TPSCKTTCQSSYKTEEYKKDKHYGELVWHSFNRFQRFLNRASAYKVTTTKSVTEIQTEIY 204

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            +GP  A++ +Y+DFYHYKSGVY +TS   +    H+ K+IGWG ENG  YWL+ N+WG 
Sbjct: 205 HYGPVEASYKVYEDFYHYKSGVYHYTSGKLVGG--HAVKIIGWGVENGVDYWLIANSWGT 262

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            +G++G  KI RG  EC  E  + AG  K
Sbjct: 263 SFGEKGFFKIRRGTNECQIEGNVVAGIAK 291


>gi|189239879|ref|XP_968767.2| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270012755|gb|EFA09203.1| cathepsin B precursor [Tribolium castaneum]
          Length = 353

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 116/318 (36%), Positives = 151/318 (47%), Gaps = 32/318 (10%)

Query: 26  IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVP 85
           I+QIN + ++WTA  N P +  E  L    I     F          R          +P
Sbjct: 24  INQINSQQSSWTARIN-PFDDIESRLGFLGIHPDPNFQLEVLEWEEPR--------TVIP 74

Query: 86  DRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
             FDARE WP C   IG++ + G C +   FAA    SDR C+ + G      S E + +
Sbjct: 75  ATFDAREYWPQCKDVIGNIRNQGKCGSCWAFAAAEVMSDRLCVATNGSVKFEFSPEDLIN 134

Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
           CC+ C     K C  G  +  W +    G V+GGDY    GCQP + S   + G +P   
Sbjct: 135 CCETC----GKKCKGGYSYYAWKYYTSTGLVSGGDYNTSRGCQPYSKSN-FNDGVSP--- 186

Query: 205 SCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG-PTTATF 263
                     +C   C N  Y   +  D+H    TY++  N   I++EIL  G P  A F
Sbjct: 187 ----------ECSKTCQNTKYPTSYLNDRHFGDGTYYILKNVTTIQQEILLRGGPVMAGF 236

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV- 322
            +Y+DF  Y+ GVY HTS A L +  H+ K+IGWGTENG  YWLV N+WG  WG  G V 
Sbjct: 237 DVYEDFKLYREGVYVHTSGALLGS--HAVKIIGWGTENGWAYWLVANSWGKDWGALGGVF 294

Query: 323 KILRGKYECAFEYLIAAG 340
           KI RG  EC  E  I  G
Sbjct: 295 KIRRGTNECKIEQSIITG 312


>gi|1008858|gb|AAA79004.1| cathepsin B-like thiol protease [Aedes aegypti]
          Length = 342

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 111/353 (31%), Positives = 157/353 (44%), Gaps = 38/353 (10%)

Query: 1   MIHILVFLLGCTLVRGELY---------KFSDAYIDQINREANTWTAGRNF--PANLSEE 49
            I I+   LG   V G+ Y         + + +   QI     TW AG N   PA     
Sbjct: 4   FILIVAAALGSPAVLGQYYNTFSYNGQYRSTGSIASQIRNLTRTWVAGNNTLPPA----A 59

Query: 50  YLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGAC 109
           Y +  L      +D+            +P+    +P+ FDAR++W  C ++  + + G C
Sbjct: 60  YFKGVL------YDRLGETRLAPAILVNPQ-DIQLPESFDARQKWSQCPSLNVIRNQGCC 112

Query: 110 AAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFL 169
            +    +A  A +DR CIKSKG++        + +CC  C       C  G +   W F 
Sbjct: 113 GSCWAISAASAMTDRWCIKSKGKEQFSFGATDMLACCHAC----GDGCKGGYLGPAWQFW 168

Query: 170 HKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF 229
            ++G  +GG Y  R GC P  I  C   G     P          KC  RC +       
Sbjct: 169 VEQGVSSGGPYNSRQGCHPYPIDVCDASGEEADTP----------KCSKRCQSGYNVTDV 218

Query: 230 FQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL 289
           +QD+    + Y + ++E  I +EI  +GP  A F  Y D + YKSGVY+H          
Sbjct: 219 WQDRRYGRVAYSIPNDEQKIMEEIYINGPVQAAFMTYQDLHAYKSGVYRHVWGHMAGG-- 276

Query: 290 HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           H+ KL+GWG ENG  YWLV N+WG  WGD G  KI+RG+  C  E  + AG P
Sbjct: 277 HAVKLMGWGVENGLKYWLVANSWGDDWGDNGFFKIVRGENHCGIEKDVHAGLP 329


>gi|119638992|gb|ABL85238.1| cysteine proteinase 4 [Necator americanus]
          Length = 339

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 107/317 (33%), Positives = 162/317 (51%), Gaps = 19/317 (5%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
           A +D +N   + +    + P N  E++++   I D KY  ++    P  RK  +   +  
Sbjct: 36  ALVDYVNSHQSLFKTEYS-PTN--EQFVKA-RIMDIKYMTEASHKYP--RKGIN--LNVE 87

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+RFDARE+WP+C +IG + D  AC +    +A    SDR CI++ G   + LS+  + 
Sbjct: 88  LPERFDAREKWPHCASIGLIRDHSACGSCWAVSAASVMSDRLCIQTNGTNQKILSSADIL 147

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC-SHHGSAPT 202
           +CC     D    C  G   + + +L   G  +GG+Y ++  C+P    PC  ++G  P 
Sbjct: 148 ACCG---EDCGSGCEGGYPIQAYFYLENTGVCSGGEYREKNVCKPYPFYPCDGNYGPCPK 204

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
             + +  K  K+ C  R   P      F       L     DNE  I++EI  +GP  A 
Sbjct: 205 EGAFDTPKCRKI-CQFRYPVPYEEDKVFGKNSHILL----QDNEARIRQEIFINGPVGAN 259

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F +++DF HYK G+YK T    +   +H+ KLIGWGTENGT YWLV N++   WG+ GT 
Sbjct: 260 FYVFEDFIHYKEGIYKQTYGKWIG--VHAIKLIGWGTENGTDYWLVANSYNYDWGENGTF 317

Query: 323 KILRGKYECAFEYLIAA 339
           +ILRG   C  E  + A
Sbjct: 318 RILRGTNHCLIESQVIA 334


>gi|157167283|ref|XP_001658486.1| cathepsin b [Aedes aegypti]
 gi|108876477|gb|EAT40702.1| AAEL007599-PA [Aedes aegypti]
          Length = 342

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 111/353 (31%), Positives = 157/353 (44%), Gaps = 38/353 (10%)

Query: 1   MIHILVFLLGCTLVRGELY---------KFSDAYIDQINREANTWTAGRNF--PANLSEE 49
            I I+   LG   V G+ Y         + + +   QI     TW AG N   PA     
Sbjct: 4   FILIVAAALGSPAVLGQYYNTFSYNGQYRSTGSIASQIRNLTRTWVAGNNTLPPA----A 59

Query: 50  YLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGAC 109
           Y +  L      +D+            +P+    +P+ FDAR++W  C ++  + + G C
Sbjct: 60  YFKGVL------YDRLGETRLAPAILVNPQ-DIQLPESFDARQKWSQCPSLNVIRNQGCC 112

Query: 110 AAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFL 169
            +    +A  A +DR CIKSKG++        + +CC  C       C  G +   W F 
Sbjct: 113 GSCWAISAASAMTDRWCIKSKGKEQFSFGATDMLACCHAC----GDGCKGGYLGPAWQFW 168

Query: 170 HKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF 229
            ++G  +GG Y  R GC P  I  C   G     P          KC  RC +       
Sbjct: 169 VEQGVSSGGPYNSRQGCHPYPIDVCDASGEEADTP----------KCSKRCQSGYNVTDV 218

Query: 230 FQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL 289
           +QD+    + Y + ++E  I +EI  +GP  A F  Y D + YKSGVY+H          
Sbjct: 219 WQDRRYGRVAYSIPNDEQKIMEEIYINGPVQAAFMTYQDLHAYKSGVYRHVWGHMAGG-- 276

Query: 290 HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           H+ KL+GWG ENG  YWLV N+WG  WGD G  KI+RG+  C  E  + AG P
Sbjct: 277 HAVKLMGWGVENGLKYWLVANSWGDDWGDNGFFKIVRGENHCGIEKDVHAGLP 329


>gi|294951797|ref|XP_002787132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239901778|gb|EER18928.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 278

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 107/282 (37%), Positives = 141/282 (50%), Gaps = 25/282 (8%)

Query: 73  RKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
           RK Y  E    +P  FDAR  +PNC   IGH+ D  AC +   F    AF+DR CIKS G
Sbjct: 10  RKGYAIEELQDLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSHG 69

Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRT---G 185
                LS   + +C        +  C+ G     W+++H +G  TGGDY    D T   G
Sbjct: 70  TFTELLSAGEMNACAP------SHGCNGGFPNSAWSWVHDKGIATGGDYVAEDDMTKDDG 123

Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH----RTTLTYW 241
           C P    PC+HH +    P C         C  +C NP Y      D+H     +   Y 
Sbjct: 124 CWPYDFPPCAHHVNDSKYPKCPKDSYETPNCAEQCHNPKYTTTLRDDRHFMVESSPYQYS 183

Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
           V+D ++AI+ +    GP +A+F +Y+DF  YKSGVYKHTS   L    H+ K+IGWG E+
Sbjct: 184 VNDAKNAIRTD----GPVSASFTVYEDFLAYKSGVYKHTSGEYLGG--HAVKIIGWGEES 237

Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           G  YWLV+N+W   WGD G  KI  G   C  +  +  G PK
Sbjct: 238 GQAYWLVVNSWNEDWGDHGLFKIALGN--CGIDDYLLGGTPK 277


>gi|270012756|gb|EFA09204.1| cathepsin B precursor [Tribolium castaneum]
          Length = 369

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 121/333 (36%), Positives = 162/333 (48%), Gaps = 44/333 (13%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
           + I+QIN + + WTAG N P +  E  L    I    + D + +P     +  +P+ +  
Sbjct: 21  SLINQINSQQSAWTAGIN-PFDDIESRLGFLGI----HPDPNFKP-----EIKEPQATQN 70

Query: 84  V-PDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
           V P+ FDARE WP C   IG++ + G C++   FAA    SDR CI + G+    LS E 
Sbjct: 71  VIPETFDAREYWPECADIIGNIRNQGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPED 130

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
           +  CC  C       C  G  +  WN+    G V+GGDY   TGCQP   S  +++   P
Sbjct: 131 LIDCCHYC----GNQCKGGYTYYAWNYFMLTGLVSGGDYNTSTGCQP--YSELNYYRITP 184

Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG-PTT 260
                         C+T C N  Y   +  DKH     Y++  NE AI+ EIL+ G P  
Sbjct: 185 -------------PCNTTCQNDKYPIPYVSDKHFGDSIYYIPQNETAIQNEILSGGGPVV 231

Query: 261 ATFALYDDFYHYK---------SGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINT 311
           A F +Y DF  Y+          GVY +TS A       + K+IGWGTENG  YWL  N+
Sbjct: 232 AAFDVYGDFKIYRDGEQHDTILEGVYIYTSGALFGR--TAVKIIGWGTENGWAYWLAANS 289

Query: 312 WGPHWGDRGT-VKILRGKYECAFEYLIAAGKPK 343
           WG  WG  G   KI RG  EC FE  I AG+ +
Sbjct: 290 WGKDWGALGGFFKIRRGTNECGFEESIIAGQVR 322


>gi|347972088|ref|XP_313836.5| AGAP004534-PA [Anopheles gambiae str. PEST]
 gi|333469166|gb|EAA09182.5| AGAP004534-PA [Anopheles gambiae str. PEST]
          Length = 334

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 104/322 (32%), Positives = 149/322 (46%), Gaps = 18/322 (5%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
           F+D ++ ++   A TW    NF +N+     R       K   +S        + Y+  Y
Sbjct: 29  FNDDFLRRVLARARTWKPDTNFQSNVHFHAFRSL-----KGIGESRTGFKVPIRRYEYVY 83

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
              +P+ FDAR  WPNC ++  + + G C +    AA    SDR CI S G  N  L+ E
Sbjct: 84  DVDIPESFDARNHWPNCESLRAIRNQGTCGSCWAVAAASVMSDRVCIHSNGTINVALAAE 143

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            +  CC  C    N     G+ F+ W      G V+GG Y    GC+P    PC +    
Sbjct: 144 DLMGCCVDCGNGCNGGFLDGTSFQYWV---DAGLVSGGAYNSTDGCKPYPFKPCEY---- 196

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
                C  +  PK   H R       R + +DK    + Y V  +E AI+ EI+ +GP  
Sbjct: 197 -PFNDCHVEISPKCTHHCR---DGVDRHYSKDKLFGKVAYSVPRDERAIRYEIMTNGPVE 252

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
           A F +Y+D   YKSGVY+H    ++    H+ ++IGWG + G PYWL+ N++G  WGD G
Sbjct: 253 AGFDVYEDVLLYKSGVYRHVYGEQIGK--HAVRIIGWGRDGGIPYWLIANSYGDDWGDHG 310

Query: 321 TVKILRGKYECAFEYLIAAGKP 342
             K +RG      E  I  G P
Sbjct: 311 YFKFVRGSNHLGIESKIITGLP 332


>gi|268566077|ref|XP_002647467.1| Hypothetical protein CBG06539 [Caenorhabditis briggsae]
          Length = 332

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 114/323 (35%), Positives = 164/323 (50%), Gaps = 34/323 (10%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGD--RKTYDPEYS 81
           A +D +N   + +        ++SEE+++   + + KY      P P D  R T      
Sbjct: 33  ALVDYVNSAQSLFITEH---VDVSEEFMKS-RVMNVKY----ASPPPSDEIRATEVNTVL 84

Query: 82  ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
           AT+P+ FDAR +WP C +I  + +   C +   F A    SDR CI +KG +   +S   
Sbjct: 85  ATIPETFDARTKWPKCKSIKLIRNQANCGSCWAFGAAEVISDRICIATKGARQPVISPMD 144

Query: 142 VASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-GDRTGCQPSTISPCSHHGS 199
           +  CC + C Y     C  G   +   +    G VTGGDY GD  GC+P     C+  G 
Sbjct: 145 MVDCCGEYCGY----GCDGGYSIQALRWWVFDGVVTGGDYQGD--GCKPYQF--CNSAGC 196

Query: 200 APTL-PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
              + P C       L C ++     Y   + +DK+  T  Y+V    +AI+ +I+ +GP
Sbjct: 197 PDAVTPEC------ALSCQSK-----YNTEYAKDKNFGTSAYYVGMTVNAIQTDIMTNGP 245

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
             A+F +Y+DFY YKSGVYK+ +   L    H+ K+IGWGTENGT YWL+ N+WG  WG+
Sbjct: 246 VEASFKVYEDFYKYKSGVYKYIAGKMLGG--HAIKIIGWGTENGTAYWLIANSWGTKWGE 303

Query: 319 RGTVKILRGKYECAFEYLIAAGK 341
            G  KI RG  EC  E  + AGK
Sbjct: 304 NGFFKIRRGVNECGIENNVVAGK 326


>gi|157058763|gb|ABV03139.1| cathepsin B-348 [Acyrthosiphon pisum]
          Length = 248

 Score =  174 bits (442), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 94/233 (40%), Positives = 127/233 (54%), Gaps = 10/233 (4%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           S  +P+ FDARE+WPNC TI  V D G+C +   F AV A SDR CI S G +N   S E
Sbjct: 23  STDLPETFDARERWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSNGTKNFHFSAE 82

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SCC  C +     C+ G     WN+   +G V+GG YG   GC P  I+PC HH + 
Sbjct: 83  NLVSCCWTCGF----GCNGGFPGAAWNYWKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNG 138

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
              P  E  K P   C  +C    Y   + QD H     Y + ++ D I++EI  +GP  
Sbjct: 139 TRGPCKEGGKTP--TCVKKCEE-GYKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTNGPVE 195

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG-TPYWLVINTW 312
             F +Y+DF  Y++GVYKH +   L    H+ +++GWG +NG  PYWLV N+W
Sbjct: 196 GAFTVYEDFIAYRAGVYKHVAGKALGG--HAIRILGWGVQNGEIPYWLVANSW 246


>gi|119638965|gb|ABL85237.1| cysteine proteinase 3 [Necator americanus]
          Length = 360

 Score =  174 bits (441), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 106/326 (32%), Positives = 160/326 (49%), Gaps = 23/326 (7%)

Query: 23  DAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA 82
           +A+ + +N+  + +TA +  P  L+   +R   + ++++ D  +  +    K  D ++S 
Sbjct: 36  EAFAEFLNKRQSFFTA-KYTPNALNILKMR---VMESRFLDNEEGEM---LKEEDMDFSE 88

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
            +P  FDAR++WP C +IG + D   C +    ++    SDR C++S G     LS   +
Sbjct: 89  EIPVSFDARDKWPKCTSIGFIRDQSHCGSCWAVSSAETMSDRLCVQSNGTIKVLLSDTDI 148

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            +CC  C       C  G   R W +    G  TGG YG +  C+P    PC       +
Sbjct: 149 LACCPNC----GAGCGGGHTIRAWEYFKNTGVCTGGLYGTKDSCKPYAFYPCKDE----S 200

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
              C     P  KC   C    Y + +  DK+     Y +  NE  IK EI+ +GP TA+
Sbjct: 201 YGKCPKDSFPTPKCRKICQY-KYSKKYADDKYYANSAYRIPQNETWIKLEIMRNGPVTAS 259

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE--NGT--PYWLVINTWGPHWGD 318
           F +Y DF  Y+ GVY  +   +L    H+ K+IGWGTE  NGT  PYWL+ N+WG  WG+
Sbjct: 260 FRIYPDFGFYEKGVYVTSGGRELGG--HAIKIIGWGTEKVNGTDLPYWLIANSWGTDWGE 317

Query: 319 -RGTVKILRGKYECAFEYLIAAGKPK 343
             G  +ILRG+  C  E  + AG  K
Sbjct: 318 NNGYFRILRGQNHCQIEQKVIAGMIK 343


>gi|294891881|ref|XP_002773785.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239878989|gb|EER05601.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 455

 Score =  174 bits (441), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 114/320 (35%), Positives = 154/320 (48%), Gaps = 20/320 (6%)

Query: 24  AYIDQINREANTWTAGRNFP--ANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE-- 79
           + +D+IN    +WTA ++ P    +S + L      D  +    D    G+ +   P   
Sbjct: 83  SMVDKINSMQQSWTASKDQPPFKGMSIKDLPAGCSNDTMFSSTLDEG--GENRLLGPTNP 140

Query: 80  YSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
              T+P  FDAR+++ +C   IGHV + G C      AAVG F+DR CIKS G+    LS
Sbjct: 141 VLTTLPSSFDARQKFASCADVIGHVREQGECNNCWASAAVGMFNDRVCIKSGGRITDILS 200

Query: 139 TEYVASCCKICR-YDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------GDRTGCQPSTI 191
             Y+ SCC        +  C  GSV    NF+   G VTGG+Y      G+  GC P   
Sbjct: 201 LGYLTSCCNRANGCPKSNGCMFGSVPEGLNFMKNHGLVTGGEYKPPEELGNDDGCWPYPF 260

Query: 192 SPCSH-HGSAPTLPSC-ENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
             C+H  G     P C + + +P   C T C N  YG    +D HR      +    + I
Sbjct: 261 PKCNHVPGLESKYPRCAQVRDLPA--CATTCPNKAYGTSMQKDTHRAKSWGRLPIGPEKI 318

Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVI 309
           K+EI  +GP  A   LY+DF  YKSGVY H +   L    H+ KLIGWG E+G  YWL +
Sbjct: 319 KQEIFDNGPVAAMMTLYEDFRFYKSGVYVHKTGQMLA--AHTLKLIGWGVESGQEYWLAV 376

Query: 310 NTWGPHWGDRGTVKILRGKY 329
           N W   WGD G +K+    Y
Sbjct: 377 NAWNEEWGDHGMIKLASSVY 396


>gi|341886633|gb|EGT42568.1| hypothetical protein CAEBREN_17563 [Caenorhabditis brenneri]
          Length = 358

 Score =  174 bits (441), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 116/321 (36%), Positives = 160/321 (49%), Gaps = 22/321 (6%)

Query: 29  INREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRF 88
           +N +   W A         E+  R   + D K+    ++    D      E    +P+ F
Sbjct: 47  VNEQQQLWKA-ETSRMTFQEKMAR---VKDIKFIRSHEQSTENDNSQVFEE----IPNSF 98

Query: 89  DAREQWPNCGTIGHVPDTGAC-AAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC- 146
           DAR++WP+C  IG V D   C +A H+ AA  A SDR CI S G  N PLS +   SCC 
Sbjct: 99  DARQKWPSCSQIGAVRDQSDCGSAAHLVAAEIA-SDRTCIFSNGTFNWPLSAQDPLSCCV 157

Query: 147 ---KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH-HGSAPT 202
               IC   D   C          +    G  TGG+Y D+ GC+P TI PC   + +  T
Sbjct: 158 GLMSIC--GDGWGCDGSWPKDILKWWQTHGLCTGGNYDDQFGCKPYTIYPCDKKYPNGTT 215

Query: 203 LPSCENQKVPKLKCHTRCT-NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
              C     P   C  RCT N T+   + QDKH     Y V      I+ EI+ +GP  A
Sbjct: 216 SVPCPGYHTPV--CEERCTSNITWPISYKQDKHFGKAHYNVGKKMTDIQTEIMRNGPVIA 273

Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT 321
           +F +YDDF+ YKSG+Y HT+  + E  + + K+IGWG +NG PYWL ++ WG  +G+ G 
Sbjct: 274 SFIIYDDFWDYKSGIYVHTAGDQ-EGGMDT-KIIGWGVDNGVPYWLCVHQWGTDFGENGF 331

Query: 322 VKILRGKYECAFEYLIAAGKP 342
           V+ILRG  E   E+ + A +P
Sbjct: 332 VRILRGVNEVNIEHQVLAAQP 352


>gi|256086863|ref|XP_002579605.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228447|emb|CCD74618.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 271

 Score =  174 bits (440), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 95/231 (41%), Positives = 127/231 (54%), Gaps = 8/231 (3%)

Query: 113 HIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKR 172
           H F AV + SDR CI SK + +  LS   + SCC  C +     C  G     W++    
Sbjct: 44  HAFGAVESMSDRICIHSKNKISVELSAINLLSCCTRCGF----GCRGGIPGMAWDYWKYE 99

Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQD 232
           G VTGG     TGCQP     C+HH S+ + P CE+   P  +CH  C +  YG+ + +D
Sbjct: 100 GIVTGGSNETHTGCQPYPFPECNHHSSSKSYPPCESYYFPTPECHETCQD-DYGKPYKKD 158

Query: 233 KHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
           K     +Y V   E +I KEIL +GP    F +Y+DF +YKSGVYKH + + L    H+ 
Sbjct: 159 KFYGKSSYNVASEEISIMKEILLNGPVEGGFYVYEDFLNYKSGVYKHITGSYLGG--HAI 216

Query: 293 KLIGWGT-ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           ++IGWG  +N  PYWL  N+W   WGD+G  KILRG  EC  E ++ AG P
Sbjct: 217 RIIGWGIQQNHIPYWLCANSWNNQWGDQGYFKILRGTNECGIESMVTAGLP 267


>gi|157167281|ref|XP_001658485.1| cathepsin b [Aedes aegypti]
 gi|108876476|gb|EAT40701.1| AAEL007585-PA [Aedes aegypti]
          Length = 386

 Score =  174 bits (440), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 93/259 (35%), Positives = 134/259 (51%), Gaps = 16/259 (6%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +PD FDARE+WP C ++  + D G C +    +A  A +DR C++SKG++     +  + 
Sbjct: 125 LPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLL 184

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC  C     + C  G++   W F  ++G  +GG    R GC P  I  C   G     
Sbjct: 185 SCCHSC----GQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGECRIPG----- 235

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
              E++  PK  C  +C +       +QD+H   + Y + ++E  I +EI  +GP  A F
Sbjct: 236 ---EDEDTPK--CSNKCRSGYNVTDVWQDRHYGRVAYSLPNDERKIMEEIFINGPVQAAF 290

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
             Y D + YKSG+Y+H          H+ KL+GWG ENG  YWLV N+WG  WG+ G  K
Sbjct: 291 HTYLDLHAYKSGIYRHVWGPLSGG--HAVKLLGWGVENGVKYWLVANSWGREWGENGFFK 348

Query: 324 ILRGKYECAFEYLIAAGKP 342
           I+RG+  C  E  I AG P
Sbjct: 349 IVRGENHCGIEENIHAGLP 367


>gi|268560898|ref|XP_002638183.1| Hypothetical protein CBG22612 [Caenorhabditis briggsae]
          Length = 721

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 111/352 (31%), Positives = 171/352 (48%), Gaps = 43/352 (12%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           + H+L +      + G+      + ++ +N     W A       +SEE ++ F + D+K
Sbjct: 8   IAHLLQYTFSQQTLSGK------SLVNHVNTIQTLWKAEY---FEISEEEMK-FKVMDSK 57

Query: 61  YFDQSDRPLPGDRKTYDPEYS-----ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIF 115
           +        P ++ + +P  S     +  P  FDAR+ WPNC +I  + D   C +   F
Sbjct: 58  F------AFPEEQISSEPNNSLPGSLSRAPTSFDARDYWPNCKSIKMIRDQAYCGSCWAF 111

Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
            A    SDR CI+S G     +S E + +CC      ++  C  G V     F   +G V
Sbjct: 112 GAAEVISDRICIQSNGTDQPIISPEDILTCCT-----NSHGCQGGFVLEAMKFWKSKGVV 166

Query: 176 TGGDY-GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           TGGD+ GD  GC P +   CS   +A T P C+N+      C  + T   Y     +DK+
Sbjct: 167 TGGDFQGD--GCIPYSYGSCSDCHTAQTTPKCKNE------CQVKYTKNEYK----EDKY 214

Query: 235 RTTLTYWVDDNEDA--IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
             +  Y +  +     I+ EIL +GP  AT+ +Y+DFY+YKSGVY++ S   +    H+ 
Sbjct: 215 YGSSAYRLSTSNAVRTIQSEILRNGPVEATYQVYEDFYYYKSGVYEYISGRHMGG--HAV 272

Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           K+IGWG E    YWL+ N+WG  +G+ G  K+ RG  EC  E  + AG  K+
Sbjct: 273 KIIGWGVEENVNYWLIANSWGTGFGENGFFKMRRGNNECGIENYVVAGMAKS 324


>gi|339242313|ref|XP_003377082.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316974149|gb|EFV57673.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 517

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 92/260 (35%), Positives = 131/260 (50%), Gaps = 27/260 (10%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FD+RE+WP C  I  + D   C +    +A    +DR CI SKGQ+   +S E + 
Sbjct: 280 LPKHFDSREKWPECEWIRFIRDQSNCGSCWAVSAASVMTDRHCIASKGQETPYISDEQIL 339

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           +C              G +   +N+  K G  TGG YGD++ CQP +I+PCS      + 
Sbjct: 340 AC--------------GMIPSPFNYWKKMGIATGGPYGDKSCCQPYSIAPCSKCSYTAST 385

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           PSC      K  C      P     F+  +H     Y V  N+  I  EI  HGP  A F
Sbjct: 386 PSC------KYDCQADYDIPISDDKFYASEH-----YHVSSNQYEIMNEIYTHGPVVAGF 434

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
            +Y+DF +Y SG+Y+ T+   +    H+ ++IGWG ENG PYWL+ N+W   +G++G  +
Sbjct: 435 IVYEDFTYYISGIYQQTTYVAMGG--HAIRIIGWGEENGIPYWLIANSWNTTFGEKGFFR 492

Query: 324 ILRGKYECAFEYLIAAGKPK 343
           I RG  EC  E  +  G PK
Sbjct: 493 IRRGTNECRIESEVYTGIPK 512



 Score = 67.4 bits (163), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 37/121 (30%), Positives = 56/121 (46%), Gaps = 11/121 (9%)

Query: 157 CSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKC 216
           C  G +   + +  + G VTGG YG++  C P +ISPC+        P C+         
Sbjct: 70  CRSGKIEAAFIYWQRSGLVTGGPYGEKACCLPYSISPCTMCRPYMLAPKCQ--------- 120

Query: 217 HTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGV 276
             R    +Y     +DK+     Y+V+ +E  I +EI   GP  A F +Y DF +Y SG 
Sbjct: 121 --RTCQASYNLSLKRDKYYGKSHYYVNQDEFDIMQEIYQRGPVVAGFKVYHDFLYYISGQ 178

Query: 277 Y 277
           +
Sbjct: 179 F 179


>gi|157058739|gb|ABV03127.1| cathepsin B-2744 [Acyrthosiphon pisum]
          Length = 260

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 94/253 (37%), Positives = 136/253 (53%), Gaps = 11/253 (4%)

Query: 72  DRKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSK 130
           +RKT D  Y   +P  FDAR+ + +C   IG V D G CA+    A    F+DR CI S 
Sbjct: 14  NRKTVDNSYKTDIPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASN 73

Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPST 190
           GQ    LS + + SC       +   C  GS F+ W     +G VTGG++    GCQP  
Sbjct: 74  GQFTDNLSAQNLMSCGD----GEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYK 129

Query: 191 ISPCSHHGSAPTLPSCENQKVPKLK-CHTRCTNPTYGRGFFQDKHRTTLTYWVD-DNEDA 248
             PC H+G +  L +C + +  ++  C  +C N  Y   +  D H+T++ Y     N   
Sbjct: 130 NRPCDHYGDSR-LTNCSSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQ 188

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE-NGTPYWL 307
           I++EI+ +GP TA   +Y++F  YK G+YK T+  +L  Y H  KLIGWG + +GT YWL
Sbjct: 189 IQQEIMTYGPVTAFMYVYENFMGYKEGIYKSTT-GELIGYHHV-KLIGWGVDGDGTEYWL 246

Query: 308 VINTWGPHWGDRG 320
            +N+W  +WG+ G
Sbjct: 247 AMNSWNSNWGNDG 259


>gi|157131748|ref|XP_001662318.1| cathepsin b [Aedes aegypti]
 gi|108871395|gb|EAT35620.1| AAEL012216-PA [Aedes aegypti]
          Length = 386

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 92/259 (35%), Positives = 134/259 (51%), Gaps = 16/259 (6%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +PD FDARE+WP C ++  + D G C +    +A  A +DR C++SKG++     +  + 
Sbjct: 125 LPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLL 184

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC  C     + C  G++   W F  ++G  +GG    R GC P  I  C   G     
Sbjct: 185 SCCHSC----GQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGECRIPG----- 235

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
              E++  PK  C  +C +       +QD+H   + Y + ++E  I +EI  +GP  A F
Sbjct: 236 ---EDEDTPK--CSNKCRSGYNVTDVWQDRHYGRVAYSLPNDERKIMEEIFINGPVQAAF 290

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
             Y D + YKSG+Y+H          H+ KL+GWG ENG  YWLV N+WG  WG+ G  K
Sbjct: 291 HTYLDLHAYKSGIYRHVWGPLSGG--HAVKLLGWGVENGVKYWLVANSWGREWGENGFFK 348

Query: 324 ILRGKYECAFEYLIAAGKP 342
           ++RG+  C  E  I AG P
Sbjct: 349 MVRGENHCGIEENIHAGLP 367


>gi|328722316|ref|XP_003247542.1| PREDICTED: cathepsin B-like isoform 2 [Acyrthosiphon pisum]
 gi|328722318|ref|XP_003247543.1| PREDICTED: cathepsin B-like isoform 3 [Acyrthosiphon pisum]
          Length = 276

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 103/261 (39%), Positives = 134/261 (51%), Gaps = 25/261 (9%)

Query: 87  RFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC 146
            FDAR++WP C TIG V + G       +A  G F+DR CI + G  N+ LSTE + SC 
Sbjct: 35  EFDARKRWPQCKTIGEVYNEGNALLSWAYATTGVFADRMCIATNGSYNKHLSTEELISCS 94

Query: 147 KICRYDDNKSCSHGSVF--RTWNFLHKRGSVTGGD-YGDRTGCQPSTISPCSHHGSAPTL 203
            I      K+ ++G V     W +    G V+GG  Y    GCQPS I P  +      L
Sbjct: 95  GI------KASANGWVRDGLAWEYFKTHGLVSGGSIYNTNDGCQPSKIPPVCN------L 142

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKH-RTTLTYWVDDNEDAIKKEILAHGPTTAT 262
           P+  N++         C +  YG    +  H    + Y+       I+KE+  +GP TA 
Sbjct: 143 PTKINKRT--------CVDYCYGNDTIKYNHDHVKVRYYYHVKPKDIQKEVQTYGPVTAA 194

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
             LYDD + +KSGVY  T NAK    L   KLIGWG ENG  YWL++N+WG  WG  G +
Sbjct: 195 LNLYDDIFLHKSGVYTLTKNAKYVR-LQYVKLIGWGVENGVDYWLLVNSWGNEWGQNGLL 253

Query: 323 KILRGKYECAFEYLIAAGKPK 343
           KI RGKY CA E  + A  PK
Sbjct: 254 KIKRGKYGCAVESFVYAAVPK 274


>gi|157111449|ref|XP_001651570.1| cathepsin b [Aedes aegypti]
 gi|108868331|gb|EAT32556.1| AAEL015312-PA [Aedes aegypti]
          Length = 386

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 92/259 (35%), Positives = 134/259 (51%), Gaps = 16/259 (6%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +PD FDARE+WP C ++  + D G C +    +A  A +DR C++SKG++     +  + 
Sbjct: 125 LPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLL 184

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC  C     + C  G++   W F  ++G  +GG    R GC P  I  C   G     
Sbjct: 185 SCCHSC----GQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGECRIPG----- 235

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
              E++  PK  C  +C +       +QD+H   + Y + ++E  I +EI  +GP  A F
Sbjct: 236 ---EDEDTPK--CSNKCRSGYNVTDVWQDRHYGRVAYSLPNDERKIMEEIFINGPVQAAF 290

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
             Y D + YKSG+Y+H          H+ KL+GWG ENG  YWLV N+WG  WG+ G  K
Sbjct: 291 HTYLDLHAYKSGIYRHVWGPLSGG--HAVKLLGWGVENGVKYWLVANSWGREWGENGFFK 348

Query: 324 ILRGKYECAFEYLIAAGKP 342
           ++RG+  C  E  I AG P
Sbjct: 349 MVRGENHCGIEENIHAGLP 367


>gi|5031250|gb|AAD38132.1|AF127592_1 vitellogenic cathepsin-B like protease [Aedes aegypti]
          Length = 386

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 92/259 (35%), Positives = 134/259 (51%), Gaps = 16/259 (6%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +PD FDARE+WP C ++  + D G C +    +A  A +DR C++SKG++     +  + 
Sbjct: 125 LPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLL 184

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC  C     + C  G++   W F  ++G  +GG    R GC P  I  C   G     
Sbjct: 185 SCCHSC----GQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGECRIPG----- 235

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
              E++  PK  C  +C +       +QD+H   + Y + ++E  I +EI  +GP  A F
Sbjct: 236 ---EDEDTPK--CSNKCRSGYNVTDVWQDRHIGRVAYSLPNDERKIMEEIFINGPVQAAF 290

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
             Y D + YKSG+Y+H          H+ KL+GWG ENG  YWLV N+WG  WG+ G  K
Sbjct: 291 HTYLDLHAYKSGIYRHVWGPLSGG--HAVKLLGWGVENGVKYWLVANSWGREWGENGFFK 348

Query: 324 ILRGKYECAFEYLIAAGKP 342
           ++RG+  C  E  I AG P
Sbjct: 349 MVRGENHCGIEENIHAGLP 367


>gi|159177|gb|AAA29177.1| cysteine proteinase [Haemonchus contortus]
          Length = 342

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 103/314 (32%), Positives = 159/314 (50%), Gaps = 29/314 (9%)

Query: 46  LSEEYLRQFLIADAKYFDQSDRPLPGDRKTY----------------DPEYSATVPDRFD 89
           L+ E L  +L  +   F+ +  P P   +                  DPE +  +P+ +D
Sbjct: 32  LTGEPLVAYLRKNQNLFEVNSEPTPNFEQKIMDIKFKNQKLNFVVKNDPEPNEDIPEEYD 91

Query: 90  AREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK-I 148
            RE++  C T  ++ D   C +    +   A SDR CI + G++   +S+  + +CC   
Sbjct: 92  PREKF-KCSTF-YIRDQANCGSCWAVSTAAAISDRICIATNGEKQVNISSTDILTCCNPQ 149

Query: 149 CRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCEN 208
           C +     C  G   R W +    G V+GG+Y  +  C+P  I PC HHG+      C  
Sbjct: 150 CGF----GCGGGWSIRAWEYFVYEGVVSGGEYLTKGVCRPYPIHPCGHHGNDTYYGECPR 205

Query: 209 QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDD 268
           +      C  +C  P Y + F  DK +  + Y V+  E+AI++EIL HGP  A+FA+Y+D
Sbjct: 206 EAATP-PCKKKC-QPGYKKIFRMDKRQGKVAYGVEPKEEAIQREILRHGPVVASFAVYED 263

Query: 269 FYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT--PYWLVINTWGPHWGDRGTVKILR 326
           F  YK+GVYKHT+ A L  Y H+ K++GWG ++ T   YWL+ N+W   WG+ G  + +R
Sbjct: 264 FSLYKTGVYKHTAGA-LRGY-HAVKMMGWGVDSKTKAKYWLIANSWHNDWGENGYFRFIR 321

Query: 327 GKYECAFEYLIAAG 340
           G  +C  E  +AAG
Sbjct: 322 GINDCEIEDTVAAG 335


>gi|339242629|ref|XP_003377240.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316973974|gb|EFV57515.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 325

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 96/263 (36%), Positives = 130/263 (49%), Gaps = 19/263 (7%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           S  +P   DAR++WP C  IG V D   C +    ++    +DR CI+S   +   LS E
Sbjct: 81  SVDLPFEMDARKRWPQCKYIGFVRDQANCGSCWAVSSASVMTDRICIESIAAKQPLLSEE 140

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SCCKIC Y     C  G   + + +   RG  TGG YG   GC+P +I   S   + 
Sbjct: 141 ELVSCCKICGY----GCDGGYPDKAFIYWATRGIPTGGPYGSTKGCKPYSIGSNSEDEAE 196

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
             L            C  +C N  Y     QD+H     YWV+ NE+ I +E+  +GP  
Sbjct: 197 TPL------------CTRQCINE-YPYNLSQDRHFGEKPYWVNSNEEQIMQELYKNGPVV 243

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
             F +Y+DF +Y  GVY+H     L    H+ KLIGWG EN   YWL+ N+W   WG+ G
Sbjct: 244 VAFNVYEDFMYYIKGVYEHRFGKFLGG--HAVKLIGWGIENSKKYWLISNSWNTTWGENG 301

Query: 321 TVKILRGKYECAFEYLIAAGKPK 343
             KI+RGK  CA E  + AG  +
Sbjct: 302 FFKIIRGKNCCAIESYVVAGMAR 324


>gi|410912140|ref|XP_003969548.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
          Length = 246

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 95/229 (41%), Positives = 122/229 (53%), Gaps = 8/229 (3%)

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
           F A  A SDR CI S  + +  LS E + SCC+ C       C+ G     W+F  K G 
Sbjct: 26  FGASEAMSDRICIHSNAKISVELSAEDLLSCCESC----GMGCNGGYPSAAWDFWTKDGL 81

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           V+GG Y    GC+P TI PC HH +  + PSC  +     +C  RC    Y   + QDKH
Sbjct: 82  VSGGLYDSHIGCRPYTIPPCEHHVNG-SRPSCSGEGGETPQCVYRC-EAGYTPSYKQDKH 139

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
               +Y V  +ED IK EI  +GP    F +Y+DF  YK+GVY+H + + L    H+ K+
Sbjct: 140 YGKTSYSVSSDEDDIKHEIYKNGPVEGAFTVYEDFVLYKTGVYQHVTGSALGG--HAIKI 197

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           +GWG ENG PYWL  N+W   WG+ G  KILRG   C  E  I AG P 
Sbjct: 198 LGWGEENGIPYWLCANSWNTDWGNNGFFKILRGSNHCGIESEIVAGIPN 246


>gi|390994429|gb|AFM37364.1| cathepsin B1 [Dictyocaulus viviparus]
          Length = 350

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 102/329 (31%), Positives = 151/329 (45%), Gaps = 30/329 (9%)

Query: 23  DAYIDQINR-------EANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKT 75
           +A ++ +N+       E ++ T+  +    +SEEYL Q              P     + 
Sbjct: 41  NALVEYVNKRQQFFQTEISSLTSSDHKARLMSEEYLTQ--------------PNLNRNEL 86

Query: 76  YDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
                   +P+ FDARE+W  C +I  + D   C +    +A    SDR CI S G+ N 
Sbjct: 87  MTGLLDVEIPENFDAREKWSQCDSIRTIRDQSHCGSCWAVSAAETMSDRTCIHSDGKINV 146

Query: 136 PLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
            LS   + SCC   C     + C  G     W +    G  TGG Y ++  C+P    PC
Sbjct: 147 GLSATDILSCCGTTC----GRGCRGGYPIEAWRYFMLHGVCTGGHYAEKDVCKPYAFHPC 202

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            HH +      C  +  P  +C   C    Y   +  DK      Y + +NE AI++EI+
Sbjct: 203 GHHRNEIYYGECPKEIFPTPQCTQSC-QAGYASDYEDDKIYGKSAYALPNNEKAIQREIM 261

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWG 313
            +GP  A F +Y+DF  Y+SG+Y HT+  +     H+ KLIGWG  ++G  YWL  N+W 
Sbjct: 262 TNGPVQAAFMVYEDFSRYRSGIYVHTAGRREGG--HAVKLIGWGVDDDGNKYWLAANSWN 319

Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKP 342
             WG+ G  +I+RG   C  E  + AG P
Sbjct: 320 SDWGENGYFRIVRGVDHCGIESAVVAGMP 348


>gi|157058741|gb|ABV03128.1| cathepsin B-2744 [Aulacorthum solani]
          Length = 255

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 93/248 (37%), Positives = 133/248 (53%), Gaps = 11/248 (4%)

Query: 73  RKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
           RK  D  Y   +P  FDAR+ + +C   IG V D G CA+    A    F+DR CI S G
Sbjct: 15  RKIVDNNYETVIPRTFDARQYFVSCSDVIGDVKDQGNCASSWAVAVASTFTDRLCIASNG 74

Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTI 191
           Q    LS + + SC      ++   C  GS F+ W     +G VTGG+Y    GCQP   
Sbjct: 75  QFTDNLSAQNLMSCGN----EEKMGCDGGSAFKAWELTMSKGIVTGGNYDSNEGCQPYKN 130

Query: 192 SPCSHHGSAPTLPSCENQKVPKLK-CHTRCTNPTYGRGFFQDKHRTTLTYWVD-DNEDAI 249
            PC H+G + +L +C + +  ++  C  +C N  Y   +  D H+T++ Y     N   I
Sbjct: 131 RPCDHYGDS-SLTNCSSLRRTQMTVCREKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQI 189

Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLV 308
           ++EI+ +GP TA   +Y++F  YK G+YK T+  +L  Y H  KLIGWG  E+GT YWL 
Sbjct: 190 QQEIMTYGPVTALMYVYENFMGYKKGIYKSTA-GELIGY-HHVKLIGWGVDEDGTEYWLA 247

Query: 309 INTWGPHW 316
           +N+W  +W
Sbjct: 248 MNSWNSNW 255


>gi|28932700|gb|AAO60044.1| midgut cysteine proteinase 1 [Rhipicephalus appendiculatus]
          Length = 332

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 115/341 (33%), Positives = 162/341 (47%), Gaps = 20/341 (5%)

Query: 2   IHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKY 61
           +  +V   G  +V   +   S+  I+ IN    TW AGRNF    S     Q     +  
Sbjct: 8   VLFVVAAQGRLMVPSSVEPLSEEMINFINSINTTWKAGRNFDEKRSHSDCVQGGDGASVL 67

Query: 62  FDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
              S         +Y+ +   T P+ F  RE W +C +I  + D  AC +   FAA  + 
Sbjct: 68  TATSTS---SHFTSYEEDSRWTCPESFTPREYWSHCSSIRVIRDQSACGSCWAFAAAESI 124

Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
           SDR CI + G+    +S E + +CC  C +  +  C   SV      L  R  V      
Sbjct: 125 SDRICIHTNGKVQVNISAEDLLACCHTCGHGCDGRCHCSSV----AILQGRRLVPE-PVR 179

Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW 241
              GCQP ++ PC        +P+C + + P  KC   C    Y + + +DKH     Y 
Sbjct: 180 TEDGCQPYSLPPC--------VPNCTHPE-PTPKCQHVCRK-GYEKSYEEDKHFAKNVYR 229

Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
           +    DAIK +I  +GP  + F +Y DF  YKSGVY+      +   +H+ K++GWGTE+
Sbjct: 230 LLKKCDAIKTDIYKNGPVESAFFVYADFPSYKSGVYQQHMIKFMG--VHAIKILGWGTED 287

Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           G PYWLV N+W   WGD+G  KILRGK EC  E +I AG P
Sbjct: 288 GVPYWLVANSWNVGWGDKGYFKILRGKDECGIEEVIDAGIP 328


>gi|268561866|ref|XP_002638438.1| Hypothetical protein CBG18654 [Caenorhabditis briggsae]
          Length = 396

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 122/358 (34%), Positives = 160/358 (44%), Gaps = 43/358 (12%)

Query: 1   MIHILVFLLGCTLVRG----ELYKFS----DAYIDQINREANTWTAGRNFPANLSEEYLR 52
           MI IL F+   +L  G    E+ K S     A +D +N   ++W A          EY  
Sbjct: 1   MIRILNFVALASLSYGFVVQEVPKRSVLSGQALVDHVNAVQDSWKA----------EY-- 48

Query: 53  QFLIADAKYFDQSDRPLPGDRKTY---DPEYSATV--PDRFDAREQWPNCGTIGHVPDTG 107
             +   AK  D     +P   K+    D E+   +  P  FD+R QWPNC +I  + D  
Sbjct: 49  SSISMKAKTMDVRFAEVPESEKSEKSDDLEFETLIQLPTAFDSRVQWPNCNSIKLIRDQT 108

Query: 108 ACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTW 166
            C +   FAA    SDR CI+S G Q   +S E + SCC   C    N  C  G      
Sbjct: 109 YCGSCWAFAAAEIISDRICIQSNGTQQPIISPEDILSCCGSSC----NNGCQGGYTIEAM 164

Query: 167 NFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYG 226
            +    G VTGGDY    GC P +  PCS        PSC+          T C      
Sbjct: 165 KYWMNSGVVTGGDY-QGAGCIPYSFRPCSTCKEPKDAPSCK----------TTCQASYKA 213

Query: 227 RGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLE 286
           +  ++    T+    V +    I+ EI  +GP    + +YDDFYHYKSGVY H    K  
Sbjct: 214 KSAYRLPTTTSSNAIVANAVQMIQTEIYNNGPVEVAYQVYDDFYHYKSGVYYHVYGDKPS 273

Query: 287 NYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
              H+ K+IGWGTE    YWLV N+W   +G+ G  KI RG  EC  E  + AG PK+
Sbjct: 274 G--HAVKIIGWGTEKKVDYWLVANSWSTTFGENGFFKIRRGTNECGIEENVVAGLPKS 329


>gi|256090364|ref|XP_002581165.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228444|emb|CCD74615.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 303

 Score =  171 bits (434), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 107/327 (32%), Positives = 155/327 (47%), Gaps = 60/327 (18%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP- 78
            SD  I  IN   N  W A         E+  R   + DA++   + R  P  R+T  P 
Sbjct: 29  LSDDIISYINEHPNAGWRA---------EKSNRFHSLDDARFQLGARREEPDLRRTRRPT 79

Query: 79  ----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
               +++  +P  FD+R++WP C +I  + D   C +   F AV A S+R CI+S G+QN
Sbjct: 80  VDHNDWNVEIPSSFDSRKKWPRCKSIATIRDQSRCGSCCAFGAVEAMSERSCIQSGGKQN 139

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             LS                              +   G VTG    + TGC+P     C
Sbjct: 140 VELSA-----------------------------VDLEGIVTGSSKENNTGCEPYPFPKC 170

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            H  +    P C ++     +C T C    Y   + QDKHR            AI+KEI+
Sbjct: 171 EHF-TKGQYPPCGSKIYKTPRCKTTCQK-RYKTSYAQDKHR------------AIQKEIM 216

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            +GP  A+F +Y+DF +YKSG+YKH +   L    H+ ++IGWG EN TPYWL+ N+W  
Sbjct: 217 KYGPVEASFTVYEDFLNYKSGIYKHITGETLGG--HAIRIIGWGVENKTPYWLIANSWNE 274

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGK 341
            WG+ G  +I+RG+ EC+ E  + AG+
Sbjct: 275 DWGENGYFRIVRGRDECSIESEVTAGR 301


>gi|170060936|ref|XP_001866022.1| cathepsin B [Culex quinquefasciatus]
 gi|167879259|gb|EDS42642.1| cathepsin B [Culex quinquefasciatus]
          Length = 341

 Score =  171 bits (434), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 107/326 (32%), Positives = 151/326 (46%), Gaps = 29/326 (8%)

Query: 19  YKFSDAYIDQINREANTWTAGRN-FPANLSEEYLRQFLIADAKYFDQSDRPLP-GDRKTY 76
           Y+ + +  +++   A TWT G N  P NL            AK  D     LP G     
Sbjct: 33  YEATISIAEKVRPLATTWTPGANPLPPNLYR--------TGAKREDLEKHRLPLGILVVK 84

Query: 77  DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
           D      +P+RFDAR++WP C ++  + + G C +    +A   F+DR CI S+ +    
Sbjct: 85  D---HIVLPERFDARDRWPECTSLKQIRNQGCCGSCWAISAAETFTDRWCIHSEDKDQFS 141

Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
                + SCC  C       C  G++   W F  +RG  +GG Y  R GC P  +  C  
Sbjct: 142 FGAYDLLSCCHSC----GDGCQGGNLGPAWQFWVQRGVSSGGPYNSRQGCHPYPVDVCHS 197

Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
                  P          KC  +C +         D+    + Y V  +E+ IK+EI  +
Sbjct: 198 ADEDADTP----------KCTRKCQSMYNVTNVSDDRRFGRVAYSVSQDEERIKEEIFRN 247

Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHW 316
           GP  A+F +Y DF  YK+GVY+H          H+ K+IGWG ENGT YWL  N+WG  W
Sbjct: 248 GPVQASFDVYLDFKAYKTGVYRHVFGPMEGG--HAVKMIGWGVENGTKYWLCSNSWGEDW 305

Query: 317 GDRGTVKILRGKYECAFEYLIAAGKP 342
           G+RG  KI+RG+  C  E  + AG P
Sbjct: 306 GERGFFKIVRGENHCGIESDVHAGLP 331


>gi|294894292|ref|XP_002774787.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239880404|gb|EER06603.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 414

 Score =  171 bits (434), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 114/345 (33%), Positives = 159/345 (46%), Gaps = 41/345 (11%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGD---RKTYDPEY 80
           + +D+IN + NTWTA      +  ++  +   + DAK    + +    D   RK Y  E 
Sbjct: 85  SLVDEINSKQNTWTA------STGQKRFKNLSLRDAKMLCGTLKRGSNDKVIRKGYAIEE 138

Query: 81  SATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
              +P  FDAR  +PNC   I H+ D   C +   F    AF+DR CIKS G     LS 
Sbjct: 139 LQDLPTDFDARTAFPNCSKVIRHIRDQSDCGSCWAFGVTEAFNDRLCIKSNGTFTELLSA 198

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRT---GCQPSTISP 193
             + +C        +  C  G     W+++H +G  TGGDY    D T   GC P    P
Sbjct: 199 GEMNACAP------SFGCDGGIPSLAWSWVHNKGIATGGDYLAEDDMTKDDGCWPYDFPP 252

Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH----RTTLTYWVDDNEDAI 249
           C+HH +    P C         C  +C NP Y      D+H         Y V+D ++AI
Sbjct: 253 CAHHVNDSKYPKCPKDSYETPNCAEQCHNPKYTTTLRDDRHFLVESVPYEYSVNDAKNAI 312

Query: 250 KKE-----------ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
           + +            +     +A+F +Y+DF  Y+SGVYKHTS  +L    H+ K+IGWG
Sbjct: 313 RTDGPVGPIYFCDPSVNFDQVSASFIVYEDFLAYRSGVYKHTSGKELGG--HAVKIIGWG 370

Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            E G  YWLV+N+W   WGD G  KI  G   C  +  +  G PK
Sbjct: 371 EETGQAYWLVVNSWNEDWGDNGLFKIALGN--CEIDDDLLGGTPK 413


>gi|268561878|ref|XP_002638441.1| Hypothetical protein CBG18657 [Caenorhabditis briggsae]
          Length = 372

 Score =  171 bits (433), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 119/350 (34%), Positives = 160/350 (45%), Gaps = 55/350 (15%)

Query: 26  IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS--DRPLPGDRKTYDPEYSAT 83
           +D IN+  N+W A  +  + L      +F + D K+ + S  D PL     T    Y   
Sbjct: 28  VDHINKIQNSWRAEYSPISELE----MKFKVMDLKFSEISPKDEPL-----TVQGVY--- 75

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           VP  FDAR+ WPNC +I  + +   C A   F A    SDR CI+S G     +S E + 
Sbjct: 76  VPISFDARDHWPNCKSIKLIRNQAYCGACWAFGAAEIISDRICIQSGGAHQPIISVEDIL 135

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC        + C  G       F    G VTGGDY + TGCQP T  PCS   ++ + 
Sbjct: 136 SCCG---SSCGEGCKGGYPLEGLKFWMNSGVVTGGDY-NGTGCQPYTFPPCSSCEASKST 191

Query: 204 PSCENQKVPKLKCHTRCTNPTY-----------------------------GRGFFQDKH 234
           PSC+       KC T     TY                             G+  ++   
Sbjct: 192 PSCQK------KCQTGYLEATYKNDKRFENEEQDSSYMSENFYQVLIILKGGKSAYRLST 245

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
            T+      D    I+ EI  +GP   ++ +++DFY YKSGVY + S  KL    H+ K+
Sbjct: 246 TTSSNKISTDAIITIQTEIYNNGPVEVSYRVFEDFYQYKSGVYHYVS-GKLTG-AHAVKI 303

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           IGWGTEN   YWLV N+WG  +G++G  KI RG  EC  E  + AG  KN
Sbjct: 304 IGWGTENKVDYWLVANSWGTDFGEKGFFKIRRGTNECGIEENVVAGLAKN 353


>gi|255040223|gb|ACT99884.1| truncated cathepsin B [Opisthorchis viverrini]
          Length = 313

 Score =  171 bits (432), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 96/285 (33%), Positives = 137/285 (48%), Gaps = 13/285 (4%)

Query: 36  WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWP 95
           W +GR+     S+  +  F         ++ RP        D      +P  FDAR +WP
Sbjct: 42  WISGRHSKGFESDHLIHTFGAKMETAEQKAQRPTVKHVGFDD----TRLPKNFDARSKWP 97

Query: 96  NCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNK 155
           +C ++  + D  +C +   F AV A SDR CI S G  N+ LS   + SCCK C +    
Sbjct: 98  HCSSVSEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGSFNKSLSAVDLLSCCKDCGF---- 153

Query: 156 SCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLK 215
            C  G     W++    G VTGG   D +GC+      C HH      P C  Q  P  +
Sbjct: 154 GCRGGYPAVAWDYWRTHGIVTGGSKEDPSGCRSYPFPKCDHHVQG-HYPPCPRQIYPTPE 212

Query: 216 CHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSG 275
           C   C  P    G+ +DK R  ++Y +  +E +I KEI+  GP  A F +Y+DF  YKS 
Sbjct: 213 CVQDCDTPEL--GYLEDKTRANISYNIYASEISIMKEIMLRGPVEAVFTVYEDFLQYKSR 270

Query: 276 VYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
           VY H   A +    H+ +++GWG E   PYWL+ N+W   WG++G
Sbjct: 271 VYFHAWGAPMSG--HAIRILGWGEEGDVPYWLIANSWNEDWGEKG 313


>gi|358341867|dbj|GAA49438.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 952

 Score =  171 bits (432), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 90/256 (35%), Positives = 131/256 (51%), Gaps = 9/256 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDAR +WP+C +I  + D  +C +   F AV + SDR CI S G  N+ LS   + 
Sbjct: 51  LPKSFDARTKWPHCPSISEIRDQSSCESFWAFGAVESMSDRLCIHSNGAFNKSLSATDLL 110

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC+ C       C  G     W+F    G VTGG   + +GC+      C H       
Sbjct: 111 SCCEDC----GLGCGAGFHPMAWDFWKTHGIVTGGSKEEPSGCRSFPFPKCGHRRKG-RY 165

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P C     P  +C  +C  P     + +DK R  ++Y V  ++ +I KEI+ +GP  A+F
Sbjct: 166 PPCPRHIYPTPECIKQCDEPEVN--YEKDKTRANISYNVYPSDISIMKEIMLNGPVEASF 223

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
            +Y DF  Y  GVY H     +    H+ +++GWG ++G PYWL+ N+W   WG++G V+
Sbjct: 224 GIYADFLEYNGGVYFHCWGGPISR--HAIRILGWGEDDGVPYWLIANSWNEDWGEKGYVR 281

Query: 324 ILRGKYECAFEYLIAA 339
            LRG  EC  E  + A
Sbjct: 282 FLRGHNECGIEEEVTA 297



 Score =  157 bits (397), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 102/314 (32%), Positives = 135/314 (42%), Gaps = 62/314 (19%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FDAR  WP+C +I  + D  +C +   F AV A SDR CI SKG  N+ LS   + 
Sbjct: 639 LPESFDARANWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSKGAFNKSLSAVDLV 698

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC  C       C  G     W+F    G VTGG     TGC+      C H G     
Sbjct: 699 SCCTEC----GCGCRGGYSPIAWDFWKTHGIVTGGSKEKPTGCRSYPFPSCEHRGKG-QY 753

Query: 204 PSCENQKVPKLKCHTRCTNPTYG------RGF-------FQDKH---------------- 234
           P C +Q  P  +C  RC            RGF         D+H                
Sbjct: 754 PPCPHQLYPTPECIKRCDTKEIDYEKDKTRGFDSASSEQLADRHCFHTSNFGEASAQRTL 813

Query: 235 ------------------------RTT--LTYWVDDNEDAIKKEILAHGPTTATFALYDD 268
                                   R+T  ++Y V   E A+ KEI+  GP  A   +Y+D
Sbjct: 814 HLTCLNFMHHSIDLLSSRLEKAVLRSTANISYNVYPAEQAVMKEIMLRGPVGAILHVYED 873

Query: 269 FYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGK 328
              YKSGVY H     L    H  +++GWG E+G PYWLV N+W   WG++G +++LR +
Sbjct: 874 LLDYKSGVYFHVWGGHLGE--HGIRILGWGEEDGVPYWLVANSWNEDWGEKGYMRVLRWR 931

Query: 329 YECAFEYLIAAGKP 342
            EC     + AG P
Sbjct: 932 NECGIVDQVTAGLP 945


>gi|339831342|gb|AEK20867.1| cathepsin B [Eimeria tenella]
          Length = 512

 Score =  170 bits (431), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 107/284 (37%), Positives = 141/284 (49%), Gaps = 16/284 (5%)

Query: 67  RPLPGDRKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRR 125
            PLP   K +         D+FDARE +P C   IGHV D G C +   FA+  A +DR 
Sbjct: 222 EPLP--VKVFAETQQVLETDKFDAREAFPQCAEVIGHVRDQGDCGSCWAFASTEALNDRF 279

Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD-RT 184
           CIKS G+    LS ++  SCC +  +  +  CS G     W +    G VTGGDY +  T
Sbjct: 280 CIKSGGRHREALSPQHTTSCCDLL-HCLSFGCSGGQPRMAWRWFSNDGVVTGGDYNELHT 338

Query: 185 G--CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYG---RGFFQDKHRTTLT 239
           G  C P  I  C HH   P  P CE       KC   C    Y    + F  D H  T  
Sbjct: 339 GKSCWPYEIPFCRHHSEGP-YPKCEGPLPKAPKCRKDCEEAEYTSKVKPFKDDLHFATSA 397

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y V+   D IK+E++ +G  T  F +Y+DF  YK GVY H +   +    H+ K+IG+G 
Sbjct: 398 YSVE-GRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMGG--HAVKVIGFGN 454

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           E+G  YWL +N+W  +WGD+GT KI  G  E   +     G+PK
Sbjct: 455 EDGRDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCGGEPK 496


>gi|359427491|gb|AEV46267.1| eimeripain [Eimeria tenella]
          Length = 512

 Score =  170 bits (431), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 107/284 (37%), Positives = 141/284 (49%), Gaps = 16/284 (5%)

Query: 67  RPLPGDRKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRR 125
            PLP   K +         D+FDARE +P C   IGHV D G C +   FA+  A +DR 
Sbjct: 222 EPLP--VKVFAETQQVLETDKFDAREAFPQCAEVIGHVRDQGDCGSCWAFASTEALNDRF 279

Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD-RT 184
           CIKS G+    LS ++  SCC +  +  +  CS G     W +    G VTGGDY +  T
Sbjct: 280 CIKSGGRHREALSPQHTTSCCDLL-HCLSFGCSGGQPRMAWRWFSNDGVVTGGDYNELHT 338

Query: 185 G--CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYG---RGFFQDKHRTTLT 239
           G  C P  I  C HH   P  P CE       KC   C    Y    + F  D H  T  
Sbjct: 339 GKSCWPYEIPFCRHHSEGP-YPKCEGPLPKAPKCRKDCEEAEYTSKVKPFKDDLHFATSA 397

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y V+   D IK+E++ +G  T  F +Y+DF  YK GVY H +   +    H+ K+IG+G 
Sbjct: 398 YSVE-GRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMGG--HAVKVIGFGN 454

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           E+G  YWL +N+W  +WGD+GT KI  G  E   +     G+PK
Sbjct: 455 EDGRDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCGGEPK 496


>gi|17560488|ref|NP_506310.1| Protein F32H5.1 [Caenorhabditis elegans]
 gi|3876629|emb|CAB04249.1| Protein F32H5.1 [Caenorhabditis elegans]
          Length = 356

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 112/336 (33%), Positives = 157/336 (46%), Gaps = 20/336 (5%)

Query: 14  VRGELYKFSDAYIDQ-INREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGD 72
           +  E  K S + +   +N++   W A         E+  R   I   K  D+       D
Sbjct: 28  ISSEAIKLSGSDLTSYVNKKQKLWKA-ETSRMTFQEKMARAKSIKFIKSNDEVSEKTGND 86

Query: 73  RKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
               D      +P  FD+R++WP+C  IG V D   C +     AV   SDR CI S G 
Sbjct: 87  NVLVD------IPSSFDSRQKWPSCSQIGAVRDQSDCGSAAHLVAVEIASDRTCIASNGT 140

Query: 133 QNRPLSTEYVASCC----KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQP 188
            N PLS +   SCC     IC   D   C          +    G  TGG+Y D+ GC+P
Sbjct: 141 FNWPLSAQDPLSCCVGLMSIC--GDGWGCDGSWPKDILKWWQTHGLCTGGNYNDQFGCKP 198

Query: 189 STISPCS-HHGSAPTLPSCENQKVPKLKCHTRCT-NPTYGRGFFQDKHRTTLTYWVDDNE 246
            +I PC   + +  T   C     P   C   CT N T+   + QDKH     Y V    
Sbjct: 199 YSIYPCDKKYANGTTSVPCPGYHTP--TCEEHCTSNITWPIAYKQDKHFGKAHYNVGKKM 256

Query: 247 DAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYW 306
             I+ EI+ +GP  A+F +YDDF+ YK+G+Y HT+  + E  + + K+IGWG +NG PYW
Sbjct: 257 TDIQIEIMTNGPVIASFIIYDDFWDYKTGIYVHTAGDQ-EGGMDT-KIIGWGVDNGVPYW 314

Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           L ++ WG  +G+ G V+ LRG  E   E+ + A  P
Sbjct: 315 LCVHQWGTDFGENGFVRFLRGVNEVNIEHQVLAALP 350


>gi|157058747|gb|ABV03131.1| cathepsin B-2744 [Myzus persicae]
          Length = 261

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 93/251 (37%), Positives = 129/251 (51%), Gaps = 9/251 (3%)

Query: 73  RKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
           RKT D  Y   +P  FDAR+ + +C   IG V D G CA+    A    F+DR CI S G
Sbjct: 17  RKTADINYKTDIPKEFDARQYFISCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNG 76

Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTI 191
           +    LS + + SC      D+   C  GS ++ W F   +G VTGG Y    GCQP   
Sbjct: 77  KFTDNLSAQNLMSCGD----DEKLGCDGGSAYKAWEFTMGKGIVTGGPYDSNEGCQPYKN 132

Query: 192 SPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD-DNEDAIK 250
            PC H+G +        ++   + C  +C N  Y   +  D ++T++ Y     N   I+
Sbjct: 133 RPCDHYGDSSLTNCSSLRRTQMMFCRDKCVNKNYKVKYEDDLYKTSVVYMTSWTNVKQIQ 192

Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVI 309
           +EI+ +GP TA   +Y++F  YK GVYK T+  +L  Y H  KLIGWG  E G  YWL +
Sbjct: 193 QEIMTYGPVTAFMYVYENFMGYKEGVYKSTA-GELIGYHHV-KLIGWGVDEAGIEYWLAM 250

Query: 310 NTWGPHWGDRG 320
           N+W  +WG  G
Sbjct: 251 NSWNSNWGTNG 261


>gi|268578113|ref|XP_002644039.1| Hypothetical protein CBG17499 [Caenorhabditis briggsae]
          Length = 355

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 102/270 (37%), Positives = 138/270 (51%), Gaps = 12/270 (4%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           E    +P  FD+R+QWP C  IG V D   C +     AV   SDR CI S G  N PLS
Sbjct: 86  EVLINIPASFDSRQQWPECTQIGAVRDQSDCGSAAHLVAVEMASDRTCISSNGTFNWPLS 145

Query: 139 TEYVASCC----KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
            +   SCC     IC   D   C          +    G  TGG+Y D+ GC+P +I PC
Sbjct: 146 AQDPLSCCVGLMSIC--GDGWGCDGSWPKDILKWWQTHGLCTGGNYDDQFGCKPYSIYPC 203

Query: 195 S-HHGSAPTLPSCENQKVPKLKCHTRCT-NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
             ++ +  T   C     P   C   CT N T+   + QDKH     Y V      I+ E
Sbjct: 204 DKNYPNGTTSVPCPGYHTP--PCEDHCTSNITWPIAYKQDKHFGKAHYNVGKKMTDIQTE 261

Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
           I+ +GP  A+F +Y+DF+ YKSG+Y HT+  + E  + + K+IGWG +NG PYWL ++ W
Sbjct: 262 IMTNGPVIASFIIYEDFWDYKSGIYVHTAGDQ-EGGMDT-KIIGWGVDNGVPYWLCVHQW 319

Query: 313 GPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           G  +G+ G V+ILRG  E   E+ + A  P
Sbjct: 320 GTDFGENGFVRILRGVNEVNIEHQVLAALP 349


>gi|299471123|emb|CBN78981.1| cathepsin B-like proteinase [Ectocarpus siliculosus]
          Length = 557

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 103/307 (33%), Positives = 155/307 (50%), Gaps = 28/307 (9%)

Query: 60  KYFDQSDRPLPGDRKTYDPEYSAT---VPDRFDAREQWPNCGTI-GHVPDTGACAAPHIF 115
           +Y  +    +PG R+      S++   +P  FDARE +P C +I G V D   C +   F
Sbjct: 253 RYTKEIAPAVPGRRRLTPVAQSSSDEDIPANFDAREAFPECASIIGRVRDQSDCGSCWAF 312

Query: 116 AAVGAFSDRRCIKSKGQQNRP-------------LSTEYVASCCKICRYDDNKSCSHGSV 162
           A+  AF+DRRCI   G+++               LS E   +CC       +  C+ G  
Sbjct: 313 ASTEAFNDRRCIAGIGKEDAAGAEGEATADQLLVLSAEDTTACCHGFHCGLSMGCNGGQP 372

Query: 163 FRTWNFLHKRGSVTGGDYGD---RTGCQPSTISPCSHH--GSAPTLPSCENQKVPKLKCH 217
              W +  K G VTGGDY D    T C+P    PC+HH    A   P+C + + P  +C 
Sbjct: 373 GSAWKWFTKTGVVTGGDYADIGTGTTCKPYEFMPCAHHVDPGASGYPACPDGEYPTPECL 432

Query: 218 TRCTNPTYGRGFF-QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGV 276
           + C+   +  G + +DK      Y +   E+ I+++++ +G  TA F+++ DF  Y  GV
Sbjct: 433 SECSETNFSGGSYGEDKKMAREAYSLAGIEN-IQRDMMKYGSVTAAFSVFSDFLTYSGGV 491

Query: 277 YKHTSNAKLENYLHSGKLIGWGTE--NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
           Y H S + +    H+ K+IGWGT+  +G  YWL+ N+W P WG+ G  +ILRG  EC  E
Sbjct: 492 YTHESGSFMGG--HAVKMIGWGTDEVSGEDYWLIANSWNPSWGEGGLFRILRGVNECGIE 549

Query: 335 YLIAAGK 341
             I AG+
Sbjct: 550 GQIVAGE 556


>gi|4325188|gb|AAD17297.1| cysteine proteinase [Ancylostoma ceylanicum]
          Length = 341

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 101/312 (32%), Positives = 151/312 (48%), Gaps = 18/312 (5%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYF-DQSDRPLPGDRKTYDPEYSA 82
           A++D IN   + + A  +  A    E   +  I D+K+  +Q    +  D    DP    
Sbjct: 37  AFVDYINEHQSFYRAEYSPEA----EAFVKARIMDSKFLAEQKKEEVLADVYGDDP---- 88

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
             PD FDAR QWP C +IG + D  AC +    ++  A SD  C++S       +S   +
Sbjct: 89  --PDSFDARTQWPECRSIGTIRDQSACGSCWAVSSAEAMSDEICVQSNSTIKVMISDTDI 146

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            SCC +   D    C  G     + ++ + G VTGG Y  R  C+P +  PC  H   P 
Sbjct: 147 LSCCGL---DCGYGCQGGWPIEAYRWMQRDGVVTGGKYRQRDVCKPYSFYPCGQHKDVPY 203

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
              C     P  KC  + +   Y + + +DKH  T +Y + +NE +I++EI  +GP  A 
Sbjct: 204 YGPCPGGLWPTPKCR-KSSQRKYNKTYQEDKHFATRSYSLPNNERSIRQEIYKNGPVVAA 262

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F +Y+D Y    G+Y H     ++   H+ K+IGWG ENGT YWL+ N+W   WG+ G  
Sbjct: 263 FKVYED-YSSTGGIYVHKWG--IQTGAHADKVIGWGRENGTDYWLIANSWNTDWGEDGYY 319

Query: 323 KILRGKYECAFE 334
           +I+R    C  E
Sbjct: 320 RIVRETDNCEIE 331


>gi|119638996|gb|ABL85239.1| cysteine proteinase 5 [Necator americanus]
          Length = 342

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 150/320 (46%), Gaps = 16/320 (5%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
           AY+D +N+  + + A     + L E+Y +   +  +++  + ++    +    D + +  
Sbjct: 38  AYVDYVNQHQSFYKAEY---SPLVEQYAKA--VMRSEFMTKPNQ----NYVVKDVDLNIN 88

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FDARE+WPNC +I  + D   C +    +A    SDR CI+S G      S   + 
Sbjct: 89  LPETFDAREKWPNCTSIRTIRDQSNCGSCWAVSAASVMSDRLCIQSNGTIQSWASDTDIL 148

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC  C       C  G  F  + F    G  TGG + +   C+P    PC  H +    
Sbjct: 149 SCCWNC----GMGCDGGRPFAAFFFAIDNGVCTGGPFREPNVCKPYAFYPCGRHQNQKYF 204

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
             C  +  P  KC   C    Y   +  DK      Y + +NE  I +EI  +GP   +F
Sbjct: 205 GPCPKELWPTPKCRKMC-QLKYNVAYKDDKIYGNDAYSLPNNETRIMQEIFTNGPVVGSF 263

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
           +++ DF  YK GVY   SN   +N  H+ K+IGWG ++G  YWL+ N+W   WGD G V+
Sbjct: 264 SVFADFAIYKKGVY--VSNGIQQNGAHAVKIIGWGVQDGLKYWLIANSWNNDWGDEGYVR 321

Query: 324 ILRGKYECAFEYLIAAGKPK 343
            LRG   C  E  +  G  K
Sbjct: 322 FLRGDNHCGIESRVVTGTMK 341


>gi|984960|gb|AAC46878.1| cathepsin B proteinase, partial [Ancylostoma caninum]
          Length = 340

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 98/321 (30%), Positives = 152/321 (47%), Gaps = 22/321 (6%)

Query: 23  DAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA 82
            A++D IN     + A   +  N   E   +  I D+K+  +     P   +     +  
Sbjct: 35  QAFVDYINEHQPFYRA--EYSPNA--EAFVKARIMDSKFLVE-----PKKEEVLTEVFGD 85

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
             PD FDAR  WP C +IG + D  AC +    ++  A SD+ C++S       +S   +
Sbjct: 86  DPPDSFDARAHWPECRSIGTIRDQSACGSCWAVSSAEAMSDQICVQSNRTTRVMISDTDI 145

Query: 143 ASCCKICRYDDNKSCSHGSV---FRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
            SCC I       SC +G        + ++ +   VTGG Y  +  C+P    PC +H +
Sbjct: 146 LSCCGI-------SCGYGCEVLPIEAYRWMQRSVVVTGGKYRQKDVCKPYAFYPCGNHTN 198

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
                 C     P  KC   C    Y + + +DK+  T +Y++  NE +I++EI  +GP 
Sbjct: 199 ERYYGPCPRGLWPTPKCRKACQR-KYNKSYNEDKYFATRSYYLPSNERSIREEIYKNGPV 257

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
            A F +Y DF +Y+ G+Y H    +     H+ K++GWG ENGT YWL+ N+W   WG+ 
Sbjct: 258 VAAFKVYQDFSYYRGGIYVHKWGGQTG--AHAVKVVGWGRENGTDYWLIANSWNTDWGEN 315

Query: 320 GTVKILRGKYECAFEYLIAAG 340
           G  +I RG  EC  E  + +G
Sbjct: 316 GYFRIARGSNECGIEGQMVSG 336


>gi|201023369|ref|NP_001128426.1| cathepsin B-3483 [Acyrthosiphon pisum]
 gi|328712086|ref|XP_003244726.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
          Length = 355

 Score =  168 bits (425), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 96/266 (36%), Positives = 128/266 (48%), Gaps = 15/266 (5%)

Query: 85  PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
           P+ FDAR  W NC +I H+ + G CAA    +   A +DR CI S+G      S + + S
Sbjct: 97  PESFDARYHWFNCTSISHIWNQGNCAADWAISVTSAMNDRICIASQGNITALYSPQKLVS 156

Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
           CC+ C       CS G     W ++ K+G VTGGDYG   GCQP  + PC+   +A    
Sbjct: 157 CCEDC----GNGCSGGYTAAAWRYILKKGIVTGGDYGSNEGCQPWLVQPCNASTTAADPS 212

Query: 205 S-------CENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
           S       C        KC   C N  +   +  D  +    +  D    + +K +  HG
Sbjct: 213 SVLGPHGVCGGDPATTPKCDLSCYNARHEGKYLDDIIKAKKVFTFDGC--SARKNLRKHG 270

Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
           P   T  +Y+DF  YKSGVY H +   L   L S ++IGWG E G  +WL+ N+WG  WG
Sbjct: 271 PYVVTMRVYEDFLAYKSGVYHHVTGDYLG--LLSVRMIGWGLEGGQAFWLLANSWGTSWG 328

Query: 318 DRGTVKILRGKYECAFEYLIAAGKPK 343
           D+G  KI R   EC  E    AG P 
Sbjct: 329 DKGFFKIRRFVNECWIENFRYAGVPN 354


>gi|329669000|gb|AEB96388.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 232

 Score =  168 bits (425), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 88/236 (37%), Positives = 123/236 (52%), Gaps = 7/236 (2%)

Query: 108 ACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWN 167
           +C +     AV A +DR CI SKG Q   +S + + SCC  C +     C     +  W+
Sbjct: 4   SCGSCWAVGAVEAMTDRICIASKGNQKVTISADDLLSCCDECGF----GCDGRDPYAAWS 59

Query: 168 FLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR 227
           +    G VTG +Y  ++GC+P    PC HH        C     P   C  +C +  Y  
Sbjct: 60  YWVSNGIVTGSNYTSKSGCKPYPYPPCEHHIPEHHYKKCPKDIYPTNTCEYKCQD-GYSI 118

Query: 228 GFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLEN 287
            +  DKH     Y V  +  +I+KEI+ +GP    F +Y+DF HY SG+YKHT+   L  
Sbjct: 119 SYNSDKHYGASVYAVAQDVASIQKEIMTNGPVEVAFDVYEDFEHYSSGIYKHTTGDYLGG 178

Query: 288 YLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
             H+ K++GWGTENGT YW+  N+W   WG+ G  +ILRG  EC  E  + AG+PK
Sbjct: 179 --HAVKMLGWGTENGTDYWICANSWNSDWGENGFFRILRGVDECEIESGVVAGEPK 232


>gi|194246069|gb|ACF35526.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 277

 Score =  168 bits (425), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 100/266 (37%), Positives = 133/266 (50%), Gaps = 11/266 (4%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           E    +P+ FDARE W +C +I  + D   C +   F A  A SDR CI +KG+    +S
Sbjct: 20  EIPEDLPESFDAREAWSHCDSIHLIRDQSTCGSCRAFGATEAMSDRICIHTKGRVQVNIS 79

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
            + + +CC  C       C  G     W++    G VTGG YG   GCQP    PC HH 
Sbjct: 80  AQDLLTCCHQC----GMGCFGGYPSAAWDYYKDEGIVTGGLYGTDDGCQPYYFPPCEHHT 135

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
             P LP+C + K P  KC   C    Y + + +DK+     Y +  +E  IK EI  +GP
Sbjct: 136 KGP-LPNCTDTK-PTPKCLQVCRK-GYEKSYSEDKYFAKTVYSLHSDETQIKTEIYKNGP 192

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
             A F++Y DF  YKSGVY+  S    E +    + +GW  +  +  WLV N+W   WGD
Sbjct: 193 VEADFSVYTDFLAYKSGVYQRHS---YELWEARHQNLGWALKRRS-VWLVANSWNQDWGD 248

Query: 319 RGTVKILRGKYECAFEYLIAAGKPKN 344
           +G  KI RG  EC  E  I AG PK 
Sbjct: 249 KGYFKIRRGNNECGIENDINAGIPKE 274


>gi|146092987|ref|XP_001466605.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania infantum JPCM5]
 gi|398018677|ref|XP_003862503.1| cysteine peptidase C (CPC) [Leishmania donovani]
 gi|12005276|gb|AAG44365.1| cathepsin B-like cysteine protease [Leishmania donovani]
 gi|134070968|emb|CAM69644.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania infantum JPCM5]
 gi|322500733|emb|CBZ35810.1| cysteine peptidase C (CPC) [Leishmania donovani]
          Length = 340

 Score =  167 bits (424), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 119/352 (33%), Positives = 160/352 (45%), Gaps = 35/352 (9%)

Query: 1   MIHILVFLLGCTLVRGELYKFSD------AYIDQINREA-NTWTAGRN---FPANLSEEY 50
           ++ +   LL  T V G   K SD      +++ +IN +A   WTA  +     +  S E 
Sbjct: 11  LVAVFAVLLA-TTVSGLYAKPSDFPLLGKSFVAEINSKARGQWTASADNGYLVSGKSLEE 69

Query: 51  LRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACA 110
           +R+ +       D S   +P    + D E    +P+ FDA E WP C TI  + D   C 
Sbjct: 70  VRKLM----GVTDMSTEAVPPRNFSVD-EMQQDLPEFFDAAEHWPMCVTISEIRDQSNCG 124

Query: 111 APHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLH 170
           +    AAV A SDR C    G  +R +ST  + SCC IC +     C  G     W +  
Sbjct: 125 SCWAIAAVEAISDRYCTLG-GVPDRRISTSNLLSCCFICGF----GCYGGIPTMAWLWWV 179

Query: 171 KRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFF 230
             G  T         CQP    PCSHHG++   P C N      KC+T C          
Sbjct: 180 WVGITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKCNTTCEKSEMDL--- 229

Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLH 290
             K++   +Y V   E  +  E++ +GP   T  +Y DF  YKSGVYKH S   L    H
Sbjct: 230 -VKYKGGTSYSVK-GEKELMIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGG--H 285

Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           + KL+GWGT+ G PYW + N+W   WGD+G   I RG  EC  E    AG P
Sbjct: 286 AVKLVGWGTQGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAGTP 337


>gi|194387364|dbj|BAG60046.1| unnamed protein product [Homo sapiens]
          Length = 245

 Score =  167 bits (424), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 94/244 (38%), Positives = 126/244 (51%), Gaps = 10/244 (4%)

Query: 101 GHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSH 159
           G  P +  C     F AV A SDR CI +    +  +S E + +CC  +C       C+ 
Sbjct: 3   GAGPLSIPCRMSWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMC----GDGCNG 58

Query: 160 GSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTR 219
           G     WNF  ++G V+GG Y    GC+P +I PC HH +    P       PK  C   
Sbjct: 59  GYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CSKI 116

Query: 220 CTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH 279
           C  P Y   + QDKH    +Y V ++E  I  EI  +GP    F++Y DF  YKSGVY+H
Sbjct: 117 C-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQH 175

Query: 280 TSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
            +   +    H+ +++GWG ENGTPYWLV N+W   WGD G  KILRG+  C  E  + A
Sbjct: 176 VTGEMMGG--HAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVA 233

Query: 340 GKPK 343
           G P+
Sbjct: 234 GIPR 237


>gi|17384033|emb|CAD12394.1| cysteine proteinase [Leishmania infantum]
          Length = 340

 Score =  167 bits (423), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 119/352 (33%), Positives = 159/352 (45%), Gaps = 35/352 (9%)

Query: 1   MIHILVFLLGCTLVRGELYKFSD------AYIDQINREA-NTWTAGRN---FPANLSEEY 50
           ++ +   LL  T V G   K SD      +++ +IN +A   WTA  +        S E 
Sbjct: 11  LVAVFAVLLA-TTVSGLYAKPSDFPLLGKSFVAEINSKARGQWTASADNGYLVTGKSLEE 69

Query: 51  LRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACA 110
           +R+ +       D S   +P    + D E    +P+ FDA E WP C TI  + D   C 
Sbjct: 70  VRKLM----GVTDMSTEAVPPRNFSVD-EMQQDLPEFFDAAEHWPMCVTISEIRDQSNCG 124

Query: 111 APHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLH 170
           +    AAV A SDR C    G  +R +ST  + SCC IC +     C  G     W +  
Sbjct: 125 SCWAIAAVEAISDRYCTLG-GVPDRRISTSNLLSCCFICGF----GCYGGIPTMAWLWWV 179

Query: 171 KRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFF 230
             G  T         CQP    PCSHHG++   P C N      KC+T C          
Sbjct: 180 WVGITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKCNTTCEKSEMDL--- 229

Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLH 290
             K++   +Y V   E  +  E++ +GP   T  +Y DF  YKSGVYKH S   L    H
Sbjct: 230 -VKYKGGTSYSVK-GEKELMIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGG--H 285

Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           + KL+GWGT+ G PYW + N+W   WGD+G   I RG  EC  E    AG P
Sbjct: 286 AVKLVGWGTQGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAGTP 337


>gi|353228456|emb|CCD74627.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 333

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 113/345 (32%), Positives = 160/345 (46%), Gaps = 23/345 (6%)

Query: 2   IHILVFLLGCTLVRGELYKFSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAK 60
           I+ L++L G  L +  +   SD  I  IN   +  W A +        +    F     +
Sbjct: 8   IYFLIYLNGYNLKQFNI--LSDELIQYINNYPSAGWKASKQNRFKSISDVYNTFGYYGIR 65

Query: 61  YFDQSDRPLPGDRKTYDPE-YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
           +F +      G   T   E  +  +PD FD+REQW +C +I  + D   C +    A+  
Sbjct: 66  HFRK------GILSTISHEDENIQLPDYFDSREQWKDCPSINIIHDQSKCDSGWAVASAA 119

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           + SDR CI++ G     LS   + SC K     +   C  G    +W++  K G VTG  
Sbjct: 120 SISDRTCIQTNGTMKVQLSAIELISCSK-----NKLGCQIGFSEFSWDYWLKNGLVTG-- 172

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
             D TGC P     C H  S+ + P C         C   C +  Y   +  DKH   + 
Sbjct: 173 --DPTGCLPYPFPKCDHR-SSNSYPKCGYITYTAPPCTKTCRS-GYPIPYKADKHYGRVI 228

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y +  NE  I+KEI+ +GP  A   ++ DF +YKSGVY+H +   +   +HS ++IGWG 
Sbjct: 229 YSLRPNESDIRKEIMMNGPVEAGIFVHSDFLNYKSGVYRHITGQLVT--IHSVRIIGWGI 286

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           EN  PYWL  N+W   WG  G  KILRG  EC  E  + AGK  N
Sbjct: 287 ENDIPYWLCANSWNEDWGLNGYFKILRGSNECEIESFVNAGKVDN 331


>gi|728602|emb|CAA88490.1| cathepsin B-like enzyme [Leishmania mexicana]
 gi|1586011|prf||2202319A cathepsin B-like Cys protease
          Length = 340

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 114/351 (32%), Positives = 160/351 (45%), Gaps = 33/351 (9%)

Query: 1   MIHILVFLLGCTLVR-----GELYKFSDAYIDQINREAN-TWTAGRN---FPANLSEEYL 51
           ++ + V LL  T+        ++     +++ + N +A   WTA  +        S E +
Sbjct: 11  LVAVFVVLLATTVSALYAKPSDIPLLGKSFVAETNSKAKGQWTASADNGHLVTGKSLEEV 70

Query: 52  RQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAA 111
           R+ +   +     S   +P  R     E    +P+ FDA E+WP C TIG + D   C +
Sbjct: 71  RKLMGVTS----MSTEAVP-PRNFSVEEMQQDLPESFDASEKWPMCVTIGEIRDQSNCGS 125

Query: 112 PHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHK 171
               AAV A SDR C  S G  +R +ST  + SCC IC +     C  G     W +   
Sbjct: 126 CWAIAAVEAMSDRYCTMS-GIPDRRISTTNLLSCCFICGF----GCYGGIPAMAWLWWVW 180

Query: 172 RGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQ 231
            G  T         CQP    PCSHHG++   P C N      KC+T C N         
Sbjct: 181 VGVTT-------ELCQPYPFGPCSHHGNSSKYPPCPNTIYNTPKCNTTCDNVE----MEL 229

Query: 232 DKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHS 291
            K++   +Y +   E  +  E++ +GP      +Y DF  YKSGVYKH S   L    H+
Sbjct: 230 VKYKGVSSYSIK-GERELDHELMNNGPLEVAMQVYADFVAYKSGVYKHVSGDHLGG--HA 286

Query: 292 GKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            KL+GWG ++G PYW + N+W   WGD+G   I RG  EC  E    AGKP
Sbjct: 287 VKLVGWGVKDGIPYWKIANSWNTDWGDKGYFLIQRGNDECGIESSGVAGKP 337


>gi|401415968|ref|XP_003872479.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|322488703|emb|CBZ23950.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 340

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 114/351 (32%), Positives = 160/351 (45%), Gaps = 33/351 (9%)

Query: 1   MIHILVFLLGCTLVR-----GELYKFSDAYIDQINREAN-TWTAGRN---FPANLSEEYL 51
           ++ + V LL  T+        ++     +++ + N +A   WTA  +        S E +
Sbjct: 11  LVAVFVVLLATTVSALYAKPSDIPLLGKSFVAETNSKAKGQWTASADNGHLVTGKSLEEV 70

Query: 52  RQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAA 111
           R+ +   +     S   +P  R     E    +P+ FDA E+WP C TIG + D   C +
Sbjct: 71  RKLMGVTS----MSTEAVP-PRNFSVEEMQQDLPESFDASEKWPMCVTIGEIRDQSNCGS 125

Query: 112 PHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHK 171
               AAV A SDR C  S G  +R +ST  + SCC IC +     C  G     W +   
Sbjct: 126 CWAIAAVEAMSDRYCTMS-GIPDRRISTTNLLSCCFICGF----GCYGGIPAMAWLWWVW 180

Query: 172 RGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQ 231
            G  T         CQP    PCSHHG++   P C N      KC+T C N         
Sbjct: 181 VGVTT-------ELCQPYPFGPCSHHGNSSKYPPCPNTIYNTPKCNTTCDNVE----MEL 229

Query: 232 DKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHS 291
            K++   +Y +   E  +  E++ +GP      +Y DF  YKSGVYKH S   L    H+
Sbjct: 230 VKYKGVSSYSIK-GERELMVELMNNGPLEVAMQVYADFVAYKSGVYKHVSGDHLGG--HA 286

Query: 292 GKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            KL+GWG ++G PYW + N+W   WGD+G   I RG  EC  E    AGKP
Sbjct: 287 VKLVGWGVKDGIPYWKIANSWNTDWGDKGYFLIQRGNDECGIESSGVAGKP 337


>gi|407080581|gb|AFS89610.1| procathepsin B precursor [Phenacoccus solenopsis]
          Length = 309

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 104/265 (39%), Positives = 133/265 (50%), Gaps = 12/265 (4%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           S  +PD FD+R +WPNC TI  + D G+C A   FAA  A SDR CI S   ++   S  
Sbjct: 48  SENLPDEFDSRVRWPNCPTIREIRDQGSCGACWAFAAAEAMSDRVCIHSSQTKHFHFSAL 107

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SCC  C     K C        W+   K G V+GG YG + GCQP  + PC HH + 
Sbjct: 108 NLLSCCDSCE----KGCLGCDHHLAWDHWVKHGIVSGGSYGSKEGCQPYHLPPCEHHRAG 163

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV-DDNEDAIKKEILAHGPT 259
           P     +    P      R   P Y   +  D H     Y +   NE  I+ EI  +GP 
Sbjct: 164 PRRNCTKYGPTPSC---ARVCQPDYKISYEDDLHFGKQWYALAPHNEKIIRTEIFHNGPV 220

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE--NGTPYWLVINTWGPHWG 317
            AT A Y+DFY Y+SG+Y H     + +  H+ K+IGWGT+    TPYWLV N++   WG
Sbjct: 221 EATMAAYEDFYTYESGIYHHIEGTFVCD--HAVKIIGWGTDKKTNTPYWLVANSFNTDWG 278

Query: 318 DRGTVKILRGKYECAFEYLIAAGKP 342
           + G  KI RG  EC  E  I AG P
Sbjct: 279 EYGFFKIKRGVNECGIENKITAGIP 303


>gi|2944340|gb|AAC05262.1| cathepsin B-like cysteine protease GCP7 [Haemonchus contortus]
          Length = 348

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 103/307 (33%), Positives = 153/307 (49%), Gaps = 21/307 (6%)

Query: 42  FPANLSEEYL---RQFLIADAKYFDQS---DRPLPGDRKTYDPEYSATVPDRFDAREQWP 95
           F A  S E +   RQFL+   ++ ++S   +  LP    T + +    +P+ FD+RE+W 
Sbjct: 53  FKAKYSPEVVKKRRQFLL-KPQFIERSYNQENVLPIANITSNDD----IPESFDSREKWK 107

Query: 96  NCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDN 154
           +C ++  +PD   C +    +A    SDR CI S+G++   LS   + +CC K C Y   
Sbjct: 108 DCPSLRVIPDQSNCGSCWAVSAAQCMSDRLCIHSQGRKKVLLSATDILACCGKFCGY--- 164

Query: 155 KSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKL 214
             C  G   R W +    G VTGG Y ++  C+P     C  H       +C +      
Sbjct: 165 -GCDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCGAH-KGKAFNNCPSHPYATP 222

Query: 215 KCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKS 274
            C   C    YG+ +  DK +    YW+ ++E  I+ EI+  GP  ATF +Y+DF HY+ 
Sbjct: 223 ACKPYCQY-GYGKRYENDKIKARTWYWLPNDERTIQLEIMQKGPVHATFNIYEDFEHYEG 281

Query: 275 GVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG-DRGTVKILRGKYECAF 333
           GVY HT+ A      HS K+IGWG + G  YWL+ N+W   WG D G  +++RG   C  
Sbjct: 282 GVYIHTAGAMEGG--HSIKIIGWGVDKGVKYWLIANSWSTDWGEDGGYFRVVRGINNCDI 339

Query: 334 EYLIAAG 340
           E  + AG
Sbjct: 340 EGGVLAG 346


>gi|308504721|ref|XP_003114544.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
 gi|308261929|gb|EFP05882.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
          Length = 358

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 108/320 (33%), Positives = 153/320 (47%), Gaps = 18/320 (5%)

Query: 29  INREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRF 88
           +N     W A         E+  R   + D K+    +  + GD +  + +    +P  F
Sbjct: 45  VNNHQKLWKA-ETSRMTFQEKMAR---VKDIKFIKSHEDQMVGDSE--NNQVLLDIPTYF 98

Query: 89  DAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-- 146
           D+R++WP C  IG V D   C +     AV   SDR CI S G  N PLS +   SCC  
Sbjct: 99  DSRQKWPECTQIGAVRDQSDCGSAAHLVAVELASDRTCIFSNGTFNWPLSAQDPLSCCVG 158

Query: 147 --KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS-HHGSAPTL 203
              IC   D   C          +    G  TGG+Y D+ GC+P +I PC   + +  T 
Sbjct: 159 LMSIC--GDGWGCDGSWPKDILKWWQTHGLCTGGNYEDQFGCKPYSIYPCDKKYPNGTTS 216

Query: 204 PSCENQKVPKLKCHTRCT-NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
             C     P   C   CT N T+   + QDKH     Y V      I+ EI+ +GP  A+
Sbjct: 217 VPCPGYHTP--TCEEHCTSNITWPIAYKQDKHFGKAHYNVGKKMTDIQTEIMTNGPVIAS 274

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F +YDDF+ YKSG+Y HT+  + E  + + K+IGWG ++G PYWL ++ WG  +G+ G V
Sbjct: 275 FVIYDDFWDYKSGIYVHTAGDQ-EGGMDT-KIIGWGVDSGVPYWLCVHQWGTDFGENGFV 332

Query: 323 KILRGKYECAFEYLIAAGKP 342
           + LRG  E   E+ + A  P
Sbjct: 333 RFLRGVNEVNIEHQVLAALP 352


>gi|409905640|gb|AFV46426.1| cysteine protease C [Leishmania donovani]
          Length = 345

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 119/352 (33%), Positives = 159/352 (45%), Gaps = 35/352 (9%)

Query: 1   MIHILVFLLGCTLVRGELYKFSD------AYIDQINREA-NTWTAGRN---FPANLSEEY 50
           ++ +   LL  T V G   K SD      +++ +IN +A   WTA  +     +  S E 
Sbjct: 16  LVAVFAVLLA-TTVSGLYAKPSDFPLLGKSFVAEINSKARGQWTASADNGYLVSGKSLEE 74

Query: 51  LRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACA 110
           +R+ +       D S   +P  R     E    +P+ FDA E WP C TI  + D   C 
Sbjct: 75  VRKLM----GVTDMSTEAVP-PRNFSVVEMQQDLPEFFDAAEHWPMCVTISEIRDQSNCG 129

Query: 111 APHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLH 170
           +    AAV A SDR C    G  +R +ST  + SCC IC +     C  G     W +  
Sbjct: 130 SCWAIAAVEAISDRYCTLG-GVPDRRISTSNLLSCCFICGF----GCYGGIPTMAWLWWV 184

Query: 171 KRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFF 230
             G  T         CQP    PCSHHG++   P C N      KC+T C          
Sbjct: 185 WVGITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKCNTTCEKSEMDL--- 234

Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLH 290
             K++   +Y V   E  +  E++ +GP   T  +Y DF  YKSGVYKH S   L    H
Sbjct: 235 -VKYKGGTSYSVK-GEKELMIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGG--H 290

Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           + KL+GWGT+ G PYW + N+W   WGD+G   I RG  EC  E    AG P
Sbjct: 291 AVKLVGWGTQGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAGTP 342


>gi|254575663|gb|ACT68328.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 103/307 (33%), Positives = 152/307 (49%), Gaps = 21/307 (6%)

Query: 42  FPANLSEEYL---RQFLIADAKYFDQS---DRPLPGDRKTYDPEYSATVPDRFDAREQWP 95
           F A  S E +   RQFL+   ++ ++S   +  LP    T + +    +P+ FD+RE+W 
Sbjct: 53  FKAKYSPEVVKKRRQFLL-KPQFIERSYNQENVLPVANITSNDD----IPESFDSREKWK 107

Query: 96  NCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDN 154
           +C ++  +PD   C +    +A    SDR CI S+G++   LS   + +CC K C Y   
Sbjct: 108 DCPSLRVIPDQSNCGSCWAVSAAQCMSDRLCIHSQGRKKVLLSATDILACCGKFCGY--- 164

Query: 155 KSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKL 214
             C  G   R W +    G VTGG Y ++  C+P     C  H       +C +      
Sbjct: 165 -GCDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCGAH-KGKAFNNCPSHPYATP 222

Query: 215 KCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKS 274
            C   C    YG+ +  DK +    YW+ ++E  I+ EI+  GP  ATF +Y+DF HY  
Sbjct: 223 ACKPYCQY-GYGKRYENDKIKAKTWYWLPNDERTIQLEIMKKGPVHATFNIYEDFEHYNG 281

Query: 275 GVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG-DRGTVKILRGKYECAF 333
           GVY HT+ A      HS K+IGWG + G  YWL+ N+W   WG D G  +++RG   C  
Sbjct: 282 GVYIHTAGAMEGG--HSIKIIGWGVDKGVKYWLIANSWSTDWGEDGGYFRVVRGINNCDI 339

Query: 334 EYLIAAG 340
           E  + AG
Sbjct: 340 EGGVLAG 346


>gi|194352768|emb|CAQ00112.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326488519|dbj|BAJ93928.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326508126|dbj|BAJ99330.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 355

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 119/345 (34%), Positives = 157/345 (45%), Gaps = 45/345 (13%)

Query: 8   LLGCTLVRGELYKFSDAYIDQINREANT-WTAGRN-FPANLSEEYLRQFLIADAKYFDQS 65
           L+G       L    +  I  +N   N  WTAG N + AN + E  +  L          
Sbjct: 28  LVGAAKAEHSLGIIQEDIIQTVNDHPNAGWTAGHNPYFANYTIEQFKHIL---------G 78

Query: 66  DRPLPGDRKTYDP----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
            +P P       P      SA +P  FDAR QW +C TIG++ D G C A   FAAV + 
Sbjct: 79  VKPTPPGLLAGVPIKTHPKSADLPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVESL 138

Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--G 178
            DR CI      +  LS   + +CC  +C       C+ G     W +  + G VT    
Sbjct: 139 QDRFCIHL--NMSVSLSVNDLLACCGFLC----GSGCNGGYPISAWRYFRRSGVVTEECD 192

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
            Y D+TGCQ        H G  P  P+         KCH +C      + + ++KH +  
Sbjct: 193 PYFDQTGCQ--------HPGCEPAYPT--------PKCHRKCK--VENQVWKKNKHFSVN 234

Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
            Y V  N   I  E+  +GP    F +Y+DF HYKSGVYKH +   +    H+ KLIGWG
Sbjct: 235 AYRVHSNPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGG--HAVKLIGWG 292

Query: 299 TEN-GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           T + G  YWL+ N W   WGD G  KI+RGK EC  E  + AG P
Sbjct: 293 TSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVTAGMP 337


>gi|226472808|emb|CAX71090.1| cathepsin B [Schistosoma japonicum]
          Length = 325

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 105/290 (36%), Positives = 139/290 (47%), Gaps = 17/290 (5%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
            S   I  IN EANT W AG         +  R          +Q +    G   T +  
Sbjct: 36  LSKELIHFINYEANTTWKAGPTRRFKTVSDIRRMLGALPDPNGEQLETLCTGYELTLN-- 93

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
               +P  FDAR++W +C +I  + D  +C +   F AV A SDR CI+SKG+    LS 
Sbjct: 94  ---ELPKSFDARKEWTHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGKYKPFLSA 150

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
           E + SCC  C       C+ G     W +   +G VTG  Y    GCQP    PC HH  
Sbjct: 151 ENLVSCCSSC----GMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEHHTL 206

Query: 200 APTLPSCE-NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
            P LP C+ + + P  K   R     Y   +  DK    + Y V  N++AI KE++ HGP
Sbjct: 207 GP-LPVCDGDVETPPCK---RTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGP 262

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLV 308
               F +Y DF +YKSGVY+H S A L    H+ +L+GWG EN  PYWL+
Sbjct: 263 VEVDFEVYADFPNYKSGVYQHVSGALLGG--HAVRLLGWGEENNVPYWLI 310


>gi|308488550|ref|XP_003106469.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
 gi|308253819|gb|EFO97771.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
          Length = 205

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 82/188 (43%), Positives = 107/188 (56%), Gaps = 3/188 (1%)

Query: 156 SCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLK 215
           SC  G   + W +  K G VTGG Y  + GC+P +I+PC    +  T P C     P  K
Sbjct: 13  SCEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPK 72

Query: 216 CHTRCT-NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKS 274
           C   CT N TY  G+ QDKH     Y V    + I+ EILAHGP    F +Y+DFY Y +
Sbjct: 73  CVEACTSNNTYPTGYLQDKHFGATAYAVGKKVEQIQTEILAHGPIEVAFTVYEDFYQYTT 132

Query: 275 GVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
           GVY HT+   L    H+ K++GWG +NGTPYWLV N+W  +WG++G  +I+RG  EC  E
Sbjct: 133 GVYVHTAGKSLGG--HAVKILGWGVDNGTPYWLVANSWNVNWGEKGYFRIIRGLNECGIE 190

Query: 335 YLIAAGKP 342
           +   AG P
Sbjct: 191 HSAVAGLP 198


>gi|294894290|ref|XP_002774786.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239880403|gb|EER06602.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 830

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 119/373 (31%), Positives = 169/373 (45%), Gaps = 76/373 (20%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYF------DQSDRPLPGDRKTYD 77
           + +D+IN + NTWTA      +  ++  +   + DAK          +D+ +   +K Y 
Sbjct: 480 SLVDEINSKQNTWTA------STGQKRFKNLSLRDAKMLCGTLMRGSNDKAI---KKGYA 530

Query: 78  PEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
            E    +P  FDAR  +PNC   IGH+ D  AC +   F    AF+DR CIKS G     
Sbjct: 531 IEELQDLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSNGTFTEL 590

Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRT---GCQPST 190
           LS   + +C        +  C+ G     W+++H +G  TGGDY    D T   GC P  
Sbjct: 591 LSAGEMNACAP------SHGCNGGFPNSAWSWVHDKGIATGGDYVAKDDMTKDDGCWPYD 644

Query: 191 ISPCSHHGSAPTLP-----SCENQKVPKL----------------KCHTRCTNPTYGRGF 229
             PC+HH +    P     SC  +  P                   C  +C NP Y    
Sbjct: 645 FPPCAHHINDTKYPECPKVSCSGESPPATAETATVIAYQNSYETPNCAEQCHNPKYTTTL 704

Query: 230 FQDKH----RTTLTYWVDDNEDAIKKEILAHGPT---------------TATFALYDDFY 270
             D+H     +   Y V+D ++AI+ +    GP                +A+F++Y+DF 
Sbjct: 705 RDDRHFMLESSPYQYSVNDAKNAIRTD----GPVGPIYFCDPNVNFDQVSASFSVYEDFL 760

Query: 271 HYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYE 330
            YKSGVYKHTS   L    H+ K+IGWG E+G  YW+V+N+W   WGD G  KI  G   
Sbjct: 761 AYKSGVYKHTSGEYLGG--HAVKIIGWGEESGQAYWIVVNSWNEDWGDHGLFKIALGN-- 816

Query: 331 CAFEYLIAAGKPK 343
           C  +  +  G PK
Sbjct: 817 CGIDDNLLGGTPK 829


>gi|3087803|emb|CAA93279.1| cysteine protease [Haemonchus contortus]
          Length = 325

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 89/268 (33%), Positives = 134/268 (50%), Gaps = 14/268 (5%)

Query: 50  YLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGAC 109
           Y  +  I D  +  ++  P+ GD      +    +P+ FDAR  WPNC ++ H+ D   C
Sbjct: 64  YDIEHRIMDLSFIGENREPIVGDEN----DEGDDIPESFDARTHWPNCSSLTHIRDQANC 119

Query: 110 AAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFL 169
            +    +   A SDR CI + G +   +S   + +CC  C Y     C  G     W ++
Sbjct: 120 GSCWAVSTAAALSDRICISTNGTKQVNISATDILTCCYKCGY----GCQGGWPIEAWEYV 175

Query: 170 HKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQ-KVPKLKCHTRCTNPTYGRG 228
            + G+VTGG    ++ C+     PC HHG+      C  + + P  KC T CT P Y   
Sbjct: 176 AREGAVTGGRLLAKSCCRSHPFPPCGHHGNETYYGECGGRARTP--KCRTSCT-PGYKNS 232

Query: 229 FFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENY 288
           +  DK R    Y + ++  AI++EI+ +GP  A F +Y DF +YK G+YKHT+     + 
Sbjct: 233 YSDDKIRGKDAYELPNSVKAIQREIMKNGPVVAAFTVYADFSYYKKGIYKHTAGRARGS- 291

Query: 289 LHSGKLIGWGTENGTPYWLVINTWGPHW 316
            H+ K+IGWG E   PYW+V N+W   W
Sbjct: 292 -HAVKVIGWGEEGDVPYWIVKNSWHNDW 318


>gi|193603738|ref|XP_001943652.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 337

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 106/328 (32%), Positives = 153/328 (46%), Gaps = 17/328 (5%)

Query: 18  LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
           L+K +       N   +TW AG NF  +L         +       + D  L     TYD
Sbjct: 24  LFKVNQIIQLVNNIPKHTWKAGINFHPSLLTNVSHLMGVVPWNKLSEKDILL-----TYD 78

Query: 78  PEYSA-TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
                 ++P+ +D  + W  C ++  + D   C +    +   AFSDR CI S    N+ 
Sbjct: 79  VSIDLESLPESYDITQTWSECKSVVSIRDQSNCGSCWALSTASAFSDRLCITSNMGVNKV 138

Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
           LS EY+ SCC     +      H    + W ++ K G  TGG+YG   GCQP +I PC  
Sbjct: 139 LSGEYINSCCNGKCGNGCNG-GHPE--KAWKYIKKNGLCTGGEYGSNEGCQPYSIVPCPR 195

Query: 197 HGSAPTLPSCENQKVPKLKCHT-RCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           + ++    S EN+  P+  C+  +CTN  Y      D +     Y V    + I  E+  
Sbjct: 196 NANSC---SKENEDTPQ--CYKDQCTNNNYETPLVSDLYYAYKVYSVKPKPEIIMSEVFK 250

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
           +GP  A   +YDDF  YK G+Y++T+     +  H+ K++GWG ++G  YWL  NTWG  
Sbjct: 251 NGPVVAAMKVYDDFLCYKGGIYQYTTGGLKGD--HAVKIMGWGEDDGIDYWLCANTWGNS 308

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPK 343
           WG  G  KI RG+ EC  E  I  G PK
Sbjct: 309 WGMGGMFKIRRGRNECGIENRITGGLPK 336


>gi|166030312|gb|ABY78823.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 111/343 (32%), Positives = 156/343 (45%), Gaps = 32/343 (9%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
           LV L    L+  +    +  ++D+IN+     W A  N         ++    A+A+   
Sbjct: 14  LVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYN-------GKMQNITFAEARRLT 66

Query: 64  ----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
               Q    LP  R T + +    +P+ FD+ E+WPNC TI  + D  AC +    +   
Sbjct: 67  GARIQKTSSLPPVRFT-EEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTAS 125

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           A SDR C     QQ R +S  ++ SCCK C Y     C  G     W +    G  +   
Sbjct: 126 AISDRHCTVGGVQQLR-ISAAHLLSCCKDCGY----GCDGGYPDAAWRYYVSHGLAS--- 177

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
               + CQP     C HHG     P C        KC+T CT+    +     K+R   +
Sbjct: 178 ----SYCQPYPFPHCDHHGGKGKKPPCSKYDFHTPKCNTTCTD----KAIPLIKYRGNHS 229

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y V   ED  K+E+  +GP    F +Y DF+ YK+GVY+H S   L    H+ +++GWG 
Sbjct: 230 YEVHGEED-YKRELYFNGPFVVAFQVYSDFFAYKTGVYRHVSGDVLGG--HAVRIVGWGK 286

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            NGTPYW + N+W   WG  G   ILRGK EC  E+   AG P
Sbjct: 287 LNGTPYWKIANSWDTDWGMNGHFLILRGKDECGIEHQGYAGSP 329


>gi|294898091|ref|XP_002776152.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239882839|gb|EER07968.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 382

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 108/332 (32%), Positives = 160/332 (48%), Gaps = 47/332 (14%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYF------DQSDRPLPGDRKTYD 77
           + +D+IN + NTWTA      +  ++  +   + DAK          +D+ +   +K Y 
Sbjct: 85  SLVDEINSKQNTWTA------STGQKRFKNLSLRDAKMLCGTLMRGSNDKAV---KKGYA 135

Query: 78  PEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
            E    +P  FDAR  +PNC   IGH+ D  AC +   F    AF+DR CIKS G     
Sbjct: 136 IEELQDLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSNGAFTEL 195

Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
           LS   + +C           C  G  +  W+++H +G  TG       G +P  +S    
Sbjct: 196 LSAGEMNACTLFF------GCGGGDPYSAWSWVHDKGIATG------EGSRPKRVS---- 239

Query: 197 HGSAPTLPSCENQKV-PKLKCHTRCTNPTYGRGFFQDKHRTTLT----YWVDDNEDAIKK 251
              +  +P    Q + P   C  +C NP Y      D+H    +    Y V+D ++AI+ 
Sbjct: 240 --ESEAIPVIAYQDIYPTPNCVEQCRNPKYTTTLRDDRHFMLESSPYHYSVNDAKNAIRT 297

Query: 252 EILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINT 311
           +    GP +A+F +Y+DF  YKSGVYKHTS + L    H+ K+IGWG ++G  YWL +N+
Sbjct: 298 D----GPVSASFTVYEDFLAYKSGVYKHTSGSYLGG--HAVKIIGWGEKSGQAYWLAVNS 351

Query: 312 WGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           W   WGD+G  KI  G   C  +  +  G PK
Sbjct: 352 WNEDWGDKGLFKIALGN--CGIDDDLLGGTPK 381


>gi|193688334|ref|XP_001945855.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 313

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 103/273 (37%), Positives = 137/273 (50%), Gaps = 24/273 (8%)

Query: 74  KTYDPEYSATVPD--RFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
           KT +P+Y     D   FDAR++WP C TIG V + G  A    +AA G  +DR CI + G
Sbjct: 60  KTRNPKYVIDNRDYKEFDARKRWPKCKTIGEVHNEGNFAFGWAYAAAGVLADRTCIATNG 119

Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTI 191
             N+ LSTE + SC  I   + N + +  S+   W +L   G V+GG Y    GCQP   
Sbjct: 120 GYNKLLSTEELISCSGI--KETNGNVNERSI---WEYLKSHGVVSGGKYNSNDGCQPFKF 174

Query: 192 SPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH-RTTLTYWVDDNEDAIK 250
            P      A  L   +         HT C +  YG       H    +  +       I+
Sbjct: 175 PPI-----ANILTHLQ---------HT-CDDHCYGNTSINYNHDHVRVRNYYTIRTGYIQ 219

Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVIN 310
           KE+  +GP    F + DDF  YKSGVY  + NAK+    ++ KLIGWG ENG  YWLVIN
Sbjct: 220 KEVQTYGPVAVQFKVCDDFLLYKSGVYVKSDNAKVIRTQYA-KLIGWGVENGVDYWLVIN 278

Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           +WG  WG +G  KI RG  +C  E ++ AG P+
Sbjct: 279 SWGHEWGQKGLFKIKRGTNQCGVESVVYAGVPE 311


>gi|239938580|gb|ACS36089.1| cysteine proteinase [Haemonchus contortus]
          Length = 332

 Score =  165 bits (417), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 84/245 (34%), Positives = 134/245 (54%), Gaps = 9/245 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FD+RE W NC +I ++ D   C +    +A    SDR C++SKG+  + +S   + 
Sbjct: 95  IPESFDSREVWKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDIL 154

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP-T 202
           +CC     +  + C+ G   + W ++ + G VTGG Y ++  C+P  + PC +HG    +
Sbjct: 155 ACCG---RECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWS 211

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
            P   + + P   C   C    YG+ + +DK      Y +D++E AI++E++ +GP  A 
Sbjct: 212 CPRDHSFRTPA--CKKYCQY-GYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAA 268

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F  Y+DF  Y  G+Y HT     +   H+ K++GWG ENGT YW V N+W   WG+ G  
Sbjct: 269 FITYEDFSFYTKGIYVHTRGR--QRGAHAVKVVGWGVENGTKYWNVANSWSTDWGENGYF 326

Query: 323 KILRG 327
           +ILRG
Sbjct: 327 RILRG 331


>gi|12004577|gb|AAG44098.1| cathepsin B cysteine protease [Leishmania chagasi]
          Length = 340

 Score =  164 bits (416), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 118/352 (33%), Positives = 159/352 (45%), Gaps = 35/352 (9%)

Query: 1   MIHILVFLLGCTLVRGELYKFSD------AYIDQINREA-NTWTAGRN---FPANLSEEY 50
           ++ +   LL  T V G   K SD      +++ +IN +A   WTA  +     +  S E 
Sbjct: 11  LVAVFAVLLA-TTVSGLYAKPSDFPLLGKSFVAEINSKARGQWTASADNGYLVSGKSLEE 69

Query: 51  LRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACA 110
           +R+ +       D S   +P    + D E    +P+ FDA E WP C TI  + D   C 
Sbjct: 70  VRKLM----GVTDMSTEAVPPRNFSVD-EMQQDLPEFFDAAEHWPMCVTISEIRDQSNCG 124

Query: 111 APHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLH 170
           +    AAV A SDR C    G  +R +ST  + SCC IC +     C  G     W +  
Sbjct: 125 SCWAIAAVEAISDRYCTLG-GVPDRRISTSNLLSCCFICGF----GCYGGIPTMAWLWWV 179

Query: 171 KRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFF 230
             G  T         CQP    PCSHHG++   P C N      KC+T C          
Sbjct: 180 WVGITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKCNTTCEKSEMDL--- 229

Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLH 290
             K++   +Y V   E  +  E++ +GP   T  +Y DF  YKSG YKH S   L    H
Sbjct: 230 -VKYKGGTSYSVK-GEKELMIELMTNGPLEVTMQVYSDFVGYKSGGYKHVSGDLLGG--H 285

Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           + KL+GWGT+ G PYW + N+W   WGD+G   I RG  EC  E    AG P
Sbjct: 286 AVKLVGWGTQGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAGTP 337


>gi|194246059|gb|ACF35521.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 217

 Score =  164 bits (416), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 94/222 (42%), Positives = 117/222 (52%), Gaps = 9/222 (4%)

Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
           SDR CI +KG+    +S E + +CC  C       C+ G     W F    G VTGG YG
Sbjct: 1   SDRICIHTKGKVQVNISAEDLLTCCDSC----GSGCNGGYPSAAWQFYKDEGIVTGGLYG 56

Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW 241
              GCQP    PC HH   P LP+C   K P  +C   C    Y + + +DKH     Y 
Sbjct: 57  TEDGCQPYYFPPCEHHTVGP-LPNCTGIK-PTPECAKTCRE-GYEKSYTRDKHFGKKVYS 113

Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
           +  +E  IK EI  +GP  A F +Y DF  YKSGVY+  S   L    H+ +++GWGTE+
Sbjct: 114 ISSDETQIKTEICKNGPVEADFNVYADFPSYKSGVYQRHSKEMLGG--HAIRILGWGTED 171

Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           G PYWLV N+W   WGD+G  KI RG  EC  E  I AG PK
Sbjct: 172 GVPYWLVANSWNEDWGDKGYFKIRRGNDECGIENDINAGIPK 213


>gi|335347289|gb|AEH42092.1| cysteine proteinase 1 [Haemonchus contortus]
          Length = 332

 Score =  164 bits (415), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 84/245 (34%), Positives = 134/245 (54%), Gaps = 9/245 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FD+RE W NC +I ++ D   C +    +A    SDR C++SKG+  + +S   + 
Sbjct: 95  IPESFDSREVWKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDIL 154

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP-T 202
           +CC     +  + C+ G   + W ++ + G VTGG Y ++  C+P  + PC +HG    +
Sbjct: 155 ACCG---RECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWS 211

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
            P   + + P   C   C    YG+ + +DK      Y +D++E AI++E++ +GP  A 
Sbjct: 212 CPRDHSFRTPA--CKKYC-QYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAA 268

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F  Y+DF  Y  G+Y HT     +   H+ K++GWG ENGT YW V N+W   WG+ G  
Sbjct: 269 FITYEDFSFYTKGIYVHTRGR--QRGAHAVKVVGWGVENGTKYWNVANSWSTDWGEDGYF 326

Query: 323 KILRG 327
           +ILRG
Sbjct: 327 RILRG 331


>gi|326492684|dbj|BAJ90198.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 355

 Score =  164 bits (415), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 118/345 (34%), Positives = 156/345 (45%), Gaps = 45/345 (13%)

Query: 8   LLGCTLVRGELYKFSDAYIDQINREANT-WTAGRN-FPANLSEEYLRQFLIADAKYFDQS 65
           L+G       L    +  I  +N   N  WTAG N + AN + E  +  L          
Sbjct: 28  LVGAAKAEHSLGIIQEDIIQTVNDHPNAGWTAGHNPYFANYTIEQFKHIL---------G 78

Query: 66  DRPLPGDRKTYDP----EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
            +P P       P      SA +P  FDAR QW +C TIG++ D G C A   FAAV + 
Sbjct: 79  VKPTPPGLLAGVPIKTHPKSADLPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVESL 138

Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--G 178
            DR CI      +  LS   + +CC  +C       C+ G     W +  + G VT    
Sbjct: 139 QDRFCIHL--NMSVSLSVNDLLACCGFLC----GSGCNGGYPISAWRYFRRSGVVTEECD 192

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
            Y D+TGCQ        H G  P  P+         KCH +C      + + ++KH +  
Sbjct: 193 PYFDQTGCQ--------HPGCEPAYPT--------PKCHRKCK--VENQVWKKNKHSSVN 234

Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
            Y V  N   I  E+  +GP    F +Y+DF HYKSGVYKH +   +    H+ KLIGWG
Sbjct: 235 AYRVHSNPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGG--HAVKLIGWG 292

Query: 299 TEN-GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           T + G  YWL+ N W   WG  G  KI+RGK EC  E  + AG P
Sbjct: 293 TSDAGEDYWLLANQWNRGWGGDGYFKIIRGKNECGIEEDVTAGMP 337


>gi|193688336|ref|XP_001945899.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 308

 Score =  164 bits (414), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 103/275 (37%), Positives = 138/275 (50%), Gaps = 26/275 (9%)

Query: 74  KTYDPEYSATVPD--RFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
           KT +P+Y     D   FDAR++WP C TIG V + G  A    +A  G  +DR CI + G
Sbjct: 53  KTRNPKYVIDNRDYKEFDARKRWPKCKTIGEVHNEGNFALGWAYAVAGVLADRTCIATNG 112

Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTI 191
             N+ LSTE + SC  I + ++    S  S+   W +L   G V+GG Y    GCQP   
Sbjct: 113 GYNKLLSTEELISCSGI-KENNGSVPSERSI---WEYLKSHGVVSGGKYNSNDGCQPFKF 168

Query: 192 SPCSHHGSAPTLPSCENQKVPK-LKCHTRCTNPTYGRGF--FQDKHRTTLTYWVDDNEDA 248
            P ++              +PK L  HT C +  YG     +   H     Y+     D 
Sbjct: 169 PPIAN--------------IPKHLHKHT-CDDHCYGNSTINYNHDHVRVRNYYTIRTRD- 212

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLV 308
           I+KE+  +GP    F + DDF+ YKSGVY  +  AK     ++ KLIGWG ENG  YWLV
Sbjct: 213 IQKEVQTYGPVVVRFMVCDDFFLYKSGVYAKSDKAKGIRTQYA-KLIGWGVENGVDYWLV 271

Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           IN+WG  WG +G  KI  G  +C  E  + AG P+
Sbjct: 272 INSWGHEWGQKGLFKIKSGTNQCGVESFVYAGLPE 306


>gi|44965401|gb|AAS49537.1| cathepsin B [Latimeria chalumnae]
          Length = 225

 Score =  163 bits (413), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 89/221 (40%), Positives = 125/221 (56%), Gaps = 9/221 (4%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FD+R QWP C TI  + D G+C +   F AV A SDR CI SKG+ N  +S E + 
Sbjct: 13  LPENFDSRTQWPKCPTIQEIRDQGSCGSCWAFGAVEAISDRVCIHSKGKVNVEISAEDLL 72

Query: 144 SCCKI-CRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
           SCC + C +     C+ G     WNF  + G V+GG +    GC+P TI PC HH +  +
Sbjct: 73  SCCGMECGF----GCNGGYPSGAWNFWTETGLVSGGLFKSHIGCRPYTIPPCEHHVNG-S 127

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
            PSC  ++    KC  +C    Y   +F+DKH  + +Y V  NE  I+ EI  +GP    
Sbjct: 128 RPSCTGEEGDTPKCVMQC-EAGYTPSYFKDKHFGSTSYAVSSNEADIQIEIYKNGPVEGA 186

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
           F +Y+DF  YKSGVYKH +   +    H+ +++GWG E+GT
Sbjct: 187 FTVYEDFLQYKSGVYKHVTGDAVGG--HAIRILGWGVESGT 225


>gi|270012757|gb|EFA09205.1| cathepsin B precursor [Tribolium castaneum]
          Length = 348

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 105/317 (33%), Positives = 151/317 (47%), Gaps = 42/317 (13%)

Query: 26  IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVP 85
           I +IN    +W AG N   ++        L  D  Y  Q          T   + + ++P
Sbjct: 29  IQEINSRQTSWKAGTN-SLDIKSRLGFLGLHPDPDYKIQ----------TKHHKIAKSIP 77

Query: 86  DRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
           + FDARE+WP C   IG + D G C +   FA+    +DR CI +KG+     S E + +
Sbjct: 78  ESFDAREKWPECKDVIGKIRDQGTCGSCWAFASTEVMTDRLCIGTKGETKFVFSPENLLT 137

Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
           CC+ CR +    C  G   + W++    G V+GGDY    GCQP + +   +  ++    
Sbjct: 138 CCEDCRLE----CVGGYTAKAWDYYINEGIVSGGDYNSSEGCQPYSKASFQYAVAS---- 189

Query: 205 SCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFA 264
                     KC   C N  Y   +  DKH     Y ++ N   I+ EIL +GP  ATF 
Sbjct: 190 ----------KCVKACQNDKYDVKYDDDKHYGDSFYTLETNVTQIQTEILTNGPVMATFN 239

Query: 265 LYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT-VK 323
           +++D  +YKSG+            L +  ++ WGTE G PYWL+ N+WG  WGD G  +K
Sbjct: 240 VFEDIIYYKSGI-----------QLSNVSILRWGTEEGVPYWLIANSWGTWWGDLGGFIK 288

Query: 324 ILRGKYECAFEYLIAAG 340
           I RG  ECA E  +AAG
Sbjct: 289 IKRGTNECAIEQEMAAG 305


>gi|170030060|ref|XP_001842908.1| cathepsin B [Culex quinquefasciatus]
 gi|167865914|gb|EDS29297.1| cathepsin B [Culex quinquefasciatus]
          Length = 320

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 94/262 (35%), Positives = 133/262 (50%), Gaps = 23/262 (8%)

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL-STEY 141
           ++P+ FD+R++WPNC ++  + D G C + ++ +   A +DR CI S GQ+     +T+Y
Sbjct: 80  SLPESFDSRQKWPNCPSLNQIRDQGCCGSCYVVSTAAAITDRYCIHSGGQKQFTFGATDY 139

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
           +A CC  C       C  G V +TW +    G  + G Y    GC            S P
Sbjct: 140 LA-CCTDCF-----KCDGGYVGKTWQYWVDSGLTSEGPYKSGQGCN-----------SYP 182

Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
               C N  +P     +R     Y   + QD       Y V  NE+AI  EI  +GP   
Sbjct: 183 FGSYCVNDPLPTC---SRTCQAGYPLTYSQDLKYGGSAYRVMWNENAIMTEIYQNGPVVV 239

Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGT 321
            F ++ DFY YKSGVY+H + A  E + H+ ++IGWG ENG  YWLV N+WG  WGD+G 
Sbjct: 240 QFEVFADFYQYKSGVYRHVTGAT-EGW-HAVRVIGWGVENGVKYWLVANSWGVRWGDKGF 297

Query: 322 VKILRGKYECAFEYLIAAGKPK 343
            K +RG+     E  + AG PK
Sbjct: 298 FKFVRGENHLGIEDFVYAGLPK 319


>gi|242001640|ref|XP_002435463.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215498799|gb|EEC08293.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 223

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 89/229 (38%), Positives = 120/229 (52%), Gaps = 9/229 (3%)

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
           F AV A SDR CI S G+    +S E +  CC  C       CS G     W +    G 
Sbjct: 2   FGAVEAMSDRVCIHSNGRVQVDISAEDLMDCCDKC----GSGCSGGVSAAAWQYWKDAGL 57

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           V+GG Y    GC+P +++PC  H S  +LP C    +P  KC  +C    Y R +  DK+
Sbjct: 58  VSGGLYNTTDGCKPYSLAPC-EHSSQGSLPECVG-TLPTPKCKRQCRE-GYERSYDDDKY 114

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
                Y ++ +E  I+ EI  +GP  A F  Y DF  YKSGVY+H S   +    H+ ++
Sbjct: 115 FAKNVYSINGSEKQIRTEIFQNGPVEAEFTAYADFLSYKSGVYQHHSRDIIGR--HAIRI 172

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           +GWG+E+  PYWL+ N+W   WGD G  K+LRG  EC  E  + AG PK
Sbjct: 173 LGWGSEDNNPYWLLANSWNEDWGDHGYFKMLRGVNECDIESFVNAGIPK 221


>gi|166030314|gb|ABY78824.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 110/343 (32%), Positives = 152/343 (44%), Gaps = 32/343 (9%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
           LV L    L+  +    +  ++D+IN+     W A  N         ++    A+A+   
Sbjct: 14  LVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYN-------GKMQNITFAEARRLT 66

Query: 64  ----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
               Q    LP  R T + +    +P+ FD+ E+WPNC TI  + D  AC +    +   
Sbjct: 67  GARIQKTSSLPPVRFT-EEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTAS 125

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           A SDR C     QQ R +S  ++ SCCK C Y     C  G     W +    G  +   
Sbjct: 126 AISDRYCTVGGVQQLR-ISAAHLLSCCKDCGY----GCDGGYPGTAWEYYVSHGLAS--- 177

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
               + CQP     C HHG     P C        KC+T CT+       ++  H   L 
Sbjct: 178 ----SYCQPYPFPHCGHHGGKGKKPPCSKYDFHTPKCNTTCTDKAIPLIKYRGNHSYGL- 232

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
               D ED  K+E+  +GP    F +Y DF  YK+GVY+H S   L    H+ +++GWG 
Sbjct: 233 ----DGEDDYKRELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDVLGG--HAVRIVGWGK 286

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            NGTPYW + N+W   WG  G   ILRGK EC  E    AG P
Sbjct: 287 LNGTPYWKIANSWDTDWGMNGHFLILRGKDECGIESEGYAGLP 329


>gi|404250524|gb|AFR54113.1| cysteine proteinase, partial [Haemonchus contortus]
          Length = 332

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 83/245 (33%), Positives = 134/245 (54%), Gaps = 9/245 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FD+RE W +C +I ++ D   C +    +A    SDR C++SKG+  + +S   + 
Sbjct: 95  IPESFDSREVWKSCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDIL 154

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP-T 202
           +CC     +  + C+ G   + W ++ + G VTGG Y ++  C+P  + PC +HG    +
Sbjct: 155 ACCG---SECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWS 211

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
            P   + + P   C   C    YG+ + +DK      Y +D++E AI++E++ +GP  A 
Sbjct: 212 CPRDHSFRTPA--CKKYC-QYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAA 268

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F  Y+DF  Y  G+Y HT     +   H+ K++GWG ENGT YW V N+W   WG+ G  
Sbjct: 269 FITYEDFSFYTKGIYVHTRGR--QRGAHAVKVVGWGVENGTKYWNVANSWSTDWGENGYF 326

Query: 323 KILRG 327
           +ILRG
Sbjct: 327 RILRG 331


>gi|56758040|gb|AAW27160.1| unknown [Schistosoma japonicum]
          Length = 216

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 86/223 (38%), Positives = 125/223 (56%), Gaps = 8/223 (3%)

Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
            +DR CI+S GQQ+  LS   + SCC+ C       C  G   + W++   +G VTGG  
Sbjct: 1   MTDRICIQSGGQQSAELSALDLISCCEDC----GDGCQGGFPGQAWDYWVTQGIVTGGSK 56

Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
            + TGCQP     C HH +    P+C  +     +C   C    Y   + QDKH    +Y
Sbjct: 57  ENHTGCQPYPFPKCEHH-TKGKYPACGTKIYKTPQCKQTC-QKGYKTPYEQDKHYGDESY 114

Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE 300
            V  NE AI+KEI+ +GP  A F +Y+DF +YKSG+Y+H + + +    H+ ++IGWG E
Sbjct: 115 NVISNEKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGG--HAIRIIGWGVE 172

Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
             TPYWL+ N+W   WG++G  +I+RG+ EC+ E  + AG  K
Sbjct: 173 KRTPYWLIANSWNEDWGEKGLFRIVRGRDECSIESHVVAGLIK 215


>gi|157058765|gb|ABV03140.1| cathepsin B-348 [Aulacorthum solani]
          Length = 237

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 87/219 (39%), Positives = 120/219 (54%), Gaps = 9/219 (4%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FDARE WPNC TI  V D G+C +   F AV A SDR CI SKG +N   S E + 
Sbjct: 28  LPETFDAREHWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGTKNFHFSAENLV 87

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC  C +     C+ G     WN+   +G V+GG YG   GC P  ++PC HH +    
Sbjct: 88  SCCWTCGF----GCNGGFPGAAWNYWKTKGIVSGGPYGSNMGCIPYEVAPCEHHVNGTRG 143

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P  E  K P  KC  +C +  Y   + QD H     Y + ++ D I++EI  +GP    F
Sbjct: 144 PCKEGGKTP--KCVKKCED-GYKVPYAQDLHHGKSAYSLSNDVDQIRQEIYTNGPVEGAF 200

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
            +Y+DF  Y++GVYKH +   L    H+ +++GWG +NG
Sbjct: 201 TVYEDFIAYRAGVYKHVAGKALGG--HAIRILGWGVQNG 237


>gi|166030308|gb|ABY78821.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 108/340 (31%), Positives = 155/340 (45%), Gaps = 25/340 (7%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPA-NLSEEYLRQFLIADAKYF 62
           LV L    L+  +    +  ++D+IN+     W A  N    N++    R+   A    F
Sbjct: 14  LVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQNITFAEARRLTGA----F 69

Query: 63  DQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
            +    LP  R T + +    +P+ FD+ E+WPNC TI  + D  AC +    +   A S
Sbjct: 70  RRKTSSLPPVRFT-EEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASAIS 128

Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
           DR C     QQ R +S  ++ SCC+ C       C  G+    W +    G  +      
Sbjct: 129 DRYCTVGGVQQLR-ISAAHLMSCCEDC----GDGCKGGAPDSAWEYYVSHGLAS------ 177

Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
            + CQP     C HHG     P C        KC+T CT+    +     K+R   +Y +
Sbjct: 178 -SYCQPYPFPHCGHHGGKGKKPPCSKYHFHTPKCNTTCTD----KAIPLIKYRGNNSYML 232

Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
            + ED  K+E+  +GP    F +Y DF  YK+GVY+H S   L    H+ +++GWG  NG
Sbjct: 233 LNGEDDYKRELYFNGPFVVDFGVYSDFLAYKTGVYRHVSGDVLGG--HAVRIVGWGKLNG 290

Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           TPYW + N+W   WG  G   ILRG  EC  E    AG P
Sbjct: 291 TPYWKIANSWDTDWGMNGHFLILRGNNECGIESTGYAGLP 330


>gi|312382740|gb|EFR28091.1| hypothetical protein AND_04395 [Anopheles darlingi]
          Length = 381

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 107/328 (32%), Positives = 154/328 (46%), Gaps = 25/328 (7%)

Query: 16  GELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKT 75
           G++     A++  IN     W AG N    L  +   Q+      Y + +   LP     
Sbjct: 75  GDVVNSQAAFVAAINNRTRGWKAGVN---PLRHD---QYRTGALLYEEAARAKLPQGIVL 128

Query: 76  YDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
              E     P+ FDAR++W  C ++G + + G CA+ +  AAV   +DR CI S+G+   
Sbjct: 129 KLQE--EPFPESFDARQKWSFCPSVGTIRNQGCCASSYAVAAVATITDRWCIHSEGKSQF 186

Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
                 V SCC  C +     C  G     W++  + G  +GG Y    GCQ      C 
Sbjct: 187 SFGAYDVLSCCHRCGF----GCDGGVPSAVWHYWVENGITSGGAYESHEGCQSYPFGVCK 242

Query: 196 -HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
                AP +          L C  +C  P Y   + +DKH   + Y V  +ED I  E+ 
Sbjct: 243 PQEIFAPHV---------DLICLRQC-QPGYNTTYLEDKHFGRVAYSVPRDEDRILYELF 292

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
             GP  A+F +Y DF  YKSGVY+HT   ++ +  HS K++GWG ENGT +WL  N+WG 
Sbjct: 293 YFGPVQASFTVYTDFIQYKSGVYRHTYGVRVGD--HSVKIVGWGVENGTKFWLCANSWGA 350

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKP 342
            WG+ G  KI+RG+   + E  + AG P
Sbjct: 351 EWGENGFFKIIRGEDHLSVESNVVAGLP 378


>gi|167541036|gb|ABZ82028.1| cathepsin B endopeptidase [Clonorchis sinensis]
          Length = 228

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 120/229 (52%), Gaps = 9/229 (3%)

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
           F AV A SDR CI + G   + +S   + SCC  C +     C  G     W+F    G 
Sbjct: 2   FGAVEAMSDRLCIHTNGTFTKRISAVDLISCCGYCGF----GCQGGFPPTAWDFWQTEGI 57

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           VTGG   + TGC+      CSHHGS    P C ++      C  +C  P     +  DK 
Sbjct: 58  VTGGSKENPTGCRSYPFPRCSHHGSK-KYPPCSHRIYDTPNCVQKCDTPD--TDYATDKT 114

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
           R  +TY V   ++AI KEI+ +GP  A F +Y+DF  YKSGVY H+    L    H+ ++
Sbjct: 115 RANITYNVKAKQNAIMKEIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGG--HAIRI 172

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           +GWG ENG  YWL+ N+W   WG+ G  K+LRGK EC  E  + AG P+
Sbjct: 173 LGWGEENGVAYWLIANSWNDGWGEDGYFKMLRGKNECGIEDEVTAGLPE 221


>gi|226466652|emb|CAX69461.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 340

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 111/347 (31%), Positives = 157/347 (45%), Gaps = 17/347 (4%)

Query: 1   MIHILVFLLGCTLVRGE---LYKFSDAYIDQINREAN-TWTAGRNFPANLSEEYLRQFLI 56
           +I +L  +  C L   E   +       I+ +NR     W AG N     S++  + F  
Sbjct: 6   VILLLNIICNCELNAVENEHIEPLFGKLIEYVNRNPKFGWKAGTNHRFRSSKDIEKMF-- 63

Query: 57  ADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
              KY +  +      +       +  +P  FDAR  W NC TI  + D   C A    A
Sbjct: 64  --RKYIEIENIQTKHIKTISHNSINMEIPRSFDARYHWINCSTIRQIHDESLCRADWAIA 121

Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
            V + SDR CI+S G+ +  LS     SC        +  C HGS      +    G VT
Sbjct: 122 TVDSISDRICIRSNGRISVQLSARDAISC------GFSPGCFHGSEVEVLVYWITYGIVT 175

Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
           GG Y D++GCQP  +  CS+H  +  L  C N      +C   C +  Y + +  DK   
Sbjct: 176 GGSYEDQSGCQPYPLPKCSYHPESRFL-DCNNNTFEFPQCTNECQD-GYNKTYDDDKFYG 233

Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
              Y V   ++ I+KEIL +GP  A+ ++  DF  YKSGVY  T  ++   ++ + ++IG
Sbjct: 234 ERIYNVYGTQEDIQKEILMNGPVIASISVNTDFLVYKSGVYLPTPRSRNLGWI-TLRIIG 292

Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           WG E   PYWL  N+W   WGD G VKI RG      E  + A  PK
Sbjct: 293 WGYEGKIPYWLCANSWNEEWGDNGYVKIQRGVQAGYIESYVRAPIPK 339


>gi|402594312|gb|EJW88238.1| cathepsin B5 [Wuchereria bancrofti]
          Length = 407

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 87/231 (37%), Positives = 123/231 (53%), Gaps = 10/231 (4%)

Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
           AAV A SDR CI SKG++   LS + + SCCK C +     C  G     W +    G V
Sbjct: 168 AAVEAMSDRICITSKGKKQVILSADDLLSCCKTCGF----GCFGGEPMAAWKYWVLSGIV 223

Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
           TG DY + +GC+P    PC HH +      C++   P  KC  +C +  Y + +  DK+ 
Sbjct: 224 TGSDYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCDRQC-DKNYKKPYKADKYY 282

Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
               Y V+++ + I+KEI+  GP  A+F +Y DF HY  G+YKH + +      H+ K++
Sbjct: 283 GEQAYNVENDVELIQKEIMTLGPVEASFEVYTDFLHYIGGIYKHVAGSVGGG--HAVKIL 340

Query: 296 GWGTENGTPYWLVINTWGPHWGD---RGTVKILRGKYECAFEYLIAAGKPK 343
           GWG + G  YWL  N+W   WG+    G  +ILRG  EC  E  I AG P+
Sbjct: 341 GWGIDQGVSYWLAANSWNTDWGEDVFSGYFRILRGVDECGIESGIVAGIPR 391


>gi|254575665|gb|ACT68329.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 105/312 (33%), Positives = 156/312 (50%), Gaps = 31/312 (9%)

Query: 42  FPANLSEEYL---RQFLIADAKYFDQS---DRPLPGDRKTYDPEYSATVPDRFDAREQWP 95
           F A  S E +   RQFL+   ++ ++S   +  LP    T + +    +P+ FD+RE+W 
Sbjct: 53  FKAKYSPEVVKKRRQFLL-KPQFIERSYNQENVLPIANITSNDD----IPESFDSREKWK 107

Query: 96  NCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDN 154
           +C ++  +PD   C +    +A    SDR CI S+G++   LS   + +CC K C Y   
Sbjct: 108 DCPSLRVIPDQSNCGSCWAVSAAQCMSDRLCIHSQGRKKVLLSATDILACCGKFCGY--- 164

Query: 155 KSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC-SHHGSA----PTLPSCENQ 209
             C  G   R W +    G VTGG Y ++  C+P     C +H G A    P+ P     
Sbjct: 165 -GCDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCGAHKGKAFNNCPSHPYATPA 223

Query: 210 KVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDF 269
           + P  +         YG+ +  DK +    YW+ ++E  I+ EI+  GP  ATF +Y+DF
Sbjct: 224 RKPYCQY-------GYGKRYENDKIKARTWYWLPNDERTIQLEIMQKGPVHATFNIYEDF 276

Query: 270 YHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG-DRGTVKILRGK 328
            HY  GVY HT+ A      HS K+IGWG + G  YWL+ N+W   WG D G  +++RG 
Sbjct: 277 EHYNGGVYIHTAGAMEGG--HSIKIIGWGVDKGVKYWLIANSWSTDWGEDGGYFRVVRGI 334

Query: 329 YECAFEYLIAAG 340
             C  E  + AG
Sbjct: 335 NNCDIEGGVLAG 346


>gi|343476048|emb|CCD12737.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 109/340 (32%), Positives = 153/340 (45%), Gaps = 25/340 (7%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPA-NLSEEYLRQFLIADAKYF 62
           LV L    L+  +    +  ++D+IN+     W A  N    N++    R+   A    F
Sbjct: 14  LVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQNITFAEARRLTGA----F 69

Query: 63  DQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
            +    LP  R T + +    +P+ FD+ E+WPNC TI  + D  AC +    +   A S
Sbjct: 70  RRKTSSLPPVRFT-EEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASAIS 128

Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
           DR C     QQ R +S  ++ SCCK C       C  G     W +    G  +      
Sbjct: 129 DRHCTVGGVQQLR-ISAAHLLSCCKDC----GDGCDGGYPDSAWEYYVSHGLAS------ 177

Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
            + CQP     C HHG     P C        KC+T CT+    +     K+R   +Y +
Sbjct: 178 -SYCQPYPFPHCGHHGGKGKKPPCSKYDFHTPKCNTTCTD----KAIPLIKYRGNDSYVL 232

Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
              ED  K+E+  +GP    F +Y DF  YK+GVY+H S   L    H+ +++GWG  NG
Sbjct: 233 LHGEDDFKRELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDFLGG--HAVRIVGWGKLNG 290

Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           TPYW + N+W   WG  G   ILRG  EC  E    AG P
Sbjct: 291 TPYWKIANSWDTDWGMNGHFLILRGNNECGIESTGYAGLP 330


>gi|107921798|gb|ABF85680.1| cathepsin B3 [Fasciola hepatica]
          Length = 278

 Score =  161 bits (407), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 95/288 (32%), Positives = 146/288 (50%), Gaps = 14/288 (4%)

Query: 21  FSDAYIDQINREAN-TWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
           FSD  I  IN E+  +W A  +   N  ++  +   + +    D++ +     R+T    
Sbjct: 3   FSDELIHYINEESGASWKAAPSTRFNNIDQVKQNLGVLEETPEDRNTQ-----RQTVRYS 57

Query: 80  YSAT-VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
            S   +P+ FDAR++WPNC +I  + D  +C++    ++  A +DR CI S GQ+   LS
Sbjct: 58  VSENDLPESFDARQKWPNCPSISEIRDQSSCSSCWAVSSASAITDRICIHSNGQKKPRLS 117

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
              + SCC  C Y     C+ G    +W++  + G VTGG   + TGC P     CSH  
Sbjct: 118 AIDIVSCCAYCGY----GCNGGIPAMSWDYWTREGVVTGGTLENPTGCLPYPFPKCSHGV 173

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
             P LP C     P  KC  +C +  Y + + QDK +   +Y V + E  I  EI+ +GP
Sbjct: 174 VTPGLPPCPRDIYPTPKCEKKC-HAGYNKTYEQDKVKGKSSYNVGEQETDIMMEIMKNGP 232

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYW 306
               F +++DF  YKSG+Y +T+   +    H+ ++IGWG ENG  YW
Sbjct: 233 VDGIFYMFEDFLVYKSGIYHYTTGRLVGG--HAIRVIGWGVENGVNYW 278


>gi|86451908|gb|ABC97349.1| cathepsin B [Streblomastix strix]
          Length = 312

 Score =  161 bits (407), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 110/328 (33%), Positives = 158/328 (48%), Gaps = 49/328 (14%)

Query: 28  QINREANT-----WTAGRN--FPANLSEEY-----LRQFLIADAKYFDQSDRPLPGDRKT 75
           ++ RE N+     W AG N  F     E++      RQ  ++D  Y D S  P+      
Sbjct: 20  KLVREVNSRNDVNWVAGINPHFADATIEDFRRLNGARQTPLSDRVYMDVSTVPV------ 73

Query: 76  YDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
                 A +PD FD+R  WPNC  IG + D G C +    ++     DR CIKS+G+Q  
Sbjct: 74  ------ANLPDEFDSRTNWPNCQLIGKIYDQGHCGSCWAMSSFEVLQDRFCIKSEGKQTP 127

Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
            LS +++ SC   C       C+ G +   + F+   G + G D      C P  +  C 
Sbjct: 128 ELSPQHLTSCTPGC-----SGCNGGWMSTAFGFMQSNG-ILGED------CIPYQMGKCK 175

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           H G + T P+    K  K KC+   T  T       +      +Y V  NE  I+KEI  
Sbjct: 176 HPGCS-TWPT---PKCNKTKCYPNDTKST-------ELWHAASSYSVRSNEADIQKEIYE 224

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
           +GP TA+FA+Y+D   Y+SGVY+H +    E  LH+ K++GWG  +G  YW ++N+W   
Sbjct: 225 NGPVTASFAVYEDLSVYQSGVYQHVTGG-FEG-LHAIKVVGWGILDGVKYWTIVNSWAED 282

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPK 343
           WG  G + I RG  EC  E  + AG+PK
Sbjct: 283 WGFDGLLLIRRGVDECGIESDVVAGQPK 310


>gi|339241013|ref|XP_003376432.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316974853|gb|EFV58323.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 551

 Score =  161 bits (407), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 103/321 (32%), Positives = 152/321 (47%), Gaps = 26/321 (8%)

Query: 28  QINREANTWTAGRN-FPANLSEEYLRQFL---IADAKYFDQSDRPLPGDRKTYDPEYSAT 83
           Q N    +W  GRN +  N S   +++ L   +      ++++ P+P D    +   +  
Sbjct: 232 QANGNTFSWKFGRNAYFKNKSIGEIKKLLGYRMLPKTVKERNEMPMPEDLLNLE---NFN 288

Query: 84  VPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
            P  FD+R+ WP C   I  + D   C +    ++    SDR CI + GQ    LS   +
Sbjct: 289 YPVEFDSRKHWPQCEKVISFIKDQANCGSCWAVSSASVMSDRTCIATDGQFTTLLSDAEL 348

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            SCC  C Y     C+ G   RT+ +    G  TGG YG    C+P  I PCS+      
Sbjct: 349 LSCCTSCGY----GCNGGYPQRTFKYWVYSGMPTGGPYGSNDTCKPYPIPPCSN------ 398

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
              C   + PK  C   C + TY     +D+H  +  Y     E ++ K+I  +GP  A 
Sbjct: 399 ---CSETRTPK--CSKSCIS-TYPLSLNEDRHYGSTYYQFWLGEKSMMKDISLYGPIVAG 452

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
            ++Y+DF HYK GVY   S   L    H+ ++IGWG ++  PYWLV N+W   +G+ G  
Sbjct: 453 MSVYEDFLHYKEGVYTQESGIFLGG--HAVRIIGWGEQDNIPYWLVANSWNTTFGEDGLF 510

Query: 323 KILRGKYECAFEYLIAAGKPK 343
           KI RG  EC  E  ++AG+ K
Sbjct: 511 KIRRGFDECGIESYVSAGRAK 531


>gi|157058749|gb|ABV03132.1| cathepsin B-3098 [Acyrthosiphon pisum]
          Length = 256

 Score =  160 bits (406), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 92/243 (37%), Positives = 129/243 (53%), Gaps = 21/243 (8%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P +FDAR++W  C TIG V D G C +    +   AF+DR C+ + G  N+ LS E + 
Sbjct: 28  IPMKFDARKKWIRCKTIGEVRDQGNCGSDWALSTSSAFADRLCVATNGDFNQLLSAEEIT 87

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
            CC  C       C+ G   R W      G VTGG+Y    GC+P  + PC +       
Sbjct: 88  FCCHKC----GNGCNGGYPIRAWKRFKNHGLVTGGNYKSGEGCEPYRVPPCPYDKDGKN- 142

Query: 204 PSCENQKV-PKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
            +C  Q + P  KC  +C    YG     F +D   T   Y++      I+K+++ +GP 
Sbjct: 143 -TCSGQPMEPNHKCSKKC----YGDEDIDFNKDHRYTRDDYYL--TYRGIQKDVINYGPI 195

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYWLVINTWGPHWG 317
            A+F +YDDF +YKSG+Y  + NA   +YL  HS KLIGWG E G  YWL++N+W   WG
Sbjct: 196 EASFDVYDDFPNYKSGIYVKSENA---SYLGGHSVKLIGWGEEYGVLYWLMVNSWNADWG 252

Query: 318 DRG 320
           D+G
Sbjct: 253 DKG 255


>gi|332374788|gb|AEE62535.1| unknown [Dendroctonus ponderosae]
          Length = 328

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 113/348 (32%), Positives = 166/348 (47%), Gaps = 35/348 (10%)

Query: 2   IHILVFLLGCT--LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
           + +LVF LG +  L   + +  SD YI QIN + +TW AGRNF   + E  L + L +  
Sbjct: 4   VLMLVFALGLSSALPSNKPHPLSDEYIAQINSKQSTWKAGRNFA--IDEYELFKSLASGV 61

Query: 60  KYFDQSDRPLPGDRKTYDP---EYSATVPDRFDAREQWPNC-GTIGHVPDTGACAAPHIF 115
           K         P   KT      E +  +P+ FD+R  WP C   IG + D   C +   F
Sbjct: 62  KK--------PQGLKTAQKLVREITEEIPESFDSRTAWPECTQIIGMIRDQSRCGSCWAF 113

Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
           AAV A SDR CI S   +   +S++ + +C           C+ G     W+     G V
Sbjct: 114 AAVEAMSDRICIHSNATKKLLVSSQDLLTC------GTAGGCNGGWPAVAWSDW-TNGIV 166

Query: 176 TGGDYGD-RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           TGG YG    GC+   +  C  H +      C N  V    C  +C  P+    +++ + 
Sbjct: 167 TGGLYGALEQGCKSYFLEGCDDHPN-----KCRNY-VSTPACVEQCDEPSL---YYKAQE 217

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
               T +    E+ I+ EI+ +GP  AT  +Y DF  Y+SG+Y+ T++       H+ K+
Sbjct: 218 TYGQTPYEIQGEEQIQYEIMTNGPVEATMDVYVDFAQYQSGIYQLTTDEYEGG--HAVKI 275

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           +GWG E+G  YWLV N+W   WG+ G  +I+RG+ E   E  I A  P
Sbjct: 276 LGWGVEDGVKYWLVANSWNERWGENGLFRIIRGRDEVGIESTIDAALP 323


>gi|291291827|gb|ADD91786.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 88/258 (34%), Positives = 131/258 (50%), Gaps = 9/258 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+   +R +WP C ++  + D   C +    +   A SDR CI S G++   +S   + 
Sbjct: 2   IPESPYSRTKWPKCSSLKPIRDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDIL 61

Query: 144 SCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
           SCC   C Y     C+ G   + +N+  K+G+VTGGDY   +GC+P    PC HHG    
Sbjct: 62  SCCGNQCGY----GCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTY 117

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
              C N+     KC  +C        + +D+      Y   + E A ++EI+ +GP    
Sbjct: 118 YGECPNEATTP-KCVRKCQKSYKKS-YKKDRSIGKDAYEEPNAEKATQREIMKNGPVVGA 175

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F +Y+DF +YK G+YKHT+        H+ K+IGWG E G PYWL+ N+W   WG+ G  
Sbjct: 176 FTVYEDFSYYKKGIYKHTAGKARGG--HAIKIIGWGKEGGVPYWLIANSWHNDWGENGYF 233

Query: 323 KILRGKYECAFEYLIAAG 340
           +IL G   C  E  + AG
Sbjct: 234 RILCGSNHCGIEENVVAG 251


>gi|239938578|gb|ACS36088.1| cysteine proteinase [Haemonchus contortus]
          Length = 332

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 82/245 (33%), Positives = 133/245 (54%), Gaps = 9/245 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FD+R  W NC +I ++ D   C +    +A    SDR C++SKG+  + +S   + 
Sbjct: 95  IPESFDSRVVWKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDIL 154

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP-T 202
           +CC     +  + C+ G   + W ++ + G VTGG Y ++  C+P  + PC +HG    +
Sbjct: 155 ACCG---RECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWS 211

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
            P   + + P   C   C    YG+ + +DK      Y +D++E AI++E++ +GP  A 
Sbjct: 212 CPRDHSFRTPA--CKKYCQY-GYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAA 268

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
              Y+DF  Y+ G+Y HT     +   H+ K++GWG ENGT YW V N+W   WG+ G  
Sbjct: 269 SITYEDFSFYRRGIYVHTRGR--QRGAHAVKVVGWGVENGTKYWNVANSWSTDWGEDGYF 326

Query: 323 KILRG 327
           +ILRG
Sbjct: 327 RILRG 331


>gi|1848229|gb|AAB48119.1| cathepsin B-like protease [Leishmania major]
          Length = 340

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 118/354 (33%), Positives = 160/354 (45%), Gaps = 39/354 (11%)

Query: 1   MIHILVFLLGCTLVRGELYKFSD------AYIDQINREAN-TWTAGRN---FPANLSEEY 50
           ++ +   LL  T V G   K SD      +++ ++N +A   WTA  N        S   
Sbjct: 11  LVAVFALLLA-TTVSGLYAKPSDFPLLGKSFVAEVNSKAKGQWTASANNGYLVTGKSLGE 69

Query: 51  LRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACA 110
           +R+ +       D S   +P  R     E    +P+ FDA E WP C TI  + D   C 
Sbjct: 70  VRKLM----GVTDMSTEAVP-PRNFSVEELQQDLPEFFDAAEHWPMCLTISEIRDQSNCG 124

Query: 111 APHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLH 170
           +    AAV A SDR C    G  +R +ST  + SCC IC       C  G     W +  
Sbjct: 125 SCWAIAAVEAISDRYCTFG-GVPDRRMSTSNLLSCCFIC----GLGCHGGIPTVAWLWWV 179

Query: 171 KRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFF 230
             G  T         CQP    PCSHHG++   P C +      KC+T C          
Sbjct: 180 WVGIAT-------EDCQPYPFDPCSHHGNSEKYPPCPSTIYDTPKCNTTCER----NEMD 228

Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL- 289
             K++ + +Y V   E  +  E++ +GP   T  +Y DF  YKSGVYKH     L ++L 
Sbjct: 229 LVKYKGSTSYSVK-GEKELMIELMTNGPLELTMQVYSDFVGYKSGVYKHV----LGDFLG 283

Query: 290 -HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            H+ KL+GWGT++G PYW V N+W   WGD+G   I RG  EC  E    AG P
Sbjct: 284 GHAVKLVGWGTQDGVPYWKVANSWNTDWGDKGYFLIQRGNNECKIESGGVAGIP 337


>gi|343474530|emb|CCD13852.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 335

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 103/344 (29%), Positives = 162/344 (47%), Gaps = 32/344 (9%)

Query: 4   ILVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYF 62
           +L+ +    LV  E    +  ++D +NR     WTA       + +  ++   +++AK  
Sbjct: 13  VLLAMNTSALVAREAPLLTKEFVDTVNRLSGGMWTA-------VYDGRMQNTTVSEAKRL 65

Query: 63  DQSDRP----LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
           +++ R     LP    T + E  A +P+ FDA E+WPNC TI  + D  +C +    AA 
Sbjct: 66  NRATRKPVSVLPRVNFT-EEELLAPLPETFDAAEKWPNCPTITEISDQSSCGSCWAVAAA 124

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
            + +DR C    G +   +S   + +CC  C Y     C  G     W +    G  +G 
Sbjct: 125 TSMTDRYCTI-HGVRGLRISAADLLACCGDCGY----GCLGGDPDMAWAYFSSEGIASGR 179

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
                  CQP     CSH+ ++ T P C    +    C+  CT+ T  +     K+R   
Sbjct: 180 -------CQPYPFPRCSHYTNSTTYPQCSALHLWTPTCNPACTDSTISK----KKYRGLK 228

Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
           +Y +   ED  ++E+   GP  A F ++ D + YK GVYKH   A +    H+ +++GWG
Sbjct: 229 SYSLSGEED-FRRELYFRGPFQAVFDVWSDLFAYKHGVYKHVGGAFIG--AHAVRIVGWG 285

Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            ++G PYW + N+W   WGDRG   +LRG  EC  E   +AG P
Sbjct: 286 NQSGVPYWKIANSWNAEWGDRGYFFMLRGDNECGIEDSGSAGVP 329


>gi|56756587|gb|AAW26466.1| unknown [Schistosoma japonicum]
          Length = 216

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 84/223 (37%), Positives = 125/223 (56%), Gaps = 8/223 (3%)

Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
            +DR CI+S G Q+  LS   + SCC+ C     + C  G     W++   +G VTGG  
Sbjct: 1   MTDRICIQSGGGQSAELSALDLISCCEDC----GQGCQGGFPGVAWDYWVTQGIVTGGSK 56

Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
            + TGCQP     C HH +    P+C  +     +C  +C    Y   + QDKH    +Y
Sbjct: 57  ENHTGCQPYPFPKCEHH-TKGKYPACGTKIYKTPQCKQKCQK-GYKTPYKQDKHYGDESY 114

Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE 300
            V  NE AI+KEI+ +GP  A F +Y+DF +YKSG+Y+H + + +    H+ ++IGWG +
Sbjct: 115 NVISNEKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGG--HAIRIIGWGVK 172

Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
             TPYWL+ N+W   WG++G  +I+RG+ EC+ E  + AG  K
Sbjct: 173 KRTPYWLIANSWNEDWGEKGLFRIVRGRDECSIESNVVAGLIK 215


>gi|94958151|gb|ABF47216.1| cathepsin B [Nicotiana benthamiana]
          Length = 356

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 110/334 (32%), Positives = 148/334 (44%), Gaps = 37/334 (11%)

Query: 15  RGELYKFSDAYIDQINR-EANTWTAGRN--FPANLSEEYLRQFLIADAKYFDQSDRPLPG 71
           + E     D+ + Q+N  E   W A  N  F      ++ R   +   +  D    P+  
Sbjct: 34  KAESAILQDSIVKQVNENEKAGWKAALNPRFSNFTVSQFKRLLGVKPTRKGDLKGIPILT 93

Query: 72  DRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
             K  +      +P  FDAR  WPNC TIG + D G C +   F AV + SDR CI    
Sbjct: 94  HPKLLE------LPQEFDARVAWPNCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHYG- 146

Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPS 189
             N  LS   + +CC     D    C  G   + W +  ++G VT     Y D  G    
Sbjct: 147 -LNISLSANDLLACCGFLCGD---GCDGGYPLQAWKYFVRKGVVTDECDPYFDNEG---- 198

Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
               CSH G  P  P+         KCH +C        + + KH     Y +  +  +I
Sbjct: 199 ----CSHPGCEPAYPT--------PKCHRKCVKQNL--LWSKSKHFGVNAYMISSDPHSI 244

Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLV 308
             E+  +GP   +F +Y+DF HYKSGVYKH +   +    H+ KLIGWGT E+G  YWL+
Sbjct: 245 MTELYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDVMGG--HAVKLIGWGTSEDGEDYWLL 302

Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            N W   WGD G  KI RG  EC  E  + AG P
Sbjct: 303 ANQWNRGWGDDGYFKIRRGTDECEIEDEVVAGLP 336


>gi|166030318|gb|ABY78826.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 103/344 (29%), Positives = 161/344 (46%), Gaps = 32/344 (9%)

Query: 4   ILVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYF 62
           +L+ +    LV  E    +  ++D +NR     WTA       + +  ++   +++AK  
Sbjct: 13  VLLAMNTSALVAREAPLLTKEFVDTVNRLSGGMWTA-------VYDGRMQNTTVSEAKRL 65

Query: 63  DQSDRP----LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
           +++ R     LP    T + E  A +P+ FDA E+WPNC TI  + D  +C +    AA 
Sbjct: 66  NRATRKPVSVLPRVNFT-EEELLAPLPETFDAAEKWPNCPTITEISDQSSCGSCWAVAAA 124

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
            + +DR C    G +   +S   + +CC  C Y     C  G     W +    G  +G 
Sbjct: 125 TSMTDRYCTI-HGVRGLRISAADLLACCGDCGY----GCLGGDPDMAWAYFSSEGIASGR 179

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
                  CQP     CSH+ ++ T P C    +    C+  CT+ T  +     K+R   
Sbjct: 180 -------CQPYPFPRCSHYTNSTTYPQCSALHLWTPTCNPACTDSTISK----KKYRGLK 228

Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
           +Y     ED  ++E+   GP  A F ++ D + YK GVYKH   A +    H+ +++GWG
Sbjct: 229 SYSFSGEED-FRRELYFRGPFQAVFDVWSDLFAYKHGVYKHVGGAFIG--AHAVRIVGWG 285

Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            ++G PYW + N+W   WGDRG   +LRG  EC  E   +AG P
Sbjct: 286 NQSGVPYWKIANSWNAEWGDRGYFFMLRGDNECGIEDSGSAGVP 329


>gi|90074902|dbj|BAE87131.1| unnamed protein product [Macaca fascicularis]
          Length = 296

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 105/338 (31%), Positives = 146/338 (43%), Gaps = 60/338 (17%)

Query: 6   VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
           +  LG    R   +  SD  ++ +N++  TW AG NF  N+   YL++       +    
Sbjct: 11  LLALGDARSRPSFHPLSDELVNYVNKQNTTWQAGHNF-YNVDVSYLKRLC---GTFLGG- 65

Query: 66  DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
             P P  R  +  +    +P+ FDAREQWP C TI  + D G+C +   F AV A SDR 
Sbjct: 66  --PKPPQRVMFTEDLK--LPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRI 121

Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG 185
           CI +    +  +S E + +CC I   D    C+ G     WNF  ++G V+GG Y    G
Sbjct: 122 CIHTNAHVSVEVSAEDLLTCCGIMCGD---GCNGGYPAGAWNFWTRKGLVSGGLYDSHVG 178

Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN 245
           C+P +I PC HH +    P       PK    ++   P Y   + QDKH    +Y V ++
Sbjct: 179 CRPYSIPPCEHHVNGSRPPCTGEGDTPKC---SKICEPGYSPTYKQDKHYGYNSYSVSNS 235

Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPY 305
           E  I  EI                                              +NGTPY
Sbjct: 236 EKDIMAEIY---------------------------------------------KNGTPY 250

Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           WLV N+W   WGD G  KILRG+  C  E  + AG P+
Sbjct: 251 WLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 288


>gi|119638954|gb|ABL85236.1| cysteine proteinase 2 [Necator americanus]
          Length = 347

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 89/289 (30%), Positives = 142/289 (49%), Gaps = 9/289 (3%)

Query: 56  IADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIF 115
           I D  +   ++  +    +  D + + ++P+ FDARE+WP C +IG + D  A       
Sbjct: 66  IMDLSFMVDAEVMMEEMDQQEDIDLAVSLPESFDAREKWPECPSIGLIRDQSAGGGCWAV 125

Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGS 174
           ++    +DR CI+S G +   +S   + SCC + C       C+ G   + +N+  ++G 
Sbjct: 126 SSAEVMTDRICIQSNGTKQVYVSETDILSCCGQRC----GSGCTSGVPRQAFNYAIRKGV 181

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
            +GG YG +  C+P    PC +H   P    C +   P   C   C +  Y   +  D+ 
Sbjct: 182 CSGGPYGTKGVCKPYPFYPCGYHAHLPYYGPCPDGMWPTPTCEKACQS-DYTVPYNDDRI 240

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
             + T  V   E+ IK+EI  +GP  AT+ +Y+DF +YK+G+Y   +        H+ K+
Sbjct: 241 FGSKTI-VLTGEEKIKREIFNNGPLVATYTVYEDFAYYKNGIY--MTGLGRATGAHAVKI 297

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           IGWG ENG  YWL+ N+W   WG+ G  ++LRG   C  E     G  K
Sbjct: 298 IGWGEENGVKYWLIANSWNTDWGENGFFRMLRGTNLCDIELSATGGTFK 346


>gi|342181301|emb|CCC90780.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 335

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 108/343 (31%), Positives = 154/343 (44%), Gaps = 32/343 (9%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
           LV L    L+  +    +  ++D+IN+     W A  N         ++    A+A+   
Sbjct: 14  LVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYN-------GKMQNITFAEARRLT 66

Query: 64  ----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
               Q    LP  R T + +    +P+ FD+ E+WPNC TI  + D  AC +    +   
Sbjct: 67  GARIQKTSSLPPVRFT-EEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTAS 125

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           A SDR C     QQ R +S  ++ SCC+ C Y     C  G    +W +    G  +   
Sbjct: 126 AISDRHCTVGGVQQLR-ISAAHLMSCCEDCGY----GCDGGYPGTSWEYYVSHGLAS--- 177

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
               + CQP     C HHG     P C        KC+T CT+    +     K+R   +
Sbjct: 178 ----SYCQPYPFPHCGHHGGKGKKPPCSKYHFHTPKCNTTCTD----KAIPLIKYRGNHS 229

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y V   ED  K+E+  +GP    F +Y DF  YK+GVY+H S   L    H+ +++GWG 
Sbjct: 230 YEVH-GEDDYKRELYFNGPFVVVFWVYSDFLAYKTGVYRHVSGDFLGG--HAVRIVGWGK 286

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            NGTPYW + N+W   WG  G +  LRG  EC  E    AG P
Sbjct: 287 LNGTPYWKIANSWDTDWGMNGHLLFLRGNNECGIEAAGYAGSP 329


>gi|389593817|ref|XP_003722157.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
 gi|321438655|emb|CBZ12414.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
          Length = 340

 Score =  158 bits (400), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 116/352 (32%), Positives = 157/352 (44%), Gaps = 35/352 (9%)

Query: 1   MIHILVFLLGCTLVRGELYKFSD------AYIDQINREAN-TWTAGRN---FPANLSEEY 50
           ++ +   LL  T V G   K SD      +++ ++N +A   WTA  +        S   
Sbjct: 11  LVAVFALLLA-TTVSGLYAKPSDFPLLGKSFVAEVNSKAKGQWTASADNGYLVTGKSLGE 69

Query: 51  LRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACA 110
           +R+ +       D S   +P  R     E    +P+ FDA E WP C TI  + D   C 
Sbjct: 70  VRKLM----GVTDMSTEAVP-PRNFSVEELQQDLPEFFDAAEHWPMCLTISEIRDQSNCG 124

Query: 111 APHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLH 170
           +    AAV A SDR C    G  +R +ST  + SCC IC       C  G     W +  
Sbjct: 125 SCWAIAAVEAISDRYCTFG-GVPDRRMSTSNLLSCCFIC----GLGCHGGIPTVAWLWWV 179

Query: 171 KRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFF 230
             G  T         CQP    PCSHHG++   P C +      KC+T C          
Sbjct: 180 WVGIAT-------EDCQPYPFDPCSHHGNSEKYPPCPSTIYDTPKCNTTCERSEMDL--- 229

Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLH 290
             K++ + +Y V   E  +  E++ +GP   T  +Y DF  YKSGVYKH     L    H
Sbjct: 230 -VKYKGSTSYSVK-GEKELMIELMTNGPLELTMQVYSDFVGYKSGVYKHVLGEFLGG--H 285

Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           + KL+GWGT++G PYW V N+W   WGD+G   I RG  EC  E    AG P
Sbjct: 286 AVKLVGWGTQDGVPYWKVANSWNTDWGDKGYFLIQRGNNECKIESGGVAGIP 337


>gi|107921791|gb|ABF85679.1| cathepsin B2 [Fasciola hepatica]
          Length = 278

 Score =  158 bits (399), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 98/289 (33%), Positives = 142/289 (49%), Gaps = 16/289 (5%)

Query: 21  FSDAYIDQINREAN-TWTAGRNFP-ANLSEEYLRQFLIADAKYFDQSDRP-LPGDRKTYD 77
           FSD  I  +N E+  +W A R+   +N+    L    +++      + RP +  D    D
Sbjct: 3   FSDELIRFVNEESGASWKAARSTRFSNVDHFKLDLGALSETPEERNALRPTIKHDISKND 62

Query: 78  PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
                 +P+ FDAR QWP C TI  + D  +C +    AA  A SDR CI S GQ    L
Sbjct: 63  ------LPESFDARSQWPQCWTISEIRDQASCGSCWATAAASAMSDRVCIHSNGQMRPRL 116

Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
           +     SCC  C     + C  G   + W++  + G VTGG + +RTGCQP   + C H 
Sbjct: 117 AAADPLSCCTYC----GQGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHV 172

Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
           G +     C +   PK  C   C    Y + + QDK     +Y V ++E  I +EI+ +G
Sbjct: 173 GDSRKYSRCPHYTYPKPPCARACQT-GYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNG 231

Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYW 306
           P   TFA++ DF  Y+SG+Y H +   +    H+ ++IGWG ENG  YW
Sbjct: 232 PVEVTFAIFQDFGVYRSGIYHHVAGKFIGR--HAVRMIGWGVENGVNYW 278


>gi|428174191|gb|EKX43088.1| hypothetical protein GUITHDRAFT_73372 [Guillardia theta CCMP2712]
          Length = 255

 Score =  158 bits (399), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 93/258 (36%), Positives = 121/258 (46%), Gaps = 25/258 (9%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +PD FDAR  WP C +I H+ D   C +   F AV A SDR CI S G     LS E + 
Sbjct: 15  IPDNFDARTNWPQCPSIAHIRDQSTCGSCWAFGAVEAMSDRLCIASNGTVKDELSAEDML 74

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC +        C+ G     W F    G  T   Y       P    PC HH +    
Sbjct: 75  SCCLV---QCGMGCNGGFPTGAWRFFKMHGLTTESKY-------PYVFPPCEHHINKTHY 124

Query: 204 PSC-ENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
             C  +Q  PK      C   +  +  +  K   +++         I+ EI+ +GP  A 
Sbjct: 125 KPCGPSQPTPK------CVRASEKKPRYHGKSVYSVS------PAKIQAEIMTNGPVEAA 172

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F +Y DF  Y+SGVY+H S  +L    H+ K++GWG E G  YWLV N+W   WGD+GT 
Sbjct: 173 FTVYQDFLAYQSGVYRHVSGPELGG--HAIKIMGWGVEAGNKYWLVANSWNEDWGDKGTF 230

Query: 323 KILRGKYECAFEYLIAAG 340
           KI RG  EC  E  + AG
Sbjct: 231 KIARGDDECGIESSVVAG 248


>gi|166030330|gb|ABY78832.1| cathepsin B-like protease [Trypanosoma congolense]
 gi|343476577|emb|CCD12360.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score =  157 bits (398), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 107/344 (31%), Positives = 153/344 (44%), Gaps = 31/344 (9%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
           LV L    L+  +    +  ++D IN+     W A  N         ++    ++AK   
Sbjct: 15  LVALGASALLAKDAPVLTKTFVDHINQLNGGMWKAVYN-------GKMQNITFSEAKRLT 67

Query: 64  ----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
               Q    LP  R T + +    +P+ FDA E WP+C TI  + D   C A    +   
Sbjct: 68  GARIQKSSALPPARFT-EEQLRTKLPETFDAAEHWPHCPTIREIADQSECRASWAVSTAS 126

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           A SDR C   KG+Q R +S  ++ SCCK C       C  G     W +  + G  +   
Sbjct: 127 AISDRYCTVGKGKQLR-ISAAHLLSCCKDC----GDGCKGGFPGFAWRYYVEYGITS--- 178

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
               + CQP     C H G+      C        KC+  CT+    +     K+R   T
Sbjct: 179 ----SSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKCNATCTD----KAIPLIKYRGNAT 230

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y +   E+  K+E+  +GP  A F +Y D + YKSGVY+H     L     + K++GWG 
Sbjct: 231 YLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGG--TAVKVVGWGK 288

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            NGTPYW + N+W   WG  G + ILRG  EC  E+L  AG P+
Sbjct: 289 LNGTPYWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAGTPE 332


>gi|166030316|gb|ABY78825.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  157 bits (397), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 105/345 (30%), Positives = 156/345 (45%), Gaps = 26/345 (7%)

Query: 1   MIHILVFLLGCTLVRG-ELYKFSDAYIDQINR-EANTWTAGRNFPA-NLSEEYLRQFLIA 57
           ++   +  LG + +R  +    +  ++D+IN+     W A  N    N++    R+   A
Sbjct: 9   LLSTALVALGASALRAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQNITFAEARRLTGA 68

Query: 58  DAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
               F +    LP  R T + +    +P+ FD+ E+WPNC TI  + D  AC +    + 
Sbjct: 69  ----FRRKTSSLPPVRFT-EEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVST 123

Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
             A SDR C     QQ R +S  ++ SCCK C       C  G     W +    G  + 
Sbjct: 124 ASAISDRHCTVGGVQQLR-ISAAHLLSCCKDC----GDGCDGGYPDAAWRYYVSHGLAS- 177

Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
                 + CQP     C HHG     P C        KC+T CT+    +     ++R  
Sbjct: 178 ------SYCQPYPFPHCGHHGGKGKKPPCSKYDFHTPKCNTTCTD----KAIPLIEYRGN 227

Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
            +Y +   ED  K+E+  +GP    F ++ DF  YK+GVY+H S   L    H+ +++GW
Sbjct: 228 DSYVLLHGEDDFKRELYFNGPFVVAFQVFSDFLAYKTGVYRHVSGDFLGG--HAVRIVGW 285

Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           G  NGTPYW + N+W   WG  G    LRG  EC  E+   AG P
Sbjct: 286 GKLNGTPYWKIANSWDTDWGMNGHFLFLRGNNECGIEFEGYAGLP 330


>gi|340053922|emb|CCC48215.1| cysteine peptidase C (CPC) [Trypanosoma vivax Y486]
          Length = 334

 Score =  157 bits (397), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 101/325 (31%), Positives = 153/325 (47%), Gaps = 37/325 (11%)

Query: 24  AYIDQINREANTWTAGRNFP-ANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA 82
           A ++++N+    WTA  +   A L+ + +++ + A  +     D P+   R   + E  A
Sbjct: 37  AEVNKLNK--GIWTARYDTKMARLTRQGVKRLMGAKLR-----DAPVLPRRHFTEEELRA 89

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
            +P+ FDA   WP+C TI  + D  +C +    AA  A SDR C+ + G ++  +S   +
Sbjct: 90  PLPESFDAATAWPDCPTIKRIADQSSCGSCWAVAAATAMSDRFCV-TGGVRDLGISAGDL 148

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            SCC  C       C  G     W +  + G V+  DY     CQP    PC H G    
Sbjct: 149 LSCCTSC----GDGCDGGYPDEAWLYFTESGLVS--DY-----CQPYPFPPCKHSGGRSK 197

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN-----EDAIKKEILAHG 257
            PSC +      KC+  CT          DK    + Y+  ++     E+  K+E+   G
Sbjct: 198 NPSCHDMHFHTPKCNATCT----------DKRIPVVRYFASESYSLQGEEDYKRELYLRG 247

Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
           P    F +Y+DF  Y+SGVYKH S   +    H+ +++GWG  NG PYW + N+W   WG
Sbjct: 248 PFEVAFTVYEDFLAYESGVYKHVSGGPVGG--HAVRVVGWGERNGVPYWKIANSWNTDWG 305

Query: 318 DRGTVKILRGKYECAFEYLIAAGKP 342
           + G +   RGK EC  E   +AG P
Sbjct: 306 ENGYLYFYRGKDECGIESQGSAGTP 330


>gi|308512693|gb|ADO33000.1| cathepsin B [Biston betularia]
          Length = 217

 Score =  157 bits (397), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 89/221 (40%), Positives = 110/221 (49%), Gaps = 9/221 (4%)

Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
           +DR C  S G ++   S E + SCC IC       C+ G     W +    G V+GG+Y 
Sbjct: 1   TDRVCTYSNGTKHFHFSAEDLLSCCPICGL----GCNGGMPTLAWEYWKHMGLVSGGNYN 56

Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW 241
              GC P  I PC HH     LP   + K PK  C   C N  Y   + +DK      Y 
Sbjct: 57  SSQGCSPYVIPPCEHHVPGNRLPCNGDTKTPK--CSKTCEN-GYNVLYKKDKRYGKHVYA 113

Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
           V   ED IK E+  +GP  A F +Y D   YKSGVYKH     L    H+ K+IGWG EN
Sbjct: 114 VRGGEDHIKAELFKNGPVEAAFTVYADLLAYKSGVYKHVEGDALGG--HAIKIIGWGVEN 171

Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           G  YWL+ N+W   WG+ G  KILRG+  C  E  I AG+P
Sbjct: 172 GNKYWLIANSWNTDWGNNGFFKILRGEDHCGIESSIVAGEP 212


>gi|343474132|emb|CCD14149.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score =  157 bits (396), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 107/344 (31%), Positives = 153/344 (44%), Gaps = 31/344 (9%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
           LV L    L+  +    +  ++D IN+     W A  N         ++    ++AK   
Sbjct: 15  LVALGASALLAKDAPVLTKTFVDHINQLNGGMWRAVYN-------GKMQNITFSEAKRLT 67

Query: 64  ----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
               Q    LP  R T + +    +P+ FDA E WP+C TI  + D   C A    +   
Sbjct: 68  GARIQKSSALPPARFT-EEQLRTKLPETFDAAEHWPHCPTIREIADQSECRASWAVSTAS 126

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           A SDR C   KG+Q R +S  ++ SCCK C       C  G     W +  + G  +   
Sbjct: 127 AISDRYCTVGKGKQLR-ISAAHLLSCCKDC----GDGCKGGFPGFAWRYYVEYGITS--- 178

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
               + CQP     C H G+      C        KC+  CT+    +     K+R   T
Sbjct: 179 ----SSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKCNATCTD----KAIPLIKYRGNAT 230

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y +   E+  K+E+  +GP  A F +Y D + YKSGVY+H     L     + K++GWG 
Sbjct: 231 YLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGG--TAVKVVGWGK 288

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            NGTPYW + N+W   WG  G + ILRG  EC  E+L  AG P+
Sbjct: 289 LNGTPYWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAGTPE 332


>gi|38639325|gb|AAR25800.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
          Length = 354

 Score =  157 bits (396), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 111/328 (33%), Positives = 149/328 (45%), Gaps = 37/328 (11%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQF-LIADAKYFDQSDRPLPGDRKTYDP 78
             D+ + ++N  A   W A   F   LS   + QF  +   K   + D  L G      P
Sbjct: 40  LQDSIVKRVNENAEAGWKAA--FNPQLSNFTVSQFKRLLGVKPAREGD--LEGIPVLTHP 95

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
                +P  FDAR+ WP C TIG + D G C +   F AV + SDR CI      +  LS
Sbjct: 96  RLK-ELPKEFDARKAWPQCSTIGKILDQGHCGSCWAFGAVESLSDRFCIHY--NLSISLS 152

Query: 139 TEYVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCS 195
              + +CC  +C       C  G     W +  + G VT     Y D TGC        S
Sbjct: 153 VNDLLACCSFLC----GSGCDGGYPIAAWRYFKRSGVVTEECDPYFDTTGC--------S 200

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           H G  P  P+         KCH +C        + + KH     Y V  +  +I  E+  
Sbjct: 201 HPGCEPLYPT--------PKCHRKCVKGNVL--WRKSKHYGVNAYRVSHDPQSIMAEVYK 250

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGP 314
           +GP   +F +Y+DF HYKSGVYKH +   +    H+ KLIGWGT E G  YWL++N+W  
Sbjct: 251 NGPVEVSFTVYEDFAHYKSGVYKHVTGGNMGG--HAVKLIGWGTSEQGEDYWLIVNSWNR 308

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKP 342
            WG+ G  KI RG  EC  E+ + AG P
Sbjct: 309 GWGEDGYFKIRRGTNECGIEHSVVAGLP 336


>gi|166030332|gb|ABY78833.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  157 bits (396), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 109/345 (31%), Positives = 157/345 (45%), Gaps = 36/345 (10%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAK--- 60
           LV L    L+  +    +  ++D+IN+     W A       + +  ++    ++AK   
Sbjct: 15  LVALGASALLAKDAPVLTKTFVDRINQLNGGMWKA-------VYDGKMQNLTFSEAKRLT 67

Query: 61  -YFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
             F +    LP  R T + +    +P+ FDA E WP+C TI  + D  AC A    A   
Sbjct: 68  GAFSRKTSSLPPVRFT-EEQLRTELPESFDAAEHWPHCPTIREIADQSACRASWAVATAS 126

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           A SDR C   KG+Q R +S   + +CCK C       C  G     W +    G  +   
Sbjct: 127 AISDRYCTVGKGKQLR-ISAADLMACCKDC----GGGCEGGYPDAAWEYYVSHGITS--- 178

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
               + CQP     C H G+    P C   K    +C+  CT+    +     K+R   +
Sbjct: 179 ----SQCQPYPFPRCEHRGAQGKKPPCSKYKFVTPQCNATCTD----KSVPLIKYRGNHS 230

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGW 297
           Y V   ED  K+E+  +GP    F ++ DF  YKSGVY+H +     N+L   + +++GW
Sbjct: 231 YEVRGEED-YKRELYFNGPFVVRFQVHSDFLAYKSGVYQHVAG----NFLGGKAVRIVGW 285

Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           G  NGTPYW V N+W   WG  G   ILRG  EC  E+L  AG P
Sbjct: 286 GKLNGTPYWKVANSWDTDWGMNGYFLILRGDNECNIEHLGFAGTP 330


>gi|357116869|ref|XP_003560199.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
          Length = 350

 Score =  157 bits (396), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 114/344 (33%), Positives = 151/344 (43%), Gaps = 43/344 (12%)

Query: 8   LLGCTLVRGELYKFSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSD 66
           L+G       L    +  I+ IN   N  WTAG+N        Y   + IA  K+     
Sbjct: 25  LVGAASGDNSLGIIQNDIIETINNHPNAGWTAGQN-------SYFANYTIAQFKHILGVK 77

Query: 67  RPLPG-----DRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
              PG       KTY    S  +P  FDAR +W  C TIG + D G C +   F AV   
Sbjct: 78  PTPPGLLRGVPTKTY--SRSTDLPKEFDARSKWSGCSTIGTILDQGHCGSCWAFGAVECL 135

Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GD 179
            DR CI      N  LS   + +CC     D    C  G     W +L + G VT     
Sbjct: 136 QDRFCIHL--NMNISLSVNDLVACCGFMCGD---GCDGGYPISAWQYLVENGVVTDECDP 190

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
           Y D+ GC+        H G  P  P+   +K  K++             + + KH +   
Sbjct: 191 YFDQVGCK--------HPGCEPAYPTPACEKKCKVQNQV----------WQEKKHFSINA 232

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y V+ +   I  E+  +GP    F +Y+DF HYKSGVY+H +   +    H+ KLIGWGT
Sbjct: 233 YRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYEHITGEMMGG--HAVKLIGWGT 290

Query: 300 E-NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
             +G  YWL+ N W   WGD G  KI+RGK EC  E  + AG P
Sbjct: 291 SADGKDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVVAGMP 334


>gi|341891034|gb|EGT46969.1| hypothetical protein CAEBREN_30419 [Caenorhabditis brenneri]
          Length = 422

 Score =  157 bits (396), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 159/356 (44%), Gaps = 67/356 (18%)

Query: 22  SDAYIDQINREAN-----TWTAGRN----------FPANLSEEYLRQFLIADAKYFDQSD 66
           SD Y+ ++ R+ N     TW A  N          F    ++  + +++    K+F+   
Sbjct: 64  SDEYLRKLVRQVNDSPETTWKAKFNKFGVKNRSYGFKYTRNQTAVEEYMEHIRKFFESD- 122

Query: 67  RPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRC 126
             +    +  D   S+ +P  FDAR++WPNC +I +VP+ G C +    AA G  SDR C
Sbjct: 123 -AMKRHLEELDNYKSSDLPKAFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRAC 181

Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGC 186
           I S G     LS E +  CC +C      +C  G   +   +   +G VTGG    R GC
Sbjct: 182 IHSNGTFKALLSEEDIIGCCSVC-----GNCYGGDPLKALTYWVNQGLVTGG----RDGC 232

Query: 187 QPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW----- 241
           +P +          P  P+   +   K  C  RC N  Y + + +DKH  T  Y      
Sbjct: 233 RPYSFDLSC---GVPCSPATFFEAEEKRTCMRRCQNIYYQQRYEEDKHFATFAYSLYPRS 289

Query: 242 -----------------------------VDDNEDAIKKEILAHGPTTATFALYDDFYHY 272
                                        V +  + IKKEIL +GPTT  F + ++F HY
Sbjct: 290 MTVSPDGKERVKVPTIIGHFNDKNTEKLNVTEYRNVIKKEILLYGPTTMAFPVPEEFLHY 349

Query: 273 KSGVYKHTSNAKLEN---YLHSGKLIGWG-TENGTPYWLVINTWGPHWGDRGTVKI 324
            SGV++       ++   Y H  +LIGWG +E+GT YWL +N++G HWGD G  KI
Sbjct: 350 SSGVFRPFPLDGFDDRIVYWHVVRLIGWGQSEDGTHYWLAVNSFGSHWGDNGLFKI 405


>gi|166030310|gb|ABY78822.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 107/343 (31%), Positives = 153/343 (44%), Gaps = 32/343 (9%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
           LV L    L+  +    +  ++D+IN+     W A  N         ++    A+A+   
Sbjct: 14  LVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYN-------GKMQNITFAEARRLT 66

Query: 64  ----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
               Q    LP  R T + +    +P+ FD+ E+WPNC TI  + D  AC +    +   
Sbjct: 67  GARIQKTSSLPPVRFT-EEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTAS 125

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           A SDR C     QQ R +S  ++ SCC+ C       C  G    +W +    G  +   
Sbjct: 126 AISDRHCTVGGVQQLR-ISAAHLMSCCEDC----GDGCDGGYPGTSWEYYVSHGLAS--- 177

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
               + CQP     C HHG     P C        KC+T CT+    +     K+R   +
Sbjct: 178 ----SYCQPYPFPHCGHHGGKGKKPPCSKYHFHTPKCNTTCTD----KAIPLIKYRGNHS 229

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y V   ED  K+E+  +GP    F +Y DF  YK+GVY+H S   L    H+ +++GWG 
Sbjct: 230 YEVH-GEDDYKRELYFNGPFVVVFWVYSDFLAYKTGVYRHVSGDFLGG--HAVRIVGWGK 286

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            NGTPYW + N+W   WG  G +  LRG  EC  E    AG P
Sbjct: 287 LNGTPYWKIANSWDTDWGMNGHLLFLRGNNECGIEAAGYAGSP 329


>gi|262217337|gb|ACY38050.1| cathepsin B [Dactylis glomerata]
          Length = 348

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 115/344 (33%), Positives = 150/344 (43%), Gaps = 43/344 (12%)

Query: 8   LLGCTLVRGELYKFSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSD 66
           L+G       L       I+ IN+  N  WTAG N        YL  + I   K+     
Sbjct: 21  LVGAASGDHSLRIIQKDIIETINKHPNAGWTAGHN-------AYLANYTIEQFKHILGVK 73

Query: 67  RPLPG-----DRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
              PG       KTY    S  +P +FDAR +W  C TIG + D G C +   F AV   
Sbjct: 74  PTPPGLLAGVPTKTYSK--SEELPKQFDARSKWSGCSTIGTILDQGHCGSCWAFGAVECL 131

Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GD 179
            DR CI      N  LS   + +CC     D    C  G   + W +  + G VT     
Sbjct: 132 QDRFCIHQ--NINISLSANDLVACCGFMCGD---GCDGGYPIKAWQYFVQSGVVTEECDP 186

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
           Y D+ GC+     P      A   P CE       KC  +       + + + KH +   
Sbjct: 187 YFDQVGCKHPGCEP------AYDTPKCEK------KCKVQ------NQVWEEKKHFSINA 228

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y V+ +   I  E+  +GP    F +Y+DF HYKSGVYKH +   +    H+ KLIGWGT
Sbjct: 229 YRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGGVMGG--HAVKLIGWGT 286

Query: 300 EN-GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            + G  YWL+ N W   WGD G  KI+RGK EC  E  + AG P
Sbjct: 287 SDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEEVVAGMP 330


>gi|357116879|ref|XP_003560204.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
          Length = 351

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 113/344 (32%), Positives = 149/344 (43%), Gaps = 43/344 (12%)

Query: 8   LLGCTLVRGELYKFSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSD 66
           L+G       L    +  I+ IN+  N  WTAG N        Y   + I   K+     
Sbjct: 24  LVGAARGDNSLRIIQNDIIETINKHPNAGWTAGHN-------PYFANYTITQFKHI-LGV 75

Query: 67  RPLP----GDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
           +P P        T     S  +P  FDAR QW  C TIG + D G C +   F AV    
Sbjct: 76  KPTPPALLAGVPTKSYSRSMKLPTEFDARSQWSGCSTIGTILDQGHCGSCWAFGAVECLQ 135

Query: 123 DRRCIKSKGQQNRPLSTEYVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GD 179
           DR CI      N  LS   + +CC  +C       C+ G     W +  ++G VT     
Sbjct: 136 DRFCIHL--NMNISLSVNDLLACCGFLC----GSGCNGGYPISAWRYFRRKGVVTDECDP 189

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
           Y D+ GC+        H G  P        + PK +   +  N  +     + KH +   
Sbjct: 190 YFDQVGCK--------HPGCEPAY------RTPKCEKKCKVQNEVWK----EQKHFSVDA 231

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y V  N   I  E+  +GP    F +Y+DF HYKSGVYKH +   +    H+ KLIGWGT
Sbjct: 232 YRVHSNPHDIMAEVYTNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGG--HAVKLIGWGT 289

Query: 300 EN-GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            + G  YWL+ N W   WGD G  KI+RGK EC  E  + AG P
Sbjct: 290 SDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVVAGMP 333


>gi|268566089|ref|XP_002647469.1| Hypothetical protein CBG06541 [Caenorhabditis briggsae]
          Length = 280

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 91/242 (37%), Positives = 123/242 (50%), Gaps = 30/242 (12%)

Query: 109 CAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHG-------S 161
           C +   F+     SDR CI +KG Q   +S   + +CC        +SC  G        
Sbjct: 61  CGSCWAFSTAEVISDRICIATKGTQQPTISPTDMLACC-------GRSCGDGCEGGYPIQ 113

Query: 162 VFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCT 221
            FR WN    RG VTGGD+   +GC+P   +PC+ +        C  +K P   C   C 
Sbjct: 114 AFRWWN---SRGVVTGGDF-RGSGCRPYPFAPCNSY-------KCPEEKTPT--CSLSC- 159

Query: 222 NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTS 281
              Y   + +DK      Y V  N  AI+ EI+ +GP    F +Y+D Y YKSGVY+HT+
Sbjct: 160 QFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTA 219

Query: 282 NAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
              L    H+ K+IGWGT+NG PYWL+ N+WG  WG+ G +K+ RG  EC  E  + AG 
Sbjct: 220 GRLLGG--HAIKIIGWGTQNGIPYWLIANSWGADWGENGFLKMRRGVNECGIESAVVAGM 277

Query: 342 PK 343
           PK
Sbjct: 278 PK 279



 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 30/65 (46%), Positives = 46/65 (70%), Gaps = 2/65 (3%)

Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWG 313
           + +GP  A+F +Y+DFY YK GVY++T+   +   +H+ K++GWGTE+GT YWL+ N+WG
Sbjct: 1   MTNGPVEASFTVYEDFYIYKKGVYQYTAGQVVG--VHAIKIMGWGTEHGTDYWLIANSWG 58

Query: 314 PHWGD 318
              G 
Sbjct: 59  AQCGS 63


>gi|609175|emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana rustica]
          Length = 356

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 111/336 (33%), Positives = 148/336 (44%), Gaps = 41/336 (12%)

Query: 15  RGELYKFSDAYIDQINR-EANTWTAGRN--FPANLSEEYLRQFLIADAKYFDQSDRPLPG 71
           + E     D+ + Q+N  E   W A  N  F      ++ R   +   +  D    P+  
Sbjct: 34  KAESAILQDSIVKQVNENEKAGWKAALNPRFSNFTVSQFKRLLGVKPTRKGDLKGIPILT 93

Query: 72  DRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
             K  +      +P  FDAR  W NC TIG + D G C +   F AV + SDR CI    
Sbjct: 94  HPKLLE------LPQEFDARVAWSNCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHYG- 146

Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPS 189
             N  LS   + +CC     D    C  G   + W +  ++G VT     Y D  GC   
Sbjct: 147 -LNISLSANDLYACCGFLCGD---GCDGGYPLQAWKYFVRKGVVTDECDPYFDNEGC--- 199

Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCT--NPTYGRGFFQDKHRTTLTYWVDDNED 247
                SH G  P  P+         KCH +C   N  + R     KH     Y +  +  
Sbjct: 200 -----SHPGCEPAYPT--------PKCHRKCVKQNLLWSR----SKHFGVNAYMISSDPH 242

Query: 248 AIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYW 306
           +I  E+  +GP   +F +Y+DF HYKSGVYKH +   +    H+ KLIGWGT E+G  YW
Sbjct: 243 SIMTEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDIMGG--HAVKLIGWGTSEDGEDYW 300

Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           L+ N W   WGD G  KI RG  EC  E  + AG P
Sbjct: 301 LLANQWNRGWGDDGYFKIRRGTNECEIEDEVVAGLP 336


>gi|166030328|gb|ABY78831.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 108/343 (31%), Positives = 153/343 (44%), Gaps = 31/343 (9%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
           LV L    L+  +    +  ++D+IN+     W A  N         ++    A+AK   
Sbjct: 14  LVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYN-------GKMQNITFAEAKRLT 66

Query: 64  ----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
               Q    LP  R T + +    +P+ FDA E WP+C TI  + D  AC A    +   
Sbjct: 67  GAWIQKSSTLPPARFT-EEQLRTKLPETFDAAEHWPHCPTIREIADQSACRASWAVSTAS 125

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           A SDR C    G+Q R +S   + SCCK C       C  G     W +  + G  +   
Sbjct: 126 AISDRYCTVGGGKQLR-ISAADLLSCCKQC----GDGCKGGFPGFAWLYYVEYGIAS--- 177

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
               +GCQP     C H G+      C   K    KC+  CT+    +     K+R   T
Sbjct: 178 ----SGCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKCNATCTD----KSIPLVKYRGNAT 229

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y +   E+  K+E+  +GP  A F +Y D + YKSGVY++     L     + +++GWG 
Sbjct: 230 YLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGG--QAVRIVGWGK 287

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            NGTPYW V N+W   WG  G + ILRG  EC  E+L   G P
Sbjct: 288 LNGTPYWKVANSWDTDWGMNGYMLILRGNNECNIEHLGFTGFP 330


>gi|59895951|gb|AAX11351.1| cathepsin B-like cysteine protease [Oryza sativa Japonica Group]
 gi|125551767|gb|EAY97476.1| hypothetical protein OsI_19406 [Oryza sativa Indica Group]
 gi|215694023|dbj|BAG89222.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215712372|dbj|BAG94499.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765382|dbj|BAG87079.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222631058|gb|EEE63190.1| hypothetical protein OsJ_17999 [Oryza sativa Japonica Group]
          Length = 358

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 111/328 (33%), Positives = 148/328 (45%), Gaps = 37/328 (11%)

Query: 21  FSDAYIDQINREANT-WTAGRN-FPANLSEEYLRQFL-IADAKYFDQSDRPLPGDRKTYD 77
             D  I  IN+  N  WTA RN + AN +    +  L +    +   +D P+    KTY 
Sbjct: 42  IQDDIIKAINKHPNAGWTAARNPYFANYTTAQFKHILGVKPTPHSVLNDVPV----KTY- 96

Query: 78  PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
              S  +P  FDAR  W  C TIG + D G C +   F AV    DR CI      N  L
Sbjct: 97  -PRSLMLPKEFDARSAWSQCNTIGTILDQGHCGSCWAFGAVECLQDRFCIHF--NMNISL 153

Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCS 195
           S   + +CC     D    C  G     W +  + G VT     Y D+ GC+        
Sbjct: 154 SVNDLVACCGFMCGD---GCDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCK-------- 202

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           H G  P  P+     V + KC  +       + + + KH +   Y V+ +   I  E+  
Sbjct: 203 HPGCEPAYPT----PVCEKKCKVQ------NQVWLEKKHFSVNAYRVNSDPHDIMAEVYQ 252

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGP 314
           +GP    F +Y+DF HYKSGVYKH +   +    H+ KLIGWGT + G  YWL+ N W  
Sbjct: 253 NGPVEVAFTVYEDFAHYKSGVYKHITGGMMGG--HAVKLIGWGTTDAGEDYWLLANQWNR 310

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKP 342
            WGD G  KI+RG  EC  E  + AG P
Sbjct: 311 GWGDDGYFKIIRGTNECGIEEDVVAGMP 338


>gi|343470805|emb|CCD16605.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score =  155 bits (392), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 107/344 (31%), Positives = 153/344 (44%), Gaps = 31/344 (9%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
           LV L    L+  +    +  ++D IN+     W A  N         ++    ++AK   
Sbjct: 15  LVALGASALLAKDAPVLTKTFVDHINQLNGGMWRAVYN-------GKMQNITFSEAKRLT 67

Query: 64  ----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
               Q    LP  R T + +    +P+ FDA E WP+C TI  + D   C A    +   
Sbjct: 68  GARIQKSSALPPARFT-EEQLRTKLPETFDAAEHWPHCPTIREIADQSECRASWAVSTAS 126

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           A SDR C   KG+Q R +S  ++ SCCK C       C  G     W +  + G  +   
Sbjct: 127 AISDRYCTVGKGKQLR-ISAAHLLSCCKDC----GDGCKGGFPGFAWRYYVEYGITS--- 178

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
               + CQP     C H G+      C        KC+  CT+    +     K+R   T
Sbjct: 179 ----SSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKCNATCTD----KSVPLIKYRGNAT 230

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y +   E+  K+E+  +GP  A F +Y D + YKSGVY++     L     + K++GWG 
Sbjct: 231 YLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRNVDGDFLGG--TAVKVVGWGK 288

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            NGTPYW V N+W   WG  G + ILRG  EC  E+L  AG P+
Sbjct: 289 LNGTPYWKVANSWDTDWGMDGYLLILRGNNECNIEHLGFAGTPE 332


>gi|17510377|ref|NP_490763.1| Protein Y65B4A.2 [Caenorhabditis elegans]
 gi|373220066|emb|CCD71920.1| Protein Y65B4A.2 [Caenorhabditis elegans]
          Length = 421

 Score =  155 bits (392), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 96/282 (34%), Positives = 132/282 (46%), Gaps = 50/282 (17%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           S+ VP  FDAR++WPNC +I +VP+ G C +    AA G  SDR CI S G     LS E
Sbjct: 135 SSDVPKNFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRACIHSNGTFKSLLSEE 194

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            +  CC +C      +C  G   +   +   +G VTGG    R GC+P +          
Sbjct: 195 DIIGCCSVC-----GNCYGGDPLKALTYWVNQGLVTGG----RDGCRPYSFDLSC---GV 242

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW------------------- 241
           P  P+   +   K  C  RC N  Y + + +DKH  T  Y                    
Sbjct: 243 PCSPATFFEAEEKRTCMKRCQNIYYQQKYEEDKHFATFAYSMYPRSMTVSPDGKERVKVP 302

Query: 242 ---------------VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLE 286
                          V +  D IKKEIL +GPTT  F + ++F HY SGV++       +
Sbjct: 303 TIIGHFNDKKTEKLNVTEYRDIIKKEILLYGPTTMAFPVPEEFLHYSSGVFRPYPTDGFD 362

Query: 287 N---YLHSGKLIGWG-TENGTPYWLVINTWGPHWGDRGTVKI 324
           +   Y H  +LIGWG +++GT YWL +N++G HWGD G  KI
Sbjct: 363 DRIVYWHVVRLIGWGESDDGTHYWLAVNSFGNHWGDNGLFKI 404


>gi|355332948|pdb|3MOR|A Chain A, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
 gi|355332949|pdb|3MOR|B Chain B, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
          Length = 317

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 105/332 (31%), Positives = 146/332 (43%), Gaps = 23/332 (6%)

Query: 13  LVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG 71
           LV  +    S A++D++NR     W A   +   +    LR+    +      ++  +  
Sbjct: 1   LVAEDAPVLSKAFVDRVNRLNRGIWKA--KYDGVMQNITLREAKRLNGVIKKNNNASILP 58

Query: 72  DRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
            R+  + E  A +P  FD+ E WPNC TI  + D  AC +    AA  A SDR C    G
Sbjct: 59  KRRFTEEEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMG-G 117

Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTI 191
            Q+  +S   + +CC  C       C+ G   R W +    G V+  DY     CQP   
Sbjct: 118 VQDVHISAGDLLACCSDC----GDGCNGGDPDRAWAYFSSTGLVS--DY-----CQPYPF 166

Query: 192 SPCSHHGSAPT-LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIK 250
             CSHH  +    P C        KC+  C +PT         +  + T +    ED   
Sbjct: 167 PHCSHHSKSKNGYPPCSQFNFDTPKCNYTCDDPT-----IPVVNYRSWTSYALQGEDDYM 221

Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVIN 310
           +E+   GP    F +Y+DF  Y SGVY H S   L    H+ +L+GWGT NG PYW + N
Sbjct: 222 RELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGG--HAVRLVGWGTSNGVPYWKIAN 279

Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           +W   WG  G   I RG  EC  E   +AG P
Sbjct: 280 SWNTEWGMDGYFLIRRGSSECGIEDGGSAGIP 311


>gi|3087799|emb|CAA93276.1| cysteine proteinase [Haemonchus contortus]
          Length = 350

 Score =  155 bits (391), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 91/275 (33%), Positives = 136/275 (49%), Gaps = 14/275 (5%)

Query: 47  SEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDT 106
           +EE +   +  D   + ++ R L   +K  +   +  +P+ FD+R  W NC +I +V D 
Sbjct: 60  AEERMAHLMKTD---YIRNARKLYKVKKAEEQTTNEDIPESFDSRIVWKNCSSITYVRDQ 116

Query: 107 GACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRT 165
             C +    +A    SDR C+++KG+    LS   + SCC ++C       C  G     
Sbjct: 117 SRCGSCWAVSAASTMSDRICVQTKGKLQTILSDTDILSCCGRMC----GDGCEGGYDHLA 172

Query: 166 WNFLHKRGSVTGGDYGDRTGCQPSTISPCS-HHGSAPTLPSCENQKVPKLKCHTRCTNPT 224
           W ++ + G VTGG Y  +  C+P    PC  HHG     P   +   P   C   C    
Sbjct: 173 WEWVQRFGVVTGGPYQQKGVCRPYAFHPCGLHHGRRYDCPWDHSFSTP--ACKPYC-QFG 229

Query: 225 YGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK 284
           YG+ + +DK     TY +D++E  I++E++ +GP  A F  Y+DF  YK G+Y H     
Sbjct: 230 YGKRYEKDKFFVKSTYILDNDEKVIQREMMKNGPVQAAFITYEDFSPYKGGIYVHVKGR- 288

Query: 285 LENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
            E   H+ KLIGWG ENGT YW V N+W   WG +
Sbjct: 289 -ERGAHAVKLIGWGVENGTKYWTVANSWHDDWGGK 322


>gi|56755295|gb|AAW25827.1| SJCHGC06356 protein [Schistosoma japonicum]
          Length = 279

 Score =  155 bits (391), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 94/260 (36%), Positives = 127/260 (48%), Gaps = 9/260 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDAR  W NC TI  + D   C A    A V + SDR CI+S G+ +  LS     
Sbjct: 28  IPRSFDARYHWINCSTIRQIHDESLCRADWAIATVDSISDRICIRSNGRISVQLSARDAI 87

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC        +  C HGS      +    G VTGG Y D++GCQP  +  CS+H  +  L
Sbjct: 88  SC------GFSPGCFHGSEVEVLVYWITYGIVTGGSYEDQSGCQPYPLPKCSYHPESRFL 141

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
             C N      +C   C +  Y + +  DK      Y V   ++ I+KEIL +GP  A+ 
Sbjct: 142 -DCNNNTFEFPQCTNECQD-GYNKTYDDDKFYGERIYNVYGTQEDIQKEILMNGPVIASI 199

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
           ++  DF  YKSGVY  T  ++   ++ + ++IGWG E   PYWL  N+W   WG  G VK
Sbjct: 200 SVNTDFLVYKSGVYLPTPRSRNLGWI-TLRIIGWGYEGKIPYWLCANSWNEEWGANGYVK 258

Query: 324 ILRGKYECAFEYLIAAGKPK 343
           I RG      E  + A  PK
Sbjct: 259 IQRGVQAGYIESYVRAPIPK 278


>gi|224128101|ref|XP_002320244.1| predicted protein [Populus trichocarpa]
 gi|222861017|gb|EEE98559.1| predicted protein [Populus trichocarpa]
          Length = 339

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 98/266 (36%), Positives = 129/266 (48%), Gaps = 30/266 (11%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           S  +P  FDAR  WP+C TIG + D G C +   F AV + SDR CI      N  LS  
Sbjct: 80  SMKLPIEFDARTAWPHCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHYG--MNLSLSVN 137

Query: 141 YVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHH 197
            + +CC  +C       C  GS    W +  + G VT     Y D  GC        SH 
Sbjct: 138 DLLACCGWMC----GAGCDGGSPIDAWRYFVQSGVVTEECDPYFDDIGC--------SHP 185

Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
           G  P  P+         KC  +C +    + + + KH +   Y +D +  +I  E+ ++G
Sbjct: 186 GCEPGFPT--------PKCERKCADKN--KLWAESKHFSVNAYRIDSDPHSIMAEVSSNG 235

Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGPHW 316
           P    F +Y+DF HYKSGVYKH +   +    H+ KLIGWGT E+G  YWL+ N W   W
Sbjct: 236 PVEVAFTVYEDFAHYKSGVYKHITGDAMGG--HAVKLIGWGTSEDGEDYWLLANQWNRGW 293

Query: 317 GDRGTVKILRGKYECAFEYLIAAGKP 342
           GD G  KI RG  EC  E  + AG P
Sbjct: 294 GDDGYFKIKRGTNECGIEGAVVAGLP 319


>gi|197725747|gb|ACH73069.1| cathepsin B precursor [Epinephelus coioides]
          Length = 333

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 103/330 (31%), Positives = 153/330 (46%), Gaps = 25/330 (7%)

Query: 15  RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRK 74
           R  L   S   ++ IN+   TW AG NF  N+   Y+++               L G + 
Sbjct: 19  RPHLKPLSSDMVNYINKLNTTWKAGHNF-NNVDYSYVQKLC----------GTMLKGPKL 67

Query: 75  TYDPEYSA--TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
               +YS    +P  FD+REQWPNC T+  + D G+C +   F A  A SDR CI S G+
Sbjct: 68  PVLVQYSGDMKLPKNFDSREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRLCIHSNGK 127

Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTIS 192
            +  +S+E + +CC  C       C+ G     W+F    G V+GG Y    GC+P TI 
Sbjct: 128 VSVEISSEDLLTCCDSC----GMGCNGGYPSAAWDFWTDVGLVSGGLYDSHVGCRPYTIP 183

Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
           PC HH +  T P C  +     +C  +C +  Y   +  DKH    +Y V  +E+ I+ E
Sbjct: 184 PCEHHVNG-TRPPCTGEGGDTPQCILQCES-GYTPSYKADKHYGKSSYSVPSDEEQIQSE 241

Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
           I  +GP    F +Y+DF  YK+GVY+H + + +  +     +  W  E       + ++ 
Sbjct: 242 IYKNGPVEGAFTVYEDFLLYKTGVYQHMTGSAVGGH----AIKSWLGEEVCSLLALCHS- 296

Query: 313 GPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
              WGD  ++    G   C  E  I AG P
Sbjct: 297 DTDWGDMVSLSS-AGSDHCGIESEIVAGIP 325


>gi|21693|emb|CAA46810.1| cathepsin B [Triticum aestivum]
          Length = 305

 Score =  154 bits (390), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 110/324 (33%), Positives = 144/324 (44%), Gaps = 39/324 (12%)

Query: 26  IDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDR---KTYDPEYS 81
           I  +N   N  WTAG N        YL  + I   K+        PG R   +T     S
Sbjct: 2   IQTVNNHPNAGWTAGHN-------PYLANYTIEQFKHMLGVKPTPPGLRAAVRTKTHSRS 54

Query: 82  ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
             +P  FDAR +W  C TIG + D G C +   F AV    DR CI      N  LS   
Sbjct: 55  EQLPKVFDARSKWSGCSTIGKILDQGHCGSCWAFGAVECLQDRFCIHH--NMNITLSAND 112

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGS 199
           + +CC     D    C  G     W +  + G VT     Y D+ GC+        H G 
Sbjct: 113 LVACCGFMCGD---GCDGGYPISAWQYFVQNGVVTDECDPYFDQVGCK--------HPGC 161

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
            P  P+     V + KC  +       + + + KH +   Y V+ +   I  E+  +GP 
Sbjct: 162 EPAYPT----PVCEKKCKVQ------NQVWEEKKHFSINAYQVNSDPHDIMAEVYNNGPV 211

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPHWGD 318
              F +Y+DF HYKSGVYKH +   +    H+ KLIGWGT + G  YWL+ N W   WGD
Sbjct: 212 EVAFTVYEDFAHYKSGVYKHITGGVMGG--HAVKLIGWGTSDAGEDYWLLANQWNRGWGD 269

Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
            G  KI+RGK EC  E  + AG P
Sbjct: 270 DGYFKIIRGKNECGIEEDVTAGMP 293


>gi|224064400|ref|XP_002301457.1| predicted protein [Populus trichocarpa]
 gi|222843183|gb|EEE80730.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score =  154 bits (390), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 110/331 (33%), Positives = 147/331 (44%), Gaps = 43/331 (12%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
             D+ + ++N      W A  N        +   + +A  KY     +P P +     P 
Sbjct: 41  LQDSILKKVNGNPKAGWKATMN-------HHFSNYTVAQFKYL-LGVKPTPKEELRGIPV 92

Query: 80  YS----ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
            S      +P+ FDAR  WP C TIG + D G C +   F AV + SDR CI      N 
Sbjct: 93  ISHPKSLRLPEEFDARTAWPQCSTIGKILDQGHCGSCWAFGAVESLSDRFCIHYG--MNI 150

Query: 136 PLSTEYVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTIS 192
            LS   + +CC  +C       C+ G     W +    G VT     Y D  GC      
Sbjct: 151 SLSVNDLLACCGFLC----GSGCNGGYPISAWRYFVHHGVVTEECDPYFDDIGC------ 200

Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
             SH G  P  P+         KC  +C N    + + + KH     Y +D + ++I  E
Sbjct: 201 --SHPGCEPGYPT--------PKCARKCVNKN--QLWKKSKHYGVKPYRIDSDPESIMAE 248

Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINT 311
           I  +GP    F +Y+DF HYKSGVYKH +   +    H+ KLIGWGT E+G  YWL+ N 
Sbjct: 249 IYKNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGG--HAVKLIGWGTSEDGEAYWLLANQ 306

Query: 312 WGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           W   WGD G  KI RG  EC  E  + AG P
Sbjct: 307 WNRGWGDDGYFKIRRGTNECGIEGDVVAGLP 337


>gi|296863454|pdb|3HHI|A Chain A, Crystal Structure Of Cathepsin B From T. Brucei In Complex
           With Ca074
 gi|296863455|pdb|3HHI|B Chain B, Crystal Structure Of Cathepsin B From T. Brucei In Complex
           With Ca074
          Length = 325

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 105/333 (31%), Positives = 145/333 (43%), Gaps = 23/333 (6%)

Query: 12  TLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLP 70
            LV  +    S A++D++NR     W A   +   +    LR+    +      ++  + 
Sbjct: 1   ALVAEDAPVLSKAFVDRVNRLNRGIWKA--KYDGVMQNITLREAKRLNGVIKKNNNASIL 58

Query: 71  GDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSK 130
             R+  + E  A +P  FD+ E WPNC TI  + D  AC +    AA  A SDR C    
Sbjct: 59  PKRRFTEEEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMG- 117

Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPST 190
           G Q+  +S   + +CC  C       C+ G   R W +    G V+  DY     CQP  
Sbjct: 118 GVQDVHISAGDLLACCSDC----GDGCNGGDPDRAWAYFSSTGLVS--DY-----CQPYP 166

Query: 191 ISPCSHHGSAPT-LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
              CSHH  +    P C        KC   C +PT         +  + T +    ED  
Sbjct: 167 FPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDPT-----IPVVNYRSWTSYALQGEDDY 221

Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVI 309
            +E+   GP    F +Y+DF  Y SGVY H S   L    H+ +L+GWGT NG PYW + 
Sbjct: 222 MRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGG--HAVRLVGWGTSNGVPYWKIA 279

Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           N+W   WG  G   I RG  EC  E   +AG P
Sbjct: 280 NSWNTEWGMDGYFLIRRGSSECGIEDGGSAGIP 312


>gi|261328564|emb|CBH11542.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like,
           putative [Trypanosoma brucei gambiense DAL972]
          Length = 340

 Score =  154 bits (389), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 103/324 (31%), Positives = 143/324 (44%), Gaps = 23/324 (7%)

Query: 21  FSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
            S A++D++NR     W A   +   +    LR+    +      ++  +   R+  + E
Sbjct: 32  LSKAFVDRVNRLNRGIWKA--KYDGVMQNITLREAKRLNGVIKKNNNASILPKRRFTEEE 89

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
             A +P  FD+ E WPNC TI  + D  AC +    AA  A SDR C    G Q+  +S 
Sbjct: 90  ARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMG-GVQDVHISA 148

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
             + +CC  C       C+ G   R W +    G V+  DY     CQP     CSHH  
Sbjct: 149 GDLLACCSDC----GDGCNGGDPDRAWAYFSSTGLVS--DY-----CQPYPFPHCSHHSK 197

Query: 200 APT-LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           +    P C        KC+  C +PT         +  + T +    ED   +E+   GP
Sbjct: 198 SKNGYPPCSQFNFDTPKCNYTCDDPT-----IPVVNYRSWTSYALQGEDDYMRELFFRGP 252

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
               F +Y+DF  Y SGVY H S   L    H+ +L+GWGT NG PYW + N+W   WG 
Sbjct: 253 FEVAFDVYEDFIAYNSGVYHHVSGQYLGG--HAVRLVGWGTSNGVPYWKIANSWNTEWGM 310

Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
            G   I RG  EC  E   +AG P
Sbjct: 311 DGYFLIRRGSSECGIEDGGSAGIP 334


>gi|281208776|gb|EFA82951.1| peptidase C1A family protein [Polysphondylium pallidum PN500]
          Length = 1308

 Score =  154 bits (389), Expect = 6e-35,   Method: Composition-based stats.
 Identities = 93/267 (34%), Positives = 128/267 (47%), Gaps = 29/267 (10%)

Query: 66  DRPLPGDRKTYDP-EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDR 124
           DRP    +K Y    ++  +P  FDA +QWP C TIG + +   C +   F A+ + SDR
Sbjct: 55  DRP----KKIYKTLPHNVNLPTNFDAAQQWPQCPTIGAIQNQAECGSCWAFGAIESISDR 110

Query: 125 RCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
            CI     ++  LS + + +C      + +  C  G  +  + ++ K G VT       +
Sbjct: 111 FCIHK--NESVQLSFQDLITC-----DNQDNGCEGGDPYTAYKYVQKNGVVT-------S 156

Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
            CQP TI  C      P    C N  V    C  +C N +    F QD H     Y V  
Sbjct: 157 NCQPYTIPTCP-----PAQQPCMN-FVNTPPCSAKCANSSVN--FQQDLHHLKTVYAVKP 208

Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTP 304
           N  AI+ EI+ +GP  A F +Y+DF  YKSGVY H S   L    H  K++G+G  NGTP
Sbjct: 209 NVAAIQNEIVTNGPVEACFEVYEDFLGYKSGVYTHKSGKDLGG--HCIKIVGFGVSNGTP 266

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYEC 331
           YW+  N+W   WG+ G   I  GK EC
Sbjct: 267 YWICNNSWTTSWGNNGIFWIEAGKNEC 293


>gi|170060938|ref|XP_001866023.1| cathepsin B [Culex quinquefasciatus]
 gi|167879260|gb|EDS42643.1| cathepsin B [Culex quinquefasciatus]
          Length = 353

 Score =  154 bits (389), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 98/314 (31%), Positives = 147/314 (46%), Gaps = 25/314 (7%)

Query: 29  INREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRF 88
           +    N+WTAG         + L  + +       +S R  PG       +    +P++F
Sbjct: 52  VRNRTNSWTAG------APRQPLSSYRVGVNMEELESKRLKPG---ILILKEDIDLPEQF 102

Query: 89  DAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKI 148
           DAR++WP C ++  + + G C +    +A  AF+DR CI S         +  + SCC  
Sbjct: 103 DARDKWPQCPSLREIRNQGCCGSCWAISAAEAFTDRWCIHSPEHTTFSFGSFDLISCCHS 162

Query: 149 CRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCEN 208
           C       C  G +   W++  ++G  +GG Y  + GC       C     +P     E+
Sbjct: 163 C----GDGCQGGVLGPAWDYWVQKGVSSGGPYNSKQGCHSYPFDTCH----SPD----ED 210

Query: 209 QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDD 268
              PK  C  +C +    +   +D+    + Y V  +E  I +EI  +GP  A F +Y D
Sbjct: 211 DDAPK--CSRKCQSSYSVQDVSKDRRFGRVAYSVVADEHRIMEEIFVNGPVQAAFQVYLD 268

Query: 269 FYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGK 328
           F  YKSGVY+H +   LE   H+ K++GWG ENGT YWL  N+WG  WGD G  KI+RG+
Sbjct: 269 FKTYKSGVYRHVT-GPLEGG-HAIKILGWGVENGTKYWLCSNSWGEDWGDHGFFKIVRGE 326

Query: 329 YECAFEYLIAAGKP 342
                E  + AG P
Sbjct: 327 NHLGIETDVHAGLP 340


>gi|72389769|ref|XP_845179.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|427931064|pdb|4HWY|A Chain A, Trypanosoma Brucei Procathepsin B Solved From 40 Fs
           Free-electron Laser Pulse Data By Serial Femtosecond
           X-ray Crystallography
 gi|40557577|gb|AAR88085.1| cathepsin B-like cysteine protease [Trypanosoma brucei]
 gi|62360039|gb|AAX80461.1| cysteine peptidase C (CPC) [Trypanosoma brucei]
 gi|70801714|gb|AAZ11620.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
          Length = 340

 Score =  154 bits (389), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 103/324 (31%), Positives = 143/324 (44%), Gaps = 23/324 (7%)

Query: 21  FSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
            S A++D++NR     W A   +   +    LR+    +      ++  +   R+  + E
Sbjct: 32  LSKAFVDRVNRLNRGIWKA--KYDGVMQNITLREAKRLNGVIKKNNNASILPKRRFTEEE 89

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
             A +P  FD+ E WPNC TI  + D  AC +    AA  A SDR C    G Q+  +S 
Sbjct: 90  ARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMG-GVQDVHISA 148

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
             + +CC  C       C+ G   R W +    G V+  DY     CQP     CSHH  
Sbjct: 149 GDLLACCSDC----GDGCNGGDPDRAWAYFSSTGLVS--DY-----CQPYPFPHCSHHSK 197

Query: 200 APT-LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           +    P C        KC+  C +PT         +  + T +    ED   +E+   GP
Sbjct: 198 SKNGYPPCSQFNFDTPKCNYTCDDPT-----IPVVNYRSWTSYALQGEDDYMRELFFRGP 252

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
               F +Y+DF  Y SGVY H S   L    H+ +L+GWGT NG PYW + N+W   WG 
Sbjct: 253 FEVAFDVYEDFIAYNSGVYHHVSGQYLGG--HAVRLVGWGTSNGVPYWKIANSWNTEWGM 310

Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
            G   I RG  EC  E   +AG P
Sbjct: 311 DGYFLIRRGSSECGIEDGGSAGIP 334


>gi|44965462|gb|AAS49538.1| cathepsin B [Protopterus dolloi]
          Length = 225

 Score =  154 bits (389), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 84/220 (38%), Positives = 121/220 (55%), Gaps = 7/220 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +PD FD+R QWPNC TI  + D G+C +   F AV + SDR C+ S G+QN  +S E + 
Sbjct: 13  LPDNFDSRTQWPNCPTIREIRDQGSCGSCWAFGAVESMSDRVCVHSGGKQNVEVSAEDLL 72

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC    ++    C+ G     W +  ++G V+GG YG   GC+P TI PC HH +  + 
Sbjct: 73  SCCG---FECGMGCNGGYPSGAWQYWTEKGLVSGGLYGSGIGCRPYTIPPCEHHVNG-SR 128

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           PSC  +     KC  +C +  Y   + +DK      Y V  + ++I +EI   GP    F
Sbjct: 129 PSCSGEGGDTPKCVQKC-DSGYTPAYEKDKIYGQSAYSVPSSPESIMEEIYKDGPVEGAF 187

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
            +Y+DF  YKSGVY+H +   +    H+ K++GWG EN T
Sbjct: 188 TVYEDFLLYKSGVYQHHTGEAVGG--HAIKILGWGIENNT 225


>gi|407425570|gb|EKF39488.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi
           marinkellei]
          Length = 333

 Score =  154 bits (388), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 109/349 (31%), Positives = 159/349 (45%), Gaps = 33/349 (9%)

Query: 2   IHILVFLLGCTLVRG----ELYKFSDAYIDQINR-EANTWTAGR-NFPANLSEEYLRQFL 55
           I + +FLL  T V      +    +D +++ +N      WTAGR +   +L+     + L
Sbjct: 9   IALFLFLLYATAVHALHVDDAPILTDEFLEHVNSLNGGKWTAGRTSRTKHLTRREASRLL 68

Query: 56  IADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIF 115
                +   +    P  R+  + E    + D+FDA E WPNC TI  + D  +C +    
Sbjct: 69  ---GTFLGNTSILAP--RQFSEAELRVRLEDKFDAAEAWPNCPTITEIRDQSSCGSCWAV 123

Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
           AA  A SDR C    G ++  +S   + SCC +C Y     C+ G     W F    G V
Sbjct: 124 AAASAMSDRYCTLG-GVRDLRISAGDLMSCCDVCGY----GCNGGFPEVAWVFYVVHGLV 178

Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCE-NQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           +  +Y     CQP     C+HH ++  L  C  + K PK  C++ CT        ++  H
Sbjct: 179 S--EY-----CQPYPFPSCAHHVNSSDLAPCSGDYKTPK--CNSTCTEKKIPLIRYRGNH 229

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
              L+      E+  K+E+L +GP    F +Y DF  Y  GVYKH +   L    H+ +L
Sbjct: 230 SYVLS-----GEEHFKRELLLNGPFEVAFEVYADFMAYTGGVYKHVAGDLLGG--HAVRL 282

Query: 295 IGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           +GWG  NG PYW + N+W   WG  G   I RG  EC  E    AG P+
Sbjct: 283 VGWGELNGEPYWKIANSWNHEWGMNGYFLIARGVNECGIESNGVAGTPR 331


>gi|343474137|emb|CCD14154.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score =  154 bits (388), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 106/344 (30%), Positives = 152/344 (44%), Gaps = 31/344 (9%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
           LV L    L+  +    +  ++D IN+     W A  N         ++    ++AK   
Sbjct: 15  LVALGASALLAKDAPVLTKTFVDHINQLNGGMWKAVYN-------GKMQNITFSEAKRLT 67

Query: 64  ----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
               Q    L   R T + +    +P+ FDA E WP+C TI  + D   C A    +   
Sbjct: 68  GARIQKSSGLQPARFT-EEQLRTKLPETFDAAEHWPHCPTIREIADQSECRASWAVSTAS 126

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           A SDR C   KG+Q R +S  ++ SCCK C       C  G     W +  + G  +   
Sbjct: 127 AISDRYCTVGKGKQLR-ISAAHLLSCCKDC----GDGCKGGFPGFAWRYYVEYGITS--- 178

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
               + CQP     C H G+      C        KC+  CT+    +     K+R   T
Sbjct: 179 ----SSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKCNATCTD----KAIPLIKYRGNAT 230

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y +   E+  K+E+  +GP  A F +Y D + YKSGVY+H     L     + K++GWG 
Sbjct: 231 YLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGG--TAVKVVGWGK 288

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            NGTPYW + N+W   WG  G + ILRG  EC  E+L  AG P+
Sbjct: 289 LNGTPYWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAGTPE 332


>gi|159175|gb|AAA29176.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score =  154 bits (388), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 92/282 (32%), Positives = 137/282 (48%), Gaps = 16/282 (5%)

Query: 63  DQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
           +Q+  P+  D    D +  A +P+ +D R  W NC +   + D   C +    +   A S
Sbjct: 72  NQNLNPVVND----DNDTGADLPENYDPRIVWKNCSSFHTIRDQANCGSCWAVSTAAAIS 127

Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKI-CRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
           DR CI +KG++    S   + +CC   C       C  G     W F    G V+GG Y 
Sbjct: 128 DRICIATKGKKQVYASDTDILTCCGARC----GLGCRGGWPIEAWKFFEYDGVVSGGPYL 183

Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR---TTL 238
            +  C P  + PC  HG+     +C     P   C  +C  P + RG ++   R      
Sbjct: 184 GKGCCSPYPLHPCGRHGNDTFYGNCVGM-APTPPCKRKC-QPGF-RGMYRVDKRYGEPGR 240

Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
           TY +  +E  I+++I   G   A FA+Y+DF HY+SG+YKHT+      Y H+ K+IGWG
Sbjct: 241 TYTLPRSEVKIRRDIKERGSVVAVFAVYEDFSHYQSGIYKHTAGRFTGGY-HAVKMIGWG 299

Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
            +NGT YWL+ N+W   WG+ G  +++RG   C  E  + AG
Sbjct: 300 KDNGTDYWLIANSWHDDWGENGFFRMIRGINNCGIEEQVDAG 341


>gi|40643250|emb|CAC83720.1| cathepsin B [Hordeum vulgare subsp. vulgare]
 gi|326494236|dbj|BAJ90387.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326499864|dbj|BAJ90767.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 344

 Score =  154 bits (388), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 111/327 (33%), Positives = 146/327 (44%), Gaps = 45/327 (13%)

Query: 26  IDQINREANT-WTAGRN-FPANLSEEYLRQFLIADAKYFDQSDRPLP-----GDRKTYDP 78
           I  +N   N  WTAG N + AN + E  +  L           +P P     G R    P
Sbjct: 41  IQTVNNHPNAGWTAGHNPYLANYTIEQFKHML---------GVKPTPPGLLAGVRTKTHP 91

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
             S  +P  FDAR +W  C TIG + D G C +   F AV    DR CI      N  LS
Sbjct: 92  R-SEQLPKEFDARSKWSGCSTIGKILDQGHCGSCWAFGAVECLQDRFCIHHN--MNISLS 148

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSH 196
              + +CC     D    C  G     W +  + G VT     Y D+ GC+        H
Sbjct: 149 ANDLVACCGFMCGD---GCDGGYPISAWQYFVQNGVVTEECDPYFDQVGCK--------H 197

Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
            G  P  P+     V + KC  +       + + + KH +   Y V+ +   I  E+  +
Sbjct: 198 PGCEPAYPT----PVCEKKCKVQ------NQVWQEKKHFSIDAYQVNSDPHDIMAEVYKN 247

Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPH 315
           GP    F +Y+DF HYKSGVYKH +   +    H+ KLIGWGT + G  YWL+ N W   
Sbjct: 248 GPVEVAFTVYEDFAHYKSGVYKHITGGVMGG--HAVKLIGWGTSDAGEDYWLLANQWNRG 305

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
           WGD G  KI+RGK EC  E  + AG P
Sbjct: 306 WGDDGYFKIIRGKNECGIEEDVTAGMP 332


>gi|116784401|gb|ABK23329.1| unknown [Picea sitchensis]
          Length = 350

 Score =  153 bits (387), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 106/327 (32%), Positives = 146/327 (44%), Gaps = 36/327 (11%)

Query: 21  FSDAYIDQINREANT-WTAGRN--FPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
             +  +++INR  N  W AG N  F  +   ++ R   +         + P+    K  +
Sbjct: 36  LKEPIVEEINRHPNAGWKAGMNSRFSNHTVGQFKRLLGVLPTPRNFLENVPVITYPKGMN 95

Query: 78  PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
                 +P +FDARE WP C ++  + D G C +   F AV A SDR CI  K   N  L
Sbjct: 96  ------LPKQFDAREAWPQCTSVQTILDQGHCGSCWAFGAVEALSDRFCIHHK--VNVTL 147

Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCS 195
           S   + +CC     D    C  G     W +    G VT     Y D  GCQ        
Sbjct: 148 SENDLVACCGFMCGD---GCDGGYPISAWQYFISTGVVTAECDPYFDDAGCQ-------- 196

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           H G  P  P+         +C  +C +     G    K  +   Y +      I  E+  
Sbjct: 197 HPGCEPLYPT--------PQCVKQCKDENQKWG--NSKRFSATAYRISSKPYDIMAEVYT 246

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
           +GP   +F++Y+DF HYKSGVYK+T    +    H+ KL+GWGTE+GT YWLV N+W   
Sbjct: 247 NGPVEVSFSVYEDFAHYKSGVYKYTKGDYMGG--HAVKLVGWGTEDGTDYWLVANSWNTA 304

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
           WG+ G  KI RG  EC  E  + AG P
Sbjct: 305 WGEDGYFKIARGSNECGIEGDVVAGMP 331


>gi|224285427|gb|ACN40436.1| unknown [Picea sitchensis]
          Length = 350

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 106/327 (32%), Positives = 146/327 (44%), Gaps = 36/327 (11%)

Query: 21  FSDAYIDQINREANT-WTAGRN--FPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
             +  +++INR  N  W AG N  F  +   ++ R   +         + P+    K  +
Sbjct: 36  LKEPIVEEINRHPNAGWKAGMNSRFSNHTVGQFKRLLGVLPTPRNFLENVPVITYPKGIN 95

Query: 78  PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
                 +P +FDARE WP C ++  + D G C +   F AV A SDR CI  K   N  L
Sbjct: 96  ------LPKQFDAREAWPQCTSVQTILDQGHCGSCWAFGAVEALSDRFCIHHK--VNVTL 147

Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCS 195
           S   + +CC     D    C  G     W +    G VT     Y D  GCQ        
Sbjct: 148 SENDLVACCGFMCGD---GCDGGYPISAWQYFISTGVVTAECDPYFDDAGCQ-------- 196

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           H G  P  P+         +C  +C +     G    K  +   Y +      I  E+  
Sbjct: 197 HPGCEPLYPT--------PQCVKQCKDENQKWG--NSKRFSATAYRISSKPYDIMAEVYT 246

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
           +GP   +F++Y+DF HYKSGVYK+T    +    H+ KL+GWGTE+GT YWLV N+W   
Sbjct: 247 NGPVEVSFSVYEDFAHYKSGVYKYTKGDYMGG--HAVKLVGWGTEDGTDYWLVANSWNTA 304

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
           WG+ G  KI RG  EC  E  + AG P
Sbjct: 305 WGEDGYFKIARGSNECGIEGDVVAGMP 331


>gi|343477197|emb|CCD11909.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 108/345 (31%), Positives = 156/345 (45%), Gaps = 36/345 (10%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAK--- 60
           LV L    L+  +    +  ++D+IN+     W A       + +  ++    ++AK   
Sbjct: 15  LVALGASALLAKDAPVLTKTFVDRINQLNGGMWKA-------VYDGKMQNLTFSEAKRLT 67

Query: 61  -YFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
             F +    LP  R T + +    +P+ FDA E WP+C TI  + D  AC A    A   
Sbjct: 68  GAFSRKTSTLPPARFT-EEQLRTDLPESFDAAEHWPHCPTIREIADQSACRASWAVATAS 126

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           A SDR C   KG+Q R +S   + +CCK C       C  G     W +    G  +   
Sbjct: 127 AISDRYCTVGKGKQLR-ISAADLMACCKDC----GGGCEGGYPDAAWEYYVSHGIAS--- 178

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
               + CQP     C H G+      C   K    +C+  CT+ T        K+R   +
Sbjct: 179 ----SQCQPYPFPRCEHRGAQGKKTPCSKYKFVTPQCNATCTDKT----IPLIKYRGNHS 230

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGW 297
           Y V   ED  K+E+  +GP    F ++ DF  YK+GVY+H +     N+L   + +++GW
Sbjct: 231 YEVRGEED-YKRELYFNGPFVVRFQVHSDFLAYKNGVYQHVAG----NFLGGKAVRIVGW 285

Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           G  NGTPYW V N+W   WG  G   ILRG  EC  E+L  AG P
Sbjct: 286 GKLNGTPYWKVANSWDTDWGMNGYFLILRGDNECNIEHLGFAGTP 330


>gi|268563232|ref|XP_002638788.1| Hypothetical protein CBG05143 [Caenorhabditis briggsae]
          Length = 426

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 105/356 (29%), Positives = 159/356 (44%), Gaps = 67/356 (18%)

Query: 22  SDAYIDQINREAN-----TWTAGRN----------FPANLSEEYLRQFLIADAKYFDQSD 66
           SD Y+ ++ R+ N     TW A  N          F    ++  + +++    K+F+   
Sbjct: 68  SDEYLRKLVRQVNDSPETTWKAKFNKFGVKNRSYGFKYTRNQTAVEEYMEHIRKFFESD- 126

Query: 67  RPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRC 126
             +    +  +   S+++P  FDAR++WPNC +I +VP+ G C +    AA G  SDR C
Sbjct: 127 -AMKRHLEELENYKSSSLPKHFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRAC 185

Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGC 186
           I S G     LS E +  CC +C      +C  G   +   +   +G VTGG    R GC
Sbjct: 186 IHSNGTFKSLLSEEDIIGCCSVC-----GNCYGGDPLKALTYWVNQGLVTGG----RDGC 236

Query: 187 QPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW----- 241
           +P +          P  P+   +   K  C  RC N  Y + + +DKH  T  Y      
Sbjct: 237 RPYSFDLSC---GVPCSPATFFEAEEKRTCMRRCQNIYYQQKYEEDKHFATFAYSLYPRS 293

Query: 242 -----------------------------VDDNEDAIKKEILAHGPTTATFALYDDFYHY 272
                                        V +  + IKKEIL +GPTT  F + ++F HY
Sbjct: 294 MTVSPDGKERVKVPTIIGHFNDKNTEKLNVTEYRNVIKKEILLYGPTTMAFPVPEEFLHY 353

Query: 273 KSGVYKHTSNAKLEN---YLHSGKLIGWG-TENGTPYWLVINTWGPHWGDRGTVKI 324
            SGV++       ++   Y H  +LIGWG +++G  YWL +N++G HWGD G  KI
Sbjct: 354 SSGVFRPFPLDGFDDRIVYWHVVRLIGWGESDDGQHYWLAVNSFGNHWGDNGIFKI 409


>gi|193783549|dbj|BAG53460.1| unnamed protein product [Homo sapiens]
          Length = 276

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 97/274 (35%), Positives = 130/274 (47%), Gaps = 24/274 (8%)

Query: 81  SATVPDRFDAREQW----PNCGTIGHVP----DTGACAAPHIFAAVGAFSDRRCIKS--- 129
           S  +PD       W     +C  +G VP      G   AP   A  G+ S    + S   
Sbjct: 8   SCLLPDLCGQGWGWRLFPASCAYLGSVPWRVWGLGGLLAPLAAAGGGSTSGLGHLGSTQW 67

Query: 130 KGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPS 189
            G+    LS  ++  C          SC+ G     WNF  ++G V+GG Y    GC+P 
Sbjct: 68  SGELVVLLSEVFITGCLF--------SCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPY 119

Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
           +I PC HH +    P       PK  C   C  P Y   + QDKH    +Y V ++E  I
Sbjct: 120 SIPPCEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDI 176

Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVI 309
             EI  +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTPYWLV 
Sbjct: 177 MAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVA 234

Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           N+W   WGD G  KILRG+  C  E  + AG P+
Sbjct: 235 NSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 268


>gi|3912916|gb|AAC78691.1| thiol protease [Trichuris suis]
          Length = 348

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 104/345 (30%), Positives = 153/345 (44%), Gaps = 40/345 (11%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY---- 76
            + A  DQ+ R         N  +NL+ + L  +L      F   D   P   K      
Sbjct: 13  LTSAAEDQVARP-------NNVESNLTGDPLVVYLNTIQGLFHLKDSQSPDTEKKLMSAK 65

Query: 77  -----------DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
                      D   + ++P  FD R  W  C ++  + D   C +    +A    SDR 
Sbjct: 66  YKHTVDICGREDRSLALSIPPSFDVRSLWHVC-SLNLIRDQAKCGSCWAVSAAETMSDRI 124

Query: 126 CIKSKGQQNRPLSTEYVASCCKI-CRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
           C++S       +S   + SCC + C Y     C+ G     W      G+ TGG   D+ 
Sbjct: 125 CVQSNCSIKACISDTDILSCCGLYCGY----GCNGGFPIEAWRHFTVAGNCTGGKTIDKY 180

Query: 185 GCQP-STISPCSHHGSAPTLPSCENQ--------KVPKLKCHTRCTNPTYGRGFFQDKHR 235
           GC+P     P   H        C N              +C  RC    Y + +  D++ 
Sbjct: 181 GCKPYKPTGPIGRHLKRNDYAPCPNDTYYGECVGMADTPRCKRRCL-LGYPKSYPSDRYY 239

Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
               Y V  +  AI++EI+ +GP  A+FA+Y+DF HYKSG+YKHT+  +L  Y H+ K+I
Sbjct: 240 GKSAYIVKQSVKAIQREIMKNGPVVASFAVYEDFRHYKSGIYKHTA-GELRGY-HAVKII 297

Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           GWG EN T +WL+ N+W   WG++G  +I+RGK EC  E  + AG
Sbjct: 298 GWGKENNTDFWLIANSWHQDWGEKGYFRIVRGKNECGIETDVVAG 342


>gi|107921773|gb|ABF85678.1| cathepsin B1 [Fasciola hepatica]
          Length = 278

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 86/223 (38%), Positives = 112/223 (50%), Gaps = 7/223 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FDARE+WP C +I  +PD  +C +    A VGA SDR CI S G     LS   + 
Sbjct: 63  LPESFDAREKWPLCRSIRQIPDQSSCGSCWAVAGVGAMSDRVCIHSNGMMQPELSAIDLV 122

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SCC  C       C  GS    W++  + G VTGG   + TGC P     C H GS   L
Sbjct: 123 SCCSYC----GNGCQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPYPFPQCRHPGSRSQL 178

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
             C     P   C+  C    Y + + +DK     +Y VD +E  I +EI+ +GP  A F
Sbjct: 179 NPCPGYIYPTPSCYPYC-QAGYDKTYEEDKVYGKTSYNVDRHEYTIMQEIMKNGPVEAGF 237

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYW 306
            +Y DF  YKSG+Y H S        H+ ++IGWG ENG  YW
Sbjct: 238 IVYTDFAVYKSGIYHHVSGRYAGK--HAIRIIGWGVENGVNYW 278


>gi|1644295|emb|CAB03627.1| cysteine proteinase [Haemonchus contortus]
          Length = 345

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 90/292 (30%), Positives = 139/292 (47%), Gaps = 21/292 (7%)

Query: 53  QFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAP 112
           Q  + D ++ +Q+ +P+  +    D +    +P+ FDAR  W NC ++ H+ D   C + 
Sbjct: 67  QHKLMDLRFVNQNRKPVVENADDEDDD----IPESFDARTHWANCTSLRHIRDQANCGSC 122

Query: 113 HIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKR 172
              +   A SDR CI SKG+    +S+  + SCCK+C Y     C  G     +++  ++
Sbjct: 123 WAVSTASALSDRICIASKGETQLHISSIDIVSCCKLCGY----GCDGGWPIEAFDYFSRQ 178

Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPT----LPSCENQKVPKLKCHTRCTNPTYGRG 228
           G+VTG +   + GC+P    P   +G+          C++ K           N T   G
Sbjct: 179 GAVTG-ETTSKDGCRPYPFHPLWTYGNDTVGRRMSGRCKHSKTVGEGVKRVTRNHTRRTG 237

Query: 229 FFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENY 288
               + R T         D        +GP  A F +Y+DF +YK G+Y H   A     
Sbjct: 238 LTARRLRITEFCQSHSEGDH------GNGPVVAVFTVYEDFSYYKKGIYVHI--AGKARG 289

Query: 289 LHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
            H+ K+IGWG ENG PYWL+ N+W   WG++G  +I+RG  EC  E  + AG
Sbjct: 290 AHAIKIIGWGVENGLPYWLIANSWHDDWGEQGLFRIVRGINECGIEQEVVAG 341


>gi|326490902|dbj|BAJ90118.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326508404|dbj|BAJ99469.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514912|dbj|BAJ99817.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 345

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 113/344 (32%), Positives = 148/344 (43%), Gaps = 41/344 (11%)

Query: 8   LLGCTLVRGELYKFSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSD 66
           L+G       L    +  I  +N   N  WTAG N        YL  + I   K+     
Sbjct: 22  LVGAARGDHSLPIIQEDIIRTVNSHPNAGWTAGHN-------PYLANYTIEQFKHILGVK 74

Query: 67  RPLPG-----DRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
              PG       KTY     A +P  FDAR +W  C TIG + D G C A   F AV   
Sbjct: 75  PTPPGLLAGVPTKTYSRSEKAELPKEFDARSKWSGCSTIGKILDQGHCGACWAFGAVECL 134

Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GD 179
            DR CI      N  LS   + +CC     D    C  G     W +  + G VT     
Sbjct: 135 QDRFCIHHS--VNVSLSVNDLVACCGFLCGD---GCDGGYPIFAWQYFVENGVVTDECDP 189

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
           + D+ GCQ        H G  P  P+     V + KC  +       + + + KH +   
Sbjct: 190 FFDQVGCQ--------HPGCEPAYPT----PVCEKKCKVQ------NQVWEEKKHFSIDA 231

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y V+ +   I  E+  +GP   +F +Y+DF HYKSGVYK  +   +    H+ KLIGWGT
Sbjct: 232 YQVNSDPHDIMAEVYKNGPVEVSFIIYEDFAHYKSGVYKQITGRMVGG--HAAKLIGWGT 289

Query: 300 EN-GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            + G  YWL+ N W   WGD G  KI+RG  EC  E  + AG P
Sbjct: 290 SDAGEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEGDVNAGMP 333


>gi|260782761|ref|XP_002586451.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
 gi|229271561|gb|EEN42462.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
          Length = 272

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 92/261 (35%), Positives = 128/261 (49%), Gaps = 36/261 (13%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDAR +W  C     + D G C +   FA+    SDR CI+++G  N  LS+E + 
Sbjct: 43  IPKSFDARMEWSTCVRSHKIHDQGHCGSCWAFASTEVLSDRLCIQTRGSTNIILSSEDLL 102

Query: 144 SCCKICRYDDNKSCSHGS-VFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
           SC K  R      CS G  +   W ++ K+G V          C+P T       G+   
Sbjct: 103 SCDKAGR-----GCSDGGRLSEAWRYMQKKGVVA-------NRCKPYT------SGATGF 144

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
           +P          +C ++CT   +    F   +  T++      E+ IK EI+ +GP  A 
Sbjct: 145 IP----------ECMSKCTGEGHAYQKFYGLYLYTVS-----GENQIKVEIMTNGPVEAA 189

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F +Y D  HYKSGVY HTS  KL    H+ K++GWG E+   YWLV N+WGP WGD+G  
Sbjct: 190 FTVYSDIVHYKSGVYHHTSGGKLGG--HAVKVLGWGVEDEEEYWLVANSWGPDWGDQGFF 247

Query: 323 KILRGKYECAFEYLIAAGKPK 343
           KI RG  EC  E  +  G  +
Sbjct: 248 KIKRGSDECGIESRVLTGTAR 268


>gi|87240981|gb|ABD32839.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
           propeptide [Medicago truncatula]
          Length = 356

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 99/266 (37%), Positives = 128/266 (48%), Gaps = 30/266 (11%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           S  +P  FDAR  W  C TIG + D G C +   F AV + SDR CI      N  LS  
Sbjct: 97  SLKLPKDFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHF--DMNVSLSVN 154

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHG 198
            + +CC +        C+ G+ F  W +L   G VT     Y D+ GC        SH G
Sbjct: 155 DILACCGLLC---GAGCAGGTPFSAWIYLAHHGVVTEECDPYFDQIGC--------SHPG 203

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQ-DKHRTTLTYWVDDNEDAIKKEILAHG 257
             PT       + PK  C  +C N   G   ++  KH +   Y V+ +   I  E+  +G
Sbjct: 204 CEPTY------RTPK--CVKKCVN---GNQLWETSKHYSVKAYTVNSDPQDIMAEVYKNG 252

Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPHW 316
           P    F +Y+DF HYKSGVYKH +   L    H+ KL+GWGT + G  YWL+ N W  +W
Sbjct: 253 PVEVAFTVYEDFAHYKSGVYKHITGFALGG--HAVKLVGWGTSHEGEDYWLLANQWNTNW 310

Query: 317 GDRGTVKILRGKYECAFEYLIAAGKP 342
           GD G  KI RG  EC  E  + AG P
Sbjct: 311 GDDGYFKIKRGTNECGIENAVTAGLP 336


>gi|357511627|ref|XP_003626102.1| Cathepsin L-like proteinase [Medicago truncatula]
 gi|355501117|gb|AES82320.1| Cathepsin L-like proteinase [Medicago truncatula]
          Length = 351

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 99/266 (37%), Positives = 128/266 (48%), Gaps = 30/266 (11%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           S  +P  FDAR  W  C TIG + D G C +   F AV + SDR CI      N  LS  
Sbjct: 92  SLKLPKDFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHF--DMNVSLSVN 149

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHG 198
            + +CC +        C+ G+ F  W +L   G VT     Y D+ GC        SH G
Sbjct: 150 DILACCGLLC---GAGCAGGTPFSAWIYLAHHGVVTEECDPYFDQIGC--------SHPG 198

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQ-DKHRTTLTYWVDDNEDAIKKEILAHG 257
             PT       + PK  C  +C N   G   ++  KH +   Y V+ +   I  E+  +G
Sbjct: 199 CEPTY------RTPK--CVKKCVN---GNQLWETSKHYSVKAYTVNSDPQDIMAEVYKNG 247

Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPHW 316
           P    F +Y+DF HYKSGVYKH +   L    H+ KL+GWGT + G  YWL+ N W  +W
Sbjct: 248 PVEVAFTVYEDFAHYKSGVYKHITGFALGG--HAVKLVGWGTSHEGEDYWLLANQWNTNW 305

Query: 317 GDRGTVKILRGKYECAFEYLIAAGKP 342
           GD G  KI RG  EC  E  + AG P
Sbjct: 306 GDDGYFKIKRGTNECGIENAVTAGLP 331


>gi|294945206|ref|XP_002784584.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239897729|gb|EER16380.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 298

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 96/260 (36%), Positives = 130/260 (50%), Gaps = 20/260 (7%)

Query: 83  TVPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  FDAR+++  C   IGHV D G C            +DR CIKS G+    LS  Y
Sbjct: 32  NLPPEFDARQKFNYCRDVIGHVRDQGRCGNCWAVCPTEVLNDRLCIKSSGKIQEILSAGY 91

Query: 142 VASCCKI---CRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG------DRTGCQPSTIS 192
           V SCC     C +   K C+ G +    +FL   G VTG D+       +  GC P    
Sbjct: 92  VTSCCNPAHGCLH--AKGCNGGRLVEAMSFLRDHGVVTGNDFKPQDQLREADGCWPYPFQ 149

Query: 193 PCSHHGSAPT-LPSCEN---QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA 248
            C+H  +  T  P C++   Q VP   C T CTN  Y +   +D HR      V ++  +
Sbjct: 150 KCNHVPTEGTGYPKCKDVVQQPVPP--CRTTCTNKAYKKSLEKDVHRAKSWRKVLNDAQS 207

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLV 308
           IK+EI  +GP  + F +Y DF +YKSGVY  T+  K  + LH  K+IGWG ++   YWL 
Sbjct: 208 IKQEIFDNGPVFSAFEMYKDFRYYKSGVYVPTT--KEVDCLHVIKIIGWGADSVREYWLA 265

Query: 309 INTWGPHWGDRGTVKILRGK 328
           +N W   WGD G +K+  GK
Sbjct: 266 MNAWNEEWGDHGLIKMAFGK 285


>gi|168020784|ref|XP_001762922.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685734|gb|EDQ72127.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 345

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 102/326 (31%), Positives = 152/326 (46%), Gaps = 34/326 (10%)

Query: 24  AYIDQINRE-ANTWTAGRNFP-ANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYS 81
           + +D+IN     TW AG N   A  + E+L++   A     ++ +   P   +      +
Sbjct: 42  SLVDKINAHPGATWKAGLNDRFAKHTVEHLKKMCGAKMTPANEVE---PSIERVTHKHKN 98

Query: 82  ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
             +P  FDAR+ W +C TIG + D G C +   F AV + +DR CI     ++  LS   
Sbjct: 99  LDLPTEFDARKHWSHCSTIGDILDQGHCGSCWAFGAVESLTDRFCIHL--NESVSLSEND 156

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGS 199
           + +CC    ++    C  G   R W +  + G VT     Y D+ GC         H G 
Sbjct: 157 LLACCG---FECGDGCEGGYPIRAWQYFKRTGVVTSKCDPYFDQKGC--------GHPGC 205

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
            PT  +         KC  RC +      +   KH     Y V    + +  E+  +GP 
Sbjct: 206 YPTYDT--------PKCFKRCVDDEL---WVSSKHLGVSAYEVSMEPEELMAELFTNGPI 254

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGPHWGD 318
              F +++DF HYK+GVYKH     +    H+ KL+GWGT ++G  YW ++N+W  +WG+
Sbjct: 255 EVAFDVFEDFAHYKTGVYKHLYGGYIGG--HAVKLVGWGTTDDGVDYWSMVNSWNTNWGE 312

Query: 319 RGTVKILRGKYECAFEYLIAAGKPKN 344
            GT +ILRGK EC  E    AG P N
Sbjct: 313 DGTFRILRGKDECGIESNAVAGLPSN 338


>gi|21699|emb|CAA46811.1| cathepsin B [Triticum aestivum]
          Length = 353

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 117/348 (33%), Positives = 152/348 (43%), Gaps = 50/348 (14%)

Query: 8   LLGCTLVRGELYKFSDAYIDQINREANT-WTAGRN-FPANLSEEYLRQFLIADAKYFDQS 65
           L G       L       I  +N+  N  WTAG N + AN + E  +  L          
Sbjct: 25  LAGTAKAEHSLGIIQKDIIQTVNKHPNAGWTAGHNPYFANYTIEQFKHIL---------G 75

Query: 66  DRPLP-----GDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
            +P P     G      PE    +P  FDAR QW +C TIG++ D G C A   FAAV A
Sbjct: 76  VKPTPPGLLAGVPIKIHPEMD--LPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVEA 133

Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG-- 177
             DR CI      +  LS   + +CC  +C       C+ G     W +  + G VT   
Sbjct: 134 LQDRFCIHL--NMSVSLSVNDLLACCGFLC----GSGCNGGYPISAWRYFRRSGVVTEEC 187

Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
             Y D+TGCQ        H G  P  P+         KC  +C      + + ++KH + 
Sbjct: 188 DPYFDQTGCQ--------HPGCEPAYPT--------PKCQRKCK--VENQAWKENKHFSV 229

Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYD--DFYHYKSGVYKHTSNAKLENYLHSGKLI 295
             Y V  N   I  E+  +GP    F      DF HYKSGVYKH +   +    H+ KLI
Sbjct: 230 NAYRVHSNPHDIMAEVYKNGPVEVAFTYCQILDFAHYKSGVYKHITGGVMGG--HAVKLI 287

Query: 296 GWGTEN-GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           GWGT + G  YWL+ N W   WGD G  KI+RG+ EC  E  + AG P
Sbjct: 288 GWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGENECGIEGDVTAGMP 335


>gi|194384502|dbj|BAG59411.1| unnamed protein product [Homo sapiens]
          Length = 273

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 106/338 (31%), Positives = 144/338 (42%), Gaps = 83/338 (24%)

Query: 6   VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
           + +L     R   +  SD  ++ +N+   TW AG NF  N+   YL++       +    
Sbjct: 11  LLVLANARSRPSFHPVSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLC---GTFLGG- 65

Query: 66  DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
             P P  R  +  +    +P  FDAREQWP C TI  + D G+C +   F AV A SDR 
Sbjct: 66  --PKPPQRVMFTEDLK--LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDR- 120

Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG 185
                                 IC                   +H  GS           
Sbjct: 121 ----------------------IC-------------------IHVNGS----------- 128

Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN 245
            +P    PC+  G  P             KC   C  P Y   + QDKH    +Y V ++
Sbjct: 129 -RP----PCTGEGDTP-------------KCSKIC-EPGYSPTYKQDKHYGYNSYSVSNS 169

Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPY 305
           E  I  EI  +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTPY
Sbjct: 170 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPY 227

Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           WLV N+W   WGD G  KILRG+  C  E  + AG P+
Sbjct: 228 WLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 265


>gi|18378947|ref|NP_563648.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|16226808|gb|AAL16267.1|AF428337_1 At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|14532526|gb|AAK63991.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|25090140|gb|AAN72238.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|332189292|gb|AEE27413.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
          Length = 362

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 97/270 (35%), Positives = 129/270 (47%), Gaps = 30/270 (11%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           + S  +P  FDAR  W  C +IG + D G C +   F AV + SDR CIK     N  LS
Sbjct: 101 DISLKLPKEFDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKY--NMNVSLS 158

Query: 139 TEYVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCS 195
              + +CC  +C     + C+ G     W +    G VT     Y D TGC        S
Sbjct: 159 VNDLLACCGFLC----GQGCNGGYPIAAWRYFKHHGVVTEECDPYFDNTGC--------S 206

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           H G  P  P+         KC  +C +    + + + KH     Y V  + D I  E+  
Sbjct: 207 HPGCEPAYPT--------PKCARKCVSGN--QLWRESKHYGVSAYKVRSHPDDIMAEVYK 256

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGP 314
           +GP    F +Y+DF HYKSGVYKH +   +    H+ KLIGWGT ++G  YWL+ N W  
Sbjct: 257 NGPVEVAFTVYEDFAHYKSGVYKHITGTNIGG--HAVKLIGWGTSDDGEDYWLLANQWNR 314

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WGD G  KI RG  EC  E+ + AG P +
Sbjct: 315 SWGDDGYFKIRRGTNECGIEHGVVAGLPSD 344


>gi|999909|pdb|1HUC|B Chain B, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
           Human Liver Cathepsin B: The Structural Basis For Its
           Specificity
 gi|999911|pdb|1HUC|D Chain D, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
           Human Liver Cathepsin B: The Structural Basis For Its
           Specificity
 gi|1421164|pdb|1CSB|B Chain B, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
           2.1 Angstroms Resolution: A Basis For The Design Of
           Specific Epoxysuccinyl Inhibitors
 gi|1421167|pdb|1CSB|E Chain E, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
           2.1 Angstroms Resolution: A Basis For The Design Of
           Specific Epoxysuccinyl Inhibitors
 gi|122920711|pdb|2IPP|B Chain B, Crystal Structure Of The Tetragonal Form Of Human Liver
           Cathepsin B
          Length = 205

 Score =  151 bits (381), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 82/208 (39%), Positives = 111/208 (53%), Gaps = 10/208 (4%)

Query: 137 LSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
           +S E + +CC  +C       C+ G     WNF  ++G V+GG Y    GC+P +I PC 
Sbjct: 5   VSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCE 60

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           HH +    P       PK  C   C  P Y   + QDKH    +Y V ++E  I  EI  
Sbjct: 61  HHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYK 117

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
           +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTPYWLV N+W   
Sbjct: 118 NGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWNTD 175

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPK 343
           WGD G  KILRG+  C  E  + AG P+
Sbjct: 176 WGDNGFFKILRGQDHCGIESEVVAGIPR 203


>gi|116779190|gb|ABK21175.1| unknown [Picea sitchensis]
 gi|148907952|gb|ABR17096.1| unknown [Picea sitchensis]
 gi|224284884|gb|ACN40172.1| unknown [Picea sitchensis]
          Length = 350

 Score =  150 bits (380), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 108/327 (33%), Positives = 145/327 (44%), Gaps = 36/327 (11%)

Query: 21  FSDAYIDQINREANT-WTAGRN--FPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
             +  +++INR     W AG N  F  +   ++ R   +         + P+    +TY 
Sbjct: 36  LKEPIVEEINRHPKAGWKAGMNSRFSNHTVGQFKRLLGVLPTPRNLLENVPV----RTYP 91

Query: 78  PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
                 +P +FDAR+ WP C ++  + D G C +   F AV A SDR CI  K   N  L
Sbjct: 92  K--GLNLPKQFDARKAWPQCTSVRTILDQGHCGSCWAFGAVEALSDRFCIHYK--VNVTL 147

Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCS 195
           S   + +CC   R  D   C  G     W +    G VT     Y D  GCQ        
Sbjct: 148 SENDLVACCGF-RCGDG--CDGGYPLSAWQYFISTGVVTAECDPYFDEAGCQ-------- 196

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           H G  P  P+         +C  +C +     G    K  +   Y +      I  E+  
Sbjct: 197 HPGCEPLYPT--------PQCVKQCKDENQNWG--NSKRFSATAYRITSKPYDIMAEVYT 246

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
            GP    F +Y+DF HYKSGVYK+ +   L    H+ KLIGWGTENGT YWLV N+W   
Sbjct: 247 KGPVEVDFLVYEDFAHYKSGVYKYITGDFLGG--HAVKLIGWGTENGTDYWLVANSWNTA 304

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
           WG+ G  KI RG  EC+ E  + AG P
Sbjct: 305 WGEDGYFKIARGSNECSIEEDVVAGMP 331


>gi|312283137|dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila]
          Length = 362

 Score =  150 bits (380), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 99/267 (37%), Positives = 127/267 (47%), Gaps = 28/267 (10%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           S  +P  FDAR  WP C +IG++ D G C +   F AV + SDR CI+     N  LS  
Sbjct: 103 SLKLPKEFDARTAWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIEFG--MNISLSVN 160

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHG 198
            + +CC   R  D   C  G     W +    G VT     Y D TGC        SH G
Sbjct: 161 DLLACCGF-RCGDG--CDGGYPIAAWQYFSYSGVVTEECDPYFDDTGC--------SHPG 209

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
             P  P+         KC  +C +    + + Q KH +  TY V  N   I  E+  +GP
Sbjct: 210 CEPAYPT--------PKCMRKCVSGN--QLWSQSKHYSVSTYTVKSNPQDIMAEVYKNGP 259

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENGTPYWLVINTWGPHWG 317
              +F +Y+DF HYKSGVYKH + + +    H+ KLIGWG T+ G  YWL+ N W   WG
Sbjct: 260 VEVSFTVYEDFAHYKSGVYKHITGSNIGG--HAVKLIGWGTTDEGEDYWLLANQWNRSWG 317

Query: 318 DRGTVKILRGKYECAFEYLIAAGKPKN 344
           D G   I RG  EC  E    AG P +
Sbjct: 318 DDGYFMIRRGTNECGIEDEPVAGLPSS 344


>gi|181178|gb|AAA52125.1| lysosomal proteinase cathepsin B, partial [Homo sapiens]
          Length = 209

 Score =  150 bits (380), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 82/208 (39%), Positives = 111/208 (53%), Gaps = 10/208 (4%)

Query: 137 LSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
           +S E + +CC  +C       C+ G     WNF  ++G V+GG Y    GC+P +I PC 
Sbjct: 3   VSAEDLLTCCGSMC----GDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCE 58

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           HH +    P       PK  C   C  P Y   + QDKH    +Y V ++E  I  EI  
Sbjct: 59  HHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYK 115

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
           +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTPYWLV N+W   
Sbjct: 116 NGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWNTD 173

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPK 343
           WGD G  KILRG+  C  E  + AG P+
Sbjct: 174 WGDNGFFKILRGQDHCGIESEVVAGIPR 201


>gi|297843028|ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335237|gb|EFH65654.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 360

 Score =  150 bits (380), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 96/270 (35%), Positives = 129/270 (47%), Gaps = 30/270 (11%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           + S  +P  FDAR  W  C ++G + D G C +   F AV + SDR CIK     N  LS
Sbjct: 99  DISLKLPKEFDARTAWSQCTSVGRILDQGHCGSCWAFGAVESLSDRFCIKY--NMNISLS 156

Query: 139 TEYVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCS 195
              + +CC  +C     + C+ G     W +    G VT     Y D TGC        S
Sbjct: 157 VNDLLACCGFLC----GQGCNGGYPIAAWRYFKHHGVVTEECDPYFDNTGC--------S 204

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           H G  P  P+         KC  +C +    + + + KH     Y V  + D I  E+  
Sbjct: 205 HPGCEPAYPT--------PKCARKCVSGN--QLWRESKHYGVSAYKVRSHPDDIMAEVYK 254

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGP 314
           +GP    F +Y+DF HYKSGVYKH +   +    H+ KLIGWGT ++G  YWL+ N W  
Sbjct: 255 NGPVEVAFTVYEDFAHYKSGVYKHITGTNIGG--HAVKLIGWGTSDDGEDYWLLANQWNR 312

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WGD G  KI RG  EC  E+ + AG P +
Sbjct: 313 SWGDDGYFKIRRGTNECGIEHGVVAGLPSD 342


>gi|3088522|gb|AAD03404.1| cathepsin B-like protease precursor [Trypanosoma cruzi]
 gi|407859283|gb|EKG06969.1| cysteine peptidase C (CPC) [Trypanosoma cruzi]
          Length = 333

 Score =  150 bits (380), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 102/325 (31%), Positives = 155/325 (47%), Gaps = 27/325 (8%)

Query: 21  FSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
            +D +++ +NR     WTAGR    + ++   R+        F ++   LP  R+  + E
Sbjct: 32  LTDEFLELVNRLNGGKWTAGRT---SRTKYLTRRGASRLLGTFLRNTSILP-PRQFSEEE 87

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
               + DRFDA E WP C TI  + D  +C +    AA  A SDR C    G ++  +S 
Sbjct: 88  LRVPLQDRFDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAMSDRYCTLG-GVRDLRISA 146

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
             + SCC +C Y     C+ G     W +    G V+  +Y     CQP     C+HH +
Sbjct: 147 GDLMSCCDVCGY----GCNGGYPEVAWEYYAVHGIVS--EY-----CQPYPFPSCAHHVN 195

Query: 200 APTLPSCENQ-KVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           +  L  C  +   P   C++ CT+    +     K+R   +Y +   E++ K+E+L +GP
Sbjct: 196 SSDLSPCSGEYDTPT--CNSTCTD----KKIPLIKYRGNTSY-ILSGEESFKRELLLNGP 248

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
              +F++Y DF  Y  GVYKH +   L    H+ +++GWG  NG PYW + N+W   WG 
Sbjct: 249 FEVSFSVYADFVAYTGGVYKHVTGVFLGG--HAVRIVGWGELNGEPYWKIANSWNHEWGM 306

Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
            G   I RG  EC  E    AG P+
Sbjct: 307 NGYFLIARGVDECGIEGSGVAGIPR 331


>gi|71656032|ref|XP_816569.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
 gi|70881707|gb|EAN94718.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
          Length = 333

 Score =  150 bits (380), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 102/325 (31%), Positives = 155/325 (47%), Gaps = 27/325 (8%)

Query: 21  FSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
            +D +++ +NR     WTAGR    + ++   R+        F ++   LP  R+  + E
Sbjct: 32  LTDEFLELVNRLNGGKWTAGRT---SRTKHLTRRGASRLLGTFLRNTSILP-PRQFSEEE 87

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
               + DRFDA E WP C TI  + D  +C +    AA  A SDR C    G ++  +S 
Sbjct: 88  LREPLQDRFDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAISDRYCTLG-GVRDLRISA 146

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
             + SCC +C Y     C+ G     W +    G V+  +Y     CQP     C+HH +
Sbjct: 147 GDLMSCCDVCGY----GCNGGYPEVAWEYYAVHGIVS--EY-----CQPYPFPSCAHHVN 195

Query: 200 APTLPSCENQ-KVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           +  L  C  +   P   C++ CT+    +     K+R   +Y +   E++ K+E+L +GP
Sbjct: 196 SSDLSPCSGEYDTPT--CNSTCTD----KKVPLIKYRGNTSYLLS-GEESFKRELLLNGP 248

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
              +F++Y DF  Y  GVYKH +   L    H+ +++GWG  NG PYW + N+W   WG 
Sbjct: 249 FEVSFSVYADFLAYTGGVYKHVAGTFLGG--HAVRIVGWGELNGEPYWKIANSWNREWGM 306

Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
            G   I RG  EC  E    AG P+
Sbjct: 307 NGYFLIARGVDECGIEGSGVAGTPR 331


>gi|339239305|ref|XP_003381207.1| cathepsin B [Trichinella spiralis]
 gi|316975778|gb|EFV59177.1| cathepsin B [Trichinella spiralis]
          Length = 343

 Score =  150 bits (380), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 95/310 (30%), Positives = 145/310 (46%), Gaps = 56/310 (18%)

Query: 72  DRKTYDPEYSATVP--DRFDAREQWPNCGTIGHVPDTGACAAPHI--------------- 114
           +++ Y    S ++P  + FDARE+WP C  IG + D   C+   +               
Sbjct: 46  NQQNYTDAKSESLPLEEHFDAREKWPECKYIGFIKDQSTCSCCWVSGDFLYHYDQWKIIL 105

Query: 115 -------------------FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNK 155
                               ++    +DR CI  KG+Q   LS E + SCC  C Y    
Sbjct: 106 LFDFSSSSSHWLFISTFKAMSSASVMTDRTCIAYKGEQQPFLSDEELTSCCTSCGY---- 161

Query: 156 SCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLK 215
            C+ G     + + ++ G  TGG YG ++GC+P +I+P +   +A   P C+      LK
Sbjct: 162 GCNGGFPLLAFKYWNEIGVPTGGPYGSKSGCKPFSIAPPTSSSTAAQTPLCQ------LK 215

Query: 216 CHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIK---KEILAHGPTTATFALYDDFYHY 272
           C +      Y R   +D++     Y +  +   +K   +EI+ HGP  A   +++ F +Y
Sbjct: 216 CIS-----DYKRKLDKDRYYGESYYLITSSNQPVKTIQREIMDHGPVVAAMEIFESFLYY 270

Query: 273 KSGVYK-HTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYEC 331
           KSGVY  +  N      LH+ KLIGWG +   PYWLV+N+W   +G++G  KI RG  EC
Sbjct: 271 KSGVYSANKRNDDPSLGLHAVKLIGWGEQKRIPYWLVVNSWNTTFGEQGLFKIRRGTNEC 330

Query: 332 AFEYL-IAAG 340
             E L + AG
Sbjct: 331 GIENLHVTAG 340


>gi|308485822|ref|XP_003105109.1| hypothetical protein CRE_20700 [Caenorhabditis remanei]
 gi|308257054|gb|EFP01007.1| hypothetical protein CRE_20700 [Caenorhabditis remanei]
          Length = 410

 Score =  150 bits (379), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 105/356 (29%), Positives = 156/356 (43%), Gaps = 67/356 (18%)

Query: 22  SDAYIDQINREAN-----TWTAGRN----------FPANLSEEYLRQFLIADAKYFDQSD 66
           +D Y+ ++ R+ N     TW A  N          F    ++  + +++    K+F+   
Sbjct: 52  NDEYLRKLVRQVNDSPETTWKAKFNKFGVKNRSYGFKYTRNQTAVEEYMEHIRKFFESD- 110

Query: 67  RPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRC 126
             +    +  +   S+ +P  FDAR++WPNC +I +VP+ G C +    AA G  SDR C
Sbjct: 111 -AMKRHLEELENYKSSDLPKHFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRAC 169

Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGC 186
           I S G     LS E +  CC +C      +C  G   +   +   +G VTGG    R GC
Sbjct: 170 IHSNGTFKALLSEEDIIGCCSVC-----GNCYGGDPLKALTYWVNQGLVTGG----RDGC 220

Query: 187 QPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW----- 241
           +P +          P  P+   +   K  C  RC N  Y + + +DKH  T  Y      
Sbjct: 221 RPYSFDLSC---GVPCSPATFFEAEEKRTCMRRCQNIYYQQKYEEDKHFATFAYSMYPRS 277

Query: 242 -----------------------------VDDNEDAIKKEILAHGPTTATFALYDDFYHY 272
                                        V +  + IKKEIL +GPTT  F + ++F HY
Sbjct: 278 MTVSPDGKERVKVPTIIGHFNDKNTEKLNVTEYRNVIKKEILLYGPTTMAFPVPEEFLHY 337

Query: 273 KSGVYKHTSNAKLEN---YLHSGKLIGWGTE-NGTPYWLVINTWGPHWGDRGTVKI 324
            SGV++       ++   Y H  +LIGWG   +G  YWL IN++G HWGD G  KI
Sbjct: 338 SSGVFRPFPLDGFDDRIVYWHVVRLIGWGESGDGQHYWLAINSFGNHWGDNGLFKI 393


>gi|222424744|dbj|BAH20325.1| AT1G02305 [Arabidopsis thaliana]
          Length = 293

 Score =  150 bits (379), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 97/270 (35%), Positives = 129/270 (47%), Gaps = 30/270 (11%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           + S  +P  FDAR  W  C +IG + D G C +   F AV + SDR CIK     N  LS
Sbjct: 32  DISLKLPKEFDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKY--NMNVSLS 89

Query: 139 TEYVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCS 195
              + +CC  +C     + C+ G     W +    G VT     Y D TGC        S
Sbjct: 90  VNDLLACCGFLC----GQGCNGGYPIAAWRYFKHHGVVTEECDPYFDNTGC--------S 137

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           H G  P  P+         KC  +C +    + + + KH     Y V  + D I  E+  
Sbjct: 138 HPGCEPAYPT--------PKCARKCVSGN--QLWRESKHYGVSAYKVRSHPDDIMAEVYK 187

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGP 314
           +GP    F +Y+DF HYKSGVYKH +   +    H+ KLIGWGT ++G  YWL+ N W  
Sbjct: 188 NGPVEVAFTVYEDFAHYKSGVYKHITGTNIGG--HAVKLIGWGTSDDGEDYWLLANQWNR 245

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WGD G  KI RG  EC  E+ + AG P +
Sbjct: 246 SWGDDGYFKIRRGTNECGIEHGVVAGLPSD 275


>gi|343472937|emb|CCD15042.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  150 bits (379), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 101/327 (30%), Positives = 146/327 (44%), Gaps = 31/327 (9%)

Query: 21  FSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFD----QSDRPLPGDRKT 75
            +  ++D+IN+     W A  N         ++    ++AK       Q  R LP  R T
Sbjct: 30  LTQTFVDRINQLNGGMWKAVYN-------GKMQNITFSEAKRLTGARIQKSRTLPPARFT 82

Query: 76  YDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
            + +    +P+ FDA E WP+C TI  + D   C A    +   A SDR C    G+Q R
Sbjct: 83  -EEQLRTKLPETFDAAEHWPHCPTIREIADQSECRASWAVSTASAISDRYCTVGGGKQLR 141

Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
            +S   + +CCK C       C  G     W +  + G  +       + CQP     C 
Sbjct: 142 -ISAADLMACCKQC----GDGCKGGFPGFAWLYYVEYGITS-------SQCQPYPFPHCE 189

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           H G+      C   K    KC+  CT+    +     K+R   TY +   E+  K+E+  
Sbjct: 190 HRGAQGNKTPCSKYKFDTPKCNATCTD----KSIPLVKYRGNATYLLLHGEEDYKRELYF 245

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
           +GP  A F +Y D + YKSGVY++     L     + +++GWG  NGTPYW V N+W   
Sbjct: 246 NGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGG--QAVRIVGWGKLNGTPYWKVANSWDTD 303

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
           WG  G + ILRG  EC  E+L   G P
Sbjct: 304 WGMNGYMLILRGNNECNIEHLGFTGFP 330


>gi|48425700|pdb|1SP4|B Chain B, Crystal Structure Of Ns-134 In Complex With Bovine
           Cathepsin B: A Two Headed Epoxysuccinyl Inhibitor
           Extends Along The Whole Active Site Cleft
          Length = 205

 Score =  150 bits (379), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 78/178 (43%), Positives = 99/178 (55%), Gaps = 5/178 (2%)

Query: 165 TWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPT 224
            WNF  K+G V+GG Y    GC+P +I PC HH +    P       PK  C+  C  P 
Sbjct: 31  AWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--CNKTC-EPG 87

Query: 225 YGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK 284
           Y   + +DKH    +Y V +NE  I  EI  +GP    F++Y DF  YKSGVY+H S   
Sbjct: 88  YSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEI 147

Query: 285 LENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           +    H+ +++GWG ENGTPYWLV N+W   WGD G  KILRG+  C  E  I AG P
Sbjct: 148 MGG--HAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 203


>gi|226497010|ref|NP_001150152.1| LOC100283781 precursor [Zea mays]
 gi|195637168|gb|ACG38052.1| cathepsin B-like cysteine proteinase 3 precursor [Zea mays]
          Length = 347

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 105/320 (32%), Positives = 141/320 (44%), Gaps = 40/320 (12%)

Query: 30  NREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP----EYSATVP 85
           N  +  WTA RN        Y   + IA  K+     +P P +  +  P      S  +P
Sbjct: 43  NHPSAGWTASRN-------PYFSNYTIAQFKHI-LGVKPAPQNALSNVPVKTYSRSLELP 94

Query: 86  DRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASC 145
             FDAR  W  C TIG++ D G C +   F AV    DR CI      +  LS   + +C
Sbjct: 95  KEFDARSAWSRCSTIGNILDQGHCGSCWAFGAVECLQDRFCIHL--NMSILLSVNDLLAC 152

Query: 146 CKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGSAPTL 203
           C     D    C  G     W +  + G VT     Y D  GC+        H G  P  
Sbjct: 153 CGFMCGD---GCDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCK--------HPGCEPAY 201

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P+         KC  +C      + + + KH +   Y ++ +   I  E+  +GP    F
Sbjct: 202 PT--------PKCEKKCKEQN--QVWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAF 251

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPHWGDRGTV 322
            +Y+DF HYKSGVYKH +   +    H+ KLIGWGT + G  YWL+ N W   WGD G  
Sbjct: 252 TVYEDFAHYKSGVYKHITGGIMGG--HAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYF 309

Query: 323 KILRGKYECAFEYLIAAGKP 342
           KI+RGK EC  E  + AG P
Sbjct: 310 KIIRGKNECGIEEGVVAGMP 329


>gi|294929081|ref|XP_002779258.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239888294|gb|EER11053.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 288

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 90/254 (35%), Positives = 120/254 (47%), Gaps = 24/254 (9%)

Query: 84  VPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           +P  FDAR+++ +C G IGHV D  AC      ++ G  +DR CIKS G     LS  Y 
Sbjct: 37  LPSNFDARQKFASCAGVIGHVRDQSACHNCWTVSSTGMLNDRVCIKSGGTFRDILSVGYF 96

Query: 143 ASCCKICR-YDDNKSCSHGSVFRTWNFLHKRGSVTG------GDYGDRTGCQPSTISPCS 195
            SCC         K C  G++    NFL   G VTG      G      GC P     C 
Sbjct: 97  TSCCNPANGCPKAKGCQGGNLLEGLNFLKNHGIVTGDEFKPAGQLSSADGCWPYPFPKCK 156

Query: 196 HHG-SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
           H G S+P              C T+CTN  Y     QD HR      +      IK+EI 
Sbjct: 157 HAGYSSPA-------------CQTKCTNKAYKTSLQQDLHRAKSFGRLPAIPQNIKQEIF 203

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            +GP     ++Y+D   YK+GVY H + +     +H+ K+IGWG E+G  YWL +N+W  
Sbjct: 204 TNGPVIGMLSIYEDIRVYKAGVYVHQTGS--FQGIHTLKIIGWGVESGQDYWLAVNSWNE 261

Query: 315 HWGDRGTVKILRGK 328
            WGD G +K+  G+
Sbjct: 262 EWGDHGMIKLAVGR 275


>gi|18411686|ref|NP_567215.1| cathepsin B [Arabidopsis thaliana]
 gi|13877861|gb|AAK44008.1|AF370193_1 putative cathepsin B cysteine protease [Arabidopsis thaliana]
 gi|17473834|gb|AAL38343.1| unknown protein [Arabidopsis thaliana]
 gi|21281113|gb|AAM45063.1| putative cathepsin B cysteine protease [Arabidopsis thaliana]
 gi|21554165|gb|AAM63244.1| cathepsin B-like cysteine protease, putative [Arabidopsis thaliana]
 gi|24417490|gb|AAN60355.1| unknown [Arabidopsis thaliana]
 gi|24899725|gb|AAN65077.1| unknown protein [Arabidopsis thaliana]
 gi|51968702|dbj|BAD43043.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51969104|dbj|BAD43244.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51969220|dbj|BAD43302.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970472|dbj|BAD43928.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970630|dbj|BAD44007.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970704|dbj|BAD44044.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970802|dbj|BAD44093.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970974|dbj|BAD44179.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51971008|dbj|BAD44196.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51971116|dbj|BAD44250.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|62320144|dbj|BAD94342.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|110740287|dbj|BAF02040.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332656652|gb|AEE82052.1| cathepsin B [Arabidopsis thaliana]
          Length = 359

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 100/273 (36%), Positives = 131/273 (47%), Gaps = 30/273 (10%)

Query: 75  TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
           ++DP  S  +P  FDAR  WP C +IG++ D G C +   F AV + SDR CI+     N
Sbjct: 96  SHDP--SLKLPKAFDARTAWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIQFG--MN 151

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTIS 192
             LS   + +CC   R  D   C  G     W +    G VT     Y D TGC      
Sbjct: 152 ISLSVNDLLACCGF-RCGDG--CDGGYPIAAWQYFSYSGVVTEECDPYFDNTGC------ 202

Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
             SH G  P  P+         KC  +C +    + + + KH +  TY V  N   I  E
Sbjct: 203 --SHPGCEPAYPT--------PKCSRKCVSDN--KLWSESKHYSVSTYTVKSNPQDIMAE 250

Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINT 311
           +  +GP   +F +Y+DF HYKSGVYKH + + +    H+ KLIGWGT + G  YWL+ N 
Sbjct: 251 VYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGG--HAVKLIGWGTSSEGEDYWLMANQ 308

Query: 312 WGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           W   WGD G   I RG  EC  E    AG P +
Sbjct: 309 WNRGWGDDGYFMIRRGTNECGIEDEPVAGLPSS 341


>gi|154340956|ref|XP_001566431.1| cysteine peptidase C (CPC) [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134063754|emb|CAM39941.1| cysteine peptidase C (CPC) [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 340

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 101/327 (30%), Positives = 148/327 (45%), Gaps = 30/327 (9%)

Query: 21  FSDAYIDQINREAN-TWTAGRN---FPANLSEEYLRQFL-IADAKYFDQSDRPLPGDRKT 75
            S+ ++ +IN +A   WTA  +     +  S+E LR+ + + +      S R    +   
Sbjct: 36  LSNRFVAEINLKAKGQWTASADNGHLVSGKSDEELRKLMGVLNMSTAALSPRIFSAE--- 92

Query: 76  YDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
              E +  +P  FD+ ++WP C TI  + D   C +    AAV A SDR C  + G  + 
Sbjct: 93  ---ELAQELPTSFDSSDKWPKCRTISEIRDQSNCGSCWAIAAVEAMSDRYCTVA-GITDL 148

Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
            +ST ++ SCC +C       C  G     W +    G  +         CQP    PC 
Sbjct: 149 RVSTGHLLSCCFVC----GMGCQGGIPTMAWLWWVWVGLTS-------EVCQPYPFPPCG 197

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           HH      P+C +       C++ C +          KH+   +Y +   E     E++ 
Sbjct: 198 HHTDGGKYPACPSTIYDTPTCNSTCADSHTAL----TKHKGEKSYSLR-GEREYMIELMT 252

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
           +GP    F +Y DF  YKSGVY HT+  +L    H+ KL+GWG +NGTPYW + N+W   
Sbjct: 253 YGPFEVAFDVYADFVSYKSGVYSHTTGERLGG--HAVKLVGWGVQNGTPYWKIANSWNSD 310

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
           WGD G   I RG  EC  E    AG P
Sbjct: 311 WGDNGYFLIRRGTDECGIESTGVAGLP 337


>gi|302823081|ref|XP_002993195.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
 gi|300138965|gb|EFJ05715.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
          Length = 342

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 107/328 (32%), Positives = 150/328 (45%), Gaps = 44/328 (13%)

Query: 24  AYIDQINREANT-WTAGRNFPANLSEEYLRQF-----LIADAKYFDQSDRPLPGDRKTYD 77
           + +D +N + N  W AG  F        +R F     ++  +    Q  RPL    +T D
Sbjct: 41  SIVDIVNNDPNAGWKAG--FNERFINHTVRDFKRLCGVLPKSSEEVQPLRPLRSHPRTLD 98

Query: 78  PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
                 +P  FDARE WP C +I ++ D G C +   F AV A +DR CI +   +N  L
Sbjct: 99  ------LPKHFDAREAWPQCSSIKNILDQGHCGSCWAFGAVEALTDRFCILN--NENVSL 150

Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCS 195
           S   + +CC  C +     C  G  +  W +  + G VT     Y D  GC+        
Sbjct: 151 SENDLVACCSSCGF----GCDGGYPYAAWEYFAQTGVVTSQCDPYFDGKGCKH------- 199

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
                   P CE +    + C  +C +    R     KH T  TY V+ +   I+ EI  
Sbjct: 200 --------PGCEPEYDTPV-CVKQCVDNEQWR---DSKHFTVQTYAVNSDIYDIQAEIYK 247

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENGTPYWLVINTWGP 314
           +GP   ++ +Y+DF HYKSGVYKH     L    H+ K IGWG T++G  YW+V N+W  
Sbjct: 248 NGPVEVSYTVYEDFAHYKSGVYKHVFGEVLGG--HAVKFIGWGTTDDGKDYWIVANSWNR 305

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKP 342
            WG+ G  +I RG  EC  E    AG P
Sbjct: 306 SWGEDGFFQISRGSNECGIESEPVAGIP 333


>gi|71424150|ref|XP_812694.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
 gi|70877506|gb|EAN90843.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
          Length = 333

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 105/348 (30%), Positives = 163/348 (46%), Gaps = 31/348 (8%)

Query: 2   IHILVFLL----GCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLI 56
           I + +FLL    G +    +    +D +++ +NR     WTAGR    + ++   R+   
Sbjct: 9   IALFLFLLYATAGHSFHAEDAPILTDEFLEHVNRLNGGKWTAGRT---SRTKHLTRRGAS 65

Query: 57  ADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFA 116
                F ++   LP  R+  + E    + DRFDA E WP C T+  + D  +C +    A
Sbjct: 66  RMLGTFLRNTSILP-PRQFSEEELRVPLQDRFDAGEAWPECPTVTEIRDQSSCGSCWAVA 124

Query: 117 AVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT 176
           A  A SDR C    G ++  +S   + SCC +C +     C+ G     W +    G V+
Sbjct: 125 AASAISDRYCTLG-GVRDLRISAGDLMSCCDVCGF----GCNGGYPEVAWEYYAVHGIVS 179

Query: 177 GGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQ-KVPKLKCHTRCTNPTYGRGFFQDKHR 235
             +Y     CQP     C+HH ++  L  C  +   P   C++ CT+    +     K+R
Sbjct: 180 --EY-----CQPYPFPSCAHHVNSSDLSPCSGEYDTPT--CNSTCTD----KKIPLIKYR 226

Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
              +Y V   E+  K+E++ +GP   +F++Y DF  Y  GVYKH +   L    H+ +++
Sbjct: 227 GNTSY-VLSGEEPFKRELILNGPFEVSFSVYADFVAYTGGVYKHVAGIFLGG--HAVRIV 283

Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           GWG  NG PYW + N+W   WG  G   I RG  EC  E    AG P+
Sbjct: 284 GWGELNGEPYWKIANSWNREWGMNGYFLIARGVDECGIEGSGVAGTPR 331


>gi|291228863|ref|XP_002734398.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 451

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 97/269 (36%), Positives = 132/269 (49%), Gaps = 32/269 (11%)

Query: 77  DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
           D +    +P  FDAR++W +   I  + D G CA+   F+ VG  SDR  I+S G+    
Sbjct: 172 DIKMKKKIPKSFDARDKWGS--MITGILDQGNCASSWAFSTVGVASDRLAIQSSGETGMT 229

Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
           LS +++ SC         + CS G + R W F+ KRG V+   Y   +G Q      C  
Sbjct: 230 LSPQHLLSC----NTRGQRGCSGGHIDRAWWFMRKRGVVSNDCYPYTSGDQDKK-GVCMM 284

Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
            G  P+             C T       GR    + H +T  Y +  NE  I+ EI+ +
Sbjct: 285 PGKLPS------------DCPT-------GRERNNELHHSTPPYRIAANEREIQVEIMEN 325

Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAK--LENY----LHSGKLIGWGTENGTPYWLVIN 310
           GP  A+F + +DF+ Y SGVY+HT  A    E Y     HS KL+GWG ENG  YWL  N
Sbjct: 326 GPVQASFEVKEDFFMYGSGVYRHTPIASNDAEQYHASEWHSVKLLGWGVENGIKYWLGAN 385

Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLIAA 339
           +WG  WG+ G  KILRG+ EC  E  + A
Sbjct: 386 SWGTKWGEDGYFKILRGENECNIESYVVA 414


>gi|294891889|ref|XP_002773789.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239878993|gb|EER05605.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 422

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 111/333 (33%), Positives = 152/333 (45%), Gaps = 35/333 (10%)

Query: 24  AYIDQINREANTWTAGRNFP--ANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE-- 79
           + +D+IN    +WTA ++ P    +S + L      D  +    D    G+ +   P   
Sbjct: 83  SMVDKINSMQQSWTASKDQPPFKGMSIKDLPAGCSNDTMFSSTLDEG--GENRLLGPTNP 140

Query: 80  YSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
              T+P  FDAR+++ +C   IGHV + G C      AAVG F+DR CIKS G+    LS
Sbjct: 141 VLTTLPSSFDARQKFASCADVIGHVREQGECNNCWASAAVGMFNDRVCIKSGGRITDILS 200

Query: 139 TEYVASCCKICR-YDDNKSCSHGSVFRTWNFLHKRGSVTG-----------GDY------ 180
             Y+ SCC        +  C  GSV    NF+   G VTG           G+Y      
Sbjct: 201 LGYLTSCCNRANGCPKSNGCMFGSVPEGLNFMKNHGLVTGRNFRFESFKLSGEYKPPEEL 260

Query: 181 GDRTGCQPSTISPCSH-HGSAPTLPSC-ENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
           G+  GC P     C+H  G     P C + + +P   C T C N  YG    +D HR   
Sbjct: 261 GNDDGCWPYPFPKCNHVPGLESKYPRCAQVRDLPA--CATTCPNKAYGTSMQKDTHRAKS 318

Query: 239 TYWVDDNEDAIKKEILAHGP---TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
              +    + IK+EI  +GP     A   LY+DF   +  VY H +   L    H+ KLI
Sbjct: 319 WGRLPIGPEKIKQEIFDNGPLRXXAAMMTLYEDF-DLQVCVYVHKTGQMLA--AHTLKLI 375

Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGK 328
           GWG E+G  YWL +N W   WGD G +K+  GK
Sbjct: 376 GWGVESGQEYWLAVNAWNEEWGDHGMIKLAVGK 408


>gi|302764096|ref|XP_002965469.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
 gi|300166283|gb|EFJ32889.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
          Length = 331

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 107/331 (32%), Positives = 149/331 (45%), Gaps = 44/331 (13%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQF-----LIADAKYFDQSDRPLPGDRK 74
              + +D +N + N  W AG  F        +R F     ++  +    Q  RPL    +
Sbjct: 27  LQKSIVDIVNNDPNAGWKAG--FNERFINHTVRDFKRLCGVLPKSSEEVQPLRPLRSHPR 84

Query: 75  TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
           T D      +P  FDARE WP C +I  + D G C +   F AV A +DR CI +   +N
Sbjct: 85  TLD------LPKHFDAREAWPQCASIKTILDQGHCGSCWAFGAVEALTDRFCILN--NEN 136

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTIS 192
             LS   + +CC  C +     C  G  +  W +  + G VT     Y D  GC+     
Sbjct: 137 VSLSENDLVACCSSCGF----GCEGGYPYAAWEYFAQTGVVTSQCDPYFDGKGCKH---- 188

Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
                      P CE +    + C  +C +    R     KH T  TY V+ +   I+ E
Sbjct: 189 -----------PGCEPEYDTPV-CVKQCVDNEQWR---DSKHFTVQTYAVNSDIYDIQAE 233

Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENGTPYWLVINT 311
           I  +GP   ++ +Y+DF HYKSGVYKH     L    H+ K IGWG T++G  YW+V N+
Sbjct: 234 IYKNGPVEVSYTVYEDFAHYKSGVYKHVFGQVLGG--HAVKFIGWGTTDDGKDYWIVANS 291

Query: 312 WGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           W   WG+ G  +I RG  EC  E    AG P
Sbjct: 292 WNRSWGEDGFFQISRGSNECGIESEPVAGIP 322


>gi|414886870|tpg|DAA62884.1| TPA: cathepsin B-like cysteine proteinase 3 [Zea mays]
          Length = 347

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 104/320 (32%), Positives = 141/320 (44%), Gaps = 40/320 (12%)

Query: 30  NREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP----EYSATVP 85
           N  +  WTA RN        Y   + IA  K+     +P P +  +  P      S  +P
Sbjct: 43  NHPSAGWTASRN-------PYFSNYTIAQFKHI-LGVKPAPQNALSNVPVKTYSRSLELP 94

Query: 86  DRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASC 145
             FDAR  W  C TIG++ + G C +   F AV    DR CI      +  LS   + +C
Sbjct: 95  KEFDARSAWSRCSTIGNILEQGHCGSCWAFGAVECLQDRFCIHL--NMSILLSVNDLLAC 152

Query: 146 CKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGSAPTL 203
           C     D    C  G     W +  + G VT     Y D  GC+        H G  P  
Sbjct: 153 CGFMCGD---GCDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCK--------HPGCEPAY 201

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P+         KC  +C      + + + KH +   Y ++ +   I  E+  +GP    F
Sbjct: 202 PT--------PKCEKKCKEQN--QVWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAF 251

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPHWGDRGTV 322
            +Y+DF HYKSGVYKH +   +    H+ KLIGWGT + G  YWL+ N W   WGD G  
Sbjct: 252 TVYEDFAHYKSGVYKHITGGIMGG--HAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYF 309

Query: 323 KILRGKYECAFEYLIAAGKP 342
           KI+RGK EC  E  + AG P
Sbjct: 310 KIIRGKNECGIEEGVVAGMP 329


>gi|161343829|tpg|DAA06095.1| TPA_inf: cathepsin B [Aphis gossypii]
          Length = 280

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 81/227 (35%), Positives = 113/227 (49%), Gaps = 9/227 (3%)

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
           Y+  +P  FDAR++WPNC +IGH+ + G C + +  +   A +DR CI S   +N  +S 
Sbjct: 59  YTNGLPINFDARKRWPNCPSIGHIYNQGNCRSSYAISVASAVTDRICIHSNETKNPIMSA 118

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
           + + SCC +C Y     C  GS F +W+F  + G V+GGDY    GCQP  I PC     
Sbjct: 119 QQIISCCYLCGY----GCDGGSQFESWDFYRRHGFVSGGDYNSNQGCQPYMIPPCKLINE 174

Query: 200 APTLPSCEN-QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
                SC    +     C  +C NP Y   F  D ++     +         KEI  +GP
Sbjct: 175 KSPRHSCTTYNREETPACEIKCNNPNYYSSFKTDIYKGK---YYQVYPFMAMKEIFDNGP 231

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG-KLIGWGTENGTP 304
            T  F +Y D   YKSGVY++      + +   G K+IGWG ENG P
Sbjct: 232 ITTQFYMYRDLIDYKSGVYQYDEGFYGDFFTVQGXKIIGWGEENGDP 278


>gi|343961899|dbj|BAK62537.1| cathepsin B precursor [Pan troglodytes]
          Length = 195

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 77/187 (41%), Positives = 102/187 (54%), Gaps = 5/187 (2%)

Query: 157 CSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKC 216
           C+ G     WNF  ++G V+GG Y    GC+P +I PC HH +    P       PK  C
Sbjct: 6   CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPK--C 63

Query: 217 HTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGV 276
              C  P Y   + QDKH    +Y V ++E  I  EI  +GP    F++Y DF  YKSGV
Sbjct: 64  SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKGIMAEIYKNGPVEGAFSVYSDFLLYKSGV 122

Query: 277 YKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYL 336
           Y+H +   +    H+ +++GWG ENGTPYWLV N+W   WGD G  KILRG+  C  E  
Sbjct: 123 YQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESE 180

Query: 337 IAAGKPK 343
           + AG P+
Sbjct: 181 VVAGIPR 187


>gi|297814171|ref|XP_002874969.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297320806|gb|EFH51228.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 359

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 99/273 (36%), Positives = 131/273 (47%), Gaps = 30/273 (10%)

Query: 75  TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
           ++DP  S  +P  FDAR  WP C +IG + D G C +   F AV + SDR CI+     N
Sbjct: 96  SHDP--SLKLPKAFDARTAWPQCTSIGKILDQGHCGSCWAFGAVESLSDRFCIQFG--MN 151

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTIS 192
             LS   + +CC   R  D   C  G     W +    G VT     Y D TGC      
Sbjct: 152 ISLSVNDLLACCGF-RCGD--GCDGGYPIAAWQYFSYSGVVTEECDPYFDNTGC------ 202

Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
             SH G  P  P+         +C  +C +    + + + KH +  TY V+ +   I  E
Sbjct: 203 --SHPGCEPAYPT--------PRCLRKCVSDN--KLWSESKHYSVSTYTVNSSPQDIMAE 250

Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINT 311
           +  +GP   +F +Y+DF HYKSGVYKH + + +    H+ KLIGWGT N G  YWL+ N 
Sbjct: 251 VYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGG--HAVKLIGWGTSNEGEDYWLMANQ 308

Query: 312 WGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           W   WGD G   I RG  EC  E    AG P +
Sbjct: 309 WNRGWGDDGYFMIRRGTNECGIEDEPVAGLPSS 341


>gi|14582576|gb|AAK69541.1|AF283476_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
          Length = 352

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 95/265 (35%), Positives = 131/265 (49%), Gaps = 35/265 (13%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDAR  WP C +I  + D G C +   F AV + +DR CI      N  LS   + 
Sbjct: 96  LPKTFDARTAWPQCLSIADILDQGHCGSCWAFGAVESLTDRFCIHYG--TNVTLSVNDLL 153

Query: 144 SCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGSA 200
           +CC  +C     + C  G     W +  + G VT     Y D+TGC        SH G  
Sbjct: 154 ACCGFLC----GEGCDGGYPIAAWQYFKRTGVVTSECDPYFDQTGC--------SHPGCE 201

Query: 201 PT--LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           P    P+CE + V K               + + KH +   Y V+ ++ +I  E+  +GP
Sbjct: 202 PAYPTPACEKKCVKK------------NLLWSESKHFSVNAYRVNSDQHSIMTEVYTNGP 249

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGPHWG 317
              +F +Y+DF HYKSGVYKH + +++    H+ KLIGWGT E+G  YWL+ N W   WG
Sbjct: 250 AEVSFTVYEDFAHYKSGVYKHVTGSEMGG--HAVKLIGWGTSEDGEDYWLLANQWNRSWG 307

Query: 318 DRGTVKILRGKYECAFEYLIAAGKP 342
           D G  KI+RG  EC  E  + AG P
Sbjct: 308 DDGYFKIIRGTNECGIED-VTAGMP 331


>gi|356984175|gb|AET43950.1| cathepsin B, partial [Reishia clavigera]
          Length = 209

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 82/207 (39%), Positives = 107/207 (51%), Gaps = 10/207 (4%)

Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
           +S   + +CC+ C       C+ G     W      G VTGG Y  + GCQP  I+ C H
Sbjct: 12  VSANELLACCESC----GDGCNGGYPSAAWEVFDHDGVVTGGQYNSKQGCQPYLIAACDH 67

Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
           H      P   + K P+  C  +C    Y   F  DKH    +Y V    D I +E++  
Sbjct: 68  HVVGKLKPCKGDGKTPR--CEKKC-EAGYNVTFKDDKHYGQRSYSVSSVND-IMEELVTR 123

Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHW 316
           GP  A F +Y DF  Y SGVY+HT+ + L    H+ K++G+G ENG  YWLV N+W P W
Sbjct: 124 GPVEAAFTVYSDFLQYHSGVYRHTTGSALGG--HAVKILGYGVENGDKYWLVANSWNPDW 181

Query: 317 GDRGTVKILRGKYECAFEYLIAAGKPK 343
           GD+G  KILRG  EC  E  I AG+PK
Sbjct: 182 GDQGFFKILRGVDECGIEGQIVAGEPK 208


>gi|157058743|gb|ABV03129.1| cathepsin B-2744 [Pterocomma populeum]
          Length = 244

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 83/232 (35%), Positives = 115/232 (49%), Gaps = 7/232 (3%)

Query: 72  DRKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSK 130
           DRKT D  Y   VP  FDAR  + +C   IG V D G CA+    A    F+DR CI + 
Sbjct: 13  DRKTVDANYRTDVPKEFDARRHFVSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIATG 72

Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPST 190
           G+    LS + + SC    ++     C  GS F+ W F    G VTGG++    GCQP  
Sbjct: 73  GKFTDNLSAQNLMSCGDSEKF---VGCHGGSAFKAWEFTMGNGIVTGGNFNSNEGCQPYK 129

Query: 191 ISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD-DNEDAI 249
             PC H+G +        ++     C  +C N  Y   +  D H+T++ Y     N   I
Sbjct: 130 NRPCDHYGDSSMTNCSSFRRTQMSICREKCVNKNYKVKYEDDLHKTSVVYMTSWTNVTQI 189

Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
           ++EI+ +GP TA   +Y++F  YK G+YK T    L  Y H  KLIGWG ++
Sbjct: 190 QQEIMTYGPVTALMYVYENFMGYKEGIYKSTV-GDLVGYHHV-KLIGWGVDD 239


>gi|3087797|emb|CAA93275.1| cysteine proteinase [Haemonchus contortus]
          Length = 330

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 75/233 (32%), Positives = 124/233 (53%), Gaps = 8/233 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FD+RE W NC +I ++ D     +    +A    SDR C++SKG+  + +S   + 
Sbjct: 95  IPESFDSREVWKNCSSITYIRDQSNSGSCWAVSAAETMSDRICVQSKGRVQKMISDVDIL 154

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           +CC     +  + C+ G   + W ++ + G VTGG Y ++  C+P  + PC   G   + 
Sbjct: 155 ACCG---RECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYHLHPCEITGKFWSC 211

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P   + + P   C   C    YG+ + +DK      Y +D++E AI++E++ +GP  A F
Sbjct: 212 PRDHSFRTPA--CKKYC-QYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAF 268

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHW 316
             Y+DF  Y+ G+Y H+     +   H+ K++GWG ENGT YW V N+W   W
Sbjct: 269 TTYEDFSFYRKGIYVHSYGR--QRGAHAVKVVGWGVENGTKYWNVANSWSTDW 319


>gi|30678927|ref|NP_849281.1| cathepsin B [Arabidopsis thaliana]
 gi|3859606|gb|AAC72872.1| contains similarity to cysteine proteases (Pfam: PF00112,
           E=1.3e-79, N=1) [Arabidopsis thaliana]
 gi|7268205|emb|CAB77732.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332656653|gb|AEE82053.1| cathepsin B [Arabidopsis thaliana]
          Length = 359

 Score =  147 bits (371), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 99/273 (36%), Positives = 130/273 (47%), Gaps = 30/273 (10%)

Query: 75  TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
           ++DP  S  +P  FDAR  WP C +IG++   G C +   F AV + SDR CI+     N
Sbjct: 96  SHDP--SLKLPKAFDARTAWPQCTSIGNILGLGHCGSCWAFGAVESLSDRFCIQFG--MN 151

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTIS 192
             LS   + +CC   R  D   C  G     W +    G VT     Y D TGC      
Sbjct: 152 ISLSVNDLLACCGF-RCGDG--CDGGYPIAAWQYFSYSGVVTEECDPYFDNTGC------ 202

Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
             SH G  P  P+         KC  +C +    + + + KH +  TY V  N   I  E
Sbjct: 203 --SHPGCEPAYPT--------PKCSRKCVSDN--KLWSESKHYSVSTYTVKSNPQDIMAE 250

Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINT 311
           +  +GP   +F +Y+DF HYKSGVYKH + + +    H+ KLIGWGT + G  YWL+ N 
Sbjct: 251 VYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGG--HAVKLIGWGTSSEGEDYWLMANQ 308

Query: 312 WGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           W   WGD G   I RG  EC  E    AG P +
Sbjct: 309 WNRGWGDDGYFMIRRGTNECGIEDEPVAGLPSS 341


>gi|157058733|gb|ABV03124.1| cathepsin B-16a [Acyrthosiphon pisum]
          Length = 274

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 89/284 (31%), Positives = 137/284 (48%), Gaps = 13/284 (4%)

Query: 2   IHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNF-PANLSEEYLRQFLIADAK 60
           I +L  +L       + Y   ++YI+ IN  A TWTAG NF P+   +++++       +
Sbjct: 1   IILLSVVLFSVYQTEQAYFLEESYIEMINDVATTWTAGVNFDPSTPEKDFIKMLGSKGVE 60

Query: 61  YFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
               +   +       +   +  +P  FDAR +W +C TIG V D G C +    A   A
Sbjct: 61  AAKNASAHMFKTHDVANDNNNGYIPRTFDARRRWRHCKTIGEVRDQGHCGSCWAMATSSA 120

Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
           F+DR C+ + G  N  LS E +  CC  C +     C+ G   + W +    G VTGG+Y
Sbjct: 121 FADRLCVATNGDFNELLSAEEITFCCHTCGF----GCNGGYPIKAWKYFSSHGIVTGGNY 176

Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG--FFQDKHRTTL 238
               GC+P  + PC       +  SC  + + K   + RCT   YG     + + HR T 
Sbjct: 177 KSGEGCEPYRVPPCPQDEEGKS--SCAGKPIEK---NHRCTRMCYGNQDLDYNEDHRFTR 231

Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSN 282
            Y+      +I+K+++ +GP  A+F +YDDF  YKSGVY+ T N
Sbjct: 232 DYYY-LTYGSIQKDVMNYGPIEASFDVYDDFPSYKSGVYQRTPN 274


>gi|215687149|dbj|BAG90919.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 403

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 101/308 (32%), Positives = 137/308 (44%), Gaps = 34/308 (11%)

Query: 38  AGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNC 97
            G N P   + ++     +    +   +D P+    KTY    S  +P  FDAR  W  C
Sbjct: 107 GGLNNPPVQTAQFKHILGVKPTPHSVLNDVPV----KTY--PRSLMLPKEFDARSAWSQC 160

Query: 98  GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSC 157
            TIG + D G C +   F AV    DR CI      N  LS   + +CC     D    C
Sbjct: 161 NTIGTILDQGHCGSCWAFGAVECLQDRFCIHF--NMNISLSVNDLVACCGFMCGD---GC 215

Query: 158 SHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLK 215
             G     W +  + G VT     Y D+ GC+        H G  P  P+   +K    K
Sbjct: 216 DGGYPIMAWRYFVRNGVVTDECDPYFDQVGCK--------HPGCEPAYPTPVCEK----K 263

Query: 216 CHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSG 275
           C  +       + + + KH +   Y V+ +   I  E+  +GP    F +Y+DF HYKSG
Sbjct: 264 CKVQ------NQVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSG 317

Query: 276 VYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
           VYKH +   +    H+ KLIGWGT + G  YWL+ N W   WGD G  KI+RG  EC  E
Sbjct: 318 VYKHITGGMMGG--HAVKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFKIIRGTNECGIE 375

Query: 335 YLIAAGKP 342
             + AG P
Sbjct: 376 EDVVAGMP 383


>gi|157058735|gb|ABV03125.1| cathepsin B-16 [Aulacorthum solani]
          Length = 246

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 90/256 (35%), Positives = 129/256 (50%), Gaps = 17/256 (6%)

Query: 19  YKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP 78
           Y   ++YID IN  A TWTAG NF  +  EE+  + L   +K  + + +    + KT D 
Sbjct: 2   YFLEESYIDMINEVATTWTAGVNFDPSTPEEHFVKML--GSKGVESAKQASAHEFKTNDV 59

Query: 79  EYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
            Y      +P  FDAR++W +C TIG V D G C +   F    AF+DR C+ + G  N 
Sbjct: 60  AYDNYYGYIPRTFDARKRWRHCKTIGEVRDQGNCGSCWAFGTSSAFADRLCVATDGDFNE 119

Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
            LS E +A CC  C +     C  G   + W +    G VTGG+Y    GC+P  + PC 
Sbjct: 120 LLSPEEIAFCCHTCGF----GCHGGYPIKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCQ 175

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG--FFQDKHRTTLTYWVDDNEDAIKKEI 253
           HH       SC ++ + K   + RCT   YG     + D HR T  Y+      +I+K++
Sbjct: 176 HHHQGNN--SCSDKPMEK---NHRCTRMCYGDQDLDYNDDHRFTRDYYY-LTYGSIQKDV 229

Query: 254 LAHGPTTATFALYDDF 269
           + +GP  A+F +YDDF
Sbjct: 230 MNYGPIEASFDVYDDF 245


>gi|56758644|gb|AAW27462.1| unknown [Schistosoma japonicum]
          Length = 294

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 77/205 (37%), Positives = 113/205 (55%), Gaps = 6/205 (2%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           + +  +P +FD+R++WP+C +I  + D   C +   F AV A +DR CI+S GQQ+  LS
Sbjct: 85  DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELS 144

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
              + SCC+ C       C  G     W++  KRG VTGG   + TGCQP     C HH 
Sbjct: 145 ALDLISCCEDC----GDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHH- 199

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           +    P+C  +     +C  +C    Y   + QDKH    +Y V  NE AI+KEI+ +GP
Sbjct: 200 TKGKYPACGTKIYKTPQCKQKCQK-GYKTPYEQDKHYGEESYNVISNEKAIQKEIMMNGP 258

Query: 259 TTATFALYDDFYHYKSGVYKHTSNA 283
             A F +Y+DF +YKSG+Y+H + +
Sbjct: 259 VEAAFDVYEDFLNYKSGIYRHVTGS 283


>gi|56754337|gb|AAW25356.1| SJCHGC00056 protein [Schistosoma japonicum]
          Length = 342

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 85/260 (32%), Positives = 132/260 (50%), Gaps = 8/260 (3%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           +  +P +FD+R++WP+C +I  + D   C +   F AV A +DR CI+S G Q+  LS  
Sbjct: 87  NVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSAL 146

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SC      D    C  G   + W+    R S       + TGCQP     C H  + 
Sbjct: 147 DLISC----CEDCGGGCKGGFPGQAWDMGKTRDSHWRFRKKNHTGCQPYPFPKCEHL-TK 201

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
              P+C  +     +C   C    Y   F QDK     +  V +NE   +++I+ +GP  
Sbjct: 202 GKYPACGTKIYKTPQCKQTCQK-GYKTPFEQDKPFGEGSSNVQNNEKVFQRDIMMYGPVE 260

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
           A F +Y+DF + KSG+ +H + + +    H  ++IGWG E G PYWL+ N+W   WG+ G
Sbjct: 261 AAFDVYEDFLNSKSGISRHVTGSIVGG--HPIRIIGWGVEKGNPYWLIANSWNEDWGENG 318

Query: 321 TVKILRGKYECAFEYLIAAG 340
             +++RG+ EC+ E  + AG
Sbjct: 319 LFRMVRGRDECSIESHVVAG 338


>gi|168000937|ref|XP_001753172.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695871|gb|EDQ82213.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 347

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 102/327 (31%), Positives = 150/327 (45%), Gaps = 36/327 (11%)

Query: 22  SDAYIDQINRE-ANTWTAGRNFP-ANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
             A +D++N     TWTAG N   A  + E+L++   A       +++  P         
Sbjct: 42  QQALVDKVNAHPGATWTAGFNERFAKHTIEHLKKMCGA---ILTPANKLEPSIETISHKH 98

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
               +P  FDAR+QW +C TIG +   G C +   F AV + +DR CI     ++  LS 
Sbjct: 99  KKLYLPKEFDARKQWSHCPTIGDILGQGHCGSCWAFGAVESLTDRFCIHL--NESVSLSE 156

Query: 140 EYVASCCKI-CRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSH 196
             + +CC   C Y     C  G   R W +    G VT     Y D+ GC        +H
Sbjct: 157 NDLLACCGFECGY----GCEGGYPIRAWKYFKHSGVVTNKCDPYFDQKGC--------AH 204

Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
            G  PT  +         KC  +C +  +   + Q KH     Y +    + +  E+  +
Sbjct: 205 PGCYPTYET--------PKCEKQCVDDEF---WVQSKHLGVNAYEMSMEPEDLMAELYTN 253

Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGPH 315
           GP    F +Y+DF HYK+GVYKH     +    H+ KLIGWGT ++G  YW ++N+W  +
Sbjct: 254 GPVEVAFEVYEDFAHYKTGVYKHLFGGFMGG--HAVKLIGWGTTDDGVDYWTIVNSWNTN 311

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
           WG+ G  +I+RG  EC  E    AG P
Sbjct: 312 WGEDGLFRIVRGNDECGIESNAVAGLP 338


>gi|157167285|ref|XP_001658487.1| cathepsin b [Aedes aegypti]
 gi|108876478|gb|EAT40703.1| AAEL007590-PA [Aedes aegypti]
          Length = 313

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 89/260 (34%), Positives = 126/260 (48%), Gaps = 12/260 (4%)

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
            +P  FDAR+QWP C ++  +   G C +    +   A +DR CI SKG++        +
Sbjct: 61  VLPKSFDARQQWPQCSSLNEIRTQGCCGSCAYVSGASAMTDRWCIHSKGKKQFTFGAFDL 120

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            SCC  C          G +   W++  K+G  +GG YG   GC P  + P     S   
Sbjct: 121 LSCCYECGGGCTGGGIPGPI---WSYWVKQGVSSGGPYGSNQGCHPYPMPPSCPKPSEGD 177

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
            P   N       C TRC          +D+    + Y +  +E  I ++I  +GP  A 
Sbjct: 178 YPDEPN-------CSTRCNAGYNVTEDLRDRRFGRVAYSIPADERKIMEDIFVNGPVQAV 230

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
           F  Y+D  +Y  GVY+H S  +L+   H+ KLIGWG E+GT YWLV N+WG  WGD G  
Sbjct: 231 FQWYEDIVNYSGGVYRHQS-GRLKGG-HAVKLIGWGVEDGTKYWLVANSWGRVWGDDGFF 288

Query: 323 KILRGKYECAFEYLIAAGKP 342
           K++RG+  C  E  + AG P
Sbjct: 289 KMVRGENHCGIEENVHAGLP 308


>gi|197304333|dbj|BAG69285.1| cathepsin B-like cysteine protease [Raphanus sativus]
          Length = 343

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 98/271 (36%), Positives = 127/271 (46%), Gaps = 32/271 (11%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           + S  +P  FDAR  WP C +IG + D G C +   F AV + SDR CI+     N  LS
Sbjct: 99  DQSLKLPKSFDARTHWPQCTSIGKILDQGHCGSCWAFGAVESLSDRFCIQFG--MNITLS 156

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSH 196
              + +CC   R  D   C  G     W +    G VT     Y D+TGC        SH
Sbjct: 157 VNDLLACCGF-RCGD--GCDGGYPISAWQYFSYSGVVTEECDPYFDQTGC--------SH 205

Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG--FFQDKHRTTLTYWVDDNEDAIKKEIL 254
            G  P   +             +C     GR   + + KH +  TY V+ N   I  EI 
Sbjct: 206 PGCEPAYNT------------PQCLRKCVGRNQLWSESKHYSINTYVVESNPQDIMAEIY 253

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENGTPYWLVINTWG 313
            +GP   +F +Y+DF HYKSGVYKH + + +    H+ KLIGWG T++G  YWL+ N W 
Sbjct: 254 KNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGG--HAVKLIGWGTTDDGEDYWLLANQWN 311

Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
             WGD G   I RG  EC  E    AG P +
Sbjct: 312 RSWGDDGYFMIRRGTNECGIEDEPVAGLPSS 342


>gi|170030062|ref|XP_001842909.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
 gi|167865915|gb|EDS29298.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
          Length = 288

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 86/264 (32%), Positives = 131/264 (49%), Gaps = 25/264 (9%)

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
            +P  FDAR++WP C ++  +   G+C + +  +     +DR CI S G++     +   
Sbjct: 46  ALPASFDARQKWPYCPSLNQIRSQGSCGSCYAVSTAAVITDRYCIHSGGERQFYFGSTGY 105

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            SCC  C       C  G V +T+++  K G  +GG Y    GC+P            P 
Sbjct: 106 LSCCTDCY-----KCDGGYVHKTFDYWVKYGLTSGGPYHSGQGCKP-----------YPF 149

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY---WVDDNEDAIKKEILAHGPT 259
             + ++  +  LKC  +C    Y   + QD      +Y   W D+N  A+K EI  +GP 
Sbjct: 150 GGATQDVNIV-LKCDRQC-QAGYPLTYSQDLKHGASSYILPWGDEN--AMKAEIYQNGPI 205

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
             +F +Y DF+ Y+SGVY+H + A   +  H+ ++IGWG ENG  YWL  N+W   WG+ 
Sbjct: 206 VTSFDVYGDFFQYRSGVYRHVTGAYKGS--HAVRVIGWGVENGVKYWLCANSWNERWGEN 263

Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
           G  KI+RG+     E +  AG PK
Sbjct: 264 GFFKIVRGENHVGVEDISYAGLPK 287


>gi|6165885|gb|AAF04727.1|AF101239_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
          Length = 352

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 94/265 (35%), Positives = 130/265 (49%), Gaps = 35/265 (13%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDAR  WP C +I  + D G C +   F AV + +DR CI      N  LS   + 
Sbjct: 96  LPKTFDARTAWPQCLSIADILDQGHCGSCWAFGAVESLTDRFCIHYG--TNVTLSVNDLL 153

Query: 144 SCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGSA 200
           +CC  +C     + C  G     W +  + G VT     Y D+TGC        SH G  
Sbjct: 154 ACCGFLC----GEGCDGGYPIAAWQYFKRTGVVTSECDPYFDQTGC--------SHPGCE 201

Query: 201 PT--LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           P    P+CE + V K               + + KH +   Y V+ ++ +I  E+  +GP
Sbjct: 202 PAYPTPACEKKCVKK------------NLLWSESKHFSVNAYRVNSDQHSIMTEVYTNGP 249

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGPHWG 317
              +F +Y+DF HYKSGVYKH + +++    H+ KLIGWGT E+G  YWL+ N W   WG
Sbjct: 250 AEVSFTVYEDFAHYKSGVYKHVTGSEMGG--HAVKLIGWGTSEDGEDYWLLANQWNRSWG 307

Query: 318 DRGTVKILRGKYECAFEYLIAAGKP 342
             G  KI+RG  EC  E  + AG P
Sbjct: 308 GDGYFKIIRGTNECGIED-VTAGTP 331


>gi|449446774|ref|XP_004141146.1| PREDICTED: cathepsin B-like [Cucumis sativus]
          Length = 348

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 96/265 (36%), Positives = 125/265 (47%), Gaps = 28/265 (10%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           S  +P  FDARE WP C +IG + D G C +   F AV + SDR CI      N  LS  
Sbjct: 98  SLKLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHF--DMNITLSVN 155

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHG 198
            + +CC     D    C  G     W +  + G VT     Y D TG        CSH G
Sbjct: 156 DLLACCGFMCGD---GCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG--------CSHPG 204

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
             P  P+         +C   C +    + + + KH     Y V  + + I  E+  +GP
Sbjct: 205 CEPAYPT--------PRCVRHCVDKN--QIWRKTKHYGVSAYRVKRDPNDIMAEVYKNGP 254

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENGTPYWLVINTWGPHWG 317
              +F +Y+DF HYKSGVYKH +   +    H+ KLIGWG T++G  YWL+ N W   WG
Sbjct: 255 VEVSFTVYEDFAHYKSGVYKHITGDVMGG--HAVKLIGWGTTDDGEDYWLLANQWNRGWG 312

Query: 318 DRGTVKILRGKYECAFEYLIAAGKP 342
           D G  KI RG  EC  E  + AG P
Sbjct: 313 DDGYFKIRRGTNECGIEEDVVAGLP 337


>gi|449489527|ref|XP_004158338.1| PREDICTED: cathepsin B-like [Cucumis sativus]
          Length = 349

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 96/265 (36%), Positives = 125/265 (47%), Gaps = 28/265 (10%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           S  +P  FDARE WP C +IG + D G C +   F AV + SDR CI      N  LS  
Sbjct: 99  SLKLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHF--DMNITLSVN 156

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHG 198
            + +CC     D    C  G     W +  + G VT     Y D TG        CSH G
Sbjct: 157 DLLACCGFMCGD---GCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG--------CSHPG 205

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
             P  P+         +C   C +    + + + KH     Y V  + + I  E+  +GP
Sbjct: 206 CEPAYPT--------PRCVRHCVDKN--QIWRKTKHYGVSAYRVKRDPNDIMAEVYKNGP 255

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENGTPYWLVINTWGPHWG 317
              +F +Y+DF HYKSGVYKH +   +    H+ KLIGWG T++G  YWL+ N W   WG
Sbjct: 256 VEVSFTVYEDFAHYKSGVYKHITGDVMGG--HAVKLIGWGTTDDGEDYWLLANQWNRGWG 313

Query: 318 DRGTVKILRGKYECAFEYLIAAGKP 342
           D G  KI RG  EC  E  + AG P
Sbjct: 314 DDGYFKIRRGTNECGIEEDVVAGLP 338


>gi|194246067|gb|ACF35525.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 192

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 81/188 (43%), Positives = 100/188 (53%), Gaps = 5/188 (2%)

Query: 157 CSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKC 216
           C+ G     W F      VTGG YG   GCQP    PC HH   P LP+C   K P  +C
Sbjct: 7   CNGGYPSAAWQFYKDEDIVTGGLYGTEDGCQPYYFPPCEHHTVGP-LPNCTGIK-PTPEC 64

Query: 217 HTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGV 276
              C    Y + + +DKH     Y +  +E  IK EI  +GP  A F++Y DF  YKSGV
Sbjct: 65  AKTCRE-GYQKSYTRDKHFGKKVYSISSDETQIKTEIYKNGPVEADFSVYADFPSYKSGV 123

Query: 277 YKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYL 336
           Y+  S   L    H+ +++GWGTE+G PYWLV N+W   WGD+G  KI RG  EC  E  
Sbjct: 124 YQRHSEEMLGG--HAIRILGWGTEDGVPYWLVANSWNEDWGDKGYFKIRRGNDECGIEDD 181

Query: 337 IAAGKPKN 344
           I AG PK 
Sbjct: 182 INAGIPKE 189


>gi|356572872|ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max]
          Length = 356

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 98/266 (36%), Positives = 125/266 (46%), Gaps = 30/266 (11%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           S  +P  FDAR  W  C TIG + D G C +   F AV + SDR CI      N  LS  
Sbjct: 97  SLKLPKNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHF--DVNISLSVN 154

Query: 141 YVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHH 197
            + +CC  +C       C  G     W +L   G VT     Y D+ GC        SH 
Sbjct: 155 DLLACCGFLC----GSGCDGGYPLYAWQYLAHHGVVTEECDPYFDQIGC--------SHP 202

Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
           G  P        + PK  C  +C +    + + + KH +   Y V  +   I  E+  +G
Sbjct: 203 GCEPAY------RTPK--CVKKCVSGN--QVWKKSKHYSVNAYRVSSDPHDIMTEVYKNG 252

Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGPHW 316
           P    F +Y+DF HYKSGVYKH +  +L    H+ KLIGWGT E+G  YWL+ N W   W
Sbjct: 253 PVEVAFTVYEDFAHYKSGVYKHITGYELGG--HAVKLIGWGTTEDGEDYWLLANQWNREW 310

Query: 317 GDRGTVKILRGKYECAFEYLIAAGKP 342
           GD G  KI RG  EC  E  + AG P
Sbjct: 311 GDDGYFKIRRGTNECGIEEDVTAGLP 336


>gi|328871084|gb|EGG19455.1| peptidase C1A family protein [Dictyostelium fasciculatum]
          Length = 352

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 92/271 (33%), Positives = 135/271 (49%), Gaps = 25/271 (9%)

Query: 71  GDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSK 130
           GD    D  + A VP  F++ +QW NC  I  + +   C +   F AV + SDR CI  K
Sbjct: 58  GDVPVVDYAFQA-VPANFNSAQQWSNCSYISAIQNQARCGSCWAFGAVESVSDRFCIH-K 115

Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPST 190
           G+ +  LS + + +C +      +  C  G  +    F+ K+G V+         C P T
Sbjct: 116 GE-DVLLSFQDLVTCDQ-----SDNGCQGGDAYTAMKFIQKKGIVS-------NDCLPYT 162

Query: 191 ISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIK 250
           I  C     AP    C N  V   +C  +C+N +Y   + QD H     Y ++   +AI+
Sbjct: 163 IPTC-----APAQQPCLN-FVDTPQCVEKCSNASYT--YAQDLHFIDGVYSMNPTVNAIQ 214

Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVIN 310
           +EI+ +GP  A F +Y+DF  YKSGVY+HT+   L    H  K+IGWGT+N   YW+  N
Sbjct: 215 QEIMTNGPVEACFEVYEDFLGYKSGVYQHTTGKDLGG--HCVKMIGWGTQNNELYWICNN 272

Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
           +W  +WG++G   I  G  EC  E  + A K
Sbjct: 273 SWTTYWGNQGVFWIKAGVNECGIESDVVAAK 303


>gi|320167003|gb|EFW43902.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
          Length = 306

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 110/338 (32%), Positives = 145/338 (42%), Gaps = 43/338 (12%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFP--ANLSEEYLRQFLIADAKYF 62
           ++  LG   V    +      IDQIN     WTAG N P  A  + E ++  L       
Sbjct: 6   VIAFLGLVAVASAEFILQQEMIDQINNANVGWTAGVN-PRFAGKTREDIKGLLGTKLLPK 64

Query: 63  DQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
               R  P      D      +P  FDAR QWP   +I  + D   C +   F A  A S
Sbjct: 65  GTKLREFPVVDTIVD-----AIPTSFDARTQWP--ASIHPIRDQQQCGSCWAFGATEALS 117

Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
           DR  I S    N  LS + + SC        +  C  G     W+++   G VT      
Sbjct: 118 DRLAIASNNSINVVLSPQDLVSCDST-----DYGCDGGYPINAWHYMQSLGVVT------ 166

Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
              C P T    S +G + T      +K P       C   T+        ++    Y V
Sbjct: 167 -DTCYPYT----SGNGDSGTC-QITGKKTPA------CATATF--------YKAKTAYQV 206

Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
            +N  AI+ EILA+GP  A F++YDDF+ Y SGVY H S A   +  H+ K++GWG +  
Sbjct: 207 ANNMAAIQSEILANGPVEAAFSVYDDFFSYTSGVYSHQSGAL--DGGHAVKIVGWGVDGT 264

Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           TPYW+V N+WG  WG  G   I RG  EC  E  I AG
Sbjct: 265 TPYWIVANSWGTSWGQAGFFWIKRGNDECGIEDGIVAG 302


>gi|44968648|gb|AAS49594.1| cathepsin B [Scyliorhinus canicula]
          Length = 206

 Score =  144 bits (364), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 81/214 (37%), Positives = 115/214 (53%), Gaps = 8/214 (3%)

Query: 90  AREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKIC 149
           +REQWP+C TI  + D G+C +   F AV A SDR CI S+G+ N  +S E + SCCK+ 
Sbjct: 1   SREQWPDCPTIKEIRDQGSCGSCWAFGAVEAMSDRICIHSRGKVNVEVSAEDLLSCCKL- 59

Query: 150 RYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQ 209
             +    C+ G     W F    G V+GG Y    GC+P +ISPC HH +  + P C  +
Sbjct: 60  --ECGNGCNGGYPSGAWEFWTNDGLVSGGLYYSHIGCRPYSISPCEHHVNG-SRPKCSGE 116

Query: 210 KVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDF 269
            +   +C  RC    Y   + +DKH    +Y +  +   I  EI  +GP  A   ++ DF
Sbjct: 117 -IETPRCSRRC-EAGYSPKYSEDKHYGLTSYSIGSDVTEIMTEIYKNGPVEAALEVFKDF 174

Query: 270 YHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
             YKSGVY+H +   +    H+ K++GWG ENGT
Sbjct: 175 LLYKSGVYQHKTGGSIGG--HAIKILGWGEENGT 206


>gi|343476073|emb|CCD12715.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  144 bits (363), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 102/350 (29%), Positives = 153/350 (43%), Gaps = 36/350 (10%)

Query: 1   MIHILVFLLGCTLVRG-ELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIAD 58
           ++   +  LG + +R  +    +  ++D+IN+     W A  N         ++    ++
Sbjct: 9   LLSTALVTLGASALRAKDAPVLTKTFVDRINQLNGGMWKAVYN-------GKMQNITFSE 61

Query: 59  AKYFD----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
           AK       Q    LP  R T + +    +P+ FD+ E+WPNC TI  + D  AC A   
Sbjct: 62  AKRLTGAWIQKTSSLPPVRFT-EEQLRTELPESFDSAEKWPNCPTIREIADQSACRASWA 120

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFR--TWNFLHKR 172
            +   A SDR C    G+Q R        S   +              F    W +  + 
Sbjct: 121 VSTASAISDRYCTVGGGKQLR-------ISAAHLLSCCKQCGGGCKGGFPGFAWRYYVEY 173

Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQD 232
           G  +       + CQP     C HHG+      C N K    +C+T CT+ T        
Sbjct: 174 GIAS-------SYCQPYPFPQCEHHGAQGNKTPCSNYKFVTPQCNTTCTDKTIPL----I 222

Query: 233 KHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
           K+R    Y +   E+  K+E+  +GP  A   +Y D + YKSGVY++   + +   + + 
Sbjct: 223 KYRGKDAYMLLPGEEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRNVDGSYMG--VTAV 280

Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           K++GWG  NGTPYW V NTW   WG  G + ILRG  EC  E+L  AG P
Sbjct: 281 KVVGWGKLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTP 330


>gi|217072748|gb|ACJ84734.1| unknown [Medicago truncatula]
 gi|388505480|gb|AFK40806.1| unknown [Medicago truncatula]
          Length = 359

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 97/268 (36%), Positives = 124/268 (46%), Gaps = 34/268 (12%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           S  +P  FDAR  W  C TIG + D G C +   F AV +  DR CI      N  LS  
Sbjct: 100 SLKLPKEFDARTAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFCIHF--DMNISLSVN 157

Query: 141 YVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHH 197
            + +CC  +C       C  G+    W +L   G VT     Y D+ GC        SH 
Sbjct: 158 DLLACCGFLC----GAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGC--------SHP 205

Query: 198 GSAPTLPSCENQKVPKLKCHTRCT--NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           G  P        + PK  C  +C   N  + R     KH +   Y V  +   I  E+  
Sbjct: 206 GCEPAY------QTPK--CVRKCVKGNQIWKR----SKHYSVKAYRVKSDPQDIMAEVYK 253

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGP 314
           +GP    F +++DF HYKSGVYKH + + L    H+ KLIGWGT + G  YWL+ N W  
Sbjct: 254 NGPVEVAFTVFEDFAHYKSGVYKHITGSALGG--HAVKLIGWGTSDEGEDYWLLANQWNT 311

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKP 342
           +WGD G  KI RG  EC  E  + AG P
Sbjct: 312 NWGDDGYFKIKRGTNECGIEDDVTAGLP 339


>gi|294889976|ref|XP_002773021.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239877724|gb|EER04837.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 342

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 93/257 (36%), Positives = 129/257 (50%), Gaps = 18/257 (7%)

Query: 83  TVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
           T+P  F+A+ ++ +C   IGH+ D   C      A+VG F+DR CI+S G+    LS  Y
Sbjct: 38  TLPSNFNAQIKFASCADVIGHIRDQAECHNCWASASVGMFNDRVCIQSGGRITDILSLAY 97

Query: 142 VASCCK---ICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------GDRTGCQPSTIS 192
           + SCC     C   D   C  GSV     F+   G VTGG+Y      G+  GC P    
Sbjct: 98  LTSCCNHANGCPKSD--GCRRGSVAEGLIFMKNHGIVTGGEYKPPKKLGNDDGCWPYPFP 155

Query: 193 PCSH-HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKK 251
            C+H  G     P C   KV +L   + C      R    D HR      +  + + IK+
Sbjct: 156 KCNHVPGMKVKYPRC-GSKVGRLAAPSHCDGLHCRRA--GDVHRAKSWGRLPISPEKIKQ 212

Query: 252 EILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINT 311
           EI  +GP  A   +++DF  YKSGVY++ + A +    H+ KLIGWG E G  YWL +N+
Sbjct: 213 EIFDNGPVAAIMTIHEDFRLYKSGVYEYKTGAMVG--AHTLKLIGWGVEAGQEYWLAVNS 270

Query: 312 WGPHWGDRGTVKILRGK 328
           W   WGD+G +K+  GK
Sbjct: 271 WNEEWGDQGKIKLAVGK 287


>gi|356505709|ref|XP_003521632.1| PREDICTED: cathepsin B-like [Glycine max]
          Length = 357

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 99/284 (34%), Positives = 131/284 (46%), Gaps = 34/284 (11%)

Query: 67  RPLPGDRKTYDPEYS----ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
           +P+P       P  S      +P  FDAR  W  C TIG + D G C +   F AV + S
Sbjct: 80  KPMPKKELRSTPAISHPKTLKLPKNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLS 139

Query: 123 DRRCIKSKGQQNRPLSTEYVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GD 179
           DR CI      N  LS   + +CC  +C       C  G     W +L   G VT     
Sbjct: 140 DRFCIHF--DVNISLSVNDLLACCGFLC----GSGCDGGYPLYAWRYLAHHGVVTEECDP 193

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
           Y D+ GC        SH G  P        + PK  C  +C +    + + + KH +   
Sbjct: 194 YFDQIGC--------SHPGCEPAY------RTPK--CVKKCVSGN--QVWKKSKHYSVSA 235

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y V+ +   I  E+  +GP    F +Y+DF +YKSGVYKH +  +L    H+ KLIGWGT
Sbjct: 236 YRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAYYKSGVYKHITGYELGG--HAVKLIGWGT 293

Query: 300 -ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            ++G  YWL+ N W   WGD G  KI RG  EC  E  + AG P
Sbjct: 294 TDDGEDYWLLANQWNREWGDDGYFKIRRGTNECGIEEDVTAGLP 337


>gi|357511629|ref|XP_003626103.1| Cathepsin B [Medicago truncatula]
 gi|87240982|gb|ABD32840.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
           propeptide [Medicago truncatula]
 gi|355501118|gb|AES82321.1| Cathepsin B [Medicago truncatula]
          Length = 357

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 97/268 (36%), Positives = 124/268 (46%), Gaps = 34/268 (12%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           S  +P  FDAR  W  C TIG + D G C +   F AV +  DR CI      N  LS  
Sbjct: 98  SLKLPKEFDARTAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFCIHF--DMNISLSVN 155

Query: 141 YVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHH 197
            + +CC  +C       C  G+    W +L   G VT     Y D+ GC        SH 
Sbjct: 156 DLLACCGFLC----GAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGC--------SHP 203

Query: 198 GSAPTLPSCENQKVPKLKCHTRCT--NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           G  P        + PK  C  +C   N  + R     KH +   Y V  +   I  E+  
Sbjct: 204 GCEPAY------QTPK--CVRKCVKGNQIWKR----SKHYSVKAYRVKSDPQDIMAEVYK 251

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGP 314
           +GP    F +++DF HYKSGVYKH + + L    H+ KLIGWGT + G  YWL+ N W  
Sbjct: 252 NGPVEVAFTVFEDFAHYKSGVYKHITGSALGG--HAVKLIGWGTSDEGEDYWLLANQWNT 309

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKP 342
           +WGD G  KI RG  EC  E  + AG P
Sbjct: 310 NWGDDGYFKIKRGTNECGIEDDVTAGLP 337


>gi|161343875|tpg|DAA06118.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 210

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 81/221 (36%), Positives = 111/221 (50%), Gaps = 12/221 (5%)

Query: 109 CAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNF 168
           C +    +A   FSDR CI + G   R LS E + +CC  C       C  GS    W F
Sbjct: 1   CGSCWAASAASVFSDRLCIATGGAVARNLSAEQLNTCCYRC----GNGCDGGSPEAAWYF 56

Query: 169 LHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTR-CTNPTYGR 227
             + G VTGGDY    GCQP +I P           +C +  +    C  R CTN  Y +
Sbjct: 57  FMRHGIVTGGDYESGDGCQPYSIYP-----RGKGRNTCIDDDIDTPDCSIRTCTNSNYTK 111

Query: 228 GFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLEN 287
           G+  D H     Y +  +E+ I  +I  +GP  A F +Y DF +YKSGVY +T   ++E 
Sbjct: 112 GYRADLHYVDTVYSLSRSEEDIMTDIYKNGPVQAAFYVYTDFMYYKSGVYSYT-RGQIEG 170

Query: 288 YLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGK 328
             H+ K++GWG ++ T YWL  N+W   WG+ G  +ILRG 
Sbjct: 171 -GHAIKILGWGVDDNTKYWLCANSWSRSWGENGLFRILRGN 210


>gi|166030322|gb|ABY78828.1| cathepsin B-like protease [Trypanosoma congolense]
 gi|343471419|emb|CCD16168.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 101/350 (28%), Positives = 153/350 (43%), Gaps = 36/350 (10%)

Query: 1   MIHILVFLLGCTLVRG-ELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIAD 58
           ++   +  LG + +R  +    +  ++D+IN+     W A  N         ++    ++
Sbjct: 9   LLSTALVALGASALRAKDAPVLTKTFVDRINQLNGGMWKAVYN-------GKMQNITFSE 61

Query: 59  AKYFD----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
           AK       Q +  LP  R T + +    +P+ FD+ E+WPNC TI  + D  AC A   
Sbjct: 62  AKRLTGAWIQKNSSLPPVRFT-EEQLRTELPESFDSAEKWPNCPTIREIADQSACRASWA 120

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFR--TWNFLHKR 172
            +   A SDR C    G+Q R        S   +              F    W +  + 
Sbjct: 121 VSTASAISDRYCTVGGGKQLR-------ISAAHLLSCCKQCGGGCKGGFPGFAWRYYVEY 173

Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQD 232
           G  +       + CQP     C H G+      C N K    +C+T CT+ T        
Sbjct: 174 GIAS-------SYCQPYPFPQCEHQGAQGNKTPCSNYKFVTPQCNTTCTDKTIPL----I 222

Query: 233 KHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
           K+R    Y +   E+  K+E+  +GP  A   +Y D + YKSGVY++   + +   + + 
Sbjct: 223 KYRGKDAYMLLPGEEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRNVDGSYMG--VTAV 280

Query: 293 KLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           K++GWG  NGTPYW V NTW   WG  G + ILRG  EC  E+L  AG P
Sbjct: 281 KVVGWGKLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTP 330


>gi|217073630|gb|ACJ85175.1| unknown [Medicago truncatula]
          Length = 359

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 97/268 (36%), Positives = 124/268 (46%), Gaps = 34/268 (12%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           S  +P  FDAR  W  C TIG + D G C +   F AV +  DR C  S    N  LS  
Sbjct: 100 SLKLPKEFDARAAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFC--SHFDMNISLSVN 157

Query: 141 YVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHH 197
            + +CC  +C       C  G+    W +L   G VT     Y D+ GC        SH 
Sbjct: 158 DLLACCGFLC----GAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGC--------SHP 205

Query: 198 GSAPTLPSCENQKVPKLKCHTRCT--NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           G  P        + PK  C  +C   N  + R     KH +   Y V  +   I  E+  
Sbjct: 206 GCEPAY------QTPK--CVRKCVKGNQIWKR----SKHYSVKAYRVKSDPQDIMTEVYK 253

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGP 314
           +GP    F +++DF HYKSGVYKH + + L    H+ KLIGWGT + G  YWL+ N W  
Sbjct: 254 NGPVEVAFTVFEDFAHYKSGVYKHITGSALGG--HAVKLIGWGTSDEGEDYWLLANQWNT 311

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKP 342
           +WGD G  KI RG  EC  E  + AG P
Sbjct: 312 NWGDDGYFKIKRGTNECGIEDDVTAGLP 339


>gi|10803452|emb|CAB97365.2| putative cathepsin B.2 [Ostertagia ostertagi]
          Length = 194

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 76/198 (38%), Positives = 107/198 (54%), Gaps = 10/198 (5%)

Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
           ++  A SDR CI S G +   LS + + +CC  C Y     C  G   + W +    G V
Sbjct: 6   SSAAAMSDRVCIASXGAKQVLLSDQDMLACCSWCGY----GCEGGWPMKAWQYFXLEGVV 61

Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSC-ENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           TGG+Y  +  C+P    PC  HG  P    C ++ K PK  C   C    Y + + +DKH
Sbjct: 62  TGGNYRKQGCCRPYEFPPCGRHGKEPYYGECYDSAKTPK--CQKTCQR-GYLKPYKEDKH 118

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
                Y + +N  AI+++I+ +GP  A F +Y+DF HYKSG+YKHT+        H+ K+
Sbjct: 119 FGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMTGG--HAVKI 176

Query: 295 IGWGTENGTPYWLVINTW 312
           IGWG E GTPYWL+ N+W
Sbjct: 177 IGWGKEXGTPYWLIANSW 194


>gi|168026641|ref|XP_001765840.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683017|gb|EDQ69431.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 339

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 100/323 (30%), Positives = 153/323 (47%), Gaps = 36/323 (11%)

Query: 26  IDQINREAN-TWTAGRN--FPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA 82
           +D++N     TW AG N  F  + + E+L++  I  AK    ++     +R T+  +   
Sbjct: 38  VDKVNAHPRATWKAGFNDRFEGH-TIEHLKK--ICGAKMTPANELEPSIERVTHKHK-KL 93

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
            +P  FDAR+ W +C TIG + D G C +   F A  + +DR CI     ++  LS   +
Sbjct: 94  VLPKEFDARKHWGHCSTIGAILDQGHCGSCWAFGAAESLTDRFCIHM--NESVSLSENDL 151

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGSA 200
            +CC    ++    C  G   R W +  + G VT     Y D+ GC         H G  
Sbjct: 152 LACCG---FECGDGCDGGYPIRAWRYFKRTGVVTSKCDPYFDQIGC--------GHPGCY 200

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
           PT       + PK  C   C +      + + KH +   Y V    + +  E+  +GP  
Sbjct: 201 PTY------RTPK--CVKHCVDDEL---WVKSKHLSVNAYEVSKEPEDLMAELYTNGPIE 249

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGPHWGDR 319
            +F +++DF HYK+GVYKH     +    H+ KLIGWGT ++G  YW ++N+W  +WG+ 
Sbjct: 250 VSFEVFEDFAHYKTGVYKHVYGRYIGG--HAVKLIGWGTTDDGVDYWTIVNSWNTNWGEH 307

Query: 320 GTVKILRGKYECAFEYLIAAGKP 342
           G  +I RG  EC  E    AG P
Sbjct: 308 GLFRIARGGNECGIESYAVAGLP 330


>gi|224285256|gb|ACN40354.1| unknown [Picea sitchensis]
          Length = 350

 Score =  141 bits (356), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 90/262 (34%), Positives = 127/262 (48%), Gaps = 29/262 (11%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P +FDAR+ WP+C +   + D G C +   FAAV A SDR CI    Q N  LS   + 
Sbjct: 95  LPSKFDARKAWPHCTSTRSILDQGHCGSCWAFAAVEALSDRFCIHF--QVNATLSENDLV 152

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGSAP 201
           +CC    +     C+ G     W +  +RG VT     Y D  G        C+H G  P
Sbjct: 153 ACCG---FRCGSGCNGGFPLSAWRYFSRRGVVTDECDPYFDNDG--------CNHPGCEP 201

Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
           + P+         +C   C +    + +   KH +   Y +  +   I  E+  +GP   
Sbjct: 202 SYPT--------PRCVKNCKD---NQRWSHSKHYSANAYRIKSDPYNIMAEVFNNGPVEV 250

Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENGTPYWLVINTWGPHWGDRG 320
           +F++Y+DF HY++GVYKH     L    H+ KLIGWG T++G  YWL+ N+W   WG+ G
Sbjct: 251 SFSVYEDFAHYETGVYKHVQGRYLGG--HAVKLIGWGTTDDGIDYWLIANSWNTAWGEGG 308

Query: 321 TVKILRGKYECAFEYLIAAGKP 342
             KI RG  EC  E    AG P
Sbjct: 309 YFKIARGVNECGIERDPVAGMP 330


>gi|29840882|gb|AAP05883.1| similar to GenBank Accession Number X70968 cathepsin B in
           Schistosoma japonicum [Schistosoma japonicum]
          Length = 312

 Score =  141 bits (356), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 89/266 (33%), Positives = 130/266 (48%), Gaps = 10/266 (3%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
            SD  I  IN++ N  W A R      S  + +  +       DQ     P     +  +
Sbjct: 32  LSDELITFINKQPNIEWKADRTTRFT-SIHHAKSMMGVLLNRVDQHKLHHP---IIHHND 87

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
            +  +P  FD+R+ W NC +I  + D  +C +   F AV + SDR CI SKG+ +  LS 
Sbjct: 88  INIKLPKYFDSRKYWKNCSSIRTIRDQSSCGSCWAFGAVESMSDRICIHSKGRISIELSA 147

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
             + SCC  C +     C+ G     W++    G VTGG     TGCQP     C HH +
Sbjct: 148 VNLLSCCSRCGF----GCNGGIPGMAWDYWKDEGIVTGGSNETHTGCQPYPFPECIHHST 203

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
           +    SCE +     +C+  C  P Y   +  DK+    +Y+V  +E +I KEIL +GP 
Sbjct: 204 SINHSSCEVKYYSTPECYQTC-QPDYAIQYENDKYYGKSSYYVTSDEVSIMKEILLNGPV 262

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKL 285
            ATF +YDDF +YK+GVYK+ + + L
Sbjct: 263 EATFYVYDDFLNYKTGVYKYVTGSLL 288


>gi|330805199|ref|XP_003290573.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
 gi|325079281|gb|EGC32888.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
          Length = 313

 Score =  141 bits (355), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 100/267 (37%), Positives = 128/267 (47%), Gaps = 28/267 (10%)

Query: 77  DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
           DP   A  P  FD+R  W NC TIG++ +   C +   F AV +  DR CI  KG   + 
Sbjct: 73  DPNIKA--PASFDSRTAWSNCTTIGYIENQARCGSCWAFGAVESAQDRICIH-KGLDVQL 129

Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
              + V      C   D+  C  G     WNFL K+G VT         C+P TI  C  
Sbjct: 130 SFLDLVT-----CDQSDD-GCEGGDDVSAWNFLKKQGVVT-------QECKPYTIPTC-- 174

Query: 197 HGSAPTLPSCENQKVPKLKCHTRC-TNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
               P    C N  V    C  +C +N T    + QDKH+    Y ++  E AI +EI  
Sbjct: 175 ---PPAQQPCLN-FVNTPNCVKQCESNSTLI--YSQDKHKMAKIYSINSVE-AIMQEIST 227

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
           +GP  A F++Y+DF  YKSGVY+HT+   L    H  K+ G+GT NG  YW V N+W   
Sbjct: 228 NGPVEACFSVYEDFLGYKSGVYQHTTGKFLGG--HCVKIFGYGTLNGVNYWSVANSWTTS 285

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
           WGD G   I RG  EC  E  + AG P
Sbjct: 286 WGDNGIFLIKRGSDECGIEDEVVAGIP 312


>gi|2317912|gb|AAC24376.1| cathepsin B-like cysteine proteinase [Arabidopsis thaliana]
          Length = 357

 Score =  141 bits (355), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 97/268 (36%), Positives = 131/268 (48%), Gaps = 32/268 (11%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           + S  +P  FDAR  W +C +I  +   G C +   F AV + SDR CIK     N  LS
Sbjct: 98  DLSLKLPKEFDARTAWSHCTSIRRI--LGHCGSCWAFGAVESLSDRFCIKY--NLNVSLS 153

Query: 139 TEYVASCCKI-CRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCS 195
              V +CC + C +     C+ G     W +    G VT     Y D TGC        S
Sbjct: 154 ANDVIACCGLLCGF----GCNGGFPMGAWLYFKYHGVVTQECDPYFDNTGC--------S 201

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           H G  PT P+ + ++    KC +R  N  +G    + KH     Y ++ +   I  E+  
Sbjct: 202 HPGCEPTYPTPKCER----KCVSR--NQLWG----ESKHYGVGAYRINPDPQDIMAEVYK 251

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGP 314
           +GP    F +Y+DF HYKSGVYK+ +  K+    H+ KLIGWGT ++G  YWL+ N W  
Sbjct: 252 NGPVEVAFTVYEDFAHYKSGVYKYITGTKIGG--HAVKLIGWGTSDDGEDYWLLANQWNR 309

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKP 342
            WGD G  KI RG  EC  E  + AG P
Sbjct: 310 SWGDDGYFKIRRGTNECGIEQSVVAGLP 337


>gi|170028894|ref|XP_001842329.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167879379|gb|EDS42762.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 355

 Score =  140 bits (354), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 82/245 (33%), Positives = 111/245 (45%), Gaps = 27/245 (11%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDAR +WPNC +I  +P+ G C +          +DR CI+S G   R  S     
Sbjct: 20  IPTSFDARTRWPNCPSIALIPNQGCCNSSAFQIPAAVITDRACIRSNGTSTRTYSAYDAL 79

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           +CC  C +     C+ G   + WN+    G V+         C P ++SP     + P L
Sbjct: 80  ACCTDCPFSQLFKCAGGDPLKVWNYWATTGLVS-------DSCMPFSLSPLCLGFNCPLL 132

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
                              P Y      D+ +      V    DAI+ EI+ +GP  A+F
Sbjct: 133 -----------------CAPGYAGSIVGDRKKGLKVVTVAPYVDAIQSEIILNGPVEASF 175

Query: 264 ALYDDFYHYK-SGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTV 322
            LY DF H K S VY   S   L     S K+IGWG ENGT YWL+ +T+G  WG++GT 
Sbjct: 176 DLYLDFVHLKQSQVYNSRSGPNLGR--QSVKIIGWGVENGTEYWLITSTFGIGWGNQGTA 233

Query: 323 KILRG 327
             LRG
Sbjct: 234 MFLRG 238


>gi|294916952|ref|XP_002778399.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239886773|gb|EER10194.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 228

 Score =  140 bits (353), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 85/217 (39%), Positives = 109/217 (50%), Gaps = 13/217 (5%)

Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           F+DR CIKS G+    LS  Y+ SCC +      +  C  GSV    NF+   G VTGG+
Sbjct: 2   FNDRVCIKSGGKTTDILSLGYLTSCCNRANGCPKSNGCMFGSVPEGLNFMKNHGLVTGGE 61

Query: 180 Y------GDRTGCQPSTISPCSH-HGSAPTLPSCENQK-VPKLKCHTRCTNPTYGRGFFQ 231
           Y      G+  GC P     C+H  G     P C   + +P   C T C N  YG    +
Sbjct: 62  YKPPEKLGNDDGCWPYPFPKCNHVPGLESKYPRCAQVRDLPA--CATTCPNKAYGTSMQK 119

Query: 232 DKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHS 291
           D HR      +    + IK+EI  +GP  A   LY+DF +YKSGVY H +   L    H+
Sbjct: 120 DTHRAKSWGRLPIGPEKIKQEIFDNGPVAAMMTLYEDFRYYKSGVYVHKTGQLLA--AHT 177

Query: 292 GKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGK 328
            KLIGWG E+G  YWL +N W   WGD G +K+  GK
Sbjct: 178 LKLIGWGVESGQEYWLAMNAWNEEWGDHGMIKLAVGK 214


>gi|86279341|gb|ABC88766.1| putative cathepsin B-like like proteinase [Tenebrio molitor]
          Length = 301

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 91/277 (32%), Positives = 133/277 (48%), Gaps = 16/277 (5%)

Query: 1   MIHILVFLLGCTLVRG--ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
           ++  +V L    L  G  +L+  SD +I++IN +  TW AGRNF  N    ++R+ L   
Sbjct: 4   VLLCIVVLASVALSYGGVKLHPLSDEFINEINSKQTTWKAGRNFDVNTPISHVRRLLGVL 63

Query: 59  AKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTI-GHVPDTGACAAPHIFAA 117
            K  +    P+    KT+     A +P+ FDARE WP C +I G + D  +C +   F A
Sbjct: 64  PKKANAPKLPV----KTHAVNLDA-IPESFDAREAWPECTSIIGEIRDQASCGSCWAFGA 118

Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
           V A SDR CI S       +S E +  CC    YD    C+ G     W++    G VTG
Sbjct: 119 VEAMSDRICIHSDASVKVRISAEDLNDCC----YDCGDGCNGGWPDLAWSYWSSTGIVTG 174

Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
           G YG   GC+  +I PC HH      P  + Q+ P  K   +  + T    +  D  R +
Sbjct: 175 GLYGVDEGCKAYSIKPCDHHVDGNLGPCGDIQRTPACK---KSCDSTSDLEYKSDLRRGS 231

Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKS 274
             Y +  +E  I+ EI+ +GP  A + +Y DF  YK+
Sbjct: 232 -AYSIPKSESQIQTEIMTNGPVEADYDVYSDFLTYKA 267


>gi|294955270|ref|XP_002788457.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
 gi|239903926|gb|EER20253.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
          Length = 392

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 99/321 (30%), Positives = 133/321 (41%), Gaps = 66/321 (20%)

Query: 16  GELYK-----FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYF--DQSDRP 68
            ELY+        + +D+IN E N WTA  +      +E      + DAK       +  
Sbjct: 65  AELYEDTRPAIMQSLVDEINSEQNLWTASTD------QERFYGHSLGDAKKLCGTLLEEA 118

Query: 69  LPGDRKTYDPEYSATVPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCI 127
              + K Y P   A +P+ FDAR+ +  C   IGHV                        
Sbjct: 119 EGLEEKVYPPGELADIPNSFDARDAFKECKDVIGHV------------------------ 154

Query: 128 KSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQ 187
                            CC          C+ G     W+FL+  G  T G      GC 
Sbjct: 155 -----------------CCD--------GCTKGRPDAAWSFLNVYGIATEGSMSAADGCW 189

Query: 188 PSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT-LTYWVDDNE 246
           P     C HH        C  +      C  RC N  YG    +D+H T   + +     
Sbjct: 190 PYNFPKCGHHQQDSKYQPCPEKNYDTPPCLDRCPNKNYGTPLDKDRHFTAHFSPYQLKGT 249

Query: 247 DAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYW 306
           D IKKEI+ +GPT+A F++YDDF  Y+SGVYKHTS   +    H  ++IGWGT+ G  YW
Sbjct: 250 DNIKKEIMTNGPTSAAFSMYDDFLSYESGVYKHTSGTLMGE--HGVEIIGWGTKQGVDYW 307

Query: 307 LVINTWGPHWGDRGTVKILRG 327
           LV+N+W   WG  GT KI +G
Sbjct: 308 LVMNSWNEGWGVHGTFKIAQG 328


>gi|390357905|ref|XP_003729132.1| PREDICTED: cathepsin B-like [Strongylocentrotus purpuratus]
          Length = 354

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 76/187 (40%), Positives = 100/187 (53%), Gaps = 5/187 (2%)

Query: 157 CSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKC 216
           C+ G     W +    G VTGG +    GCQP  I  C HH +    P C+ +  P  +C
Sbjct: 173 CNGGFPGSAWEYYKDTGIVTGGQWNSSQGCQPYQIKSCDHHVNGTKGP-CQGEG-PTPEC 230

Query: 217 HTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGV 276
             +C   +Y   + QDKH       + +N +A + EI+ +GP  A F +Y+DF  YKSGV
Sbjct: 231 KHKC-EASYSTPYEQDKHYALSVNSISNNPEATQTEIMTNGPVEADFTVYEDFPTYKSGV 289

Query: 277 YKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYL 336
           Y+HT+   L    H+ K++GWG E GT YWLV N+W   WGD G  KILRG  EC  E  
Sbjct: 290 YQHTTGGVLGG--HAIKILGWGVEEGTKYWLVANSWNNEWGDNGFFKILRGSNECGIESD 347

Query: 337 IAAGKPK 343
           I  G PK
Sbjct: 348 INFGIPK 354


>gi|343475054|emb|CCD13447.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 99/329 (30%), Positives = 146/329 (44%), Gaps = 36/329 (10%)

Query: 21  FSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAK----YFDQSDRPLPGDRKT 75
            +  ++D+IN+     W A       + +  ++    ++AK     F +    LP  R T
Sbjct: 31  LTQKFVDRINQLNGGMWKA-------VYDGKMQNLTFSEAKRLTGAFSRKTSTLPPVRFT 83

Query: 76  YDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
            + +    +P+ FDA E+WP+C TI  +PD  AC A    A   A SDR C    G+Q R
Sbjct: 84  -EEQLRTELPESFDAAEKWPHCPTIREIPDQSACRASWAVATASAISDRYCTVGNGKQLR 142

Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
             + + +A C       +            W +    G  +       + CQP     C 
Sbjct: 143 ISAADLMACCTGCGGGCEGGYPDAA-----WEYYVSNGITS-------SQCQPYPFPRCE 190

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           H G+    P C         C+  CT+    +     K+R   +Y V   ED  K+E+  
Sbjct: 191 HRGAQGKKPPCSKYNFDTPTCNATCTD----KSVPLIKYRGNHSYEVRGEED-YKRELYF 245

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYWLVINTWG 313
           +GP    F ++ DF  YKSGVY+H +     N+L   + +++GWG  NGTPYW V N+W 
Sbjct: 246 NGPFVVRFQVHSDFLAYKSGVYQHVAG----NFLGGKAVRIVGWGKMNGTPYWKVANSWD 301

Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKP 342
             WG  G   ILRG  EC  E+L  AG P
Sbjct: 302 TDWGMNGYFLILRGNNECNIEHLGFAGTP 330


>gi|294914336|ref|XP_002778250.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239886453|gb|EER10045.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 388

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 104/328 (31%), Positives = 145/328 (44%), Gaps = 25/328 (7%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
             D+  D +N+   TW A         +E  +   + D K    +    P  +    P  
Sbjct: 67  LMDSLADALNQGQKTWVASSK------QERFKGASVFDVKALCGTILNGP-SKLPKKPAS 119

Query: 81  SATV----PDRFDAREQWPNCGT-IGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
            +TV    PDRFDARE + NC T IGHV      + P + A +        I     +  
Sbjct: 120 ESTVLSNLPDRFDAREHFKNCATVIGHV------SPPVVAAGLLRRLKHSAIVCASARVD 173

Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
            L T Y      +      K  +   V    N +   G  T     D +GC P     CS
Sbjct: 174 SL-TWYHFLLATLRHVAQKKKVAFHLVAMAVNLIAHGGGSTFAPELD-SGCWPYNFPECS 231

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           HH     +  C+    P   C T C N  +   F  D+H T    +  D  D IK+EI+ 
Sbjct: 232 HHVDTKGMEPCKGNS-PSPVCSTTCRNHHFKPSFESDRHFTEDEGYSLDEVDEIKREIID 290

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
           +GP  A F +Y+DF +YKSGVYKH + ++L    H+ K+IGWG +    YWLV+N+W  +
Sbjct: 291 NGPVAAAFTVYEDFPYYKSGVYKHVNGSELGG--HAVKIIGWGIDQNEQYWLVMNSWNVN 348

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKPK 343
           WGD+G  KI  G  EC  +  + AG PK
Sbjct: 349 WGDQGIFKIAIG--ECGIDSEVTAGIPK 374


>gi|66810163|ref|XP_638805.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
 gi|74897075|sp|Q54QD9.1|CTSB_DICDI RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Flags:
           Precursor
 gi|60467425|gb|EAL65448.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
          Length = 311

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 93/269 (34%), Positives = 129/269 (47%), Gaps = 27/269 (10%)

Query: 74  KTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
           K+YDP     +P  F+A+  WPNC TI  + +   C +   F A  + +DR CI +   +
Sbjct: 70  KSYDP-LGVQIPTSFNAQTNWPNCTTISQIQNQARCGSCWAFGATESATDRLCIHNN--E 126

Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
           N  LS   + +C      + +  C  G  F  WN+L K+G+V+         C P TI  
Sbjct: 127 NVQLSFMDMVTC-----DETDNGCEGGDAFSAWNWLRKQGAVS-------EECLPYTIPT 174

Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
           C      P    C N  V    C   C + +    + QDKH+    Y  D +E AI +EI
Sbjct: 175 C-----PPAQQPCLN-FVNTPSCTKECQSNS-SLIYSQDKHKMAKIYSFDSDE-AIMQEI 226

Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWG 313
           + +GP  A F +++DF  YKSGVY HT+   L    H  KL+G+GT NG  Y+   N W 
Sbjct: 227 VTNGPVEACFTVFEDFLAYKSGVYVHTTGKDLGG--HCVKLVGFGTLNGVDYYAANNQWT 284

Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKP 342
             WGD GT  I RG  +C     + AG P
Sbjct: 285 TSWGDNGTFLIKRG--DCGISDDVVAGLP 311


>gi|388500062|gb|AFK38097.1| unknown [Lotus japonicus]
          Length = 357

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 95/266 (35%), Positives = 123/266 (46%), Gaps = 30/266 (11%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           S  +P  FDAR  W  C TIG + D G C +   F AV + SDR CI      N  LS  
Sbjct: 98  SLKLPKSFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHL--DVNVSLSVN 155

Query: 141 YVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHH 197
            + +CC  +C       C  G     W +L   G VT     Y D+ G        CSH 
Sbjct: 156 DLLACCGFLC----GSGCDGGYPLYAWRYLAHHGVVTEECDPYFDQIG--------CSHP 203

Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
           G  P        + P  KC  +C      + + + K+ +   Y V  +   I  E+  +G
Sbjct: 204 GCEPAY------QTP--KCVRKCVKGN--QIWKKSKYFSVNAYSVKSDPYDIMAEVYKNG 253

Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENGTPYWLVINTWGPHW 316
           P    F +Y+DF HYKSGVYKH + ++L    H+ KLIGWG T+ G  YWL+ N W   W
Sbjct: 254 PVEVAFTVYEDFAHYKSGVYKHITGSQLGG--HAVKLIGWGTTDEGEDYWLIANQWNRSW 311

Query: 317 GDRGTVKILRGKYECAFEYLIAAGKP 342
           GD G   I RG  EC  E  + AG P
Sbjct: 312 GDDGYFMIRRGTNECGIEEDVTAGLP 337


>gi|166030326|gb|ABY78830.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 100/349 (28%), Positives = 152/349 (43%), Gaps = 34/349 (9%)

Query: 1   MIHILVFLLGCTLVRG-ELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIAD 58
           ++   +  LG + +R  +    +  ++D+IN+     W A  N         ++    ++
Sbjct: 9   LLSTALVTLGASALRAKDAPVLTKTFVDRINQLNGGMWKAVYN-------GKMQNITFSE 61

Query: 59  AKYFD----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
           AK       Q +  LP  R T + +    +P+ FD+ E+WPNC TI  + D  AC A   
Sbjct: 62  AKRLTGAWIQKNSSLPPVRFT-EEQLRTELPESFDSAEKWPNCPTIREIADQSACRASWA 120

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
            +   A SDR C    G+Q R        S   +              F  + +L+    
Sbjct: 121 VSTASAISDRYCTVGGGKQLR-------ISAAHLLSCCKQCGGGCKGGFPGFAWLYYV-- 171

Query: 175 VTGGDYG-DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
               +YG   +GCQP     C H G+      C   K    KC+  CT+    +     K
Sbjct: 172 ----EYGIASSGCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKCNATCTD----KSIPLVK 223

Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
           +R   TY +   E+  K+E+  +GP  A F +Y D + YKSGVY++     L     + +
Sbjct: 224 YRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGG--QAVR 281

Query: 294 LIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           ++GWG  NGTPYW V N+W   WG  G + IL G  EC  E+L   G P
Sbjct: 282 IVGWGKLNGTPYWKVANSWDTDWGMNGYMLILGGNNECNIEHLGFTGFP 330


>gi|166030324|gb|ABY78829.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  139 bits (349), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 103/346 (29%), Positives = 145/346 (41%), Gaps = 35/346 (10%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
           LV L    L+  +    +  ++D+IN+     W A  N         ++    A+AK   
Sbjct: 14  LVTLGVSALLVKDAPVLTKTFVDRINQLNGGMWKAVYN-------GKMQNITFAEAKRLT 66

Query: 64  ----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
               Q    LP  R T + +    +P+ FD+ E+WPNC TI  + D  AC A    +   
Sbjct: 67  GAWIQKTSSLPPVRFT-EEQLRTELPESFDSAEKWPNCPTIREIADQSACRASWAVSTAS 125

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFR--TWNFLHKRGSVTG 177
             SDR C     QQ R        S   +              F    W +  + G  + 
Sbjct: 126 VISDRYCTVGGVQQLR-------ISAAHLLSCCKQCGGGCKGGFPGFAWRYYVEYGIAS- 177

Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
                 + CQP     C H G+      C        KC+  CT+    +     K+R  
Sbjct: 178 ------SYCQPYPFPHCEHRGAQGNKTPCSKYNFDTPKCNATCTD----KSIPLVKYRGN 227

Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
            TY +   E+  K+E+  +GP  A F +Y D + YKSGVY+H     L     + K++GW
Sbjct: 228 ATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGG--TAVKVVGW 285

Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           G  NGTPYW V NTW   WG  G + ILRG  EC  E+L  AG P+
Sbjct: 286 GKLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPE 331


>gi|323448265|gb|EGB04166.1| hypothetical protein AURANDRAFT_32974 [Aureococcus anophagefferens]
          Length = 298

 Score =  139 bits (349), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 92/278 (33%), Positives = 121/278 (43%), Gaps = 24/278 (8%)

Query: 71  GDRKTYDPEYSATVPDRFDAREQWPNCGT-IGHVPDTGACAAPHIFAAVGAFSDRRCIKS 129
           GD   Y P   A  P+ FD+  +WP C   IG + D   C     FA   A SDR+CI +
Sbjct: 12  GDVVDYVPRGGA-APEAFDSAARWPECAKLIGDIRDQSNCGCCWAFAGAEAASDRQCIAT 70

Query: 130 KGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTG---- 185
            G    PLS +       +C   +   C  G +   W ++ K G+VTGG Y + TG    
Sbjct: 71  GGAVAVPLSAQ------DVCFNANVDGCDGGQIITPWTYVAKAGAVTGGQY-NGTGPFGA 123

Query: 186 --CQPSTISPCSHHGSAPTLP-------SCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
             C       C HHG     P        C ++K P+       T       F  DKH  
Sbjct: 124 GLCADWFAPHCHHHGPRGDDPYPAEGDAGCPSEKSPEGPKACDATAAAGHDAFAADKHTF 183

Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
                    E AI   I   GP    F +Y+DF +Y  G+Y H +  +     H+ K +G
Sbjct: 184 AGDVQTASGEAAIMAMIAEGGPVETAFTVYEDFENYAGGIYHHVTGEEAGG--HAVKFVG 241

Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
           WG ENGT YW V N+W P+WG+ G  +ILRG  E   E
Sbjct: 242 WGVENGTKYWKVANSWNPYWGEAGYFRILRGSNEGGIE 279


>gi|156375635|ref|XP_001630185.1| predicted protein [Nematostella vectensis]
 gi|156217201|gb|EDO38122.1| predicted protein [Nematostella vectensis]
          Length = 311

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 109/352 (30%), Positives = 156/352 (44%), Gaps = 53/352 (15%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDA------YIDQINREANTWTAGRNFPA--NLSEEYLR 52
           M+ I  FL+   LV G+    S         +D+IN     W A   +P   NL+ E  +
Sbjct: 1   MLAIAAFLV--LLVSGDGIPISKEKVISRDLVDKINTLNVGWEATL-YPQFENLTFESAK 57

Query: 53  QFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAP 112
             L +   + + S  P        +   +  +P+ FDAR+QWP  G+I  + + G C + 
Sbjct: 58  SMLGSRGAWPEGSLPP------EIEVRVAENIPENFDARKQWP--GSIHPIRNQGQCGSC 109

Query: 113 HIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKR 172
             F A    SDR  I SK Q    LS + +  C       DN  CS G     WN++ K 
Sbjct: 110 WAFGASEVLSDRFAIASKNQIYVTLSAQQLVDCDL-----DNSGCSGGWPINAWNYMVKT 164

Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTR-CT-NPTYGRGFF 230
           G +T   YG                      P    Q   +L  +T  C   P     F+
Sbjct: 165 GLLTEQCYG----------------------PYYAKQYTCRLTANTTDCPWQPGVKARFY 202

Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLH 290
             K    L      N +AI+ +I+ +GP  A F ++ DFY Y+SG+Y H +  +L    H
Sbjct: 203 HAKSAYKLP---AKNVEAIQTDIMNNGPVEADFTIFQDFYAYRSGIYVHATGKQLGG--H 257

Query: 291 SGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           + K++GWGTE+   YWL  N+WG +WG +G  KI RG  EC  E  +AAG P
Sbjct: 258 AIKILGWGTEDNVDYWLCANSWGANWGIQGYFKIRRGTDECGIEDGLAAGLP 309


>gi|256090674|ref|XP_002581308.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 250

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 84/229 (36%), Positives = 113/229 (49%), Gaps = 13/229 (5%)

Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
           A+  + SDR CI++ G     LS   + SC K     +   C  G    +W++  K G V
Sbjct: 33  ASAASISDRTCIQTNGTMKVQLSAIELISCSK-----NKLGCQIGFSEFSWDYWLKNGLV 87

Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
           TG    D TGC P     C H  S+ + P C         C   C +  Y   +  DKH 
Sbjct: 88  TG----DPTGCLPYPFPKCDHR-SSNSYPKCGYITYTAPPCTKTCRS-GYPIPYKADKHY 141

Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
             + Y +  NE  I+KEI+ +GP  A   ++ DF +YKSGVY+H +   +   +HS ++I
Sbjct: 142 GRVIYSLRPNESDIRKEIMMNGPVEAGIFVHSDFLNYKSGVYRHITGQLVT--IHSVRII 199

Query: 296 GWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
           GWG EN  PYWL  N+W   WG  G  KILRG  EC  E  + AGK  N
Sbjct: 200 GWGIENDIPYWLCANSWNEDWGLNGYFKILRGSNECEIESFVNAGKVDN 248


>gi|729283|sp|Q06544.1|CYSP3_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 3
 gi|159952|gb|AAA29436.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
          Length = 174

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 69/177 (38%), Positives = 99/177 (55%), Gaps = 6/177 (3%)

Query: 165 TWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSC-ENQKVPKLKCHTRCTNP 223
            W +    G VTGG+Y  +  C+P    PC  HG  P    C +  K PK  C   C   
Sbjct: 1   AWQYFALEGVVTGGNYRKQGCCRPYEFPPCGRHGKEPYYGECYDTAKTPK--CQKTCQR- 57

Query: 224 TYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNA 283
            Y + + +DKH     Y + +N  AI+++I+ +GP  A F +Y+DF HYKSG+YKHT+  
Sbjct: 58  GYLKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGR 117

Query: 284 KLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
                 H+ K+IGWG E GTPYWL+ N+W   WG++G  +++RG   C  E ++ AG
Sbjct: 118 MTGG--HAVKIIGWGKEKGTPYWLIANSWHDDWGEKGFYRMIRGINNCRIEEMVFAG 172


>gi|157058751|gb|ABV03133.1| cathepsin B-3098 [Aulacorthum solani]
          Length = 215

 Score =  138 bits (347), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 85/224 (37%), Positives = 116/224 (51%), Gaps = 19/224 (8%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P +FDAR++W  C TIG V D G CA+    +   AF+DR C+ + G  N+ LS E + 
Sbjct: 6   IPRKFDARKKWLRCKTIGEVRDQGNCASGWALSTSSAFADRLCVATNGDFNQLLSAEEIT 65

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
            CC  C       C  G   R W    K G VTGG+Y    GC+P  + PC +       
Sbjct: 66  FCCHTC----GNGCYGGYPIRAWKSFKKHGLVTGGNYKSGEGCEPYRVPPCPYDEYGNN- 120

Query: 204 PSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
            +C  Q    ++ + RCT   YG     F QD   T   Y++      I+K+++ +GP  
Sbjct: 121 -TCSGQ---PMESNHRCTRMCYGNQDLDFDQDHRYTRDHYYL--TYRGIQKDVINYGPIE 174

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENG 302
           A+F +YDDF  YKSG+Y  + NA   +YL  HS KLIGWG E G
Sbjct: 175 ASFDVYDDFPSYKSGIYVKSENA---SYLGGHSVKLIGWGEEYG 215


>gi|56758130|gb|AAW27205.1| unknown [Schistosoma japonicum]
          Length = 279

 Score =  137 bits (345), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 74/198 (37%), Positives = 107/198 (54%), Gaps = 6/198 (3%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           + +  +P +FD+R++WP+C +I  + D   C +   F AV A +DR CI+S GQQ+  LS
Sbjct: 85  DLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELS 144

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
              + SCC+ C       C  G     W++  KRG VTGG   + TGCQP     C HH 
Sbjct: 145 ALDLISCCEDC----GDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHH- 199

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           +    P+C  +     +C   C    Y   + QDKH    +Y V  NE AI++EI+ +GP
Sbjct: 200 TKGKYPACGTKIYKTPQCKQTCQK-GYKTPYEQDKHYGDESYNVISNEKAIQREIMMYGP 258

Query: 259 TTATFALYDDFYHYKSGV 276
             A F +Y+DF +YKSG+
Sbjct: 259 VEAAFDVYEDFLNYKSGI 276


>gi|225437812|ref|XP_002281936.1| PREDICTED: cathepsin B-like isoform 1 [Vitis vinifera]
 gi|359480250|ref|XP_003632421.1| PREDICTED: cathepsin B-like [Vitis vinifera]
          Length = 358

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 91/265 (34%), Positives = 124/265 (46%), Gaps = 30/265 (11%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDAR  WP C TIG + D G C +   F AV + SDR CI      N  LS   + 
Sbjct: 101 LPKHFDARTAWPQCSTIGKILDQGHCGSCWAFGAVESLSDRFCIHFG--MNISLSVNDLL 158

Query: 144 SCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGSA 200
           +CC  +C       C  G     W +    G VT     Y D TG        CSH G  
Sbjct: 159 ACCGFLC----GSGCDGGYPLYAWRYFIHHGVVTEECDPYFDATG--------CSHPGCE 206

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
           P  P+         KC  +CT+    + + + K      Y +  +   I  E+  +GP  
Sbjct: 207 PGYPT--------PKCVRKCTDEN--QLWRKAKRYGQSAYRISSDPYQIMAEVYKNGPVE 256

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENGTPYWLVINTWGPHWGDR 319
             F +Y+DF HY+SGVY++T+   +    H+ KLIGWG T++G  YW++ N W  +WGD 
Sbjct: 257 VAFTVYEDFAHYESGVYRYTTGDVMGG--HAVKLIGWGTTDDGEDYWILANQWNRNWGDD 314

Query: 320 GTVKILRGKYECAFEYLIAAGKPKN 344
           G   I RG  EC  E  + AG P +
Sbjct: 315 GYFMIRRGVNECGIEEGVVAGLPSS 339


>gi|166030320|gb|ABY78827.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 101/346 (29%), Positives = 145/346 (41%), Gaps = 35/346 (10%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADAKYFD 63
           LV L    L+  +    +  ++D+IN+     W A  N         ++    A+AK   
Sbjct: 14  LVTLGVSALLVKDAPVLTKTFVDRINQLNGGMWKAVYN-------GKMQNITFAEAKRLT 66

Query: 64  ----QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
               Q    LP  R T + +    +P+ FD+ E+WPNC TI  + D  AC A    +   
Sbjct: 67  GAWIQKTSSLPPVRFT-EEQLRTELPESFDSAEKWPNCPTIREIADQSACRASWAVSTAS 125

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFR--TWNFLHKRGSVTG 177
             SDR C     QQ R        S   +              F    W +  + G  + 
Sbjct: 126 VISDRYCTVGGVQQLR-------ISAAHLLSCCKQCGGGCKGGFPGFAWRYYVEYGIAS- 177

Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTT 237
                 + CQP     C H G+      C        KC+  CT+    +     K+R  
Sbjct: 178 ------SYCQPYPFPHCEHRGAQGNKTPCSKYNFDTPKCNATCTD----KSIPLVKYRGN 227

Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGW 297
            TY +   E+  K+E+  +GP  A F +Y D + YKSGVY++     L     + +++GW
Sbjct: 228 ATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDILGG--QAVRIVGW 285

Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           G  NGTPYW V NTW   WG  G + ILRG  EC  E+L  AG P+
Sbjct: 286 GKLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPE 331


>gi|324514184|gb|ADY45787.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
          Length = 476

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 93/323 (28%), Positives = 143/323 (44%), Gaps = 54/323 (16%)

Query: 41  NFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTI 100
           NFP + +   +R++L   +++F+           T  P  + ++P  FDAR +W  C ++
Sbjct: 152 NFPFDKNSTAIREYLNRLSEFFNSEKMKQHLRELTEFP--ADSLPSEFDARRKWSYCSSL 209

Query: 101 GHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHG 160
            +VP+ G C A +  AAVG  SDR CI S G      S E V  CC +C      +C  G
Sbjct: 210 HNVPNQGGCGACYAVAAVGVASDRACIASNGTLQSMFSEEDVLGCCAVC-----GNCYGG 264

Query: 161 SVFRTWNFLHKRGSVTGGDYGDRTGCQPSTIS-PCSHHGSAPTLPSCENQKVPKLKCHTR 219
              +   +    G VTGG    R GC+P ++   C    S    P  E ++    KC+ +
Sbjct: 265 DPLKALVYWVDEGLVTGG----RDGCRPYSVDLSCGVPCSPAVYPLAEYRR----KCYRQ 316

Query: 220 CTNPTYGRGFFQDKHRTTLTY------------------------WVDDNED-------- 247
           C +  +   +  DKH  ++ Y                        ++++  D        
Sbjct: 317 CQDIYFQYNYESDKHYGSMAYSMFPRTMSLDNKGSERVKLPTVIGYLNETSDEPLTDKEI 376

Query: 248 --AIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLEN---YLHSGKLIGWGTENG 302
              I KE+   GP T  F + ++F HY SGV+     A   +   Y H  +LIGWG  +G
Sbjct: 377 RQIIMKELYLWGPMTMAFPVTEEFLHYSSGVFSPFPAANFSDRIVYWHVARLIGWGKYDG 436

Query: 303 -TPYWLVINTWGPHWGDRGTVKI 324
              YWL +N++G HWGD G  +I
Sbjct: 437 DNHYWLAVNSFGRHWGDDGVFRI 459


>gi|255548165|ref|XP_002515139.1| cathepsin B, putative [Ricinus communis]
 gi|223545619|gb|EEF47123.1| cathepsin B, putative [Ricinus communis]
          Length = 376

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 107/347 (30%), Positives = 141/347 (40%), Gaps = 58/347 (16%)

Query: 21  FSDAYIDQINREANT-WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
             ++ I ++N   +  W A  N         L  F +   KY     +P P       P 
Sbjct: 41  LQESIIKKVNENPDAGWEAAMN-------PQLSNFTVGQFKYL-LGAKPTPKKELMGVPM 92

Query: 80  YS----ATVPDRFDAREQWPNCGTIGHV-----------------PDTGACAAPHIFAAV 118
            S      +P  FDAR  WP+C TIG +                    G C +   F AV
Sbjct: 93  ISHPKTLKLPKEFDARTAWPHCSTIGKILGQLLSFYNIFSIFFFLFLEGHCGSCWAFGAV 152

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG- 177
            + SDR CI      N  LS   + +CC     D    C  G     W +    G VT  
Sbjct: 153 ESLSDRFCIHFG--MNISLSVNDLLACCGFLCGD---GCDGGYPMYAWRYFVHHGVVTEE 207

Query: 178 -GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
              Y D  GC        SH G  P  P+         KC  +C +    + + Q KH +
Sbjct: 208 CDPYFDNIGC--------SHPGCEPGFPT--------PKCVRKCIDKN--QLWRQSKHYS 249

Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
              Y +  +   +  E+  +GP   +F +Y+DF HYKSGVYKH +   +    H+ KLIG
Sbjct: 250 VNAYRISSDPHDVMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGEVMGG--HAVKLIG 307

Query: 297 WGT-ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           WGT +NG  YWL+ N W   WGD G  KI RG  EC  E    AG P
Sbjct: 308 WGTSDNGEDYWLLANQWNRGWGDDGYFKIRRGTNECGIEDDAVAGLP 354


>gi|290979437|ref|XP_002672440.1| predicted protein [Naegleria gruberi]
 gi|284086017|gb|EFC39696.1| predicted protein [Naegleria gruberi]
          Length = 354

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 100/319 (31%), Positives = 137/319 (42%), Gaps = 46/319 (14%)

Query: 26  IDQINREANTWTAGRNFP--ANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
           ID+IN           +P  ANLS    R  L   +      D P        D E    
Sbjct: 78  IDKINANETLGWKATEYPRFANLSISEARDSLFGLSLLSTDPDTP------RLDIEPRVD 131

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDAR QW  C  I  V D   C A   F+A    + R CI + G+ N  LS EY  
Sbjct: 132 LPMNFDARTQWRGC--IPAVRDQQTCGACWAFSATYVLAHRLCIATNGKTNVVLSPEYQV 189

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
            C  +     NK+C  G +   W+FL + G+               T+  C  + S    
Sbjct: 190 QCDTM-----NKACQGGYLKYAWSFLERTGT---------------TVDSCIPYASGRAT 229

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
            S          C  +C   T     ++ K+   ++       + IK  I+++G   + F
Sbjct: 230 FSSGT-------CPAKCKVSTQSMTMYKAKNSRYIS-----GVNNIKAAIMSYGSVQSGF 277

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
            +Y DF  Y+SGVYKH S   L    H+  LIGWG E+GT YWL +N+WG +WG  G  K
Sbjct: 278 TIYRDFMSYRSGVYKHVSTTTLGG--HAVALIGWGVESGTNYWLAVNSWGSNWGMSGYFK 335

Query: 324 ILRGKYECAFEYLIAAGKP 342
           I +G  EC  E  + AG+P
Sbjct: 336 IAQG--ECGIENQVYAGEP 352


>gi|255647484|gb|ACU24206.1| unknown [Glycine max]
          Length = 327

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 95/274 (34%), Positives = 126/274 (45%), Gaps = 34/274 (12%)

Query: 67  RPLPGDRKTYDPEYS----ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
           +P+P       P  S      +P  FDAR  W  C TIG + D G C +   F AV + S
Sbjct: 80  KPMPKKELRSTPAISHPKTLKLPKNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLS 139

Query: 123 DRRCIKSKGQQNRPLSTEYVASCCK-ICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GD 179
           DR CI      N  LS   + +CC  +C       C  G     W +L   G VT     
Sbjct: 140 DRFCIHF--DVNISLSVNDLLACCGFLC----GSGCDGGYPLYAWRYLAHHGVVTEECDP 193

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
           Y D+ GC        SH G  P        + PK  C  +C +    + + + KH +   
Sbjct: 194 YFDQIGC--------SHPGCEPAY------RTPK--CVKKCVSGN--QVWKKSKHYSVSA 235

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG- 298
           Y V+ +   I  E+  +GP    F +Y+DF +YKSGVYKH +  +L    H+ KLIGWG 
Sbjct: 236 YRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAYYKSGVYKHITGYELGG--HAVKLIGWGT 293

Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECA 332
           T++G  YWL+ N W   WGD G  KI RG  EC 
Sbjct: 294 TDDGEDYWLLANQWNREWGDDGYFKIRRGTNECG 327


>gi|56758658|gb|AAW27469.1| unknown [Schistosoma japonicum]
          Length = 181

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 68/174 (39%), Positives = 101/174 (58%), Gaps = 4/174 (2%)

Query: 167 NFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYG 226
           ++L KRG VTGG   + TGCQP     C H  +    P+C  +     +C  +C    Y 
Sbjct: 8   DYLVKRGIVTGGSKENHTGCQPYPFPKCEHL-TKGKYPACGTKIYKTPQCKQKCQK-GYK 65

Query: 227 RGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLE 286
             + QDK+     Y V  N  AI+KEI+ +GP  A F +Y+DF +YKSG+Y+H + + + 
Sbjct: 66  TPYEQDKNYGDQRYNVISNAKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVG 125

Query: 287 NYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
              H+ ++IGWG E  TPYWL+ N+W   WG++G  +I+RG+ EC+ E  + AG
Sbjct: 126 G--HAIRIIGWGVEKRTPYWLIANSWNEDWGEKGLFRIVRGRDECSIESNVVAG 177


>gi|297843026|ref|XP_002889394.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335236|gb|EFH65653.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 359

 Score =  134 bits (338), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 97/270 (35%), Positives = 127/270 (47%), Gaps = 30/270 (11%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           + S  +P  FDAR  W  C +I  + D G C +   F AV + SDR CIK     N  LS
Sbjct: 98  DLSLKLPKEFDARTAWSQCTSIPRILDQGHCGSCWAFGAVESLSDRFCIKY--NLNVSLS 155

Query: 139 T-EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCS 195
             + VA C  +C    N     G+    W +    G VT     Y D TGC        S
Sbjct: 156 ANDVVACCGLLCGLGCNGGFPMGA----WLYFKYHGVVTEECDPYFDNTGC--------S 203

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           H G  P  P+         KC  +C +     G  + KH     Y ++ +   I  E+  
Sbjct: 204 HPGCEPGYPT--------PKCVRKCVSENQLWG--ESKHYGVSAYRINHDPQDIMAEVYK 253

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGP 314
           +GP    F +Y+DF HYKSGVYKH +  K+    H+ KLIGWGT ++G  YWL+ N W  
Sbjct: 254 NGPVEVAFTVYEDFAHYKSGVYKHITGTKIGG--HAVKLIGWGTSDDGEDYWLLANQWNR 311

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPKN 344
            WGD G  KI RG  EC  E+ + AG P +
Sbjct: 312 SWGDDGYFKIRRGTNECGIEHGVVAGLPSD 341


>gi|268619140|gb|ACZ13346.1| cathepsin B-like cysteine proteinase [Bursaphelenchus xylophilus]
          Length = 405

 Score =  134 bits (338), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 80/269 (29%), Positives = 129/269 (47%), Gaps = 19/269 (7%)

Query: 57  ADAKYFDQSDRPLPGD----RKTYDPEYSATVPDRFDAREQWPNCGTI-GHVPDTGACAA 111
           A A +F + + PL       R   D + S  +P+ FDA E+WP C  +  ++ D   C +
Sbjct: 41  AGAYHFGRINDPLRKSTLKKRTEADYDLSEEIPESFDAAEKWPECAEVFNNIRDQSNCGS 100

Query: 112 PHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHK 171
               ++ G  SDR C+ + G+    +S    ASC           C+ G     +    +
Sbjct: 101 CWAVSSAGVMSDRICVATNGKVKVSISGIATASCV------GGDGCNGGLEEVAFEKFIE 154

Query: 172 RGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLK---CHTRCTNPTYGRG 228
            G  TG +     GCQP     C+HH ++   P C++  VP+ K   C   C    Y R 
Sbjct: 155 NGFPTGSEVDKHQGCQPYPFKHCAHHVNSTEYPPCDS--VPEYKADTCSHECQK-DYDRK 211

Query: 229 FFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENY 288
           + +D +     Y   D E  I++EI+ +GP   +F +Y+ F +Y  G+Y+ T   +++ Y
Sbjct: 212 YEEDLYYGKEQYGFSD-EAPIQREIMTNGPVAVSFTVYESFLYYSGGIYRSTPGERIKGY 270

Query: 289 LHSGKLIGWGTENGTPYWLVINTWGPHWG 317
            H+ +++GWG ENGT YW + N+W   WG
Sbjct: 271 -HAVRVVGWGVENGTKYWKIANSWNEQWG 298


>gi|3929733|emb|CAA77178.1| cathepsin B [Homo sapiens]
          Length = 195

 Score =  134 bits (338), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 76/205 (37%), Positives = 107/205 (52%), Gaps = 12/205 (5%)

Query: 109 CAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWN 167
           C +   F AV A SDR CI +    +  +S E + +CC  +C       C+ G     WN
Sbjct: 1   CGSCWAFGAVEAISDRICIHT--NVSVEVSAEDLLTCCGSMC----GDGCNGGYPAEAWN 54

Query: 168 FLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR 227
           F  ++G V+GG Y    GC+P +I PC HH +    P       PK    ++   P Y  
Sbjct: 55  FWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKC---SKICEPGYSP 111

Query: 228 GFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLEN 287
            + QDKH    +Y V ++E  I  EI  +GP    F++Y DF  YKSGVY+H +   +  
Sbjct: 112 TYKQDKHYGYDSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG 171

Query: 288 YLHSGKLIGWGTENGTPYWLVINTW 312
             H+ +++GWG ENGTPYWLV N+W
Sbjct: 172 --HAIRILGWGVENGTPYWLVANSW 194


>gi|290990464|ref|XP_002677856.1| predicted protein [Naegleria gruberi]
 gi|284091466|gb|EFC45112.1| predicted protein [Naegleria gruberi]
          Length = 231

 Score =  134 bits (338), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 86/242 (35%), Positives = 121/242 (50%), Gaps = 28/242 (11%)

Query: 87  RFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC 146
            FD+R++WPNC  +  + D G C + + FA+    SDR CI S G  N  LS + + +C 
Sbjct: 5   EFDSRQKWPNC--VHPIRDQGNCGSCYSFASSEVMSDRFCIFSNGSVNVVLSPQDLVTCS 62

Query: 147 KICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSC 206
                  +  C+ G     ++++HK G V+         C P      + H   P    C
Sbjct: 63  WY-----SFGCNGGIPGLVFDYIHKDGLVS-------DACFPYLSYDGNTHVKCPDF--C 108

Query: 207 ENQKVPKLKCHTRCTNPTYGRG-FFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFAL 265
            N K    K      +  Y  G F +DK +  L          I+KEIL HGP  A F +
Sbjct: 109 YNNKTKSFKSDKHFADKVYHVGEFLEDKAKRVL---------EIQKEILTHGPVNADFMV 159

Query: 266 YDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKIL 325
           Y DF  YKSGVY+H + +  E  +H+ K+IGWGTENG  YWL+ N+WG  +G +G  KI+
Sbjct: 160 YSDFTVYKSGVYRHQTGS-FEG-IHAVKIIGWGTENGVDYWLIANSWGTTFGLQGFFKIV 217

Query: 326 RG 327
           RG
Sbjct: 218 RG 219


>gi|159114116|ref|XP_001707283.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157435387|gb|EDO79609.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 332

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 103/344 (29%), Positives = 151/344 (43%), Gaps = 48/344 (13%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAG-RNFPANLSEEYLRQFLIADA 59
           M+ ++ +LL    V       S   + +I      WTAG  +    LSE+ LR       
Sbjct: 27  MLSVITYLLAGLGVALSKPLLSRRELQEIRALQPPWTAGISDRLVGLSEDDLRAM----- 81

Query: 60  KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
             F +  +P     +    E S  +PD FD RE++P C  I  V D G C A   F+A G
Sbjct: 82  --FPRHGQPTRPSAECPRAEPSGPIPDAFDLREEYPQC--ITPVYDQGYCGACWAFSATG 137

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           AF DRRC++       P S +Y  SC      D +  C+ G+ F  W FL + G+ T  +
Sbjct: 138 AFGDRRCMQWLDPVGVPYSQQYTVSC-----DDLDLGCAGGTSFNVWTFLTEHGTTTL-E 191

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
               T       SPC      P L  C++    +L     C +                 
Sbjct: 192 CVRYTDADKDLSSPC------PAL--CDDGSEIQLVKADGCLD----------------- 226

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
                N  AI + +   GP  A  ++Y DF +Y+ GVYKH    ++ +  H+ ++IG+GT
Sbjct: 227 --YSGNVTAIMQTLANDGPVQAVMSVYRDFLYYRGGVYKHVYGIQISS--HAVEIIGYGT 282

Query: 300 ---ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
              E   PYW+V N+ GP+WG+ G   I+RG  EC  E  + +G
Sbjct: 283 TDDEERIPYWIVKNSLGPNWGEEGYFNIVRGSNECDIESAVYSG 326


>gi|18378945|ref|NP_563647.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332189291|gb|AEE27412.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
          Length = 379

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 98/288 (34%), Positives = 134/288 (46%), Gaps = 50/288 (17%)

Query: 79  EYSATVPDRFDAREQWPNCGTI-----GHVPDT---------------GACAAPHIFAAV 118
           + S  +P  FDAR  W +C +I     G++ +                G C +   F AV
Sbjct: 98  DLSLKLPKEFDARTAWSHCTSIRRILVGYILNNVLLWSTITLWFWFLLGHCGSCWAFGAV 157

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKI-CRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
            + SDR CIK     N  LS   V +CC + C +     C+ G     W +    G VT 
Sbjct: 158 ESLSDRFCIKY--NLNVSLSANDVIACCGLLCGF----GCNGGFPMGAWLYFKYHGVVTQ 211

Query: 178 --GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
               Y D TGC        SH G  PT P+ + ++    KC +R  N  +G    + KH 
Sbjct: 212 ECDPYFDNTGC--------SHPGCEPTYPTPKCER----KCVSR--NQLWG----ESKHY 253

Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
               Y ++ +   I  E+  +GP    F +Y+DF HYKSGVYK+ +  K+    H+ KLI
Sbjct: 254 GVGAYRINPDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGG--HAVKLI 311

Query: 296 GWGT-ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           GWGT ++G  YWL+ N W   WGD G  KI RG  EC  E  + AG P
Sbjct: 312 GWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQSVVAGLP 359


>gi|239792046|dbj|BAH72408.1| ACYPI000003 [Acyrthosiphon pisum]
          Length = 182

 Score =  133 bits (335), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 71/177 (40%), Positives = 95/177 (53%), Gaps = 6/177 (3%)

Query: 167 NFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYG 226
           N+   +G V+GG YG   GC P  I+PC HH +    P  E  K P   C  +C    Y 
Sbjct: 10  NYCKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGPCKEGGKTPT--CVKKCEE-GYK 66

Query: 227 RGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLE 286
             + QD H     Y + ++ D I++EI  +GP    F +Y+DF  Y++GVYKH +   L 
Sbjct: 67  VPYAQDLHHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALG 126

Query: 287 NYLHSGKLIGWGTENG-TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
              H+ +++GWG +NG  PYWLV N+W   WG  G  KILRG  EC  E  I AG P
Sbjct: 127 G--HAIRILGWGVQNGEIPYWLVANSWNTDWGSDGFFKILRGSDECGIEGQINAGLP 181


>gi|269146930|gb|ACZ28411.1| cathepsin b [Simulium nigrimanum]
          Length = 168

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 67/172 (38%), Positives = 94/172 (54%), Gaps = 4/172 (2%)

Query: 172 RGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQ 231
           + S +GG +G   GC P  I+PC HH +  T P+C  ++    KC   C   +Y   + Q
Sbjct: 1   KASSSGGPFGSNQGCHPYKIAPCEHHVNG-TRPACNGEEGKTPKCIKHC-QASYTVAYEQ 58

Query: 232 DKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHS 291
           DK     +Y V  +   I+KEI+ +GP    F +Y+D   YK GVY+H +   L    H+
Sbjct: 59  DKSYGAKSYSVPHHVAQIQKEIMTNGPVEGAFTVYEDLVQYKDGVYQHVTGKMLGG--HA 116

Query: 292 GKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            +++GWG EN  PYWL+ N+W   WG+ G  KILRG   C  E  I+AG PK
Sbjct: 117 IRILGWGVENDVPYWLIANSWNTDWGNNGFFKILRGSDHCGIESQISAGIPK 168


>gi|242014495|ref|XP_002427925.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
           corporis]
 gi|212512409|gb|EEB15187.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
           corporis]
          Length = 473

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 92/264 (34%), Positives = 122/264 (46%), Gaps = 32/264 (12%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FDAR +WP  G I    D G C A    +     SDR  I SKG     LS +++ 
Sbjct: 190 LPNSFDARNKWP--GWISGPADQGWCGASWAVSTASVASDRYAIMSKGLTKVDLSPQHLL 247

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC K       + C  G + R W F+ K G V   DY     C P T +P          
Sbjct: 248 SCNK-----GQRGCQGGHLSRAWTFIRKFGLVD--DY-----CYPWTGTP---------- 285

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
             C+  K P     +    P+ G     + +R    Y + D +D I +EI+  GP  AT 
Sbjct: 286 TKCKIPKRPNFDALSSICPPSLGSNLRSELYRVGPAYKIQDEKD-IMEEIMQSGPVQATM 344

Query: 264 ALYDDFYHYKSGVY-KHTSNAKLENY-LHSGKLIGWGTEN---GTP--YWLVINTWGPHW 316
            +Y DF+ YKSGVY K  +  +  N+  HS K++GWG E    G P  YWL  N+WG  W
Sbjct: 345 KVYQDFFSYKSGVYTKSNTERESSNFGYHSVKILGWGEETNIYGQPIKYWLAANSWGQQW 404

Query: 317 GDRGTVKILRGKYECAFEYLIAAG 340
           G+ G  KI RG  EC  E  + A 
Sbjct: 405 GENGFFKIRRGTNECEIEEFVLAA 428


>gi|10803443|emb|CAC13134.1| putative cathepsin B.8 [Ostertagia ostertagi]
          Length = 197

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 72/199 (36%), Positives = 100/199 (50%), Gaps = 6/199 (3%)

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
           F AV A SDR CI SKG+    LS   + SCC+ C +     C+ G     W F  K G 
Sbjct: 5   FGAVEAISDRICIASKGKTQVTLSAADLLSCCRSCGF----GCNGGDPLSAWKFWVKEGI 60

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           VTG ++    GC+P     C HH +      C++   P  KC   C      R + +DK+
Sbjct: 61  VTGSNHSTNAGCKPYPFPACEHHSNKTHYDPCKHDLFPTPKCEKSCQATFGERTYKEDKY 120

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
                Y V ++ +AI+KEI+ +GP    F +Y+DF +Y  G+Y H   A      H+ K+
Sbjct: 121 FGRSAYGVKNHMEAIQKEIITYGPVEVAFEVYEDFLNYAGGIYVHQGGALGGG--HAVKM 178

Query: 295 IGWGTENGTPYWLVINTWG 313
           IGWG +NG PYW  + T G
Sbjct: 179 IGWGIDNGVPYWXHLPTHG 197


>gi|328702238|ref|XP_001943280.2| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 328

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 81/248 (32%), Positives = 115/248 (46%), Gaps = 24/248 (9%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +   FDAR++WP C TIG   + G  A    +AA G  +DR CI + G  N+ +STE + 
Sbjct: 84  IHKEFDARKRWPQCKTIGEFRNEGNFALSWAYAAAGVLADRMCIATNGSYNQLISTEELI 143

Query: 144 SCCKICRYDDNKSCSHGSVF--RTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
           SC  +          HG V     W +L   G V+GG Y    GCQPS I P   +    
Sbjct: 144 SCSGVS------GGYHGIVSEREVWEYLKSHGLVSGGKYNTSDGCQPSKIPPIEEY---- 193

Query: 202 TLPSCENQKVPKLKCHTRC-TNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
                E  ++    C+  C  N T     + D H     Y+    ED I++E+  +GP +
Sbjct: 194 ----MEYSEIKNYTCNDHCYGNKTIN---YNDDHVKVSNYYQVQYED-IQEEVQNYGPVS 245

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
             F + DD +     +       K + Y+   KLIGWG ENG  YWL++++WG   G  G
Sbjct: 246 VEFYIRDDIFTPFLSINPRFQRRKYKGYV---KLIGWGVENGEDYWLLVDSWGYERGQNG 302

Query: 321 TVKILRGK 328
             K+ R K
Sbjct: 303 VFKVERFK 310


>gi|195026034|ref|XP_001986167.1| GH20676 [Drosophila grimshawi]
 gi|193902167|gb|EDW01034.1| GH20676 [Drosophila grimshawi]
          Length = 432

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 86/268 (32%), Positives = 120/268 (44%), Gaps = 36/268 (13%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           S  +P  F+A E+W     I  VPD G C A  + +     SDR  I+S+G++   LS +
Sbjct: 184 SNDLPRSFNAVEKWST--FISEVPDQGWCGASWVLSTTSVASDRFAIQSQGKEVVQLSAQ 241

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SC +       + C  G +   W ++HK G +    Y                    
Sbjct: 242 NILSCTR-----RQQGCDGGHLDAAWRYMHKNGVLDANCY-------------------- 276

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYG----RGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
              P  + +   K++ H   +   YG     G  +D   T    +    E  I  EI   
Sbjct: 277 ---PYIQQRDTCKVQRHRGRSLKAYGCQPAHGVNRDNFYTVGPAYSLSREADIMAEIYHS 333

Query: 257 GPTTATFALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGP 314
           GP  AT  +Y DF+ Y SGVY+HT+ N       HS KL+GWG E NG  YW+  N+WGP
Sbjct: 334 GPVQATMTVYRDFFSYSSGVYQHTAANRGAATGFHSVKLVGWGEEHNGVKYWIAANSWGP 393

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKP 342
            WG+RG  +ILRG  EC  E  + A  P
Sbjct: 394 WWGERGYFRILRGSNECGIEEYVLASWP 421


>gi|161343873|tpg|DAA06117.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 254

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 75/203 (36%), Positives = 107/203 (52%), Gaps = 12/203 (5%)

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
           ++  +P  FD+R++WPNC +IGH+ + G C + +  AA  A SDR CI S   +N  +S 
Sbjct: 59  FTNGLPTNFDSRKKWPNCPSIGHIYNQGNCRSSYAVAAASAASDRICIHSNSTKNPIMSA 118

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
           + + SCC +C Y     C  GS+F +W+F  + G V+GG+Y    GCQP TI PC     
Sbjct: 119 QQIISCCYLCGY----GCDGGSLFESWDFYRRHGFVSGGEYNSNQGCQPYTIPPCKLINE 174

Query: 200 APTLPSC---ENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
            P   SC     ++ P   C  +C NP Y   F  D +R     +   +     KEI  +
Sbjct: 175 KPPGHSCTTFNREETP--TCEKKCNNPNYYTSFRADIYRGK---YYKVSPYMAMKEIFDN 229

Query: 257 GPTTATFALYDDFYHYKSGVYKH 279
           GP T  F +Y D   YKSGVY++
Sbjct: 230 GPITTQFYMYRDLVDYKSGVYQY 252


>gi|3929817|emb|CAA77181.1| cathepsin B [Mus musculus]
          Length = 194

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 76/202 (37%), Positives = 102/202 (50%), Gaps = 8/202 (3%)

Query: 109 CAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNF 168
           C +   F AV A SDR CI + G+ N  +S E + +CC I   D    C+ G     WNF
Sbjct: 1   CGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCGIQCGD---GCNGGYPSGAWNF 57

Query: 169 LHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG 228
             K+G V+GG Y    GC P TI PC HH +    P       P  +C+  C    Y   
Sbjct: 58  WTKKGLVSGGVYDSHIGCLPYTIPPCEHHVNGSRPPMHGEGDTP--RCNKSC-EAGYSPS 114

Query: 229 FFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENY 288
           + +DKH    +Y V ++   I  EI  +GP    F ++ DF  YKSGVYKH +   +   
Sbjct: 115 YKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGG- 173

Query: 289 LHSGKLIGWGTENGTPYWLVIN 310
            H+ +++GWG ENG PYWL  N
Sbjct: 174 -HAIRILGWGVENGVPYWLAAN 194


>gi|194753202|ref|XP_001958906.1| GF12327 [Drosophila ananassae]
 gi|190620204|gb|EDV35728.1| GF12327 [Drosophila ananassae]
          Length = 431

 Score =  131 bits (330), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 84/264 (31%), Positives = 120/264 (45%), Gaps = 35/264 (13%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  F+A E+WP+   I  VPD G C +  + +     SDR  I+SKG++   LS + + 
Sbjct: 186 LPSSFNAVERWPS--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSKGKEAVRLSAQNIL 243

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC +       + C  G +   W FLHK+G V    Y                       
Sbjct: 244 SCTR-----RQQGCDGGHLDAAWRFLHKKGVVDDSCY----------------------- 275

Query: 204 PSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
           P  + +   K++ ++R       R      +D   T    +  + E  I  EI   GP  
Sbjct: 276 PYTQQRDTCKIRHNSRSLKANGCRPSPNVDRDSFYTVGPAYTLNREGDIMAEIYHSGPVQ 335

Query: 261 ATFALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGPHWGD 318
           AT  +Y DF+ Y  G+Y+ T+ N       HS KL+GWG E NG  YW+  N+WGP WG+
Sbjct: 336 ATMRVYRDFFSYSGGIYRQTAANRGAPQGFHSVKLVGWGEEHNGDKYWIAANSWGPWWGE 395

Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
           RG  +ILRG  EC  E  + A  P
Sbjct: 396 RGYFRILRGSNECGIEEYVLASWP 419


>gi|157058753|gb|ABV03134.1| cathepsin B-84 [Acyrthosiphon pisum]
          Length = 230

 Score =  131 bits (329), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 78/245 (31%), Positives = 126/245 (51%), Gaps = 19/245 (7%)

Query: 38  AGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNC 97
           A +NFP N  +E + + L+   +    S  P+  + + Y    ++ VP+ FD+R +W  C
Sbjct: 1   AKQNFPENTPKEQIVR-LLGSKRLLGVSKSPIKENDELYMD--NSEVPEFFDSRLEWDYC 57

Query: 98  GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSC 157
            TIGHV + G C +       GAF+DR C+ + G+ N  +S E +  CC  C +     C
Sbjct: 58  ETIGHVRNQGNCGSCWAHGTTGAFADRLCVATNGEFNELISAEELTFCCHTCGF----GC 113

Query: 158 SHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC----SHHGSAPTLPSCENQKVPK 213
           + G   + W +  + G VTGGDY    GCQP  + PC      H S    P+  N K  K
Sbjct: 114 NGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPCVKDDEGHNSCSGQPTERNHKCSK 173

Query: 214 LKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYK 273
            KC+   T       + ++ ++T   Y++ +    ++K+ + +GP  A+F +YDDF +Y+
Sbjct: 174 -KCYGDDT-----IDYKKNHYKTKDAYYLKNT--TMQKDTMVYGPIEASFDVYDDFMNYE 225

Query: 274 SGVYK 278
           SGVY+
Sbjct: 226 SGVYQ 230


>gi|290975216|ref|XP_002670339.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
 gi|284083897|gb|EFC37595.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
          Length = 350

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 85/261 (32%), Positives = 122/261 (46%), Gaps = 41/261 (15%)

Query: 85  PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
           P +FDAREQWP C  I  + +   C +   F+A    +DR CIKS G+ N  LS +++ S
Sbjct: 126 PTQFDAREQWPQC--IRSIKNQKNCGSCWAFSASSVLADRFCIKSGGKVNVDLSPQFMVS 183

Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
           C        N  C+ G    TW FL   G+V+         C P      S  G+ P   
Sbjct: 184 CS-----GQNNGCNGGFFDATWRFLVSVGTVS-------EACVPYV----SFGGAVPA-- 225

Query: 205 SCENQKVPKLKCHTR-CTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
                      C+ + C  P     F++      L   +D     I  ++ A+GP     
Sbjct: 226 -----------CNVKSCGVPGQKSPFYRAGSARKLEGMLD-----IMADLKANGPIQVAM 269

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT--PYWLVINTWGPHWGDRGT 321
            +Y DFY YKSGVY H S   +    H+ K++GWG ++ +  PYW+  N+WG  WG +G 
Sbjct: 270 GVYRDFYSYKSGVYHHVSGRYVGG--HAVKIVGWGYDSASKLPYWICANSWGEDWGIKGY 327

Query: 322 VKILRGKYECAFEYLIAAGKP 342
             ILRG+ EC    ++ +GKP
Sbjct: 328 FWILRGRGECGIGKMVWSGKP 348


>gi|345327151|ref|XP_001507103.2| PREDICTED: tubulointerstitial nephritis antigen-like
           [Ornithorhynchus anatinus]
          Length = 327

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 88/270 (32%), Positives = 127/270 (47%), Gaps = 26/270 (9%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           +  +P  FDA ++WP  G I    D G CA    F+     SDR  I SKG     LS +
Sbjct: 54  NVVLPRNFDAAQKWP--GLIHEPLDQGNCAGSWAFSTAAVASDRISIHSKGHMTPSLSPQ 111

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SC         + C+ G + R W+FL +RG V+   Y      Q S   PC  +   
Sbjct: 112 NLLSC----NTRHQQGCNGGRLDRAWSFLRRRGLVSDKCYP--LASQNSIAEPCRMY--- 162

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
            + P    ++     C     N  +   +  D +++T  Y +  NE  I KEI+ +GP  
Sbjct: 163 -SRPMGRGKRQATGPC---PNNFHHSNDYSNDIYQSTPPYRLSSNEKDIMKEIMENGPVQ 218

Query: 261 ATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTE---NG--TPYWLVI 309
           A   +++DF+ YK G+Y+HT  SN K   +     HS K+ GWG E   NG    +W   
Sbjct: 219 ALMEVHEDFFLYKDGIYRHTPASNGKPPQFRRQGTHSVKITGWGEELQPNGRRVKFWRAA 278

Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
           N+WGP WG+ G+ +ILRG  EC  E  +  
Sbjct: 279 NSWGPTWGEGGSFRILRGCNECDIESFVVG 308


>gi|270012758|gb|EFA09206.1| cathepsin B precursor [Tribolium castaneum]
          Length = 326

 Score =  130 bits (328), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 87/263 (33%), Positives = 121/263 (46%), Gaps = 46/263 (17%)

Query: 83  TVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
           ++P+ FDARE+WP C   IG + + G C +   FA+    +DR CI SKG+     S E 
Sbjct: 75  SIPESFDAREKWPECKDVIGKIRNQGNCGSCWAFASTEVMTDRLCISSKGKIKFVFSPEN 134

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
           + +C      D    C  G +   W++    G  +GGDY    GCQP             
Sbjct: 135 LLTC----CKDCGCGCKGGYIKNAWDYYINEGIASGGDYNSSEGCQP------------- 177

Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQ--DKHRTTLTYWVDDNEDAIKKEILAHGPT 259
                                  Y    FQ  +       Y ++ N   I+ EIL +GP 
Sbjct: 178 -----------------------YSESSFQYAEASECVKFYTLETNVAQIQMEILTNGPV 214

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
            A + +++DF  +KSGVY + S   +    HS K+IGWGTE G PYWL+ N+WG  WG+ 
Sbjct: 215 MAYYNVFEDFACHKSGVYYYKSGKFVGR--HSVKVIGWGTEEGIPYWLIANSWGSEWGEL 272

Query: 320 GT-VKILRGKYECAFEYLIAAGK 341
           G   K+ RG  EC  E  + AGK
Sbjct: 273 GGFFKMRRGTNECWIEQEMTAGK 295


>gi|66506619|ref|XP_393283.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
           [Apis mellifera]
          Length = 439

 Score =  130 bits (327), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 95/278 (34%), Positives = 127/278 (45%), Gaps = 45/278 (16%)

Query: 73  RKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
           R+ YDPE   ++P  FDAR +W     I  V D G C A    +     SDR  + SKG 
Sbjct: 189 RRVYDPE---SLPREFDARTRWRR--QISGVDDQGWCGASWAISTAQVASDRFAVMSKGT 243

Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPS 189
            +  LS +++ SC K       + C  G + R W F+ K G V    Y   G    C+  
Sbjct: 244 DSVLLSAQHLLSCNK----KGQRGCDGGYLDRAWLFMRKFGLVDEQCYPWKGVYEQCKLQ 299

Query: 190 TISPCSHHG-SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA 248
             +     G  AP  P         L+       P Y  G                NE  
Sbjct: 300 KRTNLEAAGCRAPANP---------LRKELYKVGPAYRLG----------------NETD 334

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKL-ENYLHSGKLIGWG----TENGT 303
           I +EIL  GP  AT  +Y DF+ Y+SG+Y HT  A+L E+  HS ++IGWG    T++G 
Sbjct: 335 IMREILTSGPVQATMKVYQDFFSYESGIYMHTPIAELYESGYHSVRIIGWGEDISTDSGL 394

Query: 304 P--YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
           P  YWLV+N+WG  WG+ G  +I RG  EC  E  + A
Sbjct: 395 PIKYWLVVNSWGQEWGENGLFRIRRGINECDIESFVVA 432


>gi|10803441|emb|CAC13133.1| putative cathepsin B.7 [Ostertagia ostertagi]
          Length = 198

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 74/203 (36%), Positives = 107/203 (52%), Gaps = 14/203 (6%)

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
            +   A SDR CI SKG     +S + + SCC  C       C  G     W +    G 
Sbjct: 5   VSTAAAMSDRICIASKGATQVLISAQDIVSCCTWC----GAGCEGGWPIEAWKYGVTEGV 60

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQ-KVPKLKCHTRCTNPTYGRGFFQDK 233
           VTGG++G +  C+   I PC +HG+ P    C +  + P   C  RC  P Y   +  DK
Sbjct: 61  VTGGNFGRKECCRSYEIHPCGYHGNEPFYGHCHSMARTPP--CKKRC-RPGYKNSYMMDK 117

Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
              T  Y + ++  AI+++I+ +GP  A F +Y+DF +YKSG+Y+HT+        H+ K
Sbjct: 118 RYGTSAYELPNSVXAIQRDIMENGPVVAGFDVYEDFKYYKSGIYRHTAGKXTGG--HAVK 175

Query: 294 LIGWG---TENGT-PYWLVINTW 312
           +IGWG   TENGT PYW++ N+W
Sbjct: 176 VIGWGEEXTENGTIPYWIIANSW 198


>gi|290989996|ref|XP_002677623.1| cathepsin B [Naegleria gruberi]
 gi|284091231|gb|EFC44879.1| cathepsin B [Naegleria gruberi]
          Length = 321

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 105/338 (31%), Positives = 153/338 (45%), Gaps = 56/338 (16%)

Query: 26  IDQINRE-ANTWTAG--RNFPANLSEEYLRQFLIADAKYFDQ---SDRPLPGDRKTYDPE 79
           I++IN + ++TW AG  RN       E  R  L+  AK   Q   S+  +   +   + +
Sbjct: 8   INEINSDPSSTWKAGVNRNLAGKTVAEMKR--LLGFAKKEGQVRYSEEQMTTIKHYNEAK 65

Query: 80  YSAT----------------VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSD 123
            SA                 +P  FD+R+QW  C  I  + +   C +   F+A  + SD
Sbjct: 66  ASAVKSVGVEEASKQFKTLGLPTNFDSRQQWGKC--IHPIRNQEQCGSCWAFSASESLSD 123

Query: 124 RRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDR 183
           R CI S G+ +  LS + + SC     Y+D   C  G++   W ++  +G V        
Sbjct: 124 RFCIASNGKVDVILSPQDMVSC----DYND-MGCDGGNLDNAWWWMKNKGIVP------- 171

Query: 184 TGCQPSTISPCSHHGSAPTLPS-CENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
             C P      S  G+ P  PS C    +P        +   Y + F    H +   +W 
Sbjct: 172 DSCMPYV----SGGGNVPACPSNCNGTNIP------ISSQLYYAKSF---SHISPWMFW- 217

Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
            +    I++EI  +GP    F++Y DF +YKSGVY H + + L    H+ K+IGWG E G
Sbjct: 218 -ERVADIQQEIYTNGPVQGGFSVYQDFMNYKSGVYSHKTGSFLGG--HAIKIIGWGVEGG 274

Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
             YWLV N+W   WG  GT KILRG  EC  E  + AG
Sbjct: 275 VDYWLVANSWSTDWGIDGTFKILRGHNECGIEDDVYAG 312


>gi|161343859|tpg|DAA06110.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 260

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 85/267 (31%), Positives = 125/267 (46%), Gaps = 15/267 (5%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA- 59
            + +L  +L       + Y    +YID IN  A+TW AG NF  N S+E + + L +   
Sbjct: 4   FLILLSIVLFSVYQTEQAYFLQKSYIDTINEVASTWKAGVNFDPNTSQEDIVKLLGSTGV 63

Query: 60  -KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
                 S      D   Y+  Y  T P  FDAR++W +C TIG V D G C +   F   
Sbjct: 64  ESAMKASANEFKMDDVAYNKLYGYT-PRTFDARKKWRHCKTIGEVRDQGHCGSCWAFGTS 122

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
            AF+DR C+ + G  N  LS E +  CC  C +     C+ G   + W +    G VTGG
Sbjct: 123 SAFADRLCVATDGDFNELLSAEEITFCCHTCGF----GCNGGDPIKAWKYFSTHGLVTGG 178

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG--FFQDKHRT 236
           +Y    GC+P  + PC          +C  +  P+ K H RCT   YG     +++ HR 
Sbjct: 179 NYKSGEGCEPYRVPPCPRDDKGKN--TCAGK--PREKNH-RCTRMCYGNQDLDYREDHRY 233

Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATF 263
           T  ++      +I+K+++ +GP  ATF
Sbjct: 234 TRDFYY-LTYGSIQKDVMTYGPIEATF 259


>gi|125810908|ref|XP_001361665.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
 gi|54636841|gb|EAL26244.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
          Length = 433

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 86/267 (32%), Positives = 124/267 (46%), Gaps = 35/267 (13%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           +A +P  F+A E+W +   I  VPD G C +  + +     SDR  I+SKG++   LS +
Sbjct: 186 TAGLPAAFNAVEKWSS--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSKGKEAVQLSAQ 243

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SC +       + C  G +   W +LHK+G V          C P T          
Sbjct: 244 NILSCTR-----RQQGCEGGHLDAAWRYLHKKGVVD-------ESCYPYT---------- 281

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
                 +++   K++ ++R       R      +D   T    +  + E  I  EI   G
Sbjct: 282 ------QHRDTCKIRHNSRSLKANGCRPSANVDRDSFYTVGPAYTLNKESDIMAEIYHSG 335

Query: 258 PTTATFALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGPH 315
           P  AT  +Y DF+ Y SGVY+ T+ N       HS KL+GWG E NG  YW+  N+WGP 
Sbjct: 336 PVQATMRVYRDFFSYSSGVYRQTAANRGAPTGFHSVKLVGWGEEHNGDKYWIAANSWGPW 395

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
           WG+RG  +ILRG  EC  E  + A  P
Sbjct: 396 WGERGYFRILRGSNECGIEDYVLASWP 422


>gi|195154396|ref|XP_002018108.1| GL16940 [Drosophila persimilis]
 gi|194113904|gb|EDW35947.1| GL16940 [Drosophila persimilis]
          Length = 433

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 86/267 (32%), Positives = 124/267 (46%), Gaps = 35/267 (13%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           +A +P  F+A E+W +   I  VPD G C +  + +     SDR  I+SKG++   LS +
Sbjct: 186 TAGLPAAFNAVEKWSS--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSKGKEAVQLSAQ 243

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SC +       + C  G +   W +LHK+G V          C P T          
Sbjct: 244 NILSCTR-----RQQGCEGGHLDAAWRYLHKKGVVD-------ESCYPYT---------- 281

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
                 +++   K++ ++R       R      +D   T    +  + E  I  EI   G
Sbjct: 282 ------QHRDTCKIRHNSRSLKANGCRPSANVDRDSFYTVGPAYTLNKESDIMAEIYHSG 335

Query: 258 PTTATFALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGPH 315
           P  AT  +Y DF+ Y SGVY+ T+ N       HS KL+GWG E NG  YW+  N+WGP 
Sbjct: 336 PVQATMRVYRDFFSYSSGVYRQTAANRGAPTGFHSVKLVGWGEEHNGDKYWIAANSWGPW 395

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
           WG+RG  +ILRG  EC  E  + A  P
Sbjct: 396 WGERGYFRILRGSNECGIEDYVLASWP 422


>gi|156708114|gb|ABU93315.1| cathepsin B6 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 104/326 (31%), Positives = 150/326 (46%), Gaps = 58/326 (17%)

Query: 21  FSDAYIDQINRE-ANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
            +++ ++ +N + ++TW A          EY R+ +          +  LP +    D E
Sbjct: 10  IAESIVETVNNDPSSTWVA---------IEYPREVITLAKMRAMLGEEVLPLE----DVE 56

Query: 80  YSA--TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
           Y     VP+ FDAREQWP  G I  V D  +C +    AA  A  +R  IK  G+    L
Sbjct: 57  YVEPNNVPENFDAREQWP--GKIYPVRDQASCGSCWAHAASEAIGNRFSIKGCGKGM--L 112

Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
           S + + SC K      +  C+ GS   +  +L   G  T         C P      S +
Sbjct: 113 SVQDLVSCDK-----GDSGCNGGSGPLSSKWLVSNGVTT-------EECLPYV----SGN 156

Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
           G  P              C  +C+N   G    + K+    TY V +    I++E++ +G
Sbjct: 157 GRVPA-------------CAAKCSN---GSQIIRYKYEKAETYTVQN----IQEELMKNG 196

Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
           P    F +Y DF +YKSGVY+H S  +     H+  LIGWG E+G PYWL+ N+WGP WG
Sbjct: 197 PVYFRFTVYSDFMNYKSGVYQHKSGYQEGG--HAVLLIGWGVEDGVPYWLLQNSWGPAWG 254

Query: 318 DRGTVKILRGKYECAFEYLIAAGKPK 343
           ++G  KI+RGK EC  E    AG  K
Sbjct: 255 EKGHFKIIRGKNECGCEQGFYAGPVK 280


>gi|443686962|gb|ELT90079.1| hypothetical protein CAPTEDRAFT_166233 [Capitella teleta]
          Length = 495

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 105/341 (30%), Positives = 147/341 (43%), Gaps = 49/341 (14%)

Query: 13  LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA-KYFDQSDRP--L 69
           L+R E+       ID +N     W A RN+       +L    + D  KY   + +P  +
Sbjct: 153 LIRKEV-------IDHVNSHNPGWQA-RNY------TFLWGMTLKDGIKYRLGTFKPQGM 198

Query: 70  PGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKS 129
             +  +   +    +PD FDARE+WP+   I  V D G C A + F+     +DR  I S
Sbjct: 199 IEEMSSLKVDADEVMPDEFDAREEWPS--FIHPVQDQGNCGASYAFSTSTVAADRLSIHS 256

Query: 130 KGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPS 189
            G+    LS +Y+ SC         K C  G V R W  L + G+V+         C P 
Sbjct: 257 GGELKDMLSAQYLISCTTD---HHQKGCEGGHVDRAWWQLRRVGTVS-------KDCYPY 306

Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
           T    +  G       C   K    K +  C     G+G     ++ +  Y +   E  I
Sbjct: 307 TSGDTNDPGK------CLMSKYKLPKKNIECP---VGQGITSKLYQASPPYRIAAKEREI 357

Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK-------LIGWGT--- 299
             EI+ +GP  A   + DDFY Y+ GVYKH+   K  NY H GK       +IGWGT   
Sbjct: 358 MNEIILNGPVQAVMHVKDDFYTYERGVYKHSHAPKPANYPHLGKEAYHSVRIIGWGTDYT 417

Query: 300 -ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
            ++   YWL  NTWG HWG+ G  +I RG  E   E  +  
Sbjct: 418 GDDPIKYWLAANTWGRHWGEGGFFRIARGSDESHIESFVVG 458


>gi|10803454|emb|CAB97366.2| putative cathepsin B.3 [Ostertagia ostertagi]
          Length = 196

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 72/198 (36%), Positives = 102/198 (51%), Gaps = 8/198 (4%)

Query: 116 AAVGAFSDRRCIKSKGQQNRPLS-TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
           +A    SDR C+++ G++   LS T+ +A C   C Y     C+ G   R W +    G 
Sbjct: 6   SAAETMSDRLCVQTNGRKKTLLSDTDILACCGDFCGY----GCNGGYSARAWLYARNSGV 61

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
            +GG Y ++  C+P T  PC +H +      C         C   C    YG+ + +DK 
Sbjct: 62  CSGGRYQEKGVCKPYTFHPCGYHKNQTYYGECPKHTYQTPACKKYC-QYGYGKRYEKDKI 120

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
                Y V  +E AI+ EI A GP  A+FA Y+DF HYKSG+Y HT+  +     H+ K+
Sbjct: 121 YAXDAYRVSSDEAAIRAEIFARGPVQASFATYEDFAHYKSGIYVHTAGKRRGG--HAVKI 178

Query: 295 IGWGTENGTPYWLVINTW 312
           IGWG ENGT  W+V N+W
Sbjct: 179 IGWGVENGTKXWIVANSW 196


>gi|340712697|ref|XP_003394892.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
           terrestris]
          Length = 445

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 92/281 (32%), Positives = 124/281 (44%), Gaps = 49/281 (17%)

Query: 73  RKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
           R+ YDPE   ++P  FDAR +WP    I  + D G C A    +A    SDR  + SKG 
Sbjct: 194 RRIYDPE---SLPREFDARIRWPR--EISDIDDQGWCGASWAISATRVASDRFALMSKGA 248

Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPS 189
            +  LS +++ SC         ++CS G + R W ++ K G V    Y   G    C+  
Sbjct: 249 DSVLLSAQHLLSC----NNRGQQACSGGYLDRAWLYMRKFGLVDEDCYPWEGTNAQCKLR 304

Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
             +     G  P         V  L+       P Y  G                NE  I
Sbjct: 305 KRTDLKTAGCRP--------PVNPLRTELYKVGPAYRLG----------------NETDI 340

Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENY---LHSGKLIGWGTENGT--- 303
             EIL  GP  AT  +Y DF+ Y+SG+YKHT  A  E+Y    HS ++IGWG +      
Sbjct: 341 MYEILTSGPVQATMKVYQDFFSYESGIYKHT--ATTEHYAFGYHSVRIIGWGEDTSAHRH 398

Query: 304 -----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
                 YWLV+N+WG  WG+ G  +I RG  EC  E  + A
Sbjct: 399 HNLPIKYWLVVNSWGQQWGESGLFRIQRGTNECDIESFVVA 439


>gi|383861394|ref|XP_003706171.1| PREDICTED: tubulointerstitial nephritis antigen-like [Megachile
           rotundata]
          Length = 442

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 89/273 (32%), Positives = 123/273 (45%), Gaps = 38/273 (13%)

Query: 73  RKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
           R+ YDPE   ++P  FD+R +WP    I  + D G C A    ++    SDR  I SKG 
Sbjct: 192 RRIYDPE---SLPREFDSRTRWPR--DISKITDQGWCGASWAISSAQVASDRFAIMSKGT 246

Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTIS 192
               LS +++ SC         + CS G + R W F+ + G V    Y  +   +   + 
Sbjct: 247 DAVELSAQHLLSC----NNRGQQGCSGGHLDRAWMFMRRFGLVDENCYPWKASTETCRLR 302

Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
             +   SA   P     +    K       P Y                   NE  I +E
Sbjct: 303 KRTDLRSAGCAPPPNPLRTELYK-----VGPAYRLA----------------NETDIMQE 341

Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKL-ENYLHSGKLIGWGTE-----NGTP-- 304
           IL  GP  AT  +Y DF+ Y+SGVYKH+  A+L E+  HS ++IGWG E       TP  
Sbjct: 342 ILTSGPVQATMRVYQDFFSYESGVYKHSVTAELYESDYHSVRIIGWGEEPPTYSRNTPLK 401

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
           YWLV N+WG  WG+ G  +I +G  EC  E  +
Sbjct: 402 YWLVANSWGQQWGENGLFRIQKGTNECEIESFV 434


>gi|307201161|gb|EFN81067.1| Uncharacterized peptidase C1-like protein F26E4.3 [Harpegnathos
           saltator]
          Length = 443

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 94/276 (34%), Positives = 130/276 (47%), Gaps = 42/276 (15%)

Query: 73  RKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
           R+ YDP+    +P  FDAR +WP    I  + D G C A    +     SDR  I SKG 
Sbjct: 195 RRIYDPD---ALPREFDARTRWPR--DISGIHDQGWCGASWAVSTADVASDRFAIMSKGA 249

Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPS 189
           ++  LS +++ SC         + C  G + R W F+ K G V    Y   G    C+  
Sbjct: 250 EDVELSAQHLLSC----NNRGQQGCRGGYLDRAWLFMRKFGLVDKECYPWTGRNDQCRLR 305

Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
             S  +  G     P+   Q++ K+        P Y  G                NE  I
Sbjct: 306 KRSNLNVAGCRKP-PNPLRQELYKV-------GPAYRLG----------------NETDI 341

Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKL-ENYLHSGKLIGWGTE---NGTP- 304
            +EIL  GP  AT  +Y DF+ YK+GVY+H+ +A+L ++  HS ++IGWG E    G P 
Sbjct: 342 MQEILTSGPVQATMRVYQDFFVYKNGVYRHSRSAELHDSGYHSMRIIGWGEEPSYRGPPL 401

Query: 305 -YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
            YWLV N+WG HWG+ G  +I RG  EC  E  + A
Sbjct: 402 KYWLVANSWGRHWGENGLFRIQRGTNECEIESYVLA 437


>gi|156708112|gb|ABU93314.1| cathepsin B5 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  128 bits (321), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 144/315 (45%), Gaps = 54/315 (17%)

Query: 21  FSDAYIDQINRE-ANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
           F+++ ++ +N     TW A    P  ++   LR  L A     D ++ P       Y P+
Sbjct: 10  FAESIVETVNNHPGATWVAVEYPPEVITTAKLRARLGA----IDLNEGP-----SNYVPD 60

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
            S  +PD FDAREQWP  G I  V +   C +   FA      +R  I   G+ +  +S 
Sbjct: 61  TS--LPDNFDAREQWP--GKILPVRNQEQCGSCWAFAVAETTGNRLNILGCGRGD--MSP 114

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
           + + SC K+     +  C+ GS   +W ++   G  T              I   S  G 
Sbjct: 115 QDLVSCDKV-----DHGCNGGSPLFSWEWVKHSGITT-----------EECIPYVSGGGR 158

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
            P+ P              +CTN   G    + K ++          D ++ E+ + GP 
Sbjct: 159 VPSCPK-------------KCTN---GSAIVRTKAKSVGLV----KGDKMQNELYSRGPF 198

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
            A F++Y+DF  YKSGVY H +   L    H+  ++GWG E+GTPYWL+ N+WG  WG++
Sbjct: 199 EAAFSVYEDFKSYKSGVYHHITGKMLGG--HAVMVVGWGVEDGTPYWLIQNSWGTTWGEQ 256

Query: 320 GTVKILRGKYECAFE 334
           G  KILRGK EC  E
Sbjct: 257 GFFKILRGKNECGIE 271


>gi|21695|emb|CAA46812.1| cathepsin B [Triticum aestivum]
          Length = 310

 Score =  128 bits (321), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 103/317 (32%), Positives = 134/317 (42%), Gaps = 38/317 (11%)

Query: 8   LLGCTLVRGELYKFSDAYIDQINREANT-WTAGRN-FPANLSEEYLRQFLIADAKYFDQS 65
           L G       L       I  +N+  N  WTAG N + AN + E  +  L          
Sbjct: 25  LAGTAKAEHSLGIIQKDIIQTVNKHPNAGWTAGHNPYFANYTIEQFKHILGVKPT----P 80

Query: 66  DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
              L G      PE    +P  FDAR QW +C TIG++ D G C A   FAAV A  DR 
Sbjct: 81  PGLLAGVPIKIHPEMD--LPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVEALQDRF 138

Query: 126 CIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDR 183
           CI      +  LS   + +CC    +     C+ G     W +  + G VT     Y D+
Sbjct: 139 CIHLN--MSVSLSVNDLLACCG---FLCGSGCNGGYPISAWRYFRRSGVVTEECDPYFDQ 193

Query: 184 TGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD 243
           TGCQ        H G  P  P+         KC  +C      + + ++KH +   Y V 
Sbjct: 194 TGCQ--------HPGCEPAYPT--------PKCQRKCK--VENQAWKENKHFSVNAYRVH 235

Query: 244 DNEDAIKKEILAHGPTTATFALYD--DFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
            N   I  E+  +GP    F      DF HYKSGVYKH +   +    H+ KLIGWGT +
Sbjct: 236 SNPHDIMAEVYKNGPVEVAFTYCQILDFAHYKSGVYKHITGGVMGG--HAVKLIGWGTSD 293

Query: 302 -GTPYWLVINTWGPHWG 317
            G  YWL+ N W   WG
Sbjct: 294 AGEDYWLLANQWNRGWG 310


>gi|195426329|ref|XP_002061289.1| GK20838 [Drosophila willistoni]
 gi|194157374|gb|EDW72275.1| GK20838 [Drosophila willistoni]
          Length = 432

 Score =  128 bits (321), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 85/261 (32%), Positives = 120/261 (45%), Gaps = 30/261 (11%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  F+A ++W     I  VPD G C +  + +     SDR  I+S+G++   LS + + 
Sbjct: 189 LPASFNAVDKWSR--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSQGKEVVQLSPQNIL 246

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC +       + C  G +   W +LHK+G +          C P T S           
Sbjct: 247 SCTR-----RQQGCEGGHLDAAWRYLHKKGVLD-------ESCYPYTQSR---------- 284

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
            +C+ +    LK H     P    G  +D   T    +    E  IK EI   GP  AT 
Sbjct: 285 GTCKVRHSGSLKAHGCRPAP----GVDRDSLYTVGPAYSLSREADIKAEIFHSGPVQATM 340

Query: 264 ALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGPHWGDRGT 321
            +Y DF+ Y  G+Y+ T+ N       HS KL+GWG E NG  YW+  N+WGP WG+RG 
Sbjct: 341 RVYRDFFSYSGGIYRQTAANRGAPTGFHSVKLVGWGEEHNGDKYWIAANSWGPWWGERGY 400

Query: 322 VKILRGKYECAFEYLIAAGKP 342
            +ILRG  EC  E  + A  P
Sbjct: 401 FRILRGSNECGIEDYVLASWP 421


>gi|161343821|tpg|DAA06091.1| TPA_inf: cathepsin B [Aphis gossypii]
          Length = 196

 Score =  127 bits (320), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 77/202 (38%), Positives = 103/202 (50%), Gaps = 17/202 (8%)

Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
           CC  C +     C  G   R W      G VTGGDY    GC+P  + PC +        
Sbjct: 5   CCHTCGF----GCHGGYPIRAWKRFKNHGLVTGGDYKSGEGCEPYRVPPCPYDEQGNN-- 58

Query: 205 SCENQKVPKLKCHTRCTNPTYGRGF--FQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
           +C  + + K   + RCT   YG     F + HR T  Y+      +I+K+++ +GP  A+
Sbjct: 59  TCAGKPMEK---NHRCTRICYGDQELDFDEDHRYTRDYYYL-TYGSIQKDVMTYGPIEAS 114

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
           F +Y DF  YKSG+Y+ T NA    YL  H+ KLIGWG + G PYWL++N+W   WGD G
Sbjct: 115 FDVYSDFPSYKSGIYERTENA---TYLGGHAVKLIGWGEQYGIPYWLMVNSWNEDWGDNG 171

Query: 321 TVKILRGKYECAFEYLIAAGKP 342
             KI RG  EC  +    AG P
Sbjct: 172 LFKIRRGTNECGVDNSTTAGVP 193


>gi|308162940|gb|EFO65307.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 303

 Score =  127 bits (320), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 100/325 (30%), Positives = 150/325 (46%), Gaps = 49/325 (15%)

Query: 22  SDAYIDQINREANTWTAG--RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
           S A + +I      W AG  + F  N++E+  R  LI            LP    T   E
Sbjct: 17  SRAELRRIQALNPPWKAGMPKRF-ENITEDEFRGMLIR-PDILGAGSGSLPPSSVTEIQE 74

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
            +  +P +FD R+++P C T   V D G+C     F+A+G F DRRC+    ++  P S 
Sbjct: 75  PADPIPSQFDFRDEYPQCVT--PVMDQGSCGGCWAFSAIGVFGDRRCVAGIDKEGVPYSQ 132

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG--DYGDRTGCQPSTI-SPCSH 196
           +Y+ SC       +N  C  G  + TW+FL   G+ T     Y D     P+ + SPC  
Sbjct: 133 QYLISCST-----ENHGCDGGDFWPTWSFLTLTGATTAECVKYIDY----PNIVASPC-- 181

Query: 197 HGSAPTLPSCEN-QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
               P +  C++  ++   K H       YG+              V  N  AI   +  
Sbjct: 182 ----PAV--CDDGSQIQLYKAHG------YGQ--------------VSKNVQAIMHMLAT 215

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGP 314
            GP      +Y D  +Y+SGVYKHT    +   LH+ +++G+GT ++GT YW++ N+WG 
Sbjct: 216 GGPVQTMIVVYSDLSYYESGVYKHT-YGTISLGLHALEMVGYGTTDDGTDYWIIRNSWGA 274

Query: 315 HWGDRGTVKILRGKYECAFEYLIAA 339
            WG+ G  +I+RG  EC  E  I A
Sbjct: 275 DWGENGYFRIVRGVNECRIEDEIYA 299


>gi|403332696|gb|EJY65386.1| Cathepsin B [Oxytricha trifallax]
          Length = 297

 Score =  127 bits (320), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 99/342 (28%), Positives = 144/342 (42%), Gaps = 52/342 (15%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
           LV +     +    +  ++  +  I  + + W          S+    Q L     Y   
Sbjct: 4   LVIVGTIAAMVAATHPVNEEMVAHIKAKTSLWQPHETTTNPFSDLTKEQLLAKCGTYIVP 63

Query: 65  SDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDR 124
           S++  PG           + PD FDAR+QW +   I  + D   C A   F A  A SDR
Sbjct: 64  SNKQYPGSPLI-------STPDNFDARQQWGS--KIHAIRDQQQCGACWAFGATEALSDR 114

Query: 125 RCIKSKGQQNRPLSTEYVASCCKICRYDDNK-SCSHGSVFRTWNFLHKRGSVTGG--DYG 181
             I S G  +   S E + SC      D N   C+ G +   W FL + G V      Y 
Sbjct: 115 FTIASNGSVDVVFSPEDLVSC------DTNDYGCNGGYMDMAWEFLDQHGVVADSCFPYS 168

Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW 241
             +G  P+  S C+  GSA    SC +  + + +                          
Sbjct: 169 AGSGFAPACASKCAD-GSAEKKYSCVHGSIRQSQ-------------------------- 201

Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
                + IK EI+AHGP    F +Y DF++Y+SGVY  T++       H+ K++G+G EN
Sbjct: 202 ---GVEQIKSEIVAHGPVEGAFTVYTDFFNYQSGVYTPTTSDVAGG--HAIKILGFGVEN 256

Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           GTPYWL  N+WGP WG +G  KI +G  EC  E  + +  P+
Sbjct: 257 GTPYWLCANSWGPSWGMQGFFKIKQG--ECGIEDQVFSCDPQ 296


>gi|403362666|gb|EJY81064.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score =  127 bits (320), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 92/308 (29%), Positives = 131/308 (42%), Gaps = 47/308 (15%)

Query: 22  SDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYS 81
           S   ++ I      WT     P  +SE     +  A  K    +      D   +  + +
Sbjct: 22  SQEMVNAIRSSNALWT-----PTEVSENKFANYTEAQIKGLLGTVLSHSSDIPAF-TQIN 75

Query: 82  ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
           A VPD FD+R QW  C  +  + D   C +   FAA  + SDR CI S+G+ N  LS + 
Sbjct: 76  AAVPDSFDSRTQWQGC--VHPIRDQAQCGSCWAFAASESLSDRFCIASQGKVNVVLSPQD 133

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
           + SC       +N  C  G +   W +L K+G  +         C+P      S  G+AP
Sbjct: 134 MVSC-----DTNNYGCDGGYLNLAWQYLEKKGVAS-------DSCEPYK----SASGTAP 177

Query: 202 TLPS-CEN-QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
           + PS C N Q + K KC    T    G                     A K  I   GP 
Sbjct: 178 SCPSKCANGQAIKKYKCQAGSTKQANGAA-------------------ATKSLIQQSGPV 218

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
              F +Y DF++YKSG+Y H S        H+ K++GWG +    YW+V N+WG  WG++
Sbjct: 219 ETGFTVYADFFNYKSGIYHHVSGGAEGG--HAVKILGWGKQGSENYWIVANSWGESWGEK 276

Query: 320 GTVKILRG 327
           G   I +G
Sbjct: 277 GFFNIRQG 284


>gi|403345965|gb|EJY72367.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score =  127 bits (319), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 92/308 (29%), Positives = 131/308 (42%), Gaps = 47/308 (15%)

Query: 22  SDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYS 81
           S   ++ I      WT     P  +SE     +  A  K    +      D   +  + +
Sbjct: 22  SQEMVNAIRSSNALWT-----PTEVSENKFANYTEAQIKGLLGTVLSHSSDIPAF-TQIN 75

Query: 82  ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
           A VPD FD+R QW  C  +  + D   C +   FAA  + SDR CI S+G+ N  LS + 
Sbjct: 76  AAVPDSFDSRTQWQGC--VHPIRDQAQCGSCWAFAASESLSDRFCIASQGKVNVVLSPQD 133

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
           + SC       +N  C  G +   W +L K+G  +         C+P      S  G+AP
Sbjct: 134 MVSC-----DTNNYGCDGGYLNLAWQYLEKKGVAS-------DSCEPYK----SASGTAP 177

Query: 202 TLPS-CEN-QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
           + PS C N Q + K KC    T    G                     A K  I   GP 
Sbjct: 178 SCPSKCSNGQAIKKYKCKAGSTKQANGAA-------------------ATKSLIQQSGPV 218

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
              F +Y DF++YKSG+Y H S        H+ K++GWG +    YW+V N+WG  WG++
Sbjct: 219 ETGFTVYADFFNYKSGIYHHVSGGAEGG--HAVKILGWGKQGSENYWIVANSWGESWGEK 276

Query: 320 GTVKILRG 327
           G   I +G
Sbjct: 277 GFFNIRQG 284


>gi|323448735|gb|EGB04630.1| hypothetical protein AURANDRAFT_32318 [Aureococcus anophagefferens]
          Length = 253

 Score =  127 bits (318), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 85/255 (33%), Positives = 127/255 (49%), Gaps = 19/255 (7%)

Query: 97  CGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKS 156
           C ++  + D   C +   F +  A +DR CI S G     LS + V SC K+     +  
Sbjct: 1   CPSLKEIRDQANCGSCWAFGSTEAMTDRMCIASNGTVTTHLSAQDVTSCDKL----GDMG 56

Query: 157 CSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKC 216
           C+ G     +++    G V GG+YGD++GC    + PC+HH ++   P+C ++ V   KC
Sbjct: 57  CNGGIPSSVYSYWALSGIVDGGNYGDKSGCWSYQLEPCAHHVNSSKYPACPDE-VRAPKC 115

Query: 217 HTRCTNPTYGRGFFQDKHRTTLTYWVDDNED-----AIK--KEILAHGPTTATFALYDDF 269
             +C +    + + + K +    Y V    +     AIK   +I  +GP T  F +  DF
Sbjct: 116 ARKCESED--KDWTKAKVKGEKGYSVCQQGELEGTCAIKMAADIYQNGPITGMFFVKQDF 173

Query: 270 YHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRG 327
             YKSGVY+      L   L  H+ K++G+GTE+G  YWLV N+W   WGD G  KI+RG
Sbjct: 174 LAYKSGVYEPK---LLSPPLGGHAIKIMGFGTEDGKDYWLVANSWNEDWGDDGYFKIIRG 230

Query: 328 KYECAFEYLIAAGKP 342
           K  C  E  +  G P
Sbjct: 231 KNACQIEDPVINGGP 245


>gi|395526635|ref|XP_003765465.1| PREDICTED: tubulointerstitial nephritis antigen-like [Sarcophilus
           harrisii]
          Length = 467

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 101/333 (30%), Positives = 139/333 (41%), Gaps = 44/333 (13%)

Query: 26  IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY----S 81
           ID INR    WTAG +        +    L    +Y   + RP        + +      
Sbjct: 146 IDAINRGNYGWTAGNH------SVFWGMTLDEGIRYRLGTVRPTSSVMNMNEIQMVMSPD 199

Query: 82  ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            T+P  F A  +WP  G I    D G CA    F+     SDR  I S G  +  LS + 
Sbjct: 200 ETLPSAFSASNKWP--GLIHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMSPALSPQN 257

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY----GDRTGCQPSTISPCSHH 197
           + SC       +   C  G +   W FL +RG V+   Y    GD  G  P+  +PC  H
Sbjct: 258 LLSC----NTHNQHGCRGGRLDGAWWFLRRRGLVSNNCYPFSEGDHNGAAPA--APCMMH 311

Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
                     +    K +    C N    R      ++ T  Y +  +E  I KE++ +G
Sbjct: 312 S--------RHMGRGKRQATAHCPN---SRTHANHIYQATPPYRLSSHEKDIMKELMENG 360

Query: 258 PTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTE-----NGTPYW 306
           P  A   +++DF+ YKSG+YKHT  S  K E Y     HS K+ GWG E         YW
Sbjct: 361 PVQALLEVHEDFFLYKSGIYKHTPASLGKPERYRQHGTHSVKITGWGEEIQPDGQKVKYW 420

Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
              N+WGP WG+ G  +I+RG  EC  E  +  
Sbjct: 421 TAANSWGPTWGENGYFRIVRGANECDIESFVVG 453


>gi|350408961|ref|XP_003488566.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
           impatiens]
          Length = 445

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 102/334 (30%), Positives = 143/334 (42%), Gaps = 61/334 (18%)

Query: 26  IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP------LPGDRKTYDPE 79
           ID+IN +  +W A RN+      E+  + L    K    +  P      +   ++ YDPE
Sbjct: 147 IDEINSQDLSWRA-RNY-----SEFWGRTLDEGVKLRLGTLNPSRSVYRMNSVQRIYDPE 200

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
              ++P  FDAR +WP    I  + D G C A    +     SDR  + SKG  +  LS 
Sbjct: 201 ---SLPREFDARIRWPR--EISDIDDQGWCGASWAISTTRVASDRFALMSKGADSVLLSA 255

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPSTISPCSH 196
           +++ SC         ++CS G + R W ++ K G V    Y   G    C+    +    
Sbjct: 256 QHLLSC----NNRGQQACSGGYLDRAWLYMRKFGLVDEDCYPWEGTNVQCKLRKRTDLKT 311

Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
            G  P         V  L+       P Y  G                NE  I  EIL  
Sbjct: 312 AGCRP--------PVNPLRTELYKVGPAYRLG----------------NETDIMYEILTS 347

Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENY---LHSGKLIGWGTENGT--------PY 305
           GP  AT  +Y DF+ Y+SG+YKHT  A  E+Y    HS ++IGWG +            Y
Sbjct: 348 GPVQATMKVYQDFFSYESGIYKHT--ATTEHYAFGYHSVRIIGWGEDTSAHRYRNLPIKY 405

Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
           WLV+N+WG  WG+ G  +I RG  EC  E  + A
Sbjct: 406 WLVVNSWGQQWGESGLFRIQRGTNECDIESFVVA 439


>gi|195384166|ref|XP_002050789.1| GJ20006 [Drosophila virilis]
 gi|194145586|gb|EDW61982.1| GJ20006 [Drosophila virilis]
          Length = 432

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 83/264 (31%), Positives = 120/264 (45%), Gaps = 28/264 (10%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           S  +P +F+A E+W +   I  VPD G C +  + +     SDR  I+S+G++   LS +
Sbjct: 184 SDDLPRKFNAVEKWSS--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSQGKEVVQLSAQ 241

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SC +       + C  G +   W +LHK+G +    Y            P + H   
Sbjct: 242 NILSCTR-----RQQGCEGGHLDAAWRYLHKKGVLDEKCY------------PYTQHRD- 283

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
               SC+ Q+            P YG    +D   T    +    E  I  EI   GP  
Sbjct: 284 ----SCKIQRHNSRSLKANGCQPAYGVN--RDSLYTVGPAYSLSREADIMAEIYHSGPVQ 337

Query: 261 ATFALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGPHWGD 318
           AT  +Y DF+ Y  G+Y+ T+ N       HS KL+GWG E +G  YW+  N+WGP WG+
Sbjct: 338 ATMRIYRDFFSYSGGIYRQTAANRGAPTGFHSVKLVGWGEEHDGVKYWIAANSWGPWWGE 397

Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
            G  +ILRG  EC  E  + A  P
Sbjct: 398 HGYFRILRGSNECGIEEYVLASWP 421


>gi|157058737|gb|ABV03126.1| cathepsin B-16 [Myzus persicae]
          Length = 238

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 82/244 (33%), Positives = 118/244 (48%), Gaps = 15/244 (6%)

Query: 24  AYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA--KYFDQSDRPLPGDRKTYDPEYS 81
           +YID IN  A+TW AG NF  N S+E + + L +         S      D   Y+  Y 
Sbjct: 5   SYIDTINEVASTWKAGVNFDPNTSQEDIVKLLGSTGVESAMKASANEFKMDDVAYNKLYG 64

Query: 82  ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            T P  FDAR++W +C TIG V D G C +   F    AF+DR C+ + G  N  LS E 
Sbjct: 65  YT-PRTFDARKKWRHCKTIGEVRDQGHCGSCWAFGTSSAFADRLCVATDGDFNELLSAEE 123

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
           +  CC  C +     C+ G   + W +    G VTGG+Y    GC+P  + PC       
Sbjct: 124 ITFCCHTCGF----GCNGGDPIKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCPRDDKGK 179

Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRG--FFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
              +C  +  P+ K H RCT   YG     +++ HR T  ++      +I+K+++ +GP 
Sbjct: 180 N--TCAGK--PREKNH-RCTRMCYGNQDLDYREDHRYTRDFYY-LTYGSIQKDVMTYGPI 233

Query: 260 TATF 263
            ATF
Sbjct: 234 EATF 237


>gi|47125398|gb|AAH70278.1| Tubulointerstitial nephritis antigen [Homo sapiens]
 gi|190690249|gb|ACE86899.1| tubulointerstitial nephritis antigen protein [synthetic construct]
 gi|190691623|gb|ACE87586.1| tubulointerstitial nephritis antigen protein [synthetic construct]
 gi|312150986|gb|ADQ32005.1| tubulointerstitial nephritis antigen [synthetic construct]
          Length = 476

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 109/352 (30%), Positives = 153/352 (43%), Gaps = 64/352 (18%)

Query: 13  LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
           LVR EL       I+Q+N+    WTA      N S+ +     + D   F     P    
Sbjct: 155 LVRSEL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSLM 200

Query: 69  -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
            L  +  T     +  +P+ F A  +WP      H P D   CAA   F+     +DR  
Sbjct: 201 LLSMNEMTASLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257

Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------ 180
           I+SKG+    LS + + SCC   R+     C+ GS+ R W +L KRG V+   Y      
Sbjct: 258 IQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQ 313

Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
                GC  ++ S     G       C N  V K     +C+ P                
Sbjct: 314 NATNNGCAMASRS--DGRGKRHATKPCPN-NVEKSNRIYQCSPP---------------- 354

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGK 293
           Y V  NE  I KEI+ +GP  A   +++DF+HYK+G+Y+H  ++N + E Y     H+ K
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 414

Query: 294 LIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           L GWGT  G       +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 415 LTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|327281715|ref|XP_003225592.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
           carolinensis]
          Length = 520

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 83/269 (30%), Positives = 125/269 (46%), Gaps = 28/269 (10%)

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
            +P  F+A ++W   G I    D G CA    F+     SDR  I S G     LS + +
Sbjct: 253 VLPSYFNAADKWS--GMIHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMTPALSPQNL 310

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            SC         + C+ G +   W FL +RG VT         C P +    +H  +AP 
Sbjct: 311 LSC----NTRHQQGCNGGRIDGAWWFLRRRGVVT-------DECYPFSNQETNHSPNAPA 359

Query: 203 -LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
            +    +    K +   RC NP   R    + +++T  Y +  NE  I KE++ +GP  A
Sbjct: 360 CMMHSRSTGRGKRQAIARCPNP---RSHANEIYQSTPAYRLSSNEKEIMKELMENGPVQA 416

Query: 262 TFALYDDFYHYKSGVYKHTSNA--KLENY----LHSGKLIGWGTE-----NGTPYWLVIN 310
              +++DF+ Y++G+Y+HT+ A  K E Y     HS K+ GWG E     +   YW+  N
Sbjct: 417 ILEVHEDFFMYRTGIYRHTAVAAGKPEQYRRHGTHSVKITGWGEEQMPDGSNQKYWIAAN 476

Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLIAA 339
           +WG  WG+ G  +I RG+ EC  E  +  
Sbjct: 477 SWGKDWGEHGYFRITRGENECEIETFVVG 505


>gi|16768502|gb|AAL28470.1| GM06507p [Drosophila melanogaster]
          Length = 430

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 85/261 (32%), Positives = 121/261 (46%), Gaps = 30/261 (11%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ F+A ++W +   I  VPD G C A  + +     SDR  I+SKG++N  LS + + 
Sbjct: 187 LPNSFNALDKWSS--YISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNIL 244

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC +       + C  G +   W +LHK+G V    Y            P + H      
Sbjct: 245 SCTR-----RQQGCEGGHLDAAWRYLHKKGVVDENCY------------PYTQH-----R 282

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
            +C+ +    LK +  C  P       +D   T    +  + E  I  EI   GP  AT 
Sbjct: 283 DTCKIRHSRSLKANG-CQKPV---NVDRDSLYTVGPAYSLNREADIMAEIFHSGPVQATM 338

Query: 264 ALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGPHWGDRGT 321
            +  DF+ Y  GVY+ T+ N K     HS KL+GWG E NG  YW+  N+WG  WG+ G 
Sbjct: 339 RVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEHGY 398

Query: 322 VKILRGKYECAFEYLIAAGKP 342
            +ILRG  EC  E  + A  P
Sbjct: 399 FRILRGSNECGIEEYVLASWP 419


>gi|300952942|gb|ADK46902.1| cathepsin B [Radopholus similis]
          Length = 356

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 92/321 (28%), Positives = 142/321 (44%), Gaps = 26/321 (8%)

Query: 22  SDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFD---QSDRPLPGDRKTYDP 78
           ++  + ++N    TWTA      +      ++ L+    + D   Q  + L G R     
Sbjct: 42  AEDMVKKVNEAKTTWTAEELPRISSMSLNAKKGLMGLKAFHDGGFQKHKQLLGARPKSAS 101

Query: 79  EYSAT-VPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
           +  AT +P  FD+R+Q+  C   IG + D   C +    ++     DR CI S G+Q   
Sbjct: 102 KLDATKLPQHFDSRKQFTKCAKVIGTIQDQSNCGSCWAVSSASVIQDRICIASNGEQKVH 161

Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
           +S + + SC      D ++ C+ G     +    + G VTG       GC+P    P  H
Sbjct: 162 ISAQDILSCAT----DRSQGCNGGYPDEAFEHYAQSGVVTGSGNSANQGCKPYPFLP--H 215

Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA-IKKEILA 255
                + P          +C  +C N  Y + + QDKH     Y V  ++   I+ EI+ 
Sbjct: 216 TTVEYSTP----------ECSKKCENYQYKKAYKQDKHFGMSVYNVQFSDPVDIQYEIMN 265

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT--PYWLVINTWG 313
           +GP  A   +Y DF  YKSGVY+      L    H+ +++GWG +  T  PYWLV N+W 
Sbjct: 266 NGPVEANMIVYYDFMFYKSGVYQTVFPWPLGG--HAVRIVGWGVDGPTKVPYWLVANSWN 323

Query: 314 PHWGDRGTVKILRGKYECAFE 334
             WG+ G  +I RG  E   E
Sbjct: 324 TDWGEDGYFRIRRGTDESYIE 344


>gi|431838263|gb|ELK00195.1| Tubulointerstitial nephritis antigen [Pteropus alecto]
          Length = 425

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 93/274 (33%), Positives = 127/274 (46%), Gaps = 41/274 (14%)

Query: 84  VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           +P+ F A  +WP      H P D   CAA   F+     +DR  I+SKG+    LS + +
Sbjct: 166 LPEFFVAYYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNL 222

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-----GDRTGCQPSTISPCSHH 197
            SCC   R+     CS GS+ R W +L KRG V+   Y      + T    +  S     
Sbjct: 223 ISCCAKNRH----GCSSGSIDRAWWYLRKRGLVSHACYPFLKDQNTTNNACAMASRSDGR 278

Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
           G       C N  + K     +C+ P                Y V  NE  I KEI+ +G
Sbjct: 279 GKRHATKPCPNN-IEKSNRIYQCSPP----------------YRVSSNETEIMKEIIHNG 321

Query: 258 PTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGTENGT-----PYW 306
           P  A   +++DF+HYKSG+Y+H  ++N K E Y     H+ KL GWGT  G       +W
Sbjct: 322 PVQAIMQVHEDFFHYKSGIYRHVTSTNEKSEKYQKLQTHAVKLTGWGTLRGAQGRKEKFW 381

Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           +V N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 382 IVANSWGNSWGENGYFRILRGVNESDIEKLIIAA 415


>gi|339248603|ref|XP_003373289.1| cathepsin B [Trichinella spiralis]
 gi|316970616|gb|EFV54519.1| cathepsin B [Trichinella spiralis]
          Length = 576

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 88/272 (32%), Positives = 123/272 (45%), Gaps = 39/272 (14%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           E S  +P+ FDARE+WP+   I  V D G CA+   F+     +DR  I+S G+   PLS
Sbjct: 306 EMSNFLPESFDARERWPS--FIHPVRDQGDCASSWAFSTTAVSADRLAIQSGGKFYNPLS 363

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
            + + SC                     N   +RG    G Y DR  C  S        G
Sbjct: 364 VQQLLSC---------------------NQARQRG--CNGGYLDRAWCVVSDECYTYTSG 400

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
                  C   +   L    RC + +     +    + T  Y +  NE  I  EI+A+GP
Sbjct: 401 QTNQPGECHIPRTAYLDGEIRCPSGSADNRVY----KMTPPYRISTNEREIMTEIMANGP 456

Query: 259 TTATFALYDDFYHYKSGVYKHT--SNAKLENYLHSG----KLIGWGTENGT----PYWLV 308
             ATF +++DF+ YKSGVY+H   +N K   Y  SG    +++GWG ++ T     YWL 
Sbjct: 457 VQATFLVHEDFFMYKSGVYQHLPYANDKGPAYARSGYHSVRILGWGVDHSTGVPIKYWLC 516

Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
            N+WG  WG+ G  +ILRG+  C  E  I   
Sbjct: 517 ANSWGEEWGENGLFRILRGENHCDIESFIIGA 548


>gi|432884030|ref|XP_004074413.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oryzias
           latipes]
          Length = 474

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 102/330 (30%), Positives = 152/330 (46%), Gaps = 42/330 (12%)

Query: 26  IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP------LPGDRKTYDPE 79
           I  +NR    W A     AN S+ +    L    +Y   + RP      +   +   DP+
Sbjct: 145 IHAVNRGNYGWKA-----ANYSQ-FFGMSLDEGIRYRLGTQRPSRTVMNMNEIQMKMDPQ 198

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
            +  +P  F++ E+WPN   I    D G CAA   F+     SDR  I+S G     LS 
Sbjct: 199 -NDHLPRYFNSSEKWPN--KIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPQLSP 255

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQ-PSTISPCSHHG 198
           + + SC       +   C+ G +   W +L +RG VT   Y  +   Q P+ +  C    
Sbjct: 256 QNLISC----DTRNQGGCAGGRIDGAWWYLRRRGVVTENCYPYQPPQQAPAEVGRCMMQS 311

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
            A            K +   RC N TY   +  D +++T  Y +  NE  I KEI+ +GP
Sbjct: 312 RAVGRG--------KRQATQRCPN-TYN--YHNDIYQSTPPYKLSSNEKEIMKEIMENGP 360

Query: 259 TTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTE---NGTP--YWL 307
             A   +++DF+ YK+G+YKHT  S+ K   Y     HS ++ GWG +   +GTP  YW+
Sbjct: 361 VQAIMEVHEDFFVYKNGIYKHTDVSSTKPPQYRKHGTHSVRITGWGEDKDYDGTPRKYWI 420

Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLI 337
             N+WG +WG+ G  +I RG  EC  E  +
Sbjct: 421 AANSWGKNWGENGFFRIARGANECEIEAFV 450


>gi|294888035|ref|XP_002772321.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239876433|gb|EER04137.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 200

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 80/225 (35%), Positives = 112/225 (49%), Gaps = 31/225 (13%)

Query: 105 DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFR 164
           D  AC +   F    AF+DR CIKS G     LS   + +C           C  G  + 
Sbjct: 1   DQSACGSCWAFGVTEAFNDRLCIKSDGAFTELLSAGEMNACTLF------FGCGGGDPYS 54

Query: 165 TWNFLHKRGSVTGGDY---GDRT---GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHT 218
            W+++H +G  TGGDY    D T   GC P    PC+HH +    P C     PK+ C  
Sbjct: 55  AWSWVHDKGIATGGDYVAKDDMTKDDGCWPYDFPPCAHHINDTKYPKC-----PKVSCSG 109

Query: 219 RCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYK 278
                   R F  +   +   Y V+D ++AI+ +    GP +A+F +Y+DF  Y+SGVYK
Sbjct: 110 D------DRHFMLES--SPYHYSVNDAKNAIRTD----GPVSASFTVYEDFLAYRSGVYK 157

Query: 279 HTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
           HTS + L    H+ K+IGWG ++G  YWL +N+W   WGD G  +
Sbjct: 158 HTSGSYLGG--HAVKIIGWGEKSGQAYWLAVNSWNEDWGDHGLFR 200


>gi|6009533|dbj|BAA84949.1| tubulointerstitial nephritis antigen [Homo sapiens]
          Length = 476

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 109/352 (30%), Positives = 153/352 (43%), Gaps = 64/352 (18%)

Query: 13  LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
           LVR EL       I+Q+N+    WTA      N S+ +     + D   F     P    
Sbjct: 155 LVRPEL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSLM 200

Query: 69  -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
            L  +  T     +  +P+ F A  +WP      H P D   CAA   F+     +DR  
Sbjct: 201 LLSMNEMTASLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257

Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------ 180
           I+SKG+    LS + + SCC   R+     C+ GS+ R W +L KRG V+   Y      
Sbjct: 258 IQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQ 313

Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
                GC  ++ S     G       C N  V K     +C+ P                
Sbjct: 314 NATNNGCAMASRS--DGRGKRHATKPCPN-NVEKSNRIYQCSPP---------------- 354

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGK 293
           Y V  NE  I KEI+ +GP  A   +++DF+HYK+G+Y+H  ++N + E Y     H+ K
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 414

Query: 294 LIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           L GWGT  G       +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 415 LTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|10803450|emb|CAB97364.2| putative cathepsin B.1 [Ostertagia ostertagi]
          Length = 199

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 69/196 (35%), Positives = 103/196 (52%), Gaps = 12/196 (6%)

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
            ++  A SDR CI ++G +   +S + + SCC  C Y     C  G   R W +  ++G 
Sbjct: 5   VSSASAMSDRVCIATQGAKQVLISDQDIVSCCTWCGY----GCQGGWSIRAWYYFAEQGV 60

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
           VTGG+Y  +  C+P  I PC +H   P    C++      +C  RC    Y + +  DKH
Sbjct: 61  VTGGNYNTKGSCRPYEIHPCGYHKDEPYYGECDDL-ADTPRCKRRC-QLGYPKSYPSDKH 118

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
                Y +  + ++I++EI+ +GP  A F +Y+DF HYK G+YKHTS  K     H+ K+
Sbjct: 119 YGRTAYQLPMSVESIQREIMRNGPVVAGFTVYEDFAHYKGGIYKHTSGKKTGG--HAVKV 176

Query: 295 IGWGTEN----GTPYW 306
           IGWG+E       PYW
Sbjct: 177 IGWGSEQKGSEKIPYW 192


>gi|395833440|ref|XP_003789742.1| PREDICTED: tubulointerstitial nephritis antigen [Otolemur
           garnettii]
          Length = 464

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 90/276 (32%), Positives = 130/276 (47%), Gaps = 45/276 (16%)

Query: 84  VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           +P+ F A  +WP      H P D   CAA   F+     +DR  I+SKG+    LS + +
Sbjct: 205 LPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNL 261

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-------GDRTGCQPSTISPCS 195
            SCC   R+     C+ GS+ R W +L KRG V+   Y          +GC  ++ S   
Sbjct: 262 ISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQHATNSGCAMASRS--D 315

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
             G       C N  + K     +C+ P                Y +  NE  I KEI+ 
Sbjct: 316 GRGKRHATKPCPN-NIEKSNRIYQCSPP----------------YRISSNETEIMKEIMQ 358

Query: 256 HGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGTENGT-----P 304
           +GP  A   +++DF+HYKSG+Y+H  +++ + ENY     H+ KL+GWGT  G       
Sbjct: 359 NGPVQAIMQVHEDFFHYKSGIYRHVASTHGESENYRKLRTHAVKLLGWGTLRGAQGRKEK 418

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 419 FWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 454


>gi|347546077|gb|AEP03186.1| cathepsin B [Diuraphis noxia]
          Length = 239

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 83/248 (33%), Positives = 120/248 (48%), Gaps = 20/248 (8%)

Query: 38  AGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY--SATVPDRFDAREQWP 95
           AG NF  + +EE +++ L   +K     ++      KT D  Y  S  +P  FDAR++W 
Sbjct: 1   AGVNFDPDTTEEVIKRLL--GSKGVQIPNKNNMHMYKTNDVAYISSGKIPKTFDARKKWV 58

Query: 96  NCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNK 155
            C TIG V D G C +    +   AF+DR CI + G  N  LS + +  CC  C +    
Sbjct: 59  QCDTIGRVRDQGQCGSCWAVSTSSAFADRLCIATDGDFNELLSADEITFCCYTCGF---- 114

Query: 156 SCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLK 215
            C  G   + W    + G VTGGD+    GC+P  + P   + S      C        K
Sbjct: 115 GCDGGYPIKAWKQFSRHGLVTGGDFDSGEGCEPYRVPPSGSNSSNSYNHFCRG------K 168

Query: 216 CHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSG 275
           C+    N +Y      + HR T  Y+   + +AI+K++L +GP  A+F +YDDF  YKSG
Sbjct: 169 CYGDNQNISY-----SEDHRYTRDYYY-LSYNAIQKDVLLYGPIEASFEVYDDFMIYKSG 222

Query: 276 VYKHTSNA 283
           VY  + NA
Sbjct: 223 VYVKSENA 230


>gi|224586907|ref|NP_055279.3| tubulointerstitial nephritis antigen [Homo sapiens]
 gi|317373501|sp|Q9UJW2.3|TINAG_HUMAN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
 gi|119624842|gb|EAX04437.1| tubulointerstitial nephritis antigen [Homo sapiens]
 gi|189066513|dbj|BAG35763.1| unnamed protein product [Homo sapiens]
          Length = 476

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 109/352 (30%), Positives = 152/352 (43%), Gaps = 64/352 (18%)

Query: 13  LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
           LVR EL       I+Q+N+    WTA      N S+ +     + D   F     P    
Sbjct: 155 LVRSEL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSPM 200

Query: 69  -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
            L  +  T     +  +P+ F A  +WP      H P D   CAA   F+     +DR  
Sbjct: 201 LLSMNEMTASLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257

Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------ 180
           I+SKG+    LS + + SCC   R+     C+ GS+ R W +L KRG V+   Y      
Sbjct: 258 IQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQ 313

Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
                GC  ++ S     G       C N  V K     +C+ P                
Sbjct: 314 NATNNGCAMASRS--DGRGKRHATKPCPN-NVEKSNRIYQCSPP---------------- 354

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGK 293
           Y V  NE  I KEI+ +GP  A   + +DF+HYK+G+Y+H  ++N + E Y     H+ K
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 414

Query: 294 LIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           L GWGT  G       +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 415 LTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|291000228|ref|XP_002682681.1| predicted protein [Naegleria gruberi]
 gi|284096309|gb|EFC49937.1| predicted protein [Naegleria gruberi]
          Length = 225

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 87/256 (33%), Positives = 119/256 (46%), Gaps = 38/256 (14%)

Query: 83  TVPDRFDAREQWPNC-GTIGHVPDTGAC-AAPHIFA-AVGAFSDRRCIKSKGQQNRPLST 139
           ++P+ FD+RE+WP C   I +    G+C A  ++F  +    SDR CI S G+ N  LS 
Sbjct: 1   SLPESFDSREKWPTCIHPIRNQEQCGSCWACKNLFIQSSEVLSDRFCIASGGKVNVVLSP 60

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
           + + SC        N  C  G ++  W +L   G VT         C P +    S +G 
Sbjct: 61  QDLVSCNWY-----NAGCDGGILWAAWIYLKHTGIVT-------DQCLPYS----SGNGV 104

Query: 200 APTLPS-CENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           AP+ P  C     P                    K++    Y V    + I  EI  +GP
Sbjct: 105 APSCPKYCNGTSTP----------------IDSVKYKAKDWYEVGSIAEKIMNEIATNGP 148

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
             + F++Y DF  YKSGVY H + + L    H+ K++GWG EN   YWLV N+WGP WG 
Sbjct: 149 VQSGFSVYQDFMSYKSGVYTHQTGSFLGG--HAIKIVGWGVENNVKYWLVANSWGPDWGL 206

Query: 319 RGTVKILRGKYECAFE 334
            G  KI RG  EC  E
Sbjct: 207 NGLFKIKRGDNECGIE 222


>gi|115621283|ref|XP_782184.2| PREDICTED: tubulointerstitial nephritis antigen-like
           [Strongylocentrotus purpuratus]
          Length = 450

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 85/274 (31%), Positives = 122/274 (44%), Gaps = 48/274 (17%)

Query: 82  ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
           A +P+ FDARE WP  G I  V D G C +    +     SDR  I+S G+ N  LS ++
Sbjct: 195 ARLPETFDARENWP--GLIDEVIDQGKCGSSWAISTASVASDRLAIQSMGEINPRLSEQH 252

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
           + SC         + CS G + R W  L + G+V+   Y   +G    TI          
Sbjct: 253 LLSC----NIRGQRGCSGGYLDRAWYHLRRAGAVSRACYPYHSGLDEDTI---------- 298

Query: 202 TLPSCENQKVPKLKCHTRCTNPTYG------RGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
                    + KL+C        YG      RG   D + +T  Y +   E  I  EI  
Sbjct: 299 ---------MQKLRCRV-----AYGSSQCPERGVTSDLYLSTPPYRIAAREVDIMTEIYQ 344

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENY-------LHSGKLIGWGTE-----NGT 303
           +GP  ATF + +DF+ Y  GVY++       +         HS K++GWG +     N  
Sbjct: 345 NGPVQATFNVKNDFFVYNRGVYRNVKQEFTASQSDSDQAGWHSVKIVGWGIDRSDWYNPI 404

Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
            YWL  N+WG +WG++G  +I+RG  EC  E  +
Sbjct: 405 KYWLCTNSWGRNWGEQGMFRIVRGVNECEIESFV 438


>gi|332030944|gb|EGI70570.1| Uncharacterized peptidase C1-like protein F26E4.3 [Acromyrmex
           echinatior]
          Length = 501

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 90/276 (32%), Positives = 127/276 (46%), Gaps = 42/276 (15%)

Query: 73  RKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
           R+ YDP+    +P  FD+R +W     I +V D G C A    +     +DR  I SKG 
Sbjct: 253 RRIYDPD---ALPREFDSRTRWSR--DISNVHDQGWCGASWAISTADVATDRFSIMSKGA 307

Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPS 189
           ++  LS +++ S    C     + C  G + R W F+ K G V    Y   G    C+  
Sbjct: 308 EDAELSAQHLLS----CNNRGQQGCRGGYLDRAWLFMRKFGLVDKDCYPWTGKNGQCKLR 363

Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
             +     G       C     P L+       P Y  G                NE  I
Sbjct: 364 KRNNLQAAG-------CRKPPNP-LRTELYKVGPAYRLG----------------NETDI 399

Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKL-ENYLHSGKLIGWGTE---NGTP- 304
            +EIL  GP  AT  +Y DF+ YK+G+Y+H+ +A+L ++  HS ++IGWG E    G P 
Sbjct: 400 MQEILTSGPVQATMRVYQDFFVYKNGIYRHSQSAELHDSGYHSVRIIGWGEERSYRGPPL 459

Query: 305 -YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
            YWLV+N+WG +WG+ G  KI RG  EC  E  + A
Sbjct: 460 KYWLVVNSWGYNWGENGLFKIQRGTNECEIESYVLA 495


>gi|321478457|gb|EFX89414.1| hypothetical protein DAPPUDRAFT_303204 [Daphnia pulex]
          Length = 442

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 84/266 (31%), Positives = 130/266 (48%), Gaps = 23/266 (8%)

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           T+P  FD R +W +  T+  V D G C A   F+     +DR  I+S+G +  PLS + +
Sbjct: 184 TLPMSFDGRIEWRD--TLQDVRDQGWCGASWAFSTAAVAADRLAIQSRGHEVYPLSMQNL 241

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPSTISPCSHHGS 199
            +C         + C+ G + R WN++ + G V    Y     RTG       P    G+
Sbjct: 242 LAC----NNRGQQGCNGGHLDRAWNYMRRFGVVNEECYPYISGRTGQVEKCKVP--RRGN 295

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
             T+  C+     + K       P   +G F    R+   Y +   ED I  EIL HGP 
Sbjct: 296 LATM-KCQLVNAAERKSDRSDKPPR--KGLF----RSPPAYRIAPFEDDIMNEILQHGPV 348

Query: 260 TATFALYDDFYHYKSGVYKHT-SNAKLENYLHSGKLIGWGTE----NGTPYWLVINTWGP 314
            AT  ++ DF+ Y+ GVY+++ +N++  +  HS +++GWG +    N T YWLV N+WG 
Sbjct: 349 QATMRVHPDFFLYRGGVYRYSGTNSQQRSGYHSVRIVGWGVDSSKRNPTKYWLVANSWGR 408

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAG 340
            WG+ G  +I+RG+ E   E  + A 
Sbjct: 409 LWGEDGYFRIVRGENESDIEKFVLAA 434


>gi|403268748|ref|XP_003926429.1| PREDICTED: tubulointerstitial nephritis antigen [Saimiri
           boliviensis boliviensis]
          Length = 476

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 107/352 (30%), Positives = 155/352 (44%), Gaps = 64/352 (18%)

Query: 13  LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
           LVR EL       I+Q+N+    WTA      N S+ +     + D   F     P    
Sbjct: 155 LVRPEL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSPM 200

Query: 69  -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
            L  +  T     +  +P+ F A  +WP      H P D   CAA   F+     +DR  
Sbjct: 201 LLSMNEMTASLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257

Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------ 180
           I+SKG+    LS + + SCC   R+     C+ GS+ R W +L KRG V+   Y      
Sbjct: 258 IQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQ 313

Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
               +GC  ++ S     G       C N  + K     +C+ P                
Sbjct: 314 NATNSGCAMASRS--DGRGKRHATKPCPNN-IEKSNRIYQCSPP---------------- 354

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENYL----HSGK 293
           Y V  +E  I KEI+ +GP  A   +++DF+HYK+G+Y+H  ++N + E +L    H+ K
Sbjct: 355 YRVSSSETEIMKEIMQNGPVQAIMKVHEDFFHYKTGIYRHVTSTNKESEKFLKLQTHAVK 414

Query: 294 LIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           L GWGT  G       +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 415 LTGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|397517574|ref|XP_003828984.1| PREDICTED: tubulointerstitial nephritis antigen [Pan paniscus]
          Length = 476

 Score =  124 bits (312), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 109/352 (30%), Positives = 152/352 (43%), Gaps = 64/352 (18%)

Query: 13  LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
           LVR EL       I+Q+N+    WTA      N S+ +     + D   F     P    
Sbjct: 155 LVRPEL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSPM 200

Query: 69  -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
            L  +  T     +  +P+ F A  +WP      H P D   CAA   F+     +DR  
Sbjct: 201 LLSMNEMTASLPATTDLPEFFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257

Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------ 180
           I+SKG+    LS + + SCC   R+     C+ GS+ R W +L KRG V+   Y      
Sbjct: 258 IQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDH 313

Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
                GC  ++ S     G       C N  V K     +C+ P                
Sbjct: 314 NATNNGCAMASRS--DGRGKRHATKPCPN-NVEKSNRIYQCSPP---------------- 354

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGK 293
           Y V  NE  I KEI+ +GP  A   + +DF+HYK+G+Y+H  ++N + E Y     H+ K
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 414

Query: 294 LIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           L GWGT  G       +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 415 LTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|281204808|gb|EFA79003.1| hypothetical protein PPL_08471 [Polysphondylium pallidum PN500]
          Length = 322

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 84/303 (27%), Positives = 132/303 (43%), Gaps = 48/303 (15%)

Query: 46  LSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPD 105
           LS   L  FL+    Y   S +    +  +Y  +  A +P  FDAR QWPNC  I  V D
Sbjct: 4   LSIYILLAFLLVGTVY---SQQQCLDNVVSYTDQDRANIPASFDARTQWPNC--ISPVRD 58

Query: 106 TGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRT 165
            G+C++     +    +DR CI S G   + LS +Y+  C K C+ +    C+ G  F  
Sbjct: 59  QGSCSSCWAMTSSSILADRLCIASGGAIKKLLSPQYMVDCAKNCKTNSQSDCNSGCKFGF 118

Query: 166 WNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPT- 224
            +   +                        +  +  +  SC   K     C ++C + + 
Sbjct: 119 LDISME------------------------YLSNGISAESCLPYKESDATCPSQCKDGSP 154

Query: 225 ----YGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHT 280
               YG G             + + +DA + EI+ +GP  A F ++   Y+  SG+Y+ T
Sbjct: 155 IQLYYGSGCIS----------IGNLKDA-QLEIMKNGPILAVFQIFTSLYNIGSGLYRGT 203

Query: 281 SNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
            +       H+ ++IGWG ENGTPYWL +N+WG  +G  G  K+  G+    FE  + + 
Sbjct: 204 GDPAEG---HAARVIGWGEENGTPYWLALNSWGTEFGMDGAFKVPMGENIAGFESQLLSV 260

Query: 341 KPK 343
           KP 
Sbjct: 261 KPN 263


>gi|348570708|ref|XP_003471139.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
           porcellus]
          Length = 468

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 88/269 (32%), Positives = 120/269 (44%), Gaps = 32/269 (11%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 203 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSMGHMTPLLSPQN 259

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY--GDRTGCQPSTISPCSHHGS 199
           + SC  +      + C  G +   W FL +RG V+   Y    R   +     PC  H  
Sbjct: 260 LLSCDTL----HQQGCRGGHLDGAWWFLRRRGVVSDHCYPFSGREQAEAGPAPPCMMHSR 315

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
           A            K +   RC N         D ++ T  Y +  +E  I KE++ +GP 
Sbjct: 316 A--------MGRGKRQATRRCPNSHTDA---NDIYQVTPAYRLGSDEKEIMKELMENGPV 364

Query: 260 TATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWLV 308
            A   +++DF+ YK G+Y HT  S A+ E Y     HS K+ GWG E         YW  
Sbjct: 365 QALMEVHEDFFLYKGGIYSHTPLSMARPEQYRRHGTHSVKITGWGEETLPDGRTLKYWTA 424

Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLI 337
            N+WGP WG+RG  +ILRG  EC  E  +
Sbjct: 425 ANSWGPSWGERGHFRILRGSNECDIESFV 453


>gi|195488613|ref|XP_002092389.1| GE11695 [Drosophila yakuba]
 gi|194178490|gb|EDW92101.1| GE11695 [Drosophila yakuba]
          Length = 431

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 82/267 (30%), Positives = 120/267 (44%), Gaps = 41/267 (15%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  F+A ++W +   I  VPD G C A  + +     SDR  I+SKG++   LS + + 
Sbjct: 187 LPSSFNALDKWSS--YISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKEAVQLSAQNIL 244

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC +       + C  G +   W +LHK+G V    Y                       
Sbjct: 245 SCTR-----RQQGCEGGHLDAAWRYLHKKGVVDESCY----------------------- 276

Query: 204 PSCENQKVPKLKCHTR------CTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
           P  + +   K++ ++R      C  P       +D   T    +  + E  I  EI   G
Sbjct: 277 PYTQQRDTCKIRHNSRSLRANGCQTPY---NVDRDTFYTVGPAYSLNREADIMAEIFHSG 333

Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLE-NYLHSGKLIGWGTE-NGTPYWLVINTWGPH 315
           P  AT  +  DF+ Y  GVY+ T+  ++     HS KL+GWG E NG  YW+  N+WGP 
Sbjct: 334 PVQATMRVNRDFFAYAGGVYRQTAANRMAPTGFHSVKLVGWGEEHNGEKYWIAANSWGPW 393

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
           WG+RG  +ILRG  EC  E  + A  P
Sbjct: 394 WGERGYFRILRGSNECGIEEYVLASWP 420


>gi|308157829|gb|EFO60849.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 300

 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 83/260 (31%), Positives = 124/260 (47%), Gaps = 41/260 (15%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           VP+ FD RE++P+C  I  V D G C +   F++V  F DRRCI    ++    S +YV 
Sbjct: 75  VPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCIAGLDKKPVKYSPQYVV 132

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC        N +C+ G +   W FL K G+ T         C P      +  G+ PT 
Sbjct: 133 SCDH-----GNMACNGGWLPNAWKFLTKTGTTT-------DECVPYQSGSTTLRGTCPTK 180

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNED--AIKKEILAHGPTTA 261
            +  + KV                      H TT T + D   D  A+ K +   GP   
Sbjct: 181 CADGSSKV----------------------HLTTATSYKDYGLDIPAMMKALSTTGPLQV 218

Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPHWGDRG 320
            F +Y DF +Y+SGVY+HT         H+ +++G+GT++ G  YW++ N+WGP WG+ G
Sbjct: 219 AFLVYSDFMYYESGVYQHTYGYMEGG--HAVEMVGYGTDDDGVDYWIIRNSWGPDWGEDG 276

Query: 321 TVKILRGKYECAFEYLIAAG 340
             +++RG  +C+ E    AG
Sbjct: 277 YFRMIRGINDCSIEEQAYAG 296


>gi|410910940|ref|XP_003968948.1| PREDICTED: tubulointerstitial nephritis antigen-like [Takifugu
           rubripes]
          Length = 477

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 90/273 (32%), Positives = 125/273 (45%), Gaps = 30/273 (10%)

Query: 77  DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
           DPE    +P  F++ E+WP  G I    D G CAA   F+     SDR  I+S G     
Sbjct: 199 DPERDQ-LPLYFNSAEKWP--GKIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPQ 255

Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQ-PSTISPCS 195
           LS + + SC       +   C+ G +   W FL +RG VT   Y  R   Q P+ +  C 
Sbjct: 256 LSPQNLISC----DTRNQGGCTGGRIDGAWWFLRRRGVVTEDCYPYRPPQQTPAELGRCM 311

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
               +            K +   RC N      +  D +++T  Y +  NE  I KEI  
Sbjct: 312 MQSRSVGRG--------KRQATQRCPNTN---NYQNDIYQSTPPYRLSTNEKEIMKEIQD 360

Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTENGT-----P 304
           +GP  A   +++DF+ YKSG+YKHT  S  K   Y     HS K+ GWG E         
Sbjct: 361 NGPVQAIMEVHEDFFVYKSGIYKHTDVSFTKPPQYRKHGTHSVKITGWGEERNVDGAKRK 420

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
           YW+  N+WG +WG+ G  +I RG+ EC  E  +
Sbjct: 421 YWIAANSWGKNWGEEGYFRIARGENECEIEAFV 453


>gi|197100841|ref|NP_001126804.1| tubulointerstitial nephritis antigen [Pongo abelii]
 gi|55732702|emb|CAH93049.1| hypothetical protein [Pongo abelii]
          Length = 476

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 109/352 (30%), Positives = 152/352 (43%), Gaps = 64/352 (18%)

Query: 13  LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
           LVR EL       I+Q+N+    WTA      N S+ +     + D   F     P    
Sbjct: 155 LVRPEL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFHLGTLPPSPM 200

Query: 69  -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
            L  +  T     +  +P+ F A  +WP      H P D   CAA   F+     +DR  
Sbjct: 201 LLSMNEMTASLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257

Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------ 180
           I+SKG+    LS + + SCC   R+     C+ GS+ R W +L KRG V+   Y      
Sbjct: 258 IQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLSKDQ 313

Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
                GC  ++ S     G       C N  V K     +C+ P                
Sbjct: 314 NATNNGCAMASRS--DGRGKRHATKPCPN-NVEKSNRIYQCSPP---------------- 354

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGK 293
           Y V  NE  I KEI+ +GP  A   + +DF+HYK+G+Y+H  ++N + E Y     H+ K
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 414

Query: 294 LIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           L GWGT  G       +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 415 LTGWGTLRGAQGQKEKFWVAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|24657813|ref|NP_726176.1| secreted Wg-interacting molecule, isoform A [Drosophila
           melanogaster]
 gi|24657819|ref|NP_611652.2| secreted Wg-interacting molecule, isoform B [Drosophila
           melanogaster]
 gi|21064305|gb|AAM29382.1| RE01730p [Drosophila melanogaster]
 gi|21626543|gb|AAF46818.2| secreted Wg-interacting molecule, isoform A [Drosophila
           melanogaster]
 gi|21626544|gb|AAM68213.1| secreted Wg-interacting molecule, isoform B [Drosophila
           melanogaster]
 gi|220949028|gb|ACL87057.1| CG3074-PA [synthetic construct]
 gi|220958134|gb|ACL91610.1| CG3074-PA [synthetic construct]
          Length = 431

 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 84/267 (31%), Positives = 121/267 (45%), Gaps = 41/267 (15%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  F+A ++W +   I  VPD G C A  + +     SDR  I+SKG++N  LS + + 
Sbjct: 187 LPSSFNALDKWSS--YISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNIL 244

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC +       + C  G +   W +LHK+G V          C P T             
Sbjct: 245 SCTR-----RQQGCEGGHLDAAWRYLHKKGVVD-------ENCYPYT------------- 279

Query: 204 PSCENQKVPKLKCHTR------CTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
              +++   K++ ++R      C  P       +D   T    +  + E  I  EI   G
Sbjct: 280 ---QHRDTCKIRHNSRSLRANGCQKPV---NVDRDSLYTVGPAYSLNREADIMAEIFHSG 333

Query: 258 PTTATFALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGPH 315
           P  AT  +  DF+ Y  GVY+ T+ N K     HS KL+GWG E NG  YW+  N+WG  
Sbjct: 334 PVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSW 393

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
           WG+ G  +ILRG  EC  E  + A  P
Sbjct: 394 WGEHGYFRILRGSNECGIEEYVLASWP 420


>gi|56755425|gb|AAW25892.1| unknown [Schistosoma japonicum]
          Length = 226

 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 72/190 (37%), Positives = 105/190 (55%), Gaps = 10/190 (5%)

Query: 113 HIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKR 172
           H  +AVGA SDR CI+S G+Q+  LS   + SCC+ C       C  G     W++    
Sbjct: 41  HAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCENC----GSGCDGGFPGPAWDYWVSH 96

Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQD 232
           G VTGG   + TGCQP     C HH S    PSC ++     +C  +C    Y   +  D
Sbjct: 97  GIVTGGSKENHTGCQPYPFPKCEHH-SIGKYPSCGDKIYKTPQCKRKCQK-GYTTPYEHD 154

Query: 233 KHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKL-ENYLHS 291
           KH   ++  V  NE AI+KEI+ +GP  A   +++DF +YKSG+Y++T+ + + E+Y+  
Sbjct: 155 KHYGGISINVIKNESAIQKEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYV-- 212

Query: 292 GKLIGWGTEN 301
            ++IGWG EN
Sbjct: 213 -RIIGWGIEN 221


>gi|224064398|ref|XP_002301456.1| predicted protein [Populus trichocarpa]
 gi|222843182|gb|EEE80729.1| predicted protein [Populus trichocarpa]
          Length = 325

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 94/302 (31%), Positives = 132/302 (43%), Gaps = 31/302 (10%)

Query: 46  LSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFD-AREQWPNCGTIGHVP 104
           +S+  L   ++ D+     ++ P  G   T +P++S      F       P  G      
Sbjct: 30  VSKLKLNSRILQDSIVQKVNENPNAGWEATMNPQFSNYSVGEFKYLLGVKPTPGKELRGV 89

Query: 105 DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK-ICRYDDNKSCSHGSVF 163
             G C +   F AV + SDR CI      N  LS   + +CC  +C       C  G   
Sbjct: 90  PLGHCGSCWAFGAVESLSDRFCIHYG--MNLSLSVNDLLACCGWMC----GDGCDGGYPI 143

Query: 164 RTWNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCT 221
             W +  + G VT     Y D  GC        SH G  P  P+         KC  +C 
Sbjct: 144 DAWRYFVQSGVVTEECDPYFDDIGC--------SHPGCEPGFPT--------PKCERKCA 187

Query: 222 NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTS 281
           +    + + + KH +   Y +D +  +I  E+  +GP    F +Y+DF HYKSGVYKH +
Sbjct: 188 DKN--KLWAESKHFSVNAYRIDSDPHSIMAEVSMNGPVEVAFTVYEDFAHYKSGVYKHIT 245

Query: 282 NAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
              +    H+ KLIGWGT ++G  YWL+ N W   WGD G  KI RG  EC  E  + AG
Sbjct: 246 GDVMGG--HAVKLIGWGTSDDGEDYWLLANQWNRGWGDDGYFKIRRGTNECGIEEDVVAG 303

Query: 341 KP 342
            P
Sbjct: 304 LP 305


>gi|354472325|ref|XP_003498390.1| PREDICTED: tubulointerstitial nephritis antigen [Cricetulus
           griseus]
 gi|344245030|gb|EGW01134.1| Tubulointerstitial nephritis antigen-like [Cricetulus griseus]
          Length = 465

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 87/268 (32%), Positives = 121/268 (45%), Gaps = 30/268 (11%)

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
            +P  F+A E+WPN   I    D G CA    F+     SDR  I S G     LS + +
Sbjct: 201 VLPRAFEASEKWPN--LIQEPLDQGNCAGSWAFSTAAVASDRVSIHSMGHMTPILSPQNL 258

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG--DRTGCQPSTISPCSHHGSA 200
            SC         + C  G +   W FL +RG V+   Y    R   +  T S C  H  A
Sbjct: 259 LSC----DTHHQQGCRGGRLDGAWWFLRRRGVVSDNCYPFVGREQNEAGTSSRCMMHSRA 314

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
                       K +  +RC N   G+    D ++ T  Y +  +E  I KE++ +GP  
Sbjct: 315 --------MGRGKRQATSRCPN---GQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPVQ 363

Query: 261 ATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWLVI 309
           A   +++DF+ Y+SG+Y HT  S  + E Y     HS K+ GWG E         YW   
Sbjct: 364 ALMEVHEDFFLYQSGIYSHTPISQGRPEQYRRHGTHSVKITGWGEEKLPDGRTIKYWTAA 423

Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLI 337
           N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 424 NSWGPWWGERGHFRIVRGTNECDIESFV 451


>gi|328726600|ref|XP_003248962.1| PREDICTED: cathepsin B-like cysteine proteinase-like [Acyrthosiphon
           pisum]
          Length = 169

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 70/166 (42%), Positives = 93/166 (56%), Gaps = 11/166 (6%)

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF--FQDKHRTT 237
           +G   GC+P  + PC  +    +  SC  Q + K   + RCT   YG     + D HR T
Sbjct: 9   FGFAVGCEPYRVPPCPRNEDGTS--SCAGQPIEK---NHRCTRMCYGNQDLDYNDDHRFT 63

Query: 238 LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNA-KLENYLHSGKLIG 296
             Y+      +I+K+++ +GP  A+F +YDDFY YKSGVY+ T NA KL    H+ KLIG
Sbjct: 64  RDYYYL-TYGSIQKDVMNYGPIEASFDVYDDFYSYKSGVYQRTPNATKLGG--HAVKLIG 120

Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           WG E G PYWL++N+W   WGD G  KI RG  EC  +    AG P
Sbjct: 121 WGVEEGIPYWLMVNSWSAQWGDNGLFKIRRGTDECGIDSATTAGVP 166


>gi|156708116|gb|ABU93316.1| cathepsin B7 cysteine protease, partial [Monocercomonoides sp. PA]
          Length = 273

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 93/326 (28%), Positives = 148/326 (45%), Gaps = 57/326 (17%)

Query: 21  FSDAYIDQINRE-ANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
            +++ +D +N + ++TW A          EY R+ L   AK      +   G    +   
Sbjct: 1   LAESVVDIVNNDPSSTWVA---------TEYPREILTP-AKMRAMISQIGNGFEGEWTFA 50

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIK--SKGQQNRPL 137
            +   P  FD R++WP  G    V + G+C +    AA      R  I+  SKG     +
Sbjct: 51  ENENAPASFDCRQKWP--GKAEPVRNQGSCGSCWAHAASETMGFRMGIRRCSKGV----M 104

Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
           S + + SC       +N  C+ G   R WN++ K+G  T              I   S  
Sbjct: 105 SPQDLVSC-----ESNNMGCNGGYADRVWNWIQKKGITT-----------EQCIPYVSGS 148

Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
           G  PT PS             +C N +       +  R+ ++ W   N   +  E+  +G
Sbjct: 149 GRVPTCPS-------------KCKNGS-------NIVRSFVSSWGSFNSKTVMDEVANNG 188

Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
           P  A F +++DFY+Y+SGVY+H +  + + + H   L+GWGTENG PYWL+ N+WG  WG
Sbjct: 189 PVYACFEVFEDFYNYRSGVYQHKT-GRSQGWHHV-MLMGWGTENGVPYWLLQNSWGSGWG 246

Query: 318 DRGTVKILRGKYECAFEYLIAAGKPK 343
           ++G  +I RG  +C  + +  +G PK
Sbjct: 247 EKGFFRIRRGTNDCHIDEIFYSGLPK 272


>gi|363742306|ref|XP_428202.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Gallus
           gallus]
          Length = 464

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 98/329 (29%), Positives = 144/329 (43%), Gaps = 38/329 (11%)

Query: 26  IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG----DRKTYDPEYS 81
           ID +NR    W A     AN S+ +    L    +Y   + RP P     +      + +
Sbjct: 146 IDAVNRGNYGWRA-----ANYSQ-FWGMTLEDGMRYRLGTFRPPPTVMNMNEMHMAMDSN 199

Query: 82  ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
             +P  FDA  +WP  G I    D G CA    F+     SDR  I S G     LS + 
Sbjct: 200 EVLPRHFDAATKWP--GMIHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMTPSLSPQN 257

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
           + SC       + + CS G +   W +L +RG VT         C P T S  S   + P
Sbjct: 258 LLSC----DTRNQRGCSGGRLDGAWWYLRRRGVVT-------DECYPFT-SQDSQPAAQP 305

Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTA 261
            +    +    K +   RC NP   +    D +++T  Y +  +E  I KE++ +GP  A
Sbjct: 306 CMMHSRSTGRGKRQATARCPNP---QTHANDIYQSTPAYRLAPSEKEIMKELMENGPVQA 362

Query: 262 TFALYDDFYHYKSGVYKHTSNAK------LENYLHSGKLIGWGTE-----NGTPYWLVIN 310
              +++DF+ YKSG+Y+HT+ A+       ++  HS K+ GWG E         YW   N
Sbjct: 363 ILEVHEDFFLYKSGIYRHTAVAEGKGPKHQQHGTHSVKITGWGEEQLPDGQVQKYWTAAN 422

Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLIAA 339
           +WG  WG+ G  +I RG  EC  E  +  
Sbjct: 423 SWGRAWGEDGHFRIARGVNECEVESFVVG 451


>gi|195121981|ref|XP_002005491.1| GI19039 [Drosophila mojavensis]
 gi|193910559|gb|EDW09426.1| GI19039 [Drosophila mojavensis]
          Length = 432

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 86/264 (32%), Positives = 123/264 (46%), Gaps = 29/264 (10%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           S+ +P +F+A E+W +   I  VPD G C +  + +     SDR  I+S+G++   LS +
Sbjct: 184 SSGLPRKFNAVERWSS--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSQGKEVVQLSPQ 241

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SC +       + C  G +   W +LHK+G V      D T C P T          
Sbjct: 242 NILSCTR-----RQQGCEGGHLDAAWRYLHKKGVV------DET-CYPYT---------- 279

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
               SC+ +   +      C  P YG    +D   T    +    E  I  EI   GP  
Sbjct: 280 QRRDSCKIRHNSRSLKANGC-RPAYG--VNRDSLYTVGPAYSLKGETDIMAEIYHSGPVQ 336

Query: 261 ATFALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGPHWGD 318
           AT  +Y DF+ Y  GVY+ T+ N       HS K++GWG E +G  YW+  N+WGP WG+
Sbjct: 337 ATMRVYRDFFSYSGGVYRQTAANRGAPTGFHSVKIVGWGEEHDGVKYWIAANSWGPWWGE 396

Query: 319 RGTVKILRGKYECAFEYLIAAGKP 342
            G  +ILRG  EC  E  + A  P
Sbjct: 397 HGYFRILRGSNECGIEEYVLASWP 420


>gi|126330441|ref|XP_001381244.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Monodelphis
           domestica]
          Length = 466

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 86/268 (32%), Positives = 121/268 (45%), Gaps = 26/268 (9%)

Query: 83  TVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           T+P  F+A ++WP  G I    D G CA    F+     SDR  I S G     LS + +
Sbjct: 200 TLPLAFNASDKWP--GLIHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMTPALSPQNL 257

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            SC       + K C  G +   W FL +RG V+   Y    G + +T        +AP 
Sbjct: 258 LSC----DTHNQKGCRGGRLDGAWWFLRRRGLVSNHCYPFSAGNRDATAP------AAPC 307

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
           +    +    K +    C N    R      ++ T  Y +  +E  I KE++ +GP  A 
Sbjct: 308 MMHSRSMGRGKRQATAHCPN---SRAHANHIYQATPPYRLSSDEKDIMKELMENGPVQAL 364

Query: 263 FALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTENG-----TPYWLVINT 311
             +++DF+ YKSG+YKHT  S  K   Y     HS K+ GWG E         YW   N+
Sbjct: 365 MEVHEDFFLYKSGIYKHTPASLGKPARYRQHGTHSVKITGWGEERQPDGQRLKYWTAANS 424

Query: 312 WGPHWGDRGTVKILRGKYECAFEYLIAA 339
           WGP WG++G  +ILRG  EC  E  +  
Sbjct: 425 WGPTWGEKGHFRILRGANECDIESFVVG 452


>gi|291408920|ref|XP_002720687.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Oryctolagus
           cuniculus]
          Length = 467

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 102/335 (30%), Positives = 140/335 (41%), Gaps = 52/335 (15%)

Query: 26  IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA--- 82
           I+ IN+    W AG +        +    L    +Y   ++RP P      +  Y+    
Sbjct: 147 INAINQGNYGWQAGNH------SAFWGMTLEEGIRYRLGTNRP-PSSVMNMNEIYTGLGS 199

Query: 83  --TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
              +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS 
Sbjct: 200 GEVLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSP 256

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISP 193
           + + SC         + C  G +   W FL +RG V+       G   D  G  P    P
Sbjct: 257 QNLLSC----DTHHQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGHEQDEAGPAP----P 308

Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
           C  H  A            K +   RC N         D ++ T  Y +  NE  I KE+
Sbjct: 309 CMMHSRA--------MGRGKRQATARCPNSHV---HANDIYQVTPAYRLGSNEKEIMKEL 357

Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----G 302
           L +GP  A   +++DF+ Y+ G+Y HT  S  + E Y     HS K+ GWG E       
Sbjct: 358 LENGPVQALMEVHEDFFLYQGGIYSHTPVSLERPERYRRHGTHSVKITGWGEETLPDGRT 417

Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
             YW   N+WGP WG+RG  +ILRG  EC  E  +
Sbjct: 418 LKYWTAANSWGPAWGERGHFRILRGTNECDIESFV 452


>gi|348513320|ref|XP_003444190.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oreochromis
           niloticus]
          Length = 499

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 100/330 (30%), Positives = 152/330 (46%), Gaps = 42/330 (12%)

Query: 26  IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP------LPGDRKTYDPE 79
           I  +NR    W A     AN SE Y    L    +Y   + RP      +   +   DP+
Sbjct: 170 IQAVNRGNYGWKA-----ANYSELY-GMTLNEGIRYRLGTQRPSRTVMNMNEIQMNMDPQ 223

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
            +  +P  F++ E+WP  G I    D G CAA   F+     SDR  I+S G     LS 
Sbjct: 224 -TDNLPPYFNSAEKWP--GKIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPRLSP 280

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQ-PSTISPCSHHG 198
           + + SC       +   C+ G +   W +L +RG VT   Y  +   Q P+ +  C    
Sbjct: 281 QNLISC----DTRNQGGCAGGRIDGAWWYLRRRGVVTEDCYPYQPPHQTPAEVGRC---- 332

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
               +    +    K +   RC N    + +  D +++T  Y +  NE  I KEI+ +GP
Sbjct: 333 ----MMQSRSVGRGKRQATQRCPNT---QNYHNDIYQSTPPYRLSSNEKEIMKEIMDNGP 385

Query: 259 TTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTE---NGTP--YWL 307
             A   +++DF+ YK+G+YKHT  S  K   Y     HS ++ GWG +   +GT   YW+
Sbjct: 386 VQAIMEVHEDFFVYKTGIYKHTDVSFTKPPQYRKHGTHSVRITGWGEDRNVDGTSRKYWI 445

Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLI 337
             N+WG +WG+ G  +I+RG+ EC  E  +
Sbjct: 446 AANSWGKNWGENGYFRIVRGENECEIETFV 475


>gi|426353589|ref|XP_004044272.1| PREDICTED: tubulointerstitial nephritis antigen [Gorilla gorilla
           gorilla]
          Length = 476

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 108/352 (30%), Positives = 152/352 (43%), Gaps = 64/352 (18%)

Query: 13  LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
           LVR +L       I+Q+N+    WTA      N S+ +     + D   F     P    
Sbjct: 155 LVRPQL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSPM 200

Query: 69  -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
            L  +  T     +  +P+ F A  +WP      H P D   CAA   F+     +DR  
Sbjct: 201 LLSMNEMTASLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257

Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------ 180
           I+SKG+    LS + + SCC   R+     C+ GS+ R W +L KRG V+   Y      
Sbjct: 258 IQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQ 313

Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
                GC  ++ S     G       C N  V K     +C+ P                
Sbjct: 314 NATNNGCAMASRS--DGRGKRHATKPCPN-NVEKSNRIYQCSPP---------------- 354

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGK 293
           Y V  NE  I KEI+ +GP  A   + +DF+HYK+G+Y+H  ++N + E Y     H+ K
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 414

Query: 294 LIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           L GWGT  G       +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 415 LTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|358421824|ref|XP_003585145.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bos taurus]
          Length = 428

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 90/280 (32%), Positives = 125/280 (44%), Gaps = 54/280 (19%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G  +  LS + 
Sbjct: 163 VLPRTFEASEKWPN---LIHDPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMSPVLSPQN 219

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
           + SC       + + C  G +   W FL +RG V+   Y            P S HG   
Sbjct: 220 LLSC----DTHNQQGCRGGRLDGAWWFLRRRGVVSDHCY------------PFSGHGRDE 263

Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQ-------------DKHRTTLTYWVDDNEDA 248
            +P+      P    H+R      GRG  Q             D ++ T  Y +  NE  
Sbjct: 264 AVPA------PPCMMHSR----AMGRGKRQATARCPNSYVHANDIYQVTPAYRLGSNEKE 313

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN- 301
           I KE++ +GP  A   +++DF+ Y+SG+Y HT  S  + E Y     HS K+ GWG E  
Sbjct: 314 IMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETL 373

Query: 302 ----GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
                  YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 374 PDGRTIKYWTAANSWGPAWGERGHFRIVRGANECDIESFV 413


>gi|6449322|gb|AAF08931.1| tubulointerstitial nephritis antigen isoform TIN-ag [Homo sapiens]
          Length = 476

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 108/352 (30%), Positives = 151/352 (42%), Gaps = 64/352 (18%)

Query: 13  LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
           LVR EL       I+Q+N+    WTA      N S+ +     + D   F     P    
Sbjct: 155 LVRPEL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSPM 200

Query: 69  -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
            L  +  T     +  +P+ F A  +WP      H P D   CAA   F+     +DR  
Sbjct: 201 LLSMNEMTASLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257

Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------ 180
           I+SKG+    LS + + SCC   R+     C+ GS+ R W +L KRG V+   Y      
Sbjct: 258 IQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQ 313

Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
                GC  ++ S     G       C N  V K     +C+ P                
Sbjct: 314 NATNNGCAMASRS--DGRGKRDATKPCPN-NVEKSNRIYQCSPP---------------- 354

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGK 293
           Y V  NE  I KEI+ +GP  A   + +DF+HYK+G+Y+H  ++N + E Y     H+ K
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 414

Query: 294 LIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           L GWGT  G       +W+  N WG  WG+ G  +ILRG  E   E L+ A 
Sbjct: 415 LTGWGTLRGAQGQKEKFWIAANFWGKSWGENGYFRILRGVNESDIEKLVIAA 466


>gi|355724272|gb|AES08175.1| tubulointerstitial nephritis antigen [Mustela putorius furo]
          Length = 476

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 94/295 (31%), Positives = 131/295 (44%), Gaps = 45/295 (15%)

Query: 65  SDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSD 123
           S R L  +  T     +  +P+ F A  +WP      H P D   CAA   F+     +D
Sbjct: 198 SPRLLSMNEMTASLPATTDLPEFFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAAD 254

Query: 124 RRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY--- 180
           R  I+SKG+    LS + + SCC   R+     C+ GS+ R W FL KRG V+   Y   
Sbjct: 255 RIAIQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWFLRKRGLVSHACYPLF 310

Query: 181 ----GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT 236
                   GC  ++ S     G       C N  + K     +C+ P             
Sbjct: 311 KDQNATNDGCAMASRS--DGRGKRHATKPCPNN-IEKSNRIYQCSPP------------- 354

Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LH 290
              Y V  NE  I KEI+ +GP  A   +++DF+HYK+G+Y+H   +N +   Y     H
Sbjct: 355 ---YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTRTNEEASKYRKFQTH 411

Query: 291 SGKLIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           + KL GWGT  G       +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 412 AVKLTGWGTLKGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|281200411|gb|EFA74631.1| hypothetical protein PPL_11599 [Polysphondylium pallidum PN500]
          Length = 311

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 103/351 (29%), Positives = 148/351 (42%), Gaps = 63/351 (17%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           +I + VF + C  +        D +I   N    +W AGRN                  +
Sbjct: 8   LIALTVFAV-CNALDLNKPVLDDKFIHNHNANGASWVAGRN-----------------PR 49

Query: 61  YFDQSDRPLPGDRKTYDP-----EYSAT---VPDRFDAREQWPNCGTIGHVPDTGACAAP 112
           +  QS   + G   T  P     E S +   VP+ FD+R  WP C  +  V + G C + 
Sbjct: 50  FEGQSIGDILGLLGTKKPRNTPEEVSVSKVAVPNSFDSRTNWPGC--VHAVLNQGQCGSC 107

Query: 113 HIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKR 172
             FAA  + SDR CI S+G  N  LS + + SC      + N+ C+ G     W +L   
Sbjct: 108 WAFAASESLSDRLCIASQGAINVTLSPQALVSC----DIEFNQGCNGGIPQMAWEYLELH 163

Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQD 232
           G  T         C P T    S +G+AP              C   C++ +     +Q 
Sbjct: 164 GIPT-------DSCFPYT----SGNGTAP-------------DCQKECSDGSK----YQL 195

Query: 233 KHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
               T T     +  AI+  + A+GP   T  +Y DF  Y SGVY  T  +KL    H+ 
Sbjct: 196 YKGKTFTLKTCSSVAAIQANVFAYGPIEGTMDVYQDFMSYTSGVYVMTPGSKLLGG-HAI 254

Query: 293 KLIGWGTEN--GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
           K++GWGT++  G  YW+V N+WG  WG  G   I RG   C  +   +AG+
Sbjct: 255 KIVGWGTDSTSGLDYWIVQNSWGSDWGMNGFFWIQRGTNMCGIDRDASAGQ 305


>gi|253744515|gb|EET00718.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 306

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 81/265 (30%), Positives = 124/265 (46%), Gaps = 40/265 (15%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           E S ++P  FD RE++P C  I  V D G C +   F+A  AF DRRC++       P S
Sbjct: 73  EPSGSIPASFDFREEYPQC--ITPVYDQGHCGSCWAFSATSAFGDRRCMQGLDSAGVPYS 130

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
            +Y  SC  +     +  C+ G  F  W FL + G+ T         C P T +  +   
Sbjct: 131 QQYTISCDYL-----DLGCAGGLSFSVWTFLTEHGTTT-------LECVPYTDA--NKDI 176

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           S+P   +C +    +L     C +                      N  AI + +   GP
Sbjct: 177 SSPCPDACADGSEIRLVKADGCLD-------------------YSGNVTAIMQALANDGP 217

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT---ENGTPYWLVINTWGPH 315
             A+ A+Y DF +Y+SGVY+H   +++ +  H+ ++IG+G    E+ TPYW+V N+ G  
Sbjct: 218 VQASMAVYRDFLYYRSGVYRHVYGSQISS--HAVEIIGYGAADDEDSTPYWIVKNSLGSG 275

Query: 316 WGDRGTVKILRGKYECAFEYLIAAG 340
           WG+ G   I+RG  EC  E  + +G
Sbjct: 276 WGEEGYFNIVRGSNECDIESAVYSG 300


>gi|156708110|gb|ABU93313.1| cathepsin B4 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 98/326 (30%), Positives = 138/326 (42%), Gaps = 60/326 (18%)

Query: 22  SDAYIDQINRE-ANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
           +++ ++ IN +  +TW A          EY R  +I  AK+       L G    Y    
Sbjct: 11  AESIVETINNDPTSTWVAA---------EYPRS-VINVAKFRAMLGAEL-GPHMPYVQPL 59

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           S + P  FDAREQWP  G I  V D  +C +    +   A  D + I   G     +S +
Sbjct: 60  SLSEPTEFDAREQWP--GKILPVRDQASCGSCWAHSVAEAMGDAQNIA--GCPRGAMSVQ 115

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SC K      + +C+ G + +   +L K G  T            + +   S  G  
Sbjct: 116 DLVSCDK-----TDSACNGGDMKKAQEYLVKTGITT-----------EACVKYVSGSGRV 159

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
           P  PS             +C N +          R  L  W       I + ++ +GP +
Sbjct: 160 PACPS-------------KCDNGS-------QIIRYKLQSWKSVEPSEIMQALMEYGPLS 199

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK---LIGWGTENGTPYWLVINTWGPHWG 317
             F +Y DF +Y+SGVY+H S      Y   G    L GWG ENG PYWLV N+WGP WG
Sbjct: 200 CGFMVYSDFMNYRSGVYQHKSG-----YFEGGHAVLLCGWGVENGLPYWLVQNSWGPAWG 254

Query: 318 DRGTVKILRGKYECAFEYLIAAGKPK 343
           ++G  KILRG   C  E  +  G PK
Sbjct: 255 EKGFFKILRGSNHCEIESYVTLGVPK 280


>gi|296198446|ref|XP_002746707.1| PREDICTED: tubulointerstitial nephritis antigen [Callithrix
           jacchus]
          Length = 476

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 105/352 (29%), Positives = 152/352 (43%), Gaps = 64/352 (18%)

Query: 13  LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
           LVR EL       I+Q+N+    WTA      N S+ +     + D   F     P    
Sbjct: 155 LVRPEL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSPM 200

Query: 69  -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
            L  +  T     +  +P+ F A  +WP      H P D   CAA   F+     +DR  
Sbjct: 201 LLSMNEMTASLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257

Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------ 180
           I+SKG+    LS + + SCC   R+     C+ GS+ R W +L KRG V+   Y      
Sbjct: 258 IQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQ 313

Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
               +GC  ++ S     G       C N  + K     +C+ P                
Sbjct: 314 NATNSGCAMASRS--DGRGKRHATKPCPNN-IEKSNRIYQCSPP---------------- 354

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLEN------YLHSGK 293
           Y V  +E  I KEI+ +GP  A   +++DF+HYK+G+Y+H ++   E+        H+ K
Sbjct: 355 YRVSSSETEIMKEIMQNGPVQAIMKVHEDFFHYKTGIYRHVTSTNKESEKFQKLQTHAVK 414

Query: 294 LIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           L GWGT  G       +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 415 LTGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|161343845|tpg|DAA06103.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 261

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 83/269 (30%), Positives = 125/269 (46%), Gaps = 19/269 (7%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           ++ +L  +     +  + Y     +ID IN  A TW AG NF  +  +E+  + L   +K
Sbjct: 4   VLMLLSVIFVSFYLTEQAYFLQKDFIDNINERATTWKAGVNFDPDTPKEHFLKML--GSK 61

Query: 61  YFDQSDRPLPGDRKTYDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
                ++      KT+D  Y      +P  FDAR +W  C TIG V D G C +    A 
Sbjct: 62  GVQIPNKHNIHMYKTHDAAYDNLFGRIPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMAT 121

Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
             AF+DR C+ +    N  LS E +  CC  C +     C+ G   + W    KRG VTG
Sbjct: 122 SSAFADRLCVATNADFNELLSAEEITFCCHSCGF----GCNGGYPIKAWERFKKRGLVTG 177

Query: 178 GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKH 234
           GDY    GC+P  + PC +   A    +C  +  P+   H RCT   YG     F +D  
Sbjct: 178 GDYQSGEGCEPYRVPPCPY--DAEGHNTCAGK--PRESNH-RCTRMCYGNQDLDFDEDHR 232

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATF 263
            T  +Y++     +I+K+++ +GP  A+F
Sbjct: 233 YTRDSYYL--TYGSIQKDVMTYGPIEASF 259


>gi|332210168|ref|XP_003254178.1| PREDICTED: tubulointerstitial nephritis antigen [Nomascus
           leucogenys]
          Length = 476

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 106/352 (30%), Positives = 150/352 (42%), Gaps = 64/352 (18%)

Query: 13  LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
           LVR EL       I+Q+N+    WTA      N S+ +     + D   F     P    
Sbjct: 155 LVRPEL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSPM 200

Query: 69  -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
            L  +  T     +  +P+ F A  +WP      H P D   CAA   F+     +DR  
Sbjct: 201 LLSMNEMTASLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257

Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------ 180
           I+SKG+    LS + + SCC      +   C+ GS+ R W +L KRG V+   Y      
Sbjct: 258 IQSKGRYTANLSPQNLISCCS----KNRPGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQ 313

Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
                GC  ++ S     G       C N  V K     +C+ P                
Sbjct: 314 NATSNGCAMASRS--DGRGKRHATKPCPN-NVEKSNRIYQCSPP---------------- 354

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLEN------YLHSGK 293
           Y V  +E  I KEI+ +GP  A   + +DF+HYK+G+Y+H ++A  E+        H+ K
Sbjct: 355 YRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSANKESEKYRKLQTHAVK 414

Query: 294 LIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           L GWGT  G       +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 415 LTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|307175943|gb|EFN65753.1| Uncharacterized peptidase C1-like protein F26E4.3 [Camponotus
           floridanus]
          Length = 443

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 90/273 (32%), Positives = 128/273 (46%), Gaps = 36/273 (13%)

Query: 73  RKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
           R+ YDP+    +P  F++R +WP    I  + D G C A    +     SDR  I SKG 
Sbjct: 195 RRIYDPD---ALPREFNSRTRWPR--DISDIHDQGWCGASWAVSTADVASDRFAIMSKGA 249

Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTIS 192
           +   LS +++ SC         + C  G + R W F+ K G V          C P T  
Sbjct: 250 ETVELSAQHLLSC----NNRGQQGCKGGYLDRAWLFMRKFGLVD-------EECYPWTGR 298

Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
                        C  +K   LK    C NP        + ++    Y +  NE  I +E
Sbjct: 299 N----------DQCRLRKRSNLK-TAGCQNPP--NSLRTELYKVGPAYRLG-NETDIMQE 344

Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKL-ENYLHSGKLIGWGTE---NGTP--YW 306
           IL  GP  AT  +Y DF+ Y+SGVY+H+ +A+L ++  HS ++IGWG E    G P  YW
Sbjct: 345 ILTSGPVQATMRVYQDFFVYQSGVYRHSRSAELHDSGYHSVRIIGWGEEPSYRGPPLKYW 404

Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
           LV N+WG +WG+ G  +I +G  EC  E  + A
Sbjct: 405 LVANSWGHNWGENGLFRIQKGTNECEIESYVLA 437


>gi|297465285|ref|XP_887401.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
           [Bos taurus]
 gi|297472148|ref|XP_002685665.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Bos taurus]
 gi|296490232|tpg|DAA32345.1| TPA: tubulointerstitial nephritis antigen-like 1-like [Bos taurus]
          Length = 534

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 90/280 (32%), Positives = 125/280 (44%), Gaps = 54/280 (19%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G  +  LS + 
Sbjct: 269 VLPRTFEASEKWPN---LIHDPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMSPVLSPQN 325

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
           + SC       + + C  G +   W FL +RG V+   Y            P S HG   
Sbjct: 326 LLSC----DTHNQQGCRGGRLDGAWWFLRRRGVVSDHCY------------PFSGHGRDE 369

Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQ-------------DKHRTTLTYWVDDNEDA 248
            +P+      P    H+R      GRG  Q             D ++ T  Y +  NE  
Sbjct: 370 AVPA------PPCMMHSR----AMGRGKRQATARCPNSYVHANDIYQVTPAYRLGSNEKE 419

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN- 301
           I KE++ +GP  A   +++DF+ Y+SG+Y HT  S  + E Y     HS K+ GWG E  
Sbjct: 420 IMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETL 479

Query: 302 ----GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
                  YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 480 PDGRTIKYWTAANSWGPAWGERGHFRIVRGANECDIESFV 519


>gi|332824268|ref|XP_518550.3| PREDICTED: tubulointerstitial nephritis antigen [Pan troglodytes]
          Length = 476

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 104/339 (30%), Positives = 147/339 (43%), Gaps = 57/339 (16%)

Query: 26  IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP-----LPGDRKTYDPEY 80
           I+Q+N+    WTA      N S+ +     + D   F     P     L  +  T     
Sbjct: 161 IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPA 213

Query: 81  SATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
           +  +P+ F A  +WP      H P D   CAA   F+     +DR  I+SKG+    LS 
Sbjct: 214 TTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSP 270

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-------GDRTGCQPSTIS 192
           + + SCC   R+     C+ GS+ R W +L KRG V+   Y           GC  ++ S
Sbjct: 271 QNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDHNATNNGCAMASRS 326

Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
                G       C N  V K     +C+ P                Y V  NE  I KE
Sbjct: 327 --DGRGKRHATKPCPNN-VEKSNRIYQCSPP----------------YRVSSNETEIMKE 367

Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGTENGT--- 303
           I+ +GP  A   + +DF+HYK+G+Y+H  ++N + E Y     H+ KL GWGT  G    
Sbjct: 368 IMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQ 427

Query: 304 --PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
              +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 428 KEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|403365170|gb|EJY82363.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 81/252 (32%), Positives = 118/252 (46%), Gaps = 43/252 (17%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           + +A +PD FD+R QW +C  +  + D   C +   FAAV + SDR CI S+G+ N  LS
Sbjct: 73  QINAALPDSFDSRTQWKDC--VHPIRDQAKCGSCWAFAAVESLSDRFCIASQGKVNLVLS 130

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRT-WNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
            + + SC      D +  C  G    T W +L ++G   G D      C+P      S +
Sbjct: 131 PQDMLSC------DASNFCCFGGYLDTAWQYLEQQG--VGSD-----SCEPYK----SGN 173

Query: 198 GSAPTLPS-CEN-QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           G  P+ PS C N Q + K KC    T    G                    +A K  I  
Sbjct: 174 GDQPSCPSKCSNGQAIKKYKCKAGSTKQAKGA-------------------EATKSLIQQ 214

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPH 315
            GP    F +Y+DF +Y SG+Y H +   +    H+ K++GWG +    YW+V N+WG  
Sbjct: 215 SGPVETGFTIYEDFLNYNSGIYHHVTGGNMGG--HAVKILGWGKQGLENYWIVANSWGED 272

Query: 316 WGDRGTVKILRG 327
           WG++G   I +G
Sbjct: 273 WGEKGYFNIRQG 284


>gi|426221788|ref|XP_004005089.1| PREDICTED: tubulointerstitial nephritis antigen-like [Ovis aries]
          Length = 362

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 90/280 (32%), Positives = 125/280 (44%), Gaps = 54/280 (19%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G  +  LS + 
Sbjct: 97  VLPRTFEASEKWPN---LIHDPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMSPVLSPQN 153

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
           + SC       + + C  G +   W FL +RG V+   Y            P S HG   
Sbjct: 154 LLSC----DTHNQQGCHGGRLDGAWWFLRRRGVVSDHCY------------PFSGHGRDE 197

Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQ-------------DKHRTTLTYWVDDNEDA 248
            +P+      P    H+R      GRG  Q             D ++ T  Y +  NE  
Sbjct: 198 AVPA------PPCMMHSR----AMGRGKRQATARCPNSYVHANDIYQVTPAYRLGSNEKE 247

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN- 301
           I KE++ +GP  A   +++DF+ Y+SG+Y HT  S  + E Y     HS K+ GWG E  
Sbjct: 248 IMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETL 307

Query: 302 ----GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
                  YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 308 PDGRTVKYWTAANSWGPAWGERGHFRIVRGANECDIESFV 347


>gi|417409900|gb|JAA51439.1| Putative cysteine proteinase tin-ag, partial [Desmodus rotundus]
          Length = 346

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 89/280 (31%), Positives = 126/280 (45%), Gaps = 54/280 (19%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 81  VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 137

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
           + SC K     + + C  G +   W FL +RG V+   Y            P S  G   
Sbjct: 138 LLSCDK----RNQQGCQGGHLDSAWWFLRRRGVVSDHCY------------PFSGQGRTE 181

Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQ-------------DKHRTTLTYWVDDNEDA 248
           T P+      P+   H+R      GRG  Q             D ++ T  Y +  +E  
Sbjct: 182 TGPA------PRCMMHSR----AMGRGKRQATARCPNHQVHANDIYQVTPAYRLGSSEKE 231

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN- 301
           I KE++ +GP  A   +++DF+ Y++G+Y HT  S  + E Y     HS K+ GWG E+ 
Sbjct: 232 IMKELMENGPVQALMEVHEDFFLYQNGIYSHTPVSLGRPERYRRHGTHSVKITGWGEESL 291

Query: 302 ----GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
                  YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 292 PDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFV 331


>gi|335290878|ref|XP_003127800.2| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Sus scrofa]
          Length = 362

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 87/268 (32%), Positives = 124/268 (46%), Gaps = 30/268 (11%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 97  VLPRAFEASEKWPN---LIHDPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 153

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
           + SC       + + C  G +   W FL +RG V+  D+     C P +    +  G AP
Sbjct: 154 LLSC----DTHNQQGCQGGRLDGAWWFLRRRGVVS--DH-----CYPFSGHERNEAGPAP 202

Query: 202 T-LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
             +         K +   RC N         D ++ T  Y +  NE  I KE++ +GP  
Sbjct: 203 RCMMHSRAMGRGKRQATARCPNSYV---HANDIYQVTPAYRLGSNEKDIMKELMENGPVQ 259

Query: 261 ATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWLVI 309
           A   +++DF+ Y+SG+Y HT  S+ + E Y     HS K+ GWG E         YW   
Sbjct: 260 ALMEVHEDFFLYQSGIYSHTPVSHGRPERYRRHGTHSVKITGWGEETLPDGRMLKYWTAA 319

Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLI 337
           N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 320 NSWGPGWGERGHFRIVRGANECDIESFV 347


>gi|294885809|ref|XP_002771442.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239875086|gb|EER03258.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 527

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 68/176 (38%), Positives = 94/176 (53%), Gaps = 17/176 (9%)

Query: 172 RGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQ 231
           RG++T GD     GC P    PC+HH +    P C         C  +C NP Y      
Sbjct: 364 RGNLTKGD-----GCWPYDFPPCAHHINDTKYPKCPKGSYETPNCVEQCHNPKYTTSLKN 418

Query: 232 DKH----RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLEN 287
           D+H     +   Y V++ ++AI+ +    GP +A++ +Y+DF  YKSGVYKHTS + L  
Sbjct: 419 DRHYMLESSPYQYSVNNAKNAIRTD----GPISASYLVYEDFLAYKSGVYKHTSGSYLGG 474

Query: 288 YLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
             H+ K+IGWG ENG  YWLV+N+W   WGD+G  KI  G   C  +  +  G PK
Sbjct: 475 --HAVKIIGWGEENGEAYWLVVNSWNEDWGDQGLFKIALGN--CEIDDDLLGGTPK 526


>gi|148704124|gb|EDL36071.1| cathepsin B, isoform CRA_b [Mus musculus]
          Length = 237

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 80/210 (38%), Positives = 107/210 (50%), Gaps = 33/210 (15%)

Query: 17  ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQF---LIADAKYFDQSDRPLPGDR 73
             +  SD  I+ IN++  TW AGRNF  N+   YL++    ++   K        LPG  
Sbjct: 22  SFHPLSDDLINYINKQNTTWQAGRNF-YNVDISYLKKLCGTVLGGPK--------LPG-- 70

Query: 74  KTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
                  S  +P+ FDAREQW NC TIG + D G+C +   F AV A SDR CI + G+ 
Sbjct: 71  -------SIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRV 123

Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
           N  +S E + +CC I   D    C+ G     W+F  K+G V+GG Y    GC P TI P
Sbjct: 124 NVEVSAEDLLTCCGIQCGD---GCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPP 180

Query: 194 CSHH--GSAPTL------PSCENQKVPKLK 215
           C HH  GS P        P C N+K+P ++
Sbjct: 181 CEHHVNGSRPPCTGEGDTPRC-NKKLPAIR 209


>gi|194882138|ref|XP_001975170.1| GG20712 [Drosophila erecta]
 gi|190658357|gb|EDV55570.1| GG20712 [Drosophila erecta]
          Length = 431

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 81/267 (30%), Positives = 120/267 (44%), Gaps = 41/267 (15%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  F+A ++W +   I  VPD G C A  + +     SDR  I+SKG++   LS + + 
Sbjct: 187 LPRSFNALDKWSS--YISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKETVQLSAQNIL 244

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC +       + C  G +   W +LHK+G V    Y                       
Sbjct: 245 SCTR-----RQQGCDGGHLDAAWRYLHKKGVVDESCY----------------------- 276

Query: 204 PSCENQKVPKLKCHTR------CTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
           P  +++   K++ ++R      C  P       +D   T    +  + E  I  EI   G
Sbjct: 277 PYTQHRDTCKIRHNSRSLRANGCETPV---NVDRDTFYTVGPAYSLNREADIMAEIFNSG 333

Query: 258 PTTATFALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGPH 315
           P  AT  +  DF+ Y  GVY+ T+ N +     HS KL+GWG E NG  YW+  N+WG  
Sbjct: 334 PVQATMRVNRDFFSYSRGVYRQTAANREAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSW 393

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
           WG++G  +ILRG  EC  E  + A  P
Sbjct: 394 WGEKGYFRILRGSNECGIEEYVLASWP 420


>gi|355561807|gb|EHH18439.1| hypothetical protein EGK_15031 [Macaca mulatta]
          Length = 475

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 107/351 (30%), Positives = 152/351 (43%), Gaps = 63/351 (17%)

Query: 13  LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
           LVR EL       I+Q+N+    WTA      N S+ +     + D   F     P    
Sbjct: 155 LVRPEL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSPM 200

Query: 69  -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
            L  +  T     +  +P+ F A  +WP      H P D   CAA   F+     +DR  
Sbjct: 201 LLSMNEMTXPLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257

Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG------DY 180
           I+SKG+    LS + + SCC   R+     C+ GS+ R W +L KRG V+        D 
Sbjct: 258 IQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQ 313

Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
               GC  ++ S     G       C N  + K     +C+ P                Y
Sbjct: 314 NANNGCAMASRS--DGRGKRHATKPCPN-NIEKSNRIYQCSPP----------------Y 354

Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKL 294
            V  +E  I KEI+ +GP  A   + +DF+HYK+G+Y+H  ++N + E Y     H+ KL
Sbjct: 355 RVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKL 414

Query: 295 IGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
            GWGT  G       +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 415 TGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465


>gi|355748654|gb|EHH53137.1| hypothetical protein EGM_13709 [Macaca fascicularis]
          Length = 475

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 107/351 (30%), Positives = 152/351 (43%), Gaps = 63/351 (17%)

Query: 13  LVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---- 68
           LVR EL       I+Q+N+    WTA      N S+ +     + D   F     P    
Sbjct: 155 LVRPEL-------IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSPM 200

Query: 69  -LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRC 126
            L  +  T     +  +P+ F A  +WP      H P D   CAA   F+     +DR  
Sbjct: 201 LLSMNEMTAPLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIA 257

Query: 127 IKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG------DY 180
           I+SKG+    LS + + SCC   R+     C+ GS+ R W +L KRG V+        D 
Sbjct: 258 IQSKGRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQ 313

Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
               GC  ++ S     G       C N  + K     +C+ P                Y
Sbjct: 314 NANNGCAMASRS--DGRGKRHATKPCPN-NIEKSNRIYQCSPP----------------Y 354

Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKL 294
            V  +E  I KEI+ +GP  A   + +DF+HYK+G+Y+H  ++N + E Y     H+ KL
Sbjct: 355 RVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKL 414

Query: 295 IGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
            GWGT  G       +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 415 TGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465


>gi|324512900|gb|ADY45327.1| Peptidase C1-like protein [Ascaris suum]
          Length = 450

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 83/265 (31%), Positives = 122/265 (46%), Gaps = 34/265 (12%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDARE+WP    I  V D G CA+    +     +DR  I + G+ N PLS + + 
Sbjct: 184 LPSSFDAREKWPL--YIHPVRDQGDCASSWSHSTTATSADRLSIITDGRVNIPLSAQQLL 241

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC         + C  G + R W ++ K G V+   Y   +G   +T  P         +
Sbjct: 242 SC----NQHRQRGCEGGYLDRAWWYIRKLGVVSELCYPYESG---ATQQP-----GECRI 289

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P    +    + C +   +P+         +R T  Y V   E  I  EI+ +GP  ATF
Sbjct: 290 PKSAYRTGAHIDCPSGAADPSV--------YRMTPPYRVSSREQDIMTEIITNGPVQATF 341

Query: 264 ALYDDFYHYKSGVYKHT-------SNAKLENYLHSGKLIGWGTENGT----PYWLVINTW 312
            +Y+DF+ Y  GVY+H           K++ Y HS ++IGWG +  T     YWL  N+W
Sbjct: 342 LVYEDFFMYSGGVYQHLDLHEHKEEERKVQGY-HSVRIIGWGEDYSTGPQVKYWLAANSW 400

Query: 313 GPHWGDRGTVKILRGKYECAFEYLI 337
           G  WG+ G  +ILRG+  C  E  +
Sbjct: 401 GNEWGEDGLFRILRGENHCEIESFV 425


>gi|134023803|gb|AAI35570.1| LOC100124858 protein [Xenopus (Silurana) tropicalis]
          Length = 484

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 83/267 (31%), Positives = 121/267 (45%), Gaps = 28/267 (10%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  F+A E+WP  G +    D G CA    F+     SDR  I+S G   + LS + + 
Sbjct: 221 LPSHFNAAEKWP--GLVHEPLDQGNCAGSWAFSTAAVASDRISIQSMGHMTQSLSPQNLL 278

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC       +   C  G V   W +L +RG V+         C P T    + H SAP +
Sbjct: 279 SC----DTRNQHGCRGGRVDGAWWYLRRRGVVS-------EPCYPFTSLNTNGH-SAPCM 326

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
               +    K +    C N  Y      + +++T  Y +  +E  I KE+  +GP  A  
Sbjct: 327 MQSRSMGRGKRQATNNCPNQYYSS---NEIYQSTPAYRLASSEKDIMKELYENGPVQAIM 383

Query: 264 ALYDDFYHYKSGVYKHTSNAKLE------NYLHSGKLIGWGTENGT-----PYWLVINTW 312
            +++DF+ YKSG+Y+ T   + E      +  HS K+ GWG E G       YWL  N+W
Sbjct: 384 EVHEDFFMYKSGIYRRTPVTEREPEHHRRHGTHSVKITGWGEERGRDGQTHKYWLAANSW 443

Query: 313 GPHWGDRGTVKILRGKYECAFEYLIAA 339
           G  WG+ G  +I RG+ EC  E  I  
Sbjct: 444 GRDWGEDGYFRIARGENECEIETFIVG 470


>gi|322788703|gb|EFZ14296.1| hypothetical protein SINV_07506 [Solenopsis invicta]
          Length = 443

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 91/276 (32%), Positives = 123/276 (44%), Gaps = 42/276 (15%)

Query: 73  RKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
           ++ YDP+    +P  FD+R +W     I  + D G C A    +     SDR  I SKG 
Sbjct: 195 KRIYDPD---ALPREFDSRTRWSR--DISGIHDQGWCGASWAVSTADVASDRYSIMSKGA 249

Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPS 189
           +   LS + + SC         + C  G + R W F+ K G V    Y   G    C+  
Sbjct: 250 EAPELSAQQLLSC----NNRGQQGCRGGYLDRAWLFMRKFGLVDKECYPWSGKNDQCKLR 305

Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
             S     G       C     P L+       P Y  G                NE  I
Sbjct: 306 KRSTLKAAG-------CRKPSHP-LRTELYKVGPAYRLG----------------NETDI 341

Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKL-ENYLHSGKLIGWGTE---NGTP- 304
            +EIL  GP  AT  +Y DF+ YKSG+Y+H+ +A+L ++  HS ++IGWG E    G P 
Sbjct: 342 MQEILTSGPVQATMRVYQDFFIYKSGIYRHSRSAELHDSGYHSVRIIGWGEERSYRGPPL 401

Query: 305 -YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
            YWLV N+WG +WGD G  KI +G  EC  E  + A
Sbjct: 402 KYWLVANSWGYNWGDNGLFKIQKGTNECEIESYVLA 437


>gi|402853710|ref|XP_003891533.1| PREDICTED: tubulointerstitial nephritis antigen-like [Papio anubis]
          Length = 362

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 88/273 (32%), Positives = 119/273 (43%), Gaps = 40/273 (14%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 97  VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 153

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
           + SC         + C  G +   W FL +RG V+       G   D  G  P    PC 
Sbjct: 154 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAP----PCM 205

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
            H  A            K +   RC N         D ++ T  Y +  N+  I KE++ 
Sbjct: 206 MHSRA--------MGRGKRQATARCPNSHVNN---NDIYQVTPVYRLGSNDKEIMKELME 254

Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
           +GP  A   +++DF+ YK G+Y HT  S  + E Y     HS K+ GWG E         
Sbjct: 255 NGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLK 314

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
           YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 315 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 347


>gi|351709947|gb|EHB12866.1| Tubulointerstitial nephritis antigen-like protein [Heterocephalus
           glaber]
          Length = 467

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 86/269 (31%), Positives = 119/269 (44%), Gaps = 32/269 (11%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A ++WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 202 VLPKAFEASKKWPN---MIHDPLDQGNCAGSWAFSTAAVASDRVSIHSMGHMTPVLSPQN 258

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY--GDRTGCQPSTISPCSHHGS 199
           + SC         + C  G +   W FL +RG V+   Y        +    +PC  H  
Sbjct: 259 LLSC----DTHHQQGCQGGRLDGAWWFLRRRGVVSDHCYPFSGHEQAEAGPATPCMMHSR 314

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
           A            K +   RC N         + ++ T  Y +  +E  I KE++ +GP 
Sbjct: 315 A--------MGRGKRQATRRCPN---SHDDANEIYQVTPAYRLGSDEKEIMKELMENGPV 363

Query: 260 TATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTE-----NGTPYWLV 308
            A   +Y+DF+ YKSG+Y HT  S  + E Y     HS K+ GWG E         YW  
Sbjct: 364 QALMEVYEDFFLYKSGIYSHTLVSMGRPEQYRRHGTHSVKITGWGEEMLPDGRTLKYWTA 423

Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLI 337
            N+WGP WG+RG  +ILRG  EC  E  +
Sbjct: 424 ANSWGPSWGERGYFRILRGSNECDIESFV 452


>gi|126310154|ref|XP_001364630.1| PREDICTED: tubulointerstitial nephritis antigen [Monodelphis
           domestica]
          Length = 468

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 88/275 (32%), Positives = 124/275 (45%), Gaps = 43/275 (15%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ F +  +WP  G      D   CAA   F+     +DR  I+SKG+    LS + + 
Sbjct: 209 LPEFFISSYKWP--GWTHDPLDQKNCAASWAFSTASVAADRIAIQSKGRYTDNLSPQNLI 266

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG-------DRTGCQPSTISPCSH 196
           SCC   R+     C  GS+ R W +L KRG V+   Y        +  GC  ++ S    
Sbjct: 267 SCCVKNRH----GCKGGSIDRAWWYLRKRGLVSHACYPLFKDQIFNNNGCDMASRS--DG 320

Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
            G       C N  + K     +C+ P                Y V  NE  I KEI+ +
Sbjct: 321 RGKRHATKPCPNN-IEKSNRIYQCSPP----------------YRVSSNETEIMKEIMQN 363

Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLEN------YLHSGKLIGWGTENGT-----PY 305
           GP  A   +++DF+HYKSG+Y+H +N K E+        H+ KL GWG   G       +
Sbjct: 364 GPVQAIMQVHEDFFHYKSGIYRHINNLKDESEKYRNLRTHAVKLTGWGVLRGAQGKKEKF 423

Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 424 WIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 458


>gi|13469701|gb|AAK27318.1| cysteine proteinase [Clonorchis sinensis]
          Length = 179

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 74/188 (39%), Positives = 94/188 (50%), Gaps = 9/188 (4%)

Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
            AV A SDR CI S G  N+ LS   + SCCK C Y     C  G     W+F    G V
Sbjct: 1   GAVEAMSDRLCIHSSGAFNKSLSAVDLLSCCKDCGY----GCDGGFPPMAWDFWKTHGIV 56

Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
           TGG   +  GC+P     C HH S    P C  +  P  KC   C  P     + +DK R
Sbjct: 57  TGGSKEEPAGCRPYPFPKCQHH-SQGHYPPCPRRIYPTPKCVKHCDTPKID--YQKDKTR 113

Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
              +Y V  +E AI KEIL +GP  ATF +++DF  YKSG+Y H     +    H+ +++
Sbjct: 114 ANTSYNVHQSEVAIMKEILLNGPVEATFEVHEDFPEYKSGIYFHAWGGSVGG--HAIRIL 171

Query: 296 GWGTENGT 303
           GWG ENG 
Sbjct: 172 GWGEENGV 179


>gi|195585648|ref|XP_002082593.1| GD25141 [Drosophila simulans]
 gi|194194602|gb|EDX08178.1| GD25141 [Drosophila simulans]
          Length = 484

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 82/267 (30%), Positives = 119/267 (44%), Gaps = 41/267 (15%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  F+A ++W +   I  VPD G C A  + +     SDR  I+SKG++   LS + + 
Sbjct: 187 LPSSFNALDKWSS--YISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKEAVQLSAQNIL 244

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC +       + C  G +   W +LHK+G V    Y                       
Sbjct: 245 SCTR-----RQQGCEGGHLDAAWRYLHKKGVVDENCY----------------------- 276

Query: 204 PSCENQKVPKLKCHTR------CTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
           P  +++   K++ ++R      C  P       +D   T    +  + E  I  EI   G
Sbjct: 277 PYTQHRDTCKIRHNSRSLRANGCQTPV---NVDRDTLYTVGPAYSLNREADIMAEIFHSG 333

Query: 258 PTTATFALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGPH 315
           P  AT  +  DF+ Y  GVY+ T+ N K     HS KL+GWG E NG  YW+  N+WG  
Sbjct: 334 PVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSW 393

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
           WG+ G  +ILRG  EC  E  + A  P
Sbjct: 394 WGEHGYFRILRGSNECGIEEYVLASWP 420


>gi|47212965|emb|CAF93376.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 271

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 90/273 (32%), Positives = 130/273 (47%), Gaps = 30/273 (10%)

Query: 77  DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
           DPE    +P  F++ E+WP  G I    D G CAA   F+     SDR  I+S G     
Sbjct: 2   DPERDQ-LPLYFNSAEKWP--GKIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPQ 58

Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQ-PSTISPCS 195
           LS + + SC       +   C+ G +   W +L +RG VT   Y  R   Q P+ +S C 
Sbjct: 59  LSPQNLISC----DTRNQGGCAGGRLDGAWWYLRRRGVVTEDCYPYRPPQQTPAELSRC- 113

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
                  +    +    K +   RC N      +  D +++T  Y +  +E  I KEI  
Sbjct: 114 -------MMQSRSVGRGKRQATQRCPNTN---NYQNDIYQSTPPYRLSTSEKEIMKEIQD 163

Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTE---NGT--P 304
           +GP  A   +++DF+ Y SG+YKHT  S  K  +Y     HS K+ GWG E   +GT   
Sbjct: 164 NGPVQAIMEVHEDFFMYNSGIYKHTDVSFTKPPHYRKHGTHSVKITGWGEERNFDGTTRK 223

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
           YW+  N+WG +WG+ G  +I RG+ EC  E  +
Sbjct: 224 YWIAANSWGKNWGENGYFRIARGENECEIEAFV 256


>gi|417401428|gb|JAA47600.1| Putative cysteine proteinase tin-ag [Desmodus rotundus]
          Length = 466

 Score =  120 bits (302), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 89/280 (31%), Positives = 126/280 (45%), Gaps = 54/280 (19%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 201 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 257

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
           + SC K     + + C  G +   W FL +RG V+   Y            P S  G   
Sbjct: 258 LLSCDK----RNQQGCQGGHLDSAWWFLRRRGVVSDHCY------------PFSGQGRTE 301

Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQ-------------DKHRTTLTYWVDDNEDA 248
           T P+      P+   H+R      GRG  Q             D ++ T  Y +  +E  
Sbjct: 302 TGPA------PRCMMHSR----AMGRGKRQATARCPNHQVHANDIYQVTPAYRLGSSEKE 351

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN- 301
           I KE++ +GP  A   +++DF+ Y++G+Y HT  S  + E Y     HS K+ GWG E+ 
Sbjct: 352 IMKELMENGPVQALMEVHEDFFLYQNGIYSHTPVSLGRPERYRRHGTHSVKITGWGEESL 411

Query: 302 ----GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
                  YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 412 PDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFV 451


>gi|159109223|ref|XP_001704877.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157432952|gb|EDO77203.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 300

 Score =  120 bits (302), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 80/260 (30%), Positives = 123/260 (47%), Gaps = 41/260 (15%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           VP+ FD RE++P+C  I  V D G C +   F++V  F DRRC+    ++    S +YV 
Sbjct: 75  VPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVAGLDKKPVKYSPQYVV 132

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC        + +C+ G +   W FL K G+ T         C P      +  G+ PT 
Sbjct: 133 SCDH-----GDMACNGGWLPNVWKFLTKTGTTT-------DECVPYKSGSTTLRGTCPTK 180

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNED--AIKKEILAHGPTTA 261
            +  + KV                      H  T T + D   D  A+ K +   GP   
Sbjct: 181 CADGSSKV----------------------HLATATSYKDYGLDIPAMMKALSTSGPLQV 218

Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPHWGDRG 320
            F +Y DF +Y+SGVY+HT         H+ +++G+GT++ G  YW++ N+WGP WG+ G
Sbjct: 219 AFLVYSDFMYYESGVYQHTYGYMEGG--HAVEMVGYGTDDDGVDYWIIRNSWGPDWGEDG 276

Query: 321 TVKILRGKYECAFEYLIAAG 340
             +++RG  +C+ E    AG
Sbjct: 277 YFRMIRGINDCSIEEQAYAG 296


>gi|12658201|gb|AAK01061.1| cysteine proteinase [Metagonimus yokogawai]
          Length = 179

 Score =  120 bits (302), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 67/187 (35%), Positives = 98/187 (52%), Gaps = 9/187 (4%)

Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
            AV A +DR CI S     + +S+  + SCC+ C +     C  G   R W+F  + G V
Sbjct: 1   GAVEAMTDRLCIHSNATIKKHISSTDLLSCCESCGF----GCHGGFPPRAWDFWMENGLV 56

Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
           TGG   + +GC+      C+HHG  P  P C  +  P   C+  C  P     +  DK +
Sbjct: 57  TGGSKENPSGCRSYPFPKCNHHGKGPDAP-CPEKIFPTPACNKTCDTPEVN--YILDKTK 113

Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
              +Y V ++E AI KEI+ +GP  A F +Y+DF HY+SGVY H+    +    H+ +++
Sbjct: 114 AKSSYNVPNSEKAIMKEIMQNGPVEAAFEVYEDFLHYESGVYFHSFGRMIGG--HAIRML 171

Query: 296 GWGTENG 302
           GWG ENG
Sbjct: 172 GWGEENG 178


>gi|355557764|gb|EHH14544.1| hypothetical protein EGK_00488 [Macaca mulatta]
 gi|355745087|gb|EHH49712.1| hypothetical protein EGM_00421 [Macaca fascicularis]
 gi|384948750|gb|AFI37980.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
           [Macaca mulatta]
 gi|384948752|gb|AFI37981.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
           [Macaca mulatta]
 gi|387540550|gb|AFJ70902.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
           [Macaca mulatta]
          Length = 467

 Score =  120 bits (301), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 88/273 (32%), Positives = 119/273 (43%), Gaps = 40/273 (14%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 202 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 258

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
           + SC         + C  G +   W FL +RG V+       G   D  G  P    PC 
Sbjct: 259 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAP----PCM 310

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
            H  A            K +   RC N         D ++ T  Y +  N+  I KE++ 
Sbjct: 311 MHSRA--------MGRGKRQATARCPNSHVNN---NDIYQVTPVYRLGSNDKEIMKELME 359

Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
           +GP  A   +++DF+ YK G+Y HT  S  + E Y     HS K+ GWG E         
Sbjct: 360 NGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLK 419

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
           YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 420 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 452


>gi|289724789|gb|ADD18342.1| putative cysteine proteinase TIN-ag [Glossina morsitans morsitans]
          Length = 387

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 80/265 (30%), Positives = 119/265 (44%), Gaps = 35/265 (13%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  F++ ++W +   I  V D G C +  + +     SDR  I+S+G++   LS + + 
Sbjct: 142 LPRSFNSIDKWAS--YISDVLDQGWCGSSWVISTASVASDRFAIQSRGKEVIQLSPQNIL 199

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPSTISPCSHHGSA 200
           SC +       + C+ G +   W +LHK+G V    Y   G R  C              
Sbjct: 200 SCTR-----RQQGCNGGHLDAAWRYLHKQGVVDESCYPYVGYRDAC-------------- 240

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
                    K+P      R        G  +D+  T    +  +NE  I  EI   GP  
Sbjct: 241 ---------KIPHNSRSLRNNGCRSYSGVDRDELYTVGPAYSLNNETDIMAEIFMSGPVQ 291

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENY-LHSGKLIGWGTE-NGTPYWLVINTWGPHWGD 318
           AT  +Y DF+ Y  G+Y+HT+ ++      HS KLIGWG E +G  YW+  N+WG  WG+
Sbjct: 292 ATLTVYRDFFSYSGGIYRHTAASRGSPVGFHSVKLIGWGEEHDGNKYWIATNSWGTWWGE 351

Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
            G  +ILRG  EC  E  + A  P 
Sbjct: 352 HGNFRILRGSNECGIEEYVLAAWPN 376


>gi|268564843|ref|XP_002639246.1| Hypothetical protein CBG03805 [Caenorhabditis briggsae]
          Length = 526

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 82/265 (30%), Positives = 129/265 (48%), Gaps = 34/265 (12%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FDAR++W +   I  + D G C +    +  G  SDR  I S+G+ N  LS++ + 
Sbjct: 258 LPEHFDARDKWGH--LIHPIADQGDCGSSWAVSTTGISSDRLSIISEGRINASLSSQQLL 315

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC         K C  G + R W ++ K G V  GD+     C P  +S  S       +
Sbjct: 316 SC----NQHRQKGCEGGYLDRAWWYIRKLGVV--GDH-----CYP-YVSGQSREPGHCLI 363

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P  +      L+C +   + T          + T  Y V   E+ I+ E++ +GP  ATF
Sbjct: 364 PKRDYTNRQGLRCPSGSQDST--------AFKMTPPYKVSSREEDIQTELMTNGPVQATF 415

Query: 264 ALYDDFYHYKSGVYKHT-------SNAKLENYLHSGKLIGWGTENGT----PYWLVINTW 312
            +++DF+ Y  GVY+H+       +++  E Y HS +++GWG ++ T     YWL  N+W
Sbjct: 416 VVHEDFFMYAGGVYQHSDLAAQKGASSVAEGY-HSVRVLGWGVDHSTGRPIKYWLCANSW 474

Query: 313 GPHWGDRGTVKILRGKYECAFEYLI 337
           G  WG+ G  KILRG+  C  E  +
Sbjct: 475 GTQWGEDGYFKILRGENHCEIESFV 499


>gi|297723949|ref|NP_001174338.1| Os05g0310500 [Oryza sativa Japonica Group]
 gi|255676228|dbj|BAH93066.1| Os05g0310500, partial [Oryza sativa Japonica Group]
          Length = 234

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 81/239 (33%), Positives = 108/239 (45%), Gaps = 28/239 (11%)

Query: 107 GACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTW 166
           G C +   F AV    DR CI      N  LS   + +CC     D    C  G     W
Sbjct: 1   GHCGSCWAFGAVECLQDRFCIHF--NMNISLSVNDLVACCGFMCGD---GCDGGYPIMAW 55

Query: 167 NFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPT 224
            +  + G VT     Y D+ GC+        H G  P  P+     V + KC  +     
Sbjct: 56  RYFVRNGVVTDECDPYFDQVGCK--------HPGCEPAYPT----PVCEKKCKVQ----- 98

Query: 225 YGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK 284
             + + + KH +   Y V+ +   I  E+  +GP    F +Y+DF HYKSGVYKH +   
Sbjct: 99  -NQVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGM 157

Query: 285 LENYLHSGKLIGWGTEN-GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           +    H+ KLIGWGT + G  YWL+ N W   WGD G  KI+RG  EC  E  + AG P
Sbjct: 158 MGG--HAVKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEEDVVAGMP 214


>gi|297744106|emb|CBI37076.3| unnamed protein product [Vitis vinifera]
          Length = 392

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 91/301 (30%), Positives = 125/301 (41%), Gaps = 66/301 (21%)

Query: 84  VPDRFDAREQWPNCGTIG------------------------------------HVPDTG 107
           +P  FDAR  WP C TIG                                    ++ D G
Sbjct: 99  LPKHFDARTAWPQCSTIGKILGRLLDSFSSYFDDFFCFGCTDALYFSYHLLVPFYIKDQG 158

Query: 108 ACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK-ICRYDDNKSCSHGSVFRTW 166
            C +   F AV + SDR CI      N  LS   + +CC  +C       C  G     W
Sbjct: 159 HCGSCWAFGAVESLSDRFCIHFG--MNISLSVNDLLACCGFLC----GSGCDGGYPLYAW 212

Query: 167 NFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPT 224
            +    G VT     Y D TGC        SH G  P  P+         KC  +CT+  
Sbjct: 213 RYFIHHGVVTEECDPYFDATGC--------SHPGCEPGYPT--------PKCVRKCTDEN 256

Query: 225 YGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK 284
             + + + K      Y +  +   I  E+  +GP    F +Y+DF HY+SGVY++T+   
Sbjct: 257 --QLWRKAKRYGQSAYRISSDPYQIMAEVYKNGPVEVAFTVYEDFAHYESGVYRYTTGDV 314

Query: 285 LENYLHSGKLIGWGT-ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           +    H+ KLIGWGT ++G  YW++ N W  +WGD G   I RG  EC  E  + AG P 
Sbjct: 315 MGG--HAVKLIGWGTTDDGEDYWILANQWNRNWGDDGYFMIRRGVNECGIEEGVVAGLPS 372

Query: 344 N 344
           +
Sbjct: 373 S 373


>gi|389608479|dbj|BAM17849.1| tubulointerstitial nephritis antigen [Papilio xuthus]
          Length = 429

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 93/330 (28%), Positives = 138/330 (41%), Gaps = 51/330 (15%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDP-E 79
            S++ ++ +NR  ++W A  N+P     + L++ LI     F     PL  + +   P  
Sbjct: 131 MSNSVVEGVNRGGSSWRA-YNYP-EFRNKKLKEGLIYKLGTF-----PLNAETRRMGPLR 183

Query: 80  YSATVP--DRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
           Y   VP   +FDAR +WP  G I  + D G C +    +  G  SDR  I+S G +N  L
Sbjct: 184 YDKDVPYPTQFDARTRWP--GFISPIVDQGWCGSDWAVSLAGVASDRFAIQSNGAENMVL 241

Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
           S + + SC         + C  G +   WNF    G V                      
Sbjct: 242 SPQTLLSC----NVRAQQGCHGGHIDVAWNFARGHGLVD--------------------- 276

Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD-----DNEDAIKKE 252
                   C   K    +C  R        G      R T  Y +       +E  I  +
Sbjct: 277 ------EKCFPYKASVTRCPFRPRGNLIQDGCMPLVKRRTSRYKLGPPAKLSHEKDIMYD 330

Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENYLHSGKLIGWGTENGTPYWLVIN 310
           I+  GP  A   +Y DF+HY+ GVY+ +   N +L+ + HS ++IGWG + G  YW+V N
Sbjct: 331 IMESGPVQAVMTVYQDFFHYRDGVYRRSYHGNNELKGF-HSVRIIGWGEDRGDRYWVVAN 389

Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           +WG  WG+ G  +I RG  E   E  +  G
Sbjct: 390 SWGRQWGENGYFRIARGSNEADIESFVVTG 419


>gi|344287518|ref|XP_003415500.1| PREDICTED: tubulointerstitial nephritis antigen isoform 1
           [Loxodonta africana]
          Length = 468

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 87/274 (31%), Positives = 123/274 (44%), Gaps = 42/274 (15%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A ++WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 203 VLPMAFEASKKWPN---LIHEPLDQGDCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQN 259

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
           + SC       + + C  G +   W FL +RG V+       G   D+ G     + PC 
Sbjct: 260 LLSC----DTHNQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGHERDKAG----PVPPCM 311

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNP-TYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            H  A            K +  +RC N   +G   +Q     T  Y +  NE  I KE++
Sbjct: 312 MHSRA--------MGRGKRQATSRCPNSHVHGNDIYQ----VTPAYRLGTNEKEIMKELM 359

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GT 303
            +GP  A   +++DF+ Y+ G+Y HT  S  + E Y     HS K+ GWG E        
Sbjct: 360 ENGPVQALMEVHEDFFLYQGGIYSHTPVSQERPEQYRRHGTHSVKITGWGEETLPDGRTL 419

Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
            YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 420 KYWTAANSWGPAWGERGHFRIVRGANECDIESFV 453


>gi|32129433|sp|P92131.3|CATB1_GIALA RecName: Full=Cathepsin B-like CP1; AltName: Full=Cathepsin B-like
           protease B1; Flags: Precursor
          Length = 303

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 96/327 (29%), Positives = 148/327 (45%), Gaps = 53/327 (16%)

Query: 22  SDAYIDQINREANTWTAG--RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
           S A + +I      W AG  + F  N++E+  R  LI   +   +S   LP    T   E
Sbjct: 17  SRAELRRIQALNPPWKAGMPKRF-ENVTEDEFRSMLIRPDRLRARSGS-LPPISITEVQE 74

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
               +P +FD R+++P C  +    D G+C +   F+A+G F DRRC     ++    S 
Sbjct: 75  LVDPIPPQFDFRDEYPQC--VKPALDQGSCGSCWAFSAIGVFGDRRCAMGIDKEAVSYSQ 132

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG-----DYGDRTGCQPSTISPC 194
           +++ SC       +N  C  G    TW+FL   G+ T       DYG             
Sbjct: 133 QHLISCSL-----ENFGCDGGDFQPTWSFLTFTGATTAECVKYVDYG------------- 174

Query: 195 SHHGSAPTLPSCENQKVPKL-KCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
            H  ++P    C++    +L K H       YG+              V  +  AI   +
Sbjct: 175 -HTVASPCPAVCDDGSPIQLYKAHG------YGQ--------------VSKSVPAIMGML 213

Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTW 312
           +A GP      +Y D  +Y+SGVYKHT    +    H+ +++G+GT ++GT YW++ N+W
Sbjct: 214 VAGGPLQTMIVVYADLSYYESGVYKHTYGT-INLGFHALEIVGYGTTDDGTDYWIIKNSW 272

Query: 313 GPHWGDRGTVKILRGKYECAFEYLIAA 339
           GP WG+ G  +I+RG  EC  E  I A
Sbjct: 273 GPDWGENGYFRIVRGVNECRIEDEIYA 299


>gi|195346663|ref|XP_002039877.1| GM15657 [Drosophila sechellia]
 gi|194135226|gb|EDW56742.1| GM15657 [Drosophila sechellia]
          Length = 431

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 83/267 (31%), Positives = 120/267 (44%), Gaps = 41/267 (15%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  F+A ++W +   I  VPD G C A  + +     SDR  I+SKG++   LS + + 
Sbjct: 187 LPSSFNALDKWSS--YISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKEAVQLSAQNIL 244

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC +       + C  G +   W +LHK+G V          C P T             
Sbjct: 245 SCTR-----RQQGCEGGHLDAAWRYLHKKGVVD-------ENCYPYT------------- 279

Query: 204 PSCENQKVPKLKCHTR------CTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
              +++   K++ ++R      C  P       +D   T    +  + E  I  EI   G
Sbjct: 280 ---QHRDTCKIRHNSRSLRANGCQTPV---NVDRDTLYTVGPAYSLNREADIMAEIFHSG 333

Query: 258 PTTATFALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGPH 315
           P  AT  +  DF+ Y  GVY+ T+ N K     HS KL+GWG E NG  YW+  N+WG  
Sbjct: 334 PVQATMRVNRDFFAYSGGVYRETAANRKALTGFHSVKLVGWGEEHNGEKYWIAANSWGSW 393

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
           WG+ G  +ILRG  EC  E  + A  P
Sbjct: 394 WGEHGYFRILRGSNECGIEDYVLASWP 420


>gi|297282815|ref|XP_002802331.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
           mulatta]
          Length = 322

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 87/273 (31%), Positives = 119/273 (43%), Gaps = 40/273 (14%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 57  VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 113

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
           + +C         + C  G +   W FL +RG V+       G   D  G  P    PC 
Sbjct: 114 LLACDT----HHQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAP----PCM 165

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
            H  A            K +   RC N         D ++ T  Y +  N+  I KE++ 
Sbjct: 166 MHSRA--------MGRGKRQATARCPNSHVNN---NDIYQVTPVYRLGSNDKEIMKELME 214

Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
           +GP  A   +++DF+ YK G+Y HT  S  + E Y     HS K+ GWG E         
Sbjct: 215 NGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLK 274

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
           YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 275 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 307


>gi|344287520|ref|XP_003415501.1| PREDICTED: tubulointerstitial nephritis antigen isoform 2
           [Loxodonta africana]
          Length = 437

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 87/274 (31%), Positives = 123/274 (44%), Gaps = 42/274 (15%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A ++WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 172 VLPMAFEASKKWPN---LIHEPLDQGDCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQN 228

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
           + SC       + + C  G +   W FL +RG V+       G   D+ G     + PC 
Sbjct: 229 LLSC----DTHNQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGHERDKAG----PVPPCM 280

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNP-TYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            H  A            K +  +RC N   +G   +Q     T  Y +  NE  I KE++
Sbjct: 281 MHSRA--------MGRGKRQATSRCPNSHVHGNDIYQ----VTPAYRLGTNEKEIMKELM 328

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GT 303
            +GP  A   +++DF+ Y+ G+Y HT  S  + E Y     HS K+ GWG E        
Sbjct: 329 ENGPVQALMEVHEDFFLYQGGIYSHTPVSQERPEQYRRHGTHSVKITGWGEETLPDGRTL 388

Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
            YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 389 KYWTAANSWGPAWGERGHFRIVRGANECDIESFV 422


>gi|12060418|dbj|BAB20596.1| ARG1 [Mus musculus]
 gi|71059879|emb|CAJ18483.1| Lcn7 [Mus musculus]
          Length = 415

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 86/269 (31%), Positives = 121/269 (44%), Gaps = 32/269 (11%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 150 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQN 206

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY--GDRTGCQPSTISPCSHHGS 199
           + SC         + C  G +   W FL +RG V+   Y    R   + S    C  H  
Sbjct: 207 LLSC----DTHHQQGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQNEASPTPRCMMHSR 262

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
           A            K +  +RC N   G+    D ++ T  Y +  +E  I KE++ +GP 
Sbjct: 263 A--------MGRGKRQATSRCPN---GQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPV 311

Query: 260 TATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWLV 308
            A   +++DF+ Y+ G+Y HT  S  + E Y     HS K+ GWG E         YW  
Sbjct: 312 QALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWTA 371

Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLI 337
            N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 372 ANSWGPWWGERGHFRIVRGTNECDIETFV 400


>gi|297291062|ref|XP_002803846.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
           mulatta]
          Length = 463

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 102/338 (30%), Positives = 147/338 (43%), Gaps = 56/338 (16%)

Query: 26  IDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP-----LPGDRKTYDPEY 80
           I+Q+N+    WTA      N S+ +     + D   F     P     L  +  T     
Sbjct: 149 IEQVNKGDYGWTA-----QNYSQFW--GMTLEDGFKFRLGTLPPSPMLLSMNEMTAPLPA 201

Query: 81  SATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
           +  +P+ F A  +WP      H P D   CAA   F+     +DR  I+SKG+    LS 
Sbjct: 202 TTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSP 258

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG------DYGDRTGCQPSTISP 193
           + + SCC   R+     C+ GS+ R W +L KRG V+        D     GC  ++ S 
Sbjct: 259 QNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANNGCAMASRS- 313

Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
               G       C N  + K     +C+ P                Y V  +E  I KEI
Sbjct: 314 -DGRGKRHATKPCPNN-IEKSNRIYQCSPP----------------YRVSSSETEIMKEI 355

Query: 254 LAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGTENGT---- 303
           + +GP  A   + +DF+HYK+G+Y+H  ++N + E Y     H+ KL GWGT  G     
Sbjct: 356 MQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGRK 415

Query: 304 -PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
             +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 416 EKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 453


>gi|161343837|tpg|DAA06099.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 255

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 82/259 (31%), Positives = 120/259 (46%), Gaps = 17/259 (6%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
           L  +     V  + Y     +ID IN +A TW AG NF  +  +E+  + L   +K    
Sbjct: 8   LSVIFVSVYVTEQTYFLQKDFIDNINNQATTWKAGVNFDPDTPKEHFLKML--GSKGVQI 65

Query: 65  SDRPLPGDRKTYDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
            ++      KT+D  Y      +P  FDAR +W +C TIG V D G C +    A   AF
Sbjct: 66  PNKHNIHMYKTHDEAYDNLFGRIPKHFDARRKWRSCHTIGAVRDQGNCGSCWAMATSSAF 125

Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
           +DR C+ +    N  LS E +  CC  C +     C+ G   + W    KRG VTGGDY 
Sbjct: 126 ADRLCVATNADFNELLSAEEITFCCHSCGF----GCNGGYPIKAWERFKKRGLVTGGDYQ 181

Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG--FFQDKHRTTLT 239
              GC+P  + PC +   A    +C  +  P+   H RCT   YG     F + HR T  
Sbjct: 182 SGEGCEPYRVPPCPY--DAEGHNTCAGK--PRESNH-RCTRMCYGNXDLDFDEDHRYTRD 236

Query: 240 YWVDDNEDAIKKEILAHGP 258
           ++      +I+K+++ +GP
Sbjct: 237 FYY-LTYGSIQKDVMTYGP 254


>gi|159112288|ref|XP_001706373.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157434469|gb|EDO78699.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 303

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 96/327 (29%), Positives = 147/327 (44%), Gaps = 53/327 (16%)

Query: 22  SDAYIDQINREANTWTAG--RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
           S A + +I      W AG  + F  N++E+  R  LI   +   +S   LP    T   E
Sbjct: 17  SRAELRRIQALNPPWKAGMPKRF-ENVTEDEFRSMLIRPDRLRARSGS-LPPISITEVQE 74

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
               +P +FD R+++P C  +    D G+C     F+A+G F DRRC     ++    S 
Sbjct: 75  LVDPIPPQFDFRDEYPQC--VKPALDQGSCGGCWAFSAIGVFGDRRCAMGIDKEAVSYSQ 132

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG-----DYGDRTGCQPSTISPC 194
           +++ SC       +N  C  G    TW+FL   G+ T       DYG             
Sbjct: 133 QHLISCSL-----ENFGCDGGDFQPTWSFLTFTGATTAECVKYVDYG------------- 174

Query: 195 SHHGSAPTLPSCENQKVPKL-KCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
            H  ++P    C++    +L K H       YG+              V  +  AI   +
Sbjct: 175 -HTVASPCPAVCDDGSPIQLYKAHG------YGQ--------------VSKSVPAIMGML 213

Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTW 312
           +A GP      +Y D  +Y+SGVYKHT    +    H+ +++G+GT ++GT YW++ N+W
Sbjct: 214 VAGGPLQTMIVVYADLSYYESGVYKHTYGT-INLGFHALEIVGYGTTDDGTDYWIIKNSW 272

Query: 313 GPHWGDRGTVKILRGKYECAFEYLIAA 339
           GP WG+ G  +I+RG  EC  E  I A
Sbjct: 273 GPDWGENGYFRIVRGVNECRIEDEIYA 299


>gi|431891156|gb|ELK02033.1| Tubulointerstitial nephritis antigen-like protein [Pteropus alecto]
          Length = 467

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 86/274 (31%), Positives = 120/274 (43%), Gaps = 44/274 (16%)

Query: 84  VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + +
Sbjct: 203 LPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNL 259

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTI--SPC 194
            SC K     + + C  G +   W FL +RG V+       G   +  G +P  +  S  
Sbjct: 260 LSCDK----HNQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGQERNEAGPEPRCMMHSRA 315

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
              G    +  C N  V                    D ++ T  Y +  NE  I KE++
Sbjct: 316 MGRGKRQAIARCPNHHV-----------------HANDIYQVTPAYRLGSNEKEIMKELM 358

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GT 303
            +GP  A   +++DF+ Y+ G+Y HT  S  K E Y     HS K+ GWG E        
Sbjct: 359 ENGPVQALMEVHEDFFLYQGGIYSHTPVSLGKPERYRRHGTHSVKITGWGEETLPDGRTL 418

Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
            YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 419 KYWTAANSWGPAWGERGHFRIVRGTNECDIESFV 452


>gi|395856781|ref|XP_003800797.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Otolemur garnettii]
          Length = 436

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 89/271 (32%), Positives = 122/271 (45%), Gaps = 36/271 (13%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 171 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 227

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
           + SC         + C  G +   W FL +RG V+  D+     C P +       G AP
Sbjct: 228 LLSC----DTHHQQGCHGGRLDGAWWFLRRRGVVS--DH-----CYPFSGQERDKAGPAP 276

Query: 202 TLPSCENQKVP----KLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
               C     P    K +   RC N         D ++ T  Y +  NE  I KE++ +G
Sbjct: 277 L---CMMHSRPMGRGKRQATARCPNNQVQA---NDIYQVTPAYRLGSNEKEIMKELMENG 330

Query: 258 PTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYW 306
           P  A   +++DF+ Y+SG+Y HT  S  + E Y     HS K+ GWG E         YW
Sbjct: 331 PVQALMEVHEDFFLYQSGIYSHTPVSLQRPEGYRRHGTHSVKITGWGEETLPDGRTLKYW 390

Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
              N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 391 TAANSWGPAWGERGHFRIVRGANECDIESFV 421


>gi|270132817|ref|NP_075965.2| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
 gi|270132824|ref|NP_001161805.1| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
 gi|61213616|sp|Q99JR5.1|TINAL_MOUSE RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
           Full=Adrenocortical zonation factor 1; Short=AZ-1;
           AltName: Full=Androgen-regulated gene 1 protein;
           AltName: Full=Tubulointerstitial nephritis
           antigen-related protein; Short=TARP; Flags: Precursor
 gi|13543125|gb|AAH05738.1| Tinagl1 protein [Mus musculus]
 gi|17391278|gb|AAH18539.1| Tinagl1 protein [Mus musculus]
 gi|30314458|dbj|BAC76038.1| tubulointersititial nephritis antigen-related protein [Mus
           musculus]
 gi|148698197|gb|EDL30144.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
           musculus]
 gi|148698198|gb|EDL30145.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
           musculus]
          Length = 466

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 86/269 (31%), Positives = 121/269 (44%), Gaps = 32/269 (11%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 201 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQN 257

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY--GDRTGCQPSTISPCSHHGS 199
           + SC         + C  G +   W FL +RG V+   Y    R   + S    C  H  
Sbjct: 258 LLSC----DTHHQQGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQNEASPTPRCMMHSR 313

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
           A            K +  +RC N   G+    D ++ T  Y +  +E  I KE++ +GP 
Sbjct: 314 A--------MGRGKRQATSRCPN---GQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPV 362

Query: 260 TATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWLV 308
            A   +++DF+ Y+ G+Y HT  S  + E Y     HS K+ GWG E         YW  
Sbjct: 363 QALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWTA 422

Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLI 337
            N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 423 ANSWGPWWGERGHFRIVRGTNECDIETFV 451


>gi|410959397|ref|XP_003986297.1| PREDICTED: tubulointerstitial nephritis antigen [Felis catus]
          Length = 474

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 89/277 (32%), Positives = 123/277 (44%), Gaps = 46/277 (16%)

Query: 84  VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           +P+ F A  +WP      H P D   CAA   F+     +DR  I+SKG+    LS + +
Sbjct: 214 LPEFFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNL 270

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-------GDRTGCQPSTISPCS 195
            SCC   R+     C+ GS+ R W FL KRG V+   Y           GC  ++ S   
Sbjct: 271 ISCCPKNRH----GCNSGSIDRAWWFLRKRGLVSHACYPLFKNQNATNHGCAMASRS--D 324

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
             G       C N  + K     +C+ P                Y V  NE  I KEI+ 
Sbjct: 325 GRGKRHATKPCPNN-IEKSNRIYQCSPP----------------YRVSSNETEIMKEIMQ 367

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLEN-------YLHSGKLIGWGTENGT----- 303
           +GP  A   +++DF+HYK+G+Y+H +    E          H+ KL GWGT  G      
Sbjct: 368 NGPVQAIMQVHEDFFHYKTGIYRHITKKANEESGKYRKLQTHAVKLTGWGTLKGAQGRKE 427

Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
            +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 428 KFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 464


>gi|16758354|ref|NP_446034.1| tubulointerstitial nephritis antigen-like precursor [Rattus
           norvegicus]
 gi|61213054|sp|Q9EQT5.1|TINAL_RAT RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
           Full=Glucocorticoid-inducible protein 5; Flags:
           Precursor
 gi|11527795|dbj|BAB18637.1| glucocorticoid-inducible protein [Rattus norvegicus]
          Length = 467

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 86/270 (31%), Positives = 119/270 (44%), Gaps = 33/270 (12%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 201 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQN 257

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPSTISPCSHHG 198
           + SC         K C  G +   W FL +RG V+   Y   G     + S    C  H 
Sbjct: 258 LLSC----DTHHQKGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQNDEASPTPRCMMHS 313

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
            A            K +  +RC N         D ++ T  Y +  +E  I KE++ +GP
Sbjct: 314 RA--------MGRGKRQATSRCPNSQVDS---NDIYQVTPVYRLASDEKEIMKELMENGP 362

Query: 259 TTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWL 307
             A   +++DF+ Y+ G+Y HT  S  + E Y     HS K+ GWG E         YW 
Sbjct: 363 VQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWT 422

Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLI 337
             N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 423 AANSWGPWWGERGHFRIVRGINECDIETFV 452


>gi|32129435|sp|P92133.2|CATB3_GIALA RecName: Full=Cathepsin B-like CP3; AltName: Full=Cathepsin B-like
           protease B3; Flags: Precursor
 gi|1763663|gb|AAB58260.1| cysteine protease [Giardia intestinalis]
 gi|11691660|emb|CAC18648.1| cathepsin B-like cysteine protease 3 [Giardia intestinalis]
          Length = 299

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 80/265 (30%), Positives = 123/265 (46%), Gaps = 37/265 (13%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           +   PD FD RE++P+C  I  V D G C +   F++V +  DRRC     ++    S +
Sbjct: 71  ATQAPDSFDFREEYPHC--IPEVVDQGGCGSCWAFSSVASVGDRRCFAGLDKKAVKYSPQ 128

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
           YV SC +      + +C  G +   W FL K G+ T         C P         G+ 
Sbjct: 129 YVVSCDR-----GDMACDGGWLPSVWRFLTKTGTTT-------DECVPYQSGSTGARGTC 176

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
           PT    +   +P L   T+  +  YG                  +  AI K +   GP  
Sbjct: 177 PT-KCADGSDLPHLYKATKAVD--YGL-----------------DAPAIMKALATGGPLQ 216

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPHWGDR 319
             F +Y DF +Y+SGVY+HT   ++E   H+  ++G+GT++ G  YW++ N+WGP WG+ 
Sbjct: 217 TAFTVYSDFMYYESGVYQHT-YGRVEGG-HAVDMVGYGTDDDGVDYWIIKNSWGPDWGED 274

Query: 320 GTVKILRGKYECAFEYLIAAGKPKN 344
           G  +I+R   EC  E  +  G  +N
Sbjct: 275 GYFRIIRMTNECGIEEQVIGGFFEN 299


>gi|11691656|emb|CAC18646.1| cathepsin B-like protease 1 [Giardia intestinalis]
          Length = 303

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 96/327 (29%), Positives = 147/327 (44%), Gaps = 53/327 (16%)

Query: 22  SDAYIDQINREANTWTAG--RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
           S A + +I      W AG  + F  N++E+  R  LI   +   +S   LP    T   E
Sbjct: 17  SRAELRRIQALNPPWKAGMPKRF-ENVTEDEFRSMLIRPDRLRARSGS-LPPISITEVQE 74

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
               +P +FD R+++P C  +    D G+C     F+A+G F DRRC     ++    S 
Sbjct: 75  LVDPIPPQFDFRDEYPQC--VKPALDQGSCGECWAFSAIGVFGDRRCAMGIDKEAVSYSQ 132

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG-----DYGDRTGCQPSTISPC 194
           +++ SC       +N  C  G    TW+FL   G+ T       DYG             
Sbjct: 133 QHLISCSL-----ENFGCDGGDFQPTWSFLTFTGATTAECVKYVDYG------------- 174

Query: 195 SHHGSAPTLPSCENQKVPKL-KCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
            H  ++P    C++    +L K H       YG+              V  +  AI   +
Sbjct: 175 -HTVASPCPAVCDDGSPIQLYKAHG------YGQ--------------VSKSVPAIMGML 213

Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTW 312
           +A GP      +Y D  +Y+SGVYKHT    +    H+ +++G+GT ++GT YW++ N+W
Sbjct: 214 VAGGPLQTMIVVYADLSYYESGVYKHTYGT-INLGFHALEIVGYGTTDDGTDYWIIKNSW 272

Query: 313 GPHWGDRGTVKILRGKYECAFEYLIAA 339
           GP WG+ G  +I+RG  EC  E  I A
Sbjct: 273 GPDWGENGYFRIVRGVNECRIEDEIYA 299


>gi|294897889|ref|XP_002776090.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
 gi|239882699|gb|EER07906.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
          Length = 134

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 61/128 (47%), Positives = 83/128 (64%), Gaps = 5/128 (3%)

Query: 216 CHTRCTNPTYGRGFFQDKHRT-TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKS 274
           C + C N  YG  F +D+H T +L      +  +IKKEI+ +GPT+A F++Y+DF  YKS
Sbjct: 8   CSSSCPNAKYGTAFDKDRHYTESLFPSRFGSTSSIKKEIMTNGPTSAAFSVYEDFLSYKS 67

Query: 275 GVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
           GVYKHTS   L    H+ ++IGWGTE G  YWLV+N+W   WGD GT KI++G  +C  +
Sbjct: 68  GVYKHTSGGFLGG--HAVEIIGWGTEKGVDYWLVMNSWNEEWGDHGTFKIVQG--DCGID 123

Query: 335 YLIAAGKP 342
            +I AG P
Sbjct: 124 DMILAGTP 131


>gi|290998874|ref|XP_002682005.1| predicted protein [Naegleria gruberi]
 gi|284095631|gb|EFC49261.1| predicted protein [Naegleria gruberi]
          Length = 310

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 90/299 (30%), Positives = 128/299 (42%), Gaps = 45/299 (15%)

Query: 45  NLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP 104
           N++   LR  L   +      D P     +  + E    +P  FDAR QW  C  +  + 
Sbjct: 54  NMTISQLRDNLFGLSLMSSDEDTP-----RMANIETRVDIPMNFDARTQWKGC--VPAIR 106

Query: 105 DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFR 164
           D   C A   F+A    + R CI + GQ N  LS EY   C  +     NK+C  G +  
Sbjct: 107 DQQTCGACWAFSANYVLAHRLCIATNGQTNVVLSPEYQVQCDTM-----NKACQGGYLKY 161

Query: 165 TWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQ-KVPKLKCHTRCTNP 223
           +W FL            + TG    T  P +  G   +  +C  Q K+  +         
Sbjct: 162 SWTFL------------ENTGTPLDTCIPYASGGGTFSSGTCPTQCKIASMS-------- 201

Query: 224 TYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNA 283
                    K++   T ++    + IK  I+ +G   A F +Y D   YKSGVYKH  + 
Sbjct: 202 -------MSKYKAKNTVYISGINN-IKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHLVST 253

Query: 284 KLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            L    H+  LIG+G E G+ YWL  N+WGP+WG  G  KI +G  E   E  + AG+P
Sbjct: 254 VLGG--HAVALIGFGVEGGSNYWLAANSWGPNWGMSGYFKIAQG--EGGIENQVYAGEP 308


>gi|156708118|gb|ABU93317.1| cathepsin B8 cysteine protease, partial [Monocercomonoides sp. PA]
          Length = 275

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 91/324 (28%), Positives = 144/324 (44%), Gaps = 53/324 (16%)

Query: 21  FSDAYIDQINRE-ANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
            +++ +D +N + ++TW A          EY R+ L   AK      +   G    +   
Sbjct: 3   LAESVVDIVNNDPSSTWVA---------TEYPREILTL-AKMTAMISQIGNGFEGEWTFA 52

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
            +   P  FD R++WP  G    V +  +C +    AA      R  I+  G     +S 
Sbjct: 53  ENENAPASFDCRQKWP--GKAEPVRNQASCGSCWAHAASETMGFRMGIR--GCYKGVMSP 108

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
           + + SC       +N  C  G   R WN++ K+G  T         C P      S  G 
Sbjct: 109 QDLVSC-----ESNNMGCEGGYADRVWNWIQKKGITT-------EQCLPYV----SGSGR 152

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
            PT PS             +C N +       +  R+ ++ W   N   +  E+  +GP 
Sbjct: 153 VPTCPS-------------KCKNGS-------NIVRSFVSSWGSFNSKTVMDEVANNGPV 192

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
            A F +++DF +YKSG+Y+H +  K + + H   L+GWGTENG PYWL+ N+WG  WG++
Sbjct: 193 YACFEVFEDFLNYKSGIYQHKT-GKSKGWHHV-MLMGWGTENGVPYWLLQNSWGSGWGEK 250

Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
           G  +I RG  +C  + +  +G PK
Sbjct: 251 GFFRIRRGTNDCHIDEIFYSGLPK 274


>gi|395856779|ref|XP_003800796.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Otolemur garnettii]
          Length = 467

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 89/271 (32%), Positives = 122/271 (45%), Gaps = 36/271 (13%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 202 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 258

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
           + SC         + C  G +   W FL +RG V+  D+     C P +       G AP
Sbjct: 259 LLSC----DTHHQQGCHGGRLDGAWWFLRRRGVVS--DH-----CYPFSGQERDKAGPAP 307

Query: 202 TLPSCENQKVP----KLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
               C     P    K +   RC N         D ++ T  Y +  NE  I KE++ +G
Sbjct: 308 L---CMMHSRPMGRGKRQATARCPNNQVQA---NDIYQVTPAYRLGSNEKEIMKELMENG 361

Query: 258 PTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYW 306
           P  A   +++DF+ Y+SG+Y HT  S  + E Y     HS K+ GWG E         YW
Sbjct: 362 PVQALMEVHEDFFLYQSGIYSHTPVSLQRPEGYRRHGTHSVKITGWGEETLPDGRTLKYW 421

Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
              N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 422 TAANSWGPAWGERGHFRIVRGANECDIESFV 452


>gi|402867308|ref|XP_003897801.1| PREDICTED: tubulointerstitial nephritis antigen [Papio anubis]
          Length = 475

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 89/275 (32%), Positives = 126/275 (45%), Gaps = 44/275 (16%)

Query: 84  VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           +P+ F A  +WP      H P D   CAA   F+     +DR  I+SKG+    LS + +
Sbjct: 217 LPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNL 273

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG------DYGDRTGCQPSTISPCSH 196
            SCC   R+     C+ GS+ R W +L KRG V+        D     GC  ++ S    
Sbjct: 274 ISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANNGCAMASRS--DG 327

Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
            G       C N  + K     +C+ P                Y V  +E  I KEI+ +
Sbjct: 328 RGKRHATKPCPNN-IEKSNRIYQCSPP----------------YRVSSSETEIMKEIMQN 370

Query: 257 GPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGTENGT-----PY 305
           GP  A   + +DF+HYK+G+Y+H  ++N + E Y     H+ KL GWGT  G       +
Sbjct: 371 GPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGRKEKF 430

Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 431 WIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465


>gi|344264196|ref|XP_003404179.1| PREDICTED: tubulointerstitial nephritis antigen [Loxodonta
           africana]
          Length = 476

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 107/351 (30%), Positives = 150/351 (42%), Gaps = 62/351 (17%)

Query: 13  LVRGELYKFSDAYIDQINREANTWTAGR--NFPANLSEEYLRQFLIADAKYFDQSDRPLP 70
           LVR EL       I+ +N+    WTA     F     EE L+ F +           P+ 
Sbjct: 155 LVRPEL-------IEYVNKGDYGWTAKNYSQFWGMTLEEGLK-FRLGTL-----PPSPML 201

Query: 71  GDRKTYDPEYSAT--VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCI 127
                  P   AT  +P+ F A  +WP      H P D   CAA   F+     +DR  I
Sbjct: 202 LSMNEVTPSLPATTDLPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAI 258

Query: 128 KSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY------- 180
           +S G+    LS + + SCC   R+     C+ GSV R W +L KRG V+   Y       
Sbjct: 259 QSNGRYTANLSPQNLISCCTKNRH----GCNSGSVDRAWWYLRKRGLVSHACYPLFKDQN 314

Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
            +  GC  ++ S     G       C N  + K     +C+ P                Y
Sbjct: 315 ANNNGCAMASRS--DGRGKRHATKPCPNN-IEKSNVIYQCSPP----------------Y 355

Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKL 294
            V  NE  I KEI+ +GP  A   +++DF+HYK+G+Y+H   ++ + E Y     H+ KL
Sbjct: 356 RVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVIRTSEESEKYQKLRTHAVKL 415

Query: 295 IGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
            GWG   G       +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 416 TGWGMMKGAKGRKEKFWVAANSWGKSWGEDGYFRILRGVNESDIEKLIIAA 466


>gi|294890618|ref|XP_002773230.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239878281|gb|EER05046.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 238

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 79/246 (32%), Positives = 117/246 (47%), Gaps = 21/246 (8%)

Query: 24  AYIDQINREANTWTA----GRNFPANL--SEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
           + +D++N + N WTA    GR + ++L  +++    FL    +           + K Y 
Sbjct: 3   SLVDEVNSKQNLWTASTEQGRFYGSSLGDAKKLCGTFLNGTEEL----------EEKVYP 52

Query: 78  PEYSATVPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
           PE    +PD FDAR+ +  C   IGHV D  AC +   F  V AF+ R CIKS G+ N+ 
Sbjct: 53  PEELVDIPDSFDARDAFKECKDVIGHVRDQSACGSCWAFGTVEAFNARVCIKSGGKLNQL 112

Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG---GDYGDRTGCQPSTISP 193
           LS   + +CC I  +  +  CS G+   +W FLH  G V+G    +     GC P     
Sbjct: 113 LSAADMLACCNIEHFCLSFGCSGGNPITSWTFLHTNGIVSGKLSKNMKAADGCWPYNFPK 172

Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT-TLTYWVDDNEDAIKKE 252
           C+HH        C  +      C + C N  YG  F +D+H T +L      +  +IKKE
Sbjct: 173 CAHHQKESDYKPCAKELYDTPSCSSSCPNAKYGTAFDKDRHYTESLLPSRFGSTSSIKKE 232

Query: 253 ILAHGP 258
           I+ +GP
Sbjct: 233 IMTNGP 238


>gi|355724275|gb|AES08176.1| tubulointerstitial nephritis antigen-like 1 [Mustela putorius furo]
          Length = 454

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 86/268 (32%), Positives = 121/268 (45%), Gaps = 30/268 (11%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 189 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 245

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
           + SC       + + C  G +   W FL +RG V+  D+     C P         G AP
Sbjct: 246 LLSC----DTHNQRGCHGGRLDGAWWFLRRRGVVS--DH-----CYPFVGREQDEAGPAP 294

Query: 202 -TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
             +         K +   RC +         D ++ T  Y +  NE  I KE++ +GP  
Sbjct: 295 RCMMHSRAMGRGKRQATARCPS---SHAHANDIYQVTPAYRLGSNEKEIMKELMENGPVQ 351

Query: 261 ATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWLVI 309
           A   +++DF+ Y+SG+Y HT  S  + E Y     HS K+ GWG E         YW   
Sbjct: 352 ALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAA 411

Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLI 337
           N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 412 NSWGPAWGERGHFRIVRGANECDIESFV 439


>gi|308494436|ref|XP_003109407.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
 gi|308246820|gb|EFO90772.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
          Length = 470

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 83/265 (31%), Positives = 129/265 (48%), Gaps = 34/265 (12%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FDAR++W +   I  V D G C +    +  G  SDR  I S+G+ N  LS++ + 
Sbjct: 202 LPEHFDARDKWGH--LIHPVADQGDCGSSWAVSTTGISSDRLSIISEGRINASLSSQQLL 259

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC         K C  G + R W ++ K G V  GD+     C P  +S  S       +
Sbjct: 260 SC----NQHRQKGCEGGYLDRAWWYIRKLGVV--GDH-----CYP-YVSGQSREPGHCLI 307

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P  +      L+C +   + T          + T  Y V   E+ I+ E++ +GP  ATF
Sbjct: 308 PKRDYTNRQGLRCPSGDQDST--------AFKMTPPYKVSSREEDIQTELMTNGPVQATF 359

Query: 264 ALYDDFYHYKSGVYKHT-------SNAKLENYLHSGKLIGWGTENGT----PYWLVINTW 312
            +++DF+ Y  GVY+H+       +++  E Y HS +++GWG ++ T     YWL  N+W
Sbjct: 360 VVHEDFFMYAGGVYQHSDLAAQKGASSVAEGY-HSVRVLGWGVDHSTGRPIKYWLCANSW 418

Query: 313 GPHWGDRGTVKILRGKYECAFEYLI 337
           G  WG+ G  KILRG+  C  E  +
Sbjct: 419 GTQWGEDGYFKILRGENHCEIESFV 443


>gi|426328832|ref|XP_004025452.1| PREDICTED: tubulointerstitial nephritis antigen-like [Gorilla
           gorilla gorilla]
          Length = 462

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 87/273 (31%), Positives = 118/273 (43%), Gaps = 40/273 (14%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 197 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 253

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
           + SC         + C  G +   W FL +RG V+       G   D  G  P    PC 
Sbjct: 254 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAP----PCM 305

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
            H  A            K +    C N         D ++ T  Y +  N+  I KE++ 
Sbjct: 306 MHSQA--------MGRGKRQATAHCPNSYVNN---NDIYQVTPVYRLGSNDKEIMKELME 354

Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
           +GP  A   +++DF+ YK G+Y HT  S  + E Y     HS K+ GWG E         
Sbjct: 355 NGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLK 414

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
           YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 415 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 447


>gi|395730851|ref|XP_003775799.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Pongo
           abelii]
          Length = 362

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 88/279 (31%), Positives = 118/279 (42%), Gaps = 52/279 (18%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 97  VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 153

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
           + SC         + C  G +   W FL +RG V+       G   D  G  P    PC 
Sbjct: 154 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPTP----PCM 205

Query: 196 HH------GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
            H      G      SC N  V                    D ++ T  Y +  N+  I
Sbjct: 206 MHSRAMGRGKRQATASCPNSHVNN-----------------NDIYQVTPVYRLGSNDKEI 248

Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-- 301
            KE++ +GP  A   +++DF+ YK G+Y HT  S  + E Y     HS K+ GWG E   
Sbjct: 249 MKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLP 308

Query: 302 ---GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
                 YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 309 DGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFV 347


>gi|73973401|ref|XP_538969.2| PREDICTED: tubulointerstitial nephritis antigen [Canis lupus
           familiaris]
          Length = 476

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 88/276 (31%), Positives = 123/276 (44%), Gaps = 45/276 (16%)

Query: 84  VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           +P+ F A  +WP      H P D   CAA   F+     +DR  I+S G+    LS + +
Sbjct: 217 LPEFFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSNGRYTANLSPQNL 273

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-------GDRTGCQPSTISPCS 195
            SCC   R+     C+ GS+ R W FL KRG V+   Y           GC  ++ S   
Sbjct: 274 ISCCAKNRH----GCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNATNYGCAMASRS--D 327

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
             G       C N  + K     +C+ P                Y V  NE  I KEI+ 
Sbjct: 328 GRGKRHATKPCPNN-IEKSNRIYQCSPP----------------YRVSSNETEIMKEIMQ 370

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLEN------YLHSGKLIGWGTENGT-----P 304
           +GP  A   +++DF+HYK+G+Y+H +    E+        H+ KL GWGT  G       
Sbjct: 371 NGPVQAIMQVHEDFFHYKTGIYRHITRTNEESRKYQKLQTHAVKLTGWGTLKGAQGQKEK 430

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 431 FWIAANSWGISWGENGYFRILRGVNESDIEKLIIAA 466


>gi|324713036|ref|NP_001191344.1| tubulointerstitial nephritis antigen-like isoform 3 [Homo sapiens]
 gi|119628008|gb|EAX07603.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_a [Homo
           sapiens]
          Length = 362

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 87/273 (31%), Positives = 118/273 (43%), Gaps = 40/273 (14%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 97  VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 153

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
           + SC         + C  G +   W FL +RG V+       G   D  G  P    PC 
Sbjct: 154 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAP----PCM 205

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
            H  A            K +    C N         D ++ T  Y +  N+  I KE++ 
Sbjct: 206 MHSRA--------MGRGKRQATAHCPNSYVNN---NDIYQVTPVYRLGSNDKEIMKELME 254

Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
           +GP  A   +++DF+ YK G+Y HT  S  + E Y     HS K+ GWG E         
Sbjct: 255 NGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLK 314

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
           YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 315 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 347


>gi|253748582|gb|EET02635.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 298

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 79/258 (30%), Positives = 120/258 (46%), Gaps = 38/258 (14%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           VPD FD RE++P+C  I  V D G+C +   F++V +  DRRC     ++    S +YV 
Sbjct: 74  VPDSFDFREEYPHC--IPEVVDQGSCGSCWAFSSVASLGDRRCFAGLDKKAVTYSPQYVV 131

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC        + +C  G +   W FL K G+ T         C P         G+ PT 
Sbjct: 132 SCDH-----GDMACDGGWLQSVWRFLTKTGTTT-------NECVPYQSGTTGARGTCPT- 178

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
                          +C +   G      K +  + Y +D   D I K ++  GP    F
Sbjct: 179 ---------------KCAD---GGELSTVKAKKAVDYGLDC--DLIMKALVTGGPLQTAF 218

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGPHWGDRGTV 322
            +Y DF +Y+ GVY+H S  ++E   H+ +++G+GT E    YW++ N+WGP WG+ G  
Sbjct: 219 TVYSDFMYYEGGVYQHMS-GRVEGG-HAVEMVGYGTDEYDVDYWIIRNSWGPDWGEDGYF 276

Query: 323 KILRGKYECAFEYLIAAG 340
           +I+R   EC  E  +  G
Sbjct: 277 RIIRMTNECGIEEQVMGG 294


>gi|308804940|ref|XP_003079782.1| Cysteine proteinase Cathepsin F (ISS) [Ostreococcus tauri]
 gi|116058239|emb|CAL53428.1| Cysteine proteinase Cathepsin F (ISS) [Ostreococcus tauri]
          Length = 498

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 88/262 (33%), Positives = 124/262 (47%), Gaps = 27/262 (10%)

Query: 83  TVPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
           ++P  FDAR+++P C   IG V D G C +    AA    +DR CI S G++   LS ++
Sbjct: 256 SLPRHFDARDEYPKCARLIGTVRDQGKCGSCWAVAATEIMNDRLCISSGGKEVAELSPQF 315

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
             SC     Y+    C  G V  T      +G   GG   D+  C P    PC H    P
Sbjct: 316 ALSC-----YNSGAGCEGGDVVDTLTLALAKGVPHGGML-DKGACLPYQFEPCDHPCMIP 369

Query: 202 -TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA-IKKEILAHGPT 259
            T P           C   C + +     FQ  +   L Y    ++ A I KEI   G  
Sbjct: 370 GTSPEA---------CPATCADGSK----FQLVYPKNLPYTCPPDDIACIAKEIKNRGSV 416

Query: 260 TATFA-LYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENGTPYWLVINTWGPHWG 317
             TF  +++DFY +K GVYK T ++  E   H+ KLIGWG T+ G  YW+++N+W  +WG
Sbjct: 417 AVTFGPVHEDFYGHKEGVYKVTESSGRELGNHATKLIGWGVTQEGDHYWIMVNSW-RNWG 475

Query: 318 DRGTVKILRGKYECAFEYLIAA 339
           + G  K+  G  E + E  +AA
Sbjct: 476 ENGVGKVRMG--EMSIESGVAA 495


>gi|53850626|ref|NP_001005549.1| tubulointerstitial nephritis antigen precursor [Rattus norvegicus]
 gi|51858645|gb|AAH81887.1| Tubulointerstitial nephritis antigen [Rattus norvegicus]
 gi|149019129|gb|EDL77770.1| tubulointerstitial nephritis antigen [Rattus norvegicus]
          Length = 475

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 87/274 (31%), Positives = 129/274 (47%), Gaps = 37/274 (13%)

Query: 82  ATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           A +P+ F A  +WP      H P D   CAA   F+     +DR  I+SKG+    LS +
Sbjct: 214 ADLPEVFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 270

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SCC   R+     C+ GS+ R W FL KRG V+         C P      +++ S 
Sbjct: 271 NLISCCAKNRH----GCNSGSIDRAWWFLRKRGLVS-------HACYPLFKEQSTNNNSC 319

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRT---TLTYWVDDNEDAIKKEILAHG 257
                 + +   K      C N       F+  +R    +  Y +  NE  I +EI+ +G
Sbjct: 320 AMASRSDGRG--KRHATRPCPNS------FEKSNRIYQCSPPYRISSNETEIMREIIQNG 371

Query: 258 PTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGTENGT-----PYW 306
           P  A   +++DF++YK+G+Y+H  ++N + E Y     H+ KL GWGT  G       +W
Sbjct: 372 PVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYRKLRTHAVKLTGWGTLRGAQGKKEKFW 431

Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           +  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 432 IAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465


>gi|28974200|gb|AAO61484.1| cathepsin B [Sterkiella histriomuscorum]
          Length = 294

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 96/342 (28%), Positives = 142/342 (41%), Gaps = 55/342 (16%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
           LV +     V    +  ++  +  I  + + W          +     Q L     Y   
Sbjct: 4   LVIIGTIVAVAVATHPINEEMVAHIKAKTSLWQPHETTTNPFNNMTKEQLLAKCGTYIVP 63

Query: 65  SDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDR 124
           +++  PG +         TVP+ FDAR+QW +   I  + D   C +   F A  AFSDR
Sbjct: 64  ANKEYPGSKIM-------TVPENFDARQQWGS--KIHAIRDQQQCGSCWAFGATEAFSDR 114

Query: 125 RCIKSKGQQNRPLSTEYVASCCKICRYDDNK-SCSHGSVFRTWNFLHKRGSVTGG--DYG 181
             I  K   +  LS E + SC      D N   C+ G +   W +L   G+ T     Y 
Sbjct: 115 FAINGK---DVILSPEDLVSC------DTNDYGCNGGYMDVAWEYLADHGAATDSCFPYS 165

Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW 241
             +G  P+    C+  GSA          + + KC       + G               
Sbjct: 166 AGSGFAPACSDKCAD-GSA----------MQRFKCAPNSVRQSKGVA------------- 201

Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
                  I+ EI++HGP    F +Y DF++Y+SGVY  T+        H+ K++G+G EN
Sbjct: 202 ------QIQSEIVSHGPVEGAFTVYTDFFNYQSGVYTPTTTDVAGG--HAIKILGYGVEN 253

Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           GTPYWL  N+WGP WG  G  KI +G  EC  E  + +  P+
Sbjct: 254 GTPYWLCANSWGPAWGMSGFFKIKQG--ECGIEDQVFSCDPQ 293


>gi|341898422|gb|EGT54357.1| hypothetical protein CAEBREN_10381 [Caenorhabditis brenneri]
          Length = 466

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 82/265 (30%), Positives = 128/265 (48%), Gaps = 34/265 (12%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FD+R++W +   I  V D G C +    +  G  SDR  I S+G+ N  LS++ + 
Sbjct: 198 LPEHFDSRDKWGH--LINPVVDQGDCGSSWAVSTTGISSDRLAIISEGRINASLSSQQLL 255

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC         K C  G + R W ++ K G V  GD+     C P  +S  S       +
Sbjct: 256 SC----NQHRQKGCEGGYLDRAWWYIRKLGVV--GDH-----CYP-YVSGQSREPGHCLI 303

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P  +      L+C +   + T          + T  Y V   E+ I+ E++ +GP  ATF
Sbjct: 304 PKRDYTDRRGLRCPSGSQDST--------AFKMTPPYKVSSREEDIQTELMTNGPVQATF 355

Query: 264 ALYDDFYHYKSGVYKHT-------SNAKLENYLHSGKLIGWGTENGT----PYWLVINTW 312
            +++DF+ Y  GVY+H+       +++  E Y HS +++GWG ++ T     YWL  N+W
Sbjct: 356 VVHEDFFMYAGGVYQHSDLAAQKGASSVAEGY-HSVRVLGWGVDHSTGRPIKYWLCANSW 414

Query: 313 GPHWGDRGTVKILRGKYECAFEYLI 337
           G  WG+ G  KILRG   C  E  +
Sbjct: 415 GTQWGEDGYFKILRGDNHCEIESFV 439


>gi|32129434|sp|P92132.2|CATB2_GIALA RecName: Full=Cathepsin B-like CP2; AltName: Full=Cathepsin B-like
           protease B2; Flags: Precursor
 gi|11691658|emb|CAC18647.1| cathepsin B-like protease 2 [Giardia intestinalis]
          Length = 300

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 79/260 (30%), Positives = 123/260 (47%), Gaps = 41/260 (15%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           VP+ FD RE++P+C  I  V D G C +   F++V  F DRRC+    ++    S +YV 
Sbjct: 75  VPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVAGLDKKPVKYSPQYVV 132

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC        + +C+ G +   W FL K G+ T         C P      +  G+ PT 
Sbjct: 133 SCDH-----GDMACNGGWLPNVWKFLTKTGTTT-------DECVPYKSGSTTLRGTCPTK 180

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNED--AIKKEILAHGPTTA 261
            +  + KV                      H  T T + D   D  A+ K +   GP   
Sbjct: 181 CADGSSKV----------------------HLATATSYKDYGLDIPAMMKALSTSGPLQV 218

Query: 262 TFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWGPHWGDRG 320
            F ++ DF +Y+SGVY+HT         H+ +++G+GT++ G  YW++ N+WGP WG+ G
Sbjct: 219 AFLVHSDFMYYESGVYQHTYGYMEGG--HAVEMVGYGTDDDGVDYWIIKNSWGPDWGEDG 276

Query: 321 TVKILRGKYECAFEYLIAAG 340
             +++RG  +C+ E    AG
Sbjct: 277 YFRMIRGINDCSIEEQAYAG 296


>gi|403371460|gb|EJY85611.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 78/251 (31%), Positives = 115/251 (45%), Gaps = 41/251 (16%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           + +A +PD FD+R QW +C  +  + D   C +   FAA  + SDR CI S+G+ N  LS
Sbjct: 73  QINAALPDSFDSRTQWKDC--VHPIRDQAQCGSCWAFAAAESLSDRFCIASQGKVNLVLS 130

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
            + + SC        N  C  G + + W +L ++G  +         C+P      S +G
Sbjct: 131 PQDMVSC-----DTSNFGCFGGYLDQAWQYLEQQGVSS-------DSCEPYK----SGNG 174

Query: 199 SAPTLPS-CEN-QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
             P+ P+ C N Q + K KC    T    G                    +A K  I   
Sbjct: 175 DQPSCPTKCSNGQAIKKYKCKAGSTKQAKGA-------------------EATKSLIQES 215

Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHW 316
           GP    F +Y DFY+Y SGVY H +        H+ K++GWG +    YW+V N+WG  W
Sbjct: 216 GPVETGFTVYQDFYNYNSGVYHHVTGDAEGG--HAVKILGWGKQGLENYWIVANSWGEDW 273

Query: 317 GDRGTVKILRG 327
           G++G   I +G
Sbjct: 274 GEKGYFNIRQG 284


>gi|297665716|ref|XP_002811185.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 3
           [Pongo abelii]
          Length = 436

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 101/350 (28%), Positives = 138/350 (39%), Gaps = 65/350 (18%)

Query: 16  GELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKT 75
           G +Y     Y D  NR    W AG +        +    L    +Y   + RP       
Sbjct: 109 GRIYPILGTYWDNCNR---CWQAGNH------SAFWGMTLDEGIRYRLGTIRPSSSVMNM 159

Query: 76  YDP----EYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSK 130
           ++          +P  F+A E+WPN   + H P D G CA    F+     SDR  I S 
Sbjct: 160 HEIYTVLNPGEVLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSL 216

Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRT 184
           G     LS + + SC         + C  G +   W FL +RG V+       G   D  
Sbjct: 217 GHMTPVLSPQNLLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEA 272

Query: 185 GCQPSTISPCSHH------GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
           G  P    PC  H      G      SC N  V                    D ++ T 
Sbjct: 273 GPTP----PCMMHSRAMGRGKRQATASCPNSHVNN-----------------NDIYQVTP 311

Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSG 292
            Y +  N+  I KE++ +GP  A   +++DF+ YK G+Y HT  S  + E Y     HS 
Sbjct: 312 VYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSV 371

Query: 293 KLIGWGTEN-----GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
           K+ GWG E         YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 372 KITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFV 421


>gi|157116531|ref|XP_001658537.1| tubulointerstitial nephritis antigen [Aedes aegypti]
 gi|108883447|gb|EAT47672.1| AAEL001232-PA [Aedes aegypti]
          Length = 462

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 84/267 (31%), Positives = 121/267 (45%), Gaps = 37/267 (13%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDA   WP  G IG V D G C +    +     SDR  I SKG++   L+ + + 
Sbjct: 185 LPTHFDATNYWP--GFIGKVRDQGWCGSSWAVSTASVASDRFAILSKGRETVQLAPQQIV 242

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC +      ++ CS G +   W++L K G+V    Y   +      I P     +A   
Sbjct: 243 SCVR-----RSQGCSGGHLDTAWSYLRKVGTVNEECYPYISAHNVCKIRPSDTLITA--- 294

Query: 204 PSCE-NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
            +CE   KV +   +     P +                  +NE  I  EI  HGP  A 
Sbjct: 295 -NCELPMKVDRTNMYK--MGPAFSL----------------NNETDIMLEIKKHGPVQAI 335

Query: 263 FALYDDFYHYKSGVYKHT---SNAKLENYLHSGKLIGWGTE----NGTPYWLVINTWGPH 315
             ++ DF+ YKSG+Y+H+   ++A      HS +LIGWG E      T YW+ +N+WG  
Sbjct: 336 MRVHRDFFSYKSGIYRHSAASTSADQRAGYHSVRLIGWGEERHGYEVTKYWIAVNSWGTW 395

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
           WG+ G  +ILRG  EC  E  + A  P
Sbjct: 396 WGENGRFRILRGSNECEIESYVLASLP 422


>gi|332254562|ref|XP_003276398.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 3
           [Nomascus leucogenys]
          Length = 362

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 86/273 (31%), Positives = 118/273 (43%), Gaps = 40/273 (14%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 97  VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 153

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
           + SC         + C  G +   W FL +RG V+       G   D  G  P    PC 
Sbjct: 154 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAP----PCM 205

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
            H  A            K +    C N         D ++ T  Y +  N+  + KE++ 
Sbjct: 206 MHSRA--------MGRGKRQATAHCPNSHVNN---NDIYQVTPVYRLGSNDKEVMKELME 254

Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
           +GP  A   +++DF+ YK G+Y HT  S  + E Y     HS K+ GWG E         
Sbjct: 255 NGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLK 314

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
           YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 315 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 347


>gi|308163309|gb|EFO65659.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 309

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 91/322 (28%), Positives = 139/322 (43%), Gaps = 54/322 (16%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGD---RKTYD 77
            + A + QI   A TW AG         E L+    +D K    +D P       R  + 
Sbjct: 16  LTQAELRQIQALAPTWKAG-------IPERLKSLTKSDFKRMLSADSPRTQPSMVRPIHV 68

Query: 78  PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
           PE     PD FD RE++P C  I  V D G C++    +AV AFS RRC+    Q+    
Sbjct: 69  PESEDPAPDHFDFREEYPQC--ITEVIDIGLCSSSWAHSAVDAFSHRRCLTGLDQEATRY 126

Query: 138 STEYVASCCKICRYDDNKSC----SHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP 193
           S +Y+ SC           C    + G +   W+F+   G                 +  
Sbjct: 127 SAQYILSCAS------TNGCFGFSTQGDI--AWDFIATTGV---------------PLES 163

Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
           C  +          N+      C + C + ++   +  D +       V  N + +K+ +
Sbjct: 164 CVKYTDY-------NETQSSWPCPSVCNDNSFLEIYKPDGYEG-----VGFNSERLKRAV 211

Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTW 312
              GP  A FA+Y+DF +Y  G+Y HT   +   +L S +++G+GT + G  YW+V N W
Sbjct: 212 AFRGPMQAMFAVYEDFTYYLEGIYSHTYGNR-AGFL-SVEIVGYGTSDEGQDYWIVKNYW 269

Query: 313 GPHWGDRGTVKILRGKYECAFE 334
           GP WG+ G  +I+RG+ EC  E
Sbjct: 270 GPDWGEDGYFRIVRGQDECQIE 291


>gi|161343827|tpg|DAA06094.1| TPA_inf: cathepsin B [Aphis gossypii]
          Length = 207

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 66/202 (32%), Positives = 97/202 (48%), Gaps = 15/202 (7%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           ++ +L  +L    +  + Y     YI++IN +A TW AG NF     +E++ + L +   
Sbjct: 4   VLILLSVILFSVYMTEQAYFLEKDYINKINEQATTWKAGVNFDPKTPKEHILKLLGSKGV 63

Query: 61  YFDQSDRPLPGDRKTYDPE------YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
                  P   + K Y  E          +P +FDAR++W NC TIG + D G C +   
Sbjct: 64  QI-----PSKVNYKMYKSEDENYDNLLGRIPRKFDARKKWRNCKTIGAIRDQGNCGSCWA 118

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
            A   AF+DR C+ S G  N+ LS E +  CC  C +     C+ G   + W    K G 
Sbjct: 119 LATSSAFADRLCVASNGNFNQLLSAEELTFCCHKCGF----GCNGGYPIKAWERFMKHGL 174

Query: 175 VTGGDYGDRTGCQPSTISPCSH 196
           VTGGDY  R GC+P  + PC +
Sbjct: 175 VTGGDYKSREGCEPYRVPPCPY 196


>gi|338722032|ref|XP_003364468.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
           [Equus caballus]
          Length = 436

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 100/340 (29%), Positives = 142/340 (41%), Gaps = 43/340 (12%)

Query: 15  RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRK 74
           RG +Y     Y D  NR    W AG +        +    L    +Y   + RP      
Sbjct: 108 RGRVYPVLGTYWDNCNR---CWRAGNH------SAFWGMTLDEGIRYRLGTIRPSSSVTS 158

Query: 75  TYDPEY----SATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKS 129
             +          +P  F+A E+WPN   + H P D G CA    F+     SDR  I S
Sbjct: 159 MNEIHTVLGPGEVLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHS 215

Query: 130 KGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPS 189
            G     LS + + SC       + + C  G +   W FL +RG V+  D+     C P 
Sbjct: 216 LGHMTPVLSPQNLLSC----DTHNQQGCRGGHLDGAWWFLRRRGVVS--DH-----CYPF 264

Query: 190 TISPCSHHGSAP-TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA 248
           +       G AP  +         K +    C N    R    D ++ T  Y +  +E  
Sbjct: 265 SGRERDEAGPAPRCMMHSRAMGRGKRQATAHCPN---SRVHTNDIYQVTPAYRLGSSEKE 321

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN- 301
           I KE++ +GP  A   +++DF+ Y+ GVY HT  S+ + E Y     HS K+ GWG E  
Sbjct: 322 IMKELMENGPVQALMEVHEDFFLYQGGVYSHTPVSHGRPERYRRHGTHSVKITGWGEETL 381

Query: 302 ----GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
                  YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 382 PDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFV 421


>gi|338718488|ref|XP_001918155.2| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
           antigen-like [Equus caballus]
          Length = 480

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 92/292 (31%), Positives = 129/292 (44%), Gaps = 43/292 (14%)

Query: 68  PLPGDRKTYDPEYSAT--VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDR 124
           P+        P   AT  +P+ F A  +WP      H P D   CAA   F+     +DR
Sbjct: 203 PMLLSMNEVTPSLPATTDLPEFFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADR 259

Query: 125 RCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---- 180
             I+S G+    LS + + SCC   R+     C+ GS+ R W +L KRG V+   Y    
Sbjct: 260 IAIQSNGRFTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFK 315

Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
             + T    +  S     G       C N  + K     +C+ P                
Sbjct: 316 DQNATNNDCAMASRSDGRGKRHATKPCPNN-IEKSNRIYQCSPP---------------- 358

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGK 293
           Y V  NE  I KEI+ +GP  A   ++DDF+HYK G+Y+H  +++ + E Y     H+ K
Sbjct: 359 YRVSSNETEIMKEIMQNGPVQAIMQVHDDFFHYKKGIYRHVTSTHEEPEKYRKLRTHAIK 418

Query: 294 LIGWGTENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           L GWGT  G       +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 419 LAGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 470


>gi|297665714|ref|XP_002811184.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
           [Pongo abelii]
          Length = 467

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 88/279 (31%), Positives = 118/279 (42%), Gaps = 52/279 (18%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 202 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 258

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
           + SC         + C  G +   W FL +RG V+       G   D  G  P    PC 
Sbjct: 259 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPTP----PCM 310

Query: 196 HH------GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
            H      G      SC N  V                    D ++ T  Y +  N+  I
Sbjct: 311 MHSRAMGRGKRQATASCPNSHVNN-----------------NDIYQVTPVYRLGSNDKEI 353

Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-- 301
            KE++ +GP  A   +++DF+ YK G+Y HT  S  + E Y     HS K+ GWG E   
Sbjct: 354 MKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLP 413

Query: 302 ---GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
                 YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 414 DGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFV 452


>gi|412985820|emb|CCO17020.1| cathepsin B-like cysteine proteinase [Bathycoccus prasinos]
          Length = 541

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 79/256 (30%), Positives = 118/256 (46%), Gaps = 11/256 (4%)

Query: 79  EYSATVPDRFDAREQWPNCGT-IGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
           E  + +P+ FDARE+WP C   IG   D G C +    A     SDR CI S G+    L
Sbjct: 271 EPPSDLPESFDAREKWPECSEFIGEAWDQGECGSCWAIAPTKVMSDRLCIASGGKVQERL 330

Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHH 197
           +   + SC ++       SC  G     + F  + G  +GG YGD  GC      PC H 
Sbjct: 331 AASEILSCGQLVSEFSFGSCEGGMPDDAYEFAKEFGVASGGKYGDEKGCAAYPFPPCHHP 390

Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
                 P+C   K    +C       T        +H   L +  D + D + +EI   G
Sbjct: 391 CHVQPTPACP-LKSDTAQCQGDLDEHTRNEVA---QHIDKLIHCPDGDYDCMAREIYNSG 446

Query: 258 PTTA-TFALYDDFYHYKSGVYKHTSNAKLENYLHSG---KLIGWGTE-NGTPYWLVINTW 312
           P ++    +YD+FY YK G Y+ +++++     H G   ++IGW  E +GT  W +IN+W
Sbjct: 447 PVSSYAGTIYDEFYAYKDGAYRTSADSETRGRSHGGHVIEVIGWHKESDGTYSWKIINSW 506

Query: 313 GPHWGDRGTVKILRGK 328
             +WG +G  +I  G+
Sbjct: 507 -LNWGKKGHGRIAVGE 521


>gi|14789619|gb|AAH10745.1| Tubulointerstitial nephritis antigen [Mus musculus]
          Length = 475

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 88/276 (31%), Positives = 127/276 (46%), Gaps = 41/276 (14%)

Query: 82  ATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           A +P+ F A  +WP      H P D   CAA   F+     +DR  I+SKG+    LS +
Sbjct: 214 ADLPEIFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 270

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-----GDRTGCQPSTISPCS 195
            + SCC   R+     C+ GS+ R W FL KRG V+   Y      + T    +  S   
Sbjct: 271 NLISCCAKNRH----GCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNTTNNICAMASRSD 326

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
             G       C N      + + +C+ P                Y V  NE  I +EI+ 
Sbjct: 327 GRGKRHATKPCPNSFEKSNRIY-QCSPP----------------YRVSSNETEIMREIIQ 369

Query: 256 HGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGTENGT-----P 304
           +GP  A   +++DF++YK+G+Y+H  ++N + E Y     H+ KL GWGT  G       
Sbjct: 370 NGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYKKLRTHAVKLTGWGTLRGARGKKEK 429

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 430 FWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465


>gi|308160258|gb|EFO62754.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 298

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 80/265 (30%), Positives = 118/265 (44%), Gaps = 38/265 (14%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           +  VPD FD RE++P+C  I  V D G C +   F++V +  DRRC+    ++    S +
Sbjct: 71  ATQVPDSFDFREEYPHC--IPEVVDQGGCGSCWAFSSVASVGDRRCVAGLDKKAVRYSPQ 128

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
           YV SC +      + +C  G +   W FL K G+ T         C P         G+ 
Sbjct: 129 YVVSCDR-----GDMACDGGWLPSVWRFLVKTGTTT-------DECVPYQSGSTGARGTC 176

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
           PT            KC      P Y       K    + Y +D   D I K +   GP  
Sbjct: 177 PT------------KCADGSELPIY-------KATKAVDYGLD--CDLIMKALATGGPLQ 215

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGPHWGDR 319
             F +Y DF +Y+ GVY+H          H+ +++G+GT E    YW++ N+WGP WG+ 
Sbjct: 216 TAFTVYSDFMYYQGGVYQHVYGRAEGG--HAVEMVGYGTDEYDVDYWIIRNSWGPDWGED 273

Query: 320 GTVKILRGKYECAFEYLIAAGKPKN 344
           G  +I+R   EC  E  +  G  +N
Sbjct: 274 GYFRIIRMTNECGIEEQVIGGFFEN 298


>gi|332808277|ref|XP_524645.3| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
           antigen-like 1 [Pan troglodytes]
          Length = 472

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 87/273 (31%), Positives = 118/273 (43%), Gaps = 40/273 (14%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 207 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 263

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
           + SC         + C  G +   W FL +RG V+       G   D  G  P    PC 
Sbjct: 264 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAP----PCM 315

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
            H  A            K +    C N         D ++ T  Y +  N+  I KE++ 
Sbjct: 316 MHSRA--------MGRGKRQATAHCPNSYVNN---NDIYQVTPVYRLGSNDKEIMKELME 364

Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
           +GP  A   +++DF+ YK G+Y HT  S  + E Y     HS K+ GWG E         
Sbjct: 365 NGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLK 424

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
           YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 425 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 457


>gi|397515891|ref|XP_003828175.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2 [Pan
           paniscus]
          Length = 436

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 87/273 (31%), Positives = 118/273 (43%), Gaps = 40/273 (14%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 171 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 227

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
           + SC         + C  G +   W FL +RG V+       G   D  G  P    PC 
Sbjct: 228 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAP----PCM 279

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
            H  A            K +    C N         D ++ T  Y +  N+  I KE++ 
Sbjct: 280 MHSRA--------MGRGKRQATAHCPNSYVNN---NDIYQVTPVYRLGSNDKEIMKELME 328

Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
           +GP  A   +++DF+ YK G+Y HT  S  + E Y     HS K+ GWG E         
Sbjct: 329 NGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLK 388

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
           YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 389 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 421


>gi|11545918|ref|NP_071447.1| tubulointerstitial nephritis antigen-like isoform 1 precursor [Homo
           sapiens]
 gi|61213628|sp|Q9GZM7.1|TINAL_HUMAN RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
           Full=Glucocorticoid-inducible protein 5; AltName:
           Full=Oxidized LDL-responsive gene 2 protein;
           Short=OLRG-2; AltName: Full=Tubulointerstitial nephritis
           antigen-related protein; Short=TIN Ag-related protein;
           Short=TIN-Ag-RP; Flags: Precursor
 gi|11602840|gb|AAG38876.1|AF236150_1 tubulointerstitial nephritis antigen-related protein precursor
           [Homo sapiens]
 gi|11275667|gb|AAG33699.1| oxidized-LDL responsive gene 2 [Homo sapiens]
 gi|11527793|dbj|BAB18636.1| glucocorticoid-inducible protein [Homo sapiens]
 gi|11527809|dbj|BAB18727.1| glucocorticoid-inducible protein [Homo sapiens]
 gi|11761715|gb|AAG40154.1| tubulointerstitial nephritis antigen-related protein [Homo sapiens]
 gi|22761462|dbj|BAC11596.1| unnamed protein product [Homo sapiens]
 gi|37181967|gb|AAQ88787.1| LCN7 [Homo sapiens]
 gi|40353044|gb|AAH64633.1| Tubulointerstitial nephritis antigen-like 1 [Homo sapiens]
 gi|119628009|gb|EAX07604.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
           sapiens]
 gi|119628010|gb|EAX07605.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
           sapiens]
 gi|119628011|gb|EAX07606.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
           sapiens]
 gi|158258977|dbj|BAF85459.1| unnamed protein product [Homo sapiens]
 gi|261858502|dbj|BAI45773.1| tubulointerstitial nephritis antigen-like 1 [synthetic construct]
 gi|410265400|gb|JAA20666.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410307560|gb|JAA32380.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410307562|gb|JAA32381.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410307564|gb|JAA32382.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410335249|gb|JAA36571.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
          Length = 467

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 87/273 (31%), Positives = 118/273 (43%), Gaps = 40/273 (14%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 202 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 258

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
           + SC         + C  G +   W FL +RG V+       G   D  G  P    PC 
Sbjct: 259 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAP----PCM 310

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
            H  A            K +    C N         D ++ T  Y +  N+  I KE++ 
Sbjct: 311 MHSRA--------MGRGKRQATAHCPNSYVNN---NDIYQVTPVYRLGSNDKEIMKELME 359

Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
           +GP  A   +++DF+ YK G+Y HT  S  + E Y     HS K+ GWG E         
Sbjct: 360 NGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLK 419

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
           YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 420 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 452


>gi|324711034|ref|NP_001191343.1| tubulointerstitial nephritis antigen-like isoform 2 precursor [Homo
           sapiens]
 gi|194391000|dbj|BAG60618.1| unnamed protein product [Homo sapiens]
          Length = 436

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 100/344 (29%), Positives = 138/344 (40%), Gaps = 53/344 (15%)

Query: 16  GELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKT 75
           G +Y     Y D  NR    W AG +        +    L    +Y   + RP       
Sbjct: 109 GRIYPVLGTYWDNCNR---CWQAGNH------SAFWGMTLDEGIRYRLGTIRPSSSVMNM 159

Query: 76  YDP----EYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSK 130
           ++          +P  F+A E+WPN   + H P D G CA    F+     SDR  I S 
Sbjct: 160 HEIYTVLNPGEVLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSL 216

Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRT 184
           G     LS + + SC         + C  G +   W FL +RG V+       G   D  
Sbjct: 217 GHMTPVLSPQNLLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEA 272

Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
           G  P    PC  H  A            K +    C N         D ++ T  Y +  
Sbjct: 273 GPAP----PCMMHSRA--------MGRGKRQATAHCPNSYVNN---NDIYQVTPVYRLGS 317

Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWG 298
           N+  I KE++ +GP  A   +++DF+ YK G+Y HT  S  + E Y     HS K+ GWG
Sbjct: 318 NDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWG 377

Query: 299 TEN-----GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
            E         YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 378 EETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFV 421


>gi|741376|prf||2007265A cathepsin B
          Length = 153

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 62/150 (41%), Positives = 81/150 (54%), Gaps = 5/150 (3%)

Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
           C HH +    P       PK  C   C  P Y   + QDKH    +Y V ++E  I  EI
Sbjct: 1   CEHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEI 57

Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWG 313
             +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTPYWLV N+W 
Sbjct: 58  YKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWN 115

Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
             WGD G  KILRG+  C  E  + AG P+
Sbjct: 116 TDWGDNGFFKILRGQDHCGIESEVVAGIPR 145


>gi|397515889|ref|XP_003828174.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1 [Pan
           paniscus]
          Length = 467

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 87/273 (31%), Positives = 118/273 (43%), Gaps = 40/273 (14%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 202 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 258

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
           + SC         + C  G +   W FL +RG V+       G   D  G  P    PC 
Sbjct: 259 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAP----PCM 310

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
            H  A            K +    C N         D ++ T  Y +  N+  I KE++ 
Sbjct: 311 MHSRA--------MGRGKRQATAHCPNSYVNN---NDIYQVTPVYRLGSNDKEIMKELME 359

Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
           +GP  A   +++DF+ YK G+Y HT  S  + E Y     HS K+ GWG E         
Sbjct: 360 NGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLK 419

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
           YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 420 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 452


>gi|270011021|gb|EFA07469.1| cathepsin B precursor [Tribolium castaneum]
          Length = 327

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 84/273 (30%), Positives = 124/273 (45%), Gaps = 41/273 (15%)

Query: 73  RKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
           R+ YDP    ++P  FD+  +WP  G +  + D G C +          SDR  I SKG+
Sbjct: 71  RRIYDPN---SLPREFDSEFKWP--GWMSEIQDQGWCGSSWAITTAAVASDRFAILSKGR 125

Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTIS 192
           +   LS +++ SC +       +SC+ G + R W+++ K G V    +            
Sbjct: 126 EKVTLSAQHLLSCDR----RGQQSCNGGYLDRAWSYIRKIGLVDEQCF------------ 169

Query: 193 PCSHHGSAPTLPSCENQKVPKLK--CHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIK 250
                   P   + E  ++P+        C  PT      + K++    Y V  NE  I 
Sbjct: 170 --------PYSATNEKCRIPRRGDLVTANCQLPT--NVDRRSKYKVAPAYRVG-NETDIM 218

Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENYLHSGKLIGWGTENG----TP 304
            EIL  GP  AT  +Y DF+ YK G+Y+H+  S      Y HS +++GWG E        
Sbjct: 219 YEILHSGPVQATMKVYHDFFTYKRGIYRHSPISTNDRTGY-HSVRIVGWGEEYSPEGLKK 277

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
           YW V N+WGP WG+ G  +ILRG  EC  E  +
Sbjct: 278 YWKVANSWGPEWGENGYFRILRGSNECEIESFV 310


>gi|1763659|gb|AAB58258.1| cysteine protease [Giardia intestinalis]
          Length = 269

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 89/302 (29%), Positives = 137/302 (45%), Gaps = 50/302 (16%)

Query: 45  NLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP 104
           N++E+  R  LI   +   +S   LP    T   E    +P +FD R+++P C  +    
Sbjct: 7   NVTEDEFRSMLIRPDRLRARSGS-LPPISITEVQELVDPIPPQFDFRDEYPQC--VKPAL 63

Query: 105 DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFR 164
           D G+C     F+A+G F DRRC     ++    S +++ SC       +N  C  G    
Sbjct: 64  DQGSCGECWAFSAIGVFGDRRCAMGIDKEAVSYSQQHLISCSL-----ENFGCDGGDFQP 118

Query: 165 TWNFLHKRGSVTGG-----DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKL-KCHT 218
           TW+FL   G+ T       DYG              H  ++P    C++    +L K H 
Sbjct: 119 TWSFLTFTGATTAECVKYVDYG--------------HTVASPCPAVCDDGSPIQLYKAHG 164

Query: 219 RCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYK 278
                 YG+              V  +  AI   ++A GP      +Y D  +Y+SGVYK
Sbjct: 165 ------YGQ--------------VSKSVPAIMGMLVAGGPLQTMIVVYADLSYYESGVYK 204

Query: 279 HTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
           HT    +    H+ +++G+GT ++GT YW++ N+WGP WG+ G  +I+RG  EC  E  I
Sbjct: 205 HTYGT-INLGFHALEIVGYGTTDDGTDYWIIKNSWGPDWGENGYFRIVRGVNECRIEDEI 263

Query: 338 AA 339
            A
Sbjct: 264 YA 265


>gi|227499499|ref|NP_036163.3| tubulointerstitial nephritis antigen precursor [Mus musculus]
 gi|4929827|gb|AAD34171.1| tubulo-interstitial nephritis antigen [Mus musculus]
 gi|148694397|gb|EDL26344.1| tubulointerstitial nephritis antigen, isoform CRA_a [Mus musculus]
          Length = 475

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 88/276 (31%), Positives = 127/276 (46%), Gaps = 41/276 (14%)

Query: 82  ATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           A +P+ F A  +WP      H P D   CAA   F+     +DR  I+SKG+    LS +
Sbjct: 214 ADLPEIFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 270

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-----GDRTGCQPSTISPCS 195
            + SCC   R+     C+ GS+ R W FL KRG V+   Y      + T    +  S   
Sbjct: 271 NLISCCAKNRH----GCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNTTNNICAMASRSD 326

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
             G       C N      + + +C+ P                Y V  NE  I +EI+ 
Sbjct: 327 GRGKRHATKPCPNSFEKSNRIY-QCSPP----------------YRVSSNETEIMREIIQ 369

Query: 256 HGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGTENGT-----P 304
           +GP  A   +++DF++YK+G+Y+H  ++N + E Y     H+ KL GWGT  G       
Sbjct: 370 NGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYKKLRTHAVKLTGWGTLRGARGKKEK 429

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 430 FWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465


>gi|253748399|gb|EET02549.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 303

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 99/349 (28%), Positives = 155/349 (44%), Gaps = 64/349 (18%)

Query: 4   ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAG--RNFPANLSEEYLRQFLI----A 57
           IL  LL     +  +   S A + +I     +W A   + F  N++E+  R  LI     
Sbjct: 2   ILALLLAVVCAKPLV---SRAELRRIQALNPSWVAAMPKRF-ENVTEDEFRGMLINPDRL 57

Query: 58  DAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
            A+       PL       DP     +P +FD R+++P+C  +  V D G+C     F+A
Sbjct: 58  KARSGSMPSAPLKEINDPTDP-----LPAQFDFRDEYPHC--VSPVFDQGSCGGCWAFSA 110

Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
           +G F  RRC     +     S +++ SC       +N  CS G  F TW+FL + G+ T 
Sbjct: 111 IGMFGSRRCAVGIDKAAVLYSQQHLISCST-----ENFGCSGGDFFPTWSFLTQTGATTA 165

Query: 178 G-----DYGDRTGCQPSTISPCSHHGSAPTLPSCEN-QKVPKLKCHTRCTNPTYGRGFFQ 231
                 DYG             S   + PT  +C++  ++   K H       YG+    
Sbjct: 166 ECVKYVDYGS------------SVAAACPT--TCDDGSQIQFYKAHG------YGQ---- 201

Query: 232 DKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHS 291
                     V  +  AI + +++ GP      +Y D  +Y  GVY+HT    + N LH+
Sbjct: 202 ----------VSKSVPAIMQMLVSGGPVQTMIVVYADLLYYAGGVYRHT-YGPISNGLHA 250

Query: 292 GKLIGWGT-ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAA 339
            +++G+GT ++GT YW + N+WG  WG+ G  +I+RG  EC  E  I A
Sbjct: 251 LEMVGYGTTDDGTDYWTIKNSWGSDWGEDGYFRIVRGVNECRIEDEIYA 299


>gi|159108625|ref|XP_001704582.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157432649|gb|EDO76908.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 298

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 81/265 (30%), Positives = 120/265 (45%), Gaps = 38/265 (14%)

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           +   PD FD RE++P+C  I  V D G C +   F++V +  DRRC     ++    S +
Sbjct: 71  ATQAPDSFDFREEYPHC--IPEVVDQGGCGSCWAFSSVASVGDRRCFAGLDKKAVKYSPQ 128

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
           YV SC +      + +C  G +   W FL K G+ T         C P         G+ 
Sbjct: 129 YVVSCDR-----GDMACDGGWLPSVWRFLTKTGTTT-------DECVPYQSGSTGARGTC 176

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
           PT            KC      P Y       K    + Y +D   D I K +   GP  
Sbjct: 177 PT------------KCADGSDLPIY-------KATKAVDYGLD--CDLIMKALATGGPLQ 215

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTWGPHWGDR 319
             F +Y DF +Y+ GVY+HT   ++E   H+ +++G+GT E    YW++ N+WGP WG+ 
Sbjct: 216 TAFTVYSDFMYYEGGVYQHT-YGRVEGG-HAVEMVGYGTDEYDVDYWIIRNSWGPDWGED 273

Query: 320 GTVKILRGKYECAFEYLIAAGKPKN 344
           G  +I+R   EC  E  +  G  +N
Sbjct: 274 GYFRIIRMTNECGIEEQVIGGFFEN 298


>gi|170045773|ref|XP_001850470.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
 gi|167868692|gb|EDS32075.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
          Length = 463

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 84/267 (31%), Positives = 124/267 (46%), Gaps = 37/267 (13%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDA   WP  G IG V D G C +    +     SDR  I SKG++   L+ + + 
Sbjct: 186 LPTHFDATTYWP--GFIGEVKDQGWCGSSWALSTASVASDRFAILSKGREIVQLAPQQII 243

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT- 202
           SC +      ++ CS G +   WN++ K G+V    Y   +      I P     +A   
Sbjct: 244 SCVR-----RSQGCSGGHLDTAWNYVRKVGTVNDECYPYISAQNACKIRPSDTLITANCD 298

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
           LP+    KV +   +     P +                  +NE  I  EI  HGP  A 
Sbjct: 299 LPT----KVDRTNMYK--MGPAFSL----------------NNETDIMIEIKKHGPVQAI 336

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENY---LHSGKLIGWGTE-NG---TPYWLVINTWGPH 315
             ++ DF+ YKSG+Y+H++ +   +     HS +LIGWG E NG   T YW+ +N+WG  
Sbjct: 337 LRVHRDFFSYKSGIYRHSAASSAGDERAGYHSVRLIGWGEERNGYETTKYWVAVNSWGRW 396

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
           WG+ G  +I+RG+ EC  E  + A  P
Sbjct: 397 WGENGRFRIVRGQNECEIESYVLASLP 423


>gi|156708122|gb|ABU93319.1| cathepsin B10 cysteine protease [Monocercomonoides sp. PA]
          Length = 283

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 99/339 (29%), Positives = 152/339 (44%), Gaps = 63/339 (18%)

Query: 4   ILVFLLGCTLVRGELYKFSDAYIDQINRE-ANTWTAGRNFPANLSEEYLRQFLIADAKYF 62
           +++F L   L+ GE        ++ INR  A TW+A          EY R  +I  A+  
Sbjct: 1   MIIFFL-VVLISGE------PLVNIINRNPAATWSA---------HEYSRD-IITRARLT 43

Query: 63  DQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
             +   + G  + +  E S  VP+ FDAR++WPN   I  V D   C +   F+   +  
Sbjct: 44  LLAPLAI-GPVEKFTIEDSFYVPESFDARDEWPN--AILPVRDQEKCGSCWAFSIAESLG 100

Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNK-SCSHGSVFRTWNFLHKRGSVTGGDYG 181
           DR  I   G+ +  LS + + SC      D N   C+ G    +W ++   G  T     
Sbjct: 101 DRFGILGCGKGH--LSPQDLISC------DSNDLGCNGGYQENSWTWVLTTGITT----- 147

Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYW 241
               C P       +   +  +PSC +          RC N +          R T+  +
Sbjct: 148 --ESCWP-------YRSGSGRIPSCPH----------RCVNGSV-------LQRNTINNY 181

Query: 242 VDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN 301
              +   ++ E+  +GP   T+ +Y+DF++Y  G+YKH S  K+    H+  L+GWG E+
Sbjct: 182 RRLDSSELQDELYNNGPIQVTYVVYEDFFYYSKGIYKHLSGNKVGG--HAVVLMGWGIED 239

Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           G  YWLV N+WG  WG++G  +ILRG  EC  E    AG
Sbjct: 240 GVKYWLVQNSWGYEWGEQGYFRILRGSNECGIESSAYAG 278


>gi|350596935|ref|XP_001927698.4| PREDICTED: tubulointerstitial nephritis antigen, partial [Sus
           scrofa]
          Length = 368

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 85/276 (30%), Positives = 125/276 (45%), Gaps = 45/276 (16%)

Query: 84  VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           +P+ F A  +WP      H P D   CAA   F+     +DR  I+S+G+    LS + +
Sbjct: 109 LPEFFVASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSEGRYTANLSPQNL 165

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-------GDRTGCQPSTISPCS 195
            SCC   R+     C+ GS+ R W +L KRG V+   Y           GC  ++ S   
Sbjct: 166 ISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRS--D 219

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
             G       C N      + + +C+ P                Y V  NE  I +EI+ 
Sbjct: 220 GRGKRHATKPCPNNFEKSNRIY-QCSPP----------------YRVSSNETEIMREIMQ 262

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLEN------YLHSGKLIGWGTENGT-----P 304
           +GP  A   +++DF+HYK+G+Y+H ++   E+        H+ KL GWGT  G       
Sbjct: 263 NGPVQAIMQVHEDFFHYKTGIYRHVTSTNEESDKYRKLRTHAVKLTGWGTLKGAQGRKEK 322

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 323 FWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 358


>gi|10803437|emb|CAC13131.1| putative cathepsin B.5 [Ostertagia ostertagi]
          Length = 196

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 67/191 (35%), Positives = 91/191 (47%), Gaps = 8/191 (4%)

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRG 173
           F A  A SDR CI S+G+    +S + V SCC K C       C  G     W +  K G
Sbjct: 5   FGAAEAMSDRICIASQGKTQVTISADDVLSCCGKKC----GNGCEGGYPIEAWKYWVKTG 60

Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
             TGG Y  ++GC+P  I PC HH +      C   +     C  +C    Y   +  DK
Sbjct: 61  ICTGGSYESQSGCKPYPIPPCGHHKNQTYFGPCPTDEYDTPVCTNKCI-AAYKTPYSDDK 119

Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
           H  T  Y V      I+KEI+ +GP  A + +Y+DFY Y  GVY HT  A++    H+ +
Sbjct: 120 HYGTSAYNVAKTVAGIQKEIMTNGPVEAAYTVYEDFYQYTGGVYTHTGGAEVGG--HAVR 177

Query: 294 LIGWGTENGTP 304
           ++GWG     P
Sbjct: 178 ILGWGVRQQDP 188


>gi|332254560|ref|XP_003276397.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Nomascus leucogenys]
          Length = 436

 Score =  117 bits (294), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 99/344 (28%), Positives = 138/344 (40%), Gaps = 53/344 (15%)

Query: 16  GELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKT 75
           G +Y     Y D  NR    W AG +        +    L    +Y   + RP       
Sbjct: 109 GRIYPVLGTYWDNCNR---CWQAGNH------SAFWGMTLDEGIRYRLGTMRPSSSVMNM 159

Query: 76  YDP----EYSATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSK 130
           ++          +P  F+A E+WPN   + H P D G CA    F+     SDR  I S 
Sbjct: 160 HEIYTVLNPGEVLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSL 216

Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRT 184
           G     LS + + SC         + C  G +   W FL +RG V+       G   D  
Sbjct: 217 GHMTPVLSPQNLLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEA 272

Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
           G  P    PC  H  A            K +    C N         D ++ T  Y +  
Sbjct: 273 GPAP----PCMMHSRA--------MGRGKRQATAHCPNSHVNN---NDIYQVTPVYRLGS 317

Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWG 298
           N+  + KE++ +GP  A   +++DF+ YK G+Y HT  S  + E Y     HS K+ GWG
Sbjct: 318 NDKEVMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWG 377

Query: 299 TEN-----GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
            E         YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 378 EETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFV 421


>gi|66911417|gb|AAH97299.1| Tubulointerstitial nephritis antigen-like 1 [Rattus norvegicus]
 gi|149024087|gb|EDL80584.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
 gi|149024088|gb|EDL80585.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
 gi|149024089|gb|EDL80586.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
          Length = 467

 Score =  117 bits (294), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 86/270 (31%), Positives = 118/270 (43%), Gaps = 33/270 (12%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 201 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQN 257

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY---GDRTGCQPSTISPCSHHG 198
           + SC         K C  G +   W FL  RG V+   Y   G     + S    C  H 
Sbjct: 258 LLSC----DTHHQKGCRGGRLDGAWWFLRCRGVVSDNCYPFSGREQNDEASPTPRCMMHS 313

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
            A            K +  +RC N         D ++ T  Y +  +E  I KE++ +GP
Sbjct: 314 RA--------MGRGKRQATSRCPNSHVDS---NDIYQVTPVYRLASDEKEIMKELMENGP 362

Query: 259 TTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWL 307
             A   +++DF+ Y+ G+Y HT  S  + E Y     HS K+ GWG E         YW 
Sbjct: 363 VQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWT 422

Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLI 337
             N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 423 AANSWGPWWGERGHFRIVRGTNECDIETFV 452


>gi|149694136|ref|XP_001503950.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 1
           [Equus caballus]
          Length = 467

 Score =  117 bits (294), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 86/268 (32%), Positives = 122/268 (45%), Gaps = 30/268 (11%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 202 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 258

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
           + SC       + + C  G +   W FL +RG V+  D+     C P +       G AP
Sbjct: 259 LLSC----DTHNQQGCRGGHLDGAWWFLRRRGVVS--DH-----CYPFSGRERDEAGPAP 307

Query: 202 -TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
             +         K +    C N    R    D ++ T  Y +  +E  I KE++ +GP  
Sbjct: 308 RCMMHSRAMGRGKRQATAHCPN---SRVHTNDIYQVTPAYRLGSSEKEIMKELMENGPVQ 364

Query: 261 ATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWLVI 309
           A   +++DF+ Y+ GVY HT  S+ + E Y     HS K+ GWG E         YW   
Sbjct: 365 ALMEVHEDFFLYQGGVYSHTPVSHGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAA 424

Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLI 337
           N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 425 NSWGPAWGERGHFRIVRGANECDIESFV 452


>gi|351704465|gb|EHB07384.1| Tubulointerstitial nephritis antigen [Heterocephalus glaber]
          Length = 475

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 88/274 (32%), Positives = 123/274 (44%), Gaps = 42/274 (15%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ F A  +WP  G      D   CAA   F+     +DR  I+S G+    LS + + 
Sbjct: 217 LPEFFIASYKWP--GWTHDPLDQKNCAASWAFSTASVAADRIAIQSNGRYTVNLSPQNLI 274

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG------DYGDRTGCQPSTISPCSHH 197
           SCC   RY     CS GS+ R W +L KRG V+        D     GC  ++ S     
Sbjct: 275 SCCLKHRY----GCSGGSIDRAWWYLRKRGLVSHACYPLFKDQNSTNGCAMASRS--DGR 328

Query: 198 GSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
           G       C N  + K     +C+ P                Y V  NE  I KEI+ +G
Sbjct: 329 GKRHATTPCPNN-IEKSNRIYQCSPP----------------YRVSSNETQIMKEIMKNG 371

Query: 258 PTTATFALYDDFYHYKSGVYKHTSNA--KLENY----LHSGKLIGWGTENGT-----PYW 306
           P  A   +++DF++YK+G+Y+H ++     E Y     H+ KL GWGT  G       +W
Sbjct: 372 PVQAIMQVHEDFFYYKTGIYRHVTSTIEDSEKYQKLRTHAVKLTGWGTLRGAKGRKEKFW 431

Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           +  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 432 IAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465


>gi|149030259|gb|EDL85315.1| rCG52258, isoform CRA_b [Rattus norvegicus]
          Length = 210

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 71/197 (36%), Positives = 97/197 (49%), Gaps = 12/197 (6%)

Query: 17  ELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTY 76
             +  SD  I+ IN++  TW AGRNF  N+   YL++              P   +R  +
Sbjct: 22  SFHPLSDDMINYINKQNTTWQAGRNF-YNVDISYLKKLC---GTVLGGPKLP---ERVGF 74

Query: 77  DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
             + +  +P+ FDAREQW NC TI  + D G+C +   F AV A SDR CI + G+ N  
Sbjct: 75  SEDIN--LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVE 132

Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
           +S E + +CC I   D    C+ G     WNF  ++G V+GG Y    GC P TI PC H
Sbjct: 133 VSAEDLLTCCGIQCGD---GCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 189

Query: 197 HGSAPTLPSCENQKVPK 213
           H +    P       PK
Sbjct: 190 HVNGSRPPCTGEGDTPK 206


>gi|332254558|ref|XP_003276396.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Nomascus leucogenys]
          Length = 467

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 86/273 (31%), Positives = 118/273 (43%), Gaps = 40/273 (14%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 202 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 258

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
           + SC         + C  G +   W FL +RG V+       G   D  G  P    PC 
Sbjct: 259 LLSC----DTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAP----PCM 310

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
            H  A            K +    C N         D ++ T  Y +  N+  + KE++ 
Sbjct: 311 MHSRA--------MGRGKRQATAHCPNSHVNN---NDIYQVTPVYRLGSNDKEVMKELME 359

Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
           +GP  A   +++DF+ YK G+Y HT  S  + E Y     HS K+ GWG E         
Sbjct: 360 NGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLK 419

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
           YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 420 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 452


>gi|10803435|emb|CAC13130.1| putative cathepsin B.4 [Ostertagia ostertagi]
          Length = 194

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 63/188 (33%), Positives = 98/188 (52%), Gaps = 8/188 (4%)

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGS 174
            ++  A SDR CI SKG +   +S + + SCC  C Y     C  G   + W F  + G 
Sbjct: 5   VSSAAAMSDRICIASKGVKQVLISAQDMVSCCSYCGY----GCDGGWPIKAWQFFAREGV 60

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSC-ENQKVPKLKCHTRCTNPTYGRGFFQDK 233
           VTGG+YG +  C+P  I+PC HHG  P    C ++ + P+  C  +C +  Y   + +DK
Sbjct: 61  VTGGNYGRQGCCRPYEITPCGHHGREPYYGECYDDAQTPR--CKRKCQS-GYKTTYKKDK 117

Query: 234 HRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGK 293
                 Y + ++  AI++EI+ HGP  A + +Y+DF +Y  G+YKHT+  +   +     
Sbjct: 118 RYGRKAYQLPNSVKAIQREIMMHGPVVAGYTVYEDFSYYTKGIYKHTAGRETGGHAVKNN 177

Query: 294 LIGWGTEN 301
            +G G  N
Sbjct: 178 WMGQGKGN 185


>gi|345488309|ref|XP_001605531.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
           [Nasonia vitripennis]
          Length = 481

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 90/286 (31%), Positives = 125/286 (43%), Gaps = 45/286 (15%)

Query: 64  QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSD 123
           QS R +    + Y+P     +P  FD+R QW N   I  V D G C A    + V   SD
Sbjct: 217 QSTRQMLPVTRHYNPN---DLPREFDSRIQWGN--DITPVQDQGWCGASWAISTVDVASD 271

Query: 124 RRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY--- 180
           R  I SKG +   LS +++ SC         + C  G + R W F+ K G V    Y   
Sbjct: 272 RFAIMSKGIEKVQLSGQHLISC----NNRGQRGCKGGYLDRAWLFMRKFGVVDEDCYPWL 327

Query: 181 -GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
            G    C+       S  G       C+ +    L+       P Y  G           
Sbjct: 328 SGRSDKCRIPRRGKLSDAG-------CQRRNSYNLRNEMYKVGPAYRLG----------- 369

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTS--NAKLENYLHSGKLIGW 297
                NE  I +EIL  GP  AT  ++ DF+HY+SG+Y H+   + +   Y HS +++GW
Sbjct: 370 -----NETDIMQEILTSGPVQATMRVHRDFFHYESGIYVHSRPFDTRQSGY-HSVRIVGW 423

Query: 298 GTE----NGTP--YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
           G E    NG P  +W V N+WG  WG+ G  +I+RG  EC  E  +
Sbjct: 424 GEEPSPYNGKPIKFWRVANSWGRDWGEDGYFRIVRGNNECEIESFV 469


>gi|294952611|ref|XP_002787376.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239902348|gb|EER19172.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 203

 Score =  117 bits (293), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 68/190 (35%), Positives = 95/190 (50%), Gaps = 24/190 (12%)

Query: 154 NKSCSHGSVFRTWNFLHKRGSVTGGDY------GDRTGCQPSTISPCSHHGSAPTLPSCE 207
           +K C+ G+     +FL   G VTG D+       +  GC P     C+H    PT    E
Sbjct: 9   SKGCNGGTFVEAMSFLEDYGVVTGNDFKPQGQLSEADGCWPYPFQKCNH---VPT----E 61

Query: 208 NQKVPKLK---------CHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           N + PK K         C T CTN  Y +   +D HR      V ++  +IK+EI  +GP
Sbjct: 62  NSEYPKCKDVAHQPLPPCRTTCTNKAYKKSLKKDVHRAKSWRKVFNDAQSIKQEIFDNGP 121

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
             + F +Y+DF +YKSGVY  T+   L    H  K+IGWG ++   YWL +N+W   WGD
Sbjct: 122 VFSAFKMYEDFRYYKSGVYVPTTKEVLS--FHLVKIIGWGADSVQEYWLAMNSWNEEWGD 179

Query: 319 RGTVKILRGK 328
            G +K+  GK
Sbjct: 180 HGLIKMAFGK 189


>gi|255076333|ref|XP_002501841.1| cysteine endopeptidase [Micromonas sp. RCC299]
 gi|226517105|gb|ACO63099.1| cysteine endopeptidase [Micromonas sp. RCC299]
          Length = 359

 Score =  117 bits (292), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 86/257 (33%), Positives = 117/257 (45%), Gaps = 26/257 (10%)

Query: 84  VPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           +P  FDAR++WP C   IG V D G C +    A     +DR CI S G + R LS +Y 
Sbjct: 105 LPLNFDARQKWPQCRAIIGTVRDQGKCGSCWAVATAEVMNDRLCIASGGAEQRELSPQYP 164

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG-DRTGCQPSTISPCSHHGSAP 201
            SC     YD    C  G V    +    +G V GG     +T C P    PC H     
Sbjct: 165 LSC-----YDGGSGCQGGDVAVAMHEATTKGMVFGGMLNRSKTACLPYEFEPCEH----- 214

Query: 202 TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL----TYWVDDNEDA-IKKEILAH 256
               C+ Q V   +C     + T     F+   +        Y    N+ A I +EI+ +
Sbjct: 215 ---PCQVQGVIPHECPAHVDDGTCLGNTFKLADQKVFPKSDVYTCPPNDWACIAQEIMTY 271

Query: 257 GPTTATFA-LYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGTENGT--PYWLVINT 311
           GP   TF  ++ DFY Y +GVY      K E  L  H+ KLIGWG +  T  PYWL++N+
Sbjct: 272 GPVAVTFGTVHSDFYGYHAGVYTVREEDKNEEGLGMHATKLIGWGFDEATGHPYWLMMNS 331

Query: 312 WGPHWGDRGTVKILRGK 328
           W  +WG  G  ++  G+
Sbjct: 332 W-DNWGIHGLGRVGVGE 347


>gi|78042562|ref|NP_001030279.1| tubulointerstitial nephritis antigen [Bos taurus]
 gi|108861910|sp|Q3SZI1.1|TINAG_BOVIN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
 gi|74354008|gb|AAI02844.1| Tubulointerstitial nephritis antigen [Bos taurus]
 gi|296474572|tpg|DAA16687.1| TPA: tubulointerstitial nephritis antigen [Bos taurus]
          Length = 476

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 88/276 (31%), Positives = 126/276 (45%), Gaps = 45/276 (16%)

Query: 84  VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           +P+ F A  +WP      H P D   CAA   F+     +DR  I+S+G+    LS + +
Sbjct: 217 LPEFFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLSPQNL 273

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-------GDRTGCQPSTISPCS 195
            SCC   R+     C+ GSV R W +L KRG V+   Y           GC  ++ S   
Sbjct: 274 ISCCAKKRH----GCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRS--D 327

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
             G       C N  + K     +C+ P                Y V  NE  I +EI+ 
Sbjct: 328 GRGKRHATTPCPN-SIEKSNRIYQCSPP----------------YRVSSNETEIMREIMQ 370

Query: 256 HGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGTENGT-----P 304
           +GP  A   +++DF++YK+G+Y+H  ++N   E Y     H+ KL GWGT  G       
Sbjct: 371 NGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQGQKEK 430

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 431 FWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|193202653|ref|NP_492593.2| Protein F26E4.3 [Caenorhabditis elegans]
 gi|205371857|sp|P90850.3|YCF2E_CAEEL RecName: Full=Uncharacterized peptidase C1-like protein F26E4.3;
           Flags: Precursor
 gi|166157004|emb|CAB03007.2| Protein F26E4.3 [Caenorhabditis elegans]
          Length = 452

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 82/266 (30%), Positives = 129/266 (48%), Gaps = 36/266 (13%)

Query: 84  VPDRFDAREQWPNCGTIGH-VPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           +P+ FDAR++W   G + H V D G C +    +     SDR  I S+G+ N  LS++ +
Sbjct: 184 LPEHFDARDKW---GPLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQL 240

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            SC         K C  G + R W ++ K G V  GD+     C P  +S  S       
Sbjct: 241 LSC----NQHRQKGCEGGYLDRAWWYIRKLGVV--GDH-----CYP-YVSGQSREPGHCL 288

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
           +P  +      L+C +   + T          + T  Y V   E+ I+ E++ +GP  AT
Sbjct: 289 IPKRDYTNRQGLRCPSGSQDST--------AFKMTPPYKVSSREEDIQTELMTNGPVQAT 340

Query: 263 FALYDDFYHYKSGVYKHT-------SNAKLENYLHSGKLIGWGTENGT----PYWLVINT 311
           F +++DF+ Y  GVY+H+       +++  E Y HS +++GWG ++ T     YWL  N+
Sbjct: 341 FVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGY-HSVRVLGWGVDHSTGKPIKYWLCANS 399

Query: 312 WGPHWGDRGTVKILRGKYECAFEYLI 337
           WG  WG+ G  K+LRG+  C  E  +
Sbjct: 400 WGTQWGEDGYFKVLRGENHCEIESFV 425


>gi|290981656|ref|XP_002673546.1| predicted protein [Naegleria gruberi]
 gi|284087130|gb|EFC40802.1| predicted protein [Naegleria gruberi]
          Length = 362

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 91/298 (30%), Positives = 124/298 (41%), Gaps = 45/298 (15%)

Query: 45  NLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP 104
           N++   LR  L   +      D P     +  + E    +P  FDAR QW  C  +  + 
Sbjct: 106 NMTISQLRDNLFGLSLMSSDEDTP-----RMANIETRIDIPMNFDARTQWKGC--VPAIR 158

Query: 105 DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFR 164
           D   C A   F+A    + R CI + GQ N  LS EY   C  +     NK+C  G +  
Sbjct: 159 DQQTCGACWAFSANYVLAHRLCIATNGQTNVVLSPEYQVQCDTM-----NKACQGGYLKY 213

Query: 165 TWNFLHKRGSVTGGDYGDRTGCQP-STISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNP 223
           +W FL   G+           C P ++       G+ PT     +  + K K      N 
Sbjct: 214 SWTFLENTGT-------PLDSCIPYASGRGTFSSGTCPTQCKIASMSMSKYKAK----NT 262

Query: 224 TYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNA 283
            Y  G                  + IK  I+ +G   A F +Y D   YKSGVYKH  N 
Sbjct: 263 VYISGI-----------------NNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHIENT 305

Query: 284 KLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
            L    H+  LIG+G E G+ YWL  N+WGP+WG  G  KI +G  E   E  + AG+
Sbjct: 306 VLGG--HAVALIGFGVEGGSNYWLAANSWGPNWGMSGYFKIAQG--EGGIENQVYAGE 359


>gi|508264|gb|AAA96833.1| cysteine protease, partial [Caenorhabditis elegans]
          Length = 198

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 67/200 (33%), Positives = 95/200 (47%), Gaps = 8/200 (4%)

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRG 173
            +A    SDR CI S  +    +S + + +CC  +C       C+ G     W    K+G
Sbjct: 5   VSAAETISDRICIASNAKTILSISADDINACCGMVC----GNGCNGGYPIEAWRHYVKKG 60

Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDK 233
            VTGG Y D+TGC+P    PC HH +      C +   P  +             + +D 
Sbjct: 61  YVTGGSYQDKTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTGQNANALGKLDIALTYHKDL 120

Query: 234 HRTTLTYWVDDNEDA-IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG 292
           H  T+ +     E A I K I  HG       +++DF HY  GVY HT+ A L    H+ 
Sbjct: 121 HFRTILHTPASKEAAGIPKGIKTHGQLRGGITVFEDFEHYSGGVYVHTAGASLGG--HAV 178

Query: 293 KLIGWGTENGTPYWLVINTW 312
           K++GWG +NGTPYWL+ N+W
Sbjct: 179 KMLGWGVDNGTPYWLIANSW 198


>gi|161343877|tpg|DAA06119.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 145

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 64/150 (42%), Positives = 84/150 (56%), Gaps = 6/150 (4%)

Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
           PC H  SA   P C N+     +C  +C NP YG  + +D H+ T  Y +        KE
Sbjct: 1   PCQHTESAVENP-CSNKTFFTPECKVQCYNPDYGTRYVKDNHKGT-QYRIPGY--TAMKE 56

Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
           I  +GP TA+F +Y DF +Y+SGVY   S   +     + K++GWG ENGTPYWL  N++
Sbjct: 57  IYENGPITASFYMYQDFVNYQSGVYAFNSGKYVTT--QAVKILGWGEENGTPYWLAANSF 114

Query: 313 GPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
             +WGD G VKILRG  EC  E  + AG P
Sbjct: 115 NTYWGDNGFVKILRGANECYIEEFMYAGLP 144


>gi|189238903|ref|XP_967834.2| PREDICTED: similar to tubulointerstitial nephritis antigen
           [Tribolium castaneum]
          Length = 453

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 82/272 (30%), Positives = 124/272 (45%), Gaps = 39/272 (14%)

Query: 73  RKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
           R+ YDP    ++P  FD+  +WP  G +  + D G C +          SDR  I SKG+
Sbjct: 197 RRIYDPN---SLPREFDSEFKWP--GWMSEIQDQGWCGSSWAITTAAVASDRFAILSKGR 251

Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTIS 192
           +   LS +++ SC +       +SC+ G + R W+++ K G V    +            
Sbjct: 252 EKVTLSAQHLLSCDR----RGQQSCNGGYLDRAWSYIRKIGLVDEQCF------------ 295

Query: 193 PCSHHGSAPTLPSCENQKVPKLK--CHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIK 250
                   P   + E  ++P+        C  PT      + K++    Y V  NE  I 
Sbjct: 296 --------PYSATNEKCRIPRRGDLVTANCQLPTNVDR--RSKYKVAPAYRVG-NETDIM 344

Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENY-LHSGKLIGWGTENG----TPY 305
            EIL  GP  AT  +Y DF+ YK G+Y+H+  +  +    HS +++GWG E        Y
Sbjct: 345 YEILHSGPVQATMKVYHDFFTYKRGIYRHSPISTNDRTGYHSVRIVGWGEEYSPEGLKKY 404

Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
           W V N+WGP WG+ G  +ILRG  EC  E  +
Sbjct: 405 WKVANSWGPEWGENGYFRILRGSNECEIESFV 436


>gi|426250116|ref|XP_004018784.1| PREDICTED: tubulointerstitial nephritis antigen [Ovis aries]
          Length = 476

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 88/276 (31%), Positives = 126/276 (45%), Gaps = 45/276 (16%)

Query: 84  VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           +P+ F A  +WP      H P D   CAA   F+     +DR  I+S+G+    LS + +
Sbjct: 217 LPEFFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLSPQNL 273

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-------GDRTGCQPSTISPCS 195
            SCC   R+     C+ GSV R W +L KRG V+   Y           GC  ++ S   
Sbjct: 274 ISCCAKKRH----GCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRS--D 327

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
             G       C N  + K     +C+ P                Y V  NE  I +EI+ 
Sbjct: 328 GRGKRHATTPCPN-SIEKSNRIYQCSPP----------------YRVSSNETEIMREIMQ 370

Query: 256 HGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGTENGT-----P 304
           +GP  A   +++DF++YK+G+Y+H  ++N   E Y     H+ KL GWGT  G       
Sbjct: 371 NGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAHGQKEK 430

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 431 FWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|301618234|ref|XP_002938532.1| PREDICTED: tubulointerstitial nephritis antigen-like [Xenopus
           (Silurana) tropicalis]
          Length = 494

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 80/262 (30%), Positives = 118/262 (45%), Gaps = 23/262 (8%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  F+A E+WP  G +    D G CA    F+     SDR  I+S G   + LS + + 
Sbjct: 236 LPSHFNAAEKWP--GLVHEPLDQGNCAGSWAFSTAAVASDRISIQSMGHMTQSLSPQNLL 293

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC       +   C  G V   W +L +RG V+         C P T    + H SAP +
Sbjct: 294 SC----DTRNQHGCRGGRVDGAWWYLRRRGVVS-------EPCYPFTSLNTNGH-SAPCM 341

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
               +    K +    C N  Y      + +++T  Y +  +E  I KE+  +GP  A  
Sbjct: 342 MQSRSMGRGKRQATNNCPNQYYSS---NEIYQSTPAYRLASSEKDIMKELYENGPVQAIM 398

Query: 264 ALYDDFYHYKSGVYKHTSNAKLE------NYLHSGKLIGWGTENGTPYWLVINTWGPHWG 317
            +++DF+ YKSG+Y+HT   + E      +  HS K+ G        YWL  N+WG  WG
Sbjct: 399 EVHEDFFMYKSGIYRHTPVTEREPEHHRRHGTHSVKITGGRDGQTHKYWLAANSWGRDWG 458

Query: 318 DRGTVKILRGKYECAFEYLIAA 339
           + G  +I RG+ EC  E  I  
Sbjct: 459 EDGYFRIARGENECEIETFIVG 480


>gi|290982673|ref|XP_002674054.1| predicted protein [Naegleria gruberi]
 gi|284087642|gb|EFC41310.1| predicted protein [Naegleria gruberi]
          Length = 673

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 86/323 (26%), Positives = 137/323 (42%), Gaps = 36/323 (11%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
           F+   ID +N++ +      N+     + +     +   K  ++S         T D + 
Sbjct: 26  FTKDMIDSLNQDPSVKWEAANYDQFAGKSFAELRKLLGGKRGEESSSE-EARYNTRDVKS 84

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           +  +PD FD+R +WP C  I  + + G C +   FA  G FSDR CI +    N  +S E
Sbjct: 85  TVAIPDTFDSRTKWPQC--IHGIRNQGQCGSCWAFATTGVFSDRLCITTNNVSNVVISPE 142

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
           ++  C K      + +C  G  + +W F    G            C P T     +    
Sbjct: 143 FLIECDKT-----SFACQGGYGYYSWKFFMNTGI-------PLESCVPYTKDSLVYG--- 187

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
                         +C + CT+     G     ++    Y++       + EI+ +GP  
Sbjct: 188 ---------NTTNAQCRSTCTD-----GSPLKLYKAASAYYIYSPITNYQTEIMTNGPVE 233

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE-NGTPYWLVINTWGPHWGDR 319
           A F +Y DFY YKSG+Y+ T+ +      H+ K++GW ++ NGTPYW+  N WG  WG  
Sbjct: 234 ADFDVYSDFYSYKSGIYQKTAGSTYVG-GHAVKVLGWASDSNGTPYWIAQNQWGTSWGMG 292

Query: 320 GTVKILRGK--YECAFEYLIAAG 340
           G   I RG     C F+  + AG
Sbjct: 293 GYFYIYRGNSTLNCKFDNYMIAG 315


>gi|86451924|gb|ABC97357.1| cathepsin B [Streblomastix strix]
          Length = 283

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 96/324 (29%), Positives = 140/324 (43%), Gaps = 54/324 (16%)

Query: 21  FSDAYIDQINREANTWTAGRNFPAN-LSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
            +++  + INR  N+     ++PA+ +S E LR  L   A++     RP     K     
Sbjct: 10  LAESIPETINRNPNSTWVAIDYPASVISHEKLRSKL--GARFTPHRVRPYRDSNK----- 62

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
               VPD FDARE+WP+   I  V D G C +   F+      DR  +   G     ++ 
Sbjct: 63  ----VPDTFDAREKWPD--AILPVRDQGECGSCWAFSIAETIGDR--LGVLGCSRGDIAP 114

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
           E + SC     +DD   C  G +   W++  + G  T                 C  + +
Sbjct: 115 EDLVSCDI---FDDG--CDGGFIDMAWDWCQENGLTT---------------EECIPYKA 154

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
              +PS          C   C +   G   +    RT +  +   + D I+ EI  +GP 
Sbjct: 155 GEGVPS---------PCPETCED---GSAIY----RTPIESYRYIDADDIQGEIYEYGPV 198

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
           +  F +Y DF  YKSGVY H   A      H+  ++GWG E+  PYWLV N+WG  WG+ 
Sbjct: 199 SMGFIVYSDFMSYKSGVYVH--QAGYIEGGHAVLIVGWGVEDEVPYWLVQNSWGTDWGEN 256

Query: 320 GTVKILRGKYECAFEYLIAAGKPK 343
           G  KILRG   C  E  + AG P+
Sbjct: 257 GFFKILRGSDHCECESNVTAGYPE 280


>gi|4099305|gb|AAD00577.1| cysteine proteinase [Clonorchis sinensis]
          Length = 180

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 66/189 (34%), Positives = 92/189 (48%), Gaps = 9/189 (4%)

Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
            AV A SDR CI S G  N+ LS   + SCC+ C +     C  G     W++    G V
Sbjct: 1   GAVEAMSDRLCIHSNGAFNKSLSAVDLLSCCENCGF----GCRGGYPAVAWDYWKTHGIV 56

Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
           TGG   D +GC+      C HH      P C  +  P  +C  +C  P  G  + +DK R
Sbjct: 57  TGGSKEDPSGCRSYPFPKCEHHVQG-HYPPCPRELYPTPECVQQCDTPDVG--YLEDKTR 113

Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
             ++Y +  +E +I KEI+  GP  A F +Y+DF  Y SGVY H   A +    H+ +++
Sbjct: 114 ANMSYNIYASEISIMKEIMLRGPVEAIFTMYEDFLRYSSGVYFHALGAPMSG--HAVRIL 171

Query: 296 GWGTENGTP 304
           GWG     P
Sbjct: 172 GWGELGNVP 180


>gi|345794363|ref|XP_535330.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Canis lupus
           familiaris]
          Length = 467

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 85/268 (31%), Positives = 120/268 (44%), Gaps = 30/268 (11%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 202 VLPTAFEAAEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 258

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
           + SC       + + C  G +   W FL +RG V+  D+     C P         G AP
Sbjct: 259 LLSC----DTHNQQGCRGGRLDGAWWFLRRRGVVS--DH-----CYPFVGREQDEAGPAP 307

Query: 202 -TLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
             +         K +   RC +         D ++ T  Y +  NE  I KE++ +GP  
Sbjct: 308 RCMMHSRAMGRGKRQATARCPSSHV---HANDIYQVTPAYRLGTNEKEIMKELMENGPVQ 364

Query: 261 ATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWLVI 309
           A   +++DF+ Y+ G+Y HT  S  + E Y     HS K+ GWG E         YW   
Sbjct: 365 ALMEVHEDFFLYQGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAA 424

Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLI 337
           N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 425 NSWGPAWGERGHFRIVRGANECDIESFV 452


>gi|290992302|ref|XP_002678773.1| predicted protein [Naegleria gruberi]
 gi|284092387|gb|EFC46029.1| predicted protein [Naegleria gruberi]
          Length = 236

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 84/265 (31%), Positives = 128/265 (48%), Gaps = 44/265 (16%)

Query: 82  ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
           A VP  FD+R +WP+C  +  + +   C +   F+A    SDR CI S G+ +  LS +Y
Sbjct: 12  AAVP-AFDSRTKWPHC--VHPIRNQEQCGSCWAFSASEVLSDRFCIASGGKVDVVLSPQY 68

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAP 201
           + SC        +  C  G +   W FL   G  +     D+  C P T    S +G   
Sbjct: 69  MVSC-----DSTDYGCDGGYLNNAWAFLAGTGIPS-----DK--CAPYT----SQNGDVA 112

Query: 202 TLPS-CENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
             PS C++    KL               ++ K+   L     ++  +I +++  +GP  
Sbjct: 113 ACPSKCQDGSSVKL---------------YKAKNPQQL-----NDIPSIMEDMQQNGPVQ 152

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT--PYWLVINTWGPHWGD 318
           A F++Y DF  YKSGVY H S + L    H+ K++GWG ++ T  PYW++ N+WGP WG 
Sbjct: 153 AAFSVYRDFMSYKSGVYHHVSGSLLGG--HAIKMVGWGVDSATNKPYWIIANSWGPSWGL 210

Query: 319 RGTVKILRGKYECAFEYLIAAGKPK 343
            G   ILRG  EC  E  + +G+ +
Sbjct: 211 NGFFWILRGSDECGIEDNVWSGQAQ 235


>gi|15723276|gb|AAL06326.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 73/225 (32%), Positives = 111/225 (49%), Gaps = 20/225 (8%)

Query: 88  FDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK 147
           FDA E WP C TI  + D  +C +    AA  A SDR C    G ++  +S   + SCC 
Sbjct: 1   FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAISDRYCTLG-GVRDLRISAGDLMSCCD 59

Query: 148 ICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCE 207
           +C Y     C+ G     W +    G V+  +Y     CQP     C+HH ++  L  C 
Sbjct: 60  VCGY----GCNGGYPEVAWEYYAVHGIVS--EY-----CQPYPFPSCAHHVNSSDLSPCS 108

Query: 208 NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD 267
            +      C++ CT+    +     K+R   +Y +   E++ K+E+L +GP   +F++Y 
Sbjct: 109 GE-YDTPTCNSTCTD----KKVPLIKYRGNTSYLLS-GEESFKRELLLNGPFEVSFSVYA 162

Query: 268 DFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
           DF  Y  GVYKH +   L    H+ +++GWG  NG PYW + N+W
Sbjct: 163 DFLAYTGGVYKHVAGTFLGG--HAVRIVGWGELNGEPYWKIANSW 205


>gi|15723280|gb|AAL06328.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 73/225 (32%), Positives = 111/225 (49%), Gaps = 20/225 (8%)

Query: 88  FDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK 147
           FDA E WP C TI  + D  +C +    AA  A SDR C    G ++  +S   + SCC 
Sbjct: 1   FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAISDRYCTLG-GVRDLRISAGDLMSCCD 59

Query: 148 ICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCE 207
           +C Y     C+ G     W +    G V+  +Y     CQP     C+HH ++  L  C 
Sbjct: 60  VCGY----GCNGGYPEVAWEYYAVHGIVS--EY-----CQPYPFPSCAHHVNSSDLSPCS 108

Query: 208 NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD 267
            +      C++ CT+    +     K+R   +Y +   E++ K+E+L +GP   +F++Y 
Sbjct: 109 GE-YDTPTCNSTCTD----KKVPLIKYRGNTSYLLS-GEESFKRELLLNGPFEVSFSVYA 162

Query: 268 DFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
           DF  Y  GVYKH +   L    H+ +++GWG  NG PYW + N+W
Sbjct: 163 DFLAYTGGVYKHVAGIFLGG--HAVRIVGWGELNGEPYWKIANSW 205


>gi|312082955|ref|XP_003143660.1| hypothetical protein LOAG_08080 [Loa loa]
 gi|307761175|gb|EFO20409.1| hypothetical protein LOAG_08080 [Loa loa]
          Length = 339

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 88/267 (32%), Positives = 129/267 (48%), Gaps = 47/267 (17%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDAR++WP+   I  + D G CA+    +     +DR  + ++G+QN  LS +   
Sbjct: 80  LPTSFDARQKWPD--FIHPIQDQGDCASSWAQSTAATSADRLALITEGRQNVALSAQQFL 137

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISP--C----SHH 197
           SC +       K C  G + R W ++ K G V+   Y   +G   +T  P  C    S H
Sbjct: 138 SCNQ----HRQKGCEGGYLDRAWWYIRKFGVVSEECYPYISG---TTRKPEICYMQKSKH 190

Query: 198 GSAPTLPSCE-NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
            +    PS   N +V                      +RTT +Y V   E  I  EIL +
Sbjct: 191 ANGRQCPSGHPNSRV----------------------YRTTPSYRVSSREQDIMSEILTN 228

Query: 257 GPTTATFALYDDFYHYKSGVYKH--TSNAKLENYLHSGKLIGWGTE--NGTP--YWLVIN 310
           GP  ATF ++ DF  + +GVYKH  T   ++E Y HS +L+GWG +   G P  YW+  N
Sbjct: 229 GPVQATFRVHGDF--FIAGVYKHLPTVGEEIEGY-HSVRLLGWGEDYSTGIPVKYWIAAN 285

Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLI 337
           +WG +WG+ GT +ILRG+  C  E  +
Sbjct: 286 SWGTNWGENGTFRILRGENHCEIESFV 312


>gi|145509603|ref|XP_001440740.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124407968|emb|CAK73343.1| unnamed protein product [Paramecium tetraurelia]
          Length = 357

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 82/288 (28%), Positives = 127/288 (44%), Gaps = 45/288 (15%)

Query: 49  EYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGA 108
           ++ + +  +DAK+   +     G +    PE    +P+ ++ RE  P C     + + G 
Sbjct: 97  DFFKDWKFSDAKFIFNNHLTFKG-KIPQCPESGVIIPESYNFREVQPECAQ--PIYNQGN 153

Query: 109 CAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKS--CSHGSVFRTW 166
           C++ +  AAV A SDR C    G+    LS +   SC       DNK+  C  GSV R  
Sbjct: 154 CSSSYSIAAVSATSDRLCKVRNGEFQDQLSPQSPISC-------DNKNYRCGGGSVTRVL 206

Query: 167 NFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYG 226
               K+G VT       T C P T +  +         +CE  K+               
Sbjct: 207 EVGKKQGFVT-------TSCLPYTGTEDAKDNCDALFTNCEKYKI--------------- 244

Query: 227 RGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLE 286
               QD       Y V  +E+ IK+EIL +GP  A   ++ DF  YK G+Y+    +   
Sbjct: 245 ----QD-------YCVISSEENIKREILNNGPVVAVIQVFKDFLVYKGGIYEVVEGSSKF 293

Query: 287 NYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
            Y H+ K+IGWG ++G  YW++ N+WG  WG +G   +  G+ +   E
Sbjct: 294 QYGHAVKVIGWGKQDGVNYWVIENSWGDSWGLKGLAYVAVGQNQLQLE 341


>gi|428180143|gb|EKX49011.1| cathepsin B-like cysteine protease [Guillardia theta CCMP2712]
          Length = 330

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 100/332 (30%), Positives = 141/332 (42%), Gaps = 56/332 (16%)

Query: 26  IDQINREANT-WTAGR-NFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSAT 83
           I+QIN + ++ WTAG       ++ +  R  ++      D S+ P+    K +       
Sbjct: 38  IEQINSDKDSLWTAGETEIFKGMTMKEFRSSMLGLRLDRDYSEVPV----KVHSSTALKD 93

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ F+  E WPN   +  + D   C +   FAA    SDR  I S G  N+ LS E + 
Sbjct: 94  LPESFNCYENWPN--YMHPIRDQARCGSCWAFAASEVLSDRFAIASNGTVNKILSPEDLV 151

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC K      +  C  G + + W++L   G VT         C P      +  G AP+ 
Sbjct: 152 SCDK-----GDMGCQGGYLDKAWDYLKTNGIVT-------ESCFPYA----AQKGVAPS- 194

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
                       C   C +     G    K++ +  Y +   ED I KEI  +GP  A F
Sbjct: 195 ------------CRISCVD-----GEPYKKYKASDYYQLTTEED-IMKEIYLNGPVEAGF 236

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE-------NGTPYWLVINTWGPHW 316
            +Y  F  YKSGVY H     +E   H+ K++GWG E         T YW+  N+W   W
Sbjct: 237 RVYTSFMSYKSGVYHHRILDIMEGG-HAIKIVGWGVEPPKRFWQKPTKYWICANSWTADW 295

Query: 317 GDRGTVKILRGK-----YECAFEYLIAAGKPK 343
           G  G  KI RGK      EC  E  + AG PK
Sbjct: 296 GMNGFFKIRRGKNRFGQSECGIEDQVFAGHPK 327


>gi|15150360|gb|AAK85411.1| cathepsin B-like protease [Trypanosoma rangeli]
          Length = 207

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 75/225 (33%), Positives = 106/225 (47%), Gaps = 21/225 (9%)

Query: 88  FDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK 147
           FDA E WPNC TI  + D   C +    AA  A SDR C +  G ++  +S   + SCC 
Sbjct: 1   FDAGEAWPNCPTITEIRDQSGCGSCWAVAARSAMSDRYCTRG-GVRDLRISAGDLLSCCN 59

Query: 148 ICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCE 207
            C       C+ G     W +  + G V+         CQP    PC+HH ++     C 
Sbjct: 60  AC----GLGCNGGDPDWAWLYYVETGIVS-------EFCQPYPFPPCAHHVNSTHYTPCS 108

Query: 208 NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD 267
            +      C+  CTN          K++  ++Y +   ED  K+E+  +GP    F +Y+
Sbjct: 109 VEYDTPF-CNITCTNT-----IPPIKYKGRISYSLSGEED-YKRELFLYGPFEVAFTVYE 161

Query: 268 DFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
           DF  Y  GVYKH S   L    H+ +L+GWG  NGTPYW + N+W
Sbjct: 162 DFVAYSDGVYKHFSGNALGG--HAVRLVGWGNLNGTPYWKIANSW 204


>gi|403293251|ref|XP_003937634.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Saimiri boliviensis boliviensis]
          Length = 436

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 86/273 (31%), Positives = 120/273 (43%), Gaps = 40/273 (14%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 171 ALPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 227

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
           + SC         + C  G +   W FL +RG V+       G   D+ G  P    PC 
Sbjct: 228 LLSC----NTHHQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDKAGPAP----PCM 279

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
            H  A            K +    C N   G     + ++ T  Y +  N+  I KE++ 
Sbjct: 280 MHSRA--------MGRGKRQATAHCPN---GHVNNNNIYQVTPAYRLGSNDTEIMKELME 328

Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
           +GP  A   +++DF+ YK G+Y HT  +  + E Y     HS K+ GWG E         
Sbjct: 329 NGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHSVKITGWGEETRPDGRKLK 388

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
           YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 389 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 421


>gi|403293249|ref|XP_003937633.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Saimiri boliviensis boliviensis]
          Length = 467

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 86/273 (31%), Positives = 120/273 (43%), Gaps = 40/273 (14%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 202 ALPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 258

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT------GGDYGDRTGCQPSTISPCS 195
           + SC         + C  G +   W FL +RG V+       G   D+ G  P    PC 
Sbjct: 259 LLSC----NTHHQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDKAGPAP----PCM 310

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
            H  A            K +    C N   G     + ++ T  Y +  N+  I KE++ 
Sbjct: 311 MHSRA--------MGRGKRQATAHCPN---GHVNNNNIYQVTPAYRLGSNDTEIMKELME 359

Query: 256 HGPTTATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTP 304
           +GP  A   +++DF+ YK G+Y HT  +  + E Y     HS K+ GWG E         
Sbjct: 360 NGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHSVKITGWGEETRPDGRKLK 419

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLI 337
           YW   N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 420 YWTAANSWGPAWGERGHFRIVRGVNECDIESFV 452


>gi|253742315|gb|EES99155.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 303

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 98/350 (28%), Positives = 154/350 (44%), Gaps = 64/350 (18%)

Query: 4   ILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAG--RNFPANLSEEYLRQFLI----A 57
           IL  LL     +  +   S A + +I      W A   + F  N++E+  R  LI     
Sbjct: 2   ILALLLAVVCAKPLV---SRAELRRIQALNPPWVAAMPKRF-ENVTEDEFRGMLINPDRL 57

Query: 58  DAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
            A+       PL       DP     +P +FD R+++P+C  +  V D G+C     F+A
Sbjct: 58  KARSGSMPSAPLKEINDPTDP-----LPAQFDFRDEYPHC--VSPVFDQGSCGGCWAFSA 110

Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
           +G F  RRC     +     S +++ SC       +N  CS G  F TW+FL + G+ T 
Sbjct: 111 IGMFGSRRCAVGIDKAAVLYSQQHLISCST-----ENFGCSGGDFFPTWSFLTQTGATTA 165

Query: 178 G-----DYGDRTGCQPSTISPCSHHGSAPTLPSCEN-QKVPKLKCHTRCTNPTYGRGFFQ 231
                 DYG             S   + PT  +C++  ++   K H       YG+    
Sbjct: 166 ECVKYVDYGS------------SVAAACPT--TCDDGSQIQFYKAHG------YGQ---- 201

Query: 232 DKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHS 291
                     +  +  AI + +++ GP      +Y D  +Y  GVY+HT    + N LH+
Sbjct: 202 ----------LSKSVPAIMQMLVSGGPVQTMIVVYADLLYYAGGVYRHT-YGPISNGLHA 250

Query: 292 GKLIGWGT-ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
            +++G+GT ++GT YW + N+WG  WG+ G  +I+RG  EC  E  I A 
Sbjct: 251 LEMVGYGTTDDGTDYWTIKNSWGSDWGEDGYFRIVRGVNECRIEDEIYAA 300


>gi|157058761|gb|ABV03138.1| cathepsin B-84 [Myzus persicae]
          Length = 220

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 71/235 (30%), Positives = 115/235 (48%), Gaps = 19/235 (8%)

Query: 35  TWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQW 94
           TW A +NFP N   E + + L+   +    +  P+  +   Y    +  VP+ FD+R +W
Sbjct: 1   TWKAKQNFPENTPREDIVR-LLGSKRLLGLNKSPIKENDILYVD--NGEVPEFFDSRLEW 57

Query: 95  PNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDN 154
            NC TIG V + G C +       GAF+DR CI + G+ N  +S E +  CC  C +   
Sbjct: 58  KNCKTIGEVRNQGNCGSCWAHGTTGAFADRLCIATDGEFNELISAEELTFCCHTCGF--- 114

Query: 155 KSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH----HGSAPTLPSCENQK 210
             C+ G+  + W +  + G VTGG+Y    GCQPS + PC      H S    P+  N K
Sbjct: 115 -GCNGGNPLKAWKYFKRHGVVTGGNYNTTDGCQPSRVPPCVRDDEGHNSCSGQPTERNHK 173

Query: 211 VPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFAL 265
             K KC+   T       + ++ ++T   Y++ +    ++K+ + +GP  A+F +
Sbjct: 174 CSK-KCYGDET-----INYKKNHYKTKDAYYLSNT--TMQKDTMVYGPIEASFDV 220


>gi|157058775|gb|ABV03145.1| cathepsin B-16D [Myzus persicae]
          Length = 236

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 74/240 (30%), Positives = 112/240 (46%), Gaps = 14/240 (5%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
           L  +     +  + Y     +ID IN +A TW AG NF    S+E++ + L   ++    
Sbjct: 3   LSVIFVSVYMTEQAYFLEKDFIDNINEQATTWKAGVNFDPKTSKEHIMKLL--GSRGVQI 60

Query: 65  SDRPLPGDRKTYDPEYSAT-VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSD 123
            ++      K+ D +Y+ T +P  FDAR +W +C TIG V D G C +    A   AF+D
Sbjct: 61  PNKNNMNLYKSEDADYNNTYIPRFFDARRKWRHCSTIGRVRDQGNCGSCWAVATSSAFAD 120

Query: 124 RRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDR 183
           R C+ +    N  LS E +  CC  C +     C+ G   + W    K+G VTGGDY   
Sbjct: 121 RLCVATNADFNELLSAEEITFCCHTCGF----GCNGGYPIKAWKRFSKKGLVTGGDYKSG 176

Query: 184 TGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG--FFQDKHRTTLTYW 241
            GC+P  + PC +        +C  +    ++ + RCT   YG     F + HR T  Y+
Sbjct: 177 EGCEPYRVPPCPNDDQGNN--TCAGK---PMESNHRCTRMCYGDQDLDFDEDHRYTRDYY 231


>gi|145514872|ref|XP_001443341.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124410719|emb|CAK75944.1| unnamed protein product [Paramecium tetraurelia]
          Length = 358

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 80/286 (27%), Positives = 123/286 (43%), Gaps = 41/286 (14%)

Query: 49  EYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGA 108
           ++ + +  +DAK+   +     G  +   PE    +P+ ++ RE  P C    +    G 
Sbjct: 97  DFFKDWKFSDAKFIFNNHLTFKGKIQQC-PESGVIIPESYNFREAQPECAQPIYF--QGN 153

Query: 109 CAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNF 168
           C++ +  AAV A SDR C    G+    LS +   SC      D N  C  GSV R    
Sbjct: 154 CSSSYSIAAVSATSDRLCKSKNGEFQDQLSPQSPISC-----DDKNYKCGGGSVTRVLEV 208

Query: 169 LHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG 228
             K+G V+       T C P + +  + +       +CE     K K H  C        
Sbjct: 209 GKKQGFVS-------TSCLPYSGTEDAKNNCDALFSNCE-----KYKIHDYC-------- 248

Query: 229 FFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENY 288
                        V   E+ IK+EIL +GP  A   ++ DF  YK GVY+    +    Y
Sbjct: 249 -------------VVSGEENIKREILNNGPIVAVIQVFKDFLVYKGGVYEVVEGSSKFQY 295

Query: 289 LHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
            H+ K+IGWG ++G  YW++ N+WG  WG +G   +  G+ +   E
Sbjct: 296 GHAVKVIGWGKQDGVNYWVIENSWGDSWGLKGLAYVAVGQNQLQLE 341


>gi|157058755|gb|ABV03135.1| cathepsin B-84 [Aulacorthum solani]
          Length = 218

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 72/233 (30%), Positives = 115/233 (49%), Gaps = 19/233 (8%)

Query: 38  AGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNC 97
           A +NFP N  +E + + L+   +       P+  + + Y    ++ VP+ FD+R +W  C
Sbjct: 1   AKQNFPENTPKEQIVR-LLGSKRLLGVPKSPIKENDEFYMD--NSEVPEFFDSRLEWKYC 57

Query: 98  GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSC 157
            TIGHV + G C +       GAF+DR C+ + G+ N+ +S E V  CC  C +     C
Sbjct: 58  KTIGHVRNQGNCGSCWAHGTTGAFADRLCVATNGEVNQLISAEEVTFCCHRCGF----GC 113

Query: 158 SHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH----HGSAPTLPSCENQKVPK 213
           + G+  R W +  + G VTGGDY    GCQP  + PC      H S    P+  N K  K
Sbjct: 114 NGGNPLRAWQYFKRHGVVTGGDYNTTDGCQPYRVPPCVKDDKGHNSCSGQPTERNHKCSK 173

Query: 214 LKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALY 266
            KC+   T       +  D ++T   Y++ +    ++K+ + +GP  A+F +Y
Sbjct: 174 -KCYGDDT-----VDYKSDHYKTKDAYYLSNT--TMQKDTMVYGPIEASFDVY 218


>gi|189502866|gb|ACE06814.1| unknown [Schistosoma japonicum]
          Length = 121

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 56/119 (47%), Positives = 72/119 (60%), Gaps = 2/119 (1%)

Query: 225 YGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK 284
           Y   +  DK    + Y V  N++AI KE++ HGP    F +Y DF +YKSGVY+H S A 
Sbjct: 2   YNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGAL 61

Query: 285 LENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           L    H+ +L+GWG EN  PYWL+ N+W   WGD G  KI+RGK EC  E  + AG PK
Sbjct: 62  LGG--HAVRLLGWGEENNVPYWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNAGIPK 118


>gi|354483193|ref|XP_003503779.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cricetulus
           griseus]
          Length = 475

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 86/276 (31%), Positives = 127/276 (46%), Gaps = 41/276 (14%)

Query: 82  ATVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           A +P+ F +  +WP      H P D   CAA   F+     +DR  I+S+G+    LS +
Sbjct: 214 ADLPEVFISSYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSRGRYTANLSPQ 270

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-----GDRTGCQPSTISPCS 195
            + SCC   R+     C+ GS+ R W FL KRG V+   Y      + T    +  S   
Sbjct: 271 NLISCCAKKRH----GCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNTTNNICAMASRSD 326

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
             G       C N      + + +C+ P                Y V  NE  I +EI+ 
Sbjct: 327 GRGKRHATKPCPNSFEKSNRIY-QCSPP----------------YRVSSNETEIMREIIR 369

Query: 256 HGPTTATFALYDDFYHYKSGVYKH--TSNAKLENYL----HSGKLIGWGTENGT-----P 304
           +GP  A   +++DF++YK+G+Y+H  ++N + E Y     H+ KL GWGT  G       
Sbjct: 370 NGPVQAIMQVHEDFFYYKTGIYRHVISTNEESEKYRKLRSHAVKLTGWGTLRGAGGKKEK 429

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 430 FWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465


>gi|323447573|gb|EGB03489.1| hypothetical protein AURANDRAFT_72715 [Aureococcus anophagefferens]
          Length = 812

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 95/324 (29%), Positives = 144/324 (44%), Gaps = 59/324 (18%)

Query: 23  DAYIDQINREANTWTAGRNFP-ANLSEEYLRQFLIAD-----AKYFDQSDRPLPGDRKTY 76
           + +++ +N+E  +W AG N   A ++   ++  L AD     A+Y         G+ ++ 
Sbjct: 280 EQHVNYLNQEEMSWKAGVNERFAGMTYADVKGLLGADTSPHIAEYL--------GETRSQ 331

Query: 77  DPEYSAT-VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
           D   + T VP  F+A  QW   G +  + D   C +   F+A    SDR  I    Q N+
Sbjct: 332 DFYDNITDVPSEFNAVTQWK--GLVQPIRDQQQCGSCWAFSAAEVLSDRNAI----QHNK 385

Query: 136 P---LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTIS 192
               LS E + SC ++     ++ C+ G++   W +L   G VT         C P T  
Sbjct: 386 AEPVLSPEDLVSCDRV-----DQGCNGGNLGTAWTYLKNTGIVT-------DACFPYT-- 431

Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKE 252
             +  G AP             KC T C +     G    K++    Y V+  E+ ++KE
Sbjct: 432 --AGGGDAP-------------KCETSCKD-----GSSWTKYKAASAYAVNGVEN-MQKE 470

Query: 253 ILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
           I+ HGP    F +Y  F  YKSGVY       +    H+ K++GWGTE G  YWLV N+W
Sbjct: 471 IMTHGPIQVAFNVYKSFMSYKSGVYAKKWYELMPEGGHAVKIVGWGTEGGKDYWLVANSW 530

Query: 313 GPHWGDRGTVKILRGKYECAFEYL 336
              WGD G  KI  G    + + +
Sbjct: 531 NTSWGDEGYFKIAVGAESISLDVV 554


>gi|145513975|ref|XP_001442898.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124410259|emb|CAK75501.1| unnamed protein product [Paramecium tetraurelia]
          Length = 358

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 80/286 (27%), Positives = 123/286 (43%), Gaps = 41/286 (14%)

Query: 49  EYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGA 108
           ++ + +  +DAK+   +     G  +   PE    +P+ ++ RE  P C    +    G 
Sbjct: 97  DFFKDWKFSDAKFIFNNHLTFKGKIQQC-PESGVIIPESYNFREAQPECAQPIYF--QGN 153

Query: 109 CAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNF 168
           C++ +  AAV A SDR C    G+    LS +   SC      D N  C  GSV R    
Sbjct: 154 CSSSYSIAAVSATSDRLCKSKNGEFQDQLSPQSPISC-----DDKNYKCGGGSVTRVLEV 208

Query: 169 LHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG 228
             K+G V+       T C P + +  + +       +CE     K K H  C        
Sbjct: 209 GKKQGFVS-------TSCLPYSGTEDAKNNCDALFSNCE-----KYKIHDYC-------- 248

Query: 229 FFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENY 288
                        V   E+ IK+EIL +GP  A   ++ DF  YK GVY+    +    Y
Sbjct: 249 -------------VVSGEENIKREILNNGPIVAVIQVFKDFLVYKGGVYEVVEGSSKFQY 295

Query: 289 LHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
            H+ K+IGWG ++G  YW++ N+WG  WG +G   +  G+ +   E
Sbjct: 296 GHAVKVIGWGKQDGVNYWVIENSWGDTWGLKGLAYVAVGQNQLQLE 341


>gi|146386348|gb|ABQ23962.1| cathepsin B [Oryctolagus cuniculus]
          Length = 228

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 83/244 (34%), Positives = 118/244 (48%), Gaps = 19/244 (7%)

Query: 18  LYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
            +  SD  ++ IN++  TW AG NF  N+   YL++               L G +    
Sbjct: 2   FHPLSDELVNFINKQNTTWQAGHNF-FNVEVSYLKKLC----------GTFLGGPKLPRR 50

Query: 78  PEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNR 135
            E++  +  P+ FDAREQWPNC TI  + D G+C +   F AV A SDR CI + G  N 
Sbjct: 51  VEFADDIKLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNGHVNV 110

Query: 136 PLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
            +S E + +CC     D             WNF  K+G V+GG Y    GC+P +I PC 
Sbjct: 111 EVSAEDMLTCCGGQCGDGCNGGYPSGA---WNFWTKKGLVSGGLYDSHVGCKPYSIPPCE 167

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           HH +  + P+C  +     +C   C  P Y   + +DKH    +Y V  +E+ IK EI  
Sbjct: 168 HHVNG-SRPACTGEG-DTPRCSKTC-EPGYSPSYKEDKHYGYSSYSVSSDENEIKAEIYK 224

Query: 256 HGPT 259
           +GP 
Sbjct: 225 NGPV 228


>gi|239799410|dbj|BAH70626.1| ACYPI000012 [Acyrthosiphon pisum]
          Length = 265

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 88/286 (30%), Positives = 134/286 (46%), Gaps = 39/286 (13%)

Query: 5   LVFLLGCTLVRGELYK-----FSDAYIDQINREANTWTAGRNF-PANLSEEYLRQFLIAD 58
           ++FL+   L+   L +       D  ID+     +T   G N  P ++ EE+L   +++ 
Sbjct: 4   VLFLVSTMLLNSYLSEQATLFHDDNIIDKSVMGTDTLKVGENVGPNSVEEEHL---MLSG 60

Query: 59  AKYFDQSDRPL----PGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHI 114
            +  + + +        +R+ +  E    +   FDAR++WP+C TIG VP+ G       
Sbjct: 61  TRGVEATSKSKMLHKTRNRRCFRVEIDHQIDQEFDARKRWPHCKTIGEVPNDGNSLLSWA 120

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSV--FRTWNFLHKR 172
           +   G F+DR CI + G  N+ LSTE + SC  I      K    GSV  +  W +L   
Sbjct: 121 YVPTGVFADRMCIATNGTYNQLLSTEELISCSGI------KEDEFGSVNDYYVWEYLKNH 174

Query: 173 GSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGF--F 230
           G V+GG Y    GCQPS I P    G+ PT  S EN       C  RC    YG     +
Sbjct: 175 GLVSGGKYNTNNGCQPSKIPPI---GNLPT-GSYEN------TCEKRC----YGNNTINY 220

Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD-DFYHYKSG 275
              H     ++  + ED I++E+  +GP +  F ++D DF+ YKSG
Sbjct: 221 NQDHVKIKNHYDIEYED-IQREVQNYGPVSMAFRVFDNDFFLYKSG 265


>gi|130502070|ref|NP_001076255.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
 gi|818411|gb|AAC48477.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
          Length = 474

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 88/286 (30%), Positives = 127/286 (44%), Gaps = 49/286 (17%)

Query: 79  EYSATVPDRFDARE--------QWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSK 130
           E  AT+P+  D  E         W +   IG    +  CAA   F+     +DR  I+S 
Sbjct: 204 EMRATLPETTDLPEFFIAFLQMAWMDSWAIG----SKNCAASWAFSTASVAADRIAIQSN 259

Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-----GDRTG 185
           G+    LS + + SCC   R+     C+ GS+ R W +L KRG V+   Y      + + 
Sbjct: 260 GRYTANLSPQNLISCCAKNRH----GCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNISN 315

Query: 186 CQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDN 245
              +  S     G       C N  + K     +C+ P                Y V  N
Sbjct: 316 NTCAMTSKADGRGKRHATRPCPN-NIEKSNRIYQCSPP----------------YRVSSN 358

Query: 246 EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGT 299
           E  I KEI+ +GP  A   +++DF+HYK+G+Y+H  ++N + E Y     H+ KL GWGT
Sbjct: 359 ETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVISTNEESEKYRKLQTHAVKLTGWGT 418

Query: 300 ENGT-----PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
             G       +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 419 LKGARGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 464


>gi|348553066|ref|XP_003462348.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
           porcellus]
          Length = 475

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 89/275 (32%), Positives = 122/275 (44%), Gaps = 44/275 (16%)

Query: 84  VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           +P+ F A  +WP      H P D   CAA   F+     +DR  I+S G+    LS + +
Sbjct: 217 LPEFFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSSGRYTANLSPQNL 273

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG------DYGDRTGCQPSTISPCSH 196
            SCC   R+     C  GSV R W +L KRG V+        D     GC  ++ S    
Sbjct: 274 ISCCARKRH----GCGGGSVDRAWWYLRKRGLVSHACYPLFKDQNATNGCAMASRS--DG 327

Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
            G       C N        H   +N  Y         + +  Y V  NE  I KEI+ +
Sbjct: 328 RGKRHATTPCPN--------HIEKSNRIY---------QCSPPYRVSSNETQIMKEIMQN 370

Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAK--LENY----LHSGKLIGWGTENGT-----PY 305
           GP  A   +++DF+ YK+G+Y+H ++     E Y     H+ KL GWGT  G       +
Sbjct: 371 GPVQAIMKVHEDFFSYKTGIYRHVTSTSEDSEKYQKLRTHAVKLTGWGTLKGARGKKEKF 430

Query: 306 WLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           W+  N+WG  WG+ G  KILRG  E   E LI A 
Sbjct: 431 WIAANSWGKSWGENGYFKILRGVNESDIEKLIIAA 465


>gi|296207307|ref|XP_002750588.1| PREDICTED: tubulointerstitial nephritis antigen-like [Callithrix
           jacchus]
          Length = 467

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 84/269 (31%), Positives = 119/269 (44%), Gaps = 32/269 (11%)

Query: 83  TVPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
            +P  F+A E+WPN   + H P D G CA    F+     SDR  I S G     LS + 
Sbjct: 202 VLPTAFEASEKWPN---LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQN 258

Query: 142 VASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG--DRTGCQPSTISPCSHHGS 199
           + SC         + C  G +   W FL +RG V+   Y    R   +   + PC  H  
Sbjct: 259 LLSC----NTHHQQGCRGGHLDGAWWFLRRRGVVSDHCYPFLGRERDKAGPVPPCMMHSR 314

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
           A            K +    C N   G     + ++ T  Y +  N+  I KE++ +GP 
Sbjct: 315 A--------TGRGKRQATAHCPN---GHVNNNNIYQVTPAYRLGSNDTEIMKELMENGPV 363

Query: 260 TATFALYDDFYHYKSGVYKHT--SNAKLENY----LHSGKLIGWGTEN-----GTPYWLV 308
            A   +++DF+ YK G+Y HT  +  + E Y     HS K+ GWG E         YW  
Sbjct: 364 QALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHSVKITGWGEETWPDGRKLKYWTA 423

Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLI 337
            N+WGP WG+RG  +I+RG  EC  E  +
Sbjct: 424 ANSWGPAWGERGHFRIVRGVNECDIESFV 452


>gi|440907441|gb|ELR57591.1| Tubulointerstitial nephritis antigen [Bos grunniens mutus]
          Length = 476

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 87/276 (31%), Positives = 125/276 (45%), Gaps = 45/276 (16%)

Query: 84  VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           +P+ F A  +WP      H P D   CAA   F+     +DR  I+S+G+    LS + +
Sbjct: 217 LPEFFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLSPQNL 273

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-------GDRTGCQPSTISPCS 195
            SCC   R    + C+  SV R W +L KRG V+   Y           GC  ++ S   
Sbjct: 274 ISCCAKKR----RGCNSESVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRS--D 327

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
             G       C N  + K     +C+ P                Y V  NE  I +EI+ 
Sbjct: 328 GRGKRHATTPCPN-SIEKSNRIYQCSPP----------------YRVSSNETEIMREIMQ 370

Query: 256 HGPTTATFALYDDFYHYKSGVYKH--TSNAKLENY----LHSGKLIGWGTENGT-----P 304
           +GP  A   +++DF++YK+G+Y+H  ++N   E Y     H+ KL GWGT  G       
Sbjct: 371 NGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQGQKEK 430

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 431 FWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|15723272|gb|AAL06324.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 73/225 (32%), Positives = 110/225 (48%), Gaps = 20/225 (8%)

Query: 88  FDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK 147
           FDA E WP C TI  + D  +C +    AA  A SDR C    G ++  +S   + SCC 
Sbjct: 1   FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAMSDRYCTLG-GVRDLRISAGDLMSCCD 59

Query: 148 ICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCE 207
           +C Y     C+ G     W +    G V+  +Y     CQP     C+HH ++  L  C 
Sbjct: 60  VCGY----GCNGGYPEVAWEYYAVHGIVS--EY-----CQPYPFPSCAHHVNSSDLSPCS 108

Query: 208 NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD 267
            +      C++ CT+    +     K+R   T  +   E++ K+E+L +GP   +F++Y 
Sbjct: 109 GE-YDTPTCNSTCTD----KKIPLIKYRGN-TSCILSGEESFKRELLLNGPFEVSFSVYA 162

Query: 268 DFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
           DF  Y  GVYKH +   L    H+ +++GWG  NG PYW + N+W
Sbjct: 163 DFVAYTGGVYKHVTGVFLGG--HAVRIVGWGELNGEPYWKIANSW 205


>gi|48762481|dbj|BAD23810.1| cathepsin B-S [Tuberaphis taiwana]
          Length = 182

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 72/194 (37%), Positives = 107/194 (55%), Gaps = 12/194 (6%)

Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
           +  GAF+DR C+ + G+ N+ LS E +A     C  D  K C  G   + W +   +G  
Sbjct: 1   STTGAFADRLCVSTGGKFNQLLSPEELA----FCCKDCGKGCGGGYPIKAWKYFRTQGVT 56

Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
           TGGDY  + GC P  + PC +     T   C  Q  P  + H +C    YG+   Q++++
Sbjct: 57  TGGDYDTKEGCMPYKVPPCYNKQGKNT---CGGQ--PMERNH-QCPKTCYGKTTVQNRYK 110

Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
           T   Y V ++   I++++  +GP  A+F +YDDF  YKSG+Y+ T  AK +   HS K+I
Sbjct: 111 TKSEY-VMNSIKTIEQDLKTYGPVEASFDVYDDFSVYKSGIYRKTPKAKYQG-GHSIKII 168

Query: 296 GWGTENGTPYWLVI 309
           GWG +NGTPYWL +
Sbjct: 169 GWGQQNGTPYWLAV 182


>gi|157058773|gb|ABV03144.1| cathepsin B-16D [Sitobion avenae]
          Length = 215

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 73/214 (34%), Positives = 100/214 (46%), Gaps = 14/214 (6%)

Query: 16  GELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKT 75
           G  Y     +I+ IN +A TW AG NF  N  +E+  + L   +K     +R      KT
Sbjct: 1   GTAYFLQKDFIENINEQATTWKAGVNFNPNTPKEHFLKML--GSKGVQIPNRNNIHLYKT 58

Query: 76  YDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQ 132
            D  Y      +P  FDAR +W +C TIG V D G C +    A   AF+DR C+ + G 
Sbjct: 59  DDAAYDNLFGRIPRHFDARRKWRHCQTIGEVRDQGNCGSCWAVATSSAFADRLCVATDGD 118

Query: 133 QNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTIS 192
            N+ LS E +  CC  C +     C+ G   + W    K G VTGGDY    GC+P  + 
Sbjct: 119 FNQLLSAEEITFCCHTCGF----GCNGGYPIKAWERFKKHGLVTGGDYKSEEGCEPYRVP 174

Query: 193 PCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYG 226
           PC +  S     +C  + + K   + RCT   YG
Sbjct: 175 PCPYDESGNN--TCAGKPMEK---NHRCTRMCYG 203


>gi|603044|gb|AAA96832.1| cysteine protease homolog, partial [Strongyloides ratti]
          Length = 202

 Score =  114 bits (285), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 67/203 (33%), Positives = 98/203 (48%), Gaps = 12/203 (5%)

Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCC-KICRYDDNKSCSHGSVFRTWNFLHKRGS 174
           +A    +DR C++SKG+  R +S   + SCC + C Y     C  G+  R W  + + G 
Sbjct: 6   SAASVMTDRLCVQSKGRIKRFISDTDILSCCGRFCGY----GCRGGANIRAWKHVMRNGV 61

Query: 175 VTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
            TGG  G + GC+P    PC  H        C  +     +C   C        + +D++
Sbjct: 62  CTGGPCGYKYGCRPYAFHPCGVHKDQVYYGECPRKSYDTPECRKICQRGCIQLQYGKDRY 121

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
                Y+V ++  AI +EI+  GP    +  Y DF  YK GVY+HT+  +     HS K+
Sbjct: 122 YAASAYFVKNDTKAIMREIMRGGPVHGAYDTYTDFRLYKGGVYEHTAGERTGG--HSIKI 179

Query: 295 IGWGT---ENGT--PYWLVINTW 312
           +GWG     NGT  PYWLV N+W
Sbjct: 180 MGWGNYKHPNGTVIPYWLVANSW 202


>gi|198434980|ref|XP_002126076.1| PREDICTED: similar to LOC100124858 protein [Ciona intestinalis]
          Length = 541

 Score =  114 bits (285), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 94/329 (28%), Positives = 143/329 (43%), Gaps = 39/329 (11%)

Query: 26  IDQINREANTWTAGR-NFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATV 84
           I+ IN     WTA    F   L++    ++ +  A+  D+    +      +    S+ +
Sbjct: 233 IEAINEGDFGWTASNFTFLWGLTQLEGYKYKLGTARVPDE----VRNMNAMHPLSVSSNL 288

Query: 85  PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
           P  FD+R +WP  G++    D         F+     SDR  I+SK      LS +++ S
Sbjct: 289 PKTFDSRTKWP--GSLSLPRDQENEGTSWAFSTTSVLSDRLAIQSKNFTVVELSPQHLVS 346

Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
           C     +  ++      + RTW +L K+G V+   Y +        I  C     +    
Sbjct: 347 C-----FSSHEGRGE-RLDRTWWYLRKKGVVSTVCYPESRSKSTQGIGSCGLVAHSSGAH 400

Query: 205 SCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFA 264
            C N  V         +N  Y         +T+  Y V  NE+ I KEI  +GP  A   
Sbjct: 401 ICPNGNVIS-------SNEIY---------KTSPVYRVSSNEENIMKEIFENGPVQAVMR 444

Query: 265 LYDDFYHYKSGVYKHTS--NAKLE----NYLHSGKLIGWGTE----NGTPYWLVINTWGP 314
           +  DF+ YKSGVY  T+  N  +E    N  HS K+IGWG +    N   YW+V N+WG 
Sbjct: 445 VQPDFFVYKSGVYSSTAIDNIVVEQVKDNTYHSVKIIGWGEKKSKTNSGKYWIVQNSWGA 504

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           +WG+ G  +I +G  EC  E +I A  P+
Sbjct: 505 NWGEGGYFRIRKGVNECGIEEMILAAWPQ 533


>gi|15723274|gb|AAL06325.1| cathepsin B-like protease [Trypanosoma cruzi]
 gi|15723278|gb|AAL06327.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score =  114 bits (284), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 71/225 (31%), Positives = 110/225 (48%), Gaps = 20/225 (8%)

Query: 88  FDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK 147
           FDA E WP C T+  + D  +C +    AA  A SDR C    G ++  +S   + SCC 
Sbjct: 1   FDAGEAWPECPTVTEIRDQSSCGSCWAVAAASAISDRYCTLG-GVRDLRISAGDLMSCCD 59

Query: 148 ICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCE 207
           +C +     C+ G     W +    G V+  +Y     CQP     C+HH ++  L  C 
Sbjct: 60  VCGF----GCNGGYPEVAWEYYAVHGIVS--EY-----CQPYPFPSCAHHVNSSDLSPCS 108

Query: 208 NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD 267
            +      C++ CT+    +     K+R   +Y V   E+  K+E++ +GP   +F++Y 
Sbjct: 109 GE-YDTPTCNSTCTD----KKIPLIKYRGNTSY-VLSGEEPFKRELILNGPFEVSFSVYA 162

Query: 268 DFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
           DF  Y  GVYKH +   L    H+ +++GWG  NG PYW + N+W
Sbjct: 163 DFVAYTGGVYKHVAGIFLGG--HAVRIVGWGELNGEPYWKIANSW 205


>gi|167508668|gb|ABZ81540.1| cathepsin B-like cysteine protease [Caenorhabditis brenneri]
          Length = 193

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 67/169 (39%), Positives = 94/169 (55%), Gaps = 6/169 (3%)

Query: 168 FLHKRGSVTGGDYGDRTGCQPSTISPCSH-HGSAPTLPSCENQKVPKLKCHTRCT-NPTY 225
           +    G  TGG+Y D+ GC+P TI PC   + +  T   C     P   C  RCT N T+
Sbjct: 29  WWQTHGLCTGGNYDDQFGCKPYTIYPCDKTYPNGTTSVPCPGYHTPV--CEERCTSNITW 86

Query: 226 GRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKL 285
              + Q KH     Y V      I+ EI+ +GP  A+F +YDDF+ YKSG+Y HT+  + 
Sbjct: 87  PISYKQVKHFGKAHYNVGKKMTDIQTEIMRNGPVIASFIIYDDFWDYKSGIYVHTAGDQ- 145

Query: 286 ENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
           E  + + K+IGWG +NG PYWL ++ WG  +G+ G ++ILRG  E   E
Sbjct: 146 EGGMDT-KIIGWGVDNGVPYWLCVHQWGTDFGENGFMRILRGVNEVHIE 193


>gi|327282776|ref|XP_003226118.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
           carolinensis]
          Length = 476

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 85/274 (31%), Positives = 123/274 (44%), Gaps = 42/274 (15%)

Query: 85  PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
           P+ F A  +WP  G I    D   CAA   F+     +DR  I SKG+    LS +++ S
Sbjct: 225 PEFFVAWHEWP--GWIHDPLDQRNCAASWAFSTASVAADRIAIHSKGRFTDNLSPQHLIS 282

Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG------DRTGCQPSTISPCSHHG 198
           C    +Y     C  GS+   W++L K G V+   Y        +T C+ S++      G
Sbjct: 283 CDTRNQY----GCKGGSITGAWSYLKKYGLVSHACYPLFWNNLHQTSCEMSSVF--DAEG 336

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
               +  C N+  P        +N  Y  G         L Y +   +  I KEI  +GP
Sbjct: 337 KRQAIQPCPNRWEP--------SNHIYQCG---------LPYRISSQDADIMKEIKENGP 379

Query: 259 TTATFALYDDFYHYKSGVYKHT------SNAKLENYLHSGKLIGWGTENGTP-----YWL 307
             A   +YDDF+ YKSG+YKH       +  + +   HS K++GWGT          +W+
Sbjct: 380 VQAVMQVYDDFFLYKSGIYKHIWSLEGKTQNRHQKKPHSIKIVGWGTLRDAEGQRQKFWI 439

Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
             N+WG  WG+ G  +ILRG+ EC  E  + A K
Sbjct: 440 AANSWGNSWGENGYFRILRGQNECDIEKTVIASK 473


>gi|427783627|gb|JAA57265.1| hypothetical protein [Rhipicephalus pulchellus]
          Length = 483

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 87/266 (32%), Positives = 121/266 (45%), Gaps = 33/266 (12%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P+ FDAR +W   G +  V D G CA    F+     SDR  I+S+G     LS + + 
Sbjct: 200 LPEEFDARIRWS--GLVHGVRDQGDCANSWAFSTAAVASDRLSIQSRGVDKVELSPQDLM 257

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC    R      C  G   R W FL   G V+         C P        H SA   
Sbjct: 258 SCLNGGR---RVVCQGGHPDRGWRFLLNYGGVS-------EECYPYE----GVHSSANAT 303

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
                ++ P      RC  PT   G  + KH +T  Y V  NE+ I +EI A+GP  A  
Sbjct: 304 CRIPRRRDPIED--ARC--PT---GRTEQKHFSTPPYRVPANEEDIMQEIYANGPVQALI 356

Query: 264 ALYDDFYHYKSGVYKHTSNAK------LENYLHSGKLIGWGTENG----TPYWLVINTWG 313
            + +DF+ Y+SGVY+HT  A+        +  HS +++GWG +        YWL  N+WG
Sbjct: 357 LVKEDFFLYRSGVYRHTRIAESLRPQYSRSGWHSVRILGWGVDRSQYRPIKYWLCANSWG 416

Query: 314 PHWGDRGTVKILRGKYECAFEYLIAA 339
             WG+ G  +I+RG+ E   E  + A
Sbjct: 417 HGWGENGYFRIVRGEDESQIESFVLA 442


>gi|363732245|ref|XP_419905.3| PREDICTED: tubulointerstitial nephritis antigen [Gallus gallus]
          Length = 467

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 81/259 (31%), Positives = 117/259 (45%), Gaps = 25/259 (9%)

Query: 85  PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
           P+ F A   WP+   I    D   C A   F+     +DR  I S GQ    LS + + S
Sbjct: 223 PEFFAATYAWPD--WIHDPLDQRNCGASWAFSTASVAADRITIHSDGQITDNLSVQNLIS 280

Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
           C       + + C+ GS+   W +L   G V+         C PS      HH  +P+  
Sbjct: 281 C----DTGNQRGCNGGSIDGAWRYLTTHGVVS-------YACYPSFWK---HHLDSPSEN 326

Query: 205 SC-ENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
            C  + +  K   +  C N           +R    Y V   E  I +EI+A GP  A  
Sbjct: 327 QCYVSSEYGKNHTNGPCPNALEDSNRL---YRCGSHYRVSSKETDIMEEIMAKGPVQAIM 383

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG-----TPYWLVINTWGPHWGD 318
            +Y+DF+ YK G+Y+H+  A  +   HS KL+GWG+  G       +W+  N+WG +WG+
Sbjct: 384 KVYEDFFLYKEGIYRHSYKAGSKWKTHSVKLLGWGSLPGKNGQKQKFWIAANSWGKYWGE 443

Query: 319 RGTVKILRGKYECAFEYLI 337
            G  +ILRG+ EC  E LI
Sbjct: 444 NGYFRILRGQNECDIEKLI 462


>gi|294879717|ref|XP_002768767.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239871616|gb|EER01485.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 157

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 62/163 (38%), Positives = 86/163 (52%), Gaps = 12/163 (7%)

Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH----RTTLTY 240
           GC P    PC+HH +    P C     P   C  +C NP Y      D+H     +   Y
Sbjct: 2   GCWPYDFPPCAHHINDTKYPKCPKGLYPTPNCVEQCHNPKYTTTLRDDRHFMLESSPYHY 61

Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE 300
            V+D ++AI+ +    GP +A+F +Y+DF  Y+SGVYKHTS + L    H+ K+IGWG +
Sbjct: 62  SVNDAKNAIRTD----GPVSASFTVYEDFLAYRSGVYKHTSGSYLGG--HAVKIIGWGEK 115

Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           +G  YWL +N+W   WGD G  KI  G   C  +  +  G PK
Sbjct: 116 SGQAYWLAVNSWNEDWGDHGLFKIALG--NCGIDDDLLGGTPK 156


>gi|290990726|ref|XP_002677987.1| predicted protein [Naegleria gruberi]
 gi|284091597|gb|EFC45243.1| predicted protein [Naegleria gruberi]
          Length = 225

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 81/258 (31%), Positives = 113/258 (43%), Gaps = 38/258 (14%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDAR QW  C  +  + D   C A   F+A    + R CI + GQ N  LS EY  
Sbjct: 3   IPMNFDARTQWRGC--VPAIRDQQTCGACWAFSANYVLAHRLCIATNGQTNVVLSPEYQV 60

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
            C  +     NK+C  G +  +W FL   G+                +  C  + S    
Sbjct: 61  QCDTM-----NKACQGGYLKYSWTFLENTGT---------------PLDTCIPYASGRGT 100

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
            S          C T+C   +     ++ K+   +T       + IK  I+ +G   A F
Sbjct: 101 FSSGT-------CPTQCKIASMSMSKYKAKNTRYIT-----GINNIKTAIMTYGSVQAGF 148

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
            +Y D   YKSGVYKH  +  L    H+  LIG+G E G+ YWL  N+WGP+WG  G  K
Sbjct: 149 TVYRDLTGYKSGVYKHVVSTVLGG--HAVALIGFGVEGGSNYWLAANSWGPNWGMSGYFK 206

Query: 324 ILRGKYECAFEYLIAAGK 341
           I +G  E   E  + AG+
Sbjct: 207 IAQG--EGGIENQVYAGE 222


>gi|449498128|ref|XP_002193225.2| PREDICTED: tubulointerstitial nephritis antigen [Taeniopygia
           guttata]
          Length = 469

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 89/267 (33%), Positives = 119/267 (44%), Gaps = 29/267 (10%)

Query: 85  PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
           P  F A  +WP    I    D   C A   F+     +DR  I SKGQ    LS + + S
Sbjct: 223 PAIFSAIYEWPE--WIHDPLDQRNCGASWAFSTASVAADRIAIHSKGQITDNLSAQNLIS 280

Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
           C       +   C+ GS+   W +L   G V+         C PS  +   H G     P
Sbjct: 281 C----DTRNQHGCNGGSIDGAWRYLKTHGVVS-------YACYPSFWN--KHLG-----P 322

Query: 205 SCENQKVPKLKCHTRCTNPTYGRGFFQDK--HRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
           S ENQ     +     TN      F +    +R    Y V   E  I KEI   GP  A 
Sbjct: 323 SAENQCYVSNEYGKNHTNGPCPNAFEKSNRLYRCASHYRVSSKETDIMKEIKDRGPVQAI 382

Query: 263 FALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT---ENG--TPYWLVINTWGPHWG 317
             +Y+DF+ YK G+Y+H+  A  +   HS KL+GWG    +NG    +W+  N+WG  WG
Sbjct: 383 MKVYEDFFLYKEGIYQHSQKAGSKWKTHSVKLLGWGALPDKNGQKQKFWIAANSWGKSWG 442

Query: 318 DRGTVKILRGKYECAFEYLIAA--GKP 342
           + G  +ILRG+ EC  E LI A  G+P
Sbjct: 443 ENGYFRILRGQNECDIEKLILATLGQP 469


>gi|290998826|ref|XP_002681981.1| predicted protein [Naegleria gruberi]
 gi|284095607|gb|EFC49237.1| predicted protein [Naegleria gruberi]
          Length = 310

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 91/299 (30%), Positives = 123/299 (41%), Gaps = 47/299 (15%)

Query: 45  NLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP 104
           N++   LR  L   +      D P     +    E    +P  FDAR QW  C  +  + 
Sbjct: 54  NMTISQLRDNLFGLSLMSSDEDTP-----RMASIETRVDIPMNFDARTQWKGC--VPAIR 106

Query: 105 DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFR 164
           D   C A   F+A    + R CI + G+ N  LS EY   C  +     NK+C  G +  
Sbjct: 107 DQQTCGACWAFSANYVLAHRLCIATNGKTNVVLSPEYQVQCDTM-----NKACQGGYLKY 161

Query: 165 TWNFLHKRGSV--TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTN 222
           +W FL   G+   T   Y    G   S        G+ PT     +  + K K      N
Sbjct: 162 SWTFLENTGTPLDTCIPYASGRGTFSS--------GTCPTQCKIASMSMSKYKAK----N 209

Query: 223 PTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSN 282
             Y  G                  + IK  I+ +G   A F +Y D   YKSGVYKH  +
Sbjct: 210 TVYISGI-----------------NNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHVVS 252

Query: 283 AKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
             L    H+  LIG+G E G+ YWL  N+WGP+WG  G  KI +G  E   E  + AG+
Sbjct: 253 TVLGG--HAVALIGFGVEGGSNYWLAANSWGPNWGMSGYFKIAQG--EGGIENQVYAGE 307


>gi|129270160|ref|NP_001038442.2| tubulointerstitial nephritis antigen-like precursor [Danio rerio]
 gi|126632071|gb|AAI33830.1| Si:dkey-158b13.1 [Danio rerio]
          Length = 471

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 83/269 (30%), Positives = 124/269 (46%), Gaps = 35/269 (13%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  F+A ++WP  G I    D G C A   F+     SDR  I+S G     LS + + 
Sbjct: 200 LPSYFNAVDKWP--GKIHEPLDQGNCNASWAFSTAAVASDRISIQSMGHMTPQLSPQNLI 257

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC    R+ D   C+ G +   W F+ +RG VT         C P   SP     SA  +
Sbjct: 258 SC--DTRHQD--GCAGGRIDGAWWFMRRRGVVT-------QDCYP--FSPPEQ--SAVEV 302

Query: 204 PSCENQKVP----KLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
             C  Q       K +    C N      +  D +++T  Y +  NE+ I KEI+ +GP 
Sbjct: 303 ARCMMQSRAVGRGKRQATAHCPN---SHSYHNDIYQSTPPYRLSTNENEIMKEIMDNGPV 359

Query: 260 TATFALYDDFYHYKSGVYKHTS------NAKLENYLHSGKLIGWGTENG-----TPYWLV 308
            A   +++DF+ YKSG+++HT       +   ++  HS ++ GWG E         YW+ 
Sbjct: 360 QAIMEVHEDFFVYKSGIFRHTDVNYHKPSQYRKHATHSVRITGWGEERDYSGRTRKYWIG 419

Query: 309 INTWGPHWGDRGTVKILRGKYECAFEYLI 337
            N+WG +WG+ G  +I RG  EC  E  +
Sbjct: 420 ANSWGKNWGEDGYFRIARGVNECDIETFV 448


>gi|157058757|gb|ABV03136.1| cathepsin B-84 [Pterocomma populeum]
          Length = 218

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 68/229 (29%), Positives = 113/229 (49%), Gaps = 16/229 (6%)

Query: 40  RNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGT 99
           +NFP N+ +E + + L+   +       P+  +  +Y  +    +P  FDAR +W  C T
Sbjct: 3   QNFPENMLKEQMVR-LLGSKRLTGVPKTPVKENDISYVED--GGIPKAFDARLEWKYCKT 59

Query: 100 IGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSH 159
           IG V D G C +       GAF+DR CI +KG  N  +S E +  CC +C       C+ 
Sbjct: 60  IGQVRDQGNCGSCWAHGTSGAFADRLCIATKGDFNELISAEELTFCCHLC----GIGCNG 115

Query: 160 GSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTR 219
           G+  R W +  + G VTGG+Y    GCQP  + PC++        SC  Q+  +   + +
Sbjct: 116 GNPLRAWQYFKRHGVVTGGNYNTTNGCQPYRVPPCTNGDKGHY--SCSGQQKER---NHK 170

Query: 220 CTNPTYGR---GFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFAL 265
           C    YG     + +D ++T   Y++  N   ++K+++ +GP  A+F +
Sbjct: 171 CLKTCYGDKTVDYKRDHYKTKDAYYL-SNTTTMQKDVILYGPIEASFDV 218


>gi|157058731|gb|ABV03123.1| cathepsin B-16D1 [Acyrthosiphon pisum]
          Length = 243

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 78/238 (32%), Positives = 108/238 (45%), Gaps = 16/238 (6%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
           L  +     V  + Y     +ID IN +A TW AG NF  +  +E+  + L   +K    
Sbjct: 6   LSVIFVSVYVTEQTYFLQKDFIDNINNQATTWKAGVNFDPDTPKEHFLKML--GSKGVQI 63

Query: 65  SDRPLPGDRKTYDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
            ++      KT+D  Y      +P  FDAR +W +C TIG V D G C +    A   AF
Sbjct: 64  PNKHNIHMYKTHDAAYDNLFGRIPRHFDARRKWRSCHTIGAVRDQGNCGSCWAMATSSAF 123

Query: 122 SDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYG 181
           +DR C+ +    N  LS E +  CC  C +     C+ G   + W    KRG VTGGDY 
Sbjct: 124 ADRLCVATNADFNELLSAEEITFCCYSCGF----GCNGGYPIKAWERFKKRGLVTGGDYQ 179

Query: 182 DRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRG--FFQDKHRTT 237
              GC+P  + PC +   A    +C  +  P+   H RCT   YG     F + HR T
Sbjct: 180 SGEGCEPYRVPPCPY--DAEGHNTCAGK--PRESNH-RCTRMCYGNQDLDFDEDHRYT 232


>gi|308159555|gb|EFO62082.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 305

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 72/251 (28%), Positives = 112/251 (44%), Gaps = 37/251 (14%)

Query: 85  PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
           PDR D R+  P C       D   C+  + FA +GA S RRCI     Q   LS +++ S
Sbjct: 82  PDRLDYRQTHPEC--FFEPEDQKECSCCYAFATIGALSTRRCIAKLDSQAVSLSVQHMVS 139

Query: 145 CCKICRYDDNKS-CSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           C      D+ ++ C  G    +W FL   G V       ++ C P T     + G  P +
Sbjct: 140 C------DNGEAGCLGGEFESSWAFLETEGVV-------KSDCLPYTSGETGNSGECPMM 186

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
             C++  + +   H +  + +                   +N + I   +LA GP    F
Sbjct: 187 --CQDGTLVEDAFHYKAASAS-----------------PLNNYNEIMVSLLADGPVQTGF 227

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
            +++DF +Y  G+Y     + L    H+  ++G+G+ N   YW+V N+WGP WG+ G  +
Sbjct: 228 YVHEDFLYYVGGIYHKVYGSSLGG--HAVLIVGYGSMNDHDYWIVRNSWGPDWGENGYFR 285

Query: 324 ILRGKYECAFE 334
           ILRG  EC  E
Sbjct: 286 ILRGTNECGIE 296


>gi|328872536|gb|EGG20903.1| hypothetical protein DFA_00770 [Dictyostelium fasciculatum]
          Length = 313

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 84/287 (29%), Positives = 124/287 (43%), Gaps = 43/287 (14%)

Query: 50  YLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGA- 108
           +L    I    Y + S R   G   T+D   ++ +P  FD+R++W +C     V D G  
Sbjct: 3   FLITLFILLISYTELS-RAQCGASPTFD---ASNLPASFDSRQKWSDC--FSPVRDQGQK 56

Query: 109 CAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNF 168
           C++     A G  +DR C+ S G+  + LS + +  C +    + N  C  G +     +
Sbjct: 57  CSSCWAMTATGVLADRLCVASGGKVKKVLSPQELIDCDR----NGNLGCGGGRLDTPLAY 112

Query: 169 LHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLK-CHTRCTNPTYGR 227
               G VT                             CE+ K  +   C   C + T   
Sbjct: 113 FRDNGVVT---------------------------EKCESYKATQASSCSNTCDDGTSFS 145

Query: 228 GFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLEN 287
                K+ +   Y +   E A K +I  +GP  A F LY D Y+YKSGVY  + +A  + 
Sbjct: 146 N--TTKYHSKDCYRLSSIEQA-KADIYLNGPIIAVFDLYTDIYNYKSGVYIKSDSATYKE 202

Query: 288 YLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
             H+G++IGWG E+G  YWL  N+WG  WG +G  KI  G  E  FE
Sbjct: 203 -THAGRVIGWGVEDGVQYWLAANSWGTGWGQQGLFKIRSGTNEVGFE 248


>gi|161343825|tpg|DAA06093.1| TPA_inf: cathepsin B [Aphis gossypii]
          Length = 199

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 63/198 (31%), Positives = 101/198 (51%), Gaps = 7/198 (3%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNF-PANLSEEYLRQFLIADA 59
           ++ +L  +L    +  + Y     YI++IN +A+TWTAG NF P+   E+ L+       
Sbjct: 4   VLILLSVILFSVYMTEQAYFLEKDYINKINEKASTWTAGFNFDPSTPKEDILKLLGSKGV 63

Query: 60  KYFDQSDRPL-PGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAV 118
           +   + +  +   + + YD  +   +P +FDAR++W +C TIG V D G C +    +  
Sbjct: 64  QTPSKINLKMYKSEDENYDNLF-GRIPKKFDARKKWRHCTTIGKVRDQGNCGSCWALSTS 122

Query: 119 GAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGG 178
            AF+DR C+ + G  N+ LS E +  CC  C Y     C+ G   + W    K G VTGG
Sbjct: 123 SAFADRLCVATNGDFNQLLSAEELTFCCHKCGY----GCNGGYPIKAWERFKKHGLVTGG 178

Query: 179 DYGDRTGCQPSTISPCSH 196
           +Y    GC+P  + PC +
Sbjct: 179 EYKSGEGCEPYRVPPCPY 196


>gi|145356617|ref|XP_001422524.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582767|gb|ABP00841.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 245

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 83/257 (32%), Positives = 116/257 (45%), Gaps = 24/257 (9%)

Query: 83  TVPDRFDAREQWPNCGT-IGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
           T+P  FD RE+WP C   +    D G C +    A     +DR CI + G     LS   
Sbjct: 1   TLPKDFDVREKWPKCAALVSEALDQGECGSCWAVAPAKVMADRLCIATNGAVASHLSAMQ 60

Query: 142 VASCCKI--CRYDDNK----SCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCS 195
           + SC K+    +D       SC  G     +      G V+GG +GD   C P   +PC 
Sbjct: 61  LLSCGKLENGTFDAGSTYSGSCDGGFPNEAYEKARTSGIVSGGLFGDDKTCMPYAFAPCQ 120

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
           H    P  P+   Q      C T C N        Q    T+L     ++ + +  E+  
Sbjct: 121 H----PCNPNHVAQ------CPTTCRNKNVNLS-SQRYEVTSLVTCGTNDFNCMALELFY 169

Query: 256 HGPTTATFA-LYDDFYHYKSGVYKHTSNAKLENYLHSG---KLIGWG-TENGTPYWLVIN 310
           HGP ++    ++D+FY YKSGVY  + +       H G   ++IGWG TE+GT YW V N
Sbjct: 170 HGPVSSYVGDVFDEFYKYKSGVYSLSKDVAARGENHGGHVMEVIGWGTTESGTRYWKVYN 229

Query: 311 TWGPHWGDRGTVKILRG 327
           +W  +WGD+G  KI  G
Sbjct: 230 SW-LNWGDQGYGKIAVG 245


>gi|157058759|gb|ABV03137.1| cathepsin B-84 [Rhopalosiphum padi]
          Length = 219

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 69/235 (29%), Positives = 112/235 (47%), Gaps = 21/235 (8%)

Query: 36  WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYS--ATVPDRFDAREQ 93
           W A +NFP  +++E + + L + +         L    K YD +Y+    VPD FDAR +
Sbjct: 1   WKAKQNFPEYMTKEQIVRLLGSKS-----VKGALKSPIKEYDSKYTNDVEVPDFFDARIE 55

Query: 94  WPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDD 153
           W  C TIG V + G C +       GAF+DR C+ + G  N  +S E +  CC  C +  
Sbjct: 56  WKYCKTIGEVRNQGNCGSCWAHGTTGAFADRLCVATNGDFNELISAEELTFCCHTCGF-- 113

Query: 154 NKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPK 213
              C+ G+  R W +  + G VTGG+Y    GCQP  + PC          SC  Q+  +
Sbjct: 114 --GCNGGNPIRAWLYFKRHGVVTGGNYNTTDGCQPYKVPPCIRDEEGHN--SCSGQRTER 169

Query: 214 LKCHTRCTNPTYGR---GFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFAL 265
              + RC+   YG     +    ++T   Y++ +N   ++ + + +GP  ++F +
Sbjct: 170 ---NHRCSKSCYGNTTSDYKNGHYKTKDAYYLTNN--TMQIDTMIYGPIESSFDV 219


>gi|159117627|ref|XP_001709033.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157437148|gb|EDO81359.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 308

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 94/338 (27%), Positives = 145/338 (42%), Gaps = 66/338 (19%)

Query: 21  FSDAYIDQINREANTWTAGRNFPA---NLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYD 77
            +   + QI   A  W AG   P    NL++   ++ L A +     S       R    
Sbjct: 16  LTQVELRQIQALAPAWKAG--IPERLKNLTKNDFKKMLSAGSPRTQSSIV-----RPVRV 68

Query: 78  PEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL 137
           PE    VPD FD RE++P C  I  V D G C++   ++AV AFS RRC+    Q+    
Sbjct: 69  PENEDPVPDHFDFREEYPQC--ITEVIDIGLCSSSWAYSAVDAFSHRRCLTGLDQEATRY 126

Query: 138 STEYVASCCKICRYDDNKSCSHGSVFRT--WNFLHKRG-----SVTGGDYGDRTGCQPST 190
           S +Y+ SC           C   S   +  W+F+   G      V   DY D+T  +P  
Sbjct: 127 SAQYILSC------SSTNGCFGFSTRESIAWDFIATTGIPLESCVKYTDY-DQTQSRP-- 177

Query: 191 ISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIK 250
                                    C + C + ++   +  D +       V  N + +K
Sbjct: 178 -------------------------CPSTCDDDSFLEVYKPDGYEG-----VGLNCERLK 207

Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVI 309
           + +   GP  A F +Y+DF +Y  G+Y +T   ++     S +++G+GT + G  YW+V 
Sbjct: 208 RAVALRGPMQAMFTVYEDFTYYLEGIYSYTYGNRVG--FLSVEIVGYGTSDEGQDYWIVK 265

Query: 310 NTWGPHWGDRGTVKILRGKYECAFE-----YLIAAGKP 342
           N WGP WG+ G  +I+RG+ EC  E      +I+  KP
Sbjct: 266 NYWGPGWGEDGYFRIVRGQNECQIENSAYGAIISPNKP 303


>gi|268572255|ref|XP_002648916.1| Hypothetical protein CBG17829 [Caenorhabditis briggsae]
          Length = 220

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 51/106 (48%), Positives = 66/106 (62%), Gaps = 2/106 (1%)

Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIG 296
           T  Y+V     AI+ EI+ +GP    F +Y+D Y YKSGVY+HT+   L    H+ K+IG
Sbjct: 112 TSAYYVGMTVSAIQTEIMTNGPVVGVFTMYEDMYKYKSGVYRHTAGRLLGG--HAIKIIG 169

Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           WGT+NG PYWL+ N+WG  WG+ G  KI RG  EC  E  + AGK 
Sbjct: 170 WGTQNGIPYWLIANSWGTKWGENGFFKIRRGVNECGIENNVVAGKA 215


>gi|403357104|gb|EJY78168.1| Cathepsin B [Oxytricha trifallax]
          Length = 349

 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 78/267 (29%), Positives = 125/267 (46%), Gaps = 46/267 (17%)

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           + + T+P+ FD+R++WPNC  I  + D   C +   FA+    SDR CI S+GQ N  LS
Sbjct: 120 DLNETIPESFDSRDKWPNC--IHGIRDQQLCGSCWAFASSAFLSDRFCIHSEGQINEDLS 177

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQP--STISPCSH 196
            + + SC       +N  CS G +  + +FL   G V+         C+P  +  + C  
Sbjct: 178 PQDLVSCSY-----ENFGCSGGQLTESVDFLIYEGIVS-------EKCKPYMNQDTYCKF 225

Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
                    C+N K P            Y + F + K    L+     + + I+ E++ +
Sbjct: 226 --------KCQNDKQP------------YTKYFCEQKSMLILS-----DIEEIQLELMTN 260

Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENGTPYWLVINTWGPH 315
           GP     ++Y+D  +YK GVY++T+  ++    H+ K+IGWG TE G  +W   N WG  
Sbjct: 261 GPMMVGLSVYEDLMNYKEGVYEYTTGNQVGG--HAIKIIGWGHTEKGELFWKCQNQWGKD 318

Query: 316 WGDRGTVKILRGKYECAFEYLIAAGKP 342
           WG  G + I  G  E   + ++    P
Sbjct: 319 WGMGGYINIKAG--ELGMDTMVLGCMP 343


>gi|162813|gb|AAA30434.1| cathepsin B, partial [Bos taurus]
          Length = 122

 Score =  111 bits (277), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 55/118 (46%), Positives = 70/118 (59%), Gaps = 2/118 (1%)

Query: 225 YGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK 284
           Y   + +DKH    +Y V +NE  I  EI  +GP    F++Y DF  YKSGVY+H S   
Sbjct: 2   YSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEI 61

Query: 285 LENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           +    H+ +++GWG ENGTPYWLV N+W   WGD G  KILRG+  C  E  I AG P
Sbjct: 62  MGG--HAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 117


>gi|290971375|ref|XP_002668483.1| predicted protein [Naegleria gruberi]
 gi|284081912|gb|EFC35739.1| predicted protein [Naegleria gruberi]
          Length = 325

 Score =  111 bits (277), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 87/297 (29%), Positives = 124/297 (41%), Gaps = 43/297 (14%)

Query: 45  NLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP 104
           N++   LR  L   +      D P     +  + E    +P  FDAR QW  C  +  + 
Sbjct: 69  NMTISQLRDNLFGLSLMSTDEDTP-----RMENIETRMDIPMNFDARTQWRGC--VPAIR 121

Query: 105 DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFR 164
           D   C A   F+A    + R CI + GQ N  LS EY   C  +     NK+C  G +  
Sbjct: 122 DQQTCGACWAFSANYVLAHRLCIATNGQTNVVLSPEYQVQCDTM-----NKACQGGYLKY 176

Query: 165 TWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPT 224
           +W FL   G+                +  C  + S     S          C T+C   +
Sbjct: 177 SWTFLENTGT---------------PLDTCIPYASGRGTFSSGT-------CPTQCKIAS 214

Query: 225 YGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK 284
                ++ K+   +T       + IK  I+ +G   A F +Y D   YKSGVYKH  +  
Sbjct: 215 MSMSKYKAKNTRYIT-----GINNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHVVSTV 269

Query: 285 LENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
           L    H+  LIG+G E G+ YWL  N+WG +WG  G  KI +G  E   E  + AG+
Sbjct: 270 LGG--HAVALIGFGVEGGSNYWLAANSWGANWGMSGYFKIAQG--EGGIENQVYAGE 322


>gi|158285208|ref|XP_001687862.1| AGAP007684-PA [Anopheles gambiae str. PEST]
 gi|158285210|ref|XP_308187.4| AGAP007684-PB [Anopheles gambiae str. PEST]
 gi|157019881|gb|EDO64511.1| AGAP007684-PA [Anopheles gambiae str. PEST]
 gi|157019882|gb|EAA04576.4| AGAP007684-PB [Anopheles gambiae str. PEST]
          Length = 463

 Score =  110 bits (275), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 79/271 (29%), Positives = 115/271 (42%), Gaps = 45/271 (16%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P RFDA E W   G +    D G C +   F+     SDR  I SKG++   L+ + + 
Sbjct: 187 LPTRFDASEHWT--GLVAEARDQGWCGSSWAFSTATMASDRFAILSKGREMVQLAPQQML 244

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           +C +       + CS G +   W +L + G V                  C  + +A  +
Sbjct: 245 ACVR-----RQQGCSGGHLDTAWQYLRRTGVVN---------------EECYPYIAAQNV 284

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD-----DNEDAIKKEILAHGP 258
               N           C  P         K   TL Y +      +NE  I  EI   G 
Sbjct: 285 CKISNDDT---LITANCELPV--------KVNRTLMYKMGPAFSLNNETDIMAEIKDRGT 333

Query: 259 TTATFALYDDFYHYKSGVYKHTSNA---KLENYLHSGKLIGWGTE----NGTPYWLVINT 311
             A   +Y DF+ Y+SG+Y+H++ A   +  +  HS +LIGWG E    +   YW+ IN+
Sbjct: 334 VQAIMRVYRDFFSYRSGIYRHSAAATPAEERSAYHSVRLIGWGEERVGYDVVKYWIAINS 393

Query: 312 WGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           WG  WG+ G  +ILRG  EC  E  + A  P
Sbjct: 394 WGQWWGENGRFRILRGSNECDIESYVLASNP 424


>gi|301775398|ref|XP_002923119.1| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
           antigen-like [Ailuropoda melanoleuca]
          Length = 472

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 86/276 (31%), Positives = 120/276 (43%), Gaps = 49/276 (17%)

Query: 84  VPDRFDAREQWPNCGTIGHVP-DTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           +P+ F A  +WP      H P D   CAA   F+     +DR      G+    LS + +
Sbjct: 217 LPEFFIASYKWPG---WTHGPLDQKNCAASWAFSTASVAADR----IXGRYTANLSPQNL 269

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY-------GDRTGCQPSTISPCS 195
            SCC   R+     C+ GS+ R W FL KRG V+   Y           GC  ++ S   
Sbjct: 270 ISCCAKNRH----GCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNATNYGCAMASRS--D 323

Query: 196 HHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILA 255
             G       C N  + K     +C+ P                Y V  NE  I KEI+ 
Sbjct: 324 GRGKRHATKPCPNN-IEKSNRIYQCSPP----------------YRVSSNETEIMKEIMQ 366

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLEN------YLHSGKLIGWGTENGT-----P 304
           +GP  A   +++DF+HYK+G+Y+H +    E+        H+ KL GWGT  G       
Sbjct: 367 NGPVQAIMQVHEDFFHYKTGIYRHVTRTNEESSKYRKLQTHAIKLTGWGTLKGARGQKEK 426

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           +W+  N+WG  WG+ G  +ILRG  E   E LI A 
Sbjct: 427 FWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 462


>gi|260826514|ref|XP_002608210.1| hypothetical protein BRAFLDRAFT_125840 [Branchiostoma floridae]
 gi|229293561|gb|EEN64220.1| hypothetical protein BRAFLDRAFT_125840 [Branchiostoma floridae]
          Length = 470

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 91/332 (27%), Positives = 145/332 (43%), Gaps = 47/332 (14%)

Query: 19  YKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLI--ADAKYFDQSDRPLPGDRKTY 76
           +K +  +I+QIN   ++W AG  +P    E++ R  LI  A  +      RP P      
Sbjct: 173 FKTNLDFIEQINSAQSSWQAGV-YPE--YEKFTRNDLIRRAGGRKSRLPHRPRPAPVSEE 229

Query: 77  DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRP 136
               +A +P+ FD R+       +  + D G C + + FA++G    R  + +   Q   
Sbjct: 230 TRLAAAQLPESFDWRKVM-GLNFVSPIRDQGQCGSCYAFASMGMLEARLRVLTNNTQQFV 288

Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSH 196
           LS + + SC K      ++ C  G  +           +  G Y +  G       P   
Sbjct: 289 LSPQEIVSCGKY-----SQGCEGGFPY-----------LIAGKYAEDFGVVLEECYPYEG 332

Query: 197 HGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAH 256
             S     SC++         +RC     GRG+  + +R    ++   NE+ ++ E++ +
Sbjct: 333 KDS-----SCKDT--------SRC-----GRGYATN-YRYVGGFYGGCNEELMQLELVKN 373

Query: 257 GPTTATFALYDDFYHYKSGVYKHTSNAK----LENYLHSGKLIGWGT--ENGTPYWLVIN 310
           GP    F +Y DF HYK GVY+HT  +      E   H+  L+G+G   E G  +W V N
Sbjct: 374 GPMAVAFEVYSDFMHYKGGVYEHTGLSDPFNPFEITNHAVLLVGYGRDPETGAKFWTVKN 433

Query: 311 TWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           +WG  WG+ G  +I RG  ECA E +  A  P
Sbjct: 434 SWGEKWGEEGFFRIRRGTDECAIESIAVAADP 465


>gi|414886872|tpg|DAA62886.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
 gi|414886873|tpg|DAA62887.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
          Length = 208

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 71/209 (33%), Positives = 96/209 (45%), Gaps = 26/209 (12%)

Query: 137 LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG--GDYGDRTGCQPSTISPC 194
           LS   + +CC     D    C  G     W +  + G VT     Y D  GC+       
Sbjct: 5   LSVNDLLACCGFMCGD---GCDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCK------- 54

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            H G  P  P+         KC  +C      + + + KH +   Y ++ +   I  E+ 
Sbjct: 55  -HPGCEPAYPT--------PKCEKKCKEQN--QVWQEKKHFSIDAYRINSDPHDIMAEVY 103

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTEN-GTPYWLVINTWG 313
            +GP    F +Y+DF HYKSGVYKH +   +    H+ KLIGWGT + G  YWL+ N W 
Sbjct: 104 KNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGG--HAVKLIGWGTSDAGEDYWLLANQWN 161

Query: 314 PHWGDRGTVKILRGKYECAFEYLIAAGKP 342
             WGD G  KI+RGK EC  E  + AG P
Sbjct: 162 RGWGDDGYFKIIRGKNECGIEEGVVAGMP 190


>gi|256052327|ref|XP_002569724.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 96

 Score =  109 bits (273), Expect = 2e-21,   Method: Composition-based stats.
 Identities = 47/94 (50%), Positives = 69/94 (73%), Gaps = 2/94 (2%)

Query: 248 AIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWL 307
           AI+KEI+ +GP  A F +Y+DF +YKSG+YKH +  KL ++ H+ ++IGWG EN TPYWL
Sbjct: 2   AIQKEIMKYGPVEANFIVYEDFLNYKSGIYKHIT-GKLFSW-HAIRIIGWGEENNTPYWL 59

Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEYLIAAGK 341
           + N+W   WG+ G  +ILRG++EC+ E  + AG+
Sbjct: 60  IPNSWNEDWGENGNFRILRGRHECSIESEVTAGR 93


>gi|395528577|ref|XP_003766405.1| PREDICTED: dipeptidyl peptidase 1-like [Sarcophilus harrisii]
          Length = 568

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 88/339 (25%), Positives = 141/339 (41%), Gaps = 59/339 (17%)

Query: 17  ELYKFSDAYIDQINREANTWTAGRNFPANLSEEY----LRQFLIADAKYFDQSDRPLPGD 72
           E +K++  ++D IN   N+W A       + EEY    L Q +     Y     RP    
Sbjct: 272 EHFKYNYDFVDAINAAQNSWIA------TVYEEYEKLSLDQMIKRRGGYSYPYPRPKSAP 325

Query: 73  RKTYDPEYSATVPDRFDAREQWPNCGTIGHVP---DTGACAAPHIFAAVGAFSDRRCIKS 129
                 + ++T+P  +D    W N   + +V    +   C + + FA++G    R  IK+
Sbjct: 326 LTHEILQKTSTLPKSWD----WRNVNGVNYVSPVRNQANCGSCYAFASLGMLESRIRIKT 381

Query: 130 KGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPS 189
              Q   LS + + SC +      ++ C  G  +           + GG Y    G    
Sbjct: 382 NNSQVPVLSPQEIVSCSEY-----SQGCEGGFPY-----------LIGGKYAQDFGLVEE 425

Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
              P   + S  T   C                      ++  ++     ++   NE  +
Sbjct: 426 ECFPYQAYDSPCTPKKCSR--------------------YYTSEYHYVGGFYGGCNEALM 465

Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHT----SNAKLENYLHSGKLIGWGTEN--GT 303
           K E++ +GP T  F +YDDF HY++G+Y HT    +    E   H+  L+G+GT+   G 
Sbjct: 466 KHELIQNGPLTVAFEVYDDFIHYRTGIYHHTGLRDNFNPFELTNHAVLLVGYGTDEKTGE 525

Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            YW+V N+WG  WG+ G  +ILRG  ECA E +  A  P
Sbjct: 526 DYWIVKNSWGTSWGENGYFRILRGTDECAIESIAVAATP 564


>gi|197129222|gb|ACH45720.1| putative cathepsin B variant 2 [Taeniopygia guttata]
          Length = 236

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 70/229 (30%), Positives = 105/229 (45%), Gaps = 18/229 (7%)

Query: 2   IHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
           + +L  L+     R   Y    SD  ++ IN+   TW AG NF  N    Y+++      
Sbjct: 5   VSLLCVLVALANARSIPYFPPLSDDLVNHINKLNTTWKAGHNF-HNADMSYVKKLC---G 60

Query: 60  KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
            +      P     +  D      +PD FD+R QWPNC TI  + D G+C +   F AV 
Sbjct: 61  TFLGGPKLP-----ERVDFAADVELPDNFDSRTQWPNCPTISEIRDQGSCGSCWAFGAVE 115

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           A SDR C+ +  + +  +S E + SCC    ++    C+ G     W +  +RG V+GG 
Sbjct: 116 AISDRICVHTNAKVSVEVSAEDLLSCCG---FECGMGCNGGYPSGAWRYWTERGLVSGGL 172

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCT---NPTY 225
           Y    GC+P +I PC HH +  T P C  +     +C   C    +P+Y
Sbjct: 173 YDSHVGCRPYSIPPCEHHVNG-TRPPCTGEGGSTPRCSRHCEPGYSPSY 220


>gi|38048307|gb|AAR10056.1| similar to Drosophila melanogaster CG10992, partial [Drosophila
           yakuba]
          Length = 174

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 62/167 (37%), Positives = 92/167 (55%), Gaps = 10/167 (5%)

Query: 12  TLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIA--DAKYFDQSD-RP 68
           TL  GE    SD +I+ +  +A TWT GRNF A+++E ++R+ +    DA  F  +D R 
Sbjct: 15  TLSAGEPSLLSDEFIELVRSKAKTWTVGRNFDASVTEGHIRRLMGVHPDAHKFALADKRE 74

Query: 69  LPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIK 128
           + GD      +    +P+ FD+R+QWPNC TIG + D G+C +   F AV A SDR CI 
Sbjct: 75  VLGDLYMNSVD---EIPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIH 131

Query: 129 SKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
           S G+ N   S + + SCC  C +     C+ G     W++  ++G V
Sbjct: 132 SGGKVNFHFSADDLVSCCHTCGF----GCNGGFPGAAWSYWTRKGIV 174


>gi|159108157|ref|XP_001704351.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157432412|gb|EDO76677.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 360

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 92/337 (27%), Positives = 140/337 (41%), Gaps = 51/337 (15%)

Query: 4   ILVFLLGCTLVRGELYK---FSDAYIDQINR-EANTWTAGRNFPANLSEEYLRQFLIADA 59
           I + ++G +L+ G +      S A +  I   +  TW      P     + L +      
Sbjct: 61  IEIKMIGASLLLGAVLAAPAVSHADLHTIKALDGLTWVP--ELPKRFMGKSLDEVKAMFG 118

Query: 60  KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
              D S RP    R++  P   A  P+ +D R+++P+C  I  V D G C +   F++V 
Sbjct: 119 PLVDTS-RPAITMRRSTTPPVGA--PESYDFRDEYPHC--ITEVVDQGNCGSCWAFSSVQ 173

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
            F+D RC           S +YV  C +      +  C+ G     +NFLH  G+V    
Sbjct: 174 TFADHRCRSGLDATGVSYSVQYVLDCDR-----KDHGCNGGEPVNAFNFLHNTGTVLASC 228

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
            G   G                      +  V K  C  +C + +               
Sbjct: 229 VGYTAG----------------------DDAVVKF-CPQKCDDGSAVENVVATS------ 259

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG- 298
                   +    +LAHGP  ATF +  DF +YKSGVY+H     L    H+ ++IG+G 
Sbjct: 260 ---GSKSGSAIDVLLAHGPVVATFNVAQDFMYYKSGVYQHRWGLWLGG--HAVEIIGYGV 314

Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEY 335
           T++G  YW V N+WGP WG+ G  +I+RG  EC  E+
Sbjct: 315 TDSGLDYWTVRNSWGPDWGEDGYFRIVRGGDECGIEH 351


>gi|412992960|emb|CCO16493.1| cysteine proteinase, putative [Bathycoccus prasinos]
          Length = 396

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 84/273 (30%), Positives = 121/273 (44%), Gaps = 29/273 (10%)

Query: 73  RKTYDPEYSATVPDRFDAREQWPNC-GTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
           + ++DPE S  +P +FDAR++W  C G IG V D G C +    AA    +DR CI    
Sbjct: 136 KASFDPE-SLGLPRQFDARKEWAECKGLIGTVRDQGKCGSCWAVAATEVMNDRVCIAHG- 193

Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD-RTGCQPST 190
            +   LS +Y  SC     Y     C  G+V  T     ++G  TGG +GD  + C P  
Sbjct: 194 -KTEELSPQYALSC-----YSAGAGCEGGNVIDTLQEAIEKGVPTGGMFGDSSSACLPYE 247

Query: 191 ISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIK 250
              C H         C+       +C T C + T        +  +        +   I 
Sbjct: 248 FEACDH--------PCQVPGTIAEECPTTCADGTPISETEMMRPTSEPYECPPGDWKCIT 299

Query: 251 KEILAHGPTTATFA-LYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE--------N 301
           +E+  +G    TF  + DDFY +K GVY+     K    LH+ K+IGWG E         
Sbjct: 300 QELHKYGSMAVTFGPVCDDFYGHKHGVYEQPEGGKPLG-LHATKIIGWGFEGDDEETGKG 358

Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
           G PYW++IN+W  +WG+ G  +I  G+     E
Sbjct: 359 GKPYWIMINSW-QNWGEHGVGRIGIGEMSIESE 390


>gi|403359042|gb|EJY79178.1| Cysteine protease [Oxytricha trifallax]
          Length = 366

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 82/264 (31%), Positives = 123/264 (46%), Gaps = 44/264 (16%)

Query: 65  SDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDR 124
           SD    G  K+ D E    +P+++D RE +P+C  +  V + G C++ +I AA+   +DR
Sbjct: 88  SDTQNIGPCKSKDDE-ETIIPEKYDWREVYPDC--VQPVVNQGNCSSSYITAALSTVADR 144

Query: 125 RCIKSKGQQNRP--LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
            C  +K    +P  LS + +  C K      +  C  G V RT+N            +G 
Sbjct: 145 ICQTTK----KPIQLSAQELLDCDK-----SSYQCDGGYVSRTFN------------WGK 183

Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
           R G  P    P +       +  CE+  +   +C  R  N  Y            + Y +
Sbjct: 184 RKGFIPEQCYPYTG-----VVGECEDDHLETNEC--RVNNMFY----------RVIDYCL 226

Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE-N 301
             +E  +KKEIL +GP  A   +Y DF  YK GVY  T +A   N  H  K++GW  + +
Sbjct: 227 ASDELGLKKEILKNGPVVAQMVIYTDFLTYKEGVYHRTEDAFKFNGQHVVKIVGWDRQGD 286

Query: 302 GTPYWLVINTWGPHWGDRGTVKIL 325
           G  +W+V N+WG  WG+ G VKIL
Sbjct: 287 GNDFWIVENSWGSDWGEDGYVKIL 310


>gi|221219800|gb|ACM08561.1| Cathepsin B precursor [Salmo salar]
 gi|221222296|gb|ACM09809.1| Cathepsin B precursor [Salmo salar]
          Length = 205

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 70/208 (33%), Positives = 97/208 (46%), Gaps = 18/208 (8%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
           LV  L  +     L   S   +D IN+   TW AG NF  N+   Y+++      K    
Sbjct: 9   LVSGLSVSWAWPRLPPLSHQMVDYINKANTTWKAGPNF-HNVDYSYVKRLCGTLLK---- 63

Query: 65  SDRPLPGDRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
                 G +     +Y+  V  PD FD R+QWPNC T+  + D G+C +   F A  A S
Sbjct: 64  ------GPKLPTMVQYAGDVELPDTFDPRQQWPNCPTLKEIRDQGSCGSCWAFGAAEAIS 117

Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
           DR CI S  + +  +S+E + SCC  C       C+ G     W+F    G VTGG Y  
Sbjct: 118 DRVCIHSNAKVSVEISSEDLLSCCDSC----GMGCNGGYPSAAWDFWTTEGLVTGGLYDS 173

Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQK 210
             GC+P +I PC HH +  T P C  ++
Sbjct: 174 HVGCRPYSIPPCEHHVNG-TRPPCTGEE 200


>gi|149030260|gb|EDL85316.1| rCG52258, isoform CRA_c [Rattus norvegicus]
          Length = 130

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 53/119 (44%), Positives = 69/119 (57%), Gaps = 2/119 (1%)

Query: 225 YGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK 284
           Y   + +DKH    +Y V D+E  I  EI  +GP    F ++ DF  YKSGVYKH +   
Sbjct: 6   YSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDV 65

Query: 285 LENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           +    H+ +++GWG ENG PYWLV N+W   WGD G  KILRG+  C  E  I AG P+
Sbjct: 66  MGG--HAIRILGWGIENGVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAGIPR 122


>gi|328869211|gb|EGG17589.1| hypothetical protein DFA_08585 [Dictyostelium fasciculatum]
          Length = 323

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 92/328 (28%), Positives = 145/328 (44%), Gaps = 49/328 (14%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFL--IADAKYFDQSDRPLPGDRKTYDP 78
            +D +I   N +   W A RN  A      + Q +  +   K  + +  P     K  D 
Sbjct: 39  LNDKFIQNHNSKNAPWVAKRN--ARFEGHTIGQVMAMMGTKKVINNNAAP---SIKIVD- 92

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
              A++P  FDAREQWP C  +  V +   C +   F++  A SDR CI SKGQ N  LS
Sbjct: 93  ---ASIPSTFDAREQWPGC--VHAVLNQEQCGSCWAFSSSEALSDRLCIASKGQVNVTLS 147

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
            + + +C  I     N+ C+ G     W ++  +G  T         C P T    + +G
Sbjct: 148 PQALVACDDI----GNQGCNGGVPQLAWEYMEWKGLPT-------FECYPYT----AGNG 192

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           +  T             C  +C + +    +++ K  +  T    ++   I+ EI+ +GP
Sbjct: 193 TDGT-------------CQRQCADGS-AMTYYRAKPFSMTTC---NSVACIQNEIITYGP 235

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTP--YWLVINTWGPHW 316
              T  +Y DF  Y SGVY +   A+L    H+ +++GWGT+  +   YW+V N+W   W
Sbjct: 236 VVGTMMVYQDFMSYSSGVYVYDGTAELLGG-HAIEIVGWGTDATSKLDYWIVKNSWSAAW 294

Query: 317 GDR-GTVKILRGKYECAFEYLIAAGKPK 343
           G   G   I RG   C  ++  +A + K
Sbjct: 295 GGLDGYFWIQRGTNMCGIDHDASASQAK 322


>gi|221221056|gb|ACM09189.1| Cathepsin B precursor [Salmo salar]
 gi|221222300|gb|ACM09811.1| Cathepsin B precursor [Salmo salar]
          Length = 207

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 70/208 (33%), Positives = 97/208 (46%), Gaps = 18/208 (8%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
           LV  L  +     L   S   +D IN+   TW AG NF  N+   Y+++      K    
Sbjct: 9   LVSGLSVSWAWPRLPPLSHQMVDYINKANTTWKAGPNF-HNVDYSYVKRLCGTLLK---- 63

Query: 65  SDRPLPGDRKTYDPEYSATV--PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFS 122
                 G +     +Y+  V  PD FD R+QWPNC T+  + D G+C +   F A  A S
Sbjct: 64  ------GPKLPTMVQYAGDVELPDTFDPRQQWPNCPTLKEIRDQGSCGSCWAFGAAEAIS 117

Query: 123 DRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD 182
           DR CI S  + +  +S+E + SCC  C       C+ G     W+F    G VTGG Y  
Sbjct: 118 DRVCIHSNAKVSVEISSEDLLSCCDSC----GMGCNGGYPSAAWDFWTTEGLVTGGLYDS 173

Query: 183 RTGCQPSTISPCSHHGSAPTLPSCENQK 210
             GC+P +I PC HH +  T P C  ++
Sbjct: 174 HVGCRPYSIPPCEHHVNG-TRPPCTGEE 200


>gi|239793652|dbj|BAH72931.1| ACYPI000018 [Acyrthosiphon pisum]
          Length = 239

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 64/197 (32%), Positives = 91/197 (46%), Gaps = 9/197 (4%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           ++ +L  +     +  + Y     +ID IN  A TW AG NF  +  +E+  + L   +K
Sbjct: 4   VLMLLSVIFVSFYLTEQAYFLQKDFIDNINERATTWKAGVNFDPDTPKEHFLKML--GSK 61

Query: 61  YFDQSDRPLPGDRKTYDPEYS---ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAA 117
                ++      KT+D  Y      +P  FDAR +W  C TIG V D G C +    A 
Sbjct: 62  GVQIPNKHNIHMYKTHDAAYDNLFGRIPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMAT 121

Query: 118 VGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTG 177
             AF+DR C+ +    N  LS E +  CC  C +     C+ G   + W    KRG VTG
Sbjct: 122 SSAFADRLCVATNTDFNELLSAEEITFCCHSCGF----GCNGGYPIKAWERFKKRGLVTG 177

Query: 178 GDYGDRTGCQPSTISPC 194
           GDY    GC+P  + PC
Sbjct: 178 GDYQSGEGCEPYRVPPC 194


>gi|239793607|dbj|BAH72912.1| ACYPI000019 [Acyrthosiphon pisum]
          Length = 188

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 61/160 (38%), Positives = 81/160 (50%), Gaps = 5/160 (3%)

Query: 185 GCQPSTISPCSHHGSAPTLPSCEN-QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD 243
           GCQP TI PC      P   SC    +     C  +C NP Y   F  D ++     +  
Sbjct: 31  GCQPYTIPPCKLMNEKPPGHSCTTYHREETPICEKKCYNPNYYTSFRTDIYKGK---YYK 87

Query: 244 DNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENY-LHSGKLIGWGTENG 302
            +     K+I  +GP T  F +Y D   YKSGVY++   +  + + +HS K+ GWG ENG
Sbjct: 88  LSPYMAMKDIFDNGPITTQFYMYRDLVDYKSGVYQYDEQSDFDFFTVHSVKIFGWGEENG 147

Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            PYWLV N++G  WG  GT KI RG   C F+  + AG P
Sbjct: 148 VPYWLVANSFGTDWGYNGTFKISRGNDGCFFQEKMYAGLP 187


>gi|449283627|gb|EMC90232.1| Tubulointerstitial nephritis antigen [Columba livia]
          Length = 469

 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 85/272 (31%), Positives = 120/272 (44%), Gaps = 39/272 (14%)

Query: 85  PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
           P  F A   WP    I    D   C A   F+     +DR  I S+GQ    LS + + S
Sbjct: 223 PVFFAATYAWPE--WIHDPLDQRNCGASWAFSTASVAADRIAIHSEGQITDNLSVQNLIS 280

Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGD--RTGCQPSTISPC---SHHGS 199
           C       +   C+ G++   W +L   G V+   Y    +   +PS  + C   S +G 
Sbjct: 281 C----DTRNQHGCNGGNIDSAWRYLKTHGVVSYACYPSFWKKHLEPSGENHCYVSSEYGK 336

Query: 200 APTLPSCEN--QKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHG 257
             T   C N  +K  +L                   +R    Y V   E  I KEI+  G
Sbjct: 337 NYTNGPCPNALEKSNRL-------------------YRCASHYRVSSKETNIMKEIMDKG 377

Query: 258 PTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT---ENG--TPYWLVINTW 312
           P  A   +Y+DF+ YK G+Y+H+  A  +   HS KL+GWG    +NG    +W+  N+W
Sbjct: 378 PVQAIMKVYEDFFLYKEGIYRHSQKAGSKWKTHSVKLLGWGALADKNGQKQKFWIAANSW 437

Query: 313 GPHWGDRGTVKILRGKYECAFEYLIAA--GKP 342
           G  WG+ G  +ILRG+ EC  E LI A  G+P
Sbjct: 438 GKSWGENGYFRILRGQNECDIEKLILATSGQP 469


>gi|157058771|gb|ABV03143.1| cathepsin B-16D [Aulacorthum solani]
          Length = 201

 Score =  108 bits (269), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 61/185 (32%), Positives = 86/185 (46%), Gaps = 7/185 (3%)

Query: 14  VRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP--LPG 71
           V  + Y     +I+ IN +A TW AG NF  N  +E+  + L +        +       
Sbjct: 1   VTEQAYFLQRDFIENINEQATTWKAGVNFDPNTPKEHFLKLLGSKGVQIPNLNNINLYKT 60

Query: 72  DRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKG 131
           D   YD  +   +P  FDAR +W +C TIG V D G C +    A   AF+DR C+ + G
Sbjct: 61  DDAAYDNLF-GLIPRHFDARRKWRHCQTIGKVRDQGNCGSCWAMATSSAFADRLCVATNG 119

Query: 132 QQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTI 191
             N  LS E +  CC  C +     C  G   + W   +K G VTGG+Y    GC+P  +
Sbjct: 120 DFNELLSAEEITFCCHTCGF----GCHGGYPIKAWKRFNKHGLVTGGNYNSGEGCEPYRV 175

Query: 192 SPCSH 196
            PC +
Sbjct: 176 PPCPY 180


>gi|196009233|ref|XP_002114482.1| hypothetical protein TRIADDRAFT_28083 [Trichoplax adhaerens]
 gi|190583501|gb|EDV23572.1| hypothetical protein TRIADDRAFT_28083 [Trichoplax adhaerens]
          Length = 466

 Score =  108 bits (269), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 93/333 (27%), Positives = 145/333 (43%), Gaps = 56/333 (16%)

Query: 25  YIDQINREANTWTAGRNFPA----NLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
           YI+QIN   + WTA   +P      L+E  +R       K F      +  DR + + + 
Sbjct: 173 YINQINSAQSLWTA-TEYPEYEDFTLAELNMRSGRPTVPKSFAGPRLRMKRDRLSRNSDE 231

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
               P +FD R    N   +  V + GAC + + F+++  +  R  + SK    R +S +
Sbjct: 232 FIYFPKQFDWRNV-SNVNYVSPVRNQGACGSCYAFSSMAMYEARLRVLSKNSVKRVMSPQ 290

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            V SC +       + C+ G  +           +  G YG+  G    +  P  ++G  
Sbjct: 291 DVVSCSEYA-----QGCAGGFPY-----------LIAGKYGEDFGLVEESCFP--YNGKD 332

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD-----NEDAIKKEILA 255
                 E  K  K KC                +H TT  Y+V       NE  + +E++ 
Sbjct: 333 ------EPCKETKSKCR---------------RHSTTNYYYVGGFYGACNEYLMMRELVK 371

Query: 256 HGPTTATFALYDDFYHYKSGVYKHTSNAKLEN----YLHSGKLIGWGTE--NGTPYWLVI 309
           +GP + +F +Y DF HYK G+Y+HT      N      H+  L+G+GT+  +G  YW+V 
Sbjct: 372 NGPISISFEVYGDFKHYKGGIYQHTGLGDSYNPWQITNHAVLLVGYGTDQKSGKDYWIVK 431

Query: 310 NTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           N+WG  WG+ G  +ILRG  EC+ E    A  P
Sbjct: 432 NSWGTKWGENGFFRILRGVDECSIENEAVAVTP 464


>gi|326916361|ref|XP_003204476.1| PREDICTED: tubulointerstitial nephritis antigen-like [Meleagris
           gallopavo]
          Length = 467

 Score =  108 bits (269), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 78/259 (30%), Positives = 114/259 (44%), Gaps = 25/259 (9%)

Query: 85  PDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVAS 144
           P+ F A   WP+   I    D   C A   F+     +DR  I S GQ    LS + + S
Sbjct: 223 PEFFAATYAWPD--WIHDPLDQRNCGASWAFSTASVAADRIAIHSDGQITDNLSVQNLIS 280

Query: 145 CCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLP 204
           C       +   C  G++   W +L   G V+         C PS      H   +P+  
Sbjct: 281 C----DTKNQHGCGGGNIEGAWRYLKTHGVVS-------YACYPSFWK---HSLDSPSEN 326

Query: 205 SC-ENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
            C  + +  K   +  C N           +R    Y +   E  I +EI+A GP  A  
Sbjct: 327 HCYVSSEYGKNHTNGPCPNALEDSNRL---YRCASHYRISSKETDIMEEIMAKGPVQAIM 383

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG-----TPYWLVINTWGPHWGD 318
            +Y+DF+ YK G+Y+H+  A  +   HS KL+GWG+  G       +W+  N+WG +WG+
Sbjct: 384 KVYEDFFLYKEGIYRHSYKAGSKWKTHSVKLLGWGSLPGKNGQKQKFWIAANSWGKYWGE 443

Query: 319 RGTVKILRGKYECAFEYLI 337
            G  +ILRG+ EC  E LI
Sbjct: 444 NGYFRILRGQNECDIEKLI 462


>gi|159115721|ref|XP_001708083.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157436192|gb|EDO80409.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 305

 Score =  108 bits (269), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 74/254 (29%), Positives = 111/254 (43%), Gaps = 37/254 (14%)

Query: 82  ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
           A  PDR D R+  P C       D   C+  + FA +GA S RRCI     Q   LS ++
Sbjct: 79  AGSPDRLDYRQTHPEC--FFEPEDQKECSCCYAFATLGALSTRRCIAKLDPQAVSLSVQH 136

Query: 142 VASCCKICRYDDNKS-CSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
           + SC      D  ++ C  G    +W FL   G+V       ++ C P T       G  
Sbjct: 137 MVSC------DSGEAGCQGGEFESSWAFLETEGAV-------KSDCLPYTSGETGKSGEC 183

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
           PT  +C++    +   H +  + +          R +       N + I   +LA GP  
Sbjct: 184 PT--TCQDGTPVESAFHYKAASAS----------RLS-------NYNEIMVSLLADGPVQ 224

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
             F +++DF +Y  G+Y       L    H+  ++G+G+ N   YW+V N+WG  WG+ G
Sbjct: 225 TGFYVHEDFLYYVGGIYHKVYGTSLGG--HAVLIVGYGSMNNHDYWIVRNSWGSDWGENG 282

Query: 321 TVKILRGKYECAFE 334
             +ILRG  EC  E
Sbjct: 283 YFRILRGTNECGIE 296


>gi|308161503|gb|EFO63946.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 363

 Score =  108 bits (269), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 76/271 (28%), Positives = 121/271 (44%), Gaps = 44/271 (16%)

Query: 65  SDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDR 124
           + RP    + +  P   A  P+ +D RE++P+C  I  V D G+C +   F+++  F+D 
Sbjct: 126 TSRPTITMKHSTKPPVGA--PESYDFREEYPHC--ITEVVDQGSCGSCWAFSSIQTFADH 181

Query: 125 RCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRT 184
           RC           S +YV  C +      +  C+ G     +NFLH  G+V         
Sbjct: 182 RCRSGLDATGVSYSVQYVLDCDR-----KDHGCNGGEPVNAFNFLHNTGTV--------- 227

Query: 185 GCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDD 244
                 ++ C  + +        +  V K  C  +C + +            +       
Sbjct: 228 ------LTSCVEYTAG-------DDAVVKF-CPQKCDDGSAVENIVATSGAKS------- 266

Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENGT 303
              +    +LAHGP  ATF +  DF +YKSGVY+H     L    H+ +++G+G T++G 
Sbjct: 267 --GSAIDVLLAHGPVVATFNVAQDFMYYKSGVYQHRWGVWLGG--HAVEIVGYGVTDSGL 322

Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
            YW V N+WGP WG+ G  +I+RG  EC  E
Sbjct: 323 DYWTVRNSWGPDWGEDGYFRIVRGGDECGIE 353


>gi|156708108|gb|ABU93312.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  107 bits (268), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 86/315 (27%), Positives = 138/315 (43%), Gaps = 54/315 (17%)

Query: 21  FSDAYIDQINRE-ANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
            +++ ++ +N + ++TW A   +PA++      +FL     Y  + +        ++D +
Sbjct: 10  LAESIVETVNNDPSSTWVA-VEYPASVITR--AKFLARLGTYVTKYEE------TSFDLD 60

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
            +  +P+ FD+REQWP  G I  V D  +C +   F+      DR  IK  G     +S 
Sbjct: 61  NA--LPENFDSREQWP--GKILPVRDQASCGSCWAFSVAETMGDRLSIK--GCDFGDMSP 114

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
           + + SC        +  C+ G +   W +    G  T         C P      S  G 
Sbjct: 115 QDLVSCDTT-----DMGCNGGYMDHAWAWTKSHGITT-------EKCMPYQ----SGSGR 158

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
            P  P+             +C N   G    ++K  +    +   N   + +E+  +GP 
Sbjct: 159 VPACPA-------------KCVN---GSAIVRNKSVS----YKKLNAQQMMEELYENGPI 198

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
           +  F +Y DF +YKSGVY H +        H+   +GWG E+ TPYWL  N+WGP WG++
Sbjct: 199 SVAFTVYYDFMNYKSGVYVHKTGGIAGG--HAVLCVGWGVEDNTPYWLCQNSWGPAWGEK 256

Query: 320 GTVKILRGKYECAFE 334
           G  KILRG   C  E
Sbjct: 257 GHFKILRGSNHCGIE 271


>gi|161343853|tpg|DAA06107.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 217

 Score =  107 bits (268), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 58/157 (36%), Positives = 87/157 (55%), Gaps = 5/157 (3%)

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
           +++ +P  FD+R++WPNC +IGH+ + G C + +  AA  A SDR CI+S G +N  +S 
Sbjct: 57  FTSGLPINFDSRKKWPNCPSIGHIYNQGNCRSSYAVAAASAASDRICIQSNGTKNPIMSA 116

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
           + + SCC +C +     C  GS+F +W++  + G V+GGDY    GCQP TI PC     
Sbjct: 117 QQIISCCYLCGH----GCDGGSLFESWDYYRRHGFVSGGDYNSNQGCQPYTIPPCKLMNE 172

Query: 200 APTLPSCEN-QKVPKLKCHTRCTNPTYGRGFFQDKHR 235
            P   SC    +     C  +C NP Y   F  D ++
Sbjct: 173 KPPGHSCTTYHREETPICEKKCYNPNYYTSFRTDIYK 209


>gi|253747738|gb|EET02294.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 305

 Score =  107 bits (268), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 71/254 (27%), Positives = 109/254 (42%), Gaps = 37/254 (14%)

Query: 82  ATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEY 141
           A  PDR D R+  P C       D   C+  + FA +GA S RRCI        PLS ++
Sbjct: 79  ADSPDRLDYRQTHPEC--FFEPEDQSDCSCCYAFATLGALSTRRCIAKLDASVVPLSAQH 136

Query: 142 VASCCKICRYDDNKSCSHGSVFRT-WNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
           + SC      D  ++   G  F T W FL   G++          C P         G  
Sbjct: 137 MVSC------DHGEAGCQGGGFNTSWAFLETEGAIM-------RDCLPYVSGETGLSGEC 183

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
           PT  +C++  +     H +  + ++ +                 N + I   +L  GP  
Sbjct: 184 PT--TCQDGTLLNDTIHYKAVSASHLK-----------------NYNEIMTSLLNEGPVQ 224

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
             F +++DF +Y  G+Y  T  + +    H+  ++G+G+ N   YW+V N+WG  WG+ G
Sbjct: 225 TGFYVHEDFLYYVGGIYHKTYGSSIGG--HAVLIVGYGSMNNHDYWIVRNSWGSDWGENG 282

Query: 321 TVKILRGKYECAFE 334
             +ILRG  EC  E
Sbjct: 283 YFRILRGTNECGIE 296


>gi|156708106|gb|ABU93311.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
          Length = 282

 Score =  107 bits (268), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 87/316 (27%), Positives = 143/316 (45%), Gaps = 56/316 (17%)

Query: 21  FSDAYIDQINRE-ANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPG-DRKTYDP 78
            +++ ++ +N + ++TW A   +PA++         I  AK+  +    +   + +TY+ 
Sbjct: 10  LAESIVETVNNDPSSTWVA-VEYPASV---------ITRAKFLARLGTHVEEYEERTYES 59

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           + +  +P+ FDAREQWP    I  V D  +C +   F+      DR  I   G+ +  +S
Sbjct: 60  DNA--LPENFDAREQWPE--QILPVRDQASCGSCWAFSVAETMGDRLSIIGCGRGH--MS 113

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHG 198
            + + SC        +  C+ G + + W +    G VT  +      C P      S  G
Sbjct: 114 PQDLVSC-----DTTDMGCNGGYMDKAWAWTKSHG-VTNEE------CMPYQ----SGGG 157

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
             P  P+             +C N   G    + K ++   +        +++E+  +GP
Sbjct: 158 RVPACPA-------------KCVN---GSTIVRTKSQSFTHF----TASQMQQELYENGP 197

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
            +  F +Y DF +YKSGVY H +        H+   IGWG E+ TPYWL  N+WGP WG+
Sbjct: 198 LSVAFTVYYDFMNYKSGVYVHKTGGVAGG--HAVLCIGWGVEDNTPYWLCQNSWGPAWGE 255

Query: 319 RGTVKILRGKYECAFE 334
           +G  KILRG   C  E
Sbjct: 256 KGHFKILRGSNHCGIE 271


>gi|328726763|ref|XP_003249034.1| PREDICTED: cathepsin B-like cysteine proteinase-like, partial
           [Acyrthosiphon pisum]
          Length = 129

 Score =  107 bits (268), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 63/132 (47%), Positives = 77/132 (58%), Gaps = 16/132 (12%)

Query: 219 RCTNPTYGRGF--FQDKHRTT-----LTYWVDDNEDAIKKEILAHGPTTATFALYDDFYH 271
           RCT   YG     + D HR T     LTY       +I+K++L +GP  A+F +YDDF  
Sbjct: 3   RCTRMCYGNQDLDYDDDHRFTRDFYYLTY------GSIQKDVLNYGPIEASFDVYDDFPS 56

Query: 272 YKSGVYKHTSNA-KLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYE 330
           YKSGVY+ T NA KL    H+ KLIGWG E GTPYWL++N+W   WGD G  KI RG  E
Sbjct: 57  YKSGVYQRTPNATKLGG--HAVKLIGWGVEEGTPYWLMVNSWNAQWGDNGLFKIRRGTDE 114

Query: 331 CAFEYLIAAGKP 342
           C  +    AG P
Sbjct: 115 CRIDSATTAGVP 126


>gi|160688716|gb|ABX45136.1| cathepsin B-like cysteine protease 2 [Callosobruchus maculatus]
          Length = 260

 Score =  107 bits (267), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 56/115 (48%), Positives = 70/115 (60%), Gaps = 3/115 (2%)

Query: 231 QDKHRTTLTYWVDDN-EDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL 289
           +DKH     Y +    E  I+ EI+ +GP  A+F +Y DF HY SGVYK    +KL    
Sbjct: 147 EDKHYAKQAYRIMSKVERQIQLEIIKNGPVVASFTVYADFIHYLSGVYKFDGESKLLGG- 205

Query: 290 HSGKLIGWGTENGT-PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
           H+ ++IGWG ENGT PYWLV N+W   WGD+G  KI RGK EC  E  I AG P+
Sbjct: 206 HAVRIIGWGIENGTYPYWLVSNSWNERWGDQGLFKIWRGKNECGIEEEITAGLPR 260



 Score = 57.4 bits (137), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 50/107 (46%), Gaps = 6/107 (5%)

Query: 5   LVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQ 64
           L  ++ CT  + EL   SD YI+Q+N +   W AGRNF  + S   +++ L         
Sbjct: 8   LAAVVSCTFAQPELDFLSDEYIEQLNSKNLPWKAGRNFERDTSLYNIQRLLSVG------ 61

Query: 65  SDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAA 111
           +  P       +  +    +P+ FDAR+QW  C +I  + D   C +
Sbjct: 62  TINPPSEFETIFHEDDGKDLPEEFDARKQWSKCESIKEIRDQSGCGS 108


>gi|182509202|ref|NP_001116812.1| tubulointerstitial nephritis antigen precursor [Bombyx mori]
 gi|81303350|gb|ABB71105.1| TIN-ag-RP [Bombyx mori]
          Length = 404

 Score =  107 bits (267), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 81/323 (25%), Positives = 136/323 (42%), Gaps = 53/323 (16%)

Query: 21  FSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEY 80
            S+  ++ +N++  TW A   +P   +E+ L+  LI     F     PL     +Y  + 
Sbjct: 131 MSEDLVNDVNQQGTTWRA-TTYP-EFNEKKLKDGLIYKLGTF-----PLNVTVISYSKD- 182

Query: 81  SATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
               PD FDAR +W   G I  + D   C +    +      DR  I+S G +N  +S++
Sbjct: 183 -GQYPDEFDARREWY--GYISPIADQDWCGSDWAVSIASIVGDRFSIQSFGTENVRMSSQ 239

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SC         + C+ G++   ++F+   G V+                        
Sbjct: 240 TLLSC----HLKGQRGCNGGNLDIAFDFVKTHGLVS------------------------ 271

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
                   Q  P     T+C      R     ++R  + + +   ED I  +I+  GP  
Sbjct: 272 -------EQCFPYEGAVTQCRIGNDCR-----RYRVGVPFSISKEED-IMYDIMTSGPAL 318

Query: 261 ATFALYDDFYHYKSGVYKHTSNA-KLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
               +Y DF+HY+ G+Y+HT +  +L   LHS +++GWG +    YW+V N+WG  WG++
Sbjct: 319 GIMTVYQDFFHYREGIYRHTRHGDQLMRGLHSVRIVGWGEDAEDKYWIVANSWGTSWGEK 378

Query: 320 GTVKILRGKYECAFEYLIAAGKP 342
           G  +I RG      E  +    P
Sbjct: 379 GYFRIARGHSGTGIESSVLTVLP 401


>gi|327239610|gb|AEA39649.1| cathepsin B [Epinephelus coioides]
          Length = 171

 Score =  107 bits (267), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 63/179 (35%), Positives = 92/179 (51%), Gaps = 9/179 (5%)

Query: 107 GACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTW 166
           G+C A   F A  A SDR CI S G+ +  +S+E + +CC  C       C+ G     W
Sbjct: 1   GSCWA---FGAAEAISDRLCIHSNGKVSVEISSEDLLACCDSC----GMGCNGGYPSAAW 53

Query: 167 NFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYG 226
           +F    G V+GG Y    GC+P TI PC HH +  T P C  +     +C  +C +  Y 
Sbjct: 54  DFWTDVGLVSGGLYDSHVGCRPYTIPPCEHHVNG-TRPPCTGEGGDTPQCILQCES-GYT 111

Query: 227 RGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKL 285
             +  DKH    +Y V  +E+ I+ EI  +GP    F +Y+DF  YK+GVY+H + + +
Sbjct: 112 PSYKADKHYGKSSYSVPSDEEQIQSEIYKNGPVEGAFTVYEDFLLYKTGVYQHMTGSAV 170


>gi|12330246|gb|AAG52660.1| cysteine proteinase [Metagonimus yokogawai]
          Length = 179

 Score =  107 bits (267), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 65/188 (34%), Positives = 88/188 (46%), Gaps = 9/188 (4%)

Query: 116 AAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
            AV A +DR CI S     + +S   + SCC+ C +     C  G   R W+F  + G V
Sbjct: 1   GAVEAMTDRLCIHSNATIKKHISATDLLSCCESCGF----GCHGGFPPRAWDFWMENGLV 56

Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHR 235
           TGG   + +GC+      CSHHG     P C         C   C  P     +  DK  
Sbjct: 57  TGGSKENPSGCRSYPFPRCSHHGKG-KYPPCPKTIFDTPNCVDHCDKPDID--YAADKTH 113

Query: 236 TTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLI 295
              +Y V  NE  I KEI+ +GP  A F +Y+DF  YKSG+Y H+    L    H+ +++
Sbjct: 114 AKSSYNVQSNERVIMKEIMRNGPVEAAFMVYEDFIEYKSGIYFHSHGKLLGG--HAIRML 171

Query: 296 GWGTENGT 303
           GWG E G 
Sbjct: 172 GWGEEKGV 179


>gi|156708104|gb|ABU93310.1| cathepsin B1 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  107 bits (266), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 85/315 (26%), Positives = 138/315 (43%), Gaps = 54/315 (17%)

Query: 21  FSDAYIDQINRE-ANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
            +++ ++ +N + ++TW A   +PA++      +FL     Y  + +        ++D +
Sbjct: 10  LAESIVETVNNDPSSTWVA-VEYPASVITR--AKFLARLGTYVTKYEE------TSFDLD 60

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLST 139
            +  +P+ FD+REQWP  G I  V D  +C +   F+      DR  IK  G     ++ 
Sbjct: 61  NA--LPENFDSREQWP--GKILPVRDQASCGSCWAFSVAETMGDRLSIK--GCDYGDMAP 114

Query: 140 EYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGS 199
           + + SC        +  C+ G +   W +    G  T         C P      S  G 
Sbjct: 115 QDLVSCDTT-----DMGCNGGYMDHAWAWTKSHGVTT-------EKCMPYQ----SGSGR 158

Query: 200 APTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPT 259
            P  P+             +C N   G    ++K  +    +   N   + +E+  +GP 
Sbjct: 159 VPACPA-------------KCVN---GSAIVRNKSVS----YKKLNAQQMMEELYENGPI 198

Query: 260 TATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDR 319
           +  F +Y DF +YKSGVY H +        H+   +GWG E+ TPYWL  N+WGP WG++
Sbjct: 199 SVAFTVYYDFMNYKSGVYVHKTGGIAGG--HAVLCVGWGVEDNTPYWLCQNSWGPAWGEK 256

Query: 320 GTVKILRGKYECAFE 334
           G  KILRG   C  E
Sbjct: 257 GHFKILRGSNHCGIE 271


>gi|350540002|ref|NP_001232104.1| putative cathepsin B variant 2 precursor [Taeniopygia guttata]
 gi|197129221|gb|ACH45719.1| putative cathepsin B variant 2 [Taeniopygia guttata]
          Length = 261

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 66/207 (31%), Positives = 97/207 (46%), Gaps = 15/207 (7%)

Query: 2   IHILVFLLGCTLVRGELY--KFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADA 59
           + +L  L+     R   Y    SD  ++ IN+   TW AG NF  N    Y+++      
Sbjct: 5   VSLLCVLVALANARSIPYFPPLSDDLVNHINKLNTTWKAGHNF-HNADMSYVKKLC---G 60

Query: 60  KYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVG 119
            +      P     +  D      +PD FD+R QWPNC TI  + D G+C +   F AV 
Sbjct: 61  TFLGGPKLP-----ERVDFAADVELPDNFDSRTQWPNCPTISEIRDQGSCGSCWAFGAVE 115

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           A SDR C+ +  + +  +S E + SCC    ++    C+ G     W +  +RG V+GG 
Sbjct: 116 AISDRICVHTNAKVSVEVSAEDLLSCCG---FECGMGCNGGYPSGAWRYWTERGLVSGGL 172

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSC 206
           Y    GC+P +I PC HH +  T P C
Sbjct: 173 YDSHVGCRPYSIPPCEHHVNG-TRPPC 198


>gi|448278133|gb|AGE43966.1| putative cathepsin B [Naegleria fowleri]
          Length = 349

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 88/333 (26%), Positives = 147/333 (44%), Gaps = 64/333 (19%)

Query: 23  DAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSA 82
           +A+I  IN+ A TW AG++       ++     ++ A+       P P  R +Y  + S 
Sbjct: 50  EAFIQLINKYAKTWQAGKS-------KFFEGKRLSHARRLIGLGLPTPEQRASYPKKNSL 102

Query: 83  -----------------TVPDRFDAR--EQWPNCGTIGHVPDTGACAAPHIFAAVGAFSD 123
                             +PD ++A     +  C  +  + +   C +   F+     +D
Sbjct: 103 MMGEEANSLEKYLVKMDALPDSYNAANDSNYYMCQQLHRIRNQEQCGSCWAFSISEMVAD 162

Query: 124 RRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDR 183
           R CI ++G+ N  +S +++ SC       DN  C+ G     + F+   G V+       
Sbjct: 163 RFCIGTRGKINTIMSPQWMVSC----DTADN-GCNGGEFPTAFQFVETTGLVS------- 210

Query: 184 TGCQPSTISPCSHHGSAPTLP-SCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWV 242
            GC P      S +G  P  P SC N +   ++  T+ +     R F  +  ++      
Sbjct: 211 DGCVPYQ----SGNGFVPPCPNSCANGEDINVRYRTKNS-----RNFDVNDMKS------ 255

Query: 243 DDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TEN 301
                 ++  ILA+GP  + F +Y DFY+Y+SG YKH +   +    H+ K++GWG T++
Sbjct: 256 ------VQASILANGPVISGFKVYRDFYNYRSG-YKHVAGGLVGG--HAIKVVGWGVTQS 306

Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
             PYW+V N+W   WG  G   ILRG  EC+ E
Sbjct: 307 NVPYWIVANSWSDEWGMNGYFWILRGTNECSIE 339


>gi|159950|gb|AAA29435.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
          Length = 105

 Score =  106 bits (265), Expect = 1e-20,   Method: Composition-based stats.
 Identities = 47/101 (46%), Positives = 69/101 (68%), Gaps = 2/101 (1%)

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT 299
           Y + ++  AI+K+I+ +GP  AT+ +Y+DF HY+SG+YKH +  K    LH+ K+IGWG 
Sbjct: 4   YQLKNSVKAIQKDIMKNGPVVATYTVYEDFAHYRSGIYKHKAGRK--TGLHAVKVIGWGE 61

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAG 340
           E GTPYW+V N+W   WG+ G  ++ RG  +C FE  +AAG
Sbjct: 62  EKGTPYWIVANSWHDDWGENGFFRMHRGSNDCGFEERMAAG 102


>gi|48762483|dbj|BAD23811.1| cathepsin B-S [Tuberaphis takenouchii]
          Length = 155

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 55/158 (34%), Positives = 85/158 (53%), Gaps = 8/158 (5%)

Query: 155 KSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKL 214
           K C  G   + W +   +G  TGGDY  + GC P  I PC       T   C  + + + 
Sbjct: 3   KGCEGGYPIKAWQYFRTQGVPTGGDYDSKEGCAPYKIPPCFDQKGKNT---CAGKPLER- 58

Query: 215 KCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKS 274
             + +C    YG    Q +++    Y V ++ + ++++++ +GP  A+F L+DD   YKS
Sbjct: 59  --NHQCPKTCYGSTTVQKRYKVKNEY-VLNSPNTMEQDLIKYGPIEASFNLFDDLSAYKS 115

Query: 275 GVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTW 312
           G+Y+ T  AK  +  HS K+IGWG ENG PYWL +N+W
Sbjct: 116 GIYQKTPKAKFLS-GHSIKIIGWGKENGVPYWLAVNSW 152


>gi|255087666|ref|XP_002505756.1| cathepsin B-like cysteine proteinase [Micromonas sp. RCC299]
 gi|226521026|gb|ACO67014.1| cathepsin B-like cysteine proteinase [Micromonas sp. RCC299]
          Length = 273

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 75/256 (29%), Positives = 113/256 (44%), Gaps = 25/256 (9%)

Query: 84  VPDRFDAREQWPNCG-TIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYV 142
           +P+ FDAR +WP C   IG   D G C +    A     SDR CI+S G+ +  LS   +
Sbjct: 18  LPESFDARTKWPTCAHLIGVARDQGNCGSCWAMAPAEVMSDRACIQSGGEIDAELSPFQL 77

Query: 143 ASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPT 202
            +C +      +  C  G     + F    G VTGG + D+  C P   +PC H      
Sbjct: 78  LACAQ-----GSFGCEGGESADAYEFAKSNGVVTGGGFDDQNTCAPYPFAPCHHPCEVFP 132

Query: 203 LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTAT 262
            P+C     P             G+  F+ K       +   +   +  EI  +GP ++ 
Sbjct: 133 TPAC-----PATCVGGSNDGVQNGKASFKVKAIVDCPSF---DYGCVANEIYHNGPVSSY 184

Query: 263 FA-LYDDFYHYKSGVYKHTSNAKLENYLHSG---KLIGWGT------ENGTPYWLVINTW 312
              +Y++FY YKSGV++ + +       H G   K+IGWG       E    YW+V+N+W
Sbjct: 185 AGDIYEEFYAYKSGVFRESPSVAQRGANHGGHVVKVIGWGKADPAKGEGEGYYWIVVNSW 244

Query: 313 GPHWGDRGTVKILRGK 328
             +WGD G  +I  G+
Sbjct: 245 -LNWGDDGVGRIAVGE 259


>gi|294891885|ref|XP_002773787.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239878991|gb|EER05603.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 234

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 63/176 (35%), Positives = 84/176 (47%), Gaps = 11/176 (6%)

Query: 161 SVFRTWNFLHKRGSVTGG-----DYGDRTGCQPSTISPCSH-HGSAPTLPSC-ENQKVPK 213
           S F   NF  +   ++G      + G+  GC P     C+H  G     P C + + +P 
Sbjct: 12  SAFNRRNFRFESFKLSGEYKPPEELGNDDGCWPYPFPKCNHVPGLESKYPRCAQVRDLPA 71

Query: 214 LKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYK 273
             C T C N  YG    +D HR      +    + IK+EI  +GP  A   LY+DF  YK
Sbjct: 72  --CATTCPNKAYGTSMQKDTHRAKSWGRLPIGPEKIKQEIFDNGPVAAMMTLYEDFRFYK 129

Query: 274 SGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKY 329
           SGVY H +   L    H+ KLIGWG E+G  YWL +N W   WGD G +K+    Y
Sbjct: 130 SGVYVHKTGQMLA--AHTLKLIGWGVESGQEYWLAVNAWNEEWGDHGMIKLASSVY 183


>gi|147902366|ref|NP_001080511.1| cathepsin C precursor [Xenopus laevis]
 gi|33417162|gb|AAH56109.1| Ctsc protein [Xenopus laevis]
          Length = 458

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 86/338 (25%), Positives = 144/338 (42%), Gaps = 49/338 (14%)

Query: 13  LVRGELYKFSDAYIDQINREANTWTAGR--NFPANLSEEYLRQFLIADAKYFDQSDRPLP 70
           ++   +Y ++  ++ QIN    +WTA     +     E+ +R+   A  +      RP P
Sbjct: 158 MLTSRVYNYNHDFVKQINTVQKSWTASVYPEYEGMSIEDLVRR---AGGRNSRIPVRPRP 214

Query: 71  GDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSK 130
               T D +Y   +P+ +D R        +  V + G+C + + FA++G    R  I+S+
Sbjct: 215 APMPT-DQKYQG-LPNEWDWR-NIAGFNFVSPVRNQGSCGSCYAFASMGMLESRIQIQSQ 271

Query: 131 GQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPST 190
             Q   LS + V SC        ++ C  G  +           +  G Y +  G     
Sbjct: 272 LSQKPILSPQQVVSCSNY-----SQGCDGGFPY-----------LIAGKYLNDFGI---- 311

Query: 191 ISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIK 250
                           E    P +   + CT     + ++  ++     ++   NE  +K
Sbjct: 312 ---------------VEESDFPYIGSDSPCTLKDSYQRYYTAEYHYVGGFYGGCNEAYMK 356

Query: 251 KEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL----HSGKLIGWGT--ENGTP 304
            E++  GP +  F +YDDF HY+SGVY HT      N      H+  L+G+GT  + G  
Sbjct: 357 LELVLGGPLSVAFEVYDDFIHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGTDQQTGEK 416

Query: 305 YWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           YW+V N+WG  WG++G  +I RG  ECA E +  +  P
Sbjct: 417 YWIVKNSWGESWGEKGFFRIRRGSDECAIESIAVSANP 454


>gi|123478051|ref|XP_001322190.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
 gi|121905031|gb|EAY09967.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
          Length = 288

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 88/317 (27%), Positives = 129/317 (40%), Gaps = 62/317 (19%)

Query: 36  WTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDR--KTYDPEYSATVPDRFDAREQ 93
           W AG N       E  +     DA     +   L  D       P+ + ++P  ++  E+
Sbjct: 25  WVAGEN-------ERFKGMTFKDASVISGNAHKLRPDTIPLARPPKINISIPMSYNFTER 77

Query: 94  WPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPL--STEYVASCCKICRY 151
           +P C     V D G C +   FA   +FS R C K     N+P+  S  ++ +C +    
Sbjct: 78  FPQCDF--GVLDQGKCGSCWSFAVSKSFSHRYCRK----YNKPVLFSQSHLVACDR---- 127

Query: 152 DDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKV 211
             N  C  G     W ++  RG            CQP                   +  +
Sbjct: 128 -RNSGCGGGIEVNAWRYIDLRGL-------PLDSCQPY------------------DGNI 161

Query: 212 PKLKCHTRCTN--PTYGRGFFQDKHRTTLTYWVDDNEDAIKKE---ILAHGPTTATFALY 266
            K  C  +CTN   TY   F +        YW      +I++    I+  GP T +  +Y
Sbjct: 162 TKYNCSKKCTNESETYEAQFTE--------YWSVARYASIEEMQIGIMTEGPVTTSLKVY 213

Query: 267 DDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILR 326
            D  +YKSG+Y HT    L +  H+ ++IGWGT+NG  YW++ N+W   WG  G   I R
Sbjct: 214 SDLMYYKSGIYTHTKGEFLGH--HAVEIIGWGTKNGIDYWIISNSWNTTWGMNGLFLIKR 271

Query: 327 GKYECAFEYLIAAGKPK 343
           G  EC  E  + AGK K
Sbjct: 272 GVNECHIEDYVCAGKVK 288


>gi|312383398|gb|EFR28501.1| hypothetical protein AND_03481 [Anopheles darlingi]
          Length = 573

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 74/266 (27%), Positives = 112/266 (42%), Gaps = 35/266 (13%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDA + WP    +G   D G C +    +     SDR  I SKG++   L+ + + 
Sbjct: 296 LPSHFDAADHWPR--LVGEARDQGWCGSSWALSTTTMASDRFAILSKGREQVQLAPQQLL 353

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           +C +       ++CS G +   W +L + G V    Y          I    + G     
Sbjct: 354 ACVR-----RQQACSGGHLDTAWQYLRRVGVVNDECYPYIAAKNQCKI----NDGDTLVS 404

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
            +CE           R   P Y                  +NE  I  EI   G   A  
Sbjct: 405 ANCELPANVNRTAMYR-MGPAYSL----------------NNETDIMTEIKERGTVQAIL 447

Query: 264 ALYDDFYHYKSGVYKHTSNA---KLENYLHSGKLIGWGTE----NGTPYWLVINTWGPHW 316
            +Y DF+ Y++G+Y+H++ A   +  +  HS +LIGWG E    +   YW+ +N+WG  W
Sbjct: 448 RVYRDFFSYQNGIYRHSAAATPAEERSAYHSVRLIGWGEERVGYDMVKYWIAVNSWGTWW 507

Query: 317 GDRGTVKILRGKYECAFEYLIAAGKP 342
           G+ G  +ILRG  EC  E  + A  P
Sbjct: 508 GENGRFRILRGTNECEIESYVLASNP 533


>gi|189308076|gb|ACD86922.1| cysteine protease [Caenorhabditis brenneri]
          Length = 228

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 73/234 (31%), Positives = 104/234 (44%), Gaps = 10/234 (4%)

Query: 1   MIHILVFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAK 60
           ++   +  L   LV   + K  +A  + +N + + W A    P +++ E +++ L+    
Sbjct: 4   VVFASLVALATGLVIPIVPKTPEAITEYVNSKQSLWKA--EIPKHITIEQVKKRLMRTEF 61

Query: 61  YFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
               S    P            T+P  FDAR QWP+C +I ++ D   C +   FAA  A
Sbjct: 62  VAPHS----PDAEFVKHDIQEDTIPATFDARTQWPSCVSINNIRDQSDCGSCWAFAAAEA 117

Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
            SDR CI S G  N  LS E V SCC  C Y     C  G     W +L K G  TGG Y
Sbjct: 118 ASDRFCIASNGAVNTLLSAEDVLSCCSNCGY----GCEGGYPINAWKYLVKSGFCTGGSY 173

Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
             + GC+P +++PC       T P+C         C  +CTN  Y   +  DKH
Sbjct: 174 EAQFGCKPYSLAPCGETVGNTTWPACPTDGYDTPACVNKCTNSNYNVAYKDDKH 227


>gi|308811264|ref|XP_003082940.1| cysteine proteinase (ISS) [Ostreococcus tauri]
 gi|116054818|emb|CAL56895.1| cysteine proteinase (ISS) [Ostreococcus tauri]
          Length = 362

 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 88/281 (31%), Positives = 126/281 (44%), Gaps = 43/281 (15%)

Query: 74  KTYDP-----EYSATVPDRFDAREQWPNCGT-IGHVPDTGACAAPHIFAAVGAFSDRRCI 127
           KT+DP          +PD FD RE+WP C   +    D GAC +    A   A +DR CI
Sbjct: 73  KTWDPTKIKLHAGGRLPDTFDVREKWPKCAALVSEAVDQGACGSCWAVAPAKAMTDRLCI 132

Query: 128 KSKGQQNRPLSTEYVASCCKICR----YDDNKSCSHGSVF-----RTWNFLHKRGSVTGG 178
            + G  N  +S   + SC         YD+N +   G          +   H+ G V+GG
Sbjct: 133 ATNGAVNTHVSAIQLLSCNSHSNSAYTYDENLAGGSGGCMGGYPTEAYETAHRVGVVSGG 192

Query: 179 DYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKL--KCHTRCTNPTYGRGFFQDKHRT 236
             GD+  C P   +PC H    P  P+  N   P+   +  T+  N T          R 
Sbjct: 193 LNGDQDTCMPYPFAPCHH----PCEPN-HNAVCPRTCQRSATQTANTT----------RY 237

Query: 237 TLTYWVD---DNEDAIKKEILAHGPTTATFA--LYDDFYHYKSGVYKHTSNAKLENYLHS 291
            + + V    ++ D +  EI   GP T TF   +YD+FY Y+ GVYK + +       H 
Sbjct: 238 AVGHLVQCGLNDYDCMASEIFERGPVT-TFVGDVYDEFYQYERGVYKLSKDPAARGKNHG 296

Query: 292 G---KLIGWG-TENGTPYWLVINTWGPHWGDRGTVKILRGK 328
           G   ++IGWG +  G  YW V N+W  +WG+RG  +I  G+
Sbjct: 297 GHVMEVIGWGKSAEGVRYWKVYNSW-LNWGERGYGEIAVGE 336


>gi|253743418|gb|EES99819.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 296

 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 75/272 (27%), Positives = 121/272 (44%), Gaps = 44/272 (16%)

Query: 64  QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSD 123
            + RP    R +  P   A  P+ +D R+++P+C  I  V D G+C +   F+++  F+D
Sbjct: 58  NTSRPAITRRHSTKPPVGA--PESYDFRDEYPHC--ITEVVDQGSCGSCWAFSSIQTFAD 113

Query: 124 RRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDR 183
            RC           S +YV  C +      +  C+ G   + ++FLH  G+V        
Sbjct: 114 HRCRSGLDATGVSYSVQYVLDCDR-----KDHGCNGGEPTKAFDFLHSTGTV-------- 160

Query: 184 TGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVD 243
                  ++ C  + +           V K  C   C + +     F      +      
Sbjct: 161 -------LTSCVDYTAGA-------DNVVKF-CPKTCDDGSAVENVFAASGSKS------ 199

Query: 244 DNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENG 302
               +    +L+HGP  ATF +  DF +YKSGVY+H     L    H+ +++G+G T++G
Sbjct: 200 ---GSAIDVLLSHGPVVATFNVAQDFMYYKSGVYQHRWGVWLGG--HAVEVVGYGVTDSG 254

Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFE 334
             YW V N+WGP WG+ G  +I+RG  EC  E
Sbjct: 255 LDYWTVRNSWGPDWGEDGYFRIVRGSDECGIE 286


>gi|443687066|gb|ELT90166.1| hypothetical protein CAPTEDRAFT_138389 [Capitella teleta]
          Length = 446

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 88/346 (25%), Positives = 142/346 (41%), Gaps = 58/346 (16%)

Query: 8   LLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLI----ADAKYFD 63
           L+  + ++  +YK +  YI Q+N  ++TW A       +  EY    LI     +     
Sbjct: 146 LIDESQMKSSVYKPNPDYIRQLNEASSTWKA------TIYAEYEGMHLIDLHRRNGGSRS 199

Query: 64  QSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP---DTGACAAPHIFAAVGA 120
           +   P  G  K      +  +P+ +D    W N   +  V    + G C + + F+++  
Sbjct: 200 RVSSPGRGLLKEETKMAAVNLPESWD----WRNVDGVDFVSPVRNQGGCGSCYAFSSMAM 255

Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
              R  + S   Q    S + +  CC+      ++ C  G  +           + GG Y
Sbjct: 256 NEARIRVMSNNTQMPVFSPQDIVDCCQY-----SQGCDGGFPY-----------LVGGKY 299

Query: 181 GDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTY 240
            +  G    +  P             E++K     C  R          +  ++R    Y
Sbjct: 300 AEDFGLVDESCDPYVG----------EDRKCKSTSCSRR----------YATRYRYVGGY 339

Query: 241 WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL----HSGKLIG 296
           +   NE  +K   L  GP + +F +YDDF HYKSGVY+H+      N      H+  L+G
Sbjct: 340 YGACNEQEMKLA-LQRGPLSVSFMVYDDFMHYKSGVYRHSGLTDKYNPFEITNHAVLLVG 398

Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           +G + GT YW+V N+WG  WG+ G  +ILRG  ECA E +     P
Sbjct: 399 YGADEGTKYWIVKNSWGKGWGEEGYFRILRGADECAIESIAVETFP 444


>gi|161343881|tpg|DAA06121.1| TPA_inf: cathepsin B [Toxoptera citricida]
          Length = 182

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 59/148 (39%), Positives = 77/148 (52%), Gaps = 11/148 (7%)

Query: 53  QFLIADAKYFDQSDRPLPGDRKTYDPEYSATVPDRFDAREQWPNCG-TIGHVPDTGACAA 111
           Q LI    Y    D  L  +RKT+D  Y   +P  FDAR+ + NC   IG V D G CA+
Sbjct: 39  QKLIQKTNY----DSWLKKNRKTFDINYKTDIPKEFDARQYFFNCANVIGDVKDQGNCAS 94

Query: 112 PHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKS-CSHGSVFRTWNFLH 170
               A    F+DR CI + G   + LS + + SC      DD KS C+ GS F+ W F+ 
Sbjct: 95  SWAVAVASTFTDRLCIATNGTFTQNLSAQNLMSCG-----DDEKSGCNGGSAFKAWEFIT 149

Query: 171 KRGSVTGGDYGDRTGCQPSTISPCSHHG 198
            +G VTGG++    GCQP    PC H+G
Sbjct: 150 GKGIVTGGNFDSNEGCQPYKNRPCDHYG 177


>gi|328712819|ref|XP_001942906.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Acyrthosiphon pisum]
 gi|328712821|ref|XP_003244911.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Acyrthosiphon pisum]
          Length = 463

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 81/258 (31%), Positives = 114/258 (44%), Gaps = 28/258 (10%)

Query: 88  FDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK 147
           +DARE W N   I    D G C A      V   +DR  I SK   +  LS +++ SC  
Sbjct: 199 YDAREVWGN--YISSPIDQGWCGASWAITTVQVTTDRFGIMSKRAISDVLSPQHLLSCNN 256

Query: 148 ICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCE 207
           +    + + C  G + R WN++ K G +T   Y            P     S   +P  +
Sbjct: 257 L----NQQGCQGGHLTRAWNWIRKFGLITEECY------------PWQGRMSTCAVPKKK 300

Query: 208 NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD 267
            + + +     R  N    R      HR    Y V   E+ I  EIL  GP  A   +  
Sbjct: 301 KETMAQCPSRVRSNND---RTTKTRLHRVGPVYRVA-TEEGIMHEILTSGPVQAVMKVSR 356

Query: 268 DFYHYKSGVYKHTSNAK-LENYLHSGKLIGWGTE----NGTPYWLVINTWGPHWGDRGTV 322
           DF+ YKSGVYK ++ A       HS +++GWG E        YW+  N+WG  WG+ G  
Sbjct: 357 DFFMYKSGVYKCSNLASGSRTGYHSVRIVGWGEEYQGGKIVKYWIASNSWGSWWGENGYF 416

Query: 323 KILRGKYECAFE-YLIAA 339
           +IL+G  EC  E ++IAA
Sbjct: 417 RILKGVDECEIEDFVIAA 434


>gi|395734831|ref|XP_003776483.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin B-like [Pongo abelii]
          Length = 350

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 97/344 (28%), Positives = 142/344 (41%), Gaps = 33/344 (9%)

Query: 6   VFLLGCTLVRGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQS 65
           + +L     R  L+  S   ++ IN+      AG NF   +   YLR+       +  +S
Sbjct: 24  LLVLASAGSRTYLHPLSKXLVNYINKPNTMQQAGHNF-HKMXISYLRR---PCGTFPGRS 79

Query: 66  DRPLPGDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRR 125
             P    R  +  + +  +P+ FD  EQWP+      + D G+        A+ A SD  
Sbjct: 80  KLP---QRVKFAXDIN--LPESFDPXEQWPD-XPXREIRDQGSYGFCWALGALEAISDWI 133

Query: 126 CIK-----SKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDY 180
           CI      ++G  +  +S E   +C  +C       C+ G     WNF   +G V+GG Y
Sbjct: 134 CIHPNVGGAQGGNHVEVSAEDKLTC--LC----GDGCNGGXPNEGWNFWTGKGLVSGGLY 187

Query: 181 GDRTGCQ--PSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTL 238
               GC+  PS + PC HH      P       PK  C   C     G+ +  DKH    
Sbjct: 188 DSHVGCRLFPSLL-PCKHHIHG--XPYVXTGDSPK--CSMTCEP---GQTYKXDKHYGCS 239

Query: 239 TYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG 298
           +Y + D+   I   I  +      F++Y DF  YK   Y+  +        H+  ++G  
Sbjct: 240 SYSISDSTKDIMTNIYKNDXVEEAFSVYLDFLMYKFKEYQGVTGEMXGG--HAICILGCK 297

Query: 299 TENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            EN T YWLV N W   WGD G  KILRG+     E  + A  P
Sbjct: 298 VENSTSYWLVANXWNRDWGDNGFFKILRGQDHYGIESEVVAEIP 341


>gi|67867504|gb|AAH98085.1| Unknown (protein for MGC:107782) [Xenopus (Silurana) tropicalis]
          Length = 458

 Score =  103 bits (258), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 86/341 (25%), Positives = 143/341 (41%), Gaps = 55/341 (16%)

Query: 13  LVRGELYKFSDAYIDQINREANTWTAGR--NFPANLSEEYLRQFLIADAKYFDQSDRPLP 70
           ++   LY ++  ++ QIN    +WTA     +     E+ +R+   A  +      RP P
Sbjct: 158 MLTSRLYNYNHDFVKQINEVQKSWTATAYPEYEGMTIEDLIRR---AGGRNSRIPMRPRP 214

Query: 71  GDRKTYDPEYSATVPDRFDAREQWPNCGT---IGHVPDTGACAAPHIFAAVGAFSDRRCI 127
               T D +Y   +P  +D    W N      +  V +  +C + + F+++G    R  I
Sbjct: 215 APLPT-DEKYQG-LPTEWD----WRNIAGYNFVTPVRNQASCGSCYAFSSMGMLESRIQI 268

Query: 128 KSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQ 187
           +S+  Q   LS + V SC        ++ C  G  +           +  G Y    G  
Sbjct: 269 RSQLSQKPILSPQQVVSCSNY-----SQGCEGGFPY-----------LIAGKYVSDYGI- 311

Query: 188 PSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNED 247
                              E   +P     + CT     + ++  ++     ++   NE 
Sbjct: 312 ------------------VEESDLPYTGSDSPCTLKDSQQKYYTAEYHYVGGFYGGCNEA 353

Query: 248 AIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL----HSGKLIGWGT--EN 301
            +K E++  GP +  F +YDDF HY+SGVY HT      N      H+  L+G+GT  + 
Sbjct: 354 YMKLELVLGGPLSVAFEVYDDFMHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGTDQQT 413

Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           G  YW+V N+WG  WG++G  +I RG  ECA E +  + +P
Sbjct: 414 GEKYWIVKNSWGESWGEKGYFRIRRGTDECAIESIAVSAEP 454


>gi|62320420|dbj|BAD94873.1| cathepsin B-like cysteine proteinase like protein [Arabidopsis
           thaliana]
          Length = 183

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 67/180 (37%), Positives = 87/180 (48%), Gaps = 23/180 (12%)

Query: 166 WNFLHKRGSVTG--GDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNP 223
           W +    G VT     Y D TGC        SH G  PT P+         KC  +C + 
Sbjct: 4   WLYFKYHGVVTQECDPYFDNTGC--------SHPGCEPTYPT--------PKCERKCVSR 47

Query: 224 TYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNA 283
               G  + KH     Y ++ +   I  E+  +GP    F +Y+DF HYKSGVYK+ +  
Sbjct: 48  NQLWG--ESKHYGVGAYRINPDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGT 105

Query: 284 KLENYLHSGKLIGWGT-ENGTPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           K+    H+ KLIGWGT ++G  YWL+ N W   WGD G  KI RG  EC  E  + AG P
Sbjct: 106 KIGG--HAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQSVVAGLP 163


>gi|48762489|dbj|BAD23814.1| cathepsin B-N [Tuberaphis takenouchii]
          Length = 163

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 65/169 (38%), Positives = 89/169 (52%), Gaps = 18/169 (10%)

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           AF+DR CI + G+ N  LS E +A CC  C +     C  G   + W +  K G VTGGD
Sbjct: 5   AFADRLCIATDGEFNELLSAEELAFCCHKCGF----GCHGGYPIKAWEWFKKHGLVTGGD 60

Query: 180 YGDRTGCQPSTISPC--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKH 234
           Y    GCQP  + PC    +G+     +C  +   K   + RCT   YG     F +D H
Sbjct: 61  YDSGEGCQPYRVPPCPLDEYGNN----TCRGKPAEK---NHRCTRMCYGNQELDFKEDHH 113

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNA 283
            T   Y++      I+K+++A+GP  A+F +YDDF +YKSGVY  T NA
Sbjct: 114 WTRDAYYL--TYTTIQKDVMAYGPIEASFDVYDDFPNYKSGVYMKTENA 160


>gi|496968|gb|AAA96831.1| cysteine protease homologue, partial [Ancylostoma caninum]
          Length = 197

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 63/202 (31%), Positives = 96/202 (47%), Gaps = 13/202 (6%)

Query: 115 FAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKI-CRYDDNKSCSHGSVFRTWNFLHKRG 173
            ++  A SD  C++S       +S   + SCC I C Y     C  G     + ++ +  
Sbjct: 5   VSSAEAMSDEICVQSNSTIRVMISDSDILSCCGISCGY----GCQGGWSIEAYKWMQRER 60

Query: 174 SVTGGDYGDRTGCQPSTISPCSHHGSAPTLP---SCENQKVPKLKCHTRCTNPTYGRGFF 230
                +  DR  C+P  + P    G+ P  P    C     P  KC   C    Y + + 
Sbjct: 61  CCYRWENTDRRVCKP--VRPSIRVGNHPNDPYYGPCPGGLWPTPKCRKTCQRKYY-KSYQ 117

Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLH 290
           +DKH  T  Y++ +NE +I++EI  +GP  A F +Y DF +YK G+Y H    +     H
Sbjct: 118 EDKHFATRAYYLPNNERSIRQEIYKNGPVVAAFRVYQDFSYYKKGIYVHKWGGQTG--AH 175

Query: 291 SGKLIGWGTENGTPYWLVINTW 312
           + K++GWG EN T YWL+ N+W
Sbjct: 176 AVKVVGWGRENATDYWLIANSW 197


>gi|428168267|gb|EKX37214.1| hypothetical protein GUITHDRAFT_78289 [Guillardia theta CCMP2712]
          Length = 224

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 78/256 (30%), Positives = 111/256 (43%), Gaps = 41/256 (16%)

Query: 86  DRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASC 145
           D +DA E++ +C       D  +C + + FAA   +S R C ++ GQ N  LS + + SC
Sbjct: 4   DEYDASERFSSCKAF-TPKDQKSCGSCYAFAAAAVYSARLCAQTGGQFNIDLSPQQIVSC 62

Query: 146 CKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPS 205
                 + N  CS G+   T+  ++  G V G        C P         G A     
Sbjct: 63  ------NSNDGCSGGNAIDTFEQMYTSGRVPGW-------CMPYLAKDVGGGGPA----- 104

Query: 206 CENQKVPKLKCHTRCT-NPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFA 264
                     C   C+  P Y         + +    + DN   I+ EIL++GP  A F 
Sbjct: 105 ----------CSDVCSLGPDY-------SVKASSLGVIQDNVRQIQSEILSNGPVFAAFW 147

Query: 265 LYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGWGT--ENGTPYWLVINTWGPHWGDRG 320
           +Y DF  Y  GVY  +  A  +     H+  ++GWGT  E G  YWL+ N+W   WGD+G
Sbjct: 148 VYSDFMAYTGGVYSASKEALAQGKTGGHAVMMVGWGTDKETGQDYWLLQNSWSEKWGDKG 207

Query: 321 TVKILRGKYECAFEYL 336
             KI RG  EC  E L
Sbjct: 208 RFKIKRGVDECGIESL 223


>gi|290992564|ref|XP_002678904.1| predicted protein [Naegleria gruberi]
 gi|284092518|gb|EFC46160.1| predicted protein [Naegleria gruberi]
          Length = 289

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 76/236 (32%), Positives = 109/236 (46%), Gaps = 40/236 (16%)

Query: 88  FDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCK 147
           FDAR +W  C  +  + D   C +   F+A    SDR CI S G  +  LS EY+  C  
Sbjct: 87  FDARTKWGKC--VHPIRDQQQCGSCWAFSASEVLSDRFCIASNGSVDVVLSPEYMLQCDS 144

Query: 148 ICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTLPSCE 207
                 +  C  G +   W FL   G  +     D+  C P T    S +G   +     
Sbjct: 145 T-----DYGCDGGYLNNAWAFLAGTGIPS-----DK--CDPYT----SGNGDVGS----- 183

Query: 208 NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD 267
                   C T CT+ +  + +            +DD    I+K+I A+GP  A F++Y 
Sbjct: 184 --------CPTSCTDGSAIKLYKAKSSSVAQLSSIDD----IQKDIQANGPVQAAFSVYQ 231

Query: 268 DFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TENG--TPYWLVINTWGPHWGDRG 320
           DF+ YKSGVY+H S +      H+ K++GWG T +G  TPYW+V N+W  +WG  G
Sbjct: 232 DFFSYKSGVYRHVSGSLAGG--HAIKIVGWGVTSDGKDTPYWIVANSWNTNWGQEG 285


>gi|294891865|ref|XP_002773777.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239878981|gb|EER05593.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 156

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 48/130 (36%), Positives = 71/130 (54%), Gaps = 2/130 (1%)

Query: 199 SAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGP 258
           +A   P C ++ + +  C T C N +Y     QD HR      +  +   IK+EI  +G 
Sbjct: 16  AASQYPKCPSEALSQPACQTECINESYKTSLQQDLHRAKSWGRLPTSPQKIKQEIFDNGT 75

Query: 259 TTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGD 318
                ++Y+DF  YKSGVY HT+   +   +HS K+IGWG E+G  YWL +N+W   WGD
Sbjct: 76  VLGVISMYEDFRLYKSGVYVHTTGGLVG--VHSLKIIGWGVESGQDYWLAVNSWNEEWGD 133

Query: 319 RGTVKILRGK 328
            G +K+  G+
Sbjct: 134 HGMIKLAVGE 143


>gi|294876288|ref|XP_002767632.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239869318|gb|EER00350.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 97

 Score =  102 bits (254), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 47/96 (48%), Positives = 64/96 (66%), Gaps = 4/96 (4%)

Query: 247 DAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYW 306
           D IKKEI+ +GPT+AT ++Y+DF  Y+SGVYKHTS   +   +HS ++IGWG E G  YW
Sbjct: 3   DNIKKEIMTNGPTSATLSMYNDFLSYESGVYKHTSGTFMG--VHSVEIIGWGIEKGVDYW 60

Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
           LV+N+W   WGD GT KI +G  +C    ++    P
Sbjct: 61  LVMNSWNEDWGDNGTFKIAQG--DCGINDMVLGAPP 94


>gi|294931810|ref|XP_002780018.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239889821|gb|EER11813.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 131

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 51/98 (52%), Positives = 68/98 (69%), Gaps = 3/98 (3%)

Query: 231 QDKHRTTLTY-WVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL 289
           +D+H T     ++ +  D IKKEI+ +GPT+A+F+ Y+DF  YKSGVYKHTS   L +  
Sbjct: 12  RDRHFTARALPYLFEGTDNIKKEIMTNGPTSASFSTYEDFSSYKSGVYKHTSGGYLGD-- 69

Query: 290 HSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRG 327
           HS ++IGWGTE G  YWLV+N+W   WGD GT KI +G
Sbjct: 70  HSVEIIGWGTEKGVDYWLVMNSWNEGWGDHGTFKIAQG 107


>gi|253744204|gb|EET00443.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 309

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 89/322 (27%), Positives = 132/322 (40%), Gaps = 56/322 (17%)

Query: 22  SDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRP---LPGDRKTYDP 78
           + A + QI      W AG   P     E L+     D K    +  P   +P     +  
Sbjct: 17  TQAKLRQIQALGPIWKAG--IP-----ERLKNLTETDFKRLVSAKDPRGQIPTLHLIHTY 69

Query: 79  EYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLS 138
           E    +PD FD RE++P C  I  V D G C++    + V AF  RRC+    Q+    S
Sbjct: 70  ESEDPIPDHFDFREEYPQC--ITEVIDMGTCSSSWAHSPVEAFGHRRCMNGVDQEATRYS 127

Query: 139 TEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRG-----SVTGGDYGDRTGCQPSTISP 193
            +Y+ SC       +      G    +W+F+   G      V   DY D+T       S 
Sbjct: 128 AQYILSCATT----NGCLAFPGQGVVSWDFIATTGIPLESCVKYTDY-DKTESSYPCPSL 182

Query: 194 CSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEI 253
           C+ + S     S                +   G GF               N + +++ I
Sbjct: 183 CNDNSSLVLYKS----------------DGYEGVGF---------------NPEKLRRAI 211

Query: 254 LAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT-ENGTPYWLVINTW 312
              GP  A F +Y+DF +Y  G+Y H        YL S +++G+GT + G  YW+V N W
Sbjct: 212 ALRGPMQAMFTVYEDFAYYLEGIYSHVYGGT-AGYL-SVEIVGYGTSDEGQDYWIVKNYW 269

Query: 313 GPHWGDRGTVKILRGKYECAFE 334
           G +WG+ G  +I+RG+ EC  E
Sbjct: 270 GSNWGEDGYFRIVRGQNECQIE 291


>gi|48762487|dbj|BAD23813.1| cathepsin B-N [Tuberaphis taiwana]
          Length = 163

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 66/169 (39%), Positives = 86/169 (50%), Gaps = 18/169 (10%)

Query: 120 AFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGD 179
           AF+DR CI + G+ N  LS E +A CC  C +     CS G   R W    K G VTGG+
Sbjct: 5   AFADRLCIATDGEFNELLSAEELAFCCHKCGF----GCSGGYPIRAWERFKKHGLVTGGN 60

Query: 180 YGDRTGCQPSTISPC--SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGR---GFFQDKH 234
           Y    GCQP  + PC    +G+     +C  +   K   + RCT   YG     F +D H
Sbjct: 61  YDSGEGCQPYRVPPCPLDEYGNN----TCRGKPAEK---NHRCTRMCYGNQDLDFKEDHH 113

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNA 283
            T   Y++      I+ +ILA+GP  A+F +YDDF  YKSGVY    NA
Sbjct: 114 YTRDAYYL--TYGTIQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENA 160


>gi|395815757|ref|XP_003781389.1| PREDICTED: dipeptidyl peptidase 1 [Otolemur garnettii]
          Length = 575

 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 86/340 (25%), Positives = 134/340 (39%), Gaps = 58/340 (17%)

Query: 16  GELYKFSDAYIDQINREANTWTAGRNFPANLSEEY----LRQFLIADAKYFDQSDRPLPG 71
           G LYK++  ++  IN    +WTA       +  EY    LR+ +     +  +  RP P 
Sbjct: 277 GRLYKYNHNFVKAINAMQKSWTA------TVYMEYETLTLREMIRRSGGHGQRVPRPKPV 330

Query: 72  DRKTYDPEYSATVPDRFDAREQWPNCGTIGHVP---DTGACAAPHIFAAVGAFSDRRCIK 128
                  +    +P  +D    W N   + +V    +  +C + + FA+VG    R  I 
Sbjct: 331 ALTAEIQKKILHLPASWD----WRNVHGVNYVSPVRNQESCGSCYSFASVGMLEARIRIL 386

Query: 129 SKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQP 188
           +   Q   LS + V SC +       + C  G  +           +  G +    G   
Sbjct: 387 TNNTQTPILSPQEVVSCSQYA-----QGCEGGFPY-----------LVAGKHAQDFGL-- 428

Query: 189 STISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDA 248
                             E    P       CT     R ++  ++     ++   NE  
Sbjct: 429 -----------------VEEACFPYTGTDAPCTMKEGCRRYYSSEYHYVGGFYGGCNEAL 471

Query: 249 IKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAK----LENYLHSGKLIGWGTEN--G 302
           +K E++ HGP    F +YDDF HY  G+Y HT         E   H+  L+G+GT++  G
Sbjct: 472 MKLELVHHGPMAVAFEVYDDFLHYHRGIYHHTGLTDPFNPFELTNHAVLLVGYGTDSATG 531

Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
             YW+V N+WG  WG+ G  +I RG  ECA E +  A  P
Sbjct: 532 IQYWIVKNSWGTGWGEDGYFRIRRGTDECAIESIAVAATP 571


>gi|405963121|gb|EKC28721.1| Tubulointerstitial nephritis antigen-like protein [Crassostrea
           gigas]
          Length = 464

 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 80/261 (30%), Positives = 116/261 (44%), Gaps = 37/261 (14%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDAR  W +   I  V D   CA+   F+ V   +DR  I+S+G     LS +++ 
Sbjct: 193 LPIHFDARINWTS--WIHPVRDQKNCASSWAFSTVDVAADRLAIESEGLLTNQLSPQHLV 250

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           SC         + C  GS  + W F+ +RG +T         C P T S         T 
Sbjct: 251 SCNT---GRGQRGCRGGSTEKAWWFVKRRGIIT-------EECYPYTASDGECLDGETTC 300

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P+  N    K+  +                   T  Y V  +E+ IK EI  +GP  ATF
Sbjct: 301 PN-ANSSTAKIVLYV------------------TPPYRVRQDEEDIKAEIYRNGPVQATF 341

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-----TENGTPYWLVINTWGPHWGD 318
            +  DF+ Y+SGVY+HT  A L     S ++IGWG           YW+ +N+WG  WG+
Sbjct: 342 RVSSDFFMYRSGVYRHT-GADLGESRLSVRIIGWGEKTNKKGKKRKYWICLNSWGTKWGE 400

Query: 319 RGTVKILRGKYECAFEYLIAA 339
           +G  +I+RG+     E  + A
Sbjct: 401 KGAFRIVRGENHLGIEENVLA 421


>gi|294952605|ref|XP_002787373.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239902345|gb|EER19169.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 185

 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 51/121 (42%), Positives = 71/121 (58%), Gaps = 4/121 (3%)

Query: 208 NQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYD 267
            Q VP   C T CTN  Y +   +D HR      V ++  +IK+EI  +GP  ++F +Y+
Sbjct: 55  QQPVPP--CRTTCTNKAYKKSLEKDVHRAKSWRKVLNDAQSIKQEIFDNGPVLSSFKMYE 112

Query: 268 DFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRG 327
           DF +YKSGVY  T+  K  +  HS K+IGWG  +G  YWL +N+W   WGD G +K+  G
Sbjct: 113 DFRYYKSGVYVPTT--KESSTSHSIKIIGWGGASGREYWLAVNSWNEEWGDHGLIKMAFG 170

Query: 328 K 328
           K
Sbjct: 171 K 171


>gi|12330244|gb|AAG52659.1| cysteine proteinase [Metagonimus yokogawai]
          Length = 183

 Score =  101 bits (251), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 69/190 (36%), Positives = 98/190 (51%), Gaps = 10/190 (5%)

Query: 117 AVGAFSDRRCIKS-KGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSV 175
           AV + SDR CI S + + N  LS   + SCC  C +     C  G +   W++    G V
Sbjct: 1   AVTSMSDRVCIHSNQNKTNVQLSARDLLSCCTSCGF----GCVGGWIGDAWDYWRDNGIV 56

Query: 176 TGGDYGDRTGCQPSTISPCSHHGSAPTLPSCENQKV-PKLKCHTRCTNPTYGRGFFQDKH 234
           TGGDY D++ C P    P  H  S  T      Q + P   C ++C     G  + +DK 
Sbjct: 57  TGGDYQDKSTCLPYPFPPSHHLVSKGTPFEIYPQTLYPTPPCVSKCQEGYPGE-YEKDKI 115

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
               +Y +D N   I+KEIL +GP  A   +Y DF +YK+GVY+HT+   L    H+ +L
Sbjct: 116 FALSSYKIDRNATEIQKEILINGPVEAGMNVYADFPNYKTGVYQHTTGEILGG--HAIRL 173

Query: 295 IGWG-TENGT 303
           +GWG T++GT
Sbjct: 174 LGWGKTKDGT 183


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.137    0.455 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,192,513,516
Number of Sequences: 23463169
Number of extensions: 283793301
Number of successful extensions: 505548
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 4652
Number of HSP's successfully gapped in prelim test: 1385
Number of HSP's that attempted gapping in prelim test: 493140
Number of HSP's gapped (non-prelim): 8303
length of query: 344
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 201
effective length of database: 9,003,962,200
effective search space: 1809796402200
effective search space used: 1809796402200
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)