BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy10826
         (175 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|7507648|pir||T24819 hypothetical protein T10H4.12 - Caenorhabditis elegans
          Length = 324

 Score =  110 bits (275), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 48/84 (57%), Positives = 64/84 (76%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI+H+G + A+ + ++D   YK GVY +T G++ GGHAVKIIGWGVE+GV YWL  N
Sbjct: 199 IQTEIYHYGPVEASYKVYEDFYHYKSGVYHYTSGKLVGGHAVKIIGWGVENGVDYWLIAN 258

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SWG  +G+ G FKIRRGT+E +IE
Sbjct: 259 SWGTSFGEKGFFKIRRGTNECQIE 282


>gi|17559066|ref|NP_506790.1| Protein CPR-3 [Caenorhabditis elegans]
 gi|1169083|sp|P43507.1|CPR3_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 3; AltName:
           Full=Cysteine protease-related 3; Flags: Precursor
 gi|675494|gb|AAA98788.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|675496|gb|AAA98782.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|14530554|emb|CAB61032.2| Protein CPR-3 [Caenorhabditis elegans]
          Length = 370

 Score =  110 bits (275), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 48/84 (57%), Positives = 64/84 (76%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI+H+G + A+ + ++D   YK GVY +T G++ GGHAVKIIGWGVE+GV YWL  N
Sbjct: 245 IQTEIYHYGPVEASYKVYEDFYHYKSGVYHYTSGKLVGGHAVKIIGWGVENGVDYWLIAN 304

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SWG  +G+ G FKIRRGT+E +IE
Sbjct: 305 SWGTSFGEKGFFKIRRGTNECQIE 328


>gi|341888694|gb|EGT44629.1| hypothetical protein CAEBREN_31940 [Caenorhabditis brenneri]
          Length = 374

 Score =  110 bits (274), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 53/104 (50%), Positives = 68/104 (65%), Gaps = 4/104 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI+H G + A+ + ++D   YK GVYQ+T G++ GGHAVKIIGWG E+GV YWL  N
Sbjct: 249 IQTEIYHNGPVEASFKVYEDFYKYKSGVYQYTSGKLVGGHAVKIIGWGTENGVDYWLIAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA----GRVDRDRSSD 100
           SWG  +GD G FK+RRGT+E  IE   V+     G  D  R  D
Sbjct: 309 SWGTTFGDSGFFKMRRGTNEVGIEGNVVAGTAKLGTHDEKREDD 352


>gi|432852559|ref|XP_004067308.1| PREDICTED: cathepsin B-like [Oryzias latipes]
          Length = 330

 Score =  108 bits (271), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 51/91 (56%), Positives = 62/91 (68%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI+  G +  A   ++D  +YK GVYQH  G + GGHA+KI+GWGVEDGV YWLC N
Sbjct: 238 IQTEIYKNGPVEGAFMVYEDFPMYKSGVYQHVSGSLIGGHAIKILGWGVEDGVPYWLCAN 297

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG+D   IES +V AG
Sbjct: 298 SWNTDWGDNGYFKILRGSDHCGIES-EVVAG 327


>gi|170060938|ref|XP_001866023.1| cathepsin B [Culex quinquefasciatus]
 gi|167879260|gb|EDS42643.1| cathepsin B [Culex quinquefasciatus]
          Length = 353

 Score =  108 bits (271), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 52/103 (50%), Positives = 67/103 (65%), Gaps = 1/103 (0%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EIF  G + AA + + D   YK GVY+H  G + GGHA+KI+GWGVE+G KYWLC NSWG
Sbjct: 252 EIFVNGPVQAAFQVYLDFKTYKSGVYRHVTGPLEGGHAIKILGWGVENGTKYWLCSNSWG 311

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFEY 106
           E WGD G FKI RG +   IE+  V AG     +  ++ E++Y
Sbjct: 312 EDWGDHGFFKIVRGENHLGIET-DVHAGLPHYRKHKEMFEYDY 353


>gi|327322926|gb|AEA48884.1| cathepsin B [Oplegnathus fasciatus]
          Length = 330

 Score =  108 bits (270), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 51/91 (56%), Positives = 61/91 (67%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EIF  G +  A   ++D ++YK GVYQH  G   GGHA+KI+GWGVEDGV YWLC N
Sbjct: 238 IQSEIFKNGPVEGAFIVYEDFVLYKSGVYQHVSGSAVGGHAIKILGWGVEDGVPYWLCAN 297

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FK  RG+D   IES +V AG
Sbjct: 298 SWNTDWGDNGFFKFLRGSDHCGIES-EVVAG 327


>gi|45822211|emb|CAE47502.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
          Length = 331

 Score =  108 bits (270), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 51/90 (56%), Positives = 60/90 (66%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + AA + + D + YK GVYQH  GE  GGHAV+I+GWG E GV YWL  N
Sbjct: 238 IQKEILTNGPVEAAFDVYSDFVNYKSGVYQHVAGEYLGGHAVRILGWGEESGVPYWLVAN 297

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SW E WGD GLFKIRRG +ES  E   V+A
Sbjct: 298 SWNEDWGDKGLFKIRRGNNESGFEDSIVAA 327


>gi|193603738|ref|XP_001943652.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 337

 Score =  108 bits (270), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 45/82 (54%), Positives = 61/82 (74%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E+F  G +VAA++ + D + YK G+YQ+T G + G HAVKI+GWG +DG+ YWLC N+WG
Sbjct: 247 EVFKNGPVVAAMKVYDDFLCYKGGIYQYTTGGLKGDHAVKIMGWGEDDGIDYWLCANTWG 306

Query: 64  ELWGDGGLFKIRRGTDESRIES 85
             WG GG+FKIRRG +E  IE+
Sbjct: 307 NSWGMGGMFKIRRGRNECGIEN 328


>gi|268572255|ref|XP_002648916.1| Hypothetical protein CBG17829 [Caenorhabditis briggsae]
          Length = 220

 Score =  108 bits (269), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 49/96 (51%), Positives = 64/96 (66%), Gaps = 1/96 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +V     ++D+  YK GVY+HT G + GGHA+KIIGWG ++G+ YWL  N
Sbjct: 124 IQTEIMTNGPVVGVFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGTQNGIPYWLIAN 183

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRD 96
           SWG  WG+ G FKIRRG +E  IE+  V AG+ D D
Sbjct: 184 SWGTKWGENGFFKIRRGVNECGIEN-NVVAGKADVD 218


>gi|268561802|ref|XP_002638421.1| C. briggsae CBR-CPR-3 protein [Caenorhabditis briggsae]
          Length = 375

 Score =  107 bits (268), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 48/85 (56%), Positives = 59/85 (69%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI+H G + A+    +D   YK GVY H  G + GGHAVKIIGWG E+GV YWL  N
Sbjct: 250 IQYEIYHNGPVEASYRVFEDFYQYKSGVYHHVSGNLVGGHAVKIIGWGTENGVDYWLVAN 309

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SWG  +G+ G FKIRRGT+E +IES
Sbjct: 310 SWGTSFGEKGFFKIRRGTNECQIES 334


>gi|432946172|ref|XP_004083803.1| PREDICTED: cathepsin B-like [Oryzias latipes]
          Length = 330

 Score =  107 bits (266), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 47/90 (52%), Positives = 59/90 (65%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E++  G + AA   + D ++YK GVYQH  G+M GGHAVKI+GWG E+G  YWL  NSW 
Sbjct: 240 ELYKNGPVEAAFSVYADFLLYKNGVYQHVTGDMLGGHAVKILGWGEENGTPYWLVANSWN 299

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
             WGD G FKI+RG DE  IES  V+   +
Sbjct: 300 SDWGDKGFFKIKRGNDECGIESEMVAGAPL 329


>gi|256090364|ref|XP_002581165.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228444|emb|CCD74615.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 303

 Score =  106 bits (265), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 47/94 (50%), Positives = 65/94 (69%), Gaps = 1/94 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI  +G + A+   ++D + YK G+Y+H  GE  GGHA++IIGWGVE+   YWL  N
Sbjct: 211 IQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGVENKTPYWLIAN 270

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           SW E WG+ G F+I RG DE  IES +V+AGR++
Sbjct: 271 SWNEDWGENGYFRIVRGRDECSIES-EVTAGRIN 303


>gi|38373697|gb|AAR19103.1| cathepsin B [Uronema marinum]
          Length = 350

 Score =  106 bits (265), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 44/89 (49%), Positives = 60/89 (67%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI+ +GS  A+   + D + Y  GVYQ+T G   GGHA+K++GWGVE+G  YWLC N
Sbjct: 255 IKAEIYQYGSTTASFNVYSDFLTYSSGVYQNTSGSYMGGHAIKMLGWGVENGTPYWLCAN 314

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW   WG+ G FKI RG++E  IES  V+
Sbjct: 315 SWNSSWGENGFFKILRGSNECGIESGMVA 343


>gi|308466896|ref|XP_003095699.1| CRE-CPR-3 protein [Caenorhabditis remanei]
 gi|308244581|gb|EFO88533.1| CRE-CPR-3 protein [Caenorhabditis remanei]
          Length = 373

 Score =  106 bits (265), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 48/89 (53%), Positives = 64/89 (71%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI+H G + A+ + ++D   YK GVY +  G++ GGHAVKIIGWG E+ V YWL  N
Sbjct: 248 IQYEIYHNGPVEASYKVYEDFYQYKSGVYHYVSGKLVGGHAVKIIGWGTENDVDYWLVAN 307

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SWG  +G+GG FKIRRGT+E +IES  V+
Sbjct: 308 SWGIKFGEGGFFKIRRGTNECQIESNVVA 336


>gi|344195776|gb|AEM98130.1| cathepsin B [Cynoglossus semilaevis]
          Length = 332

 Score =  106 bits (265), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 45/85 (52%), Positives = 57/85 (67%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +QLEI+  G +  A   ++D ++YK GVYQH  G   GGHA+K++GWG E+G  YWLC N
Sbjct: 238 IQLEIYKNGPVEGAFTVYEDFLLYKTGVYQHVTGSAVGGHAIKVLGWGEENGTPYWLCAN 297

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FKI RG+D   IES
Sbjct: 298 SWNTDWGDNGFFKILRGSDHCGIES 322


>gi|431918315|gb|ELK17542.1| Cathepsin B [Pteropus alecto]
          Length = 359

 Score =  106 bits (264), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 50/91 (54%), Positives = 60/91 (65%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G + AA   + D ++YK GVYQH  GEM GGHAV+I+GWGVEDG  YWL  N
Sbjct: 263 IMAEIYKNGPVEAAFSVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGVEDGTPYWLVGN 322

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG D   IES ++ AG
Sbjct: 323 SWNTDWGDSGFFKILRGQDHCGIES-EIVAG 352


>gi|350535627|ref|NP_001233013.1| uncharacterized protein LOC100164982 precursor [Acyrthosiphon
           pisum]
 gi|239789514|dbj|BAH71377.1| ACYPI005957 [Acyrthosiphon pisum]
          Length = 339

 Score =  105 bits (263), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 49/86 (56%), Positives = 60/86 (69%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++ ++G I A+ + + D   YK GVYQ T      GGHAVK+IGWGVE+G  YWL V
Sbjct: 244 IQKDVLNYGPIEASFDVYDDFPSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGTPYWLMV 303

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW   WGD GLFKIRRGTDE RI+S
Sbjct: 304 NSWNAQWGDNGLFKIRRGTDECRIDS 329


>gi|328726763|ref|XP_003249034.1| PREDICTED: cathepsin B-like cysteine proteinase-like, partial
           [Acyrthosiphon pisum]
          Length = 129

 Score =  105 bits (263), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 49/86 (56%), Positives = 60/86 (69%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++ ++G I A+ + + D   YK GVYQ T      GGHAVK+IGWGVE+G  YWL V
Sbjct: 34  IQKDVLNYGPIEASFDVYDDFPSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGTPYWLMV 93

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW   WGD GLFKIRRGTDE RI+S
Sbjct: 94  NSWNAQWGDNGLFKIRRGTDECRIDS 119


>gi|170060936|ref|XP_001866022.1| cathepsin B [Culex quinquefasciatus]
 gi|167879259|gb|EDS42642.1| cathepsin B [Culex quinquefasciatus]
          Length = 341

 Score =  105 bits (263), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 51/91 (56%), Positives = 62/91 (68%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EIF  G + A+ + + D   YK GVY+H  G M GGHAVK+IGWGVE+G KYWLC N
Sbjct: 240 IKEEIFRNGPVQASFDVYLDFKAYKTGVYRHVFGPMEGGHAVKMIGWGVENGTKYWLCSN 299

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SWGE WG+ G FKI RG +   IES  V AG
Sbjct: 300 SWGEDWGERGFFKIVRGENHCGIES-DVHAG 329


>gi|345790427|ref|XP_543203.3| PREDICTED: cathepsin B [Canis lupus familiaris]
          Length = 339

 Score =  105 bits (262), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 50/91 (54%), Positives = 60/91 (65%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G + AA   + D ++YK GVYQH  GEM GGHAV+I+GWGVEDG  YWL  N
Sbjct: 239 IMAEIYKNGPVEAAFTVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGVEDGTPYWLVGN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG D   IES ++ AG
Sbjct: 299 SWNTDWGDNGFFKILRGRDHCGIES-EIVAG 328


>gi|167538317|ref|XP_001750823.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163770644|gb|EDQ84327.1| predicted protein [Monosiga brevicollis MX1]
          Length = 341

 Score =  105 bits (262), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 50/91 (54%), Positives = 61/91 (67%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI   G +  A   ++DL+ YK GVYQHT G++ GGHA+KIIGWGVE GV YW   N
Sbjct: 248 IATEIMTNGPVEGAFTVYEDLLTYKSGVYQHTTGQVLGGHAIKIIGWGVESGVDYWWVAN 307

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI++G DE  IES Q+ AG
Sbjct: 308 SWNNDWGDNGFFKIKKGVDECGIES-QIVAG 337


>gi|256090368|ref|XP_002581167.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|22531387|emb|CAD44624.1| cathepsin B1 isotype 1 [Schistosoma mansoni]
 gi|353228442|emb|CCD74613.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 340

 Score =  105 bits (262), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 47/94 (50%), Positives = 64/94 (68%), Gaps = 1/94 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI  +G + A    ++D + YK G+Y+H  GE  GGHA++IIGWGVE+   YWL  N
Sbjct: 248 IQKEIMKYGPVEAGFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGVENKTPYWLIAN 307

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           SW E WG+ G F+I RG DE  IES +V+AGR++
Sbjct: 308 SWNEDWGENGYFRIVRGRDECSIES-EVTAGRIN 340


>gi|428174191|gb|EKX43088.1| hypothetical protein GUITHDRAFT_73372 [Guillardia theta CCMP2712]
          Length = 255

 Score =  105 bits (262), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 53/94 (56%), Positives = 61/94 (64%), Gaps = 1/94 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + AA   +QD + Y+ GVY+H  G   GGHA+KI+GWGVE G KYWL  N
Sbjct: 159 IQAEIMTNGPVEAAFTVYQDFLAYQSGVYRHVSGPELGGHAIKIMGWGVEAGNKYWLVAN 218

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           SW E WGD G FKI RG DE  IES  V AG VD
Sbjct: 219 SWNEDWGDKGTFKIARGDDECGIES-SVVAGMVD 251


>gi|268566077|ref|XP_002647467.1| Hypothetical protein CBG06539 [Caenorhabditis briggsae]
          Length = 332

 Score =  105 bits (261), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 48/96 (50%), Positives = 64/96 (66%), Gaps = 1/96 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A+ + ++D   YK GVY++  G+M GGHA+KIIGWG E+G  YWL  N
Sbjct: 236 IQTDIMTNGPVEASFKVYEDFYKYKSGVYKYIAGKMLGGHAIKIIGWGTENGTAYWLIAN 295

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRD 96
           SWG  WG+ G FKIRRG +E  IE+  V AG+ D D
Sbjct: 296 SWGTKWGENGFFKIRRGVNECGIEN-NVVAGKADVD 330


>gi|343197337|pdb|3QSD|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With Ca074 Inhibitor
 gi|343197588|pdb|3S3Q|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11017 Inhibitor
 gi|343197589|pdb|3S3R|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
 gi|343197590|pdb|3S3R|B Chain B, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
 gi|343197591|pdb|3S3R|C Chain C, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
          Length = 254

 Score =  104 bits (260), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 47/94 (50%), Positives = 64/94 (68%), Gaps = 1/94 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI  +G + A    ++D + YK G+Y+H  GE  GGHA++IIGWGVE+   YWL  N
Sbjct: 162 IQKEIMKYGPVEAGFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGVENKAPYWLIAN 221

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           SW E WG+ G F+I RG DE  IES +V+AGR++
Sbjct: 222 SWNEDWGENGYFRIVRGRDECSIES-EVTAGRIN 254


>gi|118153|sp|P25792.1|CYSP_SCHMA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
           Full=Antigen Sm31; Flags: Precursor
 gi|160950|gb|AAA29865.1| cathepsin B [Schistosoma mansoni]
          Length = 340

 Score =  104 bits (260), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 47/94 (50%), Positives = 64/94 (68%), Gaps = 1/94 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI  +G + A+   ++D + YK G+Y+H  GE  GGHA++IIGWGVE+   YWL  N
Sbjct: 248 IQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVENKTPYWLIAN 307

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           SW E WG+ G F+I RG DE  IES +V AGR++
Sbjct: 308 SWNEDWGENGYFRIVRGRDECSIES-EVIAGRIN 340


>gi|4503139|ref|NP_001899.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538431|ref|NP_680090.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538433|ref|NP_680091.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538435|ref|NP_680092.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538437|ref|NP_680093.1| cathepsin B preproprotein [Homo sapiens]
 gi|68067549|sp|P07858.3|CATB_HUMAN RecName: Full=Cathepsin B; AltName: Full=APP secretase; Short=APPS;
           AltName: Full=Cathepsin B1; Contains: RecName:
           Full=Cathepsin B light chain; Contains: RecName:
           Full=Cathepsin B heavy chain; Flags: Precursor
 gi|291888|gb|AAC37547.1| cathepsin B [Homo sapiens]
 gi|63102437|gb|AAH95408.1| Cathepsin B [Homo sapiens]
 gi|119586034|gb|EAW65630.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586036|gb|EAW65632.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586037|gb|EAW65633.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586038|gb|EAW65634.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586039|gb|EAW65635.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586040|gb|EAW65636.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|168277954|dbj|BAG10955.1| cathepsin B precursor [synthetic construct]
 gi|193786804|dbj|BAG52127.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score =  104 bits (260), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334


>gi|426358853|ref|XP_004046705.1| PREDICTED: cathepsin B isoform 1 [Gorilla gorilla gorilla]
 gi|426358855|ref|XP_004046706.1| PREDICTED: cathepsin B isoform 2 [Gorilla gorilla gorilla]
 gi|426358857|ref|XP_004046707.1| PREDICTED: cathepsin B isoform 3 [Gorilla gorilla gorilla]
          Length = 339

 Score =  104 bits (259), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334


>gi|181192|gb|AAA52129.1| preprocathepsin B [Homo sapiens]
 gi|193787271|dbj|BAG52477.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score =  104 bits (259), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334


>gi|417399216|gb|JAA46636.1| Putative cathepsin b [Desmodus rotundus]
          Length = 340

 Score =  104 bits (259), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 47/85 (55%), Positives = 57/85 (67%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           + +EI+  G + AA   + D ++YK GVYQH  GEM GGHAV+I+GWGVE+G  YWL  N
Sbjct: 240 IMVEIYKNGPVEAAFSVYSDFLLYKSGVYQHVTGEMVGGHAVRILGWGVENGTPYWLVGN 299

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FKI RG D   IES
Sbjct: 300 SWNTDWGDNGFFKILRGRDHCGIES 324


>gi|397467300|ref|XP_003805362.1| PREDICTED: cathepsin B [Pan paniscus]
          Length = 339

 Score =  104 bits (259), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334


>gi|194384502|dbj|BAG59411.1| unnamed protein product [Homo sapiens]
          Length = 273

 Score =  104 bits (259), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 173 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 232

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 233 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 268


>gi|332862712|ref|XP_003317964.1| PREDICTED: cathepsin B isoform 1 [Pan troglodytes]
 gi|332862714|ref|XP_003317965.1| PREDICTED: cathepsin B isoform 2 [Pan troglodytes]
 gi|332862716|ref|XP_003317966.1| PREDICTED: cathepsin B isoform 3 [Pan troglodytes]
 gi|332862718|ref|XP_519607.3| PREDICTED: cathepsin B isoform 5 [Pan troglodytes]
 gi|410057614|ref|XP_003954244.1| PREDICTED: cathepsin B [Pan troglodytes]
 gi|410262606|gb|JAA19269.1| cathepsin B [Pan troglodytes]
 gi|410262608|gb|JAA19270.1| cathepsin B [Pan troglodytes]
 gi|410359820|gb|JAA44654.1| cathepsin B [Pan troglodytes]
 gi|410359822|gb|JAA44655.1| cathepsin B [Pan troglodytes]
 gi|410359824|gb|JAA44656.1| cathepsin B [Pan troglodytes]
 gi|410359826|gb|JAA44657.1| cathepsin B [Pan troglodytes]
 gi|410359828|gb|JAA44658.1| cathepsin B [Pan troglodytes]
          Length = 339

 Score =  104 bits (259), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334


>gi|340501578|gb|EGR28345.1| hypothetical protein IMG5_177790 [Ichthyophthirius multifiliis]
          Length = 356

 Score =  104 bits (259), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 48/87 (55%), Positives = 58/87 (66%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI + G +  A   ++D + YK GVYQH  GE  GGHAVK+IGWGVE+   YWL VNSW 
Sbjct: 260 EIMNNGPVEVAFTVYEDFVTYKSGVYQHVTGEQLGGHAVKMIGWGVENDTPYWLIVNSWN 319

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSA 90
           E WGD G FKI RG++E  IE   V+A
Sbjct: 320 ETWGDQGTFKILRGSNECGIEDEVVTA 346


>gi|355697726|gb|EHH28274.1| Cathepsin B [Macaca mulatta]
          Length = 339

 Score =  104 bits (259), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334


>gi|16307393|gb|AAH10240.1| Cathepsin B [Homo sapiens]
          Length = 339

 Score =  104 bits (259), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334


>gi|30583753|gb|AAP36125.1| Homo sapiens cathepsin B [synthetic construct]
 gi|61370555|gb|AAX43516.1| cathepsin B [synthetic construct]
          Length = 340

 Score =  104 bits (259), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334


>gi|332244666|ref|XP_003271495.1| PREDICTED: cathepsin B [Nomascus leucogenys]
          Length = 351

 Score =  104 bits (259), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 251 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHITGEMMGGHAIRILGWGVENGTPYWLVAN 310

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 311 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 346


>gi|116177489|gb|ABJ80691.1| cathepsin B [Hippoglossus hippoglossus]
          Length = 330

 Score =  104 bits (259), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 46/85 (54%), Positives = 57/85 (67%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A   ++D ++YK GVYQHT G   GGHA+K++GWG EDGV YWLC N
Sbjct: 238 IQAEISQNGPVEGAFIVYEDFVMYKSGVYQHTTGSALGGHAIKVLGWGEEDGVPYWLCAN 297

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WG+ G FKI RG+D   IES
Sbjct: 298 SWNTDWGENGFFKILRGSDHCGIES 322


>gi|197098184|ref|NP_001126573.1| cathepsin B precursor [Pongo abelii]
 gi|75061687|sp|Q5R6D1.1|CATB_PONAB RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|55731764|emb|CAH92586.1| hypothetical protein [Pongo abelii]
 gi|55731953|emb|CAH92685.1| hypothetical protein [Pongo abelii]
          Length = 339

 Score =  104 bits (259), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334


>gi|402877481|ref|XP_003902454.1| PREDICTED: cathepsin B [Papio anubis]
          Length = 339

 Score =  104 bits (259), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334


>gi|226821413|gb|ACO82382.1| cathepsin B [Lutjanus argentimaculatus]
          Length = 330

 Score =  104 bits (259), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 56/85 (65%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI+  G +  A   ++D ++YK GVYQH  G   GGHA+K++GWG E+GV YWLC N
Sbjct: 238 IQYEIYKNGPVEGAFTVYEDFVLYKSGVYQHVSGSAVGGHAIKVLGWGEENGVPYWLCAN 297

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FK  RG+D   IES
Sbjct: 298 SWNTDWGDNGFFKFLRGSDHCGIES 322


>gi|60816353|gb|AAX36379.1| cathepsin B [synthetic construct]
 gi|61358313|gb|AAX41546.1| cathepsin B [synthetic construct]
          Length = 339

 Score =  104 bits (259), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334


>gi|87246247|gb|ABD35300.1| cathepsin B-like cysteine protease [Triatoma infestans]
          Length = 333

 Score =  104 bits (259), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 50/91 (54%), Positives = 59/91 (64%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G IVA+   ++DL  YK+GVYQH  GE  GGH +KI GWG+E+G  YWL  N
Sbjct: 240 IQAEILKNGPIVASFLVYEDLFSYKEGVYQHVAGEFLGGHVIKIFGWGIENGTPYWLVAN 299

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WG+ G FKI RG DE  IE   VSAG
Sbjct: 300 SWNTDWGNNGFFKIPRGKDECGIE-IDVSAG 329


>gi|157833437|pdb|1PBH|A Chain A, Crystal Structure Of Human Recombinant Procathepsin B At
           3.2 Angstrom Resolution
 gi|157835646|pdb|2PBH|A Chain A, Crystal Structure Of Human Procathepsin B At 3.3 Angstrom
           Resolution
 gi|157836863|pdb|3PBH|A Chain A, Refined Crystal Structure Of Human Procathepsin B At 2.5
           Angstrom Resolution
          Length = 317

 Score =  104 bits (259), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 48/91 (52%), Positives = 59/91 (64%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 223 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 282

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG D   IES +V AG
Sbjct: 283 SWNTDWGDNGFFKILRGQDHCGIES-EVVAG 312


>gi|302564570|ref|NP_001181828.1| cathepsin B precursor [Macaca mulatta]
          Length = 339

 Score =  104 bits (259), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334


>gi|339241013|ref|XP_003376432.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316974853|gb|EFV58323.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 551

 Score =  103 bits (258), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 52/108 (48%), Positives = 70/108 (64%), Gaps = 3/108 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           M  +I  +G IVA +  ++D + YK+GVY    G   GGHAV+IIGWG +D + YWL  N
Sbjct: 439 MMKDISLYGPIVAGMSVYEDFLHYKEGVYTQESGIFLGGHAVRIIGWGEQDNIPYWLVAN 498

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV--DRDRSSDLEEFEY 106
           SW   +G+ GLFKIRRG DE  IES+ VSAGR    ++ S++   F+Y
Sbjct: 499 SWNTTFGEDGLFKIRRGFDECGIESY-VSAGRAKCKQNISNNFNSFKY 545


>gi|347972086|ref|XP_313835.5| AGAP004533-PA [Anopheles gambiae str. PEST]
 gi|333469165|gb|EAA09183.5| AGAP004533-PA [Anopheles gambiae str. PEST]
          Length = 337

 Score =  103 bits (258), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 50/91 (54%), Positives = 62/91 (68%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A   ++DL+ YK+GVYQH  G+M GGHA++I+GWGVE+G KYWL  N
Sbjct: 244 IQKEIMTNGPVEGAFTVYEDLLHYKEGVYQHVTGKMLGGHAIRILGWGVENGTKYWLIAN 303

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG D   IES  +SAG
Sbjct: 304 SWNSDWGDNGFFKILRGEDHLGIES-SISAG 333


>gi|75076082|sp|Q4R5M2.1|CATB_MACFA RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|67970521|dbj|BAE01603.1| unnamed protein product [Macaca fascicularis]
 gi|355779504|gb|EHH63980.1| Cathepsin B [Macaca fascicularis]
 gi|383411999|gb|AFH29213.1| cathepsin B preproprotein [Macaca mulatta]
 gi|384942194|gb|AFI34702.1| cathepsin B preproprotein [Macaca mulatta]
          Length = 339

 Score =  103 bits (258), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334


>gi|347972080|ref|XP_313831.5| AGAP004531-PA [Anopheles gambiae str. PEST]
 gi|333469162|gb|EAA09191.5| AGAP004531-PA [Anopheles gambiae str. PEST]
          Length = 375

 Score =  103 bits (258), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 45/85 (52%), Positives = 56/85 (65%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  E+F+FG   A    + D + YK GVY+HT G   G H+VK++GWGVE+ VKYWLC N
Sbjct: 281 IMYEVFNFGPAQATFTMYTDFVQYKSGVYRHTFGVRVGTHSVKVMGWGVENDVKYWLCAN 340

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SWG  WGDGG FKI RG D    E+
Sbjct: 341 SWGAQWGDGGFFKIVRGEDHLSFET 365


>gi|194387364|dbj|BAG60046.1| unnamed protein product [Homo sapiens]
          Length = 245

 Score =  103 bits (258), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 145 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 204

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 205 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 240


>gi|357613937|gb|EHJ68797.1| cathepsin B-like cysteine proteinase [Danaus plexippus]
          Length = 334

 Score =  103 bits (258), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 51/96 (53%), Positives = 61/96 (63%), Gaps = 1/96 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EIF  G +  A   + DL+ YK GVY+HT GE  GGHA+KI+GWGVE+G KYWL  N
Sbjct: 240 IKAEIFKNGPVEGAFTVYADLLTYKSGVYKHTEGEALGGHAIKIMGWGVENGNKYWLIAN 299

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRD 96
           SW   WGD G FKI RG D   IES  + AG    D
Sbjct: 300 SWNSDWGDNGFFKILRGEDHCGIES-SIVAGEPSYD 334


>gi|355681635|gb|AER96808.1| cathepsin B [Mustela putorius furo]
          Length = 338

 Score =  103 bits (258), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 50/97 (51%), Positives = 62/97 (63%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G + AA   + D ++YK GVYQH  GEM GGHAV+I+GWGVE+G  YWL  N
Sbjct: 239 IMAEIYKNGPVEAAFSVYSDFLMYKSGVYQHVTGEMMGGHAVRILGWGVENGTPYWLVGN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES ++ AG    D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EIVAGIPCTDQ 334


>gi|312382740|gb|EFR28091.1| hypothetical protein AND_04395 [Anopheles darlingi]
          Length = 381

 Score =  103 bits (258), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 45/82 (54%), Positives = 57/82 (69%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E+F+FG + A+   + D I YK GVY+HT G   G H+VKI+GWGVE+G K+WLC NSWG
Sbjct: 290 ELFYFGPVQASFTVYTDFIQYKSGVYRHTYGVRVGDHSVKIVGWGVENGTKFWLCANSWG 349

Query: 64  ELWGDGGLFKIRRGTDESRIES 85
             WG+ G FKI RG D   +ES
Sbjct: 350 AEWGENGFFKIIRGEDHLSVES 371


>gi|157167283|ref|XP_001658486.1| cathepsin b [Aedes aegypti]
 gi|108876477|gb|EAT40702.1| AAEL007599-PA [Aedes aegypti]
          Length = 342

 Score =  103 bits (258), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 51/103 (49%), Positives = 66/103 (64%), Gaps = 1/103 (0%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI+  G + AA   +QDL  YK GVY+H  G M+GGHAVK++GWGVE+G+KYWL  NSWG
Sbjct: 241 EIYINGPVQAAFMTYQDLHAYKSGVYRHVWGHMAGGHAVKLMGWGVENGLKYWLVANSWG 300

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFEY 106
           + WGD G FKI RG +   IE   V AG    ++  +L    +
Sbjct: 301 DDWGDNGFFKIVRGENHCGIEK-DVHAGLPSFNKHKELAGIHF 342


>gi|1008858|gb|AAA79004.1| cathepsin B-like thiol protease [Aedes aegypti]
          Length = 342

 Score =  103 bits (258), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 51/98 (52%), Positives = 65/98 (66%), Gaps = 1/98 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI+  G + AA   +QDL  YK GVY+H  G M+GGHAVK++GWGVE+G+KYWL  NSWG
Sbjct: 241 EIYINGPVQAAFMTYQDLHAYKSGVYRHVWGHMAGGHAVKLMGWGVENGLKYWLVANSWG 300

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDL 101
           + WGD G FKI RG +   IE   V AG    ++  +L
Sbjct: 301 DDWGDNGFFKIVRGENHCGIEK-DVHAGLPSFNKHKEL 337


>gi|328726600|ref|XP_003248962.1| PREDICTED: cathepsin B-like cysteine proteinase-like [Acyrthosiphon
           pisum]
          Length = 169

 Score =  103 bits (258), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 48/86 (55%), Positives = 60/86 (69%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++ ++G I A+ + + D   YK GVYQ T      GGHAVK+IGWGVE+G+ YWL V
Sbjct: 74  IQKDVMNYGPIEASFDVYDDFYSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGIPYWLMV 133

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW   WGD GLFKIRRGTDE  I+S
Sbjct: 134 NSWSAQWGDNGLFKIRRGTDECGIDS 159


>gi|189096178|pdb|3CBJ|A Chain A, Chagasin-cathepsin B Complex
 gi|189096180|pdb|3CBK|A Chain A, Chagasin-Cathepsin B
          Length = 266

 Score =  103 bits (258), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 166 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 225

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 226 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 261


>gi|91078964|ref|XP_974298.1| PREDICTED: similar to putative cathepsin B-like like proteinase
           [Tribolium castaneum]
 gi|270004838|gb|EFA01286.1| cathepsin B precursor [Tribolium castaneum]
          Length = 335

 Score =  103 bits (258), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 47/85 (55%), Positives = 57/85 (67%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + A  + ++D + YK GVYQ T G  +GGHA+KI+GWGVEDG  YWL  N
Sbjct: 242 IQTEIMTNGPVEADFDVYEDFLNYKSGVYQQTTGNYAGGHAIKILGWGVEDGTPYWLAAN 301

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW E WGD G FKI RG +E  IES
Sbjct: 302 SWNEDWGDKGYFKILRGQNECGIES 326


>gi|24158605|pdb|1GMY|A Chain A, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
 gi|24158606|pdb|1GMY|B Chain B, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
 gi|24158607|pdb|1GMY|C Chain C, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
          Length = 261

 Score =  103 bits (258), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 161 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 220

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 221 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 256


>gi|49036806|gb|AAT48984.1| cathepsin B-like proteinase [Triatoma sordida]
          Length = 331

 Score =  103 bits (257), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 49/91 (53%), Positives = 61/91 (67%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + AA   ++DL+ YK+GVYQH  GE  GGHA+KI+GWGVE+   YWL  N
Sbjct: 238 IQAEILKNGPVEAAFTVYEDLLNYKEGVYQHVAGEALGGHAIKILGWGVENDTPYWLVAN 297

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WG+ G FKI RG+DE  IE  Q+ AG
Sbjct: 298 SWNTDWGNNGFFKILRGSDECGIED-QIVAG 327


>gi|49036808|gb|AAT48985.1| cathepsin B-like proteinase [Triatoma vitticeps]
          Length = 332

 Score =  103 bits (257), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 50/91 (54%), Positives = 61/91 (67%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G IVA+I  ++DL  YK GVYQH  GE+ GGH +KI+GWGVE+   YWL  N
Sbjct: 239 IQAEILKNGPIVASILVYEDLFSYKAGVYQHVAGEVLGGHVIKILGWGVENDTPYWLVAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WG+ G FKI RG+DE  IE  Q+ AG
Sbjct: 299 SWNTDWGNNGFFKILRGSDECGIED-QIVAG 328


>gi|181178|gb|AAA52125.1| lysosomal proteinase cathepsin B, partial [Homo sapiens]
          Length = 209

 Score =  103 bits (257), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 109 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 168

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 169 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 204


>gi|328697984|ref|XP_003240502.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
          Length = 339

 Score =  103 bits (257), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 48/86 (55%), Positives = 60/86 (69%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++ ++G I A+ + + D   YK GVYQ T      GGHAVK+IGWGVE+G+ YWL V
Sbjct: 244 IQKDVMNYGPIEASFDVYDDFYSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGIPYWLMV 303

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW   WGD GLFKIRRGTDE  I+S
Sbjct: 304 NSWSAQWGDNGLFKIRRGTDECGIDS 329


>gi|119887749|gb|ABM05925.1| cathepsin B-like cysteine proteinase [Helicoverpa assulta]
          Length = 338

 Score =  103 bits (257), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 48/89 (53%), Positives = 60/89 (67%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ E+F  G +  A   + DL+ YK GVY+HT+G+  GGHAVKI+GWGVE+G KYWL  N
Sbjct: 242 IRAELFKNGPVEGAFTVYSDLLNYKTGVYKHTIGDALGGHAVKILGWGVENGNKYWLIAN 301

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW   WGD G FKI RG D   IES  V+
Sbjct: 302 SWNSDWGDNGFFKILRGEDHCGIESSIVA 330


>gi|301776581|ref|XP_002923704.1| PREDICTED: cathepsin B-like [Ailuropoda melanoleuca]
 gi|281347694|gb|EFB23278.1| hypothetical protein PANDA_012896 [Ailuropoda melanoleuca]
          Length = 339

 Score =  103 bits (257), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 49/91 (53%), Positives = 60/91 (65%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G + AA   + D ++YK GVYQH  GEM GGHAV+I+GWGVE+G  YWL  N
Sbjct: 239 IMAEIYKNGPVEAAFTVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGVENGTPYWLVGN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG D   IES ++ AG
Sbjct: 299 SWNTDWGDNGFFKILRGRDHCGIES-EIVAG 328


>gi|353228456|emb|CCD74627.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 333

 Score =  103 bits (257), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 48/95 (50%), Positives = 64/95 (67%), Gaps = 1/95 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G + A I  H D + YK GVY+H  G++   H+V+IIGWG+E+ + YWLC N
Sbjct: 238 IRKEIMMNGPVEAGIFVHSDFLNYKSGVYRHITGQLVTIHSVRIIGWGIENDIPYWLCAN 297

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           SW E WG  G FKI RG++E  IESF V+AG+VD 
Sbjct: 298 SWNEDWGLNGYFKILRGSNECEIESF-VNAGKVDN 331


>gi|7537454|gb|AAF35867.2| cathepsin B-like cysteine proteinase [Helicoverpa armigera]
          Length = 338

 Score =  103 bits (257), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 48/90 (53%), Positives = 60/90 (66%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ E+F  G +  A   + DL+ YK GVY+HT+G+  GGHAVKI+GWGVE+G KYWL  N
Sbjct: 242 IRAELFKNGPVEGAFTVYSDLLNYKTGVYKHTIGDALGGHAVKILGWGVENGNKYWLIAN 301

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SW   WGD G FKI RG D   IES  V+ 
Sbjct: 302 SWNSDWGDNGFFKILRGEDHCGIESSIVAG 331


>gi|256090674|ref|XP_002581308.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 250

 Score =  103 bits (257), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 48/94 (51%), Positives = 64/94 (68%), Gaps = 1/94 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G + A I  H D + YK GVY+H  G++   H+V+IIGWG+E+ + YWLC N
Sbjct: 155 IRKEIMMNGPVEAGIFVHSDFLNYKSGVYRHITGQLVTIHSVRIIGWGIENDIPYWLCAN 214

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           SW E WG  G FKI RG++E  IESF V+AG+VD
Sbjct: 215 SWNEDWGLNGYFKILRGSNECEIESF-VNAGKVD 247


>gi|56462338|gb|AAV91452.1| cysteine peptidase 2 cathepsin-B-like [Lonomia obliqua]
          Length = 338

 Score =  103 bits (257), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 49/89 (55%), Positives = 58/89 (65%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ E+F  G +  A   + DL+ YK GVYQHT G   GGHAVKI+GWGVE+G KYWL  N
Sbjct: 242 IKAELFKNGPVEGAFTVYSDLLSYKSGVYQHTDGSALGGHAVKILGWGVENGSKYWLIAN 301

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW   WGD G FKI RG D   IES  V+
Sbjct: 302 SWNSDWGDNGFFKILRGEDHCGIESSIVT 330


>gi|262368170|pdb|3K9M|A Chain A, Cathepsin B In Complex With Stefin A
 gi|262368172|pdb|3K9M|B Chain B, Cathepsin B In Complex With Stefin A
          Length = 254

 Score =  103 bits (257), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 48/91 (52%), Positives = 59/91 (64%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 160 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 219

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG D   IES +V AG
Sbjct: 220 SWNTDWGDNGFFKILRGQDHCGIES-EVVAG 249


>gi|333361087|pdb|3AI8|B Chain B, Cathepsin B In Complex With The Nitroxoline
 gi|333361088|pdb|3AI8|A Chain A, Cathepsin B In Complex With The Nitroxoline
          Length = 256

 Score =  103 bits (257), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 48/91 (52%), Positives = 59/91 (64%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 162 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 221

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG D   IES +V AG
Sbjct: 222 SWNTDWGDNGFFKILRGQDHCGIES-EVVAG 251


>gi|226471008|emb|CAX70585.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  103 bits (257), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 45/89 (50%), Positives = 61/89 (68%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI  +G + AA + ++D + YK G+Y+H  G + GGHA++IIGWGVE G  YWL  N
Sbjct: 249 IQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIGWGVEKGKPYWLIAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW E WG+ GLF++ RG DE  IES  V+
Sbjct: 309 SWNEDWGENGLFRMVRGRDECSIESHVVA 337


>gi|226471006|emb|CAX70584.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  103 bits (257), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 45/89 (50%), Positives = 61/89 (68%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI  +G + AA + ++D + YK G+Y+H  G + GGHA++IIGWGVE G  YWL  N
Sbjct: 249 IQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIGWGVEKGKPYWLIAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW E WG+ GLF++ RG DE  IES  V+
Sbjct: 309 SWNEDWGENGLFRMVRGRDECSIESHVVA 337


>gi|741376|prf||2007265A cathepsin B
          Length = 153

 Score =  103 bits (257), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 53  IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 112

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 113 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 148


>gi|193783549|dbj|BAG53460.1| unnamed protein product [Homo sapiens]
          Length = 276

 Score =  103 bits (257), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 176 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 235

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 236 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 271


>gi|187103108|ref|NP_001119614.1| cathepsin B-1418 precursor [Acyrthosiphon pisum]
 gi|163300438|tpg|DAA06126.1| TPA_inf: cathepsin B transcript 1418 [Acyrthosiphon pisum]
 gi|239788654|dbj|BAH70998.1| ACYPI000010 [Acyrthosiphon pisum]
          Length = 346

 Score =  103 bits (257), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 42/81 (51%), Positives = 57/81 (70%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           +++  G + AA   + D + YK GVY +T G++ GGHA+KI+GWGV+DG KYWLC NSW 
Sbjct: 251 DLYKNGPVQAAFYVYTDFMYYKSGVYSYTRGQIEGGHAIKILGWGVDDGTKYWLCANSWS 310

Query: 64  ELWGDGGLFKIRRGTDESRIE 84
             WG+ GLF+I RG +E  IE
Sbjct: 311 RSWGENGLFRILRGNNECHIE 331


>gi|999909|pdb|1HUC|B Chain B, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
           Human Liver Cathepsin B: The Structural Basis For Its
           Specificity
 gi|999911|pdb|1HUC|D Chain D, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
           Human Liver Cathepsin B: The Structural Basis For Its
           Specificity
 gi|1421164|pdb|1CSB|B Chain B, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
           2.1 Angstroms Resolution: A Basis For The Design Of
           Specific Epoxysuccinyl Inhibitors
 gi|1421167|pdb|1CSB|E Chain E, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
           2.1 Angstroms Resolution: A Basis For The Design Of
           Specific Epoxysuccinyl Inhibitors
 gi|122920711|pdb|2IPP|B Chain B, Crystal Structure Of The Tetragonal Form Of Human Liver
           Cathepsin B
          Length = 205

 Score =  103 bits (257), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 48/91 (52%), Positives = 59/91 (64%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 111 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 170

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG D   IES +V AG
Sbjct: 171 SWNTDWGDNGFFKILRGQDHCGIES-EVVAG 200


>gi|118358706|ref|XP_001012594.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89294361|gb|EAR92349.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 346

 Score =  103 bits (256), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 48/90 (53%), Positives = 60/90 (66%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G I  A+  ++D + YK GVYQH  G+  GGHAVK++GWGVE+G  YW  VN
Sbjct: 253 IMAEIYKNGPIEVALTVYEDFLTYKTGVYQHVTGDELGGHAVKMVGWGVENGTPYWTIVN 312

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SW E WGD G FKI RG +E  IES  V+A
Sbjct: 313 SWNESWGDKGTFKILRGKNECGIESSCVTA 342


>gi|410916585|ref|XP_003971767.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
          Length = 328

 Score =  103 bits (256), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 46/91 (50%), Positives = 59/91 (64%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E++  G + AA   + D ++YK GVYQH  GE+ GGHA+KI+GWG E G  YWL  NSW 
Sbjct: 238 ELYKNGPVEAAFTVYADFLLYKTGVYQHVTGEVLGGHAIKILGWGEESGTPYWLAANSWN 297

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
             WGD G FKI+RG DE  IES  V+   ++
Sbjct: 298 GDWGDKGFFKIKRGNDECGIESEMVAGTPLN 328


>gi|343961899|dbj|BAK62537.1| cathepsin B precursor [Pan troglodytes]
          Length = 195

 Score =  103 bits (256), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 95  IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 154

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 155 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 190


>gi|56753443|gb|AAW24925.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  103 bits (256), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 45/89 (50%), Positives = 61/89 (68%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI  +G + AA + ++D + YK G+Y+H  G + GGHA++IIGWGVE G  YWL  N
Sbjct: 249 IQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIGWGVEKGKPYWLIAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW E WG+ GLF++ RG DE  IES  V+
Sbjct: 309 SWNEDWGENGLFRMVRGRDECSIESHVVA 337


>gi|161671340|gb|ABX75522.1| cathepsin b [Lycosa singoriensis]
          Length = 247

 Score =  103 bits (256), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 48/90 (53%), Positives = 58/90 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EIF  G +  A   + D I YK GVYQH  GE  GGHA++++GWG E+ V YWLC N
Sbjct: 154 IQTEIFKNGPVEGAFSVYSDFINYKSGVYQHHSGESLGGHAIRVLGWGYENDVPYWLCAN 213

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SW   WGD G FKI RG+DE  IES  V+ 
Sbjct: 214 SWNTDWGDKGYFKILRGSDECGIESSIVAG 243


>gi|392922404|ref|NP_507186.3| Protein CPR-2 [Caenorhabditis elegans]
 gi|206994217|emb|CAB04322.3| Protein CPR-2 [Caenorhabditis elegans]
          Length = 326

 Score =  103 bits (256), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 45/85 (52%), Positives = 57/85 (67%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I++ G +VAA   ++D   YK G+Y+H  G   GGHAVK+IGWG E G  YWL VN
Sbjct: 234 IQADIYYNGPVVAAFIVYEDFEKYKSGIYRHIAGRSKGGHAVKLIGWGTERGTPYWLAVN 293

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SWG  WG+ G F+I RG DE  IES
Sbjct: 294 SWGSQWGESGTFRILRGVDECGIES 318


>gi|47217183|emb|CAG11019.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 351

 Score =  103 bits (256), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 46/92 (50%), Positives = 60/92 (65%), Gaps = 1/92 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI+  G +  A   ++D ++YK GVYQH  G   GGHA+K++GWG E+GV YWLC N
Sbjct: 259 IKQEIYKNGPVEGAFTVYEDFVLYKSGVYQHVSGSALGGHAIKMLGWGEENGVPYWLCAN 318

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGR 92
           SW   WGD G FKI RG D   IES ++ AG 
Sbjct: 319 SWNTDWGDNGFFKILRGADHCGIES-EIVAGN 349


>gi|56752997|gb|AAW24710.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  103 bits (256), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 45/89 (50%), Positives = 61/89 (68%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI  +G + AA + ++D + YK G+Y+H  G + GGHA++IIGWGVE G  YWL  N
Sbjct: 249 IQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKGKPYWLIAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW E WG+ GLF++ RG DE  IES  V+
Sbjct: 309 SWNEDWGEKGLFRMVRGRDECSIESHVVA 337


>gi|158261501|dbj|BAF82928.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score =  103 bits (256), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 60/97 (61%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G    A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 239 IMAEIYKNGPAEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334


>gi|268570495|ref|XP_002648548.1| Hypothetical protein CBG24861 [Caenorhabditis briggsae]
          Length = 323

 Score =  103 bits (256), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 44/89 (49%), Positives = 59/89 (66%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +V A   ++D+  YK GVY+HT G + GGHA+KIIGWG ++G+ YWL  N
Sbjct: 230 IQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGTQNGIPYWLIAN 289

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SWG  WG+ G  K+RRG +E  IE   V+
Sbjct: 290 SWGANWGENGFLKMRRGVNECGIERAVVA 318


>gi|46812327|gb|AAT02230.1| cathepsin B-like proteinase [Triatoma dimidiata]
          Length = 332

 Score =  102 bits (255), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 49/91 (53%), Positives = 61/91 (67%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + AA   ++DL+ YK+GVY+H  G   GGHA+KI+GWGVE+G  YWL  N
Sbjct: 239 IQTEILKNGPVEAAFFVYEDLLTYKEGVYKHVAGAPVGGHAIKILGWGVENGTPYWLIAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WG+ G FKI RG+DE  IE   VSAG
Sbjct: 299 SWNTDWGNNGFFKILRGSDECGIE-IDVSAG 328


>gi|403307501|ref|XP_003944231.1| PREDICTED: cathepsin B [Saimiri boliviensis boliviensis]
          Length = 351

 Score =  102 bits (255), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 251 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVGN 310

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 311 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 346


>gi|268572243|ref|XP_002648913.1| Hypothetical protein CBG17826 [Caenorhabditis briggsae]
          Length = 323

 Score =  102 bits (255), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 44/89 (49%), Positives = 59/89 (66%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +V A   ++D+  YK GVY+HT G + GGHA+KIIGWG ++G+ YWL  N
Sbjct: 230 IQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGTQNGIPYWLIAN 289

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SWG  WG+ G  K+RRG +E  IE   V+
Sbjct: 290 SWGANWGENGFLKMRRGVNECGIERAVVA 318


>gi|118424551|gb|ABK90823.1| cathepsin B-like cysteine proteinase [Spodoptera exigua]
          Length = 341

 Score =  102 bits (255), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 47/89 (52%), Positives = 59/89 (66%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ E++  G +  A   + DL+ YK GVY+HTVG   GGHA+KI+GWGVE+G KYWL  N
Sbjct: 245 IKAELYKNGPVEGAFTVYSDLLNYKNGVYKHTVGNALGGHAIKILGWGVENGNKYWLIAN 304

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW   WGD G FKI RG D   IES  V+
Sbjct: 305 SWNSDWGDNGFFKILRGEDHCGIESSIVA 333


>gi|296221607|ref|XP_002756833.1| PREDICTED: cathepsin B, partial [Callithrix jacchus]
          Length = 330

 Score =  102 bits (255), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 230 IMAEIYKNGPVEGAFSVYADFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVGN 289

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 290 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 325


>gi|56752811|gb|AAW24617.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  102 bits (255), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 45/89 (50%), Positives = 61/89 (68%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI  +G + AA + ++D + YK G+Y+H  G + GGHA++IIGWGVE G  YWL  N
Sbjct: 249 IQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKGKPYWLIAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW E WG+ GLF++ RG DE  IES  V+
Sbjct: 309 SWNEDWGEKGLFRMVRGRDECSIESHVVA 337


>gi|348534156|ref|XP_003454569.1| PREDICTED: cathepsin B-like [Oreochromis niloticus]
          Length = 330

 Score =  102 bits (255), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 56/85 (65%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI+  G +  A   ++D ++YK GVYQH  G   GGHA+K++GWG E+G  YWLC N
Sbjct: 238 IQTEIYKNGPVEGAFTVYEDFLLYKTGVYQHVSGSAVGGHAIKVLGWGEENGTPYWLCAN 297

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FKI RG+D   IES
Sbjct: 298 SWNTDWGDNGYFKILRGSDHCGIES 322


>gi|226471002|emb|CAX70582.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  102 bits (255), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 45/89 (50%), Positives = 61/89 (68%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI  +G + AA + ++D + YK G+Y+H  G + GGHA++IIGWGVE G  YWL  N
Sbjct: 249 IQKEIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKGKPYWLIAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW E WG+ GLF++ RG DE  IES  V+
Sbjct: 309 SWNEDWGEKGLFRMVRGRDECSIESHVVA 337


>gi|32566081|ref|NP_506002.2| Protein CPR-1 [Caenorhabditis elegans]
 gi|32172429|sp|P25807.2|CPR1_CAEEL RecName: Full=Gut-specific cysteine proteinase; Flags: Precursor
 gi|1395200|gb|AAB88058.1| gut-specific cysteine protease-1 [Caenorhabditis elegans]
 gi|24817276|emb|CAB01410.2| Protein CPR-1 [Caenorhabditis elegans]
          Length = 329

 Score =  102 bits (255), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 49/93 (52%), Positives = 60/93 (64%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI+  G + AA   ++D   YK GVY+HT G+  GGHA+KIIGWG E G  YWL  N
Sbjct: 236 IQAEIYANGPVEAAFSVYEDFYKYKSGVYKHTAGKYLGGHAIKIIGWGTESGSPYWLVAN 295

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           SWG  WG+ G FKI RG D+  IES  V AG+ 
Sbjct: 296 SWGVNWGESGFFKIYRGDDQCGIES-AVVAGKA 327


>gi|38147393|gb|AAR12009.1| cathepsin B-like proteinase [Triatoma infestans]
          Length = 332

 Score =  102 bits (255), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 49/91 (53%), Positives = 60/91 (65%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + AA   ++DL+ YK+GVYQH  G + GGHA+KI+GWGVE+   YWL  N
Sbjct: 239 IQAEILKNGPVEAAFTVYEDLVNYKEGVYQHVAGSVLGGHAIKILGWGVENDTPYWLVAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WG+ G FKI RG DE  IE   VSAG
Sbjct: 299 SWNTDWGNNGFFKILRGKDECGIE-IDVSAG 328


>gi|226466816|emb|CAX69543.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 337

 Score =  102 bits (255), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 47/94 (50%), Positives = 65/94 (69%), Gaps = 1/94 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI  +G + A+I  + D + YK GVY+H  G +    +V+IIGWG+E+G+ YWLC N
Sbjct: 243 IRREIMLYGPVEASIFIYDDFVDYKSGVYKHLTGRLITIQSVRIIGWGIENGIPYWLCAN 302

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           SW E WG  G FKI RG++E  IE+F V+AGRVD
Sbjct: 303 SWNEEWGLNGFFKILRGSNECEIEAF-VNAGRVD 335


>gi|395842321|ref|XP_003793966.1| PREDICTED: cathepsin B [Otolemur garnettii]
          Length = 339

 Score =  102 bits (255), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 48/97 (49%), Positives = 62/97 (63%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  G+M GGHA++I+GWG E+GV YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHLTGDMMGGHAIRILGWGEENGVPYWLVAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGDGG F+I RG D   IES +V AG    D+
Sbjct: 299 SWNTDWGDGGFFRILRGQDHCGIES-EVVAGIPRTDQ 334


>gi|170028910|ref|XP_001842337.1| cathepsin L [Culex quinquefasciatus]
 gi|167879387|gb|EDS42770.1| cathepsin L [Culex quinquefasciatus]
          Length = 334

 Score =  102 bits (255), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 48/91 (52%), Positives = 64/91 (70%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q E+F  G +  A   ++DL+ YK+GVYQHT G+M GGHA++I+GWGVE+  K+WL  N
Sbjct: 241 IQKELFTNGPVEGAFTVYEDLLNYKEGVYQHTAGKMLGGHAIRILGWGVENDTKFWLIAN 300

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG+D   IES  ++AG
Sbjct: 301 SWNSDWGDNGYFKILRGSDHLGIES-SIAAG 330


>gi|308507719|ref|XP_003116043.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
 gi|308250987|gb|EFO94939.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
          Length = 356

 Score =  102 bits (255), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 49/89 (55%), Positives = 58/89 (65%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI+  G +  A   + D   YK GVY H  G+ +GGHAVKIIGWG E GV YWL  N
Sbjct: 239 IQNEIYQNGPVEVAYTVYDDFYHYKSGVYHHVTGKDTGGHAVKIIGWGTEKGVDYWLVTN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SWG  +GD G FKIRRGT+E  IES  V+
Sbjct: 299 SWGTSFGDKGFFKIRRGTNECGIESNVVA 327


>gi|187104114|ref|NP_001119617.1| cathepsin B-16A precursor [Acyrthosiphon pisum]
 gi|161343835|tpg|DAA06098.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 340

 Score =  102 bits (255), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 48/86 (55%), Positives = 59/86 (68%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++ ++G I A+ + + D   YK GVYQ T      GGHAVK+IGWGVE+G  YWL V
Sbjct: 245 IQKDVMNYGPIEASFDVYDDFPSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGTPYWLMV 304

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW   WGD GLFKIRRGTDE  I+S
Sbjct: 305 NSWNAQWGDNGLFKIRRGTDECGIDS 330


>gi|56756436|gb|AAW26391.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  102 bits (254), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 45/89 (50%), Positives = 61/89 (68%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI  +G + AA + ++D + YK G+Y+H  G + GGHA++IIGWGVE G  YWL  N
Sbjct: 249 IQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKGKPYWLIAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW E WG+ GLF++ RG DE  IES  V+
Sbjct: 309 SWNEDWGEKGLFRMVRGRDECSIESHVVA 337


>gi|268566089|ref|XP_002647469.1| Hypothetical protein CBG06541 [Caenorhabditis briggsae]
          Length = 280

 Score =  102 bits (254), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 45/89 (50%), Positives = 60/89 (67%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +V A   ++D+  YK GVY+HT G + GGHA+KIIGWG ++G+ YWL  N
Sbjct: 187 IQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGTQNGIPYWLIAN 246

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SWG  WG+ G  K+RRG +E  IES  V+
Sbjct: 247 SWGADWGENGFLKMRRGVNECGIESAVVA 275



 Score = 70.9 bits (172), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 31/60 (51%), Positives = 40/60 (66%)

Query: 9  GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
          G + A+   ++D  IYKKGVYQ+T G++ G HA+KI+GWG E G  YWL  NSWG   G 
Sbjct: 4  GPVEASFTVYEDFYIYKKGVYQYTAGQVVGVHAIKIMGWGTEHGTDYWLIANSWGAQCGS 63


>gi|444525951|gb|ELV14228.1| Cathepsin B [Tupaia chinensis]
          Length = 339

 Score =  102 bits (254), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 47/97 (48%), Positives = 60/97 (61%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWG E+G  YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGTENGTPYWLVAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES ++ AG    D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EIVAGIPRTDQ 334


>gi|332374788|gb|AEE62535.1| unknown [Dendroctonus ponderosae]
          Length = 328

 Score =  102 bits (254), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 48/85 (56%), Positives = 58/85 (68%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + A ++ + D   Y+ G+YQ T  E  GGHAVKI+GWGVEDGVKYWL  N
Sbjct: 232 IQYEIMTNGPVEATMDVYVDFAQYQSGIYQLTTDEYEGGHAVKILGWGVEDGVKYWLVAN 291

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW E WG+ GLF+I RG DE  IES
Sbjct: 292 SWNERWGENGLFRIIRGRDEVGIES 316


>gi|185135431|ref|NP_001117776.1| procathepsin B precursor [Oncorhynchus mykiss]
 gi|14582897|gb|AAK69705.1|AF358667_1 procathepsin B [Oncorhynchus mykiss]
          Length = 330

 Score =  102 bits (254), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 56/85 (65%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  E++  G + AA   ++D ++YK GVYQH  G+M GGHA+KI+GWG E+   YWL  N
Sbjct: 237 IMTELYKNGPVEAAFSVYEDFLLYKTGVYQHVTGQMLGGHAIKILGWGKENNTPYWLVAN 296

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FKI RG DE  IES
Sbjct: 297 SWNTDWGDNGFFKILRGKDECGIES 321


>gi|51038793|gb|AAT94175.1| cathepsin B [Paralichthys olivaceus]
 gi|121053785|gb|ABM47001.1| cathepsin B [Paralichthys olivaceus]
          Length = 330

 Score =  102 bits (254), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 57/85 (67%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A   ++D ++YK GVYQH  G + GGHA+K++GWG EDG+ YWLC N
Sbjct: 238 IQAEISKNGPVEGAFTVYEDFVMYKSGVYQHVSGSVLGGHAIKVLGWGEEDGIPYWLCAN 297

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FKI RG++   IES
Sbjct: 298 SWNTDWGDNGFFKILRGSNHCGIES 322


>gi|91078958|ref|XP_974220.1| PREDICTED: similar to cathepsin b [Tribolium castaneum]
 gi|270004841|gb|EFA01289.1| cathepsin B precursor [Tribolium castaneum]
          Length = 334

 Score =  102 bits (254), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 50/98 (51%), Positives = 61/98 (62%), Gaps = 2/98 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A E ++DL+ YKKGVYQH  GE  GGHA++I+GWG E G  YWL  N
Sbjct: 239 IQKEIMTNGPVEGAFEVYEDLLSYKKGVYQHVKGEALGGHAIRILGWGTEKGTPYWLIAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRS 98
           SW   WGD G FKI RG D   IES  V+   + +D S
Sbjct: 299 SWNSDWGDNGTFKILRGEDHCGIESSIVAG--IPKDSS 334


>gi|269146930|gb|ACZ28411.1| cathepsin b [Simulium nigrimanum]
          Length = 168

 Score =  102 bits (254), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 49/91 (53%), Positives = 62/91 (68%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A   ++DL+ YK GVYQH  G+M GGHA++I+GWGVE+ V YWL  N
Sbjct: 76  IQKEIMTNGPVEGAFTVYEDLVQYKDGVYQHVTGKMLGGHAIRILGWGVENDVPYWLIAN 135

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WG+ G FKI RG+D   IES Q+SAG
Sbjct: 136 SWNTDWGNNGFFKILRGSDHCGIES-QISAG 165


>gi|390357905|ref|XP_003729132.1| PREDICTED: cathepsin B-like [Strongylocentrotus purpuratus]
          Length = 354

 Score =  102 bits (254), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 47/84 (55%), Positives = 56/84 (66%)

Query: 2   QLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNS 61
           Q EI   G + A    ++D   YK GVYQHT G + GGHA+KI+GWGVE+G KYWL  NS
Sbjct: 263 QTEIMTNGPVEADFTVYEDFPTYKSGVYQHTTGGVLGGHAIKILGWGVEEGTKYWLVANS 322

Query: 62  WGELWGDGGLFKIRRGTDESRIES 85
           W   WGD G FKI RG++E  IES
Sbjct: 323 WNNEWGDNGFFKILRGSNECGIES 346


>gi|326427908|gb|EGD73478.1| cathepsin B [Salpingoeca sp. ATCC 50818]
          Length = 341

 Score =  102 bits (254), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 49/91 (53%), Positives = 61/91 (67%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI   G + AA   + D + YK GVYQHT G+  GGHA+KIIGWGV+DG  YW+  N
Sbjct: 248 IATEIMTNGPVEAAFTVYSDFLSYKSGVYQHTSGQPLGGHAIKIIGWGVQDGTDYWIVAN 307

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW + WG+ G F I++GTDE  IES QV AG
Sbjct: 308 SWNDSWGNDGFFWIKKGTDECGIES-QVVAG 337


>gi|395507317|ref|XP_003757972.1| PREDICTED: cathepsin B [Sarcophilus harrisii]
          Length = 342

 Score =  102 bits (254), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 46/90 (51%), Positives = 57/90 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D + YK GVYQH  G+M GGHA++++GWGVEDGV YWL  N
Sbjct: 242 IMAEIYKNGPVEGAFIVYADFLQYKSGVYQHVTGDMLGGHAIRVLGWGVEDGVPYWLAAN 301

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SW   WGD G FKI RG D   IES  V+ 
Sbjct: 302 SWNTDWGDNGFFKILRGKDHCGIESEMVAG 331


>gi|148229459|ref|NP_001079570.1| cathepsin B precursor [Xenopus laevis]
 gi|28277314|gb|AAH44689.1| MGC53360 protein [Xenopus laevis]
          Length = 333

 Score =  102 bits (253), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 46/85 (54%), Positives = 54/85 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D  +YK GVYQH  GE  GGHA+KI+GWGVE+G  YWLC N
Sbjct: 240 IMAEIYKNGPVEGAFLVYADFPLYKSGVYQHETGEELGGHAIKILGWGVENGTPYWLCAN 299

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FKI RG D   IES
Sbjct: 300 SWNTDWGDNGFFKILRGKDHCGIES 324


>gi|45361295|ref|NP_989225.1| cathepsin B precursor [Xenopus (Silurana) tropicalis]
 gi|38969948|gb|AAH63365.1| hypothetical protein MGC75969 [Xenopus (Silurana) tropicalis]
          Length = 333

 Score =  102 bits (253), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 46/85 (54%), Positives = 54/85 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D  +YK GVYQH  GE  GGHA+KI+GWGVE+G  YWLC N
Sbjct: 240 IMAEIYKNGPVEGAFLVYADFPMYKSGVYQHETGEELGGHAIKILGWGVENGTPYWLCAN 299

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FKI RG D   IES
Sbjct: 300 SWNTDWGDNGFFKILRGKDHCGIES 324


>gi|149698064|ref|XP_001498242.1| PREDICTED: cathepsin B [Equus caballus]
          Length = 340

 Score =  102 bits (253), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 49/91 (53%), Positives = 59/91 (64%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EIF  G + AA   + D + YK GVYQH  G+M GGHAV+I+GWGVE+G  YWL  N
Sbjct: 240 IMAEIFKNGPVEAAFTVYSDFLQYKSGVYQHVAGDMMGGHAVRILGWGVENGTPYWLVGN 299

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG D   IES ++ AG
Sbjct: 300 SWNTDWGDNGFFKILRGQDHCGIES-EIVAG 329


>gi|145498570|ref|XP_001435272.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124402403|emb|CAK67875.1| unnamed protein product [Paramecium tetraurelia]
          Length = 325

 Score =  102 bits (253), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 50/93 (53%), Positives = 62/93 (66%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI  FG + A+   ++D + YK GVYQ+  G   GGHAVKIIGWGVE  V YWL VN
Sbjct: 233 IQNEIMTFGPVEASFTVYEDFLTYKSGVYQNVAGANLGGHAVKIIGWGVEKNVPYWLVVN 292

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           SW E WG+ GLFKI RG++   IE   + AGR+
Sbjct: 293 SWNEGWGENGLFKILRGSNHVGIEG-GIYAGRL 324


>gi|341904369|gb|EGT60202.1| hypothetical protein CAEBREN_08101 [Caenorhabditis brenneri]
          Length = 330

 Score =  102 bits (253), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 49/93 (52%), Positives = 59/93 (63%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + AA   ++D   YK GVY+HT G+  GGHA+KIIGWG E G  YWL  N
Sbjct: 237 IQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHAIKIIGWGTESGSPYWLVAN 296

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           SWG  WG+ G FKI RG D+  IES  V AG+ 
Sbjct: 297 SWGTSWGESGFFKIFRGDDQCGIES-AVVAGKA 328


>gi|341878049|gb|EGT33984.1| CBN-CPR-1 protein [Caenorhabditis brenneri]
          Length = 330

 Score =  102 bits (253), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 49/93 (52%), Positives = 59/93 (63%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + AA   ++D   YK GVY+HT G+  GGHA+KIIGWG E G  YWL  N
Sbjct: 237 IQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHAIKIIGWGTESGSPYWLVAN 296

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           SWG  WG+ G FKI RG D+  IES  V AG+ 
Sbjct: 297 SWGTSWGESGFFKIFRGDDQCGIES-AVVAGKA 328


>gi|256052329|ref|XP_002569725.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228436|emb|CCD74607.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 345

 Score =  101 bits (252), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 46/94 (48%), Positives = 64/94 (68%), Gaps = 1/94 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI  +G + A+   ++D + YK G+Y+H  GE  GGHA++IIGWGVE+   YWL  N
Sbjct: 253 IQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVENKTPYWLIAN 312

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           SW E WG+ G F+I RG DE  IES +V AG+++
Sbjct: 313 SWNEDWGENGYFRIVRGRDECFIES-EVIAGQIN 345


>gi|170030062|ref|XP_001842909.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
 gi|167865915|gb|EDS29298.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
          Length = 288

 Score =  101 bits (252), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 43/84 (51%), Positives = 56/84 (66%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           M+ EI+  G IV + + + D   Y+ GVY+H  G   G HAV++IGWGVE+GVKYWLC N
Sbjct: 195 MKAEIYQNGPIVTSFDVYGDFFQYRSGVYRHVTGAYKGSHAVRVIGWGVENGVKYWLCAN 254

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW E WG+ G FKI RG +   +E
Sbjct: 255 SWNERWGENGFFKIVRGENHVGVE 278


>gi|157167366|ref|XP_001653890.1| cathepsin b [Aedes aegypti]
 gi|54289254|gb|AAV31917.1| lysosomal cathepsin B [Aedes aegypti]
 gi|108874249|gb|EAT38474.1| AAEL009637-PA [Aedes aegypti]
          Length = 340

 Score =  101 bits (252), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 49/91 (53%), Positives = 61/91 (67%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A   ++DL+ YK+GVY H  G+M GGHA++I+GWGVEDG KYWL  N
Sbjct: 247 IQKEIMTNGPVEGAFTVYEDLLNYKEGVYHHVHGKMLGGHAIRILGWGVEDGTKYWLIAN 306

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG D   IES  ++AG
Sbjct: 307 SWNSDWGDNGFFKILRGEDHLGIES-SIAAG 336


>gi|148222779|ref|NP_001080410.1| uncharacterized protein LOC380102 precursor [Xenopus laevis]
 gi|28302291|gb|AAH46667.1| Cg10992 protein [Xenopus laevis]
          Length = 333

 Score =  101 bits (252), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 48/91 (52%), Positives = 58/91 (63%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  +I+  G +  A   + D  +YK GVYQH  GE  GGHA+KI+GWGVE+G  YWLC N
Sbjct: 240 IMADIYKNGPVEGAFVVYADFPLYKSGVYQHETGEELGGHAIKILGWGVENGTPYWLCAN 299

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG D   IES +V AG
Sbjct: 300 SWNTDWGDNGFFKILRGKDHCGIES-EVVAG 329


>gi|340380685|ref|XP_003388852.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
          Length = 341

 Score =  101 bits (252), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 46/85 (54%), Positives = 52/85 (61%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A   + D   Y  GVYQHT G   GGHA+KI+GWG E+GV YWL  N
Sbjct: 246 IQTEIMTNGPVEGAFSVYADFPTYTSGVYQHTTGSFLGGHAIKILGWGTENGVPYWLVAN 305

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FKI RG DE  IES
Sbjct: 306 SWNPSWGDSGFFKIIRGKDECGIES 330


>gi|260782761|ref|XP_002586451.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
 gi|229271561|gb|EEN42462.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
          Length = 272

 Score =  101 bits (252), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 47/85 (55%), Positives = 59/85 (69%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +++EI   G + AA   + D++ YK GVY HT G   GGHAVK++GWGVED  +YWL  N
Sbjct: 176 IKVEIMTNGPVEAAFTVYSDIVHYKSGVYHHTSGGKLGGHAVKVLGWGVEDEEEYWLVAN 235

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SWG  WGD G FKI+RG+DE  IES
Sbjct: 236 SWGPDWGDQGFFKIKRGSDECGIES 260


>gi|157167281|ref|XP_001658485.1| cathepsin b [Aedes aegypti]
 gi|108876476|gb|EAT40701.1| AAEL007585-PA [Aedes aegypti]
          Length = 386

 Score =  101 bits (251), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 49/101 (48%), Positives = 64/101 (63%), Gaps = 1/101 (0%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EIF  G + AA   + DL  YK G+Y+H  G +SGGHAVK++GWGVE+GVKYWL  NSWG
Sbjct: 279 EIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLLGWGVENGVKYWLVANSWG 338

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEF 104
             WG+ G FKI RG +   IE   + AG  +  R  +  ++
Sbjct: 339 REWGENGFFKIVRGENHCGIEE-NIHAGLPNFHRQGEAGKY 378


>gi|344281458|ref|XP_003412496.1| PREDICTED: cathepsin B-like [Loxodonta africana]
          Length = 340

 Score =  101 bits (251), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 47/97 (48%), Positives = 60/97 (61%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GE  GGHA++I+GWGVE+G  YWL  N
Sbjct: 240 IMAEIYKNGPVEGAFSVYTDFLVYKSGVYQHVTGEEVGGHAIRILGWGVENGTPYWLAAN 299

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES ++ AG    D+
Sbjct: 300 SWNTDWGDNGFFKILRGQDHCGIES-EIVAGIPRTDQ 335


>gi|226471004|emb|CAX70583.1| Cysteine PRotease related protein [Schistosoma japonicum]
          Length = 304

 Score =  101 bits (251), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 45/90 (50%), Positives = 60/90 (66%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI  +G + AA + ++D + YK G+Y+H  G + GGHA++IIGWGVE    YWL  N
Sbjct: 211 IQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTPYWLIAN 270

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SW E WG+ GLF+I RG DE  IES  V+ 
Sbjct: 271 SWNEDWGEKGLFRIVRGRDECSIESHVVAG 300


>gi|157131748|ref|XP_001662318.1| cathepsin b [Aedes aegypti]
 gi|108871395|gb|EAT35620.1| AAEL012216-PA [Aedes aegypti]
          Length = 386

 Score =  101 bits (251), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 48/101 (47%), Positives = 64/101 (63%), Gaps = 1/101 (0%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EIF  G + AA   + DL  YK G+Y+H  G +SGGHAVK++GWGVE+GVKYWL  NSWG
Sbjct: 279 EIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLLGWGVENGVKYWLVANSWG 338

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEF 104
             WG+ G FK+ RG +   IE   + AG  +  R  +  ++
Sbjct: 339 REWGENGFFKMVRGENHCGIEE-NIHAGLPNFHRQGEAAKY 378


>gi|291385792|ref|XP_002709482.1| PREDICTED: cathepsin B [Oryctolagus cuniculus]
          Length = 339

 Score =  101 bits (251), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 47/97 (48%), Positives = 63/97 (64%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI+  G +  A   + D ++YK GVYQHT G++ GGHA++I+GWG E+GV YWL  N
Sbjct: 239 IKAEIYKNGPVEGAFTVYSDFLMYKSGVYQHTTGDIMGGHAIRILGWGEENGVPYWLVAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES ++ AG    D+
Sbjct: 299 SWNTDWGDKGFFKILRGQDHCGIES-EIVAGIPRTDQ 334


>gi|5031250|gb|AAD38132.1|AF127592_1 vitellogenic cathepsin-B like protease [Aedes aegypti]
          Length = 386

 Score =  101 bits (251), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 48/101 (47%), Positives = 64/101 (63%), Gaps = 1/101 (0%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EIF  G + AA   + DL  YK G+Y+H  G +SGGHAVK++GWGVE+GVKYWL  NSWG
Sbjct: 279 EIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLLGWGVENGVKYWLVANSWG 338

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEF 104
             WG+ G FK+ RG +   IE   + AG  +  R  +  ++
Sbjct: 339 REWGENGFFKMVRGENHCGIEE-NIHAGLPNFHRQGEAAKY 378


>gi|154340956|ref|XP_001566431.1| cysteine peptidase C (CPC) [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134063754|emb|CAM39941.1| cysteine peptidase C (CPC) [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 340

 Score =  101 bits (251), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 45/87 (51%), Positives = 56/87 (64%)

Query: 3   LEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSW 62
           +E+  +G    A + + D + YK GVY HT GE  GGHAVK++GWGV++G  YW   NSW
Sbjct: 248 IELMTYGPFEVAFDVYADFVSYKSGVYSHTTGERLGGHAVKLVGWGVQNGTPYWKIANSW 307

Query: 63  GELWGDGGLFKIRRGTDESRIESFQVS 89
              WGD G F IRRGTDE  IES  V+
Sbjct: 308 NSDWGDNGYFLIRRGTDECGIESTGVA 334


>gi|157111449|ref|XP_001651570.1| cathepsin b [Aedes aegypti]
 gi|108868331|gb|EAT32556.1| AAEL015312-PA [Aedes aegypti]
          Length = 386

 Score =  101 bits (251), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 48/101 (47%), Positives = 64/101 (63%), Gaps = 1/101 (0%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EIF  G + AA   + DL  YK G+Y+H  G +SGGHAVK++GWGVE+GVKYWL  NSWG
Sbjct: 279 EIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLLGWGVENGVKYWLVANSWG 338

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEF 104
             WG+ G FK+ RG +   IE   + AG  +  R  +  ++
Sbjct: 339 REWGENGFFKMVRGENHCGIEE-NIHAGLPNFHRQGEAAKY 378


>gi|9955277|pdb|1QDQ|A Chain A, X-Ray Crystal Structure Of Bovine Cathepsin B-Ca074
           Complex
          Length = 253

 Score =  100 bits (250), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 55/85 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GE+ GGHA++I+GWGVE+G  YWL  N
Sbjct: 160 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVAN 219

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FKI RG D   IES
Sbjct: 220 SWNTDWGDNGFFKILRGQDHCGIES 244


>gi|239788404|dbj|BAH70886.1| ACYPI000014 [Acyrthosiphon pisum]
          Length = 335

 Score =  100 bits (250), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 51/87 (58%), Positives = 61/87 (70%), Gaps = 3/87 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--GGHAVKIIGWGVEDGVKYWLC 58
           MQ +   +G I A+ + + D + Y+ GVYQ T G  S  GGHAVK+IGWGVE+G  YWL 
Sbjct: 241 MQKDTMVYGPIEASFDVYDDFMNYESGVYQRT-GNASYLGGHAVKMIGWGVEEGTPYWLM 299

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIES 85
           VNSWGE WGD G+FKI RGTDE  IES
Sbjct: 300 VNSWGEQWGDKGMFKILRGTDECGIES 326


>gi|308504375|ref|XP_003114371.1| CRE-CPR-1 protein [Caenorhabditis remanei]
 gi|308261756|gb|EFP05709.1| CRE-CPR-1 protein [Caenorhabditis remanei]
          Length = 366

 Score =  100 bits (250), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 48/93 (51%), Positives = 59/93 (63%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + AA   ++D   YK GVY+HT G+  GGHA+KIIGWG E G  YWL  N
Sbjct: 273 IQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHAIKIIGWGTESGSPYWLVAN 332

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           SWG  WG+ G F+I RG D+  IES  V AG+ 
Sbjct: 333 SWGNSWGESGFFRIFRGDDQCGIES-AVVAGKA 364


>gi|187105116|ref|NP_001119618.1| cathepsin B-84 precursor [Acyrthosiphon pisum]
 gi|161343843|tpg|DAA06102.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 335

 Score =  100 bits (250), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 51/87 (58%), Positives = 61/87 (70%), Gaps = 3/87 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--GGHAVKIIGWGVEDGVKYWLC 58
           MQ +   +G I A+ + + D + Y+ GVYQ T G  S  GGHAVK+IGWGVE+G  YWL 
Sbjct: 241 MQKDTMVYGPIEASFDVYDDFMNYESGVYQRT-GNASYLGGHAVKMIGWGVEEGTPYWLM 299

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIES 85
           VNSWGE WGD G+FKI RGTDE  IES
Sbjct: 300 VNSWGEQWGDKGMFKILRGTDECGIES 326


>gi|390994431|gb|AFM37365.1| cathepsin B2 [Dictyocaulus viviparus]
          Length = 346

 Score =  100 bits (250), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 57/85 (67%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A + ++D   Y KG+Y+HT G   GGHAVK+IGWG E+G+ YW+C N
Sbjct: 253 IQKEIMLHGPVEVAYDVYEDFEHYLKGIYKHTAGSYLGGHAVKMIGWGTENGIPYWICSN 312

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WG+ G F+I RGTDE  IES
Sbjct: 313 SWNSDWGENGFFRILRGTDECGIES 337


>gi|321452279|gb|EFX63703.1| hypothetical protein DAPPUDRAFT_306608 [Daphnia pulex]
          Length = 340

 Score =  100 bits (250), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 47/91 (51%), Positives = 61/91 (67%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +QLEI + G +  A+  ++D   YK GVYQH  G+  GGHA++I+GWGVE+GV YWL  N
Sbjct: 247 IQLEIMNNGPVEGALTVYEDFPTYKSGVYQHVHGKALGGHAIRILGWGVEEGVPYWLIAN 306

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G  K+ RG D   IES Q++AG
Sbjct: 307 SWNTDWGDNGYIKLLRGKDHCGIES-QITAG 336


>gi|27882093|gb|AAH44517.1| Zgc:55862 [Danio rerio]
          Length = 330

 Score =  100 bits (249), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 47/91 (51%), Positives = 59/91 (64%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  E+F  G + AA   ++D ++YK GVYQH  G   GGHA+KI+GWG E+GV YWL  N
Sbjct: 238 IMAELFKNGPVEAAFTVYEDFLLYKSGVYQHMSGSALGGHAIKILGWGEENGVPYWLAAN 297

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG D   IES ++ AG
Sbjct: 298 SWNTDWGDNGYFKILRGEDHCGIES-EIVAG 327


>gi|282400164|ref|NP_001164205.1| cathepsin B precursor [Tribolium castaneum]
 gi|270004839|gb|EFA01287.1| cathepsin B precursor [Tribolium castaneum]
          Length = 335

 Score =  100 bits (249), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 45/84 (53%), Positives = 57/84 (67%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + A+   ++D + YK GVYQH  GE +GGHA+KI+GWGVE+   YWL  N
Sbjct: 242 IQTEIMTNGPVEASFSVYEDFLSYKSGVYQHLEGEYAGGHAIKILGWGVENDTPYWLVAN 301

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW E WGD G FKI RG++E  IE
Sbjct: 302 SWNEDWGDKGYFKILRGSNECGIE 325


>gi|37788265|gb|AAO64472.1| cathepsin B precursor [Fundulus heteroclitus]
          Length = 330

 Score =  100 bits (249), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 48/91 (52%), Positives = 59/91 (64%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI+  G +  A   ++D   YK GVYQH  G   GGHA+K+IGWG E+GV YWLC N
Sbjct: 238 IQSEIYKNGPVEGAFIVYEDFPSYKSGVYQHVTGSALGGHAIKMIGWGEENGVPYWLCAN 297

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG++   IES +V AG
Sbjct: 298 SWNTDWGDNGFFKILRGSNHCGIES-EVVAG 327


>gi|56758716|gb|AAW27498.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  100 bits (249), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 46/93 (49%), Positives = 65/93 (69%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI  +G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKEIMMYGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG DE  IESF V AG++
Sbjct: 309 TWNEDWGEKGYFRIVRGRDECLIESFIV-AGQI 340


>gi|156375635|ref|XP_001630185.1| predicted protein [Nematostella vectensis]
 gi|156217201|gb|EDO38122.1| predicted protein [Nematostella vectensis]
          Length = 311

 Score =  100 bits (249), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 45/84 (53%), Positives = 54/84 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I + G + A     QD   Y+ G+Y H  G+  GGHA+KI+GWG ED V YWLC N
Sbjct: 218 IQTDIMNNGPVEADFTIFQDFYAYRSGIYVHATGKQLGGHAIKILGWGTEDNVDYWLCAN 277

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SWG  WG  G FKIRRGTDE  IE
Sbjct: 278 SWGANWGIQGYFKIRRGTDECGIE 301


>gi|440913587|gb|ELR63025.1| Cathepsin B [Bos grunniens mutus]
          Length = 335

 Score =  100 bits (249), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 46/91 (50%), Positives = 59/91 (64%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GE+ GGHA++I+GWGVE+G  YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG D   IES ++ AG
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EIVAG 328


>gi|27806671|ref|NP_776456.1| cathepsin B precursor [Bos taurus]
 gi|115312124|sp|P07688.5|CATB_BOVIN RecName: Full=Cathepsin B; AltName: Full=BCSB; Contains: RecName:
           Full=Cathepsin B light chain; Contains: RecName:
           Full=Cathepsin B heavy chain; Flags: Precursor
 gi|289402|gb|AAA03064.1| cathepsin B [Bos taurus]
 gi|809479|gb|AAA80198.1| cathepsin B [Bos taurus]
 gi|296484950|tpg|DAA27065.1| TPA: cathepsin B precursor [Bos taurus]
          Length = 335

 Score =  100 bits (249), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 46/91 (50%), Positives = 59/91 (64%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GE+ GGHA++I+GWGVE+G  YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG D   IES ++ AG
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EIVAG 328


>gi|118358710|ref|XP_001012596.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89294363|gb|EAR92351.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 346

 Score =  100 bits (249), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 48/90 (53%), Positives = 58/90 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI   G I  A   ++D + YK GVYQH  G   GGHAVK++GWGVE+G  YW+ VN
Sbjct: 253 IMTEIQTNGPIEVAFTVYEDFLTYKSGVYQHVTGSELGGHAVKMVGWGVENGTPYWIIVN 312

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SW E WGD G FKI RG +E  IES  V+A
Sbjct: 313 SWNESWGDKGTFKILRGQNECGIESECVTA 342


>gi|339236191|ref|XP_003379650.1| cathepsin B [Trichinella spiralis]
 gi|316977649|gb|EFV60721.1| cathepsin B [Trichinella spiralis]
          Length = 356

 Score =  100 bits (249), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 45/89 (50%), Positives = 59/89 (66%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q E+   G +  A E ++D ++YK GVYQH  G + GGHAV+++GWG E+GV YWL  N
Sbjct: 261 LQKELMMNGPMEVAFEVYEDFLLYKTGVYQHHTGSVLGGHAVRLLGWGEENGVPYWLLAN 320

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW   WGD G FKI RG +E  IES  V+
Sbjct: 321 SWNTEWGDKGFFKIYRGRNECGIESEAVA 349


>gi|325302580|dbj|BAJ83490.1| cathepsin B-like peptidase [Echinococcus multilocularis]
          Length = 351

 Score =  100 bits (249), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 55/85 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ E+   G + A  E + D   YK GVYQH  G + GGHA+K++GWG EDGV YWLC N
Sbjct: 258 IKHELITHGPVEADFEVYADFPTYKSGVYQHVSGALLGGHAIKLMGWGEEDGVPYWLCAN 317

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WG+GG FKI RG +   IES
Sbjct: 318 SWNTDWGEGGFFKILRGKNHCGIES 342


>gi|86451908|gb|ABC97349.1| cathepsin B [Streblomastix strix]
          Length = 312

 Score =  100 bits (248), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 48/92 (52%), Positives = 61/92 (66%), Gaps = 1/92 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI+  G + A+   ++DL +Y+ GVYQH  G   G HA+K++GWG+ DGVKYW  VN
Sbjct: 218 IQKEIYENGPVTASFAVYEDLSVYQSGVYQHVTGGFEGLHAIKVVGWGILDGVKYWTIVN 277

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGR 92
           SW E WG  GL  IRRG DE  IES  V AG+
Sbjct: 278 SWAEDWGFDGLLLIRRGVDECGIES-DVVAGQ 308


>gi|27526823|emb|CAD32937.1| pro-cathepsin B2 [Fasciola hepatica]
          Length = 337

 Score =  100 bits (248), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 45/86 (52%), Positives = 56/86 (65%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI   G + A    + D  +YK G+Y H  G  +G HA++IIGWGVE+GVKYWL  NSW 
Sbjct: 239 EIMKNGPVEAGFIVYTDFAVYKSGIYHHVSGRYAGKHAIRIIGWGVENGVKYWLTANSWN 298

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVS 89
             WG+ G F+I RGTDE RIES  V+
Sbjct: 299 VGWGENGYFRILRGTDECRIESIVVA 324


>gi|126303983|ref|XP_001381634.1| PREDICTED: cathepsin B-like [Monodelphis domestica]
          Length = 337

 Score =  100 bits (248), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 46/97 (47%), Positives = 62/97 (63%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   ++D + YK GVYQH  GEM GGHA++I+GWGVE+G++YWL  N
Sbjct: 241 IMAEIYKNGPVEGAFSVYEDFLHYKSGVYQHVAGEMLGGHAIRILGWGVENGIRYWLAAN 300

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FK  RG +   IES ++ AG    D+
Sbjct: 301 SWNIDWGDNGFFKFLRGKNHCGIES-EIIAGIPRTDQ 336


>gi|183988832|gb|ACC66065.1| cathepsin B [Antheraea assama]
          Length = 287

 Score =  100 bits (248), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 46/85 (54%), Positives = 55/85 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ E+F  G +  A   + DL+ YK GVYQHT G   GGHA+KI+GWGVE+G KYWL  N
Sbjct: 202 IKAELFKNGPVEGAFTVYSDLLSYKSGVYQHTHGNALGGHAIKILGWGVENGSKYWLIAN 261

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G  KI RG D   IES
Sbjct: 262 SWNSDWGDNGFLKILRGEDHCGIES 286


>gi|73586701|gb|AAI02998.1| CTSB protein [Bos taurus]
          Length = 335

 Score =  100 bits (248), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 55/85 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GE+ GGHA++I+GWGVE+G  YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FKI RG D   IES
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES 323


>gi|268557292|ref|XP_002636635.1| C. briggsae CBR-CPR-1 protein [Caenorhabditis briggsae]
          Length = 330

 Score =  100 bits (248), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 48/93 (51%), Positives = 58/93 (62%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + AA   ++D   YK GVY+HT G+  GGHA+KIIGWG E G  YWL  N
Sbjct: 237 IQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHAIKIIGWGTESGSPYWLVAN 296

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           SWG  WG+ G FKI RG D+  IE   V AG+ 
Sbjct: 297 SWGTNWGESGFFKILRGDDQCGIEG-AVVAGKA 328


>gi|326916753|ref|XP_003204669.1| PREDICTED: cathepsin B-like [Meleagris gallopavo]
          Length = 340

 Score =  100 bits (248), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 46/91 (50%), Positives = 59/91 (64%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   ++D ++YK GVYQH  GE  GGHA++I+GWGVE+G  YWL  N
Sbjct: 240 IMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGVENGTPYWLAAN 299

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG D   IES ++ AG
Sbjct: 300 SWNTDWGDNGFFKILRGEDHCGIES-EIVAG 329


>gi|268579855|ref|XP_002644910.1| C. briggsae CBR-CPR-6 protein [Caenorhabditis briggsae]
          Length = 376

 Score =  100 bits (248), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 57/85 (67%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q E+   G +  A E ++D + Y  GVY HT G++ GGHAVK+IGWG+EDG+ YW C N
Sbjct: 267 IQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIEDGIPYWTCAN 326

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WG+ G F+I RG DE  IES
Sbjct: 327 SWNTDWGEDGFFRILRGVDECGIES 351


>gi|194246067|gb|ACF35525.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 192

 Score =  100 bits (248), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 48/91 (52%), Positives = 58/91 (63%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI+  G + A    + D   YK GVYQ    EM GGHA++I+GWG EDGV YWL  N
Sbjct: 96  IKTEIYKNGPVEADFSVYADFPSYKSGVYQRHSEEMLGGHAIRILGWGTEDGVPYWLVAN 155

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW E WGD G FKIRRG DE  IE   ++AG
Sbjct: 156 SWNEDWGDKGYFKIRRGNDECGIED-DINAG 185


>gi|225708580|gb|ACO10136.1| Cathepsin B precursor [Osmerus mordax]
          Length = 329

 Score =  100 bits (248), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 46/88 (52%), Positives = 59/88 (67%), Gaps = 1/88 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E++  G + AA   ++D ++YK GVYQH  G+M GGHA+KI+GWG E+   YWL  NSW 
Sbjct: 239 ELYKNGPVEAAFSVYEDFLLYKSGVYQHLTGDMLGGHAIKILGWGKENNTPYWLAANSWN 298

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAG 91
             WG+ G FKI RG DE  IES +V AG
Sbjct: 299 TDWGNQGFFKILRGGDECGIES-EVVAG 325


>gi|156708112|gb|ABU93314.1| cathepsin B5 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score = 99.8 bits (247), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 45/85 (52%), Positives = 55/85 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           MQ E++  G   AA   ++D   YK GVY H  G+M GGHAV ++GWGVEDG  YWL  N
Sbjct: 188 MQNELYSRGPFEAAFSVYEDFKSYKSGVYHHITGKMLGGHAVMVVGWGVEDGTPYWLIQN 247

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SWG  WG+ G FKI RG +E  IE+
Sbjct: 248 SWGTTWGEQGFFKILRGKNECGIET 272


>gi|410912140|ref|XP_003969548.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
          Length = 246

 Score = 99.8 bits (247), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 42/85 (49%), Positives = 57/85 (67%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI+  G +  A   ++D ++YK GVYQH  G   GGHA+KI+GWG E+G+ YWLC N
Sbjct: 154 IKHEIYKNGPVEGAFTVYEDFVLYKTGVYQHVTGSALGGHAIKILGWGEENGIPYWLCAN 213

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WG+ G FKI RG++   IES
Sbjct: 214 SWNTDWGNNGFFKILRGSNHCGIES 238


>gi|308512693|gb|ADO33000.1| cathepsin B [Biston betularia]
          Length = 217

 Score = 99.8 bits (247), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 47/90 (52%), Positives = 59/90 (65%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ E+F  G + AA   + DL+ YK GVY+H  G+  GGHA+KIIGWGVE+G KYWL  N
Sbjct: 121 IKAELFKNGPVEAAFTVYADLLAYKSGVYKHVEGDALGGHAIKIIGWGVENGNKYWLIAN 180

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SW   WG+ G FKI RG D   IES  V+ 
Sbjct: 181 SWNTDWGNNGFFKILRGEDHCGIESSIVAG 210


>gi|325302582|dbj|BAJ83491.1| cathepsin B-like peptidase [Echinococcus multilocularis]
          Length = 338

 Score = 99.8 bits (247), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 47/88 (53%), Positives = 58/88 (65%), Gaps = 1/88 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI  +G + A    + D + YK GVYQH  G   GGHAVKI+GWG E+GV YWLC NSW 
Sbjct: 246 EILVYGPVEADFIVYADFLTYKSGVYQHVKGGFLGGHAVKILGWGEENGVPYWLCANSWN 305

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAG 91
             WGDGG FKI RG +  +IE+  ++AG
Sbjct: 306 TDWGDGGFFKILRGYNHCKIEA-DINAG 332


>gi|209863077|ref|NP_001119612.2| cathepsin B-912 precursor [Acyrthosiphon pisum]
          Length = 342

 Score = 99.8 bits (247), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 46/86 (53%), Positives = 61/86 (70%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++  +G I A++E + D   YK GVY+ +      GGHAVK+IGWG EDGV YWL V
Sbjct: 247 IQKDVMTYGPIEASMEVYDDFPSYKSGVYEKSENATYLGGHAVKLIGWGEEDGVPYWLMV 306

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW E+WGD GLFKIRRGT+E  +++
Sbjct: 307 NSWSEMWGDKGLFKIRRGTNECSVDN 332


>gi|291236586|ref|XP_002738220.1| PREDICTED: cathepsin B preproprotein-like [Saccoglossus
          kowalevskii]
          Length = 93

 Score = 99.8 bits (247), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 50/88 (56%), Positives = 57/88 (64%), Gaps = 1/88 (1%)

Query: 4  EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
          EI  +G +  A   + D   YK GVYQH  GE  GGHA+KI+GWG EDG  YWL  NSW 
Sbjct: 3  EIQKYGPVEGAFTVYADFPSYKSGVYQHETGEALGGHAIKILGWGNEDGHDYWLVANSWN 62

Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAG 91
          E WGD G FKI RG DE  IES Q++AG
Sbjct: 63 EDWGDQGFFKILRGVDECGIES-QITAG 89


>gi|161343855|tpg|DAA06108.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 342

 Score = 99.8 bits (247), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 46/86 (53%), Positives = 61/86 (70%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++  +G I A++E + D   YK GVY+ +      GGHAVK+IGWG EDGV YWL V
Sbjct: 247 IQKDVMTYGPIEASMEVYDDFPSYKSGVYEKSENATYLGGHAVKLIGWGEEDGVPYWLMV 306

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW E+WGD GLFKIRRGT+E  +++
Sbjct: 307 NSWSEMWGDKGLFKIRRGTNECSVDN 332


>gi|48425700|pdb|1SP4|B Chain B, Crystal Structure Of Ns-134 In Complex With Bovine
           Cathepsin B: A Two Headed Epoxysuccinyl Inhibitor
           Extends Along The Whole Active Site Cleft
          Length = 205

 Score = 99.8 bits (247), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 46/91 (50%), Positives = 59/91 (64%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GE+ GGHA++I+GWGVE+G  YWL  N
Sbjct: 112 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGN 171

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG D   IES ++ AG
Sbjct: 172 SWNTDWGDNGFFKILRGQDHCGIES-EIVAG 201


>gi|28373366|pdb|1ITO|A Chain A, Crystal Structure Analysis Of Bovine Spleen Cathepsin B-
           E64c Complex
 gi|88192750|pdb|2DC6|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca073 Complex
 gi|88192751|pdb|2DC7|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca042 Complex
 gi|88192752|pdb|2DC8|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca059 Complex
 gi|88192753|pdb|2DC9|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca074me Complex
 gi|88192754|pdb|2DCA|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca075 Complex
 gi|88192755|pdb|2DCB|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca076 Complex
 gi|88192756|pdb|2DCC|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca077 Complex
 gi|88192757|pdb|2DCD|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca078 Complex
          Length = 256

 Score = 99.8 bits (247), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 55/85 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GE+ GGHA++I+GWGVE+G  YWL  N
Sbjct: 160 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGN 219

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FKI RG D   IES
Sbjct: 220 SWNTDWGDNGFFKILRGQDHCGIES 244


>gi|22531389|emb|CAD44625.1| cathepsin B1 isotype 2 [Schistosoma mansoni]
          Length = 340

 Score = 99.4 bits (246), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 43/85 (50%), Positives = 57/85 (67%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI  +G + A+   ++D + YK G+Y+H  GE  GGHA++IIGWGVE+   YWL  N
Sbjct: 248 IQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVENKTPYWLIAN 307

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW E WG+ G F+I RG DE  IES
Sbjct: 308 SWNEDWGENGYFRIVRGRDECFIES 332


>gi|313229093|emb|CBY18245.1| unnamed protein product [Oikopleura dioica]
          Length = 355

 Score = 99.4 bits (246), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 50/94 (53%), Positives = 59/94 (62%), Gaps = 1/94 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI H+G + AA   + D   YK GVY+HT G   GGHA+KIIGWG E G  YWL  N
Sbjct: 253 IQQEIMHYGPVEAAFTVYSDFPSYKSGVYRHTSGSELGGHAIKIIGWGTEGGDDYWLINN 312

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           SW   WGD G FKI RG++E  IE  +V A  VD
Sbjct: 313 SWNSDWGDKGTFKILRGSNECGIEG-EVVAATVD 345


>gi|50657025|emb|CAH04630.1| cathepsin B [Suberites domuncula]
          Length = 331

 Score = 99.4 bits (246), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 48/95 (50%), Positives = 59/95 (62%), Gaps = 1/95 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A   + D   YK GVYQHT G M GGHA++I+GWG E+G  YWL  N
Sbjct: 235 IQTEIMTNGPVEGAFTVYADFPTYKSGVYQHTSGAMLGGHAIRILGWGTENGTPYWLVAN 294

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           SW E WG  G FKI RG D+  IES Q++AG   +
Sbjct: 295 SWNEDWGAMGYFKIIRGKDDCGIES-QITAGMPKK 328


>gi|240992699|ref|XP_002404474.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215491571|gb|EEC01212.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score = 99.4 bits (246), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 47/91 (51%), Positives = 59/91 (64%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EIF  G + A    + D + YK GVYQH  G++ GGHA++I+GWG E+G  YWL  N
Sbjct: 243 IQTEIFKNGPVEADFTVYADFLSYKSGVYQHQSGDVLGGHAIRILGWGTENGTPYWLVAN 302

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW E WGD G FKI RG DE  IE   ++AG
Sbjct: 303 SWNEDWGDHGYFKILRGKDECGIED-DINAG 332


>gi|56752925|gb|AAW24674.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 99.4 bits (246), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 45/93 (48%), Positives = 65/93 (69%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI  +G + A ++ ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKEIMMYGPVEAYLQIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTSYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG DE  IESF V AG++
Sbjct: 309 TWNEDWGEKGYFRIVRGRDECLIESFIV-AGQI 340


>gi|306992171|gb|ADN19566.1| cathepsin B-like proteinase [Spodoptera frugiperda]
          Length = 341

 Score = 99.4 bits (246), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 47/89 (52%), Positives = 58/89 (65%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ E+F  G +  A   + DL+ YK GVY+HTVG   GGHA+KI+GWGVE+G KY L  N
Sbjct: 245 IKAELFKNGPVEGAFTVYSDLLNYKNGVYKHTVGNALGGHAIKILGWGVENGNKYRLIAN 304

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW   WGD G FKI RG D   IES  V+
Sbjct: 305 SWNSDWGDNGFFKILRGEDHCGIESSIVA 333


>gi|312271213|gb|ADQ57304.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 347

 Score = 99.4 bits (246), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 42/89 (47%), Positives = 58/89 (65%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A + ++D   Y  G+Y+HT G+  GGHAVK++GWG E+G  YW+C N
Sbjct: 255 IQKEIMTNGPVEVAFDVYEDFEHYSSGIYKHTTGDYLGGHAVKMLGWGTENGTDYWICAN 314

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW   WG+ G F+I RG DE +IES  V+
Sbjct: 315 SWNSDWGENGFFRILRGVDECQIESSVVA 343


>gi|161343865|tpg|DAA06113.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 335

 Score = 99.4 bits (246), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 48/86 (55%), Positives = 58/86 (67%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
           MQ +   +G I A+ + + D   Y+ GVYQ T      GGHAVK+IGWGVE+G  YWL V
Sbjct: 241 MQKDTMVYGPIEASFDVYDDFTSYESGVYQKTENASYLGGHAVKMIGWGVEEGTPYWLMV 300

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSWGE WGD G+FKI RGTDE  +ES
Sbjct: 301 NSWGEQWGDKGMFKILRGTDECGVES 326


>gi|56757271|gb|AAW26807.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 99.4 bits (246), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 45/93 (48%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI  +G + A +  ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKEIMMYGPVEAYLHIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTSYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG DE  IESF V AG++
Sbjct: 309 TWNEDWGEKGYFRIVRGRDECLIESFIV-AGQI 340


>gi|389611087|dbj|BAM19154.1| cathepsin B [Papilio polytes]
          Length = 334

 Score = 99.4 bits (246), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 45/90 (50%), Positives = 58/90 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ E++  G +  A   + DL+ YK GVY+H  G+  GGHA+KI+GWGVE+G KYWL  N
Sbjct: 240 IKAELYKNGPVEGAFTVYADLLSYKSGVYKHVAGDALGGHAIKIMGWGVENGNKYWLIAN 299

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SW   WGD G FKI RG D   IES  V+ 
Sbjct: 300 SWNSDWGDNGFFKILRGEDHCGIESSIVAG 329


>gi|268555420|ref|XP_002635699.1| Hypothetical protein CBG22436 [Caenorhabditis briggsae]
          Length = 317

 Score = 99.4 bits (246), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 47/94 (50%), Positives = 62/94 (65%), Gaps = 2/94 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI + G + AA   + D   Y+ GVY+H  G++ GGHAVKIIGWG+++G  YWL  N
Sbjct: 226 IQTEITN-GPVEAAFIVYDDFNHYRSGVYRHVAGKLVGGHAVKIIGWGIQNGAPYWLMAN 284

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           SWG  WG+ G FK+ RG DE  IES  + AG+ D
Sbjct: 285 SWGPYWGENGFFKMLRGVDECGIES-TIVAGKPD 317


>gi|156365510|ref|XP_001626688.1| predicted protein [Nematostella vectensis]
 gi|156213574|gb|EDO34588.1| predicted protein [Nematostella vectensis]
          Length = 259

 Score = 99.4 bits (246), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 46/85 (54%), Positives = 53/85 (62%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A   + D   YK GVYQHT G   GGHA+KI+GWG E+G  YWL  N
Sbjct: 162 IQKEIMTNGPVEGAFTVYADFPTYKSGVYQHTSGSALGGHAIKILGWGEENGTPYWLVAN 221

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FKI+RG DE  IES
Sbjct: 222 SWNSDWGDEGFFKIKRGNDECGIES 246


>gi|126681075|gb|ABO26563.1| cathepsin B-like cysteine protease form 1 [Ixodes ricinus]
          Length = 337

 Score = 99.4 bits (246), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 47/91 (51%), Positives = 59/91 (64%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EIF  G + A    + D + YK GVYQH  G++ GGHA++I+GWG E+G  YWL  N
Sbjct: 243 IQTEIFKNGPVEADFTVYADFLSYKSGVYQHHSGDVLGGHAIRILGWGTENGTPYWLVAN 302

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW E WGD G FKI RG DE  IE   ++AG
Sbjct: 303 SWNEDWGDHGYFKILRGKDECGIED-DINAG 332


>gi|156255405|gb|ABU62925.1| cathepsin B [Fasciola hepatica]
          Length = 337

 Score = 99.4 bits (246), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 42/91 (46%), Positives = 63/91 (69%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           + +EI   G +       +D ++YK G+Y +T G + GGHA+++IGWGVE+GVKYWL  N
Sbjct: 246 IMMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGVENGVKYWLIAN 305

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW E WG+ G F++RRG +E  IE+ +++AG
Sbjct: 306 SWNEGWGEKGYFRMRRGNNECGIEA-RINAG 335


>gi|112983908|ref|NP_001036850.1| cathepsin B precursor [Bombyx mori]
 gi|13548667|dbj|BAB40804.1| cathepsin B [Bombyx mori]
          Length = 337

 Score = 99.0 bits (245), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 47/90 (52%), Positives = 58/90 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ E+F  G +  A   + DL+ YK GVY+HT G+  GGHAVKI+GWGVE+  KYWL  N
Sbjct: 241 IRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQGDALGGHAVKILGWGVENDNKYWLIAN 300

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SW   WGD G FKI RG D   IES  V+ 
Sbjct: 301 SWNSDWGDNGFFKILRGEDHCGIESSIVTG 330


>gi|449267314|gb|EMC78276.1| Cathepsin B [Columba livia]
          Length = 340

 Score = 99.0 bits (245), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 45/97 (46%), Positives = 61/97 (62%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   ++D ++YK GVYQH  GE  GGHA++++GWGV++G  YWL  N
Sbjct: 240 IMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVTGEQVGGHAIRLLGWGVDNGTPYWLAAN 299

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES ++ AG    +R
Sbjct: 300 SWNTDWGDNGFFKILRGEDHCGIES-EIVAGIPSTER 335


>gi|389608541|dbj|BAM17880.1| cathepsin B [Papilio xuthus]
          Length = 334

 Score = 99.0 bits (245), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 45/90 (50%), Positives = 58/90 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ E++  G +  A   + DL+ YK GVY+H  G+  GGHA+KI+GWGVE+G KYWL  N
Sbjct: 240 IKAELYKNGPVEGAFTVYADLLSYKSGVYKHVTGDALGGHAIKIMGWGVENGNKYWLIAN 299

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SW   WGD G FKI RG D   IES  V+ 
Sbjct: 300 SWNSDWGDNGFFKILRGEDHCGIESSIVAG 329


>gi|312374701|gb|EFR22198.1| hypothetical protein AND_15621 [Anopheles darlingi]
          Length = 335

 Score = 99.0 bits (245), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 47/91 (51%), Positives = 61/91 (67%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G +  A   ++DL+ YK+GVYQH  G+M GGHA++I+GWGVE+  KYWL  N
Sbjct: 242 IRKEIMTNGPVEGAFTVYEDLLHYKEGVYQHVTGKMLGGHAIRILGWGVENNTKYWLIAN 301

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG D   IES  ++AG
Sbjct: 302 SWNSDWGDNGFFKILRGEDHLGIES-SIAAG 331


>gi|194246059|gb|ACF35521.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 217

 Score = 99.0 bits (245), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 48/91 (52%), Positives = 58/91 (63%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G + A    + D   YK GVYQ    EM GGHA++I+GWG EDGV YWL  N
Sbjct: 121 IKTEICKNGPVEADFNVYADFPSYKSGVYQRHSKEMLGGHAIRILGWGTEDGVPYWLVAN 180

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW E WGD G FKIRRG DE  IE+  ++AG
Sbjct: 181 SWNEDWGDKGYFKIRRGNDECGIEN-DINAG 210


>gi|332376204|gb|AEE63242.1| unknown [Dendroctonus ponderosae]
          Length = 338

 Score = 99.0 bits (245), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 48/86 (55%), Positives = 55/86 (63%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
           MQLEI   G I AA   + D + YK GVYQ T  + S GGHA+K++GWGVE+G KYWL  
Sbjct: 245 MQLEILKNGPIEAAFTVYNDFLSYKSGVYQATAQDESVGGHAIKVLGWGVEEGTKYWLIA 304

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW   WGD G FK  RG D   IES
Sbjct: 305 NSWNTDWGDNGYFKFLRGVDHCGIES 330


>gi|28971815|dbj|BAC65419.1| cathepsin B [Pandalus borealis]
          Length = 328

 Score = 99.0 bits (245), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 45/90 (50%), Positives = 55/90 (61%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + AA   + D + YK GVYQH  G + G HAV++IGWG E+G  YWL  N
Sbjct: 234 IQEEIMTNGPVTAAFAVYDDFLSYKSGVYQHETGLLDGYHAVRVIGWGEEEGTPYWLVAN 293

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SW   WGD GLFKI RG+DE   E    +A
Sbjct: 294 SWNTDWGDNGLFKILRGSDECEFEGDMAAA 323


>gi|239938574|gb|ACS36086.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score = 99.0 bits (245), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 47/93 (50%), Positives = 60/93 (64%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +V A   ++D   YKKG+Y+HT G+  GGHA+KIIGWG E+GV YWL  N
Sbjct: 162 IQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWGKENGVPYWLIAN 221

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           SW   WG+ G F+I RG++   IE   V AG V
Sbjct: 222 SWHNDWGENGYFRILRGSNHCGIEE-NVVAGHV 253


>gi|189239879|ref|XP_968767.2| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270012755|gb|EFA09203.1| cathepsin B precursor [Tribolium castaneum]
          Length = 353

 Score = 99.0 bits (245), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 47/92 (51%), Positives = 63/92 (68%), Gaps = 2/92 (2%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
           G ++A  + ++D  +Y++GVY HT G + G HAVKIIGWG E+G  YWL  NSWG+ WG 
Sbjct: 230 GPVMAGFDVYEDFKLYREGVYVHTSGALLGSHAVKIIGWGTENGWAYWLVANSWGKDWGA 289

Query: 69  -GGLFKIRRGTDESRIESFQVSAGRVDRDRSS 99
            GG+FKIRRGT+E +IE   +  G V +D  S
Sbjct: 290 LGGVFKIRRGTNECKIEQ-SIITGHVRKDEKS 320


>gi|56759504|gb|AAW27892.1| unknown [Schistosoma japonicum]
          Length = 279

 Score = 99.0 bits (245), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 44/90 (48%), Positives = 60/90 (66%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I  +G + AA + ++D + YK G+Y+H  G + GGHA++IIGWGVE    YWL  N
Sbjct: 186 IQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTPYWLIAN 245

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SW E WG+ GLF+I RG DE  IES  V+ 
Sbjct: 246 SWNEDWGEKGLFRIVRGRDECSIESNVVAG 275


>gi|29374027|gb|AAO73004.1| cathepsin B [Fasciola gigantica]
          Length = 337

 Score = 99.0 bits (245), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 42/89 (47%), Positives = 62/89 (69%), Gaps = 1/89 (1%)

Query: 3   LEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSW 62
           +EI   G +       +D ++YK G+Y +T G + GGHA+++IGWGVE+GVKYWL  NSW
Sbjct: 248 MEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGVENGVKYWLIANSW 307

Query: 63  GELWGDGGLFKIRRGTDESRIESFQVSAG 91
            E WG+ G F++RRG +E  IE+ +++AG
Sbjct: 308 NEGWGEKGYFRMRRGNNECGIEA-RINAG 335


>gi|50540542|ref|NP_998501.1| cathepsin B, a precursor [Danio rerio]
 gi|34784038|gb|AAH56688.1| Cathepsin B, a [Danio rerio]
 gi|37681773|gb|AAQ97764.1| cathepsin B [Danio rerio]
 gi|41351445|gb|AAH65589.1| Cathepsin B, a [Danio rerio]
          Length = 330

 Score = 99.0 bits (245), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 46/91 (50%), Positives = 58/91 (63%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  E+F  G +  A   ++D ++YK GVYQH  G   GGHA+KI+GWG E+GV YWL  N
Sbjct: 238 IMAELFKNGPVEGAFTVYEDFLLYKSGVYQHMSGSPVGGHAIKILGWGEENGVPYWLAAN 297

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG D   IES ++ AG
Sbjct: 298 SWNTDWGDNGYFKILRGEDHCGIES-EIVAG 327


>gi|239938584|gb|ACS36091.1| cysteine proteinase [Haemonchus contortus]
          Length = 346

 Score = 99.0 bits (245), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 47/93 (50%), Positives = 61/93 (65%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +V A   ++D   YKKG+Y+HT G+  GGHA+KIIGWGVE+ V YWL  N
Sbjct: 253 IQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGQARGGHAIKIIGWGVENDVPYWLIAN 312

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           SW   WG+ G F++ RG +E  IE  +V AG V
Sbjct: 313 SWHNDWGEEGYFRMIRGINECGIEQ-EVVAGHV 344


>gi|239938582|gb|ACS36090.1| cysteine proteinase [Haemonchus contortus]
          Length = 346

 Score = 98.6 bits (244), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 47/93 (50%), Positives = 61/93 (65%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +V A   ++D   YKKG+Y+HT G+  GGHA+KIIGWGVE+ V YWL  N
Sbjct: 253 IQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGQARGGHAIKIIGWGVENDVPYWLIAN 312

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           SW   WG+ G F++ RG +E  IE  +V AG V
Sbjct: 313 SWHNDWGEEGYFRMIRGINECGIEQ-EVVAGHV 344


>gi|225713216|gb|ACO12454.1| Cathepsin B precursor [Lepeophtheirus salmonis]
 gi|290561811|gb|ADD38303.1| Cathepsin B [Lepeophtheirus salmonis]
          Length = 333

 Score = 98.6 bits (244), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 43/84 (51%), Positives = 56/84 (66%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +QLEI + G + AA   + D + YK GVY+H  G + GGHA++I+GWGVE+G  YWL  N
Sbjct: 241 IQLEIMNNGPVEAAFSVYSDFLNYKSGVYRHVKGSLLGGHAIRILGWGVENGTPYWLVAN 300

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW   WGD G FKI +G+D   IE
Sbjct: 301 SWNTDWGDNGTFKILKGSDHCGIE 324


>gi|157167285|ref|XP_001658487.1| cathepsin b [Aedes aegypti]
 gi|108876478|gb|EAT40703.1| AAEL007590-PA [Aedes aegypti]
          Length = 313

 Score = 98.6 bits (244), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 46/88 (52%), Positives = 59/88 (67%), Gaps = 1/88 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           +IF  G + A  + ++D++ Y  GVY+H  G + GGHAVK+IGWGVEDG KYWL  NSWG
Sbjct: 220 DIFVNGPVQAVFQWYEDIVNYSGGVYRHQSGRLKGGHAVKLIGWGVEDGTKYWLVANSWG 279

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAG 91
            +WGD G FK+ RG +   IE   V AG
Sbjct: 280 RVWGDDGFFKMVRGENHCGIEE-NVHAG 306


>gi|410956528|ref|XP_003984894.1| PREDICTED: cathepsin B [Felis catus]
          Length = 339

 Score = 98.6 bits (244), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 49/91 (53%), Positives = 57/91 (62%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G + AA     D + YK GVYQH  GEM GGHAV+I+GWGVE+   YWL  N
Sbjct: 239 IMAEIYKNGPVEAAFSVFSDFLQYKSGVYQHVTGEMMGGHAVRILGWGVENDTPYWLVGN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG D   IES +V AG
Sbjct: 299 SWNTDWGDHGFFKILRGRDHCGIES-EVVAG 328


>gi|379067374|gb|AFC90100.1| cathepsin B [Capra hircus]
          Length = 335

 Score = 98.6 bits (244), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 54/85 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+   YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEMMGGHAIRILGWGVENDTPYWLVGN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FKI RG D   IES
Sbjct: 299 SWNTDWGDKGFFKILRGQDHCGIES 323


>gi|405971658|gb|EKC36483.1| Cathepsin B [Crassostrea gigas]
          Length = 341

 Score = 98.6 bits (244), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 48/89 (53%), Positives = 56/89 (62%), Gaps = 1/89 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI   G +  A   + D   YK GVY+HT G+  GGHA+KI+GWG E+G  YWL  NSW 
Sbjct: 250 EIMTNGPVEGAFTVYADFPQYKSGVYKHTTGQPLGGHAIKILGWGTENGDDYWLVANSWN 309

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAGR 92
             WGD G FKI RG DE  IES Q+SAG 
Sbjct: 310 PDWGDQGFFKILRGQDECGIES-QISAGE 337


>gi|56758658|gb|AAW27469.1| unknown [Schistosoma japonicum]
          Length = 181

 Score = 98.6 bits (244), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 45/90 (50%), Positives = 59/90 (65%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + AA + ++D + YK G+Y+H  G + GGHA++IIGWGVE    YWL  N
Sbjct: 88  IQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTPYWLIAN 147

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SW E WG+ GLF+I RG DE  IES  V+ 
Sbjct: 148 SWNEDWGEKGLFRIVRGRDECSIESNVVAG 177


>gi|426220597|ref|XP_004004501.1| PREDICTED: cathepsin B [Ovis aries]
          Length = 335

 Score = 98.6 bits (244), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 54/85 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+   YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEMMGGHAIRILGWGVENDTPYWLVGN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FKI RG D   IES
Sbjct: 299 SWNTDWGDKGFFKILRGQDHCGIES 323


>gi|341888136|gb|EGT44071.1| hypothetical protein CAEBREN_13576 [Caenorhabditis brenneri]
          Length = 337

 Score = 98.6 bits (244), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 49/99 (49%), Positives = 60/99 (60%), Gaps = 1/99 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +      + D   YK G+Y+H  G   GGHAVKI+GWGVE+G  YWL  N
Sbjct: 240 IQTEIMRNGPVEVGFLVYSDFYQYKSGIYKHVAGRELGGHAVKILGWGVENGTPYWLAAN 299

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSS 99
           SW   WG+ G F+IRRGT+E  IES  V AG  D  R+S
Sbjct: 300 SWNVNWGEKGYFRIRRGTNECGIES-SVVAGIPDLKRNS 337


>gi|226474178|emb|CAX71575.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 98.2 bits (243), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 43/93 (46%), Positives = 65/93 (69%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AGR+
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIES-EIAAGRI 340


>gi|1181143|emb|CAA93278.1| cysteine proteinase [Haemonchus contortus]
          Length = 341

 Score = 98.2 bits (243), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 47/93 (50%), Positives = 59/93 (63%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +V A   ++D   YKKG+Y+HT G+  GGHA+KIIGWG E GV YWL  N
Sbjct: 250 IQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWGKEGGVPYWLIAN 309

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           SW   WG+ G F+I RG++   IE   V AG V
Sbjct: 310 SWHNDWGENGYFRILRGSNHCGIEE-NVVAGHV 341


>gi|162813|gb|AAA30434.1| cathepsin B, partial [Bos taurus]
          Length = 122

 Score = 98.2 bits (243), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 55/85 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GE+ GGHA++I+GWGVE+G  YWL  N
Sbjct: 26  IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGN 85

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FKI RG D   IES
Sbjct: 86  SWNTDWGDNGFFKILRGQDHCGIES 110


>gi|1169189|sp|P43157.1|CYSP_SCHJA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
           Full=Antigen Sj31; Flags: Precursor
 gi|11167|emb|CAA50305.1| cathepsin B [Schistosoma japonicum]
          Length = 342

 Score = 98.2 bits (243), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 42/85 (49%), Positives = 58/85 (68%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I  +G + AA + ++D + YK G+Y+H  G + GGHA++IIGWGVE    YWL  N
Sbjct: 249 IQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTPYWLIAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW E WG+ GLF++ RG DE  IES
Sbjct: 309 SWNEDWGEKGLFRMVRGRDECSIES 333


>gi|308511959|ref|XP_003118162.1| CRE-CPR-6 protein [Caenorhabditis remanei]
 gi|308238808|gb|EFO82760.1| CRE-CPR-6 protein [Caenorhabditis remanei]
          Length = 387

 Score = 98.2 bits (243), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 42/85 (49%), Positives = 57/85 (67%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q E+   G +  A E ++D + Y  GVY HT G++ GGHAVK++GWG+E+G+ YW C N
Sbjct: 266 IQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLVGWGIENGIPYWTCAN 325

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WG+ G F+I RG DE  IES
Sbjct: 326 SWNTDWGEDGFFRILRGVDECGIES 350


>gi|338815385|gb|AEJ08755.1| cathepsin B [Crassostrea ariakensis]
          Length = 341

 Score = 98.2 bits (243), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 48/89 (53%), Positives = 56/89 (62%), Gaps = 1/89 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI   G +  A   + D   YK GVY+HT G+  GGHA+KI+GWG E+G  YWL  NSW 
Sbjct: 250 EIMTNGPVEGAFTVYADFPQYKSGVYKHTTGQPLGGHAIKILGWGTENGDDYWLVANSWN 309

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAGR 92
             WGD G FKI RG DE  IES Q+SAG 
Sbjct: 310 PDWGDQGFFKILRGQDECGIES-QISAGE 337


>gi|330434688|gb|AEC22812.1| cathepsin B [Macrobrachium nipponense]
          Length = 331

 Score = 98.2 bits (243), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 46/91 (50%), Positives = 59/91 (64%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ +I   G +  A   + D + YK GVYQHT G   GGHA++++GWG EDG  YWLC N
Sbjct: 237 IKYDIMTNGPVEGAFTVYVDFLHYKSGVYQHTHGLPLGGHAIRVLGWGEEDGTPYWLCAN 296

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG+D   IES ++SAG
Sbjct: 297 SWNTDWGDNGYFKILRGSDHCGIES-EISAG 326


>gi|56758040|gb|AAW27160.1| unknown [Schistosoma japonicum]
          Length = 216

 Score = 98.2 bits (243), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 45/90 (50%), Positives = 59/90 (65%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + AA + ++D + YK G+Y+H  G + GGHA++IIGWGVE    YWL  N
Sbjct: 123 IQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTPYWLIAN 182

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SW E WG+ GLF+I RG DE  IES  V+ 
Sbjct: 183 SWNEDWGEKGLFRIVRGRDECSIESHVVAG 212


>gi|196009263|ref|XP_002114497.1| expressed hypothetical protein [Trichoplax adhaerens]
 gi|190583516|gb|EDV23587.1| expressed hypothetical protein [Trichoplax adhaerens]
          Length = 333

 Score = 98.2 bits (243), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 47/85 (55%), Positives = 52/85 (61%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + AA   + D   YK GVYQHT G   GGHAVKI+GWG E+   YWL  N
Sbjct: 240 IQKEIMMNGPVEAAFTVYADFPSYKSGVYQHTTGGPLGGHAVKILGWGTENNTPYWLIAN 299

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FKI RG DE  IES
Sbjct: 300 SWNPTWGDKGYFKIIRGKDECGIES 324


>gi|118364222|ref|XP_001015333.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297100|gb|EAR95088.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 341

 Score = 97.8 bits (242), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 44/90 (48%), Positives = 59/90 (65%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI   G + AA   ++D + YK GVY+H  G+  GGHA+KI+GWGVE+   YW+ VN
Sbjct: 248 IMTEIQTNGPVEAAFTVYEDFLNYKSGVYKHVTGKALGGHAIKIVGWGVENNTPYWIVVN 307

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SW + WGD G FKI RG +E  IE+  V+A
Sbjct: 308 SWNQTWGDNGTFKILRGKNECGIEAQVVTA 337


>gi|308390275|gb|ADO32581.1| cathepsin B [Marsupenaeus japonicus]
          Length = 332

 Score = 97.8 bits (242), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 47/91 (51%), Positives = 59/91 (64%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G +  A   + D + YK GVYQH  G   GGHA++I+GWG E+G  YWLC N
Sbjct: 238 IKYEIMKNGPVEGAFTVYVDFLHYKSGVYQHRHGLPLGGHAIRILGWGEENGTPYWLCAN 297

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD GLFKI RG+D   IES ++SAG
Sbjct: 298 SWNTDWGDNGLFKILRGSDHCGIES-EISAG 327


>gi|1777779|gb|AAB40605.1| cathepsin B-like cysteine proteinase [Ascaris suum]
 gi|324515014|gb|ADY46062.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
          Length = 398

 Score = 97.8 bits (242), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 48/110 (43%), Positives = 65/110 (59%), Gaps = 1/110 (0%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A E ++D ++Y  G+Y HT G++ GGHAVK++GWGVE GV YWL  N
Sbjct: 282 IQKEILTHGPVEVAFEVYEDFLMYDGGIYVHTGGKIGGGHAVKMLGWGVEQGVPYWLVAN 341

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA-GRVDRDRSSDLEEFEYDTD 109
           SW   WG+ G F+I RG DE  IES  V    +++R        +  D D
Sbjct: 342 SWNTDWGEDGFFRIIRGIDECGIESSVVGGLPKLNRTYKKYHRRYRLDND 391


>gi|146217390|gb|ABQ10737.1| cathepsin B [Penaeus monodon]
          Length = 331

 Score = 97.8 bits (242), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 46/91 (50%), Positives = 60/91 (65%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI + G +  A   + D + YK GVYQH  G   GGHA++++GWG E+G  YWLC N
Sbjct: 237 IKYEIMNNGPVEGAFTVYVDFLHYKSGVYQHRHGLPLGGHAIRVLGWGEENGTPYWLCAN 296

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD GLFKI RG+D   IES ++SAG
Sbjct: 297 SWNTDWGDNGLFKILRGSDHCGIES-EISAG 326


>gi|160333103|ref|NP_001103948.1| capthepsin B, b precursor [Danio rerio]
 gi|133777414|gb|AAI15255.1| Ctsbb protein [Danio rerio]
          Length = 326

 Score = 97.8 bits (242), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 45/90 (50%), Positives = 56/90 (62%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  E++  G + AA   ++D  +YK GVYQH  G   GGHAVKI+GWG E+G  +WL  N
Sbjct: 233 IMTELYTNGPVEAAFTVYEDFPLYKSGVYQHLTGSALGGHAVKILGWGEENGTPFWLVAN 292

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SW   WGD G FKI RG DE  IES  V+ 
Sbjct: 293 SWNSDWGDNGYFKILRGHDECGIESEMVAG 322


>gi|147906534|ref|NP_001090927.1| cathepsin B precursor [Sus scrofa]
 gi|187470655|sp|A1E295.1|CATB_PIG RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|118490058|gb|ABK96810.1| cathepsin B [Sus scrofa]
          Length = 335

 Score = 97.8 bits (242), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 45/91 (49%), Positives = 58/91 (63%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D + YK GVYQH  G++ GGHA++I+GWGVE+G  YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG D   IES ++ AG
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EIVAG 328


>gi|195058549|ref|XP_001995463.1| GH17748 [Drosophila grimshawi]
 gi|193896249|gb|EDV95115.1| GH17748 [Drosophila grimshawi]
          Length = 340

 Score = 97.8 bits (242), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 50/93 (53%), Positives = 60/93 (64%), Gaps = 3/93 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
           +Q EI   G +  A   ++DLI+YK GVYQH  G+  GGHA++IIGWGV  E  V YWL 
Sbjct: 245 IQREIMTNGPVEGAFTVYEDLILYKTGVYQHVHGKQLGGHAIRIIGWGVWGESKVPYWLI 304

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
            NSW   WGD G F+I RG D   IES Q+SAG
Sbjct: 305 ANSWNTDWGDNGFFRILRGKDHCGIES-QISAG 336


>gi|154761391|gb|ABS85545.1| cathepsin B preproprotein [Biomphalaria glabrata]
          Length = 333

 Score = 97.8 bits (242), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 46/86 (53%), Positives = 55/86 (63%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E+   G + AA + + D + YK GVY+HT G   GGHAVKIIG+G E G  YWL  NSW 
Sbjct: 245 ELVDNGPVTAAFDVYSDFLSYKTGVYRHTTGSYEGGHAVKIIGYGTESGQDYWLVANSWN 304

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVS 89
           E WGD G FKI +G DE  IES  V+
Sbjct: 305 EDWGDKGFFKIAKGKDECGIESSIVA 330


>gi|324507953|gb|ADY43363.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
          Length = 352

 Score = 97.8 bits (242), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 57/85 (67%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A E ++D ++Y  G+Y HT G++ GGHAVK++GWGVE GV YWL  N
Sbjct: 241 IQKEILTHGPVEVAFEVYEDFLMYDGGIYVHTGGKIGGGHAVKMLGWGVEQGVPYWLVAN 300

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WG+ G F+I RG DE  IES
Sbjct: 301 SWNTDWGEDGFFRIIRGIDECGIES 325


>gi|239938576|gb|ACS36087.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score = 97.8 bits (242), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 47/93 (50%), Positives = 59/93 (63%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +V A   ++D   YKKG+Y+HT G+  GGHA+KIIGWG E GV YWL  N
Sbjct: 162 IQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWGKEGGVPYWLIAN 221

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           SW   WG+ G F+I RG++   IE   V AG V
Sbjct: 222 SWHNDWGENGYFRILRGSNHCGIEE-NVVAGHV 253


>gi|268560898|ref|XP_002638183.1| Hypothetical protein CBG22612 [Caenorhabditis briggsae]
          Length = 721

 Score = 97.8 bits (242), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 44/89 (49%), Positives = 59/89 (66%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + A  + ++D   YK GVY++  G   GGHAVKIIGWGVE+ V YWL  N
Sbjct: 231 IQSEILRNGPVEATYQVYEDFYYYKSGVYEYISGRHMGGHAVKIIGWGVEENVNYWLIAN 290

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SWG  +G+ G FK+RRG +E  IE++ V+
Sbjct: 291 SWGTGFGENGFFKMRRGNNECGIENYVVA 319


>gi|171948776|gb|ACB59245.1| cathepsin B [Sus scrofa]
          Length = 335

 Score = 97.8 bits (242), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 45/91 (49%), Positives = 58/91 (63%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D + YK GVYQH  G++ GGHA++I+GWGVE+G  YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG D   IES ++ AG
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EIVAG 328


>gi|270012758|gb|EFA09206.1| cathepsin B precursor [Tribolium castaneum]
          Length = 326

 Score = 97.8 bits (242), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 50/123 (40%), Positives = 75/123 (60%), Gaps = 4/123 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q+EI   G ++A     +D   +K GVY +  G+  G H+VK+IGWG E+G+ YWL  N
Sbjct: 204 IQMEILTNGPVMAYYNVFEDFACHKSGVYYYKSGKFVGRHSVKVIGWGTEEGIPYWLIAN 263

Query: 61  SWGELWGD-GGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFEYDTDTTIESSSDTK 119
           SWG  WG+ GG FK+RRGT+E  IE  +++AG+V  + +   EE    T+ TI+ S    
Sbjct: 264 SWGSEWGELGGFFKMRRGTNECWIEQ-EMTAGKVHIEGNERTEEM--TTNATIQGSGQKG 320

Query: 120 RAF 122
           ++ 
Sbjct: 321 QSL 323


>gi|340380665|ref|XP_003388842.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
          Length = 333

 Score = 97.4 bits (241), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 53/85 (62%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A   + D   YK GVYQH VG   GGHA++I+GWG E+GV YWL  N
Sbjct: 238 IQTEIMTNGPVEGAFTVYSDFPTYKSGVYQHVVGHALGGHAIRILGWGTENGVPYWLIAN 297

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FK+ RG D+  IES
Sbjct: 298 SWNPSWGDKGYFKMIRGKDDCGIES 322


>gi|393909827|gb|EJD75608.1| cysteine endopeptidase [Loa loa]
          Length = 383

 Score = 97.4 bits (241), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 41/85 (48%), Positives = 55/85 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + A+ E + D + Y  G+Y+H  G M GGHAVK++GWG++ GV YWL  N
Sbjct: 283 IQKEIMTLGPVEASFEVYTDFLYYTGGIYKHVAGSMGGGHAVKVLGWGIDQGVPYWLAAN 342

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WG+ G F+I RG +E  IES
Sbjct: 343 SWNTDWGEDGYFRILRGVNECGIES 367


>gi|183988834|gb|ACC66066.1| cathepsin B [Samia ricini]
          Length = 283

 Score = 97.4 bits (241), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 45/83 (54%), Positives = 54/83 (65%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ E+F  G + AA   + DL+ YK GVY+HT G   GGHA+KIIGWGVE+  KYWL  N
Sbjct: 201 IKAELFKNGPVEAAFTVYSDLLSYKNGVYKHTEGNALGGHAIKIIGWGVENNNKYWLIAN 260

Query: 61  SWGELWGDGGLFKIRRGTDESRI 83
           SW   WGD G FKI RG D   I
Sbjct: 261 SWNSDWGDNGFFKILRGEDHCGI 283


>gi|170028912|ref|XP_001842338.1| oryzain gamma chain [Culex quinquefasciatus]
 gi|167879388|gb|EDS42771.1| oryzain gamma chain [Culex quinquefasciatus]
          Length = 333

 Score = 97.4 bits (241), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 45/89 (50%), Positives = 57/89 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EIF  G + A      D   YK G+YQHT G ++G HAV+I+GWGVE+G KYWL  N
Sbjct: 240 IRKEIFTNGPVEATFTVFDDFASYKHGIYQHTSGNLAGEHAVRILGWGVENGTKYWLAAN 299

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW   WGD G FKI RG++   IES  V+
Sbjct: 300 SWNSDWGDNGYFKILRGSNHVDIESAIVA 328


>gi|298370749|gb|ADI80349.1| cathepsin B [Litopenaeus vannamei]
          Length = 331

 Score = 97.4 bits (241), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 46/91 (50%), Positives = 59/91 (64%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G +  A   + D + YK GVYQH  G   GGHA++++GWG E+G  YWLC N
Sbjct: 237 IKYEIMKNGPVEGAFTVYVDFLHYKSGVYQHRHGLPLGGHAIRVLGWGEENGTPYWLCAN 296

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD GLFKI RG+D   IES ++SAG
Sbjct: 297 SWNTDWGDNGLFKILRGSDHCGIES-EISAG 326


>gi|329669000|gb|AEB96388.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 232

 Score = 97.1 bits (240), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 41/85 (48%), Positives = 55/85 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A + ++D   Y  G+Y+HT G+  GGHAVK++GWG E+G  YW+C N
Sbjct: 140 IQKEIMTNGPVEVAFDVYEDFEHYSSGIYKHTTGDYLGGHAVKMLGWGTENGTDYWICAN 199

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WG+ G F+I RG DE  IES
Sbjct: 200 SWNSDWGENGFFRILRGVDECEIES 224


>gi|6681079|ref|NP_031824.1| cathepsin B preproprotein [Mus musculus]
 gi|115712|sp|P10605.2|CATB_MOUSE RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
           RecName: Full=Cathepsin B light chain; Contains:
           RecName: Full=Cathepsin B heavy chain; Flags: Precursor
 gi|239907|gb|AAB20536.1| preprocathepsin B [Mus sp.]
 gi|309152|gb|AAA37375.1| cathepsin B [Mus musculus]
 gi|13879360|gb|AAH06656.1| Cathepsin B [Mus musculus]
 gi|26350521|dbj|BAC38900.1| unnamed protein product [Mus musculus]
 gi|74180941|dbj|BAE27751.1| unnamed protein product [Mus musculus]
 gi|74191261|dbj|BAE39458.1| unnamed protein product [Mus musculus]
 gi|74198944|dbj|BAE30691.1| unnamed protein product [Mus musculus]
 gi|74208073|dbj|BAE29144.1| unnamed protein product [Mus musculus]
 gi|148704123|gb|EDL36070.1| cathepsin B, isoform CRA_a [Mus musculus]
          Length = 339

 Score = 97.1 bits (240), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 46/97 (47%), Positives = 60/97 (61%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A     D + YK GVY+H  G+M GGHA++I+GWGVE+GV YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVENGVPYWLAAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG +   IES ++ AG    D+
Sbjct: 299 SWNLDWGDNGFFKILRGENHCGIES-EIVAGIPRTDQ 334


>gi|401415968|ref|XP_003872479.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|322488703|emb|CBZ23950.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 340

 Score = 97.1 bits (240), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 42/89 (47%), Positives = 59/89 (66%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           + +E+ + G +  A++ + D + YK GVY+H  G+  GGHAVK++GWGV+DG+ YW   N
Sbjct: 246 LMVELMNNGPLEVAMQVYADFVAYKSGVYKHVSGDHLGGHAVKLVGWGVKDGIPYWKIAN 305

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW   WGD G F I+RG DE  IES  V+
Sbjct: 306 SWNTDWGDKGYFLIQRGNDECGIESSGVA 334


>gi|91089437|ref|XP_966750.1| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270012705|gb|EFA09153.1| cathepsin B precursor [Tribolium castaneum]
          Length = 324

 Score = 97.1 bits (240), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 44/87 (50%), Positives = 56/87 (64%), Gaps = 1/87 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI + G +V  ++ ++D   YK GVYQH  G   GGHAVKIIGWG E GV YWL  N
Sbjct: 230 IQTEIMNNGPVVTHMDVYEDFYSYKSGVYQHVSGNSMGGHAVKIIGWGTEKGVPYWLIAN 289

Query: 61  SWGELWGD-GGLFKIRRGTDESRIESF 86
           SWG  W D  G +KI RG +  +IE++
Sbjct: 290 SWGAKWADLDGFYKILRGKNHCKIETY 316


>gi|74213457|dbj|BAE35542.1| unnamed protein product [Mus musculus]
          Length = 339

 Score = 97.1 bits (240), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 46/97 (47%), Positives = 60/97 (61%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A     D + YK GVY+H  G+M GGHA++I+GWGVE+GV YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVENGVPYWLAAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG +   IES ++ AG    D+
Sbjct: 299 SWNLDWGDNGFFKILRGENHCGIES-EIVAGIPRTDQ 334


>gi|86279343|gb|ABC88767.1| putative cathepsin B-like proteinase [Tenebrio molitor]
          Length = 321

 Score = 97.1 bits (240), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 44/89 (49%), Positives = 55/89 (61%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q E+   G I+   E  QD   Y  GVY+H  GE  G H VKI+GWGVE+GV YWL  N
Sbjct: 228 IQYEVMTNGPIIVNFEVFQDFYNYVSGVYRHVSGESVGFHVVKIVGWGVENGVPYWLIAN 287

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SWG  WGD G FK+ RG +E  IE++  +
Sbjct: 288 SWGSSWGDHGFFKMLRGQNECGIENYPYA 316


>gi|329668994|gb|AEB96385.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 316

 Score = 97.1 bits (240), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 46/93 (49%), Positives = 60/93 (64%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI  +G +VAA   + D   YK G+Y+H  G  +GGHAV+I+GWG + GV YWL  N
Sbjct: 225 IQKEIMTYGPVVAAFTVYDDFFHYKTGIYKHVSGAEAGGHAVRILGWGQQGGVPYWLVAN 284

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           SW   WG+ G F+I RG+DE  IE   V AG+V
Sbjct: 285 SWNTDWGENGYFRILRGSDECGIED-GVVAGQV 316


>gi|283468816|emb|CAO98753.1| putative cathepsin B [Fasciola hepatica]
          Length = 112

 Score = 97.1 bits (240), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 42/91 (46%), Positives = 63/91 (69%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           + +EI   G +       +D ++YK G+Y +T G + GGHA+++IGWGVE+GVKYWL  N
Sbjct: 21  IMMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGVENGVKYWLIAN 80

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW E WG+ G F++RRG +E  IE+ +++AG
Sbjct: 81  SWNEGWGEKGYFRMRRGNNECGIEA-RINAG 110


>gi|31872149|gb|AAP59456.1| cathepsin B precursor [Araneus ventricosus]
          Length = 334

 Score = 97.1 bits (240), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 46/90 (51%), Positives = 56/90 (62%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G + AA   + D + YK GVY+H  GE  GGHAV+I+GWG E G  YWL  N
Sbjct: 241 IKTEISTNGPVEAAFTVYADFVTYKSGVYRHVTGEEMGGHAVRILGWGTESGTPYWLVAN 300

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SW   WGD G FKI RG+DE  IES  V+ 
Sbjct: 301 SWNTDWGDKGYFKILRGSDECGIESSIVAG 330


>gi|268561866|ref|XP_002638438.1| Hypothetical protein CBG18654 [Caenorhabditis briggsae]
          Length = 396

 Score = 96.7 bits (239), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 48/101 (47%), Positives = 60/101 (59%), Gaps = 1/101 (0%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI++ G +  A + + D   YK GVY H  G+   GHAVKIIGWG E  V YWL  N
Sbjct: 236 IQTEIYNNGPVEVAYQVYDDFYHYKSGVYYHVYGDKPSGHAVKIIGWGTEKKVDYWLVAN 295

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDL 101
           SW   +G+ G FKIRRGT+E  IE   V AG     R++ L
Sbjct: 296 SWSTTFGENGFFKIRRGTNECGIEE-NVVAGLPKSKRNARL 335


>gi|291000228|ref|XP_002682681.1| predicted protein [Naegleria gruberi]
 gi|284096309|gb|EFC49937.1| predicted protein [Naegleria gruberi]
          Length = 225

 Score = 96.7 bits (239), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 44/82 (53%), Positives = 54/82 (65%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI   G + +    +QD + YK GVY H  G   GGHA+KI+GWGVE+ VKYWL  NSWG
Sbjct: 142 EIATNGPVQSGFSVYQDFMSYKSGVYTHQTGSFLGGHAIKIVGWGVENNVKYWLVANSWG 201

Query: 64  ELWGDGGLFKIRRGTDESRIES 85
             WG  GLFKI+RG +E  IE+
Sbjct: 202 PDWGLNGLFKIKRGDNECGIEA 223


>gi|14141821|gb|AAK07477.2|AF329480_1 probable cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
 gi|289743431|gb|ADD20463.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
          Length = 340

 Score = 96.7 bits (239), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 47/91 (51%), Positives = 59/91 (64%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A   ++DLI+YK GVY+H  G+  GGHA++IIGWGVE  + YWL  N
Sbjct: 247 IQEEIMTHGPVEGAFTVYEDLILYKDGVYEHVHGKELGGHAIRIIGWGVEKDIPYWLVAN 306

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WG+ G FKI RG D   IES  +SAG
Sbjct: 307 SWNTDWGNNGFFKILRGKDHCGIES-SISAG 336


>gi|198429088|ref|XP_002120307.1| PREDICTED: similar to cathepsin B [Ciona intestinalis]
          Length = 364

 Score = 96.7 bits (239), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 45/90 (50%), Positives = 56/90 (62%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A   + D   YK GVY+H  G + GGHA++I+GWG E+GV YWL  N
Sbjct: 271 IQTEIMTHGPVEGAFTVYADFPTYKSGVYKHVTGGVLGGHAIRILGWGSENGVAYWLVAN 330

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SW   WGD G FKI RG+DE  IES  V+ 
Sbjct: 331 SWNTDWGDKGYFKILRGSDECGIESSVVAG 360


>gi|449667614|ref|XP_002166962.2| PREDICTED: cathepsin B-like [Hydra magnipapillata]
          Length = 330

 Score = 96.7 bits (239), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 48/88 (54%), Positives = 54/88 (61%), Gaps = 1/88 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI   G + AA     D   YK GVYQH  GE  GGHA+KI+GWGVE+   YWL  NSW 
Sbjct: 240 EIMTNGPVEAAFTVFADFPNYKSGVYQHVSGEELGGHAIKILGWGVENNTPYWLVANSWN 299

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAG 91
             WGD G FKI RG+DE  IE  +V AG
Sbjct: 300 PSWGDNGFFKILRGSDECGIED-EVVAG 326


>gi|123478051|ref|XP_001322190.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
 gi|121905031|gb|EAY09967.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
          Length = 288

 Score = 96.7 bits (239), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 44/93 (47%), Positives = 61/93 (65%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           MQ+ I   G +  +++ + DL+ YK G+Y HT GE  G HAV+IIGWG ++G+ YW+  N
Sbjct: 196 MQIGIMTEGPVTTSLKVYSDLMYYKSGIYTHTKGEFLGHHAVEIIGWGTKNGIDYWIISN 255

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           SW   WG  GLF I+RG +E  IE + V AG+V
Sbjct: 256 SWNTTWGMNGLFLIKRGVNECHIEDY-VCAGKV 287


>gi|348587350|ref|XP_003479431.1| PREDICTED: cathepsin B-like [Cavia porcellus]
          Length = 340

 Score = 96.7 bits (239), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 48/97 (49%), Positives = 60/97 (61%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G + AA     D + YK GVY+H  GE+ GGHA++I+GWG E+GV YWL  N
Sbjct: 240 IMAEIYKNGPVEAAFSVFSDFLTYKSGVYKHVAGEVLGGHAIRILGWGKENGVPYWLVGN 299

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES +V AG    D+
Sbjct: 300 SWNVDWGDNGFFKILRGEDHCGIES-EVVAGIPRTDQ 335


>gi|339242629|ref|XP_003377240.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316973974|gb|EFV57515.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 325

 Score = 96.3 bits (238), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 42/86 (48%), Positives = 57/86 (66%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E++  G +V A   ++D + Y KGVY+H  G+  GGHAVK+IGWG+E+  KYWL  NSW 
Sbjct: 235 ELYKNGPVVVAFNVYEDFMYYIKGVYEHRFGKFLGGHAVKLIGWGIENSKKYWLISNSWN 294

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVS 89
             WG+ G FKI RG +   IES+ V+
Sbjct: 295 TTWGENGFFKIIRGKNCCAIESYVVA 320


>gi|341887135|gb|EGT43070.1| hypothetical protein CAEBREN_13756 [Caenorhabditis brenneri]
          Length = 398

 Score = 96.3 bits (238), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 43/85 (50%), Positives = 56/85 (65%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q E+   G +  A E ++D + Y  GVY HT G++ GGHAVK+IGWG+EDG+ YW   N
Sbjct: 281 IQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIEDGIPYWTVAN 340

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WG+ G F+I RG DE  IES
Sbjct: 341 SWNTDWGEDGFFRILRGVDECGIES 365


>gi|312091331|ref|XP_003146940.1| cathepsin B [Loa loa]
          Length = 249

 Score = 96.3 bits (238), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 41/85 (48%), Positives = 55/85 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + A+ E + D + Y  G+Y+H  G M GGHAVK++GWG++ GV YWL  N
Sbjct: 149 IQKEIMTLGPVEASFEVYTDFLYYTGGIYKHVAGSMGGGHAVKVLGWGIDQGVPYWLAAN 208

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WG+ G F+I RG +E  IES
Sbjct: 209 SWNTDWGEDGYFRILRGVNECGIES 233


>gi|156708106|gb|ABU93311.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
          Length = 282

 Score = 96.3 bits (238), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 43/85 (50%), Positives = 54/85 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           MQ E++  G +  A   + D + YK GVY H  G ++GGHAV  IGWGVED   YWLC N
Sbjct: 188 MQQELYENGPLSVAFTVYYDFMNYKSGVYVHKTGGVAGGHAVLCIGWGVEDNTPYWLCQN 247

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SWG  WG+ G FKI RG++   IE+
Sbjct: 248 SWGPAWGEKGHFKILRGSNHCGIEN 272


>gi|121073168|gb|ABM47070.1| cathepsin B1 [Clonorchis sinensis]
 gi|358341105|dbj|GAA29748.2| cathepsin B [Clonorchis sinensis]
          Length = 339

 Score = 96.3 bits (238), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 42/82 (51%), Positives = 53/82 (64%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           ++  +G +    E + D   Y  GVY+H  G + GGHAV+++GWGVEDG  YWL  NSW 
Sbjct: 249 DLMTYGPLEVDFEVYADFPSYSSGVYRHVAGGLLGGHAVRLVGWGVEDGADYWLIANSWN 308

Query: 64  ELWGDGGLFKIRRGTDESRIES 85
             WGDGG FKIRRG +E  IES
Sbjct: 309 TDWGDGGYFKIRRGVNECGIES 330


>gi|74179506|dbj|BAE44111.1| cathepsin B preproprotein [Cyprinus carpio]
          Length = 330

 Score = 95.9 bits (237), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 43/82 (52%), Positives = 52/82 (63%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E++  G +  A   ++D + YK GVYQH  G   GGHA+KI+GWG E+GV YWL  NSW 
Sbjct: 241 ELYKNGPVEGAFTVYEDFLSYKSGVYQHVSGPALGGHAIKILGWGEENGVPYWLAANSWN 300

Query: 64  ELWGDGGLFKIRRGTDESRIES 85
             WGD G FKI RG D   IES
Sbjct: 301 TDWGDNGYFKILRGEDHCGIES 322


>gi|351695295|gb|EHA98213.1| Cathepsin B [Heterocephalus glaber]
          Length = 340

 Score = 95.9 bits (237), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 47/97 (48%), Positives = 60/97 (61%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A     D ++YK GVY+H  GEM GGHA++I+GWG E+GV YWL  N
Sbjct: 240 IMAEIYKNGPVEGAFTVFSDFLMYKTGVYKHLAGEMLGGHAIRILGWGKENGVPYWLVGN 299

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG D   IES ++ AG    D+
Sbjct: 300 SWNVDWGDSGFFKIVRGEDHCGIES-EIVAGIPRTDQ 335


>gi|156708122|gb|ABU93319.1| cathepsin B10 cysteine protease [Monocercomonoides sp. PA]
          Length = 283

 Score = 95.9 bits (237), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 46/94 (48%), Positives = 61/94 (64%), Gaps = 1/94 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q E+++ G I      ++D   Y KG+Y+H  G   GGHAV ++GWG+EDGVKYWL  N
Sbjct: 189 LQDELYNNGPIQVTYVVYEDFFYYSKGIYKHLSGNKVGGHAVVLMGWGIEDGVKYWLVQN 248

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           SWG  WG+ G F+I RG++E  IES    AG VD
Sbjct: 249 SWGYEWGEQGYFRILRGSNECGIES-SAYAGDVD 281


>gi|221107055|ref|XP_002166984.1| PREDICTED: cathepsin B-like [Hydra magnipapillata]
          Length = 330

 Score = 95.9 bits (237), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 47/93 (50%), Positives = 60/93 (64%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI+  G + AA   + D   YK GVY++T G   GGHA+KI+GWGVE+ V YWL  N
Sbjct: 237 IKNEIYLNGPVEAAFTVYSDFPNYKSGVYKYTTGNALGGHAIKILGWGVENNVPYWLVAN 296

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           SW   WGD G FKI RG++E  IE+  V AG V
Sbjct: 297 SWNPDWGDKGFFKILRGSNECGIEA-SVVAGMV 328


>gi|17565162|ref|NP_503382.1| Protein W07B8.4 [Caenorhabditis elegans]
 gi|351059398|emb|CCD74288.1| Protein W07B8.4 [Caenorhabditis elegans]
          Length = 335

 Score = 95.9 bits (237), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 45/97 (46%), Positives = 59/97 (60%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +      ++D  +YK G+Y H  G   GGHAVK++GWGV++G  YWL  N
Sbjct: 238 IQTEILAHGPVEVGFIVYEDFYLYKTGIYTHVAGGELGGHAVKMLGWGVDNGTPYWLAAN 297

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW  +WG+ G F+I RG DE  IES  V AG  D +R
Sbjct: 298 SWNTVWGEKGYFRILRGVDECGIESAAV-AGMPDLNR 333


>gi|170030060|ref|XP_001842908.1| cathepsin B [Culex quinquefasciatus]
 gi|167865914|gb|EDS29297.1| cathepsin B [Culex quinquefasciatus]
          Length = 320

 Score = 95.9 bits (237), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 44/86 (51%), Positives = 52/86 (60%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +V   E   D   YK GVY+H  G   G HAV++IGWGVE+GVKYWL  N
Sbjct: 227 IMTEIYQNGPVVVQFEVFADFYQYKSGVYRHVTGATEGWHAVRVIGWGVENGVKYWLVAN 286

Query: 61  SWGELWGDGGLFKIRRGTDESRIESF 86
           SWG  WGD G FK  RG +   IE F
Sbjct: 287 SWGVRWGDKGFFKFVRGENHLGIEDF 312


>gi|45822203|emb|CAE47498.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
          Length = 328

 Score = 95.9 bits (237), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 43/85 (50%), Positives = 52/85 (61%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A   + D + YK GVYQH  G+  GGHA++I GWGVE+   YWL  N
Sbjct: 236 IQAEILQNGPVEGAFSVYADFVNYKTGVYQHIKGQFLGGHAIRIFGWGVENNTPYWLIAN 295

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FKI RG+D   IES
Sbjct: 296 SWNTDWGDSGTFKILRGSDHCGIES 320


>gi|56756587|gb|AAW26466.1| unknown [Schistosoma japonicum]
          Length = 216

 Score = 95.9 bits (237), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 44/90 (48%), Positives = 59/90 (65%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + AA + ++D + YK G+Y+H  G + GGHA++IIGWGV+    YWL  N
Sbjct: 123 IQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVKKRTPYWLIAN 182

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SW E WG+ GLF+I RG DE  IES  V+ 
Sbjct: 183 SWNEDWGEKGLFRIVRGRDECSIESNVVAG 212


>gi|52630945|gb|AAU84936.1| putative cathepsin B-S [Toxoptera citricida]
          Length = 335

 Score = 95.9 bits (237), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 46/85 (54%), Positives = 57/85 (67%), Gaps = 1/85 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
           MQ +   +G I A+ + + D + Y+ GVYQ T   +  GGHAVK+IGWG EDG  YWL V
Sbjct: 241 MQKDTIAYGPIEASFDVYDDFVNYESGVYQKTEDAKYLGGHAVKMIGWGEEDGTPYWLMV 300

Query: 60  NSWGELWGDGGLFKIRRGTDESRIE 84
           NSWGE WG  G+FKI RGT+E  IE
Sbjct: 301 NSWGEQWGANGMFKILRGTNECGIE 325


>gi|349604734|gb|AEQ00202.1| Cathepsin B-like protein, partial [Equus caballus]
          Length = 134

 Score = 95.9 bits (237), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 44/77 (57%), Positives = 51/77 (66%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
           G + AA   + D + YK GVYQH  G+M GGHAV+I+GWGVE+G  YWL  NSW   WGD
Sbjct: 42  GPVEAAFTVYSDFLQYKSGVYQHVAGDMMGGHAVRILGWGVENGTPYWLVGNSWNTDWGD 101

Query: 69  GGLFKIRRGTDESRIES 85
            G FKI RG D   IES
Sbjct: 102 NGFFKILRGQDHCGIES 118


>gi|290989996|ref|XP_002677623.1| cathepsin B [Naegleria gruberi]
 gi|284091231|gb|EFC44879.1| cathepsin B [Naegleria gruberi]
          Length = 321

 Score = 95.9 bits (237), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 56/97 (57%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI+  G +      +QD + YK GVY H  G   GGHA+KIIGWGVE GV YWL  N
Sbjct: 223 IQQEIYTNGPVQGGFSVYQDFMNYKSGVYSHKTGSFLGGHAIKIIGWGVEGGVDYWLVAN 282

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WG  G FKI RG +E  IE   V AG  D  R
Sbjct: 283 SWSTDWGIDGTFKILRGHNECGIED-DVYAGPADLSR 318


>gi|728602|emb|CAA88490.1| cathepsin B-like enzyme [Leishmania mexicana]
 gi|1586011|prf||2202319A cathepsin B-like Cys protease
          Length = 340

 Score = 95.9 bits (237), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 42/86 (48%), Positives = 57/86 (66%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E+ + G +  A++ + D + YK GVY+H  G+  GGHAVK++GWGV+DG+ YW   NSW 
Sbjct: 249 ELMNNGPLEVAMQVYADFVAYKSGVYKHVSGDHLGGHAVKLVGWGVKDGIPYWKIANSWN 308

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVS 89
             WGD G F I+RG DE  IES  V+
Sbjct: 309 TDWGDKGYFLIQRGNDECGIESSGVA 334


>gi|29374023|gb|AAO73002.1| cathepsin B [Fasciola gigantica]
          Length = 335

 Score = 95.9 bits (237), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 46/91 (50%), Positives = 61/91 (67%), Gaps = 3/91 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           + +EI   G +       +D  +YK G+YQ+T G + GGH   IIGWGVE+GVKYWL  N
Sbjct: 246 IMMEIITNGPVSTIYYIFEDFTVYKSGIYQYTSGSLMGGHG--IIGWGVENGVKYWLAAN 303

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW E WG+ G F+IRRGT+E  IES +++AG
Sbjct: 304 SWNEGWGENGYFRIRRGTNECGIES-RINAG 333


>gi|46195455|ref|NP_990702.1| cathepsin B precursor [Gallus gallus]
 gi|1168790|sp|P43233.1|CATB_CHICK RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
           RecName: Full=Cathepsin B light chain; Contains:
           RecName: Full=Cathepsin B heavy chain; Flags: Precursor
 gi|603203|gb|AAA87075.1| cathepsin B [Gallus gallus]
          Length = 340

 Score = 95.5 bits (236), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 43/85 (50%), Positives = 54/85 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   ++D ++YK GVYQH  GE  GGHA++I+GWGVE+G  YWL  N
Sbjct: 240 IMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGVENGTPYWLAAN 299

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WG  G FKI RG D   IES
Sbjct: 300 SWNTDWGITGFFKILRGEDHCGIES 324


>gi|86451924|gb|ABC97357.1| cathepsin B [Streblomastix strix]
          Length = 283

 Score = 95.5 bits (236), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 46/91 (50%), Positives = 57/91 (62%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI+ +G +      + D + YK GVY H  G + GGHAV I+GWGVED V YWL  N
Sbjct: 188 IQGEIYEYGPVSMGFIVYSDFMSYKSGVYVHQAGYIEGGHAVLIVGWGVEDEVPYWLVQN 247

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SWG  WG+ G FKI RG+D    ES  V+AG
Sbjct: 248 SWGTDWGENGFFKILRGSDHCECES-NVTAG 277


>gi|389593817|ref|XP_003722157.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
 gi|321438655|emb|CBZ12414.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
          Length = 340

 Score = 95.5 bits (236), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 41/89 (46%), Positives = 58/89 (65%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           + +E+   G +   ++ + D + YK GVY+H +GE  GGHAVK++GWG +DGV YW   N
Sbjct: 246 LMIELMTNGPLELTMQVYSDFVGYKSGVYKHVLGEFLGGHAVKLVGWGTQDGVPYWKVAN 305

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW   WGD G F I+RG +E +IES  V+
Sbjct: 306 SWNTDWGDKGYFLIQRGNNECKIESGGVA 334


>gi|333408990|gb|AEF32260.1| cathepsin B [Cristaria plicata]
          Length = 347

 Score = 95.5 bits (236), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 45/88 (51%), Positives = 56/88 (63%), Gaps = 1/88 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI   G + AA   ++D + YK GVYQH  G+  GGHAVKI+GWG ++G  YW+  NSW 
Sbjct: 256 EIMTNGPVEAAFTVYEDFLSYKSGVYQHRTGQELGGHAVKILGWGEDNGTPYWIVANSWN 315

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAG 91
             WG+ G F I RG DE  IES Q+ AG
Sbjct: 316 PDWGNQGFFNILRGKDECGIES-QIVAG 342


>gi|354471594|ref|XP_003498026.1| PREDICTED: cathepsin B-like [Cricetulus griseus]
 gi|344254255|gb|EGW10359.1| Cathepsin B [Cricetulus griseus]
          Length = 339

 Score = 95.5 bits (236), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 45/91 (49%), Positives = 58/91 (63%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A     D + YK GVY+H  G++ GGHA++I+GWGVE+ V YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDIMGGHAIRILGWGVENSVPYWLVAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD GLFKI RG D   IES ++ AG
Sbjct: 299 SWNVDWGDNGLFKILRGEDHCGIES-EIVAG 328


>gi|71984043|ref|NP_001024426.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
 gi|351058214|emb|CCD65629.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
          Length = 378

 Score = 95.5 bits (236), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 42/85 (49%), Positives = 56/85 (65%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q E+   G +  A E ++D + Y  GVY HT G++ GGHAVK+IGWG++DG+ YW   N
Sbjct: 265 IQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIDDGIPYWTVAN 324

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WG+ G F+I RG DE  IES
Sbjct: 325 SWNTDWGEDGFFRILRGVDECGIES 349


>gi|25146613|ref|NP_741818.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
 gi|1169087|sp|P43510.1|CPR6_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 6; AltName:
           Full=Cysteine protease-related 6; Flags: Precursor
 gi|671715|gb|AAA98787.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|695294|gb|AAA98789.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351058213|emb|CCD65628.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
          Length = 379

 Score = 95.5 bits (236), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 42/85 (49%), Positives = 56/85 (65%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q E+   G +  A E ++D + Y  GVY HT G++ GGHAVK+IGWG++DG+ YW   N
Sbjct: 266 IQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIDDGIPYWTVAN 325

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WG+ G F+I RG DE  IES
Sbjct: 326 SWNTDWGEDGFFRILRGVDECGIES 350


>gi|256052331|ref|XP_002569726.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228435|emb|CCD74606.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 319

 Score = 95.5 bits (236), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 43/94 (45%), Positives = 62/94 (65%), Gaps = 1/94 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI  +G + A    ++D + YK G+Y+H  G++   HA++IIGWGVE+   YWL  N
Sbjct: 226 IQKEIMKYGPVEANFIVYEDFLNYKSGIYKHITGKLVSWHAIRIIGWGVENNTPYWLIPN 285

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           SW E WG+ G F+I RG  E  IES +V+AGR++
Sbjct: 286 SWNEDWGENGNFRILRGRHECSIES-EVTAGRIN 318


>gi|193209594|ref|NP_001123113.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
 gi|351058222|emb|CCD65637.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
          Length = 369

 Score = 95.5 bits (236), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 42/85 (49%), Positives = 56/85 (65%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q E+   G +  A E ++D + Y  GVY HT G++ GGHAVK+IGWG++DG+ YW   N
Sbjct: 256 IQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIDDGIPYWTVAN 315

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WG+ G F+I RG DE  IES
Sbjct: 316 SWNTDWGEDGFFRILRGVDECGIES 340


>gi|358341561|dbj|GAA37330.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 347

 Score = 95.1 bits (235), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 46/92 (50%), Positives = 60/92 (65%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           +Q EI+  G + A +  + D   Y  GVY+HT GE+ GGHA++++GWGV EDG  YWL  
Sbjct: 248 IQKEIWMRGPVEATMNVYTDFANYAGGVYKHTTGELLGGHAIRLLGWGVEEDGTPYWLAA 307

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           NSW   WG+ G F+I RG+D   IES  VSAG
Sbjct: 308 NSWNPSWGEKGFFRILRGSDHCGIES-DVSAG 338


>gi|56759588|gb|AAW28820.1| Parcxpwnx02 [Periplaneta americana]
          Length = 343

 Score = 95.1 bits (235), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 54/85 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q E+   G   AA+  + D + Y+ GVYQH  G   GGHAV+++GWGVEDG  YWL  N
Sbjct: 250 IQKELLLNGPAEAALTVYDDFLHYRTGVYQHVSGGALGGHAVRLLGWGVEDGTPYWLLAN 309

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G F+I RG DE  IES
Sbjct: 310 SWNYDWGDNGYFRILRGQDECGIES 334


>gi|144952804|gb|ABP04056.1| cathepsin B-4 [Clonorchis sinensis]
          Length = 347

 Score = 95.1 bits (235), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 46/92 (50%), Positives = 60/92 (65%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           +Q EI+  G + A +  + D   Y  GVY+HT GE+ GGHA++++GWGV EDG  YWL  
Sbjct: 248 IQKEIWMRGPVEATMNVYTDFANYAGGVYKHTTGELLGGHAIRLLGWGVEEDGTPYWLAA 307

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           NSW   WG+ G F+I RG+D   IES  VSAG
Sbjct: 308 NSWNPSWGEKGFFRILRGSDHCGIES-DVSAG 338


>gi|54289256|gb|AAV31918.1| putative vitellogenic cathepsin B [Aedes aegypti]
          Length = 332

 Score = 95.1 bits (235), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 45/90 (50%), Positives = 58/90 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +QLEI   G + +    +QDL +YK GVYQH VG   G HAV++IGWG E GV YWL  N
Sbjct: 239 IQLEIMTNGPVESGFSVYQDLYLYKTGVYQHVVGREVGKHAVRLIGWGKERGVPYWLIAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           S+GE WG+ G FK  RG++   IES  ++ 
Sbjct: 299 SYGEDWGEHGYFKFLRGSNHLGIESVVIAG 328


>gi|94958151|gb|ABF47216.1| cathepsin B [Nicotiana benthamiana]
          Length = 356

 Score = 95.1 bits (235), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 46/100 (46%), Positives = 61/100 (61%), Gaps = 1/100 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCVNSW 62
           E++  G +  +   ++D   YK GVY+H  G++ GGHAVK+IGWG  EDG  YWL  N W
Sbjct: 247 ELYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYWLLANQW 306

Query: 63  GELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLE 102
              WGD G FKIRRGTDE  IE   V+     R+ + +L+
Sbjct: 307 NRGWGDDGYFKIRRGTDECEIEDEVVAGLPSARNLNMELD 346


>gi|157167368|ref|XP_001653891.1| cathepsin b [Aedes aegypti]
 gi|108874250|gb|EAT38475.1| AAEL009642-PA [Aedes aegypti]
          Length = 332

 Score = 95.1 bits (235), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 45/90 (50%), Positives = 58/90 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +QLEI   G + +    +QDL +YK GVYQH VG   G HAV++IGWG E GV YWL  N
Sbjct: 239 IQLEIMTNGPVESGFSVYQDLYLYKTGVYQHVVGREVGKHAVRLIGWGKERGVPYWLIAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           S+GE WG+ G FK  RG++   IES  ++ 
Sbjct: 299 SYGEDWGEHGYFKFLRGSNHLGIESVVIAG 328


>gi|226474182|emb|CAX71577.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 95.1 bits (235), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIES-EIAAGLI 340


>gi|390994433|gb|AFM37366.1| cathepsin B3 [Dictyocaulus viviparus]
          Length = 342

 Score = 95.1 bits (235), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 43/91 (47%), Positives = 61/91 (67%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G +VA    ++D + YKKG+Y++T G   GGHAV+I+GWGVE+ VKYW+  N
Sbjct: 251 IQKDILKHGPLVATFSVYEDFMYYKKGIYRYTHGGYEGGHAVRILGWGVENNVKYWIIAN 310

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WG+ G F++ RG ++  IE   VSAG
Sbjct: 311 SWNTDWGEDGFFRMVRGINDCGIEE-SVSAG 340


>gi|1942645|pdb|1MIR|A Chain A, Rat Procathepsin B
 gi|1942646|pdb|1MIR|B Chain B, Rat Procathepsin B
          Length = 322

 Score = 95.1 bits (235), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 41/85 (48%), Positives = 54/85 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A     D + YK GVY+H  G++ GGHA++I+GWG+E+GV YWL  N
Sbjct: 222 IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVAN 281

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FKI RG +   IES
Sbjct: 282 SWNADWGDNGFFKILRGENHCGIES 306


>gi|25988674|gb|AAN76202.1| lysosomal cysteine proteinase cathepsin B/green fluorescent protein
           EGFP fusion protein [synthetic construct]
          Length = 578

 Score = 95.1 bits (235), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 43/91 (47%), Positives = 58/91 (63%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A     D + YK GVY+H  G++ GGHA++I+GWG+E+GV YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG +   IES ++ AG
Sbjct: 299 SWNVDWGDNGFFKILRGENHCGIES-EIVAG 328


>gi|56752809|gb|AAW24616.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 95.1 bits (235), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIES-EIAAGLI 340


>gi|496317|dbj|BAA04103.1| Sarcophaga pro-cathepsin B [Sarcophaga peregrina]
          Length = 344

 Score = 95.1 bits (235), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 45/91 (49%), Positives = 58/91 (63%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A   ++DLI+YK GVYQH  G   GGHA++I+GWGVE+   YWL  N
Sbjct: 251 IQKEIMQNGPVEGAFTVYEDLILYKDGVYQHVHGRELGGHAIRILGWGVENKTPYWLIAN 310

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WG+ G FK+ RG D   IES  ++AG
Sbjct: 311 SWNTDWGNNGFFKMLRGEDHCGIES-AIAAG 340


>gi|226474180|emb|CAX71576.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 95.1 bits (235), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIES-EIAAGLI 340


>gi|339242313|ref|XP_003377082.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316974149|gb|EFV57673.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 517

 Score = 95.1 bits (235), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 41/82 (50%), Positives = 54/82 (65%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI+  G +VA    ++D   Y  G+YQ T     GGHA++IIGWG E+G+ YWL  NSW 
Sbjct: 423 EIYTHGPVVAGFIVYEDFTYYISGIYQQTTYVAMGGHAIRIIGWGEENGIPYWLIANSWN 482

Query: 64  ELWGDGGLFKIRRGTDESRIES 85
             +G+ G F+IRRGT+E RIES
Sbjct: 483 TTFGEKGFFRIRRGTNECRIES 504


>gi|209863073|ref|NP_001119610.2| cathepsin B-1852 [Acyrthosiphon pisum]
          Length = 333

 Score = 95.1 bits (235), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 56/85 (65%), Gaps = 1/85 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
           MQ +I  +G I ++ + + D I YK GVY +       GGH+VK IGWGVE  V YWL +
Sbjct: 237 MQNDIMTYGPIESSFDVYDDFISYKSGVYFKSPNATYLGGHSVKCIGWGVERNVSYWLMM 296

Query: 60  NSWGELWGDGGLFKIRRGTDESRIE 84
           NSW   WGDGG FKIRRGT+E ++E
Sbjct: 297 NSWNSTWGDGGYFKIRRGTNECQVE 321


>gi|56756410|gb|AAW26378.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 95.1 bits (235), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIES-EIAAGLI 340


>gi|226474164|emb|CAX71568.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
 gi|226474166|emb|CAX71569.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 95.1 bits (235), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIES-EIAAGLI 340


>gi|346472613|gb|AEO36151.1| hypothetical protein [Amblyomma maculatum]
          Length = 373

 Score = 95.1 bits (235), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 43/85 (50%), Positives = 54/85 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + A    ++D + YK GVYQ       GGHA++++GWGVE+GV YWL  N
Sbjct: 280 IQAEIMMNGPVEADFTVYEDFLHYKSGVYQRHTDSALGGHAIRLLGWGVENGVPYWLAAN 339

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FKI RG+DE  IES
Sbjct: 340 SWNTEWGDKGFFKILRGSDECGIES 364


>gi|268557308|ref|XP_002636643.1| Hypothetical protein CBG23351 [Caenorhabditis briggsae]
          Length = 351

 Score = 95.1 bits (235), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 43/85 (50%), Positives = 54/85 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A   ++D   Y  GVY HT G   GGHAVK++GWGV++G  YWLC N
Sbjct: 258 IQKEIMTHGPVEVAFTVYEDFEHYSGGVYVHTAGASLGGHAVKMLGWGVDNGTPYWLCAN 317

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW E WG+ G F+I RG +E  IES
Sbjct: 318 SWNEDWGENGYFRIIRGVNECGIES 342


>gi|161343875|tpg|DAA06118.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 210

 Score = 95.1 bits (235), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 39/78 (50%), Positives = 53/78 (67%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  +I+  G + AA   + D + YK GVY +T G++ GGHA+KI+GWGV+D  KYWLC N
Sbjct: 133 IMTDIYKNGPVQAAFYVYTDFMYYKSGVYSYTRGQIEGGHAIKILGWGVDDNTKYWLCAN 192

Query: 61  SWGELWGDGGLFKIRRGT 78
           SW   WG+ GLF+I RG 
Sbjct: 193 SWSRSWGENGLFRILRGN 210


>gi|427785213|gb|JAA58058.1| Putative cathepsin l culex quinquefasciatus cathepsin l
           [Rhipicephalus pulchellus]
          Length = 346

 Score = 95.1 bits (235), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 43/84 (51%), Positives = 53/84 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + A    + D + YK GVYQ    E  GGHA++++GWGVE+GV YWL  N
Sbjct: 253 IQAEIMTNGPVEADFTVYSDFVHYKSGVYQRHTDEALGGHAIRLLGWGVENGVPYWLAAN 312

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW   WGD G FKI RG+DE  IE
Sbjct: 313 SWNTEWGDKGFFKILRGSDECGIE 336


>gi|226474160|emb|CAX71567.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 95.1 bits (235), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIES-EIAAGLI 340


>gi|242001640|ref|XP_002435463.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215498799|gb|EEC08293.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 223

 Score = 95.1 bits (235), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 46/91 (50%), Positives = 59/91 (64%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EIF  G + A   A+ D + YK GVYQH   ++ G HA++I+GWG ED   YWL  N
Sbjct: 129 IRTEIFQNGPVEAEFTAYADFLSYKSGVYQHHSRDIIGRHAIRILGWGSEDNNPYWLLAN 188

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW E WGD G FK+ RG +E  IESF V+AG
Sbjct: 189 SWNEDWGDHGYFKMLRGVNECDIESF-VNAG 218


>gi|226473754|emb|CAX71562.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 329

 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 236 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 295

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 296 TWNEDWGEKGYFRIVRGRNECSIES-EIAAGLI 327


>gi|308504233|ref|XP_003114300.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
 gi|308261685|gb|EFP05638.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
          Length = 351

 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 43/85 (50%), Positives = 54/85 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A   ++D   Y  GVY HT G   GGHAVK++GWGV++G  YWLC N
Sbjct: 258 IQKEIMTHGPVEVAFSVYEDFEHYSGGVYVHTAGASLGGHAVKMLGWGVDNGTPYWLCAN 317

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW E WG+ G F+I RG +E  IES
Sbjct: 318 SWNEDWGENGYFRIIRGVNECGIES 342


>gi|326515156|dbj|BAK03491.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 471

 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 41/84 (48%), Positives = 53/84 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G + A+ + + D + YK GVYQH  G+  GGHAVKIIGWGV+    YW+  N
Sbjct: 373 IMTEIYTNGPVEASYDVYADFVSYKSGVYQHVTGDYLGGHAVKIIGWGVDGSTPYWIVAN 432

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW   WG+ G F I RG+DE  IE
Sbjct: 433 SWNNDWGNNGFFNILRGSDECGIE 456


>gi|1311050|pdb|1CPJ|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1311051|pdb|1CPJ|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1421561|pdb|1THE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B- Inhibitor Complex: Implications For
           Structure-Based Inhibitor Design
 gi|1421562|pdb|1THE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B- Inhibitor Complex: Implications For
           Structure-Based Inhibitor Design
          Length = 260

 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 41/85 (48%), Positives = 54/85 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A     D + YK GVY+H  G++ GGHA++I+GWG+E+GV YWL  N
Sbjct: 166 IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVAN 225

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FKI RG +   IES
Sbjct: 226 SWNADWGDNGFFKILRGENHCGIES 250


>gi|56755451|gb|AAW25905.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIES-EIAAGLI 340


>gi|300835056|gb|ADK37857.1| putative cathepsin precursor [Sitobion avenae]
          Length = 340

 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 45/86 (52%), Positives = 58/86 (67%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++  +G I A+ + + D   YK GVY+ T      GGHAVK+IGWGVE+G  YWL V
Sbjct: 245 IQKDVMTYGPIEASFDVYDDFPSYKSGVYEKTENASYLGGHAVKLIGWGVEEGTPYWLMV 304

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW   WGD GLFKIRRGT+E  I++
Sbjct: 305 NSWNAQWGDKGLFKIRRGTNECGIDN 330


>gi|226473762|emb|CAX71566.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
 gi|226474170|emb|CAX71571.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 43/93 (46%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A IE ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKDIMMHGPVEAYIEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340


>gi|56756380|gb|AAW26363.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340


>gi|346470617|gb|AEO35153.1| hypothetical protein [Amblyomma maculatum]
          Length = 335

 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 45/91 (49%), Positives = 57/91 (62%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EIF  G + A    + D + YK GVYQ    +  GGHA++I+GWG E+GV YWL  N
Sbjct: 242 IKTEIFKNGPVEADFTVYADFVSYKSGVYQRHSDDALGGHAIRILGWGTENGVPYWLVAN 301

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW E WGD G FKI RG DE  IE   ++AG
Sbjct: 302 SWNEDWGDKGYFKILRGNDECGIED-DINAG 331


>gi|313233819|emb|CBY09988.1| unnamed protein product [Oikopleura dioica]
          Length = 356

 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 42/85 (49%), Positives = 54/85 (63%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI+  G +  A   ++D   YK GVY H  G   GGHA++++GWG E+G KYWLC NSW 
Sbjct: 257 EIYKNGPVEGAFIVYEDFPTYKSGVYSHHTGSALGGHAIRVLGWGEENGEKYWLCGNSWN 316

Query: 64  ELWGDGGLFKIRRGTDESRIESFQV 88
             WG+ G FKI+RG +E  IES  V
Sbjct: 317 TDWGNNGFFKIKRGVNECGIESEMV 341


>gi|341891084|gb|EGT47019.1| CBN-CPR-4 protein [Caenorhabditis brenneri]
          Length = 335

 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 42/84 (50%), Positives = 54/84 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + AA   ++D   YK GVY HT GE  GGHA++I+GWG ++G  YWL  N
Sbjct: 242 IQAEIIAHGPVEAAFTVYEDFYQYKSGVYVHTTGEELGGHAIRILGWGTDNGTPYWLVAN 301

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW   WG+ G F+I RGT+E  IE
Sbjct: 302 SWNVNWGENGYFRIIRGTNECGIE 325


>gi|320166129|gb|EFW43028.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
          Length = 332

 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 40/77 (51%), Positives = 49/77 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI + G + AA   + D   Y+ GVY+HT G + GGHA+ I+GWG E G  YWL  N
Sbjct: 240 IQTEILNHGPVEAAFTVYSDFPTYRSGVYKHTSGSVLGGHAISIVGWGTESGSPYWLVKN 299

Query: 61  SWGELWGDGGLFKIRRG 77
           SW   WGDGG FKI RG
Sbjct: 300 SWNPSWGDGGFFKILRG 316


>gi|225717770|gb|ACO14731.1| Cathepsin B precursor [Caligus clemensi]
          Length = 331

 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 41/84 (48%), Positives = 54/84 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q++I   G + AA   + D + YK GVY+H  G + GGHA++I+GWG+E G  YWL  N
Sbjct: 239 IQMDIMTNGPVEAAFSVYSDFMSYKSGVYRHVKGSLLGGHAIRILGWGMEKGTPYWLVAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW   WGD G FKI RG+D   IE
Sbjct: 299 SWNTDWGDNGTFKILRGSDHCGIE 322


>gi|1127275|pdb|1CTE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1127276|pdb|1CTE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
          Length = 254

 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 41/85 (48%), Positives = 54/85 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A     D + YK GVY+H  G++ GGHA++I+GWG+E+GV YWL  N
Sbjct: 160 IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVAN 219

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FKI RG +   IES
Sbjct: 220 SWNADWGDNGFFKILRGENHCGIES 244


>gi|268555790|ref|XP_002635884.1| Hypothetical protein CBG01104 [Caenorhabditis briggsae]
          Length = 337

 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 48/98 (48%), Positives = 59/98 (60%), Gaps = 1/98 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + A    + D   YK G+Y H  G+  GGHAVKI+GWGVE+G KYWL  N
Sbjct: 240 IQTEILQHGPVEAGFLVYSDFYRYKSGIYTHVSGQELGGHAVKILGWGVENGTKYWLVAN 299

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRS 98
           SW   WG+ G F+I RG +E  IES  V AG  D  R+
Sbjct: 300 SWNINWGEKGYFRILRGRNECGIES-AVVAGIPDLTRN 336


>gi|159179|gb|AAA29178.1| cysteine proteinase, partial [Haemonchus contortus]
          Length = 341

 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 42/94 (44%), Positives = 61/94 (64%), Gaps = 1/94 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q E+   G + A+   ++D  +YK G+Y+HT GE+ G HAVK+IGWG E+   YWL  N
Sbjct: 245 IQKELLKNGPVTASFAVYEDFSLYKSGIYRHTAGELRGYHAVKMIGWGTENRTDYWLIAN 304

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           SW + WG+ G F+I RG ++  IE   V+AG +D
Sbjct: 305 SWHDDWGENGYFRIIRGINDCGIEE-NVAAGLID 337


>gi|226474172|emb|CAX71572.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340


>gi|226474174|emb|CAX71573.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340


>gi|255040225|gb|ACT99885.1| cathepsin B2 [Opisthorchis viverrini]
          Length = 337

 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 45/88 (51%), Positives = 59/88 (67%), Gaps = 1/88 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI   G + AA E ++D   YK+GVY H+ GE  GGHA++I+GWG E+G  YWL  NSW 
Sbjct: 241 EIMINGPVEAAFEVYEDFFGYKQGVYFHSTGEFIGGHAIRILGWGEENGTPYWLIANSWN 300

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAG 91
           E WG+ G FK+ RG +E  IE  +V+AG
Sbjct: 301 EGWGEDGYFKMLRGKNECGIED-EVTAG 327


>gi|226473758|emb|CAX71564.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340


>gi|312271211|gb|ADQ57303.1| cathepsin B-like cysteine proteinase 1 [Angiostrongylus
           cantonensis]
          Length = 394

 Score = 94.4 bits (233), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 42/85 (49%), Positives = 55/85 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A E ++D + Y  G+Y HT G++ GGHAVK+IGWG++ G  YWL  N
Sbjct: 282 IQKEILTHGPVEVAFEVYEDFLHYAGGIYVHTGGKLGGGHAVKLIGWGIDQGTPYWLIAN 341

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WG+ G F+I RG DE  IES
Sbjct: 342 SWNTDWGEEGFFRILRGVDECGIES 366


>gi|1705630|sp|P00787.2|CATB_RAT RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; AltName:
           Full=RSG-2; Contains: RecName: Full=Cathepsin B light
           chain; Contains: RecName: Full=Cathepsin B heavy chain;
           Flags: Precursor
 gi|1524328|emb|CAA57792.1| cathepsin b [Rattus norvegicus]
          Length = 339

 Score = 94.4 bits (233), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 43/91 (47%), Positives = 58/91 (63%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A     D + YK GVY+H  G++ GGHA++I+GWG+E+GV YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG +   IES ++ AG
Sbjct: 299 SWNVDWGDNGFFKILRGENHCGIES-EIVAG 328


>gi|82830420|ref|NP_072119.2| cathepsin B preproprotein [Rattus norvegicus]
 gi|47939014|gb|AAH72490.1| Cathepsin B [Rattus norvegicus]
 gi|149030258|gb|EDL85314.1| rCG52258, isoform CRA_a [Rattus norvegicus]
          Length = 339

 Score = 94.4 bits (233), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 43/91 (47%), Positives = 58/91 (63%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A     D + YK GVY+H  G++ GGHA++I+GWG+E+GV YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG +   IES ++ AG
Sbjct: 299 SWNVDWGDNGFFKILRGENHCGIES-EIVAG 328


>gi|56754307|gb|AAW25341.1| unknown [Schistosoma japonicum]
          Length = 309

 Score = 94.4 bits (233), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 65/93 (69%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G++ A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 216 IQKDIMMHGTVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 275

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 276 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 307


>gi|289743429|gb|ADD20462.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
          Length = 340

 Score = 94.4 bits (233), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 47/91 (51%), Positives = 58/91 (63%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A   ++DLI+YK GVY+H  G+  GGHA++IIGWGVE    YWL  N
Sbjct: 247 IQGEIMTNGPVEGAFTVYEDLILYKDGVYEHVHGKELGGHAIRIIGWGVEKDTPYWLIAN 306

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WG+ G FKI RG D   IES  +SAG
Sbjct: 307 SWNTDWGNNGFFKILRGKDHCGIES-SISAG 336


>gi|161343851|tpg|DAA06106.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 333

 Score = 94.4 bits (233), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 56/85 (65%), Gaps = 1/85 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
           MQ +I  +G I ++ + + D I YK GVY +       GGH+VK IGWGVE  V YWL +
Sbjct: 237 MQNDIMTYGPIESSFDVYDDFISYKSGVYFKSPNATYLGGHSVKCIGWGVERNVSYWLMM 296

Query: 60  NSWGELWGDGGLFKIRRGTDESRIE 84
           NSW   WGDGG FKIRRGT+E ++E
Sbjct: 297 NSWNNTWGDGGNFKIRRGTNECQVE 321


>gi|156708104|gb|ABU93310.1| cathepsin B1 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score = 94.4 bits (233), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 42/85 (49%), Positives = 53/85 (62%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           M  E++  G I  A   + D + YK GVY H  G ++GGHAV  +GWGVED   YWLC N
Sbjct: 188 MMEELYENGPISVAFTVYYDFMNYKSGVYVHKTGGIAGGHAVLCVGWGVEDNTPYWLCQN 247

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SWG  WG+ G FKI RG++   IE+
Sbjct: 248 SWGPAWGEKGHFKILRGSNHCGIEN 272


>gi|1848229|gb|AAB48119.1| cathepsin B-like protease [Leishmania major]
          Length = 340

 Score = 94.4 bits (233), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 40/89 (44%), Positives = 58/89 (65%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           + +E+   G +   ++ + D + YK GVY+H +G+  GGHAVK++GWG +DGV YW   N
Sbjct: 246 LMIELMTNGPLELTMQVYSDFVGYKSGVYKHVLGDFLGGHAVKLVGWGTQDGVPYWKVAN 305

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW   WGD G F I+RG +E +IES  V+
Sbjct: 306 SWNTDWGDKGYFLIQRGNNECKIESGGVA 334


>gi|291291827|gb|ADD91786.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score = 94.4 bits (233), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 46/93 (49%), Positives = 57/93 (61%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
            Q EI   G +V A   ++D   YKKG+Y+HT G+  GGHA+KIIGWG E GV YWL  N
Sbjct: 162 TQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWGKEGGVPYWLIAN 221

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           SW   WG+ G F+I  G++   IE   V AG V
Sbjct: 222 SWHNDWGENGYFRILCGSNHCGIEE-NVVAGHV 253


>gi|156708108|gb|ABU93312.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score = 94.4 bits (233), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 42/85 (49%), Positives = 53/85 (62%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           M  E++  G I  A   + D + YK GVY H  G ++GGHAV  +GWGVED   YWLC N
Sbjct: 188 MMEELYENGPISVAFTVYYDFMNYKSGVYVHKTGGIAGGHAVLCVGWGVEDNTPYWLCQN 247

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SWG  WG+ G FKI RG++   IE+
Sbjct: 248 SWGPAWGEKGHFKILRGSNHCGIEN 272


>gi|76576339|gb|ABA53863.1| cathepsin B-like cysteine protease 1 [Parelaphostrongylus tenuis]
          Length = 346

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 42/92 (45%), Positives = 58/92 (63%), Gaps = 1/92 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI + G +    + ++D   Y  G+Y+H  GE  G HAVK++GWG E+GV YW+C N
Sbjct: 254 IQQEIMNHGPVEVTFDVYEDFEHYSSGIYKHMAGEYVGVHAVKMLGWGTENGVDYWICAN 313

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGR 92
           SW   WG+ G F+I RG +E  IES  V AG+
Sbjct: 314 SWNSDWGENGFFRILRGENECGIES-NVVAGK 344


>gi|55793943|gb|AAV65882.1| cathepsin B1 isotype 2 precursor [Trichobilharzia regenti]
          Length = 342

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 46/95 (48%), Positives = 58/95 (61%), Gaps = 1/95 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G + AA   H D + YK G+Y++  G   GGHAV+IIGWGVE    YWL  N
Sbjct: 249 IKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIGWGVEKKTPYWLIAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           SW E WG+ G F+I RG DE  IES +V+ G   R
Sbjct: 309 SWNEDWGEKGYFRILRGKDECGIES-EVTGGLPHR 342


>gi|56752787|gb|AAW24605.1| unknown [Schistosoma japonicum]
          Length = 309

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 216 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 275

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 276 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 307


>gi|240992702|ref|XP_002404475.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215491572|gb|EEC01213.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 46/91 (50%), Positives = 57/91 (62%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EIF  G + A      D + YK GVYQH   ++ GGHA++I+GWG E+G  YWL  N
Sbjct: 243 IQTEIFKNGPVEADFIVLADFLSYKSGVYQHHSDDVIGGHAIRILGWGTENGTPYWLAAN 302

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW E WGD G FKI RG DE  IE   ++AG
Sbjct: 303 SWNEDWGDHGYFKILRGKDECGIEE-DINAG 332


>gi|56757646|gb|AAW26973.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 63/93 (67%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
            Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 FQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIES-EIAAGLI 340


>gi|19526442|gb|AAL89717.1|AF483623_1 cathepsin B [Apriona germari]
          Length = 324

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 43/89 (48%), Positives = 59/89 (66%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + A +E ++D   Y  G+YQHT G   GGHAVKIIGWG E+ V YW+  N
Sbjct: 228 IQREILDNGPVTAYMEVYEDFYSYGTGIYQHTSGSFVGGHAVKIIGWGSENDVPYWIAAN 287

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SWG  +G+ G F+I RG++ + IES+ V+
Sbjct: 288 SWGTGFGEDGFFRILRGSNCAGIESYIVA 316


>gi|56756114|gb|AAW26235.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340


>gi|308488550|ref|XP_003106469.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
 gi|308253819|gb|EFO97771.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
          Length = 205

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 48/97 (49%), Positives = 58/97 (59%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G I  A   ++D   Y  GVY HT G+  GGHAVKI+GWGV++G  YWL  N
Sbjct: 107 IQTEILAHGPIEVAFTVYEDFYQYTTGVYVHTAGKSLGGHAVKILGWGVDNGTPYWLVAN 166

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WG+ G F+I RG +E  IE   V AG  D DR
Sbjct: 167 SWNVNWGEKGYFRIIRGLNECGIEHSAV-AGLPDLDR 202


>gi|56756475|gb|AAW26410.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340


>gi|171474007|gb|AAX31052.2| SJCHGC09761 protein [Schistosoma japonicum]
          Length = 342

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340


>gi|76576341|gb|ABA53864.1| cathepsin B-like cysteine protease 2 [Parelaphostrongylus tenuis]
          Length = 344

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 44/93 (47%), Positives = 58/93 (62%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI  +G + AA   ++D   Y +G+Y+H  G   GGHAV+I+GWG E G  YWL  N
Sbjct: 253 IQKEIMTYGPVTAAFIVYEDFFHYHRGIYKHVSGGEEGGHAVRILGWGEEKGTAYWLVAN 312

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           SW   WG+ G F+I RG++E  IE   V AGRV
Sbjct: 313 SWNTDWGENGYFRILRGSNECGIEE-NVVAGRV 344


>gi|609175|emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana rustica]
          Length = 356

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 45/100 (45%), Positives = 61/100 (61%), Gaps = 1/100 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCVNSW 62
           E++  G +  +   ++D   YK GVY+H  G++ GGHAVK+IGWG  EDG  YWL  N W
Sbjct: 247 EVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDIMGGHAVKLIGWGTSEDGEDYWLLANQW 306

Query: 63  GELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLE 102
              WGD G FKIRRGT+E  IE   V+     R+ + +L+
Sbjct: 307 NRGWGDDGYFKIRRGTNECEIEDEVVAGLPSARNLNVELD 346


>gi|268558600|ref|XP_002637291.1| C. briggsae CBR-CPR-4 protein [Caenorhabditis briggsae]
          Length = 335

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 41/84 (48%), Positives = 54/84 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + AA   ++D   YK GVY HT G+  GGHA++I+GWG ++G  YWL  N
Sbjct: 242 IQAEILAHGPVEAAFTVYEDFYQYKSGVYVHTTGQELGGHAIRILGWGTDNGTPYWLVAN 301

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW   WG+ G F+I RGT+E  IE
Sbjct: 302 SWNVNWGENGYFRIIRGTNECGIE 325


>gi|268561878|ref|XP_002638441.1| Hypothetical protein CBG18657 [Caenorhabditis briggsae]
          Length = 372

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 42/84 (50%), Positives = 56/84 (66%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI++ G +  +    +D   YK GVY +  G+++G HAVKIIGWG E+ V YWL  N
Sbjct: 260 IQTEIYNNGPVEVSYRVFEDFYQYKSGVYHYVSGKLTGAHAVKIIGWGTENKVDYWLVAN 319

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SWG  +G+ G FKIRRGT+E  IE
Sbjct: 320 SWGTDFGEKGFFKIRRGTNECGIE 343


>gi|56756907|gb|AAW26625.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340


>gi|56754499|gb|AAW25437.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340


>gi|226473756|emb|CAX71563.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340


>gi|203648|gb|AAA40993.1| cathepsin (EC 3.4.22.1), partial [Rattus norvegicus]
          Length = 271

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 41/85 (48%), Positives = 54/85 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A     D + YK GVY+H  G++ GGHA++I+GWG+E+GV YWL  N
Sbjct: 171 IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVAN 230

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FKI RG +   IES
Sbjct: 231 SWNVDWGDNGFFKILRGENHCGIES 255


>gi|327281751|ref|XP_003225610.1| PREDICTED: cathepsin B-like [Anolis carolinensis]
          Length = 330

 Score = 93.6 bits (231), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 43/91 (47%), Positives = 58/91 (63%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GE  GGHA++I+GWGV++G  YWL  N
Sbjct: 230 IMAEIYKNGPVEGAFVVYSDFLMYKSGVYQHVSGEEVGGHAIRILGWGVDNGTPYWLAAN 289

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WG+ G F+I RG D   IES ++ AG
Sbjct: 290 SWNTDWGEDGFFRILRGQDHCGIES-EIVAG 319


>gi|56759488|gb|AAW27884.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 93.6 bits (231), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340


>gi|56758864|gb|AAW27572.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 93.6 bits (231), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340


>gi|309202|gb|AAA37494.1| mouse preprocathepsin B [Mus musculus]
          Length = 339

 Score = 93.6 bits (231), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 45/97 (46%), Positives = 59/97 (60%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A     D + YK GVY+H  G+M GGHA++I+ WGVE+GV YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILVWGVENGVPYWLAAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG +   IES ++ AG    D+
Sbjct: 299 SWNLDWGDNGFFKILRGENHCGIES-EIVAGIPRTDQ 334


>gi|56754337|gb|AAW25356.1| SJCHGC00056 protein [Schistosoma japonicum]
          Length = 342

 Score = 93.6 bits (231), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 41/89 (46%), Positives = 57/89 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
            Q +I  +G + AA + ++D +  K G+ +H  G + GGH ++IIGWGVE G  YWL  N
Sbjct: 249 FQRDIMMYGPVEAAFDVYEDFLNSKSGISRHVTGSIVGGHPIRIIGWGVEKGNPYWLIAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW E WG+ GLF++ RG DE  IES  V+
Sbjct: 309 SWNEDWGENGLFRMVRGRDECSIESHVVA 337


>gi|384597848|gb|AFI23675.1| cathepsin B, partial [Brugia malayi]
          Length = 319

 Score = 93.6 bits (231), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 39/80 (48%), Positives = 52/80 (65%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + A+ E + D + Y  G+Y+H  G + GGHAVKI+GWG++ GV YWL  N
Sbjct: 237 IQKEIMTLGPVEASFEVYTDFLHYTSGIYKHVAGSVGGGHAVKILGWGIDQGVSYWLAAN 296

Query: 61  SWGELWGDGGLFKIRRGTDE 80
           SW   WG+ G F+I RG DE
Sbjct: 297 SWNNDWGEDGYFRILRGADE 316


>gi|341904470|gb|EGT60303.1| hypothetical protein CAEBREN_20420 [Caenorhabditis brenneri]
          Length = 351

 Score = 93.6 bits (231), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 42/84 (50%), Positives = 53/84 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A   + D  +Y  GVY HT G   GGHAVK++GWGV++G  YWLC N
Sbjct: 258 IQKEIMTNGPVEVAFTVYADFEVYSGGVYVHTAGASLGGHAVKMLGWGVDNGTPYWLCAN 317

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW E WG+ G F+I RG +E  IE
Sbjct: 318 SWNEDWGENGYFRIIRGVNECGIE 341


>gi|339248603|ref|XP_003373289.1| cathepsin B [Trichinella spiralis]
 gi|316970616|gb|EFV54519.1| cathepsin B [Trichinella spiralis]
          Length = 576

 Score = 93.6 bits (231), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 47/107 (43%), Positives = 61/107 (57%), Gaps = 13/107 (12%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQH--------TVGEMSGGHAVKIIGWGVEDG--- 52
           EI   G + A    H+D  +YK GVYQH             SG H+V+I+GWGV+     
Sbjct: 450 EIMANGPVQATFLVHEDFFMYKSGVYQHLPYANDKGPAYARSGYHSVRILGWGVDHSTGV 509

Query: 53  -VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRVDRDR 97
            +KYWLC NSWGE WG+ GLF+I RG +   IESF + A G+  + R
Sbjct: 510 PIKYWLCANSWGEEWGENGLFRILRGENHCDIESFIIGAWGKGSKKR 556


>gi|225711544|gb|ACO11618.1| Cathepsin B precursor [Caligus rogercresseyi]
          Length = 332

 Score = 93.6 bits (231), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 44/90 (48%), Positives = 55/90 (61%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +QLEI   G + AA   + D +  K GVY+H  G + GGHA++I+GWGVE G  YWL  N
Sbjct: 240 IQLEIMDNGPVEAAFSVYSDFMNDKSGVYRHVKGSLLGGHAIRILGWGVEKGTPYWLVAN 299

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SW   WGD G FKI RG+D   IE   V+ 
Sbjct: 300 SWNTDWGDKGTFKILRGSDHCGIEGSVVTG 329


>gi|308488328|ref|XP_003106358.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
 gi|308253708|gb|EFO97660.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
          Length = 343

 Score = 93.6 bits (231), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 46/99 (46%), Positives = 58/99 (58%), Gaps = 1/99 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +      + D   YK GVY H  G   GGHAVK++GWGV++G  YWL  N
Sbjct: 246 IQSEILKNGPVEVGFTVYADFYQYKSGVYVHVAGPELGGHAVKLLGWGVDNGTPYWLAAN 305

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSS 99
           SW   WG+ G F+I RG +E  IES QV AG  D +R +
Sbjct: 306 SWNTNWGENGYFRILRGVNECGIES-QVVAGMPDLERHN 343


>gi|224064400|ref|XP_002301457.1| predicted protein [Populus trichocarpa]
 gi|222843183|gb|EEE80730.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score = 93.6 bits (231), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 45/85 (52%), Positives = 53/85 (62%), Gaps = 1/85 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           +  EI+  G +  A   ++D   YK GVY+H  G M GGHAVK+IGWG  EDG  YWL  
Sbjct: 245 IMAEIYKNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTSEDGEAYWLLA 304

Query: 60  NSWGELWGDGGLFKIRRGTDESRIE 84
           N W   WGD G FKIRRGT+E  IE
Sbjct: 305 NQWNRGWGDDGYFKIRRGTNECGIE 329


>gi|308500570|ref|XP_003112470.1| CRE-CPR-4 protein [Caenorhabditis remanei]
 gi|308267038|gb|EFP10991.1| CRE-CPR-4 protein [Caenorhabditis remanei]
          Length = 335

 Score = 93.6 bits (231), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 41/84 (48%), Positives = 54/84 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + AA   ++D   YK GVY HT G+  GGHA++I+GWG ++G  YWL  N
Sbjct: 242 IQAEIIAHGPVEAAFTVYEDFYQYKSGVYVHTTGQELGGHAIRILGWGTDNGTPYWLVAN 301

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW   WG+ G F+I RGT+E  IE
Sbjct: 302 SWNVNWGENGYFRIIRGTNECGIE 325


>gi|260786791|ref|XP_002588440.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
 gi|229273602|gb|EEN44451.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
          Length = 332

 Score = 93.6 bits (231), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 45/84 (53%), Positives = 51/84 (60%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q+EI   G + AA   + D   YK GVYQH  G   GGHAVK+IGWG+E    YWL  N
Sbjct: 238 IQMEIMTNGPVEAAFTVYADFPHYKSGVYQHESGAELGGHAVKMIGWGMEGSTPYWLIAN 297

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW   WGD G FKI RG DE  IE
Sbjct: 298 SWNSDWGDMGFFKILRGQDECGIE 321


>gi|17559068|ref|NP_504682.1| Protein CPR-4 [Caenorhabditis elegans]
 gi|1169085|sp|P43508.1|CPR4_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 4; AltName:
           Full=Cysteine protease-related 4; Flags: Precursor
 gi|675500|gb|AAA98785.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|695293|gb|AAA98783.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351063163|emb|CCD71204.1| Protein CPR-4 [Caenorhabditis elegans]
          Length = 335

 Score = 93.6 bits (231), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 41/84 (48%), Positives = 54/84 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + AA   ++D   YK GVY HT G+  GGHA++I+GWG ++G  YWL  N
Sbjct: 242 IQAEIIAHGPVEAAFTVYEDFYQYKTGVYVHTTGQELGGHAIRILGWGTDNGTPYWLVAN 301

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW   WG+ G F+I RGT+E  IE
Sbjct: 302 SWNVNWGENGYFRIIRGTNECGIE 325


>gi|223646922|gb|ACN10219.1| Cathepsin B precursor [Salmo salar]
 gi|223647940|gb|ACN10728.1| Cathepsin B precursor [Salmo salar]
 gi|223672785|gb|ACN12574.1| Cathepsin B precursor [Salmo salar]
          Length = 330

 Score = 93.6 bits (231), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 41/90 (45%), Positives = 53/90 (58%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  E+   G +  A   ++D ++YK GVYQH  G   GGHA+K++GWG E G  YWL  N
Sbjct: 238 IMAELLKNGPVEGAFTVYEDFLLYKSGVYQHVSGSAVGGHAIKVLGWGEEGGTPYWLAAN 297

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SW   WG+ G FKI RG D   IES  V+ 
Sbjct: 298 SWNTDWGENGFFKILRGKDHCGIESEMVAG 327


>gi|55793945|gb|AAV65883.1| cathepsin B1 isotype 3 precursor [Trichobilharzia regenti]
          Length = 342

 Score = 93.6 bits (231), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 45/91 (49%), Positives = 57/91 (62%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G + AA   H D + YK G+Y++  G   GGHAV+IIGWGVE    YWL  N
Sbjct: 249 IKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIGWGVEKKTPYWLIAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW E WG+ G F+I RG DE  IES +V+ G
Sbjct: 309 SWNEDWGEKGYFRILRGKDECGIES-EVTGG 338


>gi|1644295|emb|CAB03627.1| cysteine proteinase [Haemonchus contortus]
          Length = 345

 Score = 93.2 bits (230), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 57/85 (67%), Gaps = 1/85 (1%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
           G +VA    ++D   YKKG+Y H  G+  G HA+KIIGWGVE+G+ YWL  NSW + WG+
Sbjct: 260 GPVVAVFTVYEDFSYYKKGIYVHIAGKARGAHAIKIIGWGVENGLPYWLIANSWHDDWGE 319

Query: 69  GGLFKIRRGTDESRIESFQVSAGRV 93
            GLF+I RG +E  IE  +V AG V
Sbjct: 320 QGLFRIVRGINECGIEQ-EVVAGHV 343


>gi|226474168|emb|CAX71570.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 93.2 bits (230), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 63/93 (67%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G   A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKDIMMHGPAEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340


>gi|729283|sp|Q06544.1|CYSP3_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 3
 gi|159952|gb|AAA29436.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
          Length = 174

 Score = 93.2 bits (230), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 41/84 (48%), Positives = 55/84 (65%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G +VA    ++D   YK G+Y+HT G M+GGHAVKIIGWG E G  YWL  N
Sbjct: 83  IQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMTGGHAVKIIGWGKEKGTPYWLIAN 142

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW + WG+ G +++ RG +  RIE
Sbjct: 143 SWHDDWGEKGFYRMIRGINNCRIE 166


>gi|328871084|gb|EGG19455.1| peptidase C1A family protein [Dictyostelium fasciculatum]
          Length = 352

 Score = 93.2 bits (230), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 43/90 (47%), Positives = 58/90 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + A  E ++D + YK GVYQHT G+  GGH VK+IGWG ++   YW+C N
Sbjct: 213 IQQEIMTNGPVEACFEVYEDFLGYKSGVYQHTTGKDLGGHCVKMIGWGTQNNELYWICNN 272

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SW   WG+ G+F I+ G +E  IES  V+A
Sbjct: 273 SWTTYWGNQGVFWIKAGVNECGIESDVVAA 302


>gi|226474184|emb|CAX71578.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 93.2 bits (230), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKDIMVHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340


>gi|297843026|ref|XP_002889394.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335236|gb|EFH65653.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 359

 Score = 93.2 bits (230), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 46/102 (45%), Positives = 59/102 (57%), Gaps = 1/102 (0%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY+H  G   GGHAVK+IGWG  +DG  YWL  
Sbjct: 247 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTKIGGHAVKLIGWGTSDDGEDYWLLA 306

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDL 101
           N W   WGD G FKIRRGT+E  IE   V+    DR+   D+
Sbjct: 307 NQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVFKDV 348


>gi|213514196|ref|NP_001133994.1| Cathepsin B precursor [Salmo salar]
 gi|209156086|gb|ACI34275.1| Cathepsin B precursor [Salmo salar]
          Length = 330

 Score = 93.2 bits (230), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 40/87 (45%), Positives = 54/87 (62%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E++  G +  A   ++D ++YK GVY+H  G   GGHA+K++GWG E G+ YWL  NSW 
Sbjct: 241 ELYKNGPVEGAFTVYEDFLLYKSGVYRHVSGSAVGGHAIKVLGWGEEGGIPYWLAANSWN 300

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSA 90
             WG+ G FKI RG D   IES  V+ 
Sbjct: 301 TDWGENGFFKIVRGEDHCGIESEMVAG 327


>gi|984958|gb|AAC46877.1| cathepsin B-like proteinase [Ancylostoma caninum]
          Length = 343

 Score = 93.2 bits (230), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 42/88 (47%), Positives = 56/88 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI+  G +VAA   +QD   YKKG+Y H  G  +G HAVK++GWG E+   YWL  N
Sbjct: 250 IRQEIYKNGPVVAAFRVYQDFSYYKKGIYVHKWGGQTGAHAVKVVGWGRENATDYWLIAN 309

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQV 88
           SW   WG+ G F+I RGT+E  IE+  V
Sbjct: 310 SWNTDWGESGYFRIVRGTNECGIEAQMV 337


>gi|226474176|emb|CAX71574.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 93.2 bits (230), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 41/93 (44%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IGWGVE+G  YWL  N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  I+S +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIDS-EIAAGLI 340


>gi|55793941|gb|AAV65881.1| cathepsin B1 isotype 1 precursor [Trichobilharzia regenti]
          Length = 342

 Score = 93.2 bits (230), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 45/91 (49%), Positives = 57/91 (62%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G + AA   H D + YK G+Y++  G   GGHAV+IIGWGVE    YWL  N
Sbjct: 249 IKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIGWGVEKKTPYWLIAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW E WG+ G F+I RG DE  IES +V+ G
Sbjct: 309 SWNEDWGEKGYFRILRGKDECGIES-EVTGG 338


>gi|356984175|gb|AET43950.1| cathepsin B, partial [Reishia clavigera]
          Length = 209

 Score = 93.2 bits (230), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 47/89 (52%), Positives = 56/89 (62%), Gaps = 1/89 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E+   G + AA   + D + Y  GVY+HT G   GGHAVKI+G+GVE+G KYWL  NSW 
Sbjct: 119 ELVTRGPVEAAFTVYSDFLQYHSGVYRHTTGSALGGHAVKILGYGVENGDKYWLVANSWN 178

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAGR 92
             WGD G FKI RG DE  IE  Q+ AG 
Sbjct: 179 PDWGDQGFFKILRGVDECGIEG-QIVAGE 206


>gi|21930117|gb|AAM82155.1| cysteine proteinase [Ancylostoma ceylanicum]
          Length = 348

 Score = 92.8 bits (229), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 45/88 (51%), Positives = 55/88 (62%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G +VA  + ++D   YKKGVY H  GE++G HAVKIIGWG  + V YWL  N
Sbjct: 255 IKYEIMTRGPVVATYKVYRDFDYYKKGVYIHREGEVTGLHAVKIIGWGKGNDVPYWLVAN 314

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQV 88
           SW   WGD G F+I RGTD   IE   V
Sbjct: 315 SWNTDWGDNGYFRIVRGTDNCEIERQMV 342


>gi|443692853|gb|ELT94358.1| hypothetical protein CAPTEDRAFT_221292 [Capitella teleta]
          Length = 374

 Score = 92.8 bits (229), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 42/84 (50%), Positives = 50/84 (59%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI   G +  A   + D   YK GVYQH  G   GGHA++++GWGVEDG  YWL  N
Sbjct: 280 IMTEIMTNGPVEGAFTVYADFPTYKSGVYQHVSGGELGGHAIRVLGWGVEDGTPYWLVAN 339

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW   WGD G FKI RG +E  IE
Sbjct: 340 SWNSDWGDNGFFKILRGQNECGIE 363


>gi|118118|sp|P19092.1|CYSP1_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
           Precursor
 gi|159173|gb|AAA29175.1| cysteine protease (AC-1) [Haemonchus contortus]
          Length = 342

 Score = 92.8 bits (229), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 44/94 (46%), Positives = 61/94 (64%), Gaps = 1/94 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +VA+   ++D   YK G+Y+HT GE+ G HAVK+IGWG E+   +WL  N
Sbjct: 246 IQSEILRNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWGNENNTDFWLIAN 305

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           SW   WG+ G F+I RGT++  IE   ++AG VD
Sbjct: 306 SWHNDWGEKGYFRIIRGTNDCGIEG-TIAAGIVD 338


>gi|124502519|gb|ABN13633.1| cysteine proteinase [Haemonchus contortus]
          Length = 342

 Score = 92.8 bits (229), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 44/94 (46%), Positives = 61/94 (64%), Gaps = 1/94 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +VA+   ++D   YK G+Y+HT GE+ G HAVK+IGWG E+   +WL  N
Sbjct: 246 IQSEILRNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWGNENNTDFWLIAN 305

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           SW   WG+ G F+I RGT++  IE   ++AG VD
Sbjct: 306 SWHNDWGEKGYFRIIRGTNDCGIEG-TIAAGIVD 338


>gi|121309133|dbj|BAF43801.1| Longipain [Haemaphysalis longicornis]
          Length = 341

 Score = 92.8 bits (229), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 42/84 (50%), Positives = 53/84 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q+EI   G + A    + D  +YK GVYQ    +  GGHA++++GWGVE GV YWL  N
Sbjct: 248 IQVEIMTNGPVEADFTVYADFPLYKSGVYQRHTDQALGGHAIRLLGWGVEKGVPYWLAAN 307

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW   WGD G FKI RG+DE  IE
Sbjct: 308 SWNTEWGDKGFFKILRGSDECGIE 331


>gi|170028916|ref|XP_001842340.1| cathepsin B [Culex quinquefasciatus]
 gi|167879390|gb|EDS42773.1| cathepsin B [Culex quinquefasciatus]
          Length = 339

 Score = 92.8 bits (229), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 40/85 (47%), Positives = 54/85 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +QLEI   G +    E  +D   Y  GVY+H VG+  G HA++I+GWG E+G  YWL  N
Sbjct: 246 IQLEIMTNGPVATGFEVFEDFYFYHSGVYKHVVGKKVGMHAIRIVGWGTENGTPYWLIAN 305

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           S+G+ WGD G FK+ RG++   IES
Sbjct: 306 SYGDTWGDKGFFKMLRGSNHLGIES 330


>gi|256077361|ref|XP_002574974.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
 gi|18181863|emb|CAC85211.2| cathepsin B endopeptidase [Schistosoma mansoni]
 gi|353231645|emb|CCD79000.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
          Length = 347

 Score = 92.8 bits (229), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 45/91 (49%), Positives = 56/91 (61%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           + LE+   G +    E + D   YK GVYQH  G + GGHAV+++GWG E+ V YWL  N
Sbjct: 252 IMLELMRNGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIAN 311

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG +E  IES  V+AG
Sbjct: 312 SWNSDWGDKGYFKIVRGKNECGIES-DVNAG 341


>gi|170586854|ref|XP_001898194.1| cathepsin B-like cysteine proteinase [Brugia malayi]
 gi|158594589|gb|EDP33173.1| cathepsin B-like cysteine proteinase, putative [Brugia malayi]
          Length = 384

 Score = 92.8 bits (229), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 46/102 (45%), Positives = 61/102 (59%), Gaps = 4/102 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + A+ E + D + Y  G+Y+H  G + GGHAVKI+GWG++ GV YWL  N
Sbjct: 281 IQKEIMTLGPVEASFEVYTDFLHYTSGIYKHVAGSVGGGHAVKILGWGIDQGVSYWLAAN 340

Query: 61  SWGELWGD---GGLFKIRRGTDESRIESFQVSAGRVDRDRSS 99
           SW   WG+    G F+I RG DE  IES  + AG   +D  S
Sbjct: 341 SWNNDWGEDVFSGYFRILRGADECGIES-GIVAGIPRKDARS 381


>gi|149030260|gb|EDL85316.1| rCG52258, isoform CRA_c [Rattus norvegicus]
          Length = 130

 Score = 92.8 bits (229), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 43/91 (47%), Positives = 58/91 (63%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A     D + YK GVY+H  G++ GGHA++I+GWG+E+GV YWL  N
Sbjct: 30  IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVAN 89

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WGD G FKI RG +   IES ++ AG
Sbjct: 90  SWNVDWGDNGFFKILRGENHCGIES-EIVAG 119


>gi|254746338|emb|CAX16634.1| putative C1A cysteine protease precursor [Manduca sexta]
          Length = 337

 Score = 92.8 bits (229), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 41/84 (48%), Positives = 54/84 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ E++  G +  A   + DL+ YK GVY+H  G+  GGHA+KI+GWGVE+  KYWL  N
Sbjct: 241 IRAELYKNGPVEGAFTVYADLLAYKSGVYKHIQGDALGGHAIKILGWGVENDNKYWLVAN 300

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW   WGD G FKI RG +   IE
Sbjct: 301 SWNTDWGDNGFFKILRGENHCGIE 324


>gi|204022106|dbj|BAG71150.1| cathepsin B-N [Astegopteryx spinocephala]
          Length = 332

 Score = 92.8 bits (229), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 46/86 (53%), Positives = 57/86 (66%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++  +G I A+ E + D   YK GVY  T      GGHAVK+IGWG E GV YWL V
Sbjct: 240 IQKDVMTYGPIEASYEVYDDFPSYKSGVYVRTENATYLGGHAVKLIGWGEEYGVPYWLMV 299

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW + WGD GLFKIRRGT+E  I++
Sbjct: 300 NSWNDQWGDRGLFKIRRGTNECGIDN 325


>gi|201023319|ref|NP_001128401.1| cathepsin B-10270 precursor [Acyrthosiphon pisum]
 gi|239788119|dbj|BAH70754.1| ACYPI000021 [Acyrthosiphon pisum]
          Length = 341

 Score = 92.8 bits (229), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 42/90 (46%), Positives = 55/90 (61%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E+   G   A +  ++D + YK G+YQH  G++ G   VK+IGWGV  GV+YWL  NSWG
Sbjct: 251 ELKKHGPATAIMRVYEDFLTYKSGIYQHVTGKLLGQITVKVIGWGVYRGVQYWLAANSWG 310

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
             WGD G FKIRRG +E   E + +S   V
Sbjct: 311 TSWGDKGFFKIRRGYNECLFEDYFISGRPV 340


>gi|170787211|gb|ACB38229.1| cathepsin B [Meretrix meretrix]
          Length = 337

 Score = 92.4 bits (228), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 50/85 (58%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI   G + AA   + D   YK GVY+H  G   GGHA+K +GWG EDG  YWL  N
Sbjct: 243 IMTEIMTNGPVEAAFTVYSDFPTYKSGVYEHKSGGPLGGHAIKTLGWGNEDGKDYWLVAN 302

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FKI RG DE  IES
Sbjct: 303 SWNPDWGDNGFFKILRGRDECGIES 327


>gi|204022108|dbj|BAG71151.1| cathepsin B-N [Cerataphis jamuritsu]
          Length = 333

 Score = 92.4 bits (228), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 46/86 (53%), Positives = 57/86 (66%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++  +G I A+ E + D   YK GVY  T      GGHAVK+IGWG E GV YWL V
Sbjct: 241 IQNDVLTYGPIEASFEVYDDFPSYKSGVYVKTENASYLGGHAVKLIGWGEEYGVPYWLLV 300

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW + WGD GLFKIRRGT+E  I++
Sbjct: 301 NSWNDQWGDQGLFKIRRGTNECGIDN 326


>gi|341900876|gb|EGT56811.1| hypothetical protein CAEBREN_29569 [Caenorhabditis brenneri]
          Length = 344

 Score = 92.4 bits (228), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 48/97 (49%), Positives = 57/97 (58%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G I  A   ++D   Y  GVY HT G   GGHAVKI+GWGV++G  YWL  N
Sbjct: 247 IQTEILKNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGVDNGTPYWLVAN 306

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WG+ G F+I RG +E  IE   V AG  D DR
Sbjct: 307 SWNINWGEKGYFRIIRGLNECGIEHSAV-AGIPDLDR 342


>gi|390994429|gb|AFM37364.1| cathepsin B1 [Dictyocaulus viviparus]
          Length = 350

 Score = 92.4 bits (228), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 48/95 (50%), Positives = 58/95 (61%), Gaps = 2/95 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           +Q EI   G + AA   ++D   Y+ G+Y HT G   GGHAVK+IGWGV +DG KYWL  
Sbjct: 256 IQREIMTNGPVQAAFMVYEDFSRYRSGIYVHTAGRREGGHAVKLIGWGVDDDGNKYWLAA 315

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           NSW   WG+ G F+I RG D   IES  V AG  D
Sbjct: 316 NSWNSDWGENGYFRIVRGVDHCGIES-AVVAGMPD 349


>gi|341888137|gb|EGT44072.1| hypothetical protein CAEBREN_10156 [Caenorhabditis brenneri]
          Length = 344

 Score = 92.4 bits (228), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 48/97 (49%), Positives = 57/97 (58%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G I  A   ++D   Y  GVY HT G   GGHAVKI+GWGV++G  YWL  N
Sbjct: 247 IQTEILKNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGVDNGTPYWLVAN 306

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WG+ G F+I RG +E  IE   V AG  D DR
Sbjct: 307 SWNINWGEKGYFRIIRGLNECGIEHSAV-AGIPDLDR 342


>gi|204022096|dbj|BAG71145.1| cathepsin B-N1 [Tuberaphis sumatrana]
          Length = 334

 Score = 92.4 bits (228), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 45/86 (52%), Positives = 56/86 (65%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++  +G I A+ E + D   YK GVY         GGHAVK+IGWG E GV YWL V
Sbjct: 242 IQYDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPYWLLV 301

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW + WGD GLFKIRRGT+E  I++
Sbjct: 302 NSWNDQWGDQGLFKIRRGTNECGIDN 327


>gi|984960|gb|AAC46878.1| cathepsin B proteinase, partial [Ancylostoma caninum]
          Length = 340

 Score = 92.4 bits (228), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 41/89 (46%), Positives = 57/89 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI+  G +VAA + +QD   Y+ G+Y H  G  +G HAVK++GWG E+G  YWL  N
Sbjct: 247 IREEIYKNGPVVAAFKVYQDFSYYRGGIYVHKWGGQTGAHAVKVVGWGRENGTDYWLIAN 306

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW   WG+ G F+I RG++E  IE   VS
Sbjct: 307 SWNTDWGENGYFRIARGSNECGIEGQMVS 335


>gi|193716207|ref|XP_001950562.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
          Length = 340

 Score = 92.4 bits (228), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 45/86 (52%), Positives = 57/86 (66%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++  +G I A+ + + D   YK GVY +       GGHAVK+IGWG E GV YWL V
Sbjct: 245 IQKDVMRYGPIEASFDMYDDFPSYKSGVYVRSENASYLGGHAVKLIGWGEEHGVLYWLMV 304

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW E WGD GLFKIRRGT+E  I++
Sbjct: 305 NSWNEGWGDNGLFKIRRGTNECGIDN 330


>gi|291228863|ref|XP_002734398.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 451

 Score = 92.4 bits (228), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 47/103 (45%), Positives = 64/103 (62%), Gaps = 9/103 (8%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--------GEMSGGHAVKIIGWGVEDG 52
           +Q+EI   G + A+ E  +D  +Y  GVY+HT            S  H+VK++GWGVE+G
Sbjct: 318 IQVEIMENGPVQASFEVKEDFFMYGSGVYRHTPIASNDAEQYHASEWHSVKLLGWGVENG 377

Query: 53  VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRVD 94
           +KYWL  NSWG  WG+ G FKI RG +E  IES+ V+  G+VD
Sbjct: 378 IKYWLGANSWGTKWGEDGYFKILRGENECNIESYVVAVWGKVD 420


>gi|52546914|gb|AAU81590.1| cysteine proteinase, partial [Petunia x hybrida]
          Length = 122

 Score = 92.4 bits (228), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 46/92 (50%), Positives = 57/92 (61%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY+H  G+  GGHAVK+IGWG  EDG  YWL  
Sbjct: 10  IMTEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGDELGGHAVKLIGWGTSEDGEDYWLLA 69

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           N W   WGD G FKIRRGT+E  IE  +V AG
Sbjct: 70  NQWNRGWGDDGYFKIRRGTNECDIED-EVVAG 100


>gi|299471123|emb|CBN78981.1| cathepsin B-like proteinase [Ectocarpus siliculosus]
          Length = 557

 Score = 92.4 bits (228), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 45/95 (47%), Positives = 58/95 (61%), Gaps = 3/95 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED--GVKYWLC 58
           +Q ++  +GS+ AA     D + Y  GVY H  G   GGHAVK+IGWG ++  G  YWL 
Sbjct: 464 IQRDMMKYGSVTAAFSVFSDFLTYSGGVYTHESGSFMGGHAVKMIGWGTDEVSGEDYWLI 523

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
            NSW   WG+GGLF+I RG +E  IE  Q+ AG V
Sbjct: 524 ANSWNPSWGEGGLFRILRGVNECGIEG-QIVAGEV 557


>gi|110456454|gb|ABG74712.1| cathepsin B preproprotein-like protein [Diaphorina citri]
          Length = 125

 Score = 92.4 bits (228), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 40/81 (49%), Positives = 54/81 (66%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           +I+  G +VA    + D + YK GVYQH  G+  G HAV+++GWGVE+ + YWL  NSW 
Sbjct: 32  QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 91

Query: 64  ELWGDGGLFKIRRGTDESRIE 84
           + WGD G FKI RG +E+ IE
Sbjct: 92  DHWGDHGTFKILRGENEADIE 112


>gi|2944340|gb|AAC05262.1| cathepsin B-like cysteine protease GCP7 [Haemonchus contortus]
          Length = 348

 Score = 92.4 bits (228), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 55/85 (64%), Gaps = 1/85 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +QLEI   G + A    ++D   Y+ GVY HT G M GGH++KIIGWGV+ GVKYWL  N
Sbjct: 256 IQLEIMQKGPVHATFNIYEDFEHYEGGVYIHTAGAMEGGHSIKIIGWGVDKGVKYWLIAN 315

Query: 61  SWGELWG-DGGLFKIRRGTDESRIE 84
           SW   WG DGG F++ RG +   IE
Sbjct: 316 SWSTDWGEDGGYFRVVRGINNCDIE 340


>gi|161343863|tpg|DAA06112.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 340

 Score = 92.4 bits (228), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 43/92 (46%), Positives = 61/92 (66%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDG-VKYWLCV 59
           ++ EI+  G +  A   ++D I Y+ GVY+H  G+  GGHA++I+GWGV++G + YWL  
Sbjct: 247 IRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVA 306

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           NSW   WG  G FKI RG+DE  IE  Q++AG
Sbjct: 307 NSWNSDWGSDGFFKILRGSDECGIEG-QINAG 337


>gi|195729973|gb|ACG50797.1| cathepsin B2 [Trichobilharzia szidati]
          Length = 344

 Score = 92.0 bits (227), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 45/88 (51%), Positives = 55/88 (62%), Gaps = 1/88 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E+   G +    E + D   YK GVYQH  G + GGHAV+++GWG E+GV YWL  NSW 
Sbjct: 253 EVMDHGPVEVDFEVYADFPNYKSGVYQHVSGGLLGGHAVRLLGWGEENGVPYWLIANSWN 312

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAG 91
             WGD G FKI RG +E  IES  V+AG
Sbjct: 313 SDWGDNGYFKIIRGRNECGIES-DVNAG 339


>gi|119638954|gb|ABL85236.1| cysteine proteinase 2 [Necator americanus]
          Length = 347

 Score = 92.0 bits (227), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 41/84 (48%), Positives = 55/84 (65%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EIF+ G +VA    ++D   YK G+Y   +G  +G HAVKIIGWG E+GVKYWL  N
Sbjct: 254 IKREIFNNGPLVATYTVYEDFAYYKNGIYMTGLGRATGAHAVKIIGWGEENGVKYWLIAN 313

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW   WG+ G F++ RGT+   IE
Sbjct: 314 SWNTDWGENGFFRMLRGTNLCDIE 337


>gi|48762493|dbj|BAD23816.1| cathepsin B-N1 [Tuberaphis coreana]
          Length = 340

 Score = 92.0 bits (227), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 46/86 (53%), Positives = 56/86 (65%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
           +Q +I  +G I A+ E + D   YK GVY         GGHAVK+IGWG E GV YWL V
Sbjct: 245 IQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPYWLLV 304

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW + WGD GLFKIRRGT+E  I++
Sbjct: 305 NSWNDQWGDQGLFKIRRGTNECGIDN 330


>gi|323147412|gb|ADX32985.1| cathepsin B [Pinctada fucata]
          Length = 366

 Score = 92.0 bits (227), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 46/88 (52%), Positives = 54/88 (61%), Gaps = 1/88 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI   G +  A   + D   YK GVY+HT G   GGHA+KI+GWG E G  YWL  NSW 
Sbjct: 275 EIMTNGPVEGAFTVYADFPQYKSGVYKHTTGSPLGGHAIKIMGWGTEGGDDYWLVANSWN 334

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAG 91
             WG+ G FKI RG DE  IES Q++AG
Sbjct: 335 PDWGNQGTFKILRGRDECGIES-QIAAG 361


>gi|300176937|emb|CBK25506.2| unnamed protein product [Blastocystis hominis]
          Length = 320

 Score = 92.0 bits (227), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 42/89 (47%), Positives = 57/89 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ E+   G I      ++D + YK G+YQH  G+  GGHAVK++GWGVEDGV+YW   N
Sbjct: 227 IKTELMTNGPIEVDFSVYEDFMTYKSGIYQHVAGKYLGGHAVKLVGWGVEDGVEYWKIAN 286

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW E WG+ G F+I  G +E  IES  V+
Sbjct: 287 SWNEDWGENGYFRIIAGKNECGIESDGVA 315


>gi|5764077|emb|CAB53367.1| necpain [Necator americanus]
          Length = 339

 Score = 92.0 bits (227), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 40/91 (43%), Positives = 62/91 (68%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++L+I   G + AA + ++D  +YK+G+Y+H  G  +GGHAVKIIGWG ++G  YWL  N
Sbjct: 246 IRLDIMKNGPVQAAFDVYEDFKLYKRGIYKHKEGIQTGGHAVKIIGWGKDNGTDYWLIAN 305

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW + WG+ G F++ RG ++  IE   ++AG
Sbjct: 306 SWSKDWGESGFFRMVRGENDCEIEDM-ITAG 335


>gi|224285427|gb|ACN40436.1| unknown [Picea sitchensis]
          Length = 350

 Score = 92.0 bits (227), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 39/84 (46%), Positives = 54/84 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  E++  G +  +   ++D   YK GVY++T G+  GGHAVK++GWG EDG  YWL  N
Sbjct: 240 IMAEVYTNGPVEVSFSVYEDFAHYKSGVYKYTKGDYMGGHAVKLVGWGTEDGTDYWLVAN 299

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW   WG+ G FKI RG++E  IE
Sbjct: 300 SWNTAWGEDGYFKIARGSNECGIE 323


>gi|187097096|ref|NP_001119608.1| cathepsin B-348 precursor [Acyrthosiphon pisum]
 gi|161343833|tpg|DAA06097.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 342

 Score = 92.0 bits (227), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 43/92 (46%), Positives = 61/92 (66%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDG-VKYWLCV 59
           ++ EI+  G +  A   ++D I Y+ GVY+H  G+  GGHA++I+GWGV++G + YWL  
Sbjct: 249 IRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVA 308

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           NSW   WG  G FKI RG+DE  IE  Q++AG
Sbjct: 309 NSWNTDWGSDGFFKILRGSDECGIEG-QINAG 339


>gi|116784401|gb|ABK23329.1| unknown [Picea sitchensis]
          Length = 350

 Score = 92.0 bits (227), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 39/84 (46%), Positives = 54/84 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  E++  G +  +   ++D   YK GVY++T G+  GGHAVK++GWG EDG  YWL  N
Sbjct: 240 IMAEVYTNGPVEVSFSVYEDFAHYKSGVYKYTKGDYMGGHAVKLVGWGTEDGTDYWLVAN 299

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW   WG+ G FKI RG++E  IE
Sbjct: 300 SWNTAWGEDGYFKIARGSNECGIE 323


>gi|48762485|dbj|BAD23812.1| cathepsin B-N1 [Tuberaphis styraci]
          Length = 340

 Score = 92.0 bits (227), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 46/86 (53%), Positives = 57/86 (66%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
           +Q +I  +G I A+ E + D   YK GVY +       GGHAVK+IGWG E GV YWL V
Sbjct: 245 IQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPYWLLV 304

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW + WGD GLFKIRRGT+E  I++
Sbjct: 305 NSWNDQWGDQGLFKIRRGTNECGIDN 330


>gi|294894290|ref|XP_002774786.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239880403|gb|EER06602.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 830

 Score = 92.0 bits (227), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 41/72 (56%), Positives = 49/72 (68%)

Query: 7   HFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELW 66
           +F  + A+   ++D + YK GVY+HT GE  GGHAVKIIGWG E G  YW+ VNSW E W
Sbjct: 745 NFDQVSASFSVYEDFLAYKSGVYKHTSGEYLGGHAVKIIGWGEESGQAYWIVVNSWNEDW 804

Query: 67  GDGGLFKIRRGT 78
           GD GLFKI  G 
Sbjct: 805 GDHGLFKIALGN 816


>gi|204022092|dbj|BAG71143.1| cathepsin B-N2 [Tuberaphis coreana]
          Length = 334

 Score = 92.0 bits (227), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 46/86 (53%), Positives = 56/86 (65%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
           +Q +I  +G I A+ E + D   YK GVY         GGHAVK+IGWG E GV YWL V
Sbjct: 242 IQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPYWLLV 301

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW + WGD GLFKIRRGT+E  I++
Sbjct: 302 NSWNDQWGDQGLFKIRRGTNECGIDN 327


>gi|256086863|ref|XP_002579605.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228447|emb|CCD74618.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 271

 Score = 92.0 bits (227), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 44/95 (46%), Positives = 59/95 (62%), Gaps = 2/95 (2%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVE-DGVKYWLCVNSW 62
           EI   G +      ++D + YK GVY+H  G   GGHA++IIGWG++ + + YWLC NSW
Sbjct: 178 EILLNGPVEGGFYVYEDFLNYKSGVYKHITGSYLGGHAIRIIGWGIQQNHIPYWLCANSW 237

Query: 63  GELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
              WGD G FKI RGT+E  IES  V+AG  +  +
Sbjct: 238 NNQWGDQGYFKILRGTNECGIESM-VTAGLPNLHK 271


>gi|254575665|gb|ACT68329.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score = 92.0 bits (227), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 54/85 (63%), Gaps = 1/85 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +QLEI   G + A    ++D   Y  GVY HT G M GGH++KIIGWGV+ GVKYWL  N
Sbjct: 256 IQLEIMQKGPVHATFNIYEDFEHYNGGVYIHTAGAMEGGHSIKIIGWGVDKGVKYWLIAN 315

Query: 61  SWGELWG-DGGLFKIRRGTDESRIE 84
           SW   WG DGG F++ RG +   IE
Sbjct: 316 SWSTDWGEDGGYFRVVRGINNCDIE 340


>gi|204022094|dbj|BAG71144.1| cathepsin B-N1 [Tuberaphis taiwana]
          Length = 334

 Score = 92.0 bits (227), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 46/86 (53%), Positives = 57/86 (66%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
           +Q +I  +G I A+ E + D   YK GVY +       GGHAVK+IGWG E GV YWL V
Sbjct: 242 IQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPYWLLV 301

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW + WGD GLFKIRRGT+E  I++
Sbjct: 302 NSWNDQWGDQGLFKIRRGTNECGIDN 327


>gi|18378947|ref|NP_563648.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|16226808|gb|AAL16267.1|AF428337_1 At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|14532526|gb|AAK63991.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|25090140|gb|AAN72238.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|332189292|gb|AEE27413.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
          Length = 362

 Score = 92.0 bits (227), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 45/97 (46%), Positives = 57/97 (58%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY+H  G   GGHAVK+IGWG  +DG  YWL  
Sbjct: 250 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLA 309

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRD 96
           N W   WGD G FKIRRGT+E  IE   V+    DR+
Sbjct: 310 NQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRN 346


>gi|168026641|ref|XP_001765840.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683017|gb|EDQ69431.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 339

 Score = 92.0 bits (227), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 46/102 (45%), Positives = 61/102 (59%), Gaps = 1/102 (0%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
           +  E++  G I  + E  +D   YK GVY+H  G   GGHAVK+IGWG  +DGV YW  V
Sbjct: 238 LMAELYTNGPIEVSFEVFEDFAHYKTGVYKHVYGRYIGGHAVKLIGWGTTDDGVDYWTIV 297

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDL 101
           NSW   WG+ GLF+I RG +E  IES+ V+    D+   S +
Sbjct: 298 NSWNTNWGEHGLFRIARGGNECGIESYAVAGLPFDKGLHSAM 339


>gi|300122171|emb|CBK22745.2| unnamed protein product [Blastocystis hominis]
          Length = 319

 Score = 92.0 bits (227), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 44/90 (48%), Positives = 56/90 (62%), Gaps = 1/90 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI   G + A+ + ++D + YK G+Y H  G+    H VKIIGWG E+G  YW  VNSW 
Sbjct: 231 EIMENGPVDASFQVYEDFMTYKSGIYHHVEGKFMNLHTVKIIGWGEENGEAYWKAVNSWN 290

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
             WG+ GLF+IR GT+E  IES QV  G V
Sbjct: 291 SEWGENGLFRIRLGTNECTIES-QVEGGLV 319


>gi|3912916|gb|AAC78691.1| thiol protease [Trichuris suis]
          Length = 348

 Score = 92.0 bits (227), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 41/85 (48%), Positives = 56/85 (65%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +VA+   ++D   YK G+Y+HT GE+ G HAVKIIGWG E+   +WL  N
Sbjct: 253 IQREIMKNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKIIGWGKENNTDFWLIAN 312

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW + WG+ G F+I RG +E  IE+
Sbjct: 313 SWHQDWGEKGYFRIVRGKNECGIET 337


>gi|380791571|gb|AFE67661.1| cathepsin B preproprotein, partial [Macaca mulatta]
          Length = 311

 Score = 92.0 bits (227), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 38/73 (52%), Positives = 48/73 (65%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298

Query: 61  SWGELWGDGGLFK 73
           SW   WGD G FK
Sbjct: 299 SWNTDWGDNGFFK 311


>gi|156708110|gb|ABU93313.1| cathepsin B4 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score = 92.0 bits (227), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 38/82 (46%), Positives = 51/82 (62%)

Query: 5   IFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGE 64
           +  +G +      + D + Y+ GVYQH  G   GGHAV + GWGVE+G+ YWL  NSWG 
Sbjct: 192 LMEYGPLSCGFMVYSDFMNYRSGVYQHKSGYFEGGHAVLLCGWGVENGLPYWLVQNSWGP 251

Query: 65  LWGDGGLFKIRRGTDESRIESF 86
            WG+ G FKI RG++   IES+
Sbjct: 252 AWGEKGFFKILRGSNHCEIESY 273


>gi|297843028|ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335237|gb|EFH65654.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 360

 Score = 91.7 bits (226), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 45/97 (46%), Positives = 57/97 (58%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY+H  G   GGHAVK+IGWG  +DG  YWL  
Sbjct: 248 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLA 307

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRD 96
           N W   WGD G FKIRRGT+E  IE   V+    DR+
Sbjct: 308 NQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRN 344


>gi|194766882|ref|XP_001965553.1| GF22391 [Drosophila ananassae]
 gi|190619544|gb|EDV35068.1| GF22391 [Drosophila ananassae]
          Length = 342

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 47/93 (50%), Positives = 61/93 (65%), Gaps = 3/93 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
           +Q EI   G +  A   ++DLI+YK GVYQH  G+  GGHA++I+GWGV  ++ V YWL 
Sbjct: 246 IQEEIMTNGPVEGAFTVYEDLILYKSGVYQHEHGKELGGHAIRILGWGVWGKEEVPYWLI 305

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
            NSW + WGD G F+I RG D   IES  +SAG
Sbjct: 306 ANSWNDDWGDKGFFRILRGEDHCGIES-SISAG 337


>gi|74221319|dbj|BAE42140.1| unnamed protein product [Mus musculus]
          Length = 339

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 58/97 (59%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+    +  A     D + YK GVY+H  G+M GGHA++I+GWGV +GV YWL  N
Sbjct: 239 IMAEIYKNDPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVGNGVPYWLAAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG +   IES ++ AG    D+
Sbjct: 299 SWNLDWGDNGFFKILRGENHCGIES-EIVAGIPRTDQ 334


>gi|356572872|ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max]
          Length = 356

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 46/92 (50%), Positives = 56/92 (60%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY+H  G   GGHAVK+IGWG  EDG  YWL  
Sbjct: 244 IMTEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGYELGGHAVKLIGWGTTEDGEDYWLLA 303

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           N W   WGD G FKIRRGT+E  IE   V+AG
Sbjct: 304 NQWNREWGDDGYFKIRRGTNECGIEE-DVTAG 334


>gi|227293|prf||1701299A cathepsin B
          Length = 339

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 58/97 (59%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A     D + YK GVY+H  G+M GGHA++I+ WGVE+GV YW   N
Sbjct: 239 IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILVWGVENGVPYWAAAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WGD G FKI RG +   IES ++ AG    D+
Sbjct: 299 SWNLDWGDNGFFKILRGENHCGIES-EIVAGIPRTDQ 334


>gi|254575663|gb|ACT68328.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 54/85 (63%), Gaps = 1/85 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +QLEI   G + A    ++D   Y  GVY HT G M GGH++KIIGWGV+ GVKYWL  N
Sbjct: 256 IQLEIMKKGPVHATFNIYEDFEHYNGGVYIHTAGAMEGGHSIKIIGWGVDKGVKYWLIAN 315

Query: 61  SWGELWG-DGGLFKIRRGTDESRIE 84
           SW   WG DGG F++ RG +   IE
Sbjct: 316 SWSTDWGEDGGYFRVVRGINNCDIE 340


>gi|222424744|dbj|BAH20325.1| AT1G02305 [Arabidopsis thaliana]
          Length = 293

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 45/97 (46%), Positives = 57/97 (58%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY+H  G   GGHAVK+IGWG  +DG  YWL  
Sbjct: 181 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLA 240

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRD 96
           N W   WGD G FKIRRGT+E  IE   V+    DR+
Sbjct: 241 NQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRN 277


>gi|168000937|ref|XP_001753172.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695871|gb|EDQ82213.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 347

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 45/90 (50%), Positives = 57/90 (63%), Gaps = 1/90 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
           +  E++  G +  A E ++D   YK GVY+H  G   GGHAVK+IGWG  +DGV YW  V
Sbjct: 246 LMAELYTNGPVEVAFEVYEDFAHYKTGVYKHLFGGFMGGHAVKLIGWGTTDDGVDYWTIV 305

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVS 89
           NSW   WG+ GLF+I RG DE  IES  V+
Sbjct: 306 NSWNTNWGEDGLFRIVRGNDECGIESNAVA 335


>gi|125981197|ref|XP_001354605.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
 gi|54642915|gb|EAL31659.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
          Length = 338

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 48/93 (51%), Positives = 59/93 (63%), Gaps = 3/93 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
           +Q EI   G +  A   ++DLI+YK GVYQH  G+  GGHA++I+GWGV  E  V YWL 
Sbjct: 243 IQQEIMTNGPVEGAFTVYEDLILYKSGVYQHEHGKELGGHAIRILGWGVWGESKVPYWLI 302

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
            NSW   WGD G F+I RG D   IES  +SAG
Sbjct: 303 GNSWNTDWGDNGFFRILRGQDHCGIES-SISAG 334


>gi|320167003|gb|EFW43902.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
          Length = 306

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 41/84 (48%), Positives = 49/84 (58%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + AA   + D   Y  GVY H  G + GGHAVKI+GWGV+    YW+  N
Sbjct: 213 IQSEILANGPVEAAFSVYDDFFSYTSGVYSHQSGALDGGHAVKIVGWGVDGTTPYWIVAN 272

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SWG  WG  G F I+RG DE  IE
Sbjct: 273 SWGTSWGQAGFFWIKRGNDECGIE 296


>gi|118122|sp|P25793.1|CYSP2_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 2; Flags:
           Precursor
 gi|159165|gb|AAA29171.1| cathepsin B-like cysteine protease [Haemonchus contortus]
          Length = 342

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 43/94 (45%), Positives = 61/94 (64%), Gaps = 1/94 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +VA+   ++D   YK G+Y+HT GE+ G HAVK+IGWG E+   +WL  N
Sbjct: 246 IQSEILKNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWGNENNTDFWLIAN 305

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           SW   WG+ G F+I RG+++  IE   ++AG VD
Sbjct: 306 SWHNDWGEKGYFRIVRGSNDCGIEG-TIAAGIVD 338


>gi|204022102|dbj|BAG71148.1| cathepsin B-N2 [Tuberaphis takenouchii]
          Length = 334

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 45/86 (52%), Positives = 57/86 (66%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-VGEMSGGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++  +G I A+ + + D   YK GVY  T      GGHAVK+IGWG E GV YWL V
Sbjct: 242 IQNDLMTYGPIEASYDVYDDFPNYKSGVYMKTENASYLGGHAVKLIGWGEEYGVPYWLLV 301

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW + WGD GLFKIRRGT+E  I++
Sbjct: 302 NSWNDQWGDQGLFKIRRGTNECGIDN 327


>gi|226468762|emb|CAX76409.1| cathepsin B [Schistosoma japonicum]
 gi|257206178|emb|CAX82740.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 44/88 (50%), Positives = 54/88 (61%), Gaps = 1/88 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E+   G +    E + D   YK GVYQH  G + GGHAV+++GWG E+ V YWL  NSW 
Sbjct: 256 ELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIANSWN 315

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAG 91
             WGD G FKI RG +E  IES  V+AG
Sbjct: 316 TDWGDNGYFKIIRGKNECGIES-DVNAG 342


>gi|195729971|gb|ACG50796.1| cathepsin B1 [Trichobilharzia szidati]
          Length = 342

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 45/91 (49%), Positives = 57/91 (62%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G + AA   + D + YK G+Y+H  G + GGHAV+IIGWGVE    YWL  N
Sbjct: 249 IKKEIMMHGPVEAAFTVYSDFLNYKSGIYKHMKGTVIGGHAVRIIGWGVEKKTPYWLIAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW E WG+ G F+I RG D   IES  V+AG
Sbjct: 309 SWNEDWGEKGYFRILRGKDVCGIES-AVTAG 338


>gi|154089579|gb|ABS57370.1| cathepsin B2 [Trichobilharzia regenti]
          Length = 344

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 45/88 (51%), Positives = 55/88 (62%), Gaps = 1/88 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E+   G +    E + D   YK GVYQH  G + GGHAV+++GWG E+GV YWL  NSW 
Sbjct: 253 EVKEHGPVEVDFEVYADFPNYKSGVYQHVSGGLLGGHAVRLLGWGEENGVPYWLIANSWN 312

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAG 91
             WGD G FKI RG +E  IES  V+AG
Sbjct: 313 SDWGDNGYFKIIRGRNECGIES-DVNAG 339


>gi|22535408|emb|CAC87118.1| cathepsin B-like protease [Nilaparvata lugens]
          Length = 347

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 43/79 (54%), Positives = 53/79 (67%), Gaps = 1/79 (1%)

Query: 2   QLEIFHFGSIVAAIEAHQDLIIYKKGVYQ-HTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           QLEIF  G IVAA + ++D  +YK GVY+ H      G HAVK+IGWG ++G+ YWL  N
Sbjct: 253 QLEIFKNGPIVAAFKVYEDFFMYKSGVYKRHPESPFRGRHAVKVIGWGEQNGLPYWLVQN 312

Query: 61  SWGELWGDGGLFKIRRGTD 79
           SW   WGD GLFKI RG +
Sbjct: 313 SWDYDWGDKGLFKIARGNE 331


>gi|226472810|emb|CAX71091.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 44/88 (50%), Positives = 54/88 (61%), Gaps = 1/88 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E+   G +    E + D   YK GVYQH  G + GGHAV+++GWG E+ V YWL  NSW 
Sbjct: 256 ELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIANSWN 315

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAG 91
             WGD G FKI RG +E  IES  V+AG
Sbjct: 316 TDWGDNGYFKIIRGKNECGIES-DVNAG 342


>gi|34979797|gb|AAQ83887.1| cathepsin B [Branchiostoma belcheri tsingtauense]
          Length = 332

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 44/84 (52%), Positives = 50/84 (59%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q+EI   G + AA   + D   YK GVYQH  G   GGHAVK+IGWG E    YWL  N
Sbjct: 238 IQMEIMTNGPVEAAFTVYADFPHYKSGVYQHESGAELGGHAVKMIGWGTEGSTPYWLIAN 297

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW   WG+ G FKI RG DE  IE
Sbjct: 298 SWNTDWGNMGFFKILRGQDECGIE 321


>gi|195438776|ref|XP_002067308.1| GK16352 [Drosophila willistoni]
 gi|194163393|gb|EDW78294.1| GK16352 [Drosophila willistoni]
          Length = 340

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 45/93 (48%), Positives = 61/93 (65%), Gaps = 3/93 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
           +Q EI + G +  A   ++DLI+YK GVY+H  G+  GGHA++I+GWGV  ++ + YWL 
Sbjct: 245 IQKEIMNNGPVEGAFTVYEDLILYKSGVYEHVHGKELGGHAIRILGWGVWGDEKIPYWLI 304

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
            NSW   WGD G F+I RG D   IES  +SAG
Sbjct: 305 ANSWNTDWGDNGFFRIVRGKDHCGIES-SISAG 336


>gi|328701234|ref|XP_001948885.2| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 326

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 47/92 (51%), Positives = 57/92 (61%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG-HAVKIIGWGVEDGVKYWLCV 59
           +Q E+  +G +    + H DL +YK GVY  T        H  K+IGWGVE+GV YWL V
Sbjct: 234 IQKEVQTYGPVSVFFDLHDDLFLYKSGVYAKTEKSKDKRYHHAKLIGWGVENGVDYWLLV 293

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           NSWG  WG  GLFKI+RGTDE  +ES  V AG
Sbjct: 294 NSWGYEWGQNGLFKIKRGTDECSVES-HVYAG 324


>gi|204022098|dbj|BAG71146.1| cathepsin B-N2 [Tuberaphis sumatrana]
          Length = 334

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 45/86 (52%), Positives = 56/86 (65%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++  +G I A+ E + D   YK GVY         GGHAVK+IGWG E GV YWL V
Sbjct: 242 IQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPYWLLV 301

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW + WGD GLFKIRRGT+E  I++
Sbjct: 302 NSWNDQWGDQGLFKIRRGTNECGIDN 327


>gi|341886633|gb|EGT42568.1| hypothetical protein CAEBREN_17563 [Caenorhabditis brenneri]
          Length = 358

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 60/97 (61%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G ++A+   + D   YK G+Y HT G+  GG   KIIGWGV++GV YWLCV+
Sbjct: 261 IQTEIMRNGPVIASFIIYDDFWDYKSGIYVHTAGDQEGGMDTKIIGWGVDNGVPYWLCVH 320

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
            WG  +G+ G  +I RG +E  IE  QV A + D D+
Sbjct: 321 QWGTDFGENGFVRILRGVNEVNIEH-QVLAAQPDLDK 356


>gi|427787723|gb|JAA59313.1| Putative cathepsin b-like cysteine protease form 2 [Rhipicephalus
           pulchellus]
          Length = 338

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 44/91 (48%), Positives = 55/91 (60%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EIF  G + A    + D   YK GVYQ       G HA++I+GWG E+GV YWL  N
Sbjct: 242 IKTEIFKNGPVEADFAVYADFYSYKSGVYQAHSRVRCGSHAIRILGWGTENGVPYWLAAN 301

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW E WGD G FKIRRG +E  IE   ++AG
Sbjct: 302 SWTEHWGDKGYFKIRRGNNECGIEE-DINAG 331


>gi|256052327|ref|XP_002569724.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 96

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 42/94 (44%), Positives = 61/94 (64%), Gaps = 1/94 (1%)

Query: 1  MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
          +Q EI  +G + A    ++D + YK G+Y+H  G++   HA++IIGWG E+   YWL  N
Sbjct: 3  IQKEIMKYGPVEANFIVYEDFLNYKSGIYKHITGKLFSWHAIRIIGWGEENNTPYWLIPN 62

Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
          SW E WG+ G F+I RG  E  IES +V+AGR++
Sbjct: 63 SWNEDWGENGNFRILRGRHECSIES-EVTAGRIN 95


>gi|30995341|gb|AAO59414.2| cathepsin B endopeptidase [Schistosoma japonicum]
 gi|226472794|emb|CAX71083.1| cathepsin B [Schistosoma japonicum]
 gi|226472796|emb|CAX71084.1| cathepsin B [Schistosoma japonicum]
 gi|226472798|emb|CAX71085.1| cathepsin B [Schistosoma japonicum]
 gi|226472802|emb|CAX71087.1| cathepsin B [Schistosoma japonicum]
 gi|226472806|emb|CAX71089.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 44/88 (50%), Positives = 54/88 (61%), Gaps = 1/88 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E+   G +    E + D   YK GVYQH  G + GGHAV+++GWG E+ V YWL  NSW 
Sbjct: 256 ELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIANSWN 315

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAG 91
             WGD G FKI RG +E  IES  V+AG
Sbjct: 316 TDWGDNGYFKIIRGKNECGIES-DVNAG 342


>gi|321461662|gb|EFX72692.1| hypothetical protein DAPPUDRAFT_308155 [Daphnia pulex]
          Length = 379

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 46/104 (44%), Positives = 60/104 (57%), Gaps = 1/104 (0%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +QLEI   G + A +  ++D + YK GVY+H  G+    HAVKI GWG E G  YWL  N
Sbjct: 274 IQLEIMENGPVQANLRIYEDFLHYKFGVYRHVHGQGLEYHAVKIFGWGTEGGTPYWLAAN 333

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEF 104
            W + WG+GG FKI RG++ + IE   V AG    D   + E F
Sbjct: 334 PWSKRWGNGGFFKILRGSNHAEIED-HVMAGIPKLDLVDEEEHF 376


>gi|294951797|ref|XP_002787132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239901778|gb|EER18928.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 278

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 42/70 (60%), Positives = 48/70 (68%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
           G + A+   ++D + YK GVY+HT GE  GGHAVKIIGWG E G  YWL VNSW E WGD
Sbjct: 195 GPVSASFTVYEDFLAYKSGVYKHTSGEYLGGHAVKIIGWGEESGQAYWLVVNSWNEDWGD 254

Query: 69  GGLFKIRRGT 78
            GLFKI  G 
Sbjct: 255 HGLFKIALGN 264


>gi|159177|gb|AAA29177.1| cysteine proteinase [Haemonchus contortus]
          Length = 342

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 45/98 (45%), Positives = 61/98 (62%), Gaps = 3/98 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED--GVKYWLC 58
           +Q EI   G +VA+   ++D  +YK GVY+HT G + G HAVK++GWGV+     KYWL 
Sbjct: 244 IQREILRHGPVVASFAVYEDFSLYKTGVYKHTAGALRGYHAVKMMGWGVDSKTKAKYWLI 303

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRD 96
            NSW   WG+ G F+  RG ++  IE   V+AG VD D
Sbjct: 304 ANSWHNDWGENGYFRFIRGINDCEIED-TVAAGIVDVD 340


>gi|161343821|tpg|DAA06091.1| TPA_inf: cathepsin B [Aphis gossypii]
          Length = 196

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 42/86 (48%), Positives = 58/86 (67%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++  +G I A+ + + D   YK G+Y+ T      GGHAVK+IGWG + G+ YWL V
Sbjct: 101 IQKDVMTYGPIEASFDVYSDFPSYKSGIYERTENATYLGGHAVKLIGWGEQYGIPYWLMV 160

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW E WGD GLFKIRRGT+E  +++
Sbjct: 161 NSWNEDWGDNGLFKIRRGTNECGVDN 186


>gi|226472800|emb|CAX71086.1| cathepsin B [Schistosoma japonicum]
 gi|226472804|emb|CAX71088.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 44/88 (50%), Positives = 54/88 (61%), Gaps = 1/88 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E+   G +    E + D   YK GVYQH  G + GGHAV+++GWG E+ V YWL  NSW 
Sbjct: 256 ELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIANSWN 315

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAG 91
             WGD G FKI RG +E  IES  V+AG
Sbjct: 316 TDWGDNGYFKIIRGKNECGIES-DVNAG 342


>gi|226469952|emb|CAX70257.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 43/93 (46%), Positives = 60/93 (64%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI  +G + A +   +D + YK G+Y++T G   G H V+IIGWG+E+G  YWL  N
Sbjct: 249 IQKEIMMYGPVEAYLLIFEDFLNYKSGIYKYTTGSFVGEHYVRIIGWGIENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES  V AGR+
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIESV-VVAGRL 340


>gi|294885809|ref|XP_002771442.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239875086|gb|EER03258.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 527

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 43/72 (59%), Positives = 49/72 (68%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
           G I A+   ++D + YK GVY+HT G   GGHAVKIIGWG E+G  YWL VNSW E WGD
Sbjct: 444 GPISASYLVYEDFLAYKSGVYKHTSGSYLGGHAVKIIGWGEENGEAYWLVVNSWNEDWGD 503

Query: 69  GGLFKIRRGTDE 80
            GLFKI  G  E
Sbjct: 504 QGLFKIALGNCE 515


>gi|204022104|dbj|BAG71149.1| cathepsin B-N [Astegopteryx styracophila]
          Length = 332

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 45/86 (52%), Positives = 57/86 (66%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++  +G I A+ + + D   YK GVY  T      GGHAVK+IGWG E GV YWL V
Sbjct: 240 IQRDVMAYGPIEASYDVYDDFPSYKSGVYVRTENATYLGGHAVKLIGWGEEYGVPYWLMV 299

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW + WGD GLFKIRRGT+E  I++
Sbjct: 300 NSWNDQWGDKGLFKIRRGTNECGIDN 325


>gi|302823081|ref|XP_002993195.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
 gi|300138965|gb|EFJ05715.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
          Length = 342

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 44/102 (43%), Positives = 62/102 (60%), Gaps = 1/102 (0%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
           +Q EI+  G +  +   ++D   YK GVY+H  GE+ GGHAVK IGWG  +DG  YW+  
Sbjct: 241 IQAEIYKNGPVEVSYTVYEDFAHYKSGVYKHVFGEVLGGHAVKFIGWGTTDDGKDYWIVA 300

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDL 101
           NSW   WG+ G F+I RG++E  IES  V+   + +   SD+
Sbjct: 301 NSWNRSWGEDGFFQISRGSNECGIESEPVAGIPLKKTGFSDI 342


>gi|3087801|emb|CAA93277.1| cysteine proteinase [Haemonchus contortus]
          Length = 344

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 44/92 (47%), Positives = 56/92 (60%), Gaps = 1/92 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +VAA     D   Y+KG+Y H  G   GGHAVKIIGWG E GV YW+  N
Sbjct: 251 IQKEIMRNGPVVAAFIVFDDFSFYRKGIYAHVAGSPRGGHAVKIIGWGTEHGVPYWIIAN 310

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGR 92
           SW   WG+ G F++ RG ++  IE+  V AG+
Sbjct: 311 SWHSDWGEDGYFRMVRGINDCGIET-NVVAGK 341


>gi|211853248|emb|CAP17587.1| cathepsin-like protein 4 [Crateromorpha meyeri]
          Length = 325

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 45/85 (52%), Positives = 51/85 (60%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + AA   + D   YK GVY+       GGHAVK+IGWG EDG+ YWL  N
Sbjct: 241 IQTEIMTNGPVEAAFTVYADFPAYKSGVYKRHSLRQLGGHAVKMIGWGEEDGIPYWLIAN 300

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WGD G FKI RG DE  IES
Sbjct: 301 SWNSDWGDHGYFKIVRGQDECGIES 325


>gi|239792046|dbj|BAH72408.1| ACYPI000003 [Acyrthosiphon pisum]
          Length = 182

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 43/92 (46%), Positives = 61/92 (66%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDG-VKYWLCV 59
           ++ EI+  G +  A   ++D I Y+ GVY+H  G+  GGHA++I+GWGV++G + YWL  
Sbjct: 89  IRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVA 148

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           NSW   WG  G FKI RG+DE  IE  Q++AG
Sbjct: 149 NSWNTDWGSDGFFKILRGSDECGIEG-QINAG 179


>gi|392920988|ref|NP_506011.2| Protein F57F5.1 [Caenorhabditis elegans]
 gi|206994319|emb|CAB00098.2| Protein F57F5.1 [Caenorhabditis elegans]
          Length = 351

 Score = 91.3 bits (225), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 40/80 (50%), Positives = 51/80 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A   ++D   Y  GVY HT G   GGHAVK++GWGV++G  YWLC N
Sbjct: 258 IQKEIMTHGPVEVAFTVYEDFEHYSGGVYVHTAGASLGGHAVKMLGWGVDNGTPYWLCAN 317

Query: 61  SWGELWGDGGLFKIRRGTDE 80
           SW E WG+ G F+I RG +E
Sbjct: 318 SWNEDWGENGYFRIIRGVNE 337


>gi|356505709|ref|XP_003521632.1| PREDICTED: cathepsin B-like [Glycine max]
          Length = 357

 Score = 91.3 bits (225), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 45/92 (48%), Positives = 56/92 (60%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY+H  G   GGHAVK+IGWG  +DG  YWL  
Sbjct: 245 IMAEVYKNGPVEVAFTVYEDFAYYKSGVYKHITGYELGGHAVKLIGWGTTDDGEDYWLLA 304

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           N W   WGD G FKIRRGT+E  IE   V+AG
Sbjct: 305 NQWNREWGDDGYFKIRRGTNECGIEE-DVTAG 335


>gi|255548165|ref|XP_002515139.1| cathepsin B, putative [Ricinus communis]
 gi|223545619|gb|EEF47123.1| cathepsin B, putative [Ricinus communis]
          Length = 376

 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 43/87 (49%), Positives = 54/87 (62%), Gaps = 1/87 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCVNSW 62
           E++  G +  +   ++D   YK GVY+H  GE+ GGHAVK+IGWG  D G  YWL  N W
Sbjct: 265 EVYKNGPVEVSFTVYEDFAHYKSGVYKHITGEVMGGHAVKLIGWGTSDNGEDYWLLANQW 324

Query: 63  GELWGDGGLFKIRRGTDESRIESFQVS 89
              WGD G FKIRRGT+E  IE   V+
Sbjct: 325 NRGWGDDGYFKIRRGTNECGIEDDAVA 351


>gi|156708114|gb|ABU93315.1| cathepsin B6 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 43/84 (51%), Positives = 51/84 (60%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q E+   G +      + D + YK GVYQH  G   GGHAV +IGWGVEDGV YWL  N
Sbjct: 188 IQEELMKNGPVYFRFTVYSDFMNYKSGVYQHKSGYQEGGHAVLLIGWGVEDGVPYWLLQN 247

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SWG  WG+ G FKI RG +E   E
Sbjct: 248 SWGPAWGEKGHFKIIRGKNECGCE 271


>gi|294914603|ref|XP_002778294.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239886508|gb|EER10089.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 365

 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 42/77 (54%), Positives = 51/77 (66%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G   AA   ++D + YK GVY+HT G   GGHAV+IIGWG E GV YWL +N
Sbjct: 273 IKKEIMTNGPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVDYWLVMN 332

Query: 61  SWGELWGDGGLFKIRRG 77
           SW E WGD G FKI +G
Sbjct: 333 SWNEEWGDHGTFKIVQG 349


>gi|194895314|ref|XP_001978227.1| GG19486 [Drosophila erecta]
 gi|190649876|gb|EDV47154.1| GG19486 [Drosophila erecta]
          Length = 340

 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 47/93 (50%), Positives = 60/93 (64%), Gaps = 3/93 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
           +Q EI   G +  A   ++DLI+YK GVYQH  G+  GGHA++I+GWGV  E+ + YWL 
Sbjct: 245 IQEEIMTNGPVEGAFTVYEDLILYKDGVYQHQHGKELGGHAIRILGWGVWGEEKIPYWLI 304

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
            NSW   WGD G F+I RG D   IES  +SAG
Sbjct: 305 GNSWNTDWGDNGFFRILRGQDHCGIES-SISAG 336


>gi|427783627|gb|JAA57265.1| hypothetical protein [Rhipicephalus pulchellus]
          Length = 483

 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 51/121 (42%), Positives = 70/121 (57%), Gaps = 17/121 (14%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHT-VGE-------MSGGHAVKIIGWGVEDG--- 52
           EI+  G + A I   +D  +Y+ GVY+HT + E        SG H+V+I+GWGV+     
Sbjct: 345 EIYANGPVQALILVKEDFFLYRSGVYRHTRIAESLRPQYSRSGWHSVRILGWGVDRSQYR 404

Query: 53  -VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQV-----SAGRVDRDRSSDLEEFEY 106
            +KYWLC NSWG  WG+ G F+I RG DES+IESF +     S     R +++   E EY
Sbjct: 405 PIKYWLCANSWGHGWGENGYFRIVRGEDESQIESFVLAVWGRSYASYYRQQAAQQREREY 464

Query: 107 D 107
           D
Sbjct: 465 D 465


>gi|226469950|emb|CAX70256.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 43/93 (46%), Positives = 60/93 (64%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI  +G + A +   +D + YK G+Y++T G   G H V+IIGWG+E+G  YWL  N
Sbjct: 249 IQKEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGIENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES  V AGR+
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIESV-VVAGRL 340


>gi|195393194|ref|XP_002055239.1| GJ19262 [Drosophila virilis]
 gi|194149749|gb|EDW65440.1| GJ19262 [Drosophila virilis]
          Length = 338

 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 47/93 (50%), Positives = 59/93 (63%), Gaps = 3/93 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
           +Q EI   G +  A   ++DLI+YK GVYQH  G   GGHA++I+GWGV  ++ V YWL 
Sbjct: 243 IQREIMTNGPVEGAFTVYEDLILYKTGVYQHVHGRQLGGHAIRILGWGVWGDNKVPYWLI 302

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
            NSW   WGD G F+I RG D   IES  +SAG
Sbjct: 303 GNSWNTDWGDNGFFRILRGEDHCGIES-AISAG 334


>gi|294889976|ref|XP_002773021.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239877724|gb|EER04837.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 342

 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 49/124 (39%), Positives = 70/124 (56%), Gaps = 8/124 (6%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EIF  G + A +  H+D  +YK GVY++  G M G H +K+IGWGVE G +YWL VN
Sbjct: 210 IKQEIFDNGPVAAIMTIHEDFRLYKSGVYEYKTGAMVGAHTLKLIGWGVEAGQEYWLAVN 269

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFEYDTDTTIESSSDTKR 120
           SW E WGD G  K+  G +    ES Q    +V R   ++L+E         ES + T++
Sbjct: 270 SWNEEWGDQGKIKLAVGKNALDEESRQ----QVPRRAVNELDE----DAMMAESGAKTQK 321

Query: 121 AFCR 124
           A  +
Sbjct: 322 AMAQ 325


>gi|156708118|gb|ABU93317.1| cathepsin B8 cysteine protease, partial [Monocercomonoides sp. PA]
          Length = 275

 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 38/81 (46%), Positives = 53/81 (65%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E+ + G + A  E  +D + YK G+YQH  G+  G H V ++GWG E+GV YWL  NSWG
Sbjct: 185 EVANNGPVYACFEVFEDFLNYKSGIYQHKTGKSKGWHHVMLMGWGTENGVPYWLLQNSWG 244

Query: 64  ELWGDGGLFKIRRGTDESRIE 84
             WG+ G F+IRRGT++  I+
Sbjct: 245 SGWGEKGFFRIRRGTNDCHID 265


>gi|161343869|tpg|DAA06115.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 337

 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 42/86 (48%), Positives = 55/86 (63%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++  +G I A+ + + D   YK GVY +       GGHAVK+IGWG EDG  YWL V
Sbjct: 242 IQKDVLTYGPIEASFDVYDDFPSYKSGVYVKSDNASYLGGHAVKLIGWGEEDGTPYWLMV 301

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW   WGD G FKIRRGT+E  +++
Sbjct: 302 NSWNTQWGDNGFFKIRRGTNECGVDN 327


>gi|17565164|ref|NP_503383.1| Protein CPR-5 [Caenorhabditis elegans]
 gi|1169086|sp|P43509.1|CPR5_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 5; AltName:
           Full=Cysteine protease-related 5; Flags: Precursor
 gi|671713|gb|AAA98786.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|675502|gb|AAA98784.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351059399|emb|CCD74289.1| Protein CPR-5 [Caenorhabditis elegans]
          Length = 344

 Score = 90.5 bits (223), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 47/97 (48%), Positives = 56/97 (57%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G I  A   ++D   Y  GVY HT G   GGHAVKI+GWGV++G  YWL  N
Sbjct: 247 IQTEILTNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGVDNGTPYWLVAN 306

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WG+ G F+I RG +E  IE   V AG  D  R
Sbjct: 307 SWNVAWGEKGYFRIIRGLNECGIEHSAV-AGIPDLAR 342


>gi|126116630|gb|ABN79675.1| cathepsin B3 [Clonorchis sinensis]
          Length = 337

 Score = 90.5 bits (223), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 43/88 (48%), Positives = 60/88 (68%), Gaps = 1/88 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI   G + AA + ++D + YK GVY H+ G + GGHA++I+GWG E+GV YWL  NSW 
Sbjct: 241 EIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWGEENGVAYWLIANSWN 300

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAG 91
           + WG+ G FK+ RG +E  IE  +V+AG
Sbjct: 301 DGWGEDGYFKMLRGKNECGIED-EVTAG 327


>gi|302848309|ref|XP_002955687.1| hypothetical protein VOLCADRAFT_106905 [Volvox carteri f.
           nagariensis]
 gi|300259096|gb|EFJ43327.1| hypothetical protein VOLCADRAFT_106905 [Volvox carteri f.
           nagariensis]
          Length = 846

 Score = 90.5 bits (223), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 40/86 (46%), Positives = 53/86 (61%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLII-YKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
           M  EI+H G I   I    D    YK G+Y+ T G+    H V+++GWGVEDGVKYW+  
Sbjct: 695 MMSEIYHRGPITCGIACPDDFTWHYKGGIYKDTSGDTELDHDVEVVGWGVEDGVKYWVVR 754

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSWG  WG+ G F++ RG +  +IES
Sbjct: 755 NSWGTYWGEMGFFRVERGVNALQIES 780


>gi|226469948|emb|CAX70255.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 90.5 bits (223), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 43/93 (46%), Positives = 60/93 (64%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI  +G + A +   +D + YK G+Y++T G   G H V+IIGWG+E+G  YWL  N
Sbjct: 249 IQNEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGIENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES  V AGR+
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIESV-VVAGRL 340


>gi|349956183|dbj|GAA30948.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 337

 Score = 90.5 bits (223), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 43/88 (48%), Positives = 60/88 (68%), Gaps = 1/88 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI   G + AA + ++D + YK GVY H+ G + GGHA++I+GWG E+GV YWL  NSW 
Sbjct: 241 EIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWGEENGVAYWLIANSWN 300

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAG 91
           + WG+ G FK+ RG +E  IE  +V+AG
Sbjct: 301 DGWGEDGCFKMLRGKNECGIED-EVTAG 327


>gi|167541036|gb|ABZ82028.1| cathepsin B endopeptidase [Clonorchis sinensis]
          Length = 228

 Score = 90.5 bits (223), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 43/88 (48%), Positives = 60/88 (68%), Gaps = 1/88 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI   G + AA + ++D + YK GVY H+ G + GGHA++I+GWG E+GV YWL  NSW 
Sbjct: 132 EIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWGEENGVAYWLIANSWN 191

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAG 91
           + WG+ G FK+ RG +E  IE  +V+AG
Sbjct: 192 DGWGEDGYFKMLRGKNECGIED-EVTAG 218


>gi|294883442|ref|XP_002770942.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239874068|gb|EER02758.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 393

 Score = 90.5 bits (223), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 48/105 (45%), Positives = 66/105 (62%), Gaps = 3/105 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G + AA   ++D + YK GVY+H  G   GGHAVKIIGWG +   +YWL +N
Sbjct: 289 IKKEIIDNGPVAAAFTVYEDFLYYKSGVYKHVNGSELGGHAVKIIGWGTDQNEQYWLVMN 348

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFE 105
           SW   WGD G+FKI  G  E  I+S +V+AG    +R+S +E+ E
Sbjct: 349 SWNVNWGDQGIFKIAIG--ECGIDS-EVTAGIPKYERTSGVEQSE 390


>gi|442616292|ref|NP_001259536.1| cathepsin B1, isoform B [Drosophila melanogaster]
 gi|440216755|gb|AGB95378.1| cathepsin B1, isoform B [Drosophila melanogaster]
          Length = 330

 Score = 90.5 bits (223), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 47/93 (50%), Positives = 60/93 (64%), Gaps = 3/93 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
           +Q EI   G +  A   ++DLI+YK GVYQH  G+  GGHA++I+GWGV  E+ + YWL 
Sbjct: 235 IQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGEEKIPYWLI 294

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
            NSW   WGD G F+I RG D   IES  +SAG
Sbjct: 295 GNSWNTDWGDHGFFRILRGQDHCGIES-SISAG 326


>gi|18921171|ref|NP_572920.1| cathepsin B1, isoform A [Drosophila melanogaster]
 gi|7292926|gb|AAF48317.1| cathepsin B1, isoform A [Drosophila melanogaster]
 gi|16767940|gb|AAL28188.1| GH06546p [Drosophila melanogaster]
 gi|220944992|gb|ACL85039.1| CG10992-PA [synthetic construct]
 gi|220954816|gb|ACL89951.1| CG10992-PA [synthetic construct]
          Length = 340

 Score = 90.5 bits (223), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 47/93 (50%), Positives = 60/93 (64%), Gaps = 3/93 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
           +Q EI   G +  A   ++DLI+YK GVYQH  G+  GGHA++I+GWGV  E+ + YWL 
Sbjct: 245 IQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGEEKIPYWLI 304

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
            NSW   WGD G F+I RG D   IES  +SAG
Sbjct: 305 GNSWNTDWGDHGFFRILRGQDHCGIES-SISAG 336


>gi|294897889|ref|XP_002776090.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
 gi|239882699|gb|EER07906.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
          Length = 134

 Score = 90.5 bits (223), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 42/77 (54%), Positives = 51/77 (66%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G   AA   ++D + YK GVY+HT G   GGHAV+IIGWG E GV YWL +N
Sbjct: 42  IKKEIMTNGPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVDYWLVMN 101

Query: 61  SWGELWGDGGLFKIRRG 77
           SW E WGD G FKI +G
Sbjct: 102 SWNEEWGDHGTFKIVQG 118


>gi|204022088|dbj|BAG71141.1| cathepsin B-N2 [Tuberaphis styraci]
          Length = 334

 Score = 90.5 bits (223), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 44/81 (54%), Positives = 53/81 (65%), Gaps = 1/81 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++  +G I A+ E + D   YK GVY         GGHAVK+IGWG E GV YWL V
Sbjct: 242 IQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPYWLLV 301

Query: 60  NSWGELWGDGGLFKIRRGTDE 80
           NSW + WGD GLFKIRRGT+E
Sbjct: 302 NSWNDQWGDQGLFKIRRGTNE 322


>gi|268578113|ref|XP_002644039.1| Hypothetical protein CBG17499 [Caenorhabditis briggsae]
          Length = 355

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 60/97 (61%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G ++A+   ++D   YK G+Y HT G+  GG   KIIGWGV++GV YWLCV+
Sbjct: 258 IQTEIMTNGPVIASFIIYEDFWDYKSGIYVHTAGDQEGGMDTKIIGWGVDNGVPYWLCVH 317

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
            WG  +G+ G  +I RG +E  IE  QV A   D D+
Sbjct: 318 QWGTDFGENGFVRILRGVNEVNIE-HQVLAALPDVDK 353


>gi|345308|pir||S31909 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - fluke
           (Schistosoma japonicum)
          Length = 316

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 60/93 (64%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI  +G + A +   +D + YK G+Y++T G   G H V+IIGWG+E+G  YWL  N
Sbjct: 223 IQKEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGIENGTAYWLAAN 282

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  +ES  V AGR+
Sbjct: 283 TWNEDWGEKGYFRIVRGRNECSVESV-VVAGRL 314


>gi|204022090|dbj|BAG71142.1| cathepsin B-N3 [Tuberaphis styraci]
          Length = 334

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 44/81 (54%), Positives = 53/81 (65%), Gaps = 1/81 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++  +G I A+ E + D   YK GVY         GGHAVK+IGWG E GV YWL V
Sbjct: 242 IQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPYWLLV 301

Query: 60  NSWGELWGDGGLFKIRRGTDE 80
           NSW + WGD GLFKIRRGT+E
Sbjct: 302 NSWNDQWGDQGLFKIRRGTNE 322


>gi|302764096|ref|XP_002965469.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
 gi|300166283|gb|EFJ32889.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
          Length = 331

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 43/102 (42%), Positives = 62/102 (60%), Gaps = 1/102 (0%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
           +Q EI+  G +  +   ++D   YK GVY+H  G++ GGHAVK IGWG  +DG  YW+  
Sbjct: 230 IQAEIYKNGPVEVSYTVYEDFAHYKSGVYKHVFGQVLGGHAVKFIGWGTTDDGKDYWIVA 289

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDL 101
           NSW   WG+ G F+I RG++E  IES  V+   + +   SD+
Sbjct: 290 NSWNRSWGEDGFFQISRGSNECGIESEPVAGIPLKKTGFSDI 331


>gi|17384033|emb|CAD12394.1| cysteine proteinase [Leishmania infantum]
          Length = 340

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 39/89 (43%), Positives = 57/89 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           + +E+   G +   ++ + D + YK GVY+H  G++ GGHAVK++GWG + GV YW   N
Sbjct: 246 LMIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGGHAVKLVGWGTQGGVPYWKIAN 305

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW   WGD G F I+RG++E  IES  V+
Sbjct: 306 SWNTDWGDKGYFLIQRGSNECGIESGGVA 334


>gi|146092987|ref|XP_001466605.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania infantum JPCM5]
 gi|398018677|ref|XP_003862503.1| cysteine peptidase C (CPC) [Leishmania donovani]
 gi|12005276|gb|AAG44365.1| cathepsin B-like cysteine protease [Leishmania donovani]
 gi|134070968|emb|CAM69644.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania infantum JPCM5]
 gi|322500733|emb|CBZ35810.1| cysteine peptidase C (CPC) [Leishmania donovani]
          Length = 340

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 39/89 (43%), Positives = 57/89 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           + +E+   G +   ++ + D + YK GVY+H  G++ GGHAVK++GWG + GV YW   N
Sbjct: 246 LMIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGGHAVKLVGWGTQGGVPYWKIAN 305

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW   WGD G F I+RG++E  IES  V+
Sbjct: 306 SWNTDWGDKGYFLIQRGSNECGIESGGVA 334


>gi|402594312|gb|EJW88238.1| cathepsin B5 [Wuchereria bancrofti]
          Length = 407

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 42/88 (47%), Positives = 55/88 (62%), Gaps = 3/88 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + A+ E + D + Y  G+Y+H  G + GGHAVKI+GWG++ GV YWL  N
Sbjct: 296 IQKEIMTLGPVEASFEVYTDFLHYIGGIYKHVAGSVGGGHAVKILGWGIDQGVSYWLAAN 355

Query: 61  SWGELWGD---GGLFKIRRGTDESRIES 85
           SW   WG+    G F+I RG DE  IES
Sbjct: 356 SWNTDWGEDVFSGYFRILRGVDECGIES 383


>gi|161343861|tpg|DAA06111.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 323

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 46/95 (48%), Positives = 63/95 (66%), Gaps = 2/95 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           +Q EI  +G + A +  +++ + YK+GVY+ T GE+ G H VK+IGWGV E G++YWL +
Sbjct: 227 IQQEIMTYGPVTAFMYVYENFMGYKEGVYKSTAGELIGYHHVKLIGWGVDEAGIEYWLAM 286

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           NSW   WG+ GLFKI RG +   IE   V AG VD
Sbjct: 287 NSWNSNWGNDGLFKILRGYNFCSIE-LLVMAGLVD 320


>gi|409905640|gb|AFV46426.1| cysteine protease C [Leishmania donovani]
          Length = 345

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 39/89 (43%), Positives = 57/89 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           + +E+   G +   ++ + D + YK GVY+H  G++ GGHAVK++GWG + GV YW   N
Sbjct: 251 LMIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGGHAVKLVGWGTQGGVPYWKIAN 310

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW   WGD G F I+RG++E  IES  V+
Sbjct: 311 SWNTDWGDKGYFLIQRGSNECGIESGGVA 339


>gi|330805199|ref|XP_003290573.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
 gi|325079281|gb|EGC32888.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
          Length = 313

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 44/88 (50%), Positives = 55/88 (62%), Gaps = 1/88 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI   G + A    ++D + YK GVYQHT G+  GGH VKI G+G  +GV YW   NSW 
Sbjct: 224 EISTNGPVEACFSVYEDFLGYKSGVYQHTTGKFLGGHCVKIFGYGTLNGVNYWSVANSWT 283

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAG 91
             WGD G+F I+RG+DE  IE  +V AG
Sbjct: 284 TSWGDNGIFLIKRGSDECGIED-EVVAG 310


>gi|294873367|ref|XP_002766594.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239867622|gb|EEQ99311.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 244

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 42/77 (54%), Positives = 51/77 (66%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G   AA   ++D + YK GVY+HT G   GGHAV+IIGWG E GV YWL +N
Sbjct: 152 IKKEIMTNGPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVDYWLVMN 211

Query: 61  SWGELWGDGGLFKIRRG 77
           SW E WGD G FKI +G
Sbjct: 212 SWNEEWGDHGTFKIVQG 228


>gi|449446774|ref|XP_004141146.1| PREDICTED: cathepsin B-like [Cucumis sativus]
          Length = 348

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 41/85 (48%), Positives = 54/85 (63%), Gaps = 1/85 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
           +  E++  G +  +   ++D   YK GVY+H  G++ GGHAVK+IGWG  +DG  YWL  
Sbjct: 245 IMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYWLLA 304

Query: 60  NSWGELWGDGGLFKIRRGTDESRIE 84
           N W   WGD G FKIRRGT+E  IE
Sbjct: 305 NQWNRGWGDDGYFKIRRGTNECGIE 329


>gi|332030944|gb|EGI70570.1| Uncharacterized peptidase C1-like protein F26E4.3 [Acromyrmex
           echinatior]
          Length = 501

 Score = 89.7 bits (221), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 44/91 (48%), Positives = 58/91 (63%), Gaps = 8/91 (8%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEM---SGGHAVKIIGWGVE-----DGVKY 55
           EI   G + A +  +QD  +YK G+Y+H+       SG H+V+IIGWG E       +KY
Sbjct: 402 EILTSGPVQATMRVYQDFFVYKNGIYRHSQSAELHDSGYHSVRIIGWGEERSYRGPPLKY 461

Query: 56  WLCVNSWGELWGDGGLFKIRRGTDESRIESF 86
           WL VNSWG  WG+ GLFKI+RGT+E  IES+
Sbjct: 462 WLVVNSWGYNWGENGLFKIQRGTNECEIESY 492


>gi|17565158|ref|NP_503384.1| Protein W07B8.1 [Caenorhabditis elegans]
 gi|351059396|emb|CCD74286.1| Protein W07B8.1 [Caenorhabditis elegans]
          Length = 335

 Score = 89.7 bits (221), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 41/90 (45%), Positives = 54/90 (60%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q ++   G I A  E + D + Y  G+Y H  G   G  +V+IIGWGV  GV YWLC N
Sbjct: 241 IQSDVMLNGPIQATFEVYDDFLQYTTGIYVHLTGNKQGHLSVRIIGWGVWQGVPYWLCAN 300

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SWG  WG+ G F++ RGT+E  +ES  VS 
Sbjct: 301 SWGRQWGENGTFRVLRGTNECGLESNCVSG 330


>gi|449489527|ref|XP_004158338.1| PREDICTED: cathepsin B-like [Cucumis sativus]
          Length = 349

 Score = 89.7 bits (221), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 41/85 (48%), Positives = 54/85 (63%), Gaps = 1/85 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
           +  E++  G +  +   ++D   YK GVY+H  G++ GGHAVK+IGWG  +DG  YWL  
Sbjct: 246 IMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYWLLA 305

Query: 60  NSWGELWGDGGLFKIRRGTDESRIE 84
           N W   WGD G FKIRRGT+E  IE
Sbjct: 306 NQWNRGWGDDGYFKIRRGTNECGIE 330


>gi|224285256|gb|ACN40354.1| unknown [Picea sitchensis]
          Length = 350

 Score = 89.7 bits (221), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 41/90 (45%), Positives = 56/90 (62%), Gaps = 1/90 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
           +  E+F+ G +  +   ++D   Y+ GVY+H  G   GGHAVK+IGWG  +DG+ YWL  
Sbjct: 238 IMAEVFNNGPVEVSFSVYEDFAHYETGVYKHVQGRYLGGHAVKLIGWGTTDDGIDYWLIA 297

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVS 89
           NSW   WG+GG FKI RG +E  IE   V+
Sbjct: 298 NSWNTAWGEGGYFKIARGVNECGIERDPVA 327


>gi|328718094|ref|XP_003246386.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
          Length = 340

 Score = 89.7 bits (221), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 44/86 (51%), Positives = 56/86 (65%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++  +G I A+ + + D   YK GVY +       GGHAVK+IGWG E GV YWL V
Sbjct: 245 IQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENATYLGGHAVKLIGWGEEYGVPYWLMV 304

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW   WGD GLFKIRRGT+E  I++
Sbjct: 305 NSWNADWGDNGLFKIRRGTNECGIDN 330


>gi|201023315|ref|NP_001128400.1| cathepsin B-16D2 precursor [Acyrthosiphon pisum]
          Length = 340

 Score = 89.7 bits (221), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 44/86 (51%), Positives = 56/86 (65%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++  +G I A+ + + D   YK GVY +       GGHAVK+IGWG E GV YWL V
Sbjct: 245 IQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENATYLGGHAVKLIGWGEEYGVPYWLMV 304

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW   WGD GLFKIRRGT+E  I++
Sbjct: 305 NSWNADWGDNGLFKIRRGTNECGIDN 330


>gi|347972088|ref|XP_313836.5| AGAP004534-PA [Anopheles gambiae str. PEST]
 gi|333469166|gb|EAA09182.5| AGAP004534-PA [Anopheles gambiae str. PEST]
          Length = 334

 Score = 89.7 bits (221), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 40/85 (47%), Positives = 57/85 (67%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G + A  + ++D+++YK GVY+H  GE  G HAV+IIGWG + G+ YWL  N
Sbjct: 241 IRYEIMTNGPVEAGFDVYEDVLLYKSGVYRHVYGEQIGKHAVRIIGWGRDGGIPYWLIAN 300

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           S+G+ WGD G FK  RG++   IES
Sbjct: 301 SYGDDWGDHGYFKFVRGSNHLGIES 325


>gi|312374702|gb|EFR22199.1| hypothetical protein AND_15622 [Anopheles darlingi]
          Length = 339

 Score = 89.7 bits (221), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 42/85 (49%), Positives = 55/85 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G +    + ++D+ +YK GVY+H  GE  G HAV+IIGWG E G+ YWL  N
Sbjct: 246 IRYEIMTNGPVEGGFDVYEDVFLYKSGVYRHVYGEHVGKHAVRIIGWGREGGIPYWLISN 305

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           S+GE WGD G FKI RG +   IES
Sbjct: 306 SYGEDWGDHGYFKIVRGINHLGIES 330


>gi|209863079|ref|NP_001119613.2| cathepsin B precursor [Acyrthosiphon pisum]
          Length = 323

 Score = 89.7 bits (221), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 45/95 (47%), Positives = 63/95 (66%), Gaps = 2/95 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVE-DGVKYWLCV 59
           +Q EI  +G + A +  +++ + YK+G+Y+ T GE+ G H VK+IGWGV+ DG +YWL +
Sbjct: 227 IQQEIMTYGPVTAFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWLAM 286

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           NSW   WG+ GLFKI RG +   IE   V AG VD
Sbjct: 287 NSWNSNWGNDGLFKILRGYNFCSIE-LLVMAGIVD 320


>gi|195026034|ref|XP_001986167.1| GH20676 [Drosophila grimshawi]
 gi|193902167|gb|EDW01034.1| GH20676 [Drosophila grimshawi]
          Length = 432

 Score = 89.7 bits (221), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 41/94 (43%), Positives = 61/94 (64%), Gaps = 4/94 (4%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV---GEMSGGHAVKIIGWGVE-DGVKYW 56
           +  EI+H G + A +  ++D   Y  GVYQHT    G  +G H+VK++GWG E +GVKYW
Sbjct: 326 IMAEIYHSGPVQATMTVYRDFFSYSSGVYQHTAANRGAATGFHSVKLVGWGEEHNGVKYW 385

Query: 57  LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           +  NSWG  WG+ G F+I RG++E  IE + +++
Sbjct: 386 IAANSWGPWWGERGYFRILRGSNECGIEEYVLAS 419


>gi|204022071|dbj|BAG71133.1| cathepsin B-S2 [Tuberaphis coreana]
          Length = 334

 Score = 89.7 bits (221), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 42/92 (45%), Positives = 61/92 (66%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
           ++ +I  +G + A+ + + DL +YK G+Y+ +   +  GGH++KIIGWG EDG  YWL V
Sbjct: 239 IEQDIKTYGPVEASFDCYDDLSVYKSGIYRKSPNAKYKGGHSIKIIGWGQEDGTPYWLAV 298

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           NSW + WGD G FKI +G +E  IE   V+AG
Sbjct: 299 NSWSKFWGDHGTFKIIKGRNECGIER-AVTAG 329


>gi|290973351|ref|XP_002669412.1| predicted protein [Naegleria gruberi]
 gi|284082959|gb|EFC36668.1| predicted protein [Naegleria gruberi]
          Length = 488

 Score = 89.7 bits (221), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 46/99 (46%), Positives = 55/99 (55%), Gaps = 9/99 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV---------GEMSGGHAVKIIGWGVED 51
           M  E++H G +  A E + D   YK GVY H+          G     HAV ++GWG E+
Sbjct: 385 MMYELYHGGPLAIAFEVYDDFFNYKGGVYTHSTALKTKIAEPGWEETNHAVLLVGWGEEN 444

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           GV YWL  NSWG  WG  G FKI+RGTDE   ES  VSA
Sbjct: 445 GVPYWLVKNSWGTSWGINGFFKIKRGTDECDCESEAVSA 483


>gi|226473760|emb|CAX71565.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 89.4 bits (220), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 41/93 (44%), Positives = 63/93 (67%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G + A +E ++D + YK G+Y++T G+   GHAV++IG GVE+G  YWL  N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGCGVENGTAYWLAAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           +W E WG+ G F+I RG +E  IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340


>gi|339831342|gb|AEK20867.1| cathepsin B [Eimeria tenella]
          Length = 512

 Score = 89.4 bits (220), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 45/102 (44%), Positives = 63/102 (61%), Gaps = 3/102 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ E+   G++  A   ++D ++YK+GVY H  G   GGHAVK+IG+G EDG  YWL VN
Sbjct: 406 IKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMGGHAVKVIGFGNEDGRDYWLAVN 465

Query: 61  SWGELWGDGGLFKIRRGTDESRIES-FQVSAGRVDRDRSSDL 101
           SW E WGD G FKI  G  E+ I+  F     +V  D+++ L
Sbjct: 466 SWNEYWGDKGTFKIEMG--EAGIDKEFCGGEPKVPNDKNASL 505


>gi|335347291|gb|AEH42093.1| cysteine proteinase 6 [Haemonchus contortus]
          Length = 346

 Score = 89.4 bits (220), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 56/93 (60%), Gaps = 1/93 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +V++   + D   Y KG+Y+HT G+  G HA+KIIGWG E  V YW+  N
Sbjct: 253 IQREIMRSGPVVSSFTVYDDFSYYVKGIYKHTAGKARGSHAIKIIGWGTEKNVPYWIIAN 312

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           SW   WG+ G F++ RGT+   IE   V AG V
Sbjct: 313 SWHNDWGEKGFFRMVRGTNHCGIEE-DVVAGHV 344


>gi|29374025|gb|AAO73003.1| cathepsin B [Fasciola gigantica]
          Length = 339

 Score = 89.4 bits (220), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 42/88 (47%), Positives = 55/88 (62%), Gaps = 1/88 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI   G +       QD  +Y+ G+Y H  G+  G HAV++IGWGVE+GV YWL  NSW 
Sbjct: 249 EIMKNGPVEVTFAIFQDFGVYRSGIYHHVAGKFIGRHAVRMIGWGVENGVNYWLMANSWN 308

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAG 91
           E WG+ G F++ RG +E  IES +V AG
Sbjct: 309 EEWGENGYFRMVRGRNECGIES-EVVAG 335


>gi|359427491|gb|AEV46267.1| eimeripain [Eimeria tenella]
          Length = 512

 Score = 89.4 bits (220), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 45/102 (44%), Positives = 63/102 (61%), Gaps = 3/102 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ E+   G++  A   ++D ++YK+GVY H  G   GGHAVK+IG+G EDG  YWL VN
Sbjct: 406 IKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMGGHAVKVIGFGNEDGRDYWLAVN 465

Query: 61  SWGELWGDGGLFKIRRGTDESRIES-FQVSAGRVDRDRSSDL 101
           SW E WGD G FKI  G  E+ I+  F     +V  D+++ L
Sbjct: 466 SWNEYWGDKGTFKIEMG--EAGIDKEFCGGEPKVPNDKNASL 505


>gi|161343867|tpg|DAA06114.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 340

 Score = 89.4 bits (220), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 42/86 (48%), Positives = 57/86 (66%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++ ++G I A+ + + D   YK GVY +       GGHAVK+IGWG E GV YWL V
Sbjct: 245 IQKDVMNYGPIEASFDVYDDFPSYKSGVYIRSDNASYLGGHAVKLIGWGEESGVPYWLMV 304

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW   WGD GLFKI+RGT+E  +++
Sbjct: 305 NSWNTDWGDKGLFKIQRGTNECGVDN 330


>gi|14582576|gb|AAK69541.1|AF283476_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
          Length = 352

 Score = 89.4 bits (220), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 49/115 (42%), Positives = 65/115 (56%), Gaps = 4/115 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           +  E++  G    +   ++D   YK GVY+H  G   GGHAVK+IGWG  EDG  YWL  
Sbjct: 240 IMTEVYTNGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHAVKLIGWGTSEDGEDYWLLA 299

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFEYDTDTTIES 114
           N W   WGD G FKI RGT+E  IE   V+AG +   ++ D+E    D D+ + S
Sbjct: 300 NQWNRSWGDDGYFKIIRGTNECGIE--DVTAG-MPSTKNLDIESGVRDDDSLVAS 351


>gi|339239305|ref|XP_003381207.1| cathepsin B [Trichinella spiralis]
 gi|316975778|gb|EFV59177.1| cathepsin B [Trichinella spiralis]
          Length = 343

 Score = 89.4 bits (220), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 45/97 (46%), Positives = 60/97 (61%), Gaps = 3/97 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVY---QHTVGEMSGGHAVKIIGWGVEDGVKYWL 57
           +Q EI   G +VAA+E  +  + YK GVY   +       G HAVK+IGWG +  + YWL
Sbjct: 247 IQREIMDHGPVVAAMEIFESFLYYKSGVYSANKRNDDPSLGLHAVKLIGWGEQKRIPYWL 306

Query: 58  CVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
            VNSW   +G+ GLFKIRRGT+E  IE+  V+AG  +
Sbjct: 307 VVNSWNTTFGEQGLFKIRRGTNECGIENLHVTAGLAE 343


>gi|294894292|ref|XP_002774787.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239880404|gb|EER06603.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 414

 Score = 89.4 bits (220), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 41/74 (55%), Positives = 50/74 (67%)

Query: 7   HFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELW 66
           +F  + A+   ++D + Y+ GVY+HT G+  GGHAVKIIGWG E G  YWL VNSW E W
Sbjct: 329 NFDQVSASFIVYEDFLAYRSGVYKHTSGKELGGHAVKIIGWGEETGQAYWLVVNSWNEDW 388

Query: 67  GDGGLFKIRRGTDE 80
           GD GLFKI  G  E
Sbjct: 389 GDNGLFKIALGNCE 402


>gi|189502866|gb|ACE06814.1| unknown [Schistosoma japonicum]
          Length = 121

 Score = 89.4 bits (220), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 44/88 (50%), Positives = 54/88 (61%), Gaps = 1/88 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E+   G +    E + D   YK GVYQH  G + GGHAV+++GWG E+ V YWL  NSW 
Sbjct: 29  ELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIANSWN 88

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAG 91
             WGD G FKI RG +E  IES  V+AG
Sbjct: 89  TDWGDNGYFKIIRGKNECGIES-DVNAG 115


>gi|268555788|ref|XP_002635883.1| C. briggsae CBR-CPR-5 protein [Caenorhabditis briggsae]
          Length = 345

 Score = 89.4 bits (220), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 46/97 (47%), Positives = 56/97 (57%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A   ++D   Y  GVY HT G   GGHAVKI+GWGV++G  YWL  N
Sbjct: 248 IQTEILKNGPVEVAFTVYEDFYQYTTGVYVHTSGASLGGHAVKILGWGVDNGTPYWLVAN 307

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
           SW   WG+ G F+I RG +E  IE   V AG  D  R
Sbjct: 308 SWNVNWGEKGYFRIIRGLNECGIEHSAV-AGIPDLTR 343


>gi|241998314|ref|XP_002433800.1| longipain, putative [Ixodes scapularis]
 gi|215495559|gb|EEC05200.1| longipain, putative [Ixodes scapularis]
          Length = 339

 Score = 89.4 bits (220), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 40/84 (47%), Positives = 54/84 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q+EI   G +  A   + D  +YK GVY+    +  GGHA++I+GWGVE+GV +WL  N
Sbjct: 246 IQMEIMKNGPVEGAFTVYADFPLYKSGVYKSHSTDALGGHAIRILGWGVENGVPFWLVAN 305

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW   WGD G FKI RG++E  IE
Sbjct: 306 SWNTEWGDKGYFKILRGSNECGIE 329


>gi|300952942|gb|ADK46902.1| cathepsin B [Radopholus similis]
          Length = 356

 Score = 89.4 bits (220), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 45/91 (49%), Positives = 58/91 (63%), Gaps = 2/91 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVK--YWLC 58
           +Q EI + G + A +  + D + YK GVYQ       GGHAV+I+GWGV+   K  YWL 
Sbjct: 259 IQYEIMNNGPVEANMIVYYDFMFYKSGVYQTVFPWPLGGHAVRIVGWGVDGPTKVPYWLV 318

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVS 89
            NSW   WG+ G F+IRRGTDES IES+ V+
Sbjct: 319 ANSWNTDWGEDGYFRIRRGTDESYIESWGVN 349


>gi|294914336|ref|XP_002778250.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239886453|gb|EER10045.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 388

 Score = 89.4 bits (220), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 66/105 (62%), Gaps = 3/105 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G + AA   ++D   YK GVY+H  G   GGHAVKIIGWG++   +YWL +N
Sbjct: 284 IKREIIDNGPVAAAFTVYEDFPYYKSGVYKHVNGSELGGHAVKIIGWGIDQNEQYWLVMN 343

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFE 105
           SW   WGD G+FKI  G  E  I+S +V+AG    +++S +E+ E
Sbjct: 344 SWNVNWGDQGIFKIAIG--ECGIDS-EVTAGIPKYEKTSGVEQSE 385


>gi|294898091|ref|XP_002776152.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239882839|gb|EER07968.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 382

 Score = 89.4 bits (220), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 40/70 (57%), Positives = 47/70 (67%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
           G + A+   ++D + YK GVY+HT G   GGHAVKIIGWG + G  YWL VNSW E WGD
Sbjct: 299 GPVSASFTVYEDFLAYKSGVYKHTSGSYLGGHAVKIIGWGEKSGQAYWLAVNSWNEDWGD 358

Query: 69  GGLFKIRRGT 78
            GLFKI  G 
Sbjct: 359 KGLFKIALGN 368


>gi|156708116|gb|ABU93316.1| cathepsin B7 cysteine protease, partial [Monocercomonoides sp. PA]
          Length = 273

 Score = 89.0 bits (219), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 38/81 (46%), Positives = 51/81 (62%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E+ + G + A  E  +D   Y+ GVYQH  G   G H V ++GWG E+GV YWL  NSWG
Sbjct: 183 EVANNGPVYACFEVFEDFYNYRSGVYQHKTGRSQGWHHVMLMGWGTENGVPYWLLQNSWG 242

Query: 64  ELWGDGGLFKIRRGTDESRIE 84
             WG+ G F+IRRGT++  I+
Sbjct: 243 SGWGEKGFFRIRRGTNDCHID 263


>gi|38639325|gb|AAR25800.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
          Length = 354

 Score = 89.0 bits (219), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 45/102 (44%), Positives = 60/102 (58%), Gaps = 1/102 (0%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           +  E++  G +  +   ++D   YK GVY+H  G   GGHAVK+IGWG  E G  YWL V
Sbjct: 244 IMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGGNMGGHAVKLIGWGTSEQGEDYWLIV 303

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDL 101
           NSW   WG+ G FKIRRGT+E  IE   V+     R+ + +L
Sbjct: 304 NSWNRGWGEDGYFKIRRGTNECGIEHSVVAGLPSARNLNVEL 345


>gi|195478432|ref|XP_002100515.1| GE16138 [Drosophila yakuba]
 gi|194188039|gb|EDX01623.1| GE16138 [Drosophila yakuba]
          Length = 340

 Score = 89.0 bits (219), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 46/93 (49%), Positives = 60/93 (64%), Gaps = 3/93 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
           +Q EI   G +  A   ++DLI+YK GVYQH  G+  GGHA++I+GWGV  ++ + YWL 
Sbjct: 245 IQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGDEKIPYWLI 304

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
            NSW   WGD G F+I RG D   IES  +SAG
Sbjct: 305 GNSWNTDWGDQGFFRILRGQDHCGIES-SISAG 336


>gi|242014495|ref|XP_002427925.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
           corporis]
 gi|212512409|gb|EEB15187.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
           corporis]
          Length = 473

 Score = 89.0 bits (219), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 46/107 (42%), Positives = 61/107 (57%), Gaps = 9/107 (8%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS----GGHAVKIIGWGVE-----DGVK 54
           EI   G + A ++ +QD   YK GVY  +  E      G H+VKI+GWG E       +K
Sbjct: 333 EIMQSGPVQATMKVYQDFFSYKSGVYTKSNTERESSNFGYHSVKILGWGEETNIYGQPIK 392

Query: 55  YWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDL 101
           YWL  NSWG+ WG+ G FKIRRGT+E  IE F ++A     D S ++
Sbjct: 393 YWLAANSWGQQWGENGFFKIRRGTNECEIEEFVLAAWAETNDPSREI 439


>gi|204022100|dbj|BAG71147.1| cathepsin B-N1 [Tuberaphis takenouchii]
          Length = 334

 Score = 89.0 bits (219), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 44/86 (51%), Positives = 56/86 (65%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++  +G I A+ + + D   YK GVY  T      GGHAVK+IGWG E GV YWL V
Sbjct: 242 IQKDVMAYGPIEASFDVYDDFPNYKSGVYMKTENASYLGGHAVKLIGWGEEYGVPYWLLV 301

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW + WGD GLFKI RGT+E  I++
Sbjct: 302 NSWNDQWGDQGLFKILRGTNECGIDN 327


>gi|195566634|ref|XP_002106884.1| GD15875 [Drosophila simulans]
 gi|194204277|gb|EDX17853.1| GD15875 [Drosophila simulans]
          Length = 340

 Score = 89.0 bits (219), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 46/93 (49%), Positives = 60/93 (64%), Gaps = 3/93 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
           +Q EI   G +  A   ++DLI+YK GVYQH  G+  GGHA++I+GWGV  ++ + YWL 
Sbjct: 245 IQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGDEKIPYWLI 304

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
            NSW   WGD G F+I RG D   IES  +SAG
Sbjct: 305 GNSWNTDWGDHGFFRILRGQDHCGIES-SISAG 336


>gi|187107122|ref|NP_001119621.1| cathepsin B-3098 precursor [Acyrthosiphon pisum]
 gi|161343841|tpg|DAA06101.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 337

 Score = 89.0 bits (219), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 41/86 (47%), Positives = 57/86 (66%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++ ++G I  + + + D   YK G+Y +       GGH+VK+IGWG E GV YWL V
Sbjct: 243 IQKDVINYGPIETSFDVYDDFPNYKSGIYVKSENASYLGGHSVKLIGWGEEYGVLYWLMV 302

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW   WGD GLFKIRRGT+E R+++
Sbjct: 303 NSWNADWGDKGLFKIRRGTNECRVDN 328


>gi|395815757|ref|XP_003781389.1| PREDICTED: dipeptidyl peptidase 1 [Otolemur garnettii]
          Length = 575

 Score = 89.0 bits (219), Expect = 7e-16,   Method: Composition-based stats.
 Identities = 42/99 (42%), Positives = 61/99 (61%), Gaps = 10/99 (10%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ H G +  A E + D + Y +G+Y HT         E++  HAV ++G+G +   
Sbjct: 472 MKLELVHHGPMAVAFEVYDDFLHYHRGIYHHTGLTDPFNPFELTN-HAVLLVGYGTDSAT 530

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           G++YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A
Sbjct: 531 GIQYWIVKNSWGTGWGEDGYFRIRRGTDECAIESIAVAA 569


>gi|224128101|ref|XP_002320244.1| predicted protein [Populus trichocarpa]
 gi|222861017|gb|EEE98559.1| predicted protein [Populus trichocarpa]
          Length = 339

 Score = 89.0 bits (219), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 43/90 (47%), Positives = 54/90 (60%), Gaps = 1/90 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           +  E+   G +  A   ++D   YK GVY+H  G+  GGHAVK+IGWG  EDG  YWL  
Sbjct: 227 IMAEVSSNGPVEVAFTVYEDFAHYKSGVYKHITGDAMGGHAVKLIGWGTSEDGEDYWLLA 286

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVS 89
           N W   WGD G FKI+RGT+E  IE   V+
Sbjct: 287 NQWNRGWGDDGYFKIKRGTNECGIEGAVVA 316


>gi|294954734|ref|XP_002788292.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239903555|gb|EER20088.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 317

 Score = 88.6 bits (218), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 41/70 (58%), Positives = 48/70 (68%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
           G + A+   ++D + YK GVY+HT G   GGHAVKIIGWG E+G  YWL VNSW E WGD
Sbjct: 234 GPVSASYLVYEDFLAYKSGVYKHTSGSYLGGHAVKIIGWGEENGEAYWLVVNSWNEDWGD 293

Query: 69  GGLFKIRRGT 78
            GLFKI  G 
Sbjct: 294 HGLFKIALGN 303


>gi|322788703|gb|EFZ14296.1| hypothetical protein SINV_07506 [Solenopsis invicta]
          Length = 443

 Score = 88.6 bits (218), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 45/91 (49%), Positives = 59/91 (64%), Gaps = 8/91 (8%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHT-VGEM--SGGHAVKIIGWGVE-----DGVKY 55
           EI   G + A +  +QD  IYK G+Y+H+   E+  SG H+V+IIGWG E       +KY
Sbjct: 344 EILTSGPVQATMRVYQDFFIYKSGIYRHSRSAELHDSGYHSVRIIGWGEERSYRGPPLKY 403

Query: 56  WLCVNSWGELWGDGGLFKIRRGTDESRIESF 86
           WL  NSWG  WGD GLFKI++GT+E  IES+
Sbjct: 404 WLVANSWGYNWGDNGLFKIQKGTNECEIESY 434


>gi|224064398|ref|XP_002301456.1| predicted protein [Populus trichocarpa]
 gi|222843182|gb|EEE80729.1| predicted protein [Populus trichocarpa]
          Length = 325

 Score = 88.6 bits (218), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 42/85 (49%), Positives = 53/85 (62%), Gaps = 1/85 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           +  E+   G +  A   ++D   YK GVY+H  G++ GGHAVK+IGWG  +DG  YWL  
Sbjct: 213 IMAEVSMNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWLLA 272

Query: 60  NSWGELWGDGGLFKIRRGTDESRIE 84
           N W   WGD G FKIRRGT+E  IE
Sbjct: 273 NQWNRGWGDDGYFKIRRGTNECGIE 297


>gi|357116869|ref|XP_003560199.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
          Length = 350

 Score = 88.6 bits (218), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 42/85 (49%), Positives = 51/85 (60%), Gaps = 1/85 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVE-DGVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY+H  GEM GGHAVK+IGWG   DG  YWL  
Sbjct: 242 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYEHITGEMMGGHAVKLIGWGTSADGKDYWLLA 301

Query: 60  NSWGELWGDGGLFKIRRGTDESRIE 84
           N W   WGD G FKI RG +E  IE
Sbjct: 302 NQWNRGWGDDGYFKIIRGKNECGIE 326


>gi|195352458|ref|XP_002042729.1| GM17589 [Drosophila sechellia]
 gi|194126760|gb|EDW48803.1| GM17589 [Drosophila sechellia]
          Length = 340

 Score = 88.6 bits (218), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 46/93 (49%), Positives = 59/93 (63%), Gaps = 3/93 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
           +Q EI   G +  A   ++DLI+YK GVYQH  G+  GGHA++I+GWGV   + + YWL 
Sbjct: 245 IQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGNEKIPYWLI 304

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
            NSW   WGD G F+I RG D   IES  +SAG
Sbjct: 305 GNSWNTDWGDHGFFRILRGQDHCGIES-SISAG 336


>gi|66506619|ref|XP_393283.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
           [Apis mellifera]
          Length = 439

 Score = 88.6 bits (218), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 47/96 (48%), Positives = 63/96 (65%), Gaps = 11/96 (11%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHT-VGEM--SGGHAVKIIGWGVED-------GV 53
           EI   G + A ++ +QD   Y+ G+Y HT + E+  SG H+V+IIGWG ED        +
Sbjct: 338 EILTSGPVQATMKVYQDFFSYESGIYMHTPIAELYESGYHSVRIIGWG-EDISTDSGLPI 396

Query: 54  KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVS 89
           KYWL VNSWG+ WG+ GLF+IRRG +E  IESF V+
Sbjct: 397 KYWLVVNSWGQEWGENGLFRIRRGINECDIESFVVA 432


>gi|28974200|gb|AAO61484.1| cathepsin B [Sterkiella histriomuscorum]
          Length = 294

 Score = 88.6 bits (218), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 37/77 (48%), Positives = 50/77 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G +  A   + D   Y+ GVY  T  +++GGHA+KI+G+GVE+G  YWLC N
Sbjct: 203 IQSEIVSHGPVEGAFTVYTDFFNYQSGVYTPTTTDVAGGHAIKILGYGVENGTPYWLCAN 262

Query: 61  SWGELWGDGGLFKIRRG 77
           SWG  WG  G FKI++G
Sbjct: 263 SWGPAWGMSGFFKIKQG 279


>gi|209863086|ref|NP_001119616.2| cathepsin B-1674 precursor [Acyrthosiphon pisum]
 gi|239799412|dbj|BAH70627.1| ACYPI000012 [Acyrthosiphon pisum]
          Length = 334

 Score = 88.6 bits (218), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 47/93 (50%), Positives = 58/93 (62%), Gaps = 3/93 (3%)

Query: 1   MQLEIFHFGSIVAAIEA-HQDLIIYKKGVYQHTVG-EMSGGHAVKIIGWGVEDGVKYWLC 58
           +Q E+ ++G +  A      D  +YK GVY+ T   E       K+IGWGVE+GV YWL 
Sbjct: 238 IQREVQNYGPVSMAFRVFDNDFFLYKSGVYEKTTNSEFIQWQYAKLIGWGVENGVDYWLL 297

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           VNSWG  WG  GLFKI+RGTDE  IE+F V AG
Sbjct: 298 VNSWGYEWGQNGLFKIKRGTDECNIETF-VHAG 329


>gi|119638996|gb|ABL85239.1| cysteine proteinase 5 [Necator americanus]
          Length = 342

 Score = 88.6 bits (218), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 42/82 (51%), Positives = 50/82 (60%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EIF  G +V +     D  IYKKGVY     + +G HAVKIIGWGV+DG+KYWL  NSW 
Sbjct: 252 EIFTNGPVVGSFSVFADFAIYKKGVYVSNGIQQNGAHAVKIIGWGVQDGLKYWLIANSWN 311

Query: 64  ELWGDGGLFKIRRGTDESRIES 85
             WGD G  +  RG +   IES
Sbjct: 312 NDWGDEGYVRFLRGDNHCGIES 333


>gi|4204370|gb|AAD11445.1| cathepsin B protease, partial [Fasciola hepatica]
          Length = 247

 Score = 88.6 bits (218), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 42/88 (47%), Positives = 55/88 (62%), Gaps = 1/88 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI   G +       QD  +Y+ G+Y H  G+  G HAV++IGWGVE+GV YWL  NSW 
Sbjct: 157 EIMKNGPVEVTFAIFQDFGVYRSGIYHHVAGKFIGRHAVRMIGWGVENGVNYWLMANSWN 216

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAG 91
           E WG+ G F++ RG +E  IES +V AG
Sbjct: 217 EEWGENGYFRMVRGRNECGIES-EVVAG 243


>gi|195130519|ref|XP_002009699.1| GI15503 [Drosophila mojavensis]
 gi|193908149|gb|EDW07016.1| GI15503 [Drosophila mojavensis]
          Length = 342

 Score = 88.6 bits (218), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 46/93 (49%), Positives = 59/93 (63%), Gaps = 3/93 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
           +Q EI   G +  A   ++DLI+YK GVY+H  G+  GGHA++I+GWGV  +  V YWL 
Sbjct: 247 IQREIMTNGPVEGAFTVYEDLILYKSGVYKHVHGKELGGHAIRILGWGVWGDSKVPYWLI 306

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
            NSW   WGD G F+I RG D   IES  +SAG
Sbjct: 307 GNSWNTDWGDNGFFRIVRGEDHCGIES-AISAG 338


>gi|91088083|ref|XP_968689.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
          Length = 360

 Score = 88.6 bits (218), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 47/92 (51%), Positives = 59/92 (64%), Gaps = 2/92 (2%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
           G +VAA + + D  IY+ GVY +T G + G  AVKIIGWG E+G  YWL  NSWG+ WG 
Sbjct: 228 GPVVAAFDVYGDFKIYRDGVYIYTSGALFGRTAVKIIGWGTENGWAYWLAANSWGKDWGA 287

Query: 69  -GGLFKIRRGTDESRIESFQVSAGRVDRDRSS 99
            GG FKIRRGT+E   E   + AG+V    S+
Sbjct: 288 LGGFFKIRRGTNECGFEE-SIIAGQVREGGST 318


>gi|51947600|gb|AAU14266.1| cathepsin B-N [Myzus persicae]
          Length = 338

 Score = 88.6 bits (218), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 42/86 (48%), Positives = 56/86 (65%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++  +G I A+ + + D   YK GVY +       GGHAVK+IGWG E GV YWL V
Sbjct: 243 IQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENASYLGGHAVKLIGWGEEYGVPYWLMV 302

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW E WGD G FKI+RGT+E  +++
Sbjct: 303 NSWNEDWGDHGFFKIQRGTNECGVDN 328


>gi|91089435|ref|XP_966663.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
 gi|270012706|gb|EFA09154.1| cathepsin B precursor [Tribolium castaneum]
          Length = 320

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 42/86 (48%), Positives = 53/86 (61%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + A+   + D   Y+ GVYQH +G +SG H+VKI+GWG E+G  YWL  N
Sbjct: 226 IQAEIMTSGPVQASYVVYDDFYSYQNGVYQHVLGNVSGRHSVKILGWGRENGTDYWLVAN 285

Query: 61  SWGELWGD-GGLFKIRRGTDESRIES 85
           SWG  WG  GG FK  RG +   IES
Sbjct: 286 SWGRDWGRLGGFFKFLRGENHCDIES 311


>gi|91078960|ref|XP_974244.1| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270004840|gb|EFA01288.1| cathepsin B precursor [Tribolium castaneum]
          Length = 319

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 40/85 (47%), Positives = 50/85 (58%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G I+ + + +QD   Y  GVY H  G  +G H VKI+GWG E    YWL  N
Sbjct: 226 IQYEIMTNGPIIVSFKVYQDFYNYGSGVYHHVSGNYTGNHIVKIVGWGTEKEQDYWLIAN 285

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SWG  WG+ G FKI RG +E  IE+
Sbjct: 286 SWGSSWGEHGFFKILRGKNECGIEN 310


>gi|215687149|dbj|BAG90919.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 403

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 42/85 (49%), Positives = 51/85 (60%), Gaps = 1/85 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY+H  G M GGHAVK+IGWG  D G  YWL  
Sbjct: 291 IMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLLA 350

Query: 60  NSWGELWGDGGLFKIRRGTDESRIE 84
           N W   WGD G FKI RGT+E  IE
Sbjct: 351 NQWNRGWGDDGYFKIIRGTNECGIE 375


>gi|354459545|pdb|3PDF|A Chain A, Discovery Of Novel Cyanamide-Based Inhibitors Of Cathepsin
           C
          Length = 441

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ H G +  A E + D + YKKG+Y HT         E++  HAV ++G+G +   
Sbjct: 336 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 394

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 395 GMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 438


>gi|160688716|gb|ABX45136.1| cathepsin B-like cysteine protease 2 [Callosobruchus maculatus]
          Length = 260

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 47/93 (50%), Positives = 61/93 (65%), Gaps = 3/93 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-VGEMSGGHAVKIIGWGVEDGV-KYWLC 58
           +QLEI   G +VA+   + D I Y  GVY+     ++ GGHAV+IIGWG+E+G   YWL 
Sbjct: 166 IQLEIIKNGPVVASFTVYADFIHYLSGVYKFDGESKLLGGHAVRIIGWGIENGTYPYWLV 225

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
            NSW E WGD GLFKI RG +E  IE  +++AG
Sbjct: 226 SNSWNERWGDQGLFKIWRGKNECGIEE-EITAG 257


>gi|116779190|gb|ABK21175.1| unknown [Picea sitchensis]
 gi|148907952|gb|ABR17096.1| unknown [Picea sitchensis]
 gi|224284884|gb|ACN40172.1| unknown [Picea sitchensis]
          Length = 350

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 38/84 (45%), Positives = 52/84 (61%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  E++  G +      ++D   YK GVY++  G+  GGHAVK+IGWG E+G  YWL  N
Sbjct: 240 IMAEVYTKGPVEVDFLVYEDFAHYKSGVYKYITGDFLGGHAVKLIGWGTENGTDYWLVAN 299

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW   WG+ G FKI RG++E  IE
Sbjct: 300 SWNTAWGEDGYFKIARGSNECSIE 323


>gi|40557606|gb|AAR88096.1| cathepsin B-like cysteine protease [Callosobruchus maculatus]
          Length = 330

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 47/93 (50%), Positives = 61/93 (65%), Gaps = 3/93 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-VGEMSGGHAVKIIGWGVEDGV-KYWLC 58
           +QLEI   G +VA+   + D I Y  GVY+     ++ GGHAV+IIGWG+E+G   YWL 
Sbjct: 236 IQLEIIKNGPVVASFTVYADFIHYLSGVYKFDGESKLLGGHAVRIIGWGIENGTYPYWLV 295

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
            NSW E WGD GLFKI RG +E  IE  +++AG
Sbjct: 296 SNSWNERWGDQGLFKIWRGKNECGIEE-EITAG 327


>gi|388500062|gb|AFK38097.1| unknown [Lotus japonicus]
          Length = 357

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 44/92 (47%), Positives = 54/92 (58%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY+H  G   GGHAVK+IGWG  D G  YWL  
Sbjct: 245 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGSQLGGHAVKLIGWGTTDEGEDYWLIA 304

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           N W   WGD G F IRRGT+E  IE   V+AG
Sbjct: 305 NQWNRSWGDDGYFMIRRGTNECGIEE-DVTAG 335


>gi|56753605|gb|AAW25005.1| SJCHGC02852 protein [Schistosoma japonicum]
          Length = 346

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 44/90 (48%), Positives = 56/90 (62%), Gaps = 3/90 (3%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVE--DGVKYWLCVNS 61
           EI   G + A      D + YK GVY++  G + GGHA++IIGWGV   +   YWLC NS
Sbjct: 255 EILLNGPVEATFYVFDDFLNYKTGVYKYVTGSLLGGHAIRIIGWGVSTLNHTPYWLCANS 314

Query: 62  WGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           W + WGD G FKI RG++E  IES  V+AG
Sbjct: 315 WNKQWGDKGYFKILRGSNECGIESM-VTAG 343


>gi|119579767|gb|EAW59363.1| cathepsin C, isoform CRA_a [Homo sapiens]
          Length = 316

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ H G +  A E + D + YKKG+Y HT         E++  HAV ++G+G +   
Sbjct: 213 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 271

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 272 GMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 315


>gi|449485032|ref|XP_002188357.2| PREDICTED: dipeptidyl peptidase 1 [Taeniopygia guttata]
          Length = 667

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 64/104 (61%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWG--VED 51
           M+LE+ H G +  A E + D ++YK+G+Y HT         E++  HAV ++G+G   E 
Sbjct: 564 MKLELVHHGPMAVAFEVYNDFMLYKEGIYHHTGLQDDLNPFELTN-HAVLLVGYGKDPES 622

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G K+W+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 623 GEKFWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 666


>gi|60827947|gb|AAX36820.1| cathepsin C [synthetic construct]
 gi|61368416|gb|AAX43175.1| cathepsin C [synthetic construct]
          Length = 464

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ H G +  A E + D + YKKG+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 419 GMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|146165818|ref|XP_001015807.2| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|146145394|gb|EAR95562.2| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 338

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 45/92 (48%), Positives = 57/92 (61%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
           +Q EI   G + A+     D + YK GVY +    +  GGH+VKIIGWGVE G  YWL  
Sbjct: 246 IQREIMAHGPVQASFRVASDFLTYKSGVYIRDPKLKYEGGHSVKIIGWGVEQGTPYWLIA 305

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           NSW E WG+ GLFK+ RG +E  IE+ +V AG
Sbjct: 306 NSWNEDWGENGLFKMLRGKNECGIEA-EVVAG 336


>gi|54696504|gb|AAV38624.1| cathepsin C [synthetic construct]
 gi|54696506|gb|AAV38625.1| cathepsin C [synthetic construct]
 gi|61368207|gb|AAX43130.1| cathepsin C [synthetic construct]
 gi|61368212|gb|AAX43131.1| cathepsin C [synthetic construct]
          Length = 464

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ H G +  A E + D + YKKG+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 419 GMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|194382330|dbj|BAG58920.1| unnamed protein product [Homo sapiens]
          Length = 446

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ H G +  A E + D + YKKG+Y HT         E++  HAV ++G+G +   
Sbjct: 343 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 401

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 402 GMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 445


>gi|300176938|emb|CBK25507.2| unnamed protein product [Blastocystis hominis]
          Length = 320

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 38/84 (45%), Positives = 54/84 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ E+   G +  +   ++D + YK G+YQH  G+  GGHAVK++GWGVEDG++YW   N
Sbjct: 227 IKTELMTNGPLEVSFFVYEDFLTYKSGIYQHVAGKYLGGHAVKLVGWGVEDGIEYWKIAN 286

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW E WG+ G F+I  G  E  IE
Sbjct: 287 SWNEDWGENGYFRIVAGKGECGIE 310


>gi|442754445|gb|JAA69382.1| Putative cathepsin b precursor [Ixodes ricinus]
          Length = 340

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 40/84 (47%), Positives = 53/84 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q+EI   G +  A   + D  +YK GVY+    +  GGHA++I+GWGVE+ V YWL  N
Sbjct: 247 IQMEIMKNGPVEGAFTVYADFPLYKSGVYKSHSTDALGGHAIRILGWGVENDVPYWLVAN 306

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW   WGD G FKI RG++E  IE
Sbjct: 307 SWNTEWGDKGYFKILRGSNECGIE 330


>gi|1582221|prf||2118248A prepro-cathepsin C
          Length = 463

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ H G +  A E + D + YKKG+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 419 GMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|323448265|gb|EGB04166.1| hypothetical protein AURANDRAFT_32974 [Aureococcus anophagefferens]
          Length = 298

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 41/90 (45%), Positives = 53/90 (58%), Gaps = 1/90 (1%)

Query: 5   IFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGE 64
           I   G +  A   ++D   Y  G+Y H  GE +GGHAVK +GWGVE+G KYW   NSW  
Sbjct: 200 IAEGGPVETAFTVYEDFENYAGGIYHHVTGEEAGGHAVKFVGWGVENGTKYWKVANSWNP 259

Query: 65  LWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
            WG+ G F+I RG++E  IE  QV+    D
Sbjct: 260 YWGEAGYFRILRGSNEGGIED-QVTGSHAD 288


>gi|59895951|gb|AAX11351.1| cathepsin B-like cysteine protease [Oryza sativa Japonica Group]
 gi|125551767|gb|EAY97476.1| hypothetical protein OsI_19406 [Oryza sativa Indica Group]
 gi|215694023|dbj|BAG89222.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215712372|dbj|BAG94499.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765382|dbj|BAG87079.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222631058|gb|EEE63190.1| hypothetical protein OsJ_17999 [Oryza sativa Japonica Group]
          Length = 358

 Score = 87.8 bits (216), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 42/85 (49%), Positives = 51/85 (60%), Gaps = 1/85 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY+H  G M GGHAVK+IGWG  D G  YWL  
Sbjct: 246 IMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLLA 305

Query: 60  NSWGELWGDGGLFKIRRGTDESRIE 84
           N W   WGD G FKI RGT+E  IE
Sbjct: 306 NQWNRGWGDDGYFKIIRGTNECGIE 330


>gi|17933071|gb|AAL48192.1| cathepsin C [Homo sapiens]
          Length = 463

 Score = 87.8 bits (216), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ H G +  A E + D + YKKG+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 419 GMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|33327024|gb|AAQ08887.1| cathepsin C [Homo sapiens]
          Length = 463

 Score = 87.8 bits (216), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ H G +  A E + D + YKKG+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 419 GMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|17933077|gb|AAL48195.1| cathepsin C [Homo sapiens]
          Length = 463

 Score = 87.8 bits (216), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ H G +  A E + D + YKKG+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 419 GMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|189083844|ref|NP_001805.3| dipeptidyl peptidase 1 isoform a preproprotein [Homo sapiens]
 gi|1006657|emb|CAA60671.1| cathepsin C [Homo sapiens]
 gi|1947071|gb|AAC51341.1| prepro dipeptidyl peptidase I [Homo sapiens]
 gi|60816242|gb|AAX36375.1| cathepsin C [synthetic construct]
 gi|119579768|gb|EAW59364.1| cathepsin C, isoform CRA_b [Homo sapiens]
 gi|158257666|dbj|BAF84806.1| unnamed protein product [Homo sapiens]
 gi|261858568|dbj|BAI45806.1| cathepsin C [synthetic construct]
          Length = 463

 Score = 87.8 bits (216), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ H G +  A E + D + YKKG+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 419 GMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|161343839|tpg|DAA06100.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 323

 Score = 87.8 bits (216), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 45/95 (47%), Positives = 62/95 (65%), Gaps = 2/95 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVE-DGVKYWLCV 59
           +Q EI   G + A +  +++ + YK+G+Y+ T GE+ G H VK+IGWGV+ DG +YWL +
Sbjct: 227 IQQEIMTHGPVTAFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWLAM 286

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           NSW   WG+ GLFKI RG +   IE   V AG VD
Sbjct: 287 NSWNSNWGNDGLFKILRGYNFCSIE-LLVMAGIVD 320


>gi|62897637|dbj|BAD96758.1| cathepsin C isoform a preproprotein variant [Homo sapiens]
          Length = 463

 Score = 87.8 bits (216), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ H G +  A E + D + YKKG+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 419 GMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|55793949|gb|AAV65885.1| cathepsin B1 isotype 5 precursor [Trichobilharzia regenti]
          Length = 342

 Score = 87.8 bits (216), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 40/85 (47%), Positives = 51/85 (60%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G + A+   H D + YK G+Y+H  G   G H V+IIGWGVE    YWL  N
Sbjct: 249 IKKEIMMHGPVEASFRVHSDFLNYKSGIYKHMTGIDIGSHVVRIIGWGVEKETPYWLIAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW E WG+ G F++ RG DE  IES
Sbjct: 309 SWNEDWGEKGYFRMLRGKDECGIES 333


>gi|403287831|ref|XP_003935129.1| PREDICTED: dipeptidyl peptidase 1 [Saimiri boliviensis boliviensis]
          Length = 463

 Score = 87.8 bits (216), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 43/104 (41%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ H G +  A E + D + Y+KG+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYRKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 419 GIHYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|2317912|gb|AAC24376.1| cathepsin B-like cysteine proteinase [Arabidopsis thaliana]
          Length = 357

 Score = 87.8 bits (216), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 42/90 (46%), Positives = 54/90 (60%), Gaps = 1/90 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY++  G   GGHAVK+IGWG  +DG  YWL  
Sbjct: 245 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLLA 304

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVS 89
           N W   WGD G FKIRRGT+E  IE   V+
Sbjct: 305 NQWNRSWGDDGYFKIRRGTNECGIEQSVVA 334


>gi|317373330|sp|P53634.2|CATC_HUMAN RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|17933069|gb|AAL48191.1| cathepsin C [Homo sapiens]
          Length = 463

 Score = 87.8 bits (216), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ H G +  A E + D + YKKG+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 419 GMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|291384116|ref|XP_002708690.1| PREDICTED: cathepsin C [Oryctolagus cuniculus]
          Length = 463

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 62/104 (59%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ H G +  A E + D + Y KG+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYHKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDPAT 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           GV YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 419 GVDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|217073630|gb|ACJ85175.1| unknown [Medicago truncatula]
          Length = 359

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 44/92 (47%), Positives = 54/92 (58%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
           +  E++  G +  A    +D   YK GVY+H  G   GGHAVK+IGWG  D G  YWL  
Sbjct: 247 IMTEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLA 306

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           N W   WGD G FKI+RGT+E  IE   V+AG
Sbjct: 307 NQWNTNWGDDGYFKIKRGTNECGIED-DVTAG 337


>gi|17560488|ref|NP_506310.1| Protein F32H5.1 [Caenorhabditis elegans]
 gi|3876629|emb|CAB04249.1| Protein F32H5.1 [Caenorhabditis elegans]
          Length = 356

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 42/97 (43%), Positives = 59/97 (60%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q+EI   G ++A+   + D   YK G+Y HT G+  GG   KIIGWGV++GV YWLCV+
Sbjct: 259 IQIEIMTNGPVIASFIIYDDFWDYKTGIYVHTAGDQEGGMDTKIIGWGVDNGVPYWLCVH 318

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
            WG  +G+ G  +  RG +E  IE  QV A   D ++
Sbjct: 319 QWGTDFGENGFVRFLRGVNEVNIE-HQVLAALPDSEK 354


>gi|217072748|gb|ACJ84734.1| unknown [Medicago truncatula]
 gi|388505480|gb|AFK40806.1| unknown [Medicago truncatula]
          Length = 359

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 44/92 (47%), Positives = 54/92 (58%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
           +  E++  G +  A    +D   YK GVY+H  G   GGHAVK+IGWG  D G  YWL  
Sbjct: 247 IMAEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLA 306

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           N W   WGD G FKI+RGT+E  IE   V+AG
Sbjct: 307 NQWNTNWGDDGYFKIKRGTNECGIED-DVTAG 337


>gi|55793947|gb|AAV65884.1| cathepsin B1 isotype 4 precursor [Trichobilharzia regenti]
          Length = 342

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 40/85 (47%), Positives = 51/85 (60%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G +  A   H D + YK G+Y++  G   G HAV+IIGWGVE    YWL  N
Sbjct: 249 IKKEIMMHGPVEVAFTVHSDFLNYKSGIYKYMTGAEIGEHAVRIIGWGVEKKTPYWLIAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW E WG+ G F++ RG DE  IES
Sbjct: 309 SWNEDWGEKGYFRMLRGKDECGIES 333


>gi|63115212|gb|AAY33830.1| cathepsin B, partial [Siniperca chuatsi]
          Length = 69

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 42/67 (62%), Positives = 47/67 (70%), Gaps = 1/67 (1%)

Query: 25 KKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIE 84
          K GVYQH  G   GGHA+KI+GWG EDGV YWLC NSW   WGD G FK  RG+D  RIE
Sbjct: 1  KFGVYQHVYGSAVGGHAIKILGWGEEDGVPYWLCANSWNTDWGDNGFFKFLRGSDHCRIE 60

Query: 85 SFQVSAG 91
          S ++ AG
Sbjct: 61 S-EIVAG 66


>gi|255647484|gb|ACU24206.1| unknown [Glycine max]
          Length = 327

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 40/81 (49%), Positives = 50/81 (61%), Gaps = 1/81 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY+H  G   GGHAVK+IGWG  +DG  YWL  
Sbjct: 245 IMAEVYKNGPVEVAFTVYEDFAYYKSGVYKHITGYELGGHAVKLIGWGTTDDGEDYWLLA 304

Query: 60  NSWGELWGDGGLFKIRRGTDE 80
           N W   WGD G FKIRRGT+E
Sbjct: 305 NQWNREWGDDGYFKIRRGTNE 325


>gi|159175|gb|AAA29176.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 43/95 (45%), Positives = 61/95 (64%), Gaps = 2/95 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG-HAVKIIGWGVEDGVKYWLCV 59
           ++ +I   GS+VA    ++D   Y+ G+Y+HT G  +GG HAVK+IGWG ++G  YWL  
Sbjct: 251 IRRDIKERGSVVAVFAVYEDFSHYQSGIYKHTAGRFTGGYHAVKMIGWGKDNGTDYWLIA 310

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           NSW + WG+ G F++ RG +   IE  QV AG VD
Sbjct: 311 NSWHDDWGENGFFRMIRGINNCGIEE-QVDAGIVD 344


>gi|241154720|ref|XP_002407359.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215494103|gb|EEC03744.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 41/84 (48%), Positives = 51/84 (60%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EIF  G + A    + D + YK GVYQ    +  G HA++I+GWG E+G  YWL  N
Sbjct: 243 IQTEIFTNGPVEADFHVYGDFLCYKSGVYQRHSNDGRGMHAIRILGWGTENGTPYWLAAN 302

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW E WGD G FKI R T+E  IE
Sbjct: 303 SWNENWGDKGYFKILRRTNECGIE 326


>gi|18378945|ref|NP_563647.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332189291|gb|AEE27412.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
          Length = 379

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 42/90 (46%), Positives = 54/90 (60%), Gaps = 1/90 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY++  G   GGHAVK+IGWG  +DG  YWL  
Sbjct: 267 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLLA 326

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVS 89
           N W   WGD G FKIRRGT+E  IE   V+
Sbjct: 327 NQWNRSWGDDGYFKIRRGTNECGIEQSVVA 356


>gi|197304333|dbj|BAG69285.1| cathepsin B-like cysteine protease [Raphanus sativus]
          Length = 343

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 42/90 (46%), Positives = 53/90 (58%), Gaps = 1/90 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
           +  EI+  G +  +   ++D   YK GVY+H  G   GGHAVK+IGWG  +DG  YWL  
Sbjct: 248 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTTDDGEDYWLLA 307

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVS 89
           N W   WGD G F IRRGT+E  IE   V+
Sbjct: 308 NQWNRSWGDDGYFMIRRGTNECGIEDEPVA 337


>gi|357511629|ref|XP_003626103.1| Cathepsin B [Medicago truncatula]
 gi|87240982|gb|ABD32840.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
           propeptide [Medicago truncatula]
 gi|355501118|gb|AES82321.1| Cathepsin B [Medicago truncatula]
          Length = 357

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 44/92 (47%), Positives = 54/92 (58%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
           +  E++  G +  A    +D   YK GVY+H  G   GGHAVK+IGWG  D G  YWL  
Sbjct: 245 IMAEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLA 304

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           N W   WGD G FKI+RGT+E  IE   V+AG
Sbjct: 305 NQWNTNWGDDGYFKIKRGTNECGIED-DVTAG 335


>gi|28932700|gb|AAO60044.1| midgut cysteine proteinase 1 [Rhipicephalus appendiculatus]
          Length = 332

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 41/84 (48%), Positives = 52/84 (61%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ +I+  G + +A   + D   YK GVYQ  + +  G HA+KI+GWG EDGV YWL  N
Sbjct: 237 IKTDIYKNGPVESAFFVYADFPSYKSGVYQQHMIKFMGVHAIKILGWGTEDGVPYWLVAN 296

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW   WGD G FKI RG DE  IE
Sbjct: 297 SWNVGWGDKGYFKILRGKDECGIE 320


>gi|12004577|gb|AAG44098.1| cathepsin B cysteine protease [Leishmania chagasi]
          Length = 340

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 38/89 (42%), Positives = 56/89 (62%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           + +E+   G +   ++ + D + YK G Y+H  G++ GGHAVK++GWG + GV YW   N
Sbjct: 246 LMIELMTNGPLEVTMQVYSDFVGYKSGGYKHVSGDLLGGHAVKLVGWGTQGGVPYWKIAN 305

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW   WGD G F I+RG++E  IES  V+
Sbjct: 306 SWNTDWGDKGYFLIQRGSNECGIESGGVA 334


>gi|221484923|gb|EEE23213.1| cysteine proteinase, putative [Toxoplasma gondii GT1]
          Length = 569

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 37/69 (53%), Positives = 45/69 (65%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
           G +  A   ++D + YK GVY+H  G   GGHA+KIIGWG E+G +YW  VNSW   WGD
Sbjct: 453 GPVSGAFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGTENGEEYWHAVNSWNTYWGD 512

Query: 69  GGLFKIRRG 77
           GG FKI  G
Sbjct: 513 GGQFKIAMG 521


>gi|237836005|ref|XP_002367300.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
 gi|211964964|gb|EEB00160.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
 gi|221506020|gb|EEE31655.1| cysteine proteinase, putative [Toxoplasma gondii VEG]
          Length = 572

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 37/69 (53%), Positives = 45/69 (65%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
           G +  A   ++D + YK GVY+H  G   GGHA+KIIGWG E+G +YW  VNSW   WGD
Sbjct: 456 GPVSGAFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGTENGEEYWHAVNSWNTYWGD 515

Query: 69  GGLFKIRRG 77
           GG FKI  G
Sbjct: 516 GGQFKIAMG 524


>gi|197101281|ref|NP_001125612.1| dipeptidyl peptidase 1 precursor [Pongo abelii]
 gi|75061881|sp|Q5RB02.1|CATC_PONAB RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|55728636|emb|CAH91058.1| hypothetical protein [Pongo abelii]
          Length = 463

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ H G +  A E + D + YKKG+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 419 GMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|195121981|ref|XP_002005491.1| GI19039 [Drosophila mojavensis]
 gi|193910559|gb|EDW09426.1| GI19039 [Drosophila mojavensis]
          Length = 432

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 41/94 (43%), Positives = 60/94 (63%), Gaps = 4/94 (4%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV---GEMSGGHAVKIIGWGVE-DGVKYW 56
           +  EI+H G + A +  ++D   Y  GVY+ T    G  +G H+VKI+GWG E DGVKYW
Sbjct: 325 IMAEIYHSGPVQATMRVYRDFFSYSGGVYRQTAANRGAPTGFHSVKIVGWGEEHDGVKYW 384

Query: 57  LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           +  NSWG  WG+ G F+I RG++E  IE + +++
Sbjct: 385 IAANSWGPWWGEHGYFRILRGSNECGIEEYVLAS 418


>gi|62320420|dbj|BAD94873.1| cathepsin B-like cysteine proteinase like protein [Arabidopsis
           thaliana]
          Length = 183

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 42/91 (46%), Positives = 54/91 (59%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY++  G   GGHAVK+IGWG  +DG  YWL  
Sbjct: 71  IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLLA 130

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           N W   WGD G FKIRRGT+E  IE   V+ 
Sbjct: 131 NQWNRSWGDDGYFKIRRGTNECGIEQSVVAG 161


>gi|21700775|gb|AAL60053.1| cysteine proteinase [Toxoplasma gondii]
          Length = 569

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 37/69 (53%), Positives = 45/69 (65%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
           G +  A   ++D + YK GVY+H  G   GGHA+KIIGWG E+G +YW  VNSW   WGD
Sbjct: 453 GPVSGAFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGTENGEEYWHAVNSWNTYWGD 512

Query: 69  GGLFKIRRG 77
           GG FKI  G
Sbjct: 513 GGQFKIAMG 521


>gi|426370061|ref|XP_004051995.1| PREDICTED: dipeptidyl peptidase 1 [Gorilla gorilla gorilla]
          Length = 463

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ H G +  A E + D + YKKG+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 419 GMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|307201161|gb|EFN81067.1| Uncharacterized peptidase C1-like protein F26E4.3 [Harpegnathos
           saltator]
          Length = 443

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 43/91 (47%), Positives = 59/91 (64%), Gaps = 8/91 (8%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHT-VGEM--SGGHAVKIIGWGVEDG-----VKY 55
           EI   G + A +  +QD  +YK GVY+H+   E+  SG H+++IIGWG E       +KY
Sbjct: 344 EILTSGPVQATMRVYQDFFVYKNGVYRHSRSAELHDSGYHSMRIIGWGEEPSYRGPPLKY 403

Query: 56  WLCVNSWGELWGDGGLFKIRRGTDESRIESF 86
           WL  NSWG  WG+ GLF+I+RGT+E  IES+
Sbjct: 404 WLVANSWGRHWGENGLFRIQRGTNECEIESY 434


>gi|114639716|ref|XP_508684.2| PREDICTED: dipeptidyl peptidase 1 isoform 2 [Pan troglodytes]
 gi|397526223|ref|XP_003833035.1| PREDICTED: dipeptidyl peptidase 1 [Pan paniscus]
 gi|410219182|gb|JAA06810.1| cathepsin C [Pan troglodytes]
 gi|410260226|gb|JAA18079.1| cathepsin C [Pan troglodytes]
 gi|410304128|gb|JAA30664.1| cathepsin C [Pan troglodytes]
 gi|410353831|gb|JAA43519.1| cathepsin C [Pan troglodytes]
          Length = 463

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ H G +  A E + D + YKKG+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 419 GMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|87240981|gb|ABD32839.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
           propeptide [Medicago truncatula]
          Length = 356

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 41/113 (36%), Positives = 62/113 (54%), Gaps = 1/113 (0%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY+H  G   GGHAVK++GWG   +G  YWL  
Sbjct: 244 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGFALGGHAVKLVGWGTSHEGEDYWLLA 303

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFEYDTDTTI 112
           N W   WGD G FKI+RGT+E  IE+   +     ++   ++ + + D D + 
Sbjct: 304 NQWNTNWGDDGYFKIKRGTNECGIENAVTAGLPSTKNIVREVTDMDVDADVSF 356


>gi|357511627|ref|XP_003626102.1| Cathepsin L-like proteinase [Medicago truncatula]
 gi|355501117|gb|AES82320.1| Cathepsin L-like proteinase [Medicago truncatula]
          Length = 351

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 41/113 (36%), Positives = 62/113 (54%), Gaps = 1/113 (0%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY+H  G   GGHAVK++GWG   +G  YWL  
Sbjct: 239 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGFALGGHAVKLVGWGTSHEGEDYWLLA 298

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFEYDTDTTI 112
           N W   WGD G FKI+RGT+E  IE+   +     ++   ++ + + D D + 
Sbjct: 299 NQWNTNWGDDGYFKIKRGTNECGIENAVTAGLPSTKNIVREVTDMDVDADVSF 351


>gi|168020784|ref|XP_001762922.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685734|gb|EDQ72127.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 345

 Score = 87.0 bits (214), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 44/90 (48%), Positives = 55/90 (61%), Gaps = 1/90 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
           +  E+F  G I  A +  +D   YK GVY+H  G   GGHAVK++GWG  +DGV YW  V
Sbjct: 244 LMAELFTNGPIEVAFDVFEDFAHYKTGVYKHLYGGYIGGHAVKLVGWGTTDDGVDYWSMV 303

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVS 89
           NSW   WG+ G F+I RG DE  IES  V+
Sbjct: 304 NSWNTNWGEDGTFRILRGKDECGIESNAVA 333


>gi|48762476|dbj|BAD23809.1| cathepsin B-S [Tuberaphis styraci]
 gi|204022069|dbj|BAG71132.1| cathepsin B-S1 [Tuberaphis styraci]
          Length = 349

 Score = 87.0 bits (214), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 40/92 (43%), Positives = 60/92 (65%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
           ++ ++  +G + A+ + + D  +YK G+Y+ T   +  GGH++KIIGWG E+G  YWL V
Sbjct: 239 IEQDLMTYGPVEASFDVYDDFSVYKSGIYRKTPKAKYEGGHSIKIIGWGEENGTPYWLAV 298

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           NSW + WGD G FKI +G +E  IE   V+AG
Sbjct: 299 NSWSKFWGDHGTFKIIKGRNECGIER-AVTAG 329


>gi|161343879|tpg|DAA06120.1| TPA_inf: cathepsin B [Toxoptera citricida]
          Length = 340

 Score = 87.0 bits (214), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 41/86 (47%), Positives = 56/86 (65%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++  +G I A+ + + D + YK GVY +       GGHAVK+IGWG E G  YWL +
Sbjct: 245 IQKDVMTYGPIEASFDVYDDFLSYKSGVYVRSENASYLGGHAVKLIGWGEEYGTPYWLMM 304

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW   WGD GLFKIRRGT+E  +++
Sbjct: 305 NSWNADWGDEGLFKIRRGTNECGVDN 330


>gi|294879717|ref|XP_002768767.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239871616|gb|EER01485.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 157

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 39/70 (55%), Positives = 47/70 (67%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
           G + A+   ++D + Y+ GVY+HT G   GGHAVKIIGWG + G  YWL VNSW E WGD
Sbjct: 74  GPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAVKIIGWGEKSGQAYWLAVNSWNEDWGD 133

Query: 69  GGLFKIRRGT 78
            GLFKI  G 
Sbjct: 134 HGLFKIALGN 143


>gi|261328564|emb|CBH11542.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like,
           putative [Trypanosoma brucei gambiense DAL972]
          Length = 340

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 38/81 (46%), Positives = 48/81 (59%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E+F  G    A + ++D I Y  GVY H  G+  GGHAV+++GWG  +GV YW   NSW 
Sbjct: 246 ELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGWGTSNGVPYWKIANSWN 305

Query: 64  ELWGDGGLFKIRRGTDESRIE 84
             WG  G F IRRG+ E  IE
Sbjct: 306 TEWGMDGYFLIRRGSSECGIE 326


>gi|296216857|ref|XP_002754752.1| PREDICTED: dipeptidyl peptidase 1 [Callithrix jacchus]
          Length = 460

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 43/104 (41%), Positives = 62/104 (59%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ H G +  A E + D + Y KG+Y HT         E++  HAV ++G+G +   
Sbjct: 357 MKLELVHHGPMAVAFEVYDDFLHYHKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 415

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 416 GIHYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 459


>gi|311263676|ref|XP_003129789.1| PREDICTED: dipeptidyl peptidase 1-like [Sus scrofa]
          Length = 463

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 43/104 (41%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVE--D 51
           M+LE+ H G +  A E + D + Y+KG+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYRKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDLAS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 419 GMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|294950069|ref|XP_002786445.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239900737|gb|EER18241.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 149

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 38/74 (51%), Positives = 51/74 (68%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EIF  G +  A + ++D  +YK GVY HT G++ G H +KIIGWGVE G +YWL +N
Sbjct: 50  IKQEIFEHGPVFCAFDMYKDFGLYKSGVYVHTTGDLVGSHTLKIIGWGVESGQEYWLAMN 109

Query: 61  SWGELWGDGGLFKI 74
           SW E WGD GL K+
Sbjct: 110 SWNEEWGDHGLIKM 123


>gi|388499754|gb|AFK37943.1| unknown [Lotus japonicus]
          Length = 209

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 44/92 (47%), Positives = 54/92 (58%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY+H  G   GGHAVK+IGWG  D G  YWL  
Sbjct: 97  IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGSQLGGHAVKLIGWGTTDEGEDYWLIA 156

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           N W   WGD G F IRRGT+E  IE   V+AG
Sbjct: 157 NQWNRSWGDDGYFMIRRGTNECGIEE-DVTAG 187


>gi|296863454|pdb|3HHI|A Chain A, Crystal Structure Of Cathepsin B From T. Brucei In Complex
           With Ca074
 gi|296863455|pdb|3HHI|B Chain B, Crystal Structure Of Cathepsin B From T. Brucei In Complex
           With Ca074
          Length = 325

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 38/81 (46%), Positives = 48/81 (59%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E+F  G    A + ++D I Y  GVY H  G+  GGHAV+++GWG  +GV YW   NSW 
Sbjct: 224 ELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGWGTSNGVPYWKIANSWN 283

Query: 64  ELWGDGGLFKIRRGTDESRIE 84
             WG  G F IRRG+ E  IE
Sbjct: 284 TEWGMDGYFLIRRGSSECGIE 304


>gi|204022083|dbj|BAG71139.1| cathepsin B-S [Astegopteryx styracophila]
          Length = 335

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 39/85 (45%), Positives = 57/85 (67%), Gaps = 1/85 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
           ++ +I  +G + A+ + + D I YK G+YQ T   +  GGH+VK+IGWG EDG+ YWL V
Sbjct: 240 IEQDIRTYGPVEASFDVYDDFITYKSGIYQKTPNALYVGGHSVKLIGWGEEDGIPYWLLV 299

Query: 60  NSWGELWGDGGLFKIRRGTDESRIE 84
           NSW + WG+ G F+I +G +E  IE
Sbjct: 300 NSWSKFWGEQGTFRIIKGRNECGIE 324


>gi|72389769|ref|XP_845179.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|427931064|pdb|4HWY|A Chain A, Trypanosoma Brucei Procathepsin B Solved From 40 Fs
           Free-electron Laser Pulse Data By Serial Femtosecond
           X-ray Crystallography
 gi|40557577|gb|AAR88085.1| cathepsin B-like cysteine protease [Trypanosoma brucei]
 gi|62360039|gb|AAX80461.1| cysteine peptidase C (CPC) [Trypanosoma brucei]
 gi|70801714|gb|AAZ11620.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
          Length = 340

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 38/81 (46%), Positives = 48/81 (59%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E+F  G    A + ++D I Y  GVY H  G+  GGHAV+++GWG  +GV YW   NSW 
Sbjct: 246 ELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGWGTSNGVPYWKIANSWN 305

Query: 64  ELWGDGGLFKIRRGTDESRIE 84
             WG  G F IRRG+ E  IE
Sbjct: 306 TEWGMDGYFLIRRGSSECGIE 326


>gi|355332948|pdb|3MOR|A Chain A, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
 gi|355332949|pdb|3MOR|B Chain B, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
          Length = 317

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 38/81 (46%), Positives = 48/81 (59%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E+F  G    A + ++D I Y  GVY H  G+  GGHAV+++GWG  +GV YW   NSW 
Sbjct: 223 ELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGWGTSNGVPYWKIANSWN 282

Query: 64  ELWGDGGLFKIRRGTDESRIE 84
             WG  G F IRRG+ E  IE
Sbjct: 283 TEWGMDGYFLIRRGSSECGIE 303


>gi|297723949|ref|NP_001174338.1| Os05g0310500 [Oryza sativa Japonica Group]
 gi|255676228|dbj|BAH93066.1| Os05g0310500, partial [Oryza sativa Japonica Group]
          Length = 234

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 42/85 (49%), Positives = 51/85 (60%), Gaps = 1/85 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY+H  G M GGHAVK+IGWG  D G  YWL  
Sbjct: 122 IMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLLA 181

Query: 60  NSWGELWGDGGLFKIRRGTDESRIE 84
           N W   WGD G FKI RGT+E  IE
Sbjct: 182 NQWNRGWGDDGYFKIIRGTNECGIE 206


>gi|312283137|dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila]
          Length = 362

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 42/97 (43%), Positives = 54/97 (55%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
           +  E++  G +  +   ++D   YK GVY+H  G   GGHAVK+IGWG  D G  YWL  
Sbjct: 250 IMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTTDEGEDYWLLA 309

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRD 96
           N W   WGD G F IRRGT+E  IE   V+     R+
Sbjct: 310 NQWNRSWGDDGYFMIRRGTNECGIEDEPVAGLPSSRN 346


>gi|300121294|emb|CBK21674.2| unnamed protein product [Blastocystis hominis]
          Length = 561

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 39/85 (45%), Positives = 54/85 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           M  EI+  G I  A++A  +L+ YK G+++   G  S  HA+ ++GWG EDG KYW+  N
Sbjct: 184 MMKEIYARGPITCALDATDELVAYKGGIFEDKTGTTSLNHAISVVGWGEEDGKKYWIVRN 243

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SWG  WG+ G F+I RGT+   IES
Sbjct: 244 SWGTYWGENGWFRIVRGTNNLGIES 268



 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 37/86 (43%), Positives = 50/86 (58%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           M+ EIF  G I   +   Q+ + Y  GV+      M GGH +++ GWGV EDG +YW+  
Sbjct: 465 MKAEIFARGPISCYVSVSQEFLDYTGGVFVEHDHSMLGGHIIEVAGWGVTEDGQEYWIGR 524

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSWGE WG+ G F+I+   D   IES
Sbjct: 525 NSWGEYWGENGWFRIQTDKDNLEIES 550


>gi|294891865|ref|XP_002773777.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239878981|gb|EER05593.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 156

 Score = 86.7 bits (213), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 42/85 (49%), Positives = 57/85 (67%), Gaps = 2/85 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EIF  G+++  I  ++D  +YK GVY HT G + G H++KIIGWGVE G  YWL VN
Sbjct: 66  IKQEIFDNGTVLGVISMYEDFRLYKSGVYVHTTGGLVGVHSLKIIGWGVESGQDYWLAVN 125

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW E WGD G+ K+  G  E+ IE+
Sbjct: 126 SWNEEWGDHGMIKLAVG--ETGIEN 148


>gi|350408961|ref|XP_003488566.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
           impatiens]
          Length = 445

 Score = 86.7 bits (213), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 43/100 (43%), Positives = 61/100 (61%), Gaps = 11/100 (11%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS---GGHAVKIIGWGVEDG----- 52
           +  EI   G + A ++ +QD   Y+ G+Y+HT        G H+V+IIGWG +       
Sbjct: 340 IMYEILTSGPVQATMKVYQDFFSYESGIYKHTATTEHYAFGYHSVRIIGWGEDTSAHRYR 399

Query: 53  ---VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVS 89
              +KYWL VNSWG+ WG+ GLF+I+RGT+E  IESF V+
Sbjct: 400 NLPIKYWLVVNSWGQQWGESGLFRIQRGTNECDIESFVVA 439


>gi|204022085|dbj|BAG71140.1| cathepsin B-S [Astegopteryx spinocephala]
          Length = 335

 Score = 86.7 bits (213), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 42/96 (43%), Positives = 60/96 (62%), Gaps = 2/96 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
           ++ +I  +G + A+ + + D I YK G+YQ T      GGH+VK+IGWG EDG+ YWL V
Sbjct: 240 IEQDIRKYGPVEASFDVYDDFITYKSGIYQKTPNAFYVGGHSVKLIGWGEEDGIPYWLLV 299

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           NSW + WG+ G F+I +G +E  IE    +AG   R
Sbjct: 300 NSWSKFWGEQGTFRIIKGRNECGIER-SATAGVPSR 334


>gi|308504721|ref|XP_003114544.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
 gi|308261929|gb|EFP05882.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
          Length = 358

 Score = 86.7 bits (213), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 43/99 (43%), Positives = 58/99 (58%), Gaps = 1/99 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G ++A+   + D   YK G+Y HT G+  GG   KIIGWGV+ GV YWLCV+
Sbjct: 261 IQTEIMTNGPVIASFVIYDDFWDYKSGIYVHTAGDQEGGMDTKIIGWGVDSGVPYWLCVH 320

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSS 99
            WG  +G+ G  +  RG +E  IE  QV A   D D+ +
Sbjct: 321 QWGTDFGENGFVRFLRGVNEVNIE-HQVLAALPDIDKHN 358


>gi|312383398|gb|EFR28501.1| hypothetical protein AND_03481 [Anopheles darlingi]
          Length = 573

 Score = 86.7 bits (213), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 40/96 (41%), Positives = 60/96 (62%), Gaps = 9/96 (9%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG-----EMSGGHAVKIIGWGVE----DGVK 54
           EI   G++ A +  ++D   Y+ G+Y+H+       E S  H+V++IGWG E    D VK
Sbjct: 436 EIKERGTVQAILRVYRDFFSYQNGIYRHSAAATPAEERSAYHSVRLIGWGEERVGYDMVK 495

Query: 55  YWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           YW+ VNSWG  WG+ G F+I RGT+E  IES+ +++
Sbjct: 496 YWIAVNSWGTWWGENGRFRILRGTNECEIESYVLAS 531


>gi|161343871|tpg|DAA06116.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 276

 Score = 86.7 bits (213), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 41/86 (47%), Positives = 57/86 (66%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++ ++G I A+ + + D   YK G+Y +       GGH+VK+IGWG E GV YWL V
Sbjct: 182 IQKDVINYGPIEASFDVYDDFPSYKSGIYVKSENASYLGGHSVKLIGWGEEYGVLYWLMV 241

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW   WGD GLFKIRRGT+E  +++
Sbjct: 242 NSWNADWGDKGLFKIRRGTNECGVDN 267


>gi|161343849|tpg|DAA06105.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 334

 Score = 86.7 bits (213), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 46/94 (48%), Positives = 58/94 (61%), Gaps = 3/94 (3%)

Query: 1   MQLEIFHFGSIVAAIEAH-QDLIIYKKGVYQHTVG-EMSGGHAVKIIGWGVEDGVKYWLC 58
           +Q E+ ++G +  A +    D  +YK GVY+ T   E       K+IGWGVE+GV YWL 
Sbjct: 238 IQREVQNYGPVSMAFKVFDNDFFLYKSGVYEKTTNSEFIQWQYAKLIGWGVENGVDYWLL 297

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSAGR 92
           VN WG  WG  GLFKI+RGTDE  IE+F V AG 
Sbjct: 298 VNFWGYEWGQNGLFKIKRGTDECNIETF-VHAGE 330


>gi|340712697|ref|XP_003394892.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
           terrestris]
          Length = 445

 Score = 86.7 bits (213), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 43/100 (43%), Positives = 61/100 (61%), Gaps = 11/100 (11%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS---GGHAVKIIGWGVEDG----- 52
           +  EI   G + A ++ +QD   Y+ G+Y+HT        G H+V+IIGWG +       
Sbjct: 340 IMYEILTSGPVQATMKVYQDFFSYESGIYKHTATTEHYAFGYHSVRIIGWGEDTSAHRHH 399

Query: 53  ---VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVS 89
              +KYWL VNSWG+ WG+ GLF+I+RGT+E  IESF V+
Sbjct: 400 NLPIKYWLVVNSWGQQWGESGLFRIQRGTNECDIESFVVA 439


>gi|294890224|ref|XP_002773108.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239878009|gb|EER04924.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 109

 Score = 86.7 bits (213), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 41/72 (56%), Positives = 49/72 (68%)

Query: 9  GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
          G + A+   ++D + Y+ GVY+HT G+  GGHAVKIIGWG E G  YWL VNSW E WGD
Sbjct: 26 GPVSASFIVYEDFLAYRSGVYKHTSGKELGGHAVKIIGWGEETGQAYWLVVNSWNEDWGD 85

Query: 69 GGLFKIRRGTDE 80
           GLFKI  G  E
Sbjct: 86 NGLFKIALGNCE 97


>gi|294939825|ref|XP_002782575.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239894358|gb|EER14370.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 398

 Score = 86.7 bits (213), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 39/69 (56%), Positives = 46/69 (66%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
           G + A    ++D + YK GVY+HT G + G HAVKIIGWG + G  YWL VNSW E WGD
Sbjct: 315 GPVSATFYVYEDFLAYKSGVYKHTSGSLLGAHAVKIIGWGEDGGEAYWLVVNSWNEGWGD 374

Query: 69  GGLFKIRRG 77
            GLFKI  G
Sbjct: 375 HGLFKIALG 383


>gi|290980376|ref|XP_002672908.1| predicted protein [Naegleria gruberi]
 gi|284086488|gb|EFC40164.1| predicted protein [Naegleria gruberi]
          Length = 261

 Score = 86.7 bits (213), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 44/94 (46%), Positives = 57/94 (60%), Gaps = 6/94 (6%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG--EMSGGHAVKIIGWGVEDGVKYWLC 58
           MQ  I   GSI+  ++ +QD + Y  GVYQH+    +      V+IIGWGVE+GVKYW+ 
Sbjct: 167 MQQAILQGGSIMTELDMYQDFLYYSSGVYQHSANLRQPIAKFVVRIIGWGVENGVKYWIV 226

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIE----SFQV 88
            N WG+ WG  G   IRRG +ES IE    +FQV
Sbjct: 227 PNIWGKTWGMQGYIWIRRGNNESNIEKDAFAFQV 260


>gi|290979437|ref|XP_002672440.1| predicted protein [Naegleria gruberi]
 gi|284086017|gb|EFC39696.1| predicted protein [Naegleria gruberi]
          Length = 354

 Score = 86.3 bits (212), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 43/91 (47%), Positives = 56/91 (61%), Gaps = 3/91 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++  I  +GS+ +    ++D + Y+ GVY+H      GGHAV +IGWGVE G  YWL VN
Sbjct: 263 IKAAIMSYGSVQSGFTIYRDFMSYRSGVYKHVSTTTLGGHAVALIGWGVESGTNYWLAVN 322

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SWG  WG  G FKI +G  E  IE+ QV AG
Sbjct: 323 SWGSNWGMSGYFKIAQG--ECGIEN-QVYAG 350


>gi|145481831|ref|XP_001426938.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124394016|emb|CAK59540.1| unnamed protein product [Paramecium tetraurelia]
          Length = 332

 Score = 86.3 bits (212), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 41/84 (48%), Positives = 51/84 (60%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI+  G + A      D + YK GVYQ T G+  G HAVKIIGWG E+GV YW  +N
Sbjct: 241 IKNEIYLNGPVQAVFTVFDDFLNYKSGVYQQTTGQRRGKHAVKIIGWGTENGVPYWEAIN 300

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW + WG  G FKI RG +   IE
Sbjct: 301 SWNDGWGINGKFKILRGFNHLDIE 324


>gi|149392557|gb|ABR26081.1| cathepsin b-like cysteine proteinase 3 [Oryza sativa Indica Group]
          Length = 142

 Score = 86.3 bits (212), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 42/85 (49%), Positives = 51/85 (60%), Gaps = 1/85 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY+H  G M GGHAVK+IGWG  D G  YWL  
Sbjct: 30  IMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLLA 89

Query: 60  NSWGELWGDGGLFKIRRGTDESRIE 84
           N W   WGD G FKI RGT+E  IE
Sbjct: 90  NQWNRGWGDDGYFKIIRGTNECGIE 114


>gi|52630925|gb|AAU84926.1| putative cathepsin B-N [Toxoptera citricida]
          Length = 340

 Score = 86.3 bits (212), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 41/86 (47%), Positives = 55/86 (63%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
           +Q ++  +G + A+ + + D   YK GVY +       GGHA K+IGWG E GV YWL V
Sbjct: 245 IQKDVLTYGPVEASFDVYDDFPSYKSGVYIRSENASYLGGHAAKLIGWGEEYGVPYWLMV 304

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSW   WGD GLFKI+RGT+E  I++
Sbjct: 305 NSWNADWGDNGLFKIQRGTNECGIDN 330


>gi|204022077|dbj|BAG71136.1| cathepsin B-S1 [Tuberaphis sumatrana]
 gi|204022079|dbj|BAG71137.1| cathepsin B-S2 [Tuberaphis sumatrana]
          Length = 334

 Score = 86.3 bits (212), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 41/92 (44%), Positives = 59/92 (64%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
           ++ +I  +G + A+ + + D  +YK G+Y+ T   +   GH+VKIIGWG E+G  YWL V
Sbjct: 239 IEQDIKTYGPVEASFDVYDDFSVYKSGIYRKTPNAKYQNGHSVKIIGWGQENGTPYWLAV 298

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           NSW + WGD G FKI +G +E  IE   V+AG
Sbjct: 299 NSWSKFWGDHGTFKIIKGKNECGIER-AVTAG 329


>gi|121073189|gb|ABM47071.1| cathepsin B2 [Clonorchis sinensis]
 gi|358341868|dbj|GAA36574.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 343

 Score = 86.3 bits (212), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 42/91 (46%), Positives = 54/91 (59%), Gaps = 1/91 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI   G + A  E H+D   YK G+Y H  G   GGHA++I+GWG E+GV YWL  NSW 
Sbjct: 247 EILLNGPVEATFEVHEDFPEYKSGIYFHAWGGSVGGHAIRILGWGEENGVPYWLIANSWN 306

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           E WG+ G  +  RG +E  IE  + +AG  D
Sbjct: 307 EDWGEKGYLRFLRGHNECGIEE-EATAGLPD 336


>gi|6165885|gb|AAF04727.1|AF101239_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
          Length = 352

 Score = 86.3 bits (212), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 48/115 (41%), Positives = 63/115 (54%), Gaps = 4/115 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           +  E++  G    +   ++D   YK GVY+H  G   GGHAVK+IGWG  EDG  YWL  
Sbjct: 240 IMTEVYTNGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHAVKLIGWGTSEDGEDYWLLA 299

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFEYDTDTTIES 114
           N W   WG  G FKI RGT+E  IE   V+AG     ++ D+E    D D+ + S
Sbjct: 300 NQWNRSWGGDGYFKIIRGTNECGIE--DVTAG-TPSTKNLDIESGVRDDDSLVAS 351


>gi|204022073|dbj|BAG71134.1| cathepsin B-S1 [Tuberaphis taiwana]
          Length = 334

 Score = 86.3 bits (212), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 40/92 (43%), Positives = 60/92 (65%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
           ++ +I  +G + A+ + + DL  YK G+Y+ T   +  GGH++KIIGWG ++G  YWL V
Sbjct: 239 IERDIMTYGPVEASFDVYDDLSAYKSGIYRKTPKAKYQGGHSIKIIGWGQQNGTPYWLAV 298

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           NSW + WG+ G FKI +G +E  IE   V+AG
Sbjct: 299 NSWSKFWGEHGTFKIIKGRNECGIER-AVTAG 329


>gi|321478457|gb|EFX89414.1| hypothetical protein DAPPUDRAFT_303204 [Daphnia pulex]
          Length = 442

 Score = 85.9 bits (211), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 41/94 (43%), Positives = 59/94 (62%), Gaps = 7/94 (7%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHT---VGEMSGGHAVKIIGWGVE----DGVKYW 56
           EI   G + A +  H D  +Y+ GVY+++     + SG H+V+I+GWGV+    +  KYW
Sbjct: 341 EILQHGPVQATMRVHPDFFLYRGGVYRYSGTNSQQRSGYHSVRIVGWGVDSSKRNPTKYW 400

Query: 57  LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           L  NSWG LWG+ G F+I RG +ES IE F ++A
Sbjct: 401 LVANSWGRLWGEDGYFRIVRGENESDIEKFVLAA 434


>gi|194352768|emb|CAQ00112.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326488519|dbj|BAJ93928.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326508126|dbj|BAJ99330.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 355

 Score = 85.9 bits (211), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 46/105 (43%), Positives = 61/105 (58%), Gaps = 5/105 (4%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY+H  G + GGHAVK+IGWG  D G  YWL  
Sbjct: 245 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLA 304

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRV---DRDRSSDL 101
           N W   WGD G FKI RG +E  IE   V+AG     + DR++D+
Sbjct: 305 NQWNRGWGDDGYFKIIRGKNECGIEE-DVTAGMPSTKNMDRNNDV 348


>gi|444728469|gb|ELW68926.1| Dipeptidyl peptidase 1 [Tupaia chinensis]
          Length = 462

 Score = 85.9 bits (211), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 42/104 (40%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVE--D 51
           M+LE+ H G +  A E + D + Y+KG+YQHT         E++  HAV ++G+G +   
Sbjct: 359 MKLELVHHGPMAVAFEVYDDFLHYQKGIYQHTGLRDPFNPFELTN-HAVLLVGYGTDLAS 417

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRG DE  IES  ++A  + +
Sbjct: 418 GMDYWIVKNSWGTSWGEDGFFRIRRGIDECSIESIAMAATPIPK 461


>gi|239938580|gb|ACS36089.1| cysteine proteinase [Haemonchus contortus]
          Length = 332

 Score = 85.9 bits (211), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 37/77 (48%), Positives = 48/77 (62%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q E+   G + AA   ++D   Y KG+Y HT G   G HAVK++GWGVE+G KYW   N
Sbjct: 255 IQREMMKNGPVQAAFITYEDFSFYTKGIYVHTRGRQRGAHAVKVVGWGVENGTKYWNVAN 314

Query: 61  SWGELWGDGGLFKIRRG 77
           SW   WG+ G F+I RG
Sbjct: 315 SWSTDWGENGYFRILRG 331


>gi|30038325|dbj|BAC75711.1| cathepsin C [Bos taurus]
          Length = 458

 Score = 85.9 bits (211), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 43/104 (41%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVE--D 51
           M+LE+ H G +  A E + D + Y+KGVY HT         E++  HAV ++G+G +   
Sbjct: 355 MKLELVHQGPMAVAFEVYDDFLHYRKGVYHHTGLRDPFNPFELTN-HAVLLVGYGTDAAS 413

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  ++A  + +
Sbjct: 414 GLDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIALAATPIPK 457


>gi|389608479|dbj|BAM17849.1| tubulointerstitial nephritis antigen [Papilio xuthus]
          Length = 429

 Score = 85.9 bits (211), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 39/93 (41%), Positives = 58/93 (62%), Gaps = 3/93 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQ---HTVGEMSGGHAVKIIGWGVEDGVKYWL 57
           +  +I   G + A +  +QD   Y+ GVY+   H   E+ G H+V+IIGWG + G +YW+
Sbjct: 327 IMYDIMESGPVQAVMTVYQDFFHYRDGVYRRSYHGNNELKGFHSVRIIGWGEDRGDRYWV 386

Query: 58  CVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
             NSWG  WG+ G F+I RG++E+ IESF V+ 
Sbjct: 387 VANSWGRQWGENGYFRIARGSNEADIESFVVTG 419


>gi|404250524|gb|AFR54113.1| cysteine proteinase, partial [Haemonchus contortus]
          Length = 332

 Score = 85.9 bits (211), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 37/77 (48%), Positives = 48/77 (62%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q E+   G + AA   ++D   Y KG+Y HT G   G HAVK++GWGVE+G KYW   N
Sbjct: 255 IQREMMKNGPVQAAFITYEDFSFYTKGIYVHTRGRQRGAHAVKVVGWGVENGTKYWNVAN 314

Query: 61  SWGELWGDGGLFKIRRG 77
           SW   WG+ G F+I RG
Sbjct: 315 SWSTDWGENGYFRILRG 331


>gi|294916338|ref|XP_002778359.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239886683|gb|EER10154.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 105

 Score = 85.9 bits (211), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 39/70 (55%), Positives = 47/70 (67%)

Query: 9  GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
          G + A+   ++D + Y+ GVY+HT G   GGHAVKIIGWG + G  YWL VNSW E WGD
Sbjct: 22 GPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAVKIIGWGEKSGQAYWLAVNSWNEDWGD 81

Query: 69 GGLFKIRRGT 78
           GLFKI  G 
Sbjct: 82 HGLFKIALGN 91


>gi|395526635|ref|XP_003765465.1| PREDICTED: tubulointerstitial nephritis antigen-like [Sarcophilus
           harrisii]
          Length = 467

 Score = 85.9 bits (211), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 47/104 (45%), Positives = 59/104 (56%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +YK G+Y+HT   +         G H+VKI GWG E   DG
Sbjct: 355 ELMENGPVQALLEVHEDFFLYKSGIYKHTPASLGKPERYRQHGTHSVKITGWGEEIQPDG 414

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             VKYW   NSWG  WG+ G F+I RG +E  IESF V   GRV
Sbjct: 415 QKVKYWTAANSWGPTWGENGYFRIVRGANECDIESFVVGVWGRV 458


>gi|75812938|ref|NP_001028789.1| dipeptidyl peptidase 1 precursor [Bos taurus]
 gi|115312125|sp|Q3ZCJ8.1|CATC_BOVIN RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|73587261|gb|AAI02116.1| Cathepsin C [Bos taurus]
          Length = 463

 Score = 85.9 bits (211), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 43/104 (41%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVE--D 51
           M+LE+ H G +  A E + D + Y+KGVY HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVHQGPMAVAFEVYDDFLHYRKGVYHHTGLRDPFNPFELTN-HAVLLVGYGTDAAS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  ++A  + +
Sbjct: 419 GLDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIALAATPIPK 462


>gi|332210919|ref|XP_003254561.1| PREDICTED: LOW QUALITY PROTEIN: dipeptidyl peptidase 1 [Nomascus
           leucogenys]
          Length = 463

 Score = 85.9 bits (211), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 43/104 (41%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ H G +  A E + D + Y+KG+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYEKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 419 GMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|296471940|tpg|DAA14055.1| TPA: dipeptidyl peptidase 1 [Bos taurus]
 gi|440894445|gb|ELR46895.1| Dipeptidyl peptidase 1 [Bos grunniens mutus]
          Length = 463

 Score = 85.9 bits (211), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 43/104 (41%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVE--D 51
           M+LE+ H G +  A E + D + Y+KGVY HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVHQGPMAVAFEVYDDFLHYRKGVYHHTGLRDPFNPFELTN-HAVLLVGYGTDAAS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  ++A  + +
Sbjct: 419 GLDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIALAATPIPK 462


>gi|193606095|ref|XP_001951499.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 330

 Score = 85.9 bits (211), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 43/95 (45%), Positives = 57/95 (60%), Gaps = 2/95 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
           +Q E+  +G +      + D  +YK GVY  T   +    H  K+IGWGVE+GV YWL V
Sbjct: 235 IQKEVQTYGPVSVKFRVYDDFFLYKSGVYVKTEKSLYVRRHFAKLIGWGVENGVDYWLLV 294

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           NSWG  WG  GLFKI+RGT+E  +E + V AG  +
Sbjct: 295 NSWGNEWGQNGLFKIKRGTNEVHVEDY-VYAGEPE 328


>gi|449269572|gb|EMC80333.1| Dipeptidyl-peptidase 1 [Columba livia]
          Length = 412

 Score = 85.9 bits (211), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGV--ED 51
           M+LE+   G +  A E + D I YK+G+Y HT         E++  HAV ++G+G   + 
Sbjct: 309 MKLELVLHGPMAVAFEVYNDFIHYKEGIYHHTGLRDDFNPFELTN-HAVLLVGYGTDPQS 367

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G K+W+  NSWG LWG+ G F+IRRGTDE  IES  VSA  + +
Sbjct: 368 GEKFWIVKNSWGILWGENGYFRIRRGTDECAIESIAVSATPIAK 411


>gi|1345924|sp|P25802.3|CYSP1_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
           Precursor
          Length = 341

 Score = 85.9 bits (211), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 37/94 (39%), Positives = 58/94 (61%), Gaps = 1/94 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G +VA    ++D   Y+ G+Y+H  G  +G HAVK+IGWG E G  YW+  N
Sbjct: 249 IQKDIMKNGPVVATYTVYEDFAHYRSGIYKHKAGRKTGLHAVKVIGWGEEKGTPYWIVAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           SW + WG+ G F++ RG+++   E  +++AG V 
Sbjct: 309 SWHDDWGENGFFRMHRGSNDCGFEE-RMAAGSVQ 341


>gi|327408413|emb|CCA30060.1| unnamed protein product [Neospora caninum Liverpool]
          Length = 463

 Score = 85.9 bits (211), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 37/77 (48%), Positives = 49/77 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ ++   G++  A   ++D + YK GVY+H  G   GGHA+KIIGWG EDG +YW  VN
Sbjct: 339 VKRDMMAHGTVTGAFMVYEDFLNYKSGVYKHVYGGPLGGHAIKIIGWGTEDGEEYWHAVN 398

Query: 61  SWGELWGDGGLFKIRRG 77
           SW   WGD G FKI  G
Sbjct: 399 SWNTYWGDSGHFKIEMG 415


>gi|290998826|ref|XP_002681981.1| predicted protein [Naegleria gruberi]
 gi|284095607|gb|EFC49237.1| predicted protein [Naegleria gruberi]
          Length = 310

 Score = 85.5 bits (210), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 45/93 (48%), Positives = 57/93 (61%), Gaps = 3/93 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++  I  +GS+ A    ++DL  YK GVY+H V  + GGHAV +IG+GVE G  YWL  N
Sbjct: 219 IKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHVVSTVLGGHAVALIGFGVEGGSNYWLAAN 278

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           SWG  WG  G FKI +G  E  IE+ QV AG  
Sbjct: 279 SWGPNWGMSGYFKIAQG--EGGIEN-QVYAGEA 308


>gi|290971375|ref|XP_002668483.1| predicted protein [Naegleria gruberi]
 gi|284081912|gb|EFC35739.1| predicted protein [Naegleria gruberi]
          Length = 325

 Score = 85.5 bits (210), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 45/93 (48%), Positives = 57/93 (61%), Gaps = 3/93 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++  I  +GS+ A    ++DL  YK GVY+H V  + GGHAV +IG+GVE G  YWL  N
Sbjct: 234 IKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHVVSTVLGGHAVALIGFGVEGGSNYWLAAN 293

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           SWG  WG  G FKI +G  E  IE+ QV AG  
Sbjct: 294 SWGANWGMSGYFKIAQG--EGGIEN-QVYAGEA 323


>gi|55793951|gb|AAV65886.1| cathepsin B1 isotype 6 precursor [Trichobilharzia regenti]
          Length = 342

 Score = 85.5 bits (210), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G + +    + D + YK G+Y+H  G   G H V+I+GWGVE G  YWL  N
Sbjct: 249 IKKEIMMHGPVGSFFTVYSDFLNYKSGIYKHMKGTEIGVHTVRIVGWGVEKGTPYWLIAN 308

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQV 88
           SW E WG+ G F+I RG DE  IES  +
Sbjct: 309 SWNEGWGEKGYFRILRGKDECDIESLVI 336


>gi|383861394|ref|XP_003706171.1| PREDICTED: tubulointerstitial nephritis antigen-like [Megachile
           rotundata]
          Length = 442

 Score = 85.5 bits (210), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 43/93 (46%), Positives = 59/93 (63%), Gaps = 10/93 (10%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEM--SGGHAVKIIGWGVEDG-------V 53
           EI   G + A +  +QD   Y+ GVY+H+V  E+  S  H+V+IIGWG E         +
Sbjct: 341 EILTSGPVQATMRVYQDFFSYESGVYKHSVTAELYESDYHSVRIIGWGEEPPTYSRNTPL 400

Query: 54  KYWLCVNSWGELWGDGGLFKIRRGTDESRIESF 86
           KYWL  NSWG+ WG+ GLF+I++GT+E  IESF
Sbjct: 401 KYWLVANSWGQQWGENGLFRIQKGTNECEIESF 433


>gi|294935195|ref|XP_002781337.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239891887|gb|EER13132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 317

 Score = 85.5 bits (210), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 39/77 (50%), Positives = 50/77 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G   A    ++D + YK GVY+HT G + G H+V+IIGWG E GV YWL +N
Sbjct: 225 IKKEIMTNGPTSATFSVYEDFVSYKSGVYKHTNGTLMGIHSVEIIGWGTEKGVDYWLVMN 284

Query: 61  SWGELWGDGGLFKIRRG 77
           SW E WGD G FKI +G
Sbjct: 285 SWNEGWGDHGTFKIAQG 301


>gi|403332696|gb|EJY65386.1| Cathepsin B [Oxytricha trifallax]
          Length = 297

 Score = 85.5 bits (210), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 36/77 (46%), Positives = 50/77 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G +  A   + D   Y+ GVY  T  +++GGHA+KI+G+GVE+G  YWLC N
Sbjct: 206 IKSEIVAHGPVEGAFTVYTDFFNYQSGVYTPTTSDVAGGHAIKILGFGVENGTPYWLCAN 265

Query: 61  SWGELWGDGGLFKIRRG 77
           SWG  WG  G FKI++G
Sbjct: 266 SWGPSWGMQGFFKIKQG 282


>gi|410972493|ref|XP_003992693.1| PREDICTED: dipeptidyl peptidase 1 isoform 1 [Felis catus]
          Length = 463

 Score = 85.5 bits (210), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 43/104 (41%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ H G +  A E + D + Y+KG+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVHHGPMAVAFEVYNDFLHYRKGIYYHTGLRDPFNPFELTN-HAVLLVGYGTDPVS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 419 GMDYWIVKNSWGIGWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|260826514|ref|XP_002608210.1| hypothetical protein BRAFLDRAFT_125840 [Branchiostoma floridae]
 gi|229293561|gb|EEN64220.1| hypothetical protein BRAFLDRAFT_125840 [Branchiostoma floridae]
          Length = 470

 Score = 85.5 bits (210), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 46/99 (46%), Positives = 60/99 (60%), Gaps = 10/99 (10%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWG--VED 51
           MQLE+   G +  A E + D + YK GVY+HT         E++  HAV ++G+G   E 
Sbjct: 366 MQLELVKNGPMAVAFEVYSDFMHYKGGVYEHTGLSDPFNPFEITN-HAVLLVGYGRDPET 424

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           G K+W   NSWGE WG+ G F+IRRGTDE  IES  V+A
Sbjct: 425 GAKFWTVKNSWGEKWGEEGFFRIRRGTDECAIESIAVAA 463


>gi|335347289|gb|AEH42092.1| cysteine proteinase 1 [Haemonchus contortus]
          Length = 332

 Score = 85.5 bits (210), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 37/77 (48%), Positives = 48/77 (62%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q E+   G + AA   ++D   Y KG+Y HT G   G HAVK++GWGVE+G KYW   N
Sbjct: 255 IQREMMKNGPVQAAFITYEDFSFYTKGIYVHTRGRQRGAHAVKVVGWGVENGTKYWNVAN 314

Query: 61  SWGELWGDGGLFKIRRG 77
           SW   WG+ G F+I RG
Sbjct: 315 SWSTDWGEDGYFRILRG 331


>gi|204022081|dbj|BAG71138.1| cathepsin B-S1 [Tuberaphis takenouchii]
          Length = 332

 Score = 85.5 bits (210), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 39/85 (45%), Positives = 54/85 (63%), Gaps = 1/85 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
           M+ ++  +G I A+     DL  YK G+YQ T   +   GH++KIIGWG E+GV YWL V
Sbjct: 239 MEQDLIKYGPIEASFNLFDDLSAYKSGIYQKTPKAKFLSGHSIKIIGWGKENGVPYWLAV 298

Query: 60  NSWGELWGDGGLFKIRRGTDESRIE 84
           NSW + WG+ G F+I +G +E  IE
Sbjct: 299 NSWSKFWGEQGTFRIIKGRNECGIE 323


>gi|417401357|gb|JAA47568.1| Putative dipeptidyl peptidase 1 [Desmodus rotundus]
          Length = 463

 Score = 85.5 bits (210), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 42/104 (40%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ H G +  A E + D + Y++G+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVHNGPMAVAFEVYNDFLHYQEGIYHHTGLTDPFNPFELTN-HAVLLVGYGTDPAT 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 419 GMDYWIVKNSWGTAWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|307175943|gb|EFN65753.1| Uncharacterized peptidase C1-like protein F26E4.3 [Camponotus
           floridanus]
          Length = 443

 Score = 85.5 bits (210), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 42/91 (46%), Positives = 59/91 (64%), Gaps = 8/91 (8%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHT-VGEM--SGGHAVKIIGWGVEDG-----VKY 55
           EI   G + A +  +QD  +Y+ GVY+H+   E+  SG H+V+IIGWG E       +KY
Sbjct: 344 EILTSGPVQATMRVYQDFFVYQSGVYRHSRSAELHDSGYHSVRIIGWGEEPSYRGPPLKY 403

Query: 56  WLCVNSWGELWGDGGLFKIRRGTDESRIESF 86
           WL  NSWG  WG+ GLF+I++GT+E  IES+
Sbjct: 404 WLVANSWGHNWGENGLFRIQKGTNECEIESY 434


>gi|294936554|ref|XP_002781799.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
 gi|239892784|gb|EER13594.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
          Length = 88

 Score = 85.5 bits (210), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 40/69 (57%), Positives = 47/69 (68%)

Query: 9  GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
          G   AA   ++D + YK GVY+HT G   GGHAV+IIGWG E GV YWL +NSW E WGD
Sbjct: 4  GPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVDYWLVMNSWNEEWGD 63

Query: 69 GGLFKIRRG 77
           G FKI +G
Sbjct: 64 HGTFKIVQG 72


>gi|297814171|ref|XP_002874969.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297320806|gb|EFH51228.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 359

 Score = 85.5 bits (210), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 41/97 (42%), Positives = 54/97 (55%), Gaps = 1/97 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           +  E++  G +  +   ++D   YK GVY+H  G   GGHAVK+IGWG   +G  YWL  
Sbjct: 247 IMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSNEGEDYWLMA 306

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRD 96
           N W   WGD G F IRRGT+E  IE   V+     R+
Sbjct: 307 NQWNRGWGDDGYFMIRRGTNECGIEDEPVAGLPSSRN 343


>gi|345327151|ref|XP_001507103.2| PREDICTED: tubulointerstitial nephritis antigen-like
           [Ornithorhynchus anatinus]
          Length = 327

 Score = 85.5 bits (210), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 49/117 (41%), Positives = 61/117 (52%), Gaps = 17/117 (14%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG--------EMSGGHAVKIIGWGVE----- 50
           EI   G + A +E H+D  +YK G+Y+HT             G H+VKI GWG E     
Sbjct: 210 EIMENGPVQALMEVHEDFFLYKDGIYRHTPASNGKPPQFRRQGTHSVKITGWGEELQPNG 269

Query: 51  DGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRVDRDRSSDLEEFEY 106
             VK+W   NSWG  WG+GG F+I RG +E  IESF V   GRV    S D+    Y
Sbjct: 270 RRVKFWRAANSWGPTWGEGGSFRILRGCNECDIESFVVGVWGRVG---SEDMNHRRY 323


>gi|195384166|ref|XP_002050789.1| GJ20006 [Drosophila virilis]
 gi|194145586|gb|EDW61982.1| GJ20006 [Drosophila virilis]
          Length = 432

 Score = 85.1 bits (209), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 39/94 (41%), Positives = 60/94 (63%), Gaps = 4/94 (4%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV---GEMSGGHAVKIIGWGVE-DGVKYW 56
           +  EI+H G + A +  ++D   Y  G+Y+ T    G  +G H+VK++GWG E DGVKYW
Sbjct: 326 IMAEIYHSGPVQATMRIYRDFFSYSGGIYRQTAANRGAPTGFHSVKLVGWGEEHDGVKYW 385

Query: 57  LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           +  NSWG  WG+ G F+I RG++E  IE + +++
Sbjct: 386 IAANSWGPWWGEHGYFRILRGSNECGIEEYVLAS 419


>gi|294891881|ref|XP_002773785.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239878989|gb|EER05601.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 455

 Score = 85.1 bits (209), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 35/74 (47%), Positives = 48/74 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EIF  G + A +  ++D   YK GVY H  G+M   H +K+IGWGVE G +YWL VN
Sbjct: 318 IKQEIFDNGPVAAMMTLYEDFRFYKSGVYVHKTGQMLAAHTLKLIGWGVESGQEYWLAVN 377

Query: 61  SWGELWGDGGLFKI 74
           +W E WGD G+ K+
Sbjct: 378 AWNEEWGDHGMIKL 391


>gi|431838501|gb|ELK00433.1| Dipeptidyl-peptidase 1 [Pteropus alecto]
          Length = 460

 Score = 85.1 bits (209), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 42/104 (40%), Positives = 61/104 (58%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ H G +  A E + D + Y KG+Y HT         E++  HAV ++G+G +   
Sbjct: 357 MKLELVHHGPMAVAFEVYDDFLHYHKGIYHHTGLKDPFNPFELTN-HAVLLVGYGTDPAS 415

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW   NSWG  WG+ G F+IRRGTDE  IES  ++A  + +
Sbjct: 416 GLNYWTVKNSWGTSWGENGYFRIRRGTDECAIESIAMAATPIPK 459


>gi|290998874|ref|XP_002682005.1| predicted protein [Naegleria gruberi]
 gi|284095631|gb|EFC49261.1| predicted protein [Naegleria gruberi]
          Length = 310

 Score = 85.1 bits (209), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 45/91 (49%), Positives = 57/91 (62%), Gaps = 3/91 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++  I  +GS+ A    ++DL  YK GVY+H V  + GGHAV +IG+GVE G  YWL  N
Sbjct: 219 IKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHLVSTVLGGHAVALIGFGVEGGSNYWLAAN 278

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SWG  WG  G FKI +G  E  IE+ QV AG
Sbjct: 279 SWGPNWGMSGYFKIAQG--EGGIEN-QVYAG 306


>gi|443686962|gb|ELT90079.1| hypothetical protein CAPTEDRAFT_166233 [Capitella teleta]
          Length = 495

 Score = 85.1 bits (209), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 44/105 (41%), Positives = 59/105 (56%), Gaps = 14/105 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG---------HAVKIIGWGVE---- 50
           EI   G + A +    D   Y++GVY+H+                H+V+IIGWG +    
Sbjct: 360 EIILNGPVQAVMHVKDDFYTYERGVYKHSHAPKPANYPHLGKEAYHSVRIIGWGTDYTGD 419

Query: 51  DGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRVD 94
           D +KYWL  N+WG  WG+GG F+I RG+DES IESF V   G+VD
Sbjct: 420 DPIKYWLAANTWGRHWGEGGFFRIARGSDESHIESFVVGVWGKVD 464


>gi|167508668|gb|ABZ81540.1| cathepsin B-like cysteine protease [Caenorhabditis brenneri]
          Length = 193

 Score = 85.1 bits (209), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 39/84 (46%), Positives = 53/84 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G ++A+   + D   YK G+Y HT G+  GG   KIIGWGV++GV YWLCV+
Sbjct: 110 IQTEIMRNGPVIASFIIYDDFWDYKSGIYVHTAGDQEGGMDTKIIGWGVDNGVPYWLCVH 169

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
            WG  +G+ G  +I RG +E  IE
Sbjct: 170 QWGTDFGENGFMRILRGVNEVHIE 193


>gi|290990726|ref|XP_002677987.1| predicted protein [Naegleria gruberi]
 gi|284091597|gb|EFC45243.1| predicted protein [Naegleria gruberi]
          Length = 225

 Score = 85.1 bits (209), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 45/93 (48%), Positives = 57/93 (61%), Gaps = 3/93 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++  I  +GS+ A    ++DL  YK GVY+H V  + GGHAV +IG+GVE G  YWL  N
Sbjct: 134 IKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHVVSTVLGGHAVALIGFGVEGGSNYWLAAN 193

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           SWG  WG  G FKI +G  E  IE+ QV AG  
Sbjct: 194 SWGPNWGMSGYFKIAQG--EGGIEN-QVYAGEA 223


>gi|145541902|ref|XP_001456639.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124424451|emb|CAK89242.1| unnamed protein product [Paramecium tetraurelia]
          Length = 487

 Score = 85.1 bits (209), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 45/110 (40%), Positives = 63/110 (57%), Gaps = 12/110 (10%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG-----------HAVKIIGWGV 49
           + LE+F+ G ++   E  QD + Y  G+Y H+V +               H+V   GWG 
Sbjct: 365 IMLELFNNGPVIMNFEPGQDFMYYSSGIY-HSVAQHDWSSSDRPEWEKVDHSVLCYGWGE 423

Query: 50  EDGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSS 99
           E+GVK+WL  NSWGE WG+ G F+++RGTDES IES   +A  V   +SS
Sbjct: 424 ENGVKFWLLQNSWGEQWGEQGNFRMKRGTDESAIESMAEAADPVIYSKSS 473


>gi|193610664|ref|XP_001948185.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
          Length = 324

 Score = 85.1 bits (209), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 40/84 (47%), Positives = 52/84 (61%), Gaps = 1/84 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-VGEMSGGHAVKIIGWGVEDGVKYWLCV 59
           +Q E+  +G + A    + DL +Y  GVY  T   +     + K+IGWGVE+GV YWL V
Sbjct: 230 IQREVQTYGPVSAYFSLYDDLFLYTSGVYARTEKSKFVRYQSAKLIGWGVENGVDYWLLV 289

Query: 60  NSWGELWGDGGLFKIRRGTDESRI 83
           NSWG  WG  GLFKI+RGTDE + 
Sbjct: 290 NSWGNEWGQNGLFKIKRGTDECQF 313


>gi|18411686|ref|NP_567215.1| cathepsin B [Arabidopsis thaliana]
 gi|13877861|gb|AAK44008.1|AF370193_1 putative cathepsin B cysteine protease [Arabidopsis thaliana]
 gi|17473834|gb|AAL38343.1| unknown protein [Arabidopsis thaliana]
 gi|21281113|gb|AAM45063.1| putative cathepsin B cysteine protease [Arabidopsis thaliana]
 gi|21554165|gb|AAM63244.1| cathepsin B-like cysteine protease, putative [Arabidopsis thaliana]
 gi|24417490|gb|AAN60355.1| unknown [Arabidopsis thaliana]
 gi|24899725|gb|AAN65077.1| unknown protein [Arabidopsis thaliana]
 gi|51968702|dbj|BAD43043.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51969104|dbj|BAD43244.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51969220|dbj|BAD43302.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970472|dbj|BAD43928.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970630|dbj|BAD44007.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970704|dbj|BAD44044.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970802|dbj|BAD44093.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970974|dbj|BAD44179.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51971008|dbj|BAD44196.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51971116|dbj|BAD44250.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|62320144|dbj|BAD94342.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|110740287|dbj|BAF02040.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332656652|gb|AEE82052.1| cathepsin B [Arabidopsis thaliana]
          Length = 359

 Score = 84.7 bits (208), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 40/90 (44%), Positives = 52/90 (57%), Gaps = 1/90 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           +  E++  G +  +   ++D   YK GVY+H  G   GGHAVK+IGWG   +G  YWL  
Sbjct: 247 IMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLMA 306

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVS 89
           N W   WGD G F IRRGT+E  IE   V+
Sbjct: 307 NQWNRGWGDDGYFMIRRGTNECGIEDEPVA 336


>gi|32129434|sp|P92132.2|CATB2_GIALA RecName: Full=Cathepsin B-like CP2; AltName: Full=Cathepsin B-like
           protease B2; Flags: Precursor
 gi|11691658|emb|CAC18647.1| cathepsin B-like protease 2 [Giardia intestinalis]
          Length = 300

 Score = 84.7 bits (208), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 41/88 (46%), Positives = 55/88 (62%), Gaps = 2/88 (2%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCVNSWGELWG 67
           G +  A   H D + Y+ GVYQHT G M GGHAV+++G+G  +DGV YW+  NSWG  WG
Sbjct: 214 GPLQVAFLVHSDFMYYESGVYQHTYGYMEGGHAVEMVGYGTDDDGVDYWIIKNSWGPDWG 273

Query: 68  DGGLFKIRRGTDESRIESFQVSAGRVDR 95
           + G F++ RG ++  IE  Q  AG  D 
Sbjct: 274 EDGYFRMIRGINDCSIEE-QAYAGFFDE 300


>gi|239938578|gb|ACS36088.1| cysteine proteinase [Haemonchus contortus]
          Length = 332

 Score = 84.7 bits (208), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 36/77 (46%), Positives = 49/77 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q E+   G + AA   ++D   Y++G+Y HT G   G HAVK++GWGVE+G KYW   N
Sbjct: 255 IQREMMKNGPVQAASITYEDFSFYRRGIYVHTRGRQRGAHAVKVVGWGVENGTKYWNVAN 314

Query: 61  SWGELWGDGGLFKIRRG 77
           SW   WG+ G F+I RG
Sbjct: 315 SWSTDWGEDGYFRILRG 331


>gi|290990464|ref|XP_002677856.1| predicted protein [Naegleria gruberi]
 gi|284091466|gb|EFC45112.1| predicted protein [Naegleria gruberi]
          Length = 231

 Score = 84.7 bits (208), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 40/77 (51%), Positives = 47/77 (61%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + A    + D  +YK GVY+H  G   G HAVKIIGWG E+GV YWL  N
Sbjct: 143 IQKEILTHGPVNADFMVYSDFTVYKSGVYRHQTGSFEGIHAVKIIGWGTENGVDYWLIAN 202

Query: 61  SWGELWGDGGLFKIRRG 77
           SWG  +G  G FKI RG
Sbjct: 203 SWGTTFGLQGFFKIVRG 219


>gi|149635146|ref|XP_001512140.1| PREDICTED: dipeptidyl peptidase 1-like [Ornithorhynchus anatinus]
          Length = 469

 Score = 84.7 bits (208), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 42/104 (40%), Positives = 62/104 (59%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+   G +  A E + D + Y++GVY HT         E++  HAV ++G+G +   
Sbjct: 366 MKLELVRHGPMAVAFEVYNDFLHYREGVYHHTGLRDPFNPFELTN-HAVLLVGYGTDPAT 424

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRG+DE  IES  V+A  + R
Sbjct: 425 GLDYWIVKNSWGTAWGEDGYFRIRRGSDECAIESIAVAATPIPR 468


>gi|30678927|ref|NP_849281.1| cathepsin B [Arabidopsis thaliana]
 gi|3859606|gb|AAC72872.1| contains similarity to cysteine proteases (Pfam: PF00112,
           E=1.3e-79, N=1) [Arabidopsis thaliana]
 gi|7268205|emb|CAB77732.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332656653|gb|AEE82053.1| cathepsin B [Arabidopsis thaliana]
          Length = 359

 Score = 84.7 bits (208), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 40/90 (44%), Positives = 52/90 (57%), Gaps = 1/90 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           +  E++  G +  +   ++D   YK GVY+H  G   GGHAVK+IGWG   +G  YWL  
Sbjct: 247 IMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLMA 306

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVS 89
           N W   WGD G F IRRGT+E  IE   V+
Sbjct: 307 NQWNRGWGDDGYFMIRRGTNECGIEDEPVA 336


>gi|166030314|gb|ABY78824.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score = 84.7 bits (208), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 39/82 (47%), Positives = 50/82 (60%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E++  G  V A + + D + YK GVY+H  G++ GGHAV+I+GWG  +G  YW   NSW 
Sbjct: 241 ELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDVLGGHAVRIVGWGKLNGTPYWKIANSWD 300

Query: 64  ELWGDGGLFKIRRGTDESRIES 85
             WG  G F I RG DE  IES
Sbjct: 301 TDWGMNGHFLILRGKDECGIES 322


>gi|426252217|ref|XP_004019812.1| PREDICTED: dipeptidyl peptidase 1, partial [Ovis aries]
          Length = 455

 Score = 84.7 bits (208), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 42/104 (40%), Positives = 63/104 (60%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVE--D 51
           M+LE+ H G +  A E + D + Y++GVY HT         E++  HAV ++G+G +   
Sbjct: 352 MKLELVHRGPMAVAFEVYNDFLHYRQGVYHHTGLRDPFNPFELTN-HAVLLVGYGTDAAS 410

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  ++A  + +
Sbjct: 411 GLDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIALAATPIPK 454


>gi|21693|emb|CAA46810.1| cathepsin B [Triticum aestivum]
          Length = 305

 Score = 84.7 bits (208), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 43/92 (46%), Positives = 55/92 (59%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
           +  E+++ G +  A   ++D   YK GVY+H  G + GGHAVK+IGWG  D G  YWL  
Sbjct: 201 IMAEVYNNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLA 260

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           N W   WGD G FKI RG +E  IE   V+AG
Sbjct: 261 NQWNRGWGDDGYFKIIRGKNECGIEE-DVTAG 291


>gi|323448735|gb|EGB04630.1| hypothetical protein AURANDRAFT_32318 [Aureococcus anophagefferens]
          Length = 253

 Score = 84.7 bits (208), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 42/94 (44%), Positives = 54/94 (57%), Gaps = 1/94 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
           M  +I+  G I       QD + YK GVY+  +     GGHA+KI+G+G EDG  YWL  
Sbjct: 153 MAADIYQNGPITGMFFVKQDFLAYKSGVYEPKLLSPPLGGHAIKIMGFGTEDGKDYWLVA 212

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           NSW E WGD G FKI RG +  +IE   ++ G V
Sbjct: 213 NSWNEDWGDDGYFKIIRGKNACQIEDPVINGGPV 246


>gi|294871893|ref|XP_002766082.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239866672|gb|EEQ98799.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 118

 Score = 84.7 bits (208), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 38/77 (49%), Positives = 50/77 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EIF  G ++ A+  ++D+ +YK GVY H  G   G H +KIIGWGVE G  YWL VN
Sbjct: 28  IKQEIFTNGPVIGALTIYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGVESGQDYWLAVN 87

Query: 61  SWGELWGDGGLFKIRRG 77
           SW E WGD G+ K+  G
Sbjct: 88  SWNEEWGDHGMIKLAVG 104


>gi|225437812|ref|XP_002281936.1| PREDICTED: cathepsin B-like isoform 1 [Vitis vinifera]
 gi|359480250|ref|XP_003632421.1| PREDICTED: cathepsin B-like [Vitis vinifera]
          Length = 358

 Score = 84.3 bits (207), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 63/114 (55%), Gaps = 2/114 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
           +  E++  G +  A   ++D   Y+ GVY++T G++ GGHAVK+IGWG  +DG  YW+  
Sbjct: 245 IMAEVYKNGPVEVAFTVYEDFAHYESGVYRYTTGDVMGGHAVKLIGWGTTDDGEDYWILA 304

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRS-SDLEEFEYDTDTTI 112
           N W   WGD G F IRRG +E  IE   V+     ++    D E  + D   +I
Sbjct: 305 NQWNRNWGDDGYFMIRRGVNECGIEEGVVAGLPSSKNLMIGDFESVDADRHVSI 358


>gi|294891885|ref|XP_002773787.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239878991|gb|EER05603.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 234

 Score = 84.3 bits (207), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 35/74 (47%), Positives = 48/74 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EIF  G + A +  ++D   YK GVY H  G+M   H +K+IGWGVE G +YWL VN
Sbjct: 105 IKQEIFDNGPVAAMMTLYEDFRFYKSGVYVHKTGQMLAAHTLKLIGWGVESGQEYWLAVN 164

Query: 61  SWGELWGDGGLFKI 74
           +W E WGD G+ K+
Sbjct: 165 AWNEEWGDHGMIKL 178


>gi|297744106|emb|CBI37076.3| unnamed protein product [Vitis vinifera]
          Length = 392

 Score = 84.3 bits (207), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 63/114 (55%), Gaps = 2/114 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
           +  E++  G +  A   ++D   Y+ GVY++T G++ GGHAVK+IGWG  +DG  YW+  
Sbjct: 279 IMAEVYKNGPVEVAFTVYEDFAHYESGVYRYTTGDVMGGHAVKLIGWGTTDDGEDYWILA 338

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRS-SDLEEFEYDTDTTI 112
           N W   WGD G F IRRG +E  IE   V+     ++    D E  + D   +I
Sbjct: 339 NQWNRNWGDDGYFMIRRGVNECGIEEGVVAGLPSSKNLMIGDFESVDADRHVSI 392


>gi|187105118|ref|NP_001119619.1| cathepsin B-5880 precursor [Acyrthosiphon pisum]
 gi|163300442|tpg|DAA06127.1| TPA_inf: cathepsin B transcript 5880 [Acyrthosiphon pisum]
 gi|239790051|dbj|BAH71611.1| ACYPI000015 [Acyrthosiphon pisum]
 gi|239790053|dbj|BAH71612.1| ACYPI000015 [Acyrthosiphon pisum]
          Length = 302

 Score = 84.3 bits (207), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 38/86 (44%), Positives = 50/86 (58%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI+  G I A+   +QD + Y+ GVY +  G+     AVKI+GWG E+G  YWL  NS+ 
Sbjct: 213 EIYENGPITASFYMYQDFVNYQSGVYAYNSGKYVTTQAVKILGWGEENGTPYWLAANSFN 272

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVS 89
             WGD G  KI RG +E  IE F  +
Sbjct: 273 TYWGDNGFVKILRGANECYIEEFMYA 298


>gi|115621283|ref|XP_782184.2| PREDICTED: tubulointerstitial nephritis antigen-like
           [Strongylocentrotus purpuratus]
          Length = 450

 Score = 84.3 bits (207), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 39/100 (39%), Positives = 57/100 (57%), Gaps = 14/100 (14%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEM---------SGGHAVKIIGWGVE- 50
           +  EI+  G + A      D  +Y +GVY++   E          +G H+VKI+GWG++ 
Sbjct: 338 IMTEIYQNGPVQATFNVKNDFFVYNRGVYRNVKQEFTASQSDSDQAGWHSVKIVGWGIDR 397

Query: 51  ----DGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESF 86
               + +KYWLC NSWG  WG+ G+F+I RG +E  IESF
Sbjct: 398 SDWYNPIKYWLCTNSWGRNWGEQGMFRIVRGVNECEIESF 437


>gi|294929081|ref|XP_002779258.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239888294|gb|EER11053.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 288

 Score = 84.3 bits (207), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 37/77 (48%), Positives = 49/77 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EIF  G ++  +  ++D+ +YK GVY H  G   G H +KIIGWGVE G  YWL VN
Sbjct: 198 IKQEIFTNGPVIGMLSIYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGVESGQDYWLAVN 257

Query: 61  SWGELWGDGGLFKIRRG 77
           SW E WGD G+ K+  G
Sbjct: 258 SWNEEWGDHGMIKLAVG 274


>gi|343475054|emb|CCD13447.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score = 84.3 bits (207), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 37/81 (45%), Positives = 45/81 (55%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E++  G  V   + H D + YK GVYQH  G   GG AV+I+GWG  +G  YW   NSW 
Sbjct: 242 ELYFNGPFVVRFQVHSDFLAYKSGVYQHVAGNFLGGKAVRIVGWGKMNGTPYWKVANSWD 301

Query: 64  ELWGDGGLFKIRRGTDESRIE 84
             WG  G F I RG +E  IE
Sbjct: 302 TDWGMNGYFLILRGNNECNIE 322


>gi|6562772|emb|CAB62590.1| putative cathepsin B-like protease [Pisum sativum]
          Length = 174

 Score = 84.3 bits (207), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 40/82 (48%), Positives = 49/82 (59%), Gaps = 1/82 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCVNSW 62
           E++  G +  A   ++D   YK GVY+H  G   GGHAVK+ GWG  D G  YWL  N W
Sbjct: 80  EVYKNGPVEVAFSVYEDFAHYKSGVYKHITGSALGGHAVKLNGWGTSDEGEDYWLLANQW 139

Query: 63  GELWGDGGLFKIRRGTDESRIE 84
              WGD G FKI+RGT+E  IE
Sbjct: 140 NTNWGDDGYFKIKRGTNECGIE 161


>gi|301779281|ref|XP_002925058.1| PREDICTED: dipeptidyl peptidase 1-like [Ailuropoda melanoleuca]
 gi|281337582|gb|EFB13166.1| hypothetical protein PANDA_014484 [Ailuropoda melanoleuca]
          Length = 461

 Score = 84.3 bits (207), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 43/104 (41%), Positives = 61/104 (58%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVE--D 51
           M+LE+ H G I  A + + D   Y+ G+Y HT         E++  HAV ++G+G +   
Sbjct: 358 MKLELVHHGPIAVAFQVYDDFFHYRTGIYYHTGLRDPFNPFELTN-HAVLLVGYGTDTAS 416

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  V +
Sbjct: 417 GMDYWIVKNSWGAGWGENGYFRIRRGTDECAIESIAVAATPVPK 460


>gi|204022075|dbj|BAG71135.1| cathepsin B-S2 [Tuberaphis taiwana]
          Length = 334

 Score = 84.0 bits (206), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 38/92 (41%), Positives = 60/92 (65%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
           ++ ++  +G + A+ + + D  +YK G+Y+ T   +  GGH++KIIGWG ++G  YWL V
Sbjct: 239 IEQDLKTYGPVEASFDVYDDFSVYKSGIYRKTPKAKYQGGHSIKIIGWGQQNGTPYWLAV 298

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           NSW + WG+ G FKI +G +E  IE   V+AG
Sbjct: 299 NSWSKFWGEHGTFKIIKGRNECGIER-AVTAG 329


>gi|74212565|dbj|BAE31022.1| unnamed protein product [Mus musculus]
          Length = 191

 Score = 84.0 bits (206), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 42/104 (40%), Positives = 61/104 (58%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+   G +  A E H D + Y  G+Y HT         E++  HAV ++G+G +   
Sbjct: 88  MELELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTN-HAVLLVGYGRDPVT 146

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G++YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 147 GIEYWIIKNSWGSNWGESGYFRIRRGTDECAIESIAVAAIPIPK 190


>gi|341891358|gb|EGT47293.1| hypothetical protein CAEBREN_29072 [Caenorhabditis brenneri]
          Length = 349

 Score = 84.0 bits (206), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 42/102 (41%), Positives = 56/102 (54%), Gaps = 12/102 (11%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT--------VGEMSGGHAVKIIGWGVEDG 52
           +Q E+   G + A    H+D  +Y  GVYQH+             G H+V+++GWGV+  
Sbjct: 224 IQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHS 283

Query: 53  ----VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
               +KYWLC NSWG  WG+ G FKI RG +   IESF V A
Sbjct: 284 TGRPIKYWLCANSWGTQWGEDGYFKILRGDNHCEIESFVVGA 325


>gi|126330441|ref|XP_001381244.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Monodelphis
           domestica]
          Length = 466

 Score = 84.0 bits (206), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 46/104 (44%), Positives = 59/104 (56%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +YK G+Y+HT   +         G H+VKI GWG E   DG
Sbjct: 354 ELMENGPVQALMEVHEDFFLYKSGIYKHTPASLGKPARYRQHGTHSVKITGWGEERQPDG 413

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF V   GRV
Sbjct: 414 QRLKYWTAANSWGPTWGEKGHFRILRGANECDIESFVVGVWGRV 457


>gi|294876288|ref|XP_002767632.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239869318|gb|EER00350.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 97

 Score = 84.0 bits (206), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 38/77 (49%), Positives = 50/77 (64%)

Query: 1  MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
          ++ EI   G   A +  + D + Y+ GVY+HT G   G H+V+IIGWG+E GV YWL +N
Sbjct: 5  IKKEIMTNGPTSATLSMYNDFLSYESGVYKHTSGTFMGVHSVEIIGWGIEKGVDYWLVMN 64

Query: 61 SWGELWGDGGLFKIRRG 77
          SW E WGD G FKI +G
Sbjct: 65 SWNEDWGDNGTFKIAQG 81


>gi|201023369|ref|NP_001128426.1| cathepsin B-3483 [Acyrthosiphon pisum]
 gi|328712086|ref|XP_003244726.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
          Length = 355

 Score = 84.0 bits (206), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 37/81 (45%), Positives = 51/81 (62%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
           G  V  +  ++D + YK GVY H  G+  G  +V++IGWG+E G  +WL  NSWG  WGD
Sbjct: 270 GPYVVTMRVYEDFLAYKSGVYHHVTGDYLGLLSVRMIGWGLEGGQAFWLLANSWGTSWGD 329

Query: 69  GGLFKIRRGTDESRIESFQVS 89
            G FKIRR  +E  IE+F+ +
Sbjct: 330 KGFFKIRRFVNECWIENFRYA 350


>gi|159950|gb|AAA29435.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
          Length = 105

 Score = 84.0 bits (206), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 37/94 (39%), Positives = 58/94 (61%), Gaps = 1/94 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G +VA    ++D   Y+ G+Y+H  G  +G HAVK+IGWG E G  YW+  N
Sbjct: 13  IQKDIMKNGPVVATYTVYEDFAHYRSGIYKHKAGRKTGLHAVKVIGWGEEKGTPYWIVAN 72

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           SW + WG+ G F++ RG+++   E  +++AG V 
Sbjct: 73  SWHDDWGENGFFRMHRGSNDCGFEE-RMAAGSVQ 105


>gi|161343877|tpg|DAA06119.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 145

 Score = 84.0 bits (206), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 38/86 (44%), Positives = 49/86 (56%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI+  G I A+   +QD + Y+ GVY    G+     AVKI+GWG E+G  YWL  NS+ 
Sbjct: 56  EIYENGPITASFYMYQDFVNYQSGVYAFNSGKYVTTQAVKILGWGEENGTPYWLAANSFN 115

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVS 89
             WGD G  KI RG +E  IE F  +
Sbjct: 116 TYWGDNGFVKILRGANECYIEEFMYA 141


>gi|294877489|ref|XP_002768007.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239870145|gb|EER00725.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 344

 Score = 83.6 bits (205), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 39/77 (50%), Positives = 49/77 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G   A+   ++D   YK GVY+HT G   G H+V+IIGWG E GV YWL +N
Sbjct: 252 IKKEIMTNGPTSASFSTYEDFSSYKSGVYKHTSGGYLGDHSVEIIGWGTEKGVDYWLVMN 311

Query: 61  SWGELWGDGGLFKIRRG 77
           SW E WGD G FKI +G
Sbjct: 312 SWNEGWGDHGTFKIAQG 328


>gi|343476048|emb|CCD12737.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score = 83.6 bits (205), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 38/85 (44%), Positives = 50/85 (58%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
            + E++  G  V A + + D + YK GVY+H  G+  GGHAV+I+GWG  +G  YW   N
Sbjct: 239 FKRELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKLNGTPYWKIAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SW   WG  G F I RG +E  IES
Sbjct: 299 SWDTDWGMNGHFLILRGNNECGIES 323


>gi|432108509|gb|ELK33225.1| Dipeptidyl peptidase 1 [Myotis davidii]
          Length = 466

 Score = 83.6 bits (205), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 62/104 (59%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ H G +  A E + D + Y +G+Y HT         E++  HAV ++G+G +   
Sbjct: 363 MKLELVHHGPMAVAFEVYDDFLHYNQGIYHHTGLKDPFNPFELTN-HAVLLVGYGTDPKT 421

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  ++A  + +
Sbjct: 422 GLDYWIVKNSWGTSWGEQGYFRIRRGTDECAIESIAMAATPIPK 465


>gi|294937366|ref|XP_002782055.1| cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239893340|gb|EER13850.1| cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 159

 Score = 83.6 bits (205), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 37/77 (48%), Positives = 49/77 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EIF  G ++  +  ++D+ +YK GVY H  G   G H +KIIGWGVE G  YWL VN
Sbjct: 69  IKQEIFTNGPVIGMLSIYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGVESGQDYWLAVN 128

Query: 61  SWGELWGDGGLFKIRRG 77
           SW E WGD G+ K+  G
Sbjct: 129 SWNEEWGDHGMIKLAVG 145


>gi|189238903|ref|XP_967834.2| PREDICTED: similar to tubulointerstitial nephritis antigen
           [Tribolium castaneum]
          Length = 453

 Score = 83.6 bits (205), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 39/93 (41%), Positives = 59/93 (63%), Gaps = 7/93 (7%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT---VGEMSGGHAVKIIGWGVE---DGVK 54
           +  EI H G + A ++ + D   YK+G+Y+H+     + +G H+V+I+GWG E   +G+K
Sbjct: 343 IMYEILHSGPVQATMKVYHDFFTYKRGIYRHSPISTNDRTGYHSVRIVGWGEEYSPEGLK 402

Query: 55  -YWLCVNSWGELWGDGGLFKIRRGTDESRIESF 86
            YW   NSWG  WG+ G F+I RG++E  IESF
Sbjct: 403 KYWKVANSWGPEWGENGYFRILRGSNECEIESF 435


>gi|147902366|ref|NP_001080511.1| cathepsin C precursor [Xenopus laevis]
 gi|33417162|gb|AAH56109.1| Ctsc protein [Xenopus laevis]
          Length = 458

 Score = 83.6 bits (205), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 44/101 (43%), Positives = 59/101 (58%), Gaps = 8/101 (7%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGE------MSGGHAVKIIGWGVED--G 52
           M+LE+   G +  A E + D I Y+ GVY HT  +          HAV ++G+G +   G
Sbjct: 355 MKLELVLGGPLSVAFEVYDDFIHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGTDQQTG 414

Query: 53  VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
            KYW+  NSWGE WG+ G F+IRRG+DE  IES  VSA  +
Sbjct: 415 EKYWIVKNSWGESWGEKGFFRIRRGSDECAIESIAVSANPI 455


>gi|290981656|ref|XP_002673546.1| predicted protein [Naegleria gruberi]
 gi|284087130|gb|EFC40802.1| predicted protein [Naegleria gruberi]
          Length = 362

 Score = 83.6 bits (205), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 44/93 (47%), Positives = 56/93 (60%), Gaps = 3/93 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++  I  +GS+ A    ++DL  YK GVY+H    + GGHAV +IG+GVE G  YWL  N
Sbjct: 271 IKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHIENTVLGGHAVALIGFGVEGGSNYWLAAN 330

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           SWG  WG  G FKI +G  E  IE+ QV AG  
Sbjct: 331 SWGPNWGMSGYFKIAQG--EGGIEN-QVYAGEA 360


>gi|40643250|emb|CAC83720.1| cathepsin B [Hordeum vulgare subsp. vulgare]
 gi|326494236|dbj|BAJ90387.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326499864|dbj|BAJ90767.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 344

 Score = 83.6 bits (205), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 43/92 (46%), Positives = 54/92 (58%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY+H  G + GGHAVK+IGWG  D G  YWL  
Sbjct: 240 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLA 299

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           N W   WGD G FKI RG +E  IE   V+AG
Sbjct: 300 NQWNRGWGDDGYFKIIRGKNECGIEE-DVTAG 330


>gi|145513975|ref|XP_001442898.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124410259|emb|CAK75501.1| unnamed protein product [Paramecium tetraurelia]
          Length = 358

 Score = 83.6 bits (205), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 39/91 (42%), Positives = 60/91 (65%), Gaps = 2/91 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG--EMSGGHAVKIIGWGVEDGVKYWLC 58
           ++ EI + G IVA I+  +D ++YK GVY+   G  +   GHAVK+IGWG +DGV YW+ 
Sbjct: 256 IKREILNNGPIVAVIQVFKDFLVYKGGVYEVVEGSSKFQYGHAVKVIGWGKQDGVNYWVI 315

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVS 89
            NSWG+ WG  GL  +  G ++ ++E++ V+
Sbjct: 316 ENSWGDTWGLKGLAYVAVGQNQLQLEAYSVA 346


>gi|363729389|ref|XP_417207.2| PREDICTED: dipeptidyl peptidase 1 [Gallus gallus]
          Length = 460

 Score = 83.6 bits (205), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 43/104 (41%), Positives = 62/104 (59%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWG--VED 51
           M+LE+   G +  A E + D + YK+G+Y HT         E++  HAV ++G+G   E 
Sbjct: 357 MKLELVLSGPMAVAFEVYNDFMFYKEGIYHHTGLKDEFNPFELTN-HAVLLVGYGKDPES 415

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G K+W+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 416 GEKFWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 459


>gi|226497010|ref|NP_001150152.1| LOC100283781 precursor [Zea mays]
 gi|195637168|gb|ACG38052.1| cathepsin B-like cysteine proteinase 3 precursor [Zea mays]
          Length = 347

 Score = 83.6 bits (205), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 40/85 (47%), Positives = 50/85 (58%), Gaps = 1/85 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY+H  G + GGHAVK+IGWG  D G  YWL  
Sbjct: 237 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWLLA 296

Query: 60  NSWGELWGDGGLFKIRRGTDESRIE 84
           N W   WGD G FKI RG +E  IE
Sbjct: 297 NQWNRGWGDDGYFKIIRGKNECGIE 321


>gi|294931810|ref|XP_002780018.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239889821|gb|EER11813.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 131

 Score = 83.6 bits (205), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 39/78 (50%), Positives = 50/78 (64%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G   A+   ++D   YK GVY+HT G   G H+V+IIGWG E GV YWL +N
Sbjct: 31  IKKEIMTNGPTSASFSTYEDFSSYKSGVYKHTSGGYLGDHSVEIIGWGTEKGVDYWLVMN 90

Query: 61  SWGELWGDGGLFKIRRGT 78
           SW E WGD G FKI +G+
Sbjct: 91  SWNEGWGDHGTFKIAQGS 108


>gi|161343857|tpg|DAA06109.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 163

 Score = 83.6 bits (205), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
           G  V  +  ++D + YK GVY H  G+  G  +V++IGWG+E G  +WL  NSWG  WGD
Sbjct: 78  GPYVVTMRVYEDFLAYKSGVYHHVTGDYLGLLSVRMIGWGLEGGQAFWLFANSWGTSWGD 137

Query: 69  GGLFKIRRGTDESRIESFQVSA 90
            G FKIRR  +E  IE+F+ + 
Sbjct: 138 KGFFKIRRFVNERWIENFRYAG 159


>gi|414886870|tpg|DAA62884.1| TPA: cathepsin B-like cysteine proteinase 3 [Zea mays]
          Length = 347

 Score = 83.6 bits (205), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 40/85 (47%), Positives = 50/85 (58%), Gaps = 1/85 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY+H  G + GGHAVK+IGWG  D G  YWL  
Sbjct: 237 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWLLA 296

Query: 60  NSWGELWGDGGLFKIRRGTDESRIE 84
           N W   WGD G FKI RG +E  IE
Sbjct: 297 NQWNRGWGDDGYFKIIRGKNECGIE 321


>gi|262217337|gb|ACY38050.1| cathepsin B [Dactylis glomerata]
          Length = 348

 Score = 83.6 bits (205), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 43/92 (46%), Positives = 54/92 (58%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY+H  G + GGHAVK+IGWG  D G  YWL  
Sbjct: 238 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGGVMGGHAVKLIGWGTSDAGEDYWLLA 297

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           N W   WGD G FKI RG +E  IE  +V AG
Sbjct: 298 NQWNRGWGDDGYFKIIRGKNECGIEE-EVVAG 328


>gi|239790303|dbj|BAH71722.1| ACYPI001175 [Acyrthosiphon pisum]
          Length = 330

 Score = 83.2 bits (204), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 42/95 (44%), Positives = 56/95 (58%), Gaps = 2/95 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
           +Q E+  +G +      + D  +YK GVY  T   +    H  K+IGWGVE+GV YWL V
Sbjct: 235 IQKEVQTYGPVSVKFRVYDDFFLYKSGVYVKTEKSLYVRRHFAKLIGWGVENGVDYWLLV 294

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           N WG  WG  GLFKI+RGT+E  +E + V AG  +
Sbjct: 295 NFWGNEWGQNGLFKIKRGTNEVHVEDY-VYAGEPE 328


>gi|118365170|ref|XP_001015806.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297573|gb|EAR95561.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 340

 Score = 83.2 bits (204), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 42/92 (45%), Positives = 57/92 (61%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
           +Q EI   G + A+ +   D + YK GVY ++   +  GGH+VKIIGWG E    YWL  
Sbjct: 248 IQREIMAHGPVQASFKVAADFLTYKSGVYIRNPKLKYEGGHSVKIIGWGKEGNTPYWLIA 307

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           NSW E WG+ GLF++ RG +E  IE+ Q+ AG
Sbjct: 308 NSWNEDWGEKGLFRMLRGRNECGIEA-QIVAG 338


>gi|357116879|ref|XP_003560204.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
          Length = 351

 Score = 83.2 bits (204), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 58/104 (55%), Gaps = 1/104 (0%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY+H  G + GGHAVK+IGWG  D G  YWL  
Sbjct: 241 IMAEVYTNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLA 300

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEE 103
           N W   WGD G FKI RG +E  IE   V+     ++ + + ++
Sbjct: 301 NQWNRGWGDDGYFKIIRGKNECGIEEDVVAGMPSTKNMARNYDD 344


>gi|270011021|gb|EFA07469.1| cathepsin B precursor [Tribolium castaneum]
          Length = 327

 Score = 83.2 bits (204), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 39/93 (41%), Positives = 59/93 (63%), Gaps = 7/93 (7%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT---VGEMSGGHAVKIIGWGVE---DGVK 54
           +  EI H G + A ++ + D   YK+G+Y+H+     + +G H+V+I+GWG E   +G+K
Sbjct: 217 IMYEILHSGPVQATMKVYHDFFTYKRGIYRHSPISTNDRTGYHSVRIVGWGEEYSPEGLK 276

Query: 55  -YWLCVNSWGELWGDGGLFKIRRGTDESRIESF 86
            YW   NSWG  WG+ G F+I RG++E  IESF
Sbjct: 277 KYWKVANSWGPEWGENGYFRILRGSNECEIESF 309


>gi|22653678|sp|O97578.1|CATC_CANFA RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain 1; AltName: Full=Dipeptidyl peptidase I
           heavy chain 1; Contains: RecName: Full=Dipeptidyl
           peptidase 1 heavy chain 2; AltName: Full=Dipeptidyl
           peptidase I heavy chain 2; Contains: RecName:
           Full=Dipeptidyl peptidase 1 heavy chain 3; AltName:
           Full=Dipeptidyl peptidase I heavy chain 3; Contains:
           RecName: Full=Dipeptidyl peptidase 1 heavy chain 4;
           AltName: Full=Dipeptidyl peptidase I heavy chain 4;
           Contains: RecName: Full=Dipeptidyl peptidase 1 light
           chain; AltName: Full=Dipeptidyl peptidase I light chain;
           Flags: Precursor
 gi|4106126|gb|AAD02704.1| dipeptidyl peptidase I [Canis lupus familiaris]
          Length = 435

 Score = 83.2 bits (204), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 42/104 (40%), Positives = 61/104 (58%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+   G +  A E + D   Y+KG+Y HT         E++  HAV ++G+G +   
Sbjct: 332 MKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 390

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 391 GMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVAATPIPK 434


>gi|268555786|ref|XP_002635882.1| Hypothetical protein CBG01102 [Caenorhabditis briggsae]
          Length = 374

 Score = 83.2 bits (204), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 36/90 (40%), Positives = 55/90 (61%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q ++   G I A +E + D + Y  G+Y H  G   G  +V+I+GWG+ +GV YWL  N
Sbjct: 280 IQSDVMLNGPISATMEVYDDFLQYTTGIYVHLTGNKQGHLSVRILGWGMYEGVPYWLLAN 339

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SWG+ WG+ G F++ RG +E  +E+  VS 
Sbjct: 340 SWGKQWGENGTFRVLRGVNECGLEANCVSG 369


>gi|307938279|ref|NP_001182763.1| dipeptidyl peptidase 1 precursor [Canis lupus familiaris]
          Length = 459

 Score = 83.2 bits (204), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 42/104 (40%), Positives = 61/104 (58%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+   G +  A E + D   Y+KG+Y HT         E++  HAV ++G+G +   
Sbjct: 356 MKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 414

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 415 GMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVAATPIPK 458


>gi|300176830|emb|CBK25399.2| unnamed protein product [Blastocystis hominis]
          Length = 563

 Score = 83.2 bits (204), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 38/85 (44%), Positives = 52/85 (61%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           M  EI+  G I  +I    DL+ YK G+Y+ T G  +  HA+ ++GWG EDG KYW+  N
Sbjct: 184 MMKEIYARGPITCSIAVPDDLMEYKGGIYRDTTGAKTLDHAISVVGWGEEDGQKYWIARN 243

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SWG  WG+ G F+I RG +   IE+
Sbjct: 244 SWGTFWGEKGWFRIVRGENNLGIEA 268



 Score = 65.5 bits (158), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 50/87 (57%), Gaps = 1/87 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
           ++ EIF  G +  ++   ++ + Y+ G++    G + G HAV++ GWG  EDG KYW+  
Sbjct: 467 IKAEIFARGPVSCSMIVTEEFLAYQGGIFVDDRGHIVGYHAVEVAGWGETEDGTKYWIAR 526

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESF 86
           NSWG  WG+ G F++  G  +  I  +
Sbjct: 527 NSWGPYWGEHGWFRMIVGVSKGLITGY 553


>gi|326492684|dbj|BAJ90198.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 355

 Score = 83.2 bits (204), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 45/105 (42%), Positives = 60/105 (57%), Gaps = 5/105 (4%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY+H  G + GGHAVK+IGWG  D G  YWL  
Sbjct: 245 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLA 304

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRV---DRDRSSDL 101
           N W   WG  G FKI RG +E  IE   V+AG     + DR++D+
Sbjct: 305 NQWNRGWGGDGYFKIIRGKNECGIEE-DVTAGMPSTKNMDRNNDV 348


>gi|195346663|ref|XP_002039877.1| GM15657 [Drosophila sechellia]
 gi|194135226|gb|EDW56742.1| GM15657 [Drosophila sechellia]
          Length = 431

 Score = 83.2 bits (204), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 4/94 (4%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGE---MSGGHAVKIIGWGVE-DGVKYW 56
           +  EIFH G + A +  ++D   Y  GVY+ T      ++G H+VK++GWG E +G KYW
Sbjct: 325 IMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKALTGFHSVKLVGWGEEHNGEKYW 384

Query: 57  LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           +  NSWG  WG+ G F+I RG++E  IE + +++
Sbjct: 385 IAANSWGSWWGEHGYFRILRGSNECGIEDYVLAS 418


>gi|74199074|dbj|BAE30750.1| unnamed protein product [Mus musculus]
          Length = 447

 Score = 83.2 bits (204), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 42/104 (40%), Positives = 61/104 (58%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+   G +  A E H D + Y  G+Y HT         E++  HAV ++G+G +   
Sbjct: 344 MKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTN-HAVLLVGYGRDPVT 402

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G++YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 403 GIEYWIIKNSWGSNWGESGYFRIRRGTDECAIESIAVAAIPIPK 446


>gi|160707990|ref|NP_034112.3| dipeptidyl peptidase 1 preproprotein [Mus musculus]
 gi|3023454|sp|P97821.1|CATC_MOUSE RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|1881656|gb|AAB49457.1| preprodipeptidyl peptidase I [Mus musculus]
 gi|7609786|gb|AAB58400.3| dipeptidyl peptidase I precursor [Mus musculus]
 gi|45219895|gb|AAH67063.1| Cathepsin C [Mus musculus]
 gi|74147157|dbj|BAE27487.1| unnamed protein product [Mus musculus]
 gi|74178079|dbj|BAE29829.1| unnamed protein product [Mus musculus]
 gi|148674849|gb|EDL06796.1| cathepsin C, isoform CRA_b [Mus musculus]
          Length = 462

 Score = 83.2 bits (204), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 42/104 (40%), Positives = 61/104 (58%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+   G +  A E H D + Y  G+Y HT         E++  HAV ++G+G +   
Sbjct: 359 MKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTN-HAVLLVGYGRDPVT 417

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G++YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 418 GIEYWIIKNSWGSNWGESGYFRIRRGTDECAIESIAVAAIPIPK 461


>gi|157116531|ref|XP_001658537.1| tubulointerstitial nephritis antigen [Aedes aegypti]
 gi|108883447|gb|EAT47672.1| AAEL001232-PA [Aedes aegypti]
          Length = 462

 Score = 83.2 bits (204), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 42/110 (38%), Positives = 63/110 (57%), Gaps = 9/110 (8%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-----GGHAVKIIGWGVE----D 51
           + LEI   G + A +  H+D   YK G+Y+H+    S     G H+V++IGWG E    +
Sbjct: 322 IMLEIKKHGPVQAIMRVHRDFFSYKSGIYRHSAASTSADQRAGYHSVRLIGWGEERHGYE 381

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDL 101
             KYW+ VNSWG  WG+ G F+I RG++E  IES+ +++      +  DL
Sbjct: 382 VTKYWIAVNSWGTWWGENGRFRILRGSNECEIESYVLASLPYVHQQVKDL 431


>gi|74191569|dbj|BAE30359.1| unnamed protein product [Mus musculus]
          Length = 462

 Score = 83.2 bits (204), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 42/104 (40%), Positives = 61/104 (58%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+   G +  A E H D + Y  G+Y HT         E++  HAV ++G+G +   
Sbjct: 359 MKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTN-HAVLLVGYGRDPVT 417

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G++YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 418 GIEYWIIKNSWGSNWGESGYFRIRRGTDECAIESIAVAAIPIPK 461


>gi|74204274|dbj|BAE39895.1| unnamed protein product [Mus musculus]
          Length = 462

 Score = 83.2 bits (204), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 42/99 (42%), Positives = 59/99 (59%), Gaps = 10/99 (10%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+   G +  A E H D + Y  G+Y HT         E++  HAV ++G+G +   
Sbjct: 359 MKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTN-HAVLLVGYGRDPVT 417

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           G++YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A
Sbjct: 418 GIEYWIIKNSWGSNWGESGYFRIRRGTDECAIESIAVAA 456


>gi|308494436|ref|XP_003109407.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
 gi|308246820|gb|EFO90772.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
          Length = 470

 Score = 83.2 bits (204), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 43/102 (42%), Positives = 58/102 (56%), Gaps = 12/102 (11%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-----VGEMS---GGHAVKIIGWGVEDG 52
           +Q E+   G + A    H+D  +Y  GVYQH+      G  S   G H+V+++GWGV+  
Sbjct: 345 IQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHS 404

Query: 53  ----VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
               +KYWLC NSWG  WG+ G FKI RG +   IESF + A
Sbjct: 405 TGRPIKYWLCANSWGTQWGEDGYFKILRGENHCEIESFVIGA 446


>gi|119638992|gb|ABL85238.1| cysteine proteinase 4 [Necator americanus]
          Length = 339

 Score = 83.2 bits (204), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 39/85 (45%), Positives = 54/85 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EIF  G + A     +D I YK+G+Y+ T G+  G HA+K+IGWG E+G  YWL  N
Sbjct: 246 IRQEIFINGPVGANFYVFEDFIHYKEGIYKQTYGKWIGVHAIKLIGWGTENGTDYWLVAN 305

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           S+   WG+ G F+I RGT+   IES
Sbjct: 306 SYNYDWGENGTFRILRGTNHCLIES 330


>gi|268564843|ref|XP_002639246.1| Hypothetical protein CBG03805 [Caenorhabditis briggsae]
          Length = 526

 Score = 83.2 bits (204), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 41/102 (40%), Positives = 56/102 (54%), Gaps = 12/102 (11%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT--------VGEMSGGHAVKIIGWGVEDG 52
           +Q E+   G + A    H+D  +Y  GVYQH+             G H+V+++GWGV+  
Sbjct: 401 IQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHS 460

Query: 53  ----VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
               +KYWLC NSWG  WG+ G FKI RG +   IESF + A
Sbjct: 461 TGRPIKYWLCANSWGTQWGEDGYFKILRGENHCEIESFVIGA 502


>gi|26340150|dbj|BAC33738.1| unnamed protein product [Mus musculus]
          Length = 462

 Score = 83.2 bits (204), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 42/99 (42%), Positives = 59/99 (59%), Gaps = 10/99 (10%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+   G +  A E H D + Y  G+Y HT         E++  HAV ++G+G +   
Sbjct: 359 MKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTN-HAVLLVGYGRDPVT 417

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           G++YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A
Sbjct: 418 GIEYWIIKNSWGSNWGESGYFRIRRGTDECAIESIAVAA 456


>gi|126327832|ref|XP_001363345.1| PREDICTED: dipeptidyl peptidase 1-like [Monodelphis domestica]
          Length = 462

 Score = 83.2 bits (204), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 44/103 (42%), Positives = 59/103 (57%), Gaps = 8/103 (7%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS------GGHAVKIIGWGVED--G 52
           M+LE+   G +  A E + D I Y+KGVY HT    S        HAV ++G+G ++  G
Sbjct: 359 MKLELVENGPMAVAFEVYNDFIHYQKGVYHHTGLRDSFNPFEITNHAVLLVGYGTDEKTG 418

Query: 53  VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
             YW+  NSWG  WG+ G F+I RGTDE  IES  VSA  + +
Sbjct: 419 EHYWIVKNSWGSYWGEDGYFRILRGTDECGIESIAVSATPIPK 461


>gi|12832450|dbj|BAB22112.1| unnamed protein product [Mus musculus]
          Length = 461

 Score = 83.2 bits (204), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 42/99 (42%), Positives = 59/99 (59%), Gaps = 10/99 (10%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+   G +  A E H D + Y  G+Y HT         E++  HAV ++G+G +   
Sbjct: 358 MKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTN-HAVLLVGYGRDPVT 416

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           G++YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A
Sbjct: 417 GIEYWIIKNSWGSNWGESGYFRIRRGTDECAIESIAVAA 455


>gi|195426329|ref|XP_002061289.1| GK20838 [Drosophila willistoni]
 gi|194157374|gb|EDW72275.1| GK20838 [Drosophila willistoni]
          Length = 432

 Score = 83.2 bits (204), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 38/94 (40%), Positives = 60/94 (63%), Gaps = 4/94 (4%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV---GEMSGGHAVKIIGWGVE-DGVKYW 56
           ++ EIFH G + A +  ++D   Y  G+Y+ T    G  +G H+VK++GWG E +G KYW
Sbjct: 326 IKAEIFHSGPVQATMRVYRDFFSYSGGIYRQTAANRGAPTGFHSVKLVGWGEEHNGDKYW 385

Query: 57  LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           +  NSWG  WG+ G F+I RG++E  IE + +++
Sbjct: 386 IAANSWGPWWGERGYFRILRGSNECGIEDYVLAS 419


>gi|159120206|ref|XP_001710319.1| Cathepsin B-like cysteine proteinase 3 precursor [Giardia lamblia
           ATCC 50803]
 gi|157438437|gb|EDO82645.1| Cathepsin B-like cysteine proteinase 3 precursor [Giardia lamblia
           ATCC 50803]
          Length = 804

 Score = 83.2 bits (204), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 41/96 (42%), Positives = 59/96 (61%), Gaps = 2/96 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIY-KKGVYQHTVG-EMSGGHAVKIIGWGVEDGVKYWLC 58
           M  +I+  G I  ++    D     KKG+Y      ++ GGHAV I+GWG E+GV YW C
Sbjct: 192 MMRDIYQNGPIAVSMYLANDFPSKDKKGIYSSGPNTKLGGGHAVMIVGWGEENGVPYWDC 251

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
            N++G  WGD G FKI+RG++E +IE++  SA  +D
Sbjct: 252 ANTYGTNWGDQGYFKIKRGSNELKIETWPGSALPID 287


>gi|159111216|ref|XP_001705840.1| Hypothetical protein GL50803_113303 [Giardia lamblia ATCC 50803]
 gi|157433930|gb|EDO78166.1| hypothetical protein GL50803_113303 [Giardia lamblia ATCC 50803]
          Length = 804

 Score = 83.2 bits (204), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 41/96 (42%), Positives = 59/96 (61%), Gaps = 2/96 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIY-KKGVYQHTVG-EMSGGHAVKIIGWGVEDGVKYWLC 58
           M  +I+  G I  ++    D     KKG+Y      ++ GGHAV I+GWG E+GV YW C
Sbjct: 192 MMRDIYQNGPIAVSMYLANDFPSKDKKGIYSSGPNTKLGGGHAVMIVGWGEENGVPYWDC 251

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
            N++G  WGD G FKI+RG++E +IE++  SA  +D
Sbjct: 252 ANTYGTNWGDQGYFKIKRGSNELKIETWPGSALPID 287


>gi|145514872|ref|XP_001443341.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124410719|emb|CAK75944.1| unnamed protein product [Paramecium tetraurelia]
          Length = 358

 Score = 83.2 bits (204), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 39/91 (42%), Positives = 60/91 (65%), Gaps = 2/91 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG--EMSGGHAVKIIGWGVEDGVKYWLC 58
           ++ EI + G IVA I+  +D ++YK GVY+   G  +   GHAVK+IGWG +DGV YW+ 
Sbjct: 256 IKREILNNGPIVAVIQVFKDFLVYKGGVYEVVEGSSKFQYGHAVKVIGWGKQDGVNYWVI 315

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVS 89
            NSWG+ WG  GL  +  G ++ ++E++ V+
Sbjct: 316 ENSWGDSWGLKGLAYVAVGQNQLQLEAYSVA 346


>gi|414886872|tpg|DAA62886.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
 gi|414886873|tpg|DAA62887.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
          Length = 208

 Score = 82.8 bits (203), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 41/91 (45%), Positives = 52/91 (57%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
           +  E++  G +  A   ++D   YK GVY+H  G + GGHAVK+IGWG  D G  YWL  
Sbjct: 98  IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWLLA 157

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           N W   WGD G FKI RG +E  IE   V+ 
Sbjct: 158 NQWNRGWGDDGYFKIIRGKNECGIEEGVVAG 188


>gi|297465285|ref|XP_887401.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
           [Bos taurus]
 gi|297472148|ref|XP_002685665.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Bos taurus]
 gi|296490232|tpg|DAA32345.1| TPA: tubulointerstitial nephritis antigen-like 1-like [Bos taurus]
          Length = 534

 Score = 82.8 bits (203), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 44/104 (42%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +Y+ G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 423 ELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 482

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 483 RTIKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 526


>gi|193629592|ref|XP_001944624.1| PREDICTED: cathepsin B-like isoform 4 [Acyrthosiphon pisum]
          Length = 331

 Score = 82.8 bits (203), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 41/91 (45%), Positives = 56/91 (61%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
           +Q E+  +G + AA+  + D+ ++K GVY  T   +      VK+IGWGVE+GV YWL V
Sbjct: 236 IQKEVQTYGPVTAALNLYDDIFLHKSGVYTLTKNAKYVRLQYVKLIGWGVENGVDYWLLV 295

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           NSWG  WG  GL KI+RG     +ESF  +A
Sbjct: 296 NSWGNEWGQNGLLKIKRGKYGCAVESFVYAA 326


>gi|170045773|ref|XP_001850470.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
 gi|167868692|gb|EDS32075.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
          Length = 463

 Score = 82.8 bits (203), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 39/99 (39%), Positives = 60/99 (60%), Gaps = 9/99 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG-----EMSGGHAVKIIGWGVE----D 51
           + +EI   G + A +  H+D   YK G+Y+H+       E +G H+V++IGWG E    +
Sbjct: 323 IMIEIKKHGPVQAILRVHRDFFSYKSGIYRHSAASSAGDERAGYHSVRLIGWGEERNGYE 382

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
             KYW+ VNSWG  WG+ G F+I RG +E  IES+ +++
Sbjct: 383 TTKYWVAVNSWGRWWGENGRFRIVRGQNECEIESYVLAS 421


>gi|363742306|ref|XP_428202.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Gallus
           gallus]
          Length = 464

 Score = 82.8 bits (203), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--------GEMSGGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +YK G+Y+HT          +  G H+VKI GWG E   DG
Sbjct: 353 ELMENGPVQAILEVHEDFFLYKSGIYRHTAVAEGKGPKHQQHGTHSVKITGWGEEQLPDG 412

Query: 53  V--KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
              KYW   NSWG  WG+ G F+I RG +E  +ESF V   GRV
Sbjct: 413 QVQKYWTAANSWGRAWGEDGHFRIARGVNECEVESFVVGVWGRV 456


>gi|67867504|gb|AAH98085.1| Unknown (protein for MGC:107782) [Xenopus (Silurana) tropicalis]
          Length = 458

 Score = 82.8 bits (203), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 44/98 (44%), Positives = 58/98 (59%), Gaps = 8/98 (8%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGE------MSGGHAVKIIGWGVED--G 52
           M+LE+   G +  A E + D + Y+ GVY HT  +          HAV ++G+G +   G
Sbjct: 355 MKLELVLGGPLSVAFEVYDDFMHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGTDQQTG 414

Query: 53  VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
            KYW+  NSWGE WG+ G F+IRRGTDE  IES  VSA
Sbjct: 415 EKYWIVKNSWGESWGEKGYFRIRRGTDECAIESIAVSA 452


>gi|294895531|ref|XP_002775206.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239881224|gb|EER07022.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 130

 Score = 82.8 bits (203), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 37/77 (48%), Positives = 49/77 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EIF  G ++  +  ++D+ +YK GVY H  G   G H +KIIGWGVE G  YWL VN
Sbjct: 40  IKQEIFTNGPVIGMLSLYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGVESGQDYWLAVN 99

Query: 61  SWGELWGDGGLFKIRRG 77
           SW E WGD G+ K+  G
Sbjct: 100 SWNEEWGDHGMIKLAVG 116


>gi|307548878|ref|NP_001182580.1| dipeptidyl peptidase 1 precursor [Macaca mulatta]
          Length = 463

 Score = 82.8 bits (203), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 62/104 (59%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ + G +  A E + D + Y+ G+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 419 GMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|308161545|gb|EFO63987.1| Cathepsin B-like cysteine proteinase [Giardia lamblia P15]
          Length = 804

 Score = 82.8 bits (203), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 41/96 (42%), Positives = 59/96 (61%), Gaps = 2/96 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIY-KKGVYQHTVG-EMSGGHAVKIIGWGVEDGVKYWLC 58
           M  +I+  G I  ++    D     KKG+Y      ++ GGHAV I+GWG E+GV YW C
Sbjct: 192 MMRDIYQNGPIAVSMYLANDFPSKDKKGIYSSGPNTKLRGGHAVMIVGWGEENGVPYWDC 251

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
            N++G  WGD G FKI+RG++E +IE++  SA  +D
Sbjct: 252 ANTYGTNWGDQGYFKIKRGSNELKIETWPGSALPID 287


>gi|294956046|ref|XP_002788796.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239904363|gb|EER20592.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 130

 Score = 82.8 bits (203), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 37/77 (48%), Positives = 49/77 (63%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EIF  G ++  +  ++D+ +YK GVY H  G   G H +KIIGWGVE G  YWL VN
Sbjct: 40  IKQEIFTNGPVIGMLSIYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGVESGQDYWLAVN 99

Query: 61  SWGELWGDGGLFKIRRG 77
           SW E WGD G+ K+  G
Sbjct: 100 SWNEEWGDHGMIKLAVG 116


>gi|383415299|gb|AFH30863.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
 gi|384944880|gb|AFI36045.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
          Length = 463

 Score = 82.8 bits (203), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 62/104 (59%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ + G +  A E + D + Y+ G+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 419 GMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|380808942|gb|AFE76346.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
          Length = 463

 Score = 82.8 bits (203), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 62/104 (59%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ + G +  A E + D + Y+ G+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 419 GMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|355752523|gb|EHH56643.1| hypothetical protein EGM_06098 [Macaca fascicularis]
          Length = 463

 Score = 82.8 bits (203), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 62/104 (59%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ + G +  A E + D + Y+ G+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 419 GMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|62510425|sp|Q60HG6.1|CATC_MACFA RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|52782205|dbj|BAD51949.1| cathepsin C [Macaca fascicularis]
          Length = 463

 Score = 82.8 bits (203), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 62/104 (59%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ + G +  A E + D + Y+ G+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 419 GMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|328872536|gb|EGG20903.1| hypothetical protein DFA_00770 [Dictyostelium fasciculatum]
          Length = 313

 Score = 82.8 bits (203), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 40/85 (47%), Positives = 53/85 (62%), Gaps = 1/85 (1%)

Query: 2   QLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           + +I+  G I+A  + + D+  YK GVY +         HA ++IGWGVEDGV+YWL  N
Sbjct: 165 KADIYLNGPIIAVFDLYTDIYNYKSGVYIKSDSATYKETHAGRVIGWGVEDGVQYWLAAN 224

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SWG  WG  GLFKIR GT+E   E+
Sbjct: 225 SWGTGWGQQGLFKIRSGTNEVGFEA 249


>gi|343477197|emb|CCD11909.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score = 82.8 bits (203), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 37/81 (45%), Positives = 45/81 (55%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E++  G  V   + H D + YK GVYQH  G   GG AV+I+GWG  +G  YW   NSW 
Sbjct: 242 ELYFNGPFVVRFQVHSDFLAYKNGVYQHVAGNFLGGKAVRIVGWGKLNGTPYWKVANSWD 301

Query: 64  ELWGDGGLFKIRRGTDESRIE 84
             WG  G F I RG +E  IE
Sbjct: 302 TDWGMNGYFLILRGDNECNIE 322


>gi|407080581|gb|AFS89610.1| procathepsin B precursor [Phenacoccus solenopsis]
          Length = 309

 Score = 82.8 bits (203), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 40/93 (43%), Positives = 57/93 (61%), Gaps = 3/93 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED--GVKYWLC 58
           ++ EIFH G + A + A++D   Y+ G+Y H  G     HAVKIIGWG +      YWL 
Sbjct: 210 IRTEIFHNGPVEATMAAYEDFYTYESGIYHHIEGTFVCDHAVKIIGWGTDKKTNTPYWLV 269

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
            NS+   WG+ G FKI+RG +E  IE+ +++AG
Sbjct: 270 ANSFNTDWGEYGFFKIKRGVNECGIEN-KITAG 301


>gi|308157698|gb|EFO60800.1| Cathepsin B-like cysteine proteinase 3 precursor [Giardia lamblia
           P15]
          Length = 627

 Score = 82.8 bits (203), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 41/96 (42%), Positives = 59/96 (61%), Gaps = 2/96 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIY-KKGVYQHTVG-EMSGGHAVKIIGWGVEDGVKYWLC 58
           M  +I+  G I  ++    D     KKG+Y      ++ GGHAV I+GWG E+GV YW C
Sbjct: 192 MMRDIYQNGPIAVSMYLANDFPSKDKKGIYSSGPNTKLRGGHAVMIVGWGEENGVPYWDC 251

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
            N++G  WGD G FKI+RG++E +IE++  SA  +D
Sbjct: 252 ANTYGTNWGDQGYFKIKRGSNELKIETWPGSALPID 287


>gi|159108157|ref|XP_001704351.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157432412|gb|EDO76677.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 360

 Score = 82.4 bits (202), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 42/81 (51%), Positives = 50/81 (61%), Gaps = 1/81 (1%)

Query: 5   IFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCVNSWG 63
           +   G +VA     QD + YK GVYQH  G   GGHAV+IIG+GV D G+ YW   NSWG
Sbjct: 270 LLAHGPVVATFNVAQDFMYYKSGVYQHRWGLWLGGHAVEIIGYGVTDSGLDYWTVRNSWG 329

Query: 64  ELWGDGGLFKIRRGTDESRIE 84
             WG+ G F+I RG DE  IE
Sbjct: 330 PDWGEDGYFRIVRGGDECGIE 350


>gi|195154396|ref|XP_002018108.1| GL16940 [Drosophila persimilis]
 gi|194113904|gb|EDW35947.1| GL16940 [Drosophila persimilis]
          Length = 433

 Score = 82.4 bits (202), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 4/94 (4%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV---GEMSGGHAVKIIGWGVE-DGVKYW 56
           +  EI+H G + A +  ++D   Y  GVY+ T    G  +G H+VK++GWG E +G KYW
Sbjct: 327 IMAEIYHSGPVQATMRVYRDFFSYSSGVYRQTAANRGAPTGFHSVKLVGWGEEHNGDKYW 386

Query: 57  LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           +  NSWG  WG+ G F+I RG++E  IE + +++
Sbjct: 387 IAANSWGPWWGERGYFRILRGSNECGIEDYVLAS 420


>gi|341898422|gb|EGT54357.1| hypothetical protein CAEBREN_10381 [Caenorhabditis brenneri]
          Length = 466

 Score = 82.4 bits (202), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 41/102 (40%), Positives = 56/102 (54%), Gaps = 12/102 (11%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT--------VGEMSGGHAVKIIGWGVEDG 52
           +Q E+   G + A    H+D  +Y  GVYQH+             G H+V+++GWGV+  
Sbjct: 341 IQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHS 400

Query: 53  ----VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
               +KYWLC NSWG  WG+ G FKI RG +   IESF + A
Sbjct: 401 TGRPIKYWLCANSWGTQWGEDGYFKILRGDNHCEIESFVIGA 442


>gi|166030332|gb|ABY78833.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score = 82.4 bits (202), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 37/81 (45%), Positives = 45/81 (55%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E++  G  V   + H D + YK GVYQH  G   GG AV+I+GWG  +G  YW   NSW 
Sbjct: 242 ELYFNGPFVVRFQVHSDFLAYKSGVYQHVAGNFLGGKAVRIVGWGKLNGTPYWKVANSWD 301

Query: 64  ELWGDGGLFKIRRGTDESRIE 84
             WG  G F I RG +E  IE
Sbjct: 302 TDWGMNGYFLILRGDNECNIE 322


>gi|193202653|ref|NP_492593.2| Protein F26E4.3 [Caenorhabditis elegans]
 gi|205371857|sp|P90850.3|YCF2E_CAEEL RecName: Full=Uncharacterized peptidase C1-like protein F26E4.3;
           Flags: Precursor
 gi|166157004|emb|CAB03007.2| Protein F26E4.3 [Caenorhabditis elegans]
          Length = 452

 Score = 82.4 bits (202), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 12/102 (11%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT--------VGEMSGGHAVKIIGWGVEDG 52
           +Q E+   G + A    H+D  +Y  GVYQH+             G H+V+++GWGV+  
Sbjct: 327 IQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHS 386

Query: 53  ----VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
               +KYWLC NSWG  WG+ G FK+ RG +   IESF + A
Sbjct: 387 TGKPIKYWLCANSWGTQWGEDGYFKVLRGENHCEIESFVIGA 428


>gi|125810908|ref|XP_001361665.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
 gi|54636841|gb|EAL26244.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
          Length = 433

 Score = 82.4 bits (202), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 4/94 (4%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV---GEMSGGHAVKIIGWGVE-DGVKYW 56
           +  EI+H G + A +  ++D   Y  GVY+ T    G  +G H+VK++GWG E +G KYW
Sbjct: 327 IMAEIYHSGPVQATMRVYRDFFSYSSGVYRQTAANRGAPTGFHSVKLVGWGEEHNGDKYW 386

Query: 57  LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           +  NSWG  WG+ G F+I RG++E  IE + +++
Sbjct: 387 IAANSWGPWWGERGYFRILRGSNECGIEDYVLAS 420


>gi|341888224|gb|EGT44159.1| hypothetical protein CAEBREN_15022 [Caenorhabditis brenneri]
          Length = 332

 Score = 82.4 bits (202), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 37/90 (41%), Positives = 54/90 (60%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q ++   G I    E + D + Y  G+Y H  G   G  +V+I+GWG+ +GV YWL  N
Sbjct: 238 IQSDVMLNGPIETTFEVYDDFLQYTTGIYVHLTGNKQGHLSVRILGWGMYEGVPYWLLAN 297

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SWG+ WG+ G F+  RGT+E  +E+  VSA
Sbjct: 298 SWGKEWGENGTFRALRGTNECGLEANCVSA 327


>gi|166030312|gb|ABY78823.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score = 82.4 bits (202), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 38/81 (46%), Positives = 48/81 (59%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E++  G  V A + + D   YK GVY+H  G++ GGHAV+I+GWG  +G  YW   NSW 
Sbjct: 241 ELYFNGPFVVAFQVYSDFFAYKTGVYRHVSGDVLGGHAVRIVGWGKLNGTPYWKIANSWD 300

Query: 64  ELWGDGGLFKIRRGTDESRIE 84
             WG  G F I RG DE  IE
Sbjct: 301 TDWGMNGHFLILRGKDECGIE 321


>gi|253743418|gb|EES99819.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 296

 Score = 82.4 bits (202), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 40/81 (49%), Positives = 51/81 (62%), Gaps = 1/81 (1%)

Query: 5   IFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCVNSWG 63
           +   G +VA     QD + YK GVYQH  G   GGHAV+++G+GV D G+ YW   NSWG
Sbjct: 206 LLSHGPVVATFNVAQDFMYYKSGVYQHRWGVWLGGHAVEVVGYGVTDSGLDYWTVRNSWG 265

Query: 64  ELWGDGGLFKIRRGTDESRIE 84
             WG+ G F+I RG+DE  IE
Sbjct: 266 PDWGEDGYFRIVRGSDECGIE 286


>gi|289724789|gb|ADD18342.1| putative cysteine proteinase TIN-ag [Glossina morsitans morsitans]
          Length = 387

 Score = 82.4 bits (202), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 41/94 (43%), Positives = 58/94 (61%), Gaps = 4/94 (4%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV---GEMSGGHAVKIIGWGVE-DGVKYW 56
           +  EIF  G + A +  ++D   Y  G+Y+HT    G   G H+VK+IGWG E DG KYW
Sbjct: 280 IMAEIFMSGPVQATLTVYRDFFSYSGGIYRHTAASRGSPVGFHSVKLIGWGEEHDGNKYW 339

Query: 57  LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           +  NSWG  WG+ G F+I RG++E  IE + ++A
Sbjct: 340 IATNSWGTWWGEHGNFRILRGSNECGIEEYVLAA 373


>gi|195437434|ref|XP_002066645.1| GK24603 [Drosophila willistoni]
 gi|194162730|gb|EDW77631.1| GK24603 [Drosophila willistoni]
          Length = 341

 Score = 82.4 bits (202), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 41/88 (46%), Positives = 51/88 (57%), Gaps = 2/88 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
           +Q EI   G + A +  ++D + YK GVY H  GE  G HAV+I+GWGV     V YWL 
Sbjct: 246 IQKEIMTNGPVQAILTVYEDFLSYKTGVYYHLEGEKVGPHAVRILGWGVWGTKKVPYWLV 305

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESF 86
            NSWG  WGD G F I RG +   IE +
Sbjct: 306 ANSWGSDWGDNGFFHIFRGENHCDIEGY 333


>gi|401758196|gb|AFQ01133.1| cathepsin B [Chilo suppressalis]
          Length = 350

 Score = 82.4 bits (202), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 41/89 (46%), Positives = 55/89 (61%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI+ +G + +    ++D + YK+G+Y +T G+  G H+VKIIGWG E G+KYWL  NS+ 
Sbjct: 262 EIYEYGPVTSYFTVYEDFLNYKEGIYNYTSGQKLGLHSVKIIGWGEERGIKYWLAANSFN 321

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAGR 92
             WGD G FKI R    S   S  V AGR
Sbjct: 322 TDWGDKGFFKIIREGVGSCGISDNVVAGR 350


>gi|308157829|gb|EFO60849.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 300

 Score = 82.4 bits (202), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 40/87 (45%), Positives = 55/87 (63%), Gaps = 2/87 (2%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCVNSWGELWG 67
           G +  A   + D + Y+ GVYQHT G M GGHAV+++G+G  +DGV YW+  NSWG  WG
Sbjct: 214 GPLQVAFLVYSDFMYYESGVYQHTYGYMEGGHAVEMVGYGTDDDGVDYWIIRNSWGPDWG 273

Query: 68  DGGLFKIRRGTDESRIESFQVSAGRVD 94
           + G F++ RG ++  IE  Q  AG  D
Sbjct: 274 EDGYFRMIRGINDCSIEE-QAYAGFFD 299


>gi|294888035|ref|XP_002772321.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239876433|gb|EER04137.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 200

 Score = 82.4 bits (202), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 36/65 (55%), Positives = 45/65 (69%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
           G + A+   ++D + Y+ GVY+HT G   GGHAVKIIGWG + G  YWL VNSW E WGD
Sbjct: 136 GPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAVKIIGWGEKSGQAYWLAVNSWNEDWGD 195

Query: 69  GGLFK 73
            GLF+
Sbjct: 196 HGLFR 200


>gi|159109223|ref|XP_001704877.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157432952|gb|EDO77203.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 300

 Score = 82.4 bits (202), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 40/88 (45%), Positives = 55/88 (62%), Gaps = 2/88 (2%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCVNSWGELWG 67
           G +  A   + D + Y+ GVYQHT G M GGHAV+++G+G  +DGV YW+  NSWG  WG
Sbjct: 214 GPLQVAFLVYSDFMYYESGVYQHTYGYMEGGHAVEMVGYGTDDDGVDYWIIRNSWGPDWG 273

Query: 68  DGGLFKIRRGTDESRIESFQVSAGRVDR 95
           + G F++ RG ++  IE  Q  AG  D 
Sbjct: 274 EDGYFRMIRGINDCSIEE-QAYAGFFDE 300


>gi|328722316|ref|XP_003247542.1| PREDICTED: cathepsin B-like isoform 2 [Acyrthosiphon pisum]
 gi|328722318|ref|XP_003247543.1| PREDICTED: cathepsin B-like isoform 3 [Acyrthosiphon pisum]
          Length = 276

 Score = 82.4 bits (202), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 41/91 (45%), Positives = 56/91 (61%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
           +Q E+  +G + AA+  + D+ ++K GVY  T   +      VK+IGWGVE+GV YWL V
Sbjct: 181 IQKEVQTYGPVTAALNLYDDIFLHKSGVYTLTKNAKYVRLQYVKLIGWGVENGVDYWLLV 240

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           NSWG  WG  GL KI+RG     +ESF  +A
Sbjct: 241 NSWGNEWGQNGLLKIKRGKYGCAVESFVYAA 271


>gi|145509603|ref|XP_001440740.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124407968|emb|CAK73343.1| unnamed protein product [Paramecium tetraurelia]
          Length = 357

 Score = 82.4 bits (202), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 37/92 (40%), Positives = 60/92 (65%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG--EMSGGHAVKIIGWGVEDGVKYWLC 58
           ++ EI + G +VA I+  +D ++YK G+Y+   G  +   GHAVK+IGWG +DGV YW+ 
Sbjct: 256 IKREILNNGPVVAVIQVFKDFLVYKGGIYEVVEGSSKFQYGHAVKVIGWGKQDGVNYWVI 315

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
            NSWG+ WG  GL  +  G ++ ++E++ V+ 
Sbjct: 316 ENSWGDSWGLKGLAYVAVGQNQLQLEAYSVAP 347


>gi|326914532|ref|XP_003203579.1| PREDICTED: dipeptidyl peptidase 1-like [Meleagris gallopavo]
          Length = 420

 Score = 82.4 bits (202), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 42/104 (40%), Positives = 62/104 (59%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWG--VED 51
           M+LE+   G +  A E + D + YK+G+Y HT         E++  HAV ++G+G   + 
Sbjct: 317 MKLELVLSGPMAVAFEVYNDFMFYKEGIYHHTGLKDNFNPFELTN-HAVLLVGYGKDPKS 375

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G K+W+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 376 GEKFWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 419


>gi|195488613|ref|XP_002092389.1| GE11695 [Drosophila yakuba]
 gi|194178490|gb|EDW92101.1| GE11695 [Drosophila yakuba]
          Length = 431

 Score = 82.0 bits (201), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 38/94 (40%), Positives = 58/94 (61%), Gaps = 4/94 (4%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEM---SGGHAVKIIGWGVE-DGVKYW 56
           +  EIFH G + A +  ++D   Y  GVY+ T       +G H+VK++GWG E +G KYW
Sbjct: 325 IMAEIFHSGPVQATMRVNRDFFAYAGGVYRQTAANRMAPTGFHSVKLVGWGEEHNGEKYW 384

Query: 57  LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           +  NSWG  WG+ G F+I RG++E  IE + +++
Sbjct: 385 IAANSWGPWWGERGYFRILRGSNECGIEEYVLAS 418


>gi|290987261|ref|XP_002676341.1| predicted protein [Naegleria gruberi]
 gi|284089943|gb|EFC43597.1| predicted protein [Naegleria gruberi]
          Length = 218

 Score = 82.0 bits (201), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 37/88 (42%), Positives = 57/88 (64%), Gaps = 4/88 (4%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV----EDGVKYW 56
           MQL I + GS+ A+++ ++D + Y+ GVY+H VG     H+V+I+GWG+    +  + YW
Sbjct: 120 MQLSIMNGGSLAASLDIYRDFVQYRGGVYRHLVGNYMFTHSVRIVGWGITSPQQGSIPYW 179

Query: 57  LCVNSWGELWGDGGLFKIRRGTDESRIE 84
           +C N+W E WG  G F I RG++E  IE
Sbjct: 180 ICGNNWTEEWGMQGWFWILRGSNECNIE 207


>gi|290992302|ref|XP_002678773.1| predicted protein [Naegleria gruberi]
 gi|284092387|gb|EFC46029.1| predicted protein [Naegleria gruberi]
          Length = 236

 Score = 82.0 bits (201), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 37/78 (47%), Positives = 48/78 (61%), Gaps = 2/78 (2%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGV--KYWLCVNSWGELW 66
           G + AA   ++D + YK GVY H  G + GGHA+K++GWGV+      YW+  NSWG  W
Sbjct: 149 GPVQAAFSVYRDFMSYKSGVYHHVSGSLLGGHAIKMVGWGVDSATNKPYWIIANSWGPSW 208

Query: 67  GDGGLFKIRRGTDESRIE 84
           G  G F I RG+DE  IE
Sbjct: 209 GLNGFFWILRGSDECGIE 226


>gi|343459017|gb|AEM37667.1| cathepsin C subunit [Epinephelus bruneus]
          Length = 106

 Score = 82.0 bits (201), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 43/104 (41%), Positives = 62/104 (59%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGV--ED 51
           M LE+   G +  A E + D +IYK+G+Y HT         E++  HAV ++G+G   + 
Sbjct: 3   MMLELVKNGPMAVAFEVYPDFMIYKEGIYHHTGLADSFNPFELTN-HAVLLVGYGRCHKT 61

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G KYW+  NSWG  WG+ G F+IRRG+DE  IES  V+A  + +
Sbjct: 62  GQKYWIVKNSWGTDWGEDGYFRIRRGSDECSIESIAVAANPIPK 105


>gi|449670327|ref|XP_002160467.2| PREDICTED: dipeptidyl peptidase 1-like [Hydra magnipapillata]
          Length = 458

 Score = 82.0 bits (201), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 42/96 (43%), Positives = 56/96 (58%), Gaps = 6/96 (6%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS------GGHAVKIIGWGVEDGVK 54
           M++ +   G +   IE + DL  Y+ G+Y HT  +          H V ++G+G EDG K
Sbjct: 354 MRVALNKIGPLAVNIEVYPDLQFYRSGIYHHTELDFKFNPFEITNHVVVVVGYGEEDGQK 413

Query: 55  YWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           YW+  NSWGE WG+ G F+IRRGTDE  IES  V A
Sbjct: 414 YWIVKNSWGEEWGEKGYFRIRRGTDEIAIESLVVYA 449


>gi|402894881|ref|XP_003910570.1| PREDICTED: dipeptidyl peptidase 1 [Papio anubis]
          Length = 463

 Score = 82.0 bits (201), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 62/104 (59%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ + G +  A E + D + Y+ G+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVYHGPLSVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 419 GMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462


>gi|195585648|ref|XP_002082593.1| GD25141 [Drosophila simulans]
 gi|194194602|gb|EDX08178.1| GD25141 [Drosophila simulans]
          Length = 484

 Score = 82.0 bits (201), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 38/94 (40%), Positives = 58/94 (61%), Gaps = 4/94 (4%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEM---SGGHAVKIIGWGVE-DGVKYW 56
           +  EIFH G + A +  ++D   Y  GVY+ T       +G H+VK++GWG E +G KYW
Sbjct: 325 IMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYW 384

Query: 57  LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           +  NSWG  WG+ G F+I RG++E  IE + +++
Sbjct: 385 IAANSWGSWWGEHGYFRILRGSNECGIEEYVLAS 418


>gi|344293788|ref|XP_003418602.1| PREDICTED: dipeptidyl peptidase 1 [Loxodonta africana]
          Length = 463

 Score = 82.0 bits (201), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 62/104 (59%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ + G +V + E + D I Y KG+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVNHGPVVVSFEVYDDFIHYHKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSW   WG+ G F+IRRGTDE  IES  ++A  + +
Sbjct: 419 GLDYWIVKNSWSATWGEDGYFRIRRGTDECGIESIALTATPIPK 462


>gi|323447573|gb|EGB03489.1| hypothetical protein AURANDRAFT_72715 [Aureococcus anophagefferens]
          Length = 812

 Score = 82.0 bits (201), Expect = 9e-14,   Method: Composition-based stats.
 Identities = 43/110 (39%), Positives = 56/110 (50%), Gaps = 9/110 (8%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEM--SGGHAVKIIGWGVEDGVKYWLC 58
           MQ EI   G I  A   ++  + YK GVY     E+   GGHAVKI+GWG E G  YWL 
Sbjct: 467 MQKEIMTHGPIQVAFNVYKSFMSYKSGVYAKKWYELMPEGGHAVKIVGWGTEGGKDYWLV 526

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIE-------SFQVSAGRVDRDRSSDL 101
            NSW   WGD G FKI  G +   ++       +F V       +R+++L
Sbjct: 527 ANSWNTSWGDEGYFKIAVGAESISLDVVKRVFAAFDVDLAETRNERTNEL 576


>gi|300121248|emb|CBK21629.2| unnamed protein product [Blastocystis hominis]
          Length = 559

 Score = 81.6 bits (200), Expect = 9e-14,   Method: Composition-based stats.
 Identities = 37/85 (43%), Positives = 50/85 (58%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           M  EI+  G I   I   QD + YK G+Y+   G +   HA+ ++GWG E+G KYW+  N
Sbjct: 185 MMKEIYARGPITCGIAVPQDFVDYKGGIYKDESGAVEKVHAISVVGWGEENGEKYWIGRN 244

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SWG  WG+ G F+I RG +   IES
Sbjct: 245 SWGNYWGEEGWFRIARGINNLAIES 269



 Score = 71.2 bits (173), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 32/79 (40%), Positives = 45/79 (56%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EIF  G +  ++   +  + Y  GVY+     M  GH V+I GWGVE+G  YW+  N
Sbjct: 465 IKAEIFARGPVSCSMTVRESFLDYHGGVYESDSSPMVAGHIVEIAGWGVENGRPYWIGRN 524

Query: 61  SWGELWGDGGLFKIRRGTD 79
           SWGE WG+ G F+I    D
Sbjct: 525 SWGEYWGEEGWFRIDMEKD 543


>gi|357623033|gb|EHJ74345.1| tubulointerstitial nephritis antigen [Danaus plexippus]
          Length = 426

 Score = 81.6 bits (200), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 38/92 (41%), Positives = 56/92 (60%), Gaps = 3/92 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV---GEMSGGHAVKIIGWGVEDGVKYWL 57
           +  +I   G + A +  HQD   Y  G+Y+ +      + G H+V+I+GWG + G KYW+
Sbjct: 323 IMYDIMESGPVHAVMTVHQDFFHYHDGIYRRSPYGDNTLQGLHSVRIVGWGEDRGDKYWV 382

Query: 58  CVNSWGELWGDGGLFKIRRGTDESRIESFQVS 89
             NSWG  WG+ G F+I RG++ES IESF V+
Sbjct: 383 VANSWGCDWGENGYFRIARGSNESGIESFVVT 414


>gi|226466652|emb|CAX69461.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 340

 Score = 81.6 bits (200), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 39/87 (44%), Positives = 52/87 (59%), Gaps = 1/87 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG-HAVKIIGWGVEDGVKYWLCV 59
           +Q EI   G ++A+I  + D ++YK GVY  T    + G   ++IIGWG E  + YWLC 
Sbjct: 246 IQKEILMNGPVIASISVNTDFLVYKSGVYLPTPRSRNLGWITLRIIGWGYEGKIPYWLCA 305

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESF 86
           NSW E WGD G  KI+RG     IES+
Sbjct: 306 NSWNEEWGDNGYVKIQRGVQAGYIESY 332


>gi|24657813|ref|NP_726176.1| secreted Wg-interacting molecule, isoform A [Drosophila
           melanogaster]
 gi|24657819|ref|NP_611652.2| secreted Wg-interacting molecule, isoform B [Drosophila
           melanogaster]
 gi|21064305|gb|AAM29382.1| RE01730p [Drosophila melanogaster]
 gi|21626543|gb|AAF46818.2| secreted Wg-interacting molecule, isoform A [Drosophila
           melanogaster]
 gi|21626544|gb|AAM68213.1| secreted Wg-interacting molecule, isoform B [Drosophila
           melanogaster]
 gi|220949028|gb|ACL87057.1| CG3074-PA [synthetic construct]
 gi|220958134|gb|ACL91610.1| CG3074-PA [synthetic construct]
          Length = 431

 Score = 81.6 bits (200), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 38/94 (40%), Positives = 58/94 (61%), Gaps = 4/94 (4%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEM---SGGHAVKIIGWGVE-DGVKYW 56
           +  EIFH G + A +  ++D   Y  GVY+ T       +G H+VK++GWG E +G KYW
Sbjct: 325 IMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYW 384

Query: 57  LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           +  NSWG  WG+ G F+I RG++E  IE + +++
Sbjct: 385 IAANSWGSWWGEHGYFRILRGSNECGIEEYVLAS 418


>gi|16768502|gb|AAL28470.1| GM06507p [Drosophila melanogaster]
          Length = 430

 Score = 81.6 bits (200), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 38/94 (40%), Positives = 58/94 (61%), Gaps = 4/94 (4%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEM---SGGHAVKIIGWGVE-DGVKYW 56
           +  EIFH G + A +  ++D   Y  GVY+ T       +G H+VK++GWG E +G KYW
Sbjct: 324 IMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYW 383

Query: 57  LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           +  NSWG  WG+ G F+I RG++E  IE + +++
Sbjct: 384 IAANSWGSWWGEHGYFRILRGSNECGIEEYVLAS 417


>gi|145486176|ref|XP_001429095.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124396185|emb|CAK61697.1| unnamed protein product [Paramecium tetraurelia]
          Length = 464

 Score = 81.6 bits (200), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 45/126 (35%), Positives = 69/126 (54%), Gaps = 17/126 (13%)

Query: 3   LEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSG-------GHAVKIIGWGVEDGVKY 55
           LEI   G +V + E   D + Y+ G+Y H+  E S         H+V   GWG E+GVK+
Sbjct: 329 LEIMKNGPVVLSFEPSYDFMYYESGIY-HSKAETSDYSEWEKVDHSVLCYGWGEEEGVKF 387

Query: 56  WLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGR--VDRDRSSDLEE-------FEY 106
           W+  NSWG+ WG+ G F+++RG DES IES   ++    ++++ S    E       F+Y
Sbjct: 388 WMLQNSWGDQWGESGNFRMKRGVDESAIESMAEASDPYVINQNSSKSFSETKSNESDFDY 447

Query: 107 DTDTTI 112
           + D +I
Sbjct: 448 EDDDSI 453


>gi|327269233|ref|XP_003219399.1| PREDICTED: dipeptidyl peptidase 1-like [Anolis carolinensis]
          Length = 467

 Score = 81.6 bits (200), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 59/104 (56%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSG-------GHAVKIIGWGV--ED 51
           M+LE+   G +  A E + D + Y+ G+Y HT G M          HAV ++G+G   E 
Sbjct: 364 MKLELVKHGPMAVAFEVYSDFMHYRGGIYHHT-GLMDPFNPFELTNHAVLLVGYGTDPET 422

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G  +W+  NSWG  WG+ G F+IRRGTDE  IES  V++  + +
Sbjct: 423 GEPFWIVKNSWGPAWGEQGYFRIRRGTDECAIESIAVASTPIPK 466


>gi|24987409|pdb|1JQP|A Chain A, Dipeptidyl Peptidase I (Cathepsin C), A Tetrameric
           Cysteine Protease Of The Papain Family
          Length = 438

 Score = 81.6 bits (200), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 60/104 (57%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+   G +  A E H D + Y  G+Y HT         E++  HAV ++G+G +   
Sbjct: 335 MKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTN-HAVLLVGYGKDPVT 393

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  ++A  + +
Sbjct: 394 GLDYWIVKNSWGSQWGESGYFRIRRGTDECAIESIAMAAIPIPK 437


>gi|47550737|ref|NP_999887.1| dipeptidyl peptidase 1 precursor [Danio rerio]
 gi|39794586|gb|AAH64286.1| Cathepsin C [Danio rerio]
          Length = 455

 Score = 81.6 bits (200), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 43/104 (41%), Positives = 62/104 (59%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGV--ED 51
           M LE+   G +  A+E + D + YK+G+Y HT         E++  HAV ++G+G   + 
Sbjct: 352 MMLELVKNGPMGVALEVYPDFMNYKEGIYHHTGLRDANNPFELTN-HAVLLVGYGQCHKT 410

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G KYW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 411 GEKYWIVKNSWGSGWGENGFFRIRRGTDECAIESIAVAATPIPK 454


>gi|8393218|ref|NP_058793.1| dipeptidyl peptidase 1 precursor [Rattus norvegicus]
 gi|114152780|sp|P80067.3|CATC_RAT RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|220686|dbj|BAA14400.1| cathepsin C precursor [Rattus norvegicus]
 gi|149069035|gb|EDM18587.1| cathepsin C, isoform CRA_a [Rattus norvegicus]
          Length = 462

 Score = 81.6 bits (200), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 60/104 (57%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+   G +  A E H D + Y  G+Y HT         E++  HAV ++G+G +   
Sbjct: 359 MKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTN-HAVLLVGYGKDPVT 417

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  ++A  + +
Sbjct: 418 GLDYWIVKNSWGSQWGESGYFRIRRGTDECAIESIAMAAIPIPK 461


>gi|48762491|dbj|BAD23815.1| cathepsin B-S1 [Tuberaphis coreana]
          Length = 334

 Score = 81.6 bits (200), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 38/92 (41%), Positives = 59/92 (64%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
           ++ ++  +G + A+ + + D  +YK G+Y+ T   +  G H++KIIGWG E+G  YWL V
Sbjct: 239 IEQDLKTYGPVEASFDVYDDFSVYKSGIYRKTPKAKYEGRHSIKIIGWGQENGTTYWLAV 298

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           NSW + WG+ G FKI +G +E  IE   V+AG
Sbjct: 299 NSWSKFWGEHGTFKIIKGRNECGIER-AVTAG 329


>gi|308161503|gb|EFO63946.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 363

 Score = 81.6 bits (200), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 41/81 (50%), Positives = 50/81 (61%), Gaps = 1/81 (1%)

Query: 5   IFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCVNSWG 63
           +   G +VA     QD + YK GVYQH  G   GGHAV+I+G+GV D G+ YW   NSWG
Sbjct: 273 LLAHGPVVATFNVAQDFMYYKSGVYQHRWGVWLGGHAVEIVGYGVTDSGLDYWTVRNSWG 332

Query: 64  ELWGDGGLFKIRRGTDESRIE 84
             WG+ G F+I RG DE  IE
Sbjct: 333 PDWGEDGYFRIVRGGDECGIE 353


>gi|294916952|ref|XP_002778399.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239886773|gb|EER10194.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 228

 Score = 81.6 bits (200), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 35/90 (38%), Positives = 53/90 (58%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EIF  G + A +  ++D   YK GVY H  G++   H +K+IGWGVE G +YWL +N
Sbjct: 137 IKQEIFDNGPVAAMMTLYEDFRYYKSGVYVHKTGQLLAAHTLKLIGWGVESGQEYWLAMN 196

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           +W E WGD G+ K+  G      + + + A
Sbjct: 197 AWNEEWGDHGMIKLAVGKTGLEHQVYHIEA 226


>gi|255209|gb|AAB23200.1| preprocathepsin C, dipeptidylaminopeptidase I [rats, kidney,
           Peptide, 462 aa]
          Length = 462

 Score = 81.3 bits (199), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 60/104 (57%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+   G +  A E H D + Y  G+Y HT         E++  HAV ++G+G +   
Sbjct: 359 MKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTN-HAVLLVGYGKDPVT 417

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  ++A  + +
Sbjct: 418 GLDYWIVKNSWGSQWGESGYFRIRRGTDECAIESIAMAAIPIPK 461


>gi|340053922|emb|CCC48215.1| cysteine peptidase C (CPC) [Trypanosoma vivax Y486]
          Length = 334

 Score = 81.3 bits (199), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 39/90 (43%), Positives = 52/90 (57%), Gaps = 1/90 (1%)

Query: 2   QLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNS 61
           + E++  G    A   ++D + Y+ GVY+H  G   GGHAV+++GWG  +GV YW   NS
Sbjct: 240 KRELYLRGPFEVAFTVYEDFLAYESGVYKHVSGGPVGGHAVRVVGWGERNGVPYWKIANS 299

Query: 62  WGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           W   WG+ G     RG DE  IES Q SAG
Sbjct: 300 WNTDWGENGYLYFYRGKDECGIES-QGSAG 328


>gi|281208776|gb|EFA82951.1| peptidase C1A family protein [Polysphondylium pallidum PN500]
          Length = 1308

 Score = 81.3 bits (199), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 36/80 (45%), Positives = 50/80 (62%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q EI   G + A  E ++D + YK GVY H  G+  GGH +KI+G+GV +G  YW+C N
Sbjct: 213 IQNEIVTNGPVEACFEVYEDFLGYKSGVYTHKSGKDLGGHCIKIVGFGVSNGTPYWICNN 272

Query: 61  SWGELWGDGGLFKIRRGTDE 80
           SW   WG+ G+F I  G +E
Sbjct: 273 SWTTSWGNNGIFWIEAGKNE 292


>gi|37905530|gb|AAO64478.1| cathepsin C precursor [Fundulus heteroclitus]
          Length = 450

 Score = 81.3 bits (199), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 42/103 (40%), Positives = 61/103 (59%), Gaps = 8/103 (7%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS------GGHAVKIIGWGV--EDG 52
           M+LE+   G +  A+E + D + YK+G+Y HT    S        HAV ++G+G   + G
Sbjct: 347 MKLELVKNGPMAVALEVYPDFMHYKEGIYHHTGFRDSVNPFELTNHAVLLVGYGRCHKTG 406

Query: 53  VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
            KYW+  NSWG  WG+ G F+IRRG+DE  IES  V+A  + +
Sbjct: 407 QKYWIVKNSWGSGWGEDGYFRIRRGSDECAIESIAVAAKPIPK 449


>gi|326490902|dbj|BAJ90118.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326508404|dbj|BAJ99469.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514912|dbj|BAJ99817.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 345

 Score = 81.3 bits (199), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 42/92 (45%), Positives = 53/92 (57%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
           +  E++  G +  +   ++D   YK GVY+   G M GGHA K+IGWG  D G  YWL  
Sbjct: 241 IMAEVYKNGPVEVSFIIYEDFAHYKSGVYKQITGRMVGGHAAKLIGWGTSDAGEDYWLLA 300

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           N W   WGD G FKI RGT+E  IE   V+AG
Sbjct: 301 NQWNRGWGDDGYFKIIRGTNECGIEG-DVNAG 331


>gi|73696355|gb|AAZ80953.1| cathepsin C [Macaca mulatta]
          Length = 118

 Score = 81.3 bits (199), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 62/104 (59%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ + G +  A E + D + Y+ G+Y HT         E++  HAV ++G+G +   
Sbjct: 15  MKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 73

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 74  GMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 117


>gi|194753202|ref|XP_001958906.1| GF12327 [Drosophila ananassae]
 gi|190620204|gb|EDV35728.1| GF12327 [Drosophila ananassae]
          Length = 431

 Score = 81.3 bits (199), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 37/94 (39%), Positives = 58/94 (61%), Gaps = 4/94 (4%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV---GEMSGGHAVKIIGWGVE-DGVKYW 56
           +  EI+H G + A +  ++D   Y  G+Y+ T    G   G H+VK++GWG E +G KYW
Sbjct: 324 IMAEIYHSGPVQATMRVYRDFFSYSGGIYRQTAANRGAPQGFHSVKLVGWGEEHNGDKYW 383

Query: 57  LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           +  NSWG  WG+ G F+I RG++E  IE + +++
Sbjct: 384 IAANSWGPWWGERGYFRILRGSNECGIEEYVLAS 417


>gi|410910940|ref|XP_003968948.1| PREDICTED: tubulointerstitial nephritis antigen-like [Takifugu
           rubripes]
          Length = 477

 Score = 81.3 bits (199), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 43/104 (41%), Positives = 59/104 (56%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVEDGV-- 53
           EI   G + A +E H+D  +YK G+Y+HT    +        G H+VKI GWG E  V  
Sbjct: 357 EIQDNGPVQAIMEVHEDFFVYKSGIYKHTDVSFTKPPQYRKHGTHSVKITGWGEERNVDG 416

Query: 54  ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
              KYW+  NSWG+ WG+ G F+I RG +E  IE+F +   GR+
Sbjct: 417 AKRKYWIAANSWGKNWGEEGYFRIARGENECEIEAFVIGVWGRI 460


>gi|348570708|ref|XP_003471139.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
           porcellus]
          Length = 468

 Score = 81.3 bits (199), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 46/104 (44%), Positives = 60/104 (57%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +YK G+Y HT   M+        G H+VKI GWG E   DG
Sbjct: 357 ELMENGPVQALMEVHEDFFLYKGGIYSHTPLSMARPEQYRRHGTHSVKITGWGEETLPDG 416

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG++E  IESF +   GRV
Sbjct: 417 RTLKYWTAANSWGPSWGERGHFRILRGSNECDIESFVLGVWGRV 460


>gi|270012756|gb|EFA09204.1| cathepsin B precursor [Tribolium castaneum]
          Length = 369

 Score = 81.3 bits (199), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 47/101 (46%), Positives = 60/101 (59%), Gaps = 11/101 (10%)

Query: 9   GSIVAAIEAHQDLIIYK---------KGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
           G +VAA + + D  IY+         +GVY +T G + G  AVKIIGWG E+G  YWL  
Sbjct: 228 GPVVAAFDVYGDFKIYRDGEQHDTILEGVYIYTSGALFGRTAVKIIGWGTENGWAYWLAA 287

Query: 60  NSWGELWGD-GGLFKIRRGTDESRIESFQVSAGRVDRDRSS 99
           NSWG+ WG  GG FKIRRGT+E   E   + AG+V    S+
Sbjct: 288 NSWGKDWGALGGFFKIRRGTNECGFEE-SIIAGQVREGGST 327


>gi|134023803|gb|AAI35570.1| LOC100124858 protein [Xenopus (Silurana) tropicalis]
          Length = 484

 Score = 81.3 bits (199), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 57/104 (54%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--------GEMSGGHAVKIIGWGVEDGV-- 53
           E++  G + A +E H+D  +YK G+Y+ T             G H+VKI GWG E G   
Sbjct: 372 ELYENGPVQAIMEVHEDFFMYKSGIYRRTPVTEREPEHHRRHGTHSVKITGWGEERGRDG 431

Query: 54  ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
              KYWL  NSWG  WG+ G F+I RG +E  IE+F V   GRV
Sbjct: 432 QTHKYWLAANSWGRDWGEDGYFRIARGENECEIETFIVGVWGRV 475


>gi|432884030|ref|XP_004074413.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oryzias
           latipes]
          Length = 474

 Score = 81.3 bits (199), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 42/104 (40%), Positives = 59/104 (56%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHT--------VGEMSGGHAVKIIGWGVE---DG 52
           EI   G + A +E H+D  +YK G+Y+HT             G H+V+I GWG +   DG
Sbjct: 354 EIMENGPVQAIMEVHEDFFVYKNGIYKHTDVSSTKPPQYRKHGTHSVRITGWGEDKDYDG 413

Query: 53  V--KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
              KYW+  NSWG+ WG+ G F+I RG +E  IE+F +   GR+
Sbjct: 414 TPRKYWIAANSWGKNWGENGFFRIARGANECEIEAFVIGVWGRI 457


>gi|242001446|ref|XP_002435366.1| cysteine proteinase, putative [Ixodes scapularis]
 gi|215498696|gb|EEC08190.1| cysteine proteinase, putative [Ixodes scapularis]
          Length = 238

 Score = 80.9 bits (198), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 42/95 (44%), Positives = 56/95 (58%), Gaps = 12/95 (12%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--------GEMSGGHAVKIIGWGVED---- 51
           EI+  G + A +   +D  +Y  GVY+HT          + S  H+V+I+GWGV+     
Sbjct: 112 EIYANGPVQALMLVKEDFFLYSSGVYKHTRLAHNLPPEYQKSDWHSVRILGWGVDRTQYR 171

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESF 86
             KYWLC NSWG  WG+ G F+I RG DES+IESF
Sbjct: 172 PQKYWLCANSWGSGWGENGYFRIVRGEDESQIESF 206


>gi|294955270|ref|XP_002788457.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
 gi|239903926|gb|EER20253.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
          Length = 392

 Score = 80.9 bits (198), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 37/77 (48%), Positives = 48/77 (62%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G   AA   + D + Y+ GVY+HT G + G H V+IIGWG + GV YWL +N
Sbjct: 252 IKKEIMTNGPTSAAFSMYDDFLSYESGVYKHTSGTLMGEHGVEIIGWGTKQGVDYWLVMN 311

Query: 61  SWGELWGDGGLFKIRRG 77
           SW E WG  G FKI +G
Sbjct: 312 SWNEGWGVHGTFKIAQG 328


>gi|300121755|emb|CBK22330.2| unnamed protein product [Blastocystis hominis]
          Length = 562

 Score = 80.9 bits (198), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 37/85 (43%), Positives = 52/85 (61%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           M  EI+  G I   I   ++L+ YK G+Y+ T G  S  H++ ++GWG EDG KYW+  N
Sbjct: 184 MMKEIYARGPITCTIADPEELMEYKGGIYRDTTGAKSLDHSISVVGWGEEDGQKYWIARN 243

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SWG  WG+ G F+I RG +   IE+
Sbjct: 244 SWGTFWGEKGWFRIVRGENNLGIEA 268



 Score = 63.9 bits (154), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 45/75 (60%), Gaps = 1/75 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
           ++ EIF  G +   I   Q+ + Y+ G+++    E  G H+V++ GWG  EDG KYW+  
Sbjct: 467 IKAEIFARGPVSCDIWVTQEFLDYQGGIFKENGSEYLGRHSVEVAGWGETEDGTKYWIGR 526

Query: 60  NSWGELWGDGGLFKI 74
           NSWG  WG+ G F+I
Sbjct: 527 NSWGTYWGEHGWFRI 541


>gi|281200411|gb|EFA74631.1| hypothetical protein PPL_11599 [Polysphondylium pallidum PN500]
          Length = 311

 Score = 80.9 bits (198), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 41/97 (42%), Positives = 59/97 (60%), Gaps = 4/97 (4%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG-EMSGGHAVKIIGWGVED--GVKYWL 57
           +Q  +F +G I   ++ +QD + Y  GVY  T G ++ GGHA+KI+GWG +   G+ YW+
Sbjct: 212 IQANVFAYGPIEGTMDVYQDFMSYTSGVYVMTPGSKLLGGHAIKIVGWGTDSTSGLDYWI 271

Query: 58  CVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
             NSWG  WG  G F I+RGT+   I+    SAG+ D
Sbjct: 272 VQNSWGSDWGMNGFFWIQRGTNMCGIDR-DASAGQAD 307


>gi|341900875|gb|EGT56810.1| hypothetical protein CAEBREN_32632 [Caenorhabditis brenneri]
          Length = 287

 Score = 80.9 bits (198), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 36/90 (40%), Positives = 53/90 (58%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q ++   G I    E + D + Y  G+Y H  G   G  +V+I+GWG+ +GV YWL  N
Sbjct: 193 IQSDVMLNGPIETTFEVYDDFLQYTTGIYVHLTGNKQGHLSVRILGWGMYEGVPYWLLAN 252

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SWG+ WG+ G F+  RGT+E  +E+  VS 
Sbjct: 253 SWGKEWGENGTFRALRGTNECGLEANCVSG 282


>gi|328712825|ref|XP_001945477.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Acyrthosiphon pisum]
          Length = 487

 Score = 80.5 bits (197), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 39/94 (41%), Positives = 61/94 (64%), Gaps = 7/94 (7%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHT---VGEMSGGHAVKIIGWGVEDG----VKYW 56
           EI ++GS+ A ++  ++  +Y+ GVY+ +   +G  +G H V+I+GWG E      VKYW
Sbjct: 363 EIMNWGSVQAMMKVSKEFFMYESGVYKCSKLDLGSKTGYHTVRIVGWGEEQQNGRTVKYW 422

Query: 57  LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           +  NSWG  WG+ G F+I +GT+E +IE F V+A
Sbjct: 423 IVSNSWGLWWGESGYFRILKGTNECQIEDFVVAA 456


>gi|355566931|gb|EHH23310.1| hypothetical protein EGK_06753 [Macaca mulatta]
          Length = 463

 Score = 80.5 bits (197), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 40/104 (38%), Positives = 61/104 (58%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+LE+ + G +  A E + D + Y+ G+Y HT         E++  HAV ++G+G +   
Sbjct: 360 MKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+I RGTDE  IES  V+A  + +
Sbjct: 419 GMDYWIVKNSWGTSWGEDGYFRIHRGTDECAIESIAVAATPIPK 462


>gi|294952605|ref|XP_002787373.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239902345|gb|EER19169.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 185

 Score = 80.5 bits (197), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 39/89 (43%), Positives = 56/89 (62%), Gaps = 2/89 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EIF  G ++++ + ++D   YK GVY  T  E S  H++KIIGWG   G +YWL VN
Sbjct: 94  IKQEIFDNGPVLSSFKMYEDFRYYKSGVYVPTTKESSTSHSIKIIGWGGASGREYWLAVN 153

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW E WGD GL K+  G  ++R+E   +S
Sbjct: 154 SWNEEWGDHGLIKMAFG--KNRLEKIVLS 180


>gi|403357104|gb|EJY78168.1| Cathepsin B [Oxytricha trifallax]
          Length = 349

 Score = 80.5 bits (197), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 51/78 (65%), Gaps = 1/78 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
           +QLE+   G ++  +  ++DL+ YK+GVY++T G   GGHA+KIIGWG  E G  +W C 
Sbjct: 253 IQLELMTNGPMMVGLSVYEDLMNYKEGVYEYTTGNQVGGHAIKIIGWGHTEKGELFWKCQ 312

Query: 60  NSWGELWGDGGLFKIRRG 77
           N WG+ WG GG   I+ G
Sbjct: 313 NQWGKDWGMGGYINIKAG 330


>gi|194882138|ref|XP_001975170.1| GG20712 [Drosophila erecta]
 gi|190658357|gb|EDV55570.1| GG20712 [Drosophila erecta]
          Length = 431

 Score = 80.5 bits (197), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 37/94 (39%), Positives = 59/94 (62%), Gaps = 4/94 (4%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEM---SGGHAVKIIGWGVE-DGVKYW 56
           +  EIF+ G + A +  ++D   Y +GVY+ T       +G H+VK++GWG E +G KYW
Sbjct: 325 IMAEIFNSGPVQATMRVNRDFFSYSRGVYRQTAANREAPTGFHSVKLVGWGEEHNGEKYW 384

Query: 57  LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           +  NSWG  WG+ G F+I RG++E  IE + +++
Sbjct: 385 IAANSWGSWWGEKGYFRILRGSNECGIEEYVLAS 418


>gi|349605750|gb|AEQ00879.1| Dipeptidyl-peptidase 1-like protein, partial [Equus caballus]
          Length = 356

 Score = 80.5 bits (197), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 40/104 (38%), Positives = 60/104 (57%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           ++LE+ H G +  A E + D + Y  G+Y HT         E++  HAV ++G+G +   
Sbjct: 253 IKLELVHHGPMAVAFEVYNDFLHYHDGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 311

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G  YW+  NSWG  WG+ G F+IRRGTDE  IES  ++A  + +
Sbjct: 312 GQDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAMAATPIPK 355


>gi|166030318|gb|ABY78826.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score = 80.5 bits (197), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 37/84 (44%), Positives = 47/84 (55%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
            + E++  G   A  +   DL  YK GVY+H  G   G HAV+I+GWG + GV YW   N
Sbjct: 238 FRRELYFRGPFQAVFDVWSDLFAYKHGVYKHVGGAFIGAHAVRIVGWGNQSGVPYWKIAN 297

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW   WGD G F + RG +E  IE
Sbjct: 298 SWNAEWGDRGYFFMLRGDNECGIE 321


>gi|328712827|ref|XP_003244913.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Acyrthosiphon pisum]
          Length = 487

 Score = 80.5 bits (197), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 39/94 (41%), Positives = 61/94 (64%), Gaps = 7/94 (7%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHT---VGEMSGGHAVKIIGWGVEDG----VKYW 56
           EI ++GS+ A ++  ++  +Y+ GVY+ +   +G  +G H V+I+GWG E      VKYW
Sbjct: 363 EIMNWGSVQAMMKVSKEFFMYESGVYRCSNLALGSKTGYHTVRIVGWGEEQQNGRTVKYW 422

Query: 57  LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           +  NSWG  WG+ G F+I +GT+E +IE F V+A
Sbjct: 423 IVSNSWGLWWGESGYFRILKGTNECQIEDFVVAA 456


>gi|343474530|emb|CCD13852.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 335

 Score = 80.5 bits (197), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 37/81 (45%), Positives = 46/81 (56%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E++  G   A  +   DL  YK GVY+H  G   G HAV+I+GWG + GV YW   NSW 
Sbjct: 241 ELYFRGPFQAVFDVWSDLFAYKHGVYKHVGGAFIGAHAVRIVGWGNQSGVPYWKIANSWN 300

Query: 64  ELWGDGGLFKIRRGTDESRIE 84
             WGD G F + RG +E  IE
Sbjct: 301 AEWGDRGYFFMLRGDNECGIE 321


>gi|348513320|ref|XP_003444190.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oreochromis
           niloticus]
          Length = 499

 Score = 80.5 bits (197), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 59/104 (56%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVEDGV-- 53
           EI   G + A +E H+D  +YK G+Y+HT    +        G H+V+I GWG +  V  
Sbjct: 379 EIMDNGPVQAIMEVHEDFFVYKTGIYKHTDVSFTKPPQYRKHGTHSVRITGWGEDRNVDG 438

Query: 54  ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
              KYW+  NSWG+ WG+ G F+I RG +E  IE+F +   GR+
Sbjct: 439 TSRKYWIAANSWGKNWGENGYFRIVRGENECEIETFVIGVWGRI 482


>gi|327281715|ref|XP_003225592.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
           carolinensis]
          Length = 520

 Score = 80.1 bits (196), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 59/104 (56%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--------GEMSGGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +Y+ G+Y+HT             G H+VKI GWG E   DG
Sbjct: 407 ELMENGPVQAILEVHEDFFMYRTGIYRHTAVAAGKPEQYRRHGTHSVKITGWGEEQMPDG 466

Query: 53  V--KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
              KYW+  NSWG+ WG+ G F+I RG +E  IE+F V   GRV
Sbjct: 467 SNQKYWIAANSWGKDWGEHGYFRITRGENECEIETFVVGVWGRV 510


>gi|158285208|ref|XP_001687862.1| AGAP007684-PA [Anopheles gambiae str. PEST]
 gi|158285210|ref|XP_308187.4| AGAP007684-PB [Anopheles gambiae str. PEST]
 gi|157019881|gb|EDO64511.1| AGAP007684-PA [Anopheles gambiae str. PEST]
 gi|157019882|gb|EAA04576.4| AGAP007684-PB [Anopheles gambiae str. PEST]
          Length = 463

 Score = 80.1 bits (196), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 61/96 (63%), Gaps = 9/96 (9%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG-----EMSGGHAVKIIGWGVE----DGVK 54
           EI   G++ A +  ++D   Y+ G+Y+H+       E S  H+V++IGWG E    D VK
Sbjct: 327 EIKDRGTVQAIMRVYRDFFSYRSGIYRHSAAATPAEERSAYHSVRLIGWGEERVGYDVVK 386

Query: 55  YWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           YW+ +NSWG+ WG+ G F+I RG++E  IES+ +++
Sbjct: 387 YWIAINSWGQWWGENGRFRILRGSNECDIESYVLAS 422


>gi|193688334|ref|XP_001945855.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 313

 Score = 80.1 bits (196), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/86 (43%), Positives = 54/86 (62%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
           +Q E+  +G +    +   D ++YK GVY +    ++      K+IGWGVE+GV YWL +
Sbjct: 218 IQKEVQTYGPVAVQFKVCDDFLLYKSGVYVKSDNAKVIRTQYAKLIGWGVENGVDYWLVI 277

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSWG  WG  GLFKI+RGT++  +ES
Sbjct: 278 NSWGHEWGQKGLFKIKRGTNQCGVES 303


>gi|308488594|ref|XP_003106491.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
 gi|308253841|gb|EFO97793.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
          Length = 342

 Score = 80.1 bits (196), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 34/90 (37%), Positives = 55/90 (61%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q ++   G + A +E + D + Y  G+Y H  G   G  +V+I+GWG+ +GV YWL  N
Sbjct: 248 IQSDVMLNGPVEATMEIYDDFLQYTTGIYVHLAGNKQGHLSVRILGWGMFEGVPYWLLAN 307

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           SWG+ WG+ G F++ RG +E  +E+  +S 
Sbjct: 308 SWGKEWGENGTFRVLRGVNECGLEANCISG 337


>gi|32129435|sp|P92133.2|CATB3_GIALA RecName: Full=Cathepsin B-like CP3; AltName: Full=Cathepsin B-like
           protease B3; Flags: Precursor
 gi|1763663|gb|AAB58260.1| cysteine protease [Giardia intestinalis]
 gi|11691660|emb|CAC18648.1| cathepsin B-like cysteine protease 3 [Giardia intestinalis]
          Length = 299

 Score = 80.1 bits (196), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 37/77 (48%), Positives = 50/77 (64%), Gaps = 1/77 (1%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCVNSWGELWG 67
           G +  A   + D + Y+ GVYQHT G + GGHAV ++G+G  +DGV YW+  NSWG  WG
Sbjct: 213 GPLQTAFTVYSDFMYYESGVYQHTYGRVEGGHAVDMVGYGTDDDGVDYWIIKNSWGPDWG 272

Query: 68  DGGLFKIRRGTDESRIE 84
           + G F+I R T+E  IE
Sbjct: 273 EDGYFRIIRMTNECGIE 289


>gi|358341867|dbj|GAA49438.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 952

 Score = 80.1 bits (196), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 44/103 (42%), Positives = 57/103 (55%), Gaps = 7/103 (6%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI   G + A +  ++DL+ YK GVY H  G   G H ++I+GWG EDGV YWL  NSW 
Sbjct: 857 EIMLRGPVGAILHVYEDLLDYKSGVYFHVWGGHLGEHGIRILGWGEEDGVPYWLVANSWN 916

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFEY 106
           E WG+ G  ++ R  +E  I   QV+AG        DL  F Y
Sbjct: 917 EDWGEKGYMRVLRWRNECGIVD-QVTAGL------PDLSNFPY 952



 Score = 73.6 bits (179), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 38/91 (41%), Positives = 52/91 (57%), Gaps = 1/91 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI   G + A+   + D + Y  GVY H  G     HA++I+GWG +DGV YWL  NSW 
Sbjct: 212 EIMLNGPVEASFGIYADFLEYNGGVYFHCWGGPISRHAIRILGWGEDDGVPYWLIANSWN 271

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           E WG+ G  +  RG +E  IE  +V+A  +D
Sbjct: 272 EDWGEKGYVRFLRGHNECGIEE-EVTAVPID 301


>gi|355572434|ref|ZP_09043578.1| Dipeptidyl-peptidase I, partial [Methanolinea tarda NOBI-1]
 gi|354824808|gb|EHF09050.1| Dipeptidyl-peptidase I, partial [Methanolinea tarda NOBI-1]
          Length = 685

 Score = 80.1 bits (196), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 31/69 (44%), Positives = 43/69 (62%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
           G I+     +QD   Y  G+Y+HT G + G HA+ ++GWG ++   YW+C NSWG  WG+
Sbjct: 369 GPIIGTFAVYQDFSYYSGGIYEHTWGSLRGYHAIVVVGWGQDERGTYWICKNSWGTGWGE 428

Query: 69  GGLFKIRRG 77
            G FKIR G
Sbjct: 429 AGWFKIRSG 437


>gi|194213370|ref|XP_001492720.2| PREDICTED: dipeptidyl peptidase 1-like [Equus caballus]
          Length = 478

 Score = 80.1 bits (196), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 40/104 (38%), Positives = 60/104 (57%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           ++LE+ H G +  A E + D + Y  G+Y HT         E++  HAV ++G+G +   
Sbjct: 375 IKLELVHHGPMAVAFEVYNDFLHYHDGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 433

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G  YW+  NSWG  WG+ G F+IRRGTDE  IES  ++A  + +
Sbjct: 434 GQDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAMAATPIPK 477


>gi|166030316|gb|ABY78825.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score = 79.7 bits (195), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 39/91 (42%), Positives = 51/91 (56%), Gaps = 1/91 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
            + E++  G  V A +   D + YK GVY+H  G+  GGHAV+I+GWG  +G  YW   N
Sbjct: 239 FKRELYFNGPFVVAFQVFSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKLNGTPYWKIAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
           SW   WG  G F   RG +E  IE F+  AG
Sbjct: 299 SWDTDWGMNGHFLFLRGNNECGIE-FEGYAG 328


>gi|145490612|ref|XP_001431306.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124398410|emb|CAK63908.1| unnamed protein product [Paramecium tetraurelia]
          Length = 490

 Score = 79.7 bits (195), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 44/126 (34%), Positives = 68/126 (53%), Gaps = 19/126 (15%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSG-------GHAVKIIGWGVEDGVKYW 56
           E+   G +V + E   D + Y+ G+Y H+  + +         H+V   GWG EDGVK+W
Sbjct: 356 EVMKNGPVVLSFEPSYDFMYYESGIY-HSKAQTNDYAEWEKVDHSVLCYGWGEEDGVKFW 414

Query: 57  LCVNSWGELWGDGGLFKIRRGTDESRIESF----------QVSAGRVDRDRSSDLEEFEY 106
           +  NSWG  WG+GG F+++RG DES IES           Q S+      +S++  +F+Y
Sbjct: 415 MLQNSWGNQWGEGGNFRMKRGVDESAIESMAEASDPYVITQNSSTSFSETKSNE-SDFDY 473

Query: 107 DTDTTI 112
           + D +I
Sbjct: 474 EDDDSI 479


>gi|407425570|gb|EKF39488.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi
           marinkellei]
          Length = 333

 Score = 79.7 bits (195), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 37/89 (41%), Positives = 50/89 (56%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
            + E+   G    A E + D + Y  GVY+H  G++ GGHAV+++GWG  +G  YW   N
Sbjct: 239 FKRELLLNGPFEVAFEVYADFMAYTGGVYKHVAGDLLGGHAVRLVGWGELNGEPYWKIAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW   WG  G F I RG +E  IES  V+
Sbjct: 299 SWNHEWGMNGYFLIARGVNECGIESNGVA 327


>gi|45708820|gb|AAH67941.1| LOC407938 protein, partial [Xenopus (Silurana) tropicalis]
          Length = 470

 Score = 79.7 bits (195), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 42/96 (43%), Positives = 56/96 (58%), Gaps = 8/96 (8%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGE------MSGGHAVKIIGWGVED--G 52
           M+LE+   G +  A E + D + Y+ GVY HT  +          HAV ++G+G +   G
Sbjct: 355 MKLELVLGGPLSVAFEVYDDFMHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGTDQQTG 414

Query: 53  VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQV 88
            KYW+  NSWGE WG+ G F+IRRGTDE  IES  V
Sbjct: 415 EKYWIVKNSWGESWGEKGYFRIRRGTDECAIESIAV 450


>gi|166030308|gb|ABY78821.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score = 79.7 bits (195), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 48/82 (58%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E++  G  V     + D + YK GVY+H  G++ GGHAV+I+GWG  +G  YW   NSW 
Sbjct: 242 ELYFNGPFVVDFGVYSDFLAYKTGVYRHVSGDVLGGHAVRIVGWGKLNGTPYWKIANSWD 301

Query: 64  ELWGDGGLFKIRRGTDESRIES 85
             WG  G F I RG +E  IES
Sbjct: 302 TDWGMNGHFLILRGNNECGIES 323


>gi|351709947|gb|EHB12866.1| Tubulointerstitial nephritis antigen-like protein [Heterocephalus
           glaber]
          Length = 467

 Score = 79.3 bits (194), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 60/104 (57%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E ++D  +YK G+Y HT+  M         G H+VKI GWG E   DG
Sbjct: 356 ELMENGPVQALMEVYEDFFLYKSGIYSHTLVSMGRPEQYRRHGTHSVKITGWGEEMLPDG 415

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG++E  IESF +   GRV
Sbjct: 416 RTLKYWTAANSWGPSWGERGYFRILRGSNECDIESFVLGVWGRV 459


>gi|118380384|ref|XP_001023356.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89305123|gb|EAS03111.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 590

 Score = 79.3 bits (194), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 51/131 (38%), Positives = 65/131 (49%), Gaps = 19/131 (14%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGE-------------MSGGHAVKIIGW 47
           M  EI+  G IV + E   D + Y KG+Y H+V                   H+V   GW
Sbjct: 454 MMEEIYKNGPIVVSFEPKMDFMYYNKGIY-HSVDANQWIQNNEENPVWQKVDHSVLCYGW 512

Query: 48  GVEDGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSS-DLEEFEY 106
           G ++  K+WL  NSWGE WG+ G F++RRGTDES IES    A  V   R S +  EF  
Sbjct: 513 GEDENGKFWLLQNSWGEEWGENGNFRMRRGTDESNIESMGERANIVKTARKSPNTTEF-- 570

Query: 107 DTDTTIESSSD 117
              +T  S SD
Sbjct: 571 --SSTYSSHSD 579


>gi|118429529|gb|ABK91812.1| cathepsin B precursor [Clonorchis sinensis]
          Length = 342

 Score = 79.3 bits (194), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 44/103 (42%), Positives = 57/103 (55%), Gaps = 7/103 (6%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI   G + A +  ++DL+ YK GVY H  G   G H ++I+GWG EDGV YWL  NSW 
Sbjct: 247 EIMLRGPVGAILHVYEDLLDYKSGVYFHVWGGHLGEHGIRILGWGEEDGVPYWLVANSWN 306

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFEY 106
           E WG+ G  ++ R  +E  I   QV+AG        DL  F Y
Sbjct: 307 EDWGEKGYMRVLRWRNECGIVD-QVTAGL------PDLSNFPY 342


>gi|3929733|emb|CAA77178.1| cathepsin B [Homo sapiens]
          Length = 195

 Score = 79.3 bits (194), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 32/62 (51%), Positives = 42/62 (67%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +  EI+  G +  A   + D ++YK GVYQH  GEM GGHA++I+GWGVE+G  YWL  N
Sbjct: 133 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 192

Query: 61  SW 62
           SW
Sbjct: 193 SW 194


>gi|291000017|ref|XP_002682576.1| cathepsin C [Naegleria gruberi]
 gi|284096203|gb|EFC49832.1| cathepsin C [Naegleria gruberi]
          Length = 430

 Score = 79.3 bits (194), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 41/101 (40%), Positives = 50/101 (49%), Gaps = 16/101 (15%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG----------------HAVKI 44
           M  EI+  G +    E + DL  YK GVY+H   E                    HAV +
Sbjct: 321 MMHEIYQNGPLAIGFEVYPDLRNYKHGVYKHVTAEELKAQGLSEDEMIPHFEVVNHAVLM 380

Query: 45  IGWGVEDGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIES 85
           +GWGVE+G  YW   NSW   WGD G FKI RG+DE  +ES
Sbjct: 381 VGWGVENGTPYWKIKNSWSTTWGDNGYFKILRGSDECGVES 421


>gi|193688336|ref|XP_001945899.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 308

 Score = 79.3 bits (194), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 42/94 (44%), Positives = 57/94 (60%), Gaps = 6/94 (6%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG---HAVKIIGWGVEDGVKYWL 57
           +Q E+  +G +V       D  +YK GVY  +  + + G      K+IGWGVE+GV YWL
Sbjct: 213 IQKEVQTYGPVVVRFMVCDDFFLYKSGVYAKS--DKAKGIRTQYAKLIGWGVENGVDYWL 270

Query: 58  CVNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
            +NSWG  WG  GLFKI+ GT++  +ESF V AG
Sbjct: 271 VINSWGHEWGQKGLFKIKSGTNQCGVESF-VYAG 303


>gi|291408920|ref|XP_002720687.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Oryctolagus
           cuniculus]
          Length = 467

 Score = 79.3 bits (194), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 59/104 (56%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +Y+ G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 356 ELLENGPVQALMEVHEDFFLYQGGIYSHTPVSLERPERYRRHGTHSVKITGWGEETLPDG 415

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RGT+E  IESF +   GRV
Sbjct: 416 RTLKYWTAANSWGPAWGERGHFRILRGTNECDIESFVLGVWGRV 459


>gi|432892467|ref|XP_004075795.1| PREDICTED: dipeptidyl peptidase 1-like [Oryzias latipes]
          Length = 453

 Score = 79.3 bits (194), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 59/104 (56%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGV--ED 51
           M  E+ H G +  A E + D + Y  G+Y HT         E++  HAV ++G+G   + 
Sbjct: 350 MMKELVHHGPMAVAFEVYPDFMHYAGGIYHHTGLADPFNPFELTN-HAVLLVGYGRCHKT 408

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G KYW+  NSWG  WG+ G F+IRRG+DE  IES  V+A  + +
Sbjct: 409 GEKYWIVKNSWGTSWGENGFFRIRRGSDECSIESIAVAATPIPK 452


>gi|166030330|gb|ABY78832.1| cathepsin B-like protease [Trypanosoma congolense]
 gi|343476577|emb|CCD12360.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score = 79.3 bits (194), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 37/81 (45%), Positives = 46/81 (56%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E++  G  VA    + DL  YK GVY+H  G+  GG AVK++GWG  +G  YW   NSW 
Sbjct: 243 ELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKLNGTPYWKLANSWD 302

Query: 64  ELWGDGGLFKIRRGTDESRIE 84
             WG GG   I RG +E  IE
Sbjct: 303 TDWGMGGYLLILRGNNECNIE 323


>gi|343474132|emb|CCD14149.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score = 79.3 bits (194), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 37/81 (45%), Positives = 46/81 (56%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E++  G  VA    + DL  YK GVY+H  G+  GG AVK++GWG  +G  YW   NSW 
Sbjct: 243 ELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKLNGTPYWKLANSWD 302

Query: 64  ELWGDGGLFKIRRGTDESRIE 84
             WG GG   I RG +E  IE
Sbjct: 303 TDWGMGGYLLILRGNNECNIE 323


>gi|428169747|gb|EKX38678.1| hypothetical protein GUITHDRAFT_76993, partial [Guillardia theta
          CCMP2712]
          Length = 85

 Score = 79.3 bits (194), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 39/85 (45%), Positives = 54/85 (63%), Gaps = 1/85 (1%)

Query: 1  MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
          MQLE+   G  V   + + D   YK GVY  +   +  GGHAV ++GWG E+GV YWL  
Sbjct: 1  MQLELMQNGPGVVVFDVYDDFYSYKSGVYTKSAKAQKVGGHAVVLVGWGRENGVDYWLVQ 60

Query: 60 NSWGELWGDGGLFKIRRGTDESRIE 84
          NSWG+  GD G++K+R+G++E  IE
Sbjct: 61 NSWGKSSGDEGMWKVRKGSNECGIE 85


>gi|47212965|emb|CAF93376.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 271

 Score = 79.0 bits (193), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 43/104 (41%), Positives = 59/104 (56%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           EI   G + A +E H+D  +Y  G+Y+HT    +        G H+VKI GWG E   DG
Sbjct: 160 EIQDNGPVQAIMEVHEDFFMYNSGIYKHTDVSFTKPPHYRKHGTHSVKITGWGEERNFDG 219

Query: 53  V--KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
              KYW+  NSWG+ WG+ G F+I RG +E  IE+F +   GR+
Sbjct: 220 TTRKYWIAANSWGKNWGENGYFRIARGENECEIEAFVIGVWGRI 263


>gi|290975216|ref|XP_002670339.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
 gi|284083897|gb|EFC37595.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
          Length = 350

 Score = 79.0 bits (193), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 39/77 (50%), Positives = 45/77 (58%), Gaps = 2/77 (2%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVK--YWLCVNSWGELW 66
           G I  A+  ++D   YK GVY H  G   GGHAVKI+GWG +   K  YW+C NSWGE W
Sbjct: 263 GPIQVAMGVYRDFYSYKSGVYHHVSGRYVGGHAVKIVGWGYDSASKLPYWICANSWGEDW 322

Query: 67  GDGGLFKIRRGTDESRI 83
           G  G F I RG  E  I
Sbjct: 323 GIKGYFWILRGRGECGI 339


>gi|66911417|gb|AAH97299.1| Tubulointerstitial nephritis antigen-like 1 [Rattus norvegicus]
 gi|149024087|gb|EDL80584.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
 gi|149024088|gb|EDL80585.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
 gi|149024089|gb|EDL80586.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
          Length = 467

 Score = 79.0 bits (193), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 59/104 (56%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +Y++G+Y HT             G H+VKI GWG E   DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDG 415

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RGT+E  IE+F +   GRV
Sbjct: 416 RTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLGVWGRV 459


>gi|12060418|dbj|BAB20596.1| ARG1 [Mus musculus]
 gi|71059879|emb|CAJ18483.1| Lcn7 [Mus musculus]
          Length = 415

 Score = 79.0 bits (193), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 59/104 (56%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +Y++G+Y HT             G H+VKI GWG E   DG
Sbjct: 304 ELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDG 363

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RGT+E  IE+F +   GRV
Sbjct: 364 RTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLGVWGRV 407


>gi|56755295|gb|AAW25827.1| SJCHGC06356 protein [Schistosoma japonicum]
          Length = 279

 Score = 79.0 bits (193), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 38/87 (43%), Positives = 51/87 (58%), Gaps = 1/87 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
           +Q EI   G ++A+I  + D ++YK GVY  T    + G   ++IIGWG E  + YWLC 
Sbjct: 185 IQKEILMNGPVIASISVNTDFLVYKSGVYLPTPRSRNLGWITLRIIGWGYEGKIPYWLCA 244

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESF 86
           NSW E WG  G  KI+RG     IES+
Sbjct: 245 NSWNEEWGANGYVKIQRGVQAGYIESY 271


>gi|431891156|gb|ELK02033.1| Tubulointerstitial nephritis antigen-like protein [Pteropus alecto]
          Length = 467

 Score = 79.0 bits (193), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 59/104 (56%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +Y+ G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYQGGIYSHTPVSLGKPERYRRHGTHSVKITGWGEETLPDG 415

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RGT+E  IESF +   GRV
Sbjct: 416 RTLKYWTAANSWGPAWGERGHFRIVRGTNECDIESFVLGVWGRV 459


>gi|21699|emb|CAA46811.1| cathepsin B [Triticum aestivum]
          Length = 353

 Score = 79.0 bits (193), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 45/111 (40%), Positives = 58/111 (52%), Gaps = 4/111 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQ--DLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWL 57
           +  E++  G +  A    Q  D   YK GVY+H  G + GGHAVK+IGWG  D G  YWL
Sbjct: 241 IMAEVYKNGPVEVAFTYCQILDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWL 300

Query: 58  CVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFEYDT 108
             N W   WGD G FKI RG +E  IE   V+AG      ++   +  + T
Sbjct: 301 LANQWNRGWGDDGYFKIIRGENECGIEG-DVTAGMPSTKNTARNNDVAFGT 350


>gi|343474137|emb|CCD14154.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score = 79.0 bits (193), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 37/81 (45%), Positives = 46/81 (56%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E++  G  VA    + DL  YK GVY+H  G+  GG AVK++GWG  +G  YW   NSW 
Sbjct: 243 ELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKLNGTPYWKLANSWD 302

Query: 64  ELWGDGGLFKIRRGTDESRIE 84
             WG GG   I RG +E  IE
Sbjct: 303 TDWGMGGYLLILRGNNECNIE 323


>gi|428180143|gb|EKX49011.1| cathepsin B-like cysteine protease [Guillardia theta CCMP2712]
          Length = 330

 Score = 79.0 bits (193), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 46/106 (43%), Positives = 56/106 (52%), Gaps = 14/106 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGE-MSGGHAVKIIGWGVEDG-------VKY 55
           EI+  G + A    +   + YK GVY H + + M GGHA+KI+GWGVE          KY
Sbjct: 225 EIYLNGPVEAGFRVYTSFMSYKSGVYHHRILDIMEGGHAIKIVGWGVEPPKRFWQKPTKY 284

Query: 56  WLCVNSWGELWGDGGLFKIRRGTD-----ESRIESFQVSAGRVDRD 96
           W+C NSW   WG  G FKIRRG +     E  IE  QV AG    D
Sbjct: 285 WICANSWTADWGMNGFFKIRRGKNRFGQSECGIED-QVFAGHPKLD 329


>gi|428168267|gb|EKX37214.1| hypothetical protein GUITHDRAFT_78289 [Guillardia theta CCMP2712]
          Length = 224

 Score = 78.6 bits (192), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 42/92 (45%), Positives = 52/92 (56%), Gaps = 6/92 (6%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS----GGHAVKIIGWGV--EDGVK 54
           +Q EI   G + AA   + D + Y  GVY  +   ++    GGHAV ++GWG   E G  
Sbjct: 132 IQSEILSNGPVFAAFWVYSDFMAYTGGVYSASKEALAQGKTGGHAVMMVGWGTDKETGQD 191

Query: 55  YWLCVNSWGELWGDGGLFKIRRGTDESRIESF 86
           YWL  NSW E WGD G FKI+RG DE  IES 
Sbjct: 192 YWLLQNSWSEKWGDKGRFKIKRGVDECGIESL 223


>gi|270132817|ref|NP_075965.2| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
 gi|270132824|ref|NP_001161805.1| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
 gi|61213616|sp|Q99JR5.1|TINAL_MOUSE RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
           Full=Adrenocortical zonation factor 1; Short=AZ-1;
           AltName: Full=Androgen-regulated gene 1 protein;
           AltName: Full=Tubulointerstitial nephritis
           antigen-related protein; Short=TARP; Flags: Precursor
 gi|13543125|gb|AAH05738.1| Tinagl1 protein [Mus musculus]
 gi|17391278|gb|AAH18539.1| Tinagl1 protein [Mus musculus]
 gi|30314458|dbj|BAC76038.1| tubulointersititial nephritis antigen-related protein [Mus
           musculus]
 gi|148698197|gb|EDL30144.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
           musculus]
 gi|148698198|gb|EDL30145.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
           musculus]
          Length = 466

 Score = 78.6 bits (192), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 59/104 (56%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +Y++G+Y HT             G H+VKI GWG E   DG
Sbjct: 355 ELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDG 414

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RGT+E  IE+F +   GRV
Sbjct: 415 RTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLGVWGRV 458


>gi|145540170|ref|XP_001455775.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124423583|emb|CAK88378.1| unnamed protein product [Paramecium tetraurelia]
          Length = 500

 Score = 78.6 bits (192), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 41/108 (37%), Positives = 59/108 (54%), Gaps = 12/108 (11%)

Query: 3   LEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG-----------HAVKIIGWGVED 51
           +E++  G ++   E   D + Y+ G+Y H+V E               H+V   GWG ED
Sbjct: 379 MELYTNGPVIMNFEPSYDFMYYESGIY-HSVAEHDWSTQERPEWEKVDHSVLCYGWGEED 437

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSS 99
           GVK+WL  NSWG  WG+ G F+++RG DES IES   +A  V   +S+
Sbjct: 438 GVKFWLLQNSWGSQWGENGSFRMKRGVDESAIESMAEAADPVIYSKSN 485


>gi|290980380|ref|XP_002672910.1| predicted protein [Naegleria gruberi]
 gi|284086490|gb|EFC40166.1| predicted protein [Naegleria gruberi]
          Length = 302

 Score = 78.6 bits (192), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 39/94 (41%), Positives = 54/94 (57%), Gaps = 6/94 (6%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQH--TVGEMSGGHAVKIIGWGVEDGVKYWLC 58
           MQ  I   GSI+  ++ +QD I Y  GVY+H  +  +       +I+GWG  +GV YW+ 
Sbjct: 208 MQQAILQGGSIMTEMDVYQDFIYYSSGVYEHDPSFTQPIAKTVARIVGWGSLNGVNYWIV 267

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIE----SFQV 88
            N WG+ WG  G   +RRGT+ES IE    +FQV
Sbjct: 268 ANVWGKTWGLDGYVLVRRGTNESNIEKDAYAFQV 301


>gi|426328832|ref|XP_004025452.1| PREDICTED: tubulointerstitial nephritis antigen-like [Gorilla
           gorilla gorilla]
          Length = 462

 Score = 78.6 bits (192), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +YK G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 351 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 410

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 411 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 454


>gi|426221788|ref|XP_004005089.1| PREDICTED: tubulointerstitial nephritis antigen-like [Ovis aries]
          Length = 362

 Score = 78.6 bits (192), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEM--------SGGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +Y+ G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 251 ELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 310

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             VKYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 311 RTVKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 354


>gi|355557764|gb|EHH14544.1| hypothetical protein EGK_00488 [Macaca mulatta]
 gi|355745087|gb|EHH49712.1| hypothetical protein EGM_00421 [Macaca fascicularis]
 gi|384948750|gb|AFI37980.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
           [Macaca mulatta]
 gi|384948752|gb|AFI37981.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
           [Macaca mulatta]
 gi|387540550|gb|AFJ70902.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
           [Macaca mulatta]
          Length = 467

 Score = 78.6 bits (192), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +YK G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 415

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 416 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 459


>gi|354472325|ref|XP_003498390.1| PREDICTED: tubulointerstitial nephritis antigen [Cricetulus
           griseus]
 gi|344245030|gb|EGW01134.1| Tubulointerstitial nephritis antigen-like [Cricetulus griseus]
          Length = 465

 Score = 78.6 bits (192), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--------GEMSGGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +Y+ G+Y HT             G H+VKI GWG E   DG
Sbjct: 355 ELMENGPVQALMEVHEDFFLYQSGIYSHTPISQGRPEQYRRHGTHSVKITGWGEEKLPDG 414

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RGT+E  IESF +   GRV
Sbjct: 415 RTIKYWTAANSWGPWWGERGHFRIVRGTNECDIESFVLGVWGRV 458


>gi|344250687|gb|EGW06791.1| Dipeptidyl-peptidase 1 [Cricetulus griseus]
          Length = 483

 Score = 78.6 bits (192), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 41/99 (41%), Positives = 55/99 (55%), Gaps = 10/99 (10%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWG--VED 51
           M+LE+   G +  A E   D + Y  G+Y HT         E++  HAV ++G+G   + 
Sbjct: 380 MKLELVQHGPMAVAFEVQDDFLHYHSGIYHHTGLRDPFNPFELTN-HAVLLVGYGRDPDT 438

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           G  YW   NSWG  WG+ G F+IRRGTDE  IES  V+A
Sbjct: 439 GTDYWTVKNSWGTEWGESGYFRIRRGTDECAIESIAVAA 477


>gi|2330009|gb|AAB66719.1| cysteine protease [Giardia muris]
          Length = 301

 Score = 78.6 bits (192), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 38/85 (44%), Positives = 52/85 (61%), Gaps = 1/85 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           M   + + G +  A   + D   Y  GVYQH  G M GGHAV+++G+G+ E G+KYW+  
Sbjct: 207 MMEALVYDGPLQVAFVVYSDFGYYSSGVYQHVNGMMEGGHAVEMVGYGIDESGLKYWIIR 266

Query: 60  NSWGELWGDGGLFKIRRGTDESRIE 84
           NSWG  WG+GG F+I R  +E  IE
Sbjct: 267 NSWGPDWGEGGYFRIIRRVNECGIE 291


>gi|294888968|ref|XP_002772645.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239877055|gb|EER04461.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 419

 Score = 78.6 bits (192), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 42/111 (37%), Positives = 64/111 (57%), Gaps = 9/111 (8%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSG--------GHAVKIIGWGVEDGVKY 55
           E+   G +V +I+   D++ Y+ GVY+  +   S         GH+V +IG+GV++G  Y
Sbjct: 304 ELVDDGPLVVSIKPAHDMMYYRSGVYRSDLERDSYHRPEWEEVGHSVLLIGYGVDNGEDY 363

Query: 56  WLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDL-EEFE 105
           WL  NSWG  WG+ G  ++ RG DES +ES  V+A  V+  R  D+ + FE
Sbjct: 364 WLIQNSWGPEWGEDGYLRLARGMDESGVESIAVAADVVEDQRPLDMFKNFE 414


>gi|403293249|ref|XP_003937633.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Saimiri boliviensis boliviensis]
          Length = 467

 Score = 78.6 bits (192), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +YK G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHSVKITGWGEETRPDG 415

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 416 RKLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 459


>gi|297665714|ref|XP_002811184.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
           [Pongo abelii]
          Length = 467

 Score = 78.6 bits (192), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +YK G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 415

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 416 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 459


>gi|354498051|ref|XP_003511129.1| PREDICTED: LOW QUALITY PROTEIN: dipeptidyl peptidase 1-like
           [Cricetulus griseus]
          Length = 470

 Score = 78.6 bits (192), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 57/104 (54%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWG--VED 51
           M+LE+   G +  A E   D + Y  G+Y HT         E++  HAV ++G+G   + 
Sbjct: 367 MKLELVQHGPMAVAFEVQDDFLHYHSGIYHHTGLRDPFNPFELTN-HAVLLVGYGRDPDT 425

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G  YW   NSWG  WG+ G F+IRRGTDE  IES  V+A  + +
Sbjct: 426 GTDYWTVKNSWGTEWGESGYFRIRRGTDECAIESIAVAAIPIPK 469


>gi|290988628|ref|XP_002677000.1| predicted protein [Naegleria gruberi]
 gi|284090605|gb|EFC44256.1| predicted protein [Naegleria gruberi]
          Length = 158

 Score = 78.6 bits (192), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 38/82 (46%), Positives = 48/82 (58%), Gaps = 2/82 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVK--YWLC 58
           M  ++   G + A +  ++D   YK GVY H  G M G HA+KI+GWGV+   K  YW+C
Sbjct: 63  MMADLKANGPLQATMIVYKDFFSYKSGVYHHVSGRMVGAHAIKIVGWGVDSASKLPYWIC 122

Query: 59  VNSWGELWGDGGLFKIRRGTDE 80
            NSWGE WG  G F I RG  E
Sbjct: 123 ANSWGEDWGLDGYFWIARGRGE 144


>gi|11545918|ref|NP_071447.1| tubulointerstitial nephritis antigen-like isoform 1 precursor [Homo
           sapiens]
 gi|61213628|sp|Q9GZM7.1|TINAL_HUMAN RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
           Full=Glucocorticoid-inducible protein 5; AltName:
           Full=Oxidized LDL-responsive gene 2 protein;
           Short=OLRG-2; AltName: Full=Tubulointerstitial nephritis
           antigen-related protein; Short=TIN Ag-related protein;
           Short=TIN-Ag-RP; Flags: Precursor
 gi|11602840|gb|AAG38876.1|AF236150_1 tubulointerstitial nephritis antigen-related protein precursor
           [Homo sapiens]
 gi|11275667|gb|AAG33699.1| oxidized-LDL responsive gene 2 [Homo sapiens]
 gi|11527793|dbj|BAB18636.1| glucocorticoid-inducible protein [Homo sapiens]
 gi|11527809|dbj|BAB18727.1| glucocorticoid-inducible protein [Homo sapiens]
 gi|11761715|gb|AAG40154.1| tubulointerstitial nephritis antigen-related protein [Homo sapiens]
 gi|22761462|dbj|BAC11596.1| unnamed protein product [Homo sapiens]
 gi|37181967|gb|AAQ88787.1| LCN7 [Homo sapiens]
 gi|40353044|gb|AAH64633.1| Tubulointerstitial nephritis antigen-like 1 [Homo sapiens]
 gi|119628009|gb|EAX07604.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
           sapiens]
 gi|119628010|gb|EAX07605.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
           sapiens]
 gi|119628011|gb|EAX07606.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
           sapiens]
 gi|158258977|dbj|BAF85459.1| unnamed protein product [Homo sapiens]
 gi|261858502|dbj|BAI45773.1| tubulointerstitial nephritis antigen-like 1 [synthetic construct]
 gi|410265400|gb|JAA20666.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410307560|gb|JAA32380.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410307562|gb|JAA32381.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410307564|gb|JAA32382.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410335249|gb|JAA36571.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
          Length = 467

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +YK G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 415

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 416 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 459


>gi|332254558|ref|XP_003276396.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Nomascus leucogenys]
          Length = 467

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +YK G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 415

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 416 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 459


>gi|332808277|ref|XP_524645.3| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
           antigen-like 1 [Pan troglodytes]
          Length = 472

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +YK G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 361 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 420

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 421 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 464


>gi|397515889|ref|XP_003828174.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1 [Pan
           paniscus]
          Length = 467

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +YK G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 415

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 416 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 459


>gi|395856779|ref|XP_003800796.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Otolemur garnettii]
          Length = 467

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +Y+ G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLQRPEGYRRHGTHSVKITGWGEETLPDG 415

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 416 RTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 459


>gi|358421824|ref|XP_003585145.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bos taurus]
          Length = 428

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +Y+ G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 317 ELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 376

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 377 RTIKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 420


>gi|328712819|ref|XP_001942906.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Acyrthosiphon pisum]
 gi|328712821|ref|XP_003244911.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Acyrthosiphon pisum]
          Length = 463

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 39/94 (41%), Positives = 56/94 (59%), Gaps = 7/94 (7%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHT---VGEMSGGHAVKIIGWGVE----DGVKYW 56
           EI   G + A ++  +D  +YK GVY+ +    G  +G H+V+I+GWG E      VKYW
Sbjct: 341 EILTSGPVQAVMKVSRDFFMYKSGVYKCSNLASGSRTGYHSVRIVGWGEEYQGGKIVKYW 400

Query: 57  LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           +  NSWG  WG+ G F+I +G DE  IE F ++A
Sbjct: 401 IASNSWGSWWGENGYFRILKGVDECEIEDFVIAA 434


>gi|395730851|ref|XP_003775799.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Pongo
           abelii]
          Length = 362

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +YK G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 251 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 310

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 311 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 354


>gi|324713036|ref|NP_001191344.1| tubulointerstitial nephritis antigen-like isoform 3 [Homo sapiens]
 gi|119628008|gb|EAX07603.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_a [Homo
           sapiens]
          Length = 362

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +YK G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 251 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 310

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 311 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 354


>gi|296207307|ref|XP_002750588.1| PREDICTED: tubulointerstitial nephritis antigen-like [Callithrix
           jacchus]
          Length = 467

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +YK G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHSVKITGWGEETWPDG 415

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 416 RKLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 459


>gi|402853710|ref|XP_003891533.1| PREDICTED: tubulointerstitial nephritis antigen-like [Papio anubis]
          Length = 362

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +YK G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 251 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 310

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 311 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 354


>gi|297282815|ref|XP_002802331.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
           mulatta]
          Length = 322

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +YK G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 211 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 270

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 271 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 314


>gi|332254562|ref|XP_003276398.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 3
           [Nomascus leucogenys]
          Length = 362

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +YK G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 251 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 310

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 311 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 354


>gi|324711034|ref|NP_001191343.1| tubulointerstitial nephritis antigen-like isoform 2 precursor [Homo
           sapiens]
 gi|194391000|dbj|BAG60618.1| unnamed protein product [Homo sapiens]
          Length = 436

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +YK G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 325 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 384

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 385 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 428


>gi|297665716|ref|XP_002811185.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 3
           [Pongo abelii]
          Length = 436

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +YK G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 325 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 384

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 385 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 428


>gi|397515891|ref|XP_003828175.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2 [Pan
           paniscus]
          Length = 436

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +YK G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 325 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 384

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 385 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 428


>gi|403293251|ref|XP_003937634.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Saimiri boliviensis boliviensis]
          Length = 436

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +YK G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 325 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHSVKITGWGEETRPDG 384

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 385 RKLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 428


>gi|332254560|ref|XP_003276397.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Nomascus leucogenys]
          Length = 436

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +YK G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 325 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 384

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 385 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 428


>gi|410909768|ref|XP_003968362.1| PREDICTED: dipeptidyl peptidase 1-like [Takifugu rubripes]
          Length = 455

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 60/104 (57%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGV--ED 51
           M +E+   G +  A+E + D + YK G+Y HT         E++  HAV ++G+G     
Sbjct: 352 MMVELVKNGPMAVALEVYSDFMSYKGGIYHHTGLTDHVNPFELTN-HAVLLVGYGRCHMT 410

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G KYW+  NSWG  WG+ G F+IRRG+DE  IES  V+A  + +
Sbjct: 411 GQKYWIVKNSWGSSWGEDGYFRIRRGSDECAIESIAVAASPIPK 454


>gi|66801417|ref|XP_629634.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
 gi|60463014|gb|EAL61210.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
          Length = 323

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 37/84 (44%), Positives = 48/84 (57%), Gaps = 1/84 (1%)

Query: 2   QLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCVN 60
           Q EI   G ++A    + D   +K  VY  +       HAV+++GWG   DGV YW+  N
Sbjct: 185 QYEIMTNGPVIATFMLYSDFKPHKWDVYIKSSNTQVESHAVRVVGWGTTSDGVDYWIAAN 244

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SWG  WGD G FKIRRG+DE+  E
Sbjct: 245 SWGTGWGDKGYFKIRRGSDEAAFE 268


>gi|417401428|gb|JAA47600.1| Putative cysteine proteinase tin-ag [Desmodus rotundus]
          Length = 466

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +Y+ G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 355 ELMENGPVQALMEVHEDFFLYQNGIYSHTPVSLGRPERYRRHGTHSVKITGWGEESLPDG 414

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 415 RTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 458


>gi|395856781|ref|XP_003800797.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Otolemur garnettii]
          Length = 436

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +Y+ G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 325 ELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLQRPEGYRRHGTHSVKITGWGEETLPDG 384

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 385 RTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 428


>gi|348508183|ref|XP_003441634.1| PREDICTED: dipeptidyl peptidase 1-like isoform 2 [Oreochromis
           niloticus]
          Length = 461

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 59/104 (56%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGV--ED 51
           M LE+   G +  A E + D + YK+G+Y HT         E++  HAV ++G+G   + 
Sbjct: 358 MMLELVKNGPMAVAFEVYPDFMNYKEGIYHHTGLADPFNPFELTN-HAVLLVGYGRCHKT 416

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G  YW+  NSWG  WG+ G F+IRRG DE  IES  V+A  + +
Sbjct: 417 GQNYWIVKNSWGTGWGEEGYFRIRRGNDECAIESIAVAANPIPK 460


>gi|442758365|gb|JAA71341.1| Hypothetical protein [Ixodes ricinus]
          Length = 353

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 42/98 (42%), Positives = 56/98 (57%), Gaps = 17/98 (17%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--------GEMSGGHAVKIIGWGVED- 51
           MQ+ +F    +V      +D  +Y  GVY+HT          + S  H+V+I+GWGV+  
Sbjct: 229 MQMALFKHSMLVK-----EDFFLYSSGVYKHTRLAHNLPPEYQKSDWHSVRILGWGVDRT 283

Query: 52  ---GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESF 86
                KYWLC NSWG  WG+ G F+I RG DES+IESF
Sbjct: 284 QYRPQKYWLCANSWGSGWGENGYFRIVRGEDESQIESF 321


>gi|417409900|gb|JAA51439.1| Putative cysteine proteinase tin-ag, partial [Desmodus rotundus]
          Length = 346

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +Y+ G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 235 ELMENGPVQALMEVHEDFFLYQNGIYSHTPVSLGRPERYRRHGTHSVKITGWGEESLPDG 294

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 295 RTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 338


>gi|355724275|gb|AES08176.1| tubulointerstitial nephritis antigen-like 1 [Mustela putorius furo]
          Length = 454

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +Y+ G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 343 ELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 402

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 403 RTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 446


>gi|23344736|gb|AAN28681.1| cathepsin B [Theromyzon tessulatum]
          Length = 65

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 35/64 (54%), Positives = 43/64 (67%)

Query: 4  EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
          E+   G + AA+  + D + YK GVY H  G+  GGHAVK+IGWGVE+ V YWL VNSWG
Sbjct: 2  ELMKHGPVEAALTVYSDFLQYKSGVYHHVAGDELGGHAVKLIGWGVENKVPYWLVVNSWG 61

Query: 64 ELWG 67
            WG
Sbjct: 62 TTWG 65


>gi|403371460|gb|EJY85611.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 33/69 (47%), Positives = 41/69 (59%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
           G +      +QD   Y  GVY H  G+  GGHAVKI+GWG +    YW+  NSWGE WG+
Sbjct: 216 GPVETGFTVYQDFYNYNSGVYHHVTGDAEGGHAVKILGWGKQGLENYWIVANSWGEDWGE 275

Query: 69  GGLFKIRRG 77
            G F IR+G
Sbjct: 276 KGYFNIRQG 284


>gi|14290553|gb|AAH09048.1| TINAGL1 protein [Homo sapiens]
          Length = 218

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +YK G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 107 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 166

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 167 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 210


>gi|351712812|gb|EHB15731.1| Dipeptidyl-peptidase 1 [Heterocephalus glaber]
          Length = 462

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 40/104 (38%), Positives = 60/104 (57%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVE--D 51
           M+LE+   G +  A E   D + Y KG+Y HT         E++  HAV ++G+G +  +
Sbjct: 359 MKLELVQHGPMAVAFEVCDDFMHYHKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAN 417

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G+ YW+  NSWG  WG+ G F+I RGTDE  IES  ++A  + +
Sbjct: 418 GMDYWIVKNSWGTSWGEKGYFRILRGTDECAIESIAMAATPIPK 461


>gi|348508181|ref|XP_003441633.1| PREDICTED: dipeptidyl peptidase 1-like isoform 1 [Oreochromis
           niloticus]
          Length = 455

 Score = 77.8 bits (190), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 59/104 (56%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGV--ED 51
           M LE+   G +  A E + D + YK+G+Y HT         E++  HAV ++G+G   + 
Sbjct: 352 MMLELVKNGPMAVAFEVYPDFMNYKEGIYHHTGLADPFNPFELTN-HAVLLVGYGRCHKT 410

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G  YW+  NSWG  WG+ G F+IRRG DE  IES  V+A  + +
Sbjct: 411 GQNYWIVKNSWGTGWGEEGYFRIRRGNDECAIESIAVAANPIPK 454


>gi|403345965|gb|EJY72367.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score = 77.8 bits (190), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 33/73 (45%), Positives = 41/73 (56%)

Query: 5   IFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGE 64
           I   G +      + D   YK G+Y H  G   GGHAVKI+GWG +    YW+  NSWGE
Sbjct: 212 IQQSGPVETGFTVYADFFNYKSGIYHHVSGGAEGGHAVKILGWGKQGSENYWIVANSWGE 271

Query: 65  LWGDGGLFKIRRG 77
            WG+ G F IR+G
Sbjct: 272 SWGEKGFFNIRQG 284


>gi|403362666|gb|EJY81064.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score = 77.8 bits (190), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 33/73 (45%), Positives = 41/73 (56%)

Query: 5   IFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGE 64
           I   G +      + D   YK G+Y H  G   GGHAVKI+GWG +    YW+  NSWGE
Sbjct: 212 IQQSGPVETGFTVYADFFNYKSGIYHHVSGGAEGGHAVKILGWGKQGSENYWIVANSWGE 271

Query: 65  LWGDGGLFKIRRG 77
            WG+ G F IR+G
Sbjct: 272 SWGEKGFFNIRQG 284


>gi|253747613|gb|EET02212.1| Hypothetical protein GL50581_498 [Giardia intestinalis ATCC 50581]
          Length = 807

 Score = 77.8 bits (190), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 38/92 (41%), Positives = 57/92 (61%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIY-KKGVYQHTVG-EMSGGHAVKIIGWGVEDGVKYWLC 58
           M  +I+  G I  ++    D     KK +Y      ++SGGHAV I+GWG E+GV YW C
Sbjct: 192 MMRDIYQNGPIAVSMYLANDFPPKDKKSIYVSGPNTKLSGGHAVMIVGWGEENGVPYWDC 251

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
            N++G  WGD G F+I+RG++E +IE++  +A
Sbjct: 252 ANTYGTNWGDHGYFRIKRGSNELKIETWPGAA 283


>gi|340503546|gb|EGR30116.1| hypothetical protein IMG5_141560 [Ichthyophthirius multifiliis]
          Length = 599

 Score = 77.8 bits (190), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 43/99 (43%), Positives = 53/99 (53%), Gaps = 15/99 (15%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSG--------------GHAVKIIG 46
           M  EI   G IV + E   D + Y++G+Y H+V                    H+V  +G
Sbjct: 479 MMEEIHKNGPIVVSFEPAMDFMYYQEGIY-HSVDANDWILGDEDKLPQWEKVDHSVLCVG 537

Query: 47  WGVEDGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIES 85
           WG  +  KYWL  NSWGE WG+ G FKIRRGTDES IES
Sbjct: 538 WGENEDGKYWLVQNSWGEDWGEKGYFKIRRGTDESNIES 576


>gi|300121514|emb|CBK22033.2| unnamed protein product [Blastocystis hominis]
          Length = 476

 Score = 77.8 bits (190), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 33/81 (40%), Positives = 49/81 (60%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI+  G +  +I+   DL+ YK G+Y+   G    GH + ++GWG E+G+ YW+  NSWG
Sbjct: 99  EIYAHGPVTCSIDVPDDLLEYKGGIYEDKTGIAGDGHDISVVGWGEENGIPYWIVRNSWG 158

Query: 64  ELWGDGGLFKIRRGTDESRIE 84
             WG+ G F+I RG +   IE
Sbjct: 159 TYWGEEGFFRIVRGKNNLGIE 179



 Score = 65.5 bits (158), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 33/88 (37%), Positives = 48/88 (54%), Gaps = 2/88 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVK--YWLC 58
           MQ EI+  G I   ++  Q  + Y  GV+    G+  G HAV++ GWGV++  +  YW+ 
Sbjct: 379 MQAEIYARGPISCVMDVTQTFLDYTGGVFTSREGKWLGKHAVEVTGWGVDEETRTPYWIV 438

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESF 86
            NSWG  WG+ G F+I  G +   IE  
Sbjct: 439 RNSWGTYWGENGWFRIAMGQNLLNIEQM 466


>gi|339235557|ref|XP_003379333.1| dipeptidyl-peptidase 1 [Trichinella spiralis]
 gi|316978004|gb|EFV61033.1| dipeptidyl-peptidase 1 [Trichinella spiralis]
          Length = 448

 Score = 77.8 bits (190), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 43/108 (39%), Positives = 60/108 (55%), Gaps = 16/108 (14%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------------VGEMSGGHAVKIIGW 47
           M+L + + G +   IEA  DLI Y+ G+YQHT               E++  HAV I+G+
Sbjct: 339 MRLALVNNGPLAVGIEAFDDLIHYRGGIYQHTKIHDDFNFPTKWNPFELTN-HAVLIVGY 397

Query: 48  GVE--DGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
           GV+    + YW+  NSWG  WG+ G F+I+RG DE  IES  V A  +
Sbjct: 398 GVDKKSNIPYWIVKNSWGTNWGEHGYFRIKRGVDECGIESLAVQATPI 445


>gi|218139209|gb|ACK57788.1| cathepsin C [Litopenaeus vannamei]
          Length = 451

 Score = 77.8 bits (190), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 41/99 (41%), Positives = 58/99 (58%), Gaps = 10/99 (10%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+L +   G ++  +E + D + YK G+Y HT         E++  HAV ++G+G ++  
Sbjct: 350 MKLALIKGGPLIVGLEVYDDFLHYKSGIYHHTGLQDRFNPLELTN-HAVLLVGYGEDEAT 408

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           G KYW   NSWGE WG+ G F+IRRG DE  IES  V A
Sbjct: 409 GEKYWSVKNSWGEEWGEDGYFRIRRGVDECAIESMAVEA 447


>gi|119638965|gb|ABL85237.1| cysteine proteinase 3 [Necator americanus]
          Length = 360

 Score = 77.4 bits (189), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 44/106 (41%), Positives = 62/106 (58%), Gaps = 8/106 (7%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED----GVKYW 56
           ++LEI   G + A+   + D   Y+KGVY  + G   GGHA+KIIGWG E      + YW
Sbjct: 246 IKLEIMRNGPVTASFRIYPDFGFYEKGVYVTSGGRELGGHAIKIIGWGTEKVNGTDLPYW 305

Query: 57  LCVNSWGELWG-DGGLFKIRRGTDESRIESFQVSAG--RVDRDRSS 99
           L  NSWG  WG + G F+I RG +  +IE  +V AG  +V + +S+
Sbjct: 306 LIANSWGTDWGENNGYFRILRGQNHCQIEQ-KVIAGMIKVPQPKSA 350


>gi|448278133|gb|AGE43966.1| putative cathepsin B [Naegleria fowleri]
          Length = 349

 Score = 77.4 bits (189), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%), Gaps = 2/85 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
           +Q  I   G +++  + ++D   Y+ G Y+H  G + GGHA+K++GWGV +  V YW+  
Sbjct: 256 VQASILANGPVISGFKVYRDFYNYRSG-YKHVAGGLVGGHAIKVVGWGVTQSNVPYWIVA 314

Query: 60  NSWGELWGDGGLFKIRRGTDESRIE 84
           NSW + WG  G F I RGT+E  IE
Sbjct: 315 NSWSDEWGMNGYFWILRGTNECSIE 339


>gi|345794363|ref|XP_535330.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Canis lupus
           familiaris]
          Length = 467

 Score = 77.4 bits (189), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +Y+ G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYQGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 415

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 416 RTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 459


>gi|301618234|ref|XP_002938532.1| PREDICTED: tubulointerstitial nephritis antigen-like [Xenopus
           (Silurana) tropicalis]
          Length = 494

 Score = 77.4 bits (189), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 41/99 (41%), Positives = 54/99 (54%), Gaps = 9/99 (9%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--------GEMSGGHAVKIIGWGVEDGVKY 55
           E++  G + A +E H+D  +YK G+Y+HT             G H+VKI G       KY
Sbjct: 387 ELYENGPVQAIMEVHEDFFMYKSGIYRHTPVTEREPEHHRRHGTHSVKITGGRDGQTHKY 446

Query: 56  WLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
           WL  NSWG  WG+ G F+I RG +E  IE+F V   GRV
Sbjct: 447 WLAANSWGRDWGEDGYFRIARGENECEIETFIVGVWGRV 485


>gi|387015548|gb|AFJ49893.1| Dipeptidyl peptidase 1-like [Crotalus adamanteus]
          Length = 464

 Score = 77.4 bits (189), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 39/104 (37%), Positives = 58/104 (55%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSG-------GHAVKIIGWG--VED 51
           M+LE+   G +  A E + D + Y  G+Y HT G M          HAV ++G+G   + 
Sbjct: 361 MKLELIKHGPMAVAFEVYNDFMYYSGGIYHHT-GLMDPFNPFELTNHAVLLVGYGSDPQT 419

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G  +W+  NSWG  WG+ G F+IRRG+DE  IES  V++  + +
Sbjct: 420 GQPFWIVKNSWGSSWGEEGYFRIRRGSDECAIESIAVASTPIPK 463


>gi|395528577|ref|XP_003766405.1| PREDICTED: dipeptidyl peptidase 1-like [Sarcophilus harrisii]
          Length = 568

 Score = 77.4 bits (189), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 40/104 (38%), Positives = 59/104 (56%), Gaps = 10/104 (9%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+ E+   G +  A E + D I Y+ G+Y HT         E++  HAV ++G+G ++  
Sbjct: 465 MKHELIQNGPLTVAFEVYDDFIHYRTGIYHHTGLRDNFNPFELTN-HAVLLVGYGTDEKT 523

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
           G  YW+  NSWG  WG+ G F+I RGTDE  IES  V+A  + +
Sbjct: 524 GEDYWIVKNSWGTSWGENGYFRILRGTDECAIESIAVAATPIPQ 567


>gi|395833440|ref|XP_003789742.1| PREDICTED: tubulointerstitial nephritis antigen [Otolemur
           garnettii]
          Length = 464

 Score = 77.0 bits (188), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 40/100 (40%), Positives = 56/100 (56%), Gaps = 13/100 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQH---TVGEMSG-----GHAVKIIGWGVEDGV-- 53
           EI   G + A ++ H+D   YK G+Y+H   T GE         HAVK++GWG   G   
Sbjct: 355 EIMQNGPVQAIMQVHEDFFHYKSGIYRHVASTHGESENYRKLRTHAVKLLGWGTLRGAQG 414

Query: 54  ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
              K+W+  NSWG+ WG+ G F+I RG +ES IE   ++A
Sbjct: 415 RKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 454


>gi|410959397|ref|XP_003986297.1| PREDICTED: tubulointerstitial nephritis antigen [Felis catus]
          Length = 474

 Score = 77.0 bits (188), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 40/101 (39%), Positives = 55/101 (54%), Gaps = 14/101 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQH---TVGEMSGG------HAVKIIGWGVEDGV- 53
           EI   G + A ++ H+D   YK G+Y+H      E SG       HAVK+ GWG   G  
Sbjct: 364 EIMQNGPVQAIMQVHEDFFHYKTGIYRHITKKANEESGKYRKLQTHAVKLTGWGTLKGAQ 423

Query: 54  ----KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
               K+W+  NSWG+ WG+ G F+I RG +ES IE   ++A
Sbjct: 424 GRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 464


>gi|431838263|gb|ELK00195.1| Tubulointerstitial nephritis antigen [Pteropus alecto]
          Length = 425

 Score = 77.0 bits (188), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 40/100 (40%), Positives = 55/100 (55%), Gaps = 13/100 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQH--TVGEMS------GGHAVKIIGWGVEDGV-- 53
           EI H G + A ++ H+D   YK G+Y+H  +  E S        HAVK+ GWG   G   
Sbjct: 316 EIIHNGPVQAIMQVHEDFFHYKSGIYRHVTSTNEKSEKYQKLQTHAVKLTGWGTLRGAQG 375

Query: 54  ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
              K+W+  NSWG  WG+ G F+I RG +ES IE   ++A
Sbjct: 376 RKEKFWIVANSWGNSWGENGYFRILRGVNESDIEKLIIAA 415


>gi|294898698|ref|XP_002776344.1| cathepsin C, putative [Perkinsus marinus ATCC 50983]
 gi|239883254|gb|EER08160.1| cathepsin C, putative [Perkinsus marinus ATCC 50983]
          Length = 301

 Score = 77.0 bits (188), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 42/111 (37%), Positives = 63/111 (56%), Gaps = 9/111 (8%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSG--------GHAVKIIGWGVEDGVKY 55
           E+   G +V +I+   D++ Y+ GVY+  +   S         GH+V +IG+GV++G  Y
Sbjct: 186 ELVDDGPLVVSIKPAHDMMYYRSGVYRSDLERDSYHRPEWEEVGHSVLLIGYGVDNGEDY 245

Query: 56  WLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSD-LEEFE 105
           WL  NSWG  WG+ G  ++ RG DES +ES  V+A  V+  R  D  + FE
Sbjct: 246 WLIQNSWGPEWGEDGYLRLARGMDESGVESIAVAADVVEDRRPLDTFKNFE 296


>gi|16758354|ref|NP_446034.1| tubulointerstitial nephritis antigen-like precursor [Rattus
           norvegicus]
 gi|61213054|sp|Q9EQT5.1|TINAL_RAT RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
           Full=Glucocorticoid-inducible protein 5; Flags:
           Precursor
 gi|11527795|dbj|BAB18637.1| glucocorticoid-inducible protein [Rattus norvegicus]
          Length = 467

 Score = 77.0 bits (188), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 43/104 (41%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +Y++G+Y HT             G H+VKI GWG E   DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDG 415

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IE+F +   GRV
Sbjct: 416 RTIKYWTAANSWGPWWGERGHFRIVRGINECDIETFVLGVWGRV 459


>gi|241861813|ref|XP_002416350.1| cysteine proteinase cathepsin L, putative [Ixodes scapularis]
 gi|215510564|gb|EEC20017.1| cysteine proteinase cathepsin L, putative [Ixodes scapularis]
          Length = 127

 Score = 77.0 bits (188), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 43/108 (39%), Positives = 56/108 (51%), Gaps = 14/108 (12%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS------------GGHAVKIIGWG 48
           M+L + H G +    E + D  +Y+ GVY+HT    S              HAV + G+G
Sbjct: 20  MRLALVHGGPVAVGFEVYPDFQMYQGGVYRHTGVHRSLNLGSPFDPFELTNHAVLVTGYG 79

Query: 49  V--EDGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
           V  E G+KYW   NSWG  WG+ G F+I RGTDE  IES  V A  + 
Sbjct: 80  VDKETGLKYWSVKNSWGPGWGENGYFRILRGTDECGIESLAVEASPIP 127


>gi|338722032|ref|XP_003364468.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
           [Equus caballus]
          Length = 436

 Score = 77.0 bits (188), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 57/104 (54%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--------GEMSGGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +Y+ GVY HT             G H+VKI GWG E   DG
Sbjct: 325 ELMENGPVQALMEVHEDFFLYQGGVYSHTPVSHGRPERYRRHGTHSVKITGWGEETLPDG 384

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 385 RTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 428


>gi|185135783|ref|NP_001117966.1| prepro-cathepsin C precursor [Oncorhynchus mykiss]
 gi|51038277|gb|AAT94060.1| prepro-cathepsin C [Oncorhynchus mykiss]
          Length = 457

 Score = 77.0 bits (188), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 41/103 (39%), Positives = 58/103 (56%), Gaps = 8/103 (7%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS------GGHAVKIIGWGV--EDG 52
           M LE+   G +  A E + D + YK+G+Y HT    S        HAV ++G+G     G
Sbjct: 354 MMLELVKNGPMGVAFEVYPDFMHYKEGIYHHTGLHDSYNPFELTNHAVLLVGYGQCHVTG 413

Query: 53  VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
            K+W+  NSWG  WG+ G FK+RRG+DE  IES  V+A  + +
Sbjct: 414 QKFWVVKNSWGTKWGEEGFFKVRRGSDECAIESIAVAAKPIPK 456


>gi|149694136|ref|XP_001503950.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 1
           [Equus caballus]
          Length = 467

 Score = 76.6 bits (187), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 57/104 (54%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--------GEMSGGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +Y+ GVY HT             G H+VKI GWG E   DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYQGGVYSHTPVSHGRPERYRRHGTHSVKITGWGEETLPDG 415

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 416 RTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 459


>gi|28804799|dbj|BAC57943.1| cathepsin C [Marsupenaeus japonicus]
          Length = 449

 Score = 76.6 bits (187), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 40/98 (40%), Positives = 56/98 (57%), Gaps = 8/98 (8%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS------GGHAVKIIGWGVED--G 52
           M++ +   G ++  +E + D + YK G+Y HT    S        HAV ++G+G ++  G
Sbjct: 348 MKIALIKGGPLIVGLEVYDDFLHYKSGIYHHTGLRDSFNPLELTNHAVLLVGYGEDETTG 407

Query: 53  VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
            KYW   NSWGE WG+ G F+IRRG DE  IES  V A
Sbjct: 408 EKYWSVKNSWGEGWGEDGYFRIRRGVDECAIESMAVEA 445


>gi|344287518|ref|XP_003415500.1| PREDICTED: tubulointerstitial nephritis antigen isoform 1
           [Loxodonta africana]
          Length = 468

 Score = 76.6 bits (187), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 57/104 (54%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +Y+ G+Y HT             G H+VKI GWG E   DG
Sbjct: 357 ELMENGPVQALMEVHEDFFLYQGGIYSHTPVSQERPEQYRRHGTHSVKITGWGEETLPDG 416

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 417 RTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 460


>gi|226470090|emb|CAX70326.1| hypotherical protein [Schistosoma japonicum]
          Length = 456

 Score = 76.6 bits (187), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 41/99 (41%), Positives = 53/99 (53%), Gaps = 11/99 (11%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS---------GGHAVKIIGWGVE- 50
           M+LE+ H G      E ++D   YK GVY HT  +             HAV ++G+GV+ 
Sbjct: 352 MRLELVHNGPFPVGFEVYEDFEFYKDGVYHHTNVQNDRYSFNPFELTNHAVLLVGYGVDK 411

Query: 51  -DGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQV 88
             G  YW   NSWG  WG+ G F+IRRGTDE  +ES  V
Sbjct: 412 VSGEPYWKIKNSWGTEWGEKGYFRIRRGTDECGVESLGV 450


>gi|361069783|gb|AEW09203.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153583|gb|AFG58928.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153585|gb|AFG58929.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153587|gb|AFG58930.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153589|gb|AFG58931.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153591|gb|AFG58932.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153593|gb|AFG58933.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153595|gb|AFG58934.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153597|gb|AFG58935.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153599|gb|AFG58936.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153601|gb|AFG58937.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153603|gb|AFG58938.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153605|gb|AFG58939.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153607|gb|AFG58940.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
 gi|383153609|gb|AFG58941.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
          Length = 68

 Score = 76.6 bits (187), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 34/61 (55%), Positives = 43/61 (70%)

Query: 24 YKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGDGGLFKIRRGTDESRI 83
          YK GVY++  G++ GGHAVK++GWG E G  YWL  NSW   WG+ G FKI RG++E  I
Sbjct: 4  YKSGVYKYIKGDLMGGHAVKLVGWGTEGGTDYWLVANSWNTAWGEDGYFKIARGSNECGI 63

Query: 84 E 84
          E
Sbjct: 64 E 64


>gi|324512900|gb|ADY45327.1| Peptidase C1-like protein [Ascaris suum]
          Length = 450

 Score = 76.6 bits (187), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 43/115 (37%), Positives = 62/115 (53%), Gaps = 13/115 (11%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQH--------TVGEMSGGHAVKIIGWGVE----D 51
           EI   G + A    ++D  +Y  GVYQH           ++ G H+V+IIGWG +     
Sbjct: 330 EIITNGPVQATFLVYEDFFMYSGGVYQHLDLHEHKEEERKVQGYHSVRIIGWGEDYSTGP 389

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRVDRDRSSDLEEFE 105
            VKYWL  NSWG  WG+ GLF+I RG +   IESF + A G+  + R   +++ +
Sbjct: 390 QVKYWLAANSWGNEWGEDGLFRILRGENHCEIESFVIGAWGKGAKKRRFKVQKLQ 444


>gi|14789619|gb|AAH10745.1| Tubulointerstitial nephritis antigen [Mus musculus]
          Length = 475

 Score = 76.6 bits (187), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 38/100 (38%), Positives = 53/100 (53%), Gaps = 13/100 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGV-- 53
           EI   G + A ++ H+D   YK G+Y+H V              HAVK+ GWG   G   
Sbjct: 366 EIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYKKLRTHAVKLTGWGTLRGARG 425

Query: 54  ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
              K+W+  NSWG+ WG+ G F+I RG +ES IE   ++A
Sbjct: 426 KKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465


>gi|159488843|ref|XP_001702410.1| papain-type cysteine protease [Chlamydomonas reinhardtii]
 gi|158271078|gb|EDO96905.1| papain-type cysteine protease [Chlamydomonas reinhardtii]
          Length = 382

 Score = 76.6 bits (187), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 36/86 (41%), Positives = 51/86 (59%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLI-IYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
           M  EI+H G I       +D    Y  G+Y+ T G+    H V+++GWG EDG KYW+  
Sbjct: 220 MMSEIYHRGPITCGQVCPEDFTWHYNGGIYKDTSGDTELDHDVEVVGWGEEDGEKYWIVR 279

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSWG  WG+ G F++RRG +  ++ES
Sbjct: 280 NSWGTYWGERGFFRVRRGDNSLQLES 305


>gi|227499499|ref|NP_036163.3| tubulointerstitial nephritis antigen precursor [Mus musculus]
 gi|4929827|gb|AAD34171.1| tubulo-interstitial nephritis antigen [Mus musculus]
 gi|148694397|gb|EDL26344.1| tubulointerstitial nephritis antigen, isoform CRA_a [Mus musculus]
          Length = 475

 Score = 76.3 bits (186), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 38/100 (38%), Positives = 53/100 (53%), Gaps = 13/100 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGV-- 53
           EI   G + A ++ H+D   YK G+Y+H V              HAVK+ GWG   G   
Sbjct: 366 EIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYKKLRTHAVKLTGWGTLRGARG 425

Query: 54  ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
              K+W+  NSWG+ WG+ G F+I RG +ES IE   ++A
Sbjct: 426 KKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465


>gi|226470084|emb|CAX70323.1| hypotherical protein [Schistosoma japonicum]
          Length = 462

 Score = 76.3 bits (186), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 41/99 (41%), Positives = 53/99 (53%), Gaps = 11/99 (11%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS---------GGHAVKIIGWGVE- 50
           M+LE+ H G      E ++D   YK GVY HT  +             HAV ++G+GV+ 
Sbjct: 358 MRLELVHNGPFPVGFEVYEDFEFYKDGVYHHTNVQNDRYSFNPFELTNHAVLLVGYGVDK 417

Query: 51  -DGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQV 88
             G  YW   NSWG  WG+ G F+IRRGTDE  +ES  V
Sbjct: 418 VSGEPYWKIKNSWGTEWGEKGYFRIRRGTDECGVESLGV 456


>gi|344287520|ref|XP_003415501.1| PREDICTED: tubulointerstitial nephritis antigen isoform 2
           [Loxodonta africana]
          Length = 437

 Score = 76.3 bits (186), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 57/104 (54%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +Y+ G+Y HT             G H+VKI GWG E   DG
Sbjct: 326 ELMENGPVQALMEVHEDFFLYQGGIYSHTPVSQERPEQYRRHGTHSVKITGWGEETLPDG 385

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 386 RTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 429


>gi|14042811|dbj|BAB55403.1| unnamed protein product [Homo sapiens]
          Length = 218

 Score = 76.3 bits (186), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +Y+ G+Y HT   +         G H+VKI GWG E   DG
Sbjct: 107 ELMENGPVQALMEVHEDFFLYEGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 166

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 167 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 210


>gi|53850626|ref|NP_001005549.1| tubulointerstitial nephritis antigen precursor [Rattus norvegicus]
 gi|51858645|gb|AAH81887.1| Tubulointerstitial nephritis antigen [Rattus norvegicus]
 gi|149019129|gb|EDL77770.1| tubulointerstitial nephritis antigen [Rattus norvegicus]
          Length = 475

 Score = 76.3 bits (186), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 38/100 (38%), Positives = 53/100 (53%), Gaps = 13/100 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGV-- 53
           EI   G + A ++ H+D   YK G+Y+H V              HAVK+ GWG   G   
Sbjct: 366 EIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYRKLRTHAVKLTGWGTLRGAQG 425

Query: 54  ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
              K+W+  NSWG+ WG+ G F+I RG +ES IE   ++A
Sbjct: 426 KKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465


>gi|340509247|gb|EGR34799.1| hypothetical protein IMG5_001760 [Ichthyophthirius multifiliis]
          Length = 527

 Score = 76.3 bits (186), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 45/99 (45%), Positives = 54/99 (54%), Gaps = 15/99 (15%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGE----MSGG---------HAVKIIGW 47
           M +EI   G IVA+I      + YK GVY H+V      ++G          HA    GW
Sbjct: 405 MMIEIMKNGPIVASINPDYQFMYYKSGVY-HSVEAAEWILNGQNAPEWRNVEHAALCYGW 463

Query: 48  G-VEDGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIES 85
           G  E   KYWL  NSWG+ WG+ G FKIRRGTDES +ES
Sbjct: 464 GESEKDGKYWLMQNSWGKEWGENGFFKIRRGTDESSVES 502


>gi|166030324|gb|ABY78829.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score = 76.3 bits (186), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 35/81 (43%), Positives = 45/81 (55%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E++  G  VA    + DL  YK GVY+H  G+  GG AVK++GWG  +G  YW   N+W 
Sbjct: 242 ELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKLNGTPYWKVANTWD 301

Query: 64  ELWGDGGLFKIRRGTDESRIE 84
             WG  G   I RG +E  IE
Sbjct: 302 TDWGMDGYLLILRGNNECNIE 322


>gi|71656032|ref|XP_816569.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
 gi|70881707|gb|EAN94718.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
          Length = 333

 Score = 76.3 bits (186), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 36/89 (40%), Positives = 46/89 (51%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
            + E+   G    +   + D + Y  GVY+H  G   GGHAV+I+GWG  +G  YW   N
Sbjct: 239 FKRELLLNGPFEVSFSVYADFLAYTGGVYKHVAGTFLGGHAVRIVGWGELNGEPYWKIAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW   WG  G F I RG DE  IE   V+
Sbjct: 299 SWNREWGMNGYFLIARGVDECGIEGSGVA 327


>gi|157058739|gb|ABV03127.1| cathepsin B-2744 [Acyrthosiphon pisum]
          Length = 260

 Score = 75.9 bits (185), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 33/72 (45%), Positives = 50/72 (69%), Gaps = 1/72 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVE-DGVKYWLCV 59
           +Q EI  +G + A +  +++ + YK+G+Y+ T GE+ G H VK+IGWGV+ DG +YWL +
Sbjct: 189 IQQEIMTYGPVTAFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWLAM 248

Query: 60  NSWGELWGDGGL 71
           NSW   WG+ GL
Sbjct: 249 NSWNSNWGNDGL 260


>gi|56754987|gb|AAW25676.1| SJCHGC01753 protein [Schistosoma japonicum]
          Length = 462

 Score = 75.9 bits (185), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 41/99 (41%), Positives = 53/99 (53%), Gaps = 11/99 (11%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS---------GGHAVKIIGWGVE- 50
           M+LE+ H G      E ++D   YK GVY HT  +             HAV ++G+GV+ 
Sbjct: 358 MRLELVHNGPFPVGFEVYEDFEFYKDGVYHHTNVQNDRYSFNPFELTNHAVLLVGYGVDK 417

Query: 51  -DGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQV 88
             G  YW   NSWG  WG+ G F+IRRGTDE  +ES  V
Sbjct: 418 VSGEPYWKIKNSWGTEWGEKGYFRIRRGTDECGVESLGV 456


>gi|386001804|ref|YP_005920103.1| Integrins alpha chain [Methanosaeta harundinacea 6Ac]
 gi|357209860|gb|AET64480.1| Integrins alpha chain [Methanosaeta harundinacea 6Ac]
          Length = 882

 Score = 75.9 bits (185), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 28/77 (36%), Positives = 44/77 (57%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q E+   G + + +  ++D  +Y   +Y+H  G + G   V I+GWG  +   YW+C N
Sbjct: 253 IQQEVLFGGPVSSKMAVYEDFYLYDDDIYEHAAGALVGSQWVDILGWGTNNSTDYWICKN 312

Query: 61  SWGELWGDGGLFKIRRG 77
           SWG  WGD G F+I+ G
Sbjct: 313 SWGAAWGDSGWFRIKMG 329


>gi|354483193|ref|XP_003503779.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cricetulus
           griseus]
          Length = 475

 Score = 75.9 bits (185), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 53/100 (53%), Gaps = 13/100 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVEDGV-- 53
           EI   G + A ++ H+D   YK G+Y+H +              HAVK+ GWG   G   
Sbjct: 366 EIIRNGPVQAIMQVHEDFFYYKTGIYRHVISTNEESEKYRKLRSHAVKLTGWGTLRGAGG 425

Query: 54  ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
              K+W+  NSWG+ WG+ G F+I RG +ES IE   ++A
Sbjct: 426 KKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465


>gi|320162754|gb|EFW39653.1| papain family cysteine protease [Capsaspora owczarzaki ATCC 30864]
          Length = 589

 Score = 75.9 bits (185), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 37/86 (43%), Positives = 48/86 (55%), Gaps = 1/86 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
           M+ EIF  G +   I    DLI Y  GV+  T G +   H+V + GWGV++ G  YW  V
Sbjct: 494 MKAEIFARGPVAVTIAVTTDLINYTGGVFHDTTGAIGDDHSVMLTGWGVDNSGTPYWTIV 553

Query: 60  NSWGELWGDGGLFKIRRGTDESRIES 85
           NSWG  WG+ G  +I RG +   IES
Sbjct: 554 NSWGTYWGETGAARIVRGVNNLGIES 579



 Score = 62.8 bits (151), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 30/85 (35%), Positives = 42/85 (49%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           M+ EIF  G I   I+A   L  Y  GV+          H + ++GWG      YW+  N
Sbjct: 194 MKAEIFARGPISCGIDATAALEAYTGGVFSEFSVLPIINHEISVVGWGTNGSTSYWIVRN 253

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           SWG  +G+ G F+I+ G D   IE+
Sbjct: 254 SWGSFYGEDGFFRIKMGGDNLAIET 278


>gi|130502070|ref|NP_001076255.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
 gi|818411|gb|AAC48477.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
          Length = 474

 Score = 75.9 bits (185), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 53/100 (53%), Gaps = 13/100 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGV-- 53
           EI   G + A ++ H+D   YK G+Y+H +              HAVK+ GWG   G   
Sbjct: 365 EIMQNGPVQAIMQVHEDFFHYKTGIYRHVISTNEESEKYRKLQTHAVKLTGWGTLKGARG 424

Query: 54  ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
              K+W+  NSWG+ WG+ G F+I RG +ES IE   ++A
Sbjct: 425 QKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 464


>gi|3088522|gb|AAD03404.1| cathepsin B-like protease precursor [Trypanosoma cruzi]
 gi|407859283|gb|EKG06969.1| cysteine peptidase C (CPC) [Trypanosoma cruzi]
          Length = 333

 Score = 75.5 bits (184), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 36/89 (40%), Positives = 46/89 (51%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
            + E+   G    +   + D + Y  GVY+H  G   GGHAV+I+GWG  +G  YW   N
Sbjct: 239 FKRELLLNGPFEVSFSVYADFVAYTGGVYKHVTGVFLGGHAVRIVGWGELNGEPYWKIAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW   WG  G F I RG DE  IE   V+
Sbjct: 299 SWNHEWGMNGYFLIARGVDECGIEGSGVA 327


>gi|351704465|gb|EHB07384.1| Tubulointerstitial nephritis antigen [Heterocephalus glaber]
          Length = 475

 Score = 75.5 bits (184), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 53/100 (53%), Gaps = 13/100 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGVK- 54
           EI   G + A ++ H+D   YK G+Y+H    +           HAVK+ GWG   G K 
Sbjct: 366 EIMKNGPVQAIMQVHEDFFYYKTGIYRHVTSTIEDSEKYQKLRTHAVKLTGWGTLRGAKG 425

Query: 55  ----YWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
               +W+  NSWG+ WG+ G F+I RG +ES IE   ++A
Sbjct: 426 RKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465


>gi|158605299|gb|ABW74905.1| cathepsin C [Penaeus monodon]
          Length = 449

 Score = 75.5 bits (184), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 41/99 (41%), Positives = 58/99 (58%), Gaps = 10/99 (10%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
           M+L +   G ++  +E + D + YK G+Y HT         E++  HAV ++G+G ++  
Sbjct: 348 MKLALIKGGLLIVGLEVYDDFLHYKGGIYHHTGLQDRFNPLELTN-HAVLLVGYGEDEAT 406

Query: 52  GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           G KYW   NSWGE WG+ G F+IRRG DE  IES  V A
Sbjct: 407 GEKYWSVKNSWGEDWGEDGYFRIRRGVDECAIESMAVEA 445


>gi|159108625|ref|XP_001704582.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157432649|gb|EDO76908.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 298

 Score = 75.5 bits (184), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 35/77 (45%), Positives = 50/77 (64%), Gaps = 1/77 (1%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCVNSWGELWG 67
           G +  A   + D + Y+ GVYQHT G + GGHAV+++G+G ++  V YW+  NSWG  WG
Sbjct: 212 GPLQTAFTVYSDFMYYEGGVYQHTYGRVEGGHAVEMVGYGTDEYDVDYWIIRNSWGPDWG 271

Query: 68  DGGLFKIRRGTDESRIE 84
           + G F+I R T+E  IE
Sbjct: 272 EDGYFRIIRMTNECGIE 288


>gi|358331547|dbj|GAA35870.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 508

 Score = 75.5 bits (184), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 36/81 (44%), Positives = 45/81 (55%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI   G + A    ++D + Y  GVY H +G    GHAV+I+GWG    V YWL  NSW 
Sbjct: 247 EIMLRGPVEAIFTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWGELGNVPYWLIANSWN 306

Query: 64  ELWGDGGLFKIRRGTDESRIE 84
           E WG+ G  K  RG +E  IE
Sbjct: 307 EDWGEEGYMKFLRGYNECGIE 327


>gi|335290878|ref|XP_003127800.2| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Sus scrofa]
          Length = 362

 Score = 75.5 bits (184), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 57/104 (54%), Gaps = 14/104 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--------GEMSGGHAVKIIGWGVE---DG 52
           E+   G + A +E H+D  +Y+ G+Y HT             G H+VKI GWG E   DG
Sbjct: 251 ELMENGPVQALMEVHEDFFLYQSGIYSHTPVSHGRPERYRRHGTHSVKITGWGEETLPDG 310

Query: 53  --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
             +KYW   NSWG  WG+ G F+I RG +E  IESF +   GRV
Sbjct: 311 RMLKYWTAANSWGPGWGERGHFRIVRGANECDIESFVLGVWGRV 354


>gi|348553066|ref|XP_003462348.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
           porcellus]
          Length = 475

 Score = 75.5 bits (184), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 38/100 (38%), Positives = 52/100 (52%), Gaps = 13/100 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGV-- 53
           EI   G + A ++ H+D   YK G+Y+H                HAVK+ GWG   G   
Sbjct: 366 EIMQNGPVQAIMKVHEDFFSYKTGIYRHVTSTSEDSEKYQKLRTHAVKLTGWGTLKGARG 425

Query: 54  ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
              K+W+  NSWG+ WG+ G FKI RG +ES IE   ++A
Sbjct: 426 KKEKFWIAANSWGKSWGENGYFKILRGVNESDIEKLIIAA 465


>gi|253742315|gb|EES99155.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 303

 Score = 75.5 bits (184), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 50/78 (64%), Gaps = 2/78 (2%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG-HAVKIIGWGV-EDGVKYWLCVNSWGELW 66
           G +   I  + DL+ Y  GVY+HT G +S G HA++++G+G  +DG  YW   NSWG  W
Sbjct: 217 GPVQTMIVVYADLLYYAGGVYRHTYGPISNGLHALEMVGYGTTDDGTDYWTIKNSWGSDW 276

Query: 67  GDGGLFKIRRGTDESRIE 84
           G+ G F+I RG +E RIE
Sbjct: 277 GEDGYFRIVRGVNECRIE 294


>gi|326430261|gb|EGD75831.1| hypothetical protein PTSG_07950 [Salpingoeca sp. ATCC 50818]
          Length = 381

 Score = 75.5 bits (184), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 41/92 (44%), Positives = 53/92 (57%), Gaps = 2/92 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVY--QHTVGEMSGGHAVKIIGWGVEDGVKYWLC 58
           +Q +I   G ++A+ E  +D   Y  GVY       +  G HAV I+GWGVED   YWL 
Sbjct: 250 IQRDIMQHGPVLASYEVFEDFGEYDSGVYTCPDDGSDSIGWHAVIIVGWGVEDNTPYWLV 309

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
            NSWG  +G  G FKI RGT+E  IES  V++
Sbjct: 310 QNSWGTGFGIDGYFKIARGTNECNIESRLVTS 341


>gi|256086900|ref|XP_002579622.1| cathepsin B (C01 family) [Schistosoma mansoni]
          Length = 204

 Score = 75.5 bits (184), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 37/87 (42%), Positives = 50/87 (57%), Gaps = 1/87 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
           +Q EI   G ++A+I    D ++YK GVY  T    + G   ++IIGWG E    YWLC 
Sbjct: 110 IQKEILMNGPVIASILVKVDFLVYKSGVYFPTPKSSNLGWINLRIIGWGYEGKTPYWLCA 169

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESF 86
           NSW + WG+ G  K+RRG     IES+
Sbjct: 170 NSWSKEWGENGYVKVRRGVQAGYIESY 196


>gi|353228747|emb|CCD74918.1| cathepsin B (C01 family) [Schistosoma mansoni]
          Length = 229

 Score = 75.5 bits (184), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 37/87 (42%), Positives = 50/87 (57%), Gaps = 1/87 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
           +Q EI   G ++A+I    D ++YK GVY  T    + G   ++IIGWG E    YWLC 
Sbjct: 135 IQKEILMNGPVIASILVKVDFLVYKSGVYFPTPKSSNLGWINLRIIGWGYEGKTPYWLCA 194

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESF 86
           NSW + WG+ G  K+RRG     IES+
Sbjct: 195 NSWSKEWGENGYVKVRRGVQAGYIESY 221


>gi|132566367|gb|ABO34080.1| cathepsin B5 [Clonorchis sinensis]
          Length = 343

 Score = 75.5 bits (184), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 49/88 (55%), Gaps = 1/88 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI   G + A    ++D + Y  GVY H +G    GHAV+I+GWG    V YWL  NSW 
Sbjct: 247 EIMLRGPVEAIFTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWGELGNVPYWLIANSWN 306

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAG 91
           E WG+ G  K  RG +E  IE   V+AG
Sbjct: 307 EDWGEEGYMKFLRGYNECGIED-DVTAG 333


>gi|157092993|gb|ABV22151.1| cysteine proteinase [Perkinsus chesapeaki]
          Length = 396

 Score = 75.5 bits (184), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 39/77 (50%), Positives = 48/77 (62%), Gaps = 4/77 (5%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI   G + A+   + D + YK GVY+ T     GGHAVKIIGWG ED   YWL VN
Sbjct: 306 IKKEIMTNGPVSASYLVYDDFLTYKSGVYKRTSHNALGGHAVKIIGWG-ED---YWLVVN 361

Query: 61  SWGELWGDGGLFKIRRG 77
           SW + WGD G+FKI  G
Sbjct: 362 SWNKNWGDNGMFKIGCG 378


>gi|253748399|gb|EET02549.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 303

 Score = 75.5 bits (184), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 50/78 (64%), Gaps = 2/78 (2%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG-HAVKIIGWGV-EDGVKYWLCVNSWGELW 66
           G +   I  + DL+ Y  GVY+HT G +S G HA++++G+G  +DG  YW   NSWG  W
Sbjct: 217 GPVQTMIVVYADLLYYAGGVYRHTYGPISNGLHALEMVGYGTTDDGTDYWTIKNSWGSDW 276

Query: 67  GDGGLFKIRRGTDESRIE 84
           G+ G F+I RG +E RIE
Sbjct: 277 GEDGYFRIVRGVNECRIE 294


>gi|226470086|emb|CAX70324.1| hypotherical protein [Schistosoma japonicum]
          Length = 456

 Score = 75.5 bits (184), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 41/99 (41%), Positives = 52/99 (52%), Gaps = 11/99 (11%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS---------GGHAVKIIGWGVE- 50
           M+LE+ H G      E  +D   YK GVY HT  +             HAV ++G+GV+ 
Sbjct: 352 MRLELVHNGPFPVGFEVFEDFEFYKDGVYHHTNVQNDRYSFNPFELTNHAVLLVGYGVDK 411

Query: 51  -DGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQV 88
             G  YW   NSWG  WG+ G F+IRRGTDE  +ES  V
Sbjct: 412 VSGEPYWKIKNSWGTEWGEKGYFRIRRGTDECGVESLGV 450


>gi|294952611|ref|XP_002787376.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239902348|gb|EER19172.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 203

 Score = 75.5 bits (184), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 37/84 (44%), Positives = 52/84 (61%), Gaps = 2/84 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EIF  G + +A + ++D   YK GVY  T  E+   H VKIIGWG +   +YWL +N
Sbjct: 112 IKQEIFDNGPVFSAFKMYEDFRYYKSGVYVPTTKEVLSFHLVKIIGWGADSVQEYWLAMN 171

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW E WGD GL K+  G  ++R+E
Sbjct: 172 SWNEEWGDHGLIKMAFG--KNRLE 193


>gi|10803452|emb|CAB97365.2| putative cathepsin B.2 [Ostertagia ostertagi]
          Length = 194

 Score = 75.5 bits (184), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 33/62 (53%), Positives = 41/62 (66%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q +I   G +VA    ++D   YK G+Y+HT G M+GGHAVKIIGWG E G  YWL  N
Sbjct: 133 IQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMTGGHAVKIIGWGKEXGTPYWLIAN 192

Query: 61  SW 62
           SW
Sbjct: 193 SW 194


>gi|338718488|ref|XP_001918155.2| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
           antigen-like [Equus caballus]
          Length = 480

 Score = 75.5 bits (184), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 52/100 (52%), Gaps = 13/100 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGV-- 53
           EI   G + A ++ H D   YKKG+Y+H                HA+K+ GWG   G   
Sbjct: 371 EIMQNGPVQAIMQVHDDFFHYKKGIYRHVTSTHEEPEKYRKLRTHAIKLAGWGTLRGAQG 430

Query: 54  ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
              K+W+  NSWG+ WG+ G F+I RG +ES IE   ++A
Sbjct: 431 RKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 470


>gi|294945206|ref|XP_002784584.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239897729|gb|EER16380.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 298

 Score = 75.1 bits (183), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 36/85 (42%), Positives = 53/85 (62%), Gaps = 2/85 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EIF  G + +A E ++D   YK GVY  T  E+   H +KIIGWG +   +YWL +N
Sbjct: 208 IKQEIFDNGPVFSAFEMYKDFRYYKSGVYVPTTKEVDCLHVIKIIGWGADSVREYWLAMN 267

Query: 61  SWGELWGDGGLFKIRRGTDESRIES 85
           +W E WGD GL K+  G  ++R+E+
Sbjct: 268 AWNEEWGDHGLIKMAFG--KNRLEN 290


>gi|355724272|gb|AES08175.1| tubulointerstitial nephritis antigen [Mustela putorius furo]
          Length = 476

 Score = 75.1 bits (183), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 40/100 (40%), Positives = 55/100 (55%), Gaps = 13/100 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQH---TVGEMSG-----GHAVKIIGWGVEDGV-- 53
           EI   G + A ++ H+D   YK G+Y+H   T  E S       HAVK+ GWG   G   
Sbjct: 367 EIMQNGPVQAIMQVHEDFFHYKTGIYRHVTRTNEEASKYRKFQTHAVKLTGWGTLKGAQG 426

Query: 54  ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
              K+W+  NSWG+ WG+ G F+I RG +ES IE   ++A
Sbjct: 427 QKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|344264196|ref|XP_003404179.1| PREDICTED: tubulointerstitial nephritis antigen [Loxodonta
           africana]
          Length = 476

 Score = 75.1 bits (183), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 55/100 (55%), Gaps = 13/100 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG--------EMSGGHAVKIIGWGVEDGVK- 54
           EI   G + A ++ H+D   YK G+Y+H +         +    HAVK+ GWG+  G K 
Sbjct: 367 EIMQNGPVQAIMQVHEDFFHYKTGIYRHVIRTSEESEKYQKLRTHAVKLTGWGMMKGAKG 426

Query: 55  ----YWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
               +W+  NSWG+ WG+ G F+I RG +ES IE   ++A
Sbjct: 427 RKEKFWVAANSWGKSWGEDGYFRILRGVNESDIEKLIIAA 466


>gi|402588459|gb|EJW82392.1| papain family cysteine protease containing protein [Wuchereria
           bancrofti]
          Length = 323

 Score = 75.1 bits (183), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 34/71 (47%), Positives = 44/71 (61%), Gaps = 2/71 (2%)

Query: 23  IYKKGVYQHT--VGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGDGGLFKIRRGTDE 80
            YK G+   T     M   HA ++IG+G E+G KYWL  NSWGE WGD G FKI RG + 
Sbjct: 252 FYKSGILPDTDECSTMEPNHAAEVIGYGTENGKKYWLLKNSWGEWWGDQGFFKIERGINA 311

Query: 81  SRIESFQVSAG 91
            ++E++  SAG
Sbjct: 312 CKVETYVASAG 322


>gi|71424150|ref|XP_812694.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
 gi|70877506|gb|EAN90843.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
          Length = 333

 Score = 75.1 bits (183), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 36/89 (40%), Positives = 46/89 (51%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
            + E+   G    +   + D + Y  GVY+H  G   GGHAV+I+GWG  +G  YW   N
Sbjct: 239 FKRELILNGPFEVSFSVYADFVAYTGGVYKHVAGIFLGGHAVRIVGWGELNGEPYWKIAN 298

Query: 61  SWGELWGDGGLFKIRRGTDESRIESFQVS 89
           SW   WG  G F I RG DE  IE   V+
Sbjct: 299 SWNREWGMNGYFLIARGVDECGIEGSGVA 327


>gi|345488309|ref|XP_001605531.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
           [Nasonia vitripennis]
          Length = 481

 Score = 75.1 bits (183), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 37/92 (40%), Positives = 51/92 (55%), Gaps = 9/92 (9%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG---EMSGGHAVKIIGWGVEDG------VK 54
           EI   G + A +  H+D   Y+ G+Y H+       SG H+V+I+GWG E        +K
Sbjct: 377 EILTSGPVQATMRVHRDFFHYESGIYVHSRPFDTRQSGYHSVRIVGWGEEPSPYNGKPIK 436

Query: 55  YWLCVNSWGELWGDGGLFKIRRGTDESRIESF 86
           +W   NSWG  WG+ G F+I RG +E  IESF
Sbjct: 437 FWRVANSWGRDWGEDGYFRIVRGNNECEIESF 468


>gi|405968896|gb|EKC33922.1| Dipeptidyl-peptidase 1, partial [Crassostrea gigas]
          Length = 392

 Score = 74.7 bits (182), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 40/98 (40%), Positives = 55/98 (56%), Gaps = 8/98 (8%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS------GGHAVKIIGWGVE--DG 52
           M++ +   G +  + E + D   YK GVY HT  +          HAV ++G+GV+   G
Sbjct: 291 MKINLVKNGPLSVSFEVYNDFFHYKGGVYVHTGLQEKFNPFEITNHAVLLVGYGVDAATG 350

Query: 53  VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           VK+W   NSWG  WG+ G F+IRRGTDE  IES  V +
Sbjct: 351 VKFWTVKNSWGTQWGEDGYFRIRRGTDECSIESIAVQS 388


>gi|350596935|ref|XP_001927698.4| PREDICTED: tubulointerstitial nephritis antigen, partial [Sus
           scrofa]
          Length = 368

 Score = 74.7 bits (182), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 39/100 (39%), Positives = 55/100 (55%), Gaps = 13/100 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQH--TVGEMSGG------HAVKIIGWGVEDGV-- 53
           EI   G + A ++ H+D   YK G+Y+H  +  E S        HAVK+ GWG   G   
Sbjct: 259 EIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNEESDKYRKLRTHAVKLTGWGTLKGAQG 318

Query: 54  ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
              K+W+  NSWG+ WG+ G F+I RG +ES IE   ++A
Sbjct: 319 RKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 358


>gi|157058745|gb|ABV03130.1| cathepsin B-2744 [Sitobion avenae]
          Length = 260

 Score = 74.7 bits (182), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 33/72 (45%), Positives = 49/72 (68%), Gaps = 1/72 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVE-DGVKYWLCV 59
           +Q EI  +G + A +  +++ + YK+G+Y+ T GE+ G H VK+IGWGV+ DG +YWL +
Sbjct: 189 IQQEIMTYGPVTAFMYVYENFMGYKEGIYKSTAGELIGYHHVKLIGWGVDGDGTEYWLAM 248

Query: 60  NSWGELWGDGGL 71
           NSW   WG  GL
Sbjct: 249 NSWNSNWGTNGL 260


>gi|3087797|emb|CAA93275.1| cysteine proteinase [Haemonchus contortus]
          Length = 330

 Score = 74.7 bits (182), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 30/66 (45%), Positives = 41/66 (62%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           +Q E+   G + AA   ++D   Y+KG+Y H+ G   G HAVK++GWGVE+G KYW   N
Sbjct: 254 IQREMMKNGPVQAAFTTYEDFSFYRKGIYVHSYGRQRGAHAVKVVGWGVENGTKYWNVAN 313

Query: 61  SWGELW 66
           SW   W
Sbjct: 314 SWSTDW 319


>gi|170579333|ref|XP_001894785.1| Papain family cysteine protease containing protein [Brugia malayi]
 gi|158598509|gb|EDP36387.1| Papain family cysteine protease containing protein [Brugia malayi]
          Length = 324

 Score = 74.7 bits (182), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 34/71 (47%), Positives = 44/71 (61%), Gaps = 2/71 (2%)

Query: 23  IYKKGVYQHT--VGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGDGGLFKIRRGTDE 80
            YK GV   T     M   HA ++IG+G E+G KYWL  NSWGE WGD G FK+ RG + 
Sbjct: 253 FYKSGVLPDTDECSTMEPNHAAEVIGYGTENGKKYWLLKNSWGEWWGDQGFFKMERGVNA 312

Query: 81  SRIESFQVSAG 91
            ++E++  SAG
Sbjct: 313 CKVETYVASAG 323


>gi|145529217|ref|XP_001450397.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124418008|emb|CAK83000.1| unnamed protein product [Paramecium tetraurelia]
          Length = 512

 Score = 74.7 bits (182), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 35/87 (40%), Positives = 53/87 (60%), Gaps = 1/87 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKG-VYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
           M++EIF+ G IV  + A Q+L  Y+ G ++     +    H V ++GWGVEDGV+YW+  
Sbjct: 418 MKIEIFNRGPIVCGVYATQELDDYEGGYIFSQKTNKTILNHYVSVVGWGVEDGVEYWIVR 477

Query: 60  NSWGELWGDGGLFKIRRGTDESRIESF 86
           NSWG  WGD G  K++  +D   +E +
Sbjct: 478 NSWGSYWGDMGYAKMKMHSDNLLLEHY 504


>gi|4325188|gb|AAD17297.1| cysteine proteinase [Ancylostoma ceylanicum]
          Length = 341

 Score = 74.7 bits (182), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 36/84 (42%), Positives = 50/84 (59%), Gaps = 1/84 (1%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EI+  G +VAA + ++D      G+Y H  G  +G HA K+IGWG E+G  YWL  N
Sbjct: 249 IRQEIYKNGPVVAAFKVYEDYSS-TGGIYVHKWGIQTGAHADKVIGWGRENGTDYWLIAN 307

Query: 61  SWGELWGDGGLFKIRRGTDESRIE 84
           SW   WG+ G ++I R TD   IE
Sbjct: 308 SWNTDWGEDGYYRIVRETDNCEIE 331


>gi|301775398|ref|XP_002923119.1| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
           antigen-like [Ailuropoda melanoleuca]
          Length = 472

 Score = 74.7 bits (182), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 38/100 (38%), Positives = 54/100 (54%), Gaps = 13/100 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--GEMSGG------HAVKIIGWGVEDGV-- 53
           EI   G + A ++ H+D   YK G+Y+H     E S        HA+K+ GWG   G   
Sbjct: 363 EIMQNGPVQAIMQVHEDFFHYKTGIYRHVTRTNEESSKYRKLQTHAIKLTGWGTLKGARG 422

Query: 54  ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
              K+W+  NSWG+ WG+ G F+I RG +ES IE   ++A
Sbjct: 423 QKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 462


>gi|47125398|gb|AAH70278.1| Tubulointerstitial nephritis antigen [Homo sapiens]
 gi|190690249|gb|ACE86899.1| tubulointerstitial nephritis antigen protein [synthetic construct]
 gi|190691623|gb|ACE87586.1| tubulointerstitial nephritis antigen protein [synthetic construct]
 gi|312150986|gb|ADQ32005.1| tubulointerstitial nephritis antigen [synthetic construct]
          Length = 476

 Score = 74.7 bits (182), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 52/100 (52%), Gaps = 13/100 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGV-- 53
           EI   G + A ++ H+D   YK G+Y+H                HAVK+ GWG   G   
Sbjct: 367 EIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQG 426

Query: 54  ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
              K+W+  NSWG+ WG+ G F+I RG +ES IE   ++A
Sbjct: 427 QKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|6009533|dbj|BAA84949.1| tubulointerstitial nephritis antigen [Homo sapiens]
          Length = 476

 Score = 74.7 bits (182), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 52/100 (52%), Gaps = 13/100 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGV-- 53
           EI   G + A ++ H+D   YK G+Y+H                HAVK+ GWG   G   
Sbjct: 367 EIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQG 426

Query: 54  ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
              K+W+  NSWG+ WG+ G F+I RG +ES IE   ++A
Sbjct: 427 QKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|32129433|sp|P92131.3|CATB1_GIALA RecName: Full=Cathepsin B-like CP1; AltName: Full=Cathepsin B-like
           protease B1; Flags: Precursor
          Length = 303

 Score = 74.7 bits (182), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 51/78 (65%), Gaps = 2/78 (2%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGV-EDGVKYWLCVNSWGELW 66
           G +   I  + DL  Y+ GVY+HT G ++ G HA++I+G+G  +DG  YW+  NSWG  W
Sbjct: 217 GPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIKNSWGPDW 276

Query: 67  GDGGLFKIRRGTDESRIE 84
           G+ G F+I RG +E RIE
Sbjct: 277 GENGYFRIVRGVNECRIE 294


>gi|440907441|gb|ELR57591.1| Tubulointerstitial nephritis antigen [Bos grunniens mutus]
          Length = 476

 Score = 74.7 bits (182), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 52/100 (52%), Gaps = 13/100 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGV-- 53
           EI   G + A ++ H+D   YK G+Y+H                HAVK+ GWG   G   
Sbjct: 367 EIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQG 426

Query: 54  ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
              K+W+  NSWG+ WG+ G F+I RG +ES IE   ++A
Sbjct: 427 QKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|290992564|ref|XP_002678904.1| predicted protein [Naegleria gruberi]
 gi|284092518|gb|EFC46160.1| predicted protein [Naegleria gruberi]
          Length = 289

 Score = 74.7 bits (182), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 36/77 (46%), Positives = 46/77 (59%), Gaps = 3/77 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDG--VKYWL 57
           +Q +I   G + AA   +QD   YK GVY+H  G ++GGHA+KI+GWGV  DG    YW+
Sbjct: 213 IQKDIQANGPVQAAFSVYQDFFSYKSGVYRHVSGSLAGGHAIKIVGWGVTSDGKDTPYWI 272

Query: 58  CVNSWGELWGDGGLFKI 74
             NSW   WG  G F I
Sbjct: 273 VANSWNTNWGQEGFFWI 289


>gi|11691656|emb|CAC18646.1| cathepsin B-like protease 1 [Giardia intestinalis]
          Length = 303

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 51/78 (65%), Gaps = 2/78 (2%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGV-EDGVKYWLCVNSWGELW 66
           G +   I  + DL  Y+ GVY+HT G ++ G HA++I+G+G  +DG  YW+  NSWG  W
Sbjct: 217 GPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIKNSWGPDW 276

Query: 67  GDGGLFKIRRGTDESRIE 84
           G+ G F+I RG +E RIE
Sbjct: 277 GENGYFRIVRGVNECRIE 294


>gi|253747738|gb|EET02294.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 305

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 33/77 (42%), Positives = 45/77 (58%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
           G +      H+D + Y  G+Y  T G   GGHAV I+G+G  +   YW+  NSWG  WG+
Sbjct: 221 GPVQTGFYVHEDFLYYVGGIYHKTYGSSIGGHAVLIVGYGSMNNHDYWIVRNSWGSDWGE 280

Query: 69  GGLFKIRRGTDESRIES 85
            G F+I RGT+E  IE+
Sbjct: 281 NGYFRILRGTNECGIEN 297


>gi|10803454|emb|CAB97366.2| putative cathepsin B.3 [Ostertagia ostertagi]
          Length = 196

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 32/62 (51%), Positives = 42/62 (67%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
           ++ EIF  G + A+   ++D   YK G+Y HT G+  GGHAVKIIGWGVE+G K W+  N
Sbjct: 135 IRAEIFARGPVQASFATYEDFAHYKSGIYVHTAGKRRGGHAVKIIGWGVENGTKXWIVAN 194

Query: 61  SW 62
           SW
Sbjct: 195 SW 196


>gi|296198446|ref|XP_002746707.1| PREDICTED: tubulointerstitial nephritis antigen [Callithrix
           jacchus]
          Length = 476

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 53/100 (53%), Gaps = 13/100 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG--------EMSGGHAVKIIGWGVEDGV-- 53
           EI   G + A ++ H+D   YK G+Y+H           +    HAVK+ GWG   G   
Sbjct: 367 EIMQNGPVQAIMKVHEDFFHYKTGIYRHVTSTNKESEKFQKLQTHAVKLTGWGTLRGAQG 426

Query: 54  ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
              K+W+  NSWG+ WG+ G F+I RG +ES IE   ++A
Sbjct: 427 RKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|78042562|ref|NP_001030279.1| tubulointerstitial nephritis antigen [Bos taurus]
 gi|108861910|sp|Q3SZI1.1|TINAG_BOVIN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
 gi|74354008|gb|AAI02844.1| Tubulointerstitial nephritis antigen [Bos taurus]
 gi|296474572|tpg|DAA16687.1| TPA: tubulointerstitial nephritis antigen [Bos taurus]
          Length = 476

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 52/100 (52%), Gaps = 13/100 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGV-- 53
           EI   G + A ++ H+D   YK G+Y+H                HAVK+ GWG   G   
Sbjct: 367 EIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQG 426

Query: 54  ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
              K+W+  NSWG+ WG+ G F+I RG +ES IE   ++A
Sbjct: 427 QKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


>gi|343472937|emb|CCD15042.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 35/81 (43%), Positives = 45/81 (55%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E++  G  VA    + DL  YK GVY++  G+  GG AV+I+GWG  +G  YW   NSW 
Sbjct: 242 ELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGGQAVRIVGWGKLNGTPYWKVANSWD 301

Query: 64  ELWGDGGLFKIRRGTDESRIE 84
             WG  G   I RG +E  IE
Sbjct: 302 TDWGMNGYMLILRGNNECNIE 322


>gi|1763659|gb|AAB58258.1| cysteine protease [Giardia intestinalis]
          Length = 269

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 51/78 (65%), Gaps = 2/78 (2%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGV-EDGVKYWLCVNSWGELW 66
           G +   I  + DL  Y+ GVY+HT G ++ G HA++I+G+G  +DG  YW+  NSWG  W
Sbjct: 183 GPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIKNSWGPDW 242

Query: 67  GDGGLFKIRRGTDESRIE 84
           G+ G F+I RG +E RIE
Sbjct: 243 GENGYFRIVRGVNECRIE 260


>gi|395734831|ref|XP_003776483.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin B-like [Pongo abelii]
          Length = 350

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 36/81 (44%), Positives = 44/81 (54%)

Query: 5   IFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGE 64
           I+    +  A   + D ++YK   YQ   GEM GGHA+ I+G  VE+   YWL  N W  
Sbjct: 254 IYKNDXVEEAFSVYLDFLMYKFKEYQGVTGEMXGGHAICILGCKVENSTSYWLVANXWNR 313

Query: 65  LWGDGGLFKIRRGTDESRIES 85
            WGD G FKI RG D   IES
Sbjct: 314 DWGDNGFFKILRGQDHYGIES 334


>gi|159112288|ref|XP_001706373.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157434469|gb|EDO78699.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 303

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 51/78 (65%), Gaps = 2/78 (2%)

Query: 9   GSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGV-EDGVKYWLCVNSWGELW 66
           G +   I  + DL  Y+ GVY+HT G ++ G HA++I+G+G  +DG  YW+  NSWG  W
Sbjct: 217 GPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIKNSWGPDW 276

Query: 67  GDGGLFKIRRGTDESRIE 84
           G+ G F+I RG +E RIE
Sbjct: 277 GENGYFRIVRGVNECRIE 294


>gi|166030328|gb|ABY78831.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 35/81 (43%), Positives = 45/81 (55%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           E++  G  VA    + DL  YK GVY++  G+  GG AV+I+GWG  +G  YW   NSW 
Sbjct: 242 ELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGGQAVRIVGWGKLNGTPYWKVANSWD 301

Query: 64  ELWGDGGLFKIRRGTDESRIE 84
             WG  G   I RG +E  IE
Sbjct: 302 TDWGMNGYMLILRGNNECNIE 322


>gi|126310154|ref|XP_001364630.1| PREDICTED: tubulointerstitial nephritis antigen [Monodelphis
           domestica]
          Length = 468

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 38/100 (38%), Positives = 53/100 (53%), Gaps = 13/100 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGV-- 53
           EI   G + A ++ H+D   YK G+Y+H                HAVK+ GWGV  G   
Sbjct: 359 EIMQNGPVQAIMQVHEDFFHYKSGIYRHINNLKDESEKYRNLRTHAVKLTGWGVLRGAQG 418

Query: 54  ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
              K+W+  NSWG+ WG+ G F+I RG +ES IE   ++A
Sbjct: 419 KKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 458


>gi|118429531|gb|ABK91813.1| cathepsin B-like cysteine proteinase precursor [Clonorchis
           sinensis]
 gi|358331549|dbj|GAA37857.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 343

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 50/88 (56%), Gaps = 1/88 (1%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
           EI   G + A    ++D + YK GVY H+ G     HA++I+GWG E  V YWL  NSW 
Sbjct: 247 EIMLRGPVEAVFTVYEDFLQYKFGVYFHSWGAPLSEHAIRILGWGEEGDVPYWLIANSWN 306

Query: 64  ELWGDGGLFKIRRGTDESRIESFQVSAG 91
           E WG+ G  K  RG +E  IE   V+AG
Sbjct: 307 EDWGEKGYMKFLRGLNECGIED-DVTAG 333


>gi|145525479|ref|XP_001448556.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124416111|emb|CAK81159.1| unnamed protein product [Paramecium tetraurelia]
          Length = 490

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 39/98 (39%), Positives = 56/98 (57%), Gaps = 11/98 (11%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGE--MSG---------GHAVKIIGWGVEDG 52
           EI++ G +V   E   D + Y  G++  T  +  ++G          H+V   GWG E+G
Sbjct: 385 EIYNNGPVVLNFEPSFDFMFYVGGIFHSTTPDWIINGLAKPEWEKVDHSVLCYGWGEENG 444

Query: 53  VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
           VKYWL  NSWG+ WG+ G F+++RG DES IES   +A
Sbjct: 445 VKYWLLQNSWGKQWGENGRFRMKRGQDESSIESMAEAA 482


>gi|324518532|gb|ADY47133.1| Cysteine proteinase [Ascaris suum]
          Length = 334

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 52/86 (60%), Gaps = 2/86 (2%)

Query: 7   HFGSIVAAIEAHQDLIIYKKGVYQ--HTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGE 64
           ++G +   +    +   YK G+ +  +   +M   HA +++G+GVEDG++YW+  NSWG 
Sbjct: 247 NYGPVALNVAIPPNYKFYKSGIMRDSYECWQMQPNHAAEVVGFGVEDGIEYWIMKNSWGS 306

Query: 65  LWGDGGLFKIRRGTDESRIESFQVSA 90
            WG+ G F+I RG +  ++E+F  SA
Sbjct: 307 WWGENGFFRIERGKNACQVETFATSA 332


>gi|145517168|ref|XP_001444467.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124411889|emb|CAK77070.1| unnamed protein product [Paramecium tetraurelia]
          Length = 339

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 51/76 (67%), Gaps = 2/76 (2%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG--EMSGGHAVKIIGWGVEDGVKYWLC 58
           ++ +I + G +VA ++ ++D ++Y+ GVYQ   G     GGHA+KIIGWG ++G +YW+ 
Sbjct: 247 IKRDIINRGPVVAIMQVYKDFLVYRDGVYQVLEGTPRFHGGHAIKIIGWGEQNGYQYWII 306

Query: 59  VNSWGELWGDGGLFKI 74
            N+WG  WG  GL K+
Sbjct: 307 ENTWGTSWGTEGLAKL 322


>gi|321476446|gb|EFX87407.1| hypothetical protein DAPPUDRAFT_312322 [Daphnia pulex]
          Length = 334

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 35/88 (39%), Positives = 55/88 (62%), Gaps = 3/88 (3%)

Query: 1   MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--GEMSGGHAVKIIGWGVEDGVKYWLC 58
           MQ  + +FG +VAA+   Q  + Y  GVY   +  G++   HAV ++GWG ++G+ YW+ 
Sbjct: 242 MQYALTNFGPLVAAMTVVQSFMDYASGVYDDKICDGKLVN-HAVVLVGWGNQNGIDYWIG 300

Query: 59  VNSWGELWGDGGLFKIRRGTDESRIESF 86
            NSWG  WG  G F I+RG ++ +IE++
Sbjct: 301 RNSWGPGWGKEGYFLIQRGVNKCQIETY 328


>gi|403268748|ref|XP_003926429.1| PREDICTED: tubulointerstitial nephritis antigen [Saimiri
           boliviensis boliviensis]
          Length = 476

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 52/100 (52%), Gaps = 13/100 (13%)

Query: 4   EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGV-- 53
           EI   G + A ++ H+D   YK G+Y+H                HAVK+ GWG   G   
Sbjct: 367 EIMQNGPVQAIMKVHEDFFHYKTGIYRHVTSTNKESEKFLKLQTHAVKLTGWGTLRGAQG 426

Query: 54  ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
              K+W+  NSWG+ WG+ G F+I RG +ES IE   ++A
Sbjct: 427 RKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.138    0.435 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,878,089,180
Number of Sequences: 23463169
Number of extensions: 119982342
Number of successful extensions: 285999
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 5033
Number of HSP's successfully gapped in prelim test: 1051
Number of HSP's that attempted gapping in prelim test: 279065
Number of HSP's gapped (non-prelim): 6236
length of query: 175
length of database: 8,064,228,071
effective HSP length: 132
effective length of query: 43
effective length of database: 9,262,057,059
effective search space: 398268453537
effective search space used: 398268453537
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 71 (32.0 bits)