BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy10826
(175 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|7507648|pir||T24819 hypothetical protein T10H4.12 - Caenorhabditis elegans
Length = 324
Score = 110 bits (275), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 48/84 (57%), Positives = 64/84 (76%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI+H+G + A+ + ++D YK GVY +T G++ GGHAVKIIGWGVE+GV YWL N
Sbjct: 199 IQTEIYHYGPVEASYKVYEDFYHYKSGVYHYTSGKLVGGHAVKIIGWGVENGVDYWLIAN 258
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SWG +G+ G FKIRRGT+E +IE
Sbjct: 259 SWGTSFGEKGFFKIRRGTNECQIE 282
>gi|17559066|ref|NP_506790.1| Protein CPR-3 [Caenorhabditis elegans]
gi|1169083|sp|P43507.1|CPR3_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 3; AltName:
Full=Cysteine protease-related 3; Flags: Precursor
gi|675494|gb|AAA98788.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|675496|gb|AAA98782.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|14530554|emb|CAB61032.2| Protein CPR-3 [Caenorhabditis elegans]
Length = 370
Score = 110 bits (275), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 48/84 (57%), Positives = 64/84 (76%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI+H+G + A+ + ++D YK GVY +T G++ GGHAVKIIGWGVE+GV YWL N
Sbjct: 245 IQTEIYHYGPVEASYKVYEDFYHYKSGVYHYTSGKLVGGHAVKIIGWGVENGVDYWLIAN 304
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SWG +G+ G FKIRRGT+E +IE
Sbjct: 305 SWGTSFGEKGFFKIRRGTNECQIE 328
>gi|341888694|gb|EGT44629.1| hypothetical protein CAEBREN_31940 [Caenorhabditis brenneri]
Length = 374
Score = 110 bits (274), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 53/104 (50%), Positives = 68/104 (65%), Gaps = 4/104 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI+H G + A+ + ++D YK GVYQ+T G++ GGHAVKIIGWG E+GV YWL N
Sbjct: 249 IQTEIYHNGPVEASFKVYEDFYKYKSGVYQYTSGKLVGGHAVKIIGWGTENGVDYWLIAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA----GRVDRDRSSD 100
SWG +GD G FK+RRGT+E IE V+ G D R D
Sbjct: 309 SWGTTFGDSGFFKMRRGTNEVGIEGNVVAGTAKLGTHDEKREDD 352
>gi|432852559|ref|XP_004067308.1| PREDICTED: cathepsin B-like [Oryzias latipes]
Length = 330
Score = 108 bits (271), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 51/91 (56%), Positives = 62/91 (68%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI+ G + A ++D +YK GVYQH G + GGHA+KI+GWGVEDGV YWLC N
Sbjct: 238 IQTEIYKNGPVEGAFMVYEDFPMYKSGVYQHVSGSLIGGHAIKILGWGVEDGVPYWLCAN 297
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG+D IES +V AG
Sbjct: 298 SWNTDWGDNGYFKILRGSDHCGIES-EVVAG 327
>gi|170060938|ref|XP_001866023.1| cathepsin B [Culex quinquefasciatus]
gi|167879260|gb|EDS42643.1| cathepsin B [Culex quinquefasciatus]
Length = 353
Score = 108 bits (271), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 52/103 (50%), Positives = 67/103 (65%), Gaps = 1/103 (0%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EIF G + AA + + D YK GVY+H G + GGHA+KI+GWGVE+G KYWLC NSWG
Sbjct: 252 EIFVNGPVQAAFQVYLDFKTYKSGVYRHVTGPLEGGHAIKILGWGVENGTKYWLCSNSWG 311
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFEY 106
E WGD G FKI RG + IE+ V AG + ++ E++Y
Sbjct: 312 EDWGDHGFFKIVRGENHLGIET-DVHAGLPHYRKHKEMFEYDY 353
>gi|327322926|gb|AEA48884.1| cathepsin B [Oplegnathus fasciatus]
Length = 330
Score = 108 bits (270), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 51/91 (56%), Positives = 61/91 (67%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EIF G + A ++D ++YK GVYQH G GGHA+KI+GWGVEDGV YWLC N
Sbjct: 238 IQSEIFKNGPVEGAFIVYEDFVLYKSGVYQHVSGSAVGGHAIKILGWGVEDGVPYWLCAN 297
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FK RG+D IES +V AG
Sbjct: 298 SWNTDWGDNGFFKFLRGSDHCGIES-EVVAG 327
>gi|45822211|emb|CAE47502.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
Length = 331
Score = 108 bits (270), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 51/90 (56%), Positives = 60/90 (66%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + AA + + D + YK GVYQH GE GGHAV+I+GWG E GV YWL N
Sbjct: 238 IQKEILTNGPVEAAFDVYSDFVNYKSGVYQHVAGEYLGGHAVRILGWGEESGVPYWLVAN 297
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SW E WGD GLFKIRRG +ES E V+A
Sbjct: 298 SWNEDWGDKGLFKIRRGNNESGFEDSIVAA 327
>gi|193603738|ref|XP_001943652.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 337
Score = 108 bits (270), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 45/82 (54%), Positives = 61/82 (74%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E+F G +VAA++ + D + YK G+YQ+T G + G HAVKI+GWG +DG+ YWLC N+WG
Sbjct: 247 EVFKNGPVVAAMKVYDDFLCYKGGIYQYTTGGLKGDHAVKIMGWGEDDGIDYWLCANTWG 306
Query: 64 ELWGDGGLFKIRRGTDESRIES 85
WG GG+FKIRRG +E IE+
Sbjct: 307 NSWGMGGMFKIRRGRNECGIEN 328
>gi|268572255|ref|XP_002648916.1| Hypothetical protein CBG17829 [Caenorhabditis briggsae]
Length = 220
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 49/96 (51%), Positives = 64/96 (66%), Gaps = 1/96 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G +V ++D+ YK GVY+HT G + GGHA+KIIGWG ++G+ YWL N
Sbjct: 124 IQTEIMTNGPVVGVFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGTQNGIPYWLIAN 183
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRD 96
SWG WG+ G FKIRRG +E IE+ V AG+ D D
Sbjct: 184 SWGTKWGENGFFKIRRGVNECGIEN-NVVAGKADVD 218
>gi|268561802|ref|XP_002638421.1| C. briggsae CBR-CPR-3 protein [Caenorhabditis briggsae]
Length = 375
Score = 107 bits (268), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 48/85 (56%), Positives = 59/85 (69%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI+H G + A+ +D YK GVY H G + GGHAVKIIGWG E+GV YWL N
Sbjct: 250 IQYEIYHNGPVEASYRVFEDFYQYKSGVYHHVSGNLVGGHAVKIIGWGTENGVDYWLVAN 309
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SWG +G+ G FKIRRGT+E +IES
Sbjct: 310 SWGTSFGEKGFFKIRRGTNECQIES 334
>gi|432946172|ref|XP_004083803.1| PREDICTED: cathepsin B-like [Oryzias latipes]
Length = 330
Score = 107 bits (266), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 47/90 (52%), Positives = 59/90 (65%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E++ G + AA + D ++YK GVYQH G+M GGHAVKI+GWG E+G YWL NSW
Sbjct: 240 ELYKNGPVEAAFSVYADFLLYKNGVYQHVTGDMLGGHAVKILGWGEENGTPYWLVANSWN 299
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
WGD G FKI+RG DE IES V+ +
Sbjct: 300 SDWGDKGFFKIKRGNDECGIESEMVAGAPL 329
>gi|256090364|ref|XP_002581165.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228444|emb|CCD74615.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 303
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 65/94 (69%), Gaps = 1/94 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI +G + A+ ++D + YK G+Y+H GE GGHA++IIGWGVE+ YWL N
Sbjct: 211 IQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGVENKTPYWLIAN 270
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
SW E WG+ G F+I RG DE IES +V+AGR++
Sbjct: 271 SWNEDWGENGYFRIVRGRDECSIES-EVTAGRIN 303
>gi|38373697|gb|AAR19103.1| cathepsin B [Uronema marinum]
Length = 350
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 44/89 (49%), Positives = 60/89 (67%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI+ +GS A+ + D + Y GVYQ+T G GGHA+K++GWGVE+G YWLC N
Sbjct: 255 IKAEIYQYGSTTASFNVYSDFLTYSSGVYQNTSGSYMGGHAIKMLGWGVENGTPYWLCAN 314
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW WG+ G FKI RG++E IES V+
Sbjct: 315 SWNSSWGENGFFKILRGSNECGIESGMVA 343
>gi|308466896|ref|XP_003095699.1| CRE-CPR-3 protein [Caenorhabditis remanei]
gi|308244581|gb|EFO88533.1| CRE-CPR-3 protein [Caenorhabditis remanei]
Length = 373
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 48/89 (53%), Positives = 64/89 (71%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI+H G + A+ + ++D YK GVY + G++ GGHAVKIIGWG E+ V YWL N
Sbjct: 248 IQYEIYHNGPVEASYKVYEDFYQYKSGVYHYVSGKLVGGHAVKIIGWGTENDVDYWLVAN 307
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SWG +G+GG FKIRRGT+E +IES V+
Sbjct: 308 SWGIKFGEGGFFKIRRGTNECQIESNVVA 336
>gi|344195776|gb|AEM98130.1| cathepsin B [Cynoglossus semilaevis]
Length = 332
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 45/85 (52%), Positives = 57/85 (67%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+QLEI+ G + A ++D ++YK GVYQH G GGHA+K++GWG E+G YWLC N
Sbjct: 238 IQLEIYKNGPVEGAFTVYEDFLLYKTGVYQHVTGSAVGGHAIKVLGWGEENGTPYWLCAN 297
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FKI RG+D IES
Sbjct: 298 SWNTDWGDNGFFKILRGSDHCGIES 322
>gi|431918315|gb|ELK17542.1| Cathepsin B [Pteropus alecto]
Length = 359
Score = 106 bits (264), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 50/91 (54%), Positives = 60/91 (65%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + AA + D ++YK GVYQH GEM GGHAV+I+GWGVEDG YWL N
Sbjct: 263 IMAEIYKNGPVEAAFSVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGVEDGTPYWLVGN 322
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG D IES ++ AG
Sbjct: 323 SWNTDWGDSGFFKILRGQDHCGIES-EIVAG 352
>gi|350535627|ref|NP_001233013.1| uncharacterized protein LOC100164982 precursor [Acyrthosiphon
pisum]
gi|239789514|dbj|BAH71377.1| ACYPI005957 [Acyrthosiphon pisum]
Length = 339
Score = 105 bits (263), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 49/86 (56%), Positives = 60/86 (69%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ ++G I A+ + + D YK GVYQ T GGHAVK+IGWGVE+G YWL V
Sbjct: 244 IQKDVLNYGPIEASFDVYDDFPSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGTPYWLMV 303
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW WGD GLFKIRRGTDE RI+S
Sbjct: 304 NSWNAQWGDNGLFKIRRGTDECRIDS 329
>gi|328726763|ref|XP_003249034.1| PREDICTED: cathepsin B-like cysteine proteinase-like, partial
[Acyrthosiphon pisum]
Length = 129
Score = 105 bits (263), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 49/86 (56%), Positives = 60/86 (69%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ ++G I A+ + + D YK GVYQ T GGHAVK+IGWGVE+G YWL V
Sbjct: 34 IQKDVLNYGPIEASFDVYDDFPSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGTPYWLMV 93
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW WGD GLFKIRRGTDE RI+S
Sbjct: 94 NSWNAQWGDNGLFKIRRGTDECRIDS 119
>gi|170060936|ref|XP_001866022.1| cathepsin B [Culex quinquefasciatus]
gi|167879259|gb|EDS42642.1| cathepsin B [Culex quinquefasciatus]
Length = 341
Score = 105 bits (263), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 51/91 (56%), Positives = 62/91 (68%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EIF G + A+ + + D YK GVY+H G M GGHAVK+IGWGVE+G KYWLC N
Sbjct: 240 IKEEIFRNGPVQASFDVYLDFKAYKTGVYRHVFGPMEGGHAVKMIGWGVENGTKYWLCSN 299
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SWGE WG+ G FKI RG + IES V AG
Sbjct: 300 SWGEDWGERGFFKIVRGENHCGIES-DVHAG 329
>gi|345790427|ref|XP_543203.3| PREDICTED: cathepsin B [Canis lupus familiaris]
Length = 339
Score = 105 bits (262), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 50/91 (54%), Positives = 60/91 (65%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + AA + D ++YK GVYQH GEM GGHAV+I+GWGVEDG YWL N
Sbjct: 239 IMAEIYKNGPVEAAFTVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGVEDGTPYWLVGN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG D IES ++ AG
Sbjct: 299 SWNTDWGDNGFFKILRGRDHCGIES-EIVAG 328
>gi|167538317|ref|XP_001750823.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163770644|gb|EDQ84327.1| predicted protein [Monosiga brevicollis MX1]
Length = 341
Score = 105 bits (262), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 50/91 (54%), Positives = 61/91 (67%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI G + A ++DL+ YK GVYQHT G++ GGHA+KIIGWGVE GV YW N
Sbjct: 248 IATEIMTNGPVEGAFTVYEDLLTYKSGVYQHTTGQVLGGHAIKIIGWGVESGVDYWWVAN 307
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI++G DE IES Q+ AG
Sbjct: 308 SWNNDWGDNGFFKIKKGVDECGIES-QIVAG 337
>gi|256090368|ref|XP_002581167.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|22531387|emb|CAD44624.1| cathepsin B1 isotype 1 [Schistosoma mansoni]
gi|353228442|emb|CCD74613.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 340
Score = 105 bits (262), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 64/94 (68%), Gaps = 1/94 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI +G + A ++D + YK G+Y+H GE GGHA++IIGWGVE+ YWL N
Sbjct: 248 IQKEIMKYGPVEAGFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGVENKTPYWLIAN 307
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
SW E WG+ G F+I RG DE IES +V+AGR++
Sbjct: 308 SWNEDWGENGYFRIVRGRDECSIES-EVTAGRIN 340
>gi|428174191|gb|EKX43088.1| hypothetical protein GUITHDRAFT_73372 [Guillardia theta CCMP2712]
Length = 255
Score = 105 bits (262), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 53/94 (56%), Positives = 61/94 (64%), Gaps = 1/94 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + AA +QD + Y+ GVY+H G GGHA+KI+GWGVE G KYWL N
Sbjct: 159 IQAEIMTNGPVEAAFTVYQDFLAYQSGVYRHVSGPELGGHAIKIMGWGVEAGNKYWLVAN 218
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
SW E WGD G FKI RG DE IES V AG VD
Sbjct: 219 SWNEDWGDKGTFKIARGDDECGIES-SVVAGMVD 251
>gi|268566077|ref|XP_002647467.1| Hypothetical protein CBG06539 [Caenorhabditis briggsae]
Length = 332
Score = 105 bits (261), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 64/96 (66%), Gaps = 1/96 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A+ + ++D YK GVY++ G+M GGHA+KIIGWG E+G YWL N
Sbjct: 236 IQTDIMTNGPVEASFKVYEDFYKYKSGVYKYIAGKMLGGHAIKIIGWGTENGTAYWLIAN 295
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRD 96
SWG WG+ G FKIRRG +E IE+ V AG+ D D
Sbjct: 296 SWGTKWGENGFFKIRRGVNECGIEN-NVVAGKADVD 330
>gi|343197337|pdb|3QSD|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With Ca074 Inhibitor
gi|343197588|pdb|3S3Q|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11017 Inhibitor
gi|343197589|pdb|3S3R|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
gi|343197590|pdb|3S3R|B Chain B, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
gi|343197591|pdb|3S3R|C Chain C, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
Length = 254
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 64/94 (68%), Gaps = 1/94 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI +G + A ++D + YK G+Y+H GE GGHA++IIGWGVE+ YWL N
Sbjct: 162 IQKEIMKYGPVEAGFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGVENKAPYWLIAN 221
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
SW E WG+ G F+I RG DE IES +V+AGR++
Sbjct: 222 SWNEDWGENGYFRIVRGRDECSIES-EVTAGRIN 254
>gi|118153|sp|P25792.1|CYSP_SCHMA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
Full=Antigen Sm31; Flags: Precursor
gi|160950|gb|AAA29865.1| cathepsin B [Schistosoma mansoni]
Length = 340
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 64/94 (68%), Gaps = 1/94 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI +G + A+ ++D + YK G+Y+H GE GGHA++IIGWGVE+ YWL N
Sbjct: 248 IQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVENKTPYWLIAN 307
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
SW E WG+ G F+I RG DE IES +V AGR++
Sbjct: 308 SWNEDWGENGYFRIVRGRDECSIES-EVIAGRIN 340
>gi|4503139|ref|NP_001899.1| cathepsin B preproprotein [Homo sapiens]
gi|22538431|ref|NP_680090.1| cathepsin B preproprotein [Homo sapiens]
gi|22538433|ref|NP_680091.1| cathepsin B preproprotein [Homo sapiens]
gi|22538435|ref|NP_680092.1| cathepsin B preproprotein [Homo sapiens]
gi|22538437|ref|NP_680093.1| cathepsin B preproprotein [Homo sapiens]
gi|68067549|sp|P07858.3|CATB_HUMAN RecName: Full=Cathepsin B; AltName: Full=APP secretase; Short=APPS;
AltName: Full=Cathepsin B1; Contains: RecName:
Full=Cathepsin B light chain; Contains: RecName:
Full=Cathepsin B heavy chain; Flags: Precursor
gi|291888|gb|AAC37547.1| cathepsin B [Homo sapiens]
gi|63102437|gb|AAH95408.1| Cathepsin B [Homo sapiens]
gi|119586034|gb|EAW65630.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586036|gb|EAW65632.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586037|gb|EAW65633.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586038|gb|EAW65634.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586039|gb|EAW65635.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586040|gb|EAW65636.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|168277954|dbj|BAG10955.1| cathepsin B precursor [synthetic construct]
gi|193786804|dbj|BAG52127.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334
>gi|426358853|ref|XP_004046705.1| PREDICTED: cathepsin B isoform 1 [Gorilla gorilla gorilla]
gi|426358855|ref|XP_004046706.1| PREDICTED: cathepsin B isoform 2 [Gorilla gorilla gorilla]
gi|426358857|ref|XP_004046707.1| PREDICTED: cathepsin B isoform 3 [Gorilla gorilla gorilla]
Length = 339
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334
>gi|181192|gb|AAA52129.1| preprocathepsin B [Homo sapiens]
gi|193787271|dbj|BAG52477.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334
>gi|417399216|gb|JAA46636.1| Putative cathepsin b [Desmodus rotundus]
Length = 340
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 47/85 (55%), Positives = 57/85 (67%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ +EI+ G + AA + D ++YK GVYQH GEM GGHAV+I+GWGVE+G YWL N
Sbjct: 240 IMVEIYKNGPVEAAFSVYSDFLLYKSGVYQHVTGEMVGGHAVRILGWGVENGTPYWLVGN 299
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FKI RG D IES
Sbjct: 300 SWNTDWGDNGFFKILRGRDHCGIES 324
>gi|397467300|ref|XP_003805362.1| PREDICTED: cathepsin B [Pan paniscus]
Length = 339
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334
>gi|194384502|dbj|BAG59411.1| unnamed protein product [Homo sapiens]
Length = 273
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 173 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 232
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 233 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 268
>gi|332862712|ref|XP_003317964.1| PREDICTED: cathepsin B isoform 1 [Pan troglodytes]
gi|332862714|ref|XP_003317965.1| PREDICTED: cathepsin B isoform 2 [Pan troglodytes]
gi|332862716|ref|XP_003317966.1| PREDICTED: cathepsin B isoform 3 [Pan troglodytes]
gi|332862718|ref|XP_519607.3| PREDICTED: cathepsin B isoform 5 [Pan troglodytes]
gi|410057614|ref|XP_003954244.1| PREDICTED: cathepsin B [Pan troglodytes]
gi|410262606|gb|JAA19269.1| cathepsin B [Pan troglodytes]
gi|410262608|gb|JAA19270.1| cathepsin B [Pan troglodytes]
gi|410359820|gb|JAA44654.1| cathepsin B [Pan troglodytes]
gi|410359822|gb|JAA44655.1| cathepsin B [Pan troglodytes]
gi|410359824|gb|JAA44656.1| cathepsin B [Pan troglodytes]
gi|410359826|gb|JAA44657.1| cathepsin B [Pan troglodytes]
gi|410359828|gb|JAA44658.1| cathepsin B [Pan troglodytes]
Length = 339
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334
>gi|340501578|gb|EGR28345.1| hypothetical protein IMG5_177790 [Ichthyophthirius multifiliis]
Length = 356
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 48/87 (55%), Positives = 58/87 (66%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI + G + A ++D + YK GVYQH GE GGHAVK+IGWGVE+ YWL VNSW
Sbjct: 260 EIMNNGPVEVAFTVYEDFVTYKSGVYQHVTGEQLGGHAVKMIGWGVENDTPYWLIVNSWN 319
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSA 90
E WGD G FKI RG++E IE V+A
Sbjct: 320 ETWGDQGTFKILRGSNECGIEDEVVTA 346
>gi|355697726|gb|EHH28274.1| Cathepsin B [Macaca mulatta]
Length = 339
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334
>gi|16307393|gb|AAH10240.1| Cathepsin B [Homo sapiens]
Length = 339
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334
>gi|30583753|gb|AAP36125.1| Homo sapiens cathepsin B [synthetic construct]
gi|61370555|gb|AAX43516.1| cathepsin B [synthetic construct]
Length = 340
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334
>gi|332244666|ref|XP_003271495.1| PREDICTED: cathepsin B [Nomascus leucogenys]
Length = 351
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 251 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHITGEMMGGHAIRILGWGVENGTPYWLVAN 310
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 311 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 346
>gi|116177489|gb|ABJ80691.1| cathepsin B [Hippoglossus hippoglossus]
Length = 330
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 46/85 (54%), Positives = 57/85 (67%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A ++D ++YK GVYQHT G GGHA+K++GWG EDGV YWLC N
Sbjct: 238 IQAEISQNGPVEGAFIVYEDFVMYKSGVYQHTTGSALGGHAIKVLGWGEEDGVPYWLCAN 297
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WG+ G FKI RG+D IES
Sbjct: 298 SWNTDWGENGFFKILRGSDHCGIES 322
>gi|197098184|ref|NP_001126573.1| cathepsin B precursor [Pongo abelii]
gi|75061687|sp|Q5R6D1.1|CATB_PONAB RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|55731764|emb|CAH92586.1| hypothetical protein [Pongo abelii]
gi|55731953|emb|CAH92685.1| hypothetical protein [Pongo abelii]
Length = 339
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334
>gi|402877481|ref|XP_003902454.1| PREDICTED: cathepsin B [Papio anubis]
Length = 339
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334
>gi|226821413|gb|ACO82382.1| cathepsin B [Lutjanus argentimaculatus]
Length = 330
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 56/85 (65%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI+ G + A ++D ++YK GVYQH G GGHA+K++GWG E+GV YWLC N
Sbjct: 238 IQYEIYKNGPVEGAFTVYEDFVLYKSGVYQHVSGSAVGGHAIKVLGWGEENGVPYWLCAN 297
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FK RG+D IES
Sbjct: 298 SWNTDWGDNGFFKFLRGSDHCGIES 322
>gi|60816353|gb|AAX36379.1| cathepsin B [synthetic construct]
gi|61358313|gb|AAX41546.1| cathepsin B [synthetic construct]
Length = 339
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334
>gi|87246247|gb|ABD35300.1| cathepsin B-like cysteine protease [Triatoma infestans]
Length = 333
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/91 (54%), Positives = 59/91 (64%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G IVA+ ++DL YK+GVYQH GE GGH +KI GWG+E+G YWL N
Sbjct: 240 IQAEILKNGPIVASFLVYEDLFSYKEGVYQHVAGEFLGGHVIKIFGWGIENGTPYWLVAN 299
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WG+ G FKI RG DE IE VSAG
Sbjct: 300 SWNTDWGNNGFFKIPRGKDECGIE-IDVSAG 329
>gi|157833437|pdb|1PBH|A Chain A, Crystal Structure Of Human Recombinant Procathepsin B At
3.2 Angstrom Resolution
gi|157835646|pdb|2PBH|A Chain A, Crystal Structure Of Human Procathepsin B At 3.3 Angstrom
Resolution
gi|157836863|pdb|3PBH|A Chain A, Refined Crystal Structure Of Human Procathepsin B At 2.5
Angstrom Resolution
Length = 317
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 48/91 (52%), Positives = 59/91 (64%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 223 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 282
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG D IES +V AG
Sbjct: 283 SWNTDWGDNGFFKILRGQDHCGIES-EVVAG 312
>gi|302564570|ref|NP_001181828.1| cathepsin B precursor [Macaca mulatta]
Length = 339
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334
>gi|339241013|ref|XP_003376432.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316974853|gb|EFV58323.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 551
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 52/108 (48%), Positives = 70/108 (64%), Gaps = 3/108 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
M +I +G IVA + ++D + YK+GVY G GGHAV+IIGWG +D + YWL N
Sbjct: 439 MMKDISLYGPIVAGMSVYEDFLHYKEGVYTQESGIFLGGHAVRIIGWGEQDNIPYWLVAN 498
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV--DRDRSSDLEEFEY 106
SW +G+ GLFKIRRG DE IES+ VSAGR ++ S++ F+Y
Sbjct: 499 SWNTTFGEDGLFKIRRGFDECGIESY-VSAGRAKCKQNISNNFNSFKY 545
>gi|347972086|ref|XP_313835.5| AGAP004533-PA [Anopheles gambiae str. PEST]
gi|333469165|gb|EAA09183.5| AGAP004533-PA [Anopheles gambiae str. PEST]
Length = 337
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/91 (54%), Positives = 62/91 (68%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A ++DL+ YK+GVYQH G+M GGHA++I+GWGVE+G KYWL N
Sbjct: 244 IQKEIMTNGPVEGAFTVYEDLLHYKEGVYQHVTGKMLGGHAIRILGWGVENGTKYWLIAN 303
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG D IES +SAG
Sbjct: 304 SWNSDWGDNGFFKILRGEDHLGIES-SISAG 333
>gi|75076082|sp|Q4R5M2.1|CATB_MACFA RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|67970521|dbj|BAE01603.1| unnamed protein product [Macaca fascicularis]
gi|355779504|gb|EHH63980.1| Cathepsin B [Macaca fascicularis]
gi|383411999|gb|AFH29213.1| cathepsin B preproprotein [Macaca mulatta]
gi|384942194|gb|AFI34702.1| cathepsin B preproprotein [Macaca mulatta]
Length = 339
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334
>gi|347972080|ref|XP_313831.5| AGAP004531-PA [Anopheles gambiae str. PEST]
gi|333469162|gb|EAA09191.5| AGAP004531-PA [Anopheles gambiae str. PEST]
Length = 375
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 45/85 (52%), Positives = 56/85 (65%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ E+F+FG A + D + YK GVY+HT G G H+VK++GWGVE+ VKYWLC N
Sbjct: 281 IMYEVFNFGPAQATFTMYTDFVQYKSGVYRHTFGVRVGTHSVKVMGWGVENDVKYWLCAN 340
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SWG WGDGG FKI RG D E+
Sbjct: 341 SWGAQWGDGGFFKIVRGEDHLSFET 365
>gi|194387364|dbj|BAG60046.1| unnamed protein product [Homo sapiens]
Length = 245
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 145 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 204
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 205 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 240
>gi|357613937|gb|EHJ68797.1| cathepsin B-like cysteine proteinase [Danaus plexippus]
Length = 334
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 51/96 (53%), Positives = 61/96 (63%), Gaps = 1/96 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EIF G + A + DL+ YK GVY+HT GE GGHA+KI+GWGVE+G KYWL N
Sbjct: 240 IKAEIFKNGPVEGAFTVYADLLTYKSGVYKHTEGEALGGHAIKIMGWGVENGNKYWLIAN 299
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRD 96
SW WGD G FKI RG D IES + AG D
Sbjct: 300 SWNSDWGDNGFFKILRGEDHCGIES-SIVAGEPSYD 334
>gi|355681635|gb|AER96808.1| cathepsin B [Mustela putorius furo]
Length = 338
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/97 (51%), Positives = 62/97 (63%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + AA + D ++YK GVYQH GEM GGHAV+I+GWGVE+G YWL N
Sbjct: 239 IMAEIYKNGPVEAAFSVYSDFLMYKSGVYQHVTGEMMGGHAVRILGWGVENGTPYWLVGN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES ++ AG D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EIVAGIPCTDQ 334
>gi|312382740|gb|EFR28091.1| hypothetical protein AND_04395 [Anopheles darlingi]
Length = 381
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 45/82 (54%), Positives = 57/82 (69%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E+F+FG + A+ + D I YK GVY+HT G G H+VKI+GWGVE+G K+WLC NSWG
Sbjct: 290 ELFYFGPVQASFTVYTDFIQYKSGVYRHTYGVRVGDHSVKIVGWGVENGTKFWLCANSWG 349
Query: 64 ELWGDGGLFKIRRGTDESRIES 85
WG+ G FKI RG D +ES
Sbjct: 350 AEWGENGFFKIIRGEDHLSVES 371
>gi|157167283|ref|XP_001658486.1| cathepsin b [Aedes aegypti]
gi|108876477|gb|EAT40702.1| AAEL007599-PA [Aedes aegypti]
Length = 342
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 51/103 (49%), Positives = 66/103 (64%), Gaps = 1/103 (0%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI+ G + AA +QDL YK GVY+H G M+GGHAVK++GWGVE+G+KYWL NSWG
Sbjct: 241 EIYINGPVQAAFMTYQDLHAYKSGVYRHVWGHMAGGHAVKLMGWGVENGLKYWLVANSWG 300
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFEY 106
+ WGD G FKI RG + IE V AG ++ +L +
Sbjct: 301 DDWGDNGFFKIVRGENHCGIEK-DVHAGLPSFNKHKELAGIHF 342
>gi|1008858|gb|AAA79004.1| cathepsin B-like thiol protease [Aedes aegypti]
Length = 342
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 51/98 (52%), Positives = 65/98 (66%), Gaps = 1/98 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI+ G + AA +QDL YK GVY+H G M+GGHAVK++GWGVE+G+KYWL NSWG
Sbjct: 241 EIYINGPVQAAFMTYQDLHAYKSGVYRHVWGHMAGGHAVKLMGWGVENGLKYWLVANSWG 300
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDL 101
+ WGD G FKI RG + IE V AG ++ +L
Sbjct: 301 DDWGDNGFFKIVRGENHCGIEK-DVHAGLPSFNKHKEL 337
>gi|328726600|ref|XP_003248962.1| PREDICTED: cathepsin B-like cysteine proteinase-like [Acyrthosiphon
pisum]
Length = 169
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 48/86 (55%), Positives = 60/86 (69%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ ++G I A+ + + D YK GVYQ T GGHAVK+IGWGVE+G+ YWL V
Sbjct: 74 IQKDVMNYGPIEASFDVYDDFYSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGIPYWLMV 133
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW WGD GLFKIRRGTDE I+S
Sbjct: 134 NSWSAQWGDNGLFKIRRGTDECGIDS 159
>gi|189096178|pdb|3CBJ|A Chain A, Chagasin-cathepsin B Complex
gi|189096180|pdb|3CBK|A Chain A, Chagasin-Cathepsin B
Length = 266
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 166 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 225
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 226 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 261
>gi|91078964|ref|XP_974298.1| PREDICTED: similar to putative cathepsin B-like like proteinase
[Tribolium castaneum]
gi|270004838|gb|EFA01286.1| cathepsin B precursor [Tribolium castaneum]
Length = 335
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 47/85 (55%), Positives = 57/85 (67%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A + ++D + YK GVYQ T G +GGHA+KI+GWGVEDG YWL N
Sbjct: 242 IQTEIMTNGPVEADFDVYEDFLNYKSGVYQQTTGNYAGGHAIKILGWGVEDGTPYWLAAN 301
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW E WGD G FKI RG +E IES
Sbjct: 302 SWNEDWGDKGYFKILRGQNECGIES 326
>gi|24158605|pdb|1GMY|A Chain A, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
gi|24158606|pdb|1GMY|B Chain B, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
gi|24158607|pdb|1GMY|C Chain C, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
Length = 261
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 161 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 220
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 221 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 256
>gi|49036806|gb|AAT48984.1| cathepsin B-like proteinase [Triatoma sordida]
Length = 331
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/91 (53%), Positives = 61/91 (67%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + AA ++DL+ YK+GVYQH GE GGHA+KI+GWGVE+ YWL N
Sbjct: 238 IQAEILKNGPVEAAFTVYEDLLNYKEGVYQHVAGEALGGHAIKILGWGVENDTPYWLVAN 297
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WG+ G FKI RG+DE IE Q+ AG
Sbjct: 298 SWNTDWGNNGFFKILRGSDECGIED-QIVAG 327
>gi|49036808|gb|AAT48985.1| cathepsin B-like proteinase [Triatoma vitticeps]
Length = 332
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/91 (54%), Positives = 61/91 (67%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G IVA+I ++DL YK GVYQH GE+ GGH +KI+GWGVE+ YWL N
Sbjct: 239 IQAEILKNGPIVASILVYEDLFSYKAGVYQHVAGEVLGGHVIKILGWGVENDTPYWLVAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WG+ G FKI RG+DE IE Q+ AG
Sbjct: 299 SWNTDWGNNGFFKILRGSDECGIED-QIVAG 328
>gi|181178|gb|AAA52125.1| lysosomal proteinase cathepsin B, partial [Homo sapiens]
Length = 209
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 109 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 168
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 169 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 204
>gi|328697984|ref|XP_003240502.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
Length = 339
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 48/86 (55%), Positives = 60/86 (69%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ ++G I A+ + + D YK GVYQ T GGHAVK+IGWGVE+G+ YWL V
Sbjct: 244 IQKDVMNYGPIEASFDVYDDFYSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGIPYWLMV 303
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW WGD GLFKIRRGTDE I+S
Sbjct: 304 NSWSAQWGDNGLFKIRRGTDECGIDS 329
>gi|119887749|gb|ABM05925.1| cathepsin B-like cysteine proteinase [Helicoverpa assulta]
Length = 338
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 48/89 (53%), Positives = 60/89 (67%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ E+F G + A + DL+ YK GVY+HT+G+ GGHAVKI+GWGVE+G KYWL N
Sbjct: 242 IRAELFKNGPVEGAFTVYSDLLNYKTGVYKHTIGDALGGHAVKILGWGVENGNKYWLIAN 301
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW WGD G FKI RG D IES V+
Sbjct: 302 SWNSDWGDNGFFKILRGEDHCGIESSIVA 330
>gi|301776581|ref|XP_002923704.1| PREDICTED: cathepsin B-like [Ailuropoda melanoleuca]
gi|281347694|gb|EFB23278.1| hypothetical protein PANDA_012896 [Ailuropoda melanoleuca]
Length = 339
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/91 (53%), Positives = 60/91 (65%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + AA + D ++YK GVYQH GEM GGHAV+I+GWGVE+G YWL N
Sbjct: 239 IMAEIYKNGPVEAAFTVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGVENGTPYWLVGN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG D IES ++ AG
Sbjct: 299 SWNTDWGDNGFFKILRGRDHCGIES-EIVAG 328
>gi|353228456|emb|CCD74627.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 333
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 48/95 (50%), Positives = 64/95 (67%), Gaps = 1/95 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G + A I H D + YK GVY+H G++ H+V+IIGWG+E+ + YWLC N
Sbjct: 238 IRKEIMMNGPVEAGIFVHSDFLNYKSGVYRHITGQLVTIHSVRIIGWGIENDIPYWLCAN 297
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
SW E WG G FKI RG++E IESF V+AG+VD
Sbjct: 298 SWNEDWGLNGYFKILRGSNECEIESF-VNAGKVDN 331
>gi|7537454|gb|AAF35867.2| cathepsin B-like cysteine proteinase [Helicoverpa armigera]
Length = 338
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 48/90 (53%), Positives = 60/90 (66%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ E+F G + A + DL+ YK GVY+HT+G+ GGHAVKI+GWGVE+G KYWL N
Sbjct: 242 IRAELFKNGPVEGAFTVYSDLLNYKTGVYKHTIGDALGGHAVKILGWGVENGNKYWLIAN 301
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SW WGD G FKI RG D IES V+
Sbjct: 302 SWNSDWGDNGFFKILRGEDHCGIESSIVAG 331
>gi|256090674|ref|XP_002581308.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 250
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 64/94 (68%), Gaps = 1/94 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G + A I H D + YK GVY+H G++ H+V+IIGWG+E+ + YWLC N
Sbjct: 155 IRKEIMMNGPVEAGIFVHSDFLNYKSGVYRHITGQLVTIHSVRIIGWGIENDIPYWLCAN 214
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
SW E WG G FKI RG++E IESF V+AG+VD
Sbjct: 215 SWNEDWGLNGYFKILRGSNECEIESF-VNAGKVD 247
>gi|56462338|gb|AAV91452.1| cysteine peptidase 2 cathepsin-B-like [Lonomia obliqua]
Length = 338
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 49/89 (55%), Positives = 58/89 (65%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ E+F G + A + DL+ YK GVYQHT G GGHAVKI+GWGVE+G KYWL N
Sbjct: 242 IKAELFKNGPVEGAFTVYSDLLSYKSGVYQHTDGSALGGHAVKILGWGVENGSKYWLIAN 301
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW WGD G FKI RG D IES V+
Sbjct: 302 SWNSDWGDNGFFKILRGEDHCGIESSIVT 330
>gi|262368170|pdb|3K9M|A Chain A, Cathepsin B In Complex With Stefin A
gi|262368172|pdb|3K9M|B Chain B, Cathepsin B In Complex With Stefin A
Length = 254
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 48/91 (52%), Positives = 59/91 (64%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 160 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 219
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG D IES +V AG
Sbjct: 220 SWNTDWGDNGFFKILRGQDHCGIES-EVVAG 249
>gi|333361087|pdb|3AI8|B Chain B, Cathepsin B In Complex With The Nitroxoline
gi|333361088|pdb|3AI8|A Chain A, Cathepsin B In Complex With The Nitroxoline
Length = 256
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 48/91 (52%), Positives = 59/91 (64%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 162 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 221
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG D IES +V AG
Sbjct: 222 SWNTDWGDNGFFKILRGQDHCGIES-EVVAG 251
>gi|226471008|emb|CAX70585.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 45/89 (50%), Positives = 61/89 (68%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI +G + AA + ++D + YK G+Y+H G + GGHA++IIGWGVE G YWL N
Sbjct: 249 IQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIGWGVEKGKPYWLIAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW E WG+ GLF++ RG DE IES V+
Sbjct: 309 SWNEDWGENGLFRMVRGRDECSIESHVVA 337
>gi|226471006|emb|CAX70584.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 45/89 (50%), Positives = 61/89 (68%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI +G + AA + ++D + YK G+Y+H G + GGHA++IIGWGVE G YWL N
Sbjct: 249 IQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIGWGVEKGKPYWLIAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW E WG+ GLF++ RG DE IES V+
Sbjct: 309 SWNEDWGENGLFRMVRGRDECSIESHVVA 337
>gi|741376|prf||2007265A cathepsin B
Length = 153
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 53 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 112
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 113 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 148
>gi|193783549|dbj|BAG53460.1| unnamed protein product [Homo sapiens]
Length = 276
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 176 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 235
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 236 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 271
>gi|187103108|ref|NP_001119614.1| cathepsin B-1418 precursor [Acyrthosiphon pisum]
gi|163300438|tpg|DAA06126.1| TPA_inf: cathepsin B transcript 1418 [Acyrthosiphon pisum]
gi|239788654|dbj|BAH70998.1| ACYPI000010 [Acyrthosiphon pisum]
Length = 346
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 42/81 (51%), Positives = 57/81 (70%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
+++ G + AA + D + YK GVY +T G++ GGHA+KI+GWGV+DG KYWLC NSW
Sbjct: 251 DLYKNGPVQAAFYVYTDFMYYKSGVYSYTRGQIEGGHAIKILGWGVDDGTKYWLCANSWS 310
Query: 64 ELWGDGGLFKIRRGTDESRIE 84
WG+ GLF+I RG +E IE
Sbjct: 311 RSWGENGLFRILRGNNECHIE 331
>gi|999909|pdb|1HUC|B Chain B, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
Human Liver Cathepsin B: The Structural Basis For Its
Specificity
gi|999911|pdb|1HUC|D Chain D, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
Human Liver Cathepsin B: The Structural Basis For Its
Specificity
gi|1421164|pdb|1CSB|B Chain B, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
2.1 Angstroms Resolution: A Basis For The Design Of
Specific Epoxysuccinyl Inhibitors
gi|1421167|pdb|1CSB|E Chain E, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
2.1 Angstroms Resolution: A Basis For The Design Of
Specific Epoxysuccinyl Inhibitors
gi|122920711|pdb|2IPP|B Chain B, Crystal Structure Of The Tetragonal Form Of Human Liver
Cathepsin B
Length = 205
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 48/91 (52%), Positives = 59/91 (64%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 111 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 170
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG D IES +V AG
Sbjct: 171 SWNTDWGDNGFFKILRGQDHCGIES-EVVAG 200
>gi|118358706|ref|XP_001012594.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89294361|gb|EAR92349.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 346
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 48/90 (53%), Positives = 60/90 (66%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G I A+ ++D + YK GVYQH G+ GGHAVK++GWGVE+G YW VN
Sbjct: 253 IMAEIYKNGPIEVALTVYEDFLTYKTGVYQHVTGDELGGHAVKMVGWGVENGTPYWTIVN 312
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SW E WGD G FKI RG +E IES V+A
Sbjct: 313 SWNESWGDKGTFKILRGKNECGIESSCVTA 342
>gi|410916585|ref|XP_003971767.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
Length = 328
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 46/91 (50%), Positives = 59/91 (64%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E++ G + AA + D ++YK GVYQH GE+ GGHA+KI+GWG E G YWL NSW
Sbjct: 238 ELYKNGPVEAAFTVYADFLLYKTGVYQHVTGEVLGGHAIKILGWGEESGTPYWLAANSWN 297
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
WGD G FKI+RG DE IES V+ ++
Sbjct: 298 GDWGDKGFFKIKRGNDECGIESEMVAGTPLN 328
>gi|343961899|dbj|BAK62537.1| cathepsin B precursor [Pan troglodytes]
Length = 195
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 95 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 154
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 155 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 190
>gi|56753443|gb|AAW24925.1| unknown [Schistosoma japonicum]
Length = 342
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 45/89 (50%), Positives = 61/89 (68%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI +G + AA + ++D + YK G+Y+H G + GGHA++IIGWGVE G YWL N
Sbjct: 249 IQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIGWGVEKGKPYWLIAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW E WG+ GLF++ RG DE IES V+
Sbjct: 309 SWNEDWGENGLFRMVRGRDECSIESHVVA 337
>gi|161671340|gb|ABX75522.1| cathepsin b [Lycosa singoriensis]
Length = 247
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 48/90 (53%), Positives = 58/90 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EIF G + A + D I YK GVYQH GE GGHA++++GWG E+ V YWLC N
Sbjct: 154 IQTEIFKNGPVEGAFSVYSDFINYKSGVYQHHSGESLGGHAIRVLGWGYENDVPYWLCAN 213
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SW WGD G FKI RG+DE IES V+
Sbjct: 214 SWNTDWGDKGYFKILRGSDECGIESSIVAG 243
>gi|392922404|ref|NP_507186.3| Protein CPR-2 [Caenorhabditis elegans]
gi|206994217|emb|CAB04322.3| Protein CPR-2 [Caenorhabditis elegans]
Length = 326
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 45/85 (52%), Positives = 57/85 (67%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I++ G +VAA ++D YK G+Y+H G GGHAVK+IGWG E G YWL VN
Sbjct: 234 IQADIYYNGPVVAAFIVYEDFEKYKSGIYRHIAGRSKGGHAVKLIGWGTERGTPYWLAVN 293
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SWG WG+ G F+I RG DE IES
Sbjct: 294 SWGSQWGESGTFRILRGVDECGIES 318
>gi|47217183|emb|CAG11019.1| unnamed protein product [Tetraodon nigroviridis]
Length = 351
Score = 103 bits (256), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 46/92 (50%), Positives = 60/92 (65%), Gaps = 1/92 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI+ G + A ++D ++YK GVYQH G GGHA+K++GWG E+GV YWLC N
Sbjct: 259 IKQEIYKNGPVEGAFTVYEDFVLYKSGVYQHVSGSALGGHAIKMLGWGEENGVPYWLCAN 318
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGR 92
SW WGD G FKI RG D IES ++ AG
Sbjct: 319 SWNTDWGDNGFFKILRGADHCGIES-EIVAGN 349
>gi|56752997|gb|AAW24710.1| unknown [Schistosoma japonicum]
Length = 342
Score = 103 bits (256), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 45/89 (50%), Positives = 61/89 (68%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI +G + AA + ++D + YK G+Y+H G + GGHA++IIGWGVE G YWL N
Sbjct: 249 IQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKGKPYWLIAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW E WG+ GLF++ RG DE IES V+
Sbjct: 309 SWNEDWGEKGLFRMVRGRDECSIESHVVA 337
>gi|158261501|dbj|BAF82928.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 103 bits (256), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 60/97 (61%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 239 IMAEIYKNGPAEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 334
>gi|268570495|ref|XP_002648548.1| Hypothetical protein CBG24861 [Caenorhabditis briggsae]
Length = 323
Score = 103 bits (256), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 44/89 (49%), Positives = 59/89 (66%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G +V A ++D+ YK GVY+HT G + GGHA+KIIGWG ++G+ YWL N
Sbjct: 230 IQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGTQNGIPYWLIAN 289
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SWG WG+ G K+RRG +E IE V+
Sbjct: 290 SWGANWGENGFLKMRRGVNECGIERAVVA 318
>gi|46812327|gb|AAT02230.1| cathepsin B-like proteinase [Triatoma dimidiata]
Length = 332
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 49/91 (53%), Positives = 61/91 (67%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + AA ++DL+ YK+GVY+H G GGHA+KI+GWGVE+G YWL N
Sbjct: 239 IQTEILKNGPVEAAFFVYEDLLTYKEGVYKHVAGAPVGGHAIKILGWGVENGTPYWLIAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WG+ G FKI RG+DE IE VSAG
Sbjct: 299 SWNTDWGNNGFFKILRGSDECGIE-IDVSAG 328
>gi|403307501|ref|XP_003944231.1| PREDICTED: cathepsin B [Saimiri boliviensis boliviensis]
Length = 351
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 251 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVGN 310
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 311 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 346
>gi|268572243|ref|XP_002648913.1| Hypothetical protein CBG17826 [Caenorhabditis briggsae]
Length = 323
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 44/89 (49%), Positives = 59/89 (66%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G +V A ++D+ YK GVY+HT G + GGHA+KIIGWG ++G+ YWL N
Sbjct: 230 IQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGTQNGIPYWLIAN 289
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SWG WG+ G K+RRG +E IE V+
Sbjct: 290 SWGANWGENGFLKMRRGVNECGIERAVVA 318
>gi|118424551|gb|ABK90823.1| cathepsin B-like cysteine proteinase [Spodoptera exigua]
Length = 341
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 47/89 (52%), Positives = 59/89 (66%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ E++ G + A + DL+ YK GVY+HTVG GGHA+KI+GWGVE+G KYWL N
Sbjct: 245 IKAELYKNGPVEGAFTVYSDLLNYKNGVYKHTVGNALGGHAIKILGWGVENGNKYWLIAN 304
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW WGD G FKI RG D IES V+
Sbjct: 305 SWNSDWGDNGFFKILRGEDHCGIESSIVA 333
>gi|296221607|ref|XP_002756833.1| PREDICTED: cathepsin B, partial [Callithrix jacchus]
Length = 330
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 230 IMAEIYKNGPVEGAFSVYADFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVGN 289
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 290 SWNTDWGDNGFFKILRGQDHCGIES-EVVAGIPRTDQ 325
>gi|56752811|gb|AAW24617.1| unknown [Schistosoma japonicum]
Length = 342
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 45/89 (50%), Positives = 61/89 (68%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI +G + AA + ++D + YK G+Y+H G + GGHA++IIGWGVE G YWL N
Sbjct: 249 IQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKGKPYWLIAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW E WG+ GLF++ RG DE IES V+
Sbjct: 309 SWNEDWGEKGLFRMVRGRDECSIESHVVA 337
>gi|348534156|ref|XP_003454569.1| PREDICTED: cathepsin B-like [Oreochromis niloticus]
Length = 330
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 56/85 (65%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI+ G + A ++D ++YK GVYQH G GGHA+K++GWG E+G YWLC N
Sbjct: 238 IQTEIYKNGPVEGAFTVYEDFLLYKTGVYQHVSGSAVGGHAIKVLGWGEENGTPYWLCAN 297
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FKI RG+D IES
Sbjct: 298 SWNTDWGDNGYFKILRGSDHCGIES 322
>gi|226471002|emb|CAX70582.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 45/89 (50%), Positives = 61/89 (68%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI +G + AA + ++D + YK G+Y+H G + GGHA++IIGWGVE G YWL N
Sbjct: 249 IQKEIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKGKPYWLIAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW E WG+ GLF++ RG DE IES V+
Sbjct: 309 SWNEDWGEKGLFRMVRGRDECSIESHVVA 337
>gi|32566081|ref|NP_506002.2| Protein CPR-1 [Caenorhabditis elegans]
gi|32172429|sp|P25807.2|CPR1_CAEEL RecName: Full=Gut-specific cysteine proteinase; Flags: Precursor
gi|1395200|gb|AAB88058.1| gut-specific cysteine protease-1 [Caenorhabditis elegans]
gi|24817276|emb|CAB01410.2| Protein CPR-1 [Caenorhabditis elegans]
Length = 329
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 49/93 (52%), Positives = 60/93 (64%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI+ G + AA ++D YK GVY+HT G+ GGHA+KIIGWG E G YWL N
Sbjct: 236 IQAEIYANGPVEAAFSVYEDFYKYKSGVYKHTAGKYLGGHAIKIIGWGTESGSPYWLVAN 295
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
SWG WG+ G FKI RG D+ IES V AG+
Sbjct: 296 SWGVNWGESGFFKIYRGDDQCGIES-AVVAGKA 327
>gi|38147393|gb|AAR12009.1| cathepsin B-like proteinase [Triatoma infestans]
Length = 332
Score = 102 bits (255), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 49/91 (53%), Positives = 60/91 (65%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + AA ++DL+ YK+GVYQH G + GGHA+KI+GWGVE+ YWL N
Sbjct: 239 IQAEILKNGPVEAAFTVYEDLVNYKEGVYQHVAGSVLGGHAIKILGWGVENDTPYWLVAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WG+ G FKI RG DE IE VSAG
Sbjct: 299 SWNTDWGNNGFFKILRGKDECGIE-IDVSAG 328
>gi|226466816|emb|CAX69543.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 337
Score = 102 bits (255), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 65/94 (69%), Gaps = 1/94 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI +G + A+I + D + YK GVY+H G + +V+IIGWG+E+G+ YWLC N
Sbjct: 243 IRREIMLYGPVEASIFIYDDFVDYKSGVYKHLTGRLITIQSVRIIGWGIENGIPYWLCAN 302
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
SW E WG G FKI RG++E IE+F V+AGRVD
Sbjct: 303 SWNEEWGLNGFFKILRGSNECEIEAF-VNAGRVD 335
>gi|395842321|ref|XP_003793966.1| PREDICTED: cathepsin B [Otolemur garnettii]
Length = 339
Score = 102 bits (255), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 48/97 (49%), Positives = 62/97 (63%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH G+M GGHA++I+GWG E+GV YWL N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHLTGDMMGGHAIRILGWGEENGVPYWLVAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGDGG F+I RG D IES +V AG D+
Sbjct: 299 SWNTDWGDGGFFRILRGQDHCGIES-EVVAGIPRTDQ 334
>gi|170028910|ref|XP_001842337.1| cathepsin L [Culex quinquefasciatus]
gi|167879387|gb|EDS42770.1| cathepsin L [Culex quinquefasciatus]
Length = 334
Score = 102 bits (255), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 48/91 (52%), Positives = 64/91 (70%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q E+F G + A ++DL+ YK+GVYQHT G+M GGHA++I+GWGVE+ K+WL N
Sbjct: 241 IQKELFTNGPVEGAFTVYEDLLNYKEGVYQHTAGKMLGGHAIRILGWGVENDTKFWLIAN 300
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG+D IES ++AG
Sbjct: 301 SWNSDWGDNGYFKILRGSDHLGIES-SIAAG 330
>gi|308507719|ref|XP_003116043.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
gi|308250987|gb|EFO94939.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
Length = 356
Score = 102 bits (255), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 49/89 (55%), Positives = 58/89 (65%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI+ G + A + D YK GVY H G+ +GGHAVKIIGWG E GV YWL N
Sbjct: 239 IQNEIYQNGPVEVAYTVYDDFYHYKSGVYHHVTGKDTGGHAVKIIGWGTEKGVDYWLVTN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SWG +GD G FKIRRGT+E IES V+
Sbjct: 299 SWGTSFGDKGFFKIRRGTNECGIESNVVA 327
>gi|187104114|ref|NP_001119617.1| cathepsin B-16A precursor [Acyrthosiphon pisum]
gi|161343835|tpg|DAA06098.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 340
Score = 102 bits (255), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 48/86 (55%), Positives = 59/86 (68%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ ++G I A+ + + D YK GVYQ T GGHAVK+IGWGVE+G YWL V
Sbjct: 245 IQKDVMNYGPIEASFDVYDDFPSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGTPYWLMV 304
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW WGD GLFKIRRGTDE I+S
Sbjct: 305 NSWNAQWGDNGLFKIRRGTDECGIDS 330
>gi|56756436|gb|AAW26391.1| unknown [Schistosoma japonicum]
Length = 342
Score = 102 bits (254), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 45/89 (50%), Positives = 61/89 (68%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI +G + AA + ++D + YK G+Y+H G + GGHA++IIGWGVE G YWL N
Sbjct: 249 IQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKGKPYWLIAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW E WG+ GLF++ RG DE IES V+
Sbjct: 309 SWNEDWGEKGLFRMVRGRDECSIESHVVA 337
>gi|268566089|ref|XP_002647469.1| Hypothetical protein CBG06541 [Caenorhabditis briggsae]
Length = 280
Score = 102 bits (254), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 45/89 (50%), Positives = 60/89 (67%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G +V A ++D+ YK GVY+HT G + GGHA+KIIGWG ++G+ YWL N
Sbjct: 187 IQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGTQNGIPYWLIAN 246
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SWG WG+ G K+RRG +E IES V+
Sbjct: 247 SWGADWGENGFLKMRRGVNECGIESAVVA 275
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 31/60 (51%), Positives = 40/60 (66%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
G + A+ ++D IYKKGVYQ+T G++ G HA+KI+GWG E G YWL NSWG G
Sbjct: 4 GPVEASFTVYEDFYIYKKGVYQYTAGQVVGVHAIKIMGWGTEHGTDYWLIANSWGAQCGS 63
>gi|444525951|gb|ELV14228.1| Cathepsin B [Tupaia chinensis]
Length = 339
Score = 102 bits (254), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 47/97 (48%), Positives = 60/97 (61%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWG E+G YWL N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGTENGTPYWLVAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES ++ AG D+
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EIVAGIPRTDQ 334
>gi|332374788|gb|AEE62535.1| unknown [Dendroctonus ponderosae]
Length = 328
Score = 102 bits (254), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 48/85 (56%), Positives = 58/85 (68%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A ++ + D Y+ G+YQ T E GGHAVKI+GWGVEDGVKYWL N
Sbjct: 232 IQYEIMTNGPVEATMDVYVDFAQYQSGIYQLTTDEYEGGHAVKILGWGVEDGVKYWLVAN 291
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW E WG+ GLF+I RG DE IES
Sbjct: 292 SWNERWGENGLFRIIRGRDEVGIES 316
>gi|185135431|ref|NP_001117776.1| procathepsin B precursor [Oncorhynchus mykiss]
gi|14582897|gb|AAK69705.1|AF358667_1 procathepsin B [Oncorhynchus mykiss]
Length = 330
Score = 102 bits (254), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 56/85 (65%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ E++ G + AA ++D ++YK GVYQH G+M GGHA+KI+GWG E+ YWL N
Sbjct: 237 IMTELYKNGPVEAAFSVYEDFLLYKTGVYQHVTGQMLGGHAIKILGWGKENNTPYWLVAN 296
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FKI RG DE IES
Sbjct: 297 SWNTDWGDNGFFKILRGKDECGIES 321
>gi|51038793|gb|AAT94175.1| cathepsin B [Paralichthys olivaceus]
gi|121053785|gb|ABM47001.1| cathepsin B [Paralichthys olivaceus]
Length = 330
Score = 102 bits (254), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 57/85 (67%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A ++D ++YK GVYQH G + GGHA+K++GWG EDG+ YWLC N
Sbjct: 238 IQAEISKNGPVEGAFTVYEDFVMYKSGVYQHVSGSVLGGHAIKVLGWGEEDGIPYWLCAN 297
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FKI RG++ IES
Sbjct: 298 SWNTDWGDNGFFKILRGSNHCGIES 322
>gi|91078958|ref|XP_974220.1| PREDICTED: similar to cathepsin b [Tribolium castaneum]
gi|270004841|gb|EFA01289.1| cathepsin B precursor [Tribolium castaneum]
Length = 334
Score = 102 bits (254), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 50/98 (51%), Positives = 61/98 (62%), Gaps = 2/98 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A E ++DL+ YKKGVYQH GE GGHA++I+GWG E G YWL N
Sbjct: 239 IQKEIMTNGPVEGAFEVYEDLLSYKKGVYQHVKGEALGGHAIRILGWGTEKGTPYWLIAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRS 98
SW WGD G FKI RG D IES V+ + +D S
Sbjct: 299 SWNSDWGDNGTFKILRGEDHCGIESSIVAG--IPKDSS 334
>gi|269146930|gb|ACZ28411.1| cathepsin b [Simulium nigrimanum]
Length = 168
Score = 102 bits (254), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 49/91 (53%), Positives = 62/91 (68%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A ++DL+ YK GVYQH G+M GGHA++I+GWGVE+ V YWL N
Sbjct: 76 IQKEIMTNGPVEGAFTVYEDLVQYKDGVYQHVTGKMLGGHAIRILGWGVENDVPYWLIAN 135
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WG+ G FKI RG+D IES Q+SAG
Sbjct: 136 SWNTDWGNNGFFKILRGSDHCGIES-QISAG 165
>gi|390357905|ref|XP_003729132.1| PREDICTED: cathepsin B-like [Strongylocentrotus purpuratus]
Length = 354
Score = 102 bits (254), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 47/84 (55%), Positives = 56/84 (66%)
Query: 2 QLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNS 61
Q EI G + A ++D YK GVYQHT G + GGHA+KI+GWGVE+G KYWL NS
Sbjct: 263 QTEIMTNGPVEADFTVYEDFPTYKSGVYQHTTGGVLGGHAIKILGWGVEEGTKYWLVANS 322
Query: 62 WGELWGDGGLFKIRRGTDESRIES 85
W WGD G FKI RG++E IES
Sbjct: 323 WNNEWGDNGFFKILRGSNECGIES 346
>gi|326427908|gb|EGD73478.1| cathepsin B [Salpingoeca sp. ATCC 50818]
Length = 341
Score = 102 bits (254), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 49/91 (53%), Positives = 61/91 (67%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI G + AA + D + YK GVYQHT G+ GGHA+KIIGWGV+DG YW+ N
Sbjct: 248 IATEIMTNGPVEAAFTVYSDFLSYKSGVYQHTSGQPLGGHAIKIIGWGVQDGTDYWIVAN 307
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW + WG+ G F I++GTDE IES QV AG
Sbjct: 308 SWNDSWGNDGFFWIKKGTDECGIES-QVVAG 337
>gi|395507317|ref|XP_003757972.1| PREDICTED: cathepsin B [Sarcophilus harrisii]
Length = 342
Score = 102 bits (254), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 46/90 (51%), Positives = 57/90 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D + YK GVYQH G+M GGHA++++GWGVEDGV YWL N
Sbjct: 242 IMAEIYKNGPVEGAFIVYADFLQYKSGVYQHVTGDMLGGHAIRVLGWGVEDGVPYWLAAN 301
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SW WGD G FKI RG D IES V+
Sbjct: 302 SWNTDWGDNGFFKILRGKDHCGIESEMVAG 331
>gi|148229459|ref|NP_001079570.1| cathepsin B precursor [Xenopus laevis]
gi|28277314|gb|AAH44689.1| MGC53360 protein [Xenopus laevis]
Length = 333
Score = 102 bits (253), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 46/85 (54%), Positives = 54/85 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D +YK GVYQH GE GGHA+KI+GWGVE+G YWLC N
Sbjct: 240 IMAEIYKNGPVEGAFLVYADFPLYKSGVYQHETGEELGGHAIKILGWGVENGTPYWLCAN 299
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FKI RG D IES
Sbjct: 300 SWNTDWGDNGFFKILRGKDHCGIES 324
>gi|45361295|ref|NP_989225.1| cathepsin B precursor [Xenopus (Silurana) tropicalis]
gi|38969948|gb|AAH63365.1| hypothetical protein MGC75969 [Xenopus (Silurana) tropicalis]
Length = 333
Score = 102 bits (253), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 46/85 (54%), Positives = 54/85 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D +YK GVYQH GE GGHA+KI+GWGVE+G YWLC N
Sbjct: 240 IMAEIYKNGPVEGAFLVYADFPMYKSGVYQHETGEELGGHAIKILGWGVENGTPYWLCAN 299
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FKI RG D IES
Sbjct: 300 SWNTDWGDNGFFKILRGKDHCGIES 324
>gi|149698064|ref|XP_001498242.1| PREDICTED: cathepsin B [Equus caballus]
Length = 340
Score = 102 bits (253), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 49/91 (53%), Positives = 59/91 (64%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EIF G + AA + D + YK GVYQH G+M GGHAV+I+GWGVE+G YWL N
Sbjct: 240 IMAEIFKNGPVEAAFTVYSDFLQYKSGVYQHVAGDMMGGHAVRILGWGVENGTPYWLVGN 299
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG D IES ++ AG
Sbjct: 300 SWNTDWGDNGFFKILRGQDHCGIES-EIVAG 329
>gi|145498570|ref|XP_001435272.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124402403|emb|CAK67875.1| unnamed protein product [Paramecium tetraurelia]
Length = 325
Score = 102 bits (253), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 50/93 (53%), Positives = 62/93 (66%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI FG + A+ ++D + YK GVYQ+ G GGHAVKIIGWGVE V YWL VN
Sbjct: 233 IQNEIMTFGPVEASFTVYEDFLTYKSGVYQNVAGANLGGHAVKIIGWGVEKNVPYWLVVN 292
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
SW E WG+ GLFKI RG++ IE + AGR+
Sbjct: 293 SWNEGWGENGLFKILRGSNHVGIEG-GIYAGRL 324
>gi|341904369|gb|EGT60202.1| hypothetical protein CAEBREN_08101 [Caenorhabditis brenneri]
Length = 330
Score = 102 bits (253), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 49/93 (52%), Positives = 59/93 (63%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + AA ++D YK GVY+HT G+ GGHA+KIIGWG E G YWL N
Sbjct: 237 IQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHAIKIIGWGTESGSPYWLVAN 296
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
SWG WG+ G FKI RG D+ IES V AG+
Sbjct: 297 SWGTSWGESGFFKIFRGDDQCGIES-AVVAGKA 328
>gi|341878049|gb|EGT33984.1| CBN-CPR-1 protein [Caenorhabditis brenneri]
Length = 330
Score = 102 bits (253), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 49/93 (52%), Positives = 59/93 (63%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + AA ++D YK GVY+HT G+ GGHA+KIIGWG E G YWL N
Sbjct: 237 IQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHAIKIIGWGTESGSPYWLVAN 296
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
SWG WG+ G FKI RG D+ IES V AG+
Sbjct: 297 SWGTSWGESGFFKIFRGDDQCGIES-AVVAGKA 328
>gi|256052329|ref|XP_002569725.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228436|emb|CCD74607.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 345
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 64/94 (68%), Gaps = 1/94 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI +G + A+ ++D + YK G+Y+H GE GGHA++IIGWGVE+ YWL N
Sbjct: 253 IQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVENKTPYWLIAN 312
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
SW E WG+ G F+I RG DE IES +V AG+++
Sbjct: 313 SWNEDWGENGYFRIVRGRDECFIES-EVIAGQIN 345
>gi|170030062|ref|XP_001842909.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
gi|167865915|gb|EDS29298.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
Length = 288
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 43/84 (51%), Positives = 56/84 (66%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
M+ EI+ G IV + + + D Y+ GVY+H G G HAV++IGWGVE+GVKYWLC N
Sbjct: 195 MKAEIYQNGPIVTSFDVYGDFFQYRSGVYRHVTGAYKGSHAVRVIGWGVENGVKYWLCAN 254
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW E WG+ G FKI RG + +E
Sbjct: 255 SWNERWGENGFFKIVRGENHVGVE 278
>gi|157167366|ref|XP_001653890.1| cathepsin b [Aedes aegypti]
gi|54289254|gb|AAV31917.1| lysosomal cathepsin B [Aedes aegypti]
gi|108874249|gb|EAT38474.1| AAEL009637-PA [Aedes aegypti]
Length = 340
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 49/91 (53%), Positives = 61/91 (67%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A ++DL+ YK+GVY H G+M GGHA++I+GWGVEDG KYWL N
Sbjct: 247 IQKEIMTNGPVEGAFTVYEDLLNYKEGVYHHVHGKMLGGHAIRILGWGVEDGTKYWLIAN 306
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG D IES ++AG
Sbjct: 307 SWNSDWGDNGFFKILRGEDHLGIES-SIAAG 336
>gi|148222779|ref|NP_001080410.1| uncharacterized protein LOC380102 precursor [Xenopus laevis]
gi|28302291|gb|AAH46667.1| Cg10992 protein [Xenopus laevis]
Length = 333
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 48/91 (52%), Positives = 58/91 (63%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ +I+ G + A + D +YK GVYQH GE GGHA+KI+GWGVE+G YWLC N
Sbjct: 240 IMADIYKNGPVEGAFVVYADFPLYKSGVYQHETGEELGGHAIKILGWGVENGTPYWLCAN 299
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG D IES +V AG
Sbjct: 300 SWNTDWGDNGFFKILRGKDHCGIES-EVVAG 329
>gi|340380685|ref|XP_003388852.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
Length = 341
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 46/85 (54%), Positives = 52/85 (61%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A + D Y GVYQHT G GGHA+KI+GWG E+GV YWL N
Sbjct: 246 IQTEIMTNGPVEGAFSVYADFPTYTSGVYQHTTGSFLGGHAIKILGWGTENGVPYWLVAN 305
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FKI RG DE IES
Sbjct: 306 SWNPSWGDSGFFKIIRGKDECGIES 330
>gi|260782761|ref|XP_002586451.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
gi|229271561|gb|EEN42462.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
Length = 272
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 47/85 (55%), Positives = 59/85 (69%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+++EI G + AA + D++ YK GVY HT G GGHAVK++GWGVED +YWL N
Sbjct: 176 IKVEIMTNGPVEAAFTVYSDIVHYKSGVYHHTSGGKLGGHAVKVLGWGVEDEEEYWLVAN 235
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SWG WGD G FKI+RG+DE IES
Sbjct: 236 SWGPDWGDQGFFKIKRGSDECGIES 260
>gi|157167281|ref|XP_001658485.1| cathepsin b [Aedes aegypti]
gi|108876476|gb|EAT40701.1| AAEL007585-PA [Aedes aegypti]
Length = 386
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 64/101 (63%), Gaps = 1/101 (0%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EIF G + AA + DL YK G+Y+H G +SGGHAVK++GWGVE+GVKYWL NSWG
Sbjct: 279 EIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLLGWGVENGVKYWLVANSWG 338
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEF 104
WG+ G FKI RG + IE + AG + R + ++
Sbjct: 339 REWGENGFFKIVRGENHCGIEE-NIHAGLPNFHRQGEAGKY 378
>gi|344281458|ref|XP_003412496.1| PREDICTED: cathepsin B-like [Loxodonta africana]
Length = 340
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 47/97 (48%), Positives = 60/97 (61%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GE GGHA++I+GWGVE+G YWL N
Sbjct: 240 IMAEIYKNGPVEGAFSVYTDFLVYKSGVYQHVTGEEVGGHAIRILGWGVENGTPYWLAAN 299
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES ++ AG D+
Sbjct: 300 SWNTDWGDNGFFKILRGQDHCGIES-EIVAGIPRTDQ 335
>gi|226471004|emb|CAX70583.1| Cysteine PRotease related protein [Schistosoma japonicum]
Length = 304
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 45/90 (50%), Positives = 60/90 (66%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI +G + AA + ++D + YK G+Y+H G + GGHA++IIGWGVE YWL N
Sbjct: 211 IQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTPYWLIAN 270
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SW E WG+ GLF+I RG DE IES V+
Sbjct: 271 SWNEDWGEKGLFRIVRGRDECSIESHVVAG 300
>gi|157131748|ref|XP_001662318.1| cathepsin b [Aedes aegypti]
gi|108871395|gb|EAT35620.1| AAEL012216-PA [Aedes aegypti]
Length = 386
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 48/101 (47%), Positives = 64/101 (63%), Gaps = 1/101 (0%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EIF G + AA + DL YK G+Y+H G +SGGHAVK++GWGVE+GVKYWL NSWG
Sbjct: 279 EIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLLGWGVENGVKYWLVANSWG 338
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEF 104
WG+ G FK+ RG + IE + AG + R + ++
Sbjct: 339 REWGENGFFKMVRGENHCGIEE-NIHAGLPNFHRQGEAAKY 378
>gi|291385792|ref|XP_002709482.1| PREDICTED: cathepsin B [Oryctolagus cuniculus]
Length = 339
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 47/97 (48%), Positives = 63/97 (64%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI+ G + A + D ++YK GVYQHT G++ GGHA++I+GWG E+GV YWL N
Sbjct: 239 IKAEIYKNGPVEGAFTVYSDFLMYKSGVYQHTTGDIMGGHAIRILGWGEENGVPYWLVAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES ++ AG D+
Sbjct: 299 SWNTDWGDKGFFKILRGQDHCGIES-EIVAGIPRTDQ 334
>gi|5031250|gb|AAD38132.1|AF127592_1 vitellogenic cathepsin-B like protease [Aedes aegypti]
Length = 386
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 48/101 (47%), Positives = 64/101 (63%), Gaps = 1/101 (0%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EIF G + AA + DL YK G+Y+H G +SGGHAVK++GWGVE+GVKYWL NSWG
Sbjct: 279 EIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLLGWGVENGVKYWLVANSWG 338
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEF 104
WG+ G FK+ RG + IE + AG + R + ++
Sbjct: 339 REWGENGFFKMVRGENHCGIEE-NIHAGLPNFHRQGEAAKY 378
>gi|154340956|ref|XP_001566431.1| cysteine peptidase C (CPC) [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134063754|emb|CAM39941.1| cysteine peptidase C (CPC) [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 340
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 45/87 (51%), Positives = 56/87 (64%)
Query: 3 LEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSW 62
+E+ +G A + + D + YK GVY HT GE GGHAVK++GWGV++G YW NSW
Sbjct: 248 IELMTYGPFEVAFDVYADFVSYKSGVYSHTTGERLGGHAVKLVGWGVQNGTPYWKIANSW 307
Query: 63 GELWGDGGLFKIRRGTDESRIESFQVS 89
WGD G F IRRGTDE IES V+
Sbjct: 308 NSDWGDNGYFLIRRGTDECGIESTGVA 334
>gi|157111449|ref|XP_001651570.1| cathepsin b [Aedes aegypti]
gi|108868331|gb|EAT32556.1| AAEL015312-PA [Aedes aegypti]
Length = 386
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 48/101 (47%), Positives = 64/101 (63%), Gaps = 1/101 (0%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EIF G + AA + DL YK G+Y+H G +SGGHAVK++GWGVE+GVKYWL NSWG
Sbjct: 279 EIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLLGWGVENGVKYWLVANSWG 338
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEF 104
WG+ G FK+ RG + IE + AG + R + ++
Sbjct: 339 REWGENGFFKMVRGENHCGIEE-NIHAGLPNFHRQGEAAKY 378
>gi|9955277|pdb|1QDQ|A Chain A, X-Ray Crystal Structure Of Bovine Cathepsin B-Ca074
Complex
Length = 253
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 55/85 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GE+ GGHA++I+GWGVE+G YWL N
Sbjct: 160 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVAN 219
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FKI RG D IES
Sbjct: 220 SWNTDWGDNGFFKILRGQDHCGIES 244
>gi|239788404|dbj|BAH70886.1| ACYPI000014 [Acyrthosiphon pisum]
Length = 335
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 51/87 (58%), Positives = 61/87 (70%), Gaps = 3/87 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--GGHAVKIIGWGVEDGVKYWLC 58
MQ + +G I A+ + + D + Y+ GVYQ T G S GGHAVK+IGWGVE+G YWL
Sbjct: 241 MQKDTMVYGPIEASFDVYDDFMNYESGVYQRT-GNASYLGGHAVKMIGWGVEEGTPYWLM 299
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIES 85
VNSWGE WGD G+FKI RGTDE IES
Sbjct: 300 VNSWGEQWGDKGMFKILRGTDECGIES 326
>gi|308504375|ref|XP_003114371.1| CRE-CPR-1 protein [Caenorhabditis remanei]
gi|308261756|gb|EFP05709.1| CRE-CPR-1 protein [Caenorhabditis remanei]
Length = 366
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/93 (51%), Positives = 59/93 (63%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + AA ++D YK GVY+HT G+ GGHA+KIIGWG E G YWL N
Sbjct: 273 IQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHAIKIIGWGTESGSPYWLVAN 332
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
SWG WG+ G F+I RG D+ IES V AG+
Sbjct: 333 SWGNSWGESGFFRIFRGDDQCGIES-AVVAGKA 364
>gi|187105116|ref|NP_001119618.1| cathepsin B-84 precursor [Acyrthosiphon pisum]
gi|161343843|tpg|DAA06102.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 335
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 51/87 (58%), Positives = 61/87 (70%), Gaps = 3/87 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--GGHAVKIIGWGVEDGVKYWLC 58
MQ + +G I A+ + + D + Y+ GVYQ T G S GGHAVK+IGWGVE+G YWL
Sbjct: 241 MQKDTMVYGPIEASFDVYDDFMNYESGVYQRT-GNASYLGGHAVKMIGWGVEEGTPYWLM 299
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIES 85
VNSWGE WGD G+FKI RGTDE IES
Sbjct: 300 VNSWGEQWGDKGMFKILRGTDECGIES 326
>gi|390994431|gb|AFM37365.1| cathepsin B2 [Dictyocaulus viviparus]
Length = 346
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 57/85 (67%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A + ++D Y KG+Y+HT G GGHAVK+IGWG E+G+ YW+C N
Sbjct: 253 IQKEIMLHGPVEVAYDVYEDFEHYLKGIYKHTAGSYLGGHAVKMIGWGTENGIPYWICSN 312
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WG+ G F+I RGTDE IES
Sbjct: 313 SWNSDWGENGFFRILRGTDECGIES 337
>gi|321452279|gb|EFX63703.1| hypothetical protein DAPPUDRAFT_306608 [Daphnia pulex]
Length = 340
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 47/91 (51%), Positives = 61/91 (67%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+QLEI + G + A+ ++D YK GVYQH G+ GGHA++I+GWGVE+GV YWL N
Sbjct: 247 IQLEIMNNGPVEGALTVYEDFPTYKSGVYQHVHGKALGGHAIRILGWGVEEGVPYWLIAN 306
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G K+ RG D IES Q++AG
Sbjct: 307 SWNTDWGDNGYIKLLRGKDHCGIES-QITAG 336
>gi|27882093|gb|AAH44517.1| Zgc:55862 [Danio rerio]
Length = 330
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 47/91 (51%), Positives = 59/91 (64%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ E+F G + AA ++D ++YK GVYQH G GGHA+KI+GWG E+GV YWL N
Sbjct: 238 IMAELFKNGPVEAAFTVYEDFLLYKSGVYQHMSGSALGGHAIKILGWGEENGVPYWLAAN 297
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG D IES ++ AG
Sbjct: 298 SWNTDWGDNGYFKILRGEDHCGIES-EIVAG 327
>gi|282400164|ref|NP_001164205.1| cathepsin B precursor [Tribolium castaneum]
gi|270004839|gb|EFA01287.1| cathepsin B precursor [Tribolium castaneum]
Length = 335
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 45/84 (53%), Positives = 57/84 (67%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A+ ++D + YK GVYQH GE +GGHA+KI+GWGVE+ YWL N
Sbjct: 242 IQTEIMTNGPVEASFSVYEDFLSYKSGVYQHLEGEYAGGHAIKILGWGVENDTPYWLVAN 301
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW E WGD G FKI RG++E IE
Sbjct: 302 SWNEDWGDKGYFKILRGSNECGIE 325
>gi|37788265|gb|AAO64472.1| cathepsin B precursor [Fundulus heteroclitus]
Length = 330
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/91 (52%), Positives = 59/91 (64%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI+ G + A ++D YK GVYQH G GGHA+K+IGWG E+GV YWLC N
Sbjct: 238 IQSEIYKNGPVEGAFIVYEDFPSYKSGVYQHVTGSALGGHAIKMIGWGEENGVPYWLCAN 297
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG++ IES +V AG
Sbjct: 298 SWNTDWGDNGFFKILRGSNHCGIES-EVVAG 327
>gi|56758716|gb|AAW27498.1| unknown [Schistosoma japonicum]
Length = 342
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 65/93 (69%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI +G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKEIMMYGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG DE IESF V AG++
Sbjct: 309 TWNEDWGEKGYFRIVRGRDECLIESFIV-AGQI 340
>gi|156375635|ref|XP_001630185.1| predicted protein [Nematostella vectensis]
gi|156217201|gb|EDO38122.1| predicted protein [Nematostella vectensis]
Length = 311
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 45/84 (53%), Positives = 54/84 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I + G + A QD Y+ G+Y H G+ GGHA+KI+GWG ED V YWLC N
Sbjct: 218 IQTDIMNNGPVEADFTIFQDFYAYRSGIYVHATGKQLGGHAIKILGWGTEDNVDYWLCAN 277
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SWG WG G FKIRRGTDE IE
Sbjct: 278 SWGANWGIQGYFKIRRGTDECGIE 301
>gi|440913587|gb|ELR63025.1| Cathepsin B [Bos grunniens mutus]
Length = 335
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/91 (50%), Positives = 59/91 (64%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GE+ GGHA++I+GWGVE+G YWL N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG D IES ++ AG
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EIVAG 328
>gi|27806671|ref|NP_776456.1| cathepsin B precursor [Bos taurus]
gi|115312124|sp|P07688.5|CATB_BOVIN RecName: Full=Cathepsin B; AltName: Full=BCSB; Contains: RecName:
Full=Cathepsin B light chain; Contains: RecName:
Full=Cathepsin B heavy chain; Flags: Precursor
gi|289402|gb|AAA03064.1| cathepsin B [Bos taurus]
gi|809479|gb|AAA80198.1| cathepsin B [Bos taurus]
gi|296484950|tpg|DAA27065.1| TPA: cathepsin B precursor [Bos taurus]
Length = 335
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/91 (50%), Positives = 59/91 (64%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GE+ GGHA++I+GWGVE+G YWL N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG D IES ++ AG
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EIVAG 328
>gi|118358710|ref|XP_001012596.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89294363|gb|EAR92351.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 346
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/90 (53%), Positives = 58/90 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI G I A ++D + YK GVYQH G GGHAVK++GWGVE+G YW+ VN
Sbjct: 253 IMTEIQTNGPIEVAFTVYEDFLTYKSGVYQHVTGSELGGHAVKMVGWGVENGTPYWIIVN 312
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SW E WGD G FKI RG +E IES V+A
Sbjct: 313 SWNESWGDKGTFKILRGQNECGIESECVTA 342
>gi|339236191|ref|XP_003379650.1| cathepsin B [Trichinella spiralis]
gi|316977649|gb|EFV60721.1| cathepsin B [Trichinella spiralis]
Length = 356
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 45/89 (50%), Positives = 59/89 (66%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q E+ G + A E ++D ++YK GVYQH G + GGHAV+++GWG E+GV YWL N
Sbjct: 261 LQKELMMNGPMEVAFEVYEDFLLYKTGVYQHHTGSVLGGHAVRLLGWGEENGVPYWLLAN 320
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW WGD G FKI RG +E IES V+
Sbjct: 321 SWNTEWGDKGFFKIYRGRNECGIESEAVA 349
>gi|325302580|dbj|BAJ83490.1| cathepsin B-like peptidase [Echinococcus multilocularis]
Length = 351
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 55/85 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ E+ G + A E + D YK GVYQH G + GGHA+K++GWG EDGV YWLC N
Sbjct: 258 IKHELITHGPVEADFEVYADFPTYKSGVYQHVSGALLGGHAIKLMGWGEEDGVPYWLCAN 317
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WG+GG FKI RG + IES
Sbjct: 318 SWNTDWGEGGFFKILRGKNHCGIES 342
>gi|86451908|gb|ABC97349.1| cathepsin B [Streblomastix strix]
Length = 312
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 48/92 (52%), Positives = 61/92 (66%), Gaps = 1/92 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI+ G + A+ ++DL +Y+ GVYQH G G HA+K++GWG+ DGVKYW VN
Sbjct: 218 IQKEIYENGPVTASFAVYEDLSVYQSGVYQHVTGGFEGLHAIKVVGWGILDGVKYWTIVN 277
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGR 92
SW E WG GL IRRG DE IES V AG+
Sbjct: 278 SWAEDWGFDGLLLIRRGVDECGIES-DVVAGQ 308
>gi|27526823|emb|CAD32937.1| pro-cathepsin B2 [Fasciola hepatica]
Length = 337
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 45/86 (52%), Positives = 56/86 (65%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI G + A + D +YK G+Y H G +G HA++IIGWGVE+GVKYWL NSW
Sbjct: 239 EIMKNGPVEAGFIVYTDFAVYKSGIYHHVSGRYAGKHAIRIIGWGVENGVKYWLTANSWN 298
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVS 89
WG+ G F+I RGTDE RIES V+
Sbjct: 299 VGWGENGYFRILRGTDECRIESIVVA 324
>gi|126303983|ref|XP_001381634.1| PREDICTED: cathepsin B-like [Monodelphis domestica]
Length = 337
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 46/97 (47%), Positives = 62/97 (63%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A ++D + YK GVYQH GEM GGHA++I+GWGVE+G++YWL N
Sbjct: 241 IMAEIYKNGPVEGAFSVYEDFLHYKSGVYQHVAGEMLGGHAIRILGWGVENGIRYWLAAN 300
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FK RG + IES ++ AG D+
Sbjct: 301 SWNIDWGDNGFFKFLRGKNHCGIES-EIIAGIPRTDQ 336
>gi|183988832|gb|ACC66065.1| cathepsin B [Antheraea assama]
Length = 287
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 46/85 (54%), Positives = 55/85 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ E+F G + A + DL+ YK GVYQHT G GGHA+KI+GWGVE+G KYWL N
Sbjct: 202 IKAELFKNGPVEGAFTVYSDLLSYKSGVYQHTHGNALGGHAIKILGWGVENGSKYWLIAN 261
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G KI RG D IES
Sbjct: 262 SWNSDWGDNGFLKILRGEDHCGIES 286
>gi|73586701|gb|AAI02998.1| CTSB protein [Bos taurus]
Length = 335
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 55/85 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GE+ GGHA++I+GWGVE+G YWL N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FKI RG D IES
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES 323
>gi|268557292|ref|XP_002636635.1| C. briggsae CBR-CPR-1 protein [Caenorhabditis briggsae]
Length = 330
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 48/93 (51%), Positives = 58/93 (62%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + AA ++D YK GVY+HT G+ GGHA+KIIGWG E G YWL N
Sbjct: 237 IQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHAIKIIGWGTESGSPYWLVAN 296
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
SWG WG+ G FKI RG D+ IE V AG+
Sbjct: 297 SWGTNWGESGFFKILRGDDQCGIEG-AVVAGKA 328
>gi|326916753|ref|XP_003204669.1| PREDICTED: cathepsin B-like [Meleagris gallopavo]
Length = 340
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 46/91 (50%), Positives = 59/91 (64%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A ++D ++YK GVYQH GE GGHA++I+GWGVE+G YWL N
Sbjct: 240 IMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGVENGTPYWLAAN 299
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG D IES ++ AG
Sbjct: 300 SWNTDWGDNGFFKILRGEDHCGIES-EIVAG 329
>gi|268579855|ref|XP_002644910.1| C. briggsae CBR-CPR-6 protein [Caenorhabditis briggsae]
Length = 376
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 57/85 (67%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q E+ G + A E ++D + Y GVY HT G++ GGHAVK+IGWG+EDG+ YW C N
Sbjct: 267 IQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIEDGIPYWTCAN 326
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WG+ G F+I RG DE IES
Sbjct: 327 SWNTDWGEDGFFRILRGVDECGIES 351
>gi|194246067|gb|ACF35525.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 192
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 48/91 (52%), Positives = 58/91 (63%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI+ G + A + D YK GVYQ EM GGHA++I+GWG EDGV YWL N
Sbjct: 96 IKTEIYKNGPVEADFSVYADFPSYKSGVYQRHSEEMLGGHAIRILGWGTEDGVPYWLVAN 155
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW E WGD G FKIRRG DE IE ++AG
Sbjct: 156 SWNEDWGDKGYFKIRRGNDECGIED-DINAG 185
>gi|225708580|gb|ACO10136.1| Cathepsin B precursor [Osmerus mordax]
Length = 329
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 46/88 (52%), Positives = 59/88 (67%), Gaps = 1/88 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E++ G + AA ++D ++YK GVYQH G+M GGHA+KI+GWG E+ YWL NSW
Sbjct: 239 ELYKNGPVEAAFSVYEDFLLYKSGVYQHLTGDMLGGHAIKILGWGKENNTPYWLAANSWN 298
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAG 91
WG+ G FKI RG DE IES +V AG
Sbjct: 299 TDWGNQGFFKILRGGDECGIES-EVVAG 325
>gi|156708112|gb|ABU93314.1| cathepsin B5 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 99.8 bits (247), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 45/85 (52%), Positives = 55/85 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
MQ E++ G AA ++D YK GVY H G+M GGHAV ++GWGVEDG YWL N
Sbjct: 188 MQNELYSRGPFEAAFSVYEDFKSYKSGVYHHITGKMLGGHAVMVVGWGVEDGTPYWLIQN 247
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SWG WG+ G FKI RG +E IE+
Sbjct: 248 SWGTTWGEQGFFKILRGKNECGIET 272
>gi|410912140|ref|XP_003969548.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
Length = 246
Score = 99.8 bits (247), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 42/85 (49%), Positives = 57/85 (67%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI+ G + A ++D ++YK GVYQH G GGHA+KI+GWG E+G+ YWLC N
Sbjct: 154 IKHEIYKNGPVEGAFTVYEDFVLYKTGVYQHVTGSALGGHAIKILGWGEENGIPYWLCAN 213
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WG+ G FKI RG++ IES
Sbjct: 214 SWNTDWGNNGFFKILRGSNHCGIES 238
>gi|308512693|gb|ADO33000.1| cathepsin B [Biston betularia]
Length = 217
Score = 99.8 bits (247), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 47/90 (52%), Positives = 59/90 (65%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ E+F G + AA + DL+ YK GVY+H G+ GGHA+KIIGWGVE+G KYWL N
Sbjct: 121 IKAELFKNGPVEAAFTVYADLLAYKSGVYKHVEGDALGGHAIKIIGWGVENGNKYWLIAN 180
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SW WG+ G FKI RG D IES V+
Sbjct: 181 SWNTDWGNNGFFKILRGEDHCGIESSIVAG 210
>gi|325302582|dbj|BAJ83491.1| cathepsin B-like peptidase [Echinococcus multilocularis]
Length = 338
Score = 99.8 bits (247), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 47/88 (53%), Positives = 58/88 (65%), Gaps = 1/88 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI +G + A + D + YK GVYQH G GGHAVKI+GWG E+GV YWLC NSW
Sbjct: 246 EILVYGPVEADFIVYADFLTYKSGVYQHVKGGFLGGHAVKILGWGEENGVPYWLCANSWN 305
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAG 91
WGDGG FKI RG + +IE+ ++AG
Sbjct: 306 TDWGDGGFFKILRGYNHCKIEA-DINAG 332
>gi|209863077|ref|NP_001119612.2| cathepsin B-912 precursor [Acyrthosiphon pisum]
Length = 342
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 46/86 (53%), Positives = 61/86 (70%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ +G I A++E + D YK GVY+ + GGHAVK+IGWG EDGV YWL V
Sbjct: 247 IQKDVMTYGPIEASMEVYDDFPSYKSGVYEKSENATYLGGHAVKLIGWGEEDGVPYWLMV 306
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW E+WGD GLFKIRRGT+E +++
Sbjct: 307 NSWSEMWGDKGLFKIRRGTNECSVDN 332
>gi|291236586|ref|XP_002738220.1| PREDICTED: cathepsin B preproprotein-like [Saccoglossus
kowalevskii]
Length = 93
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 50/88 (56%), Positives = 57/88 (64%), Gaps = 1/88 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI +G + A + D YK GVYQH GE GGHA+KI+GWG EDG YWL NSW
Sbjct: 3 EIQKYGPVEGAFTVYADFPSYKSGVYQHETGEALGGHAIKILGWGNEDGHDYWLVANSWN 62
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAG 91
E WGD G FKI RG DE IES Q++AG
Sbjct: 63 EDWGDQGFFKILRGVDECGIES-QITAG 89
>gi|161343855|tpg|DAA06108.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 342
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 46/86 (53%), Positives = 61/86 (70%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ +G I A++E + D YK GVY+ + GGHAVK+IGWG EDGV YWL V
Sbjct: 247 IQKDVMTYGPIEASMEVYDDFPSYKSGVYEKSENATYLGGHAVKLIGWGEEDGVPYWLMV 306
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW E+WGD GLFKIRRGT+E +++
Sbjct: 307 NSWSEMWGDKGLFKIRRGTNECSVDN 332
>gi|48425700|pdb|1SP4|B Chain B, Crystal Structure Of Ns-134 In Complex With Bovine
Cathepsin B: A Two Headed Epoxysuccinyl Inhibitor
Extends Along The Whole Active Site Cleft
Length = 205
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 46/91 (50%), Positives = 59/91 (64%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GE+ GGHA++I+GWGVE+G YWL N
Sbjct: 112 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGN 171
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG D IES ++ AG
Sbjct: 172 SWNTDWGDNGFFKILRGQDHCGIES-EIVAG 201
>gi|28373366|pdb|1ITO|A Chain A, Crystal Structure Analysis Of Bovine Spleen Cathepsin B-
E64c Complex
gi|88192750|pdb|2DC6|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca073 Complex
gi|88192751|pdb|2DC7|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca042 Complex
gi|88192752|pdb|2DC8|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca059 Complex
gi|88192753|pdb|2DC9|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca074me Complex
gi|88192754|pdb|2DCA|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca075 Complex
gi|88192755|pdb|2DCB|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca076 Complex
gi|88192756|pdb|2DCC|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca077 Complex
gi|88192757|pdb|2DCD|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca078 Complex
Length = 256
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 55/85 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GE+ GGHA++I+GWGVE+G YWL N
Sbjct: 160 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGN 219
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FKI RG D IES
Sbjct: 220 SWNTDWGDNGFFKILRGQDHCGIES 244
>gi|22531389|emb|CAD44625.1| cathepsin B1 isotype 2 [Schistosoma mansoni]
Length = 340
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 43/85 (50%), Positives = 57/85 (67%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI +G + A+ ++D + YK G+Y+H GE GGHA++IIGWGVE+ YWL N
Sbjct: 248 IQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVENKTPYWLIAN 307
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW E WG+ G F+I RG DE IES
Sbjct: 308 SWNEDWGENGYFRIVRGRDECFIES 332
>gi|313229093|emb|CBY18245.1| unnamed protein product [Oikopleura dioica]
Length = 355
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 50/94 (53%), Positives = 59/94 (62%), Gaps = 1/94 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI H+G + AA + D YK GVY+HT G GGHA+KIIGWG E G YWL N
Sbjct: 253 IQQEIMHYGPVEAAFTVYSDFPSYKSGVYRHTSGSELGGHAIKIIGWGTEGGDDYWLINN 312
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
SW WGD G FKI RG++E IE +V A VD
Sbjct: 313 SWNSDWGDKGTFKILRGSNECGIEG-EVVAATVD 345
>gi|50657025|emb|CAH04630.1| cathepsin B [Suberites domuncula]
Length = 331
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 48/95 (50%), Positives = 59/95 (62%), Gaps = 1/95 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A + D YK GVYQHT G M GGHA++I+GWG E+G YWL N
Sbjct: 235 IQTEIMTNGPVEGAFTVYADFPTYKSGVYQHTSGAMLGGHAIRILGWGTENGTPYWLVAN 294
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
SW E WG G FKI RG D+ IES Q++AG +
Sbjct: 295 SWNEDWGAMGYFKIIRGKDDCGIES-QITAGMPKK 328
>gi|240992699|ref|XP_002404474.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215491571|gb|EEC01212.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 47/91 (51%), Positives = 59/91 (64%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EIF G + A + D + YK GVYQH G++ GGHA++I+GWG E+G YWL N
Sbjct: 243 IQTEIFKNGPVEADFTVYADFLSYKSGVYQHQSGDVLGGHAIRILGWGTENGTPYWLVAN 302
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW E WGD G FKI RG DE IE ++AG
Sbjct: 303 SWNEDWGDHGYFKILRGKDECGIED-DINAG 332
>gi|56752925|gb|AAW24674.1| unknown [Schistosoma japonicum]
Length = 342
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 65/93 (69%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI +G + A ++ ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKEIMMYGPVEAYLQIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTSYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG DE IESF V AG++
Sbjct: 309 TWNEDWGEKGYFRIVRGRDECLIESFIV-AGQI 340
>gi|306992171|gb|ADN19566.1| cathepsin B-like proteinase [Spodoptera frugiperda]
Length = 341
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 47/89 (52%), Positives = 58/89 (65%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ E+F G + A + DL+ YK GVY+HTVG GGHA+KI+GWGVE+G KY L N
Sbjct: 245 IKAELFKNGPVEGAFTVYSDLLNYKNGVYKHTVGNALGGHAIKILGWGVENGNKYRLIAN 304
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW WGD G FKI RG D IES V+
Sbjct: 305 SWNSDWGDNGFFKILRGEDHCGIESSIVA 333
>gi|312271213|gb|ADQ57304.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 347
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 42/89 (47%), Positives = 58/89 (65%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A + ++D Y G+Y+HT G+ GGHAVK++GWG E+G YW+C N
Sbjct: 255 IQKEIMTNGPVEVAFDVYEDFEHYSSGIYKHTTGDYLGGHAVKMLGWGTENGTDYWICAN 314
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW WG+ G F+I RG DE +IES V+
Sbjct: 315 SWNSDWGENGFFRILRGVDECQIESSVVA 343
>gi|161343865|tpg|DAA06113.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 335
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 48/86 (55%), Positives = 58/86 (67%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
MQ + +G I A+ + + D Y+ GVYQ T GGHAVK+IGWGVE+G YWL V
Sbjct: 241 MQKDTMVYGPIEASFDVYDDFTSYESGVYQKTENASYLGGHAVKMIGWGVEEGTPYWLMV 300
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSWGE WGD G+FKI RGTDE +ES
Sbjct: 301 NSWGEQWGDKGMFKILRGTDECGVES 326
>gi|56757271|gb|AAW26807.1| unknown [Schistosoma japonicum]
Length = 342
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI +G + A + ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKEIMMYGPVEAYLHIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTSYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG DE IESF V AG++
Sbjct: 309 TWNEDWGEKGYFRIVRGRDECLIESFIV-AGQI 340
>gi|389611087|dbj|BAM19154.1| cathepsin B [Papilio polytes]
Length = 334
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 45/90 (50%), Positives = 58/90 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ E++ G + A + DL+ YK GVY+H G+ GGHA+KI+GWGVE+G KYWL N
Sbjct: 240 IKAELYKNGPVEGAFTVYADLLSYKSGVYKHVAGDALGGHAIKIMGWGVENGNKYWLIAN 299
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SW WGD G FKI RG D IES V+
Sbjct: 300 SWNSDWGDNGFFKILRGEDHCGIESSIVAG 329
>gi|268555420|ref|XP_002635699.1| Hypothetical protein CBG22436 [Caenorhabditis briggsae]
Length = 317
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 62/94 (65%), Gaps = 2/94 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI + G + AA + D Y+ GVY+H G++ GGHAVKIIGWG+++G YWL N
Sbjct: 226 IQTEITN-GPVEAAFIVYDDFNHYRSGVYRHVAGKLVGGHAVKIIGWGIQNGAPYWLMAN 284
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
SWG WG+ G FK+ RG DE IES + AG+ D
Sbjct: 285 SWGPYWGENGFFKMLRGVDECGIES-TIVAGKPD 317
>gi|156365510|ref|XP_001626688.1| predicted protein [Nematostella vectensis]
gi|156213574|gb|EDO34588.1| predicted protein [Nematostella vectensis]
Length = 259
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 46/85 (54%), Positives = 53/85 (62%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A + D YK GVYQHT G GGHA+KI+GWG E+G YWL N
Sbjct: 162 IQKEIMTNGPVEGAFTVYADFPTYKSGVYQHTSGSALGGHAIKILGWGEENGTPYWLVAN 221
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FKI+RG DE IES
Sbjct: 222 SWNSDWGDEGFFKIKRGNDECGIES 246
>gi|126681075|gb|ABO26563.1| cathepsin B-like cysteine protease form 1 [Ixodes ricinus]
Length = 337
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 47/91 (51%), Positives = 59/91 (64%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EIF G + A + D + YK GVYQH G++ GGHA++I+GWG E+G YWL N
Sbjct: 243 IQTEIFKNGPVEADFTVYADFLSYKSGVYQHHSGDVLGGHAIRILGWGTENGTPYWLVAN 302
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW E WGD G FKI RG DE IE ++AG
Sbjct: 303 SWNEDWGDHGYFKILRGKDECGIED-DINAG 332
>gi|156255405|gb|ABU62925.1| cathepsin B [Fasciola hepatica]
Length = 337
Score = 99.4 bits (246), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 42/91 (46%), Positives = 63/91 (69%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ +EI G + +D ++YK G+Y +T G + GGHA+++IGWGVE+GVKYWL N
Sbjct: 246 IMMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGVENGVKYWLIAN 305
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW E WG+ G F++RRG +E IE+ +++AG
Sbjct: 306 SWNEGWGEKGYFRMRRGNNECGIEA-RINAG 335
>gi|112983908|ref|NP_001036850.1| cathepsin B precursor [Bombyx mori]
gi|13548667|dbj|BAB40804.1| cathepsin B [Bombyx mori]
Length = 337
Score = 99.0 bits (245), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 47/90 (52%), Positives = 58/90 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ E+F G + A + DL+ YK GVY+HT G+ GGHAVKI+GWGVE+ KYWL N
Sbjct: 241 IRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQGDALGGHAVKILGWGVENDNKYWLIAN 300
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SW WGD G FKI RG D IES V+
Sbjct: 301 SWNSDWGDNGFFKILRGEDHCGIESSIVTG 330
>gi|449267314|gb|EMC78276.1| Cathepsin B [Columba livia]
Length = 340
Score = 99.0 bits (245), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 45/97 (46%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A ++D ++YK GVYQH GE GGHA++++GWGV++G YWL N
Sbjct: 240 IMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVTGEQVGGHAIRLLGWGVDNGTPYWLAAN 299
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES ++ AG +R
Sbjct: 300 SWNTDWGDNGFFKILRGEDHCGIES-EIVAGIPSTER 335
>gi|389608541|dbj|BAM17880.1| cathepsin B [Papilio xuthus]
Length = 334
Score = 99.0 bits (245), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 45/90 (50%), Positives = 58/90 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ E++ G + A + DL+ YK GVY+H G+ GGHA+KI+GWGVE+G KYWL N
Sbjct: 240 IKAELYKNGPVEGAFTVYADLLSYKSGVYKHVTGDALGGHAIKIMGWGVENGNKYWLIAN 299
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SW WGD G FKI RG D IES V+
Sbjct: 300 SWNSDWGDNGFFKILRGEDHCGIESSIVAG 329
>gi|312374701|gb|EFR22198.1| hypothetical protein AND_15621 [Anopheles darlingi]
Length = 335
Score = 99.0 bits (245), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 47/91 (51%), Positives = 61/91 (67%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G + A ++DL+ YK+GVYQH G+M GGHA++I+GWGVE+ KYWL N
Sbjct: 242 IRKEIMTNGPVEGAFTVYEDLLHYKEGVYQHVTGKMLGGHAIRILGWGVENNTKYWLIAN 301
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG D IES ++AG
Sbjct: 302 SWNSDWGDNGFFKILRGEDHLGIES-SIAAG 331
>gi|194246059|gb|ACF35521.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 217
Score = 99.0 bits (245), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 48/91 (52%), Positives = 58/91 (63%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G + A + D YK GVYQ EM GGHA++I+GWG EDGV YWL N
Sbjct: 121 IKTEICKNGPVEADFNVYADFPSYKSGVYQRHSKEMLGGHAIRILGWGTEDGVPYWLVAN 180
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW E WGD G FKIRRG DE IE+ ++AG
Sbjct: 181 SWNEDWGDKGYFKIRRGNDECGIEN-DINAG 210
>gi|332376204|gb|AEE63242.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 99.0 bits (245), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 48/86 (55%), Positives = 55/86 (63%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
MQLEI G I AA + D + YK GVYQ T + S GGHA+K++GWGVE+G KYWL
Sbjct: 245 MQLEILKNGPIEAAFTVYNDFLSYKSGVYQATAQDESVGGHAIKVLGWGVEEGTKYWLIA 304
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW WGD G FK RG D IES
Sbjct: 305 NSWNTDWGDNGYFKFLRGVDHCGIES 330
>gi|28971815|dbj|BAC65419.1| cathepsin B [Pandalus borealis]
Length = 328
Score = 99.0 bits (245), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 45/90 (50%), Positives = 55/90 (61%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + AA + D + YK GVYQH G + G HAV++IGWG E+G YWL N
Sbjct: 234 IQEEIMTNGPVTAAFAVYDDFLSYKSGVYQHETGLLDGYHAVRVIGWGEEEGTPYWLVAN 293
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SW WGD GLFKI RG+DE E +A
Sbjct: 294 SWNTDWGDNGLFKILRGSDECEFEGDMAAA 323
>gi|239938574|gb|ACS36086.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 99.0 bits (245), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 47/93 (50%), Positives = 60/93 (64%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G +V A ++D YKKG+Y+HT G+ GGHA+KIIGWG E+GV YWL N
Sbjct: 162 IQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWGKENGVPYWLIAN 221
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
SW WG+ G F+I RG++ IE V AG V
Sbjct: 222 SWHNDWGENGYFRILRGSNHCGIEE-NVVAGHV 253
>gi|189239879|ref|XP_968767.2| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270012755|gb|EFA09203.1| cathepsin B precursor [Tribolium castaneum]
Length = 353
Score = 99.0 bits (245), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 47/92 (51%), Positives = 63/92 (68%), Gaps = 2/92 (2%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
G ++A + ++D +Y++GVY HT G + G HAVKIIGWG E+G YWL NSWG+ WG
Sbjct: 230 GPVMAGFDVYEDFKLYREGVYVHTSGALLGSHAVKIIGWGTENGWAYWLVANSWGKDWGA 289
Query: 69 -GGLFKIRRGTDESRIESFQVSAGRVDRDRSS 99
GG+FKIRRGT+E +IE + G V +D S
Sbjct: 290 LGGVFKIRRGTNECKIEQ-SIITGHVRKDEKS 320
>gi|56759504|gb|AAW27892.1| unknown [Schistosoma japonicum]
Length = 279
Score = 99.0 bits (245), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 44/90 (48%), Positives = 60/90 (66%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I +G + AA + ++D + YK G+Y+H G + GGHA++IIGWGVE YWL N
Sbjct: 186 IQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTPYWLIAN 245
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SW E WG+ GLF+I RG DE IES V+
Sbjct: 246 SWNEDWGEKGLFRIVRGRDECSIESNVVAG 275
>gi|29374027|gb|AAO73004.1| cathepsin B [Fasciola gigantica]
Length = 337
Score = 99.0 bits (245), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 42/89 (47%), Positives = 62/89 (69%), Gaps = 1/89 (1%)
Query: 3 LEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSW 62
+EI G + +D ++YK G+Y +T G + GGHA+++IGWGVE+GVKYWL NSW
Sbjct: 248 MEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGVENGVKYWLIANSW 307
Query: 63 GELWGDGGLFKIRRGTDESRIESFQVSAG 91
E WG+ G F++RRG +E IE+ +++AG
Sbjct: 308 NEGWGEKGYFRMRRGNNECGIEA-RINAG 335
>gi|50540542|ref|NP_998501.1| cathepsin B, a precursor [Danio rerio]
gi|34784038|gb|AAH56688.1| Cathepsin B, a [Danio rerio]
gi|37681773|gb|AAQ97764.1| cathepsin B [Danio rerio]
gi|41351445|gb|AAH65589.1| Cathepsin B, a [Danio rerio]
Length = 330
Score = 99.0 bits (245), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 46/91 (50%), Positives = 58/91 (63%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ E+F G + A ++D ++YK GVYQH G GGHA+KI+GWG E+GV YWL N
Sbjct: 238 IMAELFKNGPVEGAFTVYEDFLLYKSGVYQHMSGSPVGGHAIKILGWGEENGVPYWLAAN 297
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG D IES ++ AG
Sbjct: 298 SWNTDWGDNGYFKILRGEDHCGIES-EIVAG 327
>gi|239938584|gb|ACS36091.1| cysteine proteinase [Haemonchus contortus]
Length = 346
Score = 99.0 bits (245), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 47/93 (50%), Positives = 61/93 (65%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G +V A ++D YKKG+Y+HT G+ GGHA+KIIGWGVE+ V YWL N
Sbjct: 253 IQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGQARGGHAIKIIGWGVENDVPYWLIAN 312
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
SW WG+ G F++ RG +E IE +V AG V
Sbjct: 313 SWHNDWGEEGYFRMIRGINECGIEQ-EVVAGHV 344
>gi|239938582|gb|ACS36090.1| cysteine proteinase [Haemonchus contortus]
Length = 346
Score = 98.6 bits (244), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 47/93 (50%), Positives = 61/93 (65%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G +V A ++D YKKG+Y+HT G+ GGHA+KIIGWGVE+ V YWL N
Sbjct: 253 IQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGQARGGHAIKIIGWGVENDVPYWLIAN 312
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
SW WG+ G F++ RG +E IE +V AG V
Sbjct: 313 SWHNDWGEEGYFRMIRGINECGIEQ-EVVAGHV 344
>gi|225713216|gb|ACO12454.1| Cathepsin B precursor [Lepeophtheirus salmonis]
gi|290561811|gb|ADD38303.1| Cathepsin B [Lepeophtheirus salmonis]
Length = 333
Score = 98.6 bits (244), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 43/84 (51%), Positives = 56/84 (66%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+QLEI + G + AA + D + YK GVY+H G + GGHA++I+GWGVE+G YWL N
Sbjct: 241 IQLEIMNNGPVEAAFSVYSDFLNYKSGVYRHVKGSLLGGHAIRILGWGVENGTPYWLVAN 300
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW WGD G FKI +G+D IE
Sbjct: 301 SWNTDWGDNGTFKILKGSDHCGIE 324
>gi|157167285|ref|XP_001658487.1| cathepsin b [Aedes aegypti]
gi|108876478|gb|EAT40703.1| AAEL007590-PA [Aedes aegypti]
Length = 313
Score = 98.6 bits (244), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 46/88 (52%), Positives = 59/88 (67%), Gaps = 1/88 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
+IF G + A + ++D++ Y GVY+H G + GGHAVK+IGWGVEDG KYWL NSWG
Sbjct: 220 DIFVNGPVQAVFQWYEDIVNYSGGVYRHQSGRLKGGHAVKLIGWGVEDGTKYWLVANSWG 279
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAG 91
+WGD G FK+ RG + IE V AG
Sbjct: 280 RVWGDDGFFKMVRGENHCGIEE-NVHAG 306
>gi|410956528|ref|XP_003984894.1| PREDICTED: cathepsin B [Felis catus]
Length = 339
Score = 98.6 bits (244), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 49/91 (53%), Positives = 57/91 (62%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + AA D + YK GVYQH GEM GGHAV+I+GWGVE+ YWL N
Sbjct: 239 IMAEIYKNGPVEAAFSVFSDFLQYKSGVYQHVTGEMMGGHAVRILGWGVENDTPYWLVGN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG D IES +V AG
Sbjct: 299 SWNTDWGDHGFFKILRGRDHCGIES-EVVAG 328
>gi|379067374|gb|AFC90100.1| cathepsin B [Capra hircus]
Length = 335
Score = 98.6 bits (244), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 54/85 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+ YWL N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEMMGGHAIRILGWGVENDTPYWLVGN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FKI RG D IES
Sbjct: 299 SWNTDWGDKGFFKILRGQDHCGIES 323
>gi|405971658|gb|EKC36483.1| Cathepsin B [Crassostrea gigas]
Length = 341
Score = 98.6 bits (244), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 48/89 (53%), Positives = 56/89 (62%), Gaps = 1/89 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI G + A + D YK GVY+HT G+ GGHA+KI+GWG E+G YWL NSW
Sbjct: 250 EIMTNGPVEGAFTVYADFPQYKSGVYKHTTGQPLGGHAIKILGWGTENGDDYWLVANSWN 309
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAGR 92
WGD G FKI RG DE IES Q+SAG
Sbjct: 310 PDWGDQGFFKILRGQDECGIES-QISAGE 337
>gi|56758658|gb|AAW27469.1| unknown [Schistosoma japonicum]
Length = 181
Score = 98.6 bits (244), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 45/90 (50%), Positives = 59/90 (65%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + AA + ++D + YK G+Y+H G + GGHA++IIGWGVE YWL N
Sbjct: 88 IQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTPYWLIAN 147
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SW E WG+ GLF+I RG DE IES V+
Sbjct: 148 SWNEDWGEKGLFRIVRGRDECSIESNVVAG 177
>gi|426220597|ref|XP_004004501.1| PREDICTED: cathepsin B [Ovis aries]
Length = 335
Score = 98.6 bits (244), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 54/85 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+ YWL N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEMMGGHAIRILGWGVENDTPYWLVGN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FKI RG D IES
Sbjct: 299 SWNTDWGDKGFFKILRGQDHCGIES 323
>gi|341888136|gb|EGT44071.1| hypothetical protein CAEBREN_13576 [Caenorhabditis brenneri]
Length = 337
Score = 98.6 bits (244), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 49/99 (49%), Positives = 60/99 (60%), Gaps = 1/99 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + + D YK G+Y+H G GGHAVKI+GWGVE+G YWL N
Sbjct: 240 IQTEIMRNGPVEVGFLVYSDFYQYKSGIYKHVAGRELGGHAVKILGWGVENGTPYWLAAN 299
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSS 99
SW WG+ G F+IRRGT+E IES V AG D R+S
Sbjct: 300 SWNVNWGEKGYFRIRRGTNECGIES-SVVAGIPDLKRNS 337
>gi|226474178|emb|CAX71575.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 65/93 (69%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AGR+
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIES-EIAAGRI 340
>gi|1181143|emb|CAA93278.1| cysteine proteinase [Haemonchus contortus]
Length = 341
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 47/93 (50%), Positives = 59/93 (63%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G +V A ++D YKKG+Y+HT G+ GGHA+KIIGWG E GV YWL N
Sbjct: 250 IQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWGKEGGVPYWLIAN 309
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
SW WG+ G F+I RG++ IE V AG V
Sbjct: 310 SWHNDWGENGYFRILRGSNHCGIEE-NVVAGHV 341
>gi|162813|gb|AAA30434.1| cathepsin B, partial [Bos taurus]
Length = 122
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 55/85 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GE+ GGHA++I+GWGVE+G YWL N
Sbjct: 26 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGN 85
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FKI RG D IES
Sbjct: 86 SWNTDWGDNGFFKILRGQDHCGIES 110
>gi|1169189|sp|P43157.1|CYSP_SCHJA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
Full=Antigen Sj31; Flags: Precursor
gi|11167|emb|CAA50305.1| cathepsin B [Schistosoma japonicum]
Length = 342
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 42/85 (49%), Positives = 58/85 (68%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I +G + AA + ++D + YK G+Y+H G + GGHA++IIGWGVE YWL N
Sbjct: 249 IQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTPYWLIAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW E WG+ GLF++ RG DE IES
Sbjct: 309 SWNEDWGEKGLFRMVRGRDECSIES 333
>gi|308511959|ref|XP_003118162.1| CRE-CPR-6 protein [Caenorhabditis remanei]
gi|308238808|gb|EFO82760.1| CRE-CPR-6 protein [Caenorhabditis remanei]
Length = 387
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 42/85 (49%), Positives = 57/85 (67%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q E+ G + A E ++D + Y GVY HT G++ GGHAVK++GWG+E+G+ YW C N
Sbjct: 266 IQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLVGWGIENGIPYWTCAN 325
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WG+ G F+I RG DE IES
Sbjct: 326 SWNTDWGEDGFFRILRGVDECGIES 350
>gi|338815385|gb|AEJ08755.1| cathepsin B [Crassostrea ariakensis]
Length = 341
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 48/89 (53%), Positives = 56/89 (62%), Gaps = 1/89 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI G + A + D YK GVY+HT G+ GGHA+KI+GWG E+G YWL NSW
Sbjct: 250 EIMTNGPVEGAFTVYADFPQYKSGVYKHTTGQPLGGHAIKILGWGTENGDDYWLVANSWN 309
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAGR 92
WGD G FKI RG DE IES Q+SAG
Sbjct: 310 PDWGDQGFFKILRGQDECGIES-QISAGE 337
>gi|330434688|gb|AEC22812.1| cathepsin B [Macrobrachium nipponense]
Length = 331
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 46/91 (50%), Positives = 59/91 (64%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ +I G + A + D + YK GVYQHT G GGHA++++GWG EDG YWLC N
Sbjct: 237 IKYDIMTNGPVEGAFTVYVDFLHYKSGVYQHTHGLPLGGHAIRVLGWGEEDGTPYWLCAN 296
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG+D IES ++SAG
Sbjct: 297 SWNTDWGDNGYFKILRGSDHCGIES-EISAG 326
>gi|56758040|gb|AAW27160.1| unknown [Schistosoma japonicum]
Length = 216
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 45/90 (50%), Positives = 59/90 (65%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + AA + ++D + YK G+Y+H G + GGHA++IIGWGVE YWL N
Sbjct: 123 IQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTPYWLIAN 182
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SW E WG+ GLF+I RG DE IES V+
Sbjct: 183 SWNEDWGEKGLFRIVRGRDECSIESHVVAG 212
>gi|196009263|ref|XP_002114497.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190583516|gb|EDV23587.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 333
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 47/85 (55%), Positives = 52/85 (61%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + AA + D YK GVYQHT G GGHAVKI+GWG E+ YWL N
Sbjct: 240 IQKEIMMNGPVEAAFTVYADFPSYKSGVYQHTTGGPLGGHAVKILGWGTENNTPYWLIAN 299
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FKI RG DE IES
Sbjct: 300 SWNPTWGDKGYFKIIRGKDECGIES 324
>gi|118364222|ref|XP_001015333.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297100|gb|EAR95088.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 341
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 44/90 (48%), Positives = 59/90 (65%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI G + AA ++D + YK GVY+H G+ GGHA+KI+GWGVE+ YW+ VN
Sbjct: 248 IMTEIQTNGPVEAAFTVYEDFLNYKSGVYKHVTGKALGGHAIKIVGWGVENNTPYWIVVN 307
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SW + WGD G FKI RG +E IE+ V+A
Sbjct: 308 SWNQTWGDNGTFKILRGKNECGIEAQVVTA 337
>gi|308390275|gb|ADO32581.1| cathepsin B [Marsupenaeus japonicus]
Length = 332
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 47/91 (51%), Positives = 59/91 (64%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G + A + D + YK GVYQH G GGHA++I+GWG E+G YWLC N
Sbjct: 238 IKYEIMKNGPVEGAFTVYVDFLHYKSGVYQHRHGLPLGGHAIRILGWGEENGTPYWLCAN 297
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD GLFKI RG+D IES ++SAG
Sbjct: 298 SWNTDWGDNGLFKILRGSDHCGIES-EISAG 327
>gi|1777779|gb|AAB40605.1| cathepsin B-like cysteine proteinase [Ascaris suum]
gi|324515014|gb|ADY46062.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 398
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 48/110 (43%), Positives = 65/110 (59%), Gaps = 1/110 (0%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A E ++D ++Y G+Y HT G++ GGHAVK++GWGVE GV YWL N
Sbjct: 282 IQKEILTHGPVEVAFEVYEDFLMYDGGIYVHTGGKIGGGHAVKMLGWGVEQGVPYWLVAN 341
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA-GRVDRDRSSDLEEFEYDTD 109
SW WG+ G F+I RG DE IES V +++R + D D
Sbjct: 342 SWNTDWGEDGFFRIIRGIDECGIESSVVGGLPKLNRTYKKYHRRYRLDND 391
>gi|146217390|gb|ABQ10737.1| cathepsin B [Penaeus monodon]
Length = 331
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 46/91 (50%), Positives = 60/91 (65%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI + G + A + D + YK GVYQH G GGHA++++GWG E+G YWLC N
Sbjct: 237 IKYEIMNNGPVEGAFTVYVDFLHYKSGVYQHRHGLPLGGHAIRVLGWGEENGTPYWLCAN 296
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD GLFKI RG+D IES ++SAG
Sbjct: 297 SWNTDWGDNGLFKILRGSDHCGIES-EISAG 326
>gi|160333103|ref|NP_001103948.1| capthepsin B, b precursor [Danio rerio]
gi|133777414|gb|AAI15255.1| Ctsbb protein [Danio rerio]
Length = 326
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 45/90 (50%), Positives = 56/90 (62%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ E++ G + AA ++D +YK GVYQH G GGHAVKI+GWG E+G +WL N
Sbjct: 233 IMTELYTNGPVEAAFTVYEDFPLYKSGVYQHLTGSALGGHAVKILGWGEENGTPFWLVAN 292
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SW WGD G FKI RG DE IES V+
Sbjct: 293 SWNSDWGDNGYFKILRGHDECGIESEMVAG 322
>gi|147906534|ref|NP_001090927.1| cathepsin B precursor [Sus scrofa]
gi|187470655|sp|A1E295.1|CATB_PIG RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|118490058|gb|ABK96810.1| cathepsin B [Sus scrofa]
Length = 335
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 45/91 (49%), Positives = 58/91 (63%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D + YK GVYQH G++ GGHA++I+GWGVE+G YWL N
Sbjct: 239 IMAEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG D IES ++ AG
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EIVAG 328
>gi|195058549|ref|XP_001995463.1| GH17748 [Drosophila grimshawi]
gi|193896249|gb|EDV95115.1| GH17748 [Drosophila grimshawi]
Length = 340
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 50/93 (53%), Positives = 60/93 (64%), Gaps = 3/93 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
+Q EI G + A ++DLI+YK GVYQH G+ GGHA++IIGWGV E V YWL
Sbjct: 245 IQREIMTNGPVEGAFTVYEDLILYKTGVYQHVHGKQLGGHAIRIIGWGVWGESKVPYWLI 304
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW WGD G F+I RG D IES Q+SAG
Sbjct: 305 ANSWNTDWGDNGFFRILRGKDHCGIES-QISAG 336
>gi|154761391|gb|ABS85545.1| cathepsin B preproprotein [Biomphalaria glabrata]
Length = 333
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 46/86 (53%), Positives = 55/86 (63%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E+ G + AA + + D + YK GVY+HT G GGHAVKIIG+G E G YWL NSW
Sbjct: 245 ELVDNGPVTAAFDVYSDFLSYKTGVYRHTTGSYEGGHAVKIIGYGTESGQDYWLVANSWN 304
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVS 89
E WGD G FKI +G DE IES V+
Sbjct: 305 EDWGDKGFFKIAKGKDECGIESSIVA 330
>gi|324507953|gb|ADY43363.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 352
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 57/85 (67%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A E ++D ++Y G+Y HT G++ GGHAVK++GWGVE GV YWL N
Sbjct: 241 IQKEILTHGPVEVAFEVYEDFLMYDGGIYVHTGGKIGGGHAVKMLGWGVEQGVPYWLVAN 300
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WG+ G F+I RG DE IES
Sbjct: 301 SWNTDWGEDGFFRIIRGIDECGIES 325
>gi|239938576|gb|ACS36087.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 47/93 (50%), Positives = 59/93 (63%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G +V A ++D YKKG+Y+HT G+ GGHA+KIIGWG E GV YWL N
Sbjct: 162 IQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWGKEGGVPYWLIAN 221
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
SW WG+ G F+I RG++ IE V AG V
Sbjct: 222 SWHNDWGENGYFRILRGSNHCGIEE-NVVAGHV 253
>gi|268560898|ref|XP_002638183.1| Hypothetical protein CBG22612 [Caenorhabditis briggsae]
Length = 721
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 44/89 (49%), Positives = 59/89 (66%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A + ++D YK GVY++ G GGHAVKIIGWGVE+ V YWL N
Sbjct: 231 IQSEILRNGPVEATYQVYEDFYYYKSGVYEYISGRHMGGHAVKIIGWGVEENVNYWLIAN 290
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SWG +G+ G FK+RRG +E IE++ V+
Sbjct: 291 SWGTGFGENGFFKMRRGNNECGIENYVVA 319
>gi|171948776|gb|ACB59245.1| cathepsin B [Sus scrofa]
Length = 335
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 45/91 (49%), Positives = 58/91 (63%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D + YK GVYQH G++ GGHA++I+GWGVE+G YWL N
Sbjct: 239 IMAEIYKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG D IES ++ AG
Sbjct: 299 SWNTDWGDNGFFKILRGQDHCGIES-EIVAG 328
>gi|270012758|gb|EFA09206.1| cathepsin B precursor [Tribolium castaneum]
Length = 326
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 50/123 (40%), Positives = 75/123 (60%), Gaps = 4/123 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q+EI G ++A +D +K GVY + G+ G H+VK+IGWG E+G+ YWL N
Sbjct: 204 IQMEILTNGPVMAYYNVFEDFACHKSGVYYYKSGKFVGRHSVKVIGWGTEEGIPYWLIAN 263
Query: 61 SWGELWGD-GGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFEYDTDTTIESSSDTK 119
SWG WG+ GG FK+RRGT+E IE +++AG+V + + EE T+ TI+ S
Sbjct: 264 SWGSEWGELGGFFKMRRGTNECWIEQ-EMTAGKVHIEGNERTEEM--TTNATIQGSGQKG 320
Query: 120 RAF 122
++
Sbjct: 321 QSL 323
>gi|340380665|ref|XP_003388842.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
Length = 333
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 53/85 (62%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A + D YK GVYQH VG GGHA++I+GWG E+GV YWL N
Sbjct: 238 IQTEIMTNGPVEGAFTVYSDFPTYKSGVYQHVVGHALGGHAIRILGWGTENGVPYWLIAN 297
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FK+ RG D+ IES
Sbjct: 298 SWNPSWGDKGYFKMIRGKDDCGIES 322
>gi|393909827|gb|EJD75608.1| cysteine endopeptidase [Loa loa]
Length = 383
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 41/85 (48%), Positives = 55/85 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A+ E + D + Y G+Y+H G M GGHAVK++GWG++ GV YWL N
Sbjct: 283 IQKEIMTLGPVEASFEVYTDFLYYTGGIYKHVAGSMGGGHAVKVLGWGIDQGVPYWLAAN 342
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WG+ G F+I RG +E IES
Sbjct: 343 SWNTDWGEDGYFRILRGVNECGIES 367
>gi|183988834|gb|ACC66066.1| cathepsin B [Samia ricini]
Length = 283
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 45/83 (54%), Positives = 54/83 (65%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ E+F G + AA + DL+ YK GVY+HT G GGHA+KIIGWGVE+ KYWL N
Sbjct: 201 IKAELFKNGPVEAAFTVYSDLLSYKNGVYKHTEGNALGGHAIKIIGWGVENNNKYWLIAN 260
Query: 61 SWGELWGDGGLFKIRRGTDESRI 83
SW WGD G FKI RG D I
Sbjct: 261 SWNSDWGDNGFFKILRGEDHCGI 283
>gi|170028912|ref|XP_001842338.1| oryzain gamma chain [Culex quinquefasciatus]
gi|167879388|gb|EDS42771.1| oryzain gamma chain [Culex quinquefasciatus]
Length = 333
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 45/89 (50%), Positives = 57/89 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EIF G + A D YK G+YQHT G ++G HAV+I+GWGVE+G KYWL N
Sbjct: 240 IRKEIFTNGPVEATFTVFDDFASYKHGIYQHTSGNLAGEHAVRILGWGVENGTKYWLAAN 299
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW WGD G FKI RG++ IES V+
Sbjct: 300 SWNSDWGDNGYFKILRGSNHVDIESAIVA 328
>gi|298370749|gb|ADI80349.1| cathepsin B [Litopenaeus vannamei]
Length = 331
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 46/91 (50%), Positives = 59/91 (64%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G + A + D + YK GVYQH G GGHA++++GWG E+G YWLC N
Sbjct: 237 IKYEIMKNGPVEGAFTVYVDFLHYKSGVYQHRHGLPLGGHAIRVLGWGEENGTPYWLCAN 296
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD GLFKI RG+D IES ++SAG
Sbjct: 297 SWNTDWGDNGLFKILRGSDHCGIES-EISAG 326
>gi|329669000|gb|AEB96388.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 232
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 41/85 (48%), Positives = 55/85 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A + ++D Y G+Y+HT G+ GGHAVK++GWG E+G YW+C N
Sbjct: 140 IQKEIMTNGPVEVAFDVYEDFEHYSSGIYKHTTGDYLGGHAVKMLGWGTENGTDYWICAN 199
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WG+ G F+I RG DE IES
Sbjct: 200 SWNSDWGENGFFRILRGVDECEIES 224
>gi|6681079|ref|NP_031824.1| cathepsin B preproprotein [Mus musculus]
gi|115712|sp|P10605.2|CATB_MOUSE RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
RecName: Full=Cathepsin B light chain; Contains:
RecName: Full=Cathepsin B heavy chain; Flags: Precursor
gi|239907|gb|AAB20536.1| preprocathepsin B [Mus sp.]
gi|309152|gb|AAA37375.1| cathepsin B [Mus musculus]
gi|13879360|gb|AAH06656.1| Cathepsin B [Mus musculus]
gi|26350521|dbj|BAC38900.1| unnamed protein product [Mus musculus]
gi|74180941|dbj|BAE27751.1| unnamed protein product [Mus musculus]
gi|74191261|dbj|BAE39458.1| unnamed protein product [Mus musculus]
gi|74198944|dbj|BAE30691.1| unnamed protein product [Mus musculus]
gi|74208073|dbj|BAE29144.1| unnamed protein product [Mus musculus]
gi|148704123|gb|EDL36070.1| cathepsin B, isoform CRA_a [Mus musculus]
Length = 339
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 46/97 (47%), Positives = 60/97 (61%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A D + YK GVY+H G+M GGHA++I+GWGVE+GV YWL N
Sbjct: 239 IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVENGVPYWLAAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG + IES ++ AG D+
Sbjct: 299 SWNLDWGDNGFFKILRGENHCGIES-EIVAGIPRTDQ 334
>gi|401415968|ref|XP_003872479.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania mexicana MHOM/GT/2001/U1103]
gi|322488703|emb|CBZ23950.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania mexicana MHOM/GT/2001/U1103]
Length = 340
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 42/89 (47%), Positives = 59/89 (66%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ +E+ + G + A++ + D + YK GVY+H G+ GGHAVK++GWGV+DG+ YW N
Sbjct: 246 LMVELMNNGPLEVAMQVYADFVAYKSGVYKHVSGDHLGGHAVKLVGWGVKDGIPYWKIAN 305
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW WGD G F I+RG DE IES V+
Sbjct: 306 SWNTDWGDKGYFLIQRGNDECGIESSGVA 334
>gi|91089437|ref|XP_966750.1| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270012705|gb|EFA09153.1| cathepsin B precursor [Tribolium castaneum]
Length = 324
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 44/87 (50%), Positives = 56/87 (64%), Gaps = 1/87 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI + G +V ++ ++D YK GVYQH G GGHAVKIIGWG E GV YWL N
Sbjct: 230 IQTEIMNNGPVVTHMDVYEDFYSYKSGVYQHVSGNSMGGHAVKIIGWGTEKGVPYWLIAN 289
Query: 61 SWGELWGD-GGLFKIRRGTDESRIESF 86
SWG W D G +KI RG + +IE++
Sbjct: 290 SWGAKWADLDGFYKILRGKNHCKIETY 316
>gi|74213457|dbj|BAE35542.1| unnamed protein product [Mus musculus]
Length = 339
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 46/97 (47%), Positives = 60/97 (61%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A D + YK GVY+H G+M GGHA++I+GWGVE+GV YWL N
Sbjct: 239 IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVENGVPYWLAAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG + IES ++ AG D+
Sbjct: 299 SWNLDWGDNGFFKILRGENHCGIES-EIVAGIPRTDQ 334
>gi|86279343|gb|ABC88767.1| putative cathepsin B-like proteinase [Tenebrio molitor]
Length = 321
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 44/89 (49%), Positives = 55/89 (61%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q E+ G I+ E QD Y GVY+H GE G H VKI+GWGVE+GV YWL N
Sbjct: 228 IQYEVMTNGPIIVNFEVFQDFYNYVSGVYRHVSGESVGFHVVKIVGWGVENGVPYWLIAN 287
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SWG WGD G FK+ RG +E IE++ +
Sbjct: 288 SWGSSWGDHGFFKMLRGQNECGIENYPYA 316
>gi|329668994|gb|AEB96385.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 316
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 60/93 (64%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI +G +VAA + D YK G+Y+H G +GGHAV+I+GWG + GV YWL N
Sbjct: 225 IQKEIMTYGPVVAAFTVYDDFFHYKTGIYKHVSGAEAGGHAVRILGWGQQGGVPYWLVAN 284
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
SW WG+ G F+I RG+DE IE V AG+V
Sbjct: 285 SWNTDWGENGYFRILRGSDECGIED-GVVAGQV 316
>gi|283468816|emb|CAO98753.1| putative cathepsin B [Fasciola hepatica]
Length = 112
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 42/91 (46%), Positives = 63/91 (69%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ +EI G + +D ++YK G+Y +T G + GGHA+++IGWGVE+GVKYWL N
Sbjct: 21 IMMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGVENGVKYWLIAN 80
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW E WG+ G F++RRG +E IE+ +++AG
Sbjct: 81 SWNEGWGEKGYFRMRRGNNECGIEA-RINAG 110
>gi|31872149|gb|AAP59456.1| cathepsin B precursor [Araneus ventricosus]
Length = 334
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 46/90 (51%), Positives = 56/90 (62%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G + AA + D + YK GVY+H GE GGHAV+I+GWG E G YWL N
Sbjct: 241 IKTEISTNGPVEAAFTVYADFVTYKSGVYRHVTGEEMGGHAVRILGWGTESGTPYWLVAN 300
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SW WGD G FKI RG+DE IES V+
Sbjct: 301 SWNTDWGDKGYFKILRGSDECGIESSIVAG 330
>gi|268561866|ref|XP_002638438.1| Hypothetical protein CBG18654 [Caenorhabditis briggsae]
Length = 396
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 48/101 (47%), Positives = 60/101 (59%), Gaps = 1/101 (0%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI++ G + A + + D YK GVY H G+ GHAVKIIGWG E V YWL N
Sbjct: 236 IQTEIYNNGPVEVAYQVYDDFYHYKSGVYYHVYGDKPSGHAVKIIGWGTEKKVDYWLVAN 295
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDL 101
SW +G+ G FKIRRGT+E IE V AG R++ L
Sbjct: 296 SWSTTFGENGFFKIRRGTNECGIEE-NVVAGLPKSKRNARL 335
>gi|291000228|ref|XP_002682681.1| predicted protein [Naegleria gruberi]
gi|284096309|gb|EFC49937.1| predicted protein [Naegleria gruberi]
Length = 225
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 44/82 (53%), Positives = 54/82 (65%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI G + + +QD + YK GVY H G GGHA+KI+GWGVE+ VKYWL NSWG
Sbjct: 142 EIATNGPVQSGFSVYQDFMSYKSGVYTHQTGSFLGGHAIKIVGWGVENNVKYWLVANSWG 201
Query: 64 ELWGDGGLFKIRRGTDESRIES 85
WG GLFKI+RG +E IE+
Sbjct: 202 PDWGLNGLFKIKRGDNECGIEA 223
>gi|14141821|gb|AAK07477.2|AF329480_1 probable cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
gi|289743431|gb|ADD20463.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
Length = 340
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 47/91 (51%), Positives = 59/91 (64%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A ++DLI+YK GVY+H G+ GGHA++IIGWGVE + YWL N
Sbjct: 247 IQEEIMTHGPVEGAFTVYEDLILYKDGVYEHVHGKELGGHAIRIIGWGVEKDIPYWLVAN 306
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WG+ G FKI RG D IES +SAG
Sbjct: 307 SWNTDWGNNGFFKILRGKDHCGIES-SISAG 336
>gi|198429088|ref|XP_002120307.1| PREDICTED: similar to cathepsin B [Ciona intestinalis]
Length = 364
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 45/90 (50%), Positives = 56/90 (62%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A + D YK GVY+H G + GGHA++I+GWG E+GV YWL N
Sbjct: 271 IQTEIMTHGPVEGAFTVYADFPTYKSGVYKHVTGGVLGGHAIRILGWGSENGVAYWLVAN 330
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SW WGD G FKI RG+DE IES V+
Sbjct: 331 SWNTDWGDKGYFKILRGSDECGIESSVVAG 360
>gi|449667614|ref|XP_002166962.2| PREDICTED: cathepsin B-like [Hydra magnipapillata]
Length = 330
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 48/88 (54%), Positives = 54/88 (61%), Gaps = 1/88 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI G + AA D YK GVYQH GE GGHA+KI+GWGVE+ YWL NSW
Sbjct: 240 EIMTNGPVEAAFTVFADFPNYKSGVYQHVSGEELGGHAIKILGWGVENNTPYWLVANSWN 299
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAG 91
WGD G FKI RG+DE IE +V AG
Sbjct: 300 PSWGDNGFFKILRGSDECGIED-EVVAG 326
>gi|123478051|ref|XP_001322190.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121905031|gb|EAY09967.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 288
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 44/93 (47%), Positives = 61/93 (65%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
MQ+ I G + +++ + DL+ YK G+Y HT GE G HAV+IIGWG ++G+ YW+ N
Sbjct: 196 MQIGIMTEGPVTTSLKVYSDLMYYKSGIYTHTKGEFLGHHAVEIIGWGTKNGIDYWIISN 255
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
SW WG GLF I+RG +E IE + V AG+V
Sbjct: 256 SWNTTWGMNGLFLIKRGVNECHIEDY-VCAGKV 287
>gi|348587350|ref|XP_003479431.1| PREDICTED: cathepsin B-like [Cavia porcellus]
Length = 340
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 48/97 (49%), Positives = 60/97 (61%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + AA D + YK GVY+H GE+ GGHA++I+GWG E+GV YWL N
Sbjct: 240 IMAEIYKNGPVEAAFSVFSDFLTYKSGVYKHVAGEVLGGHAIRILGWGKENGVPYWLVGN 299
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES +V AG D+
Sbjct: 300 SWNVDWGDNGFFKILRGEDHCGIES-EVVAGIPRTDQ 335
>gi|339242629|ref|XP_003377240.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316973974|gb|EFV57515.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 325
Score = 96.3 bits (238), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 42/86 (48%), Positives = 57/86 (66%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E++ G +V A ++D + Y KGVY+H G+ GGHAVK+IGWG+E+ KYWL NSW
Sbjct: 235 ELYKNGPVVVAFNVYEDFMYYIKGVYEHRFGKFLGGHAVKLIGWGIENSKKYWLISNSWN 294
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVS 89
WG+ G FKI RG + IES+ V+
Sbjct: 295 TTWGENGFFKIIRGKNCCAIESYVVA 320
>gi|341887135|gb|EGT43070.1| hypothetical protein CAEBREN_13756 [Caenorhabditis brenneri]
Length = 398
Score = 96.3 bits (238), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 43/85 (50%), Positives = 56/85 (65%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q E+ G + A E ++D + Y GVY HT G++ GGHAVK+IGWG+EDG+ YW N
Sbjct: 281 IQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIEDGIPYWTVAN 340
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WG+ G F+I RG DE IES
Sbjct: 341 SWNTDWGEDGFFRILRGVDECGIES 365
>gi|312091331|ref|XP_003146940.1| cathepsin B [Loa loa]
Length = 249
Score = 96.3 bits (238), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 41/85 (48%), Positives = 55/85 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A+ E + D + Y G+Y+H G M GGHAVK++GWG++ GV YWL N
Sbjct: 149 IQKEIMTLGPVEASFEVYTDFLYYTGGIYKHVAGSMGGGHAVKVLGWGIDQGVPYWLAAN 208
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WG+ G F+I RG +E IES
Sbjct: 209 SWNTDWGEDGYFRILRGVNECGIES 233
>gi|156708106|gb|ABU93311.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
Length = 282
Score = 96.3 bits (238), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 43/85 (50%), Positives = 54/85 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
MQ E++ G + A + D + YK GVY H G ++GGHAV IGWGVED YWLC N
Sbjct: 188 MQQELYENGPLSVAFTVYYDFMNYKSGVYVHKTGGVAGGHAVLCIGWGVEDNTPYWLCQN 247
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SWG WG+ G FKI RG++ IE+
Sbjct: 248 SWGPAWGEKGHFKILRGSNHCGIEN 272
>gi|121073168|gb|ABM47070.1| cathepsin B1 [Clonorchis sinensis]
gi|358341105|dbj|GAA29748.2| cathepsin B [Clonorchis sinensis]
Length = 339
Score = 96.3 bits (238), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 42/82 (51%), Positives = 53/82 (64%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
++ +G + E + D Y GVY+H G + GGHAV+++GWGVEDG YWL NSW
Sbjct: 249 DLMTYGPLEVDFEVYADFPSYSSGVYRHVAGGLLGGHAVRLVGWGVEDGADYWLIANSWN 308
Query: 64 ELWGDGGLFKIRRGTDESRIES 85
WGDGG FKIRRG +E IES
Sbjct: 309 TDWGDGGYFKIRRGVNECGIES 330
>gi|74179506|dbj|BAE44111.1| cathepsin B preproprotein [Cyprinus carpio]
Length = 330
Score = 95.9 bits (237), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 43/82 (52%), Positives = 52/82 (63%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E++ G + A ++D + YK GVYQH G GGHA+KI+GWG E+GV YWL NSW
Sbjct: 241 ELYKNGPVEGAFTVYEDFLSYKSGVYQHVSGPALGGHAIKILGWGEENGVPYWLAANSWN 300
Query: 64 ELWGDGGLFKIRRGTDESRIES 85
WGD G FKI RG D IES
Sbjct: 301 TDWGDNGYFKILRGEDHCGIES 322
>gi|351695295|gb|EHA98213.1| Cathepsin B [Heterocephalus glaber]
Length = 340
Score = 95.9 bits (237), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 47/97 (48%), Positives = 60/97 (61%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A D ++YK GVY+H GEM GGHA++I+GWG E+GV YWL N
Sbjct: 240 IMAEIYKNGPVEGAFTVFSDFLMYKTGVYKHLAGEMLGGHAIRILGWGKENGVPYWLVGN 299
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG D IES ++ AG D+
Sbjct: 300 SWNVDWGDSGFFKIVRGEDHCGIES-EIVAGIPRTDQ 335
>gi|156708122|gb|ABU93319.1| cathepsin B10 cysteine protease [Monocercomonoides sp. PA]
Length = 283
Score = 95.9 bits (237), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 61/94 (64%), Gaps = 1/94 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q E+++ G I ++D Y KG+Y+H G GGHAV ++GWG+EDGVKYWL N
Sbjct: 189 LQDELYNNGPIQVTYVVYEDFFYYSKGIYKHLSGNKVGGHAVVLMGWGIEDGVKYWLVQN 248
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
SWG WG+ G F+I RG++E IES AG VD
Sbjct: 249 SWGYEWGEQGYFRILRGSNECGIES-SAYAGDVD 281
>gi|221107055|ref|XP_002166984.1| PREDICTED: cathepsin B-like [Hydra magnipapillata]
Length = 330
Score = 95.9 bits (237), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 47/93 (50%), Positives = 60/93 (64%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI+ G + AA + D YK GVY++T G GGHA+KI+GWGVE+ V YWL N
Sbjct: 237 IKNEIYLNGPVEAAFTVYSDFPNYKSGVYKYTTGNALGGHAIKILGWGVENNVPYWLVAN 296
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
SW WGD G FKI RG++E IE+ V AG V
Sbjct: 297 SWNPDWGDKGFFKILRGSNECGIEA-SVVAGMV 328
>gi|17565162|ref|NP_503382.1| Protein W07B8.4 [Caenorhabditis elegans]
gi|351059398|emb|CCD74288.1| Protein W07B8.4 [Caenorhabditis elegans]
Length = 335
Score = 95.9 bits (237), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 45/97 (46%), Positives = 59/97 (60%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + ++D +YK G+Y H G GGHAVK++GWGV++G YWL N
Sbjct: 238 IQTEILAHGPVEVGFIVYEDFYLYKTGIYTHVAGGELGGHAVKMLGWGVDNGTPYWLAAN 297
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW +WG+ G F+I RG DE IES V AG D +R
Sbjct: 298 SWNTVWGEKGYFRILRGVDECGIESAAV-AGMPDLNR 333
>gi|170030060|ref|XP_001842908.1| cathepsin B [Culex quinquefasciatus]
gi|167865914|gb|EDS29297.1| cathepsin B [Culex quinquefasciatus]
Length = 320
Score = 95.9 bits (237), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 44/86 (51%), Positives = 52/86 (60%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G +V E D YK GVY+H G G HAV++IGWGVE+GVKYWL N
Sbjct: 227 IMTEIYQNGPVVVQFEVFADFYQYKSGVYRHVTGATEGWHAVRVIGWGVENGVKYWLVAN 286
Query: 61 SWGELWGDGGLFKIRRGTDESRIESF 86
SWG WGD G FK RG + IE F
Sbjct: 287 SWGVRWGDKGFFKFVRGENHLGIEDF 312
>gi|45822203|emb|CAE47498.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
Length = 328
Score = 95.9 bits (237), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 43/85 (50%), Positives = 52/85 (61%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A + D + YK GVYQH G+ GGHA++I GWGVE+ YWL N
Sbjct: 236 IQAEILQNGPVEGAFSVYADFVNYKTGVYQHIKGQFLGGHAIRIFGWGVENNTPYWLIAN 295
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FKI RG+D IES
Sbjct: 296 SWNTDWGDSGTFKILRGSDHCGIES 320
>gi|56756587|gb|AAW26466.1| unknown [Schistosoma japonicum]
Length = 216
Score = 95.9 bits (237), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 44/90 (48%), Positives = 59/90 (65%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + AA + ++D + YK G+Y+H G + GGHA++IIGWGV+ YWL N
Sbjct: 123 IQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVKKRTPYWLIAN 182
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SW E WG+ GLF+I RG DE IES V+
Sbjct: 183 SWNEDWGEKGLFRIVRGRDECSIESNVVAG 212
>gi|52630945|gb|AAU84936.1| putative cathepsin B-S [Toxoptera citricida]
Length = 335
Score = 95.9 bits (237), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 46/85 (54%), Positives = 57/85 (67%), Gaps = 1/85 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
MQ + +G I A+ + + D + Y+ GVYQ T + GGHAVK+IGWG EDG YWL V
Sbjct: 241 MQKDTIAYGPIEASFDVYDDFVNYESGVYQKTEDAKYLGGHAVKMIGWGEEDGTPYWLMV 300
Query: 60 NSWGELWGDGGLFKIRRGTDESRIE 84
NSWGE WG G+FKI RGT+E IE
Sbjct: 301 NSWGEQWGANGMFKILRGTNECGIE 325
>gi|349604734|gb|AEQ00202.1| Cathepsin B-like protein, partial [Equus caballus]
Length = 134
Score = 95.9 bits (237), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 44/77 (57%), Positives = 51/77 (66%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
G + AA + D + YK GVYQH G+M GGHAV+I+GWGVE+G YWL NSW WGD
Sbjct: 42 GPVEAAFTVYSDFLQYKSGVYQHVAGDMMGGHAVRILGWGVENGTPYWLVGNSWNTDWGD 101
Query: 69 GGLFKIRRGTDESRIES 85
G FKI RG D IES
Sbjct: 102 NGFFKILRGQDHCGIES 118
>gi|290989996|ref|XP_002677623.1| cathepsin B [Naegleria gruberi]
gi|284091231|gb|EFC44879.1| cathepsin B [Naegleria gruberi]
Length = 321
Score = 95.9 bits (237), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 56/97 (57%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI+ G + +QD + YK GVY H G GGHA+KIIGWGVE GV YWL N
Sbjct: 223 IQQEIYTNGPVQGGFSVYQDFMNYKSGVYSHKTGSFLGGHAIKIIGWGVEGGVDYWLVAN 282
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WG G FKI RG +E IE V AG D R
Sbjct: 283 SWSTDWGIDGTFKILRGHNECGIED-DVYAGPADLSR 318
>gi|728602|emb|CAA88490.1| cathepsin B-like enzyme [Leishmania mexicana]
gi|1586011|prf||2202319A cathepsin B-like Cys protease
Length = 340
Score = 95.9 bits (237), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 42/86 (48%), Positives = 57/86 (66%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E+ + G + A++ + D + YK GVY+H G+ GGHAVK++GWGV+DG+ YW NSW
Sbjct: 249 ELMNNGPLEVAMQVYADFVAYKSGVYKHVSGDHLGGHAVKLVGWGVKDGIPYWKIANSWN 308
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVS 89
WGD G F I+RG DE IES V+
Sbjct: 309 TDWGDKGYFLIQRGNDECGIESSGVA 334
>gi|29374023|gb|AAO73002.1| cathepsin B [Fasciola gigantica]
Length = 335
Score = 95.9 bits (237), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 46/91 (50%), Positives = 61/91 (67%), Gaps = 3/91 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ +EI G + +D +YK G+YQ+T G + GGH IIGWGVE+GVKYWL N
Sbjct: 246 IMMEIITNGPVSTIYYIFEDFTVYKSGIYQYTSGSLMGGHG--IIGWGVENGVKYWLAAN 303
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW E WG+ G F+IRRGT+E IES +++AG
Sbjct: 304 SWNEGWGENGYFRIRRGTNECGIES-RINAG 333
>gi|46195455|ref|NP_990702.1| cathepsin B precursor [Gallus gallus]
gi|1168790|sp|P43233.1|CATB_CHICK RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
RecName: Full=Cathepsin B light chain; Contains:
RecName: Full=Cathepsin B heavy chain; Flags: Precursor
gi|603203|gb|AAA87075.1| cathepsin B [Gallus gallus]
Length = 340
Score = 95.5 bits (236), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 43/85 (50%), Positives = 54/85 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A ++D ++YK GVYQH GE GGHA++I+GWGVE+G YWL N
Sbjct: 240 IMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGVENGTPYWLAAN 299
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WG G FKI RG D IES
Sbjct: 300 SWNTDWGITGFFKILRGEDHCGIES 324
>gi|86451924|gb|ABC97357.1| cathepsin B [Streblomastix strix]
Length = 283
Score = 95.5 bits (236), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 46/91 (50%), Positives = 57/91 (62%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI+ +G + + D + YK GVY H G + GGHAV I+GWGVED V YWL N
Sbjct: 188 IQGEIYEYGPVSMGFIVYSDFMSYKSGVYVHQAGYIEGGHAVLIVGWGVEDEVPYWLVQN 247
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SWG WG+ G FKI RG+D ES V+AG
Sbjct: 248 SWGTDWGENGFFKILRGSDHCECES-NVTAG 277
>gi|389593817|ref|XP_003722157.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
gi|321438655|emb|CBZ12414.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
Length = 340
Score = 95.5 bits (236), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 41/89 (46%), Positives = 58/89 (65%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ +E+ G + ++ + D + YK GVY+H +GE GGHAVK++GWG +DGV YW N
Sbjct: 246 LMIELMTNGPLELTMQVYSDFVGYKSGVYKHVLGEFLGGHAVKLVGWGTQDGVPYWKVAN 305
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW WGD G F I+RG +E +IES V+
Sbjct: 306 SWNTDWGDKGYFLIQRGNNECKIESGGVA 334
>gi|333408990|gb|AEF32260.1| cathepsin B [Cristaria plicata]
Length = 347
Score = 95.5 bits (236), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 45/88 (51%), Positives = 56/88 (63%), Gaps = 1/88 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI G + AA ++D + YK GVYQH G+ GGHAVKI+GWG ++G YW+ NSW
Sbjct: 256 EIMTNGPVEAAFTVYEDFLSYKSGVYQHRTGQELGGHAVKILGWGEDNGTPYWIVANSWN 315
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAG 91
WG+ G F I RG DE IES Q+ AG
Sbjct: 316 PDWGNQGFFNILRGKDECGIES-QIVAG 342
>gi|354471594|ref|XP_003498026.1| PREDICTED: cathepsin B-like [Cricetulus griseus]
gi|344254255|gb|EGW10359.1| Cathepsin B [Cricetulus griseus]
Length = 339
Score = 95.5 bits (236), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 45/91 (49%), Positives = 58/91 (63%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A D + YK GVY+H G++ GGHA++I+GWGVE+ V YWL N
Sbjct: 239 IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDIMGGHAIRILGWGVENSVPYWLVAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD GLFKI RG D IES ++ AG
Sbjct: 299 SWNVDWGDNGLFKILRGEDHCGIES-EIVAG 328
>gi|71984043|ref|NP_001024426.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
gi|351058214|emb|CCD65629.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
Length = 378
Score = 95.5 bits (236), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 42/85 (49%), Positives = 56/85 (65%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q E+ G + A E ++D + Y GVY HT G++ GGHAVK+IGWG++DG+ YW N
Sbjct: 265 IQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIDDGIPYWTVAN 324
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WG+ G F+I RG DE IES
Sbjct: 325 SWNTDWGEDGFFRILRGVDECGIES 349
>gi|25146613|ref|NP_741818.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
gi|1169087|sp|P43510.1|CPR6_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 6; AltName:
Full=Cysteine protease-related 6; Flags: Precursor
gi|671715|gb|AAA98787.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|695294|gb|AAA98789.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351058213|emb|CCD65628.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
Length = 379
Score = 95.5 bits (236), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 42/85 (49%), Positives = 56/85 (65%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q E+ G + A E ++D + Y GVY HT G++ GGHAVK+IGWG++DG+ YW N
Sbjct: 266 IQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIDDGIPYWTVAN 325
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WG+ G F+I RG DE IES
Sbjct: 326 SWNTDWGEDGFFRILRGVDECGIES 350
>gi|256052331|ref|XP_002569726.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228435|emb|CCD74606.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 319
Score = 95.5 bits (236), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 43/94 (45%), Positives = 62/94 (65%), Gaps = 1/94 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI +G + A ++D + YK G+Y+H G++ HA++IIGWGVE+ YWL N
Sbjct: 226 IQKEIMKYGPVEANFIVYEDFLNYKSGIYKHITGKLVSWHAIRIIGWGVENNTPYWLIPN 285
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
SW E WG+ G F+I RG E IES +V+AGR++
Sbjct: 286 SWNEDWGENGNFRILRGRHECSIES-EVTAGRIN 318
>gi|193209594|ref|NP_001123113.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
gi|351058222|emb|CCD65637.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
Length = 369
Score = 95.5 bits (236), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 42/85 (49%), Positives = 56/85 (65%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q E+ G + A E ++D + Y GVY HT G++ GGHAVK+IGWG++DG+ YW N
Sbjct: 256 IQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIDDGIPYWTVAN 315
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WG+ G F+I RG DE IES
Sbjct: 316 SWNTDWGEDGFFRILRGVDECGIES 340
>gi|358341561|dbj|GAA37330.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 347
Score = 95.1 bits (235), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 46/92 (50%), Positives = 60/92 (65%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
+Q EI+ G + A + + D Y GVY+HT GE+ GGHA++++GWGV EDG YWL
Sbjct: 248 IQKEIWMRGPVEATMNVYTDFANYAGGVYKHTTGELLGGHAIRLLGWGVEEDGTPYWLAA 307
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW WG+ G F+I RG+D IES VSAG
Sbjct: 308 NSWNPSWGEKGFFRILRGSDHCGIES-DVSAG 338
>gi|56759588|gb|AAW28820.1| Parcxpwnx02 [Periplaneta americana]
Length = 343
Score = 95.1 bits (235), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 54/85 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q E+ G AA+ + D + Y+ GVYQH G GGHAV+++GWGVEDG YWL N
Sbjct: 250 IQKELLLNGPAEAALTVYDDFLHYRTGVYQHVSGGALGGHAVRLLGWGVEDGTPYWLLAN 309
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G F+I RG DE IES
Sbjct: 310 SWNYDWGDNGYFRILRGQDECGIES 334
>gi|144952804|gb|ABP04056.1| cathepsin B-4 [Clonorchis sinensis]
Length = 347
Score = 95.1 bits (235), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 46/92 (50%), Positives = 60/92 (65%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
+Q EI+ G + A + + D Y GVY+HT GE+ GGHA++++GWGV EDG YWL
Sbjct: 248 IQKEIWMRGPVEATMNVYTDFANYAGGVYKHTTGELLGGHAIRLLGWGVEEDGTPYWLAA 307
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW WG+ G F+I RG+D IES VSAG
Sbjct: 308 NSWNPSWGEKGFFRILRGSDHCGIES-DVSAG 338
>gi|54289256|gb|AAV31918.1| putative vitellogenic cathepsin B [Aedes aegypti]
Length = 332
Score = 95.1 bits (235), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 45/90 (50%), Positives = 58/90 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+QLEI G + + +QDL +YK GVYQH VG G HAV++IGWG E GV YWL N
Sbjct: 239 IQLEIMTNGPVESGFSVYQDLYLYKTGVYQHVVGREVGKHAVRLIGWGKERGVPYWLIAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
S+GE WG+ G FK RG++ IES ++
Sbjct: 299 SYGEDWGEHGYFKFLRGSNHLGIESVVIAG 328
>gi|94958151|gb|ABF47216.1| cathepsin B [Nicotiana benthamiana]
Length = 356
Score = 95.1 bits (235), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 46/100 (46%), Positives = 61/100 (61%), Gaps = 1/100 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCVNSW 62
E++ G + + ++D YK GVY+H G++ GGHAVK+IGWG EDG YWL N W
Sbjct: 247 ELYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYWLLANQW 306
Query: 63 GELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLE 102
WGD G FKIRRGTDE IE V+ R+ + +L+
Sbjct: 307 NRGWGDDGYFKIRRGTDECEIEDEVVAGLPSARNLNMELD 346
>gi|157167368|ref|XP_001653891.1| cathepsin b [Aedes aegypti]
gi|108874250|gb|EAT38475.1| AAEL009642-PA [Aedes aegypti]
Length = 332
Score = 95.1 bits (235), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 45/90 (50%), Positives = 58/90 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+QLEI G + + +QDL +YK GVYQH VG G HAV++IGWG E GV YWL N
Sbjct: 239 IQLEIMTNGPVESGFSVYQDLYLYKTGVYQHVVGREVGKHAVRLIGWGKERGVPYWLIAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
S+GE WG+ G FK RG++ IES ++
Sbjct: 299 SYGEDWGEHGYFKFLRGSNHLGIESVVIAG 328
>gi|226474182|emb|CAX71577.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 95.1 bits (235), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIES-EIAAGLI 340
>gi|390994433|gb|AFM37366.1| cathepsin B3 [Dictyocaulus viviparus]
Length = 342
Score = 95.1 bits (235), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 43/91 (47%), Positives = 61/91 (67%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G +VA ++D + YKKG+Y++T G GGHAV+I+GWGVE+ VKYW+ N
Sbjct: 251 IQKDILKHGPLVATFSVYEDFMYYKKGIYRYTHGGYEGGHAVRILGWGVENNVKYWIIAN 310
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WG+ G F++ RG ++ IE VSAG
Sbjct: 311 SWNTDWGEDGFFRMVRGINDCGIEE-SVSAG 340
>gi|1942645|pdb|1MIR|A Chain A, Rat Procathepsin B
gi|1942646|pdb|1MIR|B Chain B, Rat Procathepsin B
Length = 322
Score = 95.1 bits (235), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 41/85 (48%), Positives = 54/85 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A D + YK GVY+H G++ GGHA++I+GWG+E+GV YWL N
Sbjct: 222 IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVAN 281
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FKI RG + IES
Sbjct: 282 SWNADWGDNGFFKILRGENHCGIES 306
>gi|25988674|gb|AAN76202.1| lysosomal cysteine proteinase cathepsin B/green fluorescent protein
EGFP fusion protein [synthetic construct]
Length = 578
Score = 95.1 bits (235), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 43/91 (47%), Positives = 58/91 (63%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A D + YK GVY+H G++ GGHA++I+GWG+E+GV YWL N
Sbjct: 239 IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG + IES ++ AG
Sbjct: 299 SWNVDWGDNGFFKILRGENHCGIES-EIVAG 328
>gi|56752809|gb|AAW24616.1| unknown [Schistosoma japonicum]
Length = 342
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIES-EIAAGLI 340
>gi|496317|dbj|BAA04103.1| Sarcophaga pro-cathepsin B [Sarcophaga peregrina]
Length = 344
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 45/91 (49%), Positives = 58/91 (63%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A ++DLI+YK GVYQH G GGHA++I+GWGVE+ YWL N
Sbjct: 251 IQKEIMQNGPVEGAFTVYEDLILYKDGVYQHVHGRELGGHAIRILGWGVENKTPYWLIAN 310
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WG+ G FK+ RG D IES ++AG
Sbjct: 311 SWNTDWGNNGFFKMLRGEDHCGIES-AIAAG 340
>gi|226474180|emb|CAX71576.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIES-EIAAGLI 340
>gi|339242313|ref|XP_003377082.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316974149|gb|EFV57673.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 517
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 41/82 (50%), Positives = 54/82 (65%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI+ G +VA ++D Y G+YQ T GGHA++IIGWG E+G+ YWL NSW
Sbjct: 423 EIYTHGPVVAGFIVYEDFTYYISGIYQQTTYVAMGGHAIRIIGWGEENGIPYWLIANSWN 482
Query: 64 ELWGDGGLFKIRRGTDESRIES 85
+G+ G F+IRRGT+E RIES
Sbjct: 483 TTFGEKGFFRIRRGTNECRIES 504
>gi|209863073|ref|NP_001119610.2| cathepsin B-1852 [Acyrthosiphon pisum]
Length = 333
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 56/85 (65%), Gaps = 1/85 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
MQ +I +G I ++ + + D I YK GVY + GGH+VK IGWGVE V YWL +
Sbjct: 237 MQNDIMTYGPIESSFDVYDDFISYKSGVYFKSPNATYLGGHSVKCIGWGVERNVSYWLMM 296
Query: 60 NSWGELWGDGGLFKIRRGTDESRIE 84
NSW WGDGG FKIRRGT+E ++E
Sbjct: 297 NSWNSTWGDGGYFKIRRGTNECQVE 321
>gi|56756410|gb|AAW26378.1| unknown [Schistosoma japonicum]
Length = 342
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIES-EIAAGLI 340
>gi|226474164|emb|CAX71568.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
gi|226474166|emb|CAX71569.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIES-EIAAGLI 340
>gi|346472613|gb|AEO36151.1| hypothetical protein [Amblyomma maculatum]
Length = 373
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 43/85 (50%), Positives = 54/85 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A ++D + YK GVYQ GGHA++++GWGVE+GV YWL N
Sbjct: 280 IQAEIMMNGPVEADFTVYEDFLHYKSGVYQRHTDSALGGHAIRLLGWGVENGVPYWLAAN 339
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FKI RG+DE IES
Sbjct: 340 SWNTEWGDKGFFKILRGSDECGIES 364
>gi|268557308|ref|XP_002636643.1| Hypothetical protein CBG23351 [Caenorhabditis briggsae]
Length = 351
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 43/85 (50%), Positives = 54/85 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A ++D Y GVY HT G GGHAVK++GWGV++G YWLC N
Sbjct: 258 IQKEIMTHGPVEVAFTVYEDFEHYSGGVYVHTAGASLGGHAVKMLGWGVDNGTPYWLCAN 317
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW E WG+ G F+I RG +E IES
Sbjct: 318 SWNEDWGENGYFRIIRGVNECGIES 342
>gi|161343875|tpg|DAA06118.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 210
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 39/78 (50%), Positives = 53/78 (67%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ +I+ G + AA + D + YK GVY +T G++ GGHA+KI+GWGV+D KYWLC N
Sbjct: 133 IMTDIYKNGPVQAAFYVYTDFMYYKSGVYSYTRGQIEGGHAIKILGWGVDDNTKYWLCAN 192
Query: 61 SWGELWGDGGLFKIRRGT 78
SW WG+ GLF+I RG
Sbjct: 193 SWSRSWGENGLFRILRGN 210
>gi|427785213|gb|JAA58058.1| Putative cathepsin l culex quinquefasciatus cathepsin l
[Rhipicephalus pulchellus]
Length = 346
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 43/84 (51%), Positives = 53/84 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A + D + YK GVYQ E GGHA++++GWGVE+GV YWL N
Sbjct: 253 IQAEIMTNGPVEADFTVYSDFVHYKSGVYQRHTDEALGGHAIRLLGWGVENGVPYWLAAN 312
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW WGD G FKI RG+DE IE
Sbjct: 313 SWNTEWGDKGFFKILRGSDECGIE 336
>gi|226474160|emb|CAX71567.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIES-EIAAGLI 340
>gi|242001640|ref|XP_002435463.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215498799|gb|EEC08293.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 223
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 46/91 (50%), Positives = 59/91 (64%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EIF G + A A+ D + YK GVYQH ++ G HA++I+GWG ED YWL N
Sbjct: 129 IRTEIFQNGPVEAEFTAYADFLSYKSGVYQHHSRDIIGRHAIRILGWGSEDNNPYWLLAN 188
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW E WGD G FK+ RG +E IESF V+AG
Sbjct: 189 SWNEDWGDHGYFKMLRGVNECDIESF-VNAG 218
>gi|226473754|emb|CAX71562.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 329
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 236 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 295
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 296 TWNEDWGEKGYFRIVRGRNECSIES-EIAAGLI 327
>gi|308504233|ref|XP_003114300.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
gi|308261685|gb|EFP05638.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
Length = 351
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 43/85 (50%), Positives = 54/85 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A ++D Y GVY HT G GGHAVK++GWGV++G YWLC N
Sbjct: 258 IQKEIMTHGPVEVAFSVYEDFEHYSGGVYVHTAGASLGGHAVKMLGWGVDNGTPYWLCAN 317
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW E WG+ G F+I RG +E IES
Sbjct: 318 SWNEDWGENGYFRIIRGVNECGIES 342
>gi|326515156|dbj|BAK03491.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 53/84 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A+ + + D + YK GVYQH G+ GGHAVKIIGWGV+ YW+ N
Sbjct: 373 IMTEIYTNGPVEASYDVYADFVSYKSGVYQHVTGDYLGGHAVKIIGWGVDGSTPYWIVAN 432
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW WG+ G F I RG+DE IE
Sbjct: 433 SWNNDWGNNGFFNILRGSDECGIE 456
>gi|1311050|pdb|1CPJ|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1311051|pdb|1CPJ|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1421561|pdb|1THE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B- Inhibitor Complex: Implications For
Structure-Based Inhibitor Design
gi|1421562|pdb|1THE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B- Inhibitor Complex: Implications For
Structure-Based Inhibitor Design
Length = 260
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 41/85 (48%), Positives = 54/85 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A D + YK GVY+H G++ GGHA++I+GWG+E+GV YWL N
Sbjct: 166 IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVAN 225
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FKI RG + IES
Sbjct: 226 SWNADWGDNGFFKILRGENHCGIES 250
>gi|56755451|gb|AAW25905.1| unknown [Schistosoma japonicum]
Length = 342
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIES-EIAAGLI 340
>gi|300835056|gb|ADK37857.1| putative cathepsin precursor [Sitobion avenae]
Length = 340
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 45/86 (52%), Positives = 58/86 (67%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ +G I A+ + + D YK GVY+ T GGHAVK+IGWGVE+G YWL V
Sbjct: 245 IQKDVMTYGPIEASFDVYDDFPSYKSGVYEKTENASYLGGHAVKLIGWGVEEGTPYWLMV 304
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW WGD GLFKIRRGT+E I++
Sbjct: 305 NSWNAQWGDKGLFKIRRGTNECGIDN 330
>gi|226473762|emb|CAX71566.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
gi|226474170|emb|CAX71571.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A IE ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKDIMMHGPVEAYIEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340
>gi|56756380|gb|AAW26363.1| unknown [Schistosoma japonicum]
Length = 342
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340
>gi|346470617|gb|AEO35153.1| hypothetical protein [Amblyomma maculatum]
Length = 335
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 45/91 (49%), Positives = 57/91 (62%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EIF G + A + D + YK GVYQ + GGHA++I+GWG E+GV YWL N
Sbjct: 242 IKTEIFKNGPVEADFTVYADFVSYKSGVYQRHSDDALGGHAIRILGWGTENGVPYWLVAN 301
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW E WGD G FKI RG DE IE ++AG
Sbjct: 302 SWNEDWGDKGYFKILRGNDECGIED-DINAG 331
>gi|313233819|emb|CBY09988.1| unnamed protein product [Oikopleura dioica]
Length = 356
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/85 (49%), Positives = 54/85 (63%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI+ G + A ++D YK GVY H G GGHA++++GWG E+G KYWLC NSW
Sbjct: 257 EIYKNGPVEGAFIVYEDFPTYKSGVYSHHTGSALGGHAIRVLGWGEENGEKYWLCGNSWN 316
Query: 64 ELWGDGGLFKIRRGTDESRIESFQV 88
WG+ G FKI+RG +E IES V
Sbjct: 317 TDWGNNGFFKIKRGVNECGIESEMV 341
>gi|341891084|gb|EGT47019.1| CBN-CPR-4 protein [Caenorhabditis brenneri]
Length = 335
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/84 (50%), Positives = 54/84 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + AA ++D YK GVY HT GE GGHA++I+GWG ++G YWL N
Sbjct: 242 IQAEIIAHGPVEAAFTVYEDFYQYKSGVYVHTTGEELGGHAIRILGWGTDNGTPYWLVAN 301
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW WG+ G F+I RGT+E IE
Sbjct: 302 SWNVNWGENGYFRIIRGTNECGIE 325
>gi|320166129|gb|EFW43028.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
Length = 332
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 40/77 (51%), Positives = 49/77 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI + G + AA + D Y+ GVY+HT G + GGHA+ I+GWG E G YWL N
Sbjct: 240 IQTEILNHGPVEAAFTVYSDFPTYRSGVYKHTSGSVLGGHAISIVGWGTESGSPYWLVKN 299
Query: 61 SWGELWGDGGLFKIRRG 77
SW WGDGG FKI RG
Sbjct: 300 SWNPSWGDGGFFKILRG 316
>gi|225717770|gb|ACO14731.1| Cathepsin B precursor [Caligus clemensi]
Length = 331
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 54/84 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q++I G + AA + D + YK GVY+H G + GGHA++I+GWG+E G YWL N
Sbjct: 239 IQMDIMTNGPVEAAFSVYSDFMSYKSGVYRHVKGSLLGGHAIRILGWGMEKGTPYWLVAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW WGD G FKI RG+D IE
Sbjct: 299 SWNTDWGDNGTFKILRGSDHCGIE 322
>gi|1127275|pdb|1CTE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1127276|pdb|1CTE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
Length = 254
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 41/85 (48%), Positives = 54/85 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A D + YK GVY+H G++ GGHA++I+GWG+E+GV YWL N
Sbjct: 160 IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVAN 219
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FKI RG + IES
Sbjct: 220 SWNADWGDNGFFKILRGENHCGIES 244
>gi|268555790|ref|XP_002635884.1| Hypothetical protein CBG01104 [Caenorhabditis briggsae]
Length = 337
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 48/98 (48%), Positives = 59/98 (60%), Gaps = 1/98 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A + D YK G+Y H G+ GGHAVKI+GWGVE+G KYWL N
Sbjct: 240 IQTEILQHGPVEAGFLVYSDFYRYKSGIYTHVSGQELGGHAVKILGWGVENGTKYWLVAN 299
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRS 98
SW WG+ G F+I RG +E IES V AG D R+
Sbjct: 300 SWNINWGEKGYFRILRGRNECGIES-AVVAGIPDLTRN 336
>gi|159179|gb|AAA29178.1| cysteine proteinase, partial [Haemonchus contortus]
Length = 341
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/94 (44%), Positives = 61/94 (64%), Gaps = 1/94 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q E+ G + A+ ++D +YK G+Y+HT GE+ G HAVK+IGWG E+ YWL N
Sbjct: 245 IQKELLKNGPVTASFAVYEDFSLYKSGIYRHTAGELRGYHAVKMIGWGTENRTDYWLIAN 304
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
SW + WG+ G F+I RG ++ IE V+AG +D
Sbjct: 305 SWHDDWGENGYFRIIRGINDCGIEE-NVAAGLID 337
>gi|226474172|emb|CAX71572.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340
>gi|226474174|emb|CAX71573.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340
>gi|255040225|gb|ACT99885.1| cathepsin B2 [Opisthorchis viverrini]
Length = 337
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 45/88 (51%), Positives = 59/88 (67%), Gaps = 1/88 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI G + AA E ++D YK+GVY H+ GE GGHA++I+GWG E+G YWL NSW
Sbjct: 241 EIMINGPVEAAFEVYEDFFGYKQGVYFHSTGEFIGGHAIRILGWGEENGTPYWLIANSWN 300
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAG 91
E WG+ G FK+ RG +E IE +V+AG
Sbjct: 301 EGWGEDGYFKMLRGKNECGIED-EVTAG 327
>gi|226473758|emb|CAX71564.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340
>gi|312271211|gb|ADQ57303.1| cathepsin B-like cysteine proteinase 1 [Angiostrongylus
cantonensis]
Length = 394
Score = 94.4 bits (233), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/85 (49%), Positives = 55/85 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A E ++D + Y G+Y HT G++ GGHAVK+IGWG++ G YWL N
Sbjct: 282 IQKEILTHGPVEVAFEVYEDFLHYAGGIYVHTGGKLGGGHAVKLIGWGIDQGTPYWLIAN 341
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WG+ G F+I RG DE IES
Sbjct: 342 SWNTDWGEEGFFRILRGVDECGIES 366
>gi|1705630|sp|P00787.2|CATB_RAT RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; AltName:
Full=RSG-2; Contains: RecName: Full=Cathepsin B light
chain; Contains: RecName: Full=Cathepsin B heavy chain;
Flags: Precursor
gi|1524328|emb|CAA57792.1| cathepsin b [Rattus norvegicus]
Length = 339
Score = 94.4 bits (233), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 43/91 (47%), Positives = 58/91 (63%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A D + YK GVY+H G++ GGHA++I+GWG+E+GV YWL N
Sbjct: 239 IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG + IES ++ AG
Sbjct: 299 SWNVDWGDNGFFKILRGENHCGIES-EIVAG 328
>gi|82830420|ref|NP_072119.2| cathepsin B preproprotein [Rattus norvegicus]
gi|47939014|gb|AAH72490.1| Cathepsin B [Rattus norvegicus]
gi|149030258|gb|EDL85314.1| rCG52258, isoform CRA_a [Rattus norvegicus]
Length = 339
Score = 94.4 bits (233), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 43/91 (47%), Positives = 58/91 (63%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A D + YK GVY+H G++ GGHA++I+GWG+E+GV YWL N
Sbjct: 239 IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG + IES ++ AG
Sbjct: 299 SWNVDWGDNGFFKILRGENHCGIES-EIVAG 328
>gi|56754307|gb|AAW25341.1| unknown [Schistosoma japonicum]
Length = 309
Score = 94.4 bits (233), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 65/93 (69%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G++ A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 216 IQKDIMMHGTVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 275
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 276 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 307
>gi|289743429|gb|ADD20462.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
Length = 340
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/91 (51%), Positives = 58/91 (63%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A ++DLI+YK GVY+H G+ GGHA++IIGWGVE YWL N
Sbjct: 247 IQGEIMTNGPVEGAFTVYEDLILYKDGVYEHVHGKELGGHAIRIIGWGVEKDTPYWLIAN 306
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WG+ G FKI RG D IES +SAG
Sbjct: 307 SWNTDWGNNGFFKILRGKDHCGIES-SISAG 336
>gi|161343851|tpg|DAA06106.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 333
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 56/85 (65%), Gaps = 1/85 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
MQ +I +G I ++ + + D I YK GVY + GGH+VK IGWGVE V YWL +
Sbjct: 237 MQNDIMTYGPIESSFDVYDDFISYKSGVYFKSPNATYLGGHSVKCIGWGVERNVSYWLMM 296
Query: 60 NSWGELWGDGGLFKIRRGTDESRIE 84
NSW WGDGG FKIRRGT+E ++E
Sbjct: 297 NSWNNTWGDGGNFKIRRGTNECQVE 321
>gi|156708104|gb|ABU93310.1| cathepsin B1 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 42/85 (49%), Positives = 53/85 (62%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
M E++ G I A + D + YK GVY H G ++GGHAV +GWGVED YWLC N
Sbjct: 188 MMEELYENGPISVAFTVYYDFMNYKSGVYVHKTGGIAGGHAVLCVGWGVEDNTPYWLCQN 247
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SWG WG+ G FKI RG++ IE+
Sbjct: 248 SWGPAWGEKGHFKILRGSNHCGIEN 272
>gi|1848229|gb|AAB48119.1| cathepsin B-like protease [Leishmania major]
Length = 340
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 40/89 (44%), Positives = 58/89 (65%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ +E+ G + ++ + D + YK GVY+H +G+ GGHAVK++GWG +DGV YW N
Sbjct: 246 LMIELMTNGPLELTMQVYSDFVGYKSGVYKHVLGDFLGGHAVKLVGWGTQDGVPYWKVAN 305
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW WGD G F I+RG +E +IES V+
Sbjct: 306 SWNTDWGDKGYFLIQRGNNECKIESGGVA 334
>gi|291291827|gb|ADD91786.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 57/93 (61%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
Q EI G +V A ++D YKKG+Y+HT G+ GGHA+KIIGWG E GV YWL N
Sbjct: 162 TQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWGKEGGVPYWLIAN 221
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
SW WG+ G F+I G++ IE V AG V
Sbjct: 222 SWHNDWGENGYFRILCGSNHCGIEE-NVVAGHV 253
>gi|156708108|gb|ABU93312.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 42/85 (49%), Positives = 53/85 (62%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
M E++ G I A + D + YK GVY H G ++GGHAV +GWGVED YWLC N
Sbjct: 188 MMEELYENGPISVAFTVYYDFMNYKSGVYVHKTGGIAGGHAVLCVGWGVEDNTPYWLCQN 247
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SWG WG+ G FKI RG++ IE+
Sbjct: 248 SWGPAWGEKGHFKILRGSNHCGIEN 272
>gi|76576339|gb|ABA53863.1| cathepsin B-like cysteine protease 1 [Parelaphostrongylus tenuis]
Length = 346
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 42/92 (45%), Positives = 58/92 (63%), Gaps = 1/92 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI + G + + ++D Y G+Y+H GE G HAVK++GWG E+GV YW+C N
Sbjct: 254 IQQEIMNHGPVEVTFDVYEDFEHYSSGIYKHMAGEYVGVHAVKMLGWGTENGVDYWICAN 313
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGR 92
SW WG+ G F+I RG +E IES V AG+
Sbjct: 314 SWNSDWGENGFFRILRGENECGIES-NVVAGK 344
>gi|55793943|gb|AAV65882.1| cathepsin B1 isotype 2 precursor [Trichobilharzia regenti]
Length = 342
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 46/95 (48%), Positives = 58/95 (61%), Gaps = 1/95 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G + AA H D + YK G+Y++ G GGHAV+IIGWGVE YWL N
Sbjct: 249 IKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIGWGVEKKTPYWLIAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
SW E WG+ G F+I RG DE IES +V+ G R
Sbjct: 309 SWNEDWGEKGYFRILRGKDECGIES-EVTGGLPHR 342
>gi|56752787|gb|AAW24605.1| unknown [Schistosoma japonicum]
Length = 309
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 216 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 275
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 276 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 307
>gi|240992702|ref|XP_002404475.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215491572|gb|EEC01213.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 46/91 (50%), Positives = 57/91 (62%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EIF G + A D + YK GVYQH ++ GGHA++I+GWG E+G YWL N
Sbjct: 243 IQTEIFKNGPVEADFIVLADFLSYKSGVYQHHSDDVIGGHAIRILGWGTENGTPYWLAAN 302
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW E WGD G FKI RG DE IE ++AG
Sbjct: 303 SWNEDWGDHGYFKILRGKDECGIEE-DINAG 332
>gi|56757646|gb|AAW26973.1| unknown [Schistosoma japonicum]
Length = 342
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 63/93 (67%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 FQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIES-EIAAGLI 340
>gi|19526442|gb|AAL89717.1|AF483623_1 cathepsin B [Apriona germari]
Length = 324
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 43/89 (48%), Positives = 59/89 (66%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A +E ++D Y G+YQHT G GGHAVKIIGWG E+ V YW+ N
Sbjct: 228 IQREILDNGPVTAYMEVYEDFYSYGTGIYQHTSGSFVGGHAVKIIGWGSENDVPYWIAAN 287
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SWG +G+ G F+I RG++ + IES+ V+
Sbjct: 288 SWGTGFGEDGFFRILRGSNCAGIESYIVA 316
>gi|56756114|gb|AAW26235.1| unknown [Schistosoma japonicum]
Length = 342
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340
>gi|308488550|ref|XP_003106469.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
gi|308253819|gb|EFO97771.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
Length = 205
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 48/97 (49%), Positives = 58/97 (59%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G I A ++D Y GVY HT G+ GGHAVKI+GWGV++G YWL N
Sbjct: 107 IQTEILAHGPIEVAFTVYEDFYQYTTGVYVHTAGKSLGGHAVKILGWGVDNGTPYWLVAN 166
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WG+ G F+I RG +E IE V AG D DR
Sbjct: 167 SWNVNWGEKGYFRIIRGLNECGIEHSAV-AGLPDLDR 202
>gi|56756475|gb|AAW26410.1| unknown [Schistosoma japonicum]
Length = 342
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340
>gi|171474007|gb|AAX31052.2| SJCHGC09761 protein [Schistosoma japonicum]
Length = 342
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340
>gi|76576341|gb|ABA53864.1| cathepsin B-like cysteine protease 2 [Parelaphostrongylus tenuis]
Length = 344
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 44/93 (47%), Positives = 58/93 (62%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI +G + AA ++D Y +G+Y+H G GGHAV+I+GWG E G YWL N
Sbjct: 253 IQKEIMTYGPVTAAFIVYEDFFHYHRGIYKHVSGGEEGGHAVRILGWGEEKGTAYWLVAN 312
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
SW WG+ G F+I RG++E IE V AGRV
Sbjct: 313 SWNTDWGENGYFRILRGSNECGIEE-NVVAGRV 344
>gi|609175|emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana rustica]
Length = 356
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 45/100 (45%), Positives = 61/100 (61%), Gaps = 1/100 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCVNSW 62
E++ G + + ++D YK GVY+H G++ GGHAVK+IGWG EDG YWL N W
Sbjct: 247 EVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDIMGGHAVKLIGWGTSEDGEDYWLLANQW 306
Query: 63 GELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLE 102
WGD G FKIRRGT+E IE V+ R+ + +L+
Sbjct: 307 NRGWGDDGYFKIRRGTNECEIEDEVVAGLPSARNLNVELD 346
>gi|268558600|ref|XP_002637291.1| C. briggsae CBR-CPR-4 protein [Caenorhabditis briggsae]
Length = 335
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 54/84 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + AA ++D YK GVY HT G+ GGHA++I+GWG ++G YWL N
Sbjct: 242 IQAEILAHGPVEAAFTVYEDFYQYKSGVYVHTTGQELGGHAIRILGWGTDNGTPYWLVAN 301
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW WG+ G F+I RGT+E IE
Sbjct: 302 SWNVNWGENGYFRIIRGTNECGIE 325
>gi|268561878|ref|XP_002638441.1| Hypothetical protein CBG18657 [Caenorhabditis briggsae]
Length = 372
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 42/84 (50%), Positives = 56/84 (66%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI++ G + + +D YK GVY + G+++G HAVKIIGWG E+ V YWL N
Sbjct: 260 IQTEIYNNGPVEVSYRVFEDFYQYKSGVYHYVSGKLTGAHAVKIIGWGTENKVDYWLVAN 319
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SWG +G+ G FKIRRGT+E IE
Sbjct: 320 SWGTDFGEKGFFKIRRGTNECGIE 343
>gi|56756907|gb|AAW26625.1| unknown [Schistosoma japonicum]
Length = 342
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340
>gi|56754499|gb|AAW25437.1| unknown [Schistosoma japonicum]
Length = 342
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340
>gi|226473756|emb|CAX71563.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340
>gi|203648|gb|AAA40993.1| cathepsin (EC 3.4.22.1), partial [Rattus norvegicus]
Length = 271
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 41/85 (48%), Positives = 54/85 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A D + YK GVY+H G++ GGHA++I+GWG+E+GV YWL N
Sbjct: 171 IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVAN 230
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FKI RG + IES
Sbjct: 231 SWNVDWGDNGFFKILRGENHCGIES 255
>gi|327281751|ref|XP_003225610.1| PREDICTED: cathepsin B-like [Anolis carolinensis]
Length = 330
Score = 93.6 bits (231), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 43/91 (47%), Positives = 58/91 (63%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GE GGHA++I+GWGV++G YWL N
Sbjct: 230 IMAEIYKNGPVEGAFVVYSDFLMYKSGVYQHVSGEEVGGHAIRILGWGVDNGTPYWLAAN 289
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WG+ G F+I RG D IES ++ AG
Sbjct: 290 SWNTDWGEDGFFRILRGQDHCGIES-EIVAG 319
>gi|56759488|gb|AAW27884.1| unknown [Schistosoma japonicum]
Length = 342
Score = 93.6 bits (231), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340
>gi|56758864|gb|AAW27572.1| unknown [Schistosoma japonicum]
Length = 342
Score = 93.6 bits (231), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340
>gi|309202|gb|AAA37494.1| mouse preprocathepsin B [Mus musculus]
Length = 339
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 45/97 (46%), Positives = 59/97 (60%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A D + YK GVY+H G+M GGHA++I+ WGVE+GV YWL N
Sbjct: 239 IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILVWGVENGVPYWLAAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG + IES ++ AG D+
Sbjct: 299 SWNLDWGDNGFFKILRGENHCGIES-EIVAGIPRTDQ 334
>gi|56754337|gb|AAW25356.1| SJCHGC00056 protein [Schistosoma japonicum]
Length = 342
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 41/89 (46%), Positives = 57/89 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
Q +I +G + AA + ++D + K G+ +H G + GGH ++IIGWGVE G YWL N
Sbjct: 249 FQRDIMMYGPVEAAFDVYEDFLNSKSGISRHVTGSIVGGHPIRIIGWGVEKGNPYWLIAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW E WG+ GLF++ RG DE IES V+
Sbjct: 309 SWNEDWGENGLFRMVRGRDECSIESHVVA 337
>gi|384597848|gb|AFI23675.1| cathepsin B, partial [Brugia malayi]
Length = 319
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 39/80 (48%), Positives = 52/80 (65%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A+ E + D + Y G+Y+H G + GGHAVKI+GWG++ GV YWL N
Sbjct: 237 IQKEIMTLGPVEASFEVYTDFLHYTSGIYKHVAGSVGGGHAVKILGWGIDQGVSYWLAAN 296
Query: 61 SWGELWGDGGLFKIRRGTDE 80
SW WG+ G F+I RG DE
Sbjct: 297 SWNNDWGEDGYFRILRGADE 316
>gi|341904470|gb|EGT60303.1| hypothetical protein CAEBREN_20420 [Caenorhabditis brenneri]
Length = 351
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 42/84 (50%), Positives = 53/84 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A + D +Y GVY HT G GGHAVK++GWGV++G YWLC N
Sbjct: 258 IQKEIMTNGPVEVAFTVYADFEVYSGGVYVHTAGASLGGHAVKMLGWGVDNGTPYWLCAN 317
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW E WG+ G F+I RG +E IE
Sbjct: 318 SWNEDWGENGYFRIIRGVNECGIE 341
>gi|339248603|ref|XP_003373289.1| cathepsin B [Trichinella spiralis]
gi|316970616|gb|EFV54519.1| cathepsin B [Trichinella spiralis]
Length = 576
Score = 93.6 bits (231), Expect = 3e-17, Method: Composition-based stats.
Identities = 47/107 (43%), Positives = 61/107 (57%), Gaps = 13/107 (12%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQH--------TVGEMSGGHAVKIIGWGVEDG--- 52
EI G + A H+D +YK GVYQH SG H+V+I+GWGV+
Sbjct: 450 EIMANGPVQATFLVHEDFFMYKSGVYQHLPYANDKGPAYARSGYHSVRILGWGVDHSTGV 509
Query: 53 -VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRVDRDR 97
+KYWLC NSWGE WG+ GLF+I RG + IESF + A G+ + R
Sbjct: 510 PIKYWLCANSWGEEWGENGLFRILRGENHCDIESFIIGAWGKGSKKR 556
>gi|225711544|gb|ACO11618.1| Cathepsin B precursor [Caligus rogercresseyi]
Length = 332
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 44/90 (48%), Positives = 55/90 (61%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+QLEI G + AA + D + K GVY+H G + GGHA++I+GWGVE G YWL N
Sbjct: 240 IQLEIMDNGPVEAAFSVYSDFMNDKSGVYRHVKGSLLGGHAIRILGWGVEKGTPYWLVAN 299
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SW WGD G FKI RG+D IE V+
Sbjct: 300 SWNTDWGDKGTFKILRGSDHCGIEGSVVTG 329
>gi|308488328|ref|XP_003106358.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
gi|308253708|gb|EFO97660.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
Length = 343
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 46/99 (46%), Positives = 58/99 (58%), Gaps = 1/99 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + + D YK GVY H G GGHAVK++GWGV++G YWL N
Sbjct: 246 IQSEILKNGPVEVGFTVYADFYQYKSGVYVHVAGPELGGHAVKLLGWGVDNGTPYWLAAN 305
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSS 99
SW WG+ G F+I RG +E IES QV AG D +R +
Sbjct: 306 SWNTNWGENGYFRILRGVNECGIES-QVVAGMPDLERHN 343
>gi|224064400|ref|XP_002301457.1| predicted protein [Populus trichocarpa]
gi|222843183|gb|EEE80730.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 45/85 (52%), Positives = 53/85 (62%), Gaps = 1/85 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
+ EI+ G + A ++D YK GVY+H G M GGHAVK+IGWG EDG YWL
Sbjct: 245 IMAEIYKNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTSEDGEAYWLLA 304
Query: 60 NSWGELWGDGGLFKIRRGTDESRIE 84
N W WGD G FKIRRGT+E IE
Sbjct: 305 NQWNRGWGDDGYFKIRRGTNECGIE 329
>gi|308500570|ref|XP_003112470.1| CRE-CPR-4 protein [Caenorhabditis remanei]
gi|308267038|gb|EFP10991.1| CRE-CPR-4 protein [Caenorhabditis remanei]
Length = 335
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 54/84 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + AA ++D YK GVY HT G+ GGHA++I+GWG ++G YWL N
Sbjct: 242 IQAEIIAHGPVEAAFTVYEDFYQYKSGVYVHTTGQELGGHAIRILGWGTDNGTPYWLVAN 301
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW WG+ G F+I RGT+E IE
Sbjct: 302 SWNVNWGENGYFRIIRGTNECGIE 325
>gi|260786791|ref|XP_002588440.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
gi|229273602|gb|EEN44451.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
Length = 332
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 45/84 (53%), Positives = 51/84 (60%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q+EI G + AA + D YK GVYQH G GGHAVK+IGWG+E YWL N
Sbjct: 238 IQMEIMTNGPVEAAFTVYADFPHYKSGVYQHESGAELGGHAVKMIGWGMEGSTPYWLIAN 297
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW WGD G FKI RG DE IE
Sbjct: 298 SWNSDWGDMGFFKILRGQDECGIE 321
>gi|17559068|ref|NP_504682.1| Protein CPR-4 [Caenorhabditis elegans]
gi|1169085|sp|P43508.1|CPR4_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 4; AltName:
Full=Cysteine protease-related 4; Flags: Precursor
gi|675500|gb|AAA98785.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|695293|gb|AAA98783.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351063163|emb|CCD71204.1| Protein CPR-4 [Caenorhabditis elegans]
Length = 335
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 54/84 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + AA ++D YK GVY HT G+ GGHA++I+GWG ++G YWL N
Sbjct: 242 IQAEIIAHGPVEAAFTVYEDFYQYKTGVYVHTTGQELGGHAIRILGWGTDNGTPYWLVAN 301
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW WG+ G F+I RGT+E IE
Sbjct: 302 SWNVNWGENGYFRIIRGTNECGIE 325
>gi|223646922|gb|ACN10219.1| Cathepsin B precursor [Salmo salar]
gi|223647940|gb|ACN10728.1| Cathepsin B precursor [Salmo salar]
gi|223672785|gb|ACN12574.1| Cathepsin B precursor [Salmo salar]
Length = 330
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 41/90 (45%), Positives = 53/90 (58%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ E+ G + A ++D ++YK GVYQH G GGHA+K++GWG E G YWL N
Sbjct: 238 IMAELLKNGPVEGAFTVYEDFLLYKSGVYQHVSGSAVGGHAIKVLGWGEEGGTPYWLAAN 297
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SW WG+ G FKI RG D IES V+
Sbjct: 298 SWNTDWGENGFFKILRGKDHCGIESEMVAG 327
>gi|55793945|gb|AAV65883.1| cathepsin B1 isotype 3 precursor [Trichobilharzia regenti]
Length = 342
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 45/91 (49%), Positives = 57/91 (62%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G + AA H D + YK G+Y++ G GGHAV+IIGWGVE YWL N
Sbjct: 249 IKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIGWGVEKKTPYWLIAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW E WG+ G F+I RG DE IES +V+ G
Sbjct: 309 SWNEDWGEKGYFRILRGKDECGIES-EVTGG 338
>gi|1644295|emb|CAB03627.1| cysteine proteinase [Haemonchus contortus]
Length = 345
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 57/85 (67%), Gaps = 1/85 (1%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
G +VA ++D YKKG+Y H G+ G HA+KIIGWGVE+G+ YWL NSW + WG+
Sbjct: 260 GPVVAVFTVYEDFSYYKKGIYVHIAGKARGAHAIKIIGWGVENGLPYWLIANSWHDDWGE 319
Query: 69 GGLFKIRRGTDESRIESFQVSAGRV 93
GLF+I RG +E IE +V AG V
Sbjct: 320 QGLFRIVRGINECGIEQ-EVVAGHV 343
>gi|226474168|emb|CAX71570.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 63/93 (67%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKDIMMHGPAEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340
>gi|729283|sp|Q06544.1|CYSP3_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 3
gi|159952|gb|AAA29436.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
Length = 174
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 55/84 (65%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G +VA ++D YK G+Y+HT G M+GGHAVKIIGWG E G YWL N
Sbjct: 83 IQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMTGGHAVKIIGWGKEKGTPYWLIAN 142
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW + WG+ G +++ RG + RIE
Sbjct: 143 SWHDDWGEKGFYRMIRGINNCRIE 166
>gi|328871084|gb|EGG19455.1| peptidase C1A family protein [Dictyostelium fasciculatum]
Length = 352
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 43/90 (47%), Positives = 58/90 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A E ++D + YK GVYQHT G+ GGH VK+IGWG ++ YW+C N
Sbjct: 213 IQQEIMTNGPVEACFEVYEDFLGYKSGVYQHTTGKDLGGHCVKMIGWGTQNNELYWICNN 272
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SW WG+ G+F I+ G +E IES V+A
Sbjct: 273 SWTTYWGNQGVFWIKAGVNECGIESDVVAA 302
>gi|226474184|emb|CAX71578.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKDIMVHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340
>gi|297843026|ref|XP_002889394.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
lyrata]
gi|297335236|gb|EFH65653.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
lyrata]
Length = 359
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 46/102 (45%), Positives = 59/102 (57%), Gaps = 1/102 (0%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
+ E++ G + A ++D YK GVY+H G GGHAVK+IGWG +DG YWL
Sbjct: 247 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTKIGGHAVKLIGWGTSDDGEDYWLLA 306
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDL 101
N W WGD G FKIRRGT+E IE V+ DR+ D+
Sbjct: 307 NQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVFKDV 348
>gi|213514196|ref|NP_001133994.1| Cathepsin B precursor [Salmo salar]
gi|209156086|gb|ACI34275.1| Cathepsin B precursor [Salmo salar]
Length = 330
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 40/87 (45%), Positives = 54/87 (62%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E++ G + A ++D ++YK GVY+H G GGHA+K++GWG E G+ YWL NSW
Sbjct: 241 ELYKNGPVEGAFTVYEDFLLYKSGVYRHVSGSAVGGHAIKVLGWGEEGGIPYWLAANSWN 300
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSA 90
WG+ G FKI RG D IES V+
Sbjct: 301 TDWGENGFFKIVRGEDHCGIESEMVAG 327
>gi|984958|gb|AAC46877.1| cathepsin B-like proteinase [Ancylostoma caninum]
Length = 343
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 42/88 (47%), Positives = 56/88 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI+ G +VAA +QD YKKG+Y H G +G HAVK++GWG E+ YWL N
Sbjct: 250 IRQEIYKNGPVVAAFRVYQDFSYYKKGIYVHKWGGQTGAHAVKVVGWGRENATDYWLIAN 309
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQV 88
SW WG+ G F+I RGT+E IE+ V
Sbjct: 310 SWNTDWGESGYFRIVRGTNECGIEAQMV 337
>gi|226474176|emb|CAX71574.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 41/93 (44%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IGWGVE+G YWL N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E I+S +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIDS-EIAAGLI 340
>gi|55793941|gb|AAV65881.1| cathepsin B1 isotype 1 precursor [Trichobilharzia regenti]
Length = 342
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 45/91 (49%), Positives = 57/91 (62%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G + AA H D + YK G+Y++ G GGHAV+IIGWGVE YWL N
Sbjct: 249 IKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIGWGVEKKTPYWLIAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW E WG+ G F+I RG DE IES +V+ G
Sbjct: 309 SWNEDWGEKGYFRILRGKDECGIES-EVTGG 338
>gi|356984175|gb|AET43950.1| cathepsin B, partial [Reishia clavigera]
Length = 209
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 47/89 (52%), Positives = 56/89 (62%), Gaps = 1/89 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E+ G + AA + D + Y GVY+HT G GGHAVKI+G+GVE+G KYWL NSW
Sbjct: 119 ELVTRGPVEAAFTVYSDFLQYHSGVYRHTTGSALGGHAVKILGYGVENGDKYWLVANSWN 178
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAGR 92
WGD G FKI RG DE IE Q+ AG
Sbjct: 179 PDWGDQGFFKILRGVDECGIEG-QIVAGE 206
>gi|21930117|gb|AAM82155.1| cysteine proteinase [Ancylostoma ceylanicum]
Length = 348
Score = 92.8 bits (229), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 45/88 (51%), Positives = 55/88 (62%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G +VA + ++D YKKGVY H GE++G HAVKIIGWG + V YWL N
Sbjct: 255 IKYEIMTRGPVVATYKVYRDFDYYKKGVYIHREGEVTGLHAVKIIGWGKGNDVPYWLVAN 314
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQV 88
SW WGD G F+I RGTD IE V
Sbjct: 315 SWNTDWGDNGYFRIVRGTDNCEIERQMV 342
>gi|443692853|gb|ELT94358.1| hypothetical protein CAPTEDRAFT_221292 [Capitella teleta]
Length = 374
Score = 92.8 bits (229), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 42/84 (50%), Positives = 50/84 (59%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI G + A + D YK GVYQH G GGHA++++GWGVEDG YWL N
Sbjct: 280 IMTEIMTNGPVEGAFTVYADFPTYKSGVYQHVSGGELGGHAIRVLGWGVEDGTPYWLVAN 339
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW WGD G FKI RG +E IE
Sbjct: 340 SWNSDWGDNGFFKILRGQNECGIE 363
>gi|118118|sp|P19092.1|CYSP1_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
Precursor
gi|159173|gb|AAA29175.1| cysteine protease (AC-1) [Haemonchus contortus]
Length = 342
Score = 92.8 bits (229), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 44/94 (46%), Positives = 61/94 (64%), Gaps = 1/94 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G +VA+ ++D YK G+Y+HT GE+ G HAVK+IGWG E+ +WL N
Sbjct: 246 IQSEILRNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWGNENNTDFWLIAN 305
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
SW WG+ G F+I RGT++ IE ++AG VD
Sbjct: 306 SWHNDWGEKGYFRIIRGTNDCGIEG-TIAAGIVD 338
>gi|124502519|gb|ABN13633.1| cysteine proteinase [Haemonchus contortus]
Length = 342
Score = 92.8 bits (229), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 44/94 (46%), Positives = 61/94 (64%), Gaps = 1/94 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G +VA+ ++D YK G+Y+HT GE+ G HAVK+IGWG E+ +WL N
Sbjct: 246 IQSEILRNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWGNENNTDFWLIAN 305
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
SW WG+ G F+I RGT++ IE ++AG VD
Sbjct: 306 SWHNDWGEKGYFRIIRGTNDCGIEG-TIAAGIVD 338
>gi|121309133|dbj|BAF43801.1| Longipain [Haemaphysalis longicornis]
Length = 341
Score = 92.8 bits (229), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 42/84 (50%), Positives = 53/84 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q+EI G + A + D +YK GVYQ + GGHA++++GWGVE GV YWL N
Sbjct: 248 IQVEIMTNGPVEADFTVYADFPLYKSGVYQRHTDQALGGHAIRLLGWGVEKGVPYWLAAN 307
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW WGD G FKI RG+DE IE
Sbjct: 308 SWNTEWGDKGFFKILRGSDECGIE 331
>gi|170028916|ref|XP_001842340.1| cathepsin B [Culex quinquefasciatus]
gi|167879390|gb|EDS42773.1| cathepsin B [Culex quinquefasciatus]
Length = 339
Score = 92.8 bits (229), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 40/85 (47%), Positives = 54/85 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+QLEI G + E +D Y GVY+H VG+ G HA++I+GWG E+G YWL N
Sbjct: 246 IQLEIMTNGPVATGFEVFEDFYFYHSGVYKHVVGKKVGMHAIRIVGWGTENGTPYWLIAN 305
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
S+G+ WGD G FK+ RG++ IES
Sbjct: 306 SYGDTWGDKGFFKMLRGSNHLGIES 330
>gi|256077361|ref|XP_002574974.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
gi|18181863|emb|CAC85211.2| cathepsin B endopeptidase [Schistosoma mansoni]
gi|353231645|emb|CCD79000.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
Length = 347
Score = 92.8 bits (229), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 45/91 (49%), Positives = 56/91 (61%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ LE+ G + E + D YK GVYQH G + GGHAV+++GWG E+ V YWL N
Sbjct: 252 IMLELMRNGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIAN 311
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG +E IES V+AG
Sbjct: 312 SWNSDWGDKGYFKIVRGKNECGIES-DVNAG 341
>gi|170586854|ref|XP_001898194.1| cathepsin B-like cysteine proteinase [Brugia malayi]
gi|158594589|gb|EDP33173.1| cathepsin B-like cysteine proteinase, putative [Brugia malayi]
Length = 384
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 46/102 (45%), Positives = 61/102 (59%), Gaps = 4/102 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A+ E + D + Y G+Y+H G + GGHAVKI+GWG++ GV YWL N
Sbjct: 281 IQKEIMTLGPVEASFEVYTDFLHYTSGIYKHVAGSVGGGHAVKILGWGIDQGVSYWLAAN 340
Query: 61 SWGELWGD---GGLFKIRRGTDESRIESFQVSAGRVDRDRSS 99
SW WG+ G F+I RG DE IES + AG +D S
Sbjct: 341 SWNNDWGEDVFSGYFRILRGADECGIES-GIVAGIPRKDARS 381
>gi|149030260|gb|EDL85316.1| rCG52258, isoform CRA_c [Rattus norvegicus]
Length = 130
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 43/91 (47%), Positives = 58/91 (63%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A D + YK GVY+H G++ GGHA++I+GWG+E+GV YWL N
Sbjct: 30 IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVAN 89
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WGD G FKI RG + IES ++ AG
Sbjct: 90 SWNVDWGDNGFFKILRGENHCGIES-EIVAG 119
>gi|254746338|emb|CAX16634.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 337
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 54/84 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ E++ G + A + DL+ YK GVY+H G+ GGHA+KI+GWGVE+ KYWL N
Sbjct: 241 IRAELYKNGPVEGAFTVYADLLAYKSGVYKHIQGDALGGHAIKILGWGVENDNKYWLVAN 300
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW WGD G FKI RG + IE
Sbjct: 301 SWNTDWGDNGFFKILRGENHCGIE 324
>gi|204022106|dbj|BAG71150.1| cathepsin B-N [Astegopteryx spinocephala]
Length = 332
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 46/86 (53%), Positives = 57/86 (66%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ +G I A+ E + D YK GVY T GGHAVK+IGWG E GV YWL V
Sbjct: 240 IQKDVMTYGPIEASYEVYDDFPSYKSGVYVRTENATYLGGHAVKLIGWGEEYGVPYWLMV 299
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW + WGD GLFKIRRGT+E I++
Sbjct: 300 NSWNDQWGDRGLFKIRRGTNECGIDN 325
>gi|201023319|ref|NP_001128401.1| cathepsin B-10270 precursor [Acyrthosiphon pisum]
gi|239788119|dbj|BAH70754.1| ACYPI000021 [Acyrthosiphon pisum]
Length = 341
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 42/90 (46%), Positives = 55/90 (61%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E+ G A + ++D + YK G+YQH G++ G VK+IGWGV GV+YWL NSWG
Sbjct: 251 ELKKHGPATAIMRVYEDFLTYKSGIYQHVTGKLLGQITVKVIGWGVYRGVQYWLAANSWG 310
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
WGD G FKIRRG +E E + +S V
Sbjct: 311 TSWGDKGFFKIRRGYNECLFEDYFISGRPV 340
>gi|170787211|gb|ACB38229.1| cathepsin B [Meretrix meretrix]
Length = 337
Score = 92.4 bits (228), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 50/85 (58%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI G + AA + D YK GVY+H G GGHA+K +GWG EDG YWL N
Sbjct: 243 IMTEIMTNGPVEAAFTVYSDFPTYKSGVYEHKSGGPLGGHAIKTLGWGNEDGKDYWLVAN 302
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FKI RG DE IES
Sbjct: 303 SWNPDWGDNGFFKILRGRDECGIES 327
>gi|204022108|dbj|BAG71151.1| cathepsin B-N [Cerataphis jamuritsu]
Length = 333
Score = 92.4 bits (228), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 46/86 (53%), Positives = 57/86 (66%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ +G I A+ E + D YK GVY T GGHAVK+IGWG E GV YWL V
Sbjct: 241 IQNDVLTYGPIEASFEVYDDFPSYKSGVYVKTENASYLGGHAVKLIGWGEEYGVPYWLLV 300
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW + WGD GLFKIRRGT+E I++
Sbjct: 301 NSWNDQWGDQGLFKIRRGTNECGIDN 326
>gi|341900876|gb|EGT56811.1| hypothetical protein CAEBREN_29569 [Caenorhabditis brenneri]
Length = 344
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 48/97 (49%), Positives = 57/97 (58%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G I A ++D Y GVY HT G GGHAVKI+GWGV++G YWL N
Sbjct: 247 IQTEILKNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGVDNGTPYWLVAN 306
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WG+ G F+I RG +E IE V AG D DR
Sbjct: 307 SWNINWGEKGYFRIIRGLNECGIEHSAV-AGIPDLDR 342
>gi|390994429|gb|AFM37364.1| cathepsin B1 [Dictyocaulus viviparus]
Length = 350
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 48/95 (50%), Positives = 58/95 (61%), Gaps = 2/95 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
+Q EI G + AA ++D Y+ G+Y HT G GGHAVK+IGWGV +DG KYWL
Sbjct: 256 IQREIMTNGPVQAAFMVYEDFSRYRSGIYVHTAGRREGGHAVKLIGWGVDDDGNKYWLAA 315
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
NSW WG+ G F+I RG D IES V AG D
Sbjct: 316 NSWNSDWGENGYFRIVRGVDHCGIES-AVVAGMPD 349
>gi|341888137|gb|EGT44072.1| hypothetical protein CAEBREN_10156 [Caenorhabditis brenneri]
Length = 344
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 48/97 (49%), Positives = 57/97 (58%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G I A ++D Y GVY HT G GGHAVKI+GWGV++G YWL N
Sbjct: 247 IQTEILKNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGVDNGTPYWLVAN 306
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WG+ G F+I RG +E IE V AG D DR
Sbjct: 307 SWNINWGEKGYFRIIRGLNECGIEHSAV-AGIPDLDR 342
>gi|204022096|dbj|BAG71145.1| cathepsin B-N1 [Tuberaphis sumatrana]
Length = 334
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 45/86 (52%), Positives = 56/86 (65%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ +G I A+ E + D YK GVY GGHAVK+IGWG E GV YWL V
Sbjct: 242 IQYDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPYWLLV 301
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW + WGD GLFKIRRGT+E I++
Sbjct: 302 NSWNDQWGDQGLFKIRRGTNECGIDN 327
>gi|984960|gb|AAC46878.1| cathepsin B proteinase, partial [Ancylostoma caninum]
Length = 340
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 41/89 (46%), Positives = 57/89 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI+ G +VAA + +QD Y+ G+Y H G +G HAVK++GWG E+G YWL N
Sbjct: 247 IREEIYKNGPVVAAFKVYQDFSYYRGGIYVHKWGGQTGAHAVKVVGWGRENGTDYWLIAN 306
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW WG+ G F+I RG++E IE VS
Sbjct: 307 SWNTDWGENGYFRIARGSNECGIEGQMVS 335
>gi|193716207|ref|XP_001950562.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 340
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 45/86 (52%), Positives = 57/86 (66%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ +G I A+ + + D YK GVY + GGHAVK+IGWG E GV YWL V
Sbjct: 245 IQKDVMRYGPIEASFDMYDDFPSYKSGVYVRSENASYLGGHAVKLIGWGEEHGVLYWLMV 304
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW E WGD GLFKIRRGT+E I++
Sbjct: 305 NSWNEGWGDNGLFKIRRGTNECGIDN 330
>gi|291228863|ref|XP_002734398.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 451
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 47/103 (45%), Positives = 64/103 (62%), Gaps = 9/103 (8%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--------GEMSGGHAVKIIGWGVEDG 52
+Q+EI G + A+ E +D +Y GVY+HT S H+VK++GWGVE+G
Sbjct: 318 IQVEIMENGPVQASFEVKEDFFMYGSGVYRHTPIASNDAEQYHASEWHSVKLLGWGVENG 377
Query: 53 VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRVD 94
+KYWL NSWG WG+ G FKI RG +E IES+ V+ G+VD
Sbjct: 378 IKYWLGANSWGTKWGEDGYFKILRGENECNIESYVVAVWGKVD 420
>gi|52546914|gb|AAU81590.1| cysteine proteinase, partial [Petunia x hybrida]
Length = 122
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 46/92 (50%), Positives = 57/92 (61%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
+ E++ G + A ++D YK GVY+H G+ GGHAVK+IGWG EDG YWL
Sbjct: 10 IMTEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGDELGGHAVKLIGWGTSEDGEDYWLLA 69
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
N W WGD G FKIRRGT+E IE +V AG
Sbjct: 70 NQWNRGWGDDGYFKIRRGTNECDIED-EVVAG 100
>gi|299471123|emb|CBN78981.1| cathepsin B-like proteinase [Ectocarpus siliculosus]
Length = 557
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 45/95 (47%), Positives = 58/95 (61%), Gaps = 3/95 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED--GVKYWLC 58
+Q ++ +GS+ AA D + Y GVY H G GGHAVK+IGWG ++ G YWL
Sbjct: 464 IQRDMMKYGSVTAAFSVFSDFLTYSGGVYTHESGSFMGGHAVKMIGWGTDEVSGEDYWLI 523
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
NSW WG+GGLF+I RG +E IE Q+ AG V
Sbjct: 524 ANSWNPSWGEGGLFRILRGVNECGIEG-QIVAGEV 557
>gi|110456454|gb|ABG74712.1| cathepsin B preproprotein-like protein [Diaphorina citri]
Length = 125
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 40/81 (49%), Positives = 54/81 (66%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
+I+ G +VA + D + YK GVYQH G+ G HAV+++GWGVE+ + YWL NSW
Sbjct: 32 QIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 91
Query: 64 ELWGDGGLFKIRRGTDESRIE 84
+ WGD G FKI RG +E+ IE
Sbjct: 92 DHWGDHGTFKILRGENEADIE 112
>gi|2944340|gb|AAC05262.1| cathepsin B-like cysteine protease GCP7 [Haemonchus contortus]
Length = 348
Score = 92.4 bits (228), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 55/85 (64%), Gaps = 1/85 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+QLEI G + A ++D Y+ GVY HT G M GGH++KIIGWGV+ GVKYWL N
Sbjct: 256 IQLEIMQKGPVHATFNIYEDFEHYEGGVYIHTAGAMEGGHSIKIIGWGVDKGVKYWLIAN 315
Query: 61 SWGELWG-DGGLFKIRRGTDESRIE 84
SW WG DGG F++ RG + IE
Sbjct: 316 SWSTDWGEDGGYFRVVRGINNCDIE 340
>gi|161343863|tpg|DAA06112.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 340
Score = 92.4 bits (228), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 43/92 (46%), Positives = 61/92 (66%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDG-VKYWLCV 59
++ EI+ G + A ++D I Y+ GVY+H G+ GGHA++I+GWGV++G + YWL
Sbjct: 247 IRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVA 306
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW WG G FKI RG+DE IE Q++AG
Sbjct: 307 NSWNSDWGSDGFFKILRGSDECGIEG-QINAG 337
>gi|195729973|gb|ACG50797.1| cathepsin B2 [Trichobilharzia szidati]
Length = 344
Score = 92.0 bits (227), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 45/88 (51%), Positives = 55/88 (62%), Gaps = 1/88 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E+ G + E + D YK GVYQH G + GGHAV+++GWG E+GV YWL NSW
Sbjct: 253 EVMDHGPVEVDFEVYADFPNYKSGVYQHVSGGLLGGHAVRLLGWGEENGVPYWLIANSWN 312
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAG 91
WGD G FKI RG +E IES V+AG
Sbjct: 313 SDWGDNGYFKIIRGRNECGIES-DVNAG 339
>gi|119638954|gb|ABL85236.1| cysteine proteinase 2 [Necator americanus]
Length = 347
Score = 92.0 bits (227), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 55/84 (65%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EIF+ G +VA ++D YK G+Y +G +G HAVKIIGWG E+GVKYWL N
Sbjct: 254 IKREIFNNGPLVATYTVYEDFAYYKNGIYMTGLGRATGAHAVKIIGWGEENGVKYWLIAN 313
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW WG+ G F++ RGT+ IE
Sbjct: 314 SWNTDWGENGFFRMLRGTNLCDIE 337
>gi|48762493|dbj|BAD23816.1| cathepsin B-N1 [Tuberaphis coreana]
Length = 340
Score = 92.0 bits (227), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 46/86 (53%), Positives = 56/86 (65%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
+Q +I +G I A+ E + D YK GVY GGHAVK+IGWG E GV YWL V
Sbjct: 245 IQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPYWLLV 304
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW + WGD GLFKIRRGT+E I++
Sbjct: 305 NSWNDQWGDQGLFKIRRGTNECGIDN 330
>gi|323147412|gb|ADX32985.1| cathepsin B [Pinctada fucata]
Length = 366
Score = 92.0 bits (227), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 46/88 (52%), Positives = 54/88 (61%), Gaps = 1/88 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI G + A + D YK GVY+HT G GGHA+KI+GWG E G YWL NSW
Sbjct: 275 EIMTNGPVEGAFTVYADFPQYKSGVYKHTTGSPLGGHAIKIMGWGTEGGDDYWLVANSWN 334
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAG 91
WG+ G FKI RG DE IES Q++AG
Sbjct: 335 PDWGNQGTFKILRGRDECGIES-QIAAG 361
>gi|300176937|emb|CBK25506.2| unnamed protein product [Blastocystis hominis]
Length = 320
Score = 92.0 bits (227), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 42/89 (47%), Positives = 57/89 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ E+ G I ++D + YK G+YQH G+ GGHAVK++GWGVEDGV+YW N
Sbjct: 227 IKTELMTNGPIEVDFSVYEDFMTYKSGIYQHVAGKYLGGHAVKLVGWGVEDGVEYWKIAN 286
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW E WG+ G F+I G +E IES V+
Sbjct: 287 SWNEDWGENGYFRIIAGKNECGIESDGVA 315
>gi|5764077|emb|CAB53367.1| necpain [Necator americanus]
Length = 339
Score = 92.0 bits (227), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 40/91 (43%), Positives = 62/91 (68%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++L+I G + AA + ++D +YK+G+Y+H G +GGHAVKIIGWG ++G YWL N
Sbjct: 246 IRLDIMKNGPVQAAFDVYEDFKLYKRGIYKHKEGIQTGGHAVKIIGWGKDNGTDYWLIAN 305
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW + WG+ G F++ RG ++ IE ++AG
Sbjct: 306 SWSKDWGESGFFRMVRGENDCEIEDM-ITAG 335
>gi|224285427|gb|ACN40436.1| unknown [Picea sitchensis]
Length = 350
Score = 92.0 bits (227), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 39/84 (46%), Positives = 54/84 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ E++ G + + ++D YK GVY++T G+ GGHAVK++GWG EDG YWL N
Sbjct: 240 IMAEVYTNGPVEVSFSVYEDFAHYKSGVYKYTKGDYMGGHAVKLVGWGTEDGTDYWLVAN 299
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW WG+ G FKI RG++E IE
Sbjct: 300 SWNTAWGEDGYFKIARGSNECGIE 323
>gi|187097096|ref|NP_001119608.1| cathepsin B-348 precursor [Acyrthosiphon pisum]
gi|161343833|tpg|DAA06097.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 342
Score = 92.0 bits (227), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 43/92 (46%), Positives = 61/92 (66%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDG-VKYWLCV 59
++ EI+ G + A ++D I Y+ GVY+H G+ GGHA++I+GWGV++G + YWL
Sbjct: 249 IRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVA 308
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW WG G FKI RG+DE IE Q++AG
Sbjct: 309 NSWNTDWGSDGFFKILRGSDECGIEG-QINAG 339
>gi|116784401|gb|ABK23329.1| unknown [Picea sitchensis]
Length = 350
Score = 92.0 bits (227), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 39/84 (46%), Positives = 54/84 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ E++ G + + ++D YK GVY++T G+ GGHAVK++GWG EDG YWL N
Sbjct: 240 IMAEVYTNGPVEVSFSVYEDFAHYKSGVYKYTKGDYMGGHAVKLVGWGTEDGTDYWLVAN 299
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW WG+ G FKI RG++E IE
Sbjct: 300 SWNTAWGEDGYFKIARGSNECGIE 323
>gi|48762485|dbj|BAD23812.1| cathepsin B-N1 [Tuberaphis styraci]
Length = 340
Score = 92.0 bits (227), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 46/86 (53%), Positives = 57/86 (66%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
+Q +I +G I A+ E + D YK GVY + GGHAVK+IGWG E GV YWL V
Sbjct: 245 IQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPYWLLV 304
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW + WGD GLFKIRRGT+E I++
Sbjct: 305 NSWNDQWGDQGLFKIRRGTNECGIDN 330
>gi|294894290|ref|XP_002774786.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239880403|gb|EER06602.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 830
Score = 92.0 bits (227), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 41/72 (56%), Positives = 49/72 (68%)
Query: 7 HFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELW 66
+F + A+ ++D + YK GVY+HT GE GGHAVKIIGWG E G YW+ VNSW E W
Sbjct: 745 NFDQVSASFSVYEDFLAYKSGVYKHTSGEYLGGHAVKIIGWGEESGQAYWIVVNSWNEDW 804
Query: 67 GDGGLFKIRRGT 78
GD GLFKI G
Sbjct: 805 GDHGLFKIALGN 816
>gi|204022092|dbj|BAG71143.1| cathepsin B-N2 [Tuberaphis coreana]
Length = 334
Score = 92.0 bits (227), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 46/86 (53%), Positives = 56/86 (65%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
+Q +I +G I A+ E + D YK GVY GGHAVK+IGWG E GV YWL V
Sbjct: 242 IQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPYWLLV 301
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW + WGD GLFKIRRGT+E I++
Sbjct: 302 NSWNDQWGDQGLFKIRRGTNECGIDN 327
>gi|256086863|ref|XP_002579605.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228447|emb|CCD74618.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 271
Score = 92.0 bits (227), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 44/95 (46%), Positives = 59/95 (62%), Gaps = 2/95 (2%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVE-DGVKYWLCVNSW 62
EI G + ++D + YK GVY+H G GGHA++IIGWG++ + + YWLC NSW
Sbjct: 178 EILLNGPVEGGFYVYEDFLNYKSGVYKHITGSYLGGHAIRIIGWGIQQNHIPYWLCANSW 237
Query: 63 GELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
WGD G FKI RGT+E IES V+AG + +
Sbjct: 238 NNQWGDQGYFKILRGTNECGIESM-VTAGLPNLHK 271
>gi|254575665|gb|ACT68329.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 92.0 bits (227), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 54/85 (63%), Gaps = 1/85 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+QLEI G + A ++D Y GVY HT G M GGH++KIIGWGV+ GVKYWL N
Sbjct: 256 IQLEIMQKGPVHATFNIYEDFEHYNGGVYIHTAGAMEGGHSIKIIGWGVDKGVKYWLIAN 315
Query: 61 SWGELWG-DGGLFKIRRGTDESRIE 84
SW WG DGG F++ RG + IE
Sbjct: 316 SWSTDWGEDGGYFRVVRGINNCDIE 340
>gi|204022094|dbj|BAG71144.1| cathepsin B-N1 [Tuberaphis taiwana]
Length = 334
Score = 92.0 bits (227), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 46/86 (53%), Positives = 57/86 (66%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
+Q +I +G I A+ E + D YK GVY + GGHAVK+IGWG E GV YWL V
Sbjct: 242 IQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPYWLLV 301
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW + WGD GLFKIRRGT+E I++
Sbjct: 302 NSWNDQWGDQGLFKIRRGTNECGIDN 327
>gi|18378947|ref|NP_563648.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|16226808|gb|AAL16267.1|AF428337_1 At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|14532526|gb|AAK63991.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|25090140|gb|AAN72238.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|332189292|gb|AEE27413.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
Length = 362
Score = 92.0 bits (227), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 45/97 (46%), Positives = 57/97 (58%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
+ E++ G + A ++D YK GVY+H G GGHAVK+IGWG +DG YWL
Sbjct: 250 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLA 309
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRD 96
N W WGD G FKIRRGT+E IE V+ DR+
Sbjct: 310 NQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRN 346
>gi|168026641|ref|XP_001765840.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683017|gb|EDQ69431.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 339
Score = 92.0 bits (227), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 46/102 (45%), Positives = 61/102 (59%), Gaps = 1/102 (0%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
+ E++ G I + E +D YK GVY+H G GGHAVK+IGWG +DGV YW V
Sbjct: 238 LMAELYTNGPIEVSFEVFEDFAHYKTGVYKHVYGRYIGGHAVKLIGWGTTDDGVDYWTIV 297
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDL 101
NSW WG+ GLF+I RG +E IES+ V+ D+ S +
Sbjct: 298 NSWNTNWGEHGLFRIARGGNECGIESYAVAGLPFDKGLHSAM 339
>gi|300122171|emb|CBK22745.2| unnamed protein product [Blastocystis hominis]
Length = 319
Score = 92.0 bits (227), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 44/90 (48%), Positives = 56/90 (62%), Gaps = 1/90 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI G + A+ + ++D + YK G+Y H G+ H VKIIGWG E+G YW VNSW
Sbjct: 231 EIMENGPVDASFQVYEDFMTYKSGIYHHVEGKFMNLHTVKIIGWGEENGEAYWKAVNSWN 290
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
WG+ GLF+IR GT+E IES QV G V
Sbjct: 291 SEWGENGLFRIRLGTNECTIES-QVEGGLV 319
>gi|3912916|gb|AAC78691.1| thiol protease [Trichuris suis]
Length = 348
Score = 92.0 bits (227), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 41/85 (48%), Positives = 56/85 (65%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G +VA+ ++D YK G+Y+HT GE+ G HAVKIIGWG E+ +WL N
Sbjct: 253 IQREIMKNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKIIGWGKENNTDFWLIAN 312
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW + WG+ G F+I RG +E IE+
Sbjct: 313 SWHQDWGEKGYFRIVRGKNECGIET 337
>gi|380791571|gb|AFE67661.1| cathepsin B preproprotein, partial [Macaca mulatta]
Length = 311
Score = 92.0 bits (227), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 38/73 (52%), Positives = 48/73 (65%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 239 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 298
Query: 61 SWGELWGDGGLFK 73
SW WGD G FK
Sbjct: 299 SWNTDWGDNGFFK 311
>gi|156708110|gb|ABU93313.1| cathepsin B4 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 92.0 bits (227), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 38/82 (46%), Positives = 51/82 (62%)
Query: 5 IFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGE 64
+ +G + + D + Y+ GVYQH G GGHAV + GWGVE+G+ YWL NSWG
Sbjct: 192 LMEYGPLSCGFMVYSDFMNYRSGVYQHKSGYFEGGHAVLLCGWGVENGLPYWLVQNSWGP 251
Query: 65 LWGDGGLFKIRRGTDESRIESF 86
WG+ G FKI RG++ IES+
Sbjct: 252 AWGEKGFFKILRGSNHCEIESY 273
>gi|297843028|ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
lyrata]
gi|297335237|gb|EFH65654.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
lyrata]
Length = 360
Score = 91.7 bits (226), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 45/97 (46%), Positives = 57/97 (58%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
+ E++ G + A ++D YK GVY+H G GGHAVK+IGWG +DG YWL
Sbjct: 248 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLA 307
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRD 96
N W WGD G FKIRRGT+E IE V+ DR+
Sbjct: 308 NQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRN 344
>gi|194766882|ref|XP_001965553.1| GF22391 [Drosophila ananassae]
gi|190619544|gb|EDV35068.1| GF22391 [Drosophila ananassae]
Length = 342
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 47/93 (50%), Positives = 61/93 (65%), Gaps = 3/93 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
+Q EI G + A ++DLI+YK GVYQH G+ GGHA++I+GWGV ++ V YWL
Sbjct: 246 IQEEIMTNGPVEGAFTVYEDLILYKSGVYQHEHGKELGGHAIRILGWGVWGKEEVPYWLI 305
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW + WGD G F+I RG D IES +SAG
Sbjct: 306 ANSWNDDWGDKGFFRILRGEDHCGIES-SISAG 337
>gi|74221319|dbj|BAE42140.1| unnamed protein product [Mus musculus]
Length = 339
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 58/97 (59%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ + A D + YK GVY+H G+M GGHA++I+GWGV +GV YWL N
Sbjct: 239 IMAEIYKNDPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVGNGVPYWLAAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG + IES ++ AG D+
Sbjct: 299 SWNLDWGDNGFFKILRGENHCGIES-EIVAGIPRTDQ 334
>gi|356572872|ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max]
Length = 356
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/92 (50%), Positives = 56/92 (60%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
+ E++ G + A ++D YK GVY+H G GGHAVK+IGWG EDG YWL
Sbjct: 244 IMTEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGYELGGHAVKLIGWGTTEDGEDYWLLA 303
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
N W WGD G FKIRRGT+E IE V+AG
Sbjct: 304 NQWNREWGDDGYFKIRRGTNECGIEE-DVTAG 334
>gi|227293|prf||1701299A cathepsin B
Length = 339
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 58/97 (59%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A D + YK GVY+H G+M GGHA++I+ WGVE+GV YW N
Sbjct: 239 IMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILVWGVENGVPYWAAAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WGD G FKI RG + IES ++ AG D+
Sbjct: 299 SWNLDWGDNGFFKILRGENHCGIES-EIVAGIPRTDQ 334
>gi|254575663|gb|ACT68328.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 54/85 (63%), Gaps = 1/85 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+QLEI G + A ++D Y GVY HT G M GGH++KIIGWGV+ GVKYWL N
Sbjct: 256 IQLEIMKKGPVHATFNIYEDFEHYNGGVYIHTAGAMEGGHSIKIIGWGVDKGVKYWLIAN 315
Query: 61 SWGELWG-DGGLFKIRRGTDESRIE 84
SW WG DGG F++ RG + IE
Sbjct: 316 SWSTDWGEDGGYFRVVRGINNCDIE 340
>gi|222424744|dbj|BAH20325.1| AT1G02305 [Arabidopsis thaliana]
Length = 293
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 45/97 (46%), Positives = 57/97 (58%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
+ E++ G + A ++D YK GVY+H G GGHAVK+IGWG +DG YWL
Sbjct: 181 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLA 240
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRD 96
N W WGD G FKIRRGT+E IE V+ DR+
Sbjct: 241 NQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRN 277
>gi|168000937|ref|XP_001753172.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695871|gb|EDQ82213.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 347
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 45/90 (50%), Positives = 57/90 (63%), Gaps = 1/90 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
+ E++ G + A E ++D YK GVY+H G GGHAVK+IGWG +DGV YW V
Sbjct: 246 LMAELYTNGPVEVAFEVYEDFAHYKTGVYKHLFGGFMGGHAVKLIGWGTTDDGVDYWTIV 305
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVS 89
NSW WG+ GLF+I RG DE IES V+
Sbjct: 306 NSWNTNWGEDGLFRIVRGNDECGIESNAVA 335
>gi|125981197|ref|XP_001354605.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
gi|54642915|gb|EAL31659.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
Length = 338
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 48/93 (51%), Positives = 59/93 (63%), Gaps = 3/93 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
+Q EI G + A ++DLI+YK GVYQH G+ GGHA++I+GWGV E V YWL
Sbjct: 243 IQQEIMTNGPVEGAFTVYEDLILYKSGVYQHEHGKELGGHAIRILGWGVWGESKVPYWLI 302
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW WGD G F+I RG D IES +SAG
Sbjct: 303 GNSWNTDWGDNGFFRILRGQDHCGIES-SISAG 334
>gi|320167003|gb|EFW43902.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
Length = 306
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 49/84 (58%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + AA + D Y GVY H G + GGHAVKI+GWGV+ YW+ N
Sbjct: 213 IQSEILANGPVEAAFSVYDDFFSYTSGVYSHQSGALDGGHAVKIVGWGVDGTTPYWIVAN 272
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SWG WG G F I+RG DE IE
Sbjct: 273 SWGTSWGQAGFFWIKRGNDECGIE 296
>gi|118122|sp|P25793.1|CYSP2_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 2; Flags:
Precursor
gi|159165|gb|AAA29171.1| cathepsin B-like cysteine protease [Haemonchus contortus]
Length = 342
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 43/94 (45%), Positives = 61/94 (64%), Gaps = 1/94 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G +VA+ ++D YK G+Y+HT GE+ G HAVK+IGWG E+ +WL N
Sbjct: 246 IQSEILKNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWGNENNTDFWLIAN 305
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
SW WG+ G F+I RG+++ IE ++AG VD
Sbjct: 306 SWHNDWGEKGYFRIVRGSNDCGIEG-TIAAGIVD 338
>gi|204022102|dbj|BAG71148.1| cathepsin B-N2 [Tuberaphis takenouchii]
Length = 334
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 45/86 (52%), Positives = 57/86 (66%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-VGEMSGGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ +G I A+ + + D YK GVY T GGHAVK+IGWG E GV YWL V
Sbjct: 242 IQNDLMTYGPIEASYDVYDDFPNYKSGVYMKTENASYLGGHAVKLIGWGEEYGVPYWLLV 301
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW + WGD GLFKIRRGT+E I++
Sbjct: 302 NSWNDQWGDQGLFKIRRGTNECGIDN 327
>gi|226468762|emb|CAX76409.1| cathepsin B [Schistosoma japonicum]
gi|257206178|emb|CAX82740.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/88 (50%), Positives = 54/88 (61%), Gaps = 1/88 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E+ G + E + D YK GVYQH G + GGHAV+++GWG E+ V YWL NSW
Sbjct: 256 ELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIANSWN 315
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAG 91
WGD G FKI RG +E IES V+AG
Sbjct: 316 TDWGDNGYFKIIRGKNECGIES-DVNAG 342
>gi|195729971|gb|ACG50796.1| cathepsin B1 [Trichobilharzia szidati]
Length = 342
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 45/91 (49%), Positives = 57/91 (62%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G + AA + D + YK G+Y+H G + GGHAV+IIGWGVE YWL N
Sbjct: 249 IKKEIMMHGPVEAAFTVYSDFLNYKSGIYKHMKGTVIGGHAVRIIGWGVEKKTPYWLIAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW E WG+ G F+I RG D IES V+AG
Sbjct: 309 SWNEDWGEKGYFRILRGKDVCGIES-AVTAG 338
>gi|154089579|gb|ABS57370.1| cathepsin B2 [Trichobilharzia regenti]
Length = 344
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 45/88 (51%), Positives = 55/88 (62%), Gaps = 1/88 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E+ G + E + D YK GVYQH G + GGHAV+++GWG E+GV YWL NSW
Sbjct: 253 EVKEHGPVEVDFEVYADFPNYKSGVYQHVSGGLLGGHAVRLLGWGEENGVPYWLIANSWN 312
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAG 91
WGD G FKI RG +E IES V+AG
Sbjct: 313 SDWGDNGYFKIIRGRNECGIES-DVNAG 339
>gi|22535408|emb|CAC87118.1| cathepsin B-like protease [Nilaparvata lugens]
Length = 347
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 43/79 (54%), Positives = 53/79 (67%), Gaps = 1/79 (1%)
Query: 2 QLEIFHFGSIVAAIEAHQDLIIYKKGVYQ-HTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
QLEIF G IVAA + ++D +YK GVY+ H G HAVK+IGWG ++G+ YWL N
Sbjct: 253 QLEIFKNGPIVAAFKVYEDFFMYKSGVYKRHPESPFRGRHAVKVIGWGEQNGLPYWLVQN 312
Query: 61 SWGELWGDGGLFKIRRGTD 79
SW WGD GLFKI RG +
Sbjct: 313 SWDYDWGDKGLFKIARGNE 331
>gi|226472810|emb|CAX71091.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/88 (50%), Positives = 54/88 (61%), Gaps = 1/88 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E+ G + E + D YK GVYQH G + GGHAV+++GWG E+ V YWL NSW
Sbjct: 256 ELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIANSWN 315
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAG 91
WGD G FKI RG +E IES V+AG
Sbjct: 316 TDWGDNGYFKIIRGKNECGIES-DVNAG 342
>gi|34979797|gb|AAQ83887.1| cathepsin B [Branchiostoma belcheri tsingtauense]
Length = 332
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/84 (52%), Positives = 50/84 (59%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q+EI G + AA + D YK GVYQH G GGHAVK+IGWG E YWL N
Sbjct: 238 IQMEIMTNGPVEAAFTVYADFPHYKSGVYQHESGAELGGHAVKMIGWGTEGSTPYWLIAN 297
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW WG+ G FKI RG DE IE
Sbjct: 298 SWNTDWGNMGFFKILRGQDECGIE 321
>gi|195438776|ref|XP_002067308.1| GK16352 [Drosophila willistoni]
gi|194163393|gb|EDW78294.1| GK16352 [Drosophila willistoni]
Length = 340
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 61/93 (65%), Gaps = 3/93 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
+Q EI + G + A ++DLI+YK GVY+H G+ GGHA++I+GWGV ++ + YWL
Sbjct: 245 IQKEIMNNGPVEGAFTVYEDLILYKSGVYEHVHGKELGGHAIRILGWGVWGDEKIPYWLI 304
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW WGD G F+I RG D IES +SAG
Sbjct: 305 ANSWNTDWGDNGFFRIVRGKDHCGIES-SISAG 336
>gi|328701234|ref|XP_001948885.2| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 326
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 47/92 (51%), Positives = 57/92 (61%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG-HAVKIIGWGVEDGVKYWLCV 59
+Q E+ +G + + H DL +YK GVY T H K+IGWGVE+GV YWL V
Sbjct: 234 IQKEVQTYGPVSVFFDLHDDLFLYKSGVYAKTEKSKDKRYHHAKLIGWGVENGVDYWLLV 293
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSWG WG GLFKI+RGTDE +ES V AG
Sbjct: 294 NSWGYEWGQNGLFKIKRGTDECSVES-HVYAG 324
>gi|204022098|dbj|BAG71146.1| cathepsin B-N2 [Tuberaphis sumatrana]
Length = 334
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 45/86 (52%), Positives = 56/86 (65%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ +G I A+ E + D YK GVY GGHAVK+IGWG E GV YWL V
Sbjct: 242 IQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPYWLLV 301
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW + WGD GLFKIRRGT+E I++
Sbjct: 302 NSWNDQWGDQGLFKIRRGTNECGIDN 327
>gi|341886633|gb|EGT42568.1| hypothetical protein CAEBREN_17563 [Caenorhabditis brenneri]
Length = 358
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 60/97 (61%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G ++A+ + D YK G+Y HT G+ GG KIIGWGV++GV YWLCV+
Sbjct: 261 IQTEIMRNGPVIASFIIYDDFWDYKSGIYVHTAGDQEGGMDTKIIGWGVDNGVPYWLCVH 320
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
WG +G+ G +I RG +E IE QV A + D D+
Sbjct: 321 QWGTDFGENGFVRILRGVNEVNIEH-QVLAAQPDLDK 356
>gi|427787723|gb|JAA59313.1| Putative cathepsin b-like cysteine protease form 2 [Rhipicephalus
pulchellus]
Length = 338
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/91 (48%), Positives = 55/91 (60%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EIF G + A + D YK GVYQ G HA++I+GWG E+GV YWL N
Sbjct: 242 IKTEIFKNGPVEADFAVYADFYSYKSGVYQAHSRVRCGSHAIRILGWGTENGVPYWLAAN 301
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW E WGD G FKIRRG +E IE ++AG
Sbjct: 302 SWTEHWGDKGYFKIRRGNNECGIEE-DINAG 331
>gi|256052327|ref|XP_002569724.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 96
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 42/94 (44%), Positives = 61/94 (64%), Gaps = 1/94 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI +G + A ++D + YK G+Y+H G++ HA++IIGWG E+ YWL N
Sbjct: 3 IQKEIMKYGPVEANFIVYEDFLNYKSGIYKHITGKLFSWHAIRIIGWGEENNTPYWLIPN 62
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
SW E WG+ G F+I RG E IES +V+AGR++
Sbjct: 63 SWNEDWGENGNFRILRGRHECSIES-EVTAGRIN 95
>gi|30995341|gb|AAO59414.2| cathepsin B endopeptidase [Schistosoma japonicum]
gi|226472794|emb|CAX71083.1| cathepsin B [Schistosoma japonicum]
gi|226472796|emb|CAX71084.1| cathepsin B [Schistosoma japonicum]
gi|226472798|emb|CAX71085.1| cathepsin B [Schistosoma japonicum]
gi|226472802|emb|CAX71087.1| cathepsin B [Schistosoma japonicum]
gi|226472806|emb|CAX71089.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/88 (50%), Positives = 54/88 (61%), Gaps = 1/88 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E+ G + E + D YK GVYQH G + GGHAV+++GWG E+ V YWL NSW
Sbjct: 256 ELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIANSWN 315
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAG 91
WGD G FKI RG +E IES V+AG
Sbjct: 316 TDWGDNGYFKIIRGKNECGIES-DVNAG 342
>gi|321461662|gb|EFX72692.1| hypothetical protein DAPPUDRAFT_308155 [Daphnia pulex]
Length = 379
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/104 (44%), Positives = 60/104 (57%), Gaps = 1/104 (0%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+QLEI G + A + ++D + YK GVY+H G+ HAVKI GWG E G YWL N
Sbjct: 274 IQLEIMENGPVQANLRIYEDFLHYKFGVYRHVHGQGLEYHAVKIFGWGTEGGTPYWLAAN 333
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEF 104
W + WG+GG FKI RG++ + IE V AG D + E F
Sbjct: 334 PWSKRWGNGGFFKILRGSNHAEIED-HVMAGIPKLDLVDEEEHF 376
>gi|294951797|ref|XP_002787132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239901778|gb|EER18928.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 278
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 42/70 (60%), Positives = 48/70 (68%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
G + A+ ++D + YK GVY+HT GE GGHAVKIIGWG E G YWL VNSW E WGD
Sbjct: 195 GPVSASFTVYEDFLAYKSGVYKHTSGEYLGGHAVKIIGWGEESGQAYWLVVNSWNEDWGD 254
Query: 69 GGLFKIRRGT 78
GLFKI G
Sbjct: 255 HGLFKIALGN 264
>gi|159177|gb|AAA29177.1| cysteine proteinase [Haemonchus contortus]
Length = 342
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 61/98 (62%), Gaps = 3/98 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED--GVKYWLC 58
+Q EI G +VA+ ++D +YK GVY+HT G + G HAVK++GWGV+ KYWL
Sbjct: 244 IQREILRHGPVVASFAVYEDFSLYKTGVYKHTAGALRGYHAVKMMGWGVDSKTKAKYWLI 303
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRD 96
NSW WG+ G F+ RG ++ IE V+AG VD D
Sbjct: 304 ANSWHNDWGENGYFRFIRGINDCEIED-TVAAGIVDVD 340
>gi|161343821|tpg|DAA06091.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 196
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 42/86 (48%), Positives = 58/86 (67%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ +G I A+ + + D YK G+Y+ T GGHAVK+IGWG + G+ YWL V
Sbjct: 101 IQKDVMTYGPIEASFDVYSDFPSYKSGIYERTENATYLGGHAVKLIGWGEQYGIPYWLMV 160
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW E WGD GLFKIRRGT+E +++
Sbjct: 161 NSWNEDWGDNGLFKIRRGTNECGVDN 186
>gi|226472800|emb|CAX71086.1| cathepsin B [Schistosoma japonicum]
gi|226472804|emb|CAX71088.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/88 (50%), Positives = 54/88 (61%), Gaps = 1/88 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E+ G + E + D YK GVYQH G + GGHAV+++GWG E+ V YWL NSW
Sbjct: 256 ELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIANSWN 315
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAG 91
WGD G FKI RG +E IES V+AG
Sbjct: 316 TDWGDNGYFKIIRGKNECGIES-DVNAG 342
>gi|226469952|emb|CAX70257.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 60/93 (64%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI +G + A + +D + YK G+Y++T G G H V+IIGWG+E+G YWL N
Sbjct: 249 IQKEIMMYGPVEAYLLIFEDFLNYKSGIYKYTTGSFVGEHYVRIIGWGIENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES V AGR+
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIESV-VVAGRL 340
>gi|294885809|ref|XP_002771442.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
gi|239875086|gb|EER03258.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
Length = 527
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 43/72 (59%), Positives = 49/72 (68%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
G I A+ ++D + YK GVY+HT G GGHAVKIIGWG E+G YWL VNSW E WGD
Sbjct: 444 GPISASYLVYEDFLAYKSGVYKHTSGSYLGGHAVKIIGWGEENGEAYWLVVNSWNEDWGD 503
Query: 69 GGLFKIRRGTDE 80
GLFKI G E
Sbjct: 504 QGLFKIALGNCE 515
>gi|204022104|dbj|BAG71149.1| cathepsin B-N [Astegopteryx styracophila]
Length = 332
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 45/86 (52%), Positives = 57/86 (66%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ +G I A+ + + D YK GVY T GGHAVK+IGWG E GV YWL V
Sbjct: 240 IQRDVMAYGPIEASYDVYDDFPSYKSGVYVRTENATYLGGHAVKLIGWGEEYGVPYWLMV 299
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW + WGD GLFKIRRGT+E I++
Sbjct: 300 NSWNDQWGDKGLFKIRRGTNECGIDN 325
>gi|302823081|ref|XP_002993195.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
gi|300138965|gb|EFJ05715.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
Length = 342
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/102 (43%), Positives = 62/102 (60%), Gaps = 1/102 (0%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
+Q EI+ G + + ++D YK GVY+H GE+ GGHAVK IGWG +DG YW+
Sbjct: 241 IQAEIYKNGPVEVSYTVYEDFAHYKSGVYKHVFGEVLGGHAVKFIGWGTTDDGKDYWIVA 300
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDL 101
NSW WG+ G F+I RG++E IES V+ + + SD+
Sbjct: 301 NSWNRSWGEDGFFQISRGSNECGIESEPVAGIPLKKTGFSDI 342
>gi|3087801|emb|CAA93277.1| cysteine proteinase [Haemonchus contortus]
Length = 344
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/92 (47%), Positives = 56/92 (60%), Gaps = 1/92 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G +VAA D Y+KG+Y H G GGHAVKIIGWG E GV YW+ N
Sbjct: 251 IQKEIMRNGPVVAAFIVFDDFSFYRKGIYAHVAGSPRGGHAVKIIGWGTEHGVPYWIIAN 310
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGR 92
SW WG+ G F++ RG ++ IE+ V AG+
Sbjct: 311 SWHSDWGEDGYFRMVRGINDCGIET-NVVAGK 341
>gi|211853248|emb|CAP17587.1| cathepsin-like protein 4 [Crateromorpha meyeri]
Length = 325
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 45/85 (52%), Positives = 51/85 (60%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + AA + D YK GVY+ GGHAVK+IGWG EDG+ YWL N
Sbjct: 241 IQTEIMTNGPVEAAFTVYADFPAYKSGVYKRHSLRQLGGHAVKMIGWGEEDGIPYWLIAN 300
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WGD G FKI RG DE IES
Sbjct: 301 SWNSDWGDHGYFKIVRGQDECGIES 325
>gi|239792046|dbj|BAH72408.1| ACYPI000003 [Acyrthosiphon pisum]
Length = 182
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 43/92 (46%), Positives = 61/92 (66%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDG-VKYWLCV 59
++ EI+ G + A ++D I Y+ GVY+H G+ GGHA++I+GWGV++G + YWL
Sbjct: 89 IRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVA 148
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW WG G FKI RG+DE IE Q++AG
Sbjct: 149 NSWNTDWGSDGFFKILRGSDECGIEG-QINAG 179
>gi|392920988|ref|NP_506011.2| Protein F57F5.1 [Caenorhabditis elegans]
gi|206994319|emb|CAB00098.2| Protein F57F5.1 [Caenorhabditis elegans]
Length = 351
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 40/80 (50%), Positives = 51/80 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A ++D Y GVY HT G GGHAVK++GWGV++G YWLC N
Sbjct: 258 IQKEIMTHGPVEVAFTVYEDFEHYSGGVYVHTAGASLGGHAVKMLGWGVDNGTPYWLCAN 317
Query: 61 SWGELWGDGGLFKIRRGTDE 80
SW E WG+ G F+I RG +E
Sbjct: 318 SWNEDWGENGYFRIIRGVNE 337
>gi|356505709|ref|XP_003521632.1| PREDICTED: cathepsin B-like [Glycine max]
Length = 357
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 45/92 (48%), Positives = 56/92 (60%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
+ E++ G + A ++D YK GVY+H G GGHAVK+IGWG +DG YWL
Sbjct: 245 IMAEVYKNGPVEVAFTVYEDFAYYKSGVYKHITGYELGGHAVKLIGWGTTDDGEDYWLLA 304
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
N W WGD G FKIRRGT+E IE V+AG
Sbjct: 305 NQWNREWGDDGYFKIRRGTNECGIEE-DVTAG 335
>gi|255548165|ref|XP_002515139.1| cathepsin B, putative [Ricinus communis]
gi|223545619|gb|EEF47123.1| cathepsin B, putative [Ricinus communis]
Length = 376
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 43/87 (49%), Positives = 54/87 (62%), Gaps = 1/87 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCVNSW 62
E++ G + + ++D YK GVY+H GE+ GGHAVK+IGWG D G YWL N W
Sbjct: 265 EVYKNGPVEVSFTVYEDFAHYKSGVYKHITGEVMGGHAVKLIGWGTSDNGEDYWLLANQW 324
Query: 63 GELWGDGGLFKIRRGTDESRIESFQVS 89
WGD G FKIRRGT+E IE V+
Sbjct: 325 NRGWGDDGYFKIRRGTNECGIEDDAVA 351
>gi|156708114|gb|ABU93315.1| cathepsin B6 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 43/84 (51%), Positives = 51/84 (60%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q E+ G + + D + YK GVYQH G GGHAV +IGWGVEDGV YWL N
Sbjct: 188 IQEELMKNGPVYFRFTVYSDFMNYKSGVYQHKSGYQEGGHAVLLIGWGVEDGVPYWLLQN 247
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SWG WG+ G FKI RG +E E
Sbjct: 248 SWGPAWGEKGHFKIIRGKNECGCE 271
>gi|294914603|ref|XP_002778294.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239886508|gb|EER10089.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 365
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 42/77 (54%), Positives = 51/77 (66%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G AA ++D + YK GVY+HT G GGHAV+IIGWG E GV YWL +N
Sbjct: 273 IKKEIMTNGPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVDYWLVMN 332
Query: 61 SWGELWGDGGLFKIRRG 77
SW E WGD G FKI +G
Sbjct: 333 SWNEEWGDHGTFKIVQG 349
>gi|194895314|ref|XP_001978227.1| GG19486 [Drosophila erecta]
gi|190649876|gb|EDV47154.1| GG19486 [Drosophila erecta]
Length = 340
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 47/93 (50%), Positives = 60/93 (64%), Gaps = 3/93 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
+Q EI G + A ++DLI+YK GVYQH G+ GGHA++I+GWGV E+ + YWL
Sbjct: 245 IQEEIMTNGPVEGAFTVYEDLILYKDGVYQHQHGKELGGHAIRILGWGVWGEEKIPYWLI 304
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW WGD G F+I RG D IES +SAG
Sbjct: 305 GNSWNTDWGDNGFFRILRGQDHCGIES-SISAG 336
>gi|427783627|gb|JAA57265.1| hypothetical protein [Rhipicephalus pulchellus]
Length = 483
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 51/121 (42%), Positives = 70/121 (57%), Gaps = 17/121 (14%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHT-VGE-------MSGGHAVKIIGWGVEDG--- 52
EI+ G + A I +D +Y+ GVY+HT + E SG H+V+I+GWGV+
Sbjct: 345 EIYANGPVQALILVKEDFFLYRSGVYRHTRIAESLRPQYSRSGWHSVRILGWGVDRSQYR 404
Query: 53 -VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQV-----SAGRVDRDRSSDLEEFEY 106
+KYWLC NSWG WG+ G F+I RG DES+IESF + S R +++ E EY
Sbjct: 405 PIKYWLCANSWGHGWGENGYFRIVRGEDESQIESFVLAVWGRSYASYYRQQAAQQREREY 464
Query: 107 D 107
D
Sbjct: 465 D 465
>gi|226469950|emb|CAX70256.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 60/93 (64%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI +G + A + +D + YK G+Y++T G G H V+IIGWG+E+G YWL N
Sbjct: 249 IQKEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGIENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES V AGR+
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIESV-VVAGRL 340
>gi|195393194|ref|XP_002055239.1| GJ19262 [Drosophila virilis]
gi|194149749|gb|EDW65440.1| GJ19262 [Drosophila virilis]
Length = 338
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 47/93 (50%), Positives = 59/93 (63%), Gaps = 3/93 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
+Q EI G + A ++DLI+YK GVYQH G GGHA++I+GWGV ++ V YWL
Sbjct: 243 IQREIMTNGPVEGAFTVYEDLILYKTGVYQHVHGRQLGGHAIRILGWGVWGDNKVPYWLI 302
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW WGD G F+I RG D IES +SAG
Sbjct: 303 GNSWNTDWGDNGFFRILRGEDHCGIES-AISAG 334
>gi|294889976|ref|XP_002773021.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239877724|gb|EER04837.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 342
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/124 (39%), Positives = 70/124 (56%), Gaps = 8/124 (6%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EIF G + A + H+D +YK GVY++ G M G H +K+IGWGVE G +YWL VN
Sbjct: 210 IKQEIFDNGPVAAIMTIHEDFRLYKSGVYEYKTGAMVGAHTLKLIGWGVEAGQEYWLAVN 269
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFEYDTDTTIESSSDTKR 120
SW E WGD G K+ G + ES Q +V R ++L+E ES + T++
Sbjct: 270 SWNEEWGDQGKIKLAVGKNALDEESRQ----QVPRRAVNELDE----DAMMAESGAKTQK 321
Query: 121 AFCR 124
A +
Sbjct: 322 AMAQ 325
>gi|156708118|gb|ABU93317.1| cathepsin B8 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 275
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 38/81 (46%), Positives = 53/81 (65%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E+ + G + A E +D + YK G+YQH G+ G H V ++GWG E+GV YWL NSWG
Sbjct: 185 EVANNGPVYACFEVFEDFLNYKSGIYQHKTGKSKGWHHVMLMGWGTENGVPYWLLQNSWG 244
Query: 64 ELWGDGGLFKIRRGTDESRIE 84
WG+ G F+IRRGT++ I+
Sbjct: 245 SGWGEKGFFRIRRGTNDCHID 265
>gi|161343869|tpg|DAA06115.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 337
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 42/86 (48%), Positives = 55/86 (63%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ +G I A+ + + D YK GVY + GGHAVK+IGWG EDG YWL V
Sbjct: 242 IQKDVLTYGPIEASFDVYDDFPSYKSGVYVKSDNASYLGGHAVKLIGWGEEDGTPYWLMV 301
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW WGD G FKIRRGT+E +++
Sbjct: 302 NSWNTQWGDNGFFKIRRGTNECGVDN 327
>gi|17565164|ref|NP_503383.1| Protein CPR-5 [Caenorhabditis elegans]
gi|1169086|sp|P43509.1|CPR5_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 5; AltName:
Full=Cysteine protease-related 5; Flags: Precursor
gi|671713|gb|AAA98786.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|675502|gb|AAA98784.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351059399|emb|CCD74289.1| Protein CPR-5 [Caenorhabditis elegans]
Length = 344
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 47/97 (48%), Positives = 56/97 (57%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G I A ++D Y GVY HT G GGHAVKI+GWGV++G YWL N
Sbjct: 247 IQTEILTNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGVDNGTPYWLVAN 306
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WG+ G F+I RG +E IE V AG D R
Sbjct: 307 SWNVAWGEKGYFRIIRGLNECGIEHSAV-AGIPDLAR 342
>gi|126116630|gb|ABN79675.1| cathepsin B3 [Clonorchis sinensis]
Length = 337
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 43/88 (48%), Positives = 60/88 (68%), Gaps = 1/88 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI G + AA + ++D + YK GVY H+ G + GGHA++I+GWG E+GV YWL NSW
Sbjct: 241 EIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWGEENGVAYWLIANSWN 300
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAG 91
+ WG+ G FK+ RG +E IE +V+AG
Sbjct: 301 DGWGEDGYFKMLRGKNECGIED-EVTAG 327
>gi|302848309|ref|XP_002955687.1| hypothetical protein VOLCADRAFT_106905 [Volvox carteri f.
nagariensis]
gi|300259096|gb|EFJ43327.1| hypothetical protein VOLCADRAFT_106905 [Volvox carteri f.
nagariensis]
Length = 846
Score = 90.5 bits (223), Expect = 2e-16, Method: Composition-based stats.
Identities = 40/86 (46%), Positives = 53/86 (61%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLII-YKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
M EI+H G I I D YK G+Y+ T G+ H V+++GWGVEDGVKYW+
Sbjct: 695 MMSEIYHRGPITCGIACPDDFTWHYKGGIYKDTSGDTELDHDVEVVGWGVEDGVKYWVVR 754
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSWG WG+ G F++ RG + +IES
Sbjct: 755 NSWGTYWGEMGFFRVERGVNALQIES 780
>gi|226469948|emb|CAX70255.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 60/93 (64%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI +G + A + +D + YK G+Y++T G G H V+IIGWG+E+G YWL N
Sbjct: 249 IQNEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGIENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES V AGR+
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECSIESV-VVAGRL 340
>gi|349956183|dbj|GAA30948.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 337
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 43/88 (48%), Positives = 60/88 (68%), Gaps = 1/88 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI G + AA + ++D + YK GVY H+ G + GGHA++I+GWG E+GV YWL NSW
Sbjct: 241 EIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWGEENGVAYWLIANSWN 300
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAG 91
+ WG+ G FK+ RG +E IE +V+AG
Sbjct: 301 DGWGEDGCFKMLRGKNECGIED-EVTAG 327
>gi|167541036|gb|ABZ82028.1| cathepsin B endopeptidase [Clonorchis sinensis]
Length = 228
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 43/88 (48%), Positives = 60/88 (68%), Gaps = 1/88 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI G + AA + ++D + YK GVY H+ G + GGHA++I+GWG E+GV YWL NSW
Sbjct: 132 EIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWGEENGVAYWLIANSWN 191
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAG 91
+ WG+ G FK+ RG +E IE +V+AG
Sbjct: 192 DGWGEDGYFKMLRGKNECGIED-EVTAG 218
>gi|294883442|ref|XP_002770942.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239874068|gb|EER02758.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 393
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 48/105 (45%), Positives = 66/105 (62%), Gaps = 3/105 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G + AA ++D + YK GVY+H G GGHAVKIIGWG + +YWL +N
Sbjct: 289 IKKEIIDNGPVAAAFTVYEDFLYYKSGVYKHVNGSELGGHAVKIIGWGTDQNEQYWLVMN 348
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFE 105
SW WGD G+FKI G E I+S +V+AG +R+S +E+ E
Sbjct: 349 SWNVNWGDQGIFKIAIG--ECGIDS-EVTAGIPKYERTSGVEQSE 390
>gi|442616292|ref|NP_001259536.1| cathepsin B1, isoform B [Drosophila melanogaster]
gi|440216755|gb|AGB95378.1| cathepsin B1, isoform B [Drosophila melanogaster]
Length = 330
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 47/93 (50%), Positives = 60/93 (64%), Gaps = 3/93 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
+Q EI G + A ++DLI+YK GVYQH G+ GGHA++I+GWGV E+ + YWL
Sbjct: 235 IQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGEEKIPYWLI 294
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW WGD G F+I RG D IES +SAG
Sbjct: 295 GNSWNTDWGDHGFFRILRGQDHCGIES-SISAG 326
>gi|18921171|ref|NP_572920.1| cathepsin B1, isoform A [Drosophila melanogaster]
gi|7292926|gb|AAF48317.1| cathepsin B1, isoform A [Drosophila melanogaster]
gi|16767940|gb|AAL28188.1| GH06546p [Drosophila melanogaster]
gi|220944992|gb|ACL85039.1| CG10992-PA [synthetic construct]
gi|220954816|gb|ACL89951.1| CG10992-PA [synthetic construct]
Length = 340
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 47/93 (50%), Positives = 60/93 (64%), Gaps = 3/93 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
+Q EI G + A ++DLI+YK GVYQH G+ GGHA++I+GWGV E+ + YWL
Sbjct: 245 IQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGEEKIPYWLI 304
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW WGD G F+I RG D IES +SAG
Sbjct: 305 GNSWNTDWGDHGFFRILRGQDHCGIES-SISAG 336
>gi|294897889|ref|XP_002776090.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
gi|239882699|gb|EER07906.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
Length = 134
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 42/77 (54%), Positives = 51/77 (66%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G AA ++D + YK GVY+HT G GGHAV+IIGWG E GV YWL +N
Sbjct: 42 IKKEIMTNGPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVDYWLVMN 101
Query: 61 SWGELWGDGGLFKIRRG 77
SW E WGD G FKI +G
Sbjct: 102 SWNEEWGDHGTFKIVQG 118
>gi|204022088|dbj|BAG71141.1| cathepsin B-N2 [Tuberaphis styraci]
Length = 334
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 44/81 (54%), Positives = 53/81 (65%), Gaps = 1/81 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ +G I A+ E + D YK GVY GGHAVK+IGWG E GV YWL V
Sbjct: 242 IQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPYWLLV 301
Query: 60 NSWGELWGDGGLFKIRRGTDE 80
NSW + WGD GLFKIRRGT+E
Sbjct: 302 NSWNDQWGDQGLFKIRRGTNE 322
>gi|268578113|ref|XP_002644039.1| Hypothetical protein CBG17499 [Caenorhabditis briggsae]
Length = 355
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 60/97 (61%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G ++A+ ++D YK G+Y HT G+ GG KIIGWGV++GV YWLCV+
Sbjct: 258 IQTEIMTNGPVIASFIIYEDFWDYKSGIYVHTAGDQEGGMDTKIIGWGVDNGVPYWLCVH 317
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
WG +G+ G +I RG +E IE QV A D D+
Sbjct: 318 QWGTDFGENGFVRILRGVNEVNIE-HQVLAALPDVDK 353
>gi|345308|pir||S31909 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - fluke
(Schistosoma japonicum)
Length = 316
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 60/93 (64%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI +G + A + +D + YK G+Y++T G G H V+IIGWG+E+G YWL N
Sbjct: 223 IQKEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGIENGTAYWLAAN 282
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E +ES V AGR+
Sbjct: 283 TWNEDWGEKGYFRIVRGRNECSVESV-VVAGRL 314
>gi|204022090|dbj|BAG71142.1| cathepsin B-N3 [Tuberaphis styraci]
Length = 334
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 44/81 (54%), Positives = 53/81 (65%), Gaps = 1/81 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ +G I A+ E + D YK GVY GGHAVK+IGWG E GV YWL V
Sbjct: 242 IQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPYWLLV 301
Query: 60 NSWGELWGDGGLFKIRRGTDE 80
NSW + WGD GLFKIRRGT+E
Sbjct: 302 NSWNDQWGDQGLFKIRRGTNE 322
>gi|302764096|ref|XP_002965469.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
gi|300166283|gb|EFJ32889.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
Length = 331
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 43/102 (42%), Positives = 62/102 (60%), Gaps = 1/102 (0%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
+Q EI+ G + + ++D YK GVY+H G++ GGHAVK IGWG +DG YW+
Sbjct: 230 IQAEIYKNGPVEVSYTVYEDFAHYKSGVYKHVFGQVLGGHAVKFIGWGTTDDGKDYWIVA 289
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDL 101
NSW WG+ G F+I RG++E IES V+ + + SD+
Sbjct: 290 NSWNRSWGEDGFFQISRGSNECGIESEPVAGIPLKKTGFSDI 331
>gi|17384033|emb|CAD12394.1| cysteine proteinase [Leishmania infantum]
Length = 340
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 39/89 (43%), Positives = 57/89 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ +E+ G + ++ + D + YK GVY+H G++ GGHAVK++GWG + GV YW N
Sbjct: 246 LMIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGGHAVKLVGWGTQGGVPYWKIAN 305
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW WGD G F I+RG++E IES V+
Sbjct: 306 SWNTDWGDKGYFLIQRGSNECGIESGGVA 334
>gi|146092987|ref|XP_001466605.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania infantum JPCM5]
gi|398018677|ref|XP_003862503.1| cysteine peptidase C (CPC) [Leishmania donovani]
gi|12005276|gb|AAG44365.1| cathepsin B-like cysteine protease [Leishmania donovani]
gi|134070968|emb|CAM69644.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania infantum JPCM5]
gi|322500733|emb|CBZ35810.1| cysteine peptidase C (CPC) [Leishmania donovani]
Length = 340
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 39/89 (43%), Positives = 57/89 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ +E+ G + ++ + D + YK GVY+H G++ GGHAVK++GWG + GV YW N
Sbjct: 246 LMIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGGHAVKLVGWGTQGGVPYWKIAN 305
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW WGD G F I+RG++E IES V+
Sbjct: 306 SWNTDWGDKGYFLIQRGSNECGIESGGVA 334
>gi|402594312|gb|EJW88238.1| cathepsin B5 [Wuchereria bancrofti]
Length = 407
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 42/88 (47%), Positives = 55/88 (62%), Gaps = 3/88 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A+ E + D + Y G+Y+H G + GGHAVKI+GWG++ GV YWL N
Sbjct: 296 IQKEIMTLGPVEASFEVYTDFLHYIGGIYKHVAGSVGGGHAVKILGWGIDQGVSYWLAAN 355
Query: 61 SWGELWGD---GGLFKIRRGTDESRIES 85
SW WG+ G F+I RG DE IES
Sbjct: 356 SWNTDWGEDVFSGYFRILRGVDECGIES 383
>gi|161343861|tpg|DAA06111.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 323
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 46/95 (48%), Positives = 63/95 (66%), Gaps = 2/95 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
+Q EI +G + A + +++ + YK+GVY+ T GE+ G H VK+IGWGV E G++YWL +
Sbjct: 227 IQQEIMTYGPVTAFMYVYENFMGYKEGVYKSTAGELIGYHHVKLIGWGVDEAGIEYWLAM 286
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
NSW WG+ GLFKI RG + IE V AG VD
Sbjct: 287 NSWNSNWGNDGLFKILRGYNFCSIE-LLVMAGLVD 320
>gi|409905640|gb|AFV46426.1| cysteine protease C [Leishmania donovani]
Length = 345
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 39/89 (43%), Positives = 57/89 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ +E+ G + ++ + D + YK GVY+H G++ GGHAVK++GWG + GV YW N
Sbjct: 251 LMIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGGHAVKLVGWGTQGGVPYWKIAN 310
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW WGD G F I+RG++E IES V+
Sbjct: 311 SWNTDWGDKGYFLIQRGSNECGIESGGVA 339
>gi|330805199|ref|XP_003290573.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
gi|325079281|gb|EGC32888.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
Length = 313
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 44/88 (50%), Positives = 55/88 (62%), Gaps = 1/88 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI G + A ++D + YK GVYQHT G+ GGH VKI G+G +GV YW NSW
Sbjct: 224 EISTNGPVEACFSVYEDFLGYKSGVYQHTTGKFLGGHCVKIFGYGTLNGVNYWSVANSWT 283
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAG 91
WGD G+F I+RG+DE IE +V AG
Sbjct: 284 TSWGDNGIFLIKRGSDECGIED-EVVAG 310
>gi|294873367|ref|XP_002766594.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239867622|gb|EEQ99311.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 244
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 42/77 (54%), Positives = 51/77 (66%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G AA ++D + YK GVY+HT G GGHAV+IIGWG E GV YWL +N
Sbjct: 152 IKKEIMTNGPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVDYWLVMN 211
Query: 61 SWGELWGDGGLFKIRRG 77
SW E WGD G FKI +G
Sbjct: 212 SWNEEWGDHGTFKIVQG 228
>gi|449446774|ref|XP_004141146.1| PREDICTED: cathepsin B-like [Cucumis sativus]
Length = 348
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 41/85 (48%), Positives = 54/85 (63%), Gaps = 1/85 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
+ E++ G + + ++D YK GVY+H G++ GGHAVK+IGWG +DG YWL
Sbjct: 245 IMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYWLLA 304
Query: 60 NSWGELWGDGGLFKIRRGTDESRIE 84
N W WGD G FKIRRGT+E IE
Sbjct: 305 NQWNRGWGDDGYFKIRRGTNECGIE 329
>gi|332030944|gb|EGI70570.1| Uncharacterized peptidase C1-like protein F26E4.3 [Acromyrmex
echinatior]
Length = 501
Score = 89.7 bits (221), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 44/91 (48%), Positives = 58/91 (63%), Gaps = 8/91 (8%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEM---SGGHAVKIIGWGVE-----DGVKY 55
EI G + A + +QD +YK G+Y+H+ SG H+V+IIGWG E +KY
Sbjct: 402 EILTSGPVQATMRVYQDFFVYKNGIYRHSQSAELHDSGYHSVRIIGWGEERSYRGPPLKY 461
Query: 56 WLCVNSWGELWGDGGLFKIRRGTDESRIESF 86
WL VNSWG WG+ GLFKI+RGT+E IES+
Sbjct: 462 WLVVNSWGYNWGENGLFKIQRGTNECEIESY 492
>gi|17565158|ref|NP_503384.1| Protein W07B8.1 [Caenorhabditis elegans]
gi|351059396|emb|CCD74286.1| Protein W07B8.1 [Caenorhabditis elegans]
Length = 335
Score = 89.7 bits (221), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 41/90 (45%), Positives = 54/90 (60%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q ++ G I A E + D + Y G+Y H G G +V+IIGWGV GV YWLC N
Sbjct: 241 IQSDVMLNGPIQATFEVYDDFLQYTTGIYVHLTGNKQGHLSVRIIGWGVWQGVPYWLCAN 300
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SWG WG+ G F++ RGT+E +ES VS
Sbjct: 301 SWGRQWGENGTFRVLRGTNECGLESNCVSG 330
>gi|449489527|ref|XP_004158338.1| PREDICTED: cathepsin B-like [Cucumis sativus]
Length = 349
Score = 89.7 bits (221), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 41/85 (48%), Positives = 54/85 (63%), Gaps = 1/85 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
+ E++ G + + ++D YK GVY+H G++ GGHAVK+IGWG +DG YWL
Sbjct: 246 IMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYWLLA 305
Query: 60 NSWGELWGDGGLFKIRRGTDESRIE 84
N W WGD G FKIRRGT+E IE
Sbjct: 306 NQWNRGWGDDGYFKIRRGTNECGIE 330
>gi|224285256|gb|ACN40354.1| unknown [Picea sitchensis]
Length = 350
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 41/90 (45%), Positives = 56/90 (62%), Gaps = 1/90 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
+ E+F+ G + + ++D Y+ GVY+H G GGHAVK+IGWG +DG+ YWL
Sbjct: 238 IMAEVFNNGPVEVSFSVYEDFAHYETGVYKHVQGRYLGGHAVKLIGWGTTDDGIDYWLIA 297
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVS 89
NSW WG+GG FKI RG +E IE V+
Sbjct: 298 NSWNTAWGEGGYFKIARGVNECGIERDPVA 327
>gi|328718094|ref|XP_003246386.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
Length = 340
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 44/86 (51%), Positives = 56/86 (65%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ +G I A+ + + D YK GVY + GGHAVK+IGWG E GV YWL V
Sbjct: 245 IQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENATYLGGHAVKLIGWGEEYGVPYWLMV 304
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW WGD GLFKIRRGT+E I++
Sbjct: 305 NSWNADWGDNGLFKIRRGTNECGIDN 330
>gi|201023315|ref|NP_001128400.1| cathepsin B-16D2 precursor [Acyrthosiphon pisum]
Length = 340
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 44/86 (51%), Positives = 56/86 (65%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ +G I A+ + + D YK GVY + GGHAVK+IGWG E GV YWL V
Sbjct: 245 IQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENATYLGGHAVKLIGWGEEYGVPYWLMV 304
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW WGD GLFKIRRGT+E I++
Sbjct: 305 NSWNADWGDNGLFKIRRGTNECGIDN 330
>gi|347972088|ref|XP_313836.5| AGAP004534-PA [Anopheles gambiae str. PEST]
gi|333469166|gb|EAA09182.5| AGAP004534-PA [Anopheles gambiae str. PEST]
Length = 334
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 40/85 (47%), Positives = 57/85 (67%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G + A + ++D+++YK GVY+H GE G HAV+IIGWG + G+ YWL N
Sbjct: 241 IRYEIMTNGPVEAGFDVYEDVLLYKSGVYRHVYGEQIGKHAVRIIGWGRDGGIPYWLIAN 300
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
S+G+ WGD G FK RG++ IES
Sbjct: 301 SYGDDWGDHGYFKFVRGSNHLGIES 325
>gi|312374702|gb|EFR22199.1| hypothetical protein AND_15622 [Anopheles darlingi]
Length = 339
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 42/85 (49%), Positives = 55/85 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G + + ++D+ +YK GVY+H GE G HAV+IIGWG E G+ YWL N
Sbjct: 246 IRYEIMTNGPVEGGFDVYEDVFLYKSGVYRHVYGEHVGKHAVRIIGWGREGGIPYWLISN 305
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
S+GE WGD G FKI RG + IES
Sbjct: 306 SYGEDWGDHGYFKIVRGINHLGIES 330
>gi|209863079|ref|NP_001119613.2| cathepsin B precursor [Acyrthosiphon pisum]
Length = 323
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 45/95 (47%), Positives = 63/95 (66%), Gaps = 2/95 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVE-DGVKYWLCV 59
+Q EI +G + A + +++ + YK+G+Y+ T GE+ G H VK+IGWGV+ DG +YWL +
Sbjct: 227 IQQEIMTYGPVTAFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWLAM 286
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
NSW WG+ GLFKI RG + IE V AG VD
Sbjct: 287 NSWNSNWGNDGLFKILRGYNFCSIE-LLVMAGIVD 320
>gi|195026034|ref|XP_001986167.1| GH20676 [Drosophila grimshawi]
gi|193902167|gb|EDW01034.1| GH20676 [Drosophila grimshawi]
Length = 432
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 41/94 (43%), Positives = 61/94 (64%), Gaps = 4/94 (4%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV---GEMSGGHAVKIIGWGVE-DGVKYW 56
+ EI+H G + A + ++D Y GVYQHT G +G H+VK++GWG E +GVKYW
Sbjct: 326 IMAEIYHSGPVQATMTVYRDFFSYSSGVYQHTAANRGAATGFHSVKLVGWGEEHNGVKYW 385
Query: 57 LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
+ NSWG WG+ G F+I RG++E IE + +++
Sbjct: 386 IAANSWGPWWGERGYFRILRGSNECGIEEYVLAS 419
>gi|204022071|dbj|BAG71133.1| cathepsin B-S2 [Tuberaphis coreana]
Length = 334
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 42/92 (45%), Positives = 61/92 (66%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
++ +I +G + A+ + + DL +YK G+Y+ + + GGH++KIIGWG EDG YWL V
Sbjct: 239 IEQDIKTYGPVEASFDCYDDLSVYKSGIYRKSPNAKYKGGHSIKIIGWGQEDGTPYWLAV 298
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW + WGD G FKI +G +E IE V+AG
Sbjct: 299 NSWSKFWGDHGTFKIIKGRNECGIER-AVTAG 329
>gi|290973351|ref|XP_002669412.1| predicted protein [Naegleria gruberi]
gi|284082959|gb|EFC36668.1| predicted protein [Naegleria gruberi]
Length = 488
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 46/99 (46%), Positives = 55/99 (55%), Gaps = 9/99 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV---------GEMSGGHAVKIIGWGVED 51
M E++H G + A E + D YK GVY H+ G HAV ++GWG E+
Sbjct: 385 MMYELYHGGPLAIAFEVYDDFFNYKGGVYTHSTALKTKIAEPGWEETNHAVLLVGWGEEN 444
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
GV YWL NSWG WG G FKI+RGTDE ES VSA
Sbjct: 445 GVPYWLVKNSWGTSWGINGFFKIKRGTDECDCESEAVSA 483
>gi|226473760|emb|CAX71565.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 89.4 bits (220), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 41/93 (44%), Positives = 63/93 (67%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G + A +E ++D + YK G+Y++T G+ GHAV++IG GVE+G YWL N
Sbjct: 249 IQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGCGVENGTAYWLAAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
+W E WG+ G F+I RG +E IES +++AG +
Sbjct: 309 TWNEDWGEKGYFRIVRGRNECLIES-EIAAGLI 340
>gi|339831342|gb|AEK20867.1| cathepsin B [Eimeria tenella]
Length = 512
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 45/102 (44%), Positives = 63/102 (61%), Gaps = 3/102 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ E+ G++ A ++D ++YK+GVY H G GGHAVK+IG+G EDG YWL VN
Sbjct: 406 IKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMGGHAVKVIGFGNEDGRDYWLAVN 465
Query: 61 SWGELWGDGGLFKIRRGTDESRIES-FQVSAGRVDRDRSSDL 101
SW E WGD G FKI G E+ I+ F +V D+++ L
Sbjct: 466 SWNEYWGDKGTFKIEMG--EAGIDKEFCGGEPKVPNDKNASL 505
>gi|335347291|gb|AEH42093.1| cysteine proteinase 6 [Haemonchus contortus]
Length = 346
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 56/93 (60%), Gaps = 1/93 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G +V++ + D Y KG+Y+HT G+ G HA+KIIGWG E V YW+ N
Sbjct: 253 IQREIMRSGPVVSSFTVYDDFSYYVKGIYKHTAGKARGSHAIKIIGWGTEKNVPYWIIAN 312
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
SW WG+ G F++ RGT+ IE V AG V
Sbjct: 313 SWHNDWGEKGFFRMVRGTNHCGIEE-DVVAGHV 344
>gi|29374025|gb|AAO73003.1| cathepsin B [Fasciola gigantica]
Length = 339
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 42/88 (47%), Positives = 55/88 (62%), Gaps = 1/88 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI G + QD +Y+ G+Y H G+ G HAV++IGWGVE+GV YWL NSW
Sbjct: 249 EIMKNGPVEVTFAIFQDFGVYRSGIYHHVAGKFIGRHAVRMIGWGVENGVNYWLMANSWN 308
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAG 91
E WG+ G F++ RG +E IES +V AG
Sbjct: 309 EEWGENGYFRMVRGRNECGIES-EVVAG 335
>gi|359427491|gb|AEV46267.1| eimeripain [Eimeria tenella]
Length = 512
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 45/102 (44%), Positives = 63/102 (61%), Gaps = 3/102 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ E+ G++ A ++D ++YK+GVY H G GGHAVK+IG+G EDG YWL VN
Sbjct: 406 IKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMGGHAVKVIGFGNEDGRDYWLAVN 465
Query: 61 SWGELWGDGGLFKIRRGTDESRIES-FQVSAGRVDRDRSSDL 101
SW E WGD G FKI G E+ I+ F +V D+++ L
Sbjct: 466 SWNEYWGDKGTFKIEMG--EAGIDKEFCGGEPKVPNDKNASL 505
>gi|161343867|tpg|DAA06114.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 340
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 42/86 (48%), Positives = 57/86 (66%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ ++G I A+ + + D YK GVY + GGHAVK+IGWG E GV YWL V
Sbjct: 245 IQKDVMNYGPIEASFDVYDDFPSYKSGVYIRSDNASYLGGHAVKLIGWGEESGVPYWLMV 304
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW WGD GLFKI+RGT+E +++
Sbjct: 305 NSWNTDWGDKGLFKIQRGTNECGVDN 330
>gi|14582576|gb|AAK69541.1|AF283476_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
Length = 352
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 49/115 (42%), Positives = 65/115 (56%), Gaps = 4/115 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
+ E++ G + ++D YK GVY+H G GGHAVK+IGWG EDG YWL
Sbjct: 240 IMTEVYTNGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHAVKLIGWGTSEDGEDYWLLA 299
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFEYDTDTTIES 114
N W WGD G FKI RGT+E IE V+AG + ++ D+E D D+ + S
Sbjct: 300 NQWNRSWGDDGYFKIIRGTNECGIE--DVTAG-MPSTKNLDIESGVRDDDSLVAS 351
>gi|339239305|ref|XP_003381207.1| cathepsin B [Trichinella spiralis]
gi|316975778|gb|EFV59177.1| cathepsin B [Trichinella spiralis]
Length = 343
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 45/97 (46%), Positives = 60/97 (61%), Gaps = 3/97 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVY---QHTVGEMSGGHAVKIIGWGVEDGVKYWL 57
+Q EI G +VAA+E + + YK GVY + G HAVK+IGWG + + YWL
Sbjct: 247 IQREIMDHGPVVAAMEIFESFLYYKSGVYSANKRNDDPSLGLHAVKLIGWGEQKRIPYWL 306
Query: 58 CVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
VNSW +G+ GLFKIRRGT+E IE+ V+AG +
Sbjct: 307 VVNSWNTTFGEQGLFKIRRGTNECGIENLHVTAGLAE 343
>gi|294894292|ref|XP_002774787.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239880404|gb|EER06603.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 414
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 41/74 (55%), Positives = 50/74 (67%)
Query: 7 HFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELW 66
+F + A+ ++D + Y+ GVY+HT G+ GGHAVKIIGWG E G YWL VNSW E W
Sbjct: 329 NFDQVSASFIVYEDFLAYRSGVYKHTSGKELGGHAVKIIGWGEETGQAYWLVVNSWNEDW 388
Query: 67 GDGGLFKIRRGTDE 80
GD GLFKI G E
Sbjct: 389 GDNGLFKIALGNCE 402
>gi|189502866|gb|ACE06814.1| unknown [Schistosoma japonicum]
Length = 121
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 44/88 (50%), Positives = 54/88 (61%), Gaps = 1/88 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E+ G + E + D YK GVYQH G + GGHAV+++GWG E+ V YWL NSW
Sbjct: 29 ELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWGEENNVPYWLIANSWN 88
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAG 91
WGD G FKI RG +E IES V+AG
Sbjct: 89 TDWGDNGYFKIIRGKNECGIES-DVNAG 115
>gi|268555788|ref|XP_002635883.1| C. briggsae CBR-CPR-5 protein [Caenorhabditis briggsae]
Length = 345
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 46/97 (47%), Positives = 56/97 (57%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A ++D Y GVY HT G GGHAVKI+GWGV++G YWL N
Sbjct: 248 IQTEILKNGPVEVAFTVYEDFYQYTTGVYVHTSGASLGGHAVKILGWGVDNGTPYWLVAN 307
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
SW WG+ G F+I RG +E IE V AG D R
Sbjct: 308 SWNVNWGEKGYFRIIRGLNECGIEHSAV-AGIPDLTR 343
>gi|241998314|ref|XP_002433800.1| longipain, putative [Ixodes scapularis]
gi|215495559|gb|EEC05200.1| longipain, putative [Ixodes scapularis]
Length = 339
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 40/84 (47%), Positives = 54/84 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q+EI G + A + D +YK GVY+ + GGHA++I+GWGVE+GV +WL N
Sbjct: 246 IQMEIMKNGPVEGAFTVYADFPLYKSGVYKSHSTDALGGHAIRILGWGVENGVPFWLVAN 305
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW WGD G FKI RG++E IE
Sbjct: 306 SWNTEWGDKGYFKILRGSNECGIE 329
>gi|300952942|gb|ADK46902.1| cathepsin B [Radopholus similis]
Length = 356
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 45/91 (49%), Positives = 58/91 (63%), Gaps = 2/91 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVK--YWLC 58
+Q EI + G + A + + D + YK GVYQ GGHAV+I+GWGV+ K YWL
Sbjct: 259 IQYEIMNNGPVEANMIVYYDFMFYKSGVYQTVFPWPLGGHAVRIVGWGVDGPTKVPYWLV 318
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVS 89
NSW WG+ G F+IRRGTDES IES+ V+
Sbjct: 319 ANSWNTDWGEDGYFRIRRGTDESYIESWGVN 349
>gi|294914336|ref|XP_002778250.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239886453|gb|EER10045.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 388
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 66/105 (62%), Gaps = 3/105 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G + AA ++D YK GVY+H G GGHAVKIIGWG++ +YWL +N
Sbjct: 284 IKREIIDNGPVAAAFTVYEDFPYYKSGVYKHVNGSELGGHAVKIIGWGIDQNEQYWLVMN 343
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFE 105
SW WGD G+FKI G E I+S +V+AG +++S +E+ E
Sbjct: 344 SWNVNWGDQGIFKIAIG--ECGIDS-EVTAGIPKYEKTSGVEQSE 385
>gi|294898091|ref|XP_002776152.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239882839|gb|EER07968.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 382
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 40/70 (57%), Positives = 47/70 (67%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
G + A+ ++D + YK GVY+HT G GGHAVKIIGWG + G YWL VNSW E WGD
Sbjct: 299 GPVSASFTVYEDFLAYKSGVYKHTSGSYLGGHAVKIIGWGEKSGQAYWLAVNSWNEDWGD 358
Query: 69 GGLFKIRRGT 78
GLFKI G
Sbjct: 359 KGLFKIALGN 368
>gi|156708116|gb|ABU93316.1| cathepsin B7 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 273
Score = 89.0 bits (219), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 38/81 (46%), Positives = 51/81 (62%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E+ + G + A E +D Y+ GVYQH G G H V ++GWG E+GV YWL NSWG
Sbjct: 183 EVANNGPVYACFEVFEDFYNYRSGVYQHKTGRSQGWHHVMLMGWGTENGVPYWLLQNSWG 242
Query: 64 ELWGDGGLFKIRRGTDESRIE 84
WG+ G F+IRRGT++ I+
Sbjct: 243 SGWGEKGFFRIRRGTNDCHID 263
>gi|38639325|gb|AAR25800.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
Length = 354
Score = 89.0 bits (219), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 45/102 (44%), Positives = 60/102 (58%), Gaps = 1/102 (0%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
+ E++ G + + ++D YK GVY+H G GGHAVK+IGWG E G YWL V
Sbjct: 244 IMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGGNMGGHAVKLIGWGTSEQGEDYWLIV 303
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDL 101
NSW WG+ G FKIRRGT+E IE V+ R+ + +L
Sbjct: 304 NSWNRGWGEDGYFKIRRGTNECGIEHSVVAGLPSARNLNVEL 345
>gi|195478432|ref|XP_002100515.1| GE16138 [Drosophila yakuba]
gi|194188039|gb|EDX01623.1| GE16138 [Drosophila yakuba]
Length = 340
Score = 89.0 bits (219), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 60/93 (64%), Gaps = 3/93 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
+Q EI G + A ++DLI+YK GVYQH G+ GGHA++I+GWGV ++ + YWL
Sbjct: 245 IQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGDEKIPYWLI 304
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW WGD G F+I RG D IES +SAG
Sbjct: 305 GNSWNTDWGDQGFFRILRGQDHCGIES-SISAG 336
>gi|242014495|ref|XP_002427925.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
corporis]
gi|212512409|gb|EEB15187.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
corporis]
Length = 473
Score = 89.0 bits (219), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 46/107 (42%), Positives = 61/107 (57%), Gaps = 9/107 (8%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS----GGHAVKIIGWGVE-----DGVK 54
EI G + A ++ +QD YK GVY + E G H+VKI+GWG E +K
Sbjct: 333 EIMQSGPVQATMKVYQDFFSYKSGVYTKSNTERESSNFGYHSVKILGWGEETNIYGQPIK 392
Query: 55 YWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDL 101
YWL NSWG+ WG+ G FKIRRGT+E IE F ++A D S ++
Sbjct: 393 YWLAANSWGQQWGENGFFKIRRGTNECEIEEFVLAAWAETNDPSREI 439
>gi|204022100|dbj|BAG71147.1| cathepsin B-N1 [Tuberaphis takenouchii]
Length = 334
Score = 89.0 bits (219), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 44/86 (51%), Positives = 56/86 (65%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ +G I A+ + + D YK GVY T GGHAVK+IGWG E GV YWL V
Sbjct: 242 IQKDVMAYGPIEASFDVYDDFPNYKSGVYMKTENASYLGGHAVKLIGWGEEYGVPYWLLV 301
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW + WGD GLFKI RGT+E I++
Sbjct: 302 NSWNDQWGDQGLFKILRGTNECGIDN 327
>gi|195566634|ref|XP_002106884.1| GD15875 [Drosophila simulans]
gi|194204277|gb|EDX17853.1| GD15875 [Drosophila simulans]
Length = 340
Score = 89.0 bits (219), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 60/93 (64%), Gaps = 3/93 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
+Q EI G + A ++DLI+YK GVYQH G+ GGHA++I+GWGV ++ + YWL
Sbjct: 245 IQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGDEKIPYWLI 304
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW WGD G F+I RG D IES +SAG
Sbjct: 305 GNSWNTDWGDHGFFRILRGQDHCGIES-SISAG 336
>gi|187107122|ref|NP_001119621.1| cathepsin B-3098 precursor [Acyrthosiphon pisum]
gi|161343841|tpg|DAA06101.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 337
Score = 89.0 bits (219), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 41/86 (47%), Positives = 57/86 (66%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ ++G I + + + D YK G+Y + GGH+VK+IGWG E GV YWL V
Sbjct: 243 IQKDVINYGPIETSFDVYDDFPNYKSGIYVKSENASYLGGHSVKLIGWGEEYGVLYWLMV 302
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW WGD GLFKIRRGT+E R+++
Sbjct: 303 NSWNADWGDKGLFKIRRGTNECRVDN 328
>gi|395815757|ref|XP_003781389.1| PREDICTED: dipeptidyl peptidase 1 [Otolemur garnettii]
Length = 575
Score = 89.0 bits (219), Expect = 7e-16, Method: Composition-based stats.
Identities = 42/99 (42%), Positives = 61/99 (61%), Gaps = 10/99 (10%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ H G + A E + D + Y +G+Y HT E++ HAV ++G+G +
Sbjct: 472 MKLELVHHGPMAVAFEVYDDFLHYHRGIYHHTGLTDPFNPFELTN-HAVLLVGYGTDSAT 530
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
G++YW+ NSWG WG+ G F+IRRGTDE IES V+A
Sbjct: 531 GIQYWIVKNSWGTGWGEDGYFRIRRGTDECAIESIAVAA 569
>gi|224128101|ref|XP_002320244.1| predicted protein [Populus trichocarpa]
gi|222861017|gb|EEE98559.1| predicted protein [Populus trichocarpa]
Length = 339
Score = 89.0 bits (219), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 43/90 (47%), Positives = 54/90 (60%), Gaps = 1/90 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
+ E+ G + A ++D YK GVY+H G+ GGHAVK+IGWG EDG YWL
Sbjct: 227 IMAEVSSNGPVEVAFTVYEDFAHYKSGVYKHITGDAMGGHAVKLIGWGTSEDGEDYWLLA 286
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVS 89
N W WGD G FKI+RGT+E IE V+
Sbjct: 287 NQWNRGWGDDGYFKIKRGTNECGIEGAVVA 316
>gi|294954734|ref|XP_002788292.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239903555|gb|EER20088.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 317
Score = 88.6 bits (218), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 41/70 (58%), Positives = 48/70 (68%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
G + A+ ++D + YK GVY+HT G GGHAVKIIGWG E+G YWL VNSW E WGD
Sbjct: 234 GPVSASYLVYEDFLAYKSGVYKHTSGSYLGGHAVKIIGWGEENGEAYWLVVNSWNEDWGD 293
Query: 69 GGLFKIRRGT 78
GLFKI G
Sbjct: 294 HGLFKIALGN 303
>gi|322788703|gb|EFZ14296.1| hypothetical protein SINV_07506 [Solenopsis invicta]
Length = 443
Score = 88.6 bits (218), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 45/91 (49%), Positives = 59/91 (64%), Gaps = 8/91 (8%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHT-VGEM--SGGHAVKIIGWGVE-----DGVKY 55
EI G + A + +QD IYK G+Y+H+ E+ SG H+V+IIGWG E +KY
Sbjct: 344 EILTSGPVQATMRVYQDFFIYKSGIYRHSRSAELHDSGYHSVRIIGWGEERSYRGPPLKY 403
Query: 56 WLCVNSWGELWGDGGLFKIRRGTDESRIESF 86
WL NSWG WGD GLFKI++GT+E IES+
Sbjct: 404 WLVANSWGYNWGDNGLFKIQKGTNECEIESY 434
>gi|224064398|ref|XP_002301456.1| predicted protein [Populus trichocarpa]
gi|222843182|gb|EEE80729.1| predicted protein [Populus trichocarpa]
Length = 325
Score = 88.6 bits (218), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 42/85 (49%), Positives = 53/85 (62%), Gaps = 1/85 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
+ E+ G + A ++D YK GVY+H G++ GGHAVK+IGWG +DG YWL
Sbjct: 213 IMAEVSMNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWLLA 272
Query: 60 NSWGELWGDGGLFKIRRGTDESRIE 84
N W WGD G FKIRRGT+E IE
Sbjct: 273 NQWNRGWGDDGYFKIRRGTNECGIE 297
>gi|357116869|ref|XP_003560199.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
Length = 350
Score = 88.6 bits (218), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 42/85 (49%), Positives = 51/85 (60%), Gaps = 1/85 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVE-DGVKYWLCV 59
+ E++ G + A ++D YK GVY+H GEM GGHAVK+IGWG DG YWL
Sbjct: 242 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYEHITGEMMGGHAVKLIGWGTSADGKDYWLLA 301
Query: 60 NSWGELWGDGGLFKIRRGTDESRIE 84
N W WGD G FKI RG +E IE
Sbjct: 302 NQWNRGWGDDGYFKIIRGKNECGIE 326
>gi|195352458|ref|XP_002042729.1| GM17589 [Drosophila sechellia]
gi|194126760|gb|EDW48803.1| GM17589 [Drosophila sechellia]
Length = 340
Score = 88.6 bits (218), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 59/93 (63%), Gaps = 3/93 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
+Q EI G + A ++DLI+YK GVYQH G+ GGHA++I+GWGV + + YWL
Sbjct: 245 IQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGNEKIPYWLI 304
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW WGD G F+I RG D IES +SAG
Sbjct: 305 GNSWNTDWGDHGFFRILRGQDHCGIES-SISAG 336
>gi|66506619|ref|XP_393283.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
[Apis mellifera]
Length = 439
Score = 88.6 bits (218), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 47/96 (48%), Positives = 63/96 (65%), Gaps = 11/96 (11%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHT-VGEM--SGGHAVKIIGWGVED-------GV 53
EI G + A ++ +QD Y+ G+Y HT + E+ SG H+V+IIGWG ED +
Sbjct: 338 EILTSGPVQATMKVYQDFFSYESGIYMHTPIAELYESGYHSVRIIGWG-EDISTDSGLPI 396
Query: 54 KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVS 89
KYWL VNSWG+ WG+ GLF+IRRG +E IESF V+
Sbjct: 397 KYWLVVNSWGQEWGENGLFRIRRGINECDIESFVVA 432
>gi|28974200|gb|AAO61484.1| cathepsin B [Sterkiella histriomuscorum]
Length = 294
Score = 88.6 bits (218), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 37/77 (48%), Positives = 50/77 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A + D Y+ GVY T +++GGHA+KI+G+GVE+G YWLC N
Sbjct: 203 IQSEIVSHGPVEGAFTVYTDFFNYQSGVYTPTTTDVAGGHAIKILGYGVENGTPYWLCAN 262
Query: 61 SWGELWGDGGLFKIRRG 77
SWG WG G FKI++G
Sbjct: 263 SWGPAWGMSGFFKIKQG 279
>gi|209863086|ref|NP_001119616.2| cathepsin B-1674 precursor [Acyrthosiphon pisum]
gi|239799412|dbj|BAH70627.1| ACYPI000012 [Acyrthosiphon pisum]
Length = 334
Score = 88.6 bits (218), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 47/93 (50%), Positives = 58/93 (62%), Gaps = 3/93 (3%)
Query: 1 MQLEIFHFGSIVAAIEA-HQDLIIYKKGVYQHTVG-EMSGGHAVKIIGWGVEDGVKYWLC 58
+Q E+ ++G + A D +YK GVY+ T E K+IGWGVE+GV YWL
Sbjct: 238 IQREVQNYGPVSMAFRVFDNDFFLYKSGVYEKTTNSEFIQWQYAKLIGWGVENGVDYWLL 297
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
VNSWG WG GLFKI+RGTDE IE+F V AG
Sbjct: 298 VNSWGYEWGQNGLFKIKRGTDECNIETF-VHAG 329
>gi|119638996|gb|ABL85239.1| cysteine proteinase 5 [Necator americanus]
Length = 342
Score = 88.6 bits (218), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 42/82 (51%), Positives = 50/82 (60%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EIF G +V + D IYKKGVY + +G HAVKIIGWGV+DG+KYWL NSW
Sbjct: 252 EIFTNGPVVGSFSVFADFAIYKKGVYVSNGIQQNGAHAVKIIGWGVQDGLKYWLIANSWN 311
Query: 64 ELWGDGGLFKIRRGTDESRIES 85
WGD G + RG + IES
Sbjct: 312 NDWGDEGYVRFLRGDNHCGIES 333
>gi|4204370|gb|AAD11445.1| cathepsin B protease, partial [Fasciola hepatica]
Length = 247
Score = 88.6 bits (218), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 42/88 (47%), Positives = 55/88 (62%), Gaps = 1/88 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI G + QD +Y+ G+Y H G+ G HAV++IGWGVE+GV YWL NSW
Sbjct: 157 EIMKNGPVEVTFAIFQDFGVYRSGIYHHVAGKFIGRHAVRMIGWGVENGVNYWLMANSWN 216
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAG 91
E WG+ G F++ RG +E IES +V AG
Sbjct: 217 EEWGENGYFRMVRGRNECGIES-EVVAG 243
>gi|195130519|ref|XP_002009699.1| GI15503 [Drosophila mojavensis]
gi|193908149|gb|EDW07016.1| GI15503 [Drosophila mojavensis]
Length = 342
Score = 88.6 bits (218), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 59/93 (63%), Gaps = 3/93 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
+Q EI G + A ++DLI+YK GVY+H G+ GGHA++I+GWGV + V YWL
Sbjct: 247 IQREIMTNGPVEGAFTVYEDLILYKSGVYKHVHGKELGGHAIRILGWGVWGDSKVPYWLI 306
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW WGD G F+I RG D IES +SAG
Sbjct: 307 GNSWNTDWGDNGFFRIVRGEDHCGIES-AISAG 338
>gi|91088083|ref|XP_968689.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
Length = 360
Score = 88.6 bits (218), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 47/92 (51%), Positives = 59/92 (64%), Gaps = 2/92 (2%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
G +VAA + + D IY+ GVY +T G + G AVKIIGWG E+G YWL NSWG+ WG
Sbjct: 228 GPVVAAFDVYGDFKIYRDGVYIYTSGALFGRTAVKIIGWGTENGWAYWLAANSWGKDWGA 287
Query: 69 -GGLFKIRRGTDESRIESFQVSAGRVDRDRSS 99
GG FKIRRGT+E E + AG+V S+
Sbjct: 288 LGGFFKIRRGTNECGFEE-SIIAGQVREGGST 318
>gi|51947600|gb|AAU14266.1| cathepsin B-N [Myzus persicae]
Length = 338
Score = 88.6 bits (218), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 42/86 (48%), Positives = 56/86 (65%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ +G I A+ + + D YK GVY + GGHAVK+IGWG E GV YWL V
Sbjct: 243 IQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENASYLGGHAVKLIGWGEEYGVPYWLMV 302
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW E WGD G FKI+RGT+E +++
Sbjct: 303 NSWNEDWGDHGFFKIQRGTNECGVDN 328
>gi|91089435|ref|XP_966663.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
gi|270012706|gb|EFA09154.1| cathepsin B precursor [Tribolium castaneum]
Length = 320
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 42/86 (48%), Positives = 53/86 (61%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A+ + D Y+ GVYQH +G +SG H+VKI+GWG E+G YWL N
Sbjct: 226 IQAEIMTSGPVQASYVVYDDFYSYQNGVYQHVLGNVSGRHSVKILGWGRENGTDYWLVAN 285
Query: 61 SWGELWGD-GGLFKIRRGTDESRIES 85
SWG WG GG FK RG + IES
Sbjct: 286 SWGRDWGRLGGFFKFLRGENHCDIES 311
>gi|91078960|ref|XP_974244.1| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270004840|gb|EFA01288.1| cathepsin B precursor [Tribolium castaneum]
Length = 319
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 40/85 (47%), Positives = 50/85 (58%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G I+ + + +QD Y GVY H G +G H VKI+GWG E YWL N
Sbjct: 226 IQYEIMTNGPIIVSFKVYQDFYNYGSGVYHHVSGNYTGNHIVKIVGWGTEKEQDYWLIAN 285
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SWG WG+ G FKI RG +E IE+
Sbjct: 286 SWGSSWGEHGFFKILRGKNECGIEN 310
>gi|215687149|dbj|BAG90919.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 403
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 42/85 (49%), Positives = 51/85 (60%), Gaps = 1/85 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
+ E++ G + A ++D YK GVY+H G M GGHAVK+IGWG D G YWL
Sbjct: 291 IMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLLA 350
Query: 60 NSWGELWGDGGLFKIRRGTDESRIE 84
N W WGD G FKI RGT+E IE
Sbjct: 351 NQWNRGWGDDGYFKIIRGTNECGIE 375
>gi|354459545|pdb|3PDF|A Chain A, Discovery Of Novel Cyanamide-Based Inhibitors Of Cathepsin
C
Length = 441
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ H G + A E + D + YKKG+Y HT E++ HAV ++G+G +
Sbjct: 336 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 394
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 395 GMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 438
>gi|160688716|gb|ABX45136.1| cathepsin B-like cysteine protease 2 [Callosobruchus maculatus]
Length = 260
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/93 (50%), Positives = 61/93 (65%), Gaps = 3/93 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-VGEMSGGHAVKIIGWGVEDGV-KYWLC 58
+QLEI G +VA+ + D I Y GVY+ ++ GGHAV+IIGWG+E+G YWL
Sbjct: 166 IQLEIIKNGPVVASFTVYADFIHYLSGVYKFDGESKLLGGHAVRIIGWGIENGTYPYWLV 225
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW E WGD GLFKI RG +E IE +++AG
Sbjct: 226 SNSWNERWGDQGLFKIWRGKNECGIEE-EITAG 257
>gi|116779190|gb|ABK21175.1| unknown [Picea sitchensis]
gi|148907952|gb|ABR17096.1| unknown [Picea sitchensis]
gi|224284884|gb|ACN40172.1| unknown [Picea sitchensis]
Length = 350
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 38/84 (45%), Positives = 52/84 (61%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ E++ G + ++D YK GVY++ G+ GGHAVK+IGWG E+G YWL N
Sbjct: 240 IMAEVYTKGPVEVDFLVYEDFAHYKSGVYKYITGDFLGGHAVKLIGWGTENGTDYWLVAN 299
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW WG+ G FKI RG++E IE
Sbjct: 300 SWNTAWGEDGYFKIARGSNECSIE 323
>gi|40557606|gb|AAR88096.1| cathepsin B-like cysteine protease [Callosobruchus maculatus]
Length = 330
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/93 (50%), Positives = 61/93 (65%), Gaps = 3/93 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-VGEMSGGHAVKIIGWGVEDGV-KYWLC 58
+QLEI G +VA+ + D I Y GVY+ ++ GGHAV+IIGWG+E+G YWL
Sbjct: 236 IQLEIIKNGPVVASFTVYADFIHYLSGVYKFDGESKLLGGHAVRIIGWGIENGTYPYWLV 295
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW E WGD GLFKI RG +E IE +++AG
Sbjct: 296 SNSWNERWGDQGLFKIWRGKNECGIEE-EITAG 327
>gi|388500062|gb|AFK38097.1| unknown [Lotus japonicus]
Length = 357
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/92 (47%), Positives = 54/92 (58%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
+ E++ G + A ++D YK GVY+H G GGHAVK+IGWG D G YWL
Sbjct: 245 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGSQLGGHAVKLIGWGTTDEGEDYWLIA 304
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
N W WGD G F IRRGT+E IE V+AG
Sbjct: 305 NQWNRSWGDDGYFMIRRGTNECGIEE-DVTAG 335
>gi|56753605|gb|AAW25005.1| SJCHGC02852 protein [Schistosoma japonicum]
Length = 346
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/90 (48%), Positives = 56/90 (62%), Gaps = 3/90 (3%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVE--DGVKYWLCVNS 61
EI G + A D + YK GVY++ G + GGHA++IIGWGV + YWLC NS
Sbjct: 255 EILLNGPVEATFYVFDDFLNYKTGVYKYVTGSLLGGHAIRIIGWGVSTLNHTPYWLCANS 314
Query: 62 WGELWGDGGLFKIRRGTDESRIESFQVSAG 91
W + WGD G FKI RG++E IES V+AG
Sbjct: 315 WNKQWGDKGYFKILRGSNECGIESM-VTAG 343
>gi|119579767|gb|EAW59363.1| cathepsin C, isoform CRA_a [Homo sapiens]
Length = 316
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ H G + A E + D + YKKG+Y HT E++ HAV ++G+G +
Sbjct: 213 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 271
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 272 GMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 315
>gi|449485032|ref|XP_002188357.2| PREDICTED: dipeptidyl peptidase 1 [Taeniopygia guttata]
Length = 667
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 64/104 (61%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWG--VED 51
M+LE+ H G + A E + D ++YK+G+Y HT E++ HAV ++G+G E
Sbjct: 564 MKLELVHHGPMAVAFEVYNDFMLYKEGIYHHTGLQDDLNPFELTN-HAVLLVGYGKDPES 622
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G K+W+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 623 GEKFWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 666
>gi|60827947|gb|AAX36820.1| cathepsin C [synthetic construct]
gi|61368416|gb|AAX43175.1| cathepsin C [synthetic construct]
Length = 464
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ H G + A E + D + YKKG+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 419 GMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|146165818|ref|XP_001015807.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|146145394|gb|EAR95562.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 338
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 45/92 (48%), Positives = 57/92 (61%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
+Q EI G + A+ D + YK GVY + + GGH+VKIIGWGVE G YWL
Sbjct: 246 IQREIMAHGPVQASFRVASDFLTYKSGVYIRDPKLKYEGGHSVKIIGWGVEQGTPYWLIA 305
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW E WG+ GLFK+ RG +E IE+ +V AG
Sbjct: 306 NSWNEDWGENGLFKMLRGKNECGIEA-EVVAG 336
>gi|54696504|gb|AAV38624.1| cathepsin C [synthetic construct]
gi|54696506|gb|AAV38625.1| cathepsin C [synthetic construct]
gi|61368207|gb|AAX43130.1| cathepsin C [synthetic construct]
gi|61368212|gb|AAX43131.1| cathepsin C [synthetic construct]
Length = 464
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ H G + A E + D + YKKG+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 419 GMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|194382330|dbj|BAG58920.1| unnamed protein product [Homo sapiens]
Length = 446
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ H G + A E + D + YKKG+Y HT E++ HAV ++G+G +
Sbjct: 343 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 401
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 402 GMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 445
>gi|300176938|emb|CBK25507.2| unnamed protein product [Blastocystis hominis]
Length = 320
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 38/84 (45%), Positives = 54/84 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ E+ G + + ++D + YK G+YQH G+ GGHAVK++GWGVEDG++YW N
Sbjct: 227 IKTELMTNGPLEVSFFVYEDFLTYKSGIYQHVAGKYLGGHAVKLVGWGVEDGIEYWKIAN 286
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW E WG+ G F+I G E IE
Sbjct: 287 SWNEDWGENGYFRIVAGKGECGIE 310
>gi|442754445|gb|JAA69382.1| Putative cathepsin b precursor [Ixodes ricinus]
Length = 340
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 40/84 (47%), Positives = 53/84 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q+EI G + A + D +YK GVY+ + GGHA++I+GWGVE+ V YWL N
Sbjct: 247 IQMEIMKNGPVEGAFTVYADFPLYKSGVYKSHSTDALGGHAIRILGWGVENDVPYWLVAN 306
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW WGD G FKI RG++E IE
Sbjct: 307 SWNTEWGDKGYFKILRGSNECGIE 330
>gi|1582221|prf||2118248A prepro-cathepsin C
Length = 463
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ H G + A E + D + YKKG+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 419 GMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|323448265|gb|EGB04166.1| hypothetical protein AURANDRAFT_32974 [Aureococcus anophagefferens]
Length = 298
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 41/90 (45%), Positives = 53/90 (58%), Gaps = 1/90 (1%)
Query: 5 IFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGE 64
I G + A ++D Y G+Y H GE +GGHAVK +GWGVE+G KYW NSW
Sbjct: 200 IAEGGPVETAFTVYEDFENYAGGIYHHVTGEEAGGHAVKFVGWGVENGTKYWKVANSWNP 259
Query: 65 LWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
WG+ G F+I RG++E IE QV+ D
Sbjct: 260 YWGEAGYFRILRGSNEGGIED-QVTGSHAD 288
>gi|59895951|gb|AAX11351.1| cathepsin B-like cysteine protease [Oryza sativa Japonica Group]
gi|125551767|gb|EAY97476.1| hypothetical protein OsI_19406 [Oryza sativa Indica Group]
gi|215694023|dbj|BAG89222.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215712372|dbj|BAG94499.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765382|dbj|BAG87079.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222631058|gb|EEE63190.1| hypothetical protein OsJ_17999 [Oryza sativa Japonica Group]
Length = 358
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 42/85 (49%), Positives = 51/85 (60%), Gaps = 1/85 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
+ E++ G + A ++D YK GVY+H G M GGHAVK+IGWG D G YWL
Sbjct: 246 IMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLLA 305
Query: 60 NSWGELWGDGGLFKIRRGTDESRIE 84
N W WGD G FKI RGT+E IE
Sbjct: 306 NQWNRGWGDDGYFKIIRGTNECGIE 330
>gi|17933071|gb|AAL48192.1| cathepsin C [Homo sapiens]
Length = 463
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ H G + A E + D + YKKG+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 419 GMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|33327024|gb|AAQ08887.1| cathepsin C [Homo sapiens]
Length = 463
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ H G + A E + D + YKKG+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 419 GMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|17933077|gb|AAL48195.1| cathepsin C [Homo sapiens]
Length = 463
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ H G + A E + D + YKKG+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 419 GMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|189083844|ref|NP_001805.3| dipeptidyl peptidase 1 isoform a preproprotein [Homo sapiens]
gi|1006657|emb|CAA60671.1| cathepsin C [Homo sapiens]
gi|1947071|gb|AAC51341.1| prepro dipeptidyl peptidase I [Homo sapiens]
gi|60816242|gb|AAX36375.1| cathepsin C [synthetic construct]
gi|119579768|gb|EAW59364.1| cathepsin C, isoform CRA_b [Homo sapiens]
gi|158257666|dbj|BAF84806.1| unnamed protein product [Homo sapiens]
gi|261858568|dbj|BAI45806.1| cathepsin C [synthetic construct]
Length = 463
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ H G + A E + D + YKKG+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 419 GMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|161343839|tpg|DAA06100.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 323
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 45/95 (47%), Positives = 62/95 (65%), Gaps = 2/95 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVE-DGVKYWLCV 59
+Q EI G + A + +++ + YK+G+Y+ T GE+ G H VK+IGWGV+ DG +YWL +
Sbjct: 227 IQQEIMTHGPVTAFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWLAM 286
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
NSW WG+ GLFKI RG + IE V AG VD
Sbjct: 287 NSWNSNWGNDGLFKILRGYNFCSIE-LLVMAGIVD 320
>gi|62897637|dbj|BAD96758.1| cathepsin C isoform a preproprotein variant [Homo sapiens]
Length = 463
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ H G + A E + D + YKKG+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 419 GMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|55793949|gb|AAV65885.1| cathepsin B1 isotype 5 precursor [Trichobilharzia regenti]
Length = 342
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 40/85 (47%), Positives = 51/85 (60%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G + A+ H D + YK G+Y+H G G H V+IIGWGVE YWL N
Sbjct: 249 IKKEIMMHGPVEASFRVHSDFLNYKSGIYKHMTGIDIGSHVVRIIGWGVEKETPYWLIAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW E WG+ G F++ RG DE IES
Sbjct: 309 SWNEDWGEKGYFRMLRGKDECGIES 333
>gi|403287831|ref|XP_003935129.1| PREDICTED: dipeptidyl peptidase 1 [Saimiri boliviensis boliviensis]
Length = 463
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 43/104 (41%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ H G + A E + D + Y+KG+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYRKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 419 GIHYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|2317912|gb|AAC24376.1| cathepsin B-like cysteine proteinase [Arabidopsis thaliana]
Length = 357
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 42/90 (46%), Positives = 54/90 (60%), Gaps = 1/90 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
+ E++ G + A ++D YK GVY++ G GGHAVK+IGWG +DG YWL
Sbjct: 245 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLLA 304
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVS 89
N W WGD G FKIRRGT+E IE V+
Sbjct: 305 NQWNRSWGDDGYFKIRRGTNECGIEQSVVA 334
>gi|317373330|sp|P53634.2|CATC_HUMAN RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
light chain; AltName: Full=Dipeptidyl peptidase I light
chain; Flags: Precursor
gi|17933069|gb|AAL48191.1| cathepsin C [Homo sapiens]
Length = 463
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ H G + A E + D + YKKG+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 419 GMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|291384116|ref|XP_002708690.1| PREDICTED: cathepsin C [Oryctolagus cuniculus]
Length = 463
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 62/104 (59%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ H G + A E + D + Y KG+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYHKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDPAT 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
GV YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 419 GVDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|217073630|gb|ACJ85175.1| unknown [Medicago truncatula]
Length = 359
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/92 (47%), Positives = 54/92 (58%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
+ E++ G + A +D YK GVY+H G GGHAVK+IGWG D G YWL
Sbjct: 247 IMTEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLA 306
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
N W WGD G FKI+RGT+E IE V+AG
Sbjct: 307 NQWNTNWGDDGYFKIKRGTNECGIED-DVTAG 337
>gi|17560488|ref|NP_506310.1| Protein F32H5.1 [Caenorhabditis elegans]
gi|3876629|emb|CAB04249.1| Protein F32H5.1 [Caenorhabditis elegans]
Length = 356
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 42/97 (43%), Positives = 59/97 (60%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q+EI G ++A+ + D YK G+Y HT G+ GG KIIGWGV++GV YWLCV+
Sbjct: 259 IQIEIMTNGPVIASFIIYDDFWDYKTGIYVHTAGDQEGGMDTKIIGWGVDNGVPYWLCVH 318
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDR 97
WG +G+ G + RG +E IE QV A D ++
Sbjct: 319 QWGTDFGENGFVRFLRGVNEVNIE-HQVLAALPDSEK 354
>gi|217072748|gb|ACJ84734.1| unknown [Medicago truncatula]
gi|388505480|gb|AFK40806.1| unknown [Medicago truncatula]
Length = 359
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/92 (47%), Positives = 54/92 (58%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
+ E++ G + A +D YK GVY+H G GGHAVK+IGWG D G YWL
Sbjct: 247 IMAEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLA 306
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
N W WGD G FKI+RGT+E IE V+AG
Sbjct: 307 NQWNTNWGDDGYFKIKRGTNECGIED-DVTAG 337
>gi|55793947|gb|AAV65884.1| cathepsin B1 isotype 4 precursor [Trichobilharzia regenti]
Length = 342
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/85 (47%), Positives = 51/85 (60%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G + A H D + YK G+Y++ G G HAV+IIGWGVE YWL N
Sbjct: 249 IKKEIMMHGPVEVAFTVHSDFLNYKSGIYKYMTGAEIGEHAVRIIGWGVEKKTPYWLIAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW E WG+ G F++ RG DE IES
Sbjct: 309 SWNEDWGEKGYFRMLRGKDECGIES 333
>gi|63115212|gb|AAY33830.1| cathepsin B, partial [Siniperca chuatsi]
Length = 69
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 42/67 (62%), Positives = 47/67 (70%), Gaps = 1/67 (1%)
Query: 25 KKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIE 84
K GVYQH G GGHA+KI+GWG EDGV YWLC NSW WGD G FK RG+D RIE
Sbjct: 1 KFGVYQHVYGSAVGGHAIKILGWGEEDGVPYWLCANSWNTDWGDNGFFKFLRGSDHCRIE 60
Query: 85 SFQVSAG 91
S ++ AG
Sbjct: 61 S-EIVAG 66
>gi|255647484|gb|ACU24206.1| unknown [Glycine max]
Length = 327
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/81 (49%), Positives = 50/81 (61%), Gaps = 1/81 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
+ E++ G + A ++D YK GVY+H G GGHAVK+IGWG +DG YWL
Sbjct: 245 IMAEVYKNGPVEVAFTVYEDFAYYKSGVYKHITGYELGGHAVKLIGWGTTDDGEDYWLLA 304
Query: 60 NSWGELWGDGGLFKIRRGTDE 80
N W WGD G FKIRRGT+E
Sbjct: 305 NQWNREWGDDGYFKIRRGTNE 325
>gi|159175|gb|AAA29176.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 43/95 (45%), Positives = 61/95 (64%), Gaps = 2/95 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG-HAVKIIGWGVEDGVKYWLCV 59
++ +I GS+VA ++D Y+ G+Y+HT G +GG HAVK+IGWG ++G YWL
Sbjct: 251 IRRDIKERGSVVAVFAVYEDFSHYQSGIYKHTAGRFTGGYHAVKMIGWGKDNGTDYWLIA 310
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
NSW + WG+ G F++ RG + IE QV AG VD
Sbjct: 311 NSWHDDWGENGFFRMIRGINNCGIEE-QVDAGIVD 344
>gi|241154720|ref|XP_002407359.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215494103|gb|EEC03744.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 51/84 (60%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EIF G + A + D + YK GVYQ + G HA++I+GWG E+G YWL N
Sbjct: 243 IQTEIFTNGPVEADFHVYGDFLCYKSGVYQRHSNDGRGMHAIRILGWGTENGTPYWLAAN 302
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW E WGD G FKI R T+E IE
Sbjct: 303 SWNENWGDKGYFKILRRTNECGIE 326
>gi|18378945|ref|NP_563647.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332189291|gb|AEE27412.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
Length = 379
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 42/90 (46%), Positives = 54/90 (60%), Gaps = 1/90 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
+ E++ G + A ++D YK GVY++ G GGHAVK+IGWG +DG YWL
Sbjct: 267 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLLA 326
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVS 89
N W WGD G FKIRRGT+E IE V+
Sbjct: 327 NQWNRSWGDDGYFKIRRGTNECGIEQSVVA 356
>gi|197304333|dbj|BAG69285.1| cathepsin B-like cysteine protease [Raphanus sativus]
Length = 343
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 42/90 (46%), Positives = 53/90 (58%), Gaps = 1/90 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
+ EI+ G + + ++D YK GVY+H G GGHAVK+IGWG +DG YWL
Sbjct: 248 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTTDDGEDYWLLA 307
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVS 89
N W WGD G F IRRGT+E IE V+
Sbjct: 308 NQWNRSWGDDGYFMIRRGTNECGIEDEPVA 337
>gi|357511629|ref|XP_003626103.1| Cathepsin B [Medicago truncatula]
gi|87240982|gb|ABD32840.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
propeptide [Medicago truncatula]
gi|355501118|gb|AES82321.1| Cathepsin B [Medicago truncatula]
Length = 357
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/92 (47%), Positives = 54/92 (58%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
+ E++ G + A +D YK GVY+H G GGHAVK+IGWG D G YWL
Sbjct: 245 IMAEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLA 304
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
N W WGD G FKI+RGT+E IE V+AG
Sbjct: 305 NQWNTNWGDDGYFKIKRGTNECGIED-DVTAG 335
>gi|28932700|gb|AAO60044.1| midgut cysteine proteinase 1 [Rhipicephalus appendiculatus]
Length = 332
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 52/84 (61%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ +I+ G + +A + D YK GVYQ + + G HA+KI+GWG EDGV YWL N
Sbjct: 237 IKTDIYKNGPVESAFFVYADFPSYKSGVYQQHMIKFMGVHAIKILGWGTEDGVPYWLVAN 296
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW WGD G FKI RG DE IE
Sbjct: 297 SWNVGWGDKGYFKILRGKDECGIE 320
>gi|12004577|gb|AAG44098.1| cathepsin B cysteine protease [Leishmania chagasi]
Length = 340
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 56/89 (62%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ +E+ G + ++ + D + YK G Y+H G++ GGHAVK++GWG + GV YW N
Sbjct: 246 LMIELMTNGPLEVTMQVYSDFVGYKSGGYKHVSGDLLGGHAVKLVGWGTQGGVPYWKIAN 305
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW WGD G F I+RG++E IES V+
Sbjct: 306 SWNTDWGDKGYFLIQRGSNECGIESGGVA 334
>gi|221484923|gb|EEE23213.1| cysteine proteinase, putative [Toxoplasma gondii GT1]
Length = 569
Score = 87.4 bits (215), Expect = 2e-15, Method: Composition-based stats.
Identities = 37/69 (53%), Positives = 45/69 (65%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
G + A ++D + YK GVY+H G GGHA+KIIGWG E+G +YW VNSW WGD
Sbjct: 453 GPVSGAFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGTENGEEYWHAVNSWNTYWGD 512
Query: 69 GGLFKIRRG 77
GG FKI G
Sbjct: 513 GGQFKIAMG 521
>gi|237836005|ref|XP_002367300.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
gi|211964964|gb|EEB00160.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
gi|221506020|gb|EEE31655.1| cysteine proteinase, putative [Toxoplasma gondii VEG]
Length = 572
Score = 87.4 bits (215), Expect = 2e-15, Method: Composition-based stats.
Identities = 37/69 (53%), Positives = 45/69 (65%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
G + A ++D + YK GVY+H G GGHA+KIIGWG E+G +YW VNSW WGD
Sbjct: 456 GPVSGAFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGTENGEEYWHAVNSWNTYWGD 515
Query: 69 GGLFKIRRG 77
GG FKI G
Sbjct: 516 GGQFKIAMG 524
>gi|197101281|ref|NP_001125612.1| dipeptidyl peptidase 1 precursor [Pongo abelii]
gi|75061881|sp|Q5RB02.1|CATC_PONAB RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
light chain; AltName: Full=Dipeptidyl peptidase I light
chain; Flags: Precursor
gi|55728636|emb|CAH91058.1| hypothetical protein [Pongo abelii]
Length = 463
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ H G + A E + D + YKKG+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 419 GMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|195121981|ref|XP_002005491.1| GI19039 [Drosophila mojavensis]
gi|193910559|gb|EDW09426.1| GI19039 [Drosophila mojavensis]
Length = 432
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 41/94 (43%), Positives = 60/94 (63%), Gaps = 4/94 (4%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV---GEMSGGHAVKIIGWGVE-DGVKYW 56
+ EI+H G + A + ++D Y GVY+ T G +G H+VKI+GWG E DGVKYW
Sbjct: 325 IMAEIYHSGPVQATMRVYRDFFSYSGGVYRQTAANRGAPTGFHSVKIVGWGEEHDGVKYW 384
Query: 57 LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
+ NSWG WG+ G F+I RG++E IE + +++
Sbjct: 385 IAANSWGPWWGEHGYFRILRGSNECGIEEYVLAS 418
>gi|62320420|dbj|BAD94873.1| cathepsin B-like cysteine proteinase like protein [Arabidopsis
thaliana]
Length = 183
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 42/91 (46%), Positives = 54/91 (59%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
+ E++ G + A ++D YK GVY++ G GGHAVK+IGWG +DG YWL
Sbjct: 71 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLLA 130
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
N W WGD G FKIRRGT+E IE V+
Sbjct: 131 NQWNRSWGDDGYFKIRRGTNECGIEQSVVAG 161
>gi|21700775|gb|AAL60053.1| cysteine proteinase [Toxoplasma gondii]
Length = 569
Score = 87.4 bits (215), Expect = 2e-15, Method: Composition-based stats.
Identities = 37/69 (53%), Positives = 45/69 (65%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
G + A ++D + YK GVY+H G GGHA+KIIGWG E+G +YW VNSW WGD
Sbjct: 453 GPVSGAFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGTENGEEYWHAVNSWNTYWGD 512
Query: 69 GGLFKIRRG 77
GG FKI G
Sbjct: 513 GGQFKIAMG 521
>gi|426370061|ref|XP_004051995.1| PREDICTED: dipeptidyl peptidase 1 [Gorilla gorilla gorilla]
Length = 463
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ H G + A E + D + YKKG+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 419 GMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|307201161|gb|EFN81067.1| Uncharacterized peptidase C1-like protein F26E4.3 [Harpegnathos
saltator]
Length = 443
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 43/91 (47%), Positives = 59/91 (64%), Gaps = 8/91 (8%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHT-VGEM--SGGHAVKIIGWGVEDG-----VKY 55
EI G + A + +QD +YK GVY+H+ E+ SG H+++IIGWG E +KY
Sbjct: 344 EILTSGPVQATMRVYQDFFVYKNGVYRHSRSAELHDSGYHSMRIIGWGEEPSYRGPPLKY 403
Query: 56 WLCVNSWGELWGDGGLFKIRRGTDESRIESF 86
WL NSWG WG+ GLF+I+RGT+E IES+
Sbjct: 404 WLVANSWGRHWGENGLFRIQRGTNECEIESY 434
>gi|114639716|ref|XP_508684.2| PREDICTED: dipeptidyl peptidase 1 isoform 2 [Pan troglodytes]
gi|397526223|ref|XP_003833035.1| PREDICTED: dipeptidyl peptidase 1 [Pan paniscus]
gi|410219182|gb|JAA06810.1| cathepsin C [Pan troglodytes]
gi|410260226|gb|JAA18079.1| cathepsin C [Pan troglodytes]
gi|410304128|gb|JAA30664.1| cathepsin C [Pan troglodytes]
gi|410353831|gb|JAA43519.1| cathepsin C [Pan troglodytes]
Length = 463
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ H G + A E + D + YKKG+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 419 GMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|87240981|gb|ABD32839.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
propeptide [Medicago truncatula]
Length = 356
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 62/113 (54%), Gaps = 1/113 (0%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
+ E++ G + A ++D YK GVY+H G GGHAVK++GWG +G YWL
Sbjct: 244 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGFALGGHAVKLVGWGTSHEGEDYWLLA 303
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFEYDTDTTI 112
N W WGD G FKI+RGT+E IE+ + ++ ++ + + D D +
Sbjct: 304 NQWNTNWGDDGYFKIKRGTNECGIENAVTAGLPSTKNIVREVTDMDVDADVSF 356
>gi|357511627|ref|XP_003626102.1| Cathepsin L-like proteinase [Medicago truncatula]
gi|355501117|gb|AES82320.1| Cathepsin L-like proteinase [Medicago truncatula]
Length = 351
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 62/113 (54%), Gaps = 1/113 (0%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
+ E++ G + A ++D YK GVY+H G GGHAVK++GWG +G YWL
Sbjct: 239 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGFALGGHAVKLVGWGTSHEGEDYWLLA 298
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFEYDTDTTI 112
N W WGD G FKI+RGT+E IE+ + ++ ++ + + D D +
Sbjct: 299 NQWNTNWGDDGYFKIKRGTNECGIENAVTAGLPSTKNIVREVTDMDVDADVSF 351
>gi|168020784|ref|XP_001762922.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685734|gb|EDQ72127.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 345
Score = 87.0 bits (214), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/90 (48%), Positives = 55/90 (61%), Gaps = 1/90 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
+ E+F G I A + +D YK GVY+H G GGHAVK++GWG +DGV YW V
Sbjct: 244 LMAELFTNGPIEVAFDVFEDFAHYKTGVYKHLYGGYIGGHAVKLVGWGTTDDGVDYWSMV 303
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVS 89
NSW WG+ G F+I RG DE IES V+
Sbjct: 304 NSWNTNWGEDGTFRILRGKDECGIESNAVA 333
>gi|48762476|dbj|BAD23809.1| cathepsin B-S [Tuberaphis styraci]
gi|204022069|dbj|BAG71132.1| cathepsin B-S1 [Tuberaphis styraci]
Length = 349
Score = 87.0 bits (214), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/92 (43%), Positives = 60/92 (65%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
++ ++ +G + A+ + + D +YK G+Y+ T + GGH++KIIGWG E+G YWL V
Sbjct: 239 IEQDLMTYGPVEASFDVYDDFSVYKSGIYRKTPKAKYEGGHSIKIIGWGEENGTPYWLAV 298
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW + WGD G FKI +G +E IE V+AG
Sbjct: 299 NSWSKFWGDHGTFKIIKGRNECGIER-AVTAG 329
>gi|161343879|tpg|DAA06120.1| TPA_inf: cathepsin B [Toxoptera citricida]
Length = 340
Score = 87.0 bits (214), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 41/86 (47%), Positives = 56/86 (65%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ +G I A+ + + D + YK GVY + GGHAVK+IGWG E G YWL +
Sbjct: 245 IQKDVMTYGPIEASFDVYDDFLSYKSGVYVRSENASYLGGHAVKLIGWGEEYGTPYWLMM 304
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW WGD GLFKIRRGT+E +++
Sbjct: 305 NSWNADWGDEGLFKIRRGTNECGVDN 330
>gi|294879717|ref|XP_002768767.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239871616|gb|EER01485.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 157
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 39/70 (55%), Positives = 47/70 (67%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
G + A+ ++D + Y+ GVY+HT G GGHAVKIIGWG + G YWL VNSW E WGD
Sbjct: 74 GPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAVKIIGWGEKSGQAYWLAVNSWNEDWGD 133
Query: 69 GGLFKIRRGT 78
GLFKI G
Sbjct: 134 HGLFKIALGN 143
>gi|261328564|emb|CBH11542.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like,
putative [Trypanosoma brucei gambiense DAL972]
Length = 340
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 38/81 (46%), Positives = 48/81 (59%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E+F G A + ++D I Y GVY H G+ GGHAV+++GWG +GV YW NSW
Sbjct: 246 ELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGWGTSNGVPYWKIANSWN 305
Query: 64 ELWGDGGLFKIRRGTDESRIE 84
WG G F IRRG+ E IE
Sbjct: 306 TEWGMDGYFLIRRGSSECGIE 326
>gi|296216857|ref|XP_002754752.1| PREDICTED: dipeptidyl peptidase 1 [Callithrix jacchus]
Length = 460
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 43/104 (41%), Positives = 62/104 (59%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ H G + A E + D + Y KG+Y HT E++ HAV ++G+G +
Sbjct: 357 MKLELVHHGPMAVAFEVYDDFLHYHKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 415
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 416 GIHYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 459
>gi|311263676|ref|XP_003129789.1| PREDICTED: dipeptidyl peptidase 1-like [Sus scrofa]
Length = 463
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 43/104 (41%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVE--D 51
M+LE+ H G + A E + D + Y+KG+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYRKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDLAS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 419 GMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|294950069|ref|XP_002786445.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239900737|gb|EER18241.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 149
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 38/74 (51%), Positives = 51/74 (68%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EIF G + A + ++D +YK GVY HT G++ G H +KIIGWGVE G +YWL +N
Sbjct: 50 IKQEIFEHGPVFCAFDMYKDFGLYKSGVYVHTTGDLVGSHTLKIIGWGVESGQEYWLAMN 109
Query: 61 SWGELWGDGGLFKI 74
SW E WGD GL K+
Sbjct: 110 SWNEEWGDHGLIKM 123
>gi|388499754|gb|AFK37943.1| unknown [Lotus japonicus]
Length = 209
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 44/92 (47%), Positives = 54/92 (58%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
+ E++ G + A ++D YK GVY+H G GGHAVK+IGWG D G YWL
Sbjct: 97 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGSQLGGHAVKLIGWGTTDEGEDYWLIA 156
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
N W WGD G F IRRGT+E IE V+AG
Sbjct: 157 NQWNRSWGDDGYFMIRRGTNECGIEE-DVTAG 187
>gi|296863454|pdb|3HHI|A Chain A, Crystal Structure Of Cathepsin B From T. Brucei In Complex
With Ca074
gi|296863455|pdb|3HHI|B Chain B, Crystal Structure Of Cathepsin B From T. Brucei In Complex
With Ca074
Length = 325
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 38/81 (46%), Positives = 48/81 (59%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E+F G A + ++D I Y GVY H G+ GGHAV+++GWG +GV YW NSW
Sbjct: 224 ELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGWGTSNGVPYWKIANSWN 283
Query: 64 ELWGDGGLFKIRRGTDESRIE 84
WG G F IRRG+ E IE
Sbjct: 284 TEWGMDGYFLIRRGSSECGIE 304
>gi|204022083|dbj|BAG71139.1| cathepsin B-S [Astegopteryx styracophila]
Length = 335
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 39/85 (45%), Positives = 57/85 (67%), Gaps = 1/85 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
++ +I +G + A+ + + D I YK G+YQ T + GGH+VK+IGWG EDG+ YWL V
Sbjct: 240 IEQDIRTYGPVEASFDVYDDFITYKSGIYQKTPNALYVGGHSVKLIGWGEEDGIPYWLLV 299
Query: 60 NSWGELWGDGGLFKIRRGTDESRIE 84
NSW + WG+ G F+I +G +E IE
Sbjct: 300 NSWSKFWGEQGTFRIIKGRNECGIE 324
>gi|72389769|ref|XP_845179.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|427931064|pdb|4HWY|A Chain A, Trypanosoma Brucei Procathepsin B Solved From 40 Fs
Free-electron Laser Pulse Data By Serial Femtosecond
X-ray Crystallography
gi|40557577|gb|AAR88085.1| cathepsin B-like cysteine protease [Trypanosoma brucei]
gi|62360039|gb|AAX80461.1| cysteine peptidase C (CPC) [Trypanosoma brucei]
gi|70801714|gb|AAZ11620.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
Length = 340
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 38/81 (46%), Positives = 48/81 (59%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E+F G A + ++D I Y GVY H G+ GGHAV+++GWG +GV YW NSW
Sbjct: 246 ELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGWGTSNGVPYWKIANSWN 305
Query: 64 ELWGDGGLFKIRRGTDESRIE 84
WG G F IRRG+ E IE
Sbjct: 306 TEWGMDGYFLIRRGSSECGIE 326
>gi|355332948|pdb|3MOR|A Chain A, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
gi|355332949|pdb|3MOR|B Chain B, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
Length = 317
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 38/81 (46%), Positives = 48/81 (59%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E+F G A + ++D I Y GVY H G+ GGHAV+++GWG +GV YW NSW
Sbjct: 223 ELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGWGTSNGVPYWKIANSWN 282
Query: 64 ELWGDGGLFKIRRGTDESRIE 84
WG G F IRRG+ E IE
Sbjct: 283 TEWGMDGYFLIRRGSSECGIE 303
>gi|297723949|ref|NP_001174338.1| Os05g0310500 [Oryza sativa Japonica Group]
gi|255676228|dbj|BAH93066.1| Os05g0310500, partial [Oryza sativa Japonica Group]
Length = 234
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 42/85 (49%), Positives = 51/85 (60%), Gaps = 1/85 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
+ E++ G + A ++D YK GVY+H G M GGHAVK+IGWG D G YWL
Sbjct: 122 IMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLLA 181
Query: 60 NSWGELWGDGGLFKIRRGTDESRIE 84
N W WGD G FKI RGT+E IE
Sbjct: 182 NQWNRGWGDDGYFKIIRGTNECGIE 206
>gi|312283137|dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila]
Length = 362
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 42/97 (43%), Positives = 54/97 (55%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
+ E++ G + + ++D YK GVY+H G GGHAVK+IGWG D G YWL
Sbjct: 250 IMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTTDEGEDYWLLA 309
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRD 96
N W WGD G F IRRGT+E IE V+ R+
Sbjct: 310 NQWNRSWGDDGYFMIRRGTNECGIEDEPVAGLPSSRN 346
>gi|300121294|emb|CBK21674.2| unnamed protein product [Blastocystis hominis]
Length = 561
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 39/85 (45%), Positives = 54/85 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
M EI+ G I A++A +L+ YK G+++ G S HA+ ++GWG EDG KYW+ N
Sbjct: 184 MMKEIYARGPITCALDATDELVAYKGGIFEDKTGTTSLNHAISVVGWGEEDGKKYWIVRN 243
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SWG WG+ G F+I RGT+ IES
Sbjct: 244 SWGTYWGENGWFRIVRGTNNLGIES 268
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 37/86 (43%), Positives = 50/86 (58%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
M+ EIF G I + Q+ + Y GV+ M GGH +++ GWGV EDG +YW+
Sbjct: 465 MKAEIFARGPISCYVSVSQEFLDYTGGVFVEHDHSMLGGHIIEVAGWGVTEDGQEYWIGR 524
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSWGE WG+ G F+I+ D IES
Sbjct: 525 NSWGEYWGENGWFRIQTDKDNLEIES 550
>gi|294891865|ref|XP_002773777.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239878981|gb|EER05593.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 156
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 42/85 (49%), Positives = 57/85 (67%), Gaps = 2/85 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EIF G+++ I ++D +YK GVY HT G + G H++KIIGWGVE G YWL VN
Sbjct: 66 IKQEIFDNGTVLGVISMYEDFRLYKSGVYVHTTGGLVGVHSLKIIGWGVESGQDYWLAVN 125
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW E WGD G+ K+ G E+ IE+
Sbjct: 126 SWNEEWGDHGMIKLAVG--ETGIEN 148
>gi|350408961|ref|XP_003488566.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
impatiens]
Length = 445
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 43/100 (43%), Positives = 61/100 (61%), Gaps = 11/100 (11%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS---GGHAVKIIGWGVEDG----- 52
+ EI G + A ++ +QD Y+ G+Y+HT G H+V+IIGWG +
Sbjct: 340 IMYEILTSGPVQATMKVYQDFFSYESGIYKHTATTEHYAFGYHSVRIIGWGEDTSAHRYR 399
Query: 53 ---VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVS 89
+KYWL VNSWG+ WG+ GLF+I+RGT+E IESF V+
Sbjct: 400 NLPIKYWLVVNSWGQQWGESGLFRIQRGTNECDIESFVVA 439
>gi|204022085|dbj|BAG71140.1| cathepsin B-S [Astegopteryx spinocephala]
Length = 335
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 42/96 (43%), Positives = 60/96 (62%), Gaps = 2/96 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
++ +I +G + A+ + + D I YK G+YQ T GGH+VK+IGWG EDG+ YWL V
Sbjct: 240 IEQDIRKYGPVEASFDVYDDFITYKSGIYQKTPNAFYVGGHSVKLIGWGEEDGIPYWLLV 299
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
NSW + WG+ G F+I +G +E IE +AG R
Sbjct: 300 NSWSKFWGEQGTFRIIKGRNECGIER-SATAGVPSR 334
>gi|308504721|ref|XP_003114544.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
gi|308261929|gb|EFP05882.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
Length = 358
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 43/99 (43%), Positives = 58/99 (58%), Gaps = 1/99 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G ++A+ + D YK G+Y HT G+ GG KIIGWGV+ GV YWLCV+
Sbjct: 261 IQTEIMTNGPVIASFVIYDDFWDYKSGIYVHTAGDQEGGMDTKIIGWGVDSGVPYWLCVH 320
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSS 99
WG +G+ G + RG +E IE QV A D D+ +
Sbjct: 321 QWGTDFGENGFVRFLRGVNEVNIE-HQVLAALPDIDKHN 358
>gi|312383398|gb|EFR28501.1| hypothetical protein AND_03481 [Anopheles darlingi]
Length = 573
Score = 86.7 bits (213), Expect = 3e-15, Method: Composition-based stats.
Identities = 40/96 (41%), Positives = 60/96 (62%), Gaps = 9/96 (9%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG-----EMSGGHAVKIIGWGVE----DGVK 54
EI G++ A + ++D Y+ G+Y+H+ E S H+V++IGWG E D VK
Sbjct: 436 EIKERGTVQAILRVYRDFFSYQNGIYRHSAAATPAEERSAYHSVRLIGWGEERVGYDMVK 495
Query: 55 YWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
YW+ VNSWG WG+ G F+I RGT+E IES+ +++
Sbjct: 496 YWIAVNSWGTWWGENGRFRILRGTNECEIESYVLAS 531
>gi|161343871|tpg|DAA06116.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 276
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 41/86 (47%), Positives = 57/86 (66%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ ++G I A+ + + D YK G+Y + GGH+VK+IGWG E GV YWL V
Sbjct: 182 IQKDVINYGPIEASFDVYDDFPSYKSGIYVKSENASYLGGHSVKLIGWGEEYGVLYWLMV 241
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW WGD GLFKIRRGT+E +++
Sbjct: 242 NSWNADWGDKGLFKIRRGTNECGVDN 267
>gi|161343849|tpg|DAA06105.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 334
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 58/94 (61%), Gaps = 3/94 (3%)
Query: 1 MQLEIFHFGSIVAAIEAH-QDLIIYKKGVYQHTVG-EMSGGHAVKIIGWGVEDGVKYWLC 58
+Q E+ ++G + A + D +YK GVY+ T E K+IGWGVE+GV YWL
Sbjct: 238 IQREVQNYGPVSMAFKVFDNDFFLYKSGVYEKTTNSEFIQWQYAKLIGWGVENGVDYWLL 297
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSAGR 92
VN WG WG GLFKI+RGTDE IE+F V AG
Sbjct: 298 VNFWGYEWGQNGLFKIKRGTDECNIETF-VHAGE 330
>gi|340712697|ref|XP_003394892.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
terrestris]
Length = 445
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 43/100 (43%), Positives = 61/100 (61%), Gaps = 11/100 (11%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS---GGHAVKIIGWGVEDG----- 52
+ EI G + A ++ +QD Y+ G+Y+HT G H+V+IIGWG +
Sbjct: 340 IMYEILTSGPVQATMKVYQDFFSYESGIYKHTATTEHYAFGYHSVRIIGWGEDTSAHRHH 399
Query: 53 ---VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVS 89
+KYWL VNSWG+ WG+ GLF+I+RGT+E IESF V+
Sbjct: 400 NLPIKYWLVVNSWGQQWGESGLFRIQRGTNECDIESFVVA 439
>gi|294890224|ref|XP_002773108.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239878009|gb|EER04924.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 109
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 41/72 (56%), Positives = 49/72 (68%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
G + A+ ++D + Y+ GVY+HT G+ GGHAVKIIGWG E G YWL VNSW E WGD
Sbjct: 26 GPVSASFIVYEDFLAYRSGVYKHTSGKELGGHAVKIIGWGEETGQAYWLVVNSWNEDWGD 85
Query: 69 GGLFKIRRGTDE 80
GLFKI G E
Sbjct: 86 NGLFKIALGNCE 97
>gi|294939825|ref|XP_002782575.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239894358|gb|EER14370.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 398
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 39/69 (56%), Positives = 46/69 (66%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
G + A ++D + YK GVY+HT G + G HAVKIIGWG + G YWL VNSW E WGD
Sbjct: 315 GPVSATFYVYEDFLAYKSGVYKHTSGSLLGAHAVKIIGWGEDGGEAYWLVVNSWNEGWGD 374
Query: 69 GGLFKIRRG 77
GLFKI G
Sbjct: 375 HGLFKIALG 383
>gi|290980376|ref|XP_002672908.1| predicted protein [Naegleria gruberi]
gi|284086488|gb|EFC40164.1| predicted protein [Naegleria gruberi]
Length = 261
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 44/94 (46%), Positives = 57/94 (60%), Gaps = 6/94 (6%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG--EMSGGHAVKIIGWGVEDGVKYWLC 58
MQ I GSI+ ++ +QD + Y GVYQH+ + V+IIGWGVE+GVKYW+
Sbjct: 167 MQQAILQGGSIMTELDMYQDFLYYSSGVYQHSANLRQPIAKFVVRIIGWGVENGVKYWIV 226
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIE----SFQV 88
N WG+ WG G IRRG +ES IE +FQV
Sbjct: 227 PNIWGKTWGMQGYIWIRRGNNESNIEKDAFAFQV 260
>gi|290979437|ref|XP_002672440.1| predicted protein [Naegleria gruberi]
gi|284086017|gb|EFC39696.1| predicted protein [Naegleria gruberi]
Length = 354
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 43/91 (47%), Positives = 56/91 (61%), Gaps = 3/91 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ I +GS+ + ++D + Y+ GVY+H GGHAV +IGWGVE G YWL VN
Sbjct: 263 IKAAIMSYGSVQSGFTIYRDFMSYRSGVYKHVSTTTLGGHAVALIGWGVESGTNYWLAVN 322
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SWG WG G FKI +G E IE+ QV AG
Sbjct: 323 SWGSNWGMSGYFKIAQG--ECGIEN-QVYAG 350
>gi|145481831|ref|XP_001426938.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394016|emb|CAK59540.1| unnamed protein product [Paramecium tetraurelia]
Length = 332
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 51/84 (60%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI+ G + A D + YK GVYQ T G+ G HAVKIIGWG E+GV YW +N
Sbjct: 241 IKNEIYLNGPVQAVFTVFDDFLNYKSGVYQQTTGQRRGKHAVKIIGWGTENGVPYWEAIN 300
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW + WG G FKI RG + IE
Sbjct: 301 SWNDGWGINGKFKILRGFNHLDIE 324
>gi|149392557|gb|ABR26081.1| cathepsin b-like cysteine proteinase 3 [Oryza sativa Indica Group]
Length = 142
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 42/85 (49%), Positives = 51/85 (60%), Gaps = 1/85 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
+ E++ G + A ++D YK GVY+H G M GGHAVK+IGWG D G YWL
Sbjct: 30 IMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLLA 89
Query: 60 NSWGELWGDGGLFKIRRGTDESRIE 84
N W WGD G FKI RGT+E IE
Sbjct: 90 NQWNRGWGDDGYFKIIRGTNECGIE 114
>gi|52630925|gb|AAU84926.1| putative cathepsin B-N [Toxoptera citricida]
Length = 340
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 41/86 (47%), Positives = 55/86 (63%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
+Q ++ +G + A+ + + D YK GVY + GGHA K+IGWG E GV YWL V
Sbjct: 245 IQKDVLTYGPVEASFDVYDDFPSYKSGVYIRSENASYLGGHAAKLIGWGEEYGVPYWLMV 304
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSW WGD GLFKI+RGT+E I++
Sbjct: 305 NSWNADWGDNGLFKIQRGTNECGIDN 330
>gi|204022077|dbj|BAG71136.1| cathepsin B-S1 [Tuberaphis sumatrana]
gi|204022079|dbj|BAG71137.1| cathepsin B-S2 [Tuberaphis sumatrana]
Length = 334
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 41/92 (44%), Positives = 59/92 (64%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
++ +I +G + A+ + + D +YK G+Y+ T + GH+VKIIGWG E+G YWL V
Sbjct: 239 IEQDIKTYGPVEASFDVYDDFSVYKSGIYRKTPNAKYQNGHSVKIIGWGQENGTPYWLAV 298
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW + WGD G FKI +G +E IE V+AG
Sbjct: 299 NSWSKFWGDHGTFKIIKGKNECGIER-AVTAG 329
>gi|121073189|gb|ABM47071.1| cathepsin B2 [Clonorchis sinensis]
gi|358341868|dbj|GAA36574.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 343
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 42/91 (46%), Positives = 54/91 (59%), Gaps = 1/91 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI G + A E H+D YK G+Y H G GGHA++I+GWG E+GV YWL NSW
Sbjct: 247 EILLNGPVEATFEVHEDFPEYKSGIYFHAWGGSVGGHAIRILGWGEENGVPYWLIANSWN 306
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
E WG+ G + RG +E IE + +AG D
Sbjct: 307 EDWGEKGYLRFLRGHNECGIEE-EATAGLPD 336
>gi|6165885|gb|AAF04727.1|AF101239_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
Length = 352
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 48/115 (41%), Positives = 63/115 (54%), Gaps = 4/115 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
+ E++ G + ++D YK GVY+H G GGHAVK+IGWG EDG YWL
Sbjct: 240 IMTEVYTNGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHAVKLIGWGTSEDGEDYWLLA 299
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFEYDTDTTIES 114
N W WG G FKI RGT+E IE V+AG ++ D+E D D+ + S
Sbjct: 300 NQWNRSWGGDGYFKIIRGTNECGIE--DVTAG-TPSTKNLDIESGVRDDDSLVAS 351
>gi|204022073|dbj|BAG71134.1| cathepsin B-S1 [Tuberaphis taiwana]
Length = 334
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 40/92 (43%), Positives = 60/92 (65%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
++ +I +G + A+ + + DL YK G+Y+ T + GGH++KIIGWG ++G YWL V
Sbjct: 239 IERDIMTYGPVEASFDVYDDLSAYKSGIYRKTPKAKYQGGHSIKIIGWGQQNGTPYWLAV 298
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW + WG+ G FKI +G +E IE V+AG
Sbjct: 299 NSWSKFWGEHGTFKIIKGRNECGIER-AVTAG 329
>gi|321478457|gb|EFX89414.1| hypothetical protein DAPPUDRAFT_303204 [Daphnia pulex]
Length = 442
Score = 85.9 bits (211), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 41/94 (43%), Positives = 59/94 (62%), Gaps = 7/94 (7%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHT---VGEMSGGHAVKIIGWGVE----DGVKYW 56
EI G + A + H D +Y+ GVY+++ + SG H+V+I+GWGV+ + KYW
Sbjct: 341 EILQHGPVQATMRVHPDFFLYRGGVYRYSGTNSQQRSGYHSVRIVGWGVDSSKRNPTKYW 400
Query: 57 LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
L NSWG LWG+ G F+I RG +ES IE F ++A
Sbjct: 401 LVANSWGRLWGEDGYFRIVRGENESDIEKFVLAA 434
>gi|194352768|emb|CAQ00112.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326488519|dbj|BAJ93928.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326508126|dbj|BAJ99330.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 85.9 bits (211), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 46/105 (43%), Positives = 61/105 (58%), Gaps = 5/105 (4%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
+ E++ G + A ++D YK GVY+H G + GGHAVK+IGWG D G YWL
Sbjct: 245 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLA 304
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRV---DRDRSSDL 101
N W WGD G FKI RG +E IE V+AG + DR++D+
Sbjct: 305 NQWNRGWGDDGYFKIIRGKNECGIEE-DVTAGMPSTKNMDRNNDV 348
>gi|444728469|gb|ELW68926.1| Dipeptidyl peptidase 1 [Tupaia chinensis]
Length = 462
Score = 85.9 bits (211), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 42/104 (40%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVE--D 51
M+LE+ H G + A E + D + Y+KG+YQHT E++ HAV ++G+G +
Sbjct: 359 MKLELVHHGPMAVAFEVYDDFLHYQKGIYQHTGLRDPFNPFELTN-HAVLLVGYGTDLAS 417
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRG DE IES ++A + +
Sbjct: 418 GMDYWIVKNSWGTSWGEDGFFRIRRGIDECSIESIAMAATPIPK 461
>gi|239938580|gb|ACS36089.1| cysteine proteinase [Haemonchus contortus]
Length = 332
Score = 85.9 bits (211), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 37/77 (48%), Positives = 48/77 (62%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q E+ G + AA ++D Y KG+Y HT G G HAVK++GWGVE+G KYW N
Sbjct: 255 IQREMMKNGPVQAAFITYEDFSFYTKGIYVHTRGRQRGAHAVKVVGWGVENGTKYWNVAN 314
Query: 61 SWGELWGDGGLFKIRRG 77
SW WG+ G F+I RG
Sbjct: 315 SWSTDWGENGYFRILRG 331
>gi|30038325|dbj|BAC75711.1| cathepsin C [Bos taurus]
Length = 458
Score = 85.9 bits (211), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 43/104 (41%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVE--D 51
M+LE+ H G + A E + D + Y+KGVY HT E++ HAV ++G+G +
Sbjct: 355 MKLELVHQGPMAVAFEVYDDFLHYRKGVYHHTGLRDPFNPFELTN-HAVLLVGYGTDAAS 413
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES ++A + +
Sbjct: 414 GLDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIALAATPIPK 457
>gi|389608479|dbj|BAM17849.1| tubulointerstitial nephritis antigen [Papilio xuthus]
Length = 429
Score = 85.9 bits (211), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 39/93 (41%), Positives = 58/93 (62%), Gaps = 3/93 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQ---HTVGEMSGGHAVKIIGWGVEDGVKYWL 57
+ +I G + A + +QD Y+ GVY+ H E+ G H+V+IIGWG + G +YW+
Sbjct: 327 IMYDIMESGPVQAVMTVYQDFFHYRDGVYRRSYHGNNELKGFHSVRIIGWGEDRGDRYWV 386
Query: 58 CVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
NSWG WG+ G F+I RG++E+ IESF V+
Sbjct: 387 VANSWGRQWGENGYFRIARGSNEADIESFVVTG 419
>gi|404250524|gb|AFR54113.1| cysteine proteinase, partial [Haemonchus contortus]
Length = 332
Score = 85.9 bits (211), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 37/77 (48%), Positives = 48/77 (62%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q E+ G + AA ++D Y KG+Y HT G G HAVK++GWGVE+G KYW N
Sbjct: 255 IQREMMKNGPVQAAFITYEDFSFYTKGIYVHTRGRQRGAHAVKVVGWGVENGTKYWNVAN 314
Query: 61 SWGELWGDGGLFKIRRG 77
SW WG+ G F+I RG
Sbjct: 315 SWSTDWGENGYFRILRG 331
>gi|294916338|ref|XP_002778359.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239886683|gb|EER10154.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 105
Score = 85.9 bits (211), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 39/70 (55%), Positives = 47/70 (67%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
G + A+ ++D + Y+ GVY+HT G GGHAVKIIGWG + G YWL VNSW E WGD
Sbjct: 22 GPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAVKIIGWGEKSGQAYWLAVNSWNEDWGD 81
Query: 69 GGLFKIRRGT 78
GLFKI G
Sbjct: 82 HGLFKIALGN 91
>gi|395526635|ref|XP_003765465.1| PREDICTED: tubulointerstitial nephritis antigen-like [Sarcophilus
harrisii]
Length = 467
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 47/104 (45%), Positives = 59/104 (56%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +YK G+Y+HT + G H+VKI GWG E DG
Sbjct: 355 ELMENGPVQALLEVHEDFFLYKSGIYKHTPASLGKPERYRQHGTHSVKITGWGEEIQPDG 414
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
VKYW NSWG WG+ G F+I RG +E IESF V GRV
Sbjct: 415 QKVKYWTAANSWGPTWGENGYFRIVRGANECDIESFVVGVWGRV 458
>gi|75812938|ref|NP_001028789.1| dipeptidyl peptidase 1 precursor [Bos taurus]
gi|115312125|sp|Q3ZCJ8.1|CATC_BOVIN RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
light chain; AltName: Full=Dipeptidyl peptidase I light
chain; Flags: Precursor
gi|73587261|gb|AAI02116.1| Cathepsin C [Bos taurus]
Length = 463
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 43/104 (41%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVE--D 51
M+LE+ H G + A E + D + Y+KGVY HT E++ HAV ++G+G +
Sbjct: 360 MKLELVHQGPMAVAFEVYDDFLHYRKGVYHHTGLRDPFNPFELTN-HAVLLVGYGTDAAS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES ++A + +
Sbjct: 419 GLDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIALAATPIPK 462
>gi|332210919|ref|XP_003254561.1| PREDICTED: LOW QUALITY PROTEIN: dipeptidyl peptidase 1 [Nomascus
leucogenys]
Length = 463
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 43/104 (41%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ H G + A E + D + Y+KG+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVHHGPMAVAFEVYDDFLHYEKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 419 GMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|296471940|tpg|DAA14055.1| TPA: dipeptidyl peptidase 1 [Bos taurus]
gi|440894445|gb|ELR46895.1| Dipeptidyl peptidase 1 [Bos grunniens mutus]
Length = 463
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 43/104 (41%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVE--D 51
M+LE+ H G + A E + D + Y+KGVY HT E++ HAV ++G+G +
Sbjct: 360 MKLELVHQGPMAVAFEVYDDFLHYRKGVYHHTGLRDPFNPFELTN-HAVLLVGYGTDAAS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES ++A + +
Sbjct: 419 GLDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIALAATPIPK 462
>gi|193606095|ref|XP_001951499.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 330
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 43/95 (45%), Positives = 57/95 (60%), Gaps = 2/95 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
+Q E+ +G + + D +YK GVY T + H K+IGWGVE+GV YWL V
Sbjct: 235 IQKEVQTYGPVSVKFRVYDDFFLYKSGVYVKTEKSLYVRRHFAKLIGWGVENGVDYWLLV 294
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
NSWG WG GLFKI+RGT+E +E + V AG +
Sbjct: 295 NSWGNEWGQNGLFKIKRGTNEVHVEDY-VYAGEPE 328
>gi|449269572|gb|EMC80333.1| Dipeptidyl-peptidase 1 [Columba livia]
Length = 412
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGV--ED 51
M+LE+ G + A E + D I YK+G+Y HT E++ HAV ++G+G +
Sbjct: 309 MKLELVLHGPMAVAFEVYNDFIHYKEGIYHHTGLRDDFNPFELTN-HAVLLVGYGTDPQS 367
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G K+W+ NSWG LWG+ G F+IRRGTDE IES VSA + +
Sbjct: 368 GEKFWIVKNSWGILWGENGYFRIRRGTDECAIESIAVSATPIAK 411
>gi|1345924|sp|P25802.3|CYSP1_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
Precursor
Length = 341
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 37/94 (39%), Positives = 58/94 (61%), Gaps = 1/94 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G +VA ++D Y+ G+Y+H G +G HAVK+IGWG E G YW+ N
Sbjct: 249 IQKDIMKNGPVVATYTVYEDFAHYRSGIYKHKAGRKTGLHAVKVIGWGEEKGTPYWIVAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
SW + WG+ G F++ RG+++ E +++AG V
Sbjct: 309 SWHDDWGENGFFRMHRGSNDCGFEE-RMAAGSVQ 341
>gi|327408413|emb|CCA30060.1| unnamed protein product [Neospora caninum Liverpool]
Length = 463
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 37/77 (48%), Positives = 49/77 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ ++ G++ A ++D + YK GVY+H G GGHA+KIIGWG EDG +YW VN
Sbjct: 339 VKRDMMAHGTVTGAFMVYEDFLNYKSGVYKHVYGGPLGGHAIKIIGWGTEDGEEYWHAVN 398
Query: 61 SWGELWGDGGLFKIRRG 77
SW WGD G FKI G
Sbjct: 399 SWNTYWGDSGHFKIEMG 415
>gi|290998826|ref|XP_002681981.1| predicted protein [Naegleria gruberi]
gi|284095607|gb|EFC49237.1| predicted protein [Naegleria gruberi]
Length = 310
Score = 85.5 bits (210), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 57/93 (61%), Gaps = 3/93 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ I +GS+ A ++DL YK GVY+H V + GGHAV +IG+GVE G YWL N
Sbjct: 219 IKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHVVSTVLGGHAVALIGFGVEGGSNYWLAAN 278
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
SWG WG G FKI +G E IE+ QV AG
Sbjct: 279 SWGPNWGMSGYFKIAQG--EGGIEN-QVYAGEA 308
>gi|290971375|ref|XP_002668483.1| predicted protein [Naegleria gruberi]
gi|284081912|gb|EFC35739.1| predicted protein [Naegleria gruberi]
Length = 325
Score = 85.5 bits (210), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 57/93 (61%), Gaps = 3/93 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ I +GS+ A ++DL YK GVY+H V + GGHAV +IG+GVE G YWL N
Sbjct: 234 IKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHVVSTVLGGHAVALIGFGVEGGSNYWLAAN 293
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
SWG WG G FKI +G E IE+ QV AG
Sbjct: 294 SWGANWGMSGYFKIAQG--EGGIEN-QVYAGEA 323
>gi|55793951|gb|AAV65886.1| cathepsin B1 isotype 6 precursor [Trichobilharzia regenti]
Length = 342
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G + + + D + YK G+Y+H G G H V+I+GWGVE G YWL N
Sbjct: 249 IKKEIMMHGPVGSFFTVYSDFLNYKSGIYKHMKGTEIGVHTVRIVGWGVEKGTPYWLIAN 308
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQV 88
SW E WG+ G F+I RG DE IES +
Sbjct: 309 SWNEGWGEKGYFRILRGKDECDIESLVI 336
>gi|383861394|ref|XP_003706171.1| PREDICTED: tubulointerstitial nephritis antigen-like [Megachile
rotundata]
Length = 442
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 59/93 (63%), Gaps = 10/93 (10%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEM--SGGHAVKIIGWGVEDG-------V 53
EI G + A + +QD Y+ GVY+H+V E+ S H+V+IIGWG E +
Sbjct: 341 EILTSGPVQATMRVYQDFFSYESGVYKHSVTAELYESDYHSVRIIGWGEEPPTYSRNTPL 400
Query: 54 KYWLCVNSWGELWGDGGLFKIRRGTDESRIESF 86
KYWL NSWG+ WG+ GLF+I++GT+E IESF
Sbjct: 401 KYWLVANSWGQQWGENGLFRIQKGTNECEIESF 433
>gi|294935195|ref|XP_002781337.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239891887|gb|EER13132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 317
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 39/77 (50%), Positives = 50/77 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G A ++D + YK GVY+HT G + G H+V+IIGWG E GV YWL +N
Sbjct: 225 IKKEIMTNGPTSATFSVYEDFVSYKSGVYKHTNGTLMGIHSVEIIGWGTEKGVDYWLVMN 284
Query: 61 SWGELWGDGGLFKIRRG 77
SW E WGD G FKI +G
Sbjct: 285 SWNEGWGDHGTFKIAQG 301
>gi|403332696|gb|EJY65386.1| Cathepsin B [Oxytricha trifallax]
Length = 297
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 50/77 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G + A + D Y+ GVY T +++GGHA+KI+G+GVE+G YWLC N
Sbjct: 206 IKSEIVAHGPVEGAFTVYTDFFNYQSGVYTPTTSDVAGGHAIKILGFGVENGTPYWLCAN 265
Query: 61 SWGELWGDGGLFKIRRG 77
SWG WG G FKI++G
Sbjct: 266 SWGPSWGMQGFFKIKQG 282
>gi|410972493|ref|XP_003992693.1| PREDICTED: dipeptidyl peptidase 1 isoform 1 [Felis catus]
Length = 463
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 43/104 (41%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ H G + A E + D + Y+KG+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVHHGPMAVAFEVYNDFLHYRKGIYYHTGLRDPFNPFELTN-HAVLLVGYGTDPVS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 419 GMDYWIVKNSWGIGWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|260826514|ref|XP_002608210.1| hypothetical protein BRAFLDRAFT_125840 [Branchiostoma floridae]
gi|229293561|gb|EEN64220.1| hypothetical protein BRAFLDRAFT_125840 [Branchiostoma floridae]
Length = 470
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 46/99 (46%), Positives = 60/99 (60%), Gaps = 10/99 (10%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWG--VED 51
MQLE+ G + A E + D + YK GVY+HT E++ HAV ++G+G E
Sbjct: 366 MQLELVKNGPMAVAFEVYSDFMHYKGGVYEHTGLSDPFNPFEITN-HAVLLVGYGRDPET 424
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
G K+W NSWGE WG+ G F+IRRGTDE IES V+A
Sbjct: 425 GAKFWTVKNSWGEKWGEEGFFRIRRGTDECAIESIAVAA 463
>gi|335347289|gb|AEH42092.1| cysteine proteinase 1 [Haemonchus contortus]
Length = 332
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 37/77 (48%), Positives = 48/77 (62%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q E+ G + AA ++D Y KG+Y HT G G HAVK++GWGVE+G KYW N
Sbjct: 255 IQREMMKNGPVQAAFITYEDFSFYTKGIYVHTRGRQRGAHAVKVVGWGVENGTKYWNVAN 314
Query: 61 SWGELWGDGGLFKIRRG 77
SW WG+ G F+I RG
Sbjct: 315 SWSTDWGEDGYFRILRG 331
>gi|204022081|dbj|BAG71138.1| cathepsin B-S1 [Tuberaphis takenouchii]
Length = 332
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 39/85 (45%), Positives = 54/85 (63%), Gaps = 1/85 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
M+ ++ +G I A+ DL YK G+YQ T + GH++KIIGWG E+GV YWL V
Sbjct: 239 MEQDLIKYGPIEASFNLFDDLSAYKSGIYQKTPKAKFLSGHSIKIIGWGKENGVPYWLAV 298
Query: 60 NSWGELWGDGGLFKIRRGTDESRIE 84
NSW + WG+ G F+I +G +E IE
Sbjct: 299 NSWSKFWGEQGTFRIIKGRNECGIE 323
>gi|417401357|gb|JAA47568.1| Putative dipeptidyl peptidase 1 [Desmodus rotundus]
Length = 463
Score = 85.5 bits (210), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 42/104 (40%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ H G + A E + D + Y++G+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVHNGPMAVAFEVYNDFLHYQEGIYHHTGLTDPFNPFELTN-HAVLLVGYGTDPAT 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 419 GMDYWIVKNSWGTAWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|307175943|gb|EFN65753.1| Uncharacterized peptidase C1-like protein F26E4.3 [Camponotus
floridanus]
Length = 443
Score = 85.5 bits (210), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 42/91 (46%), Positives = 59/91 (64%), Gaps = 8/91 (8%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHT-VGEM--SGGHAVKIIGWGVEDG-----VKY 55
EI G + A + +QD +Y+ GVY+H+ E+ SG H+V+IIGWG E +KY
Sbjct: 344 EILTSGPVQATMRVYQDFFVYQSGVYRHSRSAELHDSGYHSVRIIGWGEEPSYRGPPLKY 403
Query: 56 WLCVNSWGELWGDGGLFKIRRGTDESRIESF 86
WL NSWG WG+ GLF+I++GT+E IES+
Sbjct: 404 WLVANSWGHNWGENGLFRIQKGTNECEIESY 434
>gi|294936554|ref|XP_002781799.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
gi|239892784|gb|EER13594.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
Length = 88
Score = 85.5 bits (210), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 40/69 (57%), Positives = 47/69 (68%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
G AA ++D + YK GVY+HT G GGHAV+IIGWG E GV YWL +NSW E WGD
Sbjct: 4 GPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVDYWLVMNSWNEEWGD 63
Query: 69 GGLFKIRRG 77
G FKI +G
Sbjct: 64 HGTFKIVQG 72
>gi|297814171|ref|XP_002874969.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
lyrata]
gi|297320806|gb|EFH51228.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
lyrata]
Length = 359
Score = 85.5 bits (210), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 41/97 (42%), Positives = 54/97 (55%), Gaps = 1/97 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
+ E++ G + + ++D YK GVY+H G GGHAVK+IGWG +G YWL
Sbjct: 247 IMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSNEGEDYWLMA 306
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRD 96
N W WGD G F IRRGT+E IE V+ R+
Sbjct: 307 NQWNRGWGDDGYFMIRRGTNECGIEDEPVAGLPSSRN 343
>gi|345327151|ref|XP_001507103.2| PREDICTED: tubulointerstitial nephritis antigen-like
[Ornithorhynchus anatinus]
Length = 327
Score = 85.5 bits (210), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 49/117 (41%), Positives = 61/117 (52%), Gaps = 17/117 (14%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG--------EMSGGHAVKIIGWGVE----- 50
EI G + A +E H+D +YK G+Y+HT G H+VKI GWG E
Sbjct: 210 EIMENGPVQALMEVHEDFFLYKDGIYRHTPASNGKPPQFRRQGTHSVKITGWGEELQPNG 269
Query: 51 DGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRVDRDRSSDLEEFEY 106
VK+W NSWG WG+GG F+I RG +E IESF V GRV S D+ Y
Sbjct: 270 RRVKFWRAANSWGPTWGEGGSFRILRGCNECDIESFVVGVWGRVG---SEDMNHRRY 323
>gi|195384166|ref|XP_002050789.1| GJ20006 [Drosophila virilis]
gi|194145586|gb|EDW61982.1| GJ20006 [Drosophila virilis]
Length = 432
Score = 85.1 bits (209), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 39/94 (41%), Positives = 60/94 (63%), Gaps = 4/94 (4%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV---GEMSGGHAVKIIGWGVE-DGVKYW 56
+ EI+H G + A + ++D Y G+Y+ T G +G H+VK++GWG E DGVKYW
Sbjct: 326 IMAEIYHSGPVQATMRIYRDFFSYSGGIYRQTAANRGAPTGFHSVKLVGWGEEHDGVKYW 385
Query: 57 LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
+ NSWG WG+ G F+I RG++E IE + +++
Sbjct: 386 IAANSWGPWWGEHGYFRILRGSNECGIEEYVLAS 419
>gi|294891881|ref|XP_002773785.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239878989|gb|EER05601.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 455
Score = 85.1 bits (209), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 35/74 (47%), Positives = 48/74 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EIF G + A + ++D YK GVY H G+M H +K+IGWGVE G +YWL VN
Sbjct: 318 IKQEIFDNGPVAAMMTLYEDFRFYKSGVYVHKTGQMLAAHTLKLIGWGVESGQEYWLAVN 377
Query: 61 SWGELWGDGGLFKI 74
+W E WGD G+ K+
Sbjct: 378 AWNEEWGDHGMIKL 391
>gi|431838501|gb|ELK00433.1| Dipeptidyl-peptidase 1 [Pteropus alecto]
Length = 460
Score = 85.1 bits (209), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 42/104 (40%), Positives = 61/104 (58%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ H G + A E + D + Y KG+Y HT E++ HAV ++G+G +
Sbjct: 357 MKLELVHHGPMAVAFEVYDDFLHYHKGIYHHTGLKDPFNPFELTN-HAVLLVGYGTDPAS 415
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW NSWG WG+ G F+IRRGTDE IES ++A + +
Sbjct: 416 GLNYWTVKNSWGTSWGENGYFRIRRGTDECAIESIAMAATPIPK 459
>gi|290998874|ref|XP_002682005.1| predicted protein [Naegleria gruberi]
gi|284095631|gb|EFC49261.1| predicted protein [Naegleria gruberi]
Length = 310
Score = 85.1 bits (209), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 45/91 (49%), Positives = 57/91 (62%), Gaps = 3/91 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ I +GS+ A ++DL YK GVY+H V + GGHAV +IG+GVE G YWL N
Sbjct: 219 IKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHLVSTVLGGHAVALIGFGVEGGSNYWLAAN 278
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SWG WG G FKI +G E IE+ QV AG
Sbjct: 279 SWGPNWGMSGYFKIAQG--EGGIEN-QVYAG 306
>gi|443686962|gb|ELT90079.1| hypothetical protein CAPTEDRAFT_166233 [Capitella teleta]
Length = 495
Score = 85.1 bits (209), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 44/105 (41%), Positives = 59/105 (56%), Gaps = 14/105 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG---------HAVKIIGWGVE---- 50
EI G + A + D Y++GVY+H+ H+V+IIGWG +
Sbjct: 360 EIILNGPVQAVMHVKDDFYTYERGVYKHSHAPKPANYPHLGKEAYHSVRIIGWGTDYTGD 419
Query: 51 DGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRVD 94
D +KYWL N+WG WG+GG F+I RG+DES IESF V G+VD
Sbjct: 420 DPIKYWLAANTWGRHWGEGGFFRIARGSDESHIESFVVGVWGKVD 464
>gi|167508668|gb|ABZ81540.1| cathepsin B-like cysteine protease [Caenorhabditis brenneri]
Length = 193
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 39/84 (46%), Positives = 53/84 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G ++A+ + D YK G+Y HT G+ GG KIIGWGV++GV YWLCV+
Sbjct: 110 IQTEIMRNGPVIASFIIYDDFWDYKSGIYVHTAGDQEGGMDTKIIGWGVDNGVPYWLCVH 169
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
WG +G+ G +I RG +E IE
Sbjct: 170 QWGTDFGENGFMRILRGVNEVHIE 193
>gi|290990726|ref|XP_002677987.1| predicted protein [Naegleria gruberi]
gi|284091597|gb|EFC45243.1| predicted protein [Naegleria gruberi]
Length = 225
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 57/93 (61%), Gaps = 3/93 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ I +GS+ A ++DL YK GVY+H V + GGHAV +IG+GVE G YWL N
Sbjct: 134 IKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHVVSTVLGGHAVALIGFGVEGGSNYWLAAN 193
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
SWG WG G FKI +G E IE+ QV AG
Sbjct: 194 SWGPNWGMSGYFKIAQG--EGGIEN-QVYAGEA 223
>gi|145541902|ref|XP_001456639.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124424451|emb|CAK89242.1| unnamed protein product [Paramecium tetraurelia]
Length = 487
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 45/110 (40%), Positives = 63/110 (57%), Gaps = 12/110 (10%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG-----------HAVKIIGWGV 49
+ LE+F+ G ++ E QD + Y G+Y H+V + H+V GWG
Sbjct: 365 IMLELFNNGPVIMNFEPGQDFMYYSSGIY-HSVAQHDWSSSDRPEWEKVDHSVLCYGWGE 423
Query: 50 EDGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSS 99
E+GVK+WL NSWGE WG+ G F+++RGTDES IES +A V +SS
Sbjct: 424 ENGVKFWLLQNSWGEQWGEQGNFRMKRGTDESAIESMAEAADPVIYSKSS 473
>gi|193610664|ref|XP_001948185.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 324
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/84 (47%), Positives = 52/84 (61%), Gaps = 1/84 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-VGEMSGGHAVKIIGWGVEDGVKYWLCV 59
+Q E+ +G + A + DL +Y GVY T + + K+IGWGVE+GV YWL V
Sbjct: 230 IQREVQTYGPVSAYFSLYDDLFLYTSGVYARTEKSKFVRYQSAKLIGWGVENGVDYWLLV 289
Query: 60 NSWGELWGDGGLFKIRRGTDESRI 83
NSWG WG GLFKI+RGTDE +
Sbjct: 290 NSWGNEWGQNGLFKIKRGTDECQF 313
>gi|18411686|ref|NP_567215.1| cathepsin B [Arabidopsis thaliana]
gi|13877861|gb|AAK44008.1|AF370193_1 putative cathepsin B cysteine protease [Arabidopsis thaliana]
gi|17473834|gb|AAL38343.1| unknown protein [Arabidopsis thaliana]
gi|21281113|gb|AAM45063.1| putative cathepsin B cysteine protease [Arabidopsis thaliana]
gi|21554165|gb|AAM63244.1| cathepsin B-like cysteine protease, putative [Arabidopsis thaliana]
gi|24417490|gb|AAN60355.1| unknown [Arabidopsis thaliana]
gi|24899725|gb|AAN65077.1| unknown protein [Arabidopsis thaliana]
gi|51968702|dbj|BAD43043.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51969104|dbj|BAD43244.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51969220|dbj|BAD43302.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970472|dbj|BAD43928.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970630|dbj|BAD44007.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970704|dbj|BAD44044.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970802|dbj|BAD44093.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970974|dbj|BAD44179.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51971008|dbj|BAD44196.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51971116|dbj|BAD44250.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|62320144|dbj|BAD94342.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|110740287|dbj|BAF02040.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332656652|gb|AEE82052.1| cathepsin B [Arabidopsis thaliana]
Length = 359
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/90 (44%), Positives = 52/90 (57%), Gaps = 1/90 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
+ E++ G + + ++D YK GVY+H G GGHAVK+IGWG +G YWL
Sbjct: 247 IMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLMA 306
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVS 89
N W WGD G F IRRGT+E IE V+
Sbjct: 307 NQWNRGWGDDGYFMIRRGTNECGIEDEPVA 336
>gi|32129434|sp|P92132.2|CATB2_GIALA RecName: Full=Cathepsin B-like CP2; AltName: Full=Cathepsin B-like
protease B2; Flags: Precursor
gi|11691658|emb|CAC18647.1| cathepsin B-like protease 2 [Giardia intestinalis]
Length = 300
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 41/88 (46%), Positives = 55/88 (62%), Gaps = 2/88 (2%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCVNSWGELWG 67
G + A H D + Y+ GVYQHT G M GGHAV+++G+G +DGV YW+ NSWG WG
Sbjct: 214 GPLQVAFLVHSDFMYYESGVYQHTYGYMEGGHAVEMVGYGTDDDGVDYWIIKNSWGPDWG 273
Query: 68 DGGLFKIRRGTDESRIESFQVSAGRVDR 95
+ G F++ RG ++ IE Q AG D
Sbjct: 274 EDGYFRMIRGINDCSIEE-QAYAGFFDE 300
>gi|239938578|gb|ACS36088.1| cysteine proteinase [Haemonchus contortus]
Length = 332
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 49/77 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q E+ G + AA ++D Y++G+Y HT G G HAVK++GWGVE+G KYW N
Sbjct: 255 IQREMMKNGPVQAASITYEDFSFYRRGIYVHTRGRQRGAHAVKVVGWGVENGTKYWNVAN 314
Query: 61 SWGELWGDGGLFKIRRG 77
SW WG+ G F+I RG
Sbjct: 315 SWSTDWGEDGYFRILRG 331
>gi|290990464|ref|XP_002677856.1| predicted protein [Naegleria gruberi]
gi|284091466|gb|EFC45112.1| predicted protein [Naegleria gruberi]
Length = 231
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/77 (51%), Positives = 47/77 (61%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A + D +YK GVY+H G G HAVKIIGWG E+GV YWL N
Sbjct: 143 IQKEILTHGPVNADFMVYSDFTVYKSGVYRHQTGSFEGIHAVKIIGWGTENGVDYWLIAN 202
Query: 61 SWGELWGDGGLFKIRRG 77
SWG +G G FKI RG
Sbjct: 203 SWGTTFGLQGFFKIVRG 219
>gi|149635146|ref|XP_001512140.1| PREDICTED: dipeptidyl peptidase 1-like [Ornithorhynchus anatinus]
Length = 469
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 42/104 (40%), Positives = 62/104 (59%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ G + A E + D + Y++GVY HT E++ HAV ++G+G +
Sbjct: 366 MKLELVRHGPMAVAFEVYNDFLHYREGVYHHTGLRDPFNPFELTN-HAVLLVGYGTDPAT 424
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRG+DE IES V+A + R
Sbjct: 425 GLDYWIVKNSWGTAWGEDGYFRIRRGSDECAIESIAVAATPIPR 468
>gi|30678927|ref|NP_849281.1| cathepsin B [Arabidopsis thaliana]
gi|3859606|gb|AAC72872.1| contains similarity to cysteine proteases (Pfam: PF00112,
E=1.3e-79, N=1) [Arabidopsis thaliana]
gi|7268205|emb|CAB77732.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332656653|gb|AEE82053.1| cathepsin B [Arabidopsis thaliana]
Length = 359
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/90 (44%), Positives = 52/90 (57%), Gaps = 1/90 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
+ E++ G + + ++D YK GVY+H G GGHAVK+IGWG +G YWL
Sbjct: 247 IMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLMA 306
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVS 89
N W WGD G F IRRGT+E IE V+
Sbjct: 307 NQWNRGWGDDGYFMIRRGTNECGIEDEPVA 336
>gi|166030314|gb|ABY78824.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 39/82 (47%), Positives = 50/82 (60%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E++ G V A + + D + YK GVY+H G++ GGHAV+I+GWG +G YW NSW
Sbjct: 241 ELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDVLGGHAVRIVGWGKLNGTPYWKIANSWD 300
Query: 64 ELWGDGGLFKIRRGTDESRIES 85
WG G F I RG DE IES
Sbjct: 301 TDWGMNGHFLILRGKDECGIES 322
>gi|426252217|ref|XP_004019812.1| PREDICTED: dipeptidyl peptidase 1, partial [Ovis aries]
Length = 455
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 42/104 (40%), Positives = 63/104 (60%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVE--D 51
M+LE+ H G + A E + D + Y++GVY HT E++ HAV ++G+G +
Sbjct: 352 MKLELVHRGPMAVAFEVYNDFLHYRQGVYHHTGLRDPFNPFELTN-HAVLLVGYGTDAAS 410
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES ++A + +
Sbjct: 411 GLDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIALAATPIPK 454
>gi|21693|emb|CAA46810.1| cathepsin B [Triticum aestivum]
Length = 305
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 43/92 (46%), Positives = 55/92 (59%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
+ E+++ G + A ++D YK GVY+H G + GGHAVK+IGWG D G YWL
Sbjct: 201 IMAEVYNNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLA 260
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
N W WGD G FKI RG +E IE V+AG
Sbjct: 261 NQWNRGWGDDGYFKIIRGKNECGIEE-DVTAG 291
>gi|323448735|gb|EGB04630.1| hypothetical protein AURANDRAFT_32318 [Aureococcus anophagefferens]
Length = 253
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 42/94 (44%), Positives = 54/94 (57%), Gaps = 1/94 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
M +I+ G I QD + YK GVY+ + GGHA+KI+G+G EDG YWL
Sbjct: 153 MAADIYQNGPITGMFFVKQDFLAYKSGVYEPKLLSPPLGGHAIKIMGFGTEDGKDYWLVA 212
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
NSW E WGD G FKI RG + +IE ++ G V
Sbjct: 213 NSWNEDWGDDGYFKIIRGKNACQIEDPVINGGPV 246
>gi|294871893|ref|XP_002766082.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239866672|gb|EEQ98799.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 118
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 38/77 (49%), Positives = 50/77 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EIF G ++ A+ ++D+ +YK GVY H G G H +KIIGWGVE G YWL VN
Sbjct: 28 IKQEIFTNGPVIGALTIYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGVESGQDYWLAVN 87
Query: 61 SWGELWGDGGLFKIRRG 77
SW E WGD G+ K+ G
Sbjct: 88 SWNEEWGDHGMIKLAVG 104
>gi|225437812|ref|XP_002281936.1| PREDICTED: cathepsin B-like isoform 1 [Vitis vinifera]
gi|359480250|ref|XP_003632421.1| PREDICTED: cathepsin B-like [Vitis vinifera]
Length = 358
Score = 84.3 bits (207), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 63/114 (55%), Gaps = 2/114 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
+ E++ G + A ++D Y+ GVY++T G++ GGHAVK+IGWG +DG YW+
Sbjct: 245 IMAEVYKNGPVEVAFTVYEDFAHYESGVYRYTTGDVMGGHAVKLIGWGTTDDGEDYWILA 304
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRS-SDLEEFEYDTDTTI 112
N W WGD G F IRRG +E IE V+ ++ D E + D +I
Sbjct: 305 NQWNRNWGDDGYFMIRRGVNECGIEEGVVAGLPSSKNLMIGDFESVDADRHVSI 358
>gi|294891885|ref|XP_002773787.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239878991|gb|EER05603.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 234
Score = 84.3 bits (207), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 35/74 (47%), Positives = 48/74 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EIF G + A + ++D YK GVY H G+M H +K+IGWGVE G +YWL VN
Sbjct: 105 IKQEIFDNGPVAAMMTLYEDFRFYKSGVYVHKTGQMLAAHTLKLIGWGVESGQEYWLAVN 164
Query: 61 SWGELWGDGGLFKI 74
+W E WGD G+ K+
Sbjct: 165 AWNEEWGDHGMIKL 178
>gi|297744106|emb|CBI37076.3| unnamed protein product [Vitis vinifera]
Length = 392
Score = 84.3 bits (207), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 63/114 (55%), Gaps = 2/114 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
+ E++ G + A ++D Y+ GVY++T G++ GGHAVK+IGWG +DG YW+
Sbjct: 279 IMAEVYKNGPVEVAFTVYEDFAHYESGVYRYTTGDVMGGHAVKLIGWGTTDDGEDYWILA 338
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRS-SDLEEFEYDTDTTI 112
N W WGD G F IRRG +E IE V+ ++ D E + D +I
Sbjct: 339 NQWNRNWGDDGYFMIRRGVNECGIEEGVVAGLPSSKNLMIGDFESVDADRHVSI 392
>gi|187105118|ref|NP_001119619.1| cathepsin B-5880 precursor [Acyrthosiphon pisum]
gi|163300442|tpg|DAA06127.1| TPA_inf: cathepsin B transcript 5880 [Acyrthosiphon pisum]
gi|239790051|dbj|BAH71611.1| ACYPI000015 [Acyrthosiphon pisum]
gi|239790053|dbj|BAH71612.1| ACYPI000015 [Acyrthosiphon pisum]
Length = 302
Score = 84.3 bits (207), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 38/86 (44%), Positives = 50/86 (58%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI+ G I A+ +QD + Y+ GVY + G+ AVKI+GWG E+G YWL NS+
Sbjct: 213 EIYENGPITASFYMYQDFVNYQSGVYAYNSGKYVTTQAVKILGWGEENGTPYWLAANSFN 272
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVS 89
WGD G KI RG +E IE F +
Sbjct: 273 TYWGDNGFVKILRGANECYIEEFMYA 298
>gi|115621283|ref|XP_782184.2| PREDICTED: tubulointerstitial nephritis antigen-like
[Strongylocentrotus purpuratus]
Length = 450
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/100 (39%), Positives = 57/100 (57%), Gaps = 14/100 (14%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEM---------SGGHAVKIIGWGVE- 50
+ EI+ G + A D +Y +GVY++ E +G H+VKI+GWG++
Sbjct: 338 IMTEIYQNGPVQATFNVKNDFFVYNRGVYRNVKQEFTASQSDSDQAGWHSVKIVGWGIDR 397
Query: 51 ----DGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESF 86
+ +KYWLC NSWG WG+ G+F+I RG +E IESF
Sbjct: 398 SDWYNPIKYWLCTNSWGRNWGEQGMFRIVRGVNECEIESF 437
>gi|294929081|ref|XP_002779258.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239888294|gb|EER11053.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 288
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 37/77 (48%), Positives = 49/77 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EIF G ++ + ++D+ +YK GVY H G G H +KIIGWGVE G YWL VN
Sbjct: 198 IKQEIFTNGPVIGMLSIYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGVESGQDYWLAVN 257
Query: 61 SWGELWGDGGLFKIRRG 77
SW E WGD G+ K+ G
Sbjct: 258 SWNEEWGDHGMIKLAVG 274
>gi|343475054|emb|CCD13447.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 37/81 (45%), Positives = 45/81 (55%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E++ G V + H D + YK GVYQH G GG AV+I+GWG +G YW NSW
Sbjct: 242 ELYFNGPFVVRFQVHSDFLAYKSGVYQHVAGNFLGGKAVRIVGWGKMNGTPYWKVANSWD 301
Query: 64 ELWGDGGLFKIRRGTDESRIE 84
WG G F I RG +E IE
Sbjct: 302 TDWGMNGYFLILRGNNECNIE 322
>gi|6562772|emb|CAB62590.1| putative cathepsin B-like protease [Pisum sativum]
Length = 174
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 40/82 (48%), Positives = 49/82 (59%), Gaps = 1/82 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCVNSW 62
E++ G + A ++D YK GVY+H G GGHAVK+ GWG D G YWL N W
Sbjct: 80 EVYKNGPVEVAFSVYEDFAHYKSGVYKHITGSALGGHAVKLNGWGTSDEGEDYWLLANQW 139
Query: 63 GELWGDGGLFKIRRGTDESRIE 84
WGD G FKI+RGT+E IE
Sbjct: 140 NTNWGDDGYFKIKRGTNECGIE 161
>gi|301779281|ref|XP_002925058.1| PREDICTED: dipeptidyl peptidase 1-like [Ailuropoda melanoleuca]
gi|281337582|gb|EFB13166.1| hypothetical protein PANDA_014484 [Ailuropoda melanoleuca]
Length = 461
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 43/104 (41%), Positives = 61/104 (58%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVE--D 51
M+LE+ H G I A + + D Y+ G+Y HT E++ HAV ++G+G +
Sbjct: 358 MKLELVHHGPIAVAFQVYDDFFHYRTGIYYHTGLRDPFNPFELTN-HAVLLVGYGTDTAS 416
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A V +
Sbjct: 417 GMDYWIVKNSWGAGWGENGYFRIRRGTDECAIESIAVAATPVPK 460
>gi|204022075|dbj|BAG71135.1| cathepsin B-S2 [Tuberaphis taiwana]
Length = 334
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 38/92 (41%), Positives = 60/92 (65%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
++ ++ +G + A+ + + D +YK G+Y+ T + GGH++KIIGWG ++G YWL V
Sbjct: 239 IEQDLKTYGPVEASFDVYDDFSVYKSGIYRKTPKAKYQGGHSIKIIGWGQQNGTPYWLAV 298
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW + WG+ G FKI +G +E IE V+AG
Sbjct: 299 NSWSKFWGEHGTFKIIKGRNECGIER-AVTAG 329
>gi|74212565|dbj|BAE31022.1| unnamed protein product [Mus musculus]
Length = 191
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/104 (40%), Positives = 61/104 (58%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ G + A E H D + Y G+Y HT E++ HAV ++G+G +
Sbjct: 88 MELELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTN-HAVLLVGYGRDPVT 146
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G++YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 147 GIEYWIIKNSWGSNWGESGYFRIRRGTDECAIESIAVAAIPIPK 190
>gi|341891358|gb|EGT47293.1| hypothetical protein CAEBREN_29072 [Caenorhabditis brenneri]
Length = 349
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 56/102 (54%), Gaps = 12/102 (11%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT--------VGEMSGGHAVKIIGWGVEDG 52
+Q E+ G + A H+D +Y GVYQH+ G H+V+++GWGV+
Sbjct: 224 IQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHS 283
Query: 53 ----VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
+KYWLC NSWG WG+ G FKI RG + IESF V A
Sbjct: 284 TGRPIKYWLCANSWGTQWGEDGYFKILRGDNHCEIESFVVGA 325
>gi|126330441|ref|XP_001381244.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Monodelphis
domestica]
Length = 466
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/104 (44%), Positives = 59/104 (56%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +YK G+Y+HT + G H+VKI GWG E DG
Sbjct: 354 ELMENGPVQALMEVHEDFFLYKSGIYKHTPASLGKPARYRQHGTHSVKITGWGEERQPDG 413
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF V GRV
Sbjct: 414 QRLKYWTAANSWGPTWGEKGHFRILRGANECDIESFVVGVWGRV 457
>gi|294876288|ref|XP_002767632.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239869318|gb|EER00350.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 97
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 38/77 (49%), Positives = 50/77 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G A + + D + Y+ GVY+HT G G H+V+IIGWG+E GV YWL +N
Sbjct: 5 IKKEIMTNGPTSATLSMYNDFLSYESGVYKHTSGTFMGVHSVEIIGWGIEKGVDYWLVMN 64
Query: 61 SWGELWGDGGLFKIRRG 77
SW E WGD G FKI +G
Sbjct: 65 SWNEDWGDNGTFKIAQG 81
>gi|201023369|ref|NP_001128426.1| cathepsin B-3483 [Acyrthosiphon pisum]
gi|328712086|ref|XP_003244726.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 355
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 37/81 (45%), Positives = 51/81 (62%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
G V + ++D + YK GVY H G+ G +V++IGWG+E G +WL NSWG WGD
Sbjct: 270 GPYVVTMRVYEDFLAYKSGVYHHVTGDYLGLLSVRMIGWGLEGGQAFWLLANSWGTSWGD 329
Query: 69 GGLFKIRRGTDESRIESFQVS 89
G FKIRR +E IE+F+ +
Sbjct: 330 KGFFKIRRFVNECWIENFRYA 350
>gi|159950|gb|AAA29435.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
Length = 105
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 37/94 (39%), Positives = 58/94 (61%), Gaps = 1/94 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G +VA ++D Y+ G+Y+H G +G HAVK+IGWG E G YW+ N
Sbjct: 13 IQKDIMKNGPVVATYTVYEDFAHYRSGIYKHKAGRKTGLHAVKVIGWGEEKGTPYWIVAN 72
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
SW + WG+ G F++ RG+++ E +++AG V
Sbjct: 73 SWHDDWGENGFFRMHRGSNDCGFEE-RMAAGSVQ 105
>gi|161343877|tpg|DAA06119.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 145
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 38/86 (44%), Positives = 49/86 (56%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI+ G I A+ +QD + Y+ GVY G+ AVKI+GWG E+G YWL NS+
Sbjct: 56 EIYENGPITASFYMYQDFVNYQSGVYAFNSGKYVTTQAVKILGWGEENGTPYWLAANSFN 115
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVS 89
WGD G KI RG +E IE F +
Sbjct: 116 TYWGDNGFVKILRGANECYIEEFMYA 141
>gi|294877489|ref|XP_002768007.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239870145|gb|EER00725.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 344
Score = 83.6 bits (205), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/77 (50%), Positives = 49/77 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G A+ ++D YK GVY+HT G G H+V+IIGWG E GV YWL +N
Sbjct: 252 IKKEIMTNGPTSASFSTYEDFSSYKSGVYKHTSGGYLGDHSVEIIGWGTEKGVDYWLVMN 311
Query: 61 SWGELWGDGGLFKIRRG 77
SW E WGD G FKI +G
Sbjct: 312 SWNEGWGDHGTFKIAQG 328
>gi|343476048|emb|CCD12737.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 83.6 bits (205), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 38/85 (44%), Positives = 50/85 (58%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ E++ G V A + + D + YK GVY+H G+ GGHAV+I+GWG +G YW N
Sbjct: 239 FKRELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKLNGTPYWKIAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SW WG G F I RG +E IES
Sbjct: 299 SWDTDWGMNGHFLILRGNNECGIES 323
>gi|432108509|gb|ELK33225.1| Dipeptidyl peptidase 1 [Myotis davidii]
Length = 466
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 62/104 (59%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ H G + A E + D + Y +G+Y HT E++ HAV ++G+G +
Sbjct: 363 MKLELVHHGPMAVAFEVYDDFLHYNQGIYHHTGLKDPFNPFELTN-HAVLLVGYGTDPKT 421
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES ++A + +
Sbjct: 422 GLDYWIVKNSWGTSWGEQGYFRIRRGTDECAIESIAMAATPIPK 465
>gi|294937366|ref|XP_002782055.1| cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239893340|gb|EER13850.1| cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 159
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 37/77 (48%), Positives = 49/77 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EIF G ++ + ++D+ +YK GVY H G G H +KIIGWGVE G YWL VN
Sbjct: 69 IKQEIFTNGPVIGMLSIYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGVESGQDYWLAVN 128
Query: 61 SWGELWGDGGLFKIRRG 77
SW E WGD G+ K+ G
Sbjct: 129 SWNEEWGDHGMIKLAVG 145
>gi|189238903|ref|XP_967834.2| PREDICTED: similar to tubulointerstitial nephritis antigen
[Tribolium castaneum]
Length = 453
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 39/93 (41%), Positives = 59/93 (63%), Gaps = 7/93 (7%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT---VGEMSGGHAVKIIGWGVE---DGVK 54
+ EI H G + A ++ + D YK+G+Y+H+ + +G H+V+I+GWG E +G+K
Sbjct: 343 IMYEILHSGPVQATMKVYHDFFTYKRGIYRHSPISTNDRTGYHSVRIVGWGEEYSPEGLK 402
Query: 55 -YWLCVNSWGELWGDGGLFKIRRGTDESRIESF 86
YW NSWG WG+ G F+I RG++E IESF
Sbjct: 403 KYWKVANSWGPEWGENGYFRILRGSNECEIESF 435
>gi|147902366|ref|NP_001080511.1| cathepsin C precursor [Xenopus laevis]
gi|33417162|gb|AAH56109.1| Ctsc protein [Xenopus laevis]
Length = 458
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 44/101 (43%), Positives = 59/101 (58%), Gaps = 8/101 (7%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGE------MSGGHAVKIIGWGVED--G 52
M+LE+ G + A E + D I Y+ GVY HT + HAV ++G+G + G
Sbjct: 355 MKLELVLGGPLSVAFEVYDDFIHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGTDQQTG 414
Query: 53 VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
KYW+ NSWGE WG+ G F+IRRG+DE IES VSA +
Sbjct: 415 EKYWIVKNSWGESWGEKGFFRIRRGSDECAIESIAVSANPI 455
>gi|290981656|ref|XP_002673546.1| predicted protein [Naegleria gruberi]
gi|284087130|gb|EFC40802.1| predicted protein [Naegleria gruberi]
Length = 362
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 44/93 (47%), Positives = 56/93 (60%), Gaps = 3/93 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ I +GS+ A ++DL YK GVY+H + GGHAV +IG+GVE G YWL N
Sbjct: 271 IKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHIENTVLGGHAVALIGFGVEGGSNYWLAAN 330
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
SWG WG G FKI +G E IE+ QV AG
Sbjct: 331 SWGPNWGMSGYFKIAQG--EGGIEN-QVYAGEA 360
>gi|40643250|emb|CAC83720.1| cathepsin B [Hordeum vulgare subsp. vulgare]
gi|326494236|dbj|BAJ90387.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326499864|dbj|BAJ90767.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 344
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 43/92 (46%), Positives = 54/92 (58%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
+ E++ G + A ++D YK GVY+H G + GGHAVK+IGWG D G YWL
Sbjct: 240 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLA 299
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
N W WGD G FKI RG +E IE V+AG
Sbjct: 300 NQWNRGWGDDGYFKIIRGKNECGIEE-DVTAG 330
>gi|145513975|ref|XP_001442898.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124410259|emb|CAK75501.1| unnamed protein product [Paramecium tetraurelia]
Length = 358
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 60/91 (65%), Gaps = 2/91 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG--EMSGGHAVKIIGWGVEDGVKYWLC 58
++ EI + G IVA I+ +D ++YK GVY+ G + GHAVK+IGWG +DGV YW+
Sbjct: 256 IKREILNNGPIVAVIQVFKDFLVYKGGVYEVVEGSSKFQYGHAVKVIGWGKQDGVNYWVI 315
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVS 89
NSWG+ WG GL + G ++ ++E++ V+
Sbjct: 316 ENSWGDTWGLKGLAYVAVGQNQLQLEAYSVA 346
>gi|363729389|ref|XP_417207.2| PREDICTED: dipeptidyl peptidase 1 [Gallus gallus]
Length = 460
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 43/104 (41%), Positives = 62/104 (59%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWG--VED 51
M+LE+ G + A E + D + YK+G+Y HT E++ HAV ++G+G E
Sbjct: 357 MKLELVLSGPMAVAFEVYNDFMFYKEGIYHHTGLKDEFNPFELTN-HAVLLVGYGKDPES 415
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G K+W+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 416 GEKFWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 459
>gi|226497010|ref|NP_001150152.1| LOC100283781 precursor [Zea mays]
gi|195637168|gb|ACG38052.1| cathepsin B-like cysteine proteinase 3 precursor [Zea mays]
Length = 347
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 40/85 (47%), Positives = 50/85 (58%), Gaps = 1/85 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
+ E++ G + A ++D YK GVY+H G + GGHAVK+IGWG D G YWL
Sbjct: 237 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWLLA 296
Query: 60 NSWGELWGDGGLFKIRRGTDESRIE 84
N W WGD G FKI RG +E IE
Sbjct: 297 NQWNRGWGDDGYFKIIRGKNECGIE 321
>gi|294931810|ref|XP_002780018.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239889821|gb|EER11813.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 131
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 39/78 (50%), Positives = 50/78 (64%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G A+ ++D YK GVY+HT G G H+V+IIGWG E GV YWL +N
Sbjct: 31 IKKEIMTNGPTSASFSTYEDFSSYKSGVYKHTSGGYLGDHSVEIIGWGTEKGVDYWLVMN 90
Query: 61 SWGELWGDGGLFKIRRGT 78
SW E WGD G FKI +G+
Sbjct: 91 SWNEGWGDHGTFKIAQGS 108
>gi|161343857|tpg|DAA06109.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 163
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
G V + ++D + YK GVY H G+ G +V++IGWG+E G +WL NSWG WGD
Sbjct: 78 GPYVVTMRVYEDFLAYKSGVYHHVTGDYLGLLSVRMIGWGLEGGQAFWLFANSWGTSWGD 137
Query: 69 GGLFKIRRGTDESRIESFQVSA 90
G FKIRR +E IE+F+ +
Sbjct: 138 KGFFKIRRFVNERWIENFRYAG 159
>gi|414886870|tpg|DAA62884.1| TPA: cathepsin B-like cysteine proteinase 3 [Zea mays]
Length = 347
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 40/85 (47%), Positives = 50/85 (58%), Gaps = 1/85 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
+ E++ G + A ++D YK GVY+H G + GGHAVK+IGWG D G YWL
Sbjct: 237 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWLLA 296
Query: 60 NSWGELWGDGGLFKIRRGTDESRIE 84
N W WGD G FKI RG +E IE
Sbjct: 297 NQWNRGWGDDGYFKIIRGKNECGIE 321
>gi|262217337|gb|ACY38050.1| cathepsin B [Dactylis glomerata]
Length = 348
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 43/92 (46%), Positives = 54/92 (58%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
+ E++ G + A ++D YK GVY+H G + GGHAVK+IGWG D G YWL
Sbjct: 238 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGGVMGGHAVKLIGWGTSDAGEDYWLLA 297
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
N W WGD G FKI RG +E IE +V AG
Sbjct: 298 NQWNRGWGDDGYFKIIRGKNECGIEE-EVVAG 328
>gi|239790303|dbj|BAH71722.1| ACYPI001175 [Acyrthosiphon pisum]
Length = 330
Score = 83.2 bits (204), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 42/95 (44%), Positives = 56/95 (58%), Gaps = 2/95 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
+Q E+ +G + + D +YK GVY T + H K+IGWGVE+GV YWL V
Sbjct: 235 IQKEVQTYGPVSVKFRVYDDFFLYKSGVYVKTEKSLYVRRHFAKLIGWGVENGVDYWLLV 294
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
N WG WG GLFKI+RGT+E +E + V AG +
Sbjct: 295 NFWGNEWGQNGLFKIKRGTNEVHVEDY-VYAGEPE 328
>gi|118365170|ref|XP_001015806.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297573|gb|EAR95561.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 340
Score = 83.2 bits (204), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 42/92 (45%), Positives = 57/92 (61%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
+Q EI G + A+ + D + YK GVY ++ + GGH+VKIIGWG E YWL
Sbjct: 248 IQREIMAHGPVQASFKVAADFLTYKSGVYIRNPKLKYEGGHSVKIIGWGKEGNTPYWLIA 307
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW E WG+ GLF++ RG +E IE+ Q+ AG
Sbjct: 308 NSWNEDWGEKGLFRMLRGRNECGIEA-QIVAG 338
>gi|357116879|ref|XP_003560204.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
Length = 351
Score = 83.2 bits (204), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 58/104 (55%), Gaps = 1/104 (0%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
+ E++ G + A ++D YK GVY+H G + GGHAVK+IGWG D G YWL
Sbjct: 241 IMAEVYTNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLA 300
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEE 103
N W WGD G FKI RG +E IE V+ ++ + + ++
Sbjct: 301 NQWNRGWGDDGYFKIIRGKNECGIEEDVVAGMPSTKNMARNYDD 344
>gi|270011021|gb|EFA07469.1| cathepsin B precursor [Tribolium castaneum]
Length = 327
Score = 83.2 bits (204), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 39/93 (41%), Positives = 59/93 (63%), Gaps = 7/93 (7%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT---VGEMSGGHAVKIIGWGVE---DGVK 54
+ EI H G + A ++ + D YK+G+Y+H+ + +G H+V+I+GWG E +G+K
Sbjct: 217 IMYEILHSGPVQATMKVYHDFFTYKRGIYRHSPISTNDRTGYHSVRIVGWGEEYSPEGLK 276
Query: 55 -YWLCVNSWGELWGDGGLFKIRRGTDESRIESF 86
YW NSWG WG+ G F+I RG++E IESF
Sbjct: 277 KYWKVANSWGPEWGENGYFRILRGSNECEIESF 309
>gi|22653678|sp|O97578.1|CATC_CANFA RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain 1; AltName: Full=Dipeptidyl peptidase I
heavy chain 1; Contains: RecName: Full=Dipeptidyl
peptidase 1 heavy chain 2; AltName: Full=Dipeptidyl
peptidase I heavy chain 2; Contains: RecName:
Full=Dipeptidyl peptidase 1 heavy chain 3; AltName:
Full=Dipeptidyl peptidase I heavy chain 3; Contains:
RecName: Full=Dipeptidyl peptidase 1 heavy chain 4;
AltName: Full=Dipeptidyl peptidase I heavy chain 4;
Contains: RecName: Full=Dipeptidyl peptidase 1 light
chain; AltName: Full=Dipeptidyl peptidase I light chain;
Flags: Precursor
gi|4106126|gb|AAD02704.1| dipeptidyl peptidase I [Canis lupus familiaris]
Length = 435
Score = 83.2 bits (204), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 42/104 (40%), Positives = 61/104 (58%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ G + A E + D Y+KG+Y HT E++ HAV ++G+G +
Sbjct: 332 MKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 390
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 391 GMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVAATPIPK 434
>gi|268555786|ref|XP_002635882.1| Hypothetical protein CBG01102 [Caenorhabditis briggsae]
Length = 374
Score = 83.2 bits (204), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 36/90 (40%), Positives = 55/90 (61%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q ++ G I A +E + D + Y G+Y H G G +V+I+GWG+ +GV YWL N
Sbjct: 280 IQSDVMLNGPISATMEVYDDFLQYTTGIYVHLTGNKQGHLSVRILGWGMYEGVPYWLLAN 339
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SWG+ WG+ G F++ RG +E +E+ VS
Sbjct: 340 SWGKQWGENGTFRVLRGVNECGLEANCVSG 369
>gi|307938279|ref|NP_001182763.1| dipeptidyl peptidase 1 precursor [Canis lupus familiaris]
Length = 459
Score = 83.2 bits (204), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 42/104 (40%), Positives = 61/104 (58%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ G + A E + D Y+KG+Y HT E++ HAV ++G+G +
Sbjct: 356 MKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 414
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 415 GMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVAATPIPK 458
>gi|300176830|emb|CBK25399.2| unnamed protein product [Blastocystis hominis]
Length = 563
Score = 83.2 bits (204), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 38/85 (44%), Positives = 52/85 (61%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
M EI+ G I +I DL+ YK G+Y+ T G + HA+ ++GWG EDG KYW+ N
Sbjct: 184 MMKEIYARGPITCSIAVPDDLMEYKGGIYRDTTGAKTLDHAISVVGWGEEDGQKYWIARN 243
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SWG WG+ G F+I RG + IE+
Sbjct: 244 SWGTFWGEKGWFRIVRGENNLGIEA 268
Score = 65.5 bits (158), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 50/87 (57%), Gaps = 1/87 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
++ EIF G + ++ ++ + Y+ G++ G + G HAV++ GWG EDG KYW+
Sbjct: 467 IKAEIFARGPVSCSMIVTEEFLAYQGGIFVDDRGHIVGYHAVEVAGWGETEDGTKYWIAR 526
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESF 86
NSWG WG+ G F++ G + I +
Sbjct: 527 NSWGPYWGEHGWFRMIVGVSKGLITGY 553
>gi|326492684|dbj|BAJ90198.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 83.2 bits (204), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 45/105 (42%), Positives = 60/105 (57%), Gaps = 5/105 (4%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
+ E++ G + A ++D YK GVY+H G + GGHAVK+IGWG D G YWL
Sbjct: 245 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLA 304
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAGRV---DRDRSSDL 101
N W WG G FKI RG +E IE V+AG + DR++D+
Sbjct: 305 NQWNRGWGGDGYFKIIRGKNECGIEE-DVTAGMPSTKNMDRNNDV 348
>gi|195346663|ref|XP_002039877.1| GM15657 [Drosophila sechellia]
gi|194135226|gb|EDW56742.1| GM15657 [Drosophila sechellia]
Length = 431
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 4/94 (4%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGE---MSGGHAVKIIGWGVE-DGVKYW 56
+ EIFH G + A + ++D Y GVY+ T ++G H+VK++GWG E +G KYW
Sbjct: 325 IMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKALTGFHSVKLVGWGEEHNGEKYW 384
Query: 57 LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
+ NSWG WG+ G F+I RG++E IE + +++
Sbjct: 385 IAANSWGSWWGEHGYFRILRGSNECGIEDYVLAS 418
>gi|74199074|dbj|BAE30750.1| unnamed protein product [Mus musculus]
Length = 447
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/104 (40%), Positives = 61/104 (58%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ G + A E H D + Y G+Y HT E++ HAV ++G+G +
Sbjct: 344 MKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTN-HAVLLVGYGRDPVT 402
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G++YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 403 GIEYWIIKNSWGSNWGESGYFRIRRGTDECAIESIAVAAIPIPK 446
>gi|160707990|ref|NP_034112.3| dipeptidyl peptidase 1 preproprotein [Mus musculus]
gi|3023454|sp|P97821.1|CATC_MOUSE RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
light chain; AltName: Full=Dipeptidyl peptidase I light
chain; Flags: Precursor
gi|1881656|gb|AAB49457.1| preprodipeptidyl peptidase I [Mus musculus]
gi|7609786|gb|AAB58400.3| dipeptidyl peptidase I precursor [Mus musculus]
gi|45219895|gb|AAH67063.1| Cathepsin C [Mus musculus]
gi|74147157|dbj|BAE27487.1| unnamed protein product [Mus musculus]
gi|74178079|dbj|BAE29829.1| unnamed protein product [Mus musculus]
gi|148674849|gb|EDL06796.1| cathepsin C, isoform CRA_b [Mus musculus]
Length = 462
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/104 (40%), Positives = 61/104 (58%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ G + A E H D + Y G+Y HT E++ HAV ++G+G +
Sbjct: 359 MKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTN-HAVLLVGYGRDPVT 417
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G++YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 418 GIEYWIIKNSWGSNWGESGYFRIRRGTDECAIESIAVAAIPIPK 461
>gi|157116531|ref|XP_001658537.1| tubulointerstitial nephritis antigen [Aedes aegypti]
gi|108883447|gb|EAT47672.1| AAEL001232-PA [Aedes aegypti]
Length = 462
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/110 (38%), Positives = 63/110 (57%), Gaps = 9/110 (8%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-----GGHAVKIIGWGVE----D 51
+ LEI G + A + H+D YK G+Y+H+ S G H+V++IGWG E +
Sbjct: 322 IMLEIKKHGPVQAIMRVHRDFFSYKSGIYRHSAASTSADQRAGYHSVRLIGWGEERHGYE 381
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDL 101
KYW+ VNSWG WG+ G F+I RG++E IES+ +++ + DL
Sbjct: 382 VTKYWIAVNSWGTWWGENGRFRILRGSNECEIESYVLASLPYVHQQVKDL 431
>gi|74191569|dbj|BAE30359.1| unnamed protein product [Mus musculus]
Length = 462
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/104 (40%), Positives = 61/104 (58%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ G + A E H D + Y G+Y HT E++ HAV ++G+G +
Sbjct: 359 MKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTN-HAVLLVGYGRDPVT 417
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G++YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 418 GIEYWIIKNSWGSNWGESGYFRIRRGTDECAIESIAVAAIPIPK 461
>gi|74204274|dbj|BAE39895.1| unnamed protein product [Mus musculus]
Length = 462
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/99 (42%), Positives = 59/99 (59%), Gaps = 10/99 (10%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ G + A E H D + Y G+Y HT E++ HAV ++G+G +
Sbjct: 359 MKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTN-HAVLLVGYGRDPVT 417
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
G++YW+ NSWG WG+ G F+IRRGTDE IES V+A
Sbjct: 418 GIEYWIIKNSWGSNWGESGYFRIRRGTDECAIESIAVAA 456
>gi|308494436|ref|XP_003109407.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
gi|308246820|gb|EFO90772.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
Length = 470
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 43/102 (42%), Positives = 58/102 (56%), Gaps = 12/102 (11%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-----VGEMS---GGHAVKIIGWGVEDG 52
+Q E+ G + A H+D +Y GVYQH+ G S G H+V+++GWGV+
Sbjct: 345 IQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHS 404
Query: 53 ----VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
+KYWLC NSWG WG+ G FKI RG + IESF + A
Sbjct: 405 TGRPIKYWLCANSWGTQWGEDGYFKILRGENHCEIESFVIGA 446
>gi|119638992|gb|ABL85238.1| cysteine proteinase 4 [Necator americanus]
Length = 339
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 39/85 (45%), Positives = 54/85 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EIF G + A +D I YK+G+Y+ T G+ G HA+K+IGWG E+G YWL N
Sbjct: 246 IRQEIFINGPVGANFYVFEDFIHYKEGIYKQTYGKWIGVHAIKLIGWGTENGTDYWLVAN 305
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
S+ WG+ G F+I RGT+ IES
Sbjct: 306 SYNYDWGENGTFRILRGTNHCLIES 330
>gi|268564843|ref|XP_002639246.1| Hypothetical protein CBG03805 [Caenorhabditis briggsae]
Length = 526
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 56/102 (54%), Gaps = 12/102 (11%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT--------VGEMSGGHAVKIIGWGVEDG 52
+Q E+ G + A H+D +Y GVYQH+ G H+V+++GWGV+
Sbjct: 401 IQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHS 460
Query: 53 ----VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
+KYWLC NSWG WG+ G FKI RG + IESF + A
Sbjct: 461 TGRPIKYWLCANSWGTQWGEDGYFKILRGENHCEIESFVIGA 502
>gi|26340150|dbj|BAC33738.1| unnamed protein product [Mus musculus]
Length = 462
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/99 (42%), Positives = 59/99 (59%), Gaps = 10/99 (10%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ G + A E H D + Y G+Y HT E++ HAV ++G+G +
Sbjct: 359 MKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTN-HAVLLVGYGRDPVT 417
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
G++YW+ NSWG WG+ G F+IRRGTDE IES V+A
Sbjct: 418 GIEYWIIKNSWGSNWGESGYFRIRRGTDECAIESIAVAA 456
>gi|126327832|ref|XP_001363345.1| PREDICTED: dipeptidyl peptidase 1-like [Monodelphis domestica]
Length = 462
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 59/103 (57%), Gaps = 8/103 (7%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS------GGHAVKIIGWGVED--G 52
M+LE+ G + A E + D I Y+KGVY HT S HAV ++G+G ++ G
Sbjct: 359 MKLELVENGPMAVAFEVYNDFIHYQKGVYHHTGLRDSFNPFEITNHAVLLVGYGTDEKTG 418
Query: 53 VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
YW+ NSWG WG+ G F+I RGTDE IES VSA + +
Sbjct: 419 EHYWIVKNSWGSYWGEDGYFRILRGTDECGIESIAVSATPIPK 461
>gi|12832450|dbj|BAB22112.1| unnamed protein product [Mus musculus]
Length = 461
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/99 (42%), Positives = 59/99 (59%), Gaps = 10/99 (10%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ G + A E H D + Y G+Y HT E++ HAV ++G+G +
Sbjct: 358 MKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTN-HAVLLVGYGRDPVT 416
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
G++YW+ NSWG WG+ G F+IRRGTDE IES V+A
Sbjct: 417 GIEYWIIKNSWGSNWGESGYFRIRRGTDECAIESIAVAA 455
>gi|195426329|ref|XP_002061289.1| GK20838 [Drosophila willistoni]
gi|194157374|gb|EDW72275.1| GK20838 [Drosophila willistoni]
Length = 432
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 38/94 (40%), Positives = 60/94 (63%), Gaps = 4/94 (4%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV---GEMSGGHAVKIIGWGVE-DGVKYW 56
++ EIFH G + A + ++D Y G+Y+ T G +G H+VK++GWG E +G KYW
Sbjct: 326 IKAEIFHSGPVQATMRVYRDFFSYSGGIYRQTAANRGAPTGFHSVKLVGWGEEHNGDKYW 385
Query: 57 LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
+ NSWG WG+ G F+I RG++E IE + +++
Sbjct: 386 IAANSWGPWWGERGYFRILRGSNECGIEDYVLAS 419
>gi|159120206|ref|XP_001710319.1| Cathepsin B-like cysteine proteinase 3 precursor [Giardia lamblia
ATCC 50803]
gi|157438437|gb|EDO82645.1| Cathepsin B-like cysteine proteinase 3 precursor [Giardia lamblia
ATCC 50803]
Length = 804
Score = 83.2 bits (204), Expect = 4e-14, Method: Composition-based stats.
Identities = 41/96 (42%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIY-KKGVYQHTVG-EMSGGHAVKIIGWGVEDGVKYWLC 58
M +I+ G I ++ D KKG+Y ++ GGHAV I+GWG E+GV YW C
Sbjct: 192 MMRDIYQNGPIAVSMYLANDFPSKDKKGIYSSGPNTKLGGGHAVMIVGWGEENGVPYWDC 251
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
N++G WGD G FKI+RG++E +IE++ SA +D
Sbjct: 252 ANTYGTNWGDQGYFKIKRGSNELKIETWPGSALPID 287
>gi|159111216|ref|XP_001705840.1| Hypothetical protein GL50803_113303 [Giardia lamblia ATCC 50803]
gi|157433930|gb|EDO78166.1| hypothetical protein GL50803_113303 [Giardia lamblia ATCC 50803]
Length = 804
Score = 83.2 bits (204), Expect = 4e-14, Method: Composition-based stats.
Identities = 41/96 (42%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIY-KKGVYQHTVG-EMSGGHAVKIIGWGVEDGVKYWLC 58
M +I+ G I ++ D KKG+Y ++ GGHAV I+GWG E+GV YW C
Sbjct: 192 MMRDIYQNGPIAVSMYLANDFPSKDKKGIYSSGPNTKLGGGHAVMIVGWGEENGVPYWDC 251
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
N++G WGD G FKI+RG++E +IE++ SA +D
Sbjct: 252 ANTYGTNWGDQGYFKIKRGSNELKIETWPGSALPID 287
>gi|145514872|ref|XP_001443341.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124410719|emb|CAK75944.1| unnamed protein product [Paramecium tetraurelia]
Length = 358
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 60/91 (65%), Gaps = 2/91 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG--EMSGGHAVKIIGWGVEDGVKYWLC 58
++ EI + G IVA I+ +D ++YK GVY+ G + GHAVK+IGWG +DGV YW+
Sbjct: 256 IKREILNNGPIVAVIQVFKDFLVYKGGVYEVVEGSSKFQYGHAVKVIGWGKQDGVNYWVI 315
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVS 89
NSWG+ WG GL + G ++ ++E++ V+
Sbjct: 316 ENSWGDSWGLKGLAYVAVGQNQLQLEAYSVA 346
>gi|414886872|tpg|DAA62886.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
gi|414886873|tpg|DAA62887.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
Length = 208
Score = 82.8 bits (203), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 41/91 (45%), Positives = 52/91 (57%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
+ E++ G + A ++D YK GVY+H G + GGHAVK+IGWG D G YWL
Sbjct: 98 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWLLA 157
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
N W WGD G FKI RG +E IE V+
Sbjct: 158 NQWNRGWGDDGYFKIIRGKNECGIEEGVVAG 188
>gi|297465285|ref|XP_887401.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
[Bos taurus]
gi|297472148|ref|XP_002685665.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Bos taurus]
gi|296490232|tpg|DAA32345.1| TPA: tubulointerstitial nephritis antigen-like 1-like [Bos taurus]
Length = 534
Score = 82.8 bits (203), Expect = 4e-14, Method: Composition-based stats.
Identities = 44/104 (42%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +Y+ G+Y HT + G H+VKI GWG E DG
Sbjct: 423 ELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 482
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 483 RTIKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 526
>gi|193629592|ref|XP_001944624.1| PREDICTED: cathepsin B-like isoform 4 [Acyrthosiphon pisum]
Length = 331
Score = 82.8 bits (203), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 41/91 (45%), Positives = 56/91 (61%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
+Q E+ +G + AA+ + D+ ++K GVY T + VK+IGWGVE+GV YWL V
Sbjct: 236 IQKEVQTYGPVTAALNLYDDIFLHKSGVYTLTKNAKYVRLQYVKLIGWGVENGVDYWLLV 295
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
NSWG WG GL KI+RG +ESF +A
Sbjct: 296 NSWGNEWGQNGLLKIKRGKYGCAVESFVYAA 326
>gi|170045773|ref|XP_001850470.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
gi|167868692|gb|EDS32075.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
Length = 463
Score = 82.8 bits (203), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 39/99 (39%), Positives = 60/99 (60%), Gaps = 9/99 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG-----EMSGGHAVKIIGWGVE----D 51
+ +EI G + A + H+D YK G+Y+H+ E +G H+V++IGWG E +
Sbjct: 323 IMIEIKKHGPVQAILRVHRDFFSYKSGIYRHSAASSAGDERAGYHSVRLIGWGEERNGYE 382
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
KYW+ VNSWG WG+ G F+I RG +E IES+ +++
Sbjct: 383 TTKYWVAVNSWGRWWGENGRFRIVRGQNECEIESYVLAS 421
>gi|363742306|ref|XP_428202.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Gallus
gallus]
Length = 464
Score = 82.8 bits (203), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--------GEMSGGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +YK G+Y+HT + G H+VKI GWG E DG
Sbjct: 353 ELMENGPVQAILEVHEDFFLYKSGIYRHTAVAEGKGPKHQQHGTHSVKITGWGEEQLPDG 412
Query: 53 V--KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
KYW NSWG WG+ G F+I RG +E +ESF V GRV
Sbjct: 413 QVQKYWTAANSWGRAWGEDGHFRIARGVNECEVESFVVGVWGRV 456
>gi|67867504|gb|AAH98085.1| Unknown (protein for MGC:107782) [Xenopus (Silurana) tropicalis]
Length = 458
Score = 82.8 bits (203), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 44/98 (44%), Positives = 58/98 (59%), Gaps = 8/98 (8%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGE------MSGGHAVKIIGWGVED--G 52
M+LE+ G + A E + D + Y+ GVY HT + HAV ++G+G + G
Sbjct: 355 MKLELVLGGPLSVAFEVYDDFMHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGTDQQTG 414
Query: 53 VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
KYW+ NSWGE WG+ G F+IRRGTDE IES VSA
Sbjct: 415 EKYWIVKNSWGESWGEKGYFRIRRGTDECAIESIAVSA 452
>gi|294895531|ref|XP_002775206.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239881224|gb|EER07022.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 130
Score = 82.8 bits (203), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 37/77 (48%), Positives = 49/77 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EIF G ++ + ++D+ +YK GVY H G G H +KIIGWGVE G YWL VN
Sbjct: 40 IKQEIFTNGPVIGMLSLYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGVESGQDYWLAVN 99
Query: 61 SWGELWGDGGLFKIRRG 77
SW E WGD G+ K+ G
Sbjct: 100 SWNEEWGDHGMIKLAVG 116
>gi|307548878|ref|NP_001182580.1| dipeptidyl peptidase 1 precursor [Macaca mulatta]
Length = 463
Score = 82.8 bits (203), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 62/104 (59%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ + G + A E + D + Y+ G+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 419 GMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|308161545|gb|EFO63987.1| Cathepsin B-like cysteine proteinase [Giardia lamblia P15]
Length = 804
Score = 82.8 bits (203), Expect = 5e-14, Method: Composition-based stats.
Identities = 41/96 (42%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIY-KKGVYQHTVG-EMSGGHAVKIIGWGVEDGVKYWLC 58
M +I+ G I ++ D KKG+Y ++ GGHAV I+GWG E+GV YW C
Sbjct: 192 MMRDIYQNGPIAVSMYLANDFPSKDKKGIYSSGPNTKLRGGHAVMIVGWGEENGVPYWDC 251
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
N++G WGD G FKI+RG++E +IE++ SA +D
Sbjct: 252 ANTYGTNWGDQGYFKIKRGSNELKIETWPGSALPID 287
>gi|294956046|ref|XP_002788796.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239904363|gb|EER20592.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 130
Score = 82.8 bits (203), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 37/77 (48%), Positives = 49/77 (63%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EIF G ++ + ++D+ +YK GVY H G G H +KIIGWGVE G YWL VN
Sbjct: 40 IKQEIFTNGPVIGMLSIYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGVESGQDYWLAVN 99
Query: 61 SWGELWGDGGLFKIRRG 77
SW E WGD G+ K+ G
Sbjct: 100 SWNEEWGDHGMIKLAVG 116
>gi|383415299|gb|AFH30863.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
gi|384944880|gb|AFI36045.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
Length = 463
Score = 82.8 bits (203), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 62/104 (59%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ + G + A E + D + Y+ G+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 419 GMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|380808942|gb|AFE76346.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
Length = 463
Score = 82.8 bits (203), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 62/104 (59%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ + G + A E + D + Y+ G+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 419 GMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|355752523|gb|EHH56643.1| hypothetical protein EGM_06098 [Macaca fascicularis]
Length = 463
Score = 82.8 bits (203), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 62/104 (59%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ + G + A E + D + Y+ G+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 419 GMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|62510425|sp|Q60HG6.1|CATC_MACFA RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
light chain; AltName: Full=Dipeptidyl peptidase I light
chain; Flags: Precursor
gi|52782205|dbj|BAD51949.1| cathepsin C [Macaca fascicularis]
Length = 463
Score = 82.8 bits (203), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 62/104 (59%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ + G + A E + D + Y+ G+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 419 GMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|328872536|gb|EGG20903.1| hypothetical protein DFA_00770 [Dictyostelium fasciculatum]
Length = 313
Score = 82.8 bits (203), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 40/85 (47%), Positives = 53/85 (62%), Gaps = 1/85 (1%)
Query: 2 QLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ +I+ G I+A + + D+ YK GVY + HA ++IGWGVEDGV+YWL N
Sbjct: 165 KADIYLNGPIIAVFDLYTDIYNYKSGVYIKSDSATYKETHAGRVIGWGVEDGVQYWLAAN 224
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SWG WG GLFKIR GT+E E+
Sbjct: 225 SWGTGWGQQGLFKIRSGTNEVGFEA 249
>gi|343477197|emb|CCD11909.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 82.8 bits (203), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 37/81 (45%), Positives = 45/81 (55%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E++ G V + H D + YK GVYQH G GG AV+I+GWG +G YW NSW
Sbjct: 242 ELYFNGPFVVRFQVHSDFLAYKNGVYQHVAGNFLGGKAVRIVGWGKLNGTPYWKVANSWD 301
Query: 64 ELWGDGGLFKIRRGTDESRIE 84
WG G F I RG +E IE
Sbjct: 302 TDWGMNGYFLILRGDNECNIE 322
>gi|407080581|gb|AFS89610.1| procathepsin B precursor [Phenacoccus solenopsis]
Length = 309
Score = 82.8 bits (203), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 40/93 (43%), Positives = 57/93 (61%), Gaps = 3/93 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED--GVKYWLC 58
++ EIFH G + A + A++D Y+ G+Y H G HAVKIIGWG + YWL
Sbjct: 210 IRTEIFHNGPVEATMAAYEDFYTYESGIYHHIEGTFVCDHAVKIIGWGTDKKTNTPYWLV 269
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NS+ WG+ G FKI+RG +E IE+ +++AG
Sbjct: 270 ANSFNTDWGEYGFFKIKRGVNECGIEN-KITAG 301
>gi|308157698|gb|EFO60800.1| Cathepsin B-like cysteine proteinase 3 precursor [Giardia lamblia
P15]
Length = 627
Score = 82.8 bits (203), Expect = 5e-14, Method: Composition-based stats.
Identities = 41/96 (42%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIY-KKGVYQHTVG-EMSGGHAVKIIGWGVEDGVKYWLC 58
M +I+ G I ++ D KKG+Y ++ GGHAV I+GWG E+GV YW C
Sbjct: 192 MMRDIYQNGPIAVSMYLANDFPSKDKKGIYSSGPNTKLRGGHAVMIVGWGEENGVPYWDC 251
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
N++G WGD G FKI+RG++E +IE++ SA +D
Sbjct: 252 ANTYGTNWGDQGYFKIKRGSNELKIETWPGSALPID 287
>gi|159108157|ref|XP_001704351.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157432412|gb|EDO76677.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 360
Score = 82.4 bits (202), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 42/81 (51%), Positives = 50/81 (61%), Gaps = 1/81 (1%)
Query: 5 IFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCVNSWG 63
+ G +VA QD + YK GVYQH G GGHAV+IIG+GV D G+ YW NSWG
Sbjct: 270 LLAHGPVVATFNVAQDFMYYKSGVYQHRWGLWLGGHAVEIIGYGVTDSGLDYWTVRNSWG 329
Query: 64 ELWGDGGLFKIRRGTDESRIE 84
WG+ G F+I RG DE IE
Sbjct: 330 PDWGEDGYFRIVRGGDECGIE 350
>gi|195154396|ref|XP_002018108.1| GL16940 [Drosophila persimilis]
gi|194113904|gb|EDW35947.1| GL16940 [Drosophila persimilis]
Length = 433
Score = 82.4 bits (202), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 4/94 (4%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV---GEMSGGHAVKIIGWGVE-DGVKYW 56
+ EI+H G + A + ++D Y GVY+ T G +G H+VK++GWG E +G KYW
Sbjct: 327 IMAEIYHSGPVQATMRVYRDFFSYSSGVYRQTAANRGAPTGFHSVKLVGWGEEHNGDKYW 386
Query: 57 LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
+ NSWG WG+ G F+I RG++E IE + +++
Sbjct: 387 IAANSWGPWWGERGYFRILRGSNECGIEDYVLAS 420
>gi|341898422|gb|EGT54357.1| hypothetical protein CAEBREN_10381 [Caenorhabditis brenneri]
Length = 466
Score = 82.4 bits (202), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 56/102 (54%), Gaps = 12/102 (11%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT--------VGEMSGGHAVKIIGWGVEDG 52
+Q E+ G + A H+D +Y GVYQH+ G H+V+++GWGV+
Sbjct: 341 IQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHS 400
Query: 53 ----VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
+KYWLC NSWG WG+ G FKI RG + IESF + A
Sbjct: 401 TGRPIKYWLCANSWGTQWGEDGYFKILRGDNHCEIESFVIGA 442
>gi|166030332|gb|ABY78833.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 82.4 bits (202), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 37/81 (45%), Positives = 45/81 (55%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E++ G V + H D + YK GVYQH G GG AV+I+GWG +G YW NSW
Sbjct: 242 ELYFNGPFVVRFQVHSDFLAYKSGVYQHVAGNFLGGKAVRIVGWGKLNGTPYWKVANSWD 301
Query: 64 ELWGDGGLFKIRRGTDESRIE 84
WG G F I RG +E IE
Sbjct: 302 TDWGMNGYFLILRGDNECNIE 322
>gi|193202653|ref|NP_492593.2| Protein F26E4.3 [Caenorhabditis elegans]
gi|205371857|sp|P90850.3|YCF2E_CAEEL RecName: Full=Uncharacterized peptidase C1-like protein F26E4.3;
Flags: Precursor
gi|166157004|emb|CAB03007.2| Protein F26E4.3 [Caenorhabditis elegans]
Length = 452
Score = 82.4 bits (202), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 56/102 (54%), Gaps = 12/102 (11%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT--------VGEMSGGHAVKIIGWGVEDG 52
+Q E+ G + A H+D +Y GVYQH+ G H+V+++GWGV+
Sbjct: 327 IQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHS 386
Query: 53 ----VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
+KYWLC NSWG WG+ G FK+ RG + IESF + A
Sbjct: 387 TGKPIKYWLCANSWGTQWGEDGYFKVLRGENHCEIESFVIGA 428
>gi|125810908|ref|XP_001361665.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
gi|54636841|gb|EAL26244.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
Length = 433
Score = 82.4 bits (202), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 4/94 (4%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV---GEMSGGHAVKIIGWGVE-DGVKYW 56
+ EI+H G + A + ++D Y GVY+ T G +G H+VK++GWG E +G KYW
Sbjct: 327 IMAEIYHSGPVQATMRVYRDFFSYSSGVYRQTAANRGAPTGFHSVKLVGWGEEHNGDKYW 386
Query: 57 LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
+ NSWG WG+ G F+I RG++E IE + +++
Sbjct: 387 IAANSWGPWWGERGYFRILRGSNECGIEDYVLAS 420
>gi|341888224|gb|EGT44159.1| hypothetical protein CAEBREN_15022 [Caenorhabditis brenneri]
Length = 332
Score = 82.4 bits (202), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 37/90 (41%), Positives = 54/90 (60%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q ++ G I E + D + Y G+Y H G G +V+I+GWG+ +GV YWL N
Sbjct: 238 IQSDVMLNGPIETTFEVYDDFLQYTTGIYVHLTGNKQGHLSVRILGWGMYEGVPYWLLAN 297
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SWG+ WG+ G F+ RGT+E +E+ VSA
Sbjct: 298 SWGKEWGENGTFRALRGTNECGLEANCVSA 327
>gi|166030312|gb|ABY78823.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 82.4 bits (202), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 38/81 (46%), Positives = 48/81 (59%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E++ G V A + + D YK GVY+H G++ GGHAV+I+GWG +G YW NSW
Sbjct: 241 ELYFNGPFVVAFQVYSDFFAYKTGVYRHVSGDVLGGHAVRIVGWGKLNGTPYWKIANSWD 300
Query: 64 ELWGDGGLFKIRRGTDESRIE 84
WG G F I RG DE IE
Sbjct: 301 TDWGMNGHFLILRGKDECGIE 321
>gi|253743418|gb|EES99819.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 296
Score = 82.4 bits (202), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 40/81 (49%), Positives = 51/81 (62%), Gaps = 1/81 (1%)
Query: 5 IFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCVNSWG 63
+ G +VA QD + YK GVYQH G GGHAV+++G+GV D G+ YW NSWG
Sbjct: 206 LLSHGPVVATFNVAQDFMYYKSGVYQHRWGVWLGGHAVEVVGYGVTDSGLDYWTVRNSWG 265
Query: 64 ELWGDGGLFKIRRGTDESRIE 84
WG+ G F+I RG+DE IE
Sbjct: 266 PDWGEDGYFRIVRGSDECGIE 286
>gi|289724789|gb|ADD18342.1| putative cysteine proteinase TIN-ag [Glossina morsitans morsitans]
Length = 387
Score = 82.4 bits (202), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 41/94 (43%), Positives = 58/94 (61%), Gaps = 4/94 (4%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV---GEMSGGHAVKIIGWGVE-DGVKYW 56
+ EIF G + A + ++D Y G+Y+HT G G H+VK+IGWG E DG KYW
Sbjct: 280 IMAEIFMSGPVQATLTVYRDFFSYSGGIYRHTAASRGSPVGFHSVKLIGWGEEHDGNKYW 339
Query: 57 LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
+ NSWG WG+ G F+I RG++E IE + ++A
Sbjct: 340 IATNSWGTWWGEHGNFRILRGSNECGIEEYVLAA 373
>gi|195437434|ref|XP_002066645.1| GK24603 [Drosophila willistoni]
gi|194162730|gb|EDW77631.1| GK24603 [Drosophila willistoni]
Length = 341
Score = 82.4 bits (202), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 41/88 (46%), Positives = 51/88 (57%), Gaps = 2/88 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV--EDGVKYWLC 58
+Q EI G + A + ++D + YK GVY H GE G HAV+I+GWGV V YWL
Sbjct: 246 IQKEIMTNGPVQAILTVYEDFLSYKTGVYYHLEGEKVGPHAVRILGWGVWGTKKVPYWLV 305
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESF 86
NSWG WGD G F I RG + IE +
Sbjct: 306 ANSWGSDWGDNGFFHIFRGENHCDIEGY 333
>gi|401758196|gb|AFQ01133.1| cathepsin B [Chilo suppressalis]
Length = 350
Score = 82.4 bits (202), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 41/89 (46%), Positives = 55/89 (61%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI+ +G + + ++D + YK+G+Y +T G+ G H+VKIIGWG E G+KYWL NS+
Sbjct: 262 EIYEYGPVTSYFTVYEDFLNYKEGIYNYTSGQKLGLHSVKIIGWGEERGIKYWLAANSFN 321
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAGR 92
WGD G FKI R S S V AGR
Sbjct: 322 TDWGDKGFFKIIREGVGSCGISDNVVAGR 350
>gi|308157829|gb|EFO60849.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 300
Score = 82.4 bits (202), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 40/87 (45%), Positives = 55/87 (63%), Gaps = 2/87 (2%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCVNSWGELWG 67
G + A + D + Y+ GVYQHT G M GGHAV+++G+G +DGV YW+ NSWG WG
Sbjct: 214 GPLQVAFLVYSDFMYYESGVYQHTYGYMEGGHAVEMVGYGTDDDGVDYWIIRNSWGPDWG 273
Query: 68 DGGLFKIRRGTDESRIESFQVSAGRVD 94
+ G F++ RG ++ IE Q AG D
Sbjct: 274 EDGYFRMIRGINDCSIEE-QAYAGFFD 299
>gi|294888035|ref|XP_002772321.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239876433|gb|EER04137.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 200
Score = 82.4 bits (202), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 36/65 (55%), Positives = 45/65 (69%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
G + A+ ++D + Y+ GVY+HT G GGHAVKIIGWG + G YWL VNSW E WGD
Sbjct: 136 GPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAVKIIGWGEKSGQAYWLAVNSWNEDWGD 195
Query: 69 GGLFK 73
GLF+
Sbjct: 196 HGLFR 200
>gi|159109223|ref|XP_001704877.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157432952|gb|EDO77203.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 300
Score = 82.4 bits (202), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 40/88 (45%), Positives = 55/88 (62%), Gaps = 2/88 (2%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCVNSWGELWG 67
G + A + D + Y+ GVYQHT G M GGHAV+++G+G +DGV YW+ NSWG WG
Sbjct: 214 GPLQVAFLVYSDFMYYESGVYQHTYGYMEGGHAVEMVGYGTDDDGVDYWIIRNSWGPDWG 273
Query: 68 DGGLFKIRRGTDESRIESFQVSAGRVDR 95
+ G F++ RG ++ IE Q AG D
Sbjct: 274 EDGYFRMIRGINDCSIEE-QAYAGFFDE 300
>gi|328722316|ref|XP_003247542.1| PREDICTED: cathepsin B-like isoform 2 [Acyrthosiphon pisum]
gi|328722318|ref|XP_003247543.1| PREDICTED: cathepsin B-like isoform 3 [Acyrthosiphon pisum]
Length = 276
Score = 82.4 bits (202), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 41/91 (45%), Positives = 56/91 (61%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
+Q E+ +G + AA+ + D+ ++K GVY T + VK+IGWGVE+GV YWL V
Sbjct: 181 IQKEVQTYGPVTAALNLYDDIFLHKSGVYTLTKNAKYVRLQYVKLIGWGVENGVDYWLLV 240
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
NSWG WG GL KI+RG +ESF +A
Sbjct: 241 NSWGNEWGQNGLLKIKRGKYGCAVESFVYAA 271
>gi|145509603|ref|XP_001440740.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124407968|emb|CAK73343.1| unnamed protein product [Paramecium tetraurelia]
Length = 357
Score = 82.4 bits (202), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 37/92 (40%), Positives = 60/92 (65%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG--EMSGGHAVKIIGWGVEDGVKYWLC 58
++ EI + G +VA I+ +D ++YK G+Y+ G + GHAVK+IGWG +DGV YW+
Sbjct: 256 IKREILNNGPVVAVIQVFKDFLVYKGGIYEVVEGSSKFQYGHAVKVIGWGKQDGVNYWVI 315
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
NSWG+ WG GL + G ++ ++E++ V+
Sbjct: 316 ENSWGDSWGLKGLAYVAVGQNQLQLEAYSVAP 347
>gi|326914532|ref|XP_003203579.1| PREDICTED: dipeptidyl peptidase 1-like [Meleagris gallopavo]
Length = 420
Score = 82.4 bits (202), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 42/104 (40%), Positives = 62/104 (59%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWG--VED 51
M+LE+ G + A E + D + YK+G+Y HT E++ HAV ++G+G +
Sbjct: 317 MKLELVLSGPMAVAFEVYNDFMFYKEGIYHHTGLKDNFNPFELTN-HAVLLVGYGKDPKS 375
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G K+W+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 376 GEKFWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 419
>gi|195488613|ref|XP_002092389.1| GE11695 [Drosophila yakuba]
gi|194178490|gb|EDW92101.1| GE11695 [Drosophila yakuba]
Length = 431
Score = 82.0 bits (201), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 38/94 (40%), Positives = 58/94 (61%), Gaps = 4/94 (4%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEM---SGGHAVKIIGWGVE-DGVKYW 56
+ EIFH G + A + ++D Y GVY+ T +G H+VK++GWG E +G KYW
Sbjct: 325 IMAEIFHSGPVQATMRVNRDFFAYAGGVYRQTAANRMAPTGFHSVKLVGWGEEHNGEKYW 384
Query: 57 LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
+ NSWG WG+ G F+I RG++E IE + +++
Sbjct: 385 IAANSWGPWWGERGYFRILRGSNECGIEEYVLAS 418
>gi|290987261|ref|XP_002676341.1| predicted protein [Naegleria gruberi]
gi|284089943|gb|EFC43597.1| predicted protein [Naegleria gruberi]
Length = 218
Score = 82.0 bits (201), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 37/88 (42%), Positives = 57/88 (64%), Gaps = 4/88 (4%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV----EDGVKYW 56
MQL I + GS+ A+++ ++D + Y+ GVY+H VG H+V+I+GWG+ + + YW
Sbjct: 120 MQLSIMNGGSLAASLDIYRDFVQYRGGVYRHLVGNYMFTHSVRIVGWGITSPQQGSIPYW 179
Query: 57 LCVNSWGELWGDGGLFKIRRGTDESRIE 84
+C N+W E WG G F I RG++E IE
Sbjct: 180 ICGNNWTEEWGMQGWFWILRGSNECNIE 207
>gi|290992302|ref|XP_002678773.1| predicted protein [Naegleria gruberi]
gi|284092387|gb|EFC46029.1| predicted protein [Naegleria gruberi]
Length = 236
Score = 82.0 bits (201), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 37/78 (47%), Positives = 48/78 (61%), Gaps = 2/78 (2%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGV--KYWLCVNSWGELW 66
G + AA ++D + YK GVY H G + GGHA+K++GWGV+ YW+ NSWG W
Sbjct: 149 GPVQAAFSVYRDFMSYKSGVYHHVSGSLLGGHAIKMVGWGVDSATNKPYWIIANSWGPSW 208
Query: 67 GDGGLFKIRRGTDESRIE 84
G G F I RG+DE IE
Sbjct: 209 GLNGFFWILRGSDECGIE 226
>gi|343459017|gb|AEM37667.1| cathepsin C subunit [Epinephelus bruneus]
Length = 106
Score = 82.0 bits (201), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 43/104 (41%), Positives = 62/104 (59%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGV--ED 51
M LE+ G + A E + D +IYK+G+Y HT E++ HAV ++G+G +
Sbjct: 3 MMLELVKNGPMAVAFEVYPDFMIYKEGIYHHTGLADSFNPFELTN-HAVLLVGYGRCHKT 61
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G KYW+ NSWG WG+ G F+IRRG+DE IES V+A + +
Sbjct: 62 GQKYWIVKNSWGTDWGEDGYFRIRRGSDECSIESIAVAANPIPK 105
>gi|449670327|ref|XP_002160467.2| PREDICTED: dipeptidyl peptidase 1-like [Hydra magnipapillata]
Length = 458
Score = 82.0 bits (201), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 42/96 (43%), Positives = 56/96 (58%), Gaps = 6/96 (6%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS------GGHAVKIIGWGVEDGVK 54
M++ + G + IE + DL Y+ G+Y HT + H V ++G+G EDG K
Sbjct: 354 MRVALNKIGPLAVNIEVYPDLQFYRSGIYHHTELDFKFNPFEITNHVVVVVGYGEEDGQK 413
Query: 55 YWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
YW+ NSWGE WG+ G F+IRRGTDE IES V A
Sbjct: 414 YWIVKNSWGEEWGEKGYFRIRRGTDEIAIESLVVYA 449
>gi|402894881|ref|XP_003910570.1| PREDICTED: dipeptidyl peptidase 1 [Papio anubis]
Length = 463
Score = 82.0 bits (201), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 62/104 (59%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ + G + A E + D + Y+ G+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVYHGPLSVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 419 GMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|195585648|ref|XP_002082593.1| GD25141 [Drosophila simulans]
gi|194194602|gb|EDX08178.1| GD25141 [Drosophila simulans]
Length = 484
Score = 82.0 bits (201), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 38/94 (40%), Positives = 58/94 (61%), Gaps = 4/94 (4%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEM---SGGHAVKIIGWGVE-DGVKYW 56
+ EIFH G + A + ++D Y GVY+ T +G H+VK++GWG E +G KYW
Sbjct: 325 IMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYW 384
Query: 57 LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
+ NSWG WG+ G F+I RG++E IE + +++
Sbjct: 385 IAANSWGSWWGEHGYFRILRGSNECGIEEYVLAS 418
>gi|344293788|ref|XP_003418602.1| PREDICTED: dipeptidyl peptidase 1 [Loxodonta africana]
Length = 463
Score = 82.0 bits (201), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 62/104 (59%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ + G +V + E + D I Y KG+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVNHGPVVVSFEVYDDFIHYHKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSW WG+ G F+IRRGTDE IES ++A + +
Sbjct: 419 GLDYWIVKNSWSATWGEDGYFRIRRGTDECGIESIALTATPIPK 462
>gi|323447573|gb|EGB03489.1| hypothetical protein AURANDRAFT_72715 [Aureococcus anophagefferens]
Length = 812
Score = 82.0 bits (201), Expect = 9e-14, Method: Composition-based stats.
Identities = 43/110 (39%), Positives = 56/110 (50%), Gaps = 9/110 (8%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEM--SGGHAVKIIGWGVEDGVKYWLC 58
MQ EI G I A ++ + YK GVY E+ GGHAVKI+GWG E G YWL
Sbjct: 467 MQKEIMTHGPIQVAFNVYKSFMSYKSGVYAKKWYELMPEGGHAVKIVGWGTEGGKDYWLV 526
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIE-------SFQVSAGRVDRDRSSDL 101
NSW WGD G FKI G + ++ +F V +R+++L
Sbjct: 527 ANSWNTSWGDEGYFKIAVGAESISLDVVKRVFAAFDVDLAETRNERTNEL 576
>gi|300121248|emb|CBK21629.2| unnamed protein product [Blastocystis hominis]
Length = 559
Score = 81.6 bits (200), Expect = 9e-14, Method: Composition-based stats.
Identities = 37/85 (43%), Positives = 50/85 (58%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
M EI+ G I I QD + YK G+Y+ G + HA+ ++GWG E+G KYW+ N
Sbjct: 185 MMKEIYARGPITCGIAVPQDFVDYKGGIYKDESGAVEKVHAISVVGWGEENGEKYWIGRN 244
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SWG WG+ G F+I RG + IES
Sbjct: 245 SWGNYWGEEGWFRIARGINNLAIES 269
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 32/79 (40%), Positives = 45/79 (56%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EIF G + ++ + + Y GVY+ M GH V+I GWGVE+G YW+ N
Sbjct: 465 IKAEIFARGPVSCSMTVRESFLDYHGGVYESDSSPMVAGHIVEIAGWGVENGRPYWIGRN 524
Query: 61 SWGELWGDGGLFKIRRGTD 79
SWGE WG+ G F+I D
Sbjct: 525 SWGEYWGEEGWFRIDMEKD 543
>gi|357623033|gb|EHJ74345.1| tubulointerstitial nephritis antigen [Danaus plexippus]
Length = 426
Score = 81.6 bits (200), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 38/92 (41%), Positives = 56/92 (60%), Gaps = 3/92 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV---GEMSGGHAVKIIGWGVEDGVKYWL 57
+ +I G + A + HQD Y G+Y+ + + G H+V+I+GWG + G KYW+
Sbjct: 323 IMYDIMESGPVHAVMTVHQDFFHYHDGIYRRSPYGDNTLQGLHSVRIVGWGEDRGDKYWV 382
Query: 58 CVNSWGELWGDGGLFKIRRGTDESRIESFQVS 89
NSWG WG+ G F+I RG++ES IESF V+
Sbjct: 383 VANSWGCDWGENGYFRIARGSNESGIESFVVT 414
>gi|226466652|emb|CAX69461.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 340
Score = 81.6 bits (200), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 52/87 (59%), Gaps = 1/87 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG-HAVKIIGWGVEDGVKYWLCV 59
+Q EI G ++A+I + D ++YK GVY T + G ++IIGWG E + YWLC
Sbjct: 246 IQKEILMNGPVIASISVNTDFLVYKSGVYLPTPRSRNLGWITLRIIGWGYEGKIPYWLCA 305
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESF 86
NSW E WGD G KI+RG IES+
Sbjct: 306 NSWNEEWGDNGYVKIQRGVQAGYIESY 332
>gi|24657813|ref|NP_726176.1| secreted Wg-interacting molecule, isoform A [Drosophila
melanogaster]
gi|24657819|ref|NP_611652.2| secreted Wg-interacting molecule, isoform B [Drosophila
melanogaster]
gi|21064305|gb|AAM29382.1| RE01730p [Drosophila melanogaster]
gi|21626543|gb|AAF46818.2| secreted Wg-interacting molecule, isoform A [Drosophila
melanogaster]
gi|21626544|gb|AAM68213.1| secreted Wg-interacting molecule, isoform B [Drosophila
melanogaster]
gi|220949028|gb|ACL87057.1| CG3074-PA [synthetic construct]
gi|220958134|gb|ACL91610.1| CG3074-PA [synthetic construct]
Length = 431
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 38/94 (40%), Positives = 58/94 (61%), Gaps = 4/94 (4%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEM---SGGHAVKIIGWGVE-DGVKYW 56
+ EIFH G + A + ++D Y GVY+ T +G H+VK++GWG E +G KYW
Sbjct: 325 IMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYW 384
Query: 57 LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
+ NSWG WG+ G F+I RG++E IE + +++
Sbjct: 385 IAANSWGSWWGEHGYFRILRGSNECGIEEYVLAS 418
>gi|16768502|gb|AAL28470.1| GM06507p [Drosophila melanogaster]
Length = 430
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 38/94 (40%), Positives = 58/94 (61%), Gaps = 4/94 (4%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEM---SGGHAVKIIGWGVE-DGVKYW 56
+ EIFH G + A + ++D Y GVY+ T +G H+VK++GWG E +G KYW
Sbjct: 324 IMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYW 383
Query: 57 LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
+ NSWG WG+ G F+I RG++E IE + +++
Sbjct: 384 IAANSWGSWWGEHGYFRILRGSNECGIEEYVLAS 417
>gi|145486176|ref|XP_001429095.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124396185|emb|CAK61697.1| unnamed protein product [Paramecium tetraurelia]
Length = 464
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 45/126 (35%), Positives = 69/126 (54%), Gaps = 17/126 (13%)
Query: 3 LEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSG-------GHAVKIIGWGVEDGVKY 55
LEI G +V + E D + Y+ G+Y H+ E S H+V GWG E+GVK+
Sbjct: 329 LEIMKNGPVVLSFEPSYDFMYYESGIY-HSKAETSDYSEWEKVDHSVLCYGWGEEEGVKF 387
Query: 56 WLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGR--VDRDRSSDLEE-------FEY 106
W+ NSWG+ WG+ G F+++RG DES IES ++ ++++ S E F+Y
Sbjct: 388 WMLQNSWGDQWGESGNFRMKRGVDESAIESMAEASDPYVINQNSSKSFSETKSNESDFDY 447
Query: 107 DTDTTI 112
+ D +I
Sbjct: 448 EDDDSI 453
>gi|327269233|ref|XP_003219399.1| PREDICTED: dipeptidyl peptidase 1-like [Anolis carolinensis]
Length = 467
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 59/104 (56%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSG-------GHAVKIIGWGV--ED 51
M+LE+ G + A E + D + Y+ G+Y HT G M HAV ++G+G E
Sbjct: 364 MKLELVKHGPMAVAFEVYSDFMHYRGGIYHHT-GLMDPFNPFELTNHAVLLVGYGTDPET 422
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G +W+ NSWG WG+ G F+IRRGTDE IES V++ + +
Sbjct: 423 GEPFWIVKNSWGPAWGEQGYFRIRRGTDECAIESIAVASTPIPK 466
>gi|24987409|pdb|1JQP|A Chain A, Dipeptidyl Peptidase I (Cathepsin C), A Tetrameric
Cysteine Protease Of The Papain Family
Length = 438
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 60/104 (57%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ G + A E H D + Y G+Y HT E++ HAV ++G+G +
Sbjct: 335 MKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTN-HAVLLVGYGKDPVT 393
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES ++A + +
Sbjct: 394 GLDYWIVKNSWGSQWGESGYFRIRRGTDECAIESIAMAAIPIPK 437
>gi|47550737|ref|NP_999887.1| dipeptidyl peptidase 1 precursor [Danio rerio]
gi|39794586|gb|AAH64286.1| Cathepsin C [Danio rerio]
Length = 455
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 43/104 (41%), Positives = 62/104 (59%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGV--ED 51
M LE+ G + A+E + D + YK+G+Y HT E++ HAV ++G+G +
Sbjct: 352 MMLELVKNGPMGVALEVYPDFMNYKEGIYHHTGLRDANNPFELTN-HAVLLVGYGQCHKT 410
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G KYW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 411 GEKYWIVKNSWGSGWGENGFFRIRRGTDECAIESIAVAATPIPK 454
>gi|8393218|ref|NP_058793.1| dipeptidyl peptidase 1 precursor [Rattus norvegicus]
gi|114152780|sp|P80067.3|CATC_RAT RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
light chain; AltName: Full=Dipeptidyl peptidase I light
chain; Flags: Precursor
gi|220686|dbj|BAA14400.1| cathepsin C precursor [Rattus norvegicus]
gi|149069035|gb|EDM18587.1| cathepsin C, isoform CRA_a [Rattus norvegicus]
Length = 462
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 60/104 (57%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ G + A E H D + Y G+Y HT E++ HAV ++G+G +
Sbjct: 359 MKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTN-HAVLLVGYGKDPVT 417
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES ++A + +
Sbjct: 418 GLDYWIVKNSWGSQWGESGYFRIRRGTDECAIESIAMAAIPIPK 461
>gi|48762491|dbj|BAD23815.1| cathepsin B-S1 [Tuberaphis coreana]
Length = 334
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 38/92 (41%), Positives = 59/92 (64%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
++ ++ +G + A+ + + D +YK G+Y+ T + G H++KIIGWG E+G YWL V
Sbjct: 239 IEQDLKTYGPVEASFDVYDDFSVYKSGIYRKTPKAKYEGRHSIKIIGWGQENGTTYWLAV 298
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
NSW + WG+ G FKI +G +E IE V+AG
Sbjct: 299 NSWSKFWGEHGTFKIIKGRNECGIER-AVTAG 329
>gi|308161503|gb|EFO63946.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 363
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 41/81 (50%), Positives = 50/81 (61%), Gaps = 1/81 (1%)
Query: 5 IFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCVNSWG 63
+ G +VA QD + YK GVYQH G GGHAV+I+G+GV D G+ YW NSWG
Sbjct: 273 LLAHGPVVATFNVAQDFMYYKSGVYQHRWGVWLGGHAVEIVGYGVTDSGLDYWTVRNSWG 332
Query: 64 ELWGDGGLFKIRRGTDESRIE 84
WG+ G F+I RG DE IE
Sbjct: 333 PDWGEDGYFRIVRGGDECGIE 353
>gi|294916952|ref|XP_002778399.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239886773|gb|EER10194.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 228
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 35/90 (38%), Positives = 53/90 (58%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EIF G + A + ++D YK GVY H G++ H +K+IGWGVE G +YWL +N
Sbjct: 137 IKQEIFDNGPVAAMMTLYEDFRYYKSGVYVHKTGQLLAAHTLKLIGWGVESGQEYWLAMN 196
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
+W E WGD G+ K+ G + + + A
Sbjct: 197 AWNEEWGDHGMIKLAVGKTGLEHQVYHIEA 226
>gi|255209|gb|AAB23200.1| preprocathepsin C, dipeptidylaminopeptidase I [rats, kidney,
Peptide, 462 aa]
Length = 462
Score = 81.3 bits (199), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 60/104 (57%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ G + A E H D + Y G+Y HT E++ HAV ++G+G +
Sbjct: 359 MKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTN-HAVLLVGYGKDPVT 417
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES ++A + +
Sbjct: 418 GLDYWIVKNSWGSQWGESGYFRIRRGTDECAIESIAMAAIPIPK 461
>gi|340053922|emb|CCC48215.1| cysteine peptidase C (CPC) [Trypanosoma vivax Y486]
Length = 334
Score = 81.3 bits (199), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/90 (43%), Positives = 52/90 (57%), Gaps = 1/90 (1%)
Query: 2 QLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNS 61
+ E++ G A ++D + Y+ GVY+H G GGHAV+++GWG +GV YW NS
Sbjct: 240 KRELYLRGPFEVAFTVYEDFLAYESGVYKHVSGGPVGGHAVRVVGWGERNGVPYWKIANS 299
Query: 62 WGELWGDGGLFKIRRGTDESRIESFQVSAG 91
W WG+ G RG DE IES Q SAG
Sbjct: 300 WNTDWGENGYLYFYRGKDECGIES-QGSAG 328
>gi|281208776|gb|EFA82951.1| peptidase C1A family protein [Polysphondylium pallidum PN500]
Length = 1308
Score = 81.3 bits (199), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 36/80 (45%), Positives = 50/80 (62%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q EI G + A E ++D + YK GVY H G+ GGH +KI+G+GV +G YW+C N
Sbjct: 213 IQNEIVTNGPVEACFEVYEDFLGYKSGVYTHKSGKDLGGHCIKIVGFGVSNGTPYWICNN 272
Query: 61 SWGELWGDGGLFKIRRGTDE 80
SW WG+ G+F I G +E
Sbjct: 273 SWTTSWGNNGIFWIEAGKNE 292
>gi|37905530|gb|AAO64478.1| cathepsin C precursor [Fundulus heteroclitus]
Length = 450
Score = 81.3 bits (199), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 61/103 (59%), Gaps = 8/103 (7%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS------GGHAVKIIGWGV--EDG 52
M+LE+ G + A+E + D + YK+G+Y HT S HAV ++G+G + G
Sbjct: 347 MKLELVKNGPMAVALEVYPDFMHYKEGIYHHTGFRDSVNPFELTNHAVLLVGYGRCHKTG 406
Query: 53 VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
KYW+ NSWG WG+ G F+IRRG+DE IES V+A + +
Sbjct: 407 QKYWIVKNSWGSGWGEDGYFRIRRGSDECAIESIAVAAKPIPK 449
>gi|326490902|dbj|BAJ90118.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326508404|dbj|BAJ99469.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514912|dbj|BAJ99817.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 345
Score = 81.3 bits (199), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 42/92 (45%), Positives = 53/92 (57%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
+ E++ G + + ++D YK GVY+ G M GGHA K+IGWG D G YWL
Sbjct: 241 IMAEVYKNGPVEVSFIIYEDFAHYKSGVYKQITGRMVGGHAAKLIGWGTSDAGEDYWLLA 300
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
N W WGD G FKI RGT+E IE V+AG
Sbjct: 301 NQWNRGWGDDGYFKIIRGTNECGIEG-DVNAG 331
>gi|73696355|gb|AAZ80953.1| cathepsin C [Macaca mulatta]
Length = 118
Score = 81.3 bits (199), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 62/104 (59%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ + G + A E + D + Y+ G+Y HT E++ HAV ++G+G +
Sbjct: 15 MKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 73
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 74 GMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 117
>gi|194753202|ref|XP_001958906.1| GF12327 [Drosophila ananassae]
gi|190620204|gb|EDV35728.1| GF12327 [Drosophila ananassae]
Length = 431
Score = 81.3 bits (199), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 37/94 (39%), Positives = 58/94 (61%), Gaps = 4/94 (4%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV---GEMSGGHAVKIIGWGVE-DGVKYW 56
+ EI+H G + A + ++D Y G+Y+ T G G H+VK++GWG E +G KYW
Sbjct: 324 IMAEIYHSGPVQATMRVYRDFFSYSGGIYRQTAANRGAPQGFHSVKLVGWGEEHNGDKYW 383
Query: 57 LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
+ NSWG WG+ G F+I RG++E IE + +++
Sbjct: 384 IAANSWGPWWGERGYFRILRGSNECGIEEYVLAS 417
>gi|410910940|ref|XP_003968948.1| PREDICTED: tubulointerstitial nephritis antigen-like [Takifugu
rubripes]
Length = 477
Score = 81.3 bits (199), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 43/104 (41%), Positives = 59/104 (56%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVEDGV-- 53
EI G + A +E H+D +YK G+Y+HT + G H+VKI GWG E V
Sbjct: 357 EIQDNGPVQAIMEVHEDFFVYKSGIYKHTDVSFTKPPQYRKHGTHSVKITGWGEERNVDG 416
Query: 54 ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
KYW+ NSWG+ WG+ G F+I RG +E IE+F + GR+
Sbjct: 417 AKRKYWIAANSWGKNWGEEGYFRIARGENECEIEAFVIGVWGRI 460
>gi|348570708|ref|XP_003471139.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
porcellus]
Length = 468
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 46/104 (44%), Positives = 60/104 (57%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +YK G+Y HT M+ G H+VKI GWG E DG
Sbjct: 357 ELMENGPVQALMEVHEDFFLYKGGIYSHTPLSMARPEQYRRHGTHSVKITGWGEETLPDG 416
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG++E IESF + GRV
Sbjct: 417 RTLKYWTAANSWGPSWGERGHFRILRGSNECDIESFVLGVWGRV 460
>gi|270012756|gb|EFA09204.1| cathepsin B precursor [Tribolium castaneum]
Length = 369
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 47/101 (46%), Positives = 60/101 (59%), Gaps = 11/101 (10%)
Query: 9 GSIVAAIEAHQDLIIYK---------KGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
G +VAA + + D IY+ +GVY +T G + G AVKIIGWG E+G YWL
Sbjct: 228 GPVVAAFDVYGDFKIYRDGEQHDTILEGVYIYTSGALFGRTAVKIIGWGTENGWAYWLAA 287
Query: 60 NSWGELWGD-GGLFKIRRGTDESRIESFQVSAGRVDRDRSS 99
NSWG+ WG GG FKIRRGT+E E + AG+V S+
Sbjct: 288 NSWGKDWGALGGFFKIRRGTNECGFEE-SIIAGQVREGGST 327
>gi|134023803|gb|AAI35570.1| LOC100124858 protein [Xenopus (Silurana) tropicalis]
Length = 484
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 57/104 (54%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--------GEMSGGHAVKIIGWGVEDGV-- 53
E++ G + A +E H+D +YK G+Y+ T G H+VKI GWG E G
Sbjct: 372 ELYENGPVQAIMEVHEDFFMYKSGIYRRTPVTEREPEHHRRHGTHSVKITGWGEERGRDG 431
Query: 54 ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
KYWL NSWG WG+ G F+I RG +E IE+F V GRV
Sbjct: 432 QTHKYWLAANSWGRDWGEDGYFRIARGENECEIETFIVGVWGRV 475
>gi|432884030|ref|XP_004074413.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oryzias
latipes]
Length = 474
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 42/104 (40%), Positives = 59/104 (56%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHT--------VGEMSGGHAVKIIGWGVE---DG 52
EI G + A +E H+D +YK G+Y+HT G H+V+I GWG + DG
Sbjct: 354 EIMENGPVQAIMEVHEDFFVYKNGIYKHTDVSSTKPPQYRKHGTHSVRITGWGEDKDYDG 413
Query: 53 V--KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
KYW+ NSWG+ WG+ G F+I RG +E IE+F + GR+
Sbjct: 414 TPRKYWIAANSWGKNWGENGFFRIARGANECEIEAFVIGVWGRI 457
>gi|242001446|ref|XP_002435366.1| cysteine proteinase, putative [Ixodes scapularis]
gi|215498696|gb|EEC08190.1| cysteine proteinase, putative [Ixodes scapularis]
Length = 238
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 42/95 (44%), Positives = 56/95 (58%), Gaps = 12/95 (12%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--------GEMSGGHAVKIIGWGVED---- 51
EI+ G + A + +D +Y GVY+HT + S H+V+I+GWGV+
Sbjct: 112 EIYANGPVQALMLVKEDFFLYSSGVYKHTRLAHNLPPEYQKSDWHSVRILGWGVDRTQYR 171
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESF 86
KYWLC NSWG WG+ G F+I RG DES+IESF
Sbjct: 172 PQKYWLCANSWGSGWGENGYFRIVRGEDESQIESF 206
>gi|294955270|ref|XP_002788457.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
gi|239903926|gb|EER20253.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
Length = 392
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 37/77 (48%), Positives = 48/77 (62%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G AA + D + Y+ GVY+HT G + G H V+IIGWG + GV YWL +N
Sbjct: 252 IKKEIMTNGPTSAAFSMYDDFLSYESGVYKHTSGTLMGEHGVEIIGWGTKQGVDYWLVMN 311
Query: 61 SWGELWGDGGLFKIRRG 77
SW E WG G FKI +G
Sbjct: 312 SWNEGWGVHGTFKIAQG 328
>gi|300121755|emb|CBK22330.2| unnamed protein product [Blastocystis hominis]
Length = 562
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 37/85 (43%), Positives = 52/85 (61%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
M EI+ G I I ++L+ YK G+Y+ T G S H++ ++GWG EDG KYW+ N
Sbjct: 184 MMKEIYARGPITCTIADPEELMEYKGGIYRDTTGAKSLDHSISVVGWGEEDGQKYWIARN 243
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SWG WG+ G F+I RG + IE+
Sbjct: 244 SWGTFWGEKGWFRIVRGENNLGIEA 268
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 45/75 (60%), Gaps = 1/75 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
++ EIF G + I Q+ + Y+ G+++ E G H+V++ GWG EDG KYW+
Sbjct: 467 IKAEIFARGPVSCDIWVTQEFLDYQGGIFKENGSEYLGRHSVEVAGWGETEDGTKYWIGR 526
Query: 60 NSWGELWGDGGLFKI 74
NSWG WG+ G F+I
Sbjct: 527 NSWGTYWGEHGWFRI 541
>gi|281200411|gb|EFA74631.1| hypothetical protein PPL_11599 [Polysphondylium pallidum PN500]
Length = 311
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 41/97 (42%), Positives = 59/97 (60%), Gaps = 4/97 (4%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG-EMSGGHAVKIIGWGVED--GVKYWL 57
+Q +F +G I ++ +QD + Y GVY T G ++ GGHA+KI+GWG + G+ YW+
Sbjct: 212 IQANVFAYGPIEGTMDVYQDFMSYTSGVYVMTPGSKLLGGHAIKIVGWGTDSTSGLDYWI 271
Query: 58 CVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
NSWG WG G F I+RGT+ I+ SAG+ D
Sbjct: 272 VQNSWGSDWGMNGFFWIQRGTNMCGIDR-DASAGQAD 307
>gi|341900875|gb|EGT56810.1| hypothetical protein CAEBREN_32632 [Caenorhabditis brenneri]
Length = 287
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 36/90 (40%), Positives = 53/90 (58%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q ++ G I E + D + Y G+Y H G G +V+I+GWG+ +GV YWL N
Sbjct: 193 IQSDVMLNGPIETTFEVYDDFLQYTTGIYVHLTGNKQGHLSVRILGWGMYEGVPYWLLAN 252
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SWG+ WG+ G F+ RGT+E +E+ VS
Sbjct: 253 SWGKEWGENGTFRALRGTNECGLEANCVSG 282
>gi|328712825|ref|XP_001945477.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Acyrthosiphon pisum]
Length = 487
Score = 80.5 bits (197), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 39/94 (41%), Positives = 61/94 (64%), Gaps = 7/94 (7%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHT---VGEMSGGHAVKIIGWGVEDG----VKYW 56
EI ++GS+ A ++ ++ +Y+ GVY+ + +G +G H V+I+GWG E VKYW
Sbjct: 363 EIMNWGSVQAMMKVSKEFFMYESGVYKCSKLDLGSKTGYHTVRIVGWGEEQQNGRTVKYW 422
Query: 57 LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
+ NSWG WG+ G F+I +GT+E +IE F V+A
Sbjct: 423 IVSNSWGLWWGESGYFRILKGTNECQIEDFVVAA 456
>gi|355566931|gb|EHH23310.1| hypothetical protein EGK_06753 [Macaca mulatta]
Length = 463
Score = 80.5 bits (197), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 40/104 (38%), Positives = 61/104 (58%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+LE+ + G + A E + D + Y+ G+Y HT E++ HAV ++G+G +
Sbjct: 360 MKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 418
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+I RGTDE IES V+A + +
Sbjct: 419 GMDYWIVKNSWGTSWGEDGYFRIHRGTDECAIESIAVAATPIPK 462
>gi|294952605|ref|XP_002787373.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239902345|gb|EER19169.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 185
Score = 80.5 bits (197), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 39/89 (43%), Positives = 56/89 (62%), Gaps = 2/89 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EIF G ++++ + ++D YK GVY T E S H++KIIGWG G +YWL VN
Sbjct: 94 IKQEIFDNGPVLSSFKMYEDFRYYKSGVYVPTTKESSTSHSIKIIGWGGASGREYWLAVN 153
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW E WGD GL K+ G ++R+E +S
Sbjct: 154 SWNEEWGDHGLIKMAFG--KNRLEKIVLS 180
>gi|403357104|gb|EJY78168.1| Cathepsin B [Oxytricha trifallax]
Length = 349
Score = 80.5 bits (197), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 51/78 (65%), Gaps = 1/78 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWG-VEDGVKYWLCV 59
+QLE+ G ++ + ++DL+ YK+GVY++T G GGHA+KIIGWG E G +W C
Sbjct: 253 IQLELMTNGPMMVGLSVYEDLMNYKEGVYEYTTGNQVGGHAIKIIGWGHTEKGELFWKCQ 312
Query: 60 NSWGELWGDGGLFKIRRG 77
N WG+ WG GG I+ G
Sbjct: 313 NQWGKDWGMGGYINIKAG 330
>gi|194882138|ref|XP_001975170.1| GG20712 [Drosophila erecta]
gi|190658357|gb|EDV55570.1| GG20712 [Drosophila erecta]
Length = 431
Score = 80.5 bits (197), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 37/94 (39%), Positives = 59/94 (62%), Gaps = 4/94 (4%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEM---SGGHAVKIIGWGVE-DGVKYW 56
+ EIF+ G + A + ++D Y +GVY+ T +G H+VK++GWG E +G KYW
Sbjct: 325 IMAEIFNSGPVQATMRVNRDFFSYSRGVYRQTAANREAPTGFHSVKLVGWGEEHNGEKYW 384
Query: 57 LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
+ NSWG WG+ G F+I RG++E IE + +++
Sbjct: 385 IAANSWGSWWGEKGYFRILRGSNECGIEEYVLAS 418
>gi|349605750|gb|AEQ00879.1| Dipeptidyl-peptidase 1-like protein, partial [Equus caballus]
Length = 356
Score = 80.5 bits (197), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 40/104 (38%), Positives = 60/104 (57%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
++LE+ H G + A E + D + Y G+Y HT E++ HAV ++G+G +
Sbjct: 253 IKLELVHHGPMAVAFEVYNDFLHYHDGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 311
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G YW+ NSWG WG+ G F+IRRGTDE IES ++A + +
Sbjct: 312 GQDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAMAATPIPK 355
>gi|166030318|gb|ABY78826.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 80.5 bits (197), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 37/84 (44%), Positives = 47/84 (55%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ E++ G A + DL YK GVY+H G G HAV+I+GWG + GV YW N
Sbjct: 238 FRRELYFRGPFQAVFDVWSDLFAYKHGVYKHVGGAFIGAHAVRIVGWGNQSGVPYWKIAN 297
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW WGD G F + RG +E IE
Sbjct: 298 SWNAEWGDRGYFFMLRGDNECGIE 321
>gi|328712827|ref|XP_003244913.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Acyrthosiphon pisum]
Length = 487
Score = 80.5 bits (197), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 39/94 (41%), Positives = 61/94 (64%), Gaps = 7/94 (7%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHT---VGEMSGGHAVKIIGWGVEDG----VKYW 56
EI ++GS+ A ++ ++ +Y+ GVY+ + +G +G H V+I+GWG E VKYW
Sbjct: 363 EIMNWGSVQAMMKVSKEFFMYESGVYRCSNLALGSKTGYHTVRIVGWGEEQQNGRTVKYW 422
Query: 57 LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
+ NSWG WG+ G F+I +GT+E +IE F V+A
Sbjct: 423 IVSNSWGLWWGESGYFRILKGTNECQIEDFVVAA 456
>gi|343474530|emb|CCD13852.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 335
Score = 80.5 bits (197), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 37/81 (45%), Positives = 46/81 (56%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E++ G A + DL YK GVY+H G G HAV+I+GWG + GV YW NSW
Sbjct: 241 ELYFRGPFQAVFDVWSDLFAYKHGVYKHVGGAFIGAHAVRIVGWGNQSGVPYWKIANSWN 300
Query: 64 ELWGDGGLFKIRRGTDESRIE 84
WGD G F + RG +E IE
Sbjct: 301 AEWGDRGYFFMLRGDNECGIE 321
>gi|348513320|ref|XP_003444190.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oreochromis
niloticus]
Length = 499
Score = 80.5 bits (197), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 59/104 (56%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVEDGV-- 53
EI G + A +E H+D +YK G+Y+HT + G H+V+I GWG + V
Sbjct: 379 EIMDNGPVQAIMEVHEDFFVYKTGIYKHTDVSFTKPPQYRKHGTHSVRITGWGEDRNVDG 438
Query: 54 ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
KYW+ NSWG+ WG+ G F+I RG +E IE+F + GR+
Sbjct: 439 TSRKYWIAANSWGKNWGENGYFRIVRGENECEIETFVIGVWGRI 482
>gi|327281715|ref|XP_003225592.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
carolinensis]
Length = 520
Score = 80.1 bits (196), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 59/104 (56%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--------GEMSGGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +Y+ G+Y+HT G H+VKI GWG E DG
Sbjct: 407 ELMENGPVQAILEVHEDFFMYRTGIYRHTAVAAGKPEQYRRHGTHSVKITGWGEEQMPDG 466
Query: 53 V--KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
KYW+ NSWG+ WG+ G F+I RG +E IE+F V GRV
Sbjct: 467 SNQKYWIAANSWGKDWGEHGYFRITRGENECEIETFVVGVWGRV 510
>gi|158285208|ref|XP_001687862.1| AGAP007684-PA [Anopheles gambiae str. PEST]
gi|158285210|ref|XP_308187.4| AGAP007684-PB [Anopheles gambiae str. PEST]
gi|157019881|gb|EDO64511.1| AGAP007684-PA [Anopheles gambiae str. PEST]
gi|157019882|gb|EAA04576.4| AGAP007684-PB [Anopheles gambiae str. PEST]
Length = 463
Score = 80.1 bits (196), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 38/96 (39%), Positives = 61/96 (63%), Gaps = 9/96 (9%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG-----EMSGGHAVKIIGWGVE----DGVK 54
EI G++ A + ++D Y+ G+Y+H+ E S H+V++IGWG E D VK
Sbjct: 327 EIKDRGTVQAIMRVYRDFFSYRSGIYRHSAAATPAEERSAYHSVRLIGWGEERVGYDVVK 386
Query: 55 YWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
YW+ +NSWG+ WG+ G F+I RG++E IES+ +++
Sbjct: 387 YWIAINSWGQWWGENGRFRILRGSNECDIESYVLAS 422
>gi|193688334|ref|XP_001945855.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 313
Score = 80.1 bits (196), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/86 (43%), Positives = 54/86 (62%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVY-QHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
+Q E+ +G + + D ++YK GVY + ++ K+IGWGVE+GV YWL +
Sbjct: 218 IQKEVQTYGPVAVQFKVCDDFLLYKSGVYVKSDNAKVIRTQYAKLIGWGVENGVDYWLVI 277
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSWG WG GLFKI+RGT++ +ES
Sbjct: 278 NSWGHEWGQKGLFKIKRGTNQCGVES 303
>gi|308488594|ref|XP_003106491.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
gi|308253841|gb|EFO97793.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
Length = 342
Score = 80.1 bits (196), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 34/90 (37%), Positives = 55/90 (61%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q ++ G + A +E + D + Y G+Y H G G +V+I+GWG+ +GV YWL N
Sbjct: 248 IQSDVMLNGPVEATMEIYDDFLQYTTGIYVHLAGNKQGHLSVRILGWGMFEGVPYWLLAN 307
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSA 90
SWG+ WG+ G F++ RG +E +E+ +S
Sbjct: 308 SWGKEWGENGTFRVLRGVNECGLEANCISG 337
>gi|32129435|sp|P92133.2|CATB3_GIALA RecName: Full=Cathepsin B-like CP3; AltName: Full=Cathepsin B-like
protease B3; Flags: Precursor
gi|1763663|gb|AAB58260.1| cysteine protease [Giardia intestinalis]
gi|11691660|emb|CAC18648.1| cathepsin B-like cysteine protease 3 [Giardia intestinalis]
Length = 299
Score = 80.1 bits (196), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/77 (48%), Positives = 50/77 (64%), Gaps = 1/77 (1%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCVNSWGELWG 67
G + A + D + Y+ GVYQHT G + GGHAV ++G+G +DGV YW+ NSWG WG
Sbjct: 213 GPLQTAFTVYSDFMYYESGVYQHTYGRVEGGHAVDMVGYGTDDDGVDYWIIKNSWGPDWG 272
Query: 68 DGGLFKIRRGTDESRIE 84
+ G F+I R T+E IE
Sbjct: 273 EDGYFRIIRMTNECGIE 289
>gi|358341867|dbj|GAA49438.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 952
Score = 80.1 bits (196), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 57/103 (55%), Gaps = 7/103 (6%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI G + A + ++DL+ YK GVY H G G H ++I+GWG EDGV YWL NSW
Sbjct: 857 EIMLRGPVGAILHVYEDLLDYKSGVYFHVWGGHLGEHGIRILGWGEEDGVPYWLVANSWN 916
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFEY 106
E WG+ G ++ R +E I QV+AG DL F Y
Sbjct: 917 EDWGEKGYMRVLRWRNECGIVD-QVTAGL------PDLSNFPY 952
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 52/91 (57%), Gaps = 1/91 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI G + A+ + D + Y GVY H G HA++I+GWG +DGV YWL NSW
Sbjct: 212 EIMLNGPVEASFGIYADFLEYNGGVYFHCWGGPISRHAIRILGWGEDDGVPYWLIANSWN 271
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
E WG+ G + RG +E IE +V+A +D
Sbjct: 272 EDWGEKGYVRFLRGHNECGIEE-EVTAVPID 301
>gi|355572434|ref|ZP_09043578.1| Dipeptidyl-peptidase I, partial [Methanolinea tarda NOBI-1]
gi|354824808|gb|EHF09050.1| Dipeptidyl-peptidase I, partial [Methanolinea tarda NOBI-1]
Length = 685
Score = 80.1 bits (196), Expect = 3e-13, Method: Composition-based stats.
Identities = 31/69 (44%), Positives = 43/69 (62%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
G I+ +QD Y G+Y+HT G + G HA+ ++GWG ++ YW+C NSWG WG+
Sbjct: 369 GPIIGTFAVYQDFSYYSGGIYEHTWGSLRGYHAIVVVGWGQDERGTYWICKNSWGTGWGE 428
Query: 69 GGLFKIRRG 77
G FKIR G
Sbjct: 429 AGWFKIRSG 437
>gi|194213370|ref|XP_001492720.2| PREDICTED: dipeptidyl peptidase 1-like [Equus caballus]
Length = 478
Score = 80.1 bits (196), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 40/104 (38%), Positives = 60/104 (57%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
++LE+ H G + A E + D + Y G+Y HT E++ HAV ++G+G +
Sbjct: 375 IKLELVHHGPMAVAFEVYNDFLHYHDGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAS 433
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G YW+ NSWG WG+ G F+IRRGTDE IES ++A + +
Sbjct: 434 GQDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAMAATPIPK 477
>gi|166030316|gb|ABY78825.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 79.7 bits (195), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 51/91 (56%), Gaps = 1/91 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ E++ G V A + D + YK GVY+H G+ GGHAV+I+GWG +G YW N
Sbjct: 239 FKRELYFNGPFVVAFQVFSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKLNGTPYWKIAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
SW WG G F RG +E IE F+ AG
Sbjct: 299 SWDTDWGMNGHFLFLRGNNECGIE-FEGYAG 328
>gi|145490612|ref|XP_001431306.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398410|emb|CAK63908.1| unnamed protein product [Paramecium tetraurelia]
Length = 490
Score = 79.7 bits (195), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 44/126 (34%), Positives = 68/126 (53%), Gaps = 19/126 (15%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSG-------GHAVKIIGWGVEDGVKYW 56
E+ G +V + E D + Y+ G+Y H+ + + H+V GWG EDGVK+W
Sbjct: 356 EVMKNGPVVLSFEPSYDFMYYESGIY-HSKAQTNDYAEWEKVDHSVLCYGWGEEDGVKFW 414
Query: 57 LCVNSWGELWGDGGLFKIRRGTDESRIESF----------QVSAGRVDRDRSSDLEEFEY 106
+ NSWG WG+GG F+++RG DES IES Q S+ +S++ +F+Y
Sbjct: 415 MLQNSWGNQWGEGGNFRMKRGVDESAIESMAEASDPYVITQNSSTSFSETKSNE-SDFDY 473
Query: 107 DTDTTI 112
+ D +I
Sbjct: 474 EDDDSI 479
>gi|407425570|gb|EKF39488.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi
marinkellei]
Length = 333
Score = 79.7 bits (195), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 37/89 (41%), Positives = 50/89 (56%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ E+ G A E + D + Y GVY+H G++ GGHAV+++GWG +G YW N
Sbjct: 239 FKRELLLNGPFEVAFEVYADFMAYTGGVYKHVAGDLLGGHAVRLVGWGELNGEPYWKIAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW WG G F I RG +E IES V+
Sbjct: 299 SWNHEWGMNGYFLIARGVNECGIESNGVA 327
>gi|45708820|gb|AAH67941.1| LOC407938 protein, partial [Xenopus (Silurana) tropicalis]
Length = 470
Score = 79.7 bits (195), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 42/96 (43%), Positives = 56/96 (58%), Gaps = 8/96 (8%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGE------MSGGHAVKIIGWGVED--G 52
M+LE+ G + A E + D + Y+ GVY HT + HAV ++G+G + G
Sbjct: 355 MKLELVLGGPLSVAFEVYDDFMHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGTDQQTG 414
Query: 53 VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQV 88
KYW+ NSWGE WG+ G F+IRRGTDE IES V
Sbjct: 415 EKYWIVKNSWGESWGEKGYFRIRRGTDECAIESIAV 450
>gi|166030308|gb|ABY78821.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 79.7 bits (195), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 48/82 (58%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E++ G V + D + YK GVY+H G++ GGHAV+I+GWG +G YW NSW
Sbjct: 242 ELYFNGPFVVDFGVYSDFLAYKTGVYRHVSGDVLGGHAVRIVGWGKLNGTPYWKIANSWD 301
Query: 64 ELWGDGGLFKIRRGTDESRIES 85
WG G F I RG +E IES
Sbjct: 302 TDWGMNGHFLILRGNNECGIES 323
>gi|351709947|gb|EHB12866.1| Tubulointerstitial nephritis antigen-like protein [Heterocephalus
glaber]
Length = 467
Score = 79.3 bits (194), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 60/104 (57%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E ++D +YK G+Y HT+ M G H+VKI GWG E DG
Sbjct: 356 ELMENGPVQALMEVYEDFFLYKSGIYSHTLVSMGRPEQYRRHGTHSVKITGWGEEMLPDG 415
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG++E IESF + GRV
Sbjct: 416 RTLKYWTAANSWGPSWGERGYFRILRGSNECDIESFVLGVWGRV 459
>gi|118380384|ref|XP_001023356.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89305123|gb|EAS03111.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 590
Score = 79.3 bits (194), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 51/131 (38%), Positives = 65/131 (49%), Gaps = 19/131 (14%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGE-------------MSGGHAVKIIGW 47
M EI+ G IV + E D + Y KG+Y H+V H+V GW
Sbjct: 454 MMEEIYKNGPIVVSFEPKMDFMYYNKGIY-HSVDANQWIQNNEENPVWQKVDHSVLCYGW 512
Query: 48 GVEDGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSS-DLEEFEY 106
G ++ K+WL NSWGE WG+ G F++RRGTDES IES A V R S + EF
Sbjct: 513 GEDENGKFWLLQNSWGEEWGENGNFRMRRGTDESNIESMGERANIVKTARKSPNTTEF-- 570
Query: 107 DTDTTIESSSD 117
+T S SD
Sbjct: 571 --SSTYSSHSD 579
>gi|118429529|gb|ABK91812.1| cathepsin B precursor [Clonorchis sinensis]
Length = 342
Score = 79.3 bits (194), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 57/103 (55%), Gaps = 7/103 (6%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI G + A + ++DL+ YK GVY H G G H ++I+GWG EDGV YWL NSW
Sbjct: 247 EIMLRGPVGAILHVYEDLLDYKSGVYFHVWGGHLGEHGIRILGWGEEDGVPYWLVANSWN 306
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFEY 106
E WG+ G ++ R +E I QV+AG DL F Y
Sbjct: 307 EDWGEKGYMRVLRWRNECGIVD-QVTAGL------PDLSNFPY 342
>gi|3929733|emb|CAA77178.1| cathepsin B [Homo sapiens]
Length = 195
Score = 79.3 bits (194), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 32/62 (51%), Positives = 42/62 (67%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ EI+ G + A + D ++YK GVYQH GEM GGHA++I+GWGVE+G YWL N
Sbjct: 133 IMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVAN 192
Query: 61 SW 62
SW
Sbjct: 193 SW 194
>gi|291000017|ref|XP_002682576.1| cathepsin C [Naegleria gruberi]
gi|284096203|gb|EFC49832.1| cathepsin C [Naegleria gruberi]
Length = 430
Score = 79.3 bits (194), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 41/101 (40%), Positives = 50/101 (49%), Gaps = 16/101 (15%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG----------------HAVKI 44
M EI+ G + E + DL YK GVY+H E HAV +
Sbjct: 321 MMHEIYQNGPLAIGFEVYPDLRNYKHGVYKHVTAEELKAQGLSEDEMIPHFEVVNHAVLM 380
Query: 45 IGWGVEDGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIES 85
+GWGVE+G YW NSW WGD G FKI RG+DE +ES
Sbjct: 381 VGWGVENGTPYWKIKNSWSTTWGDNGYFKILRGSDECGVES 421
>gi|193688336|ref|XP_001945899.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 308
Score = 79.3 bits (194), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 42/94 (44%), Positives = 57/94 (60%), Gaps = 6/94 (6%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG---HAVKIIGWGVEDGVKYWL 57
+Q E+ +G +V D +YK GVY + + + G K+IGWGVE+GV YWL
Sbjct: 213 IQKEVQTYGPVVVRFMVCDDFFLYKSGVYAKS--DKAKGIRTQYAKLIGWGVENGVDYWL 270
Query: 58 CVNSWGELWGDGGLFKIRRGTDESRIESFQVSAG 91
+NSWG WG GLFKI+ GT++ +ESF V AG
Sbjct: 271 VINSWGHEWGQKGLFKIKSGTNQCGVESF-VYAG 303
>gi|291408920|ref|XP_002720687.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Oryctolagus
cuniculus]
Length = 467
Score = 79.3 bits (194), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 59/104 (56%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +Y+ G+Y HT + G H+VKI GWG E DG
Sbjct: 356 ELLENGPVQALMEVHEDFFLYQGGIYSHTPVSLERPERYRRHGTHSVKITGWGEETLPDG 415
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RGT+E IESF + GRV
Sbjct: 416 RTLKYWTAANSWGPAWGERGHFRILRGTNECDIESFVLGVWGRV 459
>gi|432892467|ref|XP_004075795.1| PREDICTED: dipeptidyl peptidase 1-like [Oryzias latipes]
Length = 453
Score = 79.3 bits (194), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 59/104 (56%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGV--ED 51
M E+ H G + A E + D + Y G+Y HT E++ HAV ++G+G +
Sbjct: 350 MMKELVHHGPMAVAFEVYPDFMHYAGGIYHHTGLADPFNPFELTN-HAVLLVGYGRCHKT 408
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G KYW+ NSWG WG+ G F+IRRG+DE IES V+A + +
Sbjct: 409 GEKYWIVKNSWGTSWGENGFFRIRRGSDECSIESIAVAATPIPK 452
>gi|166030330|gb|ABY78832.1| cathepsin B-like protease [Trypanosoma congolense]
gi|343476577|emb|CCD12360.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 79.3 bits (194), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 37/81 (45%), Positives = 46/81 (56%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E++ G VA + DL YK GVY+H G+ GG AVK++GWG +G YW NSW
Sbjct: 243 ELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKLNGTPYWKLANSWD 302
Query: 64 ELWGDGGLFKIRRGTDESRIE 84
WG GG I RG +E IE
Sbjct: 303 TDWGMGGYLLILRGNNECNIE 323
>gi|343474132|emb|CCD14149.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 79.3 bits (194), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 37/81 (45%), Positives = 46/81 (56%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E++ G VA + DL YK GVY+H G+ GG AVK++GWG +G YW NSW
Sbjct: 243 ELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKLNGTPYWKLANSWD 302
Query: 64 ELWGDGGLFKIRRGTDESRIE 84
WG GG I RG +E IE
Sbjct: 303 TDWGMGGYLLILRGNNECNIE 323
>gi|428169747|gb|EKX38678.1| hypothetical protein GUITHDRAFT_76993, partial [Guillardia theta
CCMP2712]
Length = 85
Score = 79.3 bits (194), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 39/85 (45%), Positives = 54/85 (63%), Gaps = 1/85 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV-GEMSGGHAVKIIGWGVEDGVKYWLCV 59
MQLE+ G V + + D YK GVY + + GGHAV ++GWG E+GV YWL
Sbjct: 1 MQLELMQNGPGVVVFDVYDDFYSYKSGVYTKSAKAQKVGGHAVVLVGWGRENGVDYWLVQ 60
Query: 60 NSWGELWGDGGLFKIRRGTDESRIE 84
NSWG+ GD G++K+R+G++E IE
Sbjct: 61 NSWGKSSGDEGMWKVRKGSNECGIE 85
>gi|47212965|emb|CAF93376.1| unnamed protein product [Tetraodon nigroviridis]
Length = 271
Score = 79.0 bits (193), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 43/104 (41%), Positives = 59/104 (56%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
EI G + A +E H+D +Y G+Y+HT + G H+VKI GWG E DG
Sbjct: 160 EIQDNGPVQAIMEVHEDFFMYNSGIYKHTDVSFTKPPHYRKHGTHSVKITGWGEERNFDG 219
Query: 53 V--KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
KYW+ NSWG+ WG+ G F+I RG +E IE+F + GR+
Sbjct: 220 TTRKYWIAANSWGKNWGENGYFRIARGENECEIEAFVIGVWGRI 263
>gi|290975216|ref|XP_002670339.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
gi|284083897|gb|EFC37595.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
Length = 350
Score = 79.0 bits (193), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 39/77 (50%), Positives = 45/77 (58%), Gaps = 2/77 (2%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVK--YWLCVNSWGELW 66
G I A+ ++D YK GVY H G GGHAVKI+GWG + K YW+C NSWGE W
Sbjct: 263 GPIQVAMGVYRDFYSYKSGVYHHVSGRYVGGHAVKIVGWGYDSASKLPYWICANSWGEDW 322
Query: 67 GDGGLFKIRRGTDESRI 83
G G F I RG E I
Sbjct: 323 GIKGYFWILRGRGECGI 339
>gi|66911417|gb|AAH97299.1| Tubulointerstitial nephritis antigen-like 1 [Rattus norvegicus]
gi|149024087|gb|EDL80584.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
gi|149024088|gb|EDL80585.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
gi|149024089|gb|EDL80586.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
Length = 467
Score = 79.0 bits (193), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 59/104 (56%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +Y++G+Y HT G H+VKI GWG E DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDG 415
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RGT+E IE+F + GRV
Sbjct: 416 RTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLGVWGRV 459
>gi|12060418|dbj|BAB20596.1| ARG1 [Mus musculus]
gi|71059879|emb|CAJ18483.1| Lcn7 [Mus musculus]
Length = 415
Score = 79.0 bits (193), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 59/104 (56%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +Y++G+Y HT G H+VKI GWG E DG
Sbjct: 304 ELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDG 363
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RGT+E IE+F + GRV
Sbjct: 364 RTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLGVWGRV 407
>gi|56755295|gb|AAW25827.1| SJCHGC06356 protein [Schistosoma japonicum]
Length = 279
Score = 79.0 bits (193), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 38/87 (43%), Positives = 51/87 (58%), Gaps = 1/87 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
+Q EI G ++A+I + D ++YK GVY T + G ++IIGWG E + YWLC
Sbjct: 185 IQKEILMNGPVIASISVNTDFLVYKSGVYLPTPRSRNLGWITLRIIGWGYEGKIPYWLCA 244
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESF 86
NSW E WG G KI+RG IES+
Sbjct: 245 NSWNEEWGANGYVKIQRGVQAGYIESY 271
>gi|431891156|gb|ELK02033.1| Tubulointerstitial nephritis antigen-like protein [Pteropus alecto]
Length = 467
Score = 79.0 bits (193), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 59/104 (56%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +Y+ G+Y HT + G H+VKI GWG E DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYQGGIYSHTPVSLGKPERYRRHGTHSVKITGWGEETLPDG 415
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RGT+E IESF + GRV
Sbjct: 416 RTLKYWTAANSWGPAWGERGHFRIVRGTNECDIESFVLGVWGRV 459
>gi|21699|emb|CAA46811.1| cathepsin B [Triticum aestivum]
Length = 353
Score = 79.0 bits (193), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 45/111 (40%), Positives = 58/111 (52%), Gaps = 4/111 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQ--DLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWL 57
+ E++ G + A Q D YK GVY+H G + GGHAVK+IGWG D G YWL
Sbjct: 241 IMAEVYKNGPVEVAFTYCQILDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWL 300
Query: 58 CVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDLEEFEYDT 108
N W WGD G FKI RG +E IE V+AG ++ + + T
Sbjct: 301 LANQWNRGWGDDGYFKIIRGENECGIEG-DVTAGMPSTKNTARNNDVAFGT 350
>gi|343474137|emb|CCD14154.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 79.0 bits (193), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 37/81 (45%), Positives = 46/81 (56%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E++ G VA + DL YK GVY+H G+ GG AVK++GWG +G YW NSW
Sbjct: 243 ELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKLNGTPYWKLANSWD 302
Query: 64 ELWGDGGLFKIRRGTDESRIE 84
WG GG I RG +E IE
Sbjct: 303 TDWGMGGYLLILRGNNECNIE 323
>gi|428180143|gb|EKX49011.1| cathepsin B-like cysteine protease [Guillardia theta CCMP2712]
Length = 330
Score = 79.0 bits (193), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 46/106 (43%), Positives = 56/106 (52%), Gaps = 14/106 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGE-MSGGHAVKIIGWGVEDG-------VKY 55
EI+ G + A + + YK GVY H + + M GGHA+KI+GWGVE KY
Sbjct: 225 EIYLNGPVEAGFRVYTSFMSYKSGVYHHRILDIMEGGHAIKIVGWGVEPPKRFWQKPTKY 284
Query: 56 WLCVNSWGELWGDGGLFKIRRGTD-----ESRIESFQVSAGRVDRD 96
W+C NSW WG G FKIRRG + E IE QV AG D
Sbjct: 285 WICANSWTADWGMNGFFKIRRGKNRFGQSECGIED-QVFAGHPKLD 329
>gi|428168267|gb|EKX37214.1| hypothetical protein GUITHDRAFT_78289 [Guillardia theta CCMP2712]
Length = 224
Score = 78.6 bits (192), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 42/92 (45%), Positives = 52/92 (56%), Gaps = 6/92 (6%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS----GGHAVKIIGWGV--EDGVK 54
+Q EI G + AA + D + Y GVY + ++ GGHAV ++GWG E G
Sbjct: 132 IQSEILSNGPVFAAFWVYSDFMAYTGGVYSASKEALAQGKTGGHAVMMVGWGTDKETGQD 191
Query: 55 YWLCVNSWGELWGDGGLFKIRRGTDESRIESF 86
YWL NSW E WGD G FKI+RG DE IES
Sbjct: 192 YWLLQNSWSEKWGDKGRFKIKRGVDECGIESL 223
>gi|270132817|ref|NP_075965.2| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
gi|270132824|ref|NP_001161805.1| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
gi|61213616|sp|Q99JR5.1|TINAL_MOUSE RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
Full=Adrenocortical zonation factor 1; Short=AZ-1;
AltName: Full=Androgen-regulated gene 1 protein;
AltName: Full=Tubulointerstitial nephritis
antigen-related protein; Short=TARP; Flags: Precursor
gi|13543125|gb|AAH05738.1| Tinagl1 protein [Mus musculus]
gi|17391278|gb|AAH18539.1| Tinagl1 protein [Mus musculus]
gi|30314458|dbj|BAC76038.1| tubulointersititial nephritis antigen-related protein [Mus
musculus]
gi|148698197|gb|EDL30144.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
musculus]
gi|148698198|gb|EDL30145.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
musculus]
Length = 466
Score = 78.6 bits (192), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 59/104 (56%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +Y++G+Y HT G H+VKI GWG E DG
Sbjct: 355 ELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDG 414
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RGT+E IE+F + GRV
Sbjct: 415 RTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLGVWGRV 458
>gi|145540170|ref|XP_001455775.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124423583|emb|CAK88378.1| unnamed protein product [Paramecium tetraurelia]
Length = 500
Score = 78.6 bits (192), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 41/108 (37%), Positives = 59/108 (54%), Gaps = 12/108 (11%)
Query: 3 LEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG-----------HAVKIIGWGVED 51
+E++ G ++ E D + Y+ G+Y H+V E H+V GWG ED
Sbjct: 379 MELYTNGPVIMNFEPSYDFMYYESGIY-HSVAEHDWSTQERPEWEKVDHSVLCYGWGEED 437
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSS 99
GVK+WL NSWG WG+ G F+++RG DES IES +A V +S+
Sbjct: 438 GVKFWLLQNSWGSQWGENGSFRMKRGVDESAIESMAEAADPVIYSKSN 485
>gi|290980380|ref|XP_002672910.1| predicted protein [Naegleria gruberi]
gi|284086490|gb|EFC40166.1| predicted protein [Naegleria gruberi]
Length = 302
Score = 78.6 bits (192), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 39/94 (41%), Positives = 54/94 (57%), Gaps = 6/94 (6%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQH--TVGEMSGGHAVKIIGWGVEDGVKYWLC 58
MQ I GSI+ ++ +QD I Y GVY+H + + +I+GWG +GV YW+
Sbjct: 208 MQQAILQGGSIMTEMDVYQDFIYYSSGVYEHDPSFTQPIAKTVARIVGWGSLNGVNYWIV 267
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIE----SFQV 88
N WG+ WG G +RRGT+ES IE +FQV
Sbjct: 268 ANVWGKTWGLDGYVLVRRGTNESNIEKDAYAFQV 301
>gi|426328832|ref|XP_004025452.1| PREDICTED: tubulointerstitial nephritis antigen-like [Gorilla
gorilla gorilla]
Length = 462
Score = 78.6 bits (192), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +YK G+Y HT + G H+VKI GWG E DG
Sbjct: 351 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 410
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 411 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 454
>gi|426221788|ref|XP_004005089.1| PREDICTED: tubulointerstitial nephritis antigen-like [Ovis aries]
Length = 362
Score = 78.6 bits (192), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEM--------SGGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +Y+ G+Y HT + G H+VKI GWG E DG
Sbjct: 251 ELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 310
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
VKYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 311 RTVKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 354
>gi|355557764|gb|EHH14544.1| hypothetical protein EGK_00488 [Macaca mulatta]
gi|355745087|gb|EHH49712.1| hypothetical protein EGM_00421 [Macaca fascicularis]
gi|384948750|gb|AFI37980.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
[Macaca mulatta]
gi|384948752|gb|AFI37981.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
[Macaca mulatta]
gi|387540550|gb|AFJ70902.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
[Macaca mulatta]
Length = 467
Score = 78.6 bits (192), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +YK G+Y HT + G H+VKI GWG E DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 415
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 416 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 459
>gi|354472325|ref|XP_003498390.1| PREDICTED: tubulointerstitial nephritis antigen [Cricetulus
griseus]
gi|344245030|gb|EGW01134.1| Tubulointerstitial nephritis antigen-like [Cricetulus griseus]
Length = 465
Score = 78.6 bits (192), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--------GEMSGGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +Y+ G+Y HT G H+VKI GWG E DG
Sbjct: 355 ELMENGPVQALMEVHEDFFLYQSGIYSHTPISQGRPEQYRRHGTHSVKITGWGEEKLPDG 414
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RGT+E IESF + GRV
Sbjct: 415 RTIKYWTAANSWGPWWGERGHFRIVRGTNECDIESFVLGVWGRV 458
>gi|344250687|gb|EGW06791.1| Dipeptidyl-peptidase 1 [Cricetulus griseus]
Length = 483
Score = 78.6 bits (192), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 55/99 (55%), Gaps = 10/99 (10%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWG--VED 51
M+LE+ G + A E D + Y G+Y HT E++ HAV ++G+G +
Sbjct: 380 MKLELVQHGPMAVAFEVQDDFLHYHSGIYHHTGLRDPFNPFELTN-HAVLLVGYGRDPDT 438
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
G YW NSWG WG+ G F+IRRGTDE IES V+A
Sbjct: 439 GTDYWTVKNSWGTEWGESGYFRIRRGTDECAIESIAVAA 477
>gi|2330009|gb|AAB66719.1| cysteine protease [Giardia muris]
Length = 301
Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 38/85 (44%), Positives = 52/85 (61%), Gaps = 1/85 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
M + + G + A + D Y GVYQH G M GGHAV+++G+G+ E G+KYW+
Sbjct: 207 MMEALVYDGPLQVAFVVYSDFGYYSSGVYQHVNGMMEGGHAVEMVGYGIDESGLKYWIIR 266
Query: 60 NSWGELWGDGGLFKIRRGTDESRIE 84
NSWG WG+GG F+I R +E IE
Sbjct: 267 NSWGPDWGEGGYFRIIRRVNECGIE 291
>gi|294888968|ref|XP_002772645.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239877055|gb|EER04461.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 419
Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 42/111 (37%), Positives = 64/111 (57%), Gaps = 9/111 (8%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSG--------GHAVKIIGWGVEDGVKY 55
E+ G +V +I+ D++ Y+ GVY+ + S GH+V +IG+GV++G Y
Sbjct: 304 ELVDDGPLVVSIKPAHDMMYYRSGVYRSDLERDSYHRPEWEEVGHSVLLIGYGVDNGEDY 363
Query: 56 WLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSDL-EEFE 105
WL NSWG WG+ G ++ RG DES +ES V+A V+ R D+ + FE
Sbjct: 364 WLIQNSWGPEWGEDGYLRLARGMDESGVESIAVAADVVEDQRPLDMFKNFE 414
>gi|403293249|ref|XP_003937633.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Saimiri boliviensis boliviensis]
Length = 467
Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +YK G+Y HT + G H+VKI GWG E DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHSVKITGWGEETRPDG 415
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 416 RKLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 459
>gi|297665714|ref|XP_002811184.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
[Pongo abelii]
Length = 467
Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +YK G+Y HT + G H+VKI GWG E DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 415
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 416 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 459
>gi|354498051|ref|XP_003511129.1| PREDICTED: LOW QUALITY PROTEIN: dipeptidyl peptidase 1-like
[Cricetulus griseus]
Length = 470
Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 57/104 (54%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWG--VED 51
M+LE+ G + A E D + Y G+Y HT E++ HAV ++G+G +
Sbjct: 367 MKLELVQHGPMAVAFEVQDDFLHYHSGIYHHTGLRDPFNPFELTN-HAVLLVGYGRDPDT 425
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G YW NSWG WG+ G F+IRRGTDE IES V+A + +
Sbjct: 426 GTDYWTVKNSWGTEWGESGYFRIRRGTDECAIESIAVAAIPIPK 469
>gi|290988628|ref|XP_002677000.1| predicted protein [Naegleria gruberi]
gi|284090605|gb|EFC44256.1| predicted protein [Naegleria gruberi]
Length = 158
Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 38/82 (46%), Positives = 48/82 (58%), Gaps = 2/82 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVK--YWLC 58
M ++ G + A + ++D YK GVY H G M G HA+KI+GWGV+ K YW+C
Sbjct: 63 MMADLKANGPLQATMIVYKDFFSYKSGVYHHVSGRMVGAHAIKIVGWGVDSASKLPYWIC 122
Query: 59 VNSWGELWGDGGLFKIRRGTDE 80
NSWGE WG G F I RG E
Sbjct: 123 ANSWGEDWGLDGYFWIARGRGE 144
>gi|11545918|ref|NP_071447.1| tubulointerstitial nephritis antigen-like isoform 1 precursor [Homo
sapiens]
gi|61213628|sp|Q9GZM7.1|TINAL_HUMAN RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
Full=Glucocorticoid-inducible protein 5; AltName:
Full=Oxidized LDL-responsive gene 2 protein;
Short=OLRG-2; AltName: Full=Tubulointerstitial nephritis
antigen-related protein; Short=TIN Ag-related protein;
Short=TIN-Ag-RP; Flags: Precursor
gi|11602840|gb|AAG38876.1|AF236150_1 tubulointerstitial nephritis antigen-related protein precursor
[Homo sapiens]
gi|11275667|gb|AAG33699.1| oxidized-LDL responsive gene 2 [Homo sapiens]
gi|11527793|dbj|BAB18636.1| glucocorticoid-inducible protein [Homo sapiens]
gi|11527809|dbj|BAB18727.1| glucocorticoid-inducible protein [Homo sapiens]
gi|11761715|gb|AAG40154.1| tubulointerstitial nephritis antigen-related protein [Homo sapiens]
gi|22761462|dbj|BAC11596.1| unnamed protein product [Homo sapiens]
gi|37181967|gb|AAQ88787.1| LCN7 [Homo sapiens]
gi|40353044|gb|AAH64633.1| Tubulointerstitial nephritis antigen-like 1 [Homo sapiens]
gi|119628009|gb|EAX07604.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
sapiens]
gi|119628010|gb|EAX07605.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
sapiens]
gi|119628011|gb|EAX07606.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
sapiens]
gi|158258977|dbj|BAF85459.1| unnamed protein product [Homo sapiens]
gi|261858502|dbj|BAI45773.1| tubulointerstitial nephritis antigen-like 1 [synthetic construct]
gi|410265400|gb|JAA20666.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410307560|gb|JAA32380.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410307562|gb|JAA32381.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410307564|gb|JAA32382.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410335249|gb|JAA36571.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
Length = 467
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +YK G+Y HT + G H+VKI GWG E DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 415
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 416 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 459
>gi|332254558|ref|XP_003276396.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Nomascus leucogenys]
Length = 467
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +YK G+Y HT + G H+VKI GWG E DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 415
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 416 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 459
>gi|332808277|ref|XP_524645.3| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
antigen-like 1 [Pan troglodytes]
Length = 472
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +YK G+Y HT + G H+VKI GWG E DG
Sbjct: 361 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 420
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 421 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 464
>gi|397515889|ref|XP_003828174.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1 [Pan
paniscus]
Length = 467
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +YK G+Y HT + G H+VKI GWG E DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 415
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 416 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 459
>gi|395856779|ref|XP_003800796.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Otolemur garnettii]
Length = 467
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +Y+ G+Y HT + G H+VKI GWG E DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLQRPEGYRRHGTHSVKITGWGEETLPDG 415
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 416 RTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 459
>gi|358421824|ref|XP_003585145.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bos taurus]
Length = 428
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +Y+ G+Y HT + G H+VKI GWG E DG
Sbjct: 317 ELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 376
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 377 RTIKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 420
>gi|328712819|ref|XP_001942906.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Acyrthosiphon pisum]
gi|328712821|ref|XP_003244911.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Acyrthosiphon pisum]
Length = 463
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 39/94 (41%), Positives = 56/94 (59%), Gaps = 7/94 (7%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHT---VGEMSGGHAVKIIGWGVE----DGVKYW 56
EI G + A ++ +D +YK GVY+ + G +G H+V+I+GWG E VKYW
Sbjct: 341 EILTSGPVQAVMKVSRDFFMYKSGVYKCSNLASGSRTGYHSVRIVGWGEEYQGGKIVKYW 400
Query: 57 LCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
+ NSWG WG+ G F+I +G DE IE F ++A
Sbjct: 401 IASNSWGSWWGENGYFRILKGVDECEIEDFVIAA 434
>gi|395730851|ref|XP_003775799.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Pongo
abelii]
Length = 362
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +YK G+Y HT + G H+VKI GWG E DG
Sbjct: 251 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 310
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 311 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 354
>gi|324713036|ref|NP_001191344.1| tubulointerstitial nephritis antigen-like isoform 3 [Homo sapiens]
gi|119628008|gb|EAX07603.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_a [Homo
sapiens]
Length = 362
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +YK G+Y HT + G H+VKI GWG E DG
Sbjct: 251 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 310
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 311 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 354
>gi|296207307|ref|XP_002750588.1| PREDICTED: tubulointerstitial nephritis antigen-like [Callithrix
jacchus]
Length = 467
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +YK G+Y HT + G H+VKI GWG E DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHSVKITGWGEETWPDG 415
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 416 RKLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 459
>gi|402853710|ref|XP_003891533.1| PREDICTED: tubulointerstitial nephritis antigen-like [Papio anubis]
Length = 362
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +YK G+Y HT + G H+VKI GWG E DG
Sbjct: 251 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 310
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 311 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 354
>gi|297282815|ref|XP_002802331.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
mulatta]
Length = 322
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +YK G+Y HT + G H+VKI GWG E DG
Sbjct: 211 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 270
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 271 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 314
>gi|332254562|ref|XP_003276398.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 3
[Nomascus leucogenys]
Length = 362
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +YK G+Y HT + G H+VKI GWG E DG
Sbjct: 251 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 310
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 311 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 354
>gi|324711034|ref|NP_001191343.1| tubulointerstitial nephritis antigen-like isoform 2 precursor [Homo
sapiens]
gi|194391000|dbj|BAG60618.1| unnamed protein product [Homo sapiens]
Length = 436
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +YK G+Y HT + G H+VKI GWG E DG
Sbjct: 325 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 384
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 385 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 428
>gi|297665716|ref|XP_002811185.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 3
[Pongo abelii]
Length = 436
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +YK G+Y HT + G H+VKI GWG E DG
Sbjct: 325 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 384
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 385 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 428
>gi|397515891|ref|XP_003828175.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2 [Pan
paniscus]
Length = 436
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +YK G+Y HT + G H+VKI GWG E DG
Sbjct: 325 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 384
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 385 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 428
>gi|403293251|ref|XP_003937634.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Saimiri boliviensis boliviensis]
Length = 436
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +YK G+Y HT + G H+VKI GWG E DG
Sbjct: 325 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHSVKITGWGEETRPDG 384
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 385 RKLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 428
>gi|332254560|ref|XP_003276397.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Nomascus leucogenys]
Length = 436
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +YK G+Y HT + G H+VKI GWG E DG
Sbjct: 325 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 384
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 385 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 428
>gi|410909768|ref|XP_003968362.1| PREDICTED: dipeptidyl peptidase 1-like [Takifugu rubripes]
Length = 455
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 60/104 (57%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGV--ED 51
M +E+ G + A+E + D + YK G+Y HT E++ HAV ++G+G
Sbjct: 352 MMVELVKNGPMAVALEVYSDFMSYKGGIYHHTGLTDHVNPFELTN-HAVLLVGYGRCHMT 410
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G KYW+ NSWG WG+ G F+IRRG+DE IES V+A + +
Sbjct: 411 GQKYWIVKNSWGSSWGEDGYFRIRRGSDECAIESIAVAASPIPK 454
>gi|66801417|ref|XP_629634.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
gi|60463014|gb|EAL61210.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
Length = 323
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 37/84 (44%), Positives = 48/84 (57%), Gaps = 1/84 (1%)
Query: 2 QLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCVN 60
Q EI G ++A + D +K VY + HAV+++GWG DGV YW+ N
Sbjct: 185 QYEIMTNGPVIATFMLYSDFKPHKWDVYIKSSNTQVESHAVRVVGWGTTSDGVDYWIAAN 244
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SWG WGD G FKIRRG+DE+ E
Sbjct: 245 SWGTGWGDKGYFKIRRGSDEAAFE 268
>gi|417401428|gb|JAA47600.1| Putative cysteine proteinase tin-ag [Desmodus rotundus]
Length = 466
Score = 77.8 bits (190), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +Y+ G+Y HT + G H+VKI GWG E DG
Sbjct: 355 ELMENGPVQALMEVHEDFFLYQNGIYSHTPVSLGRPERYRRHGTHSVKITGWGEESLPDG 414
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 415 RTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 458
>gi|395856781|ref|XP_003800797.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Otolemur garnettii]
Length = 436
Score = 77.8 bits (190), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +Y+ G+Y HT + G H+VKI GWG E DG
Sbjct: 325 ELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLQRPEGYRRHGTHSVKITGWGEETLPDG 384
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 385 RTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 428
>gi|348508183|ref|XP_003441634.1| PREDICTED: dipeptidyl peptidase 1-like isoform 2 [Oreochromis
niloticus]
Length = 461
Score = 77.8 bits (190), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 59/104 (56%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGV--ED 51
M LE+ G + A E + D + YK+G+Y HT E++ HAV ++G+G +
Sbjct: 358 MMLELVKNGPMAVAFEVYPDFMNYKEGIYHHTGLADPFNPFELTN-HAVLLVGYGRCHKT 416
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G YW+ NSWG WG+ G F+IRRG DE IES V+A + +
Sbjct: 417 GQNYWIVKNSWGTGWGEEGYFRIRRGNDECAIESIAVAANPIPK 460
>gi|442758365|gb|JAA71341.1| Hypothetical protein [Ixodes ricinus]
Length = 353
Score = 77.8 bits (190), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 42/98 (42%), Positives = 56/98 (57%), Gaps = 17/98 (17%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--------GEMSGGHAVKIIGWGVED- 51
MQ+ +F +V +D +Y GVY+HT + S H+V+I+GWGV+
Sbjct: 229 MQMALFKHSMLVK-----EDFFLYSSGVYKHTRLAHNLPPEYQKSDWHSVRILGWGVDRT 283
Query: 52 ---GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESF 86
KYWLC NSWG WG+ G F+I RG DES+IESF
Sbjct: 284 QYRPQKYWLCANSWGSGWGENGYFRIVRGEDESQIESF 321
>gi|417409900|gb|JAA51439.1| Putative cysteine proteinase tin-ag, partial [Desmodus rotundus]
Length = 346
Score = 77.8 bits (190), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +Y+ G+Y HT + G H+VKI GWG E DG
Sbjct: 235 ELMENGPVQALMEVHEDFFLYQNGIYSHTPVSLGRPERYRRHGTHSVKITGWGEESLPDG 294
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 295 RTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 338
>gi|355724275|gb|AES08176.1| tubulointerstitial nephritis antigen-like 1 [Mustela putorius furo]
Length = 454
Score = 77.8 bits (190), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +Y+ G+Y HT + G H+VKI GWG E DG
Sbjct: 343 ELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 402
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 403 RTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 446
>gi|23344736|gb|AAN28681.1| cathepsin B [Theromyzon tessulatum]
Length = 65
Score = 77.8 bits (190), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 35/64 (54%), Positives = 43/64 (67%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E+ G + AA+ + D + YK GVY H G+ GGHAVK+IGWGVE+ V YWL VNSWG
Sbjct: 2 ELMKHGPVEAALTVYSDFLQYKSGVYHHVAGDELGGHAVKLIGWGVENKVPYWLVVNSWG 61
Query: 64 ELWG 67
WG
Sbjct: 62 TTWG 65
>gi|403371460|gb|EJY85611.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 77.8 bits (190), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 33/69 (47%), Positives = 41/69 (59%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
G + +QD Y GVY H G+ GGHAVKI+GWG + YW+ NSWGE WG+
Sbjct: 216 GPVETGFTVYQDFYNYNSGVYHHVTGDAEGGHAVKILGWGKQGLENYWIVANSWGEDWGE 275
Query: 69 GGLFKIRRG 77
G F IR+G
Sbjct: 276 KGYFNIRQG 284
>gi|14290553|gb|AAH09048.1| TINAGL1 protein [Homo sapiens]
Length = 218
Score = 77.8 bits (190), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +YK G+Y HT + G H+VKI GWG E DG
Sbjct: 107 ELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 166
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 167 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 210
>gi|351712812|gb|EHB15731.1| Dipeptidyl-peptidase 1 [Heterocephalus glaber]
Length = 462
Score = 77.8 bits (190), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 40/104 (38%), Positives = 60/104 (57%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVE--D 51
M+LE+ G + A E D + Y KG+Y HT E++ HAV ++G+G + +
Sbjct: 359 MKLELVQHGPMAVAFEVCDDFMHYHKGIYHHTGLRDPFNPFELTN-HAVLLVGYGTDSAN 417
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G+ YW+ NSWG WG+ G F+I RGTDE IES ++A + +
Sbjct: 418 GMDYWIVKNSWGTSWGEKGYFRILRGTDECAIESIAMAATPIPK 461
>gi|348508181|ref|XP_003441633.1| PREDICTED: dipeptidyl peptidase 1-like isoform 1 [Oreochromis
niloticus]
Length = 455
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 59/104 (56%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGV--ED 51
M LE+ G + A E + D + YK+G+Y HT E++ HAV ++G+G +
Sbjct: 352 MMLELVKNGPMAVAFEVYPDFMNYKEGIYHHTGLADPFNPFELTN-HAVLLVGYGRCHKT 410
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G YW+ NSWG WG+ G F+IRRG DE IES V+A + +
Sbjct: 411 GQNYWIVKNSWGTGWGEEGYFRIRRGNDECAIESIAVAANPIPK 454
>gi|403345965|gb|EJY72367.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 33/73 (45%), Positives = 41/73 (56%)
Query: 5 IFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGE 64
I G + + D YK G+Y H G GGHAVKI+GWG + YW+ NSWGE
Sbjct: 212 IQQSGPVETGFTVYADFFNYKSGIYHHVSGGAEGGHAVKILGWGKQGSENYWIVANSWGE 271
Query: 65 LWGDGGLFKIRRG 77
WG+ G F IR+G
Sbjct: 272 SWGEKGFFNIRQG 284
>gi|403362666|gb|EJY81064.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 33/73 (45%), Positives = 41/73 (56%)
Query: 5 IFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGE 64
I G + + D YK G+Y H G GGHAVKI+GWG + YW+ NSWGE
Sbjct: 212 IQQSGPVETGFTVYADFFNYKSGIYHHVSGGAEGGHAVKILGWGKQGSENYWIVANSWGE 271
Query: 65 LWGDGGLFKIRRG 77
WG+ G F IR+G
Sbjct: 272 SWGEKGFFNIRQG 284
>gi|253747613|gb|EET02212.1| Hypothetical protein GL50581_498 [Giardia intestinalis ATCC 50581]
Length = 807
Score = 77.8 bits (190), Expect = 2e-12, Method: Composition-based stats.
Identities = 38/92 (41%), Positives = 57/92 (61%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIY-KKGVYQHTVG-EMSGGHAVKIIGWGVEDGVKYWLC 58
M +I+ G I ++ D KK +Y ++SGGHAV I+GWG E+GV YW C
Sbjct: 192 MMRDIYQNGPIAVSMYLANDFPPKDKKSIYVSGPNTKLSGGHAVMIVGWGEENGVPYWDC 251
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
N++G WGD G F+I+RG++E +IE++ +A
Sbjct: 252 ANTYGTNWGDHGYFRIKRGSNELKIETWPGAA 283
>gi|340503546|gb|EGR30116.1| hypothetical protein IMG5_141560 [Ichthyophthirius multifiliis]
Length = 599
Score = 77.8 bits (190), Expect = 2e-12, Method: Composition-based stats.
Identities = 43/99 (43%), Positives = 53/99 (53%), Gaps = 15/99 (15%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSG--------------GHAVKIIG 46
M EI G IV + E D + Y++G+Y H+V H+V +G
Sbjct: 479 MMEEIHKNGPIVVSFEPAMDFMYYQEGIY-HSVDANDWILGDEDKLPQWEKVDHSVLCVG 537
Query: 47 WGVEDGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIES 85
WG + KYWL NSWGE WG+ G FKIRRGTDES IES
Sbjct: 538 WGENEDGKYWLVQNSWGEDWGEKGYFKIRRGTDESNIES 576
>gi|300121514|emb|CBK22033.2| unnamed protein product [Blastocystis hominis]
Length = 476
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 33/81 (40%), Positives = 49/81 (60%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI+ G + +I+ DL+ YK G+Y+ G GH + ++GWG E+G+ YW+ NSWG
Sbjct: 99 EIYAHGPVTCSIDVPDDLLEYKGGIYEDKTGIAGDGHDISVVGWGEENGIPYWIVRNSWG 158
Query: 64 ELWGDGGLFKIRRGTDESRIE 84
WG+ G F+I RG + IE
Sbjct: 159 TYWGEEGFFRIVRGKNNLGIE 179
Score = 65.5 bits (158), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 48/88 (54%), Gaps = 2/88 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVK--YWLC 58
MQ EI+ G I ++ Q + Y GV+ G+ G HAV++ GWGV++ + YW+
Sbjct: 379 MQAEIYARGPISCVMDVTQTFLDYTGGVFTSREGKWLGKHAVEVTGWGVDEETRTPYWIV 438
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESF 86
NSWG WG+ G F+I G + IE
Sbjct: 439 RNSWGTYWGENGWFRIAMGQNLLNIEQM 466
>gi|339235557|ref|XP_003379333.1| dipeptidyl-peptidase 1 [Trichinella spiralis]
gi|316978004|gb|EFV61033.1| dipeptidyl-peptidase 1 [Trichinella spiralis]
Length = 448
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/108 (39%), Positives = 60/108 (55%), Gaps = 16/108 (14%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------------VGEMSGGHAVKIIGW 47
M+L + + G + IEA DLI Y+ G+YQHT E++ HAV I+G+
Sbjct: 339 MRLALVNNGPLAVGIEAFDDLIHYRGGIYQHTKIHDDFNFPTKWNPFELTN-HAVLIVGY 397
Query: 48 GVE--DGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRV 93
GV+ + YW+ NSWG WG+ G F+I+RG DE IES V A +
Sbjct: 398 GVDKKSNIPYWIVKNSWGTNWGEHGYFRIKRGVDECGIESLAVQATPI 445
>gi|218139209|gb|ACK57788.1| cathepsin C [Litopenaeus vannamei]
Length = 451
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 58/99 (58%), Gaps = 10/99 (10%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+L + G ++ +E + D + YK G+Y HT E++ HAV ++G+G ++
Sbjct: 350 MKLALIKGGPLIVGLEVYDDFLHYKSGIYHHTGLQDRFNPLELTN-HAVLLVGYGEDEAT 408
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
G KYW NSWGE WG+ G F+IRRG DE IES V A
Sbjct: 409 GEKYWSVKNSWGEEWGEDGYFRIRRGVDECAIESMAVEA 447
>gi|119638965|gb|ABL85237.1| cysteine proteinase 3 [Necator americanus]
Length = 360
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 44/106 (41%), Positives = 62/106 (58%), Gaps = 8/106 (7%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED----GVKYW 56
++LEI G + A+ + D Y+KGVY + G GGHA+KIIGWG E + YW
Sbjct: 246 IKLEIMRNGPVTASFRIYPDFGFYEKGVYVTSGGRELGGHAIKIIGWGTEKVNGTDLPYW 305
Query: 57 LCVNSWGELWG-DGGLFKIRRGTDESRIESFQVSAG--RVDRDRSS 99
L NSWG WG + G F+I RG + +IE +V AG +V + +S+
Sbjct: 306 LIANSWGTDWGENNGYFRILRGQNHCQIEQ-KVIAGMIKVPQPKSA 350
>gi|448278133|gb|AGE43966.1| putative cathepsin B [Naegleria fowleri]
Length = 349
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%), Gaps = 2/85 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDGVKYWLCV 59
+Q I G +++ + ++D Y+ G Y+H G + GGHA+K++GWGV + V YW+
Sbjct: 256 VQASILANGPVISGFKVYRDFYNYRSG-YKHVAGGLVGGHAIKVVGWGVTQSNVPYWIVA 314
Query: 60 NSWGELWGDGGLFKIRRGTDESRIE 84
NSW + WG G F I RGT+E IE
Sbjct: 315 NSWSDEWGMNGYFWILRGTNECSIE 339
>gi|345794363|ref|XP_535330.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Canis lupus
familiaris]
Length = 467
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +Y+ G+Y HT + G H+VKI GWG E DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYQGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 415
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 416 RTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 459
>gi|301618234|ref|XP_002938532.1| PREDICTED: tubulointerstitial nephritis antigen-like [Xenopus
(Silurana) tropicalis]
Length = 494
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 54/99 (54%), Gaps = 9/99 (9%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--------GEMSGGHAVKIIGWGVEDGVKY 55
E++ G + A +E H+D +YK G+Y+HT G H+VKI G KY
Sbjct: 387 ELYENGPVQAIMEVHEDFFMYKSGIYRHTPVTEREPEHHRRHGTHSVKITGGRDGQTHKY 446
Query: 56 WLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
WL NSWG WG+ G F+I RG +E IE+F V GRV
Sbjct: 447 WLAANSWGRDWGEDGYFRIARGENECEIETFIVGVWGRV 485
>gi|387015548|gb|AFJ49893.1| Dipeptidyl peptidase 1-like [Crotalus adamanteus]
Length = 464
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 39/104 (37%), Positives = 58/104 (55%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSG-------GHAVKIIGWG--VED 51
M+LE+ G + A E + D + Y G+Y HT G M HAV ++G+G +
Sbjct: 361 MKLELIKHGPMAVAFEVYNDFMYYSGGIYHHT-GLMDPFNPFELTNHAVLLVGYGSDPQT 419
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G +W+ NSWG WG+ G F+IRRG+DE IES V++ + +
Sbjct: 420 GQPFWIVKNSWGSSWGEEGYFRIRRGSDECAIESIAVASTPIPK 463
>gi|395528577|ref|XP_003766405.1| PREDICTED: dipeptidyl peptidase 1-like [Sarcophilus harrisii]
Length = 568
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/104 (38%), Positives = 59/104 (56%), Gaps = 10/104 (9%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+ E+ G + A E + D I Y+ G+Y HT E++ HAV ++G+G ++
Sbjct: 465 MKHELIQNGPLTVAFEVYDDFIHYRTGIYHHTGLRDNFNPFELTN-HAVLLVGYGTDEKT 523
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
G YW+ NSWG WG+ G F+I RGTDE IES V+A + +
Sbjct: 524 GEDYWIVKNSWGTSWGENGYFRILRGTDECAIESIAVAATPIPQ 567
>gi|395833440|ref|XP_003789742.1| PREDICTED: tubulointerstitial nephritis antigen [Otolemur
garnettii]
Length = 464
Score = 77.0 bits (188), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 56/100 (56%), Gaps = 13/100 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQH---TVGEMSG-----GHAVKIIGWGVEDGV-- 53
EI G + A ++ H+D YK G+Y+H T GE HAVK++GWG G
Sbjct: 355 EIMQNGPVQAIMQVHEDFFHYKSGIYRHVASTHGESENYRKLRTHAVKLLGWGTLRGAQG 414
Query: 54 ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
K+W+ NSWG+ WG+ G F+I RG +ES IE ++A
Sbjct: 415 RKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 454
>gi|410959397|ref|XP_003986297.1| PREDICTED: tubulointerstitial nephritis antigen [Felis catus]
Length = 474
Score = 77.0 bits (188), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 55/101 (54%), Gaps = 14/101 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQH---TVGEMSGG------HAVKIIGWGVEDGV- 53
EI G + A ++ H+D YK G+Y+H E SG HAVK+ GWG G
Sbjct: 364 EIMQNGPVQAIMQVHEDFFHYKTGIYRHITKKANEESGKYRKLQTHAVKLTGWGTLKGAQ 423
Query: 54 ----KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
K+W+ NSWG+ WG+ G F+I RG +ES IE ++A
Sbjct: 424 GRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 464
>gi|431838263|gb|ELK00195.1| Tubulointerstitial nephritis antigen [Pteropus alecto]
Length = 425
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 55/100 (55%), Gaps = 13/100 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQH--TVGEMS------GGHAVKIIGWGVEDGV-- 53
EI H G + A ++ H+D YK G+Y+H + E S HAVK+ GWG G
Sbjct: 316 EIIHNGPVQAIMQVHEDFFHYKSGIYRHVTSTNEKSEKYQKLQTHAVKLTGWGTLRGAQG 375
Query: 54 ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
K+W+ NSWG WG+ G F+I RG +ES IE ++A
Sbjct: 376 RKEKFWIVANSWGNSWGENGYFRILRGVNESDIEKLIIAA 415
>gi|294898698|ref|XP_002776344.1| cathepsin C, putative [Perkinsus marinus ATCC 50983]
gi|239883254|gb|EER08160.1| cathepsin C, putative [Perkinsus marinus ATCC 50983]
Length = 301
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 42/111 (37%), Positives = 63/111 (56%), Gaps = 9/111 (8%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSG--------GHAVKIIGWGVEDGVKY 55
E+ G +V +I+ D++ Y+ GVY+ + S GH+V +IG+GV++G Y
Sbjct: 186 ELVDDGPLVVSIKPAHDMMYYRSGVYRSDLERDSYHRPEWEEVGHSVLLIGYGVDNGEDY 245
Query: 56 WLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDRDRSSD-LEEFE 105
WL NSWG WG+ G ++ RG DES +ES V+A V+ R D + FE
Sbjct: 246 WLIQNSWGPEWGEDGYLRLARGMDESGVESIAVAADVVEDRRPLDTFKNFE 296
>gi|16758354|ref|NP_446034.1| tubulointerstitial nephritis antigen-like precursor [Rattus
norvegicus]
gi|61213054|sp|Q9EQT5.1|TINAL_RAT RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
Full=Glucocorticoid-inducible protein 5; Flags:
Precursor
gi|11527795|dbj|BAB18637.1| glucocorticoid-inducible protein [Rattus norvegicus]
Length = 467
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 43/104 (41%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +Y++G+Y HT G H+VKI GWG E DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDG 415
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IE+F + GRV
Sbjct: 416 RTIKYWTAANSWGPWWGERGHFRIVRGINECDIETFVLGVWGRV 459
>gi|241861813|ref|XP_002416350.1| cysteine proteinase cathepsin L, putative [Ixodes scapularis]
gi|215510564|gb|EEC20017.1| cysteine proteinase cathepsin L, putative [Ixodes scapularis]
Length = 127
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 43/108 (39%), Positives = 56/108 (51%), Gaps = 14/108 (12%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS------------GGHAVKIIGWG 48
M+L + H G + E + D +Y+ GVY+HT S HAV + G+G
Sbjct: 20 MRLALVHGGPVAVGFEVYPDFQMYQGGVYRHTGVHRSLNLGSPFDPFELTNHAVLVTGYG 79
Query: 49 V--EDGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVD 94
V E G+KYW NSWG WG+ G F+I RGTDE IES V A +
Sbjct: 80 VDKETGLKYWSVKNSWGPGWGENGYFRILRGTDECGIESLAVEASPIP 127
>gi|338722032|ref|XP_003364468.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
[Equus caballus]
Length = 436
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 57/104 (54%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--------GEMSGGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +Y+ GVY HT G H+VKI GWG E DG
Sbjct: 325 ELMENGPVQALMEVHEDFFLYQGGVYSHTPVSHGRPERYRRHGTHSVKITGWGEETLPDG 384
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 385 RTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 428
>gi|185135783|ref|NP_001117966.1| prepro-cathepsin C precursor [Oncorhynchus mykiss]
gi|51038277|gb|AAT94060.1| prepro-cathepsin C [Oncorhynchus mykiss]
Length = 457
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 58/103 (56%), Gaps = 8/103 (7%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS------GGHAVKIIGWGV--EDG 52
M LE+ G + A E + D + YK+G+Y HT S HAV ++G+G G
Sbjct: 354 MMLELVKNGPMGVAFEVYPDFMHYKEGIYHHTGLHDSYNPFELTNHAVLLVGYGQCHVTG 413
Query: 53 VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSAGRVDR 95
K+W+ NSWG WG+ G FK+RRG+DE IES V+A + +
Sbjct: 414 QKFWVVKNSWGTKWGEEGFFKVRRGSDECAIESIAVAAKPIPK 456
>gi|149694136|ref|XP_001503950.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 1
[Equus caballus]
Length = 467
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 57/104 (54%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--------GEMSGGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +Y+ GVY HT G H+VKI GWG E DG
Sbjct: 356 ELMENGPVQALMEVHEDFFLYQGGVYSHTPVSHGRPERYRRHGTHSVKITGWGEETLPDG 415
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 416 RTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 459
>gi|28804799|dbj|BAC57943.1| cathepsin C [Marsupenaeus japonicus]
Length = 449
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 40/98 (40%), Positives = 56/98 (57%), Gaps = 8/98 (8%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS------GGHAVKIIGWGVED--G 52
M++ + G ++ +E + D + YK G+Y HT S HAV ++G+G ++ G
Sbjct: 348 MKIALIKGGPLIVGLEVYDDFLHYKSGIYHHTGLRDSFNPLELTNHAVLLVGYGEDETTG 407
Query: 53 VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
KYW NSWGE WG+ G F+IRRG DE IES V A
Sbjct: 408 EKYWSVKNSWGEGWGEDGYFRIRRGVDECAIESMAVEA 445
>gi|344287518|ref|XP_003415500.1| PREDICTED: tubulointerstitial nephritis antigen isoform 1
[Loxodonta africana]
Length = 468
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 57/104 (54%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +Y+ G+Y HT G H+VKI GWG E DG
Sbjct: 357 ELMENGPVQALMEVHEDFFLYQGGIYSHTPVSQERPEQYRRHGTHSVKITGWGEETLPDG 416
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 417 RTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 460
>gi|226470090|emb|CAX70326.1| hypotherical protein [Schistosoma japonicum]
Length = 456
Score = 76.6 bits (187), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 53/99 (53%), Gaps = 11/99 (11%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS---------GGHAVKIIGWGVE- 50
M+LE+ H G E ++D YK GVY HT + HAV ++G+GV+
Sbjct: 352 MRLELVHNGPFPVGFEVYEDFEFYKDGVYHHTNVQNDRYSFNPFELTNHAVLLVGYGVDK 411
Query: 51 -DGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQV 88
G YW NSWG WG+ G F+IRRGTDE +ES V
Sbjct: 412 VSGEPYWKIKNSWGTEWGEKGYFRIRRGTDECGVESLGV 450
>gi|361069783|gb|AEW09203.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153583|gb|AFG58928.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153585|gb|AFG58929.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153587|gb|AFG58930.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153589|gb|AFG58931.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153591|gb|AFG58932.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153593|gb|AFG58933.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153595|gb|AFG58934.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153597|gb|AFG58935.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153599|gb|AFG58936.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153601|gb|AFG58937.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153603|gb|AFG58938.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153605|gb|AFG58939.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153607|gb|AFG58940.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153609|gb|AFG58941.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
Length = 68
Score = 76.6 bits (187), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 34/61 (55%), Positives = 43/61 (70%)
Query: 24 YKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGDGGLFKIRRGTDESRI 83
YK GVY++ G++ GGHAVK++GWG E G YWL NSW WG+ G FKI RG++E I
Sbjct: 4 YKSGVYKYIKGDLMGGHAVKLVGWGTEGGTDYWLVANSWNTAWGEDGYFKIARGSNECGI 63
Query: 84 E 84
E
Sbjct: 64 E 64
>gi|324512900|gb|ADY45327.1| Peptidase C1-like protein [Ascaris suum]
Length = 450
Score = 76.6 bits (187), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 43/115 (37%), Positives = 62/115 (53%), Gaps = 13/115 (11%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQH--------TVGEMSGGHAVKIIGWGVE----D 51
EI G + A ++D +Y GVYQH ++ G H+V+IIGWG +
Sbjct: 330 EIITNGPVQATFLVYEDFFMYSGGVYQHLDLHEHKEEERKVQGYHSVRIIGWGEDYSTGP 389
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRVDRDRSSDLEEFE 105
VKYWL NSWG WG+ GLF+I RG + IESF + A G+ + R +++ +
Sbjct: 390 QVKYWLAANSWGNEWGEDGLFRILRGENHCEIESFVIGAWGKGAKKRRFKVQKLQ 444
>gi|14789619|gb|AAH10745.1| Tubulointerstitial nephritis antigen [Mus musculus]
Length = 475
Score = 76.6 bits (187), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 53/100 (53%), Gaps = 13/100 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGV-- 53
EI G + A ++ H+D YK G+Y+H V HAVK+ GWG G
Sbjct: 366 EIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYKKLRTHAVKLTGWGTLRGARG 425
Query: 54 ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
K+W+ NSWG+ WG+ G F+I RG +ES IE ++A
Sbjct: 426 KKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465
>gi|159488843|ref|XP_001702410.1| papain-type cysteine protease [Chlamydomonas reinhardtii]
gi|158271078|gb|EDO96905.1| papain-type cysteine protease [Chlamydomonas reinhardtii]
Length = 382
Score = 76.6 bits (187), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 51/86 (59%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLI-IYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
M EI+H G I +D Y G+Y+ T G+ H V+++GWG EDG KYW+
Sbjct: 220 MMSEIYHRGPITCGQVCPEDFTWHYNGGIYKDTSGDTELDHDVEVVGWGEEDGEKYWIVR 279
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSWG WG+ G F++RRG + ++ES
Sbjct: 280 NSWGTYWGERGFFRVRRGDNSLQLES 305
>gi|227499499|ref|NP_036163.3| tubulointerstitial nephritis antigen precursor [Mus musculus]
gi|4929827|gb|AAD34171.1| tubulo-interstitial nephritis antigen [Mus musculus]
gi|148694397|gb|EDL26344.1| tubulointerstitial nephritis antigen, isoform CRA_a [Mus musculus]
Length = 475
Score = 76.3 bits (186), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 53/100 (53%), Gaps = 13/100 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGV-- 53
EI G + A ++ H+D YK G+Y+H V HAVK+ GWG G
Sbjct: 366 EIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYKKLRTHAVKLTGWGTLRGARG 425
Query: 54 ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
K+W+ NSWG+ WG+ G F+I RG +ES IE ++A
Sbjct: 426 KKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465
>gi|226470084|emb|CAX70323.1| hypotherical protein [Schistosoma japonicum]
Length = 462
Score = 76.3 bits (186), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 53/99 (53%), Gaps = 11/99 (11%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS---------GGHAVKIIGWGVE- 50
M+LE+ H G E ++D YK GVY HT + HAV ++G+GV+
Sbjct: 358 MRLELVHNGPFPVGFEVYEDFEFYKDGVYHHTNVQNDRYSFNPFELTNHAVLLVGYGVDK 417
Query: 51 -DGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQV 88
G YW NSWG WG+ G F+IRRGTDE +ES V
Sbjct: 418 VSGEPYWKIKNSWGTEWGEKGYFRIRRGTDECGVESLGV 456
>gi|344287520|ref|XP_003415501.1| PREDICTED: tubulointerstitial nephritis antigen isoform 2
[Loxodonta africana]
Length = 437
Score = 76.3 bits (186), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 57/104 (54%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +Y+ G+Y HT G H+VKI GWG E DG
Sbjct: 326 ELMENGPVQALMEVHEDFFLYQGGIYSHTPVSQERPEQYRRHGTHSVKITGWGEETLPDG 385
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 386 RTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGVWGRV 429
>gi|14042811|dbj|BAB55403.1| unnamed protein product [Homo sapiens]
Length = 218
Score = 76.3 bits (186), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 58/104 (55%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +Y+ G+Y HT + G H+VKI GWG E DG
Sbjct: 107 ELMENGPVQALMEVHEDFFLYEGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG 166
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 167 RTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGVWGRV 210
>gi|53850626|ref|NP_001005549.1| tubulointerstitial nephritis antigen precursor [Rattus norvegicus]
gi|51858645|gb|AAH81887.1| Tubulointerstitial nephritis antigen [Rattus norvegicus]
gi|149019129|gb|EDL77770.1| tubulointerstitial nephritis antigen [Rattus norvegicus]
Length = 475
Score = 76.3 bits (186), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 53/100 (53%), Gaps = 13/100 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGV-- 53
EI G + A ++ H+D YK G+Y+H V HAVK+ GWG G
Sbjct: 366 EIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYRKLRTHAVKLTGWGTLRGAQG 425
Query: 54 ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
K+W+ NSWG+ WG+ G F+I RG +ES IE ++A
Sbjct: 426 KKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465
>gi|340509247|gb|EGR34799.1| hypothetical protein IMG5_001760 [Ichthyophthirius multifiliis]
Length = 527
Score = 76.3 bits (186), Expect = 5e-12, Method: Composition-based stats.
Identities = 45/99 (45%), Positives = 54/99 (54%), Gaps = 15/99 (15%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGE----MSGG---------HAVKIIGW 47
M +EI G IVA+I + YK GVY H+V ++G HA GW
Sbjct: 405 MMIEIMKNGPIVASINPDYQFMYYKSGVY-HSVEAAEWILNGQNAPEWRNVEHAALCYGW 463
Query: 48 G-VEDGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIES 85
G E KYWL NSWG+ WG+ G FKIRRGTDES +ES
Sbjct: 464 GESEKDGKYWLMQNSWGKEWGENGFFKIRRGTDESSVES 502
>gi|166030324|gb|ABY78829.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 76.3 bits (186), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 35/81 (43%), Positives = 45/81 (55%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E++ G VA + DL YK GVY+H G+ GG AVK++GWG +G YW N+W
Sbjct: 242 ELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKLNGTPYWKVANTWD 301
Query: 64 ELWGDGGLFKIRRGTDESRIE 84
WG G I RG +E IE
Sbjct: 302 TDWGMDGYLLILRGNNECNIE 322
>gi|71656032|ref|XP_816569.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
gi|70881707|gb|EAN94718.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
Length = 333
Score = 76.3 bits (186), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 36/89 (40%), Positives = 46/89 (51%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ E+ G + + D + Y GVY+H G GGHAV+I+GWG +G YW N
Sbjct: 239 FKRELLLNGPFEVSFSVYADFLAYTGGVYKHVAGTFLGGHAVRIVGWGELNGEPYWKIAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW WG G F I RG DE IE V+
Sbjct: 299 SWNREWGMNGYFLIARGVDECGIEGSGVA 327
>gi|157058739|gb|ABV03127.1| cathepsin B-2744 [Acyrthosiphon pisum]
Length = 260
Score = 75.9 bits (185), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 33/72 (45%), Positives = 50/72 (69%), Gaps = 1/72 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVE-DGVKYWLCV 59
+Q EI +G + A + +++ + YK+G+Y+ T GE+ G H VK+IGWGV+ DG +YWL +
Sbjct: 189 IQQEIMTYGPVTAFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWLAM 248
Query: 60 NSWGELWGDGGL 71
NSW WG+ GL
Sbjct: 249 NSWNSNWGNDGL 260
>gi|56754987|gb|AAW25676.1| SJCHGC01753 protein [Schistosoma japonicum]
Length = 462
Score = 75.9 bits (185), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 53/99 (53%), Gaps = 11/99 (11%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS---------GGHAVKIIGWGVE- 50
M+LE+ H G E ++D YK GVY HT + HAV ++G+GV+
Sbjct: 358 MRLELVHNGPFPVGFEVYEDFEFYKDGVYHHTNVQNDRYSFNPFELTNHAVLLVGYGVDK 417
Query: 51 -DGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQV 88
G YW NSWG WG+ G F+IRRGTDE +ES V
Sbjct: 418 VSGEPYWKIKNSWGTEWGEKGYFRIRRGTDECGVESLGV 456
>gi|386001804|ref|YP_005920103.1| Integrins alpha chain [Methanosaeta harundinacea 6Ac]
gi|357209860|gb|AET64480.1| Integrins alpha chain [Methanosaeta harundinacea 6Ac]
Length = 882
Score = 75.9 bits (185), Expect = 6e-12, Method: Composition-based stats.
Identities = 28/77 (36%), Positives = 44/77 (57%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q E+ G + + + ++D +Y +Y+H G + G V I+GWG + YW+C N
Sbjct: 253 IQQEVLFGGPVSSKMAVYEDFYLYDDDIYEHAAGALVGSQWVDILGWGTNNSTDYWICKN 312
Query: 61 SWGELWGDGGLFKIRRG 77
SWG WGD G F+I+ G
Sbjct: 313 SWGAAWGDSGWFRIKMG 329
>gi|354483193|ref|XP_003503779.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cricetulus
griseus]
Length = 475
Score = 75.9 bits (185), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 53/100 (53%), Gaps = 13/100 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS--------GGHAVKIIGWGVEDGV-- 53
EI G + A ++ H+D YK G+Y+H + HAVK+ GWG G
Sbjct: 366 EIIRNGPVQAIMQVHEDFFYYKTGIYRHVISTNEESEKYRKLRSHAVKLTGWGTLRGAGG 425
Query: 54 ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
K+W+ NSWG+ WG+ G F+I RG +ES IE ++A
Sbjct: 426 KKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465
>gi|320162754|gb|EFW39653.1| papain family cysteine protease [Capsaspora owczarzaki ATCC 30864]
Length = 589
Score = 75.9 bits (185), Expect = 6e-12, Method: Composition-based stats.
Identities = 37/86 (43%), Positives = 48/86 (55%), Gaps = 1/86 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCV 59
M+ EIF G + I DLI Y GV+ T G + H+V + GWGV++ G YW V
Sbjct: 494 MKAEIFARGPVAVTIAVTTDLINYTGGVFHDTTGAIGDDHSVMLTGWGVDNSGTPYWTIV 553
Query: 60 NSWGELWGDGGLFKIRRGTDESRIES 85
NSWG WG+ G +I RG + IES
Sbjct: 554 NSWGTYWGETGAARIVRGVNNLGIES 579
Score = 62.8 bits (151), Expect = 5e-08, Method: Composition-based stats.
Identities = 30/85 (35%), Positives = 42/85 (49%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
M+ EIF G I I+A L Y GV+ H + ++GWG YW+ N
Sbjct: 194 MKAEIFARGPISCGIDATAALEAYTGGVFSEFSVLPIINHEISVVGWGTNGSTSYWIVRN 253
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
SWG +G+ G F+I+ G D IE+
Sbjct: 254 SWGSFYGEDGFFRIKMGGDNLAIET 278
>gi|130502070|ref|NP_001076255.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
gi|818411|gb|AAC48477.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
Length = 474
Score = 75.9 bits (185), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 53/100 (53%), Gaps = 13/100 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGV-- 53
EI G + A ++ H+D YK G+Y+H + HAVK+ GWG G
Sbjct: 365 EIMQNGPVQAIMQVHEDFFHYKTGIYRHVISTNEESEKYRKLQTHAVKLTGWGTLKGARG 424
Query: 54 ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
K+W+ NSWG+ WG+ G F+I RG +ES IE ++A
Sbjct: 425 QKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 464
>gi|3088522|gb|AAD03404.1| cathepsin B-like protease precursor [Trypanosoma cruzi]
gi|407859283|gb|EKG06969.1| cysteine peptidase C (CPC) [Trypanosoma cruzi]
Length = 333
Score = 75.5 bits (184), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 36/89 (40%), Positives = 46/89 (51%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ E+ G + + D + Y GVY+H G GGHAV+I+GWG +G YW N
Sbjct: 239 FKRELLLNGPFEVSFSVYADFVAYTGGVYKHVTGVFLGGHAVRIVGWGELNGEPYWKIAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW WG G F I RG DE IE V+
Sbjct: 299 SWNHEWGMNGYFLIARGVDECGIEGSGVA 327
>gi|351704465|gb|EHB07384.1| Tubulointerstitial nephritis antigen [Heterocephalus glaber]
Length = 475
Score = 75.5 bits (184), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 53/100 (53%), Gaps = 13/100 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGVK- 54
EI G + A ++ H+D YK G+Y+H + HAVK+ GWG G K
Sbjct: 366 EIMKNGPVQAIMQVHEDFFYYKTGIYRHVTSTIEDSEKYQKLRTHAVKLTGWGTLRGAKG 425
Query: 55 ----YWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
+W+ NSWG+ WG+ G F+I RG +ES IE ++A
Sbjct: 426 RKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465
>gi|158605299|gb|ABW74905.1| cathepsin C [Penaeus monodon]
Length = 449
Score = 75.5 bits (184), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 58/99 (58%), Gaps = 10/99 (10%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHT-------VGEMSGGHAVKIIGWGVED-- 51
M+L + G ++ +E + D + YK G+Y HT E++ HAV ++G+G ++
Sbjct: 348 MKLALIKGGLLIVGLEVYDDFLHYKGGIYHHTGLQDRFNPLELTN-HAVLLVGYGEDEAT 406
Query: 52 GVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
G KYW NSWGE WG+ G F+IRRG DE IES V A
Sbjct: 407 GEKYWSVKNSWGEDWGEDGYFRIRRGVDECAIESMAVEA 445
>gi|159108625|ref|XP_001704582.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157432649|gb|EDO76908.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 298
Score = 75.5 bits (184), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 35/77 (45%), Positives = 50/77 (64%), Gaps = 1/77 (1%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVED-GVKYWLCVNSWGELWG 67
G + A + D + Y+ GVYQHT G + GGHAV+++G+G ++ V YW+ NSWG WG
Sbjct: 212 GPLQTAFTVYSDFMYYEGGVYQHTYGRVEGGHAVEMVGYGTDEYDVDYWIIRNSWGPDWG 271
Query: 68 DGGLFKIRRGTDESRIE 84
+ G F+I R T+E IE
Sbjct: 272 EDGYFRIIRMTNECGIE 288
>gi|358331547|dbj|GAA35870.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 508
Score = 75.5 bits (184), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 36/81 (44%), Positives = 45/81 (55%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI G + A ++D + Y GVY H +G GHAV+I+GWG V YWL NSW
Sbjct: 247 EIMLRGPVEAIFTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWGELGNVPYWLIANSWN 306
Query: 64 ELWGDGGLFKIRRGTDESRIE 84
E WG+ G K RG +E IE
Sbjct: 307 EDWGEEGYMKFLRGYNECGIE 327
>gi|335290878|ref|XP_003127800.2| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Sus scrofa]
Length = 362
Score = 75.5 bits (184), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 57/104 (54%), Gaps = 14/104 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--------GEMSGGHAVKIIGWGVE---DG 52
E+ G + A +E H+D +Y+ G+Y HT G H+VKI GWG E DG
Sbjct: 251 ELMENGPVQALMEVHEDFFLYQSGIYSHTPVSHGRPERYRRHGTHSVKITGWGEETLPDG 310
Query: 53 --VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA-GRV 93
+KYW NSWG WG+ G F+I RG +E IESF + GRV
Sbjct: 311 RMLKYWTAANSWGPGWGERGHFRIVRGANECDIESFVLGVWGRV 354
>gi|348553066|ref|XP_003462348.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
porcellus]
Length = 475
Score = 75.5 bits (184), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 52/100 (52%), Gaps = 13/100 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGV-- 53
EI G + A ++ H+D YK G+Y+H HAVK+ GWG G
Sbjct: 366 EIMQNGPVQAIMKVHEDFFSYKTGIYRHVTSTSEDSEKYQKLRTHAVKLTGWGTLKGARG 425
Query: 54 ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
K+W+ NSWG+ WG+ G FKI RG +ES IE ++A
Sbjct: 426 KKEKFWIAANSWGKSWGENGYFKILRGVNESDIEKLIIAA 465
>gi|253742315|gb|EES99155.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 303
Score = 75.5 bits (184), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 50/78 (64%), Gaps = 2/78 (2%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG-HAVKIIGWGV-EDGVKYWLCVNSWGELW 66
G + I + DL+ Y GVY+HT G +S G HA++++G+G +DG YW NSWG W
Sbjct: 217 GPVQTMIVVYADLLYYAGGVYRHTYGPISNGLHALEMVGYGTTDDGTDYWTIKNSWGSDW 276
Query: 67 GDGGLFKIRRGTDESRIE 84
G+ G F+I RG +E RIE
Sbjct: 277 GEDGYFRIVRGVNECRIE 294
>gi|326430261|gb|EGD75831.1| hypothetical protein PTSG_07950 [Salpingoeca sp. ATCC 50818]
Length = 381
Score = 75.5 bits (184), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 41/92 (44%), Positives = 53/92 (57%), Gaps = 2/92 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVY--QHTVGEMSGGHAVKIIGWGVEDGVKYWLC 58
+Q +I G ++A+ E +D Y GVY + G HAV I+GWGVED YWL
Sbjct: 250 IQRDIMQHGPVLASYEVFEDFGEYDSGVYTCPDDGSDSIGWHAVIIVGWGVEDNTPYWLV 309
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
NSWG +G G FKI RGT+E IES V++
Sbjct: 310 QNSWGTGFGIDGYFKIARGTNECNIESRLVTS 341
>gi|256086900|ref|XP_002579622.1| cathepsin B (C01 family) [Schistosoma mansoni]
Length = 204
Score = 75.5 bits (184), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 37/87 (42%), Positives = 50/87 (57%), Gaps = 1/87 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
+Q EI G ++A+I D ++YK GVY T + G ++IIGWG E YWLC
Sbjct: 110 IQKEILMNGPVIASILVKVDFLVYKSGVYFPTPKSSNLGWINLRIIGWGYEGKTPYWLCA 169
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESF 86
NSW + WG+ G K+RRG IES+
Sbjct: 170 NSWSKEWGENGYVKVRRGVQAGYIESY 196
>gi|353228747|emb|CCD74918.1| cathepsin B (C01 family) [Schistosoma mansoni]
Length = 229
Score = 75.5 bits (184), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 37/87 (42%), Positives = 50/87 (57%), Gaps = 1/87 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGVEDGVKYWLCV 59
+Q EI G ++A+I D ++YK GVY T + G ++IIGWG E YWLC
Sbjct: 135 IQKEILMNGPVIASILVKVDFLVYKSGVYFPTPKSSNLGWINLRIIGWGYEGKTPYWLCA 194
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESF 86
NSW + WG+ G K+RRG IES+
Sbjct: 195 NSWSKEWGENGYVKVRRGVQAGYIESY 221
>gi|132566367|gb|ABO34080.1| cathepsin B5 [Clonorchis sinensis]
Length = 343
Score = 75.5 bits (184), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 49/88 (55%), Gaps = 1/88 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI G + A ++D + Y GVY H +G GHAV+I+GWG V YWL NSW
Sbjct: 247 EIMLRGPVEAIFTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWGELGNVPYWLIANSWN 306
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAG 91
E WG+ G K RG +E IE V+AG
Sbjct: 307 EDWGEEGYMKFLRGYNECGIED-DVTAG 333
>gi|157092993|gb|ABV22151.1| cysteine proteinase [Perkinsus chesapeaki]
Length = 396
Score = 75.5 bits (184), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 39/77 (50%), Positives = 48/77 (62%), Gaps = 4/77 (5%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI G + A+ + D + YK GVY+ T GGHAVKIIGWG ED YWL VN
Sbjct: 306 IKKEIMTNGPVSASYLVYDDFLTYKSGVYKRTSHNALGGHAVKIIGWG-ED---YWLVVN 361
Query: 61 SWGELWGDGGLFKIRRG 77
SW + WGD G+FKI G
Sbjct: 362 SWNKNWGDNGMFKIGCG 378
>gi|253748399|gb|EET02549.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 303
Score = 75.5 bits (184), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 50/78 (64%), Gaps = 2/78 (2%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG-HAVKIIGWGV-EDGVKYWLCVNSWGELW 66
G + I + DL+ Y GVY+HT G +S G HA++++G+G +DG YW NSWG W
Sbjct: 217 GPVQTMIVVYADLLYYAGGVYRHTYGPISNGLHALEMVGYGTTDDGTDYWTIKNSWGSDW 276
Query: 67 GDGGLFKIRRGTDESRIE 84
G+ G F+I RG +E RIE
Sbjct: 277 GEDGYFRIVRGVNECRIE 294
>gi|226470086|emb|CAX70324.1| hypotherical protein [Schistosoma japonicum]
Length = 456
Score = 75.5 bits (184), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 52/99 (52%), Gaps = 11/99 (11%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS---------GGHAVKIIGWGVE- 50
M+LE+ H G E +D YK GVY HT + HAV ++G+GV+
Sbjct: 352 MRLELVHNGPFPVGFEVFEDFEFYKDGVYHHTNVQNDRYSFNPFELTNHAVLLVGYGVDK 411
Query: 51 -DGVKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQV 88
G YW NSWG WG+ G F+IRRGTDE +ES V
Sbjct: 412 VSGEPYWKIKNSWGTEWGEKGYFRIRRGTDECGVESLGV 450
>gi|294952611|ref|XP_002787376.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239902348|gb|EER19172.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 203
Score = 75.5 bits (184), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 37/84 (44%), Positives = 52/84 (61%), Gaps = 2/84 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EIF G + +A + ++D YK GVY T E+ H VKIIGWG + +YWL +N
Sbjct: 112 IKQEIFDNGPVFSAFKMYEDFRYYKSGVYVPTTKEVLSFHLVKIIGWGADSVQEYWLAMN 171
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW E WGD GL K+ G ++R+E
Sbjct: 172 SWNEEWGDHGLIKMAFG--KNRLE 193
>gi|10803452|emb|CAB97365.2| putative cathepsin B.2 [Ostertagia ostertagi]
Length = 194
Score = 75.5 bits (184), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 33/62 (53%), Positives = 41/62 (66%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q +I G +VA ++D YK G+Y+HT G M+GGHAVKIIGWG E G YWL N
Sbjct: 133 IQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMTGGHAVKIIGWGKEXGTPYWLIAN 192
Query: 61 SW 62
SW
Sbjct: 193 SW 194
>gi|338718488|ref|XP_001918155.2| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
antigen-like [Equus caballus]
Length = 480
Score = 75.5 bits (184), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 52/100 (52%), Gaps = 13/100 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGV-- 53
EI G + A ++ H D YKKG+Y+H HA+K+ GWG G
Sbjct: 371 EIMQNGPVQAIMQVHDDFFHYKKGIYRHVTSTHEEPEKYRKLRTHAIKLAGWGTLRGAQG 430
Query: 54 ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
K+W+ NSWG+ WG+ G F+I RG +ES IE ++A
Sbjct: 431 RKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 470
>gi|294945206|ref|XP_002784584.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239897729|gb|EER16380.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 298
Score = 75.1 bits (183), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 36/85 (42%), Positives = 53/85 (62%), Gaps = 2/85 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EIF G + +A E ++D YK GVY T E+ H +KIIGWG + +YWL +N
Sbjct: 208 IKQEIFDNGPVFSAFEMYKDFRYYKSGVYVPTTKEVDCLHVIKIIGWGADSVREYWLAMN 267
Query: 61 SWGELWGDGGLFKIRRGTDESRIES 85
+W E WGD GL K+ G ++R+E+
Sbjct: 268 AWNEEWGDHGLIKMAFG--KNRLEN 290
>gi|355724272|gb|AES08175.1| tubulointerstitial nephritis antigen [Mustela putorius furo]
Length = 476
Score = 75.1 bits (183), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 55/100 (55%), Gaps = 13/100 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQH---TVGEMSG-----GHAVKIIGWGVEDGV-- 53
EI G + A ++ H+D YK G+Y+H T E S HAVK+ GWG G
Sbjct: 367 EIMQNGPVQAIMQVHEDFFHYKTGIYRHVTRTNEEASKYRKFQTHAVKLTGWGTLKGAQG 426
Query: 54 ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
K+W+ NSWG+ WG+ G F+I RG +ES IE ++A
Sbjct: 427 QKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|344264196|ref|XP_003404179.1| PREDICTED: tubulointerstitial nephritis antigen [Loxodonta
africana]
Length = 476
Score = 75.1 bits (183), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 55/100 (55%), Gaps = 13/100 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG--------EMSGGHAVKIIGWGVEDGVK- 54
EI G + A ++ H+D YK G+Y+H + + HAVK+ GWG+ G K
Sbjct: 367 EIMQNGPVQAIMQVHEDFFHYKTGIYRHVIRTSEESEKYQKLRTHAVKLTGWGMMKGAKG 426
Query: 55 ----YWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
+W+ NSWG+ WG+ G F+I RG +ES IE ++A
Sbjct: 427 RKEKFWVAANSWGKSWGEDGYFRILRGVNESDIEKLIIAA 466
>gi|402588459|gb|EJW82392.1| papain family cysteine protease containing protein [Wuchereria
bancrofti]
Length = 323
Score = 75.1 bits (183), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 34/71 (47%), Positives = 44/71 (61%), Gaps = 2/71 (2%)
Query: 23 IYKKGVYQHT--VGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGDGGLFKIRRGTDE 80
YK G+ T M HA ++IG+G E+G KYWL NSWGE WGD G FKI RG +
Sbjct: 252 FYKSGILPDTDECSTMEPNHAAEVIGYGTENGKKYWLLKNSWGEWWGDQGFFKIERGINA 311
Query: 81 SRIESFQVSAG 91
++E++ SAG
Sbjct: 312 CKVETYVASAG 322
>gi|71424150|ref|XP_812694.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
gi|70877506|gb|EAN90843.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
Length = 333
Score = 75.1 bits (183), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 36/89 (40%), Positives = 46/89 (51%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+ E+ G + + D + Y GVY+H G GGHAV+I+GWG +G YW N
Sbjct: 239 FKRELILNGPFEVSFSVYADFVAYTGGVYKHVAGIFLGGHAVRIVGWGELNGEPYWKIAN 298
Query: 61 SWGELWGDGGLFKIRRGTDESRIESFQVS 89
SW WG G F I RG DE IE V+
Sbjct: 299 SWNREWGMNGYFLIARGVDECGIEGSGVA 327
>gi|345488309|ref|XP_001605531.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
[Nasonia vitripennis]
Length = 481
Score = 75.1 bits (183), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 37/92 (40%), Positives = 51/92 (55%), Gaps = 9/92 (9%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG---EMSGGHAVKIIGWGVEDG------VK 54
EI G + A + H+D Y+ G+Y H+ SG H+V+I+GWG E +K
Sbjct: 377 EILTSGPVQATMRVHRDFFHYESGIYVHSRPFDTRQSGYHSVRIVGWGEEPSPYNGKPIK 436
Query: 55 YWLCVNSWGELWGDGGLFKIRRGTDESRIESF 86
+W NSWG WG+ G F+I RG +E IESF
Sbjct: 437 FWRVANSWGRDWGEDGYFRIVRGNNECEIESF 468
>gi|405968896|gb|EKC33922.1| Dipeptidyl-peptidase 1, partial [Crassostrea gigas]
Length = 392
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 40/98 (40%), Positives = 55/98 (56%), Gaps = 8/98 (8%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMS------GGHAVKIIGWGVE--DG 52
M++ + G + + E + D YK GVY HT + HAV ++G+GV+ G
Sbjct: 291 MKINLVKNGPLSVSFEVYNDFFHYKGGVYVHTGLQEKFNPFEITNHAVLLVGYGVDAATG 350
Query: 53 VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
VK+W NSWG WG+ G F+IRRGTDE IES V +
Sbjct: 351 VKFWTVKNSWGTQWGEDGYFRIRRGTDECSIESIAVQS 388
>gi|350596935|ref|XP_001927698.4| PREDICTED: tubulointerstitial nephritis antigen, partial [Sus
scrofa]
Length = 368
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 39/100 (39%), Positives = 55/100 (55%), Gaps = 13/100 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQH--TVGEMSGG------HAVKIIGWGVEDGV-- 53
EI G + A ++ H+D YK G+Y+H + E S HAVK+ GWG G
Sbjct: 259 EIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNEESDKYRKLRTHAVKLTGWGTLKGAQG 318
Query: 54 ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
K+W+ NSWG+ WG+ G F+I RG +ES IE ++A
Sbjct: 319 RKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 358
>gi|157058745|gb|ABV03130.1| cathepsin B-2744 [Sitobion avenae]
Length = 260
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 33/72 (45%), Positives = 49/72 (68%), Gaps = 1/72 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVE-DGVKYWLCV 59
+Q EI +G + A + +++ + YK+G+Y+ T GE+ G H VK+IGWGV+ DG +YWL +
Sbjct: 189 IQQEIMTYGPVTAFMYVYENFMGYKEGIYKSTAGELIGYHHVKLIGWGVDGDGTEYWLAM 248
Query: 60 NSWGELWGDGGL 71
NSW WG GL
Sbjct: 249 NSWNSNWGTNGL 260
>gi|3087797|emb|CAA93275.1| cysteine proteinase [Haemonchus contortus]
Length = 330
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 30/66 (45%), Positives = 41/66 (62%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
+Q E+ G + AA ++D Y+KG+Y H+ G G HAVK++GWGVE+G KYW N
Sbjct: 254 IQREMMKNGPVQAAFTTYEDFSFYRKGIYVHSYGRQRGAHAVKVVGWGVENGTKYWNVAN 313
Query: 61 SWGELW 66
SW W
Sbjct: 314 SWSTDW 319
>gi|170579333|ref|XP_001894785.1| Papain family cysteine protease containing protein [Brugia malayi]
gi|158598509|gb|EDP36387.1| Papain family cysteine protease containing protein [Brugia malayi]
Length = 324
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 34/71 (47%), Positives = 44/71 (61%), Gaps = 2/71 (2%)
Query: 23 IYKKGVYQHT--VGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGDGGLFKIRRGTDE 80
YK GV T M HA ++IG+G E+G KYWL NSWGE WGD G FK+ RG +
Sbjct: 253 FYKSGVLPDTDECSTMEPNHAAEVIGYGTENGKKYWLLKNSWGEWWGDQGFFKMERGVNA 312
Query: 81 SRIESFQVSAG 91
++E++ SAG
Sbjct: 313 CKVETYVASAG 323
>gi|145529217|ref|XP_001450397.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124418008|emb|CAK83000.1| unnamed protein product [Paramecium tetraurelia]
Length = 512
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 53/87 (60%), Gaps = 1/87 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKG-VYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCV 59
M++EIF+ G IV + A Q+L Y+ G ++ + H V ++GWGVEDGV+YW+
Sbjct: 418 MKIEIFNRGPIVCGVYATQELDDYEGGYIFSQKTNKTILNHYVSVVGWGVEDGVEYWIVR 477
Query: 60 NSWGELWGDGGLFKIRRGTDESRIESF 86
NSWG WGD G K++ +D +E +
Sbjct: 478 NSWGSYWGDMGYAKMKMHSDNLLLEHY 504
>gi|4325188|gb|AAD17297.1| cysteine proteinase [Ancylostoma ceylanicum]
Length = 341
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 50/84 (59%), Gaps = 1/84 (1%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EI+ G +VAA + ++D G+Y H G +G HA K+IGWG E+G YWL N
Sbjct: 249 IRQEIYKNGPVVAAFKVYEDYSS-TGGIYVHKWGIQTGAHADKVIGWGRENGTDYWLIAN 307
Query: 61 SWGELWGDGGLFKIRRGTDESRIE 84
SW WG+ G ++I R TD IE
Sbjct: 308 SWNTDWGEDGYYRIVRETDNCEIE 331
>gi|301775398|ref|XP_002923119.1| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
antigen-like [Ailuropoda melanoleuca]
Length = 472
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 54/100 (54%), Gaps = 13/100 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--GEMSGG------HAVKIIGWGVEDGV-- 53
EI G + A ++ H+D YK G+Y+H E S HA+K+ GWG G
Sbjct: 363 EIMQNGPVQAIMQVHEDFFHYKTGIYRHVTRTNEESSKYRKLQTHAIKLTGWGTLKGARG 422
Query: 54 ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
K+W+ NSWG+ WG+ G F+I RG +ES IE ++A
Sbjct: 423 QKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 462
>gi|47125398|gb|AAH70278.1| Tubulointerstitial nephritis antigen [Homo sapiens]
gi|190690249|gb|ACE86899.1| tubulointerstitial nephritis antigen protein [synthetic construct]
gi|190691623|gb|ACE87586.1| tubulointerstitial nephritis antigen protein [synthetic construct]
gi|312150986|gb|ADQ32005.1| tubulointerstitial nephritis antigen [synthetic construct]
Length = 476
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 52/100 (52%), Gaps = 13/100 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGV-- 53
EI G + A ++ H+D YK G+Y+H HAVK+ GWG G
Sbjct: 367 EIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQG 426
Query: 54 ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
K+W+ NSWG+ WG+ G F+I RG +ES IE ++A
Sbjct: 427 QKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|6009533|dbj|BAA84949.1| tubulointerstitial nephritis antigen [Homo sapiens]
Length = 476
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 52/100 (52%), Gaps = 13/100 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGV-- 53
EI G + A ++ H+D YK G+Y+H HAVK+ GWG G
Sbjct: 367 EIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQG 426
Query: 54 ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
K+W+ NSWG+ WG+ G F+I RG +ES IE ++A
Sbjct: 427 QKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|32129433|sp|P92131.3|CATB1_GIALA RecName: Full=Cathepsin B-like CP1; AltName: Full=Cathepsin B-like
protease B1; Flags: Precursor
Length = 303
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 51/78 (65%), Gaps = 2/78 (2%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGV-EDGVKYWLCVNSWGELW 66
G + I + DL Y+ GVY+HT G ++ G HA++I+G+G +DG YW+ NSWG W
Sbjct: 217 GPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIKNSWGPDW 276
Query: 67 GDGGLFKIRRGTDESRIE 84
G+ G F+I RG +E RIE
Sbjct: 277 GENGYFRIVRGVNECRIE 294
>gi|440907441|gb|ELR57591.1| Tubulointerstitial nephritis antigen [Bos grunniens mutus]
Length = 476
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 52/100 (52%), Gaps = 13/100 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGV-- 53
EI G + A ++ H+D YK G+Y+H HAVK+ GWG G
Sbjct: 367 EIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQG 426
Query: 54 ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
K+W+ NSWG+ WG+ G F+I RG +ES IE ++A
Sbjct: 427 QKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|290992564|ref|XP_002678904.1| predicted protein [Naegleria gruberi]
gi|284092518|gb|EFC46160.1| predicted protein [Naegleria gruberi]
Length = 289
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 46/77 (59%), Gaps = 3/77 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGV-EDG--VKYWL 57
+Q +I G + AA +QD YK GVY+H G ++GGHA+KI+GWGV DG YW+
Sbjct: 213 IQKDIQANGPVQAAFSVYQDFFSYKSGVYRHVSGSLAGGHAIKIVGWGVTSDGKDTPYWI 272
Query: 58 CVNSWGELWGDGGLFKI 74
NSW WG G F I
Sbjct: 273 VANSWNTNWGQEGFFWI 289
>gi|11691656|emb|CAC18646.1| cathepsin B-like protease 1 [Giardia intestinalis]
Length = 303
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 51/78 (65%), Gaps = 2/78 (2%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGV-EDGVKYWLCVNSWGELW 66
G + I + DL Y+ GVY+HT G ++ G HA++I+G+G +DG YW+ NSWG W
Sbjct: 217 GPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIKNSWGPDW 276
Query: 67 GDGGLFKIRRGTDESRIE 84
G+ G F+I RG +E RIE
Sbjct: 277 GENGYFRIVRGVNECRIE 294
>gi|253747738|gb|EET02294.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 305
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 45/77 (58%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGELWGD 68
G + H+D + Y G+Y T G GGHAV I+G+G + YW+ NSWG WG+
Sbjct: 221 GPVQTGFYVHEDFLYYVGGIYHKTYGSSIGGHAVLIVGYGSMNNHDYWIVRNSWGSDWGE 280
Query: 69 GGLFKIRRGTDESRIES 85
G F+I RGT+E IE+
Sbjct: 281 NGYFRILRGTNECGIEN 297
>gi|10803454|emb|CAB97366.2| putative cathepsin B.3 [Ostertagia ostertagi]
Length = 196
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 32/62 (51%), Positives = 42/62 (67%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVN 60
++ EIF G + A+ ++D YK G+Y HT G+ GGHAVKIIGWGVE+G K W+ N
Sbjct: 135 IRAEIFARGPVQASFATYEDFAHYKSGIYVHTAGKRRGGHAVKIIGWGVENGTKXWIVAN 194
Query: 61 SW 62
SW
Sbjct: 195 SW 196
>gi|296198446|ref|XP_002746707.1| PREDICTED: tubulointerstitial nephritis antigen [Callithrix
jacchus]
Length = 476
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 53/100 (53%), Gaps = 13/100 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG--------EMSGGHAVKIIGWGVEDGV-- 53
EI G + A ++ H+D YK G+Y+H + HAVK+ GWG G
Sbjct: 367 EIMQNGPVQAIMKVHEDFFHYKTGIYRHVTSTNKESEKFQKLQTHAVKLTGWGTLRGAQG 426
Query: 54 ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
K+W+ NSWG+ WG+ G F+I RG +ES IE ++A
Sbjct: 427 RKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|78042562|ref|NP_001030279.1| tubulointerstitial nephritis antigen [Bos taurus]
gi|108861910|sp|Q3SZI1.1|TINAG_BOVIN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
gi|74354008|gb|AAI02844.1| Tubulointerstitial nephritis antigen [Bos taurus]
gi|296474572|tpg|DAA16687.1| TPA: tubulointerstitial nephritis antigen [Bos taurus]
Length = 476
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 52/100 (52%), Gaps = 13/100 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGV-- 53
EI G + A ++ H+D YK G+Y+H HAVK+ GWG G
Sbjct: 367 EIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQG 426
Query: 54 ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
K+W+ NSWG+ WG+ G F+I RG +ES IE ++A
Sbjct: 427 QKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|343472937|emb|CCD15042.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 35/81 (43%), Positives = 45/81 (55%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E++ G VA + DL YK GVY++ G+ GG AV+I+GWG +G YW NSW
Sbjct: 242 ELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGGQAVRIVGWGKLNGTPYWKVANSWD 301
Query: 64 ELWGDGGLFKIRRGTDESRIE 84
WG G I RG +E IE
Sbjct: 302 TDWGMNGYMLILRGNNECNIE 322
>gi|1763659|gb|AAB58258.1| cysteine protease [Giardia intestinalis]
Length = 269
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 51/78 (65%), Gaps = 2/78 (2%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGV-EDGVKYWLCVNSWGELW 66
G + I + DL Y+ GVY+HT G ++ G HA++I+G+G +DG YW+ NSWG W
Sbjct: 183 GPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIKNSWGPDW 242
Query: 67 GDGGLFKIRRGTDESRIE 84
G+ G F+I RG +E RIE
Sbjct: 243 GENGYFRIVRGVNECRIE 260
>gi|395734831|ref|XP_003776483.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin B-like [Pongo abelii]
Length = 350
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 36/81 (44%), Positives = 44/81 (54%)
Query: 5 IFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGE 64
I+ + A + D ++YK YQ GEM GGHA+ I+G VE+ YWL N W
Sbjct: 254 IYKNDXVEEAFSVYLDFLMYKFKEYQGVTGEMXGGHAICILGCKVENSTSYWLVANXWNR 313
Query: 65 LWGDGGLFKIRRGTDESRIES 85
WGD G FKI RG D IES
Sbjct: 314 DWGDNGFFKILRGQDHYGIES 334
>gi|159112288|ref|XP_001706373.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157434469|gb|EDO78699.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 303
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 51/78 (65%), Gaps = 2/78 (2%)
Query: 9 GSIVAAIEAHQDLIIYKKGVYQHTVGEMS-GGHAVKIIGWGV-EDGVKYWLCVNSWGELW 66
G + I + DL Y+ GVY+HT G ++ G HA++I+G+G +DG YW+ NSWG W
Sbjct: 217 GPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIKNSWGPDW 276
Query: 67 GDGGLFKIRRGTDESRIE 84
G+ G F+I RG +E RIE
Sbjct: 277 GENGYFRIVRGVNECRIE 294
>gi|166030328|gb|ABY78831.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 35/81 (43%), Positives = 45/81 (55%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
E++ G VA + DL YK GVY++ G+ GG AV+I+GWG +G YW NSW
Sbjct: 242 ELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGGQAVRIVGWGKLNGTPYWKVANSWD 301
Query: 64 ELWGDGGLFKIRRGTDESRIE 84
WG G I RG +E IE
Sbjct: 302 TDWGMNGYMLILRGNNECNIE 322
>gi|126310154|ref|XP_001364630.1| PREDICTED: tubulointerstitial nephritis antigen [Monodelphis
domestica]
Length = 468
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 53/100 (53%), Gaps = 13/100 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGV-- 53
EI G + A ++ H+D YK G+Y+H HAVK+ GWGV G
Sbjct: 359 EIMQNGPVQAIMQVHEDFFHYKSGIYRHINNLKDESEKYRNLRTHAVKLTGWGVLRGAQG 418
Query: 54 ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
K+W+ NSWG+ WG+ G F+I RG +ES IE ++A
Sbjct: 419 KKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 458
>gi|118429531|gb|ABK91813.1| cathepsin B-like cysteine proteinase precursor [Clonorchis
sinensis]
gi|358331549|dbj|GAA37857.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 343
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 50/88 (56%), Gaps = 1/88 (1%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWG 63
EI G + A ++D + YK GVY H+ G HA++I+GWG E V YWL NSW
Sbjct: 247 EIMLRGPVEAVFTVYEDFLQYKFGVYFHSWGAPLSEHAIRILGWGEEGDVPYWLIANSWN 306
Query: 64 ELWGDGGLFKIRRGTDESRIESFQVSAG 91
E WG+ G K RG +E IE V+AG
Sbjct: 307 EDWGEKGYMKFLRGLNECGIED-DVTAG 333
>gi|145525479|ref|XP_001448556.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124416111|emb|CAK81159.1| unnamed protein product [Paramecium tetraurelia]
Length = 490
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 39/98 (39%), Positives = 56/98 (57%), Gaps = 11/98 (11%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGE--MSG---------GHAVKIIGWGVEDG 52
EI++ G +V E D + Y G++ T + ++G H+V GWG E+G
Sbjct: 385 EIYNNGPVVLNFEPSFDFMFYVGGIFHSTTPDWIINGLAKPEWEKVDHSVLCYGWGEENG 444
Query: 53 VKYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
VKYWL NSWG+ WG+ G F+++RG DES IES +A
Sbjct: 445 VKYWLLQNSWGKQWGENGRFRMKRGQDESSIESMAEAA 482
>gi|324518532|gb|ADY47133.1| Cysteine proteinase [Ascaris suum]
Length = 334
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 52/86 (60%), Gaps = 2/86 (2%)
Query: 7 HFGSIVAAIEAHQDLIIYKKGVYQ--HTVGEMSGGHAVKIIGWGVEDGVKYWLCVNSWGE 64
++G + + + YK G+ + + +M HA +++G+GVEDG++YW+ NSWG
Sbjct: 247 NYGPVALNVAIPPNYKFYKSGIMRDSYECWQMQPNHAAEVVGFGVEDGIEYWIMKNSWGS 306
Query: 65 LWGDGGLFKIRRGTDESRIESFQVSA 90
WG+ G F+I RG + ++E+F SA
Sbjct: 307 WWGENGFFRIERGKNACQVETFATSA 332
>gi|145517168|ref|XP_001444467.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411889|emb|CAK77070.1| unnamed protein product [Paramecium tetraurelia]
Length = 339
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 51/76 (67%), Gaps = 2/76 (2%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTVG--EMSGGHAVKIIGWGVEDGVKYWLC 58
++ +I + G +VA ++ ++D ++Y+ GVYQ G GGHA+KIIGWG ++G +YW+
Sbjct: 247 IKRDIINRGPVVAIMQVYKDFLVYRDGVYQVLEGTPRFHGGHAIKIIGWGEQNGYQYWII 306
Query: 59 VNSWGELWGDGGLFKI 74
N+WG WG GL K+
Sbjct: 307 ENTWGTSWGTEGLAKL 322
>gi|321476446|gb|EFX87407.1| hypothetical protein DAPPUDRAFT_312322 [Daphnia pulex]
Length = 334
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 35/88 (39%), Positives = 55/88 (62%), Gaps = 3/88 (3%)
Query: 1 MQLEIFHFGSIVAAIEAHQDLIIYKKGVYQHTV--GEMSGGHAVKIIGWGVEDGVKYWLC 58
MQ + +FG +VAA+ Q + Y GVY + G++ HAV ++GWG ++G+ YW+
Sbjct: 242 MQYALTNFGPLVAAMTVVQSFMDYASGVYDDKICDGKLVN-HAVVLVGWGNQNGIDYWIG 300
Query: 59 VNSWGELWGDGGLFKIRRGTDESRIESF 86
NSWG WG G F I+RG ++ +IE++
Sbjct: 301 RNSWGPGWGKEGYFLIQRGVNKCQIETY 328
>gi|403268748|ref|XP_003926429.1| PREDICTED: tubulointerstitial nephritis antigen [Saimiri
boliviensis boliviensis]
Length = 476
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 52/100 (52%), Gaps = 13/100 (13%)
Query: 4 EIFHFGSIVAAIEAHQDLIIYKKGVYQHTVGEMSGG--------HAVKIIGWGVEDGV-- 53
EI G + A ++ H+D YK G+Y+H HAVK+ GWG G
Sbjct: 367 EIMQNGPVQAIMKVHEDFFHYKTGIYRHVTSTNKESEKFLKLQTHAVKLTGWGTLRGAQG 426
Query: 54 ---KYWLCVNSWGELWGDGGLFKIRRGTDESRIESFQVSA 90
K+W+ NSWG+ WG+ G F+I RG +ES IE ++A
Sbjct: 427 RKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.138 0.435
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,878,089,180
Number of Sequences: 23463169
Number of extensions: 119982342
Number of successful extensions: 285999
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 5033
Number of HSP's successfully gapped in prelim test: 1051
Number of HSP's that attempted gapping in prelim test: 279065
Number of HSP's gapped (non-prelim): 6236
length of query: 175
length of database: 8,064,228,071
effective HSP length: 132
effective length of query: 43
effective length of database: 9,262,057,059
effective search space: 398268453537
effective search space used: 398268453537
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 71 (32.0 bits)